CSPI05G28350 (gene) Wild cucumber (PI 183967)

NameCSPI05G28350
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTy3/gypsy retrotransposon protein
LocationChr5 : 26800083 .. 26803444 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTGGAGGAACCTAACCATGTCCTTTTTTCATAACAGTAAAAAGGTGGTACTAAAAGGAGATCCGAGTTTAACAAAAACTCAAGTGAGTCTAAAAAACCTCACTAAATCATGGGTAGAAACGGACACGGGATATCTGATTGAATGCAGAACACTGGAGGCATGCCAAATAAAAACTGAAGAAAACGAAACCGAACTGGAAAGCATCTTGACAGTTCTAACACAGTATGAGGACGTCTTCGAGGAGCCTAAGGAACTACCCCCCAACAGAAATATCGAACACCAGATACACATAAGGGGTGGAGCAGACCCGGTGAACGTCCGGCCTTATCGGTATGCATTCCAACAGAAAGAAGAATTGGAAAAACTGGTGGACGAAATGATGGCATCAGGAATTATACGCCCTAGCACAAGCCCCTACTCCAGCCCTGTTCTCCTGGTCAAAAAAAAAGACGGAAGCTGGCGATTCTGCGTGGATTATCGGGCGCTCAACAACATAACTGTTCCAGATAAGTTTCCCATCCCTGTTGTGGAAGAACTGTTTGATGAGTTACACGGTGCCAGCCTCTTTTCCAAAATAGACCTGAAGGCCGGTTATCATCAACTGAGAATGTGTAGTCGGGATATAGAGAAAACGGCCTTCAGAACTCATGAGGGACATTACGAGTTCTTAGTAATGCCGTTTGGACTCACCAATGCACCAGCAACCTTCCAATCACTAATGAATTCTATCTTCAGATCCTACCTGAGGAAATTTGTCTTGGTTTTCTTTGACGATATACTAGTGTATAGCAGGAACTTAGAGGAACACTGCCAACACATGGAACTGGTGCTGGAAGTTTTAAGGGAGCATAAGTTGTTTGCCAACCGGAAGAAATGCTGCTTCGCGAGTGCAAAGGTAGAATACTTGGGACATGTATTATCGGGAAGAGGAGTAGAAGTTGACCCTGAGAAAATCCGCGCAGTCAAGCAATGGCCAGTACCAACTAACGTTCGTGAGGTTAGAGGATTCTTGGGACTGACCGGTTATTACCGACGTTTTGTACACCATTATGGATCCTTGGCAGCACCTCTAACGCAGTTGCTCAAGCTCGGCGCATTCAAATGGGATGAAGAAACACAGGAGGCGTTTGAGAAGCTTAAAAGAGCCATGATGACGGTACCCATATTAACTCTACCCGACTTCAGTATACCCTTCGAAGTAGAGACAGATGCGTCGGGCTATGGAATTGGGGCGGTACTAATGCAGAGTAAAAGACCAATAGCGTTTTACAGCCACACATTGGCACTGCGTGACCGAGTCAAACCAGTATACGAGAGGGAATTAATGGCAGTGGTAATGGCAGTTCAACGCTGGCGACCCTATTTACTTGGGAGGACGTTTATAGTTAAAACAGACCAGAAATCACTGAAATTCTTGCTGGAGCAGAGAGTCATCCAACCGCAATATCAGAAATGGATTGCAAAGCTGCTGGGGTACTCTTTCGAGGTGATGTACAAACCAGGGTTGGAAAACAAGGCAGCAGATAGCCTCTCACGAATACCTCCAACTGCACATCTTAACCAACTAACCGCTCACACTCTGGTCGACATCAAAGTAATCCGAGAAGAGGTTGACAAAGATGAATATCTGAAGAATATTATAGACAGAATTCAGAAGGAGGAAGAGGTAAAGAACTACACTCTGCAACAAGGCATACTGAAATACAAAGGAAGGTTAGTAATCGCAAAGAACTCCTCATTAAGATCAGCGATTCTGCATACCTATCATGATTCAGTCCTAGGGGGCCATTCAGGATTCCTGAGAACATATAAACGGATAACAGGAGAGTTGTTTTGGGTAGGAATGAAGGGCGAAGTACGCAAGTACTGTGAAGAATGCATGACATGCCAGCGGAATAAAACCTTAGCCTTATCTCCAGCAGGATTATTGACTCCTCTCGAGGTACCAAAGAGAGTTTGGGAGGATATAACCATGGATTTCATCGAAGGATTGCCTAAATCAATGGGGTTTAACGTCATATTCGTAGTGGTAGACCGCTTCAGCAAATATGCGCACTTCCTCAGCCTTAAACATCCCTTTGACGCAAAAATGGTAGCTGAATTATTCGTTAAAGAAGTGGTAAGATTACACGGGTTTCCACAGTCAATCGTATCTGATAGGGACAAGATCTTTCTGAGTCACTTTTGGAAAGAACTTTTTAGATTAGCGGGTACCAAGCTAAACCGAAGCACCGCCTACCATCCTCAGACAGACGGACAGACAGAGGTGGTCAACAGGTCAGTTGAAATCTATCTAAGATGCTTTTGCGGGGAAAGACCGAAGGATTGGGTGAAATGGTTATCCTGGGCTGAGTATTGGTATAATACAACATTCCAAAAATCGTTGGGGGTGACACCATTCCAAGCTGTGTACGGGAGGACCCCACCAGCCCTGCTATATTACGGAGAACGGGAAACTCCCAACTCAACCTTGGATGAACAACTGAAGGAAAGAGATGTAGCATTGGGAGCTTTGAAGGAACACCTACGCATAGCTCAAGACAAGATGAAAAGTTATGCTGACAAGAAAAGGAGACATGTCGAATTCGAAGAAGGAGATCAAGTGTTCCTAAAAATTCGACCCTACAGACAAGTGTCCTTACGGAAAAAAAGGAACGAGAAGCTATCACCGAAGTATTTCGGGCCATATAAGATAGTAAAGCGAATTGGTCAGGTGGCATATCGGCTGGAACTACGAGCGGCAGCAACAATCCACCCCGTGTTCCATGTGTCACAATTAAAAAAAAAGCCTTTGGAGAAAGTGCGAATAACGAGGAGCTGTTGCCATTTCTGACTGCCAACCACGAGTGGAAAGCCGTGCCACAGGAGACTCATGGTTATAGAAAAAACGAAGCAGGGGGATGGGAGGTTTTAATAAATTGGGAAGGCCTACCGCATCATGAAGCCACCTGGGAAGGCTATGACGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAACTGGACCGGGAAGGTACTGTTAGACCACCCATCATACACCAGTATAGTAGGAGAAAGAACAGGAAAGAGAAAAAGGACGAGGAGTTGGTTATGTAACCAACTGCAGGTGGGGACCACAAGGAAATGGAGTGCCGACGCACGATCCCCTAAGGGGGACCATGGCAGGGGAGTGAGGGAGAAAATATAAATAGAGATGAGAAGGAAGGGAGGGGGCAGAGAATTTTGGTGATAAGAATCCTACTGTTATTCCTTGAGAGATAGGAAGGCAGCTGAGGGAGAGAGGGAGTCTTCGACGGAGAGCCTTTTTATTTTCTTCTGTATTTTTCATTAGCAGCAATATTTCCT

mRNA sequence

ATGGATTGGAGGAACCTAACCATGTCCTTTTTTCATAACAGTAAAAAGGTGGTACTAAAAGGAGATCCGAGTTTAACAAAAACTCAAGTGAGTCTAAAAAACCTCACTAAATCATGGGTAGAAACGGACACGGGATATCTGATTGAATGCAGAACACTGGAGGCATGCCAAATAAAAACTGAAGAAAACGAAACCGAACTGGAAAGCATCTTGACAGTTCTAACACAGTATGAGGACGTCTTCGAGGAGCCTAAGGAACTACCCCCCAACAGAAATATCGAACACCAGATACACATAAGGGGTGGAGCAGACCCGGTGAACGTCCGGCCTTATCGGTATGCATTCCAACAGAAAGAAGAATTGGAAAAACTGGTGGACGAAATGATGGCATCAGGAATTATACGCCCTAGCACAAGCCCCTACTCCAGCCCTGTTCTCCTGGTCAAAAAAAAAGACGGAAGCTGGCGATTCTGCGTGGATTATCGGGCGCTCAACAACATAACTGTTCCAGATAAGTTTCCCATCCCTGTTGTGGAAGAACTGTTTGATGAGTTACACGGTGCCAGCCTCTTTTCCAAAATAGACCTGAAGGCCGGTTATCATCAACTGAGAATGTGTAGTCGGGATATAGAGAAAACGGCCTTCAGAACTCATGAGGGACATTACGAGTTCTTAGTAATGCCGTTTGGACTCACCAATGCACCAGCAACCTTCCAATCACTAATGAATTCTATCTTCAGATCCTACCTGAGGAAATTTGTCTTGGTTTTCTTTGACGATATACTAGTGTATAGCAGGAACTTAGAGGAACACTGCCAACACATGGAACTGGTGCTGGAAGTTTTAAGGGAGCATAAGTTGTTTGCCAACCGGAAGAAATGCTGCTTCGCGAGTGCAAAGGTAGAATACTTGGGACATGTATTATCGGGAAGAGGAGTAGAAGTTGACCCTGAGAAAATCCGCGCAGTCAAGCAATGGCCAGTACCAACTAACGTTCGTGAGGTTAGAGGATTCTTGGGACTGACCGGTTATTACCGACGTTTTGTACACCATTATGGATCCTTGGCAGCACCTCTAACGCAGTTGCTCAAGCTCGGCGCATTCAAATGGGATGAAGAAACACAGGAGGCGTTTGAGAAGCTTAAAAGAGCCATGATGACGGTACCCATATTAACTCTACCCGACTTCAGTATACCCTTCGAAGTAGAGACAGATGCGTCGGGCTATGGAATTGGGGCGGTACTAATGCAGAGTAAAAGACCAATAGCGTTTTACAGCCACACATTGGCACTGCGTGACCGAGTCAAACCAGTATACGAGAGGGAATTAATGGCAGTGGTAATGGCAGTTCAACGCTGGCGACCCTATTTACTTGGGAGGACGTTTATAGTTAAAACAGACCAGAAATCACTGAAATTCTTGCTGGAGCAGAGAGTCATCCAACCGCAATATCAGAAATGGATTGCAAAGCTGCTGGGGTACTCTTTCGAGGTGATGTACAAACCAGGGTTGGAAAACAAGGCAGCAGATAGCCTCTCACGAATACCTCCAACTGCACATCTTAACCAACTAACCGCTCACACTCTGGTCGACATCAAAGTAATCCGAGAAGAGGTTGACAAAGATGAATATCTGAAGAATATTATAGACAGAATTCAGAAGGAGGAAGAGGTAAAGAACTACACTCTGCAACAAGGCATACTGAAATACAAAGGAAGGTTAGTAATCGCAAAGAACTCCTCATTAAGATCAGCGATTCTGCATACCTATCATGATTCAGTCCTAGGGGGCCATTCAGGATTCCTGAGAACATATAAACGGATAACAGGAGAGTTGTTTTGGGTAGGAATGAAGGGCGAAGTACGCAAGTACTGTGAAGAATGCATGACATGCCAGCGGAATAAAACCTTAGCCTTATCTCCAGCAGGATTATTGACTCCTCTCGAGGTACCAAAGAGAGTTTGGGAGGATATAACCATGGATTTCATCGAAGGATTGCCTAAATCAATGGGGTTTAACGTCATATTCGTAGTGGTAGACCGCTTCAGCAAATATGCGCACTTCCTCAGCCTTAAACATCCCTTTGACGCAAAAATGGTAGCTGAATTATTCGTTAAAGAAGTGGTAAGATTACACGGGTTTCCACAGTCAATCGTATCTGATAGGGACAAGATCTTTCTGAGTCACTTTTGGAAAGAACTTTTTAGATTAGCGGGTACCAAGCTAAACCGAAGCACCGCCTACCATCCTCAGACAGACGGACAGACAGAGGTGGTCAACAGGTCAGTTGAAATCTATCTAAGATGCTTTTGCGGGGAAAGACCGAAGGATTGGGTGAAATGGTTATCCTGGGCTGAGTATTGGTATAATACAACATTCCAAAAATCGTTGGGGGTGACACCATTCCAAGCTGTGTACGGGAGGACCCCACCAGCCCTGCTATATTACGGAGAACGGGAAACTCCCAACTCAACCTTGGATGAACAACTGAAGGAAAGAGATGTAGCATTGGGAGCTTTGAAGGAACACCTACGCATAGCTCAAGACAAGATGAAAAGTTATGCTGACAAGAAAAGGAGACATGTCGAATTCGAAGAAGGAGATCAAGTTAAAGCGAATTGGTCAGGTGGCATATCGGCTGGAACTACGAGCGGCAGCAACAATCCACCCCGTGTTCCATGTGTCACAATTAAAAAAAAAGCCTTTGGAGAAAGTGCGAATAACGAGGAGCTGTTGCCATTTCTGACTGCCAACCACGAGTGGAAAGCCGTGCCACAGGAGACTCATGGTTATAGAAAAAACGAAGCAGGGGGATGGGAGGTTTTAATAAATTGGGAAGGCCTACCGCATCATGAAGCCACCTGGGAAGGCTATGACGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAACTGGACCGGGAAGGTGGGGACCACAAGGAAATGGAGTGCCGACGCACGATCCCCTAA

Coding sequence (CDS)

ATGGATTGGAGGAACCTAACCATGTCCTTTTTTCATAACAGTAAAAAGGTGGTACTAAAAGGAGATCCGAGTTTAACAAAAACTCAAGTGAGTCTAAAAAACCTCACTAAATCATGGGTAGAAACGGACACGGGATATCTGATTGAATGCAGAACACTGGAGGCATGCCAAATAAAAACTGAAGAAAACGAAACCGAACTGGAAAGCATCTTGACAGTTCTAACACAGTATGAGGACGTCTTCGAGGAGCCTAAGGAACTACCCCCCAACAGAAATATCGAACACCAGATACACATAAGGGGTGGAGCAGACCCGGTGAACGTCCGGCCTTATCGGTATGCATTCCAACAGAAAGAAGAATTGGAAAAACTGGTGGACGAAATGATGGCATCAGGAATTATACGCCCTAGCACAAGCCCCTACTCCAGCCCTGTTCTCCTGGTCAAAAAAAAAGACGGAAGCTGGCGATTCTGCGTGGATTATCGGGCGCTCAACAACATAACTGTTCCAGATAAGTTTCCCATCCCTGTTGTGGAAGAACTGTTTGATGAGTTACACGGTGCCAGCCTCTTTTCCAAAATAGACCTGAAGGCCGGTTATCATCAACTGAGAATGTGTAGTCGGGATATAGAGAAAACGGCCTTCAGAACTCATGAGGGACATTACGAGTTCTTAGTAATGCCGTTTGGACTCACCAATGCACCAGCAACCTTCCAATCACTAATGAATTCTATCTTCAGATCCTACCTGAGGAAATTTGTCTTGGTTTTCTTTGACGATATACTAGTGTATAGCAGGAACTTAGAGGAACACTGCCAACACATGGAACTGGTGCTGGAAGTTTTAAGGGAGCATAAGTTGTTTGCCAACCGGAAGAAATGCTGCTTCGCGAGTGCAAAGGTAGAATACTTGGGACATGTATTATCGGGAAGAGGAGTAGAAGTTGACCCTGAGAAAATCCGCGCAGTCAAGCAATGGCCAGTACCAACTAACGTTCGTGAGGTTAGAGGATTCTTGGGACTGACCGGTTATTACCGACGTTTTGTACACCATTATGGATCCTTGGCAGCACCTCTAACGCAGTTGCTCAAGCTCGGCGCATTCAAATGGGATGAAGAAACACAGGAGGCGTTTGAGAAGCTTAAAAGAGCCATGATGACGGTACCCATATTAACTCTACCCGACTTCAGTATACCCTTCGAAGTAGAGACAGATGCGTCGGGCTATGGAATTGGGGCGGTACTAATGCAGAGTAAAAGACCAATAGCGTTTTACAGCCACACATTGGCACTGCGTGACCGAGTCAAACCAGTATACGAGAGGGAATTAATGGCAGTGGTAATGGCAGTTCAACGCTGGCGACCCTATTTACTTGGGAGGACGTTTATAGTTAAAACAGACCAGAAATCACTGAAATTCTTGCTGGAGCAGAGAGTCATCCAACCGCAATATCAGAAATGGATTGCAAAGCTGCTGGGGTACTCTTTCGAGGTGATGTACAAACCAGGGTTGGAAAACAAGGCAGCAGATAGCCTCTCACGAATACCTCCAACTGCACATCTTAACCAACTAACCGCTCACACTCTGGTCGACATCAAAGTAATCCGAGAAGAGGTTGACAAAGATGAATATCTGAAGAATATTATAGACAGAATTCAGAAGGAGGAAGAGGTAAAGAACTACACTCTGCAACAAGGCATACTGAAATACAAAGGAAGGTTAGTAATCGCAAAGAACTCCTCATTAAGATCAGCGATTCTGCATACCTATCATGATTCAGTCCTAGGGGGCCATTCAGGATTCCTGAGAACATATAAACGGATAACAGGAGAGTTGTTTTGGGTAGGAATGAAGGGCGAAGTACGCAAGTACTGTGAAGAATGCATGACATGCCAGCGGAATAAAACCTTAGCCTTATCTCCAGCAGGATTATTGACTCCTCTCGAGGTACCAAAGAGAGTTTGGGAGGATATAACCATGGATTTCATCGAAGGATTGCCTAAATCAATGGGGTTTAACGTCATATTCGTAGTGGTAGACCGCTTCAGCAAATATGCGCACTTCCTCAGCCTTAAACATCCCTTTGACGCAAAAATGGTAGCTGAATTATTCGTTAAAGAAGTGGTAAGATTACACGGGTTTCCACAGTCAATCGTATCTGATAGGGACAAGATCTTTCTGAGTCACTTTTGGAAAGAACTTTTTAGATTAGCGGGTACCAAGCTAAACCGAAGCACCGCCTACCATCCTCAGACAGACGGACAGACAGAGGTGGTCAACAGGTCAGTTGAAATCTATCTAAGATGCTTTTGCGGGGAAAGACCGAAGGATTGGGTGAAATGGTTATCCTGGGCTGAGTATTGGTATAATACAACATTCCAAAAATCGTTGGGGGTGACACCATTCCAAGCTGTGTACGGGAGGACCCCACCAGCCCTGCTATATTACGGAGAACGGGAAACTCCCAACTCAACCTTGGATGAACAACTGAAGGAAAGAGATGTAGCATTGGGAGCTTTGAAGGAACACCTACGCATAGCTCAAGACAAGATGAAAAGTTATGCTGACAAGAAAAGGAGACATGTCGAATTCGAAGAAGGAGATCAAGTTAAAGCGAATTGGTCAGGTGGCATATCGGCTGGAACTACGAGCGGCAGCAACAATCCACCCCGTGTTCCATGTGTCACAATTAAAAAAAAAGCCTTTGGAGAAAGTGCGAATAACGAGGAGCTGTTGCCATTTCTGACTGCCAACCACGAGTGGAAAGCCGTGCCACAGGAGACTCATGGTTATAGAAAAAACGAAGCAGGGGGATGGGAGGTTTTAATAAATTGGGAAGGCCTACCGCATCATGAAGCCACCTGGGAAGGCTATGACGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAACTGGACCGGGAAGGTGGGGACCACAAGGAAATGGAGTGCCGACGCACGATCCCCTAA
BLAST of CSPI05G28350 vs. Swiss-Prot
Match: YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-I PE=3 SV=2)

HSP 1 Score: 473.8 bits (1218), Expect = 4.7e-132
Identity = 286/831 (34.42%), Postives = 441/831 (53.07%), Query Frame = 1

Query: 86   ELPP------NRNIEHQIHIRGGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTS 145
            +LPP      N  ++H I I+ GA    ++PY    + ++E+ K+V +++ +  I PS S
Sbjct: 597  DLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKS 656

Query: 146  PYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEELFDELHGASLFSKIDLKAG 205
            P SSPV+LV KKDG++R CVDYR LN  T+ D FP+P ++ L   +  A +F+ +DL +G
Sbjct: 657  PCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSG 716

Query: 206  YHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFD 265
            YHQ+ M  +D  KTAF T  G YE+ VMPFGL NAP+TF   M   FR    +FV V+ D
Sbjct: 717  YHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYLD 776

Query: 266  DILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAKVEYLGHVLSGRGVEVDPEK 325
            DIL++S + EEH +H++ VLE L+   L   +KKC FAS + E+LG+ +  + +     K
Sbjct: 777  DILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHK 836

Query: 326  IRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLTQLLKLGAFKWDEETQEAFE 385
              A++ +P P  V++ + FLG+  YYRRF+ +   +A P+ QL      +W E+  +A E
Sbjct: 837  CAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPI-QLFICDKSQWTEKQDKAIE 896

Query: 386  KLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKRP------IAFYSHTLALRD 445
            KLK A+   P+L   +    + + TDAS  GIGAVL +          + ++S +L    
Sbjct: 897  KLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQ 956

Query: 446  RVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVIQPQYQKWIAKLLG 505
            +  P  E EL+ ++ A+  +R  L G+ F ++TD  SL  L  +     + Q+W+  L  
Sbjct: 957  KNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLAT 1016

Query: 506  YSFEVMYKPGLENKAADSLSR-----IPPTA-----------------------HLNQLT 565
            Y F + Y  G +N  AD++SR      P T+                       H+ +LT
Sbjct: 1017 YDFTLEYLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELT 1076

Query: 566  AHTLV--DIKVIREEVDKDEYLKNIIDRIQKEEEVKNYTLQQGILKYKGRLVIAKNSSLR 625
             H +   D+   R    K E           E   KNY+L+  ++ Y+ RLV+      +
Sbjct: 1077 QHNVTPEDMSAFRSYQKKLEL---------SETFRKNYSLEDEMIYYQDRLVVPIKQ--Q 1136

Query: 626  SAILHTYHDSVL-GGHSGFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPA 685
            +A++  YHD  L GGH G   T  +I+   +W  ++  + +Y   C+ CQ  K+      
Sbjct: 1137 NAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLH 1196

Query: 686  GLLTPLEVPKRVWEDITMDFIEGL-PKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVA 745
            GLL PL + +  W DI+MDF+ GL P S   N+I VVVDRFSK AHF++ +   DA  + 
Sbjct: 1197 GLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLI 1256

Query: 746  ELFVKEVVRLHGFPQSIVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNR 805
            +L  + +   HGFP++I SDRD    +  ++EL +  G K   S+A HPQTDGQ+E   +
Sbjct: 1257 DLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQ 1316

Query: 806  SVEIYLRCFCGERPKDWVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTP--PALLYYGER 865
            ++   LR +     ++W  +L   E+ YN+T  ++LG +PF+   G  P  PA+    E 
Sbjct: 1317 TLNRLLRAYVSTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEV 1376

Query: 866  ETPNSTLDEQLKERDVALGALKEHLRIAQDKMKSYADKKRRHVEFEEGDQV 871
               + T  E  K         KE L  AQ +M++  +++R+ +    GD V
Sbjct: 1377 NARSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHV 1413

BLAST of CSPI05G28350 vs. Swiss-Prot
Match: YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 472.6 bits (1215), Expect = 1.0e-131
Identity = 285/831 (34.30%), Postives = 441/831 (53.07%), Query Frame = 1

Query: 86   ELPP------NRNIEHQIHIRGGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTS 145
            +LPP      N  ++H I I+ GA    ++PY    + ++E+ K+V +++ +  I PS S
Sbjct: 571  DLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKS 630

Query: 146  PYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEELFDELHGASLFSKIDLKAG 205
            P SSPV+LV KKDG++R CVDYR LN  T+ D FP+P ++ L   +  A +F+ +DL +G
Sbjct: 631  PCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSG 690

Query: 206  YHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFD 265
            YHQ+ M  +D  KTAF T  G YE+ VMPFGL NAP+TF   M   FR    +FV V+ D
Sbjct: 691  YHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYLD 750

Query: 266  DILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAKVEYLGHVLSGRGVEVDPEK 325
            DIL++S + EEH +H++ VLE L+   L   +KKC FAS + E+LG+ +  + +     K
Sbjct: 751  DILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHK 810

Query: 326  IRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLTQLLKLGAFKWDEETQEAFE 385
              A++ +P P  V++ + FLG+  YYRRF+ +   +A P+ QL      +W E+  +A +
Sbjct: 811  CAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPI-QLFICDKSQWTEKQDKAID 870

Query: 386  KLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKRP------IAFYSHTLALRD 445
            KLK A+   P+L   +    + + TDAS  GIGAVL +          + ++S +L    
Sbjct: 871  KLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQ 930

Query: 446  RVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVIQPQYQKWIAKLLG 505
            +  P  E EL+ ++ A+  +R  L G+ F ++TD  SL  L  +     + Q+W+  L  
Sbjct: 931  KNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLAT 990

Query: 506  YSFEVMYKPGLENKAADSLSR-----IPPTA-----------------------HLNQLT 565
            Y F + Y  G +N  AD++SR      P T+                       H+ +LT
Sbjct: 991  YDFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELT 1050

Query: 566  AHTLV--DIKVIREEVDKDEYLKNIIDRIQKEEEVKNYTLQQGILKYKGRLVIAKNSSLR 625
             H +   D+   R    K E           E   KNY+L+  ++ Y+ RLV+      +
Sbjct: 1051 QHNVTPEDMSAFRSYQKKLEL---------SETFRKNYSLEDEMIYYQDRLVVPIKQ--Q 1110

Query: 626  SAILHTYHDSVL-GGHSGFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPA 685
            +A++  YHD  L GGH G   T  +I+   +W  ++  + +Y   C+ CQ  K+      
Sbjct: 1111 NAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLH 1170

Query: 686  GLLTPLEVPKRVWEDITMDFIEGL-PKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVA 745
            GLL PL + +  W DI+MDF+ GL P S   N+I VVVDRFSK AHF++ +   DA  + 
Sbjct: 1171 GLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLI 1230

Query: 746  ELFVKEVVRLHGFPQSIVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNR 805
            +L  + +   HGFP++I SDRD    +  ++EL +  G K   S+A HPQTDGQ+E   +
Sbjct: 1231 DLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQ 1290

Query: 806  SVEIYLRCFCGERPKDWVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTP--PALLYYGER 865
            ++   LR +     ++W  +L   E+ YN+T  ++LG +PF+   G  P  PA+    E 
Sbjct: 1291 TLNRLLRAYASTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEV 1350

Query: 866  ETPNSTLDEQLKERDVALGALKEHLRIAQDKMKSYADKKRRHVEFEEGDQV 871
               + T  E  K         KE L  AQ +M++  +++R+ +    GD V
Sbjct: 1351 NARSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHV 1387

BLAST of CSPI05G28350 vs. Swiss-Prot
Match: TF26_SCHPO (Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-6 PE=3 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 5.0e-126
Identity = 272/841 (32.34%), Postives = 441/841 (52.44%), Query Frame = 1

Query: 57   QIKTEENETELESILTVLTQYEDVFEEP--KELP-PNRNIEHQIHIRGGADPVNVRPYRY 116
            Q+    N  +   +  +  +++D+  E   ++LP P + +E ++ +      + +R Y  
Sbjct: 361  QMNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPL 420

Query: 117  AFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKF 176
               + + +   +++ + SGIIR S +  + PV+ V KK+G+ R  VDY+ LN    P+ +
Sbjct: 421  PPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIY 480

Query: 177  PIPVVEELFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTN 236
            P+P++E+L  ++ G+++F+K+DLK+ YH +R+   D  K AFR   G +E+LVMP+G++ 
Sbjct: 481  PLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGIST 540

Query: 237  APATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKK 296
            APA FQ  +N+I        V+ + DDIL++S++  EH +H++ VL+ L+   L  N+ K
Sbjct: 541  APAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAK 600

Query: 297  CCFASAKVEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYG 356
            C F  ++V+++G+ +S +G     E I  V QW  P N +E+R FLG   Y R+F+    
Sbjct: 601  CEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTS 660

Query: 357  SLAAPLTQLLKLGA-FKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIG 416
             L  PL  LLK    +KW     +A E +K+ +++ P+L   DFS    +ETDAS   +G
Sbjct: 661  QLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVG 720

Query: 417  AVLMQSK-----RPIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLG--RTFIVK 476
            AVL Q        P+ +YS  ++       V ++E++A++ +++ WR YL      F + 
Sbjct: 721  AVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKIL 780

Query: 477  TDQKSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSR-------IP 536
            TD ++L  +   E      +  +W   L  ++FE+ Y+PG  N  AD+LSR       IP
Sbjct: 781  TDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIP 840

Query: 537  PTAHLNQLTAHTLVDI-----KVIREEVDKDEYLKNIIDRIQKEEEVKNYTLQQGIL-KY 596
              +  N +     + I       +  E   D  L N+++   K  E +N  L+ G+L   
Sbjct: 841  KDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVE-ENIQLKDGLLINS 900

Query: 597  KGRLVIAKNSSLRSAILHTYHDSVLGGHSGFLRTYKRITGELFWVGMKGEVRKYCEECMT 656
            K ++++  ++ L   I+  YH+     H G       I     W G++ ++++Y + C T
Sbjct: 901  KDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHT 960

Query: 657  CQRNKTLALSPAGLLTPLEVPKRVWEDITMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLS 716
            CQ NK+    P G L P+   +R WE ++MDFI  LP+S G+N +FVVVDRFSK A  + 
Sbjct: 961  CQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVP 1020

Query: 717  LKHPFDAKMVAELFVKEVVRLHGFPQSIVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHP 776
                  A+  A +F + V+   G P+ I++D D IF S  WK+        +  S  Y P
Sbjct: 1021 CTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRP 1080

Query: 777  QTDGQTEVVNRSVEIYLRCFCGERPKDWVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTP 836
            QTDGQTE  N++VE  LRC C   P  WV  +S  +  YN     +  +TPF+ V+ R  
Sbjct: 1081 QTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-RYS 1140

Query: 837  PALLYYGERETPNSTLDEQLKERDVALGALKEHLRIAQDKMKSYADKKRRHV-EFEEGDQ 871
            PAL    E  + +   DE  +E       +KEHL     KMK Y D K + + EF+ GD 
Sbjct: 1141 PALSPL-ELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDL 1198

BLAST of CSPI05G28350 vs. Swiss-Prot
Match: TF25_SCHPO (Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 5.0e-126
Identity = 272/841 (32.34%), Postives = 441/841 (52.44%), Query Frame = 1

Query: 57   QIKTEENETELESILTVLTQYEDVFEEP--KELP-PNRNIEHQIHIRGGADPVNVRPYRY 116
            Q+    N  +   +  +  +++D+  E   ++LP P + +E ++ +      + +R Y  
Sbjct: 361  QMNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPL 420

Query: 117  AFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKF 176
               + + +   +++ + SGIIR S +  + PV+ V KK+G+ R  VDY+ LN    P+ +
Sbjct: 421  PPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIY 480

Query: 177  PIPVVEELFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTN 236
            P+P++E+L  ++ G+++F+K+DLK+ YH +R+   D  K AFR   G +E+LVMP+G++ 
Sbjct: 481  PLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGIST 540

Query: 237  APATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKK 296
            APA FQ  +N+I        V+ + DDIL++S++  EH +H++ VL+ L+   L  N+ K
Sbjct: 541  APAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAK 600

Query: 297  CCFASAKVEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYG 356
            C F  ++V+++G+ +S +G     E I  V QW  P N +E+R FLG   Y R+F+    
Sbjct: 601  CEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTS 660

Query: 357  SLAAPLTQLLKLGA-FKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIG 416
             L  PL  LLK    +KW     +A E +K+ +++ P+L   DFS    +ETDAS   +G
Sbjct: 661  QLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVG 720

Query: 417  AVLMQSK-----RPIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLG--RTFIVK 476
            AVL Q        P+ +YS  ++       V ++E++A++ +++ WR YL      F + 
Sbjct: 721  AVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKIL 780

Query: 477  TDQKSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSR-------IP 536
            TD ++L  +   E      +  +W   L  ++FE+ Y+PG  N  AD+LSR       IP
Sbjct: 781  TDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIP 840

Query: 537  PTAHLNQLTAHTLVDI-----KVIREEVDKDEYLKNIIDRIQKEEEVKNYTLQQGIL-KY 596
              +  N +     + I       +  E   D  L N+++   K  E +N  L+ G+L   
Sbjct: 841  KDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVE-ENIQLKDGLLINS 900

Query: 597  KGRLVIAKNSSLRSAILHTYHDSVLGGHSGFLRTYKRITGELFWVGMKGEVRKYCEECMT 656
            K ++++  ++ L   I+  YH+     H G       I     W G++ ++++Y + C T
Sbjct: 901  KDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHT 960

Query: 657  CQRNKTLALSPAGLLTPLEVPKRVWEDITMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLS 716
            CQ NK+    P G L P+   +R WE ++MDFI  LP+S G+N +FVVVDRFSK A  + 
Sbjct: 961  CQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVP 1020

Query: 717  LKHPFDAKMVAELFVKEVVRLHGFPQSIVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHP 776
                  A+  A +F + V+   G P+ I++D D IF S  WK+        +  S  Y P
Sbjct: 1021 CTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRP 1080

Query: 777  QTDGQTEVVNRSVEIYLRCFCGERPKDWVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTP 836
            QTDGQTE  N++VE  LRC C   P  WV  +S  +  YN     +  +TPF+ V+ R  
Sbjct: 1081 QTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-RYS 1140

Query: 837  PALLYYGERETPNSTLDEQLKERDVALGALKEHLRIAQDKMKSYADKKRRHV-EFEEGDQ 871
            PAL    E  + +   DE  +E       +KEHL     KMK Y D K + + EF+ GD 
Sbjct: 1141 PALSPL-ELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDL 1198

BLAST of CSPI05G28350 vs. Swiss-Prot
Match: TF24_SCHPO (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 5.0e-126
Identity = 272/841 (32.34%), Postives = 441/841 (52.44%), Query Frame = 1

Query: 57   QIKTEENETELESILTVLTQYEDVFEEP--KELP-PNRNIEHQIHIRGGADPVNVRPYRY 116
            Q+    N  +   +  +  +++D+  E   ++LP P + +E ++ +      + +R Y  
Sbjct: 361  QMNKVSNIVKEPELPDIYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPL 420

Query: 117  AFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKF 176
               + + +   +++ + SGIIR S +  + PV+ V KK+G+ R  VDY+ LN    P+ +
Sbjct: 421  PPGKMQAMNDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIY 480

Query: 177  PIPVVEELFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTN 236
            P+P++E+L  ++ G+++F+K+DLK+ YH +R+   D  K AFR   G +E+LVMP+G++ 
Sbjct: 481  PLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGIST 540

Query: 237  APATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKK 296
            APA FQ  +N+I        V+ + DDIL++S++  EH +H++ VL+ L+   L  N+ K
Sbjct: 541  APAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAK 600

Query: 297  CCFASAKVEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYG 356
            C F  ++V+++G+ +S +G     E I  V QW  P N +E+R FLG   Y R+F+    
Sbjct: 601  CEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTS 660

Query: 357  SLAAPLTQLLKLGA-FKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIG 416
             L  PL  LLK    +KW     +A E +K+ +++ P+L   DFS    +ETDAS   +G
Sbjct: 661  QLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVG 720

Query: 417  AVLMQSK-----RPIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLG--RTFIVK 476
            AVL Q        P+ +YS  ++       V ++E++A++ +++ WR YL      F + 
Sbjct: 721  AVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKIL 780

Query: 477  TDQKSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSR-------IP 536
            TD ++L  +   E      +  +W   L  ++FE+ Y+PG  N  AD+LSR       IP
Sbjct: 781  TDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIP 840

Query: 537  PTAHLNQLTAHTLVDI-----KVIREEVDKDEYLKNIIDRIQKEEEVKNYTLQQGIL-KY 596
              +  N +     + I       +  E   D  L N+++   K  E +N  L+ G+L   
Sbjct: 841  KDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVE-ENIQLKDGLLINS 900

Query: 597  KGRLVIAKNSSLRSAILHTYHDSVLGGHSGFLRTYKRITGELFWVGMKGEVRKYCEECMT 656
            K ++++  ++ L   I+  YH+     H G       I     W G++ ++++Y + C T
Sbjct: 901  KDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHT 960

Query: 657  CQRNKTLALSPAGLLTPLEVPKRVWEDITMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLS 716
            CQ NK+    P G L P+   +R WE ++MDFI  LP+S G+N +FVVVDRFSK A  + 
Sbjct: 961  CQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVP 1020

Query: 717  LKHPFDAKMVAELFVKEVVRLHGFPQSIVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHP 776
                  A+  A +F + V+   G P+ I++D D IF S  WK+        +  S  Y P
Sbjct: 1021 CTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRP 1080

Query: 777  QTDGQTEVVNRSVEIYLRCFCGERPKDWVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTP 836
            QTDGQTE  N++VE  LRC C   P  WV  +S  +  YN     +  +TPF+ V+ R  
Sbjct: 1081 QTDGQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVH-RYS 1140

Query: 837  PALLYYGERETPNSTLDEQLKERDVALGALKEHLRIAQDKMKSYADKKRRHV-EFEEGDQ 871
            PAL    E  + +   DE  +E       +KEHL     KMK Y D K + + EF+ GD 
Sbjct: 1141 PALSPL-ELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDL 1198

BLAST of CSPI05G28350 vs. TrEMBL
Match: A0A087G3S6_ARAAL (Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs46225U000100 PE=4 SV=1)

HSP 1 Score: 1037.7 bits (2682), Expect = 9.0e-300
Identity = 526/1013 (51.92%), Postives = 691/1013 (68.21%), Query Frame = 1

Query: 1    MDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKT 60
            ++W+  TM F  N + V L+GD  L    +SLK L KS  +   G L+E   L+A ++ T
Sbjct: 538  VNWKLQTMKFMLNEELVKLQGDAGLCCAPISLKALWKSLADQGQGVLVEYCGLQA-ELHT 597

Query: 61   EENETEL-ESILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKE 120
            +    +L   +LTVL Q+  VFE+P+ LPP+R  EH I +   A PV+VRP+RY   Q+E
Sbjct: 598  QRRREQLPHQLLTVLEQFARVFEDPQGLPPSRGKEHNIVLEPNAKPVSVRPFRYPQAQRE 657

Query: 121  ELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVE 180
            E+EK V  M+A+G+I+ S SP+SSPVLLVKKKDGSWRFCVDYRALN +T+PD FPIP+++
Sbjct: 658  EVEKQVASMLAAGLIQASGSPFSSPVLLVKKKDGSWRFCVDYRALNKVTIPDSFPIPMID 717

Query: 181  ELFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQ 240
            +L DELHGA++FSK+DLK+GYHQ+ + + D+ KTAFRTH+GHYEFLVMPFGLTNAPATFQ
Sbjct: 718  QLLDELHGATIFSKLDLKSGYHQILVKAEDVAKTAFRTHDGHYEFLVMPFGLTNAPATFQ 777

Query: 241  SLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASA 300
            SLMN +FR YLRKFVLVFFDDILVYS++L+EH QH+ LVLE+L++H+LFAN+KKC F   
Sbjct: 778  SLMNDVFRGYLRKFVLVFFDDILVYSKSLQEHQQHLGLVLELLQQHQLFANKKKCEFGRT 837

Query: 301  KVEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPL 360
            ++EYLGHV+SG+GV  DPEKI+A+  WP P NV+ +RGFLGLTGYYR+FV  YG +A PL
Sbjct: 838  ELEYLGHVVSGKGVAADPEKIQAMVSWPEPQNVKALRGFLGLTGYYRKFVQRYGEIARPL 897

Query: 361  TQLLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSK 420
            T LLK   F+W  E   AF+KLK+AM TVP+L L DF+  F VE+DASG G+GAVLMQS+
Sbjct: 898  TALLKKDQFQWTAEATVAFQKLKKAMSTVPVLALVDFTEQFVVESDASGTGLGAVLMQSQ 957

Query: 421  RPIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRV 480
            RP+A++S  L  R R+K VYERELMA+V A+Q+WR YLLGR F+V+TDQKSLKFLLEQR 
Sbjct: 958  RPLAYFSQALTERQRLKSVYERELMAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLLEQRE 1017

Query: 481  IQPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSRIPPTAHLNQLTAHTLVDIKVIREEV 540
            I  +YQKW+ KLLG+ FE+ YKPGLENKAAD+LSR      L  L+    + ++ I  EV
Sbjct: 1018 INMEYQKWLTKLLGFDFEIQYKPGLENKAADALSRKDMALQLCALSIPAAIQLEQINTEV 1077

Query: 541  DKDEYLKNIIDRI-QKEEEVKNYTLQQGILKYKGRLVIAKNSSLRSAILHTYHDSVLGGH 600
            D D  L+ + + + Q       +++ QG L  KG+LV+   S L + IL  +H+  LGGH
Sbjct: 1078 DNDPDLRKLKEEVLQDAASHSEFSVVQGRLLRKGKLVVPAQSRLVNVILQEFHNGKLGGH 1137

Query: 601  SGFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKRVWEDI 660
             G L+T KR+    +W GM   +R++   C  CQR+K   L+PAGLL PL +P +VWEDI
Sbjct: 1138 GGVLKTQKRVEAIFYWKGMMSRIREFVAACQVCQRHKYSTLAPAGLLQPLPIPDQVWEDI 1197

Query: 661  TMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHGFPQSI 720
            +MDF+EGLPKS GF V+ VVVDR +KYAHF+S+KHP  A  VA +F KEVV+LHGFP++I
Sbjct: 1198 SMDFVEGLPKSEGFEVVMVVVDRLTKYAHFISMKHPVTAVEVALIFTKEVVKLHGFPKTI 1257

Query: 721  VSDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGERPKDW 780
            VSDRD +F   FW E+FRLAGT L  STAYHPQ+DGQTEV NR +E  LRCF  ++P+ W
Sbjct: 1258 VSDRDPLFTGRFWTEMFRLAGTSLCFSTAYHPQSDGQTEVTNRGMETLLRCFSSDKPRCW 1317

Query: 781  VKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKERDVALG 840
            V++L WAE  YNT++  ++ ++PFQAVYGR PP L+ +    T N+ L+ +L+ERD  + 
Sbjct: 1318 VQFLHWAELCYNTSYHTAIKMSPFQAVYGREPPTLIKFETGSTSNADLEGKLRERDAMIH 1377

Query: 841  ALKEHLRIAQDKMKSYADKKRRHVEFEEGD--------------------QVKANWSGGI 900
             +K+H+  AQ  MK++AD  RR V F  GD                    ++ A + G  
Sbjct: 1378 IIKQHILKAQQTMKNHADGHRREVVFSVGDLVFLRLKPYRQKTLAKRVNEKLAARFYGPY 1437

Query: 901  SAGTTSGS-NNPPRVPC---------VTIKKKAFGESANNEELLPFLTANHEWKAVPQET 960
                  G+     ++P          V++ K A G S     L   LT     +  P+  
Sbjct: 1438 EVEERIGAVAYKLKLPVGSKIHNTFHVSLLKPAIGSSLEPATLPTQLTDERVLEVAPEAH 1497

Query: 961  HGYRKNE-AGGWEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREG 981
             G+R +   G  EVLI W+ LP H++TWE      + FP+F LEDKV     G
Sbjct: 1498 MGFRIHPITGQEEVLIKWKELPEHDSTWEWTRVMAEQFPEFDLEDKVLFKAPG 1549

BLAST of CSPI05G28350 vs. TrEMBL
Match: E2DMZ5_BETVU (Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1)

HSP 1 Score: 1036.6 bits (2679), Expect = 2.0e-299
Identity = 523/1014 (51.58%), Postives = 687/1014 (67.75%), Query Frame = 1

Query: 2    DWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKTE 61
            +W+  T+ +   ++ V L+G+P+L++T+VSLK + ++  +   G+L++   + + +    
Sbjct: 520  NWKTQTLQYKEGNETVTLRGNPALSRTEVSLKAMYRTLRKEGGGFLVDLNQMASHEGLPR 579

Query: 62   ENETELESILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKEEL 121
            E       +  +L+ Y+ VF  P  LPP+R   H I+++ G +PV+VRPYRY   QK+E+
Sbjct: 580  ELPEVPSCLQPLLSSYQQVFNMPLGLPPDRGHVHAINLQHGTNPVSVRPYRYPQSQKDEI 639

Query: 122  EKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEEL 181
            E+L+ +M+A+GII+ S S +SSPVLLVKKKDGSWRFCVDYRALNN+TVPDK+PIP+++EL
Sbjct: 640  EQLIHDMLAAGIIQQSHSAFSSPVLLVKKKDGSWRFCVDYRALNNVTVPDKYPIPIIDEL 699

Query: 182  FDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSL 241
             DELHGA +FSK+DLK+GYHQ++M   D+ KTAFRTHEGHYEFLVMPFGLTNAPATFQ+L
Sbjct: 700  LDELHGACVFSKLDLKSGYHQIKMKPSDVHKTAFRTHEGHYEFLVMPFGLTNAPATFQAL 759

Query: 242  MNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAKV 301
            MN +F+ YLRKFVLVFFDDILVYS +LE+H  H+ +VL +L  + LFAN KKC F   +V
Sbjct: 760  MNEVFKPYLRKFVLVFFDDILVYSTSLEQHMHHLNVVLGLLATNHLFANLKKCEFGKEEV 819

Query: 302  EYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLTQ 361
             YLGH++S +GV +DP K++A+  W +P+ +RE+RGFLGLTGYYRRFV  Y S+A PLT 
Sbjct: 820  AYLGHIISSKGVAMDPSKVQAMMDWSIPSTLRELRGFLGLTGYYRRFVKGYASIAHPLTN 879

Query: 362  LLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKRP 421
             LK  +F W      AFE LKRA+   P+L +P+FS+PF +E DASGYG+GAVL+Q   P
Sbjct: 880  QLKKDSFGWSPAATRAFETLKRALTEAPVLQMPNFSLPFVIEADASGYGLGAVLLQQGHP 939

Query: 422  IAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVIQ 481
            IA++S TL  R R K +YE+ELMAVVMAVQ+W+ +LLGR F++ +DQ+SL+ LL QR I 
Sbjct: 940  IAYFSKTLGERARAKSIYEKELMAVVMAVQKWKHFLLGRHFVIHSDQQSLRHLLNQREIG 999

Query: 482  PQYQKWIAKLLGYSFEVMYKPGLENKAADSLSR-IPPTAHLNQLTAHTLVDIKVIREEVD 541
            P YQKW+ KLLG+ FE+ YKPG  NK AD+LSR  PP A  N LT+      ++I + + 
Sbjct: 1000 PAYQKWVGKLLGFDFEIKYKPGGHNKVADALSRKHPPEAEYNLLTSSHSPHQELIAQAIR 1059

Query: 542  KDEYLKNIIDRIQK-EEEVKNYTLQQGILKYKGRLVIAKNSSLRSAILHTYHDSVLGGHS 601
            +D  L++++  +      ++ +T++ G+LKY GRLVI KN  L + +L  YH S +GGHS
Sbjct: 1060 QDADLQHLMAEVTAGRTPLQGFTVEHGLLKYNGRLVIPKNVPLTTTLLEEYHSSPMGGHS 1119

Query: 602  GFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKRVWEDIT 661
            G  +TYKR+ GE +W GMK +V  + + C  CQ+ KT  LSPAGLL PL +P  +WEDI+
Sbjct: 1120 GIFKTYKRLAGEWYWKGMKKDVTTFVQNCQICQQFKTSTLSPAGLLQPLPIPLAIWEDIS 1179

Query: 662  MDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHGFPQSIV 721
            MDF+EGLPKS G++ I VVVDR SKYAHF++LKHPF A  VA +F+KE+V+LHGFP +IV
Sbjct: 1180 MDFVEGLPKSQGWDTILVVVDRLSKYAHFITLKHPFTAPTVAAVFIKEIVKLHGFPSTIV 1239

Query: 722  SDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGERPKDWV 781
            SDRDK+F+S FWKELF+L GT L+RSTAYHPQ+DGQTEVVN+S+E YLRCFC  RPK W 
Sbjct: 1240 SDRDKVFMSLFWKELFKLQGTLLHRSTAYHPQSDGQTEVVNKSLEAYLRCFCNGRPKAWA 1299

Query: 782  KWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKERDVALGA 841
            +W+SWAEYWYNT+   S   TPF+ VYGR  P L  + +  T   +L+EQL +RD  L  
Sbjct: 1300 QWISWAEYWYNTSTHSSSHFTPFKIVYGRDSPPLFRFEKGSTAIFSLEEQLLDRDATLDE 1359

Query: 842  LKEHLRIAQDKMKSYADKKRRHVEFEEGDQV--KANWSGGISAGTTSGSNNPPRV--PCV 901
            LK HL  AQ+ MK   DK RR V FE G  V  K       S          PR   P  
Sbjct: 1360 LKFHLLEAQNSMKIQEDKHRRAVHFEPGAMVYLKIQPYRHQSLAKKRNEKLAPRFYGPFS 1419

Query: 902  TIK--------------------------KKAFGESANNEELLPFLTANHEWKAVPQETH 961
             +K                          KKA G   ++  + P LT +    A P+   
Sbjct: 1420 VLKRIGQVAYQLQLPLGAKLHPVFHISQLKKAVGSLQSSPTIPPQLTNDLVLDAQPESLL 1479

Query: 962  GYR---KNEAGGWEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREG 981
              R   +  A   EVLI W  LP  EATWE    F   FPDFHLEDKV L+ EG
Sbjct: 1480 NIRSHPQKPAEVTEVLIKWLNLPAFEATWEDAALFNARFPDFHLEDKV-LNWEG 1532

BLAST of CSPI05G28350 vs. TrEMBL
Match: A0A087GEK8_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1)

HSP 1 Score: 1026.5 bits (2653), Expect = 2.1e-296
Identity = 504/1012 (49.80%), Postives = 693/1012 (68.48%), Query Frame = 1

Query: 1    MDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKT 60
            ++W    M F    +  VL+GDP    + +SLK+L ++  +   G L+E   L++     
Sbjct: 512  VNWGRQYMRFSLGGETAVLQGDPGQGCSAISLKSLMRAVKDQGVGLLVEYNGLQSLDQVA 571

Query: 61   EENETELESILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKEE 120
                   +++++V+ Q+  VFE+P+ LPP R   H+I++  GA  V+VRP+RY   QK E
Sbjct: 572  GFTTEVPQALVSVMDQFPQVFEDPQGLPPTRGRAHEINLESGAKAVSVRPFRYPQTQKAE 631

Query: 121  LEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEE 180
            +EK V  M+A+GII+ STS +SSPVLLVKKKDGSWRFC+DYRALN +T+PD FPIP++++
Sbjct: 632  IEKQVTAMLAAGIIQESTSTFSSPVLLVKKKDGSWRFCIDYRALNKVTIPDSFPIPMIDQ 691

Query: 181  LFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQS 240
            L DELHGA++FSK+DLK+GYHQ+ +  +++ KTAFRTH+GHYEFLVMPFGLTNAP TFQ+
Sbjct: 692  LLDELHGATVFSKLDLKSGYHQILVKPQNVPKTAFRTHDGHYEFLVMPFGLTNAPTTFQA 751

Query: 241  LMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAK 300
            LMN +FR++LRKFVLVFFDDILVYS +L+EH +H+ +VL++L + +LFAN+KKC F S+ 
Sbjct: 752  LMNEVFRAHLRKFVLVFFDDILVYSSSLQEHQEHLRVVLQILFQQQLFANKKKCQFGSSS 811

Query: 301  VEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLT 360
            +EYLGHV+SG GV  DP K++A+  WP+P N++ +RGFLGLTGYYRRFV  YGS+A PLT
Sbjct: 812  IEYLGHVISGEGVSADPSKLQAMVSWPLPKNIKALRGFLGLTGYYRRFVQGYGSIAKPLT 871

Query: 361  QLLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKR 420
             LLK   F+W EE   AFEKLK AM TVP+L L DFS  F VE+DASG G+GAVL+Q ++
Sbjct: 872  SLLKKDKFQWSEEATVAFEKLKVAMSTVPVLALVDFSELFVVESDASGIGLGAVLLQKQK 931

Query: 421  PIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVI 480
            P+A++S  L  R ++K VYERELMA+V A+Q+WR YLLGR F+V+TDQKSLKFLLEQR +
Sbjct: 932  PVAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKFLVRTDQKSLKFLLEQREV 991

Query: 481  QPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSRIPPTAHLNQLTAHTLVDIKVIREEVD 540
              +YQ+W+ K+LG++F++ YKPGLENKAAD+LSR+     L  L+    + ++ I EEVD
Sbjct: 992  NLEYQQWLTKILGFNFDIHYKPGLENKAADALSRVEGLPQLYALSVPAAIQLEEINEEVD 1051

Query: 541  KDEYLKNIIDRIQKEEEVKN-YTLQQGILKYKGRLVIAKNSSLRSAILHTYHDSVLGGHS 600
            ++   K I + +  +    + Y++ QG L Y G+LV+ K S L   +LH +H+S +GGH 
Sbjct: 1052 RNPVSKKIKEEVLLDASTHSGYSVVQGRLLYNGKLVLPKESYLIKVLLHEFHNSRMGGHG 1111

Query: 601  GFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKRVWEDIT 660
            G L+T + +    +W GM  +++ +  EC+ CQ++K   L+P+GLL PL +P +VWEDI+
Sbjct: 1112 GVLKTQRHLGALFYWQGMMADIKTFVAECVVCQKHKYSTLAPSGLLQPLPIPTQVWEDIS 1171

Query: 661  MDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHGFPQSIV 720
            +DF+EGLPKS GF+ I VVVDR +KYAHF+ L+HPF AK +A +F++E+VRLHG+P ++V
Sbjct: 1172 LDFVEGLPKSEGFDAILVVVDRLTKYAHFIKLQHPFGAKEIAAVFIQEIVRLHGYPSTMV 1231

Query: 721  SDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGERPKDWV 780
            SDRD +F   FW ELFRLAGT LN STAYHPQTDGQTEV NR +E  LRCF  ++PK W 
Sbjct: 1232 SDRDTLFTGMFWTELFRLAGTSLNFSTAYHPQTDGQTEVTNRGLETILRCFTSDKPKKWA 1291

Query: 781  KWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKERDVALGA 840
             +L WAE+ YN+++  ++ +TPF+A+YGR PP+LL + +  T N+ L+ QLKERD  +  
Sbjct: 1292 AYLPWAEFCYNSSYHSAIQMTPFKALYGRDPPSLLRFEDGSTTNANLETQLKERDAMIVI 1351

Query: 841  LKEHLRIAQDKMKSYADKKRRHVEFEEGDQV--------KANWSGGISAGTTSGSNNPPR 900
            LK+++  AQ  MK  AD  RR VEF+ GD V        + + +  ++    +    P  
Sbjct: 1352 LKQNILKAQQLMKHRADGHRREVEFKVGDMVFLKLKPYRQQSLARRVNEKLAARFYGPYE 1411

Query: 901  VPC----------------------VTIKKKAFGESANNEELLPFLTANHEWKAVPQETH 960
            V                        V+  K A G S     L P LTA +  +A P+   
Sbjct: 1412 VLARVGVVAYQLKLPADSKIHDTFHVSQLKLAVGSSFQPAALPPHLTAENVLEAEPEAHM 1471

Query: 961  GYRKNEAGGW-EVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREG 981
            G R N   G  EVLI W+GLP  ++TWE     Q+ FP+F LEDK      G
Sbjct: 1472 GVRINSRSGQQEVLIKWKGLPECDSTWEWVGVIQEQFPEFDLEDKALFKAAG 1523

BLAST of CSPI05G28350 vs. TrEMBL
Match: Q9SQW9_ARATH (Putative retroelement pol polyprotein OS=Arabidopsis thaliana GN=F23H6.1 PE=4 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 3.6e-296
Identity = 518/1020 (50.78%), Postives = 683/1020 (66.96%), Query Frame = 1

Query: 2    DWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKTE 61
            +WR+L +S+      V L GDP L + Q+S++++ +    T T YL+E  +L   + K +
Sbjct: 613  NWRDLRISWQIGRTWVSLYGDPDLCRGQISMRSMERVIKYTGTAYLLELASL--FESKKQ 672

Query: 62   ENETELE-SILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKEE 121
            E +T L+ +I  +L QY+ VF+ P+ LPP RN EH I ++ G+ PVN+RPYRY+F QK E
Sbjct: 673  EEQTALQPAIQRLLDQYQGVFQTPQLLPPVRNREHAITLQEGSSPVNIRPYRYSFAQKNE 732

Query: 122  LEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEE 181
            +EKLV EM+ + IIRPS SPYSSPVLLVKKKDG WRFCVDYRALN  T+PDK+PIPV+EE
Sbjct: 733  IEKLVREMLNAQIIRPSVSPYSSPVLLVKKKDGGWRFCVDYRALNEATIPDKYPIPVIEE 792

Query: 182  LFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQS 241
            L DEL GA++FSK+DLK+GY Q+RM   D+EKTAF+THEGHYEFLVMPFGLTNAP+TFQS
Sbjct: 793  LLDELKGATVFSKLDLKSGYFQIRMKLSDVEKTAFKTHEGHYEFLVMPFGLTNAPSTFQS 852

Query: 242  LMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAK 301
            +MN +FR YLRKFVLVFFDDILVYS +++ H +H+E VL++L  H+ +AN KKC F S +
Sbjct: 853  VMNDLFRPYLRKFVLVFFDDILVYSPDMKTHLKHLETVLQLLHLHQFYANFKKCTFGSTR 912

Query: 302  VEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLT 361
            + YLGH++S +GV  DPEK+ A+ QWP+P +V E+RGFLG TGYYRRFV +YG +A PL 
Sbjct: 913  ISYLGHIISEQGVATDPEKVEAMLQWPLPKSVTELRGFLGFTGYYRRFVKNYGQIARPLR 972

Query: 362  QLLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKR 421
              LK  +F W+E    AF+ LK A+  +P+L LPDF   F VETDASG GIGAVL Q+KR
Sbjct: 973  DQLKKNSFDWNEAATSAFQALKAAVSALPVLVLPDFQQEFTVETDASGMGIGAVLSQNKR 1032

Query: 422  PIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVI 481
             IAF S   + + R++ VYEREL+A+V AV +W+ YL  + FI+KTDQ+SL+ LLEQ+ +
Sbjct: 1033 LIAFLSQAFSSQGRIRSVYERELLAIVKAVTKWKHYLSSKEFIIKTDQRSLRHLLEQKSV 1092

Query: 482  QPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSRIPPTAHLNQL--TAHTLVDIKVIREE 541
                Q+W +KL G  + + YKPG++NK AD+LSR PPT  L+QL  T    +D+  ++ E
Sbjct: 1093 STIQQRWASKLSGLKYRIEYKPGVDNKVADALSRRPPTEALSQLTITGPPTIDLTALKAE 1152

Query: 542  VDKDEYLKNIIDR-IQKEEEVKNYTLQQGILKYKGRLVIAKNSSLRSAILHTYHDSVLGG 601
            + +D  L  I+    Q +    ++T+  G++  KG LVI   S     +L  +H S +GG
Sbjct: 1153 IQQDHELSQILKNWAQGDHHDSDFTVADGLIYRKGCLVIPVGSPFIPKMLEKFHTSPIGG 1212

Query: 602  HSGFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKRVWED 661
            H G L+T+KR+T E++W G++ +V  Y + C  CQ NK   LSPAGLL+PL +P+++W D
Sbjct: 1213 HEGALKTFKRLTSEVYWRGLRKDVVNYIKGCQICQENKYSTLSPAGLLSPLPIPQQIWSD 1272

Query: 662  ITMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHGFPQS 721
            +++DF+EGLP S  FN I VVVDR SKY+HF+ LKHPF AK V E F+++VV+LHGFP +
Sbjct: 1273 VSLDFVEGLPSSNRFNCILVVVDRLSKYSHFIPLKHPFTAKTVVEAFIRDVVKLHGFPNT 1332

Query: 722  IVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGERPKD 781
            +VSDRD+IFLS FW ELF+L GT L +STAYHPQTDGQTEVVNR +E YLRCF G RP  
Sbjct: 1333 LVSDRDRIFLSGFWSELFKLQGTGLQKSTAYHPQTDGQTEVVNRCLESYLRCFAGRRPTS 1392

Query: 782  WVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKERDVAL 841
            W +WL WAEYWYNT++  +   TPFQAVYGR PP LL YG+  T N+ ++E LK+RD  L
Sbjct: 1393 WFQWLPWAEYWYNTSYHSATKTTPFQAVYGREPPVLLRYGDIPTNNANVEELLKDRDGML 1452

Query: 842  GALKEHLRIAQDKMKSYADKKRRHVEFEEGDQVKANWSGGISAGTTSGSNNP-------- 901
              L+E+L IAQ +MK  ADK RR V FE  + V         +      N          
Sbjct: 1453 VELRENLEIAQAQMKKAADKSRRDVAFEIDEWVYLKLRPYRQSSVAHRKNEKLSQRYFGP 1512

Query: 902  ----PRVPCVTIK------------------KKAFGESANNEELLPFLTANHEWKAVPQE 961
                 R+  V  K                  K+A   S   +EL   L+   EW   P++
Sbjct: 1513 FKVLHRIGQVAYKLQLPEHSTIHPVFHVSQLKRAVPPSFTPQELPKILSPTLEWNTGPEK 1572

Query: 962  THGYRK-NEAGGWEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREGGDHKEM 987
                R+ N   G EVL+ W GL   E+TWE      Q +PDF LEDKV L R   D  ++
Sbjct: 1573 LLDIRQSNTNSGPEVLVQWSGLSTLESTWEPLLTLVQQYPDFDLEDKVSLLRGSIDRLQV 1630

BLAST of CSPI05G28350 vs. TrEMBL
Match: A0A087HNF3_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA1G155400 PE=4 SV=1)

HSP 1 Score: 1017.3 bits (2629), Expect = 1.3e-293
Identity = 526/1019 (51.62%), Postives = 680/1019 (66.73%), Query Frame = 1

Query: 2    DWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKTE 61
            +W+   + F  N + V LKGDPSL  T VSLK L K+  +   G ++E       QI  E
Sbjct: 478  NWKLQVLQFQVNGEWVSLKGDPSLCCTPVSLKALWKTVEQQGQGMVVE---FGGMQITDE 537

Query: 62   ENETEL-ESILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKEE 121
               T++ + +  ++  ++ VFEEP ELPP+R  EH I +  GA PVNVRP+RY   QKEE
Sbjct: 538  NWSTKVPQGLRALIKSFQGVFEEPHELPPSRGREHIICLEPGAPPVNVRPFRYPQIQKEE 597

Query: 122  LEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEE 181
            +E+ V  M+ +GI+R S SP+SSPVLLVKKKDGSWRFCVDYRALN  TV D +PIP++++
Sbjct: 598  IERQVASMLGAGIVRDSRSPFSSPVLLVKKKDGSWRFCVDYRALNKATVSDSYPIPMIDQ 657

Query: 182  LFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQS 241
            L DELHGA++FSK+DL++GYHQ+ + + D+ KTAFRTH+GHYEFLVMPFGL NAPATFQ+
Sbjct: 658  LLDELHGANIFSKLDLRSGYHQILVKAEDVAKTAFRTHDGHYEFLVMPFGLKNAPATFQA 717

Query: 242  LMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAK 301
            LMN +FR +LR+FVLVFFDDILVYS NLEEH +H+ +VL++L+ +KLFAN KKC F S++
Sbjct: 718  LMNDLFRPHLRRFVLVFFDDILVYSSNLEEHKEHLTMVLQILQNNKLFANPKKCQFGSSE 777

Query: 302  VEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLT 361
            +EYLGH++SG+GV  D EKI+A+ +WP P NV+ +RGFLGLTGYYR+FV  YG  A PLT
Sbjct: 778  IEYLGHIISGQGVSADQEKIKAMIEWPEPRNVKALRGFLGLTGYYRKFVSRYGEKAKPLT 837

Query: 362  QLLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKR 421
             LLK   FKW +E   AF  LK AM +V +L L DF+  F VE+DASG G+GAVLMQ ++
Sbjct: 838  TLLKKDQFKWGKEAAVAFTTLKEAMTSVSVLALADFNELFVVESDASGTGLGAVLMQKQK 897

Query: 422  PIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVI 481
            P+AF+S  L  R R+K VYERELMA+V A+Q+WR YLLGR F+V+TDQKSLKFL EQR I
Sbjct: 898  PLAFFSQALTERQRMKSVYERELMAIVFAIQKWRHYLLGRRFLVRTDQKSLKFLFEQREI 957

Query: 482  QPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSRIPPTAHLNQLTAHTLVDIKVIREEVD 541
              +YQKW+ K+LG++FE+ YKPGLEN+AAD+LSR      L  L+   ++ +  I   VD
Sbjct: 958  NLEYQKWLTKILGFNFEIQYKPGLENRAADALSRKEAVPLLFALSIPAVLQLNEIESAVD 1017

Query: 542  KDEYLKNIIDR-IQKEEEVKNYTLQQGILKYKGRLVIAKNSSLRSAILHTYHDSVLGGHS 601
            +D  LK I D  +Q      +YT+ QG L +KGRLVI   S+    IL  +HD  +GGH 
Sbjct: 1018 QDPVLKKIKDDWLQDPSSQPDYTVVQGRLLWKGRLVIPTGSAWIEVILKEFHDGKVGGHG 1077

Query: 602  GFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKRVWEDIT 661
            G L+T +RI+   FW GM G++R+Y   C  CQR+K   L+PAGLL PL +P+ VWEDI+
Sbjct: 1078 GVLKTQRRISALFFWKGMLGKIREYVAACHVCQRHKYSTLAPAGLLQPLPIPEAVWEDIS 1137

Query: 662  MDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHGFPQSIV 721
            MDFIEGLPKS G  +I VVVDR +KY HF+ LKHP DA  VA +F++E+VRLHGFP+++V
Sbjct: 1138 MDFIEGLPKSAGMELIMVVVDRLTKYGHFVGLKHPLDATTVASVFIQEIVRLHGFPKTLV 1197

Query: 722  SDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGERPKDWV 781
            SDRD++F   FW E+F+L GTKL  STAYHPQ+DGQ+EV NR +E Y RCF  ++P+ W 
Sbjct: 1198 SDRDRLFTGKFWGEMFKLVGTKLCFSTAYHPQSDGQSEVTNRGLETYPRCFTSDKPQTWA 1257

Query: 782  KWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKERDVALGA 841
            ++L WAE  YNT++  S+ +TPFQAVYGR PPAL  Y    T  + L+ +L+ERD  L  
Sbjct: 1258 QFLPWAELSYNTSYHSSIHMTPFQAVYGREPPALRRYENGSTHVADLETKLQERDSMLQL 1317

Query: 842  LKEHLRIAQDKMKSYADKKRRHVEFEEGDQVKANWSGGISAGTTSGSNN----------- 901
            LK+HL  AQ  MK+ AD  RR V F  GD V               SN            
Sbjct: 1318 LKQHLLRAQQMMKARADGHRRDVVFAVGDWVYLKLRPYRQQSLARRSNEKLSARYYGPYE 1377

Query: 902  -PPRVPCVTIK------------------KKAFGESANNEELLPFLTANHEWKAVPQETH 961
               RV  V  K                  K A G      +L P L      +AVP+   
Sbjct: 1378 IEARVGAVAYKLKLPKDSKVHHTFHVSLLKAAIGSPFTPTDLPPQLNIEGILEAVPEAVL 1437

Query: 962  GYRKNE-AGGWEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREGGDHKEME 988
            G R N+  G  E+LI W+GLP H+ +WE     +  FP+  LEDKV L   G D  E++
Sbjct: 1438 GTRINQRTGQEELLIKWKGLPPHDNSWEWKGVIEDQFPNLDLEDKVCLKERGIDTIELD 1493

BLAST of CSPI05G28350 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 151.0 bits (380), Expect = 3.9e-36
Identity = 66/129 (51.16%), Postives = 89/129 (68.99%), Query Frame = 1

Query: 274 HMELVLEVLREHKLFANRKKCCFASAKVEYLGH--VLSGRGVEVDPEKIRAVKQWPVPTN 333
           H+ +VL++  +H+ +ANRKKC F   ++ YLGH  ++SG GV  DP K+ A+  WP P N
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62

Query: 334 VREVRGFLGLTGYYRRFVHHYGSLAAPLTQLLKLGAFKWDEETQEAFEKLKRAMMTVPIL 393
             E+RGFLGLTGYYRRFV +YG +  PLT+LLK  + KW E    AF+ LK A+ T+P+L
Sbjct: 63  TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPVL 122

Query: 394 TLPDFSIPF 401
            LPD  +PF
Sbjct: 123 ALPDLKLPF 131

BLAST of CSPI05G28350 vs. TAIR10
Match: ATMG00850.1 (ATMG00850.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 50.8 bits (120), Expect = 5.5e-06
Identity = 22/39 (56.41%), Postives = 30/39 (76.92%), Query Frame = 1

Query: 117 QKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSW 156
           ++  L+  + EM+ + II+PS SPYSSPVLLV+KKDG W
Sbjct: 41  RRTRLKNWLGEMLEARIIQPSISPYSSPVLLVQKKDGGW 79

BLAST of CSPI05G28350 vs. NCBI nr
Match: gi|729344250|ref|XP_010541181.1| (PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana])

HSP 1 Score: 1107.4 bits (2863), Expect = 0.0e+00
Identity = 549/1018 (53.93%), Postives = 724/1018 (71.12%), Query Frame = 1

Query: 1    MDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKT 60
            MD+++L + F   +  V + GDP+L  + V+L++L KS  + D  YL++  TLE  Q+  
Sbjct: 749  MDFQDLELKFNQGTSWVTVTGDPTLHSSLVTLRSLIKSVCDGDQSYLVKLETLEE-QVGV 808

Query: 61   EENETELESILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKEE 120
            + N  E   +  VL ++  VFE P ELPP R  EH I+++ G  PV+VRPYRY    KEE
Sbjct: 809  DSNLPE--KLQAVLEEFGPVFEIPTELPPERGREHPINLKEGTGPVSVRPYRYPHAHKEE 868

Query: 121  LEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEE 180
            +EKLV +M+ +GI+RPS SP+SSPVLLVKKKDGSWRFC+DYRALN +TV DKFPIP++++
Sbjct: 869  IEKLVKDMLKAGIVRPSQSPFSSPVLLVKKKDGSWRFCIDYRALNKVTVLDKFPIPMIDQ 928

Query: 181  LFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQS 240
            L DELHGA +FSK+DL++GYHQ+RM + DI KTAFRTH+GHYEFLVMPFGLTNAPATFQ+
Sbjct: 929  LLDELHGARVFSKLDLRSGYHQIRMKTEDIPKTAFRTHDGHYEFLVMPFGLTNAPATFQA 988

Query: 241  LMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAK 300
            LMN IFR YLRKFVLVFFDDILVYS +L++H  H++ VL VL++HKL+AN+KKC F   +
Sbjct: 989  LMNEIFRPYLRKFVLVFFDDILVYSCSLQDHATHLQTVLAVLQKHKLYANKKKCEFGRQQ 1048

Query: 301  VEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLT 360
            ++YLGH++S  GV  DP K  A+++WP P+NV+E+RGFLGLTGYYRRFV +YG++A PLT
Sbjct: 1049 IDYLGHIISQEGVSTDPAKTAAMQKWPTPSNVKELRGFLGLTGYYRRFVQNYGTIARPLT 1108

Query: 361  QLLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKR 420
             LLK   F W E+   AF KLK+AM + P+L LPDF   F VETDASG+GIGAVLMQ  R
Sbjct: 1109 DLLKKDGFNWSEDASSAFRKLKQAMTSAPVLGLPDFREDFVVETDASGFGIGAVLMQKHR 1168

Query: 421  PIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVI 480
            PIAF+S  L+ R+R+KPVYERELMAVV+++QRWR YLLGR+F+V TDQK+LKFLLEQR +
Sbjct: 1169 PIAFFSQALSERERLKPVYERELMAVVLSIQRWRHYLLGRSFLVCTDQKALKFLLEQREV 1228

Query: 481  QPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSRIP------PTAHLNQLTAHTLVDIKV 540
              +YQ+W+ KLLGY F+++Y+PG+ENKAAD LSR+P      PT     +T    + +  
Sbjct: 1229 SMEYQRWLTKLLGYDFQIVYRPGVENKAADGLSRMPHNTILEPTCMGLAITIPRNIQLVE 1288

Query: 541  IREEVDKDEYLKNIIDRIQK-EEEVKNYTLQQGILKYKGRLVIAKNSSLRSAILHTYHDS 600
            + +E+ +D  LK I+ ++++ E +V  Y L QG+L+YK RLV++K+SS    IL  +HDS
Sbjct: 1289 VEKEIGEDSDLKEIVSKLKEGETKVGKYHLLQGMLRYKNRLVVSKHSSFIPTILAEFHDS 1348

Query: 601  VLGGHSGFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKR 660
             +GGHSG LRT KRI     WVGMK +++KY  EC  CQ  K   L+PAGLL PL +P+ 
Sbjct: 1349 KMGGHSGVLRTLKRIQELFHWVGMKADIKKYVAECAVCQSQKYSTLAPAGLLQPLPIPEH 1408

Query: 661  VWEDITMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHG 720
            +WEDI+MDFIEGLP+S G+NV+ VVVDR SKYAHF++LKHPF A +VA++FV+EVVRLHG
Sbjct: 1409 IWEDISMDFIEGLPRSAGYNVVLVVVDRLSKYAHFIALKHPFTAMVVAKVFVQEVVRLHG 1468

Query: 721  FPQSIVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGE 780
            FP+SIVSDRDK+FLS+FW ELFR+AGTKL  STAYHPQTDGQTEV+NR +E YLRC+  +
Sbjct: 1469 FPKSIVSDRDKVFLSNFWSELFRIAGTKLKFSTAYHPQTDGQTEVLNRCLETYLRCYAND 1528

Query: 781  RPKDWVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKER 840
             P+ W+++LSWAE+WYNT+F  +L  TPFQ VYGR PP LL Y E  T N  L++ L+ER
Sbjct: 1529 HPRKWIQFLSWAEFWYNTSFHTALQSTPFQIVYGREPPTLLKYEEGSTSNFELEKALRER 1588

Query: 841  DVALGALKEHLRIAQDKMKSYADKKRRHVEFEEGD--------------------QVKAN 900
            D  +  +K+ L+ AQ +MK  ADK RR +    G+                    ++ A 
Sbjct: 1589 DRMILEIKQKLQAAQQRMKVSADKGRRDLTLTVGEWVYLKIRPYRQNTLAARSNQKLAAR 1648

Query: 901  WSGGISAGTTSG----------SNNPPRVPCVTIKKKAFGESANNEELLPFLTANHEWKA 960
            + G     +  G            N   V  ++  KKA G +    +L   LT + E + 
Sbjct: 1649 YYGPFQIESRMGEVAYKLKLPKGCNIHPVFHISQLKKALGGNIQPNQLPRQLTRDLELQV 1708

Query: 961  VPQETHGYRKNEAGGWEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREGG 982
             P++    R  + G  EVL+ W+ LP HE+TWE  +DF + FP F LEDK++  ++GG
Sbjct: 1709 QPKDIKDSRYTKEGRLEVLVEWQDLPEHESTWEVAEDFNKQFPSFQLEDKLR--QKGG 1761

BLAST of CSPI05G28350 vs. NCBI nr
Match: gi|674230743|gb|KFK24528.1| (hypothetical protein AALP_AAs46225U000100, partial [Arabis alpina])

HSP 1 Score: 1037.7 bits (2682), Expect = 1.3e-299
Identity = 526/1013 (51.92%), Postives = 691/1013 (68.21%), Query Frame = 1

Query: 1    MDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKT 60
            ++W+  TM F  N + V L+GD  L    +SLK L KS  +   G L+E   L+A ++ T
Sbjct: 538  VNWKLQTMKFMLNEELVKLQGDAGLCCAPISLKALWKSLADQGQGVLVEYCGLQA-ELHT 597

Query: 61   EENETEL-ESILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKE 120
            +    +L   +LTVL Q+  VFE+P+ LPP+R  EH I +   A PV+VRP+RY   Q+E
Sbjct: 598  QRRREQLPHQLLTVLEQFARVFEDPQGLPPSRGKEHNIVLEPNAKPVSVRPFRYPQAQRE 657

Query: 121  ELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVE 180
            E+EK V  M+A+G+I+ S SP+SSPVLLVKKKDGSWRFCVDYRALN +T+PD FPIP+++
Sbjct: 658  EVEKQVASMLAAGLIQASGSPFSSPVLLVKKKDGSWRFCVDYRALNKVTIPDSFPIPMID 717

Query: 181  ELFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQ 240
            +L DELHGA++FSK+DLK+GYHQ+ + + D+ KTAFRTH+GHYEFLVMPFGLTNAPATFQ
Sbjct: 718  QLLDELHGATIFSKLDLKSGYHQILVKAEDVAKTAFRTHDGHYEFLVMPFGLTNAPATFQ 777

Query: 241  SLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASA 300
            SLMN +FR YLRKFVLVFFDDILVYS++L+EH QH+ LVLE+L++H+LFAN+KKC F   
Sbjct: 778  SLMNDVFRGYLRKFVLVFFDDILVYSKSLQEHQQHLGLVLELLQQHQLFANKKKCEFGRT 837

Query: 301  KVEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPL 360
            ++EYLGHV+SG+GV  DPEKI+A+  WP P NV+ +RGFLGLTGYYR+FV  YG +A PL
Sbjct: 838  ELEYLGHVVSGKGVAADPEKIQAMVSWPEPQNVKALRGFLGLTGYYRKFVQRYGEIARPL 897

Query: 361  TQLLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSK 420
            T LLK   F+W  E   AF+KLK+AM TVP+L L DF+  F VE+DASG G+GAVLMQS+
Sbjct: 898  TALLKKDQFQWTAEATVAFQKLKKAMSTVPVLALVDFTEQFVVESDASGTGLGAVLMQSQ 957

Query: 421  RPIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRV 480
            RP+A++S  L  R R+K VYERELMA+V A+Q+WR YLLGR F+V+TDQKSLKFLLEQR 
Sbjct: 958  RPLAYFSQALTERQRLKSVYERELMAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLLEQRE 1017

Query: 481  IQPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSRIPPTAHLNQLTAHTLVDIKVIREEV 540
            I  +YQKW+ KLLG+ FE+ YKPGLENKAAD+LSR      L  L+    + ++ I  EV
Sbjct: 1018 INMEYQKWLTKLLGFDFEIQYKPGLENKAADALSRKDMALQLCALSIPAAIQLEQINTEV 1077

Query: 541  DKDEYLKNIIDRI-QKEEEVKNYTLQQGILKYKGRLVIAKNSSLRSAILHTYHDSVLGGH 600
            D D  L+ + + + Q       +++ QG L  KG+LV+   S L + IL  +H+  LGGH
Sbjct: 1078 DNDPDLRKLKEEVLQDAASHSEFSVVQGRLLRKGKLVVPAQSRLVNVILQEFHNGKLGGH 1137

Query: 601  SGFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKRVWEDI 660
             G L+T KR+    +W GM   +R++   C  CQR+K   L+PAGLL PL +P +VWEDI
Sbjct: 1138 GGVLKTQKRVEAIFYWKGMMSRIREFVAACQVCQRHKYSTLAPAGLLQPLPIPDQVWEDI 1197

Query: 661  TMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHGFPQSI 720
            +MDF+EGLPKS GF V+ VVVDR +KYAHF+S+KHP  A  VA +F KEVV+LHGFP++I
Sbjct: 1198 SMDFVEGLPKSEGFEVVMVVVDRLTKYAHFISMKHPVTAVEVALIFTKEVVKLHGFPKTI 1257

Query: 721  VSDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGERPKDW 780
            VSDRD +F   FW E+FRLAGT L  STAYHPQ+DGQTEV NR +E  LRCF  ++P+ W
Sbjct: 1258 VSDRDPLFTGRFWTEMFRLAGTSLCFSTAYHPQSDGQTEVTNRGMETLLRCFSSDKPRCW 1317

Query: 781  VKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKERDVALG 840
            V++L WAE  YNT++  ++ ++PFQAVYGR PP L+ +    T N+ L+ +L+ERD  + 
Sbjct: 1318 VQFLHWAELCYNTSYHTAIKMSPFQAVYGREPPTLIKFETGSTSNADLEGKLRERDAMIH 1377

Query: 841  ALKEHLRIAQDKMKSYADKKRRHVEFEEGD--------------------QVKANWSGGI 900
             +K+H+  AQ  MK++AD  RR V F  GD                    ++ A + G  
Sbjct: 1378 IIKQHILKAQQTMKNHADGHRREVVFSVGDLVFLRLKPYRQKTLAKRVNEKLAARFYGPY 1437

Query: 901  SAGTTSGS-NNPPRVPC---------VTIKKKAFGESANNEELLPFLTANHEWKAVPQET 960
                  G+     ++P          V++ K A G S     L   LT     +  P+  
Sbjct: 1438 EVEERIGAVAYKLKLPVGSKIHNTFHVSLLKPAIGSSLEPATLPTQLTDERVLEVAPEAH 1497

Query: 961  HGYRKNE-AGGWEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREG 981
             G+R +   G  EVLI W+ LP H++TWE      + FP+F LEDKV     G
Sbjct: 1498 MGFRIHPITGQEEVLIKWKELPEHDSTWEWTRVMAEQFPEFDLEDKVLFKAPG 1549

BLAST of CSPI05G28350 vs. NCBI nr
Match: gi|261865347|gb|ACY01928.1| (hypothetical protein [Beta vulgaris])

HSP 1 Score: 1036.6 bits (2679), Expect = 2.9e-299
Identity = 523/1014 (51.58%), Postives = 687/1014 (67.75%), Query Frame = 1

Query: 2    DWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKTE 61
            +W+  T+ +   ++ V L+G+P+L++T+VSLK + ++  +   G+L++   + + +    
Sbjct: 520  NWKTQTLQYKEGNETVTLRGNPALSRTEVSLKAMYRTLRKEGGGFLVDLNQMASHEGLPR 579

Query: 62   ENETELESILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKEEL 121
            E       +  +L+ Y+ VF  P  LPP+R   H I+++ G +PV+VRPYRY   QK+E+
Sbjct: 580  ELPEVPSCLQPLLSSYQQVFNMPLGLPPDRGHVHAINLQHGTNPVSVRPYRYPQSQKDEI 639

Query: 122  EKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEEL 181
            E+L+ +M+A+GII+ S S +SSPVLLVKKKDGSWRFCVDYRALNN+TVPDK+PIP+++EL
Sbjct: 640  EQLIHDMLAAGIIQQSHSAFSSPVLLVKKKDGSWRFCVDYRALNNVTVPDKYPIPIIDEL 699

Query: 182  FDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSL 241
             DELHGA +FSK+DLK+GYHQ++M   D+ KTAFRTHEGHYEFLVMPFGLTNAPATFQ+L
Sbjct: 700  LDELHGACVFSKLDLKSGYHQIKMKPSDVHKTAFRTHEGHYEFLVMPFGLTNAPATFQAL 759

Query: 242  MNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAKV 301
            MN +F+ YLRKFVLVFFDDILVYS +LE+H  H+ +VL +L  + LFAN KKC F   +V
Sbjct: 760  MNEVFKPYLRKFVLVFFDDILVYSTSLEQHMHHLNVVLGLLATNHLFANLKKCEFGKEEV 819

Query: 302  EYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLTQ 361
             YLGH++S +GV +DP K++A+  W +P+ +RE+RGFLGLTGYYRRFV  Y S+A PLT 
Sbjct: 820  AYLGHIISSKGVAMDPSKVQAMMDWSIPSTLRELRGFLGLTGYYRRFVKGYASIAHPLTN 879

Query: 362  LLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKRP 421
             LK  +F W      AFE LKRA+   P+L +P+FS+PF +E DASGYG+GAVL+Q   P
Sbjct: 880  QLKKDSFGWSPAATRAFETLKRALTEAPVLQMPNFSLPFVIEADASGYGLGAVLLQQGHP 939

Query: 422  IAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVIQ 481
            IA++S TL  R R K +YE+ELMAVVMAVQ+W+ +LLGR F++ +DQ+SL+ LL QR I 
Sbjct: 940  IAYFSKTLGERARAKSIYEKELMAVVMAVQKWKHFLLGRHFVIHSDQQSLRHLLNQREIG 999

Query: 482  PQYQKWIAKLLGYSFEVMYKPGLENKAADSLSR-IPPTAHLNQLTAHTLVDIKVIREEVD 541
            P YQKW+ KLLG+ FE+ YKPG  NK AD+LSR  PP A  N LT+      ++I + + 
Sbjct: 1000 PAYQKWVGKLLGFDFEIKYKPGGHNKVADALSRKHPPEAEYNLLTSSHSPHQELIAQAIR 1059

Query: 542  KDEYLKNIIDRIQK-EEEVKNYTLQQGILKYKGRLVIAKNSSLRSAILHTYHDSVLGGHS 601
            +D  L++++  +      ++ +T++ G+LKY GRLVI KN  L + +L  YH S +GGHS
Sbjct: 1060 QDADLQHLMAEVTAGRTPLQGFTVEHGLLKYNGRLVIPKNVPLTTTLLEEYHSSPMGGHS 1119

Query: 602  GFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKRVWEDIT 661
            G  +TYKR+ GE +W GMK +V  + + C  CQ+ KT  LSPAGLL PL +P  +WEDI+
Sbjct: 1120 GIFKTYKRLAGEWYWKGMKKDVTTFVQNCQICQQFKTSTLSPAGLLQPLPIPLAIWEDIS 1179

Query: 662  MDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHGFPQSIV 721
            MDF+EGLPKS G++ I VVVDR SKYAHF++LKHPF A  VA +F+KE+V+LHGFP +IV
Sbjct: 1180 MDFVEGLPKSQGWDTILVVVDRLSKYAHFITLKHPFTAPTVAAVFIKEIVKLHGFPSTIV 1239

Query: 722  SDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGERPKDWV 781
            SDRDK+F+S FWKELF+L GT L+RSTAYHPQ+DGQTEVVN+S+E YLRCFC  RPK W 
Sbjct: 1240 SDRDKVFMSLFWKELFKLQGTLLHRSTAYHPQSDGQTEVVNKSLEAYLRCFCNGRPKAWA 1299

Query: 782  KWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKERDVALGA 841
            +W+SWAEYWYNT+   S   TPF+ VYGR  P L  + +  T   +L+EQL +RD  L  
Sbjct: 1300 QWISWAEYWYNTSTHSSSHFTPFKIVYGRDSPPLFRFEKGSTAIFSLEEQLLDRDATLDE 1359

Query: 842  LKEHLRIAQDKMKSYADKKRRHVEFEEGDQV--KANWSGGISAGTTSGSNNPPRV--PCV 901
            LK HL  AQ+ MK   DK RR V FE G  V  K       S          PR   P  
Sbjct: 1360 LKFHLLEAQNSMKIQEDKHRRAVHFEPGAMVYLKIQPYRHQSLAKKRNEKLAPRFYGPFS 1419

Query: 902  TIK--------------------------KKAFGESANNEELLPFLTANHEWKAVPQETH 961
             +K                          KKA G   ++  + P LT +    A P+   
Sbjct: 1420 VLKRIGQVAYQLQLPLGAKLHPVFHISQLKKAVGSLQSSPTIPPQLTNDLVLDAQPESLL 1479

Query: 962  GYR---KNEAGGWEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREG 981
              R   +  A   EVLI W  LP  EATWE    F   FPDFHLEDKV L+ EG
Sbjct: 1480 NIRSHPQKPAEVTEVLIKWLNLPAFEATWEDAALFNARFPDFHLEDKV-LNWEG 1532

BLAST of CSPI05G28350 vs. NCBI nr
Match: gi|923869199|ref|XP_013709039.1| (PREDICTED: uncharacterized protein LOC106412673 [Brassica napus])

HSP 1 Score: 1036.2 bits (2678), Expect = 3.8e-299
Identity = 529/1019 (51.91%), Postives = 692/1019 (67.91%), Query Frame = 1

Query: 1    MDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKT 60
            +DW     SF +   +VVL G+P+L  + VSLK L+      + G+ IE +++     K 
Sbjct: 494  VDWEKNEWSFDYEGCQVVLTGEPALHSSNVSLKTLSSEVTMQNEGWEIELKSMGP---KG 553

Query: 61   EENETELESILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQKEE 120
            E  E   + I  +L QYE VF++P  LPP R+ EH I ++    PV+VRPYRY    KE 
Sbjct: 554  EHEEVVPQLIADMLLQYEAVFQKPTGLPPLRDREHAIVLQDKTKPVSVRPYRYPHAHKEI 613

Query: 121  LEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVVEE 180
            +EKLV EM++ G+IRPS SP+SSPVLLVKKKD S RFCVDYRALN  TV DKFPIP++ +
Sbjct: 614  MEKLVQEMLSEGLIRPSHSPFSSPVLLVKKKDNSHRFCVDYRALNRATVQDKFPIPMIYQ 673

Query: 181  LFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQS 240
            L DELHGA  F+K+DL++GYHQ+RM   DI+KTAFRTH+GH+EFLVMPFGLTNAPATFQ+
Sbjct: 674  LLDELHGARYFTKLDLRSGYHQIRMREEDIDKTAFRTHDGHFEFLVMPFGLTNAPATFQA 733

Query: 241  LMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFASAK 300
            LMN +F+ +LRKFVLVFFDDIL+YS NLE+H +H+ LVL+V  E +LFAN+KKC FA  K
Sbjct: 734  LMNEVFKKFLRKFVLVFFDDILIYSDNLEDHKKHVALVLDVFVEMRLFANKKKCSFAQTK 793

Query: 301  VEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAPLT 360
            VEYLGH++S  GV  D +KI AV++WP+P  V+E+RGFLGLTGYYRRFV HYGS+A  LT
Sbjct: 794  VEYLGHIISREGVATDSKKIEAVQRWPIPRTVKELRGFLGLTGYYRRFVQHYGSIAKSLT 853

Query: 361  QLLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQSKR 420
            +LLK   F W +  QEAF+KLK AM+T P+L LPDF+ PF VE+DASG+G+GAVLMQ+  
Sbjct: 854  ELLKKEQFLWTQLAQEAFDKLKIAMVTAPVLALPDFTKPFIVESDASGFGLGAVLMQNNH 913

Query: 421  PIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQRVI 480
            PIA++SH L  R+++KP+YERELMA+VM++Q+WR YLLGR F+V+TDQ+SLK+LLEQR I
Sbjct: 914  PIAYFSHGLTPREQLKPIYERELMAIVMSIQKWRHYLLGRRFVVRTDQQSLKYLLEQREI 973

Query: 481  QPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSRIPPTA------HLNQLTAHTLVDIKV 540
               YQ+W+ ++LGY F++ YK G ENK AD LSRI  T        L  LT    + ++ 
Sbjct: 974  TLDYQRWLTRILGYEFDIEYKVGSENKVADGLSRIDHTVIDEAGLTLLALTVPVTLQMQD 1033

Query: 541  IREEVDKDEYLKNIIDRIQKEEEVKN-YTLQQGILKYKGRLVIAKNSSLRSAILHTYHDS 600
            +  E+D+DE ++ +I ++ + E VK  + L  G L YK +LVI ++S+    IL   HD+
Sbjct: 1034 LYREIDEDEEIQGMIAKLLQGEGVKQGFCLVHGRLFYKQKLVIPRSSNQIPVILQECHDT 1093

Query: 601  VLGGHSGFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKR 660
            ++GGH+G LRT +R+    +W  M+  V++Y   C  CQ +K   LSPAGLL P+E+P R
Sbjct: 1094 IMGGHAGVLRTLQRVKAMFYWPKMRSVVQEYVAACSVCQTHKYSTLSPAGLLQPIELPVR 1153

Query: 661  VWEDITMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHG 720
            +WEDI MDF+EGLP S G NVI VVVDR SKY HF++LKHPF A  VA+ FVKEVVRLHG
Sbjct: 1154 IWEDIAMDFVEGLPVSQGVNVILVVVDRLSKYGHFITLKHPFTAVEVAQKFVKEVVRLHG 1213

Query: 721  FPQSIVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGE 780
            FP+SI+SDRDKIFLS FWKE FR++GT+L  STA+HPQ+DGQTEV+NR +E YLRCF   
Sbjct: 1214 FPKSIISDRDKIFLSKFWKECFRVSGTRLRFSTAFHPQSDGQTEVLNRCLETYLRCFAST 1273

Query: 781  RPKDWVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKER 840
             PK W K+LSWAE WYNT +  +L  TPF+ VYGR PP L+ Y +  T N  +D  LKER
Sbjct: 1274 HPKSWSKYLSWAELWYNTAYHTALKCTPFKLVYGRDPPTLMPYEDGATQNFEVDMMLKER 1333

Query: 841  DVALGALKEHLRIAQDKMKSYADKKRRHVEFEEGDQV--------------------KAN 900
            ++ L ++K++L  AQ  MKS ADK RR +EF  G++V                     A 
Sbjct: 1334 ELVLTSIKDNLTRAQAIMKSNADKHRRDLEFRVGEKVYLKLRPYRQQSVSRRLFQKLAAR 1393

Query: 901  WSGGISAGTTSG----------SNNPPRVPCVTIKKKAFGESANNEELLPFLTANHEWKA 960
            + G        G          S+    V  ++  K   G S     L P L+ + +   
Sbjct: 1394 YYGPFEVVARIGKVAYRLALPVSSKIHPVFHISQLKPVVGSSEVVIPLPPILSDSADLLI 1453

Query: 961  VPQETHGYRKNEAGGWEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKLDREGGD 983
             P+     R +E G  E+L+ W+ LP HE++W    + +Q FP F LEDK+ L   G D
Sbjct: 1454 EPEAVLDRRYDEQGFLEILVKWKHLPDHESSWLRVGELKQQFPSFSLEDKLNLGEGGID 1509

BLAST of CSPI05G28350 vs. NCBI nr
Match: gi|731338584|ref|XP_010680400.1| (PREDICTED: transposon Tf2-1 polyprotein isoform X1 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1029.6 bits (2661), Expect = 3.5e-297
Identity = 516/1013 (50.94%), Postives = 690/1013 (68.11%), Query Frame = 1

Query: 2    DWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKNLTKSWVETDTGYLIECRTLEACQIKTE 61
            +W++  M F    ++V L+GDPSL +T++SLK + ++      G L+E   +E  +    
Sbjct: 533  NWKSQLMKFKIGREEVTLQGDPSLDRTRISLKAMLRALRIEGQGVLVEMNHIEREKEPPG 592

Query: 62   ENETELE---SILTVLTQYEDVFEEPKELPPNRNIEHQIHIRGGADPVNVRPYRYAFQQK 121
            + + E+E    +  +L QY  VF  P  LPP+R  EH I ++ G++PV+VRPYRY   QK
Sbjct: 593  KWDIEVEVPRPLQPLLNQYSQVFNMPSGLPPSRGREHSITLKEGSNPVSVRPYRYPHVQK 652

Query: 122  EELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITVPDKFPIPVV 181
             E+E+LV +M+A+GII+PSTSP+SSPVLLVKKKDGSWRFCVDYRALN  TVPDK+PIPV+
Sbjct: 653  GEIERLVKDMLAAGIIQPSTSPFSSPVLLVKKKDGSWRFCVDYRALNKETVPDKYPIPVI 712

Query: 182  EELFDELHGASLFSKIDLKAGYHQLRMCSRDIEKTAFRTHEGHYEFLVMPFGLTNAPATF 241
            +EL DEL+G+ +FSK+DLK+GYHQ+R+   DI KTAFRTHEGHYEFLVMPFGLTNAPATF
Sbjct: 713  DELLDELYGSVVFSKLDLKSGYHQIRVRKEDIHKTAFRTHEGHYEFLVMPFGLTNAPATF 772

Query: 242  QSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHMELVLEVLREHKLFANRKKCCFAS 301
            QSLMN +FR +LRKFVLVFFDDILVYS + E H  H+E VL +L E+ L+AN +KC F  
Sbjct: 773  QSLMNEVFRPFLRKFVLVFFDDILVYSPDEETHFHHLEQVLHILAENSLYANLEKCEFGR 832

Query: 302  AKVEYLGHVLSGRGVEVDPEKIRAVKQWPVPTNVREVRGFLGLTGYYRRFVHHYGSLAAP 361
             +V YLGHV+S +GV  D +KI+A+ +WP+P  +RE+RGFLGLTGYYR+F+ +Y  +A+P
Sbjct: 833  QQVAYLGHVISAQGVAADMDKIKAMVEWPLPKTIRELRGFLGLTGYYRKFIANYAKVASP 892

Query: 362  LTQLLKLGAFKWDEETQEAFEKLKRAMMTVPILTLPDFSIPFEVETDASGYGIGAVLMQS 421
            LT  L+  ++ W     +AFE LK+AM+  P+L +PDFS  F +E DASG+G+GAVLMQ+
Sbjct: 893  LTDQLRKDSYAWTPAATQAFEALKKAMVAAPVLAMPDFSQQFVIEADASGFGLGAVLMQN 952

Query: 422  KRPIAFYSHTLALRDRVKPVYERELMAVVMAVQRWRPYLLGRTFIVKTDQKSLKFLLEQR 481
             RPIAFYSH L  R R+K +YE+ELMA+VMAVQ+WR YLLGR F+++TDQKSLKF++EQR
Sbjct: 953  NRPIAFYSHILGPRGRLKSIYEKELMAIVMAVQKWRHYLLGRRFVIRTDQKSLKFIMEQR 1012

Query: 482  VIQPQYQKWIAKLLGYSFEVMYKPGLENKAADSLSRIPPT-AHLNQLTAHTLVDIKVIRE 541
             +  +YQ+W++KL+G+ FE+ YKPG+ N+ AD+LSR  P    L  L + +   ++ ++ 
Sbjct: 1013 EVGAEYQRWVSKLMGFEFEIHYKPGIANRVADALSRQNPAQTELKALLSSSGPSLEAVQN 1072

Query: 542  EVDKDEYLKNIIDRIQKE-EEVKNYTLQQGILKYKGRLVIAKNSSLRSAILHTYHDSVLG 601
            ++  D Y++ I+  +Q +   ++ ++++ G++ YKGR+V+   S L   +L  YHDS  G
Sbjct: 1073 QLKADPYIQQIMAELQGDGPPMEGFSVENGLVMYKGRIVLPPKSPLTHELLKFYHDSPNG 1132

Query: 602  GHSGFLRTYKRITGELFWVGMKGEVRKYCEECMTCQRNKTLALSPAGLLTPLEVPKRVWE 661
            GHSG L+TY R+  E +WVGM+  V +Y ++C  CQ+NKT   +PAGLL PL  P +VWE
Sbjct: 1133 GHSGDLKTYLRMASEWYWVGMRKNVAQYVKDCQICQQNKTSTQNPAGLLQPLPPPNQVWE 1192

Query: 662  DITMDFIEGLPKSMGFNVIFVVVDRFSKYAHFLSLKHPFDAKMVAELFVKEVVRLHGFPQ 721
            DITMDF+EGLP S G + I VVVDRF+K+AHFL LKHPF A  VA  F+KE+VRLHGFP 
Sbjct: 1193 DITMDFVEGLPPSRGVDTILVVVDRFTKFAHFLGLKHPFTAATVAGTFIKEIVRLHGFPA 1252

Query: 722  SIVSDRDKIFLSHFWKELFRLAGTKLNRSTAYHPQTDGQTEVVNRSVEIYLRCFCGERPK 781
            SI+SDRD++F+S FWKELFRL GTKL RSTAYHPQTDGQ+E VN+++E YLRCF   +P+
Sbjct: 1253 SIISDRDRVFMSLFWKELFRLQGTKLKRSTAYHPQTDGQSENVNKALETYLRCFVNGQPR 1312

Query: 782  DWVKWLSWAEYWYNTTFQKSLGVTPFQAVYGRTPPALLYYGERETPNSTLDEQLKERDVA 841
             W  WL W E+WYNT+   S  +TPF+A+YGR PP L+  G  +TP  +LD  L+ERD  
Sbjct: 1313 KWAGWLPWVEFWYNTSPHVSTKMTPFKALYGRDPPPLVRTGHNQTPVDSLDSYLQERDAV 1372

Query: 842  LGALKEHLRIAQDKMKSYADKKRRHVEFEEG--------------------DQVKANWSG 901
            L  L+ +L  AQ KMK +ADK+RR +  E G                    +++ A + G
Sbjct: 1373 LDDLRVNLLRAQQKMKFWADKRRRDILLEVGSFVYLKLQPYRQKSLARRPYEKLAARYYG 1432

Query: 902  GISAGTTSGS----------NNPPRVPCVTIKKKAFGESANNEELLPFLTANHEWKAVPQ 961
                    G+          +    V  V+  K A G      +L   LT + E    P+
Sbjct: 1433 PYQVLERIGAVAYRLDLPATSKIHPVFHVSQLKPAAGNIHQPSQLPEQLTQDLELIVEPE 1492

Query: 962  ETHGYRKNEAGG---WEVLINWEGLPHHEATWEGYDDFQQSFPDFHLEDKVKL 977
                 R    G     EVLI W+ LP  EATWE      Q FP FHLEDKV L
Sbjct: 1493 ALLDVRYGAPGHKKPLEVLIKWKHLPETEATWEDLTAMVQRFPTFHLEDKVNL 1545

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YI31B_YEAST4.7e-13234.42Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YG31B_YEAST1.0e-13134.30Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
TF26_SCHPO5.0e-12632.34Transposon Tf2-6 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF25_SCHPO5.0e-12632.34Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF24_SCHPO5.0e-12632.34Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A087G3S6_ARAAL9.0e-30051.92Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs46225U000100 PE=4... [more]
E2DMZ5_BETVU2.0e-29951.58Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1[more]
A0A087GEK8_ARAAL2.1e-29649.80Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1[more]
Q9SQW9_ARATH3.6e-29650.78Putative retroelement pol polyprotein OS=Arabidopsis thaliana GN=F23H6.1 PE=4 SV... [more]
A0A087HNF3_ARAAL1.3e-29351.62Uncharacterized protein OS=Arabis alpina GN=AALP_AA1G155400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.13.9e-3651.16ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
ATMG00850.15.5e-0656.41ATMG00850.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|729344250|ref|XP_010541181.1|0.0e+0053.93PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana][more]
gi|674230743|gb|KFK24528.1|1.3e-29951.92hypothetical protein AALP_AAs46225U000100, partial [Arabis alpina][more]
gi|261865347|gb|ACY01928.1|2.9e-29951.58hypothetical protein [Beta vulgaris][more]
gi|923869199|ref|XP_013709039.1|3.8e-29951.91PREDICTED: uncharacterized protein LOC106412673 [Brassica napus][more]
gi|731338584|ref|XP_010680400.1|3.5e-29750.94PREDICTED: transposon Tf2-1 polyprotein isoform X1 [Beta vulgaris subsp. vulgari... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR016197Chromo-like_dom_sf
IPR023780Chromo_domain
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0044238 primary metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G28350.1CSPI05G28350.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 148..307
score: 2.6
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 129..308
score: 14
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 656..763
score: 1.1
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 647..809
score: 1
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 658..810
score: 4.3
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 648..804
score: 5.92
IPR016197Chromo domain-likeunknownSSF54160Chromo domain-likecoord: 916..968
score: 1.4
IPR023780Chromo domainPFAMPF00385Chromocoord: 937..967
score: 7.
NoneNo IPR availableGENE3DG3DSA:2.40.50.40coord: 933..961
score: 6.
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 97..227
score: 3.1
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 228..309
score: 5.9
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 141..968
score:
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 141..968
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 73..501
score: 6.49E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI05G28350ClCG01G014040Watermelon (Charleston Gray)cpiwcgB384
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI05G28350Wild cucumber (PI 183967)cpicpiB036
CSPI05G28350Cucumber (Gy14) v1cgycpiB110
CSPI05G28350Cucumber (Gy14) v1cgycpiB236
CSPI05G28350Cucurbita maxima (Rimu)cmacpiB304
CSPI05G28350Cucurbita maxima (Rimu)cmacpiB730
CSPI05G28350Cucurbita moschata (Rifu)cmocpiB216
CSPI05G28350Cucurbita moschata (Rifu)cmocpiB296
CSPI05G28350Cucurbita moschata (Rifu)cmocpiB722
CSPI05G28350Cucumber (Chinese Long) v2cpicuB219
CSPI05G28350Cucumber (Chinese Long) v2cpicuB232
CSPI05G28350Cucumber (Chinese Long) v2cpicuB239
CSPI05G28350Melon (DHL92) v3.5.1cpimeB349
CSPI05G28350Melon (DHL92) v3.5.1cpimeB382
CSPI05G28350Watermelon (Charleston Gray)cpiwcgB356
CSPI05G28350Watermelon (Charleston Gray)cpiwcgB360
CSPI05G28350Watermelon (97103) v1cpiwmB370
CSPI05G28350Cucurbita pepo (Zucchini)cpecpiB177
CSPI05G28350Cucurbita pepo (Zucchini)cpecpiB414
CSPI05G28350Bottle gourd (USVL1VR-Ls)cpilsiB355
CSPI05G28350Bottle gourd (USVL1VR-Ls)cpilsiB352
CSPI05G28350Melon (DHL92) v3.6.1cpimedB339
CSPI05G28350Melon (DHL92) v3.6.1cpimedB377
CSPI05G28350Cucumber (Gy14) v2cgybcpiB035
CSPI05G28350Cucumber (Gy14) v2cgybcpiB137
CSPI05G28350Cucumber (Gy14) v2cgybcpiB225
CSPI05G28350Silver-seed gourdcarcpiB0297
CSPI05G28350Silver-seed gourdcarcpiB0722
CSPI05G28350Cucumber (Chinese Long) v3cpicucB259
CSPI05G28350Cucumber (Chinese Long) v3cpicucB277
CSPI05G28350Cucumber (Chinese Long) v3cpicucB284
CSPI05G28350Watermelon (97103) v2cpiwmbB357
CSPI05G28350Watermelon (97103) v2cpiwmbB429
CSPI05G28350Wax gourdcpiwgoB473