CSPI03G20180 (gene) Wild cucumber (PI 183967)

NameCSPI03G20180
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPolyprotein
LocationChr3 : 15974922 .. 15977876 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCCAAATCCATGGCTATTCCGTGCAGATAGGTACTTCTAAATACACAAACTGACGGATTCTGAAAAACTCACGGTCGCTACAATTAGTTTTGAAGGCCCCGCACTCAACTGGTATCGATCGTAGGAGGAGAGAGACAAATTTACTTGTTGGTTAAACTTAAAAGAACGATTACTAATCCGATTTCGATCGTCCCGCGAAGGCTCCCTATATGGTCGGTTCTTACGTATTCAGCAGGAATCAAGTGTAGAGGAATACCGGAATCTATTCGATAAGTGGGTGGCACCGTTATCGGACATTCCGGAAAAGATTGTGGAAGAGACGTTCATGGGAGGGCTGTTACCATGGATTAAGGTGGAGATGGAATTCTGCAATTCCGTGGGATTAGCCGAGATGATGAGATACGCGCAAATGGTGGAACAATGGGAGATCCTGAGGAGAGAAACAAATTTCCCAGTTATTCTGGAGCGAAAGTTCCAAATTACACCTATAATACGGCCAAAACAAATTCAGTTATGAAAGAACAAGGGAACAAGGAGAACACAATATTTCCAATACAAACAATCACGTTGAGGGGATCACCGGCAAAGGAGATTAAGAAAGACGGACCATCCAAATGGCTTTCCGACGCAGAATTCCAGGCCAAGGAGAAAGGACTCTGTTTCAAATGCGATGAGAAGTATTACTCCGGGCACAAATGCAGGGTGAAGGAAATACGTGAGTTACATATGTCCGTGGTAAGAGCGGACGACGTGGAGGAAGAAATTATTGAAGAAGACGAGTATGACTTGAAGGAACTGAAAACGATGGAGTTGCAGAATGACCTTGGGGAAGTAAAGGAGTTATGTATTAACTCGGTAGTGGGATTGACGAATCCAGGTACCATGAAGATAAGGGGAAAAGTTCAAAGCAAGGAGGTTGTCGTGTTAGTGGATTGCGGAGCCACCCACAATTTCATATCCGACAGGCTAGTGATGACACTGAAATTACCCACAAAGGAGACTTCTAACTATGGGGTAATACTGGGATCAGGAATAGCCATCAAAGGCAAGGGAGTGTGTGAAAAAGTAGAGTTGGATCTCAATGGATGGACAGTCCTTGAAAACTTTCTACCGCTGGAACTGGGAGGGGTAGACGTGATACTTGGGATGCAATGGTTACACTCATTGGGAGTGACTGAGATGGACTGGAAGAACTTAACCATGTCATTTTTCCATGACAACAAAAAAATAGTGATAAAAGGGGATCCAAGCTTAACCAAAACTCAAGTGAGCTTGAAGAATTTAACTAAATCGTGGACGGTGTCAGACATGGGGTACTTGATTGAGTGCAGAACCCTAGAAGCCCACATAGCCGAGATAGAACCAGAGAACAATAACGTACCTGAGAGCATACTGACAGCCCTGAATCAGTATAATGATGTTTTCGATTGGCCCAAAGAATTGCCTCCAAGAAGGGATATCGAACATCATATACATATAAAGGGAGGGGCAGAACCGGTGAATGTCCGGCCCCATCGGTATGCGTTTCAGCAGAAGGAAGAAATGGAAAAACTGGTGGACGAAATGCTAACCTCAGGAATTATCCGCCCCAGCACAAGCCCCTACTCAAGCACCGTACTATTGGTCAAAAAGAAGGACGGAAGCTGGCGATTCTACGTGGACTACAGGGCACTCAACAACATAACTATTCCAGATAAGTTTCCTATCCCGGTTGTGGAAGAGCTGTTTGACGAGCTAAATGGTGCAAATCTATTCTCTAAAATTGACTTGAAAGCGGGATATCATCAACTTAGAATGTGTAGTCAAGATATAGAGAAGACGGCCTTTAGAACTCATGAAGGACATTATGAGTTTTTGGTGATGCCGTTTGGACTCACAAACGCACCAGCAACTTTCCAATCACTAATGAACTCGATTTTTAGATCGTATTTGAGGAAGTTCGTCTTGGTATTCTTTGACGATATACTGGTTTATAGTAGAAACTTAGAGGAACATTGCCAGCACATTGAGCTAGTTCTGGAAGTATTGAGGAGACATAAGCTGTTTGCTAATCGAAAGAAATGCAGTTTTGCGTACTCAAAGGTGGAGTATTTAGGACACATATTGTCGGGAAAAGGAGTAGAAGTCGACCTGAAAAAATCAGAGCAATCAAACAATGGCCAACTCCAACAAATGTCCAGGAAGTTAGAGGGTTTCTGGGGTTGACTGGTTACTACCGCCATTTTGTACAGCACTATGGGTCCATAGCAGCACTTCTAACTCAACTACTTAAGCTGGGATCATTTAAATGGAATGAGGGAGCACAAGAAGCGTTTGAAAAGCTTCAACGAGCAATGATGACCCTGCCTATACTAGCTCTTCCAGATTTTAACGCACCATTCAAAGTAGAGACAGATGCATTAGGCTATGGGGTAGGGACAGTGCTAATGCAGAACAAGAGACCAATTGCTTTTTATAGCCATACACTAGCCTTGAGAGACCAAGCCAAACCAGTTTACGAGAGGGAGTTAATGGCAGTAGTGTTAGCAGTCCAACGTTGGCGACCCTATTTGTTAGGAAGAACCTTCATAGTTAAGACAGATCAGCGATCACTTAAGTTCCTGCTGGAACAGAGAGTCATACAACCGCAATATCAGAAGTGGATTGCAAAATTGTTGGGTTATTCATTTGAGGTGGTGTATAAACCGGGCTTGGAAAACAAGGCAGCAGATGCCCTTTCACGAGTACCACCAACTGTCCATCTTAACCAACTAACAGCCCCCACCTTGGTAGACATAAAGGTAATCGGAGAGGAGGTTGACAAGGATGACTACTTGAAAGATATAATCAACCAGATGGGAGGAGGAGGTAAAGAATTACACTATGCAACAAGGAATACTGAGATACAAAGGGAGATTAGTGATTGCGAAGAACTCTTCATTGATATCTGCCATTATGCACACATATCATGA

mRNA sequence

ATGACCCAAATCCATGGCTATTCCGTGCAGATAGGCTCCCTATATGGTCGGTTCTTACGTATTCAGCAGGAATCAAGTGTAGAGGAATACCGGAATCTATTCGATAAGTGGGTGGCACCGTTATCGGACATTCCGGAAAAGATTGTGGAAGAGACGTTCATGGGAGGGCTCCGAGATGATGAGATACGCGCAAATGGTGGAACAATGGGAGATCCTGAGGAGAGAAACAAATTTCCCAGTTATTCTGGAGCGAAAGTTCCAAATTACACCTATAATACGGCCAAAACAAATTCAGTTATGAAAGAACAAGGGAACAAGGAGAACACAATATTTCCAATACAAACAATCACGTTGAGGGGATCACCGGCAAAGGAGATTAAGAAAGACGGACCATCCAAATGGCTTTCCGACGCAGAATTCCAGGCCAAGGAGAAAGGACTCTGTTTCAAATGCGATGAGAAGTATTACTCCGGGCACAAATGCAGGGTGAAGGAAATACGTGAGTTACATATGTCCGTGGTAAGAGCGGACGACGTGGAGGAAGAAATTATTGAAGAAGACGAGTATGACTTGAAGGAACTGAAAACGATGGAGTTGCAGAATGACCTTGGGGAAGTAAAGGAGTTATGTATTAACTCGGTAGTGGGATTGACGAATCCAGGTACCATGAAGATAAGGGGAAAAGTTCAAAGCAAGGAGGTTGTCGTGTTAGTGGATTGCGGAGCCACCCACAATTTCATATCCGACAGGCTAGTGATGACACTGAAATTACCCACAAAGGAGACTTCTAACTATGGGGTAATACTGGGATCAGGAATAGCCATCAAAGGCAAGGGAGTGTGTGAAAAAGTAGAGTTGGATCTCAATGGATGGACAGTCCTTGAAAACTTTCTACCGCTGGAACTGGGAGGGGTAGACGTGATACTTGGGATGCAATGGTTACACTCATTGGGAGTGACTGAGATGGACTGGAAGAACTTAACCATGTCATTTTTCCATGACAACAAAAAAATAGTGATAAAAGGGGATCCAAGCTTAACCAAAACTCAAGTGAGCTTGAAGAATTTAACTAAATCGTGGACGGTGTCAGACATGGGGTACTTGATTGAGTGCAGAACCCTAGAAGCCCACATAGCCGAGATAGAACCAGAGAACAATAACGTACCTGAGAGCATACTGACAGCCCTGAATCAGTATAATGATGTTTTCGATTGGCCCAAAGAATTGCCTCCAAGAAGGGATATCGAACATCATATACATATAAAGGGAGGGGCAGAACCGGTGAATGTCCGGCCCCATCGGTATGCGTTTCAGCAGAAGGAAGAAATGGAAAAACTGGTGGACGAAATGCTAACCTCAGGAATTATCCGCCCCAGCACAAGCCCCTACTCAAGCACCGTACTATTGGTCAAAAAGAAGGACGGAAGCTGGCGATTCTACGTGGACTACAGGGCACTCAACAACATAACTATTCCAGATAAGTTTCCTATCCCGGTTGTGGAAGAGCTGTTTGACGAGCTAAATGGTGCAAATCTATTCTCTAAAATTGACTTGAAAGCGGGATATCATCAACTTAGAATGTGTAGTCAAGATATAGAGAAGACGGCCTTTAGAACTCATGAAGGACATTATGAGTTTTTGGTGATGCCGTTTGGACTCACAAACGCACCAGCAACTTTCCAATCACTAATGAACTCGATTTTTAGATCGTATTTGAGGAAGTTCGTCTTGGTATTCTTTGACGATATACTGGTTTATAGTAGAAACTTAGAGGAACATTGCCAGCACATTGAGCTAGTTCTGGAAGTATTGAGGAGACATAAGCTGTTTGCTAATCGAAAGAAATGCAGTTTTGCGTACTCAAAGGAAGTTAGAGGGTTTCTGGGGTTGACTGGTTACTACCGCCATTTTGTACAGCACTATGGGTCCATAGCAGCACTTCTAACTCAACTACTTAAGCTGGGATCATTTAAATGGAATGAGGGAGCACAAGAAGCGTTTGAAAAGCTTCAACGAGCAATGATGACCCTGCCTATACTAGCTCTTCCAGATTTTAACGCACCATTCAAAGTAGAGACAGATGCATTAGGCTATGGGGTAGGGACAGTGCTAATGCAGAACAAGAGACCAATTGCTTTTTATAGCCATACACTAGCCTTGAGAGACCAAGCCAAACCAGTTTACGAGAGGGAGTTAATGGCAGTAGTGTTAGCAGTCCAACGTTGGCGACCCTATTTGTTAGGAAGAACCTTCATAGTTAAGACAGATCAGCGATCACTTAAGTTCCTGCTGGAACAGAGAGTCATACAACCGCAATATCAGAAGTGGATTGCAAAATTGTTGGGTTATTCATTTGAGGTGGTGTATAAACCGGGCTTGGAAAACAAGGCAGCAGATGCCCTTTCACGAGTACCACCAACTGTCCATCTTAACCAACTAACAGCCCCCACCTTGGTAGACATAAAGGTAATCGGAGAGGAGGTTGACAAGGATGACTACTTGAAAGATATAATCAACCAGATGGGAGGAGGAGGTAAAGAATTACACTATGCAACAAGGAATACTGAGATACAAAGGGAGATTAGTGATTGCGAAGAACTCTTCATTGATATCTGCCATTATGCACACATATCATGA

Coding sequence (CDS)

ATGACCCAAATCCATGGCTATTCCGTGCAGATAGGCTCCCTATATGGTCGGTTCTTACGTATTCAGCAGGAATCAAGTGTAGAGGAATACCGGAATCTATTCGATAAGTGGGTGGCACCGTTATCGGACATTCCGGAAAAGATTGTGGAAGAGACGTTCATGGGAGGGCTCCGAGATGATGAGATACGCGCAAATGGTGGAACAATGGGAGATCCTGAGGAGAGAAACAAATTTCCCAGTTATTCTGGAGCGAAAGTTCCAAATTACACCTATAATACGGCCAAAACAAATTCAGTTATGAAAGAACAAGGGAACAAGGAGAACACAATATTTCCAATACAAACAATCACGTTGAGGGGATCACCGGCAAAGGAGATTAAGAAAGACGGACCATCCAAATGGCTTTCCGACGCAGAATTCCAGGCCAAGGAGAAAGGACTCTGTTTCAAATGCGATGAGAAGTATTACTCCGGGCACAAATGCAGGGTGAAGGAAATACGTGAGTTACATATGTCCGTGGTAAGAGCGGACGACGTGGAGGAAGAAATTATTGAAGAAGACGAGTATGACTTGAAGGAACTGAAAACGATGGAGTTGCAGAATGACCTTGGGGAAGTAAAGGAGTTATGTATTAACTCGGTAGTGGGATTGACGAATCCAGGTACCATGAAGATAAGGGGAAAAGTTCAAAGCAAGGAGGTTGTCGTGTTAGTGGATTGCGGAGCCACCCACAATTTCATATCCGACAGGCTAGTGATGACACTGAAATTACCCACAAAGGAGACTTCTAACTATGGGGTAATACTGGGATCAGGAATAGCCATCAAAGGCAAGGGAGTGTGTGAAAAAGTAGAGTTGGATCTCAATGGATGGACAGTCCTTGAAAACTTTCTACCGCTGGAACTGGGAGGGGTAGACGTGATACTTGGGATGCAATGGTTACACTCATTGGGAGTGACTGAGATGGACTGGAAGAACTTAACCATGTCATTTTTCCATGACAACAAAAAAATAGTGATAAAAGGGGATCCAAGCTTAACCAAAACTCAAGTGAGCTTGAAGAATTTAACTAAATCGTGGACGGTGTCAGACATGGGGTACTTGATTGAGTGCAGAACCCTAGAAGCCCACATAGCCGAGATAGAACCAGAGAACAATAACGTACCTGAGAGCATACTGACAGCCCTGAATCAGTATAATGATGTTTTCGATTGGCCCAAAGAATTGCCTCCAAGAAGGGATATCGAACATCATATACATATAAAGGGAGGGGCAGAACCGGTGAATGTCCGGCCCCATCGGTATGCGTTTCAGCAGAAGGAAGAAATGGAAAAACTGGTGGACGAAATGCTAACCTCAGGAATTATCCGCCCCAGCACAAGCCCCTACTCAAGCACCGTACTATTGGTCAAAAAGAAGGACGGAAGCTGGCGATTCTACGTGGACTACAGGGCACTCAACAACATAACTATTCCAGATAAGTTTCCTATCCCGGTTGTGGAAGAGCTGTTTGACGAGCTAAATGGTGCAAATCTATTCTCTAAAATTGACTTGAAAGCGGGATATCATCAACTTAGAATGTGTAGTCAAGATATAGAGAAGACGGCCTTTAGAACTCATGAAGGACATTATGAGTTTTTGGTGATGCCGTTTGGACTCACAAACGCACCAGCAACTTTCCAATCACTAATGAACTCGATTTTTAGATCGTATTTGAGGAAGTTCGTCTTGGTATTCTTTGACGATATACTGGTTTATAGTAGAAACTTAGAGGAACATTGCCAGCACATTGAGCTAGTTCTGGAAGTATTGAGGAGACATAAGCTGTTTGCTAATCGAAAGAAATGCAGTTTTGCGTACTCAAAGGAAGTTAGAGGGTTTCTGGGGTTGACTGGTTACTACCGCCATTTTGTACAGCACTATGGGTCCATAGCAGCACTTCTAACTCAACTACTTAAGCTGGGATCATTTAAATGGAATGAGGGAGCACAAGAAGCGTTTGAAAAGCTTCAACGAGCAATGATGACCCTGCCTATACTAGCTCTTCCAGATTTTAACGCACCATTCAAAGTAGAGACAGATGCATTAGGCTATGGGGTAGGGACAGTGCTAATGCAGAACAAGAGACCAATTGCTTTTTATAGCCATACACTAGCCTTGAGAGACCAAGCCAAACCAGTTTACGAGAGGGAGTTAATGGCAGTAGTGTTAGCAGTCCAACGTTGGCGACCCTATTTGTTAGGAAGAACCTTCATAGTTAAGACAGATCAGCGATCACTTAAGTTCCTGCTGGAACAGAGAGTCATACAACCGCAATATCAGAAGTGGATTGCAAAATTGTTGGGTTATTCATTTGAGGTGGTGTATAAACCGGGCTTGGAAAACAAGGCAGCAGATGCCCTTTCACGAGTACCACCAACTGTCCATCTTAACCAACTAACAGCCCCCACCTTGGTAGACATAAAGGTAATCGGAGAGGAGGTTGACAAGGATGACTACTTGAAAGATATAATCAACCAGATGGGAGGAGGAGGTAAAGAATTACACTATGCAACAAGGAATACTGAGATACAAAGGGAGATTAGTGATTGCGAAGAACTCTTCATTGATATCTGCCATTATGCACACATATCATGA
BLAST of CSPI03G20180 vs. Swiss-Prot
Match: POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 2.4e-79
Identity = 200/615 (32.52%), Postives = 308/615 (50.08%), Query Frame = 1

Query: 237 LVDCGATHNFISDRLVMTLKLPTKE------TSNYGVILGSGIAIKGKGVCEKVELDLNG 296
           L+D G+T N I++ +     LP +       TSN  + L   I +    + +K E     
Sbjct: 28  LLDTGSTINMINENIFC---LPIQNSRCEVLTSNGPITLNDLIMLPRNSIFKKTE----- 87

Query: 297 WTVLENFLPLELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIVIKGDPSLTKTQ 356
                 ++       D+++G + L +   + +++KN T++ F    K++     S     
Sbjct: 88  ----PFYVHRFSNNYDMLIGRKLLKN-AQSVINYKNDTVTLFDQTYKLITS--ESERNQN 147

Query: 357 VSLKNLTKSWTVSDMGYLIECRTLEAHIAEIEPENNNVPESILTALNQYNDV-FDWPKEL 416
           + ++   +S   SD   +   + L+     ++  N      +   LN++ ++ +   ++L
Sbjct: 148 LYIQRTPESIASSDQESI---KKLDFSQFRLDHLNQEETFKLKGLLNKFRNLEYKEGEKL 207

Query: 417 PPRRDIEHHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLL 476
                I+H ++    + P+  + +  A   + E+E  V EML  G+IR S SPY+S   +
Sbjct: 208 TFTNTIKHVLNTTHNS-PIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWV 267

Query: 477 VKKKDGS-----WRFYVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQ 536
           V KK  +     +R  +DYR LN ITIPD++PIP ++E+  +L     F+ IDL  G+HQ
Sbjct: 268 VPKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQ 327

Query: 537 LRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDIL 596
           + M  + I KTAF T  GHYE+L MPFGL NAPATFQ  MN+I R  L K  LV+ DDI+
Sbjct: 328 IEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDII 387

Query: 597 VYSRNLEEHCQHIELVLEVLRRHKLFANRKKCSF-------------------------- 656
           ++S +L EH   I+LV   L    L     KC F                          
Sbjct: 388 IFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKA 447

Query: 657 -------AYSKEVRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWNEGAQ--EAFEK 716
                     KE+R FLGLTGYYR F+ +Y  IA  +T  LK  +    +  +  EAFEK
Sbjct: 448 IVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEK 507

Query: 717 LQRAMMTLPILALPDFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRDQAKPVYE 776
           L+  ++  PIL LPDF   F + TDA    +G VL QN  PI+F S TL   +      E
Sbjct: 508 LKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIE 567

Query: 777 RELMAVVLAVQRWRPYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVY 805
           +EL+A+V A + +R YLLGR F++ +D + L++L   +    + ++W  +L  Y F++ Y
Sbjct: 568 KELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDY 623

BLAST of CSPI03G20180 vs. Swiss-Prot
Match: POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 297.0 bits (759), Expect = 6.9e-79
Identity = 199/620 (32.10%), Postives = 309/620 (49.84%), Query Frame = 1

Query: 228 KVQSKEVVVLVDCGATHNFISDRLVMTLKLPTKETSNYGVILGSGIAIKGKGVC--EKVE 287
           K +   +  L+D G+T N  S  +     LP + TS + +   +G  I  K +    K+ 
Sbjct: 19  KYKENNLKCLIDTGSTVNMTSKNI---FDLPIQNTSTF-IHTSNGPLIVNKSIIIPSKIL 78

Query: 288 LDLNGWTVLENFLPLELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIVIKGDPS 347
                  +L  F        D++LG + L     T + +++  ++ ++ NK  +I+G  +
Sbjct: 79  FPTTNEFLLHPFSE----NYDLLLGRKLLAEAKAT-ISYRDQEVTLYN-NKYKLIEGIAT 138

Query: 348 LTKTQVSLKNLTKSWTVSDMGYLIECRTLEAHIAEIEPENNNVPESILTALNQYNDV-FD 407
             ++     N+     +     +     LE+ +  +E  NN   + +   L +Y+D+ + 
Sbjct: 139 HEQSHFQNVNMIPDTMLRQPNKISPI--LESDLYRLEHLNNEEKQRLCALLQKYHDIQYH 198

Query: 408 WPKELPPRRDIEHHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYS 467
              +L      +H I+ K      +   +  A++Q  E+E  + +ML  GIIR S SPY+
Sbjct: 199 EGDKLTFTNQTKHTINTKHNLPLYSKYSYPQAYEQ--EVESQIQDMLNQGIIRTSNSPYN 258

Query: 468 STVLLV-KKKDGS----WRFYVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLK 527
           S + +V KK+D S    +R  +DYR LN IT+ D+ PIP ++E+  +L   N F+ IDL 
Sbjct: 259 SPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLA 318

Query: 528 AGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVF 587
            G+HQ+ M  + + KTAF T  GHYE+L MPFGL NAPATFQ  MN I R  L K  LV+
Sbjct: 319 KGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVY 378

Query: 588 FDDILVYSRNLEEHCQHIELVLEVLRRHKLFANRKKCSFAYS------------------ 647
            DDI+V+S +L+EH Q + LV E L +  L     KC F                     
Sbjct: 379 LDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNP 438

Query: 648 ---------------KEVRGFLGLTGYYRHFVQHYGSIAALLTQLLK--LGSFKWNEGAQ 707
                          KE++ FLGLTGYYR F+ ++  IA  +T+ LK  +     N    
Sbjct: 439 EKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYD 498

Query: 708 EAFEKLQRAMMTLPILALPDFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRDQA 767
            AF+KL+  +   PIL +PDF   F + TDA    +G VL Q+  P+++ S TL   +  
Sbjct: 499 SAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSYISRTLNEHEIN 558

Query: 768 KPVYERELMAVVLAVQRWRPYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYS 805
               E+EL+A+V A + +R YLLGR F + +D + L +L   +    +  +W  KL  + 
Sbjct: 559 YSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFD 618

BLAST of CSPI03G20180 vs. Swiss-Prot
Match: POL5_DROME (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 272.7 bits (696), Expect = 1.4e-71
Identity = 158/444 (35.59%), Postives = 238/444 (53.60%), Query Frame = 1

Query: 426 EPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKK-----DGSWRFY 485
           +P+  + + Y    + E+E+ +DE+L  GIIRPS SPY+S + +V KK     +  +R  
Sbjct: 122 DPIYAKSYPYPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMV 181

Query: 486 VDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTH 545
           VD++ LN +TIPD +PIP +      L  A  F+ +DL +G+HQ+ M   DI KTAF T 
Sbjct: 182 VDFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTL 241

Query: 546 EGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELV 605
            G YEFL +PFGL NAPA FQ +++ I R ++ K   V+ DDI+V+S + + H +++ LV
Sbjct: 242 NGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLV 301

Query: 606 LEVLRRHKLFANRKKCSFAYS---------------------------------KEVRGF 665
           L  L +  L  N +K  F  +                                 KE++ F
Sbjct: 302 LASLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRF 361

Query: 666 LGLTGYYRHFVQHYGSIAALLTQLLK--LGSFK----------WNEGAQEAFEKLQRAMM 725
           LG+T YYR F+Q Y  +A  LT L +    + K           +E A ++F  L+  + 
Sbjct: 362 LGMTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILC 421

Query: 726 TLPILALPDFNAPFKVETDALGYGVGTVLMQN----KRPIAFYSHTLALRDQAKPVYERE 785
           +  ILA P F  PF + TDA  + +G VL Q+     RPIA+ S +L   ++     E+E
Sbjct: 422 SSEILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKE 481

Query: 786 LMAVVLAVQRWRPYLLGR-TFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYK 815
           ++A++ ++   R YL G  T  V TD + L F L  R    + ++W A++  Y+ E++YK
Sbjct: 482 MLAIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYK 541

BLAST of CSPI03G20180 vs. Swiss-Prot
Match: POLY_DROME (Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 1.1e-60
Identity = 147/431 (34.11%), Postives = 224/431 (51.97%), Query Frame = 1

Query: 426 EPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKK------DGSWRF 485
           EPV  R +       + +   V ++L  GIIRPS SPY+S   +V KK      + + R 
Sbjct: 180 EPVYSRAYPTLMGVSDFVNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRL 239

Query: 486 YVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRT 545
            +D+R LN  TIPD++P+P +  +   L  A  F+ +DLK+GYHQ+ +   D EKT+F  
Sbjct: 240 VIDFRKLNEKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSV 299

Query: 546 HEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIEL 605
           + G YEF  +PFGL NA + FQ  ++ + R  + K   V+ DD++++S N  +H +HI+ 
Sbjct: 300 NGGKYEFCRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDT 359

Query: 606 VLEVLRRHKLFANRKKC----------SFAYSKE-----------------------VRG 665
           VL+ L    +  +++K            F  SK+                       VR 
Sbjct: 360 VLKCLIDANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRS 419

Query: 666 FLGLTGYYRHFVQHYGSIAALLTQLLK--LGSF----------KWNEGAQEAFEKLQRAM 725
           FLGL  YYR F++ + +IA  +T +LK   GS           ++NE  + AF++L+  +
Sbjct: 420 FLGLASYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNIL 479

Query: 726 MTLP-ILALPDFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRDQAKPVYERELM 785
            +   IL  PDF  PF + TDA   G+G VL Q  RPI   S TL   +Q     EREL+
Sbjct: 480 ASEDVILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELL 539

Query: 786 AVVLAVQRWRPYLLG-RTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPG 804
           A+V A+ + + +L G R   + TD + L F +  R    + ++W + +  ++ +V YKPG
Sbjct: 540 AIVWALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPG 599

BLAST of CSPI03G20180 vs. Swiss-Prot
Match: POL4_DROME (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster GN=POL PE=3 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 5.9e-54
Identity = 137/467 (29.34%), Postives = 221/467 (47.32%), Query Frame = 1

Query: 387 NVPESILTAL----NQYNDVFDWPKE-LPPRRDIEHHIHIKGGAEPVNVRPHRYAFQQKE 446
           N PE   + L    ++Y D+F    E +      +  + +K   EPV  + +R    Q E
Sbjct: 270 NFPELFKSQLENICSEYIDIFALESEPITVNNLYKQQLRLKDD-EPVYTKNYRSPHSQVE 329

Query: 447 EMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKKDG------SWRFYVDYRALNNITIPDKF 506
           E++  V +++   I+ PS S Y+S +LLV KK         WR  +DYR +N   + DKF
Sbjct: 330 EIQAQVQKLIKDKIVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKF 389

Query: 507 PIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTN 566
           P+P ++++ D+L  A  FS +DL +G+HQ+ +     + T+F T  G Y F  +PFGL  
Sbjct: 390 PLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKI 449

Query: 567 APATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLRRHKLFANRKK 626
           AP +FQ +M   F         ++ DD++V   + +   +++  V    R + L  + +K
Sbjct: 450 APNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEK 509

Query: 627 CSFAYSK---------------------------------EVRGFLGLTGYYRHFVQHYG 686
           CSF   +                                   R F+    YYR F++++ 
Sbjct: 510 CSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFA 569

Query: 687 SIAALLTQLLKLG-SFKWNEGAQEAFEKLQRAMMTLPILALPDFNAPFKVETDALGYGVG 746
             +  +T+L K    F+W +  Q+AF  L+  ++   +L  PDF+  F + TDA     G
Sbjct: 570 DYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACG 629

Query: 747 TVLMQN----KRPIAFYSHTLALRDQAKPVYERELMAVVLAVQRWRPYLLGRTFIVKTDQ 805
            VL QN    + P+A+ S      +  K   E+EL A+  A+  +RPY+ G+ F VKTD 
Sbjct: 630 AVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGKHFTVKTDH 689

BLAST of CSPI03G20180 vs. TrEMBL
Match: A5B2I6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043911 PE=4 SV=1)

HSP 1 Score: 776.2 bits (2003), Expect = 4.3e-221
Identity = 425/874 (48.63%), Postives = 572/874 (65.45%), Query Frame = 1

Query: 10   QIGSLYGRFLRIQQESSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLRDDEIRANGGTM 69
            Q GSL  +FL ++Q+ +V  Y   F+    PL  I E+++E TFM GL   EIRA    +
Sbjct: 847  QEGSLCEQFLAVRQQGTVAAYWREFEILETPLKGISEEVMESTFMNGLLP-EIRAEQRLL 906

Query: 70   GD------------PEERNKFPSYSGAKVPNYTYNTAKTNSVMKEQGNKENTIFPIQTIT 129
                           E+RN   +   A+ PN   +T   ++  + +  K    F  + + 
Sbjct: 907  QPYGLGHLMEMAQRVEDRNL--AMRAAREPNGPKSTKMLSTANRGEW-KIGENFQTRAVA 966

Query: 130  LRGSPAKEIKKDGPSKWLSDAEFQAK-EKGLCFKCDEKYYSGHKCRVKEIRELHMSVVRA 189
            + G      +++ P K L+++E QA+ EKGL FKC+EK+  GH+C+ KE+R L +     
Sbjct: 967  V-GEKTMSQRREIPIKRLTESELQARREKGLWFKCEEKFSPGHRCK-KELRVLLVH---- 1026

Query: 190  DDVEEEIIEEDEYDLKELKTMELQNDLGEVKELCINSVVGLTNPGTMKIRGKVQSKEVVV 249
            +D EE+  + D+   +E   +EL++ +    EL +NSVVGLT PGTMKI+G + SKEV++
Sbjct: 1027 EDEEEDDNQFDDRATEEPALIELKDAV----ELSLNSVVGLTTPGTMKIKGTIGSKEVII 1086

Query: 250  LVDCGATHNFISDRLVMTLKLPTKETSNYGVILGSGIAIKGKGVCEKVELDLNGWTVLEN 309
            LVD GATHNF+S  LV  L LP   T++YGV++G+GI++KGKG+C  V + + G TV+E+
Sbjct: 1087 LVDSGATHNFLSLELVQQLTLPLTTTTSYGVMMGTGISVKGKGICRGVCISMQGLTVVED 1146

Query: 310  FLPLELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIVIKGDPSLTKTQVSLKNL 369
            FLPLELG  DVILGM WL +LG  +++WK LTM        +V+KGDPSL++T+ S    
Sbjct: 1147 FLPLELGNTDVILGMPWLGTLGDVKVNWKMLTMKIKMGKAVMVLKGDPSLSRTETS---- 1206

Query: 370  TKSWTVSDMGYLIECRTLEAHIAEIEPENNNVPESILTALNQYNDVFDWPKELPPRRDIE 429
                T SD+   ++                 VP+++   L Q+  +F+    LPP RDI+
Sbjct: 1207 ----TTSDLSEGVQ----------------EVPKTVKEVLAQHQQIFEPITGLPPSRDID 1266

Query: 430  HHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKKDGS 489
            H I +  GA PVNVRP+RY    K E+++LV EML +GI+RPS SP+SS VLLVKKKDG 
Sbjct: 1267 HAIQLILGASPVNVRPYRYPHILKNEIKRLVQEMLEAGIVRPSLSPFSSPVLLVKKKDGG 1326

Query: 490  WRFYVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTA 549
            WRF +DYRALN +T+PD+FPIPV++EL D+L+GA +FSK+DLK+GYHQ+R+  QDI KTA
Sbjct: 1327 WRFCIDYRALNKVTVPDRFPIPVIDELLDKLHGATIFSKLDLKSGYHQIRVRQQDIPKTA 1386

Query: 550  FRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQH 609
            FRTHEGHYEFLVMPFGLTNAPATFQSLMN IF  +L KFVLVFF DILVYS++L+EHC H
Sbjct: 1387 FRTHEGHYEFLVMPFGLTNAPATFQSLMNRIFWPHLWKFVLVFFYDILVYSKDLKEHCDH 1446

Query: 610  IELVLEVLRRHKLFANRKKCSFAYS---------------------------------KE 669
            ++ VL +L  H+L  N KKC FA                                   KE
Sbjct: 1447 LQTVLSILANHQLHVNGKKCLFAKLQLEYLGHLVSAKGVAADPNKISAMVEWPTPKSLKE 1506

Query: 670  VRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWNEGAQEAFEKLQRAMMTLPILALP 729
            +RGFLGLTGYYR FV+ YG+I+  LTQ LK  +F WN  A+ AF+KL+  M T+P+LALP
Sbjct: 1507 LRGFLGLTGYYRRFVEGYGAISWPLTQELKKDAFNWNLEAEVAFQKLKTTMTTIPVLALP 1566

Query: 730  DFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRDQAKPVYERELMAVVLAVQRWR 789
            +F+  F VE DA GYG+GTVLMQ+ RP+A++S  L  R++ K +YERELMA+VLAVQ+WR
Sbjct: 1567 NFSQLFIVEMDASGYGLGTVLMQSHRPVAYFSQVLTARERQKSIYERELMAIVLAVQKWR 1626

Query: 790  PYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSR 838
             YLLGR FIV+TDQ SLKFLLEQR++   YQKW+AKL GY FE+ ++PG ENKAADALSR
Sbjct: 1627 HYLLGRHFIVRTDQSSLKFLLEQRIVNESYQKWVAKLFGYDFEIQFRPGXENKAADALSR 1682

BLAST of CSPI03G20180 vs. TrEMBL
Match: A0A087G3S6_ARAAL (Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs46225U000100 PE=4 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 1.7e-201
Identity = 379/751 (50.47%), Postives = 500/751 (66.58%), Query Frame = 1

Query: 121  SPAKEIKKDGPSKWLSDAEF-QAKEKGLCFKCDEKYYSGHKCRVKEIRELHMSVVRADDV 180
            SP +  +   P + L+  E  Q K    C++CDE  +  H C  KE   L   VV+ D  
Sbjct: 344  SPNQSDRVTPPYRKLTAEEVAQRKAANQCYRCDEVGHMRHMCPKKEFGVL---VVQTDGS 403

Query: 181  EEEIIEEDEYDLKELKTMELQNDLGEVKELCINSVVGLTNPGTMKIRGKVQSKEVVVLVD 240
              E+ EED    K     +   +  E+  L +NS+VG+++P TMK+RG++QS  VVV++D
Sbjct: 404  YREL-EED----KPGNPGDEGQEEPELAALSLNSIVGISSPRTMKLRGQLQSATVVVMID 463

Query: 241  CGATHNFISDRLVMTLKLPTKETSNYGVILGSGIAIKGKGVCEKVELDLNGWTVLENFLP 300
             GA+HNF+S ++V TL L   E S YGV+ G+G+ ++G G    ++L++    V   FLP
Sbjct: 464  SGASHNFVSTKVVSTLGLVIDEASRYGVVTGTGMTVQGFGSPLLLQLEIQEIMVRAEFLP 523

Query: 301  LELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIVIKGDPSLTKTQVSLKNLTKS 360
            LELG  DVILGMQWL SLG   ++WK  TM F  + + + ++GD  L    +SLK L KS
Sbjct: 524  LELGTADVILGMQWLESLGDMTVNWKLQTMKFMLNEELVKLQGDAGLCCAPISLKALWKS 583

Query: 361  WTVSDMGYLIECRTLEAHIAEIEPENNNVPESILTALNQYNDVFDWPKELPPRRDIEHHI 420
                  G L+E   L+A +   +     +P  +LT L Q+  VF+ P+ LPP R  EH+I
Sbjct: 584  LADQGQGVLVEYCGLQAEL-HTQRRREQLPHQLLTVLEQFARVFEDPQGLPPSRGKEHNI 643

Query: 421  HIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKKDGSWRF 480
             ++  A+PV+VRP RY   Q+EE+EK V  ML +G+I+ S SP+SS VLLVKKKDGSWRF
Sbjct: 644  VLEPNAKPVSVRPFRYPQAQREEVEKQVASMLAAGLIQASGSPFSSPVLLVKKKDGSWRF 703

Query: 481  YVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRT 540
             VDYRALN +TIPD FPIP++++L DEL+GA +FSK+DLK+GYHQ+ + ++D+ KTAFRT
Sbjct: 704  CVDYRALNKVTIPDSFPIPMIDQLLDELHGATIFSKLDLKSGYHQILVKAEDVAKTAFRT 763

Query: 541  HEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIEL 600
            H+GHYEFLVMPFGLTNAPATFQSLMN +FR YLRKFVLVFFDDILVYS++L+EH QH+ L
Sbjct: 764  HDGHYEFLVMPFGLTNAPATFQSLMNDVFRGYLRKFVLVFFDDILVYSKSLQEHQQHLGL 823

Query: 601  VLEVLRRHKLFANRKKCSFAYS---------------------------------KEVRG 660
            VLE+L++H+LFAN+KKC F  +                                 K +RG
Sbjct: 824  VLELLQQHQLFANKKKCEFGRTELEYLGHVVSGKGVAADPEKIQAMVSWPEPQNVKALRG 883

Query: 661  FLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWNEGAQEAFEKLQRAMMTLPILALPDFN 720
            FLGLTGYYR FVQ YG IA  LT LLK   F+W   A  AF+KL++AM T+P+LAL DF 
Sbjct: 884  FLGLTGYYRKFVQRYGEIARPLTALLKKDQFQWTAEATVAFQKLKKAMSTVPVLALVDFT 943

Query: 721  APFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRDQAKPVYERELMAVVLAVQRWRPYL 780
              F VE+DA G G+G VLMQ++RP+A++S  L  R + K VYERELMA+V A+Q+WR YL
Sbjct: 944  EQFVVESDASGTGLGAVLMQSQRPLAYFSQALTERQRLKSVYERELMAIVFAIQKWRHYL 1003

Query: 781  LGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPP 838
            LGR F+V+TDQ+SLKFLLEQR I  +YQKW+ KLLG+ FE+ YKPGLENKAADALSR   
Sbjct: 1004 LGRKFVVRTDQKSLKFLLEQREINMEYQKWLTKLLGFDFEIQYKPGLENKAADALSRKDM 1063

BLAST of CSPI03G20180 vs. TrEMBL
Match: A0A087HFW3_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA2G074100 PE=4 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 1.7e-201
Identity = 404/885 (45.65%), Postives = 535/885 (60.45%), Query Frame = 1

Query: 17   RFLRIQQESSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLRDDEIRA------------ 76
            R L ++Q  SV++Y   F        +I E  +E  FM GL    IRA            
Sbjct: 203  RLLTLRQTGSVKDYCREFIALATNAPEIQETTLELAFMVGLTP-AIRARTKTFEPRTLKQ 262

Query: 77   --------NGGTMGDPEERNKFPSYSGAKVPNYTYNTA-KTNSVMKEQGNKENTIFPIQT 136
                    +G +  D     +F S   +        ++  TN V    G   N   P  +
Sbjct: 263  MMGIAQRIDGWSTADDSPSQRFSSGGRSDAKGVQLRSSGPTNQVSGRSGYGPNNPKPTNS 322

Query: 137  --ITLRGSPAKEIKKDGPS----------KWLSDAEFQAKEKGLCFKCDEKYYSGHKCRV 196
               T R +  ++     P+          K  +D   Q K    C++CDE  +  H C  
Sbjct: 323  SSFTTRATSFQKAGYRSPNQSERVTPPYRKLTADEVAQRKAANQCYRCDEVGHMRHMCPK 382

Query: 197  KEIRELHMSVVRADDVEEEIIEEDEYDLKELKTMELQNDLG----EVKELCINSVVGLTN 256
            KE   L   VV+ D    E+ EE+            QN  G    E+ EL +NS+VG+++
Sbjct: 383  KEFGVL---VVQTDGSYRELEEEEN---------GTQNGEGQEEPELAELSLNSIVGISS 442

Query: 257  PGTMKIRGKVQSKEVVVLVDCGATHNFISDRLVMTLKLPTKETSNYGVILGSGIAIKGKG 316
            P TMK+RG++QS  VVV++D GA+HNF+S ++V +L L   + S+YGV+ G+G+ ++G G
Sbjct: 443  PRTMKLRGQLQSVPVVVMIDSGASHNFVSTKVVSSLGLSIDKASSYGVVTGTGMTVQGIG 502

Query: 317  VCEKVELDLNGWTVLENFLPLELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIV 376
                + L++    V   FLPLELG  DVILGMQWL SLG   ++WK  TM F  +++ + 
Sbjct: 503  SPMLLRLEIQEIVVAAEFLPLELGTADVILGMQWLESLGEMTVNWKLQTMRFLLNDEAVG 562

Query: 377  IKGDPSLTKTQVSLKNLTKSWTVSDMGYLIECRTLEAHIAEIEPENNNVPESILTALNQY 436
            ++GD  L    +SLK L KS      G L+E   L+  +  ++     +P  +L  L Q+
Sbjct: 563  LQGDVGLCCAPISLKALWKSLADQGQGVLVEFCGLQTELL-LKNRKEQLPPQLLEVLEQF 622

Query: 437  NDVFDWPKELPPRRDIEHHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPS 496
              VF+ P+ LPP R  EH I ++  A+PV+VRP RY   Q+EE+EK V  ML +GII+ S
Sbjct: 623  AGVFEDPQGLPPSRGKEHSIVLEPNAKPVSVRPFRYPQAQREEIEKQVASMLAAGIIQAS 682

Query: 497  TSPYSSTVLLVKKKDGSWRFYVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLK 556
             SP+SS VLLVKKKDGSWRF VDYRALN +TIPD FPIP++++L DEL+GA +FSK+DLK
Sbjct: 683  GSPFSSPVLLVKKKDGSWRFCVDYRALNKVTIPDSFPIPMIDQLLDELHGATIFSKLDLK 742

Query: 557  AGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVF 616
            +GYHQ+ + ++D+ KTAFRTH+GHYEFLVMPFGLTNAPATFQSLMN +FR YLRKFVLVF
Sbjct: 743  SGYHQILVKAEDVAKTAFRTHDGHYEFLVMPFGLTNAPATFQSLMNDVFRGYLRKFVLVF 802

Query: 617  FDDILVYSRNLEEHCQHIELVLEVLRRHKLFANRKKCSFAYSK----------------- 676
            FDDILVYS++L+EH QH+  VL +L++H+LFAN+KKC F  SK                 
Sbjct: 803  FDDILVYSKSLQEHQQHLGQVLALLQKHQLFANQKKCDFGRSKLEYLGHVVSGQGVAADP 862

Query: 677  ----------------EVRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWNEGAQEA 736
                             +RGFLGLTGYYR FVQ YG IA  LT LLK   F W   A  A
Sbjct: 863  EKIQAMVSWSEPQNVKALRGFLGLTGYYRKFVQGYGEIARPLTALLKKDQFHWTAEATMA 922

Query: 737  FEKLQRAMMTLPILALPDFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRDQAKP 796
            F++L++AM T+P+LAL DF   F VE+DA G G+G VLMQ +RP+A++S  L  R + K 
Sbjct: 923  FQQLKKAMSTVPVLALVDFTEQFVVESDASGTGLGAVLMQQQRPLAYFSQALTERQRLKS 982

Query: 797  VYERELMAVVLAVQRWRPYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYSFE 832
            VYERELMA+V A+Q+WR YLLGR F+V+TDQ+SLKFLLEQR I  +YQKW+ KLLG+ FE
Sbjct: 983  VYERELMAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLLEQREINLEYQKWLTKLLGFDFE 1042

BLAST of CSPI03G20180 vs. TrEMBL
Match: A0A087GEK8_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1)

HSP 1 Score: 710.7 bits (1833), Expect = 2.2e-201
Identity = 367/755 (48.61%), Postives = 514/755 (68.08%), Query Frame = 1

Query: 117  TLRGSPAKEIKKDGPSKWLSDAEF-QAKEKGLCFKCDEKYYSGHKCRVKEIRELHMSVVR 176
            T + +P    +   P + L+  E  Q K  GLCF+CDEK++  H+C  KE+  L   +V+
Sbjct: 315  TEKRNPTTHNRVKPPYRRLTPIEMAQRKADGLCFRCDEKWHIRHQCPKKEVNVL---LVQ 374

Query: 177  ADDVEEEIIEEDEYDLKELKTMELQNDLGEVKELCINSVVGLTNPGTMKIRGKVQSKEVV 236
             D    +I+ E + D  +     +     E+ EL +NS+VG+++P TMK+ G +Q+ EVV
Sbjct: 375  EDG--PDILWEADDDFTDATDQAIT----ELAELSLNSMVGISSPSTMKLMGTIQTTEVV 434

Query: 237  VLVDCGATHNFISDRLVMTLKLPTKETSNYGVILGSGIAIKGKGVCEKVELDLNGWTVLE 296
            VL+D GA+HNF+S++LV  L L + +T +YGV+ G G+ ++G GVC  + L L G  + +
Sbjct: 435  VLIDSGASHNFVSEQLVHRLGLQSAKTGSYGVLTGGGMTVRGAGVCRGLVLLLQGLRIRD 494

Query: 297  NFLPLELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIVIKGDPSLTKTQVSLKN 356
            +FLPLELG  DVILG++WL SLG  +++W    M F    +  V++GDP    + +SLK+
Sbjct: 495  DFLPLELGSADVILGIKWLSSLGEMKVNWGRQYMRFSLGGETAVLQGDPGQGCSAISLKS 554

Query: 357  LTKSWTVSDMGYLIECRTLEAHIAEIEPENNNVPESILTALNQYNDVFDWPKELPPRRDI 416
            L ++     +G L+E   L++ + ++      VP+++++ ++Q+  VF+ P+ LPP R  
Sbjct: 555  LMRAVKDQGVGLLVEYNGLQS-LDQVAGFTTEVPQALVSVMDQFPQVFEDPQGLPPTRGR 614

Query: 417  EHHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKKDG 476
             H I+++ GA+ V+VRP RY   QK E+EK V  ML +GII+ STS +SS VLLVKKKDG
Sbjct: 615  AHEINLESGAKAVSVRPFRYPQTQKAEIEKQVTAMLAAGIIQESTSTFSSPVLLVKKKDG 674

Query: 477  SWRFYVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKT 536
            SWRF +DYRALN +TIPD FPIP++++L DEL+GA +FSK+DLK+GYHQ+ +  Q++ KT
Sbjct: 675  SWRFCIDYRALNKVTIPDSFPIPMIDQLLDELHGATVFSKLDLKSGYHQILVKPQNVPKT 734

Query: 537  AFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQ 596
            AFRTH+GHYEFLVMPFGLTNAP TFQ+LMN +FR++LRKFVLVFFDDILVYS +L+EH +
Sbjct: 735  AFRTHDGHYEFLVMPFGLTNAPTTFQALMNEVFRAHLRKFVLVFFDDILVYSSSLQEHQE 794

Query: 597  HIELVLEVLRRHKLFANRKKCSFAYS---------------------------------K 656
            H+ +VL++L + +LFAN+KKC F  S                                 K
Sbjct: 795  HLRVVLQILFQQQLFANKKKCQFGSSSIEYLGHVISGEGVSADPSKLQAMVSWPLPKNIK 854

Query: 657  EVRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWNEGAQEAFEKLQRAMMTLPILAL 716
             +RGFLGLTGYYR FVQ YGSIA  LT LLK   F+W+E A  AFEKL+ AM T+P+LAL
Sbjct: 855  ALRGFLGLTGYYRRFVQGYGSIAKPLTSLLKKDKFQWSEEATVAFEKLKVAMSTVPVLAL 914

Query: 717  PDFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRDQAKPVYERELMAVVLAVQRW 776
             DF+  F VE+DA G G+G VL+Q ++P+A++S  L  R + K VYERELMA+V A+Q+W
Sbjct: 915  VDFSELFVVESDASGIGLGAVLLQKQKPVAYFSQALTDRQKLKSVYERELMAIVFAIQKW 974

Query: 777  RPYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALS 836
            R YLLGR F+V+TDQ+SLKFLLEQR +  +YQ+W+ K+LG++F++ YKPGLENKAADALS
Sbjct: 975  RHYLLGRKFLVRTDQKSLKFLLEQREVNLEYQQWLTKILGFNFDIHYKPGLENKAADALS 1034

Query: 837  RVPPTVHLNQLTAPTLVDIKVIGEEVDKDDYLKDI 838
            RV     L  L+ P  + ++ I EEVD++   K I
Sbjct: 1035 RVEGLPQLYALSVPAAIQLEEINEEVDRNPVSKKI 1059

BLAST of CSPI03G20180 vs. TrEMBL
Match: A0A087G291_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AAs48021U000700 PE=4 SV=1)

HSP 1 Score: 706.8 bits (1823), Expect = 3.2e-200
Identity = 365/741 (49.26%), Postives = 500/741 (67.48%), Query Frame = 1

Query: 131  PSKWLSDAEFQ-AKEKGLCFKCDEKYYSGHKCRVKEIRELHMSVVRADDVEEEIIEEDEY 190
            P + L+ AE +  K +G+CF+CDEK +S  +C  KE      +V+   D   EI  EDE 
Sbjct: 320  PYRKLTQAEIEWRKAEGMCFRCDEKGHSRSQCPHKEY-----AVLIVQDDGSEIEWEDEG 379

Query: 191  DLKELKTMELQNDLGEVKELCINSVVGLTNPGTMKIRGKVQSKEVVVLVDCGATHNFISD 250
              ++++ +    D  EV EL +NS+VG+++P T+K+RG ++ + V+V++D GA+HNF+S+
Sbjct: 380  GEEKIEAIL---DTAEVAELSLNSMVGISSPRTVKLRGSIRDEPVIVMIDSGASHNFVSE 439

Query: 251  RLVMTLKLPTKETSNYGVILGSGIAIKGKGVCEKVELDLNGWTVLENFLPLELGGVDVIL 310
            ++V+ L L   ET  YGV+ G+G+ ++G+GVC+ VEL L G  V+  FLPLELG  DVIL
Sbjct: 440  KMVVKLGLTATETKGYGVVTGTGLTVQGRGVCKDVELHLQGLVVVAPFLPLELGSADVIL 499

Query: 311  GMQWLHSLGVTEMDWKNLTMSFFHDNKKIVIKGDPSLTKTQVSLKNLTKSWTVSDMGYLI 370
            G+QWL SLG    +WK   ++F  + K++ ++GDPS+  + V+LK L K+      G ++
Sbjct: 500  GIQWLGSLGDMRCNWKLQKIAFMVEGKEVELQGDPSICCSPVTLKGLWKALDQEGQGVIV 559

Query: 371  ECRTLEAHIAEIEPENNNVPESILTALNQYNDVFDWPKELPPRRDIEHHIHIKGGAEPVN 430
            E   L+A     E     VPE++ T L ++  VF+ P+ LPP R  EH I +K  A PV 
Sbjct: 560  EYGGLQAQNPRSEKP---VPEALSTVLAEFTGVFEEPRGLPPSRGKEHEITLKQEASPVC 619

Query: 431  VRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKKDGSWRFYVDYRALNNI 490
            VRP RY   Q+EE+E+ V  ML +GI + S SP+SS VLLVKKKDGSWRF VDYRALN +
Sbjct: 620  VRPFRYPQAQREELERQVATMLAAGITKESNSPFSSPVLLVKKKDGSWRFCVDYRALNKV 679

Query: 491  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVM 550
            T+ D +PIP++++L DEL+G+ +FSK+DL+AGYHQ+R+ ++D+ KTAFRTH+GHYEFLVM
Sbjct: 680  TVGDSYPIPMIDQLLDELHGSVIFSKLDLRAGYHQIRVKAEDVPKTAFRTHDGHYEFLVM 739

Query: 551  PFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLRRHKL 610
            PFGLTNAP TFQSLMN +FR +LR+FVLVFFDDIL+YS+   EH +H+ LVL+ L  ++L
Sbjct: 740  PFGLTNAPGTFQSLMNEVFRKFLRRFVLVFFDDILIYSKTEVEHQEHLRLVLKALAENQL 799

Query: 611  FANRKKCSFAYS---------------------------------KEVRGFLGLTGYYRH 670
             ANRKKC F                                    K +RGFLGLTGYYR 
Sbjct: 800  VANRKKCEFGRVEIEYLGHVISAKGVAADPAKVQAMVEWPSPGNIKALRGFLGLTGYYRK 859

Query: 671  FVQHYGSIAALLTQLLKLGSFKWNEGAQEAFEKLQRAMMTLPILALPDFNAPFKVETDAL 730
            FV+ YG IA  LT LLK   FKW+  A+EAF+ L+ AM T+P+LAL DF+  F VE+DA 
Sbjct: 860  FVKKYGEIARPLTALLKKDQFKWSPAAEEAFKSLKIAMSTVPVLALVDFSVQFVVESDAS 919

Query: 731  GYGVGTVLMQNKRPIAFYSHTLALRDQAKPVYERELMAVVLAVQRWRPYLLGRTFIVKTD 790
            G G+G VLMQ ++PIA++S  L  R + K VYERELMA+V A+++WR YLLGR F+V+TD
Sbjct: 920  GIGLGAVLMQQQQPIAYFSQALTERQRLKSVYERELMAIVFAIRKWRHYLLGRKFLVRTD 979

Query: 791  QRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPPTVHLNQLTAP 838
            Q+SLKFLLEQR +  +Y KW+ K+LG+ F++ YKPG+ENKAADALSRV     L  L+ P
Sbjct: 980  QKSLKFLLEQREVNMEYHKWLTKILGFDFDIQYKPGMENKAADALSRVEGP-QLFALSMP 1039

BLAST of CSPI03G20180 vs. TAIR10
Match: AT3G29750.1 (AT3G29750.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 97.8 bits (242), Expect = 3.5e-20
Identity = 70/230 (30.43%), Postives = 115/230 (50.00%), Query Frame = 1

Query: 113 IQTITLRGSPAKEIKKDGPSKWLSDAEFQAKEKGLCFKCDEKYYSGHKCRVKEIRELHMS 172
           ++++TL G   +E+   G    L  A  + K  G+         + ++ R  E+  L + 
Sbjct: 30  LRSVTLPGQGFEEMFLQGLQPSLQTAVRELKPNGI---------NSYQSRQAELMSLTLV 89

Query: 173 VVRADDVEEEIIEEDEYDLKELKTMELQNDLGEVKELCINSVVGLTNPGTMKIRGKVQSK 232
             + D     ++++ +  + EL+  EL+ D   +++     V+ LT    M+  G +   
Sbjct: 90  QAKLD-----VVKKKKGVINELE--ELEQDSYTLRQGMEQLVIDLTRNKGMRFYGFILDH 149

Query: 233 EVVVLVDCGATHNFISDRLVMTLKLPTKETSNYGVILGSGIAIKGKGVCEKVELDLNGWT 292
           +VVV +D GAT NFI   L  +LKLPT  T+   V+LG    I+  G C  + L +    
Sbjct: 150 KVVVAIDSGATDNFILVELAFSLKLPTSITNQASVLLGQRQCIQSVGTCLGIRLWVQEVE 209

Query: 293 VLENFLPLELG--GVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIVI 341
           + ENFL L+L    VDVILG +WL  LG T ++W+N   SF H+ + I +
Sbjct: 210 ITENFLLLDLAKTDVDVILGYEWLSKLGETMVNWQNQDFSFSHNQQWITL 243

BLAST of CSPI03G20180 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 96.3 bits (238), Expect = 1.0e-19
Identity = 54/129 (41.86%), Postives = 71/129 (55.04%), Query Frame = 1

Query: 596 HIELVLEVLRRHKLFANRKKCSF-----AY------------------------------ 655
           H+ +VL++  +H+ +ANRKKC+F     AY                              
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62

Query: 656 SKEVRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWNEGAQEAFEKLQRAMMTLPIL 690
           + E+RGFLGLTGYYR FV++YG I   LT+LLK  S KW E A  AF+ L+ A+ TLP+L
Sbjct: 63  TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPVL 122

BLAST of CSPI03G20180 vs. TAIR10
Match: AT3G30770.1 (AT3G30770.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 84.7 bits (208), Expect = 3.0e-16
Identity = 51/144 (35.42%), Postives = 75/144 (52.08%), Query Frame = 1

Query: 199 LQNDLGEVKELCINSVVGLTNPGTMKIRGKVQSKEVVVLVDCGATHNFISDRLVMTLKLP 258
           L  D   ++++   S    T    M+  G +   +VVV++D GAT+NFISD L + LKLP
Sbjct: 260 LLEDFKTIRQVKRQSTTEFTKGKDMRFYGFISCHKVVVVIDSGATNNFISDELALVLKLP 319

Query: 259 TKETSNYGVILGSGIAIKGKGVCEKVELDLNGWTVLENFLPLEL--GGVDVILGMQWLHS 318
           T  T+   V+LG    I+  G C  + L +    + ENFL L+L    VDVILG     +
Sbjct: 320 TSTTNQASVLLGQRQCIQTIGTCFGINLLVQEVEINENFLLLDLTKTDVDVILGYGGSQN 379

Query: 319 LGVTEMDWKNLTMSFFHDNKKIVI 341
           L    + W N   SFFH+ + + +
Sbjct: 380 LERQWLIWLNQDFSFFHNQQWVTL 403

BLAST of CSPI03G20180 vs. TAIR10
Match: AT3G42723.1 (AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding)

HSP 1 Score: 51.2 bits (121), Expect = 3.7e-06
Identity = 23/65 (35.38%), Postives = 40/65 (61.54%), Query Frame = 1

Query: 278 KGVCEKVELDLNGWTVLENFLPLELGG--VDVILGMQWLHSLGVTEMDWKNLTMSFFHDN 337
           K  C+++ L +N   ++E++   +L    VDVILG +WL  LG TE++W+N + SF H+ 
Sbjct: 503 KRSCQEISLRINDIDIVEDYCVWDLKRDVVDVILGYEWLSKLGETEVNWQNQSFSFIHNQ 562

Query: 338 KKIVI 341
             + +
Sbjct: 563 DWVTL 567

BLAST of CSPI03G20180 vs. NCBI nr
Match: gi|729344250|ref|XP_010541181.1| (PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana])

HSP 1 Score: 803.9 bits (2075), Expect = 2.8e-229
Identity = 436/914 (47.70%), Postives = 592/914 (64.77%), Query Frame = 1

Query: 10   QIGSLYGRFLRIQQESSVEEYRNLFDKWVAPLSDIPEKIVEETF---------------- 69
            ++GS + R L ++Q  +VEEY   F++ +A +   PE++VE TF                
Sbjct: 417  KLGSPFDRLLSLRQTGTVEEYLCEFEELLAQVPHTPEEMVESTFKNGLKPEILEILQIFR 476

Query: 70   ---MGGLRDDEIRANG-------GTMGDPEERNKFPSYSGAK-----VP--NYTYNTAKT 129
               M  + D  +   G       G  G  + +N    Y+G+      VP  N  Y     
Sbjct: 477  PKGMEEIVDVALSIEGSKLSAVCGGKGGSDGKNWRTGYTGSSFRTVSVPTENQKYQNQSY 536

Query: 130  NSVMKEQGNKENTIFPIQTITLRGSPAKEI----KKDGPSKWLSDAEFQAK-EKGLCFKC 189
             S  +E+G + N           G   KE     +K    K +SDAEF+ K +KGLCF+C
Sbjct: 537  RSNFQERGRQING----------GKGPKEEGGVQEKKSTFKRMSDAEFEEKRKKGLCFRC 596

Query: 190  DEKYYSGHKCRVKEIRELHMSVVRADDVEEEIIEEDEYDLKELKTMELQN--DLGEVKEL 249
            DEK++ GH+C+ KE++ +         + EEI E  E +L+E +  E  N  D GE  EL
Sbjct: 597  DEKFFVGHRCKQKELQVI---------LAEEITETGE-ELEEEQDNEAGNREDEGEFAEL 656

Query: 250  CINSVVGLTNPGTMKIRGKVQSKEVVVLVDCGATHNFISDRLVMTLKLPTKETSNYGVIL 309
             +NSVVGLT+P T+KIRG ++ +EVVVL+D GATHNFIS +L+  LKL  +  + +GV L
Sbjct: 657  SLNSVVGLTSPKTLKIRGSIEGQEVVVLIDSGATHNFISLKLMKKLKLRPEGNTQFGVSL 716

Query: 310  GSGIAIKGKGVCEKVELDLNGWTVLENFLPLELGGVDVILGMQWLHSLGVTEMDWKNLTM 369
            G+G+ +KGKG+C+ V L L    V+E+FLPLELG  D+ILG+QWL  LG  +MD+++L +
Sbjct: 717  GTGMKVKGKGICKAVHLQLQQIEVVEDFLPLELGSADLILGVQWLQKLGKVQMDFQDLEL 776

Query: 370  SFFHDNKKIVIKGDPSLTKTQVSLKNLTKSWTVSDMGYLIECRTLEAHIAEIEPENNNVP 429
             F      + + GDP+L  + V+L++L KS    D  YL++  TLE  +      ++N+P
Sbjct: 777  KFNQGTSWVTVTGDPTLHSSLVTLRSLIKSVCDGDQSYLVKLETLEEQVGV----DSNLP 836

Query: 430  ESILTALNQYNDVFDWPKELPPRRDIEHHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDE 489
            E +   L ++  VF+ P ELPP R  EH I++K G  PV+VRP+RY    KEE+EKLV +
Sbjct: 837  EKLQAVLEEFGPVFEIPTELPPERGREHPINLKEGTGPVSVRPYRYPHAHKEEIEKLVKD 896

Query: 490  MLTSGIIRPSTSPYSSTVLLVKKKDGSWRFYVDYRALNNITIPDKFPIPVVEELFDELNG 549
            ML +GI+RPS SP+SS VLLVKKKDGSWRF +DYRALN +T+ DKFPIP++++L DEL+G
Sbjct: 897  MLKAGIVRPSQSPFSSPVLLVKKKDGSWRFCIDYRALNKVTVLDKFPIPMIDQLLDELHG 956

Query: 550  ANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFR 609
            A +FSK+DL++GYHQ+RM ++DI KTAFRTH+GHYEFLVMPFGLTNAPATFQ+LMN IFR
Sbjct: 957  ARVFSKLDLRSGYHQIRMKTEDIPKTAFRTHDGHYEFLVMPFGLTNAPATFQALMNEIFR 1016

Query: 610  SYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLRRHKLFANRKKCSFAYS-------- 669
             YLRKFVLVFFDDILVYS +L++H  H++ VL VL++HKL+AN+KKC F           
Sbjct: 1017 PYLRKFVLVFFDDILVYSCSLQDHATHLQTVLAVLQKHKLYANKKKCEFGRQQIDYLGHI 1076

Query: 670  -------------------------KEVRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGS 729
                                     KE+RGFLGLTGYYR FVQ+YG+IA  LT LLK   
Sbjct: 1077 ISQEGVSTDPAKTAAMQKWPTPSNVKELRGFLGLTGYYRRFVQNYGTIARPLTDLLKKDG 1136

Query: 730  FKWNEGAQEAFEKLQRAMMTLPILALPDFNAPFKVETDALGYGVGTVLMQNKRPIAFYSH 789
            F W+E A  AF KL++AM + P+L LPDF   F VETDA G+G+G VLMQ  RPIAF+S 
Sbjct: 1137 FNWSEDASSAFRKLKQAMTSAPVLGLPDFREDFVVETDASGFGIGAVLMQKHRPIAFFSQ 1196

Query: 790  TLALRDQAKPVYERELMAVVLAVQRWRPYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKW 845
             L+ R++ KPVYERELMAVVL++QRWR YLLGR+F+V TDQ++LKFLLEQR +  +YQ+W
Sbjct: 1197 ALSERERLKPVYERELMAVVLSIQRWRHYLLGRSFLVCTDQKALKFLLEQREVSMEYQRW 1256

BLAST of CSPI03G20180 vs. NCBI nr
Match: gi|147854459|emb|CAN78588.1| (hypothetical protein VITISV_043911 [Vitis vinifera])

HSP 1 Score: 776.2 bits (2003), Expect = 6.2e-221
Identity = 425/874 (48.63%), Postives = 572/874 (65.45%), Query Frame = 1

Query: 10   QIGSLYGRFLRIQQESSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLRDDEIRANGGTM 69
            Q GSL  +FL ++Q+ +V  Y   F+    PL  I E+++E TFM GL   EIRA    +
Sbjct: 847  QEGSLCEQFLAVRQQGTVAAYWREFEILETPLKGISEEVMESTFMNGLLP-EIRAEQRLL 906

Query: 70   GD------------PEERNKFPSYSGAKVPNYTYNTAKTNSVMKEQGNKENTIFPIQTIT 129
                           E+RN   +   A+ PN   +T   ++  + +  K    F  + + 
Sbjct: 907  QPYGLGHLMEMAQRVEDRNL--AMRAAREPNGPKSTKMLSTANRGEW-KIGENFQTRAVA 966

Query: 130  LRGSPAKEIKKDGPSKWLSDAEFQAK-EKGLCFKCDEKYYSGHKCRVKEIRELHMSVVRA 189
            + G      +++ P K L+++E QA+ EKGL FKC+EK+  GH+C+ KE+R L +     
Sbjct: 967  V-GEKTMSQRREIPIKRLTESELQARREKGLWFKCEEKFSPGHRCK-KELRVLLVH---- 1026

Query: 190  DDVEEEIIEEDEYDLKELKTMELQNDLGEVKELCINSVVGLTNPGTMKIRGKVQSKEVVV 249
            +D EE+  + D+   +E   +EL++ +    EL +NSVVGLT PGTMKI+G + SKEV++
Sbjct: 1027 EDEEEDDNQFDDRATEEPALIELKDAV----ELSLNSVVGLTTPGTMKIKGTIGSKEVII 1086

Query: 250  LVDCGATHNFISDRLVMTLKLPTKETSNYGVILGSGIAIKGKGVCEKVELDLNGWTVLEN 309
            LVD GATHNF+S  LV  L LP   T++YGV++G+GI++KGKG+C  V + + G TV+E+
Sbjct: 1087 LVDSGATHNFLSLELVQQLTLPLTTTTSYGVMMGTGISVKGKGICRGVCISMQGLTVVED 1146

Query: 310  FLPLELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIVIKGDPSLTKTQVSLKNL 369
            FLPLELG  DVILGM WL +LG  +++WK LTM        +V+KGDPSL++T+ S    
Sbjct: 1147 FLPLELGNTDVILGMPWLGTLGDVKVNWKMLTMKIKMGKAVMVLKGDPSLSRTETS---- 1206

Query: 370  TKSWTVSDMGYLIECRTLEAHIAEIEPENNNVPESILTALNQYNDVFDWPKELPPRRDIE 429
                T SD+   ++                 VP+++   L Q+  +F+    LPP RDI+
Sbjct: 1207 ----TTSDLSEGVQ----------------EVPKTVKEVLAQHQQIFEPITGLPPSRDID 1266

Query: 430  HHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKKDGS 489
            H I +  GA PVNVRP+RY    K E+++LV EML +GI+RPS SP+SS VLLVKKKDG 
Sbjct: 1267 HAIQLILGASPVNVRPYRYPHILKNEIKRLVQEMLEAGIVRPSLSPFSSPVLLVKKKDGG 1326

Query: 490  WRFYVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTA 549
            WRF +DYRALN +T+PD+FPIPV++EL D+L+GA +FSK+DLK+GYHQ+R+  QDI KTA
Sbjct: 1327 WRFCIDYRALNKVTVPDRFPIPVIDELLDKLHGATIFSKLDLKSGYHQIRVRQQDIPKTA 1386

Query: 550  FRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQH 609
            FRTHEGHYEFLVMPFGLTNAPATFQSLMN IF  +L KFVLVFF DILVYS++L+EHC H
Sbjct: 1387 FRTHEGHYEFLVMPFGLTNAPATFQSLMNRIFWPHLWKFVLVFFYDILVYSKDLKEHCDH 1446

Query: 610  IELVLEVLRRHKLFANRKKCSFAYS---------------------------------KE 669
            ++ VL +L  H+L  N KKC FA                                   KE
Sbjct: 1447 LQTVLSILANHQLHVNGKKCLFAKLQLEYLGHLVSAKGVAADPNKISAMVEWPTPKSLKE 1506

Query: 670  VRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWNEGAQEAFEKLQRAMMTLPILALP 729
            +RGFLGLTGYYR FV+ YG+I+  LTQ LK  +F WN  A+ AF+KL+  M T+P+LALP
Sbjct: 1507 LRGFLGLTGYYRRFVEGYGAISWPLTQELKKDAFNWNLEAEVAFQKLKTTMTTIPVLALP 1566

Query: 730  DFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRDQAKPVYERELMAVVLAVQRWR 789
            +F+  F VE DA GYG+GTVLMQ+ RP+A++S  L  R++ K +YERELMA+VLAVQ+WR
Sbjct: 1567 NFSQLFIVEMDASGYGLGTVLMQSHRPVAYFSQVLTARERQKSIYERELMAIVLAVQKWR 1626

Query: 790  PYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSR 838
             YLLGR FIV+TDQ SLKFLLEQR++   YQKW+AKL GY FE+ ++PG ENKAADALSR
Sbjct: 1627 HYLLGRHFIVRTDQSSLKFLLEQRIVNESYQKWVAKLFGYDFEIQFRPGXENKAADALSR 1682

BLAST of CSPI03G20180 vs. NCBI nr
Match: gi|731338584|ref|XP_010680400.1| (PREDICTED: transposon Tf2-1 polyprotein isoform X1 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 741.1 bits (1912), Expect = 2.2e-210
Identity = 403/904 (44.58%), Postives = 567/904 (62.72%), Query Frame = 1

Query: 12   GSLYGRFLRIQQESSVEEYRNLFDKWVA-----P--------LSDIPEKIVEETFMGG-- 71
            GSLY ++L ++QE SV +Y+  F ++ A     P        +  + E I  E  M G  
Sbjct: 200  GSLYEQWLTVEQEGSVMDYKRRFIEYAAPLENIPESIVMGQFIKGLKENIKAEVHMMGPI 259

Query: 72   ----LRDDEIRANGGTMGDP--EERNKFPSYSGAKVPNYT-----YNTAKTNSVMKEQGN 131
                  D  ++A      +P   +    P+ +    PN +     +N  K  S+   + N
Sbjct: 260  SVDQAMDLALKAEVKINSNPYLNKNRTLPTITPFPTPNRSQISPAHNIIKPTSLTYPRNN 319

Query: 132  KENTIFPIQTITLRGSPAKEIKKDG----PSKWLSDAEFQ-AKEKGLCFKCDEKYYSGHK 191
               T +  Q  T + +  K   ++     P + L++ E Q  +E GLCF+CD+K+  GH+
Sbjct: 320  P--TTYQSQPTTPKITATKNSYQNPRTQLPIRRLTEQELQFRRENGLCFRCDDKWSQGHR 379

Query: 192  CRVKEIRELHMSVVRADDVEEEIIEEDEYDLKELKTMELQNDLGEVKELCINSVVGLTNP 251
            C+ KE+     SV+  +  E+   EE+E ++ +    ++  ++  V EL +NSVVGLT+P
Sbjct: 380  CQKKEV-----SVLVMEGEEDPPPEEEEEEVNDASA-DVSAEVTTV-ELSLNSVVGLTSP 439

Query: 252  GTMKIRGKVQSKEVVVLVDCGATHNFISDRLVMTLKLPTKETSNYGVILGSGIAIKGKGV 311
             TMK+ G +  +EVVV+VD GATHNFIS R V  L +P    +N+GV LG+G  +KGKG 
Sbjct: 440  RTMKLTGVINGQEVVVMVDPGATHNFISLRAVEKLAIPLIGEANFGVSLGTGTMVKGKGE 499

Query: 312  CEKVELDLNGWTVLENFLPLELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFHDNKKIVI 371
            C+ V L++ G  + ENFLPL+LG  D+ILG+QWL  LG    +WK+  M F    +++ +
Sbjct: 500  CQGVMLEIQGLVIRENFLPLDLGNSDIILGVQWLEKLGSVTTNWKSQLMKFKIGREEVTL 559

Query: 372  KGDPSLTKTQVSLKNLTKSWTVSDMGYLIECRTLEAHIAEIEPENN-----NVPESILTA 431
            +GDPSL +T++SLK + ++  +   G L+E   +E    E EP         VP  +   
Sbjct: 560  QGDPSLDRTRISLKAMLRALRIEGQGVLVEMNHIER---EKEPPGKWDIEVEVPRPLQPL 619

Query: 432  LNQYNDVFDWPKELPPRRDIEHHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTSGI 491
            LNQY+ VF+ P  LPP R  EH I +K G+ PV+VRP+RY   QK E+E+LV +ML +GI
Sbjct: 620  LNQYSQVFNMPSGLPPSRGREHSITLKEGSNPVSVRPYRYPHVQKGEIERLVKDMLAAGI 679

Query: 492  IRPSTSPYSSTVLLVKKKDGSWRFYVDYRALNNITIPDKFPIPVVEELFDELNGANLFSK 551
            I+PSTSP+SS VLLVKKKDGSWRF VDYRALN  T+PDK+PIPV++EL DEL G+ +FSK
Sbjct: 680  IQPSTSPFSSPVLLVKKKDGSWRFCVDYRALNKETVPDKYPIPVIDELLDELYGSVVFSK 739

Query: 552  IDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLRKF 611
            +DLK+GYHQ+R+  +DI KTAFRTHEGHYEFLVMPFGLTNAPATFQSLMN +FR +LRKF
Sbjct: 740  LDLKSGYHQIRVRKEDIHKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFRPFLRKF 799

Query: 612  VLVFFDDILVYSRNLEEHCQHIELVLEVLRRHKLFANRKKCSFAYS-------------- 671
            VLVFFDDILVYS + E H  H+E VL +L  + L+AN +KC F                 
Sbjct: 800  VLVFFDDILVYSPDEETHFHHLEQVLHILAENSLYANLEKCEFGRQQVAYLGHVISAQGV 859

Query: 672  -------------------KEVRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWNEG 731
                               +E+RGFLGLTGYYR F+ +Y  +A+ LT  L+  S+ W   
Sbjct: 860  AADMDKIKAMVEWPLPKTIRELRGFLGLTGYYRKFIANYAKVASPLTDQLRKDSYAWTPA 919

Query: 732  AQEAFEKLQRAMMTLPILALPDFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLALRD 791
            A +AFE L++AM+  P+LA+PDF+  F +E DA G+G+G VLMQN RPIAFYSH L  R 
Sbjct: 920  ATQAFEALKKAMVAAPVLAMPDFSQQFVIEADASGFGLGAVLMQNNRPIAFYSHILGPRG 979

Query: 792  QAKPVYERELMAVVLAVQRWRPYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKLLG 846
            + K +YE+ELMA+V+AVQ+WR YLLGR F+++TDQ+SLKF++EQR +  +YQ+W++KL+G
Sbjct: 980  RLKSIYEKELMAIVMAVQKWRHYLLGRRFVIRTDQKSLKFIMEQREVGAEYQRWVSKLMG 1039

BLAST of CSPI03G20180 vs. NCBI nr
Match: gi|923614274|ref|XP_013745228.1| (PREDICTED: uncharacterized protein LOC106447810 [Brassica napus])

HSP 1 Score: 728.0 bits (1878), Expect = 1.9e-206
Identity = 394/841 (46.85%), Postives = 530/841 (63.02%), Query Frame = 1

Query: 47   KIVEETFMGGLRDDEI-----RANGGTMGDPEERNKFPSYSGAKVPNYTYNTAKTNSVMK 106
            K+VE+   GG   +E      + +    G P+ +N  P+       +   N A + S   
Sbjct: 283  KLVEDWSEGGDTTEETSEEKDKTSRSVNGRPQAQNNKPAQQSGNGSSPNKNKAGSGSTTS 342

Query: 107  EQGN----KENTIFPIQTITLRGSPAKEIKKDGPSKWLSDAEFQAKEKGLCFKCDEKYYS 166
            +         N + P         P + +     +KW        K +GLC++CDEKY  
Sbjct: 343  QNNTTTKPNHNRLKP---------PFRRLTPAEVAKW--------KAEGLCYRCDEKYVY 402

Query: 167  GHKCRVKEIRELHMSVVRADDVEEEI----IEEDEYDLKELKTMELQNDLGEVKELCINS 226
             H+C   E+  +   +V  D  E +I    +E DE      +  E+     EV E+ I+S
Sbjct: 403  PHRCSQAELLVI---MVLEDGTEVDISSLAVEMDEMG----EAAEI-----EVAEISISS 462

Query: 227  VVGLTNPGTMKIRGKVQSKEVVVLVDCGATHNFISDRLVMTLKLPTKETSNYGVILGSGI 286
            +VG++   T+K++G V  KEV+VL+D GATHNF+S  L+  L L T +T  Y V+   G+
Sbjct: 463  LVGISTSRTIKLKGTVMGKEVIVLIDSGATHNFVSRELMKQLDLGTDDTQGYSVLTAGGV 522

Query: 287  AIKGKGVCEKVELDLNGWTVLENFLPLELGGVDVILGMQWLHSLGVTEMDWKNLTMSFFH 346
              KG G+C+++E++L G TV+ NFLPLELG  DVI GMQWL +LG  +++WK   + F  
Sbjct: 523  TFKGAGLCKEMEVELQGCTVVSNFLPLELGSADVIWGMQWLETLGNMKVNWKLQILRFKI 582

Query: 347  DNKKIVIKGDPSLTKTQVSLKNLTKSWTVSDMGYLIECRTLEAHIAEIEPENNNVPESIL 406
             + K V++GDP L  +  SLK++ K+        LIE   L+    E E    ++P+ + 
Sbjct: 583  GDNKYVLQGDPGLCCSAASLKSIWKTVQQGGEAMLIEYNGLQL---EEEKGGGSIPQPLQ 642

Query: 407  TALNQYNDVFDWPKELPPRRDIEHHIHIKGGAEPVNVRPHRYAFQQKEEMEKLVDEMLTS 466
              L +Y +VF  P+ LPP R  EH I +K  A PV+VRP RY   Q+EE+EK V  ML++
Sbjct: 643  NILKEYEEVFAEPQGLPPSRGKEHAIVLKTDASPVSVRPFRYPQAQREEIEKQVALMLSA 702

Query: 467  GIIRPSTSPYSSTVLLVKKKDGSWRFYVDYRALNNITIPDKFPIPVVEELFDELNGANLF 526
            GIIR S+SP+SS VLLVKKKDGSWRF VDYRALN +TI D +PIP++++L DEL GA +F
Sbjct: 703  GIIRDSSSPFSSPVLLVKKKDGSWRFCVDYRALNKVTIADSYPIPMIDQLLDELQGAKVF 762

Query: 527  SKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNSIFRSYLR 586
            SK+DLK+GYHQ+ + ++D++KTAFRTH+GHYEFLVMPFGL+NAPATFQSLMN IFRSYLR
Sbjct: 763  SKLDLKSGYHQILVKAEDVQKTAFRTHDGHYEFLVMPFGLSNAPATFQSLMNEIFRSYLR 822

Query: 587  KFVLVFFDDILVYSRNLEEHCQHIELVLEVLRRHKLFANRKKCSFAYS------------ 646
            KFVLVFFDDILVYS+   EH +H+ LVLEVL+   L+ANRKKC F  S            
Sbjct: 823  KFVLVFFDDILVYSQTQSEHEEHLRLVLEVLKEQGLYANRKKCEFGSSRIEYLGHVISAE 882

Query: 647  ---------------------KEVRGFLGLTGYYRHFVQHYGSIAALLTQLLKLGSFKWN 706
                                 KE+RGFLGLTGYYR FVQ YG IA  LT LL+   FKW+
Sbjct: 883  GVAADEGKVRAMLDWMEPKAVKELRGFLGLTGYYRKFVQGYGDIARPLTSLLRKDQFKWS 942

Query: 707  EGAQEAFEKLQRAMMTLPILALPDFNAPFKVETDALGYGVGTVLMQNKRPIAFYSHTLAL 766
              A  AF+KL++AM T+P+LALPDFN  F +E+DA G G+G VLMQ +RPIA++S  L  
Sbjct: 943  GEAALAFQKLKQAMATVPVLALPDFNEQFVIESDASGVGLGAVLMQRQRPIAYFSQALTE 1002

Query: 767  RDQAKPVYERELMAVVLAVQRWRPYLLGRTFIVKTDQRSLKFLLEQRVIQPQYQKWIAKL 826
            R Q K VYERELMA+V A+Q+WR YLLGR F+V+TDQ+SLKFLLEQR I  +YQ+W+ K+
Sbjct: 1003 RQQMKSVYERELMAIVFAIQKWRHYLLGRKFVVRTDQKSLKFLLEQREINMEYQRWLTKI 1062

Query: 827  LGYSFEVVYKPGLENKAADALSRVPPTVHLNQLTAPTLVDIKVIGEEVDKDDYLKDIINQ 842
            LG+ F++ YKPGLENKAADALSR  P   L  ++ P  + ++ +G EV++D  L  +I +
Sbjct: 1063 LGFDFDIHYKPGLENKAADALSRKSPVTELFAVSVPVSIQLEEVGSEVERDSELSKLIQE 1091

BLAST of CSPI03G20180 vs. NCBI nr
Match: gi|922423376|ref|XP_013617706.1| (PREDICTED: uncharacterized protein LOC106324252 [Brassica oleracea var. oleracea])

HSP 1 Score: 712.6 bits (1838), Expect = 8.4e-202
Identity = 374/745 (50.20%), Postives = 506/745 (67.92%), Query Frame = 1

Query: 131  PSKWLSDAEFQAKEK-GLCFKCDEKYYSGHKCRVKEIRELHMSVVRADDVEEEIIEEDEY 190
            P + L+ AE + + + GLCF+CDEK+   H C   E+  +   +V  D  E E+ EE   
Sbjct: 371  PFRRLTPAEIEQRRRDGLCFRCDEKFGYKHVCARAEMLVV---MVMEDGTEIEMAEEQWG 430

Query: 191  DLKELKTMELQNDLGEVKELCINSVVGLTNPGTMKIRGKVQSKEVVVLVDCGATHNFISD 250
            D ++  T ++Q    EV EL +NSVVGL++P TMK+RG +  + VV+L+D GA+HNFIS+
Sbjct: 431  DGEDDPT-KMQ---AEVAELSLNSVVGLSSPKTMKVRGTIHGEAVVILIDNGASHNFISE 490

Query: 251  RLVMTLKLPTKETSNYGVILGSGIAIKGKGVCEKVELDLNGWTVLENFLPLELGGVDVIL 310
            ++V  L L  +  ++YGV++  G  ++G+GV   +EL L G+ V+ +FLPLELG  DVIL
Sbjct: 491  KIVTKLNLQKRAVASYGVMVAGGATLEGQGVIIGLELRLPGYVVVTDFLPLELGIADVIL 550

Query: 311  GMQWLHSLGVTEMDWKNLTMSFFHDNKKIVIKGDPSLTKTQVSLKNLTKSWTVSDMGYLI 370
            G+QWL +LG   ++WK   M +    ++I+++GDPSL    VSLK++ K+      G L+
Sbjct: 551  GVQWLDTLGDVNVNWKLQCMRYHDGEEEIILQGDPSLHSASVSLKSMWKTLQKEGEGVLL 610

Query: 371  ECRTLEAHIAEIEPENNNVPESILTALNQYNDVFDWPKELPPRRDIEHHIHIKGGAEPVN 430
            E   L A    + P     PE +   L QY  VF  P+ LPP R  EH I ++ GA+PV+
Sbjct: 611  EFGGLRASEDVVLPVA--WPEELKEVLEQYTQVFSEPRGLPPSRGREHTIILENGAKPVS 670

Query: 431  VRPHRYAFQQKEEMEKLVDEMLTSGIIRPSTSPYSSTVLLVKKKDGSWRFYVDYRALNNI 490
            +RP RY   QKEE+E+ +  ML +GII+ ++SP+SS VLLV+KKDGSWRF VDYRALN  
Sbjct: 671  IRPFRYPHAQKEEIEQQIASMLAAGIIQETSSPFSSPVLLVRKKDGSWRFCVDYRALNKY 730

Query: 491  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVM 550
            T+ DK+PIP++++L DEL+GA +FSKIDL++GYHQ+R+ ++D+ KTAFRTH+GHYEFLVM
Sbjct: 731  TVADKYPIPMIDQLLDELHGATIFSKIDLRSGYHQIRVRAEDVPKTAFRTHDGHYEFLVM 790

Query: 551  PFGLTNAPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLRRHKL 610
            PFGL+NAPATFQ+LMN IFR YLRKFVLVFFDDIL+YSR++EEH QH+ +VL +L   +L
Sbjct: 791  PFGLSNAPATFQALMNDIFRPYLRKFVLVFFDDILIYSRSVEEHKQHLAIVLMILEEQEL 850

Query: 611  FANRKKCSFAYS---------------------------------KEVRGFLGLTGYYRH 670
            FANRKKC+F  S                                 K +RGFLGLTGYY+ 
Sbjct: 851  FANRKKCTFGSSSVEYLGHIISAEGVAADMNKVKAMMDWAEPRNVKALRGFLGLTGYYQK 910

Query: 671  FVQHYGSIAALLTQLLKLGSFKWNEGAQEAFEKLQRAMMTLPILALPDFNAPFKVETDAL 730
            FV+ YG IA  LT LLK   F W+  A +AF+KL+ AM T+P+LALPDFN  F VE++A 
Sbjct: 911  FVRGYGDIARPLTILLKKEGFVWSVAAGDAFQKLKIAMSTVPVLALPDFNEVFVVESEAS 970

Query: 731  GYGVGTVLMQNKRPIAFYSHTLALRDQAKPVYERELMAVVLAVQRWRPYLLGRTFIVKTD 790
            G G+G VLMQ +RPIA++S  L  R + K VYERELMA+V A+ +WR YLLGR FIV+TD
Sbjct: 971  GVGLGAVLMQGQRPIAYFSQALTERQKLKSVYERELMAIVFALMKWRHYLLGRHFIVRTD 1030

Query: 791  QRSLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPPTVHLNQLTAP 842
            Q+SLKFLLEQR I   YQ+W++K+LG+ FE+ YKPGLENKAADALSR      +  ++ P
Sbjct: 1031 QKSLKFLLEQREINMDYQRWLSKILGFDFEIHYKPGLENKAADALSRKAMVAEVFAVSVP 1090

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POL2_DROME2.4e-7932.52Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
POL3_DROME6.9e-7932.10Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
POL5_DROME1.4e-7135.59Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
POLY_DROME1.1e-6034.11Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogas... [more]
POL4_DROME5.9e-5429.34Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaste... [more]
Match NameE-valueIdentityDescription
A5B2I6_VITVI4.3e-22148.63Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043911 PE=4 SV=1[more]
A0A087G3S6_ARAAL1.7e-20150.47Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs46225U000100 PE=4... [more]
A0A087HFW3_ARAAL1.7e-20145.65Uncharacterized protein OS=Arabis alpina GN=AALP_AA2G074100 PE=4 SV=1[more]
A0A087GEK8_ARAAL2.2e-20148.61Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1[more]
A0A087G291_ARAAL3.2e-20049.26Uncharacterized protein OS=Arabis alpina GN=AALP_AAs48021U000700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G29750.13.5e-2030.43 Eukaryotic aspartyl protease family protein[more]
ATMG00860.11.0e-1941.86ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
AT3G30770.13.0e-1635.42 Eukaryotic aspartyl protease family protein[more]
AT3G42723.13.7e-0635.38 aminoacyl-tRNA ligases;ATP binding;nucleotide binding[more]
Match NameE-valueIdentityDescription
gi|729344250|ref|XP_010541181.1|2.8e-22947.70PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana][more]
gi|147854459|emb|CAN78588.1|6.2e-22148.63hypothetical protein VITISV_043911 [Vitis vinifera][more]
gi|731338584|ref|XP_010680400.1|2.2e-21044.58PREDICTED: transposon Tf2-1 polyprotein isoform X1 [Beta vulgaris subsp. vulgari... [more]
gi|923614274|ref|XP_013745228.1|1.9e-20646.85PREDICTED: uncharacterized protein LOC106447810 [Brassica napus][more]
gi|922423376|ref|XP_013617706.1|8.4e-20250.20PREDICTED: uncharacterized protein LOC106324252 [Brassica oleracea var. oleracea... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR013242Retroviral aspartyl protease
IPR021109Peptidase_aspartic_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G20180.1CSPI03G20180.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 470..631
score: 7.9
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 451..632
score: 15
IPR013242Retroviral aspartyl proteasePFAMPF08284RVP_2coord: 228..316
score: 3.9
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 220..320
score: 7.1
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 419..549
score: 4.4
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 550..618
score: 8.3
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 463..860
score: 3.5E
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 463..860
score: 3.5E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 396..790
score: 1.05E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None