Cucsa.163250 (gene) Cucumber (Gy14) v1

NameCucsa.163250
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Locationscaffold01144 : 2652948 .. 2655509 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTTGAAAGGGGATATAAAAGGGCAAGAAGTGGTGATCCTCATTGATAGTGGAGCCACAAATAACTTCATACATGAGATAGTAGCGGAGGCACAAGGGCTAAACATAGAACCAGGAACACAGTTCGGAGTGACGATTGGAGATGGAACTCGTTGTAAAGGCAAGGGGATTTGTAGAAGAGTGGAACTGAGATTGAAGGAACTGACGATTGTAGCTGACTTCTTGGCAGTAGAATTGGGGAATGTTGATGTAGTTCTAGGAATGCAGTGGTTAGATACCACTGGAACAACGAAGGTACATTGGCCATCCTTAACCATGACCTTCTGGGTTAAGGACAGACAGATTGTATTGAAGGGAGACCCATCTCTAATTAAGGCAGAATGTTCCTTGAAAACACTGGAGAAAACGTGGGATGCTGAAGACCAAGGATTCCTGCTAGAATTCAAGAATTATGGAGTGGAAATAGAGGATAAGTATGACACCGAAACAGAGAAAAGAGGAGATAAGGAAGAATTGTCTATGATCCAATCTTGCTCAAGTACTATTCTGATATTTTTGAAACACCAAAGGACTTGCCTCCAAAAAGGGTCATTGACCACCGCATACTCACTCTACCAGACCATAGACCTATCAACGTAAGACCTTACAAATATGGTCACGTGCAGGAGGAGATTGAAAAGCTAATCACTGAAATGCTTCAGGCAGGGGTGATTAGACCGAGCCACAATCCTTATTCCAGTCCGGTGCTATTAGTAAAGAAGAAGGATGGGGGATGGAGATTTTGTGTTGATTATCGCAAGCGAAACCAAGTGATGGTGTCAGACAAATTTCCAATTCCAGTAATCGAAGAGCTGCTTGATGAGCTAAATGGGGCAGACTGTTTTTTCTAAACTAGATCTGAAATCTGGTTATCACCATATAAGAATGAGAGAGGAAGACAGAAAAAACGGCTTTGCGCACACATGAGGGCCATTACGAGTTCCTTCTCATGCCTTTTGGCCTCACAAATGCCTGGGCTACCTTTCAATTGCTGAAGAATCAGGTATTTAAGCCTTTCCTTAGACGCTACATATTAGTATTTTTTGATGACATACTAGTCTACAGTGCTGATATAGATGAGCATGCTAAGCATTTGGGAATGGTCTTTAACATACTGAAGGATAATCAACCATTTGCTAATAAGAAGAATGTGTTATAGCCCATTCTCAAATATAATACCTCGGGCATTTGATATCTAAAAGAGGGGTAGAGGCAGACGAAGAGAAGATCAGGGATATGATAAACTGGCCACTTCCGAAAGACGTCTCTAGTTTGAGGGGATTTCTTGGTCTCACAGGATACTATAGAAGATTTGTAAAGAGTTATGGAGAGATAGCTGCCCCATTAACCAAGTTACAAAAAAaTGACTTCAAATGGAGTGAGGAAGCAACTACGGCCTTCGAAAGTTTAAAGTTAGCAATGGACCATTCTAATGCTAGCCTTACTCGCTGGAACTCACCATTCACCATTGAGTTGGATGCGTCTGGAATAGGTTTAGGAGCTGTAATCTCCCAAAAAGGCCACCCTATTGCTTTTTTCAGCCAAAAACTATCTCCAAGGGCCCAAACCAAGTCTATTTATGAAAGGGAATTGATGGTTGTAGTTCTTTCCGTCCAAAAATGGAGGCACTATTTACTTGGAAGAAAGTTCACCATAATTTCTGACCAAAAAGCCCTGAAATTTTTATTAGAGCAGAGGGAAGTACAACCCCAATTTCAGAAATGGTTGACGACGCTTCTTGGCTATGATTTTGAAATATTATACCAGCTTGGACTTCAAAACAAGGCAGCGGATGCCCTTTCAAGAATGGAGCAGCCTTTGGAATTGAACAGCATGACAACCACTGGAATTGTAGACGTGGAACTGATTTGTAAGGAGGTTGAAAATGATGAAGAACTTAAAAAAATCATAAGGGAACTGGAAGGAAGCTCAGAAGAAGGAAAGAAGTACCAATGGGTAAATGGGAGGCTACTATATAAGGGACGGATGGTACTCTCTAAAGTTTCTTCTCTCGTACCGAAAATACTACAGACTTTCCACGATTCTATTCTAGGAGGTCACTTTGGATTTCTACGATGTAAAAAGCTATGTCGAGCAATGTGATGTATGCCAGCGGAACAAATAAGAAGCAGCTAAACCTGCAAGGGTCCTTCAACCAATTCCTATACCAGATCGAATATTAGAAGACTGGACGATGGACTTCATTGAGGGGTTACCCCTCGCCAGAGGAATGAATGTTATTATGGTGGTGGTGGATAGATTGAGTAAATACTCCTACTTCATATCGTTAAGACATCCATTTTCTGCTAAGCAAGTGGCTGCCATATTCATCGATAGAATAGTAAGGAAGCATGGCATTCCCATGTCCATTATCACAGATAGGGATAAAATTTTCCTTAGTAACTTTTGGAAAGAATTGTTTGCAACAATGGGAACTATTTTGAAAATGAGTACAGCATTCCATCCACAAACAGATGGACAGACAAAGAAGGTGAATAGATGCCTTGAAACCTACTTGAG

mRNA sequence

atgaagttgaaaggggatataaaagggcaagaagtggtgatcctcattgatagtggagccacaaataacttcatacatgagatagtagcggaggcacaagggctaaacatagaaccaggaacacagttcggagtgacgattggagatggaactcgttgtaaaggcaaggggatttgtagaagagtggaactgagattgaaggaactgacgattgtagctgacttcttggcagtagaattggggaatgttgatgtagttctaggaatgcagtggttagataccactggaacaacgaaggtacattggccatccttaaccatgaccttctgggttaaggacagacagattgtattgaagggagacccatctctaattaaggcagaatgttccttgaaaacactggagaaaacgtgggatgctgaagaccaaggattcctgctagaattcaagaattatggagtggaaatagaggataaggtcattgaccaccgcatactcactctaccagaccatagacctatcaacgtaagaccttacaaatatggtcacgtgcaggaggagattgaaaagctaatcactgaaatgcttcaggcaggggtgattagaccgagccacaatccttattccagtccggtgctattagtaaagaagaaggatgggggatggagattttgtgttgattatcgcaagcgaaaccaagtgatggtgtcagacaaatttccaattccagtaatcgaagagctgcttgatgagctaaatggggcagactgggtagaggcagacgaagagaagatcagggatatgataaactggccacttccgaaagacgtctctagtttgaggggatttcttggtctcacaggatactatagaagatttgtaaagagttatggagagatagctgccccattaaccaagttacaaaaaaatgacttcaaatggagtgaggaagcaactacggccttcgaaagtttaaagttagcaatggaccattctaatgctagccttactcgctggaactcaccattcaccattgagttggatgcgtctggaataggtttaggagctgtaatctcccaaaaaggccaccctattgcttttttcagccaaaaactatctccaagggcccaaaccaagtctatttatgaaagggaattgatggttgtagttctttccgtccaaaaatggaggcactatttacttggaagaaagttcaccataatttctgaccaaaaagccctgaaatttttattagagcagagggaagtacaaccccaatttcagaaatggttgacgacgcttcttggctatgattttgaaatattataccagcttggacttcaaaacaaggcagcggatgccctttcaagaatggagcagcctttggaattgaacagcatgacaaccactggaattgtagacgtggaactgatttgtaaggaggttgaaaatgatgaagaacttaaaaaaatcataagggaactggaaggaagctcagaagaaggaaagaagtaccaatggactttccacgattctattctaggaggtcactttggattTCTACgatctaaacctgcaagggtccttcaaccaattcctataccagatcgaatattagaagactggacgatggacttcattgaggggttacccctcgccagaggaatgaatgttattatggtggtggtggatagattgagtaaatactcctacttcatatcgttaagacatccattttctgctaagcaagtggctgccatattcatcgatagaatagtaaggaagcatggcattcccatgtccattatcacagatagggataaaattttccttagtaacttttggaaagaattgtttgcaacaatgggaactattttgaaaatgagtacagcattccatccacaaacagatggacagacaaagaaggtgaatagaTGCCTTgaaacctacttgag

Coding sequence (CDS)

ATGAAGTTGAAAGGGGATATAAAAGGGCAAGAAGTGGTGATCCTCATTGATAGTGGAGCCACAAATAACTTCATACATGAGATAGTAGCGGAGGCACAAGGGCTAAACATAGAACCAGGAACACAGTTCGGAGTGACGATTGGAGATGGAACTCGTTGTAAAGGCAAGGGGATTTGTAGAAGAGTGGAACTGAGATTGAAGGAACTGACGATTGTAGCTGACTTCTTGGCAGTAGAATTGGGGAATGTTGATGTAGTTCTAGGAATGCAGTGGTTAGATACCACTGGAACAACGAAGGTACATTGGCCATCCTTAACCATGACCTTCTGGGTTAAGGACAGACAGATTGTATTGAAGGGAGACCCATCTCTAATTAAGGCAGAATGTTCCTTGAAAACACTGGAGAAAACGTGGGATGCTGAAGACCAAGGATTCCTGCTAGAATTCAAGAATTATGGAGTGGAAATAGAGGATAAGGTCATTGACCACCGCATACTCACTCTACCAGACCATAGACCTATCAACGTAAGACCTTACAAATATGGTCACGTGCAGGAGGAGATTGAAAAGCTAATCACTGAAATGCTTCAGGCAGGGGTGATTAGACCGAGCCACAATCCTTATTCCAGTCCGGTGCTATTAGTAAAGAAGAAGGATGGGGGATGGAGATTTTGTGTTGATTATCGCAAGCGAAACCAAGTGATGGTGTCAGACAAATTTCCAATTCCAGTAATCGAAGAGCTGCTTGATGAGCTAAATGGGGCAGACTGGGTAGAGGCAGACGAAGAGAAGATCAGGGATATGATAAACTGGCCACTTCCGAAAGACGTCTCTAGTTTGAGGGGATTTCTTGGTCTCACAGGATACTATAGAAGATTTGTAAAGAGTTATGGAGAGATAGCTGCCCCATTAACCAAGTTACAAAAAAaTGACTTCAAATGGAGTGAGGAAGCAACTACGGCCTTCGAAAGTTTAAAGTTAGCAATGGACCATTCTAATGCTAGCCTTACTCGCTGGAACTCACCATTCACCATTGAGTTGGATGCGTCTGGAATAGGTTTAGGAGCTGTAATCTCCCAAAAAGGCCACCCTATTGCTTTTTTCAGCCAAAAACTATCTCCAAGGGCCCAAACCAAGTCTATTTATGAAAGGGAATTGATGGTTGTAGTTCTTTCCGTCCAAAAATGGAGGCACTATTTACTTGGAAGAAAGTTCACCATAATTTCTGACCAAAAAGCCCTGAAATTTTTATTAGAGCAGAGGGAAGTACAACCCCAATTTCAGAAATGGTTGACGACGCTTCTTGGCTATGATTTTGAAATATTATACCAGCTTGGACTTCAAAACAAGGCAGCGGATGCCCTTTCAAGAATGGAGCAGCCTTTGGAATTGAACAGCATGACAACCACTGGAATTGTAGACGTGGAACTGATTTGTAAGGAGGTTGAAAATGATGAAGAACTTAAAAAAATCATAAGGGAACTGGAAGGAAGCTCAGAAGAAGGAAAGAAGTACCAATGGACTTTCCACGATTCTATTCTAGGAGGTCACTTTGGATTTCTACGATCTAAACCTGCAAGGGTCCTTCAACCAATTCCTATACCAGATCGAATATTAGAAGACTGGACGATGGACTTCATTGAGGGGTTACCCCTCGCCAGAGGAATGAATGTTATTATGGTGGTGGTGGATAGATTGAGTAAATACTCCTACTTCATATCGTTAAGACATCCATTTTCTGCTAAGCAAGTGGCTGCCATATTCATCGATAGAATAGTAAGGAAGCATGGCATTCCCATGTCCATTATCACAGATAGGGATAAAATTTTCCTTAGTAACTTTTGGAAAGAATTGTTTGCAACAATGGGAACTATTTTGAAAATGAGTACAGCATTCCATCCACAAACAGATGGACAGACAAAGAAGGTGAATAGATGCCTTGAAACCTACTTGAG

Protein sequence

MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICRRVELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKGDPSLIKAECSLKTLEKTWDAEDQGFLLEFKNYGVEIEDKVIDHRILTLPDHRPINVRPYKYGHVQEEIEKLITEMLQAGVIRPSHNPYSSPVLLVKKKDGGWRFCVDYRKRNQVMVSDKFPIPVIEELLDELNGADWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTKLQKNDFKWSEEATTAFESLKLAMDHSNASLTRWNSPFTIELDASGIGLGAVISQKGHPIAFFSQKLSPRAQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWLTTLLGYDFEILYQLGLQNKAADALSRMEQPLELNSMTTTGIVDVELICKEVENDEELKKIIRELEGSSEEGKKYQWTFHDSILGGHFGFLRSKPARVLQPIPIPDRILEDWTMDFIEGLPLARGMNVIMVVVDRLSKYSYFISLRHPFSAKQVAAIFIDRIVRKHGIPMSIITDRDKIFLSNFWKELFATMGTILKMSTAFHPQTDGQTKKVNRCLETYLX
BLAST of Cucsa.163250 vs. Swiss-Prot
Match: POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 3.9e-34
Identity = 80/208 (38.46%), Postives = 122/208 (58.65%), Query Frame = 1

Query: 256 DWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTKLQKNDFKWS 315
           D ++ +  K++ ++++P+P     +R FLGLTGYYR+F+ +Y +IA P+T   K   K  
Sbjct: 417 DGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKID 476

Query: 316 E---EATTAFESLK-LAMDHSNASLTRWNSPFTIELDASGIGLGAVISQKGHPIAFFSQK 375
               E   AFE LK L +      L  +   F +  DAS + LGAV+SQ GHPI+F S+ 
Sbjct: 477 TQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRT 536

Query: 376 LSPRAQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWL 435
           L+      S  E+EL+ +V + + +RHYLLGR+F I SD + L++L   +E   + ++W 
Sbjct: 537 LNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWLHNLKEPGAKLERWR 596

Query: 436 TTLLGYDFEILYQLGLQNKAADALSRME 460
             L  Y F+I Y  G +N  ADALSR++
Sbjct: 597 VRLSEYQFKIDYIKGKENSVADALSRIK 624


HSP 2 Score: 72.8 bits (177), Expect = 1.6e-11
Identity = 84/347 (24.21%), Postives = 149/347 (42.94%), Query Frame = 1

Query: 8   KGQEVVILIDSGATNNFIHEIV--AEAQGLNIEPGTQFG-VTIGDGTRCKGKGICRRVEL 67
           KG+    L+D+G+T N I+E +     Q    E  T  G +T+ D        I ++ E 
Sbjct: 21  KGRSYKCLLDTGSTINMINENIFCLPIQNSRCEVLTSNGPITLNDLIMLPRNSIFKKTE- 80

Query: 68  RLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFW-----------VKD 127
                     ++     N D+++G + L    +  +++ + T+T +            ++
Sbjct: 81  --------PFYVHRFSNNYDMLIGRKLLKNAQSV-INYKNDTVTLFDQTYKLITSESERN 140

Query: 128 RQIVLKGDPSLIKA--ECSLKTLEKTWDAED----------QGFLLEFKNYGVEIEDKV- 187
           + + ++  P  I +  + S+K L+ +    D          +G L +F+N   +  +K+ 
Sbjct: 141 QNLYIQRTPESIASSDQESIKKLDFSQFRLDHLNQEETFKLKGLLNKFRNLEYKEGEKLT 200

Query: 188 ----IDHRILTLPDHRPINVRPYKYGHVQE-EIEKLITEMLQAGVIRPSHNPYSSPVLLV 247
               I H +L    + PI  + Y      E E+E  + EML  G+IR S++PY+SP  +V
Sbjct: 201 FTNTIKH-VLNTTHNSPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVV 260

Query: 248 KKKDGG-----WRFCVDYRKRNQVMVSDKFPIPVIEELLDEL------------NGADWV 306
            KK        +R  +DYRK N++ + D++PIP ++E+L +L             G   +
Sbjct: 261 PKKPDASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQI 320

BLAST of Cucsa.163250 vs. Swiss-Prot
Match: POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 6.6e-34
Identity = 77/208 (37.02%), Postives = 123/208 (59.13%), Query Frame = 1

Query: 256 DWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTKLQKNDFK-- 315
           D ++ + EKI  +  +P+P     ++ FLGLTGYYR+F+ ++ +IA P+TK  K + K  
Sbjct: 418 DGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKID 477

Query: 316 -WSEEATTAFESLK-LAMDHSNASLTRWNSPFTIELDASGIGLGAVISQKGHPIAFFSQK 375
             + E  +AF+ LK L  +     +  +   FT+  DAS + LGAV+SQ GHP+++ S+ 
Sbjct: 478 TTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSYISRT 537

Query: 376 LSPRAQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWL 435
           L+      S  E+EL+ +V + + +RHYLLGR F I SD + L +L   ++   +  +W 
Sbjct: 538 LNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWR 597

Query: 436 TTLLGYDFEILYQLGLQNKAADALSRME 460
             L  +DF+I Y  G +N  ADALSR++
Sbjct: 598 VKLSEFDFDIKYIKGKENCVADALSRIK 625


HSP 2 Score: 66.2 bits (160), Expect = 1.5e-09
Identity = 33/86 (38.37%), Postives = 54/86 (62.79%), Query Frame = 1

Query: 180 KYGHVQ---EEIEKLITEMLQAGVIRPSHNPYSSPVLLVKKKDGG-----WRFCVDYRKR 239
           KY + Q   +E+E  I +ML  G+IR S++PY+SP+ +V KK        +R  +DYRK 
Sbjct: 212 KYSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKL 271

Query: 240 NQVMVSDKFPIPVIEELLDELNGADW 258
           N++ V D+ PIP ++E+L +L   ++
Sbjct: 272 NEITVGDRHPIPNMDEILGKLGRCNY 297

BLAST of Cucsa.163250 vs. Swiss-Prot
Match: POL5_DROME (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 121.7 bits (304), Expect = 3.0e-26
Identity = 76/242 (31.40%), Postives = 123/242 (50.83%), Query Frame = 1

Query: 247 ELLDELNGADWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTK 306
           E L  +  AD ++AD +K+R +   P P  V  L+ FLG+T YYR+F++ Y ++A PLT 
Sbjct: 325 EFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTN 384

Query: 307 L-------------QKNDFKWSEEATTAFESLKLAMDHSN-ASLTRWNSPFTIELDASGI 366
           L              K      E A  +F  LK  +  S   +   +  PF +  DAS  
Sbjct: 385 LTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNW 444

Query: 367 GLGAVISQ----KGHPIAFFSQKLSPRAQTKSIYERELMVVVLSVQKWRHYLLGR-KFTI 426
            +GAV+SQ    +  PIA+ S+ L+   +  +  E+E++ ++ S+   R YL G     +
Sbjct: 445 AIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKV 504

Query: 427 ISDQKALKFLLEQREVQPQFQKWLTTLLGYDFEILYQLGLQNKAADALSRMEQPLELNSM 470
            +D + L F L  R    + ++W   +  Y+ E++Y+ G  N  ADALSR+  P +LN +
Sbjct: 505 YTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSRI--PPQLNQL 564


HSP 2 Score: 67.0 bits (162), Expect = 8.6e-10
Identity = 35/91 (38.46%), Postives = 56/91 (61.54%), Query Frame = 1

Query: 173 PINVRPYKYG-HVQEEIEKLITEMLQAGVIRPSHNPYSSPVLLVKKK-----DGGWRFCV 232
           PI  + Y Y  +++ E+E+ I E+LQ G+IRPS++PY+SP+ +V KK     +  +R  V
Sbjct: 123 PIYAKSYPYPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVV 182

Query: 233 DYRKRNQVMVSDKFPIPVIEELLDELNGADW 258
           D+++ N V + D +PIP I   L  L  A +
Sbjct: 183 DFKRLNTVTIPDTYPIPDINATLASLGNAKY 213

BLAST of Cucsa.163250 vs. Swiss-Prot
Match: POLY_DROME (Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 1.9e-25
Identity = 71/227 (31.28%), Postives = 119/227 (52.42%), Query Frame = 1

Query: 247 ELLDELNGADWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTK 306
           E L  +   D  ++D EK++ +  +P P  V  +R FLGL  YYR F+K +  IA P+T 
Sbjct: 384 EYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAAIARPITD 443

Query: 307 LQKND-------------FKWSEEATTAFESLK--LAMDHSNASLTRWNSPFTIELDASG 366
           + K +              +++E    AF+ L+  LA +        +  PF +  DAS 
Sbjct: 444 ILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLTTDASA 503

Query: 367 IGLGAVISQKGHPIAFFSQKLSPRAQTKSIYERELMVVVLSVQKWRHYLLG-RKFTIISD 426
            G+GAV+SQ+G PI   S+ L    Q  +  EREL+ +V ++ K +++L G R+  I +D
Sbjct: 504 SGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSREINIFTD 563

Query: 427 QKALKFLLEQREVQPQFQKWLTTLLGYDFEILYQLGLQNKAADALSR 458
            + L F +  R    + ++W + +  ++ ++ Y+ G +N  ADALSR
Sbjct: 564 HQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVADALSR 610


HSP 2 Score: 55.5 bits (132), Expect = 2.6e-06
Identity = 65/273 (23.81%), Postives = 112/273 (41.03%), Query Frame = 1

Query: 7   IKGQEVVILIDSGATNNFIHEIVAEAQGLNIEP-GTQFGVT-IGDGTRCKGKGICRRVEL 66
           + G+ + +LID+ A  N+I  +    +  N+ P  + F V+ I   T  K K + +  + 
Sbjct: 19  LAGRTLKMLIDTDAAKNYIRPV---KELKNVMPVASPFSVSSIHGSTEIKHKCLMKVFK- 78

Query: 67  RLKELTIVADFLAVELGNVDVVLGMQWLDTTGTT---------------KVHWPSLTMTF 126
                 I   FL   L   D ++G+  L   G                 K+H+ S     
Sbjct: 79  -----HISPFFLLDSLNAFDAIIGLDLLTQAGVKLNLAEDSLEYQGIAEKLHYFSCPSVN 138

Query: 127 WVKDRQIVLKGDPSLIKAECSLKTLEKTWDAEDQGFLLEFKNYGVEIEDKVIDHRILTLP 186
           +     IV+   P  +K E     + +          L F           +   I T+ 
Sbjct: 139 FTDVNDIVV---PDSVKKEFKDTIIRRKKAFSTTNEALPFNT--------AVTATIRTV- 198

Query: 187 DHRPINVRPYK-YGHVQEEIEKLITEMLQAGVIRPSHNPYSSPVLLVKKK------DGGW 246
           D+ P+  R Y     V + +   + ++L+ G+IRPS +PY+SP  +V KK      +   
Sbjct: 199 DNEPVYSRAYPTLMGVSDFVNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNK 258

Query: 247 RFCVDYRKRNQVMVSDKFPIPVIEELLDELNGA 256
           R  +D+RK N+  + D++P+P I  +L  L  A
Sbjct: 259 RLVIDFRKLNEKTIPDRYPMPSIPMILANLGKA 270

BLAST of Cucsa.163250 vs. Swiss-Prot
Match: TF25_SCHPO (Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1)

HSP 1 Score: 110.5 bits (275), Expect = 6.8e-23
Identity = 76/266 (28.57%), Postives = 128/266 (48.12%), Query Frame = 1

Query: 262 EEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTKLQKND--FKWSEEAT 321
           +E I  ++ W  PK+   LR FLG   Y R+F+    ++  PL  L K D  +KW+   T
Sbjct: 624 QENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTPTQT 683

Query: 322 TAFESLKLAMDHSNASLTR---WNSPFTIELDASGIGLGAVISQKG-----HPIAFFSQK 381
            A E++K  +   +  + R   ++    +E DAS + +GAV+SQK      +P+ ++S K
Sbjct: 684 QAIENIKQCL--VSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSAK 743

Query: 382 LSPRAQTKSIYERELMVVVLSVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQPQF 441
           +S      S+ ++E++ ++ S++ WRHYL      F I++D + L  +   E      + 
Sbjct: 744 MSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRL 803

Query: 442 QKWLTTLLGYDFEILYQLGLQNKAADALSRMEQPLE----------LNSMTTTGIVD--V 501
            +W   L  ++FEI Y+ G  N  ADALSR+    E          +N +    I D   
Sbjct: 804 ARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISITDDFK 863


HSP 2 Score: 100.9 bits (250), Expect = 5.4e-20
Identity = 52/128 (40.62%), Postives = 71/128 (55.47%), Query Frame = 1

Query: 524  KPARVLQPIPIPDRILEDWTMDFIEGLPLARGMNVIMVVVDRLSKYSYFISLRHPFSAKQ 583
            KP   LQPIP  +R  E  +MDFI  LP + G N + VVVDR SK +  +      +A+Q
Sbjct: 969  KPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQ 1028

Query: 584  VAAIFIDRIVRKHGIPMSIITDRDKIFLSNFWKELFATMGTILKMSTAFHPQTDGQTKKV 643
             A +F  R++   G P  II D D IF S  WK+       ++K S  + PQTDGQT++ 
Sbjct: 1029 TARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERT 1088

Query: 644  NRCLETYL 652
            N+ +E  L
Sbjct: 1089 NQTVEKLL 1096

BLAST of Cucsa.163250 vs. TrEMBL
Match: A0A151R5M0_CAJCA (Transposon Ty3-I Gag-Pol polyprotein OS=Cajanus cajan GN=KK1_041123 PE=4 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 3.2e-136
Identity = 281/686 (40.96%), Postives = 408/686 (59.48%), Query Frame = 1

Query: 1   MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
           M+ +G I+G  + IL+DSG+ +NF+   +A    L +EP + F V +G+G     +G+ +
Sbjct: 1   MRFQGSIQGVSIQILLDSGSYDNFLQPQLANYLKLPVEPISSFQVMVGNGNSLTVEGLIQ 60

Query: 61  RVELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKG 120
            +++ ++  T+      + +   D+VLG  WL T G     + +LT+ F++    I L G
Sbjct: 61  ELKVSVQGHTLTLPVYLLPVSGADLVLGASWLATLGPHISDYSALTLKFYLNGEFITLHG 120

Query: 121 DPSLIKAECS---LKTLEKTWDAEDQGFLLEFK---------------NYGVEIED---- 180
           D S +        ++ +  T    +   L  ++               N+ + +      
Sbjct: 121 DNSKLPTPAQFHHIRRMSHTHAIAESRILHNYRSVFDKPSGLPPDRSHNHQIPLLPDTNL 180

Query: 181 -KVIDHRILTLPDHRPINVRPYKYGHVQ-EEIEKLITEMLQAGVIRPSHNPYSSPVLLVK 240
            KV  ++I  LP   P+ VRPY+Y H Q E+IE ++ EML+ G+I PS++P+SSP+LLVK
Sbjct: 181 VKVRPYQIPLLPGTNPVKVRPYRYPHSQKEQIENMVAEMLKDGIISPSNSPFSSPILLVK 240

Query: 241 KKDGGWRFCVDYRKRNQVMVSDKFPIPVIEELLDELNGADWVEADEEKIRDMINWPLPKD 300
           KKDG WRFC+DYR  N + V D FPIP ++EL+DEL GA +    + +         P+D
Sbjct: 241 KKDGTWRFCIDYRALNTITVKDHFPIPTVDELIDELCGAQYFSKLDLRSGYHQILVAPED 300

Query: 301 ----VSSLRGFLGLTGYYRRFVKSYGEIAAPLTK-LQKNDFKWSEEATTAFESLKLAMDH 360
               V  LRGFLGLTGYYRRF+K Y  IAAPLT  L+K +F W  +AT AF++LK A+  
Sbjct: 301 RFKTVKQLRGFLGLTGYYRRFIKGYASIAAPLTNLLKKANFHWDSQATLAFDNLKKALTE 360

Query: 361 SNA-SLTRWNSPFTIELDASGIGLGAVISQKGHPIAFFSQKLSPRAQTKSIYERELMVVV 420
           +   +L  ++ PF +E DASGIG+GAV+SQ  HP+AFFS+KLSP+ Q +S Y RE   + 
Sbjct: 361 APVLALLDFSKPFILETDASGIGIGAVLSQSQHPLAFFSKKLSPQMQKQSAYTREFHAIT 420

Query: 421 LSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWLTTLLGYDFEILYQLGLQNK 480
            ++ K+RHYL+G KF I +DQK+LK L+EQ    P+ Q WL   LGYDF I Y+ G +N 
Sbjct: 421 TAIAKFRHYLIGHKFVIRTDQKSLKCLMEQPIHTPEQQAWLHKFLGYDFTIEYKPGKENL 480

Query: 481 AADALSR-----MEQPLELNSMTTTGIVDVELICKEVENDEELKKIIRELEGSSEEGKKY 540
            ADALSR       QP + + +   GI         V +      + ++++   ++   Y
Sbjct: 481 VADALSRSYFMAFSQP-QWDFIANLGIART---LARVSSQFYWPGMHQDIKSYVQQCLIY 540

Query: 541 QWTFHDSILGGHFGFLRSKPARVLQPIPIPDRILEDWTMDFIEGLPLARGMNVIMVVVDR 600
           Q     + L          PA +LQP+PIP +I +D  MDFI GLP + G  VIMVV+DR
Sbjct: 541 QQAKSSTTL----------PAGLLQPLPIPQQIWDDLAMDFIVGLPPSYGFTVIMVVIDR 600

Query: 601 LSKYSYFISLRHPFSAKQVAAIFIDRIVRKHGIPMSIITDRDKIFLSNFWKELFATMGTI 652
           LSKY++F  L+  +S+KQVA +F+  IVR HGIP SI++DRD++F SNFW++L    GT 
Sbjct: 601 LSKYAHFCQLKADYSSKQVAEVFMKSIVRLHGIPKSIVSDRDRVFTSNFWQQLCKLSGTT 660

BLAST of Cucsa.163250 vs. TrEMBL
Match: A5C633_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_034528 PE=4 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 8.4e-129
Identity = 279/696 (40.09%), Postives = 405/696 (58.19%), Query Frame = 1

Query: 1    MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
            M++   I   +VV+LIDSG+T+NFI E VA+   L + P   F V + +GT  K +G   
Sbjct: 336  MRITAKIGQHKVVVLIDSGSTHNFISEKVADMLHLPVVPTKPFTVKVANGTPLKCQGRFE 395

Query: 61   RVELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKG 120
             V + L+ +       ++ L  +D+VLG+QWL+   T   +W  LTM F  +++   L+G
Sbjct: 396  HVHVILQGIPFSLTLYSLPLTGLDLVLGVQWLEQLETVVCNWKKLTMEFQWENQTHKLQG 455

Query: 121  DPSLIKAECSLKTLEKTWDAEDQGFLLEFKNYGVEIEDKV-------------------- 180
              +      SLK + K        F +  ++   E++  +                    
Sbjct: 456  TNTQTIQVASLKAVSKELRQGSSMFAICLQSTSNEVQQAIHLDMQQLIKAFEDIFQEPNQ 515

Query: 181  ------IDHRILTLPDHRPINVRPYKYGHVQE-EIEKLITEMLQAGVIRPSHNPYSSPVL 240
                  IDHRI       P+NVRPY+Y + Q+ EIEK + +ML+ G+IR S +P+SSPVL
Sbjct: 516  LPLAREIDHRITLKEGTEPVNVRPYRYAYFQKAEIEKQVXDMLKLGLIRASTSPFSSPVL 575

Query: 241  LVKKKDGGWRFCVDYRKRNQVMVSDKFPIPVIEELLDELNGADW-----VEADEEKIRDM 300
            LVKKKDG WRFC DYR  N V + D+FPIP ++++LDEL+ A +     + A   ++R  
Sbjct: 576  LVKKKDGTWRFCTDYRALNVVTIKDRFPIPTVDDMLDELHRATYFTKLDLRAGYHQVR-- 635

Query: 301  INWP-LPKDVSSLRG----FLGLT-GYYRRFVKSYGEIAAPLTKL-QKNDFKWSEEATTA 360
            ++ P +PK           +L +  GYYR+FV +Y  IA   T L +K  F W+++A TA
Sbjct: 636  VHPPDIPKTAFRTHNGHYEYLVMPFGYYRKFVSNYDIIARAFTNLLKKGXFAWTKDAETA 695

Query: 361  FESLKLAMDHS-NASLTRWNSPFTIELDASGIGLGAVISQKGHPIAFFSQKLSPRAQTKS 420
            F+ LK AM  +   ++  +N PF IE DA G G+G V++Q+G PIAF S+ L    ++ S
Sbjct: 696  FQXLKQAMTSTPTLAMPNFNEPFVIEFDAXGDGIGVVLTQQGKPIAFMSRALGVSKRSWS 755

Query: 421  IYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWLTTLLGYDFE 480
            IY RE++ +V ++Q WR YLLGRKF I +DQ++LK+LLEQR   P  Q+W+  LLGYD+E
Sbjct: 756  IYAREMLAIVHAIQTWRPYLLGRKFYIQTDQRSLKYLLEQRIXTPXQQEWVAKLLGYDYE 815

Query: 481  ILYQLGLQNKAADALSRMEQPLELNSMTTTGIVDVELICKEVENDEELKKIIRELEGSSE 540
            I Y+ G +N AADALSR+     LN++        + I  E      + KI        +
Sbjct: 816  ITYKXGRENSAADALSRVVSSPSLNALFVPQAPLWDEIKAEAIKHPYMDKI--------D 875

Query: 541  EGKKYQWTFHDSILGGHFGFLRSKP-----ARVLQPIPIPDRILEDWTMDFIEGLPLARG 600
            +   +Q T  D +   +    R K      A +LQP+PIP  + +D TMDFIEGLP + G
Sbjct: 876  KLANWQRTVQDYV-SSYDVCQRIKSETLARAGLLQPLPIPCLVWDDITMDFIEGLPTSNG 935

Query: 601  MNVIMVVVDRLSKYSYFISLRHPFSAKQVAAIFIDRIVRKHGIPMSIITDRDKIFLSNFW 652
             N I+VVVDRLSK ++F++L HPF AK V   F++ +V+ HG+P SII+DRD +F+S FW
Sbjct: 936  KNTILVVVDRLSKSAHFLALAHPFXAKMVXEKFVEGVVKLHGMPKSIISDRDXVFMSQFW 995

BLAST of Cucsa.163250 vs. TrEMBL
Match: A0A087G0A8_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AAs43195U000200 PE=4 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 2.6e-114
Identity = 215/437 (49.20%), Postives = 293/437 (67.05%), Query Frame = 1

Query: 247  ELLDELNGADWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTK 306
            E L  +   D V AD +KI+ M++WP PK++ +LRGFLGLTGYYR+FV+ YG+IA PLT 
Sbjct: 782  EYLGHIISGDGVAADPQKIQAMVSWPEPKNIKALRGFLGLTGYYRKFVRGYGDIAKPLTS 841

Query: 307  LQKND-FKWSEEATTAFESLKLAMDHSNA-SLTRWNSPFTIELDASGIGLGAVISQKGHP 366
            L K D F+WSE A+ AF+ LK AM      SL  ++  F +E DASGIGLGAV+ Q+  P
Sbjct: 842  LLKKDQFQWSEAASGAFQQLKQAMTTVPVLSLVDFSELFVVESDASGIGLGAVLMQQQRP 901

Query: 367  IAFFSQKLSPRAQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQ 426
            IA++SQ L+ R + KS+YERELM +V ++Q+WRHYLLGRKF + +DQK+LKFLLEQREV 
Sbjct: 902  IAYYSQALTDRQKLKSVYERELMAIVFAIQRWRHYLLGRKFLVRTDQKSLKFLLEQREVN 961

Query: 427  PQFQKWLTTLLGYDFEILYQLGLQNKAADALSRMEQPLELNSMTTTGIVDVELICKEVEN 486
             ++Q+WLT +LG+DF+I+Y+ GL+NKAADALSR E   +L +++    + +E+I  EV+ 
Sbjct: 962  TEYQQWLTKILGFDFDIVYKPGLENKAADALSRREVMPQLFALSVPAAIQLEMITAEVDK 1021

Query: 487  DEELKKIIRELEGSSEEGKKYQWT-------------------------FHDSILGGHFG 546
            D E +KI  E+ G  +    Y                            FH + +GGH  
Sbjct: 1022 DAESRKIKEEVLGDVDAHPGYSVVQGRLLKQGKLVIPKASPLVGVLLHEFHSTKMGGHGV 1081

Query: 547  FLRSK-----PARVLQPIPIPDRILEDWTMDFIEGLPLARGMNVIMVVVDRLSKYSYFIS 606
              + K     PA +LQP+PIPD++ ED ++DF+EGLP + G +V+MVVVDRLSKY++F+ 
Sbjct: 1082 CQKHKYSSLAPAGLLQPLPIPDKVWEDISLDFVEGLPKSEGFDVVMVVVDRLSKYAHFLK 1141

Query: 607  LRHPFSAKQVAAIFIDRIVRKHGIPMSIITDRDKIFLSNFWKELFATMGTILKMSTAFHP 652
            L+HP+ A  VA +F+  IVR HG P +I++DRDK F   FW EL    GT+L  ST++HP
Sbjct: 1142 LKHPYEASAVALLFVQEIVRLHGFPRTIVSDRDKTFTGRFWSELMKLAGTLLNFSTSYHP 1201

BLAST of Cucsa.163250 vs. TrEMBL
Match: A0A087G0A8_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AAs43195U000200 PE=4 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 1.4e-35
Identity = 98/258 (37.98%), Postives = 143/258 (55.43%), Query Frame = 1

Query: 1   MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
           MK+ G +  +EVV++IDSGA++NFI + + +   L       +GV  G G   KG GICR
Sbjct: 435 MKMTGTVGAEEVVVMIDSGASHNFISQGLVKLLDLKPNRTGNYGVLTGAGVTVKGDGICR 494

Query: 61  RVELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKG 120
            ++L ++ L + ADFL + LG+ D V                  + +     + Q V + 
Sbjct: 495 GLDLMIQGLRVRADFLPLSLGSADAV--------------EHGEMGVVVEYNELQSVKQE 554

Query: 121 DPSLIKAECSLKTLEKTWD--AEDQGFLLEFKNYGVEIEDKVIDHRILTLPDHRPINVRP 180
           + +L       + LE+  +  AE  G              +   H I   P  RP++VRP
Sbjct: 555 ESALPVPVGLQRVLERHPEVFAEPTGLP----------PSRGRAHAINLEPGVRPVSVRP 614

Query: 181 YKYGHVQ-EEIEKLITEMLQAGVIRPSHNPYSSPVLLVKKKDGGWRFCVDYRKRNQVMVS 240
           ++Y   Q EEIE+ +T ML AG+I+ S +P+SSPVLLVKKKDG WRFC+DYR  N+V + 
Sbjct: 615 FRYPQAQKEEIERQVTAMLAAGIIQESGSPFSSPVLLVKKKDGSWRFCIDYRALNKVTIP 668

Query: 241 DKFPIPVIEELLDELNGA 256
             FPIP+I++LLDEL+GA
Sbjct: 675 HSFPIPMIDQLLDELHGA 668


HSP 2 Score: 398.3 bits (1022), Expect = 1.8e-107
Identity = 216/467 (46.25%), Postives = 293/467 (62.74%), Query Frame = 1

Query: 247  ELLDELNGADWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTK 306
            E L  +   + V AD  K++ M++WPLPK++ +LRGFLGLTGYYRRFV+ YG IA PLT 
Sbjct: 813  EYLGHVISGEGVSADPSKLQAMVSWPLPKNIKALRGFLGLTGYYRRFVQGYGSIAKPLTS 872

Query: 307  LQKND-FKWSEEATTAFESLKLAMDHSNA-SLTRWNSPFTIELDASGIGLGAVISQKGHP 366
            L K D F+WSEEAT AFE LK+AM      +L  ++  F +E DASGIGLGAV+ QK  P
Sbjct: 873  LLKKDKFQWSEEATVAFEKLKVAMSTVPVLALVDFSELFVVESDASGIGLGAVLLQKQKP 932

Query: 367  IAFFSQKLSPRAQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQ 426
            +A+FSQ L+ R + KS+YERELM +V ++QKWRHYLLGRKF + +DQK+LKFLLEQREV 
Sbjct: 933  VAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKFLVRTDQKSLKFLLEQREVN 992

Query: 427  PQFQKWLTTLLGYDFEILYQLGLQNKAADALSRMEQPLELNSMTTTGI---------VDV 486
             ++Q+WLT +LG++F+I Y+ GL+NKAADALSR+E   +L +++             VD 
Sbjct: 993  LEYQQWLTKILGFNFDIHYKPGLENKAADALSRVEGLPQLYALSVPAAIQLEEINEEVDR 1052

Query: 487  ELICKEVENDEELKKIIRE----------------LEGSSEEGKKYQWTFHDSILGGHFG 546
              + K+++ +  L                      L   S   K     FH+S +GGH G
Sbjct: 1053 NPVSKKIKEEVLLDASTHSGYSVVQGRLLYNGKLVLPKESYLIKVLLHEFHNSRMGGHGG 1112

Query: 547  FLRSK-----------------------------------PARVLQPIPIPDRILEDWTM 606
             L+++                                   P+ +LQP+PIP ++ ED ++
Sbjct: 1113 VLKTQRHLGALFYWQGMMADIKTFVAECVVCQKHKYSTLAPSGLLQPLPIPTQVWEDISL 1172

Query: 607  DFIEGLPLARGMNVIMVVVDRLSKYSYFISLRHPFSAKQVAAIFIDRIVRKHGIPMSIIT 652
            DF+EGLP + G + I+VVVDRL+KY++FI L+HPF AK++AA+FI  IVR HG P ++++
Sbjct: 1173 DFVEGLPKSEGFDAILVVVDRLTKYAHFIKLQHPFGAKEIAAVFIQEIVRLHGYPSTMVS 1232

BLAST of Cucsa.163250 vs. TrEMBL
Match: A0A087GEK8_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 2.2e-44
Identity = 125/339 (36.87%), Postives = 176/339 (51.92%), Query Frame = 1

Query: 1   MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
           MKL G I+  EVV+LIDSGA++NF+ E +    GL       +GV  G G   +G G+CR
Sbjct: 413 MKLMGTIQTTEVVVLIDSGASHNFVSEQLVHRLGLQSAKTGSYGVLTGGGMTVRGAGVCR 472

Query: 61  RVELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKG 120
            + L L+ L I  DFL +ELG+ DV+LG++WL + G  KV+W    M F +     VL+G
Sbjct: 473 GLVLLLQGLRIRDDFLPLELGSADVILGIKWLSSLGEMKVNWGRQYMRFSLGGETAVLQG 532

Query: 121 DPSLIKAECSLKTLEKTWDAEDQGFLLEFKNYGVEIEDKV------IDHRILTLPDHRPI 180
           DP    +  SLK+L +    +  G L+E+   G++  D+V      +   ++++ D  P 
Sbjct: 533 DPGQGCSAISLKSLMRAVKDQGVGLLVEYN--GLQSLDQVAGFTTEVPQALVSVMDQFPQ 592

Query: 181 NVR-----PYKYGHVQE-----------------------EIEKLITEMLQAGVIRPSHN 240
                   P   G   E                       EIEK +T ML AG+I+ S +
Sbjct: 593 VFEDPQGLPPTRGRAHEINLESGAKAVSVRPFRYPQTQKAEIEKQVTAMLAAGIIQESTS 652

Query: 241 PYSSPVLLVKKKDGGWRFCVDYRKRNQVMVSDKFPIPVIEELLDELNGADWVEADEEKIR 300
            +SSPVLLVKKKDG WRFC+DYR  N+V + D FPIP+I++LLDEL+GA      + K  
Sbjct: 653 TFSSPVLLVKKKDGSWRFCIDYRALNKVTIPDSFPIPMIDQLLDELHGATVFSKLDLKSG 712

Query: 301 DMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLT 306
                  P++V     F    G+Y   V  +G   AP T
Sbjct: 713 YHQILVKPQNVPK-TAFRTHDGHYEFLVMPFGLTNAPTT 748


HSP 2 Score: 385.6 bits (989), Expect = 1.2e-103
Identity = 252/665 (37.89%), Postives = 362/665 (54.44%), Query Frame = 1

Query: 78   VELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQI--------------VLKGDPS 137
            ++LG  D++LGM WL+  G     W    + F  +D+ I               L  + +
Sbjct: 397  LDLGGYDMILGMDWLEQWGEMTCQWKEKWVRFNYQDQLITLQGITKSDKPGLQELSVEQA 456

Query: 138  L-------IKAECSL----KTLEKTWDAEDQGFLLEFKNYGVEIED----KVIDHRILTL 197
            +       I A   L     T   T   E Q  + EF +   + +     + +DH I  +
Sbjct: 457  MRWHRGNDIWATALLVPVHNTKSNTIHPEVQVVIDEFADVFNDPKSLPPTRPLDHAIHLI 516

Query: 198  PDHRPINVRPYKYGHVQ-EEIEKLITEMLQAGVIRPSHNPYSSPVLLVKKKDGGWRFCVD 257
            P   P+NVRPY+Y  +Q +EIEK + EML+AG+I PS +P++SPVLLVKKKDG WRFCVD
Sbjct: 517  PGAVPVNVRPYRYSPLQKDEIEKQVAEMLEAGLITPSVSPFASPVLLVKKKDGTWRFCVD 576

Query: 258  YRKRN------------------------------------QVMVSDKFPIPVIE----- 317
            YRK N                                    QV+  ++F   + +     
Sbjct: 577  YRKLNSITVKRNRKCVVIFMDDILVFSESLEEHVNHLREVFQVLRENQFYAKLSKCTFAQ 636

Query: 318  ---ELLDELNGADWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAP 377
               E L  +   + V  D EK + M++WP+P++V+ L GFLGLTGYYR+FV+ YG IA P
Sbjct: 637  QKLEYLGHIISDEGVATDHEKTKVMMDWPVPQNVTELSGFLGLTGYYRKFVRHYGIIAKP 696

Query: 378  LTKL-QKNDFKWSEEATTAFESLKLAMDHSNA-SLTRWNSPFTIELDASGIGLGAVISQK 437
            LT+L QKN F WS+EA  AF+ LKLAM  +    L  +N  FTIE DA   G+GAV+ Q 
Sbjct: 697  LTQLLQKNSFAWSDEAHLAFDKLKLAMSTTPVLGLPDFNKQFTIETDACSTGIGAVLIQD 756

Query: 438  GHPIAFFSQKLSPRAQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQR 497
             HP+AF+S+ L  + Q  SIYE+E + ++++V+KWR Y+    F I +D ++L  L +Q 
Sbjct: 757  AHPVAFYSKALGIKNQQLSIYEKEFLAIMMAVEKWRAYVQRGPFIIKTDHQSLCQLGDQV 816

Query: 498  EVQPQFQKWLTTLLGYDFEILYQLGLQNKAADALSRMEQPLELNSMTTTGIVDVE----L 557
                   K +T L+G  F+  Y+      A    SR  QP+ L  +  +  VD +    L
Sbjct: 817  LTSDLQGKAMTKLVGLQFQFQYKKVGHLLAITTTSR-SQPVWLQEVLNSYEVDPQAQQLL 876

Query: 558  ICKEVENDE----ELKKIIRELE------GSSEEGKKYQWTFHDSILGGHFG-FLRSKPA 617
                + ND      L++ I +++       +S    K    FH S LGGH       K  
Sbjct: 877  QQLAIANDNVEGFSLQQGIIKMQDRIWIGANSALKTKLISAFHASALGGHSAKHEHCKYP 936

Query: 618  RVLQPIPIPDRILEDWTMDFIEGLPLARGMNVIMVVVDRLSKYSYFISLRHPFSAKQVAA 652
             +L P+P+P+   +   MDF+EGLP + G +VI+VVVDR +KY++FI LRHPFSA  VA 
Sbjct: 937  GLLNPLPVPEGPWQHVAMDFVEGLPKSAGYSVILVVVDRYTKYAHFIPLRHPFSAPVVAK 996

BLAST of Cucsa.163250 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 90.5 bits (223), Expect = 4.1e-18
Identity = 43/73 (58.90%), Postives = 55/73 (75.34%), Query Frame = 1

Query: 258 VEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTK-LQKNDFKWSE 317
           V AD  K+  M+ WP PK+ + LRGFLGLTGYYRRFVK+YG+I  PLT+ L+KN  KW+E
Sbjct: 44  VSADPAKLEAMVGWPEPKNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTE 103

Query: 318 EATTAFESLKLAM 330
            A  AF++LK A+
Sbjct: 104 MAALAFKALKGAV 116

BLAST of Cucsa.163250 vs. TAIR10
Match: AT3G29750.1 (AT3G29750.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 66.6 bits (161), Expect = 6.4e-11
Identity = 45/148 (30.41%), Postives = 74/148 (50.00%), Query Frame = 1

Query: 1   MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
           M+  G I   +VV+ IDSGAT+NFI   +A +  L      Q  V +G     +  G C 
Sbjct: 124 MRFYGFILDHKVVVAIDSGATDNFILVELAFSLKLPTSITNQASVLLGQRQCIQSVGTCL 183

Query: 61  RVELRLKELTIVADFLAVELG--NVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVL 120
            + L ++E+ I  +FL ++L   +VDV+LG +WL   G T V+W +   +F    + I L
Sbjct: 184 GIRLWVQEVEITENFLLLDLAKTDVDVILGYEWLSKLGETMVNWQNQDFSFSHNQQWITL 243

Query: 121 KGDP---SLIKAECSLKTLEKTWDAEDQ 144
             +      +  +  +K+  +  D E+Q
Sbjct: 244 CAEHEELEQVTTKVKMKSENEQEDIEEQ 271

BLAST of Cucsa.163250 vs. TAIR10
Match: AT3G30770.1 (AT3G30770.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 51.6 bits (122), Expect = 2.1e-06
Identity = 40/152 (26.32%), Postives = 70/152 (46.05%), Query Frame = 1

Query: 1   MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
           M+  G I   +VV++IDSGATNNFI + +A    L      Q  V +G     +  G C 
Sbjct: 284 MRFYGFISCHKVVVVIDSGATNNFISDELALVLKLPTSTTNQASVLLGQRQCIQTIGTCF 343

Query: 61  RVELRLKELTIVADFLAVEL--GNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVL 120
            + L ++E+ I  +FL ++L   +VDV+LG           + W +   +F+   + + L
Sbjct: 344 GINLLVQEVEINENFLLLDLTKTDVDVILGYGGSQNLERQWLIWLNQDFSFFHNQQWVTL 403

Query: 121 KGDPSLIKAECSLKTLEKTWDAEDQGFLLEFK 151
                 ++   +   ++  ++ E     LE K
Sbjct: 404 CAKDKELEQVTTKVKMKSEYEQEKIDHYLEDK 435

BLAST of Cucsa.163250 vs. TAIR10
Match: ATMG00850.1 (ATMG00850.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 51.2 bits (121), Expect = 2.8e-06
Identity = 22/35 (62.86%), Postives = 30/35 (85.71%), Query Frame = 1

Query: 188 IEKLITEMLQAGVIRPSHNPYSSPVLLVKKKDGGW 223
           ++  + EML+A +I+PS +PYSSPVLLV+KKDGGW
Sbjct: 45  LKNWLGEMLEARIIQPSISPYSSPVLLVQKKDGGW 79

BLAST of Cucsa.163250 vs. NCBI nr
Match: gi|1012325802|gb|KYP37665.1| (Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan])

HSP 1 Score: 493.8 bits (1270), Expect = 4.6e-136
Identity = 281/686 (40.96%), Postives = 408/686 (59.48%), Query Frame = 1

Query: 1   MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
           M+ +G I+G  + IL+DSG+ +NF+   +A    L +EP + F V +G+G     +G+ +
Sbjct: 1   MRFQGSIQGVSIQILLDSGSYDNFLQPQLANYLKLPVEPISSFQVMVGNGNSLTVEGLIQ 60

Query: 61  RVELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKG 120
            +++ ++  T+      + +   D+VLG  WL T G     + +LT+ F++    I L G
Sbjct: 61  ELKVSVQGHTLTLPVYLLPVSGADLVLGASWLATLGPHISDYSALTLKFYLNGEFITLHG 120

Query: 121 DPSLIKAECS---LKTLEKTWDAEDQGFLLEFK---------------NYGVEIED---- 180
           D S +        ++ +  T    +   L  ++               N+ + +      
Sbjct: 121 DNSKLPTPAQFHHIRRMSHTHAIAESRILHNYRSVFDKPSGLPPDRSHNHQIPLLPDTNL 180

Query: 181 -KVIDHRILTLPDHRPINVRPYKYGHVQ-EEIEKLITEMLQAGVIRPSHNPYSSPVLLVK 240
            KV  ++I  LP   P+ VRPY+Y H Q E+IE ++ EML+ G+I PS++P+SSP+LLVK
Sbjct: 181 VKVRPYQIPLLPGTNPVKVRPYRYPHSQKEQIENMVAEMLKDGIISPSNSPFSSPILLVK 240

Query: 241 KKDGGWRFCVDYRKRNQVMVSDKFPIPVIEELLDELNGADWVEADEEKIRDMINWPLPKD 300
           KKDG WRFC+DYR  N + V D FPIP ++EL+DEL GA +    + +         P+D
Sbjct: 241 KKDGTWRFCIDYRALNTITVKDHFPIPTVDELIDELCGAQYFSKLDLRSGYHQILVAPED 300

Query: 301 ----VSSLRGFLGLTGYYRRFVKSYGEIAAPLTK-LQKNDFKWSEEATTAFESLKLAMDH 360
               V  LRGFLGLTGYYRRF+K Y  IAAPLT  L+K +F W  +AT AF++LK A+  
Sbjct: 301 RFKTVKQLRGFLGLTGYYRRFIKGYASIAAPLTNLLKKANFHWDSQATLAFDNLKKALTE 360

Query: 361 SNA-SLTRWNSPFTIELDASGIGLGAVISQKGHPIAFFSQKLSPRAQTKSIYERELMVVV 420
           +   +L  ++ PF +E DASGIG+GAV+SQ  HP+AFFS+KLSP+ Q +S Y RE   + 
Sbjct: 361 APVLALLDFSKPFILETDASGIGIGAVLSQSQHPLAFFSKKLSPQMQKQSAYTREFHAIT 420

Query: 421 LSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWLTTLLGYDFEILYQLGLQNK 480
            ++ K+RHYL+G KF I +DQK+LK L+EQ    P+ Q WL   LGYDF I Y+ G +N 
Sbjct: 421 TAIAKFRHYLIGHKFVIRTDQKSLKCLMEQPIHTPEQQAWLHKFLGYDFTIEYKPGKENL 480

Query: 481 AADALSR-----MEQPLELNSMTTTGIVDVELICKEVENDEELKKIIRELEGSSEEGKKY 540
            ADALSR       QP + + +   GI         V +      + ++++   ++   Y
Sbjct: 481 VADALSRSYFMAFSQP-QWDFIANLGIART---LARVSSQFYWPGMHQDIKSYVQQCLIY 540

Query: 541 QWTFHDSILGGHFGFLRSKPARVLQPIPIPDRILEDWTMDFIEGLPLARGMNVIMVVVDR 600
           Q     + L          PA +LQP+PIP +I +D  MDFI GLP + G  VIMVV+DR
Sbjct: 541 QQAKSSTTL----------PAGLLQPLPIPQQIWDDLAMDFIVGLPPSYGFTVIMVVIDR 600

Query: 601 LSKYSYFISLRHPFSAKQVAAIFIDRIVRKHGIPMSIITDRDKIFLSNFWKELFATMGTI 652
           LSKY++F  L+  +S+KQVA +F+  IVR HGIP SI++DRD++F SNFW++L    GT 
Sbjct: 601 LSKYAHFCQLKADYSSKQVAEVFMKSIVRLHGIPKSIVSDRDRVFTSNFWQQLCKLSGTT 660

BLAST of Cucsa.163250 vs. NCBI nr
Match: gi|147860532|emb|CAN81876.1| (hypothetical protein VITISV_034528 [Vitis vinifera])

HSP 1 Score: 469.2 bits (1206), Expect = 1.2e-128
Identity = 279/696 (40.09%), Postives = 405/696 (58.19%), Query Frame = 1

Query: 1    MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
            M++   I   +VV+LIDSG+T+NFI E VA+   L + P   F V + +GT  K +G   
Sbjct: 336  MRITAKIGQHKVVVLIDSGSTHNFISEKVADMLHLPVVPTKPFTVKVANGTPLKCQGRFE 395

Query: 61   RVELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKG 120
             V + L+ +       ++ L  +D+VLG+QWL+   T   +W  LTM F  +++   L+G
Sbjct: 396  HVHVILQGIPFSLTLYSLPLTGLDLVLGVQWLEQLETVVCNWKKLTMEFQWENQTHKLQG 455

Query: 121  DPSLIKAECSLKTLEKTWDAEDQGFLLEFKNYGVEIEDKV-------------------- 180
              +      SLK + K        F +  ++   E++  +                    
Sbjct: 456  TNTQTIQVASLKAVSKELRQGSSMFAICLQSTSNEVQQAIHLDMQQLIKAFEDIFQEPNQ 515

Query: 181  ------IDHRILTLPDHRPINVRPYKYGHVQE-EIEKLITEMLQAGVIRPSHNPYSSPVL 240
                  IDHRI       P+NVRPY+Y + Q+ EIEK + +ML+ G+IR S +P+SSPVL
Sbjct: 516  LPLAREIDHRITLKEGTEPVNVRPYRYAYFQKAEIEKQVXDMLKLGLIRASTSPFSSPVL 575

Query: 241  LVKKKDGGWRFCVDYRKRNQVMVSDKFPIPVIEELLDELNGADW-----VEADEEKIRDM 300
            LVKKKDG WRFC DYR  N V + D+FPIP ++++LDEL+ A +     + A   ++R  
Sbjct: 576  LVKKKDGTWRFCTDYRALNVVTIKDRFPIPTVDDMLDELHRATYFTKLDLRAGYHQVR-- 635

Query: 301  INWP-LPKDVSSLRG----FLGLT-GYYRRFVKSYGEIAAPLTKL-QKNDFKWSEEATTA 360
            ++ P +PK           +L +  GYYR+FV +Y  IA   T L +K  F W+++A TA
Sbjct: 636  VHPPDIPKTAFRTHNGHYEYLVMPFGYYRKFVSNYDIIARAFTNLLKKGXFAWTKDAETA 695

Query: 361  FESLKLAMDHS-NASLTRWNSPFTIELDASGIGLGAVISQKGHPIAFFSQKLSPRAQTKS 420
            F+ LK AM  +   ++  +N PF IE DA G G+G V++Q+G PIAF S+ L    ++ S
Sbjct: 696  FQXLKQAMTSTPTLAMPNFNEPFVIEFDAXGDGIGVVLTQQGKPIAFMSRALGVSKRSWS 755

Query: 421  IYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWLTTLLGYDFE 480
            IY RE++ +V ++Q WR YLLGRKF I +DQ++LK+LLEQR   P  Q+W+  LLGYD+E
Sbjct: 756  IYAREMLAIVHAIQTWRPYLLGRKFYIQTDQRSLKYLLEQRIXTPXQQEWVAKLLGYDYE 815

Query: 481  ILYQLGLQNKAADALSRMEQPLELNSMTTTGIVDVELICKEVENDEELKKIIRELEGSSE 540
            I Y+ G +N AADALSR+     LN++        + I  E      + KI        +
Sbjct: 816  ITYKXGRENSAADALSRVVSSPSLNALFVPQAPLWDEIKAEAIKHPYMDKI--------D 875

Query: 541  EGKKYQWTFHDSILGGHFGFLRSKP-----ARVLQPIPIPDRILEDWTMDFIEGLPLARG 600
            +   +Q T  D +   +    R K      A +LQP+PIP  + +D TMDFIEGLP + G
Sbjct: 876  KLANWQRTVQDYV-SSYDVCQRIKSETLARAGLLQPLPIPCLVWDDITMDFIEGLPTSNG 935

Query: 601  MNVIMVVVDRLSKYSYFISLRHPFSAKQVAAIFIDRIVRKHGIPMSIITDRDKIFLSNFW 652
             N I+VVVDRLSK ++F++L HPF AK V   F++ +V+ HG+P SII+DRD +F+S FW
Sbjct: 936  KNTILVVVDRLSKSAHFLALAHPFXAKMVXEKFVEGVVKLHGMPKSIISDRDXVFMSQFW 995

BLAST of Cucsa.163250 vs. NCBI nr
Match: gi|922560347|ref|XP_013608444.1| (PREDICTED: uncharacterized protein LOC106315243 [Brassica oleracea var. oleracea])

HSP 1 Score: 448.4 bits (1152), Expect = 2.2e-122
Identity = 272/716 (37.99%), Postives = 393/716 (54.89%), Query Frame = 1

Query: 2    KLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICRR 61
            K++G IK QEV++++DSGA++NFI   V     L     +   V +G+G      G+C  
Sbjct: 404  KMRGYIKNQEVIVMLDSGASHNFISPEVVNKLRLKFSADSSLDVLLGNGVTVNALGVCHA 463

Query: 62   VELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKGD 121
            V  +L +    +DF+++EL NVDV+LG+QWL+T G  +V W    ++F     ++ L G+
Sbjct: 464  VTFQLIQTNFTSDFISLELRNVDVILGIQWLETLGVCEVDWREQVLSFVYGGNKVTLLGE 523

Query: 122  PSLIKAECSLKTLEKTWDAEDQGFLL------------EFKNYGVEIEDKVID------- 181
             SL   + S K+L+  + +  +G  +            E +N    I  +  D       
Sbjct: 524  KSLHCTKFSFKSLKPVYTSGKKGGEVPLASSIATSAFPEVRNQLSMILQEYADVFAVPTS 583

Query: 182  --------HRILTLPDHRPINVRPYKYGHVQE-EIEKLITEMLQAGVIRPSHNPYSSPVL 241
                    H I+  P    ++VRPY+Y H  +  +E ++ E+L++G+IRPS + +SSPVL
Sbjct: 584  LPPVRGKEHAIILKPGVSSVSVRPYRYPHASKIAMEDMVNEILRSGIIRPSTSLFSSPVL 643

Query: 242  LVKKKDGGWRFCVDYRKRNQVMVSDKFPIPVIEELLDELNGAD-WVEADEEKIRDMINWP 301
            LVKKKDG  RFCVDYR  N+  V DK+PI VI++LLDEL+GA  + + D       I   
Sbjct: 644  LVKKKDGSLRFCVDYRGLNRATVLDKYPIHVIDQLLDELHGAKVFTKLDLRSGYHQIRM- 703

Query: 302  LPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLT--KLQKNDFK-WSEEATTAF--ESLKL 361
            +  D+     F  + G+Y   V  +G   AP T   L    FK +       F  + L  
Sbjct: 704  MESDIEKT-AFRTVEGHYEFLVMPFGLTNAPATFQALMNQVFKPFLRRFVLVFFDDILIY 763

Query: 362  AMDHS------------NASLTRWNSPFTIELDASGIGLGAVISQKGHPIAFFSQKLSPR 421
            ++DH             N +L  +   F IE DASG GLGAV+ Q   PIAFFS  L+PR
Sbjct: 764  SVDHETHEEHARRVLQDNLALPNFQEVFVIESDASGFGLGAVLMQNKRPIAFFSHALTPR 823

Query: 422  AQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQPQFQKWLTTLL 481
             Q K  YER+LM +V++++KW+HYLLGRKF + +DQ++LKFLLEQ+EV  ++Q+WLT +L
Sbjct: 824  EQMKPAYERKLMAIVMAIRKWKHYLLGRKFHVHTDQRSLKFLLEQKEVNMEYQRWLTKIL 883

Query: 482  GYDFEILYQLGLQNKAADALSR-MEQPLELNSMTTTGIVDVELICKEVENDEELKKIIRE 541
            G+DF+I Y+ G +NK AD LSR M     L ++T   ++  E + KE+ +D  ++  I++
Sbjct: 884  GFDFDIFYKPGPENKVADGLSRSMSVSSLLLALTVPTVLQWEDLYKEIADDTRIQATIKQ 943

Query: 542  LEGSSEEGKKYQWTFHDSILGGHFGFLRS-------------------KPARVLQPIPIP 601
            L       KKYQ            G  +                     PA +LQP+PIP
Sbjct: 944  LLSGELSSKKYQVVDGKLWSKRRLGMYKQIQKYVAACGICQTHKHSTLSPAGLLQPLPIP 1003

Query: 602  DRILEDWTMDFIEGLPLARGMNVIMVVVDRLSKYSYFISLRHPFSAKQVAAIFIDRIVRK 652
            D + +D  MDFIEGLP + G NVI+VV+DRLSK+++FISL+HPF+A  VA  F++ +V+ 
Sbjct: 1004 DLVWDDINMDFIEGLPTSNGFNVILVVIDRLSKFAHFISLKHPFTALDVAKKFVNEVVKL 1063

BLAST of Cucsa.163250 vs. NCBI nr
Match: gi|674229525|gb|KFK23310.1| (hypothetical protein AALP_AAs43195U000200 [Arabis alpina])

HSP 1 Score: 421.0 bits (1081), Expect = 3.8e-114
Identity = 215/437 (49.20%), Postives = 293/437 (67.05%), Query Frame = 1

Query: 247  ELLDELNGADWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTK 306
            E L  +   D V AD +KI+ M++WP PK++ +LRGFLGLTGYYR+FV+ YG+IA PLT 
Sbjct: 782  EYLGHIISGDGVAADPQKIQAMVSWPEPKNIKALRGFLGLTGYYRKFVRGYGDIAKPLTS 841

Query: 307  LQKND-FKWSEEATTAFESLKLAMDHSNA-SLTRWNSPFTIELDASGIGLGAVISQKGHP 366
            L K D F+WSE A+ AF+ LK AM      SL  ++  F +E DASGIGLGAV+ Q+  P
Sbjct: 842  LLKKDQFQWSEAASGAFQQLKQAMTTVPVLSLVDFSELFVVESDASGIGLGAVLMQQQRP 901

Query: 367  IAFFSQKLSPRAQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQ 426
            IA++SQ L+ R + KS+YERELM +V ++Q+WRHYLLGRKF + +DQK+LKFLLEQREV 
Sbjct: 902  IAYYSQALTDRQKLKSVYERELMAIVFAIQRWRHYLLGRKFLVRTDQKSLKFLLEQREVN 961

Query: 427  PQFQKWLTTLLGYDFEILYQLGLQNKAADALSRMEQPLELNSMTTTGIVDVELICKEVEN 486
             ++Q+WLT +LG+DF+I+Y+ GL+NKAADALSR E   +L +++    + +E+I  EV+ 
Sbjct: 962  TEYQQWLTKILGFDFDIVYKPGLENKAADALSRREVMPQLFALSVPAAIQLEMITAEVDK 1021

Query: 487  DEELKKIIRELEGSSEEGKKYQWT-------------------------FHDSILGGHFG 546
            D E +KI  E+ G  +    Y                            FH + +GGH  
Sbjct: 1022 DAESRKIKEEVLGDVDAHPGYSVVQGRLLKQGKLVIPKASPLVGVLLHEFHSTKMGGHGV 1081

Query: 547  FLRSK-----PARVLQPIPIPDRILEDWTMDFIEGLPLARGMNVIMVVVDRLSKYSYFIS 606
              + K     PA +LQP+PIPD++ ED ++DF+EGLP + G +V+MVVVDRLSKY++F+ 
Sbjct: 1082 CQKHKYSSLAPAGLLQPLPIPDKVWEDISLDFVEGLPKSEGFDVVMVVVDRLSKYAHFLK 1141

Query: 607  LRHPFSAKQVAAIFIDRIVRKHGIPMSIITDRDKIFLSNFWKELFATMGTILKMSTAFHP 652
            L+HP+ A  VA +F+  IVR HG P +I++DRDK F   FW EL    GT+L  ST++HP
Sbjct: 1142 LKHPYEASAVALLFVQEIVRLHGFPRTIVSDRDKTFTGRFWSELMKLAGTLLNFSTSYHP 1201

BLAST of Cucsa.163250 vs. NCBI nr
Match: gi|674229525|gb|KFK23310.1| (hypothetical protein AALP_AAs43195U000200 [Arabis alpina])

HSP 1 Score: 159.5 bits (402), Expect = 2.0e-35
Identity = 98/258 (37.98%), Postives = 143/258 (55.43%), Query Frame = 1

Query: 1   MKLKGDIKGQEVVILIDSGATNNFIHEIVAEAQGLNIEPGTQFGVTIGDGTRCKGKGICR 60
           MK+ G +  +EVV++IDSGA++NFI + + +   L       +GV  G G   KG GICR
Sbjct: 435 MKMTGTVGAEEVVVMIDSGASHNFISQGLVKLLDLKPNRTGNYGVLTGAGVTVKGDGICR 494

Query: 61  RVELRLKELTIVADFLAVELGNVDVVLGMQWLDTTGTTKVHWPSLTMTFWVKDRQIVLKG 120
            ++L ++ L + ADFL + LG+ D V                  + +     + Q V + 
Sbjct: 495 GLDLMIQGLRVRADFLPLSLGSADAV--------------EHGEMGVVVEYNELQSVKQE 554

Query: 121 DPSLIKAECSLKTLEKTWD--AEDQGFLLEFKNYGVEIEDKVIDHRILTLPDHRPINVRP 180
           + +L       + LE+  +  AE  G              +   H I   P  RP++VRP
Sbjct: 555 ESALPVPVGLQRVLERHPEVFAEPTGLP----------PSRGRAHAINLEPGVRPVSVRP 614

Query: 181 YKYGHVQ-EEIEKLITEMLQAGVIRPSHNPYSSPVLLVKKKDGGWRFCVDYRKRNQVMVS 240
           ++Y   Q EEIE+ +T ML AG+I+ S +P+SSPVLLVKKKDG WRFC+DYR  N+V + 
Sbjct: 615 FRYPQAQKEEIERQVTAMLAAGIIQESGSPFSSPVLLVKKKDGSWRFCIDYRALNKVTIP 668

Query: 241 DKFPIPVIEELLDELNGA 256
             FPIP+I++LLDEL+GA
Sbjct: 675 HSFPIPMIDQLLDELHGA 668


HSP 2 Score: 407.5 bits (1046), Expect = 4.3e-110
Identity = 224/473 (47.36%), Postives = 291/473 (61.52%), Query Frame = 1

Query: 247  ELLDELNGADWVEADEEKIRDMINWPLPKDVSSLRGFLGLTGYYRRFVKSYGEIAAPLTK 306
            + L  L  A  V  D  K   M+ WP P  V  LRGFLGLTGYYR FV+ YG IA PLT+
Sbjct: 1149 DYLGHLISASGVSTDPSKTAAMMKWPTPGSVKELRGFLGLTGYYRCFVRGYGVIARPLTE 1208

Query: 307  LQKND-FKWSEEATTAFESLKLAMDHSNA-SLTRWNSPFTIELDASGIGLGAVISQKGHP 366
            L + D F+WS +A  AFE+LK AM  +   +L  +   F +E DASG GLGAV+ Q   P
Sbjct: 1209 LLRKDMFEWSAKAQLAFEALKEAMSSAPVLALPNFAKTFVVEADASGYGLGAVLMQDRRP 1268

Query: 367  IAFFSQKLSPRAQTKSIYERELMVVVLSVQKWRHYLLGRKFTIISDQKALKFLLEQREVQ 426
            IAFFS  L+PR Q K +YERELM +VL+VQKW+HYL+GR+FT+ +DQK+LKFLLEQREV 
Sbjct: 1269 IAFFSVGLTPREQLKPVYERELMAIVLAVQKWKHYLMGRRFTVHTDQKSLKFLLEQREVT 1328

Query: 427  PQFQKWLTTLLGYDFEILYQLGLQNKAADALSRM------EQPLELNSMTTTGIVDVELI 486
              +Q+WLT LL YDFEI+Y+ G+ NKAAD LSR+         ++L ++T   ++ ++ I
Sbjct: 1329 MDYQRWLTKLLPYDFEIVYKAGVDNKAADGLSRILHSTGSVSAMDLFAITVPSVIQMQDI 1388

Query: 487  CKEVENDEELKKIIRELEG--------SSEEGKKYQWTF-----------------HDSI 546
             KE+E D E+++ IRE +           ++GK +  T                  H+S 
Sbjct: 1389 FKEIEEDVEIQRRIREFDSLKMASRGFEVKDGKLWFKTKLVIPPTSKFIPLILDVFHNSQ 1448

Query: 547  LGGHFGFLRS-----------------------------------KPARVLQPIPIPDRI 606
             GGH G L++                                    PA +LQP+PIP  +
Sbjct: 1449 FGGHSGVLKTVKRIQLSFYWPRMLRMVRKYVSECAICQTHKSSTLSPAGLLQPLPIPQAV 1508

Query: 607  LEDWTMDFIEGLPLARGMNVIMVVVDRLSKYSYFISLRHPFSAKQVAAIFIDRIVRKHGI 652
              D  MDF+EGLP ++G N I+VVVDRLSKY +FI+L+HPFSA  VA  F+  IVR HG 
Sbjct: 1509 WSDVNMDFVEGLPASQGFNAILVVVDRLSKYGHFIALKHPFSASDVAQKFVAEIVRLHGF 1568

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POL2_DROME3.9e-3438.46Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
POL3_DROME6.6e-3437.02Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
POL5_DROME3.0e-2631.40Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
POLY_DROME1.9e-2531.28Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogas... [more]
TF25_SCHPO6.8e-2328.57Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A151R5M0_CAJCA3.2e-13640.96Transposon Ty3-I Gag-Pol polyprotein OS=Cajanus cajan GN=KK1_041123 PE=4 SV=1[more]
A5C633_VITVI8.4e-12940.09Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_034528 PE=4 SV=1[more]
A0A087G0A8_ARAAL2.6e-11449.20Uncharacterized protein OS=Arabis alpina GN=AALP_AAs43195U000200 PE=4 SV=1[more]
A0A087G0A8_ARAAL1.4e-3537.98Uncharacterized protein OS=Arabis alpina GN=AALP_AAs43195U000200 PE=4 SV=1[more]
A0A087GEK8_ARAAL2.2e-4436.87Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.14.1e-1858.90ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
AT3G29750.16.4e-1130.41 Eukaryotic aspartyl protease family protein[more]
AT3G30770.12.1e-0626.32 Eukaryotic aspartyl protease family protein[more]
ATMG00850.12.8e-0662.86ATMG00850.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|1012325802|gb|KYP37665.1|4.6e-13640.96Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan][more]
gi|147860532|emb|CAN81876.1|1.2e-12840.09hypothetical protein VITISV_034528 [Vitis vinifera][more]
gi|922560347|ref|XP_013608444.1|2.2e-12237.99PREDICTED: uncharacterized protein LOC106315243 [Brassica oleracea var. oleracea... [more]
gi|674229525|gb|KFK23310.1|3.8e-11449.20hypothetical protein AALP_AAs43195U000200 [Arabis alpina][more]
gi|674229525|gb|KFK23310.1|2.0e-3537.98hypothetical protein AALP_AAs43195U000200 [Arabis alpina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR013242Retroviral aspartyl protease
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005622 intracellular
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.163250.1Cucsa.163250.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 539..649
score: 1.
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 529..652
score: 19
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 534..650
score: 4.5
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 531..651
score: 1.97
IPR013242Retroviral aspartyl proteasePFAMPF08284RVP_2coord: 11..93
score: 1.
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 4..97
score: 5.5
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 169..255
score: 5.7
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 208..651
score: 4.7E
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 208..651
score: 4.7E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 159..443
score: 3.34

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None