Cucsa.291820 (gene) Cucumber (Gy14) v1

NameCucsa.291820
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationscaffold02766 : 424791 .. 429439 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTAATAAAAGTTGCAGATGATGCTAAGAAATGCATAGTCTATGAAGCTGTTGAAGGAGATATGCTAAACTATTACAAAATGCTTAGAGCAAGGGTTGAAGCTGTTAATGGAAGATCAAACGAAATAGGAGAAAGCTTTGCAGAATGGACTGTTGAGTTTGAAAAGGCTGATGAAAATGTGCCTTTACCTCAAACCCACTTGGATTTGTTTGTCGAAATGTCCAAAGCCGTCGATGCTTATTGTTTTTCCTGTTCAAAATGAAACATGCAATATGTGAGAATTGACTTATGAGGCTTTAGAACAAATTAAAAGAGGTGAGTTAAAATGTTAAATTGTGTTTAATTTGTGGTATGAGTTAAGCTCGTGAAATGTTGTAATCTAAATCATCCAAATGTGAAGTAGAATACACATCAAGAAAGTTATGGGTCTTCCAATAAATTTGTTTTATTTCCTCGCTCTCAAATATTCTATTTGGTGATTATTTATTTGGGGTGTTTATCATCACTCGCTTCTAACAAAGTGGTATTAGAGCTTGAGTTTTGAGTTTTAAAAATTTGTTTTTACAAATGGAAAATTAAAAAaTAATTTAGTTTCATTCCAAGTTCCTCGACTTACGAAAGAAAATTATAGCAGTTGGTGTATTCGAATGAAAGCTCTACTTGGTTCACAAAATGTGTGGGACATTATTAATAATGGTTATGAAGAGCCAGAAAGTGATGCAACTTTGAGCCAAGCTCAATGAGAAACTTTAAAAAaTACAAGGAAAAAAGATCAAAAGGCTTTCACCATCATTCATTCATACATTGATGATTCAAATTTTGAAAAAAaTTCTGGAGCAACTACTACACATCAAGCATGGCAAATTTTGGAGAATACGTATAAAGAAGTAGATCGAGTCAAGAAGGTTCGCATTCAAAAATTAAGAGTTGATTATAAATCATTGCTTATGAAGGAGTCTTAATCAGTTTCAGATTATACTTCAAAATTGCTAGCAGTAGTAAATGAAATGAAAAGATCCTATGGTGAGATAATAAGTGATGAGCAAGTAGTAGAAAAGATACTTTGCTCATTGAATGAGAAATTCAATTTCATTGTTGTAGCTATTGAAAAGTCAAAGGATTTGAGTTCAATGTCCACTGATCAACTTATGGGTTCTTTACAAGCCTATAAAAAGAAACTTCTTAAAAAGAACAAGCATACAATTGAACAACTTTTTCAGTCAAAGATGCAATAAAAaGACAAGACAGGCTAAAAAaGGGCAATAGAGGTCGAGGACGTGGTGGCAATCATGGACGTGGTGATTTCAGAGGTCGTGGTCAAGGAAACTAAGGTCAAAGAAAATTTTATGAGAGTAATTCAAGCTCAAGTTCATCAAGAGATCGTGGAAGACAATATTATTCGAGATCAAATGGAGAAAGATCAAATAATGACAGACGGTATGACAAAAGATAGGTTAAATGTTATAATTGTCATAAATAAAACCATTATTCCTGAGAATGTAGAAATAGGGTTGAAGAAAATGCAAATTATGCTAAGAAAGATGAAGAAAGAGGCGATTCATCATTGCTTCTAGCATGCAAATGTGTGAAAACATGTGAAAACAATGCATGGTATCTCGATACTGGTGCAAGCAATCAAATGTGTGGAAGTAAATCGATGTTCATGGAGCTTGATGAACCTATTGGTGGTGATATTGTATTTGGTGATGCCACAAAAAATTCAATTAAAGGAAAATGTAAGATTTTGATCCATTTGAAGAATGAGAAGCATGAGTTTATCATTAATGTTTATTTTGTGCCTAATATAAAAAaCAACATTTTGAGTTTGAGACAACTCTTAGAGAAAGGTTATAATATTTTGATGAAGGATTATAGTCTTTTGATAAGAGATAATCATGACAATATGATTGCTAAAAAGTGCAAATGACGAAAAATAGAATGTTTTTATTAAACATTCAAACTGATGGTGCTAAATGTTTAAACTCATGTTTGAAAGATCCAAATTGGAGATGACACTTGAGATTTGGGCATTTAAACTTTGATGGCTTGAAACTATTAGTCAGGAAGGACAAGGTGAAAGGGTTGTCATATGTCAAACATCCAGATCAATTCTGTGAAGGTTGTCTTTATGGCAAACAATCAAGGAAGAGTTTTCCACAAGAATCATCTTAAAGAGCAAGGAGACCACTAAAGTTAGTTCACACTGGTCTTTGTGGACCAATCAAACCAAATTCTTTCGGTAAGAATAATTATTTCTTATTATTTATTGATGCTTTTAACCGAAAAATTTGAGTTTATTTTGTCAAGGAGAAATCAGAAGTATTTGGCATGTTTAAGAGATTTAAAGCCCTTGTTGGAAAAGAAAGTGGTTATTACATTAAAGCTTTGAGATCATATAGGAAAGGTGAATTCACTTCAAATGAATTCAAAACTTTTTGTGCAGAAAATGGAATTCATCGACCTATGACAGTTCCATTTACTCCTACATAAAATGGTGTTTTTGAGAGAAAGAACCAAACAATACTTAACATGACTCGAATCATGTTGAAGGGCAAGAAGATACCAAAAGAATTTTGAGCGTAGGTTATTGAATGGGCAGTGTACTTGTCAAATCGTTTCCCTACTAGAAGCTTATGGAAAAAACCTCCTAAGCAAGCACAAACGGGAAGAAAACCATCCATTGCTCATTTGAGGGTATTTGGATGTATCACATATGCACATATGCTTGATCAAAAGCTTAGCAAGCTAGATCATAAAAGTGAGAAACATGTTTTTATTAGCTATGATGCAAGCTCAAAAAACTACCAGCTTTATAATTCTATTACAAAGAAGACGATGGTAAGTAGAAATGTTTTATTTGATGAAGAAGCATCATGGAATTGGAAATGATGAAGAAGCATCATGGAATTGGAATGCAACAAGACGACAAATTTTTGCTTTTTCTTGATGATCAAGATGAGCCTAGTGACATTATTGCTTCTACATCTACACCACCAACATCGCTAATCACTCCACAACAAAGCACATCTTCATCATCTACAAGTTCAAGTGAAGGACCTCGTGGCATGAGAAGTTTTCGAGACATATATGATGAAACTGAAGTTAAGTCCATGTTTTAATAACCTTACTCTCTTTTGTCTATTTGATGACAGTCAACCTTTTAGTTTTGAAGAAACTTCACAAAATTATAAATGAAAGATTTCTATGGATGAAGAGATAAAAACCATAAAAAAGAATGATGCGTGAGAACTTTCTACTCTTCCAAATGGAAAGAAAGCCGTAGGTGTCAAATGGGTGTTCAAGATTAAAAGAAATGAAAAGGGAGAAGTGGAGAGATACAAAATAAGATTAGTTGTAAAAGATTATTCTCAAAGAAAAGGCATTGATTATGAAGTATTTGCTCCTGTTGCTCGTTTGGAAATTATAAGGTTGTTAGTTGTTCTTGCTGCTCAAAATAATTGGAAGATCTTTTAGATTGATGTCAAATCAACATTTTTGAATGGATATCTAGAAGAAGAAGTCTACTTAGAACAACcTCTTGGTTATTATGTGAGAGGTCAAGAGGATAAAGTTCTAAAATTGAAGAAGACATTATACGGATTGAAACAAGCACCAAGAATGTGGAATACCAAAATCAACAAATATTTCCTTGATAATGGGTATTTGAGGTGCCCTTATGAACATTATCTTTCTATTAAGACTATTGGTCATAGAGATACTTTGGCGGTTTGTTTGTATGTGGATGACTTAATTTTTATAGGAAATTGTGCACATATGTTTGAAGATCTCAAGAAGTCGATTTCCCAAGAATTTAAAATGACAAATATCGGGTTGATGTCATATTATCTTGACATTGTGGTGGAGCAATCAAAGGAAGGTATTTTCATCTTTCAAGAACAATATACTAGAGAAATTCTAGAGAAGTTCAATATGATCAATTCTAAGACTGTCACAACTTGAATTGAAACTGGGACCAAACTGTCCAAATATGAAGAAGGACTAGATGTTGATCCTTCATATTTCAAAAGTTTGGTTGGGAGTTTGAGATATTTGACTTGCACACAACCATATATTCTTTTTAGCGTTCTAATGGTGAGTCAATTTATGCAATCTCCTACAACTACTCATTTAAAAGTGGCAAAGAGAATTCTTCGTTACCTCAAAGGTACGCTTGATTATGAGTTATTTTATTCTTCATCTAAATAATTCAAGCTTGAAGGCTATTATGATAGTGAAGGGGCTGGAGATACTAACGATCGAAAAAGCACTAGTGGATTTATTTTCTTCATTGGTAATATTGCATTTACTTGGAGTTTTAAGAAGCATCCTATTGTGACATTATCTACTTGTGAGACAGAATACGTTGCTGCAACTTCATGTGTCTGTCATGTCATTTGGTTAAGAAATTTGTTAAATACAGTTGGATTTTGCAAGATGATCCAATTCTGATCCATGTAGACAATAAGTCAACAATTGCTCTAGCAAAGAATCATGTGTTCCATGATCGTCTCAAACACATTGATACAAGATTTCATTTCATCAGAGGTTGCATTTCAAGGAAGAAGGTTCAAGTTGAATACGTGAAGACTGAAGATCAAATTGCAGACATTTTCATGAAGCCACTCAAAGCTAATCACTCCACATAA

mRNA sequence

ATGGTTAATAAAAGTTtttcattccaagttcctcgacttacgaaagaaaattatagcagttggtgtattcgaatgaaagctctacttggttcacaaaatgtgtgggacattattaataatggttatgaagagccagaaaaaactttaaaaaatacaaggaaaaaagatcaaaaggctttcaccatcattcattcatacattgatgattcaaattttgaaaaaaattctggagcaactactacacatcaagcatggcaaattttggagaatacgtataaagaagtagatcgagtcaagaagataataagtgatgagcaagtagtagaaaagatactttgctcattgaatgagaaattcaatttcattgttgtagctattgaaaagtcaaaggatttgagttcaatgtccactgatcaacttatgggttctttacaagcctataaaaagaaacttcttaaaaagaacaagcatacaattgaacaactttttcaaaatagggttgaagaaaatgcaaattatgctaagaaagatgaagaaagaggcgattcatcattgcttctagcatgcaaatgtgtgaaaacatgtgaaaacaatgcatggtatctcgatactggtgcaagcaatcaaatgtgtggaagtaaatcgatgttcatggagcttgatgaacctattggtggtgatattgtatttggtgatgccacaaaaaattcaattaaaggaaaatgtaagattttgatccatttgaagaatgagaagcatgagtttatcattaatgtttattttgtgcctaatataaaaaacaacattttgagtttgagacaactcttagagaaaggttataatattttgatgaaggattatatttattttgtcaaggagaaatcagaagtatttggcatgtttaagagatttaaagcccttgttggaaaagaaagtggttattacattaaagctttgagatcatataggaaaggtgaattcacttcaaatgaattcaaaactttttgtgcagaaaatggaattcatcgacctatgacagttattgaatgggcagtgtacttgtcaaatcgtttccctactagaagcttatggaaaaaacctcctaagcaagcacaaacgggaagaaaaccatccattgctcatttgagggtatttggatgtatcacatatgcacatatgcttgatcaaaagcttagcaagctagatcataaaagtgagaaacatgtttttattagctatgatgcaagctcaaaaaactaccagctttataattctattacaaagaagacgatgaagcatcatggaattggaatgcaacaagacgacaaatttttgctttttcttgatgatcaagatgagcctagtgacattattgcttctacatctacaccaccaacatcgctaatcactccacaacaaagcacatcttcatcatctacaagttcaagtgaaggacctcgtggcatgagaagttttcgagacatatatgatgaaactgaaaaagccgtaggtgtcaaatgggtgttcaagattaaaagaaatgaaaagggagaagtggagagatacaaaataagattagttgtaaaagattattctcaaagaaaaggcattgattatgaagtatttgctcctgttgctcgtttggaaattataaggttgttagttattgatgtcaaatcaacatttttgaatggatatctagaagaagaagtctacttagaacaacctcttggttattatgtgagaggtcaagaggataaagttctaaaattgaagaagacattatacggattgaaacaagcaccaagaatgtggaataccaaaatcaacaaatatttccttgataatgggtatttgaggtgcccttatgaacattatctttctattaagactattggtcatagagatactttggcggtttgtttgtatgtggatgacttaatttttataggaaattgtgcacatatgtttgaagatctcaagaagtcgatttcccaagaatttaaaatgacaaatatcgggttgatgtcatattatcttgacattgtggtggagcaatcaaaggaaggtattttcatctttcaagaacaatatactagagaaattctagagaagttcaatatgatcaattctaagactcttgaaggctattatgatagtgaaggggctggagatactaacgatcgaaaaagcactagtggatttattttcttcattggtaatattgcatttacttggagttttaagaagcatcctattgtgacattatctacTTGTGAGACAGAATACGTTGCTGCAACTTCATGTGTCTATGATCCAATTCTGATCCATGTAGACAATAAGTCAACAATTGCTCTAGCAAAGAATCATGTGTTCCATGATCGTCTCAAACACATTGATACAAGATTTCATTTCATCAGAGGTTGCATTTCAAGGAAGAAGGTTCAAGTTGAATACGTGAAGACTGAAGATCAAATTGCAGACATTTTCATGAAGCCACTCAAAGCTAATCACTCCAcataa

Coding sequence (CDS)

ATGGTTAATAAAAGTTTTTCATTCCAAGTTCCTCGACTTACGAAAGAAAATTATAGCAGTTGGTGTATTCGAATGAAAGCTCTACTTGGTTCACAAAATGTGTGGGACATTATTAATAATGGTTATGAAGAGCCAGAAAAAACTTTAAAAAaTACAAGGAAAAAAGATCAAAAGGCTTTCACCATCATTCATTCATACATTGATGATTCAAATTTTGAAAAAAaTTCTGGAGCAACTACTACACATCAAGCATGGCAAATTTTGGAGAATACGTATAAAGAAGTAGATCGAGTCAAGAAGATAATAAGTGATGAGCAAGTAGTAGAAAAGATACTTTGCTCATTGAATGAGAAATTCAATTTCATTGTTGTAGCTATTGAAAAGTCAAAGGATTTGAGTTCAATGTCCACTGATCAACTTATGGGTTCTTTACAAGCCTATAAAAAGAAACTTCTTAAAAAGAACAAGCATACAATTGAACAACTTTTTCAAAATAGGGTTGAAGAAAATGCAAATTATGCTAAGAAAGATGAAGAAAGAGGCGATTCATCATTGCTTCTAGCATGCAAATGTGTGAAAACATGTGAAAACAATGCATGGTATCTCGATACTGGTGCAAGCAATCAAATGTGTGGAAGTAAATCGATGTTCATGGAGCTTGATGAACCTATTGGTGGTGATATTGTATTTGGTGATGCCACAAAAAATTCAATTAAAGGAAAATGTAAGATTTTGATCCATTTGAAGAATGAGAAGCATGAGTTTATCATTAATGTTTATTTTGTGCCTAATATAAAAAaCAACATTTTGAGTTTGAGACAACTCTTAGAGAAAGGTTATAATATTTTGATGAAGGATTATATTTATTTTGTCAAGGAGAAATCAGAAGTATTTGGCATGTTTAAGAGATTTAAAGCCCTTGTTGGAAAAGAAAGTGGTTATTACATTAAAGCTTTGAGATCATATAGGAAAGGTGAATTCACTTCAAATGAATTCAAAACTTTTTGTGCAGAAAATGGAATTCATCGACCTATGACAGTTATTGAATGGGCAGTGTACTTGTCAAATCGTTTCCCTACTAGAAGCTTATGGAAAAAACCTCCTAAGCAAGCACAAACGGGAAGAAAACCATCCATTGCTCATTTGAGGGTATTTGGATGTATCACATATGCACATATGCTTGATCAAAAGCTTAGCAAGCTAGATCATAAAAGTGAGAAACATGTTTTTATTAGCTATGATGCAAGCTCAAAAAACTACCAGCTTTATAATTCTATTACAAAGAAGACGATGAAGCATCATGGAATTGGAATGCAACAAGACGACAAATTTTTGCTTTTTCTTGATGATCAAGATGAGCCTAGTGACATTATTGCTTCTACATCTACACCACCAACATCGCTAATCACTCCACAACAAAGCACATCTTCATCATCTACAAGTTCAAGTGAAGGACCTCGTGGCATGAGAAGTTTTCGAGACATATATGATGAAACTGAAAAAGCCGTAGGTGTCAAATGGGTGTTCAAGATTAAAAGAAATGAAAAGGGAGAAGTGGAGAGATACAAAATAAGATTAGTTGTAAAAGATTATTCTCAAAGAAAAGGCATTGATTATGAAGTATTTGCTCCTGTTGCTCGTTTGGAAATTATAAGGTTGTTAGTTATTGATGTCAAATCAACATTTTTGAATGGATATCTAGAAGAAGAAGTCTACTTAGAACAACcTCTTGGTTATTATGTGAGAGGTCAAGAGGATAAAGTTCTAAAATTGAAGAAGACATTATACGGATTGAAACAAGCACCAAGAATGTGGAATACCAAAATCAACAAATATTTCCTTGATAATGGGTATTTGAGGTGCCCTTATGAACATTATCTTTCTATTAAGACTATTGGTCATAGAGATACTTTGGCGGTTTGTTTGTATGTGGATGACTTAATTTTTATAGGAAATTGTGCACATATGTTTGAAGATCTCAAGAAGTCGATTTCCCAAGAATTTAAAATGACAAATATCGGGTTGATGTCATATTATCTTGACATTGTGGTGGAGCAATCAAAGGAAGGTATTTTCATCTTTCAAGAACAATATACTAGAGAAATTCTAGAGAAGTTCAATATGATCAATTCTAAGACTCTTGAAGGCTATTATGATAGTGAAGGGGCTGGAGATACTAACGATCGAAAAAGCACTAGTGGATTTATTTTCTTCATTGGTAATATTGCATTTACTTGGAGTTTTAAGAAGCATCCTATTGTGACATTATCTACTTGTGAGACAGAATACGTTGCTGCAACTTCATGTGTCTATGATCCAATTCTGATCCATGTAGACAATAAGTCAACAATTGCTCTAGCAAAGAATCATGTGTTCCATGATCGTCTCAAACACATTGATACAAGATTTCATTTCATCAGAGGTTGCATTTCAAGGAAGAAGGTTCAAGTTGAATACGTGAAGACTGAAGATCAAATTGCAGACATTTTCATGAAGCCACTCAAAGCTAATCACTCCACATAA

Protein sequence

MVNKSFSFQVPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEEPEKTLKNTRKKDQKAFTIIHSYIDDSNFEKNSGATTTHQAWQILENTYKEVDRVKKIISDEQVVEKILCSLNEKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQLFQNRVEENANYAKKDEERGDSSLLLACKCVKTCENNAWYLDTGASNQMCGSKSMFMELDEPIGGDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLRQLLEKGYNILMKDYIYFVKEKSEVFGMFKRFKALVGKESGYYIKALRSYRKGEFTSNEFKTFCAENGIHRPMTVIEWAVYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEKHVFISYDASSKNYQLYNSITKKTMKHHGIGMQQDDKFLLFLDDQDEPSDIIASTSTPPTSLITPQQSTSSSSTSSSEGPRGMRSFRDIYDETEKAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDYEVFAPVARLEIIRLLVIDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPRMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKKSISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTREILEKFNMINSKTLEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVAATSCVYDPILIHVDNKSTIALAKNHVFHDRLKHIDTRFHFIRGCISRKKVQVEYVKTEDQIADIFMKPLKANHST*
BLAST of Cucsa.291820 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 1.5e-38
Identity = 98/291 (33.68%), Postives = 156/291 (53.61%), Query Frame = 1

Query: 443  KFLLFLDDQDEPSDIIASTSTPPTSLITPQQSTSSSSTSSS------EGPRGMRSFRDIY 502
            +++L  DD+ EP  +    S P  + +         S   +      E P+G R  +   
Sbjct: 801  EYVLISDDR-EPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLK--- 860

Query: 503  DETEKAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDY-EVFAPVARLEIIRLLV 562
                     KWVFK+K++   ++ RYK RLVVK + Q+KGID+ E+F+PV ++  IR ++
Sbjct: 861  --------CKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTIL 920

Query: 563  ------------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPR 622
                        +DVK+ FL+G LEEE+Y+EQP G+ V G++  V KL K+LYGLKQAPR
Sbjct: 921  SLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPR 980

Query: 623  MWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKKS 682
             W  K + +     YL+   +  +  K     + + + LYVDD++ +G    +   LK  
Sbjct: 981  QWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGD 1040

Query: 683  ISQEFKMTNIGLMSYYL--DIVVEQSKEGIFIFQEQYTREILEKFNMINSK 713
            +S+ F M ++G     L   IV E++   +++ QE+Y   +LE+FNM N+K
Sbjct: 1041 LSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAK 1079


HSP 2 Score: 90.5 bits (223), Expect = 9.4e-17
Identity = 48/140 (34.29%), Postives = 78/140 (55.71%), Query Frame = 1

Query: 714  LEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVAATSCVYDP 773
            L+GY D++ AGD ++RKS++G++F     A +W  K    V LST E EY+AAT    + 
Sbjct: 1175 LKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEM 1234

Query: 774  I----------------LIHVDNKSTIALAKNHVFHDRLKHIDTRFHFIRGCISRKKVQV 833
            I                +++ D++S I L+KN ++H R KHID R+H+IR  +  + ++V
Sbjct: 1235 IWLKRFLQELGLHQKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKV 1294

Query: 834  EYVKTEDQIADIFMKPLKAN 838
              + T +  AD+  K +  N
Sbjct: 1295 LKISTNENPADMLTKVVPRN 1314


HSP 3 Score: 68.6 bits (166), Expect = 3.8e-10
Identity = 38/138 (27.54%), Postives = 73/138 (52.90%), Query Frame = 1

Query: 348 IEWAVYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEK 407
           ++ A YL NR P+  L  + P++  T ++ S +HL+VFGC  +AH+  ++ +KLD KS  
Sbjct: 614 VQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIP 673

Query: 408 HVFISYDASSKNYQLYNSITKKTMKHHGIGMQQDDKFLLFLDDQDEPSDIIASTSTPPTS 467
            +FI Y      Y+L++ + KK ++   +  ++ +        +   + II +  T P++
Sbjct: 674 CIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPNFVTIPST 733

Query: 468 LITPQQSTSSSSTSSSEG 486
              P  + S++   S +G
Sbjct: 734 SNNPTSAESTTDEVSEQG 751


HSP 4 Score: 61.2 bits (147), Expect = 6.1e-08
Identity = 28/63 (44.44%), Postives = 43/63 (68.25%), Query Frame = 1

Query: 285 KDYIYFVKEKSEVFGMFKRFKALVGKESGYYIKALRSYRKGEFTSNEFKTFCAENGIHRP 344
           K ++Y +K K +VF +F++F ALV +E+G  +K LRS   GE+TS EF+ +C+ +GI   
Sbjct: 513 KLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHE 572

Query: 345 MTV 348
            TV
Sbjct: 573 KTV 575

BLAST of Cucsa.291820 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 136.7 bits (343), Expect = 1.1e-30
Identity = 77/227 (33.92%), Postives = 124/227 (54.63%), Query Frame = 1

Query: 498  ETEKAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDY-EVFAPVARLEIIRLLV- 557
            E +  V  +WVF +K NE G   RYK RLV + ++Q+  IDY E FAPVAR+   R ++ 
Sbjct: 930  ENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILS 989

Query: 558  -----------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPRM 617
                       +DVK+ FLNG L+EE+Y+  P G  +    D V KL K +YGLKQA R 
Sbjct: 990  LVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQG--ISCNSDNVCKLNKAIYGLKQAARC 1049

Query: 618  WNTKINKYFLDNGYLRCPYEHYLSIKTIGH-RDTLAVCLYVDDLIFIGNCAHMFEDLKKS 677
            W     +   +  ++    +  + I   G+  + + V LYVDD++          + K+ 
Sbjct: 1050 WFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRY 1109

Query: 678  ISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTREILEKFNMIN 711
            + ++F+MT++  + +++ I +E  ++ I++ Q  Y ++IL KFNM N
Sbjct: 1110 LMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMEN 1154


HSP 2 Score: 94.0 bits (232), Expect = 8.5e-18
Identity = 54/154 (35.06%), Postives = 82/154 (53.25%), Query Frame = 1

Query: 701  EILEKFNMINSKTLEGYYDSEGAGDTNDRKSTSGFIFFIGNI-AFTWSFKKHPIVTLSTC 760
            +++ K N+     + GY DS+ AG   DRKST+G++F + +     W+ K+   V  S+ 
Sbjct: 1235 KLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASST 1294

Query: 761  ETEYVAATSCVY-----------------DPILIHVDNKSTIALAKNHVFHDRLKHIDTR 820
            E EY+A    V                  +PI I+ DN+  I++A N   H R KHID +
Sbjct: 1295 EAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIK 1354

Query: 821  FHFIRGCISRKKVQVEYVKTEDQIADIFMKPLKA 837
            +HF R  +    + +EY+ TE+Q+ADIF KPL A
Sbjct: 1355 YHFAREQVQNNVICLEYIPTENQLADIFTKPLPA 1388


HSP 3 Score: 55.8 bits (133), Expect = 2.6e-06
Identity = 30/81 (37.04%), Postives = 47/81 (58.02%), Query Frame = 1

Query: 351 AVYLSNRFPTRSLW--KKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEKH 410
           A YL NR P+R+L    K P +    +KP + HLRVFG   Y H +  K  K D KS K 
Sbjct: 617 ATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVH-IKNKQGKFDDKSFKS 676

Query: 411 VFISYDASSKNYQLYNSITKK 430
           +F+ Y+ +   ++L++++ +K
Sbjct: 677 IFVGYEPN--GFKLWDAVNEK 694


HSP 4 Score: 40.4 bits (93), Expect = 1.1e-01
Identity = 26/79 (32.91%), Postives = 40/79 (50.63%), Query Frame = 1

Query: 277 EKGYNILMKDYI------YFVKEKSEVFGMFKRFKALVGKESGYYIKALRSY--RKGEFT 336
           +K Y ++  D        Y +K KS+VF MF+ F A    E+ + +K +  Y     E+ 
Sbjct: 499 DKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVA--KSEAHFNLKVVYLYIDNGREYL 558

Query: 337 SNEFKTFCAENGIHRPMTV 348
           SNE + FC + GI   +TV
Sbjct: 559 SNEMRQFCVKKGISYHLTV 575

BLAST of Cucsa.291820 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 80.5 bits (197), Expect = 9.7e-14
Identity = 41/135 (30.37%), Postives = 74/135 (54.81%), Query Frame = 1

Query: 556 IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPRMWNTKINKYFLD 615
           +DV + FLN  ++E +Y++QP G+      D V +L   +YGLKQAP +WN  IN     
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 616 NGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKKSISQEFKMTNIGL 675
            G+ R   EH L  ++      + + +YVDDL+       +++ +K+ +++ + M ++G 
Sbjct: 61  IGFCRHEGEHGLYFRSTSD-GPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGK 120

Query: 676 MSYYLDIVVEQSKEG 691
           +  +L + + QS  G
Sbjct: 121 VDKFLGLNIHQSSNG 134

BLAST of Cucsa.291820 vs. Swiss-Prot
Match: YN12B_YEAST (Transposon Ty1-NL2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-NL2 PE=3 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 3.6e-08
Identity = 33/121 (27.27%), Postives = 63/121 (52.07%), Query Frame = 1

Query: 556  IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPRMWNTKINKYFLD 615
            +D+ S +L   ++EE+Y+  P      G  DK+++LKK+LYGLKQ+   W   I  Y ++
Sbjct: 1339 LDISSAYLYADIKEELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIE 1398

Query: 616  NGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKKSISQEF--KMTNI 675
                +C  E       +     + +CL+VDD+I      +  + +  ++ +++  K+ N+
Sbjct: 1399 ----QCDMEEVRGWSCVFKNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINL 1452

BLAST of Cucsa.291820 vs. Swiss-Prot
Match: YL14B_YEAST (Transposon Ty1-LR4 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY1B-LR4 PE=5 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 4.7e-08
Identity = 41/160 (25.62%), Postives = 74/160 (46.25%), Query Frame = 1

Query: 556  IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPRMWNTKINKYFLD 615
            +D+ S +L   ++EE+Y+  P      G  DK+++LKK+LYGLKQ+   W   I  Y + 
Sbjct: 1345 LDISSAYLYADIKEELYIRPPPHL---GMNDKLIRLKKSLYGLKQSGANWYETIKSYLIQ 1404

Query: 616  NGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKKSISQEF--KMTNI 675
                +C  E       +     + +CL+VDD++      +  + + + +  ++  K+ N+
Sbjct: 1405 ----QCGMEEVRGWSCVFKNSQVTICLFVDDMVLFSKNLNSNKRIIEKLKMQYDTKIINL 1464

Query: 676  GLMSYYLDIVVEQSKEGIFIFQEQYTREILEKFNMINSKT 714
            G          E+ +  I   + +Y R    K  M NS T
Sbjct: 1465 GESD-------EEIQYDILGLEIKYQRGKYMKLGMENSLT 1490

BLAST of Cucsa.291820 vs. TrEMBL
Match: Q9C536_ARATH (Copia-type polyprotein, putative OS=Arabidopsis thaliana GN=T18I24.5 PE=4 SV=1)

HSP 1 Score: 370.2 bits (949), Expect = 6.9e-99
Identity = 207/407 (50.86%), Postives = 262/407 (64.37%), Query Frame = 1

Query: 351  AVYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEKHVF 410
            AVYL NR PT+S+  K P++A +GRKP ++HLRVFG I +AH+ D+K SKLD KSEK++F
Sbjct: 663  AVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIF 722

Query: 411  ISYDASSKNYQLYNSITKKTMKHHGIGMQQDDK------------FLLFLDDQDEPS--- 470
            I YD +SK Y+LYN  TKKT+    I   ++ +            F  F +D+ EP+   
Sbjct: 723  IGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPEPTREE 782

Query: 471  DIIASTSTPPTSLITPQQSTSSSSTSSSEGPRGMRSFRDIYDET---------------- 530
                  +TPPTS  + Q           E     +++R+  DE                 
Sbjct: 783  PPSEEPTTPPTSPTSSQIEEKCEPMDFQEAIE-KKTWRNAMDEEIKSIQKNDTWELTSLP 842

Query: 531  --EKAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDY-EVFAPVARLEIIRLLV- 590
               KA+GVKWV+K K+N KGEVERYK RLV K YSQR GIDY EVFAPVARLE +RL++ 
Sbjct: 843  NGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEVFAPVARLETVRLIIS 902

Query: 591  -----------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPRM 650
                       +DVKS FLNG LEEEVY+EQP GY V+G+EDKVL+LKK LYGLKQAPR 
Sbjct: 903  LAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRA 962

Query: 651  WNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKKSI 710
            WNT+I+KYF +  +++CPYEH L IK I   D L  CLYVDDLIF GN   MFE+ KK +
Sbjct: 963  WNTRIDKYFKEKDFIKCPYEHALYIK-IQKEDILIACLYVDDLIFTGNNPSMFEEFKKEM 1022

Query: 711  SQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTREILEKFNMINS 712
            ++EF+MT+IGLMSYYL I V+Q   GIFI QE Y +E+L+KF M +S
Sbjct: 1023 TKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKMDDS 1067

BLAST of Cucsa.291820 vs. TrEMBL
Match: Q9C536_ARATH (Copia-type polyprotein, putative OS=Arabidopsis thaliana GN=T18I24.5 PE=4 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 1.1e-40
Identity = 89/139 (64.03%), Postives = 101/139 (72.66%), Query Frame = 1

Query: 714  LEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVAATSCVY-- 773
            L GY DS+  GD +DRKSTSGF+F+IG+ AFTW  KK PIVTLSTCE EYVAATSCV   
Sbjct: 1158 LVGYSDSDWGGDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHA 1217

Query: 774  ---------------DPILIHVDNKSTIALAKNHVFHDRLKHIDTRFHFIRGCISRKKVQ 833
                           +P  I VDNKS IALAKN VFHDR KHIDTR+H+IR C+S+K VQ
Sbjct: 1218 IWLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQ 1277

Query: 834  VEYVKTEDQIADIFMKPLK 836
            +EYVKT DQ+ADIF KPLK
Sbjct: 1278 LEYVKTHDQVADIFTKPLK 1296


HSP 2 Score: 134.8 bits (338), Expect = 4.8e-28
Identity = 65/121 (53.72%), Postives = 86/121 (71.07%), Query Frame = 1

Query: 166 RVEENANYAKKDEERGDSSLLLACKCVKTCENNAWYLDTGASNQMCGSKSMFMELDEPIG 225
           + EE ANY ++  +  D  L+ + K  +  EN+ WYLD+GASN MCG KSMF ELDE + 
Sbjct: 301 KFEEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDESVR 360

Query: 226 GDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLRQLLEKGYNILMK 285
           G++  GD +K  +KGK  ILI LKN  H+FI NVY++P++K NILSL QLLEKGY+I +K
Sbjct: 361 GNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLK 420

Query: 286 D 287
           D
Sbjct: 421 D 421


HSP 3 Score: 99.0 bits (245), Expect = 2.9e-17
Identity = 46/111 (41.44%), Postives = 71/111 (63.96%), Query Frame = 1

Query: 1   MVNKSFSFQVPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEEPEKT----------LK 60
           M + +  FQVP LTK NY +W +RMKA+LG+ +VW+I+  G+ EPE            L+
Sbjct: 1   MASNNVPFQVPVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLR 60

Query: 61  NTRKKDQKAFTIIHSYIDDSNFEKNSGATTTHQAWQILENTYKEVDRVKKI 102
           ++RK+D+KA  +I+  +D+  FEK   AT+  +AW+ L  +YK  D+VKK+
Sbjct: 61  DSRKRDKKALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKV 111


HSP 4 Score: 81.3 bits (199), Expect = 6.4e-12
Identity = 39/63 (61.90%), Postives = 48/63 (76.19%), Query Frame = 1

Query: 285 KDYIYFVKEKSEVFGMFKRFKALVGKESGYYIKALRSYRKGEFTSNEFKTFCAENGIHRP 344
           K ++YF+KEKSEVF +FK+FKA V KESG  IK +RS R GEFTS EF  +C +NGI R 
Sbjct: 559 KTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQ 618

Query: 345 MTV 348
           +TV
Sbjct: 619 LTV 621


HSP 5 Score: 61.2 bits (147), Expect = 6.8e-06
Identity = 30/71 (42.25%), Postives = 50/71 (70.42%), Query Frame = 1

Query: 102 ISDEQVVEKILCSLNEKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQ 161
           + D +++EK+L SL+ KF  IV  IE++KDL +M+ +QL+GSLQAY++K  KK +  +EQ
Sbjct: 152 LDDVRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEK-KKKKEDIVEQ 211

Query: 162 LFQNRVEENAN 173
           +   ++ +  N
Sbjct: 212 VLNMQITKEEN 221


HSP 6 Score: 316.2 bits (809), Expect = 1.2e-82
Identity = 184/493 (37.32%), Postives = 266/493 (53.96%), Query Frame = 1

Query: 287 YIYFVKEKSEVFGMFKRFKALVGKESGYYIKALRSYRKGEFTSNEFKTFCAENGIHRPMT 346
           ++YF+KEKS    +FK+FKA+V  +S   IK LRS + GE+ S EF+ +C   GI R +T
Sbjct: 502 WVYFLKEKSAALEIFKKFKAMVENQSNRKIKVLRSDQGGEYISKEFEKYCENAGIRRQLT 561

Query: 347 --------------------------------------VIEWAVYLSNRFPTRSLWKKPP 406
                                                  +  A+Y+ NR PT+++  + P
Sbjct: 562 AGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSFWAEAVNTAIYILNRSPTKAVPNRTP 621

Query: 407 KQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEKHVFISYDASSKNYQLYNSITK 466
            +A  G+KP I H+RVFGCI YA +  QK  K D+KS++ +F+ Y    K Y+LYN   K
Sbjct: 622 FEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNKSDRCIFVGYADGIKGYRLYNLEKK 681

Query: 467 KTMKHHGIGMQQDDKFLLFLDDQDEPSDIIASTSTPPTSLITPQQSTSSSSTSSSEGPRG 526
           K +    +   +   +               +  +P  S++ PQ     S   + +    
Sbjct: 682 KIIISRDVIFDESATW---------------NWKSPEASIVEPQ-----SFQEAEKHDNW 741

Query: 527 MRSFRDIYDETEK--------------AVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQR 586
           +++  D     EK               +GVKWV+K K N  G V++YK RLV K + Q+
Sbjct: 742 IKAMEDEIHMIEKNNTWELVDRPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGFKQK 801

Query: 587 KGID-YEVFAPVARLEIIRLLV------------IDVKSTFLNGYLEEEVYLEQPLGYYV 646
            GID YE +APVARLE IR ++            +DVKS FLNGYL+EE+Y+EQP G+ V
Sbjct: 802 PGIDYYETYAPVARLETIRTIIALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQPEGFSV 861

Query: 647 RGQEDKVLKLKKTLYGLKQAPRMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVC 706
           +G E+KV +LKK LYGLKQAPR W ++I+KYF+  G+ +   E  L +   G  D L V 
Sbjct: 862 QGGENKVFRLKKALYGLKQAPRAWYSQIDKYFIQKGFAKSISEPTLYVNKTG-TDILIVS 921

Query: 707 LYVDDLIFIGNCAHMFEDLKKSISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTRE 715
           LYVDDLI+ GN   M +D KK +   ++M+++GL+ Y+L + V QS EGIFI Q +Y + 
Sbjct: 922 LYVDDLIYTGNSEKMMQDFKKDMMHTYEMSDLGLLHYFLGMEVHQSDEGIFISQRKYAKN 973

BLAST of Cucsa.291820 vs. TrEMBL
Match: Q2QLK1_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=LOC_Os12g44200 PE=4 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 6.1e-23
Identity = 63/171 (36.84%), Postives = 105/171 (61.40%), Query Frame = 1

Query: 10  VPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEE----------PEKTLKNTRKKDQKA 69
           VP    ENY  W I+M+ LL SQ +WDI+ NGY+E           +K+L   R  D KA
Sbjct: 6   VPVFAGENYDIWSIKMRTLLLSQGLWDIVENGYQEYSAGETLTAEQKKSLAEDRMSDAKA 65

Query: 70  FTIIHSYIDDSNFEKNSGATTTHQAWQILENTYKEVDRVK---KIISDEQVVEKILCSLN 129
             +I   + +S F +  GA  + +AW  L+  ++   +++   + I+D++VVEKIL SL 
Sbjct: 66  LFLIQQGVAESLFPRIIGAKKSKEAWDKLKEEFQGSQKMRLYGEDINDQKVVEKILISLP 125

Query: 130 EKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQLFQNRV 168
           EK+ +IV AIE+SKD+ +++  QLM SL++++++ L++   +IE  FQ+++
Sbjct: 126 EKYEYIVAAIEESKDMLTLTIQQLMSSLESHEERKLQREGSSIENAFQSKL 176


HSP 2 Score: 88.6 bits (218), Expect = 4.0e-14
Identity = 50/146 (34.25%), Postives = 77/146 (52.74%), Query Frame = 1

Query: 706  FNMINSKTLEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVA 765
            +  +    L GY DS+ AG  +D KSTS + F +G+                  E EYVA
Sbjct: 1053 YKPVKESKLIGYTDSDWAGCLDDMKSTSSYAFSLGS-----------------AEAEYVA 1112

Query: 766  ATSCV-----------------YDPILIHVDNKSTIALAKNHVFHDRLKHIDTRFHFIRG 825
            A+  V                 Y P  I+ D+KS IA+++N V HDR KHI  ++H+IR 
Sbjct: 1113 ASKAVSQVVWLRRIMEDLGEKQYQPTTIYCDSKSAIAISENPVSHDRTKHIAIKYHYIRE 1172

Query: 826  CISRKKVQVEYVKTEDQIADIFMKPL 835
             + R++V++++ +T++Q+ADIF K L
Sbjct: 1173 AVDRQEVKLKFCRTDEQLADIFTKAL 1181


HSP 3 Score: 67.8 bits (164), Expect = 7.3e-08
Identity = 36/134 (26.87%), Postives = 71/134 (52.99%), Query Frame = 1

Query: 154 KNKHTIEQLFQNRVEENANYAKKDEERGDSSLLLACKCVKTCENNAWYLDTGASNQMCGS 213
           K K  I +  + R    AN++++ E+     ++ +C   +  +++ W +D+G +N M   
Sbjct: 233 KRKGHIAKYCRTREINRANFSQEKEK--SEEMVFSCHTAQEEKDDVWVIDSGCTNHMAAD 292

Query: 214 KSMFMELDEPIGGDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLR 273
            ++F E+D      I  G+ +    +GK  + +   +   +FI +V  VP++K N+LS+ 
Sbjct: 293 PNLFREMDSSYHAKIHMGNGSIAQSEGKGTVAVQTADGP-KFIKDVLLVPDLKQNLLSIG 352

Query: 274 QLLEKGYNILMKDY 288
           QLLE GY +  +D+
Sbjct: 353 QLLEHGYAVYFEDF 363


HSP 4 Score: 314.3 bits (804), Expect = 4.5e-82
Identity = 219/703 (31.15%), Postives = 337/703 (47.94%), Query Frame = 1

Query: 253  HEFIINVYFVPNIKNNILSLRQLLEKGYNILMKDYIYFVKEKSEVFGMFKRFKALVGKES 312
            H +I     + ++ NN+     + +    +    ++YF+K KS+V  MFK FK +V  +S
Sbjct: 499  HSYICGPMSIASLSNNVYFALFIDD----LSRMTWVYFLKTKSQVLSMFKSFKKMVETQS 558

Query: 313  GYYIKALRSYRKGEFTSNEFKTFCA------ENGIHRPMTVIE---------------WA 372
            G  +K L     GE+ S EF           E    +  TV+E               WA
Sbjct: 559  GQNVKVLIIDNGGEYISKEFNLTAPYLPQQNEVSERKNKTVMEMARCMLFEKRLPKLLWA 618

Query: 373  ------VYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKS 432
                  VYL NR PT+S+  K P +A  G KPS+ HL+VFG + Y H+   K  KLD ++
Sbjct: 619  EAVNTSVYLLNRLPTKSVQSKTPIEAWFGVKPSVKHLKVFGSLCYLHVPSVKRGKLDERA 678

Query: 433  EKHVFISYDASSKNYQLYNSITKKTMKHHGIGMQQDDKFLLFLD-----DQDEPS----- 492
            EK VF+ Y A SK Y++Y+    K +    +   ++  +   L      DQ+ PS     
Sbjct: 679  EKGVFVGYAAESKGYRIYSLSRMKIVISRDVHFDENSYWNWDLKKVHKCDQNTPSILEPA 738

Query: 493  ----------DIIASTSTP-----PTS-------LITPQQSTSSSSTSSSEGPRGMRSFR 552
                      D+ A++ TP     P S       L+  + +  + +    E  + M++  
Sbjct: 739  IESIIIEGPLDVEATSDTPMLKMRPLSDVYERCNLVHAEPTCYTEAARFLEWIKAMKAEI 798

Query: 553  DIYD-----------ETEKAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDY-EV 612
            D  +           E + A+GVKWVF+ K N  G + R+K RLVVK ++Q   +DY + 
Sbjct: 799  DAIERNGTWKLTELPEAKNAIGVKWVFRTKFNSDGSIFRHKARLVVKGFAQVARVDYGDT 858

Query: 613  FAPVARLEIIRLLV------------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVL 672
            FA VA+ + IRLL+            ++VKS FLNG L EE+Y++QP G+ V G E KV 
Sbjct: 859  FALVAKHDTIRLLLALASQMGWKVYHLNVKSAFLNGILLEEIYVQQPEGFEVIGHEHKVY 918

Query: 673  KLKKTLYGLKQAPRMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIF 732
            KL K +YGLKQAPR W ++I+ + +  G+ R   E  L +K       L V LYVDD++ 
Sbjct: 919  KLHKAVYGLKQAPRAWYSRIDSHLIQLGFRRSENEATLYLKQNDDGLQLVVSLYVDDMLV 978

Query: 733  IGNCAHMFEDLKKSISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTREILEKFNMI 792
             G+   +  D K  +   F+M+++G+M+Y+L + + Q   GIFI Q +Y  +IL+KF + 
Sbjct: 979  TGSNVKLLADFKMEMQDVFEMSDLGIMNYFLGMEIYQCSWGIFISQRKYAMDILKKFKLE 1038

Query: 793  NSKT-----------------------------------LEGYYDSEGAGDTNDRKSTSG 835
            + K                                    L+GY DS+ AG  +D KSTSG
Sbjct: 1039 SCKEVATPLAQNEKISKNDGEKLEEPFAYRSLLKTGGVKLDGYADSDWAGSVDDMKSTSG 1098

BLAST of Cucsa.291820 vs. TrEMBL
Match: A5CA01_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_011061 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 2.1e-07
Identity = 44/181 (24.31%), Postives = 86/181 (47.51%), Query Frame = 1

Query: 10  VPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEEPE----------KTLKNTRKKDQKA 69
           +P    E+Y  W ++M+  L SQ +W+++ +  + P           K  +  + K  KA
Sbjct: 11  IPVFNGEHYHIWAVKMRFYLRSQGLWNVVMSEDDPPPLGANPTVAQMKAYEEEKLKKDKA 70

Query: 70  FTIIHSYIDDSNFEKNSGATTTHQA-WQILE---------------NTYKEVDRVKKIIS 129
            T +HS + D  F K     T  Q  +++++               +   ++  + +  +
Sbjct: 71  ITCLHSGLADHIFTKIMNLETPKQREFELMKMKDDESVKDYSGRLMDVVNQMRLLGEAFT 130

Query: 130 DEQVVEKILCSLNEKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQLF 165
           D++VVEKI+ S+ +KF   + AIE+S DL +++  +L   L A ++++L +     E  F
Sbjct: 131 DQKVVEKIMVSVPQKFEAKISAIEESCDLQTLTIVELTSKLHAQEQRVLMRGDKATEGAF 190


HSP 2 Score: 65.9 bits (159), Expect = 2.8e-07
Identity = 36/133 (27.07%), Postives = 68/133 (51.13%), Query Frame = 1

Query: 164 QNRVEENANYAKKDEERGDSSLLLACKCVKTCENNAWYLDTGASNQMCGSKSMFMELDEP 223
           Q + E+NA+  ++++   D  L +A + + + E N W +D+G ++ M    S+F  +D  
Sbjct: 271 QQQPEKNASVIEENKN-DDEHLFMASQTLSSHELNTWLIDSGCTSHMTKYLSIFTSIDRS 330

Query: 224 IGGDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLRQLLEKGYNIL 283
           +   +  G+      KGK  I I  K    + + NV ++P++  N+LS+ Q+L  GY + 
Sbjct: 331 VQPKVKLGNGEVVQAKGKGTIAISTKRGT-KIVTNVLYIPDLDQNLLSVAQMLRNGYAVS 390

Query: 284 MKDYIYFVKEKSE 297
            K+   F+    E
Sbjct: 391 FKENFCFITNVQE 401


HSP 3 Score: 312.4 bits (799), Expect = 1.7e-81
Identity = 179/436 (41.06%), Postives = 251/436 (57.57%), Query Frame = 1

Query: 348 IEWAVYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEK 407
           +  AVY+ NR PT+S+  K P++A +GRKPSI HLR+FGCI YAH+ DQ   KLD K EK
Sbjct: 551 VSTAVYILNRCPTKSVCDKTPEEAWSGRKPSIRHLRIFGCIAYAHVPDQLRKKLDDKGEK 610

Query: 408 HVFISYDASSKNYQLYNSITKKTMKHHGIGMQQDDKF----------LLFLDDQDEPSDI 467
            +FI Y  +SK Y+LYN +TKK +    +   ++  +           +  ++ +E +  
Sbjct: 611 CIFIGYSTNSKAYKLYNPVTKKVIISRDVTFDEEGMWDWSFKAQKVPAVNSENYEEENGH 670

Query: 468 IASTSTPPTSLITPQQS-----------TSSSSTSSSEGPRGMRSFRDIYDET--EKAVG 527
           + +T   P +   PQ+              + +  S E       F D    T  E +  
Sbjct: 671 VDTTPDEPETSSRPQRQRRLPARLEDYVVGNDNDPSDEEIINFALFADCEPVTFEEASNN 730

Query: 528 VKW-------VFKIKRNE--------------------------KGEVERYKIRLVVKDY 587
             W       +  I++N+                           GE++R+K RLV K Y
Sbjct: 731 QYWRKAMDEEIHAIEKNQTWELTDLPANKRQIGVKWVYKTKYKSNGEIDRFKARLVAKGY 790

Query: 588 SQRKGIDY-EVFAPVARLEIIRLLV------------IDVKSTFLNGYLEEEVYLEQPLG 647
            Q+ GIDY EVFAPVARL+ IR+L+            +DVKS FLNG LEEEVY+EQP G
Sbjct: 791 KQKPGIDYFEVFAPVARLDTIRMLISISAQNNWKIHQMDVKSAFLNGTLEEEVYVEQPAG 850

Query: 648 YYVRGQEDKVLKLKKTLYGLKQAPRMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTL 707
           Y ++G+EDKV +LKK LYGLKQAPR W  KI+ YF+DNG+ RCP+EH L IK++   + L
Sbjct: 851 YKIKGKEDKVYRLKKALYGLKQAPRAWYKKIDSYFVDNGFQRCPFEHTLYIKSVDPDNIL 910

Query: 708 AVCLYVDDLIFIGNCAHMFEDLKKSISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQY 715
            VCLYVDDLIF GN   MF + ++++ + F+MT++GLMSY+L I V+Q  +GIFI Q+++
Sbjct: 911 IVCLYVDDLIFTGNNPKMFAEFREAMVKSFEMTDLGLMSYFLGIEVDQRDDGIFISQKKF 970

BLAST of Cucsa.291820 vs. TrEMBL
Match: A0A151UCJ8_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_021271 PE=4 SV=1)

HSP 1 Score: 132.1 bits (331), Expect = 3.1e-27
Identity = 70/140 (50.00%), Postives = 86/140 (61.43%), Query Frame = 1

Query: 714  LEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVAATSCVY-- 773
            L GY DS+ AGD   RKSTSG+ F +G+   +WS KK  +V LST E EY+AA SC    
Sbjct: 1074 LVGYTDSDWAGDIETRKSTSGYAFNLGSGTISWSSKKQQVVALSTAEAEYIAAASCATQA 1133

Query: 774  ---------------DPILIHVDNKSTIALAKNHVFHDRLKHIDTRFHFIRGCISRKKVQ 833
                           +P +I  DNKS IA+ KN VFH+R KHID RFH IR  ++ K+V 
Sbjct: 1134 VWLRRMLEVMHQKQDNPTVIFCDNKSAIAICKNLVFHERSKHIDIRFHKIRELVTEKEVL 1193

Query: 834  VEYVKTEDQIADIFMKPLKA 837
            + Y  TE+QIADIF KPLKA
Sbjct: 1194 INYCHTEEQIADIFTKPLKA 1213


HSP 2 Score: 90.5 bits (223), Expect = 1.0e-14
Identity = 53/127 (41.73%), Postives = 73/127 (57.48%), Query Frame = 1

Query: 166 RVEENANYAKKD-EERGDSS-----LLLACKCVKTCENNAWYLDTGASNQMCGSKSMFME 225
           R ++ AN A+   E+ G+ S     LLLA       E   WYLDTG SN MCG K +F  
Sbjct: 190 RFKQQANIAENQYEQTGEISDNPQTLLLATNNFSGNEA-IWYLDTGCSNHMCGKKELFSS 249

Query: 226 LDEPIGGDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLRQLLEKG 285
           LDE +   + FG+ +   I GK ++ I LK+    FI +V++ P + +N+LSL QL EKG
Sbjct: 250 LDETVKSTVKFGNNSNIPILGKGQVAIRLKDGTQNFISDVFYAPGLHHNLLSLGQLSEKG 309

Query: 286 YNILMKD 287
           YNI + D
Sbjct: 310 YNIQIHD 315


HSP 3 Score: 48.5 bits (114), Expect = 4.6e-02
Identity = 28/62 (45.16%), Postives = 37/62 (59.68%), Query Frame = 1

Query: 285 KDYIYFVKEKSEVFGMFKRFKALVGKESGYYIKALRSYRKGEFTSNEFKTFCAENGIHRP 344
           K ++YF+K+KSE    FK FKALV K+S   IKALR+ R  E+ +     F   +GI   
Sbjct: 452 KSWVYFLKQKSEACDAFKSFKALVEKQSSCKIKALRTDRGQEYLA--CADFIDHHGIQHQ 511

Query: 345 MT 347
           MT
Sbjct: 512 MT 511


HSP 4 Score: 46.2 bits (108), Expect = 2.3e-01
Identity = 24/82 (29.27%), Postives = 50/82 (60.98%), Query Frame = 1

Query: 102 ISDEQVVEKILCSLNEKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQ 161
           I + +VVEKIL ++  KF+ +V  I +S D+  M+  +L GS++++  ++L+K +   E+
Sbjct: 28  IPESKVVEKILRTMPMKFDHVVTTIIESHDIEIMTVAELQGSIESHVSRILEKTEKINEE 87

Query: 162 LFQNRVE--ENANYAKKDEERG 182
             +++V     A  ++ ++ RG
Sbjct: 88  ALKSQVNFTNIAEPSRNEDSRG 109


HSP 5 Score: 307.4 bits (786), Expect = 5.5e-80
Identity = 210/657 (31.96%), Postives = 314/657 (47.79%), Query Frame = 1

Query: 154 KNKHTIEQLFQNRVEENANYAKKDEERGDSSLLLACKCVKTCENNAWYLDTGASNQMCGS 213
           K K  I +  + R    AN++++ E+     ++ +C   +  +++ W +D+G +N M   
Sbjct: 288 KRKGHIAKYCRTREINRANFSQEKEK--SEEMVFSCHTAQEEKDDVWVIDSGCTNHMAAD 347

Query: 214 KSMFMELDEPIGGDIVFGDATKNSIKGKCKILIHLKN-----EKHEFIINVYFVPNIKNN 273
            ++F E+D              +S   K    IH+ N      K +   N YF+  I + 
Sbjct: 348 PNLFREMD--------------SSYHAK----IHMGNGSIAQSKGKEGGNWYFITFIDDY 407

Query: 274 ILSLRQLLEKGYNILMKDYIYFVKEKSEVFGMFKRFKALVGKESGYYIKALRSYRKGEFT 333
              +              ++YF+KEKS    +FK+FKA+V  +S   IK LRS + GE+ 
Sbjct: 408 TRMI--------------WVYFLKEKSAALEIFKKFKAMVENQSNRKIKVLRSDQGGEYI 467

Query: 334 SNEFKTFCAENGIHRPMT--------------------------------------VIEW 393
           S EF+ +C   GI R +T                                       +  
Sbjct: 468 SKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSFWAEAVNT 527

Query: 394 AVYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEKHVF 453
           AVY+ NR PT+++  + P +A  G+KP I H+RVFGCI YA +  QK  K D+KS++ +F
Sbjct: 528 AVYILNRSPTKAVPNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNKSDRCIF 587

Query: 454 ISYDASSKNYQLYNSITKKTMKHHGIGMQQDDKFLLFLDDQDEPSDIIASTSTPPTSLIT 513
           + Y     + Q  + +     ++H    Q               S   AS+ + P+S   
Sbjct: 588 VGYADGQPHMQGTHEV-----EYHPPSPQ----------SSSPRSSSSASSDSSPSSEEQ 647

Query: 514 PQQSTSSSSTSSSEGPRGMRSFR---------DIYDETEK-------------------- 573
              +  SS T S+   RG                + E EK                    
Sbjct: 648 ISYTGISSKTESTSQQRGSEQHEFCNYSVVEPQSFQEAEKHDNWIKAMEDEIHMIEKNNT 707

Query: 574 -----------AVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGID-YEVFAPVARLE 633
                       +GVKWV+K K N  G V++YK RLV K + Q+ GID YE +A VARLE
Sbjct: 708 WELVDRPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGFKQKPGIDYYETYAHVARLE 767

Query: 634 IIRLLV------------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYG 693
            I  ++            +DVKS FLNGYL+EE+Y+EQP  + V+G E+KV +LKK LYG
Sbjct: 768 TIHTIIALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQPERFSVQGGENKVFRLKKALYG 827

Query: 694 LKQAPRMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMF 715
           LKQAPR W ++I+KYF+  G+ +   E  L +K  G  D L V LYVDDLI+ GN   + 
Sbjct: 828 LKQAPRAWYSQIDKYFIQKGFAKSISEPILYVKKTG-TDILIVSLYVDDLIYTGNSEKLM 887

BLAST of Cucsa.291820 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 149.1 bits (375), Expect = 1.3e-35
Identity = 82/230 (35.65%), Postives = 134/230 (58.26%), Query Frame = 1

Query: 500 EKAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDY-EVFAPVARLEIIRLLV--- 559
           +K +G KWV+KIK N  G +ERYK RLV K Y+Q++GID+ E F+PV +L  ++L++   
Sbjct: 124 KKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAIS 183

Query: 560 ---------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQE----DKVLKLKKTLYGLKQAP 619
                    +D+ + FLNG L+EE+Y++ P GY  R  +    + V  LKK++YGLKQA 
Sbjct: 184 AIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQAS 243

Query: 620 RMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKK 679
           R W  K +   +  G+++   +H   +K I     L V +YVDD+I   N     ++LK 
Sbjct: 244 RQWFLKFSVTLIGFGFVQSHSDHTYFLK-ITATLFLCVLVYVDDIIICSNNDAAVDELKS 303

Query: 680 SISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTREILEKFNMINSK 713
            +   FK+ ++G + Y+L + + +S  GI I Q +Y  ++L++  ++  K
Sbjct: 304 QLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCK 352


HSP 2 Score: 59.7 bits (143), Expect = 1.0e-08
Identity = 33/111 (29.73%), Postives = 54/111 (48.65%), Query Frame = 1

Query: 714 LEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVAATSCVYD- 773
           L+ + D+      + R+ST+G+  F+G    +W  KK  +V+ S+ E EY A +    + 
Sbjct: 442 LQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEM 501

Query: 774 ----------------PILIHVDNKSTIALAKNHVFHDRLKHIDTRFHFIR 808
                           P L+  DN + I +A N VFH+R KHI++  H +R
Sbjct: 502 MWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVR 552

BLAST of Cucsa.291820 vs. TAIR10
Match: AT1G48720.1 (AT1G48720.1 unknown protein)

HSP 1 Score: 78.2 bits (191), Expect = 2.7e-14
Identity = 37/90 (41.11%), Postives = 57/90 (63.33%), Query Frame = 1

Query: 1  MVNKSFSFQVPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEEPEKT----------LK 60
          M + +  FQVP LTK NY +W +RMKA+LG+ +VW+I+  G+ EPE            L+
Sbjct: 1  MASNNVPFQVPVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLR 60

Query: 61 NTRKKDQKAFTIIHSYIDDSNFEKNSGATT 81
          ++RK+D+KA  +I+  +D+  FEK   AT+
Sbjct: 61 DSRKRDKKALCLIYQGLDEDTFEKVVEATS 90

BLAST of Cucsa.291820 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 50.4 bits (119), Expect = 6.1e-06
Identity = 26/57 (45.61%), Postives = 38/57 (66.67%), Query Frame = 1

Query: 710 NSK-TLEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVA 766
           NSK  ++ + DS+ AG T+ R+ST+GF  F+G    +WS K+ P V+ S+ ETEY A
Sbjct: 159 NSKLNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRA 215

BLAST of Cucsa.291820 vs. NCBI nr
Match: gi|922485402|ref|XP_013583262.1| (PREDICTED: LOW QUALITY PROTEIN: copia protein [Brassica oleracea var. oleracea])

HSP 1 Score: 381.3 bits (978), Expect = 4.3e-102
Identity = 220/402 (54.73%), Postives = 261/402 (64.93%), Query Frame = 1

Query: 501  KAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDY-EVFAPVARLEIIRLLV---- 560
            KA+GVKWV+K K+N KGEVERYK RLV K YSQR GIDY EVFAPVARLE +RL++    
Sbjct: 762  KAIGVKWVYKAKKNSKGEVERYKARLVAKCYSQRAGIDYDEVFAPVARLETVRLIISLAA 821

Query: 561  --------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPRMWNT 620
                    +DVKS FLNG LEEEVY+EQP GY V+G+EDKVL+LKK LYGLKQAPR WNT
Sbjct: 822  QKSWRIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRAWNT 881

Query: 621  KINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKKSISQE 680
            +I+K F + G+++CPYEH L IKT  + D L  CLYVDDLIF GN   MFED K  +++E
Sbjct: 882  QIDKCFKEKGFIKCPYEHALYIKT-QNNDLLIACLYVDDLIFTGNNPIMFEDFKMEMTKE 941

Query: 681  FKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTR-EILEKFNMINSKTLE----------- 740
            F MT+IGLMSYYL I V Q + GIFI QE Y + +IL    ++ S+ +E           
Sbjct: 942  FMMTDIGLMSYYLGIEV-QEENGIFITQEGYAKPDILHAVGVV-SRYMEHPTTTHFKAAK 1001

Query: 741  ------------GYYDSEGA-----GDTNDR--------KSTSGFIFFIGNIAFTWSFKK 800
                        G Y S        G ++          K+TSGF+FFIG   FT   KK
Sbjct: 1002 RILRYIKGTINFGLYYSISEYYKLFGYSDSDWGGDVDDRKNTSGFVFFIGETVFTXMSKK 1061

Query: 801  HPIVTLSTCETEYVAATSCV-----------------YDPILIHVDNKSTIALAKNHVFH 836
             PIVTLSTCE EYVAATSCV                  +P  I VDNKS IALAKN VFH
Sbjct: 1062 QPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELNLPQEEPTKIFVDNKSAIALAKNPVFH 1121

BLAST of Cucsa.291820 vs. NCBI nr
Match: gi|922485402|ref|XP_013583262.1| (PREDICTED: LOW QUALITY PROTEIN: copia protein [Brassica oleracea var. oleracea])

HSP 1 Score: 168.7 bits (426), Expect = 4.3e-38
Identity = 100/218 (45.87%), Postives = 134/218 (61.47%), Query Frame = 1

Query: 164 QNRVEENANYAKKDEERGDSSLLLACKCVKTCENNAWY----LDTGASNQMCGSKSMFME 223
           +NRVEE +NY ++  +  D  LL+A K  +  E + WY    LD+GASN MCG+KSMF+E
Sbjct: 298 KNRVEEKSNYVEERSKEEDM-LLMAYKKDEPNEVHKWYXXWCLDSGASNHMCGNKSMFVE 357

Query: 224 LDEPIGGDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLRQLLEKG 283
           LDE +  ++  GD +K  +KGK  ILI L ++K  FI NVY++P++K NILSL QLLEKG
Sbjct: 358 LDESV--NMALGDESKMEVKGKGNILIRLXDDK--FISNVYYIPSMKTNILSLGQLLEKG 417

Query: 284 YNILMKD------------------------------YIYFVKEKSEVFGMFKRFKALVG 343
           Y+I +KD                              ++YF+K+KSEVF  FK+FK  V 
Sbjct: 418 YDIRLKDNSLSLRDNANNLITKVPMSILFIDDFSRKTWVYFLKQKSEVFENFKKFKTHVE 477

Query: 344 KESGYYIKALRSYRKGEFTSNEFKTFCAENGIHRPMTV 348
           KESG  IK++RS R GEF S EF  +C +NGI R +TV
Sbjct: 478 KESGLKIKSMRSDRGGEFMSKEFMKYCEDNGIRRQLTV 510


HSP 2 Score: 127.1 bits (318), Expect = 1.5e-25
Identity = 67/157 (42.68%), Postives = 100/157 (63.69%), Query Frame = 1

Query: 351 AVYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEKHVF 410
           AVY+SNR PT+S+ +K P++A +GRKP ++HLRVFG I +AH+ D+K SKLD KSEK++F
Sbjct: 552 AVYISNRSPTKSVLEKTPQEAXSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIF 611

Query: 411 ISYDASSKNYQLYNSITKKTMKHHGIGMQQDDKFLLFLDDQD-------EPSDIIASTST 470
           I YDA+SK Y+LYN  TKKT+ +  +   ++ ++    +++D       E  ++      
Sbjct: 612 IGYDANSKGYKLYNPETKKTIINRNVIFDEEGEWDWRSNNEDYNFFPSFEEDNVEQPREE 671

Query: 471 PPTSLITPQQSTSSSSTSSSEGPRGMRSFRDIYDETE 501
           P T   +P  S+    +SS   PR  RS +DIY+ TE
Sbjct: 672 PTTPPTSPTTSSQGDESSSERTPR-FRSLQDIYEVTE 707


HSP 3 Score: 95.5 bits (236), Expect = 4.7e-16
Identity = 43/111 (38.74%), Postives = 69/111 (62.16%), Query Frame = 1

Query: 1   MVNKSFSFQVPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEEPEKT----------LK 60
           M N     QVP LTK NY +W +RM A+LG+ +VW+I+   + EPE            L+
Sbjct: 1   MANNGVPLQVPLLTKSNYDNWSLRMMAILGAHDVWEIVEKCFNEPENDGGLSQTQKDGLR 60

Query: 61  NTRKKDQKAFTIIHSYIDDSNFEKNSGATTTHQAWQILENTYKEVDRVKKI 102
           + +K+D+KA  +I+  +D+  FEK +GA T+ +AW+ L+ +YK  ++VKK+
Sbjct: 61  DAKKRDKKALCLIYQGLDEDTFEKVAGAKTSKEAWEKLQTSYKGAEQVKKV 111


HSP 4 Score: 65.1 bits (157), Expect = 6.8e-07
Identity = 31/80 (38.75%), Postives = 54/80 (67.50%), Query Frame = 1

Query: 102 ISDEQVVEKILCSLNEKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQ 161
           + + +++EK+L SL+ KF  IV  IE++KDL +M+ +QL+GSLQAY++K  KK +  +EQ
Sbjct: 152 LDEVRIMEKVLRSLDSKFEHIVTIIEETKDLETMTMEQLLGSLQAYEEK-KKKKEDIVEQ 211

Query: 162 LFQNRVEENANYAKKDEERG 182
           + + R+++     +    RG
Sbjct: 212 VLKMRIDQKEESGRNHPRRG 230


HSP 5 Score: 370.2 bits (949), Expect = 9.8e-99
Identity = 207/407 (50.86%), Postives = 262/407 (64.37%), Query Frame = 1

Query: 351  AVYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEKHVF 410
            AVYL NR PT+S+  K P++A +GRKP ++HLRVFG I +AH+ D+K SKLD KSEK++F
Sbjct: 663  AVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIF 722

Query: 411  ISYDASSKNYQLYNSITKKTMKHHGIGMQQDDK------------FLLFLDDQDEPS--- 470
            I YD +SK Y+LYN  TKKT+    I   ++ +            F  F +D+ EP+   
Sbjct: 723  IGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPEPTREE 782

Query: 471  DIIASTSTPPTSLITPQQSTSSSSTSSSEGPRGMRSFRDIYDET---------------- 530
                  +TPPTS  + Q           E     +++R+  DE                 
Sbjct: 783  PPSEEPTTPPTSPTSSQIEEKCEPMDFQEAIE-KKTWRNAMDEEIKSIQKNDTWELTSLP 842

Query: 531  --EKAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDY-EVFAPVARLEIIRLLV- 590
               KA+GVKWV+K K+N KGEVERYK RLV K YSQR GIDY EVFAPVARLE +RL++ 
Sbjct: 843  NGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEVFAPVARLETVRLIIS 902

Query: 591  -----------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVLKLKKTLYGLKQAPRM 650
                       +DVKS FLNG LEEEVY+EQP GY V+G+EDKVL+LKK LYGLKQAPR 
Sbjct: 903  LAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRA 962

Query: 651  WNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIFIGNCAHMFEDLKKSI 710
            WNT+I+KYF +  +++CPYEH L IK I   D L  CLYVDDLIF GN   MFE+ KK +
Sbjct: 963  WNTRIDKYFKEKDFIKCPYEHALYIK-IQKEDILIACLYVDDLIFTGNNPSMFEEFKKEM 1022

Query: 711  SQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTREILEKFNMINS 712
            ++EF+MT+IGLMSYYL I V+Q   GIFI QE Y +E+L+KF M +S
Sbjct: 1023 TKEFEMTDIGLMSYYLGIEVKQEDNGIFITQEGYAKEVLKKFKMDDS 1067

BLAST of Cucsa.291820 vs. NCBI nr
Match: gi|12321254|gb|AAG50698.1|AC079604_5 (copia-type polyprotein, putative [Arabidopsis thaliana])

HSP 1 Score: 176.8 bits (447), Expect = 1.6e-40
Identity = 89/139 (64.03%), Postives = 101/139 (72.66%), Query Frame = 1

Query: 714  LEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVAATSCVY-- 773
            L GY DS+  GD +DRKSTSGF+F+IG+ AFTW  KK PIVTLSTCE EYVAATSCV   
Sbjct: 1158 LVGYSDSDWGGDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHA 1217

Query: 774  ---------------DPILIHVDNKSTIALAKNHVFHDRLKHIDTRFHFIRGCISRKKVQ 833
                           +P  I VDNKS IALAKN VFHDR KHIDTR+H+IR C+S+K VQ
Sbjct: 1218 IWLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQ 1277

Query: 834  VEYVKTEDQIADIFMKPLK 836
            +EYVKT DQ+ADIF KPLK
Sbjct: 1278 LEYVKTHDQVADIFTKPLK 1296


HSP 2 Score: 134.8 bits (338), Expect = 7.0e-28
Identity = 65/121 (53.72%), Postives = 86/121 (71.07%), Query Frame = 1

Query: 166 RVEENANYAKKDEERGDSSLLLACKCVKTCENNAWYLDTGASNQMCGSKSMFMELDEPIG 225
           + EE ANY ++  +  D  L+ + K  +  EN+ WYLD+GASN MCG KSMF ELDE + 
Sbjct: 301 KFEEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDESVR 360

Query: 226 GDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLRQLLEKGYNILMK 285
           G++  GD +K  +KGK  ILI LKN  H+FI NVY++P++K NILSL QLLEKGY+I +K
Sbjct: 361 GNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLK 420

Query: 286 D 287
           D
Sbjct: 421 D 421


HSP 3 Score: 99.0 bits (245), Expect = 4.2e-17
Identity = 46/111 (41.44%), Postives = 71/111 (63.96%), Query Frame = 1

Query: 1   MVNKSFSFQVPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEEPEKT----------LK 60
           M + +  FQVP LTK NY +W +RMKA+LG+ +VW+I+  G+ EPE            L+
Sbjct: 1   MASNNVPFQVPVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLR 60

Query: 61  NTRKKDQKAFTIIHSYIDDSNFEKNSGATTTHQAWQILENTYKEVDRVKKI 102
           ++RK+D+KA  +I+  +D+  FEK   AT+  +AW+ L  +YK  D+VKK+
Sbjct: 61  DSRKRDKKALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKV 111


HSP 4 Score: 81.3 bits (199), Expect = 9.1e-12
Identity = 39/63 (61.90%), Postives = 48/63 (76.19%), Query Frame = 1

Query: 285 KDYIYFVKEKSEVFGMFKRFKALVGKESGYYIKALRSYRKGEFTSNEFKTFCAENGIHRP 344
           K ++YF+KEKSEVF +FK+FKA V KESG  IK +RS R GEFTS EF  +C +NGI R 
Sbjct: 559 KTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQ 618

Query: 345 MTV 348
           +TV
Sbjct: 619 LTV 621


HSP 5 Score: 61.2 bits (147), Expect = 9.8e-06
Identity = 30/71 (42.25%), Postives = 50/71 (70.42%), Query Frame = 1

Query: 102 ISDEQVVEKILCSLNEKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQ 161
           + D +++EK+L SL+ KF  IV  IE++KDL +M+ +QL+GSLQAY++K  KK +  +EQ
Sbjct: 152 LDDVRIMEKVLRSLDLKFEHIVTVIEETKDLEAMTIEQLLGSLQAYEEK-KKKKEDIVEQ 211

Query: 162 LFQNRVEENAN 173
           +   ++ +  N
Sbjct: 212 VLNMQITKEEN 221


HSP 6 Score: 316.2 bits (809), Expect = 1.7e-82
Identity = 184/493 (37.32%), Postives = 266/493 (53.96%), Query Frame = 1

Query: 287 YIYFVKEKSEVFGMFKRFKALVGKESGYYIKALRSYRKGEFTSNEFKTFCAENGIHRPMT 346
           ++YF+KEKS    +FK+FKA+V  +S   IK LRS + GE+ S EF+ +C   GI R +T
Sbjct: 502 WVYFLKEKSAALEIFKKFKAMVENQSNRKIKVLRSDQGGEYISKEFEKYCENAGIRRQLT 561

Query: 347 --------------------------------------VIEWAVYLSNRFPTRSLWKKPP 406
                                                  +  A+Y+ NR PT+++  + P
Sbjct: 562 AGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSFWAEAVNTAIYILNRSPTKAVPNRTP 621

Query: 407 KQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEKHVFISYDASSKNYQLYNSITK 466
            +A  G+KP I H+RVFGCI YA +  QK  K D+KS++ +F+ Y    K Y+LYN   K
Sbjct: 622 FEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNKSDRCIFVGYADGIKGYRLYNLEKK 681

Query: 467 KTMKHHGIGMQQDDKFLLFLDDQDEPSDIIASTSTPPTSLITPQQSTSSSSTSSSEGPRG 526
           K +    +   +   +               +  +P  S++ PQ     S   + +    
Sbjct: 682 KIIISRDVIFDESATW---------------NWKSPEASIVEPQ-----SFQEAEKHDNW 741

Query: 527 MRSFRDIYDETEK--------------AVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQR 586
           +++  D     EK               +GVKWV+K K N  G V++YK RLV K + Q+
Sbjct: 742 IKAMEDEIHMIEKNNTWELVDRPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGFKQK 801

Query: 587 KGID-YEVFAPVARLEIIRLLV------------IDVKSTFLNGYLEEEVYLEQPLGYYV 646
            GID YE +APVARLE IR ++            +DVKS FLNGYL+EE+Y+EQP G+ V
Sbjct: 802 PGIDYYETYAPVARLETIRTIIALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQPEGFSV 861

Query: 647 RGQEDKVLKLKKTLYGLKQAPRMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVC 706
           +G E+KV +LKK LYGLKQAPR W ++I+KYF+  G+ +   E  L +   G  D L V 
Sbjct: 862 QGGENKVFRLKKALYGLKQAPRAWYSQIDKYFIQKGFAKSISEPTLYVNKTG-TDILIVS 921

Query: 707 LYVDDLIFIGNCAHMFEDLKKSISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTRE 715
           LYVDDLI+ GN   M +D KK +   ++M+++GL+ Y+L + V QS EGIFI Q +Y + 
Sbjct: 922 LYVDDLIYTGNSEKMMQDFKKDMMHTYEMSDLGLLHYFLGMEVHQSDEGIFISQRKYAKN 973

BLAST of Cucsa.291820 vs. NCBI nr
Match: gi|77556816|gb|ABA99612.1| (retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group])

HSP 1 Score: 117.9 bits (294), Expect = 8.8e-23
Identity = 63/171 (36.84%), Postives = 105/171 (61.40%), Query Frame = 1

Query: 10  VPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEE----------PEKTLKNTRKKDQKA 69
           VP    ENY  W I+M+ LL SQ +WDI+ NGY+E           +K+L   R  D KA
Sbjct: 6   VPVFAGENYDIWSIKMRTLLLSQGLWDIVENGYQEYSAGETLTAEQKKSLAEDRMSDAKA 65

Query: 70  FTIIHSYIDDSNFEKNSGATTTHQAWQILENTYKEVDRVK---KIISDEQVVEKILCSLN 129
             +I   + +S F +  GA  + +AW  L+  ++   +++   + I+D++VVEKIL SL 
Sbjct: 66  LFLIQQGVAESLFPRIIGAKKSKEAWDKLKEEFQGSQKMRLYGEDINDQKVVEKILISLP 125

Query: 130 EKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQLFQNRV 168
           EK+ +IV AIE+SKD+ +++  QLM SL++++++ L++   +IE  FQ+++
Sbjct: 126 EKYEYIVAAIEESKDMLTLTIQQLMSSLESHEERKLQREGSSIENAFQSKL 176


HSP 2 Score: 88.6 bits (218), Expect = 5.7e-14
Identity = 50/146 (34.25%), Postives = 77/146 (52.74%), Query Frame = 1

Query: 706  FNMINSKTLEGYYDSEGAGDTNDRKSTSGFIFFIGNIAFTWSFKKHPIVTLSTCETEYVA 765
            +  +    L GY DS+ AG  +D KSTS + F +G+                  E EYVA
Sbjct: 1053 YKPVKESKLIGYTDSDWAGCLDDMKSTSSYAFSLGS-----------------AEAEYVA 1112

Query: 766  ATSCV-----------------YDPILIHVDNKSTIALAKNHVFHDRLKHIDTRFHFIRG 825
            A+  V                 Y P  I+ D+KS IA+++N V HDR KHI  ++H+IR 
Sbjct: 1113 ASKAVSQVVWLRRIMEDLGEKQYQPTTIYCDSKSAIAISENPVSHDRTKHIAIKYHYIRE 1172

Query: 826  CISRKKVQVEYVKTEDQIADIFMKPL 835
             + R++V++++ +T++Q+ADIF K L
Sbjct: 1173 AVDRQEVKLKFCRTDEQLADIFTKAL 1181


HSP 3 Score: 67.8 bits (164), Expect = 1.0e-07
Identity = 36/134 (26.87%), Postives = 71/134 (52.99%), Query Frame = 1

Query: 154 KNKHTIEQLFQNRVEENANYAKKDEERGDSSLLLACKCVKTCENNAWYLDTGASNQMCGS 213
           K K  I +  + R    AN++++ E+     ++ +C   +  +++ W +D+G +N M   
Sbjct: 233 KRKGHIAKYCRTREINRANFSQEKEK--SEEMVFSCHTAQEEKDDVWVIDSGCTNHMAAD 292

Query: 214 KSMFMELDEPIGGDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLR 273
            ++F E+D      I  G+ +    +GK  + +   +   +FI +V  VP++K N+LS+ 
Sbjct: 293 PNLFREMDSSYHAKIHMGNGSIAQSEGKGTVAVQTADGP-KFIKDVLLVPDLKQNLLSIG 352

Query: 274 QLLEKGYNILMKDY 288
           QLLE GY +  +D+
Sbjct: 353 QLLEHGYAVYFEDF 363


HSP 4 Score: 314.3 bits (804), Expect = 6.4e-82
Identity = 219/703 (31.15%), Postives = 337/703 (47.94%), Query Frame = 1

Query: 253  HEFIINVYFVPNIKNNILSLRQLLEKGYNILMKDYIYFVKEKSEVFGMFKRFKALVGKES 312
            H +I     + ++ NN+     + +    +    ++YF+K KS+V  MFK FK +V  +S
Sbjct: 499  HSYICGPMSIASLSNNVYFALFIDD----LSRMTWVYFLKTKSQVLSMFKSFKKMVETQS 558

Query: 313  GYYIKALRSYRKGEFTSNEFKTFCA------ENGIHRPMTVIE---------------WA 372
            G  +K L     GE+ S EF           E    +  TV+E               WA
Sbjct: 559  GQNVKVLIIDNGGEYISKEFNLTAPYLPQQNEVSERKNKTVMEMARCMLFEKRLPKLLWA 618

Query: 373  ------VYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKS 432
                  VYL NR PT+S+  K P +A  G KPS+ HL+VFG + Y H+   K  KLD ++
Sbjct: 619  EAVNTSVYLLNRLPTKSVQSKTPIEAWFGVKPSVKHLKVFGSLCYLHVPSVKRGKLDERA 678

Query: 433  EKHVFISYDASSKNYQLYNSITKKTMKHHGIGMQQDDKFLLFLD-----DQDEPS----- 492
            EK VF+ Y A SK Y++Y+    K +    +   ++  +   L      DQ+ PS     
Sbjct: 679  EKGVFVGYAAESKGYRIYSLSRMKIVISRDVHFDENSYWNWDLKKVHKCDQNTPSILEPA 738

Query: 493  ----------DIIASTSTP-----PTS-------LITPQQSTSSSSTSSSEGPRGMRSFR 552
                      D+ A++ TP     P S       L+  + +  + +    E  + M++  
Sbjct: 739  IESIIIEGPLDVEATSDTPMLKMRPLSDVYERCNLVHAEPTCYTEAARFLEWIKAMKAEI 798

Query: 553  DIYD-----------ETEKAVGVKWVFKIKRNEKGEVERYKIRLVVKDYSQRKGIDY-EV 612
            D  +           E + A+GVKWVF+ K N  G + R+K RLVVK ++Q   +DY + 
Sbjct: 799  DAIERNGTWKLTELPEAKNAIGVKWVFRTKFNSDGSIFRHKARLVVKGFAQVARVDYGDT 858

Query: 613  FAPVARLEIIRLLV------------IDVKSTFLNGYLEEEVYLEQPLGYYVRGQEDKVL 672
            FA VA+ + IRLL+            ++VKS FLNG L EE+Y++QP G+ V G E KV 
Sbjct: 859  FALVAKHDTIRLLLALASQMGWKVYHLNVKSAFLNGILLEEIYVQQPEGFEVIGHEHKVY 918

Query: 673  KLKKTLYGLKQAPRMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTLAVCLYVDDLIF 732
            KL K +YGLKQAPR W ++I+ + +  G+ R   E  L +K       L V LYVDD++ 
Sbjct: 919  KLHKAVYGLKQAPRAWYSRIDSHLIQLGFRRSENEATLYLKQNDDGLQLVVSLYVDDMLV 978

Query: 733  IGNCAHMFEDLKKSISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQYTREILEKFNMI 792
             G+   +  D K  +   F+M+++G+M+Y+L + + Q   GIFI Q +Y  +IL+KF + 
Sbjct: 979  TGSNVKLLADFKMEMQDVFEMSDLGIMNYFLGMEIYQCSWGIFISQRKYAMDILKKFKLE 1038

Query: 793  NSKT-----------------------------------LEGYYDSEGAGDTNDRKSTSG 835
            + K                                    L+GY DS+ AG  +D KSTSG
Sbjct: 1039 SCKEVATPLAQNEKISKNDGEKLEEPFAYRSLLKTGGVKLDGYADSDWAGSVDDMKSTSG 1098

BLAST of Cucsa.291820 vs. NCBI nr
Match: gi|147800844|emb|CAN71037.1| (hypothetical protein VITISV_011061 [Vitis vinifera])

HSP 1 Score: 66.2 bits (160), Expect = 3.0e-07
Identity = 44/181 (24.31%), Postives = 86/181 (47.51%), Query Frame = 1

Query: 10  VPRLTKENYSSWCIRMKALLGSQNVWDIINNGYEEPE----------KTLKNTRKKDQKA 69
           +P    E+Y  W ++M+  L SQ +W+++ +  + P           K  +  + K  KA
Sbjct: 11  IPVFNGEHYHIWAVKMRFYLRSQGLWNVVMSEDDPPPLGANPTVAQMKAYEEEKLKKDKA 70

Query: 70  FTIIHSYIDDSNFEKNSGATTTHQA-WQILE---------------NTYKEVDRVKKIIS 129
            T +HS + D  F K     T  Q  +++++               +   ++  + +  +
Sbjct: 71  ITCLHSGLADHIFTKIMNLETPKQREFELMKMKDDESVKDYSGRLMDVVNQMRLLGEAFT 130

Query: 130 DEQVVEKILCSLNEKFNFIVVAIEKSKDLSSMSTDQLMGSLQAYKKKLLKKNKHTIEQLF 165
           D++VVEKI+ S+ +KF   + AIE+S DL +++  +L   L A ++++L +     E  F
Sbjct: 131 DQKVVEKIMVSVPQKFEAKISAIEESCDLQTLTIVELTSKLHAQEQRVLMRGDKATEGAF 190


HSP 2 Score: 65.9 bits (159), Expect = 4.0e-07
Identity = 36/133 (27.07%), Postives = 68/133 (51.13%), Query Frame = 1

Query: 164 QNRVEENANYAKKDEERGDSSLLLACKCVKTCENNAWYLDTGASNQMCGSKSMFMELDEP 223
           Q + E+NA+  ++++   D  L +A + + + E N W +D+G ++ M    S+F  +D  
Sbjct: 271 QQQPEKNASVIEENKN-DDEHLFMASQTLSSHELNTWLIDSGCTSHMTKYLSIFTSIDRS 330

Query: 224 IGGDIVFGDATKNSIKGKCKILIHLKNEKHEFIINVYFVPNIKNNILSLRQLLEKGYNIL 283
           +   +  G+      KGK  I I  K    + + NV ++P++  N+LS+ Q+L  GY + 
Sbjct: 331 VQPKVKLGNGEVVQAKGKGTIAISTKRGT-KIVTNVLYIPDLDQNLLSVAQMLRNGYAVS 390

Query: 284 MKDYIYFVKEKSE 297
            K+   F+    E
Sbjct: 391 FKENFCFITNVQE 401


HSP 3 Score: 312.4 bits (799), Expect = 2.4e-81
Identity = 179/436 (41.06%), Postives = 251/436 (57.57%), Query Frame = 1

Query: 348 IEWAVYLSNRFPTRSLWKKPPKQAQTGRKPSIAHLRVFGCITYAHMLDQKLSKLDHKSEK 407
           +  AVY+ NR PT+S+  K P++A +GRKPSI HLR+FGCI YAH+ DQ   KLD K EK
Sbjct: 551 VSTAVYILNRCPTKSVCDKTPEEAWSGRKPSIRHLRIFGCIAYAHVPDQLRKKLDDKGEK 610

Query: 408 HVFISYDASSKNYQLYNSITKKTMKHHGIGMQQDDKF----------LLFLDDQDEPSDI 467
            +FI Y  +SK Y+LYN +TKK +    +   ++  +           +  ++ +E +  
Sbjct: 611 CIFIGYSTNSKAYKLYNPVTKKVIISRDVTFDEEGMWDWSFKAQKVPAVNSENYEEENGH 670

Query: 468 IASTSTPPTSLITPQQS-----------TSSSSTSSSEGPRGMRSFRDIYDET--EKAVG 527
           + +T   P +   PQ+              + +  S E       F D    T  E +  
Sbjct: 671 VDTTPDEPETSSRPQRQRRLPARLEDYVVGNDNDPSDEEIINFALFADCEPVTFEEASNN 730

Query: 528 VKW-------VFKIKRNE--------------------------KGEVERYKIRLVVKDY 587
             W       +  I++N+                           GE++R+K RLV K Y
Sbjct: 731 QYWRKAMDEEIHAIEKNQTWELTDLPANKRQIGVKWVYKTKYKSNGEIDRFKARLVAKGY 790

Query: 588 SQRKGIDY-EVFAPVARLEIIRLLV------------IDVKSTFLNGYLEEEVYLEQPLG 647
            Q+ GIDY EVFAPVARL+ IR+L+            +DVKS FLNG LEEEVY+EQP G
Sbjct: 791 KQKPGIDYFEVFAPVARLDTIRMLISISAQNNWKIHQMDVKSAFLNGTLEEEVYVEQPAG 850

Query: 648 YYVRGQEDKVLKLKKTLYGLKQAPRMWNTKINKYFLDNGYLRCPYEHYLSIKTIGHRDTL 707
           Y ++G+EDKV +LKK LYGLKQAPR W  KI+ YF+DNG+ RCP+EH L IK++   + L
Sbjct: 851 YKIKGKEDKVYRLKKALYGLKQAPRAWYKKIDSYFVDNGFQRCPFEHTLYIKSVDPDNIL 910

Query: 708 AVCLYVDDLIFIGNCAHMFEDLKKSISQEFKMTNIGLMSYYLDIVVEQSKEGIFIFQEQY 715
            VCLYVDDLIF GN   MF + ++++ + F+MT++GLMSY+L I V+Q  +GIFI Q+++
Sbjct: 911 IVCLYVDDLIFTGNNPKMFAEFREAMVKSFEMTDLGLMSYFLGIEVDQRDDGIFISQKKF 970

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.5e-3833.68Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME1.1e-3033.92Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
YCH4_YEAST9.7e-1430.37Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YN12B_YEAST3.6e-0827.27Transposon Ty1-NL2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
YL14B_YEAST4.7e-0825.63Transposon Ty1-LR4 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
Q9C536_ARATH6.9e-9950.86Copia-type polyprotein, putative OS=Arabidopsis thaliana GN=T18I24.5 PE=4 SV=1[more]
Q9C536_ARATH1.1e-4064.03Copia-type polyprotein, putative OS=Arabidopsis thaliana GN=T18I24.5 PE=4 SV=1[more]
Q2QLK1_ORYSJ6.1e-2336.84Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
A5CA01_VITVI2.1e-0724.31Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_011061 PE=4 SV=1[more]
A0A151UCJ8_CAJCA3.1e-2750.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.3e-3535.65 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
AT1G48720.12.7e-1441.11 unknown protein[more]
ATMG00810.16.1e-0645.61ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|922485402|ref|XP_013583262.1|4.3e-10254.73PREDICTED: LOW QUALITY PROTEIN: copia protein [Brassica oleracea var. oleracea][more]
gi|922485402|ref|XP_013583262.1|4.3e-3845.87PREDICTED: LOW QUALITY PROTEIN: copia protein [Brassica oleracea var. oleracea][more]
gi|12321254|gb|AAG50698.1|AC079604_51.6e-4064.03copia-type polyprotein, putative [Arabidopsis thaliana][more]
gi|77556816|gb|ABA99612.1|8.8e-2336.84retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group][more]
gi|147800844|emb|CAN71037.1|3.0e-0724.31hypothetical protein VITISV_011061 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025314DUF4219
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006259 DNA metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.291820.1Cucsa.291820.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 287..385
score: 6.4
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 498..714
score: 1.7
IPR025314Domain of unknown function DUF4219PFAMPF13961DUF4219coord: 13..39
score: 1.
NoneNo IPR availableunknownCoilCoilcoord: 159..179
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 11..767
score: 2.8E
NoneNo IPR availablePANTHERPTHR11439:SF127SUBFAMILY NOT NAMEDcoord: 11..767
score: 2.8E
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 53..101
score: 7.
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 501..698
score: 3.87E-10coord: 725..804
score: 3.87

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None