Cucsa.300520 (gene) Cucumber (Gy14) v1

NameCucsa.300520
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationscaffold02931 : 25938 .. 29955 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTATAGCAAGAGTAGAAATTGAGAAGTTTGATGGAAAGGGAGACTTTGCATTATGGAAAGCAAAGATCAAAGCCTTGCTTGGACAGCAAAAGTCTCATAAAGCCCTTTTAGATCCTTCAGAACTTCCAACAACCCTCACAGCAACACAAAAAGAAGAAATGAAATTAAATGCCTATGGAACACTAATACTAAACCTTAGTGACAACATAATTAGGCAAGTTCTAGAAGAAGAAACAGCATATAAAGTTTGGATGAAACTAGAAAGCCTATATGCCTCTAAAGATCTCCCAAACAAAATGTACCTAAGGGAAAAAAATTTCACATATAGAATGGATCCTTCTAAAACCTTATCTGAAAACTTAGATGAATTCAAGAAAATAGTTTCAGATTTTAAAACCCTTGAGGACAAACTAAGTGATGAAAATGAGGCATTGTCCTTTTAAATTCTTTACCCGAGGCATACAAAGAAGTAAAAAATGCTCTAAAATATGGAAGAGACTCAGTAAAAACAGATGTCATAATATCAGCTCTAAGAACTAGAGAATTAGAAATACATTCATCACACAAAGAAAATCATAGTGGTGATGGATTGTTTGTCAGAGGTAAATCCCAAAACAATCAAAGTAAAAACAGTAACAAATCCTTTTCTAATGAAGACAAGAAAGGAAAACAGAAAAAGAAATGTAAATAAGGTAAAAAGAAGGGACACCTCATACAAGAATGTTTTTtCCTTAAAAGAAAGAATCAAGAAAAAGAAAAAGATTCTAAAGGGAAACAACCAGAAGTCTCTATAGTAGAAAGCTCTTTTACCTACACAGATGCCTTGGCATCAACCTTAGACCAAGCCAACCATGTCAACCCCTTAGGAAAACATGATTGGGTCGTAGACTCAGGATGCACCTACCATATGACACCTTTTAGAGCATGGTTCAATACCTATAGAGAGATCAGTGGAGAATCTGTGTTCATAGGGAATAATAATGAATGTAACATTGCTGGAATTGGATCAGTTACCATGAAACTAAAAGATAAGACTGTAAAACTCCTTAGAAATGTAAGACATGTTCCTCACCTTAAAAGAAATTTAATCTCCCTAAGAATGTTAGACTCTCTAGGGTGTGAATATAAAGGAAAATGTGGAGTTTTCCAAGTGTTTATGGGATCTAAGTTAGTCTTGGTTGGGGAAAAGGTAAATGATTTGTTCATAATAAAAGGAGTAGAAATGATAGAGGAGGCAAATACAGTTTTATCTCTAAACCTAATAGAAGTTGATATTTGGCATAAAAGATTGTCCCACATTAGTCAGAAGGGTCTTGAGGCACTATCTAAACAGGGCATTCTGCCTCAAGACATATGCAGCAAGTTGTCCTTTTGTGAACACTGTGTACTAGGCAAAACAAGAAAACAAAACTTCACCAAAGCACAACACACAACAAGAGGAATCCTAGACTACATCCATTTGGATCTATGGGGTCCTGCATCCACTCCAAGCCTAAGTGGCTCAAGGTATTTTCTATCTTTTACTGATGATTTTTCTAGAAAAAGTTGGATTTGTTTCTTATAAACTAAAGATCAAGTTTTTGAAAAACTTAAAGAATGGAAACTAATGATATAAAAACAAACTGATAAAAATGTTAGATATCTTAGAACTGACAATGGTTTGGAGTTTTGTGGAGAGTTGTTTAATAACTTTTGCAGGCAAAGTGGAATAACAAGACACAAAACAGTGACCTACACACCCCAACAAAACAGAGTGGCAGAAAGACTAAACAAAACTATAATGGAAAGGGTAAGGTGTCTATTATCAGATGTCATCCTAAAAGAAAAGTTTTGGGCTGAAGCTGCTGCCTATGTTATGCACACGTTGAACAGAAGCCCTCATACCTCCTTAGGTCTCCTAACACCTGAGAAAAAGTGATCCAAACACCCACCAAGTCTAAGTGACCTTAAAGTGTTCGGATGTGTAGGGTATGCTCACCAAAATAAAGGGAAACTAAAGGTCAGGGCTGCTAAGTGTATGTTTATTGGTTTCACAGAAGGGGTAAAGGGTTTCAAGTTGTGGCATCCTACTAACAAGAGGTTTATAATCAGTAGGGATGTTCTTTTTAGAGAATAAAAAATGTTTATGCAAGGTAAAAGCAGCACTGAAGGGAACTTTGATGACACACAATCCTATACTACTCAGATTGAGGTGGAGAATACAGGAAAAAGTGTTCAACCTACTGAGGAACCTACAGCTATTGAACAAGAACAAGTGGAGAACTTAAGTGAAGAACAAGATGAAATGCTTGAAGAACAACCTGACTTGAGCCAATATTCCCTAGCAAGAGACAGACAAAGAAGGATAATTGTCCCTCCAGCCAGGTATGCTGAATCTAATTACATAAGTTTTGTTTTGAATGCTACTGTAGTTCCTAATGATTCAGAACCAAGTTCCTTTGATGAAGCTGTGAACAGTAGCAATGCAAGACAGTGGATTGAAGCTATGAATGAAGAAATAAATTCACTAAATGTGAATGATACTTGGACACTAGCTTCTCTACCTAAAGGATGCAAACCAATAGCATCTAAGTGGATTTTCAAACTCAAAGAAGGAATTACTAAAAACTCGCAACCAAGGTACAAGGCAAGGCTGGTAGCAAAGGGTTTCACTCAAAGAGAAGGTATTGACTATTCTGAAATCTTCTCCCCTGTAGTTAAACAAACCTCTATTAGACTTCTCCTATCCCTAGTTGCTCAAAACAACCTAGAACTGGATCAACTTGATGTAAAAACAACTTTTCTCCATGACTATCTAGAAGAGACAATCTATATGGTTCAACCTAAGGGTTATGAGGTTCAAGGTAAGGAAGACCTCTATTATTTACTAAAGAAGTCTATATATGGGTTAAAACAATCTCCTAGATGTTGGTATAGAAGATTTGATGATTTCATTACTAGTTTAGGTTTTCAAAAAAGTTCCTATGATATGTGTGTTTATATATAAACACAACAACCTACAAAGACAAGGTCTACTTGCTACTCAATGTGGATGATATGCTCCTTGCAGGAAGCTCTAAAGAAGACCTGGTACATGTAAAAAATCTTTTAAGTAAATAATTTGACATGAAAGACTTAAGGGAATCTAAGAAGATCCTAGGTATAGACATCATTAGAGATAGGGACCAATCTACACTGAGCATAAACCAAACATCCTACTGTGAAAAAGTGATAAGAAGATTCAACCTCACTAATGCTAGACCAGTTACTCTCCCTATTGCTCATCATTTTAAGCTATCAGCTGTCAATTCCCCTAGTGAAACTGATATAGAACACAAGCTACAAATGAAAAATGTTCCTTACAGTCAAGCAGTGGGAAGTTTAATGTACCTAATGATCTCTACAAGACCTGATCTATCCTATTCAGCAAGCCTAGTCAGCAAATATATGGCTAATTCTGGAAGAAGACACTGGGAAGCTACTAAATGGATAATCAGATACTTAATTTGGTCAAAGAATGCTAGATTGAACTACCAAAGGACCACTGAAATAGAACTAGAACTAATAGGATATGTAGACTCAGACTTTGCTGGTGACAGTGATAAAAGGAGAAGCCTAACCGGTTACGTTTTTCTCTACGGTCGTAATCTAATAAGTTGGAAAGCAATCCTACAATCCATTGTTGCTCTCTCAACTACCGAAGCAGAATATATTGCACTATCAGAAGGTGTAAAAGAGAGATTATGGCTTAAAGGATTGATGAGGGACTTTGGAATCAAATAGTCGATTGTTAAAATCTTTTATGACAACCAAAGTGCCATTCGCCTATCCAAAAATCCTCAATACCACAGCCGAACAAAGCACATAGACATAAAATATCACTTCATACGAGAGAAAATTGAAGCTGGAGAAATTCAAGTATTGAAAGTTCATACCTCTGAGAACGCCGCTGATATGCTCACTAAGTCGGTCTCAACATTGAAGCTTCAGAAGTGCTTCGAACTGATAGGTTTCGACCTACCTGAAAAAGGATAG

mRNA sequence

atggctatagcaagagtagaaattgagaagtttgatggaaagggagactttgcattatggaaagcaaagatcaaagccttgcttggacagcaaaagtctcataaagcccttttagatccttcagaacttccaacaaccctcacagcaacacaaaaagaagaaatgaaattaaatgcctatggaacactaatactaaaccttagtgacaacataattaggcaagttctagaagaagaaacagcatataaagtttggatgaaactagaaagcattgtccttttaaattctttacccgaggcatacaaagaagtaaaaaatgctctaaaatatggaagagactcagtaaaaacagatgtcataatatcagctctaagaactagagaattagaaatacattcatcacacaaagaaaatcatagtggtgatggattgtttgtcagaggtaaatcccaaaacaatcaaagtaaaaacagtaacaaatccttttctaatgaagacaagaaaggaaaacagaaaaagaaatattctaaagggaaacaaccagaagtctctatagtagaaagctcttttacctacacagatgccttggcatcaaccttagaccaagccaaccatgtcaaccccttaggaaaacatgattgggtcgtagactcaggatgcacctaccatatgacaccttttagagcatggttcaatacctatagagagatcagtggagaatctgtgttcatagggaataataatgaatgtaacattgctggaattggatcagttaccatgaaactaaaagataagactgtaaaactccttagaaatgtaagacatgttcctcaccttaaaagaaatttaatctccctaagaatgttagactctctagggtgtgaatataaaggaaaatgtggagttttccaagtgtttatgggatctaagttagtcttggttggggaaaaggtaaatgatttgttcataataaaaggagtagaaatgatagaggaggcaaatacagttttatctctaaacctaatagaagttgatatttggcataaaagattgtcccacattagtcagaagggtcttgaggcactatctaaacagggcattctgcctcaagacatatgcagcaagttgtccttttgtgaacactgtgtactaggcaaaacaagaaaacaaaacttcaccaaagcacaacacacaacaagaggaatcctagactacatccatttggatctatggggtcctgcatccactccaagcctaagtggctcaaggtattttctatcttttactgatgatttttctagaaaaagttggatttgtaaaagcagcactgaagggaactttgatgacacacaatcctatactactcagattgaggtggagaatacaggaaaaagtgttcaacctactgaggaacctacagctattgaacaagaacaagtggagaacttaagtgaagaacaagatgaaatgcttgaagaacaacctgacttgagccaatattccctagcaagagacagacaaagaaggataattgtccctccagccaggtatgctgaatctaattacataagttttgttttgaatgctactgtagttcctaatgattcagaaccaaGTTCCTTTGATGAAGCTGTGAACAGTAGCAATGCAAGACAGTGGATTGAAGCTATGAATgaagaaataaattcactaaatgtgaatgatacttggacactagcttctctacctaaaggatgcaaaccaatagcatctaagtggattttcaaactcaaagaaggaattactaaaaactcgcaaccaaggtacaaggcaaggctggtagcaaagggtttcactcaaagagaaggtattgactattctgaaatcttctcccctgtagttaaacaaacctctattagacttctcctatccctagttgctcaaaacaacctagaactggatcaacttgatgtaaaaacaacttttctccatgactatctagaagagacaatctatatggttcaacctaagggttatgaggttcaaggtaaggaagacctctattatttactaaagaagtctatatatgggttaaaacaatctcctagatgttggtatagaagatttgatgatttcattactagtttaggttttcaaaaaagttcctatgatatggaatctaagaagatcctaggtatagacatcattagagatagggaccaatctacactgagcataaaccaaacatcctactgtgaaaaagtgataagaagattcaacctcactaatgctagaccagttactctccctattgctcatcattttaagctatcagctgtcaattcccctagtgaaactgatatagaacacaagctacaaatgaaaaatgttccttacagtcaagcagtgggaagtttaatgtacctaatgatctctacaagacctgatctatcctattcagcaagcctagtcagcaaatatatggctaattctggaagaagacactgggaagctactaaatggataatcagatacttaatttggtcaaagaatgctagattgaactaccaaaggaccactgaaatagaactagaactaataggatatgtagactcagactttgctggtgacagtgataaaaggagaagcctaaccggttacgtttttctctacggtcgtaatctaataagttggaaagcaatcctacaatccattgttgctctctcaactaccgaagcagaatatattgcactatcagaaggtgtaaaagagagattatggcttaaaggattgatgagggactttggaatcaaatatgccattcgcctatccaaaaatcctcaataccacagccgaacaaagcacatagacataaaatatcacttcatacgagagaaaattgaagctggagaaattcaagtattgaaagttcatacctctgagaacgccgctgatatgctcactaagtcggtctcaacattgaagcttcagaagtgcttcgaactgataggtttcgacctacctgaaaaaggatag

Coding sequence (CDS)

ATGGCTATAGCAAGAGTAGAAATTGAGAAGTTTGATGGAAAGGGAGACTTTGCATTATGGAAAGCAAAGATCAAAGCCTTGCTTGGACAGCAAAAGTCTCATAAAGCCCTTTTAGATCCTTCAGAACTTCCAACAACCCTCACAGCAACACAAAAAGAAGAAATGAAATTAAATGCCTATGGAACACTAATACTAAACCTTAGTGACAACATAATTAGGCAAGTTCTAGAAGAAGAAACAGCATATAAAGTTTGGATGAAACTAGAAAGCATTGTCCTTTTAAATTCTTTACCCGAGGCATACAAAGAAGTAAAAAATGCTCTAAAATATGGAAGAGACTCAGTAAAAACAGATGTCATAATATCAGCTCTAAGAACTAGAGAATTAGAAATACATTCATCACACAAAGAAAATCATAGTGGTGATGGATTGTTTGTCAGAGGTAAATCCCAAAACAATCAAAGTAAAAACAGTAACAAATCCTTTTCTAATGAAGACAAGAAAGGAAAACAGAAAAAGAAATATTCTAAAGGGAAACAACCAGAAGTCTCTATAGTAGAAAGCTCTTTTACCTACACAGATGCCTTGGCATCAACCTTAGACCAAGCCAACCATGTCAACCCCTTAGGAAAACATGATTGGGTCGTAGACTCAGGATGCACCTACCATATGACACCTTTTAGAGCATGGTTCAATACCTATAGAGAGATCAGTGGAGAATCTGTGTTCATAGGGAATAATAATGAATGTAACATTGCTGGAATTGGATCAGTTACCATGAAACTAAAAGATAAGACTGTAAAACTCCTTAGAAATGTAAGACATGTTCCTCACCTTAAAAGAAATTTAATCTCCCTAAGAATGTTAGACTCTCTAGGGTGTGAATATAAAGGAAAATGTGGAGTTTTCCAAGTGTTTATGGGATCTAAGTTAGTCTTGGTTGGGGAAAAGGTAAATGATTTGTTCATAATAAAAGGAGTAGAAATGATAGAGGAGGCAAATACAGTTTTATCTCTAAACCTAATAGAAGTTGATATTTGGCATAAAAGATTGTCCCACATTAGTCAGAAGGGTCTTGAGGCACTATCTAAACAGGGCATTCTGCCTCAAGACATATGCAGCAAGTTGTCCTTTTGTGAACACTGTGTACTAGGCAAAACAAGAAAACAAAACTTCACCAAAGCACAACACACAACAAGAGGAATCCTAGACTACATCCATTTGGATCTATGGGGTCCTGCATCCACTCCAAGCCTAAGTGGCTCAAGGTATTTTCTATCTTTTACTGATGATTTTTCTAGAAAAAGTTGGATTTGTAAAAGCAGCACTGAAGGGAACTTTGATGACACACAATCCTATACTACTCAGATTGAGGTGGAGAATACAGGAAAAAGTGTTCAACCTACTGAGGAACCTACAGCTATTGAACAAGAACAAGTGGAGAACTTAAGTGAAGAACAAGATGAAATGCTTGAAGAACAACCTGACTTGAGCCAATATTCCCTAGCAAGAGACAGACAAAGAAGGATAATTGTCCCTCCAGCCAGGTATGCTGAATCTAATTACATAAGTTTTGTTTTGAATGCTACTGTAGTTCCTAATGATTCAGAACCAAGTTCCTTTGATGAAGCTGTGAACAGTAGCAATGCAAGACAGTGGATTGAAGCTATGAATGAAGAAATAAATTCACTAAATGTGAATGATACTTGGACACTAGCTTCTCTACCTAAAGGATGCAAACCAATAGCATCTAAGTGGATTTTCAAACTCAAAGAAGGAATTACTAAAAACTCGCAACCAAGGTACAAGGCAAGGCTGGTAGCAAAGGGTTTCACTCAAAGAGAAGGTATTGACTATTCTGAAATCTTCTCCCCTGTAGTTAAACAAACCTCTATTAGACTTCTCCTATCCCTAGTTGCTCAAAACAACCTAGAACTGGATCAACTTGATGTAAAAACAACTTTTCTCCATGACTATCTAGAAGAGACAATCTATATGGTTCAACCTAAGGGTTATGAGGTTCAAGGTAAGGAAGACCTCTATTATTTACTAAAGAAGTCTATATATGGGTTAAAACAATCTCCTAGATGTTGGTATAGAAGATTTGATGATTTCATTACTAGTTTAGGTTTTCAAAAAAGTTCCTATGATATGGAATCTAAGAAGATCCTAGGTATAGACATCATTAGAGATAGGGACCAATCTACACTGAGCATAAACCAAACATCCTACTGTGAAAAAGTGATAAGAAGATTCAACCTCACTAATGCTAGACCAGTTACTCTCCCTATTGCTCATCATTTTAAGCTATCAGCTGTCAATTCCCCTAGTGAAACTGATATAGAACACAAGCTACAAATGAAAAATGTTCCTTACAGTCAAGCAGTGGGAAGTTTAATGTACCTAATGATCTCTACAAGACCTGATCTATCCTATTCAGCAAGCCTAGTCAGCAAATATATGGCTAATTCTGGAAGAAGACACTGGGAAGCTACTAAATGGATAATCAGATACTTAATTTGGTCAAAGAATGCTAGATTGAACTACCAAAGGACCACTGAAATAGAACTAGAACTAATAGGATATGTAGACTCAGACTTTGCTGGTGACAGTGATAAAAGGAGAAGCCTAACCGGTTACGTTTTTCTCTACGGTCGTAATCTAATAAGTTGGAAAGCAATCCTACAATCCATTGTTGCTCTCTCAACTACCGAAGCAGAATATATTGCACTATCAGAAGGTGTAAAAGAGAGATTATGGCTTAAAGGATTGATGAGGGACTTTGGAATCAAATATGCCATTCGCCTATCCAAAAATCCTCAATACCACAGCCGAACAAAGCACATAGACATAAAATATCACTTCATACGAGAGAAAATTGAAGCTGGAGAAATTCAAGTATTGAAAGTTCATACCTCTGAGAACGCCGCTGATATGCTCACTAAGTCGGTCTCAACATTGAAGCTTCAGAAGTGCTTCGAACTGATAGGTTTCGACCTACCTGAAAAAGGATAG

Protein sequence

MAIARVEIEKFDGKGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESIVLLNSLPEAYKEVKNALKYGRDSVKTDVIISALRTRELEIHSSHKENHSGDGLFVRGKSQNNQSKNSNKSFSNEDKKGKQKKKYSKGKQPEVSIVESSFTYTDALASTLDQANHVNPLGKHDWVVDSGCTYHMTPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYFLSFTDDFSRKSWICKSSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQKSSYDMESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMRDFGIKYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIGFDLPEKG*
BLAST of Cucsa.300520 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 4.4e-122
Identity = 257/599 (42.90%), Postives = 347/599 (57.93%), Query Frame = 1

Query: 460  ENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAE 519
            E+T   V    E      EQ E L E  +E+        Q+   R R  R  V   RY  
Sbjct: 741  ESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLR-RSERPRVESRRYPS 800

Query: 520  SNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGC 579
            + Y+       ++ +D EP S  E ++     Q ++AM EE+ SL  N T+ L  LPKG 
Sbjct: 801  TEYV-------LISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGK 860

Query: 580  KPIASKWIFKLK-EGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSL 639
            +P+  KW+FKLK +G  K    RYKARLV KGF Q++GID+ EIFSPVVK TSIR +LSL
Sbjct: 861  RPLKCKWVFKLKKDGDCK--LVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSL 920

Query: 640  VAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCW 699
             A  +LE++QLDVKT FLH  LEE IYM QP+G+EV GK+ +   L KS+YGLKQ+PR W
Sbjct: 921  AASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQW 980

Query: 700  YRRFDDFITSL---------------------------------------------GFQK 759
            Y +FD F+ S                                              G   
Sbjct: 981  YMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLS 1040

Query: 760  SSYDME----SKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFK 819
             S+DM+    +++ILG+ I+R+R    L ++Q  Y E+V+ RFN+ NA+PV+ P+A H K
Sbjct: 1041 KSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLK 1100

Query: 820  LSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHW 879
            LS    P  T +E K  M  VPYS AVGSLMY M+ TRPD++++  +VS+++ N G+ HW
Sbjct: 1101 LSKKMCP--TTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHW 1160

Query: 880  EATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLI 939
            EA KWI+RYL  +    L +  +  I   L GY D+D AGD D R+S TGY+F +    I
Sbjct: 1161 EAVKWILRYLRGTTGDCLCFGGSDPI---LKGYTDADMAGDIDNRKSSTGYLFTFSGGAI 1220

Query: 940  SWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMRDFGI-----------KYAIRLSK 998
            SW++ LQ  VALSTTEAEYIA +E  KE +WLK  +++ G+           + AI LSK
Sbjct: 1221 SWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAIDLSK 1280


HSP 2 Score: 144.4 bits (363), Expect = 6.6e-33
Identity = 130/489 (26.58%), Postives = 217/489 (44.38%), Query Frame = 1

Query: 57  LNAYGTLILNLSDNIIRQVLEEETAYKVWMKLES-------IVLLNSLPEAYKEVKNALK 116
           LN +  LI  L+ N+  ++ EE+ A  +   L S        +L        K+V +AL 
Sbjct: 122 LNVFNGLITQLA-NLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 181

Query: 117 YGRDSVKT-----DVIISALRTRELEIHSSHKENHSGDGLFVRGKSQNNQSKNSNKSFSN 176
                 K        +I+  R R  +  SS+    SG     RGKS+N         ++ 
Sbjct: 182 LNEKMRKKPENQGQALITEGRGRSYQ-RSSNNYGRSG----ARGKSKNRSKSRVRNCYNC 241

Query: 177 ED-----------KKGKQKKKYSKGKQPEVSIVESSFTYTDALASTLDQANHVNPLG--K 236
                        +KGK +    K      ++V+++    D +   +++      L   +
Sbjct: 242 NQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNN----DNVVLFINEEEECMHLSGPE 301

Query: 237 HDWVVDSGCTYHMTPFRAWFNTYREISGESVFIGNNNECNIAGIGSVTMKLKDKTVKLLR 296
            +WVVD+  ++H TP R  F  Y      +V +GN +   IAGIG + +K       +L+
Sbjct: 302 SEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLK 361

Query: 297 NVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIE 356
           +VRHVP L+ NLIS   LD  G E       +++  GS ++  G     L+         
Sbjct: 362 DVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQG 421

Query: 357 EANTVLSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKTRKQN 416
           E N   + + I VD+WHKR+ H+S+KGL+ L+K+ ++     + +  C++C+ GK  + +
Sbjct: 422 ELNA--AQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVS 481

Query: 417 FTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYFLSFTDDFSRKSWICKSSTEGN-FDDT 476
           F  +      ILD ++ D+ GP    S+ G++YF++F DD SRK W+    T+   F   
Sbjct: 482 FQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVF 541

Query: 477 QSYTTQIEVENTGKSVQPTEEPTAIE---QEQVENLSEEQDEMLEEQPDLSQYSLARDRQ 517
           Q +   +E E TG+ ++        E   +E  E  S       +  P   Q++   +R 
Sbjct: 542 QKFHALVERE-TGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERM 597


HSP 3 Score: 65.9 bits (159), Expect = 3.0e-09
Identity = 31/97 (31.96%), Postives = 58/97 (59.79%), Query Frame = 1

Query: 1  MAIARVEIEKFDGKGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMKLNAY 60
          M+  + E+ KF+G   F+ W+ +++ LL QQ  HK L   S+ P T+ A    ++   A 
Sbjct: 1  MSGVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAA 60

Query: 61 GTLILNLSDNIIRQVLEEETAYKVWMKLESIVLLNSL 98
            + L+LSD+++  +++E+TA  +W +LES+ +  +L
Sbjct: 61 SAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTL 97

BLAST of Cucsa.300520 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 181.4 bits (459), Expect = 4.8e-44
Identity = 102/292 (34.93%), Postives = 160/292 (54.79%), Query Frame = 1

Query: 719  ESKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSE 778
            E K  +GI I    D+  LS  Q++Y +K++ +FN+ N   V+ P+        +NS  +
Sbjct: 1119 EIKHFIGIRIEMQEDKIYLS--QSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDED 1178

Query: 779  TDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRY 838
                      N P    +G LMY+M+ TRPDL+ + +++S+Y + +    W+  K ++RY
Sbjct: 1179 C---------NTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRY 1238

Query: 839  LIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVF-LYGRNLISWKAILQS 898
            L  + + +L +++    E ++IGYVDSD+AG    R+S TGY+F ++  NLI W    Q+
Sbjct: 1239 LKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQN 1298

Query: 899  IVALSTTEAEYIALSEGVKERLWLKGLMRDFGIKY------------AIRLSKNPQYHSR 958
             VA S+TEAEY+AL E V+E LWLK L+    IK              I ++ NP  H R
Sbjct: 1299 SVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKR 1358

Query: 959  TKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLTKSVSTLKLQKCFELIG 998
             KHIDIKYHF RE+++   I +  + T    AD+ TK +   +  +  + +G
Sbjct: 1359 AKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLG 1399


HSP 2 Score: 162.5 bits (410), Expect = 2.3e-38
Identity = 103/300 (34.33%), Postives = 158/300 (52.67%), Query Frame = 1

Query: 418  SLSGSRYFLSFTDDFSRKSWICKSSTEGNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQ 477
            S   ++YFL+ +    R   + +S   GN ++++   T   ++  G      + PT  + 
Sbjct: 796  SKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIG-----IDNPTKNDG 855

Query: 478  EQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAESNYISFVLNATVVPNDSE 537
             ++ N    + E L+ +P +S                    +++    VLNA  + ND  
Sbjct: 856  IEIIN---RRSERLKTKPQISYNE----------------EDNSLNKVVLNAHTIFNDV- 915

Query: 538  PSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKN 597
            P+SFDE     +   W EA+N E+N+  +N+TWT+   P+    + S+W+F +K     N
Sbjct: 916  PNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGN 975

Query: 598  SQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLH 657
               RYKARLVA+GFTQ+  IDY E F+PV + +S R +LSLV Q NL++ Q+DVKT FL+
Sbjct: 976  PI-RYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLN 1035

Query: 658  DYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFITSLGFQKSSYD 717
              L+E IYM  P+G  +    D    L K+IYGLKQ+ RCW+  F+  +    F  SS D
Sbjct: 1036 GTLKEEIYMRLPQG--ISCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVD 1067


HSP 3 Score: 56.6 bits (135), Expect = 1.8e-06
Identity = 89/430 (20.70%), Postives = 163/430 (37.91%), Query Frame = 1

Query: 33  SHKALLDPSELPTTLTATQKEEMKLNAYGTLILNLS---DNIIRQV--LEEETAYKVWMK 92
           SH  + D  EL + L A   +  +++    L++ L    D II  +  L EE     ++K
Sbjct: 116 SHFHIFD--ELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENLTLAFVK 175

Query: 93  L----ESIVLLNSLPEAYKEVKNALKYGRDSV-----------KTDVIISALRTRELEIH 152
                + I + N   +  K+V NA+ +  ++            K   I       +++ H
Sbjct: 176 NRLLDQEIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRVTKPKKIFKGNSKYKVKCH 235

Query: 153 SSHKENHSGDGLFVRGKSQNNQSKNSNKSFSNEDKKGKQKKKYSKGKQPEVSIVESSFTY 212
              +E H     F   +  NN++K + K        G                       
Sbjct: 236 HCGREGHIKKDCFHYKRILNNKNKENEKQVQTATSHG----------------------- 295

Query: 213 TDALASTLDQANHVNPLGKHDWVVDSGCTYHMTPFRAWFNTYREISGE-SVFIGNNNECN 272
              +A  + + N+ + +    +V+DSG + H+    + +    E+     + +    E  
Sbjct: 296 ---IAFMVKEVNNTSVMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFI 355

Query: 273 IAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYK-GKCGVFQVFMGSK 332
            A    +     D  + L  +V        NL+S++ L   G   +  K GV     G  
Sbjct: 356 YATKRGIVRLRNDHEITL-EDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLM 415

Query: 333 LVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQ-KGLEALSKQGILP 392
           +V     +N++ +I       +A ++ + +     +WH+R  HIS  K LE   K     
Sbjct: 416 VVKNSGMLNNVPVINF-----QAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSD 475

Query: 393 QDICSKLSF----CEHCVLGKTRKQNFTKAQHTT--RGILDYIHLDLWGPASTPSLSGSR 434
           Q + + L      CE C+ GK  +  F + +  T  +  L  +H D+ GP +  +L    
Sbjct: 476 QSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKN 511


HSP 4 Score: 37.4 bits (85), Expect = 1.1e+00
Identity = 38/146 (26.03%), Postives = 65/146 (44.52%), Query Frame = 1

Query: 4   ARVEIEKFDGKGDFALWKAKIKALLGQQKSHKAL--LDPSELPTTLTATQKEEMKLNAYG 63
           A+  I+ FDG+  +A+WK +I+ALL +Q   K +  L P+E+  +    ++      A  
Sbjct: 4   AKRNIKPFDGE-KYAIWKFRIRALLAEQDVLKVVDGLMPNEVDDSWKKAER-----CAKS 63

Query: 64  TLILNLSDNIIRQVLEEETAYKVWMKLESIVLLNSLPEAYKEVKNALKYGRDSVKT---- 123
           T+I  LSD+ +     + TA ++   L+++    SL       K  L     S  +    
Sbjct: 64  TIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLSSEMSLLSH 123

Query: 124 ----DVIISALRTRELEIHSSHKENH 140
               D +IS L     +I    K +H
Sbjct: 124 FHIFDELISELLAAGAKIEEMDKISH 143

BLAST of Cucsa.300520 vs. Swiss-Prot
Match: M300_ARATH (Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana GN=AtMg00300 PE=4 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 2.0e-21
Identity = 50/117 (42.74%), Postives = 71/117 (60.68%), Query Frame = 1

Query: 301 GVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLE 360
           GV +V  G + +L G + + L+I++G     E+N   +    E  +WH RL+H+SQ+G+E
Sbjct: 27  GVLKVLKGCRTILKGNRHDSLYILQGSVETGESNLAETAK-DETRLWHSRLAHMSQRGME 86

Query: 361 ALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTP 418
            L K+G L     S L FCE C+ GKT + NF+  QHTT+  LDY+H DLWG  S P
Sbjct: 87  LLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSDLWGAPSVP 142

BLAST of Cucsa.300520 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 4.3e-16
Identity = 74/321 (23.05%), Postives = 148/321 (46.11%), Query Frame = 1

Query: 700  RRFDDFITSLGFQKSSYDME----------SKKILGIDIIRDRDQSTLSINQTSYCEKVI 759
            +R D+FI  L   KS+++++             ILG+D++ ++   T+ +   S+  ++ 
Sbjct: 1472 QRLDEFINKL---KSNFELKITGTLIDDVLDTDILGMDLVYNKRLGTIDLTLKSFINRMD 1531

Query: 760  RRFN--LTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTR 819
            +++N  L   R  ++P    +K+       +   E + +   +   Q +G L Y+    R
Sbjct: 1532 KKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMS-EEEFRQGVLKLQQLLGELNYVRHKCR 1591

Query: 820  PDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDF 879
             D++++   V++ +     R +     II+YL+  K+  ++Y R    + ++I   D+  
Sbjct: 1592 YDINFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDIGIHYDRDCNKDKKVIAITDASV 1651

Query: 880  AGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMRD 939
              + D + S  G +  YG N+ +  +   +   +S+TEAE  A+ EG  +   LK  +++
Sbjct: 1652 GSEYDAQ-SRIGVILWYGMNIFNVYSNKSTNRCVSSTEAELHAIYEGYADSETLKVTLKE 1711

Query: 940  FG------------IKYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSEN 997
             G             K AI+         + K   IK   I+EKI+   I++LK+    N
Sbjct: 1712 LGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTWIKTEIIKEKIKEKSIKLLKITGKGN 1771


HSP 2 Score: 72.0 bits (175), Expect = 4.1e-11
Identity = 61/264 (23.11%), Postives = 114/264 (43.18%), Query Frame = 1

Query: 459  VENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYA 518
            +E +G  VQ   +   + +E      + + +  ++   L+ Y L RD++R         +
Sbjct: 1197 IEASGSPVQTVNKSAFLNKEFSSLNMKRKRKRHDKNNSLTSYELERDKKR---------S 1256

Query: 519  ESNYISFVLN--ATVVPNDSEPSSFDEAVNSS----NARQWIEAMNEEINSLNVNDTWTL 578
            + N +  + +   TV         ++EA++ +       ++ +A ++E+ +L     + +
Sbjct: 1257 KRNRVKLIPDNMETVSAQKIRAIYYNEAISKNPDLKEKHEYKQAYHKELQNLKDMKVFDV 1316

Query: 579  ASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSI 638
              +      I    I       TK     YKAR+V +G TQ     YS I +  +    I
Sbjct: 1317 -DVKYSRSEIPDNLIVPTNTIFTKKRNGIYKARIVCRGDTQSPDT-YSVITTESLNHNHI 1376

Query: 639  RLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLK 698
            ++ L +    N+ +  LD+   FL+  LEE IY+  P       K      L K++YGLK
Sbjct: 1377 KIFLMIANNRNMFMKTLDINHAFLYAKLEEEIYIPHPHDRRCVVK------LNKALYGLK 1436

Query: 699  QSPRCWYRRFDDFITSLGFQKSSY 717
            QSP+ W      ++  +G + +SY
Sbjct: 1437 QSPKEWNDHLRQYLNGIGLKDNSY 1443

BLAST of Cucsa.300520 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 88.2 bits (217), Expect = 5.6e-16
Identity = 74/321 (23.05%), Postives = 147/321 (45.79%), Query Frame = 1

Query: 700  RRFDDFITSLGFQKSSYDME----------SKKILGIDIIRDRDQSTLSINQTSYCEKVI 759
            +R D+FI  L   KS+++++             ILG+D++ ++   T+ +   S+  ++ 
Sbjct: 1473 QRLDEFINKL---KSNFELKITGTLIDDVLDTDILGMDLVYNKRLGTIDLTLKSFINRMD 1532

Query: 760  RRFN--LTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTR 819
            +++N  L   R  ++P    +K+       +   E + +   +   Q +G L Y+    R
Sbjct: 1533 KKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMS-EEEFRQGVLKLQQLLGELNYVRHKCR 1592

Query: 820  PDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDF 879
             D+ ++   V++ +     R +     II+YL+  K+  ++Y R    + ++I   D+  
Sbjct: 1593 YDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDIGIHYDRDCNKDKKVIAITDASV 1652

Query: 880  AGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMRD 939
              + D + S  G +  YG N+ +  +   +   +S+TEAE  A+ EG  +   LK  +++
Sbjct: 1653 GSEYDAQ-SRIGVILWYGMNIFNVYSNKSTNRCVSSTEAELHAIYEGYADSETLKVTLKE 1712

Query: 940  FG------------IKYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSEN 997
             G             K AI+         + K   IK   I+EKI+   I++LK+    N
Sbjct: 1713 LGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTWIKTEIIKEKIKEKSIKLLKITGKGN 1772


HSP 2 Score: 71.6 bits (174), Expect = 5.4e-11
Identity = 61/264 (23.11%), Postives = 114/264 (43.18%), Query Frame = 1

Query: 459  VENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYA 518
            +E +G  VQ   +   + +E      + + +  ++   L+ Y L RD++R         +
Sbjct: 1198 IEASGSPVQTVNKSAFLNKEFSSLNMKRKRKRHDKNNSLTSYELERDKKR---------S 1257

Query: 519  ESNYISFVLN--ATVVPNDSEPSSFDEAVNSS----NARQWIEAMNEEINSLNVNDTWTL 578
            + N +  + +   TV         ++EA++ +       ++ +A ++E+ +L     + +
Sbjct: 1258 KKNRVKLIPDNMETVSAPKIRAIYYNEAISKNPDLKEKHEYKQAYHKELQNLKDMKVFDV 1317

Query: 579  ASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSI 638
              +      I    I       TK     YKAR+V +G TQ     YS I +  +    I
Sbjct: 1318 -DVKYSRSEIPDNLIVPTNTIFTKKRNGIYKARIVCRGDTQSPDT-YSVITTESLNHNHI 1377

Query: 639  RLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLK 698
            ++ L +    N+ +  LD+   FL+  LEE IY+  P       K      L K++YGLK
Sbjct: 1378 KIFLMIANNRNMFMKTLDINHAFLYAKLEEEIYIPHPHDRRCVVK------LNKALYGLK 1437

Query: 699  QSPRCWYRRFDDFITSLGFQKSSY 717
            QSP+ W      ++  +G + +SY
Sbjct: 1438 QSPKEWNDHLRQYLNGIGLKDNSY 1444

BLAST of Cucsa.300520 vs. TrEMBL
Match: Q75HA9_ORYSJ (Integrase core domain containing protein OS=Oryza sativa subsp. japonica GN=LOC_Os03g46450 PE=4 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 1.5e-132
Identity = 274/616 (44.48%), Postives = 374/616 (60.71%), Query Frame = 1

Query: 445  GNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLAR 504
            G  D+ Q Y + ++VE+        ++ T I    V +  +    +L+ Q +     +A 
Sbjct: 722  GGSDEEQQYVS-VQVEHVD------DQETEIVGNDVNDTVQHSPSVLQPQDE----PIAH 781

Query: 505  DRQRRIIVPPARYAESN---YISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEI 564
             R +R    P R  E     Y +F   A  V N  EP+++ EAV S +  +WI A+ EE+
Sbjct: 782  RRTKRSCGAPVRLIEECDMVYYAFSY-AEQVENTLEPATYTEAVVSGDREKWISAIQEEM 841

Query: 565  NSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSE 624
             SL  N TW L  LPK  KP+  KWIFK KEG++ +  PR+K RLVAKGF+Q  G+DY++
Sbjct: 842  QSLEKNGTWELVHLPKQKKPVRCKWIFKRKEGLSPSEPPRFKVRLVAKGFSQIAGVDYND 901

Query: 625  IFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLY 684
            +FSPVVK +SIR   S+V  ++LEL+QLDVKTTFLH  LEE IYM QP+G+ V GKED  
Sbjct: 902  VFSPVVKHSSIRTFFSIVTMHDLELEQLDVKTTFLHGELEEEIYMDQPEGFIVPGKEDYV 961

Query: 685  YLLKKSIYGLKQSPRCWYRRFDDFITSLGF------------------------------ 744
              LK+S+YGLKQSPR WY+RFD F+ S GF                              
Sbjct: 962  CKLKRSLYGLKQSPRQWYKRFDSFMLSHGFKRSEFDSCVYIKFVNGSPIYLLLYVDDMLI 1021

Query: 745  --------------QKSSYDME----SKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFN 804
                            S +DM+    +KKILG++I RDR+   L ++Q SY +KV++RFN
Sbjct: 1022 AAKSKEQITTLKKQLSSEFDMKDLGAAKKILGMEITRDRNSGLLFLSQQSYIKKVLQRFN 1081

Query: 805  LTNARPVTLPIAHHFKLSAVNSPS-ETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSY 864
            + +A+PV+ PIA HFKLSA+   S + D+E+   M  VPYS AVGSLMY M+ + PDLS+
Sbjct: 1082 MHDAKPVSTPIAPHFKLSALQCASTDEDVEY---MSRVPYSSAVGSLMYAMVCSWPDLSH 1141

Query: 865  SASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSD 924
            + SLVS+YMAN G+ HW+A +WI RYL  + +A L + R   I+  L+GYVDSDFA D D
Sbjct: 1142 AMSLVSRYMANPGKEHWKAVQWIFRYLRGTADACLKFGR---IDKGLVGYVDSDFAADLD 1201

Query: 925  KRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMRDF-GI- 984
            KRRSLTGYVF  G   +SWKA LQ +VA STTEAEY+A++E  KE +WLKGL  +  G+ 
Sbjct: 1202 KRRSLTGYVFTIGSCAVSWKATLQPVVAQSTTEAEYMAIAEACKESVWLKGLFAELCGVD 1261

Query: 985  ---------KYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLT 998
                     + AI L+K+  +H RTKHIDIKYH++R+ +  G+++V K+   +N ADM+T
Sbjct: 1262 SCINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDIVAQGKLKVCKISIHDNPADMMT 1319

BLAST of Cucsa.300520 vs. TrEMBL
Match: Q75HA9_ORYSJ (Integrase core domain containing protein OS=Oryza sativa subsp. japonica GN=LOC_Os03g46450 PE=4 SV=1)

HSP 1 Score: 222.6 bits (566), Expect = 2.1e-54
Identity = 134/410 (32.68%), Postives = 224/410 (54.63%), Query Frame = 1

Query: 56  KLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESI--VLLNSLPEAYKEVKNALKYGRD 115
           KL+  G+++ ++S  + ++++ +  + +V    E +  +LL SLP +Y   ++ +   RD
Sbjct: 112 KLHESGSVLNHIS--VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRD 171

Query: 116 SVKTDVIISALRTRELE--IHSSHKENHSGDGLFVRGKSQN---NQSKNSNKSFSNEDKK 175
            +    +  AL+ RE    +  S+  +  G+ L VRG+S+    N S + +KS S    K
Sbjct: 172 ELTLAEVYEALQNREKMKGMVQSYASSSKGEALQVRGRSEQRTYNDSNDHDKSQSRGRSK 231

Query: 176 GKQKK--KYSKGKQPEVS----IVESSFTYTDALASTLDQANHVNP----------LGKH 235
            + KK  KY K K   +     +       +D  AS +  A + +           +  H
Sbjct: 232 SRGKKFCKYCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASH 291

Query: 236 D-WVVDSGCTYHMTPFRAWFNTYREISGESVF-IGNNNECNIAGIGSVTMKLKDKTVKLL 295
           D W++D+ C++H+   R WF++Y+ +  E V  +G++N   I GIGSV +K  D   + L
Sbjct: 292 DEWILDTACSFHICINRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTL 351

Query: 296 RNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVN-DLFIIKGVEM 355
           ++VRH+P + RNLISL  LD+ G +Y G  GV +V  GS + ++G+  + +L++++G  +
Sbjct: 352 KDVRHIPGMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTL 411

Query: 356 IEE--ANTVLSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKT 415
                A  V      + ++WH RL H+S+ G+  L K+ +L       + FCEHCV GK 
Sbjct: 412 HGSVTAAAVTKDEPSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKH 471

Query: 416 RKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYFLSFTDDFSRKSW 438
           ++  F  + H T+GILDY+H DLWGP+  PSL G+RY L+  DD+SRK W
Sbjct: 472 KRVKFNTSVHRTKGILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKEW 519


HSP 2 Score: 55.5 bits (132), Expect = 4.5e-04
Identity = 32/98 (32.65%), Postives = 57/98 (58.16%), Query Frame = 1

Query: 1  MAIARVEIEKFDGKGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMK-LNA 60
          MA  + ++   D K  F+LW+ K++A+L Q       L+      T   T +E+ K   A
Sbjct: 1  MASMKYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKA 60

Query: 61 YGTLILNLSDNIIRQVLEEETAYKVWMKLESIVLLNSL 98
             + L+LS++I+++VL+++TA ++W+KLESI +   L
Sbjct: 61 LSLIQLHLSNDILQEVLQKKTAAELWLKLESICMSKDL 98


HSP 3 Score: 459.5 bits (1181), Expect = 1.0e-125
Identity = 267/667 (40.03%), Postives = 382/667 (57.27%), Query Frame = 1

Query: 401  GILDYIHLDLWGPASTPSLSGSRYFLSF-TDDFSRKSWICKS-----STEGNFDDTQSYT 460
            G + Y+H+   G       S    F+ +  D+F  + W  ++     S +  F++   Y 
Sbjct: 619  GCVAYVHISDQGRNKLDPKSKKCTFIGYGEDEFGYRLWDDENKKMIRSRDVIFNEGVMYK 678

Query: 461  TQIEVENTGKSVQPTEEPTAIE-QEQVENLSEEQDEMLEE-QPDLSQYSLARD---RQRR 520
             +   +NT  S     EPT +E  + +E+   E  + +E  +PD  Q  +      R  R
Sbjct: 679  DK---QNTSASNSKPIEPTYVEVDDALESPPVESSQSVESIEPDRGQQCVPEPELRRSSR 738

Query: 521  IIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDT 580
            + VP  RY         +N  ++ +  EP  + EA  + +A +W  AM EE+ SL  N T
Sbjct: 739  VPVPNRRY---------MNYMLLTDGGEPEDYSEACQTRDASKWELAMKEEMKSLISNQT 798

Query: 581  WTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQ 640
            W LA LP G K + +KW++++KE    +   RYKARLV KGF Q+EG+DY+EIFSPVVK 
Sbjct: 799  WELAKLPMGKKALHNKWVYRVKE--EHDGSKRYKARLVVKGFQQKEGVDYTEIFSPVVKL 858

Query: 641  TSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIY 700
             +IR +LS+VA   L L+QLDVKT FLH  L+E IYM QP+G+  +GKE++   LKKS+Y
Sbjct: 859  NTIRTVLSIVASEELYLEQLDVKTAFLHGDLDEEIYMHQPEGFSEKGKENMVCRLKKSLY 918

Query: 701  GLKQSPRCWYRRFDDFITSLGF-------------------------------------- 760
            GLKQ+PR WYR+F+ F+   GF                                      
Sbjct: 919  GLKQAPRQWYRKFESFMHKEGFKKCNADHCCFFKRYNSSYIILLLYVDDMLVAGSNMIEI 978

Query: 761  ------QKSSYDME----SKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVT 820
                      +DM+    +KKILG+ I RD+ + TL ++Q  Y  +V++RFN++NA+PV+
Sbjct: 979  RNLKKQLSKEFDMKDLGPAKKILGMQITRDKQKGTLQLSQAEYINRVLQRFNMSNAKPVS 1038

Query: 821  LPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYM 880
             P+A HF+LS   SP     E K  M   PY+ A+GSLMY M+ TRPD+ Y+  +VS++M
Sbjct: 1039 TPLASHFRLSTDQSPKTE--EEKELMAKTPYASAIGSLMYAMVCTRPDIGYAVGVVSRFM 1098

Query: 881  ANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYV 940
            +N G+ HWEA KWI+RYL  +K   L + +    EL++ GYVD+DFAG+ D RRS TGY+
Sbjct: 1099 SNPGKAHWEAVKWILRYLRATKEKCLCFSKG---ELKVQGYVDADFAGEIDHRRSTTGYI 1158

Query: 941  FLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMRDFGI----------- 998
            F  G   +SW + +Q IVALSTTEAEY+A++E  KE +WL+GL+ + G            
Sbjct: 1159 FTVGTTAVSWMSQIQKIVALSTTEAEYVAVTEASKELIWLQGLLTELGFIQERSVLHSDS 1218

BLAST of Cucsa.300520 vs. TrEMBL
Match: A0A151RCT9_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_038328 PE=4 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 1.2e-36
Identity = 132/461 (28.63%), Postives = 220/461 (47.72%), Query Frame = 1

Query: 87  KLESIVLLNSLPEAYKEVKNAL-------KYGRDSVKTDVIISALRTRE---------LE 146
           ++ +++LL+SLPE++     A+       K   D V+  V+   +R RE         L 
Sbjct: 119 EVRALILLSSLPESWNATVTAVSSSSGSNKLKFDDVRDLVLSEEVRRRETGESTTSSVLH 178

Query: 147 IHSSHKENHSGDGLFV---RGKSQNNQSKNSNKSFS--NEDKKGKQKKKY-SKGKQPEVS 206
             S  + +  G G      R KS+N+QS N +K+    N  K G  K +  S  K+ EV 
Sbjct: 179 TESRGRSSTRGTGRGKSKERSKSRNHQSSNKSKTIECWNCGKTGHYKNQCRSASKKQEVK 238

Query: 207 IVESSFTYT---DALASTLDQANHVNPLGKHDWVVDSGCTYHMTPFRAWFNTYREISGES 266
              +  T +   DAL  +L+         +  WV+DSG ++H T  +  F  Y   +   
Sbjct: 239 DEANVATTSGGGDALICSLENK-------EESWVIDSGASFHATSQKELFENYVSGNLGK 298

Query: 267 VFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCG 326
           V++GN   C I G G V +KL   +V  L+NVRH+P L +NLIS+  L S G     +  
Sbjct: 299 VYLGNEQSCEIVGKGVVKIKLNG-SVWELKNVRHIPDLTKNLISVGQLASDGYTTIFQGE 358

Query: 327 VFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEA 386
            +++  G+  +  G+K   L+     +  E  + +        ++WH+RL H+S+KG++ 
Sbjct: 359 QWKISKGAMTIARGKKSGTLY-----KTTEACHLITVAANDNPNLWHQRLGHMSEKGMKI 418

Query: 387 LSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHT-TRGILDYIHLDLWGPASTPSLS 446
           +  +G LP     ++  CE C+ GK ++ +F K   T  +  L+ +H D+WGP +  S+S
Sbjct: 419 MHSKGKLPGLQSMEIEMCEDCIFGKQKRVSFQKGGRTPKKERLELVHSDVWGPTTVSSIS 478

Query: 447 GSRYFLSFTDDFSRKSWI--CKSSTEGNFDDTQSYTTQIEVENTGKSVQP--TEEPTAIE 506
           G +YF++F DD SRK W+   K  +E  F+  + +   +E E TG  ++   T+     E
Sbjct: 479 GKQYFVTFIDDHSRKVWVYFLKHKSE-VFEAFKMWKAMVENE-TGLKIKKLRTDNGGEYE 538

Query: 507 QEQVENLSEEQDEMLEEQ-PDLSQYSLARDRQRRIIVPPAR 517
             + +    E    +E   P   Q +   +R  R +   AR
Sbjct: 539 DTRFKRFCYEHGIRMERTVPGTPQQNGVAERMNRTLTERAR 564


HSP 2 Score: 456.1 bits (1172), Expect = 1.1e-124
Identity = 264/622 (42.44%), Postives = 372/622 (59.81%), Query Frame = 1

Query: 432  FSRKSWICKSSTEGNFDDTQSYTTQIEVENTGKSVQPTE---EPTAIEQEQVENLSEEQD 491
            F  +  I  S  + +  D+++   Q E  N   +++  E   +PT + Q + ++    Q+
Sbjct: 676  FDEREAISISLAKPSVADSEAQVEQNEQGNDEVAIEEPEHQQQPTVMAQVE-QSPQRGQN 735

Query: 492  EMLEEQPDLSQYSLARDRQRRIIVPPARYA--ESNYISFVLNATVVPNDSEPSSFDEAVN 551
              + + P+  + S+A D+ +R   P  R+       +S  L+ +      +P+++++A+ 
Sbjct: 736  SPIPQAPESFKRSIALDKPKRNRKPIQRFGFEPEEDVSRALSIS----QGDPTTYEDAIE 795

Query: 552  SSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARL 611
            S  +  WI AM EE+ SL+ N  W L   PK  K +  KW+F+ KEGI ++    YKARL
Sbjct: 796  SVESAGWIGAMTEEMESLHKNSVWELVPKPKERKLVGCKWVFRKKEGIHEDDAITYKARL 855

Query: 612  VAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYM 671
            VAKG++Q+EG+DY EIFSPVVK TSIRLLLS+ AQ ++E++Q+DVKT FLH  LEE IYM
Sbjct: 856  VAKGYSQKEGVDYDEIFSPVVKHTSIRLLLSIAAQYDMEIEQMDVKTAFLHGDLEEDIYM 915

Query: 672  VQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFIT------------------- 731
             QP+G+   GKE+L   LKKS+YGLKQSPR WY+ FD ++                    
Sbjct: 916  SQPEGFVETGKENLVCRLKKSLYGLKQSPRQWYKPFDTYMLKIGYTRCQYDCCVYYHVFE 975

Query: 732  --------------------SLGFQK------SSYDME----SKKILGIDIIRDRDQSTL 791
                                 L  QK      + +DM+    ++KILGI+I RDR+   +
Sbjct: 976  DGKVILLLLYVDDMLIACRDMLQIQKLKKKLGAEFDMKDLGAAQKILGIEIRRDRNAGKI 1035

Query: 792  SINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVG 851
             ++Q  Y  K++ RFN+  A+ V++P+A HF+LSA   PS  D +    MKNVPY+ AVG
Sbjct: 1036 WLSQEKYIMKILERFNMAEAKVVSIPLAAHFRLSAEQRPS--DQKEIDMMKNVPYASAVG 1095

Query: 852  SLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIEL 911
             LMY MI TRPDL+ + S+VSKYM+N G+RHWEA KWI +YL  ++   + ++R  + E 
Sbjct: 1096 CLMYAMICTRPDLAQAMSVVSKYMSNPGKRHWEAVKWIFKYLKNTRQLGIMFERR-QGEA 1155

Query: 912  ELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKE 971
             + G+VDSDFAGD D+RRS  GYVF  G   +SWKA LQ++ ALSTTEAEY+AL+E  KE
Sbjct: 1156 CVAGFVDSDFAGDLDRRRSTAGYVFTCGGGPVSWKATLQAVTALSTTEAEYMALTEASKE 1215

Query: 972  RLWLKGLMRDFGI-----------KYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQ 989
             +WL GL    G+           + AI L+KN  +H+RTKHID +YH IR+ +EAG I 
Sbjct: 1216 AIWLNGLAGQLGVHQEGVVVKCDSQSAIHLAKNQVFHARTKHIDARYHRIRDWVEAGVII 1275

BLAST of Cucsa.300520 vs. TrEMBL
Match: C6GFP7_FRAAN (Putative gag-pol polyprotein OS=Fragaria ananassa PE=4 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 2.1e-46
Identity = 126/409 (30.81%), Postives = 211/409 (51.59%), Query Frame = 1

Query: 90  SIVLLNSLPEAYKEVKNALKYGRDSVKTDVIISALRTRELEIHSSHKENHSGDGLFVRGK 149
           +++LL+SLP  +K  K  + +      + V  +      LE     +++    GL+VRGK
Sbjct: 119 AVMLLHSLPPLFKHFKTTMIFKELITLSKVCENPKSYIRLE---REEDSSQARGLYVRGK 178

Query: 150 SQN------------NQSKNSNKSFSNED---------------KKGKQKKKYSKGKQPE 209
            +               SK+  K    +D               K+ K+KK    G+  +
Sbjct: 179 ERGRSRNRGGGFQGRGMSKSKGKGKGKKDGCFIYGSPDHWKRNCKQWKEKKAQMSGESSQ 238

Query: 210 VSIVESSFTYTDALASTLDQANHVNPLGKHDWVVDSGCTYHMTPFRAWFNTYREISGESV 269
           ++ V   +   D     +  ++         W +D+ CT+H    R WF+TY+E +  SV
Sbjct: 239 LANVVIGYNDEDGELLAISTSSGA----PRHWTLDTACTFHTCAHRDWFDTYKEGNTRSV 298

Query: 270 FIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGV 329
            +GN++   I GIG V +++ D  V+ L NVRH P L RNLISL  +D +G  +KG+ GV
Sbjct: 299 LMGNDSPSRIMGIGMVKIRMHDGIVRALGNVRHTPGLNRNLISLSTMDRVGFWHKGQNGV 358

Query: 330 FQVFMGSKLVLVGE-KVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEA 389
            +V  G  + + G  + ++++ + G  +  E    +     + ++W +RL H+SQ+GL+ 
Sbjct: 359 LKVGKGQMVYMKGAIQPDNMYKLTGSTV--EGGAGVCTEEDKTELWRRRLGHMSQRGLQE 418

Query: 390 LSKQGILPQDICSKLSFCEHCVLGKTRKQNF--TKAQHTTRGILDYIHLDLWGPASTPSL 449
           L K+  L   + + L FC +C LGK  + +F  + +++ ++G+LDYIH D+WGP++T S 
Sbjct: 419 LHKKEQLDGVMSAALEFCRYCTLGKQTRVSFNLSSSENKSKGVLDYIHTDVWGPSATISK 478

Query: 450 SGSRYFLSFTDDFSRKSWICKSSTEGNFDDTQSYTTQIEVEN-TGKSVQ 468
            G+RYF+SF DDFSRK WI    T+ N   T+    + EV N TG+ ++
Sbjct: 479 GGARYFVSFIDDFSRKVWIFFMKTK-NEVFTKFKEWKAEVGNQTGRKIK 517


HSP 2 Score: 453.8 bits (1166), Expect = 5.6e-124
Identity = 256/562 (45.55%), Postives = 361/562 (64.23%), Query Frame = 1

Query: 471  EPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAE-SNYISFVLN- 530
            E   +  +   N+  E  E    Q    ++S+A+D+ +R   PP RY E +N +++ L+ 
Sbjct: 583  ESVMLHDKPSTNVPIESQEKASVQQS-PKHSIAKDKPKRNTRPPQRYIEEANIVAYALSV 642

Query: 531  ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIF 590
            A  +  ++EPS++ EA+ S +  +WI AM++E++SL  N TW L  LPK  K I  KWIF
Sbjct: 643  AEEIEGNAEPSTYSEAIVSDDCNRWITAMHDEMDSLEKNHTWKLVKLPKEKKLIHCKWIF 702

Query: 591  KLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQ 650
            K KEG++   + RYKA LVAKG++Q  GID++++FSPVVK +SIR LLS+VA ++ ELDQ
Sbjct: 703  KRKEGMSPTDEARYKAMLVAKGYSQIPGIDFNDVFSPVVKHSSIRTLLSIVAMHDYELDQ 762

Query: 651  LDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRC-------WYRR 710
            +DVKT FLH  LEE IYM QP+G+ + GKE+L +  ++S Y     P+            
Sbjct: 763  MDVKTAFLHGELEEDIYMEQPEGFVIPGKENLKF--RRSNYDSCVYPKVVDGSAIYLLLY 822

Query: 711  FDDF---------ITSLGFQKSS-YDME----SKKILGIDIIRDRDQSTLSINQTSYCEK 770
             DD          I  L  Q SS ++M+    +KKILGI+I ++R    L ++Q  Y EK
Sbjct: 823  VDDMLIAAKDKSEIAKLKAQLSSEFEMKDLGAAKKILGIEITKERHSGKLYLSQKGYIEK 882

Query: 771  VIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMIST 830
            V+RRFN+ +A+PV+ P+A HF+LS+   P S+ DIE+   M  VPYS  VGSLMY M+ +
Sbjct: 883  VLRRFNMHDAKPVSTPLAAHFRLSSDLCPQSDYDIEY---MSRVPYSSVVGSLMYAMVCS 942

Query: 831  RPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSD 890
            RPDLS++ S+VS+YMAN G+ HW+A +WI RYL  + +A L + R+ +    L+GYVDSD
Sbjct: 943  RPDLSHALSVVSRYMANPGKEHWKAVQWIFRYLCGTSSACLQFGRSRD---GLVGYVDSD 1002

Query: 891  FAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMR 950
            FAGD D+RRSL GYVF  G   +SWKA LQ+ VALSTTEAEY+A+SE  KE +WL+GL  
Sbjct: 1003 FAGDLDRRRSLAGYVFTIGGCAVSWKANLQATVALSTTEAEYMAISEACKETIWLRGLYT 1062

Query: 951  DF-GI----------KYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSEN 998
            +  G+          + AI L+K+  +H RTKHID++YHFIR  I  G+++V K+ T +N
Sbjct: 1063 ELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDVRYHFIRSVITEGDVKVCKISTHDN 1122

BLAST of Cucsa.300520 vs. TrEMBL
Match: Q2RAY7_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os11g03830 PE=4 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 1.9e-55
Identity = 125/379 (32.98%), Postives = 192/379 (50.66%), Query Frame = 1

Query: 91  IVLLNSLPEAYKEVKNALKYGRDSVKTDVIISALRTREL--EIHSSHKENHSGDGLFVRG 150
           ++LL SLP +Y   ++ + Y RD++    +  AL T+E   ++  S   N   +GL VRG
Sbjct: 87  LILLCSLPSSYANFRDTILYSRDTLTLKEVYDALHTKEKMKKMVPSEGSNSQAEGLVVRG 146

Query: 151 KSQNNQSKNSNKSFSNEDKKGKQK----------------------KKYSKGKQPEVSIV 210
           + Q   +KN ++  S+   +G+ K                      K   K K+    I 
Sbjct: 147 RQQKKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCKRDGHDISECWKLQDKDKRTRKYIP 206

Query: 211 ESSFTYTDALASTLDQANHVNPLGKH--------DWVVDSGCTYHMTPFRAWFNTYREIS 270
           +         A   D+ +    L  +         W++D+ CTYHM P R WF TY  + 
Sbjct: 207 KGKKEEEGKAAIVTDEKSDAELLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQ 266

Query: 271 GESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKG 330
           G +V +G++  C +AGIG V +K+ D  ++ L +VRH P+LKR+LISLR LD  G +Y G
Sbjct: 267 GGTVLMGDDTPCEVAGIGIVQIKMFDGCIRTLSDVRHFPNLKRSLISLRTLDRKGYKYSG 326

Query: 331 KCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKG 390
             G+ ++ +G+ + +V + ++                    N    ++WH RL H+S+ G
Sbjct: 327 GDGILKLILGN-VAVVSDSLS--------------------NSDATNLWHMRLGHMSEIG 386

Query: 391 LEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPS 438
           L  LSK+G+L      KL FCEHC+ GK ++  F  + HTT G LDY+H DLWGPA   S
Sbjct: 387 LAELSKRGLLYGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGTLDYVHSDLWGPARKTS 444


HSP 2 Score: 38.9 bits (89), Expect = 4.3e+01
Identity = 17/33 (51.52%), Postives = 26/33 (78.79%), Query Frame = 1

Query: 65 LNLSDNIIRQVLEEETAYKVWMKLESIVLLNSL 98
          L+LS+NI+++VL+EETA  +W+KLE I +   L
Sbjct: 6  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDL 38


HSP 3 Score: 453.0 bits (1164), Expect = 9.6e-124
Identity = 257/629 (40.86%), Postives = 374/629 (59.46%), Query Frame = 1

Query: 403  LDYIHLDLWGPASTPSLSGSRYFLSFTDD-FSRKSW------ICKS-----STEGNFDDT 462
            + Y+H+D    +   + S   +F+ + D+ F  + W      I +S     + +  + D 
Sbjct: 621  VSYVHIDSDARSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVMYKDR 680

Query: 463  QSYTTQI-EVENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRR 522
             + T+ + E++         +E T   +  V+   E+  E +  Q DLS       R  R
Sbjct: 681  STVTSDVTEIDQKKSEFVNLDELT---ESTVQKGGEKDKENVNSQVDLSTPVXEVRRSSR 740

Query: 523  IIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDT 582
             I PP RY+       VLN  ++ +  EP  ++EA+   N+ +W  AM +E++SL  N T
Sbjct: 741  NIRPPQRYSP------VLNYLLLTDGGEPECYBEALQDENSSKWELAMKDEMDSLLGNQT 800

Query: 583  WTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQ 642
            W L  L  G K + +KW++++K     +   RYKARLV KGF Q+EGIDY+EIFSPVVK 
Sbjct: 801  WELTELXVGKKALHNKWVYRIKN--EHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKM 860

Query: 643  TSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIY 702
            ++IRL+L +VA  NL L+QLDVKT FLH  LEE +YM+QP+G+ VQG+E+L   L+KS+Y
Sbjct: 861  STIRLVLGMVAAENLHLEQLDVKTAFLHGDLEEDLYMIQPEGFIVQGQENLVCKLRKSLY 920

Query: 703  GLKQSPRCWYRRFDDFITSLGFQKSSYDM----------ESKKILGIDIIRDRDQSTLSI 762
            GLKQ+PR WY++FD+F+  +GF++   D            +K+ILG+ IIRD+   TL +
Sbjct: 921  GLKQAPRQWYKKFDNFMHRIGFKRCEADHCCYFAMKDLGXAKQILGMRIIRDKANGTLKL 980

Query: 763  NQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSL 822
            +Q+ Y +KV+ RFN+  A+PV+ P+  HFKLS   SP     E +  M  VPY+ A+GSL
Sbjct: 981  SQSEYVKKVLSRFNMNEAKPVSTPLGSHFKLSKEQSPKTE--EERDHMSKVPYASAIGSL 1040

Query: 823  MYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELEL 882
            MY M+ TRPD++++  +VS++M+  G++HWEA KWI+RYL  S +  L +   T   L+L
Sbjct: 1041 MYTMVCTRPDIAHAVGVVSRFMSRPGKQHWEAVKWILRYLKGSLDTCLCF---TGASLKL 1100

Query: 883  IGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERL 942
             GY D+DF  D D R+S TG+VF      ISW + LQ IV LSTTE EY+A +E  KE +
Sbjct: 1101 QGYGDADFVSDIDSRKSTTGFVFTLSGTAISWASNLQKIVTLSTTEVEYVAATEVGKEMI 1160

Query: 943  WLKGLMRDFGIKY-----------AIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVL 998
            WL G + + G K            AI L+KN  +HS++KHI  KYHFIR  +E   + + 
Sbjct: 1161 WLHGFLDELGKKQEMGILHSDSQSAIFLAKNSAFHSKSKHIQTKYHFIRYLVEDKLVILK 1220

BLAST of Cucsa.300520 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 149.8 bits (377), Expect = 8.8e-36
Identity = 84/200 (42.00%), Postives = 119/200 (59.50%), Query Frame = 1

Query: 522 YISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKP 581
           Y SF++    +    EPS+++EA        W  AM++EI ++    TW + +LP   KP
Sbjct: 73  YHSFLV---CIAKAKEPSTYNEA---KEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKP 132

Query: 582 IASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQ 641
           I  KW++K+K   +  +  RYKARLVAKG+TQ+EGID+ E FSPV K TS++L+L++ A 
Sbjct: 133 IGCKWVYKIKYN-SDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAI 192

Query: 642 NNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDL----YYLLKKSIYGLKQSPRC 701
            N  L QLD+   FL+  L+E IYM  P GY  +  + L       LKKSIYGLKQ+ R 
Sbjct: 193 YNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQ 252

Query: 702 WYRRFDDFITSLGFQKSSYD 718
           W+ +F   +   GF +S  D
Sbjct: 253 WFLKFSVTLIGFGFVQSHSD 265


HSP 2 Score: 108.2 bits (269), Expect = 2.9e-23
Identity = 74/251 (29.48%), Postives = 120/251 (47.81%), Query Frame = 1

Query: 721 KKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETD 780
           K  LG++I R    + ++I Q  Y   ++    L   +P ++P+      SA +     D
Sbjct: 317 KYFLGLEIARSA--AGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVD 376

Query: 781 IEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLI 840
            +         Y + +G LMYL I TR D+S++ + +S++       H +A   I+ Y+ 
Sbjct: 377 AK--------AYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIK 436

Query: 841 WSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVA 900
            +    L Y    E++L++  + D+ F    D RRS  GY    G +LISWK+  Q +V+
Sbjct: 437 GTVGQGLFYSSQAEMQLQV--FSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVS 496

Query: 901 LSTTEAEYIALSEGVKERLWLKGLMRDFGIKY------------AIRLSKNPQYHSRTKH 960
            S+ EAEY ALS    E +WL    R+  +              AI ++ N  +H RTKH
Sbjct: 497 KSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKH 554

BLAST of Cucsa.300520 vs. TAIR10
Match: ATMG00300.1 (ATMG00300.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 106.3 bits (264), Expect = 1.1e-22
Identity = 50/117 (42.74%), Postives = 71/117 (60.68%), Query Frame = 1

Query: 301 GVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLE 360
           GV +V  G + +L G + + L+I++G     E+N   +    E  +WH RL+H+SQ+G+E
Sbjct: 27  GVLKVLKGCRTILKGNRHDSLYILQGSVETGESNLAETAK-DETRLWHSRLAHMSQRGME 86

Query: 361 ALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTP 418
            L K+G L     S L FCE C+ GKT + NF+  QHTT+  LDY+H DLWG  S P
Sbjct: 87  LLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSDLWGAPSVP 142

BLAST of Cucsa.300520 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 87.4 bits (215), Expect = 5.4e-17
Identity = 61/199 (30.65%), Postives = 101/199 (50.75%), Query Frame = 1

Query: 724 LGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVT--LPIAHHFKLSAVNSPSETDI 783
           LGI I      S L ++QT Y E+++    + + +P++  LP+  +  +S    P  +D 
Sbjct: 43  LGIQI--KTHPSGLFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSD- 102

Query: 784 EHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIW 843
                     +   VG+L YL + TRPD+SY+ ++V + M       ++  K ++RY+  
Sbjct: 103 ----------FRSIVGALQYLTL-TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKG 162

Query: 844 SKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVAL 903
           +    L   + ++  L +  + DSD+AG +  RRS TG+    G N+ISW A  Q  V+ 
Sbjct: 163 TIFHGLYIHKNSK--LNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSR 222

Query: 904 STTEAEYIALSEGVKERLW 921
           S+TE EY AL+    E  W
Sbjct: 223 SSTETEYRALALTAAELTW 225

BLAST of Cucsa.300520 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 75.5 bits (184), Expect = 2.1e-13
Identity = 42/96 (43.75%), Postives = 59/96 (61.46%), Query Frame = 1

Query: 553 WIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFT 612
           W +AM EE+++L+ N TW L   P     +  KW+FK K   +  +  R KARLVAKGF 
Sbjct: 40  WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLH-SDGTLDRLKARLVAKGFH 99

Query: 613 QREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQ 649
           Q EGI + E +SPVV+  +IR +L++  Q  LE+ Q
Sbjct: 100 QEEGIYFVETYSPVVRTATIRTILNVAQQ--LEVGQ 132

BLAST of Cucsa.300520 vs. NCBI nr
Match: gi|40538906|gb|AAR87163.1| (putative polyprotein [Oryza sativa Japonica Group])

HSP 1 Score: 482.3 bits (1240), Expect = 2.1e-132
Identity = 274/616 (44.48%), Postives = 374/616 (60.71%), Query Frame = 1

Query: 445  GNFDDTQSYTTQIEVENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLAR 504
            G  D+ Q Y + ++VE+        ++ T I    V +  +    +L+ Q +     +A 
Sbjct: 722  GGSDEEQQYVS-VQVEHVD------DQETEIVGNDVNDTVQHSPSVLQPQDE----PIAH 781

Query: 505  DRQRRIIVPPARYAESN---YISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEI 564
             R +R    P R  E     Y +F   A  V N  EP+++ EAV S +  +WI A+ EE+
Sbjct: 782  RRTKRSCGAPVRLIEECDMVYYAFSY-AEQVENTLEPATYTEAVVSGDREKWISAIQEEM 841

Query: 565  NSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSE 624
             SL  N TW L  LPK  KP+  KWIFK KEG++ +  PR+K RLVAKGF+Q  G+DY++
Sbjct: 842  QSLEKNGTWELVHLPKQKKPVRCKWIFKRKEGLSPSEPPRFKVRLVAKGFSQIAGVDYND 901

Query: 625  IFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLY 684
            +FSPVVK +SIR   S+V  ++LEL+QLDVKTTFLH  LEE IYM QP+G+ V GKED  
Sbjct: 902  VFSPVVKHSSIRTFFSIVTMHDLELEQLDVKTTFLHGELEEEIYMDQPEGFIVPGKEDYV 961

Query: 685  YLLKKSIYGLKQSPRCWYRRFDDFITSLGF------------------------------ 744
              LK+S+YGLKQSPR WY+RFD F+ S GF                              
Sbjct: 962  CKLKRSLYGLKQSPRQWYKRFDSFMLSHGFKRSEFDSCVYIKFVNGSPIYLLLYVDDMLI 1021

Query: 745  --------------QKSSYDME----SKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFN 804
                            S +DM+    +KKILG++I RDR+   L ++Q SY +KV++RFN
Sbjct: 1022 AAKSKEQITTLKKQLSSEFDMKDLGAAKKILGMEITRDRNSGLLFLSQQSYIKKVLQRFN 1081

Query: 805  LTNARPVTLPIAHHFKLSAVNSPS-ETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSY 864
            + +A+PV+ PIA HFKLSA+   S + D+E+   M  VPYS AVGSLMY M+ + PDLS+
Sbjct: 1082 MHDAKPVSTPIAPHFKLSALQCASTDEDVEY---MSRVPYSSAVGSLMYAMVCSWPDLSH 1141

Query: 865  SASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSD 924
            + SLVS+YMAN G+ HW+A +WI RYL  + +A L + R   I+  L+GYVDSDFA D D
Sbjct: 1142 AMSLVSRYMANPGKEHWKAVQWIFRYLRGTADACLKFGR---IDKGLVGYVDSDFAADLD 1201

Query: 925  KRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMRDF-GI- 984
            KRRSLTGYVF  G   +SWKA LQ +VA STTEAEY+A++E  KE +WLKGL  +  G+ 
Sbjct: 1202 KRRSLTGYVFTIGSCAVSWKATLQPVVAQSTTEAEYMAIAEACKESVWLKGLFAELCGVD 1261

Query: 985  ---------KYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSENAADMLT 998
                     + AI L+K+  +H RTKHIDIKYH++R+ +  G+++V K+   +N ADM+T
Sbjct: 1262 SCINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDIVAQGKLKVCKISIHDNPADMMT 1319

BLAST of Cucsa.300520 vs. NCBI nr
Match: gi|40538906|gb|AAR87163.1| (putative polyprotein [Oryza sativa Japonica Group])

HSP 1 Score: 222.6 bits (566), Expect = 3.0e-54
Identity = 134/410 (32.68%), Postives = 224/410 (54.63%), Query Frame = 1

Query: 56  KLNAYGTLILNLSDNIIRQVLEEETAYKVWMKLESI--VLLNSLPEAYKEVKNALKYGRD 115
           KL+  G+++ ++S  + ++++ +  + +V    E +  +LL SLP +Y   ++ +   RD
Sbjct: 112 KLHESGSVLNHIS--VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRD 171

Query: 116 SVKTDVIISALRTRELE--IHSSHKENHSGDGLFVRGKSQN---NQSKNSNKSFSNEDKK 175
            +    +  AL+ RE    +  S+  +  G+ L VRG+S+    N S + +KS S    K
Sbjct: 172 ELTLAEVYEALQNREKMKGMVQSYASSSKGEALQVRGRSEQRTYNDSNDHDKSQSRGRSK 231

Query: 176 GKQKK--KYSKGKQPEVS----IVESSFTYTDALASTLDQANHVNP----------LGKH 235
            + KK  KY K K   +     +       +D  AS +  A + +           +  H
Sbjct: 232 SRGKKFCKYCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASH 291

Query: 236 D-WVVDSGCTYHMTPFRAWFNTYREISGESVF-IGNNNECNIAGIGSVTMKLKDKTVKLL 295
           D W++D+ C++H+   R WF++Y+ +  E V  +G++N   I GIGSV +K  D   + L
Sbjct: 292 DEWILDTACSFHICINRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTL 351

Query: 296 RNVRHVPHLKRNLISLRMLDSLGCEYKGKCGVFQVFMGSKLVLVGEKVN-DLFIIKGVEM 355
           ++VRH+P + RNLISL  LD+ G +Y G  GV +V  GS + ++G+  + +L++++G  +
Sbjct: 352 KDVRHIPGMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTL 411

Query: 356 IEE--ANTVLSLNLIEVDIWHKRLSHISQKGLEALSKQGILPQDICSKLSFCEHCVLGKT 415
                A  V      + ++WH RL H+S+ G+  L K+ +L       + FCEHCV GK 
Sbjct: 412 HGSVTAAAVTKDEPSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKH 471

Query: 416 RKQNFTKAQHTTRGILDYIHLDLWGPASTPSLSGSRYFLSFTDDFSRKSW 438
           ++  F  + H T+GILDY+H DLWGP+  PSL G+RY L+  DD+SRK W
Sbjct: 472 KRVKFNTSVHRTKGILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKEW 519


HSP 2 Score: 55.5 bits (132), Expect = 6.4e-04
Identity = 32/98 (32.65%), Postives = 57/98 (58.16%), Query Frame = 1

Query: 1  MAIARVEIEKFDGKGDFALWKAKIKALLGQQKSHKALLDPSELPTTLTATQKEEMK-LNA 60
          MA  + ++   D K  F+LW+ K++A+L Q       L+      T   T +E+ K   A
Sbjct: 1  MASMKYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKA 60

Query: 61 YGTLILNLSDNIIRQVLEEETAYKVWMKLESIVLLNSL 98
             + L+LS++I+++VL+++TA ++W+KLESI +   L
Sbjct: 61 LSLIQLHLSNDILQEVLQKKTAAELWLKLESICMSKDL 98


HSP 3 Score: 459.5 bits (1181), Expect = 1.5e-125
Identity = 267/667 (40.03%), Postives = 382/667 (57.27%), Query Frame = 1

Query: 401  GILDYIHLDLWGPASTPSLSGSRYFLSF-TDDFSRKSWICKS-----STEGNFDDTQSYT 460
            G + Y+H+   G       S    F+ +  D+F  + W  ++     S +  F++   Y 
Sbjct: 619  GCVAYVHISDQGRNKLDPKSKKCTFIGYGEDEFGYRLWDDENKKMIRSRDVIFNEGVMYK 678

Query: 461  TQIEVENTGKSVQPTEEPTAIE-QEQVENLSEEQDEMLEE-QPDLSQYSLARD---RQRR 520
             +   +NT  S     EPT +E  + +E+   E  + +E  +PD  Q  +      R  R
Sbjct: 679  DK---QNTSASNSKPIEPTYVEVDDALESPPVESSQSVESIEPDRGQQCVPEPELRRSSR 738

Query: 521  IIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDT 580
            + VP  RY         +N  ++ +  EP  + EA  + +A +W  AM EE+ SL  N T
Sbjct: 739  VPVPNRRY---------MNYMLLTDGGEPEDYSEACQTRDASKWELAMKEEMKSLISNQT 798

Query: 581  WTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQ 640
            W LA LP G K + +KW++++KE    +   RYKARLV KGF Q+EG+DY+EIFSPVVK 
Sbjct: 799  WELAKLPMGKKALHNKWVYRVKE--EHDGSKRYKARLVVKGFQQKEGVDYTEIFSPVVKL 858

Query: 641  TSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIY 700
             +IR +LS+VA   L L+QLDVKT FLH  L+E IYM QP+G+  +GKE++   LKKS+Y
Sbjct: 859  NTIRTVLSIVASEELYLEQLDVKTAFLHGDLDEEIYMHQPEGFSEKGKENMVCRLKKSLY 918

Query: 701  GLKQSPRCWYRRFDDFITSLGF-------------------------------------- 760
            GLKQ+PR WYR+F+ F+   GF                                      
Sbjct: 919  GLKQAPRQWYRKFESFMHKEGFKKCNADHCCFFKRYNSSYIILLLYVDDMLVAGSNMIEI 978

Query: 761  ------QKSSYDME----SKKILGIDIIRDRDQSTLSINQTSYCEKVIRRFNLTNARPVT 820
                      +DM+    +KKILG+ I RD+ + TL ++Q  Y  +V++RFN++NA+PV+
Sbjct: 979  RNLKKQLSKEFDMKDLGPAKKILGMQITRDKQKGTLQLSQAEYINRVLQRFNMSNAKPVS 1038

Query: 821  LPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSLMYLMISTRPDLSYSASLVSKYM 880
             P+A HF+LS   SP     E K  M   PY+ A+GSLMY M+ TRPD+ Y+  +VS++M
Sbjct: 1039 TPLASHFRLSTDQSPKTE--EEKELMAKTPYASAIGSLMYAMVCTRPDIGYAVGVVSRFM 1098

Query: 881  ANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSDFAGDSDKRRSLTGYV 940
            +N G+ HWEA KWI+RYL  +K   L + +    EL++ GYVD+DFAG+ D RRS TGY+
Sbjct: 1099 SNPGKAHWEAVKWILRYLRATKEKCLCFSKG---ELKVQGYVDADFAGEIDHRRSTTGYI 1158

Query: 941  FLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMRDFGI----------- 998
            F  G   +SW + +Q IVALSTTEAEY+A++E  KE +WL+GL+ + G            
Sbjct: 1159 FTVGTTAVSWMSQIQKIVALSTTEAEYVAVTEASKELIWLQGLLTELGFIQERSVLHSDS 1218

BLAST of Cucsa.300520 vs. NCBI nr
Match: gi|1012328712|gb|KYP40337.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 163.7 bits (413), Expect = 1.7e-36
Identity = 132/461 (28.63%), Postives = 220/461 (47.72%), Query Frame = 1

Query: 87  KLESIVLLNSLPEAYKEVKNAL-------KYGRDSVKTDVIISALRTRE---------LE 146
           ++ +++LL+SLPE++     A+       K   D V+  V+   +R RE         L 
Sbjct: 119 EVRALILLSSLPESWNATVTAVSSSSGSNKLKFDDVRDLVLSEEVRRRETGESTTSSVLH 178

Query: 147 IHSSHKENHSGDGLFV---RGKSQNNQSKNSNKSFS--NEDKKGKQKKKY-SKGKQPEVS 206
             S  + +  G G      R KS+N+QS N +K+    N  K G  K +  S  K+ EV 
Sbjct: 179 TESRGRSSTRGTGRGKSKERSKSRNHQSSNKSKTIECWNCGKTGHYKNQCRSASKKQEVK 238

Query: 207 IVESSFTYT---DALASTLDQANHVNPLGKHDWVVDSGCTYHMTPFRAWFNTYREISGES 266
              +  T +   DAL  +L+         +  WV+DSG ++H T  +  F  Y   +   
Sbjct: 239 DEANVATTSGGGDALICSLENK-------EESWVIDSGASFHATSQKELFENYVSGNLGK 298

Query: 267 VFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCG 326
           V++GN   C I G G V +KL   +V  L+NVRH+P L +NLIS+  L S G     +  
Sbjct: 299 VYLGNEQSCEIVGKGVVKIKLNG-SVWELKNVRHIPDLTKNLISVGQLASDGYTTIFQGE 358

Query: 327 VFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEA 386
            +++  G+  +  G+K   L+     +  E  + +        ++WH+RL H+S+KG++ 
Sbjct: 359 QWKISKGAMTIARGKKSGTLY-----KTTEACHLITVAANDNPNLWHQRLGHMSEKGMKI 418

Query: 387 LSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHT-TRGILDYIHLDLWGPASTPSLS 446
           +  +G LP     ++  CE C+ GK ++ +F K   T  +  L+ +H D+WGP +  S+S
Sbjct: 419 MHSKGKLPGLQSMEIEMCEDCIFGKQKRVSFQKGGRTPKKERLELVHSDVWGPTTVSSIS 478

Query: 447 GSRYFLSFTDDFSRKSWI--CKSSTEGNFDDTQSYTTQIEVENTGKSVQP--TEEPTAIE 506
           G +YF++F DD SRK W+   K  +E  F+  + +   +E E TG  ++   T+     E
Sbjct: 479 GKQYFVTFIDDHSRKVWVYFLKHKSE-VFEAFKMWKAMVENE-TGLKIKKLRTDNGGEYE 538

Query: 507 QEQVENLSEEQDEMLEEQ-PDLSQYSLARDRQRRIIVPPAR 517
             + +    E    +E   P   Q +   +R  R +   AR
Sbjct: 539 DTRFKRFCYEHGIRMERTVPGTPQQNGVAERMNRTLTERAR 564


HSP 2 Score: 456.1 bits (1172), Expect = 1.6e-124
Identity = 264/622 (42.44%), Postives = 372/622 (59.81%), Query Frame = 1

Query: 432  FSRKSWICKSSTEGNFDDTQSYTTQIEVENTGKSVQPTE---EPTAIEQEQVENLSEEQD 491
            F  +  I  S  + +  D+++   Q E  N   +++  E   +PT + Q + ++    Q+
Sbjct: 676  FDEREAISISLAKPSVADSEAQVEQNEQGNDEVAIEEPEHQQQPTVMAQVE-QSPQRGQN 735

Query: 492  EMLEEQPDLSQYSLARDRQRRIIVPPARYA--ESNYISFVLNATVVPNDSEPSSFDEAVN 551
              + + P+  + S+A D+ +R   P  R+       +S  L+ +      +P+++++A+ 
Sbjct: 736  SPIPQAPESFKRSIALDKPKRNRKPIQRFGFEPEEDVSRALSIS----QGDPTTYEDAIE 795

Query: 552  SSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARL 611
            S  +  WI AM EE+ SL+ N  W L   PK  K +  KW+F+ KEGI ++    YKARL
Sbjct: 796  SVESAGWIGAMTEEMESLHKNSVWELVPKPKERKLVGCKWVFRKKEGIHEDDAITYKARL 855

Query: 612  VAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYM 671
            VAKG++Q+EG+DY EIFSPVVK TSIRLLLS+ AQ ++E++Q+DVKT FLH  LEE IYM
Sbjct: 856  VAKGYSQKEGVDYDEIFSPVVKHTSIRLLLSIAAQYDMEIEQMDVKTAFLHGDLEEDIYM 915

Query: 672  VQPKGYEVQGKEDLYYLLKKSIYGLKQSPRCWYRRFDDFIT------------------- 731
             QP+G+   GKE+L   LKKS+YGLKQSPR WY+ FD ++                    
Sbjct: 916  SQPEGFVETGKENLVCRLKKSLYGLKQSPRQWYKPFDTYMLKIGYTRCQYDCCVYYHVFE 975

Query: 732  --------------------SLGFQK------SSYDME----SKKILGIDIIRDRDQSTL 791
                                 L  QK      + +DM+    ++KILGI+I RDR+   +
Sbjct: 976  DGKVILLLLYVDDMLIACRDMLQIQKLKKKLGAEFDMKDLGAAQKILGIEIRRDRNAGKI 1035

Query: 792  SINQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVG 851
             ++Q  Y  K++ RFN+  A+ V++P+A HF+LSA   PS  D +    MKNVPY+ AVG
Sbjct: 1036 WLSQEKYIMKILERFNMAEAKVVSIPLAAHFRLSAEQRPS--DQKEIDMMKNVPYASAVG 1095

Query: 852  SLMYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIEL 911
             LMY MI TRPDL+ + S+VSKYM+N G+RHWEA KWI +YL  ++   + ++R  + E 
Sbjct: 1096 CLMYAMICTRPDLAQAMSVVSKYMSNPGKRHWEAVKWIFKYLKNTRQLGIMFERR-QGEA 1155

Query: 912  ELIGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKE 971
             + G+VDSDFAGD D+RRS  GYVF  G   +SWKA LQ++ ALSTTEAEY+AL+E  KE
Sbjct: 1156 CVAGFVDSDFAGDLDRRRSTAGYVFTCGGGPVSWKATLQAVTALSTTEAEYMALTEASKE 1215

Query: 972  RLWLKGLMRDFGI-----------KYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQ 989
             +WL GL    G+           + AI L+KN  +H+RTKHID +YH IR+ +EAG I 
Sbjct: 1216 AIWLNGLAGQLGVHQEGVVVKCDSQSAIHLAKNQVFHARTKHIDARYHRIRDWVEAGVII 1275

BLAST of Cucsa.300520 vs. NCBI nr
Match: gi|241993361|gb|ACS74199.1| (putative gag-pol polyprotein [Fragaria x ananassa])

HSP 1 Score: 196.1 bits (497), Expect = 3.0e-46
Identity = 126/409 (30.81%), Postives = 211/409 (51.59%), Query Frame = 1

Query: 90  SIVLLNSLPEAYKEVKNALKYGRDSVKTDVIISALRTRELEIHSSHKENHSGDGLFVRGK 149
           +++LL+SLP  +K  K  + +      + V  +      LE     +++    GL+VRGK
Sbjct: 119 AVMLLHSLPPLFKHFKTTMIFKELITLSKVCENPKSYIRLE---REEDSSQARGLYVRGK 178

Query: 150 SQN------------NQSKNSNKSFSNED---------------KKGKQKKKYSKGKQPE 209
            +               SK+  K    +D               K+ K+KK    G+  +
Sbjct: 179 ERGRSRNRGGGFQGRGMSKSKGKGKGKKDGCFIYGSPDHWKRNCKQWKEKKAQMSGESSQ 238

Query: 210 VSIVESSFTYTDALASTLDQANHVNPLGKHDWVVDSGCTYHMTPFRAWFNTYREISGESV 269
           ++ V   +   D     +  ++         W +D+ CT+H    R WF+TY+E +  SV
Sbjct: 239 LANVVIGYNDEDGELLAISTSSGA----PRHWTLDTACTFHTCAHRDWFDTYKEGNTRSV 298

Query: 270 FIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKGKCGV 329
            +GN++   I GIG V +++ D  V+ L NVRH P L RNLISL  +D +G  +KG+ GV
Sbjct: 299 LMGNDSPSRIMGIGMVKIRMHDGIVRALGNVRHTPGLNRNLISLSTMDRVGFWHKGQNGV 358

Query: 330 FQVFMGSKLVLVGE-KVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKGLEA 389
            +V  G  + + G  + ++++ + G  +  E    +     + ++W +RL H+SQ+GL+ 
Sbjct: 359 LKVGKGQMVYMKGAIQPDNMYKLTGSTV--EGGAGVCTEEDKTELWRRRLGHMSQRGLQE 418

Query: 390 LSKQGILPQDICSKLSFCEHCVLGKTRKQNF--TKAQHTTRGILDYIHLDLWGPASTPSL 449
           L K+  L   + + L FC +C LGK  + +F  + +++ ++G+LDYIH D+WGP++T S 
Sbjct: 419 LHKKEQLDGVMSAALEFCRYCTLGKQTRVSFNLSSSENKSKGVLDYIHTDVWGPSATISK 478

Query: 450 SGSRYFLSFTDDFSRKSWICKSSTEGNFDDTQSYTTQIEVEN-TGKSVQ 468
            G+RYF+SF DDFSRK WI    T+ N   T+    + EV N TG+ ++
Sbjct: 479 GGARYFVSFIDDFSRKVWIFFMKTK-NEVFTKFKEWKAEVGNQTGRKIK 517


HSP 2 Score: 453.8 bits (1166), Expect = 8.1e-124
Identity = 256/562 (45.55%), Postives = 361/562 (64.23%), Query Frame = 1

Query: 471  EPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRRIIVPPARYAE-SNYISFVLN- 530
            E   +  +   N+  E  E    Q    ++S+A+D+ +R   PP RY E +N +++ L+ 
Sbjct: 583  ESVMLHDKPSTNVPIESQEKASVQQS-PKHSIAKDKPKRNTRPPQRYIEEANIVAYALSV 642

Query: 531  ATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPIASKWIF 590
            A  +  ++EPS++ EA+ S +  +WI AM++E++SL  N TW L  LPK  K I  KWIF
Sbjct: 643  AEEIEGNAEPSTYSEAIVSDDCNRWITAMHDEMDSLEKNHTWKLVKLPKEKKLIHCKWIF 702

Query: 591  KLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQTSIRLLLSLVAQNNLELDQ 650
            K KEG++   + RYKA LVAKG++Q  GID++++FSPVVK +SIR LLS+VA ++ ELDQ
Sbjct: 703  KRKEGMSPTDEARYKAMLVAKGYSQIPGIDFNDVFSPVVKHSSIRTLLSIVAMHDYELDQ 762

Query: 651  LDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIYGLKQSPRC-------WYRR 710
            +DVKT FLH  LEE IYM QP+G+ + GKE+L +  ++S Y     P+            
Sbjct: 763  MDVKTAFLHGELEEDIYMEQPEGFVIPGKENLKF--RRSNYDSCVYPKVVDGSAIYLLLY 822

Query: 711  FDDF---------ITSLGFQKSS-YDME----SKKILGIDIIRDRDQSTLSINQTSYCEK 770
             DD          I  L  Q SS ++M+    +KKILGI+I ++R    L ++Q  Y EK
Sbjct: 823  VDDMLIAAKDKSEIAKLKAQLSSEFEMKDLGAAKKILGIEITKERHSGKLYLSQKGYIEK 882

Query: 771  VIRRFNLTNARPVTLPIAHHFKLSAVNSP-SETDIEHKLQMKNVPYSQAVGSLMYLMIST 830
            V+RRFN+ +A+PV+ P+A HF+LS+   P S+ DIE+   M  VPYS  VGSLMY M+ +
Sbjct: 883  VLRRFNMHDAKPVSTPLAAHFRLSSDLCPQSDYDIEY---MSRVPYSSVVGSLMYAMVCS 942

Query: 831  RPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELELIGYVDSD 890
            RPDLS++ S+VS+YMAN G+ HW+A +WI RYL  + +A L + R+ +    L+GYVDSD
Sbjct: 943  RPDLSHALSVVSRYMANPGKEHWKAVQWIFRYLCGTSSACLQFGRSRD---GLVGYVDSD 1002

Query: 891  FAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERLWLKGLMR 950
            FAGD D+RRSL GYVF  G   +SWKA LQ+ VALSTTEAEY+A+SE  KE +WL+GL  
Sbjct: 1003 FAGDLDRRRSLAGYVFTIGGCAVSWKANLQATVALSTTEAEYMAISEACKETIWLRGLYT 1062

Query: 951  DF-GI----------KYAIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVLKVHTSEN 998
            +  G+          + AI L+K+  +H RTKHID++YHFIR  I  G+++V K+ T +N
Sbjct: 1063 ELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDVRYHFIRSVITEGDVKVCKISTHDN 1122

BLAST of Cucsa.300520 vs. NCBI nr
Match: gi|77548583|gb|ABA91380.1| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 226.1 bits (575), Expect = 2.7e-55
Identity = 125/379 (32.98%), Postives = 192/379 (50.66%), Query Frame = 1

Query: 91  IVLLNSLPEAYKEVKNALKYGRDSVKTDVIISALRTREL--EIHSSHKENHSGDGLFVRG 150
           ++LL SLP +Y   ++ + Y RD++    +  AL T+E   ++  S   N   +GL VRG
Sbjct: 87  LILLCSLPSSYANFRDTILYSRDTLTLKEVYDALHTKEKMKKMVPSEGSNSQAEGLVVRG 146

Query: 151 KSQNNQSKNSNKSFSNEDKKGKQK----------------------KKYSKGKQPEVSIV 210
           + Q   +KN ++  S+   +G+ K                      K   K K+    I 
Sbjct: 147 RQQKKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCKRDGHDISECWKLQDKDKRTRKYIP 206

Query: 211 ESSFTYTDALASTLDQANHVNPLGKH--------DWVVDSGCTYHMTPFRAWFNTYREIS 270
           +         A   D+ +    L  +         W++D+ CTYHM P R WF TY  + 
Sbjct: 207 KGKKEEEGKAAIVTDEKSDAELLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQ 266

Query: 271 GESVFIGNNNECNIAGIGSVTMKLKDKTVKLLRNVRHVPHLKRNLISLRMLDSLGCEYKG 330
           G +V +G++  C +AGIG V +K+ D  ++ L +VRH P+LKR+LISLR LD  G +Y G
Sbjct: 267 GGTVLMGDDTPCEVAGIGIVQIKMFDGCIRTLSDVRHFPNLKRSLISLRTLDRKGYKYSG 326

Query: 331 KCGVFQVFMGSKLVLVGEKVNDLFIIKGVEMIEEANTVLSLNLIEVDIWHKRLSHISQKG 390
             G+ ++ +G+ + +V + ++                    N    ++WH RL H+S+ G
Sbjct: 327 GDGILKLILGN-VAVVSDSLS--------------------NSDATNLWHMRLGHMSEIG 386

Query: 391 LEALSKQGILPQDICSKLSFCEHCVLGKTRKQNFTKAQHTTRGILDYIHLDLWGPASTPS 438
           L  LSK+G+L      KL FCEHC+ GK ++  F  + HTT G LDY+H DLWGPA   S
Sbjct: 387 LAELSKRGLLYGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGTLDYVHSDLWGPARKTS 444


HSP 2 Score: 38.9 bits (89), Expect = 6.2e+01
Identity = 17/33 (51.52%), Postives = 26/33 (78.79%), Query Frame = 1

Query: 65 LNLSDNIIRQVLEEETAYKVWMKLESIVLLNSL 98
          L+LS+NI+++VL+EETA  +W+KLE I +   L
Sbjct: 6  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDL 38


HSP 3 Score: 453.0 bits (1164), Expect = 1.4e-123
Identity = 257/629 (40.86%), Postives = 374/629 (59.46%), Query Frame = 1

Query: 403  LDYIHLDLWGPASTPSLSGSRYFLSFTDD-FSRKSW------ICKS-----STEGNFDDT 462
            + Y+H+D    +   + S   +F+ + D+ F  + W      I +S     + +  + D 
Sbjct: 621  VSYVHIDSDARSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVMYKDR 680

Query: 463  QSYTTQI-EVENTGKSVQPTEEPTAIEQEQVENLSEEQDEMLEEQPDLSQYSLARDRQRR 522
             + T+ + E++         +E T   +  V+   E+  E +  Q DLS       R  R
Sbjct: 681  STVTSDVTEIDQKKSEFVNLDELT---ESTVQKGGEKDKENVNSQVDLSTPVXEVRRSSR 740

Query: 523  IIVPPARYAESNYISFVLNATVVPNDSEPSSFDEAVNSSNARQWIEAMNEEINSLNVNDT 582
             I PP RY+       VLN  ++ +  EP  ++EA+   N+ +W  AM +E++SL  N T
Sbjct: 741  NIRPPQRYSP------VLNYLLLTDGGEPECYBEALQDENSSKWELAMKDEMDSLLGNQT 800

Query: 583  WTLASLPKGCKPIASKWIFKLKEGITKNSQPRYKARLVAKGFTQREGIDYSEIFSPVVKQ 642
            W L  L  G K + +KW++++K     +   RYKARLV KGF Q+EGIDY+EIFSPVVK 
Sbjct: 801  WELTELXVGKKALHNKWVYRIKN--EHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKM 860

Query: 643  TSIRLLLSLVAQNNLELDQLDVKTTFLHDYLEETIYMVQPKGYEVQGKEDLYYLLKKSIY 702
            ++IRL+L +VA  NL L+QLDVKT FLH  LEE +YM+QP+G+ VQG+E+L   L+KS+Y
Sbjct: 861  STIRLVLGMVAAENLHLEQLDVKTAFLHGDLEEDLYMIQPEGFIVQGQENLVCKLRKSLY 920

Query: 703  GLKQSPRCWYRRFDDFITSLGFQKSSYDM----------ESKKILGIDIIRDRDQSTLSI 762
            GLKQ+PR WY++FD+F+  +GF++   D            +K+ILG+ IIRD+   TL +
Sbjct: 921  GLKQAPRQWYKKFDNFMHRIGFKRCEADHCCYFAMKDLGXAKQILGMRIIRDKANGTLKL 980

Query: 763  NQTSYCEKVIRRFNLTNARPVTLPIAHHFKLSAVNSPSETDIEHKLQMKNVPYSQAVGSL 822
            +Q+ Y +KV+ RFN+  A+PV+ P+  HFKLS   SP     E +  M  VPY+ A+GSL
Sbjct: 981  SQSEYVKKVLSRFNMNEAKPVSTPLGSHFKLSKEQSPKTE--EERDHMSKVPYASAIGSL 1040

Query: 823  MYLMISTRPDLSYSASLVSKYMANSGRRHWEATKWIIRYLIWSKNARLNYQRTTEIELEL 882
            MY M+ TRPD++++  +VS++M+  G++HWEA KWI+RYL  S +  L +   T   L+L
Sbjct: 1041 MYTMVCTRPDIAHAVGVVSRFMSRPGKQHWEAVKWILRYLKGSLDTCLCF---TGASLKL 1100

Query: 883  IGYVDSDFAGDSDKRRSLTGYVFLYGRNLISWKAILQSIVALSTTEAEYIALSEGVKERL 942
             GY D+DF  D D R+S TG+VF      ISW + LQ IV LSTTE EY+A +E  KE +
Sbjct: 1101 QGYGDADFVSDIDSRKSTTGFVFTLSGTAISWASNLQKIVTLSTTEVEYVAATEVGKEMI 1160

Query: 943  WLKGLMRDFGIKY-----------AIRLSKNPQYHSRTKHIDIKYHFIREKIEAGEIQVL 998
            WL G + + G K            AI L+KN  +HS++KHI  KYHFIR  +E   + + 
Sbjct: 1161 WLHGFLDELGKKQEMGILHSDSQSAIFLAKNSAFHSKSKHIQTKYHFIRYLVEDKLVILK 1220

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC4.4e-12242.90Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME4.8e-4434.93Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M300_ARATH2.0e-2142.74Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana GN=AtMg0... [more]
YH41B_YEAST4.3e-1623.05Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YJ41B_YEAST5.6e-1623.05Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
Q75HA9_ORYSJ1.5e-13244.48Integrase core domain containing protein OS=Oryza sativa subsp. japonica GN=LOC_... [more]
Q75HA9_ORYSJ2.1e-5432.68Integrase core domain containing protein OS=Oryza sativa subsp. japonica GN=LOC_... [more]
A0A151RCT9_CAJCA1.2e-3628.63Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
C6GFP7_FRAAN2.1e-4630.81Putative gag-pol polyprotein OS=Fragaria ananassa PE=4 SV=1[more]
Q2RAY7_ORYSJ1.9e-5532.98Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
Match NameE-valueIdentityDescription
AT4G23160.18.8e-3642.00 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00300.11.1e-2242.74ATMG00300.1 Gag-Pol-related retrotransposon family protein[more]
ATMG00810.15.4e-1730.65ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.12.1e-1343.75ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
Match NameE-valueIdentityDescription
gi|40538906|gb|AAR87163.1|2.1e-13244.48putative polyprotein [Oryza sativa Japonica Group][more]
gi|40538906|gb|AAR87163.1|3.0e-5432.68putative polyprotein [Oryza sativa Japonica Group][more]
gi|1012328712|gb|KYP40337.1|1.7e-3628.63Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|241993361|gb|ACS74199.1|3.0e-4630.81putative gag-pol polyprotein [Fragaria x ananassa][more]
gi|77548583|gb|ABA91380.1|2.7e-5532.98retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005488 binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.300520.1Cucsa.300520.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 567..717
score: 3.3
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 321..386
score: 5.0
NoneNo IPR availableunknownCoilCoilcoord: 473..493
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 7..493
score: 4.6E-251coord: 509..934
score: 4.6E
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 7..493
score: 4.6E-251coord: 509..934
score: 4.6E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 782..954
score: 1.56E-8coord: 568..744
score: 1.5

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.300520Cucumber (Chinese Long) v3cgycucB464
Cucsa.300520Wax gourdcgywgoB601
Cucsa.300520Wild cucumber (PI 183967)cgycpiB451
Cucsa.300520Cucumber (Chinese Long) v2cgycuB428
Cucsa.300520Melon (DHL92) v3.5.1cgymeB494
Cucsa.300520Melon (DHL92) v3.6.1cgymedB493