Cucsa.012630 (gene) Cucumber (Gy14) v1

NameCucsa.012630
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationscaffold00164 : 296254 .. 309261 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGATCCTTCCGAAGGCCTGGGTTCAGAATCGGCGGCAATTGGAATCGACTCCACCACCAAGGACGACCTTTGCATGGAGATCGACCCACCTTTCCGCGAGAATTTGGCCACTGCTGATGACTGGAGGAAAGCCCTGAACAAGGTTGTTCCTGCCGTCATTGTGTTGCGCACCACCGCCTGTCGGGCATTCGACACTGAATCGGCTGGTGCGAGCTATGCTACTGGATTCGTTGTTGATAAGCGCCGTGGGATTATCCTCACCAACCGTCATGTTGTTAAGCCAGGTATTGTTGGACCTTCATTTCTGTTGCCTTATTTTTCTTACTTGTGCAATTGCTGAATGGAGTGAAGTTCAATTCTCCTATCTCGATATTGTTTAGCTCTGAAATTCGTTGGGATTAACGTGTAATGGGTTTCAGAAGGTAAATTAAGTCTTCAACTGTCTGATGGGATTTCATTTGGGGTGTTTTATATTTAATTCTTGTTCATCTGGTATTAGAATTCAAGTTATTTCGTATTTTGTTTAAGTGAAACCTGGTTAGTTTGATGAACAAAAGTTTCTTTTtCTTTTTTGATGAAAAATCTGGGAGTAAAAATAAAGAATACAAAAGAGTTAACAAAAATCACAAACCAAAAAGTGGGAGCATAAACACAACGACTAACGACCCAACAAGTTGAGAAAGAGCTCAAATCCAAGAGAATAAGACGGAGGGATAATTAAGAAAGTTTTTAGAGATAGATACCTAGAAAGACACATTAAACTTGGCAAGTGCCCACACCACTTTCCTAGATTTTACATTCCTCTAAAATCTAGTCTAATATTCCATTAAGAAAGAAGAAAGACTAGCATGCCAGAGAGTATATCCTTTATCGTGAAAGGGTGGAATGAAATAGAGCCTTCTCGATCATGTGCCAAAAATCCTTGTTACTAGCCAAGCGATGCGATATGTCTCCAAGATGATATTTCCAAATTGTTACGCCATGTCCCAAGATCACTTCTTGCGATCAAATGGAGACATGAGTCACATGACTGTCTAGACACTCAACGCATATATCCCCGCGAGATAGCATGCTAAACGCCTAGTAAGCGGAATTAACTCAACTTTCCAATCTTAACACAACTTGAATCTTTAGCTTTTATTGCCTAGTTTAAACTTAAATCCTTAACTTATTAAGCGATATGAATATTCACAACGTAACATAACAACAAACTTCTCTTTATTAACTTAATAACATGCGTTCTATAATTCTTTCTCATCTACAAAGAACTATAAGAAAAAGGCTTCCAACACTTAGTGCAACTTTCCCAACTTTCAACTTCTGGCTAACGCCTCAATGTGTAGCAGCTCACCTTACCCTTCTTTAACTTGGCCTCAGCGCCTTGTCACCTGAGGAAGGGAAACTTAACACATAAGTCAAATACTTAGTGAGTGATAAATTTAGAAAACCTTTTGGGGAATACAATTCAATCTCTTAAATTCATTTTCATTTGCATAAATAGGCCTCAACGCCTCATTTCAATAAACCATATAAATCATTCATTAGTCCTACTTATTCCTTTCCACCAAGAACATGAACCATGTCCATGCATAAACTTGGGATTCATTTCACCAGCATCCCTTGGGGAACTCCAGATTCATTTCTCGTTGCTTATGCTATTAATTCAAATGCATTTGGCCAACTTCTCAATGCCTTGTGGTCTACAACACCACCATGGTTTCTTTTACTCATTATGTTGCACATAATAGCTAGCTTATTGATAGGAATAGGGCTAATGGTTTACTCGCATACAAACATCACAATCATCACATAGAAATTTTAGTTTAGAAACTTTCTTTTCAATAAACAACATCAACATCAATATCATTACAATACACATTAAACATAAATACAACATGAGACTCATAATGAATGCTTTCAAAACACTTATTTCTTTTAGAAAACCACTCACTAAAATACTTAGCCTTAGTTTCCTTCCTTTCTAAACGACTACTCCGAATCCCTTCTTCTCGCCTAATGCTTCCTTAGCAAGCAACAGAATAGTTAGGAAATTCTCAGAATAACAACTCTCTTTAGATCCTTCAATCCTATTGATACCAATCTTAAATTAGATAGTCATCCTTTAGCTTCTCTTAGCTCGCCAGCTCCTACTTAACACTTTAGACGTGCCGGTTTACTTAGCTTCTCAAACCTTGCTTAAGTTTAACTCAACACGTAAACGGCTGTCTAATCAAGGTTAGAAATTTCCTTGAAATACTTAACTTTAAATGACTACTTTCTTCTCGCGTTAAACATCAAAACGCCTAACTTTTTGTGTTGAACACATACTTCTTTATCGCATTGCTCGACTTCAATGCAAACTTCCAAAACGCCTAAGTCTTCAAAACGCCTTCTTCACACTTATTTCAAAATGACCCTTCTATTCCAGACATGGGTCTCACACAAATTGAGGGAGAAACTGGCATTTTCAAAGGATATGATTGAAATCTTCAGATGCTTACCTCTTCCTATAAAGGAAAAaCAAAACTGCAGACGTCTTTCTACAAAGAATACACCTTTGTGACCTCAAGAAGGGAGAAGAATATATTGGAATATGAAACTTCAGACATTATTCTATTGTTGAAGTTTTGATTGCTATACAGAAAATCTTTGTTTGGATGATAAATGGTCTCATAGATGATATGTGAGAGGGGAGGAAAGTTAAGCCCTGTGTCAGAGAAGATTTCAGAAAAAAATTTCAACTGGCTGTACGTGAATTAATGTTATCTTTGTTGAAATGAGAGACAATTTACCCTAAAAAGATCCAAGAAATAAATCTGAGGATGCTGGTAGCGTACACAAATTTTATATTGTTCCATGTCTTTGTGGTCCTTTGCTTCATGATCTCTTCTTGGAACATAACAGAAGAATTTTTGAAGATAAAATCACCCCTTCAGTGTTTTTATGGGATATGTACAACTTTATGCTTCTTGGTGGTGTCACTATTTCAATGATCTATTTGTAACTATCCTCTTGGATTGGAACCCTTTATGTGTGGGTAGTTCTCTTGAGTGGGGATTCTCTCTTCTCCCAGCCCTTAGACTGCATGGATCTTGAAACAGAGACACATCCCTCTCGTTTCTTATATATAAAAAATGTTCAATGTCACTTATGGTTGCATGAATCTTGTAAATTTGACCATTTAATGTTGTTTATTTATTTATTTTTGCACTCTTTGGTTTCAGGACCTGTAGTGGCAGAGGCTATGTTTGTCAATAGGGAAGAAGTTCCTGTACGGCCTATATACAGAGATCCGGTAAGTCTTCTGTGATTTGAAGTGCTTGCAATTCTGGTTCATGCATGTGGCTTTTCGTCGATCAATTTGGTTTACTTCTGAAGCTTTTGTGATGGAATCGTGGGAAAAGGCTGCAGGAAGTTTAGACTTTTGTTAAGAAAAAaCTATTTGTTCAAGAACTAGTATAAATCTATGCGGATGAATTCCAAGGAAAGTCAAAACTTTTGGATAATATTTTAAAAaCCAAGTTTGTTGGATTTGTTATGTTGTCTAAATTTTAAACTTTATGTTCACGAGTGTCTGACACATGATCATCAAACTCAAGTTTTGTTAAATAGTGTCAGCTCATGTTCAACAAATATTTAAGTGTTGAAGTATTTGATGGATGTCGGACATAGACATGAAAATGTATCTGTCTGTGCTTCATAGCAATGAAGGATGAGTTTTCAACATTTTCTTTCATCAATTTGAATAAAAGTGGCGCCGTGTATACCAACATTTTCGTCTTTCGTTTTTTTTtCTAGATGAAAAGTCAAAGAGCTACTTATCTGTTCTGGTTCTTTGAAATGCTAATCCAAATAGCTACCAGTCTTAACAAGTTCAAAAATTCTTCCAAAGTGAAACTCCTCAATATAAACAAACATAGAACAATTGAGTGAGAACATCTGAAGAGCCCAACCAAATTGGAAACTGTTGACATTAAGTGAGATATCAAAAACTCACAGAAAGAGAAGAAAGGAGAAACAACACCTAAAACTACAACAACCGTTGGTAGATTCACTAAGGGGTGTTTGGCAACCAACAAATTATTAAGGGGAGTTGAGTTGAGTTGGTATACTTTACAAACTCAATGTTTGGCCCACCAACTTTAAATGTTGGTCGGGTTGAGTTGGCTATTTTACCAACTTAACTCAACTCACCCCAAAGTCTTTCCAGTTCTTTATTCCTCCAATGTTTCTAATAGCTCCACTTATTGACCATTGTCGGCAACTACCTCTACCACACTTTGGTAACATACTCCAACAACCACCTTTGATGACCAATTTCGACAACCACATTCGCATAAGCAACTCCAACGACATACTTTGACTTCCACCTTTGACCAACTTTGACAACCAATTCTGGGCTCTGCCAACCACTCTGATGACAATTTATTGCAACTAACTCCAACGACTACCTTTGCACAAGCAACTTTGGCGAATTATTCTAACTTCTACTTGTGCAACCAACTCTGGGCTTTGCCAATCACCTTCGACGACTACCTTTCTACAAGCAACTTTGGTGACATATTCGGAATTCCACTTTCGCGCGGCTAACTTCGATGGCTAACTCCAGCCTCTGCCAACCACTTTTGACGAAAACCACCTTGGTTGATGAAACTTCAATGACCACCACCTCTAATAACCAACACTAACGTTCACTCAGGGGATTAACTCTGGTGATGCTATGTCTAATGAAACCCTTAATGCCAACCCTAGTTTTGTGCATGCTCAACTCATACGTTCTACAAAACATTAAAGTATAAATAATAGTATTTTGTCTACGTCAAATGCAATATTTGTATGTAGAGAAGGGTGATTATAATAAAAAAaGACTGCAACTTTCTTAACTTTTTTGCTTTTCACTTATGACTTTTATTTTTAAGTTCATATAAAAGTGGTTACTAAAAGAAAGACTGTACTAGAAAATGTACTCTTTTTtCCTTTCTTTTATAATAAATTAGATATTGCATTGATGTGGTTTCATCATCTTCTTTTAGATTACATATGTACCTTAAGTCAAGGATTAGGGGACACACCAGAGCATCTCAACGGGCATCTCAACTAGGTTGACACTCCTTTAACGTCCTCATCATGTCCATACAATACTAGTAGAATATAAGATTCAATAATAAAGTACATTAGGGTAATAGATAGGAGAAAAAGGCAGATAAGAGCTTAATGTACTGATTTGGAGATAAAACACAAATCCAAATAACTTTAAAGGTAAAAGCCGACCAATTCAAACAAAAATCCTGAATAGAGTATGCTCTAAACTCCTTGTTTAGCGTGCACCAAGCGGCTGCATTTCTTTATGCAATGTCCATTCGACCTGACCAAACCATGAAATTCTTACTCCACCATTTCACTTATCAATGAATAATAAATCAACTTAACTCTTTTTTGAAAGGCAAACAAGTTTTGATTGAGTTCTTACTCCACCATTTCTCTTATCAATGAAATTGTCTCTCTTCCAAAAGGGGAAAAaAaTCTAAGCTTTTCATTTGTACTGAAATCCTTTCTTTTTAAGGACCTTATCGTGAAATTTCCAATCCACATGATCATAAACCTTCTCAAACTCAAAGATACCTCCTATTTTTTACCTCTATAATCCTTGATAGCATTGTTAGCAATGAGAACCTGAACTAAGATTTGTTTCCCCCCAATTAAGGTAATCTGAGCCTTAAAGATTGGCGACAGGAGAACCTTTTTCAGTCTTTTAGCCAACACTTAAGATATAATCATATGCACACTCATAACTAGACTGATGGACTTAAAATCCCAACCCTATTCATTTTTCCATTTTGGTAATGAGATGAACAAAAATTTCATTTAGAAAACTTTCCAAGATGTCCCTGTTGAAGAACTCTCTAAAATCCCACTCAAAACCATCCTTAATGTTCTCCTATTTATCCTAGAAAAAGGTCAAAGAGAGTCCATAAGGATCAAGGGGCTTGTTCCTATCAAAGTGAACAACTGCATTCTTGATTTCTGCCAACAAAAAAaGGGAGTCCAACTCAGTCTGCCCACTCTAATAGGACATTTGTCACACCATTCTAGATCTTCAACAAAGAGTTTGAGAACCACATCCAGGGCATAGAGGGCCTAGGCAAAAAGATGGAATCTCCTCCTCGACCTCCTTCTCCTCCCTATCTTTCTCCCCTCTATCGTTCTCCAAGGAACCAATGAAGTTCTTGTTTTCTATACCATTCGCCAATCTATGGAAGTAAGTAGAATTGCAGTTCTTGTTCTTATCCCTCTCACTTTAGCTGATATTCTCTATACTTGCTACTTTGCAATGAACAAGATTGTCGAATTGTAGTTATCAAAACCGTTTTCTGCCTTAGATGCTGCCACCAGACAGTCCCTAATCCAAGCAACTGTTACTAGGTCCAACTCCAATCTCTCAGTCCCAAACCTTTAACCCACCACTCTCTTCTAACTACACCACACCTTTCTCCCACACACCATTTCGTCGATGTCCACCATCTTCACGCTTCCCTTTACTGGTTACATTTTTGGAAAGGGGGAGGGCATTTTCCTTATTGAAAAAAaTGAAAATACAGTGTATACCAACTTGGACGAAGTAGTTTTCTTTCAGACTTCTTCTCAAGCATATTTAACCTATTACAAGAAGGTGCCTAGAACCCCCCTCCcTCCCTCCACTCATTTTTCGAATAGTCATTGGTACTTGTTTCTATTTAAAAGTTATGATACCCTAAACCCTTCTGAATATAGTTTTAAAAGTTTGGTGGAAGGGTCTTTTTTGTGATATGACTCCAGACATTATTATAGAGACTTAACTTATTTTTTACTCTGTTTTGCAGTTTTATAGCTTTCTTTTCTTTCAATGGTGAGATGGAAGAGAACACTAACAGTTTGTTGCACATCACTATCATCATTATTAGGTTCATGACTTTGGCTTCTTTCGCTATGACCCTGGTGCAATTCAATTTCTTAATTATGAGGAGATTCCCCTGGCTCCCGAGGCTGCCTGTGTTGGACTTGAGATTAGGGTTGTTGGTAATGATAGTGGGGAGAAGGTACAGTCACATTGTTTTTCCctCTtaCCcTaaaaaaaAAAAAAAAAAGAAAaGAAAGGAAAAAaaGAAaGTTTtGTTTTAGTCCcATTTTATCTTTCTTATTGCTAAACTATGCTGAAGTGATTTTCTCGCATATGTTCCAAATAAAAaTTCAATTCTTCATTATTTAGTATGCAATCTTTGTATTATTTTGAGAGGTTAAATGTGACAGTGGACTTATTAGATTACTTCACTCAAGTTGGCATTTTTGGTGCTCCTATCATGTGCAAATACCAATTTATAGATTCAAACCCAGTAGGGTTAGGAGTTCTTATGCACGGACAAATTTGTGATAAGATCCAAGCATCTTGAAATATATATATATATATTTTGATAAGAAACAGTTGATTAGCATTCTCTTAATTTtTTTTtCGCCATAAAAGTATCTTTTGGGTAAAAAATGTTGAAATATAAAAGATAACAAAATTAAAATATAGTGCTAAGTATTTAAGATACTTCGAAACTCCTCCTTCTTGAGCATACTAACTCAATCCATAGTCTAATTAAACAATTGCGCTCCTATCATTATCTGCACCAATAAAGTACACATCTTATCTCTAAAGTCTGTGTGCATTTGTCTGTTTTTTTtGGATAATAAATGGAGGGAATATTAGAATGACCAAAAAGGCCACAAAGAACAACCTGGGGGCCGGGTAGAGAAAACCTCCTCCTAATAAGCTATATACATTGAGGGCCTTCCAATCATTTAGGATTAGAAGAAAGTTGTGATTATAAAAGAGGATAGTTTCAAACGTACCGAGAAACTTAACCTTAGCCTTGCAGTATTTTGTAATATAAATGTTGTTGGAACGGGGCCTTGGCTTGGGAGGTCTCAATAACAAAAATATGCACTTCTAGCTAAATGGGATTTGAGATTTATGGAGGAAGAAGATTCATTTAGGTACCCAACCTAACAATGAAATACTCAAGAACACAGGAAGAACTCAAGAACACAGGAAGAACTCAAGAACAATCAAATATGACAATATATGAAAATATATTGAAATGCAAGCTATGATATCACAAGAACAAGTAACCCCTAGCCTTTCGAGAGGGCTTGACTTTCACCAAATTCCCACTCAAGGAATTTACTACAGAACTCTTTATTTCTTGCTCTCCACAGCCCACCCAAAGCCTCTATTTATAACGAAAACCCTAACCACCTTATTAACTAATTACTAATATGCCCCTTCTAATAACAATACTAATATTCTCCTAATAACTCTAACTAGGTCCCTTACAGATTCTTTATGAAGACAAGTCATAAGAAGCATTCATGGTAAAGAACCTTTTAATTGGCACACTATTGGAAAATTTGGAAATAGTTTGAGAAGTCCTTGGATCAATATTTCAAGAGCTTGGAGAAAGATGGAAGCCTTGGCTTTCTCCAAGCTTGGAAATGGTACTAGAATTGCCTTTTGGACTAACTTGTGGGTCGAATGCCAACGGAGGTTTTGAATTTTAGAACTTCTTTAAATCATTTAAAAAaTTCTTCCTGCTATTCAGTTGTTCTCTAACTTACCCGTTAAATTATTTGGTTGCATCGCTTATGTTCATAATCCCAATCCTATCATGCTAAACTTAACCTTCAGGCTGTTAAAATGCATTCTTGTAGGCTGTCTCCCATAAGAAGGCCTGAAAATGTTTTGACCCCTTGACCAAACAGTATTTTGAGAGTATGAATGTATCTTTTTTtATTTTTTtGGAAAATCAACCTTTTTAGCCAAAATTCTCTCTAGGGGGAGCCACTTAACCTTGAAGATAATTTTTGGGACACCACACTATCCCAAACATCATTGATTCCTATTTGATGCCAAGTATGGAGAGTTCTTCTTCAGGGGGAGAAACATTATAGACTGACTTGACAAGTAAAAAACCTGAACTACGGGTTTATACTATAAGAGACTTGACTCAAAGGAATTGAGAACGGGCAGTTGATCTATCATATAACCAATCTAATACCCTAATAAGTGATTCTGAAGATCCAGGTAATGTACATTCTCCTTTTACTTCTTCTAATTCTCCCACTTCTTCTTCTTATAATCCTTGCCTGATGTTTCTAATCTTGATATTCCAATTGCCCATAGGAAAGGAAATGTGTCAAATATCCCATTATAAACTTTCTTATGATAAATTGTTTGACAGTCATAAAGCCTTCACATCCAAAATAACCAACCTGTTTGTTCCAAGGAATATACATGCGGCATTAAATGATTTGAATTGGAAATTAGCAGTGATGGAAGAGATGAATCACTGAAACAAAATTGCCCATGGGACATAGTTGAACTACCAAAAGAAAAGAAAACAGTTGGATACAAGTTGGTGTTCATGGTAAAATGTAAAGTTGATGGCAGTATTGAAAGGTACAAGGCCAGATTGGTTCCTAAGGGGCTTGCTCAAATCTATGGAGTTGATTATCAAGAAACATTGGCTCTAGTTGCTAAAATCAATTCTATTAGAATTTTGTTGTCTGTCGCAATTAATTTTGATTGACCTCTTTATCAACGGGATGTTAAGAATGCTTTTCTCATTGGAGAAATGGAAGAAGCGCTATTTATGGACTTGTCACCTAGTTTTGAGGTAGATCTCGAGATTAACAAAGTGTGCAAGTTAAAGAAATCACTATACCGTTTTAAATAGTCTCCTACAGCCTGGTTTGAACGCTTTGAAAAAGTAGTCACGAGCTATGGATTTAGTCAATGTCAAGCCGATCATACGATGTTGTACAAACATACATGAAATGACAAGGTTGTTGGTTTGATAGTTTGTGTTGATGATATCATTCTTAAAGGCAGTGATGAGACGTAACTAACTTTTGTGAAGAAAAAGTTAGCACATGATTTCCAAATCAAAGACCTAGGAACTTTAAAGTACTTCCTAGGCATGGAGTTTGCCAGGTCCAAAAGTGGCATTCTTTCCAACCAAAGTAAGTATGTTCTTGATTTGCTAAAAGAGAGGTATACCTGAGTGTAAGGTAGTAGAAACTTCCGTTGAGCAGAATCTGAAATTGGAAGCTTCAACTAAGAATGAAATAAAGAAAaGAAAaAaGTACCGGAAACTTGTGGGGAGACTCATGTATCTTTCTCACACACGTCTTGATGTCGCTTTTGTAGTTAATATGGTAAGTTAGTTCATGCATGCACCTGGCCCTCTCACTTTGATGCAATTTATAGAATCCTAAGATATTTGAAAGGTACTTCAGGAAAAGGCTTATTGTTTCAAAGAAATGACCATCTCAATGTTGAAGTATATACTGATGCTGATTTGACAGGTAGCACGACTTATAGAATATCCACTTCTGGATACTGCTCCTTTAGTTGAGGAAACCTTGTTACTTGGCAAAGTGAAAAACAAAGTGTGATTGCAAGAAGTAGAGCAGAAGCGGAATTTAGAGCTTTAGTCCATGATATTTGTGAGGGTATATGGATAAGAAGATTGTTGGAAGAATTGAGATTTACTCAAACAATGCCATGCACATTTTCTGTGATAACGAGGCAGCAATTTCCATTGCCCACAATCCGGTCCTTCATGACAGGACAAAACATATTGAAGTCGATAAATACTTTATAAAGAAAAAGATCGATGTGGGAGTAATATGCATACGTATCTTCTAAAAACAAAATAAATTGCAAATGTGTTAACTAAAACCCTTCCAAAGTGGCAATTCAAAAAATTGATTGACAAGCTGGCCATGACTGATTTCTTCAAACTAGCTTAAGGGAGAGTGTTGATTATTTCCTTTCTTGTATTATATTTTATTGTATTATAATTGCCAAATTTATTTTTtCCTTCTATGTAATGGGTTGTTCTTCTATTTAAGACAACCCTTCTCTCTTTGTGAAATATACAAGAAATATATTATTTTGGCATGGACTCTTTTGTAGGTACACATGATCTGTAAATATCATAGGTTCATATTAACCAACATATTACAGAAAAAGAGAGTGGTAGATAGAGGGAGAAACTACCAGAACCATTACATCCATAATGATTCTTTATTCTTAATAGTTAATTATGTTTTAAGCTTTTAATTTAACATATCATTATTTCTGAACTTCATTTATATCCATAATGATCCTTAATGAATTCCCATTGCAAGATAATTCCATTTTTTtGTGTGTTTTTGCCAATTGAGTCTCAATAATTTTACAGGTTTCTATCTTAGCCGGTACCCTTGCTCGGCTGGATCGGGAGGCTCCTCACTATAAGAAGTATGTTGTGCACATCTAATTGTAGTTTTCATAAATTACTTTTTCCAGGCTGTTATATTCACTTCAAATGTTGACATCAAATATATTATCCTGGTATTATTATACATCTTTATTTTTtATTTTATGTTATGAATTTGTAGTAGATCAATCACTTCTATTTTAGATTTTGATAGGAATATAAAAAAAaTTAACATAAACTCAAGCCATATAAAAAaTATATTTACAATTTTAGACATTTTTAATCTTATTTTGTTAATGTAATATTTTTtCCCTAAAAAGATGGGATTTTTAGACTTATTATGGGGAATTAAGTTTCTGGGTTTACTATTTGGTGCATTATTTATAATTTTCTTTGAAAAATAACAATATTCAAAAGGGACACTAATTATTAACTTTGTAACAACGTGTATGTTAGTCCCATTTAATTTGTTCAATGTAGAAAGTTAGATTTCTTATTAAAAAaCAGTGTGTATGTTAATAGGCCTTTATATTTATATCTTTTTCTGCTCCTTTGCCCCATATAATGTCTGGAATATGAACACTAGGCTCTTCAATGACAAACAAAAGATCTTTGACATCTAGCCCCATATGTATGCAAGTGATTGATGTTTTTTTTTtGTAACTCTTTTGATGTATGGGTTTTTTCGCTTCCCTGGTCCTTTGGCACGTTCTCCTTATGAGCCAATTAAAGTTTTTGTTTCTTTTAAGAAAGAAGTGAAATCTGGAATAGAAGTAGCTCCTGTTCATTTTCCTCTTACGTTCTATTGTTCTTTTTGAAATCTGAAAGAAGCTTTTTTtCTCTTTTCCTTTTCCTTTTtCTCCCCGGATAAATTCATTATTAGCATGATGCTATTTCAATTTTGGACCCAAGTTTATCCTATTTTTTTtATAGTTTTtCTGTTTGTTGTCTGACCTCTAGCCTGCTCTTTGTCAGATCTATTAAATATTTTTTCGTTCTAGTAGCATGACAATTGTTTTTATGCTTGTTGCAGAGATGGATATAATGATTTCAACACATTCTACATGCAAGTGAGTGATTTAACATTTTGTACGTCAATAAATCTTTTTTGTCTTTTATTAGAACCGATTAGAAGATTGCCCCTATGGTTAAAGCTTGTGCAAGTTTTCTTCCCAGATTTTGAAGATTGGTTTTCTGAAGTATGTTTAGTTGGTTTAGAAGTTTATCCTTCGCTATTTTGCCTTCTAGTAAGGCTTTTATCTAAGGTTCAGGCGAGTCAATCATGTTTTGATTTCACTTCCAAGCTTCATGTTTTATTTCATTAGTTTAGTGGAAAAGTCTTGTTTCCGTTTAGAAAAAaCAATTTCCAAGCCTCAAAGCTCTATTTCTTTTGAGGTTCAAGTTTTGCCCTCTGCTGGATGTCTCTTCAGTTGGATAACCTTCTTTTGAAACAGAGACAAGCCTCTTTATTGATAATAAATGAGACTAATGCTCAAATGACAAGAGTATTATACTAAGAGAAGAATAAGAAAGAAGGGAAAGCAAAGTTCTTGTTTCCGTTCCAAAAaAAAAAaaaGAGAAGAATAAGAAAAGACAAAAATCATCCAAATAAACTAAACAATGATGAACCCGGGCAATACAAAAGGGAAAGCGACCAACAAACGATCACAAAGAAGAAACAACGAAAGAACCAAAGACACAAAGAGTTGAGACATTAG

mRNA sequence

ATGGCTGATCCTTCCGAAGGCCTGGGTTCAGAATCGGCGGCAATTGGAATCGACTCCACCACCAAGGACGACCTTTGCATGGAGATCGACCCACCTTTCCGCGAGAATTTGGCCACTGCTGATGACTGGAGGAAAGCCCTGAACAAGGTTGTTCCTGCCGTCATTGTGTTGCGCACCACCGCCTGTCGGGCATTCGACACTGAATCGGCTGGTGCGAGCTATGCTACTGGATTCGTTGTTGATAAGCGCCGTGGGATTATCCTCACCAACCGTCATGTTGTTAAGCCAGatgaatcactgaaacaaaattgcccatgggacatagttgaactaccaaaagaaaagaaaacagttggatacaagttggtgttcatggtaaaatgtaaagttgatggcagtattgaaaggtacaaggccagattggttcctaaggggcttgctcaaatctatggagttgattatcaagaaacattggctctagttgctaaaatcaattctattagaattttgttgtctaaaaagttagcacatgatttccaaatcaaagacctaggaactttaaagtacttcctaggcatggagtttgccaggtccaaaagtggcattctttccaaccaaagtatacctgagtgtaaggtagtagaaacttccgttgagcagaatctgaaattggaagcttcaactaagaatgaaataaagaaaagaaaaaagtaccggaaacttgtggggagactcatgtatctttctcacacacgtcttgatgtcgcttttgtagttaatatgatttactcaaacaatgccatgcacattttctgtgataacgaggcagcaatttccattgcccacaatccggtccttcatgacaggacaaaacatattgaagtcgataaatactttataaagaaaaagatcgatagaagaataagaaaagacaaaaaTCATCCAAATAAACTAAACAATGATGAACCCGGGCAATACAAAAGGGAAAGCGACCAACAAACGATCACAAAGAAGAAACAACGAAAGAACCAAAGACACAAAGAGTTGAGACATTAG

Coding sequence (CDS)

ATGGCTGATCCTTCCGAAGGCCTGGGTTCAGAATCGGCGGCAATTGGAATCGACTCCACCACCAAGGACGACCTTTGCATGGAGATCGACCCACCTTTCCGCGAGAATTTGGCCACTGCTGATGACTGGAGGAAAGCCCTGAACAAGGTTGTTCCTGCCGTCATTGTGTTGCGCACCACCGCCTGTCGGGCATTCGACACTGAATCGGCTGGTGCGAGCTATGCTACTGGATTCGTTGTTGATAAGCGCCGTGGGATTATCCTCACCAACCGTCATGTTGTTAAGCCAGATGAATCACTGAAACAAAATTGCCCATGGGACATAGTTGAACTACCAAAAGAAAAGAAAACAGTTGGATACAAGTTGGTGTTCATGGTAAAATGTAAAGTTGATGGCAGTATTGAAAGGTACAAGGCCAGATTGGTTCCTAAGGGGCTTGCTCAAATCTATGGAGTTGATTATCAAGAAACATTGGCTCTAGTTGCTAAAATCAATTCTATTAGAATTTTGTTGTCTAAAAAGTTAGCACATGATTTCCAAATCAAAGACCTAGGAACTTTAAAGTACTTCCTAGGCATGGAGTTTGCCAGGTCCAAAAGTGGCATTCTTTCCAACCAAAGTATACCTGAGTGTAAGGTAGTAGAAACTTCCGTTGAGCAGAATCTGAAATTGGAAGCTTCAACTAAGAATGAAATAAAGAAAaGAAAaAaGTACCGGAAACTTGTGGGGAGACTCATGTATCTTTCTCACACACGTCTTGATGTCGCTTTTGTAGTTAATATGATTTACTCAAACAATGCCATGCACATTTTCTGTGATAACGAGGCAGCAATTTCCATTGCCCACAATCCGGTCCTTCATGACAGGACAAAACATATTGAAGTCGATAAATACTTTATAAAGAAAAAGATCGATAGAAGAATAAGAAAAGACAAAAATCATCCAAATAAACTAAACAATGATGAACCCGGGCAATACAAAAGGGAAAGCGACCAACAAACGATCACAAAGAAGAAACAACGAAAGAACCAAAGACACAAAGAGTTGAGACATTAG

Protein sequence

MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTTACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKPDESLKQNCPWDIVELPKEKKTVGYKLVFMVKCKVDGSIERYKARLVPKGLAQIYGVDYQETLALVAKINSIRILLSKKLAHDFQIKDLGTLKYFLGMEFARSKSGILSNQSIPECKVVETSVEQNLKLEASTKNEIKKRKKYRKLVGRLMYLSHTRLDVAFVVNMIYSNNAMHIFCDNEAAISIAHNPVLHDRTKHIEVDKYFIKKKIDRRIRKDKNHPNKLNNDEPGQYKRESDQQTITKKKQRKNQRHKELRH*
BLAST of Cucsa.012630 vs. Swiss-Prot
Match: DEGP7_ARATH (Protease Do-like 7 OS=Arabidopsis thaliana GN=DEGP7 PE=2 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 1.9e-35
Identity = 73/96 (76.04%), Postives = 85/96 (88.54%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          M DP E LGS+ A++  +S  K+DLC+EIDPP  E++ATA+DWR+AL KVVPAV+VLRTT
Sbjct: 1  MGDPLERLGSQ-ASMATESVMKEDLCLEIDPPLTESVATAEDWRRALGKVVPAVVVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGF+VDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFIVDKRRGIILTNRHVVKP 95

BLAST of Cucsa.012630 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 70.1 bits (170), Expect = 5.5e-11
Identity = 63/198 (31.82%), Postives = 93/198 (46.97%), Query Frame = 1

Query: 3    DPSEGLGSESAA----IGIDSTTKDDLCMEIDPPFRENLATA-----DDWRKALNKVVPA 62
            +P+E   SE+A     IGID+ TK+D  +EI     E L T      ++   +LNKVV  
Sbjct: 824  NPNESRESETAEHLKEIGIDNPTKND-GIEIINRRSERLKTKPQISYNEEDNSLNKVVLN 883

Query: 63   VIVLRTTACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKPDESLKQNCPWDIVELPK 122
               +      +FD           +  DK       N  +     + K N  W I + P+
Sbjct: 884  AHTIFNDVPNSFDEIQ--------YRDDKSSWEEAINTEL----NAHKINNTWTITKRPE 943

Query: 123  EKKTVGYKLVFMVKCKVDGSIERYKARLVPKGLAQIYGVDYQETLALVAKINSIRILLSK 182
             K  V  + VF VK    G+  RYKARLV +G  Q Y +DY+ET A VA+I+S R +LS 
Sbjct: 944  NKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSL 1003

Query: 183  KLAHDFQIKDLGTLKYFL 192
             + ++ ++  +     FL
Sbjct: 1004 VIQYNLKVHQMDVKTAFL 1008


HSP 2 Score: 42.0 bits (97), Expect = 1.6e-02
Identity = 16/43 (37.21%), Postives = 28/43 (65.12%), Query Frame = 1

Query: 266  NAMHIFCDNEAAISIAHNPVLHDRTKHIEVDKYFIKKKIDRRI 309
            N + I+ DN+  ISIA+NP  H R KHI++  +F ++++   +
Sbjct: 1324 NPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNV 1366

BLAST of Cucsa.012630 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 9.4e-11
Identity = 39/94 (41.49%), Postives = 56/94 (59.57%), Query Frame = 1

Query: 98  ESLKQNCPWDIVELPKEKKTVGYKLVFMVKCKVDGSIERYKARLVPKGLAQIYGVDYQET 157
           ESL++N  + +VELPK K+ +  K VF +K   D  + RYKARLV KG  Q  G+D+ E 
Sbjct: 835 ESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEI 894

Query: 158 LALVAKINSIRILLSKKLAHDFQIKDLGTLKYFL 192
            + V K+ SIR +LS   + D +++ L     FL
Sbjct: 895 FSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFL 928


HSP 2 Score: 40.4 bits (93), Expect = 4.7e-02
Identity = 14/36 (38.89%), Postives = 28/36 (77.78%), Query Frame = 1

Query: 270  IFCDNEAAISIAHNPVLHDRTKHIEVDKYFIKKKID 306
            ++CD+++AI ++ N + H RTKHI+V  ++I++ +D
Sbjct: 1253 VYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVD 1288


HSP 3 Score: 39.3 bits (90), Expect = 1.0e-01
Identity = 35/129 (27.13%), Postives = 55/129 (42.64%), Query Frame = 1

Query: 154  YQETLALVAKINSIRILLSKKLAHDFQIKDLGTLKYFLGMEFARSKSG------------ 213
            Y + + +V K   +   L   L+  F +KDLG  +  LGM+  R ++             
Sbjct: 1008 YVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIE 1067

Query: 214  -ILSNQSIPECKVVETSVEQNLKLE----ASTKNEIKKRKK--YRKLVGRLMY-LSHTRL 263
             +L   ++   K V T +  +LKL      +T  E     K  Y   VG LMY +  TR 
Sbjct: 1068 RVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRP 1127

BLAST of Cucsa.012630 vs. Swiss-Prot
Match: NM111_PICGU (Pro-apoptotic serine protease NMA111 OS=Meyerozyma guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC 10279 / NRRL Y-324) GN=NMA111 PE=3 SV=2)

HSP 1 Score: 66.6 bits (161), Expect = 6.1e-10
Identity = 37/85 (43.53%), Postives = 47/85 (55.29%), Query Frame = 1

Query: 14  AIGIDSTTKDDLCME--IDPPFRENLATADDWRKALNKVVPAVIVLRTTACRAFDTESAG 73
           A G D+   DD   E  + P           W+K + KVV +V+ ++ +    FDTES+ 
Sbjct: 37  ANGEDNEDIDDYSSEGEMSPQLENYFPQTTSWQKTITKVVKSVVSIQFSHVSNFDTESSA 96

Query: 74  ASYATGFVVDKRRGIILTNRHVVKP 97
            S ATGFVVD  RG ILTNRHVV P
Sbjct: 97  VSEATGFVVDSERGYILTNRHVVGP 121

BLAST of Cucsa.012630 vs. Swiss-Prot
Match: NM111_VANPO (Pro-apoptotic serine protease NMA111 OS=Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294) GN=NMA111 PE=3 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 8.0e-10
Identity = 40/104 (38.46%), Postives = 60/104 (57.69%), Query Frame = 1

Query: 1   MADPSEGLGSESAAIGIDSTTKDDLCM--EIDPPFRENLATADD------WRKALNKVVP 60
           +++ SEG   E+ A   +S T D++ M  E+D     N+++  D      W+  + KVV 
Sbjct: 12  LSEVSEGSDPEAPAKTRNSYTTDEIVMVEEVDSEIPVNISSYGDAETYLKWQNTIAKVVK 71

Query: 61  AVIVLRTTACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
           +V+ +  +    FD+++A  S ATGFVVD   GIILTNRHVV P
Sbjct: 72  SVVSIHFSQVAPFDSDNAVVSQATGFVVDASLGIILTNRHVVGP 115

BLAST of Cucsa.012630 vs. TrEMBL
Match: A0A0A0L781_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G346140 PE=4 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 2.8e-46
Identity = 96/96 (100.00%), Postives = 96/96 (100.00%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT
Sbjct: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 96

BLAST of Cucsa.012630 vs. TrEMBL
Match: A0A067JIK0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02490 PE=4 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 1.1e-37
Identity = 78/96 (81.25%), Postives = 88/96 (91.67%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          M DP E LGSE+   G+++  K+DLCMEIDPPF+EN+ATA+DWR+ALNKVVPAV+VLRTT
Sbjct: 1  MGDPLERLGSETEVAGLETKLKEDLCMEIDPPFKENVATAEDWRRALNKVVPAVVVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 96

BLAST of Cucsa.012630 vs. TrEMBL
Match: A0A061EG47_THECC (DegP protease 7 isoform 1 OS=Theobroma cacao GN=TCM_019222 PE=4 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 1.8e-37
Identity = 80/96 (83.33%), Postives = 90/96 (93.75%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          M DP E LGSE+A +G++ST K++LCMEIDPPF+EN+ATA+DWRKALNKVVPAV+VLRTT
Sbjct: 1  MGDPLERLGSETA-MGLESTIKEELCMEIDPPFKENVATAEDWRKALNKVVPAVVVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTE AGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTEPAGASYATGFVVDKRRGIILTNRHVVKP 95

BLAST of Cucsa.012630 vs. TrEMBL
Match: B9T3Z6_RICCO (Protein binding protein, putative OS=Ricinus communis GN=RCOM_0336750 PE=4 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 5.3e-37
Identity = 81/96 (84.38%), Postives = 88/96 (91.67%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          M DP E LGSE+A   I+S+ K+DLCMEIDPPF+EN ATA+DWRKALNKVVPAV+VLRTT
Sbjct: 1  MGDPLERLGSETA---IESSMKEDLCMEIDPPFKENAATAEDWRKALNKVVPAVVVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 93

BLAST of Cucsa.012630 vs. TrEMBL
Match: M5X9U1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000531mg PE=4 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 9.1e-37
Identity = 81/96 (84.38%), Postives = 88/96 (91.67%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          M DP E LGSE  AIG++S+ KDDL MEIDPPF+EN ATADDWRKAL+KVVPAV+VLRTT
Sbjct: 1  MGDPLERLGSE--AIGLESSIKDDLSMEIDPPFKENTATADDWRKALSKVVPAVVVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTE+AGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTEAAGASYATGFVVDKRRGIILTNRHVVKP 94

BLAST of Cucsa.012630 vs. TAIR10
Match: AT3G03380.1 (AT3G03380.1 DegP protease 7)

HSP 1 Score: 151.4 bits (381), Expect = 1.1e-36
Identity = 73/96 (76.04%), Postives = 85/96 (88.54%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          M DP E LGS+ A++  +S  K+DLC+EIDPP  E++ATA+DWR+AL KVVPAV+VLRTT
Sbjct: 1  MGDPLERLGSQ-ASMATESVMKEDLCLEIDPPLTESVATAEDWRRALGKVVPAVVVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGF+VDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFIVDKRRGIILTNRHVVKP 95

BLAST of Cucsa.012630 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 75.1 bits (183), Expect = 9.7e-14
Identity = 36/86 (41.86%), Postives = 53/86 (61.63%), Query Frame = 1

Query: 106 WDIVELPKEKKTVGYKLVFMVKCKVDGSIERYKARLVPKGLAQIYGVDYQETLALVAKIN 165
           W+I  LP  KK +G K V+ +K   DG+IERYKARLV KG  Q  G+D+ ET + V K+ 
Sbjct: 115 WEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLT 174

Query: 166 SIRILLSKKLAHDFQIKDLGTLKYFL 192
           S++++L+    ++F +  L     FL
Sbjct: 175 SVKLILAISAIYNFTLHQLDISNAFL 200


HSP 2 Score: 74.7 bits (182), Expect = 1.3e-13
Identity = 42/123 (34.15%), Postives = 63/123 (51.22%), Query Frame = 1

Query: 171 LSKKLAHDFQIKDLGTLKYFLGMEFARSKSGI-----------LSNQSIPECKVVETSVE 230
           L  +L   F+++DLG LKYFLG+E ARS +GI           L    +  CK     ++
Sbjct: 300 LKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMD 359

Query: 231 QNLKLEASTKNEIKKRKKYRKLVGRLMYLSHTRLDVAFVVNMIYSNNAMHIFCDNEAAIS 283
            ++   A +  +    K YR+L+GRLMYL  TRLD++F VN +   +        +A + 
Sbjct: 360 PSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMK 419


HSP 3 Score: 45.1 bits (105), Expect = 1.1e-04
Identity = 19/34 (55.88%), Postives = 26/34 (76.47%), Query Frame = 1

Query: 270 IFCDNEAAISIAHNPVLHDRTKHIEVDKYFIKKK 304
           +FCDN AAI IA N V H+RTKHIE D + ++++
Sbjct: 521 LFCDNTAAIHIATNAVFHERTKHIESDCHSVRER 554

BLAST of Cucsa.012630 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 53.5 bits (127), Expect = 3.0e-07
Identity = 28/75 (37.33%), Postives = 44/75 (58.67%), Query Frame = 1

Query: 98  ESLKQNCPWDIVELPKEKKTVGYKLVFMVKCKVDGSIERYKARLVPKGLAQIYGVDYQET 157
           ++L +N  W +V  P  +  +G K VF  K   DG+++R KARLV KG  Q  G+ + ET
Sbjct: 49  DALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVET 108

Query: 158 LALVAKINSIRILLS 173
            + V +  +IR +L+
Sbjct: 109 YSPVVRTATIRTILN 123

BLAST of Cucsa.012630 vs. NCBI nr
Match: gi|449464156|ref|XP_004149795.1| (PREDICTED: protease Do-like 7 [Cucumis sativus])

HSP 1 Score: 194.1 bits (492), Expect = 4.0e-46
Identity = 96/96 (100.00%), Postives = 96/96 (100.00%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT
Sbjct: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 96

BLAST of Cucsa.012630 vs. NCBI nr
Match: gi|700202707|gb|KGN57840.1| (hypothetical protein Csa_3G346140 [Cucumis sativus])

HSP 1 Score: 194.1 bits (492), Expect = 4.0e-46
Identity = 96/96 (100.00%), Postives = 96/96 (100.00%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT
Sbjct: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 96

BLAST of Cucsa.012630 vs. NCBI nr
Match: gi|659114476|ref|XP_008457070.1| (PREDICTED: protease Do-like 7 isoform X1 [Cucumis melo])

HSP 1 Score: 189.9 bits (481), Expect = 7.6e-45
Identity = 93/96 (96.88%), Postives = 95/96 (98.96%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          MADPSEGLGS+SAAIGI STTKDDLCMEIDPPFRENLATADDWRKALNKVVPAV+VLRTT
Sbjct: 1  MADPSEGLGSDSAAIGIHSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVVVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 96

BLAST of Cucsa.012630 vs. NCBI nr
Match: gi|659114478|ref|XP_008457071.1| (PREDICTED: protease Do-like 7 isoform X2 [Cucumis melo])

HSP 1 Score: 189.9 bits (481), Expect = 7.6e-45
Identity = 93/96 (96.88%), Postives = 95/96 (98.96%), Query Frame = 1

Query: 1  MADPSEGLGSESAAIGIDSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVIVLRTT 60
          MADPSEGLGS+SAAIGI STTKDDLCMEIDPPFRENLATADDWRKALNKVVPAV+VLRTT
Sbjct: 1  MADPSEGLGSDSAAIGIHSTTKDDLCMEIDPPFRENLATADDWRKALNKVVPAVVVLRTT 60

Query: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 97
          ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP
Sbjct: 61 ACRAFDTESAGASYATGFVVDKRRGIILTNRHVVKP 96

BLAST of Cucsa.012630 vs. NCBI nr
Match: gi|703151920|ref|XP_010110245.1| (ABC transporter C family member 2 [Morus notabilis])

HSP 1 Score: 172.6 bits (436), Expect = 1.3e-39
Identity = 100/221 (45.25%), Postives = 133/221 (60.18%), Query Frame = 1

Query: 99  SLKQNCPWDIVELPKEKKTVGYKLVFMVKCKVDGSIERYKARLVPKGLAQIYGVDYQETL 158
           +L++N  W+IVELP +KKTVG K VF VK   DGS+ RYKARLV KG  Q YG+DY+ET 
Sbjct: 75  ALEKNGTWEIVELPSKKKTVGCKWVFTVKFNADGSVNRYKARLVAKGFTQAYGIDYKETF 134

Query: 159 ALVAKINSIRILLSKKLAHDFQIKDLGTLKYFLG--MEFARSKSG------------ILS 218
           A VAK+NS+R            +K+ G  +      + +  SK G            +L 
Sbjct: 135 APVAKLNSVRSPREWFEKFTQAVKEWGFTQAQSDHTIFYKHSKDGKKAILIVKYVLDLLQ 194

Query: 219 NQSIPECKVVETSVEQNLKLEASTKNEIKKRKKYRKLVGRLMYLSHTRLDVAFVVNMIYS 278
              +  CK  ET +E N KL      EI  +++Y +LVG+L+YLSHTR D+AF       
Sbjct: 195 ETDMLGCKPAETPMEPNAKLGLEGGKEI-DQEQYHRLVGKLIYLSHTRPDIAFA------ 254

Query: 279 NNAMHIFCDNEAAISIAHNPVLHDRTKHIEVDKYFIKKKID 306
                ++CDN+AA+SIAHNPV HDR KH+EVD++FIK+KID
Sbjct: 255 -----LYCDNKAAVSIAHNPVHHDRMKHVEVDRHFIKEKID 283

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DEGP7_ARATH1.9e-3576.04Protease Do-like 7 OS=Arabidopsis thaliana GN=DEGP7 PE=2 SV=1[more]
COPIA_DROME5.5e-1131.82Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
POLX_TOBAC9.4e-1141.49Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
NM111_PICGU6.1e-1043.53Pro-apoptotic serine protease NMA111 OS=Meyerozyma guilliermondii (strain ATCC 6... [more]
NM111_VANPO8.0e-1038.46Pro-apoptotic serine protease NMA111 OS=Vanderwaltozyma polyspora (strain ATCC 2... [more]
Match NameE-valueIdentityDescription
A0A0A0L781_CUCSA2.8e-46100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G346140 PE=4 SV=1[more]
A0A067JIK0_JATCU1.1e-3781.25Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02490 PE=4 SV=1[more]
A0A061EG47_THECC1.8e-3783.33DegP protease 7 isoform 1 OS=Theobroma cacao GN=TCM_019222 PE=4 SV=1[more]
B9T3Z6_RICCO5.3e-3784.38Protein binding protein, putative OS=Ricinus communis GN=RCOM_0336750 PE=4 SV=1[more]
M5X9U1_PRUPE9.1e-3784.38Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000531mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G03380.11.1e-3676.04 DegP protease 7[more]
AT4G23160.19.7e-1441.86 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00820.13.0e-0737.33ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
Match NameE-valueIdentityDescription
gi|449464156|ref|XP_004149795.1|4.0e-46100.00PREDICTED: protease Do-like 7 [Cucumis sativus][more]
gi|700202707|gb|KGN57840.1|4.0e-46100.00hypothetical protein Csa_3G346140 [Cucumis sativus][more]
gi|659114476|ref|XP_008457070.1|7.6e-4596.88PREDICTED: protease Do-like 7 isoform X1 [Cucumis melo][more]
gi|659114478|ref|XP_008457071.1|7.6e-4596.88PREDICTED: protease Do-like 7 isoform X2 [Cucumis melo][more]
gi|703151920|ref|XP_010110245.1|1.3e-3945.25ABC transporter C family member 2 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009003Peptidase_S1_PA
IPR013103RVT_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010205 photoinhibition
biological_process GO:0006508 proteolysis
cellular_component GO:0009507 chloroplast
cellular_component GO:0005829 cytosol
molecular_function GO:0004252 serine-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.012630.1Cucsa.012630.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009003Peptidase S1, PA clanunknownSSF50494Trypsin-like serine proteasescoord: 42..101
score: 1.2
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 104..172
score: 4.3
NoneNo IPR availableGENE3DG3DSA:2.40.10.10coord: 38..101
score: 3.
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 98..262
score: 1.4
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 98..262
score: 1.4
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 135..294
score: 7.1