Cucsa.178770.1 (mRNA) Cucumber (Gy14) v1

NameCucsa.178770.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionInsulinase (Peptidase family M16) family protein
Locationscaffold01227 : 1118701 .. 1135848 (+)
Sequence length4293
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTCTGTTCTCCGCAAGCTCTGTCCTTTCCAGTTTTCTTGGATTTTTAGCCTCTAATGGCCGTTGCAACCTCTTCTACTGTTTCAAATTTGACGCAAAGACGGCCCTTGTTGAGTCTAAAAGACCAAACTACTCCCATTAAGCGGGTGAACTCGGTGCAACTCCCTTCCCGCTCTATTTCCGCTCATCTCTCTCGATTCGATGTAGACTCTCGATTTGTTGTCCCTTTGAGGAGGTAGATTTTTGCTTTTTTCCATGAAGAATTGTGTTTTTGATACCTAGCACTGTTGCTTCTAATGTTGTTAACCTTCCATTTCGTTTTGTTTCACTTTTTCTTGTTTTTGGGGAAATTATTTATCTGGGTTTGATGTGGGTTTTTTTGAGCTGAAGTTTGATCGCTCTTTTCAACCTCTGGGTTTGGTCTATGGTTCACCAATGGAAGAATCTAATTGTTGGAAAAAATTAGTAAATATTTAGGAATTTTATTTTGACTGTACGGCTAAGTGATGTAATCTTTGATGGTCTTTCCCTTCCTCTTGCGAGGCGACGTTTACAAAGTTATTCTTTGCGGTTATTTTTTAAACTCAAAGCTAAAGTTCTTTGGAAAAATCATAAATGCTAACATGGGGTTCTTTTGGAGACTATGGTTTGAACCTTTTAGTATTCCTACTCTAAAATTCAAACGAGTTTGTGGACTTAAATGAGCCCTTCAGGTTTCTCTCTCTAATTTGGATGAGTTAACAGTTGGATTCTCTTTCTTCCTCTTGTTGCACTCTGGGCTCAATATGTAATTGTGACGTTATTTTGATTTATGTCGATTGGTAGCTGTTTCTGTAAGCCCCTTGGGCTTTGAAGTGTTGAAGTCCCCCAGTAGGCTTAAAGGAATACCACTGTTCTGCATCCATAGCATTTAAGTTTCTCTAGCATTGTGGTAACTAACCATCACGATCCCTAATCCATCAGCAAACAATCTTTATTGCTTTAATCGAATCTCATCAAAGAGTGAAGGGACTAGGAAGAGAGAAAGTGAAATTTTTTtGTGCCCTGAATCTCTTCCAACCATATGTAGTATTGCTTCATCGTGGCATGGAGGAGGAGATGGAAAATCTTGTAGAATTGAAGTCTGGGTGAGTTCTCGATGGGGTTAAGATATAGGACAATGAGAAATGTGGTAGAATCCCTTGATGTGGAGGATGGTTTAATGTGAGAAATCTGCCACTATCGTGTTGGTCAGAAATAAATCTGAGTTAAATTGAGACTTGAATGGTTTAGAGTGGTGAAATCTTTATTGTATTATATTAGTGTTGCTCTTCACGAACAATCATTCGAGCGGCTGTCAGGTGCACTTGTACGCCTTTGCCTACTACTAAGATAATTCATATCACAATCTCATGCACAGTTGGTCGTGTGAATACTATGGATAAGCTTCTAAGGAAGAAGTCTTTGCTTAGGGAGCCTTTTTGTTGTATTCTTTGTTGGAAGGCTGAGGGAGACCTGGATCACCTCCACTAGAGTTGTCAGTTTGCGTTGGTGTTGAACTATTTCTTTCAAGAGTTTGATATTGAGCTTGCTTGCTAGATAGATTTTACTTAGTTGGAAACCCTCTCTTTAGATGGGCTTTCTGAGTTTCTTTTTtCTTGCCCTTGTATTCTTTCATTTTTTTTCTCAACGAAAGCAATAGAATAGTATGTTTCATTTGTTTCATCTTTTGTATACAACCTTCACACGGTTTAATTCACCCCACACTTTCGGgATTACAATCTTACTTCTCTTATAACACAATGAAATAGTGTGTTGTAATCTCTCATGCCATATGTGCCCTTCTGTAATTTCATACCATAATGAAATTATTCATATGATTGTCATAAAAAAAaGAAGAAAGCAGTTGTTAAAGAAAAGAAGAAAAGAAAAGAAAGGTAACACATTGTATGTTTTTtCCTTATGTGATCTTGGGTGAAAGCTTTTCTTTTTTGTATTTTCTCTTCCCACCCACCCGACAATTATTCCTCCTCCTTATTTTCATTTTTCTTGATCTGAGCTAAGGTTTTATTCAGTGCTTATATTTACTTCAATGCATATCTTTGTTGATTATTTCCTCTTTGGTTTGGAAGGCACTCTCGTGACGATGGTATTGGTAGACACAAGTTTAGAAGAAATAAGGATAATGCTCGAAGACCATGTGCTTATAAAATTGGGGAGCGTGGGAATGAAACGTTGACTAATTGTATCTCTTGCTTTCTTAACCAAAAAAGAAGATGTCCCAGTATCAAAAGACCAACATCTAGATTCATTCTAGACAAGTCTGCTTTTCAGTTATCCAAGAATGAACGTGATGATAGAGTTGTAAGTGCCTGCTTTTTTGCAGATTGGCTTTTATGTAGTTCTCAAAATTTCTCTCTCTTTCATCGTTGTGCTTATTCCTTTTATTCCTTTCTCTTTTCTTGTGTTGTGGCATTCATCTGCTTAATATATTTTGAGAATCTTTCTTAGAGATCGTATAGAAGGGAAAGATTAGAAATACACCCATGCAACCTTAGGAGCTTATGAAGAACACTTTGTTAGGTTAGTCATGAGAGTTTTGAGAGAAAAACTCAATAATATTGTTCATTGACTTTGTATATGTGGGTGGACAAGGCAATCCTAAATTAGAAAAATGTAAAATAGCAGCAAAGGGAAAATAACTAACGAACATGAATAACTAGGCTGTTTACAAGCAACAAGATGCACTAATGTATCTGTGTTAAGAGACATGTGGAGCATATATGTTAATGGACAAACAGCTAATGAACTTGAATAACTAGGCTAGGCCGTTTACAAACATTGTGATGCACTGATGTACTTGTGATGATAAAGATATTAGTCACGTCTATCTTGTTCTAAAGATAATGCATGCTTCATTTAATGCTTTCTTAGAGATGTCTCGTAATTTCGTTGTAGTCTTAACATCACATGTGGATAACAAATTTTTGGGTCTATCGATTGTTTAACATGGTATCATAGCTTGAGGTTGTGTGTTTGAACCCCTTTCAATCCACAACTAAAGAGGATTGTGTGTTCGAACCCCTTTAAATCCACAACTAAAGAGGTTTTGTGTTCACTTTGTATCTTTGTATCTCGAGCAATGGCCTCTTTCATTATATCAATGAAAAGTTTGTTTCTATTTAGTAAAAAAAAaCCCTTTAATGTTATTCCTATCAAATTAAGGTTGATTTCCACTTGTTGAATATTCTACATATTTTCAAGCTCACAAGTAAGGAGGGTGTTAAaGTGTTTATATATTTATATTTACCTTGACCTGTTAGCTTAAGCTTTTGGGTCAAACCGTGATTTAACATCTTCCAATACAACTAGTGTTAGATGTTTAGAAAGGAATATAAATGGCGTTCTACAATATTTACTATGTACGTGATAGTCCATATATTCTTATATTTATTGTCTCTGTACTTTTATGCAACATCCAGGTGAAGCATGCTCGTATTGTTTGTGGAACTGTAGGTCCAGATGAGCCTCATGCAGCACCTACAGCCTGGCCTGATGGTATTCTGGAGAAACAAGATTTAGATGTTTCATATCCTGAGTTTGGAAGGGCAGAGTTAGAAGCATTTCTTAGTTCAGAACTTCCGTCTCATCCGAAGTTGTATAGAGGGCAGTTGAAAAATGGATTGAAGTATCTAATTTTACCAAATAAAGTTCCACCCAACAGGTATTGACCATGTGGTTCATGAAGAAATAATTTATCAAGTGAATGCTTTTCAATAATTTAACCAGTGAATACAGGTTGACACTAAGAAGATTTTGAAGTATTTATAAATTAAAACTTCATCTATATATATTATTTTCTGAGTCCAATATCATGATAACCACTAGACAGGAATAACCAGCTTAGATTGGTACTTTTTAAATTCTTGCAAAGATAGAATGATTCAACTTTTAATTCAATTCTCAAAAAAAAAAAaaGAAAAGAAAAAaaGGAAGAAAGAAAGTAAGAAAGAAGTAAGAAAAAAGAAAAAaCAATGTTGAGGATTCAAAAGATAGTGGGATGAACGGAATTTTTTtAATAGGAAACAATACTAATGCTAAGAGTACAATAAAACAAAAACATATACAGCCATTACAACCAAATACAACCAATTTAGAACTCAGAAAATTACTGCAACTCGAAGAAACAGTACAAGTGGGATGCATATGGACTATTCCAGTGAAATAAGTTTTGTGGGAGTGTCTGGTTGAAAAGAAACGGTACAACTTCCATATTTTTGCTGAAACTTTTCTTGTTTCTTATAAATATATTTGAAATCTATTTTCAGTATTTTCACGAAATGGAGGATAAAATGGATTTTGTGTTCTACATGGTAGGCAGTTTGCATTTGGAGGAAATAATTGATCGATAAATGTTACATTTTTTGTTGAGCACTTTTTCAATTACTCTACCCATTTAGGTAATTCAAGTTATGGGACCTTAGTGTAAGGATGAGAATAGAATTTGAGATTTTATTCTTGTTAGGTAGGTTTTGTGGGTTCTTTTCTTGTTCTTCTTTCTTACTCTCTATGGATTCACTCTCTCTGTCTTTGTCTCACTTTTCATTGATATAATGAAAAGAGTAATACACACGCACATACACATATTGAAAGAATTGGACCATTCTTCTGCTATCTCATTAAGAAAATGAAGCTTTATTGATGGACTTCAGGAAGGATTCTACACTAACAAAGTTTTGAATTTAGGTTTGAAGCACACATGGAAGTTCATGTGGGATCAATTGATGAAGAAGATGACGAGCAAGGAATTGCGCACATGATTGAGCATGTAGCCTTCCTTGGAAGTAAGAAACGTGAAAAGCTTTTGGGTACAGGCGCACGATCTAATGCCTACACTGATTTCCACCATACTGTGTTTCATATACATTCACCAACTAGCACAAAGGTTGTTCTTGTTGGTTTTTCTATTTTTACAGGCATGCACCGATATCATTTCTCAATCTATTGTCTTTCATGTTTCTATGAGGACTGGGTTAAACATTTCTTTCTCAGAGAACTGGTTCTGTTTCCAGCTGTGTTTTCTTTCTTGCTGATGTTAAATAATGTGTAATTTTGCTACCCATGTATGAGCGCCAACTTACTATAATTAATACATATTCATTCCACCTCATCTTTTTGTTCTCAGCATTACGTAGCCGCTGATTATCTTTTCTTTTTCATTTTTTTCATTTTAAAATGCCTGCTCAAATGTTTCAGCGCCAATGCACTGAGAAGTTAGCTTAATGTTTAGAAGTGCAATTTTTTTAGTGCTTAGGTACTATTACTGAGGTATTTTCAAATCATAATCTCCTTGTGCTAGTTTCTCTTGTATTTTCTTGCATTGTTTGGAGATGCTTTTGGTTGAAAATAATTTTGAAACCGAATCTATTAATGCACACATGATGATCAATCAAAGATCATGATGGATCATTTGGAATTGACCCATATAATAGATGGAAGAAGAATACTGCACCATGTTACCGTTGCTGTTTGCATCAGTTTTGTGTCCGGGTTTGTTATTAGCAAATTGTTTGAATTTTGACCATGTTATCTTTGGCTGTAGGATTCTGATGGAGATCTACTTCCATCTGTTCTTGACGCCTTAAATGAGGTACGAACAATACATGTATTTATTTATGTTGAAGTGCACACTAGTGTGGCTTTGGGTTGTGATGTGATTTTGTTGGTTGATGTACTGTTTTTGTTTTTAGCAAATGGATTTTCTGATTTCAGTGTGATTTTTTTGTAAGAAAACAGTTTCATATAAGAATGTTGGATATTATTTGTCTCATCACTTTATAGTTTATGCATCCTTTCTTTGTGTTTGTCTAAATAGATAGCTTTCCACCCAAAGTTCCTTGCTTCACGAGTTGAAAAAGAGAGGCGTGCCATTCTCTCTGAACTACAGATGATGAATACAATAGAATATCGTGTTGATTGTCAGGTATGCACTGCAGTTGCTTTTATTCATAAAAAATATGTTTCAATAAAGCTTATATATTGGCTAATCCGTGCCCTTTTCTATAATTCCATTTATTTGGAAATATTCTTGCTGTACAGCTGTTACAACATCTGCATTCTGAAAACAAGCTTAGCAAAAGGTTCCCAATCGGATTGGAAGAACAGATTAAGAAATGGGATGCTGATAAAATAAGGAAGTTCCATGAACGATGGTACTTCCCTGCAAATGCAACCTTATACATTGTTGGAGATATTGATAACATCTCAAAGGCCGTTAACCAAATTGAGGTTTGAACAAGCTTTTATTTTGTTCTAATAATTCTTCTTTTGTTGACATTAGAGACTACAAAGTTTATTTTGAGGACTCAGTTCATCAGTACACTGATTATTATCTCTCCAGGCTGTCTTTGGTGAAAGTGGCCTAGAAAATGAGGCTGTTTCTACACCTAATCCCAGTGCATTTGGTGCGATGGCTAGTTTTCTTGTTCCCAAGATCTCAGTAGGATTAGGTGGCAGTTTATCAAACGAGAGATCAAATTCAGTAGATCAATCCAAGATCGTTAAGAAAGAAAGACATGCAATCCGTCCTCCTGTGATGCACAACTGGTCTCTTCCTGGGAGCAATGTACATGCAAATCCCCCACAGATATTTCAGCATGAGTTACTACAAAATTTCTCAATTAATATGTTTTGCAAGGTATGTGGTATTTAATATATTTTTTCTAATACTTTCCCTCTTGAACTTATCTTGCATGAGTTCTTTTATTTAGAGCTCTCATAATCACTAATATATAAGCAGGACGAAGAATTAACCTTTTAGAGATGTAATATTGAGATCATGAAAAGGATATTCCAAATAAATCTAGAGAAATTGATATTTACAAATTTTGAGACAATTCATCAATTTCAGATGACCCAACATACTTCCTCTACATTCTCTTGGGTCATTCTTTTATAAAGGGAAAGAAGATTCTTTGGATGGACTTTGTTAGGGTGGTTTTGTGGTTCACTTGGCTAGAAATGAATTAGAGGATTGTCAAGGAAAATGAGCTGCTCCATTTGATCCCTTTTTTGATGTTGTAGTTTATTTTCTCTTGGAATTACACACACACAAGAGAACAGAGCCAGTAGGTATCTTGCTTAAGAAGTTTTTGAAAGCATTGGTTTGTGTTAAACACCAAACACTGAACACCAACTGCCAACACTTCACTCTCTTTGTAATTTCCAGATCAACATGCCGAACTCTGGTGGTCTTCATACATCTCTTTTGTTTTTCGATGCTCAGAAATATGAGTTCTTTTCTCTTATTCTTCAAGCATATTCTTTTTCTGAAGTCTTGTTTTCCTTTTATTGAATTTCATTGCAGATTCCTGTAAATAAGGTTCGAACATTTAGTGACCTGAGAAATGTTCTTATGAAGAGGATATTTCTTTCTGCCTTGCATTTTCGTATAAATACAAGATACAAGGTACATTTGAAACCATATCCTCTCAATTCTGTCTTTGTGAATACTTGAGATTTCAAGTTTTGAATGAGATCTTTTTTTCTGAATTTGTCAAAATCATTTTTTtATTGGTCTTTTAACATAAATTAATATTTCTATTTGAATATACAGCATCTCTTTCATGGGAGTGGGGATCCTAATTTGGTTTTAATGATGTGTTAGTTAGCCTATTTTACTCATTCTATGGCATAAAACTTTTTATTGTTCCAAAGTTTTCTAATTTATTGCAATTTTCAGAGTTCAAATCCACCATTCACTTCCATTGAGTTGGACCACAGTGATTCTGGAAGGGAAGGTTGCACTGTCACCACACTAACAGTAACGGCTGAACCAAAGAATTGGCAAAGCGCGATTAAAGTTGCTGTTCAAGAGGTTTGTGTCGCTTTTTGTATCTATAATGTCCAAAAAATAAACATTAAAATAGTTTTTGTTAAAAATTTTCTCACCCTTTTACTTACCAGTCTACGAGCTAACTTTGAAATTATAGTTTGAATTGCATCCATTTGAATCCTAAATTGAAAAAATTGATAAACTAAATCCCCTTTTTtGGTTTGTTGACCTGGTAATGACTTGATTTATTAGTTTTCATTTTTCTTTTAATGAGCTCAGTGTTGTTTAAAAACATGAAAAACGGCAAATTTCAATTGGGGTTTTATGTTGTTTTTATCATTTCTTTATTAATTATTATTTTTAATTTTAGTAGGTCTGTTATTTTAACTCAATGAAAAACTAATGTTAAAGCAAAAAGAAAAGAAACTTTGAATCGAAGCATTTATTTTGGTTTGGATCAATTTTCACCTATTTTAATTCCTACATTTGGATATAAATATATGTATCATGCTTTTGTAGCATTGCTGGTACTCAGGTCATTTAAATAGAGAAAGGATTTCAGGCTAATAAAATGCCAAGAAATAAAGGGTTTACACCTATTATTTGTAAAAGAGGGAAGAATGAAGGATACCGGGTGATGCAATAACAACACAACACAACAATTTCAATGGCTTTGCTTTTGTTTTCATAGTTACTGGCCTTGAATTATGTGAGATGCAGATCAATTACTGAGTAGTATATAATTTTGATACTGTTTCATTTTTTTtCTTTTGCTTTTCATATTCGATTTCCAGAAGCACAAGCTCTGGCTGCAGTTGGACTGCTGGACTTGTAATAGTTAATTTAATATGTAGTGTATTTCATTTTGTGCAGGTACGGAGGCTTAAAGAGTTTGGTGTAACTAAGGGTGAACTGACTCGGTATATGGATGCACTTCTAAAAGACAGTGAACACCTAGCAGCAATGATTGATAACGTGTCATCAGTTGATAATTTGGATTTTATCATGGAAAGCGATGCTTTGGGGCATACAGTTATGGACCAAAGACAAGGTCATGAAAGTTTGGTTGCTGTTGCTGGAACAGTTACTCTTGAAGAGGTAAGGTACTTTTTTtCTTAATTATATTCTGAATGAAATTTTGAAGGTGTTCGTGAAAGACTAGTTGCACTTGAGAACAAATGGTGACATATTGGGAGGCAGATTATGTTTGTCTCTGACATAGGCTCTTTTTCAATTACTGGCACTGACTAATTCTTGTAGTTTGGTTGTAGTTTCTCTTGACATTGTCTCTTTCAAGAAATAAGTCAGGGTATTGGTTGGAATCACATTTCATTGTCAATGAATAATTTACTAGATGATACCTTCTGTAAGAATAAGTTGGTAGTTGGATAATTCTCTTCCCTTTTTATTGTTTTCTTAAATATTTTCTTCTCAAAACTTTGAAGGATGTGAAAATTGTATTGATGCAAAAATAGTGCAACAGAGAGATTTGATTCTATTCAAAGATCAAAAGAGATGATGAAAAGGCTTTGCAATTTGTATTCAATTAAAGGAAACTATTCCAAGATTTATCCTCCCATCCTTTGCAATTATTAGTGTATATATATTTTTAGTAGTCACATACATTATATGTGTTAGCGATTGAATTTTTtAAATTTATTTCATCAATCAATGGAATGGTCTCTGTTCCTTTAAAAAACAAGATATACGAGTGTATTCTTACTGATTAAAATCATCGATGGGTGGAGCAAATAGCTATGTGAACGTACATGCCCAGTATGTCTCTGAAACTTTGACAATTAGCTATATGAGTGAATTTAAAAATTATGGAGGGTAAGGGTTGACTAAAGTTCTATAACATTGTTTATTTAGTTTATACCCCTTATTACCAGTTCCTTATAAAAGGTTTGATGTATTTTTGTTTTCAAAATTCTGTACCTTAGGTGAATTCCATTGGTGCTGAAGTGTTAGAGTTCATCTCAGATTATGGAAAGCCTACTGCACCCCTTCCTGCAGCTATTGTTGCATGCGTTCCAAAGAAAGCACATATTGATGGATTGGGGGAGACAGAGTTTAAAATAACTGCAAGTGAGATAACTACTGCTATTGAGGCAGGATTGAGGGAACCAATTGAAGCTGAGCCTGAAGTATCTTATTCTCTTATGCTCTCACTTTCTTTTGACTTTTAGTATAAATATATCTCCATGTTTGTTTTATGGTTTATTTCTTATCGGAAATAAATTTCTATCTACTTCTGCAGCTTGAGGTGCCAAAAGAGTTGATATCATCATCACAGATAGCGGAATTAAGGATCCAACACCAGCCATCATTTATTCGTCTCAACCCAGAGACTAATGTCACCAAATTTCATGATAAAGAAACAGGGATCACTCAGTGTCGTCTGTCAAATGGAATTCCTGTGAACTATAAGGTATAACTTATTAGATAAATACAACCAACTTTAACATAACTCAATGGTTGACAAGTATAATTTTTCCCTTGAGGTTGGATGTTCAAATCCGCATACCACTCATTTGTGGTACTAAAAAAaCTTATAAGATAAATTACATATTCAGAGGTTAGGCTTCACTTTATGCATGGGGGTTGGAGGGTATAAATCTCTGTCTGTAGATTCCATTAATGTTTTTATTTCTTATCTCAAGAGAACAAAAAGTTTTAATTGAATGTCCTCTAAATAACTTAAAATAACTGAGGTTTCAGACATTGCCTGCTCAAATTCAAGAAATAATTCTTTTTTtGGATAAAATTCAAGAAATAAGTAGTGTGGATCCATTTGACGAATGTTATAAAAAAaTTAAATTATTTTTtCATTGCTAGATTGGAAAGTTTATTGTACTTTGTAGTTTTATCTTCTCATTTTCTGATTGTCTGGTAGTACAGTTTTTATTAGTTTTTTACTTCTTTTTGCTAGTATTGGTATATATGTTCAAAAGATATTTGTAGATTACTGCCATGAGTTTTTGTGAGAAGTATAATTTTAGTGGATATCTGACAATAAAGATGCTGTAGTTATATTTCACATTAGTCAAACACAAACTCCAGGGCGAAAAATATCGGATATTCAAGTGATGATTTCTAAATAAGTGTGGGTAGCTCTGAAAAAATGTATGCAGGGGTGGTGATTATTTGTCAACATATTTAACCATTGGTACCATTCTTCATGTAGATTTCAAAAAGTGAAAACAAGGCAGGCGTGATGCGGCTTATAGTTGGTGGGGGACGAGCAGCTGAAAGCCCTGACTCACAAGGAGCTGTTGTAGTCGGTGTTCGAACTCTCAGTGAGGGAGGTCGTGTAGGAACCTTTTCACGGGAGCAGGTATTCTATTAGAATCCTCACACTCTTGACTCCCATCCAATTAATTGTTTGAACTTCAAATTCAGTAAAAGAGTTGAAGGAAGCAATGAACATGAACTTCCTATTGAAACAATAAATAATTGTCCTGGTGGTTTATAAATAGCTGTCTACAAACGAAGTCTATGCACTTGGAAGTAATTTCTGATTTATGTGAAGTGTATTTATCAATTTTGTAATGAATGATGTGAGGTTAAATTGCAAATTTGGTCCTCATGGTTTATAACTAAAATTTTGTCTCCATGGTTTGATAAAACACTAAGGGCAATTTATGAAGATTTCATCAGACTATAGGGACCATTTATGAGGTTCTATCAAACAGTAGGGGCTAAACTCTAATTTTCTCCAAATCATAAGGACCAAATTTGTAATTGAATCATGGCAATACCTTGATAACTGGCATTTTAAGTAACAGCAAGCAGCTTTGTGTTGAAGGTAATTGCTAATGCTGTTATTCCTTTCAAGAACTTGCTTTGGTGTCTCAGACATATGGTTGATGACATTGACATGTAGAAGCTGTATATGAATGGCTTAATGAACGGTTCATTGCCCATATAACTCTTCATTCAAGTTGATCGATGTTGTACATGTAACATTTTGTAGGTGGAACTTTTTTGCGTCAATCACTTGATAAATTGTTCTCTCGAGTCAACCGAAGAGTTCATTGCTATGGAATTTCGTTTCACCTTGAGAGATAATGGGATGCGTGCAGCTTTCCAACTACTTCACATGGTTCTTGAGGTATGTGCAAATGGCTAGTTTTGTTAAAAAAAaCCTCATTTTCCCTACAAAGCACAGTTGAGATCTGGACCTATGACAACGATATATATAAACACAACTCTGCTTACTTTTTTTTTtAACAATGGAAACAACAATTTTCATTGATGAAATGAACAAGAGGCTAATACTTGAATTACAGTGAGATATAACTAATAAGGAAACAAGAATCTAAGAAGATTAACCCTAAATATAACTCTGCTTACTTAAAGTTAGTGCTTGAATATTGATAAAACCCAAGTTTTTAGTATAAATTGAATATTTTATTGAGAAAACTGCTCTATAACTTTTAATTCTTACCAATATTTATTTTtGTTTTATTATTATTATTATTATTATTTTAATTCCAAGTGTTTCATCTTAACCTTAAAGGAGGGTAGACCAGTATTTTGTCTGTGAATTAACTTCTTTTTAGTAAGTCCTATATTTCAATAAATTAACCGACAAGGATGAAGAATCATCTCGAGGAATTTTGCTTAGAACTCTTGAAGACTTGACATTTCCCAGCAATCATTCAGGGTTTAATTCAAATTGTATCTCTGAATAATGTTTGTGCATTTTACTGTTTAAAGATGGGCGATTACTAATTTGCAGCATAGTGTCTGGCTGGAAGATGCATTTGATAGAGCAAAGCAGTTATATATGTCATACTACCGGTCTATTCCTAAAAGCCTGGAACGTTCTACTGCTCACAAACTCATGTTAGCTATGCTGAACGGGGATGAGCGGTTTGTTGAGCCTTCCCCAAAATCACTGCAGAACTTAACATTACAAACCGTGAAGGACGCTGTGATGAATCAATTTGTGGGCAATAACATGGAGGTCTGCTTTTTTtCCTTTTAATTGAAAGTGAAAATGACATTAAGAAGTATCATCATCAGAGAGAGAACGAGATATCCTAATTAATTGAAGAGAAAGTAGAAGAAATATAATCATGCAGAGTACGAAGAAAAAAGCATTTTACATTCTGGACCAATGAAAGGCAATCTGTTCTTTTTTGAGAAGTTACATTTTTTAACAATAATTCCTGATATATCTGTCACGGAATCTTGCCATTTTGTAGGTAAGTCTTGTTGGGGACTTTTCAGAAGAAGAAATTGAGTCTTGTATTCTAGATTACCTTGGTACAGTCACAGCAACTACAACTTCTGAGGCAGCACTTGCTTCAGTTCCCATTGTGTTTCGACCATCTGCATCTGAGTTACAATTTCAGCAGGTGAGGGTGATTTACTTTTTAAAACTGTCTAGTTTACTTTTTAAGATCTTTAACCTCACCCTGCACTGAGTTATTCTTTGCAAAATTTATTATAAAATTAAAAATTAATCCACTCCTTTTGTAGTTTTGTTTCTTAGAAGTGGATATTACCTGATCCACACCAAAAAGATACTGGAAAAGATCTCAATGATTTTCATATAGTTTTTGCAGTTGCCTGCTTTCTGAGATCAATGTGATATTGTGTTCAGATCATTGTAATCTCTGTTACGTTTTTTATATCAAGTTCTTCGTTATATTTGAACAGGTATTTTTAAAGGATACAGACGAAAGAGCATGCGCTTATATTTCAGGTCCTGCACCCAACCGTTGGGGTGTTACATTTGAGGGTTTAGAGTTGTTAGAATCAATTAGTCAGATTTCAAGAACAGGTGGTGAGTTTCTCTGCGAAGAAGTGTGACTTACCATTTCCTGATTCTCTAATATAGCTTATGATTGGTCCTTTTTTtCCTTACTTTCTTTTTTTtAAACTCCAGAAAGTGACGAATCTGATAATGATATCGAGAAGGGGTTGCAGAGAAAACTTCGTAGTCATCCACtCTTTTTTGGAATCACAATGGGGCTTTTGGCCGAGATTATAAATTCTAGGCAAGTCTTCTCTCCCAACTGCATTTTGTAATCTTTAGTTTTCTTTCTCCTTGGTGCACAAGTTAATGTTTGGTTCCCCCTTTTAAGTTTTAGTTTTTTATTATTTTTGTTTACTTCCTTTCTTTTTCTTGTTAATTTGTTGGGGTTTCTTCTTTTTGGTTCTTTGTTATCAAGGCCCACAACTCCCACTACACTCATATCCAATCATCTTTGCTACCATTGCCATCATGTCGACTGTTTCCAGTTAGCCACGTCCTCTCACAAACCCACCCACACCCTTGTCCATTTGTTTGTTGGCCTCTCCTAACATGGTTTGAAAACAGGCTAAATCTCTGTCCTACAAGAGATACGTGAATCATAAAGCACCGGAAGATTGAAGTTACGCACTCGAGTCCGGGTACAAAAATCTCTAGTAGTTGTCTGAATTTCAATTGACCTAACAAGAATGAAACAAAGATGCTGAAAACTGCCCTCCACTTTATCTTATTTTCTCCATCCCTTCAAAAACTGTCCTTCCATTTCACTTAACACTAAAGTAAGAAAATACCTTACCCCTTGCTTAGGAATCGATCTTCAAAAAAGTTCTGTTCTTTTCCTTAAAATTACTCAAATTGGCCCAGTCACCTGCCAGGCCCAAAAAGAACCGACCAAACCTTCAGGTTGCTGTTTGGTTTCGCATGGTTTTTTTAGCAAATGTAGTTCAATGTGGTTTTTCTTTCAAAATAATTTACTTCGTTTTTGGTTCAGCTCCAAAACTAAGACAAACAAAATCATGAACATTCTTTATTTGCAAGAAATAATTAAGGAATAAGCCTTCCAAATAAATGTGCACGATGACATGGCAAAATGAAGAATTAAGGATTTATGAATCCCTTTCCATAGCTTTTGAAGCACAACCATCAATGGGTTTCGCATTAAAGAATCTAATTTGTGTGCAAATGTCAGCTAACAATTAATGTAAACCATGTTATGTCTGATAATGATTAATGAATATCAATTTTCTACTTTCTCTTAATTCTTAAGTTGAAGGCTGGAAGCCATTTTTAAAATTTATTGTGCTAGTTTTGCCCCAAGTCTTCCCACATAACTTCTTCACTACTGTTACTTCATTTGCTAACATTTGCAAAAATATTATTGATGTGGTAATTCCAGGCTTTTCACAAGTGTGAGGGATTCTCTTGGTTTGACATATGACGTATCCTTTGAATTGAGCCTGTTTGATAGGCTTAAGCTCGGATGGTATGTTATATCTGTGACATCAACTCCAGCCAAGGTTTGGACTCAATTTTCATTAAGATCCTTGTTAACTTCATTTATTTCTTTTTTTCTTTCACTAATACAATTCCTCTTTTCAGGTGTACAAAGCTGTTGACGCTTGCAAAAGCGTTCTGAGAGGTTTACATAGCAACAAAATTGCCCAAAGAGAGTTGGACAGGGTAAAGTGTTAAATCAACCTACTGAGCTAGTACTGGCTACTAAAAACTTAATTAGTTTCCATTTCATATTTTTAAACAGTAGCAAGCCACTAAAAGCATTTTAATGATTTTCATTTCAAACTGAAAATAGTTCATTGCTCCCTACCCGTCTCTTCATTTTACATATCTTACTATGAGATTCTTTCTCGTGCGGTTCTGTACTGCCTCTCTCATTCTTAGTCAAGGTTTTAAATAATGTTTTCCAACTGCAGGCAAAGCGTACTCTTCTTATGAGGCATGAAGCTGAAATAAAGTCCAATGCTTATTGGCTTGGCCTATTGGCTCATCTGCAAGCGTCTTCTGTTCCACGGAAGGTAGAACTAATGTGCTTCTACTTCTCTTCATTATCTATTCCTATTCTTTTGCCTTGAAAACAGAGAAAAGATTGCATACAATACAATGATTTTATAATATGTTGCAACTGTGCGTCTTTTGTGTCTCACCTATCTTTTCTCTTCTGACTGTGGAGCAGGACCTATCGTGCATCAAAGATCTTACATCATTGTATGAAGCTGCCACCATTGATGACGTCTACATTGCTTATGATCAaTTGAAAGTAGATGCAGATTCTTTGTATACGTGCATCGGGATAGCTGGAGCTCAAGCTGGGGAAGAAAGTATTGGTGAGAATTCGTGTATTTCAGTATTTCACCAGTCATTAATTGATAAATTCCTTCAAATTTTTTTGGAGGTTTATATTTTTCTTTAAGATAAATTGGGCTGTCCTCCAATGGTATTGGTACCAACCTTAGATTTGCCAAGTGGATTTGCTTTTCTCATTGTATAAATTAAGATATAAATCATAAAAATGTGGGTTTTGAAATTTTTATTCAAGCCTTATTTCATGTTGTGGTTGCAGTTTCTTTTGAAGAGGAAGGATCAGACCAAGATTTTCAAGGTGTTATTCCATCTGGACGCGGCTTATCTACAATGACCAGACCGACAACATGATCCTCCCTTGATAAATCTTTTAAAATTATCGTTAGGGATCGATCAACTAAAGGTTTGTTTTGTTTATAATTTTATTTCACAAAAACTTTGACTCCTTTGCACTCAAGCTATTGGCTAAATTGGCATTTTGATCCAATATATATGTTCAACTTTCAATCTTGTTCGATTTTTTCTTTTTTTCTGTAATTGAACATAATAAAATACCTGTTTGTTATTGGACAGAAAAAACACTTATGGAGCTATTTTGTTATTTGTCGTGAAGTTATAATAAAAAGAACTTTAGAAGAAAACTCTAAATATAGCTCAATTAGTTATAACATAGTGTAGATGTGCAAAATTTTAAATCATTTTGGGATGATCTGAAAGGAAATAAGAAATAAGGTTTCGTTGGTCAGTAAATATTGATTATGAATATAATGTCTATTCTACAAGTTTATAGAAGTATAATCAGTCCTTTGGTTTCAAAGATACCTTACCCTCATGATTGATCATTGTATAGGATGGATACTCACTTCAAAGTACAGCCAATTATTTAGCCGAAGCATACGAAGCTGCATCACTGATCTCTCTCGGTGTAAAGTAGTTTGCTCGTCGTAAGCTGATTGGCAAACACGTGGTTTAATTGGGATGCTGTTGTTCTCGGGTTTCTTTCTTGCCATCAGGCATTAATATTGCTTGAATATACAACGATGCTGATTCATTTTTCATAATGCATCATTTAGATAGCAGCCGTAGAGCTTGATTGTGTTTTTCACAATATCAGAGTAAGATAGAATACTTTGTATGCATCGATTTTTTAGTAATATATGAATAAGTATTAGTCTTTTTTCTTGTGGTATGCCGAGGACAACTATTTTAATTCATACTCAATAATATAAATATCTTATTTTCAGTTT

mRNA sequence

TTTTTTCTGTTCTCCGCAAGCTCTGTCCTTTCCAGTTTTCTTGGATTTTTAGCCTCTAATGGCCGTTGCAACCTCTTCTACTGTTTCAAATTTGACGCAAAGACGGCCCTTGTTGAGTCTAAAAGACCAAACTACTCCCATTAAGCGGGTGAACTCGGTGCAACTCCCTTCCCGCTCTATTTCCGCTCATCTCTCTCGATTCGATGTAGACTCTCGATTTGTTGTCCCTTTGAGGAGGCACTCTCGTGACGATGGTATTGGTAGACACAAGTTTAGAAGAAATAAGGATAATGCTCGAAGACCATGTGCTTATAAAATTGGGGAGCGTGGGAATGAAACGTTGACTAATTGTATCTCTTGCTTTCTTAACCAAAAAAGAAGATGTCCCAGTATCAAAAGACCAACATCTAGATTCATTCTAGACAAGTCTGCTTTTCAGTTATCCAAGAATGAACGTGATGATAGAGTTGTGAAGCATGCTCGTATTGTTTGTGGAACTGTAGGTCCAGATGAGCCTCATGCAGCACCTACAGCCTGGCCTGATGGTATTCTGGAGAAACAAGATTTAGATGTTTCATATCCTGAGTTTGGAAGGGCAGAGTTAGAAGCATTTCTTAGTTCAGAACTTCCGTCTCATCCGAAGTTGTATAGAGGGCAGTTGAAAAATGGATTGAAGTATCTAATTTTACCAAATAAAGTTCCACCCAACAGGTTTGAAGCACACATGGAAGTTCATGTGGGATCAATTGATGAAGAAGATGACGAGCAAGGAATTGCGCACATGATTGAGCATGTAGCCTTCCTTGGAAGTAAGAAACGTGAAAAGCTTTTGGGTACAGGCGCACGATCTAATGCCTACACTGATTTCCACCATACTGTGTTTCATATACATTCACCAACTAGCACAAAGGATTCTGATGGAGATCTACTTCCATCTGTTCTTGACGCCTTAAATGAGATAGCTTTCCACCCAAAGTTCCTTGCTTCACGAGTTGAAAAAGAGAGGCGTGCCATTCTCTCTGAACTACAGATGATGAATACAATAGAATATCGTGTTGATTGTCAGCTGTTACAACATCTGCATTCTGAAAACAAGCTTAGCAAAAGGTTCCCAATCGGATTGGAAGAACAGATTAAGAAATGGGATGCTGATAAAATAAGGAAGTTCCATGAACGATGGTACTTCCCTGCAAATGCAACCTTATACATTGTTGGAGATATTGATAACATCTCAAAGGCCGTTAACCAAATTGAGGCTGTCTTTGGTGAAAGTGGCCTAGAAAATGAGGCTGTTTCTACACCTAATCCCAGTGCATTTGGTGCGATGGCTAGTTTTCTTGTTCCCAAGATCTCAGTAGGATTAGGTGGCAGTTTATCAAACGAGAGATCAAATTCAGTAGATCAATCCAAGATCGTTAAGAAAGAAAGACATGCAATCCGTCCTCCTGTGATGCACAACTGGTCTCTTCCTGGGAGCAATGTACATGCAAATCCCCCACAGATATTTCAGCATGAGTTACTACAAAATTTCTCAATTAATATGTTTTGCAAGATTCCTGTAAATAAGGTTCGAACATTTAGTGACCTGAGAAATGTTCTTATGAAGAGGATATTTCTTTCTGCCTTGCATTTTCGTATAAATACAAGATACAAGAGTTCAAATCCACCATTCACTTCCATTGAGTTGGACCACAGTGATTCTGGAAGGGAAGGTTGCACTGTCACCACACTAACAGTAACGGCTGAACCAAAGAATTGGCAAAGCGCGATTAAAGTTGCTGTTCAAGAGGTACGGAGGCTTAAAGAGTTTGGTGTAACTAAGGGTGAACTGACTCGGTATATGGATGCACTTCTAAAAGACAGTGAACACCTAGCAGCAATGATTGATAACGTGTCATCAGTTGATAATTTGGATTTTATCATGGAAAGCGATGCTTTGGGGCATACAGTTATGGACCAAAGACAAGGTCATGAAAGTTTGGTTGCTGTTGCTGGAACAGTTACTCTTGAAGAGGTGAATTCCATTGGTGCTGAAGTGTTAGAGTTCATCTCAGATTATGGAAAGCCTACTGCACCCCTTCCTGCAGCTATTGTTGCATGCGTTCCAAAGAAAGCACATATTGATGGATTGGGGGAGACAGAGTTTAAAATAACTGCAAGTGAGATAACTACTGCTATTGAGGCAGGATTGAGGGAACCAATTGAAGCTGAGCCTGAACTTGAGGTGCCAAAAGAGTTGATATCATCATCACAGATAGCGGAATTAAGGATCCAACACCAGCCATCATTTATTCGTCTCAACCCAGAGACTAATGTCACCAAATTTCATGATAAAGAAACAGGGATCACTCAGTGTCGTCTGTCAAATGGAATTCCTGTGAACTATAAGATTTCAAAAAGTGAAAACAAGGCAGGCGTGATGCGGCTTATAGTTGGTGGGGGACGAGCAGCTGAAAGCCCTGACTCACAAGGAGCTGTTGTAGTCGGTGTTCGAACTCTCAGTGAGGGAGGTCGTGTAGGAACCTTTTCACGGGAGCAGGTGGAACTTTTTTGCGTCAATCACTTGATAAATTGTTCTCTCGAGTCAACCGAAGAGTTCATTGCTATGGAATTTCGTTTCACCTTGAGAGATAATGGGATGCGTGCAGCTTTCCAACTACTTCACATGGTTCTTGAGCATAGTGTCTGGCTGGAAGATGCATTTGATAGAGCAAAGCAGTTATATATGTCATACTACCGGTCTATTCCTAAAAGCCTGGAACGTTCTACTGCTCACAAACTCATGTTAGCTATGCTGAACGGGGATGAGCGGTTTGTTGAGCCTTCCCCAAAATCACTGCAGAACTTAACATTACAAACCGTGAAGGACGCTGTGATGAATCAATTTGTGGGCAATAACATGGAGGTAAGTCTTGTTGGGGACTTTTCAGAAGAAGAAATTGAGTCTTGTATTCTAGATTACCTTGGTACAGTCACAGCAACTACAACTTCTGAGGCAGCACTTGCTTCAGTTCCCATTGTGTTTCGACCATCTGCATCTGAGTTACAATTTCAGCAGGTATTTTTAAAGGATACAGACGAAAGAGCATGCGCTTATATTTCAGGTCCTGCACCCAACCGTTGGGGTGTTACATTTGAGGGTTTAGAGTTGTTAGAATCAATTAGTCAGATTTCAAGAACAGGTGAAAGTGACGAATCTGATAATGATATCGAGAAGGGGTTGCAGAGAAAACTTCGTAGTCATCCACTCTTTTTTGGAATCACAATGGGGCTTTTGGCCGAGATTATAAATTCTAGGCTTTTCACAAGTGTGAGGGATTCTCTTGGTTTGACATATGACGTATCCTTTGAATTGAGCCTGTTTGATAGGCTTAAGCTCGGATGGTATGTTATATCTGTGACATCAACTCCAGCCAAGGTGTACAAAGCTGTTGACGCTTGCAAAAGCGTTCTGAGAGGTTTACATAGCAACAAAATTGCCCAAAGAGAGTTGGACAGGGCAAAGCGTACTCTTCTTATGAGGCATGAAGCTGAAATAAAGTCCAATGCTTATTGGCTTGGCCTATTGGCTCATCTGCAAGCGTCTTCTGTTCCACGGAAGGACCTATCGTGCATCAAAGATCTTACATCATTGTATGAAGCTGCCACCATTGATGACGTCTACATTGCTTATGATCAATTGAAAGTAGATGCAGATTCTTTGTATACGTGCATCGGGATAGCTGGAGCTCAAGCTGGGGAAGAAAGTATTGTTTCTTTTGAAGAGGAAGGATCAGACCAAGATTTTCAAGGTGTTATTCCATCTGGACGCGGCTTATCTACAATGACCAGACCGACAACATGATCCTCCCTTGATAAATCTTTTAAAATTATCGTTAGGGATCGATCAACTAAAGGATGGATACTCACTTCAAAGTACAGCCAATTATTTAGCCGAAGCATACGAAGCTGCATCACTGATCTCTCTCGGTGTAAAGTAGTTTGCTCGTCGTAAGCTGATTGGCAAACACGTGGTTTAATTGGGATGCTGTTGTTCTCGGGTTTCTTTCTTGCCATCAGGCATTAATATTGCTTGAATATACAACGATGCTGATTCATTTTTCATAATGCATCATTTAGATAGCAGCCGTAGAGCTTGATTGTGTTTTTCACAATATCAGAGTAAGATAGAATACTTTGTATGCATCGATTTTTTAGTAATATATGAATAAGTATTAGTCTTTTTTCTTGTGGTATGCCGAGGACAACTATTTTAATTCATACTCAATAATATAAATATCTTATTTTCAGTTT

Coding sequence (CDS)

ATGGCCGTTGCAACCTCTTCTACTGTTTCAAATTTGACGCAAAGACGGCCCTTGTTGAGTCTAAAAGACCAAACTACTCCCATTAAGCGGGTGAACTCGGTGCAACTCCCTTCCCGCTCTATTTCCGCTCATCTCTCTCGATTCGATGTAGACTCTCGATTTGTTGTCCCTTTGAGGAGGCACTCTCGTGACGATGGTATTGGTAGACACAAGTTTAGAAGAAATAAGGATAATGCTCGAAGACCATGTGCTTATAAAATTGGGGAGCGTGGGAATGAAACGTTGACTAATTGTATCTCTTGCTTTCTTAACCAAAAAAGAAGATGTCCCAGTATCAAAAGACCAACATCTAGATTCATTCTAGACAAGTCTGCTTTTCAGTTATCCAAGAATGAACGTGATGATAGAGTTGTGAAGCATGCTCGTATTGTTTGTGGAACTGTAGGTCCAGATGAGCCTCATGCAGCACCTACAGCCTGGCCTGATGGTATTCTGGAGAAACAAGATTTAGATGTTTCATATCCTGAGTTTGGAAGGGCAGAGTTAGAAGCATTTCTTAGTTCAGAACTTCCGTCTCATCCGAAGTTGTATAGAGGGCAGTTGAAAAATGGATTGAAGTATCTAATTTTACCAAATAAAGTTCCACCCAACAGGTTTGAAGCACACATGGAAGTTCATGTGGGATCAATTGATGAAGAAGATGACGAGCAAGGAATTGCGCACATGATTGAGCATGTAGCCTTCCTTGGAAGTAAGAAACGTGAAAAGCTTTTGGGTACAGGCGCACGATCTAATGCCTACACTGATTTCCACCATACTGTGTTTCATATACATTCACCAACTAGCACAAAGGATTCTGATGGAGATCTACTTCCATCTGTTCTTGACGCCTTAAATGAGATAGCTTTCCACCCAAAGTTCCTTGCTTCACGAGTTGAAAAAGAGAGGCGTGCCATTCTCTCTGAACTACAGATGATGAATACAATAGAATATCGTGTTGATTGTCAGCTGTTACAACATCTGCATTCTGAAAACAAGCTTAGCAAAAGGTTCCCAATCGGATTGGAAGAACAGATTAAGAAATGGGATGCTGATAAAATAAGGAAGTTCCATGAACGATGGTACTTCCCTGCAAATGCAACCTTATACATTGTTGGAGATATTGATAACATCTCAAAGGCCGTTAACCAAATTGAGGCTGTCTTTGGTGAAAGTGGCCTAGAAAATGAGGCTGTTTCTACACCTAATCCCAGTGCATTTGGTGCGATGGCTAGTTTTCTTGTTCCCAAGATCTCAGTAGGATTAGGTGGCAGTTTATCAAACGAGAGATCAAATTCAGTAGATCAATCCAAGATCGTTAAGAAAGAAAGACATGCAATCCGTCCTCCTGTGATGCACAACTGGTCTCTTCCTGGGAGCAATGTACATGCAAATCCCCCACAGATATTTCAGCATGAGTTACTACAAAATTTCTCAATTAATATGTTTTGCAAGATTCCTGTAAATAAGGTTCGAACATTTAGTGACCTGAGAAATGTTCTTATGAAGAGGATATTTCTTTCTGCCTTGCATTTTCGTATAAATACAAGATACAAGAGTTCAAATCCACCATTCACTTCCATTGAGTTGGACCACAGTGATTCTGGAAGGGAAGGTTGCACTGTCACCACACTAACAGTAACGGCTGAACCAAAGAATTGGCAAAGCGCGATTAAAGTTGCTGTTCAAGAGGTACGGAGGCTTAAAGAGTTTGGTGTAACTAAGGGTGAACTGACTCGGTATATGGATGCACTTCTAAAAGACAGTGAACACCTAGCAGCAATGATTGATAACGTGTCATCAGTTGATAATTTGGATTTTATCATGGAAAGCGATGCTTTGGGGCATACAGTTATGGACCAAAGACAAGGTCATGAAAGTTTGGTTGCTGTTGCTGGAACAGTTACTCTTGAAGAGGTGAATTCCATTGGTGCTGAAGTGTTAGAGTTCATCTCAGATTATGGAAAGCCTACTGCACCCCTTCCTGCAGCTATTGTTGCATGCGTTCCAAAGAAAGCACATATTGATGGATTGGGGGAGACAGAGTTTAAAATAACTGCAAGTGAGATAACTACTGCTATTGAGGCAGGATTGAGGGAACCAATTGAAGCTGAGCCTGAACTTGAGGTGCCAAAAGAGTTGATATCATCATCACAGATAGCGGAATTAAGGATCCAACACCAGCCATCATTTATTCGTCTCAACCCAGAGACTAATGTCACCAAATTTCATGATAAAGAAACAGGGATCACTCAGTGTCGTCTGTCAAATGGAATTCCTGTGAACTATAAGATTTCAAAAAGTGAAAACAAGGCAGGCGTGATGCGGCTTATAGTTGGTGGGGGACGAGCAGCTGAAAGCCCTGACTCACAAGGAGCTGTTGTAGTCGGTGTTCGAACTCTCAGTGAGGGAGGTCGTGTAGGAACCTTTTCACGGGAGCAGGTGGAACTTTTTTGCGTCAATCACTTGATAAATTGTTCTCTCGAGTCAACCGAAGAGTTCATTGCTATGGAATTTCGTTTCACCTTGAGAGATAATGGGATGCGTGCAGCTTTCCAACTACTTCACATGGTTCTTGAGCATAGTGTCTGGCTGGAAGATGCATTTGATAGAGCAAAGCAGTTATATATGTCATACTACCGGTCTATTCCTAAAAGCCTGGAACGTTCTACTGCTCACAAACTCATGTTAGCTATGCTGAACGGGGATGAGCGGTTTGTTGAGCCTTCCCCAAAATCACTGCAGAACTTAACATTACAAACCGTGAAGGACGCTGTGATGAATCAATTTGTGGGCAATAACATGGAGGTAAGTCTTGTTGGGGACTTTTCAGAAGAAGAAATTGAGTCTTGTATTCTAGATTACCTTGGTACAGTCACAGCAACTACAACTTCTGAGGCAGCACTTGCTTCAGTTCCCATTGTGTTTCGACCATCTGCATCTGAGTTACAATTTCAGCAGGTATTTTTAAAGGATACAGACGAAAGAGCATGCGCTTATATTTCAGGTCCTGCACCCAACCGTTGGGGTGTTACATTTGAGGGTTTAGAGTTGTTAGAATCAATTAGTCAGATTTCAAGAACAGGTGAAAGTGACGAATCTGATAATGATATCGAGAAGGGGTTGCAGAGAAAACTTCGTAGTCATCCACtCTTTTTTGGAATCACAATGGGGCTTTTGGCCGAGATTATAAATTCTAGGCTTTTCACAAGTGTGAGGGATTCTCTTGGTTTGACATATGACGTATCCTTTGAATTGAGCCTGTTTGATAGGCTTAAGCTCGGATGGTATGTTATATCTGTGACATCAACTCCAGCCAAGGTGTACAAAGCTGTTGACGCTTGCAAAAGCGTTCTGAGAGGTTTACATAGCAACAAAATTGCCCAAAGAGAGTTGGACAGGGCAAAGCGTACTCTTCTTATGAGGCATGAAGCTGAAATAAAGTCCAATGCTTATTGGCTTGGCCTATTGGCTCATCTGCAAGCGTCTTCTGTTCCACGGAAGGACCTATCGTGCATCAAAGATCTTACATCATTGTATGAAGCTGCCACCATTGATGACGTCTACATTGCTTATGATCAaTTGAAAGTAGATGCAGATTCTTTGTATACGTGCATCGGGATAGCTGGAGCTCAAGCTGGGGAAGAAAGTATTGTTTCTTTTGAAGAGGAAGGATCAGACCAAGATTTTCAAGGTGTTATTCCATCTGGACGCGGCTTATCTACAATGACCAGACCGACAACATGA

Protein sequence

MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRRHSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAFGAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPTT*
BLAST of Cucsa.178770.1 vs. Swiss-Prot
Match: SPP_ARATH (Stromal processing peptidase, chloroplastic OS=Arabidopsis thaliana GN=SPP PE=2 SV=1)

HSP 1 Score: 1834.7 bits (4751), Expect = 0.0e+00
Identity = 917/1171 (78.31%), Postives = 1036/1171 (88.47%), Query Frame = 1

Query: 98   CISCFLNQKRRCPSIKRPTSRFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAP 157
            C++C    KR    I+R      +D++AF LS++     + KH++IV  T+GPDEPHAA 
Sbjct: 100  CLTC----KRNQAGIRRALPSAFVDRTAFSLSRSSLTSSLRKHSQIVNATLGPDEPHAAG 159

Query: 158  TAWPDGIL-EKQDLDVSYPEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPP 217
            TAWPDGI+ E+QDLD+  PE   AELEAFL  ELPSHPKL+RGQLKNGL+YLILPNKVPP
Sbjct: 160  TAWPDGIVAERQDLDLLPPEIDSAELEAFLGCELPSHPKLHRGQLKNGLRYLILPNKVPP 219

Query: 218  NRFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFH 277
            NRFEAHMEVHVGSIDEE+DEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFH
Sbjct: 220  NRFEAHMEVHVGSIDEEEDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFH 279

Query: 278  IHSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQ 337
            IHSPT TKDS+ DL PSVLDALNEIAFHPKFL+SRVEKERRAILSELQMMNTIEYRVDCQ
Sbjct: 280  IHSPTHTKDSEDDLFPSVLDALNEIAFHPKFLSSRVEKERRAILSELQMMNTIEYRVDCQ 339

Query: 338  LLQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVN 397
            LLQHLHSENKL +RFPIGLEEQIKKWD DKIRKFHERWYFPANATLYIVGDIDNI + V+
Sbjct: 340  LLQHLHSENKLGRRFPIGLEEQIKKWDVDKIRKFHERWYFPANATLYIVGDIDNIPRIVH 399

Query: 398  QIEAVFGESGLENEAV-STPNPSAFGAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKK 457
             IEAVFG++GL+NE+  S+P+P AFGAMA+FLVPK+  GLGG+ SNE++N+ DQSK++K+
Sbjct: 400  NIEAVFGKNGLDNESTPSSPSPGAFGAMANFLVPKLPAGLGGTFSNEKTNTADQSKMIKR 459

Query: 458  ERHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLM 517
            ERHAIRPPV HNWSLPG++V   PPQIF+HELLQNF+INMFCKIPV+KV+TF DLRNVLM
Sbjct: 460  ERHAIRPPVEHNWSLPGTSVDLKPPQIFKHELLQNFAINMFCKIPVSKVQTFGDLRNVLM 519

Query: 518  KRIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAV 577
            KRIFLSALHFRINTRYKSSNPPFTS+ELDHSDSGREGCTVTTLTVTAEP+NWQ+A+KVAV
Sbjct: 520  KRIFLSALHFRINTRYKSSNPPFTSVELDHSDSGREGCTVTTLTVTAEPQNWQNAVKVAV 579

Query: 578  QEVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQR 637
            QEVRRLKEFGVT+GELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDAL HTVMDQ 
Sbjct: 580  QEVRRLKEFGVTRGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALSHTVMDQT 639

Query: 638  QGHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETE 697
            QGHE+LVAVAGTVTLEEVN++GA+VLEFISD+G+PTAPLPAAIVACVP K H+DG+GE++
Sbjct: 640  QGHETLVAVAGTVTLEEVNTVGAKVLEFISDFGRPTAPLPAAIVACVPTKVHVDGVGESD 699

Query: 698  FKITASEITTAIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTK 757
            F I+  EI  ++++GL  PIEAEPELEVPKELIS SQ+ EL +Q  P F+ + P + +TK
Sbjct: 700  FNISPDEIIESVKSGLLAPIEAEPELEVPKELISQSQLKELTLQRNPCFVPI-PGSGLTK 759

Query: 758  FHDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLS 817
             HDKETGITQ RLSNGI VNYK S +E++AGVMRLIVGGGRAAE+ DS+GAVVVGVRTLS
Sbjct: 760  LHDKETGITQLRLSNGIAVNYKKSTTESRAGVMRLIVGGGRAAETSDSKGAVVVGVRTLS 819

Query: 818  EGGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHS 877
            EGGRVG FSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGM+AAFQLLHMVLE S
Sbjct: 820  EGGRVGDFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMQAAFQLLHMVLERS 879

Query: 878  VWLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTV 937
            VWLEDAFDRA+QLY+SY+RSIPKSLER+TAHKLM+AMLNGDERFVEP+PKSLQ+L L++V
Sbjct: 880  VWLEDAFDRARQLYLSYFRSIPKSLERATAHKLMIAMLNGDERFVEPTPKSLQSLNLESV 939

Query: 938  KDAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASE 997
            KDAVM+ FVG+NMEVS+VGDFSEEEIE CILDYLGTV A+  S     S PI+FR   + 
Sbjct: 940  KDAVMSHFVGDNMEVSIVGDFSEEEIERCILDYLGTVKASHDSAKPPGSEPILFRQPTAG 999

Query: 998  LQFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELLESISQ--ISRTGESDESDNDIEK 1057
            LQFQQVFLKDTDERACAYI+GPAPNRWG T +G +L +S+S+  ++  G     +  +E 
Sbjct: 1000 LQFQQVFLKDTDERACAYIAGPAPNRWGFTVDGDDLFQSVSKLPVAHDGLLKSEEQLLEG 1059

Query: 1058 G---LQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWY 1117
            G   LQ+KLR+HPLFFG+TMGLLAEIINSRLFT+VRDSLGLTYDVSFEL+LFDRL LGWY
Sbjct: 1060 GDRELQKKLRAHPLFFGVTMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLNLGWY 1119

Query: 1118 VISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLL 1177
            VISVTSTP KVYKAVDACKSVLRGLHSN+IA RELDRAKRTLLMRHEAE+KSNAYWL LL
Sbjct: 1120 VISVTSTPGKVYKAVDACKSVLRGLHSNQIAPRELDRAKRTLLMRHEAELKSNAYWLNLL 1179

Query: 1178 AHLQASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES 1237
            AHLQASSVPRK+LSCIK+L SLYEAA+I+D+Y+AY+QL+VD DSLY+CIGIAGAQAGEE 
Sbjct: 1180 AHLQASSVPRKELSCIKELVSLYEAASIEDIYLAYNQLRVDEDSLYSCIGIAGAQAGEEI 1239

Query: 1238 IVSFEEEGSDQDFQGVIPSGRGLSTMTRPTT 1262
             V  EEE  +  F GV+P GRG S  TRPTT
Sbjct: 1240 TVLSEEEEPEDVFSGVVPVGRGSSMTTRPTT 1265

BLAST of Cucsa.178770.1 vs. Swiss-Prot
Match: SPP_PEA (Stromal processing peptidase, chloroplastic OS=Pisum sativum GN=SPP PE=1 SV=2)

HSP 1 Score: 1802.7 bits (4668), Expect = 0.0e+00
Identity = 904/1168 (77.40%), Postives = 1016/1168 (86.99%), Query Frame = 1

Query: 97   NCISCFL-NQKRRCPSIKRPTSRFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHA 156
            +C SC L + K+R  ++ R       D S+F LSK++     VK  ++   TVGPDEPHA
Sbjct: 93   SCTSCCLASAKKRRSNLPRFVPGAFFDSSSFGLSKDKLRHASVKRVQLPHATVGPDEPHA 152

Query: 157  APTAWPDGILEKQDLDVSYPEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVP 216
            A T W +G+ EKQDL +   E  R  LE FL SELPSHPKL+RGQLKNG++YLILPNKVP
Sbjct: 153  ASTTWQEGVAEKQDLSLFDSELER--LEGFLGSELPSHPKLHRGQLKNGIRYLILPNKVP 212

Query: 217  PNRFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVF 276
            P RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVF
Sbjct: 213  PTRFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVF 272

Query: 277  HIHSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDC 336
            HIHSPTSTKDSD DLLPSVLDALNEI FHP FLASR+EKERRAILSELQMMNTIEYRVDC
Sbjct: 273  HIHSPTSTKDSD-DLLPSVLDALNEITFHPNFLASRIEKERRAILSELQMMNTIEYRVDC 332

Query: 337  QLLQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAV 396
            QLLQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDI NI K V
Sbjct: 333  QLLQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIGNIPKTV 392

Query: 397  NQIEAVFGESGLENEAVSTPNPSAFGAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKK 456
            NQIEAVFG++G++NE  S    SAFGAMASFLVPK+SVGLGG+     +N+ DQSK+ KK
Sbjct: 393  NQIEAVFGQTGVDNEKGSVATSSAFGAMASFLVPKLSVGLGGNSIERPTNTTDQSKVFKK 452

Query: 457  ERHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLM 516
            ERHA+RPPV H WSLPGS+ +  PPQIFQHELLQNFSINMFCKIPVNKV+T+ DLR VLM
Sbjct: 453  ERHAVRPPVKHTWSLPGSSANLKPPQIFQHELLQNFSINMFCKIPVNKVQTYRDLRIVLM 512

Query: 517  KRIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAV 576
            KRIFLSALHFRINTRYKSSNPPFTS+ELDHSDSGREGCTVTTLT+TAEPKNWQ+AI+VAV
Sbjct: 513  KRIFLSALHFRINTRYKSSNPPFTSVELDHSDSGREGCTVTTLTITAEPKNWQNAIRVAV 572

Query: 577  QEVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQR 636
             EVRRLKEFGVT+GELTRY+DALL+DSEHLAAMIDNVSSVDNLDFIMESDALGH VMDQ 
Sbjct: 573  HEVRRLKEFGVTQGELTRYLDALLRDSEHLAAMIDNVSSVDNLDFIMESDALGHKVMDQS 632

Query: 637  QGHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETE 696
            QGHESL+AVAGTVTL+EVNS+GA+VLEFI+D+GK +APLPAAIVACVPKK HI+G GETE
Sbjct: 633  QGHESLIAVAGTVTLDEVNSVGAQVLEFIADFGKLSAPLPAAIVACVPKKVHIEGAGETE 692

Query: 697  FKITASEITTAIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTK 756
            FKI+++EIT A++AGL EPIE EPELEVPKEL+ SS + EL+ Q +P+FI ++PE    K
Sbjct: 693  FKISSTEITDAMKAGLDEPIEPEPELEVPKELVQSSTLQELKNQRKPAFIPVSPEIEAKK 752

Query: 757  FHDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLS 816
             HD+ETGIT+ RL+NGIPVNYKISKSE ++GVMRLIVGGGRAAE  DS+G+V+VGVRTLS
Sbjct: 753  LHDEETGITRLRLANGIPVNYKISKSETQSGVMRLIVGGGRAAEGSDSRGSVIVGVRTLS 812

Query: 817  EGGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHS 876
            EGGRVG FSREQVELFCVN+ INCSLESTEEFI++EFRFTLR+NGMRAAFQLLHMVLEHS
Sbjct: 813  EGGRVGNFSREQVELFCVNNQINCSLESTEEFISLEFRFTLRNNGMRAAFQLLHMVLEHS 872

Query: 877  VWLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTV 936
            VW +DA DRA+Q+Y+SYYRSIPKSLERSTAHKLM+AML+GDERF EP+P SL+NLTLQ+V
Sbjct: 873  VWSDDALDRARQVYLSYYRSIPKSLERSTAHKLMVAMLDGDERFTEPTPSSLENLTLQSV 932

Query: 937  KDAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASE 996
            KDAVMNQFVGNNMEVS+VGDF+EEEIESCILDYLGT  AT   +     +P  FR S S 
Sbjct: 933  KDAVMNQFVGNNMEVSIVGDFTEEEIESCILDYLGTAQATGNFKNQQQIIPPTFRLSPSS 992

Query: 997  LQFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELLESISQISRTGESDESDNDI--EK 1056
            LQ Q+VFL DTDERACAYI+GPAPNRWG T +G +LLE+I   S    +    + +  E 
Sbjct: 993  LQSQEVFLNDTDERACAYIAGPAPNRWGFTADGNDLLETIDNASSVNNNGTKSDALQTEG 1052

Query: 1057 GLQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVIS 1116
              +R LRSHPLFFGITMGLL+EIINSRLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYV+S
Sbjct: 1053 APRRSLRSHPLFFGITMGLLSEIINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVVS 1112

Query: 1117 VTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHL 1176
            VTSTP+KV+KAVDACK+VLRGLHSN I  RELDRAKRTLLMRHEAEIKSNAYWLGLLAHL
Sbjct: 1113 VTSTPSKVHKAVDACKNVLRGLHSNGITVRELDRAKRTLLMRHEAEIKSNAYWLGLLAHL 1172

Query: 1177 QASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVS 1236
            Q+SSVPRKDLSCIKDLTSLYEAATI+D  +AY+QLKVD DSLY+CIG++GAQA ++    
Sbjct: 1173 QSSSVPRKDLSCIKDLTSLYEAATIEDTCLAYEQLKVDEDSLYSCIGVSGAQAAQDIAAP 1232

Query: 1237 FEEEGSDQDFQGVIPSGRGLSTMTRPTT 1262
             EEE + + + GV+P GRGLSTMTRPTT
Sbjct: 1233 VEEEEAGEGYPGVLPMGRGLSTMTRPTT 1257

BLAST of Cucsa.178770.1 vs. Swiss-Prot
Match: SPP_ORYSJ (Stromal processing peptidase, chloroplastic OS=Oryza sativa subsp. japonica GN=SPP PE=2 SV=2)

HSP 1 Score: 1658.3 bits (4293), Expect = 0.0e+00
Identity = 850/1191 (71.37%), Postives = 970/1191 (81.44%), Query Frame = 1

Query: 74   RNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRC-PSIKRPTSRFILDKSAFQLSKNE 133
            R +  A    +  IG  G E    C+SCF   +RR  P + R     +          + 
Sbjct: 65   RRRTEAVGAASAAIGSLGEER-EGCLSCFPRGRRRGRPGLARFAPCALPHTYGLSSLHSG 124

Query: 134  RDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRAELEAFLSSELPS 193
                 ++  R V    GPDEPH A   W +  L+K  +D      G+ ELE FL++ LPS
Sbjct: 125  LTGAKIRR-RHVLHAAGPDEPHVASPTWSETALDKHYVD---QPIGKEELEGFLNTPLPS 184

Query: 194  HPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSK 253
            HPKL RGQLKNGL+YLILPNKVP NRFEAHMEVHVGSIDEE+DEQGIAHMIEHVAFLGSK
Sbjct: 185  HPKLVRGQLKNGLRYLILPNKVPANRFEAHMEVHVGSIDEEEDEQGIAHMIEHVAFLGSK 244

Query: 254  KREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRV 313
            KREKLLGTGARSNAYTDFHHTVFHIHSPT TK+   DLLPSVLDALNEIAFHPKF +SRV
Sbjct: 245  KREKLLGTGARSNAYTDFHHTVFHIHSPTKTKEYGEDLLPSVLDALNEIAFHPKFSSSRV 304

Query: 314  EKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHE 373
            EKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLS+RFPIGLEEQI KWD DKIR+FHE
Sbjct: 305  EKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSERFPIGLEEQIHKWDPDKIRRFHE 364

Query: 374  RWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAFGAMASFLVPKIS 433
            RWY+PANATLY+VG+ID+I +A+ +IEAVF  +  E EA      S FGAMAS   PK+ 
Sbjct: 365  RWYYPANATLYLVGEIDDIPRAIREIEAVFEHTLPEGEAAPMSTASPFGAMASLFAPKLP 424

Query: 434  VGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFS 493
             GL  SL+ ERS + D+ K VK+ER AIRPPV H WSLPG    A PP IFQHEL+Q+FS
Sbjct: 425  GGLAASLTGERSPAADKIKPVKRERQAIRPPVEHKWSLPGVAQDAKPPAIFQHELIQSFS 484

Query: 494  INMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREG 553
            INMFCKIPVN+V+T+ DLR+VLMKRIFLSALHFRINTRYKSSNPPFTS+ELDHSDSGREG
Sbjct: 485  INMFCKIPVNQVQTYKDLRSVLMKRIFLSALHFRINTRYKSSNPPFTSVELDHSDSGREG 544

Query: 554  CTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNV 613
            CTVTTLTVTAEP+NW+SAIKVAV EVRRLKEFGVT GE+TRYMDAL+KDSE LA MID+V
Sbjct: 545  CTVTTLTVTAEPQNWRSAIKVAVHEVRRLKEFGVTMGEMTRYMDALIKDSEQLAMMIDSV 604

Query: 614  SSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTA 673
             SVDNLDFIMESDAL HTVMDQ QGHESL+AVA TVTLEEVN++GAEVLEFISDYGKP A
Sbjct: 605  PSVDNLDFIMESDALRHTVMDQLQGHESLLAVAETVTLEEVNTVGAEVLEFISDYGKPDA 664

Query: 674  PLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPELEVPKELISSSQ 733
            PLPAAIVACVPKK H+DG+GET+F+I   EIT +I+AGL EPI  EPELEVPKELI+ S+
Sbjct: 665  PLPAAIVACVPKKVHMDGVGETDFEIHPEEITDSIKAGLEEPIYPEPELEVPKELITRSE 724

Query: 734  IAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIV 793
            + +L++Q +PSF  L+ E NV K  D ETGI Q RLSNGI +NYKI+++E + GVMRLIV
Sbjct: 725  LEDLKLQRKPSFASLSKEENVVKIFDDETGIAQRRLSNGISINYKITQNEARVGVMRLIV 784

Query: 794  GGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEF 853
            GGGRA E  +S+G+V+VGVRTLSEGG VG FSREQVELFCVN+LINCSLES EEFI MEF
Sbjct: 785  GGGRATEDSESKGSVIVGVRTLSEGGCVGNFSREQVELFCVNNLINCSLESNEEFIFMEF 844

Query: 854  RFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAM 913
            RF LRDNGMRAAFQLLHMVLEH+VWLEDAFDRA QLY+SYYRSIPKSLERSTAHKLMLAM
Sbjct: 845  RFALRDNGMRAAFQLLHMVLEHNVWLEDAFDRATQLYLSYYRSIPKSLERSTAHKLMLAM 904

Query: 914  LNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTV 973
            LN DERFVEPSP SLQ LTLQ+VKDAVMNQFVG+NMEVS+VGDF+EEE+ESC+LDYLGTV
Sbjct: 905  LNHDERFVEPSPHSLQKLTLQSVKDAVMNQFVGDNMEVSIVGDFTEEEVESCVLDYLGTV 964

Query: 974  TATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELL 1033
            +A  +S+       I F P  S+L FQQV++KDTDERACAYI+GPAPNRWG   EG +L 
Sbjct: 965  SAPKSSKTQEHIEKISFLPFPSDLHFQQVYIKDTDERACAYIAGPAPNRWGFATEGNDLF 1024

Query: 1034 ESISQISRTGESDESDN-DIEKGLQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGL 1093
              I   S   +  ES N D+ +     +RSH LFFGIT+ LLAEIINSRLFT+VRDS+GL
Sbjct: 1025 NVIRSSSGDAQVSESANTDLTERKHNDVRSHSLFFGITLSLLAEIINSRLFTTVRDSMGL 1084

Query: 1094 TYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRT 1153
            TYDVSFEL+LFD+L LGWYVI+VTSTP+KV+KAVDACK VLRGLHSNKI +RELDRAKRT
Sbjct: 1085 TYDVSFELNLFDKLDLGWYVIAVTSTPSKVHKAVDACKGVLRGLHSNKIVERELDRAKRT 1144

Query: 1154 LLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVD 1213
            LLM+HEAE K+NAYWLGLLAHLQ+SSVPRK++SCIK+LT LYE+ATI+D+Y+AY+ LKVD
Sbjct: 1145 LLMKHEAETKTNAYWLGLLAHLQSSSVPRKEISCIKELTMLYESATIEDLYLAYEHLKVD 1204

Query: 1214 ADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIP-SGRGLSTMTRPTT 1262
              SL+ CIGIAGA++GEE+     ++  D    G+ P  GRGLSTMTRPTT
Sbjct: 1205 ESSLFACIGIAGAESGEET----TDDELDMGLHGMGPIGGRGLSTMTRPTT 1246

BLAST of Cucsa.178770.1 vs. Swiss-Prot
Match: SPP_ORYSI (Stromal processing peptidase, chloroplastic OS=Oryza sativa subsp. indica GN=OsI_23765 PE=1 SV=2)

HSP 1 Score: 1656.7 bits (4289), Expect = 0.0e+00
Identity = 849/1191 (71.28%), Postives = 970/1191 (81.44%), Query Frame = 1

Query: 74   RNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRC-PSIKRPTSRFILDKSAFQLSKNE 133
            R +  A    +  IG  G E    C+SCF   +RR  P + R     +          + 
Sbjct: 65   RRRTEAVGAASAAIGSLGEER-EGCLSCFPRGRRRGRPGLARFAPCALPHTYGLSSLHSG 124

Query: 134  RDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRAELEAFLSSELPS 193
                 ++  R V    GPDEPH A   W +  L+K  +D      G+ ELE FL++ LPS
Sbjct: 125  LTGAKIRR-RHVLHAAGPDEPHVASPTWSETALDKHYVD---QPIGKEELEGFLNTPLPS 184

Query: 194  HPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSK 253
            HPKL RGQLKNGL+YLILPNKVP NRFEAHMEVHVGSIDEE+DEQGIAHMIEHVAFLGSK
Sbjct: 185  HPKLVRGQLKNGLRYLILPNKVPANRFEAHMEVHVGSIDEEEDEQGIAHMIEHVAFLGSK 244

Query: 254  KREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRV 313
            KREKLLGTGARSNAYTDFHHTVFHIHSPT TK+   DLLPSVLDALNEIAFHPKF +SRV
Sbjct: 245  KREKLLGTGARSNAYTDFHHTVFHIHSPTKTKEYGEDLLPSVLDALNEIAFHPKFSSSRV 304

Query: 314  EKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHE 373
            EKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLS+RFPIGLEEQI KWD DKIR+FHE
Sbjct: 305  EKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSERFPIGLEEQIHKWDPDKIRRFHE 364

Query: 374  RWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAFGAMASFLVPKIS 433
            RWY+PANATLY+VG+I++I +A+ +IEAVF  +  E EA      S FGAMAS   PK+ 
Sbjct: 365  RWYYPANATLYLVGEINDIPRAIREIEAVFEHTLPEGEAAPMSTASPFGAMASLFAPKLP 424

Query: 434  VGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFS 493
             GL  SL+ ERS + D+ K VK+ER AIRPPV H WSLPG    A PP IFQHEL+Q+FS
Sbjct: 425  GGLAASLTGERSPAADKIKPVKRERQAIRPPVEHKWSLPGVAQDAKPPAIFQHELIQSFS 484

Query: 494  INMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREG 553
            INMFCKIPVN+V+T+ DLR+VLMKRIFLSALHFRINTRYKSSNPPFTS+ELDHSDSGREG
Sbjct: 485  INMFCKIPVNQVQTYKDLRSVLMKRIFLSALHFRINTRYKSSNPPFTSVELDHSDSGREG 544

Query: 554  CTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNV 613
            CTVTTLTVTAEP+NW+SAIKVAV EVRRLKEFGVT GE+TRYMDAL+KDSE LA MID+V
Sbjct: 545  CTVTTLTVTAEPQNWRSAIKVAVHEVRRLKEFGVTMGEMTRYMDALIKDSEQLAMMIDSV 604

Query: 614  SSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTA 673
             SVDNLDFIMESDAL HTVMDQ QGHESL+AVA TVTLEEVN++GAEVLEFISDYGKP A
Sbjct: 605  PSVDNLDFIMESDALRHTVMDQLQGHESLLAVAETVTLEEVNTVGAEVLEFISDYGKPDA 664

Query: 674  PLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPELEVPKELISSSQ 733
            PLPAAIVACVPKK H+DG+GET+F+I   EIT +I+AGL EPI  EPELEVPKELI+ S+
Sbjct: 665  PLPAAIVACVPKKVHMDGVGETDFEIHPEEITDSIKAGLEEPIYPEPELEVPKELITQSE 724

Query: 734  IAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIV 793
            + +L++Q +PSF  L+ E NV K  D ETGI Q RLSNGI +NYKI+++E + GVMRLIV
Sbjct: 725  LEDLKLQRKPSFASLSKEENVVKIFDDETGIAQRRLSNGISINYKITQNEARVGVMRLIV 784

Query: 794  GGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEF 853
            GGGRA E  +S+G+V+VGVRTLSEGG VG FSREQVELFCVN+LINCSLES EEFI MEF
Sbjct: 785  GGGRATEDSESKGSVIVGVRTLSEGGCVGNFSREQVELFCVNNLINCSLESNEEFIFMEF 844

Query: 854  RFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAM 913
            RF LRDNGMRAAFQLLHMVLEH+VWLEDAFDRA QLY+SYYRSIPKSLERSTAHKLMLAM
Sbjct: 845  RFALRDNGMRAAFQLLHMVLEHNVWLEDAFDRATQLYLSYYRSIPKSLERSTAHKLMLAM 904

Query: 914  LNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTV 973
            LN DERFVEPSP SLQ LTLQ+VKDAVMNQFVG+NMEVS+VGDF+EEE+ESC+LDYLGTV
Sbjct: 905  LNHDERFVEPSPHSLQKLTLQSVKDAVMNQFVGDNMEVSIVGDFTEEEVESCVLDYLGTV 964

Query: 974  TATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELL 1033
            +A  +S+       I F P  S+L FQQV++KDTDERACAYI+GPAPNRWG   EG +L 
Sbjct: 965  SAPKSSKTQEHIEKISFLPFPSDLHFQQVYIKDTDERACAYIAGPAPNRWGFATEGNDLF 1024

Query: 1034 ESISQISRTGESDESDN-DIEKGLQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGL 1093
              I   S   +  ES N D+ +     +RSH LFFGIT+ LLAEIINSRLFT+VRDS+GL
Sbjct: 1025 NVIRSSSGDAQVSESANTDLTERKHNDVRSHSLFFGITLSLLAEIINSRLFTTVRDSMGL 1084

Query: 1094 TYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRT 1153
            TYDVSFEL+LFD+L LGWYVI+VTSTP+KV+KAVDACK VLRGLHSNKI +RELDRAKRT
Sbjct: 1085 TYDVSFELNLFDKLDLGWYVIAVTSTPSKVHKAVDACKGVLRGLHSNKIVERELDRAKRT 1144

Query: 1154 LLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVD 1213
            LLM+HEAE K+NAYWLGLLAHLQ+SSVPRK++SCIK+LT LYE+ATI+D+Y+AY+ LKVD
Sbjct: 1145 LLMKHEAETKTNAYWLGLLAHLQSSSVPRKEISCIKELTMLYESATIEDLYLAYEHLKVD 1204

Query: 1214 ADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIP-SGRGLSTMTRPTT 1262
              SL+ CIGIAGA++GEE+     ++  D    G+ P  GRGLSTMTRPTT
Sbjct: 1205 ESSLFACIGIAGAESGEET----TDDELDMGLHGMGPIGGRGLSTMTRPTT 1246

BLAST of Cucsa.178770.1 vs. Swiss-Prot
Match: PQQL_ECOLI (Probable zinc protease PqqL OS=Escherichia coli (strain K12) GN=pqqL PE=3 SV=2)

HSP 1 Score: 119.8 bits (299), Expect = 2.2e-25
Identity = 70/215 (32.56%), Postives = 112/215 (52.09%), Query Frame = 1

Query: 186 LSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIAHMIEH 245
           +++ LP   KL  GQL NGL+Y+I P+  P ++    +++H GS+ EED+E G+AH +EH
Sbjct: 25  IAAALPQDEKLITGQLDNGLRYMIYPHAHPKDQVNLWLQIHTGSLQEEDNELGVAHFVEH 84

Query: 246 VAFLGSK--KREKLLGT--------GARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVL 305
           + F G+K     K++ T        G   NAYT +  TV+ +  PT+ K +    L  V+
Sbjct: 85  MMFNGTKTWPGNKVIETFESMGLRFGRDVNAYTSYDETVYQVSLPTTQKQN----LQQVM 144

Query: 306 DALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGL 365
              +E +    F    V+ ER  I  E +     ++R        L +  +   R PIGL
Sbjct: 145 AIFSEWSNAATFEKLEVDAERGVITEEWRAHQDAKWRTSQARRPFLLANTRNLDREPIGL 204

Query: 366 EEQIKKWDADKIRKFHERWYFPANATLYIVGDIDN 391
            + +      ++R+F++RWY P N T  +VGDID+
Sbjct: 205 MDTVATVTPAQLRQFYQRWYQPNNMTFIVVGDIDS 235

BLAST of Cucsa.178770.1 vs. TrEMBL
Match: A0A0A0LH02_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G072490 PE=4 SV=1)

HSP 1 Score: 2491.8 bits (6457), Expect = 0.0e+00
Identity = 1260/1261 (99.92%), Postives = 1260/1261 (99.92%), Query Frame = 1

Query: 1    MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR 60
            MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR
Sbjct: 1    MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR 60

Query: 61   HSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120
            HSRDDGIGRHKFRRNKDNARRPCAYKIGE GNETLTNCISCFLNQKRRCPSIKRPTSRFI
Sbjct: 61   HSRDDGIGRHKFRRNKDNARRPCAYKIGEHGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120

Query: 121  LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180
            LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA
Sbjct: 121  LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180

Query: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240
            ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA
Sbjct: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240

Query: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300
            HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE
Sbjct: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300

Query: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360
            IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK
Sbjct: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360

Query: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF 420
            KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF
Sbjct: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF 420

Query: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP 480
            GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP
Sbjct: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP 480

Query: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540
            QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS
Sbjct: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540

Query: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600
            IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK
Sbjct: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600

Query: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660
            DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV
Sbjct: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660

Query: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE 720
            LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE
Sbjct: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE 720

Query: 721  LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780
            LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK
Sbjct: 721  LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780

Query: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840
            SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS
Sbjct: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840

Query: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900
            LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL
Sbjct: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900

Query: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960
            ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE
Sbjct: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960

Query: 961  IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020
            IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN
Sbjct: 961  IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020

Query: 1021 RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080
            RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR
Sbjct: 1021 RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080

Query: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140
            LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI
Sbjct: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140

Query: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200
            AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD
Sbjct: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200

Query: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260
            VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT
Sbjct: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260

Query: 1261 T 1262
            T
Sbjct: 1261 T 1261

BLAST of Cucsa.178770.1 vs. TrEMBL
Match: D7UDI7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g00160 PE=4 SV=1)

HSP 1 Score: 1951.4 bits (5054), Expect = 0.0e+00
Identity = 989/1171 (84.46%), Postives = 1054/1171 (90.01%), Query Frame = 1

Query: 98   CISCFLNQKRRCPSIKRPTSRFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAP 157
            CISCFLN  R C SIKR   R   DKS F L K+  D+  VK  R++  TVGPDEPHAA 
Sbjct: 108  CISCFLNHPRSCSSIKRFVPRVFSDKSTFPLLKHTLDNVSVKRVRVLNATVGPDEPHAAS 167

Query: 158  TAWPDGILEKQDLDVSYPEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPN 217
            TAWPDGILEKQ LD+  PE GRAELE FL SELPSHPKLYRGQLKNGL+YLILPNKVPPN
Sbjct: 168  TAWPDGILEKQGLDLVDPEIGRAELEGFLCSELPSHPKLYRGQLKNGLRYLILPNKVPPN 227

Query: 218  RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI 277
            RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI
Sbjct: 228  RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI 287

Query: 278  HSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQL 337
            HSPTSTKDSDGDLLP VLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQL
Sbjct: 288  HSPTSTKDSDGDLLPFVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQL 347

Query: 338  LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQ 397
            LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISK V Q
Sbjct: 348  LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKTVYQ 407

Query: 398  IEAVFGESGLENEAVSTPNPSAFGAMASFLVPKISVGLGGSLSNERSN-SVDQSKIVKKE 457
            IEA+FG++G+ENE  + P PSAFGAMASFLVPK+SVGL GSLS++RS   VDQSK  KKE
Sbjct: 408  IEAIFGQTGMENETAAAPTPSAFGAMASFLVPKLSVGLAGSLSHDRSPIPVDQSKFTKKE 467

Query: 458  RHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMK 517
            RHA+RPPV HNWSLPGSN     PQIFQHELLQNFSINMFCKIPVNKV+T+ DLRNVLMK
Sbjct: 468  RHAVRPPVKHNWSLPGSNEDMKSPQIFQHELLQNFSINMFCKIPVNKVQTYGDLRNVLMK 527

Query: 518  RIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQ 577
            RIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQ
Sbjct: 528  RIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQ 587

Query: 578  EVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQ 637
            EVRRLKEFGVTKGEL RY+DALLKDSE LAAMIDNVSSVDNLDFIMESDALGH VMDQRQ
Sbjct: 588  EVRRLKEFGVTKGELARYLDALLKDSEQLAAMIDNVSSVDNLDFIMESDALGHMVMDQRQ 647

Query: 638  GHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEF 697
            GHESLVAVAGTVTLEEVNS GA+VLEFISD+GKPTAPLPAAIVACVP K H++G GE EF
Sbjct: 648  GHESLVAVAGTVTLEEVNSTGAKVLEFISDFGKPTAPLPAAIVACVPTKVHVEGSGEIEF 707

Query: 698  KITASEITTAIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKF 757
            KI+  EIT AI+AGL EPIEAEPELEVPKELISSSQ+ +LR++  PSFI L+PE NVTK 
Sbjct: 708  KISPIEITDAIKAGLEEPIEAEPELEVPKELISSSQLQKLRVERSPSFIPLSPEVNVTKV 767

Query: 758  HDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSE 817
            +D ETGITQ RLSNGIPVNYKIS++E + GVMRLIVGGGRAAES +S+GAVVVGVRTLSE
Sbjct: 768  YDNETGITQLRLSNGIPVNYKISRNEARGGVMRLIVGGGRAAESFESRGAVVVGVRTLSE 827

Query: 818  GGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSV 877
            GGRVG FSREQVELFCVNHLINCSLESTEEFI MEFRFTLRDNGMRAAFQLLHMVLEHSV
Sbjct: 828  GGRVGNFSREQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFQLLHMVLEHSV 887

Query: 878  WLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVK 937
            WL+DAFDRA+QLY+SYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQ+VK
Sbjct: 888  WLDDAFDRARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQSVK 947

Query: 938  DAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASEL 997
            DAVMNQFVG+NMEVS+VGDFSEE+IESCILDY+GTV A+  SE    S  I+FR   S+L
Sbjct: 948  DAVMNQFVGDNMEVSVVGDFSEEDIESCILDYMGTVRASRDSEIEQQSSSIMFRSYPSDL 1007

Query: 998  QFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELLESISQIS------RTGESDESDND 1057
            QFQQVFLKDTDERACAYI+GPAPNRWG T EG +L ESI+ IS         ES     D
Sbjct: 1008 QFQQVFLKDTDERACAYIAGPAPNRWGFTIEGKDLFESINNISVDDDEEPQSESLSEMKD 1067

Query: 1058 IEKGLQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWY 1117
              K LQRKLR+HPLFFGITMGLLAEIINSRLFT+VRDSLGLTYDVSFELSLFDRLKLGWY
Sbjct: 1068 CRKDLQRKLRNHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWY 1127

Query: 1118 VISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLL 1177
            VISVTSTP KVYKAVDACK+VLRGLHS+KIAQRELDRAKRTLLMRHEAE K+NAYWLGLL
Sbjct: 1128 VISVTSTPGKVYKAVDACKNVLRGLHSSKIAQRELDRAKRTLLMRHEAETKANAYWLGLL 1187

Query: 1178 AHLQASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES 1237
            AHLQAS+VPRKD+SCIKDLTSLYEAATI+D+Y+AY+QLKVD +SLY+CIGIAGAQA EE 
Sbjct: 1188 AHLQASTVPRKDISCIKDLTSLYEAATIEDIYLAYEQLKVDENSLYSCIGIAGAQAAEE- 1247

Query: 1238 IVSFEEEGSDQDFQGVIPSGRGLSTMTRPTT 1262
             +S EEE SD+  QGVIP+GRGLSTMTRPTT
Sbjct: 1248 -ISVEEEESDEGLQGVIPAGRGLSTMTRPTT 1276

BLAST of Cucsa.178770.1 vs. TrEMBL
Match: B9IAB8_POPTR (Pitrilysin family protein OS=Populus trichocarpa GN=POPTR_0014s14670g PE=4 SV=2)

HSP 1 Score: 1926.0 bits (4988), Expect = 0.0e+00
Identity = 992/1270 (78.11%), Postives = 1088/1270 (85.67%), Query Frame = 1

Query: 2    AVATSSTVSNLTQRR-PLLSLKDQTTPIK-RVNSVQLPSRSISAHLSRFDVDSRFVVPLR 61
            A ++S  + N+ Q R PL   KD T   K R+N +Q P R            S   V  +
Sbjct: 5    AASSSVLMMNVPQTRAPLTPPKDDTLSRKNRINLIQ-PRRLPLIRFHSNHHQSWNSVSSK 64

Query: 62   RHSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETLT----NCISCFLNQKRRCPSIKRP 121
            R S +   G     R K+NA + C+  +GER           C+SC LN+ R   SIK  
Sbjct: 65   RWSHEIATGGSGSLRKKNNAWKQCSSSLGERVVGAYFPEQFKCMSCSLNRLRSRYSIKGS 124

Query: 122  TS---RFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDV 181
            T    R  +DKSAF LS +  D   VKH  + C ++GP+EPHAA    PDGILE+QD D+
Sbjct: 125  TPTIPRAFVDKSAFNLSGHSLDTASVKHVHVPCTSMGPNEPHAASIGCPDGILERQDSDL 184

Query: 182  SYPEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDE 241
               E  RA L  FL SELP HPKL+RGQLKNGL+YLILPNKVPPNRFEAHMEVH GSIDE
Sbjct: 185  LDSELERARLFEFLHSELPCHPKLHRGQLKNGLRYLILPNKVPPNRFEAHMEVHAGSIDE 244

Query: 242  EDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLP 301
            EDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKD+DGDLLP
Sbjct: 245  EDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDADGDLLP 304

Query: 302  SVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFP 361
            SVLDALNEIAFHP FLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFP
Sbjct: 305  SVLDALNEIAFHPSFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFP 364

Query: 362  IGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAV 421
            IGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISK V+QIE VFG++GLE E V
Sbjct: 365  IGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKTVHQIENVFGQTGLETETV 424

Query: 422  STPNPSAFGAMASFLVPKISVGLGGSLSNERSN-SVDQSKIVKKERHAIRPPVMHNWSLP 481
            S P+PSAFGAMASFLVPK+SVGL GS S E+S+ S+DQSKI+KKERHA+RPPV H WSLP
Sbjct: 425  SAPSPSAFGAMASFLVPKLSVGLPGSSSREKSSISLDQSKIIKKERHAVRPPVEHYWSLP 484

Query: 482  GSNVHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRY 541
            GSN +  PPQIFQHE LQNFSINMFCKIPV+KV+T+ DLRNVLMKRIFLSALHFRINTRY
Sbjct: 485  GSNANLKPPQIFQHEFLQNFSINMFCKIPVSKVQTYGDLRNVLMKRIFLSALHFRINTRY 544

Query: 542  KSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGEL 601
            KSSNPPFTS+ELDHSDSGREGCTVTTLTVTAEPKNWQ+AIKVAVQEVRRLKEFGVTKGEL
Sbjct: 545  KSSNPPFTSVELDHSDSGREGCTVTTLTVTAEPKNWQNAIKVAVQEVRRLKEFGVTKGEL 604

Query: 602  TRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLE 661
             RYMDALLKDSEHLAAMIDNVSSVDNL+FIMESDALGHTVMDQRQGHESL  VAGTVTLE
Sbjct: 605  NRYMDALLKDSEHLAAMIDNVSSVDNLEFIMESDALGHTVMDQRQGHESLFGVAGTVTLE 664

Query: 662  EVNSIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGL 721
            EVNSIGA++LEFISD+GKPTAP+PAAIVACVP K + DGLGETEFKI++SEI  AI++GL
Sbjct: 665  EVNSIGAKLLEFISDFGKPTAPIPAAIVACVPSKVYFDGLGETEFKISSSEIIAAIKSGL 724

Query: 722  REPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNG 781
             E IEAEPELEVPKELI+S+Q+ ELR+Q  PSFI L P+ + TK HD ETGITQCRLSNG
Sbjct: 725  EEAIEAEPELEVPKELITSTQLEELRLQLTPSFIPLVPDADYTKLHDPETGITQCRLSNG 784

Query: 782  IPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELF 841
            I VNYKISKSE++ GVMRLIVGGGRAAES +S+GAVVVGVRTLSEGGRVG FSREQVELF
Sbjct: 785  IAVNYKISKSESRGGVMRLIVGGGRAAESSESKGAVVVGVRTLSEGGRVGNFSREQVELF 844

Query: 842  CVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMS 901
            CVNHLINCSLESTEEFI MEFRFTLRDNGMRAAF+LLHMVLEHSVWL+DA DRA+QLY+S
Sbjct: 845  CVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFELLHMVLEHSVWLDDALDRARQLYLS 904

Query: 902  YYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVS 961
            YYRSIPKSLER+TAHKLM AMLNGDERF+EP+P+SLQNLTL++VKDAVMNQFVG NMEVS
Sbjct: 905  YYRSIPKSLERATAHKLMTAMLNGDERFIEPTPQSLQNLTLKSVKDAVMNQFVGGNMEVS 964

Query: 962  LVGDFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERAC 1021
            +VGDFSEEEIESCI+DYLGTV AT  S+      P++FRPS S+LQFQQVFLKDTDERAC
Sbjct: 965  IVGDFSEEEIESCIIDYLGTVRATRDSDREQEFNPVMFRPSPSDLQFQQVFLKDTDERAC 1024

Query: 1022 AYISGPAPNRWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMG 1081
            AYI+GPAPNRWG T +G +L ES S IS+         D++K  Q KLRSHPLFFGITMG
Sbjct: 1025 AYIAGPAPNRWGFTVDGKDLFESTSGISQIDR-----KDVQKDKQGKLRSHPLFFGITMG 1084

Query: 1082 LLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSV 1141
            LLAEIINSRLFT+VRDSLGLTYDVSFELSLFDRLKLGWYV+SVTSTP KV+KAVDACKSV
Sbjct: 1085 LLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWYVVSVTSTPGKVHKAVDACKSV 1144

Query: 1142 LRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTS 1201
            LRGLHSNK+AQRELDRAKRTLLMRHE EIKSNAYWLGLLAHLQASSVPRKD+SCIKDLTS
Sbjct: 1145 LRGLHSNKVAQRELDRAKRTLLMRHETEIKSNAYWLGLLAHLQASSVPRKDVSCIKDLTS 1204

Query: 1202 LYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGR 1261
            LYEAATI+D+Y+AY+QLKVD DSLY+CIG+AGAQAGEE I + EEE +D DFQGVIP GR
Sbjct: 1205 LYEAATIEDIYVAYEQLKVDEDSLYSCIGVAGAQAGEE-INALEEEETDDDFQGVIPVGR 1264

BLAST of Cucsa.178770.1 vs. TrEMBL
Match: A0A061FMU7_THECC (Insulinase (Peptidase family M16) family protein isoform 2 OS=Theobroma cacao GN=TCM_034790 PE=4 SV=1)

HSP 1 Score: 1917.5 bits (4966), Expect = 0.0e+00
Identity = 988/1288 (76.71%), Postives = 1088/1288 (84.47%), Query Frame = 1

Query: 3    VATSSTVSNLTQRRPLLSLKDQTT------PIKRVNSVQLPSRSISAHLSRF-------- 62
            +ATSS+ S++    PL  ++  T+       I RVNS  +P+ S+     RF        
Sbjct: 1    MATSSS-SSVVSGIPLSQIRWDTSRGGGGRSIVRVNS-PVPTASLPLTRFRFHLPPTPAA 60

Query: 63   -----DVDSRFVVPLRRHSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFL 122
                 DV SR  + L   + ++   R K ++ K  A +   +      ++     I C  
Sbjct: 61   ARWQGDVRSRNTIGLEFQNWNNNTIRKK-KKKKSGASKAWKWTSPVSFSQQCKRWIPCCS 120

Query: 123  NQKRRCPSIKRP------TSRFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAP 182
            +  R C     P         F  DKS F LS +  +    KH    C TVGPDEPHAA 
Sbjct: 121  SLNRNCYRKIAPLPLATSVPAFFPDKSCFPLSAHTLNTTSGKHICAPCATVGPDEPHAAS 180

Query: 183  TAWPDGILEKQDLDVSYPEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPN 242
            T WPDG+LEKQD D  YP+F   ELE FLS++LPSHPKL+RGQLKNGL+YLILPNKVPPN
Sbjct: 181  TTWPDGLLEKQDFDSLYPQFQTTELEGFLSTQLPSHPKLHRGQLKNGLRYLILPNKVPPN 240

Query: 243  RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI 302
            RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI
Sbjct: 241  RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI 300

Query: 303  HSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQL 362
            HSPT TK+SD DLLP VLDALNEIAFHPKFL+SRVEKERRAILSELQMMNTIEYRVDCQL
Sbjct: 301  HSPTCTKESDEDLLPLVLDALNEIAFHPKFLSSRVEKERRAILSELQMMNTIEYRVDCQL 360

Query: 363  LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQ 422
            LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFP NATLYIVGDIDNISK + Q
Sbjct: 361  LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPGNATLYIVGDIDNISKTIYQ 420

Query: 423  IEAVFGESGLENEAVSTPNPSAFGAMASFLVPKISVGLGGSLSNER-SNSVDQSKIVKKE 482
            IEAVFG++ LENE    P  SAFGAMASFLVPK+S GL GS S+ER SN  DQ+KI+KKE
Sbjct: 421  IEAVFGQTALENEMPPPPTSSAFGAMASFLVPKLSAGLAGSSSHERLSNPADQAKIIKKE 480

Query: 483  RHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMK 542
            +HA+RPPV H WSLPG N    PPQIFQHELLQNFSINMFCKIPVNKV+TF DLRNVLMK
Sbjct: 481  KHAVRPPVKHIWSLPGHNTDMKPPQIFQHELLQNFSINMFCKIPVNKVQTFGDLRNVLMK 540

Query: 543  RIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQ 602
            RIFLSALHFRINTRYKSSNPPFTS+ELDHSDSGREGCTVTTLTVTAEPKNWQ+AIKVAVQ
Sbjct: 541  RIFLSALHFRINTRYKSSNPPFTSVELDHSDSGREGCTVTTLTVTAEPKNWQNAIKVAVQ 600

Query: 603  EVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQ 662
            EVRRLKEFGVTKGELTRYMDALLKDSE LAAMIDNVSSVDNLDFIMESDALGHTVMDQ Q
Sbjct: 601  EVRRLKEFGVTKGELTRYMDALLKDSEQLAAMIDNVSSVDNLDFIMESDALGHTVMDQTQ 660

Query: 663  GHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEF 722
            GHESL+AVAGTVTL+EVNSIGA+VLEFISD+GKPTAPLPAAIVACVPKK H+DG+GETEF
Sbjct: 661  GHESLMAVAGTVTLDEVNSIGAQVLEFISDFGKPTAPLPAAIVACVPKKVHLDGIGETEF 720

Query: 723  KITASEITTAIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKF 782
            KIT SEIT AI++GL EPIEAEPELEVPKELIS  Q+ ELR+Q  PSFI L+ E NVTK 
Sbjct: 721  KITPSEITAAIKSGLEEPIEAEPELEVPKELISPLQLQELRMQRGPSFIPLSAEMNVTKV 780

Query: 783  HDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSE 842
             DKETGITQ RLSNGIPVNYKISK+E + GVMRLIVGGGRAAE+ DS+GAVVVGVRTLSE
Sbjct: 781  QDKETGITQLRLSNGIPVNYKISKNEARGGVMRLIVGGGRAAETSDSKGAVVVGVRTLSE 840

Query: 843  GGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSV 902
            GGRVG FSREQVELFCVNHLINCSLESTEEFI+MEFRFTLRDNGM AAFQLLHMVLEHSV
Sbjct: 841  GGRVGNFSREQVELFCVNHLINCSLESTEEFISMEFRFTLRDNGMHAAFQLLHMVLEHSV 900

Query: 903  WLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVK 962
            WL+DAFDRA+QLY+SYYRSIPKSLERSTAHKLMLAM+NGDERFVEP+PKSLQNLTL++VK
Sbjct: 901  WLDDAFDRARQLYLSYYRSIPKSLERSTAHKLMLAMMNGDERFVEPTPKSLQNLTLKSVK 960

Query: 963  DAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASEL 1022
            DAVMNQFVG+NMEVS+VGDFSEEEIESC+LDYLGTV A+  SE A    PI+FRPS S+L
Sbjct: 961  DAVMNQFVGDNMEVSIVGDFSEEEIESCVLDYLGTVRASRDSERAHGFSPILFRPSPSDL 1020

Query: 1023 QFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELLESISQISRTGES---DESDNDIEK 1082
            QFQQVFLKDTDERACAYI+GPAPNRWG+T +G +LLES++ I    ++    +   DI+K
Sbjct: 1021 QFQQVFLKDTDERACAYIAGPAPNRWGLTVDGQDLLESVADIPSADDAQPHSDEGKDIQK 1080

Query: 1083 GLQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVIS 1142
             LQ+KLR HPLFFGITMGLLAE+INSRLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYVIS
Sbjct: 1081 DLQKKLRGHPLFFGITMGLLAEVINSRLFTTVRDSLGLTYDVSFELNLFDRLKLGWYVIS 1140

Query: 1143 VTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHL 1202
            VTSTP+KVY+AVDACK+VLRGLH+NKIA REL+RAKRTLLMRHEAEIKSNAYWLGLLAHL
Sbjct: 1141 VTSTPSKVYRAVDACKNVLRGLHTNKIAPRELERAKRTLLMRHEAEIKSNAYWLGLLAHL 1200

Query: 1203 QASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVS 1262
            QASSVPRKD+SC+K+LTSLYEAA+I+D+Y+AYDQLKVD DSLY+CIGIAG  AGE +  S
Sbjct: 1201 QASSVPRKDISCVKELTSLYEAASIEDIYLAYDQLKVDEDSLYSCIGIAGVHAGEGTTAS 1260

BLAST of Cucsa.178770.1 vs. TrEMBL
Match: B9RRK9_RICCO (Pitrilysin, putative OS=Ricinus communis GN=RCOM_1645120 PE=4 SV=1)

HSP 1 Score: 1916.7 bits (4964), Expect = 0.0e+00
Identity = 986/1274 (77.39%), Postives = 1091/1274 (85.64%), Query Frame = 1

Query: 2    AVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRS-ISAHLSRFDVDSRFVVPLRR 61
            A ++SS + ++ Q R  LS  D     +RVN +Q P    +S  L++F   +       +
Sbjct: 11   AASSSSLLMSVPQIRSCLSPSDN----RRVNRLQPPRLPRLSTPLAQFHQKNS------Q 70

Query: 62   HSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETL----TNCISCFLNQKRRC--PSIKR 121
               + G G     R K+NA    +  +GER  E+      NC+SCFLN  RR    S+ R
Sbjct: 71   WQHEVGYGGSGSCRKKNNAWERRSSLLGERVAESSFTKQNNCVSCFLNHSRRGRRTSVTR 130

Query: 122  PTSRFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSY 181
                   DKSAF L         V+   + C +VGP+EPHAA TA PDGILE+QD D+ Y
Sbjct: 131  RIPGAFADKSAFHLPGFAS----VRGVHVPCASVGPNEPHAASTACPDGILERQDSDLLY 190

Query: 182  PEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEED 241
            PE  R  L  FLS+ELP+HPKLYRGQLKNGL+YLILPNKVPPNRFEAHMEVH GSIDEE+
Sbjct: 191  PELVRTGLAEFLSTELPTHPKLYRGQLKNGLRYLILPNKVPPNRFEAHMEVHAGSIDEEE 250

Query: 242  DEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSV 301
            DEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPT+TKD DGDLLPSV
Sbjct: 251  DEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTTTKDGDGDLLPSV 310

Query: 302  LDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIG 361
            LDALNEIAFHPKFL+SRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIG
Sbjct: 311  LDALNEIAFHPKFLSSRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIG 370

Query: 362  LEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVST 421
            LEEQIKKWDADKIRKFHERWYFPANATLYIVGDID ISK V+QIE VFG++GL+ E  S 
Sbjct: 371  LEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDKISKTVHQIETVFGQTGLDIETASA 430

Query: 422  PNPSAFGAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSN 481
            P PSAFGAMASFLVPK+SVGL GS   + S+S DQSK +++ERHA+RPPV HNWSLPGSN
Sbjct: 431  PAPSAFGAMASFLVPKLSVGLPGS-PEKVSSSTDQSKSLRRERHAVRPPVQHNWSLPGSN 490

Query: 482  VHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSS 541
                PPQIFQHELLQ+FS NMFCKIPVNKVRT+ DLRNVLMKRIFLSALHFRINTRYKSS
Sbjct: 491  DCMKPPQIFQHELLQHFSYNMFCKIPVNKVRTYGDLRNVLMKRIFLSALHFRINTRYKSS 550

Query: 542  NPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRY 601
            NPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRY
Sbjct: 551  NPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRY 610

Query: 602  MDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVN 661
            MDALLKDSEHLAAMIDNVSSVDNL+FIMESDALGH VMDQRQGHESLVAVAGTVTLEEVN
Sbjct: 611  MDALLKDSEHLAAMIDNVSSVDNLEFIMESDALGHIVMDQRQGHESLVAVAGTVTLEEVN 670

Query: 662  SIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREP 721
            SIGA+VLEFISD+G+PTAPLPAAIVACVP K HIDG+GE EFKI+ SEITTAI++GL EP
Sbjct: 671  SIGAKVLEFISDFGRPTAPLPAAIVACVPNKVHIDGVGEAEFKISPSEITTAIKSGLEEP 730

Query: 722  IEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPV 781
            IEAEPELEVPKELIS+SQ+ ELR+Q +PSF+ L PE N+ K HD+ETGITQCRLSNGI V
Sbjct: 731  IEAEPELEVPKELISTSQLEELRLQRRPSFVPLLPEVNILKSHDQETGITQCRLSNGIAV 790

Query: 782  NYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVN 841
            NYKIS+SE++ GVMRLIVGGGRAAE+ +S+GAV+VGVRTLSEGGRVG FSREQVELFCVN
Sbjct: 791  NYKISRSESRGGVMRLIVGGGRAAETTESKGAVIVGVRTLSEGGRVGNFSREQVELFCVN 850

Query: 842  HLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYR 901
            HLINCSLESTEEFI MEFRFTLRDNGMRAAF+LLHMVLEHSVWL+DAFDRA+QLY+SYYR
Sbjct: 851  HLINCSLESTEEFICMEFRFTLRDNGMRAAFELLHMVLEHSVWLDDAFDRARQLYLSYYR 910

Query: 902  SIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVG 961
            SIPKSLER+TAHKLM AMLNGDERFVEP+P+SL+NLTL++VKDAVMNQFVG+NMEVS+VG
Sbjct: 911  SIPKSLERATAHKLMTAMLNGDERFVEPTPQSLENLTLKSVKDAVMNQFVGDNMEVSIVG 970

Query: 962  DFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYI 1021
            DFSEEEIESCI+DYLGTV  T  S  A   VPI+FRPS S+LQ QQVFLKDTDERACAYI
Sbjct: 971  DFSEEEIESCIIDYLGTVRETRGSVGAAKFVPILFRPS-SDLQSQQVFLKDTDERACAYI 1030

Query: 1022 SGPAPNRWGVTFEGLELLESISQISRTGESDESD-------NDIEKGLQRKLRSHPLFFG 1081
            +GPAPNRWG T +G +L ESIS I+   ++            D+++  QRKLRSHPLFFG
Sbjct: 1031 AGPAPNRWGFTVDGKDLFESISDIAVVPDAQSKSEQPLMGRKDVQEDWQRKLRSHPLFFG 1090

Query: 1082 ITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDA 1141
            ITMGLLAEIINSRLFT+VRDSLGLTYDVSFELSLFDRL LGWYVISVTSTP+KVYKAVDA
Sbjct: 1091 ITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLNLGWYVISVTSTPSKVYKAVDA 1150

Query: 1142 CKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIK 1201
            CKSVLRGL+SNKIA RELDRAKRTLLMRHEAE+KSNAYWLGLLAHLQASSVPRKD+SCIK
Sbjct: 1151 CKSVLRGLYSNKIAPRELDRAKRTLLMRHEAEVKSNAYWLGLLAHLQASSVPRKDISCIK 1210

Query: 1202 DLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVI 1261
            DLTSLYEAATIDD+Y+AY+QLK+D DSLY+CIG+AG+QAG+E  V  EEE ++  FQGVI
Sbjct: 1211 DLTSLYEAATIDDIYLAYEQLKIDDDSLYSCIGVAGSQAGDEITVPLEEEETENGFQGVI 1268

BLAST of Cucsa.178770.1 vs. TAIR10
Match: AT5G42390.1 (AT5G42390.1 Insulinase (Peptidase family M16) family protein)

HSP 1 Score: 1834.7 bits (4751), Expect = 0.0e+00
Identity = 917/1171 (78.31%), Postives = 1036/1171 (88.47%), Query Frame = 1

Query: 98   CISCFLNQKRRCPSIKRPTSRFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAP 157
            C++C    KR    I+R      +D++AF LS++     + KH++IV  T+GPDEPHAA 
Sbjct: 100  CLTC----KRNQAGIRRALPSAFVDRTAFSLSRSSLTSSLRKHSQIVNATLGPDEPHAAG 159

Query: 158  TAWPDGIL-EKQDLDVSYPEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPP 217
            TAWPDGI+ E+QDLD+  PE   AELEAFL  ELPSHPKL+RGQLKNGL+YLILPNKVPP
Sbjct: 160  TAWPDGIVAERQDLDLLPPEIDSAELEAFLGCELPSHPKLHRGQLKNGLRYLILPNKVPP 219

Query: 218  NRFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFH 277
            NRFEAHMEVHVGSIDEE+DEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFH
Sbjct: 220  NRFEAHMEVHVGSIDEEEDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFH 279

Query: 278  IHSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQ 337
            IHSPT TKDS+ DL PSVLDALNEIAFHPKFL+SRVEKERRAILSELQMMNTIEYRVDCQ
Sbjct: 280  IHSPTHTKDSEDDLFPSVLDALNEIAFHPKFLSSRVEKERRAILSELQMMNTIEYRVDCQ 339

Query: 338  LLQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVN 397
            LLQHLHSENKL +RFPIGLEEQIKKWD DKIRKFHERWYFPANATLYIVGDIDNI + V+
Sbjct: 340  LLQHLHSENKLGRRFPIGLEEQIKKWDVDKIRKFHERWYFPANATLYIVGDIDNIPRIVH 399

Query: 398  QIEAVFGESGLENEAV-STPNPSAFGAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKK 457
             IEAVFG++GL+NE+  S+P+P AFGAMA+FLVPK+  GLGG+ SNE++N+ DQSK++K+
Sbjct: 400  NIEAVFGKNGLDNESTPSSPSPGAFGAMANFLVPKLPAGLGGTFSNEKTNTADQSKMIKR 459

Query: 458  ERHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLM 517
            ERHAIRPPV HNWSLPG++V   PPQIF+HELLQNF+INMFCKIPV+KV+TF DLRNVLM
Sbjct: 460  ERHAIRPPVEHNWSLPGTSVDLKPPQIFKHELLQNFAINMFCKIPVSKVQTFGDLRNVLM 519

Query: 518  KRIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAV 577
            KRIFLSALHFRINTRYKSSNPPFTS+ELDHSDSGREGCTVTTLTVTAEP+NWQ+A+KVAV
Sbjct: 520  KRIFLSALHFRINTRYKSSNPPFTSVELDHSDSGREGCTVTTLTVTAEPQNWQNAVKVAV 579

Query: 578  QEVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQR 637
            QEVRRLKEFGVT+GELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDAL HTVMDQ 
Sbjct: 580  QEVRRLKEFGVTRGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALSHTVMDQT 639

Query: 638  QGHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETE 697
            QGHE+LVAVAGTVTLEEVN++GA+VLEFISD+G+PTAPLPAAIVACVP K H+DG+GE++
Sbjct: 640  QGHETLVAVAGTVTLEEVNTVGAKVLEFISDFGRPTAPLPAAIVACVPTKVHVDGVGESD 699

Query: 698  FKITASEITTAIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTK 757
            F I+  EI  ++++GL  PIEAEPELEVPKELIS SQ+ EL +Q  P F+ + P + +TK
Sbjct: 700  FNISPDEIIESVKSGLLAPIEAEPELEVPKELISQSQLKELTLQRNPCFVPI-PGSGLTK 759

Query: 758  FHDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLS 817
             HDKETGITQ RLSNGI VNYK S +E++AGVMRLIVGGGRAAE+ DS+GAVVVGVRTLS
Sbjct: 760  LHDKETGITQLRLSNGIAVNYKKSTTESRAGVMRLIVGGGRAAETSDSKGAVVVGVRTLS 819

Query: 818  EGGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHS 877
            EGGRVG FSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGM+AAFQLLHMVLE S
Sbjct: 820  EGGRVGDFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMQAAFQLLHMVLERS 879

Query: 878  VWLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTV 937
            VWLEDAFDRA+QLY+SY+RSIPKSLER+TAHKLM+AMLNGDERFVEP+PKSLQ+L L++V
Sbjct: 880  VWLEDAFDRARQLYLSYFRSIPKSLERATAHKLMIAMLNGDERFVEPTPKSLQSLNLESV 939

Query: 938  KDAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASE 997
            KDAVM+ FVG+NMEVS+VGDFSEEEIE CILDYLGTV A+  S     S PI+FR   + 
Sbjct: 940  KDAVMSHFVGDNMEVSIVGDFSEEEIERCILDYLGTVKASHDSAKPPGSEPILFRQPTAG 999

Query: 998  LQFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELLESISQ--ISRTGESDESDNDIEK 1057
            LQFQQVFLKDTDERACAYI+GPAPNRWG T +G +L +S+S+  ++  G     +  +E 
Sbjct: 1000 LQFQQVFLKDTDERACAYIAGPAPNRWGFTVDGDDLFQSVSKLPVAHDGLLKSEEQLLEG 1059

Query: 1058 G---LQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWY 1117
            G   LQ+KLR+HPLFFG+TMGLLAEIINSRLFT+VRDSLGLTYDVSFEL+LFDRL LGWY
Sbjct: 1060 GDRELQKKLRAHPLFFGVTMGLLAEIINSRLFTTVRDSLGLTYDVSFELNLFDRLNLGWY 1119

Query: 1118 VISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLL 1177
            VISVTSTP KVYKAVDACKSVLRGLHSN+IA RELDRAKRTLLMRHEAE+KSNAYWL LL
Sbjct: 1120 VISVTSTPGKVYKAVDACKSVLRGLHSNQIAPRELDRAKRTLLMRHEAELKSNAYWLNLL 1179

Query: 1178 AHLQASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES 1237
            AHLQASSVPRK+LSCIK+L SLYEAA+I+D+Y+AY+QL+VD DSLY+CIGIAGAQAGEE 
Sbjct: 1180 AHLQASSVPRKELSCIKELVSLYEAASIEDIYLAYNQLRVDEDSLYSCIGIAGAQAGEEI 1239

Query: 1238 IVSFEEEGSDQDFQGVIPSGRGLSTMTRPTT 1262
             V  EEE  +  F GV+P GRG S  TRPTT
Sbjct: 1240 TVLSEEEEPEDVFSGVVPVGRGSSMTTRPTT 1265

BLAST of Cucsa.178770.1 vs. TAIR10
Match: AT5G56730.1 (AT5G56730.1 Insulinase (Peptidase family M16) protein)

HSP 1 Score: 99.4 bits (246), Expect = 1.7e-20
Identity = 69/245 (28.16%), Postives = 116/245 (47.35%), Query Frame = 1

Query: 181 ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240
           ++E  L +EL      Y G+L NGL Y +  N  P  R    + V VGS+ EE+D++G+A
Sbjct: 26  DMEQELGNELEPFGADY-GRLDNGLIYYVRRNSKPRMRAALALAVKVGSVLEEEDQRGVA 85

Query: 241 HMIEHVAFLGSKKRE-----KLLGT-----GARSNAYTDFHHTVFHIHSPTSTKDSDGDL 300
           H++EH+AF  + +       K L +     G   NA T    T++ +  P        +L
Sbjct: 86  HIVEHLAFSATTRYTNHDIVKFLESIGAEFGPCQNAMTTADETIYELFVPVDKP----EL 145

Query: 301 LPSVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKR 360
           L   +  L E +   +     +EKER A++ E +       R+     Q +   +K ++R
Sbjct: 146 LSQAISILAEFSSEIRVSKEDLEKERGAVMEEYRGNRNATGRMQDSHWQLMMEGSKYAER 205

Query: 361 FPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENE 416
            PIGLE+ I+   A  +++F+++WY   N  +  VGD  +    V+ I+  F +    +E
Sbjct: 206 LPIGLEKVIRSVPAATVKQFYQKWYHLCNMAVVAVGDFPDTKTVVDLIKTHFEDKRSSSE 265

BLAST of Cucsa.178770.1 vs. NCBI nr
Match: gi|449470360|ref|XP_004152885.1| (PREDICTED: uncharacterized protein LOC101202810 isoform X1 [Cucumis sativus])

HSP 1 Score: 2491.8 bits (6457), Expect = 0.0e+00
Identity = 1260/1261 (99.92%), Postives = 1260/1261 (99.92%), Query Frame = 1

Query: 1    MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR 60
            MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR
Sbjct: 1    MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR 60

Query: 61   HSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120
            HSRDDGIGRHKFRRNKDNARRPCAYKIGE GNETLTNCISCFLNQKRRCPSIKRPTSRFI
Sbjct: 61   HSRDDGIGRHKFRRNKDNARRPCAYKIGEHGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120

Query: 121  LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180
            LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA
Sbjct: 121  LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180

Query: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240
            ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA
Sbjct: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240

Query: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300
            HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE
Sbjct: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300

Query: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360
            IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK
Sbjct: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360

Query: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF 420
            KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF
Sbjct: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF 420

Query: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP 480
            GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP
Sbjct: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP 480

Query: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540
            QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS
Sbjct: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540

Query: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600
            IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK
Sbjct: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600

Query: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660
            DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV
Sbjct: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660

Query: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE 720
            LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE
Sbjct: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE 720

Query: 721  LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780
            LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK
Sbjct: 721  LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780

Query: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840
            SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS
Sbjct: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840

Query: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900
            LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL
Sbjct: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900

Query: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960
            ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE
Sbjct: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960

Query: 961  IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020
            IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN
Sbjct: 961  IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020

Query: 1021 RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080
            RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR
Sbjct: 1021 RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080

Query: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140
            LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI
Sbjct: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140

Query: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200
            AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD
Sbjct: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200

Query: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260
            VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT
Sbjct: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260

Query: 1261 T 1262
            T
Sbjct: 1261 T 1261

BLAST of Cucsa.178770.1 vs. NCBI nr
Match: gi|778667777|ref|XP_011648983.1| (PREDICTED: uncharacterized protein LOC101202810 isoform X2 [Cucumis sativus])

HSP 1 Score: 2484.9 bits (6439), Expect = 0.0e+00
Identity = 1259/1261 (99.84%), Postives = 1259/1261 (99.84%), Query Frame = 1

Query: 1    MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR 60
            MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR
Sbjct: 1    MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR 60

Query: 61   HSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120
            HSRDDGIGRHKFRRNKDNARRPCAYKIGE GNETLTNCISCFLNQKRRCPSIKRPTSRFI
Sbjct: 61   HSRDDGIGRHKFRRNKDNARRPCAYKIGEHGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120

Query: 121  LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180
            LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA
Sbjct: 121  LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180

Query: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240
            ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA
Sbjct: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240

Query: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300
            HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE
Sbjct: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300

Query: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360
            IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK
Sbjct: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360

Query: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF 420
            KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF
Sbjct: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF 420

Query: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP 480
            GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP
Sbjct: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP 480

Query: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540
            QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS
Sbjct: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540

Query: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600
            IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK
Sbjct: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600

Query: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660
            DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV
Sbjct: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660

Query: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE 720
            LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE
Sbjct: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE 720

Query: 721  LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780
            LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK
Sbjct: 721  LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780

Query: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840
            SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS
Sbjct: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840

Query: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900
            LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL
Sbjct: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900

Query: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960
            ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE
Sbjct: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960

Query: 961  IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020
            IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN
Sbjct: 961  IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020

Query: 1021 RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080
            RWGVTFEGLELLESISQISRT ESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR
Sbjct: 1021 RWGVTFEGLELLESISQISRT-ESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080

Query: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140
            LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI
Sbjct: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140

Query: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200
            AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD
Sbjct: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200

Query: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260
            VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT
Sbjct: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260

Query: 1261 T 1262
            T
Sbjct: 1261 T 1260

BLAST of Cucsa.178770.1 vs. NCBI nr
Match: gi|659082574|ref|XP_008441914.1| (PREDICTED: uncharacterized protein LOC103485911 isoform X1 [Cucumis melo])

HSP 1 Score: 2457.2 bits (6367), Expect = 0.0e+00
Identity = 1241/1261 (98.41%), Postives = 1251/1261 (99.21%), Query Frame = 1

Query: 1    MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR 60
            MAVATSSTVSNLT RRPLLSLKDQTTPIKRVNSVQLPSRSI AHLSRFDV+SRFVVPLRR
Sbjct: 1    MAVATSSTVSNLTHRRPLLSLKDQTTPIKRVNSVQLPSRSICAHLSRFDVESRFVVPLRR 60

Query: 61   HSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120
            +SR+DGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI
Sbjct: 61   YSREDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120

Query: 121  LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180
            LDKSAFQLSKNERD +VVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA
Sbjct: 121  LDKSAFQLSKNERDAKVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180

Query: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240
            ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA
Sbjct: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240

Query: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300
            HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE
Sbjct: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300

Query: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360
            IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK
Sbjct: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360

Query: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF 420
            KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGE+GLENEAVSTPNPSAF
Sbjct: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGETGLENEAVSTPNPSAF 420

Query: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP 480
            GAMASFLVPKISVGLGGSLSNERSNSVDQSKI+KKERHAIRPPV H WSLPGSNV ANPP
Sbjct: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIIKKERHAIRPPVKHYWSLPGSNVDANPP 480

Query: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540
            QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS
Sbjct: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540

Query: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600
            IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK
Sbjct: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600

Query: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660
            DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV
Sbjct: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660

Query: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE 720
            LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEI TAIEAGLREPIEAEPE
Sbjct: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEIITAIEAGLREPIEAEPE 720

Query: 721  LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780
            LEVPKELISSSQI ELR+QHQPSF+ LNPETNVTKFHDKETGITQCRLSNGIPVNYKISK
Sbjct: 721  LEVPKELISSSQIVELRMQHQPSFVPLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780

Query: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840
            SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS
Sbjct: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840

Query: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900
            LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL
Sbjct: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900

Query: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960
            ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE
Sbjct: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960

Query: 961  IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020
            IESCILDYLGTVTAT TSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN
Sbjct: 961  IESCILDYLGTVTATKTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020

Query: 1021 RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080
            RWGVTFEGLELLES+SQISRTGESDESD+DIEKGLQRKLRSHPLFFGITMGLLAEIINSR
Sbjct: 1021 RWGVTFEGLELLESVSQISRTGESDESDSDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080

Query: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140
            LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI
Sbjct: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140

Query: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200
            AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD
Sbjct: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200

Query: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260
            VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT
Sbjct: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260

Query: 1261 T 1262
            T
Sbjct: 1261 T 1261

BLAST of Cucsa.178770.1 vs. NCBI nr
Match: gi|659082576|ref|XP_008441915.1| (PREDICTED: uncharacterized protein LOC103485911 isoform X2 [Cucumis melo])

HSP 1 Score: 2450.2 bits (6349), Expect = 0.0e+00
Identity = 1240/1261 (98.33%), Postives = 1250/1261 (99.13%), Query Frame = 1

Query: 1    MAVATSSTVSNLTQRRPLLSLKDQTTPIKRVNSVQLPSRSISAHLSRFDVDSRFVVPLRR 60
            MAVATSSTVSNLT RRPLLSLKDQTTPIKRVNSVQLPSRSI AHLSRFDV+SRFVVPLRR
Sbjct: 1    MAVATSSTVSNLTHRRPLLSLKDQTTPIKRVNSVQLPSRSICAHLSRFDVESRFVVPLRR 60

Query: 61   HSRDDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120
            +SR+DGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI
Sbjct: 61   YSREDGIGRHKFRRNKDNARRPCAYKIGERGNETLTNCISCFLNQKRRCPSIKRPTSRFI 120

Query: 121  LDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180
            LDKSAFQLSKNERD +VVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA
Sbjct: 121  LDKSAFQLSKNERDAKVVKHARIVCGTVGPDEPHAAPTAWPDGILEKQDLDVSYPEFGRA 180

Query: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240
            ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA
Sbjct: 181  ELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPNRFEAHMEVHVGSIDEEDDEQGIA 240

Query: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300
            HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE
Sbjct: 241  HMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHIHSPTSTKDSDGDLLPSVLDALNE 300

Query: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360
            IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK
Sbjct: 301  IAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLSKRFPIGLEEQIK 360

Query: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGESGLENEAVSTPNPSAF 420
            KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGE+GLENEAVSTPNPSAF
Sbjct: 361  KWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQIEAVFGETGLENEAVSTPNPSAF 420

Query: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIVKKERHAIRPPVMHNWSLPGSNVHANPP 480
            GAMASFLVPKISVGLGGSLSNERSNSVDQSKI+KKERHAIRPPV H WSLPGSNV ANPP
Sbjct: 421  GAMASFLVPKISVGLGGSLSNERSNSVDQSKIIKKERHAIRPPVKHYWSLPGSNVDANPP 480

Query: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540
            QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS
Sbjct: 481  QIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMKRIFLSALHFRINTRYKSSNPPFTS 540

Query: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600
            IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK
Sbjct: 541  IELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQEVRRLKEFGVTKGELTRYMDALLK 600

Query: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660
            DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV
Sbjct: 601  DSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQGHESLVAVAGTVTLEEVNSIGAEV 660

Query: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEITTAIEAGLREPIEAEPE 720
            LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEI TAIEAGLREPIEAEPE
Sbjct: 661  LEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEFKITASEIITAIEAGLREPIEAEPE 720

Query: 721  LEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780
            LEVPKELISSSQI ELR+QHQPSF+ LNPETNVTKFHDKETGITQCRLSNGIPVNYKISK
Sbjct: 721  LEVPKELISSSQIVELRMQHQPSFVPLNPETNVTKFHDKETGITQCRLSNGIPVNYKISK 780

Query: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840
            SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS
Sbjct: 781  SENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSEGGRVGTFSREQVELFCVNHLINCS 840

Query: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900
            LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL
Sbjct: 841  LESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSVWLEDAFDRAKQLYMSYYRSIPKSL 900

Query: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960
            ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE
Sbjct: 901  ERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVKDAVMNQFVGNNMEVSLVGDFSEEE 960

Query: 961  IESCILDYLGTVTATTTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020
            IESCILDYLGTVTAT TSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN
Sbjct: 961  IESCILDYLGTVTATKTSEAALASVPIVFRPSASELQFQQVFLKDTDERACAYISGPAPN 1020

Query: 1021 RWGVTFEGLELLESISQISRTGESDESDNDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080
            RWGVTFEGLELLES+SQISRT ESDESD+DIEKGLQRKLRSHPLFFGITMGLLAEIINSR
Sbjct: 1021 RWGVTFEGLELLESVSQISRT-ESDESDSDIEKGLQRKLRSHPLFFGITMGLLAEIINSR 1080

Query: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140
            LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI
Sbjct: 1081 LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKI 1140

Query: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200
            AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD
Sbjct: 1141 AQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDD 1200

Query: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260
            VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT
Sbjct: 1201 VYIAYDQLKVDADSLYTCIGIAGAQAGEESIVSFEEEGSDQDFQGVIPSGRGLSTMTRPT 1260

Query: 1261 T 1262
            T
Sbjct: 1261 T 1260

BLAST of Cucsa.178770.1 vs. NCBI nr
Match: gi|731427631|ref|XP_002277544.3| (PREDICTED: uncharacterized protein LOC100266746 [Vitis vinifera])

HSP 1 Score: 1951.4 bits (5054), Expect = 0.0e+00
Identity = 989/1171 (84.46%), Postives = 1054/1171 (90.01%), Query Frame = 1

Query: 98   CISCFLNQKRRCPSIKRPTSRFILDKSAFQLSKNERDDRVVKHARIVCGTVGPDEPHAAP 157
            CISCFLN  R C SIKR   R   DKS F L K+  D+  VK  R++  TVGPDEPHAA 
Sbjct: 108  CISCFLNHPRSCSSIKRFVPRVFSDKSTFPLLKHTLDNVSVKRVRVLNATVGPDEPHAAS 167

Query: 158  TAWPDGILEKQDLDVSYPEFGRAELEAFLSSELPSHPKLYRGQLKNGLKYLILPNKVPPN 217
            TAWPDGILEKQ LD+  PE GRAELE FL SELPSHPKLYRGQLKNGL+YLILPNKVPPN
Sbjct: 168  TAWPDGILEKQGLDLVDPEIGRAELEGFLCSELPSHPKLYRGQLKNGLRYLILPNKVPPN 227

Query: 218  RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI 277
            RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI
Sbjct: 228  RFEAHMEVHVGSIDEEDDEQGIAHMIEHVAFLGSKKREKLLGTGARSNAYTDFHHTVFHI 287

Query: 278  HSPTSTKDSDGDLLPSVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQL 337
            HSPTSTKDSDGDLLP VLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQL
Sbjct: 288  HSPTSTKDSDGDLLPFVLDALNEIAFHPKFLASRVEKERRAILSELQMMNTIEYRVDCQL 347

Query: 338  LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKAVNQ 397
            LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISK V Q
Sbjct: 348  LQHLHSENKLSKRFPIGLEEQIKKWDADKIRKFHERWYFPANATLYIVGDIDNISKTVYQ 407

Query: 398  IEAVFGESGLENEAVSTPNPSAFGAMASFLVPKISVGLGGSLSNERSN-SVDQSKIVKKE 457
            IEA+FG++G+ENE  + P PSAFGAMASFLVPK+SVGL GSLS++RS   VDQSK  KKE
Sbjct: 408  IEAIFGQTGMENETAAAPTPSAFGAMASFLVPKLSVGLAGSLSHDRSPIPVDQSKFTKKE 467

Query: 458  RHAIRPPVMHNWSLPGSNVHANPPQIFQHELLQNFSINMFCKIPVNKVRTFSDLRNVLMK 517
            RHA+RPPV HNWSLPGSN     PQIFQHELLQNFSINMFCKIPVNKV+T+ DLRNVLMK
Sbjct: 468  RHAVRPPVKHNWSLPGSNEDMKSPQIFQHELLQNFSINMFCKIPVNKVQTYGDLRNVLMK 527

Query: 518  RIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQ 577
            RIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQ
Sbjct: 528  RIFLSALHFRINTRYKSSNPPFTSIELDHSDSGREGCTVTTLTVTAEPKNWQSAIKVAVQ 587

Query: 578  EVRRLKEFGVTKGELTRYMDALLKDSEHLAAMIDNVSSVDNLDFIMESDALGHTVMDQRQ 637
            EVRRLKEFGVTKGEL RY+DALLKDSE LAAMIDNVSSVDNLDFIMESDALGH VMDQRQ
Sbjct: 588  EVRRLKEFGVTKGELARYLDALLKDSEQLAAMIDNVSSVDNLDFIMESDALGHMVMDQRQ 647

Query: 638  GHESLVAVAGTVTLEEVNSIGAEVLEFISDYGKPTAPLPAAIVACVPKKAHIDGLGETEF 697
            GHESLVAVAGTVTLEEVNS GA+VLEFISD+GKPTAPLPAAIVACVP K H++G GE EF
Sbjct: 648  GHESLVAVAGTVTLEEVNSTGAKVLEFISDFGKPTAPLPAAIVACVPTKVHVEGSGEIEF 707

Query: 698  KITASEITTAIEAGLREPIEAEPELEVPKELISSSQIAELRIQHQPSFIRLNPETNVTKF 757
            KI+  EIT AI+AGL EPIEAEPELEVPKELISSSQ+ +LR++  PSFI L+PE NVTK 
Sbjct: 708  KISPIEITDAIKAGLEEPIEAEPELEVPKELISSSQLQKLRVERSPSFIPLSPEVNVTKV 767

Query: 758  HDKETGITQCRLSNGIPVNYKISKSENKAGVMRLIVGGGRAAESPDSQGAVVVGVRTLSE 817
            +D ETGITQ RLSNGIPVNYKIS++E + GVMRLIVGGGRAAES +S+GAVVVGVRTLSE
Sbjct: 768  YDNETGITQLRLSNGIPVNYKISRNEARGGVMRLIVGGGRAAESFESRGAVVVGVRTLSE 827

Query: 818  GGRVGTFSREQVELFCVNHLINCSLESTEEFIAMEFRFTLRDNGMRAAFQLLHMVLEHSV 877
            GGRVG FSREQVELFCVNHLINCSLESTEEFI MEFRFTLRDNGMRAAFQLLHMVLEHSV
Sbjct: 828  GGRVGNFSREQVELFCVNHLINCSLESTEEFICMEFRFTLRDNGMRAAFQLLHMVLEHSV 887

Query: 878  WLEDAFDRAKQLYMSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQTVK 937
            WL+DAFDRA+QLY+SYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQ+VK
Sbjct: 888  WLDDAFDRARQLYLSYYRSIPKSLERSTAHKLMLAMLNGDERFVEPSPKSLQNLTLQSVK 947

Query: 938  DAVMNQFVGNNMEVSLVGDFSEEEIESCILDYLGTVTATTTSEAALASVPIVFRPSASEL 997
            DAVMNQFVG+NMEVS+VGDFSEE+IESCILDY+GTV A+  SE    S  I+FR   S+L
Sbjct: 948  DAVMNQFVGDNMEVSVVGDFSEEDIESCILDYMGTVRASRDSEIEQQSSSIMFRSYPSDL 1007

Query: 998  QFQQVFLKDTDERACAYISGPAPNRWGVTFEGLELLESISQIS------RTGESDESDND 1057
            QFQQVFLKDTDERACAYI+GPAPNRWG T EG +L ESI+ IS         ES     D
Sbjct: 1008 QFQQVFLKDTDERACAYIAGPAPNRWGFTIEGKDLFESINNISVDDDEEPQSESLSEMKD 1067

Query: 1058 IEKGLQRKLRSHPLFFGITMGLLAEIINSRLFTSVRDSLGLTYDVSFELSLFDRLKLGWY 1117
              K LQRKLR+HPLFFGITMGLLAEIINSRLFT+VRDSLGLTYDVSFELSLFDRLKLGWY
Sbjct: 1068 CRKDLQRKLRNHPLFFGITMGLLAEIINSRLFTTVRDSLGLTYDVSFELSLFDRLKLGWY 1127

Query: 1118 VISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLL 1177
            VISVTSTP KVYKAVDACK+VLRGLHS+KIAQRELDRAKRTLLMRHEAE K+NAYWLGLL
Sbjct: 1128 VISVTSTPGKVYKAVDACKNVLRGLHSSKIAQRELDRAKRTLLMRHEAETKANAYWLGLL 1187

Query: 1178 AHLQASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES 1237
            AHLQAS+VPRKD+SCIKDLTSLYEAATI+D+Y+AY+QLKVD +SLY+CIGIAGAQA EE 
Sbjct: 1188 AHLQASTVPRKDISCIKDLTSLYEAATIEDIYLAYEQLKVDENSLYSCIGIAGAQAAEE- 1247

Query: 1238 IVSFEEEGSDQDFQGVIPSGRGLSTMTRPTT 1262
             +S EEE SD+  QGVIP+GRGLSTMTRPTT
Sbjct: 1248 -ISVEEEESDEGLQGVIPAGRGLSTMTRPTT 1276

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SPP_ARATH0.0e+0078.31Stromal processing peptidase, chloroplastic OS=Arabidopsis thaliana GN=SPP PE=2 ... [more]
SPP_PEA0.0e+0077.40Stromal processing peptidase, chloroplastic OS=Pisum sativum GN=SPP PE=1 SV=2[more]
SPP_ORYSJ0.0e+0071.37Stromal processing peptidase, chloroplastic OS=Oryza sativa subsp. japonica GN=S... [more]
SPP_ORYSI0.0e+0071.28Stromal processing peptidase, chloroplastic OS=Oryza sativa subsp. indica GN=OsI... [more]
PQQL_ECOLI2.2e-2532.56Probable zinc protease PqqL OS=Escherichia coli (strain K12) GN=pqqL PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LH02_CUCSA0.0e+0099.92Uncharacterized protein OS=Cucumis sativus GN=Csa_2G072490 PE=4 SV=1[more]
D7UDI7_VITVI0.0e+0084.46Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g00160 PE=4 SV=... [more]
B9IAB8_POPTR0.0e+0078.11Pitrilysin family protein OS=Populus trichocarpa GN=POPTR_0014s14670g PE=4 SV=2[more]
A0A061FMU7_THECC0.0e+0076.71Insulinase (Peptidase family M16) family protein isoform 2 OS=Theobroma cacao GN... [more]
B9RRK9_RICCO0.0e+0077.39Pitrilysin, putative OS=Ricinus communis GN=RCOM_1645120 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G42390.10.0e+0078.31 Insulinase (Peptidase family M16) family protein[more]
AT5G56730.11.7e-2028.16 Insulinase (Peptidase family M16) protein[more]
Match NameE-valueIdentityDescription
gi|449470360|ref|XP_004152885.1|0.0e+0099.92PREDICTED: uncharacterized protein LOC101202810 isoform X1 [Cucumis sativus][more]
gi|778667777|ref|XP_011648983.1|0.0e+0099.84PREDICTED: uncharacterized protein LOC101202810 isoform X2 [Cucumis sativus][more]
gi|659082574|ref|XP_008441914.1|0.0e+0098.41PREDICTED: uncharacterized protein LOC103485911 isoform X1 [Cucumis melo][more]
gi|659082576|ref|XP_008441915.1|0.0e+0098.33PREDICTED: uncharacterized protein LOC103485911 isoform X2 [Cucumis melo][more]
gi|731427631|ref|XP_002277544.3|0.0e+0084.46PREDICTED: uncharacterized protein LOC100266746 [Vitis vinifera][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR007863Peptidase_M16_C
IPR011237Peptidase M16 domain
IPR011249Metalloenz_LuxS/M16
IPR011765Pept_M16_N
Vocabulary: Molecular Function
TermDefinition
GO:0046872metal ion binding
GO:0003824catalytic activity
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0006508 proteolysis
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003824 catalytic activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0008233 peptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cucsa.178770Cucsa.178770gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cucsa.178770.1Cucsa.178770.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.178770.1.five_prime_UTR.1Cucsa.178770.1.five_prime_UTR.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.178770.1.CDS.1Cucsa.178770.1.CDS.1CDS
Cucsa.178770.1.CDS.2Cucsa.178770.1.CDS.2CDS
Cucsa.178770.1.CDS.3Cucsa.178770.1.CDS.3CDS
Cucsa.178770.1.CDS.4Cucsa.178770.1.CDS.4CDS
Cucsa.178770.1.CDS.5Cucsa.178770.1.CDS.5CDS
Cucsa.178770.1.CDS.6Cucsa.178770.1.CDS.6CDS
Cucsa.178770.1.CDS.7Cucsa.178770.1.CDS.7CDS
Cucsa.178770.1.CDS.8Cucsa.178770.1.CDS.8CDS
Cucsa.178770.1.CDS.9Cucsa.178770.1.CDS.9CDS
Cucsa.178770.1.CDS.10Cucsa.178770.1.CDS.10CDS
Cucsa.178770.1.CDS.11Cucsa.178770.1.CDS.11CDS
Cucsa.178770.1.CDS.12Cucsa.178770.1.CDS.12CDS
Cucsa.178770.1.CDS.13Cucsa.178770.1.CDS.13CDS
Cucsa.178770.1.CDS.14Cucsa.178770.1.CDS.14CDS
Cucsa.178770.1.CDS.15Cucsa.178770.1.CDS.15CDS
Cucsa.178770.1.CDS.16Cucsa.178770.1.CDS.16CDS
Cucsa.178770.1.CDS.17Cucsa.178770.1.CDS.17CDS
Cucsa.178770.1.CDS.18Cucsa.178770.1.CDS.18CDS
Cucsa.178770.1.CDS.19Cucsa.178770.1.CDS.19CDS
Cucsa.178770.1.CDS.20Cucsa.178770.1.CDS.20CDS
Cucsa.178770.1.CDS.21Cucsa.178770.1.CDS.21CDS
Cucsa.178770.1.CDS.22Cucsa.178770.1.CDS.22CDS
Cucsa.178770.1.CDS.23Cucsa.178770.1.CDS.23CDS
Cucsa.178770.1.CDS.24Cucsa.178770.1.CDS.24CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.178770.1.three_prime_UTR.1Cucsa.178770.1.three_prime_UTR.1three_prime_UTR
Cucsa.178770.1.three_prime_UTR.2Cucsa.178770.1.three_prime_UTR.2three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007863Peptidase M16, C-terminalPFAMPF05193Peptidase_M16_Ccoord: 361..598
score: 7.1E-41coord: 929..1152
score: 3.4
IPR011237Peptidase M16 domainGENE3DG3DSA:3.30.830.10coord: 184..404
score: 5.4E-55coord: 751..979
score: 5.1E-33coord: 1070..1207
score: 5.3
IPR011249Metalloenzyme, LuxS/M16 peptidase-likeunknownSSF63411LuxS/MPP-like metallohydrolasecoord: 753..973
score: 4.33E-20coord: 193..402
score: 1.34E-38coord: 1071..1211
score: 4.67
IPR011765Peptidase M16, N-terminalPFAMPF00675Peptidase_M16coord: 212..343
score: 1.6
NoneNo IPR availablePANTHERPTHR11851METALLOPROTEASEcoord: 449..489
score: 0.0coord: 515..662
score: 0.0coord: 681..1255
score: 0.0coord: 122..420
score:
NoneNo IPR availablePANTHERPTHR11851:SF152PROTEIN UCR-2.3coord: 681..1255
score: 0.0coord: 515..662
score: 0.0coord: 122..420
score: 0.0coord: 449..489
score: