gCucsa.078830.2 (gene) Cucumber (Gy14) v1

NamegCucsa.078830.2
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionArmadillo/beta-catenin-like repeat protein
Locationscaffold00793 : 1321954 .. 1338130 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAATTTGAAACAGGGCAATACCACCACCCAACTAACGAAGAAGAAGAAGAAACCACACTGCTTCAGCTACAGACAGCATATGGCGACTGAACTCGAAGAATTGATACAGTTTCTTTCCTCTCCTTCCCCTCAAGTAAGTTTCTTCGCTATTCAACCATTCTCCATCTTTTTCGTTGTCGATGAAGGATTTTCGGACAGACATTTTCTATTAAGTCGATTACCTGGTCAGATGCAGCTTATACTGCTTCAATTTTAATCAAAATCCATGATTTTATTGCTTCAATTTCAATTTACATCTTCTTGCTCTGTTTCACTTGGTTAGGATTAGGTGATTCTTTTCATGCTTCAAGTGTTTGTGAAGTTATATTCCACATTTGTGACAGTTTTTCCTTCTTTTTAACGAAAGTTGATTGTGGGAAGTTTCTTTATATGGTGTGGGAATACCGTGTTTGCTTCTGTTGATTGCAAATCTCGTTTCCCTGTGATCACAACTTGGTTTTAGAATTTTACATATTATTGACAGACTTCCATTTTTTGGCTCAATAGTTGAGGAAAGCGGCCATAGACATTGTTCAAGGTCTAACTGGGTCTGAGGATGGGATGCAGTCTCTTGCCAAGTATCCCGACGTCCTGCTGCCATCGTTAGCTCGACTCTTGAAAGAACAAAAGGTGAGCTTTAATCTATCATCTTCTTTATAGACAAAAGGTTATCAACACTGTTTCTATATGTATACTTTGTATTATGTTGGATGAATGATTTTTAATTAACATATCCATTTGTACTAAAGAATGTATTAGCAAACAATTGAGCATCTCGGAGGGTTTTTATTGACCATGCAGTCATAATAAACATCGATTTTATTCTTCCCAACTTCCTGTTAGCATGACACAGTGCTAAGCTACCTGATTATTTGATAGGCAGGCAAAAGTTGTCATAATGTTTTCTGCTAATTATATTTATGGTTTAAATATAACTCATGTTTCACACTAATGTGATTTAGCATTTCAATTTCATTTGCATCATTAAGAATATGGCTTTCTGTTGTAAAATGTATTTGATAATAGATATAATTATAATCAATTATAGGAATTTGGCTATCCATTGGTTCGGTATGGTTCAAAATATCTGGAAAATGTGACTCATAGAACATTTCCAATTCTACAGTGAGATATATTGTCTATTATTTCAGGATGTTGCAGAACGTGCAGCTGAAGCTCTTGTAAATCTTTCACAAAATTCTGATCTGGCTGGAAAGATGATAAACTTGGGACTGATTGGAGAAGCTATGAATCTTTTGTATAATGTGGACTCTAGCATTAGCCAGTTGCTAGTAATGCTTCTGGTTAATCTAACACAGTTGGATGCGGGTATTGCTTCATTACTTCAGGTGTGCCACATTGCTAGAGTGGCCTTTTCTTCTGTTGATGTTCAGATGAGCTATTTTTGTTGAAATTTGTTTGGGAAAGTTTGAATGCTATAGTTTAAGCATTTCCTTTGCTAATAGCTTAGTTAGACTTTTGTGATCTTTATGAAATGAAAAAaTAGTTCTTATGGTTATCAATTCTGCTTCATAGACTGGAGATGACAAGATGCAAGGTCTCTATGTCATGAAGATTGTCAGATCATTTTGTCGTTCAGCAAGTGAATCTAGAGGTATGCcTTGCTTCTTCTCTAATGCAGCTTGCTTTGGACAATATTTATTTTTCAAGAATATTGGGCTCTATATTTATAATTTAAATTTTTAATATCTTTGGTTGCAAGGTTTTTTGGTTATCTTTGCTTCTTAAAGATCTCTATGTGTAGTGGTGTTACCAACTTAATTTACTGTGCTGCATTTCGAAATGAGTATTTAGTCCTTCTAGAAAACAGAAAaaTATAATAATAATAATAATTGCAATGCTTCAAGGCCAAATCCTCTGCAGCCTTAACCTAACATTGGCATCCTGTCATTTACAGTTTATCACATTTCAACATTAATCATTTAAAGTTTTCAGCGTTCATTTGACATTTTAAATTGGTTCTCTACTAGATGATCCCTTTGAGCACGTCGCTTCCATACTGGTAAATATATCAAAGAAGGAAGCTGGAAGGAAGCTTTTGCTTGATCCTAATCGGGGACTTTTGAAGCAGATAATTAGGCAGTATGATTCAAATAGTCAACTGCGGAAGAATGGAGTAAGTCCAATGGATATCTATTTTTAACAAGAAACAATTTTTCATTGCTATCGATTAATGTTCAAGAATACAAGCTTCCAATGACGACTAAAAAGACAACCAAAAaTAAAAAGGCCTAAAACTAGGTTACATATTGTTTGATTTACACGATTGGTACTATATTATTACTTTTACATTCTTTGGTAATTTTGGAAACTAAGCCATGACTGTGATTGGTTAATTACTTATTCCAGCTCAGATGCCAAGTATATATGTATTGTTCAATCAGTTACACTTTCGATATAGTCAAAGTATATCTTCAGCCTATGGGATCACCAGGCTCATGTCTTCATGCAATTTTCTTGTTTAAGGTATTTGGAACTCTCAGAAATTGCTGCTTTGAAGCAGAAGATCAACTACAAAATTTACTCTTGATAGCTGAGTTCCTTTGGCCAGCTTTACTTCTCCCTGTTGCTGGTAACAAGGTAATATTTTTTCGTCTTAGTGACCAAAATGAAACAAGTTGGAACAAATTAAAAAGTTCAAAGACCAAAATTGATATTTTCAAAGTTAGAGACGAAAGAAACAAACTTGAATTTTATTATTATTAGTTTTTTAGTTCAACTCATGGGTTGGGGGATTCAAGTCTCTTTTATTTACTTTTTtCTTTATCCTTGAATTTTAATCGATTTTTCGATTCAACAATAATTGTGTAAGTTGGGATACCATGTTAATTCTAGTTTATTTCATTGTCTTTAGTCTTTTTTGATATGTTCCTATTTTTATTATGAAATCAGGGATCAGGAGACCTTACTCTTGTAAAAAAaTCCTCTGCTCTTACCTATCCATTCTCTATGAAGTTTTCAACTTATGTTTGCAGTTTATTATTTTGTTTAAACGTCTCAAATTGGATTTTTTTtCTTCTTTTAACTGGAGCCTGGTAAATTCTTAGAAGTTTCATAGGTCATTCCAACTTCAATTATTAGGTAGCAGCTGAAGAGATGAACCTTTATTTTATTTTATAGTCTTTAAGAGTTCTCTCAACCAGTAGACTACAGTTCAGAAGAAAAATAACTAGTATTGTTCAATATTCTAGACTTCTAAAAGCTGGGTTCATTTTTTACTCAGCTATTAACTGTTGGAGTTGTATTCTAGAAGGAGAGTGATAGACCACCAATTATTCTACAATATAGTAGGAGGAGAAAGGGAGAAGTTGCACATGCACAAGAGAAGGAAAATGAAGGAGGAAGTGGAAACCACCTTAGTGGGGACCACGGTTAGTTACATGGAAGGGAGTTAAAAGGGAAAGATGGAAGGGAAGATAGGGGTGGTTAATCAGTTTCTGGAGAGAAGAAAAACAGAAGTTCAGTTACTTTAATCTTAGAAAGTTGTTTTCCATTACCGTAATTTACTTTTGATGTTTGGATGTTTTATTTTGAACTTGGTTATTGTCTGTAATAGTTCTTGTTTGTTCTGTGTTTTCAAGACTGGTTTGAGTATTGTATTTTGTTATAGTTTAGTTGCTTAATGAACAAATAGAACACACTACTTGGTGTTCTATCAGAGAGGAATTGGAAGATGCTGTCTTATTGAATATTGGATTATCATTAGCAATATGGGCTCTTTCTAGGCTACCCACCCTTTCTAACCAGTATTTCGATTACAACAGCTCTTTGATTATTCCTAATTGGGGGTAATTATGTGTAAGCTTCCTTGGCTCTAACTATTTTGGTTGGTGTTAGGACTAGGTTCCGTTTTTGGAGTTAGGGTGTTCCATTTCAAGGCAGTTGCAGCTAGGGTTTCTTATACAGTAGAAAGGCCATCAACCTAGAATCAAAGGCAATGTTGTTATCAGTGTAAATAGAACTGCCCCTACTATTTACATAAAATACAGAAAAAGTCCATCTTTTTGGGCTCTAATGAATTGAAGGATCTTCTCATCACGTGTAGCCATAATATGTTAAACGTTTCCTGTTTGGATGTCTGCAACAATAAAAATTCACCATAGAGGCCCACATTGATCTAATAATGAATAAGTAAATAAGTAAATTTCTACTAGTTATGAATGAAACCATTGAAGATTTTGAAAACCACCTGCACATGCTCTTACCCCTTTTTGCCTTCGCGAGGGATCCTTCATATGGAGGATTGTAGTCAGTTTCTAGAAATATTTGCATGTTTTCACAAACTATTTACTCTTTTTGATGGCATGTTTCTATAACTCCCCACATTGAAGTAGAAAATGATTTTATCGACCTCCTTTTCTCATTAACTTTTTACAAAATATATTTGATTACTCACATTTTGAGTAATTTTGTGATGTGCCAAGTTCAAAAGATATCAAATAGTGTTAAGCTTGCTTCCTCACATCTTGAGTAATTTTGTGGGATAACTACTGAAACCTACAACATTGGTTCATTAAGGTGATTTCAGCAAGAAAATATCCAAACAAAAAGACAAGAGAATACCCAGCACCTTATGGAGGGCTGAGAACCTCTCCCCTTAGCTTTAGCAGCTTTTTACACAAAATAAACCCTACCTACAGCAATCCCCATGTCATTCTATTTATCCTCTCTTCTACCCGTGGGCCTTACCTATCAAGATCTTTTTCCCCAATCATTTCCCCAGCATATATATTCTGGTTAATGACCTTTCTGCCCCTCCTTTTACATTTGTGGATGATTGGCGGCCTAACAGTACCCCTGGGTTCCAAATTCACCTTGTCCTCAAGGTGAAACGTCGGAAATTGTTGGTTCATTTGGTATCCCGACTCCCAAGTGGCTTCACTCGCTGGCATGCCCTTCCATTTGTCTAGCCATTCATTAGCCCCCCAACTCTCTGCTCCAGCGTATCCCCATAACTGTCTTGGCCAAAACTCAGAACTGCTACCTTAAACTTATCGGATTCTGGTAAGTCATTAATCTAAAAAAAAaTGTTACAGTCTTGTACACCCAAGACTTTGGGTTTTCTCCCACGAACATAGGCATCTCAAGCTTCTTATACTTGCTTTGGTCCACTGTTACTGTTCCATTGTCCATCACGGTCTCCAGACCACCCATCTTTCTATTCAGACTTCAGTTTCAATACGGAACCATCAGATGTGCACGAATCGTCTTTGCGTCTATGGCTATTGCTTTCTCTCATGTCTTCTGCCATTCTGTCGACCACCATCTTAAGTTCAAGCATCATCTCTTTCAAGCCCAAAACTTCCTTTtCCGTCCCCTCTAATCTCACATCCATCTGTTTGTGAGCCATCAAGCGCATCACCACCCCCAAATTTCCGGGCTCTGATACCAATTTGTTAGGATTCTACCCTTAAATAACTCAATAATCCTAACAAAAGGCTAAAACCAGAGGTGGCACTCTATTAATTATATTACTTAACCAAAATCAGATGTAACAAAATGATTTCAGCAAGAAAATATCCAAACAAAAAGACAAAAACATACCTAGCACCTTAAGGGTAGAATCCTAACAAGAGGGCTGGAAACCTCTCCCCTAAGCTTTAGCAGCTTTTTACACAAAATAAACCCTATCTATTGTAATCCCCATATCATTCTATTTATATCTCTCCTCTACCGGTGGGGCCTCCTCTACCTGTGGGCCCCACCTGTCAAGATCTTTtCCCCAATCATTTTCCCCATCATTTATATGCTGGTTAATGACCTTTTTGCTCCTCCTTTTATATTTCTGAATGATTGGCGGCCTAATAATATCCTAGGTAGGTGGTTTTACAATGAGATTTGAACCCATACTCTCAAGTCCCTTTTAAGGCAATGTGGTTAATTCTTTTATGTTTTTACCATTGTAATATTTTCTTTCATCATATTCTTGGAAGTTGGAATAATACAGCCTTAGATTTAAAAGAGAAAAGATTTGTACGAGGAAGGAAATAGAGGCAATAGTAAAAGGACAATACAAGTCAGCTTTGAAGATAGGTATAGAAGGAAAGAGGAACAAAGAGTATTTAATGAATATTTTAAGATATTCTTAAGGGTTAGCAACCTTGATATTCCCCATCAACAATTCTCATGGTTATTGCATTTATATCATAGCATTTCTGTGAGCAGGTTTATAAGGAGGAGGATACATCAAAAATGCCACTTGAGCTTGGAACTGCACTTTCAATTGAGCGTGAGCAAATTGATGATCCTGAAATTCGAGTTCATGCATTGGAAGCCATATACATGATCATATTACAGGTTTCAGAGAATTCCATTTCTTTTGAATTTAATGAAATGCCTTTCCTCTTGAAACTTAAGACTTTTTACCTCTTTAATTATTTAAACAAACAAAAACTGAATCCAAATGAGATAATTTATCTCCTGATATACACTCTATTGGGCTACTATTTTGCTGATGAAAAAGGATTATGTGGATGTATCTTATTTGCTTTCTAATGGATTACTTGTAAGGAAAGAAATAACTGCCTCTTCAATAAAAAAAaCACTTTTGATACTGATTTTTAGTATATATATTATCTATCTTACTATGCATTGTCAAATGTACATATTTGTATAGCTTTTATAATCTCTCGTATCTATTTTGAACTATGGTAGATATCTTTCGTAACCCCTTTGGTACAGTCCATTGCCCcTTTTTTTtGTATATTGTTCTTAGTGAAATTTGTTTCTTATATTAAAAAGAAAATAAATAATTACAAATATTAGCAAAATCAAGACAAAAATTACAGCATAATAGCCTTTTGGGAGGGATATCTTTCTCCGAAGTCACACACAATGGAAACTCACCCCAAGATGATCCACCCCTTCTAGGTCCCTTACACTCAGCCTTATAACCAAGACACCCATATGTAATTACCATGATGTCCTTAGCTTGCTAATAATGTTCTTCTAATCTTGCCCATATACGTCTAACTAGTACTCTCACACATGACATTTTCATGTACTAAGTAAGATGTAACCCAACTTGAGGTTAATTTTGGAGTAGATATGCGATCCTTGTGGCTTAGCTAACAATTCCTCCACAATGAGGATGAAAAATTTGTTGGGAATCGTTGCATTTTTTATTGTTCAAAGCACGAAAATCAACACAAAATCTTCATGCCCCATCTTCCTTACCCAGCACAATAGGTTTTGGAGTAGCACATTGGACTATATGATGTTTGAAGTCAACATTTCATTTACAAGTCATTCAATTTCATGTTTTTTTtGGGTGTATGAGTATTGATAAGGTTGTACATTAGAAGTGTTAGAGTAATGGGCTTGGGCCACAGGAAGGTCTGCCCTAAATTTATGTCCTTTTTTAACTCATTGTATTATTCTATTTATTCCCCCCTCTTGTACCTTTTGTAATTATCAAAAAATAATAAGAGAAGTAAAATCTTGGTTTTTCTCCCAGTACTTGGGTTTCCACATAATTCGGTGTTTTTTTGTTGGTTTATCACTTTCAATATGGTATCAAAGTTAGGTAACAACAAAACCCTAGACATCTATATAGGTGAAGCCCAAATAGAGACGGAAGATGCCGTCGCTGCCATGGAAAAACTCCTCCATTAGATTCAGAGGATGCCGGCAGTCACTTGGGGGCCACCCTCAGAGCAGTCCGCACAACCCAGTTTCGGGAATGAACCTCACGCGCCGCAACCTGTCGCCCACGCATCACAACTACCAGGTGCGTTTGCCCACACGCCGCCATCGAATACCTCCATGAACCAGCCTGTCTTCTCTCCTTTGTCATCCCCAAGTCCAGTCGCCATATGCTTCCGACAATCCACACCCCCATGCACTGCTGCTACCCTTGGTTTATGGACTGCCCGGTTCACCCACCGTTGCCTTTTCAGACTTTTCAGCAGCCACATATTCATGGTTCCAGGATTGATTGAGGGTATACTCAGTCTGGTTTTTAAGTTGGTGGATCCTCGGTGCAGTCTAAATCAATCTATCTACCGATGTATTTGAAGGACCTGGTAACTTCATTCTCTAATTTGTCTTCGAATTACATATCTGGTTCTATAGCACCCTTTATAGGGAATTTTCCTGAGGAGAAATTAAATGGTCAGAATTATTTTTCTTGGTCCCAATCGATCAAGATGTTCCTTGAGGATCACCACCAATTTGGCTTTTTGAGAAGGGAGACTATTAGTCCTCCACCAGATGATGCCTTGTAACGCTTCTGAAAAGGGGAGGACTCCTTCATTCGGTTTATGCTGATTAATAGCATGGAACCACAAATCGGCAAGCCTTTACTTTACGCAGTCACTTGTAAAGATTTATGGCATACAACTCAAAAACTTTACTTGAAGCATCAAAATGCCTCTCGATTGTATACACTGAAGAAACAGGTCCACCACGATTGCAAGCAAGGAACCCCGGACATGACCTCCTATTTCAATAAGCTCTCCCTTCTCTGGCAAGAGATAGATTTGTGTAGGGAGACAGTTTGGAACACGCCAAATGATGGTATACAATATGCTAAACTTGAGGAAGTTGACCATATTTATGATTTCTTTGCAGGACTTAATCTCAAATTGATATTGTATGTGGTTGTATGCTTAGACAGAGACCACTTCCCTCCTTAATGGAAGTGTGCTATGAAGTCTGTCTTGAAGAAGACCGTACGAATGCTATGAGTATACAGACGGCCTTCAGTGCTCGATCCTTTATCCATGACAGTGAGAAGAATAGCGGGAAACCAATCCCTGTATGTGAGCATTGCAAGAAACAGTGGCACACCAAGGATTAGTGTTGGAAACTCCACGGTCGTCCCCCAGAAGGTAAGAAACGTTCCTCCAACGAGAAACATAACTCAAGGCATGCCTATATAAATGAGACTGCCAACATCTCTCAGCCTACTAGCCCCACTGCAAGCCAAAACAACCCTCCTACTCTGTGAGCCATTTCTCAATCAGCTATGCCTCAGTCCCTTTGCCTTATTAGTGTTGATGGGAAGAATCCTTAGATTTTAGACTCGGGGGCTACAAATCACTTGACAAGTTCTTCAGAGAATTTTGTCTCTTATACCCCTTGTGTCGGTAATGAGAAAATTCAGATAGTTAGTGATTCTTTAGCCATGATTGCTGGCAAAGGACAAATAGTTCTTTTTGAAGGACTCTCTCCAGAATGTTTTGCATGTGCCTAAGCTTTCTTATAATTTGTTATCTATCAGCAAGATCACGTGAGCTGGCATTGCTCGACATAGTAGGGGACTTTACATCTCCAATAGTAGTATCTCTAGAACTAATTTACTGTCTTCCTATTTTAGTATTTCTAAACATGATGCTATGTTGTGGCATTTTCGGTTGCGCCACCCGAACTTTAATTATATGAAATATTTATTTCCCCACCTGTTTTCTAAAATTTATGTCTCCTCTCTATCTTGTGATGAGTGTATTTGAACAAAACGTCGGGTTTCATTTCCCTCACAACCATATAAACCCACACAACCATTTACCTTTATTCATAGTGACATTTGGGGCCCATCTAAGGTCACTACCTCCTATGGAAAACGGTGGTTTGCTACTTTCATTGATGATCATACCCGCCTTACCTTGGTCTTCCTTATCATAGATAAATCTGAAGCTTCCTCTGTTTTCCAAAACTTCTATCACACCATTGAAACACAATTCAATACAAAAATTGCAATTATTCGGAGTGATAATGGTTGGGAATTCCAAAACCATAATTTAGTGAGTTCCTAGCCTCCAAGGAAATTGTTCACTAAAGCTCGTTCACCCACACCCCTTAACAAAATGGGGTGCCTGAGCGAAAAAACTGTCATCTTTTGGAAGTAGCTCATTCTCTTATGCTATCTACTTCCCTTCCTTCATATCTGTGGGGAGATGCTATTCTTGCAGCAACTTATCGAATCAACAGAATGCCTTCTCGTGTTCTCCACCTTCAGACTCCCTTAGATTGTTTCAAGGAGGCATACCCTTCTACTCGTTTAATTTCTGAGATTCCCCTTCGTGTGTTTGGGTGTACTGCATATGTCCATAATTTCGGCTCTAAACAGACCAAATTTACCTCTTGGGCTTAAGCACGTGTGTTTGTTGGGTATCCCCTTCATCAATGCTATTATAAATGCTTTCATTCTCCGTCCAGGAAATACTTTATCACTATGGATGTTACTTTCTGTGAGGACCGAACCTTCTTTCCCATTAGCCATCTTTAGGGGGAGAGTGTGAGTGAAGAGTTTAACTGTACCTTAAGGTTTATTGAACCTACTCCTAGTACCGTGTCTTACTCTGATCCTCATTCCATTGTCCTACTCACAAACCAAGTTCTCTAGAAAACGTACGAGAGGAGGAATCATACAAAGGAAATTGGGTCCACTACTAGTCTGCCGGCTCCCTGGCCCGAAACTCTGAACCTTCTCAATATCAAGGTCTGGAAAATCCTACCAAACCTTGTCTTGATAATAAGATGAGTGAGAATGACAAGTCCGATGTTGTTGTTCCTGAAAATGTGGAAGAAAAAGACACTAGTGATAAAATTGAGGTCAGAGCATAAACTAGTAGTAATGAAGAGGGACATTCGGGGAAACTTGATGTGTATGATCCTTCCCTAGACATTCCCATTGCACTGAGAAAAGGTATCAAGTCTTGTACAAAATATCTCATTTGTAACTATGTCTCCTATGATAATCTCTCTCTACAGTTTAGAGCTTTCACAGCGATCCTTGATTCTACCATAATATCGAAAAATATCCACACTGCTTTAAAGTGTCCTTAATGGAAGAATGTTGTCAAAGAAGAGATGAAGGTTCTTGAAAAGAATAAAACAGTGGGATGCAAACGAGTGTTCACTCTCAAATACAAAGCAGATGGAACACCTAACAGACACAAGGCAAGGTTAGTTGCAAGTGGTTTACTCAAACCTATGGTGTTGACTAGCAAGGTTAGTTGCAAAGAGGTTTACTCAAAACTATGGTGTTGACTACTAAGAAACTTTTCCCCCAATTTCTAAATTGAATATTGTTAGAGTCATGCTATCTGTTGCTCTGGAAAAAGATTGACCTCTATACCAGCTGGATGTTAAGAATGATTTTCTGAATGGAGATCTACTGGAGGAAGCCTACATGAGCTCACCTGGATTTGAAGCCTAGTTTGGTCATCAAGTGTGTAAACTTCAGAAATCCGTATATGGTATAAAAAAAGTCGCCCAGAGCATGGTTTGACAAGTTCACTACCTTTGTTAAGTCCCGAGGGTACAATCAAGAGCATTCAGATTATTCAGATCATACTTTGTTTACAAAGATCCCCAAGACTGGGAAGATCGCAGTTCTGATTGTGTATGTTGATGACATCGTTTTATCTGGAGATGATCAGGCAGAAGTTAATCAATTGAAACAGAAAATGTACGATGAATTTGAAATCAAGAACTTAGGAAATCAAAATTTCCTTGGAATGGAGGTGGCCAAATCTAAAGAAGGCATCTCTGTATCTCAGGGAAAATACACCCTTGATTTGCTGACCGAGACAGGTATGCTGGGATGTCGGCCTCTGACGCCTCTATTGAATTAAATTGTAAACTAGGAAATTCAAGTGATCATGTTCTAGTTGATAAAAGAACAATATCAGCGCTTTATGGGTAAATTGATTTACTTATCCCATACTCGTCCAAATATTTCATTTGTTGTGAATGTTGTTAGCCAGTTTAAATGGGATATTTCCTTTGATGTTAAAAAAGATAGACAGAAAAATGATTGAGACCTATACTGATTCAAACTGGGCAGAATCTGTTGTTGACAGAAAATCTACCTCTAGTTACTGTACCTTTGTATGGGGCAATCTTGTAACTTGGAGAATTAAGAAGGAAAGTGTTGTGGCTAGGAGCAATGGGGAGACTAAATACTGGCTATGAGTTTGGGGATATGTGAGGAAATTTGGCTCCATAAAGTCTTGTCTAATCTTCATCAGGAGTGTGAGACACCAATGAAGCTATTTTGTGATAACAAAGCAACTATTAGTATTGCTATCAGTCTAATTCAACATGATAGAAACAAACATATTGAGATTGATCGGCACTTCATCAAAGAAAGATATGACAGTGGGAGCATATCCATTTTGTATGTTCCTTCGAGCCAACAGGTTGTTTATGTTCTCACCAAAGGGTTTCTCAAACTAAACTTCGACTTTTGTGTTAGCAAGTTGGGCCTCGTTGATGTCTATGTCCCAACTTGAGGGAGAGTATTAGAGTTAGTGGGCTTGGCCCATAGGAAGATTTGTCCCAAATTTCTTTTCTTTTTTAACTCATTATATTATTCTATTTATTCCTCTCTTGTACCTTTTGTAATTATTAGAAAATAATTTGAAAAGTAAACCTGTGGTTTTTCTTTCAGTACTCAAGTTTTCACGTAATTCAGTGTTTTCGTTGGTTTATTGCTTCCAATACGAAGGAAGCCCTCTACTAAATGGATATTGTGGTCAATATCCCAATGTGGAGATAGGGCTTCCGAAAAATCTCAAATGCTTTCATATTTATTCAATCAGTAAGCAATTATTATGCTGTTTATAGAACTAGTGTTTGATTCTCTTAGTGAGTGGCCCTTCCAAAGCTCTGAACTCCACAAGGTACTCTTGTCACTGTCTTCCCACGACTTCGTCATTTTCTTTAATGTGACATTGGCTTTCAGCCTTTCATTAAAGCAGAATCACCCTTCAAAGTCATTTGGCGTGACCCTTGCCAACCTTCATTGTCAATAATTTTCAATCAACTTCCGTCTTGCCCAAAGTATGGAGCTCACCAACTCCTAATTCAAATTTCCTGCTACTGTTAGGTCCTTATTGTTGAGGACTGTGATCTTGCAAACTCCCTCCTTTTCCACGACTAACCGTGCTCCCATGATCACTTCATAATCTGCCATAGTTGTTAGGAGTGCATTCAGCTCTCGGGTGACTTTCTAAGGAATGAAGTTTTATTCCCTCCACACTTGATCAGTTTTATCACCTCATTGTTTGGGCTGGGACATTTGTTCACATAATTGTGCACATGGGGTCATTTCCAGCAGCATCTTACTATTATTTCTCACTGTTGAGTGCGCTTTGGGTTTATCACCTGGCCTTACTATTTGTATTTGGGATATCTGCTTTCTTTAACACAACCCAAATGTGTTCTGTCCTCTACATGTTGGGCTGACTTCATCATGTCTTCTAATCAATGACCTCCAACACTGCCCATTTAAGTGAGAACAACGGCCACCGAGGTTTTAAACATATTCTGGTACTTTTCTAGAGCTTAATTGCTAAGAATCTTCCACACACACTCCCCTCCTTGGATGTGCGAAATCGAGTAACATGATTTTCAAGTTGTTTCGGTCTTTGAAGGATTTTTCTTCCTTTGTACCAATTTAACACCATAGCTTTGAAAGTTACTGCAACCACAATTAACTTTTAAGGATCTGTCAAGTTGTCGATTTGTGAAATTGGATGTACTTGTTGCCGGCTCAAAAGAGTTGTTGCCTCCAAACACTGACATCTCTACCTTAATGGGAGGACAACTTTCTATTGATCCTATAAGAAGTTAACATAAACTTTTGATTGGTTTGGAAAGAAATGAAAAACTGGTAAAAAAAAAaCCAAAAaGAAAAGATTCTGATAAACACCACCAGTCCAATTCAAATATACTGTTGAACGGTAGAAAAGAGAGCCCAAGGTTTACGTAGAAAAGCCTACAACCGGGAGAAAAACCACGATAGAGAGTCTTTTTATTATTTTCACTCACAATGAAAGTTACAAGAGAGGACAACTTTACATGCTGTAGCAAGCCCTAATACAAAAAAAaGAAaTAAAGTTAGGATAAAGTAAAATACTAAACTTCCCTCGAGGCTATAAAGTATCAACAAAGCCCCTTATTTCCAACGGATACAAAAAaTGTAAAACAAAGCTTTCTCTAAGTGATTGGAAAGAAGCCATCTTAAAATGCAAGGAGATCCGACAACAAAGGAATGGATGACCTGTCAAACTAAGAATCATTGGGTTGAATCAGATCCTATTAAAAGGGTAAGTATGCTCCCTCCTCTCTCTCCTCTTGCCTCTCTCCCCTTTCCTCCTTCATCCACTAGCTCCAGCCCGTTTCCTCATCCTTTCTCTACTCATCCGGCCACCATTTGCTTCAGAAGCTCCAGTTCCTTAAGTCAATGGAAGCAAAAAACTGCAAAATCAGATCCTATTCAGAAGCTTAATACAAACTTGCCCACAAAATATACTTTTGTAGAACAACAAAGAAGACTTTGTTTCCCTGTATATTCTCTTTCACTTTTTCTTAATGAAAGTTGTCATTTTCATCGAAAAGAAAaCCAAGAAGACTATTTTGAAATTTTTAAAAACTAATAATTTGATCCCTGAAGTTTAGAAAATAGTTTTAAATCGTTTTGAGGTTAGAACATAACGATATCAGTTAGTAATCTATTAATGAATTAGAAGGATGAGCAAATTCTCATATTTTGACCCTTTTCATTTTGTTTGTCATCTTACTGTCAAAAGTGTACGTTTTTGGTTCAAATGTATCATAATTTTTTTTTtGCTCTTACCAGGATGCTGGGCGGAGAGCATTCTGGTCTATCAATGGACCAAGAATACTGCAAGTGGGGTATGAAGACGAGGAAAACCCAAAAGTAATGGAAGCATATGAGAGAGTTGGCTCACTAGTAAGTGACATTCGATCTTCACTAAACTATATCTCTAAATCTAGCCAACATTAGCTTAACTCAACTGACACATGTTTGTACTCGAGGACGAGAGTTCTCAGGTTCAAATCCCCCAACCCCAAGTTGAATTACAATACCTTTTCAAAATATATATGTATATATCTAAATCTGGTGTTATTTGTCTGTCTCCTGTAAAAAATCCATACCAGCTATTTTAATATTTATATTAAACTTTTCTTGATTAATCATTCAGCTGGTTAATAGCGGTGGCGACGAGGAACCACACGATTGAGGATGAGATGTTCAGGTATGTGTTAGTTAATCTTTACTTTTCTGGAAAGGGGTATTCTTATGAATCATGTGCTAATTTAATTCATTTTATCAATCTCTCTTGTACAAAGGAGAAAGGAGGAAGTGCTTTTAGTTTTTAAGATTAATTTTCAGTCAATGTTAGAGTTTATCATATAGATTTGTTTTCTGAGTTGGTTATACTATTTATTTAGGGGGTGGTTGGTGAGTCGTAATGGGACATGTTTGTATTGTAGTGTTTCAATTGAGCACTTTGTGTCCAATTTGTAATATAAAACTCATTTCGTTTGGAGAGTCTTCAAAATAATCCTATTTCTCACTTTTTTCACATCTTACAATCTTATTTTCGTTTCATCTTCAATTACTCATGCTTTCCAACACTCCTTTAATTTCTAGTAATCCTGATTGCATTCCAACTCTCCAAACACTCCCTTATTCATAATAAATGACTAAATTTACTTGACTTGTCATTCTTTTCTCATATTTCTTTTCTTACTAGGTAATAGAGAAACAGATTTAATCAGATTTTTTAATGGTTTAAATTAAATTTACATAGAAAATAAACGAATTGATGCTAAAATTGCATATAAGAGCAATGTTTGTGGTATCATAATGTTGAGTTTGAAAGATTGCATCTGGTGAAACTTATGCACTTTGACAGGCAACTTTGTTGGCGATTCGTATCCTGGTTATCAAGTGCTACTTGCGAGAGTCAGAATTGCTATAGATTAACATCTCAAGTGCTTAAAAACAAATATCTTTTTACTTTAAAATATGGGCAGCCAGAGGAGGAATAGGAGAAACAAGATTACGTGCGTGGTACCATAGTACATCTGTTTCGGGTATAATATTTCTTTTAAACACTTAGAAAGTGAGATACTAATCCGTGAGATGATCATGAATGAATAATGTAATTTATTTATAGCCTAAGGCTTTCAATCATGTTTTTGAGTGTTAGCGTATTGATTTGAGAGAACTATTTTCTTCGAATGTTAATTTTGTTTTTtGGTTTTTtCTTTTTCAGAAGGAAAACATGAGTAGTTTAATATAACTTTTTGGGGGTGAAAGGCAATCCTAGGAGAATTTTAACCATTTGAGAATCAC

mRNA sequence

CAATTTGAAACAGGGCAATACCACCACCCAACTAACGAAGAAGAAGAAGAAACCACACTGCTTCAGCTACAGACAGCATATGGCGACTGAACTCGAAGAATTGATACAGTTTCTTTCCTCTCCTTCCCCTCAATTGAGGAAAGCGGCCATAGACATTGTTCAAGGTCTAACTGGGTCTGAGGATGGGATGCAGTCTCTTGCCAAGTATCCCGACGTCCTGCTGCCATCGTTAGCTCGACTCTTGAAAGAACAAAAGGATGTTGCAGAACGTGCAGCTGAAGCTCTTGTAAATCTTTCACAAAATTCTGATCTGGCTGGAAAGATGATAAACTTGGGACTGATTGGAGAAGCTATGAATCTTTTGTATAATGTGGACTCTAGCATTAGCCAGTTGCTAGTAATGCTTCTGGTTAATCTAACACAGTTGGATGCGGGTATTGCTTCATTACTTCAGACTGGAGATGACAAGATGCAAGGTCTCTATGTCATGAAGATTGTCAGATCATTTTGTCGTTCAGCAAGTGAATCTAGAGATGATCCCTTTGAGCACGTCGCTTCCATACTGGTAAATATATCAAAGAAGGAAGCTGGAAGGAAGCTTTTGCTTGATCCTAATCGGGGACTTTTGAAGCAGATAATTAGGCAGTATGATTCAAATAGTCAACTGCGGAAGAATGGAGTATTTGGAACTCTCAGAAATTGCTGCTTTGAAGCAGAAGATCAACTACAAAATTTACTCTTGATAGCTGAGTTCCTTTGGCCAGCTTTACTTCTCCCTGTTGCTGGTAACAAGGTTTATAAGGAGGAGGATACATCAAAAATGCCACTTGAGCTTGGAACTGCACTTTCAATTGAGCGTGAGCAAATTGATGATCCTGAAATTCGAGTTCATGCATTGGAAGCCATATACATGATCATATTACAGGATGCTGGGCGGAGAGCATTCTGGTCTATCAATGGACCAAGAATACTGCAAGTGGGGTATGAAGACGAGGAAAACCCAAAAGTAATGGAAGCATATGAGAGAGTTGGCTCACTACTGGTTAATAGCGGTGGCGACGAGGAACCACACGATTGAGGATGAGATGTTCAGGCAACTTTGTTGGCGATTCGTATCCTGGTTATCAAGTGCTACTTGCGAGAGTCAGAATTGCTATAGATTAACATCTCAAGTGCTTAAAAACAAATATCTTTTTACTTTAAAATATGGGCAGCCAGAGGAGGAATAGGAGAAACAAGATTACGTGCGTGGTACCATAGTACATCTGTTTCGGGTATAATATTTCTTTTAAACACTTAGAAAGTGAGATACTAATCCGTGAGATGATCATGAATGAATAATGTAATTTATTTATAGCCTAAGGCTTTCAATCATGTTTTTGAGTGTTAGCGTATTGATTTGAGAGAACTATTTTCTTCGAATGTTAATTTTGTTTTTTGGTTTTTTCTTTTTCAGAAGGAAAACATGAGTAGTTTAATATAACTTTTTGGGGGTGAAAGGCAATCCTAGGAGAATTTTAACCATTTGAGAATCAC

Coding sequence (CDS)

ATGGCGACTGAACTCGAAGAATTGATACAGTTTCTTTCCTCTCCTTCCCCTCAATTGAGGAAAGCGGCCATAGACATTGTTCAAGGTCTAACTGGGTCTGAGGATGGGATGCAGTCTCTTGCCAAGTATCCCGACGTCCTGCTGCCATCGTTAGCTCGACTCTTGAAAGAACAAAAGGATGTTGCAGAACGTGCAGCTGAAGCTCTTGTAAATCTTTCACAAAATTCTGATCTGGCTGGAAAGATGATAAACTTGGGACTGATTGGAGAAGCTATGAATCTTTTGTATAATGTGGACTCTAGCATTAGCCAGTTGCTAGTAATGCTTCTGGTTAATCTAACACAGTTGGATGCGGGTATTGCTTCATTACTTCAGACTGGAGATGACAAGATGCAAGGTCTCTATGTCATGAAGATTGTCAGATCATTTTGTCGTTCAGCAAGTGAATCTAGAGATGATCCCTTTGAGCACGTCGCTTCCATACTGGTAAATATATCAAAGAAGGAAGCTGGAAGGAAGCTTTTGCTTGATCCTAATCGGGGACTTTTGAAGCAGATAATTAGGCAGTATGATTCAAATAGTCAACTGCGGAAGAATGGAGTATTTGGAACTCTCAGAAATTGCTGCTTTGAAGCAGAAGATCAACTACAAAATTTACTCTTGATAGCTGAGTTCCTTTGGCCAGCTTTACTTCTCCCTGTTGCTGGTAACAAGGTTTATAAGGAGGAGGATACATCAAAAATGCCACTTGAGCTTGGAACTGCACTTTCAATTGAGCGTGAGCAAATTGATGATCCTGAAATTCGAGTTCATGCATTGGAAGCCATATACATGATCATATTACAGGATGCTGGGCGGAGAGCATTCTGGTCTATCAATGGACCAAGAATACTGCAAGTGGGGTATGAAGACGAGGAAAACCCAAAAGTAATGGAAGCATATGAGAGAGTTGGCTCACTACTGGTTAATAGCGGTGGCGACGAGGAACCACACGATTGA

Protein sequence

MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKDVAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGIASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNRGLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVYKEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQVGYEDEENPKVMEAYERVGSLLVNSGGDEEPHD*
BLAST of gCucsa.078830.2 vs. Swiss-Prot
Match: HGH1_XENLA (Protein HGH1 homolog OS=Xenopus laevis GN=hgh1 PE=2 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 2.0e-23
Identity = 88/325 (27.08%), Postives = 158/325 (48.62%), Query Frame = 1

Query: 7   ELIQFLSSPS-PQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKDVAERA 66
           EL+ FL   +   +R  A++ + G++G+ +G QSL   P +L   L    ++   +A+ A
Sbjct: 8   ELLSFLKPETRADVRAQALEYILGVSGTPEGRQSLCAEPRLLQVVLDLTTEQSAHIAQDA 67

Query: 67  AEALVNLSQNSDLAGKMINL--GLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGIASL 126
              LVNL+ +      ++     L+   + LL +     S      L NL++ +    S 
Sbjct: 68  HHVLVNLTSDPTTHKSLLGHVPTLLPSLLTLLQDPTCPFSDSTCTALCNLSREEESCQSF 127

Query: 127 LQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNRGLL 186
           LQT   K +GL   +++   C           +++  ++ N+++   GR  +LD +R ++
Sbjct: 128 LQTL--KQEGL--CQLLHMLCTPKYNGHAS-LDYLGPLVCNLTQLPEGRDFILDRDRCVI 187

Query: 187 KQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVYKEE 246
           ++++    + S +RK G+ GTLRNCCF   D  + LL     L P LLLP+AG + Y +E
Sbjct: 188 QRLLPYVTAGSTVRKGGIVGTLRNCCFNHRDH-EWLLSDQVDLLPFLLLPLAGGEEYTDE 247

Query: 247 DTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQVGYE 306
           +   +P +L   L  ++E+  DP+IR   +E + ++     GRR         I++  + 
Sbjct: 248 EMESLPPDL-QYLPEDKERESDPDIRKMLIETVQLLCATAGGRRIVRQKGTYLIMRELHS 307

Query: 307 DEENPKVMEAYERVGSLLVNSGGDE 329
            E    V  A E++  +L+   GDE
Sbjct: 308 WERESYVSRACEKLIQVLI---GDE 322

BLAST of gCucsa.078830.2 vs. Swiss-Prot
Match: HGH1_XENTR (Protein HGH1 homolog OS=Xenopus tropicalis GN=hgh1 PE=2 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 2.9e-22
Identity = 92/330 (27.88%), Postives = 160/330 (48.48%), Query Frame = 1

Query: 7   ELIQFLSSPS-PQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKDVAERA 66
           EL+ FL   +   +R  A++ + G++GS +G QSL   P +L   L    ++   VA+ A
Sbjct: 8   ELLSFLKPETRADVRAQALEYILGVSGSPEGRQSLCAEPRLLCALLDLSTEQSPHVAQDA 67

Query: 67  AEALVNLSQNSDLAGKMINLG----LIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGIA 126
              LVNL+  SD A     L     L+   ++ L +     +  +   L NL++ +    
Sbjct: 68  HHVLVNLT--SDCAAHRALLAHVPTLLPSMLSRLRDPGCPFADSICTALCNLSREEETCQ 127

Query: 127 SLLQT-GDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 186
           S L++   + M  L  M     +   AS       +++  +L N+++   GR  +LD NR
Sbjct: 128 SFLRSLTQEGMCQLLDMLCAPKYNPRAS------LDYLGPLLCNLTQLPEGRHFILDRNR 187

Query: 187 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 246
            ++++++    S S +R+ G+ GTLRNCCF   D    LL     L P LLLP+AG + +
Sbjct: 188 CVVQRLLPYLQSGSTVRRGGIVGTLRNCCFSHRDHAW-LLGDDVDLLPFLLLPLAGGEEF 247

Query: 247 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 306
            EE+   +P +L   L+ ++++  DP+IR   +E + ++     GRR         +++ 
Sbjct: 248 TEEEMETLPPDL-QYLAEDKQREADPDIRKMLIETVLLLCATADGRRLVKQRGTYLVMRE 307

Query: 307 GYEDEENPKVMEAYERVGSLLVNSGGDEEP 331
            +  E  P V  A E++  +L+     EEP
Sbjct: 308 LHSWEREPCVKRACEKLIQMLIG----EEP 323

BLAST of gCucsa.078830.2 vs. Swiss-Prot
Match: HGH1_HUMAN (Protein HGH1 homolog OS=Homo sapiens GN=HGH1 PE=1 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 8.6e-22
Identity = 103/333 (30.93%), Postives = 168/333 (50.45%), Query Frame = 1

Query: 4   ELEELIQFLSSPS-PQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKDVA 63
           E+ +L+ FL+  +   L+ AA+  V  LTG   G   LA     LL +L  L       A
Sbjct: 21  EVVKLLPFLAPGARADLQAAAVRHVLALTGCGPGRALLAGQA-ALLQALMELAPASAP-A 80

Query: 64  ERAAEALVNLSQNSDLAGKMI--NLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 123
             AA ALVNL+ +  L   ++  + GL    M    +     ++     L NL++  A  
Sbjct: 81  RDAARALVNLAADPGLHETLLAADPGLPARLMGRALDPQWPWAEEAAAALANLSREPAPC 140

Query: 124 ASL---LQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLD 183
           A+L   L   +    GL   ++VR+ C     +R  P  ++A +L N+S++ A R  LLD
Sbjct: 141 AALMAALAAAEPADSGLE--RLVRALCTPGYNARA-PLHYLAPLLSNLSQRPAARAFLLD 200

Query: 184 PNRGLLKQII--RQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVA 243
           P+R ++++++   QY  +S +R+ GV GTLRNCCFE     + LL     + P LLLP+A
Sbjct: 201 PDRCVVQRLLPLTQYPDSS-VRRGGVVGTLRNCCFEHRHH-EWLLGPEVDILPFLLLPLA 260

Query: 244 GNKVYKEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGP 303
           G + + EE+  ++P++L   L  ++++  D +IR   +EAI ++     GR+        
Sbjct: 261 GPEDFSEEEMERLPVDL-QYLPPDKQREPDADIRKMLVEAIMLLTATAPGRQQVRDQGAY 320

Query: 304 RILQVGYEDEENPKVMEAYERVGSLLVNSGGDE 329
            IL+  +  E  P V  A E++  +L+   GDE
Sbjct: 321 LILRELHSWEPEPDVRTACEKLIQVLI---GDE 342

BLAST of gCucsa.078830.2 vs. Swiss-Prot
Match: HGH1_MOUSE (Protein HGH1 homolog OS=Mus musculus GN=Hgh1 PE=1 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 1.1e-21
Identity = 94/334 (28.14%), Postives = 170/334 (50.90%), Query Frame = 1

Query: 3   TELEELIQFLS-SPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKDV 62
           TE  EL+ FL       L+ AA   V  LTG+  G   LA  P++L   +   +      
Sbjct: 25  TEAVELLPFLVLGARADLQAAAAQHVLALTGAGSGRTLLAGQPELLRALVDLAVAPAPAP 84

Query: 63  AERAAEALVNLSQNSDLAGKMINLG--LIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAG 122
           +  A+ ALVNL+ + ++  +++     L    +  + +     ++    +L NL++  A 
Sbjct: 85  SRDASRALVNLAADPNVHWQLLAADPELPARLLRCVLDPQWPWAEEAAAVLANLSREPAP 144

Query: 123 IASLLQ---TGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLL 182
            A+L++     + +  GL   ++V + C + S +   P  ++  +L N+S++   R  LL
Sbjct: 145 CAALMEKLMAAEPERLGLE--RLVNALC-TPSYNAAAPLHYLGPLLSNLSQQAEVRAFLL 204

Query: 183 DPNRGLLKQII--RQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPV 242
           DP+R ++++++   QY ++S +R+ GV GTLRNCCFE       L    + L P LLLP+
Sbjct: 205 DPDRCVVQRLLPLTQY-TDSSVRRGGVVGTLRNCCFEHRHHKWLLGAQVDIL-PFLLLPL 264

Query: 243 AGNKVYKEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSING 302
           AG + + EE+  ++P++L   LS ++++  D +IR   +EA+ ++     GR+       
Sbjct: 265 AGPEEFSEEEMDQLPVDL-QYLSPDKQREPDADIRKMLIEAVMLLTATAPGRKQVRDQGA 324

Query: 303 PRILQVGYEDEENPKVMEAYERVGSLLVNSGGDE 329
             IL+  +  E  P V  A E++  +L+   GDE
Sbjct: 325 YLILRELHSWEPEPDVRMACEKLIQVLI---GDE 349

BLAST of gCucsa.078830.2 vs. Swiss-Prot
Match: HGH1_DANRE (Protein HGH1 homolog OS=Danio rerio GN=hgh1 PE=2 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 1.1e-21
Identity = 87/329 (26.44%), Postives = 164/329 (49.85%), Query Frame = 1

Query: 4   ELEELIQFLS-SPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKDVA 63
           E ++L+ FL+      ++  A   + GLTG+ DG + L   PD L   +         + 
Sbjct: 6   EAKDLLSFLTLEMRADVKGQATGYILGLTGNRDGCRYLQSKPDFLKALVTLTSDPSIAIV 65

Query: 64  ERAAEALVNLSQNSDLAGKMIN-LGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGIA 123
           +    AL+NLS +  L   ++    ++ + +  L + +   S  +  +L NL++ +    
Sbjct: 66  KDCFHALINLSADETLHQPLVKETEILSKLIPKLQDPEFVFSDRICTILSNLSRHEQTCR 125

Query: 124 SLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNRG 183
            + +   +   GL   ++V  FC      +     ++A +L N+++    R  +LD +R 
Sbjct: 126 DVFKALQELNVGLD--RLVEIFCTEGFNKKAS-LHYLAPLLSNLTQLPEARHFILDKDRC 185

Query: 184 LLKQII--RQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKV 243
           ++++++   QY+  S  R+ GV GTLRNCCF+     + LL  A  + P LLLP+AG + 
Sbjct: 186 VIQRLLPFTQYEE-SITRRGGVVGTLRNCCFDYVHH-EWLLSDAVDILPFLLLPLAGPEE 245

Query: 244 YKEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQ 303
             EE+   +P++L   L  ++ + +DP+IR   LE + ++     GR+   S N   I++
Sbjct: 246 LSEEENEGLPVDL-QYLPEDKRREEDPDIRKMLLETLMLLTATKVGRQILKSKNVYPIMR 305

Query: 304 VGYEDEENPKVMEAYERVGSLLVNSGGDE 329
             ++ E++P V+ A E++   L+   GDE
Sbjct: 306 EFHKWEKDPHVISACEKLVQALI---GDE 325

BLAST of gCucsa.078830.2 vs. TrEMBL
Match: A0A0A0K4F9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G328330 PE=4 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 2.0e-150
Identity = 282/282 (100.00%), Postives = 282/282 (100.00%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD
Sbjct: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI
Sbjct: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR
Sbjct: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY
Sbjct: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ 283
           KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ
Sbjct: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ 282

BLAST of gCucsa.078830.2 vs. TrEMBL
Match: A0A061EI51_THECC (ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_011758 PE=4 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 1.6e-136
Identity = 248/330 (75.15%), Postives = 292/330 (88.48%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEELI FLS+PSP ++KAA+DIV+GLTGSEDG+ SL+ Y + +LPSL+RLL E K+
Sbjct: 1   MATELEELIDFLSAPSPPVKKAAVDIVRGLTGSEDGLHSLSNYANTVLPSLSRLLSEDKE 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           V+E AAEALVNLSQN++LA KM+ +G++  AM+LLY    SI+++LVMLLVNLTQLD GI
Sbjct: 61  VSEPAAEALVNLSQNAELAAKMVEIGMVKIAMDLLYKPGFSITRVLVMLLVNLTQLDDGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
            SLLQ GD+KMQGLYVMK+VRSFCRS SE+ DDPF+HV SILVNISKKEAGRK+LLDP R
Sbjct: 121 TSLLQIGDEKMQGLYVMKLVRSFCRS-SEAGDDPFDHVGSILVNISKKEAGRKMLLDPKR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIRQ+DS+  LRK GV GT+RNCCFEAE+QLQNLLLI+EFLWPALLLPVAGNK+Y
Sbjct: 181 GLLKQIIRQFDSSGPLRKKGVSGTIRNCCFEAENQLQNLLLISEFLWPALLLPVAGNKIY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300
            E+DTSKMPLEL +ALSIERE + DPEI V ALEAIY+I LQ+AGRRAFWS+NGPRILQV
Sbjct: 241 SEQDTSKMPLELRSALSIEREPVKDPEICVQALEAIYLITLQEAGRRAFWSVNGPRILQV 300

Query: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEP 331
           GYEDEE+PKV+EAYE++GSLLV+  G EEP
Sbjct: 301 GYEDEEDPKVLEAYEQIGSLLVHGSGTEEP 329

BLAST of gCucsa.078830.2 vs. TrEMBL
Match: A0A0B0NYT0_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_09773 PE=4 SV=1)

HSP 1 Score: 490.7 bits (1262), Expect = 1.4e-135
Identity = 249/330 (75.45%), Postives = 291/330 (88.18%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEELI FLS+PS  ++KAA+DIV+ LTGSEDG+ SL+ Y + +LPSL+RLL + K+
Sbjct: 1   MATELEELIGFLSAPSSPVKKAAVDIVRDLTGSEDGLHSLSNYANTVLPSLSRLLSDDKE 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           V+E AAEALVNLSQN+ LA KM+ +GLI  AM++LY   SSI++LLVMLLVNLTQLD GI
Sbjct: 61  VSEPAAEALVNLSQNAGLAAKMVEMGLIKIAMDMLYKPGSSITRLLVMLLVNLTQLDDGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           +SLLQ GDDKMQGLYVMK+VRSFCRS SE+ DDPF+HV SILVNISKKE GRK+LLDP R
Sbjct: 121 SSLLQIGDDKMQGLYVMKLVRSFCRS-SETSDDPFDHVGSILVNISKKEEGRKMLLDPKR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIRQ+DS+S LRK GV GT+RNCCFEAE+QLQNLLLI+EFLWPALLLPVAGNK+Y
Sbjct: 181 GLLKQIIRQFDSSSLLRKKGVSGTIRNCCFEAENQLQNLLLISEFLWPALLLPVAGNKIY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300
            E+DTSKMPLELG+ALSI+RE + DPEIRV ALEAIY+I LQ+AGRRA WS+NGPRILQV
Sbjct: 241 GEQDTSKMPLELGSALSIDREPVKDPEIRVQALEAIYLIALQEAGRRALWSVNGPRILQV 300

Query: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEP 331
           GYEDEE+PKVMEAYE++GSLLV+    EEP
Sbjct: 301 GYEDEEDPKVMEAYEQIGSLLVHGSESEEP 329

BLAST of gCucsa.078830.2 vs. TrEMBL
Match: A9PJP6_9ROSI (Putative uncharacterized protein OS=Populus trichocarpa x Populus deltoides PE=2 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 1.8e-135
Identity = 247/330 (74.85%), Postives = 292/330 (88.48%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEEL+ FLSSPSP ++KAA++IV+ LTGSEDG+ SL+KY   +LPSL++LLKE+K+
Sbjct: 1   MATELEELVGFLSSPSPPVKKAAVEIVRDLTGSEDGLLSLSKYASTVLPSLSQLLKEKKE 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           V+E AAEAL+NLS NS+LA KM+ +G+I  AM++LY  DSSI++LLVMLLVNLTQLD+GI
Sbjct: 61  VSEPAAEALINLSLNSNLAAKMVEMGMIKTAMDVLYKPDSSITRLLVMLLVNLTQLDSGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
            SLLQ  D+KMQGL+VMK+VRSF RS+ E+RDDPF+HV SILVNISKKEAGRK+LLD  R
Sbjct: 121 VSLLQIEDEKMQGLFVMKLVRSFGRSSDETRDDPFDHVGSILVNISKKEAGRKMLLDSKR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQI+RQ+DS S LRK GV GTLRNCCFEAE+QLQN LLI+EFLWPALLLPVAG K+Y
Sbjct: 181 GLLKQILRQFDSTSPLRKKGVSGTLRNCCFEAENQLQNFLLISEFLWPALLLPVAGKKIY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300
            E+DTSKMPLELG+ALSIERE  DDPEIRV ALE+IY+II+Q+AG RA WS+NGPRILQV
Sbjct: 241 SEQDTSKMPLELGSALSIEREPWDDPEIRVEALESIYLIIVQEAGLRALWSVNGPRILQV 300

Query: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEP 331
           GYEDEE+PKVMEAYERVGSLLV+  G EEP
Sbjct: 301 GYEDEEDPKVMEAYERVGSLLVHGCGTEEP 330

BLAST of gCucsa.078830.2 vs. TrEMBL
Match: V7C9K6_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_003G049600g PE=4 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 4.0e-135
Identity = 244/330 (73.94%), Postives = 289/330 (87.58%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATE+EEL+ FL+SPSPQ+ KAA+DIV+GLTGS +G+QSLA Y + LLP+L+RLL   K+
Sbjct: 1   MATEMEELVSFLASPSPQITKAAVDIVRGLTGSAEGLQSLANYSNALLPALSRLLTLPKE 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           V+E AAEALVNLSQNS LA  M+ LGL+   M++LY  +  I+QLLVMLLVNLTQLDAG+
Sbjct: 61  VSEAAAEALVNLSQNSSLAEAMVQLGLVKTTMDVLYKPECVIAQLLVMLLVNLTQLDAGV 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           AS+LQT D+K++GLYVMK+VRSFCR+A ES DD FEHV SILVNISK+  GRKLLLDP R
Sbjct: 121 ASVLQTEDEKVRGLYVMKLVRSFCRTAHESDDDAFEHVGSILVNISKQREGRKLLLDPKR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIRQ+DSNS LRK GV GT+RNCCFEAE++LQNLLL++EFLWPALLLPVAGNK+Y
Sbjct: 181 GLLKQIIRQFDSNSTLRKKGVSGTIRNCCFEAENELQNLLLVSEFLWPALLLPVAGNKIY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300
            E D SKMPLELGTALSIERE ++DPEIR+ ALEAIY+IILQDAGRRAFWS+NGPRI+Q+
Sbjct: 241 NELDRSKMPLELGTALSIEREPVNDPEIRIQALEAIYLIILQDAGRRAFWSVNGPRIVQI 300

Query: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEP 331
           GYEDEE+PKVM AYE++GSLLV+S   EEP
Sbjct: 301 GYEDEEDPKVMGAYEQLGSLLVHSSSAEEP 330

BLAST of gCucsa.078830.2 vs. TAIR10
Match: AT1G14300.2 (AT1G14300.2 ARM repeat superfamily protein)

HSP 1 Score: 430.3 bits (1105), Expect = 1.1e-120
Identity = 224/368 (60.87%), Postives = 282/368 (76.63%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           M TELEEL++FLSSPSP ++KAA++IV GLTGSE+G+QSL+KY ++LLPSL++LL E K+
Sbjct: 1   MVTELEELVEFLSSPSPPVKKAAVEIVSGLTGSEEGLQSLSKYSEILLPSLSQLLNESKE 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           V+E AA+ALVNLSQ  +LA KMI +GLI  AM++LY  +S I++LLVMLLVNLTQLD G+
Sbjct: 61  VSEPAAQALVNLSQKCELAKKMIQMGLIKVAMDMLYKPESCITRLLVMLLVNLTQLDDGV 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           +SLLQ  D+KM GL++MK+VRSFCRS+ E+ DD FEHV SILVNISK E GRKLLL+P R
Sbjct: 121 SSLLQIDDEKMHGLHIMKLVRSFCRSSGETADDQFEHVGSILVNISKTEDGRKLLLEPKR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
            LLKQIIRQ+DS +QLRK GV GT+RNCCFEA++QLQN+LLI+EFLWPALLLPVAG+K Y
Sbjct: 181 RLLKQIIRQFDSTNQLRKKGVAGTIRNCCFEAKNQLQNILLISEFLWPALLLPVAGSKTY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEI-------------------------------- 300
            E+D +KMP ELG+ALSIERE + DP+I                                
Sbjct: 241 SEQDVAKMPPELGSALSIEREPVTDPDIRVQTLEAIYLIILEVKHFSVSLSLLVNSCEGP 300

Query: 301 ------RVHALEAIYMIILQDAGRRAFWSINGPRILQVGYEDEENPKVMEAYERVGSLLV 330
                 +  ++ +  +++LQ+AGRRAFWS+NGPRILQ+GYE EE+PK M AYE+VGSLLV
Sbjct: 301 IKPKTYKRSSVMSKLLLMLQEAGRRAFWSVNGPRILQLGYEYEEDPKAMRAYEQVGSLLV 360

BLAST of gCucsa.078830.2 vs. NCBI nr
Match: gi|778726756|ref|XP_011659154.1| (PREDICTED: protein HGH1 homolog [Cucumis sativus])

HSP 1 Score: 643.3 bits (1658), Expect = 2.4e-181
Identity = 332/332 (100.00%), Postives = 332/332 (100.00%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD
Sbjct: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI
Sbjct: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR
Sbjct: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY
Sbjct: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300
           KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV
Sbjct: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300

Query: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD 333
           GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD
Sbjct: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD 332

BLAST of gCucsa.078830.2 vs. NCBI nr
Match: gi|659124076|ref|XP_008461977.1| (PREDICTED: FAM203 family protein DDB_G0276861 isoform X1 [Cucumis melo])

HSP 1 Score: 636.3 bits (1640), Expect = 2.9e-179
Identity = 327/332 (98.49%), Postives = 331/332 (99.70%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDG+QSLAKYPDVLLPSLARLL+EQKD
Sbjct: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGLQSLAKYPDVLLPSLARLLREQKD 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           VAERAAEALVNLSQNSDLAGKMI+LGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI
Sbjct: 61  VAERAAEALVNLSQNSDLAGKMIDLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           ASLLQTGDDKMQGLYVMKIVRSFCRSASES DDPFEHVASILVNISKKEAGRKLLLDPNR
Sbjct: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESSDDPFEHVASILVNISKKEAGRKLLLDPNR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLP+AGNKVY
Sbjct: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPIAGNKVY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300
           KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV
Sbjct: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300

Query: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD 333
           GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD
Sbjct: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD 332

BLAST of gCucsa.078830.2 vs. NCBI nr
Match: gi|700189312|gb|KGN44545.1| (hypothetical protein Csa_7G328330 [Cucumis sativus])

HSP 1 Score: 540.0 bits (1390), Expect = 2.8e-150
Identity = 282/282 (100.00%), Postives = 282/282 (100.00%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD
Sbjct: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI
Sbjct: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR
Sbjct: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY
Sbjct: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ 283
           KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ
Sbjct: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ 282

BLAST of gCucsa.078830.2 vs. NCBI nr
Match: gi|659124081|ref|XP_008461979.1| (PREDICTED: FAM203 family protein DDB_G0276861 isoform X2 [Cucumis melo])

HSP 1 Score: 533.1 bits (1372), Expect = 3.5e-148
Identity = 277/282 (98.23%), Postives = 281/282 (99.65%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDG+QSLAKYPDVLLPSLARLL+EQKD
Sbjct: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGLQSLAKYPDVLLPSLARLLREQKD 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           VAERAAEALVNLSQNSDLAGKMI+LGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI
Sbjct: 61  VAERAAEALVNLSQNSDLAGKMIDLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           ASLLQTGDDKMQGLYVMKIVRSFCRSASES DDPFEHVASILVNISKKEAGRKLLLDPNR
Sbjct: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESSDDPFEHVASILVNISKKEAGRKLLLDPNR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLP+AGNKVY
Sbjct: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPIAGNKVY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ 283
           KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ
Sbjct: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQ 282

BLAST of gCucsa.078830.2 vs. NCBI nr
Match: gi|659124084|ref|XP_008461980.1| (PREDICTED: protein FAM203A isoform X3 [Cucumis melo])

HSP 1 Score: 518.5 bits (1334), Expect = 8.9e-144
Identity = 281/332 (84.64%), Postives = 288/332 (86.75%), Query Frame = 1

Query: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGMQSLAKYPDVLLPSLARLLKEQKD 60
           MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDG+QSLAKYPDVLLPSLARLL+EQKD
Sbjct: 1   MATELEELIQFLSSPSPQLRKAAIDIVQGLTGSEDGLQSLAKYPDVLLPSLARLLREQKD 60

Query: 61  VAERAAEALVNLSQNSDLAGKMINLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120
           VAERAAEALVNLSQNSDLAGKMI+LGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI
Sbjct: 61  VAERAAEALVNLSQNSDLAGKMIDLGLIGEAMNLLYNVDSSISQLLVMLLVNLTQLDAGI 120

Query: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESRDDPFEHVASILVNISKKEAGRKLLLDPNR 180
           ASLLQTGDDKMQGLYVMKIVRSFCRSASES DDPFEHVASILVNISKKEAGRKLLLDPNR
Sbjct: 121 ASLLQTGDDKMQGLYVMKIVRSFCRSASESSDDPFEHVASILVNISKKEAGRKLLLDPNR 180

Query: 181 GLLKQIIRQYDSNSQLRKNGVFGTLRNCCFEAEDQLQNLLLIAEFLWPALLLPVAGNKVY 240
           GLLKQIIR                     +++  QL+                   N VY
Sbjct: 181 GLLKQIIR--------------------QYDSNSQLRK------------------NGVY 240

Query: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 300
           KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV
Sbjct: 241 KEEDTSKMPLELGTALSIEREQIDDPEIRVHALEAIYMIILQDAGRRAFWSINGPRILQV 294

Query: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD 333
           GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD
Sbjct: 301 GYEDEENPKVMEAYERVGSLLVNSGGDEEPHD 294

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HGH1_XENLA2.0e-2327.08Protein HGH1 homolog OS=Xenopus laevis GN=hgh1 PE=2 SV=1[more]
HGH1_XENTR2.9e-2227.88Protein HGH1 homolog OS=Xenopus tropicalis GN=hgh1 PE=2 SV=1[more]
HGH1_HUMAN8.6e-2230.93Protein HGH1 homolog OS=Homo sapiens GN=HGH1 PE=1 SV=1[more]
HGH1_MOUSE1.1e-2128.14Protein HGH1 homolog OS=Mus musculus GN=Hgh1 PE=1 SV=1[more]
HGH1_DANRE1.1e-2126.44Protein HGH1 homolog OS=Danio rerio GN=hgh1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K4F9_CUCSA2.0e-150100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_7G328330 PE=4 SV=1[more]
A0A061EI51_THECC1.6e-13675.15ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_011758 PE=4 S... [more]
A0A0B0NYT0_GOSAR1.4e-13575.45Uncharacterized protein OS=Gossypium arboreum GN=F383_09773 PE=4 SV=1[more]
A9PJP6_9ROSI1.8e-13574.85Putative uncharacterized protein OS=Populus trichocarpa x Populus deltoides PE=2... [more]
V7C9K6_PHAVU4.0e-13573.94Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_003G049600g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G14300.21.1e-12060.87 ARM repeat superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778726756|ref|XP_011659154.1|2.4e-181100.00PREDICTED: protein HGH1 homolog [Cucumis sativus][more]
gi|659124076|ref|XP_008461977.1|2.9e-17998.49PREDICTED: FAM203 family protein DDB_G0276861 isoform X1 [Cucumis melo][more]
gi|700189312|gb|KGN44545.1|2.8e-150100.00hypothetical protein Csa_7G328330 [Cucumis sativus][more]
gi|659124081|ref|XP_008461979.1|3.5e-14898.23PREDICTED: FAM203 family protein DDB_G0276861 isoform X2 [Cucumis melo][more]
gi|659124084|ref|XP_008461980.1|8.9e-14484.64PREDICTED: protein FAM203A isoform X3 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007205Protein_HGH1_N
IPR007206Protein_HGH1_C
IPR011989ARM-like
IPR016024ARM-type_fold
Vocabulary: Molecular Function
TermDefinition
GO:0005488binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005488 binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.078830.2Cucsa.078830.2mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007205Protein HGH1 N-terminalPFAMPF04063DUF383coord: 97..269
score: 8.9
IPR007206Protein HGH1 C-terminalPFAMPF04064DUF384coord: 275..323
score: 2.
IPR011989Armadillo-like helicalGENE3DG3DSA:1.25.10.10coord: 8..316
score: 6.2
IPR016024Armadillo-type foldunknownSSF48371ARM repeatcoord: 2..319
score: 2.27
NoneNo IPR availableunknownCoilCoilcoord: 52..75
scor
NoneNo IPR availablePANTHERPTHR13387FAMILY NOT NAMEDcoord: 1..322
score: 2.0E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
gCucsa.078830.2Cucurbita maxima (Rimu)cgycmaB0136
gCucsa.078830.2Cucurbita maxima (Rimu)cgycmaB0143
gCucsa.078830.2Cucurbita moschata (Rifu)cgycmoB0138
gCucsa.078830.2Cucurbita moschata (Rifu)cgycmoB0133
gCucsa.078830.2Cucurbita pepo (Zucchini)cgycpeB0131
gCucsa.078830.2Cucurbita pepo (Zucchini)cgycpeB0133
gCucsa.078830.2Silver-seed gourdcarcgyB0092