Cucsa.032420 (gene) Cucumber (Gy14) v1

NameCucsa.032420
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionSequence-specific DNA-binding transcription factor
Locationscaffold00429 : 1719778 .. 1725494 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATAaCGAATTTCATTTCTGTAgACGGGATCTTCAATTCTGATTtCTTTTTCCCGGCACTTTTCTCTTTGTTTCTCTCTCTGCTTCTTCGATAGCGACGCCGAAATCAAAGAAACACACCGAAAATTTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACGGCTTCCGAGGTCCTTTtCCCCCTTCTTATTCTTCTTCTTTTGTTGTTTTtCCCTCTTTtTTTttCCTTTCTATTTTCTTACTGTGAATGATGTGTTCCTGATGCATGTTAGATGCTTGAGTTGAAATTTTTTCTGGGTTTGCTCAATTGCTACGTCTATTTGCTTGATTGGTTTTTTTTTTtAGAAAAAGATTAACTTTACTGATTATTGTGCTtGTTTTTTTTTTTtttCTGTATTGGTGTTTTGGGGAGGTTTCTTATCGATTTCTTTTGAACTGATTAGCTGTTTTGTTATTTTCTAACTTGTATTCTGTTtTTTTTTTtAAAAAAACATTTCCCCCCATTTCTTAATCCACTTGTGATTGTGTTTTtCCCCTTTTGAATTACGATGCCCTGCCCCCAAATTACTACTTCTGTAACTGGTTTGCTTTCATTGCATAGAGTCGAGGTTGTGGTTGGTGTAAGTGTAGGTACCTGCATTTAGGTAAGGATTGCAGAGAGGGAGATTGTTGGGATTTGCTGGATAACGGTAATTTGGAAGAGTGCCATTTTCAATCACGCTTTCCATATTTATGTTTTGGTGAGTGGTGAGAGAATTTGAAGTTGCTGGTAGTTGAACTTTTGGTTTAGAGCTTTTGATAAAATTTGTTCCAGACCCAAATGGAATAGATCTAACCATCTGAATTTGAGTGGATATTGTGTTGACCCCGTTTCGAAATGGAACAGTTGGGTATCTTTTGGCCTTTTTGCTGTTCATTCGTAAGAAGGTAACTAATTTTCCTGCCGGATGTATTTTGTTCTATAAGGAAATACATTCTTGATATGGAGTTGAAAATTGTTTCAGTCAAATGAAGTTTTTCTGTTCTTGCTTTAAGGCTGCTGCTGTAGTTGTTTCTTTTCTGGGCATTATAATCTCATAGGTTACAATTCCTTAAAAGTCAAAACTTCCGTTGAATGTAATATTCAATTCTTATTTTTATCAAGGTCGCGGAGATGGAAGCCATATTGCAAGGACACAACAATACCATGCCAGCTCGGGAAGTTCTTGTTGCACTTGCTGATAAGTTCAGGTGGACATCGAGCAGTAGCCTACTGTGGCTTTTATTATTTTTGAGTTGATCTTTGTATGTTTTTTtAATAGAAGACAGGATTGAATAGTCAAAaGAATAATTTGTTTATGTTGTTATAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTATGGTGGCTTCTGTTTCTATGCTTGACCTTGATATGATGGGTAGAGGTGTTCAGCTAATTATCATTATCATTTCTTAGGTTTGGAATTGGTTCCAGAATAGACGATATGCTATCAGAGCAAAGACATCCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAGTTGTCCAAATTGAGTCAACCCCTGTGAGAAATGTGCCTCAAACTGTAGTTGTTCCTGCTCCCGCACCAGTAGGTATTAAAAAGAATTGTTATGTTATCTTAGTTTTTAAGTATTTTCATTTTCTTATGCTGTTTTTCTTTTTtATTACATGTTTATTACTAGCTTATCATCACAAATCTATTCTAGTAGTAGAATGGCATTCGTCAAATCCTCAGTCTAGGCTTGGATCACCATTTTAAAACTTTTCCTCGCTGGCACTGGGAAAAAGGTGTACGCAGAGGCGTGGTAGCTTTAATGGCCCCAAATCTTCATTTTGATTTGTACATTTTTATGAAATTATTACGAAAAAaCAAATTGCAAATTGAGCTGCTAGCTTTTATTGTCTTCTTGTCCTTCTCAAGGATTTTCACCTTGCTCTCTTTTCCACCTCCTGAGCTTTCCCTCACTTAGATTCTTGGCTTAGGAAGTCTTACATGAGAGAGTTAATACTATGGACCATGTGTAGAGAGACTCTTCCTTCTTTCTTGGGCCTCGGTGGTGTTCCTACTTTAGAGGGAGCTGTATAAGACCTTGTCAAATTCTTTAGAGCTGCCAGTTTCCTTTTTCAATTTGGTCTTGGTTTTTtGGGTGGCCCTTGGTCTGTCTTTGGCTTGGAATATGAATGGCATGGCCATGATATAGGAGGTTCTGCCTTCTCCTACCTTTAGGGTGAAGGGAAAGTTGTGGCAAACTTGCTTCTTTTCTGTTGTTTGGAGGAATTGACTTAGTGAGAATTAATAGATTCTTAAAGGAAACATTACATTTAATACTTTGGTGGTTGAAGTGTTTAACTATTATCAGTTGAGTCTTATTATTTCGAATTGGATATTGTTTCTACCTTAAATCATGGTTGACCATGTTTATATCCTATGAAGAACTATAGTTTTTTAGAATGACATGGTGTGGAAAGCTATGAAGCACGGATGCTTCAGTTTGGATAGTGTGTGAACAAAATATATGTGAGCAAAACTTATTGGACGGGCTGACCACTTGACACATGTTGGATGGCTTGATAGTATGTGTCTGTGTCAAACAATTGTTGGACACTTAAATAAAATATTCAATTTATTGGTGTTCTATTTTTGGTCTGTTTATTTAGTTTTGAAAAAAGTGAAATAAAAATAAAGTTGAGAGGACAATAATAAATTTTGTGAGGAAACACCAATCTTGTTTTTCATGCATATTAATGGATGAGCACCTTTGATTATGGATTTGGTTTTTTAAGATGATTCATATTGAAGCACAGTAAGTTTGTTTTTGGATCATTTATTTTTtATTATTATTTTTTTGAGATAAGGCATTTGTATCAGTCAAACTGAAAAGCAATTCTAAGTACGCAAAATGTGGAGGGGAATTTTGACGTAATTATGTTATTGACCTTAACATATTTTAATTTTTCATTCAATTTCTGTGCCATTCCAGGCTCTGCAAAGGGTGCTCCAGAAAATCCATTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTTAGTATAGCTTGTCTACAAATATTGGAATTTTGGAGTGAATTAGCTGGTTTCCATGCTTTTGGATGATAAACCTCCTCATGATAGGTGTAGAGGAACAATTGCAGTTCCATGCATACACACATGTGTGAATTAATTAACATGCAACAAACAGACTTGCATAATGTGAGACATTACTGATTGACTTTTTTAACTTTTCATGGTCAGGTATGACGTTGCTACCTTTTTATCGCATAGATCTGTGGAAAGTGGTGACCCGgTAATTTTGCGCCTCTTGCATGGAAGTTATTATTTAAGGAAGCATTGCGCTGCCTTTCTTTCCTCATCAACCTATTTGCTGTTTTCTTCTTCATGCATTGTTTTTTtTTTTtAGCTTTTATCATTAAAAATGAACATGGAAGTATTAAGCCAATTAATTCTTGAATGGCCACTATCTCCATTGTTAGGAAAACATTAATCTGCCTTCTATCCTCAACccATGGGCCCAATGTCAAGGGCTTTTCTTTAGTTTGGTGGTGGCTTGAAAACCTTTTGGATCACATTCAATCCTACATTTTCTCTAAGTATTTATTATATATGTGGTTTCAAACTAAAACAAAAGTTAGAATTATAGGGTTTTTGTGTGCTACGGAGAGGGAATTGTAACATTCTCCTTCCCCcTCCCTACCTTTGAGTTTGTATTGACATTATTAGGATTGAGCAACAACTTTTCTGACCAATATTTTTTtATTTTTGGTGATTCCCAAAAATGGTCTGAGATATTTGATTGTAATTCAAAAGTTTGTCTCCTGATTTTTCGTTGAGTACAACAAATGTAGTGGTGAGAATGCAATTCTTTGGACCTCAAAGGAAGGAATAGATCCCTTAATCATTGATGTTCATGATTTTGGTAAAGAAATCTGATGTTTTGATTTGTTTGCTTTAATCTGGCTCCCTATATTACCTAATAATTGTTGTTTGTCAGGAAGTCTAATAGTAATTGTTTCTCAGGAAGTACTAGTAAGATTTTCTGGTTTTGGATCCGAGGAGGACGAGTGGGTTAATATACGAAGGAACATTAGACCTCGTTCTCTACCTTGTGAATCATCAGAATGTGTAGCGGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGTAAATACTTTTACTCCAAACTTTCAGTACTTAAAAGGGAGGCCTTTCCATCACCTTGCAATCAGTTCAGGACAATCTCTTGGATGTAATTTCCGACTACCTTGTAAAACCACTCCCCTAGTTGGAGGACACAATGATCTTTCAAATTTCTAACTAGAAAACTCAACACACGCTCTACCAGTTATAGGGGAAACTTTTTTTAATGACGAATTTCATTATCTTCAAAATTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCACGTGCTTGATACACAACGGAGAAGACATGATGTACGAGGTTGTCGATGCAGGTTTTTGGTTCGTTATGATCACGATCAATCTGAGGTGTGTATCTTCTTCAGGTTCTTTTGGTTGTTCTTATTTGGTATTGCTCTCCTAGCTTGGGGTTGTTGCAATGAACACAAACATCACATCAATATAACCATTGCCGTTAATTTCACATGAGCTGAAGACTTTAGCTTTTGATGTGGGGAAAATTGGAAATCAAGAGCTTGGTTAGTGGTTGGGCTGATGTCATTTTTTTTTCACGTTTCTTCTTTTTaTAAAAAAAaGTAATCCTATTCTCTTTCTTTGGAGTTGTAGAGTGTAGGCTGGATAGTTGGTTTTCTAAGGGGAGTTTGCTTCTCGTTATTGAATTAGCTAACCATATGGCTACGTGGAAGAACCTTTTCTAATCCTCTACGGATTTTCGGAAGTTTTTTtCCTTCCTTTTTGTAAACTTCATCTATGAAATTCTTGTTTCTTCTTCAAGAAAAATAAAAGGAGGTTATTTCTATAACAAGCGAGAAGTGTTCTATTTAATCAAGCATTGAATGCTGTTCAGGATTATTGACTTATCTTGCTCTAAATTATTGTTAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGGCCTGAGACTGATTACCGGTTGCAACAGCTTCACGCCGTAAATGAAGCAGCATCCATTGAGCCCTCAAAGTCAGGCATGGATTCTGTACTGCTCAGTGGTCAGAGGATAAATTTCGAAACATCACAAAATCCACTTAGCAAGGATGCAGCCTTGGTTATACCAAATGCAAATCCTCATATAAATGCCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATACTGAAACTAACACTGCTCCAACCACATTCAACTCTGCTAATCTTGCAGGTAGCTCTGCATTCTCGAGTGGTATCGTGACAAACACTGTTTCTGCTGGGTCAGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGATTATGTGGGAAAAGTAAAATTCTCCATCAGTCTAATTTTAACAGAACCTATCAATTTAAAATTTTGCCTGACTCGTTTATTTAGGATGAGTAAATACCTAGCGAAGTCTGTTCTTTTTGTCCCATGTTTCAAAGTTATAGGTTCTCCATCCACTTGCTGTGAATGCTGAATGTTCTTGACGGATGAAAAATGCAGGA

mRNA sequence

AAAATAACGAATTTCATTTCTGTAGACGGGATCTTCAATTCTGATTTCTTTTTCCCGGCACTTTTCTCTTTGTTTCTCTCTCTGCTTCTTCGATAGCGACGCCGAAATCAAAGAAACACACCGAAAATTTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACGGCTTCCGAGGTCGCGGAGATGGAAGCCATATTGCAAGGACACAACAATACCATGCCAGCTCGGGAAGTTCTTGTTGCACTTGCTGATAAGTTCAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTATCAGAGCAAAGACATCCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAGTTGTCCAAATTGAGTCAACCCCTGTGAGAAATGTGCCTCAAACTGTAGTTGTTCCTGCTCCCGCACCAGTAGGCTCTGCAAAGGGTGCTCCAGAAAATCCATTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCTACCTTTTTATCGCATAGATCTGTGGAAAGTGGTGACCCGGAAGTACTAGTAAGATTTTCTGGTTTTGGATCCGAGGAGGACGAGTGGGTTAATATACGAAGGAACATTAGACCTCGTTCTCTACCTTGTGAATCATCAGAATGTGTAGCGGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCACGTGCTTGATACACAACGGAGAAGACATGATGTACGAGGTTGTCGATGCAGGTTTTTGGTTCGTTATGATCACGATCAATCTGAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGGCCTGAGACTGATTACCGGTTGCAACAGCTTCACGCCGTAAATGAAGCAGCATCCATTGAGCCCTCAAAGTCAGGCATGGATTCTGTACTGCTCAGTGGTCAGAGGATAAATTTCGAAACATCACAAAATCCACTTAGCAAGGATGCAGCCTTGGTTATACCAAATGCAAATCCTCATATAAATGCCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATACTGAAACTAACACTGCTCCAACCACATTCAACTCTGCTAATCTTGCAGGTAGCTCTGCATTCTCGAGTGGTATCGTGACAAACACTGTTTCTGCTGGGTCAGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGATTATGTGGGAAAAGTAAAATTCTCCATCAGTCTAATTTTAACAGAACCTATCAATTTAAAATTTTGCCTGACTCGTTTATTTAGGATGAGTAAATACCTAGCGAAGTCTGTTCTTTTTGTCCCATGTTTCAAAGTTATAGGTTCTCCATCCACTTGCTGTGAATGCTGAATGTTCTTGACGGATGAAAAATGCAGGA

Coding sequence (CDS)

ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACGGCTTCCGAGGTCGCGGAGATGGAAGCCATATTGCAAGGACACAACAATACCATGCCAGCTCGGGAAGTTCTTGTTGCACTTGCTGATAAGTTCAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTATCAGAGCAAAGACATCCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAGTTGTCCAAATTGAGTCAACCCCTGTGAGAAATGTGCCTCAAACTGTAGTTGTTCCTGCTCCCGCACCAGTAGGCTCTGCAAAGGGTGCTCCAGAAAATCCATTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCTACCTTTTTATCGCATAGATCTGTGGAAAGTGGTGACCCGGAAGTACTAGTAAGATTTTCTGGTTTTGGATCCGAGGAGGACGAGTGGGTTAATATACGAAGGAACATTAGACCTCGTTCTCTACCTTGTGAATCATCAGAATGTGTAGCGGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCACGTGCTTGATACACAACGGAGAAGACATGATGTACGAGGTTGTCGATGCAGGTTTTTGGTTCGTTATGATCACGATCAATCTGAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGGCCTGAGACTGATTACCGGTTGCAACAGCTTCACGCCGTAAATGAAGCAGCATCCATTGAGCCCTCAAAGTCAGGCATGGATTCTGTACTGCTCAGTGGTCAGAGGATAAATTTCGAAACATCACAAAATCCACTTAGCAAGGATGCAGCCTTGGTTATACCAAATGCAAATCCTCATATAAATGCCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATACTGAAACTAACACTGCTCCAACCACATTCAACTCTGCTAATCTTGCAGGTAGCTCTGCATTCTCGAGTGGTATCGTGACAAACACTGTTTCTGCTGGGTCAGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGA

Protein sequence

MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSKDAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGSADNVSDGKLLS*
BLAST of Cucsa.032420 vs. Swiss-Prot
Match: SHH2_ARATH (Protein SAWADEE HOMEODOMAIN HOMOLOG 2 OS=Arabidopsis thaliana GN=SHH2 PE=2 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 2.8e-106
Identity = 208/353 (58.92%), Postives = 248/353 (70.25%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALADKFSES ERKGK+ VQ 
Sbjct: 1   MGRPPSNGGPAFRFILPEVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQF 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVRNVPQTVVVP----------- 120
           KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R+V Q + VP           
Sbjct: 61  KQIWNWFQNRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPG 120

Query: 121 -APAPVGSA-----KGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSG 180
             PAP GS      +   +N   EFEAKS RDGAWYDV  FL+HR++E GDPEV VRF+G
Sbjct: 121 MTPAPSGSLVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAG 180

Query: 181 FGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRR 240
           F  EEDEW+N+++++R RSLPCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRR
Sbjct: 181 FEVEEDEWINVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRR 240

Query: 241 HDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGMD 300
           HDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ A S +     +D
Sbjct: 241 HDVRGCRCRFLVRYSHDQSEEIVPLRKICRRPETDYRLQQLHNAVNDLANSNQHQIPALD 300

Query: 301 SVLLSGQRINFETSQNPLSKDAA---LVIPNA-NPHINAHAQTSTQEARNTET 330
           +            ++ PLS   A   +V P + +P ++A   T  Q + N  T
Sbjct: 301 A-----------AAKTPLSLPGATVPIVAPESKDPSLSATPATLVQPSSNAAT 342

BLAST of Cucsa.032420 vs. Swiss-Prot
Match: SHH1_ARATH (Protein SAWADEE HOMEODOMAIN HOMOLOG 1 OS=Arabidopsis thaliana GN=SHH1 PE=1 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 1.2e-45
Identity = 106/248 (42.74%), Postives = 144/248 (58.06%), Query Frame = 1

Query: 14  FTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYA 73
           FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ +   
Sbjct: 14  FTLSEIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKL-- 73

Query: 74  IRAKTSKAPGKLAVSPVVQIE-----STPVRNVPQTVVVPAPAPVGSAKGAPENPLS-EF 133
                S+   K   SP +QI      S+   N      V     V + KG   +     F
Sbjct: 74  --KHQSQPKSKTLPSPPLQIHDLSNPSSYASNASNATFVGNSTFVQTRKGKASDLADLAF 133

Query: 134 EAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESS 193
           EAKS RD AWYDV++FL++R + +G+ EV VRFSGF +  DEWVN++ ++R RS+P E S
Sbjct: 134 EAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEPS 193

Query: 194 ECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQL 253
           EC  V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L
Sbjct: 194 ECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTEESLGL 253

Query: 254 RKICRRPE 256
            +ICRRPE
Sbjct: 254 ERICRRPE 257

BLAST of Cucsa.032420 vs. TrEMBL
Match: A0A0A0LC67_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236580 PE=4 SV=1)

HSP 1 Score: 718.4 bits (1853), Expect = 4.5e-204
Identity = 365/371 (98.38%), Postives = 365/371 (98.38%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120
           K      QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP
Sbjct: 61  K------QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300
           SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS 360
           DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS 360

Query: 361 ADNVSDGKLLS 372
           ADNVSDGKLLS
Sbjct: 361 ADNVSDGKLLS 365

BLAST of Cucsa.032420 vs. TrEMBL
Match: M5WG16_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007389mg PE=4 SV=1)

HSP 1 Score: 496.1 bits (1276), Expect = 3.7e-137
Identity = 256/372 (68.82%), Postives = 290/372 (77.96%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFT SEV+EMEAILQ HNNTMPAREVLVALADKFSES ERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTQSEVSEMEAILQQHNNTMPAREVLVALADKFSESAERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQ-----TVVVPAPAPVGS 120
           KQVWNWFQNRRYAIRAK+SK  GKL VSP+ + +S PVRNVPQ        + AP+  GS
Sbjct: 61  KQVWNWFQNRRYAIRAKSSKVLGKLNVSPMSRDDSNPVRNVPQGPQPIAAPIHAPSAQGS 120

Query: 121 AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRR 180
            KGA EN + EFEAKSGRDGAWYDVA FLSHR +E+GDPEVLVRF+GFG EEDEWVN+R+
Sbjct: 121 GKGASENSIFEFEAKSGRDGAWYDVANFLSHRYLETGDPEVLVRFAGFGPEEDEWVNVRK 180

Query: 181 NIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVR 240
           ++R RSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFLVR
Sbjct: 181 HVRQRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFLVR 240

Query: 241 YDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQ 300
           Y HDQSEEIV LRK+CRRPETDYRLQQLHAVNEAAS E  +  MD  +  G   + E  Q
Sbjct: 241 YVHDQSEEIVPLRKVCRRPETDYRLQQLHAVNEAASAE--QKSMDHFM--GSVTSAEMMQ 300

Query: 301 NPLSKDAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNT 360
              + DAA   P  + + +   Q++T E + +E +T  ++ NS    GS+  +SG  T  
Sbjct: 301 KQQNTDAASAPPVLHANASLATQSTTPEFKGSEVSTVISSGNSNFPPGSAVITSGTATVV 360

Query: 361 VSAGSADNVSDG 368
           V  GS +N+  G
Sbjct: 361 VPGGSVENMPKG 368

BLAST of Cucsa.032420 vs. TrEMBL
Match: W9RI10_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012170 PE=4 SV=1)

HSP 1 Score: 488.0 bits (1255), Expect = 1.0e-134
Identity = 253/372 (68.01%), Postives = 282/372 (75.81%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPP NGGPAFRFTASEVAEMEAILQ HNNTMPARE+LV LADKFSESVERKGKI VQM
Sbjct: 1   MGRPPGNGGPAFRFTASEVAEMEAILQEHNNTMPAREILVDLADKFSESVERKGKIMVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120
           KQVWNWFQNRRYAIRAK S+  G L+VS + + + TPVRNVPQ +  P PAP G+ +GA 
Sbjct: 61  KQVWNWFQNRRYAIRAKLSRNLGMLSVSSMPRDDPTPVRNVPQAITAPIPAPSGTGRGAS 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           EN + EFEAKSGRDGAWYDVA F SHR +ESGDPEVLVRF GFG E+DEWVNIR+++R R
Sbjct: 121 ENSIMEFEAKSGRDGAWYDVANFFSHRYLESGDPEVLVRFVGFGPEDDEWVNIRKHVRQR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSG--QRINFETSQNPL 300
           SEEIV LRK+CRRPETDYRLQQL+AVNEAAS E  KS  D+    G   RI+ ET+    
Sbjct: 241 SEEIVPLRKVCRRPETDYRLQQLYAVNEAASAEQQKSSTDNFGGGGFRARISAETTPKLQ 300

Query: 301 SKDAALVIPNANPHINAHAQTSTQEARNTE-TNTAPTTFNSANL-AGSSAFSSGIVTNTV 360
             DAALV P  +       + S  E +  E  N      NS N+ A  +   SG   +  
Sbjct: 301 HADAALVAPALHATAALATKASILEPKKVEIVNVVVDAGNSNNVTASGNGIMSGSPASNK 360

Query: 361 SAGSADNVSDGK 369
              S + + +GK
Sbjct: 361 PIVSGEKMPEGK 372

BLAST of Cucsa.032420 vs. TrEMBL
Match: G7LDQ2_MEDTR (Sequence-specific DNA-binding transcription factor OS=Medicago truncatula GN=MTR_8g061040 PE=4 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 1.4e-128
Identity = 240/376 (63.83%), Postives = 278/376 (73.94%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFT  EV EMEAIL  HNN MPAR+VL ALADKFSES +RKGKI VQM
Sbjct: 1   MGRPPSNGGPAFRFTQPEVTEMEAILSEHNNAMPARDVLQALADKFSESPDRKGKITVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGS----A 120
           KQVWNWFQN+RYAIRAK+SK P KL ++P+ + + TP R + Q    P PAP  S    A
Sbjct: 61  KQVWNWFQNKRYAIRAKSSKTPAKLNITPMPRTDLTPGRIMTQPTASPIPAPSASVQTTA 120

Query: 121 KGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRN 180
           K APEN + EFEAKSGRDGAWYDVATFLS+R +ES DPEVLVRF+GFGSEEDEW+N+R+N
Sbjct: 121 KAAPENSVMEFEAKSGRDGAWYDVATFLSYRHLESSDPEVLVRFAGFGSEEDEWINVRKN 180

Query: 181 IRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRY 240
           +RPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFLVRY
Sbjct: 181 VRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFLVRY 240

Query: 241 DHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMD-SVLLSGQRINFETSQ 300
           DHDQSEEIV LRKICRRPETDYRL QLHAVN+AA  +  K  +D    + G R+   +  
Sbjct: 241 DHDQSEEIVPLRKICRRPETDYRLHQLHAVNDAAPTDQQKIALDHPANVHGARVTNPSEM 300

Query: 301 NPLSKDAA---LVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIV 360
               +  A   +V P    +++   Q+   +    ET       NS     S+AF+  I 
Sbjct: 301 VQKQQQIANIHIVTPVLQTNVSIPPQSMNVDPMKAETKADVQAGNSVT-PSSAAFTGIIA 360

Query: 361 TNTVSAGSADNVSDGK 369
           T++V   S  N+++GK
Sbjct: 361 TSSVPEVSTQNLAEGK 375

BLAST of Cucsa.032420 vs. TrEMBL
Match: A0A067KMR4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_09823 PE=4 SV=1)

HSP 1 Score: 461.5 bits (1186), Expect = 1.0e-126
Identity = 239/372 (64.25%), Postives = 277/372 (74.46%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRF  +EVAEME ILQ H+N+MPAREVLVALA+KFSES ERKGKI VQM
Sbjct: 1   MGRPPSNGGPAFRFMPNEVAEMEGILQEHHNSMPAREVLVALAEKFSESTERKGKIIVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120
           KQVWNWFQNRRYAIRAK+SK P KL V+P+ + ESTPVR+VPQ V  P PA + +    P
Sbjct: 61  KQVWNWFQNRRYAIRAKSSKTPVKLNVTPMSREESTPVRSVPQAVAAPIPAAIPATMALP 120

Query: 121 ----------ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEW 180
                     EN   EFEAKS RDGAWYDV TFLSHR +++GDPEVLVRF+GFG +EDEW
Sbjct: 121 SVPSAGRTTTENSYMEFEAKSARDGAWYDVGTFLSHRHLDTGDPEVLVRFAGFGPDEDEW 180

Query: 181 VNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRC 240
           VNIR+++R RSLPCE+SECVAVLPGDLILCFQEGK+QALYFDAHVLD QRRRHDVRGCRC
Sbjct: 181 VNIRKHVRQRSLPCEASECVAVLPGDLILCFQEGKDQALYFDAHVLDAQRRRHDVRGCRC 240

Query: 241 RFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRIN 300
           RFLVRYDHD SEEIV LRK+CRRPETDYRLQQLHA N++A+ +  K+  D      QR+ 
Sbjct: 241 RFLVRYDHDLSEEIVPLRKVCRRPETDYRLQQLHAANDSATTDQQKTNTDPSTAFFQRVT 300

Query: 301 F---ETSQNPLSKDAALVIPNANPHINAHAQTSTQEARNTET-NTA----PTTF--NSAN 353
               ET Q   + D A     +N +I+   +T   E ++ ET NTA    P+    N+AN
Sbjct: 301 LSPAETMQRQRNADVATTGAVSNANISLPIKTIIPEPKSVETSNTANVGTPSVLPNNTAN 360

BLAST of Cucsa.032420 vs. TAIR10
Match: AT3G18380.2 (AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding)

HSP 1 Score: 382.1 bits (980), Expect = 3.9e-106
Identity = 208/354 (58.76%), Postives = 248/354 (70.06%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALADKFSES ERKGK+ VQ 
Sbjct: 1   MGRPPSNGGPAFRFILPEVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQF 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVRNVPQTVVVP----------- 120
           KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R+V Q + VP           
Sbjct: 61  KQIWNWFQNRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPG 120

Query: 121 -APAPVGSA-----KGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSG 180
             PAP GS      +   +N   EFEAKS RDGAWYDV  FL+HR++E GDPEV VRF+G
Sbjct: 121 MTPAPSGSLVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAG 180

Query: 181 FGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRR 240
           F  EEDEW+N+++++R RSLPCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRR
Sbjct: 181 FEVEEDEWINVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRR 240

Query: 241 HDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGM 300
           HDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A S +     +
Sbjct: 241 HDVRGCRCRFLVRYSHDQSEQEIVPLRKICRRPETDYRLQQLHNAVNDLANSNQHQIPAL 300

Query: 301 DSVLLSGQRINFETSQNPLSKDAA---LVIPNA-NPHINAHAQTSTQEARNTET 330
           D+            ++ PLS   A   +V P + +P ++A   T  Q + N  T
Sbjct: 301 DA-----------AAKTPLSLPGATVPIVAPESKDPSLSATPATLVQPSSNAAT 343

BLAST of Cucsa.032420 vs. TAIR10
Match: AT1G15215.2 (AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors;sequence-specific DNA binding (TAIR:AT3G18380.1))

HSP 1 Score: 185.3 bits (469), Expect = 7.0e-47
Identity = 106/248 (42.74%), Postives = 144/248 (58.06%), Query Frame = 1

Query: 14  FTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYA 73
           FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ +   
Sbjct: 14  FTLSEIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKL-- 73

Query: 74  IRAKTSKAPGKLAVSPVVQIE-----STPVRNVPQTVVVPAPAPVGSAKGAPENPLS-EF 133
                S+   K   SP +QI      S+   N      V     V + KG   +     F
Sbjct: 74  --KHQSQPKSKTLPSPPLQIHDLSNPSSYASNASNATFVGNSTFVQTRKGKASDLADLAF 133

Query: 134 EAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESS 193
           EAKS RD AWYDV++FL++R + +G+ EV VRFSGF +  DEWVN++ ++R RS+P E S
Sbjct: 134 EAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEPS 193

Query: 194 ECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQL 253
           EC  V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L
Sbjct: 194 ECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTEESLGL 253

Query: 254 RKICRRPE 256
            +ICRRPE
Sbjct: 254 ERICRRPE 257

BLAST of Cucsa.032420 vs. NCBI nr
Match: gi|778680368|ref|XP_011651298.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 741.5 bits (1913), Expect = 7.2e-211
Identity = 371/371 (100.00%), Postives = 371/371 (100.00%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120
           KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP
Sbjct: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300
           SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS 360
           DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS 360

Query: 361 ADNVSDGKLLS 372
           ADNVSDGKLLS
Sbjct: 361 ADNVSDGKLLS 371

BLAST of Cucsa.032420 vs. NCBI nr
Match: gi|659111991|ref|XP_008456010.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo])

HSP 1 Score: 723.0 bits (1865), Expect = 2.6e-205
Identity = 365/382 (95.55%), Postives = 366/382 (95.81%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120
           KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAP PVG+AK AP
Sbjct: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAP 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300
           SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFET QNPLSK
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK 300

Query: 301 DAALVIPNANPHINAHAQTSTQEARNTETNT-----------APTTFNSANLAGSSAFSS 360
           DAALVIPNANPHINAHAQTSTQEARNTETNT           APTTFNSANLAGSSAFSS
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSS 360

Query: 361 GIVTNTVSAGSADNVSDGKLLS 372
           GIVTNTVS GSADNVSDGKLLS
Sbjct: 361 GIVTNTVSGGSADNVSDGKLLS 382

BLAST of Cucsa.032420 vs. NCBI nr
Match: gi|700202508|gb|KGN57641.1| (hypothetical protein Csa_3G236580 [Cucumis sativus])

HSP 1 Score: 718.4 bits (1853), Expect = 6.5e-204
Identity = 365/371 (98.38%), Postives = 365/371 (98.38%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120
           K      QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP
Sbjct: 61  K------QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300
           SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS 360
           DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGS 360

Query: 361 ADNVSDGKLLS 372
           ADNVSDGKLLS
Sbjct: 361 ADNVSDGKLLS 365

BLAST of Cucsa.032420 vs. NCBI nr
Match: gi|778680371|ref|XP_011651299.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 698.7 bits (1802), Expect = 5.3e-198
Identity = 350/350 (100.00%), Postives = 350/350 (100.00%), Query Frame = 1

Query: 22  MEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA 81
           MEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA
Sbjct: 1   MEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA 60

Query: 82  PGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVA 141
           PGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVA
Sbjct: 61  PGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVA 120

Query: 142 TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF 201
           TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF
Sbjct: 121 TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF 180

Query: 202 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 261
           QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ
Sbjct: 181 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 240

Query: 262 QLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSKDAALVIPNANPHINAHAQTST 321
           QLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSKDAALVIPNANPHINAHAQTST
Sbjct: 241 QLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSKDAALVIPNANPHINAHAQTST 300

Query: 322 QEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGSADNVSDGKLLS 372
           QEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGSADNVSDGKLLS
Sbjct: 301 QEARNTETNTAPTTFNSANLAGSSAFSSGIVTNTVSAGSADNVSDGKLLS 350

BLAST of Cucsa.032420 vs. NCBI nr
Match: gi|659111993|ref|XP_008456011.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis melo])

HSP 1 Score: 680.2 bits (1754), Expect = 2.0e-192
Identity = 344/361 (95.29%), Postives = 345/361 (95.57%), Query Frame = 1

Query: 22  MEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA 81
           ME ILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA
Sbjct: 1   METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA 60

Query: 82  PGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVA 141
           PGKLAVSPVVQIESTPVRNVPQTVVVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVA
Sbjct: 61  PGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVA 120

Query: 142 TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF 201
           TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF
Sbjct: 121 TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF 180

Query: 202 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 261
           QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ
Sbjct: 181 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 240

Query: 262 QLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSKDAALVIPNANPHINAHAQTST 321
           QLHAVNEAASIEPSKSGMDSVLLSGQRINFET QNPLSKDAALVIPNANPHINAHAQTST
Sbjct: 241 QLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST 300

Query: 322 QEARNTETNT-----------APTTFNSANLAGSSAFSSGIVTNTVSAGSADNVSDGKLL 372
           QEARNTETNT           APTTFNSANLAGSSAFSSGIVTNTVS GSADNVSDGKLL
Sbjct: 301 QEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLL 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SHH2_ARATH2.8e-10658.92Protein SAWADEE HOMEODOMAIN HOMOLOG 2 OS=Arabidopsis thaliana GN=SHH2 PE=2 SV=1[more]
SHH1_ARATH1.2e-4542.74Protein SAWADEE HOMEODOMAIN HOMOLOG 1 OS=Arabidopsis thaliana GN=SHH1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LC67_CUCSA4.5e-20498.38Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236580 PE=4 SV=1[more]
M5WG16_PRUPE3.7e-13768.82Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007389mg PE=4 SV=1[more]
W9RI10_9ROSA1.0e-13468.01Uncharacterized protein OS=Morus notabilis GN=L484_012170 PE=4 SV=1[more]
G7LDQ2_MEDTR1.4e-12863.83Sequence-specific DNA-binding transcription factor OS=Medicago truncatula GN=MTR... [more]
A0A067KMR4_JATCU1.0e-12664.25Uncharacterized protein OS=Jatropha curcas GN=JCGZ_09823 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G18380.23.9e-10658.76 sequence-specific DNA binding transcription factors;sequence-specifi... [more]
AT1G15215.27.0e-4742.74 BEST Arabidopsis thaliana protein match is: sequence-specific DNA bi... [more]
Match NameE-valueIdentityDescription
gi|778680368|ref|XP_011651298.1|7.2e-211100.00PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis sativus][more]
gi|659111991|ref|XP_008456010.1|2.6e-20595.55PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo][more]
gi|700202508|gb|KGN57641.1|6.5e-20498.38hypothetical protein Csa_3G236580 [Cucumis sativus][more]
gi|778680371|ref|XP_011651299.1|5.3e-198100.00PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis sativus][more]
gi|659111993|ref|XP_008456011.1|2.0e-19295.29PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001356Homeobox_dom
IPR009057Homeobox-like_sf
IPR001356Homeobox_dom
IPR009057Homeobox-like_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003682chromatin binding
GO:0003677DNA binding
GO:0003682chromatin binding
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0000785 chromatin
cellular_component GO:0005634 nucleus
molecular_function GO:0003682 chromatin binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.032420.1Cucsa.032420.1mRNA
Cucsa.032420.2Cucsa.032420.2mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 1..56
score: 9
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 1..59
score: 1.
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 9..56
score: 9.4
NoneNo IPR availablePANTHERPTHR33827FAMILY NOT NAMEDcoord: 1..325
score: 7.6E
NoneNo IPR availablePANTHERPTHR33827:SF3PROTEIN SAWADEE HOMEODOMAIN HOMOLOG 2coord: 1..325
score: 7.6E