CsaV3_1G032090 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G032090
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionHomeobox-leucine zipper protein family
Locationchr1 : 19087128 .. 19089193 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTAAATTCTCTCTCTCTCTTTTTTTTTAAATTTTTTTTTTTATAAAAAAGTCATTAATTACTCTTTTTTTTTTAAACTCACAAACCTCAATAATTAACTGATTATTATTCCAACTTGAAACCTCTCTCATTCTCTCCCTCTTTACCAATTTTTCTTCTGTCTTAAAATCCCCCACTTTTCCCCTTTCTCTTTGTCTCCACAAGATTACTGCCATGGAAGATCATCATGAAGATGAAATTTGCAATATCAGTTGGCTTAGCCTTGGTTTGGGATTTGGTGATCAATATGTCCCAAAGAAGATTCAAAAAAATCAACAGCAACAACAACAACTCTCTTTCACTCTCATTCCTAAAGAAGAATTGGAAATTACCAACAACAACAATATGGAAATTGATGATGATGAAGCTAATTCAAGTGAAGAAGATGATGATCATCATTTAATGAAGAGGATTAGAAGCAGTAATAATATTGTTAATTACGATCATCATCGTCAAGATTCTTCCTTTGGATCAATTAGAAGATTATCATCAGATCATTACATCAACAACAGCGATATTGTTAATACTACTAATCATAATTATAAAGGGATTAGTAGCAGTGGATCAGAATTAAGGGAGAGGAAAAAACTTAGGCTTTCTAAAGAACAATCCACTTTGTTGGAAGAAAGCTTCAAACTTCACACCACATTGAATCCGGTATGCTTCTTTTTCTTTTTTTTTTTTTTTTTGTCAATACATATTTTTTTTCCATTTTTATCAAATACCCATCAATTTTTTTTTGTTAATTTTTTGAATACCCATTTCACTTTTTTTCACTCTTTTCCATTTTTATCAAATACCCATCAATTTATTTTATGAAAAAAATCGATCTTCTTTTGTTTTTTTAATACCCATTTCACTTTTTTTCATTCTTTCCATTTTTTTTATCAAATACCCATCAATTTATTTTATGGAAAAAAATTCAATACCCATTTCACCTTTTTTTTTTTTTCATTCTTTCTACTTTTATCGAATAACCATAATTTATTTTTTCCGTTTTCTAATGAAAAAAACTGAATCTTTTATTGTTATTATTATTTTTTTCTGCATGGATTTGCAGGCTCAGAAACAGGCACTTGCCCAACAATTAAACCTCAAAACTCGACAAGTGGAAGTTTGGTTTCAAAACAGACGTGCAAGGTATTTTTACAATTTATTTTTTCTTAACATTATATCTATCTCCCTTCTTAACATCAGTTTCTAGTTTTCTGGTTTTCAAAAATTAAGTTATCATTATTCAGTTCGTTCCTTTCATTTAAGAATAATGTCAATAGTGAAATTAAAGTTTTCAAAAACTTTTTCGTTACCATTTTATTTATAATTTTTGAAATTTAGTAACTGTTTTCACCCAATTTTCTTAAAATGATTTCATTAAAATAACCTGAACATACTGGTTTTTCATGAACAGGACGAAACTGAAACAAACAGAAGTGGACTGTGAGTTCTTGAAGAAATGCTGTGAAAGATTAAACGAAGAAAATCGAAGGTTGAAGAAAGAATTAAACGAATTAAGATCGCTTAAACTTGGAGCTTCACAATTGTATATTCAGCTGCCTAAGGCCGCGACGCTCACTATTTGCCCTTCATGCGACAAAATTACGAGGACGCCCGCCGTCGACGCCAATTCTCCGCCACAATAATTTAATATAATGCTCTCTTTTCTTTTTTCTTTTTCCGGCTGTGTGTTCTTAGTACAGCAAAGAAAGAATAATTCAATTCAAATAATAACATACAGCCTTTTTTAGATTATCATTTATGAATCATCTCAAAATTTTATATTCTTGAAAAAAATCAGACCCCAAAAAATGATTTTCTCTCTCTCTCTCCTTGAATTGCAAATTTGATCTCAAAAGGAATTTGCAAAATACTGCAGTTTATTTTAAATATTTTATAAAGATTTTGGATTATATCTTTCTATATTTAGAAATATCTCTAAATTTAATTACGAGGTTTTATTGATTCCTAAGGTTCAAATGTAAACTCCAAAAATGATAAGAGTTCTAAACATTATAACAATTAAATTCTAG

mRNA sequence

ATGGAAGATCATCATGAAGATGAAATTTGCAATATCAGTTGGCTTAGCCTTGGTTTGGGATTTGGTGATCAATATGTCCCAAAGAAGATTCAAAAAAATCAACAGCAACAACAACAACTCTCTTTCACTCTCATTCCTAAAGAAGAATTGGAAATTACCAACAACAACAATATGGAAATTGATGATGATGAAGCTAATTCAAGTGAAGAAGATGATGATCATCATTTAATGAAGAGGATTAGAAGCAGTAATAATATTGTTAATTACGATCATCATCGTCAAGATTCTTCCTTTGGATCAATTAGAAGATTATCATCAGATCATTACATCAACAACAGCGATATTGTTAATACTACTAATCATAATTATAAAGGGATTAGTAGCAGTGGATCAGAATTAAGGGAGAGGAAAAAACTTAGGCTTTCTAAAGAACAATCCACTTTGTTGGAAGAAAGCTTCAAACTTCACACCACATTGAATCCGGCTCAGAAACAGGCACTTGCCCAACAATTAAACCTCAAAACTCGACAAGTGGAAGTTTGGTTTCAAAACAGACGTGCAAGGACGAAACTGAAACAAACAGAAGTGGACTGTGAGTTCTTGAAGAAATGCTGTGAAAGATTAAACGAAGAAAATCGAAGGTTGAAGAAAGAATTAAACGAATTAAGATCGCTTAAACTTGGAGCTTCACAATTGTATATTCAGCTGCCTAAGGCCGCGACGCTCACTATTTGCCCTTCATGCGACAAAATTACGAGGACGCCCGCCGTCGACGCCAATTCTCCGCCACAATAA

Coding sequence (CDS)

ATGGAAGATCATCATGAAGATGAAATTTGCAATATCAGTTGGCTTAGCCTTGGTTTGGGATTTGGTGATCAATATGTCCCAAAGAAGATTCAAAAAAATCAACAGCAACAACAACAACTCTCTTTCACTCTCATTCCTAAAGAAGAATTGGAAATTACCAACAACAACAATATGGAAATTGATGATGATGAAGCTAATTCAAGTGAAGAAGATGATGATCATCATTTAATGAAGAGGATTAGAAGCAGTAATAATATTGTTAATTACGATCATCATCGTCAAGATTCTTCCTTTGGATCAATTAGAAGATTATCATCAGATCATTACATCAACAACAGCGATATTGTTAATACTACTAATCATAATTATAAAGGGATTAGTAGCAGTGGATCAGAATTAAGGGAGAGGAAAAAACTTAGGCTTTCTAAAGAACAATCCACTTTGTTGGAAGAAAGCTTCAAACTTCACACCACATTGAATCCGGCTCAGAAACAGGCACTTGCCCAACAATTAAACCTCAAAACTCGACAAGTGGAAGTTTGGTTTCAAAACAGACGTGCAAGGACGAAACTGAAACAAACAGAAGTGGACTGTGAGTTCTTGAAGAAATGCTGTGAAAGATTAAACGAAGAAAATCGAAGGTTGAAGAAAGAATTAAACGAATTAAGATCGCTTAAACTTGGAGCTTCACAATTGTATATTCAGCTGCCTAAGGCCGCGACGCTCACTATTTGCCCTTCATGCGACAAAATTACGAGGACGCCCGCCGTCGACGCCAATTCTCCGCCACAATAA

Protein sequence

MEDHHEDEICNISWLSLGLGFGDQYVPKKIQKNQQQQQQLSFTLIPKEELEITNNNNMEIDDDEANSSEEDDDHHLMKRIRSSNNIVNYDHHRQDSSFGSIRRLSSDHYINNSDIVNTTNHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKITRTPAVDANSPPQ
BLAST of CsaV3_1G032090 vs. NCBI nr
Match: XP_004150745.1 (PREDICTED: homeobox-leucine zipper protein HOX18-like [Cucumis sativus] >KGN65506.1 hypothetical protein Csa_1G433050 [Cucumis sativus])

HSP 1 Score: 349.7 bits (896), Expect = 8.4e-93
Identity = 264/264 (100.00%), Postives = 264/264 (100.00%), Query Frame = 0

Query: 1   MEDHHEDEICNISWLSLGLGFGDQYVPKKIQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MEDHHEDEICNISWLSLGLGFGDQYVPKKIQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MEDHHEDEICNISWLSLGLGFGDQYVPKKIQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSSDHYINNSDIVNTTN 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSSDHYINNSDIVNTTN
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSSDHYINNSDIVNTTN 120

Query: 121 HNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEV 180
           HNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEV
Sbjct: 121 HNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEV 180

Query: 181 WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAA 240
           WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAA
Sbjct: 181 WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAA 240

Query: 241 TLTICPSCDKITRTPAVDANSPPQ 265
           TLTICPSCDKITRTPAVDANSPPQ
Sbjct: 241 TLTICPSCDKITRTPAVDANSPPQ 264

BLAST of CsaV3_1G032090 vs. NCBI nr
Match: XP_008453538.1 (PREDICTED: homeobox-leucine zipper protein HOX18-like [Cucumis melo])

HSP 1 Score: 304.7 bits (779), Expect = 3.1e-79
Identity = 234/271 (86.35%), Postives = 236/271 (87.08%), Query Frame = 0

Query: 1   MEDHHEDEICNISWLSLGLGFGDQYVPKKIQ-XXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MEDHHEDEICNISWLSLGLGFGDQYVPKKIQ                XXXXXXXXXXXXX
Sbjct: 1   MEDHHEDEICNISWLSLGLGFGDQYVPKKIQKNQHQQQQVSFTLIPKXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSSDHYINNSDIVNTT 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX             N+T
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNST 120

Query: 121 NHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVE 180
           NHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVE
Sbjct: 121 NHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVE 180

Query: 181 VWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKA 240
           VWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKA
Sbjct: 181 VWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKA 240

Query: 241 ATLTICPSCDKITRTP------AVDANSPPQ 265
           ATLTICPSCD+ITRTP      AVDANSPPQ
Sbjct: 241 ATLTICPSCDQITRTPAAVTAAAVDANSPPQ 271

BLAST of CsaV3_1G032090 vs. NCBI nr
Match: XP_022931479.1 (homeobox-leucine zipper protein HAT3-like [Cucurbita moschata])

HSP 1 Score: 234.6 bits (597), Expect = 4.0e-58
Identity = 128/142 (90.14%), Postives = 133/142 (93.66%), Query Frame = 0

Query: 127 SSSGSEL-RERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNR 186
           +S GSE  RERKKLRLSKEQ+TLLEESFKLHTTLNPAQKQALA QLNLKTRQVEVWFQNR
Sbjct: 63  NSLGSERERERKKLRLSKEQATLLEESFKLHTTLNPAQKQALAHQLNLKTRQVEVWFQNR 122

Query: 187 RARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTIC 246
           RARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL+ELRS+KLGASQLYIQLPKAATLTIC
Sbjct: 123 RARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAATLTIC 182

Query: 247 PSCDKITRTPAVDA----NSPP 264
           PSCDKITRTPA +A    NSPP
Sbjct: 183 PSCDKITRTPAANAAAEPNSPP 204

BLAST of CsaV3_1G032090 vs. NCBI nr
Match: XP_023531587.1 (homeobox-leucine zipper protein HAT3-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 234.6 bits (597), Expect = 4.0e-58
Identity = 128/142 (90.14%), Postives = 133/142 (93.66%), Query Frame = 0

Query: 127 SSSGSEL-RERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNR 186
           +S GSE  RERKKLRLSKEQ+TLLEESFKLHTTLNPAQKQALA QLNLKTRQVEVWFQNR
Sbjct: 63  NSLGSERERERKKLRLSKEQATLLEESFKLHTTLNPAQKQALAHQLNLKTRQVEVWFQNR 122

Query: 187 RARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTIC 246
           RARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL+ELRS+KLGASQLYIQLPKAATLTIC
Sbjct: 123 RARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAATLTIC 182

Query: 247 PSCDKITRTPAVDA----NSPP 264
           PSCDKITRTPA +A    NSPP
Sbjct: 183 PSCDKITRTPAANAAAEPNSPP 204

BLAST of CsaV3_1G032090 vs. NCBI nr
Match: XP_022134791.1 (homeobox-leucine zipper protein HAT22-like [Momordica charantia])

HSP 1 Score: 222.6 bits (566), Expect = 1.6e-54
Identity = 116/128 (90.62%), Postives = 121/128 (94.53%), Query Frame = 0

Query: 135 ERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQT 194
           ERKKLRLSKEQS LLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQT
Sbjct: 89  ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQT 148

Query: 195 EVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKITRT 254
           EVDCEFLKKCCERLNEENRRLKKE+ ELRSLK+GASQLYIQLPKAATLTICPSC+K+TR 
Sbjct: 149 EVDCEFLKKCCERLNEENRRLKKEVQELRSLKIGASQLYIQLPKAATLTICPSCNKLTRN 208

Query: 255 PAVDANSP 263
            A  A +P
Sbjct: 209 AAATAAAP 216

BLAST of CsaV3_1G032090 vs. TAIR10
Match: AT4G37790.1 (Homeobox-leucine zipper protein family)

HSP 1 Score: 164.5 bits (415), Expect = 9.1e-41
Identity = 85/116 (73.28%), Postives = 104/116 (89.66%), Query Frame = 0

Query: 136 RKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTE 195
           RKKLRL+K+QS LLE++FKLH+TLNP QKQALA+QLNL+ RQVEVWFQNRRARTKLKQTE
Sbjct: 125 RKKLRLTKQQSALLEDNFKLHSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE 184

Query: 196 VDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKI 252
           VDCEFLKKCCE L +ENRRL+KEL +L++LKL +   Y+ +P AATLT+CPSC+++
Sbjct: 185 VDCEFLKKCCETLTDENRRLQKELQDLKALKL-SQPFYMHMP-AATLTMCPSCERL 238

BLAST of CsaV3_1G032090 vs. TAIR10
Match: AT4G16780.1 (homeobox protein 2)

HSP 1 Score: 162.5 bits (410), Expect = 3.5e-40
Identity = 87/139 (62.59%), Postives = 107/139 (76.98%), Query Frame = 0

Query: 117 NTTNHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTR 176
           +T     +GIS        RKKLRLSK+QS +LEE+FK H+TLNP QKQALA+QL L+ R
Sbjct: 109 DTDPQGSRGISDDEDGDNSRKKLRLSKDQSAILEETFKDHSTLNPKQKQALAKQLGLRAR 168

Query: 177 QVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQL 236
           QVEVWFQNRRARTKLKQTEVDCEFL++CCE L EENRRL+KE+ ELR+LKL + Q Y+ +
Sbjct: 169 QVEVWFQNRRARTKLKQTEVDCEFLRRCCENLTEENRRLQKEVTELRALKL-SPQFYMHM 228

Query: 237 PKAATLTICPSCDKITRTP 256
               TLT+CPSC+ ++  P
Sbjct: 229 SPPTTLTMCPSCEHVSVPP 246

BLAST of CsaV3_1G032090 vs. TAIR10
Match: AT3G60390.1 (homeobox-leucine zipper protein 3)

HSP 1 Score: 161.4 bits (407), Expect = 7.7e-40
Identity = 82/125 (65.60%), Postives = 105/125 (84.00%), Query Frame = 0

Query: 139 LRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDC 198
           LRLSKEQ+ +LEE+FK H+TLNP QK ALA+QLNL+TRQVEVWFQNRRARTKLKQTEVDC
Sbjct: 164 LRLSKEQALVLEETFKEHSTLNPKQKMALAKQLNLRTRQVEVWFQNRRARTKLKQTEVDC 223

Query: 199 EFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKITRTPAVD 258
           E+LK+CCE L +ENRRL+KE++ELR+LKL +  LY+ +    TLT+CPSC+++  T +  
Sbjct: 224 EYLKRCCENLTDENRRLQKEVSELRALKL-SPHLYMHMKPPTTLTMCPSCERVAVTSSSS 283

Query: 259 ANSPP 264
           + +PP
Sbjct: 284 SVAPP 287

BLAST of CsaV3_1G032090 vs. TAIR10
Match: AT5G06710.1 (homeobox from Arabidopsis thaliana)

HSP 1 Score: 161.4 bits (407), Expect = 7.7e-40
Identity = 86/126 (68.25%), Postives = 105/126 (83.33%), Query Frame = 0

Query: 136 RKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTE 195
           RKKLRLSK+QS  LE+SFK H+TLNP QK ALA+QLNL+ RQVEVWFQNRRARTKLKQTE
Sbjct: 189 RKKLRLSKDQSAFLEDSFKEHSTLNPKQKIALAKQLNLRPRQVEVWFQNRRARTKLKQTE 248

Query: 196 VDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKITRTP 255
           VDCE+LK+CCE L EENRRL+KE+ ELR+LK  ++  Y+QLP A TLT+CPSC+++  + 
Sbjct: 249 VDCEYLKRCCESLTEENRRLQKEVKELRTLKT-STPFYMQLP-ATTLTMCPSCERVATSA 308

Query: 256 AVDANS 262
           A  + S
Sbjct: 309 AQPSTS 312

BLAST of CsaV3_1G032090 vs. TAIR10
Match: AT2G22800.1 (Homeobox-leucine zipper protein family)

HSP 1 Score: 156.0 bits (393), Expect = 3.2e-38
Identity = 87/137 (63.50%), Postives = 109/137 (79.56%), Query Frame = 0

Query: 115 IVNTTNHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLK 174
           +++  + + +GIS+       RKKLRL+K+QS LLEESFK H+TLNP QKQ LA+QLNL+
Sbjct: 98  VISDYHEDEEGISA-------RKKLRLTKQQSALLEESFKDHSTLNPKQKQVLARQLNLR 157

Query: 175 TRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYI 234
            RQVEVWFQNRRARTKLKQTEVDCEFLKKCCE L +EN RL+KE+ EL++LKL     Y+
Sbjct: 158 PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLADENIRLQKEIQELKTLKL-TQPFYM 217

Query: 235 QLPKAATLTICPSCDKI 252
            +P A+TLT CPSC++I
Sbjct: 218 HMP-ASTLTKCPSCERI 225

BLAST of CsaV3_1G032090 vs. Swiss-Prot
Match: sp|P46604|HAT22_ARATH (Homeobox-leucine zipper protein HAT22 OS=Arabidopsis thaliana OX=3702 GN=HAT22 PE=1 SV=1)

HSP 1 Score: 164.5 bits (415), Expect = 1.6e-39
Identity = 85/116 (73.28%), Postives = 104/116 (89.66%), Query Frame = 0

Query: 136 RKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTE 195
           RKKLRL+K+QS LLE++FKLH+TLNP QKQALA+QLNL+ RQVEVWFQNRRARTKLKQTE
Sbjct: 125 RKKLRLTKQQSALLEDNFKLHSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE 184

Query: 196 VDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKI 252
           VDCEFLKKCCE L +ENRRL+KEL +L++LKL +   Y+ +P AATLT+CPSC+++
Sbjct: 185 VDCEFLKKCCETLTDENRRLQKELQDLKALKL-SQPFYMHMP-AATLTMCPSCERL 238

BLAST of CsaV3_1G032090 vs. Swiss-Prot
Match: sp|Q05466|HAT4_ARATH (Homeobox-leucine zipper protein HAT4 OS=Arabidopsis thaliana OX=3702 GN=HAT4 PE=1 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 6.2e-39
Identity = 87/139 (62.59%), Postives = 107/139 (76.98%), Query Frame = 0

Query: 117 NTTNHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTR 176
           +T     +GIS        RKKLRLSK+QS +LEE+FK H+TLNP QKQALA+QL L+ R
Sbjct: 109 DTDPQGSRGISDDEDGDNSRKKLRLSKDQSAILEETFKDHSTLNPKQKQALAKQLGLRAR 168

Query: 177 QVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQL 236
           QVEVWFQNRRARTKLKQTEVDCEFL++CCE L EENRRL+KE+ ELR+LKL + Q Y+ +
Sbjct: 169 QVEVWFQNRRARTKLKQTEVDCEFLRRCCENLTEENRRLQKEVTELRALKL-SPQFYMHM 228

Query: 237 PKAATLTICPSCDKITRTP 256
               TLT+CPSC+ ++  P
Sbjct: 229 SPPTTLTMCPSCEHVSVPP 246

BLAST of CsaV3_1G032090 vs. Swiss-Prot
Match: sp|P46665|HAT14_ARATH (Homeobox-leucine zipper protein HAT14 OS=Arabidopsis thaliana OX=3702 GN=HAT14 PE=2 SV=3)

HSP 1 Score: 161.4 bits (407), Expect = 1.4e-38
Identity = 86/126 (68.25%), Postives = 105/126 (83.33%), Query Frame = 0

Query: 136 RKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTE 195
           RKKLRLSK+QS  LE+SFK H+TLNP QK ALA+QLNL+ RQVEVWFQNRRARTKLKQTE
Sbjct: 189 RKKLRLSKDQSAFLEDSFKEHSTLNPKQKIALAKQLNLRPRQVEVWFQNRRARTKLKQTE 248

Query: 196 VDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKITRTP 255
           VDCE+LK+CCE L EENRRL+KE+ ELR+LK  ++  Y+QLP A TLT+CPSC+++  + 
Sbjct: 249 VDCEYLKRCCESLTEENRRLQKEVKELRTLKT-STPFYMQLP-ATTLTMCPSCERVATSA 308

Query: 256 AVDANS 262
           A  + S
Sbjct: 309 AQPSTS 312

BLAST of CsaV3_1G032090 vs. Swiss-Prot
Match: sp|P46602|HAT3_ARATH (Homeobox-leucine zipper protein HAT3 OS=Arabidopsis thaliana OX=3702 GN=HAT3 PE=1 SV=2)

HSP 1 Score: 161.4 bits (407), Expect = 1.4e-38
Identity = 82/125 (65.60%), Postives = 105/125 (84.00%), Query Frame = 0

Query: 139 LRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDC 198
           LRLSKEQ+ +LEE+FK H+TLNP QK ALA+QLNL+TRQVEVWFQNRRARTKLKQTEVDC
Sbjct: 164 LRLSKEQALVLEETFKEHSTLNPKQKMALAKQLNLRTRQVEVWFQNRRARTKLKQTEVDC 223

Query: 199 EFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKITRTPAVD 258
           E+LK+CCE L +ENRRL+KE++ELR+LKL +  LY+ +    TLT+CPSC+++  T +  
Sbjct: 224 EYLKRCCENLTDENRRLQKEVSELRALKL-SPHLYMHMKPPTTLTMCPSCERVAVTSSSS 283

Query: 259 ANSPP 264
           + +PP
Sbjct: 284 SVAPP 287

BLAST of CsaV3_1G032090 vs. Swiss-Prot
Match: sp|Q01I23|HOX17_ORYSI (Homeobox-leucine zipper protein HOX17 OS=Oryza sativa subsp. indica OX=39946 GN=HOX17 PE=2 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 1.4e-38
Identity = 81/117 (69.23%), Postives = 98/117 (83.76%), Query Frame = 0

Query: 136 RKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTE 195
           RKKLRLSK+QS +LE+SF+ H TLNP QK  LAQQL L+ RQVEVWFQNRRARTKLKQTE
Sbjct: 81  RKKLRLSKDQSAVLEDSFREHPTLNPRQKATLAQQLGLRPRQVEVWFQNRRARTKLKQTE 140

Query: 196 VDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKIT 253
           VDCEFLK+CCE L EENRRL+KE+ ELR+LKL +  LY+ +    TLT+CPSC++++
Sbjct: 141 VDCEFLKRCCETLTEENRRLQKEVQELRALKLVSPHLYMNMSPPTTLTMCPSCERVS 197

BLAST of CsaV3_1G032090 vs. TrEMBL
Match: tr|A0A0A0M012|A0A0A0M012_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G433050 PE=4 SV=1)

HSP 1 Score: 349.7 bits (896), Expect = 5.6e-93
Identity = 264/264 (100.00%), Postives = 264/264 (100.00%), Query Frame = 0

Query: 1   MEDHHEDEICNISWLSLGLGFGDQYVPKKIQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MEDHHEDEICNISWLSLGLGFGDQYVPKKIQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MEDHHEDEICNISWLSLGLGFGDQYVPKKIQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSSDHYINNSDIVNTTN 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSSDHYINNSDIVNTTN
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSSDHYINNSDIVNTTN 120

Query: 121 HNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEV 180
           HNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEV
Sbjct: 121 HNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEV 180

Query: 181 WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAA 240
           WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAA
Sbjct: 181 WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAA 240

Query: 241 TLTICPSCDKITRTPAVDANSPPQ 265
           TLTICPSCDKITRTPAVDANSPPQ
Sbjct: 241 TLTICPSCDKITRTPAVDANSPPQ 264

BLAST of CsaV3_1G032090 vs. TrEMBL
Match: tr|A0A1S3BWH2|A0A1S3BWH2_CUCME (homeobox-leucine zipper protein HOX18-like OS=Cucumis melo OX=3656 GN=LOC103494220 PE=4 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 2.1e-79
Identity = 234/271 (86.35%), Postives = 236/271 (87.08%), Query Frame = 0

Query: 1   MEDHHEDEICNISWLSLGLGFGDQYVPKKIQ-XXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MEDHHEDEICNISWLSLGLGFGDQYVPKKIQ                XXXXXXXXXXXXX
Sbjct: 1   MEDHHEDEICNISWLSLGLGFGDQYVPKKIQKNQHQQQQVSFTLIPKXXXXXXXXXXXXX 60

Query: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSSDHYINNSDIVNTT 120
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX             N+T
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNST 120

Query: 121 NHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVE 180
           NHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVE
Sbjct: 121 NHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVE 180

Query: 181 VWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKA 240
           VWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKA
Sbjct: 181 VWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKA 240

Query: 241 ATLTICPSCDKITRTP------AVDANSPPQ 265
           ATLTICPSCD+ITRTP      AVDANSPPQ
Sbjct: 241 ATLTICPSCDQITRTPAAVTAAAVDANSPPQ 271

BLAST of CsaV3_1G032090 vs. TrEMBL
Match: tr|A0A2P4ILY9|A0A2P4ILY9_QUESU (Homeobox-leucine zipper protein hat14 OS=Quercus suber OX=58331 GN=CFP56_40632 PE=4 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 2.1e-44
Identity = 98/125 (78.40%), Postives = 110/125 (88.00%), Query Frame = 0

Query: 136 RKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTE 195
           RKKLRLSKEQS+LLE+SFK+H+TL PAQKQAL+QQLNLK RQVEVWFQNRRARTKLKQTE
Sbjct: 86  RKKLRLSKEQSSLLEDSFKIHSTLTPAQKQALSQQLNLKPRQVEVWFQNRRARTKLKQTE 145

Query: 196 VDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPSCDKITRTP 255
           VDCEFLKKCCE L+EEN+RLKKEL ELRSLK+G S LYIQ+ KAATLT+C SC+K  +  
Sbjct: 146 VDCEFLKKCCESLSEENKRLKKELQELRSLKVGPSPLYIQIQKAATLTMCTSCEKSIKAN 205

Query: 256 AVDAN 261
            V  N
Sbjct: 206 EVKGN 210

BLAST of CsaV3_1G032090 vs. TrEMBL
Match: tr|A0A2N9EV41|A0A2N9EV41_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS10549 PE=4 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 4.8e-44
Identity = 101/139 (72.66%), Postives = 114/139 (82.01%), Query Frame = 0

Query: 112 NSDIVNTTNHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQL 171
           + +  N  N++      SG     RKKLRL+KEQS+ LE+SFKLH+TL PAQKQ LA+QL
Sbjct: 64  DEEFPNKENNSINNKKESG-----RKKLRLTKEQSSFLEDSFKLHSTLTPAQKQVLAEQL 123

Query: 172 NLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQ 231
           NLK RQVEVWFQNRRARTKLKQTEVDCEFLKKCCE L+EENRRLKKEL ELRSLK+G S 
Sbjct: 124 NLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCESLSEENRRLKKELQELRSLKVGPSP 183

Query: 232 LYIQLPKAATLTICPSCDK 251
           LYIQL KAATLT+CPSC+K
Sbjct: 184 LYIQLQKAATLTMCPSCEK 197

BLAST of CsaV3_1G032090 vs. TrEMBL
Match: tr|K7LC40|K7LC40_SOYBN (Uncharacterized protein OS=Glycine max OX=3847 GN=100790522 PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 1.0e-41
Identity = 93/126 (73.81%), Postives = 108/126 (85.71%), Query Frame = 0

Query: 128 SSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRA 187
           S+ S    RKKL+L+KEQS  LE+ FKLH+TLNPAQKQALA+QLNLK RQVEVWFQNRRA
Sbjct: 82  SNNSNNGSRKKLKLTKEQSATLEDIFKLHSTLNPAQKQALAEQLNLKHRQVEVWFQNRRA 141

Query: 188 RTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQLPKAATLTICPS 247
           RTKLKQTEVDCEFLKKCCE+L +EN+RLKKEL ELR+ K+G + LYIQL KA TLTIC S
Sbjct: 142 RTKLKQTEVDCEFLKKCCEKLTDENQRLKKELQELRAQKIGPTPLYIQLSKATTLTICSS 201

Query: 248 CDKITR 254
           C+K+ +
Sbjct: 202 CEKLLK 207

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004150745.18.4e-93100.00PREDICTED: homeobox-leucine zipper protein HOX18-like [Cucumis sativus] >KGN6550... [more]
XP_008453538.13.1e-7986.35PREDICTED: homeobox-leucine zipper protein HOX18-like [Cucumis melo][more]
XP_022931479.14.0e-5890.14homeobox-leucine zipper protein HAT3-like [Cucurbita moschata][more]
XP_023531587.14.0e-5890.14homeobox-leucine zipper protein HAT3-like [Cucurbita pepo subsp. pepo][more]
XP_022134791.11.6e-5490.63homeobox-leucine zipper protein HAT22-like [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT4G37790.19.1e-4173.28Homeobox-leucine zipper protein family[more]
AT4G16780.13.5e-4062.59homeobox protein 2[more]
AT3G60390.17.7e-4065.60homeobox-leucine zipper protein 3[more]
AT5G06710.17.7e-4068.25homeobox from Arabidopsis thaliana[more]
AT2G22800.13.2e-3863.50Homeobox-leucine zipper protein family[more]
Match NameE-valueIdentityDescription
sp|P46604|HAT22_ARATH1.6e-3973.28Homeobox-leucine zipper protein HAT22 OS=Arabidopsis thaliana OX=3702 GN=HAT22 P... [more]
sp|Q05466|HAT4_ARATH6.2e-3962.59Homeobox-leucine zipper protein HAT4 OS=Arabidopsis thaliana OX=3702 GN=HAT4 PE=... [more]
sp|P46665|HAT14_ARATH1.4e-3868.25Homeobox-leucine zipper protein HAT14 OS=Arabidopsis thaliana OX=3702 GN=HAT14 P... [more]
sp|P46602|HAT3_ARATH1.4e-3865.60Homeobox-leucine zipper protein HAT3 OS=Arabidopsis thaliana OX=3702 GN=HAT3 PE=... [more]
sp|Q01I23|HOX17_ORYSI1.4e-3869.23Homeobox-leucine zipper protein HOX17 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0M012|A0A0A0M012_CUCSA5.6e-93100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G433050 PE=4 SV=1[more]
tr|A0A1S3BWH2|A0A1S3BWH2_CUCME2.1e-7986.35homeobox-leucine zipper protein HOX18-like OS=Cucumis melo OX=3656 GN=LOC1034942... [more]
tr|A0A2P4ILY9|A0A2P4ILY9_QUESU2.1e-4478.40Homeobox-leucine zipper protein hat14 OS=Quercus suber OX=58331 GN=CFP56_40632 P... [more]
tr|A0A2N9EV41|A0A2N9EV41_FAGSY4.8e-4472.66Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS10549 PE=4 SV=1[more]
tr|K7LC40|K7LC40_SOYBN1.0e-4173.81Uncharacterized protein OS=Glycine max OX=3847 GN=100790522 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR009057Homeobox-like_sf
IPR017970Homeobox_CS
IPR003106Leu_zip_homeo
IPR001356Homeobox_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G032090.1CsaV3_1G032090.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 191..228
NoneNo IPR availableGENE3DG3DSA:1.10.10.60coord: 111..191
e-value: 6.1E-17
score: 63.5
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 61..261
NoneNo IPR availablePANTHERPTHR24326:SF327SUBFAMILY NOT NAMEDcoord: 61..261
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 134..196
e-value: 3.8E-15
score: 66.3
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 136..190
e-value: 1.8E-15
score: 56.4
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 132..192
score: 17.2
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 136..193
e-value: 8.74034E-15
score: 65.7276
IPR003106Leucine zipper, homeobox-associatedSMARTSM00340halzcoord: 192..235
e-value: 8.6E-19
score: 78.4
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 192..225
e-value: 8.1E-9
score: 35.4
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 167..190
IPR009057Homeobox-like domain superfamilySUPERFAMILYSSF46689Homeodomain-likecoord: 119..193