CmoCh14G006540 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh14G006540
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionhomeobox-leucine zipper protein HAT5-like
LocationCmo_Chr14: 3312300 .. 3313587 (-)
RNA-Seq ExpressionCmoCh14G006540
SyntenyCmoCh14G006540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGAGTATGACTATGCATGAGAGTCCAAAGGGGAGGCCATTTTTTCGATCACCGGACGATCTTTACGATGATGAATATTACGACGAGTTCTACCCTGAGAAGAAGCGTCGTCTCACCCATGATCAGGTTTGTTATATTTGCCTTTACTGCCCCATTTTGCCTGCTTCTGCTGCTGCACTCTCTGTGGGACAAAAATTTAGAAGTTGCGTAAAATTGTTGAAGGTTCAAATGTTGGAGAAGAGCTTTGATGAAGACAACAAACTGGAGCCAGAGAGGAAGTCCCAACTGGCCAAGAAGTTGGGGCTGCAACCAAGGCAGGTGGCTGTGTGGTTCCAGAATCGTCGTGCTCGATGGAAGACAAAGCAGCTCGAAAGGGACTATGATGTTCTTAAAGCTTCATATGATTTGCTTCTGTCTAACTATGACTCAGTTATCAAAGAGAATGCAGATCTTAAATCTCAGGTGAAGTTATATATACATACACATTCTTAAATTATGGTAGACGAGTAAAATTTTCGCCTTATTCACTACTGGAGTATTGAATCCAGCTCGGTATATAGCTTAGTGAAATCCCGGTCAGTTGTTATCTAGTTCATTTCGATCTAGCATGTCTGAAAGGATAAGAAAGAAAATAGGAAAGTTTATAAAAAACTAAGAACAAGAGTTAAAGACTTAAAACAAATTTCATTATATTACTTTCAGGTGGCTTCCTTAACTGAGAAATGTCTGGCTAAAGAGCTGGATGGAGGAGGAGAAGCACCAATTCCATGTGTGACATCAGAGCCTCTTCTAGCAGACATTGGCCATGTCTCCGCCCCACACTCCAGCAGAAAGGCTGAAGATCGTCTCAGTTCAGGGAGCGATGACAGTACGGTGATCGACGATAATTGTCGACAACTCACTGATTGCTGTGATTCTTACTTCCCCAGCAACGAGTATCTGCAATGTGCACCTCTGCCTAATGGGTTGCAAATGGAACATGATGATAGCAATAACAATAGCAACTACTTGTTCTCAGACATGTTTGCAGTAACAGGCCAACAAAATCAGGAGGGACTGGGAGGGCCTCCTGCTTGGTGGACATGGCCTTAGGACAGTCATTCCACTTGTAACAGCCTGTTGCTTATACAAAATGTTAATGAAGTTCTTGTTTGTAATGAATAAAGTTGAGATATTATAGACATTGTTTGAAGTATTGTATGGCTGTGCTTAAAGAATGGTGCAAAATGTAGTGTCCACAAACTTGAACTGTAAACTTATAAGACAAAGCAAGGATATAACATAC

mRNA sequence

ATGATGAGTATGACTATGCATGAGAGTCCAAAGGGGAGGCCATTTTTTCGATCACCGGACGATCTTTACGATGATGAATATTACGACGAGTTCTACCCTGAGAAGAAGCGTCGTCTCACCCATGATCAGGTTTGTTATATTTGCCTTTACTGCCCCATTTTGCCTGCTTCTGCTGCTGCACTCTCTGTGGGACAAAAATTTAGAAGTTGCGTAAAATTGTTGAAGGTTCAAATGTTGGAGAAGAGCTTTGATGAAGACAACAAACTGGAGCCAGAGAGGAAGTCCCAACTGGCCAAGAAGTTGGGGCTGCAACCAAGGCAGGTGGCTGTGTGGTTCCAGAATCGTCGTGCTCGATGGAAGACAAAGCAGCTCGAAAGGGACTATGATGTTCTTAAAGCTTCATATGATTTGCTTCTGTCTAACTATGACTCAGTTATCAAAGAGAATGCAGATCTTAAATCTCAGGTGGCTTCCTTAACTGAGAAATGTCTGGCTAAAGAGCTGGATGGAGGAGGAGAAGCACCAATTCCATGTGTGACATCAGAGCCTCTTCTAGCAGACATTGGCCATGTCTCCGCCCCACACTCCAGCAGAAAGGCTGAAGATCGTCTCAGTTCAGGGAGCGATGACAGTACGGTGATCGACGATAATTGTCGACAACTCACTGATTGCTGTGATTCTTACTTCCCCAGCAACGAGTATCTGCAATGTGCACCTCTGCCTAATGGGTTGCAAATGGAACATGATGATAGCAATAACAATAGCAACTACTTGTTCTCAGACATGTTTGCAGTAACAGGCCAACAAAATCAGGAGGGACTGGGAGGGCCTCCTGCTTGGTGGACATGGCCTTAGGACAGTCATTCCACTTGTAACAGCCTGTTGCTTATACAAAATGTTAATGAAGTTCTTGTTTGTAATGAATAAAGTTGAGATATTATAGACATTGTTTGAAGTATTGTATGGCTGTGCTTAAAGAATGGTGCAAAATGTAGTGTCCACAAACTTGAACTGTAAACTTATAAGACAAAGCAAGGATATAACATAC

Coding sequence (CDS)

ATGATGAGTATGACTATGCATGAGAGTCCAAAGGGGAGGCCATTTTTTCGATCACCGGACGATCTTTACGATGATGAATATTACGACGAGTTCTACCCTGAGAAGAAGCGTCGTCTCACCCATGATCAGGTTTGTTATATTTGCCTTTACTGCCCCATTTTGCCTGCTTCTGCTGCTGCACTCTCTGTGGGACAAAAATTTAGAAGTTGCGTAAAATTGTTGAAGGTTCAAATGTTGGAGAAGAGCTTTGATGAAGACAACAAACTGGAGCCAGAGAGGAAGTCCCAACTGGCCAAGAAGTTGGGGCTGCAACCAAGGCAGGTGGCTGTGTGGTTCCAGAATCGTCGTGCTCGATGGAAGACAAAGCAGCTCGAAAGGGACTATGATGTTCTTAAAGCTTCATATGATTTGCTTCTGTCTAACTATGACTCAGTTATCAAAGAGAATGCAGATCTTAAATCTCAGGTGGCTTCCTTAACTGAGAAATGTCTGGCTAAAGAGCTGGATGGAGGAGGAGAAGCACCAATTCCATGTGTGACATCAGAGCCTCTTCTAGCAGACATTGGCCATGTCTCCGCCCCACACTCCAGCAGAAAGGCTGAAGATCGTCTCAGTTCAGGGAGCGATGACAGTACGGTGATCGACGATAATTGTCGACAACTCACTGATTGCTGTGATTCTTACTTCCCCAGCAACGAGTATCTGCAATGTGCACCTCTGCCTAATGGGTTGCAAATGGAACATGATGATAGCAATAACAATAGCAACTACTTGTTCTCAGACATGTTTGCAGTAACAGGCCAACAAAATCAGGAGGGACTGGGAGGGCCTCCTGCTTGGTGGACATGGCCTTAG

Protein sequence

MMSMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAALSVGQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVTSEPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCAPLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP
Homology
BLAST of CmoCh14G006540 vs. ExPASy Swiss-Prot
Match: Q02283 (Homeobox-leucine zipper protein HAT5 OS=Arabidopsis thaliana OX=3702 GN=HAT5 PE=1 SV=1)

HSP 1 Score: 231.1 bits (588), Expect = 1.6e-59
Identity = 139/283 (49.12%), Postives = 174/283 (61.48%), Query Frame = 0

Query: 4   MTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAALSV 63
           M M E+ K RPFF SP+DLYDD++YD+  PEKKRRLT +Q                    
Sbjct: 37  MNMEETSKRRPFFSSPEDLYDDDFYDDQLPEKKRRLTTEQ-------------------- 96

Query: 64  GQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQ 123
                       V +LEKSF+ +NKLEPERK+QLAKKLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 97  ------------VHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRARWKTKQ 156

Query: 124 LERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAP--IPCVTS 183
           LERDYD+LK++YD LLSNYDS++ +N  L+S+V SLTEK   K+ +   E P  +P    
Sbjct: 157 LERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQGKQ-ETANEPPGQVP---- 216

Query: 184 EPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCAPLP 243
           EP   D  +++A  ++ K EDRLSSGS  S V+DD+  QL D CDSYFPS   +Q     
Sbjct: 217 EPNQLDPVYINA--AAIKTEDRLSSGSVGSAVLDDDAPQLLDSCDSYFPSIVPIQ----D 272

Query: 244 NGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285
           N    +HD    N    F+D+F  T   + +  G   A+W WP
Sbjct: 277 NSNASDHD----NDRSCFADVFVPTTSPSHDHHGESLAFWGWP 272

BLAST of CmoCh14G006540 vs. ExPASy Swiss-Prot
Match: A2X980 (Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. indica OX=39946 GN=HOX16 PE=2 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 9.8e-46
Identity = 132/330 (40.00%), Postives = 171/330 (51.82%), Query Frame = 0

Query: 6   MHESPKG--RPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAALSV 65
           M E  +G  RPFF +PD+L ++EYYDE  PEKKRRLT +Q                    
Sbjct: 48  MEEGGRGVKRPFFTTPDELLEEEYYDEQLPEKKRRLTPEQ-------------------- 107

Query: 66  GQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQ 125
                       V +LE+SF+E+NKLEPERK++LA+KLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 108 ------------VHLLERSFEEENKLEPERKTELARKLGLQPRQVAVWFQNRRARWKTKQ 167

Query: 126 LERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPC-VTSE 185
           LERD+D LKAS+D L +++D+++++N  L SQV SLTEK   KE    G A     V   
Sbjct: 168 LERDFDRLKASFDALRADHDALLQDNHRLHSQVMSLTEKLQEKETTTEGSAGAAVDVPGL 227

Query: 186 PLLADIGHVSAPHSSR------------------KAEDRLSSGSDDSTVIDDNCRQLTDC 245
           P  AD+  V+ P +                    KAEDRLS+GS  S V+D + + +  C
Sbjct: 228 PAAADV-KVAVPDAEEPALEEAAAAFEEQQEQQVKAEDRLSTGSGGSAVVDTDAQLVVGC 287

Query: 246 -----------CDSYFP-SNEYLQCAPLP-----NGLQMEHDD---SNNNSNYLFSD--- 284
                       +SYFP  +EY  C   P      G+Q E DD   S+   +Y   D   
Sbjct: 288 GRQHLAAVDSSVESYFPGGDEYHDCVMGPMDHAAGGIQSEEDDGAGSDEGCSYYADDAGV 344

BLAST of CmoCh14G006540 vs. ExPASy Swiss-Prot
Match: Q6YWR4 (Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX16 PE=2 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 9.8e-46
Identity = 132/330 (40.00%), Postives = 171/330 (51.82%), Query Frame = 0

Query: 6   MHESPKG--RPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAALSV 65
           M E  +G  RPFF +PD+L ++EYYDE  PEKKRRLT +Q                    
Sbjct: 46  MEEGGRGVKRPFFTTPDELLEEEYYDEQLPEKKRRLTPEQ-------------------- 105

Query: 66  GQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQ 125
                       V +LE+SF+E+NKLEPERK++LA+KLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 106 ------------VHLLERSFEEENKLEPERKTELARKLGLQPRQVAVWFQNRRARWKTKQ 165

Query: 126 LERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPC-VTSE 185
           LERD+D LKAS+D L +++D+++++N  L SQV SLTEK   KE    G A     V   
Sbjct: 166 LERDFDRLKASFDALRADHDALLQDNHRLHSQVMSLTEKLQEKETTTEGSAGAAVDVPGL 225

Query: 186 PLLADIGHVSAPHSSR------------------KAEDRLSSGSDDSTVIDDNCRQLTDC 245
           P  AD+  V+ P +                    KAEDRLS+GS  S V+D + + +  C
Sbjct: 226 PAAADV-KVAVPDAEEPALEEAAAAFEEQQEQQVKAEDRLSTGSGGSAVVDTDAQLVVGC 285

Query: 246 -----------CDSYFP-SNEYLQCAPLP-----NGLQMEHDD---SNNNSNYLFSD--- 284
                       +SYFP  +EY  C   P      G+Q E DD   S+   +Y   D   
Sbjct: 286 GRQHLAAVDSSVESYFPGGDEYHDCVMGPMDHAAGGIQSEEDDGAGSDEGCSYYADDAGV 342

BLAST of CmoCh14G006540 vs. ExPASy Swiss-Prot
Match: Q9XH36 (Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. indica OX=39946 GN=HOX5 PE=1 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 1.7e-34
Identity = 79/155 (50.97%), Postives = 104/155 (67.10%), Query Frame = 0

Query: 13  RPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAALSVGQKFRSCVK 72
           RPFF + ++L ++EYYDE  PEKKRRLT +Q                             
Sbjct: 64  RPFFTTHEELLEEEYYDEQAPEKKRRLTAEQ----------------------------- 123

Query: 73  LLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLK 132
              VQMLE+SF+E+NKLEPERK++LA++LG+ PRQVAVWFQNRRARWKTKQLE D+D LK
Sbjct: 124 ---VQMLERSFEEENKLEPERKTELARRLGMAPRQVAVWFQNRRARWKTKQLEHDFDRLK 183

Query: 133 ASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKE 168
           A+YD L +++ +++ +N  L++QV SLTEK   KE
Sbjct: 184 AAYDALAADHHALLSDNDRLRAQVISLTEKLQDKE 186

BLAST of CmoCh14G006540 vs. ExPASy Swiss-Prot
Match: Q6ZA74 (Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX5 PE=1 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 1.7e-34
Identity = 79/155 (50.97%), Postives = 104/155 (67.10%), Query Frame = 0

Query: 13  RPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAALSVGQKFRSCVK 72
           RPFF + ++L ++EYYDE  PEKKRRLT +Q                             
Sbjct: 64  RPFFTTHEELLEEEYYDEQAPEKKRRLTAEQ----------------------------- 123

Query: 73  LLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLK 132
              VQMLE+SF+E+NKLEPERK++LA++LG+ PRQVAVWFQNRRARWKTKQLE D+D LK
Sbjct: 124 ---VQMLERSFEEENKLEPERKTELARRLGMAPRQVAVWFQNRRARWKTKQLEHDFDRLK 183

Query: 133 ASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKE 168
           A+YD L +++ +++ +N  L++QV SLTEK   KE
Sbjct: 184 AAYDALAADHHALLSDNDRLRAQVISLTEKLQDKE 186

BLAST of CmoCh14G006540 vs. ExPASy TrEMBL
Match: A0A6J1F7P1 (homeobox-leucine zipper protein HAT5-like OS=Cucurbita moschata OX=3662 GN=LOC111441616 PE=4 SV=1)

HSP 1 Score: 504.2 bits (1297), Expect = 3.5e-139
Identity = 252/284 (88.73%), Postives = 252/284 (88.73%), Query Frame = 0

Query: 1   MMSMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAA 60
           MMSMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQ                 
Sbjct: 35  MMSMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQ----------------- 94

Query: 61  LSVGQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWK 120
                          VQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWK
Sbjct: 95  ---------------VQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWK 154

Query: 121 TKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVT 180
           TKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVT
Sbjct: 155 TKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVT 214

Query: 181 SEPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCAPL 240
           SEPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCAPL
Sbjct: 215 SEPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCAPL 274

Query: 241 PNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285
           PNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP
Sbjct: 275 PNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 286

BLAST of CmoCh14G006540 vs. ExPASy TrEMBL
Match: A0A6J1J4X1 (homeobox-leucine zipper protein HAT5-like OS=Cucurbita maxima OX=3661 GN=LOC111481400 PE=4 SV=1)

HSP 1 Score: 481.5 bits (1238), Expect = 2.4e-132
Identity = 243/284 (85.56%), Postives = 248/284 (87.32%), Query Frame = 0

Query: 1   MMSMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAA 60
           M+SMTMHESPKGRPFF+SPDDLYDDEYYDEFYPEKKRRLTHDQ                 
Sbjct: 35  MISMTMHESPKGRPFFQSPDDLYDDEYYDEFYPEKKRRLTHDQ----------------- 94

Query: 61  LSVGQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWK 120
                          VQMLEKSFDE+NKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWK
Sbjct: 95  ---------------VQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWK 154

Query: 121 TKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVT 180
           TKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELD GGEAPIPCVT
Sbjct: 155 TKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELD-GGEAPIPCVT 214

Query: 181 SEPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCAPL 240
           SEPLLADIG+VS PHSSRKAEDRLSSGSD STVIDDNCRQL DCCDSYFPSNEYLQCAPL
Sbjct: 215 SEPLLADIGNVSTPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCAPL 274

Query: 241 PNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285
           PNGLQMEHD+SNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP
Sbjct: 275 PNGLQMEHDNSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285

BLAST of CmoCh14G006540 vs. ExPASy TrEMBL
Match: A0A6J1EJJ4 (homeobox-leucine zipper protein HAT5 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111435068 PE=4 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 6.5e-109
Identity = 213/286 (74.48%), Postives = 225/286 (78.67%), Query Frame = 0

Query: 2   MSMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAAL 61
           M+M+M ESPKGRPFFRSPDDLYDDEYYDE YPEKKRRL ++Q                  
Sbjct: 40  MNMSMQESPKGRPFFRSPDDLYDDEYYDELYPEKKRRLANEQ------------------ 99

Query: 62  SVGQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT 121
                         VQMLEKSF+E+NKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT
Sbjct: 100 --------------VQMLEKSFEEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT 159

Query: 122 KQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVTS 181
           KQLERDYDVLKASYDLL+SNYDS++KENA LKS+VASLTEKC+AKELD GGEAPIP  T 
Sbjct: 160 KQLERDYDVLKASYDLLMSNYDSIVKENAVLKSEVASLTEKCVAKELD-GGEAPIPRTTL 219

Query: 182 EPLLADIGHVSAPH---SSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCA 241
           EPLLAD  HVSAPH   S RKAEDRLSSGSD S VIDDNC QL D  DSYFPSNEY Q A
Sbjct: 220 EPLLADTAHVSAPHSGGSGRKAEDRLSSGSDSSAVIDDNCLQLIDSGDSYFPSNEYPQRA 279

Query: 242 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285
           PLP GLQMEHDD N+NSNYLFSDMFA T QQNQE  GGPPAWW WP
Sbjct: 280 PLPPGLQMEHDDRNDNSNYLFSDMFAETNQQNQE--GGPPAWWAWP 290

BLAST of CmoCh14G006540 vs. ExPASy TrEMBL
Match: A0A6J1EQW9 (homeobox-leucine zipper protein HAT5 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435068 PE=4 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 6.5e-109
Identity = 213/286 (74.48%), Postives = 225/286 (78.67%), Query Frame = 0

Query: 2   MSMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAAL 61
           M+M+M ESPKGRPFFRSPDDLYDDEYYDE YPEKKRRL ++Q                  
Sbjct: 41  MNMSMQESPKGRPFFRSPDDLYDDEYYDELYPEKKRRLANEQ------------------ 100

Query: 62  SVGQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT 121
                         VQMLEKSF+E+NKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT
Sbjct: 101 --------------VQMLEKSFEEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT 160

Query: 122 KQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVTS 181
           KQLERDYDVLKASYDLL+SNYDS++KENA LKS+VASLTEKC+AKELD GGEAPIP  T 
Sbjct: 161 KQLERDYDVLKASYDLLMSNYDSIVKENAVLKSEVASLTEKCVAKELD-GGEAPIPRTTL 220

Query: 182 EPLLADIGHVSAPH---SSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCA 241
           EPLLAD  HVSAPH   S RKAEDRLSSGSD S VIDDNC QL D  DSYFPSNEY Q A
Sbjct: 221 EPLLADTAHVSAPHSGGSGRKAEDRLSSGSDSSAVIDDNCLQLIDSGDSYFPSNEYPQRA 280

Query: 242 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285
           PLP GLQMEHDD N+NSNYLFSDMFA T QQNQE  GGPPAWW WP
Sbjct: 281 PLPPGLQMEHDDRNDNSNYLFSDMFAETNQQNQE--GGPPAWWAWP 291

BLAST of CmoCh14G006540 vs. ExPASy TrEMBL
Match: A0A6J1JPM5 (homeobox-leucine zipper protein HAT5-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111486614 PE=4 SV=1)

HSP 1 Score: 396.7 bits (1018), Expect = 7.9e-107
Identity = 211/286 (73.78%), Postives = 222/286 (77.62%), Query Frame = 0

Query: 2   MSMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAAL 61
           M+M+M ES KGRPFFRSPDDLYDDEYYDE YPEKKRRL ++Q                  
Sbjct: 40  MNMSMQESSKGRPFFRSPDDLYDDEYYDELYPEKKRRLANEQ------------------ 99

Query: 62  SVGQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT 121
                         VQMLEKSF+E+NKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT
Sbjct: 100 --------------VQMLEKSFEEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKT 159

Query: 122 KQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVTS 181
           KQLERDYDVLKASYDLL+SNYDS++KENA LKS+VASLTEKC+AKELD GGEAPIPC TS
Sbjct: 160 KQLERDYDVLKASYDLLMSNYDSIVKENAVLKSEVASLTEKCVAKELD-GGEAPIPCTTS 219

Query: 182 EPLLADIGHVSAPH---SSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCA 241
           EPL AD  HVSAPH   S RKAEDRLSSGSD S VIDDNC QL D  DSYFPSNEY    
Sbjct: 220 EPLRADTAHVSAPHSGGSGRKAEDRLSSGSDSSAVIDDNCLQLIDSGDSYFPSNEY---- 279

Query: 242 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285
           PLP GLQMEHDD N NSNYLFSDMFA T QQNQE  GGPPAWW WP
Sbjct: 280 PLPPGLQMEHDDRNYNSNYLFSDMFAETNQQNQE--GGPPAWWAWP 286

BLAST of CmoCh14G006540 vs. TAIR 10
Match: AT3G01470.1 (homeobox 1 )

HSP 1 Score: 231.1 bits (588), Expect = 1.1e-60
Identity = 139/283 (49.12%), Postives = 174/283 (61.48%), Query Frame = 0

Query: 4   MTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVCYICLYCPILPASAAALSV 63
           M M E+ K RPFF SP+DLYDD++YD+  PEKKRRLT +Q                    
Sbjct: 37  MNMEETSKRRPFFSSPEDLYDDDFYDDQLPEKKRRLTTEQ-------------------- 96

Query: 64  GQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQ 123
                       V +LEKSF+ +NKLEPERK+QLAKKLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 97  ------------VHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRARWKTKQ 156

Query: 124 LERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAP--IPCVTS 183
           LERDYD+LK++YD LLSNYDS++ +N  L+S+V SLTEK   K+ +   E P  +P    
Sbjct: 157 LERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQGKQ-ETANEPPGQVP---- 216

Query: 184 EPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCAPLP 243
           EP   D  +++A  ++ K EDRLSSGS  S V+DD+  QL D CDSYFPS   +Q     
Sbjct: 217 EPNQLDPVYINA--AAIKTEDRLSSGSVGSAVLDDDAPQLLDSCDSYFPSIVPIQ----D 272

Query: 244 NGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285
           N    +HD    N    F+D+F  T   + +  G   A+W WP
Sbjct: 277 NSNASDHD----NDRSCFADVFVPTTSPSHDHHGESLAFWGWP 272

BLAST of CmoCh14G006540 vs. TAIR 10
Match: AT2G22430.1 (homeobox protein 6 )

HSP 1 Score: 116.7 bits (291), Expect = 3.0e-26
Identity = 59/98 (60.20%), Postives = 76/98 (77.55%), Query Frame = 0

Query: 75  KVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLKAS 134
           +V+ LEK+F+ +NKLEPERK +LA++LGLQPRQVAVWFQNRRARWKTKQLE+DY VLK  
Sbjct: 70  QVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLEKDYGVLKTQ 129

Query: 135 YDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGG 173
           YD L  N+DS+ ++N  L  +++ L  K     L+GGG
Sbjct: 130 YDSLRHNFDSLRRDNESLLQEISKLKTK-----LNGGG 162

BLAST of CmoCh14G006540 vs. TAIR 10
Match: AT4G40060.1 (homeobox protein 16 )

HSP 1 Score: 115.9 bits (289), Expect = 5.2e-26
Identity = 73/166 (43.98%), Postives = 104/166 (62.65%), Query Frame = 0

Query: 71  VKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDV 130
           +K+ +V+ LEK+F+ +NKLEPERK++LA++LGLQPRQVAVWFQNRRARWKTKQLE+DY V
Sbjct: 63  LKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQLEKDYGV 122

Query: 131 LKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPCVTSEPLLADIGH 190
           LK  YD L  N+DS+ ++N  L  +++ +  K   +E +   +A    V  E +      
Sbjct: 123 LKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNNNKAITEGVKEEEVHKTDSI 182

Query: 191 VSAP-----HSS----RKAEDRLSSGSDDSTVIDDNCRQLTDCCDS 228
            S+P     HSS    R++   L     +STV++      +D CDS
Sbjct: 183 PSSPLQFLEHSSGFNYRRSFTDLRDLLPNSTVVEAGS---SDSCDS 225

BLAST of CmoCh14G006540 vs. TAIR 10
Match: AT5G15150.1 (homeobox 3 )

HSP 1 Score: 114.8 bits (286), Expect = 1.2e-25
Identity = 58/99 (58.59%), Postives = 77/99 (77.78%), Query Frame = 0

Query: 61  LSVGQKFRSCVKLLKVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWK 120
           + +G+K +  + L +V+ LEKSF+  NKLEPERK QLAK LGLQPRQ+A+WFQNRRARWK
Sbjct: 110 MMLGEK-KKRLNLEQVRALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWK 169

Query: 121 TKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASL 160
           TKQLERDYD LK  +D+L S+ DS++  N  L +++ +L
Sbjct: 170 TKQLERDYDSLKKQFDVLKSDNDSLLAHNKKLHAELVAL 207

BLAST of CmoCh14G006540 vs. TAIR 10
Match: AT5G65310.1 (homeobox protein 5 )

HSP 1 Score: 114.0 bits (284), Expect = 2.0e-25
Identity = 57/96 (59.38%), Postives = 74/96 (77.08%), Query Frame = 0

Query: 75  KVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLKAS 134
           +V+ LEK+F+ DNKLEPERK +LA++LGLQPRQVA+WFQNRRARWKTKQLERDY VLK++
Sbjct: 80  QVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNRRARWKTKQLERDYGVLKSN 139

Query: 135 YDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDG 171
           +D L  N DS+ ++N  L  Q+  L  K   + + G
Sbjct: 140 FDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKG 175

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q022831.6e-5949.12Homeobox-leucine zipper protein HAT5 OS=Arabidopsis thaliana OX=3702 GN=HAT5 PE=... [more]
A2X9809.8e-4640.00Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Q6YWR49.8e-4640.00Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. japonica OX=39947 G... [more]
Q9XH361.7e-3450.97Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. indica OX=39946 GN=H... [more]
Q6ZA741.7e-3450.97Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. japonica OX=39947 GN... [more]
Match NameE-valueIdentityDescription
A0A6J1F7P13.5e-13988.73homeobox-leucine zipper protein HAT5-like OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A6J1J4X12.4e-13285.56homeobox-leucine zipper protein HAT5-like OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A6J1EJJ46.5e-10974.48homeobox-leucine zipper protein HAT5 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1EQW96.5e-10974.48homeobox-leucine zipper protein HAT5 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JPM57.9e-10773.78homeobox-leucine zipper protein HAT5-like isoform X2 OS=Cucurbita maxima OX=3661... [more]
Match NameE-valueIdentityDescription
AT3G01470.11.1e-6049.12homeobox 1 [more]
AT2G22430.13.0e-2660.20homeobox protein 6 [more]
AT4G40060.15.2e-2643.98homeobox protein 16 [more]
AT5G15150.11.2e-2558.59homeobox 3 [more]
AT5G65310.12.0e-2559.38homeobox protein 5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 142..162
NoneNo IPR availableGENE3D1.10.10.60coord: 74..120
e-value: 7.9E-18
score: 65.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 194..209
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 192..211
NoneNo IPR availablePANTHERPTHR24326:SF497HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT5coord: 2..44
NoneNo IPR availablePANTHERPTHR24326:SF497HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT5coord: 75..250
NoneNo IPR availablePANTHERPTHR24326HOMEOBOX-LEUCINE ZIPPER PROTEINcoord: 2..44
NoneNo IPR availablePANTHERPTHR24326HOMEOBOX-LEUCINE ZIPPER PROTEINcoord: 75..250
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 93..102
score: 48.54
coord: 102..118
score: 59.22
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 64..126
e-value: 8.6E-16
score: 68.5
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 68..120
e-value: 5.4E-16
score: 58.2
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 62..122
score: 16.843536
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 76..123
e-value: 3.58246E-16
score: 69.1944
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 122..162
e-value: 2.4E-16
score: 59.6
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 97..120
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 65..132

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G006540.1CmoCh14G006540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding