HG10023525 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023525
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 1-like
LocationChr05: 35003719 .. 35007632 (+)
RNA-Seq ExpressionHG10023525
SyntenyHG10023525
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTATCGAAATTCGTCAAGGGTTTTAGATGACTCTCCATTCGAATTCACACTAGCTGAGGCAAGTTCTTATGTTATGCTTGAGTCATGTCTTTCAATCGATGCTTCCGAGTTTAGTTTCCCTGTTCGAGTTATGTGCAGAAATTCATTACAGCATTTTCTGTTTTGTAATTACCGATTTCGATTCATTCTGCCGTTAATTTTGCAGATTGTAGAGATGGATAATATCTTGAAGGACTCTGGAGATCAAACACTTGGTCAAGAGTTTTTCCAAGATGTCGCACTTCATTTCAGGTTATATAGGGAATTTTTCTAAAAGCTTCGATCGCTTCTTCAAACCAGTATTTGTGCAATCAATCATCGTGTCTTTCTTTTTCATGTTCGCTCTTTCTTGCAGTTGCTCCCCGTGGCGCGCTGGAAAATCTCCCGTCACTGCGGAACAGGTATTCGATTTTTCCTTATCCGCATTCTACTTTGTTCATTGTTTTCGTGATACTTGGTCATTCTTATGTGCCCTAGGTGCATGGCTGGTTTGAGAATCGGAAAAAGGAATTGCGAAGTAGTTGTAAAAAAGCTCGGCCTCCACCTCCACCTCCACTTCCACCTCCGCCATCACCTCCGCCTCCGACTCCGCCACCGAAACTTTTGCTTTATCATTCGGAGAGTAATTTTTTAACTGACGCGCCTTCATCTGAACCACCTGAATTCAAAGGTAAAATTCCGCATTATTAGCTGATGAATTTTAGAAATTCTATCCGAAGTTCTCCTTTGCACTGCCAATGCAACAACCAGTTGCTGCGGCTATAGGTATATACATAGTTTTATATATTTTTGCTTCTACGATGGTTTTCTTTTATTCTGTTGCACGATACTAGGCTGGTAGAGGGAGGGGCTGCACAGTGATTGGTAACTCATTCGTGTCTAGAATTATGTGCTCGATGTTTGGAGATTTAGTCGTGCTTAGATAATCTTGTCTTTTATCAAGAGATATTAAGCAGAAAGATCCATAGGCGATGAAACTTCAAACTCGCAAACCTTCTTGCATCGAATGCCGAGTACATTTAGAATTTCTTCCTTGTGAACTGAAAACTGCAGTGTGTCCTTGCATGATCCAAGCTTTGTCCAAGATACTATAATTTGCAGTTCTGGAGTAATTTTCACTTTCGATACATGTTGGCAAACTTGAAATCTGTGCAGTTCCAAATCAAACTATTAACTTTTTGTAATATCATTTCTTGAAATTTAATGCAGGCAAGGCAACTGATCTTTCAGAATTAGCATTTGAAGCCTTTTCGTCAAGAGACCATGCCTGGTGAGAAAAATATAGAGCAGAGAATACAAAACTGTTAGCTTGATCTGTGATGGAATCATATGTTCATATTCACACAATTTCCATGGATTCTAAACTGTAAATGACATCTTGAATCCACTGGAATACATAGGTAGTCGGTGAGTTCTCCCTAGTCAGCTCAGATGTAGCACCTCAGTTTAGCTATCCATAATCCTAGCTAGCATTGTACAAGTGCCTTAAGTTTATTTCATTTTACACTTGGAATAACTAGTGCAACACAAATCTCCCTATAAACAATGCTAACTTCTCAGACATCTATTACTTGCAGTCATTCAGGATGTTACACCCTTGATAATGATGTGCCAAAGACTGACATTCAGTTCAGATCCTTTAAGAAGGCTTTGCAAAAGTTGGGATTTGGGTCATTTAGCTCTACAGAGTTAGAAACAAAGTTATTGACTGCAATGATATTTACATGCTAACTTCTGTGCAAATGTGTTAATAATCAGTCTTTGATTTTTATCATCTATATGGAGCTTTTATACTCGATTTTCTTACCCTAATGCCAGTGTAAAGTTGATGTTTCATGGACACTGTTGGGTGCTATACATGTCCAAACCATAAAAGCTTTAGCATTATTTCCTGCTTCGTCTAACCCATAATGTTTATGTTGTGTTATCTTGCTCAACGAACAGTCAGTGTCATCATTGAGATGTATCCATTACTTTTGTTGGTGTATGGTATATGGGCAATGACATAGATAATGATATAGATAATGGACCTATCATATATCATTATCTTATGAGAAGATGTTATTAGACGCGTCTCTCTTCAATTATCAACTTGATGGTTTACAGAAAAGCCATTGATATTTGAAGATTCATATTATTTTGCTTTCAGCAAGCACTGTTACTGCATAATCAGTTTTGGTTAAGACTCATGTTTTTCTATTGAATTGAAAAATCATACTAATGTTCTTTTGATTCCTTAGGTATGATGTTGCTTCCTTCCTCAGTTACCGAGTTAATTGCCATGGAGAACTAGTAAGATTTTTCCCACACATTTCATATACTTCTTTCGACAGTTGCTGTAGGTTGCAAATGCACAAAATACGTCATTGTTTTCCACTCACTGGTTTATCATTATGAAACCACCGGAAAAAAGTCAGCAAATGTTTAGACGTTGAAGAGAAAAGCCTCATGATCTGTTGTGCTTTTTATGCTATTGCATTGGAACAAACTTCACAGGATGCTCGAGTTCGTTATGCTGGCTTTGGAAAGGATGAGGATGAGTGGGTTAATGTTGCAAGAGGAGTGCGTGATCGATCCATACCTTTGGAATCTTCAGAGTGTTACAGAGTGAAAGTTGGAGATCTTGTGTTATGTTACCGGGTAATTCTATGTTTCCAGTTTTATTTATTTATTATTATTTTATTGATATATGAAAAGGAAGAAAAAAGTAAAAGAAAGTAAAAACCAAACAGAACTGACGAGAACTGGAAAAGAAGGAACCACACGCACACAAAAAAAGGCAAAAAAAACACCGAAAAGCAAAGGAAGAGAAACAATACAGGATTGGAGCTGCCACAAAACAAAACAGACGAGCCAAATAACCAACAGCCGGCTGGAACATTTTTTTTAAAAAAATGTCTTCATAGGTGACCCTTGGAATTCTATGTTCCCAAATTTTCTCCTCCCAAATTGAGTTTTCTTAGCGAGGAGTGTATGCGAATTTTTCAAGCAACAAATAGTCTGGCTACCAGAATAGCTTTGTACTGAACTATAATTACTTATTATTATTATTATTAGGAAAATTTTAACGGATAAAAAAATGTCAAACTATTTACAGAAAATAGCAAAAAAATACTGATAGATATTGATAGATTTCTATCCGCATTTATCAGTGATAGACTTGTTGACCAAACAAACAATTGTCAGCAAAGTTGTTTCTTTTTTTTTTCTTGAAATCCAAAGAATTTTTCTTACTTTATACATAACATAGGTCATCGATTCAGCATTGGATAAAAAAGAAAAAAATGAGGGTGAAATCCTCATTTTTTTTAGTTTTTTACAAAAAAAATTCTAATATTCTATTAAAATATAAAATAATAAAAAAGGGATTTAGATAGACCCTAATAGACTTCTATCAACGTCTATCACAACTATCTAAAAAATTTTGCTATTTTGTGGAAATAGTTTTCCTTATTTTTCTATTTTTTAAAACTCCTCTTATTGTTATTATAAGTAAACTAAAATGATCATCATAATAAAAATGGGGAAAGAAGTCCTTAGTGCGACAAATGTTTCAGATCATGTAATCTAACCTTGGAAAGTTTTATGGCTTATTTACACACAGGAAGGACAAGATCATGCACTCTACTTCGATGCATATGTTGTGGAAATTCAGAGGAGGCTACACGATATTGGGGGTTGTCGATGCATATTTGTTGTACGCTACGACTATGATCACTATGAGGTAGAAAGCCAACCAATAGAAATTTGTCCAATTTTACTTGAAACGAATACTCAGATTCTTATGCTGCTGCCTTTTTGGCTTAGGAAAAAGTTCATTTGGGGAGATTGTGCTGCAGGCCTTCGGCATACAACTCCGACCAAATTTGA

mRNA sequence

ATGGAGTATCGAAATTCGTCAAGGGTTTTAGATGACTCTCCATTCGAATTCACACTAGCTGAGGCAAGTTCTTATGTTATGCTTGAGTCATGTCTTTCAATCGATGCTTCCGAGTTTAGTTTCCCTGTTCGAGTTATGTGCAGAAATTCATTACAGCATTTTCTGTTTTGTAATTACCGATTTCGATTCATTCTGCCGTTAATTTTGCAGATTGTAGAGATGGATAATATCTTGAAGGACTCTGGAGATCAAACACTTGGTCAAGAGTTTTTCCAAGATGTCGCACTTCATTTCAGTTGCTCCCCGTGGCGCGCTGGAAAATCTCCCGTCACTGCGGAACAGGTGCATGGCTGGTTTGAGAATCGGAAAAAGGAATTGCGAAGTAGTTGTAAAAAAGCTCGGCCTCCACCTCCACCTCCACTTCCACCTCCGCCATCACCTCCGCCTCCGACTCCGCCACCGAAACTTTTGCTTTATCATTCGGAGAGTAATTTTTTAACTGACGCGCCTTCATCTGAACCACCTGAATTCAAAGGCAAGGCAACTGATCTTTCAGAATTAGCATTTGAAGCCTTTTCGTCAAGAGACCATGCCTGGTATGATGTTGCTTCCTTCCTCAGTTACCGAGTTAATTGCCATGGAGAACTAGATGCTCGAGTTCGTTATGCTGGCTTTGGAAAGGATGAGGATGAGTGGGTTAATGTTGCAAGAGGAGTGCGTGATCGATCCATACCTTTGGAATCTTCAGAGTGTTACAGAGTGAAAGTTGGAGATCTTGTGTTATGTTACCGGGAAGGACAAGATCATGCACTCTACTTCGATGCATATGTTGTGGAAATTCAGAGGAGGCTACACGATATTGGGGGTTGTCGATGCATATTTGTTGTACGCTACGACTATGATCACTATGAGGAAAAAGTTCATTTGGGGAGATTGTGCTGCAGGCCTTCGGCATACAACTCCGACCAAATTTGA

Coding sequence (CDS)

ATGGAGTATCGAAATTCGTCAAGGGTTTTAGATGACTCTCCATTCGAATTCACACTAGCTGAGGCAAGTTCTTATGTTATGCTTGAGTCATGTCTTTCAATCGATGCTTCCGAGTTTAGTTTCCCTGTTCGAGTTATGTGCAGAAATTCATTACAGCATTTTCTGTTTTGTAATTACCGATTTCGATTCATTCTGCCGTTAATTTTGCAGATTGTAGAGATGGATAATATCTTGAAGGACTCTGGAGATCAAACACTTGGTCAAGAGTTTTTCCAAGATGTCGCACTTCATTTCAGTTGCTCCCCGTGGCGCGCTGGAAAATCTCCCGTCACTGCGGAACAGGTGCATGGCTGGTTTGAGAATCGGAAAAAGGAATTGCGAAGTAGTTGTAAAAAAGCTCGGCCTCCACCTCCACCTCCACTTCCACCTCCGCCATCACCTCCGCCTCCGACTCCGCCACCGAAACTTTTGCTTTATCATTCGGAGAGTAATTTTTTAACTGACGCGCCTTCATCTGAACCACCTGAATTCAAAGGCAAGGCAACTGATCTTTCAGAATTAGCATTTGAAGCCTTTTCGTCAAGAGACCATGCCTGGTATGATGTTGCTTCCTTCCTCAGTTACCGAGTTAATTGCCATGGAGAACTAGATGCTCGAGTTCGTTATGCTGGCTTTGGAAAGGATGAGGATGAGTGGGTTAATGTTGCAAGAGGAGTGCGTGATCGATCCATACCTTTGGAATCTTCAGAGTGTTACAGAGTGAAAGTTGGAGATCTTGTGTTATGTTACCGGGAAGGACAAGATCATGCACTCTACTTCGATGCATATGTTGTGGAAATTCAGAGGAGGCTACACGATATTGGGGGTTGTCGATGCATATTTGTTGTACGCTACGACTATGATCACTATGAGGAAAAAGTTCATTTGGGGAGATTGTGCTGCAGGCCTTCGGCATACAACTCCGACCAAATTTGA

Protein sequence

MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYRFRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFENRKKELRSSCKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPEFKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVARGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVVRYDYDHYEEKVHLGRLCCRPSAYNSDQI
Homology
BLAST of HG10023525 vs. NCBI nr
Match: XP_038897963.1 (protein SAWADEE HOMEODOMAIN HOMOLOG 1-like [Benincasa hispida])

HSP 1 Score: 532.7 bits (1371), Expect = 2.2e-147
Identity = 267/324 (82.41%), Postives = 271/324 (83.64%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           M+YRNSS VLDDSPFEFTLAE                                       
Sbjct: 1   MDYRNSSSVLDDSPFEFTLAE--------------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE
Sbjct: 61  ----------IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120

Query: 121 NRKKELRSSCKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPEFKGK 180
           NRK EL +SCKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSES+FLTDAPSSEPPEFKGK
Sbjct: 121 NRKMELLTSCKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSESDFLTDAPSSEPPEFKGK 180

Query: 181 ATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVARGVR 240
           ATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVARGVR
Sbjct: 181 ATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVARGVR 240

Query: 241 DRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVVRYDY 300
           DRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVVRYD+
Sbjct: 241 DRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVVRYDH 275

Query: 301 DHYEEKVHLGRLCCRPSAYNSDQI 325
           D YEEKVHLGRLCCRPSAYNSDQI
Sbjct: 301 DDYEEKVHLGRLCCRPSAYNSDQI 275

BLAST of HG10023525 vs. NCBI nr
Match: KAA0051041.1 (protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 501.5 bits (1290), Expect = 5.4e-138
Identity = 254/328 (77.44%), Postives = 267/328 (81.40%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           ME+  SS++LDDS FEFTLAEASSY                                   
Sbjct: 1   MEHPKSSKLLDDSSFEFTLAEASSY----------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRA KSPVTAE VH WFE
Sbjct: 61  ----------IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFE 120

Query: 121 NRKKELRSSCKKARPPPPPP----LPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPE 180
           NR+KELRSS KKARPPPPPP    LPP PS PPPTPPPKLLLYHSES+FLT APSSEPPE
Sbjct: 121 NRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPE 180

Query: 181 FKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVA 240
           F GKATDLSELAFEAFSSRDHAWYDVASFL+YR+NCHGELDARVRYAGFGKDEDEWVNVA
Sbjct: 181 FIGKATDLSELAFEAFSSRDHAWYDVASFLTYRINCHGELDARVRYAGFGKDEDEWVNVA 240

Query: 241 RGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 300
           RGVRDRSIPLESSECYRVKVGDLVLC+RE QDHALYFDAYVVEIQRRLHDIGGCRCIFVV
Sbjct: 241 RGVRDRSIPLESSECYRVKVGDLVLCFRERQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 283

Query: 301 RYDYDHYEEKVHLGRLCCRPSAYNSDQI 325
           RY++DHYEEKVH+GRLCCRPSA+NSD+I
Sbjct: 301 RYEHDHYEEKVHIGRLCCRPSAFNSDRI 283

BLAST of HG10023525 vs. NCBI nr
Match: XP_008462435.1 (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X2 [Cucumis melo])

HSP 1 Score: 492.7 bits (1267), Expect = 2.5e-135
Identity = 250/328 (76.22%), Postives = 263/328 (80.18%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           ME+  SS++LDDS FEFTLAE                                       
Sbjct: 1   MEHPKSSKLLDDSSFEFTLAE--------------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRA KSPVTAE VH WFE
Sbjct: 61  ----------IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFE 120

Query: 121 NRKKELRSSCKKARPPPPPP----LPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPE 180
           NR+KELRSS KKARPPPPPP    LPP PS PPPTPPPKLLLYHSES+FLT APSSEPPE
Sbjct: 121 NRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPE 180

Query: 181 FKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVA 240
           F GKATDLSELAFEAFSSRDHAWYDVASFL+YR+NCHGELDARVRYAGFGKDEDEWVNVA
Sbjct: 181 FIGKATDLSELAFEAFSSRDHAWYDVASFLTYRINCHGELDARVRYAGFGKDEDEWVNVA 240

Query: 241 RGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 300
           RGVRDRSIPLESSECYRVKVGDLVLC+RE QDHALYFDAYVVEIQRRLHDIGGCRCIFVV
Sbjct: 241 RGVRDRSIPLESSECYRVKVGDLVLCFRERQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 279

Query: 301 RYDYDHYEEKVHLGRLCCRPSAYNSDQI 325
           RY++DHYEEKVH+GRLCCRPSA+NSD+I
Sbjct: 301 RYEHDHYEEKVHIGRLCCRPSAFNSDRI 279

BLAST of HG10023525 vs. NCBI nr
Match: XP_023514197.1 (protein SAWADEE HOMEODOMAIN HOMOLOG 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 470.7 bits (1210), Expect = 1.0e-128
Identity = 246/336 (73.21%), Postives = 260/336 (77.38%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           ME+RNSS  LDDS FEFTLAE                                       
Sbjct: 5   MEFRNSSTDLDDSVFEFTLAE--------------------------------------- 64

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMD+IL++SGDQTLGQ+FFQDVALHFSCSPWRA KSPVTAEQV  WFE
Sbjct: 65  ----------IVEMDSILRESGDQTLGQQFFQDVALHFSCSPWRAEKSPVTAEQVQSWFE 124

Query: 121 NRKKELR----SSCKKARPPPPPPLPPPPSP----PPPTPPPKLLLYHSESNFLTDAPSS 180
           NRKKE R    SS KKARPPPPPP PPPP P    PPPTPPPKLLLYHS+S FLTD P+S
Sbjct: 125 NRKKESRSSSTSSSKKARPPPPPPPPPPPPPPLSSPPPTPPPKLLLYHSDSAFLTDIPAS 184

Query: 181 EP----PEFKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKD 240
           EP    PEFKGKATDLSELAFEAFSSRD+AWYDVASFL+YRVN HGELDARVRY GFGKD
Sbjct: 185 EPPDSFPEFKGKATDLSELAFEAFSSRDNAWYDVASFLTYRVNYHGELDARVRYTGFGKD 244

Query: 241 EDEWVNVARGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIG 300
           EDEWVNVARGVR+RSIPLESSEC+RVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHD G
Sbjct: 245 EDEWVNVARGVRERSIPLESSECHRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDTG 291

Query: 301 GCRCIFVVRYDYDHYEEKVHLGRLCCRPSAYNSDQI 325
           GCRCIFVVRY++DHYEEKVHLGRLCCRPSAYNSDQ+
Sbjct: 305 GCRCIFVVRYEHDHYEEKVHLGRLCCRPSAYNSDQL 291

BLAST of HG10023525 vs. NCBI nr
Match: XP_004141662.1 (protein SAWADEE HOMEODOMAIN HOMOLOG 1 [Cucumis sativus] >KGN45586.1 hypothetical protein Csa_016875 [Cucumis sativus])

HSP 1 Score: 470.7 bits (1210), Expect = 1.0e-128
Identity = 241/336 (71.73%), Postives = 257/336 (76.49%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           ME+R SS++LDDS FEFTLAE                                       
Sbjct: 1   MEHRKSSKLLDDSSFEFTLAE--------------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMDNILKDS DQTLGQEFFQDVALHFSCSPWRA KSPVT E VH WFE
Sbjct: 61  ----------IVEMDNILKDSRDQTLGQEFFQDVALHFSCSPWRAAKSPVTTEHVHAWFE 120

Query: 121 NRKKELRSSCKKARPPPPPPLPPPPSP------------PPPTPPPKLLLYHSESNFLTD 180
           NR+KELR+S KKARPPPPPP  PPP P            PPP+PPPKLLLYHSES+FLT 
Sbjct: 121 NRRKELRASSKKARPPPPPPSEPPPPPPSELPPLPTPSSPPPSPPPKLLLYHSESDFLTH 180

Query: 181 APSSEPPEFKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKD 240
           APSS PPEFKGKATDLSELAFEAFSSRDHAWYDVASFL+YRVNCHGELDARVRYAGF KD
Sbjct: 181 APSSGPPEFKGKATDLSELAFEAFSSRDHAWYDVASFLTYRVNCHGELDARVRYAGFRKD 240

Query: 241 EDEWVNVARGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIG 300
           EDEWVNV RGVRDRSIPLESSECYRVKVGDLVLC++E QDHALYFDA+VVEIQRRLHDI 
Sbjct: 241 EDEWVNVGRGVRDRSIPLESSECYRVKVGDLVLCFQERQDHALYFDAHVVEIQRRLHDIS 287

Query: 301 GCRCIFVVRYDYDHYEEKVHLGRLCCRPSAYNSDQI 325
           GCRCIFVVRY++D +EEKVH+GRLCCRPSA+NSDQI
Sbjct: 301 GCRCIFVVRYEHDRHEEKVHIGRLCCRPSAFNSDQI 287

BLAST of HG10023525 vs. ExPASy Swiss-Prot
Match: Q9XI47 (Protein SAWADEE HOMEODOMAIN HOMOLOG 1 OS=Arabidopsis thaliana OX=3702 GN=SHH1 PE=1 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 3.8e-54
Identity = 116/247 (46.96%), Postives = 158/247 (63.97%), Query Frame = 0

Query: 70  QIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFENRKKELRSS 129
           +IV+M+N+ K+ GDQ+L ++F Q VA  FSCS  R GKS +T +QV  WF+ + K     
Sbjct: 18  EIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKLKHQSQP 77

Query: 130 CKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPEFKGKATDLSELAF 189
             K  P PP  +    +P           +   S F+           KGKA+DL++LAF
Sbjct: 78  KSKTLPSPPLQIHDLSNPSSYASNASNATFVGNSTFVQTR--------KGKASDLADLAF 137

Query: 190 EAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVARGVRDRSIPLESS 249
           EA S+RD+AWYDV+SFL+YRV   GEL+ RVR++GF    DEWVNV   VR+RSIP+E S
Sbjct: 138 EAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEPS 197

Query: 250 ECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVVRYDYDHYEEKVHL 309
           EC RV VGDL+LC++E +D ALY D +V+ I+R +HD   C C+F+VRY+ D+ EE + L
Sbjct: 198 ECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTEESLGL 256

Query: 310 GRLCCRP 317
            R+C RP
Sbjct: 258 ERICRRP 256

BLAST of HG10023525 vs. ExPASy Swiss-Prot
Match: Q8RWJ7 (Protein SAWADEE HOMEODOMAIN HOMOLOG 2 OS=Arabidopsis thaliana OX=3702 GN=SHH2 PE=2 SV=1)

HSP 1 Score: 184.5 bits (467), Expect = 1.9e-45
Identity = 118/264 (44.70%), Postives = 154/264 (58.33%), Query Frame = 0

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
           FRFILP   ++ EM+ IL        G+   + +A  FS SP R GK  V  +Q+  WF+
Sbjct: 12  FRFILP---EVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQFKQIWNWFQ 71

Query: 121 NRKKELRSSCKKA------RPPPPPPLP-PPPSPPPPTPPPKLLLYHSESNFLTDAPS-S 180
           NR+  LR+   KA         P   LP    S   P   PK          +T APS S
Sbjct: 72  NRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPGMTPAPSGS 131

Query: 181 EPPEFKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEW 240
             P      +D S L FEA S+RD AWYDV +FL++R    G+ + +VR+AGF  +EDEW
Sbjct: 132 LVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVEEDEW 191

Query: 241 VNVARGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRC 300
           +NV + VR RS+P E+SEC  V  GDLVLC++EG+D ALYFDA V++ QRR HD+ GCRC
Sbjct: 192 INVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVRGCRC 251

Query: 301 IFVVRYDYDHYEEKVHLGRLCCRP 317
            F+VRY +D  EE V L ++C RP
Sbjct: 252 RFLVRYSHDQSEEIVPLRKICRRP 272

BLAST of HG10023525 vs. ExPASy TrEMBL
Match: A0A5A7UBT5 (Protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold481G00190 PE=4 SV=1)

HSP 1 Score: 501.5 bits (1290), Expect = 2.6e-138
Identity = 254/328 (77.44%), Postives = 267/328 (81.40%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           ME+  SS++LDDS FEFTLAEASSY                                   
Sbjct: 1   MEHPKSSKLLDDSSFEFTLAEASSY----------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRA KSPVTAE VH WFE
Sbjct: 61  ----------IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFE 120

Query: 121 NRKKELRSSCKKARPPPPPP----LPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPE 180
           NR+KELRSS KKARPPPPPP    LPP PS PPPTPPPKLLLYHSES+FLT APSSEPPE
Sbjct: 121 NRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPE 180

Query: 181 FKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVA 240
           F GKATDLSELAFEAFSSRDHAWYDVASFL+YR+NCHGELDARVRYAGFGKDEDEWVNVA
Sbjct: 181 FIGKATDLSELAFEAFSSRDHAWYDVASFLTYRINCHGELDARVRYAGFGKDEDEWVNVA 240

Query: 241 RGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 300
           RGVRDRSIPLESSECYRVKVGDLVLC+RE QDHALYFDAYVVEIQRRLHDIGGCRCIFVV
Sbjct: 241 RGVRDRSIPLESSECYRVKVGDLVLCFRERQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 283

Query: 301 RYDYDHYEEKVHLGRLCCRPSAYNSDQI 325
           RY++DHYEEKVH+GRLCCRPSA+NSD+I
Sbjct: 301 RYEHDHYEEKVHIGRLCCRPSAFNSDRI 283

BLAST of HG10023525 vs. ExPASy TrEMBL
Match: A0A1S3CIG7 (protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103500784 PE=4 SV=1)

HSP 1 Score: 492.7 bits (1267), Expect = 1.2e-135
Identity = 250/328 (76.22%), Postives = 263/328 (80.18%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           ME+  SS++LDDS FEFTLAE                                       
Sbjct: 1   MEHPKSSKLLDDSSFEFTLAE--------------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRA KSPVTAE VH WFE
Sbjct: 61  ----------IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFE 120

Query: 121 NRKKELRSSCKKARPPPPPP----LPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPE 180
           NR+KELRSS KKARPPPPPP    LPP PS PPPTPPPKLLLYHSES+FLT APSSEPPE
Sbjct: 121 NRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPE 180

Query: 181 FKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVA 240
           F GKATDLSELAFEAFSSRDHAWYDVASFL+YR+NCHGELDARVRYAGFGKDEDEWVNVA
Sbjct: 181 FIGKATDLSELAFEAFSSRDHAWYDVASFLTYRINCHGELDARVRYAGFGKDEDEWVNVA 240

Query: 241 RGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 300
           RGVRDRSIPLESSECYRVKVGDLVLC+RE QDHALYFDAYVVEIQRRLHDIGGCRCIFVV
Sbjct: 241 RGVRDRSIPLESSECYRVKVGDLVLCFRERQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 279

Query: 301 RYDYDHYEEKVHLGRLCCRPSAYNSDQI 325
           RY++DHYEEKVH+GRLCCRPSA+NSD+I
Sbjct: 301 RYEHDHYEEKVHIGRLCCRPSAFNSDRI 279

BLAST of HG10023525 vs. ExPASy TrEMBL
Match: A0A0A0KCT9 (SAWADEE domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G452900 PE=4 SV=1)

HSP 1 Score: 470.7 bits (1210), Expect = 4.9e-129
Identity = 241/336 (71.73%), Postives = 257/336 (76.49%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           ME+R SS++LDDS FEFTLAE                                       
Sbjct: 1   MEHRKSSKLLDDSSFEFTLAE--------------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMDNILKDS DQTLGQEFFQDVALHFSCSPWRA KSPVT E VH WFE
Sbjct: 61  ----------IVEMDNILKDSRDQTLGQEFFQDVALHFSCSPWRAAKSPVTTEHVHAWFE 120

Query: 121 NRKKELRSSCKKARPPPPPPLPPPPSP------------PPPTPPPKLLLYHSESNFLTD 180
           NR+KELR+S KKARPPPPPP  PPP P            PPP+PPPKLLLYHSES+FLT 
Sbjct: 121 NRRKELRASSKKARPPPPPPSEPPPPPPSELPPLPTPSSPPPSPPPKLLLYHSESDFLTH 180

Query: 181 APSSEPPEFKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKD 240
           APSS PPEFKGKATDLSELAFEAFSSRDHAWYDVASFL+YRVNCHGELDARVRYAGF KD
Sbjct: 181 APSSGPPEFKGKATDLSELAFEAFSSRDHAWYDVASFLTYRVNCHGELDARVRYAGFRKD 240

Query: 241 EDEWVNVARGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIG 300
           EDEWVNV RGVRDRSIPLESSECYRVKVGDLVLC++E QDHALYFDA+VVEIQRRLHDI 
Sbjct: 241 EDEWVNVGRGVRDRSIPLESSECYRVKVGDLVLCFQERQDHALYFDAHVVEIQRRLHDIS 287

Query: 301 GCRCIFVVRYDYDHYEEKVHLGRLCCRPSAYNSDQI 325
           GCRCIFVVRY++D +EEKVH+GRLCCRPSA+NSDQI
Sbjct: 301 GCRCIFVVRYEHDRHEEKVHIGRLCCRPSAFNSDQI 287

BLAST of HG10023525 vs. ExPASy TrEMBL
Match: A0A6J1DBC4 (protein SAWADEE HOMEODOMAIN HOMOLOG 1-like OS=Momordica charantia OX=3673 GN=LOC111018742 PE=4 SV=1)

HSP 1 Score: 469.5 bits (1207), Expect = 1.1e-128
Identity = 241/325 (74.15%), Postives = 255/325 (78.46%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           MEYR+  + LDD PFEFTLAE                                       
Sbjct: 1   MEYRSLPKDLDDYPFEFTLAE--------------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMDNILKD+GDQTLGQEFFQDVALHFSCSPWRAGKS VTAEQV GWFE
Sbjct: 61  ----------IVEMDNILKDTGDQTLGQEFFQDVALHFSCSPWRAGKSSVTAEQVKGWFE 120

Query: 121 NRKKELRSSCKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEP----PE 180
           NR+ ELRSS KKA PPPPPP PPPP      PPPKLLLYHS+S+FLTDAPSSEP    PE
Sbjct: 121 NRQNELRSSSKKAPPPPPPPSPPPP------PPPKLLLYHSDSSFLTDAPSSEPPDSLPE 180

Query: 181 FKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVA 240
            KGKA+DLSELAFEAFSSRD+AWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVA
Sbjct: 181 LKGKASDLSELAFEAFSSRDNAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVA 240

Query: 241 RGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 300
           RGVR+RSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVV
Sbjct: 241 RGVRERSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVV 270

Query: 301 RYDYDHYEEKVHLGRLCCRPSAYNS 322
           RYD+D++EEKVHLGRLCCRP+AYN+
Sbjct: 301 RYDHDNHEEKVHLGRLCCRPAAYNN 270

BLAST of HG10023525 vs. ExPASy TrEMBL
Match: A0A6J1HHW3 (protein SAWADEE HOMEODOMAIN HOMOLOG 1-like OS=Cucurbita moschata OX=3662 GN=LOC111464481 PE=4 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 1.9e-128
Identity = 243/332 (73.19%), Postives = 258/332 (77.71%), Query Frame = 0

Query: 1   MEYRNSSRVLDDSPFEFTLAEASSYVMLESCLSIDASEFSFPVRVMCRNSLQHFLFCNYR 60
           ME+RNSS  +DDS FEFTLAE                                       
Sbjct: 1   MEFRNSSSDIDDSVFEFTLAE--------------------------------------- 60

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
                     IVEMD+IL++SGDQTLGQ+FFQDVALHFSCSPWRA KSPVTAEQV  WFE
Sbjct: 61  ----------IVEMDSILRESGDQTLGQQFFQDVALHFSCSPWRAEKSPVTAEQVQSWFE 120

Query: 121 NRKKELR----SSCKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEP-- 180
           NRKKE R    SS KK RPPPPPP PPP S PPPTPPPKLLLYHS+S FLTD P+SEP  
Sbjct: 121 NRKKESRSSSTSSSKKPRPPPPPPPPPPLSSPPPTPPPKLLLYHSDSAFLTDIPASEPPD 180

Query: 181 --PEFKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEW 240
             PEFKGKATDLSELAFEAFSSRD+AWYDVASFL+YRVN HGELDARVRY GFGKDEDEW
Sbjct: 181 SSPEFKGKATDLSELAFEAFSSRDNAWYDVASFLTYRVNYHGELDARVRYTGFGKDEDEW 240

Query: 241 VNVARGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRC 300
           VNVARGVR+RSIPLESSEC+RVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHD GGCRC
Sbjct: 241 VNVARGVRERSIPLESSECHRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDTGGCRC 283

Query: 301 IFVVRYDYDHYEEKVHLGRLCCRPSAYNSDQI 325
           IFVVRY++DHYEEKVHLGRLCCRPSAYNSDQ+
Sbjct: 301 IFVVRYEHDHYEEKVHLGRLCCRPSAYNSDQL 283

BLAST of HG10023525 vs. TAIR 10
Match: AT1G15215.2 (BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors;sequence-specific DNA binding (TAIR:AT3G18380.1); Has 89 Blast hits to 86 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 89; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 213.4 bits (542), Expect = 2.7e-55
Identity = 116/247 (46.96%), Postives = 158/247 (63.97%), Query Frame = 0

Query: 70  QIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFENRKKELRSS 129
           +IV+M+N+ K+ GDQ+L ++F Q VA  FSCS  R GKS +T +QV  WF+ + K     
Sbjct: 18  EIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKLKHQSQP 77

Query: 130 CKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPEFKGKATDLSELAF 189
             K  P PP  +    +P           +   S F+           KGKA+DL++LAF
Sbjct: 78  KSKTLPSPPLQIHDLSNPSSYASNASNATFVGNSTFVQTR--------KGKASDLADLAF 137

Query: 190 EAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVARGVRDRSIPLESS 249
           EA S+RD+AWYDV+SFL+YRV   GEL+ RVR++GF    DEWVNV   VR+RSIP+E S
Sbjct: 138 EAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEPS 197

Query: 250 ECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVVRYDYDHYEEKVHL 309
           EC RV VGDL+LC++E +D ALY D +V+ I+R +HD   C C+F+VRY+ D+ EE + L
Sbjct: 198 ECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTEESLGL 256

Query: 310 GRLCCRP 317
            R+C RP
Sbjct: 258 ERICRRP 256

BLAST of HG10023525 vs. TAIR 10
Match: AT1G15215.3 (BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors;sequence-specific DNA binding (TAIR:AT3G18380.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 200.3 bits (508), Expect = 2.4e-51
Identity = 110/235 (46.81%), Postives = 150/235 (63.83%), Query Frame = 0

Query: 70  QIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFENRKKELRSS 129
           +IV+M+N+ K+ GDQ+L ++F Q VA  FSCS  R GKS +T +QV  WF+ + K     
Sbjct: 18  EIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKLKHQSQP 77

Query: 130 CKKARPPPPPPLPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPEFKGKATDLSELAF 189
             K  P PP  +    +P           +   S F+           KGKA+DL++LAF
Sbjct: 78  KSKTLPSPPLQIHDLSNPSSYASNASNATFVGNSTFVQTR--------KGKASDLADLAF 137

Query: 190 EAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVARGVRDRSIPLESS 249
           EA S+RD+AWYDV+SFL+YRV   GEL+ RVR++GF    DEWVNV   VR+RSIP+E S
Sbjct: 138 EAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEPS 197

Query: 250 ECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVVRYDYDHYE 305
           EC RV VGDL+LC++E +D ALY D +V+ I+R +HD   C C+F+VRY+ D+ E
Sbjct: 198 ECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTE 244

BLAST of HG10023525 vs. TAIR 10
Match: AT1G15215.1 (BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors;sequence-specific DNA binding (TAIR:AT3G18380.1); Has 89 Blast hits to 86 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 89; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 195.3 bits (495), Expect = 7.7e-50
Identity = 108/231 (46.75%), Postives = 146/231 (63.20%), Query Frame = 0

Query: 74  MDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFENRKKELRSSCKKA 133
           M+N+ K+ GDQ+L ++F Q VA  FSCS  R GKS +T +QV  WF+ + K       K 
Sbjct: 1   MENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKLKHQSQPKSKT 60

Query: 134 RPPPPPPLPPPPSPPPPTPPPKLLLYHSESNFLTDAPSSEPPEFKGKATDLSELAFEAFS 193
            P PP  +    +P           +   S F+           KGKA+DL++LAFEA S
Sbjct: 61  LPSPPLQIHDLSNPSSYASNASNATFVGNSTFVQTR--------KGKASDLADLAFEAKS 120

Query: 194 SRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEWVNVARGVRDRSIPLESSECYR 253
           +RD+AWYDV+SFL+YRV   GEL+ RVR++GF    DEWVNV   VR+RSIP+E SEC R
Sbjct: 121 ARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEPSECGR 180

Query: 254 VKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRCIFVVRYDYDHYE 305
           V VGDL+LC++E +D ALY D +V+ I+R +HD   C C+F+VRY+ D+ E
Sbjct: 181 VNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTE 223

BLAST of HG10023525 vs. TAIR 10
Match: AT3G18380.1 (sequence-specific DNA binding transcription factors;sequence-specific DNA binding )

HSP 1 Score: 184.5 bits (467), Expect = 1.4e-46
Identity = 118/264 (44.70%), Postives = 154/264 (58.33%), Query Frame = 0

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
           FRFILP   ++ EM+ IL        G+   + +A  FS SP R GK  V  +Q+  WF+
Sbjct: 12  FRFILP---EVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQFKQIWNWFQ 71

Query: 121 NRKKELRSSCKKA------RPPPPPPLP-PPPSPPPPTPPPKLLLYHSESNFLTDAPS-S 180
           NR+  LR+   KA         P   LP    S   P   PK          +T APS S
Sbjct: 72  NRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPGMTPAPSGS 131

Query: 181 EPPEFKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEW 240
             P      +D S L FEA S+RD AWYDV +FL++R    G+ + +VR+AGF  +EDEW
Sbjct: 132 LVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVEEDEW 191

Query: 241 VNVARGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRC 300
           +NV + VR RS+P E+SEC  V  GDLVLC++EG+D ALYFDA V++ QRR HD+ GCRC
Sbjct: 192 INVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVRGCRC 251

Query: 301 IFVVRYDYDHYEEKVHLGRLCCRP 317
            F+VRY +D  EE V L ++C RP
Sbjct: 252 RFLVRYSHDQSEEIVPLRKICRRP 272

BLAST of HG10023525 vs. TAIR 10
Match: AT3G18380.2 (sequence-specific DNA binding transcription factors;sequence-specific DNA binding )

HSP 1 Score: 180.3 bits (456), Expect = 2.6e-45
Identity = 117/265 (44.15%), Postives = 155/265 (58.49%), Query Frame = 0

Query: 61  FRFILPLILQIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAGKSPVTAEQVHGWFE 120
           FRFILP   ++ EM+ IL        G+   + +A  FS SP R GK  V  +Q+  WF+
Sbjct: 12  FRFILP---EVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQFKQIWNWFQ 71

Query: 121 NRKKELRSSCKKA------RPPPPPPLP-PPPSPPPPTPPPKLLLYHSESNFLTDAPS-S 180
           NR+  LR+   KA         P   LP    S   P   PK          +T APS S
Sbjct: 72  NRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPGMTPAPSGS 131

Query: 181 EPPEFKGKATDLSELAFEAFSSRDHAWYDVASFLSYRVNCHGELDARVRYAGFGKDEDEW 240
             P      +D S L FEA S+RD AWYDV +FL++R    G+ + +VR+AGF  +EDEW
Sbjct: 132 LVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAGFEVEEDEW 191

Query: 241 VNVARGVRDRSIPLESSECYRVKVGDLVLCYREGQDHALYFDAYVVEIQRRLHDIGGCRC 300
           +NV + VR RS+P E+SEC  V  GDLVLC++EG+D ALYFDA V++ QRR HD+ GCRC
Sbjct: 192 INVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRRHDVRGCRC 251

Query: 301 IFVVRYDYDHYEEK-VHLGRLCCRP 317
            F+VRY +D  E++ V L ++C RP
Sbjct: 252 RFLVRYSHDQSEQEIVPLRKICRRP 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038897963.12.2e-14782.41protein SAWADEE HOMEODOMAIN HOMOLOG 1-like [Benincasa hispida][more]
KAA0051041.15.4e-13877.44protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X2 [Cucumis melo var. ma... [more]
XP_008462435.12.5e-13576.22PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X2 [Cucumis melo][more]
XP_023514197.11.0e-12873.21protein SAWADEE HOMEODOMAIN HOMOLOG 1-like [Cucurbita pepo subsp. pepo][more]
XP_004141662.11.0e-12871.73protein SAWADEE HOMEODOMAIN HOMOLOG 1 [Cucumis sativus] >KGN45586.1 hypothetical... [more]
Match NameE-valueIdentityDescription
Q9XI473.8e-5446.96Protein SAWADEE HOMEODOMAIN HOMOLOG 1 OS=Arabidopsis thaliana OX=3702 GN=SHH1 PE... [more]
Q8RWJ71.9e-4544.70Protein SAWADEE HOMEODOMAIN HOMOLOG 2 OS=Arabidopsis thaliana OX=3702 GN=SHH2 PE... [more]
Match NameE-valueIdentityDescription
A0A5A7UBT52.6e-13877.44Protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X2 OS=Cucumis melo var. ... [more]
A0A1S3CIG71.2e-13576.22protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X2 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0KCT94.9e-12971.73SAWADEE domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G452900 PE=... [more]
A0A6J1DBC41.1e-12874.15protein SAWADEE HOMEODOMAIN HOMOLOG 1-like OS=Momordica charantia OX=3673 GN=LOC... [more]
A0A6J1HHW31.9e-12873.19protein SAWADEE HOMEODOMAIN HOMOLOG 1-like OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
Match NameE-valueIdentityDescription
AT1G15215.22.7e-5546.96BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transc... [more]
AT1G15215.32.4e-5146.81BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transc... [more]
AT1G15215.17.7e-5046.75BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transc... [more]
AT3G18380.11.4e-4644.70sequence-specific DNA binding transcription factors;sequence-specific DNA bindin... [more]
AT3G18380.22.6e-4544.15sequence-specific DNA binding transcription factors;sequence-specific DNA bindin... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D2.40.50.40coord: 184..245
e-value: 8.1E-30
score: 104.8
NoneNo IPR availableGENE3D2.30.30.140coord: 246..318
e-value: 9.2E-33
score: 113.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 132..156
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 129..157
NoneNo IPR availablePANTHERPTHR33827:SF12SUBFAMILY NOT NAMEDcoord: 70..324
IPR032001SAWADEE domainPFAMPF16719SAWADEEcoord: 186..313
e-value: 2.7E-36
score: 124.7
IPR039276Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2PANTHERPTHR33827PROTEIN SAWADEE HOMEODOMAIN HOMOLOG 2coord: 70..324

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023525.1HG10023525.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003682 chromatin binding
molecular_function GO:0003677 DNA binding