Cla97C01G014450 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G014450
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptiontranscription factor HEC3-like
LocationCla97Chr01: 28182072 .. 28183228 (-)
RNA-Seq ExpressionCla97C01G014450
SyntenyCla97C01G014450
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGGTTCAGAAGATGATATAAAGAATAAGGCAAACCCCCTCTGTAAAGCCTTTGATTACTAGGGGCAATTCCTTGGGAGAAAAATCTTCCAAAAAAAGAAATTAGAAAATAATGTTAAACCCTAAGATCACTCCTTTTTTCCTCTTTGCCCATCCAAACATCTTTCCCAATTCTTTTCATTCCTTTCTTTCTAAAACACAAGCACTACCAATCCCCATTCTCTTAAACATACAAACCCTCATATTCTTCACTCTTTTTTTAGATAATCAATACAATAGTCTTGTATATTTTTTTCTTCTTCTTCACAAATTAATTTCCCCCACATTGCTTTCATTTTCATCTCTCCAACCCCATACCCTAGTTTTTCTTCACCCCCATCACCCTCCTCCATGGACCCCCATCACCTCCCAAACCCCTCTTCTCATCTCCATCACCCCGCCATGGAAGACCACCATATCGACCTTCACCATCATCACCACGACCCCGACCTCGATTCCCTTTGGCCGTCGTTGTTGCCATTCCAACTCCCCGACTCTCACGATCAACAACTCCCCTCCTCTTCCACCCATCTGTTGATAGGCTACGGTATATATATATTATATATTTGTAAAAGAACATTATAGGAATTTCAACATAGTATTAATAACTCTACTCAATTTAATTGTTTTAAATCCTTTACAGGTACTCCGAGTTCTGGAACAGGTGATGATGAAGAAGAACCGGAAGAAGAGTTAGGTGCTATGAAGGAGATGATGTATAAGATTGCGGCAATGCAGCCAGTGGACATCGACCCTTCAACTATCCGAAAGCCTAAGCGACGGAACGTGCGGATTAGTGATGACCCACAAAGCATCGCAGCTCGTCTCCGACGGGAGAGGATCAGTGAGAAAATCAGAATTCTTCAAAGGCTTGTACCTGGAGGGACCAAGATGGACACAGCTTCAATGTTGGATGAAGCCATTCGCTATGTCAAGTTCTTGAAGAGACAAATTCGGTTGTTGCAGTCAAGTCAGCCGCCGCAACAGCCACCCACCAGCGGTGGAGCCGCCGCTGCCGGCGGAGGATGGCCTTTTCCTTTCCACAAGGCTAATGGTTCAACCTCCTCATCCACTTCCATGGAGACTACTCCTGCCATTACACCATCAGGATGGTGA

mRNA sequence

ATGGGGTCTTGTATATTTTTTTCTTCTTCTTCACAAATTAATTTCCCCCACATTGCTTTCATTTTCATCTCTCCAACCCCATACCCTAGTTTTTCTTCACCCCCATCACCCTCCTCCATGGACCCCCATCACCTCCCAAACCCCTCTTCTCATCTCCATCACCCCGCCATGGAAGACCACCATATCGACCTTCACCATCATCACCACGACCCCGACCTCGATTCCCTTTGGCCGTCGTTGTTGCCATTCCAACTCCCCGACTCTCACGATCAACAACTCCCCTCCTCTTCCACCCATCTGTTGATAGGCTACGGTACTCCGAGTTCTGGAACAGGTGATGATGAAGAAGAACCGGAAGAAGAGTTAGGTGCTATGAAGGAGATGATGTATAAGATTGCGGCAATGCAGCCAGTGGACATCGACCCTTCAACTATCCGAAAGCCTAAGCGACGGAACGTGCGGATTAGTGATGACCCACAAAGCATCGCAGCTCGTCTCCGACGGGAGAGGATCAGTGAGAAAATCAGAATTCTTCAAAGGCTTGTACCTGGAGGGACCAAGATGGACACAGCTTCAATGTTGGATGAAGCCATTCGCTATGTCAAGTTCTTGAAGAGACAAATTCGGTTGTTGCAGTCAAGTCAGCCGCCGCAACAGCCACCCACCAGCGGTGGAGCCGCCGCTGCCGGCGGAGGATGGCCTTTTCCTTTCCACAAGGCTAATGGTTCAACCTCCTCATCCACTTCCATGGAGACTACTCCTGCCATTACACCATCAGGATGGTGA

Coding sequence (CDS)

ATGGGGTCTTGTATATTTTTTTCTTCTTCTTCACAAATTAATTTCCCCCACATTGCTTTCATTTTCATCTCTCCAACCCCATACCCTAGTTTTTCTTCACCCCCATCACCCTCCTCCATGGACCCCCATCACCTCCCAAACCCCTCTTCTCATCTCCATCACCCCGCCATGGAAGACCACCATATCGACCTTCACCATCATCACCACGACCCCGACCTCGATTCCCTTTGGCCGTCGTTGTTGCCATTCCAACTCCCCGACTCTCACGATCAACAACTCCCCTCCTCTTCCACCCATCTGTTGATAGGCTACGGTACTCCGAGTTCTGGAACAGGTGATGATGAAGAAGAACCGGAAGAAGAGTTAGGTGCTATGAAGGAGATGATGTATAAGATTGCGGCAATGCAGCCAGTGGACATCGACCCTTCAACTATCCGAAAGCCTAAGCGACGGAACGTGCGGATTAGTGATGACCCACAAAGCATCGCAGCTCGTCTCCGACGGGAGAGGATCAGTGAGAAAATCAGAATTCTTCAAAGGCTTGTACCTGGAGGGACCAAGATGGACACAGCTTCAATGTTGGATGAAGCCATTCGCTATGTCAAGTTCTTGAAGAGACAAATTCGGTTGTTGCAGTCAAGTCAGCCGCCGCAACAGCCACCCACCAGCGGTGGAGCCGCCGCTGCCGGCGGAGGATGGCCTTTTCCTTTCCACAAGGCTAATGGTTCAACCTCCTCATCCACTTCCATGGAGACTACTCCTGCCATTACACCATCAGGATGGTGA

Protein sequence

MGSCIFFSSSSQINFPHIAFIFISPTPYPSFSSPPSPSSMDPHHLPNPSSHLHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQQLPSSSTHLLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQPPTSGGAAAAGGGWPFPFHKANGSTSSSTSMETTPAITPSGW
Homology
BLAST of Cla97C01G014450 vs. NCBI nr
Match: XP_038882292.1 (transcription factor HEC3-like [Benincasa hispida])

HSP 1 Score: 389.4 bits (999), Expect = 2.4e-104
Identity = 203/220 (92.27%), Postives = 206/220 (93.64%), Query Frame = 0

Query: 45  LPNPSSH--LHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQQLPSSSTHLLI 104
           +P PS H  L   AMEDHHI LHHHHH PDL+S+WPS LPFQLPDSHDQQLPSSSTHLLI
Sbjct: 6   VPTPSFHLDLDRSAMEDHHIGLHHHHHVPDLESIWPSFLPFQLPDSHDQQLPSSSTHLLI 65

Query: 105 GYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSI 164
           GY TPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSI
Sbjct: 66  GYSTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSI 125

Query: 165 AARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQPPT 224
           AARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQP T
Sbjct: 126 AARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQPTT 185

Query: 225 SGGAA-AAGGGWPFPFHKANGSTSSSTSMETTPAITPSGW 262
           SGGAA A GGGWPFPFHKANGSTSSSTSMETTPAITPSGW
Sbjct: 186 SGGAAGAGGGGWPFPFHKANGSTSSSTSMETTPAITPSGW 225

BLAST of Cla97C01G014450 vs. NCBI nr
Match: XP_004148648.2 (transcription factor HEC3 [Cucumis sativus] >KGN48364.1 hypothetical protein Csa_003305 [Cucumis sativus])

HSP 1 Score: 377.9 bits (969), Expect = 7.2e-101
Identity = 197/224 (87.95%), Postives = 203/224 (90.62%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQQLPSSSTH 99
           MDPHHL NP  HL   AM+DHHI LHHHHH  DLDS+WPS LPFQL D HDQQLP+SSTH
Sbjct: 1   MDPHHLSNPPPHLDRSAMDDHHI-LHHHHHVHDLDSIWPSFLPFQLSDHHDQQLPTSSTH 60

Query: 100 LLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 159
            +IGY TPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP
Sbjct: 61  FVIGYSTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 120

Query: 160 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 219
           QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ
Sbjct: 121 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 180

Query: 220 PPTSGGA--AAAGGGWPFPFHKANGSTSSSTSMETTPAITPSGW 262
           P TSGGA  A  GGGW FPF+KANGSTSSSTSME TPAITP+GW
Sbjct: 181 PSTSGGATTAGGGGGWHFPFNKANGSTSSSTSMENTPAITPTGW 223

BLAST of Cla97C01G014450 vs. NCBI nr
Match: XP_016899193.1 (PREDICTED: transcription factor HEC3-like [Cucumis melo] >KAA0025535.1 transcription factor HEC3-like [Cucumis melo var. makuwa] >TYK25694.1 transcription factor HEC3-like [Cucumis melo var. makuwa])

HSP 1 Score: 367.1 bits (941), Expect = 1.3e-97
Identity = 195/224 (87.05%), Postives = 200/224 (89.29%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQQLPSSSTH 99
           MDPHHL NP  HL   AM+DHHI LHH H   DLDS+WPS LPFQL D HDQQLP+SSTH
Sbjct: 1   MDPHHLSNPPPHLDRSAMDDHHI-LHHVH---DLDSIWPSFLPFQLSDHHDQQLPTSSTH 60

Query: 100 LLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 159
            LIGY TPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP
Sbjct: 61  FLIGYSTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 120

Query: 160 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 219
           QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ
Sbjct: 121 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 180

Query: 220 PPTSGGA--AAAGGGWPFPFHKANGSTSSSTSMETTPAITPSGW 262
           P TSGGA  AA GGGW FP +KANGSTSSSTSME TP ITP+GW
Sbjct: 181 PSTSGGAATAAGGGGWHFPLNKANGSTSSSTSMENTPTITPTGW 220

BLAST of Cla97C01G014450 vs. NCBI nr
Match: KAG6604279.1 (Transcription factor IND, partial [Cucurbita argyrosperma subsp. sororia] >KAG7034439.1 Transcription factor IND, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 346.3 bits (887), Expect = 2.3e-91
Identity = 192/231 (83.12%), Postives = 197/231 (85.28%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHH--------DPDLDSLWPSLLPFQLPDSHDQ 99
           M+PHHL NPS HLHH AMEDHHI LH HHH        DPD DS+W SLLPF LPD H Q
Sbjct: 1   MEPHHLSNPSPHLHHSAMEDHHI-LHRHHHVHVHDPDPDPDPDSIWSSLLPFHLPDPH-Q 60

Query: 100 QLPSSSTHLLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRR 159
           Q PSSSTHLLI Y TPSS   D++EEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRR
Sbjct: 61  QFPSSSTHLLISYATPSSRV-DNDEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRR 120

Query: 160 NVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL 219
           NVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL
Sbjct: 121 NVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL 180

Query: 220 QSSQPPQQPPTSGGAAAAGGGWPFPFHKANGSTSSSTSMETTPA-ITPSGW 262
           QSS  PQQP T GGAAA  GGWP  FHKA+GS SSSTSMETTPA ITPSGW
Sbjct: 181 QSSDQPQQPSTDGGAAA--GGWPLTFHKADGSASSSTSMETTPAIITPSGW 226

BLAST of Cla97C01G014450 vs. NCBI nr
Match: XP_022950638.1 (transcription factor IND-like [Cucurbita moschata])

HSP 1 Score: 344.7 bits (883), Expect = 6.8e-91
Identity = 192/235 (81.70%), Postives = 197/235 (83.83%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHH------------DPDLDSLWPSLLPFQLPD 99
           M+PHHL NPS HLHH AMEDHHI LH HHH            DPD DS+W SLLPF LPD
Sbjct: 1   MEPHHLSNPSPHLHHSAMEDHHI-LHRHHHVHVHDPDPDPDPDPDPDSIWSSLLPFHLPD 60

Query: 100 SHDQQLPSSSTHLLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRK 159
            H QQ PSSSTHLLI Y TPSS   D++EEPEEELGAMKEMMYKIAAMQPVDIDPSTIRK
Sbjct: 61  PH-QQFPSSSTHLLISYATPSSRV-DNDEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRK 120

Query: 160 PKRRNVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQ 219
           PKRRNVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQ
Sbjct: 121 PKRRNVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQ 180

Query: 220 IRLLQSSQPPQQPPTSGGAAAAGGGWPFPFHKANGSTSSSTSMETTPA-ITPSGW 262
           IRLLQSS  PQQP T GGAAA  GGWP  FHKA+GS SSSTSMETTPA ITPSGW
Sbjct: 181 IRLLQSSDQPQQPSTDGGAAA--GGWPLTFHKADGSASSSTSMETTPAIITPSGW 230

BLAST of Cla97C01G014450 vs. ExPASy Swiss-Prot
Match: Q9LXD8 (Transcription factor HEC3 OS=Arabidopsis thaliana OX=3702 GN=HEC3 PE=1 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 8.4e-44
Identity = 112/185 (60.54%), Postives = 127/185 (68.65%), Query Frame = 0

Query: 58  EDHHIDLHHHHHDP---DLDSLWPSLLPFQLPDSHDQQLPSSSTHLLIGYGTPSSGTGDD 117
           +DHH   H H++DP    +D      +      SH   L SS T   +  G       +D
Sbjct: 30  DDHH---HQHNNDPIGMAMDQYTQLHIFNPFSSSHFPPLSSSLTTTTLLSGDQED--DED 89

Query: 118 EEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLRRERISEK 177
           EEEP EELGAMKEMMYKIAAMQ VDIDP+T++KPKRRNVRISDDPQS+AAR RRERISE+
Sbjct: 90  EEEPLEELGAMKEMMYKIAAMQSVDIDPATVKKPKRRNVRISDDPQSVAARHRRERISER 149

Query: 178 IRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSS---QPPQQPPTSGGAAAAGG 237
           IRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL ++    PP  PP    + A   
Sbjct: 150 IRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLNNNTGYTPP--PPQDQASQAVTT 207

BLAST of Cla97C01G014450 vs. ExPASy Swiss-Prot
Match: O81313 (Transcription factor IND OS=Arabidopsis thaliana OX=3702 GN=IND PE=1 SV=3)

HSP 1 Score: 155.2 bits (391), Expect = 1.0e-36
Identity = 76/100 (76.00%), Postives = 89/100 (89.00%), Query Frame = 0

Query: 113 DDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLRRERIS 172
           D++EE +E++ AMKEM Y IA MQPVDIDP+T+ KP RRNVRISDDPQ++ AR RRERIS
Sbjct: 76  DEDEEYDEDMDAMKEMQYMIAVMQPVDIDPATVPKPNRRNVRISDDPQTVVARRRRERIS 135

Query: 173 EKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQ 213
           EKIRIL+R+VPGG KMDTASMLDEAIRY KFLKRQ+R+LQ
Sbjct: 136 EKIRILKRIVPGGAKMDTASMLDEAIRYTKFLKRQVRILQ 175

BLAST of Cla97C01G014450 vs. ExPASy Swiss-Prot
Match: Q9FHA7 (Transcription factor HEC1 OS=Arabidopsis thaliana OX=3702 GN=HEC1 PE=1 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 1.5e-32
Identity = 72/110 (65.45%), Postives = 88/110 (80.00%), Query Frame = 0

Query: 122 LGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLRRERISEKIRILQRL 181
           + AM+EM+++IA MQP+ IDP  ++ PKRRNVRIS DPQS+AAR RRERISE+IRILQRL
Sbjct: 95  MAAMREMIFRIAVMQPIHIDPEAVKPPKRRNVRISKDPQSVAARHRRERISERIRILQRL 154

Query: 182 VPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQPPTSGGAAAAGG 232
           VPGGTKMDTASMLDEAI YVKFLK+Q++ L+     +Q   +GG    GG
Sbjct: 155 VPGGTKMDTASMLDEAIHYVKFLKKQVQSLE-----EQAVVTGGGGGGGG 199

BLAST of Cla97C01G014450 vs. ExPASy Swiss-Prot
Match: Q9SND4 (Transcription factor HEC2 OS=Arabidopsis thaliana OX=3702 GN=HEC2 PE=1 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 1.3e-28
Identity = 90/201 (44.78%), Postives = 118/201 (58.71%), Query Frame = 0

Query: 32  SSPPSPSSMDPHHLPNPSSHLHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQ 91
           +S P+P   +PH++   S    HP   +       H H P     +   +P   P  + +
Sbjct: 24  NSNPNP---NPHNIMMLSESNTHPFFFN-----PTHSHLP-----FDQTMPHHQPGLNFR 83

Query: 92  QLPSSSTHLLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRR 151
             PS S+ L      P    G  +      + AM+EM+++IA MQP+ IDP +++ PKR+
Sbjct: 84  YAPSPSSSL------PEKRGGCSD---NANMAAMREMIFRIAVMQPIHIDPESVKPPKRK 143

Query: 152 NVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL 211
           NVRIS DPQS+AAR RRERISE+IRILQRLVPGGTKMDTASMLDEAI YVKFLK+Q++ L
Sbjct: 144 NVRISKDPQSVAARHRRERISERIRILQRLVPGGTKMDTASMLDEAIHYVKFLKKQVQSL 198

Query: 212 QSSQPPQQPPTSGGAAAAGGG 233
           +           GG  A  GG
Sbjct: 204 EE----HAVVNGGGMTAVAGG 198

BLAST of Cla97C01G014450 vs. ExPASy Swiss-Prot
Match: Q8S3D2 (Transcription factor bHLH87 OS=Arabidopsis thaliana OX=3702 GN=BHLH87 PE=1 SV=1)

HSP 1 Score: 125.9 bits (315), Expect = 6.5e-28
Identity = 63/109 (57.80%), Postives = 83/109 (76.15%), Query Frame = 0

Query: 108 SSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLR 167
           S+   D+ E   E +  MKEM+Y+ AA +PV+     + KPKR+NV+IS DPQ++AAR R
Sbjct: 228 STCLSDNVEPDAEAIAQMKEMIYRAAAFRPVNFGLEIVEKPKRKNVKISTDPQTVAARQR 287

Query: 168 RERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQP 217
           RERISEKIR+LQ LVPGGTKMDTASMLDEA  Y+KFL+ Q++ L++ +P
Sbjct: 288 RERISEKIRVLQTLVPGGTKMDTASMLDEAANYLKFLRAQVKALENLRP 336

BLAST of Cla97C01G014450 vs. ExPASy TrEMBL
Match: A0A0A0KHS6 (BHLH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G483450 PE=4 SV=1)

HSP 1 Score: 377.9 bits (969), Expect = 3.5e-101
Identity = 197/224 (87.95%), Postives = 203/224 (90.62%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQQLPSSSTH 99
           MDPHHL NP  HL   AM+DHHI LHHHHH  DLDS+WPS LPFQL D HDQQLP+SSTH
Sbjct: 1   MDPHHLSNPPPHLDRSAMDDHHI-LHHHHHVHDLDSIWPSFLPFQLSDHHDQQLPTSSTH 60

Query: 100 LLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 159
            +IGY TPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP
Sbjct: 61  FVIGYSTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 120

Query: 160 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 219
           QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ
Sbjct: 121 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 180

Query: 220 PPTSGGA--AAAGGGWPFPFHKANGSTSSSTSMETTPAITPSGW 262
           P TSGGA  A  GGGW FPF+KANGSTSSSTSME TPAITP+GW
Sbjct: 181 PSTSGGATTAGGGGGWHFPFNKANGSTSSSTSMENTPAITPTGW 223

BLAST of Cla97C01G014450 vs. ExPASy TrEMBL
Match: A0A5A7SM52 (Transcription factor HEC3-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1230G00250 PE=4 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 6.2e-98
Identity = 195/224 (87.05%), Postives = 200/224 (89.29%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQQLPSSSTH 99
           MDPHHL NP  HL   AM+DHHI LHH H   DLDS+WPS LPFQL D HDQQLP+SSTH
Sbjct: 1   MDPHHLSNPPPHLDRSAMDDHHI-LHHVH---DLDSIWPSFLPFQLSDHHDQQLPTSSTH 60

Query: 100 LLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 159
            LIGY TPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP
Sbjct: 61  FLIGYSTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 120

Query: 160 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 219
           QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ
Sbjct: 121 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 180

Query: 220 PPTSGGA--AAAGGGWPFPFHKANGSTSSSTSMETTPAITPSGW 262
           P TSGGA  AA GGGW FP +KANGSTSSSTSME TP ITP+GW
Sbjct: 181 PSTSGGAATAAGGGGWHFPLNKANGSTSSSTSMENTPTITPTGW 220

BLAST of Cla97C01G014450 vs. ExPASy TrEMBL
Match: A0A1S4DTA8 (transcription factor HEC3-like OS=Cucumis melo OX=3656 GN=LOC103485271 PE=4 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 6.2e-98
Identity = 195/224 (87.05%), Postives = 200/224 (89.29%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQQLPSSSTH 99
           MDPHHL NP  HL   AM+DHHI LHH H   DLDS+WPS LPFQL D HDQQLP+SSTH
Sbjct: 1   MDPHHLSNPPPHLDRSAMDDHHI-LHHVH---DLDSIWPSFLPFQLSDHHDQQLPTSSTH 60

Query: 100 LLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 159
            LIGY TPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP
Sbjct: 61  FLIGYSTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDP 120

Query: 160 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 219
           QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ
Sbjct: 121 QSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQ 180

Query: 220 PPTSGGA--AAAGGGWPFPFHKANGSTSSSTSMETTPAITPSGW 262
           P TSGGA  AA GGGW FP +KANGSTSSSTSME TP ITP+GW
Sbjct: 181 PSTSGGAATAAGGGGWHFPLNKANGSTSSSTSMENTPTITPTGW 220

BLAST of Cla97C01G014450 vs. ExPASy TrEMBL
Match: A0A6J1GFF7 (transcription factor IND-like OS=Cucurbita moschata OX=3662 GN=LOC111453678 PE=4 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 3.3e-91
Identity = 192/235 (81.70%), Postives = 197/235 (83.83%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHH------------DPDLDSLWPSLLPFQLPD 99
           M+PHHL NPS HLHH AMEDHHI LH HHH            DPD DS+W SLLPF LPD
Sbjct: 1   MEPHHLSNPSPHLHHSAMEDHHI-LHRHHHVHVHDPDPDPDPDPDPDSIWSSLLPFHLPD 60

Query: 100 SHDQQLPSSSTHLLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRK 159
            H QQ PSSSTHLLI Y TPSS   D++EEPEEELGAMKEMMYKIAAMQPVDIDPSTIRK
Sbjct: 61  PH-QQFPSSSTHLLISYATPSSRV-DNDEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRK 120

Query: 160 PKRRNVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQ 219
           PKRRNVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQ
Sbjct: 121 PKRRNVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQ 180

Query: 220 IRLLQSSQPPQQPPTSGGAAAAGGGWPFPFHKANGSTSSSTSMETTPA-ITPSGW 262
           IRLLQSS  PQQP T GGAAA  GGWP  FHKA+GS SSSTSMETTPA ITPSGW
Sbjct: 181 IRLLQSSDQPQQPSTDGGAAA--GGWPLTFHKADGSASSSTSMETTPAIITPSGW 230

BLAST of Cla97C01G014450 vs. ExPASy TrEMBL
Match: A0A6J1ISW8 (transcription factor IND-like OS=Cucurbita maxima OX=3661 GN=LOC111478125 PE=4 SV=1)

HSP 1 Score: 338.6 bits (867), Expect = 2.4e-89
Identity = 189/231 (81.82%), Postives = 194/231 (83.98%), Query Frame = 0

Query: 40  MDPHHLPNPSSHLHHPAMEDHHIDLHHHHH--------DPDLDSLWPSLLPFQLPDSHDQ 99
           M+PHHL  PS HLHH AMEDHHI LH HHH        DPD DS+W SLLPF L D H Q
Sbjct: 1   MEPHHLSKPSPHLHHSAMEDHHI-LHRHHHAHVHDPDPDPDPDSIWSSLLPFHLSDPH-Q 60

Query: 100 QLPSSSTHLLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRR 159
           Q PSSSTHLLI Y TPSS   D +EEPEEELGAMKEMMYK+AAMQPVDIDPSTIRKPKRR
Sbjct: 61  QFPSSSTHLLISYATPSSRI-DTDEEPEEELGAMKEMMYKMAAMQPVDIDPSTIRKPKRR 120

Query: 160 NVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL 219
           NVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL
Sbjct: 121 NVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL 180

Query: 220 QSSQPPQQPPTSGGAAAAGGGWPFPFHKANGSTSSSTSMETTPA-ITPSGW 262
           QSS  PQQP T GGAAA  GGWP  FHKA+GS SSSTSMETTPA ITPSGW
Sbjct: 181 QSSDQPQQPSTDGGAAA--GGWPLTFHKADGSASSSTSMETTPAIITPSGW 226

BLAST of Cla97C01G014450 vs. TAIR 10
Match: AT5G09750.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 178.7 bits (452), Expect = 6.0e-45
Identity = 112/185 (60.54%), Postives = 127/185 (68.65%), Query Frame = 0

Query: 58  EDHHIDLHHHHHDP---DLDSLWPSLLPFQLPDSHDQQLPSSSTHLLIGYGTPSSGTGDD 117
           +DHH   H H++DP    +D      +      SH   L SS T   +  G       +D
Sbjct: 30  DDHH---HQHNNDPIGMAMDQYTQLHIFNPFSSSHFPPLSSSLTTTTLLSGDQED--DED 89

Query: 118 EEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLRRERISEK 177
           EEEP EELGAMKEMMYKIAAMQ VDIDP+T++KPKRRNVRISDDPQS+AAR RRERISE+
Sbjct: 90  EEEPLEELGAMKEMMYKIAAMQSVDIDPATVKKPKRRNVRISDDPQSVAARHRRERISER 149

Query: 178 IRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSS---QPPQQPPTSGGAAAAGG 237
           IRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL ++    PP  PP    + A   
Sbjct: 150 IRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLNNNTGYTPP--PPQDQASQAVTT 207

BLAST of Cla97C01G014450 vs. TAIR 10
Match: AT4G00120.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 155.2 bits (391), Expect = 7.1e-38
Identity = 76/100 (76.00%), Postives = 89/100 (89.00%), Query Frame = 0

Query: 113 DDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLRRERIS 172
           D++EE +E++ AMKEM Y IA MQPVDIDP+T+ KP RRNVRISDDPQ++ AR RRERIS
Sbjct: 76  DEDEEYDEDMDAMKEMQYMIAVMQPVDIDPATVPKPNRRNVRISDDPQTVVARRRRERIS 135

Query: 173 EKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQ 213
           EKIRIL+R+VPGG KMDTASMLDEAIRY KFLKRQ+R+LQ
Sbjct: 136 EKIRILKRIVPGGAKMDTASMLDEAIRYTKFLKRQVRILQ 175

BLAST of Cla97C01G014450 vs. TAIR 10
Match: AT5G67060.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 141.4 bits (355), Expect = 1.1e-33
Identity = 72/110 (65.45%), Postives = 88/110 (80.00%), Query Frame = 0

Query: 122 LGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLRRERISEKIRILQRL 181
           + AM+EM+++IA MQP+ IDP  ++ PKRRNVRIS DPQS+AAR RRERISE+IRILQRL
Sbjct: 95  MAAMREMIFRIAVMQPIHIDPEAVKPPKRRNVRISKDPQSVAARHRRERISERIRILQRL 154

Query: 182 VPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQPPQQPPTSGGAAAAGG 232
           VPGGTKMDTASMLDEAI YVKFLK+Q++ L+     +Q   +GG    GG
Sbjct: 155 VPGGTKMDTASMLDEAIHYVKFLKKQVQSLE-----EQAVVTGGGGGGGG 199

BLAST of Cla97C01G014450 vs. TAIR 10
Match: AT3G50330.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 128.3 bits (321), Expect = 9.3e-30
Identity = 90/201 (44.78%), Postives = 118/201 (58.71%), Query Frame = 0

Query: 32  SSPPSPSSMDPHHLPNPSSHLHHPAMEDHHIDLHHHHHDPDLDSLWPSLLPFQLPDSHDQ 91
           +S P+P   +PH++   S    HP   +       H H P     +   +P   P  + +
Sbjct: 24  NSNPNP---NPHNIMMLSESNTHPFFFN-----PTHSHLP-----FDQTMPHHQPGLNFR 83

Query: 92  QLPSSSTHLLIGYGTPSSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRR 151
             PS S+ L      P    G  +      + AM+EM+++IA MQP+ IDP +++ PKR+
Sbjct: 84  YAPSPSSSL------PEKRGGCSD---NANMAAMREMIFRIAVMQPIHIDPESVKPPKRK 143

Query: 152 NVRISDDPQSIAARLRRERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL 211
           NVRIS DPQS+AAR RRERISE+IRILQRLVPGGTKMDTASMLDEAI YVKFLK+Q++ L
Sbjct: 144 NVRISKDPQSVAARHRRERISERIRILQRLVPGGTKMDTASMLDEAIHYVKFLKKQVQSL 198

Query: 212 QSSQPPQQPPTSGGAAAAGGG 233
           +           GG  A  GG
Sbjct: 204 EE----HAVVNGGGMTAVAGG 198

BLAST of Cla97C01G014450 vs. TAIR 10
Match: AT3G21330.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 125.9 bits (315), Expect = 4.6e-29
Identity = 63/109 (57.80%), Postives = 83/109 (76.15%), Query Frame = 0

Query: 108 SSGTGDDEEEPEEELGAMKEMMYKIAAMQPVDIDPSTIRKPKRRNVRISDDPQSIAARLR 167
           S+   D+ E   E +  MKEM+Y+ AA +PV+     + KPKR+NV+IS DPQ++AAR R
Sbjct: 228 STCLSDNVEPDAEAIAQMKEMIYRAAAFRPVNFGLEIVEKPKRKNVKISTDPQTVAARQR 287

Query: 168 RERISEKIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLLQSSQP 217
           RERISEKIR+LQ LVPGGTKMDTASMLDEA  Y+KFL+ Q++ L++ +P
Sbjct: 288 RERISEKIRVLQTLVPGGTKMDTASMLDEAANYLKFLRAQVKALENLRP 336

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882292.12.4e-10492.27transcription factor HEC3-like [Benincasa hispida][more]
XP_004148648.27.2e-10187.95transcription factor HEC3 [Cucumis sativus] >KGN48364.1 hypothetical protein Csa... [more]
XP_016899193.11.3e-9787.05PREDICTED: transcription factor HEC3-like [Cucumis melo] >KAA0025535.1 transcrip... [more]
KAG6604279.12.3e-9183.12Transcription factor IND, partial [Cucurbita argyrosperma subsp. sororia] >KAG70... [more]
XP_022950638.16.8e-9181.70transcription factor IND-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9LXD88.4e-4460.54Transcription factor HEC3 OS=Arabidopsis thaliana OX=3702 GN=HEC3 PE=1 SV=1[more]
O813131.0e-3676.00Transcription factor IND OS=Arabidopsis thaliana OX=3702 GN=IND PE=1 SV=3[more]
Q9FHA71.5e-3265.45Transcription factor HEC1 OS=Arabidopsis thaliana OX=3702 GN=HEC1 PE=1 SV=1[more]
Q9SND41.3e-2844.78Transcription factor HEC2 OS=Arabidopsis thaliana OX=3702 GN=HEC2 PE=1 SV=1[more]
Q8S3D26.5e-2857.80Transcription factor bHLH87 OS=Arabidopsis thaliana OX=3702 GN=BHLH87 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KHS63.5e-10187.95BHLH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G483450 PE=4 S... [more]
A0A5A7SM526.2e-9887.05Transcription factor HEC3-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S4DTA86.2e-9887.05transcription factor HEC3-like OS=Cucumis melo OX=3656 GN=LOC103485271 PE=4 SV=1[more]
A0A6J1GFF73.3e-9181.70transcription factor IND-like OS=Cucurbita moschata OX=3662 GN=LOC111453678 PE=4... [more]
A0A6J1ISW82.4e-8981.82transcription factor IND-like OS=Cucurbita maxima OX=3661 GN=LOC111478125 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT5G09750.16.0e-4560.54basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT4G00120.17.1e-3876.00basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT5G67060.11.1e-3365.45basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT3G50330.19.3e-3044.78basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT3G21330.14.6e-2957.80basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 161..210
e-value: 6.1E-12
score: 55.7
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 164..204
e-value: 1.7E-6
score: 27.9
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROSITEPS50888BHLHcoord: 155..204
score: 14.982968
IPR036638Helix-loop-helix DNA-binding domain superfamilyGENE3D4.10.280.10coord: 158..219
e-value: 5.9E-11
score: 44.3
IPR036638Helix-loop-helix DNA-binding domain superfamilySUPERFAMILY47459HLH, helix-loop-helix DNA-binding domaincoord: 160..214
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 31..55
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..105
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 213..261
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 239..261
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..118
NoneNo IPR availablePANTHERPTHR45914TRANSCRIPTION FACTOR HEC3-RELATEDcoord: 28..255
NoneNo IPR availablePANTHERPTHR45914:SF7TRANSCRIPTION FACTOR HEC3-RELATEDcoord: 28..255
NoneNo IPR availableCDDcd11454bHLH_AtIND_likecoord: 156..211
e-value: 2.39869E-36
score: 121.73

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G014450.2Cla97C01G014450.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus
molecular_function GO:0046983 protein dimerization activity