Tan0020793 (gene) Snake gourd v1

Overview
NameTan0020793
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor bHLH52
LocationLG05: 79504350 .. 79505240 (+)
RNA-Seq ExpressionTan0020793
SyntenyTan0020793
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCTCTCACCTTTTACTCCAATTTTTACTCTGACCCTTCTTCAATCACTCCCCAATTCCATCATTTCTCCCCTGAAATCTCGCCGGAGCTCTTTTATCTCCCTCCCCTGCAGCTCGTCCCCGACCCCGACTTCGATTACGTCTCCTCCGCGGTTGATGACTCTGTTTTTTTCCCGAATGTCGCTCCGTTCTTTGACGACGCCTCGTCGTTTCTCTTCTCCGATGTTTGCCCGTGCTTCTCCGCTCCGGTCGCCAATGAATTTGTTCCCGTCTCGGCTGAATTCTTCCCTGTTGATGAATTTGAGTTCCATTGCTCCAAACGCCAGAGAGTTGTGCTGGAGCAGAGTTTTTGCTGTGGCGGTGTTGGTGGTGATGGCAATGTGGGTGGCGGAGGAGGCTACTTTCCTCCGCCGGAGTTGTTTTCTGGTATGTGGGATGGCCGAAGGGACAATGCTGAACTGATGAACAATGGTGGTTGCTCGAAGACTAAACCGTCGACGAGTAACAACTTATCGGCACAGACAATTGCCGCCCGGGAACGGCGGAGGAAAATCACGGCAAAGACGCAGGAGCTTGGAGAGCTGGTTCCGGGCGGCAGCAAGATGAACACTGCTGAAATGCTGAATTCTGCGTTCAGGTATGTAAAATTCCTGCAAGCCCAAGTCGCCATTTTGCAACTCAAGCAAGAAACAGAGCAAGAACAAGAACAACAAGAAACAGAGGATCTTCGGATTCTTGAATCCACAATGATTCAAGAGAAATTATACTCAGAAGAGCAGTGTTTAGTCCCAAAAGGGTTCGTCCAAAATCTTGCCAATTTTCCAGAGATTCAATCCCATCCGACCATTTTCAACTCCATCAATCAGATCCTTCAAAAGAGCAGCTAG

mRNA sequence

ATGGCGGCTCTCACCTTTTACTCCAATTTTTACTCTGACCCTTCTTCAATCACTCCCCAATTCCATCATTTCTCCCCTGAAATCTCGCCGGAGCTCTTTTATCTCCCTCCCCTGCAGCTCGTCCCCGACCCCGACTTCGATTACGTCTCCTCCGCGGTTGATGACTCTGTTTTTTTCCCGAATGTCGCTCCGTTCTTTGACGACGCCTCGTCGTTTCTCTTCTCCGATGTTTGCCCGTGCTTCTCCGCTCCGGTCGCCAATGAATTTGTTCCCGTCTCGGCTGAATTCTTCCCTGTTGATGAATTTGAGTTCCATTGCTCCAAACGCCAGAGAGTTGTGCTGGAGCAGAGTTTTTGCTGTGGCGGTGTTGGTGGTGATGGCAATGTGGGTGGCGGAGGAGGCTACTTTCCTCCGCCGGAGTTGTTTTCTGGTATGTGGGATGGCCGAAGGGACAATGCTGAACTGATGAACAATGGTGGTTGCTCGAAGACTAAACCGTCGACGAGTAACAACTTATCGGCACAGACAATTGCCGCCCGGGAACGGCGGAGGAAAATCACGGCAAAGACGCAGGAGCTTGGAGAGCTGGTTCCGGGCGGCAGCAAGATGAACACTGCTGAAATGCTGAATTCTGCGTTCAGGTATGTAAAATTCCTGCAAGCCCAAGTCGCCATTTTGCAACTCAAGCAAGAAACAGAGCAAGAACAAGAACAACAAGAAACAGAGGATCTTCGGATTCTTGAATCCACAATGATTCAAGAGAAATTATACTCAGAAGAGCAGTGTTTAGTCCCAAAAGGGTTCGTCCAAAATCTTGCCAATTTTCCAGAGATTCAATCCCATCCGACCATTTTCAACTCCATCAATCAGATCCTTCAAAAGAGCAGCTAG

Coding sequence (CDS)

ATGGCGGCTCTCACCTTTTACTCCAATTTTTACTCTGACCCTTCTTCAATCACTCCCCAATTCCATCATTTCTCCCCTGAAATCTCGCCGGAGCTCTTTTATCTCCCTCCCCTGCAGCTCGTCCCCGACCCCGACTTCGATTACGTCTCCTCCGCGGTTGATGACTCTGTTTTTTTCCCGAATGTCGCTCCGTTCTTTGACGACGCCTCGTCGTTTCTCTTCTCCGATGTTTGCCCGTGCTTCTCCGCTCCGGTCGCCAATGAATTTGTTCCCGTCTCGGCTGAATTCTTCCCTGTTGATGAATTTGAGTTCCATTGCTCCAAACGCCAGAGAGTTGTGCTGGAGCAGAGTTTTTGCTGTGGCGGTGTTGGTGGTGATGGCAATGTGGGTGGCGGAGGAGGCTACTTTCCTCCGCCGGAGTTGTTTTCTGGTATGTGGGATGGCCGAAGGGACAATGCTGAACTGATGAACAATGGTGGTTGCTCGAAGACTAAACCGTCGACGAGTAACAACTTATCGGCACAGACAATTGCCGCCCGGGAACGGCGGAGGAAAATCACGGCAAAGACGCAGGAGCTTGGAGAGCTGGTTCCGGGCGGCAGCAAGATGAACACTGCTGAAATGCTGAATTCTGCGTTCAGGTATGTAAAATTCCTGCAAGCCCAAGTCGCCATTTTGCAACTCAAGCAAGAAACAGAGCAAGAACAAGAACAACAAGAAACAGAGGATCTTCGGATTCTTGAATCCACAATGATTCAAGAGAAATTATACTCAGAAGAGCAGTGTTTAGTCCCAAAAGGGTTCGTCCAAAATCTTGCCAATTTTCCAGAGATTCAATCCCATCCGACCATTTTCAACTCCATCAATCAGATCCTTCAAAAGAGCAGCTAG

Protein sequence

MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFPNVAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFCCGGVGGDGNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGCSKTKPSTSNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQLKQETEQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSINQILQKSS
Homology
BLAST of Tan0020793 vs. ExPASy Swiss-Prot
Match: Q84RD0 (Transcription factor bHLH53 OS=Arabidopsis thaliana OX=3702 GN=BHLH53 PE=2 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 1.1e-23
Identity = 67/128 (52.34%), Postives = 85/128 (66.41%), Query Frame = 0

Query: 167 STSNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAIL 226
           S    LS+Q+IAAR RRR+I  KT ELG+L+PGG+K+NTAEM  +A +YVKFLQ+QV IL
Sbjct: 160 SKKPTLSSQSIAARGRRRRIAEKTHELGKLIPGGNKLNTAEMFQAAAKYVKFLQSQVGIL 219

Query: 227 QLKQETEQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFN 286
           QL Q T++     + E   +LES  IQEKL +EE CLVP   VQ+L     I   P I  
Sbjct: 220 QLMQTTKKGSSNVQMETQYLLESQAIQEKLSTEEVCLVPCEMVQDLTTEETICRTPNISR 279

Query: 287 SINQILQK 295
            IN++L K
Sbjct: 280 EINKLLSK 287

BLAST of Tan0020793 vs. ExPASy Swiss-Prot
Match: Q9SA82 (Transcription factor bHLH52 OS=Arabidopsis thaliana OX=3702 GN=BHLH52 PE=2 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 1.0e-21
Identity = 68/135 (50.37%), Postives = 89/135 (65.93%), Query Frame = 0

Query: 160 GCSKTKPSTSNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFL 219
           G ++   +    LSAQ+IAAR+RRR+IT KTQELG+L+PG  K NTAEM N+A +YVKFL
Sbjct: 124 GWTEQGDTKKRELSAQSIAARKRRRRITEKTQELGKLIPGSQKHNTAEMFNAAAKYVKFL 183

Query: 220 QAQVAILQLKQETEQEQEQQET--EDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPE 279
           QAQ+ ILQLKQ   Q  +  +   E   +L S  IQEKL +EE C+VP+  VQ L     
Sbjct: 184 QAQIEILQLKQTKMQTLDSSKVGREMQFLLGSQEIQEKLSTEEVCVVPREMVQVLKAEEC 243

Query: 280 IQSHPTIFNSINQIL 293
           I ++P I   IN++L
Sbjct: 244 ILTNPKISRDINKLL 258

BLAST of Tan0020793 vs. ExPASy Swiss-Prot
Match: Q7XAQ6 (Transcription factor LAX PANICLE 1 OS=Oryza sativa subsp. japonica OX=39947 GN=LAX1 PE=1 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 5.0e-08
Identity = 37/84 (44.05%), Postives = 51/84 (60.71%), Query Frame = 0

Query: 144 GMWDGRRDNAELMNNGGCSKTKPSTSNNLSAQTIAARERRRKITAKTQELGELVPGGSKM 203
           G+ +GR      M  GG  + +P    +   Q++AARERR +I+ + + L  LVPGGSKM
Sbjct: 22  GLGEGR------MRGGG--RRRPGAKLSTDPQSVAARERRHRISDRFRVLRSLVPGGSKM 81

Query: 204 NTAEMLNSAFRYVKFLQAQVAILQ 228
           +T  ML  A  YVKFL+AQV + Q
Sbjct: 82  DTVSMLEQAIHYVKFLKAQVTLHQ 97

BLAST of Tan0020793 vs. ExPASy Swiss-Prot
Match: O81313 (Transcription factor IND OS=Arabidopsis thaliana OX=3702 GN=IND PE=1 SV=3)

HSP 1 Score: 59.7 bits (143), Expect = 6.5e-08
Identity = 30/53 (56.60%), Postives = 39/53 (73.58%), Query Frame = 0

Query: 175 QTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQ 228
           QT+ AR RR +I+ K + L  +VPGG+KM+TA ML+ A RY KFL+ QV ILQ
Sbjct: 123 QTVVARRRRERISEKIRILKRIVPGGAKMDTASMLDEAIRYTKFLKRQVRILQ 175

BLAST of Tan0020793 vs. ExPASy Swiss-Prot
Match: Q8S3D2 (Transcription factor bHLH87 OS=Arabidopsis thaliana OX=3702 GN=BHLH87 PE=1 SV=1)

HSP 1 Score: 58.9 bits (141), Expect = 1.1e-07
Identity = 33/66 (50.00%), Postives = 45/66 (68.18%), Query Frame = 0

Query: 165 KPSTSN---NLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQA 224
           KP   N   +   QT+AAR+RR +I+ K + L  LVPGG+KM+TA ML+ A  Y+KFL+A
Sbjct: 267 KPKRKNVKISTDPQTVAARQRRERISEKIRVLQTLVPGGTKMDTASMLDEAANYLKFLRA 326

Query: 225 QVAILQ 228
           QV  L+
Sbjct: 327 QVKALE 332

BLAST of Tan0020793 vs. NCBI nr
Match: XP_038894685.1 (transcription factor bHLH52 [Benincasa hispida])

HSP 1 Score: 446.8 bits (1148), Expect = 1.4e-121
Identity = 246/312 (78.85%), Postives = 260/312 (83.33%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDP-DFDYVSSAVDDSVFF 60
           MAALTFYSNFYSDPSS   QFH+FSPE SP+L Y PP QL+ DP  FDY    VDDSVF+
Sbjct: 1   MAALTFYSNFYSDPSSYYHQFHYFSPEFSPDLCYFPPPQLLHDPIAFDY----VDDSVFY 60

Query: 61  P-NVAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSF 120
           P   APFFDDA  FLFSD  P FS P  +EF+PVS EFFP DEFEFHC KRQR VLEQSF
Sbjct: 61  PTTAAPFFDDAFPFLFSDAYPYFSTPAVDEFLPVSPEFFPCDEFEFHCPKRQRAVLEQSF 120

Query: 121 CC--GGVGGDGNVGGGGGYFPPP-------ELFSGMWDGRRDNAELMNNGGCSKTKPS-- 180
            C  GG GG GNVGGGGGYFPPP       ELFS  WDGR DNAE+MN+GGC KTKP+  
Sbjct: 121 RCGDGGGGGGGNVGGGGGYFPPPPTMTMTSELFSSAWDGRMDNAEMMNDGGCLKTKPTPV 180

Query: 181 --TSNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAI 240
             +SNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAF+YVKFLQAQVAI
Sbjct: 181 PLSSNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFKYVKFLQAQVAI 240

Query: 241 LQLKQETEQEQEQ-QETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTI 297
           LQLKQETEQEQEQ +ETEDLRILESTMIQEKLYSEE+CLVPKGFVQNLANFPEIQSHP+I
Sbjct: 241 LQLKQETEQEQEQEEETEDLRILESTMIQEKLYSEEKCLVPKGFVQNLANFPEIQSHPSI 300

BLAST of Tan0020793 vs. NCBI nr
Match: XP_023001636.1 (transcription factor bHLH52 [Cucurbita maxima])

HSP 1 Score: 430.3 bits (1105), Expect = 1.4e-116
Identity = 232/308 (75.32%), Postives = 253/308 (82.14%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFP 60
           MAAL+FYSNF SDPSS   QFH FSPEISPELF+LPP QL+PDP FDY+ +AVD SVFFP
Sbjct: 1   MAALSFYSNFSSDPSSYHHQFHPFSPEISPELFHLPPPQLLPDPTFDYLPAAVDHSVFFP 60

Query: 61  N-VAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFC 120
              APFFDD S F+FSDV P F  P  +EFVPVSAEFFP DEFEF+C KRQRVV+EQSFC
Sbjct: 61  TATAPFFDDVSPFVFSDVYPYFPTPAVDEFVPVSAEFFPCDEFEFYCPKRQRVVMEQSFC 120

Query: 121 CGGVGGDGNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGCSKTKPS-----------T 180
            G  GG GNVGG GGYFP PE FS  WDGR  NAE+MN G C K  P+           +
Sbjct: 121 YG--GGGGNVGGEGGYFPLPEFFSDQWDGRLGNAEMMNKGDCLKGTPTLVPLPLPSPSPS 180

Query: 181 SNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQL 240
           SNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEML+SAF+YVKFLQAQVAILQ+
Sbjct: 181 SNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLSSAFKYVKFLQAQVAILQV 240

Query: 241 KQETEQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSI 297
           KQETEQEQEQ ETEDL+ILESTMIQEKLY+EE+CLVPKGFVQNLAN  EIQS+P+I NSI
Sbjct: 241 KQETEQEQEQTETEDLQILESTMIQEKLYAEEKCLVPKGFVQNLANSSEIQSNPSILNSI 300

BLAST of Tan0020793 vs. NCBI nr
Match: XP_022927111.1 (transcription factor bHLH52-like [Cucurbita moschata])

HSP 1 Score: 427.2 bits (1097), Expect = 1.2e-115
Identity = 230/304 (75.66%), Postives = 250/304 (82.24%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFP 60
           MAAL+FYSNF SDPSS   QFH FSPEISPELF+LPP QL+PDP FDY+ +AVD SVFFP
Sbjct: 1   MAALSFYSNFSSDPSSYHHQFHPFSPEISPELFHLPPPQLLPDPAFDYLHAAVDHSVFFP 60

Query: 61  N-VAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFC 120
              APFFDD S F+FSDV P F  P  + FVPVSAEFFP DEFEF+C KRQRVV+EQSFC
Sbjct: 61  TATAPFFDDVSPFVFSDVYPYFPTPAVDSFVPVSAEFFPCDEFEFYCPKRQRVVMEQSFC 120

Query: 121 CGGVGGDGNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGCSK-------TKPSTSNNL 180
            G  GG GNVGG GGYFP PE FS  WDGR  NAELMN G C K           +SNNL
Sbjct: 121 YG--GGGGNVGGDGGYFPLPEFFSDQWDGRLGNAELMNKGDCLKGTLTPAPLPAPSSNNL 180

Query: 181 SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQLKQET 240
           SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEML+SAF+YVKFLQAQVAILQ+KQET
Sbjct: 181 SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLSSAFKYVKFLQAQVAILQVKQET 240

Query: 241 EQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSINQIL 297
           EQEQE+ ETEDL+ILESTMIQEKLY+EE+CLVPKGFVQNL N PEIQS+P+I NSINQIL
Sbjct: 241 EQEQERTETEDLQILESTMIQEKLYAEEKCLVPKGFVQNLVNSPEIQSNPSILNSINQIL 300

BLAST of Tan0020793 vs. NCBI nr
Match: KAG6583823.1 (Transcription factor basic helix-loop-helix 52, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 426.4 bits (1095), Expect = 2.0e-115
Identity = 228/300 (76.00%), Postives = 250/300 (83.33%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFP 60
           MAAL+FYSNF SDPSS   QFH FSPEISPELF+LPP QL+PDP FDY+ +AVD SVFFP
Sbjct: 1   MAALSFYSNFSSDPSSYHHQFHPFSPEISPELFHLPPPQLLPDPAFDYLPAAVDHSVFFP 60

Query: 61  N-VAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFC 120
              APFF+D S F+FSDV P    P  + FVPVSAEFFP DEFEF+C KRQRVV+EQSFC
Sbjct: 61  TATAPFFNDVSPFVFSDVYPYLPTPAVDSFVPVSAEFFPCDEFEFYCPKRQRVVMEQSFC 120

Query: 121 CGGVGGDGNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGC---SKTKPSTSNNLSAQT 180
            G  GG GNVGG GGYFP PE FS  WDGR  NAELMN G C   ++T   +SNNLSAQT
Sbjct: 121 YG--GGGGNVGGDGGYFPLPEFFSDQWDGRLGNAELMNKGDCLKGTRTPAPSSNNLSAQT 180

Query: 181 IAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQLKQETEQEQ 240
           IAARERRRKITAKTQELGELVPGGSKMNTAEML+SAF+YVKFLQAQVAILQ+KQETEQEQ
Sbjct: 181 IAARERRRKITAKTQELGELVPGGSKMNTAEMLSSAFKYVKFLQAQVAILQVKQETEQEQ 240

Query: 241 EQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSINQILQKSS 297
           E  ETEDL+ILESTMIQEKLY+EE+CLVPKGFVQNL N PEIQS+P+I NSINQIL  +S
Sbjct: 241 EHTETEDLQILESTMIQEKLYAEEKCLVPKGFVQNLVNSPEIQSNPSILNSINQILHMNS 298

BLAST of Tan0020793 vs. NCBI nr
Match: KAG7019449.1 (Transcription factor bHLH52, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 425.6 bits (1093), Expect = 3.4e-115
Identity = 230/304 (75.66%), Postives = 249/304 (81.91%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFP 60
           MAAL+FYSNF SDPSS   QFH FSPEISPELF+LPP QL+PDP  DY+ +AVD SVFFP
Sbjct: 1   MAALSFYSNFSSDPSSYHHQFHPFSPEISPELFHLPPPQLLPDPVLDYLPAAVDHSVFFP 60

Query: 61  NV-APFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFC 120
            V APFFDD S F+FSDV P F  P  + FVPVSAEFFP DEFEF+C KRQRVV+EQSFC
Sbjct: 61  TVTAPFFDDVSPFVFSDVYPYFPTPAVDSFVPVSAEFFPCDEFEFYCPKRQRVVMEQSFC 120

Query: 121 CGGVGGDGNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGCSK-------TKPSTSNNL 180
            G  GG GNVGG GGYFP PE FS  WDGR  NAELMN G C K           +SNNL
Sbjct: 121 YG--GGGGNVGGDGGYFPLPEFFSDQWDGRLGNAELMNKGDCLKGTLTPAPLPAPSSNNL 180

Query: 181 SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQLKQET 240
           SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEML+SAF+YVKFLQAQVAILQ+KQET
Sbjct: 181 SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLSSAFKYVKFLQAQVAILQVKQET 240

Query: 241 EQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSINQIL 297
           EQEQE  ETEDL+ILESTMIQEKLY+EE+CLVPKGFVQNL N PEIQS+P+I NSINQIL
Sbjct: 241 EQEQEHTETEDLQILESTMIQEKLYAEEKCLVPKGFVQNLVNSPEIQSNPSILNSINQIL 300

BLAST of Tan0020793 vs. ExPASy TrEMBL
Match: A0A6J1KJ69 (transcription factor bHLH52 OS=Cucurbita maxima OX=3661 GN=LOC111495708 PE=4 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 6.7e-117
Identity = 232/308 (75.32%), Postives = 253/308 (82.14%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFP 60
           MAAL+FYSNF SDPSS   QFH FSPEISPELF+LPP QL+PDP FDY+ +AVD SVFFP
Sbjct: 1   MAALSFYSNFSSDPSSYHHQFHPFSPEISPELFHLPPPQLLPDPTFDYLPAAVDHSVFFP 60

Query: 61  N-VAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFC 120
              APFFDD S F+FSDV P F  P  +EFVPVSAEFFP DEFEF+C KRQRVV+EQSFC
Sbjct: 61  TATAPFFDDVSPFVFSDVYPYFPTPAVDEFVPVSAEFFPCDEFEFYCPKRQRVVMEQSFC 120

Query: 121 CGGVGGDGNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGCSKTKPS-----------T 180
            G  GG GNVGG GGYFP PE FS  WDGR  NAE+MN G C K  P+           +
Sbjct: 121 YG--GGGGNVGGEGGYFPLPEFFSDQWDGRLGNAEMMNKGDCLKGTPTLVPLPLPSPSPS 180

Query: 181 SNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQL 240
           SNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEML+SAF+YVKFLQAQVAILQ+
Sbjct: 181 SNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLSSAFKYVKFLQAQVAILQV 240

Query: 241 KQETEQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSI 297
           KQETEQEQEQ ETEDL+ILESTMIQEKLY+EE+CLVPKGFVQNLAN  EIQS+P+I NSI
Sbjct: 241 KQETEQEQEQTETEDLQILESTMIQEKLYAEEKCLVPKGFVQNLANSSEIQSNPSILNSI 300

BLAST of Tan0020793 vs. ExPASy TrEMBL
Match: A0A6J1EGS4 (transcription factor bHLH52-like OS=Cucurbita moschata OX=3662 GN=LOC111434050 PE=4 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 5.7e-116
Identity = 230/304 (75.66%), Postives = 250/304 (82.24%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFP 60
           MAAL+FYSNF SDPSS   QFH FSPEISPELF+LPP QL+PDP FDY+ +AVD SVFFP
Sbjct: 1   MAALSFYSNFSSDPSSYHHQFHPFSPEISPELFHLPPPQLLPDPAFDYLHAAVDHSVFFP 60

Query: 61  N-VAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFC 120
              APFFDD S F+FSDV P F  P  + FVPVSAEFFP DEFEF+C KRQRVV+EQSFC
Sbjct: 61  TATAPFFDDVSPFVFSDVYPYFPTPAVDSFVPVSAEFFPCDEFEFYCPKRQRVVMEQSFC 120

Query: 121 CGGVGGDGNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGCSK-------TKPSTSNNL 180
            G  GG GNVGG GGYFP PE FS  WDGR  NAELMN G C K           +SNNL
Sbjct: 121 YG--GGGGNVGGDGGYFPLPEFFSDQWDGRLGNAELMNKGDCLKGTLTPAPLPAPSSNNL 180

Query: 181 SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQLKQET 240
           SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEML+SAF+YVKFLQAQVAILQ+KQET
Sbjct: 181 SAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLSSAFKYVKFLQAQVAILQVKQET 240

Query: 241 EQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSINQIL 297
           EQEQE+ ETEDL+ILESTMIQEKLY+EE+CLVPKGFVQNL N PEIQS+P+I NSINQIL
Sbjct: 241 EQEQERTETEDLQILESTMIQEKLYAEEKCLVPKGFVQNLVNSPEIQSNPSILNSINQIL 300

BLAST of Tan0020793 vs. ExPASy TrEMBL
Match: A0A6J1GUV4 (transcription factor bHLH52-like OS=Cucurbita moschata OX=3662 GN=LOC111457139 PE=4 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 6.8e-109
Identity = 225/298 (75.50%), Postives = 241/298 (80.87%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFP 60
           MAALTF+SNFYSDPSSI PQF +FSPE+SP+LF+LPP      P  DYV +  DDSVF P
Sbjct: 1   MAALTFFSNFYSDPSSINPQFPYFSPEMSPDLFFLPP------PHLDYVPAMPDDSVFIP 60

Query: 61  NVAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFCC 120
            VAPFFDDAS FLFSDV P F           SA FFP DEFEFH SKRQR+VLEQSFC 
Sbjct: 61  TVAPFFDDASPFLFSDVYPYF-----------SAGFFPFDEFEFHYSKRQRIVLEQSFCY 120

Query: 121 GGVGGD-GNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGCSKTKP--STSNNLSAQTI 180
           GGVGG+ G  GGGGGYFP     SG+ DGR DN E+MNNG CSKTK   S SN LS QTI
Sbjct: 121 GGVGGNVGGGGGGGGYFP-----SGLLDGRVDNVEMMNNGNCSKTKQLVSPSNTLSVQTI 180

Query: 181 AARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQLKQETEQEQE 240
           AARERRRKITAKTQELGEL+PGG KMNTAEMLNSAF+YVKFLQAQVAILQLKQETE E E
Sbjct: 181 AARERRRKITAKTQELGELIPGGVKMNTAEMLNSAFKYVKFLQAQVAILQLKQETEHELE 240

Query: 241 QQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSINQILQKS 296
           +QETEDL+ILES MIQEKLYSEEQCLVPKGFVQNLANFPEIQSHP+I NSINQIL+ S
Sbjct: 241 EQETEDLQILESAMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPSISNSINQILKTS 276

BLAST of Tan0020793 vs. ExPASy TrEMBL
Match: A0A6J1K695 (uncharacterized protein LOC111490459 OS=Cucurbita maxima OX=3661 GN=LOC111490459 PE=4 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 4.1e-106
Identity = 221/301 (73.42%), Postives = 239/301 (79.40%), Query Frame = 0

Query: 1   MAALTFYSNFYSDPSSITPQFHHFSPEISPELFYLPPLQLVPDPDFDYVSSAVDDSVFFP 60
           MAALTF+SNFYSDPSSI PQF +FSPE+SP+LF+LPP      P  DYV +  DDSV FP
Sbjct: 1   MAALTFFSNFYSDPSSINPQFPYFSPEMSPDLFFLPP------PHLDYVPAMPDDSVLFP 60

Query: 61  NVAPFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQSFCC 120
            VAPFFDDAS FLFSDV P F           SAE FP DEFEFH SKRQR+VLEQSFC 
Sbjct: 61  TVAPFFDDASPFLFSDVYPYF-----------SAEIFPFDEFEFHYSKRQRIVLEQSFCY 120

Query: 121 GGVGGD----GNVGGGGGYFPPPELFSGMWDGRRDNAELMNNGGCSKTK--PSTSNNLSA 180
           GGVGG+       GGGGGYFP      G+ DGR DNAE+MNN  CSKTK  PS+SN  S 
Sbjct: 121 GGVGGNVGGGCGGGGGGGYFP-----LGLLDGRMDNAEMMNNSNCSKTKQLPSSSNASSV 180

Query: 181 QTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQLKQETEQ 240
           QTIAARERRR+ITAKTQELGEL+PGG KMNTAEMLNSAF+YVKFLQAQVAILQLKQETE 
Sbjct: 181 QTIAARERRRRITAKTQELGELIPGGVKMNTAEMLNSAFKYVKFLQAQVAILQLKQETEH 240

Query: 241 EQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFNSINQILQK 296
           E E+QETEDL+IL+S MIQEKLYSEEQCLVPKGFVQNLANFPEIQSHP I NSINQIL+ 
Sbjct: 241 ELEEQETEDLQILDSAMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPFISNSINQILKT 279

BLAST of Tan0020793 vs. ExPASy TrEMBL
Match: A0A0A0LY56 (BHLH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G542450 PE=4 SV=1)

HSP 1 Score: 393.7 bits (1010), Expect = 7.0e-106
Identity = 222/316 (70.25%), Postives = 247/316 (78.16%), Query Frame = 0

Query: 1   MAALTFYSNFYS-DPSSITPQFHHFSPEISPELFYLPPLQLVPDP-DFDYVSSAVDDSVF 60
           MAALT+YSNFYS DPS     FH+FSPE SP+L Y+PP QL+  P  FDY    VDDS+F
Sbjct: 1   MAALTYYSNFYSPDPSPYYHHFHYFSPEFSPDLSYVPPPQLLNHPAAFDY----VDDSLF 60

Query: 61  FPNV-APFFDDASSFLFSDVCPCFSAPVANEFVPVSAEFFPVDEFEFHCSKRQRVVLEQS 120
           +P      FDDA  FLFSD  PCFSAP  +EF+PVS++FFP DEFEFHC KRQR V E S
Sbjct: 61  YPTADTLLFDDALPFLFSDTYPCFSAPSVDEFLPVSSQFFPFDEFEFHCPKRQRAVFEHS 120

Query: 121 FCCGGVGGDGNVGGGG-----GYFPPP--------ELFSGMWDGRRDNAELMNNGGCSKT 180
           FCCGG  GDGNVGGGG     G+FP P        E+FSG WD R DNAE+ N+  C K+
Sbjct: 121 FCCGGGVGDGNVGGGGSGAGAGFFPSPPPPPPPLAEVFSGPWDSRMDNAEMRND--CLKS 180

Query: 181 K----PSTSNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQ 240
           +    PS+ NNLSAQTIAARERRRKIT KTQELGELVPGGSKMNTAEMLNSAF+YVKFLQ
Sbjct: 181 QPSPAPSSHNNLSAQTIAARERRRKITVKTQELGELVPGGSKMNTAEMLNSAFKYVKFLQ 240

Query: 241 AQVAILQLKQETEQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQS 297
           AQVAILQLKQET  EQE QETE+L ILESTMIQEKLYSEE+CLVPKGF+QNLA+FPEIQS
Sbjct: 241 AQVAILQLKQET--EQEGQETENLEILESTMIQEKLYSEEKCLVPKGFIQNLADFPEIQS 300

BLAST of Tan0020793 vs. TAIR 10
Match: AT2G34820.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 112.1 bits (279), Expect = 7.8e-25
Identity = 67/128 (52.34%), Postives = 85/128 (66.41%), Query Frame = 0

Query: 167 STSNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAIL 226
           S    LS+Q+IAAR RRR+I  KT ELG+L+PGG+K+NTAEM  +A +YVKFLQ+QV IL
Sbjct: 160 SKKPTLSSQSIAARGRRRRIAEKTHELGKLIPGGNKLNTAEMFQAAAKYVKFLQSQVGIL 219

Query: 227 QLKQETEQEQEQQETEDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPEIQSHPTIFN 286
           QL Q T++     + E   +LES  IQEKL +EE CLVP   VQ+L     I   P I  
Sbjct: 220 QLMQTTKKGSSNVQMETQYLLESQAIQEKLSTEEVCLVPCEMVQDLTTEETICRTPNISR 279

Query: 287 SINQILQK 295
            IN++L K
Sbjct: 280 EINKLLSK 287

BLAST of Tan0020793 vs. TAIR 10
Match: AT1G30670.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 105.5 bits (262), Expect = 7.3e-23
Identity = 68/135 (50.37%), Postives = 89/135 (65.93%), Query Frame = 0

Query: 160 GCSKTKPSTSNNLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFL 219
           G ++   +    LSAQ+IAAR+RRR+IT KTQELG+L+PG  K NTAEM N+A +YVKFL
Sbjct: 124 GWTEQGDTKKRELSAQSIAARKRRRRITEKTQELGKLIPGSQKHNTAEMFNAAAKYVKFL 183

Query: 220 QAQVAILQLKQETEQEQEQQET--EDLRILESTMIQEKLYSEEQCLVPKGFVQNLANFPE 279
           QAQ+ ILQLKQ   Q  +  +   E   +L S  IQEKL +EE C+VP+  VQ L     
Sbjct: 184 QAQIEILQLKQTKMQTLDSSKVGREMQFLLGSQEIQEKLSTEEVCVVPREMVQVLKAEEC 243

Query: 280 IQSHPTIFNSINQIL 293
           I ++P I   IN++L
Sbjct: 244 ILTNPKISRDINKLL 258

BLAST of Tan0020793 vs. TAIR 10
Match: AT4G00120.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 59.7 bits (143), Expect = 4.6e-09
Identity = 30/53 (56.60%), Postives = 39/53 (73.58%), Query Frame = 0

Query: 175 QTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAILQ 228
           QT+ AR RR +I+ K + L  +VPGG+KM+TA ML+ A RY KFL+ QV ILQ
Sbjct: 123 QTVVARRRRERISEKIRILKRIVPGGAKMDTASMLDEAIRYTKFLKRQVRILQ 175

BLAST of Tan0020793 vs. TAIR 10
Match: AT3G21330.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 58.9 bits (141), Expect = 7.9e-09
Identity = 33/66 (50.00%), Postives = 45/66 (68.18%), Query Frame = 0

Query: 165 KPSTSN---NLSAQTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQA 224
           KP   N   +   QT+AAR+RR +I+ K + L  LVPGG+KM+TA ML+ A  Y+KFL+A
Sbjct: 267 KPKRKNVKISTDPQTVAARQRRERISEKIRVLQTLVPGGTKMDTASMLDEAANYLKFLRA 326

Query: 225 QVAILQ 228
           QV  L+
Sbjct: 327 QVKALE 332

BLAST of Tan0020793 vs. TAIR 10
Match: AT5G09750.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 58.2 bits (139), Expect = 1.3e-08
Identity = 28/52 (53.85%), Postives = 40/52 (76.92%), Query Frame = 0

Query: 175 QTIAARERRRKITAKTQELGELVPGGSKMNTAEMLNSAFRYVKFLQAQVAIL 227
           Q++AAR RR +I+ + + L  LVPGG+KM+TA ML+ A RYVKFL+ Q+ +L
Sbjct: 130 QSVAARHRRERISERIRILQRLVPGGTKMDTASMLDEAIRYVKFLKRQIRLL 181

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q84RD01.1e-2352.34Transcription factor bHLH53 OS=Arabidopsis thaliana OX=3702 GN=BHLH53 PE=2 SV=1[more]
Q9SA821.0e-2150.37Transcription factor bHLH52 OS=Arabidopsis thaliana OX=3702 GN=BHLH52 PE=2 SV=1[more]
Q7XAQ65.0e-0844.05Transcription factor LAX PANICLE 1 OS=Oryza sativa subsp. japonica OX=39947 GN=L... [more]
O813136.5e-0856.60Transcription factor IND OS=Arabidopsis thaliana OX=3702 GN=IND PE=1 SV=3[more]
Q8S3D21.1e-0750.00Transcription factor bHLH87 OS=Arabidopsis thaliana OX=3702 GN=BHLH87 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_038894685.11.4e-12178.85transcription factor bHLH52 [Benincasa hispida][more]
XP_023001636.11.4e-11675.32transcription factor bHLH52 [Cucurbita maxima][more]
XP_022927111.11.2e-11575.66transcription factor bHLH52-like [Cucurbita moschata][more]
KAG6583823.12.0e-11576.00Transcription factor basic helix-loop-helix 52, partial [Cucurbita argyrosperma ... [more]
KAG7019449.13.4e-11575.66Transcription factor bHLH52, partial [Cucurbita argyrosperma subsp. argyrosperma... [more]
Match NameE-valueIdentityDescription
A0A6J1KJ696.7e-11775.32transcription factor bHLH52 OS=Cucurbita maxima OX=3661 GN=LOC111495708 PE=4 SV=... [more]
A0A6J1EGS45.7e-11675.66transcription factor bHLH52-like OS=Cucurbita moschata OX=3662 GN=LOC111434050 P... [more]
A0A6J1GUV46.8e-10975.50transcription factor bHLH52-like OS=Cucurbita moschata OX=3662 GN=LOC111457139 P... [more]
A0A6J1K6954.1e-10673.42uncharacterized protein LOC111490459 OS=Cucurbita maxima OX=3661 GN=LOC111490459... [more]
A0A0A0LY567.0e-10670.25BHLH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G542450 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT2G34820.17.8e-2552.34basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT1G30670.17.3e-2350.37basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT4G00120.14.6e-0956.60basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT3G21330.17.9e-0950.00basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT5G09750.11.3e-0853.85basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 216..236
NoneNo IPR availablePANTHERPTHR16223TRANSCRIPTION FACTOR BHLH83-RELATEDcoord: 15..287
NoneNo IPR availablePANTHERPTHR16223:SF49TRANSCRIPTION FACTOR BHLH52-RELATEDcoord: 15..287
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 176..225
e-value: 2.5E-9
score: 47.0
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 177..220
e-value: 9.9E-6
score: 25.5
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROSITEPS50888BHLHcoord: 170..219
score: 12.252081
IPR036638Helix-loop-helix DNA-binding domain superfamilyGENE3D4.10.280.10coord: 172..245
e-value: 1.4E-10
score: 43.1
IPR036638Helix-loop-helix DNA-binding domain superfamilySUPERFAMILY47459HLH, helix-loop-helix DNA-binding domaincoord: 170..233

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020793.1Tan0020793.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0000978 RNA polymerase II cis-regulatory region sequence-specific DNA binding