Cp4.1LG01g20040 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g20040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBasic helix-loop-helix transcription factor
LocationCp4.1LG01 : 17028190 .. 17030384 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCACTTGCAAACTCATCTGTACATGAAGCCGATGGCATGGACAAAGCTGGTGGATATATCAAGTCTATACCCATTGCTACCCATGACACTCCAAATGCTACCTTTCTCTTTCGAAACACGACGAATTACCAACTCGATCACCATCAGCAGCAGCAGCAACTGCCTCAGTCTCTCATCAACTTCAAGTGTGGGATTGATAACATTTTCCATAAGAACAATGAATCTTTGCTGAGCTTTGAGCCACAAGGGGTTGGTCAAGACACTTATAATTGGGATGATCACCAGCGTTTGATGGAAGATCCCAATTGCTTTCAAACAGCCACCAATGGTTATAGCTCTCCAAAAGAGAACAAAAATGGTGATGGGTCTGTGTATGATTGGCTTTACTCTGAAGCAAGTGGTGTTTCTGATGCCATTCAGGAGGCTGAAGGAGCTCACGAGATTGCCAATCAAAAGCGACCTCATATGGTATTGAATTTCAAAGTTCTTACCTTTTCTTTGCAACAAGTCCTCAATTTTTTTTTCGTTTTTGTTGAGAAATGTGAGATCTCGTATCCGATTGGGGAGGAGAACAAAACATTCTTTATAAGAGTGTGGAAACCTCTTCTTAGTAGACGCGTTTTAAAAACCTTGAAGAAAAGCCCGAAAGGTAAAATCAAAGAAGATAATATCTACTAGCAGTGGGCTTGAGCCGTTACAAATGGTATCAGAGCCAGACAACGGATGATGAGCCAGTGAGGAAGCTAAGCTCCGAAGAGGGGTGGACAAGAGGCGGTGTGCTAGCAAGCACACTGGGCCCTAAAGGAGGTGGATTGGAGTGTCACACATTGATTGGAGAAGGGAACGAGTGTCAGTGAGGACGCTGGCCCCCGAAAGAGGGTGGATCGTAAGATCCTACATCAGTTAGGGAGGAGAACAAAACATTTTTTATAAGAGTGTGGAAACCTCTCACTAACAGACCGTTTTAAAGAAAGACAAAAAAGAACAGGCTTGGGTCGTTACCGAAAACGTTTGAATTCCGAGCTAAATTACAAGTTTTTTTTATTTTGAAAGTTAGCTTTTGATTGATTGATTGACAGGGAGAGAGCAGTGGGAGTGTTAGTAAGAAGCAATGCACAACAGTTGCAGCTAAGAAACAGAAACCCAAATCATCAACAGCAAAGGATCCACAGAGCATTGCAGCCAAGGTTTTAGTTTGTTTTTTTGACAACGTTGGCTTACAATTTAAGGAACAAATATTTTAAGGTTCTGAAAGAATGTCTGCTTTTGTCTTTGTTTTTATGTTTCTTAGAATCGACGAGAGAGGATTAGCGAGCGGCTGAAGATACTTCAAGAATTGGTTCCCAATGGCTCCAAGGTGCTTACTCTTATCTAATATAAAAAACATGTTCATATCTCAATCCGCCTCGGCAGATCTTCTTCTTAACTAGCTAATGTGAAACTCTAAAAATCTCTTCATCTTAAATAGTCATTTATCCGTCTAGCACTATCCCGTGTTAGCTCCTTTTCGTCAATCTAATACATCTCTTCCATGCATATTTCCTATGAACACACCTTTAAGGCTCACATGTCTATTTATTTGACACCCACCCAAGAATTCTATGTGAGATCTCACATCGGTTGTAGAGAGGAACAAAGTATTTTTTACCAGGGTACAAATATCTCCAAACCCAAAGAGAATAATATCTGCTGACGTGGGCTTAGGTGGTTATATTCTATTACTAAGTTATGGGTACGATTGTGATAGGCTAATAATTACTTGATATTGTACACATTGAGCATAAGCTCTCGTGATTTTGTTTTTTGGTTACCCAAAATGTTTTGTTCGAATGAAGATAGTGTCCCTAACTTCTATATCTATGATCTTTCTCGTGAAACGCTAATAATTTGTACTTGTTTGCAGGTTGATTTGGTAACGATGTTGGAAAAGGCAATTAGTTATGTAAAATTTCTTCAACTACAAGTGAAGGTACAAATTCCATATCACTCTGTTTCCAAATTTTGGATAAAGATCCTTCCCTTGTTGGCTCTGATATCATAGTAAAAATTTTGAATATCAGATCCTAGCTACGGATGAGTTTTGGCCAGTTCAAGGTGGGAAAGCTCCTGATATATCACAAGTTAAGGAAGCCATTGATGCTATTCTTTCATCTCAAAGAGAAAGAAGTTCAAGCTCAAACAGTCAAAAATGA

mRNA sequence

ATGGCACTTGCAAACTCATCTGTACATGAAGCCGATGGCATGGACAAAGCTGGTGGATATATCAAGTCTATACCCATTGCTACCCATGACACTCCAAATGCTACCTTTCTCTTTCGAAACACGACGAATTACCAACTCGATCACCATCAGCAGCAGCAGCAACTGCCTCAGTCTCTCATCAACTTCAAGTGTGGGATTGATAACATTTTCCATAAGAACAATGAATCTTTGCTGAGCTTTGAGCCACAAGGGGTTGGTCAAGACACTTATAATTGGGATGATCACCAGCGTTTGATGGAAGATCCCAATTGCTTTCAAACAGCCACCAATGGTTATAGCTCTCCAAAAGAGAACAAAAATGGTGATGGGTCTGTGTATGATTGGCTTTACTCTGAAGCAAGTGGTGTTTCTGATGCCATTCAGGAGGCTGAAGGAGCTCACGAGATTGCCAATCAAAAGCGACCTCATATGGTATTGAATTTCAAAGGAGAGAGCAGTGGGAGTGTTAGTAAGAAGCAATGCACAACAGTTGCAGCTAAGAAACAGAAACCCAAATCATCAACAGCAAAGGATCCACAGAGCATTGCAGCCAAGAATCGACGAGAGAGGATTAGCGAGCGGCTGAAGATACTTCAAGAATTGGTTCCCAATGGCTCCAAGATCCTAGCTACGGATGAGTTTTGGCCAGTTCAAGGTGGGAAAGCTCCTGATATATCACAAGTTAAGGAAGCCATTGATGCTATTCTTTCATCTCAAAGAGAAAGAAGTTCAAGCTCAAACAGTCAAAAATGA

Coding sequence (CDS)

ATGGCACTTGCAAACTCATCTGTACATGAAGCCGATGGCATGGACAAAGCTGGTGGATATATCAAGTCTATACCCATTGCTACCCATGACACTCCAAATGCTACCTTTCTCTTTCGAAACACGACGAATTACCAACTCGATCACCATCAGCAGCAGCAGCAACTGCCTCAGTCTCTCATCAACTTCAAGTGTGGGATTGATAACATTTTCCATAAGAACAATGAATCTTTGCTGAGCTTTGAGCCACAAGGGGTTGGTCAAGACACTTATAATTGGGATGATCACCAGCGTTTGATGGAAGATCCCAATTGCTTTCAAACAGCCACCAATGGTTATAGCTCTCCAAAAGAGAACAAAAATGGTGATGGGTCTGTGTATGATTGGCTTTACTCTGAAGCAAGTGGTGTTTCTGATGCCATTCAGGAGGCTGAAGGAGCTCACGAGATTGCCAATCAAAAGCGACCTCATATGGTATTGAATTTCAAAGGAGAGAGCAGTGGGAGTGTTAGTAAGAAGCAATGCACAACAGTTGCAGCTAAGAAACAGAAACCCAAATCATCAACAGCAAAGGATCCACAGAGCATTGCAGCCAAGAATCGACGAGAGAGGATTAGCGAGCGGCTGAAGATACTTCAAGAATTGGTTCCCAATGGCTCCAAGATCCTAGCTACGGATGAGTTTTGGCCAGTTCAAGGTGGGAAAGCTCCTGATATATCACAAGTTAAGGAAGCCATTGATGCTATTCTTTCATCTCAAAGAGAAAGAAGTTCAAGCTCAAACAGTCAAAAATGA

Protein sequence

MALANSSVHEADGMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHKNNESLLSFEPQGVGQDTYNWDDHQRLMEDPNCFQTATNGYSSPKENKNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGESSGSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERISERLKILQELVPNGSKILATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSSNSQK
BLAST of Cp4.1LG01g20040 vs. Swiss-Prot
Match: BH086_ARATH (Putative transcription factor bHLH086 OS=Arabidopsis thaliana GN=BHLH86 PE=3 SV=2)

HSP 1 Score: 129.4 bits (324), Expect = 5.7e-29
Identity = 113/291 (38.83%), Postives = 153/291 (52.58%), Query Frame = 1

Query: 13  GMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHK 72
           G+D+      S  I +    N  F+F  +     DH+        SL++F       F  
Sbjct: 30  GLDEGASASSSSTINSDHQNNQGFVFYPSGETIEDHN--------SLMDFNASSFFTFD- 89

Query: 73  NNESLLSFEPQGV------GQDTYN---WDDHQ------RLMEDPNCFQTAT------NG 132
           N+ SL+S    G       G  +Y+   W  HQ      R+++ PN F+T +      N 
Sbjct: 90  NHRSLISPVTNGGAFPVVDGNMSYSYDGWSHHQVDSISPRVIKTPNSFETTSSFGLTSNS 149

Query: 133 YSSPKENKNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGESSGSVSK 192
            S P  N +G+G   DWLYS ++ V+   +    + ++A  KRP     F GE++  +SK
Sbjct: 150 MSKPATN-HGNG---DWLYSGSTIVNIGSRHESTSPKLAGNKRP-----FTGENT-QLSK 209

Query: 193 KQCTTVAAKKQKPKSSTA-KDPQSIAAKNRRERISERLKILQELVPNGSKI--------- 252
           K  +    K  KPK++T+ KDPQS+AAKNRRERISERLK+LQELVPNG+K+         
Sbjct: 210 KPSSGTNGKI-KPKATTSPKDPQSLAAKNRRERISERLKVLQELVPNGTKVDLVTMLEKA 269

Query: 253 -------------LATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSS 260
                        LA DEFWP QGGKAPDISQVKEAIDAILSS +  S+S+
Sbjct: 270 IGYVKFLQVQVKVLAADEFWPAQGGKAPDISQVKEAIDAILSSSQRDSNST 300

BLAST of Cp4.1LG01g20040 vs. Swiss-Prot
Match: BH083_ARATH (Transcription factor bHLH83 OS=Arabidopsis thaliana GN=BHLH83 PE=2 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 4.5e-26
Identity = 98/252 (38.89%), Postives = 129/252 (51.19%), Query Frame = 1

Query: 30  DTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHKNNESLLSFEPQGVGQDT 89
           D  N+   F  ++    DHH+     P   I+  CG  +            E   +    
Sbjct: 71  DHHNSLMDFNGSSFLNFDHHES---FPPPAIS--CGGSS----GGGGFSFLEGNNMSYGF 130

Query: 90  YNWDDHQRLMEDPNCFQTATNGYSSPKENKNGDGSVYDWLYSEASGVSDAIQEAEGAHEI 149
            NW+ HQ  M+             SP+  +   G   DWLYS+++ V+   +    + + 
Sbjct: 131 TNWN-HQHHMD-----------IISPRSTETPQGQK-DWLYSDSTVVTTGSRNESLSPKS 190

Query: 150 ANQKRPHMVLNFKGESSGSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERISERLK 209
           A  KR H      GES+   SKK  + V  K +   +++ KDPQS+AAKNRRERISERLK
Sbjct: 191 AGNKRSHT-----GEST-QPSKKLSSGVTGKTKPKPTTSPKDPQSLAAKNRRERISERLK 250

Query: 210 ILQELVPNGSKI----------------------LATDEFWPVQGGKAPDISQVKEAIDA 259
           ILQELVPNG+K+                      LATDEFWP QGGKAPDISQVK+AIDA
Sbjct: 251 ILQELVPNGTKVDLVTMLEKAISYVKFLQVQVKVLATDEFWPAQGGKAPDISQVKDAIDA 294

BLAST of Cp4.1LG01g20040 vs. TrEMBL
Match: A0A0A0KN55_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G420280 PE=4 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 8.5e-80
Identity = 196/301 (65.12%), Postives = 212/301 (70.43%), Query Frame = 1

Query: 1   MALANSSVHEADGMDK-AGGYIKSIPIAT---HDTPNATFLFRNTTNYQLDHHQQQQQLP 60
           MALANS++ E DGM+K  GGY+KS+ I     H  PN  F+FR+TTNYQL   QQQ    
Sbjct: 1   MALANSAIQEVDGMEKNGGGYLKSLAIGKGHDHQNPNGDFVFRDTTNYQLGQQQQQ---- 60

Query: 61  QSLINFKCGIDNIFHK-NNESLLSFEPQGVGQD--TYNWDDHQRLMEDPNCFQTATN--G 120
            SLINF        HK NNESLLSFE QG+ Q   TYNWDD QR+MEDPNCFQTATN   
Sbjct: 61  -SLINFG-------HKMNNESLLSFEAQGICQLDLTYNWDDQQRVMEDPNCFQTATNHNN 120

Query: 121 YSSPKE----NKNGD-GSVYDWLYSEAS-GVSDAIQEAEGAHEIA-NQKRPHMVLNFKGE 180
           YS  K+    NKNGD GSVY+WLYSE++   SD+IQEAEG  EI  N KR H      GE
Sbjct: 121 YSPSKDHHHHNKNGDNGSVYEWLYSESTTDFSDSIQEAEGTQEIVPNHKRSHTT----GE 180

Query: 181 SSGSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERISERLKILQELVPNGSK---- 240
           SSGSV KKQCT  A KKQKPKS+TAKDPQSIAAKNRRERISERLKILQELVPNGSK    
Sbjct: 181 SSGSVCKKQCTA-APKKQKPKSATAKDPQSIAAKNRRERISERLKILQELVPNGSKVDLV 240

Query: 241 ------------------ILATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSSNSQ 264
                             ILATDEFWPVQGGKAPDISQVKEAID ILSSQRERSSSSNS+
Sbjct: 241 TMLEKAISYVKFLQLQVKILATDEFWPVQGGKAPDISQVKEAIDVILSSQRERSSSSNSE 284

BLAST of Cp4.1LG01g20040 vs. TrEMBL
Match: V4SXU2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10013420mg PE=4 SV=1)

HSP 1 Score: 194.5 bits (493), Expect = 1.6e-46
Identity = 137/286 (47.90%), Postives = 169/286 (59.09%), Query Frame = 1

Query: 13  GMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHK 72
           G  KA G++KSI I+   +P++     N+  Y+  + ++ +    SLINFK G D+  H 
Sbjct: 22  GSKKAPGFVKSIGISNTSSPSSA----NSNGYEFHNAEEARD---SLINFKSGHDHFMHA 81

Query: 73  NNESLLSFEPQGVGQDTYN-WD------------DHQRLMEDPNCFQTATN-GYSSPKEN 132
           N  +LLSFE      D Y+ W+            DH RLMED NCF TA+N G+      
Sbjct: 82  NASNLLSFEQNETKDDEYSMWEGNFNQINQKYSSDH-RLMEDFNCFDTASNYGFMINNSA 141

Query: 133 KNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGESSGSVSKKQCTTVA 192
           ++  G   DWLY+EA+ V+D I E+      ++ KRP+      GES  +V KKQC   A
Sbjct: 142 RDCHG---DWLYTEATAVTDTILESGSQDASSSLKRPNT-----GESMQAV-KKQCAIAA 201

Query: 193 AKKQKPKSSTA---KDPQSIAAKNRRERISERLKILQELVPNGSKI-------------- 252
            KK   K  T+   KDPQSIAAKNRRERISERLKILQELVPNGSK+              
Sbjct: 202 TKKTNNKQKTSAPPKDPQSIAAKNRRERISERLKILQELVPNGSKVDLVTMLEKAISYVK 261

Query: 253 --------LATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSS 260
                   LATDEFWPVQGGKAPDISQV+EAIDAILSSQR+RSSSS
Sbjct: 262 FLQLQVKVLATDEFWPVQGGKAPDISQVREAIDAILSSQRDRSSSS 290

BLAST of Cp4.1LG01g20040 vs. TrEMBL
Match: A0A067FAC6_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g038918mg PE=4 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 3.6e-46
Identity = 136/286 (47.55%), Postives = 169/286 (59.09%), Query Frame = 1

Query: 13  GMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHK 72
           G  KA G++KSI I+   +P++     N+  Y+  + ++ +    SLINFK G D+  H 
Sbjct: 22  GSKKAPGFVKSIGISNTSSPSSA----NSNGYEFHNAEEARD---SLINFKSGHDHFMHA 81

Query: 73  NNESLLSFEPQGVGQDTYN-WD------------DHQRLMEDPNCFQTATN-GYSSPKEN 132
           N  +LLSFE      D Y+ W+            DH RLMED NCF TA+N G+      
Sbjct: 82  NASNLLSFEQNETKDDEYSMWEGNFNQINQKYSSDH-RLMEDFNCFDTASNYGFMINNSA 141

Query: 133 KNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGESSGSVSKKQCTTVA 192
           ++  G   DWLY+EA+ V+D I E+      ++ KRP+      G+S  +V KKQC   A
Sbjct: 142 RDCHG---DWLYTEATAVTDTILESGSQDASSSLKRPNT-----GDSMQAV-KKQCAIAA 201

Query: 193 AKKQKPKSSTA---KDPQSIAAKNRRERISERLKILQELVPNGSKI-------------- 252
            KK   K  T+   KDPQSIAAKNRRERISERLKILQELVPNGSK+              
Sbjct: 202 TKKTNNKQKTSAPPKDPQSIAAKNRRERISERLKILQELVPNGSKVDLVTMLEKAISYVK 261

Query: 253 --------LATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSS 260
                   LATDEFWPVQGGKAPDISQV+EAIDAILSSQR+RSSSS
Sbjct: 262 FLQLQVKVLATDEFWPVQGGKAPDISQVREAIDAILSSQRDRSSSS 290

BLAST of Cp4.1LG01g20040 vs. TrEMBL
Match: A0A061EH34_THECC (Rhd six-like 1, putative OS=Theobroma cacao GN=TCM_019210 PE=4 SV=1)

HSP 1 Score: 186.0 bits (471), Expect = 5.7e-44
Identity = 137/308 (44.48%), Postives = 174/308 (56.49%), Query Frame = 1

Query: 10  EADGMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQ------LDHHQQQQQLPQSLINFK 69
           E  G++K GG +K+   ++    + +F   N+ N        +++H ++ Q   SLINFK
Sbjct: 34  EGGGLEKLGGVVKNFSTSS----STSFSSPNSVNSDGLAFRAVNYHPEEAQ---SLINFK 93

Query: 70  -CGIDNIFHKNNESLLSFEPQ----------------------GVGQDTYNWDD------ 129
             G DN  H  N SLLSFE                        G     Y W+       
Sbjct: 94  GAGYDNFMHGTNGSLLSFEQNERVLQNTYLKTSSHKDEYSIWAGSLNQNYQWNQVNPKSS 153

Query: 130 -HQRLMEDPNCFQTATNGYSSPKENKNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQK 189
              R++ED +CF+TA+N  S     K   G   DWLYSEA+ V+++IQE+ G+ E +  K
Sbjct: 154 TDPRVVEDFSCFETASNFNSMTSATKENHG---DWLYSEAAVVANSIQES-GSPEASGLK 213

Query: 190 RPHMVLNFKGESSGSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERISERLKILQE 249
           RPH      GES+ ++ KKQC+   AKK K KS  +KDPQSIAAKNRRERISERLKILQE
Sbjct: 214 RPHT-----GESNQAL-KKQCSN-EAKKAKTKSGPSKDPQSIAAKNRRERISERLKILQE 273

Query: 250 LVPNGSKI----------------------LATDEFWPVQGGKAPDISQVKEAIDAILSS 260
           LVPNGSK+                      LATDEFWPVQGGKAPDISQV+EAIDAILSS
Sbjct: 274 LVPNGSKVDLVTMLEKAISYVKFLQLQVKVLATDEFWPVQGGKAPDISQVREAIDAILSS 323

BLAST of Cp4.1LG01g20040 vs. TrEMBL
Match: B9T2K1_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_0285440 PE=4 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 1.8e-42
Identity = 127/260 (48.85%), Postives = 154/260 (59.23%), Query Frame = 1

Query: 51  QQQQLPQSLINFKCGIDNIFHKNNESLLSFEPQ---------GVGQDTYNWD-----DHQ 110
           Q  +   SLINF+ G D   H N  SLLSFE           G+ +D   W+     ++Q
Sbjct: 79  QAHEGAHSLINFRGGYDGFLHGNG-SLLSFEQNNKVSSQTSSGLKEDYSTWEVDFNCNYQ 138

Query: 111 ------------RLMEDPNCFQTATNGYSSPKENKNGDGSVYDWLYSEASGVSDAIQEAE 170
                       RL+E+ NCF TA+N  S     K   G   DWLYSE + ++D   +  
Sbjct: 139 WNQMNPKSSADPRLVENINCFHTASNFNSIDNSEKEDPG---DWLYSEGTIITDHSIQEP 198

Query: 171 GAHEIANQKRPHMVLNFKGESSGSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERI 230
           G  +    KRPHM     GES+ +V KKQC + A KKQKPK+S +KDPQSIAAKNRRERI
Sbjct: 199 GTQDANFHKRPHM-----GESTQAV-KKQCNS-ATKKQKPKTSPSKDPQSIAAKNRRERI 258

Query: 231 SERLKILQELVPNGSKI----------------------LATDEFWPVQGGKAPDISQVK 263
           SERLKILQELVPNGSK+                      LATDEFWPVQGGKAPD+SQVK
Sbjct: 259 SERLKILQELVPNGSKVDLVTMLEKAISYVKFLQLQVKVLATDEFWPVQGGKAPDVSQVK 318

BLAST of Cp4.1LG01g20040 vs. TAIR10
Match: AT5G37800.1 (AT5G37800.1 RHD SIX-LIKE 1)

HSP 1 Score: 129.4 bits (324), Expect = 3.2e-30
Identity = 113/291 (38.83%), Postives = 153/291 (52.58%), Query Frame = 1

Query: 13  GMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHK 72
           G+D+      S  I +    N  F+F  +     DH+        SL++F       F  
Sbjct: 30  GLDEGASASSSSTINSDHQNNQGFVFYPSGETIEDHN--------SLMDFNASSFFTFD- 89

Query: 73  NNESLLSFEPQGV------GQDTYN---WDDHQ------RLMEDPNCFQTAT------NG 132
           N+ SL+S    G       G  +Y+   W  HQ      R+++ PN F+T +      N 
Sbjct: 90  NHRSLISPVTNGGAFPVVDGNMSYSYDGWSHHQVDSISPRVIKTPNSFETTSSFGLTSNS 149

Query: 133 YSSPKENKNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGESSGSVSK 192
            S P  N +G+G   DWLYS ++ V+   +    + ++A  KRP     F GE++  +SK
Sbjct: 150 MSKPATN-HGNG---DWLYSGSTIVNIGSRHESTSPKLAGNKRP-----FTGENT-QLSK 209

Query: 193 KQCTTVAAKKQKPKSSTA-KDPQSIAAKNRRERISERLKILQELVPNGSKI--------- 252
           K  +    K  KPK++T+ KDPQS+AAKNRRERISERLK+LQELVPNG+K+         
Sbjct: 210 KPSSGTNGKI-KPKATTSPKDPQSLAAKNRRERISERLKVLQELVPNGTKVDLVTMLEKA 269

Query: 253 -------------LATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSS 260
                        LA DEFWP QGGKAPDISQVKEAIDAILSS +  S+S+
Sbjct: 270 IGYVKFLQVQVKVLAADEFWPAQGGKAPDISQVKEAIDAILSSSQRDSNST 300

BLAST of Cp4.1LG01g20040 vs. TAIR10
Match: AT1G66470.1 (AT1G66470.1 ROOT HAIR DEFECTIVE6)

HSP 1 Score: 119.8 bits (299), Expect = 2.6e-27
Identity = 98/252 (38.89%), Postives = 129/252 (51.19%), Query Frame = 1

Query: 30  DTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHKNNESLLSFEPQGVGQDT 89
           D  N+   F  ++    DHH+     P   I+  CG  +            E   +    
Sbjct: 71  DHHNSLMDFNGSSFLNFDHHES---FPPPAIS--CGGSS----GGGGFSFLEGNNMSYGF 130

Query: 90  YNWDDHQRLMEDPNCFQTATNGYSSPKENKNGDGSVYDWLYSEASGVSDAIQEAEGAHEI 149
            NW+ HQ  M+             SP+  +   G   DWLYS+++ V+   +    + + 
Sbjct: 131 TNWN-HQHHMD-----------IISPRSTETPQGQK-DWLYSDSTVVTTGSRNESLSPKS 190

Query: 150 ANQKRPHMVLNFKGESSGSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERISERLK 209
           A  KR H      GES+   SKK  + V  K +   +++ KDPQS+AAKNRRERISERLK
Sbjct: 191 AGNKRSHT-----GEST-QPSKKLSSGVTGKTKPKPTTSPKDPQSLAAKNRRERISERLK 250

Query: 210 ILQELVPNGSKI----------------------LATDEFWPVQGGKAPDISQVKEAIDA 259
           ILQELVPNG+K+                      LATDEFWP QGGKAPDISQVK+AIDA
Sbjct: 251 ILQELVPNGTKVDLVTMLEKAISYVKFLQVQVKVLATDEFWPAQGGKAPDISQVKDAIDA 294

BLAST of Cp4.1LG01g20040 vs. TAIR10
Match: AT3G50330.1 (AT3G50330.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 50.1 bits (118), Expect = 2.5e-06
Identity = 23/33 (69.70%), Postives = 31/33 (93.94%), Query Frame = 1

Query: 189 AKDPQSIAAKNRRERISERLKILQELVPNGSKI 222
           +KDPQS+AA++RRERISER++ILQ LVP G+K+
Sbjct: 126 SKDPQSVAARHRRERISERIRILQRLVPGGTKM 158

BLAST of Cp4.1LG01g20040 vs. TAIR10
Match: AT5G67060.1 (AT5G67060.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 50.1 bits (118), Expect = 2.5e-06
Identity = 23/33 (69.70%), Postives = 31/33 (93.94%), Query Frame = 1

Query: 189 AKDPQSIAAKNRRERISERLKILQELVPNGSKI 222
           +KDPQS+AA++RRERISER++ILQ LVP G+K+
Sbjct: 129 SKDPQSVAARHRRERISERIRILQRLVPGGTKM 161

BLAST of Cp4.1LG01g20040 vs. TAIR10
Match: AT4G33880.1 (AT4G33880.1 ROOT HAIR DEFECTIVE 6-LIKE 2)

HSP 1 Score: 49.3 bits (116), Expect = 4.2e-06
Identity = 39/116 (33.62%), Postives = 54/116 (46.55%), Query Frame = 1

Query: 111 GYSSPKENKNG-----DGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGES 170
           G +  K+ KNG       S   +   E S  +D  Q+  G    + +  P   LN  G++
Sbjct: 209 GETKLKKRKNGAMMSRQNSSTTFCTEEESNCAD--QDGGGEDSSSKEDDPSKALNLNGKT 268

Query: 171 SGSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERISERLKILQELVPNGSKI 222
             S                    A DPQS+ A+ RRERI+ERL+ILQ LVPNG+K+
Sbjct: 269 RAS-----------------RGAATDPQSLYARKRRERINERLRILQNLVPNGTKV 305

BLAST of Cp4.1LG01g20040 vs. NCBI nr
Match: gi|449445206|ref|XP_004140364.1| (PREDICTED: putative transcription factor bHLH086 [Cucumis sativus])

HSP 1 Score: 305.1 bits (780), Expect = 1.2e-79
Identity = 196/301 (65.12%), Postives = 212/301 (70.43%), Query Frame = 1

Query: 1   MALANSSVHEADGMDK-AGGYIKSIPIAT---HDTPNATFLFRNTTNYQLDHHQQQQQLP 60
           MALANS++ E DGM+K  GGY+KS+ I     H  PN  F+FR+TTNYQL   QQQ    
Sbjct: 1   MALANSAIQEVDGMEKNGGGYLKSLAIGKGHDHQNPNGDFVFRDTTNYQLGQQQQQ---- 60

Query: 61  QSLINFKCGIDNIFHK-NNESLLSFEPQGVGQD--TYNWDDHQRLMEDPNCFQTATN--G 120
            SLINF        HK NNESLLSFE QG+ Q   TYNWDD QR+MEDPNCFQTATN   
Sbjct: 61  -SLINFG-------HKMNNESLLSFEAQGICQLDLTYNWDDQQRVMEDPNCFQTATNHNN 120

Query: 121 YSSPKE----NKNGD-GSVYDWLYSEAS-GVSDAIQEAEGAHEIA-NQKRPHMVLNFKGE 180
           YS  K+    NKNGD GSVY+WLYSE++   SD+IQEAEG  EI  N KR H      GE
Sbjct: 121 YSPSKDHHHHNKNGDNGSVYEWLYSESTTDFSDSIQEAEGTQEIVPNHKRSHTT----GE 180

Query: 181 SSGSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERISERLKILQELVPNGSK---- 240
           SSGSV KKQCT  A KKQKPKS+TAKDPQSIAAKNRRERISERLKILQELVPNGSK    
Sbjct: 181 SSGSVCKKQCTA-APKKQKPKSATAKDPQSIAAKNRRERISERLKILQELVPNGSKVDLV 240

Query: 241 ------------------ILATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSSNSQ 264
                             ILATDEFWPVQGGKAPDISQVKEAID ILSSQRERSSSSNS+
Sbjct: 241 TMLEKAISYVKFLQLQVKILATDEFWPVQGGKAPDISQVKEAIDVILSSQRERSSSSNSE 284

BLAST of Cp4.1LG01g20040 vs. NCBI nr
Match: gi|659121136|ref|XP_008460515.1| (PREDICTED: putative transcription factor bHLH086 [Cucumis melo])

HSP 1 Score: 292.4 bits (747), Expect = 8.2e-76
Identity = 190/299 (63.55%), Postives = 211/299 (70.57%), Query Frame = 1

Query: 1   MALANSSVHEADGMDK-AGGYIKSIPIAT-HD-TPNATFLFRNTTNYQLDHHQQQQQLPQ 60
           M+LANS++ E DGM+K  GGY+KS+ I   HD  PN  F+FR+ TNYQL   QQQ     
Sbjct: 1   MSLANSAIQEVDGMEKNGGGYLKSLAIGKGHDQNPNGDFVFRDATNYQLGLQQQQ----- 60

Query: 61  SLINFKCGIDNIFHK-NNESLLSFEPQGVGQD--TYNWDDHQRLMEDPNCFQTAT--NGY 120
           SLINF        HK NNESLLSFE QG+ Q   TYNWDD +R+MEDPNCFQTAT  N Y
Sbjct: 61  SLINFG-------HKMNNESLLSFEAQGICQLDLTYNWDDQERVMEDPNCFQTATSHNNY 120

Query: 121 SSPKE---NKNGD-GSVYDWLYSEAS-GVSDAIQEAEGAHEIA-NQKRPHMVLNFKGESS 180
           S  K+   NKNG+ GSVY+WLY E++   SD+IQ+AEG  EI  N KR H      GESS
Sbjct: 121 SPSKDHHHNKNGNSGSVYEWLYCESTTDFSDSIQDAEGTQEIVPNHKRSHST----GESS 180

Query: 181 GSVSKKQCTTVAAKKQKPKSSTAKDPQSIAAKNRRERISERLKILQELVPNGSK------ 240
           GSV KKQCT  A KKQKPKS+TAKDPQSIAAKNRRERISERLKILQELVPNGSK      
Sbjct: 181 GSVCKKQCTA-APKKQKPKSATAKDPQSIAAKNRRERISERLKILQELVPNGSKVDLVTM 240

Query: 241 ----------------ILATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSSNSQK 264
                           ILATDEFWPVQGGKAPDISQVKEAID ILSSQRERSSSSN++K
Sbjct: 241 LEKAISYVKFLQLQVKILATDEFWPVQGGKAPDISQVKEAIDVILSSQRERSSSSNTEK 282

BLAST of Cp4.1LG01g20040 vs. NCBI nr
Match: gi|567876091|ref|XP_006430635.1| (hypothetical protein CICLE_v10013420mg [Citrus clementina])

HSP 1 Score: 194.5 bits (493), Expect = 2.3e-46
Identity = 137/286 (47.90%), Postives = 169/286 (59.09%), Query Frame = 1

Query: 13  GMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHK 72
           G  KA G++KSI I+   +P++     N+  Y+  + ++ +    SLINFK G D+  H 
Sbjct: 22  GSKKAPGFVKSIGISNTSSPSSA----NSNGYEFHNAEEARD---SLINFKSGHDHFMHA 81

Query: 73  NNESLLSFEPQGVGQDTYN-WD------------DHQRLMEDPNCFQTATN-GYSSPKEN 132
           N  +LLSFE      D Y+ W+            DH RLMED NCF TA+N G+      
Sbjct: 82  NASNLLSFEQNETKDDEYSMWEGNFNQINQKYSSDH-RLMEDFNCFDTASNYGFMINNSA 141

Query: 133 KNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGESSGSVSKKQCTTVA 192
           ++  G   DWLY+EA+ V+D I E+      ++ KRP+      GES  +V KKQC   A
Sbjct: 142 RDCHG---DWLYTEATAVTDTILESGSQDASSSLKRPNT-----GESMQAV-KKQCAIAA 201

Query: 193 AKKQKPKSSTA---KDPQSIAAKNRRERISERLKILQELVPNGSKI-------------- 252
            KK   K  T+   KDPQSIAAKNRRERISERLKILQELVPNGSK+              
Sbjct: 202 TKKTNNKQKTSAPPKDPQSIAAKNRRERISERLKILQELVPNGSKVDLVTMLEKAISYVK 261

Query: 253 --------LATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSS 260
                   LATDEFWPVQGGKAPDISQV+EAIDAILSSQR+RSSSS
Sbjct: 262 FLQLQVKVLATDEFWPVQGGKAPDISQVREAIDAILSSQRDRSSSS 290

BLAST of Cp4.1LG01g20040 vs. NCBI nr
Match: gi|641844201|gb|KDO63095.1| (hypothetical protein CISIN_1g038918mg [Citrus sinensis])

HSP 1 Score: 193.4 bits (490), Expect = 5.2e-46
Identity = 136/286 (47.55%), Postives = 169/286 (59.09%), Query Frame = 1

Query: 13  GMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHK 72
           G  KA G++KSI I+   +P++     N+  Y+  + ++ +    SLINFK G D+  H 
Sbjct: 22  GSKKAPGFVKSIGISNTSSPSSA----NSNGYEFHNAEEARD---SLINFKSGHDHFMHA 81

Query: 73  NNESLLSFEPQGVGQDTYN-WD------------DHQRLMEDPNCFQTATN-GYSSPKEN 132
           N  +LLSFE      D Y+ W+            DH RLMED NCF TA+N G+      
Sbjct: 82  NASNLLSFEQNETKDDEYSMWEGNFNQINQKYSSDH-RLMEDFNCFDTASNYGFMINNSA 141

Query: 133 KNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGESSGSVSKKQCTTVA 192
           ++  G   DWLY+EA+ V+D I E+      ++ KRP+      G+S  +V KKQC   A
Sbjct: 142 RDCHG---DWLYTEATAVTDTILESGSQDASSSLKRPNT-----GDSMQAV-KKQCAIAA 201

Query: 193 AKKQKPKSSTA---KDPQSIAAKNRRERISERLKILQELVPNGSKI-------------- 252
            KK   K  T+   KDPQSIAAKNRRERISERLKILQELVPNGSK+              
Sbjct: 202 TKKTNNKQKTSAPPKDPQSIAAKNRRERISERLKILQELVPNGSKVDLVTMLEKAISYVK 261

Query: 253 --------LATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSS 260
                   LATDEFWPVQGGKAPDISQV+EAIDAILSSQR+RSSSS
Sbjct: 262 FLQLQVKVLATDEFWPVQGGKAPDISQVREAIDAILSSQRDRSSSS 290

BLAST of Cp4.1LG01g20040 vs. NCBI nr
Match: gi|568857667|ref|XP_006482386.1| (PREDICTED: transcription factor bHLH83-like [Citrus sinensis])

HSP 1 Score: 192.2 bits (487), Expect = 1.1e-45
Identity = 134/286 (46.85%), Postives = 169/286 (59.09%), Query Frame = 1

Query: 13  GMDKAGGYIKSIPIATHDTPNATFLFRNTTNYQLDHHQQQQQLPQSLINFKCGIDNIFHK 72
           G +KA G++KSI ++   +P++     N+  Y   +    +++  SLINFK G D+  H 
Sbjct: 22  GSEKASGFVKSIGMSNTSSPSSA----NSNGYAFHN---AEEVRDSLINFKSGHDHFMHA 81

Query: 73  NNESLLSFEPQGVGQDTYN-WD------------DHQRLMEDPNCFQTATN-GYSSPKEN 132
           N  +LLSFE      D Y+ W+            DH RLMED NCF +A+N G+      
Sbjct: 82  NASNLLSFEQNETKDDEYSMWEGNFNQINQKYSSDH-RLMEDFNCFDSASNYGFMINNSA 141

Query: 133 KNGDGSVYDWLYSEASGVSDAIQEAEGAHEIANQKRPHMVLNFKGESSGSVSKKQCTTVA 192
           ++  G   DWLY+EA+ V+D I E+      ++ KRP+      G+S  +V KKQC   A
Sbjct: 142 RDCHG---DWLYTEATAVTDTILESGSQDASSSLKRPNT-----GDSMQAV-KKQCAIAA 201

Query: 193 AKKQKPKSSTA---KDPQSIAAKNRRERISERLKILQELVPNGSKI-------------- 252
            KK   K  T+   KDPQSIAAKNRRERISERLKILQELVPNGSK+              
Sbjct: 202 TKKTNNKQKTSAPPKDPQSIAAKNRRERISERLKILQELVPNGSKVDLVTMLEKAISYVK 261

Query: 253 --------LATDEFWPVQGGKAPDISQVKEAIDAILSSQRERSSSS 260
                   LATDEFWPVQGGKAPDISQV+EAIDAILSSQR+RSSSS
Sbjct: 262 FLQLQVKVLATDEFWPVQGGKAPDISQVREAIDAILSSQRDRSSSS 290

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BH086_ARATH5.7e-2938.83Putative transcription factor bHLH086 OS=Arabidopsis thaliana GN=BHLH86 PE=3 SV=... [more]
BH083_ARATH4.5e-2638.89Transcription factor bHLH83 OS=Arabidopsis thaliana GN=BHLH83 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KN55_CUCSA8.5e-8065.12Uncharacterized protein OS=Cucumis sativus GN=Csa_5G420280 PE=4 SV=1[more]
V4SXU2_9ROSI1.6e-4647.90Uncharacterized protein OS=Citrus clementina GN=CICLE_v10013420mg PE=4 SV=1[more]
A0A067FAC6_CITSI3.6e-4647.55Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g038918mg PE=4 SV=1[more]
A0A061EH34_THECC5.7e-4444.48Rhd six-like 1, putative OS=Theobroma cacao GN=TCM_019210 PE=4 SV=1[more]
B9T2K1_RICCO1.8e-4248.85DNA binding protein, putative OS=Ricinus communis GN=RCOM_0285440 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G37800.13.2e-3038.83 RHD SIX-LIKE 1[more]
AT1G66470.12.6e-2738.89 ROOT HAIR DEFECTIVE6[more]
AT3G50330.12.5e-0669.70 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT5G67060.12.5e-0669.70 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G33880.14.2e-0633.62 ROOT HAIR DEFECTIVE 6-LIKE 2[more]
Match NameE-valueIdentityDescription
gi|449445206|ref|XP_004140364.1|1.2e-7965.12PREDICTED: putative transcription factor bHLH086 [Cucumis sativus][more]
gi|659121136|ref|XP_008460515.1|8.2e-7663.55PREDICTED: putative transcription factor bHLH086 [Cucumis melo][more]
gi|567876091|ref|XP_006430635.1|2.3e-4647.90hypothetical protein CICLE_v10013420mg [Citrus clementina][more]
gi|641844201|gb|KDO63095.1|5.2e-4647.55hypothetical protein CISIN_1g038918mg [Citrus sinensis][more]
gi|568857667|ref|XP_006482386.1|1.1e-4546.85PREDICTED: transcription factor bHLH83-like [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g20040.1Cp4.1LG01g20040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 185..254
score: 1.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 188..251
score: 11
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 186..255
score: 2.0
NoneNo IPR availablePANTHERPTHR16223FAMILY NOT NAMEDcoord: 49..254
score: 1.8
NoneNo IPR availablePANTHERPTHR16223:SF19TRANSCRIPTION FACTOR BHLH83-RELATEDcoord: 49..254
score: 1.8