Csa3G116640.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa3G116640.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionTranscription factor; contains IPR011598 (Myc-type, basic helix-loop-helix (bHLH) domain)
LocationChr3 : 6197532 .. 6200142 (+)
Sequence length1077
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCACAACTGTCTCTCTCCGCTGCTGGACAAAACTGTCACCGCCAATAATGCTTGGGGCATTCTAATTCAAACTAACACGCCCTTCTTTCCTCTTATCTCTGCTTTCGGATCCATCAACTTGGTTTGAGTTTGAGCTTCAAGTATATGCGAGACTTGTAGTATATCTTCTTGAACTCTCTATCTATTTATTTATCGCCCTTTGTTTGTCTTTACTGCCTATTCCACCTCCTCTTCATCACTGTTTAACCTCTCACCTTCCTCTTCGGAGAACAACATTCTTCTCCTCTAGCCATAGCATTTAATATTATTCTATAAAGAGTATCAACTGCTCCCAAAACGAAGGAATGTGTTTTGGGCTGATATGGGGTCTCCTTCTGCTCATTGGGTTTTTCATTTAATGCTATTGTTTTGTTAGCATCACTGTGGAATGCATGTTGTTACCTCTAGACAGAGAAGTAGAAAGGCTAAATTAACACACTTCAGAACTGCTCTGCCGTTCAAATGGAGAGGCTCCAAGGACCCATTAATCCTTGCGTATGATAAATGCTATCCCTCTCTGTTTCTATTTATTTCCGTTTTCCTTTTTCATTGTCACCCACTTTTAGAATTTTCTTTTCAGAACATGTGCATTACATTCATTTAATTCTTCAAATGTTTTCCATTGAATGGCTTTGTGTAATTTCTCCAGGATGGATATTAGAATATCTACTTTGTACTTACAATGAACATGCTTTGTTTCTTATTATATATTTCTCTTATCACTCTGGATTCTCTTCTTTTCAGTTTTATGGTGAATATTCAGAGACAGGTTGCTCGGAACAAGAATTCACAAACTTGGGATTTGAAGAATCAGAAGAAGTTTGTTTACTAACCTCAAGTTTGGAAGATAAAATACCATTCCTTCAGATGCTGCAGAGTGTGGAATCGCAATCATTCAAGGAACCTAACTTTCAAAGCTTGCTGAAGCTGCAGCACCTAACCAAACCATGGGAAGGGGGGGTTAATAAAATTCAGGAGCTTGTACAGTTGTTTTCTTCACCAATAAACTCAGAAACGAAGGACCAAAATCAACCTCCAAAGTCGGACAGAGTGTTCTCAGAGTGTAACCAAAATCAAGGCATATCCCAGACCCAAATGACAAAGGCTCCTCCAGTCATCAAGGAAAGAAGAAAACGAAAGAGATCCAAACCAACAAAAAACAAGGAAGAAGTAGAGTGCCAAAGAATGACCCACATTGCTGTTGAGCGCAACCGGAGACGGCAAATGAACGACCATCTCAACGTTATCAAGTCTCTCATACCTACGTCCTATGTACAAAGGGTATATACTAAAACAAAATATCGAAATTTTATTTTTGAAAGATCACGTTGTTTTCTCCAAAACTGATTAATTAAGAATGTCACATGTTGATTTGACCCGTTTTTTTTTGTGTACATGACCAGGGTGACCAGGCATCCATAATTGGGGGCGCAATAGACTTCGTGAAGGAATTGGAGCAGCTACTTGAATCTTTGGAAGCACTGAGGAAAGAAAGGAAGGGAGCGGAAGGTGAGTGTAAGGGTGAGCAGTCAGAAGTGCGAGTGGCATCAAATAGGAGAATAGGAGAAGGGGTTTGCGCCGAGCTCAGGTCAGAAGTGGCTGAGATTGAGGTTACAATGATTCAAACCCATGTAAACTTAAAGATAAGGTGCCCCAAAAGGCAAGATCAGTTGTTGAAAGTCATTGTTGCTTTGGAAGATCTTAGGCTCACAGTTTTGCATCTCAACATTACCTCACAAACCGCTGCCACCATGCTTTACTCCTTCAATCTAAAGGTATTGCTCTGCTATTATTTACTATTGCCCTTTTCTTTTCTTCTTTAGCGTTTGGATAGAGGGGTGGAAAAGTAGACTACTACTCTGTGTCCTCCGTCTGTAGTCAACCTCAACTAGTCTGACTTTGGAACTCAAGCACAGTGCTTCTAGTTTCAACTTCCAAGACGCCGGCTCTGTTCCTACTTCTTTTGCTTTTTGTTTTTTCTAAAAAAAATATTTAACAATATTTCAGTGCAAGACTCATCTTTTTCCCACAGAAAAGAGCATGCATTATTTATAATTACATGTACCATTCCTGTGGGTAAGTTGGCTAAGAAATATGATGATCTTTGATTCGGTTGAGAGTAAGATTGTTTTAACAAAAGGAGGGTCTTACGTTGGTTGCATGTGAAACCAGATAGAAGATGAATGTAAGCTAGAATCAGAGGAGCAGATTGCAGCAACGGTTAATGAAATATTCAGTTTTATCAACAATGGCAGACTGGTCAATGAGGCAAAGGAAAATTTCAGGCAGTACAGTGGCAGTCGCTGACTATGGAAGAGGACCTTCCCTATTTCCCATTCGAAAAAAGGAAAAATAGAAAAGCCAAGCTCAGGAAATAAGGATGAGGTTAAGTTGGAGTGCGTGTGACTGTGGTCACCTCGAATACTGTTCTTTAGTGTACCACAGATGGGTTGGTCAAATGGTTATGGCAGGCTTAAAACAAGTGTTTGTCGTCTCCTGCTTTACTCAAATTCATAAATCTTTTACAAAATAAATTAAACTACACAGCTTTTTTATATTCCTTTTTTTTCT

mRNA sequence

ATGGAGAGGCTCCAAGGACCCATTAATCCTTGCTTTTATGGTGAATATTCAGAGACAGGTTGCTCGGAACAAGAATTCACAAACTTGGGATTTGAAGAATCAGAAGAAGTTTGTTTACTAACCTCAAGTTTGGAAGATAAAATACCATTCCTTCAGATGCTGCAGAGTGTGGAATCGCAATCATTCAAGGAACCTAACTTTCAAAGCTTGCTGAAGCTGCAGCACCTAACCAAACCATGGGAAGGGGGGGTTAATAAAATTCAGGAGCTTGTACAGTTGTTTTCTTCACCAATAAACTCAGAAACGAAGGACCAAAATCAACCTCCAAAGTCGGACAGAGTGTTCTCAGAGTGTAACCAAAATCAAGGCATATCCCAGACCCAAATGACAAAGGCTCCTCCAGTCATCAAGGAAAGAAGAAAACGAAAGAGATCCAAACCAACAAAAAACAAGGAAGAAGTAGAGTGCCAAAGAATGACCCACATTGCTGTTGAGCGCAACCGGAGACGGCAAATGAACGACCATCTCAACGTTATCAAGTCTCTCATACCTACGTCCTATGTACAAAGGGGTGACCAGGCATCCATAATTGGGGGCGCAATAGACTTCGTGAAGGAATTGGAGCAGCTACTTGAATCTTTGGAAGCACTGAGGAAAGAAAGGAAGGGAGCGGAAGGTGAGTGTAAGGGTGAGCAGTCAGAAGTGCGAGTGGCATCAAATAGGAGAATAGGAGAAGGGGTTTGCGCCGAGCTCAGGTCAGAAGTGGCTGAGATTGAGGTTACAATGATTCAAACCCATGTAAACTTAAAGATAAGGTGCCCCAAAAGGCAAGATCAGTTGTTGAAAGTCATTGTTGCTTTGGAAGATCTTAGGCTCACAGTTTTGCATCTCAACATTACCTCACAAACCGCTGCCACCATGCTTTACTCCTTCAATCTAAAGATAGAAGATGAATGTAAGCTAGAATCAGAGGAGCAGATTGCAGCAACGGTTAATGAAATATTCAGTTTTATCAACAATGGCAGACTGGTCAATGAGGCAAAGGAAAATTTCAGGCAGTACAGTGGCAGTCGCTGA

Coding sequence (CDS)

ATGGAGAGGCTCCAAGGACCCATTAATCCTTGCTTTTATGGTGAATATTCAGAGACAGGTTGCTCGGAACAAGAATTCACAAACTTGGGATTTGAAGAATCAGAAGAAGTTTGTTTACTAACCTCAAGTTTGGAAGATAAAATACCATTCCTTCAGATGCTGCAGAGTGTGGAATCGCAATCATTCAAGGAACCTAACTTTCAAAGCTTGCTGAAGCTGCAGCACCTAACCAAACCATGGGAAGGGGGGGTTAATAAAATTCAGGAGCTTGTACAGTTGTTTTCTTCACCAATAAACTCAGAAACGAAGGACCAAAATCAACCTCCAAAGTCGGACAGAGTGTTCTCAGAGTGTAACCAAAATCAAGGCATATCCCAGACCCAAATGACAAAGGCTCCTCCAGTCATCAAGGAAAGAAGAAAACGAAAGAGATCCAAACCAACAAAAAACAAGGAAGAAGTAGAGTGCCAAAGAATGACCCACATTGCTGTTGAGCGCAACCGGAGACGGCAAATGAACGACCATCTCAACGTTATCAAGTCTCTCATACCTACGTCCTATGTACAAAGGGGTGACCAGGCATCCATAATTGGGGGCGCAATAGACTTCGTGAAGGAATTGGAGCAGCTACTTGAATCTTTGGAAGCACTGAGGAAAGAAAGGAAGGGAGCGGAAGGTGAGTGTAAGGGTGAGCAGTCAGAAGTGCGAGTGGCATCAAATAGGAGAATAGGAGAAGGGGTTTGCGCCGAGCTCAGGTCAGAAGTGGCTGAGATTGAGGTTACAATGATTCAAACCCATGTAAACTTAAAGATAAGGTGCCCCAAAAGGCAAGATCAGTTGTTGAAAGTCATTGTTGCTTTGGAAGATCTTAGGCTCACAGTTTTGCATCTCAACATTACCTCACAAACCGCTGCCACCATGCTTTACTCCTTCAATCTAAAGATAGAAGATGAATGTAAGCTAGAATCAGAGGAGCAGATTGCAGCAACGGTTAATGAAATATTCAGTTTTATCAACAATGGCAGACTGGTCAATGAGGCAAAGGAAAATTTCAGGCAGTACAGTGGCAGTCGCTGA

Protein sequence

MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQSFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQNQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFRQYSGSR*
BLAST of Csa3G116640.1 vs. Swiss-Prot
Match: BH067_ARATH (Transcription factor bHLH67 OS=Arabidopsis thaliana GN=BHLH67 PE=2 SV=1)

HSP 1 Score: 270.0 bits (689), Expect = 3.7e-71
Identity = 162/369 (43.90%), Postives = 226/369 (61.25%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSE----QEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQS 60
           MER QG INPCF+    +    E     E  +  F+E EE      SL+D +PFLQMLQS
Sbjct: 1   MERFQGHINPCFFDRKPDVRSLEVQGFAEAQSFAFKEKEE-----ESLQDTVPFLQMLQS 60

Query: 61  VESQSF---KEPNFQSLLKLQHLTKPWEGGVNKIQELVQL----FSSPINSET----KDQ 120
            +  SF   KEPNF +LL LQ L +PWE     ++  + L    F SP+ SET    +  
Sbjct: 61  EDPSSFFSIKEPNFLTLLSLQTLKEPWE-----LERYLSLEDSQFHSPVQSETNRFMEGA 120

Query: 121 NQPPKSDRV-FSECNQNQGISQTQMTKA-------------PPVIKERRKRKRSKPTKNK 180
           NQ   S  + FS+ N     S +    A               + +E+RKR+++KP+KN 
Sbjct: 121 NQAVSSQEIPFSQANMTLPSSTSSPLSAHSRRKRKINHLLPQEMTREKRKRRKTKPSKNN 180

Query: 181 EEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLL 240
           EE+E QR+ HIAVERNRRRQMN+H+N +++L+P SY+QRGDQASI+GGAI++VK LEQ++
Sbjct: 181 EEIENQRINHIAVERNRRRQMNEHINSLRALLPPSYIQRGDQASIVGGAINYVKVLEQII 240

Query: 241 ESLEALRKERKGAEGECKGEQSEVRVASNRRIGEG-----VCAELRSEVAEIEVTMIQTH 300
           +SLE+ ++ ++ +  E       V  A N   G          E ++ + +IE T+IQ H
Sbjct: 241 QSLESQKRTQQQSNSEV------VENALNHLSGISSNDLWTTLEDQTCIPKIEATVIQNH 300

Query: 301 VNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQ 336
           V+LK++C K+Q QLLK I++LE L+LTVLHLNIT+ + +++ YSFNLK+EDEC LES ++
Sbjct: 301 VSLKVQCEKKQGQLLKGIISLEKLKLTVLHLNITTSSHSSVSYSFNLKMEDECDLESADE 353

BLAST of Csa3G116640.1 vs. Swiss-Prot
Match: BH070_ARATH (Transcription factor bHLH70 OS=Arabidopsis thaliana GN=BHLH70 PE=2 SV=1)

HSP 1 Score: 264.6 bits (675), Expect = 1.6e-69
Identity = 158/330 (47.88%), Postives = 210/330 (63.64%), Query Frame = 1

Query: 24  QEFTNLGFEESEEVCLLTSSLEDK-IPFLQMLQSVESQS----FKEPNFQSLLKLQHLTK 83
           ++  +   EE E+     S L+D  IPFLQMLQ  E  S    FK+P+F +LL LQ L K
Sbjct: 43  EDHQSFALEEEEQQLSTPSLLQDTTIPFLQMLQQSEDPSPFLSFKDPSFLALLSLQTLEK 102

Query: 84  PWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQNQGISQTQMTKAPP---- 143
           PWE   N +   V  F SPI+SET      P  + V +E   NQ +    +  A      
Sbjct: 103 PWELE-NYLPHEVPEFHSPIHSETNHYYHNPSLEGV-NEAISNQELPFNPLENARSRRKR 162

Query: 144 --------VIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTS 203
                   + +E+RKR+R+KPTKN EE+E QRMTHIAVERNRRRQMN HLN ++S+IP+S
Sbjct: 163 KNNNLASLMTREKRKRRRTKPTKNIEEIESQRMTHIAVERNRRRQMNVHLNSLRSIIPSS 222

Query: 204 YVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEG-ECKGEQSEVRVASNRRIGE 263
           Y+QRGDQASI+GGAIDFVK LEQ L+SLEA ++ ++  +  E   E + +R  S+ ++  
Sbjct: 223 YIQRGDQASIVGGAIDFVKILEQQLQSLEAQKRSQQSDDNKEQIPEDNSLRNISSNKL-R 282

Query: 264 GVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAA 323
               E +S   +IE T+I++HVNLKI+C ++Q QLL+ I+ LE LR TVLHLNITS T  
Sbjct: 283 ASNKEEQSSKLKIEATVIESHVNLKIQCTRKQGQLLRSIILLEKLRFTVLHLNITSPTNT 342

Query: 324 TMLYSFNLKIEDECKLESEEQIAATVNEIF 336
           ++ YSFNLK+EDEC L S ++I A + +IF
Sbjct: 343 SVSYSFNLKMEDECNLGSADEITAAIRQIF 369

BLAST of Csa3G116640.1 vs. Swiss-Prot
Match: BH057_ARATH (Transcription factor bHLH57 OS=Arabidopsis thaliana GN=BHLH57 PE=2 SV=1)

HSP 1 Score: 223.8 bits (569), Expect = 3.0e-57
Identity = 139/312 (44.55%), Postives = 189/312 (60.58%), Query Frame = 1

Query: 42  SSLEDKIPFLQMLQSVESQ-SFKEPN--FQSLLKLQHLTKPWEGGVNKIQELVQLFSSPI 101
           +++E+KIPFLQMLQ +E   +  EPN   QSLL++Q L                   S +
Sbjct: 20  TTMEEKIPFLQMLQCIEHPFTTTEPNQFLQSLLQIQTLES----------------KSCL 79

Query: 102 NSETKDQNQPPKSDRVFSECNQNQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQR 161
             ET  +  P ++D    +     G            +KE+RKRKR++  KNK+EVE QR
Sbjct: 80  TLETNIKRDPGQTDDPEKDPRTENGAV---------TVKEKRKRKRTRAPKNKDEVENQR 139

Query: 162 MTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALR 221
           MTHIAVERNRRRQMN+HLN ++SL+P S++QRGDQASI+GGAIDF+KELEQLL+SLEA  
Sbjct: 140 MTHIAVERNRRRQMNEHLNSLRSLMPPSFLQRGDQASIVGGAIDFIKELEQLLQSLEA-E 199

Query: 222 KERKGAEGECKG---EQSEVRVASNRRIG-------EGVCAEL-RSEVAEIEVTMIQTHV 281
           K + G +   K      S     +N  I         G  A     +  E+E T+IQ HV
Sbjct: 200 KRKDGTDETPKTASCSSSSSLACTNSSISSVSTTSENGFTARFGGGDTTEVEATVIQNHV 259

Query: 282 NLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQI 340
           +LK+RC + + Q+LK IV++E+L+L +LHL I+S +   ++YSFNLK+ED CKL S ++I
Sbjct: 260 SLKVRCKRGKRQILKAIVSIEELKLAILHLTISS-SFDFVIYSFNLKMEDGCKLGSADEI 304

BLAST of Csa3G116640.1 vs. Swiss-Prot
Match: BH094_ARATH (Transcription factor bHLH94 OS=Arabidopsis thaliana GN=BHLH94 PE=2 SV=2)

HSP 1 Score: 176.4 bits (446), Expect = 5.6e-43
Identity = 97/218 (44.50%), Postives = 143/218 (65.60%), Query Frame = 1

Query: 123 GISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSL 182
           G++   +   PP  + RRKR+R++  KNKEE+E QRMTHIAVERNRR+QMN++L V++SL
Sbjct: 80  GLTAIDVESHPPP-QHRRKRRRTRNCKNKEEIENQRMTHIAVERNRRKQMNEYLAVLRSL 139

Query: 183 IPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN-- 242
           +P+SY QRGDQASI+GGAI++VKELE +L+S+E  R      +G+     S V   ++  
Sbjct: 140 MPSSYAQRGDQASIVGGAINYVKELEHILQSMEPKRTRTHDPKGDKTSTSSLVGPFTDFF 199

Query: 243 -----RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVL 302
                         E  S  AEIEVT+ ++H N+KI   K+  QLLK+I +L+ LRLT+L
Sbjct: 200 SFPQYSTKSSSDVPESSSSPAEIEVTVAESHANIKIMTKKKPRQLLKLITSLQSLRLTLL 259

Query: 303 HLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNE 334
           HLN+T+    ++LYS ++++E+  +L + + IA  +N+
Sbjct: 260 HLNVTT-LHNSILYSISVRVEEGSQLNTVDDIATALNQ 295

BLAST of Csa3G116640.1 vs. Swiss-Prot
Match: FAMA_ARATH (Transcription factor FAMA OS=Arabidopsis thaliana GN=FAMA PE=1 SV=1)

HSP 1 Score: 176.4 bits (446), Expect = 5.6e-43
Identity = 105/225 (46.67%), Postives = 146/225 (64.89%), Query Frame = 1

Query: 139 RRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIG 198
           + KRKR++ +K  EEVE QRMTHIAVERNRR+QMN+HL V++SL+P SYVQRGDQASIIG
Sbjct: 177 KSKRKRARTSKTSEEVESQRMTHIAVERNRRKQMNEHLRVLRSLMPGSYVQRGDQASIIG 236

Query: 199 GAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASNRRI--------------- 258
           GAI+FV+ELEQLL+ LE+  ++R+   GE   + +    +S+  I               
Sbjct: 237 GAIEFVRELEQLLQCLES--QKRRRILGETGRDMTTTTTSSSSPITTVANQAQPLIITGN 296

Query: 259 ------GEGV---CAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTV 318
                 G G+    AE +S +A++EV ++     +KI   +R  QL+K I ALEDL L++
Sbjct: 297 VTELEGGGGLREETAENKSCLADVEVKLLGFDAMIKILSRRRPGQLIKTIAALEDLHLSI 356

Query: 319 LHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFIN 340
           LH NIT+    T+LYSFN+KI  E +  +E+ IA+++ +IFSFI+
Sbjct: 357 LHTNITTM-EQTVLYSFNVKITSETRFTAED-IASSIQQIFSFIH 397

BLAST of Csa3G116640.1 vs. TrEMBL
Match: A0A0A0L6W2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G116640 PE=4 SV=1)

HSP 1 Score: 706.4 bits (1822), Expect = 1.7e-200
Identity = 358/358 (100.00%), Postives = 358/358 (100.00%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60
           MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ
Sbjct: 1   MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60

Query: 61  SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ 120
           SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ
Sbjct: 61  SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ 120

Query: 121 NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK 180
           NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK
Sbjct: 121 NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK 180

Query: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN 240
           SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN
Sbjct: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN 240

Query: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT 300
           RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT
Sbjct: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT 300

Query: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFRQYSGSR 359
           SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFRQYSGSR
Sbjct: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFRQYSGSR 358

BLAST of Csa3G116640.1 vs. TrEMBL
Match: A0A0B2PFM9_GLYSO (Transcription factor bHLH70 OS=Glycine soja GN=glysoja_021210 PE=4 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 1.4e-93
Identity = 209/386 (54.15%), Postives = 256/386 (66.32%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSEQEFTN---LGFEESEEVCLLTSSLEDKIPFLQMLQSV 60
           MERLQGP+N CF+G+  E  C +Q   +   L  EE E+   L SSLED +PFLQMLQSV
Sbjct: 1   MERLQGPLNSCFFGDPLEVNCLDQVLVDEESLRLEEEEQ--FLISSLEDNMPFLQMLQSV 60

Query: 61  ESQSF---KEPNFQSLLKLQHLTKPWEGGVN------KIQELVQLFS----------SPI 120
           ES  F   KEPNFQ+LL+LQH+ KPWEG         ++Q  ++L S          SP+
Sbjct: 61  ESPQFFPLKEPNFQTLLRLQHMKKPWEGIAYIPRMEAQVQAALELESCVTHDMLEMQSPV 120

Query: 121 NSETKDQNQPPKS---DRVFSECNQN-QGISQTQMTKAPPVIKERRKRKRSKPTKNKEEV 180
            SE+ +   P      ++V  ECNQ  Q +SQT     P   KERRKRKR++P+KNKE+V
Sbjct: 121 KSESNELQHPLSISCFEKVNYECNQEPQKVSQTCPKSQPTATKERRKRKRTRPSKNKEDV 180

Query: 181 ECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESL 240
           E QRMTHIAVERNRRRQMNDHL+V++SL+P SY+QRGDQASIIGGAIDFVKELEQLL+SL
Sbjct: 181 ENQRMTHIAVERNRRRQMNDHLSVLRSLMPPSYIQRGDQASIIGGAIDFVKELEQLLQSL 240

Query: 241 EALRKERKGAEGE----------CK-----------GEQSEVRVASNRRIGEGVCAELRS 300
           EA ++ RK  EG           CK           G       +     G+ V AE +S
Sbjct: 241 EAQKRMRKNEEGGGGSSSSSTMLCKPPPPSSLSSPHGYGMRSSTSDEVNCGDEVKAENKS 300

Query: 301 EVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNL 340
           E A+I+VT+IQTHVNLKI C +R  QLLKVIVALEDLRLT+LHLNITS +  ++LYS NL
Sbjct: 301 EAADIKVTLIQTHVNLKIECQRRPGQLLKVIVALEDLRLTILHLNITS-SETSVLYSLNL 360

BLAST of Csa3G116640.1 vs. TrEMBL
Match: I1KHN8_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_07G049100 PE=4 SV=1)

HSP 1 Score: 350.5 bits (898), Expect = 2.4e-93
Identity = 208/385 (54.03%), Postives = 256/385 (66.49%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSEQEFTN---LGFEESEEVCLLTSSLEDKIPFLQMLQSV 60
           MERLQGP+N CF+G+  E  C +Q   +   L  EE E+   L SSLED +PFLQMLQSV
Sbjct: 1   MERLQGPLNSCFFGDPLEVNCLDQVLVDEESLRLEEEEQ--FLISSLEDNMPFLQMLQSV 60

Query: 61  ESQSF---KEPNFQSLLKLQHLTKPWEGGVN------KIQELVQLFS----------SPI 120
           ES  F   KEPNFQ+LL+LQH+ KPWEG         ++Q  ++L S          SP+
Sbjct: 61  ESPQFFPLKEPNFQTLLRLQHMKKPWEGIAYIPRMEAQVQAALELESCVTHDMLEMQSPV 120

Query: 121 NSETKDQNQPPKS---DRVFSECNQN-QGISQTQMTKAPPVIKERRKRKRSKPTKNKEEV 180
            SE+ +   P      ++V  ECNQ  Q +SQT     P   +ERRKRKR++P+KNKE+V
Sbjct: 121 KSESNELQHPLSISCFEKVNYECNQEPQKVSQTCPKSQPAATRERRKRKRTRPSKNKEDV 180

Query: 181 ECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESL 240
           E QRMTHIAVERNRRRQMNDHL+V++SL+P SY+QRGDQASIIGGAIDFVKELEQLL+SL
Sbjct: 181 ENQRMTHIAVERNRRRQMNDHLSVLRSLMPPSYIQRGDQASIIGGAIDFVKELEQLLQSL 240

Query: 241 EALRKERKGAEGE---------CK-----------GEQSEVRVASNRRIGEGVCAELRSE 300
           EA ++ RK  EG          CK           G       +     G+ V AE +SE
Sbjct: 241 EAQKRMRKNEEGGGGSSSSTMLCKPPPPSSLSSPHGYGMRSSTSDEVNCGDEVKAENKSE 300

Query: 301 VAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLK 340
            A+I+VT+IQTHVNLKI C +R  QLLKVIVALEDLRLT+LHLNITS +  ++LYS NLK
Sbjct: 301 AADIKVTLIQTHVNLKIECQRRPGQLLKVIVALEDLRLTILHLNITS-SETSVLYSLNLK 360

BLAST of Csa3G116640.1 vs. TrEMBL
Match: M5WZZ7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006041mg PE=4 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 2.2e-91
Identity = 211/430 (49.07%), Postives = 269/430 (62.56%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSEQEFT---NLGFEESEEVCLLT-SSLEDKIPFLQMLQS 60
           MERLQGPI+PCF+GE+ +  C EQ  +   +L FEE EE   L+  SLEDK+PFLQMLQ+
Sbjct: 1   MERLQGPIDPCFFGEHLDFQCLEQGLSTTESLRFEEEEEAAHLSIPSLEDKMPFLQMLQT 60

Query: 61  VESQ----SFKEPNFQSLLKLQHLTKPWEGG-------VNKIQELVQLFS---------- 120
           V S       KEP+FQ+LL+L HL  PWE G         ++Q  +++ S          
Sbjct: 61  VNSPPPYFPLKEPSFQALLRLHHLKNPWELGKAYMPEMETQLQTALEIESCVTHDMVELH 120

Query: 121 SPINSETKDQNQPPKS------DRVFSECNQNQ--------------------------- 180
           SP+ SE KD +  P S      + V SEC Q+Q                           
Sbjct: 121 SPVKSEAKDLHNHPHSVSAGNLEAVSSECIQDQEQPNSAEINCCRKGNNSSGSPPPTWAQ 180

Query: 181 ---GISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVI 240
                 QTQ  K+PPV +ERRKRKR++PTKNKEEVE QRMTHIAVERNRRRQMNDHLNV+
Sbjct: 181 AQNEPEQTQYPKSPPVTRERRKRKRTRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVL 240

Query: 241 KSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQ------- 300
           +SL+PTSY+QRGDQASI+GGAIDFVKELEQLL+SLEA ++ R+  +G   G+        
Sbjct: 241 RSLMPTSYIQRGDQASIVGGAIDFVKELEQLLQSLEAQKRMRRADQGS-NGDNSFSSSSS 300

Query: 301 -----------------SEVRVASNR------RIGEGVCAELRSEVAEIEVTMIQTHVNL 340
                            S+ R+ S+        + + V A+ +SE A+I+VT+IQTHVNL
Sbjct: 301 SSSASMAMPSNGMFMSLSQCRIGSHEEGTTTTHLEDEVTAQNKSEAADIDVTVIQTHVNL 360

BLAST of Csa3G116640.1 vs. TrEMBL
Match: A0A061DS78_THECC (Basic helix-loop-helix DNA-binding superfamily protein, putative OS=Theobroma cacao GN=TCM_005028 PE=4 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 5.0e-91
Identity = 209/390 (53.59%), Postives = 256/390 (65.64%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSEQEFTN---LGFEESEEVCLLTSSLEDKIPFLQMLQSV 60
           MERLQGPINPCF  E+ E    EQ F N   L F E EE      SLEDK+PFLQMLQSV
Sbjct: 1   MERLQGPINPCFLEEHLEVEFLEQGFVNSESLRFGEEEEAHFSIPSLEDKMPFLQMLQSV 60

Query: 61  ESQ---SFKEPNFQSLLKLQHLTKPWEGGVN--------KIQELV-------QLFS--SP 120
           +S    +FKEPNFQ+LL+LQHL KPWE   N        +IQ L        ++F   SP
Sbjct: 61  QSPQLFAFKEPNFQTLLRLQHLKKPWEINNNPFIPEMETQIQALELESCVTHEIFDLQSP 120

Query: 121 INSETKDQNQPPKSDRVF----SECNQNQGISQT----------------QMTKAPPVIK 180
           + SETKD  + P S   F    +E NQ+Q  S T                  TK+PP+ +
Sbjct: 121 VQSETKDLKKNPHSISCFEVVSAESNQDQPKSATADNCSREGNSGSSPPKSFTKSPPITR 180

Query: 181 ERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASII 240
           ERRKRKR++P KNKEEVE QRMTHIAVERNRRRQMND+LN ++SL+P SY+QRGDQASII
Sbjct: 181 ERRKRKRTRPAKNKEEVESQRMTHIAVERNRRRQMNDYLNSLRSLMPPSYIQRGDQASII 240

Query: 241 GGAIDFVKELEQLLESLEALRKERK----------GAEGECKGEQSEVRVAS-NRRIGEG 300
           GGAIDFVKELEQLL+SLEA ++ R+           A+   +  Q E  + S +   G+ 
Sbjct: 241 GGAIDFVKELEQLLQSLEAQKRMRRIEESSNSNNSVAKSAMEISQPETGMGSEDGNCGKE 300

Query: 301 VCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAAT 337
           + AE +S  AEIEV +   HVNLKI+C +R  QLL+ IV LE LRLTVLHLNITS + A+
Sbjct: 301 IKAESKSGAAEIEVNVTHNHVNLKIQCSRRPGQLLQAIVTLESLRLTVLHLNITS-SQAS 360

BLAST of Csa3G116640.1 vs. TAIR10
Match: AT3G61950.1 (AT3G61950.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 270.0 bits (689), Expect = 2.1e-72
Identity = 162/369 (43.90%), Postives = 226/369 (61.25%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSE----QEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQS 60
           MER QG INPCF+    +    E     E  +  F+E EE      SL+D +PFLQMLQS
Sbjct: 1   MERFQGHINPCFFDRKPDVRSLEVQGFAEAQSFAFKEKEE-----ESLQDTVPFLQMLQS 60

Query: 61  VESQSF---KEPNFQSLLKLQHLTKPWEGGVNKIQELVQL----FSSPINSET----KDQ 120
            +  SF   KEPNF +LL LQ L +PWE     ++  + L    F SP+ SET    +  
Sbjct: 61  EDPSSFFSIKEPNFLTLLSLQTLKEPWE-----LERYLSLEDSQFHSPVQSETNRFMEGA 120

Query: 121 NQPPKSDRV-FSECNQNQGISQTQMTKA-------------PPVIKERRKRKRSKPTKNK 180
           NQ   S  + FS+ N     S +    A               + +E+RKR+++KP+KN 
Sbjct: 121 NQAVSSQEIPFSQANMTLPSSTSSPLSAHSRRKRKINHLLPQEMTREKRKRRKTKPSKNN 180

Query: 181 EEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLL 240
           EE+E QR+ HIAVERNRRRQMN+H+N +++L+P SY+QRGDQASI+GGAI++VK LEQ++
Sbjct: 181 EEIENQRINHIAVERNRRRQMNEHINSLRALLPPSYIQRGDQASIVGGAINYVKVLEQII 240

Query: 241 ESLEALRKERKGAEGECKGEQSEVRVASNRRIGEG-----VCAELRSEVAEIEVTMIQTH 300
           +SLE+ ++ ++ +  E       V  A N   G          E ++ + +IE T+IQ H
Sbjct: 241 QSLESQKRTQQQSNSEV------VENALNHLSGISSNDLWTTLEDQTCIPKIEATVIQNH 300

Query: 301 VNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQ 336
           V+LK++C K+Q QLLK I++LE L+LTVLHLNIT+ + +++ YSFNLK+EDEC LES ++
Sbjct: 301 VSLKVQCEKKQGQLLKGIISLEKLKLTVLHLNITTSSHSSVSYSFNLKMEDECDLESADE 353

BLAST of Csa3G116640.1 vs. TAIR10
Match: AT2G46810.1 (AT2G46810.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 264.6 bits (675), Expect = 8.8e-71
Identity = 158/330 (47.88%), Postives = 210/330 (63.64%), Query Frame = 1

Query: 24  QEFTNLGFEESEEVCLLTSSLEDK-IPFLQMLQSVESQS----FKEPNFQSLLKLQHLTK 83
           ++  +   EE E+     S L+D  IPFLQMLQ  E  S    FK+P+F +LL LQ L K
Sbjct: 43  EDHQSFALEEEEQQLSTPSLLQDTTIPFLQMLQQSEDPSPFLSFKDPSFLALLSLQTLEK 102

Query: 84  PWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQNQGISQTQMTKAPP---- 143
           PWE   N +   V  F SPI+SET      P  + V +E   NQ +    +  A      
Sbjct: 103 PWELE-NYLPHEVPEFHSPIHSETNHYYHNPSLEGV-NEAISNQELPFNPLENARSRRKR 162

Query: 144 --------VIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTS 203
                   + +E+RKR+R+KPTKN EE+E QRMTHIAVERNRRRQMN HLN ++S+IP+S
Sbjct: 163 KNNNLASLMTREKRKRRRTKPTKNIEEIESQRMTHIAVERNRRRQMNVHLNSLRSIIPSS 222

Query: 204 YVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEG-ECKGEQSEVRVASNRRIGE 263
           Y+QRGDQASI+GGAIDFVK LEQ L+SLEA ++ ++  +  E   E + +R  S+ ++  
Sbjct: 223 YIQRGDQASIVGGAIDFVKILEQQLQSLEAQKRSQQSDDNKEQIPEDNSLRNISSNKL-R 282

Query: 264 GVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAA 323
               E +S   +IE T+I++HVNLKI+C ++Q QLL+ I+ LE LR TVLHLNITS T  
Sbjct: 283 ASNKEEQSSKLKIEATVIESHVNLKIQCTRKQGQLLRSIILLEKLRFTVLHLNITSPTNT 342

Query: 324 TMLYSFNLKIEDECKLESEEQIAATVNEIF 336
           ++ YSFNLK+EDEC L S ++I A + +IF
Sbjct: 343 SVSYSFNLKMEDECNLGSADEITAAIRQIF 369

BLAST of Csa3G116640.1 vs. TAIR10
Match: AT4G01460.1 (AT4G01460.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 223.8 bits (569), Expect = 1.7e-58
Identity = 139/312 (44.55%), Postives = 189/312 (60.58%), Query Frame = 1

Query: 42  SSLEDKIPFLQMLQSVESQ-SFKEPN--FQSLLKLQHLTKPWEGGVNKIQELVQLFSSPI 101
           +++E+KIPFLQMLQ +E   +  EPN   QSLL++Q L                   S +
Sbjct: 20  TTMEEKIPFLQMLQCIEHPFTTTEPNQFLQSLLQIQTLES----------------KSCL 79

Query: 102 NSETKDQNQPPKSDRVFSECNQNQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQR 161
             ET  +  P ++D    +     G            +KE+RKRKR++  KNK+EVE QR
Sbjct: 80  TLETNIKRDPGQTDDPEKDPRTENGAV---------TVKEKRKRKRTRAPKNKDEVENQR 139

Query: 162 MTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALR 221
           MTHIAVERNRRRQMN+HLN ++SL+P S++QRGDQASI+GGAIDF+KELEQLL+SLEA  
Sbjct: 140 MTHIAVERNRRRQMNEHLNSLRSLMPPSFLQRGDQASIVGGAIDFIKELEQLLQSLEA-E 199

Query: 222 KERKGAEGECKG---EQSEVRVASNRRIG-------EGVCAEL-RSEVAEIEVTMIQTHV 281
           K + G +   K      S     +N  I         G  A     +  E+E T+IQ HV
Sbjct: 200 KRKDGTDETPKTASCSSSSSLACTNSSISSVSTTSENGFTARFGGGDTTEVEATVIQNHV 259

Query: 282 NLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNLKIEDECKLESEEQI 340
           +LK+RC + + Q+LK IV++E+L+L +LHL I+S +   ++YSFNLK+ED CKL S ++I
Sbjct: 260 SLKVRCKRGKRQILKAIVSIEELKLAILHLTISS-SFDFVIYSFNLKMEDGCKLGSADEI 304

BLAST of Csa3G116640.1 vs. TAIR10
Match: AT1G22490.1 (AT1G22490.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 176.4 bits (446), Expect = 3.1e-44
Identity = 97/218 (44.50%), Postives = 143/218 (65.60%), Query Frame = 1

Query: 123 GISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSL 182
           G++   +   PP  + RRKR+R++  KNKEE+E QRMTHIAVERNRR+QMN++L V++SL
Sbjct: 80  GLTAIDVESHPPP-QHRRKRRRTRNCKNKEEIENQRMTHIAVERNRRKQMNEYLAVLRSL 139

Query: 183 IPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN-- 242
           +P+SY QRGDQASI+GGAI++VKELE +L+S+E  R      +G+     S V   ++  
Sbjct: 140 MPSSYAQRGDQASIVGGAINYVKELEHILQSMEPKRTRTHDPKGDKTSTSSLVGPFTDFF 199

Query: 243 -----RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVL 302
                         E  S  AEIEVT+ ++H N+KI   K+  QLLK+I +L+ LRLT+L
Sbjct: 200 SFPQYSTKSSSDVPESSSSPAEIEVTVAESHANIKIMTKKKPRQLLKLITSLQSLRLTLL 259

Query: 303 HLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNE 334
           HLN+T+    ++LYS ++++E+  +L + + IA  +N+
Sbjct: 260 HLNVTT-LHNSILYSISVRVEEGSQLNTVDDIATALNQ 295

BLAST of Csa3G116640.1 vs. TAIR10
Match: AT3G24140.1 (AT3G24140.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 176.4 bits (446), Expect = 3.1e-44
Identity = 105/225 (46.67%), Postives = 146/225 (64.89%), Query Frame = 1

Query: 139 RRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIG 198
           + KRKR++ +K  EEVE QRMTHIAVERNRR+QMN+HL V++SL+P SYVQRGDQASIIG
Sbjct: 177 KSKRKRARTSKTSEEVESQRMTHIAVERNRRKQMNEHLRVLRSLMPGSYVQRGDQASIIG 236

Query: 199 GAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASNRRI--------------- 258
           GAI+FV+ELEQLL+ LE+  ++R+   GE   + +    +S+  I               
Sbjct: 237 GAIEFVRELEQLLQCLES--QKRRRILGETGRDMTTTTTSSSSPITTVANQAQPLIITGN 296

Query: 259 ------GEGV---CAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTV 318
                 G G+    AE +S +A++EV ++     +KI   +R  QL+K I ALEDL L++
Sbjct: 297 VTELEGGGGLREETAENKSCLADVEVKLLGFDAMIKILSRRRPGQLIKTIAALEDLHLSI 356

Query: 319 LHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFIN 340
           LH NIT+    T+LYSFN+KI  E +  +E+ IA+++ +IFSFI+
Sbjct: 357 LHTNITTM-EQTVLYSFNVKITSETRFTAED-IASSIQQIFSFIH 397

BLAST of Csa3G116640.1 vs. NCBI nr
Match: gi|449432974|ref|XP_004134273.1| (PREDICTED: transcription factor bHLH67 [Cucumis sativus])

HSP 1 Score: 706.4 bits (1822), Expect = 2.5e-200
Identity = 358/358 (100.00%), Postives = 358/358 (100.00%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60
           MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ
Sbjct: 1   MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60

Query: 61  SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ 120
           SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ
Sbjct: 61  SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ 120

Query: 121 NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK 180
           NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK
Sbjct: 121 NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK 180

Query: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN 240
           SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN
Sbjct: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN 240

Query: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT 300
           RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT
Sbjct: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT 300

Query: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFRQYSGSR 359
           SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFRQYSGSR
Sbjct: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFRQYSGSR 358

BLAST of Csa3G116640.1 vs. NCBI nr
Match: gi|659074697|ref|XP_008437748.1| (PREDICTED: transcription factor bHLH67 isoform X1 [Cucumis melo])

HSP 1 Score: 656.0 bits (1691), Expect = 3.8e-185
Identity = 334/352 (94.89%), Postives = 341/352 (96.88%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSEQEFTNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60
           MERLQGPINPC YGEYSETGCSEQEF+NLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ
Sbjct: 1   MERLQGPINPCLYGEYSETGCSEQEFSNLGFEESEEVCLLTSSLEDKIPFLQMLQSVESQ 60

Query: 61  SFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSDRVFSECNQ 120
           SFKEPNFQSLLKLQHL KPWE GV+KIQELV+LFSSPINSETKDQNQPP SDRVFSECNQ
Sbjct: 61  SFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSDRVFSECNQ 120

Query: 121 NQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIK 180
           NQG+SQTQMTK PPVIKERRKRKRSKPTKNKEEVE QRMTHIAVERNRRRQMNDHLNVIK
Sbjct: 121 NQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQMNDHLNVIK 180

Query: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASN 240
           SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECK EQSEVRVASN
Sbjct: 181 SLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQSEVRVASN 240

Query: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNIT 300
           RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRC KRQ QLLKVIVALEDLRLTVLHLNIT
Sbjct: 241 RRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRLTVLHLNIT 300

Query: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFR 353
           SQTAATMLYSFNLKIEDECKLESEEQIAATVN+IFSF+NNGRLVNEAK  F+
Sbjct: 301 SQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 352

BLAST of Csa3G116640.1 vs. NCBI nr
Match: gi|659074699|ref|XP_008437749.1| (PREDICTED: transcription factor bHLH70 isoform X2 [Cucumis melo])

HSP 1 Score: 552.4 bits (1422), Expect = 6.0e-154
Identity = 284/300 (94.67%), Postives = 290/300 (96.67%), Query Frame = 1

Query: 53  MLQSVESQSFKEPNFQSLLKLQHLTKPWEGGVNKIQELVQLFSSPINSETKDQNQPPKSD 112
           MLQSVESQSFKEPNFQSLLKLQHL KPWE GV+KIQELV+LFSSPINSETKDQNQPP SD
Sbjct: 1   MLQSVESQSFKEPNFQSLLKLQHLNKPWEEGVSKIQELVELFSSPINSETKDQNQPPNSD 60

Query: 113 RVFSECNQNQGISQTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQM 172
           RVFSECNQNQG+SQTQMTK PPVIKERRKRKRSKPTKNKEEVE QRMTHIAVERNRRRQM
Sbjct: 61  RVFSECNQNQGLSQTQMTKFPPVIKERRKRKRSKPTKNKEEVESQRMTHIAVERNRRRQM 120

Query: 173 NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQ 232
           NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECK EQ
Sbjct: 121 NDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKDEQ 180

Query: 233 SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRL 292
           SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRC KRQ QLLKVIVALEDLRL
Sbjct: 181 SEVRVASNRRIGEGVCAELRSEVAEIEVTMIQTHVNLKIRCRKRQGQLLKVIVALEDLRL 240

Query: 293 TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNEIFSFINNGRLVNEAKENFR 352
           TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVN+IFSF+NNGRLVNEAK  F+
Sbjct: 241 TVLHLNITSQTAATMLYSFNLKIEDECKLESEEQIAATVNQIFSFMNNGRLVNEAKGKFQ 300

BLAST of Csa3G116640.1 vs. NCBI nr
Match: gi|359489477|ref|XP_002267819.2| (PREDICTED: transcription factor bHLH67 isoform X1 [Vitis vinifera])

HSP 1 Score: 354.8 bits (909), Expect = 1.8e-94
Identity = 212/422 (50.24%), Postives = 266/422 (63.03%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSET-GCSEQEFTNLGF----------EESEEVCLLTSSLEDKIP 60
           MERLQGP+NPCF+GE+ +  G        LG+           E E+    T SLED++P
Sbjct: 1   MERLQGPMNPCFFGEHLDVEGLEHGSSLELGYGPLLSTESQRSEEEQALFSTPSLEDRMP 60

Query: 61  FLQMLQSVESQSFK---EPNFQSLLKLQHLTKPWEGGVNKIQEL---------------- 120
           FLQMLQSVES  F    EPNFQ+LL+LQH  KPW  G+  + EL                
Sbjct: 61  FLQMLQSVESPPFSPFTEPNFQALLRLQHQKKPW--GMTHLTELDSRIQARELESCITHD 120

Query: 121 VQLFSSPINSETKD----QNQPPKSDRVFSECNQNQGIS--------------------- 180
           +    SP+ SETK+    Q+  P  +   SECNQ+Q  S                     
Sbjct: 121 ILEMHSPVKSETKEPQHHQHSTPCLEGTSSECNQDQPNSAENGCLDTNSGSSPAWVQPQT 180

Query: 181 ---QTQMTKAPPVIKERRKRKRSKPTKNKEEVECQRMTHIAVERNRRRQMNDHLNVIKSL 240
              Q  ++K+PP+ +ERRKRKR++PTKNKEEVE QRMTHIAVERNRRRQMNDHLN ++SL
Sbjct: 181 RQKQAHLSKSPPITRERRKRKRTRPTKNKEEVESQRMTHIAVERNRRRQMNDHLNALRSL 240

Query: 241 IPTSYVQRGDQASIIGGAIDFVKELEQLLESLEALRKERKGAEGECKGEQSEVRVASNRR 300
           +PTSY+QRGDQASIIGGAIDFVKELEQLLESL+A ++ R+  EG   G+ S    +S+ +
Sbjct: 241 MPTSYIQRGDQASIIGGAIDFVKELEQLLESLQAQKRMRRSEEG---GDASTNSSSSSPK 300

Query: 301 I-GEGVC------------------------AELRSEVAEIEVTMIQTHVNLKIRCPKRQ 340
           I  +G+C                        A+ +S  A+IEVT+IQTHVNLKI+CP+R 
Sbjct: 301 IASKGLCTQHRFAPDESNSAEGGRSDEFTFTADNKSAAADIEVTVIQTHVNLKIQCPRRP 360

BLAST of Csa3G116640.1 vs. NCBI nr
Match: gi|734329686|gb|KHN06372.1| (Transcription factor bHLH70 [Glycine soja])

HSP 1 Score: 351.3 bits (900), Expect = 2.0e-93
Identity = 209/386 (54.15%), Postives = 256/386 (66.32%), Query Frame = 1

Query: 1   MERLQGPINPCFYGEYSETGCSEQEFTN---LGFEESEEVCLLTSSLEDKIPFLQMLQSV 60
           MERLQGP+N CF+G+  E  C +Q   +   L  EE E+   L SSLED +PFLQMLQSV
Sbjct: 1   MERLQGPLNSCFFGDPLEVNCLDQVLVDEESLRLEEEEQ--FLISSLEDNMPFLQMLQSV 60

Query: 61  ESQSF---KEPNFQSLLKLQHLTKPWEGGVN------KIQELVQLFS----------SPI 120
           ES  F   KEPNFQ+LL+LQH+ KPWEG         ++Q  ++L S          SP+
Sbjct: 61  ESPQFFPLKEPNFQTLLRLQHMKKPWEGIAYIPRMEAQVQAALELESCVTHDMLEMQSPV 120

Query: 121 NSETKDQNQPPKS---DRVFSECNQN-QGISQTQMTKAPPVIKERRKRKRSKPTKNKEEV 180
            SE+ +   P      ++V  ECNQ  Q +SQT     P   KERRKRKR++P+KNKE+V
Sbjct: 121 KSESNELQHPLSISCFEKVNYECNQEPQKVSQTCPKSQPTATKERRKRKRTRPSKNKEDV 180

Query: 181 ECQRMTHIAVERNRRRQMNDHLNVIKSLIPTSYVQRGDQASIIGGAIDFVKELEQLLESL 240
           E QRMTHIAVERNRRRQMNDHL+V++SL+P SY+QRGDQASIIGGAIDFVKELEQLL+SL
Sbjct: 181 ENQRMTHIAVERNRRRQMNDHLSVLRSLMPPSYIQRGDQASIIGGAIDFVKELEQLLQSL 240

Query: 241 EALRKERKGAEGE----------CK-----------GEQSEVRVASNRRIGEGVCAELRS 300
           EA ++ RK  EG           CK           G       +     G+ V AE +S
Sbjct: 241 EAQKRMRKNEEGGGGSSSSSTMLCKPPPPSSLSSPHGYGMRSSTSDEVNCGDEVKAENKS 300

Query: 301 EVAEIEVTMIQTHVNLKIRCPKRQDQLLKVIVALEDLRLTVLHLNITSQTAATMLYSFNL 340
           E A+I+VT+IQTHVNLKI C +R  QLLKVIVALEDLRLT+LHLNITS +  ++LYS NL
Sbjct: 301 EAADIKVTLIQTHVNLKIECQRRPGQLLKVIVALEDLRLTILHLNITS-SETSVLYSLNL 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BH067_ARATH3.7e-7143.90Transcription factor bHLH67 OS=Arabidopsis thaliana GN=BHLH67 PE=2 SV=1[more]
BH070_ARATH1.6e-6947.88Transcription factor bHLH70 OS=Arabidopsis thaliana GN=BHLH70 PE=2 SV=1[more]
BH057_ARATH3.0e-5744.55Transcription factor bHLH57 OS=Arabidopsis thaliana GN=BHLH57 PE=2 SV=1[more]
BH094_ARATH5.6e-4344.50Transcription factor bHLH94 OS=Arabidopsis thaliana GN=BHLH94 PE=2 SV=2[more]
FAMA_ARATH5.6e-4346.67Transcription factor FAMA OS=Arabidopsis thaliana GN=FAMA PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L6W2_CUCSA1.7e-200100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G116640 PE=4 SV=1[more]
A0A0B2PFM9_GLYSO1.4e-9354.15Transcription factor bHLH70 OS=Glycine soja GN=glysoja_021210 PE=4 SV=1[more]
I1KHN8_SOYBN2.4e-9354.03Uncharacterized protein OS=Glycine max GN=GLYMA_07G049100 PE=4 SV=1[more]
M5WZZ7_PRUPE2.2e-9149.07Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006041mg PE=4 SV=1[more]
A0A061DS78_THECC5.0e-9153.59Basic helix-loop-helix DNA-binding superfamily protein, putative OS=Theobroma ca... [more]
Match NameE-valueIdentityDescription
AT3G61950.12.1e-7243.90 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT2G46810.18.8e-7147.88 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT4G01460.11.7e-5844.55 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G22490.13.1e-4444.50 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G24140.13.1e-4446.67 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449432974|ref|XP_004134273.1|2.5e-200100.00PREDICTED: transcription factor bHLH67 [Cucumis sativus][more]
gi|659074697|ref|XP_008437748.1|3.8e-18594.89PREDICTED: transcription factor bHLH67 isoform X1 [Cucumis melo][more]
gi|659074699|ref|XP_008437749.1|6.0e-15494.67PREDICTED: transcription factor bHLH70 isoform X2 [Cucumis melo][more]
gi|359489477|ref|XP_002267819.2|1.8e-9450.24PREDICTED: transcription factor bHLH67 isoform X1 [Vitis vinifera][more]
gi|734329686|gb|KHN06372.1|2.0e-9354.15Transcription factor bHLH70 [Glycine soja][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa3G116640Csa3G116640gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa3G116640.1Csa3G116640.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa3G116640.1.utr5p1Csa3G116640.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa3G116640.1.cds1Csa3G116640.1.cds1CDS
Csa3G116640.1.cds2Csa3G116640.1.cds2CDS
Csa3G116640.1.cds3Csa3G116640.1.cds3CDS
Csa3G116640.1.cds4Csa3G116640.1.cds4CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa3G116640.1.utr3p1Csa3G116640.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 157..218
score: 5.5
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 157..208
score: 1.6
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 162..213
score: 2.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 156..207
score: 14
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 148..218
score: 2.09
NoneNo IPR availableunknownCoilCoilcoord: 200..234
scor
NoneNo IPR availablePANTHERPTHR11969MAX DIMERIZATION, MADcoord: 44..339
score: 7.5E
NoneNo IPR availablePANTHERPTHR11969:SF29TRANSCRIPTION FACTOR BHLH57-RELATEDcoord: 44..339
score: 7.5E