CsGy3G044040 (gene) Cucumber (Gy14) v2

NameCsGy3G044040
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptiontranscription factor bHLH71
LocationChr3 : 40836626 .. 40842829 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATAATATTAAGAATTTGGTAATGAAGTTGAGAGCATCTAAAACACAACTTTTGCTAAGTTGTCCCTATACTTATTTCTATATGGATCATTTGGTCCCCTCTCACACACATTTCTATCTCTCTCTCTTCTCTGTCCAAACTGCCCCAATGTTTTGGCACTTTCTATGATCATACCAATGACTAAAAAGAAAATACTATATATCAAAGTAATATATAGTTTCTTACTTTTTTGTTTACATGAAATAAATATAAACAAAATGAAAAGAAAAATGGGGATAAATAGACAATAGAGATATCAATCCAATTCTGCCCCTCTCTCACGTGCAACCCTTCTATATACCTCTTTCCTTTTACATTACCCTTTTCACAATTTTACATTTCTTTCAATCTTCTTCTTCTCATTCATCCTCTTCTCTACATATTACAACATCCAAAGCCTAAGATCCTCAATAACCTTAAAGAAAAAGAGGAAAACATGAGTCTTGATGCTCTTTCTTCCAATGACTTGTTCAACTTCATCATCTACGACACGATCTCCGCAACGCCCAACACCTCCAATAACAACGTCGTCCCTCACCATGACTCCTCCGAGAACACTTTCCTATCCGACGACAAGTGCTCGAAACCCAATTCGCGAAAACGCTGCACGAGTGAGGTAGAGATTAGCAACCGAGTGGTGGTGCCATTGTCGACGACGACGACGACGACCACCACCACCACTCAACATGGGAAGAAGAAAAGAAAGAGAAAAGCAAAGGTTTGTAAGAACAAAGAAGAAGCTGAAACACAAAGAATGACACACATTGCTGTTGAAAGGAACCGCCGTAAACAAATGAATGAACATCTTTCGGTTTTACGTTCACTCATGCCAGAATCCTATGTTCAAAGGGTTTGTTTTCTTCTATTATTTGTCTCTTTCACTTTTCTTTTTAAATTTTTTAAATATATAACTCTTTGTTAAACTAATGCATGAGATTTGAATTAAACTTCATCATTATGTTATCATTAAATTCATAACCATTCTAACATCTTGAGGAGGGAAAAACAAAACTACTTATTCTGTTCATATTTTCGTTATTAATCCTCATTTATTTATCCTAAAAATAGAATCCAATTATTGTGATTTATTTTAAAGTTGAATTATTCATTTTTGTGAAAGGTTTGATTTTTTTCACACCCTTTTGAAAGTAGCTAGAACCCATCACTATTATCAAACATAAAAACCAAAAAAACTATATTAATTATTGCAAAAAAAATTGACCTCTCCTTCTAACCCCATTCATTATTGCATGGCCTTACAAAGAACTTAATTAAGGGTACTATATTATATATAATTAATAGCAAAGCATTTAATTAAACAATAATCATCTTGGTTAATAACAAAACAAAAGGCCAAAATATATATTAAGAAAGAAAAATAGGGTGCTTGGAATTGACAGATTCCATAGGAATCAGAGTTTGTAAAAGTCCTAAAATCATGATAGTTTCCAATGCCTAGATGCTATATTTTTCTTCTACACATAGCTTATTATATATTTATTATCATTATTTTAACCTCTTGGATAAATTTTTTTTTAATAACAATAATATTAAAATATAAGTTTTGAAACTTTTATTTATTTATTTAAATTGTGTCAGTCTAATCTTTAGGCTTAAAAAAAGGTTTCATTTTAAATAATATCCTCCTAGCATGCAATATCAATATGAAGCTAATGATGATGGAAACACAATCAAACTTAAATTAAGTAACAAAACTAAAAAGTTTAAAATAAAATACCATTTTGATCCTAATCATTCCATTGAATCATAAATTTAATTTAATGTTTCCTACTTTTAATATACTGTTGACTTTTATAAGCTTTTTTATTATACAATAATAATGTACTTTCTTCACTATCATTAATATTTAACCAAATATTTAACCAAATTCGACAAACGTAAGAGTACATGAACTAAAATAGAATTAGAACGGTATCAATCTAGAACGAGTGGTTTTTCTTTTTTCATTTTGAAAGCTTAATATCACTATTTAATATAGTAGTCTCTTGTGTGTTTAACATTATATGAAGGGTGACCAAGCCTCCATAGTAGGTGGTGCCGTTGAGTTTGTGAAGGAATTGGAGCATCTTCTGTCAACTTTAGAAGCCAAGAAGCTACAAATCTTACAACAAGAAGTAGATCAACATCAAGAACAAGAGATGAATGAAGATTCAAGGATAAGAAAAAATGATAATAATGATAATAACAATAAGTTGTTTTCATTTGCAAGTCTATTGATGAATAATTCAGATCAAAATAACTATTCATCACAATATTCAACAAAATACACATCAAAATCCAAAGCCTCTTCAGCTGACATTGAAGTCACTTTGATTGAAACTCATGCAAATCTTAGAATCCTCTCAACAAGAAGCCATAGACAACTCTTGAAGCTCATTGCTGGTTTGCAAGCTCTTCGTCTCACTATTCTTCATCTAAATCTCACTGATTTTCATCCATTGGTTCTTTACTCCATTAGCCTCAAGGTATGTCCTCTCTAATTATCATTTTATAGTCCTTAGTTAACTTTTAAGATGTCAATAGAACTTCCGTCGCTAGTTCTTCTTATTATTCTGTCACCTACTATCTATTTAATCGCTTCTGTATTTGAACAAATACTTTTCAAAATGTGTTAGTGTATTTGCAGATACTCGCGACATTTATTTAAAACAAACGCATCCAAAGTTCAGTGTTAATGGTTCAGGAGGAAGTAAAGAAAGAGTTATAGTCCTTCAAGTTTCCTTCTAAAATCTTCTTCTAGCCCCTTTTACAAAATCTCATGTCTTATCTTTTCTTGGCAATAAGTCAACTTTGGGTCCAGGACGTCATGAATGGGCTATAAGTGTATCAATCTAGTTGAGATACTACCTTCAATTGATCCCTCTCTTTAGACGTTCTTGCATAATTAACATGCAATTTTGGCCCCTAAACTTTAGTAACTTTGGACATGTTAGTAAATTTATTGAAAGTCGTAGTTATAAAAATAGAACATAAGTTTGAAAGTTGAAAGGGGACTAAAACAGAAATTTGGGACGAGAATGAAATTTGAATATATAATACATGACTTTATGACATTTTGAATGAAATGGGGAATTACATAATCTACTTATAATAATGCAGGTTGAAGAAGGATGCCAACTAAGATCAGTAGATGACATAGCAGCAGCAGCACATCATATGGTAAGAATAATTGAGGAAGAAGCCGTTTTATGTTAAAATGTAAACAATTCAAGAAAGAAAAAGAAAAAAAGCTAAGAGGTTTGAAATATGGAATCAAGCATTAAAATGCCAAGTTCTCTTCTCTCTGAGTGTGTGCATGTGGCTTGGTTTGATGCAATTTGTAAATGAATGTGTTTTTATGTAAGTTGATATTGTTTTGTAGCTATCAAAGGTTTTGTCTTTGTATGCCTTTAGTTTCTTTTTTTTCTTGAGACCACAGAAAAGGGAGAGAGGGACAGAGATGATAAGATAAAGTTGTCAAAGGGATTGGCCTTGGTTTCATCTCTCTTTCTTCCAATGCATTCATAAATTTCCCTCTCCCCTCCCTTCCTTTTTCATACAACTTTTCACCTCCCAACTTACTTAGGGTTTTCTTTTTTTTTTTGGGTAATTTTGTTTATGTTTTTCATTGGGTGACCCACTAACCCAAATATTGAGTTGATTTCTTTTTTAGAATTCAAGGTAAAAAGTCAATGTCCAAAGTAAATAAAGCAATTGTAAAGGAAGGATTTGTTTACTTAACAATGAAGTGAATACTGGTAAAATGCAAATACGGTAATTTAAAACATTTGCAATCACACTCCCTACAAAGTACTATTTAATTTGTGCTATATGAACACTGCAAATAGTCCTTCCTAATTCAATAATCGTAACGACACAACATTGGTTATAATGTAACTTAAGCCATCTAGTCATTAGGTACATCATATACTATTCTAGATGTACTTTTTATTAAGTGGTCACTTCCAATGTCTAATATGACCAGTACTTATTCAATAACTCTTACTTGCGACTTAATCAGGCTATTGAATTGAGAAACCTTCTAGTCTCACTCACACAGCTGGTTTTGTACTAGTATCATCATATCATAACTTTCTAGTCTTTTCTAAACAAGAAAGGGTTCAAGAGGTAAATATTTCAATTAAATTAACACATTCTTTAATACCCAATAGTATACCCTCGTCACACTCCCAAAAACAGATTTATTGAAATGCAAAAGAAAAAAAAAAAGTACCTTGATTGATTGATCCTATCATAACTTTGTTGTCTTAATTGATCAACAATGGGTCAGTATCAGTCCGTTTTTAGTAAAAGAAAATGTGAACCAACCGACTGGGCCGATTATTCTTTACAACTAAAAACCAACCCCGACTGATCAAAGTTAGTTCAGCTGATTTTGGTAGGTCCGATCGGTTTTCAATCTATCTGTGATCACCCCAGTTGATAATATATAATTTTAAAAATCACAACCATGATACTAACTTTTTGAGTTCTTAGCTTTTTCCAAAACTTATTTAGACAGTAATATGAAAAATAGAGAACAAGCTAGGTGTATTCATTTTGAAATAAATCTTGATTCTTGAATATGATGATGAGAATCATGAGAGGAGCAATCCTTTTCCCTCTCATATGGAGCCACAGTGTGCTACAATTATTGGGAAAAAGAAATGACATGCATATATACTTTTTAGCATTGAACAGAGAAAAAGCTGAAGCTAAAAGGGTTTATTTAAATTGTGCCATTCGATGCACAATTTTGTTTTATTATTGCTACTTTCTTCTCATTATCATCCTTTCTTATTTTGTTTCTTTATACACACCATTCTGTATTCTAATTAATTCTCCTGTCTTCTTCTCTACCAATATTATCCCTCAAGTCAGTTGTAATTTACTTGGCAAGTTCAGAGAGAGAAATAGAGAGATATTGTAATAATCTTCATAGAAGCAATGCAAAAGCAGTCAAATAATAGATGGAAGAAGAAACATACTGATGCATCAGACATATTTGTGACAGTTGAATGATAAAGCAAAAGCCATGGGTGATCTTTTCTTTCACTAAAACCAATAACCAACCATAAAGGGAAATCATTGAAAGCAGCTAAAGAACTCACATGGTATATGTTCATTTCGTAAAATCCTCTGTTGTGGTTTTTTTTTCTCATCAAAACTGCAAACCATTCTGATTCTGTGTTGTAAAATTTGGGACAGGGGCAATTCCCCTGTGTTCTCTCCCATGCTTGGTGAGAGAAAGTCTTCCACTTGAAAATGATCTCTTCATTGAAACAGGACTCATCACACAATGCAAGATTCTTTTTCTATGACTTGACCTACTGTCTTCTCCTACAGAATGTTTAGATGATCCGACATCACCTGAACTTCCTGCTTCATCAATACCTCCAAAATCATCTTCATTTAGACGTAGAACATCGGCAATAATAATATGGTTTTGGCTGCAATGATCCATTGAAATTGACCTTCTAATTTGCTGCAATCTTCCATCTCTAAGCTCAATGATAGTATCTCTTTTCTCCAGATTTCCAAGGTCGCTGAAAGCGCGAACCGAGTTCTTTCCATCCTCATTGCTAAAACTTTGTCCAGAATCTAGTGAAACAGCCTCATAGTCATTATCCTGAGAAGTCTCTGGCACGGTCTCGTTAGTTATGGGATATGCAGTGCTTGCAGGGAGTTGAACTGGAGAACCAGCATTGATAGAGATGATATTAGCACGACATAAGGGACAATTGGAGTGTGATTTGAGCCATGTATCAATGCATTGAAGATGAAAAGCATGGCTGCATTTAGGCAACAATCGAAGGCTTTCATCTTCTTGAAACTCACTAAGGCAAACGGAGCAATCAGAGCCTTCAACCAACCCATCTTCTCTCTTGTACTTGCAAACAGTAATGGACTTGATAAGTGCTTCATCCAAGCCAGTTGTGGCAACATGCCATGGCTCATGAAATGTTGGATTGTGATTATCTTCATATTCCTCACTTGGGTCATGATTTCCGGTTCCAGACAAACGATTTGTGTTGCCACAATACTTTGAAATAATGGTGTAATAGGTTACAAGAAGGAAAGCACTGGCCAGAATTCCAATAATTGCAATAACAAGAGGTGAAAAATGTGGGCCTGAATTATCTGGAAACTCCAAGGGTGGTGGA

mRNA sequence

AAATAATATTAAGAATTTGGTAATGAAGTTGAGAGCATCTAAAACACAACTTTTGCTAAGTTGTCCCTATACTTATTTCTATATGGATCATTTGGTCCCCTCTCACACACATTTCTATCTCTCTCTCTTCTCTGTCCAAACTGCCCCAATGTTTTGGCACTTTCTATGATCATACCAATGACTAAAAAGAAAATACTATATATCAAAGTAATATATAGTTTCTTACTTTTTTGTTTACATGAAATAAATATAAACAAAATGAAAAGAAAAATGGGGATAAATAGACAATAGAGATATCAATCCAATTCTGCCCCTCTCTCACGTGCAACCCTTCTATATACCTCTTTCCTTTTACATTACCCTTTTCACAATTTTACATTTCTTTCAATCTTCTTCTTCTCATTCATCCTCTTCTCTACATATTACAACATCCAAAGCCTAAGATCCTCAATAACCTTAAAGAAAAAGAGGAAAACATGAGTCTTGATGCTCTTTCTTCCAATGACTTGTTCAACTTCATCATCTACGACACGATCTCCGCAACGCCCAACACCTCCAATAACAACGTCGTCCCTCACCATGACTCCTCCGAGAACACTTTCCTATCCGACGACAAGTGCTCGAAACCCAATTCGCGAAAACGCTGCACGAGTGAGGTAGAGATTAGCAACCGAGTGGTGGTGCCATTGTCGACGACGACGACGACGACCACCACCACCACTCAACATGGGAAGAAGAAAAGAAAGAGAAAAGCAAAGGTTTGTAAGAACAAAGAAGAAGCTGAAACACAAAGAATGACACACATTGCTGTTGAAAGGAACCGCCGTAAACAAATGAATGAACATCTTTCGGTTTTACGTTCACTCATGCCAGAATCCTATGTTCAAAGGGGTGACCAAGCCTCCATAGTAGGTGGTGCCGTTGAGTTTGTGAAGGAATTGGAGCATCTTCTGTCAACTTTAGAAGCCAAGAAGCTACAAATCTTACAACAAGAAGTAGATCAACATCAAGAACAAGAGATGAATGAAGATTCAAGGATAAGAAAAAATGATAATAATGATAATAACAATAAGTTGTTTTCATTTGCAAGTCTATTGATGAATAATTCAGATCAAAATAACTATTCATCACAATATTCAACAAAATACACATCAAAATCCAAAGCCTCTTCAGCTGACATTGAAGTCACTTTGATTGAAACTCATGCAAATCTTAGAATCCTCTCAACAAGAAGCCATAGACAACTCTTGAAGCTCATTGCTGGTTTGCAAGCTCTTCGTCTCACTATTCTTCATCTAAATCTCACTGATTTTCATCCATTGGTTCTTTACTCCATTAGCCTCAAGGTTGAAGAAGGATGCCAACTAAGATCAGTAGATGACATAGCAGCAGCAGCACATCATATGACCACAGAAAAGGGAGAGAGGGACAGAGATGATAAGATAAAGTTGTCAAAGGGATTGGCCTTGATTTCCAAGGTCGCTGAAAGCGCGAACCGAATGAAAAGCATGGCTGCATTTAGGCAACAATCGAAGGCTTTCATCTTCTTGAAACTCACTAAGGCAAACGGAGCAATCAGAGCCTTCAACCAACCCATCTTCTCTCTTGTACTTGCAAACAGTAATGGACTTGATAAGTGCTTCATCCAAGCCAGTTGTGGCAACATGCCATGGCTCATGAAATGTTGGATTGTTACAAGAAGGAAAGCACTGGCCAGAATTCCAATAATTGCAATAACAAGAGGTGAAAAATGTGGGCCTGAATTATCTGGAAACTCCAAGGGTGGTGGA

Coding sequence (CDS)

ATGAGTCTTGATGCTCTTTCTTCCAATGACTTGTTCAACTTCATCATCTACGACACGATCTCCGCAACGCCCAACACCTCCAATAACAACGTCGTCCCTCACCATGACTCCTCCGAGAACACTTTCCTATCCGACGACAAGTGCTCGAAACCCAATTCGCGAAAACGCTGCACGAGTGAGGTAGAGATTAGCAACCGAGTGGTGGTGCCATTGTCGACGACGACGACGACGACCACCACCACCACTCAACATGGGAAGAAGAAAAGAAAGAGAAAAGCAAAGGTTTGTAAGAACAAAGAAGAAGCTGAAACACAAAGAATGACACACATTGCTGTTGAAAGGAACCGCCGTAAACAAATGAATGAACATCTTTCGGTTTTACGTTCACTCATGCCAGAATCCTATGTTCAAAGGGGTGACCAAGCCTCCATAGTAGGTGGTGCCGTTGAGTTTGTGAAGGAATTGGAGCATCTTCTGTCAACTTTAGAAGCCAAGAAGCTACAAATCTTACAACAAGAAGTAGATCAACATCAAGAACAAGAGATGAATGAAGATTCAAGGATAAGAAAAAATGATAATAATGATAATAACAATAAGTTGTTTTCATTTGCAAGTCTATTGATGAATAATTCAGATCAAAATAACTATTCATCACAATATTCAACAAAATACACATCAAAATCCAAAGCCTCTTCAGCTGACATTGAAGTCACTTTGATTGAAACTCATGCAAATCTTAGAATCCTCTCAACAAGAAGCCATAGACAACTCTTGAAGCTCATTGCTGGTTTGCAAGCTCTTCGTCTCACTATTCTTCATCTAAATCTCACTGATTTTCATCCATTGGTTCTTTACTCCATTAGCCTCAAGGTTGAAGAAGGATGCCAACTAAGATCAGTAGATGACATAGCAGCAGCAGCACATCATATGACCACAGAAAAGGGAGAGAGGGACAGAGATGATAAGATAAAGTTGTCAAAGGGATTGGCCTTGATTTCCAAGGTCGCTGAAAGCGCGAACCGAATGAAAAGCATGGCTGCATTTAGGCAACAATCGAAGGCTTTCATCTTCTTGAAACTCACTAAGGCAAACGGAGCAATCAGAGCCTTCAACCAACCCATCTTCTCTCTTGTACTTGCAAACAGTAATGGACTTGATAAGTGCTTCATCCAAGCCAGTTGTGGCAACATGCCATGGCTCATGAAATGTTGGATTGTTACAAGAAGGAAAGCACTGGCCAGAATTCCAATAATTGCAATAACAAGAGGTGAAAAATGTGGGCCTGAATTATCTGGAAACTCCAAGGGTGGTGGA

Protein sequence

MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSEVEISNRVVVPLSTTTTTTTTTTQHGKKKRKRKAKVCKNKEEAETQRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEQEMNEDSRIRKNDNNDNNNKLFSFASLLMNNSDQNNYSSQYSTKYTSKSKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRSVDDIAAAAHHMTTEKGERDRDDKIKLSKGLALISKVAESANRMKSMAAFRQQSKAFIFLKLTKANGAIRAFNQPIFSLVLANSNGLDKCFIQASCGNMPWLMKCWIVTRRKALARIPIIAITRGEKCGPELSGNSKGGG
BLAST of CsGy3G044040 vs. NCBI nr
Match: XP_004136191.1 (PREDICTED: transcription factor bHLH71 [Cucumis sativus] >KGN60373.1 hypothetical protein Csa_3G901180 [Cucumis sativus])

HSP 1 Score: 423.7 bits (1088), Expect = 7.6e-115
Identity = 310/310 (100.00%), Postives = 310/310 (100.00%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE 60
           MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE
Sbjct: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE 60

Query: 61  VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM 120
           VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM
Sbjct: 61  VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM 120

Query: 121 NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX 180
           NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX
Sbjct: 121 NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI 240
           XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI 240

Query: 241 ETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRSV 300
           ETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRSV
Sbjct: 241 ETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRSV 300

Query: 301 DDIAAAAHHM 311
           DDIAAAAHHM
Sbjct: 301 DDIAAAAHHM 310

BLAST of CsGy3G044040 vs. NCBI nr
Match: XP_008465984.1 (PREDICTED: transcription factor bHLH71 [Cucumis melo])

HSP 1 Score: 395.6 bits (1015), Expect = 2.2e-106
Identity = 279/311 (89.71%), Postives = 279/311 (89.71%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPH-HDSSENTFLSDDKCSKPNSRKRCTS 60
           MSLDALSSNDLFNFIIYDTISATP    NN VPH HDSSENTFLSDDKCSKPNSRKRC S
Sbjct: 1   MSLDALSSNDLFNFIIYDTISATP----NNFVPHGHDSSENTFLSDDKCSKPNSRKRCPS 60

Query: 61  EVEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQ 120
           EVEISNRVV   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX RMTHIAVERNRRKQ
Sbjct: 61  EVEISNRVV--VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRMTHIAVERNRRKQ 120

Query: 121 MNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQE 180
           MNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQE
Sbjct: 121 MNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQE 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTL 240
                                FSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTL
Sbjct: 181 QEMNEDSRIRKNDNSDNNNKLFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTL 240

Query: 241 IETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRS 300
           IETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRS
Sbjct: 241 IETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRS 300

Query: 301 VDDIAAAAHHM 311
           VDDIAAAAHHM
Sbjct: 301 VDDIAAAAHHM 305

BLAST of CsGy3G044040 vs. NCBI nr
Match: XP_023554782.1 (transcription factor bHLH71-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 327.8 bits (839), Expect = 5.7e-86
Identity = 219/323 (67.80%), Postives = 223/323 (69.04%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCS----------- 60
           MSLDALSSND FNFIIYDTISA PNTS       HDSSENTF SDD CS           
Sbjct: 62  MSLDALSSNDFFNFIIYDTISAIPNTS----PVLHDSSENTFFSDDTCSRHQDHQHGLEG 121

Query: 61  --KPNSRKRCTSEVEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRM 120
             KPN RKR  SE E                       XXXXXXXXX          QRM
Sbjct: 122 SLKPN-RKRRPSESE-------------SLRTASTRQGXXXXXXXXXVCKNKEEAETQRM 181

Query: 121 THIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKL 180
           THIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKL
Sbjct: 182 THIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKL 241

Query: 181 QILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXX 240
           Q+LQ+EVD HQE                     FSF SLLMNNSD               
Sbjct: 242 QVLQKEVD-HQE-------------QEINDNKLFSFGSLLMNNSDH-NYCSQYSTKYTSK 301

Query: 241 XKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSI 300
            KASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHL+LTDFHPLVLYSI
Sbjct: 302 SKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLSLTDFHPLVLYSI 351

Query: 301 SLKVEEGCQLRSVDDIAAAAHHM 311
           SLKVEEGCQLRSVDDIAAAAHH+
Sbjct: 362 SLKVEEGCQLRSVDDIAAAAHHL 351

BLAST of CsGy3G044040 vs. NCBI nr
Match: XP_022963757.1 (transcription factor bHLH71-like [Cucurbita moschata])

HSP 1 Score: 327.4 bits (838), Expect = 7.4e-86
Identity = 207/322 (64.29%), Postives = 213/322 (66.15%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPN-------- 60
           MSLDALSSND FNFIIYDTISA PNTS       HDSSENTF SDD CS+          
Sbjct: 1   MSLDALSSNDFFNFIIYDTISAIPNTS----PVLHDSSENTFFSDDTCSRHQDHQHGLEG 60

Query: 61  ----SRKRCTSEVEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMT 120
                RKR  SE+E                                          QRMT
Sbjct: 61  SLKPDRKRRPSELE-------------SLRTASTRQGKKKRKRKPRVCKNKEEAETQRMT 120

Query: 121 HIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQ 180
           HIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQ
Sbjct: 121 HIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQ 180

Query: 181 ILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXX 240
           +LQ+EVD HQE                     FSF SLLMNNSD                
Sbjct: 181 VLQKEVD-HQE-------------QEINDNKLFSFGSLLMNNSDH-NYCSQYSTKYTSKS 240

Query: 241 KASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSIS 300
           KASSADIEVT+IETHANLRILSTRSHRQLLKLIAGLQALRLTILHL+LTDFHPLVLYSIS
Sbjct: 241 KASSADIEVTMIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLSLTDFHPLVLYSIS 290

Query: 301 LKVEEGCQLRSVDDIAAAAHHM 311
           LKVEEGCQLRSVDDIAAAAHHM
Sbjct: 301 LKVEEGCQLRSVDDIAAAAHHM 290

BLAST of CsGy3G044040 vs. NCBI nr
Match: XP_022967282.1 (transcription factor bHLH71-like [Cucurbita maxima])

HSP 1 Score: 327.4 bits (838), Expect = 7.4e-86
Identity = 228/323 (70.59%), Postives = 233/323 (72.14%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCS----------- 60
           MSLDALSSND FNFIIYDTISA PNTS    +  HDSSENTF SDD CS           
Sbjct: 62  MSLDALSSNDFFNFIIYDTISAIPNTS----LVFHDSSENTFFSDDTCSRHQDHQHGFEG 121

Query: 61  --KPNSRKRCTSEVEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRM 120
             KPN RKR  SE E                       XXXXXXXXXXXXXXXXXXX RM
Sbjct: 122 CLKPN-RKRRPSESE-------------SLRTAPTRQGXXXXXXXXXXXXXXXXXXXXRM 181

Query: 121 THIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKL 180
           THIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKL
Sbjct: 182 THIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKL 241

Query: 181 QILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXX 240
           Q+LQ+EVD HQE                     FSF SLLMNNSD               
Sbjct: 242 QVLQKEVD-HQE-------------QEINDNKLFSFGSLLMNNSDH-NYCSQYSTKYTSK 301

Query: 241 XKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSI 300
            KASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHL+LTDFHP+VLYSI
Sbjct: 302 SKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLSLTDFHPVVLYSI 351

Query: 301 SLKVEEGCQLRSVDDIAAAAHHM 311
           SLKVEEGCQLRSVDDIAAAAHHM
Sbjct: 362 SLKVEEGCQLRSVDDIAAAAHHM 351

BLAST of CsGy3G044040 vs. TAIR10
Match: AT1G22490.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 174.5 bits (441), Expect = 1.5e-43
Identity = 101/202 (50.00%), Postives = 129/202 (63.86%), Query Frame = 0

Query: 105 QRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEA 164
           QRMTHIAVERNRRKQMNE+L+VLRSLMP SY QRGDQASIVGGA+ +VKELEH+L ++E 
Sbjct: 113 QRMTHIAVERNRRKQMNEYLAVLRSLMPSSYAQRGDQASIVGGAINYVKELEHILQSMEP 172

Query: 165 KKLQILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXX 224
           K+ +    + D+                        FSF      +S             
Sbjct: 173 KRTRTHDPKGDK-----------TSTSSLVGPFTDFFSFPQYSTKSSSD----------- 232

Query: 225 XXXXKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVL 284
                +S A+IEVT+ E+HAN++I++ +  RQLLKLI  LQ+LRLT+LHLN+T  H  +L
Sbjct: 233 VPESSSSPAEIEVTVAESHANIKIMTKKKPRQLLKLITSLQSLRLTLLHLNVTTLHNSIL 292

Query: 285 YSISLKVEEGCQLRSVDDIAAA 307
           YSIS++VEEG QL +VDDIA A
Sbjct: 293 YSISVRVEEGSQLNTVDDIATA 292

BLAST of CsGy3G044040 vs. TAIR10
Match: AT5G46690.1 (beta HLH protein 71)

HSP 1 Score: 164.9 bits (416), Expect = 1.2e-40
Identity = 152/321 (47.35%), Postives = 190/321 (59.19%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE 60
           M+L+ALSSN L NF++ +T+S TP  S  ++ P  ++  +  +S +  S+          
Sbjct: 1   MTLEALSSNGLLNFLLSETLSPTPFKSLVDLEPLPEN--DVIISKNTISE---------- 60

Query: 61  VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM 120
                  + XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   QRMTHIAVERNRR+QM
Sbjct: 61  -------ISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAENQRMTHIAVERNRRRQM 120

Query: 121 NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX 180
           N+HLSVLRSLMP+ +  +GDQASIVGGA++F+KELEH L +LEA+K          +Q  
Sbjct: 121 NQHLSVLRSLMPQPFAHKGDQASIVGGAIDFIKELEHKLLSLEAQK----HHNAKLNQSV 180

Query: 181 XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI 240
                                S +   +++ D                K    D+EVTLI
Sbjct: 181 TSSTSQDSNGEQENPHQPSSLSLSQFFLHSYD---PSQENRNGSTSSVKTPMEDLEVTLI 240

Query: 241 ETHANLRILS-----------TRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISL 300
           ETHAN+RILS           T    QL KL+A LQ+L L+ILHL++T      +YSIS 
Sbjct: 241 ETHANIRILSRRRGFRWSTLATTKPPQLSKLVASLQSLSLSILHLSVTTLDNYAIYSISA 295

Query: 301 KVEEGCQLRSVDDIAAAAHHM 311
           KVEE CQL SVDDIA A HHM
Sbjct: 301 KVEESCQLSSVDDIAGAVHHM 295

BLAST of CsGy3G044040 vs. TAIR10
Match: AT1G72210.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 159.8 bits (403), Expect = 3.7e-39
Identity = 97/206 (47.09%), Postives = 126/206 (61.17%), Query Frame = 0

Query: 105 QRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEA 164
           QRMTHIAVERNRRKQMNE+L+VLRSLMP  Y QRGDQASIVGGA+ ++KELEH L ++E 
Sbjct: 123 QRMTHIAVERNRRKQMNEYLAVLRSLMPPYYAQRGDQASIVGGAINYLKELEHHLQSMEP 182

Query: 165 KKLQILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXX 224
                 +     H +                      + AS     SD            
Sbjct: 183 PVKTATEDTGAGHDQTKT-------------------TSASSSGPFSDFFAFPQYSNRPT 242

Query: 225 XXXXKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVL 284
                   A+IEVT++E+HA+L+IL+ +  RQLLKL++ +Q+LRLT+LHLN+T     VL
Sbjct: 243 SAAAAEGMAEIEVTMVESHASLKILAKKRPRQLLKLVSSIQSLRLTLLHLNVTTRDDSVL 302

Query: 285 YSISLKVEEGCQLRSVDDIAAAAHHM 311
           YSIS+KVEEG QL +V+DIAAA + +
Sbjct: 303 YSISVKVEEGSQLNTVEDIAAAVNQI 309

BLAST of CsGy3G044040 vs. TAIR10
Match: AT2G46810.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 138.7 bits (348), Expect = 8.8e-33
Identity = 90/203 (44.33%), Postives = 119/203 (58.62%), Query Frame = 0

Query: 105 QRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEA 164
           QRMTHIAVERNRR+QMN HL+ LRS++P SY+QRGDQASIVGGA++FVK LE  L +LEA
Sbjct: 191 QRMTHIAVERNRRRQMNVHLNSLRSIIPSSYIQRGDQASIVGGAIDFVKILEQQLQSLEA 250

Query: 165 KKLQILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXX 224
           +K     Q+ D ++E                      S   L  +N ++           
Sbjct: 251 QK---RSQQSDDNKE-----------QIPEDNSLRNISSNKLRASNKEE----------- 310

Query: 225 XXXXKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTD-FHPLV 284
               ++S   IE T+IE+H NL+I  TR   QLL+ I  L+ LR T+LHLN+T   +  V
Sbjct: 311 ----QSSKLKIEATVIESHVNLKIQCTRKQGQLLRSIILLEKLRFTVLHLNITSPTNTSV 364

Query: 285 LYSISLKVEEGCQLRSVDDIAAA 307
            YS +LK+E+ C L S D+I AA
Sbjct: 371 SYSFNLKMEDECNLGSADEITAA 364

BLAST of CsGy3G044040 vs. TAIR10
Match: AT3G24140.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 133.7 bits (335), Expect = 2.8e-31
Identity = 99/202 (49.01%), Postives = 127/202 (62.87%), Query Frame = 0

Query: 105 QRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEA 164
           QRMTHIAVERNRRKQMNEHL VLRSLMP SYVQRGDQASI+GGA+EFV+ELE LL  LE+
Sbjct: 195 QRMTHIAVERNRRKQMNEHLRVLRSLMPGSYVQRGDQASIIGGAIEFVRELEQLLQCLES 254

Query: 165 KKLQILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXX 224
           +K + +  E  +   XXXXXXXXXXXXX             L++  +             
Sbjct: 255 QKRRRILGETGRDMTXXXXXXXXXXXXXAN-------QAQPLIITGNVTELEGGGGLREE 314

Query: 225 XXXXKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVL 284
               K+  AD+EV L+   A ++ILS R   QL+K IA L+ L L+ILH N+T     VL
Sbjct: 315 TAENKSCLADVEVKLLGFDAMIKILSRRRPGQLIKTIAALEDLHLSILHTNITTMEQTVL 374

Query: 285 YSISLKVEEGCQLRSVDDIAAA 307
           YS ++K+    +  + +DIA++
Sbjct: 375 YSFNVKITSETRF-TAEDIASS 388

BLAST of CsGy3G044040 vs. Swiss-Prot
Match: sp|Q9SK91|BH094_ARATH (Transcription factor bHLH94 OS=Arabidopsis thaliana OX=3702 GN=BHLH94 PE=1 SV=2)

HSP 1 Score: 174.5 bits (441), Expect = 2.6e-42
Identity = 101/202 (50.00%), Postives = 129/202 (63.86%), Query Frame = 0

Query: 105 QRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEA 164
           QRMTHIAVERNRRKQMNE+L+VLRSLMP SY QRGDQASIVGGA+ +VKELEH+L ++E 
Sbjct: 113 QRMTHIAVERNRRKQMNEYLAVLRSLMPSSYAQRGDQASIVGGAINYVKELEHILQSMEP 172

Query: 165 KKLQILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXX 224
           K+ +    + D+                        FSF      +S             
Sbjct: 173 KRTRTHDPKGDK-----------TSTSSLVGPFTDFFSFPQYSTKSSSD----------- 232

Query: 225 XXXXKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVL 284
                +S A+IEVT+ E+HAN++I++ +  RQLLKLI  LQ+LRLT+LHLN+T  H  +L
Sbjct: 233 VPESSSSPAEIEVTVAESHANIKIMTKKKPRQLLKLITSLQSLRLTLLHLNVTTLHNSIL 292

Query: 285 YSISLKVEEGCQLRSVDDIAAA 307
           YSIS++VEEG QL +VDDIA A
Sbjct: 293 YSISVRVEEGSQLNTVDDIATA 292

BLAST of CsGy3G044040 vs. Swiss-Prot
Match: sp|Q56XR0|BH071_ARATH (Transcription factor bHLH71 OS=Arabidopsis thaliana OX=3702 GN=BHLH71 PE=1 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 2.1e-39
Identity = 152/321 (47.35%), Postives = 190/321 (59.19%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE 60
           M+L+ALSSN L NF++ +T+S TP  S  ++ P  ++  +  +S +  S+          
Sbjct: 1   MTLEALSSNGLLNFLLSETLSPTPFKSLVDLEPLPEN--DVIISKNTISE---------- 60

Query: 61  VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM 120
                  + XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   QRMTHIAVERNRR+QM
Sbjct: 61  -------ISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAENQRMTHIAVERNRRRQM 120

Query: 121 NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX 180
           N+HLSVLRSLMP+ +  +GDQASIVGGA++F+KELEH L +LEA+K          +Q  
Sbjct: 121 NQHLSVLRSLMPQPFAHKGDQASIVGGAIDFIKELEHKLLSLEAQK----HHNAKLNQSV 180

Query: 181 XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI 240
                                S +   +++ D                K    D+EVTLI
Sbjct: 181 TSSTSQDSNGEQENPHQPSSLSLSQFFLHSYD---PSQENRNGSTSSVKTPMEDLEVTLI 240

Query: 241 ETHANLRILS-----------TRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISL 300
           ETHAN+RILS           T    QL KL+A LQ+L L+ILHL++T      +YSIS 
Sbjct: 241 ETHANIRILSRRRGFRWSTLATTKPPQLSKLVASLQSLSLSILHLSVTTLDNYAIYSISA 295

Query: 301 KVEEGCQLRSVDDIAAAAHHM 311
           KVEE CQL SVDDIA A HHM
Sbjct: 301 KVEESCQLSSVDDIAGAVHHM 295

BLAST of CsGy3G044040 vs. Swiss-Prot
Match: sp|Q9C7T4|BH096_ARATH (Transcription factor bHLH96 OS=Arabidopsis thaliana OX=3702 GN=BHLH96 PE=1 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 6.7e-38
Identity = 97/206 (47.09%), Postives = 126/206 (61.17%), Query Frame = 0

Query: 105 QRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEA 164
           QRMTHIAVERNRRKQMNE+L+VLRSLMP  Y QRGDQASIVGGA+ ++KELEH L ++E 
Sbjct: 123 QRMTHIAVERNRRKQMNEYLAVLRSLMPPYYAQRGDQASIVGGAINYLKELEHHLQSMEP 182

Query: 165 KKLQILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXX 224
                 +     H +                      + AS     SD            
Sbjct: 183 PVKTATEDTGAGHDQTKT-------------------TSASSSGPFSDFFAFPQYSNRPT 242

Query: 225 XXXXKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVL 284
                   A+IEVT++E+HA+L+IL+ +  RQLLKL++ +Q+LRLT+LHLN+T     VL
Sbjct: 243 SAAAAEGMAEIEVTMVESHASLKILAKKRPRQLLKLVSSIQSLRLTLLHLNVTTRDDSVL 302

Query: 285 YSISLKVEEGCQLRSVDDIAAAAHHM 311
           YSIS+KVEEG QL +V+DIAAA + +
Sbjct: 303 YSISVKVEEGSQLNTVEDIAAAVNQI 309

BLAST of CsGy3G044040 vs. Swiss-Prot
Match: sp|O81037|BH070_ARATH (Transcription factor bHLH70 OS=Arabidopsis thaliana OX=3702 GN=BHLH70 PE=2 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 1.6e-31
Identity = 90/203 (44.33%), Postives = 119/203 (58.62%), Query Frame = 0

Query: 105 QRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEA 164
           QRMTHIAVERNRR+QMN HL+ LRS++P SY+QRGDQASIVGGA++FVK LE  L +LEA
Sbjct: 191 QRMTHIAVERNRRRQMNVHLNSLRSIIPSSYIQRGDQASIVGGAIDFVKILEQQLQSLEA 250

Query: 165 KKLQILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXX 224
           +K     Q+ D ++E                      S   L  +N ++           
Sbjct: 251 QK---RSQQSDDNKE-----------QIPEDNSLRNISSNKLRASNKEE----------- 310

Query: 225 XXXXKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTD-FHPLV 284
               ++S   IE T+IE+H NL+I  TR   QLL+ I  L+ LR T+LHLN+T   +  V
Sbjct: 311 ----QSSKLKIEATVIESHVNLKIQCTRKQGQLLRSIILLEKLRFTVLHLNITSPTNTSV 364

Query: 285 LYSISLKVEEGCQLRSVDDIAAA 307
            YS +LK+E+ C L S D+I AA
Sbjct: 371 SYSFNLKMEDECNLGSADEITAA 364

BLAST of CsGy3G044040 vs. Swiss-Prot
Match: sp|Q56YJ8|FAMA_ARATH (Transcription factor FAMA OS=Arabidopsis thaliana OX=3702 GN=FAMA PE=1 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 5.1e-30
Identity = 99/202 (49.01%), Postives = 127/202 (62.87%), Query Frame = 0

Query: 105 QRMTHIAVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEA 164
           QRMTHIAVERNRRKQMNEHL VLRSLMP SYVQRGDQASI+GGA+EFV+ELE LL  LE+
Sbjct: 195 QRMTHIAVERNRRKQMNEHLRVLRSLMPGSYVQRGDQASIIGGAIEFVRELEQLLQCLES 254

Query: 165 KKLQILQQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXX 224
           +K + +  E  +   XXXXXXXXXXXXX             L++  +             
Sbjct: 255 QKRRRILGETGRDMTXXXXXXXXXXXXXAN-------QAQPLIITGNVTELEGGGGLREE 314

Query: 225 XXXXKASSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVL 284
               K+  AD+EV L+   A ++ILS R   QL+K IA L+ L L+ILH N+T     VL
Sbjct: 315 TAENKSCLADVEVKLLGFDAMIKILSRRRPGQLIKTIAALEDLHLSILHTNITTMEQTVL 374

Query: 285 YSISLKVEEGCQLRSVDDIAAA 307
           YS ++K+    +  + +DIA++
Sbjct: 375 YSFNVKITSETRF-TAEDIASS 388

BLAST of CsGy3G044040 vs. TrEMBL
Match: tr|A0A0A0LF14|A0A0A0LF14_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G901180 PE=4 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 5.0e-115
Identity = 310/310 (100.00%), Postives = 310/310 (100.00%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE 60
           MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE
Sbjct: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE 60

Query: 61  VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM 120
           VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM
Sbjct: 61  VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM 120

Query: 121 NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX 180
           NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX
Sbjct: 121 NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI 240
           XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI 240

Query: 241 ETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRSV 300
           ETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRSV
Sbjct: 241 ETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRSV 300

Query: 301 DDIAAAAHHM 311
           DDIAAAAHHM
Sbjct: 301 DDIAAAAHHM 310

BLAST of CsGy3G044040 vs. TrEMBL
Match: tr|A0A1S3CRM4|A0A1S3CRM4_CUCME (transcription factor bHLH71 OS=Cucumis melo OX=3656 GN=LOC103503552 PE=4 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 1.5e-106
Identity = 279/311 (89.71%), Postives = 279/311 (89.71%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPH-HDSSENTFLSDDKCSKPNSRKRCTS 60
           MSLDALSSNDLFNFIIYDTISATP    NN VPH HDSSENTFLSDDKCSKPNSRKRC S
Sbjct: 1   MSLDALSSNDLFNFIIYDTISATP----NNFVPHGHDSSENTFLSDDKCSKPNSRKRCPS 60

Query: 61  EVEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQ 120
           EVEISNRVV   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX RMTHIAVERNRRKQ
Sbjct: 61  EVEISNRVV--VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRMTHIAVERNRRKQ 120

Query: 121 MNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQE 180
           MNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQE
Sbjct: 121 MNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQE 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTL 240
                                FSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTL
Sbjct: 181 QEMNEDSRIRKNDNSDNNNKLFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTL 240

Query: 241 IETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRS 300
           IETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRS
Sbjct: 241 IETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRS 300

Query: 301 VDDIAAAAHHM 311
           VDDIAAAAHHM
Sbjct: 301 VDDIAAAAHHM 305

BLAST of CsGy3G044040 vs. TrEMBL
Match: tr|A0A2I4DNU5|A0A2I4DNU5_9ROSI (transcription factor bHLH71-like OS=Juglans regia OX=51240 GN=LOC108982006 PE=4 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 9.6e-58
Identity = 149/310 (48.06%), Postives = 181/310 (58.39%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSKPNSRKRCTSE 60
           M+L+ALSSN+LFNFI+YDTISA P +S       HDSSE +FL ++     N      + 
Sbjct: 1   MALEALSSNELFNFIVYDTISAAPYSS-------HDSSETSFLLENAMPPQNLGAILENS 60

Query: 61  VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERNRRKQM 120
             ++ +                                      QRMTHIAVERNRRKQM
Sbjct: 61  SLMTQKRHCGRSQVVERRQNLAVQGRKKRRRKPRVCKNKEEAETQRMTHIAVERNRRKQM 120

Query: 121 NEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVDQHQEX 180
           NEHLSVLRSLMPESY QRGDQASIVGGA+EFVKELEHLL +LEA+KLQ+ Q  V    + 
Sbjct: 121 NEHLSVLRSLMPESYAQRGDQASIVGGAIEFVKELEHLLQSLEAQKLQLPQPGVGALIDE 180

Query: 181 XXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADIEVTLI 240
                               ++++ +    + +               + + ADIEV+LI
Sbjct: 181 DATTCKFRQPPFSQFFVYPQYTWSQIPNKYTTK--------------TEDAIADIEVSLI 240

Query: 241 ETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGCQLRSV 300
           ETHANLRIL+ R+ RQL KL+AG Q LRL+ILHLN+T   PLVLYSIS KVEEGCQL SV
Sbjct: 241 ETHANLRILTRRNPRQLSKLVAGFQTLRLSILHLNVTTMDPLVLYSISAKVEEGCQLTSV 289

Query: 301 DDIAAAAHHM 311
           DDIA A HHM
Sbjct: 301 DDIAGAVHHM 289

BLAST of CsGy3G044040 vs. TrEMBL
Match: tr|A0A061G3I4|A0A061G3I4_THECC (Basic helix-loop-helix DNA-binding superfamily protein OS=Theobroma cacao OX=3641 GN=TCM_015694 PE=4 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 2.8e-57
Identity = 169/315 (53.65%), Postives = 195/315 (61.90%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSNNNVVPHHDSSENTFLSDDKCSK----PNSRKR 60
           M+L+ LSSN+L NFIIYDTISATP +S++++V          +S ++ S     P    +
Sbjct: 1   MALETLSSNELLNFIIYDTISATPYSSHDSLVTDFSLENGFSISQEQASSLNCFPLVTPQ 60

Query: 61  CTSE-VEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHIAVERN 120
           C S  VE ++R                                      QRMTHIAVERN
Sbjct: 61  CRSTGVEAADR-----------RPNLAVQGRKKRRRKPRVCKNKEEAETQRMTHIAVERN 120

Query: 121 RRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQILQQEVD 180
           RRKQMNEHL+VLRSLMPESYVQRGDQASIVGGA+EFVKELEHLL TLEA+KLQ+LQQ   
Sbjct: 121 RRKQMNEHLAVLRSLMPESYVQRGDQASIVGGAIEFVKELEHLLQTLEAQKLQVLQQ--- 180

Query: 181 QHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKASSADI 240
                   XXXXXXXXXXXXXX   F F     +                   KAS ADI
Sbjct: 181 VRPASEGTXXXXXXXXXXXXXXAQFFMFPQYTWSQ---------IPSKFTSKTKASIADI 240

Query: 241 EVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLKVEEGC 300
           EVTLIETHANLRILS +  R L KL+AG Q+L L+ILHL++T  +PLVLYSIS KVEEGC
Sbjct: 241 EVTLIETHANLRILSRKGPRHLSKLVAGFQSLCLSILHLSVTTMYPLVLYSISAKVEEGC 292

Query: 301 QLRSVDDIAAAAHHM 311
           QL SVDDIA A HHM
Sbjct: 301 QLSSVDDIAGAVHHM 292

BLAST of CsGy3G044040 vs. TrEMBL
Match: tr|A0A2N9GA13|A0A2N9GA13_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS24294 PE=4 SV=1)

HSP 1 Score: 230.3 bits (586), Expect = 8.2e-57
Identity = 168/320 (52.50%), Postives = 188/320 (58.75%), Query Frame = 0

Query: 1   MSLDALSSNDLFNFIIYDTISATPNTSN---------NNVVPHHDSSENTFLSDDKCSKP 60
           M+L+ALSSNDLFNFIIYDTISATP +S+         N + P  D  ++  L +      
Sbjct: 1   MALEALSSNDLFNFIIYDTISATPYSSHAHESSFLLENEMKPQDDDHQDGILKNPSLMTK 60

Query: 61  NSRKRC-TSEVEISNRVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMTHI 120
                C +SEV   N+                                      QRMTHI
Sbjct: 61  KRHSACSSSEVADDNK-----------KQNLGVQGRKKRRRKPRVCKNKEEAETQRMTHI 120

Query: 121 AVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAVEFVKELEHLLSTLEAKKLQIL 180
           AVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGA+EFVKELEHLL +LEA+KLQ++
Sbjct: 121 AVERNRRKQMNEHLSVLRSLMPESYVQRGDQASIVGGAIEFVKELEHLLQSLEAQKLQLV 180

Query: 181 QQEVDQHQEXXXXXXXXXXXXXXXXXXXXXFSFASLLMNNSDQXXXXXXXXXXXXXXXKA 240
           Q  V            XXXXXXXXXX      FA   +                    KA
Sbjct: 181 QPGV----VGVGGLNEXXXXXXXXXXLAVQPPFAQFFVY---PQYTWSQIPNKYTSKTKA 240

Query: 241 SSADIEVTLIETHANLRILSTRSHRQLLKLIAGLQALRLTILHLNLTDFHPLVLYSISLK 300
           + ADIEVTLIETHANLRIL  RS RQ  KL+AG Q L L+ILHLN+T    LVLYSIS K
Sbjct: 241 AIADIEVTLIETHANLRILWRRSPRQHSKLVAGFQTLHLSILHLNVTTMDTLVLYSISAK 300

Query: 301 VEEGCQLRSVDDIAAAAHHM 311
           VEEGCQL SVDDIAAA HHM
Sbjct: 301 VEEGCQLTSVDDIAAAVHHM 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004136191.17.6e-115100.00PREDICTED: transcription factor bHLH71 [Cucumis sativus] >KGN60373.1 hypothetica... [more]
XP_008465984.12.2e-10689.71PREDICTED: transcription factor bHLH71 [Cucumis melo][more]
XP_023554782.15.7e-8667.80transcription factor bHLH71-like [Cucurbita pepo subsp. pepo][more]
XP_022963757.17.4e-8664.29transcription factor bHLH71-like [Cucurbita moschata][more]
XP_022967282.17.4e-8670.59transcription factor bHLH71-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G22490.11.5e-4350.00basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT5G46690.11.2e-4047.35beta HLH protein 71[more]
AT1G72210.13.7e-3947.09basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT2G46810.18.8e-3344.33basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G24140.12.8e-3149.01basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SK91|BH094_ARATH2.6e-4250.00Transcription factor bHLH94 OS=Arabidopsis thaliana OX=3702 GN=BHLH94 PE=1 SV=2[more]
sp|Q56XR0|BH071_ARATH2.1e-3947.35Transcription factor bHLH71 OS=Arabidopsis thaliana OX=3702 GN=BHLH71 PE=1 SV=1[more]
sp|Q9C7T4|BH096_ARATH6.7e-3847.09Transcription factor bHLH96 OS=Arabidopsis thaliana OX=3702 GN=BHLH96 PE=1 SV=1[more]
sp|O81037|BH070_ARATH1.6e-3144.33Transcription factor bHLH70 OS=Arabidopsis thaliana OX=3702 GN=BHLH70 PE=2 SV=1[more]
sp|Q56YJ8|FAMA_ARATH5.1e-3049.01Transcription factor FAMA OS=Arabidopsis thaliana OX=3702 GN=FAMA PE=1 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0LF14|A0A0A0LF14_CUCSA5.0e-115100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G901180 PE=4 SV=1[more]
tr|A0A1S3CRM4|A0A1S3CRM4_CUCME1.5e-10689.71transcription factor bHLH71 OS=Cucumis melo OX=3656 GN=LOC103503552 PE=4 SV=1[more]
tr|A0A2I4DNU5|A0A2I4DNU5_9ROSI9.6e-5848.06transcription factor bHLH71-like OS=Juglans regia OX=51240 GN=LOC108982006 PE=4 ... [more]
tr|A0A061G3I4|A0A061G3I4_THECC2.8e-5753.65Basic helix-loop-helix DNA-binding superfamily protein OS=Theobroma cacao OX=364... [more]
tr|A0A2N9GA13|A0A2N9GA13_FAGSY8.2e-5752.50Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS24294 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR036638HLH_DNA-bd_sf
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G044040.1CsGy3G044040.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 152..187
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 75..102
NoneNo IPR availablePANTHERPTHR11969MAX DIMERIZATION, MADcoord: 2..311
NoneNo IPR availablePANTHERPTHR11969:SF31TRANSCRIPTION FACTOR BHLH71coord: 2..311
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 110..161
e-value: 1.2E-9
score: 48.1
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 105..156
e-value: 3.1E-11
score: 43.0
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROSITEPS50888BHLHcoord: 104..155
score: 15.205
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainCDDcd00083HLHcoord: 103..160
e-value: 4.56727E-11
score: 57.6091
IPR036638Helix-loop-helix DNA-binding domain superfamilyGENE3DG3DSA:4.10.280.10coord: 96..198
e-value: 5.9E-12
score: 47.4
IPR036638Helix-loop-helix DNA-binding domain superfamilySUPERFAMILYSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 103..172

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsGy3G044040Carg08845Silver-seed gourdcarcgybB0774
CsGy3G044040Carg24527Silver-seed gourdcarcgybB0956
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None