Cp4.1LG01g15300 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g15300
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBHLH transcription factor
LocationCp4.1LG01 : 9174100 .. 9178935 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCTGTTTCTCTTCTCACTGCTCAGTCAAAACTCTCTCTCTCTCTCTCTCTTTCTCTTTCTTCTTAAAGCAGAATCAGATTTCTGTTTCCGAAAACTGACTCTGATTTTCCCCTTTCCGAATCAAATGTTAGTTAGTTACCCTTCTCCTCCTGTACTTCATCAACGTATACACTCACTTCAGTTGAGGAGAACCAGAGATTATGAGAGGCTTTGAGGGAGCTCTGGAGTGTCTTAGGGCTCTTGTGGAGACCAAAGTTTGGGATTATTGCATTGTTTGGAGATCGAGGGATGATTCTTCCAGGTATATATATCTATATATTTGATTACACTCTTCTCTGGTTTGATCGAAAATGGTGTGTTTTAGTTTTACTGGACGGCTGGGCTGCTGTTTGTAGGTTCATTGATTGGGTAGGTTGCTGCTGTAGCGGTGGAAGTTTCGACGACGGTGGGAAAAGGGAAGCCGGTGAGACAATCCCAGCTACCCTTTGTAAGGACACTCGATTTCGACATTTTAGGAGAACAAATGCTTGCCAAGCACTTGCTCAGTTCCCTTCTACTATATCATTGAACTCTGGGTAGCTACTTAGAACTTAGAACTGTTCGTTAATCTTCTAGACTAAGATGATTATGGAAGAACTGCTGGGAAAATTTCTTCAAATTCGTGTTTTATCGCAGGGTTCATGGAGATGTTTTGATTTCGAACCAACCCATGTGGCTTACGAGTGCTGAAGTTTCTTCTACTTCAAGCTTTTCACATGTGGGTTTCTTGTTTTTCTTCCTCTTTGGGGGCTGGTTGGTTTTGACTAACTTCTGTGCAATGCCATCTTATTTGGCAGGAGTTGACTGGAACTCGTGTTCTAATACCAATTTCTGCTGGCATTGTGGAGCTTTTTGCGACAAAACATGTAAGAACTAAGAGCTATGAACTACGAGTAAGAACATGGACTATATTTGTGCTACCAAAACAACTTCTCTTCACATTTTGGGTGCAATCTCTTTCAGATGCCGAGAGACAGAGAGGTTATAGACTTTGTAATGGCTCACTGTAATATCTCTATGGAGCAAGAATTTGATACAGAGAGTGAATTGGATGGTGACCTCAATGAGAAAAGACTTGATTCATGTACCAAGTACTATTCAGTAGACTGGCCTGATCCTCAGCCCTTGCTTGATTTCAAGTCCAAGTTAGAAATTCTTCCTTCCGTGTCCCAGTCCAACTCCTTTCCTAGCTGTGAAGGATCATCCAGTGGCTCAAAGCCTTCCAATGAGCATCACTACTTTGATTCCTATTCAAGTTTGGTTTCACGTGGGTTCTTCAACCAACCAATTCACAGGTCCTTTGAATCCAAGAGGCCAACGCCCCAGGAAGATTTGCTTGAACAACAAAGAGATGTGGTTTCGGATTATTCAAAGTTCTTGCAGAAAGATGAAGCCAAAACTGGAGGAGGGAGACAAGGAAAGGAAATTTACAAATCCAAAAATCTTATCACAGAGAGGAGAAGAAGAAACAAGATTAGGGATCGTCTCTACGCGTTACGTGCCTTAGTCCCTAATATATCCAAGGTTCAACCAAACTATCAAATACCATATGGTGAATTTGTTTAGATTATGTTTATTAATCTTCTGTCTTTGATATGATGATACCCATAGATGGATAGAGCATCAATCATTGTGGATGCTATCAACTACATCAGAGAGTTGGAAGAGAATGTGAAAAACCTCCAGAATGAACTTGTTCAATTGGAGCACAAAGACTGTCAAAAGAACAGACACTTGAAGATTTCTCCATCAGAGAAGAACAAAGATGACACAATATATTGGCCTCTTGTTCAGAACGATCAACCAATGTTTATCCTTGGTGAGGAGAAGCCTATGGAGGTAGAGTTTCAAAGTTTGACCTCATCTTTACAGCTCCTTGCCAAATAACCCTTCCAATCAAACATTAAGGGGAAAGAGTGCTTACTAATGCTGAATCGTATGCCCTATTAAAATGTTTTTAGGTGGAAGTGGAAGTGATGCAAATTAATGAAAGAGATTTCTTAATCAAGCTATTCTGCAAGCAGACACAAGGTGGTGTGGTGAGCTCGATAGAGGCTATGGATTCATTGGGGTTGCAGGTTGTTGATGTCAATATTACCACCTTTGGGGGTATGGTCCTAAACATTTTCCATGTTGAGGTAAGACTCTTTTGAGTTTATATACCTATTATGAGCTTATACTTCAGATTACCAGTCTAAAAGTTTGTAAAGCCCTCCTTAAATTCTAGGTAAAGGAACTTTCTACTGACTGTTCTTCTATTGCATATATAGGCGAATGAAAATGACATTCAGCCTAAGAGACTGAGGGACTCACTAATCAAGCTAACTAGCAGAGAAATGAGACAATTAAAATAGGCTGTACAAAGGGAAAGTGTTTATTACCCGGGCTCGACGTAAACATCTTAGAAAGTGCAACGGATTATCCAAACGTCGGTTTGTGTAATCGCCTGCACCAGTTACATTAATCAGCTTGAGCTAGGGAACTTGTTATTTCATCAAAATGAGCCAAGTAAGTAACTTTTTTTCTTCGTTCTGCTAACTTCTTCTTTCCCTGCTGGTTATTACTCCAACTGGGGGAAATATAATACACAAACTATGAATGTATAATATAATGGGTAGACATATGGTGTTTAAATGCAATAAATAGAAGACCAGAGCATAACCAGAAGTAGAATATCACTGGTCAACCAATTATATTATATGGTGTTTACTAGTTGGGGCTCAATTGTAAAATTCCAAATCACTATTCCACACCTCGTTAGTCTAGAAGGATGTATAAGTTTATGATGAAAAAAGAGAAAACTAAGTCAAAATGGTGGGGAAAAGAAAGAAGAAAGAAGCTTTGCACGTTATTTACGTCTTGTTTCAGCATCATATTAGAATTGTCGACCGCTAGGGTGACTTTGTGGTGCGCCAGTTCGTCCGTGAAAAGCTCGAGTTTCTGATTCACCGCTTGAGTTGTTTTCGAATCTCCATGATTCCCCACTTGCATTCATTTCTCTGGAACTGTATTCGCTACTGTAAATGTCGAAGTCGGAGCTGTCTCCACCAAGCGCCATTCTCCTGAATTTAGAGATGTCTTGACTGTATTGTCCAGAATCATAGATGGTGCTCTGGCCATATTTGACACCATTGGAGAGGTCAGACATGTCTGTTTCAATATCTAGTGATCTCACCACCTGTTGACCAAAGTAGTATTAGAAGAGAACTACAATTTAAGAACAAGCTTGTTGACGAAATGAAACAGAGTTTTCAATGGAATTAAGTTAACCTGAACCATGCGAGGCCTTTTGGGAGCCGAATGACGAACGCAGGCAGCTGCTGCTTCGATCATTCTGAACATTTCACTTTCCACATACTGTTTTCCAAGGCGTGGATCTATCAATCCGTCAAAAACCCCAGTTTCAAGGGTGTGAAGGAGGTGTGGACGAGCCTGCAACAAACATAGAATATTACTGAATCCCTTCAAGTGCATCTCAAAGAAGATGAAGGCAGATCTTCAACATATAGCATGGCAGAAAGCTTAACGATGAGTAAAAAAGAAAATGGAGGAGAATTTTGAAATGTTTGTTCATTTACTGTCATACCCATTCAACCAAACTCTCATCCCCCAAAGGCTGAGTGGGATCAACAGGCTTACGACCAGTAATAAGCTCAAGAAGCACAACCCCAAACGAGAACACATCTGATCTGTCTGTCAATTTCCCACTTGATGCGTACTCAGGCGCCATGTATCTGCAAGCCATTGAAATACAAACGATAAATTATTAGGAAACACACGGCTCCAAGAATTGTCATCCGATGAGAATATGTCTGAGTTAACCCACCCAAATGTCCCCATGACACGAGTCGAAACGTGGGTGTTTGTATCGTTTGTCAGTTTTGCAAGTCCAAAGTCTGCAACCTGTGTTCGAAGTTGTCCATCGAAATCGGAATCCGAGACTGTAGAAACAAGAAGCAATCGAAATTATAAGAAAAGAATTACTCTGGACGTACCTGTGCGTCAAAAGCATCATCCAGCAAAATGTTGGCTGACTTGATGTCTCTGTGAATAATTCTGGGATGGCCTATGAAAAGAGCAGTAAAAGGAGATGGGGAGTTAACAGAGTATGTGGTCTCAGTAACATTGGAACAGTTCAATTCAGTAAAGCCTTTTGAATTTAGAGATATGTACTCACAATCTTCATGAAGATATGCCAAACCCTTTGCAGCTCCTAAAGCGATTTTGAGTCTTTTGGACCAATCCAACACGGGTACTCCATTACCTGCCAGGAGAAATCAAATATTTTAAACATCAACACCAGGTGGGGCTATTTTGAAGTGATGGTGGAGCTTAAGGATAGGTTCCACATACCGCCATGGAGATGATGCTCAAGAGTCTTATTGGGAACAAAATCATAGATGAGCAATCTATGATTCTCAGAGACGCAGTAGCCCACTAAAGACACCAAATGCCGATGATGAACACGACTGATAATCTCAACTTCCGCCTTGAATTCCCTCTCCCCCTGTCCACTTCCTGCCTTGAGTTGCTTCACAGCCACTGACCTCCCTGCCGGAAGCCATCCCTGGTAAACACATCCAAACCCACCTTCTCCAAGAATGTTTTGACGCGAAAATCCAGACGTAATCTCCATCAATTCCTCATAGCTAAAGACGAACTTAGCACTGTTTATCACACCTGAGTCTGCTCCACTGCCACTGGATCCTGTTCCTTTTTGACTCCCAAAGCTATTTCCAACCGGAGTATGTGGGACCTGAGTGTAGAAGCCTTCAGCAGAGCCAGAGTTGCCCAT

mRNA sequence

TTTCTGTTTCTCTTCTCACTGCTCAGTCAAAACTCTCTCTCTCTCTCTCTCTTTCTCTTTCTTCTTAAAGCAGAATCAGATTTCTGTTTCCGAAAACTGACTCTGATTTTCCCCTTTCCGAATCAAATGTTAGTTAGTTACCCTTCTCCTCCTGTACTTCATCAACGTATACACTCACTTCAGTTGAGGAGAACCAGAGATTATGAGAGGCTTTGAGGGAGCTCTGGAGTGTCTTAGGGCTCTTGTGGAGACCAAAGTTTGGGATTATTGCATTGTTTGGAGATCGAGGGATGATTCTTCCAGGTTCATTGATTGGGTAGGTTGCTGCTGTAGCGGTGGAAGTTTCGACGACGGTGGGAAAAGGGAAGCCGGTGAGACAATCCCAGCTACCCTTTGTAAGGACACTCGATTTCGACATTTTAGGAGAACAAATGCTTGCCAAGCACTTGCTCAGTTCCCTTCTACTATATCATTGAACTCTGGGGTTCATGGAGATGTTTTGATTTCGAACCAACCCATGTGGCTTACGAGTGCTGAAGTTTCTTCTACTTCAAGCTTTTCACATGAGTTGACTGGAACTCGTGTTCTAATACCAATTTCTGCTGGCATTGTGGAGCTTTTTGCGACAAAACATATGCCGAGAGACAGAGAGGTTATAGACTTTGTAATGGCTCACTGTAATATCTCTATGGAGCAAGAATTTGATACAGAGAGTGAATTGGATGGTGACCTCAATGAGAAAAGACTTGATTCATGTACCAAGTACTATTCAGTAGACTGGCCTGATCCTCAGCCCTTGCTTGATTTCAAGTCCAAGTTAGAAATTCTTCCTTCCGTGTCCCAGTCCAACTCCTTTCCTAGCTGTGAAGGATCATCCAGTGGCTCAAAGCCTTCCAATGAGCATCACTACTTTGATTCCTATTCAAGTTTGGTTTCACGTGGGTTCTTCAACCAACCAATTCACAGGTCCTTTGAATCCAAGAGGCCAACGCCCCAGGAAGATTTGCTTGAACAACAAAGAGATGTGGTTTCGGATTATTCAAAGTTCTTGCAGAAAGATGAAGCCAAAACTGGAGGAGGGAGACAAGGAAAGGAAATTTACAAATCCAAAAATCTTATCACAGAGAGGAGAAGAAGAAACAAGATTAGGGATCGTCTCTACGCGTTACGTGCCTTAGTCCCTAATATATCCAAGATGGATAGAGCATCAATCATTGTGGATGCTATCAACTACATCAGAGAGTTGGAAGAGAATGTGAAAAACCTCCAGAATGAACTTGTTCAATTGGAGCACAAAGACTGTCAAAAGAACAGACACTTGAAGATTTCTCCATCAGAGAAGAACAAAGATGACACAATATATTGGCCTCTTGTTCAGAACGATCAACCAATGTTTATCCTTGGTGAGGAGAAGCCTATGGAGGTGGAAGTGGAAGTGATGCAAATTAATGAAAGAGATTTCTTAATCAAGCTATTCTGCAAGCAGACACAAGGTGGTGTGGTGAGCTCGATAGAGGCTATGGATTCATTGGGGTTGCAGGTTGTTGATGTCAATATTACCACCTTTGGGGGTATGGTCCTAAACATTTTCCATGTTGAGGCGAATGAAAATGACATTCAGCCTAAGAGACTGAGGGACTCACTAATCAAGCTAACTAGCAGAGAAATGAGACAATTAAAATAGGCTGTACAAAGGGAAAGTGTTTATTACCCGGGCTCGACGTAAACATCTTAGAAAGTGCAACGGATTATCCAAACGTCGGTTTGTGTAATCGCCTGCACCAGTTACATTAATCAGCTTGAGCTAGGGAACTTGTTATTTCATCAAAATGAGCCAAGTAAGTAACTTTTTTTCTTCGTTCTGCTAACTTCTTCTTTCCCTGCTGGTTATTACTCCAACTGGGGGAAATATAATACACAAACTATGAATGTATAATATAATGGGTAGACATATGGTGTTTAAATGCAATAAATAGAAGACCAGAGCATAACCAGAAGTAGAATATCACTGGTCAACCAATTATATTATATGGTGTTTACTAGTTGGGGCTCAATTGTAAAATTCCAAATCACTATTCCACACCTCGTTAGTCTAGAAGGATGTATAAGTTTATGATGAAAAAAGAGAAAACTAAGTCAAAATGGTGGGGAAAAGAAAGAAGAAAGAAGCTTTGCACGTTATTTACGTCTTGTTTCAGCATCATATTAGAATTGTCGACCGCTAGGGTGACTTTGTGGTGCGCCAGTTCGTCCGTGAAAAGCTCGAGTTTCTGATTCACCGCTTGAGTTGTTTTCGAATCTCCATGATTCCCCACTTGCATTCATTTCTCTGGAACTGTATTCGCTACTGTAAATGTCGAAGTCGGAGCTGTCTCCACCAAGCGCCATTCTCCTGAATTTAGAGATGTCTTGACTGTATTGTCCAGAATCATAGATGGTGCTCTGGCCATATTTGACACCATTGGAGAGGTCAGACATGTCTGTTTCAATATCTAGTGATCTCACCACCTGTTGACCAAAGTAGTATTAGAAGAGAACTACAATTTAAGAACAAGCTTGTTGACGAAATGAAACAGAGTTTTCAATGGAATTAAGTTAACCTGAACCATGCGAGGCCTTTTGGGAGCCGAATGACGAACGCAGGCAGCTGCTGCTTCGATCATTCTGAACATTTCACTTTCCACATACTGTTTTCCAAGGCGTGGATCTATCAATCCGTCAAAAACCCCAGTTTCAAGGGTGTGAAGGAGGTGTGGACGAGCCTGCAACAAACATAGAATATTACTGAATCCCTTCAAGTGCATCTCAAAGAAGATGAAGGCAGATCTTCAACATATAGCATGGCAGAAAGCTTAACGATGAGTAAAAAAGAAAATGGAGGAGAATTTTGAAATGTTTGTTCATTTACTGTCATACCCATTCAACCAAACTCTCATCCCCCAAAGGCTGAGTGGGATCAACAGGCTTACGACCAGTAATAAGCTCAAGAAGCACAACCCCAAACGAGAACACATCTGATCTGTCTGTCAATTTCCCACTTGATGCGTACTCAGGCGCCATGTATCTGCAAGCCATTGAAATACAAACGATAAATTATTAGGAAACACACGGCTCCAAGAATTGTCATCCGATGAGAATATGTCTGAGTTAACCCACCCAAATGTCCCCATGACACGAGTCGAAACGTGGGTGTTTGTATCGTTTGTCAGTTTTGCAAGTCCAAAGTCTGCAACCTGTGTTCGAAGTTGTCCATCGAAATCGGAATCCGAGACTGTAGAAACAAGAAGCAATCGAAATTATAAGAAAAGAATTACTCTGGACGTACCTGTGCGTCAAAAGCATCATCCAGCAAAATGTTGGCTGACTTGATGTCTCTGTGAATAATTCTGGGATGGCCTATGAAAAGAGCAGTAAAAGGAGATGGGGAGTTAACAGAGTATGTGGTCTCAGTAACATTGGAACAGTTCAATTCAGTAAAGCCTTTTGAATTTAGAGATATGTACTCACAATCTTCATGAAGATATGCCAAACCCTTTGCAGCTCCTAAAGCGATTTTGAGTCTTTTGGACCAATCCAACACGGGTACTCCATTACCTGCCAGGAGAAATCAAATATTTTAAACATCAACACCAGGTGGGGCTATTTTGAAGTGATGGTGGAGCTTAAGGATAGGTTCCACATACCGCCATGGAGATGATGCTCAAGAGTCTTATTGGGAACAAAATCATAGATGAGCAATCTATGATTCTCAGAGACGCAGTAGCCCACTAAAGACACCAAATGCCGATGATGAACACGACTGATAATCTCAACTTCCGCCTTGAATTCCCTCTCCCCCTGTCCACTTCCTGCCTTGAGTTGCTTCACAGCCACTGACCTCCCTGCCGGAAGCCATCCCTGGTAAACACATCCAAACCCACCTTCTCCAAGAATGTTTTGACGCGAAAATCCAGACGTAATCTCCATCAATTCCTCATAGCTAAAGACGAACTTAGCACTGTTTATCACACCTGAGTCTGCTCCACTGCCACTGGATCCTGTTCCTTTTTGACTCCCAAAGCTATTTCCAACCGGAGTATGTGGGACCTGAGTGTAGAAGCCTTCAGCAGAGCCAGAGTTGCCCAT

Coding sequence (CDS)

ATGAGAGGCTTTGAGGGAGCTCTGGAGTGTCTTAGGGCTCTTGTGGAGACCAAAGTTTGGGATTATTGCATTGTTTGGAGATCGAGGGATGATTCTTCCAGGTTCATTGATTGGGTAGGTTGCTGCTGTAGCGGTGGAAGTTTCGACGACGGTGGGAAAAGGGAAGCCGGTGAGACAATCCCAGCTACCCTTTGTAAGGACACTCGATTTCGACATTTTAGGAGAACAAATGCTTGCCAAGCACTTGCTCAGTTCCCTTCTACTATATCATTGAACTCTGGGGTTCATGGAGATGTTTTGATTTCGAACCAACCCATGTGGCTTACGAGTGCTGAAGTTTCTTCTACTTCAAGCTTTTCACATGAGTTGACTGGAACTCGTGTTCTAATACCAATTTCTGCTGGCATTGTGGAGCTTTTTGCGACAAAACATATGCCGAGAGACAGAGAGGTTATAGACTTTGTAATGGCTCACTGTAATATCTCTATGGAGCAAGAATTTGATACAGAGAGTGAATTGGATGGTGACCTCAATGAGAAAAGACTTGATTCATGTACCAAGTACTATTCAGTAGACTGGCCTGATCCTCAGCCCTTGCTTGATTTCAAGTCCAAGTTAGAAATTCTTCCTTCCGTGTCCCAGTCCAACTCCTTTCCTAGCTGTGAAGGATCATCCAGTGGCTCAAAGCCTTCCAATGAGCATCACTACTTTGATTCCTATTCAAGTTTGGTTTCACGTGGGTTCTTCAACCAACCAATTCACAGGTCCTTTGAATCCAAGAGGCCAACGCCCCAGGAAGATTTGCTTGAACAACAAAGAGATGTGGTTTCGGATTATTCAAAGTTCTTGCAGAAAGATGAAGCCAAAACTGGAGGAGGGAGACAAGGAAAGGAAATTTACAAATCCAAAAATCTTATCACAGAGAGGAGAAGAAGAAACAAGATTAGGGATCGTCTCTACGCGTTACGTGCCTTAGTCCCTAATATATCCAAGATGGATAGAGCATCAATCATTGTGGATGCTATCAACTACATCAGAGAGTTGGAAGAGAATGTGAAAAACCTCCAGAATGAACTTGTTCAATTGGAGCACAAAGACTGTCAAAAGAACAGACACTTGAAGATTTCTCCATCAGAGAAGAACAAAGATGACACAATATATTGGCCTCTTGTTCAGAACGATCAACCAATGTTTATCCTTGGTGAGGAGAAGCCTATGGAGGTGGAAGTGGAAGTGATGCAAATTAATGAAAGAGATTTCTTAATCAAGCTATTCTGCAAGCAGACACAAGGTGGTGTGGTGAGCTCGATAGAGGCTATGGATTCATTGGGGTTGCAGGTTGTTGATGTCAATATTACCACCTTTGGGGGTATGGTCCTAAACATTTTCCATGTTGAGGCGAATGAAAATGACATTCAGCCTAAGAGACTGAGGGACTCACTAATCAAGCTAACTAGCAGAGAAATGAGACAATTAAAATAG

Protein sequence

MRGFEGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGGKREAGETIPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSFSHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELDGDLNEKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSCEGSSSGSKPSNEHHYFDSYSSLVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQNELVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANENDIQPKRLRDSLIKLTSREMRQLK
BLAST of Cp4.1LG01g15300 vs. Swiss-Prot
Match: AMS_ARATH (Transcription factor ABORTED MICROSPORES OS=Arabidopsis thaliana GN=AMS PE=1 SV=2)

HSP 1 Score: 246.5 bits (628), Expect = 6.0e-64
Identity = 168/516 (32.56%), Postives = 261/516 (50.58%), Query Frame = 1

Query: 5   EGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGGKREAGETIPATL 64
           +  LE LR LV  + WDYC++WR  +D  RF+ W+GCCC G            E      
Sbjct: 6   QNLLEKLRPLVGARAWDYCVLWRLNEDQ-RFVKWMGCCCGGTELI---AENGTEEFSYGG 65

Query: 65  CKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSFSHELT 124
           C+D  F H  RT +C+ L+  P++I L+SG++ + L++NQ  WL+    SS  SF  E  
Sbjct: 66  CRDVMFHH-PRTKSCEFLSHLPASIPLDSGIYAETLLTNQTGWLSE---SSEPSFMQETI 125

Query: 125 GTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELDGDLNEKRLDS 184
            TRVLIPI  G+VELFAT+H+  D+ V+DFVM HCN+ M+        +  ++  K    
Sbjct: 126 CTRVLIPIPGGLVELFATRHVAEDQNVVDFVMGHCNMLMDDSVTINMMVADEVESKPYGM 185

Query: 185 CTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSCEGSSSGSKPSNEHHYFDSYSSLV 244
            +          + +++  S  +I     + N  P      +        ++  +   L 
Sbjct: 186 LSGDIQQKGSKEEDMMNLPSSYDISADQIRLNFLPQMSDYETQHLKMKSDYHHQALGYLP 245

Query: 245 SRG----FFNQPIHRSFESKRPTPQE-DLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKEI 304
             G        P +   E   P   E  LL  ++ VV+D  K + ++     G     +I
Sbjct: 246 ENGNKEMMGMNPFNTVEEDGIPVIGEPSLLVNEQQVVND--KDMNENGRVDSGSDCSDQI 305

Query: 305 -------YK--------SKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINY 364
                  YK        +KNL+ ERRRR K+ DRLYALR+LVP I+K+DRASI+ DAINY
Sbjct: 306 DDEDDPKYKKKSGKGSQAKNLMAERRRRKKLNDRLYALRSLVPRITKLDRASILGDAINY 365

Query: 365 IRELEENVKNLQNELVQLEHKDCQKNR--------HLKISPSEKNKDDTIYWPLVQNDQP 424
           ++EL+   K LQ+EL +    +   NR           ++            P V+ D  
Sbjct: 366 VKELQNEAKELQDELEENSETEDGSNRPQGGMSLNGTVVTGFHPGLSCNSNVPSVKQDVD 425

Query: 425 MFILGEE-KPMEVEVEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTF 484
           +    ++ + ME +V+V Q++ R+F +K+ C+   GG    +EA+DSLGL+V + N T +
Sbjct: 426 LENSNDKGQEMEPQVDVAQLDGREFFVKVICEYKPGGFTRLMEALDSLGLEVTNANTTRY 485

Query: 485 GGMVLNIFHVEANEND-IQPKRLRDSLIKLTSREMR 491
             +V N+F VE N+N+ +Q + +R+SL+++T    R
Sbjct: 486 LSLVSNVFKVEKNDNEMVQAEHVRNSLLEITRNTSR 511

BLAST of Cp4.1LG01g15300 vs. Swiss-Prot
Match: BH090_ARATH (Transcription factor bHLH90 OS=Arabidopsis thaliana GN=BHLH90 PE=2 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 4.6e-56
Identity = 163/496 (32.86%), Postives = 245/496 (49.40%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGGKRE----- 60
           MRG E   E LR  V+++ WD C++W+  DD SRFI+WVGCCCSG   D   K E     
Sbjct: 4   MRGGERVKEFLRPFVDSRTWDLCVIWKLGDDPSRFIEWVGCCCSGCYIDKNIKLENSEEG 63

Query: 61  -AGETIPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVS 120
             G    A+ C+D   +H  RT AC+AL++FP  + L  G+HG+V++S  P WL +    
Sbjct: 64  GTGRKKKASFCRDDHNKHRIRTLACEALSRFPLFMPLYPGIHGEVVMSKSPKWLVN---- 123

Query: 121 STSSFSHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELD 180
             S    E+  TRVL+P+S G+VELFA    P D  ++  +M+ C    E          
Sbjct: 124 --SGSKMEMFSTRVLVPVSDGLVELFAFDMRPFDESMVHLIMSRCTTFFE---------- 183

Query: 181 GDLNEKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFP---SCEG--SSSGSK 240
                              P P+  L F+    I+P   +S S     S EG  SSS S 
Sbjct: 184 -------------------PFPEQRLQFR----IIPRAEESMSSGVNLSVEGGGSSSVSN 243

Query: 241 PSNEHHYFDSYSSLVSRGFFNQPIHRSFESKRPTPQEDLL-EQQRDVVSDYSKFLQKDEA 300
           PS+E              F N P     E  R      L+  +++DVV   +     +++
Sbjct: 244 PSSETQNL----------FGNYPNASCVEILREEQTPCLIMNKEKDVVVQNA-----NDS 303

Query: 301 KTGGGRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIREL 360
           K        E +KSKNL +ER+RR +I   +Y LRA+VP I+K+++  I  DA++YI EL
Sbjct: 304 KANKKLLPTENFKSKNLHSERKRRERINQAMYGLRAVVPKITKLNKIGIFSDAVDYINEL 363

Query: 361 EENVKNLQNELVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEV 420
               + L++EL  +   +C+     +I+  E++         V +     +    K  EV
Sbjct: 364 LVEKQKLEDELKGINEMECK-----EIAAEEQSAIADPEAERVSSKSNKRV----KKNEV 423

Query: 421 EVEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEAN 480
           ++EV +  ERDFLI++  +  Q G    IEA+D   L+++DVN T     V+ + +V+AN
Sbjct: 424 KIEVHETGERDFLIRVVQEHKQDGFKRLIEAVDLCELEIIDVNFTRLDLTVMTVLNVKAN 436

Query: 481 ENDIQPKRLRDSLIKL 485
           ++ I    LRD L+K+
Sbjct: 484 KDGIACGILRDLLLKM 436

BLAST of Cp4.1LG01g15300 vs. Swiss-Prot
Match: TDR_ORYSJ (Transcription factor TDR OS=Oryza sativa subsp. japonica GN=TDR PE=1 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 8.0e-24
Identity = 82/232 (35.34%), Postives = 130/232 (56.03%), Query Frame = 1

Query: 287 EAKTGGGRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIR 346
           E ++GG ++     + KNL  ER+RR K+   LY LR+LVPNI+KMDRASI+ DAI+YI 
Sbjct: 273 EGRSGGAKR----QQCKNLEAERKRRKKLNGHLYKLRSLVPNITKMDRASILGDAIDYIV 332

Query: 347 ELEENVKNLQNEL----------------------VQLEHKDC------QKNRHLKISPS 406
            L++ VK LQ+EL                      V L++ D       Q+   L +S S
Sbjct: 333 GLQKQVKELQDELEDNHVHHKPPDVLIDHPPPASLVGLDNDDASPPNSHQQQPPLAVSGS 392

Query: 407 EK---NKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINERDFLIKLFCKQTQGGVVS 466
                NKD     P + +D+     G    ME ++EV Q+   +  +++  +   GG V 
Sbjct: 393 SSRRSNKD-----PAMTDDKVGGGGGGGHRMEPQLEVRQVQGNELFVQVLWEHKPGGFVR 452

Query: 467 SIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANEND--IQPKRLRDSLIKLT 486
            ++AM++LGL+V++VN+TT+  +VLN+F V   +++  +Q  R+RDSL+++T
Sbjct: 453 LMDAMNALGLEVINVNVTTYKTLVLNVFRVMVRDSEVAVQADRVRDSLLEVT 495

BLAST of Cp4.1LG01g15300 vs. Swiss-Prot
Match: SCRM2_ARATH (Transcription factor SCREAM2 OS=Arabidopsis thaliana GN=SCRM2 PE=1 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.8e-20
Identity = 69/229 (30.13%), Postives = 124/229 (54.15%), Query Frame = 1

Query: 256 SFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKEIYKSKNLITERRRRNKI 315
           S E ++ + + ++ +    ++       + D+  T   +  K+   +KNL+ ERRRR K+
Sbjct: 220 SSEMRKSSYEREIDDTSTGIIDISGLNYESDDHNTNNNKGKKKGMPAKNLMAERRRRKKL 279

Query: 316 RDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQNELVQLEHKDCQKNRHLKI 375
            DRLY LR++VP ISKMDRASI+ DAI+Y++EL + + +L  E   LE      +    +
Sbjct: 280 NDRLYMLRSVVPKISKMDRASILGDAIDYLKELLQRINDLHTE---LESTPPSSSSLHPL 339

Query: 376 SPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINERDFLIKLFCKQTQGGVVS 435
           +P+ +    T+ + + +   P   L   K  +  VEV     +   I +FC +  G ++S
Sbjct: 340 TPTPQ----TLSYRVKEELCPSSSLPSPKGQQPRVEVRLREGKAVNIHMFCGRRPGLLLS 399

Query: 436 SIEAMDSLGLQVVDVNITTFGGMVLNIFHVE--ANENDIQPKRLRDSLI 483
           ++ A+D+LGL V    I+ F G  L++F  E    ++D+ P++++  L+
Sbjct: 400 TMRALDNLGLDVQQAVISCFNGFALDVFRAEQCQEDHDVLPEQIKAVLL 441

BLAST of Cp4.1LG01g15300 vs. Swiss-Prot
Match: ICE1_ARATH (Transcription factor ICE1 OS=Arabidopsis thaliana GN=SCRM PE=1 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 4.1e-20
Identity = 66/186 (35.48%), Postives = 97/186 (52.15%), Query Frame = 1

Query: 291 GGGRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEE 350
           GGG+  K+   +KNL+ ERRRR K+ DRLY LR++VP ISKMDRASI+ DAI+Y++EL +
Sbjct: 295 GGGKGKKKGMPAKNLMAERRRRKKLNDRLYMLRSVVPKISKMDRASILGDAIDYLKELLQ 354

Query: 351 NVKNLQNELVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFI----------L 410
            + +L NEL                 P       + + PL    Q +            L
Sbjct: 355 RINDLHNELE-------------STPPGSLPPTSSSFHPLTPTPQTLSCRVKEELCPSSL 414

Query: 411 GEEKPMEVEVEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVL 467
              K  +  VEV     R   I +FC +  G ++++++A+D+LGL V    I+ F G  L
Sbjct: 415 PSPKGQQARVEVRLREGRAVNIHMFCGRRPGLLLATMKALDNLGLDVQQAVISCFNGFAL 467

BLAST of Cp4.1LG01g15300 vs. TrEMBL
Match: E5GBJ2_CUCME (BHLH transcription factor OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 759.2 bits (1959), Expect = 3.1e-216
Identity = 396/489 (80.98%), Postives = 423/489 (86.50%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSS-RFIDWVGCCCSGGSFDDGGKREAGET 60
           MR FEGALE LR LVE K+WDYCIVW+SRDD S RFIDWVGCCCSGG  D GGK EAGET
Sbjct: 1   MRSFEGALEFLRPLVEIKLWDYCIVWKSRDDDSLRFIDWVGCCCSGGVSDAGGKEEAGET 60

Query: 61  IPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSF 120
           IPA LCKDTRFRHFRRTNACQALAQFPS+ISLN+GVHGDVLISNQPMWLTS E S  SSF
Sbjct: 61  IPAALCKDTRFRHFRRTNACQALAQFPSSISLNTGVHGDVLISNQPMWLTSGEASYFSSF 120

Query: 121 SHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESEL-DGDLN 180
           SHELTGTRVLIP+S GIVELFATK MPR+ EVIDFVMAHCNIS+EQEF+TES L D  LN
Sbjct: 121 SHELTGTRVLIPVSGGIVELFATKRMPREGEVIDFVMAHCNISLEQEFETESALLDAGLN 180

Query: 181 EKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSC-EGSSSGSKPSNEHHYF 240
           EK L S TKYYS++WPDPQP L FKSKLEILPSVSQS+SFP C EGSSSGSKP       
Sbjct: 181 EKILSSSTKYYSLNWPDPQPFLGFKSKLEILPSVSQSSSFPGCGEGSSSGSKP------- 240

Query: 241 DSYSSLVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGK 300
                  S G FNQPIH SFESK  T +E+LLEQQ++VVSD+SK LQKDEAKT G +Q K
Sbjct: 241 -------SPGLFNQPIHTSFESKAATHREELLEQQKNVVSDHSKILQKDEAKT-GEKQEK 300

Query: 301 EIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQN 360
           E+YKSKNL+TERRRRNKIRDRLY LRALVPNISKMDRASIIVDAI YIRELEENVK+LQN
Sbjct: 301 EVYKSKNLMTERRRRNKIRDRLYTLRALVPNISKMDRASIIVDAIGYIRELEENVKSLQN 360

Query: 361 ELVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINE 420
           EL+QLEHKDCQKN+HLKISP EK  DD   W  VQ+DQPMFIL EEKPMEVEVEVM+INE
Sbjct: 361 ELIQLEHKDCQKNKHLKISPLEKTNDDINSWSFVQDDQPMFILNEEKPMEVEVEVMRINE 420

Query: 421 RDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANENDIQPKRL 480
           RDFLIKLFCK+ QGGVVSSIEAM SLGLQV+DVNITTFGGMVLNIFHVEANENDIQPKRL
Sbjct: 421 RDFLIKLFCKRKQGGVVSSIEAMYSLGLQVIDVNITTFGGMVLNIFHVEANENDIQPKRL 474

Query: 481 RDSLIKLTS 487
           RDSL+KLTS
Sbjct: 481 RDSLMKLTS 474

BLAST of Cp4.1LG01g15300 vs. TrEMBL
Match: A0A0A0KQ91_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G601530 PE=4 SV=1)

HSP 1 Score: 751.9 bits (1940), Expect = 4.9e-214
Identity = 392/488 (80.33%), Postives = 418/488 (85.66%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSS-RFIDWVGCCCSGGSFDDGGKREAGET 60
           MR FE ALE LR LVE K+WDYCIVW+SRDD S RFIDWVGCCCSGG    GGK EAGET
Sbjct: 1   MRSFEEALEFLRPLVEIKLWDYCIVWKSRDDDSLRFIDWVGCCCSGGVSGAGGKEEAGET 60

Query: 61  IPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSF 120
           IPA LCKDTRFRHFRRTNACQALAQFPS+ISLN+GVHGDV ISNQPMWLTS EVS  SSF
Sbjct: 61  IPAALCKDTRFRHFRRTNACQALAQFPSSISLNTGVHGDVSISNQPMWLTSGEVSYFSSF 120

Query: 121 SHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELDGDLNE 180
           SHELTGTRVLIP+S GIVELFATK MPR+ EVIDFVMAHCN S+ QEF+TES L+  LNE
Sbjct: 121 SHELTGTRVLIPVSGGIVELFATKRMPREGEVIDFVMAHCNFSLGQEFETESALNAGLNE 180

Query: 181 KRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSC-EGSSSGSKPSNEHHYFD 240
           K L+S TKYYS++WPDPQ +L FKSKLE LPSVSQS+SFP C EGSSSGSKP        
Sbjct: 181 KILNSSTKYYSLNWPDPQAILGFKSKLETLPSVSQSSSFPGCGEGSSSGSKP-------- 240

Query: 241 SYSSLVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKE 300
                 S G FNQPI  SFESK    QEDLLEQQR+VV D+SK LQKDEAKT G +Q KE
Sbjct: 241 ------SPGLFNQPIRTSFESKAGMRQEDLLEQQRNVVLDHSKILQKDEAKT-GEKQEKE 300

Query: 301 IYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQNE 360
           +YKSKNL+TERRRRNKIRDRLY LRALVPNISKMDRASIIVDAI YIRELEENVK+LQNE
Sbjct: 301 VYKSKNLMTERRRRNKIRDRLYTLRALVPNISKMDRASIIVDAIGYIRELEENVKSLQNE 360

Query: 361 LVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINER 420
           L+QLEHKDCQKN+HLK+SP EK  DD   WP VQ+DQPMFIL EEKPMEVEVEVMQINER
Sbjct: 361 LIQLEHKDCQKNKHLKVSPLEKTNDDINSWPFVQDDQPMFILDEEKPMEVEVEVMQINER 420

Query: 421 DFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANENDIQPKRLR 480
           DFLIKLFCK+ QGGVVSSIEAMDSLGLQV+DVNITTFGGMVLNIFHVEANENDIQPKRLR
Sbjct: 421 DFLIKLFCKRKQGGVVSSIEAMDSLGLQVIDVNITTFGGMVLNIFHVEANENDIQPKRLR 473

Query: 481 DSLIKLTS 487
           DSLIKLTS
Sbjct: 481 DSLIKLTS 473

BLAST of Cp4.1LG01g15300 vs. TrEMBL
Match: A0A067JRR7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17750 PE=4 SV=1)

HSP 1 Score: 354.4 bits (908), Expect = 2.3e-94
Identity = 207/487 (42.51%), Postives = 304/487 (62.42%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGGK-REAGET 60
           MRG E A+E LR  V++K WD C+VW+  DD SRFI W+GCCCSGG   DGGK +E    
Sbjct: 17  MRGLERAVELLRPFVDSKAWDCCVVWKLGDDPSRFIQWMGCCCSGGGGGDGGKVKEERID 76

Query: 61  IPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSF 120
               +C+D  F+H  RT AC+ALA FPS + L SG+HG+V++S Q  W+T    S  S F
Sbjct: 77  ENVGICRDLYFKHPIRTKACEALACFPSFMPLYSGIHGEVVLSKQSKWVTRVNASD-SKF 136

Query: 121 SHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELDGDLNE 180
           SHE  GTRVLIP+S G++ELFA KH+P+D+++I+F+ +  N+ ++QE        G LNE
Sbjct: 137 SHESIGTRVLIPVSGGLIELFAGKHIPKDQKIIEFITSQFNV-LKQEVMIAHGFTG-LNE 196

Query: 181 KRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSCEGSSSGSKPSNEHHYFDS 240
             LD   +    + P P  LL    + ++L  ++Q+N+  S   SSSGS P N     DS
Sbjct: 197 LCLDPFLEQNMQNLPKPCQLLSLIPQAQVLHPLNQTNTHSSFVVSSSGSSPPNGLPSLDS 256

Query: 241 YSSLVSRGFFN-QPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKE 300
           +SS + +   + Q I +    ++   Q  LL +  + V+  ++  ++D+           
Sbjct: 257 HSSYLPQTVISKQSIGKRSAPRKSKKQAGLLPECNNKVAKVNQRSERDQ----------- 316

Query: 301 IYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQNE 360
            ++SKNL+TER RR++IR  L+ LR+LVP I+KMD+AS + DAINYI ELEE  K LQ+E
Sbjct: 317 -FRSKNLVTERNRRDRIRGGLFTLRSLVPKITKMDKASTLGDAINYIVELEEEAKKLQDE 376

Query: 361 LVQLEHKDCQKN-RHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINE 420
           L + + ++C+ +   L    S+  +D +       N+Q     GE K +E+++E+ QI +
Sbjct: 377 LKETKAEECKSSDAELLTLKSKALQDGSKNMQPPDNNQDFSGFGENKKIELQLEINQIGK 436

Query: 421 RDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANENDIQPKRL 480
           R+FLI++  ++ QGG    ++A+ SLGLQVVD N+TTF G VLNI  VEA+E +IQPK+L
Sbjct: 437 REFLIRVLYEKKQGGFGRLMDAIHSLGLQVVDANMTTFNGKVLNILRVEADEKEIQPKKL 488

Query: 481 RDSLIKL 485
           ++SL+KL
Sbjct: 497 KESLLKL 488

BLAST of Cp4.1LG01g15300 vs. TrEMBL
Match: U5G6A6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0008s19390g PE=4 SV=1)

HSP 1 Score: 348.2 bits (892), Expect = 1.6e-92
Identity = 219/495 (44.24%), Postives = 308/495 (62.22%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGG---KREAG 60
           MRG + A+E LR LV++  WDYC+VW+  DD SRFI+WVGCCC GG    GG   +R+ G
Sbjct: 1   MRGLDRAMERLRPLVDSNAWDYCVVWKLGDDPSRFIEWVGCCCGGGG--GGGYNVERDRG 60

Query: 61  ETIP---ATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVS 120
           E        LCKD  F+H  RT AC+AL++FPS++ L SG+HG+V+IS +P WL  A V+
Sbjct: 61  EDNQFGRGPLCKDVYFKHPVRTKACEALSRFPSSMPLYSGIHGEVVISAEPRWLCHATVT 120

Query: 121 STSSFS-HELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTE-SE 180
           +  S +  E+ GT+VLIP+  G+VELFA KHM +D ++I+ + AHC++ ++QE  TE   
Sbjct: 121 THDSNTLREVAGTQVLIPVIGGLVELFAAKHMKKDEKMIESIRAHCHVPVKQEAVTELGY 180

Query: 181 LDGDLNEKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSCEGSSSGSKPSN 240
            +   N+ RLDS  +    + P    LL    + + L  +SQ  +  S EGSSSGS PSN
Sbjct: 181 SNSSFNDHRLDSLLE---ENLPHSCHLLSLIPRTQFLLPLSQPRNSISFEGSSSGSNPSN 240

Query: 241 EHHYFDSYSSLVSRGFFNQPIHRSFE-SKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTG 300
           E   F S +S +       P H   E S   +  ++ + +QR   +D +K + K      
Sbjct: 241 EAPSFVSNASQL-------PQHGHLELSVGKSNHDEKILKQRAGSADCNKKVPKVMR--- 300

Query: 301 GGRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEEN 360
             R  ++ YKSKNL+TER RR +I+  L+ALRALVP ISKMD+A+I+ DAI+Y+ EL + 
Sbjct: 301 --RSERDDYKSKNLVTERNRRTRIKTGLFALRALVPKISKMDKAAILGDAIDYVGELLKE 360

Query: 361 VKNLQNELVQLEHKDCQ-KNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEV 420
           VKNLQ+E+   E ++ +  N  LK S  E  ++D +    +  D   F+  E+K  EV++
Sbjct: 361 VKNLQDEIKNAEEEERRASNIELKTSKLEIFQEDHVSSSKINQDSSGFV--EKKGAEVQL 420

Query: 421 EVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANEN 480
           EV QI++R FL+K  C+Q QGG    +E + SLGLQ++D NITTF G VLNI  VEA + 
Sbjct: 421 EVDQISKRQFLLKFLCEQRQGGFGRLMETIHSLGLQILDANITTFNGNVLNILKVEA-DK 475

Query: 481 DIQPKRLRDSLIKLT 486
           DI PK L+ SLI+LT
Sbjct: 481 DIHPKTLKKSLIELT 475

BLAST of Cp4.1LG01g15300 vs. TrEMBL
Match: B9SHA3_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0528090 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 4.0e-91
Identity = 201/481 (41.79%), Postives = 297/481 (61.75%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGGK--REAGE 60
           M+G E ALE LR  V++K WDY +VW+  DD SR+I+W+GCCCSGG    GGK   E GE
Sbjct: 6   MKGLERALELLRPFVDSKAWDYSVVWKLGDDPSRYIEWMGCCCSGG----GGKVKMERGE 65

Query: 61  T-IPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTS 120
                +LC+D  F+H   T AC+ALA +PS++ L SG+HG+++ S Q  W+T A  SS S
Sbjct: 66  DKYSVSLCRDVYFKHPISTKACEALAGYPSSMPLYSGIHGEMVTSTQSKWITHANASSDS 125

Query: 121 SFSHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELDGDL 180
           +      GTRVLIP+  G++ELFA +H+ +D+++ID+V AH N+  ++   +        
Sbjct: 126 NSYPVPIGTRVLIPVFGGLIELFAARHIAKDQKIIDYVTAHFNVLKQEAMISHGY--PSF 185

Query: 181 NEKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSCEGSSSGSKPSNEHHYF 240
           +E  +D+  +    +   P  LL    +  ++  + Q N+  S EGSSSGS PSNEH  F
Sbjct: 186 SECCIDTFREQNFQNLTSPSHLLGLIPRTHVIYPLYQPNTHSSLEGSSSGSNPSNEHPPF 245

Query: 241 DSYSS-LVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSD-----YSKFLQKDEAKTG 300
           DS+S  L+  G   Q I +S   ++    E+L++Q+  +  D      SK +QK E    
Sbjct: 246 DSHSGYLLENGLLKQTIEKSSGPRKSKNDENLMKQKAGLFLDRNKKKISKAIQKSE---- 305

Query: 301 GGRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEEN 360
                ++ + SKNL+TER RRN+I+D LY LRALVP I+KMD ASI+ DAI YI EL++ 
Sbjct: 306 -----RDNFPSKNLVTERNRRNRIKDGLYTLRALVPKITKMDIASILGDAIEYIGELQKE 365

Query: 361 VKNLQNELVQLEHKDCQK-NRHLKISPSEKNKDDTIYWPL-VQNDQPMFILGEEKPMEVE 420
            K L++EL  +E ++C+K N  L +   + ++      P+ + N++     GE++ +EV+
Sbjct: 366 KKKLEDELEGIEEEECEKSNAQLPLKLEQLHEGRKPLPPVEIDNNEDSSGFGEKEKIEVQ 425

Query: 421 VEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANE 471
           +EV QI +R+FLIKLFC++ +GG    ++A+ SLGLQVVD N+TTF G VLNI  VE  +
Sbjct: 426 IEVNQIGKREFLIKLFCEKKRGGFGRLMDAIYSLGLQVVDANMTTFNGKVLNILKVEVQQ 471

BLAST of Cp4.1LG01g15300 vs. TAIR10
Match: AT2G16910.1 (AT2G16910.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 246.5 bits (628), Expect = 3.4e-65
Identity = 168/516 (32.56%), Postives = 261/516 (50.58%), Query Frame = 1

Query: 5   EGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGGKREAGETIPATL 64
           +  LE LR LV  + WDYC++WR  +D  RF+ W+GCCC G            E      
Sbjct: 6   QNLLEKLRPLVGARAWDYCVLWRLNEDQ-RFVKWMGCCCGGTELI---AENGTEEFSYGG 65

Query: 65  CKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSFSHELT 124
           C+D  F H  RT +C+ L+  P++I L+SG++ + L++NQ  WL+    SS  SF  E  
Sbjct: 66  CRDVMFHH-PRTKSCEFLSHLPASIPLDSGIYAETLLTNQTGWLSE---SSEPSFMQETI 125

Query: 125 GTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELDGDLNEKRLDS 184
            TRVLIPI  G+VELFAT+H+  D+ V+DFVM HCN+ M+        +  ++  K    
Sbjct: 126 CTRVLIPIPGGLVELFATRHVAEDQNVVDFVMGHCNMLMDDSVTINMMVADEVESKPYGM 185

Query: 185 CTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSCEGSSSGSKPSNEHHYFDSYSSLV 244
            +          + +++  S  +I     + N  P      +        ++  +   L 
Sbjct: 186 LSGDIQQKGSKEEDMMNLPSSYDISADQIRLNFLPQMSDYETQHLKMKSDYHHQALGYLP 245

Query: 245 SRG----FFNQPIHRSFESKRPTPQE-DLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKEI 304
             G        P +   E   P   E  LL  ++ VV+D  K + ++     G     +I
Sbjct: 246 ENGNKEMMGMNPFNTVEEDGIPVIGEPSLLVNEQQVVND--KDMNENGRVDSGSDCSDQI 305

Query: 305 -------YK--------SKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINY 364
                  YK        +KNL+ ERRRR K+ DRLYALR+LVP I+K+DRASI+ DAINY
Sbjct: 306 DDEDDPKYKKKSGKGSQAKNLMAERRRRKKLNDRLYALRSLVPRITKLDRASILGDAINY 365

Query: 365 IRELEENVKNLQNELVQLEHKDCQKNR--------HLKISPSEKNKDDTIYWPLVQNDQP 424
           ++EL+   K LQ+EL +    +   NR           ++            P V+ D  
Sbjct: 366 VKELQNEAKELQDELEENSETEDGSNRPQGGMSLNGTVVTGFHPGLSCNSNVPSVKQDVD 425

Query: 425 MFILGEE-KPMEVEVEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTF 484
           +    ++ + ME +V+V Q++ R+F +K+ C+   GG    +EA+DSLGL+V + N T +
Sbjct: 426 LENSNDKGQEMEPQVDVAQLDGREFFVKVICEYKPGGFTRLMEALDSLGLEVTNANTTRY 485

Query: 485 GGMVLNIFHVEANEND-IQPKRLRDSLIKLTSREMR 491
             +V N+F VE N+N+ +Q + +R+SL+++T    R
Sbjct: 486 LSLVSNVFKVEKNDNEMVQAEHVRNSLLEITRNTSR 511

BLAST of Cp4.1LG01g15300 vs. TAIR10
Match: AT1G10610.1 (AT1G10610.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 220.3 bits (560), Expect = 2.6e-57
Identity = 163/496 (32.86%), Postives = 245/496 (49.40%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGGKRE----- 60
           MRG E   E LR  V+++ WD C++W+  DD SRFI+WVGCCCSG   D   K E     
Sbjct: 4   MRGGERVKEFLRPFVDSRTWDLCVIWKLGDDPSRFIEWVGCCCSGCYIDKNIKLENSEEG 63

Query: 61  -AGETIPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVS 120
             G    A+ C+D   +H  RT AC+AL++FP  + L  G+HG+V++S  P WL +    
Sbjct: 64  GTGRKKKASFCRDDHNKHRIRTLACEALSRFPLFMPLYPGIHGEVVMSKSPKWLVN---- 123

Query: 121 STSSFSHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELD 180
             S    E+  TRVL+P+S G+VELFA    P D  ++  +M+ C    E          
Sbjct: 124 --SGSKMEMFSTRVLVPVSDGLVELFAFDMRPFDESMVHLIMSRCTTFFE---------- 183

Query: 181 GDLNEKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFP---SCEG--SSSGSK 240
                              P P+  L F+    I+P   +S S     S EG  SSS S 
Sbjct: 184 -------------------PFPEQRLQFR----IIPRAEESMSSGVNLSVEGGGSSSVSN 243

Query: 241 PSNEHHYFDSYSSLVSRGFFNQPIHRSFESKRPTPQEDLL-EQQRDVVSDYSKFLQKDEA 300
           PS+E              F N P     E  R      L+  +++DVV   +     +++
Sbjct: 244 PSSETQNL----------FGNYPNASCVEILREEQTPCLIMNKEKDVVVQNA-----NDS 303

Query: 301 KTGGGRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIREL 360
           K        E +KSKNL +ER+RR +I   +Y LRA+VP I+K+++  I  DA++YI EL
Sbjct: 304 KANKKLLPTENFKSKNLHSERKRRERINQAMYGLRAVVPKITKLNKIGIFSDAVDYINEL 363

Query: 361 EENVKNLQNELVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEV 420
               + L++EL  +   +C+     +I+  E++         V +     +    K  EV
Sbjct: 364 LVEKQKLEDELKGINEMECK-----EIAAEEQSAIADPEAERVSSKSNKRV----KKNEV 423

Query: 421 EVEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEAN 480
           ++EV +  ERDFLI++  +  Q G    IEA+D   L+++DVN T     V+ + +V+AN
Sbjct: 424 KIEVHETGERDFLIRVVQEHKQDGFKRLIEAVDLCELEIIDVNFTRLDLTVMTVLNVKAN 436

Query: 481 ENDIQPKRLRDSLIKL 485
           ++ I    LRD L+K+
Sbjct: 484 KDGIACGILRDLLLKM 436

BLAST of Cp4.1LG01g15300 vs. TAIR10
Match: AT1G12860.1 (AT1G12860.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 102.1 bits (253), Expect = 1.0e-21
Identity = 69/229 (30.13%), Postives = 124/229 (54.15%), Query Frame = 1

Query: 256 SFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKEIYKSKNLITERRRRNKI 315
           S E ++ + + ++ +    ++       + D+  T   +  K+   +KNL+ ERRRR K+
Sbjct: 220 SSEMRKSSYEREIDDTSTGIIDISGLNYESDDHNTNNNKGKKKGMPAKNLMAERRRRKKL 279

Query: 316 RDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQNELVQLEHKDCQKNRHLKI 375
            DRLY LR++VP ISKMDRASI+ DAI+Y++EL + + +L  E   LE      +    +
Sbjct: 280 NDRLYMLRSVVPKISKMDRASILGDAIDYLKELLQRINDLHTE---LESTPPSSSSLHPL 339

Query: 376 SPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINERDFLIKLFCKQTQGGVVS 435
           +P+ +    T+ + + +   P   L   K  +  VEV     +   I +FC +  G ++S
Sbjct: 340 TPTPQ----TLSYRVKEELCPSSSLPSPKGQQPRVEVRLREGKAVNIHMFCGRRPGLLLS 399

Query: 436 SIEAMDSLGLQVVDVNITTFGGMVLNIFHVE--ANENDIQPKRLRDSLI 483
           ++ A+D+LGL V    I+ F G  L++F  E    ++D+ P++++  L+
Sbjct: 400 TMRALDNLGLDVQQAVISCFNGFALDVFRAEQCQEDHDVLPEQIKAVLL 441

BLAST of Cp4.1LG01g15300 vs. TAIR10
Match: AT3G26744.1 (AT3G26744.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 100.9 bits (250), Expect = 2.3e-21
Identity = 66/186 (35.48%), Postives = 97/186 (52.15%), Query Frame = 1

Query: 291 GGGRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEE 350
           GGG+  K+   +KNL+ ERRRR K+ DRLY LR++VP ISKMDRASI+ DAI+Y++EL +
Sbjct: 295 GGGKGKKKGMPAKNLMAERRRRKKLNDRLYMLRSVVPKISKMDRASILGDAIDYLKELLQ 354

Query: 351 NVKNLQNELVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFI----------L 410
            + +L NEL                 P       + + PL    Q +            L
Sbjct: 355 RINDLHNELE-------------STPPGSLPPTSSSFHPLTPTPQTLSCRVKEELCPSSL 414

Query: 411 GEEKPMEVEVEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVL 467
              K  +  VEV     R   I +FC +  G ++++++A+D+LGL V    I+ F G  L
Sbjct: 415 PSPKGQQARVEVRLREGRAVNIHMFCGRRPGLLLATMKALDNLGLDVQQAVISCFNGFAL 467

BLAST of Cp4.1LG01g15300 vs. TAIR10
Match: AT5G57150.4 (AT5G57150.4 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 92.4 bits (228), Expect = 8.2e-19
Identity = 59/172 (34.30%), Postives = 95/172 (55.23%), Query Frame = 1

Query: 302 SKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQNELVQ 361
           SKN+++ER RR K+  RL+ALR++VPNI+KMD+ASII DAI+YI  L+   K L+ E+ +
Sbjct: 54  SKNIVSERNRRQKLNQRLFALRSVVPNITKMDKASIIKDAISYIEGLQYEEKKLEAEIRE 113

Query: 362 LEHKDCQKNRHLKISPS-EKNKDDTIYWPLVQNDQPMFILGEEKPM--EVEVEVMQINER 421
           LE          K S S  K+ D  +  P+          G    +   +E++V  + ER
Sbjct: 114 LESTP-------KSSLSFSKDFDRDLLVPVTSKKMKQLDSGSSTSLIEVLELKVTFMGER 173

Query: 422 DFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANEN 471
             ++ + C +    +V   E  +SL L+++  N+T+F GM+ +   +E   N
Sbjct: 174 TMVVSVTCNKRTDTMVKLCEVFESLNLKILTSNLTSFSGMIFHTVFIELRPN 218

BLAST of Cp4.1LG01g15300 vs. NCBI nr
Match: gi|659090790|ref|XP_008446203.1| (PREDICTED: transcription factor ABORTED MICROSPORES isoform X1 [Cucumis melo])

HSP 1 Score: 759.2 bits (1959), Expect = 4.4e-216
Identity = 396/489 (80.98%), Postives = 423/489 (86.50%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSS-RFIDWVGCCCSGGSFDDGGKREAGET 60
           MR FEGALE LR LVE K+WDYCIVW+SRDD S RFIDWVGCCCSGG  D GGK EAGET
Sbjct: 1   MRSFEGALEFLRPLVEIKLWDYCIVWKSRDDDSLRFIDWVGCCCSGGVSDAGGKEEAGET 60

Query: 61  IPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSF 120
           IPA LCKDTRFRHFRRTNACQALAQFPS+ISLN+GVHGDVLISNQPMWLTS E S  SSF
Sbjct: 61  IPAALCKDTRFRHFRRTNACQALAQFPSSISLNTGVHGDVLISNQPMWLTSGEASYFSSF 120

Query: 121 SHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESEL-DGDLN 180
           SHELTGTRVLIP+S GIVELFATK MPR+ EVIDFVMAHCNIS+EQEF+TES L D  LN
Sbjct: 121 SHELTGTRVLIPVSGGIVELFATKRMPREGEVIDFVMAHCNISLEQEFETESALLDAGLN 180

Query: 181 EKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSC-EGSSSGSKPSNEHHYF 240
           EK L S TKYYS++WPDPQP L FKSKLEILPSVSQS+SFP C EGSSSGSKP       
Sbjct: 181 EKILSSSTKYYSLNWPDPQPFLGFKSKLEILPSVSQSSSFPGCGEGSSSGSKP------- 240

Query: 241 DSYSSLVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGK 300
                  S G FNQPIH SFESK  T +E+LLEQQ++VVSD+SK LQKDEAKT G +Q K
Sbjct: 241 -------SPGLFNQPIHTSFESKAATHREELLEQQKNVVSDHSKILQKDEAKT-GEKQEK 300

Query: 301 EIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQN 360
           E+YKSKNL+TERRRRNKIRDRLY LRALVPNISKMDRASIIVDAI YIRELEENVK+LQN
Sbjct: 301 EVYKSKNLMTERRRRNKIRDRLYTLRALVPNISKMDRASIIVDAIGYIRELEENVKSLQN 360

Query: 361 ELVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINE 420
           EL+QLEHKDCQKN+HLKISP EK  DD   W  VQ+DQPMFIL EEKPMEVEVEVM+INE
Sbjct: 361 ELIQLEHKDCQKNKHLKISPLEKTNDDINSWSFVQDDQPMFILNEEKPMEVEVEVMRINE 420

Query: 421 RDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANENDIQPKRL 480
           RDFLIKLFCK+ QGGVVSSIEAM SLGLQV+DVNITTFGGMVLNIFHVEANENDIQPKRL
Sbjct: 421 RDFLIKLFCKRKQGGVVSSIEAMYSLGLQVIDVNITTFGGMVLNIFHVEANENDIQPKRL 474

Query: 481 RDSLIKLTS 487
           RDSL+KLTS
Sbjct: 481 RDSLMKLTS 474

BLAST of Cp4.1LG01g15300 vs. NCBI nr
Match: gi|449434929|ref|XP_004135248.1| (PREDICTED: transcription factor ABORTED MICROSPORES isoform X1 [Cucumis sativus])

HSP 1 Score: 751.9 bits (1940), Expect = 7.0e-214
Identity = 392/488 (80.33%), Postives = 418/488 (85.66%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSS-RFIDWVGCCCSGGSFDDGGKREAGET 60
           MR FE ALE LR LVE K+WDYCIVW+SRDD S RFIDWVGCCCSGG    GGK EAGET
Sbjct: 1   MRSFEEALEFLRPLVEIKLWDYCIVWKSRDDDSLRFIDWVGCCCSGGVSGAGGKEEAGET 60

Query: 61  IPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSF 120
           IPA LCKDTRFRHFRRTNACQALAQFPS+ISLN+GVHGDV ISNQPMWLTS EVS  SSF
Sbjct: 61  IPAALCKDTRFRHFRRTNACQALAQFPSSISLNTGVHGDVSISNQPMWLTSGEVSYFSSF 120

Query: 121 SHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELDGDLNE 180
           SHELTGTRVLIP+S GIVELFATK MPR+ EVIDFVMAHCN S+ QEF+TES L+  LNE
Sbjct: 121 SHELTGTRVLIPVSGGIVELFATKRMPREGEVIDFVMAHCNFSLGQEFETESALNAGLNE 180

Query: 181 KRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSC-EGSSSGSKPSNEHHYFD 240
           K L+S TKYYS++WPDPQ +L FKSKLE LPSVSQS+SFP C EGSSSGSKP        
Sbjct: 181 KILNSSTKYYSLNWPDPQAILGFKSKLETLPSVSQSSSFPGCGEGSSSGSKP-------- 240

Query: 241 SYSSLVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKE 300
                 S G FNQPI  SFESK    QEDLLEQQR+VV D+SK LQKDEAKT G +Q KE
Sbjct: 241 ------SPGLFNQPIRTSFESKAGMRQEDLLEQQRNVVLDHSKILQKDEAKT-GEKQEKE 300

Query: 301 IYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQNE 360
           +YKSKNL+TERRRRNKIRDRLY LRALVPNISKMDRASIIVDAI YIRELEENVK+LQNE
Sbjct: 301 VYKSKNLMTERRRRNKIRDRLYTLRALVPNISKMDRASIIVDAIGYIRELEENVKSLQNE 360

Query: 361 LVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINER 420
           L+QLEHKDCQKN+HLK+SP EK  DD   WP VQ+DQPMFIL EEKPMEVEVEVMQINER
Sbjct: 361 LIQLEHKDCQKNKHLKVSPLEKTNDDINSWPFVQDDQPMFILDEEKPMEVEVEVMQINER 420

Query: 421 DFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANENDIQPKRLR 480
           DFLIKLFCK+ QGGVVSSIEAMDSLGLQV+DVNITTFGGMVLNIFHVEANENDIQPKRLR
Sbjct: 421 DFLIKLFCKRKQGGVVSSIEAMDSLGLQVIDVNITTFGGMVLNIFHVEANENDIQPKRLR 473

Query: 481 DSLIKLTS 487
           DSLIKLTS
Sbjct: 481 DSLIKLTS 473

BLAST of Cp4.1LG01g15300 vs. NCBI nr
Match: gi|659090794|ref|XP_008446205.1| (PREDICTED: transcription factor ABORTED MICROSPORES isoform X2 [Cucumis melo])

HSP 1 Score: 705.3 bits (1819), Expect = 7.6e-200
Identity = 368/460 (80.00%), Postives = 394/460 (85.65%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSS-RFIDWVGCCCSGGSFDDGGKREAGET 60
           MR FEGALE LR LVE K+WDYCIVW+SRDD S RFIDWVGCCCSGG  D GGK EAGET
Sbjct: 1   MRSFEGALEFLRPLVEIKLWDYCIVWKSRDDDSLRFIDWVGCCCSGGVSDAGGKEEAGET 60

Query: 61  IPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSF 120
           IPA LCKDTRFRHFRRTNACQALAQFPS+ISLN+GVHGDVLISNQPMWLTS E S  SSF
Sbjct: 61  IPAALCKDTRFRHFRRTNACQALAQFPSSISLNTGVHGDVLISNQPMWLTSGEASYFSSF 120

Query: 121 SHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESEL-DGDLN 180
           SHELTGTRVLIP+S GIVELFATK MPR+ EVIDFVMAHCNIS+EQEF+TES L D  LN
Sbjct: 121 SHELTGTRVLIPVSGGIVELFATKRMPREGEVIDFVMAHCNISLEQEFETESALLDAGLN 180

Query: 181 EKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSC-EGSSSGSKPSNEHHYF 240
           EK L S TKYYS++WPDPQP L FKSKLEILPSVSQS+SFP C EGSSSGSKP       
Sbjct: 181 EKILSSSTKYYSLNWPDPQPFLGFKSKLEILPSVSQSSSFPGCGEGSSSGSKP------- 240

Query: 241 DSYSSLVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGK 300
                  S G FNQPIH SFESK  T +E+LLEQQ++VVSD+SK LQKDEAKT G +Q K
Sbjct: 241 -------SPGLFNQPIHTSFESKAATHREELLEQQKNVVSDHSKILQKDEAKT-GEKQEK 300

Query: 301 EIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQN 360
           E+YKSKNL+TERRRRNKIRDRLY LRALVPNISKMDRASIIVDAI YIRELEENVK+LQN
Sbjct: 301 EVYKSKNLMTERRRRNKIRDRLYTLRALVPNISKMDRASIIVDAIGYIRELEENVKSLQN 360

Query: 361 ELVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINE 420
           EL+QLEHKDCQKN+HLKISP EK  DD   W  VQ+DQPMFIL EEKPMEVEVEVM+INE
Sbjct: 361 ELIQLEHKDCQKNKHLKISPLEKTNDDINSWSFVQDDQPMFILNEEKPMEVEVEVMRINE 420

Query: 421 RDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGG 458
           RDFLIKLFCK+ QGGVVSSIEAM SLGLQV+DVNITTFGG
Sbjct: 421 RDFLIKLFCKRKQGGVVSSIEAMYSLGLQVIDVNITTFGG 445

BLAST of Cp4.1LG01g15300 vs. NCBI nr
Match: gi|778705040|ref|XP_011655624.1| (PREDICTED: transcription factor ABORTED MICROSPORES isoform X2 [Cucumis sativus])

HSP 1 Score: 696.8 bits (1797), Expect = 2.7e-197
Identity = 363/459 (79.08%), Postives = 389/459 (84.75%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSS-RFIDWVGCCCSGGSFDDGGKREAGET 60
           MR FE ALE LR LVE K+WDYCIVW+SRDD S RFIDWVGCCCSGG    GGK EAGET
Sbjct: 1   MRSFEEALEFLRPLVEIKLWDYCIVWKSRDDDSLRFIDWVGCCCSGGVSGAGGKEEAGET 60

Query: 61  IPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVSSTSSF 120
           IPA LCKDTRFRHFRRTNACQALAQFPS+ISLN+GVHGDV ISNQPMWLTS EVS  SSF
Sbjct: 61  IPAALCKDTRFRHFRRTNACQALAQFPSSISLNTGVHGDVSISNQPMWLTSGEVSYFSSF 120

Query: 121 SHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEFDTESELDGDLNE 180
           SHELTGTRVLIP+S GIVELFATK MPR+ EVIDFVMAHCN S+ QEF+TES L+  LNE
Sbjct: 121 SHELTGTRVLIPVSGGIVELFATKRMPREGEVIDFVMAHCNFSLGQEFETESALNAGLNE 180

Query: 181 KRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSC-EGSSSGSKPSNEHHYFD 240
           K L+S TKYYS++WPDPQ +L FKSKLE LPSVSQS+SFP C EGSSSGSKP        
Sbjct: 181 KILNSSTKYYSLNWPDPQAILGFKSKLETLPSVSQSSSFPGCGEGSSSGSKP-------- 240

Query: 241 SYSSLVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGGGRQGKE 300
                 S G FNQPI  SFESK    QEDLLEQQR+VV D+SK LQKDEAKT G +Q KE
Sbjct: 241 ------SPGLFNQPIRTSFESKAGMRQEDLLEQQRNVVLDHSKILQKDEAKT-GEKQEKE 300

Query: 301 IYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENVKNLQNE 360
           +YKSKNL+TERRRRNKIRDRLY LRALVPNISKMDRASIIVDAI YIRELEENVK+LQNE
Sbjct: 301 VYKSKNLMTERRRRNKIRDRLYTLRALVPNISKMDRASIIVDAIGYIRELEENVKSLQNE 360

Query: 361 LVQLEHKDCQKNRHLKISPSEKNKDDTIYWPLVQNDQPMFILGEEKPMEVEVEVMQINER 420
           L+QLEHKDCQKN+HLK+SP EK  DD   WP VQ+DQPMFIL EEKPMEVEVEVMQINER
Sbjct: 361 LIQLEHKDCQKNKHLKVSPLEKTNDDINSWPFVQDDQPMFILDEEKPMEVEVEVMQINER 420

Query: 421 DFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGG 458
           DFLIKLFCK+ QGGVVSSIEAMDSLGLQV+DVNITTFGG
Sbjct: 421 DFLIKLFCKRKQGGVVSSIEAMDSLGLQVIDVNITTFGG 444

BLAST of Cp4.1LG01g15300 vs. NCBI nr
Match: gi|568861660|ref|XP_006484318.1| (PREDICTED: transcription factor bHLH90 [Citrus sinensis])

HSP 1 Score: 378.3 bits (970), Expect = 2.1e-101
Identity = 222/496 (44.76%), Postives = 309/496 (62.30%), Query Frame = 1

Query: 1   MRGFEGALECLRALVETKVWDYCIVWRSRDDSSRFIDWVGCCCSGGSFDDGG------KR 60
           MR  E A+E LR  V++K WDYC+VW+  DD SRFI+W+GCCCSGG    GG      K 
Sbjct: 1   MRDLEKAVEWLRPFVDSKAWDYCVVWKLGDDPSRFIEWLGCCCSGGV--GGGFEYVKVKE 60

Query: 61  EAGETIPATLCKDTRFRHFRRTNACQALAQFPSTISLNSGVHGDVLISNQPMWLTSAEVS 120
           E+GE    + C+D   +H  RT AC+ALAQ PS + L SG+HG+V+I+NQP W++ A  S
Sbjct: 61  ESGEEQKFSFCRDAHLKHSARTKACEALAQLPSFMDLYSGIHGEVVITNQPKWISLAN-S 120

Query: 121 STSSFSHELTGTRVLIPISAGIVELFATKHMPRDREVIDFVMAHCNISMEQEF-DTESEL 180
           S S  SH+   TRVLIP+  G++ELFA KH+ +D+ +I+ V+AHCN S+EQ      S  
Sbjct: 121 SDSIASHQSNSTRVLIPVFGGLIELFAAKHISKDQNIIELVLAHCNTSIEQRVVPAGSSY 180

Query: 181 DGDLNEKRLDSCTKYYSVDWPDPQPLLDFKSKLEILPSVSQSNSFPSCEGSSSGSKPSNE 240
           D  L+EK LD   K    ++P P  LL F    ++L + ++ N+ P  EGSS GS PS E
Sbjct: 181 DVGLDEKCLDILLKENLQNFPSPLQLLTFVPGTQVLSAATRFNTHPYNEGSSRGSNPSIE 240

Query: 241 HHYFDS-YSSLVSRGFFNQPIHRSFESKRPTPQEDLLEQQRDVVSDYSKFLQKDEAKTGG 300
           H  FDS Y  +       QPI  SF +KRP  +  + +++                    
Sbjct: 241 HPSFDSNYGYIAQNAPLMQPIGNSF-AKRPKCKSHVFKEELG------------------ 300

Query: 301 GRQGKEIYKSKNLITERRRRNKIRDRLYALRALVPNISKMDRASIIVDAINYIRELEENV 360
             +   + ++KNLITER RRNK++D L+ALRALVP ISKMDRA+I+ DA  YI+EL + V
Sbjct: 301 --ERHRLGRAKNLITERNRRNKLKDGLFALRALVPKISKMDRAAILGDAAEYIKELLQEV 360

Query: 361 KNLQNELVQLEHKDCQK-NRHLKISPSEKNKD--DTIYWPLVQNDQPMFILGEEKPMEVE 420
             LQ+EL   E++DC+K N  +K    ++  +   T Y P  ++++     GE+   EV 
Sbjct: 361 DKLQDELK--ENEDCEKDNEEMKSFKLDEIHEGTSTTYLPASEHNKSFPACGEKGKSEVR 420

Query: 421 VEVMQINERDFLIKLFCKQTQGGVVSSIEAMDSLGLQVVDVNITTFGGMVLNIFHVEANE 480
           VEV QIN+RDFLIKL C+  +GG V  +EA++SL LQV+D N+TTF G VLNI  V+A++
Sbjct: 421 VEVNQINDRDFLIKLLCEHERGGFVRLMEAINSLELQVIDANVTTFNGKVLNILRVQAHK 470

Query: 481 NDIQPKRLRDSLIKLT 486
            +I+ K+LR++LI+LT
Sbjct: 481 ENIRLKKLRETLIELT 470

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AMS_ARATH6.0e-6432.56Transcription factor ABORTED MICROSPORES OS=Arabidopsis thaliana GN=AMS PE=1 SV=... [more]
BH090_ARATH4.6e-5632.86Transcription factor bHLH90 OS=Arabidopsis thaliana GN=BHLH90 PE=2 SV=1[more]
TDR_ORYSJ8.0e-2435.34Transcription factor TDR OS=Oryza sativa subsp. japonica GN=TDR PE=1 SV=1[more]
SCRM2_ARATH1.8e-2030.13Transcription factor SCREAM2 OS=Arabidopsis thaliana GN=SCRM2 PE=1 SV=1[more]
ICE1_ARATH4.1e-2035.48Transcription factor ICE1 OS=Arabidopsis thaliana GN=SCRM PE=1 SV=1[more]
Match NameE-valueIdentityDescription
E5GBJ2_CUCME3.1e-21680.98BHLH transcription factor OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A0A0KQ91_CUCSA4.9e-21480.33Uncharacterized protein OS=Cucumis sativus GN=Csa_5G601530 PE=4 SV=1[more]
A0A067JRR7_JATCU2.3e-9442.51Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17750 PE=4 SV=1[more]
U5G6A6_POPTR1.6e-9244.24Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0008s19390g PE=4 SV=1[more]
B9SHA3_RICCO4.0e-9141.79Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0528090 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G16910.13.4e-6532.56 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G10610.12.6e-5732.86 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G12860.11.0e-2130.13 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G26744.12.3e-2135.48 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT5G57150.48.2e-1934.30 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659090790|ref|XP_008446203.1|4.4e-21680.98PREDICTED: transcription factor ABORTED MICROSPORES isoform X1 [Cucumis melo][more]
gi|449434929|ref|XP_004135248.1|7.0e-21480.33PREDICTED: transcription factor ABORTED MICROSPORES isoform X1 [Cucumis sativus][more]
gi|659090794|ref|XP_008446205.1|7.6e-20080.00PREDICTED: transcription factor ABORTED MICROSPORES isoform X2 [Cucumis melo][more]
gi|778705040|ref|XP_011655624.1|2.7e-19779.08PREDICTED: transcription factor ABORTED MICROSPORES isoform X2 [Cucumis sativus][more]
gi|568861660|ref|XP_006484318.1|2.1e-10144.76PREDICTED: transcription factor bHLH90 [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR025610MYC/MYB_N
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g15300.1Cp4.1LG01g15300.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 301..365
score: 9.6
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 303..349
score: 5.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 305..354
score: 2.4
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 299..348
score: 15
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 298..364
score: 1.15
IPR025610Transcription factor MYC/MYB N-terminalPFAMPF14215bHLH-MYC_Ncoord: 10..156
score: 2.2
NoneNo IPR availableunknownCoilCoilcoord: 338..365
scor
NoneNo IPR availableGENE3DG3DSA:3.30.70.260coord: 437..485
score: 5.
NoneNo IPR availablePANTHERPTHR31945FAMILY NOT NAMEDcoord: 77..490
score: 9.6E-102coord: 26..44
score: 9.6E
NoneNo IPR availablePANTHERPTHR31945:SF14TRANSCRIPTION FACTOR BHLH90coord: 26..44
score: 9.6E-102coord: 77..490
score: 9.6E

The following gene(s) are paralogous to this gene:

None