Cp4.1LG04g14670 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g14670
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTranscription factor
LocationCp4.1LG04 : 11738496 .. 11742739 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAGTGGTCCAAACAAGTAAAGAACTGTTTGAGCTATCTCCGTTTCATACTGTGACATCCCTATCAACCCCCAATCTCTCGCATTATATACACATACCCTCACACACAGAGATAGAGATAAAGAAAGAGCTGTCCCTCAATTCCTGCACCTGGGTACCTATGAGCATGAGCCACTGGACAGTCCCGAACTGGAACCCTATGTAAGCACTCAATTATCACTCATTATAACTTTTTTAGTTAGTTGAAGTGAAGGCTTTTCAAACCCACCATCCATAATGTTGTTTTTTTTGGGTGAATTCCTGTGATTCAGGCCCAAGTATGGACGGTCATGTGCGAGAGGGGATGAGCAAGAAGCAGCCATGGATGCCCATGTGGAGGTGCTTTTCCAAACGCCTTTAACTAACCACTCGTGGAGCCAATCTGATGATACCCTCGAGTCCATTGTTCATAGTTCATTGTCAAGGAAACGGACTCGATCCAACCCAGATTGCAAAAATGAAGCTTTGATGAGTTGGGCTTCCATGGAATCTCACGGGAGCTTCAAAACCGATGATTCCGTTGAGGATTTAGCTCAAGATGATGGCTGGGTGATGCACAAGGTCTTTGTTGTTGTTATGAAGTTGTTTTTGTTGGGGTTTTTTTAGATAAACACGACTCTCCACGATAGTATGATACTGTCTACTTTGAGAATAAGCTCTGGTGGCTTTGCTTTGGGCTTTCCAAAAAATGAAGCTTTGATTTGTTGTTTGAATCAGGAGGATGATCATAATATGAAGGGGAAAGCAGATGGGAGCTGCTCAATCGGACGAACTCGAGCGGCTATTAATCACAATCGGTCTGATAGAGTAAGAAACTTTGAACCGAGTTAACGTTTTTGTATCATGCTACTCGAGGGTTGAATTAACTGTTCCCTCATTTGTAGAAACGGAGAGATCGAATAAACCAGAAGATGAAAGATCTGCAAAAGTTGGTGCCAAATGGGAGTAAGGTGAGAAAATGGTTGGAATCATGAATTATTTTGTCATTGTTTTCCGGGGTGCTCGGAAAACTCGTGTAGGGTTTAGTAAGCTTACGTATGTGGAGATTAGAAAATCTATGATCCAAGATAATCGAGCGAATTCTATATAAAACGTATGCTCGGTTTCAAACTCAAGTCACACTAATCTAACTCGTAAAGCAAATATGTCTTTAGGTTCGAATCGTTAATGAGATGTTTAATCGACAATTTCAAAGGAAGAGAGAGAGCTAAAGAGTCGTCTTTCTTTTTACAGATGGATAGAGCTTCACTTTTGGATGACACAATTCAATATTTGAAGCAACTTCAAGCACAAGTTCAGTTCATGTACAGTATCAGATCAATTGTGCCGCAGATGGTCATGCCTCTAGAAATTCAGCAGCAGCAGCAGCAGCAGCTTCAGATGTCACTGCTTGCAGCTCATATGGGACTACTTAACACTGATTCAAAGGCTCCCAGCTCCAGTTCCTTCCCTTGTGCTGCTGCATTCCCTCCTCCTCCGTTTCTACTGTCTTCTATCAACTCAACAACAAAGCCAAAATCTAACCTTTCCACCAGTGCTTTCGTTCCTCCGACCGATCCTTTCTGTACTTTCTTGGCACAAGTAAGTCGAACATCAAACATAAGATGAACTAATTGAGTAATACACACAAATGGGTTCGATTTTGTAGAATGCAAATGGAAAAATAATTGAGATAACTGGTTTTAGTCGACTTTCACGATTCAGTTTCTTAAGGAGTCGTCAGGCCAGACTAAACCGATATCGAGTCATTCTACTTACACTAGCTCCTAACATGAATAAAATGAATGAGGGACCAACTATATATAATAAAAATATAGAAAGAGCTATTCATTGAAGAACAGATTCAAGATGCAAAAGCTTACAGTTTTTTGTATTCAAGTGCTGTTTTTTTGTTGCTGCAGTCGATGGATATGGATTTCTACAGTAAGATGGTGACACTATATTGCCAAGAAGTGAACAGGACACCTCGGCAGGCAGGCAAATTTGAAGGAATCAAGGAAGATATGCATTAGAGCTAACCACAAAAAGCTTCAGCAATTGGTTGAATTGAATCCAAGAACAAATAGTCTTAGTTGAATAGCTAAATATGCCTCATTTCAACCAAACGTTCAAGCTTTCTCCAGTGATTTTGGTTGATGGAAATGACATATTCTAATGTTAAGAAGTTCTGCATTTTTTGAATACCATGAATGAGCATATGGCAAGTCCAGAAATAAGTGAACATAATGCGAAAACAAGCAATATTGCTGAGCTGCTGCTCGAGATAGCACTGCCGAAGAATTCGTTCTAACAAATTCAATGGTCACCAGTAGTCTCGGCAAGAACAAACGCCTTCAGAAAAATGGTGGAAACGATTTCGGTCTCTAACGACAATATATCGTTTATAGACAAGTGATACGACCTTGAAAAACTCGTGGCAATCCGAGCAAATTCTCAGATTTTTCACTATCCTTAGGGTGCAATCTAAAGGAGAGTTTAAGAGACCAAATGCCAAAGCAAGTTTCTCACTATGGTACATCACAGAGTGTTCTTTCTCTTCTTTCTCAATATCAAACAAAGACATGTCAGTGCCTGTAACGTAACCATTCTCTTTCAGTTTCTTCATCACATTATCCAACTGCTTATATATTGCATTGAATTCTAGTTTTCTGTCATTTGATGCAGTAAACTCATGAACTGAATTGTTTATTTCAATTGAACTGCAACCAGGGACTTTTTCAATCCCCCTGAGACTCATCATTCCTCGCAACTTTCCTACTTCGATCCACCGCCGTTCCCTCGAGTATAGATTTGATAGCAAGACATAATTCTCCCCATTGTTTGGTTCTAACTCAATAAGTCTCCTGATAGTATATTCACCCAATTTTGTATTCCCATGGACTCGGCAAGCACAAAGCAGAGCCCTCCAAATGATAGGGTCCGGCTCCATGCTCATGGATTCGATCAACTCTAGAGCTTCCTCTAACAATCCTGCTCGACCAAGTAGGTCTACCATACACCCATAATGCTCGATCTTCGGTTGCAGTCCAAACTGTTGTTTCATGCTTATGAATTGTCGACGCCCTTCTGTGACCAGACCTTGGTGACAACATGCACACAAGAGACCTAGAAAGGTAACCGCATCTGGTTTGAAATTTTCCATCAACATCCTAGAAAAAGCCTGCAAAGCAGCATCGCCTTGTCCATTCATGCCATATCCAGAAATCAAGACATTCCATGTATAAACGTTCTTGTCTCTAATTTCTTCAAAGACCTTCTCTGCCTCCTCAACAACCCCACATTTAGCATACATATCAATAAGTGCTGTTCCTACAAACACGTTCAGTCTCAACTTATTCTGATAAATAAAATCATGGATCCACTTTCCCTGATTCAAAGCTCCCAAATGAGAACAAGCAGATAGAACTACCACCACGGTTCTCTCGCTCGGTTCAGCCCCAGCTGCCAGCATCCCTCGGAAAGCATTGATAGCTTCCTTAAATTTCCTATTATGTGTATAACCAGTAATCAGCGCATTCCAAGTAACTGAATTTCTTTCAGGCATTTCGTCGAACAGCTGAGATGCATCGGAAATAGACAGACAAGAACAATACATGTGCACTAGAGCCGTGCTTGTGTAGACATCACGAATGAAACCCATTTGAACAACAGCACCATGTATCATTTTTCCAAGTTTAATATCACATAGCTGTGCAGTCGCTTTAAGAACGGCCGGGAAAGTAGAAGAATCAGGCAGAATGCTAAATTTATGCATGTGAGCAAAAATGATAAGTGAATTCAAATGCTCATTCAAATCCAAATAACCCCTGATCATCGAGTTACAAATCTGAGAATTAATAGAACCACGAAACTTAGAGAAAATAAGAGCAATAGATTCAAACCCATTATTCGAAACAGAGTCCTCAATGAGTTTCATGAGGAAATATCCGTCGCTCCGCATGTCATTGCCTTCCTTGCGGCGGGCATCCAGACAATCAGGAATCTTTCTCTCAATACCGCCCTGAGACGGAAGATTCATCACCCGGGTATGCACAAATCTCGATATTTGTGATTGGTTCGCTAAAAAATCCAAGTTCGAGATCGTACAAGTATCTTGGAGCCGAGCCCTTCGAATCAAACGAAATGAGAAAGGCAGAAACCTGAGATCCATTTTCATATTCAATTAAAGACAGTCTGGATTAGAGTTTCCAGTTCATAGTTCA

mRNA sequence

ATGGAGAAGATAGAGATAAAGAAAGAGCTGTCCCTCAATTCCTGCACCTGGGTACCTATGAGCATGAGCCACTGGACAGTCCCGAACTGGAACCCTATGCCCAAGTATGGACGGTCATGTGCGAGAGGGGATGAGCAAGAAGCAGCCATGGATGCCCATGTGGAGGTGCTTTTCCAAACGCCTTTAACTAACCACTCGTGGAGCCAATCTGATGATACCCTCGAGTCCATTGTTCATAGTTCATTGTCAAGGAAACGGACTCGATCCAACCCAGATTGCAAAAATGAAGCTTTGATGAGTTGGGCTTCCATGGAATCTCACGGGAGCTTCAAAACCGATGATTCCGTTGAGGATTTAGCTCAAGATGATGGCTGGAAACGGAGAGATCGAATAAACCAGAAGATGAAAGATCTGCAAAAGTTGGTGCCAAATGGGAGTAAGATGGATAGAGCTTCACTTTTGGATGACACAATTCAATATTTGAAGCAACTTCAAGCACAAGTTCAGTTCATGTACAGTATCAGATCAATTGTGCCGCAGATGGTCATGCCTCTAGAAATTCAGCAGCAGCAGCAGCAGCAGCTTCAGATGTCACTGCTTGCAGCTCATATGGGACTACTTAACACTGATTCAAAGGCTCCCAGCTCCAGTTCCTTCCCTTGTGCTGCTGCATTCCCTCCTCCTCCGTTTCTACTGTCTTCTATCAACTCAACAACAAAGCCAAAATCTAACCTTTCCACCAGTGCTTTCGTTCCTCCGACCGATCCTTTCTGTACTTTCTTGGCACAATCGATGGATATGGATTTCTACAGTAAGATGGTGACACTATATTGCCAAGAAGTGAACAGGACACCTCGGCAGGCAGGCAAATTTGAAGGAATCAAGGAAGATATGCATTAGAGCTAACCACAAAAAGCTTCAGCAATTGGTTGAATTGAATCCAAGAACAAATAGTCTTAGTTGAATAGCTAAATATGCCTCATTTCAACCAAACGTTCAAGCTTTCTCCAGTGATTTTGGTTGATGGAAATGACATATTCTAATGTTAAGAAGTTCTGCATTTTTTGAATACCATGAATGAGCATATGGCAAGTCCAGAAATAAGTGAACATAATGCGAAAACAAGCAATATTGCTGAGCTGCTGCTCGAGATAGCACTGCCGAAGAATTCGTTCTAACAAATTCAATGGTCACCAGTAGTCTCGGCAAGAACAAACGCCTTCAGAAAAATGGTGGAAACGATTTCGGTCTCTAACGACAATATATCGTTTATAGACAAGTGATACGACCTTGAAAAACTCGTGGCAATCCGAGCAAATTCTCAGATTTTTCACTATCCTTAGGGTGCAATCTAAAGGAGAGTTTAAGAGACCAAATGCCAAAGCAAGTTTCTCACTATGGTACATCACAGAGTGTTCTTTCTCTTCTTTCTCAATATCAAACAAAGACATGTCAGTGCCTGTAACGTAACCATTCTCTTTCAGTTTCTTCATCACATTATCCAACTGCTTATATATTGCATTGAATTCTAGTTTTCTGTCATTTGATGCAGTAAACTCATGAACTGAATTGTTTATTTCAATTGAACTGCAACCAGGGACTTTTTCAATCCCCCTGAGACTCATCATTCCTCGCAACTTTCCTACTTCGATCCACCGCCGTTCCCTCGAGTATAGATTTGATAGCAAGACATAATTCTCCCCATTGTTTGGTTCTAACTCAATAAGTCTCCTGATAGTATATTCACCCAATTTTGTATTCCCATGGACTCGGCAAGCACAAAGCAGAGCCCTCCAAATGATAGGGTCCGGCTCCATGCTCATGGATTCGATCAACTCTAGAGCTTCCTCTAACAATCCTGCTCGACCAAGTAGGTCTACCATACACCCATAATGCTCGATCTTCGGTTGCAGTCCAAACTGTTGTTTCATGCTTATGAATTGTCGACGCCCTTCTGTGACCAGACCTTGGTGACAACATGCACACAAGAGACCTAGAAAGGTAACCGCATCTGGTTTGAAATTTTCCATCAACATCCTAGAAAAAGCCTGCAAAGCAGCATCGCCTTGTCCATTCATGCCATATCCAGAAATCAAGACATTCCATGTATAAACGTTCTTGTCTCTAATTTCTTCAAAGACCTTCTCTGCCTCCTCAACAACCCCACATTTAGCATACATATCAATAAGTGCTGTTCCTACAAACACGTTCAGTCTCAACTTATTCTGATAAATAAAATCATGGATCCACTTTCCCTGATTCAAAGCTCCCAAATGAGAACAAGCAGATAGAACTACCACCACGGTTCTCTCGCTCGGTTCAGCCCCAGCTGCCAGCATCCCTCGGAAAGCATTGATAGCTTCCTTAAATTTCCTATTATGTGTATAACCAGTAATCAGCGCATTCCAAGTAACTGAATTTCTTTCAGGCATTTCGTCGAACAGCTGAGATGCATCGGAAATAGACAGACAAGAACAATACATGTGCACTAGAGCCGTGCTTGTGTAGACATCACGAATGAAACCCATTTGAACAACAGCACCATGTATCATTTTTCCAAGTTTAATATCACATAGCTGTGCAGTCGCTTTAAGAACGGCCGGGAAAGTAGAAGAATCAGGCAGAATGCTAAATTTATGCATGTGAGCAAAAATGATAAGTGAATTCAAATGCTCATTCAAATCCAAATAACCCCTGATCATCGAGTTACAAATCTGAGAATTAATAGAACCACGAAACTTAGAGAAAATAAGAGCAATAGATTCAAACCCATTATTCGAAACAGAGTCCTCAATGAGTTTCATGAGGAAATATCCGTCGCTCCGCATGTCATTGCCTTCCTTGCGGCGGGCATCCAGACAATCAGGAATCTTTCTCTCAATACCGCCCTGAGACGGAAGATTCATCACCCGGGTATGCACAAATCTCGATATTTGTGATTGGTTCGCTAAAAAATCCAAGTTCGAGATCGTACAAGTATCTTGGAGCCGAGCCCTTCGAATCAAACGAAATGAGAAAGGCAGAAACCTGAGATCCATTTTCATATTCAATTAAAGACAGTCTGGATTAGAGTTTCCAGTTCATAGTTCA

Coding sequence (CDS)

ATGGAGAAGATAGAGATAAAGAAAGAGCTGTCCCTCAATTCCTGCACCTGGGTACCTATGAGCATGAGCCACTGGACAGTCCCGAACTGGAACCCTATGCCCAAGTATGGACGGTCATGTGCGAGAGGGGATGAGCAAGAAGCAGCCATGGATGCCCATGTGGAGGTGCTTTTCCAAACGCCTTTAACTAACCACTCGTGGAGCCAATCTGATGATACCCTCGAGTCCATTGTTCATAGTTCATTGTCAAGGAAACGGACTCGATCCAACCCAGATTGCAAAAATGAAGCTTTGATGAGTTGGGCTTCCATGGAATCTCACGGGAGCTTCAAAACCGATGATTCCGTTGAGGATTTAGCTCAAGATGATGGCTGGAAACGGAGAGATCGAATAAACCAGAAGATGAAAGATCTGCAAAAGTTGGTGCCAAATGGGAGTAAGATGGATAGAGCTTCACTTTTGGATGACACAATTCAATATTTGAAGCAACTTCAAGCACAAGTTCAGTTCATGTACAGTATCAGATCAATTGTGCCGCAGATGGTCATGCCTCTAGAAATTCAGCAGCAGCAGCAGCAGCAGCTTCAGATGTCACTGCTTGCAGCTCATATGGGACTACTTAACACTGATTCAAAGGCTCCCAGCTCCAGTTCCTTCCCTTGTGCTGCTGCATTCCCTCCTCCTCCGTTTCTACTGTCTTCTATCAACTCAACAACAAAGCCAAAATCTAACCTTTCCACCAGTGCTTTCGTTCCTCCGACCGATCCTTTCTGTACTTTCTTGGCACAATCGATGGATATGGATTTCTACAGTAAGATGGTGACACTATATTGCCAAGAAGTGAACAGGACACCTCGGCAGGCAGGCAAATTTGAAGGAATCAAGGAAGATATGCATTAG

Protein sequence

MEKIEIKKELSLNSCTWVPMSMSHWTVPNWNPMPKYGRSCARGDEQEAAMDAHVEVLFQTPLTNHSWSQSDDTLESIVHSSLSRKRTRSNPDCKNEALMSWASMESHGSFKTDDSVEDLAQDDGWKRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPLEIQQQQQQQLQMSLLAAHMGLLNTDSKAPSSSSFPCAAAFPPPPFLLSSINSTTKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQAGKFEGIKEDMH
BLAST of Cp4.1LG04g14670 vs. Swiss-Prot
Match: PIF7_ARATH (Transcription factor PIF7 OS=Arabidopsis thaliana GN=BHLH72 PE=1 SV=2)

HSP 1 Score: 97.4 bits (241), Expect = 2.7e-19
Identity = 73/185 (39.46%), Postives = 105/185 (56.76%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPL 185
           +RRDRINQ+M+ LQKL+P  SK D+ S+LDD I++LKQLQAQVQFM S+R+ +PQ +M  
Sbjct: 177 RRRDRINQRMRTLQKLLPTASKADKVSILDDVIEHLKQLQAQVQFM-SLRANLPQQMMIP 236

Query: 186 EI---------------------QQQQQQQLQMSLLA--AHMGLLNTDSKAPSSSSFPCA 245
           ++                     QQQQQQQ QMSLLA  A MG+    +        P  
Sbjct: 237 QLPPPQSVLSIQHQQQQQQQQQQQQQQQQQFQMSLLATMARMGMGGGGNGYGGLVPPP-- 296

Query: 246 AAFPPPPFLLSSINSTTKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKM-VTLYCQEV 287
              PPPP ++  + +      + +T      +DP+  F AQ+M+MD Y+KM   +Y Q+ 
Sbjct: 297 ---PPPPMMVPPMGNRDCTNGSSATL-----SDPYSAFFAQTMNMDLYNKMAAAIYRQQS 350

BLAST of Cp4.1LG04g14670 vs. Swiss-Prot
Match: UNE10_ARATH (Transcription factor UNE10 OS=Arabidopsis thaliana GN=UNE10 PE=2 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 6.1e-19
Identity = 91/259 (35.14%), Postives = 130/259 (50.19%), Query Frame = 1

Query: 43  GDEQEAAMDAHVEVLFQTPLTNHSWSQSDDTLESIVHSSLSRKRTRSNPDCKNEALMSWA 102
           G  Q   MD      +    T+ S    D+T++   H S+   R +   + + +A     
Sbjct: 154 GGSQRLTMDT-----YDVGFTSTSMGSHDNTIDD--HDSVCHSRPQMEDEEEKKA----- 213

Query: 103 SMESHGSFKTDDSVEDLAQDDGWKRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLK 162
             +S  S K   +     Q +  KRRD+INQ+MK LQKLVPN SK D+AS+LD+ I+YLK
Sbjct: 214 GGKSSVSTKRSRAAAIHNQSER-KRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLK 273

Query: 163 QLQAQVQFMYSIRSIVPQMVMPLEIQQQQQQQLQMSLLAAHMGL-------------LNT 222
           QLQAQV  M   R  +P M++P+ +  QQQQQLQMSL++  MGL             LN+
Sbjct: 274 QLQAQVSMM--SRMNMPSMMLPMAM--QQQQQLQMSLMSNPMGLGMGMGMPGLGLLDLNS 333

Query: 223 DSKAPSSSSFPCAAAFPPPPFLLSSINSTTKPKSNLSTSAFVPPTDPFCTFLA---QSMD 282
            ++A +S+    A   P P   ++  +       +   S  +P  DP   FLA   Q   
Sbjct: 334 MNRAAASAPNIHANMMPNPFLPMNCPSWDASSNDSRFQSPLIP--DPMSAFLACSTQPTT 393

Query: 283 MDFYSKMVTLYCQEVNRTP 286
           M+ YS+M TLY Q   + P
Sbjct: 394 MEAYSRMATLYQQMQQQLP 393

BLAST of Cp4.1LG04g14670 vs. Swiss-Prot
Match: APG_ORYSJ (Transcription factor APG OS=Oryza sativa subsp. japonica GN=APG PE=1 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 2.3e-10
Identity = 49/131 (37.40%), Postives = 69/131 (52.67%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFM-YSIRSIVPQMVMP 185
           +RRDRIN+KM+ LQ+L+PN +K+D+AS+L++ I+YLK LQ QVQ M       VP M++P
Sbjct: 346 RRRDRINEKMRALQELIPNCNKIDKASMLEEAIEYLKTLQLQVQMMSMGTGMFVPPMMLP 405

Query: 186 LEIQQQQQQQLQMSLLAA---------HMGLLNTDSKA----PSSSSFPC---------- 231
                 Q   +QM  +A          H+G       A    P+++ FPC          
Sbjct: 406 AAAAAMQHHHMQMQQMAGPMAAAAHFPHLGAAAAMGLAGFGMPAAAQFPCPMFPAAPPMS 465

BLAST of Cp4.1LG04g14670 vs. Swiss-Prot
Match: PIF3_ARATH (Transcription factor PIF3 OS=Arabidopsis thaliana GN=PIF3 PE=1 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 9.8e-09
Identity = 29/46 (63.04%), Postives = 40/46 (86.96%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFM 172
           +RRDRIN+KM+ LQ+L+PN +K+D+AS+LD+ I+YLK LQ QVQ M
Sbjct: 354 RRRDRINEKMRALQELIPNCNKVDKASMLDEAIEYLKSLQLQVQIM 399

BLAST of Cp4.1LG04g14670 vs. Swiss-Prot
Match: ALC_ARATH (Transcription factor ALC OS=Arabidopsis thaliana GN=ALC PE=2 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 3.7e-08
Identity = 38/85 (44.71%), Postives = 54/85 (63.53%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFMYSIRSI------VP 185
           KRR +IN+KMK LQKL+PN +K D+AS+LD+ I+YLKQLQ QVQ +  +  +      +P
Sbjct: 104 KRRSKINEKMKALQKLIPNSNKTDKASMLDEAIEYLKQLQLQVQTLAVMNGLGLNPMRLP 163

Query: 186 QMVMP--LEIQQQQQQQLQMSLLAA 203
           Q+  P    I +  +Q L +  L A
Sbjct: 164 QVPPPTHTRINETLEQDLNLETLLA 188

BLAST of Cp4.1LG04g14670 vs. TrEMBL
Match: A0A0A0LSD5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G011540 PE=4 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 6.3e-63
Identity = 170/316 (53.80%), Postives = 197/316 (62.34%), Query Frame = 1

Query: 31  NPMPKYG---RSCARGDEQEAAMD----AHVEVLFQTPLTNHSWSQSDDTLESIVHSSLS 90
           N  PKY    R+   GD+Q AA++     H +VL Q P T HSWS+S+DTLESIVHSSLS
Sbjct: 2   NFRPKYEGMERTWESGDQQAAAIEDHGHCHGKVLPQMPSTKHSWSESEDTLESIVHSSLS 61

Query: 91  RKRTRSNPDC-KNEALMSWASMESHGSFKTDDSVEDLA-QDDGWKRRDRINQKM------ 150
           RKRTRSNP+C K+E LM+ AS+ESH +FK+ +S++DLA + DG ++   +  K       
Sbjct: 62  RKRTRSNPECWKDETLMTEASLESHRTFKSKNSIQDLALEHDGSEKEYNMKGKTDGSCSN 121

Query: 151 KDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQ-------------------------- 210
           +  +    N ++ +R    D   Q +K LQ  V                           
Sbjct: 122 RRTRTAAINHNQYERRR-RDRINQRMKDLQKLVPNGSKTDRASLLDDTIQYLKQLQAQVQ 181

Query: 211 FMYSIRSIVPQMVMPLEIQQQQQQQLQMSLLAAHMGLLNTDSKAPSSSSFPCAAAFPPPP 270
           FM SIRS VPQMVMPL I   QQQQLQMSLLAA MGLL   S A SSSSFPCAA F  P 
Sbjct: 182 FMDSIRSAVPQMVMPLGI---QQQQLQMSLLAARMGLLGAASMASSSSSFPCAATF--PQ 241

Query: 271 FLLSSINSTTKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQAG 300
             L SI STTKPKS LST AFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTP+Q  
Sbjct: 242 IQLPSIVSTTKPKSKLSTRAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPQQTS 301

BLAST of Cp4.1LG04g14670 vs. TrEMBL
Match: A0A0D2LW15_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G078500 PE=4 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 1.7e-28
Identity = 111/294 (37.76%), Postives = 151/294 (51.36%), Query Frame = 1

Query: 35  KYGRSCA----RGDEQEAAMDAHVEVLFQTPLTNHSWSQSDDTLESIVHSSLSRKRTRSN 94
           K+  SC     + D  E   D   ++      T  +W   D++L+S     L  K T  +
Sbjct: 85  KHNISCGIQKDKADRSECGCDTFYKI--DNDATMVTWGSHDESLQS-----LKTKTTDGD 144

Query: 95  PDCKNEALMSWASMESHGSFKTDDSVEDLAQDDGWKRRDRINQKMKDLQKLVPNGSKMDR 154
             C + +     +  SH + ++  +      +   KRRDRINQKMK LQKLVPN SK D+
Sbjct: 145 SGCHDGSESRDETGRSHPTRRSRAAATHNLSER--KRRDRINQKMKALQKLVPNASKTDK 204

Query: 155 ASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPLEIQQQQQQQLQMSL----------- 214
           AS+LD+ I+YLKQLQAQVQ M S+RSI P M+MPL +  Q  Q LQMSL           
Sbjct: 205 ASMLDEVIEYLKQLQAQVQVM-SMRSIPPMMMMPLGL--QHHQHLQMSLLGRIMAGMGVN 264

Query: 215 --LAAHMGLLNTDSKAP--SSSSFPCAAAFPP-------PPFLLSSINSTTKPKS--NLS 274
             L   MGL++ ++  P  +S S P     PP       PP + S   +T   +S  N S
Sbjct: 265 HALGMGMGLVDINAATPPNASQSLPPLLHLPPPFLATALPPMIPSRATATAAAQSNPNAS 324

Query: 275 TSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQA---GKFEGIKED 298
           +S  +P  DP C FL QSM+M+ YSKM  LY  ++NRT   A    +   IK+D
Sbjct: 325 SSDSIPLPDPSCAFLTQSMNMELYSKMAALYQAQMNRTTETASSPSRSNNIKQD 366

BLAST of Cp4.1LG04g14670 vs. TrEMBL
Match: A0A0D2PVV3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G078500 PE=4 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 1.7e-28
Identity = 111/294 (37.76%), Postives = 151/294 (51.36%), Query Frame = 1

Query: 35  KYGRSCA----RGDEQEAAMDAHVEVLFQTPLTNHSWSQSDDTLESIVHSSLSRKRTRSN 94
           K+  SC     + D  E   D   ++      T  +W   D++L+S     L  K T  +
Sbjct: 150 KHNISCGIQKDKADRSECGCDTFYKI--DNDATMVTWGSHDESLQS-----LKTKTTDGD 209

Query: 95  PDCKNEALMSWASMESHGSFKTDDSVEDLAQDDGWKRRDRINQKMKDLQKLVPNGSKMDR 154
             C + +     +  SH + ++  +      +   KRRDRINQKMK LQKLVPN SK D+
Sbjct: 210 SGCHDGSESRDETGRSHPTRRSRAAATHNLSER--KRRDRINQKMKALQKLVPNASKTDK 269

Query: 155 ASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPLEIQQQQQQQLQMSL----------- 214
           AS+LD+ I+YLKQLQAQVQ M S+RSI P M+MPL +  Q  Q LQMSL           
Sbjct: 270 ASMLDEVIEYLKQLQAQVQVM-SMRSIPPMMMMPLGL--QHHQHLQMSLLGRIMAGMGVN 329

Query: 215 --LAAHMGLLNTDSKAP--SSSSFPCAAAFPP-------PPFLLSSINSTTKPKS--NLS 274
             L   MGL++ ++  P  +S S P     PP       PP + S   +T   +S  N S
Sbjct: 330 HALGMGMGLVDINAATPPNASQSLPPLLHLPPPFLATALPPMIPSRATATAAAQSNPNAS 389

Query: 275 TSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQA---GKFEGIKED 298
           +S  +P  DP C FL QSM+M+ YSKM  LY  ++NRT   A    +   IK+D
Sbjct: 390 SSDSIPLPDPSCAFLTQSMNMELYSKMAALYQAQMNRTTETASSPSRSNNIKQD 431

BLAST of Cp4.1LG04g14670 vs. TrEMBL
Match: F6GT27_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g06930 PE=4 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 5.0e-28
Identity = 92/178 (51.69%), Postives = 118/178 (66.29%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPL 185
           +RRDRINQKMK LQKLVPN SK D+AS+LD+ I+YLKQLQAQVQ M S+R++ PQM+MP+
Sbjct: 234 RRRDRINQKMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVQMM-SVRNM-PQMMMPM 293

Query: 186 EIQQQQQQQLQMSLLAAH---------MGLLNTDS---KAPSS-----SSFPCAAAFP-- 245
            +QQQ    LQMSLLA           MG+L+  +    AP +      + P  AA P  
Sbjct: 294 GMQQQ----LQMSLLARMGMGVGLGMGMGMLDMSAVPRAAPQTLPSLLHANPVVAATPTF 353

Query: 246 -PPPFLLSSIN-STTKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVN 283
            PPPF++  +  S+++PKS+   +A VP  DP+C FLAQSM+MD Y KM  LY Q VN
Sbjct: 354 VPPPFVVPPMMPSSSQPKSDAGANAAVPLQDPYCAFLAQSMNMDLYHKMAALYRQHVN 405

BLAST of Cp4.1LG04g14670 vs. TrEMBL
Match: A0A061G4L1_THECC (DNA binding protein, putative isoform 4 OS=Theobroma cacao GN=TCM_014141 PE=4 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 2.8e-26
Identity = 91/188 (48.40%), Postives = 121/188 (64.36%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPL 185
           +RRDRINQKM+ LQKLVPN SK D+AS+LD+ I+YLKQLQAQVQ M S+RS +PQM++PL
Sbjct: 216 RRRDRINQKMRTLQKLVPNASKTDKASMLDEVIEYLKQLQAQVQMM-SMRS-MPQMMVPL 275

Query: 186 EIQQQQQQQLQMSLLA-AHMGLLNTDSKA--PSSS----SFPCAAAFPP---PPFLLSSI 245
            +    QQ LQMSLLA   MG+L+ +S A  PS S      P     PP   PPF+   +
Sbjct: 276 GM----QQHLQMSLLARMGMGMLDINSMARFPSQSLPPLMHPSPVTVPPTFLPPFVAPPM 335

Query: 246 NST---TKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQA---G 298
             T    +  S+  ++A VP  DP+C  LAQS++MD YSKM  LY  ++N+T + A    
Sbjct: 336 IPTREAAQANSDAISNASVPLPDPYCALLAQSVNMDLYSKMAALYRPQINQTTQTASSPS 395

BLAST of Cp4.1LG04g14670 vs. TAIR10
Match: AT5G61270.1 (AT5G61270.1 phytochrome-interacting factor7)

HSP 1 Score: 97.4 bits (241), Expect = 1.5e-20
Identity = 73/185 (39.46%), Postives = 105/185 (56.76%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPL 185
           +RRDRINQ+M+ LQKL+P  SK D+ S+LDD I++LKQLQAQVQFM S+R+ +PQ +M  
Sbjct: 177 RRRDRINQRMRTLQKLLPTASKADKVSILDDVIEHLKQLQAQVQFM-SLRANLPQQMMIP 236

Query: 186 EI---------------------QQQQQQQLQMSLLA--AHMGLLNTDSKAPSSSSFPCA 245
           ++                     QQQQQQQ QMSLLA  A MG+    +        P  
Sbjct: 237 QLPPPQSVLSIQHQQQQQQQQQQQQQQQQQFQMSLLATMARMGMGGGGNGYGGLVPPP-- 296

Query: 246 AAFPPPPFLLSSINSTTKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKM-VTLYCQEV 287
              PPPP ++  + +      + +T      +DP+  F AQ+M+MD Y+KM   +Y Q+ 
Sbjct: 297 ---PPPPMMVPPMGNRDCTNGSSATL-----SDPYSAFFAQTMNMDLYNKMAAAIYRQQS 350

BLAST of Cp4.1LG04g14670 vs. TAIR10
Match: AT4G00050.1 (AT4G00050.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 96.3 bits (238), Expect = 3.4e-20
Identity = 91/259 (35.14%), Postives = 130/259 (50.19%), Query Frame = 1

Query: 43  GDEQEAAMDAHVEVLFQTPLTNHSWSQSDDTLESIVHSSLSRKRTRSNPDCKNEALMSWA 102
           G  Q   MD      +    T+ S    D+T++   H S+   R +   + + +A     
Sbjct: 154 GGSQRLTMDT-----YDVGFTSTSMGSHDNTIDD--HDSVCHSRPQMEDEEEKKA----- 213

Query: 103 SMESHGSFKTDDSVEDLAQDDGWKRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLK 162
             +S  S K   +     Q +  KRRD+INQ+MK LQKLVPN SK D+AS+LD+ I+YLK
Sbjct: 214 GGKSSVSTKRSRAAAIHNQSER-KRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLK 273

Query: 163 QLQAQVQFMYSIRSIVPQMVMPLEIQQQQQQQLQMSLLAAHMGL-------------LNT 222
           QLQAQV  M   R  +P M++P+ +  QQQQQLQMSL++  MGL             LN+
Sbjct: 274 QLQAQVSMM--SRMNMPSMMLPMAM--QQQQQLQMSLMSNPMGLGMGMGMPGLGLLDLNS 333

Query: 223 DSKAPSSSSFPCAAAFPPPPFLLSSINSTTKPKSNLSTSAFVPPTDPFCTFLA---QSMD 282
            ++A +S+    A   P P   ++  +       +   S  +P  DP   FLA   Q   
Sbjct: 334 MNRAAASAPNIHANMMPNPFLPMNCPSWDASSNDSRFQSPLIP--DPMSAFLACSTQPTT 393

Query: 283 MDFYSKMVTLYCQEVNRTP 286
           M+ YS+M TLY Q   + P
Sbjct: 394 MEAYSRMATLYQQMQQQLP 393

BLAST of Cp4.1LG04g14670 vs. TAIR10
Match: AT1G09530.1 (AT1G09530.1 phytochrome interacting factor 3)

HSP 1 Score: 62.4 bits (150), Expect = 5.5e-10
Identity = 29/46 (63.04%), Postives = 40/46 (86.96%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFM 172
           +RRDRIN+KM+ LQ+L+PN +K+D+AS+LD+ I+YLK LQ QVQ M
Sbjct: 354 RRRDRINEKMRALQELIPNCNKVDKASMLDEAIEYLKSLQLQVQIM 399

BLAST of Cp4.1LG04g14670 vs. TAIR10
Match: AT5G67110.1 (AT5G67110.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein)

HSP 1 Score: 60.5 bits (145), Expect = 2.1e-09
Identity = 38/85 (44.71%), Postives = 54/85 (63.53%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFMYSIRSI------VP 185
           KRR +IN+KMK LQKL+PN +K D+AS+LD+ I+YLKQLQ QVQ +  +  +      +P
Sbjct: 104 KRRSKINEKMKALQKLIPNSNKTDKASMLDEAIEYLKQLQLQVQTLAVMNGLGLNPMRLP 163

Query: 186 QMVMP--LEIQQQQQQQLQMSLLAA 203
           Q+  P    I +  +Q L +  L A
Sbjct: 164 QVPPPTHTRINETLEQDLNLETLLA 188

BLAST of Cp4.1LG04g14670 vs. TAIR10
Match: AT3G59060.2 (AT3G59060.2 phytochrome interacting factor 3-like 6)

HSP 1 Score: 58.2 bits (139), Expect = 1.0e-08
Identity = 27/47 (57.45%), Postives = 39/47 (82.98%), Query Frame = 1

Query: 126 KRRDRINQKMKDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQFMY 173
           +RRDRIN++MK LQ+L+P+ S+ D+AS+LD+ I YLK LQ Q+Q M+
Sbjct: 267 RRRDRINERMKALQELIPHCSRTDKASILDEAIDYLKSLQMQLQVMW 313

BLAST of Cp4.1LG04g14670 vs. NCBI nr
Match: gi|778656018|ref|XP_004138310.2| (PREDICTED: transcription factor UNE10 [Cucumis sativus])

HSP 1 Score: 253.4 bits (646), Expect = 4.8e-64
Identity = 171/315 (54.29%), Postives = 198/315 (62.86%), Query Frame = 1

Query: 32  PMPKYG---RSCARGDEQEAAMD----AHVEVLFQTPLTNHSWSQSDDTLESIVHSSLSR 91
           PMPKY    R+   GD+Q AA++     H +VL Q P T HSWS+S+DTLESIVHSSLSR
Sbjct: 48  PMPKYEGMERTWESGDQQAAAIEDHGHCHGKVLPQMPSTKHSWSESEDTLESIVHSSLSR 107

Query: 92  KRTRSNPDC-KNEALMSWASMESHGSFKTDDSVEDLA-QDDGWKRRDRINQKM------K 151
           KRTRSNP+C K+E LM+ AS+ESH +FK+ +S++DLA + DG ++   +  K       +
Sbjct: 108 KRTRSNPECWKDETLMTEASLESHRTFKSKNSIQDLALEHDGSEKEYNMKGKTDGSCSNR 167

Query: 152 DLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQ--------------------------F 211
             +    N ++ +R    D   Q +K LQ  V                           F
Sbjct: 168 RTRTAAINHNQYERRR-RDRINQRMKDLQKLVPNGSKTDRASLLDDTIQYLKQLQAQVQF 227

Query: 212 MYSIRSIVPQMVMPLEIQQQQQQQLQMSLLAAHMGLLNTDSKAPSSSSFPCAAAFPPPPF 271
           M SIRS VPQMVMPL I   QQQQLQMSLLAA MGLL   S A SSSSFPCAA F  P  
Sbjct: 228 MDSIRSAVPQMVMPLGI---QQQQLQMSLLAARMGLLGAASMASSSSSFPCAATF--PQI 287

Query: 272 LLSSINSTTKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQAGK 300
            L SI STTKPKS LST AFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTP+Q  K
Sbjct: 288 QLPSIVSTTKPKSKLSTRAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPQQTSK 347

BLAST of Cp4.1LG04g14670 vs. NCBI nr
Match: gi|700208606|gb|KGN63702.1| (hypothetical protein Csa_1G011540 [Cucumis sativus])

HSP 1 Score: 249.2 bits (635), Expect = 9.0e-63
Identity = 170/316 (53.80%), Postives = 197/316 (62.34%), Query Frame = 1

Query: 31  NPMPKYG---RSCARGDEQEAAMD----AHVEVLFQTPLTNHSWSQSDDTLESIVHSSLS 90
           N  PKY    R+   GD+Q AA++     H +VL Q P T HSWS+S+DTLESIVHSSLS
Sbjct: 2   NFRPKYEGMERTWESGDQQAAAIEDHGHCHGKVLPQMPSTKHSWSESEDTLESIVHSSLS 61

Query: 91  RKRTRSNPDC-KNEALMSWASMESHGSFKTDDSVEDLA-QDDGWKRRDRINQKM------ 150
           RKRTRSNP+C K+E LM+ AS+ESH +FK+ +S++DLA + DG ++   +  K       
Sbjct: 62  RKRTRSNPECWKDETLMTEASLESHRTFKSKNSIQDLALEHDGSEKEYNMKGKTDGSCSN 121

Query: 151 KDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQ-------------------------- 210
           +  +    N ++ +R    D   Q +K LQ  V                           
Sbjct: 122 RRTRTAAINHNQYERRR-RDRINQRMKDLQKLVPNGSKTDRASLLDDTIQYLKQLQAQVQ 181

Query: 211 FMYSIRSIVPQMVMPLEIQQQQQQQLQMSLLAAHMGLLNTDSKAPSSSSFPCAAAFPPPP 270
           FM SIRS VPQMVMPL I   QQQQLQMSLLAA MGLL   S A SSSSFPCAA F  P 
Sbjct: 182 FMDSIRSAVPQMVMPLGI---QQQQLQMSLLAARMGLLGAASMASSSSSFPCAATF--PQ 241

Query: 271 FLLSSINSTTKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQAG 300
             L SI STTKPKS LST AFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTP+Q  
Sbjct: 242 IQLPSIVSTTKPKSKLSTRAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPQQTS 301

BLAST of Cp4.1LG04g14670 vs. NCBI nr
Match: gi|659106027|ref|XP_008453232.1| (PREDICTED: transcription factor UNE10 [Cucumis melo])

HSP 1 Score: 244.6 bits (623), Expect = 2.2e-61
Identity = 172/318 (54.09%), Postives = 197/318 (61.95%), Query Frame = 1

Query: 29  NWNPMPKYG---RSCARGDEQEAAMDAHV----EVLFQTPLTNHSWSQSDDTLESIVHSS 88
           N  P PKY    R+   GD+QEAAM+ H     EVL Q P T HSWS+S+DTLESIVHSS
Sbjct: 47  NLIPTPKYEGMERTWESGDQQEAAMEGHGHCHGEVLPQMPSTKHSWSESEDTLESIVHSS 106

Query: 89  LSRKRTRSNPD-CKNEALMSWASMESHGSFKTDDSVEDLA-QDDGWKRRDRINQKM---- 148
           LSRKRTRSNP+  K+E LM+ AS+ESH +FK+ +S++DLA + DG +    +  KM    
Sbjct: 107 LSRKRTRSNPEYWKDETLMTEASLESHRTFKSKNSIQDLAVEHDGSEEEYNMKGKMGGSC 166

Query: 149 --KDLQKLVPNGSKMDRASLLDDTIQYLKQLQAQVQ------------------------ 208
             +  +    N ++ +R    D   Q +K LQ  V                         
Sbjct: 167 SNRQTRTAAINHNQYERRR-RDRINQKMKDLQKLVPNGSKTDRASLLDDTIQYLKQLQAQ 226

Query: 209 --FMYSIRSIVPQMVMPLEIQQQQQQQLQMSLLAAHMGLLNTDSKAPSSSSFPCAAAFPP 268
             FM SIRS   QMVMPL I QQQQQQLQMSLLAA  GLL+  S A SSSSFP AA F  
Sbjct: 227 VQFMGSIRS-ASQMVMPLGI-QQQQQQLQMSLLAARTGLLDAASVASSSSSFPWAATF-- 286

Query: 269 PPFLLSSINSTTKPKSNLSTSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQ 300
           P  LL SI STTKPKS  ST AFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTP+Q
Sbjct: 287 PQILLPSIVSTTKPKSKFSTGAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPQQ 346

BLAST of Cp4.1LG04g14670 vs. NCBI nr
Match: gi|823122360|ref|XP_012470587.1| (PREDICTED: transcription factor UNE10-like [Gossypium raimondii])

HSP 1 Score: 134.8 bits (338), Expect = 2.5e-28
Identity = 111/294 (37.76%), Postives = 151/294 (51.36%), Query Frame = 1

Query: 35  KYGRSCA----RGDEQEAAMDAHVEVLFQTPLTNHSWSQSDDTLESIVHSSLSRKRTRSN 94
           K+  SC     + D  E   D   ++      T  +W   D++L+S     L  K T  +
Sbjct: 150 KHNISCGIQKDKADRSECGCDTFYKI--DNDATMVTWGSHDESLQS-----LKTKTTDGD 209

Query: 95  PDCKNEALMSWASMESHGSFKTDDSVEDLAQDDGWKRRDRINQKMKDLQKLVPNGSKMDR 154
             C + +     +  SH + ++  +      +   KRRDRINQKMK LQKLVPN SK D+
Sbjct: 210 SGCHDGSESRDETGRSHPTRRSRAAATHNLSER--KRRDRINQKMKALQKLVPNASKTDK 269

Query: 155 ASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPLEIQQQQQQQLQMSL----------- 214
           AS+LD+ I+YLKQLQAQVQ M S+RSI P M+MPL +  Q  Q LQMSL           
Sbjct: 270 ASMLDEVIEYLKQLQAQVQVM-SMRSIPPMMMMPLGL--QHHQHLQMSLLGRIMAGMGVN 329

Query: 215 --LAAHMGLLNTDSKAP--SSSSFPCAAAFPP-------PPFLLSSINSTTKPKS--NLS 274
             L   MGL++ ++  P  +S S P     PP       PP + S   +T   +S  N S
Sbjct: 330 HALGMGMGLVDINAATPPNASQSLPPLLHLPPPFLATALPPMIPSRATATAAAQSNPNAS 389

Query: 275 TSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQA---GKFEGIKED 298
           +S  +P  DP C FL QSM+M+ YSKM  LY  ++NRT   A    +   IK+D
Sbjct: 390 SSDSIPLPDPSCAFLTQSMNMELYSKMAALYQAQMNRTTETASSPSRSNNIKQD 431

BLAST of Cp4.1LG04g14670 vs. NCBI nr
Match: gi|763740858|gb|KJB08357.1| (hypothetical protein B456_001G078500 [Gossypium raimondii])

HSP 1 Score: 134.8 bits (338), Expect = 2.5e-28
Identity = 111/294 (37.76%), Postives = 151/294 (51.36%), Query Frame = 1

Query: 35  KYGRSCA----RGDEQEAAMDAHVEVLFQTPLTNHSWSQSDDTLESIVHSSLSRKRTRSN 94
           K+  SC     + D  E   D   ++      T  +W   D++L+S     L  K T  +
Sbjct: 85  KHNISCGIQKDKADRSECGCDTFYKI--DNDATMVTWGSHDESLQS-----LKTKTTDGD 144

Query: 95  PDCKNEALMSWASMESHGSFKTDDSVEDLAQDDGWKRRDRINQKMKDLQKLVPNGSKMDR 154
             C + +     +  SH + ++  +      +   KRRDRINQKMK LQKLVPN SK D+
Sbjct: 145 SGCHDGSESRDETGRSHPTRRSRAAATHNLSER--KRRDRINQKMKALQKLVPNASKTDK 204

Query: 155 ASLLDDTIQYLKQLQAQVQFMYSIRSIVPQMVMPLEIQQQQQQQLQMSL----------- 214
           AS+LD+ I+YLKQLQAQVQ M S+RSI P M+MPL +  Q  Q LQMSL           
Sbjct: 205 ASMLDEVIEYLKQLQAQVQVM-SMRSIPPMMMMPLGL--QHHQHLQMSLLGRIMAGMGVN 264

Query: 215 --LAAHMGLLNTDSKAP--SSSSFPCAAAFPP-------PPFLLSSINSTTKPKS--NLS 274
             L   MGL++ ++  P  +S S P     PP       PP + S   +T   +S  N S
Sbjct: 265 HALGMGMGLVDINAATPPNASQSLPPLLHLPPPFLATALPPMIPSRATATAAAQSNPNAS 324

Query: 275 TSAFVPPTDPFCTFLAQSMDMDFYSKMVTLYCQEVNRTPRQA---GKFEGIKED 298
           +S  +P  DP C FL QSM+M+ YSKM  LY  ++NRT   A    +   IK+D
Sbjct: 325 SSDSIPLPDPSCAFLTQSMNMELYSKMAALYQAQMNRTTETASSPSRSNNIKQD 366

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PIF7_ARATH2.7e-1939.46Transcription factor PIF7 OS=Arabidopsis thaliana GN=BHLH72 PE=1 SV=2[more]
UNE10_ARATH6.1e-1935.14Transcription factor UNE10 OS=Arabidopsis thaliana GN=UNE10 PE=2 SV=1[more]
APG_ORYSJ2.3e-1037.40Transcription factor APG OS=Oryza sativa subsp. japonica GN=APG PE=1 SV=1[more]
PIF3_ARATH9.8e-0963.04Transcription factor PIF3 OS=Arabidopsis thaliana GN=PIF3 PE=1 SV=1[more]
ALC_ARATH3.7e-0844.71Transcription factor ALC OS=Arabidopsis thaliana GN=ALC PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LSD5_CUCSA6.3e-6353.80Uncharacterized protein OS=Cucumis sativus GN=Csa_1G011540 PE=4 SV=1[more]
A0A0D2LW15_GOSRA1.7e-2837.76Uncharacterized protein OS=Gossypium raimondii GN=B456_001G078500 PE=4 SV=1[more]
A0A0D2PVV3_GOSRA1.7e-2837.76Uncharacterized protein OS=Gossypium raimondii GN=B456_001G078500 PE=4 SV=1[more]
F6GT27_VITVI5.0e-2851.69Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g06930 PE=4 SV=... [more]
A0A061G4L1_THECC2.8e-2648.40DNA binding protein, putative isoform 4 OS=Theobroma cacao GN=TCM_014141 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT5G61270.11.5e-2039.46 phytochrome-interacting factor7[more]
AT4G00050.13.4e-2035.14 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT1G09530.15.5e-1063.04 phytochrome interacting factor 3[more]
AT5G67110.12.1e-0944.71 basic helix-loop-helix (bHLH) DNA-binding superfamily protein[more]
AT3G59060.21.0e-0857.45 phytochrome interacting factor 3-like 6[more]
Match NameE-valueIdentityDescription
gi|778656018|ref|XP_004138310.2|4.8e-6454.29PREDICTED: transcription factor UNE10 [Cucumis sativus][more]
gi|700208606|gb|KGN63702.1|9.0e-6353.80hypothetical protein Csa_1G011540 [Cucumis sativus][more]
gi|659106027|ref|XP_008453232.1|2.2e-6154.09PREDICTED: transcription factor UNE10 [Cucumis melo][more]
gi|823122360|ref|XP_012470587.1|2.5e-2837.76PREDICTED: transcription factor UNE10-like [Gossypium raimondii][more]
gi|763740858|gb|KJB08357.1|2.5e-2837.76hypothetical protein B456_001G078500 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
Vocabulary: INTERPRO
TermDefinition
IPR011598bHLH_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0009416 response to light stimulus
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0005488 binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g14670.1Cp4.1LG04g14670.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainGENE3DG3DSA:4.10.280.10coord: 126..173
score: 2.6
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPFAMPF00010HLHcoord: 126..165
score: 1.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainSMARTSM00353finuluscoord: 126..170
score: 2.
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROFILEPS50888BHLHcoord: 115..164
score: 12
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainunknownSSF47459HLH, helix-loop-helix DNA-binding domaincoord: 126..190
score: 1.31
NoneNo IPR availablePANTHERPTHR12565STEROL REGULATORY ELEMENT-BINDING PROTEINcoord: 126..277
score: 2.4
NoneNo IPR availablePANTHERPTHR12565:SF165TRANSCRIPTION FACTOR PIF7coord: 126..277
score: 2.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG04g14670Cp4.1LG18g07650Cucurbita pepo (Zucchini)cpecpeB361
Cp4.1LG04g14670Cp4.1LG02g10510Cucurbita pepo (Zucchini)cpecpeB452