Cp4.1LG10g10100 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g10100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyb family transcription factor family protein
LocationCp4.1LG10 : 4138492 .. 4142462 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNATCCAAAAATTTGAAAAGGAAGTGTGTGAAGGGAGAAATAAAAAGAGATTGAGATTTGGGGAAAGTCCGAAATGGAGCGGATTTTGCGTTCCTACAAAGCAGGCAAAGGCTGACGATGAGACTGCTCACAGTCGCAGACGCCTTCTTTCCTTTTCAACTTTGAAGGCCAACCCAAGCCAATCCAAGCCATGGGAGAGCACACTGTGACGCACAAACCGAGATACAGTAAGGAATCTGATGCCCTTCAGTTTAACTAACACACCCCAACCACGAGGGAATCCAATTTTTATCTTCTTGCTTCGATTACCCACAACATTGATTCCTTTGAAACTCGTGAGCTACAGAAAGACAGAAATGGCGCCATCCCGATATTCAGGGATGCGGTCTTAGATCTGGGTATCTACTTATCCTCTTTCTCTCTCTAATCCTATGTTTTGATCTTGCATTTACATGGCTGTCTTCTAGTTTATGTTGTTGCTGAGTTTTGAACTGTAAGGCCGACTTCGGTTTCGGATATTGACTTATTGTCCCCCTTTAAGCCCTGGTTGTCGTTTCTGGTTCGGTTCTTATTTGTTGTTCTTCCTTTTCTTCGCTTGTTGCAATGCCTGCTAGTTAATTCTTTCCTAGTTTTTTAGCTCGTTTTAAGTCCTGGGTTTAGATATGTTCTGATGCTGAAGAATCCATTTGCCTGGCTTTAATTCTTATCAAGATTACTTTGGTTTGGTTCATTGGATGTTTGTGGTAAACAATCCTTGTCTTATAGGCATCTGTTTCTGTGTCATGTTTAGAATATCATTAGAAATTGGTCTTTTCAGACTCTAATCTGTTTTTTGTTTTTTGGTGGGGTGAAGAGAAGTGTTACTAGTGAAGACACCTCTCTTGATACTACACTCTTTGCTCGGCCATTGCTAAGTTATTGTCGATTTTCTTATAAACACTCCAGATACCCCGTTTTTCATGTTGGGGAAGAGTGCCCTTTCTAAAGAGCAAATGATTCAAACATCTGTAGGACTCATTAATCCCAATAAACTTCCCAAGTTCTGAGGTTTTCTGTCATTTTTTTGCTTATTGTTTACAGGTAAATGACATGTTTCAGCAACAGAGTACTCATAATTCAAGCTTTCTTCAAAACAACTCATTAGTTCGTGATCAGAACATACCTTTTGATGCCAGCTCGATGGAACCCACAAATGGAAGCAATGATCCCAGCAACACCCCGAATTTGGCCTCAAAACAGAGATTGCGATGGACACATGATCTTCACGAACGATTCGTTAATGCAGTGGCACAACTTGGTGGTCCAGATCGTGAGTAAACTCACCTCTTGTGGCATTGTAGTTTGGCATCTTCTTCATCTATATGTTTATTGTGGCTAAGGATGAAAGAACAAACAAACATGTTACAAATAGACACAATAAGCTGTATGCATCACGCCATTGATCTTCTTCCAAAAAAAGAAGTCATCATGCAAATTTACATTCAGGATGTGGTTTCCTACATAAATAGTCTTCCTTCTTATAGAGACTTTCTATTGATCATACACAGTTTGTCAATCAAGGAAGGAGAAGCATATTTTAGTATAGAGAATAGAAGAGTAAAAGGGTAAAACATGATAAACTGTAGTAAAGTTGCAGGCAGGGTATATTATCTTGAAGTTATGCGCCAACAAACGACGTTTAGCCATGTTTATGTGTGCATTTTTCCATCTCTTTGTTAGGTGCTACACCCAAAGGCGTCCTTCGAGTGATGGGTGTTCAAGGTCTAACGATATACCACGTTAAAAGCCACCTGCAGGTACTGTGATCTTTCACCTATCCTTACACGAGAGGTTTATGTTAGCATGCATAGAATTTATTTTCTAATTTGTTTTCTTGGATTCATCCAGAAATATCGACTTGCAAAATACCTTCCCGACTCATCGTCTGATGGTAGGTGAATCTAGTTCCCAATTGTTCTTTTTCCATCTTTATGTGCTCGAACTTTTTGCAGCATGCTGTAAGACCTCTAGTGACATTCACAAAGTCAATTCACCTGTTTGTTGTCTTGTTGATGCAAGGGAAAAAGGCTGACAAGAAGGATTCTATTGACGTTCTATCGAACATTGAGGGCTCGTCGTAAGTGGATCTAATCGCCTTCGCTTTTTATCCAGCTCTGTGTTTTATTGAAAGTTGATGTAAATACAATTTAGAAGCAGGCACCCCTCTTGCTCTTAGTTGGTAGCTGACTGTTTGATGGGCTTAACTCTAGAATTTGTCACCACCCATTTGCAGGGGAATGCAAATTACCGAAGCACTTAAGCTGCAGATGGAGGTCCAGAAGCGACTGCATGAACAATTAGAGGCATGTTCTAGTTGTCCTTTCTATTTGGGAATAAGAATGACTGCTTTTTATAAATAAAGGTTAACAAAAAGCCTGCTTGAAACCTGCTAAAGGTGTACTATAACCTGTGAAATTTTGTCCCTGCAACTTCAAGTAGACATTTTAGATATTCTAGAATTTGGAGCAGGCCTTCACCTAAGAACAGTCGGCTTAACTCATAATGCCACATCGGTTGGTGAGGAGAATGAAACATTCCTTATAAGGGTGTGGAAACTTCTCCCTAGTAGACGCATTTTAGAACCGTGAGGCTAACGACGATACATAACGGGCCAAAGCGGACAATATCTGCTAGTGGTGAGCTTGGACTGTTATAAATTGTATGAGTCAGGCACTGAGCGGTGTGCCAGCGAGAACGCTGGGCTCCAAGGGGGATAGATTGTGAGATCCCACATTGGTTGGAGAGGAGAACGAAACATTTCTTATATAGGTGTGGAAACCTCTCCCTAGTAGACGCATTTTAAAACCGTGAGGCTGACGGTGATACGTAATGGGCCAAAGTGGACAATATCTACTAACAGTGGACTTGTGGGCTTGGGCTATCTCTTCAACAATCAATCATAAGGAATCATGATTGCTTGTGCATTAGTTCAACTTTTGTTTCAAGGCATATGAGATGCCTCTGACTGATGCAGGACAGGCTTTGAAACAGCCTAACTTTTCCCAAGCATTGAATCGTTTCATCTTATACATATCTAATACAGGTACAGAGACAGCTACAGTTACGGATTGAAGCCCAAGGCAAGTACTTAAAGAAGATAATTGAAGAGCAACAAAAACTTAGTGGGGTTCTTTCAGGAGCAGCTCCAGTTGCCTCTTCCTTCTCAGCTCCTGCTTCTGGTGAGAACTGCCCAGAAACCGACAAGAACGACCCACCAACACCTGCGCCCACATCGGAGTTTCCTCGACAAGAGAAAGCATCAAAGGAACGTGCCCAAGCCAAGAGTGTCTCTATTGATGATTCTTTCTCATCTCGTCACGAACCATTGACGCCCGATTCTGGTTGTCATAGCTCCCCAAATGAGAGCCCAAAGCCAGTGAAGAAGCAAAGACAGTACCAGGATGGTGCATTTGCCAAATCAGAGATGATACTCGCACATCAGATACTGGAGTCAAGTTTAAACTCCACTCACAAGGGATCACGCTCTGTTTTTCCAGCCAGAGAAACGTTGGATCCTTCATCTGGACTATCTATAGGGGACGATGAGCAGTTTGACTAAACCACAGGGTCTCCTAGTCCTAGTGTTCCTTGTTTTCACAAGTTTATCTTCCACGGTTAGTTCCTGTTGTAGTTTTGCTAGTCTCTACATATTGTGGTGTATCATATCATAGCACATCTCGGCACATAATTTTTTTTCCTGAGTAGGACCAATATTTGATGAACCTGATGATTATGTCCACAATCCAAGCAGGGCCCATATTTGATCTTATTAGAGAATTATCTGGCAGTAGAAACCAATCTTTGTATACACCTGCATCAAATAGTTAGAGATACATGTTCTGGTCTTTTCTGATCCATTGAATCAAGTTGGGGTTGATTTCAACATTTGTTTTGTTA

mRNA sequence

ATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNATCCAAAAATTTGAAAAGGAAGTGTGTGAAGGGAGAAATAAAAAGAGATTGAGATTTGGGGAAAGTCCGAAATGGAGCGGATTTTGCGTTCCTACAAAGCAGGCAAAGGCTGACGATGAGACTGCTCACAGTCGCAGACGCCTTCTTTCCTTTTCAACTTTGAAGGCCAACCCAAGCCAATCCAAGCCATGGGAGAGCACACTGTGACGCACAAACCGAGATACAAAAGACAGAAATGGCGCCATCCCGATATTCAGGGATGCGGTCTTAGATCTGGGTAAATGACATGTTTCAGCAACAGAGTACTCATAATTCAAGCTTTCTTCAAAACAACTCATTAGTTCGTGATCAGAACATACCTTTTGATGCCAGCTCGATGGAACCCACAAATGGAAGCAATGATCCCAGCAACACCCCGAATTTGGCCTCAAAACAGAGATTGCGATGGACACATGATCTTCACGAACGATTCGTTAATGCAGTGGCACAACTTGGTGGTCCAGATCGTGCTACACCCAAAGGCGTCCTTCGAGTGATGGGTGTTCAAGGTCTAACGATATACCACGTTAAAAGCCACCTGCAGAAATATCGACTTGCAAAATACCTTCCCGACTCATCGTCTGATGGGAAAAAGGCTGACAAGAAGGATTCTATTGACGTTCTATCGAACATTGAGGGCTCGTCGGGAATGCAAATTACCGAAGCACTTAAGCTGCAGATGGAGGTACAGAGACAGCTACAGTTACGGATTGAAGCCCAAGGCAAGTACTTAAAGAAGATAATTGAAGAGCAACAAAAACTTAGTGGGGTTCTTTCAGGAGCAGCTCCAGTTGCCTCTTCCTTCTCAGCTCCTGCTTCTGGTGAGAACTGCCCAGAAACCGACAAGAACGACCCACCAACACCTGCGCCCACATCGGAGTTTCCTCGACAAGAGAAAGCATCAAAGGAACGTGCCCAAGCCAAGAGTGTCTCTATTGATGATTCTTTCTCATCTCGTCACGAACCATTGACGCCCGATTCTGGTTGTCATAGCTCCCCAAATGAGAGCCCAAAGCCAGTGAAGAAGCAAAGACAGTACCAGGATGGTGCATTTGCCAAATCAGAGATGATACTCGCACATCAGATACTGGAGTCAAGTTTAAACTCCACTCACAAGGGATCACGCTCTGTTTTTCCAGCCAGAGAAACGTTGGATCCTTCATCTGGACTATCTATAGGGGACGATGAGCAGTTTGACTAAACCACAGGGTCTCCTAGTCCTAGTGTTCCTTGTTTTCACAAGTTTATCTTCCACGGTTAGTTCCTGTTGTAGTTTTGCTAGTCTCTACATATTGTGGTGTATCATATCATAGCACATCTCGGCACATAATTTTTTTTCCTGAGTAGGACCAATATTTGATGAACCTGATGATTATGTCCACAATCCAAGCAGGGCCCATATTTGATCTTATTAGAGAATTATCTGGCAGTAGAAACCAATCTTTGTATACACCTGCATCAAATAGTTAGAGATACATGTTCTGGTCTTTTCTGATCCATTGAATCAAGTTGGGGTTGATTTCAACATTTGTTTTGTTA

Coding sequence (CDS)

ATGTTTCAGCAACAGAGTACTCATAATTCAAGCTTTCTTCAAAACAACTCATTAGTTCGTGATCAGAACATACCTTTTGATGCCAGCTCGATGGAACCCACAAATGGAAGCAATGATCCCAGCAACACCCCGAATTTGGCCTCAAAACAGAGATTGCGATGGACACATGATCTTCACGAACGATTCGTTAATGCAGTGGCACAACTTGGTGGTCCAGATCGTGCTACACCCAAAGGCGTCCTTCGAGTGATGGGTGTTCAAGGTCTAACGATATACCACGTTAAAAGCCACCTGCAGAAATATCGACTTGCAAAATACCTTCCCGACTCATCGTCTGATGGGAAAAAGGCTGACAAGAAGGATTCTATTGACGTTCTATCGAACATTGAGGGCTCGTCGGGAATGCAAATTACCGAAGCACTTAAGCTGCAGATGGAGGTACAGAGACAGCTACAGTTACGGATTGAAGCCCAAGGCAAGTACTTAAAGAAGATAATTGAAGAGCAACAAAAACTTAGTGGGGTTCTTTCAGGAGCAGCTCCAGTTGCCTCTTCCTTCTCAGCTCCTGCTTCTGGTGAGAACTGCCCAGAAACCGACAAGAACGACCCACCAACACCTGCGCCCACATCGGAGTTTCCTCGACAAGAGAAAGCATCAAAGGAACGTGCCCAAGCCAAGAGTGTCTCTATTGATGATTCTTTCTCATCTCGTCACGAACCATTGACGCCCGATTCTGGTTGTCATAGCTCCCCAAATGAGAGCCCAAAGCCAGTGAAGAAGCAAAGACAGTACCAGGATGGTGCATTTGCCAAATCAGAGATGATACTCGCACATCAGATACTGGAGTCAAGTTTAAACTCCACTCACAAGGGATCACGCTCTGTTTTTCCAGCCAGAGAAACGTTGGATCCTTCATCTGGACTATCTATAGGGGACGATGAGCAGTTTGACTAA

Protein sequence

MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQMEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETDKNDPPTPAPTSEFPRQEKASKERAQAKSVSIDDSFSSRHEPLTPDSGCHSSPNESPKPVKKQRQYQDGAFAKSEMILAHQILESSLNSTHKGSRSVFPARETLDPSSGLSIGDDEQFD
BLAST of Cp4.1LG10g10100 vs. Swiss-Prot
Match: PHL7_ARATH (Myb family transcription factor PHL7 OS=Arabidopsis thaliana GN=PHL7 PE=2 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 2.3e-77
Identity = 180/301 (59.80%), Postives = 212/301 (70.43%), Query Frame = 1

Query: 31  MEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT 90
           ME  NG  + S+    ASKQRLRWTH+LHERFV+AVAQLGGPDRATPKGVLRVMGVQGLT
Sbjct: 1   MEADNGGPNSSH----ASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLT 60

Query: 91  IYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQM----- 150
           IYHVKSHLQKYRLAKYLPDSSS+GKK DKK+S D+LS ++GSSGMQITEALKLQM     
Sbjct: 61  IYHVKSHLQKYRLAKYLPDSSSEGKKTDKKESGDMLSGLDGSSGMQITEALKLQMEVQKR 120

Query: 151 -----EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETDK 210
                EVQRQLQLRIEAQGKYLKKIIEEQQ+LSGVL          SAP +G+       
Sbjct: 121 LHEQLEVQRQLQLRIEAQGKYLKKIIEEQQRLSGVLGEP-------SAPVTGD------- 180

Query: 211 NDPPTPAPTSEFPRQEKASKERAQAKSVSIDDSFSSRHEPLTPDSGCH-SSPNES---PK 270
           +DP TPAPTSE P Q+K+ K+    KS+S+D+S SS  EPLTPDSGC+  SP+ES    +
Sbjct: 181 SDPATPAPTSESPLQDKSGKDCGPDKSLSVDESLSSYREPLTPDSGCNIGSPDESTGEER 240

Query: 271 PVKKQRQYQDGAFAKSEMILAHQILESSLNSTHKGSRSVFPARETLDPSSGLSIGDDEQF 318
             KK R  +  A    ++++ H ILES LN+++  S  V       D  S   +G +EQ 
Sbjct: 241 LSKKPRLVRGAAGYTPDIVVGHPILESGLNTSYHQSDHVL----AFDQPSTSLLGAEEQL 279

BLAST of Cp4.1LG10g10100 vs. Swiss-Prot
Match: PHR1_ORYSI (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. indica GN=PHR1 PE=3 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 2.3e-32
Identity = 87/173 (50.29%), Postives = 116/173 (67.05%), Query Frame = 1

Query: 37  SNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKS 96
           S  P+N+   ASKQR+RWT +LHE FV+AV +LGG ++ATPKGVL++M V GLTIYHVKS
Sbjct: 204 SPPPNNSNASASKQRMRWTPELHESFVHAVNKLGGSEKATPKGVLKLMKVDGLTIYHVKS 263

Query: 97  HLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQMEV--------- 156
           HLQKYR A+Y PD S +GK  + K + ++  +++ S  M +TEAL+LQMEV         
Sbjct: 264 HLQKYRTARYKPDLS-EGKTQEGKTTDELSLDLKAS--MDLTEALRLQMEVQKRLHEQLE 323

Query: 157 -QRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETD 200
            QR+LQLRIE QGKYL+K+ E+Q K S   S   P +   + P+   N  + D
Sbjct: 324 IQRKLQLRIEEQGKYLQKMFEKQCK-SSTQSVQDPSSGDTATPSEPSNSVDKD 372

BLAST of Cp4.1LG10g10100 vs. Swiss-Prot
Match: PHR1_ORYSJ (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. japonica GN=PHR1 PE=2 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 2.3e-32
Identity = 87/173 (50.29%), Postives = 116/173 (67.05%), Query Frame = 1

Query: 37  SNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKS 96
           S  P+N+   ASKQR+RWT +LHE FV+AV +LGG ++ATPKGVL++M V GLTIYHVKS
Sbjct: 204 SPPPNNSNASASKQRMRWTPELHESFVHAVNKLGGSEKATPKGVLKLMKVDGLTIYHVKS 263

Query: 97  HLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQMEV--------- 156
           HLQKYR A+Y PD S +GK  + K + ++  +++ S  M +TEAL+LQMEV         
Sbjct: 264 HLQKYRTARYKPDLS-EGKTQEGKTTDELSLDLKAS--MDLTEALRLQMEVQKRLHEQLE 323

Query: 157 -QRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETD 200
            QR+LQLRIE QGKYL+K+ E+Q K S   S   P +   + P+   N  + D
Sbjct: 324 IQRKLQLRIEEQGKYLQKMFEKQCK-SSTQSVQDPSSGDTATPSEPSNSVDKD 372

BLAST of Cp4.1LG10g10100 vs. Swiss-Prot
Match: PHR1_ARATH (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Arabidopsis thaliana GN=PHR1 PE=1 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 8.7e-32
Identity = 89/206 (43.20%), Postives = 125/206 (60.68%), Query Frame = 1

Query: 30  SMEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGL 89
           S+E    S   SN+ N   K R+RWT +LHE FV AV  LGG +RATPKGVL++M V+GL
Sbjct: 206 SVELRPVSTTSSNSNNGTGKARMRWTPELHEAFVEAVNSLGGSERATPKGVLKIMKVEGL 265

Query: 90  TIYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQMEVQR 149
           TIYHVKSHLQKYR A+Y P+ S  G    K   ++ +++++   G+ ITEAL+LQMEVQ+
Sbjct: 266 TIYHVKSHLQKYRTARYRPEPSETGSPERKLTPLEHITSLDLKGGIGITEALRLQMEVQK 325

Query: 150 Q----------LQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETD 209
           Q          LQLRIE QGKYL+ + E+Q   SG+  G A         ++ ++  +++
Sbjct: 326 QLHEQLEIQRNLQLRIEEQGKYLQMMFEKQN--SGLTKGTA---------STSDSAAKSE 385

Query: 210 KNDPPTPAPTSEFPRQEKASKERAQA 226
           + D  T A + E P +E    E  ++
Sbjct: 386 QEDKKT-ADSKEVPEEETRKCEELES 399

BLAST of Cp4.1LG10g10100 vs. Swiss-Prot
Match: PHR3_ORYSI (Protein PHOSPHATE STARVATION RESPONSE 3 OS=Oryza sativa subsp. indica GN=PHR3 PE=3 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 1.3e-30
Identity = 104/250 (41.60%), Postives = 137/250 (54.80%), Query Frame = 1

Query: 21  DQNIPFDASSMEPTNGSNDPSNTPNLA-SKQRLRWTHDLHERFVNAVAQLGGPDRATPKG 80
           DQ    DA S      S+  S++   + +K RLRWT +LHERFV+AV +L GP++ATPKG
Sbjct: 236 DQEDLQDARSPAKVQLSSSRSSSGTASCNKPRLRWTPELHERFVDAVNKLEGPEKATPKG 295

Query: 81  VLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEG-SSGMQIT 140
           VL++M V+GLTIYH+KSHLQKYRLAKYLP++  D K+ +KK       N        Q+ 
Sbjct: 296 VLKLMKVEGLTIYHIKSHLQKYRLAKYLPETKEDKKQEEKKTKSVANGNDHAKKKSAQMA 355

Query: 141 EALKLQM----------EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSA 200
           EAL++QM          EVQRQLQLRIE   +YL+KI+EEQQK       A    SS ++
Sbjct: 356 EALRMQMEVQKQLHEQLEVQRQLQLRIEEHARYLQKILEEQQK-------ARESISSMTS 415

Query: 201 PASGENCPETDKNDPPTPAPTSEFPRQEKASKERAQAKSVSIDDSFSSRHEPLTPDSGCH 258
              GE+         P  AP  +   ++KA    A      I D+          D+ CH
Sbjct: 416 TTEGES---------PEFAPMEK--TEDKAETSSAPLSKCRITDT----------DAECH 457

BLAST of Cp4.1LG10g10100 vs. TrEMBL
Match: B3FNK5_CUCSA (Putative myb family transcription factor OS=Cucumis sativus GN=Csa_6G510280 PE=2 SV=1)

HSP 1 Score: 417.9 bits (1073), Expect = 1.1e-113
Identity = 227/270 (84.07%), Postives = 238/270 (88.15%), Query Frame = 1

Query: 31  MEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT 90
           M+PTNG+N  S +PNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT
Sbjct: 1   MDPTNGNNATSKSPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT 60

Query: 91  IYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQM----- 150
           IYHVKSHLQKYRLAKYLPDSSSDGKK DKKDS D+LSNI+GSSGMQITEALKLQM     
Sbjct: 61  IYHVKSHLQKYRLAKYLPDSSSDGKKTDKKDSSDILSNIDGSSGMQITEALKLQMEVQKR 120

Query: 151 -----EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETDK 210
                EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG AP AS+F+APASG+NCPE DK
Sbjct: 121 LHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG-APAASAFTAPASGDNCPEVDK 180

Query: 211 NDPPTPAPTSEFPRQEKASKERAQAKSVSIDDSFSSRHEPLTPDSGCHSSPNESPKPVKK 270
           NDP TPA TSEFPRQEK SKERAQ KSVSIDDSFSS HEPLTPDSGCHSSP+ESP+PVKK
Sbjct: 181 NDPSTPASTSEFPRQEKVSKERAQGKSVSIDDSFSSHHEPLTPDSGCHSSPSESPRPVKK 240

Query: 271 QRQYQDGAFAKSEMILAHQILESSLNSTHK 291
           Q Q        S+MILAHQILESSLNSTHK
Sbjct: 241 QIQ--------SKMILAHQILESSLNSTHK 261

BLAST of Cp4.1LG10g10100 vs. TrEMBL
Match: A0A061GGR1_THECC (Myb-like HTH transcriptional regulator family protein OS=Theobroma cacao GN=TCM_030480 PE=4 SV=1)

HSP 1 Score: 393.7 bits (1010), Expect = 2.2e-106
Identity = 225/332 (67.77%), Postives = 263/332 (79.22%), Query Frame = 1

Query: 1   MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHE 60
           M+Q ++   SS ++NNS+V  Q++   AS M+P +G N  +N PNLASKQRLRWTH+LHE
Sbjct: 60  MYQPKTVPGSSLVRNNSIVHGQHLDCGASQMDPISGGNSLTNNPNLASKQRLRWTHELHE 119

Query: 61  RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120
           RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK
Sbjct: 120 RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 179

Query: 121 DSIDVLSNIEGSSGMQITEALKLQM----------EVQRQLQLRIEAQGKYLKKIIEEQQ 180
           ++ D+LSN++GSSGMQITEALKLQM          EVQRQLQLRIEAQGKYLKKIIEEQQ
Sbjct: 180 ETGDMLSNLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ 239

Query: 181 KLSGVLSGAAPVASSFSAPASGENCPETD-KNDPPTPAPTSEFPRQEKASKERAQAKSVS 240
           +LSGVL+ A    S  S PA G+N  E+D K DP TPAPTSE P Q+KA+KERA AKS S
Sbjct: 240 RLSGVLAEAP--GSGASVPALGDNGLESDKKTDPATPAPTSESPLQDKAAKERAPAKSHS 299

Query: 241 IDDSFSSRHEPLTPDSGCH-SSPNESPKP---VKKQRQYQDGAFAKSEMILAHQILESSL 300
           ID+SFSS HEPLTPDSGCH  SP  SPK    +KKQR     AFAK E++L HQILESS+
Sbjct: 300 IDESFSSHHEPLTPDSGCHVGSPAGSPKGERLMKKQRVSMAAAFAKPEVVLPHQILESSI 359

Query: 301 NSTHKGSRSVFPARETLDPSSGLSIGDDEQFD 318
           +S+ + S SVF  RE  DPSSG+S+G+++Q +
Sbjct: 360 SSSFQQSHSVFMTREQFDPSSGISMGNEDQLE 389

BLAST of Cp4.1LG10g10100 vs. TrEMBL
Match: F6HD66_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0475g00040 PE=4 SV=1)

HSP 1 Score: 391.7 bits (1005), Expect = 8.3e-106
Identity = 220/331 (66.47%), Postives = 260/331 (78.55%), Query Frame = 1

Query: 1   MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHE 60
           M+Q ++  + S + NNSLV  Q+    A++M+P NG N  +N P+LASKQRLRWTH+LHE
Sbjct: 1   MYQPKAVPSPSLVHNNSLVHGQHSDCGANTMDPINGGNSLNNNPSLASKQRLRWTHELHE 60

Query: 61  RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120
           RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK
Sbjct: 61  RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120

Query: 121 DSIDVLSNIEGSSGMQITEALKLQM----------EVQRQLQLRIEAQGKYLKKIIEEQQ 180
           +S D+LS+++GSSGMQITEALKLQM          EVQRQLQLRIEAQGKYLKKIIEEQQ
Sbjct: 121 ESGDMLSSLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ 180

Query: 181 KLSGVLSGAAPVASSFSAPASGENCPETDKNDPPTPAPTSEFPRQEKASKERAQAKSVSI 240
           +LSGV++      S  S P SG+NC E+DK DP TPAPTSE P  +KA+KE A AKS+SI
Sbjct: 181 RLSGVITEVP--GSGVSVPVSGDNCLESDKTDPATPAPTSEGPLLDKAAKETAPAKSLSI 240

Query: 241 DDSFSSRHEPLTPDSGCH-SSPNESPK---PVKKQRQYQDGAFAKSEMILAHQILESSLN 300
           D+SFSS HEPLTPDSGCH +SP+ESPK    VKKQR     A+AK EM+L HQILESSL+
Sbjct: 241 DESFSSHHEPLTPDSGCHVNSPDESPKGERSVKKQRVSIGAAYAKQEMVLTHQILESSLS 300

Query: 301 STHKGSRSVFPARETLDPSSGLSIGDDEQFD 318
           S+     SVF  R+  DP +G+SI +++Q +
Sbjct: 301 SSFHQPHSVFLNRDQFDPQAGISISNEDQLE 329

BLAST of Cp4.1LG10g10100 vs. TrEMBL
Match: U5G8Q8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s00290g PE=4 SV=1)

HSP 1 Score: 391.7 bits (1005), Expect = 8.3e-106
Identity = 223/331 (67.37%), Postives = 260/331 (78.55%), Query Frame = 1

Query: 1   MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHE 60
           M+Q +S  +SS +  NSLV DQ +  D  +M+P NG N+ +N PNLASKQRLRWTH+LHE
Sbjct: 1   MYQLESVPSSSSVHKNSLVNDQYLDCDDMTMDPINGGNNLNNNPNLASKQRLRWTHELHE 60

Query: 61  RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120
           RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK
Sbjct: 61  RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120

Query: 121 DSIDVLSNIEGSSGMQITEALKLQM----------EVQRQLQLRIEAQGKYLKKIIEEQQ 180
           ++ D++SN++GSSGMQITEALKLQM          EVQRQLQLRIEAQGKYLKKIIEEQQ
Sbjct: 121 ETGDMISNLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ 180

Query: 181 KLSGVLSGAAPVASSFSAPASGENCPETDKNDPPTPAPTSEFPRQEKASKERAQAKSVSI 240
           +LSGVL       S  +AP SG+NCPE+DK DP TPAPTSE P Q+KA+KERA AKS+SI
Sbjct: 181 RLSGVLEDVP--GSGVTAPVSGDNCPESDKTDPATPAPTSESPLQDKAAKERAPAKSLSI 240

Query: 241 DDSFSSRHEPLTPDSGCHS-SPNESP---KPVKKQRQYQDGAFAKSEMILAHQILESSLN 300
           D+SFSS+ EPLTPDS C++ SP ESP   + +KKQR      + K EM+L HQILESSLN
Sbjct: 241 DESFSSQPEPLTPDSRCNAGSPAESPRGERSMKKQRVSIGVTYGKQEMVLTHQILESSLN 300

Query: 301 STHKGSRSVFPARETLDPSSGLSIGDDEQFD 318
           S +    S F  RE  DPSSGLS+G ++Q +
Sbjct: 301 S-YPRPHSAFLGREQFDPSSGLSMGIEDQME 328

BLAST of Cp4.1LG10g10100 vs. TrEMBL
Match: B9SGW9_RICCO (Transcription factor, putative OS=Ricinus communis GN=RCOM_0820580 PE=4 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 1.8e-105
Identity = 227/332 (68.37%), Postives = 264/332 (79.52%), Query Frame = 1

Query: 1   MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHE 60
           M+Q +S  +SS +  +SLV  Q++   AS M+  NG N  +N P+LASKQRLRWTH+LHE
Sbjct: 1   MYQLKSVPSSSLVHKSSLVHGQHLDCGASRMDAINGENSLNNNPSLASKQRLRWTHELHE 60

Query: 61  RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120
           RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK
Sbjct: 61  RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120

Query: 121 DSIDVLSNIEGSSGMQITEALKLQM----------EVQRQLQLRIEAQGKYLKKIIEEQQ 180
           ++ D+LSN++GSSGMQITEALKLQM          EVQRQLQLRIEAQGKYLKKIIEEQQ
Sbjct: 121 ETGDMLSNLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ 180

Query: 181 KLSGVLSGAAPVASSFSAPASGENCPETD-KNDPPTPAPTSEFPRQEKASKERAQAKSVS 240
           +LSGVL G  P A + +AP SG+NCPE+D K DP TPAPTSE P Q+KA+KERA AKS+S
Sbjct: 181 RLSGVL-GEVPGAVA-AAPVSGDNCPESDNKTDPATPAPTSESPIQDKAAKERAPAKSLS 240

Query: 241 IDDSFSSRHEPLTPDSGCH-SSPNESPK---PVKKQRQYQDGAFAKSEMILAHQILESSL 300
           ID+SFSSRHEPLTPDS C+  SP ESPK    +KKQR     ++ KSEM+L HQILESSL
Sbjct: 241 IDESFSSRHEPLTPDSRCNVGSPAESPKGERSMKKQRVCMGTSYGKSEMVLTHQILESSL 300

Query: 301 NSTHKGSRSVFPARETLDPSSGLSIGDDEQFD 318
           NS +    S+F +RE  DPSSGLS G+D+  +
Sbjct: 301 NS-YPQPHSLFLSREQFDPSSGLSTGNDDHIE 329

BLAST of Cp4.1LG10g10100 vs. TAIR10
Match: AT2G01060.1 (AT2G01060.1 myb-like HTH transcriptional regulator family protein)

HSP 1 Score: 290.4 bits (742), Expect = 1.3e-78
Identity = 180/301 (59.80%), Postives = 212/301 (70.43%), Query Frame = 1

Query: 31  MEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT 90
           ME  NG  + S+    ASKQRLRWTH+LHERFV+AVAQLGGPDRATPKGVLRVMGVQGLT
Sbjct: 1   MEADNGGPNSSH----ASKQRLRWTHELHERFVDAVAQLGGPDRATPKGVLRVMGVQGLT 60

Query: 91  IYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQM----- 150
           IYHVKSHLQKYRLAKYLPDSSS+GKK DKK+S D+LS ++GSSGMQITEALKLQM     
Sbjct: 61  IYHVKSHLQKYRLAKYLPDSSSEGKKTDKKESGDMLSGLDGSSGMQITEALKLQMEVQKR 120

Query: 151 -----EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETDK 210
                EVQRQLQLRIEAQGKYLKKIIEEQQ+LSGVL          SAP +G+       
Sbjct: 121 LHEQLEVQRQLQLRIEAQGKYLKKIIEEQQRLSGVLGEP-------SAPVTGD------- 180

Query: 211 NDPPTPAPTSEFPRQEKASKERAQAKSVSIDDSFSSRHEPLTPDSGCH-SSPNES---PK 270
           +DP TPAPTSE P Q+K+ K+    KS+S+D+S SS  EPLTPDSGC+  SP+ES    +
Sbjct: 181 SDPATPAPTSESPLQDKSGKDCGPDKSLSVDESLSSYREPLTPDSGCNIGSPDESTGEER 240

Query: 271 PVKKQRQYQDGAFAKSEMILAHQILESSLNSTHKGSRSVFPARETLDPSSGLSIGDDEQF 318
             KK R  +  A    ++++ H ILES LN+++  S  V       D  S   +G +EQ 
Sbjct: 241 LSKKPRLVRGAAGYTPDIVVGHPILESGLNTSYHQSDHVL----AFDQPSTSLLGAEEQL 279

BLAST of Cp4.1LG10g10100 vs. TAIR10
Match: AT4G28610.1 (AT4G28610.1 phosphate starvation response 1)

HSP 1 Score: 139.0 bits (349), Expect = 4.9e-33
Identity = 89/206 (43.20%), Postives = 125/206 (60.68%), Query Frame = 1

Query: 30  SMEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGL 89
           S+E    S   SN+ N   K R+RWT +LHE FV AV  LGG +RATPKGVL++M V+GL
Sbjct: 206 SVELRPVSTTSSNSNNGTGKARMRWTPELHEAFVEAVNSLGGSERATPKGVLKIMKVEGL 265

Query: 90  TIYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQMEVQR 149
           TIYHVKSHLQKYR A+Y P+ S  G    K   ++ +++++   G+ ITEAL+LQMEVQ+
Sbjct: 266 TIYHVKSHLQKYRTARYRPEPSETGSPERKLTPLEHITSLDLKGGIGITEALRLQMEVQK 325

Query: 150 Q----------LQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETD 209
           Q          LQLRIE QGKYL+ + E+Q   SG+  G A         ++ ++  +++
Sbjct: 326 QLHEQLEIQRNLQLRIEEQGKYLQMMFEKQN--SGLTKGTA---------STSDSAAKSE 385

Query: 210 KNDPPTPAPTSEFPRQEKASKERAQA 226
           + D  T A + E P +E    E  ++
Sbjct: 386 QEDKKT-ADSKEVPEEETRKCEELES 399

BLAST of Cp4.1LG10g10100 vs. TAIR10
Match: AT5G06800.1 (AT5G06800.1 myb-like HTH transcriptional regulator family protein)

HSP 1 Score: 135.2 bits (339), Expect = 7.1e-32
Identity = 80/178 (44.94%), Postives = 113/178 (63.48%), Query Frame = 1

Query: 7   THNSSFLQNNSLVRDQNIPFDAS---SMEPTNGSNDPSNTPNLASKQRLRWTHDLHERFV 66
           T+++S + + +    Q+ P  +    S  P+   +  S  PN  +K R+RWT DLHE+FV
Sbjct: 147 TYSNSNVTHLNFTSSQHQPKQSHPRFSSPPSFSIHGGSMAPNCVNKTRIRWTQDLHEKFV 206

Query: 67  NAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSI 126
             V +LGG D+ATPK +L+ M   GLTI+HVKSHLQKYR+AKY+P+S     K +K+   
Sbjct: 207 ECVNRLGGADKATPKAILKRMDSDGLTIFHVKSHLQKYRIAKYMPESQEG--KFEKRACA 266

Query: 127 DVLSNIEGSSGMQITEALKL----------QMEVQRQLQLRIEAQGKYLKKIIEEQQK 172
             LS ++  +G+QI EAL+L          Q+E+QR LQLRIE QGK LK ++E+QQK
Sbjct: 267 KELSQLDTRTGVQIKEALQLQLDVQRHLHEQLEIQRNLQLRIEEQGKQLKMMMEQQQK 322

BLAST of Cp4.1LG10g10100 vs. TAIR10
Match: AT5G29000.2 (AT5G29000.2 Homeodomain-like superfamily protein)

HSP 1 Score: 133.3 bits (334), Expect = 2.7e-31
Identity = 78/156 (50.00%), Postives = 108/156 (69.23%), Query Frame = 1

Query: 29  SSMEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQG 88
           SS +  +G N  S+     SKQR+RWT +LHE FV AV QLGG +RATPK VL+++   G
Sbjct: 213 SSEDQLSGRNSSSSVAT--SKQRMRWTPELHEAFVEAVNQLGGSERATPKAVLKLLNNPG 272

Query: 89  LTIYHVKSHLQKYRLAKYLPDSSS-DGKKADKK-DSIDVLSNIEGSSGMQITEALKLQME 148
           LTIYHVKSHLQKYR A+Y P++S   G+  +KK  SI+ + +++  + ++IT+AL+LQME
Sbjct: 273 LTIYHVKSHLQKYRTARYKPETSEVTGEPQEKKMTSIEDIKSLDMKTSVEITQALRLQME 332

Query: 149 V----------QRQLQLRIEAQGKYLKKIIEEQQKL 173
           V          QR LQL+IE QG+YL+ + E+QQK+
Sbjct: 333 VQKRLHEQLEIQRSLQLQIEKQGRYLQMMFEKQQKI 366

BLAST of Cp4.1LG10g10100 vs. TAIR10
Match: AT3G04450.1 (AT3G04450.1 Homeodomain-like superfamily protein)

HSP 1 Score: 132.9 bits (333), Expect = 3.5e-31
Identity = 93/248 (37.50%), Postives = 147/248 (59.27%), Query Frame = 1

Query: 1   MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHE 60
           ++ +  T +S   +   + R+Q+      SMEP N  + P+++  + SKQR+RWT +LHE
Sbjct: 194 LYSKIETQSSDIARQEIVFRNQHQV--DPSMEPFNAKSPPASS--MTSKQRMRWTPELHE 253

Query: 61  RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120
            FV A+ QLGG +RATPK VL+++   GLT+YHVKSHLQKYR A+Y P+ S D ++   K
Sbjct: 254 AFVEAINQLGGSERATPKAVLKLINSPGLTVYHVKSHLQKYRTARYKPELSKDTEEPLVK 313

Query: 121 D--SIDVLSNIEGSSGMQITEALKLQM----------EVQRQLQLRIEAQGKYLKKIIEE 180
           +  +I+ + +++  + ++ITEAL+LQM          E+QR LQL+IE QG+YL+ +IE+
Sbjct: 314 NLKTIEDIKSLDLKTSIEITEALRLQMKVQKQLHEQLEIQRSLQLQIEEQGRYLQMMIEK 373

Query: 181 QQKLSGVLSGAAPVASSFSAPASGENCPETDKNDPPTPAPTSEFPRQEKASKERAQAKSV 237
           QQK+           SS S P +  + P  + + P     T+     E +  ++ Q  S 
Sbjct: 374 QQKMQ---ENKKDSTSSSSMPEADPSAPSPNLSQPFLHKATN----SEPSITQKLQNGSS 430

BLAST of Cp4.1LG10g10100 vs. NCBI nr
Match: gi|659080718|ref|XP_008440942.1| (PREDICTED: protein PHR1-LIKE 1-like [Cucumis melo])

HSP 1 Score: 440.3 bits (1131), Expect = 2.9e-120
Identity = 235/270 (87.04%), Postives = 244/270 (90.37%), Query Frame = 1

Query: 31  MEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT 90
           M+PTNG+N  S  PNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT
Sbjct: 1   MDPTNGNNATSKNPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT 60

Query: 91  IYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQM----- 150
           IYHVKSHLQKYRLAKYLPDSSSDGKK DKKDS D+LSNI+GSSGMQITEALKLQM     
Sbjct: 61  IYHVKSHLQKYRLAKYLPDSSSDGKKTDKKDSSDILSNIDGSSGMQITEALKLQMEVQKR 120

Query: 151 -----EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETDK 210
                EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG AP AS+F+APASG+NCPE DK
Sbjct: 121 LHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG-APAASAFTAPASGDNCPEADK 180

Query: 211 NDPPTPAPTSEFPRQEKASKERAQAKSVSIDDSFSSRHEPLTPDSGCHSSPNESPKPVKK 270
           NDPPTPA TSEFPRQEK SKERAQ KSVSIDDSFSS HEPLTPDSGCHSSP+ES +PVKK
Sbjct: 181 NDPPTPASTSEFPRQEKVSKERAQGKSVSIDDSFSSHHEPLTPDSGCHSSPSESARPVKK 240

Query: 271 QRQYQDGAFAKSEMILAHQILESSLNSTHK 291
           QRQ  DGAFAKSEMILAHQILESSLNSTHK
Sbjct: 241 QRQDMDGAFAKSEMILAHQILESSLNSTHK 269

BLAST of Cp4.1LG10g10100 vs. NCBI nr
Match: gi|778720003|ref|XP_011658093.1| (PREDICTED: protein PHR1-LIKE 1 [Cucumis sativus])

HSP 1 Score: 417.9 bits (1073), Expect = 1.6e-113
Identity = 227/270 (84.07%), Postives = 238/270 (88.15%), Query Frame = 1

Query: 31  MEPTNGSNDPSNTPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT 90
           M+PTNG+N  S +PNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT
Sbjct: 1   MDPTNGNNATSKSPNLASKQRLRWTHDLHERFVNAVAQLGGPDRATPKGVLRVMGVQGLT 60

Query: 91  IYHVKSHLQKYRLAKYLPDSSSDGKKADKKDSIDVLSNIEGSSGMQITEALKLQM----- 150
           IYHVKSHLQKYRLAKYLPDSSSDGKK DKKDS D+LSNI+GSSGMQITEALKLQM     
Sbjct: 61  IYHVKSHLQKYRLAKYLPDSSSDGKKTDKKDSSDILSNIDGSSGMQITEALKLQMEVQKR 120

Query: 151 -----EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAAPVASSFSAPASGENCPETDK 210
                EVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG AP AS+F+APASG+NCPE DK
Sbjct: 121 LHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG-APAASAFTAPASGDNCPEVDK 180

Query: 211 NDPPTPAPTSEFPRQEKASKERAQAKSVSIDDSFSSRHEPLTPDSGCHSSPNESPKPVKK 270
           NDP TPA TSEFPRQEK SKERAQ KSVSIDDSFSS HEPLTPDSGCHSSP+ESP+PVKK
Sbjct: 181 NDPSTPASTSEFPRQEKVSKERAQGKSVSIDDSFSSHHEPLTPDSGCHSSPSESPRPVKK 240

Query: 271 QRQYQDGAFAKSEMILAHQILESSLNSTHK 291
           Q Q        S+MILAHQILESSLNSTHK
Sbjct: 241 QIQ--------SKMILAHQILESSLNSTHK 261

BLAST of Cp4.1LG10g10100 vs. NCBI nr
Match: gi|743923392|ref|XP_011005787.1| (PREDICTED: protein PHR1-LIKE 1-like isoform X1 [Populus euphratica])

HSP 1 Score: 394.4 bits (1012), Expect = 1.8e-106
Identity = 225/326 (69.02%), Postives = 257/326 (78.83%), Query Frame = 1

Query: 1   MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHE 60
           M+Q +S  +SS +  NSLV DQ +  D  +M+P NG N+ +N PNLASKQRLRWTH+LHE
Sbjct: 1   MYQLESVPSSSSVHKNSLVNDQYLDCDDMAMDPINGGNNLNNNPNLASKQRLRWTHELHE 60

Query: 61  RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120
           RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK
Sbjct: 61  RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120

Query: 121 DSIDVLSNIEGSSGMQITEALKLQM----------EVQRQLQLRIEAQGKYLKKIIEEQQ 180
           ++ DV+SN++GSSGMQITEALKLQM          EVQRQLQLRIEAQGKYLKKIIEEQQ
Sbjct: 121 ETGDVISNLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ 180

Query: 181 KLSGVLSGAAPVASSFSAPASGENCPETDKNDPPTPAPTSEFPRQEKASKERAQAKSVSI 240
           +LSGVL       S  +APASG+NCPE+DK DP TPAPTSE P Q+KA+KERA AKS+SI
Sbjct: 181 RLSGVLEDVP--GSGVAAPASGDNCPESDKTDPATPAPTSESPLQDKAAKERAPAKSLSI 240

Query: 241 DDSFSSRHEPLTPDSGCHS-SPNESP---KPVKKQRQYQDGAFAKSEMILAHQILESSLN 300
           D+SFSS+ EPLTPDS C++ SP ESP   + +KKQR      + K EM+L HQILESSLN
Sbjct: 241 DESFSSQPEPLTPDSRCNAGSPAESPRGERSIKKQRVSMGVTYGKQEMVLTHQILESSLN 300

Query: 301 STHKGSRSVFPARETLDPSSGLSIGD 313
           S +    S F  RE  DPSSGLSI D
Sbjct: 301 S-YPRPHSAFLGREQFDPSSGLSIED 323

BLAST of Cp4.1LG10g10100 vs. NCBI nr
Match: gi|590627399|ref|XP_007026438.1| (Myb-like HTH transcriptional regulator family protein [Theobroma cacao])

HSP 1 Score: 393.7 bits (1010), Expect = 3.1e-106
Identity = 225/332 (67.77%), Postives = 263/332 (79.22%), Query Frame = 1

Query: 1   MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHE 60
           M+Q ++   SS ++NNS+V  Q++   AS M+P +G N  +N PNLASKQRLRWTH+LHE
Sbjct: 60  MYQPKTVPGSSLVRNNSIVHGQHLDCGASQMDPISGGNSLTNNPNLASKQRLRWTHELHE 119

Query: 61  RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120
           RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK
Sbjct: 120 RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 179

Query: 121 DSIDVLSNIEGSSGMQITEALKLQM----------EVQRQLQLRIEAQGKYLKKIIEEQQ 180
           ++ D+LSN++GSSGMQITEALKLQM          EVQRQLQLRIEAQGKYLKKIIEEQQ
Sbjct: 180 ETGDMLSNLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ 239

Query: 181 KLSGVLSGAAPVASSFSAPASGENCPETD-KNDPPTPAPTSEFPRQEKASKERAQAKSVS 240
           +LSGVL+ A    S  S PA G+N  E+D K DP TPAPTSE P Q+KA+KERA AKS S
Sbjct: 240 RLSGVLAEAP--GSGASVPALGDNGLESDKKTDPATPAPTSESPLQDKAAKERAPAKSHS 299

Query: 241 IDDSFSSRHEPLTPDSGCH-SSPNESPKP---VKKQRQYQDGAFAKSEMILAHQILESSL 300
           ID+SFSS HEPLTPDSGCH  SP  SPK    +KKQR     AFAK E++L HQILESS+
Sbjct: 300 IDESFSSHHEPLTPDSGCHVGSPAGSPKGERLMKKQRVSMAAAFAKPEVVLPHQILESSI 359

Query: 301 NSTHKGSRSVFPARETLDPSSGLSIGDDEQFD 318
           +S+ + S SVF  RE  DPSSG+S+G+++Q +
Sbjct: 360 SSSFQQSHSVFMTREQFDPSSGISMGNEDQLE 389

BLAST of Cp4.1LG10g10100 vs. NCBI nr
Match: gi|566173713|ref|XP_006380857.1| (hypothetical protein POPTR_0006s00290g [Populus trichocarpa])

HSP 1 Score: 391.7 bits (1005), Expect = 1.2e-105
Identity = 223/331 (67.37%), Postives = 260/331 (78.55%), Query Frame = 1

Query: 1   MFQQQSTHNSSFLQNNSLVRDQNIPFDASSMEPTNGSNDPSNTPNLASKQRLRWTHDLHE 60
           M+Q +S  +SS +  NSLV DQ +  D  +M+P NG N+ +N PNLASKQRLRWTH+LHE
Sbjct: 1   MYQLESVPSSSSVHKNSLVNDQYLDCDDMTMDPINGGNNLNNNPNLASKQRLRWTHELHE 60

Query: 61  RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120
           RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK
Sbjct: 61  RFVDAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDSSSDGKKADKK 120

Query: 121 DSIDVLSNIEGSSGMQITEALKLQM----------EVQRQLQLRIEAQGKYLKKIIEEQQ 180
           ++ D++SN++GSSGMQITEALKLQM          EVQRQLQLRIEAQGKYLKKIIEEQQ
Sbjct: 121 ETGDMISNLDGSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ 180

Query: 181 KLSGVLSGAAPVASSFSAPASGENCPETDKNDPPTPAPTSEFPRQEKASKERAQAKSVSI 240
           +LSGVL       S  +AP SG+NCPE+DK DP TPAPTSE P Q+KA+KERA AKS+SI
Sbjct: 181 RLSGVLEDVP--GSGVTAPVSGDNCPESDKTDPATPAPTSESPLQDKAAKERAPAKSLSI 240

Query: 241 DDSFSSRHEPLTPDSGCHS-SPNESP---KPVKKQRQYQDGAFAKSEMILAHQILESSLN 300
           D+SFSS+ EPLTPDS C++ SP ESP   + +KKQR      + K EM+L HQILESSLN
Sbjct: 241 DESFSSQPEPLTPDSRCNAGSPAESPRGERSMKKQRVSIGVTYGKQEMVLTHQILESSLN 300

Query: 301 STHKGSRSVFPARETLDPSSGLSIGDDEQFD 318
           S +    S F  RE  DPSSGLS+G ++Q +
Sbjct: 301 S-YPRPHSAFLGREQFDPSSGLSMGIEDQME 328

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PHL7_ARATH2.3e-7759.80Myb family transcription factor PHL7 OS=Arabidopsis thaliana GN=PHL7 PE=2 SV=1[more]
PHR1_ORYSI2.3e-3250.29Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. indica GN=PHR1 PE... [more]
PHR1_ORYSJ2.3e-3250.29Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. japonica GN=PHR1 ... [more]
PHR1_ARATH8.7e-3243.20Protein PHOSPHATE STARVATION RESPONSE 1 OS=Arabidopsis thaliana GN=PHR1 PE=1 SV=... [more]
PHR3_ORYSI1.3e-3041.60Protein PHOSPHATE STARVATION RESPONSE 3 OS=Oryza sativa subsp. indica GN=PHR3 PE... [more]
Match NameE-valueIdentityDescription
B3FNK5_CUCSA1.1e-11384.07Putative myb family transcription factor OS=Cucumis sativus GN=Csa_6G510280 PE=2... [more]
A0A061GGR1_THECC2.2e-10667.77Myb-like HTH transcriptional regulator family protein OS=Theobroma cacao GN=TCM_... [more]
F6HD66_VITVI8.3e-10666.47Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0475g00040 PE=4 SV=... [more]
U5G8Q8_POPTR8.3e-10667.37Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s00290g PE=4 SV=1[more]
B9SGW9_RICCO1.8e-10568.37Transcription factor, putative OS=Ricinus communis GN=RCOM_0820580 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01060.11.3e-7859.80 myb-like HTH transcriptional regulator family protein[more]
AT4G28610.14.9e-3343.20 phosphate starvation response 1[more]
AT5G06800.17.1e-3244.94 myb-like HTH transcriptional regulator family protein[more]
AT5G29000.22.7e-3150.00 Homeodomain-like superfamily protein[more]
AT3G04450.13.5e-3137.50 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659080718|ref|XP_008440942.1|2.9e-12087.04PREDICTED: protein PHR1-LIKE 1-like [Cucumis melo][more]
gi|778720003|ref|XP_011658093.1|1.6e-11384.07PREDICTED: protein PHR1-LIKE 1 [Cucumis sativus][more]
gi|743923392|ref|XP_011005787.1|1.8e-10669.02PREDICTED: protein PHR1-LIKE 1-like isoform X1 [Populus euphratica][more]
gi|590627399|ref|XP_007026438.1|3.1e-10667.77Myb-like HTH transcriptional regulator family protein [Theobroma cacao][more]
gi|566173713|ref|XP_006380857.1|1.2e-10567.37hypothetical protein POPTR_0006s00290g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR025756Myb_CC_LHEQLE
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044237 cellular metabolic process
biological_process GO:0006499 N-terminal protein myristoylation
biological_process GO:1902582 single-organism intracellular transport
biological_process GO:0044763 single-organism cellular process
biological_process GO:0034645 cellular macromolecule biosynthetic process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0071704 organic substance metabolic process
biological_process GO:0016558 protein import into peroxisome matrix
biological_process GO:0006891 intra-Golgi vesicle-mediated transport
biological_process GO:0006635 fatty acid beta-oxidation
biological_process GO:0044699 single-organism process
biological_process GO:0044238 primary metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g10100.1Cp4.1LG10g10100.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 51..102
score: 5.
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 49..104
score: 5.5
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 48..104
score: 5.2
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 48..103
score: 3.58
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 46..106
score: 13
IPR025756MYB-CC type transcription factor, LHEQLE-containing domainPFAMPF14379Myb_CC_LHEQLEcoord: 135..171
score: 8.7
NoneNo IPR availablePANTHERPTHR31314FAMILY NOT NAMEDcoord: 3..317
score: 2.1E
NoneNo IPR availablePANTHERPTHR31314:SF5MYB-LIKE HTH TRANSCRIPTIONAL REGULATOR FAMILY PROTEIN-RELATEDcoord: 3..317
score: 2.1E