Cp4.1LG05g08380 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG05g08380
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyb family transcription factor family protein
LocationCp4.1LG05 : 5326885 .. 5332070 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATTGTGGTTCGGTTCGGTGTTCGGTTCAACGTTTTTCTCTGAATTGGCCGTTCACTATCCAATTAAGCCAAGAAGAAGTGTGGCAGAGGCCAATTTTCTTTTTCGAAAAATTCCTTTGCCCAATCATTTGTGTGTATGGGGAAATCGGATGTGGAATCAGCGGTGTTGAATCAGCGCCCAGAAAGGATCAGAGGGTCGAAATTGGATTCGTGAATTGTTCATCATGGAAGAGAGAGAGACACCCCGATACGTTTCTCTCTGAGACAAGAATCCGTGTGAGTGTCTGTTTAATCTCGTTGACGGCAAATCGACAAAGCCATCGAGCGCATCGGAACCCAAATCTTTGCTACTGCCTCTGAAATTTCGAACCCTCCTTGTGTTTTGATCTCTAGGGTAAGTGATTCTGTTTTAACCCTTTTGTGATTTTGTGATTTTGTGATTTTGAACTGTTCTTTTGCTGATTTCATTTCTGGGTTGTAATTTTATTGATTGTTTTGCGTTTTTATCAGTTTCCCCCTTTTAATTTTGTACTGGATTTGTGTTCTTGATATATTTGTGTTGAGTTCTAGCGATATAGAACGAGCTTTCAATGATTGTATGATTGATTAGGTCATGAAATTCTGTGTAATTGTGTTTTTGTTTTTGGATATATGTTTTTCTCTACTACAGAGTGAAGGGAAGAGGGTGGCTTCTGAAGTTATGTCATCATCTTATCGAGTTCTTCCTAAGCCATTTGAAGAAAAGTATCCCAAATTGCCTGTTTCTTGTCAGGGTTCTTCACATAGTGAAGCCATGAGACATCCAATTCCCAGACAGGCCCCTCCATTAGTTTCTGGCACTGTTGGGCATTTGTTTTCATCATCTTCTGGATTACGAAACGGTTTTCCGTTGATTCAACCTTTGTCTCAAGAGAGAAATGCCGAGTTTTCTCCGTTCATTTCGTCGTCTGCTAATGATGAACCGTTGTTGGCTCCTCGAGATTCTTCCCATTCAGAAGTGGTGACTAATGACTTGAATGAGAATAATGCATCTTGGAGTGATGATACCCTTCAGGGTTTGCTTGATTTTTCTGAGAATGTTCCTGACCAGAATGGTCAAGCGCAAAGCATTGCTGGTGTGTTAATGGCAGATGAACAAGATAAGAGGAATGATTGGCCTGATTGGGCTGATCAGTTTATTTCTGTGGATGATGCTTTGGAGCCAAATTGGAATGAGATTTTTTCTGATGGTAATGGAGGAGATCCTAAACCAGAGGTTGGTTCGTTCTTTCGTTTTTTTATGCCCGATCTTACTCTTCCGTAGTGTATTTCTCTTCGTGGGTATGTTTGCTCAACAAAAAACAGGTTAGATTCTGCCTATAAAATGTATGGGACTTAAAAAGGAAGGATGAAAGGGAATAAACAAAGGAAAGAAAGAGAGAGGGAACTAAATCAATGTAACGGCCTAAGCCCACCGCTAGTAGATATTGACCTCTTTGAGCTTTCCCTTTCGGGCTTCCCTTCAAGGTTTTTAAAACGTGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGCTTCGTTCCTCTCTCCAACCGATGTGGGAATCCTCCCCCTTTCGGGGTGTAGTGTCCTTGTTGGCACTTGTTCCTCTCTCCAATCGATATGGGATCTCACAATCAAAATGACATTAGAAGAGTACCTATACCCGAAAACCCGAAATTAATGAAAATGTGGCTAAACTGGTTCTGATCATTCAATGAGATGTAATGGAAGAGAGGAGGGTATTCCTAAACAAAGAAAGGATCAATTACTAATTACGAGCCATATGACGTGAAAATCTGATGTACGGTTCAATCAAGCCCTTGACGTTTCTTTGAAAAACTTGGTCTCTGGTTAAACTACACAGCCACTGGAGTTATGAATGCTAGAGTGCTTGGCCTTTTTAAGATAATGTGTGTAGTACTCGTTTTTTAATGCTCGAGTGTCTTCTTGGTGAATGATGCCTATGAAATGCTTTGCTTTTTTGTACAAGATGTGTATGTCATTGTTTAAAGGCAGGCTAACTTATAGTTTATAATGTTAGTTAATGGTGTTAACGACATGGACGTTCGTTCTACCCATTTTCGATCTTCTTTCTTCAAATTCTTATTCTGTTGAAAAAAATGTAGAATCATCAACTTTTTACCTAAAGTTCTTGGTGCAAATTTTACAGTTGCTCAAATCATCATCAGCTGGTTTCCATGCTCCACAGAACCAAACAAATCAAGTTGACTCTGTACCTACAGCAGAGTTTCACTCGGTTTCTTCTAATTCACTCTCAACATCAACTAACCGACCTCGGATGCGATGGACACCAGAGCTTCACGAGGCATTTGTAGAAGCTGTCAATAAACTAGGTGGCAGCGAAAGTATATTACTTTCTTTCATCTGTGTTTACCGTTGTTTAAGAACGCTTGAAAAGTTGCAAATTTTTCTCAATCATTGACCCTATAGTTTAATGGACAGATGCAACTCCTAAGGGTGTCTTAAAGCTTATGAATGTTGAAGGCCTCACCATTTATCATGTCAAAAGCCACCTACAGGTACGCATCATTTTTTTTCGATTGTTATACGGTTTCACTCGGCATGCTTATTTCTAAGCTTTATTATGATTTAATGTCAGAAATATAGAACCGCCAGATATAGACCGGAGTCATCAGAAGGTATCGGTGACTGTTTCCCTTCTCTTTGTTCGTTTACTTCATTACCTTCTTTATCATCGATGTCGTTATCTTTGTTGAACATAAAACCGCCTGATCGATGGTTGAATTCATTTTTCCTCTGTTCAGGATCTTCAGGGAAAAAAATAAATCATATTGAGGAAATGAAGTCTCTTGACTTGAAAACGTAAGCAATTTTTTCCCTTCTAATATATATGTTTTCTTGTGTGGCTGCATTTTCTTCTGGCTTATTCGTTTTTACAGTGGGAGTGCGGAGTTTGAATCATAAAATTTGAAACATTTAGCATGTTTTGATGTTTTTTTCTTTGCTACTTTTGAAAGGCATGTTTTCTAATAGAGATGTCCATTTTAACCCACGGGGCGGGGATTCCCCGTTTAGATGGAGGATGAGATTATATTCACTGTTTCCGTTTTCCCCCAATTCCCGCCCTCTAAATCCCCGCGAGGAACCCCATTCCCGTGAATTCCCTGCAGGAAATCCCCACCCCGCCTCATGGACATCTCTACCATCCCCTAGGAAAAATACTTGGAAGAAAAAGTAACTAGTCTAAAGCGGTGGAAGCATCCCAAATTCTTTATATTTCTCAGTTTTCAGATCATTTCCATGAAACCTGTTTGTAAAAGTCCTCTTCCTTCTCTTGAGCCGTGATGCCATTAAGGCTGGAAGTCATTTGTAATTAATTTCAGTGGCTCTACTTTTGTATCTTAGATGTAATCAAGACAAATAATATGTAACCGTCAAGCCCACCGCTAGCAGATATTGTCCTCTTTGGGCTTTTCCTTCCGGGCTTCCCCTCAAAATTTTAAAACGCGTCAGCTAGGGAGAGGTTTTTACACCCTTATAAGGAATGCTTAGTTCCCCTCTCCAACCAATGTGGGATCTCACAATCCACCCCCCTTGGGGGCTTAACGTCCTCGTTGGCACACCGCCCAGTGTCTAGTTCTGATACCATTTGTAACAGCACAAGCCCACCGCTAACAGATACTGCCTCTTTGGGCTTTCCCTTCCGGGCTTCCCCTCAAGGTTTTAAAACACGTCTGCTAGGGAGAAGTTTCCACACCCTTATTAAGGAATGCTTTGTGCCCCTCTTCAACCAAAGTGAGATCTCACATAACAATAGGGAATTTTTCAAGTTCATCCTCGTCTAAGCTTGAATAAGTGGTAAAATCAGGCACCCAACTTAAAAGTAACTAGCATCGGACAAATTAACTGAGGCAATTTCTTTATTTTTTCGAAAGTTTAAATCGACTCTCTGAGCTAGCTATGACATTTTGTTGCTAAATTTGCTGATATCATAAGATTGCATACTGTATGCTACTAAAGAAACAAGCTTCAGTGGTTATTTTTCATTTCTTCTACACAAAAAATGATGTTACTGTGTAGTAAATACTCAAATCCCAGGCTTATTAGGTTTAGAAACTGAAATTGTGAACATAAATTCTTGGGAATGGTCTTAGTGCTGAACTCCTTTGTGCACAATTCAGGAGTATGGGAATAACTGAAGCTTTGCGTTTGCAAATGGAGGTTCAGAAGCGGCTCCATGAACAACTCGAGGTCTGCATTTTGTTTGCTTTTGAAGGTTCATTACGTTTGAGTCGAAGTCGGTTTCTCGTTCAGTCGAAGTCGGTTTTTGCCTGGAGTTTTCTAAATATTCTTATTCTGTAAACTTATGCAGATCCAAAGGAATTTACAATTACGAATTGAAGAACAAGGTAAGTACCTACAGGAGATGTTTGAGCAACAACGAAAGATGGAAAGCAAGCTAAAGACCTCATCGTCGATCCTCGAGAATATGCCTCGTCCTAATGATCAACCCGAAAATTCAGAACAAGGTCATGTTGCAGCTGGTATGACCACGGAGAACGCTGAGGGCGCTCGAGAAGATGGTTTGCCCGCTGCAAGTAGGAAGCAGAAGGCACATGAAAGAGAAGCGATTGAATCGAGTGAAGGTAACTCTTCTTCCCCGGACACAAAACGAGCTAAATCAGATGCAACAGCTTCATAGCAGTCTGTTTCGGAGTCGAGATGTTCATTGATGGAGCTCTGAAACCAGGGAGAATACCAATCCTTGCAGTACAATTGGAAACAATTGGCTTCTTCCTTTATTTAAGGTGGGAAGCCATTTGGCAATTGACTTAGGAGCTCTCCCAACTAATTGTGTATAAGATTATTGGCCTTAGTGGATTGTGATCTTCACACTCTCACCATTTTATTTTATGCATTGTCTCGAGTCTCGATCTTGGTTCGGGTTTCGGGTTTGACTTGCTATCAAACTTCGGACAAAATATTGTTGTTTTTCCTTTTTTCTCATTTGTTAGGAAATAAATATTTGTAAGTGGGAAAAATATATACTCCACTTATCAGTTTAATTCAGAGTCATGAGTCGACCCTCTCTACTCACTCGGACATACTTATTAAAAACTGACTCGTGAGTCGATTTTATCATTTGGATACCTTGAACATCCATCCAAATACTATGGA

mRNA sequence

TATTGTGGTTCGGTTCGGTGTTCGGTTCAACGTTTTTCTCTGAATTGGCCGTTCACTATCCAATTAAGCCAAGAAGAAGTGTGGCAGAGGCCAATTTTCTTTTTCGAAAAATTCCTTTGCCCAATCATTTGTGTGTATGGGGAAATCGGATGTGGAATCAGCGGTGTTGAATCAGCGCCCAGAAAGGATCAGAGGGTCGAAATTGGATTCGTGAATTGTTCATCATGGAAGAGAGAGAGACACCCCGATACGTTTCTCTCTGAGACAAGAATCCGTGTGAGTGTCTGTTTAATCTCGTTGACGGCAAATCGACAAAGCCATCGAGCGCATCGGAACCCAAATCTTTGCTACTGCCTCTGAAATTTCGAACCCTCCTTGTGTTTTGATCTCTAGGAGTGAAGGGAAGAGGGTGGCTTCTGAAGTTATGTCATCATCTTATCGAGTTCTTCCTAAGCCATTTGAAGAAAAGTATCCCAAATTGCCTGTTTCTTGTCAGGGTTCTTCACATAGTGAAGCCATGAGACATCCAATTCCCAGACAGGCCCCTCCATTAGTTTCTGGCACTGTTGGGCATTTGTTTTCATCATCTTCTGGATTACGAAACGGTTTTCCGTTGATTCAACCTTTGTCTCAAGAGAGAAATGCCGAGTTTTCTCCGTTCATTTCGTCGTCTGCTAATGATGAACCGTTGTTGGCTCCTCGAGATTCTTCCCATTCAGAAGTGGTGACTAATGACTTGAATGAGAATAATGCATCTTGGAGTGATGATACCCTTCAGGGTTTGCTTGATTTTTCTGAGAATGTTCCTGACCAGAATGGTCAAGCGCAAAGCATTGCTGGTGTGTTAATGGCAGATGAACAAGATAAGAGGAATGATTGGCCTGATTGGGCTGATCAGTTTATTTCTGTGGATGATGCTTTGGAGCCAAATTGGAATGAGATTTTTTCTGATGGTAATGGAGGAGATCCTAAACCAGAGTTGCTCAAATCATCATCAGCTGGTTTCCATGCTCCACAGAACCAAACAAATCAAGTTGACTCTGTACCTACAGCAGAGTTTCACTCGGTTTCTTCTAATTCACTCTCAACATCAACTAACCGACCTCGGATGCGATGGACACCAGAGCTTCACGAGGCATTTGTAGAAGCTGTCAATAAACTAGGTGGCAGCGAAAATGCAACTCCTAAGGGTGTCTTAAAGCTTATGAATGTTGAAGGCCTCACCATTTATCATGTCAAAAGCCACCTACAGAAATATAGAACCGCCAGATATAGACCGGAGTCATCAGAAGGATCTTCAGGGAAAAAAATAAATCATATTGAGGAAATGAAGTCTCTTGACTTGAAAACGAGTATGGGAATAACTGAAGCTTTGCGTTTGCAAATGGAGGTTCAGAAGCGGCTCCATGAACAACTCGAGATCCAAAGGAATTTACAATTACGAATTGAAGAACAAGGTAAGTACCTACAGGAGATGTTTGAGCAACAACGAAAGATGGAAAGCAAGCTAAAGACCTCATCGTCGATCCTCGAGAATATGCCTCGTCCTAATGATCAACCCGAAAATTCAGAACAAGGTCATGTTGCAGCTGGTATGACCACGGAGAACGCTGAGGGCGCTCGAGAAGATGGTTTGCCCGCTGCAAGTAGGAAGCAGAAGGCACATGAAAGAGAAGCGATTGAATCGAGTGAAGGTAACTCTTCTTCCCCGGACACAAAACGAGCTAAATCAGATGCAACAGCTTCATAGCAGTCTGTTTCGGAGTCGAGATGTTCATTGATGGAGCTCTGAAACCAGGGAGAATACCAATCCTTGCAGTACAATTGGAAACAATTGGCTTCTTCCTTTATTTAAGGTGGGAAGCCATTTGGCAATTGACTTAGGAGCTCTCCCAACTAATTGTGTATAAGATTATTGGCCTTAGTGGATTGTGATCTTCACACTCTCACCATTTTATTTTATGCATTGTCTCGAGTCTCGATCTTGGTTCGGGTTTCGGGTTTGACTTGCTATCAAACTTCGGACAAAATATTGTTGTTTTTCCTTTTTTCTCATTTGTTAGGAAATAAATATTTGTAAGTGGGAAAAATATATACTCCACTTATCAGTTTAATTCAGAGTCATGAGTCGACCCTCTCTACTCACTCGGACATACTTATTAAAAACTGACTCGTGAGTCGATTTTATCATTTGGATACCTTGAACATCCATCCAAATACTATGGA

Coding sequence (CDS)

ATGTCATCATCTTATCGAGTTCTTCCTAAGCCATTTGAAGAAAAGTATCCCAAATTGCCTGTTTCTTGTCAGGGTTCTTCACATAGTGAAGCCATGAGACATCCAATTCCCAGACAGGCCCCTCCATTAGTTTCTGGCACTGTTGGGCATTTGTTTTCATCATCTTCTGGATTACGAAACGGTTTTCCGTTGATTCAACCTTTGTCTCAAGAGAGAAATGCCGAGTTTTCTCCGTTCATTTCGTCGTCTGCTAATGATGAACCGTTGTTGGCTCCTCGAGATTCTTCCCATTCAGAAGTGGTGACTAATGACTTGAATGAGAATAATGCATCTTGGAGTGATGATACCCTTCAGGGTTTGCTTGATTTTTCTGAGAATGTTCCTGACCAGAATGGTCAAGCGCAAAGCATTGCTGGTGTGTTAATGGCAGATGAACAAGATAAGAGGAATGATTGGCCTGATTGGGCTGATCAGTTTATTTCTGTGGATGATGCTTTGGAGCCAAATTGGAATGAGATTTTTTCTGATGGTAATGGAGGAGATCCTAAACCAGAGTTGCTCAAATCATCATCAGCTGGTTTCCATGCTCCACAGAACCAAACAAATCAAGTTGACTCTGTACCTACAGCAGAGTTTCACTCGGTTTCTTCTAATTCACTCTCAACATCAACTAACCGACCTCGGATGCGATGGACACCAGAGCTTCACGAGGCATTTGTAGAAGCTGTCAATAAACTAGGTGGCAGCGAAAATGCAACTCCTAAGGGTGTCTTAAAGCTTATGAATGTTGAAGGCCTCACCATTTATCATGTCAAAAGCCACCTACAGAAATATAGAACCGCCAGATATAGACCGGAGTCATCAGAAGGATCTTCAGGGAAAAAAATAAATCATATTGAGGAAATGAAGTCTCTTGACTTGAAAACGAGTATGGGAATAACTGAAGCTTTGCGTTTGCAAATGGAGGTTCAGAAGCGGCTCCATGAACAACTCGAGATCCAAAGGAATTTACAATTACGAATTGAAGAACAAGGTAAGTACCTACAGGAGATGTTTGAGCAACAACGAAAGATGGAAAGCAAGCTAAAGACCTCATCGTCGATCCTCGAGAATATGCCTCGTCCTAATGATCAACCCGAAAATTCAGAACAAGGTCATGTTGCAGCTGGTATGACCACGGAGAACGCTGAGGGCGCTCGAGAAGATGGTTTGCCCGCTGCAAGTAGGAAGCAGAAGGCACATGAAAGAGAAGCGATTGAATCGAGTGAAGGTAACTCTTCTTCCCCGGACACAAAACGAGCTAAATCAGATGCAACAGCTTCATAG

Protein sequence

MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVSGTVGHLFSSSSGLRNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEVVTNDLNENNASWSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNWNEIFSDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGSSGKKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQQRKMESKLKTSSSILENMPRPNDQPENSEQGHVAAGMTTENAEGAREDGLPAASRKQKAHEREAIESSEGNSSSPDTKRAKSDATAS
BLAST of Cp4.1LG05g08380 vs. Swiss-Prot
Match: PHL1_ARATH (Protein PHR1-LIKE 1 OS=Arabidopsis thaliana GN=PHL1 PE=1 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 1.4e-75
Identity = 185/373 (49.60%), Postives = 241/373 (64.61%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVSG--TVGHLFSSSSGL 60
           MSSSY  L    E++Y KLP S   SS  E M +P+P Q+   VSG  + G+LF SSSG 
Sbjct: 13  MSSSYSALHTSVEDRYHKLPNSFWVSSGQELMNNPVPCQS---VSGGNSGGYLFPSSSGY 72

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEVVTNDLNENNASWSDDTLQ 120
            N   +   L   RN +  P +S+   D   LA +D     +  + L  ++     D L 
Sbjct: 73  CN---VSAVLPHGRNLQNQPPVSTVPRDR--LAMQDCPL--IAQSSLINHHPQEFIDPLH 132

Query: 121 GLLDFSENVPDQNGQAQSIAGVLMAD--EQDKRNDWPDWADQFISVDDALEPNWNEIFSD 180
              DFS++VP QN QA+S +GV +    E  K+++W DWADQ ISVDD  EPNW+E+  D
Sbjct: 133 EFFDFSDHVPVQNLQAES-SGVRVDSSVELHKKSEWQDWADQLISVDDGSEPNWSELLGD 192

Query: 181 GNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPELH 240
            +  +P  E+                Q   V + +  S  ++S S +T++ RMRWTPELH
Sbjct: 193 SSSHNPNSEIPTPFLDVPRLDITANQQQQMVSSEDQLSGRNSSSSVATSKQRMRWTPELH 252

Query: 241 EAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGS---SG 300
           EAFVEAVN+LGGSE ATPK VLKL+N  GLTIYHVKSHLQKYRTARY+PE+SE +     
Sbjct: 253 EAFVEAVNQLGGSERATPKAVLKLLNNPGLTIYHVKSHLQKYRTARYKPETSEVTGEPQE 312

Query: 301 KKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFE 360
           KK+  IE++KSLD+KTS+ IT+ALRLQMEVQKRLHEQLEIQR+LQL+IE+QG+YLQ MFE
Sbjct: 313 KKMTSIEDIKSLDMKTSVEITQALRLQMEVQKRLHEQLEIQRSLQLQIEKQGRYLQMMFE 372

Query: 361 QQRKMESKLKTSS 367
           +Q+K++    +SS
Sbjct: 373 KQQKIQDNKSSSS 374

BLAST of Cp4.1LG05g08380 vs. Swiss-Prot
Match: PHLD_ARATH (Myb family transcription factor PHL13 OS=Arabidopsis thaliana GN=PHL13 PE=2 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 3.9e-70
Identity = 193/457 (42.23%), Postives = 265/457 (57.99%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVSG--TVGHLFSSSSGL 60
           +SSS+ +L + +   +P     C  S       +P+P Q  PLVSG  + G+LFSSSSG 
Sbjct: 13  ISSSFTILEERYHNNFPN--TLCVSSGQESMNNNPVPCQVFPLVSGGSSGGNLFSSSSGF 72

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLA-----------PRDSSHSEVVTNDLNE 120
            NG   +   SQ R     P +S+   D   +A           P ++   +++     +
Sbjct: 73  CNGV-YVSSSSQAR-----PSVSTVPRDRITVAHVSGEGQRQECPVETHSLQLINQPQEQ 132

Query: 121 NNASWSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALE 180
              +WS D ++G  DF   VPD   QA S   ++ + E   + +WPDWADQ IS DD+LE
Sbjct: 133 KIMTWSSDQIRGFFDFP--VPDP--QAASSRTMVSSKEVLSKCEWPDWADQLIS-DDSLE 192

Query: 181 PNWNEIFSDGNGGD--PKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTN 240
           PNW+E+  D N  +   K E   S  A         +QVD  P+ E  +  S   S+ T+
Sbjct: 193 PNWSELLGDPNVLNLYSKIETQSSDIARQEIVFRNQHQVD--PSMEPFNAKSPPASSMTS 252

Query: 241 RPRMRWTPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRP 300
           + RMRWTPELHEAFVEA+N+LGGSE ATPK VLKL+N  GLT+YHVKSHLQKYRTARY+P
Sbjct: 253 KQRMRWTPELHEAFVEAINQLGGSERATPKAVLKLINSPGLTVYHVKSHLQKYRTARYKP 312

Query: 301 ESSEGSSG---KKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIE 360
           E S+ +     K +  IE++KSLDLKTS+ ITEALRLQM+VQK+LHEQLEIQR+LQL+IE
Sbjct: 313 ELSKDTEEPLVKNLKTIEDIKSLDLKTSIEITEALRLQMKVQKQLHEQLEIQRSLQLQIE 372

Query: 361 EQGKYLQEMFEQQRKMESKLKTSSSILENMPR--PNDQPENSEQGHVAAGMTTENAEGAR 420
           EQG+YLQ M E+Q+KM+   K S+S   +MP   P+    N  Q  +     +E      
Sbjct: 373 EQGRYLQMMIEKQQKMQENKKDSTS-SSSMPEADPSAPSPNLSQPFLHKATNSE------ 432

Query: 421 EDGLPAASRKQKAHEREAIESSEGNSSSPDTKRAKSD 438
               P+ ++K + +    ++ SE  S + + KR + D
Sbjct: 433 ----PSITQKLQ-NGSSTMDQSESTSGTSNRKRVRED 442

BLAST of Cp4.1LG05g08380 vs. Swiss-Prot
Match: PHR1_ARATH (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Arabidopsis thaliana GN=PHR1 PE=1 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 6.4e-65
Identity = 186/440 (42.27%), Postives = 235/440 (53.41%), Query Frame = 1

Query: 9   PKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS---GTVGHLFSSSS-GLRNGFPL 68
           P P E+ + +        ++S+ M  P+ +    L S   G VGH+ SSSS G       
Sbjct: 26  PSPVEDSFMR------SDNNSQLMSRPLGQTYHLLSSSNGGAVGHICSSSSSGFATNLHY 85

Query: 69  IQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEVVTNDLNENNASWSDDTLQG-LLDF 128
              +S E+   ++   S++A   P                 + N+++W  D+L G  LDF
Sbjct: 86  STMVSHEKQQHYTGSSSNNAVQTP-----------------SNNDSAWCHDSLPGGFLDF 145

Query: 129 SENVPDQNGQAQSIAGVLMA--DEQDKRNDWPDWADQFISVDDAL-EPNWNEIFSDGNGG 188
            E  P      Q   G + A  D+  KR+DW +WAD  I+ DD L   NWN++  + N  
Sbjct: 146 HETNPAIQNNCQIEDGGIAAAFDDIQKRSDWHEWADHLITDDDPLMSTNWNDLLLETNSN 205

Query: 189 DPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTS--TNRPRMRWTPELHEA 248
               +           PQ Q  Q    P+ E   VS+ S +++  T + RMRWTPELHEA
Sbjct: 206 SDSKD-----QKTLQIPQPQIVQQQPSPSVELRPVSTTSSNSNNGTGKARMRWTPELHEA 265

Query: 249 FVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSE-GSSGKKIN 308
           FVEAVN LGGSE ATPKGVLK+M VEGLTIYHVKSHLQKYRTARYRPE SE GS  +K+ 
Sbjct: 266 FVEAVNSLGGSERATPKGVLKIMKVEGLTIYHVKSHLQKYRTARYRPEPSETGSPERKLT 325

Query: 309 HIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQQRK 368
            +E + SLDLK  +GITEALRLQMEVQK+LHEQLEIQRNLQLRIEEQGKYLQ MFE+Q  
Sbjct: 326 PLEHITSLDLKGGIGITEALRLQMEVQKQLHEQLEIQRNLQLRIEEQGKYLQMMFEKQ-- 385

Query: 369 MESKLKTSSSILENMPRPNDQPENSEQGHVAAGMTTENAEGAREDGLPAASRKQKAHERE 428
                  +S + +     +D    SEQ                ED   A S++    E  
Sbjct: 386 -------NSGLTKGTASTSDSAAKSEQ----------------EDKKTADSKEVPEEETR 408

Query: 429 AIESSEGNSSSPDTKRAKSD 438
             E  E    SP  KR K D
Sbjct: 446 KCEELE----SPQPKRPKID 408

BLAST of Cp4.1LG05g08380 vs. Swiss-Prot
Match: PHR1_ORYSI (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. indica GN=PHR1 PE=3 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 4.3e-53
Identity = 161/376 (42.82%), Postives = 211/376 (56.12%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPL----VSGTVGH----LF 60
           MSSS  +LPK  ++    +P S   ++ +  M   +P  + PL       ++ H    + 
Sbjct: 1   MSSSLPILPKSLKD----IPRS--HNTQNILMPGQLPNDSMPLHQSATQSSISHPRASVV 60

Query: 61  SSSSGLRNGF---PLIQPLSQERNAEFSPFISSSANDEPLLAP-RDSSHSEVVTNDLNEN 120
            SS     G+   P+    S E +   +PFIS S+N E L +   +++H           
Sbjct: 61  RSSYSAMLGYAANPIDSVSSHEGHFMAAPFISQSSNAEMLQSLCNNNTHGGHTVPTFFPA 120

Query: 121 NASWSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEP 180
            A  + D +  +      VPD + Q+ S     +  +  K+N+W  WAD  I  DD    
Sbjct: 121 PACGAPDYMDTI-----TVPDNHTQSGSST---VTSDAAKQNEW--WAD--IMNDD---- 180

Query: 181 NWNEIF-SDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRP 240
            W +I  +       K     S+SA      NQ+    S       S   N+ + S ++ 
Sbjct: 181 -WKDILDATATDSQSKSMAQPSNSAASQPAFNQSTSSHSGDICPVTSPPPNNSNASASKQ 240

Query: 241 RMRWTPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPES 300
           RMRWTPELHE+FV AVNKLGGSE ATPKGVLKLM V+GLTIYHVKSHLQKYRTARY+P+ 
Sbjct: 241 RMRWTPELHESFVHAVNKLGGSEKATPKGVLKLMKVDGLTIYHVKSHLQKYRTARYKPDL 300

Query: 301 SEGSS--GKKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG 360
           SEG +  GK  + +    SLDLK SM +TEALRLQMEVQKRLHEQLEIQR LQLRIEEQG
Sbjct: 301 SEGKTQEGKTTDEL----SLDLKASMDLTEALRLQMEVQKRLHEQLEIQRKLQLRIEEQG 349

Query: 361 KYLQEMFEQQRKMESK 362
           KYLQ+MFE+Q K  ++
Sbjct: 361 KYLQKMFEKQCKSSTQ 349

BLAST of Cp4.1LG05g08380 vs. Swiss-Prot
Match: PHR1_ORYSJ (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. japonica GN=PHR1 PE=2 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 5.6e-53
Identity = 161/376 (42.82%), Postives = 210/376 (55.85%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPL----VSGTVGH----LF 60
           MSSS  +LPK  ++    +P S   ++ +  M   +P  + PL       ++ H    + 
Sbjct: 1   MSSSLPILPKSLKD----IPRS--HNTQNILMPGQLPNDSMPLHQSATQSSISHPRASVV 60

Query: 61  SSSSGLRNGF---PLIQPLSQERNAEFSPFISSSANDEPL-LAPRDSSHSEVVTNDLNEN 120
            SS     G+   P+    S E +   +PFIS S+N E L     +++H           
Sbjct: 61  RSSYSAMLGYAANPIDSVSSHEGHFMAAPFISQSSNAEMLQYLCNNNTHGGHTVPTFFPA 120

Query: 121 NASWSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEP 180
            A  + D +  +      VPD + Q+ S     +  +  K+N+W  WAD  I  DD    
Sbjct: 121 PACGAPDYMDTI-----TVPDNHTQSGSST---VTSDAAKQNEW--WAD--IMNDD---- 180

Query: 181 NWNEIF-SDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRP 240
            W +I  +       K     S+SA      NQ+    S       S   N+ + S ++ 
Sbjct: 181 -WKDILDATATDSQSKSMAQPSNSAASQPAFNQSTSSHSGDICPVTSPPPNNSNASASKQ 240

Query: 241 RMRWTPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPES 300
           RMRWTPELHE+FV AVNKLGGSE ATPKGVLKLM V+GLTIYHVKSHLQKYRTARY+P+ 
Sbjct: 241 RMRWTPELHESFVHAVNKLGGSEKATPKGVLKLMKVDGLTIYHVKSHLQKYRTARYKPDL 300

Query: 301 SEGSS--GKKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG 360
           SEG +  GK  + +    SLDLK SM +TEALRLQMEVQKRLHEQLEIQR LQLRIEEQG
Sbjct: 301 SEGKTQEGKTTDEL----SLDLKASMDLTEALRLQMEVQKRLHEQLEIQRKLQLRIEEQG 349

Query: 361 KYLQEMFEQQRKMESK 362
           KYLQ+MFE+Q K  ++
Sbjct: 361 KYLQKMFEKQCKSSTQ 349

BLAST of Cp4.1LG05g08380 vs. TrEMBL
Match: A0A0A0LY82_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G666970 PE=4 SV=1)

HSP 1 Score: 698.4 bits (1801), Expect = 5.8e-198
Identity = 377/445 (84.72%), Postives = 399/445 (89.66%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS--GTVGHLFSSSSGL 60
           MSSSY VL KPFE+KYPKLP+S QGSS SEAMRHPIPRQAPPLVS  GTVGHLFSSSSG 
Sbjct: 1   MSSSYPVLSKPFEDKYPKLPLSFQGSSQSEAMRHPIPRQAPPLVSNSGTVGHLFSSSSGF 60

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEV----VTNDLNENNASWSD 120
           RN FPL+QPLSQERNA+FSPFIS SAND  LL    SSHSEV    VT +LNEN+ASWS 
Sbjct: 61  RNDFPLMQPLSQERNAQFSPFISRSANDGSLLPSHGSSHSEVQSTMVTGNLNENSASWST 120

Query: 121 DTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNWNEIF 180
           DTLQ LLDFSEN+PDQNGQ Q++A VLM+D+Q KRNDWPDWADQFISVDDALEPNW+EIF
Sbjct: 121 DTLQDLLDFSENIPDQNGQDQNVASVLMSDDQAKRNDWPDWADQFISVDDALEPNWSEIF 180

Query: 181 SDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPE 240
           SD N GDPKPE+LKSSSA F+AP NQTNQVDS+PT EFHSVS NSLSTST RPRMRWTPE
Sbjct: 181 SDANAGDPKPEVLKSSSANFNAPPNQTNQVDSLPTVEFHSVS-NSLSTST-RPRMRWTPE 240

Query: 241 LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGSSGK 300
           LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARY+PESSEGSSGK
Sbjct: 241 LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYKPESSEGSSGK 300

Query: 301 KINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ 360
           KINHIEEMK+LDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ
Sbjct: 301 KINHIEEMKTLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ 360

Query: 361 QRKMESKLKTSSSILENMPRPNDQPENSEQGHVAAGMTTENAEGAREDGLPAASRKQKAH 420
           QRKME+KLKTSSSILENMP  +DQP+N EQGH AAGM+TENAE AREDGL AASRK K H
Sbjct: 361 QRKMENKLKTSSSILENMPCADDQPKNLEQGHDAAGMSTENAEDAREDGLLAASRKHKGH 420

Query: 421 EREAIESSEGNSSSPDTKRAKSDAT 440
           E E +E  EGN SSPD KRAKSDAT
Sbjct: 421 EGEEVEPDEGN-SSPDAKRAKSDAT 442

BLAST of Cp4.1LG05g08380 vs. TrEMBL
Match: A0A067JIQ6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23560 PE=4 SV=1)

HSP 1 Score: 399.8 bits (1026), Expect = 4.2e-108
Identity = 251/462 (54.33%), Postives = 308/462 (66.67%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS--GTVGHLFSSSSGL 60
           MS+S+ VLP P EEKYPKLP S Q SS  E  + P+ +QA  L    GT GHL SSS   
Sbjct: 1   MSASFPVLPTPLEEKYPKLPDSFQVSSERELTKGPVSQQASSLGPNIGTTGHLSSSSLRF 60

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDE-PLL-APRDSSHSEV-----VTNDLNENNAS 120
            N         +   ++  PFI+ S++D  PLL AP  SSHS+V     +T+    NN S
Sbjct: 61  LNQVHSSFSSPRRVQSQNYPFIAKSSSDGGPLLGAPTGSSHSDVQPTALMTDPEENNNMS 120

Query: 121 WSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNWN 180
           WS D L  LLDF EN+P QNGQ +S  GV+ +++  K+ DW DWADQ ISVDD LEPNW+
Sbjct: 121 WSIDPLHDLLDFPENIPVQNGQVESNIGVISSEDISKKADWQDWADQLISVDDDLEPNWS 180

Query: 181 EIFSDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRW 240
           EI +D N  D K ++LKS S     PQ   +QV  V + E ++ ++ + +    + RMRW
Sbjct: 181 EILNDANATDAKQKVLKSPSGISVQPQIHQHQV--VSSGETYTAANPTSAAPAAKTRMRW 240

Query: 241 TPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGS 300
           TPELH+AFVEAVNKLGGSE ATPKGVLKLMNVEGLTIYHVKSHLQKYRTARY+PESSEGS
Sbjct: 241 TPELHDAFVEAVNKLGGSERATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYKPESSEGS 300

Query: 301 SGKKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEM 360
           S KK+N I+EMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG+YLQ M
Sbjct: 301 SEKKLNPIDEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGRYLQMM 360

Query: 361 FEQQRKMESKLK--TSSSILENMPRP---------NDQPENSEQGHVAAGMTTENAEGAR 420
           FE+QRKME +    +SSS +++   P         N++ E S+Q +   G    +A  A 
Sbjct: 361 FEKQRKMEEEKSKDSSSSPVDDPSLPQSNLVQQPGNNKSEVSQQDNAKTGFDGNDAGSAL 420

Query: 421 EDGLPAASRKQKAHEREAIE--SSEGNSSSP-DTKRAKSDAT 440
           E+   + S+KQK  E +  +    E N  SP   KR ++D T
Sbjct: 421 EESFQSVSKKQKVQENKTCKGFDPEDNECSPASAKRPRTDET 460

BLAST of Cp4.1LG05g08380 vs. TrEMBL
Match: B9S7M4_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_0609760 PE=4 SV=1)

HSP 1 Score: 382.5 bits (981), Expect = 7.0e-103
Identity = 253/466 (54.29%), Postives = 306/466 (65.67%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS--GTVGHLFSSS--S 60
           MSS + VLP P E +YPKLP S Q SS  E MR+PI +Q  PL S  GTVGHLFSSS  S
Sbjct: 1   MSSPFPVLPTPLEGQYPKLPDSFQVSSERELMRNPILQQTSPLCSNSGTVGHLFSSSMRS 60

Query: 61  GLRNGFPLIQPLSQERNAEFSPFISSSANDE---PLLAPRDSSHSEV----VTNDLNEN- 120
            +      + P  Q   ++ SPFIS S  D+   P+L    SS+SEV    + N   EN 
Sbjct: 61  SIEAQASSLAP--QGGQSQNSPFISQSLRDKGSLPVLTTH-SSNSEVQSTALINHSEENK 120

Query: 121 NASWSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEP 180
           + SW+ D L  LLDF ENV  QNGQ +S  GV+ +++  KR DW +WADQ ISVDD LEP
Sbjct: 121 DMSWTIDPLHDLLDFPENVAVQNGQVESTIGVITSEDFSKRTDWQEWADQLISVDDDLEP 180

Query: 181 NWNEIFSDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPR 240
           NW+E+ +D N  D K +++KSSS    + Q   +Q   V   E +S ++   +    + R
Sbjct: 181 NWSELLNDANNADRKQKVVKSSSQ--ISVQPTVHQPQPVHNGEPYSAANPMSAIPAAKHR 240

Query: 241 MRWTPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESS 300
           MRWTPELHEAFVEAVNKLGGSE ATPKGVLKLMNVEGLTIYHVKSHLQKYRTARY+PES+
Sbjct: 241 MRWTPELHEAFVEAVNKLGGSERATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYKPESA 300

Query: 301 EGSSGKKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYL 360
           EG+S KK++ I+EMKSLDLK SMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG++L
Sbjct: 301 EGTSEKKLSPIDEMKSLDLKASMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGRHL 360

Query: 361 QEMFEQQRKME-SKLKTSSSILENMPRP---------NDQPENSEQGHVAAGMTTENAEG 420
           Q MFEQQRKME  + K SSS L++   P         N++ E SE  H  A     +  G
Sbjct: 361 QMMFEQQRKMEDDRSKASSSSLDDPSLPQSNIVQSPGNNKLEVSELDH--ARTEISSGGG 420

Query: 421 AREDGLPAASRKQKAHER---EAIESSEGNSSSPDTKRAKSDATAS 442
           A E      SRKQKA E    E ++  +  S     KR ++D  A+
Sbjct: 421 ALEGSSQNGSRKQKAPENRTGEDLDPEDDESGPASAKRPRADEIAA 459

BLAST of Cp4.1LG05g08380 vs. TrEMBL
Match: A0A061F0K2_THECC (Homeodomain-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_026171 PE=4 SV=1)

HSP 1 Score: 369.0 bits (946), Expect = 8.0e-99
Identity = 232/457 (50.77%), Postives = 288/457 (63.02%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVSG--TVGHLFSSSSGL 60
           MSSS+  LP PF+EKYPKLP S Q SS  + M++ I  Q   L     T+G+ FSS S +
Sbjct: 1   MSSSFPALPTPFKEKYPKLPDSFQVSSERKVMKNSISPQESSLAPSNRTLGNSFSSPS-I 60

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEVVTNDLN----ENNASWSD 120
            N       L+ +R+++   FIS  + D   L   DSS+S+  T   N    + + SW  
Sbjct: 61  ANNDMCASALAHDRHSQSPAFISQRSRDLASLPSIDSSYSDQSTALFNHPQEKKDVSWCI 120

Query: 121 DTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNWNEIF 180
           D LQ  LD  ENVPD NG  +S  GV+ +++  KR DW +WADQ ISVDD L+ +W E  
Sbjct: 121 DRLQDFLDLPENVPDPNGLLESSTGVMASEDHSKRTDWQEWADQLISVDDPLDTDWREFL 180

Query: 181 SDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPE 240
            D N  DPK ++L +SS      Q Q +Q    P  EF S +         RPRMRWTPE
Sbjct: 181 DDTNASDPKVKVL-NSSGDISKQQPQFHQNQPAPHGEFSSDAYPLSPAPPTRPRMRWTPE 240

Query: 241 LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGSSGK 300
           LHEAFV+AVN LGGSE ATPKG+LKLM VEGLTIYHVKSHLQKYRTARY+PESSEG+   
Sbjct: 241 LHEAFVDAVNILGGSERATPKGILKLMKVEGLTIYHVKSHLQKYRTARYKPESSEGTLEN 300

Query: 301 KINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ 360
           K+  I EMKSLDLK  MGITEALRLQMEVQK+LHEQLEIQRNLQLRIEEQG+YLQ MFE+
Sbjct: 301 KMASIGEMKSLDLKAGMGITEALRLQMEVQKQLHEQLEIQRNLQLRIEEQGRYLQMMFEK 360

Query: 361 QRKMESK--------LKTSSSILENMPRP---NDQPENSEQGHVAAGMTTENAEGAREDG 420
           Q++ME +        L  +S+ L  +  P   ND+ E  EQ H   G+ T NA    +  
Sbjct: 361 QKRMEDERTGAPSFNLDDASASLPGLTCPSCANDKSEALEQVHTKTGIDTRNASTTEDKS 420

Query: 421 LPAASRKQKAHE---REAIESSEGNSSSPDTKRAKSD 438
               SRKQKA E    + IE+++  S SP +KRA+++
Sbjct: 421 SQDVSRKQKALETNTADHIETNDNESGSPLSKRARTE 455

BLAST of Cp4.1LG05g08380 vs. TrEMBL
Match: F6HZP9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g04120 PE=4 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 2.3e-98
Identity = 238/459 (51.85%), Postives = 286/459 (62.31%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS--GTVGHLFSSSSGL 60
           +SSS  VLP   EE +PKLP S Q     E M  P    A PL S  G VGH+FSSSSG 
Sbjct: 26  LSSSLSVLPTSLEETHPKLPDSQQVYVGREIMTRPQAMHASPLPSNSGAVGHIFSSSSGY 85

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEVVTNDLNE------NNASW 120
                       ER++  +PFIS S+++   L    SSHS  + +  +       NNASW
Sbjct: 86  STDLHFSSVSPHERHSRSAPFISQSSSNGTSLPLAHSSHSGQLQSTASSHYIEENNNASW 145

Query: 121 SDDTLQGLLDFSENVPDQNGQ--AQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNW 180
             D+L G LDF  N P Q+ Q  ++S +GV+ +++  KR+DW +WADQ I+ DDAL  NW
Sbjct: 146 CTDSLSGFLDFPVNTPVQSSQIESRSASGVIASEDLSKRHDWQEWADQLITDDDALNSNW 205

Query: 181 NEIFSDGNGGDPKPEL---LKSSSAGFHAPQNQTNQVDSVPTAEFHSV--SSNSLSTSTN 240
           NE   D N  D +P++   +   S+ F A Q Q +   S P+ E H+V   S+S++T+  
Sbjct: 206 NEFLVDTNVADVEPKMAYQVPKPSSNFSANQPQVHPQLSAPSGEVHNVVTPSSSVNTAPT 265

Query: 241 RPRMRWTPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRP 300
           +PRMRWTPELHEAFVEAVN+LGGSE ATPKGVLKLM VEGLTIYHVKSHLQKYRTARYRP
Sbjct: 266 KPRMRWTPELHEAFVEAVNQLGGSERATPKGVLKLMKVEGLTIYHVKSHLQKYRTARYRP 325

Query: 301 ESSEGSSGKKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG 360
           ESSEGSS K++  IEEM SLDLKT + ITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG
Sbjct: 326 ESSEGSSEKRLTSIEEMSSLDLKTGIEITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG 385

Query: 361 KYLQEMFEQQRKME-SKLKTSSSILENMPR------PN----DQPENSEQGHVAAGMTTE 420
           +YLQ MFE+Q K    KLKTSSS LEN         PN     + E S   H   G    
Sbjct: 386 RYLQMMFEKQCKSGIDKLKTSSSALENPSSLSSDTIPNSPAKSEMEASHDEHDKTGTDLV 445

Query: 421 NAEGAREDGLPAASRKQKAHEREAIESSEGNSSSPDTKR 434
           N            SR+Q A E EA+  SE  +  P+  R
Sbjct: 446 NDSKTSSGNPQKLSREQNAIETEALLGSEQIAIEPEAPR 484

BLAST of Cp4.1LG05g08380 vs. TAIR10
Match: AT5G29000.2 (AT5G29000.2 Homeodomain-like superfamily protein)

HSP 1 Score: 285.0 bits (728), Expect = 7.7e-77
Identity = 185/373 (49.60%), Postives = 241/373 (64.61%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVSG--TVGHLFSSSSGL 60
           MSSSY  L    E++Y KLP S   SS  E M +P+P Q+   VSG  + G+LF SSSG 
Sbjct: 13  MSSSYSALHTSVEDRYHKLPNSFWVSSGQELMNNPVPCQS---VSGGNSGGYLFPSSSGY 72

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEVVTNDLNENNASWSDDTLQ 120
            N   +   L   RN +  P +S+   D   LA +D     +  + L  ++     D L 
Sbjct: 73  CN---VSAVLPHGRNLQNQPPVSTVPRDR--LAMQDCPL--IAQSSLINHHPQEFIDPLH 132

Query: 121 GLLDFSENVPDQNGQAQSIAGVLMAD--EQDKRNDWPDWADQFISVDDALEPNWNEIFSD 180
              DFS++VP QN QA+S +GV +    E  K+++W DWADQ ISVDD  EPNW+E+  D
Sbjct: 133 EFFDFSDHVPVQNLQAES-SGVRVDSSVELHKKSEWQDWADQLISVDDGSEPNWSELLGD 192

Query: 181 GNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPELH 240
            +  +P  E+                Q   V + +  S  ++S S +T++ RMRWTPELH
Sbjct: 193 SSSHNPNSEIPTPFLDVPRLDITANQQQQMVSSEDQLSGRNSSSSVATSKQRMRWTPELH 252

Query: 241 EAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGS---SG 300
           EAFVEAVN+LGGSE ATPK VLKL+N  GLTIYHVKSHLQKYRTARY+PE+SE +     
Sbjct: 253 EAFVEAVNQLGGSERATPKAVLKLLNNPGLTIYHVKSHLQKYRTARYKPETSEVTGEPQE 312

Query: 301 KKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFE 360
           KK+  IE++KSLD+KTS+ IT+ALRLQMEVQKRLHEQLEIQR+LQL+IE+QG+YLQ MFE
Sbjct: 313 KKMTSIEDIKSLDMKTSVEITQALRLQMEVQKRLHEQLEIQRSLQLQIEKQGRYLQMMFE 372

Query: 361 QQRKMESKLKTSS 367
           +Q+K++    +SS
Sbjct: 373 KQQKIQDNKSSSS 374

BLAST of Cp4.1LG05g08380 vs. TAIR10
Match: AT3G04450.1 (AT3G04450.1 Homeodomain-like superfamily protein)

HSP 1 Score: 266.9 bits (681), Expect = 2.2e-71
Identity = 193/457 (42.23%), Postives = 265/457 (57.99%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVSG--TVGHLFSSSSGL 60
           +SSS+ +L + +   +P     C  S       +P+P Q  PLVSG  + G+LFSSSSG 
Sbjct: 13  ISSSFTILEERYHNNFPN--TLCVSSGQESMNNNPVPCQVFPLVSGGSSGGNLFSSSSGF 72

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLA-----------PRDSSHSEVVTNDLNE 120
            NG   +   SQ R     P +S+   D   +A           P ++   +++     +
Sbjct: 73  CNGV-YVSSSSQAR-----PSVSTVPRDRITVAHVSGEGQRQECPVETHSLQLINQPQEQ 132

Query: 121 NNASWSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALE 180
              +WS D ++G  DF   VPD   QA S   ++ + E   + +WPDWADQ IS DD+LE
Sbjct: 133 KIMTWSSDQIRGFFDFP--VPDP--QAASSRTMVSSKEVLSKCEWPDWADQLIS-DDSLE 192

Query: 181 PNWNEIFSDGNGGD--PKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTN 240
           PNW+E+  D N  +   K E   S  A         +QVD  P+ E  +  S   S+ T+
Sbjct: 193 PNWSELLGDPNVLNLYSKIETQSSDIARQEIVFRNQHQVD--PSMEPFNAKSPPASSMTS 252

Query: 241 RPRMRWTPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRP 300
           + RMRWTPELHEAFVEA+N+LGGSE ATPK VLKL+N  GLT+YHVKSHLQKYRTARY+P
Sbjct: 253 KQRMRWTPELHEAFVEAINQLGGSERATPKAVLKLINSPGLTVYHVKSHLQKYRTARYKP 312

Query: 301 ESSEGSSG---KKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIE 360
           E S+ +     K +  IE++KSLDLKTS+ ITEALRLQM+VQK+LHEQLEIQR+LQL+IE
Sbjct: 313 ELSKDTEEPLVKNLKTIEDIKSLDLKTSIEITEALRLQMKVQKQLHEQLEIQRSLQLQIE 372

Query: 361 EQGKYLQEMFEQQRKMESKLKTSSSILENMPR--PNDQPENSEQGHVAAGMTTENAEGAR 420
           EQG+YLQ M E+Q+KM+   K S+S   +MP   P+    N  Q  +     +E      
Sbjct: 373 EQGRYLQMMIEKQQKMQENKKDSTS-SSSMPEADPSAPSPNLSQPFLHKATNSE------ 432

Query: 421 EDGLPAASRKQKAHEREAIESSEGNSSSPDTKRAKSD 438
               P+ ++K + +    ++ SE  S + + KR + D
Sbjct: 433 ----PSITQKLQ-NGSSTMDQSESTSGTSNRKRVRED 442

BLAST of Cp4.1LG05g08380 vs. TAIR10
Match: AT4G28610.1 (AT4G28610.1 phosphate starvation response 1)

HSP 1 Score: 249.6 bits (636), Expect = 3.6e-66
Identity = 186/440 (42.27%), Postives = 235/440 (53.41%), Query Frame = 1

Query: 9   PKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS---GTVGHLFSSSS-GLRNGFPL 68
           P P E+ + +        ++S+ M  P+ +    L S   G VGH+ SSSS G       
Sbjct: 26  PSPVEDSFMR------SDNNSQLMSRPLGQTYHLLSSSNGGAVGHICSSSSSGFATNLHY 85

Query: 69  IQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEVVTNDLNENNASWSDDTLQG-LLDF 128
              +S E+   ++   S++A   P                 + N+++W  D+L G  LDF
Sbjct: 86  STMVSHEKQQHYTGSSSNNAVQTP-----------------SNNDSAWCHDSLPGGFLDF 145

Query: 129 SENVPDQNGQAQSIAGVLMA--DEQDKRNDWPDWADQFISVDDAL-EPNWNEIFSDGNGG 188
            E  P      Q   G + A  D+  KR+DW +WAD  I+ DD L   NWN++  + N  
Sbjct: 146 HETNPAIQNNCQIEDGGIAAAFDDIQKRSDWHEWADHLITDDDPLMSTNWNDLLLETNSN 205

Query: 189 DPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTS--TNRPRMRWTPELHEA 248
               +           PQ Q  Q    P+ E   VS+ S +++  T + RMRWTPELHEA
Sbjct: 206 SDSKD-----QKTLQIPQPQIVQQQPSPSVELRPVSTTSSNSNNGTGKARMRWTPELHEA 265

Query: 249 FVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSE-GSSGKKIN 308
           FVEAVN LGGSE ATPKGVLK+M VEGLTIYHVKSHLQKYRTARYRPE SE GS  +K+ 
Sbjct: 266 FVEAVNSLGGSERATPKGVLKIMKVEGLTIYHVKSHLQKYRTARYRPEPSETGSPERKLT 325

Query: 309 HIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQQRK 368
            +E + SLDLK  +GITEALRLQMEVQK+LHEQLEIQRNLQLRIEEQGKYLQ MFE+Q  
Sbjct: 326 PLEHITSLDLKGGIGITEALRLQMEVQKQLHEQLEIQRNLQLRIEEQGKYLQMMFEKQ-- 385

Query: 369 MESKLKTSSSILENMPRPNDQPENSEQGHVAAGMTTENAEGAREDGLPAASRKQKAHERE 428
                  +S + +     +D    SEQ                ED   A S++    E  
Sbjct: 386 -------NSGLTKGTASTSDSAAKSEQ----------------EDKKTADSKEVPEEETR 408

Query: 429 AIESSEGNSSSPDTKRAKSD 438
             E  E    SP  KR K D
Sbjct: 446 KCEELE----SPQPKRPKID 408

BLAST of Cp4.1LG05g08380 vs. TAIR10
Match: AT2G20400.1 (AT2G20400.1 myb-like HTH transcriptional regulator family protein)

HSP 1 Score: 171.8 bits (434), Expect = 9.5e-43
Identity = 121/261 (46.36%), Postives = 148/261 (56.70%), Query Frame = 1

Query: 144 DEQDKRNDWPDWADQFISVDD--ALEPNWNEIFSDGNGGDPKPELLKSSSAGFHAPQNQT 203
           DE  K++D P W D  I+ D+   +     ++  D N          S  +    PQ   
Sbjct: 138 DEIHKQSDLPLWYDDLITTDEDPLMSSILGDLLLDTNFNSASKVQQPSMQSQIQQPQAVL 197

Query: 204 NQVDSV----PTAEFHSVSSNSLSTSTN-----RPRMRWTPELHEAFVEAVNKLGGSENA 263
            Q  S     P     S +SN+ S S N     + RMRWTPELHE FV+AVN+LGGS  A
Sbjct: 198 QQPSSCVELRPLDRTVSSNSNNNSNSNNAAAAAKGRMRWTPELHEVFVDAVNQLGGSNEA 257

Query: 264 TPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGSSGKKINHIEEMKSLDLKTSMG 323
           TPKGVLK M VEGLTI+HVKSHLQKYRTA+Y P  SEGS   ++  +E++ S D K  + 
Sbjct: 258 TPKGVLKHMKVEGLTIFHVKSHLQKYRTAKYIPVPSEGSPEARLTPLEQITSDDTKRGID 317

Query: 324 ITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQQRK----MESKLKTSSSI 383
           ITE LR+QME QK+LHEQLE  R +QLRIEEQGK L  M E+Q       E   KTS+  
Sbjct: 318 ITETLRIQMEHQKKLHEQLESLRTMQLRIEEQGKALLMMIEKQNMGFGGPEQGEKTSAKT 377

BLAST of Cp4.1LG05g08380 vs. TAIR10
Match: AT5G06800.1 (AT5G06800.1 myb-like HTH transcriptional regulator family protein)

HSP 1 Score: 161.8 bits (408), Expect = 9.8e-40
Identity = 90/168 (53.57%), Postives = 116/168 (69.05%), Query Frame = 1

Query: 195 HAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPELHEAFVEAVNKLGGSENATP 254
           H P+    +  S P+   H  S        N+ R+RWT +LHE FVE VN+LGG++ ATP
Sbjct: 163 HQPKQSHPRFSSPPSFSIHGGSM--APNCVNKTRIRWTQDLHEKFVECVNRLGGADKATP 222

Query: 255 KGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGSSGKKINHIEEMKSLDLKTSMGIT 314
           K +LK M+ +GLTI+HVKSHLQKYR A+Y PES EG   K+    +E+  LD +T + I 
Sbjct: 223 KAILKRMDSDGLTIFHVKSHLQKYRIAKYMPESQEGKFEKRA-CAKELSQLDTRTGVQIK 282

Query: 315 EALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQQRKMESKL 363
           EAL+LQ++VQ+ LHEQLEIQRNLQLRIEEQGK L+ M EQQ+K +  L
Sbjct: 283 EALQLQLDVQRHLHEQLEIQRNLQLRIEEQGKQLKMMMEQQQKNKESL 327

BLAST of Cp4.1LG05g08380 vs. NCBI nr
Match: gi|449449583|ref|XP_004142544.1| (PREDICTED: protein PHR1-LIKE 1-like [Cucumis sativus])

HSP 1 Score: 698.4 bits (1801), Expect = 8.3e-198
Identity = 377/445 (84.72%), Postives = 399/445 (89.66%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS--GTVGHLFSSSSGL 60
           MSSSY VL KPFE+KYPKLP+S QGSS SEAMRHPIPRQAPPLVS  GTVGHLFSSSSG 
Sbjct: 1   MSSSYPVLSKPFEDKYPKLPLSFQGSSQSEAMRHPIPRQAPPLVSNSGTVGHLFSSSSGF 60

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEV----VTNDLNENNASWSD 120
           RN FPL+QPLSQERNA+FSPFIS SAND  LL    SSHSEV    VT +LNEN+ASWS 
Sbjct: 61  RNDFPLMQPLSQERNAQFSPFISRSANDGSLLPSHGSSHSEVQSTMVTGNLNENSASWST 120

Query: 121 DTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNWNEIF 180
           DTLQ LLDFSEN+PDQNGQ Q++A VLM+D+Q KRNDWPDWADQFISVDDALEPNW+EIF
Sbjct: 121 DTLQDLLDFSENIPDQNGQDQNVASVLMSDDQAKRNDWPDWADQFISVDDALEPNWSEIF 180

Query: 181 SDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPE 240
           SD N GDPKPE+LKSSSA F+AP NQTNQVDS+PT EFHSVS NSLSTST RPRMRWTPE
Sbjct: 181 SDANAGDPKPEVLKSSSANFNAPPNQTNQVDSLPTVEFHSVS-NSLSTST-RPRMRWTPE 240

Query: 241 LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGSSGK 300
           LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARY+PESSEGSSGK
Sbjct: 241 LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYKPESSEGSSGK 300

Query: 301 KINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ 360
           KINHIEEMK+LDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ
Sbjct: 301 KINHIEEMKTLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ 360

Query: 361 QRKMESKLKTSSSILENMPRPNDQPENSEQGHVAAGMTTENAEGAREDGLPAASRKQKAH 420
           QRKME+KLKTSSSILENMP  +DQP+N EQGH AAGM+TENAE AREDGL AASRK K H
Sbjct: 361 QRKMENKLKTSSSILENMPCADDQPKNLEQGHDAAGMSTENAEDAREDGLLAASRKHKGH 420

Query: 421 EREAIESSEGNSSSPDTKRAKSDAT 440
           E E +E  EGN SSPD KRAKSDAT
Sbjct: 421 EGEEVEPDEGN-SSPDAKRAKSDAT 442

BLAST of Cp4.1LG05g08380 vs. NCBI nr
Match: gi|659086105|ref|XP_008443767.1| (PREDICTED: protein PHR1-LIKE 1-like [Cucumis melo])

HSP 1 Score: 685.3 bits (1767), Expect = 7.2e-194
Identity = 371/446 (83.18%), Postives = 395/446 (88.57%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS--GTVGHLFSSSSGL 60
           MSSSY VLPKPFE+KYPKLP+S QGSS SE MRHPIPRQAPPLVS  GTVGHLFSSSSG 
Sbjct: 1   MSSSYPVLPKPFEDKYPKLPLSFQGSSQSETMRHPIPRQAPPLVSNSGTVGHLFSSSSGF 60

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEV----VTNDLNENNASWSD 120
           RN FPL+QPLSQERNA+FSPFIS SAND  LL    SSH EV    VT +LNEN+ASWS 
Sbjct: 61  RNDFPLMQPLSQERNAQFSPFISRSANDGSLLPSHGSSHPEVQSAMVTGNLNENSASWST 120

Query: 121 DTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNWNEIF 180
           DTLQ LLDFSEN+PDQNGQ QS+AGVLM+D+Q KRNDWPDWADQFISVDDALEPNW+EIF
Sbjct: 121 DTLQDLLDFSENIPDQNGQDQSMAGVLMSDDQAKRNDWPDWADQFISVDDALEPNWSEIF 180

Query: 181 SDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPE 240
           SD N GDPK E+LK SS  F+AP N+TNQVDS+PT EFHSVS NSLSTST RPRMRWTPE
Sbjct: 181 SDANAGDPKSEVLKPSSTNFNAPPNETNQVDSLPTVEFHSVS-NSLSTSTTRPRMRWTPE 240

Query: 241 LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGSSGK 300
           LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARY+PESSEGS  +
Sbjct: 241 LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYKPESSEGSGEE 300

Query: 301 KINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ 360
           KIN IEEMK+LDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ
Sbjct: 301 KINPIEEMKTLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ 360

Query: 361 QRKMESKLKTSSSILENMPRPNDQPENSEQGHVAAGMTTENAEGAREDGLPAASRKQKAH 420
           QRKME+KLKTSSSILENMPR +DQPEN EQ H AAGM+TENA+ ARED L AASRK K H
Sbjct: 361 QRKMENKLKTSSSILENMPR-DDQPENLEQDHDAAGMSTENAKAAREDDLLAASRKHKGH 420

Query: 421 EREAIESSEGNSSSPDTKRAKSDATA 441
           E +A+ES EGN SSPD KRAKSDATA
Sbjct: 421 EGDAVESGEGN-SSPDAKRAKSDATA 443

BLAST of Cp4.1LG05g08380 vs. NCBI nr
Match: gi|802759369|ref|XP_012089362.1| (PREDICTED: protein PHR1-LIKE 1 [Jatropha curcas])

HSP 1 Score: 399.8 bits (1026), Expect = 6.1e-108
Identity = 251/462 (54.33%), Postives = 308/462 (66.67%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS--GTVGHLFSSSSGL 60
           MS+S+ VLP P EEKYPKLP S Q SS  E  + P+ +QA  L    GT GHL SSS   
Sbjct: 1   MSASFPVLPTPLEEKYPKLPDSFQVSSERELTKGPVSQQASSLGPNIGTTGHLSSSSLRF 60

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDE-PLL-APRDSSHSEV-----VTNDLNENNAS 120
            N         +   ++  PFI+ S++D  PLL AP  SSHS+V     +T+    NN S
Sbjct: 61  LNQVHSSFSSPRRVQSQNYPFIAKSSSDGGPLLGAPTGSSHSDVQPTALMTDPEENNNMS 120

Query: 121 WSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNWN 180
           WS D L  LLDF EN+P QNGQ +S  GV+ +++  K+ DW DWADQ ISVDD LEPNW+
Sbjct: 121 WSIDPLHDLLDFPENIPVQNGQVESNIGVISSEDISKKADWQDWADQLISVDDDLEPNWS 180

Query: 181 EIFSDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRW 240
           EI +D N  D K ++LKS S     PQ   +QV  V + E ++ ++ + +    + RMRW
Sbjct: 181 EILNDANATDAKQKVLKSPSGISVQPQIHQHQV--VSSGETYTAANPTSAAPAAKTRMRW 240

Query: 241 TPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGS 300
           TPELH+AFVEAVNKLGGSE ATPKGVLKLMNVEGLTIYHVKSHLQKYRTARY+PESSEGS
Sbjct: 241 TPELHDAFVEAVNKLGGSERATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYKPESSEGS 300

Query: 301 SGKKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEM 360
           S KK+N I+EMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG+YLQ M
Sbjct: 301 SEKKLNPIDEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGRYLQMM 360

Query: 361 FEQQRKMESKLK--TSSSILENMPRP---------NDQPENSEQGHVAAGMTTENAEGAR 420
           FE+QRKME +    +SSS +++   P         N++ E S+Q +   G    +A  A 
Sbjct: 361 FEKQRKMEEEKSKDSSSSPVDDPSLPQSNLVQQPGNNKSEVSQQDNAKTGFDGNDAGSAL 420

Query: 421 EDGLPAASRKQKAHEREAIE--SSEGNSSSP-DTKRAKSDAT 440
           E+   + S+KQK  E +  +    E N  SP   KR ++D T
Sbjct: 421 EESFQSVSKKQKVQENKTCKGFDPEDNECSPASAKRPRTDET 460

BLAST of Cp4.1LG05g08380 vs. NCBI nr
Match: gi|255561969|ref|XP_002521993.1| (PREDICTED: protein PHR1-LIKE 1 isoform X1 [Ricinus communis])

HSP 1 Score: 382.5 bits (981), Expect = 1.0e-102
Identity = 253/466 (54.29%), Postives = 306/466 (65.67%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVS--GTVGHLFSSS--S 60
           MSS + VLP P E +YPKLP S Q SS  E MR+PI +Q  PL S  GTVGHLFSSS  S
Sbjct: 1   MSSPFPVLPTPLEGQYPKLPDSFQVSSERELMRNPILQQTSPLCSNSGTVGHLFSSSMRS 60

Query: 61  GLRNGFPLIQPLSQERNAEFSPFISSSANDE---PLLAPRDSSHSEV----VTNDLNEN- 120
            +      + P  Q   ++ SPFIS S  D+   P+L    SS+SEV    + N   EN 
Sbjct: 61  SIEAQASSLAP--QGGQSQNSPFISQSLRDKGSLPVLTTH-SSNSEVQSTALINHSEENK 120

Query: 121 NASWSDDTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEP 180
           + SW+ D L  LLDF ENV  QNGQ +S  GV+ +++  KR DW +WADQ ISVDD LEP
Sbjct: 121 DMSWTIDPLHDLLDFPENVAVQNGQVESTIGVITSEDFSKRTDWQEWADQLISVDDDLEP 180

Query: 181 NWNEIFSDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPR 240
           NW+E+ +D N  D K +++KSSS    + Q   +Q   V   E +S ++   +    + R
Sbjct: 181 NWSELLNDANNADRKQKVVKSSSQ--ISVQPTVHQPQPVHNGEPYSAANPMSAIPAAKHR 240

Query: 241 MRWTPELHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESS 300
           MRWTPELHEAFVEAVNKLGGSE ATPKGVLKLMNVEGLTIYHVKSHLQKYRTARY+PES+
Sbjct: 241 MRWTPELHEAFVEAVNKLGGSERATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYKPESA 300

Query: 301 EGSSGKKINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYL 360
           EG+S KK++ I+EMKSLDLK SMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQG++L
Sbjct: 301 EGTSEKKLSPIDEMKSLDLKASMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGRHL 360

Query: 361 QEMFEQQRKME-SKLKTSSSILENMPRP---------NDQPENSEQGHVAAGMTTENAEG 420
           Q MFEQQRKME  + K SSS L++   P         N++ E SE  H  A     +  G
Sbjct: 361 QMMFEQQRKMEDDRSKASSSSLDDPSLPQSNIVQSPGNNKLEVSELDH--ARTEISSGGG 420

Query: 421 AREDGLPAASRKQKAHER---EAIESSEGNSSSPDTKRAKSDATAS 442
           A E      SRKQKA E    E ++  +  S     KR ++D  A+
Sbjct: 421 ALEGSSQNGSRKQKAPENRTGEDLDPEDDESGPASAKRPRADEIAA 459

BLAST of Cp4.1LG05g08380 vs. NCBI nr
Match: gi|590641941|ref|XP_007030373.1| (Homeodomain-like superfamily protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 369.0 bits (946), Expect = 1.1e-98
Identity = 232/457 (50.77%), Postives = 288/457 (63.02%), Query Frame = 1

Query: 1   MSSSYRVLPKPFEEKYPKLPVSCQGSSHSEAMRHPIPRQAPPLVSG--TVGHLFSSSSGL 60
           MSSS+  LP PF+EKYPKLP S Q SS  + M++ I  Q   L     T+G+ FSS S +
Sbjct: 1   MSSSFPALPTPFKEKYPKLPDSFQVSSERKVMKNSISPQESSLAPSNRTLGNSFSSPS-I 60

Query: 61  RNGFPLIQPLSQERNAEFSPFISSSANDEPLLAPRDSSHSEVVTNDLN----ENNASWSD 120
            N       L+ +R+++   FIS  + D   L   DSS+S+  T   N    + + SW  
Sbjct: 61  ANNDMCASALAHDRHSQSPAFISQRSRDLASLPSIDSSYSDQSTALFNHPQEKKDVSWCI 120

Query: 121 DTLQGLLDFSENVPDQNGQAQSIAGVLMADEQDKRNDWPDWADQFISVDDALEPNWNEIF 180
           D LQ  LD  ENVPD NG  +S  GV+ +++  KR DW +WADQ ISVDD L+ +W E  
Sbjct: 121 DRLQDFLDLPENVPDPNGLLESSTGVMASEDHSKRTDWQEWADQLISVDDPLDTDWREFL 180

Query: 181 SDGNGGDPKPELLKSSSAGFHAPQNQTNQVDSVPTAEFHSVSSNSLSTSTNRPRMRWTPE 240
            D N  DPK ++L +SS      Q Q +Q    P  EF S +         RPRMRWTPE
Sbjct: 181 DDTNASDPKVKVL-NSSGDISKQQPQFHQNQPAPHGEFSSDAYPLSPAPPTRPRMRWTPE 240

Query: 241 LHEAFVEAVNKLGGSENATPKGVLKLMNVEGLTIYHVKSHLQKYRTARYRPESSEGSSGK 300
           LHEAFV+AVN LGGSE ATPKG+LKLM VEGLTIYHVKSHLQKYRTARY+PESSEG+   
Sbjct: 241 LHEAFVDAVNILGGSERATPKGILKLMKVEGLTIYHVKSHLQKYRTARYKPESSEGTLEN 300

Query: 301 KINHIEEMKSLDLKTSMGITEALRLQMEVQKRLHEQLEIQRNLQLRIEEQGKYLQEMFEQ 360
           K+  I EMKSLDLK  MGITEALRLQMEVQK+LHEQLEIQRNLQLRIEEQG+YLQ MFE+
Sbjct: 301 KMASIGEMKSLDLKAGMGITEALRLQMEVQKQLHEQLEIQRNLQLRIEEQGRYLQMMFEK 360

Query: 361 QRKMESK--------LKTSSSILENMPRP---NDQPENSEQGHVAAGMTTENAEGAREDG 420
           Q++ME +        L  +S+ L  +  P   ND+ E  EQ H   G+ T NA    +  
Sbjct: 361 QKRMEDERTGAPSFNLDDASASLPGLTCPSCANDKSEALEQVHTKTGIDTRNASTTEDKS 420

Query: 421 LPAASRKQKAHE---REAIESSEGNSSSPDTKRAKSD 438
               SRKQKA E    + IE+++  S SP +KRA+++
Sbjct: 421 SQDVSRKQKALETNTADHIETNDNESGSPLSKRARTE 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PHL1_ARATH1.4e-7549.60Protein PHR1-LIKE 1 OS=Arabidopsis thaliana GN=PHL1 PE=1 SV=1[more]
PHLD_ARATH3.9e-7042.23Myb family transcription factor PHL13 OS=Arabidopsis thaliana GN=PHL13 PE=2 SV=1[more]
PHR1_ARATH6.4e-6542.27Protein PHOSPHATE STARVATION RESPONSE 1 OS=Arabidopsis thaliana GN=PHR1 PE=1 SV=... [more]
PHR1_ORYSI4.3e-5342.82Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. indica GN=PHR1 PE... [more]
PHR1_ORYSJ5.6e-5342.82Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. japonica GN=PHR1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0LY82_CUCSA5.8e-19884.72Uncharacterized protein OS=Cucumis sativus GN=Csa_1G666970 PE=4 SV=1[more]
A0A067JIQ6_JATCU4.2e-10854.33Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23560 PE=4 SV=1[more]
B9S7M4_RICCO7.0e-10354.29DNA binding protein, putative OS=Ricinus communis GN=RCOM_0609760 PE=4 SV=1[more]
A0A061F0K2_THECC8.0e-9950.77Homeodomain-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=T... [more]
F6HZP9_VITVI2.3e-9851.85Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g04120 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G29000.27.7e-7749.60 Homeodomain-like superfamily protein[more]
AT3G04450.12.2e-7142.23 Homeodomain-like superfamily protein[more]
AT4G28610.13.6e-6642.27 phosphate starvation response 1[more]
AT2G20400.19.5e-4346.36 myb-like HTH transcriptional regulator family protein[more]
AT5G06800.19.8e-4053.57 myb-like HTH transcriptional regulator family protein[more]
Match NameE-valueIdentityDescription
gi|449449583|ref|XP_004142544.1|8.3e-19884.72PREDICTED: protein PHR1-LIKE 1-like [Cucumis sativus][more]
gi|659086105|ref|XP_008443767.1|7.2e-19483.18PREDICTED: protein PHR1-LIKE 1-like [Cucumis melo][more]
gi|802759369|ref|XP_012089362.1|6.1e-10854.33PREDICTED: protein PHR1-LIKE 1 [Jatropha curcas][more]
gi|255561969|ref|XP_002521993.1|1.0e-10254.29PREDICTED: protein PHR1-LIKE 1 isoform X1 [Ricinus communis][more]
gi|590641941|ref|XP_007030373.1|1.1e-9850.77Homeodomain-like superfamily protein, putative isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR025756Myb_CC_LHEQLE
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009987 cellular process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g08380.1Cp4.1LG05g08380.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 228..279
score: 1.2
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 227..280
score: 2.8
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 224..282
score: 1.4
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 224..280
score: 5.2
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 223..283
score: 12
IPR025756MYB-CC type transcription factor, LHEQLE-containing domainPFAMPF14379Myb_CC_LHEQLEcoord: 311..358
score: 6.9
NoneNo IPR availableunknownCoilCoilcoord: 314..334
scor
NoneNo IPR availablePANTHERPTHR31314FAMILY NOT NAMEDcoord: 1..440
score: 6.5E
NoneNo IPR availablePANTHERPTHR31314:SF12MYB FAMILY TRANSCRIPTION FACTOR-RELATEDcoord: 1..440
score: 6.5E