Cp4.1LG18g03560 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g03560
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionWD-40 repeat-containing protein MSI4
LocationCp4.1LG18: 4975117 .. 4982850 (-)
RNA-Seq ExpressionCp4.1LG18g03560
SyntenyCp4.1LG18g03560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGAGTATATTACTGCTGCTATTTCACGGCACGGCAATAACACTACGCGCCACTGCATTAGGAAGACCTACCGAGATTTTGACACGTGTCCCAATATCAACCGTTGGTGGTGGATCATCATCATCATTTCCCAAATTGGACCCAAAATATATTTTTGTCTGGGTTTACTGTATGTAACTGGTTTTGATACTTTGGTGGGGTCGCCTTGAGTCTGGAGAGTCAAAGAGAGCGCGAAAAGTATCGTACTGAAAAATCGCGGACCGAAGCTCCACTGAGCTATGGATTCTCCTCAGTCGCAGCAGCAGCAGCAGCAACAGCTTCAACAACAGCAGCAGCAGCAGCCTGTTGTGAAGAAGAAGGAGACCAGAGGCCGGAAGCCCAAGCCTAAGGAGGAGAAGAAGGACGAGCAGCAAGCTAAGAAGATGAAGGCTCACCAACAGCCCTCCGTCGATGAGCGTTATACTCAGTGGAAGTCTCTTGTTCCTGTTCTCTATGACTGGTTCGCTAACCACAATCTCGTTTGGCCTTCTCTCTCTTGCCGGTATTCCACTTACTCTTCCTTCTGTTTTCAACTATCTACTCTACTCTGCTTGCTTCTCTCGTTTCTAGGGTTTTCTGTTTGCTGTTTTTTTCACTTGTGTTTCTTGTGCTTCTTCTTGTTCGTTGGAGATAAATCACCTCCACTGGATTACTGGATCTGTCGCGTGTTAATTTTTAGGGTTTTTTTTTTCTTACTATTTGTCACATTTCTGCAAACTTGAGCTTCTTATTCGTATCATACATTCCCTCTCCCTACCTCTACGGCTATTTGGCGGGGCCTTTATATGACGTTTATTTGATCTGAGTCTGACTATTTTCTGGGTCTAGGTGGGGTCCTCAGCTCGAGCAAGCAACGTATAAGAATCGACAGCGGCTTTATCTTTCTGAACAGGTGACTTCCGTCGCAGTAAAATCTTGTGTTTTGTTGAAATGGCTACGTTGTTTTATTTCTTGCTGTAGTTGTGCTGAATTGCGACCTAAAATCCCCTTAGGGGAGAGAAACTAGCTTGTACTGAATTTATCACATTGAAGATCGATCTGTAATTGTTGTTTATTGTTTATTTTTCTTCCTTGTTACCAGACTGATGGCAGTGTTCCGAATACACTGGTCATTGCAAATTGTGAAGTTGTGAAGCCGAGGGTTGCAGCCGCAGAGCACATTTCTCAGGTAATTGGTCAATTTCAAAGTTTAAAGTTCTGTGGACCTTTTTCGTGCTACTCAATCTAGGTGTCGTATGTTGTTATTGTATTTTCAGTTCAATGAAGAAGCACGCTCACCATTTGTAAAGAAGTACAAGACTATTATACACCCTGGTGAGGTATGAAACGTATGAAAATATGATCTACAATGACATTCATATAAGTGGTTGATCTTCAGGCATTCTTTCCATGCTTTCATGTGGTTACGGTTACATTATTATTTTTTTCCCATTTGAGTTTCAAGGTCTTTCTCCGTCTTGAACATTTATATTTCTTTCTTTGGATAGTTGTTACCGTTTGGTTGGAAGTTGAAAATGATGGAAGATGAAGAGTTGTTTTAAAATTGTTGTCTTTCTAATGGTCTATGTTGAATATTTTCCCAGGTTAACAGAATTAGGGAACTTCCCCAGAATTCTAGAATTGTTGCCACACACACTGACAGTCCAGATGTATGTGCTATTAATTTTAGTTCTTGTTTTAATCCATTTGAAAATCTTATGCCTACAGTACATTTATTCAACAAACATTGATTTAAGTCTCTTTGGTATTCAACATCTAGGTCCTCATTTGGGATGTTGAGGCACAACCTAACCGCCATGCTGTCCTTGGTGCCACAAATTCTCGCCCAGATTTGGTAATTCTATTAAAAACTTGAATTGCCTACACGATCTTTGTTTTCCTATAAATGTTGCTATATCTTAGATCTCTGGTGTATATTTCTAGTTGATGTTTCTCTATCTACCTTCTGGTTACAACCACGGTGGAATACATAACATTGATGTTGAAGTATGCCTTGCATTAAAGCAAAACCAACGTTATCACTTTTCTTTTTAGGGCCGTGCATCATTAGAATGCCTTTCGAGAATAGCTGATAAGATGGAAGATGTCCTTTTTACAACGTTCTTTTCTCACTCATTTAAGCACTAAATTTGAATATGGTTTTAAGAGAGAATGACTCAATTCTTCTGTACTTAGATTTCGTGAGAGTCAGAAATTCTTGAGTCTCTTGGATGTAGATTATGGGCTTGGCTGGTTGAGGTTAAGTTTAATCCTGCTATAGATGCTTATTGAGATCTCGATATGCCCAACTGAGTTTAGTTCACTTTCGTACTGTTAATGTTGCGTCATGGTTGAACAATAGGCTGCTTAATTTATTTTCTTATGAATTTAAAAGGTTCAAGCATTTCATAGTTTTAATCTGATAGGAGATATTTTACAAAATCATTATCTAGAGCTGAGGTGGGTTCACAGTCTGATTGGAAGACCATGGTAGGTCATCGATGAGTGTGTAAAACTTCTTTGGTCTCATTTACACATCTTAACAATTGTTCAACCTGGGACCATATAAAGATAAATTCTTCTGGTGTTTGTAACATGGACATTCTAAATCCCTTGCTTGTTTAAAGCCTTCATTGTGCAATTTTATCTTAACTTACTGGTCACTTGCAGATTCTGACTGGTCATCAAGAAAATGCCGAGTTTGCTCTAGCTATGTGCCCCACTGAACCTTATGTCCTATCTGGAGGTTGAATGTTTTCCAATCTTTTTCTCTCCTTTAAGTTGGTCCTGCTAATTAGGTTCAATCTGGAAAGTTCTACTCAACTCTTGCACTTTTTCAGATTTAGATGTCAAATTATTTTGTTGTTTTTTTCAATACCTATCAATTTCTTTGCTGTATGGGATTTTGGTCTATCTTATGGTTTGGGGTGTTACAAATGGGGTTCCATTTCTCTGATGTTACTTCATCTTATGGGCAGGGAAGGACAAGTTAGTTGTTTTATGGAGTATCCAGGACCATATAACAACTTCTGCCACAGACCCTGCTGCTTCAAAATCACCAGGATCTGGTGGATCTATCATAAAGAAGGCGGGAGAGGGAACTGATAAAACTTCTGATGGTCCTTCTATTGGGCCACGAGGAGTTTACCATGGCCATGAGGATACTGTTGAAGATGTGACCTTCTGTCCATCCAAGTAATATCTTAGTTCTGTGTTACTGGTCATTCAGTTTTTTATTTTGAGAATTATTGATAAATGGACATGCGGATGGGGCATTTAAAACCATTTTTATTTAAAATATATTTTGTTTATTACGTAGTACAGGTGATTTCTAGCTGAGGTTTGGTTTGACCAAGATGAGAGGTTTTGAAGGGTAGGGTTTTGCGTGGGGGATGAGGGTCGATTTCTGGATCATAGTGCTCTTGATTAATGTTACAAATAAACTATCGAGCATTGAATAATTTTTACTGTATATTATCTTCACAAAACTGAAAATAGTAAAAAGTAGTAATAAAATTGTACCATTTTTTAATCTTCCAAGGTAAGGAACTCACGTATAGTAGATTTTCCTAACGTTATAACTAATTTGGTTGGGACTGGGCATAAATGATCTCTTTTTCACTACCATAACTTTACTTCCCTCCTTCCTAGTTGAAGAGGCCTTTTGTAATCTCTCTGTTGGGGGTCTTCCCCTCTTTTGTAAATTCATGCCATCAATGAAATTGTTTCTTAACCCTCCCCAAAAAATAATGCTATTTTAAAGTATTTTAATTTGTGTCTTTGTGTTCATGTACACCAGATTTTATTTCGCTTGAGTGGATGGAAGTCTAATTTTTAGTTGTTTCTTTTCCAAGCGTAACCAGCAACGTCCTTTTCTTTTATTTTCCCTCTTAAGTGCACAGGAGTTCTGCAGTGTAGGAGATGACTCTTGCCTAATATTATGGGATGCCCGTACTGGCTCTAGCCCAGCTGTCAAGGTCTTAATTCTCTTGTTTAGTTCTCTGTAACAAGGGGCTCTCCTTGTCGTATAATGTGGAATTTGAAATATGTGTTATAGAGCATAAATTCGATGAAACTAGAAACTTGTCATTTTCCTATTATGAGAAATATTTTTTGATTTGTTATACTGGCTTATGTCGATAGGACAAGAATTCCATGGTGTTGTTTTGTAGTAGGGAATTTTTTTCGTTGTTCATACCAAGTATTCTGTGCAGGAGATTTATCCTGCAAGATAAAATTGATATCATGTAATTTAATATTGCTTTTTATGTTTTTATCCTCAGGGTGCATATTTTAGGTGCTTCTTGTTATTCATCTTAGTCTGTCTTTCTTTCTGGTCTGCAGGTTGAAAAGGCACACAATGCTGATCTTCATTGTGTTGATTGGAATCCCCATGATGATAATCTTATCATAACAGGGTAAATTTTAATGTTTTATTTTCTTTTCAATTGCTAATGCTTATGGGAAAATTGGGTCTCATGTATTTATACAAAAAAGAGAATAAATATACAGACAGGGGAGAGAGAAGAAATAATAAATAGAATCTCAGAAGCACAAACACTTTAGTTTAGCCAGCAGGTCCGTGTCTGACACGAGTTAGACACTTGGACATTTTAACACTTGTTGGTTACCTACTAAACACTTGTTAGGGCAGTAGATGTGTTAACATTAGTTCTACAAAGTCAAAGTAGGTCCAACATTTGTTAGAAATAAATCGAACCCTTGTTAAGTATACTAAATTGCACATATGACAAAATTAATAAATTTGAGAGAAAATACACAAACTAATTTTTAAGCATATGAACTCATTGACTTCGAAATTCATTATGGTATAAAAATGACATTTTAAAAAATGTATATTTTAATGAACGTGTCAGTTGCTGTGTTCTTGTCCTCGATTTTAGAAAATTAGGTGTTGTGGTGTCTGTGCTTCTTAAAATAGCTCAATGGTCTTACTACTATTTGCCCTGTCTTAGTTAGGCTGTTGTGTGTCATTTGGTTTGGTCTCATCATTTTATCGATAGTGCATTTAGGATGATTTAGAATTTAGGAGAATGAGATTTCTGAGGTAATGACTATAAACATACGCGGTCACATCGGCACAAATTTCTCTTTAGGTCAGCAGATAATTCTATTCGCATGTTTGATCGTCGGAATCTCACTTCTAATGGAGTTGGCTCGCCCATCTATAAATTTGAGGGCCACAAAGCAGCGGTTCTTTGTGTTCAGGTATTCTTTTTGTCTTATTACTACTAATTTTATTTGAAAATGCTCTTGCTATTAGTTGTAAAATGTACTGATTATACGTACTGGTCCATATATGTTCTTATTCAGTGGTCCCCAGATAAATCGTCTGTCTTTGGAAGTTCTGCTGAGGATGGGCTGTTGAATATTTGGGATTACGATAAGGTACATTATTTTGAGATTTTAACCATGTTTTATCTTTTCATCATCTTGCTTCAGGATATCAGGGTTCAGTTTAATGCCGGAACTCAGGATAATAATATGTTATACTATGAAGTTCTTTTGCCCTTTAGCTTAGAGATCTGATTCCACAAACTTCCACGACAACTGGCTGATATTTTATAGCATATCTTATATGTTTATAGTCCATGTTAACTAATTTGTCCTAGTATTTCTATTGAATTTTTTGTAGTATAGACTATAATTTCTTTGTGGTCTCAACATATAATATACATTCCAGGTTGGTAAAAAGACAGAGCGAGCTACAAGGACACCTGCTGCTCCTCCTGGTTTATTTTTCCAGCATGCTGGGCACAGGTTCCACATCGCTGCCTTTCTTGATTAAAATGCATGAACATGACATAAAAAANCCCCCCCCCCAAAAAAAAAAAAGAAAAAAAAGAAAAGAGGAGTTTTTTAGTTTGCTGTTGGTTCATGGTTTTTGTTATGTACTTGTAGGTAGGCAGAAAATTATTTTGTTATTAGATTTGGCTTCAATGGTAATGGTCCTTCTCATCTAATCTTTTGCCCCAGGGATAAAGTCGTTGACTTCCATTGGAATGCAGCTGATCCATGGACTGTTGTGAGTGTGTCTGATGATTGTGATACGACTGGTGGAGGAGGGACGTTGCAGGTTGCTATCTCTCTCCCTTTCTCTCTCACTCGCCTCCCACACATGCACAAAAGTAAAACAAATAGATTTGTACGGTTTCTGAACTTGGAGGAGTGTGCTAATTTTCTCTTGAATTTGTTATGAGTTGCAGATATGGCGCATGAGTGACCTAATTTATCGTCCAGAAGATGAGGTGTTAGCTGAGCTTGAAAAATTCAAATCTCACGTAATTGAATGTGCTGCAAAGCCTTGAGAGCTTCATAAACCTTTCTACCTGTGTTTTCTTCGGACATAACGCAGTAGGTTGTAGATTTTGTCTGTTTTCTTAGGATTGAGATGTTAGGTGTGCAAGTAAATCCGGCGCAGGCTCTCTCTACCATTTATAGAGTTTAGTAGAATGAACATGATCCACAGTGATACCATCAAGGTATTGAACGCTCTTGTTTCCCTGGATTACTTGGAGGTTAGGTTCTTGAAATTTGGTGTTATGTTGAATGAAACCCGTTATGATTCTGTGTTTGGGGACCGTCTAGGAAGGCCTAATGGATGTCTTTTCACTCTTATTATTTTGAAGCGTGGTGGATGATATTATATGATATTACAGGTGAGACTAATGGATGTCTTTTTTCATGATACAAAGTGTTCTATACATTGATTGAAAAGAAAAAAAAAAATTGACTCTGAATCTTACAGCATTTTCTTCACTCTAATGCTTTATTCTGCAACAGCAAGGAAGAAAAAAAACGTCGAGGATAACGAGAAGGTTAACCGTTCTTGTATCCACCCCAGTTCATCATCACCTTATCCTCTACAATACCATCAATTTGTTGTGTATAGCACTGCAAAAGCTATTTTGGAAGCCAGTTAGTTGAAATTTGTATGTATGTTCGTTCGTCTTAGGGCGTATTGGATCAAAATCGGTCAACATCGAAATTGAAGTATGTGGAAGGACTTGCTTGTTTGTTAGGTAGAACCGGACCATAATCTAGGCATTCCATGTTCACTGATGCTACTTCCAGAGTACCTTCTCATTCTTTGCCAGAAGTGTCTCTTACCTTTCCCGTTTCGAAGAGGACTAGAACTTAGGCATCTTTCGAGTGATCTAGATTCTAGCATTCCGATTCCAATTGAAGTATTTAACATGCCCGACATTGAGAGACGTGTATCGATGACATCAAGACCTGACATTGAGCCAGATACTAAATATTCGCTATGAAAAGAGTCTAGCATGTATCTTGAACTCAATGTCTCGTCATTCATCTCCACAGACGAATTATCTAGGCTTGAACCAGCTTGTGAGATTGCTTCTCGAGCATCGTTGCATTGGTTGCATACTAGTTTCAAAGCTTGAACGACCTCGCCCATGAAAGGTCGATGTGACACCTCTGGTTGCACACACATTGAAGCAATGGCAGCCACTTTGGCAATATTTTCAAAAGGAACACTAGAATCGAGTGATTTGTCAATTATAACATCTAAACCTTCTTTGCTCGTGAGTAGTGGACGAGCCCAAGCAACAAGATTTTCTTCACCAGGCGGTTGCGACATGTCAACTGGTTTCCTCCCAGTTAATAGCTCGAGAAGAACGACCCCGTAGCTATAAACATCACTCTTCACAAGTAAATGCCCCGTCATCGC

mRNA sequence

AAAAGAGTATATTACTGCTGCTATTTCACGGCACGGCAATAACACTACGCGCCACTGCATTAGGAAGACCTACCGAGATTTTGACACGTGTCCCAATATCAACCGTTGGTGGTGGATCATCATCATCATTTCCCAAATTGGACCCAAAATATATTTTTGTCTGGGTTTACTGTATGTAACTGGTTTTGATACTTTGGTGGGGTCGCCTTGAGTCTGGAGAGTCAAAGAGAGCGCGAAAAGTATCGTACTGAAAAATCGCGGACCGAAGCTCCACTGAGCTATGGATTCTCCTCAGTCGCAGCAGCAGCAGCAGCAACAGCTTCAACAACAGCAGCAGCAGCAGCCTGTTGTGAAGAAGAAGGAGACCAGAGGCCGGAAGCCCAAGCCTAAGGAGGAGAAGAAGGACGAGCAGCAAGCTAAGAAGATGAAGGCTCACCAACAGCCCTCCGTCGATGAGCGTTATACTCAGTGGAAGTCTCTTGTTCCTGTTCTCTATGACTGGTTCGCTAACCACAATCTCGTTTGGCCTTCTCTCTCTTGCCGGTGGGGTCCTCAGCTCGAGCAAGCAACGTATAAGAATCGACAGCGGCTTTATCTTTCTGAACAGACTGATGGCAGTGTTCCGAATACACTGGTCATTGCAAATTGTGAAGTTGTGAAGCCGAGGGTTGCAGCCGCAGAGCACATTTCTCAGTTCAATGAAGAAGCACGCTCACCATTTGTAAAGAAGTACAAGACTATTATACACCCTGGTGAGGTTAACAGAATTAGGGAACTTCCCCAGAATTCTAGAATTGTTGCCACACACACTGACAGTCCAGATGTCCTCATTTGGGATGTTGAGGCACAACCTAACCGCCATGCTGTCCTTGGTGCCACAAATTCTCGCCCAGATTTGTTGATGTTTCTCTATCTACCTTCTGGTTACAACCACGGTGGAATACATAACATTGATGTTGAAATTCTGACTGGTCATCAAGAAAATGCCGAGTTTGCTCTAGCTATGTGCCCCACTGAACCTTATGTCCTATCTGGAGGGAAGGACAAGTTAGTTGTTTTATGGAGTATCCAGGACCATATAACAACTTCTGCCACAGACCCTGCTGCTTCAAAATCACCAGGATCTGGTGGATCTATCATAAAGAAGGCGGGAGAGGGAACTGATAAAACTTCTGATGGTCCTTCTATTGGGCCACGAGGAGTTTACCATGGCCATGAGGATACTGTTGAAGATGTGACCTTCTGTCCATCCAATGCACAGGAGTTCTGCAGTGTAGGAGATGACTCTTGCCTAATATTATGGGATGCCCGTACTGGCTCTAGCCCAGCTGTCAAGGTTGAAAAGGCACACAATGCTGATCTTCATTGTGTTGATTGGAATCCCCATGATGATAATCTTATCATAACAGGGTCAGCAGATAATTCTATTCGCATGTTTGATCGTCGGAATCTCACTTCTAATGGAGTTGGCTCGCCCATCTATAAATTTGAGGGCCACAAAGCAGCGGTTCTTTGTGTTCAGTGGTCCCCAGATAAATCGTCTGTCTTTGGAAGTTCTGCTGAGGATGGGCTGTTGAATATTTGGGATTACGATAAGGTTGGTAAAAAGACAGAGCGAGCTACAAGGACACCTGCTGCTCCTCCTGGTTTATTTTTCCAGCATGCTGGGCACAGGGATAAAGTCGTTGACTTCCATTGGAATGCAGCTGATCCATGGACTGTTGTGAGTGTGTCTGATGATTGTGATACGACTGGTGGAGGAGGGACGTTGCAGATATGGCGCATGAGTGACCTAATTTATCGTCCAGAAGATGAGGTGTTAGCTGAGCTTGAAAAATTCAAATCTCACGTAATTGAATGTGCTGCAAAGCCTTGAGAGCTTCATAAACCTTTCTACCTGTGTTTTCTTCGGACATAACGCAGTAGGTTGTAGATTTTGTCTGTTTTCTTAGGATTGAGATGTTAGGTGTGCAAGTAAATCCGGCGCAGGCTCTCTCTACCATTTATAGAGTTTAGTAGAATGAACATGATCCACAGTGATACCATCAAGGTATTGAACGCTCTTGTTTCCCTGGATTACTTGGAGGTTAGGTTCTTGAAATTTGGTGTTATGTTGAATGAAACCCGTTATGATTCTGTGTTTGGGGACCGTCTAGGAAGGCCTAATGGATGTCTTTTCACTCTTATTATTTTGAAGCGTGGTGGATGATATTATATGATATTACAGGTGAGACTAATGGATGTCTTTTTTCATGATACAAAGTGTTCTATACATTGATTGAAAAGAAAAAAAAAAATTGACTCTGAATCTTACAGCATTTTCTTCACTCTAATGCTTTATTCTGCAACAGCAAGGAAGAAAAAAAACGTCGAGGATAACGAGAAGGTTAACCGTTCTTGTATCCACCCCAGTTCATCATCACCTTATCCTCTACAATACCATCAATTTGTTGTGTATAGCACTGCAAAAGCTATTTTGGAAGCCAGTTAGTTGAAATTTGTATGTATGTTCGTTCGTCTTAGGGCGTATTGGATCAAAATCGGTCAACATCGAAATTGAAGTATGTGGAAGGACTTGCTTGTTTGTTAGGTAGAACCGGACCATAATCTAGGCATTCCATGTTCACTGATGCTACTTCCAGAGTACCTTCTCATTCTTTGCCAGAAGTGTCTCTTACCTTTCCCGTTTCGAAGAGGACTAGAACTTAGGCATCTTTCGAGTGATCTAGATTCTAGCATTCCGATTCCAATTGAAGTATTTAACATGCCCGACATTGAGAGACGTGTATCGATGACATCAAGACCTGACATTGAGCCAGATACTAAATATTCGCTATGAAAAGAGTCTAGCATGTATCTTGAACTCAATGTCTCGTCATTCATCTCCACAGACGAATTATCTAGGCTTGAACCAGCTTGTGAGATTGCTTCTCGAGCATCGTTGCATTGGTTGCATACTAGTTTCAAAGCTTGAACGACCTCGCCCATGAAAGGTCGATGTGACACCTCTGGTTGCACACACATTGAAGCAATGGCAGCCACTTTGGCAATATTTTCAAAAGGAACACTAGAATCGAGTGATTTGTCAATTATAACATCTAAACCTTCTTTGCTCGTGAGTAGTGGACGAGCCCAAGCAACAAGATTTTCTTCACCAGGCGGTTGCGACATGTCAACTGGTTTCCTCCCAGTTAATAGCTCGAGAAGAACGACCCCGTAGCTATAAACATCACTCTTCACAAGTAAATGCCCCGTCATCGC

Coding sequence (CDS)

ATGGATTCTCCTCAGTCGCAGCAGCAGCAGCAGCAACAGCTTCAACAACAGCAGCAGCAGCAGCCTGTTGTGAAGAAGAAGGAGACCAGAGGCCGGAAGCCCAAGCCTAAGGAGGAGAAGAAGGACGAGCAGCAAGCTAAGAAGATGAAGGCTCACCAACAGCCCTCCGTCGATGAGCGTTATACTCAGTGGAAGTCTCTTGTTCCTGTTCTCTATGACTGGTTCGCTAACCACAATCTCGTTTGGCCTTCTCTCTCTTGCCGGTGGGGTCCTCAGCTCGAGCAAGCAACGTATAAGAATCGACAGCGGCTTTATCTTTCTGAACAGACTGATGGCAGTGTTCCGAATACACTGGTCATTGCAAATTGTGAAGTTGTGAAGCCGAGGGTTGCAGCCGCAGAGCACATTTCTCAGTTCAATGAAGAAGCACGCTCACCATTTGTAAAGAAGTACAAGACTATTATACACCCTGGTGAGGTTAACAGAATTAGGGAACTTCCCCAGAATTCTAGAATTGTTGCCACACACACTGACAGTCCAGATGTCCTCATTTGGGATGTTGAGGCACAACCTAACCGCCATGCTGTCCTTGGTGCCACAAATTCTCGCCCAGATTTGTTGATGTTTCTCTATCTACCTTCTGGTTACAACCACGGTGGAATACATAACATTGATGTTGAAATTCTGACTGGTCATCAAGAAAATGCCGAGTTTGCTCTAGCTATGTGCCCCACTGAACCTTATGTCCTATCTGGAGGGAAGGACAAGTTAGTTGTTTTATGGAGTATCCAGGACCATATAACAACTTCTGCCACAGACCCTGCTGCTTCAAAATCACCAGGATCTGGTGGATCTATCATAAAGAAGGCGGGAGAGGGAACTGATAAAACTTCTGATGGTCCTTCTATTGGGCCACGAGGAGTTTACCATGGCCATGAGGATACTGTTGAAGATGTGACCTTCTGTCCATCCAATGCACAGGAGTTCTGCAGTGTAGGAGATGACTCTTGCCTAATATTATGGGATGCCCGTACTGGCTCTAGCCCAGCTGTCAAGGTTGAAAAGGCACACAATGCTGATCTTCATTGTGTTGATTGGAATCCCCATGATGATAATCTTATCATAACAGGGTCAGCAGATAATTCTATTCGCATGTTTGATCGTCGGAATCTCACTTCTAATGGAGTTGGCTCGCCCATCTATAAATTTGAGGGCCACAAAGCAGCGGTTCTTTGTGTTCAGTGGTCCCCAGATAAATCGTCTGTCTTTGGAAGTTCTGCTGAGGATGGGCTGTTGAATATTTGGGATTACGATAAGGTTGGTAAAAAGACAGAGCGAGCTACAAGGACACCTGCTGCTCCTCCTGGTTTATTTTTCCAGCATGCTGGGCACAGGGATAAAGTCGTTGACTTCCATTGGAATGCAGCTGATCCATGGACTGTTGTGAGTGTGTCTGATGATTGTGATACGACTGGTGGAGGAGGGACGTTGCAGATATGGCGCATGAGTGACCTAATTTATCGTCCAGAAGATGAGGTGTTAGCTGAGCTTGAAAAATTCAAATCTCACGTAATTGAATGTGCTGCAAAGCCTTGA

Protein sequence

MDSPQSQQQQQQQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Homology
BLAST of Cp4.1LG18g03560 vs. ExPASy Swiss-Prot
Match: O22607 (WD-40 repeat-containing protein MSI4 OS=Arabidopsis thaliana OX=3702 GN=MSI4 PE=1 SV=3)

HSP 1 Score: 837.8 bits (2163), Expect = 6.8e-242
Identity = 406/511 (79.45%), Postives = 443/511 (86.69%), Query Frame = 0

Query: 30  RGRKPKPKEEKK----DEQQAKKM-----KAHQQPSVDERYTQWKSLVPVLYDWFANHNL 89
           RGRKPK KE+ +     +Q   KM     K  Q PSVDE+Y+QWK LVP+LYDW ANHNL
Sbjct: 28  RGRKPKTKEDSQTPSSQQQSDVKMKESGKKTQQSPSVDEKYSQWKGLVPILYDWLANHNL 87

Query: 90  VWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN 149
           VWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN
Sbjct: 88  VWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN 147

Query: 150 EEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSPDVLIWDVEAQPNRHAVLGAT 209
           EEARSPFVKKYKTIIHPGEVNRIRELPQNS+IVATHTDSPDVLIWDVE QPNRHAVLGA 
Sbjct: 148 EEARSPFVKKYKTIIHPGEVNRIRELPQNSKIVATHTDSPDVLIWDVETQPNRHAVLGAA 207

Query: 210 NSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFALAMCPTEPYVLSGGKDKLVVL 269
           NSRPDL                     ILTGHQ+NAEFALAMCPTEP+VLSGGKDK VVL
Sbjct: 208 NSRPDL---------------------ILTGHQDNAEFALAMCPTEPFVLSGGKDKSVVL 267

Query: 270 WSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVT 329
           WSIQDHITT  TD  +S      GSIIK+ GEGTDK ++ P++GPRGVYHGHEDTVEDV 
Sbjct: 268 WSIQDHITTIGTDSKSS------GSIIKQTGEGTDK-NESPTVGPRGVYHGHEDTVEDVA 327

Query: 330 FCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNADLHCVDWNPHDDNLIITGSAD 389
           F P++AQEFCSVGDDSCLILWDARTG++P  KVEKAH+ADLHCVDWNPHDDNLI+TGSAD
Sbjct: 328 FSPTSAQEFCSVGDDSCLILWDARTGTNPVTKVEKAHDADLHCVDWNPHDDNLILTGSAD 387

Query: 390 NSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKV 449
           N++R+FDRR LT+NGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYD+V
Sbjct: 388 NTVRLFDRRKLTANGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDRV 447

Query: 450 GKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIW 509
            KK++RA ++PA   GLFFQHAGHRDKVVDFHWNA+DPWT+VSVSDDC+TTGGGGTLQIW
Sbjct: 448 SKKSDRAAKSPA---GLFFQHAGHRDKVVDFHWNASDPWTIVSVSDDCETTGGGGTLQIW 507

Query: 510 RMSDLIYRPEDEVLAELEKFKSHVIECAAKP 532
           RMSDLIYRPE+EV+AELEKFKSHV+ CA+KP
Sbjct: 508 RMSDLIYRPEEEVVAELEKFKSHVMTCASKP 507

BLAST of Cp4.1LG18g03560 vs. ExPASy Swiss-Prot
Match: Q9SU78 (WD-40 repeat-containing protein MSI5 OS=Arabidopsis thaliana OX=3702 GN=MSI5 PE=1 SV=2)

HSP 1 Score: 735.3 bits (1897), Expect = 4.8e-211
Identity = 360/503 (71.57%), Postives = 413/503 (82.11%), Query Frame = 0

Query: 28  ETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSC 87
           + R RKPK   E    Q    ++  Q+ +VD+ Y+QWK+L+P+LYD F NH LVWPSLSC
Sbjct: 28  DKRRRKPKSNNE---SQLPFLLQQSQKATVDDTYSQWKTLLPILYDSFVNHTLVWPSLSC 87

Query: 88  RWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPF 147
           RWGPQLEQA  K  QRLYLSEQT+GSVPNTLVIANCE V           Q NE+A SPF
Sbjct: 88  RWGPQLEQAGSKT-QRLYLSEQTNGSVPNTLVIANCETVN---------RQLNEKAHSPF 147

Query: 148 VKKYKTIIHPGEVNRIRELPQNSRIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLL 207
           VKKYKTIIHPGEVNRIRELPQNS+IVATHTDSPD+LIW+ E QP+R+AVLGA +SRPDLL
Sbjct: 148 VKKYKTIIHPGEVNRIRELPQNSKIVATHTDSPDILIWNTETQPDRYAVLGAPDSRPDLL 207

Query: 208 MFLYLPSGYNHGGIHNIDVEILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHI 267
                                L GHQ++AEFALAMCPTEP+VLSGGKDK V+LW+IQDHI
Sbjct: 208 ---------------------LIGHQDDAEFALAMCPTEPFVLSGGKDKSVILWNIQDHI 267

Query: 268 TTSATDPAASKSPGSGGSIIKKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQ 327
           T + +D   SKSPGS     K+ GEG+DKT  GPS+GPRG+Y+GH+DTVEDV FCPS+AQ
Sbjct: 268 TMAGSD---SKSPGSS---FKQTGEGSDKTG-GPSVGPRGIYNGHKDTVEDVAFCPSSAQ 327

Query: 328 EFCSVGDDSCLILWDARTGSSPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFD 387
           EFCSVGDDSCL+LWDARTG+SPA+KVEKAH+ADLHCVDWNPHD+NLI+TGSADN++R+FD
Sbjct: 328 EFCSVGDDSCLMLWDARTGTSPAMKVEKAHDADLHCVDWNPHDNNLILTGSADNTVRVFD 387

Query: 388 RRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERA 447
           RRNLTSNGVGSP+YKFEGH+AAVLCVQWSPDKSSVFGSSAEDGLLNIWD D+VGKK+ERA
Sbjct: 388 RRNLTSNGVGSPVYKFEGHRAAVLCVQWSPDKSSVFGSSAEDGLLNIWDCDRVGKKSERA 447

Query: 448 TRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIY 507
           T+T   P GLFFQHAGHRDKVVDFHW+  +PWT+VSVSD+C++ GGGGTLQIWRMSDLIY
Sbjct: 448 TKT---PDGLFFQHAGHRDKVVDFHWSLLNPWTIVSVSDNCESIGGGGTLQIWRMSDLIY 486

Query: 508 RPEDEVLAELEKFKSHVIECAAK 531
           RPEDEVL ELEKFKSHV  C +K
Sbjct: 508 RPEDEVLTELEKFKSHVFTCTSK 486

BLAST of Cp4.1LG18g03560 vs. ExPASy Swiss-Prot
Match: Q9I8G9 (Histone-binding protein RBBP7 OS=Gallus gallus OX=9031 GN=RBBP7 PE=1 SV=1)

HSP 1 Score: 223.4 bits (568), Expect = 6.1e-57
Identity = 150/470 (31.91%), Postives = 219/470 (46.60%), Query Frame = 0

Query: 57  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGS-VP 116
           + E Y  WK   P LYD    H L WPSL+ +W P + +   K+    +L   T  S   
Sbjct: 16  ISEEYKIWKKNTPFLYDLVMTHALEWPSLTVQWLPDVSRPEGKDYALHWLVLGTHTSDEQ 75

Query: 117 NTLVIANCEVVK-PRVAAAEHISQFNEEARSPFVK-KYKT---IIHPGEVNRIRELPQNS 176
           N LV+A  ++    +   +++ S+  E      V  K +T   I H GEVNR R +PQN 
Sbjct: 76  NHLVVARVQIPNDDQFDTSQYDSEKGEFGGFGSVTGKIETEIKINHEGEVNRARYMPQNP 135

Query: 177 RIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILT 236
            I+AT T S DVL++D    P++    G  N  PDL +  +   GY      N     L 
Sbjct: 136 YIIATKTPSADVLVFDYTKHPSKPDPSGECN--PDLRLRGHQKEGYGLSWNSN-----LK 195

Query: 237 GHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKA 296
           GH                +LS   D  V LW I             S  P          
Sbjct: 196 GH----------------LLSASDDHTVCLWDI-------------SAGP---------- 255

Query: 297 GEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDAR--TGSS 356
                   +G  +  + ++ GH   VEDV +   +   F SV DD  L++WD R  T S 
Sbjct: 256 -------KEGKIVDAKAIFTGHSAVVEDVAWHLLHESLFGSVADDQKLMIWDTRSNTTSK 315

Query: 357 PAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKA 416
           P+  V+ AH A+++C+ +NP+ + ++ TGSAD ++ ++D RNL        ++ FE HK 
Sbjct: 316 PSHSVD-AHTAEVNCLSFNPYSEFILATGSADKTVALWDLRNLKLK-----LHSFESHKD 375

Query: 417 AVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKV 476
            +  V WSP   ++  SS  D  LN+WD  K+G++ + A      PP L F H GH  K+
Sbjct: 376 EIFQVHWSPHNETILASSGTDRRLNVWDLSKIGEE-QSAEDAEDGPPELLFIHGGHTAKI 418

Query: 477 VDFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELE 519
            DF WN  +PW + SVS+D         +QIW+M++ IY  E+  +A  E
Sbjct: 436 SDFSWNPNEPWVICSVSED-------NIMQIWQMAENIYNDEEPDIAAAE 418

BLAST of Cp4.1LG18g03560 vs. ExPASy Swiss-Prot
Match: Q8AVH1 (Histone-binding protein RBBP7 OS=Xenopus laevis OX=8355 GN=rbbp7 PE=2 SV=1)

HSP 1 Score: 223.4 bits (568), Expect = 6.1e-57
Identity = 149/466 (31.97%), Postives = 217/466 (46.57%), Query Frame = 0

Query: 57  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGS-VP 116
           ++E Y  WK   P LYD    H L WPSL+ +W P + +   K+    +L   T  S   
Sbjct: 16  INEEYKIWKKNTPFLYDLVMTHALEWPSLTVQWLPDVTRPEGKDYALHWLVLGTHTSDEQ 75

Query: 117 NTLVIANCEVVKPRVAAAEHISQFNEE--------ARSPFVKKYKTIIHPGEVNRIRELP 176
           N LV+A  +V  P   A    S ++ E        + S  ++    I H GEVNR R +P
Sbjct: 76  NHLVVARVQV--PNDDAQFDASHYDSEKGEFGGFGSVSGKIETEIKINHEGEVNRARYMP 135

Query: 177 QNSRIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVE 236
           QN  I+AT T S DVL++D    P++    G  +  PDL +  +   GY      N    
Sbjct: 136 QNPCIIATKTPSADVLVFDYTKHPSKPDPSGECS--PDLRLRGHQKEGYGLSWNSN---- 195

Query: 237 ILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSII 296
            L+GH                +LS   D  V LW I             S  P       
Sbjct: 196 -LSGH----------------LLSASDDHTVCLWDI-------------SAGP------- 255

Query: 297 KKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDAR--T 356
                      +G  +  + V+ GH   VEDV +   +   F SV DD  L++WD R  T
Sbjct: 256 ----------KEGKVVDAKAVFTGHSAVVEDVAWHLLHESLFGSVADDQKLMIWDTRSNT 315

Query: 357 GSSPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEG 416
            S P+  V+ AH A+++C+ +NP+ + ++ TGSAD ++ ++D RNL        ++ FE 
Sbjct: 316 TSKPSHSVD-AHTAEVNCLSFNPYSEFILATGSADKTVALWDLRNLKLK-----LHSFES 375

Query: 417 HKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHR 476
           HK  +  V WSP   ++  SS  D  LN+WD  K+G++ + A      PP L F H GH 
Sbjct: 376 HKDEIFQVHWSPHNETILASSGTDRRLNVWDLSKIGEE-QSAEDAEDGPPELLFIHGGHT 412

Query: 477 DKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPED 512
            K+ DF WN  +PW + SVS+D         +QIW+M++ IY  E+
Sbjct: 436 AKISDFSWNPNEPWVICSVSED-------NIMQIWQMAENIYNDEE 412

BLAST of Cp4.1LG18g03560 vs. ExPASy Swiss-Prot
Match: Q7ZTY4 (Histone-binding protein RBBP7 OS=Danio rerio OX=7955 GN=rbbp7 PE=2 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 7.9e-57
Identity = 153/484 (31.61%), Postives = 228/484 (47.11%), Query Frame = 0

Query: 51  AHQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNR--QRLYLSE 110
           A ++  ++E Y  WK   P LYD    H L WPSL+ +W P + +   K+    RL L  
Sbjct: 10  AVEERVINEEYKIWKKNTPFLYDLVMTHALEWPSLTVQWLPDVNRPEGKDYVVHRLVLGT 69

Query: 111 QTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPF---------VKKYKTIIHPGE 170
            T     N LVIA+ ++  P   A    S ++ E  + F         ++    I H GE
Sbjct: 70  HTSDE-QNHLVIASAQI--PNDDAQFDASHYDSEKGAEFGGFGSVSGKIEIEIKINHEGE 129

Query: 171 VNRIRELPQNSRIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHG 230
           VNR R +PQN  I+AT T + DVL +D    P          S+PD       PSG    
Sbjct: 130 VNRARYMPQNPCIIATKTPTSDVLAFDYTKHP----------SKPD-------PSGDCSP 189

Query: 231 GIHNIDVEILTGHQENAEFALAMCPT-EPYVLSGGKDKLVVLWSIQDHITTSATDPAASK 290
            +       L GHQ+   + L+  P     +LS   D  + LW I             S 
Sbjct: 190 DLR------LRGHQKEG-YGLSWNPNLSGNLLSASDDHTICLWDI-------------SG 249

Query: 291 SPGSGGSIIKKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCL 350
           +P                  +G  +  + ++ GH   VEDV++   +   F SV DD  L
Sbjct: 250 AP-----------------KEGKIVDAKTIFTGHTAVVEDVSWHLLHESLFGSVADDQKL 309

Query: 351 ILWDARTG--SSPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGV 410
           ++WD R+   S P+  V+ AH A+++C+ +NP+ + ++ TGSAD ++ ++D RNL     
Sbjct: 310 MIWDTRSNNTSKPSHSVD-AHTAEVNCLSFNPYSEFILATGSADKTVALWDLRNLKLK-- 369

Query: 411 GSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPG 470
              ++ FE HK  +  VQWSP   ++  SS  D  LN+WD  K+G++ + A      PP 
Sbjct: 370 ---LHSFESHKDEIFQVQWSPHNETILASSGTDRRLNVWDLSKIGEE-QSAEDAEDGPPE 422

Query: 471 LFFQHAGHRDKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPE--DEVL 519
           L F H GH  K+ DF WN  +PW + SVS+D         +Q+W+M++ IY  E  D   
Sbjct: 430 LLFIHGGHTAKISDFSWNPNEPWVICSVSED-------NIMQVWQMAENIYNDEEPDTPA 422

BLAST of Cp4.1LG18g03560 vs. NCBI nr
Match: XP_023516978.1 (WD-40 repeat-containing protein MSI4 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1018 bits (2633), Expect = 0.0
Identity = 510/531 (96.05%), Postives = 510/531 (96.05%), Query Frame = 0

Query: 1   MDSPQSQQQQQQQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER 60
           MDSPQSQQQQQQQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER
Sbjct: 1   MDSPQSQQQQQQQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER 60

Query: 61  YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI 120
           YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI
Sbjct: 61  YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI 120

Query: 121 ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP 180
           ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP
Sbjct: 121 ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP 180

Query: 181 DVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFAL 240
           DVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAEFAL
Sbjct: 181 DVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAEFAL 240

Query: 241 AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDG 300
           AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDG
Sbjct: 241 AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDG 300

Query: 301 PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD 360
           PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD
Sbjct: 301 PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD 360

Query: 361 LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS 420
           LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS
Sbjct: 361 LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS 420

Query: 421 SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT 480
           SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT
Sbjct: 421 SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT 480

Query: 481 VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 510

BLAST of Cp4.1LG18g03560 vs. NCBI nr
Match: XP_022960697.1 (WD-40 repeat-containing protein MSI4 [Cucurbita moschata])

HSP 1 Score: 1012 bits (2616), Expect = 0.0
Identity = 507/531 (95.48%), Postives = 507/531 (95.48%), Query Frame = 0

Query: 1   MDSPQSQQQQQQQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER 60
           MDSPQSQQQQQ Q QQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER
Sbjct: 1   MDSPQSQQQQQLQQQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER 60

Query: 61  YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI 120
           YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI
Sbjct: 61  YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI 120

Query: 121 ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP 180
           ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP
Sbjct: 121 ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP 180

Query: 181 DVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFAL 240
           DVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAEFAL
Sbjct: 181 DVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAEFAL 240

Query: 241 AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDG 300
           AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPA SKSPGSGGSIIKKAGEGTDKTSDG
Sbjct: 241 AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAVSKSPGSGGSIIKKAGEGTDKTSDG 300

Query: 301 PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD 360
           PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD
Sbjct: 301 PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD 360

Query: 361 LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS 420
           LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS
Sbjct: 361 LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS 420

Query: 421 SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT 480
           SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT
Sbjct: 421 SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT 480

Query: 481 VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 510

BLAST of Cp4.1LG18g03560 vs. NCBI nr
Match: XP_022987695.1 (WD-40 repeat-containing protein MSI4 [Cucurbita maxima])

HSP 1 Score: 1008 bits (2606), Expect = 0.0
Identity = 507/532 (95.30%), Postives = 507/532 (95.30%), Query Frame = 0

Query: 1   MDSPQSQQQQQ-QQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDE 60
           MDSPQSQQQQQ QQ QQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQ PSVDE
Sbjct: 1   MDSPQSQQQQQLQQQQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQHPSVDE 60

Query: 61  RYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLV 120
           RYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLV
Sbjct: 61  RYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLV 120

Query: 121 IANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDS 180
           IANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDS
Sbjct: 121 IANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDS 180

Query: 181 PDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFA 240
           PDVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAEFA
Sbjct: 181 PDVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAEFA 240

Query: 241 LAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSD 300
           LAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPA SKSPGSGGSIIKKAGEGTDKTSD
Sbjct: 241 LAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAVSKSPGSGGSIIKKAGEGTDKTSD 300

Query: 301 GPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNA 360
           GPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNA
Sbjct: 301 GPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNA 360

Query: 361 DLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDK 420
           DLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDK
Sbjct: 361 DLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDK 420

Query: 421 SSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPW 480
           SSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPW
Sbjct: 421 SSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPW 480

Query: 481 TVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           TVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 TVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 511

BLAST of Cp4.1LG18g03560 vs. NCBI nr
Match: KAG6589995.1 (WD-40 repeat-containing protein MSI4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1007 bits (2604), Expect = 0.0
Identity = 507/533 (95.12%), Postives = 507/533 (95.12%), Query Frame = 0

Query: 1   MDSPQSQQQQQ--QQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVD 60
           MDSPQSQQQQQ  QQ QQQQQ QPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVD
Sbjct: 1   MDSPQSQQQQQLQQQQQQQQQPQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVD 60

Query: 61  ERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTL 120
           ERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTL
Sbjct: 61  ERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTL 120

Query: 121 VIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTD 180
           VIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTD
Sbjct: 121 VIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTD 180

Query: 181 SPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEF 240
           SPDVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAEF
Sbjct: 181 SPDVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAEF 240

Query: 241 ALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTS 300
           ALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPA SKSPGSGGSIIKKAGEGTDKTS
Sbjct: 241 ALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAVSKSPGSGGSIIKKAGEGTDKTS 300

Query: 301 DGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHN 360
           DGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHN
Sbjct: 301 DGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHN 360

Query: 361 ADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD 420
           ADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD
Sbjct: 361 ADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPD 420

Query: 421 KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADP 480
           KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADP
Sbjct: 421 KSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADP 480

Query: 481 WTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           WTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 WTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 512

BLAST of Cp4.1LG18g03560 vs. NCBI nr
Match: KAG7023659.1 (WD-40 repeat-containing protein MSI4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1007 bits (2603), Expect = 0.0
Identity = 507/534 (94.94%), Postives = 507/534 (94.94%), Query Frame = 0

Query: 1   MDSPQSQQQQQ---QQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSV 60
           MDSPQSQQQQQ   QQ QQQQQ QPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSV
Sbjct: 1   MDSPQSQQQQQLQQQQQQQQQQPQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSV 60

Query: 61  DERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNT 120
           DERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNT
Sbjct: 61  DERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNT 120

Query: 121 LVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHT 180
           LVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHT
Sbjct: 121 LVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHT 180

Query: 181 DSPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAE 240
           DSPDVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAE
Sbjct: 181 DSPDVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAE 240

Query: 241 FALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKT 300
           FALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPA SKSPGSGGSIIKKAGEGTDKT
Sbjct: 241 FALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAVSKSPGSGGSIIKKAGEGTDKT 300

Query: 301 SDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAH 360
           SDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAH
Sbjct: 301 SDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAH 360

Query: 361 NADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP 420
           NADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP
Sbjct: 361 NADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP 420

Query: 421 DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAAD 480
           DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAAD
Sbjct: 421 DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAAD 480

Query: 481 PWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           PWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 PWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 513

BLAST of Cp4.1LG18g03560 vs. ExPASy TrEMBL
Match: A0A6J1H857 (WD-40 repeat-containing protein MSI4 OS=Cucurbita moschata OX=3662 GN=LOC111461417 PE=3 SV=1)

HSP 1 Score: 1012 bits (2616), Expect = 0.0
Identity = 507/531 (95.48%), Postives = 507/531 (95.48%), Query Frame = 0

Query: 1   MDSPQSQQQQQQQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER 60
           MDSPQSQQQQQ Q QQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER
Sbjct: 1   MDSPQSQQQQQLQQQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER 60

Query: 61  YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI 120
           YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI
Sbjct: 61  YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI 120

Query: 121 ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP 180
           ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP
Sbjct: 121 ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP 180

Query: 181 DVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFAL 240
           DVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAEFAL
Sbjct: 181 DVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAEFAL 240

Query: 241 AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDG 300
           AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPA SKSPGSGGSIIKKAGEGTDKTSDG
Sbjct: 241 AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAVSKSPGSGGSIIKKAGEGTDKTSDG 300

Query: 301 PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD 360
           PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD
Sbjct: 301 PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD 360

Query: 361 LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS 420
           LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS
Sbjct: 361 LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS 420

Query: 421 SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT 480
           SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT
Sbjct: 421 SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT 480

Query: 481 VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 510

BLAST of Cp4.1LG18g03560 vs. ExPASy TrEMBL
Match: A0A6J1JHL6 (WD-40 repeat-containing protein MSI4 OS=Cucurbita maxima OX=3661 GN=LOC111485173 PE=3 SV=1)

HSP 1 Score: 1008 bits (2606), Expect = 0.0
Identity = 507/532 (95.30%), Postives = 507/532 (95.30%), Query Frame = 0

Query: 1   MDSPQSQQQQQ-QQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDE 60
           MDSPQSQQQQQ QQ QQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQ PSVDE
Sbjct: 1   MDSPQSQQQQQLQQQQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQHPSVDE 60

Query: 61  RYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLV 120
           RYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLV
Sbjct: 61  RYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLV 120

Query: 121 IANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDS 180
           IANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDS
Sbjct: 121 IANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDS 180

Query: 181 PDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFA 240
           PDVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAEFA
Sbjct: 181 PDVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAEFA 240

Query: 241 LAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSD 300
           LAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPA SKSPGSGGSIIKKAGEGTDKTSD
Sbjct: 241 LAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAVSKSPGSGGSIIKKAGEGTDKTSD 300

Query: 301 GPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNA 360
           GPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNA
Sbjct: 301 GPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNA 360

Query: 361 DLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDK 420
           DLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDK
Sbjct: 361 DLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDK 420

Query: 421 SSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPW 480
           SSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPW
Sbjct: 421 SSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPW 480

Query: 481 TVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           TVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 TVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 511

BLAST of Cp4.1LG18g03560 vs. ExPASy TrEMBL
Match: A0A6J1HUE9 (WD-40 repeat-containing protein MSI4 OS=Cucurbita maxima OX=3661 GN=LOC111466808 PE=3 SV=1)

HSP 1 Score: 986 bits (2548), Expect = 0.0
Identity = 495/534 (92.70%), Postives = 504/534 (94.38%), Query Frame = 0

Query: 1   MDSPQSQQQQQQQLQQQQ---QQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSV 60
           MDS QSQQQQQQQ QQQQ   QQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKA QQPSV
Sbjct: 1   MDSSQSQQQQQQQPQQQQPQLQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAQQQPSV 60

Query: 61  DERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNT 120
           DERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNT
Sbjct: 61  DERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNT 120

Query: 121 LVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHT 180
           LVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQN+RIVATHT
Sbjct: 121 LVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHT 180

Query: 181 DSPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAE 240
           DSPDVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAE
Sbjct: 181 DSPDVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAE 240

Query: 241 FALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKT 300
           FALAMCPTEPYVLSGGKDKLVVLWSIQDHITT+A+D AASKSPGSGGSIIKKAGE  DKT
Sbjct: 241 FALAMCPTEPYVLSGGKDKLVVLWSIQDHITTAASDAAASKSPGSGGSIIKKAGEANDKT 300

Query: 301 SDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAH 360
           ++GPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTG+SPAVKVEKAH
Sbjct: 301 AEGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGTSPAVKVEKAH 360

Query: 361 NADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP 420
           NADLHCVDWNPHDDNLIITGSADNSIR+FDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP
Sbjct: 361 NADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSP 420

Query: 421 DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAAD 480
           DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNA+D
Sbjct: 421 DKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASD 480

Query: 481 PWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           PWT+VSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 PWTLVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 513

BLAST of Cp4.1LG18g03560 vs. ExPASy TrEMBL
Match: A0A6J1D211 (WD-40 repeat-containing protein MSI4-like OS=Momordica charantia OX=3673 GN=LOC111016569 PE=3 SV=1)

HSP 1 Score: 979 bits (2530), Expect = 0.0
Identity = 490/531 (92.28%), Postives = 495/531 (93.22%), Query Frame = 0

Query: 1   MDSPQSQQQQQQQLQQQQQQQPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDER 60
           MDS Q QQQQQQ    QQQ QPVVKKKETRGRKPKPK+EKKDEQ AKKMKA  QPSVDER
Sbjct: 1   MDSSQPQQQQQQPQPPQQQPQPVVKKKETRGRKPKPKDEKKDEQLAKKMKAQHQPSVDER 60

Query: 61  YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI 120
           YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI
Sbjct: 61  YTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVI 120

Query: 121 ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSP 180
           ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQN+RIVATHTDSP
Sbjct: 121 ANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATHTDSP 180

Query: 181 DVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFAL 240
           DVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENAEFAL
Sbjct: 181 DVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENAEFAL 240

Query: 241 AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDG 300
           AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGE  DK SDG
Sbjct: 241 AMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGETNDKASDG 300

Query: 301 PSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNAD 360
           PS+GPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDAR GS PAVKVEKAHNAD
Sbjct: 301 PSVGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARAGSGPAVKVEKAHNAD 360

Query: 361 LHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS 420
           LHCVDWNPHDDNLIITGSADNSIR+FDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS
Sbjct: 361 LHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKS 420

Query: 421 SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWT 480
           SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNA+DPWT
Sbjct: 421 SVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNASDPWT 480

Query: 481 VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 VVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 510

BLAST of Cp4.1LG18g03560 vs. ExPASy TrEMBL
Match: A0A6J1F4J3 (WD-40 repeat-containing protein MSI4 OS=Cucurbita moschata OX=3662 GN=LOC111440528 PE=3 SV=1)

HSP 1 Score: 977 bits (2525), Expect = 0.0
Identity = 491/535 (91.78%), Postives = 501/535 (93.64%), Query Frame = 0

Query: 1   MDSPQSQQQQQQQLQQQQQQ----QPVVKKKETRGRKPKPKEEKKDEQQAKKMKAHQQPS 60
           MDS QSQQQQQQQ Q QQQQ    Q  VKKKETRGRKPKPK+EKKDEQQAKKMKA QQPS
Sbjct: 1   MDSSQSQQQQQQQQQPQQQQPQLQQQPVKKKETRGRKPKPKDEKKDEQQAKKMKAQQQPS 60

Query: 61  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPN 120
           VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPN
Sbjct: 61  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPN 120

Query: 121 TLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATH 180
           TLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQN+RIVATH
Sbjct: 121 TLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNARIVATH 180

Query: 181 TDSPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENA 240
           TDSPDVLIWDVEAQPNRHAVLGATNSRPDL                     ILTGHQENA
Sbjct: 181 TDSPDVLIWDVEAQPNRHAVLGATNSRPDL---------------------ILTGHQENA 240

Query: 241 EFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDK 300
           EFALAMCPTEPYVLSGGKDKLVVLWSIQDHITT+A+D AASKSPGSGGSIIKKAGE  DK
Sbjct: 241 EFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTAASDAAASKSPGSGGSIIKKAGEANDK 300

Query: 301 TSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKA 360
           T++GPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTG+SPAVKVEKA
Sbjct: 301 TAEGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGTSPAVKVEKA 360

Query: 361 HNADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWS 420
           HNADLHCVDWNPHDDNLIITGSADNSIR+FDRRNLTSNGVGSPIYKFEGHKAAVLCVQWS
Sbjct: 361 HNADLHCVDWNPHDDNLIITGSADNSIRLFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWS 420

Query: 421 PDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAA 480
           PDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNA+
Sbjct: 421 PDKSSVFGSSAEDGLLNIWDYDKVGKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAS 480

Query: 481 DPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 531
           DPWT+VSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP
Sbjct: 481 DPWTLVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDEVLAELEKFKSHVIECAAKP 514

BLAST of Cp4.1LG18g03560 vs. TAIR 10
Match: AT2G19520.1 (Transducin family protein / WD-40 repeat family protein )

HSP 1 Score: 837.8 bits (2163), Expect = 4.8e-243
Identity = 406/511 (79.45%), Postives = 443/511 (86.69%), Query Frame = 0

Query: 30  RGRKPKPKEEKK----DEQQAKKM-----KAHQQPSVDERYTQWKSLVPVLYDWFANHNL 89
           RGRKPK KE+ +     +Q   KM     K  Q PSVDE+Y+QWK LVP+LYDW ANHNL
Sbjct: 28  RGRKPKTKEDSQTPSSQQQSDVKMKESGKKTQQSPSVDEKYSQWKGLVPILYDWLANHNL 87

Query: 90  VWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN 149
           VWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN
Sbjct: 88  VWPSLSCRWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFN 147

Query: 150 EEARSPFVKKYKTIIHPGEVNRIRELPQNSRIVATHTDSPDVLIWDVEAQPNRHAVLGAT 209
           EEARSPFVKKYKTIIHPGEVNRIRELPQNS+IVATHTDSPDVLIWDVE QPNRHAVLGA 
Sbjct: 148 EEARSPFVKKYKTIIHPGEVNRIRELPQNSKIVATHTDSPDVLIWDVETQPNRHAVLGAA 207

Query: 210 NSRPDLLMFLYLPSGYNHGGIHNIDVEILTGHQENAEFALAMCPTEPYVLSGGKDKLVVL 269
           NSRPDL                     ILTGHQ+NAEFALAMCPTEP+VLSGGKDK VVL
Sbjct: 208 NSRPDL---------------------ILTGHQDNAEFALAMCPTEPFVLSGGKDKSVVL 267

Query: 270 WSIQDHITTSATDPAASKSPGSGGSIIKKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVT 329
           WSIQDHITT  TD  +S      GSIIK+ GEGTDK ++ P++GPRGVYHGHEDTVEDV 
Sbjct: 268 WSIQDHITTIGTDSKSS------GSIIKQTGEGTDK-NESPTVGPRGVYHGHEDTVEDVA 327

Query: 330 FCPSNAQEFCSVGDDSCLILWDARTGSSPAVKVEKAHNADLHCVDWNPHDDNLIITGSAD 389
           F P++AQEFCSVGDDSCLILWDARTG++P  KVEKAH+ADLHCVDWNPHDDNLI+TGSAD
Sbjct: 328 FSPTSAQEFCSVGDDSCLILWDARTGTNPVTKVEKAHDADLHCVDWNPHDDNLILTGSAD 387

Query: 390 NSIRMFDRRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKV 449
           N++R+FDRR LT+NGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYD+V
Sbjct: 388 NTVRLFDRRKLTANGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDRV 447

Query: 450 GKKTERATRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIW 509
            KK++RA ++PA   GLFFQHAGHRDKVVDFHWNA+DPWT+VSVSDDC+TTGGGGTLQIW
Sbjct: 448 SKKSDRAAKSPA---GLFFQHAGHRDKVVDFHWNASDPWTIVSVSDDCETTGGGGTLQIW 507

Query: 510 RMSDLIYRPEDEVLAELEKFKSHVIECAAKP 532
           RMSDLIYRPE+EV+AELEKFKSHV+ CA+KP
Sbjct: 508 RMSDLIYRPEEEVVAELEKFKSHVMTCASKP 507

BLAST of Cp4.1LG18g03560 vs. TAIR 10
Match: AT4G29730.1 (nucleosome/chromatin assembly factor group C5 )

HSP 1 Score: 735.3 bits (1897), Expect = 3.4e-212
Identity = 360/503 (71.57%), Postives = 413/503 (82.11%), Query Frame = 0

Query: 28  ETRGRKPKPKEEKKDEQQAKKMKAHQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSC 87
           + R RKPK   E    Q    ++  Q+ +VD+ Y+QWK+L+P+LYD F NH LVWPSLSC
Sbjct: 28  DKRRRKPKSNNE---SQLPFLLQQSQKATVDDTYSQWKTLLPILYDSFVNHTLVWPSLSC 87

Query: 88  RWGPQLEQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPF 147
           RWGPQLEQA  K  QRLYLSEQT+GSVPNTLVIANCE V           Q NE+A SPF
Sbjct: 88  RWGPQLEQAGSKT-QRLYLSEQTNGSVPNTLVIANCETVN---------RQLNEKAHSPF 147

Query: 148 VKKYKTIIHPGEVNRIRELPQNSRIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLL 207
           VKKYKTIIHPGEVNRIRELPQNS+IVATHTDSPD+LIW+ E QP+R+AVLGA +SRPDLL
Sbjct: 148 VKKYKTIIHPGEVNRIRELPQNSKIVATHTDSPDILIWNTETQPDRYAVLGAPDSRPDLL 207

Query: 208 MFLYLPSGYNHGGIHNIDVEILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHI 267
                                L GHQ++AEFALAMCPTEP+VLSGGKDK V+LW+IQDHI
Sbjct: 208 ---------------------LIGHQDDAEFALAMCPTEPFVLSGGKDKSVILWNIQDHI 267

Query: 268 TTSATDPAASKSPGSGGSIIKKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQ 327
           T + +D   SKSPGS     K+ GEG+DKT  GPS+GPRG+Y+GH+DTVEDV FCPS+AQ
Sbjct: 268 TMAGSD---SKSPGSS---FKQTGEGSDKTG-GPSVGPRGIYNGHKDTVEDVAFCPSSAQ 327

Query: 328 EFCSVGDDSCLILWDARTGSSPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFD 387
           EFCSVGDDSCL+LWDARTG+SPA+KVEKAH+ADLHCVDWNPHD+NLI+TGSADN++R+FD
Sbjct: 328 EFCSVGDDSCLMLWDARTGTSPAMKVEKAHDADLHCVDWNPHDNNLILTGSADNTVRVFD 387

Query: 388 RRNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERA 447
           RRNLTSNGVGSP+YKFEGH+AAVLCVQWSPDKSSVFGSSAEDGLLNIWD D+VGKK+ERA
Sbjct: 388 RRNLTSNGVGSPVYKFEGHRAAVLCVQWSPDKSSVFGSSAEDGLLNIWDCDRVGKKSERA 447

Query: 448 TRTPAAPPGLFFQHAGHRDKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIY 507
           T+T   P GLFFQHAGHRDKVVDFHW+  +PWT+VSVSD+C++ GGGGTLQIWRMSDLIY
Sbjct: 448 TKT---PDGLFFQHAGHRDKVVDFHWSLLNPWTIVSVSDNCESIGGGGTLQIWRMSDLIY 486

Query: 508 RPEDEVLAELEKFKSHVIECAAK 531
           RPEDEVL ELEKFKSHV  C +K
Sbjct: 508 RPEDEVLTELEKFKSHVFTCTSK 486

BLAST of Cp4.1LG18g03560 vs. TAIR 10
Match: AT2G16780.1 (Transducin family protein / WD-40 repeat family protein )

HSP 1 Score: 216.5 bits (550), Expect = 5.3e-56
Identity = 139/463 (30.02%), Postives = 215/463 (46.44%), Query Frame = 0

Query: 57  VDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQA----TYKNRQRLYLSEQTDG 116
           V+E ++ WK   P LYD   +H L WPSL+  W P         +Y    +L L   T G
Sbjct: 14  VEEDFSVWKKNTPFLYDLLISHPLEWPSLTVHWVPSTPNPYVADSYFGVHKLILGTHTSG 73

Query: 117 SVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKTIIHPGEVNRIRELPQNSRI 176
           S  + L++A  +VV P   A   I   N++   P V+  + I   GEVNR R +PQ   +
Sbjct: 74  SAQDFLMVA--DVVTPTPNAEPGIGGANQDPFIPKVEIRQRIRVDGEVNRARCMPQKPTL 133

Query: 177 VATHTDSPDVLIWDVEAQPNRHAVLGATNS-RPDLLMFLYLPSGYNHGGIHNIDVEILTG 236
           V   T   +V ++D      +HA    T+   PDL                      L G
Sbjct: 134 VGAKTSGCEVFLFDYA----KHAAKSQTSECDPDLR---------------------LVG 193

Query: 237 HQENAEFALAMCP-TEPYVLSGGKDKLVVLWSIQDHITTSATDPAASKSPGSGGSIIKKA 296
           H +   + L+  P  E Y+LSG +D+ + LW +      SAT                  
Sbjct: 194 HDKEG-YGLSWSPFKEGYLLSGSQDQKICLWDV------SATP----------------- 253

Query: 297 GEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVGDDSCLILWDARTGSSPA 356
               DK      +    VY GHE  + DV++   N   F S G+D  L++WD RT     
Sbjct: 254 ---QDKV-----LNAMFVYEGHESAIADVSWHMKNENLFGSAGEDGRLVIWDTRTNQMQ- 313

Query: 357 VKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTSNGVGSPIYKFEGHKAAV 416
               K H  +++ + +NP ++ ++ T S+D+++ +FD R L      +P++    H+  V
Sbjct: 314 -HQVKVHEREVNYLSFNPFNEWVLATASSDSTVALFDLRKL-----NAPLHVMSSHEGEV 373

Query: 417 LCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG-KKTERATRTPAAPPGLFFQHAGHRDKVV 476
             V+W P+  +V  SS ED  L +WD ++VG ++ E        PP L F H GH+ K+ 
Sbjct: 374 FQVEWDPNHETVLASSGEDRRLMVWDLNRVGEEQLEIELDAEDGPPELLFSHGGHKAKIS 403

Query: 477 DFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDE 513
           DF WN  +PW + SV++D        +LQ+W+M++ IYR E++
Sbjct: 434 DFAWNKNEPWVIASVAED-------NSLQVWQMAESIYRDEED 403

BLAST of Cp4.1LG18g03560 vs. TAIR 10
Match: AT4G35050.1 (Transducin family protein / WD-40 repeat family protein )

HSP 1 Score: 211.8 bits (538), Expect = 1.3e-54
Identity = 142/480 (29.58%), Postives = 217/480 (45.21%), Query Frame = 0

Query: 38  EEKKDEQQAKKMKAHQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQ----L 97
           EE KDE    +        V+E ++ WK   P LYD   +H L WPSL+  W P      
Sbjct: 4   EEGKDEAGLDQ--------VEEEFSIWKRNTPFLYDLMISHPLEWPSLTLHWVPSTPIPY 63

Query: 98  EQATYKNRQRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPFVKKYKT 157
            +  Y    +L L   T G   + L++A  +VV P   A   +   ++E   P V+  + 
Sbjct: 64  SKDPYFAVHKLILGTHTSGGAQDFLMVA--DVVIPTPDAEPGLGGRDQEPIVPKVEIKQK 123

Query: 158 IIHPGEVNRIRELPQNSRIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLLMFLYLP 217
           I   GEVNR R +PQ   +V   T   +V ++D      +      +   PDL       
Sbjct: 124 IRVDGEVNRARCMPQKPTLVGAKTSGSEVFLFDYARLSGKPQ---TSECDPDLR------ 183

Query: 218 SGYNHGGIHNIDVEILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITTSATD 277
                          L GH++           E Y+LSG +D+ + LW +      SAT 
Sbjct: 184 ---------------LMGHEQEGYGLAWSSFKEGYLLSGSQDQRICLWDV------SAT- 243

Query: 278 PAASKSPGSGGSIIKKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEFCSVG 337
                               TDK      + P  VY GH+  +EDV +   N   F S G
Sbjct: 244 -------------------ATDKV-----LNPMHVYEGHQSIIEDVAWHMKNENIFGSAG 303

Query: 338 DDSCLILWDARTGSSPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFDRRNLTS 397
           DD  L++WD RT         K H  +++ + +NP ++ ++ T S+D+++ +FD R LT 
Sbjct: 304 DDCQLVIWDLRTNQMQ--HQVKVHEREINYLSFNPFNEWVLATASSDSTVALFDLRKLT- 363

Query: 398 NGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVG-KKTERATRTPA 457
               +P++    H+  V  V+W P+  +V  SS ED  L +WD ++VG ++ E       
Sbjct: 364 ----APLHVLSKHEGEVFQVEWDPNHETVLASSGEDRRLMVWDINRVGDEQLEIELDAED 404

Query: 458 APPGLFFQHAGHRDKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYRPEDE 513
            PP L F H GH+ K+ DF WN  +PW + SV++D        +LQ+W+M++ IYR +DE
Sbjct: 424 GPPELLFSHGGHKAKISDFAWNKDEPWVISSVAED-------NSLQVWQMAESIYREDDE 404

BLAST of Cp4.1LG18g03560 vs. TAIR 10
Match: AT5G58230.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 211.5 bits (537), Expect = 1.7e-54
Identity = 142/488 (29.10%), Postives = 226/488 (46.31%), Query Frame = 0

Query: 41  KDEQQAKKMKAHQQPSVDERYTQWKSLVPVLYDWFANHNLVWPSLSCRWGPQLEQATYKN 100
           KDE++ +     ++  ++E Y  WK   P LYD    H L WPSL+  W P  E+ + K+
Sbjct: 3   KDEEEMR--GEIEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPSGKD 62

Query: 101 R--QRLYLSEQTDGSVPNTLVIANCEVVKPRVAAAEHISQFNEEARSPF---------VK 160
              Q++ L   T  S PN L++A  +V  P         Q++++ RS F         V+
Sbjct: 63  YSVQKMILGTHTSESEPNYLMLA--QVQLPLDDTESEARQYDDD-RSEFGGFGCATGKVQ 122

Query: 161 KYKTIIHPGEVNRIRELPQNSRIVATHTDSPDVLIWDVEAQPNRHAVLGATNSRPDLLMF 220
             + I H GEVNR R +PQN  I+AT T + +V ++D    P++  + GA N  PDL + 
Sbjct: 123 IIQQINHDGEVNRARYMPQNPFIIATKTVNAEVYVFDYSKHPSKPPLDGACN--PDLKLR 182

Query: 221 LYLPSGYNHGGIHNIDVEILTGHQENAEFALAMCPTEPYVLSGGKDKLVVLWSIQDHITT 280
            +   GY          +   GH                +LSG  D  + LW I      
Sbjct: 183 GHSSEGYGLSW-----SKFKQGH----------------LLSGSDDAQICLWDI------ 242

Query: 281 SATDPAASKSPGSGGSIIKKAGEGTDKTSDGPSIGPRGVYHGHEDTVEDVTFCPSNAQEF 340
                                    + T    S+  + ++  HE  VEDV +   +   F
Sbjct: 243 -------------------------NATPKNKSLDAQQIFKAHEGVVEDVAWHLRHEYLF 302

Query: 341 CSVGDDSCLILWDARTGS-SPAVKVEKAHNADLHCVDWNPHDDNLIITGSADNSIRMFDR 400
            SVGDD  L++WD R+ S S  V+   AH+ +++C+ +NP ++ ++ TGS D ++++FD 
Sbjct: 303 GSVGDDQYLLIWDLRSPSASKPVQSVVAHSMEVNCLAFNPFNEWVVATGSTDKTVKLFDL 362

Query: 401 RNLTSNGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWDYDKVGKKTERAT 460
           R L+     + ++ F+ HK  V  V W+P   ++  S      L +WD  ++ ++ +   
Sbjct: 363 RKLS-----TALHTFDSHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEE-QTVE 418

Query: 461 RTPAAPPGLFFQHAGHRDKVVDFHWNAADPWTVVSVSDDCDTTGGGGTLQIWRMSDLIYR 517
                PP L F H GH  K+ DF WN  + W + SV++D         LQIW+M++ IY 
Sbjct: 423 DAEDGPPELLFIHGGHTSKISDFSWNPCEDWVISSVAED-------NILQIWQMAENIYH 418

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O226076.8e-24279.45WD-40 repeat-containing protein MSI4 OS=Arabidopsis thaliana OX=3702 GN=MSI4 PE=... [more]
Q9SU784.8e-21171.57WD-40 repeat-containing protein MSI5 OS=Arabidopsis thaliana OX=3702 GN=MSI5 PE=... [more]
Q9I8G96.1e-5731.91Histone-binding protein RBBP7 OS=Gallus gallus OX=9031 GN=RBBP7 PE=1 SV=1[more]
Q8AVH16.1e-5731.97Histone-binding protein RBBP7 OS=Xenopus laevis OX=8355 GN=rbbp7 PE=2 SV=1[more]
Q7ZTY47.9e-5731.61Histone-binding protein RBBP7 OS=Danio rerio OX=7955 GN=rbbp7 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_023516978.10.096.05WD-40 repeat-containing protein MSI4 [Cucurbita pepo subsp. pepo][more]
XP_022960697.10.095.48WD-40 repeat-containing protein MSI4 [Cucurbita moschata][more]
XP_022987695.10.095.30WD-40 repeat-containing protein MSI4 [Cucurbita maxima][more]
KAG6589995.10.095.12WD-40 repeat-containing protein MSI4, partial [Cucurbita argyrosperma subsp. sor... [more]
KAG7023659.10.094.94WD-40 repeat-containing protein MSI4 [Cucurbita argyrosperma subsp. argyrosperma... [more]
Match NameE-valueIdentityDescription
A0A6J1H8570.095.48WD-40 repeat-containing protein MSI4 OS=Cucurbita moschata OX=3662 GN=LOC1114614... [more]
A0A6J1JHL60.095.30WD-40 repeat-containing protein MSI4 OS=Cucurbita maxima OX=3661 GN=LOC111485173... [more]
A0A6J1HUE90.092.70WD-40 repeat-containing protein MSI4 OS=Cucurbita maxima OX=3661 GN=LOC111466808... [more]
A0A6J1D2110.092.28WD-40 repeat-containing protein MSI4-like OS=Momordica charantia OX=3673 GN=LOC1... [more]
A0A6J1F4J30.091.78WD-40 repeat-containing protein MSI4 OS=Cucurbita moschata OX=3662 GN=LOC1114405... [more]
Match NameE-valueIdentityDescription
AT2G19520.14.8e-24379.45Transducin family protein / WD-40 repeat family protein [more]
AT4G29730.13.4e-21271.57nucleosome/chromatin assembly factor group C5 [more]
AT2G16780.15.3e-5630.02Transducin family protein / WD-40 repeat family protein [more]
AT4G35050.11.3e-5429.58Transducin family protein / WD-40 repeat family protein [more]
AT5G58230.11.7e-5429.10Transducin/WD40 repeat-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 454..501
e-value: 10.0
score: 10.6
coord: 347..387
e-value: 1.8E-6
score: 37.5
coord: 147..186
e-value: 370.0
score: 0.8
coord: 222..262
e-value: 0.75
score: 17.8
coord: 302..342
e-value: 2.0E-5
score: 34.0
coord: 396..436
e-value: 4.5E-8
score: 42.8
IPR001680WD40 repeatPFAMPF00400WD40coord: 307..342
e-value: 0.069
score: 14.1
coord: 397..436
e-value: 1.2E-5
score: 25.9
coord: 228..262
e-value: 0.19
score: 12.7
coord: 355..387
e-value: 0.0014
score: 19.4
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 309..351
score: 10.742378
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 403..436
score: 12.814306
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 354..387
score: 10.441614
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 33..519
e-value: 8.5E-113
score: 379.6
IPR022052Histone-binding protein RBBP4, N-terminalPFAMPF12265CAF1C_H4-bdcoord: 59..124
e-value: 5.5E-17
score: 61.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..58
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..24
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 270..310
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 32..55
NoneNo IPR availablePANTHERPTHR22850WD40 REPEAT FAMILYcoord: 228..531
coord: 38..208
NoneNo IPR availablePANTHERPTHR22850:SF202WD-40 REPEAT-CONTAINING PROTEIN MSI4coord: 228..531
coord: 38..208
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 403..436
score: 10.653776
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 354..387
score: 8.781968
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 146..519

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g03560.1Cp4.1LG18g03560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016573 histone acetylation
biological_process GO:0010468 regulation of gene expression
cellular_component GO:0005634 nucleus
molecular_function GO:0004402 histone acetyltransferase activity
molecular_function GO:0042393 histone binding
molecular_function GO:0005515 protein binding