Cp4.1LG20g05030 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g05030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionSET and MYND domain-containing protein 4 isoform X1
LocationCp4.1LG20: 2967726 .. 2974131 (+)
RNA-Seq ExpressionCp4.1LG20g05030
SyntenyCp4.1LG20g05030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAGACACAAGGAAAGGTCGACTCCGTGACAGTGTTCATTCTTGGGAGCGTACATTATGGAGAAGCTGAAGTCACTGGTGCCGGAGAACTTGAAGCAGACGGTGGGTTCAAGCACCGTCGATGATCTTCCCTCATCGTGTTCTTTCTTATTACGCCTTTTTCAGCAATCCCAGCTCTTCTTCCAAGTAAGCTTGGTTGCAAACGGCCTCGTATTTCTCTTCCATTTCTCTCTGGAAAAGTAGGTAAATTAGCATATGGGTTACGCTTCCAATGTAACTCTGGAGGAATATGGGCAAATTTTAGATGATGATGGGTTTTGTCTCCAACGTCACAGGTCATCGGGGATTTGGCAATGGACCCTGAAAATGCTCTCTGTGGTAAGAAAAAGGACGCTGCTCTGGAGTTGAAGCGCCAGGGAAATCAATGCTTCTTGAAGGGGGATTATGCTCCTGCGTTGGTTTATTATTCCCAGGTATCCAAGTTTCATTTATTTTCCAGTTTGTGCGGAAAACTGGTTTCTCTTGTGCCGATTCACAGTGGGTTAGTATTAGTTTTATAGATTTCAGGATTGACCTACAACTTCAACTTACAATGAAGGATGAAAATTAGTATTATTCAAATGCTTTATTCCAGGATTGTTTTAGATTACAACAGTACACATCACTAAGTTTCCCTTATAACGTCCACAGGCACTGCAAGTGGCTCCGATGAATGCTGTTGACATGGATAAGAATTTGGTTGCAACCTTATATGTGAATCGAGCATCAGTTTTGCTTGTGAGTTCGGTCCTTCGGTTGATTTATATTTAATGTAAAACTATAATAAAAGAGAAAAAGTAATAAATTATTGCAAATTGATGTATGAACTGAGTGATTTAGGTTCCTCCATTGACACTAATCCATCTGCTTTATGCTTCCTTCAGAAAATGGATCTGCAATTGGAGTGTTTACGTGATTGCAATAGAGCACTTCAAATTTCATCAAACTATGCAAAGGTGAAATTAATTTCATATCATATTATGTTTGTGGATACTCCTTATAATCCACTTGTTGTTCGATCTTCCCTCATTGCCATGAGCTATGTCTTGTTTTTTTTTTTTTTTCTTGTTCTATAGGCATGGTATAGAAGAGGTAAAGCAAATGCTAGTATGGGAAATTTTCATGATGCTATCCGTGACTTTCAAATGTCTAAGAGTGTGGAGGTATCATTCAATGGAAAGAAACAGGTAGACGACGAGTTGAAGATCATCCAACGTCAGCACAAGAGGTCAAATACAGTACTGGAACATAGCAACAACAACAAATTAGACGATTTTGGTATGTCTCATGTTTAATTTTATTGAATAACAACTCAAGACTTTCATGTTATATTCTCTGGGCTTGGAGGATCTCCTCTTTTTCATCTTTAATTGCTAAAAGGGTTTGGTGGGAAAAGCTTAATGTTTCTTCTCTTCTAGTCTGTTCTTAATATTTTGATAGTCAATGAGTTCATTTCCCAAAGAAATGATACAAGCTATGTAGTAGTTTAAATACCCTAAAATTGTTAGTGGTTTTCTCTAGTAGGTTTGTATTGTATTGGTTAGCCGATATTGTCCTCTTTGGACTTTTCCTTCTAGGCTTCCCCTCAAGGTTTTATAATGCGTCTATTAGGGTGAGGTTTCTACACCCTTATAAGGAATGCTTTGTTCTCCTCTCCAACCAATGTGGGATCTCACAATCCATCTCCCTTGGAGGCCCAACATCCTCGCTAGCACACCGTCCGGTGTTTGTCTCTGATACCATTTGTAACAGCTCAAGCCCACCACTAGCAGATATTGTCCTCTTTGGGCTTTCTCTCAAGGTTTTGAAACACGTCTACTAGGGAGAGGTTTCCACACCCTTTTATGGAATGCTTTATTCTGCTCTCCAACCAACGTGGGATCTCACAATTTGGGTTATTTTGGGATATATGGGGTGATTAGAGAGCTCCCTTCCTCTCCCTTTCTCATTTAGAGAACAAGTACAATTTTCCAAAACGATGCCTCCTGCTAGTCTCTATCTTCCTTCTATATTCCCTCTCTGTCAACTCCCTAACTAGCTAATATGGTTGGGATCTGCATAAGTCATTTCATCATCCACACCCTCAATCTTTCTCCCTTCTTATAACAGTCCAAGCCTACCGCTAGCAAATATTGTCCGCTTTGACCCGTTACGTATTGCCGTCAACCTCACGGTTTTAAAACGTGTTTGTTAGGGAGAGGTTTCCACACCCTTATAAAGAATGCTTCGTTCCCCTCTCCAACCGATGTGGGATCTCACAATAGTCATTTCATCATCCACATCCTCAATCTTTCTCCCTTCCTCTGGCTACAAGTACATGCACGTCAGCGGAGGCTTATCAAACTTCCAACAGGAAAACCAAATAAATAAAATGTTTTGATGTCTATTGGAGCTGGTACTGGGATGCGGGAAGAGAGCAGATTTTTAAAAGATGACATATGAACAATGTTTTGTTTAATTAATTTTTCGTTTTAAAAACATAAAATAGTCTTGAAAAACAACCTCCAAGCAGCCCCCTAATGTTCTGGTTACAGGATCTTTTTTGGTTGGGGGGATCATTTATCAGAGTGCAATAGGAAGAGGATTTTCAGGCTGTTTACGAAGAGCCCACAAAATACAGGCTTAAATTACTTCTTAAAAACATAGGTATTCGTTACTCGGTTACTTGTTATAAACTAGCATTATATGTGTTACAGATGAGCCAATTCAAGTAAAATTACATGTCACCACGTCGAATAAAGGTAGAGGAATGGTTTCACCCATTGAGATACCTCCATCATCCTTGGTCCATGTTGAAGAACCTTATGCCTTGGTGAGTACATATTTATCTTCTAAATCATGGGGAACGCATTCTATTGTCCATACTGTTTCTTTCACCTTTTTCTTTGTCAAATTCTCTCAACTTTACACTTTATACTGCCTCCAAGCCTGGACACTATATTTTCACGAGAAGTTATTCTGTATACGGTCTAAAGGGTTTGCAACCCAGGGCTGGTTGACAACAATCTAATCAGTTTTTCTGACTTAAAGTCTTATAATATTTACTTTGCTTTTGACTGGCAGGTAATATTGAAGCATTGTAGAGAAACTCACTGCCATTACTGCTTGAATGAGCTACCAGCAGATAAAGTACCCTGTCCATCATGCTCGATTCCTCTGTACTGCTCACAACGTTGCCAAATACAAGCCGGGGGACGAATGTTACAAAACGTTCCAGATAATAAAGAGATTTTAAAAGATCTATCTGATGACCTCAGAAAGTATGTTCAAGAAATAACTTTGCCCAGTTTTGCTGACTTAAGGACTGATGATGTTCCTGAACATAAACATGAATGTGATGGTGTGCACTGGCCTGTAATTTTGCCATCTGAAATAGTTTTGGCTGGGCGAATAGTGGCTAAATTTGTAGGACAGGGAGGTGTCTTTGCAGATGCTTCTAACCTTGTGGATATGTTGGTACTTCTGATTACTATATATCCCTACTAATTGTATTTGATTTTTTTACCTTGAAAACTACCTCTATTCTTGGCTTCGCTCTCACGACATGGAGTTGAACCATGATCGACTGTGATCGTGAGAATTGCCTAGAGCTTCTTGGCCCACGTGTCCCGAACGAACTCAGTACTAAGCTTATTCCCAATGGTTTCCTGTAGAATCTTTCACACCATTTTTCGGAAATGCACGCTGACAGCAAGCTGGAGTGTATCATCTATTCCATTATATTATCAAGTTGTCTTCGGCAATTTTTCCCCTCTCAACTTCCAGTAAATGAGAACACTATCTCGCAGGTTGGTTGATTCTCTTACACTTCATCATCTGTTCTAGAAAGAATGAAAAAGGCCCAATTTACCTCAAAATAATTATGAAATGGATTATTTGCTTTATCAATGCAGATTGTCATACTTATATCCCAAATCAGAACAAATTCTATATCTATTGTCCGTATGAAATCCTTCGATGCACCGGGGTCACGAGATCAGTCTGGAAGATTATCTAGCGTGGTTCCTTTTACTTGTAATATGGAACAAGTAAGCTCATCAGCCTGTATTTCTCTATGGTTTCAATTTTTCAATTGATTTAAATAGTAAATCCTTTGTAGTTGGCACTTCCTCATTGTGGTTTTTAATAGGTCAGAGTAGGTCAAGCTATTTATACAACTGGAAGCTTGTTTAACCATTCATGCAAACCGAACATCCATGCATATTTCAATTCACGTACCCTCTTTATTCGGACAACTGCGTCCGTGACAGTGGGGTGCCCCCTAGAGTTGTCATACGGTCCACAGGTTTGCCTATTCTCCAAACTTGGGTCCACATTTTCTAAACACGACCTTTGCTATCAATGTTCCTTTTAACTTTTCCTTCTAAAATATATATATGTAATAGTCCAAGCCCACCACTAGCCGATATTGTCCTCTTTCTAGCTAAATGTGCTTCTACTTCTCTGACAGGTTGGTCAATTGGACTGCAAAGACCGTCTTAAGTTGCTAGAGGATGAGTACTCTTTCAAATGTCAGTGTAGTGGTTGCTCAATGGTGCATATACCTGACCTTGTCCTCAATGCATTTTGTTGCATTAATTCAAGCTGCTGTGGCGTAGTCTTGGATAGATCCATCTTCAACTGTGAAAACAAGAAAACCAAGGACTATCTTACGGTCGACGAACAAAGTAGGCTTGAGCCTTTCATGCTGGTACTGTATCATTTTTTAATGCTTTTTAAGCACCTCTACCCAATTCTTAGTCCTTTCCTTCAACACAATGGTTTGAGATGTACAAAAATTGTATGCTTTTACCAAGTTTAAGAACGGTTTGAGCAAGGGACTAAATGGTGCTATTGCATTTCTATAGACTGACAGCTTCCTTCATGCTGGTCCTAGCCATTGTTTGAAGTGCGGATCTTATCGTAATATAAAATCATCTCGTTCGACTGTGGACGAGGCCTGGATTCACTTTACGAGGTAATTGCCTGATGCAATGTTGCCCAAGTCCGTTAGTGTTGGACCGTTGGTCGTATACTCGTTCTATAATTGCTCATAACAAGAGAATTCTTTCTTTCACTTGGGTTTGGATAACTGGCTGACAAATGTACACAGGTTGCAGCAGGAGATAAATTCAAATAGGGTATCCGAGACGACAGTCTCAGATGCTTTGAGAGCCCTGTGCTCACTGAAGTCTACATTGCATGCATATAATAAGCGTATAGCAGAAGTGAGTTCTCGTTTCGATATTGAACACTATATGCTGTCGTTTCCTTTTATGTGCTCAAATGGGTAGAGATACCTATGTCGAAACGATAGTACTGTGGACAATGCTTGTATCTTAGCTAAGCGATGCTTGTTCGGGTTAAGTTGGCTACAAATAATGCTATATGATTCACTTCTTGTAGGCTGAAGACAATCTGTCACAGGCCTTCTGTTTGCTTGGAAAACTCGAGCTGGCAGCGGACCATTGTAAAGCATCAATTCGGGTATGCTTTCTCTATGGACATATAATTGTACTTATAGGAAGTAAAACTTGTTTTAATATGCTCCACTTTTTTGTGACAACCATTCTAAAGATATAAAGTTCCATTTGGAATCTTAAAATTACATTATAAACGACTGCAACATACAACGCGCTACATCAGAGTGATGAGCCACCACGAGGGGTTACCTTTCGAACCAAATAGAAACTGAATTCTCACTGAATCTGCATGGTTTGTTTTGTTTGGAAAGACTTCGTTTTTGTCCGAATCATGAACGTTTCTTTTTTTCCTTCTCTGTTTTTGTGAGATCCCACATCGGTTGGAGAGGAGAACGAAACATTCTTTATAAGGATGTGAAAACCTCTCCCTAACCGACGCGTTTGAAAAACTTTGAGGGTAAGTCCAAAGAGACAATATCTGCTAGCAGTGGGCTTGAGTTGTTATAGTATTTGATTTGAGATAACTCCATCTCAACTATCTTCCTTGATTTTGTTCGGAGTAAATGAAACCTGAGCCGTACTTTCCCGCTTTTTGCAGATTCTAGAGAAGTTGTATGGCGAAAACCATATCACCATTGGCAACGAACTCTTGAAGCTGTCTTCCATTCTGTTGTCTGTGGGTGACTGCAATGGTGTGGAGTGCATTAAACGATTGAGTGAAATTTTCAGGTGTCATTATGGATGGCATGCCAACGCAATGTTCCCATTTTTGAACATCTTGGAGGAAGAAACTCACAAATTTGTCAGCACAGATGTTTGATCACAACGTGCATGGAACCCAAATTTGTCAACTGTTCGATCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAGACAGTTT

mRNA sequence

CAAGACACAAGGAAAGGTCGACTCCGTGACAGTGTTCATTCTTGGGAGCGTACATTATGGAGAAGCTGAAGTCACTGGTGCCGGAGAACTTGAAGCAGACGGTGGGTTCAAGCACCGTCGATGATCTTCCCTCATCGTGTTCTTTCTTATTACGCCTTTTTCAGCAATCCCAGCTCTTCTTCCAAGTCATCGGGGATTTGGCAATGGACCCTGAAAATGCTCTCTGTGGTAAGAAAAAGGACGCTGCTCTGGAGTTGAAGCGCCAGGGAAATCAATGCTTCTTGAAGGGGGATTATGCTCCTGCGTTGGTTTATTATTCCCAGGCACTGCAAGTGGCTCCGATGAATGCTGTTGACATGGATAAGAATTTGGTTGCAACCTTATATGTGAATCGAGCATCAGTTTTGCTTAAAATGGATCTGCAATTGGAGTGTTTACGTGATTGCAATAGAGCACTTCAAATTTCATCAAACTATGCAAAGGCATGGTATAGAAGAGGTAAAGCAAATGCTAGTATGGGAAATTTTCATGATGCTATCCGTGACTTTCAAATGTCTAAGAGTGTGGAGGTATCATTCAATGGAAAGAAACAGGTAGACGACGAGTTGAAGATCATCCAACGTCAGCACAAGAGGTCAAATACAGTACTGGAACATAGCAACAACAACAAATTAGACGATTTTGATGAGCCAATTCAAGTAAAATTACATGTCACCACGTCGAATAAAGGTAGAGGAATGGTTTCACCCATTGAGATACCTCCATCATCCTTGGTCCATGTTGAAGAACCTTATGCCTTGGTAATATTGAAGCATTGTAGAGAAACTCACTGCCATTACTGCTTGAATGAGCTACCAGCAGATAAAGTACCCTGTCCATCATGCTCGATTCCTCTGTACTGCTCACAACGTTGCCAAATACAAGCCGGGGGACGAATGTTACAAAACGTTCCAGATAATAAAGAGATTTTAAAAGATCTATCTGATGACCTCAGAAAGTATGTTCAAGAAATAACTTTGCCCAGTTTTGCTGACTTAAGGACTGATGATGTTCCTGAACATAAACATGAATGTGATGGTGTGCACTGGCCTGTAATTTTGCCATCTGAAATAGTTTTGGCTGGGCGAATAGTGGCTAAATTTGTAGGACAGGGAGGTGTCTTTGCAGATGCTTCTAACCTTGTGGATATTTGTCTTCGGCAATTTTTCCCCTCTCAACTTCCAGTAAATGAGAACACTATCTCGCAGATTGTCATACTTATATCCCAAATCAGAACAAATTCTATATCTATTGTCCGTATGAAATCCTTCGATGCACCGGGGTCACGAGATCAGTCTGGAAGATTATCTAGCGTGGTTCCTTTTACTTGTAATATGGAACAAGTCAGAGTAGGTCAAGCTATTTATACAACTGGAAGCTTGTTTAACCATTCATGCAAACCGAACATCCATGCATATTTCAATTCACGTACCCTCTTTATTCGGACAACTGCGTCCGTGACAGTGGGGTGCCCCCTAGAGTTGTCATACGGTCCACAGTGTAGTGGTTGCTCAATGGTGCATATACCTGACCTTGTCCTCAATGCATTTTGTTGCATTAATTCAAGCTGCTGTGGCGTAGTCTTGGATAGATCCATCTTCAACTGTGAAAACAAGAAAACCAAGGACTATCTTACGGTCGACGAACAAAGTAGGCTTGAGCCTTTCATGCTGACTGACAGCTTCCTTCATGCTGGTCCTAGCCATTGTTTGAAGTGCGGATCTTATCGTAATATAAAATCATCTCGTTCGACTGTGGACGAGGCCTGGATTCACTTTACGAGGTTGCAGCAGGAGATAAATTCAAATAGGGTATCCGAGACGACAGTCTCAGATGCTTTGAGAGCCCTGTGCTCACTGAAGTCTACATTGCATGCATATAATAAGCGTATAGCAGAAGCTGAAGACAATCTGTCACAGGCCTTCTGTTTGCTTGGAAAACTCGAGCTGGCAGCGGACCATTGTAAAGCATCAATTCGGATTCTAGAGAAGTTGTATGGCGAAAACCATATCACCATTGGCAACGAACTCTTGAAGCTGTCTTCCATTCTGTTGTCTGTGGGTGACTGCAATGGTGTGGAGTGCATTAAACGATTGAGTGAAATTTTCAGGTGTCATTATGGATGGCATGCCAACGCAATGTTCCCATTTTTGAACATCTTGGAGGAAGAAACTCACAAATTTGTCAGCACAGATGTTTGATCACAACGTGCATGGAACCCAAATTTGTCAACTGTTCGATCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAGACAGTTT

Coding sequence (CDS)

ATGGAGAAGCTGAAGTCACTGGTGCCGGAGAACTTGAAGCAGACGGTGGGTTCAAGCACCGTCGATGATCTTCCCTCATCGTGTTCTTTCTTATTACGCCTTTTTCAGCAATCCCAGCTCTTCTTCCAAGTCATCGGGGATTTGGCAATGGACCCTGAAAATGCTCTCTGTGGTAAGAAAAAGGACGCTGCTCTGGAGTTGAAGCGCCAGGGAAATCAATGCTTCTTGAAGGGGGATTATGCTCCTGCGTTGGTTTATTATTCCCAGGCACTGCAAGTGGCTCCGATGAATGCTGTTGACATGGATAAGAATTTGGTTGCAACCTTATATGTGAATCGAGCATCAGTTTTGCTTAAAATGGATCTGCAATTGGAGTGTTTACGTGATTGCAATAGAGCACTTCAAATTTCATCAAACTATGCAAAGGCATGGTATAGAAGAGGTAAAGCAAATGCTAGTATGGGAAATTTTCATGATGCTATCCGTGACTTTCAAATGTCTAAGAGTGTGGAGGTATCATTCAATGGAAAGAAACAGGTAGACGACGAGTTGAAGATCATCCAACGTCAGCACAAGAGGTCAAATACAGTACTGGAACATAGCAACAACAACAAATTAGACGATTTTGATGAGCCAATTCAAGTAAAATTACATGTCACCACGTCGAATAAAGGTAGAGGAATGGTTTCACCCATTGAGATACCTCCATCATCCTTGGTCCATGTTGAAGAACCTTATGCCTTGGTAATATTGAAGCATTGTAGAGAAACTCACTGCCATTACTGCTTGAATGAGCTACCAGCAGATAAAGTACCCTGTCCATCATGCTCGATTCCTCTGTACTGCTCACAACGTTGCCAAATACAAGCCGGGGGACGAATGTTACAAAACGTTCCAGATAATAAAGAGATTTTAAAAGATCTATCTGATGACCTCAGAAAGTATGTTCAAGAAATAACTTTGCCCAGTTTTGCTGACTTAAGGACTGATGATGTTCCTGAACATAAACATGAATGTGATGGTGTGCACTGGCCTGTAATTTTGCCATCTGAAATAGTTTTGGCTGGGCGAATAGTGGCTAAATTTGTAGGACAGGGAGGTGTCTTTGCAGATGCTTCTAACCTTGTGGATATTTGTCTTCGGCAATTTTTCCCCTCTCAACTTCCAGTAAATGAGAACACTATCTCGCAGATTGTCATACTTATATCCCAAATCAGAACAAATTCTATATCTATTGTCCGTATGAAATCCTTCGATGCACCGGGGTCACGAGATCAGTCTGGAAGATTATCTAGCGTGGTTCCTTTTACTTGTAATATGGAACAAGTCAGAGTAGGTCAAGCTATTTATACAACTGGAAGCTTGTTTAACCATTCATGCAAACCGAACATCCATGCATATTTCAATTCACGTACCCTCTTTATTCGGACAACTGCGTCCGTGACAGTGGGGTGCCCCCTAGAGTTGTCATACGGTCCACAGTGTAGTGGTTGCTCAATGGTGCATATACCTGACCTTGTCCTCAATGCATTTTGTTGCATTAATTCAAGCTGCTGTGGCGTAGTCTTGGATAGATCCATCTTCAACTGTGAAAACAAGAAAACCAAGGACTATCTTACGGTCGACGAACAAAGTAGGCTTGAGCCTTTCATGCTGACTGACAGCTTCCTTCATGCTGGTCCTAGCCATTGTTTGAAGTGCGGATCTTATCGTAATATAAAATCATCTCGTTCGACTGTGGACGAGGCCTGGATTCACTTTACGAGGTTGCAGCAGGAGATAAATTCAAATAGGGTATCCGAGACGACAGTCTCAGATGCTTTGAGAGCCCTGTGCTCACTGAAGTCTACATTGCATGCATATAATAAGCGTATAGCAGAAGCTGAAGACAATCTGTCACAGGCCTTCTGTTTGCTTGGAAAACTCGAGCTGGCAGCGGACCATTGTAAAGCATCAATTCGGATTCTAGAGAAGTTGTATGGCGAAAACCATATCACCATTGGCAACGAACTCTTGAAGCTGTCTTCCATTCTGTTGTCTGTGGGTGACTGCAATGGTGTGGAGTGCATTAAACGATTGAGTGAAATTTTCAGGTGTCATTATGGATGGCATGCCAACGCAATGTTCCCATTTTTGAACATCTTGGAGGAAGAAACTCACAAATTTGTCAGCACAGATGTTTGA

Protein sequence

MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKKKDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKMDLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQVDDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPDNKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVAKFVGQGGVFADASNLVDICLRQFFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQCSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV
Homology
BLAST of Cp4.1LG20g05030 vs. ExPASy Swiss-Prot
Match: Q8BTK5 (SET and MYND domain-containing protein 4 OS=Mus musculus OX=10090 GN=Smyd4 PE=2 SV=2)

HSP 1 Score: 91.7 bits (226), Expect = 3.8e-17
Identity = 169/798 (21.18%), Postives = 292/798 (36.59%), Query Frame = 0

Query: 8   VPENLKQTVGSS-TVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKKKDAALE 67
           +P++++ T+ ++ T+ D+    S LL+   + ++F + +        +    K  DA L 
Sbjct: 19  LPKSVQDTISTAETLSDIFLPSSSLLQ--PEDEMFLKELS------SSYSVEKDNDAPLF 78

Query: 68  LKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKMDLQLEC 127
            + +GN+ F + +Y  A V YS+ +  +  N  D     ++  Y NR++ L  +     C
Sbjct: 79  YREEGNRKFQEKEYTDAAVLYSKGVSHSRPNTED-----ISLCYANRSAALFHLGQYEAC 138

Query: 128 LRDCNRALQ---ISSNYAKAWYRRGKANASMGNFHDA---IRDFQMSKS-----VEVSFN 187
           L+D   A           K   R+ +   ++G   +A   I D + S +     V  S+ 
Sbjct: 139 LKDIVEAGMHGYPERLQPKMMVRKTECLVNLGRLQEARQTISDLESSLTAKPTLVLSSYQ 198

Query: 188 GKKQVDDELKIIQRQHKRSNTVLEHSNNNKLDDF---DEPIQV---KLHV---TTSNKGR 247
             ++    LKI  ++ +     +  +  N  +D    +E  Q+    L V   T   KGR
Sbjct: 199 ILQRNVQHLKIKIQEKETLPEPIPAALTNAFEDIALGEENTQISGASLSVSLCTHPLKGR 258

Query: 248 GMVSPIEIPPSSLVHVEEPYALVIL-------KHCRET-----------HCHYCLNELPA 307
            +V+  +I P  L+  E+ +  V++        HC E            +CH CL    A
Sbjct: 259 HLVATKDILPGELLVKEDAFVSVLIPGEMPRPHHCLENKWDTRVTSGDLYCHRCLKHTLA 318

Query: 308 DKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPDNKEILKDLSDDLRKYVQEITLPSFADL- 367
             VPC SCS   YCSQ C  QA             +L  L       ++   L  F D+ 
Sbjct: 319 -TVPCGSCSYAKYCSQECMQQAWDLYHSTECSLGGLLLTLGVFCHVALRMTLLARFEDVD 378

Query: 368 -----------RTDD-VPEHKHECDGVHWPVILPS-EIVLAGRIVAKFVGQGGVFADASN 427
                       TD  +PE K+      +     S E    G          G +    N
Sbjct: 379 RVVRMLCDEVGSTDTCLPESKNLVKAFDYTSQGESEEKSKIGEPPIPGCNVNGKYGSNYN 438

Query: 428 LVDICLRQFFPSQLPVNENTISQIVILISQIRTNSISIVRMKS----FDAPGSRDQ---- 487
            +   L                 +  L  Q++ +S+    +KS       PG        
Sbjct: 439 AIFSLLPHTEKHSPEHRFICAISVSALCRQLKADSVQAQTLKSPKLKAVTPGLCADLTVW 498

Query: 488 -SGRLSSVVPFTCNME-------------------QVRVGQAIYTTGSLFNHSCKPNIHA 547
            +  L  ++   CN +                   Q+R+   I+   SL NHSC+PN   
Sbjct: 499 GAAMLRHMLQLQCNAQAITSICHTGSNESIITNSRQIRLATGIFPVVSLLNHSCRPNTSV 558

Query: 548 YFNSRTLFIRTTASVTVGCPLELSYGPQCS-------------------GCSMVHIPDLV 607
            F      +R    +  G  +   YGP  S                    C   H   L 
Sbjct: 559 SFTGTVATVRAAQRIAKGQEILHCYGPHESRMGVAERQQRLSSQYFFDCRCGACHAETLR 618

Query: 608 L------NAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDE-QSRLEPFMLTDSFLHAG 667
                   AFCC   +C  ++    + +C N+   + ++ D+  SRL+            
Sbjct: 619 AAAAPRWEAFCC--KTCRALMQGNDVLSCSNESCTNSVSRDQLVSRLQDLQQQ------- 678

Query: 668 PSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRALCSLKSTLHA 699
                 C + + +++ +                       E  +   LR   + +S L A
Sbjct: 679 -----VCMAQKLLRTGK----------------------PEQAIQQLLRCREAAESFLSA 738

BLAST of Cp4.1LG20g05030 vs. ExPASy Swiss-Prot
Match: Q9HGM9 (DnaJ homolog subfamily C member 7 homolog OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC543.02c PE=4 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 4.8e-12
Identity = 49/127 (38.58%), Postives = 73/127 (57.48%), Query Frame = 0

Query: 68  KRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKMDLQLECL 127
           K QGN  F +G+Y  A   YS+ALQ+ P N     K  VA LY+NRA+VLL++    E L
Sbjct: 227 KNQGNDLFRQGNYQDAYEKYSEALQIDPDN-----KETVAKLYMNRATVLLRLKRPEEAL 286

Query: 128 RDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQVDDELKII 187
            D + AL I S+Y K    R KA+ ++  + +A+RD Q +  ++ S    +Q   EL+ +
Sbjct: 287 SDSDNALAIDSSYLKGLKVRAKAHEALEKWEEAVRDVQSAIELDASDANLRQ---ELRRL 345

Query: 188 QRQHKRS 195
           Q + K+S
Sbjct: 347 QLELKKS 345

BLAST of Cp4.1LG20g05030 vs. ExPASy Swiss-Prot
Match: Q9CWR2 (Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 1.5e-10
Identity = 67/279 (24.01%), Postives = 105/279 (37.63%), Query Frame = 0

Query: 220 TTSNKGRGMVSPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADK-VPCPSCSI 279
           TT+N+G G+ +   + P  L+   +P A  + K  R   C  CL  L  +K + C  C I
Sbjct: 10  TTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCL--LGKEKLMRCSQCRI 69

Query: 280 PLYCSQRCQIQAGGRMLQNVPDNKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHE 339
             YCS +CQ +A                                          P+H+ E
Sbjct: 70  AKYCSAKCQKKAW-----------------------------------------PDHRRE 129

Query: 340 CDGVH--WPVILPSEIVLAGRIVAKFVGQGGVFADASNLVDICLRQFFPSQLPVNENT-I 399
           C  +    P   P  + L GR++ K + +    +++  L      +   S+L  ++   +
Sbjct: 130 CSCLKSCKPRYPPDSVRLLGRVIVKLMDEKP--SESEKLYSFYDLESNISKLTEDKKEGL 189

Query: 400 SQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVV--PFT-CNMEQVRVGQAIYTT 459
            Q+ +         I      +   P S D     + V+   FT CN E   VG  +Y +
Sbjct: 190 RQLAMTFQHFMREEI----QDASQLPPSFDLFEAFAKVICNSFTICNAEMQEVGVGLYPS 239

Query: 460 GSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSY 492
            SL NHSC PN    FN   L +R    +  G  L + Y
Sbjct: 250 MSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICY 239

BLAST of Cp4.1LG20g05030 vs. ExPASy Swiss-Prot
Match: P53042 (Serine/threonine-protein phosphatase 5 OS=Rattus norvegicus OX=10116 GN=Ppp5c PE=1 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 1.3e-09
Identity = 41/127 (32.28%), Postives = 63/127 (49.61%), Query Frame = 0

Query: 64  ALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKMDLQ 123
           A ELK Q N  F   DY  A+ +YSQA+++ P NA+          Y NR+   L+ +  
Sbjct: 28  AEELKTQANDYFKAKDYENAIKFYSQAIELNPSNAI---------YYGNRSLAYLRTECY 87

Query: 124 LECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQVDDE 183
              L D  RA+++   Y K +YRR  +N ++G F  A+RD++    V+ +    K    E
Sbjct: 88  GYALGDATRAIELDKKYIKGYYRRAASNMALGKFRAALRDYETVVKVKPNDKDAKMKYQE 145

Query: 184 LKIIQRQ 191
              I +Q
Sbjct: 148 CSKIVKQ 145

BLAST of Cp4.1LG20g05030 vs. ExPASy Swiss-Prot
Match: Q8IWX7 (Protein unc-45 homolog B OS=Homo sapiens OX=9606 GN=UNC45B PE=1 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 1.7e-09
Identity = 42/108 (38.89%), Postives = 62/108 (57.41%), Query Frame = 0

Query: 64  ALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKMDLQ 123
           A++LK +GN+ F   DY  A   YSQAL++        DK L+ATLY NRA+  LK +  
Sbjct: 6   AVQLKEEGNRHFQLQDYKAATNSYSQALKLT------KDKALLATLYRNRAACGLKTESY 65

Query: 124 LECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVE 172
           ++   D +RA+ I+S+  KA YRR +A   +G    A +D Q   ++E
Sbjct: 66  VQAASDASRAIDINSSDIKALYRRCQALEHLGKLDQAFKDVQRCATLE 107

BLAST of Cp4.1LG20g05030 vs. NCBI nr
Match: XP_023520329.1 (SET and MYND domain-containing protein 4 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1439 bits (3724), Expect = 0.0
Identity = 726/776 (93.56%), Postives = 727/776 (93.69%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60
           MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK
Sbjct: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60

Query: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120
           KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM
Sbjct: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120

Query: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180
           DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV
Sbjct: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180

Query: 181 DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240
           DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV
Sbjct: 181 DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240

Query: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300
           HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD
Sbjct: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300

Query: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA 360
           NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA
Sbjct: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA 360

Query: 361 KFVGQGGVFADASNLVDI---------------------------CLRQFFPSQLPVNEN 420
           KFVGQGGVFADASNLVD+                           CLRQFFPSQLPVNEN
Sbjct: 361 KFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLRQFFPSQLPVNEN 420

Query: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG 480
           TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG
Sbjct: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG 480

Query: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ------------------- 540
           SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ                   
Sbjct: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQVGQLDCKDRLKLLEDEYSF 540

Query: 541 ---CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM 600
              CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM
Sbjct: 541 KCQCSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM 600

Query: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660
           LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL
Sbjct: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660

Query: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL 720
           CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL
Sbjct: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL 720

Query: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV 727
           LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV
Sbjct: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV 776

BLAST of Cp4.1LG20g05030 vs. NCBI nr
Match: XP_022927244.1 (SET and MYND domain-containing protein 4 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1403 bits (3632), Expect = 0.0
Identity = 710/776 (91.49%), Postives = 715/776 (92.14%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60
           MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK
Sbjct: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60

Query: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120
           KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM
Sbjct: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120

Query: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180
           DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAI DFQ+SK+VEVSFNGKKQV
Sbjct: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIHDFQISKNVEVSFNGKKQV 180

Query: 181 DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240
           DDELKIIQRQHKRSNTV EHSNN KLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV
Sbjct: 181 DDELKIIQRQHKRSNTVQEHSNN-KLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240

Query: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300
           HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGG+MLQNVPD
Sbjct: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGQMLQNVPD 300

Query: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA 360
           NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWP ILPSEIVLAGRIVA
Sbjct: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPAILPSEIVLAGRIVA 360

Query: 361 KFVGQGGVFADASNLVDI---------------------------CLRQFFPSQLPVNEN 420
           KFVGQGGVFADASNLVD+                           CLRQFFPSQLPVNEN
Sbjct: 361 KFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLRQFFPSQLPVNEN 420

Query: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG 480
           TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSV PFTCNMEQVRVGQAIYTTG
Sbjct: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVAPFTCNMEQVRVGQAIYTTG 480

Query: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ------------------- 540
           SLFNHSCKPNIHAYFNSRTLFIRTTA VTVGCPLELSYGPQ                   
Sbjct: 481 SLFNHSCKPNIHAYFNSRTLFIRTTAFVTVGCPLELSYGPQVGQLDCKDRLKLLEDEYSF 540

Query: 541 ---CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM 600
              CSGCSMVHIPDLVLNAFCCIN SCCGVVLDRSIFNCENKKTKD LTVDEQSRLEPFM
Sbjct: 541 KCQCSGCSMVHIPDLVLNAFCCINPSCCGVVLDRSIFNCENKKTKDSLTVDEQSRLEPFM 600

Query: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660
           LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQE+NSN VSETTVSDALRAL
Sbjct: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEMNSNMVSETTVSDALRAL 660

Query: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL 720
           CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLE AADHCKASIRILEKLYGENHI IGNEL
Sbjct: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLEHAADHCKASIRILEKLYGENHIAIGNEL 720

Query: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV 727
           LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLNILEEETHKFVSTDV
Sbjct: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLNILEEETHKFVSTDV 775

BLAST of Cp4.1LG20g05030 vs. NCBI nr
Match: KAG7019525.1 (SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1402 bits (3629), Expect = 0.0
Identity = 710/776 (91.49%), Postives = 715/776 (92.14%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60
           MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK
Sbjct: 32  MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 91

Query: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120
           KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM
Sbjct: 92  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 151

Query: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180
           DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQ+SK+VEVSFNGKKQV
Sbjct: 152 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQISKNVEVSFNGKKQV 211

Query: 181 DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240
           DDELKIIQRQHKRSNTV EHSNN KLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV
Sbjct: 212 DDELKIIQRQHKRSNTVQEHSNN-KLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 271

Query: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300
           HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGG+MLQNVPD
Sbjct: 272 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGQMLQNVPD 331

Query: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA 360
           NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWP ILPSEIVLAGRIVA
Sbjct: 332 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPAILPSEIVLAGRIVA 391

Query: 361 KFVGQGGVFADASNLVDI---------------------------CLRQFFPSQLPVNEN 420
           KFVGQGGVFADASNLVD+                           CLRQFFPSQLPVNEN
Sbjct: 392 KFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLRQFFPSQLPVNEN 451

Query: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG 480
           TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSV PFTCNMEQVRVGQAIYTTG
Sbjct: 452 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVAPFTCNMEQVRVGQAIYTTG 511

Query: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ------------------- 540
           SLFNHSCKPNIHAYFNSRTLFIRTTA VTVGCPLELSYGPQ                   
Sbjct: 512 SLFNHSCKPNIHAYFNSRTLFIRTTAFVTVGCPLELSYGPQVGQLDCKDRLKLLEDEYSF 571

Query: 541 ---CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM 600
              CSGCSMVHIPDLVLNAFCCIN SCCGVVLDRSIFNCENKKTKD LTVDEQSRLEPFM
Sbjct: 572 KCQCSGCSMVHIPDLVLNAFCCINPSCCGVVLDRSIFNCENKKTKDSLTVDEQSRLEPFM 631

Query: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660
           LTDSFLHAGPSHCLKCGSYRNIKSS STVDEAWIHFTRLQQE+NSN VSETTVSDALRAL
Sbjct: 632 LTDSFLHAGPSHCLKCGSYRNIKSSCSTVDEAWIHFTRLQQEMNSNMVSETTVSDALRAL 691

Query: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL 720
           CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLE AADHCKASIRILEKLYGENHI IGNEL
Sbjct: 692 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLEHAADHCKASIRILEKLYGENHIAIGNEL 751

Query: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV 727
           LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLNILEEETHKFVSTDV
Sbjct: 752 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLNILEEETHKFVSTDV 806

BLAST of Cp4.1LG20g05030 vs. NCBI nr
Match: KAG6583908.1 (SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1395 bits (3610), Expect = 0.0
Identity = 705/776 (90.85%), Postives = 712/776 (91.75%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60
           MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK
Sbjct: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60

Query: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120
           KDAALELKRQGNQCFLKGDYA ALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM
Sbjct: 61  KDAALELKRQGNQCFLKGDYARALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120

Query: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180
           DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQ+SK+VEVS NGKKQV
Sbjct: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQISKNVEVSLNGKKQV 180

Query: 181 DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240
           DDELKIIQRQHKRSNTV EHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV
Sbjct: 181 DDELKIIQRQHKRSNTVHEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240

Query: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300
           HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGG+MLQNVPD
Sbjct: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGQMLQNVPD 300

Query: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA 360
           NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWP ILPSEIVLAGRIVA
Sbjct: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPAILPSEIVLAGRIVA 360

Query: 361 KFVGQGGVFADASNLVDI---------------------------CLRQFFPSQLPVNEN 420
           KFVGQG VFADASNLVD+                           CL+QFFPSQLPVNEN
Sbjct: 361 KFVGQGDVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLQQFFPSQLPVNEN 420

Query: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG 480
           TISQIVILISQIRTNSISIVRMKSFDAPGSRDQ GRLSSV PFTCNMEQVRVGQAIYTTG
Sbjct: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQFGRLSSVAPFTCNMEQVRVGQAIYTTG 480

Query: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ------------------- 540
           SLFNHSCKPNIHAYFNSRTLFIRTTA VTVGCPLELSYGPQ                   
Sbjct: 481 SLFNHSCKPNIHAYFNSRTLFIRTTAFVTVGCPLELSYGPQVGQLDCKDRLKLLEDEYSF 540

Query: 541 ---CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM 600
              CSGCS+VHIPDLVLNAFCCIN SCCGVVLDRSIFNCENKKTKD LTVDEQSRLEPFM
Sbjct: 541 KCQCSGCSLVHIPDLVLNAFCCINPSCCGVVLDRSIFNCENKKTKDSLTVDEQSRLEPFM 600

Query: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660
           LTDSFLHAGPSHCLKCGSYRNIKSS STVDEAWIHFTRLQQE+NSN VSETTVSDALRAL
Sbjct: 601 LTDSFLHAGPSHCLKCGSYRNIKSSCSTVDEAWIHFTRLQQEMNSNMVSETTVSDALRAL 660

Query: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL 720
           CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLE AADHCKASIRILEKLYGENHI IGNEL
Sbjct: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLEHAADHCKASIRILEKLYGENHIAIGNEL 720

Query: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV 727
           LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLNILEEETHKFVSTDV
Sbjct: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLNILEEETHKFVSTDV 776

BLAST of Cp4.1LG20g05030 vs. NCBI nr
Match: XP_023001396.1 (SET and MYND domain-containing protein 4 isoform X2 [Cucurbita maxima])

HSP 1 Score: 1391 bits (3600), Expect = 0.0
Identity = 702/776 (90.46%), Postives = 713/776 (91.88%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60
           MEKLKSLVP+NL+QTVGSSTVDDLPSSCSFLLRLFQQSQLFFQ+IGDL MDPENALCGKK
Sbjct: 1   MEKLKSLVPKNLEQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQLIGDLTMDPENALCGKK 60

Query: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120
           KDAALELKRQGNQCFLKGDYA ALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM
Sbjct: 61  KDAALELKRQGNQCFLKGDYATALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120

Query: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180
           DLQLECLRDCNR LQISSNYAKAWYRRGKANASMGNFHDAIRDFQ+SK+VEVSFNGKKQV
Sbjct: 121 DLQLECLRDCNRTLQISSNYAKAWYRRGKANASMGNFHDAIRDFQISKNVEVSFNGKKQV 180

Query: 181 DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240
           DDELKIIQRQ+KRSNTV EHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV
Sbjct: 181 DDELKIIQRQYKRSNTVQEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240

Query: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300
           HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD
Sbjct: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300

Query: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA 360
           NKEILKDLSDDLRKYVQEIT PSFADLRTDDVPEHKHECDGVHWP ILPSEIVLAGRI+A
Sbjct: 301 NKEILKDLSDDLRKYVQEITSPSFADLRTDDVPEHKHECDGVHWPAILPSEIVLAGRILA 360

Query: 361 KFVGQGGVFADASNLVDI---------------------------CLRQFFPSQLPVNEN 420
           KFVGQGGVFADASNLVD+                           CL+QFFPSQLPVNEN
Sbjct: 361 KFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLKQFFPSQLPVNEN 420

Query: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG 480
           TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSV PFTCNMEQVRVGQAIYTTG
Sbjct: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVAPFTCNMEQVRVGQAIYTTG 480

Query: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ------------------- 540
           SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ                   
Sbjct: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQVGQLDCKDRLKLLEDEYSF 540

Query: 541 ---CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM 600
              CSGCS+VHI DLVL+AFCCIN SC GVVLDRSIFNCENKKTKD LTVDEQSRLEPFM
Sbjct: 541 KCQCSGCSLVHISDLVLDAFCCINPSCFGVVLDRSIFNCENKKTKDSLTVDEQSRLEPFM 600

Query: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660
           LTDSFLHAGPSHCLKCGSYRNIKSS STVDEAWIHFTRLQQEINSNRVSETTVSDALRAL
Sbjct: 601 LTDSFLHAGPSHCLKCGSYRNIKSSCSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660

Query: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL 720
           CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHI IGNEL
Sbjct: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHIAIGNEL 720

Query: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV 727
           LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLNILEEETHKFVSTDV
Sbjct: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLNILEEETHKFVSTDV 776

BLAST of Cp4.1LG20g05030 vs. ExPASy TrEMBL
Match: A0A6J1EHH4 (SET and MYND domain-containing protein 4 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111434150 PE=4 SV=1)

HSP 1 Score: 1403 bits (3632), Expect = 0.0
Identity = 710/776 (91.49%), Postives = 715/776 (92.14%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60
           MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK
Sbjct: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60

Query: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120
           KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM
Sbjct: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120

Query: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180
           DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAI DFQ+SK+VEVSFNGKKQV
Sbjct: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIHDFQISKNVEVSFNGKKQV 180

Query: 181 DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240
           DDELKIIQRQHKRSNTV EHSNN KLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV
Sbjct: 181 DDELKIIQRQHKRSNTVQEHSNN-KLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240

Query: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300
           HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGG+MLQNVPD
Sbjct: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGQMLQNVPD 300

Query: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA 360
           NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWP ILPSEIVLAGRIVA
Sbjct: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPAILPSEIVLAGRIVA 360

Query: 361 KFVGQGGVFADASNLVDI---------------------------CLRQFFPSQLPVNEN 420
           KFVGQGGVFADASNLVD+                           CLRQFFPSQLPVNEN
Sbjct: 361 KFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLRQFFPSQLPVNEN 420

Query: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG 480
           TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSV PFTCNMEQVRVGQAIYTTG
Sbjct: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVAPFTCNMEQVRVGQAIYTTG 480

Query: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ------------------- 540
           SLFNHSCKPNIHAYFNSRTLFIRTTA VTVGCPLELSYGPQ                   
Sbjct: 481 SLFNHSCKPNIHAYFNSRTLFIRTTAFVTVGCPLELSYGPQVGQLDCKDRLKLLEDEYSF 540

Query: 541 ---CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM 600
              CSGCSMVHIPDLVLNAFCCIN SCCGVVLDRSIFNCENKKTKD LTVDEQSRLEPFM
Sbjct: 541 KCQCSGCSMVHIPDLVLNAFCCINPSCCGVVLDRSIFNCENKKTKDSLTVDEQSRLEPFM 600

Query: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660
           LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQE+NSN VSETTVSDALRAL
Sbjct: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEMNSNMVSETTVSDALRAL 660

Query: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL 720
           CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLE AADHCKASIRILEKLYGENHI IGNEL
Sbjct: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLEHAADHCKASIRILEKLYGENHIAIGNEL 720

Query: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV 727
           LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLNILEEETHKFVSTDV
Sbjct: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLNILEEETHKFVSTDV 775

BLAST of Cp4.1LG20g05030 vs. ExPASy TrEMBL
Match: A0A6J1KIH9 (SET and MYND domain-containing protein 4 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111495544 PE=4 SV=1)

HSP 1 Score: 1391 bits (3600), Expect = 0.0
Identity = 702/776 (90.46%), Postives = 713/776 (91.88%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60
           MEKLKSLVP+NL+QTVGSSTVDDLPSSCSFLLRLFQQSQLFFQ+IGDL MDPENALCGKK
Sbjct: 1   MEKLKSLVPKNLEQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQLIGDLTMDPENALCGKK 60

Query: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120
           KDAALELKRQGNQCFLKGDYA ALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM
Sbjct: 61  KDAALELKRQGNQCFLKGDYATALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120

Query: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180
           DLQLECLRDCNR LQISSNYAKAWYRRGKANASMGNFHDAIRDFQ+SK+VEVSFNGKKQV
Sbjct: 121 DLQLECLRDCNRTLQISSNYAKAWYRRGKANASMGNFHDAIRDFQISKNVEVSFNGKKQV 180

Query: 181 DDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240
           DDELKIIQRQ+KRSNTV EHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV
Sbjct: 181 DDELKIIQRQYKRSNTVQEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMVSPIEIPPSSLV 240

Query: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300
           HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD
Sbjct: 241 HVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGRMLQNVPD 300

Query: 301 NKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIVLAGRIVA 360
           NKEILKDLSDDLRKYVQEIT PSFADLRTDDVPEHKHECDGVHWP ILPSEIVLAGRI+A
Sbjct: 301 NKEILKDLSDDLRKYVQEITSPSFADLRTDDVPEHKHECDGVHWPAILPSEIVLAGRILA 360

Query: 361 KFVGQGGVFADASNLVDI---------------------------CLRQFFPSQLPVNEN 420
           KFVGQGGVFADASNLVD+                           CL+QFFPSQLPVNEN
Sbjct: 361 KFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLKQFFPSQLPVNEN 420

Query: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVGQAIYTTG 480
           TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSV PFTCNMEQVRVGQAIYTTG
Sbjct: 421 TISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVAPFTCNMEQVRVGQAIYTTG 480

Query: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ------------------- 540
           SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ                   
Sbjct: 481 SLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQVGQLDCKDRLKLLEDEYSF 540

Query: 541 ---CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLTVDEQSRLEPFM 600
              CSGCS+VHI DLVL+AFCCIN SC GVVLDRSIFNCENKKTKD LTVDEQSRLEPFM
Sbjct: 541 KCQCSGCSLVHISDLVLDAFCCINPSCFGVVLDRSIFNCENKKTKDSLTVDEQSRLEPFM 600

Query: 601 LTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660
           LTDSFLHAGPSHCLKCGSYRNIKSS STVDEAWIHFTRLQQEINSNRVSETTVSDALRAL
Sbjct: 601 LTDSFLHAGPSHCLKCGSYRNIKSSCSTVDEAWIHFTRLQQEINSNRVSETTVSDALRAL 660

Query: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHITIGNEL 720
           CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHI IGNEL
Sbjct: 661 CSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYGENHIAIGNEL 720

Query: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETHKFVSTDV 727
           LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLNILEEETHKFVSTDV
Sbjct: 721 LKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLNILEEETHKFVSTDV 776

BLAST of Cp4.1LG20g05030 vs. ExPASy TrEMBL
Match: A0A6J1KL29 (SET and MYND domain-containing protein 4 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495544 PE=4 SV=1)

HSP 1 Score: 1381 bits (3574), Expect = 0.0
Identity = 703/794 (88.54%), Postives = 713/794 (89.80%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQV---------------- 60
           MEKLKSLVP+NL+QTVGSSTVDDLPSSCSFLLRLFQQSQLFFQV                
Sbjct: 1   MEKLKSLVPKNLEQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVSFGCKRPRIFLPFLSG 60

Query: 61  --IGDLAMDPENALCGKKKDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMD 120
             IGDL MDPENALCGKKKDAALELKRQGNQCFLKGDYA ALVYYSQALQVAPMNAVDMD
Sbjct: 61  KLIGDLTMDPENALCGKKKDAALELKRQGNQCFLKGDYATALVYYSQALQVAPMNAVDMD 120

Query: 121 KNLVATLYVNRASVLLKMDLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIR 180
           KNLVATLYVNRASVLLKMDLQLECLRDCNR LQISSNYAKAWYRRGKANASMGNFHDAIR
Sbjct: 121 KNLVATLYVNRASVLLKMDLQLECLRDCNRTLQISSNYAKAWYRRGKANASMGNFHDAIR 180

Query: 181 DFQMSKSVEVSFNGKKQVDDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTS 240
           DFQ+SK+VEVSFNGKKQVDDELKIIQRQ+KRSNTV EHSNNNKLDDFDEPIQVKLHVTTS
Sbjct: 181 DFQISKNVEVSFNGKKQVDDELKIIQRQYKRSNTVQEHSNNNKLDDFDEPIQVKLHVTTS 240

Query: 241 NKGRGMVSPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYC 300
           NKGRGMVSPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYC
Sbjct: 241 NKGRGMVSPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYC 300

Query: 301 SQRCQIQAGGRMLQNVPDNKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGV 360
           SQRCQIQAGGRMLQNVPDNKEILKDLSDDLRKYVQEIT PSFADLRTDDVPEHKHECDGV
Sbjct: 301 SQRCQIQAGGRMLQNVPDNKEILKDLSDDLRKYVQEITSPSFADLRTDDVPEHKHECDGV 360

Query: 361 HWPVILPSEIVLAGRIVAKFVGQGGVFADASNLVDI------------------------ 420
           HWP ILPSEIVLAGRI+AKFVGQGGVFADASNLVD+                        
Sbjct: 361 HWPAILPSEIVLAGRILAKFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSII 420

Query: 421 ---CLRQFFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVP 480
              CL+QFFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSV P
Sbjct: 421 LSSCLKQFFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVAP 480

Query: 481 FTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ- 540
           FTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ 
Sbjct: 481 FTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQV 540

Query: 541 ---------------------CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENK 600
                                CSGCS+VHI DLVL+AFCCIN SC GVVLDRSIFNCENK
Sbjct: 541 GQLDCKDRLKLLEDEYSFKCQCSGCSLVHISDLVLDAFCCINPSCFGVVLDRSIFNCENK 600

Query: 601 KTKDYLTVDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQE 660
           KTKD LTVDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSS STVDEAWIHFTRLQQE
Sbjct: 601 KTKDSLTVDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSSCSTVDEAWIHFTRLQQE 660

Query: 661 INSNRVSETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASI 720
           INSNRVSETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASI
Sbjct: 661 INSNRVSETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASI 720

Query: 721 RILEKLYGENHITIGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLN 727
           RILEKLYGENHI IGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLN
Sbjct: 721 RILEKLYGENHIAIGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLN 780

BLAST of Cp4.1LG20g05030 vs. ExPASy TrEMBL
Match: A0A6J1EGM3 (SET and MYND domain-containing protein 4 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111434150 PE=4 SV=1)

HSP 1 Score: 1310 bits (3390), Expect = 0.0
Identity = 661/727 (90.92%), Postives = 666/727 (91.61%), Query Frame = 0

Query: 50  MDPENALCGKKKDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATL 109
           MDPENALCGKKKDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATL
Sbjct: 1   MDPENALCGKKKDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATL 60

Query: 110 YVNRASVLLKMDLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKS 169
           YVNRASVLLKMDLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAI DFQ+SK+
Sbjct: 61  YVNRASVLLKMDLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIHDFQISKN 120

Query: 170 VEVSFNGKKQVDDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMV 229
           VEVSFNGKKQVDDELKIIQRQHKRSNTV EHSNN KLDDFDEPIQVKLHVTTSNKGRGMV
Sbjct: 121 VEVSFNGKKQVDDELKIIQRQHKRSNTVQEHSNN-KLDDFDEPIQVKLHVTTSNKGRGMV 180

Query: 230 SPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQ 289
           SPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQ
Sbjct: 181 SPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQ 240

Query: 290 AGGRMLQNVPDNKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILP 349
           AGG+MLQNVPDNKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWP ILP
Sbjct: 241 AGGQMLQNVPDNKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPAILP 300

Query: 350 SEIVLAGRIVAKFVGQGGVFADASNLVDI---------------------------CLRQ 409
           SEIVLAGRIVAKFVGQGGVFADASNLVD+                           CLRQ
Sbjct: 301 SEIVLAGRIVAKFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLRQ 360

Query: 410 FFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQ 469
           FFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSV PFTCNMEQ
Sbjct: 361 FFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVAPFTCNMEQ 420

Query: 470 VRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ-------- 529
           VRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTA VTVGCPLELSYGPQ        
Sbjct: 421 VRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTAFVTVGCPLELSYGPQVGQLDCKD 480

Query: 530 --------------CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLT 589
                         CSGCSMVHIPDLVLNAFCCIN SCCGVVLDRSIFNCENKKTKD LT
Sbjct: 481 RLKLLEDEYSFKCQCSGCSMVHIPDLVLNAFCCINPSCCGVVLDRSIFNCENKKTKDSLT 540

Query: 590 VDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVS 649
           VDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQE+NSN VS
Sbjct: 541 VDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEMNSNMVS 600

Query: 650 ETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLY 709
           ETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLE AADHCKASIRILEKLY
Sbjct: 601 ETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLEHAADHCKASIRILEKLY 660

Query: 710 GENHITIGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETH 727
           GENHI IGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLNILEEETH
Sbjct: 661 GENHIAIGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLNILEEETH 720

BLAST of Cp4.1LG20g05030 vs. ExPASy TrEMBL
Match: A0A6J1KMM0 (SET and MYND domain-containing protein 4 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111495544 PE=4 SV=1)

HSP 1 Score: 1303 bits (3373), Expect = 0.0
Identity = 657/727 (90.37%), Postives = 665/727 (91.47%), Query Frame = 0

Query: 50  MDPENALCGKKKDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATL 109
           MDPENALCGKKKDAALELKRQGNQCFLKGDYA ALVYYSQALQVAPMNAVDMDKNLVATL
Sbjct: 1   MDPENALCGKKKDAALELKRQGNQCFLKGDYATALVYYSQALQVAPMNAVDMDKNLVATL 60

Query: 110 YVNRASVLLKMDLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKS 169
           YVNRASVLLKMDLQLECLRDCNR LQISSNYAKAWYRRGKANASMGNFHDAIRDFQ+SK+
Sbjct: 61  YVNRASVLLKMDLQLECLRDCNRTLQISSNYAKAWYRRGKANASMGNFHDAIRDFQISKN 120

Query: 170 VEVSFNGKKQVDDELKIIQRQHKRSNTVLEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMV 229
           VEVSFNGKKQVDDELKIIQRQ+KRSNTV EHSNNNKLDDFDEPIQVKLHVTTSNKGRGMV
Sbjct: 121 VEVSFNGKKQVDDELKIIQRQYKRSNTVQEHSNNNKLDDFDEPIQVKLHVTTSNKGRGMV 180

Query: 230 SPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQ 289
           SPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQ
Sbjct: 181 SPIEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQ 240

Query: 290 AGGRMLQNVPDNKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILP 349
           AGGRMLQNVPDNKEILKDLSDDLRKYVQEIT PSFADLRTDDVPEHKHECDGVHWP ILP
Sbjct: 241 AGGRMLQNVPDNKEILKDLSDDLRKYVQEITSPSFADLRTDDVPEHKHECDGVHWPAILP 300

Query: 350 SEIVLAGRIVAKFVGQGGVFADASNLVDI---------------------------CLRQ 409
           SEIVLAGRI+AKFVGQGGVFADASNLVD+                           CL+Q
Sbjct: 301 SEIVLAGRILAKFVGQGGVFADASNLVDMLNLSHHFSEMHADSKLECIIYSIILSSCLKQ 360

Query: 410 FFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQ 469
           FFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSV PFTCNMEQ
Sbjct: 361 FFPSQLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVAPFTCNMEQ 420

Query: 470 VRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ-------- 529
           VRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ        
Sbjct: 421 VRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQVGQLDCKD 480

Query: 530 --------------CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLT 589
                         CSGCS+VHI DLVL+AFCCIN SC GVVLDRSIFNCENKKTKD LT
Sbjct: 481 RLKLLEDEYSFKCQCSGCSLVHISDLVLDAFCCINPSCFGVVLDRSIFNCENKKTKDSLT 540

Query: 590 VDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHFTRLQQEINSNRVS 649
           VDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSS STVDEAWIHFTRLQQEINSNRVS
Sbjct: 541 VDEQSRLEPFMLTDSFLHAGPSHCLKCGSYRNIKSSCSTVDEAWIHFTRLQQEINSNRVS 600

Query: 650 ETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLY 709
           ETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLY
Sbjct: 601 ETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLY 660

Query: 710 GENHITIGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANAMFPFLNILEEETH 727
           GENHI IGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHAN MFPFLNILEEETH
Sbjct: 661 GENHIAIGNELLKLSSILLSVGDCNGVECIKRLSEIFRCHYGWHANTMFPFLNILEEETH 720

BLAST of Cp4.1LG20g05030 vs. TAIR 10
Match: AT1G33400.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 661.4 bits (1705), Expect = 8.5e-190
Identity = 363/798 (45.49%), Postives = 505/798 (63.28%), Query Frame = 0

Query: 1   MEKLKSLVPENLKQTVGSSTVDDLPSSCSFLLRLFQQSQLFFQVIGDLAMDPENALCGKK 60
           MEKLKSL+PE+L QTV SS+VDDL S+ S LLRLF     F Q + +LA +PE   CGK 
Sbjct: 1   MEKLKSLIPEDLLQTVKSSSVDDLLSTSSSLLRLFLGLPQFHQAVSELA-NPELGCCGKN 60

Query: 61  KDAALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKM 120
           ++ +L+LKR+GN CF   D+  AL  YS+AL+VAP++A+D DK+L+A+L++NRA+VL  +
Sbjct: 61  EETSLDLKRRGNHCFRSRDFDEALRLYSKALRVAPLDAIDGDKSLLASLFLNRANVLHNL 120

Query: 121 DLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQV 180
            L  E LRDC+RAL+I   YAKAWYRRGK N  +GN+ DA RD  +S S+E S  GKKQ+
Sbjct: 121 GLLKESLRDCHRALRIDPYYAKAWYRRGKLNTLLGNYKDAFRDITVSMSLESSLVGKKQL 180

Query: 181 DDELKIIQRQHKRSNTVLEH-----SNNNKLDDFDE-PIQVKLH-VTTSNKGRGMVSPIE 240
            +ELK I     ++N  LEH     SN+  +D      ++VKL  V+T  KGRGMVS  +
Sbjct: 181 QNELKAI--PDYQNNQTLEHDEYRPSNDAGVDHLPSVQMEVKLRCVSTKEKGRGMVSECD 240

Query: 241 IPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQRCQIQAGGR 300
           I  +S++HVEEP+++VI K CRETHCH+CLNELPAD VPCPSCSIP+YCS+ CQIQ+GG 
Sbjct: 241 IEEASVIHVEEPFSVVISKSCRETHCHFCLNELPADTVPCPSCSIPVYCSESCQIQSGGM 300

Query: 301 MLQNVPDNKEILKDLSDDLRKYVQEITLPSFADLRTDDVPEHKHECDGVHWPVILPSEIV 360
           +  N  D   I + L DD+ ++++ +T        TD + EH+HEC G +WP +LPS+ V
Sbjct: 301 LSTNEMDKHHIFQKLPDDIVEHIKGVTSADIYYFATDLIQEHQHECRGANWPAVLPSDAV 360

Query: 361 LAGRIVAKFVGQGGVFADASNLVDI---------------------------CLRQFFPS 420
           LAGRI+ K + QG    D SNL +I                           CL +    
Sbjct: 361 LAGRIIMKLINQGKAATDLSNLQEILELSHTYSKMNPENKLELHLLSIVLIWCLSKSSCP 420

Query: 421 QLPVNENTISQIVILISQIRTNSISIVRMKSFDAPGSRDQSGRLSSVVPFTCNMEQVRVG 480
            L V E +++Q +IL+SQI+ NSI++ RMKS         SG +S+  P   ++EQ+RVG
Sbjct: 421 NLSVCEASVTQTIILLSQIKVNSIAVARMKSSGDSFKCLPSGNISTKEPIQ-SLEQIRVG 480

Query: 481 QAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTASVTVGCPLELSYGPQ------------ 540
           QA+Y TGSLFNHSCKPNIH YF SR L ++TT  V  GCPLELSYGP+            
Sbjct: 481 QALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRIRF 540

Query: 541 ----------CSGCSMVHIPDLVLNAFCCINSSCCGVVLDRSIFNCENKKTKDYLT---- 600
                     C GC+ ++I DLV+N + C+N++C GVVLD ++  CE++K   + T    
Sbjct: 541 LEEEYFFHCRCRGCAQINISDLVINGYGCVNTNCTGVVLDSNVATCESEKLNHFFTAPRN 600

Query: 601 VDEQSRLEPFMLTD-------------SFLHAGPSHCLKCGSYRNIKSSRSTVDEAWIHF 660
           VD+Q ++   +  D               LH  P  CLKCGS  +I++S + V++AW H 
Sbjct: 601 VDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSHAEVNKAWNHM 660

Query: 661 TRLQQEINSNRVSETTVSDALRALCSLKSTLHAYNKRIAEAEDNLSQAFCLLGKLELAAD 720
            R+++ +NS R + + +SD  R++  L++ LH YNK IA+AED ++QA  L G+L  A  
Sbjct: 661 RRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACYLAGELVDARK 720

Query: 721 HCKASIRILEKLYGENHITIGNELLKLSSILLSVGDCNGV-ECIKRLSEIFRCHYGWHAN 725
           HC+ASI+IL++LY + H+ IGNE++KL+SI L+ GD +G  +  KR S+IF  +YG HA 
Sbjct: 721 HCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQIFSKYYGSHAE 780

BLAST of Cp4.1LG20g05030 vs. TAIR 10
Match: AT2G42810.1 (protein phosphatase 5.2 )

HSP 1 Score: 65.5 bits (158), Expect = 2.1e-10
Identity = 36/105 (34.29%), Postives = 59/105 (56.19%), Query Frame = 0

Query: 64  ALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKMDLQ 123
           A E K Q N+ F    Y+ A+  Y++A+++   NAV          + NRA    K++  
Sbjct: 13  AEEFKSQANEAFKGHKYSSAIDLYTKAIELNSNNAV---------YWANRAFAHTKLEEY 72

Query: 124 LECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSK 169
              ++D ++A+++ S Y+K +YRRG A  +MG F DA++DFQ  K
Sbjct: 73  GSAIQDASKAIEVDSRYSKGYYRRGAAYLAMGKFKDALKDFQQVK 108

BLAST of Cp4.1LG20g05030 vs. TAIR 10
Match: AT2G42810.2 (protein phosphatase 5.2 )

HSP 1 Score: 65.5 bits (158), Expect = 2.1e-10
Identity = 36/105 (34.29%), Postives = 59/105 (56.19%), Query Frame = 0

Query: 64  ALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKMDLQ 123
           A E K Q N+ F    Y+ A+  Y++A+++   NAV          + NRA    K++  
Sbjct: 13  AEEFKSQANEAFKGHKYSSAIDLYTKAIELNSNNAV---------YWANRAFAHTKLEEY 72

Query: 124 LECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSK 169
              ++D ++A+++ S Y+K +YRRG A  +MG F DA++DFQ  K
Sbjct: 73  GSAIQDASKAIEVDSRYSKGYYRRGAAYLAMGKFKDALKDFQQVK 108

BLAST of Cp4.1LG20g05030 vs. TAIR 10
Match: AT4G30480.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 58.2 bits (139), Expect = 3.3e-08
Identity = 36/112 (32.14%), Postives = 61/112 (54.46%), Query Frame = 0

Query: 58  GKKKDAAL----ELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNR 117
           G  K+ AL    E K +GN+ F+ G Y  AL  Y+ AL++  +  +     L +  Y+NR
Sbjct: 95  GSNKEKALAEANEAKAEGNKLFVNGLYEEALSKYAFALEL--VQELPESIELRSICYLNR 154

Query: 118 ASVLLKMDLQLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQ 166
               LK+    E +++C +AL+++  Y KA  RR +A+  + +F DA+ D +
Sbjct: 155 GVCFLKLGKCEETIKECTKALELNPTYNKALVRRAEAHEKLEHFEDAVTDLK 204

BLAST of Cp4.1LG20g05030 vs. TAIR 10
Match: AT4G32070.1 (Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 57.0 bits (136), Expect = 7.3e-08
Identity = 38/124 (30.65%), Postives = 66/124 (53.23%), Query Frame = 0

Query: 64  ALELKRQGNQCFLKGDYAPALVYYSQALQVAPMNAVDMDKNLVATLYVNRASVLLKMDL- 123
           ALELK +GN+ F K D+  A++ + +AL++ P + +D     VA L  + AS  ++M L 
Sbjct: 51  ALELKEEGNKLFQKRDHEGAMLSFDKALKLLPKDHID-----VAYLRTSMASCYMQMGLG 110

Query: 124 -QLECLRDCNRALQISSNYAKAWYRRGKANASMGNFHDAIRDFQMSKSVEVSFNGKKQVD 183
                + +CN AL+ S  Y+KA  RR +   ++     A RD ++  ++E       ++ 
Sbjct: 111 EYPNAISECNLALEASPRYSKALVRRSRCYEALNKLDYAFRDARIVLNMEPGNVSANEIF 169

Query: 184 DELK 186
           D +K
Sbjct: 171 DRVK 169

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8BTK53.8e-1721.18SET and MYND domain-containing protein 4 OS=Mus musculus OX=10090 GN=Smyd4 PE=2 ... [more]
Q9HGM94.8e-1238.58DnaJ homolog subfamily C member 7 homolog OS=Schizosaccharomyces pombe (strain 9... [more]
Q9CWR21.5e-1024.01Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 ... [more]
P530421.3e-0932.28Serine/threonine-protein phosphatase 5 OS=Rattus norvegicus OX=10116 GN=Ppp5c PE... [more]
Q8IWX71.7e-0938.89Protein unc-45 homolog B OS=Homo sapiens OX=9606 GN=UNC45B PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_023520329.10.093.56SET and MYND domain-containing protein 4 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022927244.10.091.49SET and MYND domain-containing protein 4 isoform X1 [Cucurbita moschata][more]
KAG7019525.10.091.49SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp.... [more]
KAG6583908.10.090.85SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp.... [more]
XP_023001396.10.090.46SET and MYND domain-containing protein 4 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1EHH40.091.49SET and MYND domain-containing protein 4 isoform X1 OS=Cucurbita moschata OX=366... [more]
A0A6J1KIH90.090.46SET and MYND domain-containing protein 4 isoform X2 OS=Cucurbita maxima OX=3661 ... [more]
A0A6J1KL290.088.54SET and MYND domain-containing protein 4 isoform X1 OS=Cucurbita maxima OX=3661 ... [more]
A0A6J1EGM30.090.92SET and MYND domain-containing protein 4 isoform X2 OS=Cucurbita moschata OX=366... [more]
A0A6J1KMM00.090.37SET and MYND domain-containing protein 4 isoform X4 OS=Cucurbita maxima OX=3661 ... [more]
Match NameE-valueIdentityDescription
AT1G33400.18.5e-19045.49Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G42810.12.1e-1034.29protein phosphatase 5.2 [more]
AT2G42810.22.1e-1034.29protein phosphatase 5.2 [more]
AT4G30480.23.3e-0832.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G32070.17.3e-0830.65Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide r... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019734Tetratricopeptide repeatSMARTSM00028tpr_5coord: 626..659
e-value: 28.0
score: 11.0
coord: 141..174
e-value: 0.018
score: 24.2
coord: 107..140
e-value: 1.3
score: 18.0
coord: 64..97
e-value: 6.7E-4
score: 29.0
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 64..97
score: 8.1424
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 47..206
e-value: 5.7E-30
score: 106.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 494..718
e-value: 3.5E-20
score: 74.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 64..656
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 392..491
e-value: 1.2E-11
score: 46.4
NoneNo IPR availablePANTHERPTHR47337TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEINcoord: 1..718
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 220..496
IPR001214SET domainPROSITEPS50280SETcoord: 211..492
score: 9.538796

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g05030.1Cp4.1LG20g05030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding