CmaCh20G010670 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G010670
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionHistone-lysine N-methyltransferase
LocationCma_Chr20 : 8047687 .. 8051349 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCGCCGTAGAGGCCATTTCGGGAAGACAGAGAGCTAACAAATTCCGCCATTATCGTCTTCCTCCATGCACCTTTCTGCACACCCACACCAGAGCTGCCTACACCAATCCGTTTTCCTCCCTTCCACCTTTTATCCTCTCCCAATTCTGCCTATTGCGAATCTCTCCTCTTCCTCATGACTCCGGCTTTCTCCTCCTCCTCCGCCACCGCCGCCTCCTCCCAGCGCCTTATCCGCTGCAGCGCCTCGCTCCGTCGGACACACGCTCCGCACCGACCTTCCTCGACATCCCCTCCGCCGAGGAAGCTGAAGCCAATGGCGGAGGTAATGGCCAAGGCCAAGCATGTGGTTCTCGAACGGGACGATTACGATGACGTCAGGTGCGAGGAATGTGGCTCCGGCGACCGGGATGATGAGCTGCTGTTGTGCGACAAATGCGACAAGGGGTTTCATATGAAATGTGTTAGTCCGATCGTCGTTAGGGTCCCGATTGGATCCTGGCTTTGCCCCAAATGCTGTGGCCAAAGAAGAGTAAGAAGTAAGGATGCTCACGAATTTGTTGATTCTCCCATTTTCTGTGAATTTCTTGGAGTTTGTGATTTTCGGAATGTCATTGTGTTCCGATTCCTTTCTTCATTCTTTTTGTTTCGCTTGTAGGCTTTTCTCAGAAGAAGATTATCGATTTTTTCGGAATTCAGAAATGTAATGACGAAGAAGTCCCATATCTGTCTGCTCAAGGTAAATTCTTCACTACAGTATTCAAAATTGATATCATATTATGTCTGCATCATAATCTTGGTCATGATCGTGGCCCTTCTGGTATTCATTCTTATATTTTATGCTATTGTCCTCAGAAAATGCTTTAGAATTTCAAGCATTATATAACATAGTGGAACTACTGGCAAATGAGTTTTTATTTTCAATAATGTGGGGCAATGTTATGATATGCTTTATACAAATCTGCTCAATTATCATGCTTGATGTATACATTATTCCTTCTCAACCAATTTTAGCTGTTTAGTTTACTGCTACAGATGTGATTCTTGAAGTGAAATTCTATGATATCCTGATGAAATCGAAAGAGAAGAAAAATCATCTCATATTCTTCCTTGTACTTCATTAAATGATCGAGTTGTGAGGCATAAAAATTCTTAGTGGACATCAATGAGCCTTTACCTATCAAGGAGTTAACATGATTTTTGTGGTTTCTTGAAATCTAGTTGCTTGATAAGGATACATGCTAAATGTTATGCTCTATAAGATGTGTGTGAGAGCTAGCATTTCAGATAAGATAGATACGGAAAGTACAAATAAAATCTTATGACATATATTTGGGATGTCATACTCATATCATATATTTGACAACCATACTTGATTTTGTAAGAACATTTATAGATATTACTGAGCATATATGTGACAACATACTGTAATAGTCCAAGCTCACCGCTAGCAGATATTGTCTTATTTGGGTTTTCCCTTTCAGGCTTCCCCTCAAGGTTTTTAAAACGCATCTACTAGGATGGAGGTTTCTACGCCCTTATAAAGAATATTTCATTCTTCTCCCGGACCGATGTGGGATTTCACAATCCGCTTTCCTTTAGGACCTTGCATTCTCGCTGACACTCGTTCCCTTTGCCAATCAATATGGGACTCCCCCAATCTACCCCCTTCGAGGCCAGCGTCCTTGCTGGCACATCGTCTTGTGTCCACCACTCTTTGGGGCTTAGCCTCCTCGTTGACACATTGCCCCGTGTCTGGCTCTGTTGCCATTTATAACAACCCAAGCTCACCACTGGCAGATATTGTCCTCCTTGGGCTTTTCCTTTCGAGCGTCCCCTCAAGGTTTTTAAAACGCATCTGCTAGGGGGAAGTTTCCACACCCTTATAAAGAATATTTCGTTCTCCTCCCCAACCGAGGTTGGATCTCACACATATCTTATGACATATGTTTAGGCATTACAATTTCAACCAAAAAGTCAGAGATTTGAATCCTACCCTCAAGTATGGTCAAAAAAAAAAAAAAAAAAATCCTGAATTATTTGTAATTTCACTAATCTCCAACGATTTTTTCTTTTTTATGAAGACCATTTGACAGCAGTAAATCGTCTAGAGGTTTTCTCTTGAAACTTCTCTATACCAAATGTGTTCTTGAGGTGATCATTTCATGTTGGTATTCTATGAGATTTTCTGGATTTGAGACCATTTGGCATTAAAATTTAGTTCTTCCATGTTCTCTGCTGAAACTTTTCTATACCATAGGTGTTTGTTCCTGAACTGATCAATTCATGTTGGACGATTTTCTTTTAATATTCCTTGCATCAAAAGTTATTCATGGATTCATAATTTTAATATTATACAAATGGCATCATGAACTGAAGCTATAAAGCGTAGGAGACGATTACGGCCATTGGTGTGGCAGAAGAAAAGGAGAAGATTACTACCATTTCGTCCAAGTGAAGATCCTGATCGCAGATTGAAACAGATGGGTTCACTTGCTACTGCTTTAACAACATTGAAAATGGAGTTTAGTGATGATTTGACTTACGTGCCGGGCATGGCATCAAGATCTGCTAACCAGGCAGAGTTTGAAGATGGTGGAATGCAGGTTTAAAAACTTGGAACCATTTGAAAACATTTTCTAGTTGGAATCTTTTTTGTTCTTAATAGTGGAATGGTCATAGATTATGTTACCTGCCTTCTATTTTAGGTTCTTTCCAAAGAGGACACTGAGACCTTGGAACTCTGCAGAGCCATGAGCAGAAGAGGCGAATGCCCTCCCCTTTTGGTTGTTTTTGATTCATGTGAAGGGTAACTTTATCTTTAACTCCTTGACCATATTGTCATTTGCTTCCAACATCTATATTGAGTAAGTCCACACATAATTTATGATGTAGATTTACGGTAGAAGCGGATGATCAAATTAAGGATATGACGTTTATTGCCGAATATACTGGTGACGTGGATTATCTTAAAAACCGTGAACACGACGATTGTGATAGTATGATGACCCTTCTTTCAGCAAAAGATCCATCTAGGAGTCTTGTCATCTGCCCCGACAAACGTGGCAACATCGCTCGGTTTATTAATGGAATAAATAATCATTCTCCGTAAGTTGGCCAAGTCTATCACATTTTAAAAATGTATCTAATCGCTCAATCATATCAGCTTTGTCTACATGCACGTATCTAAGTAGTTCACCTCATGGCCATATCTTGATAGCTTACTAAGAACTAGTTTCTTAAAACTTTAATCGTATGTTTAGTTTGTTTCCTCTTTTACTTTTTCCAATATTTCCGAGAATCAAATTCATGATGCTCAAAACATAAATTCCATTTTTGTGCAGAGAAGGTAAGAAGAAACAGAACTGTAAATGTGTGAGATACAATATTAACGGCGAATGCCGGGTCATTTTGGTTGCTATTCAGGATATTGCTAAAGGGGAGAGGCTCTATTATGACTACAATGGATATGAGTATGATTACCCTACTCATCATTTCGTTTGAGAATATCTTAGGATACTTCTAACAATTTTGGGATTTGATTTGTCCCATACAGTTTTTCTCCATTTTCTTATGCTGAATTTTGTTGCTAAGTCAACAGTTTTTGCTGATTCTATCAGAGTAATTTGTGTTACATTTCCTCGGCTGAGGACAGGAGGGAGGGA

mRNA sequence

TTTCGCCGTAGAGGCCATTTCGGGAAGACAGAGAGCTAACAAATTCCGCCATTATCGTCTTCCTCCATGCACCTTTCTGCACACCCACACCAGAGCTGCCTACACCAATCCGTTTTCCTCCCTTCCACCTTTTATCCTCTCCCAATTCTGCCTATTGCGAATCTCTCCTCTTCCTCATGACTCCGGCTTTCTCCTCCTCCTCCGCCACCGCCGCCTCCTCCCAGCGCCTTATCCGCTGCAGCGCCTCGCTCCGTCGGACACACGCTCCGCACCGACCTTCCTCGACATCCCCTCCGCCGAGGAAGCTGAAGCCAATGGCGGAGGTAATGGCCAAGGCCAAGCATGTGGTTCTCGAACGGGACGATTACGATGACGTCAGGTGCGAGGAATGTGGCTCCGGCGACCGGGATGATGAGCTGCTGTTGTGCGACAAATGCGACAAGGGGTTTCATATGAAATGTGTTAGTCCGATCGTCGTTAGGGTCCCGATTGGATCCTGGCTTTGCCCCAAATGCTGTGGCCAAAGAAGAGTAAGAAGCTTTTCTCAGAAGAAGATTATCGATTTTTTCGGAATTCAGAAATGTAATGACGAAGAAGTCCCATATCTGTCTGCTCAAGCTATAAAGCGTAGGAGACGATTACGGCCATTGGTGTGGCAGAAGAAAAGGAGAAGATTACTACCATTTCGTCCAAGTGAAGATCCTGATCGCAGATTGAAACAGATGGGTTCACTTGCTACTGCTTTAACAACATTGAAAATGGAGTTTAGTGATGATTTGACTTACGTGCCGGGCATGGCATCAAGATCTGCTAACCAGGCAGAGTTTGAAGATGGTGGAATGCAGGTTCTTTCCAAAGAGGACACTGAGACCTTGGAACTCTGCAGAGCCATGAGCAGAAGAGGCGAATGCCCTCCCCTTTTGGTTGTTTTTGATTCATGTGAAGGATTTACGGTAGAAGCGGATGATCAAATTAAGGATATGACGTTTATTGCCGAATATACTGGTGACGTGGATTATCTTAAAAACCGTGAACACGACGATTGTGATAGTATGATGACCCTTCTTTCAGCAAAAGATCCATCTAGGAGTCTTGTCATCTGCCCCGACAAACGTGGCAACATCGCTCGGTTTATTAATGGAATAAATAATCATTCTCCAGAAGGTAAGAAGAAACAGAACTGTAAATGTGTGAGATACAATATTAACGGCGAATGCCGGGTCATTTTGGTTGCTATTCAGGATATTGCTAAAGGGGAGAGGCTCTATTATGACTACAATGGATATGAGTATGATTACCCTACTCATCATTTCGTTTGAGAATATCTTAGGATACTTCTAACAATTTTGGGATTTGATTTGTCCCATACAGTTTTTCTCCATTTTCTTATGCTGAATTTTGTTGCTAAGTCAACAGTTTTTGCTGATTCTATCAGAGTAATTTGTGTTACATTTCCTCGGCTGAGGACAGGAGGGAGGGA

Coding sequence (CDS)

ATGACTCCGGCTTTCTCCTCCTCCTCCGCCACCGCCGCCTCCTCCCAGCGCCTTATCCGCTGCAGCGCCTCGCTCCGTCGGACACACGCTCCGCACCGACCTTCCTCGACATCCCCTCCGCCGAGGAAGCTGAAGCCAATGGCGGAGGTAATGGCCAAGGCCAAGCATGTGGTTCTCGAACGGGACGATTACGATGACGTCAGGTGCGAGGAATGTGGCTCCGGCGACCGGGATGATGAGCTGCTGTTGTGCGACAAATGCGACAAGGGGTTTCATATGAAATGTGTTAGTCCGATCGTCGTTAGGGTCCCGATTGGATCCTGGCTTTGCCCCAAATGCTGTGGCCAAAGAAGAGTAAGAAGCTTTTCTCAGAAGAAGATTATCGATTTTTTCGGAATTCAGAAATGTAATGACGAAGAAGTCCCATATCTGTCTGCTCAAGCTATAAAGCGTAGGAGACGATTACGGCCATTGGTGTGGCAGAAGAAAAGGAGAAGATTACTACCATTTCGTCCAAGTGAAGATCCTGATCGCAGATTGAAACAGATGGGTTCACTTGCTACTGCTTTAACAACATTGAAAATGGAGTTTAGTGATGATTTGACTTACGTGCCGGGCATGGCATCAAGATCTGCTAACCAGGCAGAGTTTGAAGATGGTGGAATGCAGGTTCTTTCCAAAGAGGACACTGAGACCTTGGAACTCTGCAGAGCCATGAGCAGAAGAGGCGAATGCCCTCCCCTTTTGGTTGTTTTTGATTCATGTGAAGGATTTACGGTAGAAGCGGATGATCAAATTAAGGATATGACGTTTATTGCCGAATATACTGGTGACGTGGATTATCTTAAAAACCGTGAACACGACGATTGTGATAGTATGATGACCCTTCTTTCAGCAAAAGATCCATCTAGGAGTCTTGTCATCTGCCCCGACAAACGTGGCAACATCGCTCGGTTTATTAATGGAATAAATAATCATTCTCCAGAAGGTAAGAAGAAACAGAACTGTAAATGTGTGAGATACAATATTAACGGCGAATGCCGGGTCATTTTGGTTGCTATTCAGGATATTGCTAAAGGGGAGAGGCTCTATTATGACTACAATGGATATGAGTATGATTACCCTACTCATCATTTCGTTTGA

Protein sequence

MTPAFSSSSATAASSQRLIRCSASLRRTHAPHRPSSTSPPPRKLKPMAEVMAKAKHVVLERDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVRSFSQKKIIDFFGIQKCNDEEVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRRLKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAMSRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSAKDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAKGERLYYDYNGYEYDYPTHHFV
BLAST of CmaCh20G010670 vs. Swiss-Prot
Match: ATXR5_RICCO (Probable Histone-lysine N-methyltransferase ATXR5 OS=Ricinus communis GN=ATXR5 PE=1 SV=1)

HSP 1 Score: 555.1 bits (1429), Expect = 6.1e-157
Identity = 280/381 (73.49%), Postives = 317/381 (83.20%), Query Frame = 1

Query: 1   MTPAFSSSSATAASSQRLIRCSASLRRTHAPHRPSSTSPPPRKLKPMAEVMAKAKHVVLE 60
           M PA  +++ T A      R   S RRT A   P S  PPP+KLKP++E++AKA++ V+E
Sbjct: 1   MAPASITTTTTVAR-----RIVGSRRRTKATSPPDS--PPPKKLKPISEILAKAQYAVVE 60

Query: 61  RDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVR 120
           R DY DV C +CGSG+R +ELLLCDKCDKGFHMKCV PIVVRVPIGSWLCPKC GQRRVR
Sbjct: 61  RADYGDVSCMQCGSGERAEELLLCDKCDKGFHMKCVRPIVVRVPIGSWLCPKCSGQRRVR 120

Query: 121 SFSQKKIIDFFGIQKCNDEEVPYLSAQAI-KRRRRLRPLVWQKKRRRLLPFRPSEDPDRR 180
             SQ+KIIDFF IQKCN +     S Q I K RRR   LV+QK+RRRLLPF  SEDP +R
Sbjct: 121 RLSQRKIIDFFRIQKCNHKTDKCSSPQDIRKHRRRSGSLVYQKRRRRLLPFVSSEDPAQR 180

Query: 181 LKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM 240
           LKQMG+LA+ALT L+MEFSDDLTY  GMA RSANQA FE+GGMQVL+KED ETLE CRAM
Sbjct: 181 LKQMGTLASALTELQMEFSDDLTYSSGMAPRSANQARFEEGGMQVLTKEDIETLEQCRAM 240

Query: 241 SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSA 300
            +RG+CPPLLVVFDS EGFTVEAD QIKDMTFIAEYTGDVDY++NREHDDCDSMMTLL A
Sbjct: 241 CKRGDCPPLLVVFDSREGFTVEADGQIKDMTFIAEYTGDVDYIRNREHDDCDSMMTLLLA 300

Query: 301 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 360
           KDPS+SLVICPDKRGNIARFI+GINNH+ +GKKKQNCKCVRY++NGECRV LVA +DIAK
Sbjct: 301 KDPSKSLVICPDKRGNIARFISGINNHTLDGKKKQNCKCVRYSVNGECRVFLVATRDIAK 360

Query: 361 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYE++YPT HFV
Sbjct: 361 GERLYYDYNGYEHEYPTQHFV 374

BLAST of CmaCh20G010670 vs. Swiss-Prot
Match: ATXR5_ARATH (Histone-lysine N-methyltransferase ATXR5 OS=Arabidopsis thaliana GN=ATXR5 PE=1 SV=2)

HSP 1 Score: 504.2 bits (1297), Expect = 1.2e-141
Identity = 254/375 (67.73%), Postives = 301/375 (80.27%), Query Frame = 1

Query: 13  ASSQRLIRCSASLRRTHAP-HRPSSTSPPPRKLKPMAEVMAKAKHVVLER-----DDYDD 72
           ASS     CS S RRT AP  RPSS SPPPRK+K MAE+MAK+  VV +      D Y +
Sbjct: 6   ASSPAASPCS-SRRRTKAPARRPSSESPPPRKMKSMAEIMAKSVPVVEQEEEEDEDSYSN 65

Query: 73  VRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVRSFSQKK 132
           V CE+CGSG+ DDELLLCDKCD+GFHMKC+ PIVVRVPIG+WLC  C  QR VR  SQKK
Sbjct: 66  VTCEKCGSGEGDDELLLCDKCDRGFHMKCLRPIVVRVPIGTWLCVDCSDQRPVRRLSQKK 125

Query: 133 IIDFFGIQK-CNDEEVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRRLKQMGS 192
           I+ FF I+K  +  +   LS +  ++RRR   L  +K+RR+LLP  PSEDPD+RL QMG+
Sbjct: 126 ILHFFRIEKHTHQTDKLELSQEETRKRRRSCSLTVKKRRRKLLPLVPSEDPDQRLAQMGT 185

Query: 193 LATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAMSRRGEC 252
           LA+ALT L +++SD L YVPGMA RSANQ++ E GGMQVL KED ETLE C++M RRGEC
Sbjct: 186 LASALTALGIKYSDGLNYVPGMAPRSANQSKLEKGGMQVLCKEDLETLEQCQSMYRRGEC 245

Query: 253 PPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSAKDPSRS 312
           PPL+VVFD  EG+TVEAD  IKD+TFIAEYTGDVDYLKNRE DDCDS+MTLL ++DPS++
Sbjct: 246 PPLVVVFDPLEGYTVEADGPIKDLTFIAEYTGDVDYLKNREKDDCDSIMTLLLSEDPSKT 305

Query: 313 LVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAKGERLYY 372
           LVICPDK GNI+RFINGINNH+P  KKKQNCKCVRY+INGECRV+LVA +DI+KGERLYY
Sbjct: 306 LVICPDKFGNISRFINGINNHNPVAKKKQNCKCVRYSINGECRVLLVATRDISKGERLYY 365

Query: 373 DYNGYEYDYPTHHFV 381
           DYNGYE++YPTHHF+
Sbjct: 366 DYNGYEHEYPTHHFL 379

BLAST of CmaCh20G010670 vs. Swiss-Prot
Match: ATXR6_ARATH (Histone-lysine N-methyltransferase ATXR6 OS=Arabidopsis thaliana GN=ATXR6 PE=1 SV=1)

HSP 1 Score: 414.8 bits (1065), Expect = 9.9e-115
Identity = 210/321 (65.42%), Postives = 243/321 (75.70%), Query Frame = 1

Query: 63  DYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVRSF 122
           D+D V CEEC SG +  +LLLCDKCDKGFH+ C+ PI+V VP GSW CP C   +  +SF
Sbjct: 30  DWDTV-CEECSSGKQPAKLLLCDKCDKGFHLFCLRPILVSVPKGSWFCPSCSKHQIPKSF 89

Query: 123 S--QKKIIDFFGIQKCNDEEVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRRL 182
              Q KIIDFF I++  D      S+ +I ++R+   LV  KK+RRLLP+ PS DP RRL
Sbjct: 90  PLIQTKIIDFFRIKRSPDSSQISSSSDSIGKKRKKTSLVMSKKKRRLLPYNPSNDPQRRL 149

Query: 183 KQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAMS 242
           +QM SLATAL     +FS++LTYV G A RSANQA FE GGMQVLSKE  ETL LC+ M 
Sbjct: 150 EQMASLATALRASNTKFSNELTYVSGKAPRSANQAAFEKGGMQVLSKEGVETLALCKKMM 209

Query: 243 RRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHD-DCDSMMTLLSA 302
             GECPPL+VVFD  EGFTVEAD  IKD T I EY GDVDYL NRE D D DSMMTLL A
Sbjct: 210 DLGECPPLMVVFDPYEGFTVEADRFIKDWTIITEYVGDVDYLSNREDDYDGDSMMTLLHA 269

Query: 303 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 362
            DPS+ LVICPD+R NIARFI+GINNHSPEG+KKQN KCVR+NINGE RV+LVA +DI+K
Sbjct: 270 SDPSQCLVICPDRRSNIARFISGINNHSPEGRKKQNLKCVRFNINGEARVLLVANRDISK 329

Query: 363 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYE++YPT HFV
Sbjct: 330 GERLYYDYNGYEHEYPTEHFV 349

BLAST of CmaCh20G010670 vs. Swiss-Prot
Match: PHRF1_RAT (PHD and RING finger domain-containing protein 1 OS=Rattus norvegicus GN=Phrf1 PE=1 SV=2)

HSP 1 Score: 78.6 bits (192), Expect = 1.7e-13
Identity = 38/100 (38.00%), Postives = 51/100 (51.00%), Query Frame = 1

Query: 60  ERDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRV 119
           E ++ D   CE CG  DR+D LLLCD CD G+HM+C+ P +  VP+  W CP+C      
Sbjct: 182 EAEEEDPTFCEVCGRSDREDRLLLCDGCDAGYHMECLDPPLQEVPVDEWFCPEC------ 241

Query: 120 RSFSQKKIIDFFGIQKCNDEEVPYLSAQAIKRRRRLRPLV 160
              +   +         +DEEV  L A  +    RLRP V
Sbjct: 242 ---AVPGVDPTHDAAPVSDEEVSLLLADVVPTTSRLRPRV 272

BLAST of CmaCh20G010670 vs. Swiss-Prot
Match: PHRF1_MOUSE (PHD and RING finger domain-containing protein 1 OS=Mus musculus GN=Phrf1 PE=1 SV=2)

HSP 1 Score: 78.6 bits (192), Expect = 1.7e-13
Identity = 38/100 (38.00%), Postives = 51/100 (51.00%), Query Frame = 1

Query: 60  ERDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRV 119
           E ++ D   CE CG  DR+D LLLCD CD G+HM+C+ P +  VP+  W CP+C      
Sbjct: 179 EAEEEDPTFCEVCGRSDREDRLLLCDGCDAGYHMECLDPPLQEVPVDEWFCPEC------ 238

Query: 120 RSFSQKKIIDFFGIQKCNDEEVPYLSAQAIKRRRRLRPLV 160
              +   +         +DEEV  L A  +    RLRP V
Sbjct: 239 ---TVPGVDPTHDAAPVSDEEVSLLLADVVPTTSRLRPRV 269

BLAST of CmaCh20G010670 vs. TrEMBL
Match: A0A0A0LM82_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G302110 PE=4 SV=1)

HSP 1 Score: 708.0 bits (1826), Expect = 6.3e-201
Identity = 351/381 (92.13%), Postives = 361/381 (94.75%), Query Frame = 1

Query: 1   MTPAFSSSSATAASSQRLIRCSASLRRTHAPHRPSSTSPPPRKLKPMAEVMAKAKHVVLE 60
           MTPAFSSSS   A+SQRLIRCSAS RRTHAPHRPSS SPP RKLK M E+MAKAKHVVLE
Sbjct: 1   MTPAFSSSS---AASQRLIRCSASPRRTHAPHRPSSMSPPLRKLKSMTEIMAKAKHVVLE 60

Query: 61  RDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVR 120
           R+DYDDV CEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKC GQRRVR
Sbjct: 61  REDYDDVSCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCSGQRRVR 120

Query: 121 SFSQKKIIDFFGIQKCNDE-EVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRR 180
           SFSQKKIIDFF IQKC D+ +V YLSAQAIKRRRRLR LVWQKKRRRLLPF PSEDPDRR
Sbjct: 121 SFSQKKIIDFFRIQKCKDDGDVLYLSAQAIKRRRRLRSLVWQKKRRRLLPFLPSEDPDRR 180

Query: 181 LKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM 240
           LKQMGSLATALTTL+MEFSDDLTY PGMASRSANQAEFEDGGMQVLSKED ETLELCRAM
Sbjct: 181 LKQMGSLATALTTLQMEFSDDLTYGPGMASRSANQAEFEDGGMQVLSKEDAETLELCRAM 240

Query: 241 SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSA 300
           +RRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLS 
Sbjct: 241 NRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSV 300

Query: 301 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 360
           KDPSRSLVICPD RGNIARFINGINNHSPEGKKKQNCKCVRYN+NGECRVILVAI+DIAK
Sbjct: 301 KDPSRSLVICPDTRGNIARFINGINNHSPEGKKKQNCKCVRYNVNGECRVILVAIRDIAK 360

Query: 361 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYEY+YPTHHFV
Sbjct: 361 GERLYYDYNGYEYEYPTHHFV 378

BLAST of CmaCh20G010670 vs. TrEMBL
Match: A0A067JQG3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22238 PE=4 SV=1)

HSP 1 Score: 586.6 bits (1511), Expect = 2.1e-164
Identity = 292/382 (76.44%), Postives = 328/382 (85.86%), Query Frame = 1

Query: 1   MTPAFSSSSATAASSQRLIRCSASLRRTHAPHRPSST-SPPPRKLKPMAEVMAKAKHVVL 60
           M PA ++SS  AA+++R+I    S RRT A   PS   SPPP+KLKP++E++AKAK+ V+
Sbjct: 1   MAPASTTSSPGAAAARRII---GSRRRTKATPLPSPPESPPPKKLKPISEILAKAKYAVV 60

Query: 61  ERDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRV 120
           ER DY DV CE+CGSG+R DELLLCDKCDKGFHMKCV PIV RVPIGSW CPKC GQRRV
Sbjct: 61  ERADYSDVSCEQCGSGERADELLLCDKCDKGFHMKCVRPIVARVPIGSWFCPKCSGQRRV 120

Query: 121 RSFSQKKIIDFFGIQKCNDEEVPYLSAQAI-KRRRRLRPLVWQKKRRRLLPFRPSEDPDR 180
           R  SQKKIIDFF IQKCN ++    S Q   KRRRR  PLV+QKKRRRLLPF PSED   
Sbjct: 121 RRLSQKKIIDFFRIQKCNRKKDKCSSPQDTRKRRRRSGPLVYQKKRRRLLPFIPSEDAAE 180

Query: 181 RLKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRA 240
           RLKQMG+LA+ALT L+MEFSD+LTY+P MA R+ANQAEFE+GGMQVLSKED ETLE CRA
Sbjct: 181 RLKQMGTLASALTALQMEFSDELTYLPDMAPRAANQAEFEEGGMQVLSKEDIETLEQCRA 240

Query: 241 MSRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLS 300
           MSRRGECPPLLVVFDSCEGFTV+AD QIKDMT IAEYTGDVDY++NREHDDCDSMMTLL 
Sbjct: 241 MSRRGECPPLLVVFDSCEGFTVKADSQIKDMTLIAEYTGDVDYIRNREHDDCDSMMTLLL 300

Query: 301 AKDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIA 360
           AKDP++SLVICPDKRGNIARFINGINNH+P+GKKKQNCKCVRY++NGECRV LVA +DIA
Sbjct: 301 AKDPTKSLVICPDKRGNIARFINGINNHTPDGKKKQNCKCVRYSVNGECRVFLVATRDIA 360

Query: 361 KGERLYYDYNGYEYDYPTHHFV 381
           KGERLYYDYNGYE +YPTHHFV
Sbjct: 361 KGERLYYDYNGYEQEYPTHHFV 379

BLAST of CmaCh20G010670 vs. TrEMBL
Match: A0A061DIV3_THECC (Trithorax-related protein 5 isoform 1 OS=Theobroma cacao GN=TCM_001425 PE=4 SV=1)

HSP 1 Score: 571.6 bits (1472), Expect = 7.0e-160
Identity = 286/381 (75.07%), Postives = 325/381 (85.30%), Query Frame = 1

Query: 9   SATAASSQRLIRCSASLRRTHAP-HRPS-------STSPPPRKLKPMAEVMAKAKHVVLE 68
           + T A+++RL+      RRT AP  RPS       S SPP RKL+P+AE+MA+A++ V+E
Sbjct: 4   ATTVAAARRLVGLR---RRTEAPPRRPSPSPPRRPSPSPPQRKLRPVAEIMARARYAVVE 63

Query: 69  RDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVR 128
           R DY DV CE+CGSG+R DELLLCDKCDKGFHMKC+ PI+ RVPIGSWLCPKC G RRVR
Sbjct: 64  RADYSDVGCEQCGSGERPDELLLCDKCDKGFHMKCLRPIMARVPIGSWLCPKCSGHRRVR 123

Query: 129 SFSQKKIIDFFGIQKCNDEEVPYLSAQAI-KRRRRLRPLVWQKKRRRLLPFRPSEDPDRR 188
           SFSQKKIIDFF IQK  D +  + S Q   KRRRR R LV  KKRRRLLPF PSEDP++R
Sbjct: 124 SFSQKKIIDFFRIQKSCDGKKKFTSNQDTRKRRRRSRSLVLLKKRRRLLPFIPSEDPNQR 183

Query: 189 LKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM 248
           L QMG+LA+ALT L+MEFSDDLTY PGMA RSANQA+FE+GGMQVLSKED ETLELCRAM
Sbjct: 184 LNQMGTLASALTALQMEFSDDLTYSPGMAPRSANQAKFENGGMQVLSKEDMETLELCRAM 243

Query: 249 SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSA 308
           +RRGECPPL+VVFDSCEG+TVEAD QIKDMTFIAEYTGDVDY+KNRE+DDCDS+MTLL A
Sbjct: 244 NRRGECPPLIVVFDSCEGYTVEADGQIKDMTFIAEYTGDVDYIKNRENDDCDSLMTLLLA 303

Query: 309 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 368
            D S+SLVICPDKRGNIARFINGINNH+ EGKKKQNCKCVRY++NGECRV+LVA +DIAK
Sbjct: 304 TDSSKSLVICPDKRGNIARFINGINNHTLEGKKKQNCKCVRYSVNGECRVLLVATRDIAK 363

Query: 369 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYE++YPTHHFV
Sbjct: 364 GERLYYDYNGYEHEYPTHHFV 381

BLAST of CmaCh20G010670 vs. TrEMBL
Match: A0A0D2NXD1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G068200 PE=4 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 3.5e-159
Identity = 276/368 (75.00%), Postives = 320/368 (86.96%), Query Frame = 1

Query: 13  ASSQRLIRCSASLRRTHAPHRPSSTSPPPRKLKPMAEVMAKAKHVVLERDDYDDVRCEEC 72
           A ++RL+   +S RRT AP R  S S PP+KL+PM+E+MA+AK+ V+ER DY D+ CE+C
Sbjct: 8   AGARRLV---SSRRRTEAPRRRPSPSTPPKKLRPMSEIMARAKYAVVERADYSDIICEQC 67

Query: 73  GSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVRSFSQKKIIDFFG 132
           GSG+R  ELLLCDKCDKGFHM+C+ PIVVR+PIGSWLCPKC G RRVR+FSQK+IIDFF 
Sbjct: 68  GSGERPGELLLCDKCDKGFHMRCLRPIVVRIPIGSWLCPKCSGHRRVRTFSQKRIIDFFK 127

Query: 133 IQKCNDEEVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRRLKQMGSLATALTT 192
           IQK  D +     +Q  ++RRR RPLV  KKRRRLLPF PSEDP++RLKQMGSLA+ALT 
Sbjct: 128 IQKSGDGKKKCNLSQDTRKRRR-RPLVLLKKRRRLLPFIPSEDPNQRLKQMGSLASALTA 187

Query: 193 LKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAMSRRGECPPLLVVF 252
           ++MEFSDDLTY   MA RSANQA+FE+GGMQVLS+ED ETLELCR+MSRRGECPP +VVF
Sbjct: 188 MQMEFSDDLTYSSDMAPRSANQAKFENGGMQVLSREDMETLELCRSMSRRGECPPFIVVF 247

Query: 253 DSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSAKDPSRSLVICPDK 312
           DSCEG+TVEAD QIKDMTFIAEYTGDVDY+KNRE+DDCDSMMTLL A +PS SLVICPDK
Sbjct: 248 DSCEGYTVEADAQIKDMTFIAEYTGDVDYIKNRENDDCDSMMTLLLATNPSESLVICPDK 307

Query: 313 RGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAKGERLYYDYNGYEY 372
            GNIARFINGINNH+PEGKKKQNCKCVRY++NGECRV+LVA +DIAKGERLYYDYNGYE+
Sbjct: 308 CGNIARFINGINNHTPEGKKKQNCKCVRYSVNGECRVLLVATRDIAKGERLYYDYNGYEH 367

Query: 373 DYPTHHFV 381
           +YPTHHFV
Sbjct: 368 EYPTHHFV 371

BLAST of CmaCh20G010670 vs. TrEMBL
Match: V4SV45_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025879mg PE=4 SV=1)

HSP 1 Score: 568.9 bits (1465), Expect = 4.5e-159
Identity = 287/381 (75.33%), Postives = 321/381 (84.25%), Query Frame = 1

Query: 1   MTPAFSSSSATAASSQRLIRCSASLRRTHAPHRPSSTSPPPRKLKPMAEVMAKAKHVVLE 60
           M PA +SS    A ++RLI    S RRT AP R  S SPPP+K+K M E++AKA + V+E
Sbjct: 1   MAPATTSS----AEARRLI---GSRRRTEAPRRMLSPSPPPKKVKSMEEILAKAHYAVVE 60

Query: 61  RDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVR 120
           R DY DV CE+CGSG+R +ELLLCDKCDKGFHMKC+ PIVVRVPIG+WLCPKC GQRRVR
Sbjct: 61  RGDYGDVGCEQCGSGERAEELLLCDKCDKGFHMKCLRPIVVRVPIGTWLCPKCSGQRRVR 120

Query: 121 SFSQKKIIDFFGIQKCNDEEVPYLSAQAI-KRRRRLRPLVWQKKRRRLLPFRPSEDPDRR 180
           SFSQ+KIIDFF I+K N  E    S Q   KRRRR   LV QKKRRRLLPF PSED  +R
Sbjct: 121 SFSQRKIIDFFKIKKPNLPEEKCDSPQDTRKRRRRSASLVLQKKRRRLLPFTPSEDRSQR 180

Query: 181 LKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM 240
           L QMGSLA ALT L+MEFSDDLTY+PGMA RSANQAEFE+GGMQVLSKEDTETLE CRAM
Sbjct: 181 LSQMGSLAHALTALQMEFSDDLTYMPGMAPRSANQAEFEEGGMQVLSKEDTETLEQCRAM 240

Query: 241 SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSA 300
            +RGECPPL+VV+DSCEGFTVEAD QIKDMTFIAEY GDVD+++NREHDDCDSMMTLL A
Sbjct: 241 CKRGECPPLVVVYDSCEGFTVEADGQIKDMTFIAEYIGDVDFIRNREHDDCDSMMTLLLA 300

Query: 301 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 360
            DPS+SLVICPDKRGNIARFINGINN++ EG+KKQNCKCVRY++NGECRV LVA +DIAK
Sbjct: 301 TDPSKSLVICPDKRGNIARFINGINNYTLEGRKKQNCKCVRYSVNGECRVFLVATRDIAK 360

Query: 361 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYE +YPTHHFV
Sbjct: 361 GERLYYDYNGYEQEYPTHHFV 374

BLAST of CmaCh20G010670 vs. TAIR10
Match: AT5G09790.2 (AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5)

HSP 1 Score: 504.2 bits (1297), Expect = 7.0e-143
Identity = 254/375 (67.73%), Postives = 301/375 (80.27%), Query Frame = 1

Query: 13  ASSQRLIRCSASLRRTHAP-HRPSSTSPPPRKLKPMAEVMAKAKHVVLER-----DDYDD 72
           ASS     CS S RRT AP  RPSS SPPPRK+K MAE+MAK+  VV +      D Y +
Sbjct: 6   ASSPAASPCS-SRRRTKAPARRPSSESPPPRKMKSMAEIMAKSVPVVEQEEEEDEDSYSN 65

Query: 73  VRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVRSFSQKK 132
           V CE+CGSG+ DDELLLCDKCD+GFHMKC+ PIVVRVPIG+WLC  C  QR VR  SQKK
Sbjct: 66  VTCEKCGSGEGDDELLLCDKCDRGFHMKCLRPIVVRVPIGTWLCVDCSDQRPVRRLSQKK 125

Query: 133 IIDFFGIQK-CNDEEVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRRLKQMGS 192
           I+ FF I+K  +  +   LS +  ++RRR   L  +K+RR+LLP  PSEDPD+RL QMG+
Sbjct: 126 ILHFFRIEKHTHQTDKLELSQEETRKRRRSCSLTVKKRRRKLLPLVPSEDPDQRLAQMGT 185

Query: 193 LATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAMSRRGEC 252
           LA+ALT L +++SD L YVPGMA RSANQ++ E GGMQVL KED ETLE C++M RRGEC
Sbjct: 186 LASALTALGIKYSDGLNYVPGMAPRSANQSKLEKGGMQVLCKEDLETLEQCQSMYRRGEC 245

Query: 253 PPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSAKDPSRS 312
           PPL+VVFD  EG+TVEAD  IKD+TFIAEYTGDVDYLKNRE DDCDS+MTLL ++DPS++
Sbjct: 246 PPLVVVFDPLEGYTVEADGPIKDLTFIAEYTGDVDYLKNREKDDCDSIMTLLLSEDPSKT 305

Query: 313 LVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAKGERLYY 372
           LVICPDK GNI+RFINGINNH+P  KKKQNCKCVRY+INGECRV+LVA +DI+KGERLYY
Sbjct: 306 LVICPDKFGNISRFINGINNHNPVAKKKQNCKCVRYSINGECRVLLVATRDISKGERLYY 365

Query: 373 DYNGYEYDYPTHHFV 381
           DYNGYE++YPTHHF+
Sbjct: 366 DYNGYEHEYPTHHFL 379

BLAST of CmaCh20G010670 vs. TAIR10
Match: AT5G24330.1 (AT5G24330.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 6)

HSP 1 Score: 414.8 bits (1065), Expect = 5.6e-116
Identity = 210/321 (65.42%), Postives = 243/321 (75.70%), Query Frame = 1

Query: 63  DYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVRSF 122
           D+D V CEEC SG +  +LLLCDKCDKGFH+ C+ PI+V VP GSW CP C   +  +SF
Sbjct: 30  DWDTV-CEECSSGKQPAKLLLCDKCDKGFHLFCLRPILVSVPKGSWFCPSCSKHQIPKSF 89

Query: 123 S--QKKIIDFFGIQKCNDEEVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRRL 182
              Q KIIDFF I++  D      S+ +I ++R+   LV  KK+RRLLP+ PS DP RRL
Sbjct: 90  PLIQTKIIDFFRIKRSPDSSQISSSSDSIGKKRKKTSLVMSKKKRRLLPYNPSNDPQRRL 149

Query: 183 KQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAMS 242
           +QM SLATAL     +FS++LTYV G A RSANQA FE GGMQVLSKE  ETL LC+ M 
Sbjct: 150 EQMASLATALRASNTKFSNELTYVSGKAPRSANQAAFEKGGMQVLSKEGVETLALCKKMM 209

Query: 243 RRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHD-DCDSMMTLLSA 302
             GECPPL+VVFD  EGFTVEAD  IKD T I EY GDVDYL NRE D D DSMMTLL A
Sbjct: 210 DLGECPPLMVVFDPYEGFTVEADRFIKDWTIITEYVGDVDYLSNREDDYDGDSMMTLLHA 269

Query: 303 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 362
            DPS+ LVICPD+R NIARFI+GINNHSPEG+KKQN KCVR+NINGE RV+LVA +DI+K
Sbjct: 270 SDPSQCLVICPDRRSNIARFISGINNHSPEGRKKQNLKCVRFNINGEARVLLVANRDISK 329

Query: 363 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYE++YPT HFV
Sbjct: 330 GERLYYDYNGYEHEYPTEHFV 349

BLAST of CmaCh20G010670 vs. TAIR10
Match: AT3G01460.1 (AT3G01460.1 methyl-CPG-binding domain 9)

HSP 1 Score: 65.1 bits (157), Expect = 1.1e-10
Identity = 31/84 (36.90%), Postives = 50/84 (59.52%), Query Frame = 1

Query: 42   RKLKPM-AEVMAKAKHVV-----LERDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKC 101
            RKL+ + AE+  + K +V     L +  +D+  C+ CG    DD +LLCD CD  +H  C
Sbjct: 1257 RKLECLSAEMKKEIKDIVVSVNKLPKAPWDEGVCKVCGVDKDDDSVLLCDTCDAEYHTYC 1316

Query: 102  VSPIVVRVPIGSWLCPKCCGQRRV 120
            ++P ++R+P G+W CP C   +R+
Sbjct: 1317 LNPPLIRIPDGNWYCPSCVIAKRM 1340

BLAST of CmaCh20G010670 vs. TAIR10
Match: AT1G77300.1 (AT1G77300.1 histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 specific))

HSP 1 Score: 50.4 bits (119), Expect = 2.7e-06
Identity = 32/116 (27.59%), Postives = 58/116 (50.00%), Query Frame = 1

Query: 256  EGFTVEADDQIKDMTFIAEYTG---DVDYLKNREHDDCDSMMTLLSAKDPSRSLVICPDK 315
            +G+ +   + +++  F+ EY G   D+   + R+ +              + + VI    
Sbjct: 1036 KGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGA 1095

Query: 316  RGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAKGERLYYDYN 369
            +GN+ RFIN    HS E     NC+  ++ +NGE  V + ++QD+ KG+ L +DYN
Sbjct: 1096 KGNLGRFIN----HSCE----PNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYN 1143

BLAST of CmaCh20G010670 vs. TAIR10
Match: AT1G77250.1 (AT1G77250.1 RING/FYVE/PHD-type zinc finger family protein)

HSP 1 Score: 50.1 bits (118), Expect = 3.6e-06
Identity = 18/45 (40.00%), Postives = 25/45 (55.56%), Query Frame = 1

Query: 69  CEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKC 114
           C  C +   DD+++LCD CD  +H+ C+ P    VP G W C  C
Sbjct: 405 CRNCLTDKDDDKIVLCDGCDDAYHIYCMRPPCESVPNGEWFCTAC 449

BLAST of CmaCh20G010670 vs. NCBI nr
Match: gi|659121970|ref|XP_008460909.1| (PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Cucumis melo])

HSP 1 Score: 717.6 bits (1851), Expect = 1.1e-203
Identity = 354/381 (92.91%), Postives = 365/381 (95.80%), Query Frame = 1

Query: 1   MTPAFSSSSATAASSQRLIRCSASLRRTHAPHRPSSTSPPPRKLKPMAEVMAKAKHVVLE 60
           MTPAFSSSS   A+SQRLIRCSAS RRTHAPHRPSS SPP RKLK M E+MAKAKHVVLE
Sbjct: 1   MTPAFSSSS---AASQRLIRCSASPRRTHAPHRPSSMSPPLRKLKSMTEIMAKAKHVVLE 60

Query: 61  RDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVR 120
           R+DYDDV CEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKC GQRRVR
Sbjct: 61  REDYDDVSCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCSGQRRVR 120

Query: 121 SFSQKKIIDFFGIQKCNDE-EVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRR 180
           SFSQKKIIDFF IQKC D+ +VPYLSAQAIKRRRRLR LVWQKKRRRLLPF PSEDPDRR
Sbjct: 121 SFSQKKIIDFFRIQKCKDDGDVPYLSAQAIKRRRRLRSLVWQKKRRRLLPFLPSEDPDRR 180

Query: 181 LKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM 240
           LKQMGSLATALTTL+MEFSDDLTY+PGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM
Sbjct: 181 LKQMGSLATALTTLQMEFSDDLTYMPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM 240

Query: 241 SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSA 300
           SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLS 
Sbjct: 241 SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSV 300

Query: 301 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 360
           KDPSRSLVICPD+RGNIARFINGINNHSPEGKKKQNCKCVRYN+NGECRVILVAI+DIAK
Sbjct: 301 KDPSRSLVICPDRRGNIARFINGINNHSPEGKKKQNCKCVRYNVNGECRVILVAIRDIAK 360

Query: 361 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYEY+YPTHHFV
Sbjct: 361 GERLYYDYNGYEYEYPTHHFV 378

BLAST of CmaCh20G010670 vs. NCBI nr
Match: gi|778670199|ref|XP_011649397.1| (PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Cucumis sativus])

HSP 1 Score: 708.0 bits (1826), Expect = 9.0e-201
Identity = 351/381 (92.13%), Postives = 361/381 (94.75%), Query Frame = 1

Query: 1   MTPAFSSSSATAASSQRLIRCSASLRRTHAPHRPSSTSPPPRKLKPMAEVMAKAKHVVLE 60
           MTPAFSSSS   A+SQRLIRCSAS RRTHAPHRPSS SPP RKLK M E+MAKAKHVVLE
Sbjct: 1   MTPAFSSSS---AASQRLIRCSASPRRTHAPHRPSSMSPPLRKLKSMTEIMAKAKHVVLE 60

Query: 61  RDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVR 120
           R+DYDDV CEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKC GQRRVR
Sbjct: 61  REDYDDVSCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCSGQRRVR 120

Query: 121 SFSQKKIIDFFGIQKCNDE-EVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRR 180
           SFSQKKIIDFF IQKC D+ +V YLSAQAIKRRRRLR LVWQKKRRRLLPF PSEDPDRR
Sbjct: 121 SFSQKKIIDFFRIQKCKDDGDVLYLSAQAIKRRRRLRSLVWQKKRRRLLPFLPSEDPDRR 180

Query: 181 LKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM 240
           LKQMGSLATALTTL+MEFSDDLTY PGMASRSANQAEFEDGGMQVLSKED ETLELCRAM
Sbjct: 181 LKQMGSLATALTTLQMEFSDDLTYGPGMASRSANQAEFEDGGMQVLSKEDAETLELCRAM 240

Query: 241 SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSA 300
           +RRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLS 
Sbjct: 241 NRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSV 300

Query: 301 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 360
           KDPSRSLVICPD RGNIARFINGINNHSPEGKKKQNCKCVRYN+NGECRVILVAI+DIAK
Sbjct: 301 KDPSRSLVICPDTRGNIARFINGINNHSPEGKKKQNCKCVRYNVNGECRVILVAIRDIAK 360

Query: 361 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYEY+YPTHHFV
Sbjct: 361 GERLYYDYNGYEYEYPTHHFV 378

BLAST of CmaCh20G010670 vs. NCBI nr
Match: gi|802726157|ref|XP_012086084.1| (PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Jatropha curcas])

HSP 1 Score: 586.6 bits (1511), Expect = 3.0e-164
Identity = 292/382 (76.44%), Postives = 328/382 (85.86%), Query Frame = 1

Query: 1   MTPAFSSSSATAASSQRLIRCSASLRRTHAPHRPSST-SPPPRKLKPMAEVMAKAKHVVL 60
           M PA ++SS  AA+++R+I    S RRT A   PS   SPPP+KLKP++E++AKAK+ V+
Sbjct: 1   MAPASTTSSPGAAAARRII---GSRRRTKATPLPSPPESPPPKKLKPISEILAKAKYAVV 60

Query: 61  ERDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRV 120
           ER DY DV CE+CGSG+R DELLLCDKCDKGFHMKCV PIV RVPIGSW CPKC GQRRV
Sbjct: 61  ERADYSDVSCEQCGSGERADELLLCDKCDKGFHMKCVRPIVARVPIGSWFCPKCSGQRRV 120

Query: 121 RSFSQKKIIDFFGIQKCNDEEVPYLSAQAI-KRRRRLRPLVWQKKRRRLLPFRPSEDPDR 180
           R  SQKKIIDFF IQKCN ++    S Q   KRRRR  PLV+QKKRRRLLPF PSED   
Sbjct: 121 RRLSQKKIIDFFRIQKCNRKKDKCSSPQDTRKRRRRSGPLVYQKKRRRLLPFIPSEDAAE 180

Query: 181 RLKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRA 240
           RLKQMG+LA+ALT L+MEFSD+LTY+P MA R+ANQAEFE+GGMQVLSKED ETLE CRA
Sbjct: 181 RLKQMGTLASALTALQMEFSDELTYLPDMAPRAANQAEFEEGGMQVLSKEDIETLEQCRA 240

Query: 241 MSRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLS 300
           MSRRGECPPLLVVFDSCEGFTV+AD QIKDMT IAEYTGDVDY++NREHDDCDSMMTLL 
Sbjct: 241 MSRRGECPPLLVVFDSCEGFTVKADSQIKDMTLIAEYTGDVDYIRNREHDDCDSMMTLLL 300

Query: 301 AKDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIA 360
           AKDP++SLVICPDKRGNIARFINGINNH+P+GKKKQNCKCVRY++NGECRV LVA +DIA
Sbjct: 301 AKDPTKSLVICPDKRGNIARFINGINNHTPDGKKKQNCKCVRYSVNGECRVFLVATRDIA 360

Query: 361 KGERLYYDYNGYEYDYPTHHFV 381
           KGERLYYDYNGYE +YPTHHFV
Sbjct: 361 KGERLYYDYNGYEQEYPTHHFV 379

BLAST of CmaCh20G010670 vs. NCBI nr
Match: gi|590708583|ref|XP_007048318.1| (Trithorax-related protein 5 isoform 1 [Theobroma cacao])

HSP 1 Score: 571.6 bits (1472), Expect = 1.0e-159
Identity = 286/381 (75.07%), Postives = 325/381 (85.30%), Query Frame = 1

Query: 9   SATAASSQRLIRCSASLRRTHAP-HRPS-------STSPPPRKLKPMAEVMAKAKHVVLE 68
           + T A+++RL+      RRT AP  RPS       S SPP RKL+P+AE+MA+A++ V+E
Sbjct: 4   ATTVAAARRLVGLR---RRTEAPPRRPSPSPPRRPSPSPPQRKLRPVAEIMARARYAVVE 63

Query: 69  RDDYDDVRCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVR 128
           R DY DV CE+CGSG+R DELLLCDKCDKGFHMKC+ PI+ RVPIGSWLCPKC G RRVR
Sbjct: 64  RADYSDVGCEQCGSGERPDELLLCDKCDKGFHMKCLRPIMARVPIGSWLCPKCSGHRRVR 123

Query: 129 SFSQKKIIDFFGIQKCNDEEVPYLSAQAI-KRRRRLRPLVWQKKRRRLLPFRPSEDPDRR 188
           SFSQKKIIDFF IQK  D +  + S Q   KRRRR R LV  KKRRRLLPF PSEDP++R
Sbjct: 124 SFSQKKIIDFFRIQKSCDGKKKFTSNQDTRKRRRRSRSLVLLKKRRRLLPFIPSEDPNQR 183

Query: 189 LKQMGSLATALTTLKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAM 248
           L QMG+LA+ALT L+MEFSDDLTY PGMA RSANQA+FE+GGMQVLSKED ETLELCRAM
Sbjct: 184 LNQMGTLASALTALQMEFSDDLTYSPGMAPRSANQAKFENGGMQVLSKEDMETLELCRAM 243

Query: 249 SRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSA 308
           +RRGECPPL+VVFDSCEG+TVEAD QIKDMTFIAEYTGDVDY+KNRE+DDCDS+MTLL A
Sbjct: 244 NRRGECPPLIVVFDSCEGYTVEADGQIKDMTFIAEYTGDVDYIKNRENDDCDSLMTLLLA 303

Query: 309 KDPSRSLVICPDKRGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAK 368
            D S+SLVICPDKRGNIARFINGINNH+ EGKKKQNCKCVRY++NGECRV+LVA +DIAK
Sbjct: 304 TDSSKSLVICPDKRGNIARFINGINNHTLEGKKKQNCKCVRYSVNGECRVLLVATRDIAK 363

Query: 369 GERLYYDYNGYEYDYPTHHFV 381
           GERLYYDYNGYE++YPTHHFV
Sbjct: 364 GERLYYDYNGYEHEYPTHHFV 381

BLAST of CmaCh20G010670 vs. NCBI nr
Match: gi|823140887|ref|XP_012470268.1| (PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 isoform X3 [Gossypium raimondii])

HSP 1 Score: 569.3 bits (1466), Expect = 5.0e-159
Identity = 276/368 (75.00%), Postives = 320/368 (86.96%), Query Frame = 1

Query: 13  ASSQRLIRCSASLRRTHAPHRPSSTSPPPRKLKPMAEVMAKAKHVVLERDDYDDVRCEEC 72
           A ++RL+   +S RRT AP R  S S PP+KL+PM+E+MA+AK+ V+ER DY D+ CE+C
Sbjct: 8   AGARRLV---SSRRRTEAPRRRPSPSTPPKKLRPMSEIMARAKYAVVERADYSDIICEQC 67

Query: 73  GSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCCGQRRVRSFSQKKIIDFFG 132
           GSG+R  ELLLCDKCDKGFHM+C+ PIVVR+PIGSWLCPKC G RRVR+FSQK+IIDFF 
Sbjct: 68  GSGERPGELLLCDKCDKGFHMRCLRPIVVRIPIGSWLCPKCSGHRRVRTFSQKRIIDFFK 127

Query: 133 IQKCNDEEVPYLSAQAIKRRRRLRPLVWQKKRRRLLPFRPSEDPDRRLKQMGSLATALTT 192
           IQK  D +     +Q  ++RRR RPLV  KKRRRLLPF PSEDP++RLKQMGSLA+ALT 
Sbjct: 128 IQKSGDGKKKCNLSQDTRKRRR-RPLVLLKKRRRLLPFIPSEDPNQRLKQMGSLASALTA 187

Query: 193 LKMEFSDDLTYVPGMASRSANQAEFEDGGMQVLSKEDTETLELCRAMSRRGECPPLLVVF 252
           ++MEFSDDLTY   MA RSANQA+FE+GGMQVLS+ED ETLELCR+MSRRGECPP +VVF
Sbjct: 188 MQMEFSDDLTYSSDMAPRSANQAKFENGGMQVLSREDMETLELCRSMSRRGECPPFIVVF 247

Query: 253 DSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMMTLLSAKDPSRSLVICPDK 312
           DSCEG+TVEAD QIKDMTFIAEYTGDVDY+KNRE+DDCDSMMTLL A +PS SLVICPDK
Sbjct: 248 DSCEGYTVEADAQIKDMTFIAEYTGDVDYIKNRENDDCDSMMTLLLATNPSESLVICPDK 307

Query: 313 RGNIARFINGINNHSPEGKKKQNCKCVRYNINGECRVILVAIQDIAKGERLYYDYNGYEY 372
            GNIARFINGINNH+PEGKKKQNCKCVRY++NGECRV+LVA +DIAKGERLYYDYNGYE+
Sbjct: 308 CGNIARFINGINNHTPEGKKKQNCKCVRYSVNGECRVLLVATRDIAKGERLYYDYNGYEH 367

Query: 373 DYPTHHFV 381
           +YPTHHFV
Sbjct: 368 EYPTHHFV 371

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATXR5_RICCO6.1e-15773.49Probable Histone-lysine N-methyltransferase ATXR5 OS=Ricinus communis GN=ATXR5 P... [more]
ATXR5_ARATH1.2e-14167.73Histone-lysine N-methyltransferase ATXR5 OS=Arabidopsis thaliana GN=ATXR5 PE=1 S... [more]
ATXR6_ARATH9.9e-11565.42Histone-lysine N-methyltransferase ATXR6 OS=Arabidopsis thaliana GN=ATXR6 PE=1 S... [more]
PHRF1_RAT1.7e-1338.00PHD and RING finger domain-containing protein 1 OS=Rattus norvegicus GN=Phrf1 PE... [more]
PHRF1_MOUSE1.7e-1338.00PHD and RING finger domain-containing protein 1 OS=Mus musculus GN=Phrf1 PE=1 SV... [more]
Match NameE-valueIdentityDescription
A0A0A0LM82_CUCSA6.3e-20192.13Uncharacterized protein OS=Cucumis sativus GN=Csa_2G302110 PE=4 SV=1[more]
A0A067JQG3_JATCU2.1e-16476.44Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22238 PE=4 SV=1[more]
A0A061DIV3_THECC7.0e-16075.07Trithorax-related protein 5 isoform 1 OS=Theobroma cacao GN=TCM_001425 PE=4 SV=1[more]
A0A0D2NXD1_GOSRA3.5e-15975.00Uncharacterized protein OS=Gossypium raimondii GN=B456_003G068200 PE=4 SV=1[more]
V4SV45_9ROSI4.5e-15975.33Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025879mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G09790.27.0e-14367.73 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5[more]
AT5G24330.15.6e-11665.42 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 6[more]
AT3G01460.11.1e-1036.90 methyl-CPG-binding domain 9[more]
AT1G77300.12.7e-0627.59 histone methyltransferases(H3-K4 specific);histone methyltransferase... [more]
AT1G77250.13.6e-0640.00 RING/FYVE/PHD-type zinc finger family protein[more]
Match NameE-valueIdentityDescription
gi|659121970|ref|XP_008460909.1|1.1e-20392.91PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Cucumis melo][more]
gi|778670199|ref|XP_011649397.1|9.0e-20192.13PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Cucumis sativus][more]
gi|802726157|ref|XP_012086084.1|3.0e-16476.44PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Jatropha curcas][more]
gi|590708583|ref|XP_007048318.1|1.0e-15975.07Trithorax-related protein 5 isoform 1 [Theobroma cacao][more]
gi|823140887|ref|XP_012470268.1|5.0e-15975.00PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 isoform X3 [Gossypi... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001214SET_dom
IPR001965Znf_PHD
IPR011011Znf_FYVE_PHD
IPR013083Znf_RING/FYVE/PHD
IPR019786Zinc_finger_PHD-type_CS
IPR019787Znf_PHD-finger
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009294 DNA mediated transformation
biological_process GO:0070734 histone H3-K27 methylation
biological_process GO:0009555 pollen development
biological_process GO:0006275 regulation of DNA replication
biological_process GO:0032259 methylation
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0046976 histone methyltransferase activity (H3-K27 specific)
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G010670.1CmaCh20G010670.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001214SET domainPFAMPF00856SETcoord: 286..368
score: 3.
IPR001214SET domainSMARTSM00317set_7coord: 246..374
score: 1.
IPR001214SET domainPROFILEPS50280SETcoord: 246..368
score: 14
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 68..114
score: 6.9
IPR011011Zinc finger, FYVE/PHD-typeunknownSSF57903FYVE/PHD zinc fingercoord: 35..114
score: 1.68
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3DG3DSA:3.30.40.10coord: 57..120
score: 2.2
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 69..113
scor
IPR019787Zinc finger, PHD-fingerPFAMPF00628PHDcoord: 68..114
score: 4.0
IPR019787Zinc finger, PHD-fingerPROFILEPS50016ZF_PHD_2coord: 66..116
score: 10
NoneNo IPR availableGENE3DG3DSA:2.170.270.10coord: 240..371
score: 9.9
NoneNo IPR availablePANTHERPTHR10615HISTONE ACETYLTRANSFERASEcoord: 161..380
score: 1.4E-207coord: 57..137
score: 1.4E
NoneNo IPR availablePANTHERPTHR10615:SF112HISTONE-LYSINE N-METHYLTRANSFERASE ATXR5coord: 57..137
score: 1.4E-207coord: 161..380
score: 1.4E
NoneNo IPR availableunknownSSF82199SET domaincoord: 240..369
score: 5.49