Cp4.1LG05g10640 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG05g10640
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHistone-lysine N-methyltransferase
LocationCp4.1LG05 : 7047728 .. 7050936 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTCCGGCTTTGTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCGCCGCCGCCGCCTCCCAGCGGCTTTTCCGCTGCACTGCTTCGCTGCGGCGAACTCACGCGCCTCACCGCTCTCCTTCCATGTCTCCTCCGCCGAGGAAGCTGAAACTAATGGCGGAGATAATGGTCAAGGCTAAGTATGCGGTTATCGAACGGGACGATTACGACGACGTCAGGTGTGAGAAATGTCGCTCCGGCGACCGGGACGATGAGCTGCTGTTGTGTGACAAATGCGATAAGGGATTTCATATGAAATGTGTGAGTCCGATCGTTGTTAGGGTCCCGATTGGATCCTGGCTTTGCCCCAAATGCTGTGGCCAAAAAAGAGTCAGAAGTAAGAATATTCTCGAACTTGTTGATTCTTGANTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTCAACTTCGTAAGGTGTATAATCTTTGGACTGTCAACGTGTTCGAATTCCATTCTTCATTCTTTTCAAAATTTTGGTTGTAGGTTTTTCTCAGAAGAAGATTATCGATTTTTTCCGAATTCAGAAATGTAAAGACGCAGAATCTCCATATCTATCCGCTCAAGGTAAATTCTTGATTACAGTGTTCAAAATTCATATGATCTTATGTTTGTTTTATAATCTTGATCACGATCGTGCTCCTGGTATTCAATAATATGGGATAGTTATGATATACTTTCTATATATCTGCTCAAATTGTTATTCCTTTCGTAGACATTTCCCTTTACAGTCATTTGAATTGTTTAAGAGCTAGAATATGGTTCTTGAAGTGACATCCCAGATGAATTCCAGAGAAGAAGATCATCTCTTATTCTTCCTTATACTTAATTGAATATGTGAGGTGTAAGTCATGGAAATTCTTTAAACAACAAGGAGTTCACCTGATTCTTGTGGTTACTTTAAATTTAGTTGCTTAGTAAGGAGACATACTGAATGTTTATGCTTTATAACACCTGCTGTAAGACCATTAGACGTTAAAATATAAATCTCCTAAGAAGTACTCTGCTGAAACTTTTCTACACCATAGGTGTTCCTGAACTGTTCATTTGTTGGTCGGTTTTCTTTTAATATTCCCTTTGTTGTACCTTTAAATGCCACAACAACAATGTTCTGATACCAATTGTTGGAGCGTAATGCCACAACAAAAAATGTTCTGATACCACTTATTCAGCGTTGGAAGCCACAACCAAAAAGCTTTGATTTGCAATTGTGTGGAGTCATTGTCATTTTGTATTTATCTAGAGACAATTAGGAGGAACTTTGTTGAGAGACTTTGTAGCGTGAGAGTGAGAGATAACTTGTAAACACTTTGGTTATAGTGATTGGCTACATGGATGTAGAGGAATTTCTTCTCTCCGAACCACGTAAATTTGTGTGTCTCTTTGTTTAATATATATTTGTGGGTGTGTAAATGCGGTCGATCTTGCTTCTGCTTTTGCTACAACAATTGGTATCAGAGCATTTCGTGGCATTTAAGGGTACAACACCTTTCATTTAGACTTAGTCATGGATTCATTGTTTTAATATTATACAAATGGTATCGTGAACTGAAGCTATAAAGCATAGGAGACGACTACGGTCATTGGTGTGGCAGAAGAAAAGGAGAAGATTACTATCATTTCTTCCAAGTGAAGATCCTGATCGCAGATTGAAACAGATGGGTTCACTTGCTACAGCTCTAACAACATTGCAAATGGAGTTCAGTGATGATCTGACGTATTTGCCGGGCATGGCTTCAAGATCTGCTAACCAGGCAGAGCTTGAAGATGGTGGAATGCAGGTTTAAATACTCATCTGTTGTTAATGACTGAGTTAAGAGTGGAATGGTCATAAATTATGACATCTATCTGCCTTCTGTTGTAGGTTCTTTCCAAAGAGGATATTGAGACCTTTGAACTCTGCAGAACCATGAGCAGAAGAGGCGAATGTGCTCCCCTTTTGGTAGTTTTTGATACATGTGAAGGGTAACTTTATTTTGAGCTTCTTAATCATATTCCCATATGATTCTAACCTCTCTACTGAGTAGGTCCATGCTCAATCTATGATGTAGTTTTACTGTACAAGCCGATGATCAAATTAAGGATATGACTTTTATTGCTGAATATACTGGTGATGTCGATTATCTTACGAACCGCGAACACGATGATTGTGATAGTATGATGACCCTTCTTTCAGCCAAACATCCATCTAGAAGTCTTGTCATCTGCCCTGACACACATGGAAACATTGCTCGCTTCATCAATGGAATCAATAATCATACTCCGTAAGTTGAACAAGGCTACCTCATTGTCTTTTATACGTTTATATACTTGTGTTCATTCATATCAGCCTTCATGTCTGTGGAGATAGAGACTGAGTTATGCACATATTGATATCTTCTTTTAGTTTATGTAATATTTGAGAGAATGAAATTGTTGATATTCCATTTTTGTGCAGAGAAGGTAAGAAGAAACAGAACTGTAAATGTGTGAGATACAATGTTAAGGGCGAATGCCGTGTGATTTTGGTTGCTATTCGGGATATTGCTAAAGGAGAGAGGCTTTATTATGACTACAATGGATATGAGTATGAATACCCCACTCATCATTTTCTCTGAGAATTTCTTAGGATACGTCTAACAACTTTGGGATTTAATTTGTCTGTCCCATACAGTTTTTTCACCATTTTTTCTTCTTCTTCTGCTGAATCTTGTTGCAAAGTCAACAGTTGTTGCTGATTCTACCAAAGTAATTTGTATTGCATTCCCTCGGCCGAGGTCGAGGCTCGGACCAACCAGGCCCGGGAGGAGGAGGGGGGAGGAATGCCAGATTTTTCCCAAGATTTGGTAGTGAAGGATGAAGTTCTGTTAGGTATATTACTGTAATACAATTGTGAGATAAATGTGCTTTCATTTCTTGTGAAAAGTGATTGCTTGTTACTCCATTCCTACTTTTCTCCCATTTAAGTTTCTTTTATCCCTTGCCTGACGTTTCACATTGGTTTTAGTAAACTTTGAGCAGACTTTTAGCTCGAAATCTTCCTTATTAAGATCTCGTTTGTTGTGTTGTGAAATGATATACGAAATCAACGAGTTGGAAGAATATATAAGTGAAGATTGAAAACCATGAAGTTTGAAATAGTTGTCTTGTTCATATGGTATGTTCGAGCCCGGTC

mRNA sequence

ATGACTCCGGCTTTGTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCGCCGCCGCCGCCTCCCAGCGGCTTTTCCGCTGCACTGCTTCGCTGCGGCGAACTCACGCGCCTCACCGCTCTCCTTCCATGTCTCCTCCGCCGAGGAAGCTGAAACTAATGGCGGAGATAATGGTCAAGGCTAAGTATGCGGTTATCGAACGGGACGATTACGACGACGTCAGGTGTGAGAAATGTCGCTCCGGCGACCGGGACGATGAGCTGCTGTTGTGTGACAAATGCGATAAGGGATTTCATATGAAATGTTTTTCTCAGAAGAAGATTATCGATTTTTTCCGAATTCAGAAATGTAAAGACGCAGAATCTCCATATCTATCCGCTCAAGCTATAAAGCATAGGAGACGACTACGGTCATTGGTGTGGCAGAAGAAAAGGAGAAGATTACTATCATTTCTTCCAAGTGAAGATCCTGATCGCAGATTGAAACAGATGGGTTCACTTGCTACAGCTCTAACAACATTGCAAATGGAGTTCAGTGATGATCTGACGTATTTGCCGGGCATGGCTTCAAGATCTGCTAACCAGGCAGAGCTTGAAGATGGTGGAATGCAGGTTCTTTCCAAAGAGGATATTGAGACCTTTGAACTCTGCAGAACCATGAGCAGAAGAGGCGAATGTGCTCCCCTTTTGGTAGTTTTTGATACATGTGAAGGTTTTACTGTACAAGCCGATGATCAAATTAAGGATATGACTTTTATTGCTGAATATACTGGTGATGTCGATTATCTTACGAACCGCGAACACGATGATTGTGATAGTATGATGACCCTTCTTTCAGCCAAACATCCATCTAGAAGTCTTGTCATCTGCCCTGACACACATGGAAACATTGCTCGCTTCATCAATGGAATCAATAATCATACTCCAGAAGGTAAGAAGAAACAGAACTGTAAATGTGTGAGATACAATGTTAAGGGCGAATGCCGTGTGATTTTGGTTGCTATTCGGGATATTGCTAAAGGAGAGAGGCTTTATTATGACTACAATGGATATGATTGTTGCTGATTCTACCAAAGTAATTTGTATTGCATTCCCTCGGCCGAGGTCGAGGCTCGGACCAACCAGGCCCGGGAGGAGGAGGGGGGAGGAATGCCAGATTTTTCCCAAGATTTGGTAGTGAAGGATGAAGTTCTGTTAGGTATATTACTGTAATACAATTGTGAGATAAATGTGCTTTCATTTCTTGTGAAAAGTGATTGCTTGTTACTCCATTCCTACTTTTCTCCCATTTAAGTTTCTTTTATCCCTTGCCTGACGTTTCACATTGGTTTTAGTAAACTTTGAGCAGACTTTTAGCTCGAAATCTTCCTTATTAAGATCTCGTTTGTTGTGTTGTGAAATGATATACGAAATCAACGAGTTGGAAGAATATATAAGTGAAGATTGAAAACCATGAAGTTTGAAATAGTTGTCTTGTTCATATGGTATGTTCGAGCCCGGTC

Coding sequence (CDS)

ATGACTCCGGCTTTGTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCGCCGCCGCCGCCTCCCAGCGGCTTTTCCGCTGCACTGCTTCGCTGCGGCGAACTCACGCGCCTCACCGCTCTCCTTCCATGTCTCCTCCGCCGAGGAAGCTGAAACTAATGGCGGAGATAATGGTCAAGGCTAAGTATGCGGTTATCGAACGGGACGATTACGACGACGTCAGGTGTGAGAAATGTCGCTCCGGCGACCGGGACGATGAGCTGCTGTTGTGTGACAAATGCGATAAGGGATTTCATATGAAATGTTTTTCTCAGAAGAAGATTATCGATTTTTTCCGAATTCAGAAATGTAAAGACGCAGAATCTCCATATCTATCCGCTCAAGCTATAAAGCATAGGAGACGACTACGGTCATTGGTGTGGCAGAAGAAAAGGAGAAGATTACTATCATTTCTTCCAAGTGAAGATCCTGATCGCAGATTGAAACAGATGGGTTCACTTGCTACAGCTCTAACAACATTGCAAATGGAGTTCAGTGATGATCTGACGTATTTGCCGGGCATGGCTTCAAGATCTGCTAACCAGGCAGAGCTTGAAGATGGTGGAATGCAGGTTCTTTCCAAAGAGGATATTGAGACCTTTGAACTCTGCAGAACCATGAGCAGAAGAGGCGAATGTGCTCCCCTTTTGGTAGTTTTTGATACATGTGAAGGTTTTACTGTACAAGCCGATGATCAAATTAAGGATATGACTTTTATTGCTGAATATACTGGTGATGTCGATTATCTTACGAACCGCGAACACGATGATTGTGATAGTATGATGACCCTTCTTTCAGCCAAACATCCATCTAGAAGTCTTGTCATCTGCCCTGACACACATGGAAACATTGCTCGCTTCATCAATGGAATCAATAATCATACTCCAGAAGGTAAGAAGAAACAGAACTGTAAATGTGTGAGATACAATGTTAAGGGCGAATGCCGTGTGATTTTGGTTGCTATTCGGGATATTGCTAAAGGAGAGAGGCTTTATTATGACTACAATGGATATGATTGTTGCTGA

Protein sequence

MTPALSSSSSSSSSSAAAASQRLFRCTASLRRTHAPHRSPSMSPPPRKLKLMAEIMVKAKYAVIERDDYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKCFSQKKIIDFFRIQKCKDAESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLKQMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSRRGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKHPSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGERLYYDYNGYDCC
BLAST of Cp4.1LG05g10640 vs. Swiss-Prot
Match: ATXR5_RICCO (Probable Histone-lysine N-methyltransferase ATXR5 OS=Ricinus communis GN=ATXR5 PE=1 SV=1)

HSP 1 Score: 466.1 bits (1198), Expect = 3.5e-130
Identity = 242/353 (68.56%), Postives = 270/353 (76.49%), Query Frame = 1

Query: 25  RCTASLRRTHAPHRSPSMSPPPRKLKLMAEIMVKAKYAVIERDDYDDVRCEKCRSGDRDD 84
           R   S RRT A   SP  SPPP+KLK ++EI+ KA+YAV+ER DY DV C +C SG+R +
Sbjct: 15  RIVGSRRRTKAT--SPPDSPPPKKLKPISEILAKAQYAVVERADYGDVSCMQCGSGERAE 74

Query: 85  ELLLCDKCDKGFHMKC--------------------------FSQKKIIDFFRIQKCKDA 144
           ELLLCDKCDKGFHMKC                           SQ+KIIDFFRIQKC   
Sbjct: 75  ELLLCDKCDKGFHMKCVRPIVVRVPIGSWLCPKCSGQRRVRRLSQRKIIDFFRIQKCNHK 134

Query: 145 ESPYLSAQAI-KHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLKQMGSLATALTTLQMEFS 204
                S Q I KHRRR  SLV+QK+RRRLL F+ SEDP +RLKQMG+LA+ALT LQMEFS
Sbjct: 135 TDKCSSPQDIRKHRRRSGSLVYQKRRRRLLPFVSSEDPAQRLKQMGTLASALTELQMEFS 194

Query: 205 DDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSRRGECAPLLVVFDTCEGF 264
           DDLTY  GMA RSANQA  E+GGMQVL+KEDIET E CR M +RG+C PLLVVFD+ EGF
Sbjct: 195 DDLTYSSGMAPRSANQARFEEGGMQVLTKEDIETLEQCRAMCKRGDCPPLLVVFDSREGF 254

Query: 265 TVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKHPSRSLVICPDTHGNIAR 324
           TV+AD QIKDMTFIAEYTGDVDY+ NREHDDCDSMMTLL AK PS+SLVICPD  GNIAR
Sbjct: 255 TVEADGQIKDMTFIAEYTGDVDYIRNREHDDCDSMMTLLLAKDPSKSLVICPDKRGNIAR 314

Query: 325 FINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGERLYYDYNGYD 351
           FI+GINNHT +GKKKQNCKCVRY+V GECRV LVA RDIAKGERLYYDYNGY+
Sbjct: 315 FISGINNHTLDGKKKQNCKCVRYSVNGECRVFLVATRDIAKGERLYYDYNGYE 365

BLAST of Cp4.1LG05g10640 vs. Swiss-Prot
Match: ATXR5_ARATH (Histone-lysine N-methyltransferase ATXR5 OS=Arabidopsis thaliana GN=ATXR5 PE=1 SV=2)

HSP 1 Score: 414.5 bits (1064), Expect = 1.2e-114
Identity = 221/374 (59.09%), Postives = 267/374 (71.39%), Query Frame = 1

Query: 10  SSSSSSAAAASQRLFRCTASLRRTHAPHRSPSM-SPPPRKLKLMAEIMVKAKYAVIER-- 69
           ++SS +A+  S R        RRT AP R PS  SPPPRK+K MAEIM K+   V +   
Sbjct: 5   NASSPAASPCSSR--------RRTKAPARRPSSESPPPRKMKSMAEIMAKSVPVVEQEEE 64

Query: 70  ---DDYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKCF---------------------- 129
              D Y +V CEKC SG+ DDELLLCDKCD+GFHMKC                       
Sbjct: 65  EDEDSYSNVTCEKCGSGEGDDELLLCDKCDRGFHMKCLRPIVVRVPIGTWLCVDCSDQRP 124

Query: 130 ----SQKKIIDFFRIQK-CKDAESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSEDPD 189
               SQKKI+ FFRI+K     +   LS +  + RRR  SL  +K+RR+LL  +PSEDPD
Sbjct: 125 VRRLSQKKILHFFRIEKHTHQTDKLELSQEETRKRRRSCSLTVKKRRRKLLPLVPSEDPD 184

Query: 190 RRLKQMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCR 249
           +RL QMG+LA+ALT L +++SD L Y+PGMA RSANQ++LE GGMQVL KED+ET E C+
Sbjct: 185 QRLAQMGTLASALTALGIKYSDGLNYVPGMAPRSANQSKLEKGGMQVLCKEDLETLEQCQ 244

Query: 250 TMSRRGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLL 309
           +M RRGEC PL+VVFD  EG+TV+AD  IKD+TFIAEYTGDVDYL NRE DDCDS+MTLL
Sbjct: 245 SMYRRGECPPLVVVFDPLEGYTVEADGPIKDLTFIAEYTGDVDYLKNREKDDCDSIMTLL 304

Query: 310 SAKHPSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDI 351
            ++ PS++LVICPD  GNI+RFINGINNH P  KKKQNCKCVRY++ GECRV+LVA RDI
Sbjct: 305 LSEDPSKTLVICPDKFGNISRFINGINNHNPVAKKKQNCKCVRYSINGECRVLLVATRDI 364

BLAST of Cp4.1LG05g10640 vs. Swiss-Prot
Match: ATXR6_ARATH (Histone-lysine N-methyltransferase ATXR6 OS=Arabidopsis thaliana GN=ATXR6 PE=1 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 4.4e-93
Identity = 181/312 (58.01%), Postives = 214/312 (68.59%), Query Frame = 1

Query: 68  DYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKCFS------------------------- 127
           D+D V CE+C SG +  +LLLCDKCDKGFH+ C                           
Sbjct: 30  DWDTV-CEECSSGKQPAKLLLCDKCDKGFHLFCLRPILVSVPKGSWFCPSCSKHQIPKSF 89

Query: 128 ---QKKIIDFFRIQKCKDAESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSEDPDRRL 187
              Q KIIDFFRI++  D+     S+ +I  +R+  SLV  KK+RRLL + PS DP RRL
Sbjct: 90  PLIQTKIIDFFRIKRSPDSSQISSSSDSIGKKRKKTSLVMSKKKRRLLPYNPSNDPQRRL 149

Query: 188 KQMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMS 247
           +QM SLATAL     +FS++LTY+ G A RSANQA  E GGMQVLSKE +ET  LC+ M 
Sbjct: 150 EQMASLATALRASNTKFSNELTYVSGKAPRSANQAAFEKGGMQVLSKEGVETLALCKKMM 209

Query: 248 RRGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHD-DCDSMMTLLSA 307
             GEC PL+VVFD  EGFTV+AD  IKD T I EY GDVDYL+NRE D D DSMMTLL A
Sbjct: 210 DLGECPPLMVVFDPYEGFTVEADRFIKDWTIITEYVGDVDYLSNREDDYDGDSMMTLLHA 269

Query: 308 KHPSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAK 351
             PS+ LVICPD   NIARFI+GINNH+PEG+KKQN KCVR+N+ GE RV+LVA RDI+K
Sbjct: 270 SDPSQCLVICPDRRSNIARFISGINNHSPEGRKKQNLKCVRFNINGEARVLLVANRDISK 329

BLAST of Cp4.1LG05g10640 vs. TrEMBL
Match: A0A0A0LM82_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G302110 PE=4 SV=1)

HSP 1 Score: 585.1 bits (1507), Expect = 5.7e-164
Identity = 303/377 (80.37%), Postives = 314/377 (83.29%), Query Frame = 1

Query: 1   MTPALSSSSSSSSSSAAAASQRLFRCTASLRRTHAPHRSPSMSPPPRKLKLMAEIMVKAK 60
           MTPA SSSS        AASQRL RC+AS RRTHAPHR  SMSPP RKLK M EIM KAK
Sbjct: 1   MTPAFSSSS--------AASQRLIRCSASPRRTHAPHRPSSMSPPLRKLKSMTEIMAKAK 60

Query: 61  YAVIERDDYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKC-------------------- 120
           + V+ER+DYDDV CE+C SGDRDDELLLCDKCDKGFHMKC                    
Sbjct: 61  HVVLEREDYDDVSCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCSG 120

Query: 121 ------FSQKKIIDFFRIQKCKD-AESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSE 180
                 FSQKKIIDFFRIQKCKD  +  YLSAQAIK RRRLRSLVWQKKRRRLL FLPSE
Sbjct: 121 QRRVRSFSQKKIIDFFRIQKCKDDGDVLYLSAQAIKRRRRLRSLVWQKKRRRLLPFLPSE 180

Query: 181 DPDRRLKQMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFE 240
           DPDRRLKQMGSLATALTTLQMEFSDDLTY PGMASRSANQAE EDGGMQVLSKED ET E
Sbjct: 181 DPDRRLKQMGSLATALTTLQMEFSDDLTYGPGMASRSANQAEFEDGGMQVLSKEDAETLE 240

Query: 241 LCRTMSRRGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMM 300
           LCR M+RRGEC PLLVVFD+CEGFTV+ADDQIKDMTFIAEYTGDVDYL NREHDDCDSMM
Sbjct: 241 LCRAMNRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMM 300

Query: 301 TLLSAKHPSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAI 351
           TLLS K PSRSLVICPDT GNIARFINGINNH+PEGKKKQNCKCVRYNV GECRVILVAI
Sbjct: 301 TLLSVKDPSRSLVICPDTRGNIARFINGINNHSPEGKKKQNCKCVRYNVNGECRVILVAI 360

BLAST of Cp4.1LG05g10640 vs. TrEMBL
Match: A0A067JQG3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22238 PE=4 SV=1)

HSP 1 Score: 485.3 bits (1248), Expect = 6.1e-134
Identity = 256/370 (69.19%), Postives = 285/370 (77.03%), Query Frame = 1

Query: 9   SSSSSSSAAAASQRLFRCTASLRRTHA-PHRSPSMSPPPRKLKLMAEIMVKAKYAVIERD 68
           +S++SS  AAA++R+     S RRT A P  SP  SPPP+KLK ++EI+ KAKYAV+ER 
Sbjct: 4   ASTTSSPGAAAARRII---GSRRRTKATPLPSPPESPPPKKLKPISEILAKAKYAVVERA 63

Query: 69  DYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKC--------------------------F 128
           DY DV CE+C SG+R DELLLCDKCDKGFHMKC                           
Sbjct: 64  DYSDVSCEQCGSGERADELLLCDKCDKGFHMKCVRPIVARVPIGSWFCPKCSGQRRVRRL 123

Query: 129 SQKKIIDFFRIQKCKDAESPYLSAQAI-KHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLK 188
           SQKKIIDFFRIQKC   +    S Q   K RRR   LV+QKKRRRLL F+PSED   RLK
Sbjct: 124 SQKKIIDFFRIQKCNRKKDKCSSPQDTRKRRRRSGPLVYQKKRRRLLPFIPSEDAAERLK 183

Query: 189 QMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSR 248
           QMG+LA+ALT LQMEFSD+LTYLP MA R+ANQAE E+GGMQVLSKEDIET E CR MSR
Sbjct: 184 QMGTLASALTALQMEFSDELTYLPDMAPRAANQAEFEEGGMQVLSKEDIETLEQCRAMSR 243

Query: 249 RGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKH 308
           RGEC PLLVVFD+CEGFTV+AD QIKDMT IAEYTGDVDY+ NREHDDCDSMMTLL AK 
Sbjct: 244 RGECPPLLVVFDSCEGFTVKADSQIKDMTLIAEYTGDVDYIRNREHDDCDSMMTLLLAKD 303

Query: 309 PSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGE 351
           P++SLVICPD  GNIARFINGINNHTP+GKKKQNCKCVRY+V GECRV LVA RDIAKGE
Sbjct: 304 PTKSLVICPDKRGNIARFINGINNHTPDGKKKQNCKCVRYSVNGECRVFLVATRDIAKGE 363

BLAST of Cp4.1LG05g10640 vs. TrEMBL
Match: A0A0D2NXD1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G068200 PE=4 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 2.2e-131
Identity = 241/352 (68.47%), Postives = 276/352 (78.41%), Query Frame = 1

Query: 25  RCTASLRRTHAPHRSPSMSPPPRKLKLMAEIMVKAKYAVIERDDYDDVRCEKCRSGDRDD 84
           R  +S RRT AP R PS S PP+KL+ M+EIM +AKYAV+ER DY D+ CE+C SG+R  
Sbjct: 12  RLVSSRRRTEAPRRRPSPSTPPKKLRPMSEIMARAKYAVVERADYSDIICEQCGSGERPG 71

Query: 85  ELLLCDKCDKGFHMKC--------------------------FSQKKIIDFFRIQKCKDA 144
           ELLLCDKCDKGFHM+C                          FSQK+IIDFF+IQK  D 
Sbjct: 72  ELLLCDKCDKGFHMRCLRPIVVRIPIGSWLCPKCSGHRRVRTFSQKRIIDFFKIQKSGDG 131

Query: 145 ESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLKQMGSLATALTTLQMEFSD 204
           +     +Q  + RRR R LV  KKRRRLL F+PSEDP++RLKQMGSLA+ALT +QMEFSD
Sbjct: 132 KKKCNLSQDTRKRRR-RPLVLLKKRRRLLPFIPSEDPNQRLKQMGSLASALTAMQMEFSD 191

Query: 205 DLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSRRGECAPLLVVFDTCEGFT 264
           DLTY   MA RSANQA+ E+GGMQVLS+ED+ET ELCR+MSRRGEC P +VVFD+CEG+T
Sbjct: 192 DLTYSSDMAPRSANQAKFENGGMQVLSREDMETLELCRSMSRRGECPPFIVVFDSCEGYT 251

Query: 265 VQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKHPSRSLVICPDTHGNIARF 324
           V+AD QIKDMTFIAEYTGDVDY+ NRE+DDCDSMMTLL A +PS SLVICPD  GNIARF
Sbjct: 252 VEADAQIKDMTFIAEYTGDVDYIKNRENDDCDSMMTLLLATNPSESLVICPDKCGNIARF 311

Query: 325 INGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGERLYYDYNGYD 351
           INGINNHTPEGKKKQNCKCVRY+V GECRV+LVA RDIAKGERLYYDYNGY+
Sbjct: 312 INGINNHTPEGKKKQNCKCVRYSVNGECRVLLVATRDIAKGERLYYDYNGYE 362

BLAST of Cp4.1LG05g10640 vs. TrEMBL
Match: A0A061DIV3_THECC (Trithorax-related protein 5 isoform 1 OS=Theobroma cacao GN=TCM_001425 PE=4 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 2.8e-131
Identity = 250/369 (67.75%), Postives = 285/369 (77.24%), Query Frame = 1

Query: 14  SSAAAASQRLF----RCTASLRR-THAPHRSPSMSPPPRKLKLMAEIMVKAKYAVIERDD 73
           ++  AA++RL     R  A  RR + +P R PS SPP RKL+ +AEIM +A+YAV+ER D
Sbjct: 4   ATTVAAARRLVGLRRRTEAPPRRPSPSPPRRPSPSPPQRKLRPVAEIMARARYAVVERAD 63

Query: 74  YDDVRCEKCRSGDRDDELLLCDKCDKGFHMKC--------------------------FS 133
           Y DV CE+C SG+R DELLLCDKCDKGFHMKC                          FS
Sbjct: 64  YSDVGCEQCGSGERPDELLLCDKCDKGFHMKCLRPIMARVPIGSWLCPKCSGHRRVRSFS 123

Query: 134 QKKIIDFFRIQKCKDAESPYLSAQAI-KHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLKQ 193
           QKKIIDFFRIQK  D +  + S Q   K RRR RSLV  KKRRRLL F+PSEDP++RL Q
Sbjct: 124 QKKIIDFFRIQKSCDGKKKFTSNQDTRKRRRRSRSLVLLKKRRRLLPFIPSEDPNQRLNQ 183

Query: 194 MGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSRR 253
           MG+LA+ALT LQMEFSDDLTY PGMA RSANQA+ E+GGMQVLSKED+ET ELCR M+RR
Sbjct: 184 MGTLASALTALQMEFSDDLTYSPGMAPRSANQAKFENGGMQVLSKEDMETLELCRAMNRR 243

Query: 254 GECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKHP 313
           GEC PL+VVFD+CEG+TV+AD QIKDMTFIAEYTGDVDY+ NRE+DDCDS+MTLL A   
Sbjct: 244 GECPPLIVVFDSCEGYTVEADGQIKDMTFIAEYTGDVDYIKNRENDDCDSLMTLLLATDS 303

Query: 314 SRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGER 351
           S+SLVICPD  GNIARFINGINNHT EGKKKQNCKCVRY+V GECRV+LVA RDIAKGER
Sbjct: 304 SKSLVICPDKRGNIARFINGINNHTLEGKKKQNCKCVRYSVNGECRVLLVATRDIAKGER 363

BLAST of Cp4.1LG05g10640 vs. TrEMBL
Match: V4SV45_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025879mg PE=4 SV=1)

HSP 1 Score: 465.3 bits (1196), Expect = 6.6e-128
Identity = 243/362 (67.13%), Postives = 269/362 (74.31%), Query Frame = 1

Query: 16  AAAASQRLFRCTASLRRTHAPHRSPSMSPPPRKLKLMAEIMVKAKYAVIERDDYDDVRCE 75
           A  +S    R   S RRT AP R  S SPPP+K+K M EI+ KA YAV+ER DY DV CE
Sbjct: 4   ATTSSAEARRLIGSRRRTEAPRRMLSPSPPPKKVKSMEEILAKAHYAVVERGDYGDVGCE 63

Query: 76  KCRSGDRDDELLLCDKCDKGFHMKC--------------------------FSQKKIIDF 135
           +C SG+R +ELLLCDKCDKGFHMKC                          FSQ+KIIDF
Sbjct: 64  QCGSGERAEELLLCDKCDKGFHMKCLRPIVVRVPIGTWLCPKCSGQRRVRSFSQRKIIDF 123

Query: 136 FRIQKCKDAESPYLSAQAI-KHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLKQMGSLATA 195
           F+I+K    E    S Q   K RRR  SLV QKKRRRLL F PSED  +RL QMGSLA A
Sbjct: 124 FKIKKPNLPEEKCDSPQDTRKRRRRSASLVLQKKRRRLLPFTPSEDRSQRLSQMGSLAHA 183

Query: 196 LTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSRRGECAPLL 255
           LT LQMEFSDDLTY+PGMA RSANQAE E+GGMQVLSKED ET E CR M +RGEC PL+
Sbjct: 184 LTALQMEFSDDLTYMPGMAPRSANQAEFEEGGMQVLSKEDTETLEQCRAMCKRGECPPLV 243

Query: 256 VVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKHPSRSLVIC 315
           VV+D+CEGFTV+AD QIKDMTFIAEY GDVD++ NREHDDCDSMMTLL A  PS+SLVIC
Sbjct: 244 VVYDSCEGFTVEADGQIKDMTFIAEYIGDVDFIRNREHDDCDSMMTLLLATDPSKSLVIC 303

Query: 316 PDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGERLYYDYNG 351
           PD  GNIARFINGINN+T EG+KKQNCKCVRY+V GECRV LVA RDIAKGERLYYDYNG
Sbjct: 304 PDKRGNIARFINGINNYTLEGRKKQNCKCVRYSVNGECRVFLVATRDIAKGERLYYDYNG 363

BLAST of Cp4.1LG05g10640 vs. TAIR10
Match: AT5G09790.2 (AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5)

HSP 1 Score: 414.5 bits (1064), Expect = 6.7e-116
Identity = 221/374 (59.09%), Postives = 267/374 (71.39%), Query Frame = 1

Query: 10  SSSSSSAAAASQRLFRCTASLRRTHAPHRSPSM-SPPPRKLKLMAEIMVKAKYAVIER-- 69
           ++SS +A+  S R        RRT AP R PS  SPPPRK+K MAEIM K+   V +   
Sbjct: 5   NASSPAASPCSSR--------RRTKAPARRPSSESPPPRKMKSMAEIMAKSVPVVEQEEE 64

Query: 70  ---DDYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKCF---------------------- 129
              D Y +V CEKC SG+ DDELLLCDKCD+GFHMKC                       
Sbjct: 65  EDEDSYSNVTCEKCGSGEGDDELLLCDKCDRGFHMKCLRPIVVRVPIGTWLCVDCSDQRP 124

Query: 130 ----SQKKIIDFFRIQK-CKDAESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSEDPD 189
               SQKKI+ FFRI+K     +   LS +  + RRR  SL  +K+RR+LL  +PSEDPD
Sbjct: 125 VRRLSQKKILHFFRIEKHTHQTDKLELSQEETRKRRRSCSLTVKKRRRKLLPLVPSEDPD 184

Query: 190 RRLKQMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCR 249
           +RL QMG+LA+ALT L +++SD L Y+PGMA RSANQ++LE GGMQVL KED+ET E C+
Sbjct: 185 QRLAQMGTLASALTALGIKYSDGLNYVPGMAPRSANQSKLEKGGMQVLCKEDLETLEQCQ 244

Query: 250 TMSRRGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLL 309
           +M RRGEC PL+VVFD  EG+TV+AD  IKD+TFIAEYTGDVDYL NRE DDCDS+MTLL
Sbjct: 245 SMYRRGECPPLVVVFDPLEGYTVEADGPIKDLTFIAEYTGDVDYLKNREKDDCDSIMTLL 304

Query: 310 SAKHPSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDI 351
            ++ PS++LVICPD  GNI+RFINGINNH P  KKKQNCKCVRY++ GECRV+LVA RDI
Sbjct: 305 LSEDPSKTLVICPDKFGNISRFINGINNHNPVAKKKQNCKCVRYSINGECRVLLVATRDI 364

BLAST of Cp4.1LG05g10640 vs. TAIR10
Match: AT5G24330.1 (AT5G24330.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 6)

HSP 1 Score: 342.8 bits (878), Expect = 2.5e-94
Identity = 181/312 (58.01%), Postives = 214/312 (68.59%), Query Frame = 1

Query: 68  DYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKCFS------------------------- 127
           D+D V CE+C SG +  +LLLCDKCDKGFH+ C                           
Sbjct: 30  DWDTV-CEECSSGKQPAKLLLCDKCDKGFHLFCLRPILVSVPKGSWFCPSCSKHQIPKSF 89

Query: 128 ---QKKIIDFFRIQKCKDAESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSEDPDRRL 187
              Q KIIDFFRI++  D+     S+ +I  +R+  SLV  KK+RRLL + PS DP RRL
Sbjct: 90  PLIQTKIIDFFRIKRSPDSSQISSSSDSIGKKRKKTSLVMSKKKRRLLPYNPSNDPQRRL 149

Query: 188 KQMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMS 247
           +QM SLATAL     +FS++LTY+ G A RSANQA  E GGMQVLSKE +ET  LC+ M 
Sbjct: 150 EQMASLATALRASNTKFSNELTYVSGKAPRSANQAAFEKGGMQVLSKEGVETLALCKKMM 209

Query: 248 RRGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHD-DCDSMMTLLSA 307
             GEC PL+VVFD  EGFTV+AD  IKD T I EY GDVDYL+NRE D D DSMMTLL A
Sbjct: 210 DLGECPPLMVVFDPYEGFTVEADRFIKDWTIITEYVGDVDYLSNREDDYDGDSMMTLLHA 269

Query: 308 KHPSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAK 351
             PS+ LVICPD   NIARFI+GINNH+PEG+KKQN KCVR+N+ GE RV+LVA RDI+K
Sbjct: 270 SDPSQCLVICPDRRSNIARFISGINNHSPEGRKKQNLKCVRFNINGEARVLLVANRDISK 329

BLAST of Cp4.1LG05g10640 vs. TAIR10
Match: AT1G77300.1 (AT1G77300.1 histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 specific))

HSP 1 Score: 48.9 bits (115), Expect = 7.4e-06
Identity = 32/118 (27.12%), Postives = 59/118 (50.00%), Query Frame = 1

Query: 235  EGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKH-----PSRSLVICP 294
            +G+ ++  + +++  F+ EY G+V  L  + ++           KH      + + VI  
Sbjct: 1036 KGYGLRLLEDVREGQFLIEYVGEV--LDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDA 1095

Query: 295  DTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGERLYYDYN 348
               GN+ RFIN    H+ E     NC+  ++ V GE  V + +++D+ KG+ L +DYN
Sbjct: 1096 GAKGNLGRFIN----HSCE----PNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYN 1143

BLAST of Cp4.1LG05g10640 vs. NCBI nr
Match: gi|659121970|ref|XP_008460909.1| (PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Cucumis melo])

HSP 1 Score: 590.1 bits (1520), Expect = 2.5e-165
Identity = 304/377 (80.64%), Postives = 315/377 (83.55%), Query Frame = 1

Query: 1   MTPALSSSSSSSSSSAAAASQRLFRCTASLRRTHAPHRSPSMSPPPRKLKLMAEIMVKAK 60
           MTPA SSSS        AASQRL RC+AS RRTHAPHR  SMSPP RKLK M EIM KAK
Sbjct: 1   MTPAFSSSS--------AASQRLIRCSASPRRTHAPHRPSSMSPPLRKLKSMTEIMAKAK 60

Query: 61  YAVIERDDYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKC-------------------- 120
           + V+ER+DYDDV CE+C SGDRDDELLLCDKCDKGFHMKC                    
Sbjct: 61  HVVLEREDYDDVSCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCSG 120

Query: 121 ------FSQKKIIDFFRIQKCKD-AESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSE 180
                 FSQKKIIDFFRIQKCKD  + PYLSAQAIK RRRLRSLVWQKKRRRLL FLPSE
Sbjct: 121 QRRVRSFSQKKIIDFFRIQKCKDDGDVPYLSAQAIKRRRRLRSLVWQKKRRRLLPFLPSE 180

Query: 181 DPDRRLKQMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFE 240
           DPDRRLKQMGSLATALTTLQMEFSDDLTY+PGMASRSANQAE EDGGMQVLSKED ET E
Sbjct: 181 DPDRRLKQMGSLATALTTLQMEFSDDLTYMPGMASRSANQAEFEDGGMQVLSKEDTETLE 240

Query: 241 LCRTMSRRGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMM 300
           LCR MSRRGEC PLLVVFD+CEGFTV+ADDQIKDMTFIAEYTGDVDYL NREHDDCDSMM
Sbjct: 241 LCRAMSRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMM 300

Query: 301 TLLSAKHPSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAI 351
           TLLS K PSRSLVICPD  GNIARFINGINNH+PEGKKKQNCKCVRYNV GECRVILVAI
Sbjct: 301 TLLSVKDPSRSLVICPDRRGNIARFINGINNHSPEGKKKQNCKCVRYNVNGECRVILVAI 360

BLAST of Cp4.1LG05g10640 vs. NCBI nr
Match: gi|778670199|ref|XP_011649397.1| (PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Cucumis sativus])

HSP 1 Score: 585.1 bits (1507), Expect = 8.1e-164
Identity = 303/377 (80.37%), Postives = 314/377 (83.29%), Query Frame = 1

Query: 1   MTPALSSSSSSSSSSAAAASQRLFRCTASLRRTHAPHRSPSMSPPPRKLKLMAEIMVKAK 60
           MTPA SSSS        AASQRL RC+AS RRTHAPHR  SMSPP RKLK M EIM KAK
Sbjct: 1   MTPAFSSSS--------AASQRLIRCSASPRRTHAPHRPSSMSPPLRKLKSMTEIMAKAK 60

Query: 61  YAVIERDDYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKC-------------------- 120
           + V+ER+DYDDV CE+C SGDRDDELLLCDKCDKGFHMKC                    
Sbjct: 61  HVVLEREDYDDVSCEECGSGDRDDELLLCDKCDKGFHMKCVSPIVVRVPIGSWLCPKCSG 120

Query: 121 ------FSQKKIIDFFRIQKCKD-AESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSE 180
                 FSQKKIIDFFRIQKCKD  +  YLSAQAIK RRRLRSLVWQKKRRRLL FLPSE
Sbjct: 121 QRRVRSFSQKKIIDFFRIQKCKDDGDVLYLSAQAIKRRRRLRSLVWQKKRRRLLPFLPSE 180

Query: 181 DPDRRLKQMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFE 240
           DPDRRLKQMGSLATALTTLQMEFSDDLTY PGMASRSANQAE EDGGMQVLSKED ET E
Sbjct: 181 DPDRRLKQMGSLATALTTLQMEFSDDLTYGPGMASRSANQAEFEDGGMQVLSKEDAETLE 240

Query: 241 LCRTMSRRGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMM 300
           LCR M+RRGEC PLLVVFD+CEGFTV+ADDQIKDMTFIAEYTGDVDYL NREHDDCDSMM
Sbjct: 241 LCRAMNRRGECPPLLVVFDSCEGFTVEADDQIKDMTFIAEYTGDVDYLKNREHDDCDSMM 300

Query: 301 TLLSAKHPSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAI 351
           TLLS K PSRSLVICPDT GNIARFINGINNH+PEGKKKQNCKCVRYNV GECRVILVAI
Sbjct: 301 TLLSVKDPSRSLVICPDTRGNIARFINGINNHSPEGKKKQNCKCVRYNVNGECRVILVAI 360

BLAST of Cp4.1LG05g10640 vs. NCBI nr
Match: gi|802726157|ref|XP_012086084.1| (PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Jatropha curcas])

HSP 1 Score: 485.3 bits (1248), Expect = 8.8e-134
Identity = 256/370 (69.19%), Postives = 285/370 (77.03%), Query Frame = 1

Query: 9   SSSSSSSAAAASQRLFRCTASLRRTHA-PHRSPSMSPPPRKLKLMAEIMVKAKYAVIERD 68
           +S++SS  AAA++R+     S RRT A P  SP  SPPP+KLK ++EI+ KAKYAV+ER 
Sbjct: 4   ASTTSSPGAAAARRII---GSRRRTKATPLPSPPESPPPKKLKPISEILAKAKYAVVERA 63

Query: 69  DYDDVRCEKCRSGDRDDELLLCDKCDKGFHMKC--------------------------F 128
           DY DV CE+C SG+R DELLLCDKCDKGFHMKC                           
Sbjct: 64  DYSDVSCEQCGSGERADELLLCDKCDKGFHMKCVRPIVARVPIGSWFCPKCSGQRRVRRL 123

Query: 129 SQKKIIDFFRIQKCKDAESPYLSAQAI-KHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLK 188
           SQKKIIDFFRIQKC   +    S Q   K RRR   LV+QKKRRRLL F+PSED   RLK
Sbjct: 124 SQKKIIDFFRIQKCNRKKDKCSSPQDTRKRRRRSGPLVYQKKRRRLLPFIPSEDAAERLK 183

Query: 189 QMGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSR 248
           QMG+LA+ALT LQMEFSD+LTYLP MA R+ANQAE E+GGMQVLSKEDIET E CR MSR
Sbjct: 184 QMGTLASALTALQMEFSDELTYLPDMAPRAANQAEFEEGGMQVLSKEDIETLEQCRAMSR 243

Query: 249 RGECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKH 308
           RGEC PLLVVFD+CEGFTV+AD QIKDMT IAEYTGDVDY+ NREHDDCDSMMTLL AK 
Sbjct: 244 RGECPPLLVVFDSCEGFTVKADSQIKDMTLIAEYTGDVDYIRNREHDDCDSMMTLLLAKD 303

Query: 309 PSRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGE 351
           P++SLVICPD  GNIARFINGINNHTP+GKKKQNCKCVRY+V GECRV LVA RDIAKGE
Sbjct: 304 PTKSLVICPDKRGNIARFINGINNHTPDGKKKQNCKCVRYSVNGECRVFLVATRDIAKGE 363

BLAST of Cp4.1LG05g10640 vs. NCBI nr
Match: gi|823140887|ref|XP_012470268.1| (PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 isoform X3 [Gossypium raimondii])

HSP 1 Score: 476.9 bits (1226), Expect = 3.1e-131
Identity = 241/352 (68.47%), Postives = 276/352 (78.41%), Query Frame = 1

Query: 25  RCTASLRRTHAPHRSPSMSPPPRKLKLMAEIMVKAKYAVIERDDYDDVRCEKCRSGDRDD 84
           R  +S RRT AP R PS S PP+KL+ M+EIM +AKYAV+ER DY D+ CE+C SG+R  
Sbjct: 12  RLVSSRRRTEAPRRRPSPSTPPKKLRPMSEIMARAKYAVVERADYSDIICEQCGSGERPG 71

Query: 85  ELLLCDKCDKGFHMKC--------------------------FSQKKIIDFFRIQKCKDA 144
           ELLLCDKCDKGFHM+C                          FSQK+IIDFF+IQK  D 
Sbjct: 72  ELLLCDKCDKGFHMRCLRPIVVRIPIGSWLCPKCSGHRRVRTFSQKRIIDFFKIQKSGDG 131

Query: 145 ESPYLSAQAIKHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLKQMGSLATALTTLQMEFSD 204
           +     +Q  + RRR R LV  KKRRRLL F+PSEDP++RLKQMGSLA+ALT +QMEFSD
Sbjct: 132 KKKCNLSQDTRKRRR-RPLVLLKKRRRLLPFIPSEDPNQRLKQMGSLASALTAMQMEFSD 191

Query: 205 DLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSRRGECAPLLVVFDTCEGFT 264
           DLTY   MA RSANQA+ E+GGMQVLS+ED+ET ELCR+MSRRGEC P +VVFD+CEG+T
Sbjct: 192 DLTYSSDMAPRSANQAKFENGGMQVLSREDMETLELCRSMSRRGECPPFIVVFDSCEGYT 251

Query: 265 VQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKHPSRSLVICPDTHGNIARF 324
           V+AD QIKDMTFIAEYTGDVDY+ NRE+DDCDSMMTLL A +PS SLVICPD  GNIARF
Sbjct: 252 VEADAQIKDMTFIAEYTGDVDYIKNRENDDCDSMMTLLLATNPSESLVICPDKCGNIARF 311

Query: 325 INGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGERLYYDYNGYD 351
           INGINNHTPEGKKKQNCKCVRY+V GECRV+LVA RDIAKGERLYYDYNGY+
Sbjct: 312 INGINNHTPEGKKKQNCKCVRYSVNGECRVLLVATRDIAKGERLYYDYNGYE 362

BLAST of Cp4.1LG05g10640 vs. NCBI nr
Match: gi|590708583|ref|XP_007048318.1| (Trithorax-related protein 5 isoform 1 [Theobroma cacao])

HSP 1 Score: 476.5 bits (1225), Expect = 4.1e-131
Identity = 250/369 (67.75%), Postives = 285/369 (77.24%), Query Frame = 1

Query: 14  SSAAAASQRLF----RCTASLRR-THAPHRSPSMSPPPRKLKLMAEIMVKAKYAVIERDD 73
           ++  AA++RL     R  A  RR + +P R PS SPP RKL+ +AEIM +A+YAV+ER D
Sbjct: 4   ATTVAAARRLVGLRRRTEAPPRRPSPSPPRRPSPSPPQRKLRPVAEIMARARYAVVERAD 63

Query: 74  YDDVRCEKCRSGDRDDELLLCDKCDKGFHMKC--------------------------FS 133
           Y DV CE+C SG+R DELLLCDKCDKGFHMKC                          FS
Sbjct: 64  YSDVGCEQCGSGERPDELLLCDKCDKGFHMKCLRPIMARVPIGSWLCPKCSGHRRVRSFS 123

Query: 134 QKKIIDFFRIQKCKDAESPYLSAQAI-KHRRRLRSLVWQKKRRRLLSFLPSEDPDRRLKQ 193
           QKKIIDFFRIQK  D +  + S Q   K RRR RSLV  KKRRRLL F+PSEDP++RL Q
Sbjct: 124 QKKIIDFFRIQKSCDGKKKFTSNQDTRKRRRRSRSLVLLKKRRRLLPFIPSEDPNQRLNQ 183

Query: 194 MGSLATALTTLQMEFSDDLTYLPGMASRSANQAELEDGGMQVLSKEDIETFELCRTMSRR 253
           MG+LA+ALT LQMEFSDDLTY PGMA RSANQA+ E+GGMQVLSKED+ET ELCR M+RR
Sbjct: 184 MGTLASALTALQMEFSDDLTYSPGMAPRSANQAKFENGGMQVLSKEDMETLELCRAMNRR 243

Query: 254 GECAPLLVVFDTCEGFTVQADDQIKDMTFIAEYTGDVDYLTNREHDDCDSMMTLLSAKHP 313
           GEC PL+VVFD+CEG+TV+AD QIKDMTFIAEYTGDVDY+ NRE+DDCDS+MTLL A   
Sbjct: 244 GECPPLIVVFDSCEGYTVEADGQIKDMTFIAEYTGDVDYIKNRENDDCDSLMTLLLATDS 303

Query: 314 SRSLVICPDTHGNIARFINGINNHTPEGKKKQNCKCVRYNVKGECRVILVAIRDIAKGER 351
           S+SLVICPD  GNIARFINGINNHT EGKKKQNCKCVRY+V GECRV+LVA RDIAKGER
Sbjct: 304 SKSLVICPDKRGNIARFINGINNHTLEGKKKQNCKCVRYSVNGECRVLLVATRDIAKGER 363

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATXR5_RICCO3.5e-13068.56Probable Histone-lysine N-methyltransferase ATXR5 OS=Ricinus communis GN=ATXR5 P... [more]
ATXR5_ARATH1.2e-11459.09Histone-lysine N-methyltransferase ATXR5 OS=Arabidopsis thaliana GN=ATXR5 PE=1 S... [more]
ATXR6_ARATH4.4e-9358.01Histone-lysine N-methyltransferase ATXR6 OS=Arabidopsis thaliana GN=ATXR6 PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A0A0LM82_CUCSA5.7e-16480.37Uncharacterized protein OS=Cucumis sativus GN=Csa_2G302110 PE=4 SV=1[more]
A0A067JQG3_JATCU6.1e-13469.19Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22238 PE=4 SV=1[more]
A0A0D2NXD1_GOSRA2.2e-13168.47Uncharacterized protein OS=Gossypium raimondii GN=B456_003G068200 PE=4 SV=1[more]
A0A061DIV3_THECC2.8e-13167.75Trithorax-related protein 5 isoform 1 OS=Theobroma cacao GN=TCM_001425 PE=4 SV=1[more]
V4SV45_9ROSI6.6e-12867.13Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025879mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G09790.26.7e-11659.09 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 5[more]
AT5G24330.12.5e-9458.01 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 6[more]
AT1G77300.17.4e-0627.12 histone methyltransferases(H3-K4 specific);histone methyltransferase... [more]
Match NameE-valueIdentityDescription
gi|659121970|ref|XP_008460909.1|2.5e-16580.64PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Cucumis melo][more]
gi|778670199|ref|XP_011649397.1|8.1e-16480.37PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Cucumis sativus][more]
gi|802726157|ref|XP_012086084.1|8.8e-13469.19PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 [Jatropha curcas][more]
gi|823140887|ref|XP_012470268.1|3.1e-13168.47PREDICTED: probable Histone-lysine N-methyltransferase ATXR5 isoform X3 [Gossypi... [more]
gi|590708583|ref|XP_007048318.1|4.1e-13167.75Trithorax-related protein 5 isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR019787Znf_PHD-finger
IPR013083Znf_RING/FYVE/PHD
IPR011011Znf_FYVE_PHD
IPR001214SET_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006325 chromatin organization
biological_process GO:0032259 methylation
biological_process GO:0008150 biological_process
biological_process GO:0006554 lysine catabolic process
biological_process GO:0034968 histone lysine methylation
biological_process GO:0009294 DNA mediated transformation
biological_process GO:0070734 histone H3-K27 methylation
biological_process GO:0009555 pollen development
biological_process GO:0006275 regulation of DNA replication
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
cellular_component GO:0005634 nucleus
cellular_component GO:0009536 plastid
molecular_function GO:0046976 histone methyltransferase activity (H3-K27 specific)
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding
molecular_function GO:0018024 histone-lysine N-methyltransferase activity
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g10640.1Cp4.1LG05g10640.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001214SET domainPFAMPF00856SETcoord: 265..347
score: 3.
IPR001214SET domainSMARTSM00317set_7coord: 225..351
score: 0.
IPR001214SET domainPROFILEPS50280SETcoord: 210..347
score: 14
IPR011011Zinc finger, FYVE/PHD-typeunknownSSF57903FYVE/PHD zinc fingercoord: 44..103
score: 2.67
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3DG3DSA:3.30.40.10coord: 65..97
score: 1.
IPR019787Zinc finger, PHD-fingerPFAMPF00628PHDcoord: 73..104
score: 4.
NoneNo IPR availableGENE3DG3DSA:2.170.270.10coord: 213..347
score: 2.7E-18coord: 98..122
score: 2.7
NoneNo IPR availablePANTHERPTHR10615HISTONE ACETYLTRANSFERASEcoord: 52..116
score: 5.0E-171coord: 140..350
score: 5.0E
NoneNo IPR availablePANTHERPTHR10615:SF112HISTONE-LYSINE N-METHYLTRANSFERASE ATXR5coord: 52..116
score: 5.0E-171coord: 140..350
score: 5.0E
NoneNo IPR availableunknownSSF82199SET domaincoord: 219..347
score: 8.67