Cp4.1LG17g00020 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g00020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDNA methyltransferase, putative
LocationCp4.1LG17 : 488169 .. 493746 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTGACTATCCGAACAAATTGTTGTTTACAGAGCCACAGAAAACTCAGCTTACTCTCCTGAAGGTGAAGCTTAAAGCTACCTCAGCCCTAGTCCTAGCAACCACTACTTCAAAACTTGATGGTATGTACACTAATGAGATACTATGCTGAATCATTGGAAACTTAAATGCATGTGCATCAACATAAACGCGCTCTTATTTCTGTTCCTTAGGCCAGCAATAAGCCTATTGTACCAAAGGAAGAGGTCTTGGATTTCAGGTTGCCACCTGATGGGCTGTACTCAAGGCATGTCGGGGTACGTACAAGTTATAACTCTAACCTTATTGTTAAAATAAATTGCCGAAGTATCTGTACAAGGTCTTCACACTTGTGTTTACAATATATCAAGTTCTGTTATTGTCATTTTGGACCTACCATCCATTTGGAAAAAAATAAAAAATAAAAAAAAAATACAATTTTCATTGTGAAAAGTAAGATGTTATTGCCTAGGAGTATGGATTGAATATTTGGAGTATGGCATTTATTATATTTCCTAGTGTGGAAAAAGTTAAAAGTAACGGATTTGCTCAAATTTTCAGTTCTTATAAATTTCAGGACAGTGGTGCAAGCTCATCTGGAAACAATATAAGAACTTTCTTTGTAGATATGGGGTTCCTGCCATCTCTTGTTGATAAAGTGATCGAAGAAAAAGGTACATTTATTCCTTTTAATTTTTAGGATACAATGCGTTCTTCATGAATGAGGGCACTGAATTATGTTATTTACCTTTTCTCTTTAGAAAACTGAGAAGAGAGAACCTTATCCTTTAAAGTTGGCCTTCGAATTTTCTCCATCACAAGGAAAAGTTTCATTAGAAATGTAATTATAACCTTGAAATGTCCAATTTGTATAGAATATCCTAATAATCGCCGCTAATATTGTGTCTCATCTTAAATCCAGGTGAAGATGATGTAGAATTGCTATTAAATACCCTGACTACATTTTCGGTAAGTAGCCCAGATAGCCTGCCTGCGTGCCTGTCTCATCGTAAAAAGAATAATGGTCTTATTGAAATTTCCAGTCTTAATAAACTGGTTTTGACTGCCAGCATTGTAATCAATTGTTGCTGTTGTTAACAACTTCTTATATTTGTCAACTTTTTGTGGTTACTGGTTTTATGTCTTAGCAATAAAGCTTCATTGGAAAAGTGAATGAGTTGCTTTAATTGACTAGGCAGAACAAAAATCACTTCCTGAGTCAACAGGCTCTCTTCACAGTGTACGAAGTGATAAAAAGGGTACTAATCCTATTGCTGCATTCTGTTATAAGCAGGTATGCTTTCGTTCGTTGAAGTGTTGTAAATTTTCATTTTTGACCTTGATATATAAAGCTGCCTTTGGATTGGCTGGACAGCCACGGGGTGTACCATAAAAAATTAAAAGAAAAATCTGTTGCATTTGGCTCTTCTCTCGTTTAGAAATTAACTATATTTGAATTGTATGCTTTGGAAGTTCGTGTAATTTATTTATTGTAATAGGCTGGCCGGACATCAAAACCCGAATCATCAGATTCTCTAGATAGCTTATTTGATGACAAGGAGGATGCAAGCAATGAAATCTCTTCAGTTGTTATACCGAAGGAGGTAACCTAATTCCCACTCCCACGTGGAATTCAATGGATTAAATAGGACAAGAGGATCAAAATATCTTCTTCCCTCATAGATTACTATTTCTTCTTCTCTTGTTTATGTGTGTGTGCATGTGTGCACGTGTGTAAGCGTTTGTTTTTCAGTATAGAAGCTTCAAGTAATTTCTTTTACCGCCCAGCTGATCGTGTCCATTGCAGGAGGCTGATGATGATTATATTAGTACTCGTACCAATAAAGCTTCCTTATTAATGATGAACTTCAGTGTTGATGAAGTAAACTTTGCAATTGATAAGCTTGGTTAGTACATTTCTGGAATAGTGAACTTGATTGAAGTTTTGCCCAATATTTATGCAGCCATGAGGATTTATTATAACTCATTTTCAGGTGGAGATGCTCCCATTAACGAGTTGGTGGATTTTATCGTCGCCGCACAGATTGCTGAAAACTTGGAGAAGGAAACAAATGAAACAATCTGTAGAAATGAATTGAAAACGGAGGTTTGTTTTTCATAATTCAAGTTCATGTGTCAATCTTATTTTAGTAGTGTTGTATGTTTACTTCAGCGATTTTAGTTAACGAGTTGACCAAGTTGATTACCATTTTACATTATCCAACTTTTCTTGGTCGGTCTACTTTTCGTATTGTGCTCACCATATATAGTACTGGCATAATGCCTAGTCTTGTATTTGTTGTTATTGATGTTAAGTCTAAAATGTTCCTCAGGAAAATGATGAAACCTTGTTTGTGACTATGGAGAAAACACTTCGCTTGCTTGAAATGGGTTTCTCTGAGAATGAGGTTTCTTTGGCAATTGAGAAGTTCGGTAAGCAAACCGCTCCTAACTCCTATTAGCAGCCGACATAGAAAGGGATTAGTGTAGTCATCTCATTGTCTAATCATTATAAATTTTTTGTTTGACCGATTGGTTTATTCTGCTCTAGGAGGCTAATTTTTCATCCAATTTCATCTTTGATTTGGTGTGAGAACTTCTTTAAGCTTTTTCATGGGTAATGTGGAGACCTTAACTAGCATCGATCCTTCAGGTTCGGACACTCAAGTTTCAGAGCTTGCTGATTCCATTGTTACGGGTCGAATAGCTGGTGACTACCCTGGGAATGATAAGGTTCTTACACTTGAACTTTGCTCGTCGAACTATTTGTACTAAGCCTGGTTATGTGTTCACATCAATTGTCCAAAATTATTCAAATACTTGCACCTTTTCTTTTTCTAGGAAATAAAATTTACACTAATGGATTAAAGAAGACAAATGCATCCTCTCAACTCTATCCCAAACTTTGTTGACCTCTTCAGCATCCCAACATGAGTCAAACCATACATTTCCCAAGAGCTTGCAATGAGTACCAAACTTTAGAAAGAAGATATTTGATTGACTTTTTTTAAAAAAATATATATATTGTTCCTTCCCTATAAAAAAATCGAAACCTCAAGTCGGTGCCACACCTTTCTAAAATAGCGTTTGTTAACTTTCTTTGAATTCAATAGTTCTATTCAACACTTTGATAAAATATTATTGGGGAGAATCTGCTAGTAGAGACTAACTTTTTTATCTTAATTTTTTTTGGATTCTATGTTCTTGTGCCCCTTCTTTTGTGATTGATTCTTGAATTTTGTGGTCAAGCATCTGGTACTACCAACGTCTATTTATTTTATTTTCATGGCTTTGTGCAGTGTTCCTCGAACTCATTCTATATTGGTGGTTTACATAATCCAAAAGTTAAAGCTGAGGATTCGAGTTCTCCTGGAGTTTCTCTCTCAAGGAATGTTAACGTTGAGGAAATATTGAAGGGGAAAAGGCCGAAAGAAGAATATATGGATGACCTTCCAAATCCGATTCCTCGTTTCGATGCTAAACACAAAGGGAAACGGCCAAAGCCAGAATATGCCGATGACGACTTGAGTTCTCTCTACGGTCCTGAATGGTTGGAAGCAAAAGTAAATCCAAAAATCGTCGGACTTGAGATGCCCACATCTTCAGAACTAAATCTCTCAAGAAGTCTTGATAAGATGGTGGCTAGACCTCCATCCCCACCTTCAAAGTTTAATCCTTGTAGAAGTCTTGATAAGGTGGTGGCTAAACCTCCATTTTTCTTGTATGGAAATGTTTTGGATGTATCTCGTGATTCTTGGGGAAAAGTTTCCAAGTTCCTTTATACCATTGAGCCCGAATTTGTGGACACTCAGTCGTTCTCAGCATTGAGTAGAAGGGAAGGCTATGTGCACAATCTTCCATGTGAAAACAGGTCTCACATCCTCCCAAAGCCTGCAATGACTATCGAAGATGCTATACCACATACAAAGAAATGGTGGCCTTCCTGGGATACAAGGAAGCATTTGAGCTGCATCAATTCTGAAACTAGAGGAGTACCTCAGCTGTGTGAAAGGCTTACAAAGATGATGACCGATTCTCATGGTCAGCTCTCGTCTCAACAGCAGAGAGATATTCTTCATCATTGCATAGCTTTGAACCTTATCTGGGTTGGCCAGTTCAAACTGGCTCCTTTAGAGCCTGAACAGTTGGAGTATGTGCTTGGGTACCCCGTAAATCACACTCAAGATGCTGAAAGTAGCTCAATGGAAAGGCTTCAAATTCTCAAATATTGTTTTCAGATAGACACTTTGGGTTATCATCTCTCTGTTTTGAAGTCTATGTTCCCAGAAGGGTTGATTGTGTTGTCCATTTTCAGTGGAATTGGTGGGGCGGAAATTGCTTTGCACCGTCTTGGCATTCATCTGAAAGTTGTGATATCAGTCGAGAGTTCGGCAGCAAAGAGGAGGATTTTGCAGAAATGGTGGCGCAGTAGTGGACAAACTGGGGAGCTGGAGCTGATAGAGGACATACAGAAACTAACAAGCAACAAGATTCACAATTTGATTAAGAAATATGGCGGTTTTGATTTGGTCGTTTGTCAGAATCCTTGTTCTCGTTCTTTGTCGTCTTCAAAACTGAGCAAAGACACCGAAGGTATAGCAAGTTTCGATTTCTCTATATTCTACGAGTTTGTTCGTGTTCTTCAGGGTGTAAGGAACACCATGGAAAGAAAGAAGTGACAATGTTCACTTCATCTGCTTGTTCTGTCTGATAATATGAAATTCACAGCTCAATGTCGAAACTATATTATATTGTAATATTGCCTTGACTCACTCTTGTGCTTGTTGATTACACATCCTACTTCTCTAACATGAGACATAAAGGTGCATATGGTTGTATAAATTACTCAAGGTACATAGTAGACGGCCAAAAAGAGAGGCCATCATCTATATTCTACAGTCTACAATAGGGCATCACATCGCTCAATGTGGGAGGATTAAAAATGAACCCAGAAAGGGATGGAGTTGGAGCCTAAGCGACAATAATCAATACCCAGATACAGAAAATGGCACTTGGCTGGTGTGCTGTGGTGGCTGTATACGGTATTATTATTGTACTTCAGCAGCTATATTCATAGGTCTCTGCGACATTCTGAGGTGCTCCTTCTTGCAAGCAATCAATGGGAGGAATGGTCAAAGTCTCCTGAATCCCAAATGCATTCCAACAGAAAGGATCTTCCACCAATTCACCCTCATGTGGTAGCAATTCTACTTCAGACATTGTAATATTCGTGCAGGCCACTGTGTCGCTGCACGCGAAGTGAATTGGAGTGTTCCTAACGTCGTAAGTTCCCTTGATGTTCTTGTACAAGACTTGGTTCACAAACACAGCCGATGTCTGGTTACGACAATCCTTGGACAAGCAGTAGTACTGGTCAACTATGATGCAGTTTCTAACATTCTCCATCTGTATGTTCTCGAATAAGATGTCGCTCACTGTACCCGACCCTCCTTGCCATGTCTTTATCCTCACTCCATTGTCTGATTCCCTAATCACTGCATTTCTCACTGTTATGTTGGAAACGCATGCCTGTGAATTGTGGACTCCCAGACTCCCAAT

mRNA sequence

TTTTTGACTATCCGAACAAATTGTTGTTTACAGAGCCACAGAAAACTCAGCTTACTCTCCTGAAGGTGAAGCTTAAAGCTACCTCAGCCCTAGTCCTAGCAACCACTACTTCAAAACTTGATGGCCAGCAATAAGCCTATTGTACCAAAGGAAGAGGTCTTGGATTTCAGGTTGCCACCTGATGGGCTGTACTCAAGGCATGTCGGGGACAGTGGTGCAAGCTCATCTGGAAACAATATAAGAACTTTCTTTGTAGATATGGGGTTCCTGCCATCTCTTGTTGATAAAGTGATCGAAGAAAAAGGTGAAGATGATGTAGAATTGCTATTAAATACCCTGACTACATTTTCGGCAGAACAAAAATCACTTCCTGAGTCAACAGGCTCTCTTCACAGTGTACGAAGTGATAAAAAGGGTACTAATCCTATTGCTGCATTCTGTTATAAGCAGGCTGGCCGGACATCAAAACCCGAATCATCAGATTCTCTAGATAGCTTATTTGATGACAAGGAGGATGCAAGCAATGAAATCTCTTCAGTTGTTATACCGAAGGAGGAGGCTGATGATGATTATATTAGTACTCGTACCAATAAAGCTTCCTTATTAATGATGAACTTCAGTGTTGATGAAGTAAACTTTGCAATTGATAAGCTTGGTGGAGATGCTCCCATTAACGAGTTGGTGGATTTTATCGTCGCCGCACAGATTGCTGAAAACTTGGAGAAGGAAACAAATGAAACAATCTGTAGAAATGAATTGAAAACGGAGGAAAATGATGAAACCTTGTTTGTGACTATGGAGAAAACACTTCGCTTGCTTGAAATGGGTTTCTCTGAGAATGAGGTTTCTTTGGCAATTGAGAAGTTCGGTTCGGACACTCAAGTTTCAGAGCTTGCTGATTCCATTGTTACGGGTCGAATAGCTGGTGACTACCCTGGGAATGATAAGTGTTCCTCGAACTCATTCTATATTGGTGGTTTACATAATCCAAAAGTTAAAGCTGAGGATTCGAGTTCTCCTGGAGTTTCTCTCTCAAGGAATGTTAACGTTGAGGAAATATTGAAGGGGAAAAGGCCGAAAGAAGAATATATGGATGACCTTCCAAATCCGATTCCTCGTTTCGATGCTAAACACAAAGGGAAACGGCCAAAGCCAGAATATGCCGATGACGACTTGAGTTCTCTCTACGGTCCTGAATGGTTGGAAGCAAAAGTAAATCCAAAAATCGTCGGACTTGAGATGCCCACATCTTCAGAACTAAATCTCTCAAGAAGTCTTGATAAGATGGTGGCTAGACCTCCATCCCCACCTTCAAAGTTTAATCCTTGTAGAAGTCTTGATAAGGTGGTGGCTAAACCTCCATTTTTCTTGTATGGAAATGTTTTGGATGTATCTCGTGATTCTTGGGGAAAAGTTTCCAAGTTCCTTTATACCATTGAGCCCGAATTTGTGGACACTCAGTCGTTCTCAGCATTGAGTAGAAGGGAAGGCTATGTGCACAATCTTCCATGTGAAAACAGGTCTCACATCCTCCCAAAGCCTGCAATGACTATCGAAGATGCTATACCACATACAAAGAAATGGTGGCCTTCCTGGGATACAAGGAAGCATTTGAGCTGCATCAATTCTGAAACTAGAGGAGTACCTCAGCTGTGTGAAAGGCTTACAAAGATGATGACCGATTCTCATGGTCAGCTCTCGTCTCAACAGCAGAGAGATATTCTTCATCATTGCATAGCTTTGAACCTTATCTGGGTTGGCCAGTTCAAACTGGCTCCTTTAGAGCCTGAACAGTTGGAGTATGTGCTTGGGTACCCCGTAAATCACACTCAAGATGCTGAAAGTAGCTCAATGGAAAGGCTTCAAATTCTCAAATATTGTTTTCAGATAGACACTTTGGGTTATCATCTCTCTGTTTTGAAGTCTATGTTCCCAGAAGGGTTGATTGTGTTGTCCATTTTCAGTGGAATTGGTGGGGCGGAAATTGCTTTGCACCGTCTTGGCATTCATCTGAAAGTTGTGATATCAGTCGAGAGTTCGGCAGCAAAGAGGAGGATTTTGCAGAAATGGTGGCGCAGTAGTGGACAAACTGGGGAGCTGGAGCTGATAGAGGACATACAGAAACTAACAAGCAACAAGATTCACAATTTGATTAAGAAATATGGCGGTTTTGATTTGGTCGTTTGTCAGAATCCTTGTTCTCGTTCTTTGTCGTCTTCAAAACTGAGCAAAGACACCGAAGGTATAGCAAGTTTCGATTTCTCTATATTCTACGAGTTTGTTCGTGTTCTTCAGGGTGTAAGGAACACCATGGAAAGAAAGAAGTGACAATGTTCACTTCATCTGCTTGTTCTGTCTGATAATATGAAATTCACAGCTCAATGTCGAAACTATATTATATTGTAATATTGCCTTGACTCACTCTTGTGCTTGTTGATTACACATCCTACTTCTCTAACATGAGACATAAAGGTGCATATGGTTGTATAAATTACTCAAGGTACATAGTAGACGGCCAAAAAGAGAGGCCATCATCTATATTCTACAGTCTACAATAGGGCATCACATCGCTCAATGTGGGAGGATTAAAAATGAACCCAGAAAGGGATGGAGTTGGAGCCTAAGCGACAATAATCAATACCCAGATACAGAAAATGGCACTTGGCTGGTGTGCTGTGGTGGCTGTATACGGTATTATTATTGTACTTCAGCAGCTATATTCATAGGTCTCTGCGACATTCTGAGGTGCTCCTTCTTGCAAGCAATCAATGGGAGGAATGGTCAAAGTCTCCTGAATCCCAAATGCATTCCAACAGAAAGGATCTTCCACCAATTCACCCTCATGTGGTAGCAATTCTACTTCAGACATTGTAATATTCGTGCAGGCCACTGTGTCGCTGCACGCGAAGTGAATTGGAGTGTTCCTAACGTCGTAAGTTCCCTTGATGTTCTTGTACAAGACTTGGTTCACAAACACAGCCGATGTCTGGTTACGACAATCCTTGGACAAGCAGTAGTACTGGTCAACTATGATGCAGTTTCTAACATTCTCCATCTGTATGTTCTCGAATAAGATGTCGCTCACTGTACCCGACCCTCCTTGCCATGTCTTTATCCTCACTCCATTGTCTGATTCCCTAATCACTGCATTTCTCACTGTTATGTTGGAAACGCATGCCTGTGAATTGTGGACTCCCAGACTCCCAAT

Coding sequence (CDS)

ATGGCCAGCAATAAGCCTATTGTACCAAAGGAAGAGGTCTTGGATTTCAGGTTGCCACCTGATGGGCTGTACTCAAGGCATGTCGGGGACAGTGGTGCAAGCTCATCTGGAAACAATATAAGAACTTTCTTTGTAGATATGGGGTTCCTGCCATCTCTTGTTGATAAAGTGATCGAAGAAAAAGGTGAAGATGATGTAGAATTGCTATTAAATACCCTGACTACATTTTCGGCAGAACAAAAATCACTTCCTGAGTCAACAGGCTCTCTTCACAGTGTACGAAGTGATAAAAAGGGTACTAATCCTATTGCTGCATTCTGTTATAAGCAGGCTGGCCGGACATCAAAACCCGAATCATCAGATTCTCTAGATAGCTTATTTGATGACAAGGAGGATGCAAGCAATGAAATCTCTTCAGTTGTTATACCGAAGGAGGAGGCTGATGATGATTATATTAGTACTCGTACCAATAAAGCTTCCTTATTAATGATGAACTTCAGTGTTGATGAAGTAAACTTTGCAATTGATAAGCTTGGTGGAGATGCTCCCATTAACGAGTTGGTGGATTTTATCGTCGCCGCACAGATTGCTGAAAACTTGGAGAAGGAAACAAATGAAACAATCTGTAGAAATGAATTGAAAACGGAGGAAAATGATGAAACCTTGTTTGTGACTATGGAGAAAACACTTCGCTTGCTTGAAATGGGTTTCTCTGAGAATGAGGTTTCTTTGGCAATTGAGAAGTTCGGTTCGGACACTCAAGTTTCAGAGCTTGCTGATTCCATTGTTACGGGTCGAATAGCTGGTGACTACCCTGGGAATGATAAGTGTTCCTCGAACTCATTCTATATTGGTGGTTTACATAATCCAAAAGTTAAAGCTGAGGATTCGAGTTCTCCTGGAGTTTCTCTCTCAAGGAATGTTAACGTTGAGGAAATATTGAAGGGGAAAAGGCCGAAAGAAGAATATATGGATGACCTTCCAAATCCGATTCCTCGTTTCGATGCTAAACACAAAGGGAAACGGCCAAAGCCAGAATATGCCGATGACGACTTGAGTTCTCTCTACGGTCCTGAATGGTTGGAAGCAAAAGTAAATCCAAAAATCGTCGGACTTGAGATGCCCACATCTTCAGAACTAAATCTCTCAAGAAGTCTTGATAAGATGGTGGCTAGACCTCCATCCCCACCTTCAAAGTTTAATCCTTGTAGAAGTCTTGATAAGGTGGTGGCTAAACCTCCATTTTTCTTGTATGGAAATGTTTTGGATGTATCTCGTGATTCTTGGGGAAAAGTTTCCAAGTTCCTTTATACCATTGAGCCCGAATTTGTGGACACTCAGTCGTTCTCAGCATTGAGTAGAAGGGAAGGCTATGTGCACAATCTTCCATGTGAAAACAGGTCTCACATCCTCCCAAAGCCTGCAATGACTATCGAAGATGCTATACCACATACAAAGAAATGGTGGCCTTCCTGGGATACAAGGAAGCATTTGAGCTGCATCAATTCTGAAACTAGAGGAGTACCTCAGCTGTGTGAAAGGCTTACAAAGATGATGACCGATTCTCATGGTCAGCTCTCGTCTCAACAGCAGAGAGATATTCTTCATCATTGCATAGCTTTGAACCTTATCTGGGTTGGCCAGTTCAAACTGGCTCCTTTAGAGCCTGAACAGTTGGAGTATGTGCTTGGGTACCCCGTAAATCACACTCAAGATGCTGAAAGTAGCTCAATGGAAAGGCTTCAAATTCTCAAATATTGTTTTCAGATAGACACTTTGGGTTATCATCTCTCTGTTTTGAAGTCTATGTTCCCAGAAGGGTTGATTGTGTTGTCCATTTTCAGTGGAATTGGTGGGGCGGAAATTGCTTTGCACCGTCTTGGCATTCATCTGAAAGTTGTGATATCAGTCGAGAGTTCGGCAGCAAAGAGGAGGATTTTGCAGAAATGGTGGCGCAGTAGTGGACAAACTGGGGAGCTGGAGCTGATAGAGGACATACAGAAACTAACAAGCAACAAGATTCACAATTTGATTAAGAAATATGGCGGTTTTGATTTGGTCGTTTGTCAGAATCCTTGTTCTCGTTCTTTGTCGTCTTCAAAACTGAGCAAAGACACCGAAGGTATAGCAAGTTTCGATTTCTCTATATTCTACGAGTTTGTTCGTGTTCTTCAGGGTGTAAGGAACACCATGGAAAGAAAGAAGTGA

Protein sequence

MASNKPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEEKGEDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNPIAAFCYKQAGRTSKPESSDSLDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDKLGGDAPINELVDFIVAAQIAENLEKETNETICRNELKTEENDETLFVTMEKTLRLLEMGFSENEVSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHNPKVKAEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRFDAKHKGKRPKPEYADDDLSSLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVAKPPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILPKPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQRDILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDTLGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWRSSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKLSKDTEGIASFDFSIFYEFVRVLQGVRNTMERKK
BLAST of Cp4.1LG17g00020 vs. Swiss-Prot
Match: DRM2_ARATH (DNA (cytosine-5)-methyltransferase DRM2 OS=Arabidopsis thaliana GN=DRM2 PE=1 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 2.6e-79
Identity = 148/330 (44.85%), Postives = 208/330 (63.03%), Query Frame = 1

Query: 404 RSLDKVVAKPPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLP 463
           RSL ++   PPFF Y NV    +  W  +S+ L+ I PEFVD++ F   +R+ GY+HNLP
Sbjct: 297 RSLPELARGPPFFYYENVALTPKGVWETISRHLFEIPPEFVDSKYFCVAARKRGYIHNLP 356

Query: 464 CENRSHILPKPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMT--D 523
             NR  I P P  TI DA P +K+WWP WD R  L+CI + T G  QL  R+   +   +
Sbjct: 357 INNRFQIQPPPKYTIHDAFPLSKRWWPEWDKRTKLNCILTCT-GSAQLTNRIRVALEPYN 416

Query: 524 SHGQLSSQQQRDILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERL 583
              +     QR ++  C   NL+WVG+ K APLEP+++E +LG+P NHT+    S  ER 
Sbjct: 417 EEPEPPKHVQRYVIDQCKKWNLVWVGKNKAAPLEPDEMESILGFPKNHTRGGGMSRTERF 476

Query: 584 QILKYCFQIDTLGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSA 643
           + L   FQ+DT+ YHLSVLK +FP G+ VLS+F+GIGG E+ALHRL I +K+V+SVE S 
Sbjct: 477 KSLGNSFQVDTVAYHLSVLKPIFPHGINVLSLFTGIGGGEVALHRLQIKMKLVVSVEISK 536

Query: 644 AKRRILQKWWRSSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSK 703
             R IL+ +W  + QTGEL    DIQ LT++ I  L++KYGGFDLV+  +PC+     ++
Sbjct: 537 VNRNILKDFWEQTNQTGELIEFSDIQHLTNDTIEGLMEKYGGFDLVIGGSPCNNLAGGNR 596

Query: 704 LSKDTEGIASFDFSIFYEFVRVLQGVRNTM 732
           +S+   G+     S+F+E+ R+L+ VR  M
Sbjct: 597 VSR--VGLEGDQSSLFFEYCRILEVVRARM 623

BLAST of Cp4.1LG17g00020 vs. Swiss-Prot
Match: DRM1L_ARATH (DNA (cytosine-5)-methyltransferase DRM1 OS=Arabidopsis thaliana GN=DRM1 PE=3 SV=2)

HSP 1 Score: 283.5 bits (724), Expect = 6.6e-75
Identity = 136/316 (43.04%), Postives = 202/316 (63.92%), Query Frame = 1

Query: 413 PPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILP 472
           PPFF Y NV    +  W K+S  LY I PEFVD++ F A +R+ GY+HNLP +NR  I P
Sbjct: 304 PPFFYYENVAMTPKGVWAKISSHLYDIVPEFVDSKHFCAAARKRGYIHNLPIQNRFQIQP 363

Query: 473 KPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQR 532
               TI++A P TK+WWPSWD R  L+C+ +      +L E++ + +    G+     Q+
Sbjct: 364 PQHNTIQEAFPLTKRWWPSWDGRTKLNCLLTCIAS-SRLTEKIREALERYDGETPLDVQK 423

Query: 533 DILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDT 592
            +++ C   NL+WVG+ KLAPL+ +++E +LG+P +HT+    S+ +R + L   FQ+DT
Sbjct: 424 WVMYECKKWNLVWVGKNKLAPLDADEMEKLLGFPRDHTRGGGISTTDRYKSLGNSFQVDT 483

Query: 593 LGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWR 652
           + YHLSVLK +FP G+ VLS+F+GIGG E+ALHRL I + VV+SVE S A R IL+ +W 
Sbjct: 484 VAYHLSVLKPLFPNGINVLSLFTGIGGGEVALHRLQIKMNVVVSVEISDANRNILRSFWE 543

Query: 653 SSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKLSKDTEGIASF 712
            + Q G L   +D+QKL  N I  L+ +YGGFDLV+  +PC+     ++  +   G+   
Sbjct: 544 QTNQKGILREFKDVQKLDDNTIERLMDEYGGFDLVIGGSPCNNLAGGNRHHR--VGLGGE 603

Query: 713 DFSIFYEFVRVLQGVR 729
             S+F+++ R+L+ VR
Sbjct: 604 HSSLFFDYCRILEAVR 616

BLAST of Cp4.1LG17g00020 vs. TrEMBL
Match: A0A0A0K816_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G000130 PE=4 SV=1)

HSP 1 Score: 1135.9 bits (2937), Expect = 0.0e+00
Identity = 590/745 (79.19%), Postives = 645/745 (86.58%), Query Frame = 1

Query: 2   ASNKPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEEK 61
           ASNKPIVPKEEV DFRLPPD +YSRHVGDSGASSSG+N+RTFF+DMGFLPSLVD VIE+ 
Sbjct: 6   ASNKPIVPKEEVFDFRLPPDRMYSRHVGDSGASSSGSNVRTFFIDMGFLPSLVDSVIEKN 65

Query: 62  GEDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNP--IAAFCYKQ--AGRTSKP 121
           GEDDVELLLNTLTT+SAEQKS+P+S+ SL  ++S K G+NP  ++  C+KQ  A +TSK 
Sbjct: 66  GEDDVELLLNTLTTYSAEQKSIPQSSDSLEGLQSGKMGSNPPHVSTVCHKQVQAAQTSKS 125

Query: 122 ESSDSLDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDK 181
           ESSDSLDSLFDDK DA NEISSV IPKEEADD Y  + TNKASLL+MNFS DEV+FAIDK
Sbjct: 126 ESSDSLDSLFDDK-DAHNEISSV-IPKEEADDYYHISDTNKASLLVMNFSADEVDFAIDK 185

Query: 182 LGGDAPINELVDFIVAAQIAENLEKETNETICRNELKTEENDETLFVTMEKTLRLLEMGF 241
           LGGDAP+NELVDFI+AAQIA  LEKET++  CRNELK EENDETLFVTMEKTLRLLEMGF
Sbjct: 186 LGGDAPLNELVDFIIAAQIAIKLEKETDDAFCRNELKKEENDETLFVTMEKTLRLLEMGF 245

Query: 242 SENEVSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHN-----PKV 301
           SENEVSLAIEKFGS+TQVSELADSIVTGRIA DYPG+ KCS +SF IGGL+       KV
Sbjct: 246 SENEVSLAIEKFGSETQVSELADSIVTGRIASDYPGDVKCSPSSFGIGGLYTREDYVTKV 305

Query: 302 KAEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRFDAKHKGKRPKPEYADDDL 361
           KAE+SSS    L RNVN+E I KGKRPKEE MDDL NP  R + KHKGKRPK EYADD L
Sbjct: 306 KAEESSSAVGPLPRNVNIEAIQKGKRPKEENMDDLLNPTTRLN-KHKGKRPKQEYADD-L 365

Query: 362 SSLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVAK 421
            SLYGP W+E+KVNP I   ++P SS LNLSRSLDK+VA+PP PP K NP R+L+KVV K
Sbjct: 366 GSLYGPGWVESKVNPDITSFDIPPSSRLNLSRSLDKLVAKPPCPPLKSNPSRALEKVVTK 425

Query: 422 PPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILP 481
           PPFFLYGNVLD+SRDSW KVSKFLY +EPEFVDT+SFSALSR EGYVHNLPCENR HI+P
Sbjct: 426 PPFFLYGNVLDISRDSWAKVSKFLYAVEPEFVDTRSFSALSRTEGYVHNLPCENRFHIIP 485

Query: 482 KPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQR 541
            P MTI+DA   TKKWWPSWDTRK+LSCINSETRGVPQLC+RLTK +TDS G  SS ++R
Sbjct: 486 LPPMTIQDAT-RTKKWWPSWDTRKYLSCINSETRGVPQLCDRLTKTLTDSGGHPSSHEER 545

Query: 542 DILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDT 601
           DILHHCIALNLIWV QFKLAP+EPEQLE VLGYPVNHTQDAESSS+ERLQ LKYCFQ D 
Sbjct: 546 DILHHCIALNLIWVSQFKLAPVEPEQLECVLGYPVNHTQDAESSSIERLQYLKYCFQTDA 605

Query: 602 LGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWR 661
           LGYHLSVLKSMFPEGL+VLSIFSGIGGAEIALHRLGIHLKVV+SVESSAAKRRIL+KWW 
Sbjct: 606 LGYHLSVLKSMFPEGLVVLSIFSGIGGAEIALHRLGIHLKVVVSVESSAAKRRILKKWWH 665

Query: 662 SSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKL--SKDTEGIA 721
           SSGQTGELE IEDIQKLTS KI+N I KYGGFDLV+CQNPCSR LSSSKL  S D EGIA
Sbjct: 666 SSGQTGELEQIEDIQKLTSIKINNWITKYGGFDLVICQNPCSRCLSSSKLNQSGDAEGIA 725

Query: 722 SFDFSIFYEFVRVLQGVRNTMERKK 736
           SFDFSIFYEFVRVLQ VRNTM RKK
Sbjct: 726 SFDFSIFYEFVRVLQSVRNTMHRKK 745

BLAST of Cp4.1LG17g00020 vs. TrEMBL
Match: A0A061EPS9_THECC (S-adenosyl-L-methionine-dependent methyltransferases superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_021455 PE=4 SV=1)

HSP 1 Score: 747.7 bits (1929), Expect = 1.4e-212
Identity = 416/764 (54.45%), Postives = 505/764 (66.10%), Query Frame = 1

Query: 5   KPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEEKGED 64
           K IVPK E+LDF LP D LYSRHVGD+ ASSSG+N+R+FF+ MGF PSLV+KVIEEKGED
Sbjct: 17  KAIVPKPEMLDFDLPEDALYSRHVGDNVASSSGSNVRSFFIGMGFSPSLVEKVIEEKGED 76

Query: 65  DVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNPIAAFCYKQAGRTSKPESSDSLD 124
           + +LLL TL  +S  +K    S+ SLHS+ +DK G               S PES     
Sbjct: 77  NADLLLETLVEYSEVRKVNTHSSASLHSLFADKDG--------------GSCPES----- 136

Query: 125 SLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDKLGGDAPI 184
                        S+ + PKEE D         +ASLLMM+FSV+EV FA+DKLG  AP+
Sbjct: 137 -------------STYIQPKEEPDVFDEVHVDKRASLLMMSFSVNEVEFALDKLGEHAPL 196

Query: 185 NELVDFIVAAQIAENLEKETNETICRNELKTEE-NDETLFVTMEKTLRLLEMGFSENEVS 244
           NELVDFI AAQIAE  EKE+  ++  +E K +   +E LF TMEKTL LLEMGFSENEVS
Sbjct: 197 NELVDFIAAAQIAEEFEKESEGSLSSDEEKDQNVTNEFLFGTMEKTLHLLEMGFSENEVS 256

Query: 245 LAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYI------------------- 304
           +AIEKFGS+  ++ELADSI TG+IAG+Y  + K    SF                     
Sbjct: 257 IAIEKFGSEVPIAELADSIFTGQIAGNYTESKKERVISFCFIIARFTIKKLLHCHVQFTS 316

Query: 305 ----GGL-HNP----KVKAEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRF- 364
               GGL HN     K++ ED SS  V  SRN+N  E  KGKRPKEE  DD P  +P+F 
Sbjct: 317 AALRGGLIHNSCDTVKIEPEDCSSSAVPQSRNINTGESCKGKRPKEESFDDFPVSLPQFK 376

Query: 365 ----DAKHKGKRPKPEYADDDLSSLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMV 424
               + KHKGKRP+ +Y  D+ SS   P WLE K++P I+  EMP               
Sbjct: 377 QSSYEKKHKGKRPRQDYV-DNTSSFLDPAWLEEKIDPNIIRFEMPR-------------- 436

Query: 425 ARPPSPPSKFNPCRSLDKVVAKPPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFS 484
                 P K N C+S+DK+VAKPP+F YGNV+++S D W KVS+FLY IEPEFV+TQ FS
Sbjct: 437 ------PFKSNSCKSVDKMVAKPPYFFYGNVVNMSPDCWAKVSQFLYGIEPEFVNTQFFS 496

Query: 485 ALSRREGYVHNLPCENRSHILPKPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQ 544
           ALSR EGYVHNLP  NR +ILPK  +TI+DA+PHTKKWWPSWDTRK LSC+  E  GV +
Sbjct: 497 ALSRIEGYVHNLPAGNRFNILPKSPLTIQDALPHTKKWWPSWDTRKQLSCMGCEVNGVSK 556

Query: 545 LCERLTKMMTDSHGQLSSQQQRDILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHT 604
           LC+RL K++ DS G LS  QQ+DI  HC   NLIWVG +KL   EP   E +LGYP+NHT
Sbjct: 557 LCDRLGKIVADSRGILSPDQQKDIFRHCKTSNLIWVGPYKLGLAEPGHWELILGYPLNHT 616

Query: 605 QDAESSSMERLQILKYCFQIDTLGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIH 664
           +  E+ S  RLQ+L+  FQ DTLGYHLSVLKSM P GL +LS+FSGIGGA + LHRLGIH
Sbjct: 617 KALENDSSRRLQLLEQSFQTDTLGYHLSVLKSMCPGGLTMLSVFSGIGGAVVTLHRLGIH 676

Query: 665 LKVVISVESSAAKRRILQKWWRSSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQ 724
           LK V+SVE+S A++ IL+ WW+S+GQTGEL LIEDIQKLTS K+ NLI K GGFD V+CQ
Sbjct: 677 LKGVVSVETSEARQSILRNWWQSTGQTGELVLIEDIQKLTSKKLENLIDKLGGFDFVICQ 722

Query: 725 NPCSRSLSSSKLSKDTEGIASFDFSIFYEFVRVLQGVRNTMERK 735
           N      SSS    D + +  FDFS+FYEFVRVLQ VR+ MER+
Sbjct: 737 NS-----SSSMTGPDDDRLPGFDFSLFYEFVRVLQRVRSMMERR 722

BLAST of Cp4.1LG17g00020 vs. TrEMBL
Match: W9RGA9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_026137 PE=4 SV=1)

HSP 1 Score: 717.6 bits (1851), Expect = 1.5e-203
Identity = 401/751 (53.40%), Postives = 499/751 (66.44%), Query Frame = 1

Query: 5   KPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEEKGED 64
           KP+VPKEEVLDF  PP  + S   G++ ASSSG+N+R+  + MGFLP+LVDKVIEEKG D
Sbjct: 19  KPLVPKEEVLDFDFPPQTMLSGQFGENFASSSGSNLRSSLIGMGFLPALVDKVIEEKGVD 78

Query: 65  DVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNP--IAAFCYKQAGRTSKPESSDS 124
           DV+LL++ L  +SA Q+S  ES+ SL S+  DK  ++P  I  F           E  D 
Sbjct: 79  DVDLLVDALVAYSAPQRSNSESSDSLDSLFDDKDESSPPEIPTF-------IELKEEPDI 138

Query: 125 LDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDKLGGDA 184
           LD + D K                           +ASL MMNFS+DE+NFAIDKLG + 
Sbjct: 139 LDDVDDGK---------------------------RASLQMMNFSLDEINFAIDKLGENT 198

Query: 185 PINELVDFIVAAQIAENLEKETNETICRNELKTEE-NDETLFVTMEKTLRLLEMGFSENE 244
           PI+ELVDFIVAAQIA+ LE+E ++ +   E + E+ N+ETLF TM+KTL LLEMGFSE +
Sbjct: 199 PIDELVDFIVAAQIAKRLEEERHDIVSDAEERNEDTNNETLFGTMDKTLHLLEMGFSEAQ 258

Query: 245 VSLAIEKFGSDTQVSELADSIVTGRI-----AGDYPGNDKCSSN--SFYIGG-------- 304
           VS AIE  GS+  +SELADSI TGR              + +SN  SF +G         
Sbjct: 259 VSWAIENVGSEAPISELADSIFTGRTPVKPSISTSTSELRTASNCRSFALGAEGIRRDPL 318

Query: 305 LHNPKVKAEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRFDAK--HKGKRPK 364
           L + KV+  D     VS S + N E+ L GKRPK+E   D   P    D +   KGKRPK
Sbjct: 319 LGSAKVETSDLYPDTVSQSMDFNAEQRLLGKRPKQELSFDTVPPFGHIDYEDNQKGKRPK 378

Query: 365 PEYADDDLSSLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCR 424
            EY DDD SSL GP WL+ K+ P+    EMP                     P  +NP R
Sbjct: 379 EEY-DDDSSSLSGPTWLDEKMYPEFTECEMPR--------------------PFDYNPRR 438

Query: 425 SLDKVVAKPPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPC 484
           SL  VVA+PP+F YGNV  V  +SW K+SKFLY +EPEFV+T+ FSALSR+EGY+HNLP 
Sbjct: 439 SLSGVVARPPYFFYGNVGTVCHESWVKISKFLYNLEPEFVNTRFFSALSRKEGYIHNLPT 498

Query: 485 ENRSHILPKPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHG 544
           ENRS ILPKP MTIE+ +P TKKWWP+WD RK LSCI+SE  G+PQLC RL   +  SHG
Sbjct: 499 ENRSQILPKPPMTIEEVMPRTKKWWPAWDARKQLSCISSEVNGIPQLCARLQNTIASSHG 558

Query: 545 QLSSQQQRDILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQIL 604
            LSS+QQR+ILHHC +LNL+WVG  KLAPL+PE LE +LGYP NHTQ A  S +ERLQ L
Sbjct: 559 LLSSEQQRNILHHCRSLNLVWVGPNKLAPLDPEYLEIILGYPSNHTQ-AFISLIERLQSL 618

Query: 605 KYCFQIDTLGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKR 664
           +YCFQ DTLGY+LSVLKS++P GL VLSIFSGIGGAE+A HRLGIHLK V+SVE+S  KR
Sbjct: 619 RYCFQTDTLGYYLSVLKSIYPNGLTVLSIFSGIGGAEVAFHRLGIHLKAVVSVETSETKR 678

Query: 665 RILQKWWRSSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKLSK 724
            IL+KWW+S+GQTGEL  IEDIQKLTS+K+ + +  +GGFD ++CQN  + S  +SK+  
Sbjct: 679 GILKKWWQSTGQTGELIQIEDIQKLTSSKLSSFMNNFGGFDFIICQNSFTHS-PNSKVPA 712

Query: 725 DTEGIASFDFSIFYEFVRVLQGVRNTMERKK 736
           + + I+ FDFS+F EFVR+LQ VR   E+K+
Sbjct: 739 NVDSISGFDFSLFCEFVRILQRVRTMSEKKR 712

BLAST of Cp4.1LG17g00020 vs. TrEMBL
Match: V4TZT4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004428mg PE=4 SV=1)

HSP 1 Score: 712.2 bits (1837), Expect = 6.4e-202
Identity = 401/741 (54.12%), Postives = 499/741 (67.34%), Query Frame = 1

Query: 5   KPIVPKEEVLDFRLPPDGLY--SRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEEKG 64
           K IVPK E+LDF LP D     S HVG++ ASSSG+N+R+ F+ MGF PSLVDKVIEEKG
Sbjct: 47  KAIVPKPEILDFELPADRACASSMHVGENIASSSGSNLRSSFIGMGFSPSLVDKVIEEKG 106

Query: 65  EDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNPIAAFCYKQAGRTSKPESSDS 124
           +D+V+LLL TL  ++A Q+S  +S+ SL ++  DK   +P              PE    
Sbjct: 107 QDNVDLLLETLIEYNALQESNSQSSDSLDTLFGDKDANSP--------------PE---- 166

Query: 125 LDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDKLGGDA 184
           + ++   KE+ +     + I K             +ASLLMMNFSV+EV+FA+DKLG DA
Sbjct: 167 ISTMVQPKEEPNVMDEGLHIEK-------------RASLLMMNFSVNEVDFALDKLGKDA 226

Query: 185 PINELVDFIVAAQIAENLEKETNETICRNELKTEE-NDETLFVTMEKTLRLLEMGFSENE 244
           P+ ELVDFI AAQI+EN EKET++    N+   E+ +DETL+ TME TL+LLEMGFSEN+
Sbjct: 227 PVYELVDFITAAQISENFEKETDDAPHDNDGTNEDKSDETLYGTMEITLQLLEMGFSENQ 286

Query: 245 VSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHNPKVKAEDSSSPG 304
           VSLAIEKFGS T +SELAD I +G+I  D P      S S+        KVK E  S   
Sbjct: 287 VSLAIEKFGSKTPISELADKIFSGQIFLDTP-----RSRSY-----DTVKVKTEYCSPDV 346

Query: 305 VSLSRNVNVEEILKGKRPKEEYMDDLPNPIPR-----FDAKHKGKRPKPEYADDDLSSLY 364
           VS SR +N  E  +GKRPKEEY DD  N   +     F    KGKRPK E  DD  S LY
Sbjct: 347 VSQSRKMNTSETSRGKRPKEEYFDDFSNSTSQFQHVDFQENRKGKRPKQESLDDSSSFLY 406

Query: 365 GPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVAKPPFF 424
              W E KV         P SS   + ++ +             NPCRS++KVVA+PP+F
Sbjct: 407 S-SW-EEKVK--------PNSSRYGMQQAFNS------------NPCRSINKVVAQPPYF 466

Query: 425 LYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILPKPAM 484
            YGNV+DVS D W K+S FLY++EPEFV++Q FSALSRREGY+HNLP  NR HI P+P M
Sbjct: 467 FYGNVVDVSIDCWVKMSHFLYSLEPEFVNSQYFSALSRREGYLHNLPTTNRFHIPPEPPM 526

Query: 485 TIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQRDILH 544
           TI+DAIPHTKKWWPSWDTRKHLSCINS T G+ QLCER  K++ DS G LSSQQQRDILH
Sbjct: 527 TIQDAIPHTKKWWPSWDTRKHLSCINSGTGGISQLCERFEKLLRDSRGVLSSQQQRDILH 586

Query: 545 HCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDTLGYH 604
               LNL+WVG +KL P++PE +E +LGYP NHTQ A +S   RL+ L++CFQ DTLGYH
Sbjct: 587 RSEKLNLVWVGAYKLGPVDPEHIELILGYPSNHTQAAGNSLTARLESLRHCFQTDTLGYH 646

Query: 605 LSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWRSSGQ 664
           LSVLKSMFP GL +LS+FSGIGGAE+ LHRLGI LK VIS+E+S   RRIL++WW SSGQ
Sbjct: 647 LSVLKSMFPGGLTMLSVFSGIGGAEVTLHRLGIKLKGVISIETSETNRRILKRWWESSGQ 706

Query: 665 TGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQN-----PCSRSLSSS---KLSKDTEG 724
           TGEL  IEDIQ LT+ K  +LI K G  D V+CQN     P S+ +S+S   K++ +++ 
Sbjct: 707 TGELVQIEDIQALTTKKFESLIHKLGSIDFVICQNSVPQIPNSKQISNSKDPKMAAESDN 724

Query: 725 IASFDFSIFYEFVRVLQGVRN 730
           +  FDFS++YEFVRV+Q VR+
Sbjct: 767 LPDFDFSLYYEFVRVVQRVRS 724

BLAST of Cp4.1LG17g00020 vs. TrEMBL
Match: A0A067FXI4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g005421mg PE=4 SV=1)

HSP 1 Score: 709.9 bits (1831), Expect = 3.2e-201
Identity = 400/741 (53.98%), Postives = 498/741 (67.21%), Query Frame = 1

Query: 5   KPIVPKEEVLDFRLPPDGLY--SRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEEKG 64
           K IVPK E+LDF LP D     S HVG++ ASSSG+N+R+ F+ MGF PSLVDKVIEEKG
Sbjct: 17  KAIVPKPEILDFELPADRACASSMHVGENIASSSGSNLRSSFIGMGFSPSLVDKVIEEKG 76

Query: 65  EDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNPIAAFCYKQAGRTSKPESSDS 124
           +D+V+LLL TL  ++A Q+S  +S+ SL ++  DK   +P              PE    
Sbjct: 77  QDNVDLLLETLIEYNALQESNSQSSDSLDTLFGDKDANSP--------------PE---- 136

Query: 125 LDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDKLGGDA 184
           + ++   KE+ +     + I K             +ASLLMMNFSV+EV+FA+DKLG DA
Sbjct: 137 ISTMVQPKEEPNVMDEGLHIEK-------------RASLLMMNFSVNEVDFALDKLGKDA 196

Query: 185 PINELVDFIVAAQIAENLEKETNETICRNELKTEE-NDETLFVTMEKTLRLLEMGFSENE 244
           P+ ELVDFI AAQI+EN EKET++    N+   E+ +DETL+ TME TL+LLEMGFSEN+
Sbjct: 197 PVYELVDFITAAQISENFEKETDDAPHDNDGTNEDKSDETLYGTMEITLQLLEMGFSENQ 256

Query: 245 VSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHNPKVKAEDSSSPG 304
           VSLAIEKFGS T +SELAD I +G+I  D P      S S+        KVK E  S   
Sbjct: 257 VSLAIEKFGSKTPISELADKIFSGQIFLDTP-----RSRSY-----DTVKVKTEYCSPDV 316

Query: 305 VSLSRNVNVEEILKGKRPKEEYMDDLPNPIPR-----FDAKHKGKRPKPEYADDDLSSLY 364
           VS SR +N  E  +GKRPKEEY DD  N   +     F    KGKRPK E  DD  SS  
Sbjct: 317 VSQSRKMNTSETSRGKRPKEEYFDDFSNSTSQFQHVDFQENRKGKRPKQESLDDS-SSFL 376

Query: 365 GPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVAKPPFF 424
              W E KV         P SS   + ++ +             NPCRS++KVVA+PP+F
Sbjct: 377 DSSW-EEKVK--------PNSSRYGMQQAFNS------------NPCRSINKVVAQPPYF 436

Query: 425 LYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILPKPAM 484
            YGNV+DVS D W K+S FLY++EPEFV++Q FSALSRREGY+HNLP  NR HI P+P M
Sbjct: 437 FYGNVVDVSIDCWVKMSHFLYSLEPEFVNSQYFSALSRREGYLHNLPTTNRFHIPPEPPM 496

Query: 485 TIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQRDILH 544
           TI+DAIPHTKKWWPSWDTRKHLSCINS T G+ QLCER  K++ DS G LSSQQQRDILH
Sbjct: 497 TIQDAIPHTKKWWPSWDTRKHLSCINSGTSGISQLCERFEKLLRDSRGVLSSQQQRDILH 556

Query: 545 HCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDTLGYH 604
               LNL+WVG +KL P++PE +E +LGYP NHTQ A +S   RL+ L++CFQ DTLGYH
Sbjct: 557 RSEKLNLVWVGAYKLGPVDPEHIELILGYPSNHTQAAGNSLTARLESLRHCFQTDTLGYH 616

Query: 605 LSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWRSSGQ 664
           LSVLKSMFP GL +LS+FSGIGGAE+ LHRLGI LK VIS+E+S   RRIL++WW SSGQ
Sbjct: 617 LSVLKSMFPGGLTMLSVFSGIGGAEVTLHRLGIKLKGVISIETSETNRRILKRWWESSGQ 676

Query: 665 TGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQN-----PCSRSLSSS---KLSKDTEG 724
           TGEL  IEDIQ LT+ K  +LI K G  D V+CQN     P S+ +S+S   K++ +++ 
Sbjct: 677 TGELVQIEDIQALTTKKFESLIHKLGSIDFVICQNSVPQIPNSKQISNSKDPKMAAESDN 694

Query: 725 IASFDFSIFYEFVRVLQGVRN 730
           +  FDFS++YEFVRV+Q VR+
Sbjct: 737 LPDFDFSLYYEFVRVVQRVRS 694

BLAST of Cp4.1LG17g00020 vs. TAIR10
Match: AT3G17310.2 (AT3G17310.2 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein)

HSP 1 Score: 459.9 bits (1182), Expect = 2.9e-129
Identity = 285/700 (40.71%), Postives = 389/700 (55.57%), Query Frame = 1

Query: 82  SLPESTG--SLHSVRSDKKGTNPIAAFCYKQAGRTSKPESSDSLDSLFD------DKEDA 141
           S P+  G  +  S  S+ K       FC     +       D  + L +      + E  
Sbjct: 39  SFPQQIGDNAASSSGSNVKSLLIEMGFCPTLVQKAIDENGQDDFELLLEILTKSTETEPP 98

Query: 142 SNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDKLGGDAPINELVDFIVA 201
                 ++ PK E D +Y + R  + +LL M F  + V+FA+D+LG D PI+E+VDFIVA
Sbjct: 99  GPSFHGLMEPKPEPDIEYETDRI-RIALLTMKFPENLVDFALDRLGKDTPIDEMVDFIVA 158

Query: 202 AQIAENLEKETNETICRNELKTEEND-------------ETLFVTMEKTLRLLEMGFSEN 261
           AQ+AE   +E+ +++   E+  E+ D             E LF TM+KTLRLLEMGFS +
Sbjct: 159 AQLAEKYAEESEDSLDGAEINEEDEDVTPVTARGPEVPNEQLFETMDKTLRLLEMGFSND 218

Query: 262 EVSLAIEKFGSDTQVSELADSIVTG---------------RIAGDYPG-NDKCSSNSFYI 321
           E+S+AIEK G+  Q+S LA+SIVTG               +++   P  N  C S S+  
Sbjct: 219 EISMAIEKIGTKGQISVLAESIVTGEFPAECHDDLEDIEKKVSAAAPAVNRTCLSKSWRF 278

Query: 322 GGLHNPKVKAEDSSSPGVS------------LSRNVNVEEILKGKRPKEEYMDDLPNPIP 381
            G+   K      SS G +                 NV E  +GKRPK+E  +  P    
Sbjct: 279 VGVGAQKEDGGGGSSSGTANIKPDPGIESFPFPATDNVGETSRGKRPKDEDENAYPEEYT 338

Query: 382 RFDAKHKGKRPKPEYADDDLSSLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVAR 441
            +D   +GKR +PE   D  S +  P W++ +        E P+  +  LS+SL   VAR
Sbjct: 339 GYD--DRGKRLRPEDMGDSSSFMETP-WMQDEWKDNTY--EFPSVMQPRLSQSLGPKVAR 398

Query: 442 PPSPPSKFNPCRSLDKVVAKPPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSAL 501
            P                    +F YG + ++S   W K+S FL+ I PE VDT+  SAL
Sbjct: 399 RP--------------------YFFYGQLGELSPSWWSKISGFLFGIHPEHVDTRLCSAL 458

Query: 502 SRREGYVHNLPCENRSHILPKPAMTIEDAIPHTKKWWPSWDTRKHL-SCINSETRGVPQL 561
            R EGY+HNLP  NR + LP P +TI+DA+PH + WWP WD RKH  S   S  +    L
Sbjct: 459 RRTEGYLHNLPTVNRFNTLPNPRLTIQDAMPHMRSWWPQWDIRKHFNSGTCSNMKDATLL 518

Query: 562 CERLTKMMTDSHGQLSSQQQRDILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQ 621
           CER+ + + +  G+ + Q Q  IL HC   NLIW+    L+PLEPE LE ++GYP+NHT 
Sbjct: 519 CERIGRRIAECKGKPTQQDQTLILRHCHTSNLIWIAPNILSPLEPEHLECIMGYPMNHTN 578

Query: 622 DAESSSMERLQILKYCFQIDTLGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHL 681
                  ERL++  YCFQ DTLGYHLSVLKSMFP+GL VLS+FSGIGGAEIAL RLGIHL
Sbjct: 579 IGGGRLAERLKLFDYCFQTDTLGYHLSVLKSMFPQGLTVLSLFSGIGGAEIALDRLGIHL 638

Query: 682 KVVISVESSAAKRRILQKWWRSSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQN 732
           K V+SVES    R IL++WW++SGQTGEL  IE+I+ LT+ ++  L++++GGFD V+CQN
Sbjct: 639 KGVVSVESCGLSRNILKRWWQTSGQTGELVQIEEIKSLTAKRLETLMQRFGGFDFVICQN 698

BLAST of Cp4.1LG17g00020 vs. TAIR10
Match: AT5G14620.1 (AT5G14620.1 domains rearranged methyltransferase 2)

HSP 1 Score: 298.1 bits (762), Expect = 1.5e-80
Identity = 148/330 (44.85%), Postives = 208/330 (63.03%), Query Frame = 1

Query: 404 RSLDKVVAKPPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLP 463
           RSL ++   PPFF Y NV    +  W  +S+ L+ I PEFVD++ F   +R+ GY+HNLP
Sbjct: 297 RSLPELARGPPFFYYENVALTPKGVWETISRHLFEIPPEFVDSKYFCVAARKRGYIHNLP 356

Query: 464 CENRSHILPKPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMT--D 523
             NR  I P P  TI DA P +K+WWP WD R  L+CI + T G  QL  R+   +   +
Sbjct: 357 INNRFQIQPPPKYTIHDAFPLSKRWWPEWDKRTKLNCILTCT-GSAQLTNRIRVALEPYN 416

Query: 524 SHGQLSSQQQRDILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERL 583
              +     QR ++  C   NL+WVG+ K APLEP+++E +LG+P NHT+    S  ER 
Sbjct: 417 EEPEPPKHVQRYVIDQCKKWNLVWVGKNKAAPLEPDEMESILGFPKNHTRGGGMSRTERF 476

Query: 584 QILKYCFQIDTLGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSA 643
           + L   FQ+DT+ YHLSVLK +FP G+ VLS+F+GIGG E+ALHRL I +K+V+SVE S 
Sbjct: 477 KSLGNSFQVDTVAYHLSVLKPIFPHGINVLSLFTGIGGGEVALHRLQIKMKLVVSVEISK 536

Query: 644 AKRRILQKWWRSSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSK 703
             R IL+ +W  + QTGEL    DIQ LT++ I  L++KYGGFDLV+  +PC+     ++
Sbjct: 537 VNRNILKDFWEQTNQTGELIEFSDIQHLTNDTIEGLMEKYGGFDLVIGGSPCNNLAGGNR 596

Query: 704 LSKDTEGIASFDFSIFYEFVRVLQGVRNTM 732
           +S+   G+     S+F+E+ R+L+ VR  M
Sbjct: 597 VSR--VGLEGDQSSLFFEYCRILEVVRARM 623

BLAST of Cp4.1LG17g00020 vs. TAIR10
Match: AT5G15380.1 (AT5G15380.1 domains rearranged methylase 1)

HSP 1 Score: 283.5 bits (724), Expect = 3.7e-76
Identity = 136/316 (43.04%), Postives = 202/316 (63.92%), Query Frame = 1

Query: 413 PPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILP 472
           PPFF Y NV    +  W K+S  LY I PEFVD++ F A +R+ GY+HNLP +NR  I P
Sbjct: 304 PPFFYYENVAMTPKGVWAKISSHLYDIVPEFVDSKHFCAAARKRGYIHNLPIQNRFQIQP 363

Query: 473 KPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQR 532
               TI++A P TK+WWPSWD R  L+C+ +      +L E++ + +    G+     Q+
Sbjct: 364 PQHNTIQEAFPLTKRWWPSWDGRTKLNCLLTCIAS-SRLTEKIREALERYDGETPLDVQK 423

Query: 533 DILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDT 592
            +++ C   NL+WVG+ KLAPL+ +++E +LG+P +HT+    S+ +R + L   FQ+DT
Sbjct: 424 WVMYECKKWNLVWVGKNKLAPLDADEMEKLLGFPRDHTRGGGISTTDRYKSLGNSFQVDT 483

Query: 593 LGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWR 652
           + YHLSVLK +FP G+ VLS+F+GIGG E+ALHRL I + VV+SVE S A R IL+ +W 
Sbjct: 484 VAYHLSVLKPLFPNGINVLSLFTGIGGGEVALHRLQIKMNVVVSVEISDANRNILRSFWE 543

Query: 653 SSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKLSKDTEGIASF 712
            + Q G L   +D+QKL  N I  L+ +YGGFDLV+  +PC+     ++  +   G+   
Sbjct: 544 QTNQKGILREFKDVQKLDDNTIERLMDEYGGFDLVIGGSPCNNLAGGNRHHR--VGLGGE 603

Query: 713 DFSIFYEFVRVLQGVR 729
             S+F+++ R+L+ VR
Sbjct: 604 HSSLFFDYCRILEAVR 616

BLAST of Cp4.1LG17g00020 vs. NCBI nr
Match: gi|778708928|ref|XP_011656315.1| (PREDICTED: uncharacterized protein LOC101206985 isoform X2 [Cucumis sativus])

HSP 1 Score: 1142.9 bits (2955), Expect = 0.0e+00
Identity = 591/744 (79.44%), Postives = 646/744 (86.83%), Query Frame = 1

Query: 1   MASNKPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEE 60
           MASNKPIVPKEEV DFRLPPD +YSRHVGDSGASSSG+N+RTFF+DMGFLPSLVD VIE+
Sbjct: 1   MASNKPIVPKEEVFDFRLPPDRMYSRHVGDSGASSSGSNVRTFFIDMGFLPSLVDSVIEK 60

Query: 61  KGEDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNP--IAAFCYKQAGRTSKPE 120
            GEDDVELLLNTLTT+SAEQKS+P+S+ SL  ++S K G+NP  ++  C+KQA +TSK E
Sbjct: 61  NGEDDVELLLNTLTTYSAEQKSIPQSSDSLEGLQSGKMGSNPPHVSTVCHKQAAQTSKSE 120

Query: 121 SSDSLDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDKL 180
           SSDSLDSLFDDK DA NEISSV IPKEEADD Y  + TNKASLL+MNFS DEV+FAIDKL
Sbjct: 121 SSDSLDSLFDDK-DAHNEISSV-IPKEEADDYYHISDTNKASLLVMNFSADEVDFAIDKL 180

Query: 181 GGDAPINELVDFIVAAQIAENLEKETNETICRNELKTEENDETLFVTMEKTLRLLEMGFS 240
           GGDAP+NELVDFI+AAQIA  LEKET++  CRNELK EENDETLFVTMEKTLRLLEMGFS
Sbjct: 181 GGDAPLNELVDFIIAAQIAIKLEKETDDAFCRNELKKEENDETLFVTMEKTLRLLEMGFS 240

Query: 241 ENEVSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHN-----PKVK 300
           ENEVSLAIEKFGS+TQVSELADSIVTGRIA DYPG+ KCS +SF IGGL+       KVK
Sbjct: 241 ENEVSLAIEKFGSETQVSELADSIVTGRIASDYPGDVKCSPSSFGIGGLYTREDYVTKVK 300

Query: 301 AEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRFDAKHKGKRPKPEYADDDLS 360
           AE+SSS    L RNVN+E I KGKRPKEE MDDL NP  R + KHKGKRPK EYADD L 
Sbjct: 301 AEESSSAVGPLPRNVNIEAIQKGKRPKEENMDDLLNPTTRLN-KHKGKRPKQEYADD-LG 360

Query: 361 SLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVAKP 420
           SLYGP W+E+KVNP I   ++P SS LNLSRSLDK+VA+PP PP K NP R+L+KVV KP
Sbjct: 361 SLYGPGWVESKVNPDITSFDIPPSSRLNLSRSLDKLVAKPPCPPLKSNPSRALEKVVTKP 420

Query: 421 PFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILPK 480
           PFFLYGNVLD+SRDSW KVSKFLY +EPEFVDT+SFSALSR EGYVHNLPCENR HI+P 
Sbjct: 421 PFFLYGNVLDISRDSWAKVSKFLYAVEPEFVDTRSFSALSRTEGYVHNLPCENRFHIIPL 480

Query: 481 PAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQRD 540
           P MTI+DA   TKKWWPSWDTRK+LSCINSETRGVPQLC+RLTK +TDS G  SS ++RD
Sbjct: 481 PPMTIQDAT-RTKKWWPSWDTRKYLSCINSETRGVPQLCDRLTKTLTDSGGHPSSHEERD 540

Query: 541 ILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDTL 600
           ILHHCIALNLIWV QFKLAP+EPEQLE VLGYPVNHTQDAESSS+ERLQ LKYCFQ D L
Sbjct: 541 ILHHCIALNLIWVSQFKLAPVEPEQLECVLGYPVNHTQDAESSSIERLQYLKYCFQTDAL 600

Query: 601 GYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWRS 660
           GYHLSVLKSMFPEGL+VLSIFSGIGGAEIALHRLGIHLKVV+SVESSAAKRRIL+KWW S
Sbjct: 601 GYHLSVLKSMFPEGLVVLSIFSGIGGAEIALHRLGIHLKVVVSVESSAAKRRILKKWWHS 660

Query: 661 SGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKL--SKDTEGIAS 720
           SGQTGELE IEDIQKLTS KI+N I KYGGFDLV+CQNPCSR LSSSKL  S D EGIAS
Sbjct: 661 SGQTGELEQIEDIQKLTSIKINNWITKYGGFDLVICQNPCSRCLSSSKLNQSGDAEGIAS 720

Query: 721 FDFSIFYEFVRVLQGVRNTMERKK 736
           FDFSIFYEFVRVLQ VRNTM RKK
Sbjct: 721 FDFSIFYEFVRVLQSVRNTMHRKK 739

BLAST of Cp4.1LG17g00020 vs. NCBI nr
Match: gi|778708917|ref|XP_011656312.1| (PREDICTED: uncharacterized protein LOC101206985 isoform X1 [Cucumis sativus])

HSP 1 Score: 1137.9 bits (2942), Expect = 0.0e+00
Identity = 591/746 (79.22%), Postives = 646/746 (86.60%), Query Frame = 1

Query: 1   MASNKPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEE 60
           MASNKPIVPKEEV DFRLPPD +YSRHVGDSGASSSG+N+RTFF+DMGFLPSLVD VIE+
Sbjct: 1   MASNKPIVPKEEVFDFRLPPDRMYSRHVGDSGASSSGSNVRTFFIDMGFLPSLVDSVIEK 60

Query: 61  KGEDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNP--IAAFCYKQ--AGRTSK 120
            GEDDVELLLNTLTT+SAEQKS+P+S+ SL  ++S K G+NP  ++  C+KQ  A +TSK
Sbjct: 61  NGEDDVELLLNTLTTYSAEQKSIPQSSDSLEGLQSGKMGSNPPHVSTVCHKQVQAAQTSK 120

Query: 121 PESSDSLDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAID 180
            ESSDSLDSLFDDK DA NEISSV IPKEEADD Y  + TNKASLL+MNFS DEV+FAID
Sbjct: 121 SESSDSLDSLFDDK-DAHNEISSV-IPKEEADDYYHISDTNKASLLVMNFSADEVDFAID 180

Query: 181 KLGGDAPINELVDFIVAAQIAENLEKETNETICRNELKTEENDETLFVTMEKTLRLLEMG 240
           KLGGDAP+NELVDFI+AAQIA  LEKET++  CRNELK EENDETLFVTMEKTLRLLEMG
Sbjct: 181 KLGGDAPLNELVDFIIAAQIAIKLEKETDDAFCRNELKKEENDETLFVTMEKTLRLLEMG 240

Query: 241 FSENEVSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHN-----PK 300
           FSENEVSLAIEKFGS+TQVSELADSIVTGRIA DYPG+ KCS +SF IGGL+       K
Sbjct: 241 FSENEVSLAIEKFGSETQVSELADSIVTGRIASDYPGDVKCSPSSFGIGGLYTREDYVTK 300

Query: 301 VKAEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRFDAKHKGKRPKPEYADDD 360
           VKAE+SSS    L RNVN+E I KGKRPKEE MDDL NP  R + KHKGKRPK EYADD 
Sbjct: 301 VKAEESSSAVGPLPRNVNIEAIQKGKRPKEENMDDLLNPTTRLN-KHKGKRPKQEYADD- 360

Query: 361 LSSLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVA 420
           L SLYGP W+E+KVNP I   ++P SS LNLSRSLDK+VA+PP PP K NP R+L+KVV 
Sbjct: 361 LGSLYGPGWVESKVNPDITSFDIPPSSRLNLSRSLDKLVAKPPCPPLKSNPSRALEKVVT 420

Query: 421 KPPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHIL 480
           KPPFFLYGNVLD+SRDSW KVSKFLY +EPEFVDT+SFSALSR EGYVHNLPCENR HI+
Sbjct: 421 KPPFFLYGNVLDISRDSWAKVSKFLYAVEPEFVDTRSFSALSRTEGYVHNLPCENRFHII 480

Query: 481 PKPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQ 540
           P P MTI+DA   TKKWWPSWDTRK+LSCINSETRGVPQLC+RLTK +TDS G  SS ++
Sbjct: 481 PLPPMTIQDAT-RTKKWWPSWDTRKYLSCINSETRGVPQLCDRLTKTLTDSGGHPSSHEE 540

Query: 541 RDILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQID 600
           RDILHHCIALNLIWV QFKLAP+EPEQLE VLGYPVNHTQDAESSS+ERLQ LKYCFQ D
Sbjct: 541 RDILHHCIALNLIWVSQFKLAPVEPEQLECVLGYPVNHTQDAESSSIERLQYLKYCFQTD 600

Query: 601 TLGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWW 660
            LGYHLSVLKSMFPEGL+VLSIFSGIGGAEIALHRLGIHLKVV+SVESSAAKRRIL+KWW
Sbjct: 601 ALGYHLSVLKSMFPEGLVVLSIFSGIGGAEIALHRLGIHLKVVVSVESSAAKRRILKKWW 660

Query: 661 RSSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKL--SKDTEGI 720
            SSGQTGELE IEDIQKLTS KI+N I KYGGFDLV+CQNPCSR LSSSKL  S D EGI
Sbjct: 661 HSSGQTGELEQIEDIQKLTSIKINNWITKYGGFDLVICQNPCSRCLSSSKLNQSGDAEGI 720

Query: 721 ASFDFSIFYEFVRVLQGVRNTMERKK 736
           ASFDFSIFYEFVRVLQ VRNTM RKK
Sbjct: 721 ASFDFSIFYEFVRVLQSVRNTMHRKK 741

BLAST of Cp4.1LG17g00020 vs. NCBI nr
Match: gi|700190409|gb|KGN45613.1| (hypothetical protein Csa_6G000130 [Cucumis sativus])

HSP 1 Score: 1135.9 bits (2937), Expect = 0.0e+00
Identity = 590/745 (79.19%), Postives = 645/745 (86.58%), Query Frame = 1

Query: 2   ASNKPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEEK 61
           ASNKPIVPKEEV DFRLPPD +YSRHVGDSGASSSG+N+RTFF+DMGFLPSLVD VIE+ 
Sbjct: 6   ASNKPIVPKEEVFDFRLPPDRMYSRHVGDSGASSSGSNVRTFFIDMGFLPSLVDSVIEKN 65

Query: 62  GEDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNP--IAAFCYKQ--AGRTSKP 121
           GEDDVELLLNTLTT+SAEQKS+P+S+ SL  ++S K G+NP  ++  C+KQ  A +TSK 
Sbjct: 66  GEDDVELLLNTLTTYSAEQKSIPQSSDSLEGLQSGKMGSNPPHVSTVCHKQVQAAQTSKS 125

Query: 122 ESSDSLDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDK 181
           ESSDSLDSLFDDK DA NEISSV IPKEEADD Y  + TNKASLL+MNFS DEV+FAIDK
Sbjct: 126 ESSDSLDSLFDDK-DAHNEISSV-IPKEEADDYYHISDTNKASLLVMNFSADEVDFAIDK 185

Query: 182 LGGDAPINELVDFIVAAQIAENLEKETNETICRNELKTEENDETLFVTMEKTLRLLEMGF 241
           LGGDAP+NELVDFI+AAQIA  LEKET++  CRNELK EENDETLFVTMEKTLRLLEMGF
Sbjct: 186 LGGDAPLNELVDFIIAAQIAIKLEKETDDAFCRNELKKEENDETLFVTMEKTLRLLEMGF 245

Query: 242 SENEVSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHN-----PKV 301
           SENEVSLAIEKFGS+TQVSELADSIVTGRIA DYPG+ KCS +SF IGGL+       KV
Sbjct: 246 SENEVSLAIEKFGSETQVSELADSIVTGRIASDYPGDVKCSPSSFGIGGLYTREDYVTKV 305

Query: 302 KAEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRFDAKHKGKRPKPEYADDDL 361
           KAE+SSS    L RNVN+E I KGKRPKEE MDDL NP  R + KHKGKRPK EYADD L
Sbjct: 306 KAEESSSAVGPLPRNVNIEAIQKGKRPKEENMDDLLNPTTRLN-KHKGKRPKQEYADD-L 365

Query: 362 SSLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVAK 421
            SLYGP W+E+KVNP I   ++P SS LNLSRSLDK+VA+PP PP K NP R+L+KVV K
Sbjct: 366 GSLYGPGWVESKVNPDITSFDIPPSSRLNLSRSLDKLVAKPPCPPLKSNPSRALEKVVTK 425

Query: 422 PPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILP 481
           PPFFLYGNVLD+SRDSW KVSKFLY +EPEFVDT+SFSALSR EGYVHNLPCENR HI+P
Sbjct: 426 PPFFLYGNVLDISRDSWAKVSKFLYAVEPEFVDTRSFSALSRTEGYVHNLPCENRFHIIP 485

Query: 482 KPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQR 541
            P MTI+DA   TKKWWPSWDTRK+LSCINSETRGVPQLC+RLTK +TDS G  SS ++R
Sbjct: 486 LPPMTIQDAT-RTKKWWPSWDTRKYLSCINSETRGVPQLCDRLTKTLTDSGGHPSSHEER 545

Query: 542 DILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDT 601
           DILHHCIALNLIWV QFKLAP+EPEQLE VLGYPVNHTQDAESSS+ERLQ LKYCFQ D 
Sbjct: 546 DILHHCIALNLIWVSQFKLAPVEPEQLECVLGYPVNHTQDAESSSIERLQYLKYCFQTDA 605

Query: 602 LGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWR 661
           LGYHLSVLKSMFPEGL+VLSIFSGIGGAEIALHRLGIHLKVV+SVESSAAKRRIL+KWW 
Sbjct: 606 LGYHLSVLKSMFPEGLVVLSIFSGIGGAEIALHRLGIHLKVVVSVESSAAKRRILKKWWH 665

Query: 662 SSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKL--SKDTEGIA 721
           SSGQTGELE IEDIQKLTS KI+N I KYGGFDLV+CQNPCSR LSSSKL  S D EGIA
Sbjct: 666 SSGQTGELEQIEDIQKLTSIKINNWITKYGGFDLVICQNPCSRCLSSSKLNQSGDAEGIA 725

Query: 722 SFDFSIFYEFVRVLQGVRNTMERKK 736
           SFDFSIFYEFVRVLQ VRNTM RKK
Sbjct: 726 SFDFSIFYEFVRVLQSVRNTMHRKK 745

BLAST of Cp4.1LG17g00020 vs. NCBI nr
Match: gi|659116878|ref|XP_008458306.1| (PREDICTED: DNA (cytosine-5)-methyltransferase DRM1 isoform X1 [Cucumis melo])

HSP 1 Score: 1135.9 bits (2937), Expect = 0.0e+00
Identity = 592/744 (79.57%), Postives = 642/744 (86.29%), Query Frame = 1

Query: 1   MASNKPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEE 60
           MASNKP+VPKEEV DFR   D +YSRH GDSGASSSG++IRTFFVDMGFLPSLVD VIE+
Sbjct: 1   MASNKPVVPKEEVFDFR---DRMYSRHAGDSGASSSGSDIRTFFVDMGFLPSLVDTVIEK 60

Query: 61  KGEDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNP--IAAFCYKQAGRTSKPE 120
            GEDDVELLLNTLTT+SAEQKS+P+S+ SL  +RS K G+NP   +  C+KQA RTSK E
Sbjct: 61  NGEDDVELLLNTLTTYSAEQKSIPQSSDSLEGLRSGKMGSNPPHFSTVCHKQAVRTSKSE 120

Query: 121 SSDSLDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAIDKL 180
           SSDSLDSLFDDK DA NEISSVVIPKEEADD Y  + TNKASLL+MNFS DEV+FAIDKL
Sbjct: 121 SSDSLDSLFDDK-DAHNEISSVVIPKEEADDYYHISDTNKASLLVMNFSADEVDFAIDKL 180

Query: 181 GGDAPINELVDFIVAAQIAENLEKETNETICRNELKTEENDETLFVTMEKTLRLLEMGFS 240
           GGDAP+NELVDFI+AAQIA  LEKET++  CRNEL  EENDETLFVTMEKTLRLLEMGFS
Sbjct: 181 GGDAPLNELVDFIIAAQIAIKLEKETDDAFCRNELNKEENDETLFVTMEKTLRLLEMGFS 240

Query: 241 ENEVSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHNP-----KVK 300
           ENEVSLAIEKFGS+TQ++ELADSIVTGRIA DYPG+ KCS +SF IGGL+ P     KVK
Sbjct: 241 ENEVSLAIEKFGSETQIAELADSIVTGRIASDYPGDVKCSPSSFGIGGLYTPEDYMTKVK 300

Query: 301 AEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRFDAKHKGKRPKPEYADDDLS 360
           AE+SSS  V L RNVN+E ILKGKRPKEE  DD  NP PR D KHKGKRPK EYADD LS
Sbjct: 301 AEESSSAVVPLPRNVNIEAILKGKRPKEENTDDHLNPTPRLD-KHKGKRPKQEYADD-LS 360

Query: 361 SLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVAKP 420
           SLYGP W+E+KVNP I   ++P SS LNLSRSLDK+VA+PP PP K NP R+L+KVV KP
Sbjct: 361 SLYGPGWVESKVNPDITSFDIPPSSRLNLSRSLDKLVAKPPFPPLKSNPSRALEKVVTKP 420

Query: 421 PFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHILPK 480
           PFFLYGNVLD+SRDSW KVSKFLY +EPE+VDT+SFSALSR EGYVHNLPCENR HI P 
Sbjct: 421 PFFLYGNVLDISRDSWEKVSKFLYAVEPEYVDTRSFSALSRTEGYVHNLPCENRFHITPL 480

Query: 481 PAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQRD 540
           P MTI+DA   TKKWWPSWDTRK+LSCINSETRGV QLC+RLTK +TDS G LSS Q+RD
Sbjct: 481 PPMTIQDAT-RTKKWWPSWDTRKYLSCINSETRGVSQLCDRLTKTLTDSCGHLSSHQERD 540

Query: 541 ILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQIDTL 600
           ILHHCIALNLIWV QFKLAP+EPEQLE VLGYPVNHTQDAESSSMERLQ LKYCFQ D L
Sbjct: 541 ILHHCIALNLIWVSQFKLAPVEPEQLECVLGYPVNHTQDAESSSMERLQYLKYCFQTDAL 600

Query: 601 GYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWWRS 660
           GYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGI LKVV+SVESSAAKRRIL+KWW S
Sbjct: 601 GYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIRLKVVVSVESSAAKRRILKKWWHS 660

Query: 661 SGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKL--SKDTEGIAS 720
           SGQTGELE IEDIQKLTS KI+N I KYGGFDLV+CQNPCSR LSSSKL  S D EGIAS
Sbjct: 661 SGQTGELEQIEDIQKLTSIKINNWITKYGGFDLVICQNPCSRCLSSSKLNQSGDAEGIAS 720

Query: 721 FDFSIFYEFVRVLQGVRNTMERKK 736
           FDFSIFYEFVRVLQ VRNTM RKK
Sbjct: 721 FDFSIFYEFVRVLQSVRNTMHRKK 737

BLAST of Cp4.1LG17g00020 vs. NCBI nr
Match: gi|449441506|ref|XP_004138523.1| (PREDICTED: uncharacterized protein LOC101206985 isoform X3 [Cucumis sativus])

HSP 1 Score: 1096.6 bits (2835), Expect = 0.0e+00
Identity = 575/746 (77.08%), Postives = 629/746 (84.32%), Query Frame = 1

Query: 1   MASNKPIVPKEEVLDFRLPPDGLYSRHVGDSGASSSGNNIRTFFVDMGFLPSLVDKVIEE 60
           MASNKPIVPKEEV DFRLPPD +YSRHVGDSGASSSG+N+RTFF+DMGFLPSLVD VIE+
Sbjct: 1   MASNKPIVPKEEVFDFRLPPDRMYSRHVGDSGASSSGSNVRTFFIDMGFLPSLVDSVIEK 60

Query: 61  KGEDDVELLLNTLTTFSAEQKSLPESTGSLHSVRSDKKGTNP--IAAFCYKQ--AGRTSK 120
                             EQKS+P+S+ SL  ++S K G+NP  ++  C+KQ  A +TSK
Sbjct: 61  N-----------------EQKSIPQSSDSLEGLQSGKMGSNPPHVSTVCHKQVQAAQTSK 120

Query: 121 PESSDSLDSLFDDKEDASNEISSVVIPKEEADDDYISTRTNKASLLMMNFSVDEVNFAID 180
            ESSDSLDSLFDDK DA NEISSV IPKEEADD Y  + TNKASLL+MNFS DEV+FAID
Sbjct: 121 SESSDSLDSLFDDK-DAHNEISSV-IPKEEADDYYHISDTNKASLLVMNFSADEVDFAID 180

Query: 181 KLGGDAPINELVDFIVAAQIAENLEKETNETICRNELKTEENDETLFVTMEKTLRLLEMG 240
           KLGGDAP+NELVDFI+AAQIA  LEKET++  CRNELK EENDETLFVTMEKTLRLLEMG
Sbjct: 181 KLGGDAPLNELVDFIIAAQIAIKLEKETDDAFCRNELKKEENDETLFVTMEKTLRLLEMG 240

Query: 241 FSENEVSLAIEKFGSDTQVSELADSIVTGRIAGDYPGNDKCSSNSFYIGGLHN-----PK 300
           FSENEVSLAIEKFGS+TQVSELADSIVTGRIA DYPG+ KCS +SF IGGL+       K
Sbjct: 241 FSENEVSLAIEKFGSETQVSELADSIVTGRIASDYPGDVKCSPSSFGIGGLYTREDYVTK 300

Query: 301 VKAEDSSSPGVSLSRNVNVEEILKGKRPKEEYMDDLPNPIPRFDAKHKGKRPKPEYADDD 360
           VKAE+SSS    L RNVN+E I KGKRPKEE MDDL NP  R + KHKGKRPK EYADD 
Sbjct: 301 VKAEESSSAVGPLPRNVNIEAIQKGKRPKEENMDDLLNPTTRLN-KHKGKRPKQEYADD- 360

Query: 361 LSSLYGPEWLEAKVNPKIVGLEMPTSSELNLSRSLDKMVARPPSPPSKFNPCRSLDKVVA 420
           L SLYGP W+E+KVNP I   ++P SS LNLSRSLDK+VA+PP PP K NP R+L+KVV 
Sbjct: 361 LGSLYGPGWVESKVNPDITSFDIPPSSRLNLSRSLDKLVAKPPCPPLKSNPSRALEKVVT 420

Query: 421 KPPFFLYGNVLDVSRDSWGKVSKFLYTIEPEFVDTQSFSALSRREGYVHNLPCENRSHIL 480
           KPPFFLYGNVLD+SRDSW KVSKFLY +EPEFVDT+SFSALSR EGYVHNLPCENR HI+
Sbjct: 421 KPPFFLYGNVLDISRDSWAKVSKFLYAVEPEFVDTRSFSALSRTEGYVHNLPCENRFHII 480

Query: 481 PKPAMTIEDAIPHTKKWWPSWDTRKHLSCINSETRGVPQLCERLTKMMTDSHGQLSSQQQ 540
           P P MTI+DA   TKKWWPSWDTRK+LSCINSETRGVPQLC+RLTK +TDS G  SS ++
Sbjct: 481 PLPPMTIQDAT-RTKKWWPSWDTRKYLSCINSETRGVPQLCDRLTKTLTDSGGHPSSHEE 540

Query: 541 RDILHHCIALNLIWVGQFKLAPLEPEQLEYVLGYPVNHTQDAESSSMERLQILKYCFQID 600
           RDILHHCIALNLIWV QFKLAP+EPEQLE VLGYPVNHTQDAESSS+ERLQ LKYCFQ D
Sbjct: 541 RDILHHCIALNLIWVSQFKLAPVEPEQLECVLGYPVNHTQDAESSSIERLQYLKYCFQTD 600

Query: 601 TLGYHLSVLKSMFPEGLIVLSIFSGIGGAEIALHRLGIHLKVVISVESSAAKRRILQKWW 660
            LGYHLSVLKSMFPEGL+VLSIFSGIGGAEIALHRLGIHLKVV+SVESSAAKRRIL+KWW
Sbjct: 601 ALGYHLSVLKSMFPEGLVVLSIFSGIGGAEIALHRLGIHLKVVVSVESSAAKRRILKKWW 660

Query: 661 RSSGQTGELELIEDIQKLTSNKIHNLIKKYGGFDLVVCQNPCSRSLSSSKL--SKDTEGI 720
            SSGQTGELE IEDIQKLTS KI+N I KYGGFDLV+CQNPCSR LSSSKL  S D EGI
Sbjct: 661 HSSGQTGELEQIEDIQKLTSIKINNWITKYGGFDLVICQNPCSRCLSSSKLNQSGDAEGI 720

Query: 721 ASFDFSIFYEFVRVLQGVRNTMERKK 736
           ASFDFSIFYEFVRVLQ VRNTM RKK
Sbjct: 721 ASFDFSIFYEFVRVLQSVRNTMHRKK 724

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DRM2_ARATH2.6e-7944.85DNA (cytosine-5)-methyltransferase DRM2 OS=Arabidopsis thaliana GN=DRM2 PE=1 SV=... [more]
DRM1L_ARATH6.6e-7543.04DNA (cytosine-5)-methyltransferase DRM1 OS=Arabidopsis thaliana GN=DRM1 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0K816_CUCSA0.0e+0079.19Uncharacterized protein OS=Cucumis sativus GN=Csa_6G000130 PE=4 SV=1[more]
A0A061EPS9_THECC1.4e-21254.45S-adenosyl-L-methionine-dependent methyltransferases superfamily protein, putati... [more]
W9RGA9_9ROSA1.5e-20353.40Uncharacterized protein OS=Morus notabilis GN=L484_026137 PE=4 SV=1[more]
V4TZT4_9ROSI6.4e-20254.12Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004428mg PE=4 SV=1[more]
A0A067FXI4_CITSI3.2e-20153.98Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g005421mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G17310.22.9e-12940.71 S-adenosyl-L-methionine-dependent methyltransferases superfamily pro... [more]
AT5G14620.11.5e-8044.85 domains rearranged methyltransferase 2[more]
AT5G15380.13.7e-7643.04 domains rearranged methylase 1[more]
Match NameE-valueIdentityDescription
gi|778708928|ref|XP_011656315.1|0.0e+0079.44PREDICTED: uncharacterized protein LOC101206985 isoform X2 [Cucumis sativus][more]
gi|778708917|ref|XP_011656312.1|0.0e+0079.22PREDICTED: uncharacterized protein LOC101206985 isoform X1 [Cucumis sativus][more]
gi|700190409|gb|KGN45613.1|0.0e+0079.19hypothetical protein Csa_6G000130 [Cucumis sativus][more]
gi|659116878|ref|XP_008458306.1|0.0e+0079.57PREDICTED: DNA (cytosine-5)-methyltransferase DRM1 isoform X1 [Cucumis melo][more]
gi|449441506|ref|XP_004138523.1|0.0e+0077.08PREDICTED: uncharacterized protein LOC101206985 isoform X3 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006306DNA methylation
Vocabulary: INTERPRO
TermDefinition
IPR015940UBA
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006306 DNA methylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g00020.1Cp4.1LG17g00020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015940Ubiquitin-associated domainPROFILEPS50030UBAcoord: 217..265
score:
NoneNo IPR availablePANTHERPTHR23068DNA CYTOSINE-5- -METHYLTRANSFERASE 3-RELATEDcoord: 7..735
score: 5.8E
NoneNo IPR availablePANTHERPTHR23068:SF11SUBFAMILY NOT NAMEDcoord: 7..735
score: 5.8E