Cp4.1LG01g04820 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g04820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionXH/XS domain-containing family protein
LocationCp4.1LG01 : 1016355 .. 1020847 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATGGCTCGCCGCCACTTGGGGAGGCTCAGAAGTCAATAAGCTAAATTTGTTGGTGAAGCTCACAAGCTTCAGCCATGGAGATCAACGCTTTCGGAGTGCATGAGCCTCTAATGATTCCATCAACGCGCTCACACGCGCTACGATTTTCGTCAGGTTTTGGTCTGAGCCTTAGCGAGCGTACCCAGCGCAGGCCCCAATTCGACTTTCTCGCATTTCCACGGCGACTCACTTTTCCATTGTCGAGAACAGACTCGGTCCTCACTCCAACCCTGAGTCTTTGATTTGCTCTCAATTCGTGAATTCACTTCCCCTTTCTCTCATCAGACGCCTTTGGATTCTGCGATTCTTGTCAGGTGATCTATGTTGTGTTTTTGCTTCGTACCGGTGTTCTTTTCCGTTTGAATTGGGGTAGTCCAGATTTATCTGGATCATTTCATGCATGATTTTGGTTCTGTTGCATGAATTCAACTGCATTGCATTTCTCTTTTGAGCTGAGGCCTTGCGATTCTTGTGTACTCCGATGTTTGACATTTGGCTTGGAATTCATCGTTCTTGTTCTTTTTGACTGATCGGACTCTACTGTGCATGTTTAGATGTTATTCATGGGTTTTCTTTTCCTCCTTTCATGTGGAAGTGTAGGCTTAGTTAAGGAGCTACGGATTTTTGAGCAGCTGTTTGTTAGATTTCATTACTTATCTATGGGTTTTAGGATGGCTTTCGTTGTGTTAGGGGTTGGTGCTGGACACTTGAATCCTATAATCTGTAAAGCATCAATTACATATAAATATGAATCTAACTTGAGGTATTCTTCTCTAGGTGCTCTACATCTAGTTTTATGGGAAGTTCCTCATCTGACGATTCTGATGTGGACACTGATATCAGTGAATCTGAATTGGATGAGCGGGAGAGCAAGTCCTATGAAGAACTTAAAAATGGAAAACGCATTGTAAAACTCTCGCATGAGACATTTACTTGCCCCTACTGCTCGAGAAAAAGAAAGAGGGATTTCTTATATAAGGATCTCCTGCAGCATGCTTCTGGGGTAGGCAACAGCCCTTCAAATAAACGGAGTGCCAAAGAAAAAGCTAATCATTTAGCTTTAGTGAAATATTTGGAAAAAGATCTAGCTGATGCTGTTGGTCCATCAAAACCTGCAAGCAATAATGATCCTGTTATGGATTGCGATCACGATGAAAAGTTTGTGTGGCCATGGAGAGGAATTGTGGTAAACATTCCGACTAGGCGTACAGATGATGGGCGATATGTGGGAGAGAGTGGATCAAAGTTTAGGGATGAGTTGAAAGAAAGAGGATTTAATCCCACAAGAGTTACTCCCTTGTGGAATTACCGGGGTCACTCCGGTTGTGCTATCGTGGAATTTAATAAAGATTGGCCCGGTTTGCATAATGCTATTTCTTTTGAGAGGGCTTATGAGGCAGATCACCATGGAAAAAAGGATTGGCTGGCTAATGGTACTGAGAAACTAGGAGTTTATGCCTGGGTTGCTCGTGCTGATGATTACAACTCGAGTAATATAATCGGGGAACATTTGCGCAAGATTGGAGACCTGAAGACCGTATCTGAAATTATTGAGGAGGAAGCACGGAAGCAGGATAGACTTGTGTCCAATCTTACAAGTATCATCGAGCTCAAGAACAAACATTTGAGAGAGATGGAGGAAAGATGTAGTGAAACTGCCACCACTCTTAACAATTTGATGGTGGAGAGAGATAAATTACTTCAAGCTTATAACGAAGGTTTCTTATATAGAGCTTGAAAATTTTCTTTCTTGCACTTCAAAACCAGGAAATCTGTTCCATGTCATTGCAGATTACTTTTTCTTTCATGGACTTTCATGTTGTATCGATAAATTCATATAAGGTTTGTCACATCTACAGAGATAAAAAAAATTCAATTGGGTGCAAGGGATCACCTTAAGAAGATCTTCAGTGATCATGAAAAACTAAAGTTGCAACTAGAATCTCAGAAAAAAGAGTTCGAGTTAAGAGGAAGAGAACTGGAGATGCGTGAAGCACAAAATGAACATGAGAGCAAGTATTTGGCTGAAGAAATTGAGAAGGTACACTTTTCTTTTCTTCTTTCCCAGGAATTAAGAGATGCTGAAATTACGAAAAGATAAGTTGCATTCGACTTCACTTGAACTTTGGTTTTCAAAACAGCATTTTTTTTTAAAAAAAAGATCAATACCTGTCACCTTTTTCTTTTATTGTTTGCAAGTTTATTTTTTTAAGAATTGTTTTCAATCACACTATAGTAAAACAAAGTTGTAAGGCATAATCTTTGTTGCACACTATAGTCTTTAGCATTTTGGGATGTGTTGCACTTAAATCTTCGTGGAAATATTTCAACTGGGTTTTTTCCTTGCTCTTACTTCTTTCTTCACCTCTTCGACCTTTAAAGGCATTTGGCTCACGAGTTGGATTTGTATCGTTTGGAGTTTGAAATTTGGTAGGTAGGTTTTTCTTATTCTAGGCCACAAGTCTGCTAATCTCTTGGGTCATATGCTTTTAGGCATCTAATGCCCCTCTCTTGTGCTACATGCTTACTCAGAAATCACAAACTCACGAATCAGATGCAAACAACGTAAATTTAGGTTCATTTGATTTATATTCTTTTTTGCCTGTCAGTATGAGGTGAGAAATAGTTCTCTTCAATTGGCTGAGTTAGAGCAACAGAAGGCTGATGAAGATTTTATGAAGCTGGCAGATGATCAGAAGGTTTGTTGGCACATCTGCTTTATTAATTATCCAAATATTTCTTTATCAAATCTTGAGAGCAGGGTGTACTTATTTGTGATTTCTAATGATTCAGAAACAAAAGGAGGACCTCCATAATAGAATAATCCGACTGGAAAAGCAACTGGATACCAAGCAAGCATTAGAGTTGGAAATTGAGCGTCTACGTGGGTCGTTGAATGTTATGAAGCACATGGGAGATGATGAGGATGTGGAAGTCCTCCAGAAGGCAGAGACAATACTAAAAAGTTTGAGTGAAAAGGAAGGAGATCTTGAAGCTCTTGATGAACTTAACCAAACATTGATAGTAAAGCAGCGCAAGAGTAACGACGAACTCCAAGAAGCCCGTAAAGAGATAGTGAATGTAAGAATATTTTTTTATTGCAAATAACTTTCTTAGTCATTTAAACAAGATAAAAATTCAAGATTTTGTTCTAGTCTAAAACAAACTCATACTGTTTAATTCTCTCATTTCTTAGGCTTTTAAAGATTTGCCTGGTCGTTCCCACTTGCGTGTTAAGAGAATGGGTGAACTAGATACAAAACCATTCCATGAAGCAGCGAAGAAAAGATATAATGAGGATGAAGCAGATGAAAGAGCTTCAGAGTTGTGCTCATTATGGGCAGAATATCTCAAGGACCCAGATTGGCATCCTTTCAAAGTAATTAAGGAAGAAGGAAGGGATAACGAGGAAGGAAAGGTAATTCTTTTGCTACCTCCTTTCATCAGTTTAGCTTCTTGCAACTAAGAAATGTTGGCATTTTTTGTCTTTAGAAGATGAGATTATGTTTAAGAACTTCAAGTTTCTTGCCCCATTACTAAATTGTTTCTTGTTCGTAAACATTCTTGAGTTTGTATTTTTATTTTTGGCTCACTGGGACGATTCATACTCTTTTGAATTTTAGAGGCATTCTCGAGCTACATAAGTTTTCGTTACCTGAAGGCCCTTTAGATATTTATATTTAACTCTTATGGTGATGATTTAGGAAATTGAAGTTTTGGATGATGAAGATGAGAAACTGCAAGATCTGAAAAATGAGTGGGGGGAGGAAGTGTTCAAGGCTGTGACAGCAGCTTTAAGGGAGATAAATGAATATAATCCAAGTGGAAGGTATATAGTATCGGAGCTATGGAACTACCAAGAGGACAGGAAAGCAACGTTGCGAGAGGGAGTGAAATTCTTACTGGACAAGTTGAAAAGAAGCAACTAGGAAAGGGAACTCCCAAAGGTTGGTATTCGTTGTTGTGGTTGATAAACTTTTCTGGCTTTTTGACTGGAAATAAGAACAAAAATTCTTATTAGAGCTTGAGCTTGAGCTTGAATCTTCTTGAAGTAACTTGCTTATCTATTGATTTGATCATTTTGTCAATAACATTTGATTTGTTGAACTGCTGCAGCTGCCATGATGATCAAACCAACATATCATTGTCTTGAAACAAAGCGCAACAATGTTCTACACTTTAGATGAAGTATCACCAGGAGATGGAATGTGAGCCATTACGTTATGTCGGTGTATGTTTTCAAGTCAGTCATTTCTTTCCTTCGTACCAATTATTTCCAGATTATGAAGATTCAGATACTGCTGACTTGCTTATCTATTATCATTTACCCATCTTACTATTTAATTTGTTTGAAGATAAACCTATGATACTTTATCGTCTCCGATAACCTGCATGTCATGTGACGTCCAGTCTAGATACCGAATGTTTCTATCTTATTTATCTTCCAATAATC

mRNA sequence

CATGGCTCGCCGCCACTTGGGGAGGCTCAGAAGTCAATAAGCTAAATTTGTTGGTGAAGCTCACAAGCTTCAGCCATGGAGATCAACGCTTTCGGAGTGCATGAGCCTCTAATGATTCCATCAACGCGCTCACACGCGCTACGATTTTCGTCAGGTTTTGGTCTGAGCCTTAGCGAGCGTACCCAGCGCAGGCCCCAATTCGACTTTCTCGCATTTCCACGGCGACTCACTTTTCCATTGTCGAGAACAGACTCGGTCCTCACTCCAACCCTGAGTCTTTGATTTGCTCTCAATTCGTGAATTCACTTCCCCTTTCTCTCATCAGACGCCTTTGGATTCTGCGATTCTTGTCAGGTGCTCTACATCTAGTTTTATGGGAAGTTCCTCATCTGACGATTCTGATGTGGACACTGATATCAGTGAATCTGAATTGGATGAGCGGGAGAGCAAGTCCTATGAAGAACTTAAAAATGGAAAACGCATTGTAAAACTCTCGCATGAGACATTTACTTGCCCCTACTGCTCGAGAAAAAGAAAGAGGGATTTCTTATATAAGGATCTCCTGCAGCATGCTTCTGGGGTAGGCAACAGCCCTTCAAATAAACGGAGTGCCAAAGAAAAAGCTAATCATTTAGCTTTAGTGAAATATTTGGAAAAAGATCTAGCTGATGCTGTTGGTCCATCAAAACCTGCAAGCAATAATGATCCTGTTATGGATTGCGATCACGATGAAAAGTTTGTGTGGCCATGGAGAGGAATTGTGGTAAACATTCCGACTAGGCGTACAGATGATGGGCGATATGTGGGAGAGAGTGGATCAAAGTTTAGGGATGAGTTGAAAGAAAGAGGATTTAATCCCACAAGAGTTACTCCCTTGTGGAATTACCGGGGTCACTCCGGTTGTGCTATCGTGGAATTTAATAAAGATTGGCCCGGTTTGCATAATGCTATTTCTTTTGAGAGGGCTTATGAGGCAGATCACCATGGAAAAAAGGATTGGCTGGCTAATGGTACTGAGAAACTAGGAGTTTATGCCTGGGTTGCTCGTGCTGATGATTACAACTCGAGTAATATAATCGGGGAACATTTGCGCAAGATTGGAGACCTGAAGACCGTATCTGAAATTATTGAGGAGGAAGCACGGAAGCAGGATAGACTTGTGTCCAATCTTACAAGTATCATCGAGCTCAAGAACAAACATTTGAGAGAGATGGAGGAAAGATGTAGTGAAACTGCCACCACTCTTAACAATTTGATGGTGGAGAGAGATAAATTACTTCAAGCTTATAACGAAGAGATAAAAAAAATTCAATTGGGTGCAAGGGATCACCTTAAGAAGATCTTCAGTGATCATGAAAAACTAAAGTTGCAACTAGAATCTCAGAAAAAAGAGTTCGAGTTAAGAGGAAGAGAACTGGAGATGCGTGAAGCACAAAATGAACATGAGAGCAAGTATTTGGCTGAAGAAATTGAGAAGTATGAGGTGAGAAATAGTTCTCTTCAATTGGCTGAGTTAGAGCAACAGAAGGCTGATGAAGATTTTATGAAGCTGGCAGATGATCAGAAGAAACAAAAGGAGGACCTCCATAATAGAATAATCCGACTGGAAAAGCAACTGGATACCAAGCAAGCATTAGAGTTGGAAATTGAGCGTCTACGTGGGTCGTTGAATGTTATGAAGCACATGGGAGATGATGAGGATGTGGAAGTCCTCCAGAAGGCAGAGACAATACTAAAAAGTTTGAGTGAAAAGGAAGGAGATCTTGAAGCTCTTGATGAACTTAACCAAACATTGATAGTAAAGCAGCGCAAGAGTAACGACGAACTCCAAGAAGCCCGTAAAGAGATAGTGAATGCTTTTAAAGATTTGCCTGGTCGTTCCCACTTGCGTGTTAAGAGAATGGGTGAACTAGATACAAAACCATTCCATGAAGCAGCGAAGAAAAGATATAATGAGGATGAAGCAGATGAAAGAGCTTCAGAGTTGTGCTCATTATGGGCAGAATATCTCAAGGACCCAGATTGGCATCCTTTCAAAGTAATTAAGGAAGAAGGAAGGGATAACGAGGAAGGAAAGGAAATTGAAGTTTTGGATGATGAAGATGAGAAACTGCAAGATCTGAAAAATGAGTGGGGGGAGGAAGTGTTCAAGGCTGTGACAGCAGCTTTAAGGGAGATAAATGAATATAATCCAAGTGGAAGGTATATAGTATCGGAGCTATGGAACTACCAAGAGGACAGGAAAGCAACGTTGCGAGAGGGAGTGAAATTCTTACTGGACAAGTTGAAAAGAAGCAACTAGGAAAGGGAACTCCCAAAGCTGCCATGATGATCAAACCAACATATCATTGTCTTGAAACAAAGCGCAACAATGTTCTACACTTTAGATGAAGTATCACCAGGAGATGGAATGTGAGCCATTACGTTATGTCGGTGTATGTTTTCAAGTCAGTCATTTCTTTCCTTCGTACCAATTATTTCCAGATTATGAAGATTCAGATACTGCTGACTTGCTTATCTATTATCATTTACCCATCTTACTATTTAATTTGTTTGAAGATAAACCTATGATACTTTATCGTCTCCGATAACCTGCATGTCATGTGACGTCCAGTCTAGATACCGAATGTTTCTATCTTATTTATCTTCCAATAATC

Coding sequence (CDS)

ATGGGAAGTTCCTCATCTGACGATTCTGATGTGGACACTGATATCAGTGAATCTGAATTGGATGAGCGGGAGAGCAAGTCCTATGAAGAACTTAAAAATGGAAAACGCATTGTAAAACTCTCGCATGAGACATTTACTTGCCCCTACTGCTCGAGAAAAAGAAAGAGGGATTTCTTATATAAGGATCTCCTGCAGCATGCTTCTGGGGTAGGCAACAGCCCTTCAAATAAACGGAGTGCCAAAGAAAAAGCTAATCATTTAGCTTTAGTGAAATATTTGGAAAAAGATCTAGCTGATGCTGTTGGTCCATCAAAACCTGCAAGCAATAATGATCCTGTTATGGATTGCGATCACGATGAAAAGTTTGTGTGGCCATGGAGAGGAATTGTGGTAAACATTCCGACTAGGCGTACAGATGATGGGCGATATGTGGGAGAGAGTGGATCAAAGTTTAGGGATGAGTTGAAAGAAAGAGGATTTAATCCCACAAGAGTTACTCCCTTGTGGAATTACCGGGGTCACTCCGGTTGTGCTATCGTGGAATTTAATAAAGATTGGCCCGGTTTGCATAATGCTATTTCTTTTGAGAGGGCTTATGAGGCAGATCACCATGGAAAAAAGGATTGGCTGGCTAATGGTACTGAGAAACTAGGAGTTTATGCCTGGGTTGCTCGTGCTGATGATTACAACTCGAGTAATATAATCGGGGAACATTTGCGCAAGATTGGAGACCTGAAGACCGTATCTGAAATTATTGAGGAGGAAGCACGGAAGCAGGATAGACTTGTGTCCAATCTTACAAGTATCATCGAGCTCAAGAACAAACATTTGAGAGAGATGGAGGAAAGATGTAGTGAAACTGCCACCACTCTTAACAATTTGATGGTGGAGAGAGATAAATTACTTCAAGCTTATAACGAAGAGATAAAAAAAATTCAATTGGGTGCAAGGGATCACCTTAAGAAGATCTTCAGTGATCATGAAAAACTAAAGTTGCAACTAGAATCTCAGAAAAAAGAGTTCGAGTTAAGAGGAAGAGAACTGGAGATGCGTGAAGCACAAAATGAACATGAGAGCAAGTATTTGGCTGAAGAAATTGAGAAGTATGAGGTGAGAAATAGTTCTCTTCAATTGGCTGAGTTAGAGCAACAGAAGGCTGATGAAGATTTTATGAAGCTGGCAGATGATCAGAAGAAACAAAAGGAGGACCTCCATAATAGAATAATCCGACTGGAAAAGCAACTGGATACCAAGCAAGCATTAGAGTTGGAAATTGAGCGTCTACGTGGGTCGTTGAATGTTATGAAGCACATGGGAGATGATGAGGATGTGGAAGTCCTCCAGAAGGCAGAGACAATACTAAAAAGTTTGAGTGAAAAGGAAGGAGATCTTGAAGCTCTTGATGAACTTAACCAAACATTGATAGTAAAGCAGCGCAAGAGTAACGACGAACTCCAAGAAGCCCGTAAAGAGATAGTGAATGCTTTTAAAGATTTGCCTGGTCGTTCCCACTTGCGTGTTAAGAGAATGGGTGAACTAGATACAAAACCATTCCATGAAGCAGCGAAGAAAAGATATAATGAGGATGAAGCAGATGAAAGAGCTTCAGAGTTGTGCTCATTATGGGCAGAATATCTCAAGGACCCAGATTGGCATCCTTTCAAAGTAATTAAGGAAGAAGGAAGGGATAACGAGGAAGGAAAGGAAATTGAAGTTTTGGATGATGAAGATGAGAAACTGCAAGATCTGAAAAATGAGTGGGGGGAGGAAGTGTTCAAGGCTGTGACAGCAGCTTTAAGGGAGATAAATGAATATAATCCAAGTGGAAGGTATATAGTATCGGAGCTATGGAACTACCAAGAGGACAGGAAAGCAACGTTGCGAGAGGGAGTGAAATTCTTACTGGACAAGTTGAAAAGAAGCAACTAG

Protein sequence

MGSSSSDDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEKFVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKDWPGLHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLRKIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSNDELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSLWAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALREINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLKRSN
BLAST of Cp4.1LG01g04820 vs. Swiss-Prot
Match: IDN2_ARATH (Protein INVOLVED IN DE NOVO 2 OS=Arabidopsis thaliana GN=IDN2 PE=1 SV=1)

HSP 1 Score: 713.0 bits (1839), Expect = 3.0e-204
Identity = 373/648 (57.56%), Postives = 479/648 (73.92%), Query Frame = 1

Query: 1   MGSS---SSDDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRD 60
           MGS+   SSDD D  +DISESE+DE   K Y  LK GK  V+LS + F CPYC  K+K  
Sbjct: 1   MGSTVILSSDDED--SDISESEMDEYGDKMYLNLKGGKLKVRLSPQAFICPYCPNKKKTS 60

Query: 61  FLYKDLLQHASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPAS----NNDPV 120
           F YKDLLQHASGVGNS S+KRSAKEKA+HLALVKYL++DLAD+   ++P+S    N +P+
Sbjct: 61  FQYKDLLQHASGVGNSNSDKRSAKEKASHLALVKYLQQDLADSASEAEPSSKRQKNGNPI 120

Query: 121 MDCDHDEKFVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRG 180
            DCDHDEK V+PW+GIVVNIPT +  DGR  GESGSK RDE   RGFNPTRV PLWNY G
Sbjct: 121 QDCDHDEKLVYPWKGIVVNIPTTKAQDGRSAGESGSKLRDEYILRGFNPTRVRPLWNYLG 180

Query: 181 HSGCAIVEFNKDWPGLHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSN 240
           HSG AIVEFNKDW GLHN + F++AY  D HGKKDWL     KLG+Y W+ARADDYN +N
Sbjct: 181 HSGTAIVEFNKDWNGLHNGLLFDKAYTVDGHGKKDWLKKDGPKLGLYGWIARADDYNGNN 240

Query: 241 IIGEHLRKIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNN 300
           IIGE+LRK GDLKT++E+ EEEARKQ+ LV NL  ++E K K ++E+EE CS  +  LN 
Sbjct: 241 IIGENLRKTGDLKTIAELTEEEARKQELLVQNLRQLVEEKKKDMKEIEELCSVKSEELNQ 300

Query: 301 LMVERDKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREA 360
           LM E++K  Q +  E+  IQ     H++KI  DHEKLK  LES++K+ E++  EL  RE 
Sbjct: 301 LMEEKEKNQQKHYRELNAIQERTMSHIQKIVDDHEKLKRLLESERKKLEIKCNELAKREV 360

Query: 361 QNEHESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEK 420
            N  E   L+E++E+   +NSSL+LA +EQQKADE+  KLA+DQ++QKE+LH +IIRLE+
Sbjct: 361 HNGTERMKLSEDLEQNASKNSSLELAAMEQQKADEEVKKLAEDQRRQKEELHEKIIRLER 420

Query: 421 QLDTKQALELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQT 480
           Q D KQA+ELE+E+L+G LNVMKHM  D D EV+++ + I K L EKE  L  LD+ NQT
Sbjct: 421 QRDQKQAIELEVEQLKGQLNVMKHMASDGDAEVVKEVDIIFKDLGEKEAQLADLDKFNQT 480

Query: 481 LIVKQRKSNDELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADE 540
           LI+++R++NDELQEA KE+VN  K+    +++ VKRMGEL TKPF +A +++Y + + ++
Sbjct: 481 LILRERRTNDELQEAHKELVNIMKE--WNTNIGVKRMGELVTKPFVDAMQQKYCQQDVED 540

Query: 541 RASELCSLWAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFK 600
           RA E+  LW  YLKD DWHPFK +K E  D    +E+EV+DD DEKL++LK + G+  + 
Sbjct: 541 RAVEVLQLWEHYLKDSDWHPFKRVKLENED----REVEVIDDRDEKLRELKADLGDGPYN 600

Query: 601 AVTAALREINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLKRS 642
           AVT AL EINEYNPSGRYI +ELWN++ D+KATL EGV  LLD+ +++
Sbjct: 601 AVTKALLEINEYNPSGRYITTELWNFKADKKATLEEGVTCLLDQWEKA 640

BLAST of Cp4.1LG01g04820 vs. Swiss-Prot
Match: FDM3_ARATH (Factor of DNA methylation 3 OS=Arabidopsis thaliana GN=FDM3 PE=2 SV=1)

HSP 1 Score: 590.5 bits (1521), Expect = 2.2e-167
Identity = 322/634 (50.79%), Postives = 436/634 (68.77%), Query Frame = 1

Query: 18  SELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHASGVGNSPSNK 77
           ++L + E   Y++LK+GK  VK+S+ TF CPYC   +K+  LY D+LQHASGVGNS S K
Sbjct: 3   NKLSDFEKNLYKKLKSGKLEVKVSYRTFLCPYCPDNKKKVGLYVDILQHASGVGNSQSKK 62

Query: 78  RSAKEKANHLALVKYLEKDLADAVGPSK-----------PASNNDP--VMDCDHDEKFVW 137
           RS  EKA+H AL KYL KDLA     +            PA   D   + D    EK VW
Sbjct: 63  RSLTEKASHRALAKYLIKDLAHYATSTISKRLKARTSFIPAETGDAPIIYDDAQFEKLVW 122

Query: 138 PWRGIVVNIPTRRTDDGRY-VGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 197
           PW+G++VNIPT  T+DGR   GESG K +DEL  RGFNP RV  +W+  GHSG  IVEFN
Sbjct: 123 PWKGVLVNIPTTSTEDGRSCTGESGPKLKDELIRRGFNPIRVRTVWDRFGHSGTGIVEFN 182

Query: 198 KDWPGLHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLRKIG 257
           +DW GL +A+ F++AYE D HGKKDWL   T+   +YAW+A ADDY  +NI+GE+LRK+G
Sbjct: 183 RDWNGLQDALVFKKAYEGDGHGKKDWLCGATDS-SLYAWLANADDYYRANILGENLRKMG 242

Query: 258 DLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLLQ 317
           DLK++    EEEARK  +L+  L  ++E K   L++++ + S+ +  L     E++K+L+
Sbjct: 243 DLKSIYRFAEEEARKDQKLLQRLNFMVENKQYRLKKLQIKYSQDSVKLKYETEEKEKILR 302

Query: 318 AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYLA 377
           AY+E++   Q  + DH  +IF+DHEK K+QLESQ KE E+R  EL  REA+NE + K +A
Sbjct: 303 AYSEDLTGRQQKSTDHFNRIFADHEKQKVQLESQIKELEIRKLELAKREAENETQRKIVA 362

Query: 378 EEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALEL 437
           +E+E+    NS +QL+ LEQQK  E   +LA D K QKE LH RI  LE+QLD KQ LEL
Sbjct: 363 KELEQNAAINSYVQLSALEQQKTREKAQRLAVDHKMQKEKLHKRIAALERQLDQKQELEL 422

Query: 438 EIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSND 497
           E+++L+  L+VM+ +  D   E++ K ET L+ LSE EG+L  L++ NQ L+V++RKSND
Sbjct: 423 EVQQLKSQLSVMRLVELDSGSEIVNKVETFLRDLSETEGELAHLNQFNQDLVVQERKSND 482

Query: 498 ELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSLWA 557
           ELQEAR+ +++  +D+    H+ VKRMGELDTKPF +A + +Y +++ ++ A E+  LW 
Sbjct: 483 ELQEARRALISNLRDM--GLHIGVKRMGELDTKPFMKAMRIKYCQEDLEDWAVEVIQLWE 542

Query: 558 EYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALREIN 617
           EYLKDPDWHPFK IK E  +      +EV+D++DEKL+ LKNE G++ ++AV  AL EIN
Sbjct: 543 EYLKDPDWHPFKRIKLETAET----IVEVIDEDDEKLRTLKNELGDDAYQAVANALLEIN 602

Query: 618 EYNPSGRYIVSELWNYQEDRKATLREGVKFLLDK 638
           EYNPSGRYI SELWN++EDRKATL EGV  LL++
Sbjct: 603 EYNPSGRYISSELWNFREDRKATLEEGVNSLLEQ 629

BLAST of Cp4.1LG01g04820 vs. Swiss-Prot
Match: FDM5_ARATH (Factor of DNA methylation 5 OS=Arabidopsis thaliana GN=FDM5 PE=2 SV=1)

HSP 1 Score: 448.4 bits (1152), Expect = 1.4e-124
Identity = 263/639 (41.16%), Postives = 404/639 (63.22%), Query Frame = 1

Query: 7   DDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQH 66
           + SD +++ISESE+D    K YE+L NG   VK+  +TF CP+C+ K+K+ + YK+LL H
Sbjct: 3   NSSDEESEISESEIDVYYEKPYEKLMNGDYKVKVK-DTFRCPFCAGKKKQHYKYKELLAH 62

Query: 67  ASGVGNSPSNKRSAKEKANHLALVKYLEKDLA---DAVGPSKPASNNDPVMDCDHDEKFV 126
           ASGV    S  RSAK+KANH AL KY+E +LA   D   P  P+S+ +       D+ +V
Sbjct: 63  ASGVAKG-SASRSAKQKANHFALAKYMENELAGDADVPRPQIPSSSTEQ-SQAVVDDIYV 122

Query: 127 WPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 186
           WPW GIV+N P RRTD+   + +S    +   K   FNP  V  LW  +      I +FN
Sbjct: 123 WPWMGIVIN-PVRRTDNKNVLLDSAYWLK---KLARFNPLEVKTLWLDQESVVAVIPQFN 182

Query: 187 KDWPGLHNAISFERAYEADHHGKKDWL-ANGTEKLGVYAWVARADDYNSSNIIGEHLRKI 246
             W G  +    E+ YE    G+KDW+   G  +   Y W ARADDYNS   I E+L K+
Sbjct: 183 SGWSGFKSVTELEKEYEIRGCGRKDWIDKRGDWRSKAYGWCARADDYNSQGSIAEYLSKV 242

Query: 247 GDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLL 306
           G L++ S+I +EE + +  +V +L + I + N+ L +++   +E   +L  +++E+D+L 
Sbjct: 243 GKLRSFSDITKEEIQNKSIVVDDLANKIAMTNEDLNKLQYMNNEKTLSLRRVLIEKDELD 302

Query: 307 QAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYL 366
           + Y +E KK+Q  +R+ + +IF + E+L  +LE++    ++  ++L+ ++A  E E + L
Sbjct: 303 RVYKQETKKMQELSREKINRIFREKERLTNELEAKMNNLKIWSKQLDKKQALTELERQKL 362

Query: 367 AEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALE 426
            E+ +K +V NSSLQLA LEQ+K D+  ++L D+ K++KE+  N+I++LEK+LD+KQ L+
Sbjct: 363 DEDKKKSDVMNSSLQLASLEQKKTDDRVLRLVDEHKRKKEETLNKILQLEKELDSKQKLQ 422

Query: 427 LEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSN 486
           +EI+ L+G L VMKH  D++D  + +K + + + L EK  +L+ L++ N  L+VK+RKSN
Sbjct: 423 MEIQELKGKLKVMKH-EDEDDEGIKKKMKKMKEELEEKCSELQDLEDTNSALMVKERKSN 482

Query: 487 DELQEARKEIVNAFKDL-PGRSHLRVKRMGELDTKPFHEAAKKRYN-EDEADERASELCS 546
           DE+ EARK ++   ++L   R+ +RVKRMGEL+ KPF  A ++R   E+EA  + + LCS
Sbjct: 483 DEIVEARKFLITELRELVSDRNIIRVKRMGELEEKPFMTACRQRCTVEEEAQVQYAMLCS 542

Query: 547 LWAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALR 606
            W E +KD  W PFK +    R        EV+D+EDE+++ L+ EWGEEV  AV  AL 
Sbjct: 543 KWQEKVKDSAWQPFKHVGTGDRKK------EVVDEEDEEIKKLREEWGEEVKNAVKTALE 602

Query: 607 EINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
           E+NE+NPSGRY V ELWN ++ RKATL+E + ++  ++K
Sbjct: 603 ELNEFNPSGRYSVPELWNSKQGRKATLKEVIDYITQQVK 627

BLAST of Cp4.1LG01g04820 vs. Swiss-Prot
Match: FDM1_ARATH (Factor of DNA methylation 1 OS=Arabidopsis thaliana GN=FDM1 PE=1 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 2.2e-122
Identity = 265/640 (41.41%), Postives = 391/640 (61.09%), Query Frame = 1

Query: 9   SDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHAS 68
           SD + +ISESE+++     Y  L++G   VK++ +   CP+C+ K+K+D+ YK+L  HA+
Sbjct: 4   SDEEAEISESEIEDYSETPYRLLRDGTYKVKVNGQ-LRCPFCAGKKKQDYKYKELYAHAT 63

Query: 69  GVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEK------- 128
           GV    S  RSA +KANHLAL  +LE +LA   G ++P     PV+    DE        
Sbjct: 64  GVSKG-SATRSALQKANHLALAMFLENELA---GYAEPVPR-PPVVPPQLDETEPNPHNV 123

Query: 129 FVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 188
           +VWPW GIVVN P +  DD   + +S    +   K   F P  V   W  +      I +
Sbjct: 124 YVWPWMGIVVN-PLKEADDKELLLDSAYWLQTLSK---FKPIEVNAFWVEQDSIVGVIAK 183

Query: 189 FNKDWPGLHNAISFERAYEADHHGKKDWLA-NGTEKLGVYAWVARADDYNSSNIIGEHLR 248
           FN DW G   A   E+ +E     KK+W   +G  +   Y W ARADD+ S   IGE+L 
Sbjct: 184 FNGDWSGFAGATELEKEFETQGSSKKEWTERSGDSESKAYGWCARADDFESQGPIGEYLS 243

Query: 249 KIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDK 308
           K G L+TVS+I ++  + ++ ++  L+ +I + N+ L +++   + TA +L  ++ E+  
Sbjct: 244 KEGQLRTVSDISQKNVQDRNTVLEELSDMIAMTNEDLNKVQYSYNRTAMSLQRVLDEKKN 303

Query: 309 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESK 368
           L QA+ +E KK+Q  +  H++KI  D EKL  +L+ + ++ E R ++LE  EA  E + +
Sbjct: 304 LHQAFADETKKMQQMSLRHIQKILYDKEKLSNELDRKMRDLESRAKQLEKHEALTELDRQ 363

Query: 369 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQA 428
            L E+  K +  N SLQLA  EQ+KADE  ++L ++ ++QKED  N+I+ LEKQLDTKQ 
Sbjct: 364 KLDEDKRKSDAMNKSLQLASREQKKADESVLRLVEEHQRQKEDALNKILLLEKQLDTKQT 423

Query: 429 LELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRK 488
           LE+EI+ L+G L VMKH+GDD+D  V +K + +   L +K+ +LE L+ +N  L+ K+R+
Sbjct: 424 LEMEIQELKGKLQVMKHLGDDDDEAVQKKMKEMNDELDDKKAELEGLESMNSVLMTKERQ 483

Query: 489 SNDELQEARKEIVNAFKDLPG-RSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELC 548
           SNDE+Q ARK+++     L G  + + VKRMGELD KPF +  K RY+ +EA   A+ LC
Sbjct: 484 SNDEIQAARKKLIAGLTGLLGAETDIGVKRMGELDEKPFLDVCKLRYSANEAAVEAATLC 543

Query: 549 SLWAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAAL 608
           S W E LK+P W PF   K EG    +G E EV+D++DE+L+ LK EWG+EV  AV  AL
Sbjct: 544 STWQENLKNPSWQPF---KHEG--TGDGAE-EVVDEDDEQLKKLKREWGKEVHNAVKTAL 603

Query: 609 REINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
            E+NEYN SGRY   ELWN++E RKATL+E + F+ + +K
Sbjct: 604 VEMNEYNASGRYTTPELWNFKEGRKATLKEVITFISNDIK 627

BLAST of Cp4.1LG01g04820 vs. Swiss-Prot
Match: FDM2_ARATH (Factor of DNA methylation 2 OS=Arabidopsis thaliana GN=FDM2 PE=1 SV=1)

HSP 1 Score: 440.3 bits (1131), Expect = 3.7e-122
Identity = 265/638 (41.54%), Postives = 386/638 (60.50%), Query Frame = 1

Query: 7   DDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQH 66
           D SD +++ISESE++E     Y  L++        +    CP+C  K+K+D+ YK+L  H
Sbjct: 2   DISDEESEISESEIEEYSKTPYHLLRSETYYKVKVNGRLRCPFCVGKKKQDYKYKELHAH 61

Query: 67  ASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEK---FV 126
           A+GV    S  RSA +K+NHLAL K+LE DLA    P        P++D         +V
Sbjct: 62  ATGVSKG-SATRSALQKSNHLALAKFLENDLAGYAEPLPRPPVVPPLLDETEPNPHNVYV 121

Query: 127 WPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 186
           WPW GIVVN P + TDD   + +S    +   K   F P  V   W  +      I +F+
Sbjct: 122 WPWMGIVVN-PLKETDDKELLLDSVYWLQTLSK---FKPVEVNAFWVEQDSIVGVIAKFD 181

Query: 187 KDWPGLHNAISFERAYEADHHGKKDWLA-NGTEKLGVYAWVARADDYNSSNIIGEHLRKI 246
            DW G   A   E+ +E     KK+W   +G  +   Y W ARADD+ S   IGE+L K 
Sbjct: 182 SDWSGFAAATELEKEFETQGSCKKEWTERSGDSESKAYGWCARADDFQSQGPIGEYLSKE 241

Query: 247 GDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLL 306
           G L+TVS+I++   + ++ L+  L+++I++ N+ L + +   + TA +L  ++ E+  L 
Sbjct: 242 GTLRTVSDILQNNVQDRNTLLDVLSNMIDMTNEDLNKAQHSYNRTAMSLQRVLDEKKNLH 301

Query: 307 QAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYL 366
           QA+ EE KK+Q  +  H+++I  D EKL+ +L+ + ++ E R ++LE  EA  E E + L
Sbjct: 302 QAFAEETKKMQQMSLRHIQRILYDKEKLRNELDRKMRDLESRAKQLEKHEALTELERQKL 361

Query: 367 AEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALE 426
            E+  K +  N SLQLA  EQ+KADE  ++L ++ ++QKED  N+I+ LEKQLDTKQ LE
Sbjct: 362 DEDKRKSDAMNKSLQLASREQKKADESVLRLVEEHQRQKEDALNKILLLEKQLDTKQTLE 421

Query: 427 LEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSN 486
           +EI+ L+G L VMKH+GDD+D  V  K + +   L +K+ +LE L+ +N  L+ K+R+SN
Sbjct: 422 MEIQELKGKLQVMKHLGDDDDEAVQTKMKEMNDELDDKKAELEDLESMNSVLMTKERQSN 481

Query: 487 DELQEARKEIVNAFKDLPG-RSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSL 546
           DE+Q AR++++     L G  S + VKRMGELD KPF +  K RY+ +EA   A+ LCS 
Sbjct: 482 DEIQAARQKMIAGLTGLLGAESDIGVKRMGELDEKPFLDVCKLRYSANEARVEAATLCST 541

Query: 547 WAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALRE 606
           W E LK+P W PFK  +E   D  E    EV+D++DE+L+ LK EWG+EV  AV AAL E
Sbjct: 542 WKENLKNPSWQPFK--REGTGDGAE----EVVDEDDEQLKKLKREWGKEVHNAVKAALVE 601

Query: 607 INEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
           +NEYN SGRY  SELWN++E RKATL+E + F+   +K
Sbjct: 602 MNEYNASGRYPTSELWNFKEGRKATLKEVITFISTDIK 628

BLAST of Cp4.1LG01g04820 vs. TrEMBL
Match: A0A0A0KNW6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G182100 PE=4 SV=1)

HSP 1 Score: 1104.0 bits (2854), Expect = 0.0e+00
Identity = 569/644 (88.35%), Postives = 607/644 (94.25%), Query Frame = 1

Query: 4   SSSDDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDL 63
           SS+DDSDVDTD+SESE+DERESKSY+ELKNGKRIVKLSHETFTCPYC++KRKRDFLYKDL
Sbjct: 141 SSTDDSDVDTDVSESEMDERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDL 200

Query: 64  LQHASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPA--SNNDPVMDCDHDEK 123
           LQHASGVG SPSNKRS KEKANHLAL+KYLEKDLADAVGPSKPA  SNNDPVMDC+HDEK
Sbjct: 201 LQHASGVGKSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPATASNNDPVMDCNHDEK 260

Query: 124 FVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 183
           FVWPWRGIVVNIPTRRTDDGR+VG SGSKFRDELKERGFNP+RVTPLWNYRGHSGCAIVE
Sbjct: 261 FVWPWRGIVVNIPTRRTDDGRFVGGSGSKFRDELKERGFNPSRVTPLWNYRGHSGCAIVE 320

Query: 184 FNKDWPGLHNAISFERAYEADHHGKKDWLANGT-EKLGVYAWVARADDYNSSNIIGEHLR 243
           FNKDWPGLHNAISFERAYEAD HGKKDWLANGT EKLGVYAWVARADDYNS+NIIGEHLR
Sbjct: 321 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTEKLGVYAWVARADDYNSNNIIGEHLR 380

Query: 244 KIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDK 303
           KIGDLKT+SEII+EEARKQDRLVSNLTSIIELKNKHL EME+RC+ET+ T+++LM E +K
Sbjct: 381 KIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLTEMEKRCNETSATVDSLMREIEK 440

Query: 304 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESK 363
           LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKL+LESQKKEFELRGRELE REAQNE+ESK
Sbjct: 441 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLKLESQKKEFELRGRELEKREAQNENESK 500

Query: 364 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQA 423
           YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLH+RIIRLEKQLD KQA
Sbjct: 501 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHDRIIRLEKQLDAKQA 560

Query: 424 LELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRK 483
           LELEIERLRG+LNVMKHM D EDV   QKAE+ILK LSEKE DLE LD+LNQ LIVKQRK
Sbjct: 561 LELEIERLRGTLNVMKHMEDAEDV---QKAESILKDLSEKERDLEELDDLNQALIVKQRK 620

Query: 484 SNDELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCS 543
           SNDELQEARKEI+NAFKDLPGRSHLR+KRMGELDTKPFHEA KK YNEDEADERASELCS
Sbjct: 621 SNDELQEARKEIINAFKDLPGRSHLRIKRMGELDTKPFHEAMKKIYNEDEADERASELCS 680

Query: 544 LWAEYLKDPDWHPFKVIKEEGRDNEEG--KEIEVLDDEDEKLQDLKNEWGEEVFKAVTAA 603
           LWAEYLKDPDWHPFKVIK EG+D  +G  KEIE+LDDEDEKL+ LK ++GEEV KAV +A
Sbjct: 681 LWAEYLKDPDWHPFKVIKVEGKDAPDGKEKEIEILDDEDEKLKGLKKDYGEEVCKAVISA 740

Query: 604 LREINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLKRSN 643
           L EINEYNPSGRYI SELWNYQE ++ATLREGV+FLLDKL RSN
Sbjct: 741 LVEINEYNPSGRYITSELWNYQEGKRATLREGVRFLLDKLNRSN 781

BLAST of Cp4.1LG01g04820 vs. TrEMBL
Match: V4SLZ7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10030937mg PE=4 SV=1)

HSP 1 Score: 912.5 bits (2357), Expect = 2.8e-262
Identity = 458/631 (72.58%), Postives = 538/631 (85.26%), Query Frame = 1

Query: 9   SDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHAS 68
           S+ D+DISESE+ + E KSY++LK+G   VK+S E FTCPYC +KRK+++LYKDLLQHAS
Sbjct: 5   SEEDSDISESEMLKYEDKSYKKLKSGNHSVKISDEAFTCPYCPKKRKQEYLYKDLLQHAS 64

Query: 69  GVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEKFVWPWRG 128
           GVGNS SNKRSAKEKANHLAL KYLEKDL DA  PSKP +  DP+  C HDEKFVWPW G
Sbjct: 65  GVGNSTSNKRSAKEKANHLALAKYLEKDLRDAGSPSKPVNEGDPLTGCSHDEKFVWPWTG 124

Query: 129 IVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKDWPG 188
           IVVNIPTRR +DGR VGESGSK RDEL  RGFNPTRV PLWN+RGHSGCA+VEF+KDWPG
Sbjct: 125 IVVNIPTRRAEDGRSVGESGSKLRDELIRRGFNPTRVHPLWNFRGHSGCAVVEFHKDWPG 184

Query: 189 LHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLRKIGDLKTV 248
           LHNA+SFE+AYEADHHGKKDW A+  EK G+YAWVAR+DDYN  NIIG+HLRKIGDLKT+
Sbjct: 185 LHNAMSFEKAYEADHHGKKDWYASNQEKSGLYAWVARSDDYNLKNIIGDHLRKIGDLKTI 244

Query: 249 SEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLLQAYNEE 308
           SE++EEEARKQ+ LVSNLT++IE+K+KHL EM+ER +ET+ ++  LM E+D+LLQ+YNEE
Sbjct: 245 SEMMEEEARKQNLLVSNLTNMIEVKDKHLEEMKERFTETSNSVEKLMEEKDRLLQSYNEE 304

Query: 309 IKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYLAEEIEK 368
           IKKIQL ARDH ++IF+DHEKLKLQLESQKKE ELRG ELE RE QNE++ K LAEEIEK
Sbjct: 305 IKKIQLSARDHFQRIFTDHEKLKLQLESQKKELELRGEELEKRETQNENDRKILAEEIEK 364

Query: 369 YEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALELEIERL 428
             +RN+SLQLA L QQKADE+  KLA+DQKKQKEDLHNRII+LEKQLD KQAL LEIERL
Sbjct: 365 NAMRNNSLQLASLVQQKADENVRKLAEDQKKQKEDLHNRIIQLEKQLDAKQALALEIERL 424

Query: 429 RGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSNDELQEA 488
           +GSLNVMKHMGDD D+EVLQK ET+LK L EKEG+L+ L+ LNQTLI+++RKSNDELQ+A
Sbjct: 425 KGSLNVMKHMGDDGDIEVLQKMETVLKDLREKEGELDDLEALNQTLIIRERKSNDELQDA 484

Query: 489 RKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSLWAEYLKD 548
           RKE++NA K+L GR+H+ +KRMGELD KPF E   ++YNE+EA+ERASELCSLW EYLKD
Sbjct: 485 RKELINALKELAGRAHIGLKRMGELDNKPFLEVMNRKYNEEEAEERASELCSLWEEYLKD 544

Query: 549 PDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALREINEYNPS 608
           PDWHPFKVI        EGK  E++++EDEKL+ LK E GEEV+ AVT AL EINEYNPS
Sbjct: 545 PDWHPFKVI------TAEGKHKEIINEEDEKLKGLKKEMGEEVYIAVTTALVEINEYNPS 604

Query: 609 GRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
           GRYI SELWNY+E RKATL+EGV FL+ + K
Sbjct: 605 GRYITSELWNYKEGRKATLQEGVAFLMKQWK 629

BLAST of Cp4.1LG01g04820 vs. TrEMBL
Match: A0A067G0R1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g0065972mg PE=4 SV=1)

HSP 1 Score: 910.2 bits (2351), Expect = 1.4e-261
Identity = 457/631 (72.42%), Postives = 538/631 (85.26%), Query Frame = 1

Query: 9   SDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHAS 68
           S+ D+DISESE+ + E KSY++LK+G   VK+S E FTCPYC +KRK+++LYKDLLQHAS
Sbjct: 5   SEEDSDISESEMLKYEDKSYKKLKSGNHSVKISDEAFTCPYCPKKRKQEYLYKDLLQHAS 64

Query: 69  GVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEKFVWPWRG 128
           GVGNS SNKRSAKEKANHLAL KYLEKDL DA  PSKP +  DP+  C HDEKFVWPW G
Sbjct: 65  GVGNSTSNKRSAKEKANHLALAKYLEKDLRDAGSPSKPVNEGDPLTGCSHDEKFVWPWTG 124

Query: 129 IVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKDWPG 188
           IVVNIPTRR +DGR VGESGSK RDEL  RGFNPTRV PLWN+RGHSGCA+VEF+KDWPG
Sbjct: 125 IVVNIPTRRAEDGRSVGESGSKLRDELIRRGFNPTRVHPLWNFRGHSGCAVVEFHKDWPG 184

Query: 189 LHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLRKIGDLKTV 248
           LHNA+SFE+AYEADH+GKKDW A+  EK G+YAWVAR+DDYN  NIIG+HLRKIGDLKT+
Sbjct: 185 LHNAMSFEKAYEADHYGKKDWYASNQEKSGLYAWVARSDDYNLKNIIGDHLRKIGDLKTI 244

Query: 249 SEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLLQAYNEE 308
           SE++EEEARKQ+ LVSNLT++IE+K+KHL EM+ER +ET+ ++  LM E+D+LLQ+YNEE
Sbjct: 245 SEMMEEEARKQNLLVSNLTNMIEVKDKHLEEMKERFTETSNSVEKLMEEKDRLLQSYNEE 304

Query: 309 IKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYLAEEIEK 368
           IKKIQL ARDH ++IF+DHEKLKLQLESQKKE ELRG ELE RE QNE++ K LAEEIEK
Sbjct: 305 IKKIQLSARDHFQRIFTDHEKLKLQLESQKKELELRGEELEKRETQNENDRKILAEEIEK 364

Query: 369 YEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALELEIERL 428
             +RN+SLQLA L QQKADE+  KLA+DQKKQKEDLHNRII+LEKQLD KQAL LEIERL
Sbjct: 365 NAMRNNSLQLASLVQQKADENVRKLAEDQKKQKEDLHNRIIQLEKQLDAKQALALEIERL 424

Query: 429 RGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSNDELQEA 488
           +GSLNVMKHMGDD D+EVLQK ET+LK L EKEG+L+ L+ LNQTLI+++RKSNDELQ+A
Sbjct: 425 KGSLNVMKHMGDDGDIEVLQKMETVLKDLREKEGELDDLEALNQTLIIRERKSNDELQDA 484

Query: 489 RKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSLWAEYLKD 548
           RKE++NA K+L GR+H+ +KRMGELD KPF E   ++YNE+EA+ERASELCSLW EYLKD
Sbjct: 485 RKELINALKELSGRAHIGLKRMGELDNKPFLEVMNRKYNEEEAEERASELCSLWEEYLKD 544

Query: 549 PDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALREINEYNPS 608
           PDWHPFKVI        EGK  E++++EDEKL+ LK E GEEV+ AVT AL EINEYNPS
Sbjct: 545 PDWHPFKVI------TAEGKHKEIINEEDEKLKGLKKEMGEEVYIAVTTALVEINEYNPS 604

Query: 609 GRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
           GRYI SELWNY+E RKATL+EGV FL+ + K
Sbjct: 605 GRYITSELWNYKEGRKATLQEGVAFLMKQWK 629

BLAST of Cp4.1LG01g04820 vs. TrEMBL
Match: A0A067LJU2_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01445 PE=4 SV=1)

HSP 1 Score: 907.1 bits (2343), Expect = 1.2e-260
Identity = 455/627 (72.57%), Postives = 533/627 (85.01%), Query Frame = 1

Query: 9   SDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHAS 68
           SD DTD+S+SE++E E++SYEELKNG R VK+S E F+CPYC +KRKRD+LYKDLLQHA 
Sbjct: 5   SDEDTDVSDSEMEEYEAQSYEELKNGTRSVKISDEIFSCPYCPKKRKRDYLYKDLLQHAV 64

Query: 69  GVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEKFVWPWRG 128
           GVG SPSNKRSAKEKANHLALVKYLEKDL     PS+P S+ DP+ +CDH EK VWPW G
Sbjct: 65  GVGKSPSNKRSAKEKANHLALVKYLEKDLGATGSPSEPKSDTDPLSECDHYEKLVWPWTG 124

Query: 129 IVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKDWPG 188
           IVVN+PT RTDDGR+VG SGSKFRDEL  RGFNPTRV PLWNYRGHSG A+VEF KDWPG
Sbjct: 125 IVVNLPTTRTDDGRFVGASGSKFRDELISRGFNPTRVHPLWNYRGHSGSAVVEFRKDWPG 184

Query: 189 LHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLRKIGDLKTV 248
           LHNA+SFE+AYEADHHGKK+W   G EK GVY WVARADDY + NIIGEHLRKIGDLKTV
Sbjct: 185 LHNALSFEKAYEADHHGKKEWFTGG-EKSGVYCWVARADDYKADNIIGEHLRKIGDLKTV 244

Query: 249 SEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLLQAYNEE 308
           SEI+EEEARKQD+L+SNL +IIE+KNKHL+EMEE+CSET  +L  LM E+D+LLQAYNEE
Sbjct: 245 SEIMEEEARKQDKLISNLNNIIEIKNKHLQEMEEKCSETTVSLQKLMGEKDRLLQAYNEE 304

Query: 309 IKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYLAEEIEK 368
           IKKIQ+ AR+H +KIF+DHEKLKLQLESQK+E E+RG ELE REA+NE + + L+EEIEK
Sbjct: 305 IKKIQMSAREHFQKIFNDHEKLKLQLESQKRELEMRGSELEQREARNESDRRLLSEEIEK 364

Query: 369 YEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALELEIERL 428
             +RNSSLQLA LEQQKADE  +KLA+DQK+QKE+LHNRII+LEKQLD KQALELEIERL
Sbjct: 365 NAIRNSSLQLASLEQQKADESVLKLAEDQKRQKEELHNRIIQLEKQLDAKQALELEIERL 424

Query: 429 RGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSNDELQEA 488
           RGSLNV+KHMGDD D EVL+K +TI+++L EKEG+LE L+ LNQ LIV++RKSNDELQEA
Sbjct: 425 RGSLNVIKHMGDDGDAEVLKKMDTIIQNLREKEGELEELETLNQALIVRERKSNDELQEA 484

Query: 489 RKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSLWAEYLKD 548
           RKE++   K++  R+ + VKRMGELD+KPF EA KK++ EDEA+ RASELCSLW EYLKD
Sbjct: 485 RKELITGLKEISNRASIGVKRMGELDSKPFLEAMKKKFVEDEAEVRASELCSLWMEYLKD 544

Query: 549 PDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALREINEYNPS 608
           PDWHPFK +        +GK  EV++DEDEKL+ L+ E   EV+KAVT AL EINEYNPS
Sbjct: 545 PDWHPFKFVM------VDGKHKEVINDEDEKLKGLRKEMSNEVYKAVTDALMEINEYNPS 604

Query: 609 GRYIVSELWNYQEDRKATLREGVKFLL 636
           GRYI+SELWNY+E +KATL+EGV FLL
Sbjct: 605 GRYIISELWNYKEGKKATLKEGVSFLL 624

BLAST of Cp4.1LG01g04820 vs. TrEMBL
Match: B9T4I5_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0001380 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 3.6e-257
Identity = 447/635 (70.39%), Postives = 539/635 (84.88%), Query Frame = 1

Query: 1   MGSSSSDDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLY 60
           MGSS    SD DTD+SESELDE E++ YEELKNG   VK+S ETFTCPYC +KRKR++LY
Sbjct: 1   MGSSVDHSSDEDTDMSESELDEYEAQCYEELKNGTHHVKISDETFTCPYCPKKRKREYLY 60

Query: 61  KDLLQHASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDE 120
           +DLLQHASGVG S S KRS KEKANHLALVKYLEKD+AD   PSKP   +DP+  C+HDE
Sbjct: 61  RDLLQHASGVGRSASKKRSTKEKANHLALVKYLEKDIADLGSPSKPKGESDPLDSCNHDE 120

Query: 121 KFVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIV 180
           K VWPW GIV+NIPT +  DGR+VG SGSKFRDEL  RGFNPTRV PLWNYRGHSG A+V
Sbjct: 121 KIVWPWTGIVINIPTTKAPDGRFVGASGSKFRDELISRGFNPTRVHPLWNYRGHSGSAVV 180

Query: 181 EFNKDWPGLHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLR 240
           EF+KDWPGLHNA+SFE+AYEADHHGKKD+   G EK GVY WVARADDY + NIIG+HLR
Sbjct: 181 EFHKDWPGLHNALSFEKAYEADHHGKKDYFTTG-EKSGVYCWVARADDYKADNIIGDHLR 240

Query: 241 KIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDK 300
           K GDLKT+SEI+EEEARKQD+L+SNL +IIE+KNKH++EM+++ SET+ +LN LM E+D+
Sbjct: 241 KTGDLKTISEIMEEEARKQDKLISNLNNIIEIKNKHIQEMQDKFSETSVSLNKLMEEKDR 300

Query: 301 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESK 360
           LLQAYNEEI+KIQ+ AR+H +KIF+DHEKLKLQ++SQK+E E+RG ELE REA+NE++ +
Sbjct: 301 LLQAYNEEIRKIQMSAREHFQKIFNDHEKLKLQVDSQKRELEMRGSELEKREAKNENDRR 360

Query: 361 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQA 420
            L+EEIEK  +RNSSLQLA  EQQKADE+ +KLA+DQK+QKE+LHNRII+L+KQLD KQA
Sbjct: 361 KLSEEIEKNAIRNSSLQLAAFEQQKADENVLKLAEDQKRQKEELHNRIIQLQKQLDAKQA 420

Query: 421 LELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRK 480
           LELEIERLRG+LNVMKHMGDD DVEVLQK ETI+++L EKEG+LE L+ LNQ LIV +RK
Sbjct: 421 LELEIERLRGTLNVMKHMGDDGDVEVLQKMETIIQNLREKEGELEDLETLNQALIVSERK 480

Query: 481 SNDELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCS 540
           SNDELQEARKE++N  K++  R+ + VKRMGELD+KPF EA K++Y E+EA+ RASELCS
Sbjct: 481 SNDELQEARKELINGLKEISNRAQIGVKRMGELDSKPFLEAMKRKYTEEEAEVRASELCS 540

Query: 541 LWAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALR 600
           LW EYLKDP WHPFKV   +G++       EV+DD+DEKL  LK+E G+EV+KAVT A++
Sbjct: 541 LWVEYLKDPGWHPFKVAMVDGKNK------EVIDDKDEKLNGLKDELGDEVYKAVTDAVK 600

Query: 601 EINEYNPSGRYIVSELWNYQEDRKATLREGVKFLL 636
           EIN+YNPSGRYI SELWNY+E++KATL+EGV FLL
Sbjct: 601 EINDYNPSGRYITSELWNYKEEKKATLKEGVSFLL 628

BLAST of Cp4.1LG01g04820 vs. TAIR10
Match: AT3G48670.1 (AT3G48670.1 XH/XS domain-containing protein)

HSP 1 Score: 713.0 bits (1839), Expect = 1.7e-205
Identity = 373/648 (57.56%), Postives = 479/648 (73.92%), Query Frame = 1

Query: 1   MGSS---SSDDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRD 60
           MGS+   SSDD D  +DISESE+DE   K Y  LK GK  V+LS + F CPYC  K+K  
Sbjct: 1   MGSTVILSSDDED--SDISESEMDEYGDKMYLNLKGGKLKVRLSPQAFICPYCPNKKKTS 60

Query: 61  FLYKDLLQHASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPAS----NNDPV 120
           F YKDLLQHASGVGNS S+KRSAKEKA+HLALVKYL++DLAD+   ++P+S    N +P+
Sbjct: 61  FQYKDLLQHASGVGNSNSDKRSAKEKASHLALVKYLQQDLADSASEAEPSSKRQKNGNPI 120

Query: 121 MDCDHDEKFVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRG 180
            DCDHDEK V+PW+GIVVNIPT +  DGR  GESGSK RDE   RGFNPTRV PLWNY G
Sbjct: 121 QDCDHDEKLVYPWKGIVVNIPTTKAQDGRSAGESGSKLRDEYILRGFNPTRVRPLWNYLG 180

Query: 181 HSGCAIVEFNKDWPGLHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSN 240
           HSG AIVEFNKDW GLHN + F++AY  D HGKKDWL     KLG+Y W+ARADDYN +N
Sbjct: 181 HSGTAIVEFNKDWNGLHNGLLFDKAYTVDGHGKKDWLKKDGPKLGLYGWIARADDYNGNN 240

Query: 241 IIGEHLRKIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNN 300
           IIGE+LRK GDLKT++E+ EEEARKQ+ LV NL  ++E K K ++E+EE CS  +  LN 
Sbjct: 241 IIGENLRKTGDLKTIAELTEEEARKQELLVQNLRQLVEEKKKDMKEIEELCSVKSEELNQ 300

Query: 301 LMVERDKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREA 360
           LM E++K  Q +  E+  IQ     H++KI  DHEKLK  LES++K+ E++  EL  RE 
Sbjct: 301 LMEEKEKNQQKHYRELNAIQERTMSHIQKIVDDHEKLKRLLESERKKLEIKCNELAKREV 360

Query: 361 QNEHESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEK 420
            N  E   L+E++E+   +NSSL+LA +EQQKADE+  KLA+DQ++QKE+LH +IIRLE+
Sbjct: 361 HNGTERMKLSEDLEQNASKNSSLELAAMEQQKADEEVKKLAEDQRRQKEELHEKIIRLER 420

Query: 421 QLDTKQALELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQT 480
           Q D KQA+ELE+E+L+G LNVMKHM  D D EV+++ + I K L EKE  L  LD+ NQT
Sbjct: 421 QRDQKQAIELEVEQLKGQLNVMKHMASDGDAEVVKEVDIIFKDLGEKEAQLADLDKFNQT 480

Query: 481 LIVKQRKSNDELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADE 540
           LI+++R++NDELQEA KE+VN  K+    +++ VKRMGEL TKPF +A +++Y + + ++
Sbjct: 481 LILRERRTNDELQEAHKELVNIMKE--WNTNIGVKRMGELVTKPFVDAMQQKYCQQDVED 540

Query: 541 RASELCSLWAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFK 600
           RA E+  LW  YLKD DWHPFK +K E  D    +E+EV+DD DEKL++LK + G+  + 
Sbjct: 541 RAVEVLQLWEHYLKDSDWHPFKRVKLENED----REVEVIDDRDEKLRELKADLGDGPYN 600

Query: 601 AVTAALREINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLKRS 642
           AVT AL EINEYNPSGRYI +ELWN++ D+KATL EGV  LLD+ +++
Sbjct: 601 AVTKALLEINEYNPSGRYITTELWNFKADKKATLEEGVTCLLDQWEKA 640

BLAST of Cp4.1LG01g04820 vs. TAIR10
Match: AT3G12550.1 (AT3G12550.1 XH/XS domain-containing protein)

HSP 1 Score: 590.5 bits (1521), Expect = 1.2e-168
Identity = 322/634 (50.79%), Postives = 436/634 (68.77%), Query Frame = 1

Query: 18  SELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHASGVGNSPSNK 77
           ++L + E   Y++LK+GK  VK+S+ TF CPYC   +K+  LY D+LQHASGVGNS S K
Sbjct: 3   NKLSDFEKNLYKKLKSGKLEVKVSYRTFLCPYCPDNKKKVGLYVDILQHASGVGNSQSKK 62

Query: 78  RSAKEKANHLALVKYLEKDLADAVGPSK-----------PASNNDP--VMDCDHDEKFVW 137
           RS  EKA+H AL KYL KDLA     +            PA   D   + D    EK VW
Sbjct: 63  RSLTEKASHRALAKYLIKDLAHYATSTISKRLKARTSFIPAETGDAPIIYDDAQFEKLVW 122

Query: 138 PWRGIVVNIPTRRTDDGRY-VGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 197
           PW+G++VNIPT  T+DGR   GESG K +DEL  RGFNP RV  +W+  GHSG  IVEFN
Sbjct: 123 PWKGVLVNIPTTSTEDGRSCTGESGPKLKDELIRRGFNPIRVRTVWDRFGHSGTGIVEFN 182

Query: 198 KDWPGLHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLRKIG 257
           +DW GL +A+ F++AYE D HGKKDWL   T+   +YAW+A ADDY  +NI+GE+LRK+G
Sbjct: 183 RDWNGLQDALVFKKAYEGDGHGKKDWLCGATDS-SLYAWLANADDYYRANILGENLRKMG 242

Query: 258 DLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLLQ 317
           DLK++    EEEARK  +L+  L  ++E K   L++++ + S+ +  L     E++K+L+
Sbjct: 243 DLKSIYRFAEEEARKDQKLLQRLNFMVENKQYRLKKLQIKYSQDSVKLKYETEEKEKILR 302

Query: 318 AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYLA 377
           AY+E++   Q  + DH  +IF+DHEK K+QLESQ KE E+R  EL  REA+NE + K +A
Sbjct: 303 AYSEDLTGRQQKSTDHFNRIFADHEKQKVQLESQIKELEIRKLELAKREAENETQRKIVA 362

Query: 378 EEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALEL 437
           +E+E+    NS +QL+ LEQQK  E   +LA D K QKE LH RI  LE+QLD KQ LEL
Sbjct: 363 KELEQNAAINSYVQLSALEQQKTREKAQRLAVDHKMQKEKLHKRIAALERQLDQKQELEL 422

Query: 438 EIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSND 497
           E+++L+  L+VM+ +  D   E++ K ET L+ LSE EG+L  L++ NQ L+V++RKSND
Sbjct: 423 EVQQLKSQLSVMRLVELDSGSEIVNKVETFLRDLSETEGELAHLNQFNQDLVVQERKSND 482

Query: 498 ELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSLWA 557
           ELQEAR+ +++  +D+    H+ VKRMGELDTKPF +A + +Y +++ ++ A E+  LW 
Sbjct: 483 ELQEARRALISNLRDM--GLHIGVKRMGELDTKPFMKAMRIKYCQEDLEDWAVEVIQLWE 542

Query: 558 EYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALREIN 617
           EYLKDPDWHPFK IK E  +      +EV+D++DEKL+ LKNE G++ ++AV  AL EIN
Sbjct: 543 EYLKDPDWHPFKRIKLETAET----IVEVIDEDDEKLRTLKNELGDDAYQAVANALLEIN 602

Query: 618 EYNPSGRYIVSELWNYQEDRKATLREGVKFLLDK 638
           EYNPSGRYI SELWN++EDRKATL EGV  LL++
Sbjct: 603 EYNPSGRYISSELWNFREDRKATLEEGVNSLLEQ 629

BLAST of Cp4.1LG01g04820 vs. TAIR10
Match: AT1G80790.1 (AT1G80790.1 XH/XS domain-containing protein)

HSP 1 Score: 448.4 bits (1152), Expect = 7.7e-126
Identity = 263/639 (41.16%), Postives = 404/639 (63.22%), Query Frame = 1

Query: 7   DDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQH 66
           + SD +++ISESE+D    K YE+L NG   VK+  +TF CP+C+ K+K+ + YK+LL H
Sbjct: 3   NSSDEESEISESEIDVYYEKPYEKLMNGDYKVKVK-DTFRCPFCAGKKKQHYKYKELLAH 62

Query: 67  ASGVGNSPSNKRSAKEKANHLALVKYLEKDLA---DAVGPSKPASNNDPVMDCDHDEKFV 126
           ASGV    S  RSAK+KANH AL KY+E +LA   D   P  P+S+ +       D+ +V
Sbjct: 63  ASGVAKG-SASRSAKQKANHFALAKYMENELAGDADVPRPQIPSSSTEQ-SQAVVDDIYV 122

Query: 127 WPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 186
           WPW GIV+N P RRTD+   + +S    +   K   FNP  V  LW  +      I +FN
Sbjct: 123 WPWMGIVIN-PVRRTDNKNVLLDSAYWLK---KLARFNPLEVKTLWLDQESVVAVIPQFN 182

Query: 187 KDWPGLHNAISFERAYEADHHGKKDWL-ANGTEKLGVYAWVARADDYNSSNIIGEHLRKI 246
             W G  +    E+ YE    G+KDW+   G  +   Y W ARADDYNS   I E+L K+
Sbjct: 183 SGWSGFKSVTELEKEYEIRGCGRKDWIDKRGDWRSKAYGWCARADDYNSQGSIAEYLSKV 242

Query: 247 GDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLL 306
           G L++ S+I +EE + +  +V +L + I + N+ L +++   +E   +L  +++E+D+L 
Sbjct: 243 GKLRSFSDITKEEIQNKSIVVDDLANKIAMTNEDLNKLQYMNNEKTLSLRRVLIEKDELD 302

Query: 307 QAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYL 366
           + Y +E KK+Q  +R+ + +IF + E+L  +LE++    ++  ++L+ ++A  E E + L
Sbjct: 303 RVYKQETKKMQELSREKINRIFREKERLTNELEAKMNNLKIWSKQLDKKQALTELERQKL 362

Query: 367 AEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALE 426
            E+ +K +V NSSLQLA LEQ+K D+  ++L D+ K++KE+  N+I++LEK+LD+KQ L+
Sbjct: 363 DEDKKKSDVMNSSLQLASLEQKKTDDRVLRLVDEHKRKKEETLNKILQLEKELDSKQKLQ 422

Query: 427 LEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSN 486
           +EI+ L+G L VMKH  D++D  + +K + + + L EK  +L+ L++ N  L+VK+RKSN
Sbjct: 423 MEIQELKGKLKVMKH-EDEDDEGIKKKMKKMKEELEEKCSELQDLEDTNSALMVKERKSN 482

Query: 487 DELQEARKEIVNAFKDL-PGRSHLRVKRMGELDTKPFHEAAKKRYN-EDEADERASELCS 546
           DE+ EARK ++   ++L   R+ +RVKRMGEL+ KPF  A ++R   E+EA  + + LCS
Sbjct: 483 DEIVEARKFLITELRELVSDRNIIRVKRMGELEEKPFMTACRQRCTVEEEAQVQYAMLCS 542

Query: 547 LWAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALR 606
            W E +KD  W PFK +    R        EV+D+EDE+++ L+ EWGEEV  AV  AL 
Sbjct: 543 KWQEKVKDSAWQPFKHVGTGDRKK------EVVDEEDEEIKKLREEWGEEVKNAVKTALE 602

Query: 607 EINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
           E+NE+NPSGRY V ELWN ++ RKATL+E + ++  ++K
Sbjct: 603 ELNEFNPSGRYSVPELWNSKQGRKATLKEVIDYITQQVK 627

BLAST of Cp4.1LG01g04820 vs. TAIR10
Match: AT1G15910.1 (AT1G15910.1 XH/XS domain-containing protein)

HSP 1 Score: 441.0 bits (1133), Expect = 1.2e-123
Identity = 265/640 (41.41%), Postives = 391/640 (61.09%), Query Frame = 1

Query: 9   SDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHAS 68
           SD + +ISESE+++     Y  L++G   VK++ +   CP+C+ K+K+D+ YK+L  HA+
Sbjct: 4   SDEEAEISESEIEDYSETPYRLLRDGTYKVKVNGQ-LRCPFCAGKKKQDYKYKELYAHAT 63

Query: 69  GVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEK------- 128
           GV    S  RSA +KANHLAL  +LE +LA   G ++P     PV+    DE        
Sbjct: 64  GVSKG-SATRSALQKANHLALAMFLENELA---GYAEPVPR-PPVVPPQLDETEPNPHNV 123

Query: 129 FVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 188
           +VWPW GIVVN P +  DD   + +S    +   K   F P  V   W  +      I +
Sbjct: 124 YVWPWMGIVVN-PLKEADDKELLLDSAYWLQTLSK---FKPIEVNAFWVEQDSIVGVIAK 183

Query: 189 FNKDWPGLHNAISFERAYEADHHGKKDWLA-NGTEKLGVYAWVARADDYNSSNIIGEHLR 248
           FN DW G   A   E+ +E     KK+W   +G  +   Y W ARADD+ S   IGE+L 
Sbjct: 184 FNGDWSGFAGATELEKEFETQGSSKKEWTERSGDSESKAYGWCARADDFESQGPIGEYLS 243

Query: 249 KIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDK 308
           K G L+TVS+I ++  + ++ ++  L+ +I + N+ L +++   + TA +L  ++ E+  
Sbjct: 244 KEGQLRTVSDISQKNVQDRNTVLEELSDMIAMTNEDLNKVQYSYNRTAMSLQRVLDEKKN 303

Query: 309 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESK 368
           L QA+ +E KK+Q  +  H++KI  D EKL  +L+ + ++ E R ++LE  EA  E + +
Sbjct: 304 LHQAFADETKKMQQMSLRHIQKILYDKEKLSNELDRKMRDLESRAKQLEKHEALTELDRQ 363

Query: 369 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQA 428
            L E+  K +  N SLQLA  EQ+KADE  ++L ++ ++QKED  N+I+ LEKQLDTKQ 
Sbjct: 364 KLDEDKRKSDAMNKSLQLASREQKKADESVLRLVEEHQRQKEDALNKILLLEKQLDTKQT 423

Query: 429 LELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRK 488
           LE+EI+ L+G L VMKH+GDD+D  V +K + +   L +K+ +LE L+ +N  L+ K+R+
Sbjct: 424 LEMEIQELKGKLQVMKHLGDDDDEAVQKKMKEMNDELDDKKAELEGLESMNSVLMTKERQ 483

Query: 489 SNDELQEARKEIVNAFKDLPG-RSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELC 548
           SNDE+Q ARK+++     L G  + + VKRMGELD KPF +  K RY+ +EA   A+ LC
Sbjct: 484 SNDEIQAARKKLIAGLTGLLGAETDIGVKRMGELDEKPFLDVCKLRYSANEAAVEAATLC 543

Query: 549 SLWAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAAL 608
           S W E LK+P W PF   K EG    +G E EV+D++DE+L+ LK EWG+EV  AV  AL
Sbjct: 544 STWQENLKNPSWQPF---KHEG--TGDGAE-EVVDEDDEQLKKLKREWGKEVHNAVKTAL 603

Query: 609 REINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
            E+NEYN SGRY   ELWN++E RKATL+E + F+ + +K
Sbjct: 604 VEMNEYNASGRYTTPELWNFKEGRKATLKEVITFISNDIK 627

BLAST of Cp4.1LG01g04820 vs. TAIR10
Match: AT4G00380.1 (AT4G00380.1 XH/XS domain-containing protein)

HSP 1 Score: 440.3 bits (1131), Expect = 2.1e-123
Identity = 265/638 (41.54%), Postives = 386/638 (60.50%), Query Frame = 1

Query: 7   DDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQH 66
           D SD +++ISESE++E     Y  L++        +    CP+C  K+K+D+ YK+L  H
Sbjct: 2   DISDEESEISESEIEEYSKTPYHLLRSETYYKVKVNGRLRCPFCVGKKKQDYKYKELHAH 61

Query: 67  ASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEK---FV 126
           A+GV    S  RSA +K+NHLAL K+LE DLA    P        P++D         +V
Sbjct: 62  ATGVSKG-SATRSALQKSNHLALAKFLENDLAGYAEPLPRPPVVPPLLDETEPNPHNVYV 121

Query: 127 WPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 186
           WPW GIVVN P + TDD   + +S    +   K   F P  V   W  +      I +F+
Sbjct: 122 WPWMGIVVN-PLKETDDKELLLDSVYWLQTLSK---FKPVEVNAFWVEQDSIVGVIAKFD 181

Query: 187 KDWPGLHNAISFERAYEADHHGKKDWLA-NGTEKLGVYAWVARADDYNSSNIIGEHLRKI 246
            DW G   A   E+ +E     KK+W   +G  +   Y W ARADD+ S   IGE+L K 
Sbjct: 182 SDWSGFAAATELEKEFETQGSCKKEWTERSGDSESKAYGWCARADDFQSQGPIGEYLSKE 241

Query: 247 GDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLL 306
           G L+TVS+I++   + ++ L+  L+++I++ N+ L + +   + TA +L  ++ E+  L 
Sbjct: 242 GTLRTVSDILQNNVQDRNTLLDVLSNMIDMTNEDLNKAQHSYNRTAMSLQRVLDEKKNLH 301

Query: 307 QAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYL 366
           QA+ EE KK+Q  +  H+++I  D EKL+ +L+ + ++ E R ++LE  EA  E E + L
Sbjct: 302 QAFAEETKKMQQMSLRHIQRILYDKEKLRNELDRKMRDLESRAKQLEKHEALTELERQKL 361

Query: 367 AEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALE 426
            E+  K +  N SLQLA  EQ+KADE  ++L ++ ++QKED  N+I+ LEKQLDTKQ LE
Sbjct: 362 DEDKRKSDAMNKSLQLASREQKKADESVLRLVEEHQRQKEDALNKILLLEKQLDTKQTLE 421

Query: 427 LEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSN 486
           +EI+ L+G L VMKH+GDD+D  V  K + +   L +K+ +LE L+ +N  L+ K+R+SN
Sbjct: 422 MEIQELKGKLQVMKHLGDDDDEAVQTKMKEMNDELDDKKAELEDLESMNSVLMTKERQSN 481

Query: 487 DELQEARKEIVNAFKDLPG-RSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSL 546
           DE+Q AR++++     L G  S + VKRMGELD KPF +  K RY+ +EA   A+ LCS 
Sbjct: 482 DEIQAARQKMIAGLTGLLGAESDIGVKRMGELDEKPFLDVCKLRYSANEARVEAATLCST 541

Query: 547 WAEYLKDPDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALRE 606
           W E LK+P W PFK  +E   D  E    EV+D++DE+L+ LK EWG+EV  AV AAL E
Sbjct: 542 WKENLKNPSWQPFK--REGTGDGAE----EVVDEDDEQLKKLKREWGKEVHNAVKAALVE 601

Query: 607 INEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
           +NEYN SGRY  SELWN++E RKATL+E + F+   +K
Sbjct: 602 MNEYNASGRYPTSELWNFKEGRKATLKEVITFISTDIK 628

BLAST of Cp4.1LG01g04820 vs. NCBI nr
Match: gi|659123460|ref|XP_008461675.1| (PREDICTED: flagellar attachment zone protein 1 [Cucumis melo])

HSP 1 Score: 1114.0 bits (2880), Expect = 0.0e+00
Identity = 574/644 (89.13%), Postives = 607/644 (94.25%), Query Frame = 1

Query: 4   SSSDDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDL 63
           SS+DDSDVDTD+SESE+DERESKSY+ELKNGKRIVKLSHETFTCPYC++KRKRDFLYKDL
Sbjct: 3   SSTDDSDVDTDVSESEMDERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDL 62

Query: 64  LQHASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPA--SNNDPVMDCDHDEK 123
           LQHASGVGNSPSNKRS KEKANHLAL+KYLEKDLAD VGPSKPA  SN DPVMDC+HDEK
Sbjct: 63  LQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADTVGPSKPATASNKDPVMDCNHDEK 122

Query: 124 FVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 183
           FVWPWRGIVVNIPTRRTDDGR+VG SGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE
Sbjct: 123 FVWPWRGIVVNIPTRRTDDGRFVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 182

Query: 184 FNKDWPGLHNAISFERAYEADHHGKKDWLANGT-EKLGVYAWVARADDYNSSNIIGEHLR 243
           FNKDWPGLHNAISFERAYEAD HGKKDWLANGT EKLG+YAWVARADDYN++NI+GEHLR
Sbjct: 183 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTEKLGIYAWVARADDYNTNNIVGEHLR 242

Query: 244 KIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDK 303
           KIGDLKT+SEII+EEARKQDRLVSNLTSIIELKNKHL EME+RC+ETATTLNNLM ER+K
Sbjct: 243 KIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCTETATTLNNLMGEREK 302

Query: 304 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESK 363
           LL AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELE REAQNE+ESK
Sbjct: 303 LLHAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESK 362

Query: 364 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQA 423
           YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLD KQA
Sbjct: 363 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDAKQA 422

Query: 424 LELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRK 483
           LELEIERLRG+LNVMKHM   EDVE +QKAE+ILK LSEKE DLE LD+LNQ LIVKQRK
Sbjct: 423 LELEIERLRGTLNVMKHM---EDVEDVQKAESILKELSEKERDLEELDDLNQALIVKQRK 482

Query: 484 SNDELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCS 543
           SNDELQEARKEI+NAFKDLPGRSHLRVKRMGELDTKPFHEA KK YNEDEADERASELCS
Sbjct: 483 SNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCS 542

Query: 544 LWAEYLKDPDWHPFKVIKEEGRDNEEG--KEIEVLDDEDEKLQDLKNEWGEEVFKAVTAA 603
           LWAEYLKDPDWHPF+VIK E +D  +G  KEIE+LDDEDEKL+ LK ++GEEV KAV +A
Sbjct: 543 LWAEYLKDPDWHPFRVIKVEAKDAPDGKEKEIEILDDEDEKLKGLKKDYGEEVCKAVISA 602

Query: 604 LREINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLKRSN 643
           L EINEYNPSGRYI SELWNYQE RKATLREGV+FLLDKL RSN
Sbjct: 603 LMEINEYNPSGRYITSELWNYQEGRKATLREGVRFLLDKLNRSN 643

BLAST of Cp4.1LG01g04820 vs. NCBI nr
Match: gi|449459906|ref|XP_004147687.1| (PREDICTED: protein INVOLVED IN DE NOVO 2 [Cucumis sativus])

HSP 1 Score: 1104.0 bits (2854), Expect = 0.0e+00
Identity = 569/644 (88.35%), Postives = 607/644 (94.25%), Query Frame = 1

Query: 4   SSSDDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDL 63
           SS+DDSDVDTD+SESE+DERESKSY+ELKNGKRIVKLSHETFTCPYC++KRKRDFLYKDL
Sbjct: 3   SSTDDSDVDTDVSESEMDERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDL 62

Query: 64  LQHASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPA--SNNDPVMDCDHDEK 123
           LQHASGVG SPSNKRS KEKANHLAL+KYLEKDLADAVGPSKPA  SNNDPVMDC+HDEK
Sbjct: 63  LQHASGVGKSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPATASNNDPVMDCNHDEK 122

Query: 124 FVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 183
           FVWPWRGIVVNIPTRRTDDGR+VG SGSKFRDELKERGFNP+RVTPLWNYRGHSGCAIVE
Sbjct: 123 FVWPWRGIVVNIPTRRTDDGRFVGGSGSKFRDELKERGFNPSRVTPLWNYRGHSGCAIVE 182

Query: 184 FNKDWPGLHNAISFERAYEADHHGKKDWLANGT-EKLGVYAWVARADDYNSSNIIGEHLR 243
           FNKDWPGLHNAISFERAYEAD HGKKDWLANGT EKLGVYAWVARADDYNS+NIIGEHLR
Sbjct: 183 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTEKLGVYAWVARADDYNSNNIIGEHLR 242

Query: 244 KIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDK 303
           KIGDLKT+SEII+EEARKQDRLVSNLTSIIELKNKHL EME+RC+ET+ T+++LM E +K
Sbjct: 243 KIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLTEMEKRCNETSATVDSLMREIEK 302

Query: 304 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESK 363
           LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKL+LESQKKEFELRGRELE REAQNE+ESK
Sbjct: 303 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLKLESQKKEFELRGRELEKREAQNENESK 362

Query: 364 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQA 423
           YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLH+RIIRLEKQLD KQA
Sbjct: 363 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHDRIIRLEKQLDAKQA 422

Query: 424 LELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRK 483
           LELEIERLRG+LNVMKHM D EDV   QKAE+ILK LSEKE DLE LD+LNQ LIVKQRK
Sbjct: 423 LELEIERLRGTLNVMKHMEDAEDV---QKAESILKDLSEKERDLEELDDLNQALIVKQRK 482

Query: 484 SNDELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCS 543
           SNDELQEARKEI+NAFKDLPGRSHLR+KRMGELDTKPFHEA KK YNEDEADERASELCS
Sbjct: 483 SNDELQEARKEIINAFKDLPGRSHLRIKRMGELDTKPFHEAMKKIYNEDEADERASELCS 542

Query: 544 LWAEYLKDPDWHPFKVIKEEGRDNEEG--KEIEVLDDEDEKLQDLKNEWGEEVFKAVTAA 603
           LWAEYLKDPDWHPFKVIK EG+D  +G  KEIE+LDDEDEKL+ LK ++GEEV KAV +A
Sbjct: 543 LWAEYLKDPDWHPFKVIKVEGKDAPDGKEKEIEILDDEDEKLKGLKKDYGEEVCKAVISA 602

Query: 604 LREINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLKRSN 643
           L EINEYNPSGRYI SELWNYQE ++ATLREGV+FLLDKL RSN
Sbjct: 603 LVEINEYNPSGRYITSELWNYQEGKRATLREGVRFLLDKLNRSN 643

BLAST of Cp4.1LG01g04820 vs. NCBI nr
Match: gi|700195377|gb|KGN50554.1| (hypothetical protein Csa_5G182100 [Cucumis sativus])

HSP 1 Score: 1104.0 bits (2854), Expect = 0.0e+00
Identity = 569/644 (88.35%), Postives = 607/644 (94.25%), Query Frame = 1

Query: 4   SSSDDSDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDL 63
           SS+DDSDVDTD+SESE+DERESKSY+ELKNGKRIVKLSHETFTCPYC++KRKRDFLYKDL
Sbjct: 141 SSTDDSDVDTDVSESEMDERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDL 200

Query: 64  LQHASGVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPA--SNNDPVMDCDHDEK 123
           LQHASGVG SPSNKRS KEKANHLAL+KYLEKDLADAVGPSKPA  SNNDPVMDC+HDEK
Sbjct: 201 LQHASGVGKSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPATASNNDPVMDCNHDEK 260

Query: 124 FVWPWRGIVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 183
           FVWPWRGIVVNIPTRRTDDGR+VG SGSKFRDELKERGFNP+RVTPLWNYRGHSGCAIVE
Sbjct: 261 FVWPWRGIVVNIPTRRTDDGRFVGGSGSKFRDELKERGFNPSRVTPLWNYRGHSGCAIVE 320

Query: 184 FNKDWPGLHNAISFERAYEADHHGKKDWLANGT-EKLGVYAWVARADDYNSSNIIGEHLR 243
           FNKDWPGLHNAISFERAYEAD HGKKDWLANGT EKLGVYAWVARADDYNS+NIIGEHLR
Sbjct: 321 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTEKLGVYAWVARADDYNSNNIIGEHLR 380

Query: 244 KIGDLKTVSEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDK 303
           KIGDLKT+SEII+EEARKQDRLVSNLTSIIELKNKHL EME+RC+ET+ T+++LM E +K
Sbjct: 381 KIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLTEMEKRCNETSATVDSLMREIEK 440

Query: 304 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESK 363
           LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKL+LESQKKEFELRGRELE REAQNE+ESK
Sbjct: 441 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLKLESQKKEFELRGRELEKREAQNENESK 500

Query: 364 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQA 423
           YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLH+RIIRLEKQLD KQA
Sbjct: 501 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHDRIIRLEKQLDAKQA 560

Query: 424 LELEIERLRGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRK 483
           LELEIERLRG+LNVMKHM D EDV   QKAE+ILK LSEKE DLE LD+LNQ LIVKQRK
Sbjct: 561 LELEIERLRGTLNVMKHMEDAEDV---QKAESILKDLSEKERDLEELDDLNQALIVKQRK 620

Query: 484 SNDELQEARKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCS 543
           SNDELQEARKEI+NAFKDLPGRSHLR+KRMGELDTKPFHEA KK YNEDEADERASELCS
Sbjct: 621 SNDELQEARKEIINAFKDLPGRSHLRIKRMGELDTKPFHEAMKKIYNEDEADERASELCS 680

Query: 544 LWAEYLKDPDWHPFKVIKEEGRDNEEG--KEIEVLDDEDEKLQDLKNEWGEEVFKAVTAA 603
           LWAEYLKDPDWHPFKVIK EG+D  +G  KEIE+LDDEDEKL+ LK ++GEEV KAV +A
Sbjct: 681 LWAEYLKDPDWHPFKVIKVEGKDAPDGKEKEIEILDDEDEKLKGLKKDYGEEVCKAVISA 740

Query: 604 LREINEYNPSGRYIVSELWNYQEDRKATLREGVKFLLDKLKRSN 643
           L EINEYNPSGRYI SELWNYQE ++ATLREGV+FLLDKL RSN
Sbjct: 741 LVEINEYNPSGRYITSELWNYQEGKRATLREGVRFLLDKLNRSN 781

BLAST of Cp4.1LG01g04820 vs. NCBI nr
Match: gi|567886052|ref|XP_006435548.1| (hypothetical protein CICLE_v10030937mg [Citrus clementina])

HSP 1 Score: 912.5 bits (2357), Expect = 4.1e-262
Identity = 458/631 (72.58%), Postives = 538/631 (85.26%), Query Frame = 1

Query: 9   SDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHAS 68
           S+ D+DISESE+ + E KSY++LK+G   VK+S E FTCPYC +KRK+++LYKDLLQHAS
Sbjct: 5   SEEDSDISESEMLKYEDKSYKKLKSGNHSVKISDEAFTCPYCPKKRKQEYLYKDLLQHAS 64

Query: 69  GVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEKFVWPWRG 128
           GVGNS SNKRSAKEKANHLAL KYLEKDL DA  PSKP +  DP+  C HDEKFVWPW G
Sbjct: 65  GVGNSTSNKRSAKEKANHLALAKYLEKDLRDAGSPSKPVNEGDPLTGCSHDEKFVWPWTG 124

Query: 129 IVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKDWPG 188
           IVVNIPTRR +DGR VGESGSK RDEL  RGFNPTRV PLWN+RGHSGCA+VEF+KDWPG
Sbjct: 125 IVVNIPTRRAEDGRSVGESGSKLRDELIRRGFNPTRVHPLWNFRGHSGCAVVEFHKDWPG 184

Query: 189 LHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLRKIGDLKTV 248
           LHNA+SFE+AYEADHHGKKDW A+  EK G+YAWVAR+DDYN  NIIG+HLRKIGDLKT+
Sbjct: 185 LHNAMSFEKAYEADHHGKKDWYASNQEKSGLYAWVARSDDYNLKNIIGDHLRKIGDLKTI 244

Query: 249 SEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLLQAYNEE 308
           SE++EEEARKQ+ LVSNLT++IE+K+KHL EM+ER +ET+ ++  LM E+D+LLQ+YNEE
Sbjct: 245 SEMMEEEARKQNLLVSNLTNMIEVKDKHLEEMKERFTETSNSVEKLMEEKDRLLQSYNEE 304

Query: 309 IKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYLAEEIEK 368
           IKKIQL ARDH ++IF+DHEKLKLQLESQKKE ELRG ELE RE QNE++ K LAEEIEK
Sbjct: 305 IKKIQLSARDHFQRIFTDHEKLKLQLESQKKELELRGEELEKRETQNENDRKILAEEIEK 364

Query: 369 YEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALELEIERL 428
             +RN+SLQLA L QQKADE+  KLA+DQKKQKEDLHNRII+LEKQLD KQAL LEIERL
Sbjct: 365 NAMRNNSLQLASLVQQKADENVRKLAEDQKKQKEDLHNRIIQLEKQLDAKQALALEIERL 424

Query: 429 RGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSNDELQEA 488
           +GSLNVMKHMGDD D+EVLQK ET+LK L EKEG+L+ L+ LNQTLI+++RKSNDELQ+A
Sbjct: 425 KGSLNVMKHMGDDGDIEVLQKMETVLKDLREKEGELDDLEALNQTLIIRERKSNDELQDA 484

Query: 489 RKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSLWAEYLKD 548
           RKE++NA K+L GR+H+ +KRMGELD KPF E   ++YNE+EA+ERASELCSLW EYLKD
Sbjct: 485 RKELINALKELAGRAHIGLKRMGELDNKPFLEVMNRKYNEEEAEERASELCSLWEEYLKD 544

Query: 549 PDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALREINEYNPS 608
           PDWHPFKVI        EGK  E++++EDEKL+ LK E GEEV+ AVT AL EINEYNPS
Sbjct: 545 PDWHPFKVI------TAEGKHKEIINEEDEKLKGLKKEMGEEVYIAVTTALVEINEYNPS 604

Query: 609 GRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
           GRYI SELWNY+E RKATL+EGV FL+ + K
Sbjct: 605 GRYITSELWNYKEGRKATLQEGVAFLMKQWK 629

BLAST of Cp4.1LG01g04820 vs. NCBI nr
Match: gi|641850418|gb|KDO69291.1| (hypothetical protein CISIN_1g0065972mg [Citrus sinensis])

HSP 1 Score: 910.2 bits (2351), Expect = 2.0e-261
Identity = 457/631 (72.42%), Postives = 538/631 (85.26%), Query Frame = 1

Query: 9   SDVDTDISESELDERESKSYEELKNGKRIVKLSHETFTCPYCSRKRKRDFLYKDLLQHAS 68
           S+ D+DISESE+ + E KSY++LK+G   VK+S E FTCPYC +KRK+++LYKDLLQHAS
Sbjct: 5   SEEDSDISESEMLKYEDKSYKKLKSGNHSVKISDEAFTCPYCPKKRKQEYLYKDLLQHAS 64

Query: 69  GVGNSPSNKRSAKEKANHLALVKYLEKDLADAVGPSKPASNNDPVMDCDHDEKFVWPWRG 128
           GVGNS SNKRSAKEKANHLAL KYLEKDL DA  PSKP +  DP+  C HDEKFVWPW G
Sbjct: 65  GVGNSTSNKRSAKEKANHLALAKYLEKDLRDAGSPSKPVNEGDPLTGCSHDEKFVWPWTG 124

Query: 129 IVVNIPTRRTDDGRYVGESGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKDWPG 188
           IVVNIPTRR +DGR VGESGSK RDEL  RGFNPTRV PLWN+RGHSGCA+VEF+KDWPG
Sbjct: 125 IVVNIPTRRAEDGRSVGESGSKLRDELIRRGFNPTRVHPLWNFRGHSGCAVVEFHKDWPG 184

Query: 189 LHNAISFERAYEADHHGKKDWLANGTEKLGVYAWVARADDYNSSNIIGEHLRKIGDLKTV 248
           LHNA+SFE+AYEADH+GKKDW A+  EK G+YAWVAR+DDYN  NIIG+HLRKIGDLKT+
Sbjct: 185 LHNAMSFEKAYEADHYGKKDWYASNQEKSGLYAWVARSDDYNLKNIIGDHLRKIGDLKTI 244

Query: 249 SEIIEEEARKQDRLVSNLTSIIELKNKHLREMEERCSETATTLNNLMVERDKLLQAYNEE 308
           SE++EEEARKQ+ LVSNLT++IE+K+KHL EM+ER +ET+ ++  LM E+D+LLQ+YNEE
Sbjct: 245 SEMMEEEARKQNLLVSNLTNMIEVKDKHLEEMKERFTETSNSVEKLMEEKDRLLQSYNEE 304

Query: 309 IKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMREAQNEHESKYLAEEIEK 368
           IKKIQL ARDH ++IF+DHEKLKLQLESQKKE ELRG ELE RE QNE++ K LAEEIEK
Sbjct: 305 IKKIQLSARDHFQRIFTDHEKLKLQLESQKKELELRGEELEKRETQNENDRKILAEEIEK 364

Query: 369 YEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDTKQALELEIERL 428
             +RN+SLQLA L QQKADE+  KLA+DQKKQKEDLHNRII+LEKQLD KQAL LEIERL
Sbjct: 365 NAMRNNSLQLASLVQQKADENVRKLAEDQKKQKEDLHNRIIQLEKQLDAKQALALEIERL 424

Query: 429 RGSLNVMKHMGDDEDVEVLQKAETILKSLSEKEGDLEALDELNQTLIVKQRKSNDELQEA 488
           +GSLNVMKHMGDD D+EVLQK ET+LK L EKEG+L+ L+ LNQTLI+++RKSNDELQ+A
Sbjct: 425 KGSLNVMKHMGDDGDIEVLQKMETVLKDLREKEGELDDLEALNQTLIIRERKSNDELQDA 484

Query: 489 RKEIVNAFKDLPGRSHLRVKRMGELDTKPFHEAAKKRYNEDEADERASELCSLWAEYLKD 548
           RKE++NA K+L GR+H+ +KRMGELD KPF E   ++YNE+EA+ERASELCSLW EYLKD
Sbjct: 485 RKELINALKELSGRAHIGLKRMGELDNKPFLEVMNRKYNEEEAEERASELCSLWEEYLKD 544

Query: 549 PDWHPFKVIKEEGRDNEEGKEIEVLDDEDEKLQDLKNEWGEEVFKAVTAALREINEYNPS 608
           PDWHPFKVI        EGK  E++++EDEKL+ LK E GEEV+ AVT AL EINEYNPS
Sbjct: 545 PDWHPFKVI------TAEGKHKEIINEEDEKLKGLKKEMGEEVYIAVTTALVEINEYNPS 604

Query: 609 GRYIVSELWNYQEDRKATLREGVKFLLDKLK 640
           GRYI SELWNY+E RKATL+EGV FL+ + K
Sbjct: 605 GRYITSELWNYKEGRKATLQEGVAFLMKQWK 629

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
IDN2_ARATH3.0e-20457.56Protein INVOLVED IN DE NOVO 2 OS=Arabidopsis thaliana GN=IDN2 PE=1 SV=1[more]
FDM3_ARATH2.2e-16750.79Factor of DNA methylation 3 OS=Arabidopsis thaliana GN=FDM3 PE=2 SV=1[more]
FDM5_ARATH1.4e-12441.16Factor of DNA methylation 5 OS=Arabidopsis thaliana GN=FDM5 PE=2 SV=1[more]
FDM1_ARATH2.2e-12241.41Factor of DNA methylation 1 OS=Arabidopsis thaliana GN=FDM1 PE=1 SV=1[more]
FDM2_ARATH3.7e-12241.54Factor of DNA methylation 2 OS=Arabidopsis thaliana GN=FDM2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KNW6_CUCSA0.0e+0088.35Uncharacterized protein OS=Cucumis sativus GN=Csa_5G182100 PE=4 SV=1[more]
V4SLZ7_9ROSI2.8e-26272.58Uncharacterized protein OS=Citrus clementina GN=CICLE_v10030937mg PE=4 SV=1[more]
A0A067G0R1_CITSI1.4e-26172.42Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g0065972mg PE=4 SV=1[more]
A0A067LJU2_JATCU1.2e-26072.57Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01445 PE=4 SV=1[more]
B9T4I5_RICCO3.6e-25770.39Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0001380 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G48670.11.7e-20557.56 XH/XS domain-containing protein[more]
AT3G12550.11.2e-16850.79 XH/XS domain-containing protein[more]
AT1G80790.17.7e-12641.16 XH/XS domain-containing protein[more]
AT1G15910.11.2e-12341.41 XH/XS domain-containing protein[more]
AT4G00380.12.1e-12341.54 XH/XS domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|659123460|ref|XP_008461675.1|0.0e+0089.13PREDICTED: flagellar attachment zone protein 1 [Cucumis melo][more]
gi|449459906|ref|XP_004147687.1|0.0e+0088.35PREDICTED: protein INVOLVED IN DE NOVO 2 [Cucumis sativus][more]
gi|700195377|gb|KGN50554.1|0.0e+0088.35hypothetical protein Csa_5G182100 [Cucumis sativus][more]
gi|567886052|ref|XP_006435548.1|4.1e-26272.58hypothetical protein CICLE_v10030937mg [Citrus clementina][more]
gi|641850418|gb|KDO69291.1|2.0e-26172.42hypothetical protein CISIN_1g0065972mg [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0031047gene silencing by RNA
Vocabulary: INTERPRO
TermDefinition
IPR005381Znf-XS_domain
IPR005380XS_domain
IPR005379Uncharacterised_XH
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031047 gene silencing by RNA
cellular_component GO:0005575 cellular_component
cellular_component GO:0005655 nucleolar ribonuclease P complex
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g04820.1Cp4.1LG01g04820.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005379Uncharacterised domain XHPFAMPF03469XHcoord: 508..640
score: 1.7
IPR005380XS domainPFAMPF03468XScoord: 119..231
score: 5.7
IPR005381Zinc finger-XS domainPFAMPF03470zf-XScoord: 47..90
score: 9.3
NoneNo IPR availableunknownCoilCoilcoord: 320..351
score: -coord: 471..498
score: -coord: 394..438
score: -coord: 277..311
score: -coord: 447..467
scor
NoneNo IPR availablePANTHERPTHR21596RIBONUCLEASE P SUBUNIT P38coord: 19..639
score:
NoneNo IPR availablePANTHERPTHR21596:SF25TRANSCRIPTION REGULATOR-LIKE-RELATEDcoord: 19..639
score:

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g04820Cp4.1LG01g10830Cucurbita pepo (Zucchini)cpecpeB374
Cp4.1LG01g04820Cp4.1LG20g08410Cucurbita pepo (Zucchini)cpecpeB386
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g04820Cucumber (Chinese Long) v3cpecucB0485
Cp4.1LG01g04820Cucumber (Chinese Long) v3cpecucB0508
Cp4.1LG01g04820Cucumber (Chinese Long) v3cpecucB0550
Cp4.1LG01g04820Wax gourdcpewgoB0487
Cp4.1LG01g04820Wax gourdcpewgoB0543
Cp4.1LG01g04820Cucurbita pepo (Zucchini)cpecpeB072
Cp4.1LG01g04820Cucurbita pepo (Zucchini)cpecpeB381
Cp4.1LG01g04820Cucurbita pepo (Zucchini)cpecpeB401
Cp4.1LG01g04820Cucumber (Gy14) v2cgybcpeB364
Cp4.1LG01g04820Cucumber (Gy14) v2cgybcpeB510
Cp4.1LG01g04820Melon (DHL92) v3.6.1cpemedB476
Cp4.1LG01g04820Melon (DHL92) v3.6.1cpemedB506
Cp4.1LG01g04820Silver-seed gourdcarcpeB0424