Cucsa.050020 (gene) Cucumber (Gy14) v1

NameCucsa.050020
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionEmsy N Terminus/ plant Tudor-like domains-containing protein isoform 1
Locationscaffold00551 : 166801 .. 172708 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCAGGAGTTGAGTGTTCTCATTGTACATCGCCGTCGATTCCCATTTGTTCCTGATTCACTTCAATTTCCTCTACTTCAATCTCAAATCTCAATCCCAACTTCTCCAACACTTCGCCGGAGATTCCTCTTCTTCCCGGCAGATTTCAGCTCCAAAACTCGACATTTCTCGCCGTTTCTACTTTCCCCTCTCTCCGATTTCCTTTTCTACATTATTCCTCTAGCTGTTTTTCTCACTGTATTCAAGGGTTTCCGCCCGGTTGCGAGCTGGCTATGGACTACGAGCCCTATGATAGTAGCGGTAAAGTTTACTCTCACGATGCTTTCATTTTCTGTTATTAGGGTTTTTCTTCGAATTTTAGGTTGCCGGTTTTGTTTTTAGAATTGGGTTTCTAATCTTGCTTTATGGGGGCGTAGGGAAGTTGTAGGTTGCTTGGAGGAACACCCGAGTTTTTCAACCTTTTACACTAGTTCTTTTCTTTGAAGGGAATTCTTTATATTTTGCATCCTTCTTGGGAAATTTGTTTAACTACATAGGGGTTGCTGGCTGTTTCCTTGAGGATGTGTTGCATATGAAAATTTGGATCCTTTCTTCTGTGTATGATCTTTAGCCTGATAATTGTTGCAATACCTACCCAGAGCGTGACAAGCTGTTGAAATTGAAATATTGTTCGAGTTATCTTTAGTTGTATTTTCTCACGGCATTGTTGTTCCTTTTCTTTTTCTCCAAATTTCTTCACTTAATATCAACTCAAAGTCTGTTCGTATAAAAAATAGGCAAGAAACACCTGATTTTCTCATTATATGTGATTTCTCAAGCCCGAGTCATGACGAGGAGGCTAAGAACAGAGCAAGTATGAAAATAGTTGTAGTTGTTTGGATAATTGGCTATTTGGTTTCTGAGCATTTAAAAATATCTATGAAGGAACTAGTTTATCAGTTTTATTCAATTTACGCAGTTCATGTCTAAGCTATTGCCGCACTTGCACACATGTTAACGATTCTCCTTTGAATGCAAGACTTAGACTGACCTTCTGCCTTTTTGAAGGTATCCAGTACTTCCTTAGCCTCAAACAAATCGTTTTTTAACCTGTTTATATTAGTAGCCAGTTCTTGATTATTGATCCGCGGGTTTAGTTCTTCAACATTTTTGTTTCAGAAAAAGAATTACTACACTTGTTCCTGGCTCCTATTATTTGAATTCAGCTGTTTAATTTTTTTGTGGTGCTATCCGTTTCAGGAACTGACGATGATCTCCCTCCATCTCACCAGAATAGAATTGCAAGAGGAGGGGGGCGTGTTTCAGGGAATGGAAGATCAGTCATGGGTTCTGTTCCTTATCCCAGGATGTATGCTGAAACTGACATGGAAGCTCAAATTCATCAACTTGAGAAGGAAGCTTACAGTTCTGTTCTTAGAGCCTTTAAAGCTCAAGCTGATGCCATTACTTGGGTACTTTTTGTTTCTATTTCATGTATCTAATATTGGTAAATTTGTTTTATCAGGTTTAAATGATTGCTACATGAAATGTATTTTCTTCAGGAGAAAGAAAGTTTGATCACAGAACTCAGAAAAGAGTTAAGATTGTCCAATGAGGAGCACAGGGAGCTTCTAGGACGGGTTAATGCAGATGACATGATTAGAAGTATAAGGTTTGATAACTTGATAAAAAGAATATTAACCAATGTTTTAGGAGGTTGTAGTGTTCAGAGCTCTTCTAGATTTTGATGGATACTTTTTTGGTTCATGTCATAGGGAATGGAGACAGGCAGGTGGGCATCAGCCTGGCAAGCTCAGTACTAGCCAAGCCATTCATGATCCAATTCCCAGCCCTACTGTCTCTGCATCACGCAAGAAACAGAAACTAACTCATTTGGTACCTCCACAGTCATTTACTGGCCCATCTCCATCCTTTCACCAACAAAATGTTCCTCCCCCTCATCAGCCATCCTCATCTGTTGCAAAAAGAGGAGCTATCCCTCCCACCAAGGGCAAAAAACAAAAATCTGTAAGTTTCAGAGGCACTAGTTCTGATTGGACTTATTTGTTCTTCATTTTGTTTTGTTTATATTGTTCCTTTTCCTCAGATATTACCTGGGGCATCTTCAGCTAAACAGTACCAAACTTCAGTTCCTAGTGGTAGGAATCAAGTAGGAAATAGAGTTTCTTCTGGAGCCCTTGAGCCAGCTGAAGGAGCAACATTGAATCCACTTATTGGGAGGAAAGTAAGAACTAGATGGCCAGATGATAATAACTTTTATGAAGCTGTGATAACTGATTACAACGCTGCTGAGGTAAACTATCTTCTTCCAGCAGTGTACTAATTCACATTTCAGTAAGGAAAGTTGTTGAAAATGCATTGCTGTCTTCCAGGGACGGCATGCTTTGGTTTATGATATCGGTTCTGCTAATGAGACGTGGGAGTGGGTTAATCTCTCAGAGGTATATCCATCTAATACATTTTATTTGAACTTAAGATACTTGCATCCTCATTTTTATAATCCTCACCGTGTAGCCATTTGCATCACCTGGTTATATGTTTCCACATCTGGATTTGTGTGAAAATTGTTGAGACCTTCAATTCTGAATTAAAAAATTCCTCCATATTAAAAAAATATCCACGAGTGTTTGAGCGAGCTTGTGTGCATATCAACTAATCTTACGAGACACTCTGTCTAACCTTACATGATCTCTTCTATGGCCTGAATGGTTACACCTTAGATAGCTCTTGTGATATTTATTGAAGCAGCACTCTAGTCCTTAACATTTTGCTGTCATATTAATGTGATATAAAAAATGAAATTATAAGAAAAAAAAAATGGAATGTAAATTACCTTAATTTCCTTTCAGTGTTGGGTAATTAAATTAAAAATTTTTTAAATTGATTCTACTCATAGCCCTTAAATTGTTTTGACAATATATTCTACATAATTTTAGTTTCCGTGTATAGAACTAATGAAAACAATAGGTTGCTGTAGGCATGTTTTTTGGAAATCTCAGAAAGTCATCTCACAGTCTTCATGTTTTAAAAAAGTTTGTTAAGCACTTTGTTGGGGATGGTTGATTGGTTTCATATCCTTTTTTAGAGGATCACTTGGCAGTGACACCTTCCATATCCTCTTTATGTGCCTACTGCCTATTTCGTTATGCAAAGGTACTATAGTTAATCTTGTCCTGCTGTGCTATGGCGGTCTTCTGTCTCTCTTTCTTATTACATCCATTTTAGTAGATCTCTTTCTGATTTCATGTTTTTGGATCAACTTCACTTCTCTCTTTGTTGCATGAGTCTTTATTCAAATTCTAGTGACTTTCATACTCGAACCCAAGATTTCTTAAAGATATCCTTAAAATCTATTTCTATTCTTTGATTGCTCAACCCTCCTTTTTCTATCTTCCTCTTCCCTCATCTTCCCCAGATTCAAGAAGTTGAAAATTTAGAGATGTAATTCTCTCTTAGCTGGTAGCATTTGGGAAGATTAACATCATGGTTTCCATCCAAACTCAAAAAAGGTTTTTTAGTGGTTTCCTTTTGTGTTGGTGGTTGGGGTTGAAAAATCTAGATCATATTTTAGGGAAAGATCAATTTTCTTTGGGTGCTAGACTGAATTGTCAGGATATGGTGGATAAGGTCATCCAGTCTTTGACTGAGATCTTCTTTTGGGCATACAGTGTTCTTTGCTATTGATTAGTATATTTCACTTGACCACCTTTTCAAAGATCAACACATATGGCAACTCAGCTGTACCGAAAATTTTCTGAATAATTTAGAGGAAGATCATGATTATCATTGCCCATAATTCTTTGTGGACCTCTTCCTTTACAGTTAAGTTTCTTACTCAGGCTAAGCTGCTACACAGTGGCTTACTAACAATTTGGATTAAATCAAGTGTTAGATTAGAGTTACCTTAGAAGATTAAATATTGCTTTAAATGTTATAAAGTTAGAAAGTAGTCACATTTGAACTTGATGGGATCAAGTTATCTGGGGTTTTCGTTGCTAAAAGTATGCATGTTGGAGCCAAACTTTAGGAAGGTGTCCTAACTTCCATATTTCTAAGGACTGGAAAAGGTAAACAACATTGAATTTAACGAAATATAAAGGTCAGTGGGTTTGGTGGAAGTGCTTGGTCGCTGGGGTTTCTTTTCCTAGTATCAAATTTATAGTTGTATCAAGCATGCCTTTTCCCCTTCTTTACATATGTTAGATGATATGGTCAGGCCTTATGTGAATATCCAAATTCAAGATGGTATAGTTGCATGCGTACATATATTCTGTGATTTGTTGGATTAATGATTTAGGGTGTGTTTGGAAGACATTCACAAATGTTTAATTTAAAAATAAGTTATTTTGGAAAAAACTTGAGTGTTTGGAAACTACTCAAAGTAGCTTTTGAAGTATATTTAAACCATCTTTATCAAAAGTGTTTAAATAAAAATAAGTTTTTTGAAAAATACTTTTTTCTTAAGTCAATCCAAGCGGACCCAAAACATGTTTAAGGATTTTTCTTCAGCATGTTTTTCACCCGTTGATATATGGTTGTGTCAATAGTGTCTAATAGTGATGGGATATTCCCTAATTTTATATTTTTTGAATTCCATACTACCTATTTGAAATGTTACTAGATTTAAAGATCCAGAGTTTTTAGAAATGGAGTACAATGGTTGCTAGTGCTTATCTTAAATATTCTGCCCAGATATCTCCTGAGGACATCCAATGGGTTGATGAAGATCCGGGAATCCCACATAGAGGTGGTTACGGTGGATCAGGTCATGGAATGAACCGATCTGTAGGCCGTGATGGTTCTGGTGCAGGTAGAGGTAGAGGAGTCCCCAAGTCCCAATCCAGAAAAGATTTCTTGCCATCTCAAAATGGTATTGGCAAGAAGACATCCGACGATATACGGATACTCCACACAGAAACTCTAATTAGAGAGGTACAAACACTGTGGATTTATTATTTTGACCTTGATGAATTTGCTGTCTTTCTAGTGTCCATGAACTCTTTTTCCTTCATTTGGTTAGCTGGAGAGTTTTTGGTTCAAGCACTGTGGATTTATTCTTTTGAACTTTATGAATTTACTGTCTTTCTAGTGTCCATAAACTTTTTTTCCTTTCTACTGGTTAGGTGGAGAGAGTCTTTGGTTCAAATCATCCTGATCCCGTTGAAATTGAAAGGGCAAAGAAAGTTCTTAATGTAAGAAGCATTAGATGCACCTGCAGTCTTTCATTCTATTCCATTACCTTTTTAATCAATATACTAACTTACTCTGTCCAGGACCATGAACAATCACTTATTGATGCTATTAATAAGCTTGGAGATATTTCTGAAGTAGGAAGCGGTAAATTTCATATCAATCTCAGCCAATGCTTTTCGTTTTTTTATTTCCCATGCTTGTCAACTCAGTCACTTTAGAACAGTCCACAATAATACCAGCCTTAAAATCTTAGGAATTCTGACTATTGGGCACCTGAATTTGTTCATGCTCTCTTGGTGGTTTACATTCTTGAGTTCAACACATTCATGAATTGTAACTCTACATTCCTTGATACCCTATCTGGTTTGGTTGCTTTTATAACAGACGAAGGTGGCCACCGGTTCTCGCATGGGCAATCTATGGACCGGGAATAACAACAGTTTCAAAACGAAGGTGGTTCACTCAGTAAGAGCATCCTTTTGGTAAAAAAAAAAGACGACGGGATGTGGTAAGCAACTTTTGAGTTAAATGGGGAGTTGACCCTAAAACTCTCAAGGGCCTTTTCCTGTAAAATATACGAGAATTGAGATGAATGTTCCCTCGCTTAGGTTAACTATGGGGTTTTTGTTCTATTAGCATAGAGATGGTTCTTAGGAGGATGGGGTGGGGTTGGTTGTTTTCTTCTTTTCTTCTTATTATTCTCCTTTTTTTTGTTG

mRNA sequence

GTCAGGAGTTGAGTGTTCTCATTGTACATCGCCGTCGATTCCCATTTGTTCCTGATTCACTTCAATTTCCTCTACTTCAATCTCAAATCTCAATCCCAACTTCTCCAACACTTCGCCGGAGATTCCTCTTCTTCCCGGCAGATTTCAGCTCCAAAACTCGACATTTCTCGCCGTTTCTACTTTCCCCTCTCTCCGATTTCCTTTTCTACATTATTCCTCTAGCTGTTTTTCTCACTGTATTCAAGGGTTTCCGCCCGGTTGCGAGCTGGCTATGGACTACGAGCCCTATGATAGTAGCGGAACTGACGATGATCTCCCTCCATCTCACCAGAATAGAATTGCAAGAGGAGGGGGGCGTGTTTCAGGGAATGGAAGATCAGTCATGGGTTCTGTTCCTTATCCCAGGATGTATGCTGAAACTGACATGGAAGCTCAAATTCATCAACTTGAGAAGGAAGCTTACAGTTCTGTTCTTAGAGCCTTTAAAGCTCAAGCTGATGCCATTACTTGGGAGAAAGAAAGTTTGATCACAGAACTCAGAAAAGAGTTAAGATTGTCCAATGAGGAGCACAGGGAGCTTCTAGGACGGGTTAATGCAGATGACATGATTAGAAGTATAAGGGAATGGAGACAGGCAGGTGGGCATCAGCCTGGCAAGCTCAGTACTAGCCAAGCCATTCATGATCCAATTCCCAGCCCTACTGTCTCTGCATCACGCAAGAAACAGAAACTAACTCATTTGGTACCTCCACAGTCATTTACTGGCCCATCTCCATCCTTTCACCAACAAAATGTTCCTCCCCCTCATCAGCCATCCTCATCTGTTGCAAAAAGAGGAGCTATCCCTCCCACCAAGGGCAAAAAACAAAAATCTATATTACCTGGGGCATCTTCAGCTAAACAGTACCAAACTTCAGTTCCTAGTGGTAGGAATCAAGTAGGAAATAGAGTTTCTTCTGGAGCCCTTGAGCCAGCTGAAGGAGCAACATTGAATCCACTTATTGGGAGGAAAGTAAGAACTAGATGGCCAGATGATAATAACTTTTATGAAGCTGTGATAACTGATTACAACGCTGCTGAGGGACGGCATGCTTTGGTTTATGATATCGGTTCTGCTAATGAGACGTGGGAGTGGGTTAATCTCTCAGAGATATCTCCTGAGGACATCCAATGGGTTGATGAAGATCCGGGAATCCCACATAGAGGTGGTTACGGTGGATCAGGTCATGGAATGAACCGATCTGTAGGCCGTGATGGTTCTGGTGCAGGTAGAGGTAGAGGAGTCCCCAAGTCCCAATCCAGAAAAGATTTCTTGCCATCTCAAAATGGTATTGGCAAGAAGACATCCGACGATATACGGATACTCCACACAGAAACTCTAATTAGAGAGACGAAGGTGGCCACCGGTTCTCGCATGGGCAATCTATGGACCGGGAATAACAACAGTTTCAAAACGAAGGTGGTTCACTCAGTAAGAGCATCCTTTTGGTAAAAAAAAAAGACGACGGGATGTGGTAAGCAACTTTTGAGTTAAATGGGGAGTTGACCCTAAAACTCTCAAGGGCCTTTTCCTGTAAAATATACGAGAATTGAGATGAATGTTCCCTCGCTTAGGTTAACTATGGGGTTTTTGTTCTATTAGCATAGAGATGGTTCTTAGGAGGATGGGGTGGGGTTGGTTGTTTTCTTCTTTTCTTCTTATTATTCTCCTTTTTTTTGTTG

Coding sequence (CDS)

ATGGACTACGAGCCCTATGATAGTAGCGGAACTGACGATGATCTCCCTCCATCTCACCAGAATAGAATTGCAAGAGGAGGGGGGCGTGTTTCAGGGAATGGAAGATCAGTCATGGGTTCTGTTCCTTATCCCAGGATGTATGCTGAAACTGACATGGAAGCTCAAATTCATCAACTTGAGAAGGAAGCTTACAGTTCTGTTCTTAGAGCCTTTAAAGCTCAAGCTGATGCCATTACTTGGGAGAAAGAAAGTTTGATCACAGAACTCAGAAAAGAGTTAAGATTGTCCAATGAGGAGCACAGGGAGCTTCTAGGACGGGTTAATGCAGATGACATGATTAGAAGTATAAGGGAATGGAGACAGGCAGGTGGGCATCAGCCTGGCAAGCTCAGTACTAGCCAAGCCATTCATGATCCAATTCCCAGCCCTACTGTCTCTGCATCACGCAAGAAACAGAAACTAACTCATTTGGTACCTCCACAGTCATTTACTGGCCCATCTCCATCCTTTCACCAACAAAATGTTCCTCCCCCTCATCAGCCATCCTCATCTGTTGCAAAAAGAGGAGCTATCCCTCCCACCAAGGGCAAAAAACAAAAATCTATATTACCTGGGGCATCTTCAGCTAAACAGTACCAAACTTCAGTTCCTAGTGGTAGGAATCAAGTAGGAAATAGAGTTTCTTCTGGAGCCCTTGAGCCAGCTGAAGGAGCAACATTGAATCCACTTATTGGGAGGAAAGTAAGAACTAGATGGCCAGATGATAATAACTTTTATGAAGCTGTGATAACTGATTACAACGCTGCTGAGGGACGGCATGCTTTGGTTTATGATATCGGTTCTGCTAATGAGACGTGGGAGTGGGTTAATCTCTCAGAGATATCTCCTGAGGACATCCAATGGGTTGATGAAGATCCGGGAATCCCACATAGAGGTGGTTACGGTGGATCAGGTCATGGAATGAACCGATCTGTAGGCCGTGATGGTTCTGGTGCAGGTAGAGGTAGAGGAGTCCCCAAGTCCCAATCCAGAAAAGATTTCTTGCCATCTCAAAATGGTATTGGCAAGAAGACATCCGACGATATACGGATACTCCACACAGAAACTCTAATTAGAGAGACGAAGGTGGCCACCGGTTCTCGCATGGGCAATCTATGGACCGGGAATAACAACAGTTTCAAAACGAAGGTGGTTCACTCAGTAAGAGCATCCTTTTGGTAA

Protein sequence

MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLEKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWRQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQPSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQWVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSDDIRILHTETLIRETKVATGSRMGNLWTGNNNSFKTKVVHSVRASFW*
BLAST of Cucsa.050020 vs. Swiss-Prot
Match: EML4_ARATH (Protein EMSY-LIKE 4 OS=Arabidopsis thaliana GN=EML4 PE=2 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 1.2e-132
Identity = 254/435 (58.39%), Postives = 303/435 (69.66%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGG-----RVSGNGRSVMGSVPYPRMYAE--TDME 60
           MD +  DSSGTDDDLPPSH  R+ RGGG     RV+GNGR +     YP+MY +   DME
Sbjct: 1   MDCKSSDSSGTDDDLPPSH--RVPRGGGGGRGGRVAGNGRPLNLPPSYPKMYDDLAADME 60

Query: 61  AQIHQLEKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMI 120
           AQIHQ+EKEAY SVLRAFKAQ DAI+WEKES+ITELRKEL LSNEEHRELLGRVN+DD I
Sbjct: 61  AQIHQIEKEAYISVLRAFKAQGDAISWEKESVITELRKELSLSNEEHRELLGRVNSDDTI 120

Query: 121 RSIREWRQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQ 180
           R IREWRQ+GG QP   + +Q +HD +PSP+VSAS K  K    +P Q F   SPSFH Q
Sbjct: 121 RRIREWRQSGGMQPSMRNAAQVVHDTLPSPSVSASMKTHKPNQPIPSQPFASSSPSFHPQ 180

Query: 181 NVPPPHQPSSSVAKRGAIPPTKGKKQKSILPGASSAKQ--YQTSVPSGRNQVGNR---VS 240
              P H  +SS AKRG +P  KGKK K + PG+SS K   Y  S    R QV NR   V 
Sbjct: 181 -ADPTHPFASSTAKRGPVPIVKGKKHKPVFPGSSSTKHAPYHPSDQPPRGQVMNRLPSVP 240

Query: 241 SGALEPAEGATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEW 300
           + + EP  G      +GR+VRT+WP+DN FY+A+IT YN  EGRHALVYDI + +ETWEW
Sbjct: 241 ASSSEPTNGIDPESFLGRRVRTKWPEDNTFYDAIITQYNPVEGRHALVYDIATPSETWEW 300

Query: 301 VNLSEISPEDIQWVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFL 360
           V LSEISP DI+W+ EDPG+ +R  Y G GHG+NR+ G       RG G+ K+  RK F 
Sbjct: 301 VRLSEISPGDIEWIGEDPGLGNR--YNGQGHGLNRTTG-PNCVPQRGSGLEKNTIRKGFR 360

Query: 361 PSQNGIGKKTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINK 420
            SQNG GKK   DIRI  T+ LIREVERV  S++PDP E+ERAK+VL +HE +L+ AI K
Sbjct: 361 TSQNGTGKKKHLDIRIRQTDVLIREVERVLRSHNPDPYEVERAKRVLEEHEHALVGAIAK 420

Query: 421 LGDISEVGSDEGGHR 424
           LGDIS+ G +EG  R
Sbjct: 421 LGDISD-GENEGAFR 428

BLAST of Cucsa.050020 vs. Swiss-Prot
Match: EML3_ARATH (Protein EMSY-LIKE 3 OS=Arabidopsis thaliana GN=EML3 PE=1 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 1.6e-121
Identity = 243/419 (58.00%), Postives = 293/419 (69.93%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGR-SVMGSVPYPRMYAETDMEAQIHQL 60
           MDY P DSSGTDDDLPPSHQ R  R   R +GNGR SV+ S P  R++ E  ME QIH +
Sbjct: 1   MDYRPSDSSGTDDDLPPSHQGRYQRNA-RPTGNGRPSVLNSAPLSRVHNE--METQIHLI 60

Query: 61  EKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREW 120
           E+EAYSS+LRAFKAQ+DAITWEKESLITELRKELR+S+EEHRELL RVNAD+MIR IREW
Sbjct: 61  EQEAYSSILRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADEMIRRIREW 120

Query: 121 RQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPH 180
           R+A   Q    S  Q +HD  PSP VS SRKKQK +  +   +   PSPS H     P  
Sbjct: 121 RKANSLQS---SVPQLVHDA-PSPAVSGSRKKQKTSQSIASLAMGPPSPSLH-----PSM 180

Query: 181 QPSSSVAKRGAIPP-TKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGA 240
           QPSSS  +RG  PP  K KK K+ +       QY ++  +GR Q G   +    EP E  
Sbjct: 181 QPSSSALRRGGPPPGPKTKKPKTSM-------QYPSTGIAGRPQAGALTN----EPGESG 240

Query: 241 TLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPED 300
           + +PL+GRKV T+WPDDN +YEAVITDYN  EGRHALVYDI SANETWEWVNL EISP D
Sbjct: 241 SYDPLVGRKVWTKWPDDNQYYEAVITDYNPVEGRHALVYDINSANETWEWVNLKEISPGD 300

Query: 301 IQWVDEDPGIPHRGGYGGSGHGMNRSVGRDG---SGAGRGRGVPKSQSRKDFLPSQNGIG 360
           I+W  EDPGI  +GG+ G G G  +++ R G   +  GRGRG  + Q  K    +QNGIG
Sbjct: 301 IRWEGEDPGISRKGGHPGQGRG-TKTMARGGPASNAGGRGRGSMRMQQPK----TQNGIG 360

Query: 361 KKTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISE 415
           KK   +I ILHTETL++EVE+VFGS +P+P E+E+AK+VL DHE +L+DAI KL +IS+
Sbjct: 361 KKALGEIEILHTETLLKEVEKVFGSVNPNPAEVEKAKRVLRDHELALMDAIAKLEEISD 391

BLAST of Cucsa.050020 vs. Swiss-Prot
Match: EML1_ARATH (Protein EMSY-LIKE 1 OS=Arabidopsis thaliana GN=EML1 PE=1 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 1.0e-51
Identity = 114/240 (47.50%), Postives = 153/240 (63.75%), Query Frame = 1

Query: 204 PGASSAKQYQTSVPS--------GRNQVGNRVSSGALEPAEGATLNPLIGRKVRTRWPDD 263
           P  S+A++ Q + PS        G     NR+ S  +   E A    LIGRKV T+WP+D
Sbjct: 91  PTFSAARKKQKTFPSYNPSIGATGNRSFNNRLVSSGISGNESA--EALIGRKVWTKWPED 150

Query: 264 NNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQWVDEDPGIPHRGGYG 323
           N+FYEA+IT YNA EGRHALVYDI +ANETWEWV+L EI PEDI+W  E+ G+    G+G
Sbjct: 151 NHFYEAIITQYNADEGRHALVYDIHAANETWEWVDLKEIPPEDIRWDGEESGVALNIGHG 210

Query: 324 GSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLP---SQNGIG--KKTSDDIRILHTETL 383
            +    NR   R     GRGRG    Q R++ +P    QNG G  + +SDDI + +T++L
Sbjct: 211 SASFRGNR---RGQIHGGRGRGPRIHQPRRELVPPPTQQNGSGGRRTSSDDIELFNTDSL 270

Query: 384 IREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEGGHRFSHGQSM 431
           ++EVERVF S HPDP+E+++AKK+L +HEQ+LI AI +L D S+ G  +G   +SH   M
Sbjct: 271 VKEVERVFDSTHPDPLELDKAKKMLKEHEQALIAAIARLADTSD-GEMDGDPPYSHDHPM 324


HSP 2 Score: 139.8 bits (351), Expect = 7.0e-32
Identity = 71/102 (69.61%), Postives = 85/102 (83.33%), Query Frame = 1

Query: 52  MEAQIHQLEKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADD 111
           ME QIHQLE+EAY++VLRAFKAQ+DAI+WEKESLITELRKELR+S++EHRELL RVN DD
Sbjct: 1   METQIHQLEQEAYTAVLRAFKAQSDAISWEKESLITELRKELRVSDDEHRELLSRVNKDD 60

Query: 112 MIRSIREWRQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQK 154
            I+ IR+WRQ G  Q  + +T Q   D +PSPT SA+RKKQK
Sbjct: 61  TIQRIRDWRQGGASQITRHATIQPF-DVLPSPTFSAARKKQK 101

BLAST of Cucsa.050020 vs. Swiss-Prot
Match: EML2_ARATH (Protein EMSY-LIKE 2 OS=Arabidopsis thaliana GN=EML2 PE=1 SV=2)

HSP 1 Score: 202.2 bits (513), Expect = 1.1e-50
Identity = 113/230 (49.13%), Postives = 152/230 (66.09%), Query Frame = 1

Query: 197 KKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATLNPLIGRKVRTRWPDDN 256
           KKQK+         Q   S+ S R++  N     A EPAE      LIGRKV T+WP+DN
Sbjct: 95  KKQKTF--------QSYPSIGSTRSKSFNNRVVSANEPAEA-----LIGRKVWTKWPEDN 154

Query: 257 NFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQWVDEDPGIPHRGGYGG 316
           +FYEAV+T YNA EGRHALVYDI + NETWEWV+L+EI  +DI+W  E+ G+    G+GG
Sbjct: 155 SFYEAVVTQYNANEGRHALVYDINTVNETWEWVDLNEIPTKDIRWDGEEDGVTLNVGHGG 214

Query: 317 SGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSDDIRILHTETLIREVER 376
              G  R   R  S  GRGRG P++Q R++ L ++NG G+K   +I + +T++L++EVER
Sbjct: 215 ---GTTRGNRRTLSHGGRGRG-PRTQPRREHLATENGGGRKFFGEIELFNTDSLVKEVER 274

Query: 377 VFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEGGHRFSH 427
           VF SN PDP E+++AKK+L +HEQ+LI AI +L D S+  SD G   +SH
Sbjct: 275 VFDSNLPDPHELDKAKKLLKEHEQALIAAIARLTDASDYESD-GEEPYSH 306


HSP 2 Score: 112.1 bits (279), Expect = 1.6e-23
Identity = 67/137 (48.91%), Postives = 95/137 (69.34%), Query Frame = 1

Query: 52  MEAQIHQLEKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADD 111
           MEAQIH LE+EAYS+VLRAF+AQAD  +W+K +++T LRKELR+S++E+R+LL  V+ DD
Sbjct: 1   MEAQIHILEQEAYSAVLRAFQAQADEFSWDKATVMTNLRKELRISDDENRQLLNNVHNDD 60

Query: 112 MIRSIREWRQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFH 171
           +I+ IR+ R  GG+Q   +   Q++ D  PSPT SASRKKQK     P    T  S SF+
Sbjct: 61  LIKRIRDSRPRGGNQ---VVRHQSL-DVHPSPTFSASRKKQKTFQSYPSIGST-RSKSFN 120

Query: 172 QQNVPPPHQPSSSVAKR 189
            + V   ++P+ ++  R
Sbjct: 121 NR-VVSANEPAEALIGR 131

BLAST of Cucsa.050020 vs. Swiss-Prot
Match: MSH6_ARATH (DNA mismatch repair protein MSH6 OS=Arabidopsis thaliana GN=MSH6 PE=1 SV=2)

HSP 1 Score: 55.1 bits (131), Expect = 2.3e-06
Identity = 47/174 (27.01%), Postives = 76/174 (43.68%), Query Frame = 1

Query: 128 GKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQPSSSVAK 187
           GK ++S +   P PSP+ S S KK   ++   P+S   PSPS  ++       PSS++  
Sbjct: 26  GKSASSSS--SPSPSPSPSLSNKKTPKSNNPNPKS-PSPSPSPPKKTPKLNPNPSSNLPA 85

Query: 188 RGAIP------PTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATLN 247
           R   P      P + K +K +L    +    Q+ V +  ++V                  
Sbjct: 86  RSPSPGPDTPSPVQSKFKKPLLVIGQTPSPPQSVVITYGDEV------------------ 145

Query: 248 PLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGS------ANETWEWV 290
             +G++VR  WP D  +Y+  +T Y+  EG+H + Y+ G         E  EWV
Sbjct: 146 --VGKQVRVYWPLDKKWYDGSVTFYDKGEGKHVVEYEDGEEESLDLGKEKTEWV 176

BLAST of Cucsa.050020 vs. TrEMBL
Match: A0A0A0K8F3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G429500 PE=4 SV=1)

HSP 1 Score: 883.6 bits (2282), Expect = 9.5e-254
Identity = 433/433 (100.00%), Postives = 433/433 (100.00%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60
           MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE
Sbjct: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60

Query: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120
           KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR
Sbjct: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120

Query: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180
           QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ
Sbjct: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180

Query: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL 240
           PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL
Sbjct: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL 240

Query: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300
           NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ
Sbjct: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300

Query: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360
           WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD
Sbjct: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360

Query: 361 DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG 420
           DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG
Sbjct: 361 DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG 420

Query: 421 GHRFSHGQSMDRE 434
           GHRFSHGQSMDRE
Sbjct: 421 GHRFSHGQSMDRE 433

BLAST of Cucsa.050020 vs. TrEMBL
Match: A0A0B2QHM2_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_026038 PE=4 SV=1)

HSP 1 Score: 686.8 bits (1771), Expect = 1.7e-194
Identity = 341/438 (77.85%), Postives = 374/438 (85.39%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGG-RVSGNGRSVMGSVPYPRMYAETDMEAQIHQL 60
           MDYEPYDSSGTDDDLPP+HQNRI+RGGG R++GNGRS + S+PYPRMY E DME QIHQL
Sbjct: 1   MDYEPYDSSGTDDDLPPTHQNRISRGGGGRLAGNGRSAVASIPYPRMYGEIDMETQIHQL 60

Query: 61  EKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREW 120
           E+EAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADD+IR IREW
Sbjct: 61  EQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDVIRRIREW 120

Query: 121 RQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPP-QSFTGPSPSFHQQNVPPP 180
           RQAGGHQPG LST Q +HD IPSPTVSASRKKQK+T  VPP +SF GPSP FH Q V  P
Sbjct: 121 RQAGGHQPGVLSTGQGLHDSIPSPTVSASRKKQKITPSVPPSRSFGGPSPPFHPQTVTAP 180

Query: 181 HQPSSSVAKRGAIPPTKGKKQK--SILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAE 240
           HQPSSS AKRG+ P +KGKK K   ILPG SS KQY +S P GRNQV NR   G  E AE
Sbjct: 181 HQPSSSAAKRGSAPGSKGKKHKPGQILPGVSSMKQYPSSGPGGRNQVSNRAVMG--EHAE 240

Query: 241 GATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISP 300
           GA+ + L+GR+VRTRWPDDNNFYEAVIT+YN A+GRH LVYD+GSANETWEWVNLSEISP
Sbjct: 241 GASFDSLVGRRVRTRWPDDNNFYEAVITNYNPADGRHNLVYDMGSANETWEWVNLSEISP 300

Query: 301 EDIQWVDEDPGIPHRGGYGGSGHGMNRSVGRDG-SGAGRGRGVPKSQSRKDFLPSQNGIG 360
           EDIQWV EDPGI HRGG+GG GHGMNRSVGRDG  GAGRGRG  K QSRKDFL SQNG+G
Sbjct: 301 EDIQWVGEDPGINHRGGFGGPGHGMNRSVGRDGVPGAGRGRGAAKGQSRKDFLSSQNGLG 360

Query: 361 KKTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEV 420
           KK  DDI+ILHT+TLI+EVERVF +NHPDP+E+E+AKKVL DHEQ+LIDAI KL D+S+ 
Sbjct: 361 KKVHDDIQILHTDTLIKEVERVFSANHPDPLEVEKAKKVLKDHEQALIDAIAKLNDLSDG 420

Query: 421 GSDEGGHRFSHGQSMDRE 434
            SD  GH FSH QSMDRE
Sbjct: 421 ESDGAGHHFSHAQSMDRE 436

BLAST of Cucsa.050020 vs. TrEMBL
Match: I1LVG3_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_12G240400 PE=4 SV=1)

HSP 1 Score: 686.8 bits (1771), Expect = 1.7e-194
Identity = 341/438 (77.85%), Postives = 374/438 (85.39%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGG-RVSGNGRSVMGSVPYPRMYAETDMEAQIHQL 60
           MDYEPYDSSGTDDDLPP+HQNRI+RGGG R++GNGRS + S+PYPRMY E DME QIHQL
Sbjct: 1   MDYEPYDSSGTDDDLPPTHQNRISRGGGGRLAGNGRSAVASIPYPRMYGEIDMETQIHQL 60

Query: 61  EKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREW 120
           E+EAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADD+IR IREW
Sbjct: 61  EQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDVIRRIREW 120

Query: 121 RQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPP-QSFTGPSPSFHQQNVPPP 180
           RQAGGHQPG LST Q +HD IPSPTVSASRKKQK+T  VPP +SF GPSP FH Q V  P
Sbjct: 121 RQAGGHQPGVLSTGQGLHDSIPSPTVSASRKKQKITPSVPPSRSFGGPSPPFHPQTVTAP 180

Query: 181 HQPSSSVAKRGAIPPTKGKKQK--SILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAE 240
           HQPSSS AKRG+ P +KGKK K   ILPG SS KQY +S P GRNQV NR   G  E AE
Sbjct: 181 HQPSSSAAKRGSAPGSKGKKHKPGQILPGVSSMKQYPSSGPGGRNQVSNRAVMG--EHAE 240

Query: 241 GATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISP 300
           GA+ + L+GR+VRTRWPDDNNFYEAVIT+YN A+GRH LVYD+GSANETWEWVNLSEISP
Sbjct: 241 GASFDSLVGRRVRTRWPDDNNFYEAVITNYNPADGRHNLVYDMGSANETWEWVNLSEISP 300

Query: 301 EDIQWVDEDPGIPHRGGYGGSGHGMNRSVGRDG-SGAGRGRGVPKSQSRKDFLPSQNGIG 360
           EDIQWV EDPGI HRGG+GG GHGMNRSVGRDG  GAGRGRG  K QSRKDFL SQNG+G
Sbjct: 301 EDIQWVGEDPGINHRGGFGGPGHGMNRSVGRDGVPGAGRGRGAAKGQSRKDFLSSQNGLG 360

Query: 361 KKTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEV 420
           KK  DDI+ILHT+TLI+EVERVF +NHPDP+E+E+AKKVL DHEQ+LIDAI KL D+S+ 
Sbjct: 361 KKVHDDIQILHTDTLIKEVERVFSANHPDPLEVEKAKKVLKDHEQALIDAIAKLNDLSDG 420

Query: 421 GSDEGGHRFSHGQSMDRE 434
            SD  GH FSH QSMDRE
Sbjct: 421 ESDGAGHHFSHAQSMDRE 436

BLAST of Cucsa.050020 vs. TrEMBL
Match: D7TNV2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0156g00360 PE=4 SV=1)

HSP 1 Score: 685.3 bits (1767), Expect = 5.0e-194
Identity = 343/437 (78.49%), Postives = 376/437 (86.04%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRS-VMGSVPYPRMYAETDMEAQIHQL 60
           MDYEP+DSSGTDDDLPP HQNRI R GGRV+GNGRS V+GS+PY RMY ETDME QIHQL
Sbjct: 1   MDYEPFDSSGTDDDLPPPHQNRIPR-GGRVAGNGRSAVVGSIPYSRMYGETDMETQIHQL 60

Query: 61  EKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREW 120
           E+EAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADD+IR IREW
Sbjct: 61  EQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDVIRRIREW 120

Query: 121 RQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPH 180
           RQAGG QPG L+T QA+HDPIPSPTVSASRKKQK+T  +P QSF GPS SFH Q +   +
Sbjct: 121 RQAGGLQPGMLTTGQAVHDPIPSPTVSASRKKQKITQSIPSQSFGGPSQSFHPQAIAASN 180

Query: 181 QPSSSVAKRGAIPPTKGKKQKSILPGASSAK--QYQTSVPSGRNQVGNRVSSGALEPAEG 240
           QPSSS AKRG I   KGKK KS+LPGASS K  QY +S P+GR QV NR   G  EPAE 
Sbjct: 181 QPSSSAAKRGPILGPKGKKHKSVLPGASSMKSMQYASSGPTGRGQVANR---GVNEPAEA 240

Query: 241 ATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPE 300
           AT +PLIGRKVRTRWPDDNNFYEAVITDYN  EGRHALVYD+GSANETWEWVNLSEISPE
Sbjct: 241 ATFDPLIGRKVRTRWPDDNNFYEAVITDYNPVEGRHALVYDMGSANETWEWVNLSEISPE 300

Query: 301 DIQWVDEDPGIPHRGGYGGSGHGMNRSVGRDG-SGAGRGRGVPKSQSRKDFLPSQNGIGK 360
           DIQW  EDPGI  RGGY GSGHGMNR+VGRD   GAGRGRG+PK QS+KDFLPSQNGIGK
Sbjct: 301 DIQWDGEDPGISRRGGYDGSGHGMNRAVGRDSVQGAGRGRGLPKGQSKKDFLPSQNGIGK 360

Query: 361 KTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVG 420
           K  DDI+ILHT+TLI+EVERVF +NHPDP+EI++AKKVL +HEQ+L+DAI +L DIS+  
Sbjct: 361 KAPDDIQILHTDTLIKEVERVFTANHPDPLEIDKAKKVLKEHEQALLDAIGRLADISDGE 420

Query: 421 SDEGGHRFSHGQSMDRE 434
           SDEGGH+FSHG SMDRE
Sbjct: 421 SDEGGHQFSHGHSMDRE 433

BLAST of Cucsa.050020 vs. TrEMBL
Match: A0A061DVP8_THECC (Emsy N Terminus/ plant Tudor-like domains-containing protein isoform 1 OS=Theobroma cacao GN=TCM_005943 PE=4 SV=1)

HSP 1 Score: 676.0 bits (1743), Expect = 3.0e-191
Identity = 343/438 (78.31%), Postives = 370/438 (84.47%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60
           MDYEPYDSSGTDDDLPPSHQNRI R G R++GNGRS + SVPYPR+Y ETDMEAQIHQLE
Sbjct: 1   MDYEPYDSSGTDDDLPPSHQNRIPRTG-RIAGNGRSAVASVPYPRIYGETDMEAQIHQLE 60

Query: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120
           +EAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADD+IR IREWR
Sbjct: 61  QEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDVIRRIREWR 120

Query: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180
           Q G  QP  L+  QA+HDP+PSPTVSAS KKQK+T  VP QSF GPSP FH Q V P HQ
Sbjct: 121 QTGALQPSMLNAGQAVHDPVPSPTVSASHKKQKITQSVPSQSFGGPSPPFHPQAVAPSHQ 180

Query: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAK--QYQTSVPSGRNQVGNRVSSGAL---EPA 240
           PSSS AKRG I  +KGKK K  LPGA S K  QY ++ P+GR QV NRVSSG     EPA
Sbjct: 181 PSSSAAKRGPITGSKGKKHKPSLPGAPSMKSMQYPSTGPAGRGQVVNRVSSGTALVSEPA 240

Query: 241 EGATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEIS 300
           EGAT +PLIG+KVRTRWPDDNNFYEAVITDYN+ EGRHALVYDIG+ANETWEWVNLSEIS
Sbjct: 241 EGATFDPLIGKKVRTRWPDDNNFYEAVITDYNSVEGRHALVYDIGTANETWEWVNLSEIS 300

Query: 301 PEDIQWVDEDPGIPHRGGYGGSGHGMNRSVGRDG-SGAGRGRGVPKSQSRKDFLPSQNGI 360
           PEDIQW  E PGIPHRG YGG GHGMNRSVGRDG  GAGRGRG  K  SRKDFLPSQNGI
Sbjct: 301 PEDIQWESEVPGIPHRGVYGGPGHGMNRSVGRDGVPGAGRGRGFAKGPSRKDFLPSQNGI 360

Query: 361 GKKTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISE 420
           GKK  DDI+ILHT+TLI+EVERVFG+NHPDP+EIE+AKKVL +HEQSLIDAI KL DIS+
Sbjct: 361 GKKALDDIQILHTDTLIKEVERVFGTNHPDPLEIEKAKKVLKEHEQSLIDAIAKLTDISD 420

Query: 421 VGSDEGGHRFSHGQSMDR 433
             SDEGG +F  GQ MDR
Sbjct: 421 GESDEGGLQF--GQPMDR 435

BLAST of Cucsa.050020 vs. TAIR10
Match: AT2G44440.1 (AT2G44440.1 Emsy N Terminus (ENT) domain-containing protein)

HSP 1 Score: 474.6 bits (1220), Expect = 6.7e-134
Identity = 254/435 (58.39%), Postives = 303/435 (69.66%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGG-----RVSGNGRSVMGSVPYPRMYAE--TDME 60
           MD +  DSSGTDDDLPPSH  R+ RGGG     RV+GNGR +     YP+MY +   DME
Sbjct: 1   MDCKSSDSSGTDDDLPPSH--RVPRGGGGGRGGRVAGNGRPLNLPPSYPKMYDDLAADME 60

Query: 61  AQIHQLEKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMI 120
           AQIHQ+EKEAY SVLRAFKAQ DAI+WEKES+ITELRKEL LSNEEHRELLGRVN+DD I
Sbjct: 61  AQIHQIEKEAYISVLRAFKAQGDAISWEKESVITELRKELSLSNEEHRELLGRVNSDDTI 120

Query: 121 RSIREWRQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQ 180
           R IREWRQ+GG QP   + +Q +HD +PSP+VSAS K  K    +P Q F   SPSFH Q
Sbjct: 121 RRIREWRQSGGMQPSMRNAAQVVHDTLPSPSVSASMKTHKPNQPIPSQPFASSSPSFHPQ 180

Query: 181 NVPPPHQPSSSVAKRGAIPPTKGKKQKSILPGASSAKQ--YQTSVPSGRNQVGNR---VS 240
              P H  +SS AKRG +P  KGKK K + PG+SS K   Y  S    R QV NR   V 
Sbjct: 181 -ADPTHPFASSTAKRGPVPIVKGKKHKPVFPGSSSTKHAPYHPSDQPPRGQVMNRLPSVP 240

Query: 241 SGALEPAEGATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEW 300
           + + EP  G      +GR+VRT+WP+DN FY+A+IT YN  EGRHALVYDI + +ETWEW
Sbjct: 241 ASSSEPTNGIDPESFLGRRVRTKWPEDNTFYDAIITQYNPVEGRHALVYDIATPSETWEW 300

Query: 301 VNLSEISPEDIQWVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFL 360
           V LSEISP DI+W+ EDPG+ +R  Y G GHG+NR+ G       RG G+ K+  RK F 
Sbjct: 301 VRLSEISPGDIEWIGEDPGLGNR--YNGQGHGLNRTTG-PNCVPQRGSGLEKNTIRKGFR 360

Query: 361 PSQNGIGKKTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINK 420
            SQNG GKK   DIRI  T+ LIREVERV  S++PDP E+ERAK+VL +HE +L+ AI K
Sbjct: 361 TSQNGTGKKKHLDIRIRQTDVLIREVERVLRSHNPDPYEVERAKRVLEEHEHALVGAIAK 420

Query: 421 LGDISEVGSDEGGHR 424
           LGDIS+ G +EG  R
Sbjct: 421 LGDISD-GENEGAFR 428

BLAST of Cucsa.050020 vs. TAIR10
Match: AT5G13020.1 (AT5G13020.1 Emsy N Terminus (ENT)/ plant Tudor-like domains-containing protein)

HSP 1 Score: 437.6 bits (1124), Expect = 9.1e-123
Identity = 243/419 (58.00%), Postives = 293/419 (69.93%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGR-SVMGSVPYPRMYAETDMEAQIHQL 60
           MDY P DSSGTDDDLPPSHQ R  R   R +GNGR SV+ S P  R++ E  ME QIH +
Sbjct: 1   MDYRPSDSSGTDDDLPPSHQGRYQRNA-RPTGNGRPSVLNSAPLSRVHNE--METQIHLI 60

Query: 61  EKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREW 120
           E+EAYSS+LRAFKAQ+DAITWEKESLITELRKELR+S+EEHRELL RVNAD+MIR IREW
Sbjct: 61  EQEAYSSILRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADEMIRRIREW 120

Query: 121 RQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPH 180
           R+A   Q    S  Q +HD  PSP VS SRKKQK +  +   +   PSPS H     P  
Sbjct: 121 RKANSLQS---SVPQLVHDA-PSPAVSGSRKKQKTSQSIASLAMGPPSPSLH-----PSM 180

Query: 181 QPSSSVAKRGAIPP-TKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGA 240
           QPSSS  +RG  PP  K KK K+ +       QY ++  +GR Q G   +    EP E  
Sbjct: 181 QPSSSALRRGGPPPGPKTKKPKTSM-------QYPSTGIAGRPQAGALTN----EPGESG 240

Query: 241 TLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPED 300
           + +PL+GRKV T+WPDDN +YEAVITDYN  EGRHALVYDI SANETWEWVNL EISP D
Sbjct: 241 SYDPLVGRKVWTKWPDDNQYYEAVITDYNPVEGRHALVYDINSANETWEWVNLKEISPGD 300

Query: 301 IQWVDEDPGIPHRGGYGGSGHGMNRSVGRDG---SGAGRGRGVPKSQSRKDFLPSQNGIG 360
           I+W  EDPGI  +GG+ G G G  +++ R G   +  GRGRG  + Q  K    +QNGIG
Sbjct: 301 IRWEGEDPGISRKGGHPGQGRG-TKTMARGGPASNAGGRGRGSMRMQQPK----TQNGIG 360

Query: 361 KKTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISE 415
           KK   +I ILHTETL++EVE+VFGS +P+P E+E+AK+VL DHE +L+DAI KL +IS+
Sbjct: 361 KKALGEIEILHTETLLKEVEKVFGSVNPNPAEVEKAKRVLRDHELALMDAIAKLEEISD 391

BLAST of Cucsa.050020 vs. TAIR10
Match: AT5G06780.1 (AT5G06780.1 Emsy N Terminus (ENT)/ plant Tudor-like domains-containing protein)

HSP 1 Score: 202.2 bits (513), Expect = 6.5e-52
Identity = 113/230 (49.13%), Postives = 152/230 (66.09%), Query Frame = 1

Query: 197 KKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATLNPLIGRKVRTRWPDDN 256
           KKQK+         Q   S+ S R++  N     A EPAE      LIGRKV T+WP+DN
Sbjct: 102 KKQKTF--------QSYPSIGSTRSKSFNNRVVSANEPAEA-----LIGRKVWTKWPEDN 161

Query: 257 NFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQWVDEDPGIPHRGGYGG 316
           +FYEAV+T YNA EGRHALVYDI + NETWEWV+L+EI  +DI+W  E+ G+    G+GG
Sbjct: 162 SFYEAVVTQYNANEGRHALVYDINTVNETWEWVDLNEIPTKDIRWDGEEDGVTLNVGHGG 221

Query: 317 SGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSDDIRILHTETLIREVER 376
              G  R   R  S  GRGRG P++Q R++ L ++NG G+K   +I + +T++L++EVER
Sbjct: 222 ---GTTRGNRRTLSHGGRGRG-PRTQPRREHLATENGGGRKFFGEIELFNTDSLVKEVER 281

Query: 377 VFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEGGHRFSH 427
           VF SN PDP E+++AKK+L +HEQ+LI AI +L D S+  SD G   +SH
Sbjct: 282 VFDSNLPDPHELDKAKKLLKEHEQALIAAIARLTDASDYESD-GEEPYSH 313


HSP 2 Score: 112.5 bits (280), Expect = 6.7e-25
Identity = 67/138 (48.55%), Postives = 96/138 (69.57%), Query Frame = 1

Query: 51  DMEAQIHQLEKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNAD 110
           +MEAQIH LE+EAYS+VLRAF+AQAD  +W+K +++T LRKELR+S++E+R+LL  V+ D
Sbjct: 7   NMEAQIHILEQEAYSAVLRAFQAQADEFSWDKATVMTNLRKELRISDDENRQLLNNVHND 66

Query: 111 DMIRSIREWRQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSF 170
           D+I+ IR+ R  GG+Q   +   Q++ D  PSPT SASRKKQK     P    T  S SF
Sbjct: 67  DLIKRIRDSRPRGGNQ---VVRHQSL-DVHPSPTFSASRKKQKTFQSYPSIGST-RSKSF 126

Query: 171 HQQNVPPPHQPSSSVAKR 189
           + + V   ++P+ ++  R
Sbjct: 127 NNR-VVSANEPAEALIGR 138

BLAST of Cucsa.050020 vs. TAIR10
Match: AT3G12140.3 (AT3G12140.3 Emsy N Terminus (ENT)/ plant Tudor-like domains-containing protein)

HSP 1 Score: 201.1 bits (510), Expect = 1.4e-51
Identity = 109/224 (48.66%), Postives = 146/224 (65.18%), Query Frame = 1

Query: 204 PGASSAKQYQTSVPS--------GRNQVGNRVSSGALEPAEGATLNPLIGRKVRTRWPDD 263
           P  S+A++ Q + PS        G     NR+ S  +   E A    LIGRKV T+WP+D
Sbjct: 91  PTFSAARKKQKTFPSYNPSIGATGNRSFNNRLVSSGISGNESA--EALIGRKVWTKWPED 150

Query: 264 NNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQWVDEDPGIPHRGGYG 323
           N+FYEA+IT YNA EGRHALVYDI +ANETWEWV+L EI PEDI+W  E+ G+    G+G
Sbjct: 151 NHFYEAIITQYNADEGRHALVYDIHAANETWEWVDLKEIPPEDIRWDGEESGVALNIGHG 210

Query: 324 GSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLP---SQNGIG--KKTSDDIRILHTETL 383
            +    NR   R     GRGRG    Q R++ +P    QNG G  + +SDDI + +T++L
Sbjct: 211 SASFRGNR---RGQIHGGRGRGPRIHQPRRELVPPPTQQNGSGGRRTSSDDIELFNTDSL 270

Query: 384 IREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISE 415
           ++EVERVF S HPDP+E+++AKK+L +HEQ+LI AI +L D S+
Sbjct: 271 VKEVERVFDSTHPDPLELDKAKKMLKEHEQALIAAIARLADTSD 309


HSP 2 Score: 139.8 bits (351), Expect = 3.9e-33
Identity = 71/102 (69.61%), Postives = 85/102 (83.33%), Query Frame = 1

Query: 52  MEAQIHQLEKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADD 111
           ME QIHQLE+EAY++VLRAFKAQ+DAI+WEKESLITELRKELR+S++EHRELL RVN DD
Sbjct: 1   METQIHQLEQEAYTAVLRAFKAQSDAISWEKESLITELRKELRVSDDEHRELLSRVNKDD 60

Query: 112 MIRSIREWRQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQK 154
            I+ IR+WRQ G  Q  + +T Q   D +PSPT SA+RKKQK
Sbjct: 61  TIQRIRDWRQGGASQITRHATIQPF-DVLPSPTFSAARKKQK 101

BLAST of Cucsa.050020 vs. TAIR10
Match: AT3G57970.1 (AT3G57970.1 Emsy N Terminus (ENT)/ plant Tudor-like domains-containing protein)

HSP 1 Score: 79.3 bits (194), Expect = 6.3e-15
Identity = 65/266 (24.44%), Postives = 117/266 (43.98%), Query Frame = 1

Query: 46  MYAETDMEAQIHQLEKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLG 105
           +++  D + ++  L+  AY  VL AFKA++ AI+  +  ++ EL KEL++  + H+    
Sbjct: 38  LFSPLDKKVKLSMLQDRAYYYVLHAFKAESPAISSSRIVIMQELLKELKIEYKTHQTYHN 97

Query: 106 RVNADDMIRSIREWRQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTG 165
            + AD M+  +R    A                       S  ++ QKLT    P+  TG
Sbjct: 98  AIEADPMVLQLRNVSLA-----------------------SDEKEMQKLTVAEEPE-ITG 157

Query: 166 PSPSFHQQNVPPPHQPSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGN 225
                    +      ++ +  R  + P K K  K + P   +  Q + + PS  +    
Sbjct: 158 VDGQPLIIRIKLNKDDNAKIQDR-VLAPAKIKDGKVLAP---AKIQDEEAKPSSTSDDSP 217

Query: 226 RVSSGALEPAEGATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSAN-- 285
             S G + P        L+GR+V  + PD++ + E +IT Y+A    H L+    + +  
Sbjct: 218 DSSWGQVSPGS------LVGRRVHIQMPDEDEYIEFLITKYDANTETHHLLSAFSNKDYE 269

Query: 286 ETWEWVNLSEISPEDIQWVDEDPGIP 310
           +   WV+L  +  ED++W + DP +P
Sbjct: 278 DPCNWVDLRHVQAEDMKWPEGDPDLP 269

BLAST of Cucsa.050020 vs. NCBI nr
Match: gi|449436251|ref|XP_004135906.1| (PREDICTED: protein EMSY-LIKE 3 isoform X2 [Cucumis sativus])

HSP 1 Score: 883.6 bits (2282), Expect = 1.4e-253
Identity = 433/433 (100.00%), Postives = 433/433 (100.00%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60
           MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE
Sbjct: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60

Query: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120
           KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR
Sbjct: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120

Query: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180
           QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ
Sbjct: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180

Query: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL 240
           PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL
Sbjct: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL 240

Query: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300
           NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ
Sbjct: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300

Query: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360
           WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD
Sbjct: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360

Query: 361 DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG 420
           DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG
Sbjct: 361 DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG 420

Query: 421 GHRFSHGQSMDRE 434
           GHRFSHGQSMDRE
Sbjct: 421 GHRFSHGQSMDRE 433

BLAST of Cucsa.050020 vs. NCBI nr
Match: gi|659122643|ref|XP_008461251.1| (PREDICTED: uncharacterized protein LOC103499891 isoform X2 [Cucumis melo])

HSP 1 Score: 879.4 bits (2271), Expect = 2.6e-252
Identity = 430/433 (99.31%), Postives = 432/433 (99.77%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60
           MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE
Sbjct: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60

Query: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120
           KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR
Sbjct: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120

Query: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180
           QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQ VPPPHQ
Sbjct: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQTVPPPHQ 180

Query: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL 240
           PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGA+EPAEGATL
Sbjct: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGAIEPAEGATL 240

Query: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300
           NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ
Sbjct: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300

Query: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360
           WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD
Sbjct: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360

Query: 361 DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG 420
           DIRILHTETLIREVERVFGSNHPDP+EIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG
Sbjct: 361 DIRILHTETLIREVERVFGSNHPDPLEIERAKKVLNDHEQSLIDAINKLGDISEVGSDEG 420

Query: 421 GHRFSHGQSMDRE 434
           GHRFSHGQSMDRE
Sbjct: 421 GHRFSHGQSMDRE 433

BLAST of Cucsa.050020 vs. NCBI nr
Match: gi|778728730|ref|XP_011659468.1| (PREDICTED: protein EMSY-LIKE 3 isoform X1 [Cucumis sativus])

HSP 1 Score: 869.0 bits (2244), Expect = 3.5e-249
Identity = 433/460 (94.13%), Postives = 433/460 (94.13%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60
           MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE
Sbjct: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60

Query: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120
           KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR
Sbjct: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120

Query: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180
           QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ
Sbjct: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180

Query: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL 240
           PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL
Sbjct: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL 240

Query: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300
           NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ
Sbjct: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300

Query: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360
           WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD
Sbjct: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360

Query: 361 DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLN------------------------ 420
           DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLN                        
Sbjct: 361 DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNVRSIRCTCSLSFYSITFLINILTY 420

Query: 421 ---DHEQSLIDAINKLGDISEVGSDEGGHRFSHGQSMDRE 434
              DHEQSLIDAINKLGDISEVGSDEGGHRFSHGQSMDRE
Sbjct: 421 SVQDHEQSLIDAINKLGDISEVGSDEGGHRFSHGQSMDRE 460

BLAST of Cucsa.050020 vs. NCBI nr
Match: gi|659122641|ref|XP_008461250.1| (PREDICTED: uncharacterized protein LOC103499891 isoform X1 [Cucumis melo])

HSP 1 Score: 864.8 bits (2233), Expect = 6.6e-248
Identity = 430/460 (93.48%), Postives = 432/460 (93.91%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60
           MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE
Sbjct: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGGRVSGNGRSVMGSVPYPRMYAETDMEAQIHQLE 60

Query: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120
           KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR
Sbjct: 61  KEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREWR 120

Query: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQNVPPPHQ 180
           QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQ VPPPHQ
Sbjct: 121 QAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPPQSFTGPSPSFHQQTVPPPHQ 180

Query: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAEGATL 240
           PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGA+EPAEGATL
Sbjct: 181 PSSSVAKRGAIPPTKGKKQKSILPGASSAKQYQTSVPSGRNQVGNRVSSGAIEPAEGATL 240

Query: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300
           NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ
Sbjct: 241 NPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISPEDIQ 300

Query: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360
           WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD
Sbjct: 301 WVDEDPGIPHRGGYGGSGHGMNRSVGRDGSGAGRGRGVPKSQSRKDFLPSQNGIGKKTSD 360

Query: 361 DIRILHTETLIREVERVFGSNHPDPVEIERAKKVLN------------------------ 420
           DIRILHTETLIREVERVFGSNHPDP+EIERAKKVLN                        
Sbjct: 361 DIRILHTETLIREVERVFGSNHPDPLEIERAKKVLNVRSIRCSCSLSFYSITFLINILTY 420

Query: 421 ---DHEQSLIDAINKLGDISEVGSDEGGHRFSHGQSMDRE 434
              DHEQSLIDAINKLGDISEVGSDEGGHRFSHGQSMDRE
Sbjct: 421 SVQDHEQSLIDAINKLGDISEVGSDEGGHRFSHGQSMDRE 460

BLAST of Cucsa.050020 vs. NCBI nr
Match: gi|356544200|ref|XP_003540542.1| (PREDICTED: protein EMSY-LIKE 3 [Glycine max])

HSP 1 Score: 686.8 bits (1771), Expect = 2.5e-194
Identity = 341/438 (77.85%), Postives = 374/438 (85.39%), Query Frame = 1

Query: 1   MDYEPYDSSGTDDDLPPSHQNRIARGGG-RVSGNGRSVMGSVPYPRMYAETDMEAQIHQL 60
           MDYEPYDSSGTDDDLPP+HQNRI+RGGG R++GNGRS + S+PYPRMY E DME QIHQL
Sbjct: 1   MDYEPYDSSGTDDDLPPTHQNRISRGGGGRLAGNGRSAVASIPYPRMYGEIDMETQIHQL 60

Query: 61  EKEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDMIRSIREW 120
           E+EAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADD+IR IREW
Sbjct: 61  EQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNADDVIRRIREW 120

Query: 121 RQAGGHQPGKLSTSQAIHDPIPSPTVSASRKKQKLTHLVPP-QSFTGPSPSFHQQNVPPP 180
           RQAGGHQPG LST Q +HD IPSPTVSASRKKQK+T  VPP +SF GPSP FH Q V  P
Sbjct: 121 RQAGGHQPGVLSTGQGLHDSIPSPTVSASRKKQKITPSVPPSRSFGGPSPPFHPQTVTAP 180

Query: 181 HQPSSSVAKRGAIPPTKGKKQK--SILPGASSAKQYQTSVPSGRNQVGNRVSSGALEPAE 240
           HQPSSS AKRG+ P +KGKK K   ILPG SS KQY +S P GRNQV NR   G  E AE
Sbjct: 181 HQPSSSAAKRGSAPGSKGKKHKPGQILPGVSSMKQYPSSGPGGRNQVSNRAVMG--EHAE 240

Query: 241 GATLNPLIGRKVRTRWPDDNNFYEAVITDYNAAEGRHALVYDIGSANETWEWVNLSEISP 300
           GA+ + L+GR+VRTRWPDDNNFYEAVIT+YN A+GRH LVYD+GSANETWEWVNLSEISP
Sbjct: 241 GASFDSLVGRRVRTRWPDDNNFYEAVITNYNPADGRHNLVYDMGSANETWEWVNLSEISP 300

Query: 301 EDIQWVDEDPGIPHRGGYGGSGHGMNRSVGRDG-SGAGRGRGVPKSQSRKDFLPSQNGIG 360
           EDIQWV EDPGI HRGG+GG GHGMNRSVGRDG  GAGRGRG  K QSRKDFL SQNG+G
Sbjct: 301 EDIQWVGEDPGINHRGGFGGPGHGMNRSVGRDGVPGAGRGRGAAKGQSRKDFLSSQNGLG 360

Query: 361 KKTSDDIRILHTETLIREVERVFGSNHPDPVEIERAKKVLNDHEQSLIDAINKLGDISEV 420
           KK  DDI+ILHT+TLI+EVERVF +NHPDP+E+E+AKKVL DHEQ+LIDAI KL D+S+ 
Sbjct: 361 KKVHDDIQILHTDTLIKEVERVFSANHPDPLEVEKAKKVLKDHEQALIDAIAKLNDLSDG 420

Query: 421 GSDEGGHRFSHGQSMDRE 434
            SD  GH FSH QSMDRE
Sbjct: 421 ESDGAGHHFSHAQSMDRE 436

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EML4_ARATH1.2e-13258.39Protein EMSY-LIKE 4 OS=Arabidopsis thaliana GN=EML4 PE=2 SV=1[more]
EML3_ARATH1.6e-12158.00Protein EMSY-LIKE 3 OS=Arabidopsis thaliana GN=EML3 PE=1 SV=1[more]
EML1_ARATH1.0e-5147.50Protein EMSY-LIKE 1 OS=Arabidopsis thaliana GN=EML1 PE=1 SV=1[more]
EML2_ARATH1.1e-5049.13Protein EMSY-LIKE 2 OS=Arabidopsis thaliana GN=EML2 PE=1 SV=2[more]
MSH6_ARATH2.3e-0627.01DNA mismatch repair protein MSH6 OS=Arabidopsis thaliana GN=MSH6 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0K8F3_CUCSA9.5e-254100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_7G429500 PE=4 SV=1[more]
A0A0B2QHM2_GLYSO1.7e-19477.85Uncharacterized protein OS=Glycine soja GN=glysoja_026038 PE=4 SV=1[more]
I1LVG3_SOYBN1.7e-19477.85Uncharacterized protein OS=Glycine max GN=GLYMA_12G240400 PE=4 SV=1[more]
D7TNV2_VITVI5.0e-19478.49Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0156g00360 PE=4 SV=... [more]
A0A061DVP8_THECC3.0e-19178.31Emsy N Terminus/ plant Tudor-like domains-containing protein isoform 1 OS=Theobr... [more]
Match NameE-valueIdentityDescription
AT2G44440.16.7e-13458.39 Emsy N Terminus (ENT) domain-containing protein[more]
AT5G13020.19.1e-12358.00 Emsy N Terminus (ENT)/ plant Tudor-like domains-containing protein[more]
AT5G06780.16.5e-5249.13 Emsy N Terminus (ENT)/ plant Tudor-like domains-containing protein[more]
AT3G12140.31.4e-5148.66 Emsy N Terminus (ENT)/ plant Tudor-like domains-containing protein[more]
AT3G57970.16.3e-1524.44 Emsy N Terminus (ENT)/ plant Tudor-like domains-containing protein[more]
Match NameE-valueIdentityDescription
gi|449436251|ref|XP_004135906.1|1.4e-253100.00PREDICTED: protein EMSY-LIKE 3 isoform X2 [Cucumis sativus][more]
gi|659122643|ref|XP_008461251.1|2.6e-25299.31PREDICTED: uncharacterized protein LOC103499891 isoform X2 [Cucumis melo][more]
gi|778728730|ref|XP_011659468.1|3.5e-24994.13PREDICTED: protein EMSY-LIKE 3 isoform X1 [Cucumis sativus][more]
gi|659122641|ref|XP_008461250.1|6.6e-24893.48PREDICTED: uncharacterized protein LOC103499891 isoform X1 [Cucumis melo][more]
gi|356544200|ref|XP_003540542.1|2.5e-19477.85PREDICTED: protein EMSY-LIKE 3 [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005491ENT_dom
IPR005491ENT_dom
Vocabulary: Biological Process
TermDefinition
GO:0050832defense response to fungus
GO:0050832defense response to fungus
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0050832 defense response to fungus
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.050020.2Cucsa.050020.2mRNA
Cucsa.050020.1Cucsa.050020.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005491ENT domainPFAMPF03735ENTcoord: 54..122
score: 2.2
IPR005491ENT domainSMARTSM01191ENT_2coord: 52..125
score: 3.7
IPR005491ENT domainPROFILEPS51138ENTcoord: 52..139
score: 24
IPR005491ENT domainunknownSSF158639ENT-likecoord: 47..163
score: 1.31
NoneNo IPR availableunknownCoilCoilcoord: 86..106
scor
NoneNo IPR availableGENE3DG3DSA:2.30.30.140coord: 242..277
score: 9.
NoneNo IPR availablePANTHERPTHR33432:SF5EMSY N TERMINUS (ENT) DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 2..373
score: 2.4E
NoneNo IPR availableunknownSSF63748Tudor/PWWP/MBTcoord: 242..282
score: 8.2

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cucsa.050020Cla015274Watermelon (97103) v1cgywmB031
Cucsa.050020Csa7G429500Cucumber (Chinese Long) v2cgycuB031
Cucsa.050020MELO3C023516Melon (DHL92) v3.5.1cgymeB029
Cucsa.050020ClCG09G003820Watermelon (Charleston Gray)cgywcgB033
Cucsa.050020CSPI07G19590Wild cucumber (PI 183967)cgycpiB032
Cucsa.050020CmaCh08G005540Cucurbita maxima (Rimu)cgycmaB0052
Cucsa.050020CmaCh17G009320Cucurbita maxima (Rimu)cgycmaB0050
Cucsa.050020CmoCh17G009040Cucurbita moschata (Rifu)cgycmoB0046
Cucsa.050020CmoCh10G010100Cucurbita moschata (Rifu)cgycmoB0045
Cucsa.050020CmoCh08G005460Cucurbita moschata (Rifu)cgycmoB0047
Cucsa.050020Lsi02G026100Bottle gourd (USVL1VR-Ls)cgylsiB030
Cucsa.050020Lsi06G004760Bottle gourd (USVL1VR-Ls)cgylsiB032
Cucsa.050020Lsi03G017280Bottle gourd (USVL1VR-Ls)cgylsiB031
Cucsa.050020Cp4.1LG17g08350Cucurbita pepo (Zucchini)cgycpeB0048
Cucsa.050020Cp4.1LG12g06900Cucurbita pepo (Zucchini)cgycpeB0046
Cucsa.050020Cp4.1LG18g01450Cucurbita pepo (Zucchini)cgycpeB0049
Cucsa.050020MELO3C009636.2Melon (DHL92) v3.6.1cgymedB028
Cucsa.050020MELO3C023516.2Melon (DHL92) v3.6.1cgymedB027
Cucsa.050020CsaV3_1G042390Cucumber (Chinese Long) v3cgycucB031
Cucsa.050020CsaV3_3G038920Cucumber (Chinese Long) v3cgycucB032
Cucsa.050020CsaV3_7G031410Cucumber (Chinese Long) v3cgycucB033
Cucsa.050020Cla97C09G166070Watermelon (97103) v2cgywmbB030
Cucsa.050020Cla97C03G063950Watermelon (97103) v2cgywmbB029
Cucsa.050020Bhi09G000492Wax gourdcgywgoB034
Cucsa.050020Carg18700Silver-seed gourdcarcgyB0413
Cucsa.050020Carg13468Silver-seed gourdcarcgyB0519
Cucsa.050020Carg16958Silver-seed gourdcarcgyB0537
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cucsa.050020Cucsa.143030Cucumber (Gy14) v1cgycgyB010
Cucsa.050020Cucsa.242760Cucumber (Gy14) v1cgycgyB011
The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.050020Cucurbita maxima (Rimu)cgycmaB0051
Cucsa.050020Wild cucumber (PI 183967)cgycpiB031
Cucsa.050020Cucumber (Chinese Long) v2cgycuB030
Cucsa.050020Watermelon (97103) v1cgywmB033
Cucsa.050020Cucurbita pepo (Zucchini)cgycpeB0047
Cucsa.050020Silver-seed gourdcarcgyB0593