Csa3G119510 (gene) Cucumber (Chinese Long) v2

NameCsa3G119510
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass; contains IPR001878 (Zinc finger, CCHC-type), IPR009044 (ssDNA-binding transcriptional regulator), IPR009057 (Homeodomain-like), IPR014876 (DEK, C-terminal)
LocationChr3 : 6744339 .. 6747280 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAGGTCTTCTTTTCTTCTACTTTCAGTTTTGTTTCTTAGAAAGATGAACGACGAGACCCGACGGAGAATCGAGGAAAACGTGATTGAGGTATTGAAGAAATCGAGCATGGAAGATACGACGGAGTTTAAAGTTAGAAGCCAGGTCGAAGAACGGCTTGGAATCGATCTTTCTAATAAACAATGCAAGTTGCTGGTGAGGAACGTGGTAGAGAGCTTTTTGCTTTCAATGTCGGAGCGGGTGTGTATGGGGAAAGAGGATGAGCCAGGACCTAGTGTTCGTTACGAGAATAAAGCGGTGGAGCAGAAGATAGTACCAAAGAAGGAGTTCAACGATGATGGAGACCTTTTGATTTGCCGGGTAAAGTAAAATCATACCCTTCTTGATTCTTATGGTGTAGTGATTTTGAAAGAAAGGAGAAAGATTAGGGTTTTATCAACTTAGTTTTATGGAAAGATTCCGAACTGGATTCTTCTGTAGCTAAGCAATTGAATAGTTCTTTATGTTTCTGTGGCTAGATTGTTTTGTATTCTTTGTTCCTAGATCATTCTGTACCAAAAAGGTTAGAAACATAAACAACCCTGCATTTGTTGTCTATGTTTCTATCTGTTGTTTATGTTCCTATGGCGATTGTGTACCAAGATGTTAGATTGTAGAAGAAATTCGGGTGACCATTTTCTGTCATTTATGCATTATGATAGAATTGTCTATGCTTTATGAGTGTAGAAATCGATCATTTAGACAATAACTTAATAACATTGCTCCTCCTGCTGTTAAAATTGTCGATTCCTTTGTACCCTATTAGTAATTCTTTTCACTTATTAGTTAGGGATTTTTTTGGTTTGTTTCAGCTATCTAATAACAGGAGTGTGACAATTCATAAATTTAAAGGGGCACCTATGGTATCAGTTAGGCAGTATTATGAAAAAGATGGAAAACAGCTTCCAACTCTAAAAGGTGGGCATTATGCATTGGTCTCTTCCTTTTCTCTTATGGTGAATCTTTTCTTTTATTATGTCAGTTCTGTTAACTTCACACCTTCCTCCCTCCCCTTCCCCTTTTTTTCTTTTTTCACCTTGTTTTTAGCCACAACGGTTGGGAGTTTCTTTTGCTTGGGTCTGAACTAGTAACCCATCCTATAGGACTATTTTAATTATGAAAATTTTCTCTTTTAGGAATCAGCATGCCAACTGAGCAATGGTCAGTCTTTAAGAGTAATATTCCTGCTATAGCGGAAGCTATTTTGCAGATGAAAAGAAATAAAAGGTGAGGGAAGCATGATGATACTGTTGTTTCTTAACTTTCGAGTTACAGTTCATCAAATTTATTTTATTGAAAACTCTGAAGTTCTATGCTAGGTTTTCAGGTGTTAGTATCCAATCTAGATCCCAAATAGAATCTGTATCTAATATTTTTTGTTTTTAAGGCTAGGTTGATAACCATTTTGTATGTTTTTAGTTTTTGGTTTATGAAAATTGTGCTTAGTTTCTCACCGGTTTTGCATCTTTTTAAGGAAACATTTGAATTTTTAGTCTAATTCCAATAACTAAAAGTCTAAAACAAAATTTTAGAAACTACTTTAAAAACTCAGCTTGGATTTTGGAAACAATTGTTAGATTTAGATAACATCTTAAAGAAATTTAGTGTTTATAAACTTAATTTTCAAAACAAGCTATTTGTTTTGTTTGGTTGTAGCATTTAAAATCTGGACCATAGATGTGTGTTTGAACTATTTGATTGTATTATATCATGTAGATCTGAACATGATGCTGAAAAAATTGGTGCCTTCTCAAATCCAACAACTAGAGTAACTTCTCCAAAATATCCAATTGAAACTATTCGATTTGATGGAAAAAACTACAATGCATGGGCACATCAGATGGAGCTTTTGCTGCAGGACTTAAAGATTGCTTATGTTCTTTCTAATCAATGTCCGACTGCCGTGCTTGGGGAAGAATCAAGCTCTGGAAATGCTGCGCAATCCAAGGCTGCTGAACAGAAATGGATGAGGGATGACCACATGTGTCGCCGCAACATTCTTAACTCCCTCTCCGACAGGCTTTTTAATGAATATTCGAAAAAAACAATGAGTGCTAGTGAACTTTGGAAGGAGCTAAAATTGCTTTATCTTTTGGAGGAGTTTGGGACCAAGAGATCTCAAGTCAAAAAGTATCTAGAATTCAAAATGGTTGAGGAGAAGTCAATATTGGAGCAAGTTGAAGAACTTAATCACATTGCTGATTCTATTGGTTCTTCTGGAACGGTTATCGATGAGGATTTTCATGTTAGCGCAATTATCTCAAAGCTTCCACTTTCTTGGAAGAATGTCTGGGTGAACTTAATGCATGAGCAGTATCTTCCCCTTCGGAAATTGACAGATCGATTGAGAATTGAAGAACAATTACGTACACAAAAAAATTCACGTCTCTCAGGAGTGTCTTCTTCTCCTACTCCAAGAGGCCAACATCATGCTGCAAATCACCCATCAAAGATGGGAGACCCAAAGCCTGTAACCGTACCACTGAGAAAAAAGGAATGTCAAAAGGAGGTCAAAACTTTACTCTGCTTGGATTGCGGCAAGGAAGGGCACACATCTCCAAATTGTCCAACTAAGAAAGTCAACAATGAAGTACCTCGGCAAAGAACATAGCACATTCTTACCCAGGTAAATAGATCTGACTTCTGAAGGTGAAAATAATAAATTCACCATTTAGATGCTATAGTTCTTGATTCATATATCCTTTCAAAATATGGAACCCAATAGTCATTTTGAAATTTTGAACGATCTTAAAAGACTCGGAAACTATCAATGAGCATAGTTGAGGCTCTATTCCATGTATATGTGTATAACTGATATTCTTGCTGAGTGTAACTCAATTCCTTAGAATAGTATATGTTCTCATACGCCTTCTTGAAGCTCAAGTGAATTGTAACATGTTCAGC

mRNA sequence

ATGAACGACGAGACCCGACGGAGAATCGAGGAAAACGTGATTGAGGTATTGAAGAAATCGAGCATGGAAGATACGACGGAGTTTAAAGTTAGAAGCCAGGTCGAAGAACGGCTTGGAATCGATCTTTCTAATAAACAATGCAAGTTGCTGGTGAGGAACGTGGTAGAGAGCTTTTTGCTTTCAATGTCGGAGCGGGTGTGTATGGGGAAAGAGGATGAGCCAGGACCTAGTGTTCGTTACGAGAATAAAGCGGTGGAGCAGAAGATAGTACCAAAGAAGGAGTTCAACGATGATGGAGACCTTTTGATTTGCCGGCTATCTAATAACAGGAGTGTGACAATTCATAAATTTAAAGGGGCACCTATGGTATCAGTTAGGCAGTATTATGAAAAAGATGGAAAACAGCTTCCAACTCTAAAAGGAATCAGCATGCCAACTGAGCAATGGTCAGTCTTTAAGAGTAATATTCCTGCTATAGCGGAAGCTATTTTGCAGATGAAAAGAAATAAAAGATCTGAACATGATGCTGAAAAAATTGGTGCCTTCTCAAATCCAACAACTAGAGTAACTTCTCCAAAATATCCAATTGAAACTATTCGATTTGATGGAAAAAACTACAATGCATGGGCACATCAGATGGAGCTTTTGCTGCAGGACTTAAAGATTGCTTATGTTCTTTCTAATCAATGTCCGACTGCCGTGCTTGGGGAAGAATCAAGCTCTGGAAATGCTGCGCAATCCAAGGCTGCTGAACAGAAATGGATGAGGGATGACCACATGTGTCGCCGCAACATTCTTAACTCCCTCTCCGACAGGCTTTTTAATGAATATTCGAAAAAAACAATGAGTGCTAGTGAACTTTGGAAGGAGCTAAAATTGCTTTATCTTTTGGAGGAGTTTGGGACCAAGAGATCTCAAGTCAAAAAGTATCTAGAATTCAAAATGGTTGAGGAGAAGTCAATATTGGAGCAAGTTGAAGAACTTAATCACATTGCTGATTCTATTGGTTCTTCTGGAACGGTTATCGATGAGGATTTTCATGTTAGCGCAATTATCTCAAAGCTTCCACTTTCTTGGAAGAATGTCTGGGTGAACTTAATGCATGAGCAGTATCTTCCCCTTCGGAAATTGACAGATCGATTGAGAATTGAAGAACAATTACGTACACAAAAAAATTCACGTCTCTCAGGAGTGTCTTCTTCTCCTACTCCAAGAGGCCAACATCATGCTGCAAATCACCCATCAAAGATGGGAGACCCAAAGCCTGTAACCGTACCACTGAGAAAAAAGGAATGTCAAAAGGAGGTCAAAACTTTACTCTGCTTGGATTGCGGCAAGGAAGGGCACACATCTCCAAATTGTCCAACTAAGAAAGTCAACAATGAAGTACCTCGGCAAAGAACATAG

Coding sequence (CDS)

ATGAACGACGAGACCCGACGGAGAATCGAGGAAAACGTGATTGAGGTATTGAAGAAATCGAGCATGGAAGATACGACGGAGTTTAAAGTTAGAAGCCAGGTCGAAGAACGGCTTGGAATCGATCTTTCTAATAAACAATGCAAGTTGCTGGTGAGGAACGTGGTAGAGAGCTTTTTGCTTTCAATGTCGGAGCGGGTGTGTATGGGGAAAGAGGATGAGCCAGGACCTAGTGTTCGTTACGAGAATAAAGCGGTGGAGCAGAAGATAGTACCAAAGAAGGAGTTCAACGATGATGGAGACCTTTTGATTTGCCGGCTATCTAATAACAGGAGTGTGACAATTCATAAATTTAAAGGGGCACCTATGGTATCAGTTAGGCAGTATTATGAAAAAGATGGAAAACAGCTTCCAACTCTAAAAGGAATCAGCATGCCAACTGAGCAATGGTCAGTCTTTAAGAGTAATATTCCTGCTATAGCGGAAGCTATTTTGCAGATGAAAAGAAATAAAAGATCTGAACATGATGCTGAAAAAATTGGTGCCTTCTCAAATCCAACAACTAGAGTAACTTCTCCAAAATATCCAATTGAAACTATTCGATTTGATGGAAAAAACTACAATGCATGGGCACATCAGATGGAGCTTTTGCTGCAGGACTTAAAGATTGCTTATGTTCTTTCTAATCAATGTCCGACTGCCGTGCTTGGGGAAGAATCAAGCTCTGGAAATGCTGCGCAATCCAAGGCTGCTGAACAGAAATGGATGAGGGATGACCACATGTGTCGCCGCAACATTCTTAACTCCCTCTCCGACAGGCTTTTTAATGAATATTCGAAAAAAACAATGAGTGCTAGTGAACTTTGGAAGGAGCTAAAATTGCTTTATCTTTTGGAGGAGTTTGGGACCAAGAGATCTCAAGTCAAAAAGTATCTAGAATTCAAAATGGTTGAGGAGAAGTCAATATTGGAGCAAGTTGAAGAACTTAATCACATTGCTGATTCTATTGGTTCTTCTGGAACGGTTATCGATGAGGATTTTCATGTTAGCGCAATTATCTCAAAGCTTCCACTTTCTTGGAAGAATGTCTGGGTGAACTTAATGCATGAGCAGTATCTTCCCCTTCGGAAATTGACAGATCGATTGAGAATTGAAGAACAATTACGTACACAAAAAAATTCACGTCTCTCAGGAGTGTCTTCTTCTCCTACTCCAAGAGGCCAACATCATGCTGCAAATCACCCATCAAAGATGGGAGACCCAAAGCCTGTAACCGTACCACTGAGAAAAAAGGAATGTCAAAAGGAGGTCAAAACTTTACTCTGCTTGGATTGCGGCAAGGAAGGGCACACATCTCCAAATTGTCCAACTAAGAAAGTCAACAATGAAGTACCTCGGCAAAGAACATAG

Protein sequence

MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLLSMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGAPMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIGAFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESSSGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEFGTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDPKPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEVPRQRT*
BLAST of Csa3G119510 vs. Swiss-Prot
Match: KELP_ARATH (RNA polymerase II transcriptional coactivator KELP OS=Arabidopsis thaliana GN=KELP PE=1 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 3.5e-37
Identity = 85/171 (49.71%), Postives = 109/171 (63.74%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFL- 60
           M  ET+ +IE+ VIE+L +S M++ TEFKVR    E+L IDLS K  K  VR+VVE FL 
Sbjct: 1   MEKETKEKIEKTVIEILSESDMKEITEFKVRKLASEKLAIDLSEKSHKAFVRSVVEKFLD 60

Query: 61  ---LSMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHK 120
                  E   + KE+E G     +           KEF+DDGDL+ICRLS+ R VTI +
Sbjct: 61  EERAREYENSQVNKEEEDGDKDCGKGN---------KEFDDDGDLIICRLSDKRRVTIQE 120

Query: 121 FKGAPMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMK 168
           FKG  +VS+R+YY+KDGK+LPT KGIS+  EQWS FK N+PAI  A+ +M+
Sbjct: 121 FKGKSLVSIREYYKKDGKELPTSKGISLTDEQWSTFKKNMPAIENAVKKME 162

BLAST of Csa3G119510 vs. TrEMBL
Match: A0A0A0L3U5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G119510 PE=4 SV=1)

HSP 1 Score: 939.1 bits (2426), Expect = 2.1e-270
Identity = 468/468 (100.00%), Postives = 468/468 (100.00%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL
Sbjct: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120
           SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA
Sbjct: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120

Query: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180
           PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG
Sbjct: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180

Query: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240
           AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS
Sbjct: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240

Query: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300
           SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF
Sbjct: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300

Query: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360
           GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK
Sbjct: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360

Query: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420
           NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP
Sbjct: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420

Query: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEVPRQRT 469
           KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEVPRQRT
Sbjct: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEVPRQRT 468

BLAST of Csa3G119510 vs. TrEMBL
Match: A0A061DUH2_THECC (Zinc knuckle family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_005132 PE=4 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 1.2e-116
Identity = 226/463 (48.81%), Postives = 310/463 (66.95%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           M  ETR++IEE V E+L K+ ME+ TEFKVR    ERLGIDLS+   K  VR V+ESFLL
Sbjct: 1   MEKETRQKIEETVREILSKADMEEMTEFKVRVAASERLGIDLSDFNHKKFVREVIESFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120
           S  E    G  +E    +R E    E KI  KKE + DGD LIC+L++ R+V +H+F+G 
Sbjct: 61  STVEE--NGDVEELNSKLREE----EAKIKIKKEIDGDGDRLICKLADKRNVVVHEFRGK 120

Query: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180
             VS+R++Y KDGK+LP+ +G+S+ +E WS  K++ PAI  A+ +M+    ++ D E+ G
Sbjct: 121 TYVSIREFYVKDGKELPSARGVSLTSEIWSALKNSFPAIDAAVKKMQSKLSTKLDGEQNG 180

Query: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240
             SN  T  +    PIET RFDGKNY+ WA QMEL L+ L+IAYVL++ CP+  L  E+S
Sbjct: 181 DVSNSVTAFSHEFSPIETTRFDGKNYHCWAEQMELFLKQLQIAYVLTDPCPSLTLSPEAS 240

Query: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300
           S  +AQ+KA E+KWM DD++CR +IL+SLSD L+ ++SKKT SA ELW+ELKL+YL EEF
Sbjct: 241 SEESAQAKATEKKWMNDDYLCRHSILSSLSDNLYYQFSKKTKSAKELWEELKLVYLYEEF 300

Query: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360
           GTKRSQV+KY+EF++V+ + IL+Q++ELN IADSI ++G +IDE+FHVS IISKLP SWK
Sbjct: 301 GTKRSQVRKYIEFQIVDGRPILKQMQELNSIADSIVAAGMMIDENFHVSTIISKLPPSWK 360

Query: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420
           +  V LM E+YLP R L D +R+EE+ R +          S  P     A N   ++ D 
Sbjct: 361 DFCVKLMREEYLPFRMLMDHIRVEEESRNRVKQAEHSKYESFYP-----ANNLGPRIRDM 420

Query: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEV 464
           K   VP +++E +      +C  CG++GH S  C  ++   EV
Sbjct: 421 KKPGVPWKRRESEMHGSPPICNYCGRKGHLSKFCRNRRCEKEV 452

BLAST of Csa3G119510 vs. TrEMBL
Match: A0A061DTK4_THECC (Zinc knuckle family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_005132 PE=4 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 1.2e-116
Identity = 226/463 (48.81%), Postives = 310/463 (66.95%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           M  ETR++IEE V E+L K+ ME+ TEFKVR    ERLGIDLS+   K  VR V+ESFLL
Sbjct: 1   MEKETRQKIEETVREILSKADMEEMTEFKVRVAASERLGIDLSDFNHKKFVREVIESFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120
           S  E    G  +E    +R E    E KI  KKE + DGD LIC+L++ R+V +H+F+G 
Sbjct: 61  STVEE--NGDVEELNSKLREE----EAKIKIKKEIDGDGDRLICKLADKRNVVVHEFRGK 120

Query: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180
             VS+R++Y KDGK+LP+ +G+S+ +E WS  K++ PAI  A+ +M+    ++ D E+ G
Sbjct: 121 TYVSIREFYVKDGKELPSARGVSLTSEIWSALKNSFPAIDAAVKKMQSKLSTKLDGEQNG 180

Query: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240
             SN  T  +    PIET RFDGKNY+ WA QMEL L+ L+IAYVL++ CP+  L  E+S
Sbjct: 181 DVSNSVTAFSHEFSPIETTRFDGKNYHCWAEQMELFLKQLQIAYVLTDPCPSLTLSPEAS 240

Query: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300
           S  +AQ+KA E+KWM DD++CR +IL+SLSD L+ ++SKKT SA ELW+ELKL+YL EEF
Sbjct: 241 SEESAQAKATEKKWMNDDYLCRHSILSSLSDNLYYQFSKKTKSAKELWEELKLVYLYEEF 300

Query: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360
           GTKRSQV+KY+EF++V+ + IL+Q++ELN IADSI ++G +IDE+FHVS IISKLP SWK
Sbjct: 301 GTKRSQVRKYIEFQIVDGRPILKQMQELNSIADSIVAAGMMIDENFHVSTIISKLPPSWK 360

Query: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420
           +  V LM E+YLP R L D +R+EE+ R +          S  P     A N   ++ D 
Sbjct: 361 DFCVKLMREEYLPFRMLMDHIRVEEESRNRVKQAEHSKYESFYP-----ANNLGPRIRDM 420

Query: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEV 464
           K   VP +++E +      +C  CG++GH S  C  ++   EV
Sbjct: 421 KKPGVPWKRRESEMHGSPPICNYCGRKGHLSKFCRNRRCEKEV 452

BLAST of Csa3G119510 vs. TrEMBL
Match: A0A151TXZ1_CAJCA (RNA polymerase II transcriptional coactivator KELP (Fragment) OS=Cajanus cajan GN=KK1_011225 PE=4 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 1.0e-112
Identity = 230/491 (46.84%), Postives = 316/491 (64.36%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           M  E+RR++E+ V+++LKKS++E+ TEF +R    ERLGIDLS+   K LVRN+VES+L+
Sbjct: 2   MEAESRRKVEDMVLDILKKSNIEEATEFSIRVAASERLGIDLSHSTGKQLVRNIVESYLV 61

Query: 61  SM----SERVCMGKEDEPGPSVRYENKAVEQK---IVPKKEFND-------DGDLLICRL 120
           S+      +    KED P       N  V++K   +VP KE          D + +IC+L
Sbjct: 62  SVVANEKSKNTEKKEDIPA------NDDVQKKHDVVVPAKEGTQVARVTRKDSERVICQL 121

Query: 121 SNNRSVTIHKFKGAPMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQM 180
           SN R+V +  FKG  +VS+R++Y KDGK LP  KGIS+ +EQWS FK ++PAI EAI +M
Sbjct: 122 SNRRNVAVKAFKGTTLVSIREFYVKDGKLLPGSKGISLSSEQWSTFKKSVPAIEEAITKM 181

Query: 181 KRNKRSEHDAEKIGAFSNPT----------TRVTSPKYPIETIRFDGKNYNAWAHQMELL 240
           +   R EH+ ++ G  SN            T V   + PIE IRFDGKNY  WA QMELL
Sbjct: 182 EGRIRLEHNGKQNGDASNAVVDAAPVEPVPTEVVHFEVPIEVIRFDGKNYQFWAQQMELL 241

Query: 241 LQDLKIAYVLSNQCPTAVLGEESSSGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNE 300
           L+ LKI YVL+  CP   + E +S+   A +KAAE++W+ DD MC RNILN LSD LFN 
Sbjct: 242 LKQLKIEYVLTEPCPNLTVEEGASAEKIAAAKAAERRWLNDDLMCHRNILNHLSDHLFNL 301

Query: 301 YSKKTMSASELWKELKLLYLLEEFGTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIG 360
           ++ + MSA ELW+ELKL+YL EEFGTKRSQVKKY+EF+MV+EK+++EQ+ ELN IADSI 
Sbjct: 302 HANRKMSAKELWEELKLVYLYEEFGTKRSQVKKYIEFQMVDEKAVMEQIRELNGIADSIV 361

Query: 361 SSGTVIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLS 420
           ++G  +D++FHVS IISKLP SWK+  + LM E+YLP  KL +R++IEE+ R    S + 
Sbjct: 362 AAGMFLDDNFHVSLIISKLPPSWKDFSIKLMREEYLPFWKLMERIQIEEESR----SGVK 421

Query: 421 GVSSSPTPRGQHHAANHPSKMGDPKPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPT 468
            V       G H A     +  D +P+ +P  + E     K++ C  CGK+GH S NC  
Sbjct: 422 QVGEHYDSVGFHLANRGGQRRVDNRPLRMPRNRSEI--NTKSIPCSVCGKKGHLSKNC-- 478

BLAST of Csa3G119510 vs. TrEMBL
Match: M5XT05_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppb013833mg PE=4 SV=1)

HSP 1 Score: 407.1 bits (1045), Expect = 2.8e-110
Identity = 219/473 (46.30%), Postives = 307/473 (64.90%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           M+ E+RR+IE+ V+++L+K+++E+ TEFKVR    E+LGID S+ + K  VR+V+E FLL
Sbjct: 1   MDSESRRKIEDTVLDILRKTNLEEMTEFKVRKVTSEQLGIDFSDTEHKSFVRSVIERFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120
           S  E     +E+     +   N   E     KKE  +DG  +IC+LSN ++V I+ FK  
Sbjct: 61  SSPEPKVNAREE-----LMETNVQEEPGTRSKKEVTEDGHRVICKLSNRKTVVINDFKEK 120

Query: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180
             VS R++Y+KDGKQLPT KGIS+P+EQW+ FK ++PAI EA+ +M+   RSE D+++  
Sbjct: 121 TYVSFREFYQKDGKQLPTAKGISLPSEQWAAFKKSVPAIEEAVKKMESKIRSELDSKRTE 180

Query: 181 ------------------AFSNPTTRVTSPKY-PIETIRFDGKNYNAWAHQMELLLQDLK 240
                               SN    +   +   IET RFDGKNY  W  QMEL L+ LK
Sbjct: 181 NGKQTEDGKQTGDGVQTEIMSNSLNGIAPQQLVTIETSRFDGKNYPFWVEQMELQLKQLK 240

Query: 241 IAYVLSNQCPTAVLGEESSSGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKT 300
           IAYVL   CP+++LG E+SS   A SKAA++KW+ DD +CRR ILN+LSD LF  YSKKT
Sbjct: 241 IAYVLFEPCPSSMLGPEASSEEIAHSKAADRKWVNDDSVCRRGILNALSDDLFYLYSKKT 300

Query: 301 MSASELWKELKLLYLLEEFGTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTV 360
           M+A ELW++LKL+YL E+FGT R++VKKY+EF M+E KSI+EQVE  N +ADSI  SG +
Sbjct: 301 MTAKELWEDLKLIYLFEQFGTDRTRVKKYIEFVMLEGKSIVEQVENFNRLADSIVGSGMM 360

Query: 361 IDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSS 420
           I+E FHVS IISKLP SWK+V + LM E++LP   L +RLR+EE++R ++N        +
Sbjct: 361 IEEKFHVSVIISKLPPSWKDVCIKLMREEHLPFAMLMERLRVEEEMRVREN------QGA 420

Query: 421 PTPRGQHHAANHPSKMGDPKPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNC 455
           P       A  +  +  D KP ++  +++E +   K ++C  CGK+GH S +C
Sbjct: 421 PFNLVGDLARKYAPRQRDMKPRSMQWKRQELETNGK-VICQVCGKKGHISQHC 461

BLAST of Csa3G119510 vs. TAIR10
Match: AT4G00980.1 (AT4G00980.1 zinc knuckle (CCHC-type) family protein)

HSP 1 Score: 290.0 bits (741), Expect = 2.5e-78
Identity = 174/453 (38.41%), Postives = 260/453 (57.40%), Query Frame = 1

Query: 7   RRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLLSMSERV 66
           ++IEE V  +L +S M+  TEFK+R     +LGIDLS    K LVR+V+E FLLS     
Sbjct: 18  QKIEETVKSILSESDMDQMTEFKLRLDASAKLGIDLSGTNHKKLVRDVLEVFLLSTPGEA 77

Query: 67  CMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGAPMVSVR 126
            + +   P      +N+ V    V       + +  IC+LS  ++ T+ +++G P +S+ 
Sbjct: 78  LVPETVAPA-----KNETVS---VAAASVGGEDERFICKLSEKQNATVQRYRGQPFLSIG 137

Query: 127 QYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIGAFSNPT 186
              ++ GK     +G  + T QWSV K N  AI + I Q +   +SE  A + G  S   
Sbjct: 138 S--QEHGK---AFRGAHLSTNQWSVIKKNFAAIEDGIKQCQSKLKSE--AARNGDTSEAV 197

Query: 187 TRVTSPKYPIETI-RFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPT--AVLGEESSSGN 246
            + +S  + +  I RFDGK+Y  WA QMEL L+ LK+ YVLS  CP+  +  G E++   
Sbjct: 198 DKDSSHGFSVIKISRFDGKSYLYWASQMELFLKQLKLTYVLSEPCPSIGSSQGPETNPRE 257

Query: 247 AAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEFGTK 306
             ++ A  +KW+RDD++C  +++NSLSD L+  YS+K   A ELW ELK +Y  +E  +K
Sbjct: 258 ITRADATGKKWLRDDYLCYTHLMNSLSDHLYRRYSQKFKHAKELWDELKWVYQCDESKSK 317

Query: 307 RSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWKNVW 366
           RSQV+KY+EF+MVEE+ ILEQV+  N IADSI S+G  +DE FHVS IISK P SW+   
Sbjct: 318 RSQVRKYIEFRMVEERPILEQVQVFNKIADSIVSAGMFLDEAFHVSTIISKFPPSWRGFC 377

Query: 367 VNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDPK-P 426
             LM E+YLP+  L +R++ EE+L     +   GV+  P   G       PS     +  
Sbjct: 378 TRLMEEEYLPVWMLMERVKAEEEL---LRNGAKGVTYRPA-TGSSQMERTPSLGTTHRGS 437

Query: 427 VTVPLRKKECQKEVKTLL-CLDCGKEGHTSPNC 455
            +V  ++KE +++ + ++ C +CG++GH + +C
Sbjct: 438 QSVGWKRKEPERDERVIIVCDNCGRKGHLAKHC 451

BLAST of Csa3G119510 vs. TAIR10
Match: AT4G10920.1 (AT4G10920.1 transcriptional coactivator p15 (PC4) family protein (KELP))

HSP 1 Score: 157.5 bits (397), Expect = 2.0e-38
Identity = 85/171 (49.71%), Postives = 109/171 (63.74%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFL- 60
           M  ET+ +IE+ VIE+L +S M++ TEFKVR    E+L IDLS K  K  VR+VVE FL 
Sbjct: 1   MEKETKEKIEKTVIEILSESDMKEITEFKVRKLASEKLAIDLSEKSHKAFVRSVVEKFLD 60

Query: 61  ---LSMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHK 120
                  E   + KE+E G     +           KEF+DDGDL+ICRLS+ R VTI +
Sbjct: 61  EERAREYENSQVNKEEEDGDKDCGKGN---------KEFDDDGDLIICRLSDKRRVTIQE 120

Query: 121 FKGAPMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMK 168
           FKG  +VS+R+YY+KDGK+LPT KGIS+  EQWS FK N+PAI  A+ +M+
Sbjct: 121 FKGKSLVSIREYYKKDGKELPTSKGISLTDEQWSTFKKNMPAIENAVKKME 162

BLAST of Csa3G119510 vs. NCBI nr
Match: gi|449433026|ref|XP_004134299.1| (PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis sativus])

HSP 1 Score: 939.1 bits (2426), Expect = 3.0e-270
Identity = 468/468 (100.00%), Postives = 468/468 (100.00%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL
Sbjct: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120
           SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA
Sbjct: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120

Query: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180
           PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG
Sbjct: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180

Query: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240
           AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS
Sbjct: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240

Query: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300
           SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF
Sbjct: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300

Query: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360
           GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK
Sbjct: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360

Query: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420
           NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP
Sbjct: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420

Query: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEVPRQRT 469
           KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEVPRQRT
Sbjct: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEVPRQRT 468

BLAST of Csa3G119510 vs. NCBI nr
Match: gi|659074945|ref|XP_008437880.1| (PREDICTED: uncharacterized protein LOC103483179 [Cucumis melo])

HSP 1 Score: 853.6 bits (2204), Expect = 1.6e-244
Identity = 424/461 (91.97%), Postives = 440/461 (95.44%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           MNDETRR+IEENVIEVLK+S++EDTTEFKVRSQVEER+GIDLSNKQCKLLVRNVVESFLL
Sbjct: 1   MNDETRRKIEENVIEVLKQSNIEDTTEFKVRSQVEERIGIDLSNKQCKLLVRNVVESFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120
           SMSERVCMGKEDEPGPSVRYEN+AVEQKI+PKKEFNDDGDLLICRLSNNRSVTIHKFKG 
Sbjct: 61  SMSERVCMGKEDEPGPSVRYENRAVEQKIIPKKEFNDDGDLLICRLSNNRSVTIHKFKGE 120

Query: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180
            MVS+RQYY KDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDA+KIG
Sbjct: 121 RMVSIRQYYAKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDADKIG 180

Query: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240
           A SNPT RVT PK+PIETIRFDGKNY+AWAHQMELLLQDLKIAYVLSNQCPTAVLG ESS
Sbjct: 181 AISNPT-RVTYPKFPIETIRFDGKNYHAWAHQMELLLQDLKIAYVLSNQCPTAVLGAESS 240

Query: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300
           SGNAAQSK AEQKWM DDHMC RNILNSLSDRLFNEYSKK MSASELWKELKLLY LEEF
Sbjct: 241 SGNAAQSKVAEQKWMSDDHMCHRNILNSLSDRLFNEYSKKPMSASELWKELKLLYFLEEF 300

Query: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360
           GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGS+GT+IDEDFHVSAIISKLPLSWK
Sbjct: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSAGTIIDEDFHVSAIISKLPLSWK 360

Query: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420
           NVW++LM E YLPL KLTDRLRIEEQLRTQKNSRLS VS  P  RGQHHAANHPSKMGDP
Sbjct: 361 NVWMSLMQEHYLPLSKLTDRLRIEEQLRTQKNSRLSRVSIGPNTRGQHHAANHPSKMGDP 420

Query: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNN 462
            PVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKV+N
Sbjct: 421 MPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVDN 460

BLAST of Csa3G119510 vs. NCBI nr
Match: gi|1009142372|ref|XP_015888688.1| (PREDICTED: uncharacterized protein LOC107423619 [Ziziphus jujuba])

HSP 1 Score: 430.6 bits (1106), Expect = 3.4e-117
Identity = 236/477 (49.48%), Postives = 312/477 (65.41%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           M  E + +IEE V++VL+ + +E+ TEFKVR    ERLGIDLS K+C+  VRN+VE+FLL
Sbjct: 1   MKPEIKGKIEETVLDVLRNADLEEMTEFKVREAASERLGIDLSGKECRSFVRNLVENFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAV---EQKI-VPKKEFNDDGDLLICRLSNNRSVTIHK 120
           S ++      E +   SVR E K V   +Q++ V  K+  +DGD  IC LS  RSVT+  
Sbjct: 61  STAD------EAQQSQSVREETKEVVREQQEVRVVNKDVYEDGDRHICELSKKRSVTVQD 120

Query: 121 FKGAPMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDA 180
           F+G  MVS+R++Y KDGK++ T KGIS+P EQWS FK ++PAI EAI +M+   RS+ D 
Sbjct: 121 FRGKTMVSIREFYLKDGKRVHTAKGISLPAEQWSNFKKSVPAIEEAIRKMESRSRSKLDD 180

Query: 181 EKIGAFSNPTTRV----TSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPT 240
           +K    SNP T +    T P  P E +R DGKNY  WA  +EL+L+ L IAYVL   CP+
Sbjct: 181 KKTEDTSNPVTSLPPCETFP--PAEIVRLDGKNYQCWAQNIELMLKQLNIAYVLFEPCPS 240

Query: 241 AVLGEESSSGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELK 300
            +LG E+S+    ++KAAEQKWM DD MCR  IL+SLSD L N YSKK  +A ELW+ELK
Sbjct: 241 IMLGREASTEEITRAKAAEQKWMNDDRMCRHYILSSLSDYLLNLYSKKPTTAKELWEELK 300

Query: 301 LLYLLEEFGTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAII 360
           LL+L EEFGTKRSQVKKY+EF+MVEEK ILEQV+ELN IAD I ++G +I+E+FHVS II
Sbjct: 301 LLHLYEEFGTKRSQVKKYIEFQMVEEKPILEQVQELNCIADKIAAAGMLIEENFHVSVII 360

Query: 361 SKLPLSWKNVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAAN 420
           SKLPLSWK++ + LM E++LP   L  RL++EE+LR Q                      
Sbjct: 361 SKLPLSWKDISIKLMFEEHLPFWMLMRRLKVEEELRMQ---------------------- 420

Query: 421 HPSKMGDPKPVTVPLRKKEC--QKEVKTLLCLDCGKEGHTSPNCPTKKVNNEVPRQR 468
              K G P PV+  L  K     ++ +  +C  CGK+GH+   C +KKV+ E+  +R
Sbjct: 421 --DKQGMPNPVSNNLAGKAVPRMRDARPQVCYTCGKKGHSYRQCYSKKVDKEINGKR 445

BLAST of Csa3G119510 vs. NCBI nr
Match: gi|590721161|ref|XP_007051530.1| (Zinc knuckle family protein, putative isoform 2 [Theobroma cacao])

HSP 1 Score: 428.3 bits (1100), Expect = 1.7e-116
Identity = 226/463 (48.81%), Postives = 310/463 (66.95%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           M  ETR++IEE V E+L K+ ME+ TEFKVR    ERLGIDLS+   K  VR V+ESFLL
Sbjct: 1   MEKETRQKIEETVREILSKADMEEMTEFKVRVAASERLGIDLSDFNHKKFVREVIESFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120
           S  E    G  +E    +R E    E KI  KKE + DGD LIC+L++ R+V +H+F+G 
Sbjct: 61  STVEE--NGDVEELNSKLREE----EAKIKIKKEIDGDGDRLICKLADKRNVVVHEFRGK 120

Query: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180
             VS+R++Y KDGK+LP+ +G+S+ +E WS  K++ PAI  A+ +M+    ++ D E+ G
Sbjct: 121 TYVSIREFYVKDGKELPSARGVSLTSEIWSALKNSFPAIDAAVKKMQSKLSTKLDGEQNG 180

Query: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240
             SN  T  +    PIET RFDGKNY+ WA QMEL L+ L+IAYVL++ CP+  L  E+S
Sbjct: 181 DVSNSVTAFSHEFSPIETTRFDGKNYHCWAEQMELFLKQLQIAYVLTDPCPSLTLSPEAS 240

Query: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300
           S  +AQ+KA E+KWM DD++CR +IL+SLSD L+ ++SKKT SA ELW+ELKL+YL EEF
Sbjct: 241 SEESAQAKATEKKWMNDDYLCRHSILSSLSDNLYYQFSKKTKSAKELWEELKLVYLYEEF 300

Query: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360
           GTKRSQV+KY+EF++V+ + IL+Q++ELN IADSI ++G +IDE+FHVS IISKLP SWK
Sbjct: 301 GTKRSQVRKYIEFQIVDGRPILKQMQELNSIADSIVAAGMMIDENFHVSTIISKLPPSWK 360

Query: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420
           +  V LM E+YLP R L D +R+EE+ R +          S  P     A N   ++ D 
Sbjct: 361 DFCVKLMREEYLPFRMLMDHIRVEEESRNRVKQAEHSKYESFYP-----ANNLGPRIRDM 420

Query: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEV 464
           K   VP +++E +      +C  CG++GH S  C  ++   EV
Sbjct: 421 KKPGVPWKRRESEMHGSPPICNYCGRKGHLSKFCRNRRCEKEV 452

BLAST of Csa3G119510 vs. NCBI nr
Match: gi|590721157|ref|XP_007051529.1| (Zinc knuckle family protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 428.3 bits (1100), Expect = 1.7e-116
Identity = 226/463 (48.81%), Postives = 310/463 (66.95%), Query Frame = 1

Query: 1   MNDETRRRIEENVIEVLKKSSMEDTTEFKVRSQVEERLGIDLSNKQCKLLVRNVVESFLL 60
           M  ETR++IEE V E+L K+ ME+ TEFKVR    ERLGIDLS+   K  VR V+ESFLL
Sbjct: 1   MEKETRQKIEETVREILSKADMEEMTEFKVRVAASERLGIDLSDFNHKKFVREVIESFLL 60

Query: 61  SMSERVCMGKEDEPGPSVRYENKAVEQKIVPKKEFNDDGDLLICRLSNNRSVTIHKFKGA 120
           S  E    G  +E    +R E    E KI  KKE + DGD LIC+L++ R+V +H+F+G 
Sbjct: 61  STVEE--NGDVEELNSKLREE----EAKIKIKKEIDGDGDRLICKLADKRNVVVHEFRGK 120

Query: 121 PMVSVRQYYEKDGKQLPTLKGISMPTEQWSVFKSNIPAIAEAILQMKRNKRSEHDAEKIG 180
             VS+R++Y KDGK+LP+ +G+S+ +E WS  K++ PAI  A+ +M+    ++ D E+ G
Sbjct: 121 TYVSIREFYVKDGKELPSARGVSLTSEIWSALKNSFPAIDAAVKKMQSKLSTKLDGEQNG 180

Query: 181 AFSNPTTRVTSPKYPIETIRFDGKNYNAWAHQMELLLQDLKIAYVLSNQCPTAVLGEESS 240
             SN  T  +    PIET RFDGKNY+ WA QMEL L+ L+IAYVL++ CP+  L  E+S
Sbjct: 181 DVSNSVTAFSHEFSPIETTRFDGKNYHCWAEQMELFLKQLQIAYVLTDPCPSLTLSPEAS 240

Query: 241 SGNAAQSKAAEQKWMRDDHMCRRNILNSLSDRLFNEYSKKTMSASELWKELKLLYLLEEF 300
           S  +AQ+KA E+KWM DD++CR +IL+SLSD L+ ++SKKT SA ELW+ELKL+YL EEF
Sbjct: 241 SEESAQAKATEKKWMNDDYLCRHSILSSLSDNLYYQFSKKTKSAKELWEELKLVYLYEEF 300

Query: 301 GTKRSQVKKYLEFKMVEEKSILEQVEELNHIADSIGSSGTVIDEDFHVSAIISKLPLSWK 360
           GTKRSQV+KY+EF++V+ + IL+Q++ELN IADSI ++G +IDE+FHVS IISKLP SWK
Sbjct: 301 GTKRSQVRKYIEFQIVDGRPILKQMQELNSIADSIVAAGMMIDENFHVSTIISKLPPSWK 360

Query: 361 NVWVNLMHEQYLPLRKLTDRLRIEEQLRTQKNSRLSGVSSSPTPRGQHHAANHPSKMGDP 420
           +  V LM E+YLP R L D +R+EE+ R +          S  P     A N   ++ D 
Sbjct: 361 DFCVKLMREEYLPFRMLMDHIRVEEESRNRVKQAEHSKYESFYP-----ANNLGPRIRDM 420

Query: 421 KPVTVPLRKKECQKEVKTLLCLDCGKEGHTSPNCPTKKVNNEV 464
           K   VP +++E +      +C  CG++GH S  C  ++   EV
Sbjct: 421 KKPGVPWKRRESEMHGSPPICNYCGRKGHLSKFCRNRRCEKEV 452

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KELP_ARATH3.5e-3749.71RNA polymerase II transcriptional coactivator KELP OS=Arabidopsis thaliana GN=KE... [more]
Match NameE-valueIdentityDescription
A0A0A0L3U5_CUCSA2.1e-270100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G119510 PE=4 SV=1[more]
A0A061DUH2_THECC1.2e-11648.81Zinc knuckle family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_005132... [more]
A0A061DTK4_THECC1.2e-11648.81Zinc knuckle family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_005132... [more]
A0A151TXZ1_CAJCA1.0e-11246.84RNA polymerase II transcriptional coactivator KELP (Fragment) OS=Cajanus cajan G... [more]
M5XT05_PRUPE2.8e-11046.30Uncharacterized protein OS=Prunus persica GN=PRUPE_ppb013833mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G00980.12.5e-7838.41 zinc knuckle (CCHC-type) family protein[more]
AT4G10920.12.0e-3849.71 transcriptional coactivator p15 (PC4) family protein (KELP)[more]
Match NameE-valueIdentityDescription
gi|449433026|ref|XP_004134299.1|3.0e-270100.00PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis ... [more]
gi|659074945|ref|XP_008437880.1|1.6e-24491.97PREDICTED: uncharacterized protein LOC103483179 [Cucumis melo][more]
gi|1009142372|ref|XP_015888688.1|3.4e-11749.48PREDICTED: uncharacterized protein LOC107423619 [Ziziphus jujuba][more]
gi|590721161|ref|XP_007051530.1|1.7e-11648.81Zinc knuckle family protein, putative isoform 2 [Theobroma cacao][more]
gi|590721157|ref|XP_007051529.1|1.7e-11648.81Zinc knuckle family protein, putative isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001878Znf_CCHC
IPR003173PC4
IPR009044ssDNA-bd_transcriptional_reg
IPR009057Homeobox-like_sf
IPR014876DEK_C
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO:0003677DNA binding
GO:0003713transcription coactivator activity
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003713 transcription coactivator activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0046872 metal ion binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU133468cucumber EST collection version 3.0transcribed_cluster
CU154149cucumber EST collection version 3.0transcribed_cluster
CU166424cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa3G119510.1Csa3G119510.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU166424CU166424transcribed_cluster
CU154149CU154149transcribed_cluster
CU133468CU133468transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeGENE3DG3DSA:4.10.60.10coord: 433..458
score: 7.
IPR001878Zinc finger, CCHC-typePROFILEPS50158ZF_CCHCcoord: 441..455
score: 9
IPR001878Zinc finger, CCHC-typeunknownSSF57756Retrovirus zinc finger-like domainscoord: 433..458
score: 4.8
IPR003173Transcriptional coactivator p15 (PC4)PFAMPF02229PC4coord: 104..154
score: 2.9
IPR009044ssDNA-binding transcriptional regulatorGENE3DG3DSA:2.30.31.10coord: 103..163
score: 1.8
IPR009044ssDNA-binding transcriptional regulatorunknownSSF54447ssDNA-binding transcriptional regulator domaincoord: 103..163
score: 8.63
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 4..62
score: 7.
IPR014876DEK, C-terminalPFAMPF08766DEK_Ccoord: 5..59
score: 3.
NoneNo IPR availablePANTHERPTHR13215RNA POLYMERASE II TRANSCRIPTIONAL COACTIVATORcoord: 1..239
score: 5.0
NoneNo IPR availablePANTHERPTHR13215:SF6RNA POLYMERASE II TRANSCRIPTIONAL COACTIVATOR KELPcoord: 1..239
score: 5.0
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 254..390
score: 6.6
NoneNo IPR availableunknownSSF109715DEK C-terminal domaincoord: 5..57
score: 5.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Csa3G119510Csa4G052710Cucumber (Chinese Long) v2cucuB093