Cp4.1LG17g09540 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g09540
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMyb/SANT-like DNA-binding domain protein
LocationCp4.1LG17 : 7242980 .. 7245727 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCTTTTCTATATTTTTAATTTTTAAAGTAAAACCAAACCAATTTTGAAATAAAGAACCCAATCACGACGCCATTAAAGGCAAATCCCTCCCTTCTTCTTCTGCTTCTCTTTCTCTTTCTCTTTCTCTTTCTCTTTCTCTTTCTGCTCCCCTTTTCTACTTCTCTCTCTATATTACGGAGAGAGATGATCTGACAAACTGAATCCCAAACGCCGTAACGGACACTCCCTAAACATCCCCGTTTACGTCGTCCCAAAACCATTCTCCAATCTCCATTAAGTCACGTGCTCGTCTTCTTAACAATATTCTTATCATTTTTCTTTTTCTTTTTTTTTTTTTTTTTTCTCTCTTTATATTTATATGGCTTTGCAGCAGCTTAGCCTTGGCCCAACTCCCGTTGACGCCGTCACGNTTTTTTTTTTTTTTTGTCCTCTCTTTATATTTATATGGCTTTGCAGCAGCTTAGCCTTGGCCCAACTCCCGTTGACGCCGTCACGAATGGCCTTGACGCCCGCCCCATCTCATCTGACGGCGGAGACAACGGCAGCAAAACCCCCAGGCTCCCCCGTTGGACCCGCCAGGAGATTCTCGTCCTCATACAGGGAAAGAAGGTTGCTGAGACCAGGGTTCGCGGAGGCCGGGCTGCTACTCTCGCCTTTGGTTCCGCCCAGCTCGAGCCCAAGTGGGCCTCTGTCTCCTCCTACTGTAAGAGACATGGGGTTAATCGGGGTCCGGTTCAGTGCCGCAAGAGATGGAGCAATTTGGCTGGCGACTTCAAGAAGATCAAGGAGTGGGAGTCCCAAATTAGGGATGACTCTGAATCCTTTTGGCTCATGAGGAACGATTTGAGGCGCGAGAGGAAATTGCCAGGGTTTTTCGACAGAGAGGTTTATGATATCTTGGAATCCGGCTCGGCTCCTTCGCCGTCGCCGGCTCTGGCTCTCGCCCTCGCTCCTGCTCCTGCTCCTCCTCCGGTCCCCCCTCTCACGTCAGCCCTAAATTCCGACGACGCCGAGCCGGAGCACGTCTTCGATAGTTTTAAAACTGCGGCAGCGGACGACGGCTTGTTTTCCGATTTCGAGCAGGACGAAACCAGCCAGAGCCCCATCAAGGAGGTCCCGGGGAAGGAGGCTCCGGTCCCGACGCCCGACGGGGAAATCCCTGCTCCGGCACCTTTATCAGGTAATTGGAGAAGAAGGGTCAAATTCAAATCTCTCTCTATTTGTTTAATTCTCATTCTACAACCTGCGTGGTCTCCAAAACATCCGGTTTCAAAAAATTAAATGATAAATCTTCCTGGTTGTTTCGAGATTCTTGCAGCTTTATGTGGCAATTATTCTTTTCTCCTTCCACCCTTCTTAAAAAAACCATTTTATATGTTCTGTCGGCTTCGAAGAAGATTTGAAGCTCCATTTATTTCTAGTTTCAATTTCTTTCCCCTTGATTTACAGAGAAGCCGTACCAACCTGCTAGTCAAGGTTGTCCTGATCAAGGTAAGCTCAAAGTACAATTCGAATACATATTCATTCTGGGACTCTGTTGTAAACCCTTCCTTCAGACACAAGTATTTGTTCATAATCATTGTGGTCGTTTGATGGTTCAACCTTTCATTTAGAGGATGAATGAAAATCTCTGGTGCTGTTGCACGCCTATGTTCATTAGCTGCAAAGATTAGTTCTTTTTTGTCTTCTCAATTCCGGATGAATAATGAAGTCAATAAAGCACTATGTTCTTATATTGTTGATTTGATATGTTTTATCTTGTGCGCTTCAAATTTTCAGTCAGAATCCCTTTTTTGGTACCCACTAAGCTTTTGTGTCTGTACGTTCGTGAATCCTATTGTTTTTACCCTTCCCCCTCCCCCTCCCCCTCCTTTCCACTACGAGCGTCGCAGTTTCAAAATTCAGCTGGCTCTGCGAATCATGTTTTAGCTGTAATCTGTAAGTGTGGTCAAAGTTTGAGTGTCCGTTCTGTGAAGGCTCAAAAACTTGGATTAAAATCTTGTTAGAGGTGTTGTTTAATTGGGTTTGGCAAAGCTTCAGAGTCCAACGTTATGAAGACGGATGCCTATTTTATTTTTTATTTTTTTTTTTGTTGGTGTTCTGAATCCCCAAATTGTTGGGAAGAGTGGAGACAATAATAACTGATTGTGAACCCAATCTGGTGTGCAACAGGCACGTCAAATGAGAGAGAAGCAGCGGCAAATCCGGAGATAGGATCGAGTTTGCAAGAGGGACGGAAGCGGAAGCGCATTGCAAAAGAGGGGGAGGAGGAAAGTATGATGATGCAGGATGAGTTGATTGGCATCCTGGAAAAGAATGGGAAACTGCTGACTGCCCAACTTGAAGCTCAAAACATGAACTTCCAGCTGGACCGGGAGCAGCGAAGATACCACGCAGATGGGTTAGTGACGGTTCTGAACAAGCTCGCAGATGCTCTAGGCAGGATCGCGGACAAGCTGTAGAATAAGAAGCAGGCAGTGCAAATGTGTACATAAGCAACAACGTTTTTGTGTCTTTTTATGACGACGACCAAATTAGTGGAGAGTAGAATTTACATGGATTTCCCCATACACAGATAAGAGGTTCATATTTGACCCCATTTCATTTTCTGCTATATTTGTAAATCTTTATTATTATTTTATTTTCATTAACACGCATTTGCCTCGTTCCATTTGTTGTTTACTCGTTTTTGGAATGCTTTTTAGTAAAGGATTCAAATACGTATTATTGATATTGAGATATAAAA

mRNA sequence

TTTCTTTTCTATATTTTTAATTTTTAAAGTAAAACCAAACCAATTTTGAAATAAAGAACCCAATCACGACGCCATTAAAGGCAAATCCCTCCCTTCTTCTTCTGCTTCTCTTTCTCTTTCTCTTTCTCTTTCTCTTTCTCTTTCTGCTCCCCTTTTCTACTTCTCTCTCTATATTACGGAGAGAGATGATCTGACAAACTGAATCCCAAACGCCGTAACGGACACTCCCTAAACATCCCCGTTTACGTCGTCCCAAAACCATTCTCCAATCTCCATTAAGTCACGTGCTCGTCTTCTTAACAATATTCTTATCATTTTTCTTTTTCTTTTTTTTTTTTTTTTTTCTCTCTTTATATTTATATGGCTTTGCAGCAGCTTAGCCTTGGCCCAACTCCCGTTGACGCCGTCACGNTTTTTTTTTTTTTTTGTCCTCTCTTTATATTTATATGGCTTTGCAGCAGCTTAGCCTTGGCCCAACTCCCGTTGACGCCGTCACGAATGGCCTTGACGCCCGCCCCATCTCATCTGACGGCGGAGACAACGGCAGCAAAACCCCCAGGCTCCCCCGTTGGACCCGCCAGGAGATTCTCGTCCTCATACAGGGAAAGAAGGTTGCTGAGACCAGGGTTCGCGGAGGCCGGGCTGCTACTCTCGCCTTTGGTTCCGCCCAGCTCGAGCCCAAGTGGGCCTCTGTCTCCTCCTACTGTAAGAGACATGGGGTTAATCGGGGTCCGGTTCAGTGCCGCAAGAGATGGAGCAATTTGGCTGGCGACTTCAAGAAGATCAAGGAGTGGGAGTCCCAAATTAGGGATGACTCTGAATCCTTTTGGCTCATGAGGAACGATTTGAGGCGCGAGAGGAAATTGCCAGGGTTTTTCGACAGAGAGGTTTATGATATCTTGGAATCCGGCTCGGCTCCTTCGCCGTCGCCGGCTCTGGCTCTCGCCCTCGCTCCTGCTCCTGCTCCTCCTCCGGTCCCCCCTCTCACGTCAGCCCTAAATTCCGACGACGCCGAGCCGGAGCACGTCTTCGATAGTTTTAAAACTGCGGCAGCGGACGACGGCTTGTTTTCCGATTTCGAGCAGGACGAAACCAGCCAGAGCCCCATCAAGGAGGTCCCGGGGAAGGAGGCTCCGGTCCCGACGCCCGACGGGGAAATCCCTGCTCCGGCACCTTTATCAGAGAAGCCGTACCAACCTGCTAGTCAAGGTTGTCCTGATCAAGGCACGTCAAATGAGAGAGAAGCAGCGGCAAATCCGGAGATAGGATCGAGTTTGCAAGAGGGACGGAAGCGGAAGCGCATTGCAAAAGAGGGGGAGGAGGAAAGTATGATGATGCAGGATGAGTTGATTGGCATCCTGGAAAAGAATGGGAAACTGCTGACTGCCCAACTTGAAGCTCAAAACATGAACTTCCAGCTGGACCGGGAGCAGCGAAGATACCACGCAGATGGGTTAGTGACGGTTCTGAACAAGCTCGCAGATGCTCTAGGCAGGATCGCGGACAAGCTGTAGAATAAGAAGCAGGCAGTGCAAATGTGTACATAAGCAACAACGTTTTTGTGTCTTTTTATGACGACGACCAAATTAGTGGAGAGTAGAATTTACATGGATTTCCCCATACACAGATAAGAGGTTCATATTTGACCCCATTTCATTTTCTGCTATATTTGTAAATCTTTATTATTATTTTATTTTCATTAACACGCATTTGCCTCGTTCCATTTGTTGTTTACTCGTTTTTGGAATGCTTTTTAGTAAAGGATTCAAATACGTATTATTGATATTGAGATATAAAA

Coding sequence (CDS)

ATGGCTTTGCAGCAGCTTAGCCTTGGCCCAACTCCCGTTGACGCCGTCACGAATGGCCTTGACGCCCGCCCCATCTCATCTGACGGCGGAGACAACGGCAGCAAAACCCCCAGGCTCCCCCGTTGGACCCGCCAGGAGATTCTCGTCCTCATACAGGGAAAGAAGGTTGCTGAGACCAGGGTTCGCGGAGGCCGGGCTGCTACTCTCGCCTTTGGTTCCGCCCAGCTCGAGCCCAAGTGGGCCTCTGTCTCCTCCTACTGTAAGAGACATGGGGTTAATCGGGGTCCGGTTCAGTGCCGCAAGAGATGGAGCAATTTGGCTGGCGACTTCAAGAAGATCAAGGAGTGGGAGTCCCAAATTAGGGATGACTCTGAATCCTTTTGGCTCATGAGGAACGATTTGAGGCGCGAGAGGAAATTGCCAGGGTTTTTCGACAGAGAGGTTTATGATATCTTGGAATCCGGCTCGGCTCCTTCGCCGTCGCCGGCTCTGGCTCTCGCCCTCGCTCCTGCTCCTGCTCCTCCTCCGGTCCCCCCTCTCACGTCAGCCCTAAATTCCGACGACGCCGAGCCGGAGCACGTCTTCGATAGTTTTAAAACTGCGGCAGCGGACGACGGCTTGTTTTCCGATTTCGAGCAGGACGAAACCAGCCAGAGCCCCATCAAGGAGGTCCCGGGGAAGGAGGCTCCGGTCCCGACGCCCGACGGGGAAATCCCTGCTCCGGCACCTTTATCAGAGAAGCCGTACCAACCTGCTAGTCAAGGTTGTCCTGATCAAGGCACGTCAAATGAGAGAGAAGCAGCGGCAAATCCGGAGATAGGATCGAGTTTGCAAGAGGGACGGAAGCGGAAGCGCATTGCAAAAGAGGGGGAGGAGGAAAGTATGATGATGCAGGATGAGTTGATTGGCATCCTGGAAAAGAATGGGAAACTGCTGACTGCCCAACTTGAAGCTCAAAACATGAACTTCCAGCTGGACCGGGAGCAGCGAAGATACCACGCAGATGGGTTAGTGACGGTTCTGAACAAGCTCGCAGATGCTCTAGGCAGGATCGCGGACAAGCTGTAG

Protein sequence

MALQQLSLGPTPVDAVTNGLDARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETRVRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIRDDSESFWLMRNDLRRERKLPGFFDREVYDILESGSAPSPSPALALALAPAPAPPPVPPLTSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPAPAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGSSLQEGRKRKRIAKEGEEESMMMQDELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIADKL
BLAST of Cp4.1LG17g09540 vs. Swiss-Prot
Match: ASR3_ARATH (Trihelix transcription factor ASR3 OS=Arabidopsis thaliana GN=ASR3 PE=1 SV=1)

HSP 1 Score: 331.3 bits (848), Expect = 1.3e-89
Identity = 194/363 (53.44%), Postives = 241/363 (66.39%), Query Frame = 1

Query: 1   MALQQLSLGPTPVDAVTNGLDARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETR 60
           MAL+QL LG   V AV  G ++   S+DGGD+G KT RLPRWTRQEILVLIQGK+VAE R
Sbjct: 1   MALEQLGLG---VSAVDGGENSSAPSNDGGDDGVKTARLPRWTRQEILVLIQGKRVAENR 60

Query: 61  VRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI 120
           VR GRAA +A GS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGD+KKIKEWESQI
Sbjct: 61  VRRGRAAGMALGSGQMEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIKEWESQI 120

Query: 121 RDDSESFWLMRNDLRRERKLPGFFDREVYDILESGSAPSPSPALALALAPAPAPPPVPPL 180
           ++++ES+W+MRND+RRE+KLPGFFD+EVYDI++ G  P   P L+L LAPA         
Sbjct: 121 KEETESYWVMRNDVRREKKLPGFFDKEVYDIVDGGVIPPAVPVLSLGLAPA--------- 180

Query: 181 TSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPA 240
                                 +D+GL SD ++ E+ +  +   P  ++     D E   
Sbjct: 181 ----------------------SDEGLLSDLDRRESPEK-LNSTPVAKSVTDVIDKE--- 240

Query: 241 PAPLSEKPYQPASQGC-PDQGTSNEREA-AANPEIGSSLQEGRKRKRIA------KEGEE 300
                        + C  DQG   E++  AAN E GS+ QE RKRKR +      +E E 
Sbjct: 241 -----------KQEACVADQGRVKEKQPEAANVEGGSTSQEERKRKRTSFGEKEEEEEEG 300

Query: 301 ESMMMQDELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIA 356
           E+  MQ++LI ILE+NG+LL AQLE QN+N +LDREQR+ H D LV VLNKLADA+ +IA
Sbjct: 301 ETKKMQNQLIEILERNGQLLAAQLEVQNLNLKLDREQRKDHGDSLVAVLNKLADAVAKIA 314

BLAST of Cp4.1LG17g09540 vs. TrEMBL
Match: A0A0A0KDF5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G091850 PE=4 SV=1)

HSP 1 Score: 578.6 bits (1490), Expect = 5.4e-162
Identity = 299/356 (83.99%), Postives = 323/356 (90.73%), Query Frame = 1

Query: 1   MALQQLSLGPTPVDAVTNGLDARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETR 60
           MALQQLSLGPTPVD VTNG+D RP+S+DGGD+GSKTPRLPRWTRQEILVLIQGKKVAETR
Sbjct: 1   MALQQLSLGPTPVDGVTNGVDVRPMSTDGGDDGSKTPRLPRWTRQEILVLIQGKKVAETR 60

Query: 61  VRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI 120
           VRGGRAA+LAFGS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI
Sbjct: 61  VRGGRAASLAFGSGQVEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI 120

Query: 121 RDDSESFWLMRNDLRRERKLPGFFDREVYDILESGSAPSPSPALALALAPAPAPPPVPPL 180
           R+D+ESFW+MRNDLRRERKLPGFFDREVYDIL+SGSAPSPSPALALAL P P P P P L
Sbjct: 121 REDTESFWVMRNDLRRERKLPGFFDREVYDILDSGSAPSPSPALALALTPLPIPVPPPAL 180

Query: 181 TSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPA 240
            S     DAEPEHVFDS KTAAADDGLFSDFEQDET +SP+KEV GK+ P PT DG IPA
Sbjct: 181 NSDDGKPDAEPEHVFDSSKTAAADDGLFSDFEQDETCRSPLKEVAGKDVPPPTADGGIPA 240

Query: 241 PAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGS-SLQEGRKRKRIAKEGEEESMMMQD 300
           P PLSEK Y+P    CPDQGT+NE+EAAANPEIGS S QEGRKRKR+A +G+EE+ ++QD
Sbjct: 241 PTPLSEKLYRPPGHDCPDQGTTNEKEAAANPEIGSTSSQEGRKRKRVALDGDEET-ILQD 300

Query: 301 ELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIADKL 356
           ELIGILEKNGKLLTAQLEAQNMNFQLDREQR++HADGLV VLNKLADALGRIADKL
Sbjct: 301 ELIGILEKNGKLLTAQLEAQNMNFQLDREQRKHHADGLVAVLNKLADALGRIADKL 355

BLAST of Cp4.1LG17g09540 vs. TrEMBL
Match: A0A067JH78_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23099 PE=4 SV=1)

HSP 1 Score: 427.9 bits (1099), Expect = 1.2e-116
Identity = 233/356 (65.45%), Postives = 273/356 (76.69%), Query Frame = 1

Query: 4   QQLSLGPTPVDAVT-NGLDARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETRVR 63
           QQL+L P  VD    NG+D R  S DGGD+GSK PRLPRWTRQEILVLIQGKKVAE RVR
Sbjct: 5   QQLNLTPITVDGEQINGVDTRLTSIDGGDDGSKAPRLPRWTRQEILVLIQGKKVAENRVR 64

Query: 64  GGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIRD 123
            GR A +AFGS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGD+KKIKEWESQIR+
Sbjct: 65  RGRTAGMAFGSGQVEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIKEWESQIRE 124

Query: 124 DSESFWLMRNDLRRERKLPGFFDREVYDILES-GSAPSPSPALALALAPAPAPPPVPPLT 183
           ++ESFW+MRNDLRRERKLPGFFDREVYDIL+  G   + +P LALAL PAP P       
Sbjct: 125 ETESFWVMRNDLRRERKLPGFFDREVYDILDGVGGVSATAPGLALALTPAPEP------- 184

Query: 184 SALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKE--VPGKEAPVPTPDGEIP 243
               +DDAE   +FDS ++AAA+DGLFSDFEQDE   SP KE  V  +  P+ T    + 
Sbjct: 185 ----ADDAEA--IFDSGRSAAAEDGLFSDFEQDEAGGSPEKEAAVAKEVPPIKTAAAGVA 244

Query: 244 APAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGSSLQEGRKRKRIAKEGEEESMMMQD 303
           AP P+SEK YQP+      QG +NE++ A+NPE+GS+  + RKRKR   + +EE+  + +
Sbjct: 245 APLPISEKQYQPSHLADQAQGGTNEKQPASNPEVGSASHDSRKRKRFTADVDEETANLHN 304

Query: 304 ELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIADKL 356
            L+G+LEKN K+LTAQLEAQN NFQLDREQR+ HAD LV VLNKLADALG+IADKL
Sbjct: 305 HLVGVLEKNSKMLTAQLEAQNNNFQLDREQRKDHADSLVAVLNKLADALGKIADKL 347

BLAST of Cp4.1LG17g09540 vs. TrEMBL
Match: V4SAN3_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10005329mg PE=4 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 3.8e-115
Identity = 235/363 (64.74%), Postives = 273/363 (75.21%), Query Frame = 1

Query: 1   MALQQLSLGPTPVDA----VTNGLDARPISS--DGGDNGSKTPRLPRWTRQEILVLIQGK 60
           MAL+QLSL  TPVD     V NG++ R  ++  DGGD+G K PRLPRWTRQEILVLIQGK
Sbjct: 1   MALEQLSLARTPVDGETDGVNNGVEQRTTTASIDGGDDGCKAPRLPRWTRQEILVLIQGK 60

Query: 61  KVAETRVRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIK 120
           +VAE RVR GRAA + FGS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIK
Sbjct: 61  RVAENRVRRGRAAGMGFGSGQIEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIK 120

Query: 121 EWESQIRDDSESFWLMRNDLRRERKLPGFFDREVYDILESGS--APSPSPALALALAPAP 180
           EWES ++D +ESFW+MRNDLRRERKLPGFFDREVYDIL+  +  A S SP L LALAPA 
Sbjct: 121 EWESHVKDGTESFWVMRNDLRRERKLPGFFDREVYDILDGAATVASSASPGLGLALAPA- 180

Query: 181 APPPVPPLTSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVP 240
                         +    E VFDS ++AAADDGLFSDFE +ET+ +P+K+    EA  P
Sbjct: 181 -------------EETTTDEAVFDSGRSAAADDGLFSDFEPEETTGTPVKDDAPAEA-AP 240

Query: 241 TPDGEIPAPAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGSSLQEGRKRKRIAKEGEE 300
                I A  P+ EK YQP  +GC  QGT+ E++ A  PEIGS+ Q+GRKRKR   +G+E
Sbjct: 241 AAAKPISATMPIPEKQYQPNLRGCHGQGTTTEKQPA--PEIGSTSQDGRKRKRFTVDGDE 300

Query: 301 ESMMMQDELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIA 356
           E   MQ +LI +LE+NGK+LTAQLEAQN +FQLDREQR+ HAD LV VLNKLADALGRIA
Sbjct: 301 EMSNMQYQLIDVLERNGKMLTAQLEAQNNSFQLDREQRKDHADSLVAVLNKLADALGRIA 346

BLAST of Cp4.1LG17g09540 vs. TrEMBL
Match: A0A067F3Y6_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g019057mg PE=4 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 3.8e-115
Identity = 235/363 (64.74%), Postives = 273/363 (75.21%), Query Frame = 1

Query: 1   MALQQLSLGPTPVDA----VTNGLDARPISS--DGGDNGSKTPRLPRWTRQEILVLIQGK 60
           MAL+QLSL  TPVD     V NG++ R  ++  DGGD+G K PRLPRWTRQEILVLIQGK
Sbjct: 1   MALEQLSLARTPVDGETDGVNNGVEQRTTTASIDGGDDGCKAPRLPRWTRQEILVLIQGK 60

Query: 61  KVAETRVRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIK 120
           +VAE RVR GRAA + FGS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIK
Sbjct: 61  RVAENRVRRGRAAGMGFGSGQIEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIK 120

Query: 121 EWESQIRDDSESFWLMRNDLRRERKLPGFFDREVYDILESGS--APSPSPALALALAPAP 180
           EWES ++D +ESFW+MRNDLRRERKLPGFFDREVYDIL+  +  A S SP L LALAPA 
Sbjct: 121 EWESHVKDGTESFWVMRNDLRRERKLPGFFDREVYDILDGAATVASSASPGLGLALAPA- 180

Query: 181 APPPVPPLTSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVP 240
                         +    E VFDS ++AAADDGLFSDFE +ET+ +P+K+    EA  P
Sbjct: 181 -------------EETTTDEAVFDSGRSAAADDGLFSDFEPEETTGTPVKDDAPAEA-AP 240

Query: 241 TPDGEIPAPAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGSSLQEGRKRKRIAKEGEE 300
                I A  P+ EK YQP  +GC  QGT+ E++ A  PEIGS+ Q+GRKRKR   +G+E
Sbjct: 241 AAAKPISATMPIPEKQYQPNLRGCHGQGTTTEKQPA--PEIGSTSQDGRKRKRFTVDGDE 300

Query: 301 ESMMMQDELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIA 356
           E   MQ +LI +LE+NGK+LTAQLEAQN +FQLDREQR+ HAD LV VLNKLADALGRIA
Sbjct: 301 EMSNMQYQLIDVLERNGKMLTAQLEAQNNSFQLDREQRKDHADSLVAVLNKLADALGRIA 346

BLAST of Cp4.1LG17g09540 vs. TrEMBL
Match: B9RFZ6_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1438230 PE=4 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 7.8e-113
Identity = 233/363 (64.19%), Postives = 276/363 (76.03%), Query Frame = 1

Query: 1   MALQQ-LSLGPTPVDA-VTNGLD-ARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVA 60
           MALQQ  +L    VD   TNG+D  RP S DG D+GSKTPRLPRWTRQEILVLIQGKKVA
Sbjct: 1   MALQQQFNLTSNTVDGDTTNGVDNTRPASIDGADDGSKTPRLPRWTRQEILVLIQGKKVA 60

Query: 61  ETRVRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWE 120
           E RVR GR A +AFGS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGD+KKIKEWE
Sbjct: 61  ENRVRRGRTAGMAFGSGQVEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIKEWE 120

Query: 121 SQIRDDSESFWLMRNDLRRERKLPGFFDREVYDILESG---SAPSPSPALALALAPAPAP 180
           + IR+++ESFW+MRNDLRRERKLPGFFDREV+DIL+     SA   +P LALALAPA   
Sbjct: 121 NHIREETESFWVMRNDLRRERKLPGFFDREVFDILDGAGGVSAAPATPGLALALAPA--- 180

Query: 181 PPVPPLTSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPV-PT 240
                      ++D+E   VFDS +TAAA+DGLFSDFEQ++   SP KE   +  P+   
Sbjct: 181 -----------TEDSEA--VFDSGRTAAAEDGLFSDFEQEDAGGSPEKEAVKEAPPIKAA 240

Query: 241 PDGEIPAPAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGSSLQEGRKRKRI-AKEGEE 300
             G I AP P+SEK YQPA +    QG +NE++  +NPE+GS L E RKRKR    +G+E
Sbjct: 241 ATGGIAAPVPISEKQYQPAVRTDQSQGATNEKQPPSNPEMGSGLHESRKRKRFGTTDGDE 300

Query: 301 ESMMMQDELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIA 356
           E+  +Q++LIG+LE+NG++LTAQLEAQN NFQLDREQR+  A+ LV VLNKLADALG+IA
Sbjct: 301 ETTTLQNQLIGVLERNGEMLTAQLEAQNTNFQLDREQRKDQANSLVAVLNKLADALGKIA 347

BLAST of Cp4.1LG17g09540 vs. TAIR10
Match: AT2G33550.1 (AT2G33550.1 Homeodomain-like superfamily protein)

HSP 1 Score: 331.3 bits (848), Expect = 7.5e-91
Identity = 194/363 (53.44%), Postives = 241/363 (66.39%), Query Frame = 1

Query: 1   MALQQLSLGPTPVDAVTNGLDARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETR 60
           MAL+QL LG   V AV  G ++   S+DGGD+G KT RLPRWTRQEILVLIQGK+VAE R
Sbjct: 1   MALEQLGLG---VSAVDGGENSSAPSNDGGDDGVKTARLPRWTRQEILVLIQGKRVAENR 60

Query: 61  VRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI 120
           VR GRAA +A GS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGD+KKIKEWESQI
Sbjct: 61  VRRGRAAGMALGSGQMEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIKEWESQI 120

Query: 121 RDDSESFWLMRNDLRRERKLPGFFDREVYDILESGSAPSPSPALALALAPAPAPPPVPPL 180
           ++++ES+W+MRND+RRE+KLPGFFD+EVYDI++ G  P   P L+L LAPA         
Sbjct: 121 KEETESYWVMRNDVRREKKLPGFFDKEVYDIVDGGVIPPAVPVLSLGLAPA--------- 180

Query: 181 TSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPA 240
                                 +D+GL SD ++ E+ +  +   P  ++     D E   
Sbjct: 181 ----------------------SDEGLLSDLDRRESPEK-LNSTPVAKSVTDVIDKE--- 240

Query: 241 PAPLSEKPYQPASQGC-PDQGTSNEREA-AANPEIGSSLQEGRKRKRIA------KEGEE 300
                        + C  DQG   E++  AAN E GS+ QE RKRKR +      +E E 
Sbjct: 241 -----------KQEACVADQGRVKEKQPEAANVEGGSTSQEERKRKRTSFGEKEEEEEEG 300

Query: 301 ESMMMQDELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIA 356
           E+  MQ++LI ILE+NG+LL AQLE QN+N +LDREQR+ H D LV VLNKLADA+ +IA
Sbjct: 301 ETKKMQNQLIEILERNGQLLAAQLEVQNLNLKLDREQRKDHGDSLVAVLNKLADAVAKIA 314

BLAST of Cp4.1LG17g09540 vs. TAIR10
Match: AT4G31270.1 (AT4G31270.1 sequence-specific DNA binding transcription factors)

HSP 1 Score: 73.2 bits (178), Expect = 3.7e-13
Identity = 40/128 (31.25%), Postives = 63/128 (49.22%), Query Frame = 1

Query: 30  GDNGSKTPR---LPRWTRQEILVLIQGKKVAETRVRGGRAATLAFGSAQLEPKWASVSSY 89
           G +GS+  R    P W  ++ LVL+      E           A  S Q   KW  ++  
Sbjct: 4   GTSGSRRTRSQVAPEWAVKDCLVLVNEIAAVEADCSN------ALSSFQ---KWTMITEN 63

Query: 90  CKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIRDDSESFWLMRNDLRRERKLPGFFDR 149
           C    V+R   QCR++W +L  D+ +IK+WESQ R    S+W + +D R+   LPG  D 
Sbjct: 64  CNALDVSRNLNQCRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPGDIDI 122

Query: 150 EVYDILES 155
           E+++ + +
Sbjct: 124 ELFEAINA 122

BLAST of Cp4.1LG17g09540 vs. TAIR10
Match: AT2G35640.1 (AT2G35640.1 Homeodomain-like superfamily protein)

HSP 1 Score: 72.0 bits (175), Expect = 8.2e-13
Identity = 54/156 (34.62%), Postives = 76/156 (48.72%), Query Frame = 1

Query: 21  DARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETRVRGGRAATLAFG-SAQLEPK 80
           DA P  S G     +  R   WT  E LVLI+ KK+ + R R  R+     G +   E +
Sbjct: 3   DADP--SSGEQIVMRECRKGNWTVSETLVLIEAKKMDDQR-RVRRSEKQPEGRNKPAELR 62

Query: 81  WASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWE-SQIRDD-----SESFWLMRND 140
           W  +  YC R G  R   QC  +W NL  D+KKI+E+E S++        S S+W M   
Sbjct: 63  WKWIEEYCWRRGCYRNQNQCNDKWDNLMRDYKKIREYERSRVESSFNTVTSSSYWKMDKT 122

Query: 141 LRRERKLPGFFDREVYDIL----ESGSAPSPSPALA 166
            R+E+ LP     ++YD+L    +  + PS S A A
Sbjct: 123 ERKEKNLPSNMLPQIYDVLSELVDRKTLPSSSSAAA 155

BLAST of Cp4.1LG17g09540 vs. TAIR10
Match: AT5G51800.1 (AT5G51800.1 Protein kinase superfamily protein)

HSP 1 Score: 52.0 bits (123), Expect = 8.8e-07
Identity = 45/159 (28.30%), Postives = 70/159 (44.03%), Query Frame = 1

Query: 40  PRWTRQEILVLIQGKKVA-ETRVRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQ 99
           P W   E+L L +  +   +T+  G  + ++         K   V+ Y  RHG+NR    
Sbjct: 149 PVWKPNEMLWLARAWRAQYQTQGTGSGSGSVEGRGKTRAEKDREVAEYLNRHGINRDSKI 208

Query: 100 CRKRWSNLAGDFKKIKEWESQIRDD--SESFWLMRNDLRRERKLPGFFDREVYDILESGS 159
              +W N+ G+F+K+ EWE     D   +S++ +    R++ +LP  FD EVY  L    
Sbjct: 209 AGTKWDNMLGEFRKVYEWEKCGDQDKYGKSYFRLSPYERKQHRLPASFDEEVYQELALFM 268

Query: 160 APS-PSPAL------ALALAPAPAPPPV----PPLTSAL 185
            P   +P +         +  A  PP V    PPL  AL
Sbjct: 269 GPRVRAPTINRGGGGGATVTVASTPPSVEALPPPLYPAL 307

BLAST of Cp4.1LG17g09540 vs. TAIR10
Match: AT1G31310.1 (AT1G31310.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 50.8 bits (120), Expect = 2.0e-06
Identity = 87/373 (23.32%), Postives = 138/373 (37.00%), Query Frame = 1

Query: 38  RLPRWTRQEILVLIQGKKVAETRVRGGRAATLAFGSAQ--------LEPKWASVSSYCKR 97
           R   WT  E +VLI+ K++ + R R  R+  L     Q         E +W  +  YC R
Sbjct: 15  RKGNWTLNETMVLIEAKRMDDER-RMRRSIGLPPPEQQQDIRSNKPAELRWKWIEDYCWR 74

Query: 98  HGVNRGPVQCRKRWSNLAGDFKKIKEWE----------------SQIRDDSESFWLMRND 157
            G  R   QC  +W NL  D+KK++E+E                S    ++ S+W M   
Sbjct: 75  KGCMRSQNQCNDKWDNLMRDYKKVREYERRRVESSITAGESSSSSAPAGETASYWKMEKS 134

Query: 158 LRRERKLPGFFDREVY----DILESGSAPSPSPALALALAPAPAPPPVPPLTSALNSDDA 217
            R+ER LP     + Y    +++ES + PS +   A+  A A A      ++S   S   
Sbjct: 135 ERKERSLPSNMLPQTYQALFEVVESKTLPSSTAVTAVTAAVAAA---AAAISSGNGSGGG 194

Query: 218 EPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPAPAPL----- 277
           + + V           G           Q P+  +P +  P P P   +P P  L     
Sbjct: 195 QIQKVIQQ------GLGFVVPKVHQIIQQQPVL-LPLQPPPPPPPSQPLPRPLLLPPPPP 254

Query: 278 ---SEKPYQPASQGCPDQGTSNEREAA-------------ANPEIGS-SLQEGRKRKRIA 337
                +P  P      D  TS   + +             A P  G   ++E  + KR  
Sbjct: 255 PSFHAQPILPTKDSSTDSDTSEYSDTSPAKRRRTMPTTTTAGPSGGGVDVEEVGRSKRDE 314

Query: 338 KEGEEESMMMQDELIGILEKNGKLLTAQLEAQNMNFQLDR--------EQRRYHADGLVT 353
           +     ++     +I    +  +    +   + MN Q  R        E  R   +GLV 
Sbjct: 315 ETTVAAALSRSVSVIANAIRESEERQDRRHKEVMNVQERRLKIEESNVEMNREGMNGLVE 374

BLAST of Cp4.1LG17g09540 vs. NCBI nr
Match: gi|449445594|ref|XP_004140557.1| (PREDICTED: uncharacterized protein LOC101222632 isoform X1 [Cucumis sativus])

HSP 1 Score: 578.6 bits (1490), Expect = 7.7e-162
Identity = 299/356 (83.99%), Postives = 323/356 (90.73%), Query Frame = 1

Query: 1   MALQQLSLGPTPVDAVTNGLDARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETR 60
           MALQQLSLGPTPVD VTNG+D RP+S+DGGD+GSKTPRLPRWTRQEILVLIQGKKVAETR
Sbjct: 1   MALQQLSLGPTPVDGVTNGVDVRPMSTDGGDDGSKTPRLPRWTRQEILVLIQGKKVAETR 60

Query: 61  VRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI 120
           VRGGRAA+LAFGS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI
Sbjct: 61  VRGGRAASLAFGSGQVEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI 120

Query: 121 RDDSESFWLMRNDLRRERKLPGFFDREVYDILESGSAPSPSPALALALAPAPAPPPVPPL 180
           R+D+ESFW+MRNDLRRERKLPGFFDREVYDIL+SGSAPSPSPALALAL P P P P P L
Sbjct: 121 REDTESFWVMRNDLRRERKLPGFFDREVYDILDSGSAPSPSPALALALTPLPIPVPPPAL 180

Query: 181 TSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPA 240
            S     DAEPEHVFDS KTAAADDGLFSDFEQDET +SP+KEV GK+ P PT DG IPA
Sbjct: 181 NSDDGKPDAEPEHVFDSSKTAAADDGLFSDFEQDETCRSPLKEVAGKDVPPPTADGGIPA 240

Query: 241 PAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGS-SLQEGRKRKRIAKEGEEESMMMQD 300
           P PLSEK Y+P    CPDQGT+NE+EAAANPEIGS S QEGRKRKR+A +G+EE+ ++QD
Sbjct: 241 PTPLSEKLYRPPGHDCPDQGTTNEKEAAANPEIGSTSSQEGRKRKRVALDGDEET-ILQD 300

Query: 301 ELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIADKL 356
           ELIGILEKNGKLLTAQLEAQNMNFQLDREQR++HADGLV VLNKLADALGRIADKL
Sbjct: 301 ELIGILEKNGKLLTAQLEAQNMNFQLDREQRKHHADGLVAVLNKLADALGRIADKL 355

BLAST of Cp4.1LG17g09540 vs. NCBI nr
Match: gi|659119916|ref|XP_008459912.1| (PREDICTED: uncharacterized protein LOC103498891 isoform X1 [Cucumis melo])

HSP 1 Score: 573.2 bits (1476), Expect = 3.2e-160
Identity = 297/356 (83.43%), Postives = 321/356 (90.17%), Query Frame = 1

Query: 1   MALQQLSLGPTPVDAVTNGLDARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETR 60
           MALQQLSLGPTPVD VTNG+D RP+S+DGGD+GSKTPRLPRWTRQEILVLIQGKKVAETR
Sbjct: 1   MALQQLSLGPTPVDGVTNGVDVRPMSTDGGDDGSKTPRLPRWTRQEILVLIQGKKVAETR 60

Query: 61  VRGGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI 120
           VRGGRAA+LAFGS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI
Sbjct: 61  VRGGRAASLAFGSGQVEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQI 120

Query: 121 RDDSESFWLMRNDLRRERKLPGFFDREVYDILESGSAPSPSPALALALAPAPAPPPVPPL 180
           R+D+ESFW+MRNDLRRERKLPGFFDREVYDIL+SGSAPSPSP LALAL P P P P   L
Sbjct: 121 REDTESFWVMRNDLRRERKLPGFFDREVYDILDSGSAPSPSPPLALALTPVPIPVPPHAL 180

Query: 181 TSALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPA 240
            S     DAEPEHVFDS KTAAADDGLFSDFEQDETS+SP+KEV GK+ P PT DG IPA
Sbjct: 181 NSDDGKPDAEPEHVFDSSKTAAADDGLFSDFEQDETSRSPLKEVAGKDVPPPTADGGIPA 240

Query: 241 PAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGS-SLQEGRKRKRIAKEGEEESMMMQD 300
           P PLSE  Y+P    CPDQGT+NE+EAAANPEIGS S QEGRKRKR+A +G+EE+ ++QD
Sbjct: 241 PTPLSETLYRPPGHDCPDQGTTNEKEAAANPEIGSTSSQEGRKRKRVALDGDEET-ILQD 300

Query: 301 ELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIADKL 356
           ELIGILEKNGKLLTAQLEAQNMNFQLDREQR++HADGLV VLNKLADALGRIADKL
Sbjct: 301 ELIGILEKNGKLLTAQLEAQNMNFQLDREQRKHHADGLVAVLNKLADALGRIADKL 355

BLAST of Cp4.1LG17g09540 vs. NCBI nr
Match: gi|778711738|ref|XP_011656788.1| (PREDICTED: uncharacterized protein LOC101222632 isoform X2 [Cucumis sativus])

HSP 1 Score: 536.6 bits (1381), Expect = 3.4e-149
Identity = 278/332 (83.73%), Postives = 301/332 (90.66%), Query Frame = 1

Query: 25  ISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETRVRGGRAATLAFGSAQLEPKWASVS 84
           +S+DGGD+GSKTPRLPRWTRQEILVLIQGKKVAETRVRGGRAA+LAFGS Q+EPKWASVS
Sbjct: 1   MSTDGGDDGSKTPRLPRWTRQEILVLIQGKKVAETRVRGGRAASLAFGSGQVEPKWASVS 60

Query: 85  SYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIRDDSESFWLMRNDLRRERKLPGFF 144
           SYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIR+D+ESFW+MRNDLRRERKLPGFF
Sbjct: 61  SYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIREDTESFWVMRNDLRRERKLPGFF 120

Query: 145 DREVYDILESGSAPSPSPALALALAPAPAPPPVPPLTSALNSDDAEPEHVFDSFKTAAAD 204
           DREVYDIL+SGSAPSPSPALALAL P P P P P L S     DAEPEHVFDS KTAAAD
Sbjct: 121 DREVYDILDSGSAPSPSPALALALTPLPIPVPPPALNSDDGKPDAEPEHVFDSSKTAAAD 180

Query: 205 DGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPAPAPLSEKPYQPASQGCPDQGTSNE 264
           DGLFSDFEQDET +SP+KEV GK+ P PT DG IPAP PLSEK Y+P    CPDQGT+NE
Sbjct: 181 DGLFSDFEQDETCRSPLKEVAGKDVPPPTADGGIPAPTPLSEKLYRPPGHDCPDQGTTNE 240

Query: 265 REAAANPEIGS-SLQEGRKRKRIAKEGEEESMMMQDELIGILEKNGKLLTAQLEAQNMNF 324
           +EAAANPEIGS S QEGRKRKR+A +G+EE+ ++QDELIGILEKNGKLLTAQLEAQNMNF
Sbjct: 241 KEAAANPEIGSTSSQEGRKRKRVALDGDEET-ILQDELIGILEKNGKLLTAQLEAQNMNF 300

Query: 325 QLDREQRRYHADGLVTVLNKLADALGRIADKL 356
           QLDREQR++HADGLV VLNKLADALGRIADKL
Sbjct: 301 QLDREQRKHHADGLVAVLNKLADALGRIADKL 331

BLAST of Cp4.1LG17g09540 vs. NCBI nr
Match: gi|659119918|ref|XP_008459913.1| (PREDICTED: uncharacterized protein LOC103498891 isoform X2 [Cucumis melo])

HSP 1 Score: 531.2 bits (1367), Expect = 1.4e-147
Identity = 276/332 (83.13%), Postives = 299/332 (90.06%), Query Frame = 1

Query: 25  ISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETRVRGGRAATLAFGSAQLEPKWASVS 84
           +S+DGGD+GSKTPRLPRWTRQEILVLIQGKKVAETRVRGGRAA+LAFGS Q+EPKWASVS
Sbjct: 1   MSTDGGDDGSKTPRLPRWTRQEILVLIQGKKVAETRVRGGRAASLAFGSGQVEPKWASVS 60

Query: 85  SYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIRDDSESFWLMRNDLRRERKLPGFF 144
           SYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIR+D+ESFW+MRNDLRRERKLPGFF
Sbjct: 61  SYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIREDTESFWVMRNDLRRERKLPGFF 120

Query: 145 DREVYDILESGSAPSPSPALALALAPAPAPPPVPPLTSALNSDDAEPEHVFDSFKTAAAD 204
           DREVYDIL+SGSAPSPSP LALAL P P P P   L S     DAEPEHVFDS KTAAAD
Sbjct: 121 DREVYDILDSGSAPSPSPPLALALTPVPIPVPPHALNSDDGKPDAEPEHVFDSSKTAAAD 180

Query: 205 DGLFSDFEQDETSQSPIKEVPGKEAPVPTPDGEIPAPAPLSEKPYQPASQGCPDQGTSNE 264
           DGLFSDFEQDETS+SP+KEV GK+ P PT DG IPAP PLSE  Y+P    CPDQGT+NE
Sbjct: 181 DGLFSDFEQDETSRSPLKEVAGKDVPPPTADGGIPAPTPLSETLYRPPGHDCPDQGTTNE 240

Query: 265 REAAANPEIGS-SLQEGRKRKRIAKEGEEESMMMQDELIGILEKNGKLLTAQLEAQNMNF 324
           +EAAANPEIGS S QEGRKRKR+A +G+EE+ ++QDELIGILEKNGKLLTAQLEAQNMNF
Sbjct: 241 KEAAANPEIGSTSSQEGRKRKRVALDGDEET-ILQDELIGILEKNGKLLTAQLEAQNMNF 300

Query: 325 QLDREQRRYHADGLVTVLNKLADALGRIADKL 356
           QLDREQR++HADGLV VLNKLADALGRIADKL
Sbjct: 301 QLDREQRKHHADGLVAVLNKLADALGRIADKL 331

BLAST of Cp4.1LG17g09540 vs. NCBI nr
Match: gi|802754529|ref|XP_012088721.1| (PREDICTED: uncharacterized protein LOC105647306 [Jatropha curcas])

HSP 1 Score: 427.9 bits (1099), Expect = 1.7e-116
Identity = 233/356 (65.45%), Postives = 273/356 (76.69%), Query Frame = 1

Query: 4   QQLSLGPTPVDAVT-NGLDARPISSDGGDNGSKTPRLPRWTRQEILVLIQGKKVAETRVR 63
           QQL+L P  VD    NG+D R  S DGGD+GSK PRLPRWTRQEILVLIQGKKVAE RVR
Sbjct: 5   QQLNLTPITVDGEQINGVDTRLTSIDGGDDGSKAPRLPRWTRQEILVLIQGKKVAENRVR 64

Query: 64  GGRAATLAFGSAQLEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESQIRD 123
            GR A +AFGS Q+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGD+KKIKEWESQIR+
Sbjct: 65  RGRTAGMAFGSGQVEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIKEWESQIRE 124

Query: 124 DSESFWLMRNDLRRERKLPGFFDREVYDILES-GSAPSPSPALALALAPAPAPPPVPPLT 183
           ++ESFW+MRNDLRRERKLPGFFDREVYDIL+  G   + +P LALAL PAP P       
Sbjct: 125 ETESFWVMRNDLRRERKLPGFFDREVYDILDGVGGVSATAPGLALALTPAPEP------- 184

Query: 184 SALNSDDAEPEHVFDSFKTAAADDGLFSDFEQDETSQSPIKE--VPGKEAPVPTPDGEIP 243
               +DDAE   +FDS ++AAA+DGLFSDFEQDE   SP KE  V  +  P+ T    + 
Sbjct: 185 ----ADDAEA--IFDSGRSAAAEDGLFSDFEQDEAGGSPEKEAAVAKEVPPIKTAAAGVA 244

Query: 244 APAPLSEKPYQPASQGCPDQGTSNEREAAANPEIGSSLQEGRKRKRIAKEGEEESMMMQD 303
           AP P+SEK YQP+      QG +NE++ A+NPE+GS+  + RKRKR   + +EE+  + +
Sbjct: 245 APLPISEKQYQPSHLADQAQGGTNEKQPASNPEVGSASHDSRKRKRFTADVDEETANLHN 304

Query: 304 ELIGILEKNGKLLTAQLEAQNMNFQLDREQRRYHADGLVTVLNKLADALGRIADKL 356
            L+G+LEKN K+LTAQLEAQN NFQLDREQR+ HAD LV VLNKLADALG+IADKL
Sbjct: 305 HLVGVLEKNSKMLTAQLEAQNNNFQLDREQRKDHADSLVAVLNKLADALGKIADKL 347

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASR3_ARATH1.3e-8953.44Trihelix transcription factor ASR3 OS=Arabidopsis thaliana GN=ASR3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KDF5_CUCSA5.4e-16283.99Uncharacterized protein OS=Cucumis sativus GN=Csa_6G091850 PE=4 SV=1[more]
A0A067JH78_JATCU1.2e-11665.45Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23099 PE=4 SV=1[more]
V4SAN3_9ROSI3.8e-11564.74Uncharacterized protein OS=Citrus clementina GN=CICLE_v10005329mg PE=4 SV=1[more]
A0A067F3Y6_CITSI3.8e-11564.74Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g019057mg PE=4 SV=1[more]
B9RFZ6_RICCO7.8e-11364.19Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1438230 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G33550.17.5e-9153.44 Homeodomain-like superfamily protein[more]
AT4G31270.13.7e-1331.25 sequence-specific DNA binding transcription factors[more]
AT2G35640.18.2e-1334.62 Homeodomain-like superfamily protein[more]
AT5G51800.18.8e-0728.30 Protein kinase superfamily protein[more]
AT1G31310.12.0e-0623.32 hydroxyproline-rich glycoprotein family protein[more]
Match NameE-valueIdentityDescription
gi|449445594|ref|XP_004140557.1|7.7e-16283.99PREDICTED: uncharacterized protein LOC101222632 isoform X1 [Cucumis sativus][more]
gi|659119916|ref|XP_008459912.1|3.2e-16083.43PREDICTED: uncharacterized protein LOC103498891 isoform X1 [Cucumis melo][more]
gi|778711738|ref|XP_011656788.1|3.4e-14983.73PREDICTED: uncharacterized protein LOC101222632 isoform X2 [Cucumis sativus][more]
gi|659119918|ref|XP_008459913.1|1.4e-14783.13PREDICTED: uncharacterized protein LOC103498891 isoform X2 [Cucumis melo][more]
gi|802754529|ref|XP_012088721.1|1.7e-11665.45PREDICTED: uncharacterized protein LOC105647306 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR017877Myb-like_dom
IPR009057Homeobox-like_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0071219 cellular response to molecule of bacterial origin
biological_process GO:0050777 negative regulation of immune response
biological_process GO:0045892 negative regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0042803 protein homodimerization activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g09540.1Cp4.1LG17g09540.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 41..106
score: 5.
IPR017877Myb-like domainPROFILEPS50090MYB_LIKEcoord: 41..107
score:
NoneNo IPR availablePANTHERPTHR21654FAMILY NOT NAMEDcoord: 196..355
score: 3.5E-128coord: 1..156
score: 3.5E
NoneNo IPR availablePANTHERPTHR21654:SF10GT-2-RELATED PROTEINcoord: 196..355
score: 3.5E-128coord: 1..156
score: 3.5E
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 41..130
score: 2.1