Cp4.1LG05g12030 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG05g12030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTwo-component response regulator
LocationCp4.1LG05 : 8388262 .. 8390858 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAATACGCGAGGACACGAACATGAGTTTGATAACTCAGTCAAAACTCGGCGATGTAATGGAGCACTCGCCGATCAGAATCTCCGCATATCACACCATTTTTATTCTTTTTGAAATTCCCTATTTCTGTTCCTCTCTCTATATATATATTTATATTTACATGTTGAAATTTATTTTTCATATTTATTTTTCATTTATCCTCCGATTTTTAATCCAAAAAATCGAAACCAGGAGAAGCGGAGATCAATAATGGGTTCGCTCTCTCCTGAACTCAGTTTGGATTTTCGATCCACTTTTGTGCCGAAATCGATCTCCGATTTCTTCAAACAGGTCTCGATGATCGATAATGTGTCCGATAGGGTCATCAAACTTAACGATTTCGTCAAAAGTTTGGAGGATGAAATGAGGAAGATCGATGCCTTCAAGCGGGAGCTTCCGCTCTGTATGATTCTGTTGAAAGATGGTGAGTTCTTCTTCTCTAATTACAGCTTTTGATTCGATGTCCTATTGTTTGTTTCTGAGATTTGGATTTTGGGGAATCTCTTTGAATATCATTCAAATTGAAACGTTTNACCCTAATTTGCACTCATGGATTGTTCATATTTGTTCTTCAGCCATTTTAGCTGTGAAAGAAGAGGAGTCAATGCAATGTTCCGTTTCTAGAACTCAACCGGTTTTGGAAGAGTTCAAATCTCTGAAAAAAGAATTTGATGAACAAGATTTTAGAAATGGAAAAAATTATAGAGATCAGAAAAATTGGATGAGGTCGGTTCAGCTTTGGAACTCGGATGATAGTAAACGAGATTCCAAATTCGAAACTAAGGTATAAGAATTTCTTCGAAGAATAAGAACAGAGGAATTTGTGCATTTTGTGTTCTTGGTAATTAGATGATAATTAGATTGGAATTTGCTGTCACAGATAAACGAGAAAGGAGGCCCTGTAGTGACTCAAGTTTCATTGCAGTTCTGTAGAAATAAGAATAGTGAGAGGAGCCAAGTGCCATTGAAGGCATATTCTGTGTTCCCTTCAGCCATGGCTGTGAGGAAGGAGGATAAGGAGGAATTTCCAATTCATGGGCTTTCTCTTTGCACTCCGGGAATTAAGAATCCGATAGAGGAGTCAGCTTCTAGTGGTTCGAGATCGAGCGGGACTAGAGCTGTTTCTTCTTCAGCTATGACTGCTCCGGTGAGTTTACAAACCGGAACACAGCAACAGCAACAGCAGCTGTCTCGTAAGCAAAGGAGATGCTGGTCACCAGAGTTGCATCGACGTTTTGTTAGCGCCTTGCAGCAACTTGGAGGCTCACAGGGTAAGAAATGAAGCTCAATGTTAAATATAGATTACAAATCTGTTAATTGATGAGGTTTCTTGTGAGATCCCACATCGGTTGGAGAGGAGAACAAAACATTCTCTATAAATGTGTGAAACCTCTTCCTAGCTGACAAGTTTTAAAAACCTTGAGGGAAAACTCAAGGAAAAGCCCAAAGAGAGCAATATCTGCTAGTGATGAGTTTGGGCTGTTACAAATAGTATTAGAGCTAGATACGGGCCGATGTGTCAGCAAGGAGACTGAGTCCCGAAGGGGGGTGGCACGAGGCGGTGTGCTAGCAAGTACGTTGGGTCCCAAAGGGGAGTAGATTGTGAGATCCCAACCGGGGGGAGAACAAAACATTTTTTATAAGGGTGTGGAAACTTAGCCGGCGCGTTTTAAAACCTTGAGGGAAAGCGGACAATATCTCCTAGCGGTGGGCTTGGGTCGTTAGATTCCTCGTGTTGGCTTGAAATCATGTAAAACTGTAGACTTTTTTCATCATGGTTTAAGCTCAAGAGCTATGAACTTGATTGTTAATATCAATTTTCTGATTCAAACTTTAATTACTTGTTTTAGCAGTGGCCACACCGAAGCAGATTAGAGAGCTTATGCAGGTGGATGGCCTCACCAACGATGAAGTGAAGAGTCATCTGCAGGCAAGCTCCAAAGCTTACTTACTTACTTTTGATGTCTGTTTAGCATTTTAATGTTTCGATGGCTCGTTTACAACGATGAAGTTAATGATGTTTTTCGCTTGATCCAGAAATTCCGACTTCACACGAGAAGACTTCCAGCATCTACAGTTACCCCTACGAACCAATCCATTGTCGTTCTTGGGGGATTGTTGGTGGCCGAAGATCAGTTCCGCGAGTCATCAAAAGCTTGCAGTTCTCAATCTGGGTCACCCCAAGGTCCCCTTCAGTTAGGCAGGACTGGCGGGACGTCGACCACAGGGGGAGACAGCACGGAGGACGACGACAACGACGTAAAATCAGAGAGCTACAGCTGGAAGAGCCGAAGTAAAAGAACAGGAAAGGAAGATGCATAAGAAGTTAGAAAAAAAAGTATTTAAGGTACATAAAAGAAAGGTAAAGGGTGTTGAGAGAGAGAGAGATAGAGAGCAAGAGGGTGTGAAATTTTGCAGGATTAGAGTCCCTTCCCCTTTAGTAGAAAGACTACTAATGTCTACCACTGTTAAAATGGTACAAATTGTTTGTAGAAACCTATGTTTATGTGAACATATCAAGTTCATCTTCGATAAACAAAAAAAAAAAAAAAAAACAAA

mRNA sequence

CAAATACGCGAGGACACGAACATGAGTTTGATAACTCAGTCAAAACTCGGCGATGTAATGGAGCACTCGCCGATCAGAATCTCCGCATATCACACCATTTTTATTCTTTTTGAAATTCCCTATTTCTGTTCCTCTCTCTATATATATATTTATATTTACATGTTGAAATTTATTTTTCATATTTATTTTTCATTTATCCTCCGATTTTTAATCCAAAAAATCGAAACCAGGAGAAGCGGAGATCAATAATGGGTTCGCTCTCTCCTGAACTCAGTTTGGATTTTCGATCCACTTTTGTGCCGAAATCGATCTCCGATTTCTTCAAACAGGTCTCGATGATCGATAATGTGTCCGATAGGGTCATCAAACTTAACGATTTCGTCAAAAGTTTGGAGGATGAAATGAGGAAGATCGATGCCTTCAAGCGGGAGCTTCCGCTCTGTATGATTCTGTTGAAAGATGCCATTTTAGCTGTGAAAGAAGAGGAGTCAATGCAATGTTCCGTTTCTAGAACTCAACCGGTTTTGGAAGAGTTCAAATCTCTGAAAAAAGAATTTGATGAACAAGATTTTAGAAATGGAAAAAATTATAGAGATCAGAAAAATTGGATGAGGTCGGTTCAGCTTTGGAACTCGGATGATAGTAAACGAGATTCCAAATTCGAAACTAAGATAAACGAGAAAGGAGGCCCTGTAGTGACTCAAGTTTCATTGCAGTTCTGTAGAAATAAGAATAGTGAGAGGAGCCAAGTGCCATTGAAGGCATATTCTGTGTTCCCTTCAGCCATGGCTGTGAGGAAGGAGGATAAGGAGGAATTTCCAATTCATGGGCTTTCTCTTTGCACTCCGGGAATTAAGAATCCGATAGAGGAGTCAGCTTCTAGTGGTTCGAGATCGAGCGGGACTAGAGCTGTTTCTTCTTCAGCTATGACTGCTCCGGTGAGTTTACAAACCGGAACACAGCAACAGCAACAGCAGCTGTCTCGTAAGCAAAGGAGATGCTGGTCACCAGAGTTGCATCGACGTTTTGTTAGCGCCTTGCAGCAACTTGGAGGCTCACAGGTGGCCACACCGAAGCAGATTAGAGAGCTTATGCAGGTGGATGGCCTCACCAACGATGAAAAATTCCGACTTCACACGAGAAGACTTCCAGCATCTACAGTTACCCCTACGAACCAATCCATTGTCGTTCTTGGGGGATTGTTGGTGGCCGAAGATCAGTTCCGCGAGTCATCAAAAGCTTGCAGTTCTCAATCTGGGTCACCCCAAGGTCCCCTTCAGTTAGGCAGGACTGGCGGGACGTCGACCACAGGGGGAGACAGCACGGAGGACGACGACAACGACGTAAAATCAGAGAGCTACAGCTGGAAGAGCCGAAGTAAAAGAACAGGAAAGGAAGATGCATAAGAAGTTAGAAAAAAAAGTATTTAAGGTACATAAAAGAAAGGTAAAGGGTGTTGAGAGAGAGAGAGATAGAGAGCAAGAGGGTGTGAAATTTTGCAGGATTAGAGTCCCTTCCCCTTTAGTAGAAAGACTACTAATGTCTACCACTGTTAAAATGGTACAAATTGTTTGTAGAAACCTATGTTTATGTGAACATATCAAGTTCATCTTCGATAAACAAAAAAAAAAAAAAAAAACAAA

Coding sequence (CDS)

ATGGGTTCGCTCTCTCCTGAACTCAGTTTGGATTTTCGATCCACTTTTGTGCCGAAATCGATCTCCGATTTCTTCAAACAGGTCTCGATGATCGATAATGTGTCCGATAGGGTCATCAAACTTAACGATTTCGTCAAAAGTTTGGAGGATGAAATGAGGAAGATCGATGCCTTCAAGCGGGAGCTTCCGCTCTGTATGATTCTGTTGAAAGATGCCATTTTAGCTGTGAAAGAAGAGGAGTCAATGCAATGTTCCGTTTCTAGAACTCAACCGGTTTTGGAAGAGTTCAAATCTCTGAAAAAAGAATTTGATGAACAAGATTTTAGAAATGGAAAAAATTATAGAGATCAGAAAAATTGGATGAGGTCGGTTCAGCTTTGGAACTCGGATGATAGTAAACGAGATTCCAAATTCGAAACTAAGATAAACGAGAAAGGAGGCCCTGTAGTGACTCAAGTTTCATTGCAGTTCTGTAGAAATAAGAATAGTGAGAGGAGCCAAGTGCCATTGAAGGCATATTCTGTGTTCCCTTCAGCCATGGCTGTGAGGAAGGAGGATAAGGAGGAATTTCCAATTCATGGGCTTTCTCTTTGCACTCCGGGAATTAAGAATCCGATAGAGGAGTCAGCTTCTAGTGGTTCGAGATCGAGCGGGACTAGAGCTGTTTCTTCTTCAGCTATGACTGCTCCGGTGAGTTTACAAACCGGAACACAGCAACAGCAACAGCAGCTGTCTCGTAAGCAAAGGAGATGCTGGTCACCAGAGTTGCATCGACGTTTTGTTAGCGCCTTGCAGCAACTTGGAGGCTCACAGGTGGCCACACCGAAGCAGATTAGAGAGCTTATGCAGGTGGATGGCCTCACCAACGATGAAAAATTCCGACTTCACACGAGAAGACTTCCAGCATCTACAGTTACCCCTACGAACCAATCCATTGTCGTTCTTGGGGGATTGTTGGTGGCCGAAGATCAGTTCCGCGAGTCATCAAAAGCTTGCAGTTCTCAATCTGGGTCACCCCAAGGTCCCCTTCAGTTAGGCAGGACTGGCGGGACGTCGACCACAGGGGGAGACAGCACGGAGGACGACGACAACGACGTAAAATCAGAGAGCTACAGCTGGAAGAGCCGAAGTAAAAGAACAGGAAAGGAAGATGCATAA

Protein sequence

MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKRELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQDFRNGKNYRDQKNWMRSVQLWNSDDSKRDSKFETKINEKGGPVVTQVSLQFCRNKNSERSQVPLKAYSVFPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAMTAPVSLQTGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDEKFRLHTRRLPASTVTPTNQSIVVLGGLLVAEDQFRESSKACSSQSGSPQGPLQLGRTGGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKEDA
BLAST of Cp4.1LG05g12030 vs. Swiss-Prot
Match: EFM_ARATH (Myb family transcription factor EFM OS=Arabidopsis thaliana GN=EFM PE=1 SV=2)

HSP 1 Score: 143.7 bits (361), Expect = 4.3e-33
Identity = 118/334 (35.33%), Postives = 168/334 (50.30%), Query Frame = 1

Query: 5   SPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKRELPL 64
           S ELSLD +    P+S S   K             KL D +  LE E  KIDAFKRELPL
Sbjct: 4   SSELSLDCK----PQSYSMLLKSFGDNFQSDPTTHKLEDLLSRLEQERLKIDAFKRELPL 63

Query: 65  CMILLKDAILAVKEE-ESMQCSVSR------TQPVLEEFKSLKKEFDEQDFRNGKNYRDQ 124
           CM LL +A+   K++ E+ + + +       T+PVLEEF  L+ + ++ + +        
Sbjct: 64  CMQLLNNAVEVYKQQLEAYRANSNNNNQSVGTRPVLEEFIPLRNQPEKTNNKGS------ 123

Query: 125 KNWMRSVQLWNSDDSKR---DSKFETKINEKGGPVVTQVSLQFCRNKNSERSQVPLKAYS 184
            NWM + QLW+  ++K    DS  +  + +       ++     + +N   + +P     
Sbjct: 124 -NWMTTAQLWSQSETKPKNIDSTTDQSLPKDEINSSPKLGHFDAKQRNGSGAFLPFSKEQ 183

Query: 185 VFPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAMTAPVSLQ 244
             P  +A+  E K   P +  +    G    +  + ++ + ++   + S+          
Sbjct: 184 SLPE-LALSTEVKRVSPTNEHTNGQDGNDESMINNDNNYNNNNNNNSNSN---------- 243

Query: 245 TGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE--- 304
            G      Q +RK RRCWSP+LHRRFV ALQ LGGSQVATPKQIRELM+VDGLTNDE   
Sbjct: 244 -GVSSTTSQSNRKARRCWSPDLHRRFVQALQMLGGSQVATPKQIRELMKVDGLTNDEVKS 303

Query: 305 ---KFRLHTRRLPASTVTP--TNQSIVVLGGLLV 321
              K+RLHTRR   S  T       +VVLGG+ V
Sbjct: 304 HLQKYRLHTRRPSPSPQTSGGPGPHLVVLGGIWV 314

BLAST of Cp4.1LG05g12030 vs. Swiss-Prot
Match: ORR23_ORYSI (Two-component response regulator ORR23 OS=Oryza sativa subsp. indica GN=RR23 PE=3 SV=1)

HSP 1 Score: 52.8 bits (125), Expect = 1.0e-05
Identity = 37/79 (46.84%), Postives = 49/79 (62.03%), Query Frame = 1

Query: 246 RKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTND------EKFRLHTRR 305
           +K R  WS ELHR+FV+A+ QLG  + A PK+I ELM V+ LT +      +K+RL+ +R
Sbjct: 213 KKPRVVWSVELHRKFVAAVNQLGIDK-AVPKRILELMNVEKLTRENVASHLQKYRLYLKR 272

Query: 306 LPASTVTPTNQSIV-VLGG 318
           L  S V     SIV  LGG
Sbjct: 273 L--SAVASQQVSIVAALGG 288

BLAST of Cp4.1LG05g12030 vs. Swiss-Prot
Match: ORR23_ORYSJ (Two-component response regulator ORR23 OS=Oryza sativa subsp. japonica GN=RR23 PE=2 SV=1)

HSP 1 Score: 52.8 bits (125), Expect = 1.0e-05
Identity = 37/79 (46.84%), Postives = 49/79 (62.03%), Query Frame = 1

Query: 246 RKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTND------EKFRLHTRR 305
           +K R  WS ELHR+FV+A+ QLG  + A PK+I ELM V+ LT +      +K+RL+ +R
Sbjct: 213 KKPRVVWSVELHRKFVAAVNQLGIDK-AVPKRILELMNVEKLTRENVASHLQKYRLYLKR 272

Query: 306 LPASTVTPTNQSIV-VLGG 318
           L  S V     SIV  LGG
Sbjct: 273 L--SAVASQQVSIVAALGG 288

BLAST of Cp4.1LG05g12030 vs. TrEMBL
Match: A0A0A0LIJ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G236120 PE=4 SV=1)

HSP 1 Score: 523.9 bits (1348), Expect = 1.7e-145
Identity = 291/396 (73.48%), Postives = 330/396 (83.33%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGSL  EL+LDFR +FVPK+I+DFFK+VSMI NVSDRV KLNDF+K+LEDE+RKIDAFKR
Sbjct: 1   MGSLPSELTLDFRPSFVPKTITDFFKEVSMIGNVSDRVSKLNDFIKTLEDEVRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQD---FRNGKNYRDQ 120
           ELPLCM+LLKDAILAVK+E+ MQC+V +T+PVLEEF  LKKE +E D    + G +YRDQ
Sbjct: 61  ELPLCMVLLKDAILAVKDEK-MQCAVPKTKPVLEEFIPLKKEQEEDDGDDSKKGNDYRDQ 120

Query: 121 KNWMRSVQLWNSDDSKRDS-KFETKINEKGGPVVTQVSLQFCRNKNSERSQVPLK-AYSV 180
           KNWM SVQLWNSDD+   + K ETK NEKGGPVVTQVS+Q CR KN ER QVP K +Y +
Sbjct: 121 KNWMSSVQLWNSDDNHHSNYKLETKRNEKGGPVVTQVSMQSCRTKNGERIQVPFKPSYPI 180

Query: 181 FPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAMTAPVSLQT 240
           F SAM  RKEDKEEFPIHGLSLCTPGIK+P+EESAS+GSRSSGTRAVSSS +TA V+L+T
Sbjct: 181 FSSAMVARKEDKEEFPIHGLSLCTPGIKSPMEESASTGSRSSGTRAVSSSTLTASVNLRT 240

Query: 241 GTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE---- 300
           G QQQ+QQ SRKQRRCWS ELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE    
Sbjct: 241 GMQQQKQQCSRKQRRCWSKELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDEVKSH 300

Query: 301 --KFRLHTRRLPASTVTPTNQ-SIVVLGGLLVAEDQFRESSKACSSQSGSPQGPLQLGRT 360
             KFRLH RRLPAS V P NQ S+VVLGGLLV +D + +SSKACSSQSGSPQGPLQL   
Sbjct: 301 LQKFRLHARRLPASAVPPANQSSVVVLGGLLVPQDPYADSSKACSSQSGSPQGPLQL--- 360

Query: 361 GGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
              + TGGDS E+++ DVKSESY WKSR ++ G ED
Sbjct: 361 ---AGTGGDSMEEEE-DVKSESYCWKSRIQKPGNED 388

BLAST of Cp4.1LG05g12030 vs. TrEMBL
Match: A0A067L2E9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04882 PE=4 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 5.1e-118
Identity = 248/410 (60.49%), Postives = 302/410 (73.66%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGS+ PELSLDFR T+VPK+ISDF K+VSMI +VS++V KL+ FVK LE+EMRKIDAFKR
Sbjct: 1   MGSIPPELSLDFRPTYVPKTISDFLKEVSMIGDVSEKVSKLDGFVKGLEEEMRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQD-------FRNGKN 120
           ELPLCM+LL DAIL +K E SMQC+ S  +PVLEEF  LKK  +E++        +  K+
Sbjct: 61  ELPLCMLLLNDAILFLKTE-SMQCTTSNNRPVLEEFIPLKKSDEEEEEEEEESPIKKEKD 120

Query: 121 YRDQKNWMRSVQLWNSD---------DSKRDSKFETKINEKGGPVVTQVSLQFCRNKNSE 180
           Y+D+KNWM SVQLWNS+         D K++ K E K  +KG     + + Q C+++N+ 
Sbjct: 121 YKDKKNWMSSVQLWNSNEHSSTDYIFDPKQNLKLEYKSTKKGNQYANEDTFQACKSRNAA 180

Query: 181 RSQVPLKAYSVFPSAMAVRK---EDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRA 240
           R+ +P K Y+         K   ++ EE PI GLSL TPGIKN  EES S+GSR+S +RA
Sbjct: 181 RAFMPFKTYTGLSRKEDSNKNHNQNSEELPIPGLSLLTPGIKNLREESVSTGSRTSCSRA 240

Query: 241 VSSSAMTAPVSLQTGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIREL 300
           VSSSA    ++L+ G Q  QQQ +RKQRRCWSPELHRRFV+ALQQLGGSQ ATPKQIREL
Sbjct: 241 VSSSAPNPQLNLRNGLQPSQQQTARKQRRCWSPELHRRFVNALQQLGGSQAATPKQIREL 300

Query: 301 MQVDGLTNDE------KFRLHTRRLPASTVTPTNQSIVVLGGLLVAEDQFRESSKACSSQ 360
           MQVDGLTNDE      K+RLHTRR+P +T  P NQS+VVLGGL V++DQ+ +SSKA SSQ
Sbjct: 301 MQVDGLTNDEVKSHLQKYRLHTRRMPTATAAPANQSLVVLGGLWVSQDQYGDSSKATSSQ 360

Query: 361 SGSPQGPLQL-GRTGGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
           SGSPQGPLQL G TGGTSTTGGDS EDD+ D KSE YSWKS   R+GK+D
Sbjct: 361 SGSPQGPLQLAGNTGGTSTTGGDSMEDDE-DAKSEGYSWKSHIHRSGKDD 408

BLAST of Cp4.1LG05g12030 vs. TrEMBL
Match: M5VM72_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006812mg PE=4 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 2.4e-115
Identity = 244/406 (60.10%), Postives = 295/406 (72.66%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGS+ PELSLDFR TFVPK+ISDF K+VSMI +VS+R+ KL+DFVK LE+EMRKIDAFKR
Sbjct: 1   MGSVPPELSLDFRPTFVPKTISDFLKEVSMIGSVSERLSKLDDFVKRLEEEMRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDE-----QDFRNGKNYR 120
           ELPLCM LL DAILA+K E+++QC+    QPVLEEF  LKK+ ++      + +  K+ R
Sbjct: 61  ELPLCMFLLNDAILALK-EDAVQCAAPNVQPVLEEFIPLKKDCEKNNGGTNNNKKEKDCR 120

Query: 121 DQKNWMRSVQLWNSD-----------DSKRDSKFETKINEKGGPVVTQVSLQFCRNKNSE 180
           D+KNWM SVQLWN+D           D K+ S+ ++K NE    +  +   Q CRN+ + 
Sbjct: 121 DKKNWMSSVQLWNTDNYQHPSSDFPYDRKQVSEIDSKRNEAENGLANEDPFQTCRNRTAG 180

Query: 181 RSQVPLKAYSVFPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSS 240
           R+ +P K Y  F S  A RKEDKEE P+HGLSL TPGIKNP EESASSGSRS+  RAVS 
Sbjct: 181 RAFMPFKTYPAF-SVTAARKEDKEELPVHGLSLLTPGIKNPKEESASSGSRSTCGRAVSF 240

Query: 241 SAMTAPVSLQTGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQV 300
           S   A  +++T     QQ  SRKQRRCWSPELHRRFV+ALQQLGGSQVATPKQIRELMQV
Sbjct: 241 STANAQSNMRT---PPQQPTSRKQRRCWSPELHRRFVNALQQLGGSQVATPKQIRELMQV 300

Query: 301 DGLTNDE------KFRLHTRRLPASTVTPTNQSIVVLGGLLVAEDQFRESSKACSSQSGS 360
           DGLTNDE      K+RLHTRR+PA+T  P NQS+VVLGGL +++DQ+ +SSKA SSQSGS
Sbjct: 301 DGLTNDEVKSHLQKYRLHTRRVPAATAAPDNQSVVVLGGLWMSQDQYADSSKASSSQSGS 360

Query: 361 PQGPLQLGRTGGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
           PQGPL L  TGG         ++DD D KSESYSWK   +++GK D
Sbjct: 361 PQGPLHLTGTGG--------DDEDDEDAKSESYSWKGHIQKSGKND 393

BLAST of Cp4.1LG05g12030 vs. TrEMBL
Match: A5BS05_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_035944 PE=4 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 4.1e-115
Identity = 249/404 (61.63%), Postives = 301/404 (74.50%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGS+ PELSLD R ++VPK+I+DF   +S I +VS+RV KL++F+K LE+EMRKIDAFKR
Sbjct: 1   MGSIPPELSLDLRPSYVPKTINDFLSGISTIGDVSERVTKLDEFLKRLEEEMRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQD-FRNGKNYRDQKN 120
           ELPLCMILL DAILA+KEE  ++   S  QPVLEEF  LKK+ DE+   +  K+ +D+KN
Sbjct: 61  ELPLCMILLSDAILALKEE-LLRSKASNVQPVLEEFIPLKKDCDEEGGAKEEKDCKDKKN 120

Query: 121 WMRSVQLWNSDDS---------KRDSKFETK--INEKGGPVVTQVSLQFCRNKNSERSQV 180
           WM SVQLWNSDDS         K+DSK E K  ++E+  P  T+   Q C+++   R+ +
Sbjct: 121 WMSSVQLWNSDDSASTDHIYDKKQDSKLEIKQRVDEENRPA-TEDPFQPCKSRTGGRAFL 180

Query: 181 PLKAYSVFPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAMT 240
           P K YS FP A A RKEDK+E P+HGLSL TPGIKNP EES SSGS++S +R VSSSA  
Sbjct: 181 PFKGYSGFPVATA-RKEDKDELPVHGLSLLTPGIKNPREESGSSGSKTSCSRGVSSSAPN 240

Query: 241 APVSLQTGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLT 300
              +L+TG Q  QQQ +RKQRRCWSPELHRRFV+ALQQLGGSQ ATPKQIRELMQVDGLT
Sbjct: 241 LQPNLRTGPQPPQQQTARKQRRCWSPELHRRFVNALQQLGGSQAATPKQIRELMQVDGLT 300

Query: 301 NDE------KFRLHTRRLP-ASTVTPTNQSIVVLGGLLVAEDQFRESSKACSSQSGSPQG 360
           NDE      K+RLHTRR+P  S   P+NQ +VVLG L + +DQ+ +SSKA SSQS SPQG
Sbjct: 301 NDEVKSHLQKYRLHTRRMPTTSAAPPSNQPVVVLGSLWMPQDQYGDSSKASSSQSASPQG 360

Query: 361 PLQL-GRTGGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
           PLQL G  GGTSTTGGDS EDD+ D KSESY WKS+  ++ KED
Sbjct: 361 PLQLGGAAGGTSTTGGDSMEDDE-DEKSESYGWKSQVHKSVKED 400

BLAST of Cp4.1LG05g12030 vs. TrEMBL
Match: A0A061E0R0_THECC (Homeodomain-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_006998 PE=4 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 7.0e-115
Identity = 258/422 (61.14%), Postives = 304/422 (72.04%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGS+ PELSLDFR TFVPK+IS+F K+VSM+ NVSD+V K++ FVK LE+EMRKIDAFKR
Sbjct: 1   MGSVPPELSLDFRPTFVPKTISNFLKEVSMVGNVSDKVSKVDAFVKGLEEEMRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEF---KSLKKE--FDEQDFR------ 120
           ELPLCM+LL DAI+A+K EESMQC     +PVLEEF   K+ KKE    E+D        
Sbjct: 61  ELPLCMLLLNDAIVALK-EESMQCVTRNVEPVLEEFIPLKNNKKETKHSEEDGASITTKK 120

Query: 121 ----NGKNY---RDQKNWMRSVQLWNSDDS---KRDSKFETKINEKGGPVVTQVSLQFCR 180
               N  NY   +D+KNWM SVQLWN+DD      D K +TK N++          Q C+
Sbjct: 121 DKDPNNNNYNINKDKKNWMSSVQLWNTDDDDYRSTDHKLDTKRNDED-------PFQGCK 180

Query: 181 NKNSERSQVPLKAYSVFPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGT 240
           N+ S R+ +P K        +AVRKE+KEE P+HGL+L TPGIKN  EES S+GSR+S +
Sbjct: 181 NRGSARAFMPFKP----NLGLAVRKEEKEEIPVHGLTLLTPGIKNLKEESGSTGSRTSCS 240

Query: 241 RAVSSSAMTAPVSLQTG---------TQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGS 300
           RAVSSSA  A  + ++G          QQQQQQ +RKQRRCWSPELHRRFV+ALQQLGGS
Sbjct: 241 RAVSSSAPNAQSNFRSGPQPLAHHLQQQQQQQQTARKQRRCWSPELHRRFVNALQQLGGS 300

Query: 301 QVATPKQIRELMQVDGLTNDE------KFRLHTRRLPASTVTPTNQSIVVLG-GLLVAED 360
           QVATPKQIRELMQVDGLTNDE      K+RLHTRRLP ST TP NQS+VVLG GL +++D
Sbjct: 301 QVATPKQIRELMQVDGLTNDEVKSHLQKYRLHTRRLPPSTTTPANQSVVVLGSGLWISQD 360

Query: 361 QFRESSKACSSQSGSPQGPLQL-GRTGGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGK 385
           Q+ ESSK  SSQSGSPQGPLQL   TGGTSTTGGDS EDD+ D KSESYSWKS   + GK
Sbjct: 361 QYGESSKGSSSQSGSPQGPLQLAANTGGTSTTGGDSMEDDE-DAKSESYSWKSHIHKPGK 409

BLAST of Cp4.1LG05g12030 vs. TAIR10
Match: AT1G49560.1 (AT1G49560.1 Homeodomain-like superfamily protein)

HSP 1 Score: 151.0 bits (380), Expect = 1.5e-36
Identity = 151/392 (38.52%), Postives = 204/392 (52.04%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGSL  ELSL           S F + VSM  NV   V K+++ VK LE+E RK+++ + 
Sbjct: 1   MGSLGDELSLG----------SIFGRGVSM--NVV-AVEKVDEHVKKLEEEKRKLESCQL 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQDFRNGKNYRDQKNW 120
           ELPL + +L DAIL +K++   +CS   TQP+L++F S+ K                   
Sbjct: 61  ELPLSLQILNDAILYLKDK---RCSEMETQPLLKDFISVNKPIQG--------------- 120

Query: 121 MRSVQLWNSDDSKRDSKFET-KINEKGGPVVTQVSLQFCRNKNSERSQVPLKAYSVFPSA 180
            R ++L   ++  R+ KF+  K N+               + +  +S++ +K        
Sbjct: 121 ERGIELLKREELMREKKFQQWKANDD--------------HTSKIKSKLEIK-------- 180

Query: 181 MAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGS-RSSGTRA---VSSSAMTAPVSLQT 240
              R E+K         L  P ++  +    SS S R  G  A    +S++M  P +   
Sbjct: 181 ---RNEEKSPM------LLIPKVETGLGLGLSSSSIRRKGIVASCGFTSNSMPQPPTPAV 240

Query: 241 GTQQQ--QQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE-- 300
             Q    +QQ  RKQRRCW+PELHRRFV ALQQLGG  VATPKQIRE MQ +GLTNDE  
Sbjct: 241 PQQPAFLKQQALRKQRRCWNPELHRRFVDALQQLGGPGVATPKQIREHMQEEGLTNDEVK 300

Query: 301 ----KFRLHTRRLPASTVTPTNQSIVVLGGLL---VAEDQFR-----ESSKACSSQSGSP 360
               K+RLH R+ P S      QS VVLG  L    A+D+       ES K  ++QS SP
Sbjct: 301 SHLQKYRLHIRK-PNSNA--EKQSAVVLGFNLWNSSAQDEEETCEGGESLKRSNAQSDSP 325

Query: 361 QGPLQLGRTGGTSTTGGDSTEDDDNDVKSESY 372
           QGPLQL  T  T+TTGGDS+ +D  D KSES+
Sbjct: 361 QGPLQLPST--TTTTGGDSSMEDVEDAKSESF 325

BLAST of Cp4.1LG05g12030 vs. TAIR10
Match: AT2G03500.1 (AT2G03500.1 Homeodomain-like superfamily protein)

HSP 1 Score: 143.7 bits (361), Expect = 2.4e-34
Identity = 118/334 (35.33%), Postives = 168/334 (50.30%), Query Frame = 1

Query: 5   SPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKRELPL 64
           S ELSLD +    P+S S   K             KL D +  LE E  KIDAFKRELPL
Sbjct: 4   SSELSLDCK----PQSYSMLLKSFGDNFQSDPTTHKLEDLLSRLEQERLKIDAFKRELPL 63

Query: 65  CMILLKDAILAVKEE-ESMQCSVSR------TQPVLEEFKSLKKEFDEQDFRNGKNYRDQ 124
           CM LL +A+   K++ E+ + + +       T+PVLEEF  L+ + ++ + +        
Sbjct: 64  CMQLLNNAVEVYKQQLEAYRANSNNNNQSVGTRPVLEEFIPLRNQPEKTNNKGS------ 123

Query: 125 KNWMRSVQLWNSDDSKR---DSKFETKINEKGGPVVTQVSLQFCRNKNSERSQVPLKAYS 184
            NWM + QLW+  ++K    DS  +  + +       ++     + +N   + +P     
Sbjct: 124 -NWMTTAQLWSQSETKPKNIDSTTDQSLPKDEINSSPKLGHFDAKQRNGSGAFLPFSKEQ 183

Query: 185 VFPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAMTAPVSLQ 244
             P  +A+  E K   P +  +    G    +  + ++ + ++   + S+          
Sbjct: 184 SLPE-LALSTEVKRVSPTNEHTNGQDGNDESMINNDNNYNNNNNNNSNSN---------- 243

Query: 245 TGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE--- 304
            G      Q +RK RRCWSP+LHRRFV ALQ LGGSQVATPKQIRELM+VDGLTNDE   
Sbjct: 244 -GVSSTTSQSNRKARRCWSPDLHRRFVQALQMLGGSQVATPKQIRELMKVDGLTNDEVKS 303

Query: 305 ---KFRLHTRRLPASTVTP--TNQSIVVLGGLLV 321
              K+RLHTRR   S  T       +VVLGG+ V
Sbjct: 304 HLQKYRLHTRRPSPSPQTSGGPGPHLVVLGGIWV 314

BLAST of Cp4.1LG05g12030 vs. TAIR10
Match: AT1G68670.1 (AT1G68670.1 myb-like transcription factor family protein)

HSP 1 Score: 137.9 bits (346), Expect = 1.3e-32
Identity = 116/322 (36.02%), Postives = 157/322 (48.76%), Query Frame = 1

Query: 40  KLNDFVKSLEDEMRKIDAFKRELPLCMILLKDAILAVKEE-------ESMQCSVSRTQ-- 99
           K +++V++LE+E +KI  F+RELPLC+ L+  AI A ++E        S QCS   T   
Sbjct: 13  KCHEYVEALEEEQKKIQVFQRELPLCLELVTQAIEACRKELSGTTTTTSEQCSEQTTSVC 72

Query: 100 --PVLEEFKSLKK------EFDEQDFRNGKNYRD-------QKNWMRSVQLWNSDDSKRD 159
             PV EEF  +KK      E  E++  +G++          + +W+RSVQLWN       
Sbjct: 73  GGPVFEEFIPIKKISSLCEEVQEEEEEDGEHESSPELVNNKKSDWLRSVQLWNHSPDL-- 132

Query: 160 SKFETKINEKGGPVVTQVSLQFCRNKNSERSQVPLKAYSVFPSAMAVRKEDKEEFPIHGL 219
                                   N   ER     K   V P + A +   K        
Sbjct: 133 ------------------------NPKEERVAKKAKVVEVKPKSGAFQPFQKRVLETD-- 192

Query: 220 SLCTPGIKNPIEESASSGSRSSGTRAVSSSAMTAPVSLQTGTQQQQQQLS-RKQRRCWSP 279
               P +K      A++ S ++ T    S  + A    +   QQQ Q  + RKQRRCWSP
Sbjct: 193 --LQPAVKVASSMPATTTSSTTETCGGKSDLIKAGDEERRIEQQQSQSHTHRKQRRCWSP 252

Query: 280 ELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE------KFRLHTRRLPASTVTPT 325
           ELHRRF++ALQQLGGS VATPKQIR+ M+VDGLTNDE      K+RLHTRR  A++V   
Sbjct: 253 ELHRRFLNALQQLGGSHVATPKQIRDHMKVDGLTNDEVKSHLQKYRLHTRRPAATSVAAQ 304

BLAST of Cp4.1LG05g12030 vs. TAIR10
Match: AT4G37180.2 (AT4G37180.2 Homeodomain-like superfamily protein)

HSP 1 Score: 132.5 bits (332), Expect = 5.6e-31
Identity = 127/381 (33.33%), Postives = 182/381 (47.77%), Query Frame = 1

Query: 8   LSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKRELPLCMI 67
           L+L+     +PK +S F  +VS I +   ++ +++ +V  LE+E  KID FKRELPLCM+
Sbjct: 12  LNLNLSIYSLPKPLSQFLDEVSRIKDNHSKLSEIDGYVGKLEEERNKIDVFKRELPLCML 71

Query: 68  LLKD-------AILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQDFRNGKNYRDQKNW 127
           LL +       AI A+K+E     S+  +    ++ +  K E             D+K+W
Sbjct: 72  LLNEEIVFLCVAIGALKDEARKGLSLMASNGKFDDVERAKPE------------TDKKSW 131

Query: 128 MRSVQLWNSDDSKRDSKFETKINEKGGPVVTQVSLQFCRNKNSERSQVPLKAYSVFPSAM 187
           M S QLW S+    +S+F +   E+    V+Q   Q C   N     +P       P   
Sbjct: 132 MSSAQLWISNP---NSQFRSTNEEEEDRCVSQNPFQTCNYPNQGGVFMPFNRPPPPP--- 191

Query: 188 AVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAMTAPVSLQTGTQQQ 247
                     P   LSL TP  +  ++ S    S         SS             Q 
Sbjct: 192 ----------PPAPLSLMTPTSEMMMDYSRIEQSHHHHQFNKPSS-------------QS 251

Query: 248 QQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE------KFR 307
                ++QRR WS ELHR+FV AL +LGG QVATPKQIR+LM+VDGLTNDE      K+R
Sbjct: 252 HHIQKKEQRRRWSQELHRKFVDALHRLGGPQVATPKQIRDLMKVDGLTNDEVKSHLQKYR 311

Query: 308 LHTRR---LPASTVTPTNQSIVVLGGLLVAEDQFRESSKACSSQSGSPQGPLQLGRTGGT 367
           +H R+    P  T++ ++Q      G+L  E Q    S    S+S SPQ PL + R   +
Sbjct: 312 MHIRKHPLHPTKTLSSSDQP-----GVLERESQ----SLISLSRSDSPQSPL-VARGLFS 341

Query: 368 STTGGDSTEDDDNDVKSESYS 373
           S  G  S ED++ + + E  S
Sbjct: 372 SNVGHSSEEDEEEEDEEEEKS 341

BLAST of Cp4.1LG05g12030 vs. TAIR10
Match: AT1G25550.1 (AT1G25550.1 myb-like transcription factor family protein)

HSP 1 Score: 110.5 bits (275), Expect = 2.3e-24
Identity = 101/320 (31.56%), Postives = 151/320 (47.19%), Query Frame = 1

Query: 35  SDRVIKLNDFVKSLEDEMRKIDAFKRELPLCMILLKDAILAVKEEES---------MQCS 94
           + ++ + +++V++LE+E +KI  F+RELPLC+ L+  AI + ++E S          +CS
Sbjct: 12  TQKMKRCHEYVEALEEEQKKIQVFQRELPLCLELVTQAIESCRKELSESSEHVGGQSECS 71

Query: 95  VSRTQP----VLEEFKSLKKEFDEQDFRNGKNYRDQKNWMRSVQLWNSDDSKRDSKFETK 154
              T      V EEF  +K      D                     +D  +   K E  
Sbjct: 72  ERTTSECGGAVFEEFMPIKWSSASSD--------------------ETDKDEEAEKTEMM 131

Query: 155 INEKGGPVVTQVSLQFCRNKNSERSQVPLKAYSVFPSAMAVRKEDKEEFPIHGLSLCTPG 214
            NE               N   ++    L++  ++  +   +  +K+   I      + G
Sbjct: 132 TNEN--------------NDGDKKKSDWLRSVQLWNQSPDPQPNNKKPMVIEVKR--SAG 191

Query: 215 IKNPIEESASSGSRS---------SGTRAVSSSAMTAPVSLQTGTQQQQQQLSRKQRRCW 274
              P ++     + S         + T   SS+A T     +   +Q+Q   +RKQRRCW
Sbjct: 192 AFQPFQKEKPKAADSQPLIKAITPTSTTTTSSTAETVGGGKEF-EEQKQSHSNRKQRRCW 251

Query: 275 SPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE------KFRLHTRRLPASTV- 321
           SPELHRRF+ ALQQLGGS VATPKQIR+LM+VDGLTNDE      K+RLHTRR PA+ V 
Sbjct: 252 SPELHRRFLHALQQLGGSHVATPKQIRDLMKVDGLTNDEVKSHLQKYRLHTRR-PATPVV 293

BLAST of Cp4.1LG05g12030 vs. NCBI nr
Match: gi|659118629|ref|XP_008459219.1| (PREDICTED: uncharacterized protein LOC103498410 [Cucumis melo])

HSP 1 Score: 526.6 bits (1355), Expect = 3.8e-146
Identity = 296/396 (74.75%), Postives = 328/396 (82.83%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGSL  ELSLDFR TFVPK+I+DFFK+VSMI NVSDRV KLNDF+KSLEDEMRKIDAFKR
Sbjct: 43  MGSLPSELSLDFRPTFVPKTITDFFKEVSMIGNVSDRVSKLNDFIKSLEDEMRKIDAFKR 102

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEF---DEQDFRNGKNYRDQ 120
           ELPLCMILL+DAILAVK+E+ MQC+V +T+PVLEEF  LKKE    D+ D + G + RDQ
Sbjct: 103 ELPLCMILLQDAILAVKDEK-MQCAVLKTKPVLEEFIPLKKEQEEDDDDDSKKGNDCRDQ 162

Query: 121 KNWMRSVQLWNSDDSKRDS-KFETKINEKGGPVVTQVSLQFCRNKNSERSQVPLK-AYSV 180
           KNWM SVQLWNSDD+   + K ETK NEKGGPVVTQVSLQ CR KN ER  VP K +Y +
Sbjct: 163 KNWMSSVQLWNSDDNHHSNYKLETKRNEKGGPVVTQVSLQSCRTKNGERMHVPFKPSYPM 222

Query: 181 FPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAMTAPVSLQT 240
           FPSAM V KEDKEEFPIHGLSLCTPGIKNP+EESAS+GSRSSGTRAVSSS +TA V+L+T
Sbjct: 223 FPSAMVVTKEDKEEFPIHGLSLCTPGIKNPMEESASTGSRSSGTRAVSSSTLTASVNLRT 282

Query: 241 GTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE---- 300
             QQQ+QQ SRKQRRCWS ELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE    
Sbjct: 283 AMQQQKQQCSRKQRRCWSKELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDEVKSH 342

Query: 301 --KFRLHTRRLPASTVTPTNQ-SIVVLGGLLVAEDQFRESSKACSSQSGSPQGPLQLGRT 360
             KFRLH RRLPAS V P NQ S+VVLGGLLV +D + +SSKACSSQSGSPQGPLQL   
Sbjct: 343 LQKFRLHARRLPASAVPPANQSSVVVLGGLLVPQDAYADSSKACSSQSGSPQGPLQL--- 402

Query: 361 GGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
              + TGGDS E++D DVKSESY WKSR ++ G ED
Sbjct: 403 ---AGTGGDSMEEED-DVKSESYCWKSRIQKPGNED 430

BLAST of Cp4.1LG05g12030 vs. NCBI nr
Match: gi|778669153|ref|XP_011649201.1| (PREDICTED: uncharacterized protein LOC101214647 [Cucumis sativus])

HSP 1 Score: 523.9 bits (1348), Expect = 2.4e-145
Identity = 291/396 (73.48%), Postives = 330/396 (83.33%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGSL  EL+LDFR +FVPK+I+DFFK+VSMI NVSDRV KLNDF+K+LEDE+RKIDAFKR
Sbjct: 1   MGSLPSELTLDFRPSFVPKTITDFFKEVSMIGNVSDRVSKLNDFIKTLEDEVRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQD---FRNGKNYRDQ 120
           ELPLCM+LLKDAILAVK+E+ MQC+V +T+PVLEEF  LKKE +E D    + G +YRDQ
Sbjct: 61  ELPLCMVLLKDAILAVKDEK-MQCAVPKTKPVLEEFIPLKKEQEEDDGDDSKKGNDYRDQ 120

Query: 121 KNWMRSVQLWNSDDSKRDS-KFETKINEKGGPVVTQVSLQFCRNKNSERSQVPLK-AYSV 180
           KNWM SVQLWNSDD+   + K ETK NEKGGPVVTQVS+Q CR KN ER QVP K +Y +
Sbjct: 121 KNWMSSVQLWNSDDNHHSNYKLETKRNEKGGPVVTQVSMQSCRTKNGERIQVPFKPSYPI 180

Query: 181 FPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAMTAPVSLQT 240
           F SAM  RKEDKEEFPIHGLSLCTPGIK+P+EESAS+GSRSSGTRAVSSS +TA V+L+T
Sbjct: 181 FSSAMVARKEDKEEFPIHGLSLCTPGIKSPMEESASTGSRSSGTRAVSSSTLTASVNLRT 240

Query: 241 GTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE---- 300
           G QQQ+QQ SRKQRRCWS ELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDE    
Sbjct: 241 GMQQQKQQCSRKQRRCWSKELHRRFVSALQQLGGSQVATPKQIRELMQVDGLTNDEVKSH 300

Query: 301 --KFRLHTRRLPASTVTPTNQ-SIVVLGGLLVAEDQFRESSKACSSQSGSPQGPLQLGRT 360
             KFRLH RRLPAS V P NQ S+VVLGGLLV +D + +SSKACSSQSGSPQGPLQL   
Sbjct: 301 LQKFRLHARRLPASAVPPANQSSVVVLGGLLVPQDPYADSSKACSSQSGSPQGPLQL--- 360

Query: 361 GGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
              + TGGDS E+++ DVKSESY WKSR ++ G ED
Sbjct: 361 ---AGTGGDSMEEEE-DVKSESYCWKSRIQKPGNED 388

BLAST of Cp4.1LG05g12030 vs. NCBI nr
Match: gi|1009115569|ref|XP_015874298.1| (PREDICTED: uncharacterized protein LOC107411267 [Ziziphus jujuba])

HSP 1 Score: 447.6 bits (1150), Expect = 2.2e-122
Identity = 259/408 (63.48%), Postives = 301/408 (73.77%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGS+  ELSLDFR TFVPK+I DF K+VSMI NV +RV K +DF+K LE+EMRKIDAFKR
Sbjct: 1   MGSVLAELSLDFRPTFVPKTIRDFLKEVSMIRNVPERVAKFDDFLKRLEEEMRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQDFRNG----KNYRD 120
           ELPLCM+LL DAIL +KEE + QC+V   +PVLEEF  LKK+ DE + +N     K+ +D
Sbjct: 61  ELPLCMLLLNDAILTLKEELT-QCAVPNAEPVLEEFIPLKKDCDENEEQNNDNKEKDCKD 120

Query: 121 QKNWMRSVQLWNSDD---------SKRDSKFETKINEKGGPVVTQVSLQFCRNKNSERSQ 180
           +KNWM SVQLWN+DD          K  SK E K NEK     T+  +   RN+   RS 
Sbjct: 121 KKNWMSSVQLWNTDDYPTTDFKIDPKLTSKTEIKRNEKEYGFTTEDPVHCSRNRTGGRSF 180

Query: 181 VPLKAYSVFPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSSSAM 240
           +P KAYS FP    +RKEDK+E P+HGLSL TPG+KNP EES SSGSRS+ +RAVSSSA 
Sbjct: 181 IPFKAYSAFPG---IRKEDKDELPVHGLSLLTPGVKNPKEESGSSGSRSTSSRAVSSSAA 240

Query: 241 TAPV-----SLQTGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELM 300
            A       +L+ G QQ QQQ SRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELM
Sbjct: 241 AAAAVNVQSNLRAGQQQPQQQTSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELM 300

Query: 301 QVDGLTNDE------KFRLHTRRLPASTVTPTNQSIVVLGGLLVAEDQFRESSKACSSQS 360
           QVDGLTNDE      K+RLHTRRLPAS+  P NQSIVVL GL +++DQ+ +SSKA SSQS
Sbjct: 301 QVDGLTNDEVKSHLQKYRLHTRRLPASSAAPANQSIVVLSGLWMSQDQYGDSSKASSSQS 360

Query: 361 GSPQGPLQLGRTGGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
           GSPQGPLQL  TGGTSTTG DS EDD+ D +SE YSWKS   + G +D
Sbjct: 361 GSPQGPLQLTGTGGTSTTGCDSMEDDE-DARSEGYSWKSHVHKPGNDD 403

BLAST of Cp4.1LG05g12030 vs. NCBI nr
Match: gi|802598810|ref|XP_012072452.1| (PREDICTED: probable transcription factor KAN3 [Jatropha curcas])

HSP 1 Score: 432.6 bits (1111), Expect = 7.4e-118
Identity = 248/410 (60.49%), Postives = 302/410 (73.66%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGS+ PELSLDFR T+VPK+ISDF K+VSMI +VS++V KL+ FVK LE+EMRKIDAFKR
Sbjct: 1   MGSIPPELSLDFRPTYVPKTISDFLKEVSMIGDVSEKVSKLDGFVKGLEEEMRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDEQD-------FRNGKN 120
           ELPLCM+LL DAIL +K E SMQC+ S  +PVLEEF  LKK  +E++        +  K+
Sbjct: 61  ELPLCMLLLNDAILFLKTE-SMQCTTSNNRPVLEEFIPLKKSDEEEEEEEEESPIKKEKD 120

Query: 121 YRDQKNWMRSVQLWNSD---------DSKRDSKFETKINEKGGPVVTQVSLQFCRNKNSE 180
           Y+D+KNWM SVQLWNS+         D K++ K E K  +KG     + + Q C+++N+ 
Sbjct: 121 YKDKKNWMSSVQLWNSNEHSSTDYIFDPKQNLKLEYKSTKKGNQYANEDTFQACKSRNAA 180

Query: 181 RSQVPLKAYSVFPSAMAVRK---EDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRA 240
           R+ +P K Y+         K   ++ EE PI GLSL TPGIKN  EES S+GSR+S +RA
Sbjct: 181 RAFMPFKTYTGLSRKEDSNKNHNQNSEELPIPGLSLLTPGIKNLREESVSTGSRTSCSRA 240

Query: 241 VSSSAMTAPVSLQTGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIREL 300
           VSSSA    ++L+ G Q  QQQ +RKQRRCWSPELHRRFV+ALQQLGGSQ ATPKQIREL
Sbjct: 241 VSSSAPNPQLNLRNGLQPSQQQTARKQRRCWSPELHRRFVNALQQLGGSQAATPKQIREL 300

Query: 301 MQVDGLTNDE------KFRLHTRRLPASTVTPTNQSIVVLGGLLVAEDQFRESSKACSSQ 360
           MQVDGLTNDE      K+RLHTRR+P +T  P NQS+VVLGGL V++DQ+ +SSKA SSQ
Sbjct: 301 MQVDGLTNDEVKSHLQKYRLHTRRMPTATAAPANQSLVVLGGLWVSQDQYGDSSKATSSQ 360

Query: 361 SGSPQGPLQL-GRTGGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
           SGSPQGPLQL G TGGTSTTGGDS EDD+ D KSE YSWKS   R+GK+D
Sbjct: 361 SGSPQGPLQLAGNTGGTSTTGGDSMEDDE-DAKSEGYSWKSHIHRSGKDD 408

BLAST of Cp4.1LG05g12030 vs. NCBI nr
Match: gi|595795870|ref|XP_007201075.1| (hypothetical protein PRUPE_ppa006812mg [Prunus persica])

HSP 1 Score: 423.7 bits (1088), Expect = 3.4e-115
Identity = 244/406 (60.10%), Postives = 295/406 (72.66%), Query Frame = 1

Query: 1   MGSLSPELSLDFRSTFVPKSISDFFKQVSMIDNVSDRVIKLNDFVKSLEDEMRKIDAFKR 60
           MGS+ PELSLDFR TFVPK+ISDF K+VSMI +VS+R+ KL+DFVK LE+EMRKIDAFKR
Sbjct: 1   MGSVPPELSLDFRPTFVPKTISDFLKEVSMIGSVSERLSKLDDFVKRLEEEMRKIDAFKR 60

Query: 61  ELPLCMILLKDAILAVKEEESMQCSVSRTQPVLEEFKSLKKEFDE-----QDFRNGKNYR 120
           ELPLCM LL DAILA+K E+++QC+    QPVLEEF  LKK+ ++      + +  K+ R
Sbjct: 61  ELPLCMFLLNDAILALK-EDAVQCAAPNVQPVLEEFIPLKKDCEKNNGGTNNNKKEKDCR 120

Query: 121 DQKNWMRSVQLWNSD-----------DSKRDSKFETKINEKGGPVVTQVSLQFCRNKNSE 180
           D+KNWM SVQLWN+D           D K+ S+ ++K NE    +  +   Q CRN+ + 
Sbjct: 121 DKKNWMSSVQLWNTDNYQHPSSDFPYDRKQVSEIDSKRNEAENGLANEDPFQTCRNRTAG 180

Query: 181 RSQVPLKAYSVFPSAMAVRKEDKEEFPIHGLSLCTPGIKNPIEESASSGSRSSGTRAVSS 240
           R+ +P K Y  F S  A RKEDKEE P+HGLSL TPGIKNP EESASSGSRS+  RAVS 
Sbjct: 181 RAFMPFKTYPAF-SVTAARKEDKEELPVHGLSLLTPGIKNPKEESASSGSRSTCGRAVSF 240

Query: 241 SAMTAPVSLQTGTQQQQQQLSRKQRRCWSPELHRRFVSALQQLGGSQVATPKQIRELMQV 300
           S   A  +++T     QQ  SRKQRRCWSPELHRRFV+ALQQLGGSQVATPKQIRELMQV
Sbjct: 241 STANAQSNMRT---PPQQPTSRKQRRCWSPELHRRFVNALQQLGGSQVATPKQIRELMQV 300

Query: 301 DGLTNDE------KFRLHTRRLPASTVTPTNQSIVVLGGLLVAEDQFRESSKACSSQSGS 360
           DGLTNDE      K+RLHTRR+PA+T  P NQS+VVLGGL +++DQ+ +SSKA SSQSGS
Sbjct: 301 DGLTNDEVKSHLQKYRLHTRRVPAATAAPDNQSVVVLGGLWMSQDQYADSSKASSSQSGS 360

Query: 361 PQGPLQLGRTGGTSTTGGDSTEDDDNDVKSESYSWKSRSKRTGKED 385
           PQGPL L  TGG         ++DD D KSESYSWK   +++GK D
Sbjct: 361 PQGPLHLTGTGG--------DDEDDEDAKSESYSWKGHIQKSGKND 393

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EFM_ARATH4.3e-3335.33Myb family transcription factor EFM OS=Arabidopsis thaliana GN=EFM PE=1 SV=2[more]
ORR23_ORYSI1.0e-0546.84Two-component response regulator ORR23 OS=Oryza sativa subsp. indica GN=RR23 PE=... [more]
ORR23_ORYSJ1.0e-0546.84Two-component response regulator ORR23 OS=Oryza sativa subsp. japonica GN=RR23 P... [more]
Match NameE-valueIdentityDescription
A0A0A0LIJ1_CUCSA1.7e-14573.48Uncharacterized protein OS=Cucumis sativus GN=Csa_2G236120 PE=4 SV=1[more]
A0A067L2E9_JATCU5.1e-11860.49Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04882 PE=4 SV=1[more]
M5VM72_PRUPE2.4e-11560.10Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006812mg PE=4 SV=1[more]
A5BS05_VITVI4.1e-11561.63Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_035944 PE=4 SV=1[more]
A0A061E0R0_THECC7.0e-11561.14Homeodomain-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=T... [more]
Match NameE-valueIdentityDescription
AT1G49560.11.5e-3638.52 Homeodomain-like superfamily protein[more]
AT2G03500.12.4e-3435.33 Homeodomain-like superfamily protein[more]
AT1G68670.11.3e-3236.02 myb-like transcription factor family protein[more]
AT4G37180.25.6e-3133.33 Homeodomain-like superfamily protein[more]
AT1G25550.12.3e-2431.56 myb-like transcription factor family protein[more]
Match NameE-valueIdentityDescription
gi|659118629|ref|XP_008459219.1|3.8e-14674.75PREDICTED: uncharacterized protein LOC103498410 [Cucumis melo][more]
gi|778669153|ref|XP_011649201.1|2.4e-14573.48PREDICTED: uncharacterized protein LOC101214647 [Cucumis sativus][more]
gi|1009115569|ref|XP_015874298.1|2.2e-12263.48PREDICTED: uncharacterized protein LOC107411267 [Ziziphus jujuba][more]
gi|802598810|ref|XP_012072452.1|7.4e-11860.49PREDICTED: probable transcription factor KAN3 [Jatropha curcas][more]
gi|595795870|ref|XP_007201075.1|3.4e-11560.10hypothetical protein PRUPE_ppa006812mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g12030.1Cp4.1LG05g12030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 247..290
score: 7.1
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 245..291
score: 1.9
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 245..297
score: 4.6
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 244..303
score: 9
NoneNo IPR availablePANTHERPTHR31003MYB FAMILY TRANSCRIPTION FACTORcoord: 1..384
score: 2.3E
NoneNo IPR availablePANTHERPTHR31003:SF7F14J22.20 PROTEINcoord: 1..384
score: 2.3E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG05g12030Cucsa.110080Cucumber (Gy14) v1cgycpeB0247
Cp4.1LG05g12030CmaCh02G004590Cucurbita maxima (Rimu)cmacpeB646
Cp4.1LG05g12030CmoCh02G004670Cucurbita moschata (Rifu)cmocpeB595
Cp4.1LG05g12030Bhi10G001685Wax gourdcpewgoB0957
Cp4.1LG05g12030Carg24043Silver-seed gourdcarcpeB1469
The following gene(s) are paralogous to this gene:

None