Cp4.1LG13g04050 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g04050
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTranscription factor, putative
LocationCp4.1LG13 : 6381999 .. 6383327 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGGTAATTTATCACAAGGAGGGTTGATTCCAGGAGGAACCTCTTATGGAGGTCTCGATTTGCAAGGACCCTTTAAGGTTCATAGCCAAGCACAACACTCTCACGCCTTACACCAACAACATCATCCTCATACTCGTCAGGGATCTGCGGCCAATCCTTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATGCCATGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCGGCTAGTGACGAAGAGCCGAGCTTTACTGAGGATGGTATTGATGGTCATAATGAGACGAGTAAGGGGAAGAAGGGATCGGTATGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTAATTCAGATTTTGATGGGGCTGGAAGAAGGAAATGCCATATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAAGTCATGGCTGAAAGGGGCTATCAAGTTTCACCTCAGCAGTGTGAGGATAAGTTTAATGATCTCAATAAGAGGTATAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCAAGGTTGTTGAGAACCCTGCACTTCTTGATGTTCTTGAATATTTAACAGATAAAGAGAAGGATGATGTGAGAAAGATTTTAAACTCGAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCAAATCGACTTCATCTGCCCCATGATCCTGCTTTGCAGCGTTCTTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGGAGACACCAAAATGATGATTTTGATGAAAACGAACATGATGAAACTGATGAACGTGATGATTTTGAGGAGAATTTTGCACCTCATGGGGACAGTAGACGATCATTCGGGGTATTAGGAGGCTCAGTGAAGAGGCTAAGACGAGACCAAGACCATGATGATACTCATGCCTGTGGCAAATCCTTGAGTTCTCATGCTCACGCACAAGCACAGTTTGCTCAAGCTGATACAGCTCACTTAGAAACTGAAGGTATGAAGGGTTCAACATCGCAAAAACAGTGGATGGAGCTTCGTTTACTTCAATTGGAAGATCAAAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAGTGGGAGAGATTCAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAATGAGAGGATGAAGCTTGAAAACGAGCGCATTGCACTCGACTTAAAGCAAAAGGAAATTGGCTCGGGATTTCATTGA

mRNA sequence

ATGGAAGGTAATTTATCACAAGGAGGGTTGATTCCAGGAGGAACCTCTTATGGAGGTCTCGATTTGCAAGGACCCTTTAAGGTTCATAGCCAAGCACAACACTCTCACGCCTTACACCAACAACATCATCCTCATACTCGTCAGGGATCTGCGGCCAATCCTTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATGCCATGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCGGCTAGTGACGAAGAGCCGAGCTTTACTGAGGATGGTATTGATGGTCATAATGAGACGAGTAAGGGGAAGAAGGGATCGGTATGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTAATTCAGATTTTGATGGGGCTGGAAGAAGGAAATGCCATATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAAGTCATGGCTGAAAGGGGCTATCAAGTTTCACCTCAGCAGTGTGAGGATAAGTTTAATGATCTCAATAAGAGGTATAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCAAGGTTGTTGAGAACCCTGCACTTCTTGATGTTCTTGAATATTTAACAGATAAAGAGAAGGATGATGTGAGAAAGATTTTAAACTCGAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCAAATCGACTTCATCTGCCCCATGATCCTGCTTTGCAGCGTTCTTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGGAGACACCAAAATGATGATTTTGATGAAAACGAACATGATGAAACTGATGAACGTGATGATTTTGAGGAGAATTTTGCACCTCATGGGGACAGTAGACGATCATTCGGGGTATTAGGAGGCTCAGTGAAGAGGCTAAGACGAGACCAAGACCATGATGATACTCATGCCTGTGGCAAATCCTTGAGTTCTCATGCTCACGCACAAGCACAGTTTGCTCAAGCTGATACAGCTCACTTAGAAACTGAAGGTATGAAGGGTTCAACATCGCAAAAACAGTGGATGGAGCTTCGTTTACTTCAATTGGAAGATCAAAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAGTGGGAGAGATTCAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAATGAGAGGATGAAGCTTGAAAACGAGCGCATTGCACTCGACTTAAAGCAAAAGGAAATTGGCTCGGGATTTCATTGA

Coding sequence (CDS)

ATGGAAGGTAATTTATCACAAGGAGGGTTGATTCCAGGAGGAACCTCTTATGGAGGTCTCGATTTGCAAGGACCCTTTAAGGTTCATAGCCAAGCACAACACTCTCACGCCTTACACCAACAACATCATCCTCATACTCGTCAGGGATCTGCGGCCAATCCTTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATGCCATGTCTTTGGTAGATTATAACAAGGGAGAAAGGTGTAAAAACTCGGCTAGTGACGAAGAGCCGAGCTTTACTGAGGATGGTATTGATGGTCATAATGAGACGAGTAAGGGGAAGAAGGGATCGGTATGGCATCGCGTGAAATGGACGGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTAATTCAGATTTTGATGGGGCTGGAAGAAGGAAATGCCATATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAAGTCATGGCTGAAAGGGGCTATCAAGTTTCACCTCAGCAGTGTGAGGATAAGTTTAATGATCTCAATAAGAGGTATAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGCAAGGTTGTTGAGAACCCTGCACTTCTTGATGTTCTTGAATATTTAACAGATAAAGAGAAGGATGATGTGAGAAAGATTTTAAACTCGAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAATTCAAATCGACTTCATCTGCCCCATGATCCTGCTTTGCAGCGTTCTTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGGAGACACCAAAATGATGATTTTGATGAAAACGAACATGATGAAACTGATGAACGTGATGATTTTGAGGAGAATTTTGCACCTCATGGGGACAGTAGACGATCATTCGGGGTATTAGGAGGCTCAGTGAAGAGGCTAAGACGAGACCAAGACCATGATGATACTCATGCCTGTGGCAAATCCTTGAGTTCTCATGCTCACGCACAAGCACAGTTTGCTCAAGCTGATACAGCTCACTTAGAAACTGAAGGTATGAAGGGTTCAACATCGCAAAAACAGTGGATGGAGCTTCGTTTACTTCAATTGGAAGATCAAAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAGTGGGAGAGATTCAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTAAATGAGAGGATGAAGCTTGAAAACGAGCGCATTGCACTCGACTTAAAGCAAAAGGAAATTGGCTCGGGATTTCATTGA

Protein sequence

MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGFSLSMGVVQNCDHAMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSVWHRVKWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHDETDERDDFEENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSLSSHAHAQAQFAQADTAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEIGSGFH
BLAST of Cp4.1LG13g04050 vs. TrEMBL
Match: A0A0A0KBC2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G057100 PE=4 SV=1)

HSP 1 Score: 797.0 bits (2057), Expect = 1.2e-227
Identity = 401/449 (89.31%), Postives = 420/449 (93.54%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEGNLSQGGLIPGG+SYGGLDLQGPFKVH+Q Q SHALHQQHHPHTRQGS+ANPSIQEGF
Sbjct: 1   MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQLSHALHQQHHPHTRQGSSANPSIQEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSVWHRVK 120
           SLSMGVVQNCDH MSLV+YNKGERCKNSASDE+PSF ED IDGHNE SKGKKGS+WHRVK
Sbjct: 61  SLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDSIDGHNENSKGKKGSMWHRVK 120

Query: 121 WTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQC 180
           WTDKMVKLLITAVSYIGDDI SD DG GRRKC IIQKKGKWKLISKV+AERGYQVSPQQC
Sbjct: 121 WTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQC 180

Query: 181 EDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYEE 240
           EDKFNDLNKRYKRLNDIIGRGTSC+VVENPALLDV++YLT+K+KDDVRKILNSKQLFYEE
Sbjct: 181 EDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEE 240

Query: 241 MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHDETDERDDFE 300
           MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E DETDE DD+E
Sbjct: 241 MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPDETDEHDDYE 300

Query: 301 ENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSL-------SSHAHAQAQFAQAD 360
           ENF PH D+RRS GVLGGSVKRL+R QDHDD HACG SL       SSH H+QAQF QAD
Sbjct: 301 ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQAD 360

Query: 361 TAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK 420
           TAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK
Sbjct: 361 TAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK 420

Query: 421 MRMVNERMKLENERIALDLKQKEIGSGFH 443
           MRMVNERMKLENER+ALDLKQK+IGSGFH
Sbjct: 421 MRMVNERMKLENERLALDLKQKQIGSGFH 449

BLAST of Cp4.1LG13g04050 vs. TrEMBL
Match: A0A061FHC2_THECC (Sequence-specific DNA binding transcription factors OS=Theobroma cacao GN=TCM_035562 PE=4 SV=1)

HSP 1 Score: 587.0 bits (1512), Expect = 1.9e-164
Identity = 304/447 (68.01%), Postives = 366/447 (81.88%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEGNLSQGG+I GG S+GGLD+QG  +VH  AQH H +HQ HH + RQG++ +PSI EGF
Sbjct: 1   MEGNLSQGGMISGGGSFGGLDVQGSMRVHHHAQHPHNIHQHHHSNPRQGASIHPSIHEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDE-EPSFTEDGIDGHNETSKGKKGSVWHRV 120
            L+MG +QNCD  +++ DYNKGER K+S SDE EPSFTE+G+DGHN+ +KGKKGS W RV
Sbjct: 61  PLTMGTMQNCDQTIAMTDYNKGERRKSSVSDEDEPSFTEEGVDGHNDGTKGKKGSPWQRV 120

Query: 121 KWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQ 180
           KWTDKMV+LLITAVSYIG+D   D  G  RRK  ++QKKGKWK +SKVMAERGY VSPQQ
Sbjct: 121 KWTDKMVRLLITAVSYIGEDAAGDCGGGMRRKFAVLQKKGKWKSVSKVMAERGYHVSPQQ 180

Query: 181 CEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYE 240
           CEDKFNDLNKRYK+LND++GRGTSC+VVENPALLDV++YLT+KEKDDVRKIL+SK LFYE
Sbjct: 181 CEDKFNDLNKRYKKLNDMLGRGTSCQVVENPALLDVIDYLTEKEKDDVRKILSSKHLFYE 240

Query: 241 EMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHD-ETDERDD 300
           EMCSYHN NRLHLPHDP LQRSLQLA R+RDDH+ND+ RRHQ+DD D+++HD ETD+ D+
Sbjct: 241 EMCSYHNGNRLHLPHDPQLQRSLQLALRSRDDHENDDARRHQHDDLDDDDHDMETDDHDE 300

Query: 301 FEENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHAC-GKSLSSHAHAQAQFA-----QA 360
           FEEN A HGDSR  +GVLGGS KR R+ Q H+D  AC   SL+S    ++ F+     QA
Sbjct: 301 FEENHALHGDSRGMYGVLGGSAKRSRQGQVHED--ACFQNSLNSQDCNKSSFSYSPINQA 360

Query: 361 DTAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELE 420
           D   +  +  + +  QKQW+E R LQLE+QKLQIQVEMLELEKQ+FKW+RF+KK+DRELE
Sbjct: 361 DMNQVLPDNTRAAWLQKQWIESRSLQLEEQKLQIQVEMLELEKQRFKWQRFSKKRDRELE 420

Query: 421 KMRMVNERMKLENERIALDLKQKEIGS 440
           KMRM NERMKLENER+AL+LK+KE  +
Sbjct: 421 KMRMENERMKLENERMALELKRKEFAA 445

BLAST of Cp4.1LG13g04050 vs. TrEMBL
Match: M5VYL5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa1027145mg PE=4 SV=1)

HSP 1 Score: 570.1 bits (1468), Expect = 2.4e-159
Identity = 301/448 (67.19%), Postives = 361/448 (80.58%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEG+LSQGG++PGG SYGGLDL+G  +V  Q QH H +HQ HHPH RQGS A+PSI EGF
Sbjct: 1   MEGHLSQGGMVPGGASYGGLDLEGSMRVQHQTQHPHTIHQ-HHPHPRQGSLAHPSIHEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDE-EPSFTEDGIDGHNETSKGKKGSVWHRV 120
            L MG + NCD  +S++DYNKGER KNSASDE EPS+TE+G DGH E ++GKKGS W RV
Sbjct: 61  PLKMGTMHNCDQTISMMDYNKGERSKNSASDEDEPSYTEEGTDGHAEGNRGKKGSPWQRV 120

Query: 121 KWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQ 180
           KWTD+MVKLLITAVSYIG+D +SD    GRRK   +QKKGKWK +SKVMAERG+ VSPQQ
Sbjct: 121 KWTDQMVKLLITAVSYIGEDTSSDCGSGGRRKYSTLQKKGKWKSVSKVMAERGFHVSPQQ 180

Query: 181 CEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYE 240
           CEDKFNDLNKRYK+LND++GRGTSC+VVEN ALLDV++YLT+KEKDDVRKIL+SK LFYE
Sbjct: 181 CEDKFNDLNKRYKKLNDMLGRGTSCQVVENQALLDVIDYLTEKEKDDVRKILSSKHLFYE 240

Query: 241 EMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHD-ETDERDD 300
           EMCSYHN NRLHLPHDPALQ+SLQ A R RD+HDND+PRRH +DD DE++ D ETDE +D
Sbjct: 241 EMCSYHNGNRLHLPHDPALQKSLQRALR-RDEHDNDDPRRHHHDDLDEDDQDMETDEHED 300

Query: 301 FEENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSL-----SSHAHAQAQFAQAD 360
           FEEN A H D+R  + VL  SVKRLR+ Q  ++ +  G SL     +  ++       AD
Sbjct: 301 FEENNASHVDNRGIY-VLEDSVKRLRQGQGREEFN-YGSSLNPQDCNKSSYCHPPIPPAD 360

Query: 361 TAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK 420
              +  +G K +  QKQW+E R LQLE+QKLQIQVEMLELEKQ+FKW+RF+KK+DRELEK
Sbjct: 361 MNQVLPDGTKAAWLQKQWIESRSLQLEEQKLQIQVEMLELEKQRFKWQRFSKKRDRELEK 420

Query: 421 MRMVNERMKLENERIALDLKQKEIGSGF 442
           +RM NERMKLENER+AL+LK+KE+G+GF
Sbjct: 421 LRMENERMKLENERMALELKRKEMGAGF 444

BLAST of Cp4.1LG13g04050 vs. TrEMBL
Match: A0A059BYD0_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F04240 PE=4 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 1.2e-158
Identity = 298/450 (66.22%), Postives = 356/450 (79.11%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEGN+SQGG+IPG  +YGGLDLQ   ++H QAQ SH LHQ HH H RQ    +PSI EGF
Sbjct: 1   MEGNMSQGGMIPGNPAYGGLDLQRSMQIHGQAQQSHTLHQPHHSHPRQTPGVHPSIHEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASD-EEPSFTEDGIDGHNETSKGKKGSVWHRV 120
            L+MG +  CD  +S+ D+NKGERCKNSASD ++PSFTEDGID HN+ SKGKKGS W RV
Sbjct: 61  PLTMGALPGCDQMISVGDFNKGERCKNSASDDDDPSFTEDGIDNHNDVSKGKKGSPWQRV 120

Query: 121 KWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQ 180
           KWTDKMV+LLITAVSYIG+D  SD  G GRRK  ++QKKGKWK ISKV+AERGY VSPQQ
Sbjct: 121 KWTDKMVRLLITAVSYIGEDSGSDCGGGGRRKFAVLQKKGKWKSISKVLAERGYHVSPQQ 180

Query: 181 CEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYE 240
           CEDKFNDLNKRYK+LND++GRGTSC+VVENPALLD++E+L++KEKDDVRKIL+SK LFYE
Sbjct: 181 CEDKFNDLNKRYKKLNDMLGRGTSCQVVENPALLDMIEFLSEKEKDDVRKILSSKHLFYE 240

Query: 241 EMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHD-ETDERDD 300
           EMCSYHNSNRLHLPHDPALQRSLQ A R+RDDH+N++ RR+  DDFDE++ D + D+ D+
Sbjct: 241 EMCSYHNSNRLHLPHDPALQRSLQQALRSRDDHENNDVRRNNPDDFDEDDQDVDNDDHDE 300

Query: 301 FEENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSL-------SSHAHAQAQFAQ 360
            EEN+APHGD R   GVL GS KR+++ Q H+D  + G SL       SSH+H+      
Sbjct: 301 HEENYAPHGDGRTMCGVLKGSAKRMKQAQGHED-FSFGSSLNVQEFNKSSHSHSHMVHPD 360

Query: 361 ADTAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDREL 420
            +    E      + SQKQ  E R LQLE+QKLQIQ EMLELEKQ+FKWERF KK+DREL
Sbjct: 361 MNQTFSENR----AWSQKQSNESRSLQLEEQKLQIQYEMLELEKQRFKWERFCKKRDREL 420

Query: 421 EKMRMVNERMKLENERIALDLKQKEIGSGF 442
           EKM+M NE+MKLENER+AL+L+QKE+G  F
Sbjct: 421 EKMKMENEKMKLENERMALELRQKEMGVDF 445

BLAST of Cp4.1LG13g04050 vs. TrEMBL
Match: A0A067GYJ6_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013261mg PE=4 SV=1)

HSP 1 Score: 567.0 bits (1460), Expect = 2.0e-158
Identity = 294/447 (65.77%), Postives = 359/447 (80.31%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHP-HTRQGSAANPSIQEG 60
           MEGNLSQGG+IPGG S+GG+DLQG  +VH QAQH HA+H  HH  + RQG + +PSI EG
Sbjct: 1   MEGNLSQGGMIPGGASFGGIDLQGSMRVHHQAQHPHAMHPHHHTINPRQGPSVHPSIHEG 60

Query: 61  FSLSMGVVQNCDHAMSLVDYNKGERCKNSASDEE-PSFTEDGIDGHNETSKGKKGSVWHR 120
           F L++G +Q+ D  +S+ DYNKGER KNS SD++ PS TE+G +GHN+  KGKKGS W R
Sbjct: 61  FPLTIGTMQSSDQTISMTDYNKGERGKNSFSDDDDPSLTEEGGEGHNDAGKGKKGSPWQR 120

Query: 121 VKWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQ 180
           VKW DKMVKLLITAVSY+G+D +SD  G  RRK  ++QKKGKWK ISKVMAERG+ VSPQ
Sbjct: 121 VKWADKMVKLLITAVSYVGEDTSSDCGGGARRKFAVLQKKGKWKAISKVMAERGFHVSPQ 180

Query: 181 QCEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFY 240
           QCEDKFNDLNKRYK+LND++GRGTSC+VVENP+LLDV+++LT+KEKDDVRKIL+SK LFY
Sbjct: 181 QCEDKFNDLNKRYKKLNDMLGRGTSCQVVENPSLLDVIDFLTEKEKDDVRKILSSKHLFY 240

Query: 241 EEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHDETDERDD 300
           EEMCSYHN NRLHLPHD  LQRSLQLA R+RDDHDND+ RRH +D  ++++  ETD+ D+
Sbjct: 241 EEMCSYHNGNRLHLPHDLPLQRSLQLALRSRDDHDNDDVRRHIHDPDEDDQDMETDDHDE 300

Query: 301 FEENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSLS----SHAHAQAQFAQADT 360
           FE+N A HGDSR ++G+ G S KR R+ Q H+D    G S++    + +    Q AQAD 
Sbjct: 301 FEDNHALHGDSRGTYGIAGCSAKRQRQGQGHEDV-CLGNSMNPLDFNKSSYPPQVAQADM 360

Query: 361 AHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKM 420
                E MK S  QKQW+E R LQLE+QKLQIQVEMLELEKQ+FKW+RF+KK+DRELEK+
Sbjct: 361 NQALPESMKSSWLQKQWIESRSLQLEEQKLQIQVEMLELEKQRFKWQRFSKKRDRELEKL 420

Query: 421 RMVNERMKLENERIALDLKQKEIGSGF 442
           RM NERMKLENER+AL+LKQKE+G+ F
Sbjct: 421 RMENERMKLENERMALELKQKEMGADF 446

BLAST of Cp4.1LG13g04050 vs. TAIR10
Match: AT1G21200.1 (AT1G21200.1 sequence-specific DNA binding transcription factors)

HSP 1 Score: 478.8 bits (1231), Expect = 3.6e-135
Identity = 267/453 (58.94%), Postives = 332/453 (73.29%), Query Frame = 1

Query: 1   MEGNLSQGGLI-PGGTSYGGLDLQGPFKVHSQAQHSHALHQQH--HPHTRQGSAANPSIQ 60
           M+GN  QGG++  G +SYGG DLQG  +VH    H  +++QQH  +P++R        + 
Sbjct: 1   MDGNFPQGGVVRSGASSYGGFDLQGSMRVH----HQDSMNQQHRHNPNSRP-------LH 60

Query: 61  EGFSLSMGVVQNCDHA----MSLVDYNKGERCKNSASDE-EPSFTEDGIDG-HNETSKGK 120
           EG   +M   Q CDH     MS+ +  K ER KNS SD+ EPSFTE+G DG HNE ++  
Sbjct: 61  EGLPFTMVTGQTCDHHQNQNMSMSEQQKAEREKNSVSDDDEPSFTEEGGDGVHNEANRST 120

Query: 121 KGSVWHRVKWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAER 180
           KGS W RVKWTDKMVKLLITAVSYIGDD  S  D + RRK  ++QKKGKWK +SKVMAER
Sbjct: 121 KGSPWQRVKWTDKMVKLLITAVSYIGDD--SSIDSSSRRKFAVLQKKGKWKSVSKVMAER 180

Query: 181 GYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKIL 240
           GY VSPQQCEDKFNDLNKRYK+LND++GRGTSC+VVENPALLD + YL DKEKDDVRKI+
Sbjct: 181 GYHVSPQQCEDKFNDLNKRYKKLNDMLGRGTSCQVVENPALLDSIGYLNDKEKDDVRKIM 240

Query: 241 NSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHD 300
           +SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R+RDDHDND+ R+HQ +D D+ +HD
Sbjct: 241 SSKHLFYEEMCSYHNGNRLHLPHDLALQRSLQLALRSRDDHDNDDSRKHQMEDLDDEDHD 300

Query: 301 -ETDERDDFEENFAPHGDSR-RSFGVLGGSVKRLRRDQDHDD----THACGKSLSSHAHA 360
            + DE D++EE    +GD R   +G  GG +K++R    H+D    +H      +  +  
Sbjct: 301 GDGDEHDEYEEQHYAYGDCRVNHYGGGGGPLKKIRPSLSHEDGDHPSHVNSLECNKVSLP 360

Query: 361 QAQFAQADTAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNK 420
           Q  F+QAD      E  +  + QKQWME R LQLE+QKLQIQVE+LELEKQ+F+W+RF+K
Sbjct: 361 QIPFSQADVNQGGAESGRAGSVQKQWMESRTLQLEEQKLQIQVELLELEKQRFRWQRFSK 420

Query: 421 KKDRELEKMRMVNERMKLENERIALDLKQKEIG 439
           K+D+ELE+MRM NERMKLEN+R+ L+LKQ+E+G
Sbjct: 421 KRDQELERMRMENERMKLENDRMGLELKQRELG 440

BLAST of Cp4.1LG13g04050 vs. TAIR10
Match: AT1G76870.1 (AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1))

HSP 1 Score: 335.5 bits (859), Expect = 5.0e-92
Identity = 209/441 (47.39%), Postives = 280/441 (63.49%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEGN SQG      +S   L    P  ++          +QHHP++RQ S  N ++    
Sbjct: 1   MEGNCSQGRFDSQVSSMRDLR---PNAINQN-------QKQHHPNSRQDSGFNNTMD--- 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSVWHRVK 120
                             +N  +R K S S+++        DG N   K K+ S W RVK
Sbjct: 61  ----------------TRHNNVDRGKKSMSEDDELCLLSS-DGQN---KSKENSPWQRVK 120

Query: 121 WTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQC 180
           W DKMVKL+ITA+SYIG+D  SD      +K  ++QKKGKW+ +SKVM ERGY VSPQQC
Sbjct: 121 WMDKMVKLMITALSYIGEDSGSD------KKFAVLQKKGKWRSVSKVMDERGYHVSPQQC 180

Query: 181 EDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYEE 240
           EDKFNDLNKRYK+LN+++GRGTSC+VVENP+LLD ++YL +KEKD+VR+I++SK LFYEE
Sbjct: 181 EDKFNDLNKRYKKLNEMLGRGTSCEVVENPSLLDKIDYLNEKEKDEVRRIMSSKHLFYEE 240

Query: 241 MCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDENEHDETDERDDF 300
           MCSYHN NRLHLPHDPA+QRSL L    +RDDHDNDE  +HQN+D D++        DD+
Sbjct: 241 MCSYHNGNRLHLPHDPAVQRSLHLITLGSRDDHDNDEHGKHQNEDLDDD--------DDY 300

Query: 301 EENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSLSSHAHAQAQFAQADTAH-LE 360
           EE+       R         +KRLR+ Q H+D     K        +   +QAD    + 
Sbjct: 301 EEDHDGALSDR--------PLKRLRQSQSHEDVGHPNKGYDVPCLPR---SQADVNRGIS 360

Query: 361 TEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVN 420
            +  K +  Q+Q +E + L+LE +KLQIQ EM+ELE+Q+FKWE F+K+++++L KMRM N
Sbjct: 361 LDSRKAAGLQRQQIESKSLELEGRKLQIQAEMMELERQQFKWEVFSKRREQKLAKMRMEN 383

Query: 421 ERMKLENERIALDLKQKEIGS 440
           ERMKLENER++L+LK+ E+G+
Sbjct: 421 ERMKLENERMSLELKRIELGA 383

BLAST of Cp4.1LG13g04050 vs. TAIR10
Match: AT3G10040.1 (AT3G10040.1 sequence-specific DNA binding transcription factors)

HSP 1 Score: 203.4 bits (516), Expect = 3.0e-52
Identity = 160/463 (34.56%), Postives = 241/463 (52.05%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDL-QGPFKVHSQAQHSHALHQQHHPHTRQGSA-ANPSIQE 60
           ME N+   G  P   S   L++ Q P    +  Q  H      HP+T  G     P I+ 
Sbjct: 1   MESNVMFSGFSPRMLS---LEMPQNPPNPQNSIQFQHP-----HPYTTSGDQQTQPPIKS 60

Query: 61  GFSLSMGVVQ-------NCDHAMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGK 120
            +  +    Q        CD      D ++G    +  + E+ +    G DG  + S+  
Sbjct: 61  LYPYASKPKQMSPISGGGCD------DEDRGSGSGSGCNPEDSA----GTDGKRKLSQ-- 120

Query: 121 KGSVWHRVKWTDKMVKLLITAVSYIGDD--INSDFD--------GAGRRKCHIIQKKGKW 180
               WHR+KWTD MV+LLI AV YIGD+  +N   D        G G     ++QKKGKW
Sbjct: 121 ----WHRMKWTDTMVRLLIMAVFYIGDEAGLNDPVDAKKKTGGGGGGGGGGGMLQKKGKW 180

Query: 181 KLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTD 240
           K +S+ M E+G+ VSPQQCEDKFNDLNKRYKR+NDI+G+G +C+VVEN  LL+ +++LT 
Sbjct: 181 KSVSRAMVEKGFSVSPQQCEDKFNDLNKRYKRVNDILGKGIACRVVENQGLLESMDHLTP 240

Query: 241 KEKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHD--PALQRSLQLAFRARDD---HDNDE 300
           K KD+V+K+LNSK LF+ EMC+YHNS      HD  P  Q  + +   ++     H  + 
Sbjct: 241 KLKDEVKKLLNSKHLFFREMCAYHNSCGHLGGHDQQPPQQNPISIPIPSQQQNCFHAAEA 300

Query: 301 PRRHQNDDFDENEHD-ETDERDDFE-ENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHA 360
            +  +  +  E E + E+D  +D E E      +  R    +  +VKRLR          
Sbjct: 301 GKMARIAERVEVEEEVESDMAEDSESEMEESEEEETRKKRRISTAVKRLRE--------- 360

Query: 361 CGKSLSSHAHAQAQFAQADTAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELE 420
                             + A +  +  K    +K+W+  ++L++E++K+  + E +E+E
Sbjct: 361 ------------------EAASVVEDVGKSVWEKKEWIRRKMLEIEEKKIGYEWEGVEME 412

Query: 421 KQKFKWERFNKKKDRELEKMRMVNERMKLENERIALDLKQKEI 438
           KQ+ KW R+  KK+RE+EK ++ N+R +LE ER+ L L++ EI
Sbjct: 421 KQRVKWMRYRSKKEREMEKAKLDNQRRRLETERMILMLRRSEI 412

BLAST of Cp4.1LG13g04050 vs. NCBI nr
Match: gi|449446415|ref|XP_004140967.1| (PREDICTED: uncharacterized protein LOC101203313 [Cucumis sativus])

HSP 1 Score: 797.0 bits (2057), Expect = 1.7e-227
Identity = 401/449 (89.31%), Postives = 420/449 (93.54%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEGNLSQGGLIPGG+SYGGLDLQGPFKVH+Q Q SHALHQQHHPHTRQGS+ANPSIQEGF
Sbjct: 1   MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQLSHALHQQHHPHTRQGSSANPSIQEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSVWHRVK 120
           SLSMGVVQNCDH MSLV+YNKGERCKNSASDE+PSF ED IDGHNE SKGKKGS+WHRVK
Sbjct: 61  SLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDSIDGHNENSKGKKGSMWHRVK 120

Query: 121 WTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQC 180
           WTDKMVKLLITAVSYIGDDI SD DG GRRKC IIQKKGKWKLISKV+AERGYQVSPQQC
Sbjct: 121 WTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQC 180

Query: 181 EDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYEE 240
           EDKFNDLNKRYKRLNDIIGRGTSC+VVENPALLDV++YLT+K+KDDVRKILNSKQLFYEE
Sbjct: 181 EDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEE 240

Query: 241 MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHDETDERDDFE 300
           MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E DETDE DD+E
Sbjct: 241 MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPDETDEHDDYE 300

Query: 301 ENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSL-------SSHAHAQAQFAQAD 360
           ENF PH D+RRS GVLGGSVKRL+R QDHDD HACG SL       SSH H+QAQF QAD
Sbjct: 301 ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQAD 360

Query: 361 TAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK 420
           TAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK
Sbjct: 361 TAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK 420

Query: 421 MRMVNERMKLENERIALDLKQKEIGSGFH 443
           MRMVNERMKLENER+ALDLKQK+IGSGFH
Sbjct: 421 MRMVNERMKLENERLALDLKQKQIGSGFH 449

BLAST of Cp4.1LG13g04050 vs. NCBI nr
Match: gi|659081798|ref|XP_008441519.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo])

HSP 1 Score: 797.0 bits (2057), Expect = 1.7e-227
Identity = 400/449 (89.09%), Postives = 421/449 (93.76%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEGNLSQGGLIPGG+SYGGLDLQGPFKVH+Q QHSHALHQQHHPHTRQGS+ANPSIQEGF
Sbjct: 1   MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDEEPSFTEDGIDGHNETSKGKKGSVWHRVK 120
           SLSMGVVQNCDH MSLV+YNKGERCKNSASDE+PSF ED IDGHNE SKGKKGS+WHRVK
Sbjct: 61  SLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDSIDGHNENSKGKKGSMWHRVK 120

Query: 121 WTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQC 180
           WTDKMVKLLITAVSYIGDDI SD DG+GRRKC IIQKKGKWKLISKV+AERGYQVSPQQC
Sbjct: 121 WTDKMVKLLITAVSYIGDDIASDIDGSGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQC 180

Query: 181 EDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYEE 240
           EDKFNDLNKRYKRLNDIIGRGTSC+VVENPALLDV++YLT+K+KDDVRKILNSKQLFYEE
Sbjct: 181 EDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEE 240

Query: 241 MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHDETDERDDFE 300
           MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDE DD+E
Sbjct: 241 MCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE 300

Query: 301 ENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSL-------SSHAHAQAQFAQAD 360
           ENF PH D+RRS GVLGGSVKRL+R QDHDD HACG SL       SSH H+QAQFAQAD
Sbjct: 301 ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFAQAD 360

Query: 361 TAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK 420
           TAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKK DRELEK
Sbjct: 361 TAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKXDRELEK 420

Query: 421 MRMVNERMKLENERIALDLKQKEIGSGFH 443
           MRMVNE+MKLENER+ALDLKQK+IGSGFH
Sbjct: 421 MRMVNEKMKLENERLALDLKQKQIGSGFH 449

BLAST of Cp4.1LG13g04050 vs. NCBI nr
Match: gi|590600527|ref|XP_007019480.1| (Sequence-specific DNA binding transcription factors [Theobroma cacao])

HSP 1 Score: 587.0 bits (1512), Expect = 2.7e-164
Identity = 304/447 (68.01%), Postives = 366/447 (81.88%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEGNLSQGG+I GG S+GGLD+QG  +VH  AQH H +HQ HH + RQG++ +PSI EGF
Sbjct: 1   MEGNLSQGGMISGGGSFGGLDVQGSMRVHHHAQHPHNIHQHHHSNPRQGASIHPSIHEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDE-EPSFTEDGIDGHNETSKGKKGSVWHRV 120
            L+MG +QNCD  +++ DYNKGER K+S SDE EPSFTE+G+DGHN+ +KGKKGS W RV
Sbjct: 61  PLTMGTMQNCDQTIAMTDYNKGERRKSSVSDEDEPSFTEEGVDGHNDGTKGKKGSPWQRV 120

Query: 121 KWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQ 180
           KWTDKMV+LLITAVSYIG+D   D  G  RRK  ++QKKGKWK +SKVMAERGY VSPQQ
Sbjct: 121 KWTDKMVRLLITAVSYIGEDAAGDCGGGMRRKFAVLQKKGKWKSVSKVMAERGYHVSPQQ 180

Query: 181 CEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYE 240
           CEDKFNDLNKRYK+LND++GRGTSC+VVENPALLDV++YLT+KEKDDVRKIL+SK LFYE
Sbjct: 181 CEDKFNDLNKRYKKLNDMLGRGTSCQVVENPALLDVIDYLTEKEKDDVRKILSSKHLFYE 240

Query: 241 EMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHD-ETDERDD 300
           EMCSYHN NRLHLPHDP LQRSLQLA R+RDDH+ND+ RRHQ+DD D+++HD ETD+ D+
Sbjct: 241 EMCSYHNGNRLHLPHDPQLQRSLQLALRSRDDHENDDARRHQHDDLDDDDHDMETDDHDE 300

Query: 301 FEENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHAC-GKSLSSHAHAQAQFA-----QA 360
           FEEN A HGDSR  +GVLGGS KR R+ Q H+D  AC   SL+S    ++ F+     QA
Sbjct: 301 FEENHALHGDSRGMYGVLGGSAKRSRQGQVHED--ACFQNSLNSQDCNKSSFSYSPINQA 360

Query: 361 DTAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELE 420
           D   +  +  + +  QKQW+E R LQLE+QKLQIQVEMLELEKQ+FKW+RF+KK+DRELE
Sbjct: 361 DMNQVLPDNTRAAWLQKQWIESRSLQLEEQKLQIQVEMLELEKQRFKWQRFSKKRDRELE 420

Query: 421 KMRMVNERMKLENERIALDLKQKEIGS 440
           KMRM NERMKLENER+AL+LK+KE  +
Sbjct: 421 KMRMENERMKLENERMALELKRKEFAA 445

BLAST of Cp4.1LG13g04050 vs. NCBI nr
Match: gi|645264216|ref|XP_008237585.1| (PREDICTED: uncharacterized protein LOC103336322 [Prunus mume])

HSP 1 Score: 573.9 bits (1478), Expect = 2.4e-160
Identity = 303/448 (67.63%), Postives = 361/448 (80.58%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEG+LSQGG++PGG SYGGLDL+G  +V  Q QH H +HQ HHPH RQGS A+PSI EGF
Sbjct: 1   MEGHLSQGGMVPGGASYGGLDLEGSMRVQHQTQHPHTIHQ-HHPHPRQGSLAHPSIHEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDE-EPSFTEDGIDGHNETSKGKKGSVWHRV 120
            L MG +  CD  +S++DYNKGER KNSASDE EPS+TEDG DGH E ++GKKGS W RV
Sbjct: 61  PLKMGTMHTCDQTISMMDYNKGERSKNSASDEDEPSYTEDGTDGHAEGNRGKKGSPWQRV 120

Query: 121 KWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQ 180
           KWTD+MVKLLITAVSYIG+D +SD    GRRK   +QKKGKWK +SKVMAERG+ VSPQQ
Sbjct: 121 KWTDQMVKLLITAVSYIGEDASSDCGSGGRRKYSTLQKKGKWKSVSKVMAERGFHVSPQQ 180

Query: 181 CEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYE 240
           CEDKFNDLNKRYK+LND++GRGTSC+VVEN ALLDV++YLT+KEKDDVRKIL+SK LFYE
Sbjct: 181 CEDKFNDLNKRYKKLNDMLGRGTSCQVVENQALLDVIDYLTEKEKDDVRKILSSKHLFYE 240

Query: 241 EMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHD-ETDERDD 300
           EMCSYHN NRLHLPHDPALQ+SLQ A R RD+HDND+PRRH +DD DE++ D ETDE +D
Sbjct: 241 EMCSYHNGNRLHLPHDPALQKSLQRALR-RDEHDNDDPRRHHHDDLDEDDQDMETDEHED 300

Query: 301 FEENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHACGKSL-----SSHAHAQAQFAQAD 360
           FEEN A HGD+R  + VL  SVKRLR+ Q  ++ +  G SL     +  ++      QAD
Sbjct: 301 FEENNASHGDNRGIY-VLEDSVKRLRQGQGREEFN-YGSSLNPQDCNKSSYCHPPIPQAD 360

Query: 361 TAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEK 420
              +  +G K +  QKQW+E R LQLE+QKLQIQVEMLELEKQ FKW+RF+KK+DRELEK
Sbjct: 361 MNQVLPDGTKAAWLQKQWIESRSLQLEEQKLQIQVEMLELEKQHFKWQRFSKKRDRELEK 420

Query: 421 MRMVNERMKLENERIALDLKQKEIGSGF 442
           +RM NERMKLENER+AL+LK+KE+G+GF
Sbjct: 421 LRMENERMKLENERMALELKRKEMGAGF 444

BLAST of Cp4.1LG13g04050 vs. NCBI nr
Match: gi|743899114|ref|XP_011042849.1| (PREDICTED: uncharacterized protein LOC105138468 [Populus euphratica])

HSP 1 Score: 572.8 bits (1475), Expect = 5.3e-160
Identity = 302/451 (66.96%), Postives = 359/451 (79.60%), Query Frame = 1

Query: 1   MEGNLSQGGLIPGGTSYGGLDLQGPFKVHSQAQHSHALHQQHHPHTRQGSAANPSIQEGF 60
           MEGNLSQGG++PGG  +GGLDLQG  +VH QAQH H +H  HH   RQGS+   S++EGF
Sbjct: 1   MEGNLSQGGMVPGGAPFGGLDLQGSMRVHHQAQHPHTMHHHHHHLHRQGSSTLTSVEEGF 60

Query: 61  SLSMGVVQNCDHAMSLVDYNKGERCKNSASDE-EPSFTEDGIDGHNETSKGKKGSVWHRV 120
            L+MG + N D  +S+ DYNKG+R KNS SDE EPS+TE+G DGHN+   GKKG+ W RV
Sbjct: 61  PLTMGFMHNSDQNISMTDYNKGDRGKNSVSDEDEPSYTEEGADGHNDAITGKKGTPWQRV 120

Query: 121 KWTDKMVKLLITAVSYIGDDINSDFDGAGRRKCHIIQKKGKWKLISKVMAERGYQVSPQQ 180
           KWTDKMV+LLITAVSYIG+D  SD  G  RRK  ++QKKGKWK ISKVMAERG+ VSPQQ
Sbjct: 121 KWTDKMVRLLITAVSYIGEDGTSDCGGGMRRKFTVLQKKGKWKSISKVMAERGFHVSPQQ 180

Query: 181 CEDKFNDLNKRYKRLNDIIGRGTSCKVVENPALLDVLEYLTDKEKDDVRKILNSKQLFYE 240
           CEDKFNDLNKRYKRLND++GRGTSC+VVENPALLDV++YLT+KEKDDVRKILNSK LFYE
Sbjct: 181 CEDKFNDLNKRYKRLNDMLGRGTSCQVVENPALLDVIDYLTEKEKDDVRKILNSKHLFYE 240

Query: 241 EMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDENEHD-ETDERDD 300
           EMCSYHN NRLHLPHDPALQRSLQLA R+RDDHDND+ RRHQ+DD DE++ + ETD+ D+
Sbjct: 241 EMCSYHNGNRLHLPHDPALQRSLQLALRSRDDHDNDDARRHQHDDLDEDDQEIETDDHDE 300

Query: 301 FEENFAPHGDSRRSFGVLGGSVKRLRRDQDHDDTHAC-GKSL------SSHAHAQAQFAQ 360
           FE+N A HGD R   GVLGGS KR R+ Q H+D  AC G S       SS  H  A  AQ
Sbjct: 301 FEDNHASHGDCRGIHGVLGGSAKRPRQGQGHED--ACFGNSSREPNKGSSSYHPLA--AQ 360

Query: 361 ADTAHLETEGMKGSTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDREL 420
            D   + +E  +    QKQWME R LQLE++KLQIQ EMLELEKQ+FKW+RF+KK+DREL
Sbjct: 361 VDVNQVSSESARAVWLQKQWMESRTLQLEERKLQIQQEMLELEKQRFKWQRFSKKRDREL 420

Query: 421 EKMRMVNERMKLENERIALDLKQKEIGSGFH 443
           EK+RM NERMKLEN+++AL+LK+KE+G+ F+
Sbjct: 421 EKLRMENERMKLENDQMALELKRKEMGADFN 447

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KBC2_CUCSA1.2e-22789.31Uncharacterized protein OS=Cucumis sativus GN=Csa_6G057100 PE=4 SV=1[more]
A0A061FHC2_THECC1.9e-16468.01Sequence-specific DNA binding transcription factors OS=Theobroma cacao GN=TCM_03... [more]
M5VYL5_PRUPE2.4e-15967.19Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa1027145mg PE=4 SV=1[more]
A0A059BYD0_EUCGR1.2e-15866.22Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F04240 PE=4 SV=1[more]
A0A067GYJ6_CITSI2.0e-15865.77Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013261mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G21200.13.6e-13558.94 sequence-specific DNA binding transcription factors[more]
AT1G76870.15.0e-9247.39 BEST Arabidopsis thaliana protein match is: sequence-specific DNA bi... [more]
AT3G10040.13.0e-5234.56 sequence-specific DNA binding transcription factors[more]
Match NameE-valueIdentityDescription
gi|449446415|ref|XP_004140967.1|1.7e-22789.31PREDICTED: uncharacterized protein LOC101203313 [Cucumis sativus][more]
gi|659081798|ref|XP_008441519.1|1.7e-22789.09PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis me... [more]
gi|590600527|ref|XP_007019480.1|2.7e-16468.01Sequence-specific DNA binding transcription factors [Theobroma cacao][more]
gi|645264216|ref|XP_008237585.1|2.4e-16067.63PREDICTED: uncharacterized protein LOC103336322 [Prunus mume][more]
gi|743899114|ref|XP_011042849.1|5.3e-16066.96PREDICTED: uncharacterized protein LOC105138468 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g04050.1Cp4.1LG13g04050.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 374..396
score: -coord: 404..426
scor
NoneNo IPR availablePANTHERPTHR21654FAMILY NOT NAMEDcoord: 363..434
score: 1.3E-199coord: 2..309
score: 1.3E
NoneNo IPR availablePANTHERPTHR21654:SF11F16F4.11 PROTEINcoord: 363..434
score: 1.3E-199coord: 2..309
score: 1.3E
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 118..243
score: 3.8