Cp4.1LG20g04740 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g04740
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionprotein DOG1-like 1
LocationCp4.1LG20: 2700081 .. 2700905 (-)
RNA-Seq ExpressionCp4.1LG20g04740
SyntenyCp4.1LG20g04740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAATGGATACCAGTAGTGGTACCACACACCAAGAACGTTCCCGGTGCTGTTTCCAGGAATGGATGCAACTTCAGCGCCAAGACCTGACCCTCCTCCAATCGCTTCACAAAAAACCAAATCACAACGACCACGAAGAAATCAAAACCCTCATCTGCAGTTGCATCGCTCATTTCGAGGATTACATCTCCACTCGCAGCCGCTTGGCTCAGGAAGACCCATCCACCTTCTTCGCGCCCACATGGTGCACGTCCTTGGAGAATTCCCTGTTGTGGATCGCAGGGTGTCGCCCCACCATTTTCATACGCCTGGTGTACGCGCTGAGTGGGTGCGAGGTGGAGGCGCGCCTACGGAAGTGGATGGAAGGCATGAGGAATAATAATGATGATCACAATAATAATTTTAATAGCATTAGCAGTGGTAGTGATAGTGTGGGGGAGCTGTCCCCGACCCAGATGAAGCGGGTGGATGAGCTGCACATGAGGACCCTAAGAGCAGAGGAGAAACTGACGAGCGAGATGGCGAGCTTACAGGAAGAGTTAGCGGACCAGCCGATTGCGGTGATCGCGAATGAGATGGAGGGGACTGGGGAGATGAACGAGGAGGCGGAGATGGCTCTGAAGGAGCATGAGACGGCAATGAGAGGGATTATGGAGAAGGCAGATGAGTTGAGGCTCAACACTCTTAAGGAGTTAGCTTTGGAGATTCTGAAGCCACCACAGGCCCTCCAGTTTCTGGCTGCCAGCAAGAAGCTTCATCTCTGCCTACATCAATGGGGCAAATGGAGAGATGAGAAGCATGGAAGAAGATCATATTGTGACTAA

mRNA sequence

ATGGCAATGGATACCAGTAGTGGTACCACACACCAAGAACGTTCCCGGTGCTGTTTCCAGGAATGGATGCAACTTCAGCGCCAAGACCTGACCCTCCTCCAATCGCTTCACAAAAAACCAAATCACAACGACCACGAAGAAATCAAAACCCTCATCTGCAGTTGCATCGCTCATTTCGAGGATTACATCTCCACTCGCAGCCGCTTGGCTCAGGAAGACCCATCCACCTTCTTCGCGCCCACATGGTGCACGTCCTTGGAGAATTCCCTGTTGTGGATCGCAGGGTGTCGCCCCACCATTTTCATACGCCTGGTGTACGCGCTGAGTGGGTGCGAGGTGGAGGCGCGCCTACGGAAGTGGATGGAAGGCATGAGGAATAATAATGATGATCACAATAATAATTTTAATAGCATTAGCAGTGGTAGTGATAGTGTGGGGGAGCTGTCCCCGACCCAGATGAAGCGGGTGGATGAGCTGCACATGAGGACCCTAAGAGCAGAGGAGAAACTGACGAGCGAGATGGCGAGCTTACAGGAAGAGTTAGCGGACCAGCCGATTGCGGTGATCGCGAATGAGATGGAGGGGACTGGGGAGATGAACGAGGAGGCGGAGATGGCTCTGAAGGAGCATGAGACGGCAATGAGAGGGATTATGGAGAAGGCAGATGAGTTGAGGCTCAACACTCTTAAGGAGTTAGCTTTGGAGATTCTGAAGCCACCACAGGCCCTCCAGTTTCTGGCTGCCAGCAAGAAGCTTCATCTCTGCCTACATCAATGGGGCAAATGGAGAGATGAGAAGCATGGAAGAAGATCATATTGTGACTAA

Coding sequence (CDS)

ATGGCAATGGATACCAGTAGTGGTACCACACACCAAGAACGTTCCCGGTGCTGTTTCCAGGAATGGATGCAACTTCAGCGCCAAGACCTGACCCTCCTCCAATCGCTTCACAAAAAACCAAATCACAACGACCACGAAGAAATCAAAACCCTCATCTGCAGTTGCATCGCTCATTTCGAGGATTACATCTCCACTCGCAGCCGCTTGGCTCAGGAAGACCCATCCACCTTCTTCGCGCCCACATGGTGCACGTCCTTGGAGAATTCCCTGTTGTGGATCGCAGGGTGTCGCCCCACCATTTTCATACGCCTGGTGTACGCGCTGAGTGGGTGCGAGGTGGAGGCGCGCCTACGGAAGTGGATGGAAGGCATGAGGAATAATAATGATGATCACAATAATAATTTTAATAGCATTAGCAGTGGTAGTGATAGTGTGGGGGAGCTGTCCCCGACCCAGATGAAGCGGGTGGATGAGCTGCACATGAGGACCCTAAGAGCAGAGGAGAAACTGACGAGCGAGATGGCGAGCTTACAGGAAGAGTTAGCGGACCAGCCGATTGCGGTGATCGCGAATGAGATGGAGGGGACTGGGGAGATGAACGAGGAGGCGGAGATGGCTCTGAAGGAGCATGAGACGGCAATGAGAGGGATTATGGAGAAGGCAGATGAGTTGAGGCTCAACACTCTTAAGGAGTTAGCTTTGGAGATTCTGAAGCCACCACAGGCCCTCCAGTTTCTGGCTGCCAGCAAGAAGCTTCATCTCTGCCTACATCAATGGGGCAAATGGAGAGATGAGAAGCATGGAAGAAGATCATATTGTGACTAA

Protein sequence

MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFEDYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWMEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD
Homology
BLAST of Cp4.1LG20g04740 vs. ExPASy Swiss-Prot
Match: Q9SN47 (Protein DOG1-like 1 OS=Arabidopsis thaliana OX=3702 GN=DOGL1 PE=2 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 4.0e-44
Identity = 106/256 (41.41%), Postives = 153/256 (59.77%), Query Frame = 0

Query: 18  CFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFEDYISTRSRLAQEDPSTF 77
           C+ EWM LQ + +T L+         D  ++  LI + I  F DY   RS  ++   S +
Sbjct: 15  CYNEWMSLQAKRITELKEA-ISTGEKDDNKLLDLIRTAIRDFGDYARKRSEHSRRFSSNY 74

Query: 78  FAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWMEGMRNNNDDHNNNFNS 137
           FAPTW T LEN+LLW+ GCRP+ FIRLVYA+ G + E RL  +     N N D ++N + 
Sbjct: 75  FAPTWNTCLENALLWMGGCRPSSFIRLVYAMCGSQTEHRLTNF---FNNTNHDIDSNLSM 134

Query: 138 I-------SSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELADQPIAVIA 197
                     G +S+ +L+  Q+ +++ELH++T+ AE KLT   ASLQE+ AD PIAV A
Sbjct: 135 ALGETRGGIGGGESMSDLTAEQLFKINELHLKTVEAENKLTKVSASLQEDTADTPIAVAA 194

Query: 198 NEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQALQFLAASK 257
              E  G+ +   E AL +HE  M G++ +AD+LR+ TL ++ ++IL   QA  FL A K
Sbjct: 195 FYKEVIGQADVVVERALDKHEEDMGGLLVEADKLRMTTLTKI-VDILTAVQAADFLLAGK 254

Query: 258 KLHLCLHQWGKWRDEK 267
           KLHL +H+WGK R+ +
Sbjct: 255 KLHLAMHEWGKSREHR 265

BLAST of Cp4.1LG20g04740 vs. ExPASy Swiss-Prot
Match: A0SVK0 (Protein DELAY OF GERMINATION 1 OS=Arabidopsis thaliana OX=3702 GN=DOG1 PE=1 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 7.5e-43
Identity = 103/264 (39.02%), Postives = 162/264 (61.36%), Query Frame = 0

Query: 7   SGTTHQERSRCCFQEWMQLQRQDL-TLLQSLHKKPNHNDHE---EIKTLICSCIAHFEDY 66
           S + + E+++  + EWM LQ Q +  L Q L ++ +H D +   +++ L    I  F++Y
Sbjct: 3   SSSKNIEQAQDSYLEWMSLQSQRIPELKQLLAQRRSHGDEDNDNKLRKLTGKIIGDFKNY 62

Query: 67  ISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWME 126
            + R+ LA    S ++APTW + LEN+L+W+ GCRP+ F RLVYAL G + E R+ +++ 
Sbjct: 63  AAKRADLAHRCSSNYYAPTWNSPLENALIWMGGCRPSSFFRLVYALCGSQTEIRVTQFL- 122

Query: 127 GMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELA 186
               N D +    +S   G  S+ +LS  Q+ +++ LH++ +  EEK+T +++SLQE+ A
Sbjct: 123 ---RNIDGYE---SSGGGGGASLSDLSAEQLAKINVLHVKIIDEEEKMTKKVSSLQEDAA 182

Query: 187 DQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQA 246
           D PIA +A EME  GE N   + AL + E AM  ++ +AD LR++TL ++ L IL P Q 
Sbjct: 183 DIPIATVAYEMENVGEPNVVVDQALDKQEEAMARLLVEADNLRVDTLAKI-LGILSPVQG 242

Query: 247 LQFLAASKKLHLCLHQWGKWRDEK 267
             FL A KKLHL +H+WG  RD +
Sbjct: 243 ADFLLAGKKLHLSMHEWGTMRDRR 258

BLAST of Cp4.1LG20g04740 vs. ExPASy Swiss-Prot
Match: Q58FV0 (Protein DOG1-like 3 OS=Arabidopsis thaliana OX=3702 GN=DOGL3 PE=2 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 9.5e-38
Identity = 86/264 (32.58%), Postives = 160/264 (60.61%), Query Frame = 0

Query: 5   TSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFEDYIS 64
           +SS    ++  + C+ EWM +Q + +  L+         +  +++ L+   +  F+ Y  
Sbjct: 4   SSSSYGIEQLQKGCYYEWMSVQAKHIVDLKEALMSHRSKEDHKLEELVGKIVNDFQKYTE 63

Query: 65  TRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWMEGM 124
            RS L++   S++FAP+W + LEN LLW+ GCRP+ FIR++Y+L G + E +L +++  +
Sbjct: 64  KRSELSRRSCSSYFAPSWNSPLENGLLWMGGCRPSSFIRVIYSLCGSQAETQLSQYLLKI 123

Query: 125 RNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELADQ 184
             N + ++           S+ +L+ +Q+ ++++LH++ +  E+K+T + A+LQE +AD 
Sbjct: 124 DENVEVNHGG---------SMSDLNASQLAKINDLHIKVIEKEDKITKKSANLQENVADM 183

Query: 185 PIAVIANEMEGTGEMNEE--AEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQA 244
           PIA+ A     T  MN +   E AL ++E  M  +M +AD+LR  TL+++ ++++ P QA
Sbjct: 184 PIAIAA---YATDLMNGDVVVEDALDKYEEGMAVLMVEADKLRFETLRKI-VDVVTPVQA 243

Query: 245 LQFLAASKKLHLCLHQWGKWRDEK 267
            +FL A K+LH+ LH+WG+ R+E+
Sbjct: 244 AEFLLAGKRLHISLHEWGRVREEQ 254

BLAST of Cp4.1LG20g04740 vs. ExPASy Swiss-Prot
Match: Q9SN45 (Protein DOG1-like 2 OS=Arabidopsis thaliana OX=3702 GN=DOGL2 PE=4 SV=2)

HSP 1 Score: 145.2 bits (365), Expect = 1.1e-33
Identity = 85/266 (31.95%), Postives = 156/266 (58.65%), Query Frame = 0

Query: 3   MDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFEDY 62
           M+ SS    ++  + C+ EWM LQ + +  L+       +ND ++++ L+   +  +  Y
Sbjct: 1   MERSSSYGVEKLQKRCYHEWMSLQTKHIDDLKEALMCQRNND-DKLEDLVGKIVNDYHTY 60

Query: 63  ISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWME 122
              RS L+    + +FAP+W T +ENS+LW+ GCRP+ FIRL+YAL G + E +L +++ 
Sbjct: 61  AGKRSELSYRCCAHYFAPSWNTPIENSMLWMGGCRPSSFIRLIYALCGSQAETQLSQYLL 120

Query: 123 GMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELA 182
            + ++ D ++  F S         +L+ TQ+ ++++LH+  ++ E+K+T   A+ Q+++A
Sbjct: 121 KIDDDFDINHGGFMS---------DLTATQLGKLNDLHLEVIKKEDKITKTSANFQDDVA 180

Query: 183 DQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQA 242
           D PIA + +        +   E AL +HE  M  ++ +AD+LR  TL+++ ++++ P QA
Sbjct: 181 DLPIADVVH-------ADVAVEDALDKHEEGMAVLLAEADKLRFETLRKI-VDVVTPLQA 240

Query: 243 LQFLAASKKLHLCLHQWGKWRDEKHG 269
           ++FL A K+L L LH  G+ R +  G
Sbjct: 241 VEFLLAGKRLQLSLHDRGRVRADVCG 248

BLAST of Cp4.1LG20g04740 vs. ExPASy Swiss-Prot
Match: Q84JC2 (Protein DOG1-like 4 OS=Arabidopsis thaliana OX=3702 GN=DOGL4 PE=1 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 4.7e-13
Identity = 63/224 (28.12%), Postives = 101/224 (45.09%), Query Frame = 0

Query: 41  NHNDHEEIKTLICSCIAHFEDYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTI 100
           N     E++ LI     H + Y + +    +ED   FF   W   LEN+  W+ G +P++
Sbjct: 37  NTMSETELRHLISKLTTHHKAYYTAKWAAIREDVLAFFGSVWLNPLENACSWLTGWKPSM 96

Query: 101 FIRLVYALSGCEVEARLRKWMEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELH 160
             R+V          RLRK                        S   L   Q+K+++EL 
Sbjct: 97  VFRMV---------DRLRK------------------------SRVVLVEAQVKKLEELR 156

Query: 161 MRTLRAEEKLTSEMASLQEELADQPIAVIAN-EMEGTGEMNEEAEMALKEHETAMRGIME 220
           ++T   E+K+  EM   Q  +AD+ +  +A       GE     E A++     +  +++
Sbjct: 157 VKTKFDEQKIEREMERYQVAMADRKMVELARLGCHVGGESVMVVEAAVRGLSMGLEKMVK 216

Query: 221 KADELRLNTLKELALEILKPPQALQFLAASKKLHLCLHQWGKWR 264
            AD +RL TLK + L+IL PPQ ++FLAA+    + L +WG  R
Sbjct: 217 AADCVRLKTLKGI-LDILTPPQCVEFLAAAATFQVQLRRWGNRR 226

BLAST of Cp4.1LG20g04740 vs. NCBI nr
Match: XP_023520222.1 (protein DOG1-like 1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 551 bits (1421), Expect = 4.66e-198
Identity = 274/274 (100.00%), Postives = 274/274 (100.00%), Query Frame = 0

Query: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFE 60
           MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFE
Sbjct: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFE 60

Query: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120
           DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW
Sbjct: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120

Query: 121 MEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180
           MEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE
Sbjct: 121 MEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180

Query: 181 LADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240
           LADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP
Sbjct: 181 LADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240

Query: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274
           QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD
Sbjct: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274

BLAST of Cp4.1LG20g04740 vs. NCBI nr
Match: XP_022927121.1 (protein DOG1-like 1 [Cucurbita moschata] >KAG7019574.1 Protein DOG1-like 1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 537 bits (1384), Expect = 2.04e-192
Identity = 267/274 (97.45%), Postives = 271/274 (98.91%), Query Frame = 0

Query: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFE 60
           MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKP HNDHEEIKTLICSCIAHFE
Sbjct: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPTHNDHEEIKTLICSCIAHFE 60

Query: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120
           DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIF+RLVYALSGCEVEARLRKW
Sbjct: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFVRLVYALSGCEVEARLRKW 120

Query: 121 MEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180
           MEGMRN+NDD++NNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE
Sbjct: 121 MEGMRNSNDDNSNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180

Query: 181 LADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240
           LADQPIAVIANEMEG G MNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP
Sbjct: 181 LADQPIAVIANEMEGIGVMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240

Query: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274
           QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD
Sbjct: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274

BLAST of Cp4.1LG20g04740 vs. NCBI nr
Match: KAG6583954.1 (Protein DOG1-like 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 536 bits (1380), Expect = 8.30e-192
Identity = 267/274 (97.45%), Postives = 270/274 (98.54%), Query Frame = 0

Query: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFE 60
           MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQS HKKP HNDHEEIKTLICSCIAHFE
Sbjct: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSFHKKPTHNDHEEIKTLICSCIAHFE 60

Query: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120
           DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW
Sbjct: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120

Query: 121 MEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180
           MEGMRN+NDD++NNFNSISSGSDSVGELS TQMKRVDELHMRTLRAEEKLTSEMASLQEE
Sbjct: 121 MEGMRNSNDDNSNNFNSISSGSDSVGELSSTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180

Query: 181 LADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240
           LADQPIAVIANEMEG GEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP
Sbjct: 181 LADQPIAVIANEMEGIGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240

Query: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274
           QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD
Sbjct: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274

BLAST of Cp4.1LG20g04740 vs. NCBI nr
Match: XP_023001640.1 (protein DOG1-like 1 [Cucurbita maxima])

HSP 1 Score: 520 bits (1340), Expect = 1.04e-185
Identity = 262/274 (95.62%), Postives = 265/274 (96.72%), Query Frame = 0

Query: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFE 60
           MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHND EEIKTLI SCIAHFE
Sbjct: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDQEEIKTLIRSCIAHFE 60

Query: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120
           DYISTRS LAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW
Sbjct: 61  DYISTRSCLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120

Query: 121 MEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180
           +EG RNNNDD+NNNFNSISSGSDSVGELSPTQM+RVDELHMRTLRAEEKLTSEMASLQEE
Sbjct: 121 LEGTRNNNDDNNNNFNSISSGSDSVGELSPTQMRRVDELHMRTLRAEEKLTSEMASLQEE 180

Query: 181 LADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240
           LADQPIAVIANEMEG  EMNEEAEMALKEHETAMRGIMEKADELRLNTLKEL LEILKPP
Sbjct: 181 LADQPIAVIANEMEGIEEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELVLEILKPP 240

Query: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274
           QAL FLAASKKLHLCLHQWGKWRDEKHGRRSY D
Sbjct: 241 QALHFLAASKKLHLCLHQWGKWRDEKHGRRSYYD 274

BLAST of Cp4.1LG20g04740 vs. NCBI nr
Match: XP_038895667.1 (protein DELAY OF GERMINATION 1 [Benincasa hispida])

HSP 1 Score: 348 bits (894), Expect = 9.08e-118
Identity = 189/272 (69.49%), Postives = 214/272 (78.68%), Query Frame = 0

Query: 4   DTSSGTTHQERSRCCFQEWMQLQRQDLT-LLQSLHKKPNHNDHEEIKTLICSCIAHFEDY 63
           +    T+H+ERSRCCFQEWMQLQR+DLT LL SLH    HN  E  K LI +CI HFEDY
Sbjct: 15  EDDGSTSHEERSRCCFQEWMQLQREDLTHLLLSLH---THNHEETNKNLIRNCITHFEDY 74

Query: 64  ISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWME 123
           I++R  LAQED S FFAPTWCTSLENSLLWIAGCRP+IFIRLVYAL G E +ARL +W+E
Sbjct: 75  ITSRRHLAQEDVSPFFAPTWCTSLENSLLWIAGCRPSIFIRLVYALCGSEADARLGEWLE 134

Query: 124 GMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELA 183
           G+RNNN  H             +GELSPTQM RV+ LHMRT++AEEKLTSEMAS QE++A
Sbjct: 135 GVRNNNSCH----------VGGIGELSPTQMVRVNGLHMRTIKAEEKLTSEMASSQEDVA 194

Query: 184 DQPIAVI-ANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQ 243
           DQPIAVI A EMEG GE +EE +MAL+++E  MR +MEKADELRLNTLKEL LEILKP Q
Sbjct: 195 DQPIAVIVAKEMEGVGEESEETKMALEQYEKVMREVMEKADELRLNTLKELVLEILKPIQ 254

Query: 244 ALQFLAASKKLHLCLHQWGKWRDEKHGR-RSY 272
           AL+FL ASKKLHL LHQWGK RDEK GR RSY
Sbjct: 255 ALEFLVASKKLHLSLHQWGKRRDEKQGRLRSY 273

BLAST of Cp4.1LG20g04740 vs. ExPASy TrEMBL
Match: A0A6J1EGT4 (protein DOG1-like 1 OS=Cucurbita moschata OX=3662 GN=LOC111434061 PE=4 SV=1)

HSP 1 Score: 537 bits (1384), Expect = 9.86e-193
Identity = 267/274 (97.45%), Postives = 271/274 (98.91%), Query Frame = 0

Query: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFE 60
           MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKP HNDHEEIKTLICSCIAHFE
Sbjct: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPTHNDHEEIKTLICSCIAHFE 60

Query: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120
           DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIF+RLVYALSGCEVEARLRKW
Sbjct: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFVRLVYALSGCEVEARLRKW 120

Query: 121 MEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180
           MEGMRN+NDD++NNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE
Sbjct: 121 MEGMRNSNDDNSNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180

Query: 181 LADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240
           LADQPIAVIANEMEG G MNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP
Sbjct: 181 LADQPIAVIANEMEGIGVMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240

Query: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274
           QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD
Sbjct: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274

BLAST of Cp4.1LG20g04740 vs. ExPASy TrEMBL
Match: A0A6J1KLR3 (protein DOG1-like 1 OS=Cucurbita maxima OX=3661 GN=LOC111495712 PE=4 SV=1)

HSP 1 Score: 520 bits (1340), Expect = 5.03e-186
Identity = 262/274 (95.62%), Postives = 265/274 (96.72%), Query Frame = 0

Query: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFE 60
           MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHND EEIKTLI SCIAHFE
Sbjct: 1   MAMDTSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDQEEIKTLIRSCIAHFE 60

Query: 61  DYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120
           DYISTRS LAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW
Sbjct: 61  DYISTRSCLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKW 120

Query: 121 MEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEE 180
           +EG RNNNDD+NNNFNSISSGSDSVGELSPTQM+RVDELHMRTLRAEEKLTSEMASLQEE
Sbjct: 121 LEGTRNNNDDNNNNFNSISSGSDSVGELSPTQMRRVDELHMRTLRAEEKLTSEMASLQEE 180

Query: 181 LADQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPP 240
           LADQPIAVIANEMEG  EMNEEAEMALKEHETAMRGIMEKADELRLNTLKEL LEILKPP
Sbjct: 181 LADQPIAVIANEMEGIEEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELVLEILKPP 240

Query: 241 QALQFLAASKKLHLCLHQWGKWRDEKHGRRSYCD 274
           QAL FLAASKKLHLCLHQWGKWRDEKHGRRSY D
Sbjct: 241 QALHFLAASKKLHLCLHQWGKWRDEKHGRRSYYD 274

BLAST of Cp4.1LG20g04740 vs. ExPASy TrEMBL
Match: A0A6J1GT06 (protein DOG1-like 1 OS=Cucurbita moschata OX=3662 GN=LOC111456709 PE=4 SV=1)

HSP 1 Score: 322 bits (825), Expect = 7.77e-108
Identity = 174/264 (65.91%), Postives = 208/264 (78.79%), Query Frame = 0

Query: 10  THQERSRCCFQEWMQLQRQDLTLL-QSLHKKPNHNDHEEIKTLICSCIAHFEDYISTRSR 69
           THQ RSRCCFQEWMQLQRQDLTLL +S+H    HND ++ +TLI +C+AHFEDYI+ R R
Sbjct: 10  THQHRSRCCFQEWMQLQRQDLTLLLESIHN--THND-DQSRTLIRNCLAHFEDYITNRRR 69

Query: 70  LAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWMEGMRNNN 129
            A+ED   FFAPTWCTSLENSLLWIAGCRP+IFIRL+YALSGCEV+ARL +W+EG+R  +
Sbjct: 70  FAEEDAFPFFAPTWCTSLENSLLWIAGCRPSIFIRLLYALSGCEVDARLGEWLEGIRGES 129

Query: 130 DDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELADQPIAV 189
            +H+ +          VG LS +Q+ RV+ LHMRT+RAEE+   EMAS QEE AD+PIAV
Sbjct: 130 GNHHLH----------VGNLSASQLVRVNGLHMRTVRAEERQGREMASCQEEPADEPIAV 189

Query: 190 IANEMEGT--GEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQALQFL 249
           I NEMEG    E+ E AE ALKEHE  MR +MEKADELRL T+KE+ +EIL+P QA++FL
Sbjct: 190 IVNEMEGVVGEEVGEAAERALKEHEREMRRMMEKADELRLKTIKEV-MEILEPVQAVEFL 249

Query: 250 AASKKLHLCLHQWGKWRDEKHGRR 270
            ASKKLHL LHQWGK RDE+HGRR
Sbjct: 250 VASKKLHLSLHQWGKRRDERHGRR 259

BLAST of Cp4.1LG20g04740 vs. ExPASy TrEMBL
Match: A0A0A0LXS6 (DOG1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G470430 PE=4 SV=1)

HSP 1 Score: 313 bits (802), Expect = 7.41e-104
Identity = 170/267 (63.67%), Postives = 204/267 (76.40%), Query Frame = 0

Query: 10  THQERSRCCFQEWMQLQRQDLT-LLQSLHKKPNHNDHEEIKTLICSCIAHFEDYISTRSR 69
           TH+ERSRCCF+EWMQLQR+DLT LL+SLH+  N++      T+I +CI+HFE YIS R+ 
Sbjct: 30  THEERSRCCFEEWMQLQREDLTHLLKSLHQPTNNDTTTTTTTVIRNCISHFEHYISNRTL 89

Query: 70  LAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWMEGMRNNN 129
           LAQE PS  FAPTWCTSLENSLLW+AGCRP+IFIRL+YAL+ C  E  +         N+
Sbjct: 90  LAQEHPSPLFAPTWCTSLENSLLWMAGCRPSIFIRLIYALTSCSSEPLIT--------ND 149

Query: 130 DDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELADQPIAV 189
           DD+ N  N+++S    +GELSP+QM RV+ LHMRT++AEEKLTSE+AS QEELAD+PIA+
Sbjct: 150 DDNKNGNNTVTS----IGELSPSQMTRVNGLHMRTIKAEEKLTSELASWQEELADEPIAL 209

Query: 190 IA------NEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQA 249
           IA      +E+     MNEEAEMALKEHE  M  ++ KADELRLNT+KEL LEILKP QA
Sbjct: 210 IAAKGDCGDEVVLNNMMNEEAEMALKEHEKVMGKVIGKADELRLNTMKELVLEILKPTQA 269

Query: 250 LQFLAASKKLHLCLHQWGKWRDEKHGR 269
           LQFL ASKKLHL LHQWGK RDEK  R
Sbjct: 270 LQFLVASKKLHLSLHQWGKRRDEKQRR 284

BLAST of Cp4.1LG20g04740 vs. ExPASy TrEMBL
Match: A0A5A7UU20 (Transcription factor HBP-1b(C38)-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold96G002330 PE=4 SV=1)

HSP 1 Score: 290 bits (741), Expect = 1.24e-94
Identity = 166/277 (59.93%), Postives = 197/277 (71.12%), Query Frame = 0

Query: 4   DTSSGTTHQERSRCCFQEWMQLQRQDLT-LLQSLHKKPNHNDHEE---IKTLICSCIAHF 63
           ++   TTH+ER R CF+EWMQLQR+DLT LL+SLH+  N+N  E      T+I +CI+HF
Sbjct: 24  ESGLSTTHEERCRRCFEEWMQLQREDLTHLLESLHQPTNNNSQETNNTTSTVIRNCISHF 83

Query: 64  EDYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRK 123
           + YIS R+ LAQE PS  FAPTWCTSLE SLLW+AGCRP+IFIRL YAL+ C  E     
Sbjct: 84  DHYISKRNLLAQEYPSPLFAPTWCTSLEKSLLWMAGCRPSIFIRLTYALTSCSTEPLT-- 143

Query: 124 WMEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQE 183
                   ND  N N +S       +GELSP+QM RV+ LHMRT++AE+KLT E+AS QE
Sbjct: 144 --------NDGDNKNSDSFIG----IGELSPSQMTRVNGLHMRTIKAEQKLTDELASWQE 203

Query: 184 ELADQPIAVIANEMEGTGEM---NEEAEMALKEHETAMRGIMEKADELRLNTLKELALEI 243
           ELAD PIAVI  + +   E+   NEEAEMALKEHE  M  ++ KAD+LRLNT+KEL LEI
Sbjct: 204 ELADDPIAVIVAKGDCGDEVVMNNEEAEMALKEHEKVMGEVIGKADKLRLNTMKELVLEI 263

Query: 244 LKPPQALQFLAASKKLHLCLHQWGKWRDEKHGR-RSY 272
           LKP QALQFL ASKKL L LHQWGK RDEK  R RSY
Sbjct: 264 LKPTQALQFLVASKKLQLSLHQWGKRRDEKQRRIRSY 286

BLAST of Cp4.1LG20g04740 vs. TAIR 10
Match: AT5G45830.1 (delay of germination 1 )

HSP 1 Score: 175.6 bits (444), Expect = 5.3e-44
Identity = 103/264 (39.02%), Postives = 162/264 (61.36%), Query Frame = 0

Query: 7   SGTTHQERSRCCFQEWMQLQRQDL-TLLQSLHKKPNHNDHE---EIKTLICSCIAHFEDY 66
           S + + E+++  + EWM LQ Q +  L Q L ++ +H D +   +++ L    I  F++Y
Sbjct: 3   SSSKNIEQAQDSYLEWMSLQSQRIPELKQLLAQRRSHGDEDNDNKLRKLTGKIIGDFKNY 62

Query: 67  ISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWME 126
            + R+ LA    S ++APTW + LEN+L+W+ GCRP+ F RLVYAL G + E R+ +++ 
Sbjct: 63  AAKRADLAHRCSSNYYAPTWNSPLENALIWMGGCRPSSFFRLVYALCGSQTEIRVTQFL- 122

Query: 127 GMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELA 186
               N D +    +S   G  S+ +LS  Q+ +++ LH++ +  EEK+T +++SLQE+ A
Sbjct: 123 ---RNIDGYE---SSGGGGGASLSDLSAEQLAKINVLHVKIIDEEEKMTKKVSSLQEDAA 182

Query: 187 DQPIAVIANEMEGTGEMNEEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQA 246
           D PIA +A EME  GE N   + AL + E AM  ++ +AD LR++TL ++ L IL P Q 
Sbjct: 183 DIPIATVAYEMENVGEPNVVVDQALDKQEEAMARLLVEADNLRVDTLAKI-LGILSPVQG 242

Query: 247 LQFLAASKKLHLCLHQWGKWRDEK 267
             FL A KKLHL +H+WG  RD +
Sbjct: 243 ADFLLAGKKLHLSMHEWGTMRDRR 258

BLAST of Cp4.1LG20g04740 vs. TAIR 10
Match: AT4G18690.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G18680.1); Has 522 Blast hits to 522 proteins in 39 species: Archae - 0; Bacteria - 0; Metazoa - 9; Fungi - 0; Plants - 513; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 158.7 bits (400), Expect = 6.7e-39
Identity = 86/264 (32.58%), Postives = 160/264 (60.61%), Query Frame = 0

Query: 5   TSSGTTHQERSRCCFQEWMQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFEDYIS 64
           +SS    ++  + C+ EWM +Q + +  L+         +  +++ L+   +  F+ Y  
Sbjct: 4   SSSSYGIEQLQKGCYYEWMSVQAKHIVDLKEALMSHRSKEDHKLEELVGKIVNDFQKYTE 63

Query: 65  TRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWMEGM 124
            RS L++   S++FAP+W + LEN LLW+ GCRP+ FIR++Y+L G + E +L +++  +
Sbjct: 64  KRSELSRRSCSSYFAPSWNSPLENGLLWMGGCRPSSFIRVIYSLCGSQAETQLSQYLLKI 123

Query: 125 RNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELADQ 184
             N + ++           S+ +L+ +Q+ ++++LH++ +  E+K+T + A+LQE +AD 
Sbjct: 124 DENVEVNHGG---------SMSDLNASQLAKINDLHIKVIEKEDKITKKSANLQENVADM 183

Query: 185 PIAVIANEMEGTGEMNEE--AEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQA 244
           PIA+ A     T  MN +   E AL ++E  M  +M +AD+LR  TL+++ ++++ P QA
Sbjct: 184 PIAIAA---YATDLMNGDVVVEDALDKYEEGMAVLMVEADKLRFETLRKI-VDVVTPVQA 243

Query: 245 LQFLAASKKLHLCLHQWGKWRDEK 267
            +FL A K+LH+ LH+WG+ R+E+
Sbjct: 244 AEFLLAGKRLHISLHEWGRVREEQ 254

BLAST of Cp4.1LG20g04740 vs. TAIR 10
Match: AT4G18680.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G18690.1); Has 361 Blast hits to 361 proteins in 33 species: Archae - 0; Bacteria - 0; Metazoa - 8; Fungi - 0; Plants - 353; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 132.5 bits (332), Expect = 5.2e-31
Identity = 79/246 (32.11%), Postives = 145/246 (58.94%), Query Frame = 0

Query: 23  MQLQRQDLTLLQSLHKKPNHNDHEEIKTLICSCIAHFEDYISTRSRLAQEDPSTFFAPTW 82
           M LQ + +  L+       +ND ++++ L+   +  +  Y   RS L+    + +FAP+W
Sbjct: 1   MSLQTKHIDDLKEALMCQRNND-DKLEDLVGKIVNDYHTYAGKRSELSYRCCAHYFAPSW 60

Query: 83  CTSLENSLLWIAGCRPTIFIRLVYALSGCEVEARLRKWMEGMRNNNDDHNNNFNSISSGS 142
            T +ENS+LW+ GCRP+ FIRL+YAL G + E +L +++  + ++ D ++  F S     
Sbjct: 61  NTPIENSMLWMGGCRPSSFIRLIYALCGSQAETQLSQYLLKIDDDFDINHGGFMS----- 120

Query: 143 DSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELADQPIAVIANEMEGTGEMNEE 202
               +L+ TQ+ ++++LH+  ++ E+K+T   A+ Q+++AD PIA + +        +  
Sbjct: 121 ----DLTATQLGKLNDLHLEVIKKEDKITKTSANFQDDVADLPIADVVH-------ADVA 180

Query: 203 AEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQALQFLAASKKLHLCLHQWGKW 262
            E AL +HE  M  ++ +AD+LR  TL+++ ++++ P QA++FL A K+L L LH  G+ 
Sbjct: 181 VEDALDKHEEGMAVLLAEADKLRFETLRKI-VDVVTPLQAVEFLLAGKRLQLSLHDRGRV 228

Query: 263 RDEKHG 269
           R +  G
Sbjct: 241 RADVCG 228

BLAST of Cp4.1LG20g04740 vs. TAIR 10
Match: AT4G18660.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G18690.1); Has 115 Blast hits to 115 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 115; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 102.8 bits (255), Expect = 4.4e-22
Identity = 57/126 (45.24%), Postives = 85/126 (67.46%), Query Frame = 0

Query: 141 GSDSVGELSPTQMKRVDELHMRTLRAEEKLTSEMASLQEELADQPIAVIANEMEGTGEMN 200
           G +S+ +L+  Q+ +++ELH++T+ AE KLT   ASLQE+ AD PIAV A   E  G+ +
Sbjct: 12  GGESMSDLTAEQLFKINELHLKTVEAENKLTKVSASLQEDTADTPIAVAAFYKEVIGQAD 71

Query: 201 EEAEMALKEHETAMRGIMEKADELRLNTLKELALEILKPPQALQFLAASKKLHLCLHQWG 260
              E AL +HE  M G++ +AD+LR+ TL ++ ++IL   QA  FL A KKLHL +H+WG
Sbjct: 72  VVVERALDKHEEDMGGLLVEADKLRMTTLTKI-VDILTAVQAADFLLAGKKLHLAMHEWG 131

Query: 261 KWRDEK 267
           K R+ +
Sbjct: 132 KSREHR 136

BLAST of Cp4.1LG20g04740 vs. TAIR 10
Match: AT4G18650.1 (transcription factor-related )

HSP 1 Score: 76.6 bits (187), Expect = 3.4e-14
Identity = 63/224 (28.12%), Postives = 101/224 (45.09%), Query Frame = 0

Query: 41  NHNDHEEIKTLICSCIAHFEDYISTRSRLAQEDPSTFFAPTWCTSLENSLLWIAGCRPTI 100
           N     E++ LI     H + Y + +    +ED   FF   W   LEN+  W+ G +P++
Sbjct: 37  NTMSETELRHLISKLTTHHKAYYTAKWAAIREDVLAFFGSVWLNPLENACSWLTGWKPSM 96

Query: 101 FIRLVYALSGCEVEARLRKWMEGMRNNNDDHNNNFNSISSGSDSVGELSPTQMKRVDELH 160
             R+V          RLRK                        S   L   Q+K+++EL 
Sbjct: 97  VFRMV---------DRLRK------------------------SRVVLVEAQVKKLEELR 156

Query: 161 MRTLRAEEKLTSEMASLQEELADQPIAVIAN-EMEGTGEMNEEAEMALKEHETAMRGIME 220
           ++T   E+K+  EM   Q  +AD+ +  +A       GE     E A++     +  +++
Sbjct: 157 VKTKFDEQKIEREMERYQVAMADRKMVELARLGCHVGGESVMVVEAAVRGLSMGLEKMVK 216

Query: 221 KADELRLNTLKELALEILKPPQALQFLAASKKLHLCLHQWGKWR 264
            AD +RL TLK + L+IL PPQ ++FLAA+    + L +WG  R
Sbjct: 217 AADCVRLKTLKGI-LDILTPPQCVEFLAAAATFQVQLRRWGNRR 226

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SN474.0e-4441.41Protein DOG1-like 1 OS=Arabidopsis thaliana OX=3702 GN=DOGL1 PE=2 SV=1[more]
A0SVK07.5e-4339.02Protein DELAY OF GERMINATION 1 OS=Arabidopsis thaliana OX=3702 GN=DOG1 PE=1 SV=1[more]
Q58FV09.5e-3832.58Protein DOG1-like 3 OS=Arabidopsis thaliana OX=3702 GN=DOGL3 PE=2 SV=1[more]
Q9SN451.1e-3331.95Protein DOG1-like 2 OS=Arabidopsis thaliana OX=3702 GN=DOGL2 PE=4 SV=2[more]
Q84JC24.7e-1328.13Protein DOG1-like 4 OS=Arabidopsis thaliana OX=3702 GN=DOGL4 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_023520222.14.66e-198100.00protein DOG1-like 1 [Cucurbita pepo subsp. pepo][more]
XP_022927121.12.04e-19297.45protein DOG1-like 1 [Cucurbita moschata] >KAG7019574.1 Protein DOG1-like 1, part... [more]
KAG6583954.18.30e-19297.45Protein DOG1-like 1, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023001640.11.04e-18595.62protein DOG1-like 1 [Cucurbita maxima][more]
XP_038895667.19.08e-11869.49protein DELAY OF GERMINATION 1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1EGT49.86e-19397.45protein DOG1-like 1 OS=Cucurbita moschata OX=3662 GN=LOC111434061 PE=4 SV=1[more]
A0A6J1KLR35.03e-18695.62protein DOG1-like 1 OS=Cucurbita maxima OX=3661 GN=LOC111495712 PE=4 SV=1[more]
A0A6J1GT067.77e-10865.91protein DOG1-like 1 OS=Cucurbita moschata OX=3662 GN=LOC111456709 PE=4 SV=1[more]
A0A0A0LXS67.41e-10463.67DOG1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G470430 PE=4 S... [more]
A0A5A7UU201.24e-9459.93Transcription factor HBP-1b(C38)-like OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
Match NameE-valueIdentityDescription
AT5G45830.15.3e-4439.02delay of germination 1 [more]
AT4G18690.16.7e-3932.58unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G18680.15.2e-3132.11unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G18660.14.4e-2245.24unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G18650.13.4e-1428.13transcription factor-related [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 200..227
NoneNo IPR availablePANTHERPTHR46354:SF7PROTEIN DOG1-LIKE 1coord: 10..268
NoneNo IPR availablePANTHERPTHR46354FAMILY NOT NAMEDcoord: 10..268
IPR025422Transcription factor TGA like domainPFAMPF14144DOG1coord: 33..106
e-value: 7.2E-24
score: 83.8
IPR025422Transcription factor TGA like domainPROSITEPS51806DOG1coord: 12..266
score: 31.837282

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g04740.1Cp4.1LG20g04740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006351 transcription, DNA-templated
molecular_function GO:0043565 sequence-specific DNA binding