Cp4.1LG05g00390 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG05g00390
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionG-patch domain-containing protein
LocationCp4.1LG05 : 714675 .. 717052 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGTATAGTTAAGGAAAACAGAGATCGAGAAAGCAATGGATTGGCTATTGGGAAGCATTGTTAGGATTGTTGGTGGAAGATATGCAGGTTCGGAAGGTAAAATTGTAGAGAAATTGGATCTCAAATGGCTTGTCCTTAAGCTTTCTAACAGAGACGAAGTTAAAGTACATGCAACTGATGTTGCTGAATTGGGTTCCAAAGAAGAGGAAAGGTTCCTGAAGAAATTGGAGGAAGTGAAAGTTCAGGACGAAATCAAAGGCTAGAGACGGAGAAGAGAAGTTGAACGAGTAGAGGAAAAGCGTGAAAATGGTCAGAGGGACGAAGAGAAGAGGACTAGATTGTGTTGGCTTACCAGTCACATTCGTGTTAGGATCATCAGCAAAGATTTCAAGAGAGGAAAGTTTTATCTAAAAAAAGGAGAGATAGTTGACGTAGTTGGACCCACCATTTGTGATATATCCATTGATGAAAGTAGAGAGCTTGTTCAAGGGGTCTCTCAAGAGCTTCTTGAGACAGCACTTCCCAGACGTGGGGGACCCGTTCTTGTTCTGTATGGTAAGCACAAGGGCGTTTATGGGAGTTTAGTTGAGAGGGATCTTGACAAAGAGACAGGGGTAGTGCGTGATGCTGATAGCCATGAATTATGTAATGTGCGTTTAGAACAGGTTGCTGAATATCTTGGTGACCCAACTTACTTGAGGTATTGATTCCTTGTAGCCAAGATGACGCTGGACCTGTCTTAGTGCATACTGTGTGTACACCAGGTACTTCTGGTCCTGTTCATATTTCTTATTCATACTTAGTCGGCCTATCTATTACTTTGTTCAAAGATGTTTAAATTAGGAACATATTTTAACCTTAAAGACAAACTGAAATTTTAAAAACTTTGAGACCAAATTAAAGATAGGGTAATTTAACTATCCAAGGCACCAAAATGGCAAAACTATGTTTTAACTTTAAATTCTCTAGCTTGTTTCAGCGTGTCTACATGTCTATAGCTGTTGTCTCCATCATTTTAGTCGATCTGTCTAGCTAATAGCTTGGAGGATTAAGATTTTATTCTTCTCCGATATGATGATATTGGGGATTAGGCATTCTGTTGTTACGTTCTGATCGATGATATCATGTCTTTCAGAGTTTGGGATAAAAACCATGGTGACACTTAGCTGTGACGTTTTCTTAACAAGTCTCTTGCATGAATGAGGCGCCTATGTGCTTAAGTGCGTGATAAGTTGTCTTAGAGTGCTCAGTGCATGGAGTGTACGCAGAAAACGCCTGCGGCTGTCAAATGTAGTGGTGATACTCATTCTCTGTTACCCTCAGGATGCTCCTGAATACTGAACTTTGGTAATTTTCATTTCATTTCTTTGCAAATTAGCTTCTTTGCAGTACTACTACATTCATTTGAAGAAAGTGTGACAGTTTGCATATTGAGTATTGAGTCGACGGATATAATAGGTTCAGAATTTTAGATAATGCTCTCTTTATCAGTGGATTGGTCGCCATCTTCATTTCTCGGTCTCGGTCGATTTAAACATCACATTACCTTCATTTCTCGGTCTCGGTCGATTTAAACATCTTTATCAGTGGATTGGTCGCCATCTTCTTTTTGGGTAATACTCCTATTGCTTGGTGTATGTAAGAAAATATGGCTATCTCGGCTAGTGACTGGGTTCATTGGCCACACCTTTCATTTCAAAGCATGAAGAACAGACTGTGAACTTGTGCCCTACCAAACATGCAGGCGGTTGGGAGATATGCATCTCTGATTGATAAGAAGAAAACCCTGATTGATTGTGATAGCATGGTCGTCAGAGACTGGAAAATCCTGGATATCAATATGCTGATTCATGCAATGAATTGGGCTTAAGGGCTGTGGGAGATATCAGCCATGTAATGAATCCACAACCTCATCCTTTTTTTATGTATTGTCTTATAGTATGCTTATCTGAATGACTTCGACATTGCCTTCCTCCTCCACAGCTCGCCAGCCGGAGACACAGGGTCAATGGGAAGAGTAACGAGCACAGAAGACTCGTCTCAATTCTTCGAGTTTCTCCGGTGTCTGCGGTTTGGGTGTTCTGATCAATGGCGATGTCCAATACGAGCTCCAGCAAGGGTGCCTACAAAAATTATTTGAGAATAAACTAGAAATGTTCATTTAACTCGTGTGATGGGAGATTAAATGGAGAAGGAAGGAGTGAGTGATGAGAGATTATTTGCTGTAAGCTAAATGAAATTAGAGTCGGAGCTGGGCTCTTGTGTTTCTCTAGGTTTCAAGGAATTTTAAGGATCACACCTTTAACGATTCAAACCTACCGCTAGCAGATATTTTCTTCTTTGGATTTTTCCTTTCATGCTTCTCCTCAAGATT

mRNA sequence

ATGGCGTTAAGGAAAACAGAGATCGAGAAAGCAATGGATTGGCTATTGGGAAGCATTGTTAGGATTGTTGGTGGAAGATATGCAGGTTCGGAAGGTAAAATTGTAGAGAAATTGGATCTCAAATGGCTTGTCCTTAAGCTTTCTAACAGAGACGAAGTTAAAGTACATGCAACTGATGTTGCTGAATTGGGTTCCAAAGAAGAGGAAAGGTTCCTGAAGAAATTGGAGGAAGTGAAAAGACGGAGAAGAGAAGTTGAACGAGTAGAGGAAAAGCGTGAAAATGGTCAGAGGGACGAAGAGAAGAGGACTAGATTGTGTTGGCTTACCAGTCACATTCGTGTTAGGATCATCAGCAAAGATTTCAAGAGAGGAAAGTTTTATCTAAAAAAAGGAGAGATAGTTGACGTAGTTGGACCCACCATTTGTGATATATCCATTGATGAAAGTAGAGAGCTTGTTCAAGGGGTCTCTCAAGAGCTTCTTGAGACAGCACTTCCCAGACGTGGGGGACCCGTTCTTGTTCTGTATGGTAAGCACAAGGGCGTTTATGGGAGTTTAGTTGAGAGGGATCTTGACAAAGAGACAGGGGTAGTGCGTGATGCTGATAGCCATGAATTATGTAATGTGCGTTTAGAACAGGTTGCTGAATATCTTGGTGACCCAACTTACTTGAGAGTGCTCAGTGCATGGAGTGTACGCAGAAAACGCCTGCGGCTGTCAAATGTAGTGTGGATTGGTCGCCATCTTCATTTCTCGGTCTCGGTCGATTTAAACATCACATTACCTTCATTTCTCGGTCTCGGTCGATTTAAACATCTTTATCAGTGGATTGGTCGCCATCTTCTTTTTGGCTCGCCAGCCGGAGACACAGGGTCAATGGGAAGAGTAACGAGCACAGAAGACTCGTCTCAATTCTTCGAGTTTCTCCGGTGTCTGCGGTTTGGGTGTTCTGATCAATGGCGATGTCCAATACGAGCTCCAGCAAGGGTGCCTACAAAAATTATTTGAGAATAAACTAGAAATGTTCATTTAACTCGTGTGATGGGAGATTAAATGGAGAAGGAAGGAGTGAGTGATGAGAGATTATTTGCTGTAAGCTAAATGAAATTAGAGTCGGAGCTGGGCTCTTGTGTTTCTCTAGGTTTCAAGGAATTTTAAGGATCACACCTTTAACGATTCAAACCTACCGCTAGCAGATATTTTCTTCTTTGGATTTTTCCTTTCATGCTTCTCCTCAAGATT

Coding sequence (CDS)

ATGGCGTTAAGGAAAACAGAGATCGAGAAAGCAATGGATTGGCTATTGGGAAGCATTGTTAGGATTGTTGGTGGAAGATATGCAGGTTCGGAAGGTAAAATTGTAGAGAAATTGGATCTCAAATGGCTTGTCCTTAAGCTTTCTAACAGAGACGAAGTTAAAGTACATGCAACTGATGTTGCTGAATTGGGTTCCAAAGAAGAGGAAAGGTTCCTGAAGAAATTGGAGGAAGTGAAAAGACGGAGAAGAGAAGTTGAACGAGTAGAGGAAAAGCGTGAAAATGGTCAGAGGGACGAAGAGAAGAGGACTAGATTGTGTTGGCTTACCAGTCACATTCGTGTTAGGATCATCAGCAAAGATTTCAAGAGAGGAAAGTTTTATCTAAAAAAAGGAGAGATAGTTGACGTAGTTGGACCCACCATTTGTGATATATCCATTGATGAAAGTAGAGAGCTTGTTCAAGGGGTCTCTCAAGAGCTTCTTGAGACAGCACTTCCCAGACGTGGGGGACCCGTTCTTGTTCTGTATGGTAAGCACAAGGGCGTTTATGGGAGTTTAGTTGAGAGGGATCTTGACAAAGAGACAGGGGTAGTGCGTGATGCTGATAGCCATGAATTATGTAATGTGCGTTTAGAACAGGTTGCTGAATATCTTGGTGACCCAACTTACTTGAGAGTGCTCAGTGCATGGAGTGTACGCAGAAAACGCCTGCGGCTGTCAAATGTAGTGTGGATTGGTCGCCATCTTCATTTCTCGGTCTCGGTCGATTTAAACATCACATTACCTTCATTTCTCGGTCTCGGTCGATTTAAACATCTTTATCAGTGGATTGGTCGCCATCTTCTTTTTGGCTCGCCAGCCGGAGACACAGGGTCAATGGGAAGAGTAACGAGCACAGAAGACTCGTCTCAATTCTTCGAGTTTCTCCGGTGTCTGCGGTTTGGGTGTTCTGATCAATGGCGATGTCCAATACGAGCTCCAGCAAGGGTGCCTACAAAAATTATTTGA

Protein sequence

MALRKTEIEKAMDWLLGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDEVKVHATDVAELGSKEEERFLKKLEEVKRRRREVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKRGKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYLRVLSAWSVRRKRLRLSNVVWIGRHLHFSVSVDLNITLPSFLGLGRFKHLYQWIGRHLLFGSPAGDTGSMGRVTSTEDSSQFFEFLRCLRFGCSDQWRCPIRAPARVPTKII
BLAST of Cp4.1LG05g00390 vs. Swiss-Prot
Match: MOS2_ARATH (Protein MOS2 OS=Arabidopsis thaliana GN=MOS2 PE=2 SV=1)

HSP 1 Score: 240.4 bits (612), Expect = 2.9e-62
Identity = 129/226 (57.08%), Postives = 169/226 (74.78%), Query Frame = 1

Query: 14  WLLGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLS-NRDEVKVHATDVAELGSKEEERFL 73
           + +G  VRI+ GR  G +GKIVEK    + V+K+S + +EVKV   +VA+LGSKEEE+ L
Sbjct: 232 FFVGKEVRIIAGRDVGLKGKIVEKPGSDFFVIKISGSEEEVKVGVNEVADLGSKEEEKCL 291

Query: 74  KKLEEVKRRRREVE------------------RVEEKRENGQRDEEKRTRLCWLTSHIRV 133
           KKL++++   RE +                  R  EK++ GQ   E++ +  WL SHI+V
Sbjct: 292 KKLKDLQLNDREKDKKTSGRGRGAERGSRSEVRASEKQDRGQ-TRERKVKPSWLRSHIKV 351

Query: 134 RIISKDFKRGKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLV 193
           RI+SKD+K G+ YLKKG++VDVVGPT CDI++DE++ELVQGV QELLETALPRRGGPVLV
Sbjct: 352 RIVSKDWKGGRLYLKKGKVVDVVGPTTCDITMDETQELVQGVDQELLETALPRRGGPVLV 411

Query: 194 LYGKHKGVYGSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGD 221
           L GKHKGVYG+LVE+DLDKETGVVRD D+H++ +VRL+QVAEY+GD
Sbjct: 412 LSGKHKGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLDQVAEYMGD 456

BLAST of Cp4.1LG05g00390 vs. Swiss-Prot
Match: GPKOW_XENLA (G patch domain and KOW motifs-containing protein OS=Xenopus laevis GN=gpkow PE=2 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 8.1e-12
Identity = 44/139 (31.65%), Postives = 76/139 (54.68%), Query Frame = 1

Query: 86  ERVEEKRENGQRDEEKRTRL---CWLTSHIRVRIISKDFKRGKFYLKKGEIVDVVGPTIC 145
           E+   +R   Q  E+K+ R     WL   IRVR I K++K GK+Y  K  + DV+ PT C
Sbjct: 346 EKRHRQRSPEQEKEKKKIRPEPHGWLRRDIRVRFIDKNYKGGKYYNSKMLVEDVLSPTRC 405

Query: 146 DISIDESRELVQGVSQELLETALPRRGGP-VLVLYGKHKGVYGSLVERDLDKETGVVRDA 205
            +   E+  +++ + Q++LET +P+  G  V+V+ GK++G+ G ++ RD  K   +V+  
Sbjct: 406 -VCRTENGCILEDIRQDMLETIIPKEEGEHVMVVLGKYRGMVGKILHRDKQKSRALVQLQ 465

Query: 206 DSHELC-NVRLEQVAEYLG 220
             H+    +  + +  Y G
Sbjct: 466 GEHDSAETLSYDAICHYTG 483

BLAST of Cp4.1LG05g00390 vs. Swiss-Prot
Match: MOS2_CAEEL (Protein mos-2 homolog OS=Caenorhabditis elegans GN=mos-2 PE=1 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 1.3e-09
Identity = 67/251 (26.69%), Postives = 109/251 (43.43%), Query Frame = 1

Query: 5   KTEIEKAMDWLLGSIVRIVGGRYAGSEGKIVEKLD--LKWLVLKLSNRDEVKVHATDVAE 64
           K E EK  +  +GS +++V GR  G  GK+  + D      +        +KV       
Sbjct: 213 KAEEEKLEEIKVGSFIKVVDGRNKGVYGKVEGRDDDSNSLFIRTAIGGKTMKVSQIVAVA 272

Query: 65  LGSKEEERFLKKLE----EVKRRRREVER------------------VEEKRENGQRD-- 124
           + +KE ER  K L     + ++ R E ER                   + K  + + D  
Sbjct: 273 VSAKEYERDSKCLNKSEYDKEKDRLETERKKLESQPPSTSTSQSSKDYKSKSSSSKHDKN 332

Query: 125 --EEKRTRLCWLTSHIRVRIISKDFKRGKFYLKKGEIVDVVGPTICDISIDESR-ELVQG 184
             E +R    W  + + VR I +DFKRG  Y +K  IVDV G    D++I++ R      
Sbjct: 333 SSEYERNDKMWARTDLLVRFIDEDFKRGSLYEQKVRIVDVAGDN--DVTIEDDRGNTHYN 392

Query: 185 VSQELLETALPRR-GGPVLVLYGKHKGVYGSLVERDLDKE--------TGVVRDADSHEL 218
           + Q  LET +PR  G  ++++ GK  G    ++++D  KE        T  V  A   ++
Sbjct: 393 IRQSWLETVIPREIGEKLMIVAGKRSGQLAVMLDKDKRKEKVTARLVATNDVVTAYFEDV 452

BLAST of Cp4.1LG05g00390 vs. TrEMBL
Match: A0A0A0LCH0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G636410 PE=4 SV=1)

HSP 1 Score: 329.7 bits (844), Expect = 4.1e-87
Identity = 177/221 (80.09%), Postives = 194/221 (87.78%), Query Frame = 1

Query: 16  LGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDE---VKVHATDVAELGSKEEERFL 75
           +G  VRIV GR AG +G+++EKLD  WLVLKLS RDE   +KV ATD+AELGSKEEE+FL
Sbjct: 278 IGKHVRIVRGRDAGLKGRVLEKLDSDWLVLKLSKRDEHVKLKVRATDIAELGSKEEEKFL 337

Query: 76  KKLEEVK--------RRRREVERVEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKR 135
           KKLEE+K        +RRREVE+V EKRENG RD+EKRT RL WLTSHIRVRIISK+FK 
Sbjct: 338 KKLEELKVKNENTGQKRRREVEQVVEKRENGSRDKEKRTGRLSWLTSHIRVRIISKEFKG 397

Query: 136 GKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY 195
           GKFYLKKGEIVDVVGP+ICDISID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY
Sbjct: 398 GKFYLKKGEIVDVVGPSICDISIDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY 457

Query: 196 GSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           GSLVERDLDKETGVVRDADSHEL NVRLEQ+AEY+GDP+YL
Sbjct: 458 GSLVERDLDKETGVVRDADSHELLNVRLEQIAEYIGDPSYL 498

BLAST of Cp4.1LG05g00390 vs. TrEMBL
Match: A0A0D2TVX7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G267300 PE=4 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 2.2e-64
Identity = 136/239 (56.90%), Postives = 175/239 (73.22%), Query Frame = 1

Query: 4   RKTEIEKAMDWLLGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDE-VKVHATDVAE 63
           +K   E    + +G  VR++ GR  GS+G I+EKL   W+VLKL NRDE VKV  +++A+
Sbjct: 222 KKEREEDEDGFFVGKDVRVIEGRGMGSKGTIMEKLGDSWVVLKLKNRDEEVKVRISEIAD 281

Query: 64  LGSKEEERFLKKLEEVK----------------RRRREVERVEEKRENGQRDEEKRTR-L 123
           LGS+EEE+ L++L+E+K                +R R  E++ E + N +R      R +
Sbjct: 282 LGSREEEKCLRRLKELKIRDEKMSKHKDERKYSKRSRNTEKISETQVNVERTRTNGDRGV 341

Query: 124 CWLTSHIRVRIISKDFKRGKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETAL 183
            WL SHIRVRIISK    G+ YLKKG++VDVVGP +CDI++DES+EL+QGV QELLETAL
Sbjct: 342 SWLKSHIRVRIISKSLAGGRLYLKKGQVVDVVGPYMCDIAMDESKELIQGVEQELLETAL 401

Query: 184 PRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           PRRGGPVLVLYG+HKGVYG+LVERDLD+E GVVRDADS EL +V+LEQVAEY+GDP+YL
Sbjct: 402 PRRGGPVLVLYGRHKGVYGNLVERDLDREMGVVRDADSQELLDVKLEQVAEYMGDPSYL 460

BLAST of Cp4.1LG05g00390 vs. TrEMBL
Match: W9RZ88_9ROSA (Protein MOS2 OS=Morus notabilis GN=L484_018670 PE=4 SV=1)

HSP 1 Score: 253.1 bits (645), Expect = 4.9e-64
Identity = 135/218 (61.93%), Postives = 175/218 (80.28%), Query Frame = 1

Query: 15  LLGSIVRIVGGRYAGSEGKIVEKL-DLKWLVLKLSNRDE-VKVHATDVAELGSKEEERFL 74
           L+G  VRIV GR  G +G+++EKL D   LV++LS   E VKV+  DVAELGS+E+E  L
Sbjct: 257 LIGKEVRIVRGRELGLKGRVLEKLSDDNRLVVRLSRSQETVKVNIQDVAELGSEEDEACL 316

Query: 75  KKLEEVKRRRREV--ERVEEKRENGQRD----EEKRTRLCWLTSHIRVRIISKDFKRGKF 134
           K+L+E++ R  E   E+  ++REN  RD    +++  R  WL SHIRVRIIS++ K G+ 
Sbjct: 317 KRLKELRIREEEEKKEKKSKRRENKSRDSDGEKQQPPRKSWLRSHIRVRIISRELKGGRL 376

Query: 135 YLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL 194
           YLKKGE+VDVVGP +CD+S+D+ REL+QGVSQ++LE+ALPRRGGPVLVL+GKH+GVYGSL
Sbjct: 377 YLKKGEVVDVVGPKVCDVSMDDGRELIQGVSQDVLESALPRRGGPVLVLFGKHEGVYGSL 436

Query: 195 VERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           VERDLD+ETGVVRDAD+H+L NVRLEQ+AEY+GDP+YL
Sbjct: 437 VERDLDRETGVVRDADTHDLINVRLEQIAEYIGDPSYL 474

BLAST of Cp4.1LG05g00390 vs. TrEMBL
Match: A0A151RDK5_CAJCA (Protein MOS2 OS=Cajanus cajan GN=KK1_037960 PE=4 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 1.8e-63
Identity = 134/211 (63.51%), Postives = 169/211 (80.09%), Query Frame = 1

Query: 19  IVRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDE-VKVHATDVAELGSKEEERFLKKLEE 78
           +VRIVGGR AG +G +V  +   +LVL+LS   + VKV   DVAELGS+EEER L+KL++
Sbjct: 239 VVRIVGGRDAGLKGSVVSSIGDGYLVLRLSGSGQKVKVKVGDVAELGSREEERCLRKLKD 298

Query: 79  VKRRRREVERVEEKRENGQRDEEKRT----RLCWLTSHIRVRIISKDFKRGKFYLKKGEI 138
            K RR      EEKR++  R EE+R     ++ WLTSHIRVR+IS+D K G+ YLKKGE+
Sbjct: 299 SKIRR------EEKRQDVGRREERRVVDHRKVSWLTSHIRVRVISRDLKGGRLYLKKGEV 358

Query: 139 VDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDK 198
           +DVVGPT CDIS+DESRE+VQGVSQ++LETA+PR GGPVLVL GK+KGVYGSLVERDLD+
Sbjct: 359 LDVVGPTTCDISMDESREIVQGVSQDVLETAIPRCGGPVLVLAGKYKGVYGSLVERDLDR 418

Query: 199 ETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           E+ VVRDAD+HE+ NV+LEQ+AEY+GDP+ L
Sbjct: 419 ESAVVRDADTHEMFNVKLEQIAEYIGDPSLL 443

BLAST of Cp4.1LG05g00390 vs. TrEMBL
Match: A0A061ENN8_THECC (MOS2, putative isoform 1 OS=Theobroma cacao GN=TCM_021051 PE=4 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 3.2e-63
Identity = 134/230 (58.26%), Postives = 171/230 (74.35%), Query Frame = 1

Query: 14  WLLGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDE-VKVHATDVAELGSKEEERFL 73
           + +G  VR++ GR  G +G I+EKL   W+VL+L   +E VKV   ++A+LGS+EEE+ L
Sbjct: 234 FFVGKDVRVIEGREMGLKGTIMEKLGGGWIVLRLKKSEEKVKVRLFEIADLGSREEEKCL 293

Query: 74  KKLEEVK-----------------RRRREVERVEEKRENGQRDEEKRTR-LCWLTSHIRV 133
           +KL E+K                 +R RE E+  E + N +R      R + WL SHIRV
Sbjct: 294 RKLTELKIREAKDLKTKGDERKVSKRSRESEKRSETKVNVERVRTNGDRGVSWLRSHIRV 353

Query: 134 RIISKDFKRGKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLV 193
           RIISK+ + G+ YLKKG++VDVVGP +CDIS+DESREL+QGV QELLETALPRRGGPVL+
Sbjct: 354 RIISKNLEGGRLYLKKGQVVDVVGPYMCDISMDESRELIQGVEQELLETALPRRGGPVLI 413

Query: 194 LYGKHKGVYGSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           LYG+HKGVYGSLVERD+D+ETGVVRDADSHEL NV+LEQ+AEY+GDP+YL
Sbjct: 414 LYGRHKGVYGSLVERDVDRETGVVRDADSHELLNVKLEQIAEYMGDPSYL 463

BLAST of Cp4.1LG05g00390 vs. TAIR10
Match: AT1G33520.1 (AT1G33520.1 D111/G-patch domain-containing protein)

HSP 1 Score: 240.4 bits (612), Expect = 1.7e-63
Identity = 129/226 (57.08%), Postives = 169/226 (74.78%), Query Frame = 1

Query: 14  WLLGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLS-NRDEVKVHATDVAELGSKEEERFL 73
           + +G  VRI+ GR  G +GKIVEK    + V+K+S + +EVKV   +VA+LGSKEEE+ L
Sbjct: 232 FFVGKEVRIIAGRDVGLKGKIVEKPGSDFFVIKISGSEEEVKVGVNEVADLGSKEEEKCL 291

Query: 74  KKLEEVKRRRREVE------------------RVEEKRENGQRDEEKRTRLCWLTSHIRV 133
           KKL++++   RE +                  R  EK++ GQ   E++ +  WL SHI+V
Sbjct: 292 KKLKDLQLNDREKDKKTSGRGRGAERGSRSEVRASEKQDRGQ-TRERKVKPSWLRSHIKV 351

Query: 134 RIISKDFKRGKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLV 193
           RI+SKD+K G+ YLKKG++VDVVGPT CDI++DE++ELVQGV QELLETALPRRGGPVLV
Sbjct: 352 RIVSKDWKGGRLYLKKGKVVDVVGPTTCDITMDETQELVQGVDQELLETALPRRGGPVLV 411

Query: 194 LYGKHKGVYGSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGD 221
           L GKHKGVYG+LVE+DLDKETGVVRD D+H++ +VRL+QVAEY+GD
Sbjct: 412 LSGKHKGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLDQVAEYMGD 456

BLAST of Cp4.1LG05g00390 vs. TAIR10
Match: AT4G25020.1 (AT4G25020.1 D111/G-patch domain-containing protein)

HSP 1 Score: 181.8 bits (460), Expect = 7.0e-46
Identity = 104/201 (51.74%), Postives = 138/201 (68.66%), Query Frame = 1

Query: 26  RYAGSEGKIVEKLD--LKWLVLKL---SNRDEVKVHATDVAELGSKEEERFLKKLEEVKR 85
           +++G+EG    K D  +K +  KL    + +EVKV    + ++ + E++R ++K      
Sbjct: 173 KWSGNEGFGFGKSDKAMKMIDNKLVGSGSHEEVKV---GINKIENMEKDRVVRKRNRETE 232

Query: 86  RRREVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKRGKFYLKKGEIVDVVGPT 145
                E    K+    +  E R +  WL SHI+VRIISKD K G+ YLKK  + DVVGPT
Sbjct: 233 GESRTEVKACKQNYRGQTRETREKTSWLRSHIKVRIISKDVKGGRLYLKKAVVTDVVGPT 292

Query: 146 ICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRD 205
            CDI++DE++ELVQG+ QELLETALPRRGG VLVL G+HKGVYG LVE+DLDKETGVV D
Sbjct: 293 SCDIAMDETQELVQGIDQELLETALPRRGGSVLVLSGRHKGVYGRLVEKDLDKETGVVCD 352

Query: 206 ADSHELCNVRLEQVAEYLGDP 222
           ADS E+ +V+L+QVAEY+GDP
Sbjct: 353 ADSQEMLHVKLDQVAEYIGDP 370

BLAST of Cp4.1LG05g00390 vs. NCBI nr
Match: gi|700203259|gb|KGN58392.1| (hypothetical protein Csa_3G636410 [Cucumis sativus])

HSP 1 Score: 329.7 bits (844), Expect = 5.9e-87
Identity = 177/221 (80.09%), Postives = 194/221 (87.78%), Query Frame = 1

Query: 16  LGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDE---VKVHATDVAELGSKEEERFL 75
           +G  VRIV GR AG +G+++EKLD  WLVLKLS RDE   +KV ATD+AELGSKEEE+FL
Sbjct: 278 IGKHVRIVRGRDAGLKGRVLEKLDSDWLVLKLSKRDEHVKLKVRATDIAELGSKEEEKFL 337

Query: 76  KKLEEVK--------RRRREVERVEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKR 135
           KKLEE+K        +RRREVE+V EKRENG RD+EKRT RL WLTSHIRVRIISK+FK 
Sbjct: 338 KKLEELKVKNENTGQKRRREVEQVVEKRENGSRDKEKRTGRLSWLTSHIRVRIISKEFKG 397

Query: 136 GKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY 195
           GKFYLKKGEIVDVVGP+ICDISID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY
Sbjct: 398 GKFYLKKGEIVDVVGPSICDISIDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY 457

Query: 196 GSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           GSLVERDLDKETGVVRDADSHEL NVRLEQ+AEY+GDP+YL
Sbjct: 458 GSLVERDLDKETGVVRDADSHELLNVRLEQIAEYIGDPSYL 498

BLAST of Cp4.1LG05g00390 vs. NCBI nr
Match: gi|778682238|ref|XP_004144463.2| (PREDICTED: protein MOS2 [Cucumis sativus])

HSP 1 Score: 329.7 bits (844), Expect = 5.9e-87
Identity = 177/221 (80.09%), Postives = 194/221 (87.78%), Query Frame = 1

Query: 16  LGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDE---VKVHATDVAELGSKEEERFL 75
           +G  VRIV GR AG +G+++EKLD  WLVLKLS RDE   +KV ATD+AELGSKEEE+FL
Sbjct: 256 IGKHVRIVRGRDAGLKGRVLEKLDSDWLVLKLSKRDEHVKLKVRATDIAELGSKEEEKFL 315

Query: 76  KKLEEVK--------RRRREVERVEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKR 135
           KKLEE+K        +RRREVE+V EKRENG RD+EKRT RL WLTSHIRVRIISK+FK 
Sbjct: 316 KKLEELKVKNENTGQKRRREVEQVVEKRENGSRDKEKRTGRLSWLTSHIRVRIISKEFKG 375

Query: 136 GKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY 195
           GKFYLKKGEIVDVVGP+ICDISID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY
Sbjct: 376 GKFYLKKGEIVDVVGPSICDISIDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVY 435

Query: 196 GSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           GSLVERDLDKETGVVRDADSHEL NVRLEQ+AEY+GDP+YL
Sbjct: 436 GSLVERDLDKETGVVRDADSHELLNVRLEQIAEYIGDPSYL 476

BLAST of Cp4.1LG05g00390 vs. NCBI nr
Match: gi|659120907|ref|XP_008460410.1| (PREDICTED: protein MOS2 [Cucumis melo])

HSP 1 Score: 325.5 bits (833), Expect = 1.1e-85
Identity = 174/217 (80.18%), Postives = 192/217 (88.48%), Query Frame = 1

Query: 20  VRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDE---VKVHATDVAELGSKEEERFLKKLE 79
           VRI+ GR AG +G+++EKLD  WLVLKLS RDE   +KV ATD+AELGSKEEERFLKKLE
Sbjct: 312 VRIIRGRDAGLKGRVLEKLDSDWLVLKLSKRDEHVKLKVRATDIAELGSKEEERFLKKLE 371

Query: 80  EVK--------RRRREVERVEEKRENGQRDEEKR-TRLCWLTSHIRVRIISKDFKRGKFY 139
           E+K        +RRREVERV EKRENG RD+EKR +RL WLTSHIRVRIISK+FK GKFY
Sbjct: 372 ELKVKDENTGQKRRREVERVVEKRENGTRDKEKRNSRLSWLTSHIRVRIISKEFKGGKFY 431

Query: 140 LKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLV 199
           LKKGEIVDVVGP+ICDISID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLV
Sbjct: 432 LKKGEIVDVVGPSICDISIDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLV 491

Query: 200 ERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           ERDLD+ETGVVRDADSH+L NVRLEQ+AEY+GDP+YL
Sbjct: 492 ERDLDEETGVVRDADSHQLLNVRLEQIAEYIGDPSYL 528

BLAST of Cp4.1LG05g00390 vs. NCBI nr
Match: gi|1009139935|ref|XP_015887378.1| (PREDICTED: protein MOS2 [Ziziphus jujuba])

HSP 1 Score: 270.0 bits (689), Expect = 5.5e-69
Identity = 149/232 (64.22%), Postives = 179/232 (77.16%), Query Frame = 1

Query: 4   RKTEIEKAMD-WLLGSIVRIVGGRYAGSEGKIVEKLDLKWLVLKLSNRDE-VKVHATDVA 63
           R  +I++  D  L G  VRIVGGR AG +GKI+EKLD    VLKLS  ++ VKV A D+A
Sbjct: 257 RDKDIDRGRDSGLSGKEVRIVGGRNAGLKGKIIEKLDHDKFVLKLSRSEQSVKVSANDIA 316

Query: 64  ELGSKEEERFLKKLEEVKRR----RREVERVEEKRENGQRDEEKRTR-----LCWLTSHI 123
           ELGSKEEER+LKKL+E+K +    R+E +R  E+     RD +K  +       WLTSHI
Sbjct: 317 ELGSKEEERYLKKLKELKIQEEVGRKESKRTREEGRRESRDSQKENQRNMKQASWLTSHI 376

Query: 124 RVRIISKDFKRGKFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPV 183
           RVRIISKD K G+ +LKKGE+VDVVGP +CDIS+DESRELVQGVSQ+LLETALPRRGGPV
Sbjct: 377 RVRIISKDLKGGRLHLKKGEVVDVVGPKMCDISMDESRELVQGVSQDLLETALPRRGGPV 436

Query: 184 LVLYGKHKGVYGSLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           LVL GKHKGVYG+LVERDLD+E GVVRDAD+H L NV+ EQ+AEY+GDP+ L
Sbjct: 437 LVLSGKHKGVYGNLVERDLDREIGVVRDADTHSLLNVQFEQIAEYIGDPSLL 488

BLAST of Cp4.1LG05g00390 vs. NCBI nr
Match: gi|502123464|ref|XP_004498120.1| (PREDICTED: protein MOS2 [Cicer arietinum])

HSP 1 Score: 255.4 bits (651), Expect = 1.4e-64
Identity = 134/220 (60.91%), Postives = 172/220 (78.18%), Query Frame = 1

Query: 19  IVRIVGGRYAGSEGKIVEKLDLKWLVLK-LSNRDEVKVHATDVAELGSKEEERFLKKLEE 78
           IVRIV GR  G +  +V++    +L+LK L + +EVKV   DVAELGSKEE+R L+KL++
Sbjct: 239 IVRIVRGRDVGLKASVVDRFGDDFLILKVLRSGEEVKVKIEDVAELGSKEEDRCLRKLQD 298

Query: 79  VKRRRREVER----------VEEKRENGQ---RDEEKRTRLCWLTSHIRVRIISKDFKRG 138
            K R RE E           VEE+R NG    R+E+ + ++ WLTSHIRVR+IS+ FK G
Sbjct: 299 SKTRGREEENGSRSKRGRDEVEERRVNGNGGGREEKGKKQISWLTSHIRVRVISRSFKAG 358

Query: 139 KFYLKKGEIVDVVGPTICDISIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYG 198
           + YLKKGE++DV+GPT CDIS+DESRE++QGVSQ++LETA+P+RGGPVLVLYGKHKGV+G
Sbjct: 359 RLYLKKGEVLDVIGPTTCDISLDESREIIQGVSQDMLETAIPKRGGPVLVLYGKHKGVFG 418

Query: 199 SLVERDLDKETGVVRDADSHELCNVRLEQVAEYLGDPTYL 225
           SLVERDLD+E GVVRDAD+HEL NV+LE +AEY+GDP+ L
Sbjct: 419 SLVERDLDREIGVVRDADTHELLNVKLEHMAEYIGDPSLL 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MOS2_ARATH2.9e-6257.08Protein MOS2 OS=Arabidopsis thaliana GN=MOS2 PE=2 SV=1[more]
GPKOW_XENLA8.1e-1231.65G patch domain and KOW motifs-containing protein OS=Xenopus laevis GN=gpkow PE=2... [more]
MOS2_CAEEL1.3e-0926.69Protein mos-2 homolog OS=Caenorhabditis elegans GN=mos-2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LCH0_CUCSA4.1e-8780.09Uncharacterized protein OS=Cucumis sativus GN=Csa_3G636410 PE=4 SV=1[more]
A0A0D2TVX7_GOSRA2.2e-6456.90Uncharacterized protein OS=Gossypium raimondii GN=B456_009G267300 PE=4 SV=1[more]
W9RZ88_9ROSA4.9e-6461.93Protein MOS2 OS=Morus notabilis GN=L484_018670 PE=4 SV=1[more]
A0A151RDK5_CAJCA1.8e-6363.51Protein MOS2 OS=Cajanus cajan GN=KK1_037960 PE=4 SV=1[more]
A0A061ENN8_THECC3.2e-6358.26MOS2, putative isoform 1 OS=Theobroma cacao GN=TCM_021051 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G33520.11.7e-6357.08 D111/G-patch domain-containing protein[more]
AT4G25020.17.0e-4651.74 D111/G-patch domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|700203259|gb|KGN58392.1|5.9e-8780.09hypothetical protein Csa_3G636410 [Cucumis sativus][more]
gi|778682238|ref|XP_004144463.2|5.9e-8780.09PREDICTED: protein MOS2 [Cucumis sativus][more]
gi|659120907|ref|XP_008460410.1|1.1e-8580.18PREDICTED: protein MOS2 [Cucumis melo][more]
gi|1009139935|ref|XP_015887378.1|5.5e-6964.22PREDICTED: protein MOS2 [Ziziphus jujuba][more]
gi|502123464|ref|XP_004498120.1|1.4e-6460.91PREDICTED: protein MOS2 [Cicer arietinum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005824KOW
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g00390.1Cp4.1LG05g00390.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005824KOWPFAMPF00467KOWcoord: 17..55
score: 1.
IPR005824KOWSMARTSM00739kow_9coord: 13..40
score: 0.73coord: 165..192
score:
NoneNo IPR availableunknownCoilCoilcoord: 68..98
scor
NoneNo IPR availablePANTHERPTHR15818G PATCH AND KOW-CONTAININGcoord: 4..224
score: 1.7
NoneNo IPR availablePANTHERPTHR15818:SF2G PATCH DOMAIN AND KOW MOTIFS-CONTAINING PROTEINcoord: 4..224
score: 1.7