CmoCh04G000070 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G000070
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Sh4 homologue protein) (Shattering protein)
LocationCmo_Chr04 : 36569 .. 38481 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGGAGCCCCCGACGACATCAACGGAGCCACAGCGTCAGCATCAGCATGAGCAGCAGCAACATCCCCACCATCTCCTACATTTACCCCTAATCCACGGTGGCGCATCCACGGGCACCACTGCCCGAATCAACACTGCAGTAGCAACCTCACCCTCGACAGTAATAGTCCGAGAGTACCGCAAAGGGAACTGGAGTCTCCAAGAGACGATGATTCTGATAACCGCGAAAAAGCTGGACGAGGAACGGCGGAACAAGGCGGAACAAGGAACGGCCAGAAAGGGCAGCGAGCTGCGGTGGAAGTGGGTGGAAAACTACTGCTGGAGCCAGGGGTGCGAGCGGAGCCAAAATCAGTGCAACGACAAGTGGGATAACCTACTCCGCGACTACAAAAAAGTCCGGGAGCATGAATCCCGCGCGTGTGATCAATCCCAAATTCCGTCTTACTGGAAAATGGAAAAGCATGAGCGTAAGGACAACAATCTTCCTTCTAACATGGCCTTTGAGGTGTATCAGGCCTTAAACGACGTGGTTCAGAGGAAGCTCAGAGGTGTGTCTGTTGCTGTTGTTGCTGGTCCTCCGCCTTCTCCTTCCCCCGCCGAGGGCGAGGCTGCGGCGGGGACTAGTTCCCCGGCGGCTTCAGGTGAGTGTGTTTTTGTTTGTGTGTGTGTGTGTGTGGGGTACTATTAAATTTGATTTTATTATATTATTTTCGGCCTGGACCTTTCATTTTCGGCCTGGTTCGAGTTAATAGTGTGGGTCAAAGTTGCAGGGTTTGACTCTGCCACAATTTTGAATAGGACCCTCCTTCCTCCTTTTCGTTTCGTTATTCGCTTTCTAATCCTCTACGGACACCTCTACCATCTGCAGATGGGGCCTCCAAATCTTATCCCTCCGATTACGTTGCGAATTTAATATAGTGGGATTAATTTTGCAATCATGATTAATACAAATCTAAGCATGCAATCCCATAATTAAATATTTATTGTTTACTTACTAATTTTGTCTTCTTCTTCTTTTTAAAATCTCTCTATTTTACAGCTTCTTTAAAATTTATGGATGGATATATTTACCAAATGCAGATTCGTGAGCTTTATGTGTGTGTGTGTATATATATATATATATTTGAAATTTGATTGTGTGGGCATATGTGTATATATATATACAGAGTCGTCGTCGTCGTCGTCGTCAGGGAGGGAGTGGGGGCAGAAGAAAGAGAAGAGGGAGAGAAAGAGGAGAAGAGTGGGAAGAAGCATTGAGAGAAGCGCGTCGGCGGTGGCTCAAACGCTGCGGACCTGCGAGGAGCAGAGGGAGATCCGACACCAACAACTGATGGAGCTTAAGAAACGGCGCCTTCAAATCCAAGAAGCCCGCAACCACATTCAGGGTCAAGGCATCGCCGACCTCGTGGCCGCGGTTGCCAACCTCTCCGGTACATGTGTACATATATATATATATATATATATATATATAAATAATAAAAAGAAAGGAATTAGGGTTTATGTTGACTTGGTGGGGGGATGTTGTGAGTGTAGGCATAAACAATAGAAGAAGAAGAAGAAGAAGATCAGAAGAGTATGAATGTTTATACAGTGGAGAAGAGGTGAGAATGTTGAAAGAACAAAACGAGGCAATGCAGGCTGAGCTTTCGAGCGTCAAGACTGAGCTTTCTCAACTCCGAGACCAAATGCCCTCTCTCGTGCGAACCATGATGCACAATGTGATGCACAACATCCCTCCTCCTCCTCCTCCTTCCACGGTACTCTCTCCCTCTCCCTCTCCCTCTCCCTCTCCCTCTCCCTCTCCCTCCCTATGCATCCATCCATATTATTCTAAATAATATATATAATATTGACTATATGTTCTGTACATCTTTGTTTGTATAGGACCCAGGTGGAGATGCTTACAAATAA

mRNA sequence

ATGTCGGAGCCCCCGACGACATCAACGGAGCCACAGCGTCAGCATCAGCATGAGCAGCAGCAACATCCCCACCATCTCCTACATTTACCCCTAATCCACGGTGGCGCATCCACGGGCACCACTGCCCGAATCAACACTGCAGTAGCAACCTCACCCTCGACAGTAATAGTCCGAGAGTACCGCAAAGGGAACTGGAGTCTCCAAGAGACGATGATTCTGATAACCGCGAAAAAGCTGGACGAGGAACGGCGGAACAAGGCGGAACAAGGAACGGCCAGAAAGGGCAGCGAGCTGCGGTGGAAGTGGGTGGAAAACTACTGCTGGAGCCAGGGGTGCGAGCGGAGCCAAAATCAGTGCAACGACAAGTGGGATAACCTACTCCGCGACTACAAAAAAGTCCGGGAGCATGAATCCCGCGCGTGTGATCAATCCCAAATTCCGTCTTACTGGAAAATGGAAAAGCATGAGCGTAAGGACAACAATCTTCCTTCTAACATGGCCTTTGAGGTGTATCAGGCCTTAAACGACGTGGTTCAGAGGAAGCTCAGAGGTGTGTCTGTTGCTGTTGTTGCTGGTCCTCCGCCTTCTCCTTCCCCCGCCGAGGGCGAGGCTGCGGCGGGGACTAGTTCCCCGGCGGCTTCAGAGTCGTCGTCGTCGTCGTCGTCAGGGAGGGAGTGGGGGCAGAAGAAAGAGAAGAGGGAGAGAAAGAGGAGAAGAGTGGGAAGAAGCATTGAGAGAAGCGCGTCGGCGGTGGCTCAAACGCTGCGGACCTGCGAGGAGCAGAGGGAGATCCGACACCAACAACTGATGGAGCTTAAGAAACGGCGCCTTCAAATCCAAGAAGCCCGCAACCACATTCAGGGTCAAGGCATCGCCGACCTCGTGGCCGCGGTTGCCAACCTCTCCGGCATAAACAATAGAAGAAGAAGAAGAAGAAGATCAGAAGAGTATGAATGTTTATACAGTGGAGAAGAGGTGAGAATGTTGAAAGAACAAAACGAGGCAATGCAGGCTGAGCTTTCGAGCGTCAAGACTGAGCTTTCTCAACTCCGAGACCAAATGCCCTCTCTCGTGCGAACCATGATGCACAATGTGATGCACAACATCCCTCCTCCTCCTCCTCCTTCCACGGACCCAGGTGGAGATGCTTACAAATAA

Coding sequence (CDS)

ATGTCGGAGCCCCCGACGACATCAACGGAGCCACAGCGTCAGCATCAGCATGAGCAGCAGCAACATCCCCACCATCTCCTACATTTACCCCTAATCCACGGTGGCGCATCCACGGGCACCACTGCCCGAATCAACACTGCAGTAGCAACCTCACCCTCGACAGTAATAGTCCGAGAGTACCGCAAAGGGAACTGGAGTCTCCAAGAGACGATGATTCTGATAACCGCGAAAAAGCTGGACGAGGAACGGCGGAACAAGGCGGAACAAGGAACGGCCAGAAAGGGCAGCGAGCTGCGGTGGAAGTGGGTGGAAAACTACTGCTGGAGCCAGGGGTGCGAGCGGAGCCAAAATCAGTGCAACGACAAGTGGGATAACCTACTCCGCGACTACAAAAAAGTCCGGGAGCATGAATCCCGCGCGTGTGATCAATCCCAAATTCCGTCTTACTGGAAAATGGAAAAGCATGAGCGTAAGGACAACAATCTTCCTTCTAACATGGCCTTTGAGGTGTATCAGGCCTTAAACGACGTGGTTCAGAGGAAGCTCAGAGGTGTGTCTGTTGCTGTTGTTGCTGGTCCTCCGCCTTCTCCTTCCCCCGCCGAGGGCGAGGCTGCGGCGGGGACTAGTTCCCCGGCGGCTTCAGAGTCGTCGTCGTCGTCGTCGTCAGGGAGGGAGTGGGGGCAGAAGAAAGAGAAGAGGGAGAGAAAGAGGAGAAGAGTGGGAAGAAGCATTGAGAGAAGCGCGTCGGCGGTGGCTCAAACGCTGCGGACCTGCGAGGAGCAGAGGGAGATCCGACACCAACAACTGATGGAGCTTAAGAAACGGCGCCTTCAAATCCAAGAAGCCCGCAACCACATTCAGGGTCAAGGCATCGCCGACCTCGTGGCCGCGGTTGCCAACCTCTCCGGCATAAACAATAGAAGAAGAAGAAGAAGAAGATCAGAAGAGTATGAATGTTTATACAGTGGAGAAGAGGTGAGAATGTTGAAAGAACAAAACGAGGCAATGCAGGCTGAGCTTTCGAGCGTCAAGACTGAGCTTTCTCAACTCCGAGACCAAATGCCCTCTCTCGTGCGAACCATGATGCACAATGTGATGCACAACATCCCTCCTCCTCCTCCTCCTTCCACGGACCCAGGTGGAGATGCTTACAAATAA
BLAST of CmoCh04G000070 vs. TrEMBL
Match: A0A0A0KME8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G261720 PE=4 SV=1)

HSP 1 Score: 376.3 bits (965), Expect = 4.4e-101
Identity = 222/324 (68.52%), Postives = 242/324 (74.69%), Query Frame = 1

Query: 1   MSEPPTTSTEPQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVREY 60
           MS+PPTTS+EP   H      H  HL  LP+IH GA+ GT  R+NTA ATS S VIVREY
Sbjct: 1   MSDPPTTSSEPPHHH------HQQHLPRLPVIHSGATGGT--RMNTAAATSSSAVIVREY 60

Query: 61  RKGNWSLQETMILITAKKLDEERRNKAEQG------TARKGSELRWKWVENYCWSQGCER 120
           RKGNW+LQETMILITAKKLD+ERRNKA  G       ARKG ELRWKWVENYCWS GC+R
Sbjct: 61  RKGNWTLQETMILITAKKLDDERRNKANLGPSTVDPAARKGGELRWKWVENYCWSHGCQR 120

Query: 121 SQNQCNDKWDNLLRDYKKVREHESRACDQSQIPSYWKMEKHERKDNNLPSNMAFEVYQAL 180
           SQNQCNDKWDNLLRDYKKVRE+ESRACDQ QIPSYWKMEKHERKD NLPSNMAFEVYQAL
Sbjct: 121 SQNQCNDKWDNLLRDYKKVREYESRACDQ-QIPSYWKMEKHERKDKNLPSNMAFEVYQAL 180

Query: 181 NDVVQRKL---------RGVSVAVVAGPPPS---PSPAEGEAAAGTSSPAASESSSSSSS 240
           NDVVQRK           G+ +  +  PPPS   P P        T+SP  SE   SSSS
Sbjct: 181 NDVVQRKFSQKPSNSSNTGILLLPLPAPPPSALLPPP------TATNSPQLSE---SSSS 240

Query: 241 GREWGQKKEKRERKRRR----VGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKRRLQ 300
           G E  +KKEK E KRR+    +GR IERS SA+ QTL +CEEQREIRHQQLMEL+KRRLQ
Sbjct: 241 GTESSEKKEKVEAKRRKMEDNIGRRIERSVSALGQTLHSCEEQREIRHQQLMELRKRRLQ 300

Query: 301 IQEARNHIQGQGIADLVAAVANLS 303
           I+E RNHI  QGIADLVAAVANLS
Sbjct: 301 IEETRNHIHRQGIADLVAAVANLS 306

BLAST of CmoCh04G000070 vs. TrEMBL
Match: A0A067KQ45_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04860 PE=4 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 1.9e-64
Identity = 158/321 (49.22%), Postives = 200/321 (62.31%), Query Frame = 1

Query: 1   MSEPPTTSTEPQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVREY 60
           MS+PPT+   P      + Q  P    HLPL+   A+T TT   +            REY
Sbjct: 1   MSQPPTSIPPPP-----QPQPQPQPPSHLPLLPFSATTTTTPTSSN-----------REY 60

Query: 61  RKGNWSLQETMILITAKKLDEERRNKAEQGTARKGSELRWKWVENYCWSQGCERSQNQCN 120
           RKGNW++QET+ LITAKKLD+ERR+K    +  K  ELRWKWVENYCW+ GC RSQNQCN
Sbjct: 61  RKGNWTIQETLTLITAKKLDDERRSKPTVPSTSKPGELRWKWVENYCWAHGCYRSQNQCN 120

Query: 121 DKWDNLLRDYKKVREHESRACDQSQIPSYWKMEKHERKDNNLPSNMAFEVYQALNDVVQR 180
           DKWDNLLRDYKKVRE++SR+      PSYW ME+H+RK  NLPSNM+ EV++ALN VVQR
Sbjct: 121 DKWDNLLRDYKKVREYQSRSDGSDSFPSYWTMERHQRKYYNLPSNMSLEVFEALNQVVQR 180

Query: 181 KLRGVS---------------VAVVAGPPPSP-SPAEGEAAAGTSSPAASESSSSS---S 240
           +   ++               V VVA  P SP +  E    A    PA SE S SS   S
Sbjct: 181 RYTNITQQNVVVSPQQQQQQQVTVVADVPVSPVTLREVVPEALMDRPALSEGSESSATES 240

Query: 241 SGREWGQKKEKRERKRRRVGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKRRLQIQE 300
           S +      ++R  K   +G SI+ SAS +AQT+R CEE++E RHQ+LME ++RRLQ++E
Sbjct: 241 SDKHDSGGSKRRRMKNNNIGASIKHSASILAQTIRNCEEKKEKRHQELMEFEQRRLQLEE 300

Query: 301 ARNHIQGQGIADLVAAVANLS 303
            RN +  QG+A+L  AV NLS
Sbjct: 301 TRNEVNRQGMANLAMAVTNLS 305

BLAST of CmoCh04G000070 vs. TrEMBL
Match: K7N4B1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_20G184500 PE=4 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 3.3e-64
Identity = 156/330 (47.27%), Postives = 203/330 (61.52%), Query Frame = 1

Query: 22  HPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVREYRKGNWSLQETMILITAKKLDE 81
           H HH  H+PLI GGA+            +S ST + REYRKGNW++QET+ILITAKKLD+
Sbjct: 12  HHHHNHHVPLIQGGATA----------PSSSSTTLAREYRKGNWTIQETLILITAKKLDD 71

Query: 82  ERRNKAEQG------TARKGSELRWKWVENYCWSQGCERSQNQCNDKWDNLLRDYKKVRE 141
           ERR K          T R   ELRWKWVENYCWS GC RSQNQCNDKWDNLLRDYKKVR+
Sbjct: 72  ERRLKTPAACSTSTTTTRTSGELRWKWVENYCWSHGCLRSQNQCNDKWDNLLRDYKKVRD 131

Query: 142 HESRACD-----QSQIPSYWKMEKHERKDNNLPSNMAFEVYQALNDVVQRK--------L 201
           +ES++ D         PSYW + K +RK+ NLPSNM FEVYQ + DV+QRK         
Sbjct: 132 YESKSNDNDNNNNKHFPSYWTLNKQQRKEQNLPSNMVFEVYQTIADVLQRKQTQSQRQHQ 191

Query: 202 RGVSVAVVAGP------------PPSPSPAEGEAAAGTSSPAASESSSSSSS--GREWGQ 261
           + +++ +V               PP P P        +++P  SE S SS +    +   
Sbjct: 192 QPLAIPLVTSSPSPLQTLPPPPLPPPPPPPPPPPPVSSTTPVGSERSESSGTEHSEDDDD 251

Query: 262 KKEKRERKRRRVGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKRRLQIQEARNHIQG 314
             E + RK + +G  I +SAS +A+ LR+CEE++E RH++++EL++RR+Q++EARN +  
Sbjct: 252 GSESKRRKVKNLGSRIMQSASVLARALRSCEEKKEKRHREMIELEQRRIQMEEARNEVHR 311

BLAST of CmoCh04G000070 vs. TrEMBL
Match: V7BD78_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_007G100500g PE=4 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 3.6e-63
Identity = 168/360 (46.67%), Postives = 212/360 (58.89%), Query Frame = 1

Query: 1   MSEPPTTSTE--PQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVR 60
           MS+P TT     P     H Q  H H L   PLI G           TA A S S+ + R
Sbjct: 1   MSDPSTTPLPHPPLLPEAHRQPLHHHPL---PLIQGA----------TAAAPSSSSSLAR 60

Query: 61  EYRKGNWSLQETMILITAKKLDEERRNKAEQ------------GTARKGSELRWKWVENY 120
           EYRKGNW++QET+ILITAKKLD+ERR K                +AR   ELRWKWVENY
Sbjct: 61  EYRKGNWTIQETLILITAKKLDDERRLKTPHDPTRPACSSTTSSSARTSGELRWKWVENY 120

Query: 121 CWSQGCERSQNQCNDKWDNLLRDYKKVREHESRACDQ--------SQIPSYWKMEKHERK 180
           CWS GC RSQNQCNDKWDNLLRDYKKVR++ES++  Q           PSYW + K +RK
Sbjct: 121 CWSHGCLRSQNQCNDKWDNLLRDYKKVRDYESKSQQQQHQQSHEIKHFPSYWTLNKQQRK 180

Query: 181 DNNLPSNMAFEVYQALNDVVQRKLR------------------GVSVAVVAGPPPSPSPA 240
           + NLPSNM +EVY A+ +V+QRK                     V++  V+ PPP P P 
Sbjct: 181 EQNLPSNMVYEVYHAITEVLQRKQTQPQLQSQTQTQRQPQQQPPVALITVSSPPPPPPP- 240

Query: 241 EGEAAAGTSSPAASESSSSSSSGREWGQKKEKRERKRRRV---GRSIERSASAVAQTLRT 300
                  +++PA SE S SS +        +  E KRR+V   G SI RSAS +A+ LR+
Sbjct: 241 ----PVSSTTPAVSERSESSGTEHSEDDADDGSESKRRKVKNLGSSIMRSASVLARALRS 300

Query: 301 CEEQREIRHQQLMELKKRRLQIQEARNHIQGQGIADLVAAVANLSG-----INNRRRRRR 313
           CEE++E RH++L+EL++RRLQ++EAR+ +  QGIA LVAAV NLSG     IN+ R  +R
Sbjct: 301 CEEKKEKRHRELIELEQRRLQMEEARDEVHRQGIATLVAAVTNLSGAIQSLINSERHGQR 342

BLAST of CmoCh04G000070 vs. TrEMBL
Match: B9S7A3_RICCO (Transcription factor, putative OS=Ricinus communis GN=RCOM_0775080 PE=4 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 6.2e-63
Identity = 160/319 (50.16%), Postives = 207/319 (64.89%), Query Frame = 1

Query: 1   MSEPPTTSTEPQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVREY 60
           MSEPP    +P           P   LHLPL+     T TT    T +++S +    REY
Sbjct: 1   MSEPPPPLQQPPPL--------PPPPLHLPLLPFSTITTTT----TTISSSSN----REY 60

Query: 61  RKGNWSLQETMILITAKKLDEERRNKAEQ-GTARKGSELRWKWVENYCWSQGCERSQNQC 120
           RKGNW++QET+ LITAKKLD+ERR+K     +  K  ELRWKWVENYCW+ GC RSQNQC
Sbjct: 61  RKGNWTIQETLTLITAKKLDDERRSKPSTVASTSKPGELRWKWVENYCWAHGCFRSQNQC 120

Query: 121 NDKWDNLLRDYKKVREHESRA--CDQSQIPSYWKMEKHERKDNNLPSNMAFEVYQALNDV 180
           NDKWDNLLRD+KKVR++++R+   D S  PSYW ME+H+RK  NLPSNM+ EV++ALN+V
Sbjct: 121 NDKWDNLLRDFKKVRDYQARSNDSDSSSFPSYWTMERHQRKFYNLPSNMSLEVFEALNEV 180

Query: 181 VQRKL--------RGVSVAVVAGPPPSPSPAEGEAAAGT------SSPAASESSSSSSSG 240
           VQR+         +   V+ VA PPP P  +  EA   T      + P  SESS++ SS 
Sbjct: 181 VQRRYNTNITTTPQQQHVSAVA-PPPVPVTSVREAMPETVVMDAPAVPERSESSATESSD 240

Query: 241 REWGQKKEKRERKRRRVGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKRRLQIQEAR 300
           +  G    KR RK R +G SI+RSAS +AQT+R CEE++  RHQ+L+E ++RRLQ++E R
Sbjct: 241 KHDGNTGPKR-RKVRNIGASIKRSASILAQTIRNCEEKKHKRHQELLEFEQRRLQLEETR 300

Query: 301 NHIQGQGIADLVAAVANLS 303
           N +  QG+ DLV AV  LS
Sbjct: 301 NEVNKQGMTDLVLAVTKLS 301

BLAST of CmoCh04G000070 vs. TAIR10
Match: AT1G31310.1 (AT1G31310.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 160.6 bits (405), Expect = 1.9e-39
Identity = 93/197 (47.21%), Postives = 119/197 (60.41%), Query Frame = 1

Query: 51  SPSTVIVREYRKGNWSLQETMILITAKKLDEERRNKAEQGT----------ARKGSELRW 110
           S   V++REYRKGNW+L ETM+LI AK++D+ERR +   G           + K +ELRW
Sbjct: 5   SGGLVMMREYRKGNWTLNETMVLIEAKRMDDERRMRRSIGLPPPEQQQDIRSNKPAELRW 64

Query: 111 KWVENYCWSQGCERSQNQCNDKWDNLLRDYKKVREHESRACDQS--------------QI 170
           KW+E+YCW +GC RSQNQCNDKWDNL+RDYKKVRE+E R  + S              + 
Sbjct: 65  KWIEDYCWRKGCMRSQNQCNDKWDNLMRDYKKVREYERRRVESSITAGESSSSSAPAGET 124

Query: 171 PSYWKMEKHERKDNNLPSNMAFEVYQALNDVVQRKLRGVSVAVVAGPPPSPSPAEGEAAA 224
            SYWKMEK ERK+ +LPSNM  + YQAL +VV+ K    S AV A              A
Sbjct: 125 ASYWKMEKSERKERSLPSNMLPQTYQALFEVVESKTLPSSTAVTA------------VTA 184

BLAST of CmoCh04G000070 vs. TAIR10
Match: AT2G35640.1 (AT2G35640.1 Homeodomain-like superfamily protein)

HSP 1 Score: 156.8 bits (395), Expect = 2.8e-38
Identity = 72/139 (51.80%), Postives = 101/139 (72.66%), Query Frame = 1

Query: 50  TSPSTVIVREYRKGNWSLQETMILITAKKLDEERR---NKAEQGTARKGSELRWKWVENY 109
           +S   +++RE RKGNW++ ET++LI AKK+D++RR   ++ +     K +ELRWKW+E Y
Sbjct: 7   SSGEQIVMRECRKGNWTVSETLVLIEAKKMDDQRRVRRSEKQPEGRNKPAELRWKWIEEY 66

Query: 110 CWSQGCERSQNQCNDKWDNLLRDYKKVREHESRACDQS----QIPSYWKMEKHERKDNNL 169
           CW +GC R+QNQCNDKWDNL+RDYKK+RE+E    + S       SYWKM+K ERK+ NL
Sbjct: 67  CWRRGCYRNQNQCNDKWDNLMRDYKKIREYERSRVESSFNTVTSSSYWKMDKTERKEKNL 126

Query: 170 PSNMAFEVYQALNDVVQRK 182
           PSNM  ++Y  L+++V RK
Sbjct: 127 PSNMLPQIYDVLSELVDRK 145

BLAST of CmoCh04G000070 vs. TAIR10
Match: AT2G33550.1 (AT2G33550.1 Homeodomain-like superfamily protein)

HSP 1 Score: 52.0 bits (123), Expect = 9.6e-07
Identity = 77/289 (26.64%), Postives = 118/289 (40.83%), Query Frame = 1

Query: 57  VREYRKGNWSLQETMILITAKKLDEERRNKAEQGTARKGS---ELRWKWVENYCWSQGCE 116
           V+  R   W+ QE ++LI  K++ E R  +        GS   E +W  V +YC   G  
Sbjct: 31  VKTARLPRWTRQEILVLIQGKRVAENRVRRGRAAGMALGSGQMEPKWASVSSYCKRHGVN 90

Query: 117 RSQNQCNDKWDNLLRDYKKVREHESRACDQSQIPSYWKMEKHERKDNNLPSNMAFEVYQA 176
           R   QC  +W NL  DYKK++E ES+  ++++  SYW M    R++  LP     EVY  
Sbjct: 91  RGPVQCRKRWSNLAGDYKKIKEWESQIKEETE--SYWVMRNDVRREKKLPGFFDKEVYDI 150

Query: 177 LNDVVQRKLRGVSVAVVAGPPPSPSPAEGEAAAGTSSPAASESSSSSSSGREWGQK---- 236
           ++  V              PP  P  + G A      PA+ E   S    RE  +K    
Sbjct: 151 VDGGVI-------------PPAVPVLSLGLA------PASDEGLLSDLDRRESPEKLNST 210

Query: 237 -----------KEKRERKRRRVGRSIERSASAV-AQTLRTCEEQREIRHQQLMELKKRRL 296
                      KEK+E      GR  E+   A   +   T +E+R+ +     E  K   
Sbjct: 211 PVAKSVTDVIDKEKQEACVADQGRVKEKQPEAANVEGGSTSQEERKRKRTSFGE--KEEE 270

Query: 297 QIQEARNHIQGQGI------ADLVAAVANLSGINNRRRRRRRSEEYECL 321
           + +     +Q Q I        L+AA   +  +N +  R +R +  + L
Sbjct: 271 EEEGETKKMQNQLIEILERNGQLLAAQLEVQNLNLKLDREQRKDHGDSL 296

BLAST of CmoCh04G000070 vs. NCBI nr
Match: gi|659114054|ref|XP_008456886.1| (PREDICTED: uncharacterized protein LOC103496697 [Cucumis melo])

HSP 1 Score: 479.2 bits (1232), Expect = 6.9e-132
Identity = 288/408 (70.59%), Postives = 316/408 (77.45%), Query Frame = 1

Query: 1   MSEPPTTSTEPQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVREY 60
           MS+PPTTS+EP   HQ +QQ    HL  LP+IHGGAS  T  R+NTA ATS S VIVREY
Sbjct: 1   MSDPPTTSSEPPH-HQQQQQ----HLPRLPVIHGGASGAT--RMNTAAATSSSAVIVREY 60

Query: 61  RKGNWSLQETMILITAKKLDEERRNKAEQG------TARKGSELRWKWVENYCWSQGCER 120
           RKGNW+LQETMILITAKKLD+ERRNKA  G       ARKG ELRWKWVENYCWS GC+R
Sbjct: 61  RKGNWTLQETMILITAKKLDDERRNKANLGPSTVDPAARKGGELRWKWVENYCWSHGCQR 120

Query: 121 SQNQCNDKWDNLLRDYKKVREHESRACDQSQIPSYWKMEKHERKDNNLPSNMAFEVYQAL 180
           SQNQCNDKWDNLLRDYKKVRE+ESRACDQ QIPSYWKMEKHERKD NLPSNMAFEVYQAL
Sbjct: 121 SQNQCNDKWDNLLRDYKKVREYESRACDQ-QIPSYWKMEKHERKDKNLPSNMAFEVYQAL 180

Query: 181 NDVVQRKL---------RGVSVAVVAGPPPS---PSPAEGEAAAGTSSPAASESSSSSSS 240
           NDVVQRK           G+ +  +  PPPS   P P        T+SP  SE   SSSS
Sbjct: 181 NDVVQRKFSQKPSNSSNTGILLLPLPAPPPSTLLPPP------TATNSPQLSE---SSSS 240

Query: 241 GREWGQKKEKRERKRRR----VGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKRRLQ 300
           G E  +KKEK E KRR+    +GR IERS SA+ QTL +CEEQREIRHQQLMEL+KRRLQ
Sbjct: 241 GTESSEKKEKMEAKRRKMEDNIGRRIERSVSALGQTLHSCEEQREIRHQQLMELRKRRLQ 300

Query: 301 IQEARNHIQGQGIADLVAAVANLS-GINNRRRRRRRSEEYE-CLYSGEEVRMLKEQNEAM 360
           I+E RNHI  QGIADLVAAVANLS GI+N   RR RSE YE CLYSGEEVR+LKEQNEAM
Sbjct: 301 IEETRNHIHRQGIADLVAAVANLSAGIDN--NRRGRSEGYESCLYSGEEVRILKEQNEAM 360

Query: 361 QAELSSVKTELSQLRDQMPSLVRTMMHNVMHNIPPPPPPST---DPGG 382
           QAEL +VK ELSQLRDQMPSL++TMMH+++HNIPPPPPPST   DP G
Sbjct: 361 QAELMNVKNELSQLRDQMPSLMQTMMHSMIHNIPPPPPPSTSSMDPSG 389

BLAST of CmoCh04G000070 vs. NCBI nr
Match: gi|778701746|ref|XP_004140413.2| (PREDICTED: trihelix transcription factor PTL-like [Cucumis sativus])

HSP 1 Score: 474.9 bits (1221), Expect = 1.3e-130
Identity = 287/412 (69.66%), Postives = 314/412 (76.21%), Query Frame = 1

Query: 1   MSEPPTTSTEPQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVREY 60
           MS+PPTTS+EP   H      H  HL  LP+IH GA+ GT  R+NTA ATS S VIVREY
Sbjct: 1   MSDPPTTSSEPPHHH------HQQHLPRLPVIHSGATGGT--RMNTAAATSSSAVIVREY 60

Query: 61  RKGNWSLQETMILITAKKLDEERRNKAEQG------TARKGSELRWKWVENYCWSQGCER 120
           RKGNW+LQETMILITAKKLD+ERRNKA  G       ARKG ELRWKWVENYCWS GC+R
Sbjct: 61  RKGNWTLQETMILITAKKLDDERRNKANLGPSTVDPAARKGGELRWKWVENYCWSHGCQR 120

Query: 121 SQNQCNDKWDNLLRDYKKVREHESRACDQSQIPSYWKMEKHERKDNNLPSNMAFEVYQAL 180
           SQNQCNDKWDNLLRDYKKVRE+ESRACDQ QIPSYWKMEKHERKD NLPSNMAFEVYQAL
Sbjct: 121 SQNQCNDKWDNLLRDYKKVREYESRACDQ-QIPSYWKMEKHERKDKNLPSNMAFEVYQAL 180

Query: 181 NDVVQRKL---------RGVSVAVVAGPPPS---PSPAEGEAAAGTSSPAASESSSSSSS 240
           NDVVQRK           G+ +  +  PPPS   P P        T+SP  SE   SSSS
Sbjct: 181 NDVVQRKFSQKPSNSSNTGILLLPLPAPPPSALLPPP------TATNSPQLSE---SSSS 240

Query: 241 GREWGQKKEKRERKRRR----VGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKRRLQ 300
           G E  +KKEK E KRR+    +GR IERS SA+ QTL +CEEQREIRHQQLMEL+KRRLQ
Sbjct: 241 GTESSEKKEKVEAKRRKMEDNIGRRIERSVSALGQTLHSCEEQREIRHQQLMELRKRRLQ 300

Query: 301 IQEARNHIQGQGIADLVAAVANLS-GINNRRRRRRRSEEYE-CLYSGEEVRMLKEQNEAM 360
           I+E RNHI  QGIADLVAAVANLS GI+N   RR RSE YE CLYSGEEVR+LKEQNEAM
Sbjct: 301 IEETRNHIHRQGIADLVAAVANLSAGIDN--DRRGRSEGYESCLYSGEEVRILKEQNEAM 360

Query: 361 QAELSSVKTELSQLRDQMPSLVRTMMHNVMHNIPPPPP--PSTDP---GGDA 384
           QAEL +VK ELSQLRDQMPSL++TMMHN++HNIPPPPP   S DP   GGDA
Sbjct: 361 QAELMNVKNELSQLRDQMPSLMQTMMHNMLHNIPPPPPSTSSMDPSGSGGDA 392

BLAST of CmoCh04G000070 vs. NCBI nr
Match: gi|700195606|gb|KGN50783.1| (hypothetical protein Csa_5G261720 [Cucumis sativus])

HSP 1 Score: 376.3 bits (965), Expect = 6.3e-101
Identity = 222/324 (68.52%), Postives = 242/324 (74.69%), Query Frame = 1

Query: 1   MSEPPTTSTEPQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVREY 60
           MS+PPTTS+EP   H      H  HL  LP+IH GA+ GT  R+NTA ATS S VIVREY
Sbjct: 1   MSDPPTTSSEPPHHH------HQQHLPRLPVIHSGATGGT--RMNTAAATSSSAVIVREY 60

Query: 61  RKGNWSLQETMILITAKKLDEERRNKAEQG------TARKGSELRWKWVENYCWSQGCER 120
           RKGNW+LQETMILITAKKLD+ERRNKA  G       ARKG ELRWKWVENYCWS GC+R
Sbjct: 61  RKGNWTLQETMILITAKKLDDERRNKANLGPSTVDPAARKGGELRWKWVENYCWSHGCQR 120

Query: 121 SQNQCNDKWDNLLRDYKKVREHESRACDQSQIPSYWKMEKHERKDNNLPSNMAFEVYQAL 180
           SQNQCNDKWDNLLRDYKKVRE+ESRACDQ QIPSYWKMEKHERKD NLPSNMAFEVYQAL
Sbjct: 121 SQNQCNDKWDNLLRDYKKVREYESRACDQ-QIPSYWKMEKHERKDKNLPSNMAFEVYQAL 180

Query: 181 NDVVQRKL---------RGVSVAVVAGPPPS---PSPAEGEAAAGTSSPAASESSSSSSS 240
           NDVVQRK           G+ +  +  PPPS   P P        T+SP  SE   SSSS
Sbjct: 181 NDVVQRKFSQKPSNSSNTGILLLPLPAPPPSALLPPP------TATNSPQLSE---SSSS 240

Query: 241 GREWGQKKEKRERKRRR----VGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKRRLQ 300
           G E  +KKEK E KRR+    +GR IERS SA+ QTL +CEEQREIRHQQLMEL+KRRLQ
Sbjct: 241 GTESSEKKEKVEAKRRKMEDNIGRRIERSVSALGQTLHSCEEQREIRHQQLMELRKRRLQ 300

Query: 301 IQEARNHIQGQGIADLVAAVANLS 303
           I+E RNHI  QGIADLVAAVANLS
Sbjct: 301 IEETRNHIHRQGIADLVAAVANLS 306

BLAST of CmoCh04G000070 vs. NCBI nr
Match: gi|1009152822|ref|XP_015894304.1| (PREDICTED: trihelix transcription factor ASR3-like [Ziziphus jujuba])

HSP 1 Score: 267.7 bits (683), Expect = 3.1e-68
Identity = 168/327 (51.38%), Postives = 214/327 (65.44%), Query Frame = 1

Query: 1   MSEPPTTS--------TEPQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSP 60
           MSEP TTS          PQ+Q Q  QQQ PHH       H  +   + A +    ++S 
Sbjct: 1   MSEPTTTSPAATISPPVNPQQQQQ--QQQQPHH-------HRKSPHFSAASVGPTTSSST 60

Query: 61  STVIVREYRKGNWSLQETMILITAKKLDEERRNKAEQG------TARKGSELRWKWVENY 120
           ST I REYRKGNW++QET+ILITAKKLDEERR KA         T+    ELRWKWVENY
Sbjct: 61  STPIEREYRKGNWTIQETLILITAKKLDEERRYKARSAPPDPTSTSTTKGELRWKWVENY 120

Query: 121 CWSQGCERSQNQCNDKWDNLLRDYKKVREHESRACDQSQIPSYWKMEKHERKDNNLPSNM 180
           CWSQGC RS NQCNDKWDNLLRDYKKVRE+ES A  +  +PSYW MEK +RK  NLPSNM
Sbjct: 121 CWSQGCLRSSNQCNDKWDNLLRDYKKVREYESNAQSKPDLPSYWNMEKQDRKLRNLPSNM 180

Query: 181 AFEVYQALNDVVQRKLRGVSVAV----VAGPPPSPSPAEGE---AAAGTSSPAASESSSS 240
           A EV+QALN+V+QRK    + A+         PSP+P        A  TS+PA SE   S
Sbjct: 181 ALEVFQALNEVLQRKYSTQTTALRDPQTLSVSPSPAPLAARPLLPAPTTSAPAPSE--RS 240

Query: 241 SSSGREWGQKKEK----RERKRRRVGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKR 300
            SSG E  +K E+    + +K  ++  SI+RSAS +A+TL+ CEE++E RH+++ME++++
Sbjct: 241 DSSGTEASEKDEETSDTKRKKHGKISSSIKRSASLLAKTLQNCEEKKEKRHREIMEMERK 300

Query: 301 RLQIQEARNHIQGQGIADLVAAVANLS 303
           +L+I+EA N +  QG+ +LV AVANLS
Sbjct: 301 KLEIEEAHNEVNRQGMVNLVGAVANLS 316

BLAST of CmoCh04G000070 vs. NCBI nr
Match: gi|802598104|ref|XP_012072429.1| (PREDICTED: uncharacterized protein LOC105634212 [Jatropha curcas])

HSP 1 Score: 254.6 bits (649), Expect = 2.8e-64
Identity = 158/321 (49.22%), Postives = 200/321 (62.31%), Query Frame = 1

Query: 1   MSEPPTTSTEPQRQHQHEQQQHPHHLLHLPLIHGGASTGTTARINTAVATSPSTVIVREY 60
           MS+PPT+   P      + Q  P    HLPL+   A+T TT   +            REY
Sbjct: 1   MSQPPTSIPPPP-----QPQPQPQPPSHLPLLPFSATTTTTPTSSN-----------REY 60

Query: 61  RKGNWSLQETMILITAKKLDEERRNKAEQGTARKGSELRWKWVENYCWSQGCERSQNQCN 120
           RKGNW++QET+ LITAKKLD+ERR+K    +  K  ELRWKWVENYCW+ GC RSQNQCN
Sbjct: 61  RKGNWTIQETLTLITAKKLDDERRSKPTVPSTSKPGELRWKWVENYCWAHGCYRSQNQCN 120

Query: 121 DKWDNLLRDYKKVREHESRACDQSQIPSYWKMEKHERKDNNLPSNMAFEVYQALNDVVQR 180
           DKWDNLLRDYKKVRE++SR+      PSYW ME+H+RK  NLPSNM+ EV++ALN VVQR
Sbjct: 121 DKWDNLLRDYKKVREYQSRSDGSDSFPSYWTMERHQRKYYNLPSNMSLEVFEALNQVVQR 180

Query: 181 KLRGVS---------------VAVVAGPPPSP-SPAEGEAAAGTSSPAASESSSSS---S 240
           +   ++               V VVA  P SP +  E    A    PA SE S SS   S
Sbjct: 181 RYTNITQQNVVVSPQQQQQQQVTVVADVPVSPVTLREVVPEALMDRPALSEGSESSATES 240

Query: 241 SGREWGQKKEKRERKRRRVGRSIERSASAVAQTLRTCEEQREIRHQQLMELKKRRLQIQE 300
           S +      ++R  K   +G SI+ SAS +AQT+R CEE++E RHQ+LME ++RRLQ++E
Sbjct: 241 SDKHDSGGSKRRRMKNNNIGASIKHSASILAQTIRNCEEKKEKRHQELMEFEQRRLQLEE 300

Query: 301 ARNHIQGQGIADLVAAVANLS 303
            RN +  QG+A+L  AV NLS
Sbjct: 301 TRNEVNRQGMANLAMAVTNLS 305

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KME8_CUCSA4.4e-10168.52Uncharacterized protein OS=Cucumis sativus GN=Csa_5G261720 PE=4 SV=1[more]
A0A067KQ45_JATCU1.9e-6449.22Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04860 PE=4 SV=1[more]
K7N4B1_SOYBN3.3e-6447.27Uncharacterized protein OS=Glycine max GN=GLYMA_20G184500 PE=4 SV=1[more]
V7BD78_PHAVU3.6e-6346.67Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_007G100500g PE=4 SV=1[more]
B9S7A3_RICCO6.2e-6350.16Transcription factor, putative OS=Ricinus communis GN=RCOM_0775080 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G31310.11.9e-3947.21 hydroxyproline-rich glycoprotein family protein[more]
AT2G35640.12.8e-3851.80 Homeodomain-like superfamily protein[more]
AT2G33550.19.6e-0726.64 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659114054|ref|XP_008456886.1|6.9e-13270.59PREDICTED: uncharacterized protein LOC103496697 [Cucumis melo][more]
gi|778701746|ref|XP_004140413.2|1.3e-13069.66PREDICTED: trihelix transcription factor PTL-like [Cucumis sativus][more]
gi|700195606|gb|KGN50783.1|6.3e-10168.52hypothetical protein Csa_5G261720 [Cucumis sativus][more]
gi|1009152822|ref|XP_015894304.1|3.1e-6851.38PREDICTED: trihelix transcription factor ASR3-like [Ziziphus jujuba][more]
gi|802598104|ref|XP_012072429.1|2.8e-6449.22PREDICTED: uncharacterized protein LOC105634212 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR017877Myb-like_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G000070.1CmoCh04G000070.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR017877Myb-like domainPROFILEPS50090MYB_LIKEcoord: 57..127
score: 7
NoneNo IPR availableunknownCoilCoilcoord: 326..353
scor
NoneNo IPR availableGENE3DG3DSA:1.20.5.170coord: 307..355
score: 1.
NoneNo IPR availablePANTHERPTHR33492FAMILY NOT NAMEDcoord: 26..323
score: 3.5
NoneNo IPR availablePANTHERPTHR33492:SF5SUBFAMILY NOT NAMEDcoord: 26..323
score: 3.5
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 64..154
score: 2.4

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh04G000070Cucumber (Chinese Long) v3cmocucB0893
CmoCh04G000070Watermelon (97103) v2cmowmbB703
CmoCh04G000070Wax gourdcmowgoB0898
CmoCh04G000070Cucumber (Chinese Long) v2cmocuB734
CmoCh04G000070Watermelon (Charleston Gray)cmowcgB627