CmaCh16G007680 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G007680
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionKH domain-containing protein
LocationCma_Chr16: 4214724 .. 4219757 (-)
RNA-Seq ExpressionCmaCh16G007680
SyntenyCmaCh16G007680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTAAAATCATAAATACAGTGATAGAAAAAAAAATAAATAAATAAAGGTTTTTACTTCGGGTAGAAAAAGACGCATCCACCAGAAGATTGCCAATTTCCCATTCCTCATCTCGTTCTCCATTGCCACTGACTCATTGATTGTGCGAAGAACAATACAGTAAAATTCAAAGAACAGCAAAAAGCACTCACACAACGCCACGATCTTCCCGTTCTTCTCCTTCCCCATAGGGCTTTCCTTTTTCCGCCACCGCCTCTCTTCATCGGACTTCCCTTTACCTCTCTGCCTCAGATTCATCTCCTTCTTCGCCCCCTTTCTTTCTGCAAAGGTACGCTTCTTTCTCTATCTGGGTCGCCCTGTTCTTCGTTGGATATGTTAATTTCACTCTTCTTCTCCACTTATTAACCCAATTCCACTTCTGGGTTTCTTCTATTTTGTTGTGTTCTTTAGTTTATGCGTTATGGGGGAGAGAACCCCACCTGGGAGTTACTTCCATTACCCTCCCCCTTCTGCCCATGCATCCCCTCACAGGACTCCTTCCATTCCCTTAGATCGGGAGAGGTAATTTCCCTCTCTGTTTCTCAATTCTTCATCATGATTTGCTTTTTGTTTTTGTTTTTTTTTTTTTTTTTTGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGACTGCTTGATTCAGATTCCTTGTTTGATTGATCATGTAATTCTGTTTTTAACCCACTTTTTAACTCTCTTGTGTTGGTTTGATCATCTTTGTCATTGCATTTCAATGCTGACTAGAAGCCCTATTGAGCTTTTAGTGATGTTGTGTTGTTTGACAATGGCAGATATCTAGCTGAATTACTGTCAGAGAGACAGAAGTTGGGTCCCTTTGTGCAAGTTTTACCTCATTGTAGCAGACTTCTGAATCAAGGTGGGGTTATAAGATCTCTACTATTGCATTATGTTGTTGCAATACTTGTTATTGTTTCTTATAGTTTGGATATTTCTTTCTTTGAAGTTTATTATGGGTAGACACTGTTTTCTTAATCATTTGAAAAAATTTAATTACAACCGAGCATACGATTAGAGAAGCAAACGAGTAACGATCAATTGGGAAGGAGTTTTTTACTCCCTCAGACTCATAGAAAGAAATTTGCTTCCTGTTAAAGAGAATGGACTGTTATGTGAATCATATTTCGATAATACATGAGATTAGATGATCGGAACTGAAGTTTCGTGGAGGGGCTACGAACAAGGGGACATTTTGTGGCTATGTATTGCAATGGGAATGTTGTCTGCAGTCTTTGTAAAAGAAAAATTATACTCGTCTTACTATGTTTGCCTATTACATGCAGTTAAGCGTTCTCGTTGTTTGCTGGCAGCTATTAAATACTAGCCTTATCTTATATCATTGAGTTGTAATGTGGGTCGGGTCAATCTAAGTTTAGGCTTAGCCTGTTATCAGAGCTAGACACTGGGTGGTGGTGTGTCAGCGAGGACGATGGACCTCCAAGGAGGGTGAATTGTGAAATCCCACATGGGTTGGAGAGGAGAACGAAACATTCCTTATAAGGGTGTGGAAAACTCTTCTTAGTAGACTTGTTTTAAAAACCTTAAGGGAAATCTCGAAGGGAAACCCCAAAGAGGACAATATCTGCTAGCAGTGGGCTTGAGCTGTTACACTTAGCCTGCTATGTCCAAGAGAGTTTCATTTTCTCTGTGTGGATGGAGCCAAAGATGAAACCTATTGGCCTACTCGATACTTTGTCAAGTCGTAATTAATTAGACAAAATCAATTACCATTTTCCCTACTGTCTTGTGGCCTATAAAAGATGACGTTAACTTGTGATGATGTCATGAATCTGAGACATGAGACATGATCTTGAGAAAAGTTCCCAATGACTGGTAATTTTTCATGAATAACTAGCCATTTATCTTCAAAGTCGTCATCTTACACGAGTATTTGCTTAGTCAGTTGCACGTGCACTCCTTATATAAGACGAGTATTTGCTTAGTCAGTTGCACGTGCACTCCTTATATAAGACGAGTATTTTCTTAGTCAGTTGCATGTGCACCCCTATTTATTACATCTCATTTCAGAGATCAAACGTCTATCGGGCCTTAATCAAACTTCTGTGGATCATGAGAGATTTGAGCACGGGAGTCCTTACCGTTCACTAGGTCAGCTCTCGAATGGAAGACCAATGGACTTGGAAGGTTGGCCTCCAATGCAAATGGAGGTAAATATATATTTGTCCTTGATCTACTTCTTTTCCGTGTTGCTAGTTTTTCGATATGAATGCTTTCTCTTTAATCCCTATTTTCTTTGTATAAGGATATTTTTCGAAAAATGGAAAACTTAACTTTCTTTTAAATGAATGAAAGAATACAATACAGAAAGACAATAGCCTAGATTCGAGATACAAGTGTTTCGAGTTATTAAGTTTTCATCTCGGGGGCTTGAGTTCAGAACTTCGTAGGGTATCTCCAAACATCCCGAGCCATTACTACTGGGGAGTTCCTTGGTTCTAACCTGTGTATCCTCAAAATAGAATAAGTAGTTCTTCGTGTTTTAAAACATAACCATAGAATAAGTAACATACCTATGAGCAACATGTGTGCAACACTTGGACACTCTATGTAACAGCCTAAGCCCACCGCTAGCAGCAGATATTGTCCTTTTGGTCTTTCTCTTTTGGGTTTCCCTTTCAAGGTTTGGACACTCTATGTAACAGCATAAGCCCACCGCTAGCAGCAGATATTATCCTTTTGGTCTTTCTCTAACGAGTTTCCCTTCAAGGTTTAGACACTCTATGTAACAACCTAAGCTCACCGCTAGCAGATATTGTCCTTTTGGTCTTTCCCTTTCGGGTTTCCCTTCAAGGTTTGGACACTCTATGTAACAACCTAAGCCCACCACTAGCAGATATTGTCCTTTTGGTCTTTCCCTTTCGGGTTTCCCTCCAAGGTTTGGACACTTTATGTAACAGCCTAAGCCCACCACCCCTTTAAGGTTTTAGACGCGTTTTAAAAAGCTTGAAGGGAAACCCGAAAAGGAAAGACCAAAAGGACAATATCTGCTAGCAGTGGGCTTGGACTGTTTCACTCTAAAACTTTATTGACGTGTATCACACACTTAGCACAATAGACATGCATTAGACACTAGTTGTAAACAATGAATAAGGACCAAACATTTGTTAGACACATATACGACACTTGTTTAAACATAATAAATGTAACCGCCCAAGCCCACTGCTAACAGATATTGTCCTCTTTGGGCTTTTCCTTTCAAGCTTCCCCTCAAAGTTTTTAAAACGCGTTTGCTAGGGAGAAATTTCCACACCCTCATAAAGAATGGTTCGGTCTCCTTCCCAATCGATGTGGGATCTCATAATAAATAGACACGAATGGTAATAGAACAAAAATTAATAGATTTTTGGAGTGAAATACATCAAACTCATACATGCATACATTTCAATAAATGTGTACTTATCGTATCCGTGTCCTTGATTTTTTTAAAAAACCCATGCTTCTTATTGACCTGTTTACTGCAGGGAAGTGGACATGTCCATGGTATGGGTTCGCTTCAAGCTCACTCGATGGGTTGGCCAAGGGTGCAAGGAGGAATTCCTGCTACGCCGGTAGTTAAAAGAGTCGTTAGACTTGATGTACCTGTTGACAAATATCCAAACGTGAGTCGATATTCTTACGTTAGGTGGTGATATTACTGACCTTTTGCCTCTCTGATGTTCGTAGTTTGTTGCTTAAATCATGTTAGTATAACTTCGTTGGACGACTTCTCGGACCGCGCGGAAACTCCTTGAAAAGGGTCGAAGCCTTAACGGAATGTCGGGTGTACATAAGAGGCAAGGGCTCCATCAAGGATGCTTTGGAGGTACTAAAAATTTTCAATCTTAGTCCTAACTTAAATCTTCAATATTACTTACAATGGATCTGGATGTTGTGCTTTAGGAAGAGAAACTAAAGGACAAGCCTGGATATGAGCATCTTAATGAGCCCTTGCATCTGTTGGTTGAGGCAGAATTCCCGGAGGATACGATAAACTCGCGCTTGGATCATGCCGTAGCCGTTTTAGAAAGCCTATTGAAGCCTGTGGTATGAACTAAACAACGCCCACATCAACTAAACCTCTGATCGTTTTATTATGAAGTCGTCTTCTTATGGAGTTGTTGTGTATTCCAGGATGAATTGCTTGATCAATATAAGAAGCAACAACTAAGAGAACTCGCATTACTAAATGGCACCCTAAGGGAGGAAAGTCCGAGTATGAGCCCGAGCATGTCACCGTTTAACAGCACGGGACTGAAACGGGCCAAGACAGGGAGGTAAGGTAAGGTAAGGTAAAGGAGGAGTGATGACGAGCTTTGTGGTTGACACTGTCTGTTGAAACCATCTTGAGAGCTTTTGGTGCCTTCTTGTCTTCAGCAGATGGTCAAGAATTTTGCTGATTTTGCTGTGCTAA

mRNA sequence

ATGGGCTTTCCTTTTTCCGCCACCGCCTCTCTTCATCGGACTTCCCTTTACCTCTCTGCCTCAGATTCATCTCCTTCTTCGCCCCCTTTCTTTCTGCAAAGTTTATGCGTTATGGGGGAGAGAACCCCACCTGGGAGTTACTTCCATTACCCTCCCCCTTCTGCCCATGCATCCCCTCACAGGACTCCTTCCATTCCCTTAGATCGGGAGAGATATCTAGCTGAATTACTGTCAGAGAGACAGAAGTTGGGTCCCTTTGTGCAAGTTTTACCTCATTGTAGCAGACTTCTGAATCAAGAGATCAAACGTCTATCGGGCCTTAATCAAACTTCTGTGGATCATGAGAGATTTGAGCACGGGAGTCCTTACCGTTCACTAGGTCAGCTCTCGAATGGAAGACCAATGGACTTGGAAGGTTGGCCTCCAATGCAAATGGAGGGAAGTGGACATGTCCATGGTATGGGTTCGCTTCAAGCTCACTCGATGGGTTGGCCAAGGGTGCAAGGAGGAATTCCTGCTACGCCGGTAGTTAAAAGAGTCGTTAGACTTGATGTACCTGTTGACAAATATCCAAACTATAACTTCGTTGGACGACTTCTCGGACCGCGCGGAAACTCCTTGAAAAGGGTCGAAGCCTTAACGGAATGTCGGGTGTACATAAGAGGCAAGGGCTCCATCAAGGATGCTTTGGAGGAAGAGAAACTAAAGGACAAGCCTGGATATGAGCATCTTAATGAGCCCTTGCATCTGTTGGTTGAGGCAGAATTCCCGGAGGATACGATAAACTCGCGCTTGGATCATGCCGTAGCCGTTTTAGAAAGCCTATTGAAGCCTGTGGATGAATTGCTTGATCAATATAAGAAGCAACAACTAAGAGAACTCGCATTACTAAATGGCACCCTAAGGGAGGAAAGTCCGAGTATGAGCCCGAGCATGTCACCGTTTAACAGCACGGGACTGAAACGGGCCAAGACAGGGAGCAGATGGTCAAGAATTTTGCTGATTTTGCTGTGCTAA

Coding sequence (CDS)

ATGGGCTTTCCTTTTTCCGCCACCGCCTCTCTTCATCGGACTTCCCTTTACCTCTCTGCCTCAGATTCATCTCCTTCTTCGCCCCCTTTCTTTCTGCAAAGTTTATGCGTTATGGGGGAGAGAACCCCACCTGGGAGTTACTTCCATTACCCTCCCCCTTCTGCCCATGCATCCCCTCACAGGACTCCTTCCATTCCCTTAGATCGGGAGAGATATCTAGCTGAATTACTGTCAGAGAGACAGAAGTTGGGTCCCTTTGTGCAAGTTTTACCTCATTGTAGCAGACTTCTGAATCAAGAGATCAAACGTCTATCGGGCCTTAATCAAACTTCTGTGGATCATGAGAGATTTGAGCACGGGAGTCCTTACCGTTCACTAGGTCAGCTCTCGAATGGAAGACCAATGGACTTGGAAGGTTGGCCTCCAATGCAAATGGAGGGAAGTGGACATGTCCATGGTATGGGTTCGCTTCAAGCTCACTCGATGGGTTGGCCAAGGGTGCAAGGAGGAATTCCTGCTACGCCGGTAGTTAAAAGAGTCGTTAGACTTGATGTACCTGTTGACAAATATCCAAACTATAACTTCGTTGGACGACTTCTCGGACCGCGCGGAAACTCCTTGAAAAGGGTCGAAGCCTTAACGGAATGTCGGGTGTACATAAGAGGCAAGGGCTCCATCAAGGATGCTTTGGAGGAAGAGAAACTAAAGGACAAGCCTGGATATGAGCATCTTAATGAGCCCTTGCATCTGTTGGTTGAGGCAGAATTCCCGGAGGATACGATAAACTCGCGCTTGGATCATGCCGTAGCCGTTTTAGAAAGCCTATTGAAGCCTGTGGATGAATTGCTTGATCAATATAAGAAGCAACAACTAAGAGAACTCGCATTACTAAATGGCACCCTAAGGGAGGAAAGTCCGAGTATGAGCCCGAGCATGTCACCGTTTAACAGCACGGGACTGAAACGGGCCAAGACAGGGAGCAGATGGTCAAGAATTTTGCTGATTTTGCTGTGCTAA

Protein sequence

MGFPFSATASLHRTSLYLSASDSSPSSPPFFLQSLCVMGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIKRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSLQAHSMGWPRVQGGIPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGSRWSRILLILLC
Homology
BLAST of CmaCh16G007680 vs. ExPASy Swiss-Prot
Match: Q8GWR3 (KH domain-containing protein At1g09660/At1g09670 OS=Arabidopsis thaliana OX=3702 GN=At1g09660/At1g09670 PE=2 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 2.6e-98
Identity = 188/295 (63.73%), Postives = 227/295 (76.95%), Query Frame = 0

Query: 38  MGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLL 97
           M ER  PGS+F YP     ASP+R+P  P DRERYL ELL ERQKLGPF+QV+P+C RLL
Sbjct: 11  MEERISPGSFFQYPLSGFRASPNRSPCPPSDRERYLTELLQERQKLGPFLQVMPNCCRLL 70

Query: 98  NQEIKRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSL 157
           N EI+R+S       D +R+EHGSP+RSLGQ +NG+ +DLEGW  MQ E + H+      
Sbjct: 71  NHEIRRVSSF----PDLDRYEHGSPFRSLGQPTNGK-LDLEGWSMMQAEENCHLQRASPF 130

Query: 158 QAHS-MGWPRVQGGIPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 217
           +  S +GW  +  G+P  P+VK+V+RLDVPVDKYP+YNFVGR+LGPRGNSLKRVE  T C
Sbjct: 131 RGPSPVGWIGMP-GLPNPPIVKKVIRLDVPVDKYPSYNFVGRILGPRGNSLKRVELATHC 190

Query: 218 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLL 277
           RV+IRG+GS+KD ++EEKLK KPGYEHL EPLH+L+EAE PED INSRL+HAV  LESLL
Sbjct: 191 RVFIRGRGSVKDTVKEEKLKGKPGYEHLCEPLHVLIEAELPEDIINSRLEHAVHFLESLL 250

Query: 278 KPVDELLDQYKKQQLRELALLNGTLREESPS------MSPSMSPFNSTGLKRAKT 326
           KP+DE +D YK++QL+ELA LNGTLREESPS      +SPSMSPFNS   KRAKT
Sbjct: 251 KPMDESMDHYKREQLKELAALNGTLREESPSPSLSPCLSPSMSPFNS---KRAKT 296

BLAST of CmaCh16G007680 vs. ExPASy Swiss-Prot
Match: Q8GYR4 (KH domain-containing protein At3g08620 OS=Arabidopsis thaliana OX=3702 GN=At3g08620 PE=2 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 1.1e-75
Identity = 161/288 (55.90%), Postives = 197/288 (68.40%), Query Frame = 0

Query: 46  SYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIKRLS 105
           +Y ++ P  A +   RTPS  +D  +Y+++LL+E QKLGPF+QVLP CSRLLNQEI R++
Sbjct: 6   NYNNFSPSRAASPQIRTPSSDVD-SQYISQLLAEHQKLGPFMQVLPICSRLLNQEIFRIT 65

Query: 106 GL--NQTSVDHERFEH--GSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSLQAHS 165
           G+  NQ   D +R  H   SP  S   +SN     L GW  +  E  G  HGM      +
Sbjct: 66  GMMPNQGFTDFDRLRHRSPSPMASPNLMSNVSGGGLGGWNGLPPERIGGPHGM------A 125

Query: 166 MGWPRVQGGIPATP---VVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRV 225
           M W     G PA+P    VKR++RLD+PVD YPN+NFVGRLLGPRGNSLKRVEA T CRV
Sbjct: 126 MEWQ----GAPASPSSYPVKRILRLDLPVDTYPNFNFVGRLLGPRGNSLKRVEATTGCRV 185

Query: 226 YIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKP 285
           YIRGKGSIKD  +EEKLK KPGYEHLNE LH+L+EA+ P D ++ +L  A  ++E L+KP
Sbjct: 186 YIRGKGSIKDPEKEEKLKGKPGYEHLNEQLHILIEADLPIDIVDIKLRQAQEIIEELVKP 245

Query: 286 VDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTG 327
           VDE  D  K+QQLRELALLN  LRE SP  S S+SPFNS  +KR KTG
Sbjct: 246 VDESQDYIKRQQLRELALLNSNLRENSPGPSGSVSPFNSNAMKRPKTG 282

BLAST of CmaCh16G007680 vs. ExPASy Swiss-Prot
Match: Q0WLR1 (KH domain-containing protein At4g26480 OS=Arabidopsis thaliana OX=3702 GN=At4g26480 PE=2 SV=1)

HSP 1 Score: 282.0 bits (720), Expect = 9.2e-75
Identity = 156/292 (53.42%), Postives = 199/292 (68.15%), Query Frame = 0

Query: 45  GSYFHYP-----PPSAHASPH------RTPSIPLDRERYLAELLSERQKLGPFVQVLPHC 104
           G +  YP     PPSA  SP+        PS  +++E+YL+ELL+ER KL PF+ VLPH 
Sbjct: 23  GRFVTYPPPLSVPPSAPQSPNFSGGLRSQPSFLVEQEKYLSELLAERHKLTPFLPVLPHV 82

Query: 105 SRLLNQEIKRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHG 164
            RL+NQEI R++ L + ++   RF+H SP  S G   N R  D+ GW   Q      V  
Sbjct: 83  CRLMNQEILRVTTLLENALSQSRFDHPSPLASGGIFQNSR-ADMNGW-ASQFPSERSV-- 142

Query: 165 MGSLQAHSMGWPRVQGGIPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEAL 224
             S   + +  P    G+    +VKR +R+D+PVDKYPNYNFVGRLLGPRGNSLKRVEA 
Sbjct: 143 SSSPAPNWLNSPGSSSGL----IVKRTIRVDIPVDKYPNYNFVGRLLGPRGNSLKRVEAS 202

Query: 225 TECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLE 284
           T+CRV IRG+GSIKD ++E+ ++ KPGYEHLNEPLH+LVEAE P + +++RL  A  +L+
Sbjct: 203 TDCRVLIRGRGSIKDPIKEDMMRGKPGYEHLNEPLHILVEAELPIEIVDARLMQAREILD 262

Query: 285 SLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKT 326
            LL PV+E  D YKKQQLRELALLNG+LREE   MS S+SP+NS G+KRAKT
Sbjct: 263 DLLTPVEETHDFYKKQQLRELALLNGSLREEGSPMSGSISPYNSLGMKRAKT 306

BLAST of CmaCh16G007680 vs. ExPASy Swiss-Prot
Match: Q75GR5 (KH domain-containing protein SPIN1 OS=Oryza sativa subsp. japonica OX=39947 GN=SPIN1 PE=1 SV=1)

HSP 1 Score: 273.9 bits (699), Expect = 2.5e-72
Identity = 156/281 (55.52%), Postives = 194/281 (69.04%), Query Frame = 0

Query: 53  PSAHASPHRTPSIPLDRE-RYLAELLSERQKLGPFVQVLPHCSRLLNQEIKRLSGLNQT- 112
           P+ + SP +  S P D + +YLAELL+E QKLGPF+QVLP CS+LL+QEI R+S +    
Sbjct: 11  PARNLSP-QIRSNPTDVDSQYLAELLAEHQKLGPFMQVLPICSKLLSQEIMRVSSIVHNH 70

Query: 113 ---SVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSLQAHSMGWPRV 172
                D  RF   SP  S    SN        W  +      H   +G  Q  SM W   
Sbjct: 71  GFGDFDRHRFRSPSPMSSPNPRSNRSGNGFSPWNGL------HQERLGFPQGTSMDW--- 130

Query: 173 QGG--IPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGKGS 232
           QG    P++ VVK+++RLDVPVD YPN+NFVGR+LGPRGNSLKRVEA T CRV+IRGKGS
Sbjct: 131 QGAPPSPSSHVVKKILRLDVPVDSYPNFNFVGRILGPRGNSLKRVEASTGCRVFIRGKGS 190

Query: 233 IKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELLDQ 292
           IKD  +E+KL+ KPGYEHL++PLH+L+EAEFP   I++RL HA  V+E LLKPVDE  D 
Sbjct: 191 IKDPGKEDKLRGKPGYEHLSDPLHILIEAEFPASIIDARLRHAQEVIEELLKPVDESQDF 250

Query: 293 YKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTG 327
           YK+QQLRELA+LN TLRE+SP    S+SPF++ G+KRAKTG
Sbjct: 251 YKRQQLRELAMLNSTLREDSPHPG-SVSPFSNGGMKRAKTG 280

BLAST of CmaCh16G007680 vs. ExPASy Swiss-Prot
Match: Q9FKT4 (KH domain-containing protein At5g56140 OS=Arabidopsis thaliana OX=3702 GN=At5g56140 PE=2 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 2.8e-71
Identity = 150/282 (53.19%), Postives = 190/282 (67.38%), Query Frame = 0

Query: 52  PPSAHASPHRT------PSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIKRLS 111
           PPSA  SP+ +       S+ +++E+YL+ELL+ER KL PF+ VLPH  RLLNQEI R++
Sbjct: 39  PPSAPQSPNYSGGLRSQSSVFVEQEKYLSELLAERHKLTPFLPVLPHAFRLLNQEILRVT 98

Query: 112 GL--NQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSLQAHSMG 171
            L  N T +     +H SP  S G   N R  D+ GW              G    +S  
Sbjct: 99  TLLENATVLSQSGLDHPSPLASGGIFQNAR-ADMNGWASQFPSERSVPSSPGPNWLNS-- 158

Query: 172 WPRVQGGIPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGK 231
            P    G+    + KR +R+D+PVD YPN+NFVGRLLGPRGNSLKRVEA T+CRV IRG+
Sbjct: 159 -PGSSSGL----IAKRTIRVDIPVDNYPNFNFVGRLLGPRGNSLKRVEASTDCRVLIRGR 218

Query: 232 GSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELL 291
           GSIKD ++EE ++ KPGYEHLNEPLH+LVEAE P + +++RL  A  +L+ LL P++E  
Sbjct: 219 GSIKDPIKEEMMRGKPGYEHLNEPLHILVEAELPIEIVDARLMQAREILDDLLTPMEETH 278

Query: 292 DQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKT 326
           D YKKQQLRELALLNGTLREE   MS S+SP+NS G+KRAKT
Sbjct: 279 DMYKKQQLRELALLNGTLREEGSPMSGSVSPYNSLGMKRAKT 312

BLAST of CmaCh16G007680 vs. TAIR 10
Match: AT1G09660.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 360.1 bits (923), Expect = 1.9e-99
Identity = 188/295 (63.73%), Postives = 227/295 (76.95%), Query Frame = 0

Query: 38  MGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLL 97
           M ER  PGS+F YP     ASP+R+P  P DRERYL ELL ERQKLGPF+QV+P+C RLL
Sbjct: 11  MEERISPGSFFQYPLSGFRASPNRSPCPPSDRERYLTELLQERQKLGPFLQVMPNCCRLL 70

Query: 98  NQEIKRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSL 157
           N EI+R+S       D +R+EHGSP+RSLGQ +NG+ +DLEGW  MQ E + H+      
Sbjct: 71  NHEIRRVSSF----PDLDRYEHGSPFRSLGQPTNGK-LDLEGWSMMQAEENCHLQRASPF 130

Query: 158 QAHS-MGWPRVQGGIPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 217
           +  S +GW  +  G+P  P+VK+V+RLDVPVDKYP+YNFVGR+LGPRGNSLKRVE  T C
Sbjct: 131 RGPSPVGWIGMP-GLPNPPIVKKVIRLDVPVDKYPSYNFVGRILGPRGNSLKRVELATHC 190

Query: 218 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLL 277
           RV+IRG+GS+KD ++EEKLK KPGYEHL EPLH+L+EAE PED INSRL+HAV  LESLL
Sbjct: 191 RVFIRGRGSVKDTVKEEKLKGKPGYEHLCEPLHVLIEAELPEDIINSRLEHAVHFLESLL 250

Query: 278 KPVDELLDQYKKQQLRELALLNGTLREESPS------MSPSMSPFNSTGLKRAKT 326
           KP+DE +D YK++QL+ELA LNGTLREESPS      +SPSMSPFNS   KRAKT
Sbjct: 251 KPMDESMDHYKREQLKELAALNGTLREESPSPSLSPCLSPSMSPFNS---KRAKT 296

BLAST of CmaCh16G007680 vs. TAIR 10
Match: AT1G09660.2 (RNA-binding KH domain-containing protein )

HSP 1 Score: 303.5 bits (776), Expect = 2.1e-82
Identity = 153/243 (62.96%), Postives = 187/243 (76.95%), Query Frame = 0

Query: 38  MGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLL 97
           M ER  PGS+F YP     ASP+R+P  P DRERYL ELL ERQKLGPF+QV+P+C RLL
Sbjct: 11  MEERISPGSFFQYPLSGFRASPNRSPCPPSDRERYLTELLQERQKLGPFLQVMPNCCRLL 70

Query: 98  NQEIKRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSL 157
           N EI+R+S       D +R+EHGSP+RSLGQ +NG+ +DLEGW  MQ E + H+      
Sbjct: 71  NHEIRRVSSF----PDLDRYEHGSPFRSLGQPTNGK-LDLEGWSMMQAEENCHLQRASPF 130

Query: 158 QAHS-MGWPRVQGGIPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 217
           +  S +GW  +  G+P  P+VK+V+RLDVPVDKYP+YNFVGR+LGPRGNSLKRVE  T C
Sbjct: 131 RGPSPVGWIGMP-GLPNPPIVKKVIRLDVPVDKYPSYNFVGRILGPRGNSLKRVELATHC 190

Query: 218 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLL 277
           RV+IRG+GS+KD ++EEKLK KPGYEHL EPLH+L+EAE PED INSRL+HAV  LESLL
Sbjct: 191 RVFIRGRGSVKDTVKEEKLKGKPGYEHLCEPLHVLIEAELPEDIINSRLEHAVHFLESLL 247

Query: 278 KPV 280
           KP+
Sbjct: 251 KPM 247

BLAST of CmaCh16G007680 vs. TAIR 10
Match: AT3G08620.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 285.0 bits (728), Expect = 7.7e-77
Identity = 161/288 (55.90%), Postives = 197/288 (68.40%), Query Frame = 0

Query: 46  SYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIKRLS 105
           +Y ++ P  A +   RTPS  +D  +Y+++LL+E QKLGPF+QVLP CSRLLNQEI R++
Sbjct: 6   NYNNFSPSRAASPQIRTPSSDVD-SQYISQLLAEHQKLGPFMQVLPICSRLLNQEIFRIT 65

Query: 106 GL--NQTSVDHERFEH--GSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSLQAHS 165
           G+  NQ   D +R  H   SP  S   +SN     L GW  +  E  G  HGM      +
Sbjct: 66  GMMPNQGFTDFDRLRHRSPSPMASPNLMSNVSGGGLGGWNGLPPERIGGPHGM------A 125

Query: 166 MGWPRVQGGIPATP---VVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRV 225
           M W     G PA+P    VKR++RLD+PVD YPN+NFVGRLLGPRGNSLKRVEA T CRV
Sbjct: 126 MEWQ----GAPASPSSYPVKRILRLDLPVDTYPNFNFVGRLLGPRGNSLKRVEATTGCRV 185

Query: 226 YIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKP 285
           YIRGKGSIKD  +EEKLK KPGYEHLNE LH+L+EA+ P D ++ +L  A  ++E L+KP
Sbjct: 186 YIRGKGSIKDPEKEEKLKGKPGYEHLNEQLHILIEADLPIDIVDIKLRQAQEIIEELVKP 245

Query: 286 VDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTG 327
           VDE  D  K+QQLRELALLN  LRE SP  S S+SPFNS  +KR KTG
Sbjct: 246 VDESQDYIKRQQLRELALLNSNLRENSPGPSGSVSPFNSNAMKRPKTG 282

BLAST of CmaCh16G007680 vs. TAIR 10
Match: AT4G26480.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 282.0 bits (720), Expect = 6.5e-76
Identity = 156/292 (53.42%), Postives = 199/292 (68.15%), Query Frame = 0

Query: 45  GSYFHYP-----PPSAHASPH------RTPSIPLDRERYLAELLSERQKLGPFVQVLPHC 104
           G +  YP     PPSA  SP+        PS  +++E+YL+ELL+ER KL PF+ VLPH 
Sbjct: 23  GRFVTYPPPLSVPPSAPQSPNFSGGLRSQPSFLVEQEKYLSELLAERHKLTPFLPVLPHV 82

Query: 105 SRLLNQEIKRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHG 164
            RL+NQEI R++ L + ++   RF+H SP  S G   N R  D+ GW   Q      V  
Sbjct: 83  CRLMNQEILRVTTLLENALSQSRFDHPSPLASGGIFQNSR-ADMNGW-ASQFPSERSV-- 142

Query: 165 MGSLQAHSMGWPRVQGGIPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEAL 224
             S   + +  P    G+    +VKR +R+D+PVDKYPNYNFVGRLLGPRGNSLKRVEA 
Sbjct: 143 SSSPAPNWLNSPGSSSGL----IVKRTIRVDIPVDKYPNYNFVGRLLGPRGNSLKRVEAS 202

Query: 225 TECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLE 284
           T+CRV IRG+GSIKD ++E+ ++ KPGYEHLNEPLH+LVEAE P + +++RL  A  +L+
Sbjct: 203 TDCRVLIRGRGSIKDPIKEDMMRGKPGYEHLNEPLHILVEAELPIEIVDARLMQAREILD 262

Query: 285 SLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKT 326
            LL PV+E  D YKKQQLRELALLNG+LREE   MS S+SP+NS G+KRAKT
Sbjct: 263 DLLTPVEETHDFYKKQQLRELALLNGSLREEGSPMSGSISPYNSLGMKRAKT 306

BLAST of CmaCh16G007680 vs. TAIR 10
Match: AT5G56140.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 270.4 bits (690), Expect = 2.0e-72
Identity = 150/282 (53.19%), Postives = 190/282 (67.38%), Query Frame = 0

Query: 52  PPSAHASPHRT------PSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIKRLS 111
           PPSA  SP+ +       S+ +++E+YL+ELL+ER KL PF+ VLPH  RLLNQEI R++
Sbjct: 39  PPSAPQSPNYSGGLRSQSSVFVEQEKYLSELLAERHKLTPFLPVLPHAFRLLNQEILRVT 98

Query: 112 GL--NQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSLQAHSMG 171
            L  N T +     +H SP  S G   N R  D+ GW              G    +S  
Sbjct: 99  TLLENATVLSQSGLDHPSPLASGGIFQNAR-ADMNGWASQFPSERSVPSSPGPNWLNS-- 158

Query: 172 WPRVQGGIPATPVVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGK 231
            P    G+    + KR +R+D+PVD YPN+NFVGRLLGPRGNSLKRVEA T+CRV IRG+
Sbjct: 159 -PGSSSGL----IAKRTIRVDIPVDNYPNFNFVGRLLGPRGNSLKRVEASTDCRVLIRGR 218

Query: 232 GSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELL 291
           GSIKD ++EE ++ KPGYEHLNEPLH+LVEAE P + +++RL  A  +L+ LL P++E  
Sbjct: 219 GSIKDPIKEEMMRGKPGYEHLNEPLHILVEAELPIEIVDARLMQAREILDDLLTPMEETH 278

Query: 292 DQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKT 326
           D YKKQQLRELALLNGTLREE   MS S+SP+NS G+KRAKT
Sbjct: 279 DMYKKQQLRELALLNGTLREEGSPMSGSVSPYNSLGMKRAKT 312

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8GWR32.6e-9863.73KH domain-containing protein At1g09660/At1g09670 OS=Arabidopsis thaliana OX=3702... [more]
Q8GYR41.1e-7555.90KH domain-containing protein At3g08620 OS=Arabidopsis thaliana OX=3702 GN=At3g08... [more]
Q0WLR19.2e-7553.42KH domain-containing protein At4g26480 OS=Arabidopsis thaliana OX=3702 GN=At4g26... [more]
Q75GR52.5e-7255.52KH domain-containing protein SPIN1 OS=Oryza sativa subsp. japonica OX=39947 GN=S... [more]
Q9FKT42.8e-7153.19KH domain-containing protein At5g56140 OS=Arabidopsis thaliana OX=3702 GN=At5g56... [more]
Match NameE-valueIdentityDescription
AT1G09660.11.9e-9963.73RNA-binding KH domain-containing protein [more]
AT1G09660.22.1e-8262.96RNA-binding KH domain-containing protein [more]
AT3G08620.17.7e-7755.90RNA-binding KH domain-containing protein [more]
AT4G26480.16.5e-7653.42RNA-binding KH domain-containing protein [more]
AT5G56140.12.0e-7253.19RNA-binding KH domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004087K Homology domainSMARTSM00322kh_6coord: 177..277
e-value: 0.0017
score: 27.6
IPR036612K Homology domain, type 1 superfamilyGENE3D3.30.1370.10K Homology domain, type 1coord: 66..309
e-value: 3.6E-68
score: 231.6
IPR036612K Homology domain, type 1 superfamilySUPERFAMILY54791Eukaryotic type KH-domain (KH-domain type I)coord: 183..302
IPR032377STAR protein, homodimerisation regionPFAMPF16544STAR_dimercoord: 71..105
e-value: 3.2E-10
score: 39.5
IPR045071KH domain-containing BBP-likePANTHERPTHR11208RNA-BINDING PROTEIN RELATEDcoord: 44..326
NoneNo IPR availablePANTHERPTHR11208:SF101OS01G0886300 PROTEINcoord: 44..326
NoneNo IPR availableCDDcd02395SF1_like-KHcoord: 182..302
e-value: 8.66235E-52
score: 165.495

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G007680.1CmaCh16G007680.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048024 regulation of mRNA splicing, via spliceosome
cellular_component GO:0005634 nucleus
molecular_function GO:0003729 mRNA binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003723 RNA binding