Sgr029862 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029862
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionKH domain-containing protein
Locationtig00153552: 1267261 .. 1272504 (+)
RNA-Seq ExpressionSgr029862
SyntenySgr029862
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAGAAAAAGGTAAGGTTTTTACTTTGGGTAGAAAAAGACGCATCCCCCATTGCCAATTGCCATTCCATTGCCACATACTGACTGATTGCGAGAAGAAGAATACAGTAAAAGCACACCCCACGACACGATCTTCTCTTCCCCTCTTCCTTCCTCTTCAGCGTAGGGCTTTCCCTCCTCTGCTTTCCATTTTCCTCTCGAGCTACCACCGCCACCGCCACCGCCACCCCCACCGCCAATACAGGGAGCCCATCAAACGGACTATAATCTCTCTGCCTCCTGATTCGTCATCTACCTCGCTGTTTCTGGAAAGGTATGCGTCCATCTATCTATTTCTCTTTCTTGGTCGCCCCCTGCAATCTCTGTGTTTGTTGGAAATGTATCTCACTCTTCTTCTGGGTTCTGACAATTGTACTTGTGCCGTGTTCTGTAGTTTATGCGTGATGGGGGAGAGAACCCCACCTGGGAGTTACTTCCATTACCCTCCCCCTTCAGCCCACGCATCTCCACACAGGACTCCTTCCATCCCCTCAGATCGGGAAAGGTAATCTTCCTTTGTCTATCTTGTCAATTCTTGATAATGGTTTCTTTTTATGTGGTACTGCTTGATTCAGATTCCTTGTTTGATTGATCATACAAAGCGTGCTGTTTTCCAACTTTTTAATCTTTTGTGTTGCTTGGATAATCCTTATAATCGCATTCAAATACTGCTTCTGCTACTAAGAAAGTAGTTTTATTACTTTTAGAAATGATGAGTTACATGCGACTGTGATTTGTGTTGTATGACAATGGCAGATATCTAGCTGAATTACTATCAGAGAGACAAAAGTTGGGTCCCTTCGTGCAAGTTCTACCTCATTGTAGCAGACTTCTGAATCAGGGTGGGTTCACTACCTGTTATGCGTTTGATGCGTTGCAAGATTTCAACTATTGCATTTTGTTGTTGAAATAATAATGATTATTTCCTATGCTTGCATTAATGGAGTTATAGCTTCCGTCTTAAACTACTGTTGGAACATTTTCATCCTTTGAAATGTATTATAGGTAGTCACTGTTTTCTTTATCACATGAAAATTTTTAATTAGAACTAAGCACGTGAGTTTCCATATGTTTTCATGATTTTATTTAGTCTTTTAAACATCTTCATTTCTATTTACAACGTTTGAAGTACTTGGAATTGTAGCTATAATGCTATATTTTTCCTATTATATATGGCCATTGGTTGCTAACATTTATTGAATTATTACTTCCTTTTGAAACATCCTAGAATCAGAGCATCAATTCTAACTATTAATTTGTTGGCTGAAAGAAATGCTGGACGCACAAGTGGATTTGTTGGAGAAGAAAATTTAAGATACTGAAAGAAGTTGGTGACAAGAGTTTGTAGATTCAAGGGGTATGCTAGCTCAATATTGGCAAAGAAGATGCAAATCATGCTTGATAAGATGGAATCAATGTTGAAAAAGGAGATATTGCCACTTGCTATTAAGGAAGAAAACCTTAAACGAGTGATTGAGATGGACTTATTGACTCCCTCAAAGGTGTAAATGGATTTCCCACTCGAGAATGGAAAGTCACATGTGTTATATGACAAGAATCCATGAGATTAGAGGATATGAAATGGAACTTGGTGGAGAGGCTAAGAACAAGAGTTATACATCGTGTGACTAGATTTTGCAAGGGGAATATTGTCTAAGGAAGAGTTGGTAGTCAATCTTTGTGAAAGAAAAATTCTTGGAGTGGGTTTTCAAGGTTGTGGGTAATTTCTCTTTAAACAGGATGATTATATCATAAATGTGGAATTCATTCACATTAGCAAAAATGAGAACACACTAATTCACCTTCAATGGGAGTTTGTTAATGCTGGGAGTGATGTTATACCCTTTTGGTCGGGCCCTTCTTTTTGCAGGGCTATCTTTGTATTTTTGTCTTTCATCTTCTTCCCATGATAAAATAAAATAATTTTTTAAAAATTTCATTTGGTTATCAGGGATTTGGAAGGAGATAGTTTCAGTCAAATACCATTCCTTTAACCACACCTTAAGGACAAGGTGTGATTTTGGGGAGGTACGATAGGACTTATAGCAAGGGGTAGTATAGCCATTTCCATTGGTAAGGTAGTTTAAGAGGTGTGAGTTTGTGTTTGTTTGGGACCACCAGTTGTAATGTAAATGGAGGCTGGGTGGTTGAGGAGACAGAAGTTTGTTAATTGTTATTGTTTAGCTTGAGCTTTCTGAAAAATGAGGCATCTATATTACCTTGAAATACTTGAGATTATACCTAGCTTGCTATGTTTTTCCTATTACATACAATTAAGAGTTCTCCTTACTTGCTGGTACCTATTAGATACCCATGCCTTATCCTATATCACCAAAATCTAATCTGGATCCAGGTCGAACTATGTTAAGGCTAAGCCTTTTATGTCCAAGAGTGACATTCTATGTCCAAAAATGTCCTACTTGGTACTTTGTCAACTTCTGTAATTAATTAGACATAGTCAACAACAATTTTTCCTACTGTCCTAACCCATCAAAAATGATGTAAACGTTCAATGATATCATTAATCTGGTACATCATTCTTAGTATCCCCACATGAATAGGCATGTGGTGATAATTTGCTGAGGTCTTGAGAAAAGTTCCCAATGACTGGTAATTTTTATGAAAAACAAGCCATTTATCTCCATTGTCCTCATCTTAAACACGAGTATTTGCTATGTCAATTGCACATGCACCACTTTATTTATTTCATCTCATATGCAGAAATCAGACGTCTATCAGGCCTTAATCAAACTTCTGTGGATCATGAGAGATTTGAGCACGGGAGTCCATACAGGTCACTAGGCCAGCTATCAAATGGAAGACCAATGGACTTGGAAGGTTGGCCTCCAATGCAAATGGAGGTAAATGTATTAATATCCTACTAAGCTGGAAAGATTATGTGATAGTGGATTTAAATTTTTGCCTTACTATGCATTGACTGTATCAATTGTCCTTGAGCTATTTTGTTAGTTCTTTAGATGCTAGTTTCTCGGTATGAAGACTTTCTCTTTAATCACAAATTTCTTTAGATGAGAATTATTTTAAATGAAAACCAAGCTTTCATTAAGAAGAATAAAAGAATACATACGGGCAATACAATAAAAGGAAGCCCACTACAAAAAGAGCCTGCCTATACAAAACATACACCCTAACTAGCTTCAGTTAAATTACAGGTAATAAGAGAAAAAAGGATAGTTACAAAACTTTTTTTAAGTCCGTTTATGATTACTAAATATGCTAGATTGAATTGCATAAATAGTATGTTAGGGTACTAGATATTGTAATTGTGTATCTCACAGTTTGATTAAGGATGACTATAGGTGAATTGAATCAGTGAAGTTAAATTCCTATCTTGTTTTAGGTTAATTTTATTTTTCTCTATGTAATTTCTAAAATAGTGGAAACCATGTGAGATGTGCAATTTGAATGGATGGCTTGTGAAGCTTGGATGGGTAGCTAACCCTGAGCATGCTCGAAATAGAAAAAGTAATTCATATTTGTGATTCTGGTGTGGTTTAGTTACGTGATATATTTACTGCAGGGAAGTGGACACGTCCACGGAATGGCTCCCCTTCAAGCTCATTCAATTGGGGGTTGGCCCAGAGTGCAAGGAATTCCTACTACGCCTATAGTGAAGAGAGTTGTTAGACTTGACGTACCTGTTGACAAATATCCAAATGTAAGTCAAAATATTTTTATGTTAGGTGGTGATGGTACTGACCCTTTTGCCTCTCTGATGTTAATAGTTTCTTGCTCCAATCACGGCAGTATAATTTTGTTGGCCGACTTCTGGGACCACGTGGAAACTCCTTGAAAAGGGTTGAAGCCTTAACAGAATGTAGGGTGTACATAAGAGGCAAGGGCTCTATCAAAGATGCTTTAGAGGTACTGTGAAAATTTTCCGTAGTGCAGCTAAATGTATTATTTACTGCATCGGTTAGACTTCATTGACATGCTTTTCTGGTCGATCTTTTTCTAGTACTCGCCCTTTTTCTTATGTGTTCTCCCAACTTTGAAATAAGAGCTTGATTATAGATTTGTCAGTATTGCCATGAATAGCTTTTTGCTAATACGTCTTTCACTTGTAGTGGTCTGTGTATCATTTGTATGTTCTGAATATAGATGTGGAAGCGGTTGCATTTTATGTCTTAACTTTCTATCTTGATTCCAAGTACTGGAAAGTACTTTATCAGTTACATTGTCATGCTGCAAACAAGTTTACCCTCATCCTTCACGAACTAGATACATTTTATTATCATATGCTTTCTCAGTGTGACCTTCGCCTTTGTAATATTCCTTTTTTTCATTCGATAAGAACATTATGAACCTTTTATCTTTATTCATTAATCAAACACCTGAGAATTTGCTAGGCTTCGTAGCCTCTCGATTACTTGTATATAAGACTGACAAAATTTTGCCTCTTTAAATAAGTTGATTGATGCCAAAGATTTTCATCCCTCTCAATGTTTCTCACACAAAATGTTTGTGATGTTTTCATTCTCTGCAACGAATGTAGTAATATTTCTTTACCATTTTTTTGGGTGGCTTTTTGGGTTAGAAAGAAACCAGAATCTTTACTGACAAAAGGAGGACTATAGTGATTTAGTTCATTTTTGGGCCACTATTTGGAGCACCTTACACAATTCTTCGGCCGGAGATTACACCACTTCTTTTATCAACGTAGATTGGAAAGGTTTTTTGTAATAGTTTTCTTTGGCGGCCTCTAAGAGGTTGTTTTCTTTTTGTTTTGTGAAGATACCCTCTCCTATACATATATATATATACACCTTTTAATATTACTTACAATGGATCTGAATATTCTGCTTTAGGAAGAGAAACTAAAGGACAAGCCTGGATATGAGCATCTTAATGAGCCCTTGCATCTGTTGGTTGAGGCAGAATTCCCAGAGGATACAATAAACTCACGCTTGGATCATGCAGTGGCAGTTTTAGAAAGCCTTTTGAAGCCTGTGGTATGAACAAAAGAGCTCAAAATCATCTACAAAAGCTCTGATTGGTTTCTATAAAGTCTTATGAAGTTTCTGTTATTATTCCAGGATGAATTGCTGGATCAATATAAGAAGCAACAACTAAGAGAATTGGCATTACTAAATGGCACACTAAGGGAGGAAAGTCCGAGCATGAGCCCTAGCATGTCGCCGTTTAACAGCACGGGGCTCAAACGGGCCAAGACGGGAAGGTAG

mRNA sequence

ATGAGAGAAAAAGGTAAGGTTTTTACTTTGGGTAGAAAAAGACGCATCCCCCATTGCCAATTGCCATTCCATTGCCACATACTGACTGATTGCGAGAAGAAGAATACAGTAAAAGCACACCCCACGACACGATCTTCTCTTCCCCTCTTCCTTCCTCTTCAGCGTAGGGCTTTCCCTCCTCTGCTTTCCATTTTCCTCTCGAGCTACCACCGCCACCGCCACCGCCACCCCCACCGCCAATACAGGGAGCCCATCAAACGGACTATAATCTCTCTGCCTCCTGATTCGTCATCTACCTCGCTGTTTCTGGAAAGTTTATGCGTGATGGGGGAGAGAACCCCACCTGGGAGTTACTTCCATTACCCTCCCCCTTCAGCCCACGCATCTCCACACAGGACTCCTTCCATCCCCTCAGATCGGGAAAGATATCTAGCTGAATTACTATCAGAGAGACAAAAGTTGGGTCCCTTCGTGCAAGTTCTACCTCATTGTAGCAGACTTCTGAATCAGGAAATCAGACGTCTATCAGGCCTTAATCAAACTTCTGTGGATCATGAGAGATTTGAGCACGGGAGTCCATACAGGTCACTAGGCCAGCTATCAAATGGAAGACCAATGGACTTGGAAGGTTGGCCTCCAATGCAAATGGAGGGAAGTGGACACGTCCACGGAATGGCTCCCCTTCAAGCTCATTCAATTGGGGGTTGGCCCAGAGTGCAAGGAATTCCTACTACGCCTATAGTGAAGAGAGTTGTTAGACTTGACGTACCTGTTGACAAATATCCAAATTATAATTTTGTTGGCCGACTTCTGGGACCACGTGGAAACTCCTTGAAAAGGGTTGAAGCCTTAACAGAATGTAGGGTGTACATAAGAGGCAAGGGCTCTATCAAAGATGCTTTAGAGGAAGAGAAACTAAAGGACAAGCCTGGATATGAGCATCTTAATGAGCCCTTGCATCTGTTGGTTGAGGCAGAATTCCCAGAGGATACAATAAACTCACGCTTGGATCATGCAGTGGCAGTTTTAGAAAGCCTTTTGAAGCCTGTGGATGAATTGCTGGATCAATATAAGAAGCAACAACTAAGAGAATTGGCATTACTAAATGGCACACTAAGGGAGGAAAGTCCGAGCATGAGCCCTAGCATGTCGCCGTTTAACAGCACGGGGCTCAAACGGGCCAAGACGGGAAGGTAG

Coding sequence (CDS)

ATGAGAGAAAAAGGTAAGGTTTTTACTTTGGGTAGAAAAAGACGCATCCCCCATTGCCAATTGCCATTCCATTGCCACATACTGACTGATTGCGAGAAGAAGAATACAGTAAAAGCACACCCCACGACACGATCTTCTCTTCCCCTCTTCCTTCCTCTTCAGCGTAGGGCTTTCCCTCCTCTGCTTTCCATTTTCCTCTCGAGCTACCACCGCCACCGCCACCGCCACCCCCACCGCCAATACAGGGAGCCCATCAAACGGACTATAATCTCTCTGCCTCCTGATTCGTCATCTACCTCGCTGTTTCTGGAAAGTTTATGCGTGATGGGGGAGAGAACCCCACCTGGGAGTTACTTCCATTACCCTCCCCCTTCAGCCCACGCATCTCCACACAGGACTCCTTCCATCCCCTCAGATCGGGAAAGATATCTAGCTGAATTACTATCAGAGAGACAAAAGTTGGGTCCCTTCGTGCAAGTTCTACCTCATTGTAGCAGACTTCTGAATCAGGAAATCAGACGTCTATCAGGCCTTAATCAAACTTCTGTGGATCATGAGAGATTTGAGCACGGGAGTCCATACAGGTCACTAGGCCAGCTATCAAATGGAAGACCAATGGACTTGGAAGGTTGGCCTCCAATGCAAATGGAGGGAAGTGGACACGTCCACGGAATGGCTCCCCTTCAAGCTCATTCAATTGGGGGTTGGCCCAGAGTGCAAGGAATTCCTACTACGCCTATAGTGAAGAGAGTTGTTAGACTTGACGTACCTGTTGACAAATATCCAAATTATAATTTTGTTGGCCGACTTCTGGGACCACGTGGAAACTCCTTGAAAAGGGTTGAAGCCTTAACAGAATGTAGGGTGTACATAAGAGGCAAGGGCTCTATCAAAGATGCTTTAGAGGAAGAGAAACTAAAGGACAAGCCTGGATATGAGCATCTTAATGAGCCCTTGCATCTGTTGGTTGAGGCAGAATTCCCAGAGGATACAATAAACTCACGCTTGGATCATGCAGTGGCAGTTTTAGAAAGCCTTTTGAAGCCTGTGGATGAATTGCTGGATCAATATAAGAAGCAACAACTAAGAGAATTGGCATTACTAAATGGCACACTAAGGGAGGAAAGTCCGAGCATGAGCCCTAGCATGTCGCCGTTTAACAGCACGGGGCTCAAACGGGCCAAGACGGGAAGGTAG

Protein sequence

MREKGKVFTLGRKRRIPHCQLPFHCHILTDCEKKNTVKAHPTTRSSLPLFLPLQRRAFPPLLSIFLSSYHRHRHRHPHRQYREPIKRTIISLPPDSSSTSLFLESLCVMGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPLQAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR
Homology
BLAST of Sgr029862 vs. NCBI nr
Match: KAG6600718.1 (KH domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 596.3 bits (1536), Expect = 2.0e-166
Identity = 294/305 (96.39%), Postives = 299/305 (98.03%), Query Frame = 0

Query: 94  PDSSSTSLFLESLCVMGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQK 153
           PDSSS+SLFL SLC+MGERTPPGSYFHYPPPSAHASPHRTPSIP DRERYLAELLSERQK
Sbjct: 24  PDSSSSSLFLRSLCMMGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQK 83

Query: 154 LGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPP 213
           LGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPP
Sbjct: 84  LGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPP 143

Query: 214 MQMEGSGHVHGMAPLQAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGP 273
           MQMEGSGHVH + PLQAHS+  WPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGP
Sbjct: 144 MQMEGSGHVHSLGPLQAHSM-AWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGP 203

Query: 274 RGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTIN 333
           RGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTIN
Sbjct: 204 RGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTIN 263

Query: 334 SRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKR 393
           SRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFN+TGLKR
Sbjct: 264 SRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNTTGLKR 323

Query: 394 AKTGR 399
           AKTGR
Sbjct: 324 AKTGR 327

BLAST of Sgr029862 vs. NCBI nr
Match: KAG6577363.1 (Ras-related protein RABA2a, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 584.3 bits (1505), Expect = 7.8e-163
Identity = 296/327 (90.52%), Postives = 300/327 (91.74%), Query Frame = 0

Query: 73  RHRHPHRQYREPIKRTIISLPPDSSSTSLFLESLCVMGERTPPGSYFHYPPPSAHASPHR 132
           RHR     +  P+    IS          FL+SLCVMGERTPPGSYFHYPPPSAHASPHR
Sbjct: 202 RHRLSSSDFPLPLCLRFISFFAPPPPPPFFLQSLCVMGERTPPGSYFHYPPPSAHASPHR 261

Query: 133 TPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGS 192
           TPSIP DRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGS
Sbjct: 262 TPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGS 321

Query: 193 PYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPLQAHSIGGWPRVQ-GIPTTPIVKRV 252
           PYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGM  LQAHS+ GWPRVQ GIP TPIVKRV
Sbjct: 322 PYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMGSLQAHSM-GWPRVQGGIPATPIVKRV 381

Query: 253 VRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPG 312
           VRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPG
Sbjct: 382 VRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPG 441

Query: 313 YEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGT 372
           YEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGT
Sbjct: 442 YEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGT 501

Query: 373 LREESPSMSPSMSPFNSTGLKRAKTGR 399
           LREESPSMSPSMSPFNSTGLKRAKTGR
Sbjct: 502 LREESPSMSPSMSPFNSTGLKRAKTGR 527

BLAST of Sgr029862 vs. NCBI nr
Match: KAG7015451.1 (KH domain-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 578.2 bits (1489), Expect = 5.6e-161
Identity = 288/298 (96.64%), Postives = 291/298 (97.65%), Query Frame = 0

Query: 102 FLESLCVMGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVL 161
           FL+SLCVMGERTPPGSYFHYPPPS+HASPHRTPSIP DRERYLAELLSERQKLGPFVQVL
Sbjct: 5   FLQSLCVMGERTPPGSYFHYPPPSSHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVL 64

Query: 162 PHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGH 221
           PHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGR MDLEGWPPMQMEGSGH
Sbjct: 65  PHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRQMDLEGWPPMQMEGSGH 124

Query: 222 VHGMAPLQAHSIGGWPRVQ-GIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKR 281
           VHGM  LQAHS+ GWPRVQ GIP TPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKR
Sbjct: 125 VHGMGSLQAHSM-GWPRVQGGIPATPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKR 184

Query: 282 VEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAV 341
           VEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAV
Sbjct: 185 VEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAV 244

Query: 342 AVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
           AVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR
Sbjct: 245 AVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 301

BLAST of Sgr029862 vs. NCBI nr
Match: XP_038904738.1 (KH domain-containing protein At1g09660/At1g09670 isoform X1 [Benincasa hispida])

HSP 1 Score: 577.8 bits (1488), Expect = 7.3e-161
Identity = 285/290 (98.28%), Postives = 287/290 (98.97%), Query Frame = 0

Query: 109 MGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLL 168
           MGERTPPGSYFHYPPPSAHASPHRTPSIP DRERYLAELLSERQKLGPFVQVLPHCSRLL
Sbjct: 1   MGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRLL 60

Query: 169 NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPL 228
           NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEG+GHVHGM PL
Sbjct: 61  NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGTGHVHGMGPL 120

Query: 229 QAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 288
           QAHS+ GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR
Sbjct: 121 QAHSM-GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 180

Query: 289 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLK 348
           VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLK
Sbjct: 181 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLK 240

Query: 349 PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
           PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR
Sbjct: 241 PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 289

BLAST of Sgr029862 vs. NCBI nr
Match: XP_022136599.1 (KH domain-containing protein At1g09660/At1g09670 [Momordica charantia])

HSP 1 Score: 573.9 bits (1478), Expect = 1.0e-159
Identity = 286/291 (98.28%), Postives = 288/291 (98.97%), Query Frame = 0

Query: 109 MGERTPPGSYFHYPPPSAHASPHRTPSIP-SDRERYLAELLSERQKLGPFVQVLPHCSRL 168
           MGERTPPGSYFHYPPPSAHASPHRTPSIP SDRERYL ELLSERQKLGPFVQVLPHCSRL
Sbjct: 1   MGERTPPGSYFHYPPPSAHASPHRTPSIPSSDRERYLTELLSERQKLGPFVQVLPHCSRL 60

Query: 169 LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAP 228
           LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAP
Sbjct: 61  LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAP 120

Query: 229 LQAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 288
           LQAHS+ GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC
Sbjct: 121 LQAHSM-GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 180

Query: 289 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLL 348
           RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLD+AVAVLESLL
Sbjct: 181 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDNAVAVLESLL 240

Query: 349 KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
           KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR
Sbjct: 241 KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 290

BLAST of Sgr029862 vs. ExPASy Swiss-Prot
Match: Q8GWR3 (KH domain-containing protein At1g09660/At1g09670 OS=Arabidopsis thaliana OX=3702 GN=At1g09660/At1g09670 PE=2 SV=1)

HSP 1 Score: 374.4 bits (960), Expect = 1.6e-102
Identity = 192/294 (65.31%), Postives = 229/294 (77.89%), Query Frame = 0

Query: 109 MGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLL 168
           M ER  PGS+F YP     ASP+R+P  PSDRERYL ELL ERQKLGPF+QV+P+C RLL
Sbjct: 11  MEERISPGSFFQYPLSGFRASPNRSPCPPSDRERYLTELLQERQKLGPFLQVMPNCCRLL 70

Query: 169 NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPL 228
           N EIRR+S       D +R+EHGSP+RSLGQ +NG+ +DLEGW  MQ E + H+   +P 
Sbjct: 71  NHEIRRVSSF----PDLDRYEHGSPFRSLGQPTNGK-LDLEGWSMMQAEENCHLQRASPF 130

Query: 229 QAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 288
           +  S  GW  + G+P  PIVK+V+RLDVPVDKYP+YNFVGR+LGPRGNSLKRVE  T CR
Sbjct: 131 RGPSPVGWIGMPGLPNPPIVKKVIRLDVPVDKYPSYNFVGRILGPRGNSLKRVELATHCR 190

Query: 289 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLK 348
           V+IRG+GS+KD ++EEKLK KPGYEHL EPLH+L+EAE PED INSRL+HAV  LESLLK
Sbjct: 191 VFIRGRGSVKDTVKEEKLKGKPGYEHLCEPLHVLIEAELPEDIINSRLEHAVHFLESLLK 250

Query: 349 PVDELLDQYKKQQLRELALLNGTLREESPS------MSPSMSPFNSTGLKRAKT 397
           P+DE +D YK++QL+ELA LNGTLREESPS      +SPSMSPFNS   KRAKT
Sbjct: 251 PMDESMDHYKREQLKELAALNGTLREESPSPSLSPCLSPSMSPFNS---KRAKT 296

BLAST of Sgr029862 vs. ExPASy Swiss-Prot
Match: Q8GYR4 (KH domain-containing protein At3g08620 OS=Arabidopsis thaliana OX=3702 GN=At3g08620 PE=2 SV=1)

HSP 1 Score: 284.6 bits (727), Expect = 1.7e-75
Identity = 159/286 (55.59%), Postives = 194/286 (67.83%), Query Frame = 0

Query: 117 SYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLS 176
           +Y ++ P  A +   RTPS   D  +Y+++LL+E QKLGPF+QVLP CSRLLNQEI R++
Sbjct: 6   NYNNFSPSRAASPQIRTPSSDVD-SQYISQLLAEHQKLGPFMQVLPICSRLLNQEIFRIT 65

Query: 177 GL--NQTSVDHERFEH--GSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPLQAHS 236
           G+  NQ   D +R  H   SP  S   +SN     L GW  +  E  G  HGMA      
Sbjct: 66  GMMPNQGFTDFDRLRHRSPSPMASPNLMSNVSGGGLGGWNGLPPERIGGPHGMAM----- 125

Query: 237 IGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIR 296
              W      P++  VKR++RLD+PVD YPN+NFVGRLLGPRGNSLKRVEA T CRVYIR
Sbjct: 126 --EWQGAPASPSSYPVKRILRLDLPVDTYPNFNFVGRLLGPRGNSLKRVEATTGCRVYIR 185

Query: 297 GKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDE 356
           GKGSIKD  +EEKLK KPGYEHLNE LH+L+EA+ P D ++ +L  A  ++E L+KPVDE
Sbjct: 186 GKGSIKDPEKEEKLKGKPGYEHLNEQLHILIEADLPIDIVDIKLRQAQEIIEELVKPVDE 245

Query: 357 LLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
             D  K+QQLRELALLN  LRE SP  S S+SPFNS  +KR KTGR
Sbjct: 246 SQDYIKRQQLRELALLNSNLRENSPGPSGSVSPFNSNAMKRPKTGR 283

BLAST of Sgr029862 vs. ExPASy Swiss-Prot
Match: Q0WLR1 (KH domain-containing protein At4g26480 OS=Arabidopsis thaliana OX=3702 GN=At4g26480 PE=2 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 3.7e-75
Identity = 156/292 (53.42%), Postives = 195/292 (66.78%), Query Frame = 0

Query: 116 GSYFHYP-----PPSAHASPH------RTPSIPSDRERYLAELLSERQKLGPFVQVLPHC 175
           G +  YP     PPSA  SP+        PS   ++E+YL+ELL+ER KL PF+ VLPH 
Sbjct: 23  GRFVTYPPPLSVPPSAPQSPNFSGGLRSQPSFLVEQEKYLSELLAERHKLTPFLPVLPHV 82

Query: 176 SRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHG 235
            RL+NQEI R++ L + ++   RF+H SP  S G   N R  D+ GW             
Sbjct: 83  CRLMNQEILRVTTLLENALSQSRFDHPSPLASGGIFQNSR-ADMNGWASQFPSERSVSSS 142

Query: 236 MAPLQAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEAL 295
            AP        W    G  +  IVKR +R+D+PVDKYPNYNFVGRLLGPRGNSLKRVEA 
Sbjct: 143 PAP-------NWLNSPGSSSGLIVKRTIRVDIPVDKYPNYNFVGRLLGPRGNSLKRVEAS 202

Query: 296 TECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLE 355
           T+CRV IRG+GSIKD ++E+ ++ KPGYEHLNEPLH+LVEAE P + +++RL  A  +L+
Sbjct: 203 TDCRVLIRGRGSIKDPIKEDMMRGKPGYEHLNEPLHILVEAELPIEIVDARLMQAREILD 262

Query: 356 SLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKT 397
            LL PV+E  D YKKQQLRELALLNG+LREE   MS S+SP+NS G+KRAKT
Sbjct: 263 DLLTPVEETHDFYKKQQLRELALLNGSLREEGSPMSGSISPYNSLGMKRAKT 306

BLAST of Sgr029862 vs. ExPASy Swiss-Prot
Match: Q75GR5 (KH domain-containing protein SPIN1 OS=Oryza sativa subsp. japonica OX=39947 GN=SPIN1 PE=1 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 8.6e-72
Identity = 151/280 (53.93%), Postives = 192/280 (68.57%), Query Frame = 0

Query: 124 PSAHASPHRTPSIPSDRE-RYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLSGLNQT- 183
           P+ + SP +  S P+D + +YLAELL+E QKLGPF+QVLP CS+LL+QEI R+S +    
Sbjct: 11  PARNLSP-QIRSNPTDVDSQYLAELLAEHQKLGPFMQVLPICSKLLSQEIMRVSSIVHNH 70

Query: 184 ---SVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPLQAHSIGGWPR 243
                D  RF   SP  S    SN        W  +  E  G   G +         W  
Sbjct: 71  GFGDFDRHRFRSPSPMSSPNPRSNRSGNGFSPWNGLHQERLGFPQGTSM-------DWQG 130

Query: 244 VQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGKGSIK 303
               P++ +VK+++RLDVPVD YPN+NFVGR+LGPRGNSLKRVEA T CRV+IRGKGSIK
Sbjct: 131 APPSPSSHVVKKILRLDVPVDSYPNFNFVGRILGPRGNSLKRVEASTGCRVFIRGKGSIK 190

Query: 304 DALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELLDQYK 363
           D  +E+KL+ KPGYEHL++PLH+L+EAEFP   I++RL HA  V+E LLKPVDE  D YK
Sbjct: 191 DPGKEDKLRGKPGYEHLSDPLHILIEAEFPASIIDARLRHAQEVIEELLKPVDESQDFYK 250

Query: 364 KQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
           +QQLRELA+LN TLRE+SP    S+SPF++ G+KRAKTG+
Sbjct: 251 RQQLRELAMLNSTLREDSPHPG-SVSPFSNGGMKRAKTGQ 281

BLAST of Sgr029862 vs. ExPASy Swiss-Prot
Match: Q9FKT4 (KH domain-containing protein At5g56140 OS=Arabidopsis thaliana OX=3702 GN=At5g56140 PE=2 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 1.1e-71
Identity = 150/282 (53.19%), Postives = 187/282 (66.31%), Query Frame = 0

Query: 123 PPSAHASPHRTPSIPS------DRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLS 182
           PPSA  SP+ +  + S      ++E+YL+ELL+ER KL PF+ VLPH  RLLNQEI R++
Sbjct: 39  PPSAPQSPNYSGGLRSQSSVFVEQEKYLSELLAERHKLTPFLPVLPHAFRLLNQEILRVT 98

Query: 183 GL--NQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPLQAHSIG 242
            L  N T +     +H SP  S G   N R  D+ GW               P       
Sbjct: 99  TLLENATVLSQSGLDHPSPLASGGIFQNAR-ADMNGWASQFPSERSVPSSPGP------- 158

Query: 243 GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGK 302
            W    G  +  I KR +R+D+PVD YPN+NFVGRLLGPRGNSLKRVEA T+CRV IRG+
Sbjct: 159 NWLNSPGSSSGLIAKRTIRVDIPVDNYPNFNFVGRLLGPRGNSLKRVEASTDCRVLIRGR 218

Query: 303 GSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELL 362
           GSIKD ++EE ++ KPGYEHLNEPLH+LVEAE P + +++RL  A  +L+ LL P++E  
Sbjct: 219 GSIKDPIKEEMMRGKPGYEHLNEPLHILVEAELPIEIVDARLMQAREILDDLLTPMEETH 278

Query: 363 DQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKT 397
           D YKKQQLRELALLNGTLREE   MS S+SP+NS G+KRAKT
Sbjct: 279 DMYKKQQLRELALLNGTLREEGSPMSGSVSPYNSLGMKRAKT 312

BLAST of Sgr029862 vs. ExPASy TrEMBL
Match: A0A0A0L3B7 (KH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G110600 PE=4 SV=1)

HSP 1 Score: 596.3 bits (1536), Expect = 9.5e-167
Identity = 295/306 (96.41%), Postives = 299/306 (97.71%), Query Frame = 0

Query: 93  PPDSSSTSLFLESLCVMGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQ 152
           P DSS +SLFL SLCVMGERTPPGSYFHYPPPSAHASPHRTPSIP DRER LAELLSERQ
Sbjct: 95  PRDSSPSSLFLRSLCVMGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERCLAELLSERQ 154

Query: 153 KLGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWP 212
           KLGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMD+EGWP
Sbjct: 155 KLGPFVQVLPHCSRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDMEGWP 214

Query: 213 PMQMEGSGHVHGMAPLQAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLG 272
           PMQMEGSGHVHGM PLQAHS+ GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLG
Sbjct: 215 PMQMEGSGHVHGMGPLQAHSM-GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLG 274

Query: 273 PRGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTI 332
           PRGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTI
Sbjct: 275 PRGNSLKRVEALTECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTI 334

Query: 333 NSRLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLK 392
           N+RLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLK
Sbjct: 335 NARLDHAVAVLESLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLK 394

Query: 393 RAKTGR 399
           RAKTGR
Sbjct: 395 RAKTGR 399

BLAST of Sgr029862 vs. ExPASy TrEMBL
Match: A0A6J1C4S3 (KH domain-containing protein At1g09660/At1g09670 OS=Momordica charantia OX=3673 GN=LOC111008264 PE=4 SV=1)

HSP 1 Score: 573.9 bits (1478), Expect = 5.1e-160
Identity = 286/291 (98.28%), Postives = 288/291 (98.97%), Query Frame = 0

Query: 109 MGERTPPGSYFHYPPPSAHASPHRTPSIP-SDRERYLAELLSERQKLGPFVQVLPHCSRL 168
           MGERTPPGSYFHYPPPSAHASPHRTPSIP SDRERYL ELLSERQKLGPFVQVLPHCSRL
Sbjct: 1   MGERTPPGSYFHYPPPSAHASPHRTPSIPSSDRERYLTELLSERQKLGPFVQVLPHCSRL 60

Query: 169 LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAP 228
           LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAP
Sbjct: 61  LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAP 120

Query: 229 LQAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 288
           LQAHS+ GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC
Sbjct: 121 LQAHSM-GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 180

Query: 289 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLL 348
           RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLD+AVAVLESLL
Sbjct: 181 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDNAVAVLESLL 240

Query: 349 KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
           KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR
Sbjct: 241 KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 290

BLAST of Sgr029862 vs. ExPASy TrEMBL
Match: A0A5D3BT43 (KH domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold18G00980 PE=4 SV=1)

HSP 1 Score: 573.5 bits (1477), Expect = 6.6e-160
Identity = 283/290 (97.59%), Postives = 286/290 (98.62%), Query Frame = 0

Query: 109 MGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLL 168
           MGERTPPGSYFHYPPPSAHASPHRTPSIP DRER LAELLSERQKLGPFVQVLPHCSRLL
Sbjct: 1   MGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERCLAELLSERQKLGPFVQVLPHCSRLL 60

Query: 169 NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPL 228
           NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMD+EGWPPMQMEGSGHVHGM PL
Sbjct: 61  NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDMEGWPPMQMEGSGHVHGMGPL 120

Query: 229 QAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 288
           QAHS+ GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR
Sbjct: 121 QAHSM-GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 180

Query: 289 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLK 348
           VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTIN+RLDHAVAVLESLLK
Sbjct: 181 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINARLDHAVAVLESLLK 240

Query: 349 PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
           PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR
Sbjct: 241 PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 289

BLAST of Sgr029862 vs. ExPASy TrEMBL
Match: A0A1S3BST0 (KH domain-containing protein At1g09660/At1g09670 OS=Cucumis melo OX=3656 GN=LOC103493316 PE=4 SV=1)

HSP 1 Score: 573.5 bits (1477), Expect = 6.6e-160
Identity = 283/290 (97.59%), Postives = 286/290 (98.62%), Query Frame = 0

Query: 109 MGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLL 168
           MGERTPPGSYFHYPPPSAHASPHRTPSIP DRER LAELLSERQKLGPFVQVLPHCSRLL
Sbjct: 1   MGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERCLAELLSERQKLGPFVQVLPHCSRLL 60

Query: 169 NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPL 228
           NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMD+EGWPPMQMEGSGHVHGM PL
Sbjct: 61  NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDMEGWPPMQMEGSGHVHGMGPL 120

Query: 229 QAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 288
           QAHS+ GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR
Sbjct: 121 QAHSM-GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 180

Query: 289 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLK 348
           VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTIN+RLDHAVAVLESLLK
Sbjct: 181 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINARLDHAVAVLESLLK 240

Query: 349 PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
           PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR
Sbjct: 241 PVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 289

BLAST of Sgr029862 vs. ExPASy TrEMBL
Match: A0A6J1FX42 (KH domain-containing protein At1g09660/At1g09670-like OS=Cucurbita moschata OX=3662 GN=LOC111447793 PE=4 SV=1)

HSP 1 Score: 572.4 bits (1474), Expect = 1.5e-159
Identity = 282/291 (96.91%), Postives = 286/291 (98.28%), Query Frame = 0

Query: 108 VMGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRL 167
           +MGERTPPGSYFHYPPPSAHASPHRTPSIP DRERYLAELLSERQKLGPFVQVLPHCSRL
Sbjct: 1   MMGERTPPGSYFHYPPPSAHASPHRTPSIPLDRERYLAELLSERQKLGPFVQVLPHCSRL 60

Query: 168 LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAP 227
           LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVH + P
Sbjct: 61  LNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHSLGP 120

Query: 228 LQAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 287
           LQAHS+  WPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC
Sbjct: 121 LQAHSM-AWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTEC 180

Query: 288 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLL 347
           RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLL
Sbjct: 181 RVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLL 240

Query: 348 KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
           KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFN+TGLKRAKTGR
Sbjct: 241 KPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNTTGLKRAKTGR 290

BLAST of Sgr029862 vs. TAIR 10
Match: AT1G09660.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 374.4 bits (960), Expect = 1.1e-103
Identity = 192/294 (65.31%), Postives = 229/294 (77.89%), Query Frame = 0

Query: 109 MGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLL 168
           M ER  PGS+F YP     ASP+R+P  PSDRERYL ELL ERQKLGPF+QV+P+C RLL
Sbjct: 11  MEERISPGSFFQYPLSGFRASPNRSPCPPSDRERYLTELLQERQKLGPFLQVMPNCCRLL 70

Query: 169 NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPL 228
           N EIRR+S       D +R+EHGSP+RSLGQ +NG+ +DLEGW  MQ E + H+   +P 
Sbjct: 71  NHEIRRVSSF----PDLDRYEHGSPFRSLGQPTNGK-LDLEGWSMMQAEENCHLQRASPF 130

Query: 229 QAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 288
           +  S  GW  + G+P  PIVK+V+RLDVPVDKYP+YNFVGR+LGPRGNSLKRVE  T CR
Sbjct: 131 RGPSPVGWIGMPGLPNPPIVKKVIRLDVPVDKYPSYNFVGRILGPRGNSLKRVELATHCR 190

Query: 289 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLK 348
           V+IRG+GS+KD ++EEKLK KPGYEHL EPLH+L+EAE PED INSRL+HAV  LESLLK
Sbjct: 191 VFIRGRGSVKDTVKEEKLKGKPGYEHLCEPLHVLIEAELPEDIINSRLEHAVHFLESLLK 250

Query: 349 PVDELLDQYKKQQLRELALLNGTLREESPS------MSPSMSPFNSTGLKRAKT 397
           P+DE +D YK++QL+ELA LNGTLREESPS      +SPSMSPFNS   KRAKT
Sbjct: 251 PMDESMDHYKREQLKELAALNGTLREESPSPSLSPCLSPSMSPFNS---KRAKT 296

BLAST of Sgr029862 vs. TAIR 10
Match: AT1G09660.2 (RNA-binding KH domain-containing protein )

HSP 1 Score: 318.5 bits (815), Expect = 7.4e-87
Identity = 157/242 (64.88%), Postives = 189/242 (78.10%), Query Frame = 0

Query: 109 MGERTPPGSYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLL 168
           M ER  PGS+F YP     ASP+R+P  PSDRERYL ELL ERQKLGPF+QV+P+C RLL
Sbjct: 11  MEERISPGSFFQYPLSGFRASPNRSPCPPSDRERYLTELLQERQKLGPFLQVMPNCCRLL 70

Query: 169 NQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPL 228
           N EIRR+S       D +R+EHGSP+RSLGQ +NG+ +DLEGW  MQ E + H+   +P 
Sbjct: 71  NHEIRRVSSF----PDLDRYEHGSPFRSLGQPTNGK-LDLEGWSMMQAEENCHLQRASPF 130

Query: 229 QAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECR 288
           +  S  GW  + G+P  PIVK+V+RLDVPVDKYP+YNFVGR+LGPRGNSLKRVE  T CR
Sbjct: 131 RGPSPVGWIGMPGLPNPPIVKKVIRLDVPVDKYPSYNFVGRILGPRGNSLKRVELATHCR 190

Query: 289 VYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLK 348
           V+IRG+GS+KD ++EEKLK KPGYEHL EPLH+L+EAE PED INSRL+HAV  LESLLK
Sbjct: 191 VFIRGRGSVKDTVKEEKLKGKPGYEHLCEPLHVLIEAELPEDIINSRLEHAVHFLESLLK 247

Query: 349 PV 351
           P+
Sbjct: 251 PM 247

BLAST of Sgr029862 vs. TAIR 10
Match: AT3G08620.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 284.6 bits (727), Expect = 1.2e-76
Identity = 159/286 (55.59%), Postives = 194/286 (67.83%), Query Frame = 0

Query: 117 SYFHYPPPSAHASPHRTPSIPSDRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLS 176
           +Y ++ P  A +   RTPS   D  +Y+++LL+E QKLGPF+QVLP CSRLLNQEI R++
Sbjct: 6   NYNNFSPSRAASPQIRTPSSDVD-SQYISQLLAEHQKLGPFMQVLPICSRLLNQEIFRIT 65

Query: 177 GL--NQTSVDHERFEH--GSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPLQAHS 236
           G+  NQ   D +R  H   SP  S   +SN     L GW  +  E  G  HGMA      
Sbjct: 66  GMMPNQGFTDFDRLRHRSPSPMASPNLMSNVSGGGLGGWNGLPPERIGGPHGMAM----- 125

Query: 237 IGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIR 296
              W      P++  VKR++RLD+PVD YPN+NFVGRLLGPRGNSLKRVEA T CRVYIR
Sbjct: 126 --EWQGAPASPSSYPVKRILRLDLPVDTYPNFNFVGRLLGPRGNSLKRVEATTGCRVYIR 185

Query: 297 GKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDE 356
           GKGSIKD  +EEKLK KPGYEHLNE LH+L+EA+ P D ++ +L  A  ++E L+KPVDE
Sbjct: 186 GKGSIKDPEKEEKLKGKPGYEHLNEQLHILIEADLPIDIVDIKLRQAQEIIEELVKPVDE 245

Query: 357 LLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKTGR 399
             D  K+QQLRELALLN  LRE SP  S S+SPFNS  +KR KTGR
Sbjct: 246 SQDYIKRQQLRELALLNSNLRENSPGPSGSVSPFNSNAMKRPKTGR 283

BLAST of Sgr029862 vs. TAIR 10
Match: AT4G26480.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 283.5 bits (724), Expect = 2.6e-76
Identity = 156/292 (53.42%), Postives = 195/292 (66.78%), Query Frame = 0

Query: 116 GSYFHYP-----PPSAHASPH------RTPSIPSDRERYLAELLSERQKLGPFVQVLPHC 175
           G +  YP     PPSA  SP+        PS   ++E+YL+ELL+ER KL PF+ VLPH 
Sbjct: 23  GRFVTYPPPLSVPPSAPQSPNFSGGLRSQPSFLVEQEKYLSELLAERHKLTPFLPVLPHV 82

Query: 176 SRLLNQEIRRLSGLNQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHG 235
            RL+NQEI R++ L + ++   RF+H SP  S G   N R  D+ GW             
Sbjct: 83  CRLMNQEILRVTTLLENALSQSRFDHPSPLASGGIFQNSR-ADMNGWASQFPSERSVSSS 142

Query: 236 MAPLQAHSIGGWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEAL 295
            AP        W    G  +  IVKR +R+D+PVDKYPNYNFVGRLLGPRGNSLKRVEA 
Sbjct: 143 PAP-------NWLNSPGSSSGLIVKRTIRVDIPVDKYPNYNFVGRLLGPRGNSLKRVEAS 202

Query: 296 TECRVYIRGKGSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLE 355
           T+CRV IRG+GSIKD ++E+ ++ KPGYEHLNEPLH+LVEAE P + +++RL  A  +L+
Sbjct: 203 TDCRVLIRGRGSIKDPIKEDMMRGKPGYEHLNEPLHILVEAELPIEIVDARLMQAREILD 262

Query: 356 SLLKPVDELLDQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKT 397
            LL PV+E  D YKKQQLRELALLNG+LREE   MS S+SP+NS G+KRAKT
Sbjct: 263 DLLTPVEETHDFYKKQQLRELALLNGSLREEGSPMSGSISPYNSLGMKRAKT 306

BLAST of Sgr029862 vs. TAIR 10
Match: AT5G56140.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 271.9 bits (694), Expect = 7.9e-73
Identity = 150/282 (53.19%), Postives = 187/282 (66.31%), Query Frame = 0

Query: 123 PPSAHASPHRTPSIPS------DRERYLAELLSERQKLGPFVQVLPHCSRLLNQEIRRLS 182
           PPSA  SP+ +  + S      ++E+YL+ELL+ER KL PF+ VLPH  RLLNQEI R++
Sbjct: 39  PPSAPQSPNYSGGLRSQSSVFVEQEKYLSELLAERHKLTPFLPVLPHAFRLLNQEILRVT 98

Query: 183 GL--NQTSVDHERFEHGSPYRSLGQLSNGRPMDLEGWPPMQMEGSGHVHGMAPLQAHSIG 242
            L  N T +     +H SP  S G   N R  D+ GW               P       
Sbjct: 99  TLLENATVLSQSGLDHPSPLASGGIFQNAR-ADMNGWASQFPSERSVPSSPGP------- 158

Query: 243 GWPRVQGIPTTPIVKRVVRLDVPVDKYPNYNFVGRLLGPRGNSLKRVEALTECRVYIRGK 302
            W    G  +  I KR +R+D+PVD YPN+NFVGRLLGPRGNSLKRVEA T+CRV IRG+
Sbjct: 159 NWLNSPGSSSGLIAKRTIRVDIPVDNYPNFNFVGRLLGPRGNSLKRVEASTDCRVLIRGR 218

Query: 303 GSIKDALEEEKLKDKPGYEHLNEPLHLLVEAEFPEDTINSRLDHAVAVLESLLKPVDELL 362
           GSIKD ++EE ++ KPGYEHLNEPLH+LVEAE P + +++RL  A  +L+ LL P++E  
Sbjct: 219 GSIKDPIKEEMMRGKPGYEHLNEPLHILVEAELPIEIVDARLMQAREILDDLLTPMEETH 278

Query: 363 DQYKKQQLRELALLNGTLREESPSMSPSMSPFNSTGLKRAKT 397
           D YKKQQLRELALLNGTLREE   MS S+SP+NS G+KRAKT
Sbjct: 279 DMYKKQQLRELALLNGTLREEGSPMSGSVSPYNSLGMKRAKT 312

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6600718.12.0e-16696.39KH domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG6577363.17.8e-16390.52Ras-related protein RABA2a, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7015451.15.6e-16196.64KH domain-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperm... [more]
XP_038904738.17.3e-16198.28KH domain-containing protein At1g09660/At1g09670 isoform X1 [Benincasa hispida][more]
XP_022136599.11.0e-15998.28KH domain-containing protein At1g09660/At1g09670 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q8GWR31.6e-10265.31KH domain-containing protein At1g09660/At1g09670 OS=Arabidopsis thaliana OX=3702... [more]
Q8GYR41.7e-7555.59KH domain-containing protein At3g08620 OS=Arabidopsis thaliana OX=3702 GN=At3g08... [more]
Q0WLR13.7e-7553.42KH domain-containing protein At4g26480 OS=Arabidopsis thaliana OX=3702 GN=At4g26... [more]
Q75GR58.6e-7253.93KH domain-containing protein SPIN1 OS=Oryza sativa subsp. japonica OX=39947 GN=S... [more]
Q9FKT41.1e-7153.19KH domain-containing protein At5g56140 OS=Arabidopsis thaliana OX=3702 GN=At5g56... [more]
Match NameE-valueIdentityDescription
A0A0A0L3B79.5e-16796.41KH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G110600 PE=4 SV=... [more]
A0A6J1C4S35.1e-16098.28KH domain-containing protein At1g09660/At1g09670 OS=Momordica charantia OX=3673 ... [more]
A0A5D3BT436.6e-16097.59KH domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A1S3BST06.6e-16097.59KH domain-containing protein At1g09660/At1g09670 OS=Cucumis melo OX=3656 GN=LOC1... [more]
A0A6J1FX421.5e-15996.91KH domain-containing protein At1g09660/At1g09670-like OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT1G09660.11.1e-10365.31RNA-binding KH domain-containing protein [more]
AT1G09660.27.4e-8764.88RNA-binding KH domain-containing protein [more]
AT3G08620.11.2e-7655.59RNA-binding KH domain-containing protein [more]
AT4G26480.12.6e-7653.42RNA-binding KH domain-containing protein [more]
AT5G56140.17.9e-7353.19RNA-binding KH domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004087K Homology domainSMARTSM00322kh_6coord: 248..348
e-value: 0.0017
score: 27.6
IPR036612K Homology domain, type 1 superfamilyGENE3D3.30.1370.10K Homology domain, type 1coord: 135..380
e-value: 2.6E-68
score: 232.1
IPR036612K Homology domain, type 1 superfamilySUPERFAMILY54791Eukaryotic type KH-domain (KH-domain type I)coord: 254..373
IPR032377STAR protein, homodimerisation regionPFAMPF16544STAR_dimercoord: 141..176
e-value: 4.3E-10
score: 39.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 373..398
NoneNo IPR availablePANTHERPTHR11208:SF101OS01G0886300 PROTEINcoord: 115..398
NoneNo IPR availableCDDcd02395SF1_like-KHcoord: 253..373
e-value: 2.02549E-50
score: 163.569
IPR045071KH domain-containing BBP-likePANTHERPTHR11208RNA-BINDING PROTEIN RELATEDcoord: 115..398

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029862.1Sgr029862.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048024 regulation of mRNA splicing, via spliceosome
cellular_component GO:0005634 nucleus
molecular_function GO:0003729 mRNA binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003723 RNA binding