CcUC02G022320 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC02G022320
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionCRM domain-containing protein
LocationCicolChr02: 4675261 .. 4677597 (-)
RNA-Seq ExpressionCcUC02G022320
SyntenyCcUC02G022320
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAGTTTCTTTTAGCTTCAGGGAAGCCGAACCAAGAAGCAAGCAAAAATGGGGGCTTCGATAATCTCCTCTCACTCTCTTCTTCTCCGGTCAGTTCCTTCTTCTTTCTCCACCTTCTTCTCTAAACCCCCCGCCTCCGCCACCGCCGCCGCCGCCATCTCCACCGTTTGCTTCCATCAATTTAAGCCCAACAGTTCAAACTTCCTCACTCGCGAATCTCCTCACCATTCCCGGTCTTTCACTCGCTCCGCATCCCTTTCCTTCACTCGCTCCTTTTCCTTCACCATTTCCGTCCCCTCCGAGGTCAGATTTGACGAAGACGACGATGGAATTGAAGATGAAGATGAAAGCGATTACGAAGATGAAGGTGAAATCGAAGATTTTGAAAGGGACGAGGCCGTTGGGAGTGGAACGGAAACTGGTGTGGCTTCGGAGTTGTCTTCGTCAATGGCGAGTAGAAATGAAGTGAAGGATATTCCGGGCTTGACAATTAAGGAGAAAAAAGAGCTGGCATCGTATGCTCATAGTTTAGGGAAGAAGCTGAAAAGTCAGTTGGTGGGAAAGTCTGGCGTCACTCCTGGCTTGGCTACTTCTTTCATCGAGACACTTGAAGCCAACGAGCTTCTCAAGGTTTGTTGGTTCTTTCAACGTTCACTTCTCTGTTTGATTGAGTTCTGATACTATCCATTAAGTCTATATGGCTTTAGGATTAGGGTTTATAAAGCAACCATTGCTGTGGACATCAATTGTCTTTGCAAAATTGTCGCCTCTGCTTCTTGTAGAATTTGAATAGCAAGAACATTCTGGATATAAATTGCTTGTATAATCGTATGTAGTTAGGAGGACAGCAATTACATTGGTTCTTGGAAAAAAGAAATTTGTCTGTTGGAAAGTAAATTTCAATTTCTATTATAGAATATGATTGATTAACTGCTAGGGAATAAAAAATGTTTGTATTTAGAACTGACTTCCATCATATGATTCATTGATCATGCATTGAATTTGTTGATAGATAAGGAAGGGTTCTTTCCATTAATTTTGAGGTCCAAGTTTATCAGTGAAGGGAGAATCTTAGAAGTCATATGGTTAAGCATTTGCACTATATATAATTCTCTTCCCTTACGTTGTAAGCTTACAAAAGCTTATGTTACAACCTTACTTGAACAGGATCTGTTCTACCGTGTCCTTGAGTTCTAAAAAAGCTTAACTTTGTCTTCTTGCACTTCATTCATTTTAAGTACCTCGAATTTATATTTAATGATTATTAGGGGCATGCTAATCCATGTTGTTATGAAGCAGCTACATTAAGTGTTTAACCTAAATACCGACGTTTCTTGTTGGTTGAGTTTTCTTTTTAGTTATTCTCTAGGCTTATTTTGATTGATTGAAGCCCTTTCTTAGTTTTAGTTTGGCTCCCTTCGTGCATGAGTTTATTTACACCCTTTCTGTTCTTTATTCTTTCATTTTTTCTCAGCAAGGGTTCTCATTGGAAAAAAGTGATATTTTGGGTTGATTGCTTTTGTTTATGAAGAGATATATAGCATCTGGAACTCAATTGCGATTCCTCTCTGTAAACTTGACTAGTGACTTCATTTCTTTGTTCTTTCTCATTTAAATGTTGAATATTAGAAACTCCTTGGAACAAACTAGATTAAAAGAAAATTGAACCACTCGATACAGATCAAAATTCTTGGAAACTGTCCAGAGGAGCTGGAGGATGCGGTGAGGAAATTGGAGGAATCAACTGGTTCTGTGGTGGTGAATCAAATTGGCAGGACTGTTATTATTTACAGGCCCAGCATCACAAAAATGAAAGCAGAGGAAGAGAAACGACGAGCCCGAAAAATTTATGTGAGAAAAGAGCCTGATCGAGTCAAATCCATTTTGCAGGTAAAGTAAATGCATAATGCATTGGCATTCTTGATCATTTCTCCAACATGTTTAGTATTCTATTGCAAGTTTCCTCTTAAATTTGAATACTTGTATAACTGCAGAATAAAATACAAACACCCCGGAGCTCGAACCGTGGTCGTCGTGGAAGCAGCAGGTATTGATCTGAAATAGGATTCACATCTTGCAGGTTTATGTTGGAGGAACAGATTTTGAATTTTCATCCCAATGTAAGTTTTGTTCATACCATTCTTGTTCTTGTAGTTCCACTAGTTTTGATTTAATTCATTAATGTAGAGATTTTTACTCTGCTTACACTTCACAGGTTTAGAGATTAAACAAGATTATAGGTTGTTTGACAAGAGTTAGTAGGAGGAGCTCAAACTCTACTTCCGTGTATCTCATGTTTTGTTGTTTGCTTTCTACTCATGCTTTTGAAACTTAAGCC

mRNA sequence

CAAGTTTCTTTTAGCTTCAGGGAAGCCGAACCAAGAAGCAAGCAAAAATGGGGGCTTCGATAATCTCCTCTCACTCTCTTCTTCTCCGGTCAGTTCCTTCTTCTTTCTCCACCTTCTTCTCTAAACCCCCCGCCTCCGCCACCGCCGCCGCCGCCATCTCCACCGTTTGCTTCCATCAATTTAAGCCCAACAGTTCAAACTTCCTCACTCGCGAATCTCCTCACCATTCCCGGTCTTTCACTCGCTCCGCATCCCTTTCCTTCACTCGCTCCTTTTCCTTCACCATTTCCGTCCCCTCCGAGGTCAGATTTGACGAAGACGACGATGGAATTGAAGATGAAGATGAAAGCGATTACGAAGATGAAGGTGAAATCGAAGATTTTGAAAGGGACGAGGCCGTTGGGAGTGGAACGGAAACTGGTGTGGCTTCGGAGTTGTCTTCGTCAATGGCGAGTAGAAATGAAGTGAAGGATATTCCGGGCTTGACAATTAAGGAGAAAAAAGAGCTGGCATCGTATGCTCATAGTTTAGGGAAGAAGCTGAAAAGTCAGTTGGTGGGAAAGTCTGGCGTCACTCCTGGCTTGGCTACTTCTTTCATCGAGACACTTGAAGCCAACGAGCTTCTCAAGATCAAAATTCTTGGAAACTGTCCAGAGGAGCTGGAGGATGCGGTGAGGAAATTGGAGGAATCAACTGGTTCTGTGGTGGTGAATCAAATTGGCAGGACTGTTATTATTTACAGGCCCAGCATCACAAAAATGAAAGCAGAGGAAGAGAAACGACGAGCCCGAAAAATTTATGTGAGAAAAGAGCCTGATCGAGTCAAATCCATTTTGCAGGTAAAAATAAAATACAAACACCCCGGAGCTCGAACCGTGGTCGTCGTGGAAGCAGCAGGTATTGATCTGAAATAGGATTCACATCTTGCAGGTTTATGTTGGAGGAACAGATTTTGAATTTTCATCCCAATGTTTAGAGATTAAACAAGATTATAGGTTGTTTGACAAGAGTTAGTAGGAGGAGCTCAAACTCTACTTCCGTGTATCTCATGTTTTGTTGTTTGCTTTCTACTCATGCTTTTGAAACTTAAGCC

Coding sequence (CDS)

ATGGGGGCTTCGATAATCTCCTCTCACTCTCTTCTTCTCCGGTCAGTTCCTTCTTCTTTCTCCACCTTCTTCTCTAAACCCCCCGCCTCCGCCACCGCCGCCGCCGCCATCTCCACCGTTTGCTTCCATCAATTTAAGCCCAACAGTTCAAACTTCCTCACTCGCGAATCTCCTCACCATTCCCGGTCTTTCACTCGCTCCGCATCCCTTTCCTTCACTCGCTCCTTTTCCTTCACCATTTCCGTCCCCTCCGAGGTCAGATTTGACGAAGACGACGATGGAATTGAAGATGAAGATGAAAGCGATTACGAAGATGAAGGTGAAATCGAAGATTTTGAAAGGGACGAGGCCGTTGGGAGTGGAACGGAAACTGGTGTGGCTTCGGAGTTGTCTTCGTCAATGGCGAGTAGAAATGAAGTGAAGGATATTCCGGGCTTGACAATTAAGGAGAAAAAAGAGCTGGCATCGTATGCTCATAGTTTAGGGAAGAAGCTGAAAAGTCAGTTGGTGGGAAAGTCTGGCGTCACTCCTGGCTTGGCTACTTCTTTCATCGAGACACTTGAAGCCAACGAGCTTCTCAAGATCAAAATTCTTGGAAACTGTCCAGAGGAGCTGGAGGATGCGGTGAGGAAATTGGAGGAATCAACTGGTTCTGTGGTGGTGAATCAAATTGGCAGGACTGTTATTATTTACAGGCCCAGCATCACAAAAATGAAAGCAGAGGAAGAGAAACGACGAGCCCGAAAAATTTATGTGAGAAAAGAGCCTGATCGAGTCAAATCCATTTTGCAGGTAAAAATAAAATACAAACACCCCGGAGCTCGAACCGTGGTCGTCGTGGAAGCAGCAGGTATTGATCTGAAATAG

Protein sequence

MGASIISSHSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFLTRESPHHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFERDEAVGSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGKSGVTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPSITKMKAEEEKRRARKIYVRKEPDRVKSILQVKIKYKHPGARTVVVVEAAGIDLK
Homology
BLAST of CcUC02G022320 vs. NCBI nr
Match: XP_004148536.1 (uncharacterized protein LOC101213060 [Cucumis sativus] >KGN43169.1 hypothetical protein Csa_020174 [Cucumis sativus])

HSP 1 Score: 390.6 bits (1002), Expect = 1.2e-104
Identity = 227/270 (84.07%), Postives = 238/270 (88.15%), Query Frame = 0

Query: 1   MGASIIS--SHSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFLTRESP 60
           MGASIIS  SHSLLLRS+PSSFSTFFSKP    +AAAAISTVCFHQFKP+SS  LTRESP
Sbjct: 1   MGASIISSHSHSLLLRSLPSSFSTFFSKP----SAAAAISTVCFHQFKPSSSTLLTRESP 60

Query: 61  HHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFERDEAV 120
           HH R FT SA LSF+RSFS TIS PSEV FDE DD IED+DESDYEDE E+ED      V
Sbjct: 61  HHYRPFTPSAPLSFSRSFSSTISDPSEVIFDEADDEIEDKDESDYEDEVEMED-----EV 120

Query: 121 GSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGKSGVTPG 180
           G GTETGVASELSS + SRNEVK+IP LTIKEKKELASYAH LGKKLKSQLVGKSGVTPG
Sbjct: 121 GDGTETGVASELSSPIMSRNEVKNIPSLTIKEKKELASYAHGLGKKLKSQLVGKSGVTPG 180

Query: 181 LATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPSITKM 240
           LATSFIETLEANELLKIKILGNCPEELED VRKL ESTGSVVVNQIGRTVIIYRPSITKM
Sbjct: 181 LATSFIETLEANELLKIKILGNCPEELEDVVRKLAESTGSVVVNQIGRTVIIYRPSITKM 240

Query: 241 KAEEEKRRARKIYVRKEPDRVKSILQVKIK 269
           KAEEEKRRARK+Y+RKEPDRVKSILQ KI+
Sbjct: 241 KAEEEKRRARKVYMRKEPDRVKSILQKKIE 261

BLAST of CcUC02G022320 vs. NCBI nr
Match: XP_038888218.1 (uncharacterized protein LOC120078082 [Benincasa hispida])

HSP 1 Score: 388.7 bits (997), Expect = 4.5e-104
Identity = 229/270 (84.81%), Postives = 239/270 (88.52%), Query Frame = 0

Query: 1   MGASIIS--SHSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFLTRESP 60
           MGASIIS  SHSLLLRSVPSSFSTFFSKP A+A AAA    VCF+QFKP SSNF TRES 
Sbjct: 1   MGASIISFHSHSLLLRSVPSSFSTFFSKPSAAAAAAA----VCFNQFKPVSSNFFTRESS 60

Query: 61  HHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFERDEAV 120
           HHSRSF  S SLSF RSF+ TIS PSE RFDEDDD IEDEDESDYEDE EI   ER+EAV
Sbjct: 61  HHSRSFIPSPSLSFPRSFASTISGPSEFRFDEDDDEIEDEDESDYEDEVEI---EREEAV 120

Query: 121 GSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGKSGVTPG 180
             G ETGVASELSS M +RNEVK+IP LTIKEKKELASYAHSLGKKLKSQLVGKSGVTPG
Sbjct: 121 EDGMETGVASELSSLMVNRNEVKNIPSLTIKEKKELASYAHSLGKKLKSQLVGKSGVTPG 180

Query: 181 LATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPSITKM 240
           LATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPSITKM
Sbjct: 181 LATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPSITKM 240

Query: 241 KAEEEKRRARKIYVRKEPDRVKSILQVKIK 269
           KAE++KR+ARKIYVRKE DRVKSILQ KI+
Sbjct: 241 KAEKQKRQARKIYVRKELDRVKSILQNKIQ 263

BLAST of CcUC02G022320 vs. NCBI nr
Match: XP_008448058.1 (PREDICTED: uncharacterized protein LOC103490350 [Cucumis melo])

HSP 1 Score: 384.4 bits (986), Expect = 8.5e-103
Identity = 221/274 (80.66%), Postives = 238/274 (86.86%), Query Frame = 0

Query: 1   MGASIIS------SHSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFLT 60
           M ASIIS      SHSLLLRSVPSSFSTFFSKP    +A AA STVCFHQFKP+SS FLT
Sbjct: 1   MRASIISSHSHSHSHSLLLRSVPSSFSTFFSKP----SATAAFSTVCFHQFKPSSSTFLT 60

Query: 61  RESPHHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFER 120
           R+SPH+SR FT S SLSF+R FS T+S PSEVRFDE DD IED+DESDYEDE E+ED   
Sbjct: 61  RKSPHYSRPFTPSTSLSFSRFFSSTVSGPSEVRFDEGDDEIEDKDESDYEDEVEMED--- 120

Query: 121 DEAVGSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGKSG 180
             +VG GTE GVASELSS + +RNEVK+IP LTIKEKKELASYAH LGKKLKSQLVGKSG
Sbjct: 121 --SVGDGTENGVASELSSPVMNRNEVKNIPSLTIKEKKELASYAHGLGKKLKSQLVGKSG 180

Query: 181 VTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPS 240
           VTPGLATSFIETLEANELLKIKILGNCPE+LED VRKLEESTGSVVVNQIGRTVIIYRPS
Sbjct: 181 VTPGLATSFIETLEANELLKIKILGNCPEDLEDVVRKLEESTGSVVVNQIGRTVIIYRPS 240

Query: 241 ITKMKAEEEKRRARKIYVRKEPDRVKSILQVKIK 269
           ITKMKAEEEKRR+RKIY+RKEPDRVKSILQ K++
Sbjct: 241 ITKMKAEEEKRRSRKIYIRKEPDRVKSILQKKVE 265

BLAST of CcUC02G022320 vs. NCBI nr
Match: KAG7022267.1 (yqeI [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 375.9 bits (964), Expect = 3.0e-100
Identity = 223/296 (75.34%), Postives = 244/296 (82.43%), Query Frame = 0

Query: 1   MGASIISS-------HSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFL 60
           MGASI+SS       HSLLLRSVPSS STF  +P      +A IS  C HQFKPNSS FL
Sbjct: 1   MGASIMSSPSHSHLIHSLLLRSVPSSLSTFLCRP------SAVISAACSHQFKPNSSIFL 60

Query: 61  TRESPHHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFE 120
            R+SPH SRSFT SAS+SF+RSFS TIS  S V F EDDD I+D++ESDYEDE EIEDFE
Sbjct: 61  IRQSPHRSRSFTPSASVSFSRSFSSTISDTSAVGF-EDDDVIDDDEESDYEDEVEIEDFE 120

Query: 121 -RDEAVGSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGK 180
             D  V  G ET VAS+ SS++ S +EVK+IP LT+KEKKELASYAHSLGKKLKSQLVGK
Sbjct: 121 LEDNTVEDGLETAVASDSSSAVVSISEVKNIPSLTVKEKKELASYAHSLGKKLKSQLVGK 180

Query: 181 SGVTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYR 240
           SGVTPGLATSFIETLEANELLKIK+LGNCP ELED VR+LEESTGSVVV++IGRTVIIYR
Sbjct: 181 SGVTPGLATSFIETLEANELLKIKVLGNCPGELEDVVRQLEESTGSVVVSKIGRTVIIYR 240

Query: 241 PSITKMKAEEEKRRARKIYVRKEPDRVKSILQVKIKYKHPGARTVVVVEAAGIDLK 289
           PSI+KMKAEEEKRRARK+YVRKEPDRVK  LQVKIKYKH  ART VVVEAAG DLK
Sbjct: 241 PSISKMKAEEEKRRARKMYVRKEPDRVKLFLQVKIKYKHREARTEVVVEAAGFDLK 289

BLAST of CcUC02G022320 vs. NCBI nr
Match: XP_022971306.1 (uncharacterized protein LOC111470060 [Cucurbita maxima])

HSP 1 Score: 345.9 bits (886), Expect = 3.4e-91
Identity = 206/276 (74.64%), Postives = 226/276 (81.88%), Query Frame = 0

Query: 1   MGASIISS-------HSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFL 60
           MGASI+SS       HSLLLRSVPSS STF  +P      +A IS  C HQFKPNSS FL
Sbjct: 1   MGASIMSSPSHSHLIHSLLLRSVPSSLSTFLCRP------SAVISAACSHQFKPNSSIFL 60

Query: 61  TRESPHHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFE 120
            R+SPH SRSFT SAS+SF+RSFS  IS  S V F EDDD I+D+DESDYEDE EIEDFE
Sbjct: 61  IRQSPHRSRSFTPSASVSFSRSFSSIISGTSAVGF-EDDDVIDDDDESDYEDEVEIEDFE 120

Query: 121 -RDEAVGSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGK 180
             D  V  G ET VAS+ SS++ SR+EVK IP LT+KEKKELASYAHSLGKKLKSQLVGK
Sbjct: 121 LEDNTVEDGLETAVASDSSSAVVSRSEVKSIPSLTVKEKKELASYAHSLGKKLKSQLVGK 180

Query: 181 SGVTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYR 240
           SGVTPGLATSFIETLEANELLKIK+LGNCP ELED VR+LEESTGSVVV++IGRTVIIYR
Sbjct: 181 SGVTPGLATSFIETLEANELLKIKVLGNCPGELEDVVRQLEESTGSVVVSKIGRTVIIYR 240

Query: 241 PSITKMKAEEEKRRARKIYVRKEPDRVKSILQVKIK 269
           PSI+KMKAEEEKRRARKIYVRKEPDRVK  LQ K++
Sbjct: 241 PSISKMKAEEEKRRARKIYVRKEPDRVKLFLQNKVQ 269

BLAST of CcUC02G022320 vs. ExPASy Swiss-Prot
Match: P54454 (Probable RNA-binding protein YqeI OS=Bacillus subtilis (strain 168) OX=224308 GN=yqeI PE=4 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 2.8e-08
Identity = 34/89 (38.20%), Postives = 49/89 (55.06%), Query Frame = 0

Query: 146 LTIKEKKELASYAHSLGKKLKSQLVGKSGVTPGLATSFIETLEANELLKIKILGNCPEEL 205
           LT K+K+ L S AH L    +   VGK GV   +     E LEA EL+K+ +L NC E+ 
Sbjct: 2   LTGKQKRFLRSKAHHLTPIFQ---VGKGGVNDNMIKQIAEALEARELIKVSVLQNCEEDK 61

Query: 206 EDAVRKLEESTGSVVVNQIGRTVIIYRPS 235
            D    L + + S +V  IG T+++Y+ S
Sbjct: 62  NDVAEALVKGSRSQLVQTIGNTIVLYKES 87

BLAST of CcUC02G022320 vs. ExPASy Swiss-Prot
Match: P0AGK6 (RNA-binding protein YhbY OS=Escherichia coli O157:H7 OX=83334 GN=yhbY PE=4 SV=1)

HSP 1 Score: 51.6 bits (122), Expect = 1.7e-05
Identity = 30/89 (33.71%), Postives = 49/89 (55.06%), Query Frame = 0

Query: 146 LTIKEKKELASYAHSLGKKLKSQLVGKSGVTPGLATSFIETLEANELLKIKILGNCPEEL 205
           L+ K+K+ L   AH L   +   L+G +G+T G+     + LE +EL+K+KI     E  
Sbjct: 3   LSTKQKQHLKGLAHPLKPVV---LLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETK 62

Query: 206 EDAVRKLEESTGSVVVNQIGRTVIIYRPS 235
              V  +   TG+  V  IG+T+++YRP+
Sbjct: 63  TLIVEAIVRETGACNVQVIGKTLVLYRPT 88

BLAST of CcUC02G022320 vs. ExPASy Swiss-Prot
Match: P0AGK5 (RNA-binding protein YhbY OS=Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 / UPEC) OX=199310 GN=yhbY PE=4 SV=1)

HSP 1 Score: 51.6 bits (122), Expect = 1.7e-05
Identity = 30/89 (33.71%), Postives = 49/89 (55.06%), Query Frame = 0

Query: 146 LTIKEKKELASYAHSLGKKLKSQLVGKSGVTPGLATSFIETLEANELLKIKILGNCPEEL 205
           L+ K+K+ L   AH L   +   L+G +G+T G+     + LE +EL+K+KI     E  
Sbjct: 3   LSTKQKQHLKGLAHPLKPVV---LLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETK 62

Query: 206 EDAVRKLEESTGSVVVNQIGRTVIIYRPS 235
              V  +   TG+  V  IG+T+++YRP+
Sbjct: 63  TLIVEAIVRETGACNVQVIGKTLVLYRPT 88

BLAST of CcUC02G022320 vs. ExPASy Swiss-Prot
Match: P0AGK4 (RNA-binding protein YhbY OS=Escherichia coli (strain K12) OX=83333 GN=yhbY PE=1 SV=1)

HSP 1 Score: 51.6 bits (122), Expect = 1.7e-05
Identity = 30/89 (33.71%), Postives = 49/89 (55.06%), Query Frame = 0

Query: 146 LTIKEKKELASYAHSLGKKLKSQLVGKSGVTPGLATSFIETLEANELLKIKILGNCPEEL 205
           L+ K+K+ L   AH L   +   L+G +G+T G+     + LE +EL+K+KI     E  
Sbjct: 3   LSTKQKQHLKGLAHPLKPVV---LLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETK 62

Query: 206 EDAVRKLEESTGSVVVNQIGRTVIIYRPS 235
              V  +   TG+  V  IG+T+++YRP+
Sbjct: 63  TLIVEAIVRETGACNVQVIGKTLVLYRPT 88

BLAST of CcUC02G022320 vs. ExPASy Swiss-Prot
Match: P71376 (RNA-binding protein HI_1333 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=HI_1333 PE=1 SV=1)

HSP 1 Score: 45.8 bits (107), Expect = 9.4e-04
Identity = 26/89 (29.21%), Postives = 46/89 (51.69%), Query Frame = 0

Query: 146 LTIKEKKELASYAHSLGKKLKSQLVGKSGVTPGLATSFIETLEANELLKIKILGNCPEEL 205
           L+ K+K+ L   AH L   +   ++G +G+T G+       L  +EL+K+K+ G   E  
Sbjct: 4   LSTKQKQFLKGLAHHLNPVV---MLGGNGLTEGVLAEIENALNHHELIKVKVAGADRETK 63

Query: 206 EDAVRKLEESTGSVVVNQIGRTVIIYRPS 235
           +  +  +   T +  V  IG  +++YRPS
Sbjct: 64  QLIINAIVRETKAAQVQTIGHILVLYRPS 89

BLAST of CcUC02G022320 vs. ExPASy TrEMBL
Match: A0A0A0K2C3 (CRM domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G004710 PE=4 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 5.8e-105
Identity = 227/270 (84.07%), Postives = 238/270 (88.15%), Query Frame = 0

Query: 1   MGASIIS--SHSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFLTRESP 60
           MGASIIS  SHSLLLRS+PSSFSTFFSKP    +AAAAISTVCFHQFKP+SS  LTRESP
Sbjct: 1   MGASIISSHSHSLLLRSLPSSFSTFFSKP----SAAAAISTVCFHQFKPSSSTLLTRESP 60

Query: 61  HHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFERDEAV 120
           HH R FT SA LSF+RSFS TIS PSEV FDE DD IED+DESDYEDE E+ED      V
Sbjct: 61  HHYRPFTPSAPLSFSRSFSSTISDPSEVIFDEADDEIEDKDESDYEDEVEMED-----EV 120

Query: 121 GSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGKSGVTPG 180
           G GTETGVASELSS + SRNEVK+IP LTIKEKKELASYAH LGKKLKSQLVGKSGVTPG
Sbjct: 121 GDGTETGVASELSSPIMSRNEVKNIPSLTIKEKKELASYAHGLGKKLKSQLVGKSGVTPG 180

Query: 181 LATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPSITKM 240
           LATSFIETLEANELLKIKILGNCPEELED VRKL ESTGSVVVNQIGRTVIIYRPSITKM
Sbjct: 181 LATSFIETLEANELLKIKILGNCPEELEDVVRKLAESTGSVVVNQIGRTVIIYRPSITKM 240

Query: 241 KAEEEKRRARKIYVRKEPDRVKSILQVKIK 269
           KAEEEKRRARK+Y+RKEPDRVKSILQ KI+
Sbjct: 241 KAEEEKRRARKVYMRKEPDRVKSILQKKIE 261

BLAST of CcUC02G022320 vs. ExPASy TrEMBL
Match: A0A1S3BJF7 (uncharacterized protein LOC103490350 OS=Cucumis melo OX=3656 GN=LOC103490350 PE=4 SV=1)

HSP 1 Score: 384.4 bits (986), Expect = 4.1e-103
Identity = 221/274 (80.66%), Postives = 238/274 (86.86%), Query Frame = 0

Query: 1   MGASIIS------SHSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFLT 60
           M ASIIS      SHSLLLRSVPSSFSTFFSKP    +A AA STVCFHQFKP+SS FLT
Sbjct: 1   MRASIISSHSHSHSHSLLLRSVPSSFSTFFSKP----SATAAFSTVCFHQFKPSSSTFLT 60

Query: 61  RESPHHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFER 120
           R+SPH+SR FT S SLSF+R FS T+S PSEVRFDE DD IED+DESDYEDE E+ED   
Sbjct: 61  RKSPHYSRPFTPSTSLSFSRFFSSTVSGPSEVRFDEGDDEIEDKDESDYEDEVEMED--- 120

Query: 121 DEAVGSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGKSG 180
             +VG GTE GVASELSS + +RNEVK+IP LTIKEKKELASYAH LGKKLKSQLVGKSG
Sbjct: 121 --SVGDGTENGVASELSSPVMNRNEVKNIPSLTIKEKKELASYAHGLGKKLKSQLVGKSG 180

Query: 181 VTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPS 240
           VTPGLATSFIETLEANELLKIKILGNCPE+LED VRKLEESTGSVVVNQIGRTVIIYRPS
Sbjct: 181 VTPGLATSFIETLEANELLKIKILGNCPEDLEDVVRKLEESTGSVVVNQIGRTVIIYRPS 240

Query: 241 ITKMKAEEEKRRARKIYVRKEPDRVKSILQVKIK 269
           ITKMKAEEEKRR+RKIY+RKEPDRVKSILQ K++
Sbjct: 241 ITKMKAEEEKRRSRKIYIRKEPDRVKSILQKKVE 265

BLAST of CcUC02G022320 vs. ExPASy TrEMBL
Match: A0A6J1I2Z3 (uncharacterized protein LOC111470060 OS=Cucurbita maxima OX=3661 GN=LOC111470060 PE=4 SV=1)

HSP 1 Score: 345.9 bits (886), Expect = 1.6e-91
Identity = 206/276 (74.64%), Postives = 226/276 (81.88%), Query Frame = 0

Query: 1   MGASIISS-------HSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFL 60
           MGASI+SS       HSLLLRSVPSS STF  +P      +A IS  C HQFKPNSS FL
Sbjct: 1   MGASIMSSPSHSHLIHSLLLRSVPSSLSTFLCRP------SAVISAACSHQFKPNSSIFL 60

Query: 61  TRESPHHSRSFTRSASLSFTRSFSFTISVPSEVRFDEDDDGIEDEDESDYEDEGEIEDFE 120
            R+SPH SRSFT SAS+SF+RSFS  IS  S V F EDDD I+D+DESDYEDE EIEDFE
Sbjct: 61  IRQSPHRSRSFTPSASVSFSRSFSSIISGTSAVGF-EDDDVIDDDDESDYEDEVEIEDFE 120

Query: 121 -RDEAVGSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGK 180
             D  V  G ET VAS+ SS++ SR+EVK IP LT+KEKKELASYAHSLGKKLKSQLVGK
Sbjct: 121 LEDNTVEDGLETAVASDSSSAVVSRSEVKSIPSLTVKEKKELASYAHSLGKKLKSQLVGK 180

Query: 181 SGVTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYR 240
           SGVTPGLATSFIETLEANELLKIK+LGNCP ELED VR+LEESTGSVVV++IGRTVIIYR
Sbjct: 181 SGVTPGLATSFIETLEANELLKIKVLGNCPGELEDVVRQLEESTGSVVVSKIGRTVIIYR 240

Query: 241 PSITKMKAEEEKRRARKIYVRKEPDRVKSILQVKIK 269
           PSI+KMKAEEEKRRARKIYVRKEPDRVK  LQ K++
Sbjct: 241 PSISKMKAEEEKRRARKIYVRKEPDRVKLFLQNKVQ 269

BLAST of CcUC02G022320 vs. ExPASy TrEMBL
Match: A0A5D3DFJ1 (Putative RNA-binding CRS1 / YhbY domain protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1428G00840 PE=4 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 4.2e-63
Identity = 133/146 (91.10%), Postives = 140/146 (95.89%), Query Frame = 0

Query: 123 ETGVASELSSSMASRNEVKDIPGLTIKEKKELASYAHSLGKKLKSQLVGKSGVTPGLATS 182
           ETGVASELSS + +RNEVK+IP LTIKEKKELASYAH LGKKLKSQLVGKSGVTPGLATS
Sbjct: 2   ETGVASELSSPVMNRNEVKNIPSLTIKEKKELASYAHGLGKKLKSQLVGKSGVTPGLATS 61

Query: 183 FIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRPSITKMKAEE 242
           FIETLEANELLKIKILGNCPEELED VRKLEESTGSVVVNQIGRTVIIYRPSITKMKAEE
Sbjct: 62  FIETLEANELLKIKILGNCPEELEDVVRKLEESTGSVVVNQIGRTVIIYRPSITKMKAEE 121

Query: 243 EKRRARKIYVRKEPDRVKSILQVKIK 269
           EKRR+RKIY+RKEPDRVKSILQ K++
Sbjct: 122 EKRRSRKIYIRKEPDRVKSILQKKVE 147

BLAST of CcUC02G022320 vs. ExPASy TrEMBL
Match: A0A6J1C409 (uncharacterized protein LOC111007723 OS=Momordica charantia OX=3673 GN=LOC111007723 PE=4 SV=1)

HSP 1 Score: 235.0 bits (598), Expect = 4.0e-58
Identity = 128/170 (75.29%), Postives = 151/170 (88.82%), Query Frame = 0

Query: 100 ESDYEDEGEIEDFE-RDEAVGSGTETGVASELSSSMASRNEVKDIPGLTIKEKKELASYA 159
           +SDYE E  I DFE  +E++G  ++    S+LSSS+ + +EVK++P LT+KEKKELASYA
Sbjct: 9   DSDYEGEVRIVDFELEEESIGGESQADAVSQLSSSVVNGSEVKNLPSLTVKEKKELASYA 68

Query: 160 HSLGKKLKSQLVGKSGVTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGS 219
           HSLGKKLK+QLVGKSGVTPGLA SF+ETLEANELLKIKIL NCP EL+DAVR+LEESTGS
Sbjct: 69  HSLGKKLKTQLVGKSGVTPGLAISFVETLEANELLKIKILRNCPVELDDAVRQLEESTGS 128

Query: 220 VVVNQIGRTVIIYRPSITKMKAEEEKRRARKIYVRKEPDRVKSILQVKIK 269
           VVV+QIGRTVIIYRPSI+KMKAEE+KR+ARKIYVRKEPDRVKSILQ K++
Sbjct: 129 VVVSQIGRTVIIYRPSISKMKAEEKKRQARKIYVRKEPDRVKSILQNKVQ 178

BLAST of CcUC02G022320 vs. TAIR 10
Match: AT4G39040.1 (RNA-binding CRS1 / YhbY (CRM) domain protein )

HSP 1 Score: 145.6 bits (366), Expect = 6.2e-35
Identity = 116/261 (44.44%), Postives = 150/261 (57.47%), Query Frame = 0

Query: 9   HSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFLTRESPHHSRSFTRSA 68
           H+LL    P S    F +P   + + +  +    H+ KP   +F ++  P HS S     
Sbjct: 17  HNLLRHPKPPSPVCLFLRPFCFSASVSQSN----HRNKPRFQSFSSKPLPCHSASLV--- 76

Query: 69  SLSFTRSFSFTISVPSEVRFDEDDDGIEDEDES--DYEDEGEIEDFERDEAV---GSGTE 128
                +SFS +I  P      E+D+  EDED S  +YE E E E+ E+D  V     G E
Sbjct: 77  ----VKSFS-SIDEPDL----EEDEESEDEDFSAEEYEYEDEEEEDEQDSGVVVSERGIE 136

Query: 129 TGVASELSSSMASRNEVKD----------IPGLTIKEKKELASYAHSLGKKLKSQLVGKS 188
              ASE  S +  + E  +             L+IKEKKELASYAHSLG KLK QLVGKS
Sbjct: 137 DSEASEEVSEIGDKEEKTENTKKKKSRGSALKLSIKEKKELASYAHSLGDKLKCQLVGKS 196

Query: 189 GVTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRP 248
           GVT  +  SF+ETLE NELLK+KI    P+ELEDAV+ LEE+TGSV V QIGRTVI+YRP
Sbjct: 197 GVTDSVVFSFLETLEKNELLKVKIRKTSPDELEDAVQHLEEATGSVAVGQIGRTVILYRP 256

Query: 249 SITKMKAEEEKRRARKIYVRK 255
           S TKMKAE +K+   ++ + +
Sbjct: 257 SPTKMKAEAKKKEVERMSITR 261

BLAST of CcUC02G022320 vs. TAIR 10
Match: AT4G39040.2 (RNA-binding CRS1 / YhbY (CRM) domain protein )

HSP 1 Score: 145.6 bits (366), Expect = 6.2e-35
Identity = 116/261 (44.44%), Postives = 150/261 (57.47%), Query Frame = 0

Query: 9   HSLLLRSVPSSFSTFFSKPPASATAAAAISTVCFHQFKPNSSNFLTRESPHHSRSFTRSA 68
           H+LL    P S    F +P   + + +  +    H+ KP   +F ++  P HS S     
Sbjct: 17  HNLLRHPKPPSPVCLFLRPFCFSASVSQSN----HRNKPRFQSFSSKPLPCHSASLV--- 76

Query: 69  SLSFTRSFSFTISVPSEVRFDEDDDGIEDEDES--DYEDEGEIEDFERDEAV---GSGTE 128
                +SFS +I  P      E+D+  EDED S  +YE E E E+ E+D  V     G E
Sbjct: 77  ----VKSFS-SIDEPDL----EEDEESEDEDFSAEEYEYEDEEEEDEQDSGVVVSERGIE 136

Query: 129 TGVASELSSSMASRNEVKD----------IPGLTIKEKKELASYAHSLGKKLKSQLVGKS 188
              ASE  S +  + E  +             L+IKEKKELASYAHSLG KLK QLVGKS
Sbjct: 137 DSEASEEVSEIGDKEEKTENTKKKKSRGSALKLSIKEKKELASYAHSLGDKLKCQLVGKS 196

Query: 189 GVTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEESTGSVVVNQIGRTVIIYRP 248
           GVT  +  SF+ETLE NELLK+KI    P+ELEDAV+ LEE+TGSV V QIGRTVI+YRP
Sbjct: 197 GVTDSVVFSFLETLEKNELLKVKIRKTSPDELEDAVQHLEEATGSVAVGQIGRTVILYRP 256

Query: 249 SITKMKAEEEKRRARKIYVRK 255
           S TKMKAE +K+   ++ + +
Sbjct: 257 SPTKMKAEAKKKEVERMSITR 261

BLAST of CcUC02G022320 vs. TAIR 10
Match: AT2G21350.1 (RNA-binding CRS1 / YhbY (CRM) domain protein )

HSP 1 Score: 132.5 bits (332), Expect = 5.4e-31
Identity = 103/212 (48.58%), Postives = 128/212 (60.38%), Query Frame = 0

Query: 38  STVCFHQFKPNSSNFLTRESPHHSRSFTRSASLSFTRSF-SFT-ISVPSEVRFDEDDDGI 97
           S VC    +P S + L R+S HH+     S+SL   R + SF+ + VP    F   D+  
Sbjct: 24  SPVCLF-LRPFSVSLLGRKSSHHA---WISSSLPLPRPYISFSPLLVPKS--FSSVDESE 83

Query: 98  EDEDE-SDYEDEGEIEDFERDEAVGSGTETGVASELSSSMASRNEVKDIPGLTIKEKKEL 157
           +DED  S      EI D         G E G  SEL+      N       L+ KEK++L
Sbjct: 84  QDEDNVSLVASLNEIND--------DGQEDG--SELTMVSMRENRRSSALELSAKEKRKL 143

Query: 158 ASYAHSLGKKLKSQLVGKSGVTPGLATSFIETLEANELLKIKILGNCPEELEDAVRKLEE 217
           ASYAH LG KLKSQLVGKSGVT  +  SF+ETLE NELLK+KI   CP ELED + +LEE
Sbjct: 144 ASYAHHLGDKLKSQLVGKSGVTDSVVLSFVETLEKNELLKVKIHRTCPGELEDMILRLEE 203

Query: 218 STGSVVVNQIGRTVIIYRPSITKMKAEEEKRR 247
           +TGSV V QI RTVI+YRPS TK+KA+E+K+R
Sbjct: 204 ATGSVSVGQIARTVILYRPSPTKLKADEDKKR 219

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004148536.11.2e-10484.07uncharacterized protein LOC101213060 [Cucumis sativus] >KGN43169.1 hypothetical ... [more]
XP_038888218.14.5e-10484.81uncharacterized protein LOC120078082 [Benincasa hispida][more]
XP_008448058.18.5e-10380.66PREDICTED: uncharacterized protein LOC103490350 [Cucumis melo][more]
KAG7022267.13.0e-10075.34yqeI [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022971306.13.4e-9174.64uncharacterized protein LOC111470060 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
P544542.8e-0838.20Probable RNA-binding protein YqeI OS=Bacillus subtilis (strain 168) OX=224308 GN... [more]
P0AGK61.7e-0533.71RNA-binding protein YhbY OS=Escherichia coli O157:H7 OX=83334 GN=yhbY PE=4 SV=1[more]
P0AGK51.7e-0533.71RNA-binding protein YhbY OS=Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 ... [more]
P0AGK41.7e-0533.71RNA-binding protein YhbY OS=Escherichia coli (strain K12) OX=83333 GN=yhbY PE=1 ... [more]
P713769.4e-0429.21RNA-binding protein HI_1333 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 1... [more]
Match NameE-valueIdentityDescription
A0A0A0K2C35.8e-10584.07CRM domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G004710 PE=4 SV... [more]
A0A1S3BJF74.1e-10380.66uncharacterized protein LOC103490350 OS=Cucumis melo OX=3656 GN=LOC103490350 PE=... [more]
A0A6J1I2Z31.6e-9174.64uncharacterized protein LOC111470060 OS=Cucurbita maxima OX=3661 GN=LOC111470060... [more]
A0A5D3DFJ14.2e-6391.10Putative RNA-binding CRS1 / YhbY domain protein OS=Cucumis melo var. makuwa OX=1... [more]
A0A6J1C4094.0e-5875.29uncharacterized protein LOC111007723 OS=Momordica charantia OX=3673 GN=LOC111007... [more]
Match NameE-valueIdentityDescription
AT4G39040.16.2e-3544.44RNA-binding CRS1 / YhbY (CRM) domain protein [more]
AT4G39040.26.2e-3544.44RNA-binding CRS1 / YhbY (CRM) domain protein [more]
AT2G21350.15.4e-3148.58RNA-binding CRS1 / YhbY (CRM) domain protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001890RNA-binding, CRM domainSMARTSM01103CRS1_YhbY_2coord: 146..232
e-value: 4.7E-24
score: 95.9
IPR001890RNA-binding, CRM domainPFAMPF01985CRS1_YhbYcoord: 146..232
e-value: 6.0E-16
score: 58.4
IPR001890RNA-binding, CRM domainPROSITEPS51295CRMcoord: 144..243
score: 17.014265
IPR035920YhbY-like superfamilyGENE3D3.30.110.60coord: 145..244
e-value: 3.8E-27
score: 96.1
IPR035920YhbY-like superfamilySUPERFAMILY75471YhbY-likecoord: 146..234
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..109
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 89..109
NoneNo IPR availablePANTHERPTHR47714CRS1/YHBY DOMAIN CONTAINING PROTEIN, EXPRESSEDcoord: 6..268
NoneNo IPR availablePANTHERPTHR47714:SF1CRS1/YHBY DOMAIN CONTAINING PROTEIN, EXPRESSEDcoord: 6..268

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC02G022320.1CcUC02G022320.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003723 RNA binding