Cla97C03G055920 (gene) Watermelon (97103) v2.5

Overview
NameCla97C03G055920
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionDUF4228 domain-containing protein
LocationCla97Chr03: 4815392 .. 4815742 (+)
RNA-Seq ExpressionCla97C03G055920
SyntenyCla97C03G055920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCAACTGTTGTAGAAGCCAATCATCTACAATGATCTTCGCCGGTGACGATTGGGGCTCCTTCAGTTCCAAGCACAGCCGTGGCTGCGGCGGAAACGCCACAACCTCCGGCGGCGATCCAGAGAAGCATAGACTACTTGGAGGGAAGGAAGCAGCGTCGTGTGCGCAGCCGCAAGTGAAAATCAAGATGACGAAGAGAGAGCTTGAGGATCTGGTGAAGAAGCTGGAGATGCAAGGATTGAGTCTAGAACAAGTTATTGGTCGATTGGTGAACGACGAAGATGAATTTGAAGTTGAGCATCATCGATCATGGAGGCCTTCCCTTCAAAGCATTCCAGAGGATTACTGA

mRNA sequence

ATGGGCAACTGTTGTAGAAGCCAATCATCTACAATGATCTTCGCCGGTGACGATTGGGGCTCCTTCAGTTCCAAGCACAGCCGTGGCTGCGGCGGAAACGCCACAACCTCCGGCGGCGATCCAGAGAAGCATAGACTACTTGGAGGGAAGGAAGCAGCGTCGTGTGCGCAGCCGCAAGTGAAAATCAAGATGACGAAGAGAGAGCTTGAGGATCTGGTGAAGAAGCTGGAGATGCAAGGATTGAGTCTAGAACAAGTTATTGGTCGATTGGTGAACGACGAAGATGAATTTGAAGTTGAGCATCATCGATCATGGAGGCCTTCCCTTCAAAGCATTCCAGAGGATTACTGA

Coding sequence (CDS)

ATGGGCAACTGTTGTAGAAGCCAATCATCTACAATGATCTTCGCCGGTGACGATTGGGGCTCCTTCAGTTCCAAGCACAGCCGTGGCTGCGGCGGAAACGCCACAACCTCCGGCGGCGATCCAGAGAAGCATAGACTACTTGGAGGGAAGGAAGCAGCGTCGTGTGCGCAGCCGCAAGTGAAAATCAAGATGACGAAGAGAGAGCTTGAGGATCTGGTGAAGAAGCTGGAGATGCAAGGATTGAGTCTAGAACAAGTTATTGGTCGATTGGTGAACGACGAAGATGAATTTGAAGTTGAGCATCATCGATCATGGAGGCCTTCCCTTCAAAGCATTCCAGAGGATTACTGA

Protein sequence

MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGGKEAASCAQPQVKIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPEDY
Homology
BLAST of Cla97C03G055920 vs. NCBI nr
Match: XP_038895954.1 (uncharacterized protein LOC120084128 [Benincasa hispida])

HSP 1 Score: 211.1 bits (536), Expect = 5.2e-51
Identity = 105/117 (89.74%), Postives = 111/117 (94.87%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGGKEAASCAQPQV 60
           MGNCCRSQSSTM+F GDDWGSFSSKH RG GGNATTSGGDPEK+RLLG KEAASC+QPQV
Sbjct: 1   MGNCCRSQSSTMVFVGDDWGSFSSKHGRGRGGNATTSGGDPEKYRLLGEKEAASCSQPQV 60

Query: 61  KIKMTKRELEDLVKKLEMQGLSLEQVIGRLV-NDEDEFEVEHHRSWRPSLQSIPEDY 117
           KIKM+KRELEDLVKKLEMQG+SLEQVIGR+V N EDEFEVEHHRSWRPSLQSIPEDY
Sbjct: 61  KIKMSKRELEDLVKKLEMQGMSLEQVIGRMVMNGEDEFEVEHHRSWRPSLQSIPEDY 117

BLAST of Cla97C03G055920 vs. NCBI nr
Match: KAA0056859.1 (DUF4228 domain-containing protein [Cucumis melo var. makuwa] >TYJ99362.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 208.4 bits (529), Expect = 3.4e-50
Identity = 102/117 (87.18%), Postives = 110/117 (94.02%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGG-KEAASCAQPQ 60
           MGNCCRSQSSTM+FAGDDWGSFSSKH RGCGGNATTSGGDPEKHRLLG  KEAASC+QP 
Sbjct: 1   MGNCCRSQSSTMVFAGDDWGSFSSKHGRGCGGNATTSGGDPEKHRLLGDQKEAASCSQPL 60

Query: 61  VKIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPEDY 117
           VKIKMTKRELE LVKKLEMQGL+LEQVIGR++  E+E+E+EHHRSWRPSLQSIPEDY
Sbjct: 61  VKIKMTKRELEVLVKKLEMQGLTLEQVIGRMMKGEEEYEIEHHRSWRPSLQSIPEDY 117

BLAST of Cla97C03G055920 vs. NCBI nr
Match: XP_022933211.1 (uncharacterized protein LOC111440057 [Cucurbita moschata])

HSP 1 Score: 199.9 bits (507), Expect = 1.2e-47
Identity = 97/114 (85.09%), Postives = 106/114 (92.98%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGGKEAASCAQPQV 60
           MGNCCRSQSSTMIFAGDDWGSFSSKH RG GGNA+TS GDPEK+RLLG K+ +SC QPQV
Sbjct: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHGRGRGGNASTSSGDPEKYRLLGEKQTSSCPQPQV 60

Query: 61  KIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPE 115
           KIKMTKRELEDLVK LE++GLSLEQVIGR++NDEDEF+ EHHRSWRPSLQSIPE
Sbjct: 61  KIKMTKRELEDLVKNLEIEGLSLEQVIGRMMNDEDEFQGEHHRSWRPSLQSIPE 114

BLAST of Cla97C03G055920 vs. NCBI nr
Match: KAG6584086.1 (hypothetical protein SDJN03_20018, partial [Cucurbita argyrosperma subsp. sororia] >KAG7019687.1 hypothetical protein SDJN02_18650, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 199.5 bits (506), Expect = 1.6e-47
Identity = 97/114 (85.09%), Postives = 105/114 (92.11%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGGKEAASCAQPQV 60
           MGNCCRSQSSTMIFAGDDWGSFSSKH RG GGNA+TS GDPEK+RLLG K+ +SC QPQV
Sbjct: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHGRGRGGNASTSSGDPEKYRLLGEKQTSSCPQPQV 60

Query: 61  KIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPE 115
           KIKMTKRELEDLVK LE++GLSLEQVIGR+ NDEDEF+ EHHRSWRPSLQSIPE
Sbjct: 61  KIKMTKRELEDLVKNLEIEGLSLEQVIGRMTNDEDEFQGEHHRSWRPSLQSIPE 114

BLAST of Cla97C03G055920 vs. NCBI nr
Match: XP_023001321.1 (uncharacterized protein LOC111495485 [Cucurbita maxima])

HSP 1 Score: 196.8 bits (499), Expect = 1.0e-46
Identity = 95/114 (83.33%), Postives = 105/114 (92.11%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGGKEAASCAQPQV 60
           MGNCCRSQSSTM+FAGDDWGSFSSKH RG GGNA+ S GDPEK+RLLG K+ +SC QPQV
Sbjct: 1   MGNCCRSQSSTMVFAGDDWGSFSSKHGRGRGGNASASRGDPEKYRLLGEKQTSSCPQPQV 60

Query: 61  KIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPE 115
           KIKMTKRELEDLVK LE++GLSLEQVIGR++NDEDEF+ EHHRSWRPSLQSIPE
Sbjct: 61  KIKMTKRELEDLVKNLEIEGLSLEQVIGRMMNDEDEFQGEHHRSWRPSLQSIPE 114

BLAST of Cla97C03G055920 vs. ExPASy TrEMBL
Match: A0A5A7UTK3 (DUF4228 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005800 PE=4 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 1.6e-50
Identity = 102/117 (87.18%), Postives = 110/117 (94.02%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGG-KEAASCAQPQ 60
           MGNCCRSQSSTM+FAGDDWGSFSSKH RGCGGNATTSGGDPEKHRLLG  KEAASC+QP 
Sbjct: 1   MGNCCRSQSSTMVFAGDDWGSFSSKHGRGCGGNATTSGGDPEKHRLLGDQKEAASCSQPL 60

Query: 61  VKIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPEDY 117
           VKIKMTKRELE LVKKLEMQGL+LEQVIGR++  E+E+E+EHHRSWRPSLQSIPEDY
Sbjct: 61  VKIKMTKRELEVLVKKLEMQGLTLEQVIGRMMKGEEEYEIEHHRSWRPSLQSIPEDY 117

BLAST of Cla97C03G055920 vs. ExPASy TrEMBL
Match: A0A0A0LRL1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G063490 PE=4 SV=1)

HSP 1 Score: 203.4 bits (516), Expect = 5.3e-49
Identity = 101/118 (85.59%), Postives = 107/118 (90.68%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGN--ATTSGGDPEKHRLLGGKEAASCAQP 60
           MGNCCRSQSSTM+FAGDDWGSFSSKH RGCG N  A TSGGDPEKHRLLG KEA SC+QP
Sbjct: 1   MGNCCRSQSSTMVFAGDDWGSFSSKHGRGCGRNVSAATSGGDPEKHRLLGEKEAGSCSQP 60

Query: 61  QVKIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPEDY 117
            VKIKMTKRELE LVKKLEMQGLSLEQVIGR++  E+EFE+EHHRSWRPSLQSIPEDY
Sbjct: 61  LVKIKMTKRELEVLVKKLEMQGLSLEQVIGRMMKGEEEFEIEHHRSWRPSLQSIPEDY 118

BLAST of Cla97C03G055920 vs. ExPASy TrEMBL
Match: A0A6J1EYG4 (uncharacterized protein LOC111440057 OS=Cucurbita moschata OX=3662 GN=LOC111440057 PE=4 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 5.8e-48
Identity = 97/114 (85.09%), Postives = 106/114 (92.98%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGGKEAASCAQPQV 60
           MGNCCRSQSSTMIFAGDDWGSFSSKH RG GGNA+TS GDPEK+RLLG K+ +SC QPQV
Sbjct: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHGRGRGGNASTSSGDPEKYRLLGEKQTSSCPQPQV 60

Query: 61  KIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPE 115
           KIKMTKRELEDLVK LE++GLSLEQVIGR++NDEDEF+ EHHRSWRPSLQSIPE
Sbjct: 61  KIKMTKRELEDLVKNLEIEGLSLEQVIGRMMNDEDEFQGEHHRSWRPSLQSIPE 114

BLAST of Cla97C03G055920 vs. ExPASy TrEMBL
Match: A0A6J1KIA5 (uncharacterized protein LOC111495485 OS=Cucurbita maxima OX=3661 GN=LOC111495485 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 4.9e-47
Identity = 95/114 (83.33%), Postives = 105/114 (92.11%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGGKEAASCAQPQV 60
           MGNCCRSQSSTM+FAGDDWGSFSSKH RG GGNA+ S GDPEK+RLLG K+ +SC QPQV
Sbjct: 1   MGNCCRSQSSTMVFAGDDWGSFSSKHGRGRGGNASASRGDPEKYRLLGEKQTSSCPQPQV 60

Query: 61  KIKMTKRELEDLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPE 115
           KIKMTKRELEDLVK LE++GLSLEQVIGR++NDEDEF+ EHHRSWRPSLQSIPE
Sbjct: 61  KIKMTKRELEDLVKNLEIEGLSLEQVIGRMMNDEDEFQGEHHRSWRPSLQSIPE 114

BLAST of Cla97C03G055920 vs. ExPASy TrEMBL
Match: A0A1S3BEX3 (uncharacterized protein LOC103489132 OS=Cucumis melo OX=3656 GN=LOC103489132 PE=4 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 1.5e-43
Identity = 91/106 (85.85%), Postives = 99/106 (93.40%), Query Frame = 0

Query: 12  MIFAGDDWGSFSSKHSRGCGGNATTSGGDPEKHRLLGG-KEAASCAQPQVKIKMTKRELE 71
           M+FAGDDWGSFSSKH RGCGGNATTSGGDPEKHRLLG  KEAASC+QP VKIKMTKRELE
Sbjct: 1   MVFAGDDWGSFSSKHGRGCGGNATTSGGDPEKHRLLGDQKEAASCSQPLVKIKMTKRELE 60

Query: 72  DLVKKLEMQGLSLEQVIGRLVNDEDEFEVEHHRSWRPSLQSIPEDY 117
            LVKKLEMQGL+LEQVIGR++  E+E+E+EHHRSWRPSLQSIPEDY
Sbjct: 61  VLVKKLEMQGLTLEQVIGRMMKGEEEYEIEHHRSWRPSLQSIPEDY 106

BLAST of Cla97C03G055920 vs. TAIR 10
Match: AT4G21920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20340.1); Has 40 Blast hits to 40 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 64.7 bits (156), Expect = 5.6e-11
Identity = 40/127 (31.50%), Postives = 71/127 (55.91%), Query Frame = 0

Query: 1   MGNC-CRSQSSTMIFAGDDWGSFSSKHSRGCGGNATTSGGDPEK-----HRLLGGKEAAS 60
           MGNC C ++ +T  ++GDD GS++ +  R           D EK       +     ++S
Sbjct: 1   MGNCICVTEKTTTSWSGDDNGSYNKRRRRRRSTVVHDDNDDGEKLLGETSNVTSTSSSSS 60

Query: 61  CAQPQVKIKMTKRELEDLVKKLEMQGLSLEQVIGRLV---NDEDEFE----VEHHRSWRP 115
             + ++KI++TK+ELEDL++ + ++ L+ E+++ +L+    D+  F       HH+ W+P
Sbjct: 61  SERREIKIRITKKELEDLMRNIGLKSLTAEEILSKLIFEGGDQIGFSAVDVTNHHQPWKP 120

BLAST of Cla97C03G055920 vs. TAIR 10
Match: AT3G20340.1 (Expression of the gene is downregulated in the presence of paraquat, an inducer of photoxidative stress. )

HSP 1 Score: 55.8 bits (133), Expect = 2.6e-08
Identity = 39/117 (33.33%), Postives = 62/117 (52.99%), Query Frame = 0

Query: 1   MGNCCRSQSSTMIFAGDDWGSFSSKHSRGCG-GNATTSGGDPEKHRLLGGKEAASCAQPQ 60
           MGNC R +S  M +AG+DW  F ++        + TT  G P    ++     +S    +
Sbjct: 1   MGNCLRHESE-MHWAGEDWDEFITEDEEDHHYSSKTTRDGKPV---IVTRDSKSSVPSHE 60

Query: 61  VKIKMTKRELEDLVKKLEMQGLSLEQVIGR--LVNDEDEFEVEHHRSWRPSLQSIPE 115
           +KI++TK++L DL+ K+ +  L+ +Q      ++N+    E    R WRP LQSIPE
Sbjct: 61  IKIRLTKKQLHDLLSKVNVHDLTFQQQTFSCPILNNRGYEEANQQRLWRPVLQSIPE 113

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895954.15.2e-5189.74uncharacterized protein LOC120084128 [Benincasa hispida][more]
KAA0056859.13.4e-5087.18DUF4228 domain-containing protein [Cucumis melo var. makuwa] >TYJ99362.1 DUF4228... [more]
XP_022933211.11.2e-4785.09uncharacterized protein LOC111440057 [Cucurbita moschata][more]
KAG6584086.11.6e-4785.09hypothetical protein SDJN03_20018, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023001321.11.0e-4683.33uncharacterized protein LOC111495485 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7UTK31.6e-5087.18DUF4228 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A0A0LRL15.3e-4985.59Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G063490 PE=4 SV=1[more]
A0A6J1EYG45.8e-4885.09uncharacterized protein LOC111440057 OS=Cucurbita moschata OX=3662 GN=LOC1114400... [more]
A0A6J1KIA54.9e-4783.33uncharacterized protein LOC111495485 OS=Cucurbita maxima OX=3661 GN=LOC111495485... [more]
A0A1S3BEX31.5e-4385.85uncharacterized protein LOC103489132 OS=Cucumis melo OX=3656 GN=LOC103489132 PE=... [more]
Match NameE-valueIdentityDescription
AT4G21920.15.6e-1131.50unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
AT3G20340.12.6e-0833.33Expression of the gene is downregulated in the presence of paraquat, an inducer ... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 66..86
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 25..52
NoneNo IPR availablePANTHERPTHR33647OS01G0793900 PROTEINcoord: 1..114
NoneNo IPR availablePANTHERPTHR33647:SF10DOMAIN PROTEIN, PUTATIVE-RELATEDcoord: 1..114

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G055920.1Cla97C03G055920.1mRNA