Cla97C01G018540 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G018540
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionUnknown protein
LocationCla97Chr01: 31744272 .. 31744523 (-)
RNA-Seq ExpressionCla97C01G018540
SyntenyCla97C01G018540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGATATTGCAATTCTTGTAGCAGAGGAATACGAGAGGAGGATGAAGAATCCAAGGCAAGATCTGGAGAAGCAACAGCTTCAAGATTTGGCTTCTGGGGTTTCCATCTCTGCAACTGCTACTTCGATTAGGATGAAGAAGATGATGGAAATGGCAAAACTGAATTCAGAAATGTCGGAATTCAAATGGGTTTTTGAACCCAAATCTCAAATTGGGCGAGCTGCTTCCACCGGATTCTTCTCGGCTTGA

mRNA sequence

ATGGCGGATATTGCAATTCTTGTAGCAGAGGAATACGAGAGGAGGATGAAGAATCCAAGGCAAGATCTGGAGAAGCAACAGCTTCAAGATTTGGCTTCTGGGGTTTCCATCTCTGCAACTGCTACTTCGATTAGGATGAAGAAGATGATGGAAATGGCAAAACTGAATTCAGAAATGTCGGAATTCAAATGGGTTTTTGAACCCAAATCTCAAATTGGGCGAGCTGCTTCCACCGGATTCTTCTCGGCTTGA

Coding sequence (CDS)

ATGGCGGATATTGCAATTCTTGTAGCAGAGGAATACGAGAGGAGGATGAAGAATCCAAGGCAAGATCTGGAGAAGCAACAGCTTCAAGATTTGGCTTCTGGGGTTTCCATCTCTGCAACTGCTACTTCGATTAGGATGAAGAAGATGATGGAAATGGCAAAACTGAATTCAGAAATGTCGGAATTCAAATGGGTTTTTGAACCCAAATCTCAAATTGGGCGAGCTGCTTCCACCGGATTCTTCTCGGCTTGA

Protein sequence

MADIAILVAEEYERRMKNPRQDLEKQQLQDLASGVSISATATSIRMKKMMEMAKLNSEMSEFKWVFEPKSQIGRAASTGFFSA
Homology
BLAST of Cla97C01G018540 vs. NCBI nr
Match: XP_038882836.1 (uncharacterized protein LOC120073972 [Benincasa hispida])

HSP 1 Score: 143.3 bits (360), Expect = 9.5e-31
Identity = 76/83 (91.57%), Postives = 79/83 (95.18%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLEKQQLQDLASGVSISATATSIRMKKMMEMAKLNSEMS 60
          MADIAILVAEEYERRMKNP Q  EKQ+LQDLASGVSISATATSIRMKKMMEMAK NS++S
Sbjct: 1  MADIAILVAEEYERRMKNPTQVQEKQELQDLASGVSISATATSIRMKKMMEMAKQNSQIS 60

Query: 61 EFKWVFEPKSQIGRAASTGFFSA 84
          EFKWVFEPKSQIGRAASTGFFSA
Sbjct: 61 EFKWVFEPKSQIGRAASTGFFSA 83

BLAST of Cla97C01G018540 vs. NCBI nr
Match: KAG6604012.1 (hypothetical protein SDJN03_04621, partial [Cucurbita argyrosperma subsp. sororia] >KAG7034176.1 hypothetical protein SDJN02_03903, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 134.0 bits (336), Expect = 5.8e-28
Identity = 70/83 (84.34%), Postives = 78/83 (93.98%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLEKQQLQDLASGVSISATATSIRMKKMMEMAKLNSEMS 60
          MADIAILVAEEYERRMKNPRQ  E Q+L+DL+SGVSISATA++IRMKKMME+A  N+EM+
Sbjct: 1  MADIAILVAEEYERRMKNPRQAQETQKLEDLSSGVSISATASAIRMKKMMEIAIQNAEMT 60

Query: 61 EFKWVFEPKSQIGRAASTGFFSA 84
          EFKWVFEPKSQIGRAASTGFFSA
Sbjct: 61 EFKWVFEPKSQIGRAASTGFFSA 83

BLAST of Cla97C01G018540 vs. NCBI nr
Match: XP_011657992.1 (uncharacterized protein LOC105435930 [Cucumis sativus] >KGN48782.1 hypothetical protein Csa_003490 [Cucumis sativus])

HSP 1 Score: 122.9 bits (307), Expect = 1.3e-24
Identity = 71/89 (79.78%), Postives = 79/89 (88.76%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPR--QDLEKQQLQDL-ASGVSISATATSIRMKKMMEMAK--- 60
          MADIAILVAEEYERR KNPR  Q++ +Q+LQ+L +SGVSISATATSIRMKKMMEMAK   
Sbjct: 1  MADIAILVAEEYERRTKNPRRGQEMIRQELQELGSSGVSISATATSIRMKKMMEMAKKHN 60

Query: 61 LNSEMSEFKWVFEPKSQIGRAASTGFFSA 84
           ++EMSEF WVFEPKSQIGRAASTGFFSA
Sbjct: 61 YSAEMSEFNWVFEPKSQIGRAASTGFFSA 89

BLAST of Cla97C01G018540 vs. NCBI nr
Match: XP_016899241.1 (PREDICTED: uncharacterized protein LOC103485003 [Cucumis melo] >KAA0036271.1 uncharacterized protein E6C27_scaffold18G00700 [Cucumis melo var. makuwa] >TYK12665.1 uncharacterized protein E5676_scaffold255G002530 [Cucumis melo var. makuwa])

HSP 1 Score: 120.6 bits (301), Expect = 6.6e-24
Identity = 70/88 (79.55%), Postives = 76/88 (86.36%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLE--KQQLQDLASGVSISATATSIRMKKMMEMAKLN-- 60
          MADIAILVAEEYERR K+PR+D E  +Q+LQDL SGVSISATATSI MKKMMEMAK N  
Sbjct: 1  MADIAILVAEEYERRTKHPRRDQEMVRQELQDLGSGVSISATATSITMKKMMEMAKRNNY 60

Query: 61 -SEMSEFKWVFEPKSQIGRAASTGFFSA 84
           +E+SEF  VFEPKSQIGRAASTGFFSA
Sbjct: 61 SAEISEFNRVFEPKSQIGRAASTGFFSA 88

BLAST of Cla97C01G018540 vs. NCBI nr
Match: XP_023003555.1 (uncharacterized protein LOC111497122 [Cucurbita maxima])

HSP 1 Score: 110.9 bits (276), Expect = 5.2e-21
Identity = 63/84 (75.00%), Postives = 72/84 (85.71%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLE-KQQLQDLASGVSISATATSIRMKKMMEMAKLNSEM 60
          MADIAILVAEEYERRMKNPRQ+ +   ++QDL SGVSISATA+  R+KKMME+    +E 
Sbjct: 1  MADIAILVAEEYERRMKNPRQEEKAAAEVQDLVSGVSISATAS--RVKKMMELELELAER 60

Query: 61 SEFKWVFEPKSQIGRAASTGFFSA 84
          +EFKWVFEPKSQIGRAASTGFFSA
Sbjct: 61 NEFKWVFEPKSQIGRAASTGFFSA 82

BLAST of Cla97C01G018540 vs. ExPASy TrEMBL
Match: A0A0A0KK75 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G501230 PE=4 SV=1)

HSP 1 Score: 122.9 bits (307), Expect = 6.5e-25
Identity = 71/89 (79.78%), Postives = 79/89 (88.76%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPR--QDLEKQQLQDL-ASGVSISATATSIRMKKMMEMAK--- 60
          MADIAILVAEEYERR KNPR  Q++ +Q+LQ+L +SGVSISATATSIRMKKMMEMAK   
Sbjct: 1  MADIAILVAEEYERRTKNPRRGQEMIRQELQELGSSGVSISATATSIRMKKMMEMAKKHN 60

Query: 61 LNSEMSEFKWVFEPKSQIGRAASTGFFSA 84
           ++EMSEF WVFEPKSQIGRAASTGFFSA
Sbjct: 61 YSAEMSEFNWVFEPKSQIGRAASTGFFSA 89

BLAST of Cla97C01G018540 vs. ExPASy TrEMBL
Match: A0A5A7T3Y6 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G002530 PE=4 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 3.2e-24
Identity = 70/88 (79.55%), Postives = 76/88 (86.36%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLE--KQQLQDLASGVSISATATSIRMKKMMEMAKLN-- 60
          MADIAILVAEEYERR K+PR+D E  +Q+LQDL SGVSISATATSI MKKMMEMAK N  
Sbjct: 1  MADIAILVAEEYERRTKHPRRDQEMVRQELQDLGSGVSISATATSITMKKMMEMAKRNNY 60

Query: 61 -SEMSEFKWVFEPKSQIGRAASTGFFSA 84
           +E+SEF  VFEPKSQIGRAASTGFFSA
Sbjct: 61 SAEISEFNRVFEPKSQIGRAASTGFFSA 88

BLAST of Cla97C01G018540 vs. ExPASy TrEMBL
Match: A0A1S4DTC5 (uncharacterized protein LOC103485003 OS=Cucumis melo OX=3656 GN=LOC103485003 PE=4 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 3.2e-24
Identity = 70/88 (79.55%), Postives = 76/88 (86.36%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLE--KQQLQDLASGVSISATATSIRMKKMMEMAKLN-- 60
          MADIAILVAEEYERR K+PR+D E  +Q+LQDL SGVSISATATSI MKKMMEMAK N  
Sbjct: 1  MADIAILVAEEYERRTKHPRRDQEMVRQELQDLGSGVSISATATSITMKKMMEMAKRNNY 60

Query: 61 -SEMSEFKWVFEPKSQIGRAASTGFFSA 84
           +E+SEF  VFEPKSQIGRAASTGFFSA
Sbjct: 61 SAEISEFNRVFEPKSQIGRAASTGFFSA 88

BLAST of Cla97C01G018540 vs. ExPASy TrEMBL
Match: A0A6J1KS29 (uncharacterized protein LOC111497122 OS=Cucurbita maxima OX=3661 GN=LOC111497122 PE=4 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 2.5e-21
Identity = 63/84 (75.00%), Postives = 72/84 (85.71%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLE-KQQLQDLASGVSISATATSIRMKKMMEMAKLNSEM 60
          MADIAILVAEEYERRMKNPRQ+ +   ++QDL SGVSISATA+  R+KKMME+    +E 
Sbjct: 1  MADIAILVAEEYERRMKNPRQEEKAAAEVQDLVSGVSISATAS--RVKKMMELELELAER 60

Query: 61 SEFKWVFEPKSQIGRAASTGFFSA 84
          +EFKWVFEPKSQIGRAASTGFFSA
Sbjct: 61 NEFKWVFEPKSQIGRAASTGFFSA 82

BLAST of Cla97C01G018540 vs. ExPASy TrEMBL
Match: A0A6J1HEA8 (uncharacterized protein LOC111463202 OS=Cucurbita moschata OX=3662 GN=LOC111463202 PE=4 SV=1)

HSP 1 Score: 109.8 bits (273), Expect = 5.7e-21
Identity = 62/84 (73.81%), Postives = 72/84 (85.71%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLE-KQQLQDLASGVSISATATSIRMKKMMEMAKLNSEM 60
          MADIAILVAEEYERRMKNPRQ+ +   ++QDL SGVSISATA+  R+KKMME+    ++ 
Sbjct: 1  MADIAILVAEEYERRMKNPRQEQKAAAEVQDLVSGVSISATAS--RVKKMMELELELAQP 60

Query: 61 SEFKWVFEPKSQIGRAASTGFFSA 84
          +EFKWVFEPKSQIGRAASTGFFSA
Sbjct: 61 NEFKWVFEPKSQIGRAASTGFFSA 82

BLAST of Cla97C01G018540 vs. TAIR 10
Match: AT5G65207.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G10040.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 41.2 bits (95), Expect = 4.8e-04
Identity = 35/84 (41.67%), Postives = 42/84 (50.00%), Query Frame = 0

Query: 1  MADIAILVAEEYERRMKNPRQDLEKQQLQ-DLASGVSISATATSIRMKKMMEMAKLNSEM 60
          MADIAILVAEEYERRMK+         ++ D    +    T    +MK  +E  K N   
Sbjct: 1  MADIAILVAEEYERRMKHTTVGSRSSPVEFDWWKIIPAKMTIAFDKMK--IESLKKN--- 60

Query: 61 SEFKWVFEPKSQIGRAASTGFFSA 84
                FE KS+   A S GFFSA
Sbjct: 61 ------FEAKSEFALAISHGFFSA 73

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882836.19.5e-3191.57uncharacterized protein LOC120073972 [Benincasa hispida][more]
KAG6604012.15.8e-2884.34hypothetical protein SDJN03_04621, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_011657992.11.3e-2479.78uncharacterized protein LOC105435930 [Cucumis sativus] >KGN48782.1 hypothetical ... [more]
XP_016899241.16.6e-2479.55PREDICTED: uncharacterized protein LOC103485003 [Cucumis melo] >KAA0036271.1 unc... [more]
XP_023003555.15.2e-2175.00uncharacterized protein LOC111497122 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KK756.5e-2579.78Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G501230 PE=4 SV=1[more]
A0A5A7T3Y63.2e-2479.55Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S4DTC53.2e-2479.55uncharacterized protein LOC103485003 OS=Cucumis melo OX=3656 GN=LOC103485003 PE=... [more]
A0A6J1KS292.5e-2175.00uncharacterized protein LOC111497122 OS=Cucurbita maxima OX=3661 GN=LOC111497122... [more]
A0A6J1HEA85.7e-2173.81uncharacterized protein LOC111463202 OS=Cucurbita moschata OX=3662 GN=LOC1114632... [more]
Match NameE-valueIdentityDescription
AT5G65207.14.8e-0441.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 9..29
NoneNo IPR availablePANTHERPTHR36067EXPRESSED PROTEINcoord: 1..83

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G018540.1Cla97C01G018540.1mRNA