Cla97C07G141420 (gene) Watermelon (97103) v2.5

Overview
NameCla97C07G141420
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionUnknown protein
LocationCla97Chr07: 29041581 .. 29041952 (+)
RNA-Seq ExpressionCla97C07G141420
SyntenyCla97C07G141420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCAAAGCAATGAGCAGGAGGAAGGGCAGTCAGCGGCGGGATCGCGGTCGGGGATGCGTCGGGGGATTCAGTTTCGGAGGCGACGAATGCCGGTGGCACGACTCGGGGGGAAGAGATCGGGGCGGATCGTGGCATGGGGGAGGATGCTGAGGAAGATAAGACTGAGATGGGTGAAATTGAAATGCATGGCCATGGTGAAGAAGATTAAAAAATACTATACAAAGTTGATGAAGGATATAGTGGAAGGAGGAGTATATGGCTATCCTGACTCATATCAACATAGGCTTCTCTTGGAGACTTCTTTTGCAATTCCAATCCTGGGTGTCTCTCTCTCTACTCATTCCAGTATTCTTGCCACAACTTCTTAA

mRNA sequence

ATGGCTCAAAGCAATGAGCAGGAGGAAGGGCAGTCAGCGGCGGGATCGCGGTCGGGGATGCGTCGGGGGATTCAGTTTCGGAGGCGACGAATGCCGGTGGCACGACTCGGGGGGAAGAGATCGGGGCGGATCGTGGCATGGGGGAGGATGCTGAGGAAGATAAGACTGAGATGGGTGAAATTGAAATGCATGGCCATGGTGAAGAAGATTAAAAAATACTATACAAAGTTGATGAAGGATATAGTGGAAGGAGGAGTATATGGCTATCCTGACTCATATCAACATAGGCTTCTCTTGGAGACTTCTTTTGCAATTCCAATCCTGGGTGTCTCTCTCTCTACTCATTCCAGTATTCTTGCCACAACTTCTTAA

Coding sequence (CDS)

ATGGCTCAAAGCAATGAGCAGGAGGAAGGGCAGTCAGCGGCGGGATCGCGGTCGGGGATGCGTCGGGGGATTCAGTTTCGGAGGCGACGAATGCCGGTGGCACGACTCGGGGGGAAGAGATCGGGGCGGATCGTGGCATGGGGGAGGATGCTGAGGAAGATAAGACTGAGATGGGTGAAATTGAAATGCATGGCCATGGTGAAGAAGATTAAAAAATACTATACAAAGTTGATGAAGGATATAGTGGAAGGAGGAGTATATGGCTATCCTGACTCATATCAACATAGGCTTCTCTTGGAGACTTCTTTTGCAATTCCAATCCTGGGTGTCTCTCTCTCTACTCATTCCAGTATTCTTGCCACAACTTCTTAA

Protein sequence

MAQSNEQEEGQSAAGSRSGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWVKLKCMAMVKKIKKYYTKLMKDIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHSSILATTS
Homology
BLAST of Cla97C07G141420 vs. NCBI nr
Match: XP_038891072.1 (uncharacterized protein LOC120080455 [Benincasa hispida])

HSP 1 Score: 204.1 bits (518), Expect = 6.7e-49
Identity = 111/124 (89.52%), Postives = 115/124 (92.74%), Query Frame = 0

Query: 1   MAQSNEQEEGQSAAGSRSGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWVK 60
           MAQS EQ EGQ AAG R GMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWVK
Sbjct: 1   MAQSYEQGEGQPAAGLRWGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWVK 60

Query: 61  LKCMAMVKKIKKYYTKLMKDIVEGGVYGY-PDSYQHRLLLETSFAIPILGVSLSTHSSIL 120
           +KC+AMVKKIKKYYT LMKDI+EGG YGY  DSYQHRLL+ETSFAIPILGVSLSTHSSIL
Sbjct: 61  VKCIAMVKKIKKYYTSLMKDIMEGGAYGYAADSYQHRLLMETSFAIPILGVSLSTHSSIL 120

Query: 121 ATTS 124
           ATTS
Sbjct: 121 ATTS 124

BLAST of Cla97C07G141420 vs. NCBI nr
Match: KAG6581768.1 (hypothetical protein SDJN03_21770, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 182.6 bits (462), Expect = 2.1e-42
Identity = 97/123 (78.86%), Postives = 108/123 (87.80%), Query Frame = 0

Query: 1   MAQSNEQEEGQSAAGSRSGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWVK 60
           M Q  E EEGQ  AG R G RRG++FRRRRMPVARLGGKRSGR+VAWGRMLRKIRLRWVK
Sbjct: 1   MGQPYEPEEGQPGAGLRRGTRRGMRFRRRRMPVARLGGKRSGRLVAWGRMLRKIRLRWVK 60

Query: 61  LKCMAMVKKIKKYYTKLMKDIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHSSILA 120
           LKC+AMVKK+K+YYT+LMKDI+E G Y +  SYQHR+LLETSFAIPILGVSLSTHSS+LA
Sbjct: 61  LKCIAMVKKMKEYYTRLMKDIIEAGAYAH--SYQHRVLLETSFAIPILGVSLSTHSSVLA 120

Query: 121 TTS 124
            TS
Sbjct: 121 ATS 121

BLAST of Cla97C07G141420 vs. NCBI nr
Match: XP_022956317.1 (uncharacterized protein LOC111458050 [Cucurbita moschata])

HSP 1 Score: 177.9 bits (450), Expect = 5.2e-41
Identity = 98/127 (77.17%), Postives = 108/127 (85.04%), Query Frame = 0

Query: 1   MAQSNEQEEGQSAA----GSRSGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRL 60
           M Q  E EEGQ  A    G+R G RRG++FRRRRMPVARLGGKRSGR+VAWGRMLRKIRL
Sbjct: 1   MGQPYEPEEGQPGAGLRRGTRRGTRRGMRFRRRRMPVARLGGKRSGRLVAWGRMLRKIRL 60

Query: 61  RWVKLKCMAMVKKIKKYYTKLMKDIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHS 120
           RWVKLKC+AMVKK+K+YYT+LMKDI+E G Y    SYQHRLLLETSFAIPILGVSLSTHS
Sbjct: 61  RWVKLKCIAMVKKMKEYYTRLMKDIIEAGAYA--RSYQHRLLLETSFAIPILGVSLSTHS 120

Query: 121 SILATTS 124
           S+LA TS
Sbjct: 121 SVLAATS 125

BLAST of Cla97C07G141420 vs. NCBI nr
Match: KAG7018218.1 (hypothetical protein SDJN02_20086, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 175.6 bits (444), Expect = 2.6e-40
Identity = 95/123 (77.24%), Postives = 107/123 (86.99%), Query Frame = 0

Query: 1   MAQSNEQEEGQSAAGSRSGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWVK 60
           M Q  E EEGQ  A    G+RRG++FRRRRMPVARLGGKRSGR+VAWGRMLRKIRLRWVK
Sbjct: 1   MGQPYEPEEGQPGA----GLRRGMRFRRRRMPVARLGGKRSGRLVAWGRMLRKIRLRWVK 60

Query: 61  LKCMAMVKKIKKYYTKLMKDIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHSSILA 120
           LKC+AMVKK+K+YYT+LMKDI+E G Y +  SYQHR+LLETSFAIPILGVSLSTHSS+LA
Sbjct: 61  LKCIAMVKKMKEYYTRLMKDIIEAGAYAH--SYQHRVLLETSFAIPILGVSLSTHSSVLA 117

Query: 121 TTS 124
            TS
Sbjct: 121 ATS 117

BLAST of Cla97C07G141420 vs. NCBI nr
Match: XP_022948735.1 (uncharacterized protein LOC111452314 [Cucurbita moschata] >XP_023523769.1 uncharacterized protein LOC111787905 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 167.5 bits (423), Expect = 7.0e-38
Identity = 94/124 (75.81%), Postives = 104/124 (83.87%), Query Frame = 0

Query: 1   MAQSNEQEEGQSA-AGSRSGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWV 60
           M QS+EQEEGQ A AG R   R+GIQFRRRRM VARLGGKRSGR+VAWGR LRKIRLRWV
Sbjct: 1   MPQSSEQEEGQQAGAGLRWATRQGIQFRRRRMAVARLGGKRSGRLVAWGRRLRKIRLRWV 60

Query: 61  KLKCMAMVKKIKKYYTKLMKDIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHSSIL 120
           KLKC+AMVKK+KKY   LMKD++E G YG  DSYQHRLLLETSFAIPILGVS STHS+  
Sbjct: 61  KLKCIAMVKKVKKYCASLMKDMMEAGAYG--DSYQHRLLLETSFAIPILGVSFSTHSTHA 120

Query: 121 ATTS 124
           A ++
Sbjct: 121 AASA 122

BLAST of Cla97C07G141420 vs. ExPASy TrEMBL
Match: A0A6J1GVZ8 (uncharacterized protein LOC111458050 OS=Cucurbita moschata OX=3662 GN=LOC111458050 PE=4 SV=1)

HSP 1 Score: 177.9 bits (450), Expect = 2.5e-41
Identity = 98/127 (77.17%), Postives = 108/127 (85.04%), Query Frame = 0

Query: 1   MAQSNEQEEGQSAA----GSRSGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRL 60
           M Q  E EEGQ  A    G+R G RRG++FRRRRMPVARLGGKRSGR+VAWGRMLRKIRL
Sbjct: 1   MGQPYEPEEGQPGAGLRRGTRRGTRRGMRFRRRRMPVARLGGKRSGRLVAWGRMLRKIRL 60

Query: 61  RWVKLKCMAMVKKIKKYYTKLMKDIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHS 120
           RWVKLKC+AMVKK+K+YYT+LMKDI+E G Y    SYQHRLLLETSFAIPILGVSLSTHS
Sbjct: 61  RWVKLKCIAMVKKMKEYYTRLMKDIIEAGAYA--RSYQHRLLLETSFAIPILGVSLSTHS 120

Query: 121 SILATTS 124
           S+LA TS
Sbjct: 121 SVLAATS 125

BLAST of Cla97C07G141420 vs. ExPASy TrEMBL
Match: A0A6J1GAR0 (uncharacterized protein LOC111452314 OS=Cucurbita moschata OX=3662 GN=LOC111452314 PE=4 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 3.4e-38
Identity = 94/124 (75.81%), Postives = 104/124 (83.87%), Query Frame = 0

Query: 1   MAQSNEQEEGQSA-AGSRSGMRRGIQFRRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWV 60
           M QS+EQEEGQ A AG R   R+GIQFRRRRM VARLGGKRSGR+VAWGR LRKIRLRWV
Sbjct: 1   MPQSSEQEEGQQAGAGLRWATRQGIQFRRRRMAVARLGGKRSGRLVAWGRRLRKIRLRWV 60

Query: 61  KLKCMAMVKKIKKYYTKLMKDIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHSSIL 120
           KLKC+AMVKK+KKY   LMKD++E G YG  DSYQHRLLLETSFAIPILGVS STHS+  
Sbjct: 61  KLKCIAMVKKVKKYCASLMKDMMEAGAYG--DSYQHRLLLETSFAIPILGVSFSTHSTHA 120

Query: 121 ATTS 124
           A ++
Sbjct: 121 AASA 122

BLAST of Cla97C07G141420 vs. ExPASy TrEMBL
Match: A0A5D3DCV1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold45855G00010 PE=4 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 2.4e-36
Identity = 83/100 (83.00%), Postives = 92/100 (92.00%), Query Frame = 0

Query: 21  RRGIQFRRRR-MPVARLGGKRSGRIVAWGRMLRKIRLRWVKLKCMAMVKKIKKYYTKLMK 80
           R GIQFRRRR M VARLGGKRSGRI+AWGR++RKIRL+WVK+KC+ MVKK+KKYY KLMK
Sbjct: 14  RHGIQFRRRRKMAVARLGGKRSGRIMAWGRIVRKIRLKWVKMKCIEMVKKMKKYYKKLMK 73

Query: 81  DIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHSSIL 120
           DI+E G YGY DSYQHRLLLETSFAIPILGVSLSTHSSI+
Sbjct: 74  DIMEAGAYGYADSYQHRLLLETSFAIPILGVSLSTHSSII 113

BLAST of Cla97C07G141420 vs. ExPASy TrEMBL
Match: A0A0A0LBW7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G644780 PE=4 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 9.2e-36
Identity = 84/104 (80.77%), Postives = 93/104 (89.42%), Query Frame = 0

Query: 21  RRGIQF-RRRRMPVARLGGKRSGRIVAWGRMLRKIRLRWVKLKCMAMVKKIKKYYTKLMK 80
           R GIQF RRRRM VARLGGKRSGRIV WGR++RKIRL+WVK+KC+ MVKK+KKYY +LMK
Sbjct: 14  RCGIQFRRRRRMAVARLGGKRSGRIVGWGRIVRKIRLKWVKMKCIEMVKKMKKYYKELMK 73

Query: 81  DIVEGGVYGYPDSYQHRLLLETSFAIPILGVSLSTHSSILATTS 124
           DI+E G YGY DSYQHRLLLETSFAIPILGVSLSTHSSI A T+
Sbjct: 74  DIMEAGAYGYADSYQHRLLLETSFAIPILGVSLSTHSSIHAATA 117

BLAST of Cla97C07G141420 vs. ExPASy TrEMBL
Match: A0A6J1D3L4 (uncharacterized protein LOC111016963 OS=Momordica charantia OX=3673 GN=LOC111016963 PE=4 SV=1)

HSP 1 Score: 121.7 bits (304), Expect = 2.1e-24
Identity = 72/117 (61.54%), Postives = 88/117 (75.21%), Query Frame = 0

Query: 2   AQSNEQEEGQSAAGSRSGMRRGIQF-RRRRMPVARLGGKRSGRIVA-WGRMLRKIRLRWV 61
           +Q  E  E ++  G R   RRGIQF RRRRMPVARLGG  SG      GRM+RKIR+RWV
Sbjct: 26  SQMGEGYETENGPGLRCAARRGIQFRRRRRMPVARLGGPGSGSGPGRLGRMVRKIRVRWV 85

Query: 62  KLKCMAMVKKIKKYYTKLMKDIVEG-GVYGYPDSYQHRLLLETSFAIPILGVSLSTH 116
           KL+C+AMVKK+KK+Y K++KD++E      YP+SYQHR+LLETSFAIPIL  SLST+
Sbjct: 86  KLRCLAMVKKMKKWYRKVLKDVIEAQAAVAYPESYQHRVLLETSFAIPILPRSLSTY 142

BLAST of Cla97C07G141420 vs. TAIR 10
Match: AT1G54120.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G14060.1); Has 23 Blast hits to 23 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 63.2 bits (152), Expect = 1.7e-10
Identity = 44/105 (41.90%), Postives = 66/105 (62.86%), Query Frame = 0

Query: 18  SGMRRGIQFRRRRMPVARLGGKRSGRIVAWG-----RMLRKIRLRWVKLKCMAMVKKIKK 77
           S  RR I  RRR+  V RLGGK S   V+ G     +M+R+++LRW+KL  + +VKKIK 
Sbjct: 14  SRSRRKIHIRRRKSQVVRLGGKNSA--VSRGGFSLKKMVRRMKLRWLKLHYVRVVKKIKV 73

Query: 78  YYTKLMKDIVEGGVYGYPDSYQHRLLLE-TSFAIPILGVSLSTHS 117
           +Y  L+K+ V+ G     ++ Q R+ +E  +FA+P LG+S S+ S
Sbjct: 74  FYRNLVKEFVDAG--AELEAIQQRMAVEAAAFAVPGLGLSFSSFS 114

BLAST of Cla97C07G141420 vs. TAIR 10
Match: AT3G14060.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G54120.1); Has 30 Blast hits to 30 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 30; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 53.5 bits (127), Expect = 1.4e-07
Identity = 37/94 (39.36%), Postives = 58/94 (61.70%), Query Frame = 0

Query: 21  RRGIQFRRRRMPVARLGGKRSGRIVAWG--RMLRKIRLRWVKLKCMAMVKKIKKYYTKLM 80
           RR I  RRR+  V RLGGK S    ++   +++ ++RL+W++L  + +VKKIK YY  ++
Sbjct: 20  RRKIYLRRRKPQVVRLGGKNSTPRGSFSLKKVVTRMRLKWLRLYYVRLVKKIKAYYRTIV 79

Query: 81  KDIVEGGVYGYPDSYQHRLLLET-SFAIPILGVS 112
           K+  E G      + Q R+ +ET +FA P LG+S
Sbjct: 80  KEFEEAGA----ATIQQRMTVETAAFAAPGLGLS 109

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038891072.16.7e-4989.52uncharacterized protein LOC120080455 [Benincasa hispida][more]
KAG6581768.12.1e-4278.86hypothetical protein SDJN03_21770, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022956317.15.2e-4177.17uncharacterized protein LOC111458050 [Cucurbita moschata][more]
KAG7018218.12.6e-4077.24hypothetical protein SDJN02_20086, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022948735.17.0e-3875.81uncharacterized protein LOC111452314 [Cucurbita moschata] >XP_023523769.1 unchar... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GVZ82.5e-4177.17uncharacterized protein LOC111458050 OS=Cucurbita moschata OX=3662 GN=LOC1114580... [more]
A0A6J1GAR03.4e-3875.81uncharacterized protein LOC111452314 OS=Cucurbita moschata OX=3662 GN=LOC1114523... [more]
A0A5D3DCV12.4e-3683.00Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LBW79.2e-3680.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G644780 PE=4 SV=1[more]
A0A6J1D3L42.1e-2461.54uncharacterized protein LOC111016963 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
Match NameE-valueIdentityDescription
AT1G54120.11.7e-1041.90unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G14060.11.4e-0739.36unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..24
NoneNo IPR availablePANTHERPTHR34788:SF4F15I1.22coord: 1..118
NoneNo IPR availablePANTHERPTHR34788F15I1.22coord: 1..118

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G141420.1Cla97C07G141420.1mRNA