ClCG03G001160 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG03G001160
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionEncodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).
LocationCG_Chr03: 1131342 .. 1137123 (-)
RNA-Seq ExpressionClCG03G001160
SyntenyClCG03G001160
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCACCTAGTGATGATGGCACATGGGTGACAAAGCAGCAGCTATTGCCTTCTTCTTATAAATAGTGAGCTGAGGAGTCAATTTAGACATATCTCATTCTTAGCAGGTTAGCTGATTTAGTAGATCTTACAAGCATACTTTCTTGTTTTCATTTCTTACTCCGCTGATATCTTTCAATAGGGTGTGGTGGAAAAGGGGAGTCATTTTTTTGGGTAAGAAGGGAAGATTGGGAGAAAATGGAGGGAAAGTCTAAAGGGTATCAAGCCTCCTCTTTCGTTGCCGATCTTTTCGATGTCAAGGAGCCGCCATTGTCATCCACATCCGGAGTCTTTGCAGCAATCTTTCCATCTCCACAGAAGGTAATGCTGCTGTTTAACTGCTCGTTTCTATGTAGAACTTTCTTTCTCCTTTCCAAATAGGTTCAGATTTTGTCATAGAATTCCAAATGACTTGGAAGATAACACATACATCGAGGTGTTATGTGCTGAGGATTTGGAATTTTTGCTGCACAATGTTAAAAGGATGATTTTTGAAACTTATGTTGCCTTTTCTAAAGTTCATTGCTTAAGGTTCCCCTTATTGGCAATGATATATTTTGTCCTGAATTCCCTACTTTGACCAGTATTGGAAGTTTTGTTGCAGGAATCTCATGTAAAGTTTCTATCGCAAACCTTCACATATCCTTCATTTTAGTTCTTATATTTCCAAATATTTGGTTCTGATCATTAAACTTCAATAAATCTTCACCAATTGTTACAATTATATTTAGCAGAGATCTTTGTCATCCGTCACCATTTTTGGTATCAAACTGGATTTATGTGACACTAAATTTGAGTCCGCATGTGTACTCAAATGTAGTATGAAAATTTATTGTTAAGGACTAAAATGAACTTTTACTTCAAAAATAAAATAGGACGTTCAAGCTACAGGGAGGGACCACCATGAAGACTAAAATAAGATTTATACATGTTATTGCTTCCATTTCATACTCTTCCTTCTTTATTTTACTTTTACATTTTTGAAAAATGTTAATTATATGGACCTGGATGGTTCAGGTGCCACAGAGCCATTGTCAAAGCATCATTTTTATAACACTATTATTTTTCTAATTTTCAGGAGGGAGGCAGGAATTCTTCTAGCTCTGGGGATTGGCTAAAACAAACCAATGGAAATCAACCACGCTACACCAGACAAGGAAATTCAGGTGTGTGAATTGAAATATCAATCCCGTTAAAACTGATGAACAGAAGGGTATGATATGCAAATTCACAAACCACAGGAATGGATCTTTTGAAATCATTTAAAAAATATATATGAGTATATATATCATTTACAAAATTCCGAAACTGCAAGGGTATAATTTATAACTATTTTTAACAATAAAGAAGATAAGAGCAAAAGGAAGGAAAACCACAGGAGTATAATTCTCAATTTTACACTGTTCCCAAGAAACAAGAAATCAAGTTATACCTAACAAAAATCAGATCTGATAATTTCAAGGAGGGAGCTTGGAGCCTTGTCATCTGAGTTCATCTCTATATTATGGAGGACAAGATGGCTACTCCCAGGCCCCATCAGCTGGACCATCCCCACCCCCACCCCCACCCCCCACTGTGAGTATACTTTGTTGATCAATTTGGATCATTGTTAAAGATATGTGCTATCTGTTGTAACTTATATGATTTCATCAAACTGATCAAAAACTGAAAGTAAAATTGATTGTTCTTCTGCTGTTGCCAATTCATTGATGACCCAGATGAAGAAAAGTGGGGGAGAAGATGATCCAAATGGAAGCAACTCTCAACCTGCTTCTAGGGGAAATTGGTGGCAAGGTACTAATAATGTAATGGACGACGTTGTAGGAATCTGTTGACTCATCCGTTTTAGTCTCTGATTTCAAATACTTTTGTAGGTTCTCTTTATTATTAGAACGTCTGCCTTGCCTCGGTGGCACATGAATTCCCACCTATTTTCTAAGGTATGTTGGTGCCATTGATTACTTATCTTATGATGTCATCTTGTCGGCATAAATGTATATTTCAAAGCCTTTTTTGTTAGTTATCATAACTCAAAGTGAATTTGAGAGGTTTATTCGAAGCTGTGTTTACTTACGGCTCATGATTAAAATGTCCTTGAAATATTAGACTCCACAATCAGTGACATTGGTTGTTTGAAAGTGAAATTTCAAACTGAGCTTGATTTGACTAAGAATAAGGTCGACCAAGAAAGGTACTTGTCTTTCTCAGTTATAAATTGATTCACAATTACTTTATTTTTATTAATTAATGATTCCATGAGCTGGGAGCATAAAAGTTAATTGAACCATAGAAAATTCAATTTGAACCAAGGTTGTTAAACTTCCATCGCCCATGTCTTTCCAGTTTTCACCCACTTGATATTATGAAACCTAGAAGCTGCTTCCTTGTGGACCCTCTAATATACCAAATAAATAGGAAAATTTCAAACATTTAGTCAAATTATCATTTAATCTATTGTTTTTATTTACTAAATCCCATAAGACTTTTCTTTGTCACTGTTGTGCGAAGATTTAGGAGTTGTTTGGTGACTTTGTTGTTTCTTATATCTGATGTTTTAGATATATGTGCATTTAATGATGTTTCCTATTTTTCTTTTCTTTTAGATGAGAAACAAAGTTTTCCTTTGTTTGTGAAAAATTACGAACAAATATATAAAATACAAAATTATTTCGCAAAAGCTGTACCTCATACGTTTCTCCTTGATGCTAACATGGGCATAGCACAACTAGCATGAATTTGTACTATCAACCTCCAAGTCTAAGGTCCAATCCCCAACCTTACATATTGTCAAAAAAAAAAAAAAAAAAAAAAAAAAAATAAAAAAAAAAAAAAAAAAAAAAAATAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAGAAAAAAAAAAAACAACGTACCTCCCTGATTTGTGTTGATAACAAATAGGTATGACTACAAGACTCTTCTTAATCCGAGTATAGCTTAGTGAATGTGGCATTAATGACTATCCTTGGTCGATGGTTCGATCCTTCACATCTGGACTAGTTGAACTTAGAGAAGAGATAAAAAGATAGATACAAAGGCTATTTATTTGTCTCGATATCAAATAAAAGCATTTAATTTTGCCCATTCCTCCCACTCCCACTATCAATCTCTTGCTTTTGTGGAAAAAAAAATCTGTCATTTCTTTCCAACCATATATTCTTTAGTGTTTGGTTTAATAGCATTGACCAATAGAATCTGCTTTCTAGTTTAAAGGATGGATGCAAAGCATTTGAACCATATTTCCACCAAAAGGGAAGTGGAAGCCCAATCCAAACATTTATATGAGAGAGACCTAACTCTTCATGCTATAAGTTAAAAGGAATAAAAGATGATCCAATTCTTATATTTGAAGAAATGTTTCTACAATTATATATAGATTTCAGGGTCAAAAGTAGAATTCTCTTCCTGCTTTTCAATTCTTAGAGATATTACTAAATATATATATGATATATCATATCATTTCTTAGAACTTTCAAAATGTGCAACAGATAAAAGAAAAAAGCTTAGATGACAGTAAAACCTTCATCTTTTAGCTTTAAGAGTTTTTATTAATCTTGAAATCATGGGTTGGCACATCCGTTGGTTTGGTCGGTTGGTTTTTATTAATGTGATGTGATGGTTTGAGGGTGAAGGGAGTGACGTAATCTGTATCGGATACTAACTTTGGTTTTGGTCGGTTTGGTCGGTTTACTTTGTTCCCCTATCATATACATTTGTATTCAACACATTTTGTTTATGTGGTAAAGGGATAAGGCTTCTCACCTTGAAATGCTTTCTCTTTGGGTTCAGGAATTCATAAAGCAGCAGAGCAGATGAAGCTTTGAAGTTGGATTTACATTTTACATTACATATTGTGTATACTATTTCAAATTTCTTGTTTGGTTTTTTGGTTTTTACATTTGAAAATGGTGTGGAGTTTTCTGTATTTTCTTTTCTCTGATTTCTCTTTTATAAAAAGCTTTGCTGTTTCTTATATTTGCTGTTATGATGGAAAATCCACTTGAACAGTCCTTCCAAAGCATTCCCTAAAAATGATATTCAGGGATAAATGTAAATTCAGTCCTTTGCCTGAAAATAGAACTCCTTGATGTTTCAAATAGTCATAATTAAATCATGAATACTGTTTTTATACTGATTTAAGTTTCTCAAGATCGAGAATTTGGTTTTATTTTAGTGCATGCAAATAAAAGTAAAATAAGATTCTAATTTATGTAATAGTCTTTTACATTGTTTAATCTTAATTGAATAGTTATAACTATTTGAAGAGAAGAACAAAATGAGAGTTTATGAAATTATAAAGGACAATATTTTAAAGAATTGTAAATATAGCAAAATTTACTATGAAGTAGCAAGTTTAGATATTAGGCCAAAGGTCAAATCTTTGGCAAACAAAGCTTCTATGTTCAAATGGACAATCCAAAGCTTTCTATTACAACACTAAACTAGGTATTTATAATATTGCAAGAAAATAAGACAATTAACTAATTAAACATTAATTAATCTAAATTAATAACTAAACTAAATATTAATTAAAATATACTAATTATTCTACTATATCTACTAATAATTAGGCCATGATTCTTAAATTTGCAACTACCTATTTTCCTAAAATCTCTCACAAACTCACAACCTCATTTTAAATGCTAAAAAACATTATCCTCTTTTAACTTTTATTTATTTAAATGAACAAAAGAAAAAGAGATTCATAATACTTTTACAAATGAATTTCCTACTACAATCACGAACAACTTAAAAAACTCTTACGATTTACAATATATATATATATATATAAAAGTTATAACCACAATAAATTAATGAACTAAATTCTTAAAAAAGAAAAACTAAGTATTGCTATAAAAATCTTAAGAAAAATTGTAAACGGGAGAGATGGAAAATGCTAGCGCGTGAAGGCGTGAATAACAAGCGCGATGTAAATGAAAAGCGCCAAAGGAACCAAACTCACCAACGGCACGGCGGCCGCCACCGTCTCCCACCGCCGGCAACCGAAGATCCCCAATTTTATCTGAACCAAATTCAGCAACGCCGCCACCAAAAACCCACACCCGAGCACCGACCCGACTGCAGATGCCAACATCCCCACGCGCAGAGCCGGCGCGTGGGCCACACCCTGCCCTCCGTTCCCGCCGGCGATGATTCTGATTGCTTGCTTCAGTGCCGACGCGACCAGGCTGGAGAAGAGGAAGCAGCTGAAGGAGTACACATGGCAGGCAACGAGGCTCTGGGCGACAGAATCGGCGGCGGCGCAGGGGCCGTCGTCGTCGTCGGGGAGGAGGTTGGCGGCGGGGTTGGCGGTTGGGTACCAGGCGACACCGAGGAAGACGGCGAAGGTGAAAAGGGAGTTGACGTTGACGATGCCGTCGAGGGCCAAGATGTGGATTCTGGTGGAGGTGGATCGCCGCCGTGGAACGACGGACATGGCGGTGGAGAGGGAGGTAGTAAATGA

mRNA sequence

CCCACCTAGTGATGATGGCACATGGGTGACAAAGCAGCAGCTATTGCCTTCTTCTTATAAATAGTGAGCTGAGGAGTCAATTTAGACATATCTCATTCTTAGCAGGGTGTGGTGGAAAAGGGGAGTCATTTTTTTGGGTAAGAAGGGAAGATTGGGAGAAAATGGAGGGAAAGTCTAAAGGGTATCAAGCCTCCTCTTTCGTTGCCGATCTTTTCGATGTCAAGGAGCCGCCATTGTCATCCACATCCGGAGTCTTTGCAGCAATCTTTCCATCTCCACAGAAGGAGGGAGGCAGGAATTCTTCTAGCTCTGGGGATTGGCTAAAACAAACCAATGGAAATCAACCACGCTACACCAGACAAGGAAATTCAGGAGGGAGCTTGGAGCCTTGTCATCTGAGTTCATCTCTATATTATGGAGGACAAGATGGCTACTCCCAGGCCCCATCAGCTGGACCATCCCCACCCCCACCCCCACCCCCCACTATGAAGAAAAGTGGGGGAGAAGATGATCCAAATGGAAGCAACTCTCAACCTGCTTCTAGGGGAAATTGGTGGCAAGCGCGTGAAGGCGTGAATAACAAGCGCGATGTAAATGAAAAGCGCCAAAGGAACCAAACTCACCAACGGCACGGCGGCCGCCACCGTCTCCCACCGCCGGCAACCGAAGATCCCCAATTTTATCTGAACCAAATTCAGCAACGCCGCCACCAAAAACCCACACCCGAGCACCGACCCGACTGCAGATGCCAACATCCCCACGCGCAGAGCCGGCGCGTGGGCCACACCCTGCCCTCCGTTCCCGCCGGCGATGATTCTGATTGCTTGCTTCAGTGCCGACGCGACCAGGCTGGAGAAGAGGAAGCAGCTGAAGGAGTACACATGGCAGGCAACGAGGCTCTGGGCGACAGAATCGGCGGCGGCGCAGGGGCCGTCGTCGTCGTCGGGGAGGAGGTTGGCGGCGGGGTTGGCGGTTGGGTACCAGGCGACACCGAGGAAGACGGCGAAGGTGAAAAGGGAGTTGACGTTGACGATGCCGTCGAGGGCCAAGATGTGGATTCTGGTGGAGGTGGATCGCCGCCGTGGAACGACGGACATGGCGGTGGAGAGGGAGGTAGTAAATGA

Coding sequence (CDS)

ATGGAGGGAAAGTCTAAAGGGTATCAAGCCTCCTCTTTCGTTGCCGATCTTTTCGATGTCAAGGAGCCGCCATTGTCATCCACATCCGGAGTCTTTGCAGCAATCTTTCCATCTCCACAGAAGGAGGGAGGCAGGAATTCTTCTAGCTCTGGGGATTGGCTAAAACAAACCAATGGAAATCAACCACGCTACACCAGACAAGGAAATTCAGGAGGGAGCTTGGAGCCTTGTCATCTGAGTTCATCTCTATATTATGGAGGACAAGATGGCTACTCCCAGGCCCCATCAGCTGGACCATCCCCACCCCCACCCCCACCCCCCACTATGAAGAAAAGTGGGGGAGAAGATGATCCAAATGGAAGCAACTCTCAACCTGCTTCTAGGGGAAATTGGTGGCAAGCGCGTGAAGGCGTGAATAACAAGCGCGATGTAAATGAAAAGCGCCAAAGGAACCAAACTCACCAACGGCACGGCGGCCGCCACCGTCTCCCACCGCCGGCAACCGAAGATCCCCAATTTTATCTGAACCAAATTCAGCAACGCCGCCACCAAAAACCCACACCCGAGCACCGACCCGACTGCAGATGCCAACATCCCCACGCGCAGAGCCGGCGCGTGGGCCACACCCTGCCCTCCGTTCCCGCCGGCGATGATTCTGATTGCTTGCTTCAGTGCCGACGCGACCAGGCTGGAGAAGAGGAAGCAGCTGAAGGAGTACACATGGCAGGCAACGAGGCTCTGGGCGACAGAATCGGCGGCGGCGCAGGGGCCGTCGTCGTCGTCGGGGAGGAGGTTGGCGGCGGGGTTGGCGGTTGGGTACCAGGCGACACCGAGGAAGACGGCGAAGGTGAAAAGGGAGTTGACGTTGACGATGCCGTCGAGGGCCAAGATGTGGATTCTGGTGGAGGTGGATCGCCGCCGTGGAACGACGGACATGGCGGTGGAGAGGGAGGTAGTAAATGA

Protein sequence

MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGNQPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGGEDDPNGSNSQPASRGNWWQAREGVNNKRDVNEKRQRNQTHQRHGGRHRLPPPATEDPQFYLNQIQQRRHQKPTPEHRPDCRCQHPHAQSRRVGHTLPSVPAGDDSDCLLQCRRDQAGEEEAAEGVHMAGNEALGDRIGGGAGAVVVVGEEVGGGVGGWVPGDTEEDGEGEKGVDVDDAVEGQDVDSGGGGSPPWNDGHGGGEGGSK
Homology
BLAST of ClCG03G001160 vs. NCBI nr
Match: XP_038894691.1 (uncharacterized protein LOC120083160 [Benincasa hispida] >XP_038894692.1 uncharacterized protein LOC120083160 [Benincasa hispida] >XP_038894693.1 uncharacterized protein LOC120083160 [Benincasa hispida])

HSP 1 Score: 238.4 bits (607), Expect = 8.4e-59
Identity = 124/133 (93.23%), Postives = 126/133 (94.74%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIF SPQK  GRNSSSSGDWLKQTNGN
Sbjct: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFSSPQKGRGRNSSSSGDWLKQTNGN 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGGEDDPNG 120
           QPR+TRQGNS GSLEPCHLSSSLYYGGQDGYSQAPSAGPS  PP PPTMKKSGGEDDPNG
Sbjct: 61  QPRHTRQGNS-GSLEPCHLSSSLYYGGQDGYSQAPSAGPS--PPSPPTMKKSGGEDDPNG 120

Query: 121 SNSQPASRGNWWQ 134
           +NSQPASRGNWWQ
Sbjct: 121 NNSQPASRGNWWQ 130

BLAST of ClCG03G001160 vs. NCBI nr
Match: XP_008457434.1 (PREDICTED: uncharacterized protein LOC103497123 [Cucumis melo] >KAA0031766.1 uncharacterized protein E6C27_scaffold848G00050 [Cucumis melo var. makuwa] >TYJ97366.1 uncharacterized protein E5676_scaffold194G001770 [Cucumis melo var. makuwa])

HSP 1 Score: 233.0 bits (593), Expect = 3.5e-57
Identity = 119/134 (88.81%), Postives = 122/134 (91.04%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQ SSFVADLFDVKE PLSS SG FA IFPSPQK  GRNSSSS DWLKQTNGN
Sbjct: 1   MEGKSKGYQTSSFVADLFDVKEAPLSSASGAFATIFPSPQKGAGRNSSSSVDWLKQTNGN 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGG-EDDPN 120
           QP +TRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSP PPPP TMKKSGG +DDPN
Sbjct: 61  QPHHTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPLPPPPHTMKKSGGQQDDPN 120

Query: 121 GSNSQPASRGNWWQ 134
           G+NSQPASRGNWWQ
Sbjct: 121 GNNSQPASRGNWWQ 134

BLAST of ClCG03G001160 vs. NCBI nr
Match: XP_022970583.1 (uncharacterized protein LOC111469517 [Cucurbita maxima] >XP_023519303.1 uncharacterized protein LOC111782742 [Cucurbita pepo subsp. pepo] >XP_023519304.1 uncharacterized protein LOC111782742 [Cucurbita pepo subsp. pepo] >XP_023519305.1 uncharacterized protein LOC111782742 [Cucurbita pepo subsp. pepo] >XP_023521040.1 uncharacterized protein LOC111784637 [Cucurbita pepo subsp. pepo] >XP_023521041.1 uncharacterized protein LOC111784637 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 230.3 bits (586), Expect = 2.3e-56
Identity = 117/133 (87.97%), Postives = 122/133 (91.73%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQASSFVADLFDVKEPP +S+S VFAAIFPSPQK GGRNSSSSGDWLKQ NGN
Sbjct: 1   MEGKSKGYQASSFVADLFDVKEPPSTSSSEVFAAIFPSPQKGGGRNSSSSGDWLKQANGN 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGGEDDPNG 120
           QP + RQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS  P P PT+KKSGGEDDPNG
Sbjct: 61  QPSHARQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS--PSPAPTLKKSGGEDDPNG 120

Query: 121 SNSQPASRGNWWQ 134
           +N QPASRGNWWQ
Sbjct: 121 NNFQPASRGNWWQ 131

BLAST of ClCG03G001160 vs. NCBI nr
Match: XP_004153966.1 (uncharacterized protein LOC101213402 [Cucumis sativus] >XP_031741189.1 uncharacterized protein LOC116403771 [Cucumis sativus] >XP_031741191.1 uncharacterized protein LOC116403771 [Cucumis sativus] >XP_031741196.1 uncharacterized protein LOC101213402 [Cucumis sativus] >KGN65759.1 hypothetical protein Csa_023354 [Cucumis sativus])

HSP 1 Score: 229.6 bits (584), Expect = 3.9e-56
Identity = 118/134 (88.06%), Postives = 122/134 (91.04%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQASSFVADLFDVKE PLSS SG FA IFPSPQK  GRNSSSS DWLKQTNG+
Sbjct: 1   MEGKSKGYQASSFVADLFDVKEAPLSSASGAFATIFPSPQKGAGRNSSSSVDWLKQTNGS 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGG-EDDPN 120
           QP +TRQGNSGGSLEPCHLSSSLYYGGQDGYSQA SAGPSP PPPP TMKKSGG +DDPN
Sbjct: 61  QPHHTRQGNSGGSLEPCHLSSSLYYGGQDGYSQATSAGPSPLPPPPHTMKKSGGQQDDPN 120

Query: 121 GSNSQPASRGNWWQ 134
           G+NSQPASRGNWWQ
Sbjct: 121 GNNSQPASRGNWWQ 134

BLAST of ClCG03G001160 vs. NCBI nr
Match: XP_022964980.1 (uncharacterized protein LOC111464929 [Cucurbita moschata] >XP_022964981.1 uncharacterized protein LOC111464929 [Cucurbita moschata] >XP_022964982.1 uncharacterized protein LOC111464929 [Cucurbita moschata])

HSP 1 Score: 226.5 bits (576), Expect = 3.3e-55
Identity = 116/133 (87.22%), Postives = 121/133 (90.98%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQASSFVADLFDVKEPP +S+S VFAAIFPSPQK GGRNSSSSGDWLKQ NGN
Sbjct: 1   MEGKSKGYQASSFVADLFDVKEPPSTSSSEVFAAIFPSPQKGGGRNSSSSGDWLKQANGN 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGGEDDPNG 120
           QP + RQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAG S  P P PT+KKSGGEDDPNG
Sbjct: 61  QPSHARQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGLS--PSPAPTLKKSGGEDDPNG 120

Query: 121 SNSQPASRGNWWQ 134
           +N QPASRGNWWQ
Sbjct: 121 NNFQPASRGNWWQ 131

BLAST of ClCG03G001160 vs. ExPASy TrEMBL
Match: A0A5D3BEB7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001770 PE=4 SV=1)

HSP 1 Score: 233.0 bits (593), Expect = 1.7e-57
Identity = 119/134 (88.81%), Postives = 122/134 (91.04%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQ SSFVADLFDVKE PLSS SG FA IFPSPQK  GRNSSSS DWLKQTNGN
Sbjct: 1   MEGKSKGYQTSSFVADLFDVKEAPLSSASGAFATIFPSPQKGAGRNSSSSVDWLKQTNGN 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGG-EDDPN 120
           QP +TRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSP PPPP TMKKSGG +DDPN
Sbjct: 61  QPHHTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPLPPPPHTMKKSGGQQDDPN 120

Query: 121 GSNSQPASRGNWWQ 134
           G+NSQPASRGNWWQ
Sbjct: 121 GNNSQPASRGNWWQ 134

BLAST of ClCG03G001160 vs. ExPASy TrEMBL
Match: A0A1S3C5M6 (uncharacterized protein LOC103497123 OS=Cucumis melo OX=3656 GN=LOC103497123 PE=4 SV=1)

HSP 1 Score: 233.0 bits (593), Expect = 1.7e-57
Identity = 119/134 (88.81%), Postives = 122/134 (91.04%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQ SSFVADLFDVKE PLSS SG FA IFPSPQK  GRNSSSS DWLKQTNGN
Sbjct: 1   MEGKSKGYQTSSFVADLFDVKEAPLSSASGAFATIFPSPQKGAGRNSSSSVDWLKQTNGN 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGG-EDDPN 120
           QP +TRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSP PPPP TMKKSGG +DDPN
Sbjct: 61  QPHHTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPLPPPPHTMKKSGGQQDDPN 120

Query: 121 GSNSQPASRGNWWQ 134
           G+NSQPASRGNWWQ
Sbjct: 121 GNNSQPASRGNWWQ 134

BLAST of ClCG03G001160 vs. ExPASy TrEMBL
Match: A0A6J1I4B7 (uncharacterized protein LOC111469517 OS=Cucurbita maxima OX=3661 GN=LOC111469517 PE=4 SV=1)

HSP 1 Score: 230.3 bits (586), Expect = 1.1e-56
Identity = 117/133 (87.97%), Postives = 122/133 (91.73%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQASSFVADLFDVKEPP +S+S VFAAIFPSPQK GGRNSSSSGDWLKQ NGN
Sbjct: 1   MEGKSKGYQASSFVADLFDVKEPPSTSSSEVFAAIFPSPQKGGGRNSSSSGDWLKQANGN 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGGEDDPNG 120
           QP + RQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS  P P PT+KKSGGEDDPNG
Sbjct: 61  QPSHARQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPS--PSPAPTLKKSGGEDDPNG 120

Query: 121 SNSQPASRGNWWQ 134
           +N QPASRGNWWQ
Sbjct: 121 NNFQPASRGNWWQ 131

BLAST of ClCG03G001160 vs. ExPASy TrEMBL
Match: A0A0A0LXG9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G525300 PE=4 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 1.9e-56
Identity = 118/134 (88.06%), Postives = 122/134 (91.04%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQASSFVADLFDVKE PLSS SG FA IFPSPQK  GRNSSSS DWLKQTNG+
Sbjct: 1   MEGKSKGYQASSFVADLFDVKEAPLSSASGAFATIFPSPQKGAGRNSSSSVDWLKQTNGS 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGG-EDDPN 120
           QP +TRQGNSGGSLEPCHLSSSLYYGGQDGYSQA SAGPSP PPPP TMKKSGG +DDPN
Sbjct: 61  QPHHTRQGNSGGSLEPCHLSSSLYYGGQDGYSQATSAGPSPLPPPPHTMKKSGGQQDDPN 120

Query: 121 GSNSQPASRGNWWQ 134
           G+NSQPASRGNWWQ
Sbjct: 121 GNNSQPASRGNWWQ 134

BLAST of ClCG03G001160 vs. ExPASy TrEMBL
Match: A0A6J1HJ44 (uncharacterized protein LOC111464929 OS=Cucurbita moschata OX=3662 GN=LOC111464929 PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 1.6e-55
Identity = 116/133 (87.22%), Postives = 121/133 (90.98%), Query Frame = 0

Query: 1   MEGKSKGYQASSFVADLFDVKEPPLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLKQTNGN 60
           MEGKSKGYQASSFVADLFDVKEPP +S+S VFAAIFPSPQK GGRNSSSSGDWLKQ NGN
Sbjct: 1   MEGKSKGYQASSFVADLFDVKEPPSTSSSEVFAAIFPSPQKGGGRNSSSSGDWLKQANGN 60

Query: 61  QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGGEDDPNG 120
           QP + RQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAG S  P P PT+KKSGGEDDPNG
Sbjct: 61  QPSHARQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGLS--PSPAPTLKKSGGEDDPNG 120

Query: 121 SNSQPASRGNWWQ 134
           +N QPASRGNWWQ
Sbjct: 121 NNFQPASRGNWWQ 131

BLAST of ClCG03G001160 vs. TAIR 10
Match: AT5G59080.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to oxidative stress; LOCATED IN: chloroplast; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G46880.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 91.7 bits (226), Expect = 1.2e-18
Identity = 62/139 (44.60%), Postives = 83/139 (59.71%), Query Frame = 0

Query: 1   MEGK----SKGYQASSFVADLFDVKEP-PLSSTSGVFAAIFPSPQKEGGRNSSSSGDWLK 60
           MEGK    S    +SSF A+LF  K+P P SS+SG+F+ +FP P K   R+ S+S     
Sbjct: 1   MEGKGRVGSSSSTSSSFTAELFGSKDPSPPSSSSGIFSTMFPHPSKGSARDGSNS----- 60

Query: 61  QTNGNQPRYTRQGNS-GGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGG 120
             +G+Q +     N+    +EPCHLSSSLYYGGQD Y+++ +   +   PP    ++  G
Sbjct: 61  -KHGSQAQRRESLNAQEDRVEPCHLSSSLYYGGQDVYARSTT---NQTYPPVKNDRRRSG 120

Query: 121 EDDPNGSNSQPASRGNWWQ 134
           EDD NG N Q  SRGNWWQ
Sbjct: 121 EDDANGQNPQDVSRGNWWQ 130

BLAST of ClCG03G001160 vs. TAIR 10
Match: AT5G02020.1 (Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich). )

HSP 1 Score: 76.6 bits (187), Expect = 3.9e-14
Identity = 62/155 (40.00%), Postives = 79/155 (50.97%), Query Frame = 0

Query: 1   MEGKSK------GYQASSFVADLFDVKEPPLS-STSGVFAAIFPSPQKEGGRNS-----S 60
           MEG+ K         +SS  ++LF  +E P S S+SG+  +IFP P K  GR S      
Sbjct: 1   MEGRKKKASSSSPCSSSSLTSELFGSRENPSSPSSSGILGSIFPPPSKVLGRESVRQETV 60

Query: 61  SSGDW---LKQTNGNQPRYTRQGNSGGS-------LEPCHLSSSLYYGGQDGYSQAPSAG 120
           + G W     +T GN  R   Q  + GS       ++PCHLSSS+YYGG D Y Q  ++ 
Sbjct: 61  TGGCWNEKTSKTGGNVDRNREQQENHGSGYQQDQRVQPCHLSSSIYYGGPDVYFQPQNST 120

Query: 121 PSPPPPPPPTMKKSGGEDDPNGSNSQPASRGNWWQ 134
            +       T KK GGEDD     S  ASRGNWWQ
Sbjct: 121 SN------STNKKDGGEDD-----SGSASRGNWWQ 144

BLAST of ClCG03G001160 vs. TAIR 10
Match: AT5G02020.2 (Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich). )

HSP 1 Score: 59.3 bits (142), Expect = 6.5e-09
Identity = 45/115 (39.13%), Postives = 59/115 (51.30%), Query Frame = 0

Query: 1   MEGKSK------GYQASSFVADLFDVKEPPLS-STSGVFAAIFPSPQKEGGRNS-----S 60
           MEG+ K         +SS  ++LF  +E P S S+SG+  +IFP P K  GR S      
Sbjct: 1   MEGRKKKASSSSPCSSSSLTSELFGSRENPSSPSSSGILGSIFPPPSKVLGRESVRQETV 60

Query: 61  SSGDW---LKQTNGNQPRYTRQGNSGGS-------LEPCHLSSSLYYGGQDGYSQ 94
           + G W     +T GN  R   Q  + GS       ++PCHLSSS+YYGG D Y Q
Sbjct: 61  TGGCWNEKTSKTGGNVDRNREQQENHGSGYQQDQRVQPCHLSSSIYYGGPDVYFQ 115

BLAST of ClCG03G001160 vs. TAIR 10
Match: AT2G39855.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55646.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 56.2 bits (134), Expect = 5.5e-08
Identity = 46/119 (38.66%), Postives = 59/119 (49.58%), Query Frame = 0

Query: 26  SSTSGVFAAIFPSPQ--KEGGRNSSSSGDWLKQTNGNQP--RYTRQGN-------SGGSL 85
           SST+G+F +IFP P    +G   S +     + TN   P  R  R  N       S  + 
Sbjct: 32  SSTTGLFKSIFPPPSAVTQGNLTSRNGAAKYQPTNFETPNERGERSKNKERKSYQSEETQ 91

Query: 86  EPCHLSSSLYYGGQDGYSQAPSAGPSPPPPPPPTMKKSGGEDDPNGSNSQPASRGNWWQ 134
            PC+LSSS+YYGGQD YS + +         P   KK G E D     S+ ASRGNWW+
Sbjct: 92  PPCNLSSSIYYGGQDNYSSSTT--------NPDAYKKDGEEGD-----SESASRGNWWE 137

BLAST of ClCG03G001160 vs. TAIR 10
Match: AT3G55646.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G39855.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 48.1 bits (113), Expect = 1.5e-05
Identity = 53/149 (35.57%), Postives = 70/149 (46.98%), Query Frame = 0

Query: 2   EGKSKGYQASSFVADL--FD------VKEPPLSSTSGVFAAIFPSPQKEG-GR--NSSSS 61
           + K K   ASS  + L  FD      V     SS +G+F +IFP P  +  GR  + +S 
Sbjct: 5   KNKKKIVSASSSSSSLSSFDHIFGPRVSSSSSSSATGLFKSIFPPPSADQLGRQVDFASQ 64

Query: 62  GDWLKQTNGN------QPRYTRQGNSGGSLEPCHLSSSLYYGGQDGYSQAPSAGPSPPPP 121
           G  +K  + N        +  +   +  +  PCHLSSSLYYGGQ+ YS       S    
Sbjct: 65  GGHVKYQSPNAKGERSNKKEKKSYYNEETEPPCHLSSSLYYGGQETYS-------STTTT 124

Query: 122 PPPTMKKSGGEDDPNGSNSQPASRGNWWQ 134
              T KK G E D     S+ ASRGNWW+
Sbjct: 125 THDTYKKDGEEGD-----SKRASRGNWWE 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894691.18.4e-5993.23uncharacterized protein LOC120083160 [Benincasa hispida] >XP_038894692.1 unchara... [more]
XP_008457434.13.5e-5788.81PREDICTED: uncharacterized protein LOC103497123 [Cucumis melo] >KAA0031766.1 unc... [more]
XP_022970583.12.3e-5687.97uncharacterized protein LOC111469517 [Cucurbita maxima] >XP_023519303.1 uncharac... [more]
XP_004153966.13.9e-5688.06uncharacterized protein LOC101213402 [Cucumis sativus] >XP_031741189.1 uncharact... [more]
XP_022964980.13.3e-5587.22uncharacterized protein LOC111464929 [Cucurbita moschata] >XP_022964981.1 unchar... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3BEB71.7e-5788.81Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3C5M61.7e-5788.81uncharacterized protein LOC103497123 OS=Cucumis melo OX=3656 GN=LOC103497123 PE=... [more]
A0A6J1I4B71.1e-5687.97uncharacterized protein LOC111469517 OS=Cucurbita maxima OX=3661 GN=LOC111469517... [more]
A0A0A0LXG91.9e-5688.06Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G525300 PE=4 SV=1[more]
A0A6J1HJ441.6e-5587.22uncharacterized protein LOC111464929 OS=Cucurbita moschata OX=3662 GN=LOC1114649... [more]
Match NameE-valueIdentityDescription
AT5G59080.11.2e-1844.60unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response... [more]
AT5G02020.13.9e-1440.00Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine ric... [more]
AT5G02020.26.5e-0939.13Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine ric... [more]
AT2G39855.25.5e-0838.66unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G55646.11.5e-0535.57unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 36..218
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 41..77
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..134
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 183..200
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 96..111
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 266..320
NoneNo IPR availablePANTHERPTHR33738EMB|CAB82975.1coord: 1..134

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G001160.2ClCG03G001160.2mRNA