ClCG01G021530 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G021530
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDUF506 family protein
LocationCG_Chr01: 35308086 .. 35309045 (+)
RNA-Seq ExpressionClCG01G021530
SyntenyClCG01G021530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAGATTAGCTGCAGTCTTCCGCCACGGTACTGATGCTGCTCTATGGGACAGCGGCAGCGACCATTCGCCGGAAAACTCCACTGCCGACCTCTTCGACCTCGTTAAGTCGTTCATTGAAAGTGATGATATGGAAATTAAGGAAGGGGAGAAGGAAGATAGCTATAAAGAAAAATCAGACGGTTTCTCCTTTGATTCAGATGCAGACACAATCAAGCTACGAAATCTGTTTTGTTCCTTGGAAAATAAGAACGAGGAAATAAGATTTGGAGCAGAACAAGCTCTGAAGCTCGTTGGTGGAAGATCGTTCCCGGGGATTAAACGACAATTGATGGCACATTTGCGCAGAAAAGGCTTCGATGCTGGTGAGATTTTTTTTGTTTCTTTCACTTTGCTTTTGAGCCCATGTAGAGAAATTCAAACATCAGTTTAATTCATTATTCAATTGACTCGTTGAATTCAAGCAGGTTTATGCAAATCGAAGATGGAGAATCTCCGGTCATTTCCGGCAGGTGATCATGAGTATATCGACGTCAATTTTGGTGGAAGTCGATACATTGTAGAAATCTTTTTAGCTAGAGAATTTGAAATTGCCCGTCCAACCAGTAAATACAGATCATTACTCAACATATTTCCAGAGATTTTCGTCGGAAATTTGGAAGAGTTGAAGGGGGTGGTGAAACTGATGTGCTCTGCCATGAAAGAGTCCATGAAGAAGAGGAACATGCATGTGCCTCCATGGAGAAGAAACGGGTACATGCAGGCAAAATGGTTTGGTTCTTACAAGCGAACCACGAACCACAAAGTCTCAGGGTCAGCTGAAGCAGACACTTCCCCGACGGAAATGAGTTCACCCTGTTTTAAAGCCTACCATTGCAGGGGAGATGTCGGCCGAAACGCGAGTATCAGAGTTGGGAATTTAACGGCTGTTTTTGGTAGCAATGAGTTGCTTCTGTAG

mRNA sequence

ATGGATAGATTAGCTGCAGTCTTCCGCCACGGTACTGATGCTGCTCTATGGGACAGCGGCAGCGACCATTCGCCGGAAAACTCCACTGCCGACCTCTTCGACCTCGTTAAGTCGTTCATTGAAAGTGATGATATGGAAATTAAGGAAGGGGAGAAGGAAGATAGCTATAAAGAAAAATCAGACGGTTTCTCCTTTGATTCAGATGCAGACACAATCAAGCTACGAAATCTGTTTTGTTCCTTGGAAAATAAGAACGAGGAAATAAGATTTGGAGCAGAACAAGCTCTGAAGCTCGTTGGTGGAAGATCGTTCCCGGGGATTAAACGACAATTGATGGCACATTTGCGCAGAAAAGGCTTCGATGCTGGTTTATGCAAATCGAAGATGGAGAATCTCCGGTCATTTCCGGCAGGTGATCATGAGTATATCGACGTCAATTTTGGTGGAAGTCGATACATTGTAGAAATCTTTTTAGCTAGAGAATTTGAAATTGCCCGTCCAACCAGTAAATACAGATCATTACTCAACATATTTCCAGAGATTTTCGTCGGAAATTTGGAAGAGTTGAAGGGGGTGGTGAAACTGATGTGCTCTGCCATGAAAGAGTCCATGAAGAAGAGGAACATGCATGTGCCTCCATGGAGAAGAAACGGGTACATGCAGGCAAAATGGTTTGGTTCTTACAAGCGAACCACGAACCACAAAGTCTCAGGGTCAGCTGAAGCAGACACTTCCCCGACGGAAATGAGTTCACCCTGTTTTAAAGCCTACCATTGCAGGGGAGATGTCGGCCGAAACGCGAGTATCAGAGTTGGGAATTTAACGGCTGTTTTTGGTAGCAATGAGTTGCTTCTGTAG

Coding sequence (CDS)

ATGGATAGATTAGCTGCAGTCTTCCGCCACGGTACTGATGCTGCTCTATGGGACAGCGGCAGCGACCATTCGCCGGAAAACTCCACTGCCGACCTCTTCGACCTCGTTAAGTCGTTCATTGAAAGTGATGATATGGAAATTAAGGAAGGGGAGAAGGAAGATAGCTATAAAGAAAAATCAGACGGTTTCTCCTTTGATTCAGATGCAGACACAATCAAGCTACGAAATCTGTTTTGTTCCTTGGAAAATAAGAACGAGGAAATAAGATTTGGAGCAGAACAAGCTCTGAAGCTCGTTGGTGGAAGATCGTTCCCGGGGATTAAACGACAATTGATGGCACATTTGCGCAGAAAAGGCTTCGATGCTGGTTTATGCAAATCGAAGATGGAGAATCTCCGGTCATTTCCGGCAGGTGATCATGAGTATATCGACGTCAATTTTGGTGGAAGTCGATACATTGTAGAAATCTTTTTAGCTAGAGAATTTGAAATTGCCCGTCCAACCAGTAAATACAGATCATTACTCAACATATTTCCAGAGATTTTCGTCGGAAATTTGGAAGAGTTGAAGGGGGTGGTGAAACTGATGTGCTCTGCCATGAAAGAGTCCATGAAGAAGAGGAACATGCATGTGCCTCCATGGAGAAGAAACGGGTACATGCAGGCAAAATGGTTTGGTTCTTACAAGCGAACCACGAACCACAAAGTCTCAGGGTCAGCTGAAGCAGACACTTCCCCGACGGAAATGAGTTCACCCTGTTTTAAAGCCTACCATTGCAGGGGAGATGTCGGCCGAAACGCGAGTATCAGAGTTGGGAATTTAACGGCTGTTTTTGGTAGCAATGAGTTGCTTCTGTAG

Protein sequence

MDRLAAVFRHGTDAALWDSGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKSDGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGFDAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNIFPEIFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSAEADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL
Homology
BLAST of ClCG01G021530 vs. NCBI nr
Match: XP_038881327.1 (uncharacterized protein LOC120072874 [Benincasa hispida])

HSP 1 Score: 502.3 bits (1292), Expect = 2.8e-138
Identity = 251/285 (88.07%), Postives = 261/285 (91.58%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWDSGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKS 60
           MDRLAAVFR G D ALWDSGSDHS ENSTADLFDLVKSFIE  D++IKEGEKEDSY E+S
Sbjct: 1   MDRLAAVFRRGADTALWDSGSDHSSENSTADLFDLVKSFIEKGDIKIKEGEKEDSYTEES 60

Query: 61  DGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGF 120
           DGFSFDSDA+ IKLRNLF SL++KN EIR  AEQALK VGGRSFPGIKRQLMAHLRRKGF
Sbjct: 61  DGFSFDSDAEAIKLRNLFGSLDDKNGEIRIEAEQALKPVGGRSFPGIKRQLMAHLRRKGF 120

Query: 121 DAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNIFPE 180
           DAGLCKSKME LRSFPAGDHEYIDVNFGG+RYIVEIFLAREFEIARPTSKY SLLNIFPE
Sbjct: 121 DAGLCKSKMEKLRSFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISLLNIFPE 180

Query: 181 IFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240
           IFVGNLEELK VVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTN KVSGSA
Sbjct: 181 IFVGNLEELKQVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNQKVSGSA 240

Query: 241 EADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           E +TSP E+S PCFKAYHCRGD  +NA IRVGNLTAVFG NELLL
Sbjct: 241 EEETSPPEISLPCFKAYHCRGDFDQNAGIRVGNLTAVFGGNELLL 285

BLAST of ClCG01G021530 vs. NCBI nr
Match: XP_008440059.1 (PREDICTED: uncharacterized protein LOC103484651 [Cucumis melo] >TYK13008.1 DUF506 family protein [Cucumis melo var. makuwa])

HSP 1 Score: 494.2 bits (1271), Expect = 7.6e-136
Identity = 243/285 (85.26%), Postives = 258/285 (90.53%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWDSGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKS 60
           MDRLAAVFR+G D+++W+SGSDHSPE  TADLFDLVKSFIE  D E KEGE EDS  E+S
Sbjct: 1   MDRLAAVFRYGADSSVWESGSDHSPEKPTADLFDLVKSFIEKGDFEFKEGETEDSCTEES 60

Query: 61  DGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGF 120
           DGFSFDSDA  +KLRNLF SLENKNEEIR   EQALKLVGGRS PGI RQLMAHLRRKGF
Sbjct: 61  DGFSFDSDAGVVKLRNLFGSLENKNEEIRIETEQALKLVGGRSVPGINRQLMAHLRRKGF 120

Query: 121 DAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNIFPE 180
           DAGLCKSKME LR+FPAGDHEYIDVNFGG+RYIVEIFLAREFEIARPTSKY SLLN FPE
Sbjct: 121 DAGLCKSKMEKLRAFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYVSLLNTFPE 180

Query: 181 IFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240
           IFVG L+ELK VVKLMCSAMKESMKKRNMH+PPWRRNGYMQAKWFGSYKRTTNHKVSGSA
Sbjct: 181 IFVGTLDELKQVVKLMCSAMKESMKKRNMHIPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240

Query: 241 EADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           EA+TSP+EMS PCFK+Y+CRGD GRNA IRVGNLTAVFG NELLL
Sbjct: 241 EAETSPSEMSLPCFKSYYCRGDFGRNAGIRVGNLTAVFGGNELLL 285

BLAST of ClCG01G021530 vs. NCBI nr
Match: XP_004134790.1 (uncharacterized protein LOC101205314 [Cucumis sativus] >KAE8647647.1 hypothetical protein Csa_003601 [Cucumis sativus])

HSP 1 Score: 478.0 bits (1229), Expect = 5.6e-131
Identity = 235/285 (82.46%), Postives = 253/285 (88.77%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWDSGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKS 60
           MDRLAA+FRH  D++  +SGSDHSPE  TADLFDLVKSFIE  D+E KEGE+ED   E+S
Sbjct: 1   MDRLAALFRHRADSSFSESGSDHSPEKPTADLFDLVKSFIEKGDLEFKEGEREDCCTEES 60

Query: 61  DGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGF 120
           DGFSFDSDA  +KLRNLF S+ENKNEEIR   EQALKLVGGRS PGI RQLMAHLRR+GF
Sbjct: 61  DGFSFDSDAGVVKLRNLFGSVENKNEEIRIETEQALKLVGGRSLPGINRQLMAHLRREGF 120

Query: 121 DAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNIFPE 180
           DAGLCKSKME  R+FPAGDHEYIDVNFGG+RYIVEIFLAREFEIARPTSKY SLLN FPE
Sbjct: 121 DAGLCKSKMEKPRAFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYVSLLNTFPE 180

Query: 181 IFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240
           IFVG L+ELK VVKLMCSAMKESMKK NMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGS+
Sbjct: 181 IFVGTLDELKHVVKLMCSAMKESMKKMNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSS 240

Query: 241 EADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           EA+TSP E+S PCFK+YHCRGD GRNA IRVGNLTAVFG NELL+
Sbjct: 241 EAETSPAEISLPCFKSYHCRGDFGRNAGIRVGNLTAVFGGNELLM 285

BLAST of ClCG01G021530 vs. NCBI nr
Match: XP_022926904.1 (uncharacterized protein LOC111433882 [Cucurbita moschata])

HSP 1 Score: 443.0 bits (1138), Expect = 2.0e-120
Identity = 226/288 (78.47%), Postives = 246/288 (85.42%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWD--SGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKE 60
           MDR A +FRHG +AA+WD  SGSDHSPENS ADLFDLVKSF+E DD+EI EGE+ED  KE
Sbjct: 5   MDRFAEIFRHGAEAAVWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDGGKE 64

Query: 61  KSDGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQAL-KLVGGRSFPGIKRQLMAHLRR 120
           +SD FS DSDA  IKL+NLF S +N+++EIR  AEQAL KLVGGRSF GIKR+LMAHLRR
Sbjct: 65  ESDSFSCDSDAGVIKLKNLFGSRDNESDEIRIEAEQALKKLVGGRSFQGIKRKLMAHLRR 124

Query: 121 KGFDAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNI 180
           KGFDAGLCKSK E L+SFPAGDHEYIDVNFGG+RYIVE+FLAREFEIARPT KY SLLN 
Sbjct: 125 KGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEVFLAREFEIARPTRKYTSLLNT 184

Query: 181 FPEIFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVS 240
           FPEIFVGNLEELK VVKLMCSAMK+SM  RNMHVPPWRR GYMQ KWFGSYKRTTNHK S
Sbjct: 185 FPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNHKGS 244

Query: 241 GSAEADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           GSAEA+TSP  MSS CFK  HCRGD GRN  I VGNLTA FG++ LLL
Sbjct: 245 GSAEAETSP-GMSSACFKTSHCRGDFGRNRGIMVGNLTAAFGADGLLL 291

BLAST of ClCG01G021530 vs. NCBI nr
Match: KAG6594690.1 (hypothetical protein SDJN03_11243, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 441.8 bits (1135), Expect = 4.5e-120
Identity = 228/288 (79.17%), Postives = 246/288 (85.42%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWD--SGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKE 60
           MDR A +FRHG +AALWD  SGSDHSPENS ADLFDLVKSF+E DD+EI EGE+ED   E
Sbjct: 5   MDRFAEMFRHGAEAALWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDRGTE 64

Query: 61  KSDGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQAL-KLVGGRSFPGIKRQLMAHLRR 120
           +SDGFS DSDA  IKL+NLF S +NK++EIR  AEQAL KLVGGRSF GIKR+LMAHLRR
Sbjct: 65  ESDGFSCDSDAGVIKLKNLFGSRDNKSDEIRIEAEQALKKLVGGRSFQGIKRKLMAHLRR 124

Query: 121 KGFDAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNI 180
           KGFDAGLCKSK E L+SFPAGDHEYIDVNFGG+RYIVE+FLAREFEIARPT KY SLLN 
Sbjct: 125 KGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEVFLAREFEIARPTRKYTSLLNT 184

Query: 181 FPEIFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVS 240
           FPEIFVGNLEELK VVKLMCSAMK+SM  RNMHVPPWRR GYMQ KWFGSYKRTTN K S
Sbjct: 185 FPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNLKGS 244

Query: 241 GSAEADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           GSAEA+TSP  MSS CFKA HCRGD GRN  I VGNLTA FG++ LLL
Sbjct: 245 GSAEAETSP-GMSSACFKASHCRGDFGRNRGIMVGNLTAAFGADGLLL 291

BLAST of ClCG01G021530 vs. ExPASy TrEMBL
Match: A0A5D3CPA4 (DUF506 family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G006060 PE=4 SV=1)

HSP 1 Score: 494.2 bits (1271), Expect = 3.7e-136
Identity = 243/285 (85.26%), Postives = 258/285 (90.53%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWDSGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKS 60
           MDRLAAVFR+G D+++W+SGSDHSPE  TADLFDLVKSFIE  D E KEGE EDS  E+S
Sbjct: 1   MDRLAAVFRYGADSSVWESGSDHSPEKPTADLFDLVKSFIEKGDFEFKEGETEDSCTEES 60

Query: 61  DGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGF 120
           DGFSFDSDA  +KLRNLF SLENKNEEIR   EQALKLVGGRS PGI RQLMAHLRRKGF
Sbjct: 61  DGFSFDSDAGVVKLRNLFGSLENKNEEIRIETEQALKLVGGRSVPGINRQLMAHLRRKGF 120

Query: 121 DAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNIFPE 180
           DAGLCKSKME LR+FPAGDHEYIDVNFGG+RYIVEIFLAREFEIARPTSKY SLLN FPE
Sbjct: 121 DAGLCKSKMEKLRAFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYVSLLNTFPE 180

Query: 181 IFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240
           IFVG L+ELK VVKLMCSAMKESMKKRNMH+PPWRRNGYMQAKWFGSYKRTTNHKVSGSA
Sbjct: 181 IFVGTLDELKQVVKLMCSAMKESMKKRNMHIPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240

Query: 241 EADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           EA+TSP+EMS PCFK+Y+CRGD GRNA IRVGNLTAVFG NELLL
Sbjct: 241 EAETSPSEMSLPCFKSYYCRGDFGRNAGIRVGNLTAVFGGNELLL 285

BLAST of ClCG01G021530 vs. ExPASy TrEMBL
Match: A0A1S3B074 (uncharacterized protein LOC103484651 OS=Cucumis melo OX=3656 GN=LOC103484651 PE=4 SV=1)

HSP 1 Score: 494.2 bits (1271), Expect = 3.7e-136
Identity = 243/285 (85.26%), Postives = 258/285 (90.53%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWDSGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKS 60
           MDRLAAVFR+G D+++W+SGSDHSPE  TADLFDLVKSFIE  D E KEGE EDS  E+S
Sbjct: 1   MDRLAAVFRYGADSSVWESGSDHSPEKPTADLFDLVKSFIEKGDFEFKEGETEDSCTEES 60

Query: 61  DGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGF 120
           DGFSFDSDA  +KLRNLF SLENKNEEIR   EQALKLVGGRS PGI RQLMAHLRRKGF
Sbjct: 61  DGFSFDSDAGVVKLRNLFGSLENKNEEIRIETEQALKLVGGRSVPGINRQLMAHLRRKGF 120

Query: 121 DAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNIFPE 180
           DAGLCKSKME LR+FPAGDHEYIDVNFGG+RYIVEIFLAREFEIARPTSKY SLLN FPE
Sbjct: 121 DAGLCKSKMEKLRAFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYVSLLNTFPE 180

Query: 181 IFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240
           IFVG L+ELK VVKLMCSAMKESMKKRNMH+PPWRRNGYMQAKWFGSYKRTTNHKVSGSA
Sbjct: 181 IFVGTLDELKQVVKLMCSAMKESMKKRNMHIPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240

Query: 241 EADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           EA+TSP+EMS PCFK+Y+CRGD GRNA IRVGNLTAVFG NELLL
Sbjct: 241 EAETSPSEMSLPCFKSYYCRGDFGRNAGIRVGNLTAVFGGNELLL 285

BLAST of ClCG01G021530 vs. ExPASy TrEMBL
Match: A0A6J1EGH2 (uncharacterized protein LOC111433882 OS=Cucurbita moschata OX=3662 GN=LOC111433882 PE=4 SV=1)

HSP 1 Score: 443.0 bits (1138), Expect = 9.7e-121
Identity = 226/288 (78.47%), Postives = 246/288 (85.42%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWD--SGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKE 60
           MDR A +FRHG +AA+WD  SGSDHSPENS ADLFDLVKSF+E DD+EI EGE+ED  KE
Sbjct: 5   MDRFAEIFRHGAEAAVWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDGGKE 64

Query: 61  KSDGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQAL-KLVGGRSFPGIKRQLMAHLRR 120
           +SD FS DSDA  IKL+NLF S +N+++EIR  AEQAL KLVGGRSF GIKR+LMAHLRR
Sbjct: 65  ESDSFSCDSDAGVIKLKNLFGSRDNESDEIRIEAEQALKKLVGGRSFQGIKRKLMAHLRR 124

Query: 121 KGFDAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNI 180
           KGFDAGLCKSK E L+SFPAGDHEYIDVNFGG+RYIVE+FLAREFEIARPT KY SLLN 
Sbjct: 125 KGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEVFLAREFEIARPTRKYTSLLNT 184

Query: 181 FPEIFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVS 240
           FPEIFVGNLEELK VVKLMCSAMK+SM  RNMHVPPWRR GYMQ KWFGSYKRTTNHK S
Sbjct: 185 FPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNHKGS 244

Query: 241 GSAEADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           GSAEA+TSP  MSS CFK  HCRGD GRN  I VGNLTA FG++ LLL
Sbjct: 245 GSAEAETSP-GMSSACFKTSHCRGDFGRNRGIMVGNLTAAFGADGLLL 291

BLAST of ClCG01G021530 vs. ExPASy TrEMBL
Match: A0A6J1KUE8 (uncharacterized protein LOC111497287 OS=Cucurbita maxima OX=3661 GN=LOC111497287 PE=4 SV=1)

HSP 1 Score: 439.1 bits (1128), Expect = 1.4e-119
Identity = 228/289 (78.89%), Postives = 245/289 (84.78%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWD--SGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKE 60
           MDR A +FRHG +AALWD  SGSDHSPENS ADLFDLVKSF+E DD+EI EGE+ED   E
Sbjct: 5   MDRFAEIFRHGAEAALWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDGSTE 64

Query: 61  KSD-GFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQAL-KLVGGRSFPGIKRQLMAHLR 120
           +SD GFS DSDA  IKL+NLF S +NK++EIR  AEQAL KLVGGRSF GIKR+LMAHLR
Sbjct: 65  ESDGGFSCDSDAGVIKLKNLFGSRDNKSDEIRIEAEQALKKLVGGRSFQGIKRKLMAHLR 124

Query: 121 RKGFDAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLN 180
           RKGFDAGLCKSK E L+SFPAGDHEYIDVNFGG+RYIVEIFLAREFEIARPT KY SLLN
Sbjct: 125 RKGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTRKYTSLLN 184

Query: 181 IFPEIFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKV 240
            FPEIFVGNLEELK VVKLMCSAMK+SM  RNMHVPPWRR GYMQ KWFGSYKRTTNHK 
Sbjct: 185 TFPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNHKG 244

Query: 241 SGSAEADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           SGSAEA+TSP  MSS CFK  HCRGD GRN  I VGNLTA FG+ + LL
Sbjct: 245 SGSAEAETSP-GMSSACFKTSHCRGDFGRNRGIMVGNLTAAFGAADGLL 292

BLAST of ClCG01G021530 vs. ExPASy TrEMBL
Match: A0A0A0KL43 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511780 PE=4 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 4.7e-115
Identity = 215/285 (75.44%), Postives = 232/285 (81.40%), Query Frame = 0

Query: 1   MDRLAAVFRHGTDAALWDSGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKS 60
           MDRLAA+FRH  D++  +SGSDHSPE  TADLFDLVKSFIE  D+E KEGE+ED   E+S
Sbjct: 1   MDRLAALFRHRADSSFSESGSDHSPEKPTADLFDLVKSFIEKGDLEFKEGEREDCCTEES 60

Query: 61  DGFSFDSDADTIKLRNLFCSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGF 120
           DGFSFDSDA  +KLRNLF S+ENKNEEIR   EQALKLV                     
Sbjct: 61  DGFSFDSDAGVVKLRNLFGSVENKNEEIRIETEQALKLV--------------------- 120

Query: 121 DAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNIFPE 180
             GLCKSKME  R+FPAGDHEYIDVNFGG+RYIVEIFLAREFEIARPTSKY SLLN FPE
Sbjct: 121 --GLCKSKMEKPRAFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYVSLLNTFPE 180

Query: 181 IFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSA 240
           IFVG L+ELK VVKLMCSAMKESMKK NMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGS+
Sbjct: 181 IFVGTLDELKHVVKLMCSAMKESMKKMNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSS 240

Query: 241 EADTSPTEMSSPCFKAYHCRGDVGRNASIRVGNLTAVFGSNELLL 286
           EA+TSP E+S PCFK+YHCRGD GRNA IRVGNLTAVFG NELL+
Sbjct: 241 EAETSPAEISLPCFKSYHCRGDFGRNAGIRVGNLTAVFGGNELLM 262

BLAST of ClCG01G021530 vs. TAIR 10
Match: AT1G12030.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 198.0 bits (502), Expect = 1.0e-50
Identity = 123/269 (45.72%), Postives = 162/269 (60.22%), Query Frame = 0

Query: 19  SGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKSDGFSFDSDADTIKLRNLF 78
           SGSDHSP++ T DL+DLV+SFI   D E+ E   ED+++E+ D  S D D + +K R   
Sbjct: 30  SGSDHSPDD-TEDLWDLVESFI---DREV-ETLPEDAFQEEEDDKS-DEDYEDVKERLRE 89

Query: 79  CSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGFDAGLCKSKMENLRSFPAG 138
               +  EE +   ++A+     R F G KR  MA+LR KGFDAGLCKS+ E      AG
Sbjct: 90  ILENHGGEERQRIMDEAVN--ASRVFAGEKRHFMAYLRNKGFDAGLCKSRWEKFGKNTAG 149

Query: 139 DHEYIDVNFGG-SRYIVEIFLAREFEIARPTSKYRSLLNIFPEIFVGNLEELKGVVKLMC 198
            +EY+DV  G  +RYIVE  LA EFEIARPT++Y S+L   P +FVG  EELK +V++MC
Sbjct: 150 KYEYVDVKAGDKNRYIVETNLAGEFEIARPTTRYLSVLAQVPRVFVGTPEELKQLVRIMC 209

Query: 199 SAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSGSAEADTSPTEMSSPCFKAY 258
             ++ SMK+ ++ VPPWRRNGYMQAKWFG YKRT+N  VS        P        K  
Sbjct: 210 FEIRRSMKRADIFVPPWRRNGYMQAKWFGHYKRTSNEVVSRVKSCGCGPRVGFEESVKMT 269

Query: 259 HCRG---DVGRNASIRVGNLTAVFGSNEL 284
              G      R + ++VG LT  F  +E+
Sbjct: 270 TFNGFKDGEMRRSGLKVGQLTVAFNGSEV 290

BLAST of ClCG01G021530 vs. TAIR 10
Match: AT1G62420.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 176.0 bits (445), Expect = 4.2e-44
Identity = 110/222 (49.55%), Postives = 132/222 (59.46%), Query Frame = 0

Query: 19  SGSDHSPENSTADLFDLVKSFIESDDMEIKEGEKEDSYKEKSDGFSFDSDADTI--KLRN 78
           SGSDHSP     DL DLV SFIE +   +   E+E S        S D++ + +  +LR 
Sbjct: 30  SGSDHSP-----DLSDLVASFIEKEGQIVLREEEETS--------SDDNNLEDVNERLRK 89

Query: 79  LFCSLENKNEEIRF---GAEQALKLVGGRSFPGIKRQLMAHLRRKGFDAGLCKSKMENLR 138
           L   L    E +R      E A   VG  S    KR LMA LR KGFDAGLCKS  E   
Sbjct: 90  LLEGLSCGEERMRILSATMEVAGTFVGDIS--SSKRHLMAFLRNKGFDAGLCKSSWERFG 149

Query: 139 SFPAGDHEYIDVNFGG---SRYIVEIFLAREFEIARPTSKYRSLLNIFPEIFVGNLEELK 198
               G +EY+DV  GG   +RY VE  LA EFEIARPT +Y S+L+  P +FVG  EELK
Sbjct: 150 KNTGGKYEYVDVRCGGDYNNRYFVETNLAGEFEIARPTKRYLSILSQVPRVFVGTSEELK 209

Query: 199 GVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKRTT 233
            +V++MC  M+ SMK   +HVPPWRRNGYMQAKWFG YKRT+
Sbjct: 210 LLVRIMCHEMRRSMKHVGIHVPPWRRNGYMQAKWFGFYKRTS 236

BLAST of ClCG01G021530 vs. TAIR 10
Match: AT3G07350.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 129.4 bits (324), Expect = 4.6e-30
Identity = 84/260 (32.31%), Postives = 139/260 (53.46%), Query Frame = 0

Query: 2   DRLAAVFRHGTDAALWDSGSDHS-------PENSTADLFDLVKSFIESDDMEIKEGEKED 61
           D LA   R       + SGS+H+        ++ +  L DLV+ F+E    E+   + E 
Sbjct: 11  DPLAEEVRARLVGCSFSSGSEHTGDGIEDYEDDDSPCLSDLVQGFLED---EVDTVDDES 70

Query: 62  SYKEKSDGFSFDSDADTIKL----RNLFCSLENKNEEIRFG---------AEQALKLVGG 121
            + ++  G   DSD++  +L     ++   L N   E  +G         A + L  +G 
Sbjct: 71  CWCDQDSGSDSDSDSELGELPDFADDIAKLLRNSLREDSYGRTVLVHVARAMEMLSSLGS 130

Query: 122 R--SFPGIKRQLMAHLRRKGFDAGLCKSKMENLRSFPAGDHEYIDVNFGGS------RYI 181
           +       +R++M+ LR  G +A +CK+K ++     AG+HE+IDV +  S      R+I
Sbjct: 131 QPEQRAVFQRKVMSLLRELGHNAAICKTKWKSSGGLTAGNHEFIDVVYTPSASSQSVRFI 190

Query: 182 VEIFLAREFEIARPTSKYRSLLNIFPEIFVGNLEELKGVVKLMCSAMKESMKKRNMHVPP 234
           V++  +  F+IARPTS+Y  +L   P +FVG  ++LK +++L+C A + S++ R + +PP
Sbjct: 191 VDLDFSSRFQIARPTSQYARVLQSLPAVFVGKGDDLKRILRLVCDAARISLRNRGLTLPP 250

BLAST of ClCG01G021530 vs. TAIR 10
Match: AT4G14620.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 129.0 bits (323), Expect = 5.9e-30
Identity = 63/148 (42.57%), Postives = 92/148 (62.16%), Query Frame = 0

Query: 107 IKRQLMAHLRRKGFDAGLCKSKMENLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIAR 166
           +++ ++  L   G+D+ +CKSK +  RS PAG++EYIDV   G R I++I    EFEIAR
Sbjct: 143 LRKIVVDELSSLGYDSSICKSKWDKTRSIPAGEYEYIDVIVNGERLIIDIDFRSEFEIAR 202

Query: 167 PTSKYRSLLNIFPEIFVGNLEELKGVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFG 226
            TS Y+ LL   P IFVG  + ++ +V ++  A K+S+KK+ MH PPWR+  YM+AKW  
Sbjct: 203 QTSGYKELLQSLPLIFVGKSDRIRQIVSIVSEASKQSLKKKGMHFPPWRKADYMRAKWLS 262

Query: 227 SYKRTTNHK---VSGSAEADTSPTEMSS 252
           SY R +  K   V+ +A+    P   SS
Sbjct: 263 SYTRNSGEKKPTVTSAAKVVAEPELDSS 290

BLAST of ClCG01G021530 vs. TAIR 10
Match: AT2G38820.2 (Protein of unknown function (DUF506) )

HSP 1 Score: 123.6 bits (309), Expect = 2.5e-28
Identity = 62/160 (38.75%), Postives = 94/160 (58.75%), Query Frame = 0

Query: 71  TIKLRNLFCSLENKNEEIRFGAEQALKLVGGRSFPGIKRQLMAHLRRKGFDAGLCKSKME 130
           +I++RNL   +    E       +  KL  G     +   L++     G+DA LCKS+ E
Sbjct: 132 SIRVRNLLTDVTKIAE-----TSKNCKLKDGSCLKSVANGLVS----LGYDAALCKSRWE 191

Query: 131 NLRSFPAGDHEYIDVNFGGSRYIVEIFLAREFEIARPTSKYRSLLNIFPEIFVGNLEELK 190
              S PAG++EY+DV   G R +++I    +FEIAR T  Y+S+L   P IFVG  + L+
Sbjct: 192 KSPSCPAGEYEYVDVIMKGERLLIDIDFKSKFEIARATKTYKSMLQTLPYIFVGKADRLQ 251

Query: 191 GVVKLMCSAMKESMKKRNMHVPPWRRNGYMQAKWFGSYKR 231
            ++ L+C A K+S+KK+ +HVPPWRR  Y+++KW  S+ R
Sbjct: 252 KIIVLICKAAKQSLKKKGLHVPPWRRAEYVKSKWLSSHVR 282

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881327.12.8e-13888.07uncharacterized protein LOC120072874 [Benincasa hispida][more]
XP_008440059.17.6e-13685.26PREDICTED: uncharacterized protein LOC103484651 [Cucumis melo] >TYK13008.1 DUF50... [more]
XP_004134790.15.6e-13182.46uncharacterized protein LOC101205314 [Cucumis sativus] >KAE8647647.1 hypothetica... [more]
XP_022926904.12.0e-12078.47uncharacterized protein LOC111433882 [Cucurbita moschata][more]
KAG6594690.14.5e-12079.17hypothetical protein SDJN03_11243, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3CPA43.7e-13685.26DUF506 family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25... [more]
A0A1S3B0743.7e-13685.26uncharacterized protein LOC103484651 OS=Cucumis melo OX=3656 GN=LOC103484651 PE=... [more]
A0A6J1EGH29.7e-12178.47uncharacterized protein LOC111433882 OS=Cucurbita moschata OX=3662 GN=LOC1114338... [more]
A0A6J1KUE81.4e-11978.89uncharacterized protein LOC111497287 OS=Cucurbita maxima OX=3661 GN=LOC111497287... [more]
A0A0A0KL434.7e-11575.44Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511780 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G12030.11.0e-5045.72Protein of unknown function (DUF506) [more]
AT1G62420.14.2e-4449.55Protein of unknown function (DUF506) [more]
AT3G07350.14.6e-3032.31Protein of unknown function (DUF506) [more]
AT4G14620.15.9e-3042.57Protein of unknown function (DUF506) [more]
AT2G38820.22.5e-2838.75Protein of unknown function (DUF506) [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006502Protein of unknown function PDDEXK-likeTIGRFAMTIGR01615TIGR01615coord: 108..231
e-value: 1.2E-46
score: 155.7
IPR006502Protein of unknown function PDDEXK-likePFAMPF04720PDDEXK_6coord: 34..230
e-value: 7.6E-58
score: 196.1
IPR006502Protein of unknown function PDDEXK-likePANTHERPTHR31579OS03G0796600 PROTEINcoord: 3..279
NoneNo IPR availablePANTHERPTHR31579:SF42DUF506 FAMILY PROTEIN (DUF506)coord: 3..279

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G021530.1ClCG01G021530.1mRNA