Cla97C02G049420 (gene) Watermelon (97103) v2.5

Overview
NameCla97C02G049420
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionSequence-specific DNA binding transcription factors
LocationCla97Chr02: 36831747 .. 36833021 (-)
RNA-Seq ExpressionCla97C02G049420
SyntenyCla97C02G049420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAATGGATAGTTCAGGTTTGGGAGGTGGATTTCTCTCAGCAAATGGGGGGCTGTTAGATCTGGAGTCTCCTATCCGAAGACCTCAAAAAACCCAATTGTTCAATCCCTCGTTGACACAACGCCATCACTTGAACATGATGAGTACTTTTGAAGGCGATCACCAGTCCATTGGGATTGTGGACACGAAAAGCGTGGGGCAGAAGGATTTATTGATGGCGTTCAGTAAAAGGAAAGCTATTGCCTCTGGTTGCAACACAAACAACTACACGAGTGAAGAAGATGAGCCAAGTTTTACCGAGGATGGCGAGTGCTCTGAGTTTTTGAAGGGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAACAGTGGTTGCTTGTGTGGGTGATGACGGGGAGGCTGGAGTGGGTTCGAAGAGAAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAACAGTGTCAAAGATTATGCTAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAATTTAATGACTTGAACAAAAGATACAAGAGATTGAACGATATAATTGGGAGGGGAACCAGTTGTAGAGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTTCGAAAAATATTAAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATATCCAAGGTAAAATTTTGCCTGTTGCTAATTTCTCCAAAGGAAATAATGAGTCAGAAGAGGCTGAGGACAGTGACAGTGACAGTGAAAGTGGTGAATCAGCTAGTGAAGACGATCACTCTCCTGTGGAAAATAGTTTATGGCCATCTGAATCTCGTGGCAGGGATAAAGCGAGTGCAGATGATGGTCCTCTTTGGTCAAACTCTGTTGCAAAAAATGAATTTGAAGGTCAAATTGATGTTTTTCTTTCGGATCCGACAAAGTCCCAATGGGAGCGCAGAGATTGGGTTAAAATACAGATGCTACAACTTCAGGAGCAATGTATGACCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGGTATTGTAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAACTAGATAATGAGCGGAGAGTACTGCAACTAAAGCAGAAGGAAATGGAACTAGAATTAAAAAAGGTCTGA

mRNA sequence

ATGAAAATGGATAGTTCAGGTTTGGGAGGTGGATTTCTCTCAGCAAATGGGGGGCTGTTAGATCTGGAGTCTCCTATCCGAAGACCTCAAAAAACCCAATTGTTCAATCCCTCGTTGACACAACGCCATCACTTGAACATGATGAGTACTTTTGAAGGCGATCACCAGTCCATTGGGATTGTGGACACGAAAAGCGTGGGGCAGAAGGATTTATTGATGGCGTTCAGTAAAAGGAAAGCTATTGCCTCTGGTTGCAACACAAACAACTACACGAGTGAAGAAGATGAGCCAAGTTTTACCGAGGATGGCGAGTGCTCTGAGTTTTTGAAGGGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAACAGTGGTTGCTTGTGTGGGTGATGACGGGGAGGCTGGAGTGGGTTCGAAGAGAAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAACAGTGTCAAAGATTATGCTAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAATTTAATGACTTGAACAAAAGATACAAGAGATTGAACGATATAATTGGGAGGGGAACCAGTTGTAGAGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTTCGAAAAATATTAAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATATCCAAGGTAAAATTTTGCCTGTTGCTAATTTCTCCAAAGGAAATAATGAGTCAGAAGAGGCTGAGGACAGTGACAGTGACAGTGAAAGTGGTGAATCAGCTAGTGAAGACGATCACTCTCCTGTGGAAAATAGTTTATGGCCATCTGAATCTCGTGGCAGGGATAAAGCGAGTGCAGATGATGGTCCTCTTTGGTCAAACTCTGTTGCAAAAAATGAATTTGAAGGTCAAATTGATGTTTTTCTTTCGGATCCGACAAAGTCCCAATGGGAGCGCAGAGATTGGGTTAAAATACAGATGCTACAACTTCAGGAGCAATGTATGACCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGGTATTGTAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAACTAGATAATGAGCGGAGAGTACTGCAACTAAAGCAGAAGGAAATGGAACTAGAATTAAAAAAGGTCTGA

Coding sequence (CDS)

ATGAAAATGGATAGTTCAGGTTTGGGAGGTGGATTTCTCTCAGCAAATGGGGGGCTGTTAGATCTGGAGTCTCCTATCCGAAGACCTCAAAAAACCCAATTGTTCAATCCCTCGTTGACACAACGCCATCACTTGAACATGATGAGTACTTTTGAAGGCGATCACCAGTCCATTGGGATTGTGGACACGAAAAGCGTGGGGCAGAAGGATTTATTGATGGCGTTCAGTAAAAGGAAAGCTATTGCCTCTGGTTGCAACACAAACAACTACACGAGTGAAGAAGATGAGCCAAGTTTTACCGAGGATGGCGAGTGCTCTGAGTTTTTGAAGGGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAACAGTGGTTGCTTGTGTGGGTGATGACGGGGAGGCTGGAGTGGGTTCGAAGAGAAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAACAGTGTCAAAGATTATGCTAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAATTTAATGACTTGAACAAAAGATACAAGAGATTGAACGATATAATTGGGAGGGGAACCAGTTGTAGAGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTTCGAAAAATATTAAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATATCCAAGGTAAAATTTTGCCTGTTGCTAATTTCTCCAAAGGAAATAATGAGTCAGAAGAGGCTGAGGACAGTGACAGTGACAGTGAAAGTGGTGAATCAGCTAGTGAAGACGATCACTCTCCTGTGGAAAATAGTTTATGGCCATCTGAATCTCGTGGCAGGGATAAAGCGAGTGCAGATGATGGTCCTCTTTGGTCAAACTCTGTTGCAAAAAATGAATTTGAAGGTCAAATTGATGTTTTTCTTTCGGATCCGACAAAGTCCCAATGGGAGCGCAGAGATTGGGTTAAAATACAGATGCTACAACTTCAGGAGCAATGTATGACCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGGTATTGTAGTAAGAAAAATAGGGATTTGGAGAGAGTGAGGCTTGAAAATGAGAGGATGAAACTAGATAATGAGCGGAGAGTACTGCAACTAAAGCAGAAGGAAATGGAACTAGAATTAAAAAAGGTCTGA

Protein sequence

MKMDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVDTKSVGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELELKKV
Homology
BLAST of Cla97C02G049420 vs. NCBI nr
Match: XP_038901508.1 (uncharacterized protein LOC120088355 [Benincasa hispida])

HSP 1 Score: 778.5 bits (2009), Expect = 3.0e-221
Identity = 391/424 (92.22%), Postives = 403/424 (95.05%), Query Frame = 0

Query: 1   MKMDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGI 60
           MKMDSSGLGGGFLS NGGL+DLESPIRRPQKTQL NPSLT RHHLNMMSTFEGDH S+G 
Sbjct: 1   MKMDSSGLGGGFLSGNGGLIDLESPIRRPQKTQLVNPSLTHRHHLNMMSTFEGDHWSLGT 60

Query: 61  VDTKSVGQKDLLMAFSKRKAIAS-GCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQR 120
           VDTKS+GQKDLLMAF+K KAIAS G   NNYTSEEDEPSFTEDGEC EFLKGKKGSPWQR
Sbjct: 61  VDTKSLGQKDLLMAFNKGKAIASGGITNNNYTSEEDEPSFTEDGECPEFLKGKKGSPWQR 120

Query: 121 MKWTDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQC 180
           MKWTDEIVRLLI VVACVGDDGEAG GSKRKSGILQKKGKWKT+SKIMLSKGCHVSPQQC
Sbjct: 121 MKWTDEIVRLLIAVVACVGDDGEAGTGSKRKSGILQKKGKWKTISKIMLSKGCHVSPQQC 180

Query: 181 EDKFNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKE 240
           EDKFNDLNKRYKRLNDIIG+GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKE
Sbjct: 181 EDKFNDLNKRYKRLNDIIGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKE 240

Query: 241 MCAYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVE 300
           MCAYHNGQTIPGCQDVD QGKILPVANFSKGNNES+EAEDSDSDS+SGES +EDDHSPVE
Sbjct: 241 MCAYHNGQTIPGCQDVDFQGKILPVANFSKGNNESDEAEDSDSDSDSGESDNEDDHSPVE 300

Query: 301 NSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQL 360
           N LWPSESRGRDK SADDGPLWSNSVAKNEFEG+IDVFLSDPTKSQWERRDWV+ QMLQL
Sbjct: 301 NRLWPSESRGRDKVSADDGPLWSNSVAKNEFEGRIDVFLSDPTKSQWERRDWVEKQMLQL 360

Query: 361 QEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMEL 420
           QEQC  FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMKLDNERRVLQLKQKEMEL
Sbjct: 361 QEQCNNFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNERRVLQLKQKEMEL 420

Query: 421 ELKK 424
           ELK+
Sbjct: 421 ELKR 424

BLAST of Cla97C02G049420 vs. NCBI nr
Match: TYJ96146.1 (stress response protein nst1 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 763.5 bits (1970), Expect = 9.9e-217
Identity = 381/421 (90.50%), Postives = 398/421 (94.54%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRRPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKW 122
           +KS+GQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TEDGECSEFLKGKKGSPWQRMKW
Sbjct: 61  SKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKW 120

Query: 123 TDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDK 182
           TDEIVRLLI VVACVGDDGEAG+GSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDK
Sbjct: 121 TDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDK 180

Query: 183 FNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 242
           FNDLNKRYKRLNDI+G+GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA
Sbjct: 181 FNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 240

Query: 243 YHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL 302
           YHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Sbjct: 241 YHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL 300

Query: 303 WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQ 362
           W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQ
Sbjct: 301 WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQ 360

Query: 363 CMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELELK 422
           C +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMKLDNE+RVLQLK+KEMELE K
Sbjct: 361 CNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELESK 420

Query: 423 K 424
           +
Sbjct: 421 R 421

BLAST of Cla97C02G049420 vs. NCBI nr
Match: XP_008449727.1 (PREDICTED: uncharacterized protein LOC103491522 [Cucumis melo] >KAA0041403.1 stress response protein nst1 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 762.3 bits (1967), Expect = 2.2e-216
Identity = 380/421 (90.26%), Postives = 398/421 (94.54%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRRPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKW 122
           +KS+GQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TEDGECSEFLKGKKGSPWQRMKW
Sbjct: 61  SKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKW 120

Query: 123 TDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDK 182
           TDEIVRLLI VVACVGDDGEAG+GSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDK
Sbjct: 121 TDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDK 180

Query: 183 FNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 242
           FNDLNKRYKRLNDI+G+GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA
Sbjct: 181 FNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 240

Query: 243 YHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL 302
           YHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Sbjct: 241 YHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL 300

Query: 303 WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQ 362
           W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQ
Sbjct: 301 WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQ 360

Query: 363 CMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELELK 422
           C +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMKLDNE+RVLQLK+KEMELE K
Sbjct: 361 CNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELESK 420

Query: 423 K 424
           +
Sbjct: 421 R 421

BLAST of Cla97C02G049420 vs. NCBI nr
Match: XP_004142119.1 (uncharacterized protein LOC101205501 [Cucumis sativus] >KGN54170.1 hypothetical protein Csa_017893 [Cucumis sativus])

HSP 1 Score: 758.1 bits (1956), Expect = 4.2e-215
Identity = 380/421 (90.26%), Postives = 398/421 (94.54%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRRPQKTQL NPSLTQRH LNMM+ FEGDHQSIGI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMNNFEGDHQSIGILD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKW 122
           +KS+GQKDLLMAF++ KAIASGC TNNYTSEEDEPS+TEDGECSEFLKGKKGSPWQRMKW
Sbjct: 61  SKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKW 120

Query: 123 TDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDK 182
           TDEIVRLLI VVACVGDDGEAG+GSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDK
Sbjct: 121 TDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDK 180

Query: 183 FNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 242
           FNDLNKRYKRLNDI+G+GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA
Sbjct: 181 FNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 240

Query: 243 YHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL 302
           YHNGQTIPGCQDVD QGKILPVANFSKGNNES   EDSDSDS+SGES +EDDHSPVEN L
Sbjct: 241 YHNGQTIPGCQDVDFQGKILPVANFSKGNNES---EDSDSDSDSGESDNEDDHSPVENRL 300

Query: 303 WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQ 362
           W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKS WER+ W+K QMLQLQEQ
Sbjct: 301 WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSHWERKVWIKKQMLQLQEQ 360

Query: 363 CMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELELK 422
           C +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMKLDNE+RVLQLK+KEMELELK
Sbjct: 361 CNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELELK 418

Query: 423 K 424
           +
Sbjct: 421 R 418

BLAST of Cla97C02G049420 vs. NCBI nr
Match: XP_022957960.1 (uncharacterized protein LOC111459338 [Cucurbita moschata] >XP_022957961.1 uncharacterized protein LOC111459338 [Cucurbita moschata] >XP_022957962.1 uncharacterized protein LOC111459338 [Cucurbita moschata] >XP_022957963.1 uncharacterized protein LOC111459338 [Cucurbita moschata])

HSP 1 Score: 724.5 bits (1869), Expect = 5.1e-205
Identity = 365/422 (86.49%), Postives = 389/422 (92.18%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRR Q+TQL N SLT RHHL MM+T EGDHQS+GI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRHQQTQLINTSLTHRHHLKMMNTLEGDHQSVGIMD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNN-YTSEEDEPSFTEDGECSEFLKGKKGSPWQRMK 122
           TK +G KDL M F+K KAIASG  TNN  TSEEDEPSFTEDGEC+EFLKGKKGSPWQRMK
Sbjct: 61  TKRLGHKDLSMTFTKGKAIASGGVTNNSNTSEEDEPSFTEDGECTEFLKGKKGSPWQRMK 120

Query: 123 WTDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCED 182
           WTD+IVRLLI VVACVGDDGEAG+GSKRKSGILQKKGKWK VSKIM+SKGCHVSPQQCED
Sbjct: 121 WTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCED 180

Query: 183 KFNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMC 242
           KFNDLNKRYKRLNDI+GRGTSCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMC
Sbjct: 181 KFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSSKVKDDVRKILSSKHLFYKEMC 240

Query: 243 AYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENS 302
           AYHNGQTIPGCQDVD QGKILPV NFSKGNNESEEA+DSDSDS+  ES +EDDH P EN 
Sbjct: 241 AYHNGQTIPGCQDVDFQGKILPVVNFSKGNNESEEADDSDSDSD--ESDNEDDHYPEENR 300

Query: 303 LWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQE 362
           LWP+ESRGRDKASADDGPLWS + A+NEFEGQIDVFLSDPTK QWERRDW+K QMLQLQE
Sbjct: 301 LWPAESRGRDKASADDGPLWSITSAQNEFEGQIDVFLSDPTKPQWERRDWIKKQMLQLQE 360

Query: 363 QCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELEL 422
           QC++FQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERMK+DNERRVLQLKQKEMELE 
Sbjct: 361 QCVSFQAQSFELEKQRFKWLRYCSKKSRDLERMRLENERMKIDNERRVLQLKQKEMELEF 420

Query: 423 KK 424
           K+
Sbjct: 421 KR 420

BLAST of Cla97C02G049420 vs. ExPASy TrEMBL
Match: A0A5D3BB81 (Stress response protein nst1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold182G001150 PE=4 SV=1)

HSP 1 Score: 763.5 bits (1970), Expect = 4.8e-217
Identity = 381/421 (90.50%), Postives = 398/421 (94.54%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRRPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKW 122
           +KS+GQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TEDGECSEFLKGKKGSPWQRMKW
Sbjct: 61  SKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKW 120

Query: 123 TDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDK 182
           TDEIVRLLI VVACVGDDGEAG+GSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDK
Sbjct: 121 TDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDK 180

Query: 183 FNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 242
           FNDLNKRYKRLNDI+G+GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA
Sbjct: 181 FNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 240

Query: 243 YHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL 302
           YHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Sbjct: 241 YHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL 300

Query: 303 WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQ 362
           W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQ
Sbjct: 301 WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQ 360

Query: 363 CMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELELK 422
           C +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMKLDNE+RVLQLK+KEMELE K
Sbjct: 361 CNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELESK 420

Query: 423 K 424
           +
Sbjct: 421 R 421

BLAST of Cla97C02G049420 vs. ExPASy TrEMBL
Match: A0A5A7TE21 (Stress response protein nst1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold206G00500 PE=4 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 1.1e-216
Identity = 380/421 (90.26%), Postives = 398/421 (94.54%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRRPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKW 122
           +KS+GQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TEDGECSEFLKGKKGSPWQRMKW
Sbjct: 61  SKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKW 120

Query: 123 TDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDK 182
           TDEIVRLLI VVACVGDDGEAG+GSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDK
Sbjct: 121 TDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDK 180

Query: 183 FNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 242
           FNDLNKRYKRLNDI+G+GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA
Sbjct: 181 FNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 240

Query: 243 YHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL 302
           YHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Sbjct: 241 YHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL 300

Query: 303 WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQ 362
           W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQ
Sbjct: 301 WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQ 360

Query: 363 CMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELELK 422
           C +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMKLDNE+RVLQLK+KEMELE K
Sbjct: 361 CNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELESK 420

Query: 423 K 424
           +
Sbjct: 421 R 421

BLAST of Cla97C02G049420 vs. ExPASy TrEMBL
Match: A0A1S3BM36 (uncharacterized protein LOC103491522 OS=Cucumis melo OX=3656 GN=LOC103491522 PE=4 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 1.1e-216
Identity = 380/421 (90.26%), Postives = 398/421 (94.54%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRRPQKTQL NPSLTQRH LNMMS FEGDHQSIGI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKW 122
           +KS+GQKDLLMAF++ KAIAS C TNNYTSEEDEPS+TEDGECSEFLKGKKGSPWQRMKW
Sbjct: 61  SKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKW 120

Query: 123 TDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDK 182
           TDEIVRLLI VVACVGDDGEAG+GSKRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDK
Sbjct: 121 TDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDK 180

Query: 183 FNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 242
           FNDLNKRYKRLNDI+G+GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA
Sbjct: 181 FNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 240

Query: 243 YHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL 302
           YHNGQTIPGCQDVD QGKILP ANFSKGNNESEEAEDSDSDS+SGES +EDDHSP EN L
Sbjct: 241 YHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL 300

Query: 303 WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQ 362
           W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKSQWER+ W+K QMLQLQEQ
Sbjct: 301 WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQ 360

Query: 363 CMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELELK 422
           C +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMKLDNE+RVLQLK+KEMELE K
Sbjct: 361 CNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELESK 420

Query: 423 K 424
           +
Sbjct: 421 R 421

BLAST of Cla97C02G049420 vs. ExPASy TrEMBL
Match: A0A0A0KX12 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G290850 PE=4 SV=1)

HSP 1 Score: 758.1 bits (1956), Expect = 2.0e-215
Identity = 380/421 (90.26%), Postives = 398/421 (94.54%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRRPQKTQL NPSLTQRH LNMM+ FEGDHQSIGI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMNNFEGDHQSIGILD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQRMKW 122
           +KS+GQKDLLMAF++ KAIASGC TNNYTSEEDEPS+TEDGECSEFLKGKKGSPWQRMKW
Sbjct: 61  SKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKW 120

Query: 123 TDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDK 182
           TDEIVRLLI VVACVGDDGEAG+GSKRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDK
Sbjct: 121 TDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDK 180

Query: 183 FNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 242
           FNDLNKRYKRLNDI+G+GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA
Sbjct: 181 FNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCA 240

Query: 243 YHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENSL 302
           YHNGQTIPGCQDVD QGKILPVANFSKGNNES   EDSDSDS+SGES +EDDHSPVEN L
Sbjct: 241 YHNGQTIPGCQDVDFQGKILPVANFSKGNNES---EDSDSDSDSGESDNEDDHSPVENRL 300

Query: 303 WPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQEQ 362
           W SESRGRDK SADDGPLWSNSV KNEFEGQIDVFLSDPTKS WER+ W+K QMLQLQEQ
Sbjct: 301 WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSHWERKVWIKKQMLQLQEQ 360

Query: 363 CMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELELK 422
           C +FQAQSVELEKQRFKWLRYCSKKNRDLER RLENERMKLDNE+RVLQLK+KEMELELK
Sbjct: 361 CNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELELK 418

Query: 423 K 424
           +
Sbjct: 421 R 418

BLAST of Cla97C02G049420 vs. ExPASy TrEMBL
Match: A0A6J1H0P0 (uncharacterized protein LOC111459338 OS=Cucurbita moschata OX=3662 GN=LOC111459338 PE=4 SV=1)

HSP 1 Score: 724.5 bits (1869), Expect = 2.5e-205
Identity = 365/422 (86.49%), Postives = 389/422 (92.18%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           MDSSGLGGGFLS NGGLLDLESPIRR Q+TQL N SLT RHHL MM+T EGDHQS+GI+D
Sbjct: 1   MDSSGLGGGFLSGNGGLLDLESPIRRHQQTQLINTSLTHRHHLKMMNTLEGDHQSVGIMD 60

Query: 63  TKSVGQKDLLMAFSKRKAIASGCNTNN-YTSEEDEPSFTEDGECSEFLKGKKGSPWQRMK 122
           TK +G KDL M F+K KAIASG  TNN  TSEEDEPSFTEDGEC+EFLKGKKGSPWQRMK
Sbjct: 61  TKRLGHKDLSMTFTKGKAIASGGVTNNSNTSEEDEPSFTEDGECTEFLKGKKGSPWQRMK 120

Query: 123 WTDEIVRLLITVVACVGDDGEAGVGSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCED 182
           WTD+IVRLLI VVACVGDDGEAG+GSKRKSGILQKKGKWK VSKIM+SKGCHVSPQQCED
Sbjct: 121 WTDDIVRLLIAVVACVGDDGEAGMGSKRKSGILQKKGKWKMVSKIMISKGCHVSPQQCED 180

Query: 183 KFNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMC 242
           KFNDLNKRYKRLNDI+GRGTSCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMC
Sbjct: 181 KFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSSKVKDDVRKILSSKHLFYKEMC 240

Query: 243 AYHNGQTIPGCQDVDIQGKILPVANFSKGNNESEEAEDSDSDSESGESASEDDHSPVENS 302
           AYHNGQTIPGCQDVD QGKILPV NFSKGNNESEEA+DSDSDS+  ES +EDDH P EN 
Sbjct: 241 AYHNGQTIPGCQDVDFQGKILPVVNFSKGNNESEEADDSDSDSD--ESDNEDDHYPEENR 300

Query: 303 LWPSESRGRDKASADDGPLWSNSVAKNEFEGQIDVFLSDPTKSQWERRDWVKIQMLQLQE 362
           LWP+ESRGRDKASADDGPLWS + A+NEFEGQIDVFLSDPTK QWERRDW+K QMLQLQE
Sbjct: 301 LWPAESRGRDKASADDGPLWSITSAQNEFEGQIDVFLSDPTKPQWERRDWIKKQMLQLQE 360

Query: 363 QCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMELEL 422
           QC++FQAQS ELEKQRFKWLRYCSKK+RDLER+RLENERMK+DNERRVLQLKQKEMELE 
Sbjct: 361 QCVSFQAQSFELEKQRFKWLRYCSKKSRDLERMRLENERMKIDNERRVLQLKQKEMELEF 420

Query: 423 KK 424
           K+
Sbjct: 421 KR 420

BLAST of Cla97C02G049420 vs. TAIR 10
Match: AT1G21200.1 (sequence-specific DNA binding transcription factors )

HSP 1 Score: 290.0 bits (741), Expect = 3.0e-78
Identity = 167/362 (46.13%), Postives = 240/362 (66.30%), Query Frame = 0

Query: 89  NYTSEEDEPSFTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGV 148
           N  S++DEPSFTE   DG  +E  +  KGSPWQR+KWTD++V+LLIT V+ +GDD     
Sbjct: 83  NSVSDDDEPSFTEEGGDGVHNEANRSTKGSPWQRVKWTDKMVKLLITAVSYIGDDSSIDS 142

Query: 149 GSKRKSGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCRV 208
            S+RK  +LQKKGKWK+VSK+M  +G HVSPQQCEDKFNDLNKRYK+LND++GRGTSC+V
Sbjct: 143 SSRRKFAVLQKKGKWKSVSKVMAERGYHVSPQQCEDKFNDLNKRYKKLNDMLGRGTSCQV 202

Query: 209 VENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVA 268
           VENPAL+DS+ +L+ K KDDVRKI+SSKHLFY+EMC+YHNG  +    D+ +Q + L +A
Sbjct: 203 VENPALLDSIGYLNDKEKDDVRKIMSSKHLFYEEMCSYHNGNRLHLPHDLALQ-RSLQLA 262

Query: 269 NFSKGNNESEEA-----EDSDSDSESGESASEDDHSPVENSLWPSE-----------SRG 328
             S+ +++++++     ED D +   G+    D++     +                 + 
Sbjct: 263 LRSRDDHDNDDSRKHQMEDLDDEDHDGDGDEHDEYEEQHYAYGDCRVNHYGGGGGPLKKI 322

Query: 329 RDKASADDG--PLWSNSVAKNEFE-GQIDVFLSDPTKSQWE-------RRDWVKIQMLQL 388
           R   S +DG  P   NS+  N+    QI    +D  +   E       ++ W++ + LQL
Sbjct: 323 RPSLSHEDGDHPSHVNSLECNKVSLPQIPFSQADVNQGGAESGRAGSVQKQWMESRTLQL 382

Query: 389 QEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERVRLENERMKLDNERRVLQLKQKEMEL 422
           +EQ +  Q + +ELEKQRF+W R+  K++++LER+R+ENERMKL+N+R  L+LKQ+E+ +
Sbjct: 383 EEQKLQIQVELLELEKQRFRWQRFSKKRDQELERMRMENERMKLENDRMGLELKQRELGV 442

BLAST of Cla97C02G049420 vs. TAIR 10
Match: AT3G10040.1 (sequence-specific DNA binding transcription factors )

HSP 1 Score: 235.7 bits (600), Expect = 6.7e-62
Identity = 166/445 (37.30%), Postives = 244/445 (54.83%), Query Frame = 0

Query: 3   MDSSGLGGGFLSANGGLLDLESPIRRPQKTQLFNPSLTQRHHLNMMSTFEGDHQSIGIVD 62
           M+S+ +  GF   +  +L LE P   P      NP  + +       T  GD Q+   + 
Sbjct: 1   MESNVMFSGF---SPRMLSLEMPQNPP------NPQNSIQFQHPHPYTTSGDQQTQPPI- 60

Query: 63  TKSVGQKDLLMAFSKRKAIA----SGCNTNNYTSEEDEPSFTEDGECSEFLKGKKGSPWQ 122
                 K L    SK K ++     GC+  +  S        ED   ++    +K S W 
Sbjct: 61  ------KSLYPYASKPKQMSPISGGGCDDEDRGSGSGSGCNPEDSAGTD--GKRKLSQWH 120

Query: 123 RMKWTDEIVRLLITVVACVGDDGEAG----VGSKRKS----------GILQKKGKWKTVS 182
           RMKWTD +VRLLI  V  +GD  EAG    V +K+K+          G+LQKKGKWK+VS
Sbjct: 121 RMKWTDTMVRLLIMAVFYIGD--EAGLNDPVDAKKKTGGGGGGGGGGGMLQKKGKWKSVS 180

Query: 183 KIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKD 242
           + M+ KG  VSPQQCEDKFNDLNKRYKR+NDI+G+G +CRVVEN  L++SM HL+ K KD
Sbjct: 181 RAMVEKGFSVSPQQCEDKFNDLNKRYKRVNDILGKGIACRVVENQGLLESMDHLTPKLKD 240

Query: 243 DVRKILSSKHLFYKEMCAYHNGQTIPGCQD----------VDIQGKILPVANFSKGNNES 302
           +V+K+L+SKHLF++EMCAYHN     G  D          + I  +     + ++    +
Sbjct: 241 EVKKLLNSKHLFFREMCAYHNSCGHLGGHDQQPPQQNPISIPIPSQQQNCFHAAEAGKMA 300

Query: 303 EEAEDSDSDSESGESASEDDHSPVENSLWPSESRGRDKASADDGPLWSNSVAKNEFEGQI 362
             AE  + + E     +ED  S +E S    E   R K           S A      + 
Sbjct: 301 RIAERVEVEEEVESDMAEDSESEMEES---EEEETRKKRRI--------STAVKRLREEA 360

Query: 363 DVFLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKKNRDLERV 420
              + D  KS WE+++W++ +ML+++E+ + ++ + VE+EKQR KW+RY SKK R++E+ 
Sbjct: 361 ASVVEDVGKSVWEKKEWIRRKMLEIEEKKIGYEWEGVEMEKQRVKWMRYRSKKEREMEKA 414

BLAST of Cla97C02G049420 vs. TAIR 10
Match: AT1G76870.1 (BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1); Has 406 Blast hits to 351 proteins in 76 species: Archae - 0; Bacteria - 2; Metazoa - 137; Fungi - 14; Plants - 127; Viruses - 0; Other Eukaryotes - 126 (source: NCBI BLink). )

HSP 1 Score: 232.6 bits (592), Expect = 5.7e-61
Identity = 136/334 (40.72%), Postives = 213/334 (63.77%), Query Frame = 0

Query: 92  SEEDEPS-FTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLITVVACVGDDGEAGVGSKRK 151
           SE+DE    + DG+     K K+ SPWQR+KW D++V+L+IT ++ +G+D     GS +K
Sbjct: 61  SEDDELCLLSSDGQ----NKSKENSPWQRVKWMDKMVKLMITALSYIGEDS----GSDKK 120

Query: 152 SGILQKKGKWKTVSKIMLSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCRVVENPA 211
             +LQKKGKW++VSK+M  +G HVSPQQCEDKFNDLNKRYK+LN+++GRGTSC VVENP+
Sbjct: 121 FAVLQKKGKWRSVSKVMDERGYHVSPQQCEDKFNDLNKRYKKLNEMLGRGTSCEVVENPS 180

Query: 212 LMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDIQGKILPVANFSKG 271
           L+D + +L+ K KD+VR+I+SSKHLFY+EMC+YHNG  +    D  +Q  +  +   S+ 
Sbjct: 181 LLDKIDYLNEKEKDEVRRIMSSKHLFYEEMCSYHNGNRLHLPHDPAVQRSLHLITLGSRD 240

Query: 272 NNESEEAEDSDSDSESGESASEDDHSPVENSLWPSESRGRDKASADDGPLWSNSVAKNEF 331
           +++++E     ++    +   E+DH    +   P +   + ++  D G            
Sbjct: 241 DHDNDEHGKHQNEDLDDDDDYEEDHDGALSDR-PLKRLRQSQSHEDVGHPNKGYDVPCLP 300

Query: 332 EGQIDV---FLSDPTKSQWERRDWVKIQMLQLQEQCMTFQAQSVELEKQRFKWLRYCSKK 391
             Q DV      D  K+   +R  ++ + L+L+ + +  QA+ +ELE+Q+FKW  +  ++
Sbjct: 301 RSQADVNRGISLDSRKAAGLQRQQIESKSLELEGRKLQIQAEMMELERQQFKWEVFSKRR 360

Query: 392 NRDLERVRLENERMKLDNERRVLQLKQKEMELEL 422
            + L ++R+ENERMKL+NER  L+LK+ E+  +L
Sbjct: 361 EQKLAKMRMENERMKLENERMSLELKRIELGAKL 385

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901508.13.0e-22192.22uncharacterized protein LOC120088355 [Benincasa hispida][more]
TYJ96146.19.9e-21790.50stress response protein nst1 isoform X1 [Cucumis melo var. makuwa][more]
XP_008449727.12.2e-21690.26PREDICTED: uncharacterized protein LOC103491522 [Cucumis melo] >KAA0041403.1 str... [more]
XP_004142119.14.2e-21590.26uncharacterized protein LOC101205501 [Cucumis sativus] >KGN54170.1 hypothetical ... [more]
XP_022957960.15.1e-20586.49uncharacterized protein LOC111459338 [Cucurbita moschata] >XP_022957961.1 unchar... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3BB814.8e-21790.50Stress response protein nst1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7TE211.1e-21690.26Stress response protein nst1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A1S3BM361.1e-21690.26uncharacterized protein LOC103491522 OS=Cucumis melo OX=3656 GN=LOC103491522 PE=... [more]
A0A0A0KX122.0e-21590.26Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G290850 PE=4 SV=1[more]
A0A6J1H0P02.5e-20586.49uncharacterized protein LOC111459338 OS=Cucurbita moschata OX=3662 GN=LOC1114593... [more]
Match NameE-valueIdentityDescription
AT1G21200.13.0e-7846.13sequence-specific DNA binding transcription factors [more]
AT3G10040.16.7e-6237.30sequence-specific DNA binding transcription factors [more]
AT1G76870.15.7e-6140.72BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transc... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 384..420
NoneNo IPR availableCOILSCoilCoilcoord: 356..376
NoneNo IPR availableGENE3D1.10.10.60coord: 122..192
e-value: 1.0E-12
score: 49.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 267..320
NoneNo IPR availablePANTHERPTHR46327F16F4.11 PROTEIN-RELATEDcoord: 14..421
NoneNo IPR availablePANTHERPTHR46327:SF9TRANSCRIPTION FACTOR TRIHELIX FAMILY-RELATEDcoord: 14..421
IPR044822Myb/SANT-like DNA-binding domain 4PFAMPF13837Myb_DNA-bind_4coord: 119..242
e-value: 2.9E-19
score: 69.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G049420.1Cla97C02G049420.1mRNA