Clc10G08740 (gene) Watermelon (cordophanus) v2

Overview
NameClc10G08740
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionIndole-3-glycerol phosphate synthase
LocationClcChr10: 19330028 .. 19334240 (+)
RNA-Seq ExpressionClc10G08740
SyntenyClc10G08740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCATACCCAATATATGAACTCAGCATTTCAAATACCAAAAATACAAACTCAACTCAAGAGCCCAAAGAATCCCTAACCAATTTCCAATAAAACTAGGAAAAGAGATATGGGTTTAAAAGGTTATATAGAAATTAGAAAATAGAAAAAACCAAAAGGGTAAAGAAAACAGAGGAAGCGGTTTTCCGTTGCCGTTGGATTACCGTCTTCTCTCCTTTTTCGCTCTGTTTAAAAACAGAGCAATCCTTCAATGGAACTTTACACAGCCGACTGCCATGGGTATAGCCGAAGCTTCTCCGACCACCATGGCTCCTCTTCTTCTACGGAATCTAGCCACTTCTCTCTTCGTCTTTTTCGATAAATTCCTCATCAATTTGTCAAAGAAATACAAACTTCTCGAGATTATTCACACCTTACTAATCTCCTCCTTCCTCTTTTTCCTCCGTTTACTTCCTTCCTTGTTCCCTTCCATCCATCCCGTTTCCGATGATCGGTATCCTCTAAAACCCCCAAAAACTGCGAGTTACGTTAGCGGCGGAATTGGAATCGGAAGCAGTAGCGGAAGCGGCGATTTAGGGATTTCTCGTGCCCTAACGCAATTGCTGTCGATTATTAGCCACGTTCCGGTCAGTTCTCGCAAGTATGAAGTGGTTCGATCGTTGGCGGAGAAGCTGATTGATGAGAATCACTGGGAAGGGATTGAGGAACTTCGCGAGGTCAATCGTGCGGTTCTTTCCGCGGCTTTCGATCGGACTATTGGTCAGATTGAGGCCAGGATGATAGAGCGAGGGTTTTTTCAGGACGATAACGAAGGCGGGGACGGTGCTGGTGGTGGTGGGTCGGTGGCTGGACCGATGGAGTTCCGGTTGGGTCGGGTTGTAAGGGCGGTTCGGTTGTTTGGAGAGTCGGCTTGTAGCCGGTTCGGGAGGGTGAAGGAGGGGGCGAACCAGACCGGAAGCTCGGTGGAGAAACTGGCGGCGGAGGTGCTTTGGTTGGCTCAAAAAATGGCGAGTTGCGGTTGTCGGAATGAGGCTTGTAGGCGGTGGGCTTCGGCGGCTCAGTTGGGAAGACTCTCTCTCTCGGCGGAGCCGCGGCTGCAGGCGTCATTGGTTAAAGTCGCAGGTATGACTCCATTTTCTATATATATATTTTTGAGATTTTTTCTTCTTTTTTAATGGCTCGTAAATTTAGCTGCATTATTATATTTTATTTTAATTTTGTTAAAATTAAAAAGAAAAGAAGATTAGTCTAATAAATTCTCATAAATTAAATTTATGAGTGCCAAAGGTTTTATTTACTGCACCTTTGATGGTTTAGACAAATCAAAAAGCATGATTTAGTGGATGCGATATCAATTACTATCTTTTTGATCATGCGGTATCGAATATATGCTAGTTGAATTATGCTTACGTTAATGTAGGTGTGTTTTCTATATATCTTAATATATATTTGAACTTTAATGTGTATTTACCAAAAAAAAAAAGAAAAAAAAAAAGTCATCCGTGTGTATCACGCTATTTAGTCGCAAAAATGCGCATTCAAGGAAGGATCCCCTCCCAATTTGTTTGAGACACAATGTGTCCTAGATTTAGAATGAAATTGTAACTAATTAGTTTAGGATTTGTTTTCTCTTTTCTCAATTTAAAACTGAAGCAACATTGTATTTAGGGGAAAAAAAAACCTTTTACCTTGGAGTAGGCTTATAATTAGAGTGTTTAGCCTTGCGGTTGTGCCTAAACTTTATAAGTAGGGCGTCTCTCAACATTGAATTGGTGGAAAATATTTGGTCATATATATTAGATTTGTGTAACTAGGAGTGGAGTTTATATATGATAGGTGAGTTTTCTTACCTATAAAATCCACCTTTTATATTGCATAGTAGATTTTATAAAAGTGCCTTATAATTAACTTGGATCACAAAGTTAGGCATTTGAATCTGTTAGAATGTTTATAATATTTGTAATATATTTGAATTTCCAAATTATCGTGGCCCACAAGTAGAGCTGCCTTCCTTTTAAATGAAAGTGAAAGCCTGGTGTGAAGTAATTGTCCCAAATTCCATGGAATATGCGATTTGAGAATTAGTTATCTAAATTGAATAATTCTACAACCTTTTCACTAATTTTTTTTTTTGTCTCTTCTTATCGTGTTTTACTTTTTATTATTTTTTCTTTTTAAAGAAAATTTATATTTTCATATAAGCTTGAACCAACTTGACGGCAAGATTTTTGTCTCTTTTTCCAATAAATAAATAAATAAAAAGTTATTTTTGTCTTTTAATGTTTGTCTCAATACTTTAAGTAAGCCTTTGGAATTGTGTAGCTTGTGTTTGGAATATACTGGTGAATGAATTACGTGGACTATCAGAAGTCCCGTTTTGATAACAGATTTTTGGGCAAGTTTTTGAAAATTAAATGTTTTTTTTTTTAAAATCTATTGTTTAGGAATGTTTTAAAAAACTAACTCTATTTTTTTTATGTATTATTTATTTATTAGTATTTTGACTTAGAATTCAAATGTTTAGTTGTTGTATTTTCAAATGTCCAATATTAATTCCTATATTTTCAATAAGTTTTAAATTGGGTCTCTATTAGTTTGTTTTTTTACTTTTTAAAAAAATTTAAATATTTATTAACATTTTCCTATAAATTTTGGAAATATATACTTATATTGTATTTTATTGCATGAAAATTATTGTTATTATTTAGCAAATTTCGACAAAAATTTAACTTCAAGAAATTAAACTTAAGATTTATTTAAAGTAAAAAAACAAGGTGAACAATCCCAAAATAAATATAAAGTTGAAAATGATATTTGAAAAAGTAATCTAATAATTGCCAAACAGCCTAGATAAGAGAATAAGTCACTGTTGAATGGCTTCCTATCTGTCTCACTTCCACCCAGGAATAAAATTATACATTTTGGGATTGGAGTAAAAGTGGGGTTGAGGGCAATTTGATGTATATTTTTTCAAAAACTTCTTGGTCATACTGTTTTGGTTCAGTTACCATTTATTTTATGAGTAAAAGACAGAAATAGAATCAATTCTTGGTTAAGCCTCCTAATGCAGAAATCCATAGGGAAATCATTGAGACTTCTAATGATAAGCATCAAAAACAATATTTACATTAAATTTAGGAATAAGAATAAAGTATTTGTAAAGTGAGAAAACTCAGTATGAAAATTGAAAGCTTAAAAAAGAGAAGTCCAATTTGTTTCTATTATTATTTATTTTAGAGAAAGGAAAGGGCTATGGTAGAAAAATAACCATTCAAATTATTTTTGTTTAAAAGTTTATGTTTACCATGTAGTAAATTTTAATTGGACAATTAGGTGGATTGTACAAAAATAGGTGCCATCTGAGAAGAACCAAAGTATTTTTCTTAAGGAAAAAGATCAAAAAAATGAAGAGACCTTTCTATATCTAATCATTGTAATATATGATTGTATGTGTAGCATTCTTGTTGAAGCAATGCCGAGAAATGGGAAAGGATAAAGATGGAGACGAGAGTGAGAAACAGCAGCAGATGCAGACGAAGTTGAAGATGCTGATTTCATGGCTTCCATTGCTATGCAGAGGCAGCAATGGGACCGATGCGCCCGTCCTAAGCATTGGTGAACGGCGAGAGCTGGAGTTGGTGCTAGAGGAGATGATAGGGACATTGCAACAAGACCAACAAGAGCAAGTTTTGGCTTTGTGGCTCCATCATTTCACTTACTCCTCCTCCTCCGACTGGCCGAACCTCCACGCTTCGTATGCTCGGTGGTACAGTGCCTCTCGTAAGCTCTTGATCCACCAGGATCAGTAATCTAACCTTTAAAACAAAAACATATAACCTTTTGTTAGAAAAGCTGTTAGTATTTGGTAGAAGTTTCTAATGCCATTTGAAAGTCATATATTGTATTTCACTTGCTAGGAGGTGTGATAAATGTGGCCAAACCTAAGTATAGTTCAGCTCGCTAAGACATATATCATTCATCAAAAGGTGAATCTCCTATCACCCCTCCCTTAACTCATTGGGTAACAAATTTTCGGTAGTTGTTTTTTGTTTTTGAATATGATTGAAGTTATTCCTATGACCTTATGGACTTAGGAAATTGTGACTTCAAGTGCCAAGATAATCTTAATGAAGTTACATGTACGATGATGCCTTAGGCAGCCGAGCACAACTTTCTGATTATATAGGCATGTTTGAGAGTGATTTTGAAATGATG

mRNA sequence

CTCATACCCAATATATGAACTCAGCATTTCAAATACCAAAAATACAAACTCAACTCAAGAGCCCAAAGAATCCCTAACCAATTTCCAATAAAACTAGGAAAAGAGATATGGGTTTAAAAGGTTATATAGAAATTAGAAAATAGAAAAAACCAAAAGGGTAAAGAAAACAGAGGAAGCGGTTTTCCGTTGCCGTTGGATTACCGTCTTCTCTCCTTTTTCGCTCTGTTTAAAAACAGAGCAATCCTTCAATGGAACTTTACACAGCCGACTGCCATGGGTATAGCCGAAGCTTCTCCGACCACCATGGCTCCTCTTCTTCTACGGAATCTAGCCACTTCTCTCTTCGTCTTTTTCGATAAATTCCTCATCAATTTGTCAAAGAAATACAAACTTCTCGAGATTATTCACACCTTACTAATCTCCTCCTTCCTCTTTTTCCTCCGTTTACTTCCTTCCTTGTTCCCTTCCATCCATCCCGTTTCCGATGATCGGTATCCTCTAAAACCCCCAAAAACTGCGAGTTACGTTAGCGGCGGAATTGGAATCGGAAGCAGTAGCGGAAGCGGCGATTTAGGGATTTCTCGTGCCCTAACGCAATTGCTGTCGATTATTAGCCACGTTCCGGTCAGTTCTCGCAAGTATGAAGTGGTTCGATCGTTGGCGGAGAAGCTGATTGATGAGAATCACTGGGAAGGGATTGAGGAACTTCGCGAGGTCAATCGTGCGGTTCTTTCCGCGGCTTTCGATCGGACTATTGGTCAGATTGAGGCCAGGATGATAGAGCGAGGGTTTTTTCAGGACGATAACGAAGGCGGGGACGGTGCTGGTGGTGGTGGGTCGGTGGCTGGACCGATGGAGTTCCGGTTGGGTCGGGTTGTAAGGGCGGTTCGGTTGTTTGGAGAGTCGGCTTGTAGCCGGTTCGGGAGGGTGAAGGAGGGGGCGAACCAGACCGGAAGCTCGGTGGAGAAACTGGCGGCGGAGGTGCTTTGGTTGGCTCAAAAAATGGCGAGTTGCGGTTGTCGGAATGAGGCTTGTAGGCGGTGGGCTTCGGCGGCTCAGTTGGGAAGACTCTCTCTCTCGGCGGAGCCGCGGCTGCAGGCGTCATTGGTTAAAGTCGCAGCATTCTTGTTGAAGCAATGCCGAGAAATGGGAAAGGATAAAGATGGAGACGAGAGTGAGAAACAGCAGCAGATGCAGACGAAGTTGAAGATGCTGATTTCATGGCTTCCATTGCTATGCAGAGGCAGCAATGGGACCGATGCGCCCGTCCTAAGCATTGGTGAACGGCGAGAGCTGGAGTTGGTGCTAGAGGAGATGATAGGGACATTGCAACAAGACCAACAAGAGCAAGTTTTGGCTTTGTGGCTCCATCATTTCACTTACTCCTCCTCCTCCGACTGGCCGAACCTCCACGCTTCGTATGCTCGGTGGTACAGTGCCTCTCGTAAGCTCTTGATCCACCAGGATCAGTAATCTAACCTTTAAAACAAAAACATATAACCTTTTGTTAGAAAAGCTGTTAGTATTTGGTAGAAGTTTCTAATGCCATTTGAAAGTCATATATTGTATTTCACTTGCTAGGAGGTGTGATAAATGTGGCCAAACCTAAGTATAGTTCAGCTCGCTAAGACATATATCATTCATCAAAAGGTGAATCTCCTATCACCCCTCCCTTAACTCATTGGGTAACAAATTTTCGGTAGTTGTTTTTTGTTTTTGAATATGATTGAAGTTATTCCTATGACCTTATGGACTTAGGAAATTGTGACTTCAAGTGCCAAGATAATCTTAATGAAGTTACATGTACGATGATGCCTTAGGCAGCCGAGCACAACTTTCTGATTATATAGGCATGTTTGAGAGTGATTTTGAAATGATG

Coding sequence (CDS)

ATGGGTATAGCCGAAGCTTCTCCGACCACCATGGCTCCTCTTCTTCTACGGAATCTAGCCACTTCTCTCTTCGTCTTTTTCGATAAATTCCTCATCAATTTGTCAAAGAAATACAAACTTCTCGAGATTATTCACACCTTACTAATCTCCTCCTTCCTCTTTTTCCTCCGTTTACTTCCTTCCTTGTTCCCTTCCATCCATCCCGTTTCCGATGATCGGTATCCTCTAAAACCCCCAAAAACTGCGAGTTACGTTAGCGGCGGAATTGGAATCGGAAGCAGTAGCGGAAGCGGCGATTTAGGGATTTCTCGTGCCCTAACGCAATTGCTGTCGATTATTAGCCACGTTCCGGTCAGTTCTCGCAAGTATGAAGTGGTTCGATCGTTGGCGGAGAAGCTGATTGATGAGAATCACTGGGAAGGGATTGAGGAACTTCGCGAGGTCAATCGTGCGGTTCTTTCCGCGGCTTTCGATCGGACTATTGGTCAGATTGAGGCCAGGATGATAGAGCGAGGGTTTTTTCAGGACGATAACGAAGGCGGGGACGGTGCTGGTGGTGGTGGGTCGGTGGCTGGACCGATGGAGTTCCGGTTGGGTCGGGTTGTAAGGGCGGTTCGGTTGTTTGGAGAGTCGGCTTGTAGCCGGTTCGGGAGGGTGAAGGAGGGGGCGAACCAGACCGGAAGCTCGGTGGAGAAACTGGCGGCGGAGGTGCTTTGGTTGGCTCAAAAAATGGCGAGTTGCGGTTGTCGGAATGAGGCTTGTAGGCGGTGGGCTTCGGCGGCTCAGTTGGGAAGACTCTCTCTCTCGGCGGAGCCGCGGCTGCAGGCGTCATTGGTTAAAGTCGCAGCATTCTTGTTGAAGCAATGCCGAGAAATGGGAAAGGATAAAGATGGAGACGAGAGTGAGAAACAGCAGCAGATGCAGACGAAGTTGAAGATGCTGATTTCATGGCTTCCATTGCTATGCAGAGGCAGCAATGGGACCGATGCGCCCGTCCTAAGCATTGGTGAACGGCGAGAGCTGGAGTTGGTGCTAGAGGAGATGATAGGGACATTGCAACAAGACCAACAAGAGCAAGTTTTGGCTTTGTGGCTCCATCATTTCACTTACTCCTCCTCCTCCGACTGGCCGAACCTCCACGCTTCGTATGCTCGGTGGTACAGTGCCTCTCGTAAGCTCTTGATCCACCAGGATCAGTAA

Protein sequence

MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLPSLFPSIHPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSSRKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEGGDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWLAQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDESEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQVLALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQDQ
Homology
BLAST of Clc10G08740 vs. NCBI nr
Match: XP_038876704.1 (uncharacterized protein LOC120069090 [Benincasa hispida])

HSP 1 Score: 709.5 bits (1830), Expect = 1.6e-200
Identity = 371/399 (92.98%), Postives = 380/399 (95.24%), Query Frame = 0

Query: 1   MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60
           MGIAEASPTTMAPLLLRNLATSLFVF DKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP
Sbjct: 1   MGIAEASPTTMAPLLLRNLATSLFVFADKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60

Query: 61  SLFPSIHPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSS 120
           SLFPSIHPVSDDRYPLKPPKT SY SGGI +G+ SGSGDLGISRALTQLLSIISHVPVSS
Sbjct: 61  SLFPSIHPVSDDRYPLKPPKTGSYGSGGIAVGNGSGSGDLGISRALTQLLSIISHVPVSS 120

Query: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEG 180
           RKYEVVRSLAEKLIDENH EGIEELREVNR VLSAAF RTIGQIEA MIERGF QDDN+G
Sbjct: 121 RKYEVVRSLAEKLIDENHREGIEELREVNRVVLSAAFGRTIGQIEAGMIERGFCQDDNDG 180

Query: 181 GDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWL 240
           G   GGGGSV GP+EF LG+VVRAVRL GESACSRFGRVKEGANQTGSS+EKLAAEVLWL
Sbjct: 181 G---GGGGSVGGPVEFGLGQVVRAVRLLGESACSRFGRVKEGANQTGSSMEKLAAEVLWL 240

Query: 241 AQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDE 300
           AQKMASCGCRNE CRRWASAAQLGRLSLSAEPRLQASLVKVAAFL KQCREMGKD+DG+E
Sbjct: 241 AQKMASCGCRNEVCRRWASAAQLGRLSLSAEPRLQASLVKVAAFLFKQCREMGKDEDGEE 300

Query: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQV 360
           SEKQQQMQTKLKMLISWLPLLCRGSNGTD P+LSIGERRELELVLEEMIGTLQQDQQEQV
Sbjct: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDVPILSIGERRELELVLEEMIGTLQQDQQEQV 360

Query: 361 LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQDQ 400
           LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQDQ
Sbjct: 361 LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQDQ 396

BLAST of Clc10G08740 vs. NCBI nr
Match: XP_008462861.1 (PREDICTED: uncharacterized protein LOC103501143 [Cucumis melo])

HSP 1 Score: 661.4 bits (1705), Expect = 5.0e-186
Identity = 349/399 (87.47%), Postives = 368/399 (92.23%), Query Frame = 0

Query: 1   MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60
           MGIAEASPTT+APLLLRNLATSLFVF DK LINL+KKYK+L+IIH L+ISSFLFFLRLLP
Sbjct: 1   MGIAEASPTTLAPLLLRNLATSLFVFADKSLINLAKKYKILQIIHALIISSFLFFLRLLP 60

Query: 61  SLFPSIHPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSS 120
           SLFPSIH VSDDRYPLKPPK  SY +GGIG    SGSGDLG+SRALTQLLSIISHVPVSS
Sbjct: 61  SLFPSIHSVSDDRYPLKPPKGGSYGTGGIG----SGSGDLGVSRALTQLLSIISHVPVSS 120

Query: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEG 180
           RKYEVVRSLAEKLIDENHWEGIEELREVNR VLSAAFDRTIG IEA MIERGF Q+DN+G
Sbjct: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRVVLSAAFDRTIGLIEAGMIERGFCQEDNDG 180

Query: 181 GDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWL 240
           GDG GGGGS+ GP+EF LGRVVRAVRL GESACSRFGR KE  NQ+GSSVEKLAAE+LWL
Sbjct: 181 GDG-GGGGSLGGPVEFGLGRVVRAVRLLGESACSRFGREKEVGNQSGSSVEKLAAEMLWL 240

Query: 241 AQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDE 300
           AQKMASCG  NE C RWASAAQLGRLSLSAEPRLQASLVKVA FL KQCREMGKD+DG+E
Sbjct: 241 AQKMASCGYWNEVCGRWASAAQLGRLSLSAEPRLQASLVKVAVFLFKQCREMGKDEDGEE 300

Query: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQV 360
           SEKQQQMQTKLKMLISWLPLLCRGSNGTDAP+LSIGERRELEL LEEMIGTLQQD+QEQV
Sbjct: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPILSIGERRELELGLEEMIGTLQQDEQEQV 360

Query: 361 LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQDQ 400
           LALWLH+FTYSS SDWPNLHASYARWYSASRKLLI +DQ
Sbjct: 361 LALWLHNFTYSSLSDWPNLHASYARWYSASRKLLIDRDQ 394

BLAST of Clc10G08740 vs. NCBI nr
Match: XP_004137277.1 (uncharacterized protein LOC101222931 [Cucumis sativus] >KGN53714.1 hypothetical protein Csa_014677 [Cucumis sativus])

HSP 1 Score: 634.0 bits (1634), Expect = 8.6e-178
Identity = 336/399 (84.21%), Postives = 356/399 (89.22%), Query Frame = 0

Query: 1   MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60
           MGIAEASPTT+APLLLRNLATSLFVF DK LINLSKKYKLL++IH L+ISSFLFFLRLLP
Sbjct: 1   MGIAEASPTTLAPLLLRNLATSLFVFADKSLINLSKKYKLLQLIHALIISSFLFFLRLLP 60

Query: 61  SLFPSIHPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSS 120
           SLFPSIH VSDD YPLK PK  SY +G        GSGDLG+SRALTQLLSIISH+PVSS
Sbjct: 61  SLFPSIHTVSDDCYPLKSPKDGSYGTG--------GSGDLGVSRALTQLLSIISHIPVSS 120

Query: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEG 180
           RKYEVVRSLAEKLIDENHWEGIEELREVNR VLS AFDR+IG IEA MIERGF Q+DN+G
Sbjct: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRVVLSTAFDRSIGLIEAGMIERGFCQEDNDG 180

Query: 181 GDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWL 240
            +G GGGGSV GP+EF LGRVVRAVR  GESACSRFGRV+E  NQ+GSSVEKLAAEVLWL
Sbjct: 181 ENG-GGGGSVGGPVEFGLGRVVRAVRFLGESACSRFGRVREVGNQSGSSVEKLAAEVLWL 240

Query: 241 AQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDE 300
           AQKM SCG  NE C RWASA QLGRLSLSAEPRLQASLVKVA FL KQCREMGKD+D +E
Sbjct: 241 AQKMVSCGFGNEVCGRWASATQLGRLSLSAEPRLQASLVKVAVFLFKQCREMGKDEDEEE 300

Query: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQV 360
           S KQQQMQ KLKMLISWLPLLCRGS+GTDAP+LSIGERRELEL LEEMIGTLQQD+QEQV
Sbjct: 301 SVKQQQMQMKLKMLISWLPLLCRGSSGTDAPILSIGERRELELGLEEMIGTLQQDEQEQV 360

Query: 361 LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQDQ 400
           LALWLH+FTY SSSDWPNLHASYARWYSASRKLLIHQDQ
Sbjct: 361 LALWLHNFTYLSSSDWPNLHASYARWYSASRKLLIHQDQ 390

BLAST of Clc10G08740 vs. NCBI nr
Match: XP_023551926.1 (uncharacterized protein LOC111809753 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 576.6 bits (1485), Expect = 1.6e-160
Identity = 303/387 (78.29%), Postives = 333/387 (86.05%), Query Frame = 0

Query: 11  MAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLPSLFPSIHPVS 70
           MAP+LLR+LA+SLFV  DK  INLSKKYKLLEIIHTLL+S FLFFLRLLPS F SIH V 
Sbjct: 1   MAPVLLRSLASSLFVCADKSFINLSKKYKLLEIIHTLLVSFFLFFLRLLPSFFSSIHLVP 60

Query: 71  DDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSSRKYEVVRSLA 130
            DR+PLKP K+  Y  GG G    SG GDLGISRALTQLLSIIS +PVSSRKYEVVRSLA
Sbjct: 61  GDRFPLKPSKSVCYGGGGNG----SGGGDLGISRALTQLLSIISQIPVSSRKYEVVRSLA 120

Query: 131 EKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEGGDGAGGGGSV 190
           EKLIDENHWEGIEELR+VNRAVLS AFDRTIGQIEARM++ GF QDD+  G   GGGGSV
Sbjct: 121 EKLIDENHWEGIEELRDVNRAVLSTAFDRTIGQIEARMLDLGFLQDDDVEG---GGGGSV 180

Query: 191 AGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWLAQKMASCGCR 250
            GP EFRLG++VRAVRL GESA SR GRVKE ANQT  S EKLAAE LWLA+KMASCGCR
Sbjct: 181 NGPAEFRLGQIVRAVRLLGESAYSRLGRVKEAANQTRISAEKLAAEALWLAEKMASCGCR 240

Query: 251 NEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDESEKQQQMQTK 310
           +EACRRWASA QLGRLSL+AEPRLQ SLVK+AAF+ KQCREMGK+++ +   +++QMQTK
Sbjct: 241 SEACRRWASAGQLGRLSLAAEPRLQGSLVKLAAFMFKQCREMGKEEEAESERRRRQMQTK 300

Query: 311 LKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQVLALWLHHFTY 370
           LKML SWLPLLCRG NGTDAP+LSIGERRELE  LEEMI TLQQD QEQVLALWLHHFTY
Sbjct: 301 LKMLNSWLPLLCRGINGTDAPILSIGERRELESALEEMIATLQQDDQEQVLALWLHHFTY 360

Query: 371 SSSSDWPNLHASYARWYSASRKLLIHQ 398
           SSSSDWP+LHASYARWY+ASRKL +H+
Sbjct: 361 SSSSDWPDLHASYARWYTASRKLFVHR 380

BLAST of Clc10G08740 vs. NCBI nr
Match: KAG7014876.1 (hypothetical protein SDJN02_22506, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 574.7 bits (1480), Expect = 6.2e-160
Identity = 300/387 (77.52%), Postives = 333/387 (86.05%), Query Frame = 0

Query: 11  MAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLPSLFPSIHPVS 70
           MAPLL+R+LA+SLFVF DK  INLSKKY LLEIIHTLL+SSFLFFLRLLPS F SIH VS
Sbjct: 1   MAPLLVRSLASSLFVFADKSFINLSKKYILLEIIHTLLVSSFLFFLRLLPSFFSSIHLVS 60

Query: 71  DDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSSRKYEVVRSLA 130
            DR+PLKP K+  Y  GG G    SG GDLGISRALTQLLSIISH+PVSSRKYEVVRSLA
Sbjct: 61  GDRFPLKPSKSVCYGGGGNG----SGGGDLGISRALTQLLSIISHIPVSSRKYEVVRSLA 120

Query: 131 EKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEGGDGAGGGGSV 190
           EKLIDENHWEGIEELREVNR VLS AF+RTIGQIEARM++ GF QDD++ G   GGGGSV
Sbjct: 121 EKLIDENHWEGIEELREVNRTVLSTAFNRTIGQIEARMLDLGFLQDDDDEG---GGGGSV 180

Query: 191 AGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWLAQKMASCGCR 250
            GP+EFRLG++VRAVR  GESA SR GRVKEGANQT  S EKLAAE LWLA+KMASCGCR
Sbjct: 181 NGPVEFRLGQIVRAVRFLGESAYSRLGRVKEGANQTRISAEKLAAEALWLAEKMASCGCR 240

Query: 251 NEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDESEKQQQMQTK 310
           +EACRRWASA QLGRLSL+AEP+LQ SLVK+AAF+ KQCREMGK+++ +   +++QMQTK
Sbjct: 241 SEACRRWASADQLGRLSLAAEPQLQGSLVKLAAFMFKQCREMGKEEEAESERRRRQMQTK 300

Query: 311 LKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQVLALWLHHFTY 370
           LKML SWLPLLCRG NGTDAP+LS GERRE+E  LEEMI TLQQD QEQVLALWLHHFTY
Sbjct: 301 LKMLNSWLPLLCRGINGTDAPILSTGERREVESALEEMIATLQQDDQEQVLALWLHHFTY 360

Query: 371 SSSSDWPNLHASYARWYSASRKLLIHQ 398
           S SSDWP LHASY RWY+ASRKL +H+
Sbjct: 361 SPSSDWPELHASYTRWYTASRKLSVHR 380

BLAST of Clc10G08740 vs. ExPASy TrEMBL
Match: A0A1S3CHW5 (uncharacterized protein LOC103501143 OS=Cucumis melo OX=3656 GN=LOC103501143 PE=4 SV=1)

HSP 1 Score: 661.4 bits (1705), Expect = 2.4e-186
Identity = 349/399 (87.47%), Postives = 368/399 (92.23%), Query Frame = 0

Query: 1   MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60
           MGIAEASPTT+APLLLRNLATSLFVF DK LINL+KKYK+L+IIH L+ISSFLFFLRLLP
Sbjct: 1   MGIAEASPTTLAPLLLRNLATSLFVFADKSLINLAKKYKILQIIHALIISSFLFFLRLLP 60

Query: 61  SLFPSIHPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSS 120
           SLFPSIH VSDDRYPLKPPK  SY +GGIG    SGSGDLG+SRALTQLLSIISHVPVSS
Sbjct: 61  SLFPSIHSVSDDRYPLKPPKGGSYGTGGIG----SGSGDLGVSRALTQLLSIISHVPVSS 120

Query: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEG 180
           RKYEVVRSLAEKLIDENHWEGIEELREVNR VLSAAFDRTIG IEA MIERGF Q+DN+G
Sbjct: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRVVLSAAFDRTIGLIEAGMIERGFCQEDNDG 180

Query: 181 GDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWL 240
           GDG GGGGS+ GP+EF LGRVVRAVRL GESACSRFGR KE  NQ+GSSVEKLAAE+LWL
Sbjct: 181 GDG-GGGGSLGGPVEFGLGRVVRAVRLLGESACSRFGREKEVGNQSGSSVEKLAAEMLWL 240

Query: 241 AQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDE 300
           AQKMASCG  NE C RWASAAQLGRLSLSAEPRLQASLVKVA FL KQCREMGKD+DG+E
Sbjct: 241 AQKMASCGYWNEVCGRWASAAQLGRLSLSAEPRLQASLVKVAVFLFKQCREMGKDEDGEE 300

Query: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQV 360
           SEKQQQMQTKLKMLISWLPLLCRGSNGTDAP+LSIGERRELEL LEEMIGTLQQD+QEQV
Sbjct: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPILSIGERRELELGLEEMIGTLQQDEQEQV 360

Query: 361 LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQDQ 400
           LALWLH+FTYSS SDWPNLHASYARWYSASRKLLI +DQ
Sbjct: 361 LALWLHNFTYSSLSDWPNLHASYARWYSASRKLLIDRDQ 394

BLAST of Clc10G08740 vs. ExPASy TrEMBL
Match: A0A0A0KXY7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G109030 PE=4 SV=1)

HSP 1 Score: 634.0 bits (1634), Expect = 4.1e-178
Identity = 336/399 (84.21%), Postives = 356/399 (89.22%), Query Frame = 0

Query: 1   MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60
           MGIAEASPTT+APLLLRNLATSLFVF DK LINLSKKYKLL++IH L+ISSFLFFLRLLP
Sbjct: 1   MGIAEASPTTLAPLLLRNLATSLFVFADKSLINLSKKYKLLQLIHALIISSFLFFLRLLP 60

Query: 61  SLFPSIHPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSS 120
           SLFPSIH VSDD YPLK PK  SY +G        GSGDLG+SRALTQLLSIISH+PVSS
Sbjct: 61  SLFPSIHTVSDDCYPLKSPKDGSYGTG--------GSGDLGVSRALTQLLSIISHIPVSS 120

Query: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEG 180
           RKYEVVRSLAEKLIDENHWEGIEELREVNR VLS AFDR+IG IEA MIERGF Q+DN+G
Sbjct: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRVVLSTAFDRSIGLIEAGMIERGFCQEDNDG 180

Query: 181 GDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWL 240
            +G GGGGSV GP+EF LGRVVRAVR  GESACSRFGRV+E  NQ+GSSVEKLAAEVLWL
Sbjct: 181 ENG-GGGGSVGGPVEFGLGRVVRAVRFLGESACSRFGRVREVGNQSGSSVEKLAAEVLWL 240

Query: 241 AQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDE 300
           AQKM SCG  NE C RWASA QLGRLSLSAEPRLQASLVKVA FL KQCREMGKD+D +E
Sbjct: 241 AQKMVSCGFGNEVCGRWASATQLGRLSLSAEPRLQASLVKVAVFLFKQCREMGKDEDEEE 300

Query: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQV 360
           S KQQQMQ KLKMLISWLPLLCRGS+GTDAP+LSIGERRELEL LEEMIGTLQQD+QEQV
Sbjct: 301 SVKQQQMQMKLKMLISWLPLLCRGSSGTDAPILSIGERRELELGLEEMIGTLQQDEQEQV 360

Query: 361 LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQDQ 400
           LALWLH+FTY SSSDWPNLHASYARWYSASRKLLIHQDQ
Sbjct: 361 LALWLHNFTYLSSSDWPNLHASYARWYSASRKLLIHQDQ 390

BLAST of Clc10G08740 vs. ExPASy TrEMBL
Match: A0A6J1FQP8 (uncharacterized protein LOC111447583 OS=Cucurbita moschata OX=3662 GN=LOC111447583 PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 9.9e-156
Identity = 305/397 (76.83%), Postives = 332/397 (83.63%), Query Frame = 0

Query: 1   MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60
           MGIAEASP TMAPLLLRNL TSLF F DKFLINLSKK+KLLE+IH L +S F FFLR LP
Sbjct: 1   MGIAEASPITMAPLLLRNLLTSLFGFADKFLINLSKKHKLLEVIHCLSVSFFHFFLRWLP 60

Query: 61  SLFPSIHPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSS 120
           SLFPSIH VSDDRY LKPPK  SY + G      SGSGDLG+SRALTQLLSIISHV VSS
Sbjct: 61  SLFPSIHQVSDDRYSLKPPKGGSYGTSG------SGSGDLGVSRALTQLLSIISHVQVSS 120

Query: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEG 180
           RKYEVVRSLAEKLIDENH EGIEEL EVNRAVLS AFDRTI QIEA M+ +GF  DD+E 
Sbjct: 121 RKYEVVRSLAEKLIDENHREGIEELHEVNRAVLSTAFDRTITQIEAAMLHQGFHNDDDED 180

Query: 181 GDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWL 240
            DG     + +GP+EF L RVVRAV       CSR G VK+GAN+TGSS EKLAAE+LWL
Sbjct: 181 EDGE----TSSGPVEFWLARVVRAV-------CSRLGSVKDGANRTGSSAEKLAAELLWL 240

Query: 241 AQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDE 300
           A KMASCGC  EAC+RWASAAQLGRLSLSAEPRLQ SLV+VAAF+ KQ REMGKD++ DE
Sbjct: 241 AGKMASCGCGIEACQRWASAAQLGRLSLSAEPRLQGSLVRVAAFMFKQSREMGKDEE-DE 300

Query: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQV 360
            E ++  QTKL+MLISWLPLLCRGSNGTDAPVLSIGERRE+ELVL EMIGTLQ D+QEQV
Sbjct: 301 EESEKHAQTKLQMLISWLPLLCRGSNGTDAPVLSIGERREVELVLGEMIGTLQGDEQEQV 360

Query: 361 LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQ 398
           LA+WLHHFTYS+SSDWPNLHASYA WYSASR L+IHQ
Sbjct: 361 LAIWLHHFTYSASSDWPNLHASYAHWYSASRNLIIHQ 379

BLAST of Clc10G08740 vs. ExPASy TrEMBL
Match: A0A6J1ISQ7 (uncharacterized protein LOC111478071 OS=Cucurbita maxima OX=3661 GN=LOC111478071 PE=4 SV=1)

HSP 1 Score: 551.2 bits (1419), Expect = 3.5e-153
Identity = 301/397 (75.82%), Postives = 329/397 (82.87%), Query Frame = 0

Query: 1   MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60
           MGIAEASP TMAPLLLRNL TSLF F DKFLI+LSKK+KLLE+IH L +S FLFFLR LP
Sbjct: 1   MGIAEASPITMAPLLLRNLLTSLFGFADKFLISLSKKHKLLEVIHCLSVSFFLFFLRWLP 60

Query: 61  SLFPSIHPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSS 120
             FP+IH VSDDRYPLK PK  SY +     GS SGSGDLGISRALTQLLSIISHV +SS
Sbjct: 61  PFFPTIHQVSDDRYPLKSPKGGSYGTS----GSGSGSGDLGISRALTQLLSIISHVQISS 120

Query: 121 RKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEG 180
           RKYEVVRSLAEKLIDENH EGIEELREVNRAVLS AFDRTI QIEA M+ +GF  DD+E 
Sbjct: 121 RKYEVVRSLAEKLIDENHREGIEELREVNRAVLSTAFDRTIAQIEAAMLHQGFRNDDDED 180

Query: 181 GDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWL 240
            DG     + +GP+EF L RVVRAV       CSR G VK+GAN+TGSS EKLAAE+LWL
Sbjct: 181 EDGE----TSSGPVEFWLARVVRAV-------CSRLGSVKDGANRTGSSAEKLAAELLWL 240

Query: 241 AQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDE 300
           A KMASCGC  EAC+RWASAAQLGRLSLSAEPRLQ SLV+VAAF+ KQ REMGK      
Sbjct: 241 AGKMASCGCGIEACQRWASAAQLGRLSLSAEPRLQGSLVRVAAFMFKQSREMGK------ 300

Query: 301 SEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQV 360
            E ++  QTKL+MLISWLPLLCRGSNGTDAPVLSIGERRE+ELVL EMIGTLQ+D+QEQV
Sbjct: 301 -ESEKHAQTKLQMLISWLPLLCRGSNGTDAPVLSIGERREVELVLGEMIGTLQRDEQEQV 360

Query: 361 LALWLHHFTYSSSSDWPNLHASYARWYSASRKLLIHQ 398
           LA+WLHHFTYS+SSDWPNLHASYA WYSASR L+IHQ
Sbjct: 361 LAMWLHHFTYSASSDWPNLHASYAHWYSASRNLIIHQ 375

BLAST of Clc10G08740 vs. ExPASy TrEMBL
Match: B9IJF8 (Uncharacterized protein OS=Populus trichocarpa OX=3694 GN=POPTR_017G053200 PE=4 SV=1)

HSP 1 Score: 439.5 bits (1129), Expect = 1.5e-119
Identity = 242/403 (60.05%), Postives = 309/403 (76.67%), Query Frame = 0

Query: 1   MGIAEASPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLP 60
           MG+ E SP T+AP+L+RN+AT++F+F DK L+ L++K+KLLE I  LL++SFLFFLRLLP
Sbjct: 1   MGMMETSPITIAPMLIRNIATAMFIFADKSLVALAQKHKLLEHIRYLLVTSFLFFLRLLP 60

Query: 61  SLFPSIHPVSD-------DRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSII 120
           SLFPS+ P SD         + LKP KTA+Y+        SSG GD GI+RALTQLLSI+
Sbjct: 61  SLFPSLSPSSDLQDHDNIQYHHLKPLKTANYL-------PSSGYGDSGIARALTQLLSIV 120

Query: 121 SHVPVSSRKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGF 180
           + +PVSSRKYE+VRSLAEK++D+NH E  E LREVNR VLSAAF RT+ Q+EA M+E   
Sbjct: 121 NDIPVSSRKYEIVRSLAEKVVDDNHGENNEALREVNRGVLSAAFSRTLSQLEAAMMEIAH 180

Query: 181 FQDDNEGGDGAGGGGSVAGPMEFRLGRVVRAVRLFGESACSRFGRVKEGANQTGSSVEKL 240
                   DG+  GGS  GP++ RL ++++A R  G+ + +R GR +EG ++   S EKL
Sbjct: 181 --------DGSENGGSRTGPVKRRLNQILKAARAVGDVSWARSGRGREGGDR---SEEKL 240

Query: 241 AAEVLWLAQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMG 300
           AAE+LWL QK+++CGC  EA RRWASA+ L RL+LSAE RLQ SLVKV+AFLLKQ RE+G
Sbjct: 241 AAELLWLGQKLSACGCGEEAVRRWASASNLARLALSAEARLQGSLVKVSAFLLKQARELG 300

Query: 301 KDKDGDESEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQ 360
            D+ G E +++ Q QTK+KML+SWLPLLCR SNGTDAPVLS+ ER ELE+VLEEMI  L+
Sbjct: 301 LDEAG-EGQREPQRQTKMKMLLSWLPLLCRASNGTDAPVLSMRERAELEIVLEEMIDMLE 360

Query: 361 -QDQQEQVLALWLHHFTYSSSSDWPNLHASYARWYSASRKLLI 396
            +++QEQVL+LWLHHFTY+ SSDWPNL ASYARW +ASR+LL+
Sbjct: 361 HEEEQEQVLSLWLHHFTYTPSSDWPNLRASYARWCTASRQLLL 384

BLAST of Clc10G08740 vs. TAIR 10
Match: AT5G64230.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G19920.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 283.1 bits (723), Expect = 3.5e-76
Identity = 171/390 (43.85%), Postives = 246/390 (63.08%), Query Frame = 0

Query: 7   SPTTMAPLLLRNLATSLFVFFDKFLINLSKKYKLLEIIHTLLISSFLFFLRLLPSLFPSI 66
           S   + P LLRN+  ++ VF D+ L+ +S   KLLE +   L++ FLFFLR LPS+    
Sbjct: 5   SSAAVVPQLLRNIIVAVVVFADESLLQISGNSKLLEKLRVFLVTCFLFFLRSLPSIVSFA 64

Query: 67  HPVSDDRYPLKPPKTASYVSGGIGIGSSSGSGDLGISRALTQLLSIISHVPVSSRKYEVV 126
           +P S       P  + +     I +   +   + GI RA+ QLLS ++ +PVSSRKY+VV
Sbjct: 65  NPNSSVVSFANPYSSKTKKKNKILV--INHCEESGIGRAIWQLLSAMNEIPVSSRKYQVV 124

Query: 127 RSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEGGDGAGG 186
           RSLAE+LI++N  E    L ++NR VL+A+F  T+ ++E  +      +D +E       
Sbjct: 125 RSLAERLINDNQGENSVALLDLNRRVLNASFRTTLSRLETAVERNPNRRDIDE------- 184

Query: 187 GGSVAGPMEFRLGRVVRA-VRLFGESACSRFGRVKEGANQTGSSVEKLAAEVLWLAQKMA 246
                 P+   L RVVRA VR  G+      G  +E A+QT  + EKLAAE+LWLA+KMA
Sbjct: 185 ------PVRRGLNRVVRAVVRAVGDGFIGWGG--EETADQTAETSEKLAAELLWLAEKMA 244

Query: 247 SCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKVAAFLLKQCREMGKDKDGDESEKQQ 306
             G  +EA  +WASA+ L  L+LS EPRLQ SL++++A L K+ +++ K  + +E E+ +
Sbjct: 245 VYGFVDEAVEKWASASNLAWLALSCEPRLQCSLIQISALLFKEAKDIKKGSEEEEGEEAK 304

Query: 307 QMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERRELELVLEEMIGTLQQDQQEQVLALWL 366
             + K KMLISW+PLLCR SNG D PVL   ER  LE VLE+MI  L++++QE+VL+LWL
Sbjct: 305 LREIKKKMLISWIPLLCRASNGADKPVLRSAERAYLEKVLEKMISELKEEEQERVLSLWL 364

Query: 367 HHFTYSSSSDWPNLHASYARWYSASRKLLI 396
           HH+T+ +SSDWP+L+ SY RW  +SR+LL+
Sbjct: 365 HHYTHCASSDWPDLNGSYVRWCHSSRQLLL 377

BLAST of Clc10G08740 vs. TAIR 10
Match: AT3G19920.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G64230.1); Has 217 Blast hits to 217 proteins in 16 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 215; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 150.2 bits (378), Expect = 3.5e-36
Identity = 117/354 (33.05%), Postives = 181/354 (51.13%), Query Frame = 0

Query: 76  LKPPKTASYVS-GGIGIGSSSGSGDLG--------------ISRALTQLLSIISHVPVSS 135
           L P K +S  S     +   SGS DLG              + RAL   L++++ +PV+S
Sbjct: 77  LVPAKASSSSSTSSTALVKYSGSSDLGMMICDGVDEPSVNSLGRALCHALALMNEIPVTS 136

Query: 136 RKYEVVRSLAEKLIDENHWEGIEELREVNRAVLSAAFDRTIGQIEARMIERGFFQDDNEG 195
           RKY+    +AEK++++N   G  +L +VNRA L+++F RT  +++   ++R    D+  G
Sbjct: 137 RKYQFAMGMAEKIMEDNAQSGHVDLLDVNRAALASSFARTTARLQ-DCLKRSRTADEPFG 196

Query: 196 G------DGAGGGGSVAGPMEFRLGRVVRAVRLFG-------------ESACSRFGRVKE 255
           G           GG VA  +   L   +  VR                ESA  R G ++E
Sbjct: 197 GLPLRVVSALPLGGYVASYVR-GLSACINTVRSLADMTGNLLSQTRRRESAVVRAGGIQE 256

Query: 256 GANQTGSSVEKLAAEVLWLAQKMASCGCRNEACRRWASAAQLGRLSLSAEPRLQASLVKV 315
             N+   +VEKLA E+LW+ +K+   G   E  +RW+ A+ L  LSL+A PR+Q  +VK+
Sbjct: 257 --NEAELAVEKLAEELLWMTEKLRRYGAVAEGIKRWSYASGLASLSLTAAPRVQGLMVKI 316

Query: 316 AAFLLKQCREMGKDKDGDESEKQQQMQTKLKMLISWLPLLCRGSNGTDAPVLSIGERREL 375
           +A L+    E+ +D        Q   Q   ++L +WLPL     NG   PVL+  ER E+
Sbjct: 317 SALLI---GELARD------STQVPGQVTFRLLANWLPLFSHARNGLAFPVLTGYERVEV 376

Query: 376 ELVLEEMIGTLQQDQQEQVLALWLHHFTYSSSSDWPNLHASYARWYSASRKLLI 396
           E  +++ I TL    QE +L  WL  F+  S+S+WPNL  +Y RW  ++R+L +
Sbjct: 377 ERAIDKAISTLPALDQEILLTNWLQDFSV-SASEWPNLQPAYDRWCHSTRQLFM 416

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876704.11.6e-20092.98uncharacterized protein LOC120069090 [Benincasa hispida][more]
XP_008462861.15.0e-18687.47PREDICTED: uncharacterized protein LOC103501143 [Cucumis melo][more]
XP_004137277.18.6e-17884.21uncharacterized protein LOC101222931 [Cucumis sativus] >KGN53714.1 hypothetical ... [more]
XP_023551926.11.6e-16078.29uncharacterized protein LOC111809753 [Cucurbita pepo subsp. pepo][more]
KAG7014876.16.2e-16077.52hypothetical protein SDJN02_22506, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3CHW52.4e-18687.47uncharacterized protein LOC103501143 OS=Cucumis melo OX=3656 GN=LOC103501143 PE=... [more]
A0A0A0KXY74.1e-17884.21Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G109030 PE=4 SV=1[more]
A0A6J1FQP89.9e-15676.83uncharacterized protein LOC111447583 OS=Cucurbita moschata OX=3662 GN=LOC1114475... [more]
A0A6J1ISQ73.5e-15375.82uncharacterized protein LOC111478071 OS=Cucurbita maxima OX=3661 GN=LOC111478071... [more]
B9IJF81.5e-11960.05Uncharacterized protein OS=Populus trichocarpa OX=3694 GN=POPTR_017G053200 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT5G64230.13.5e-7643.85unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G19920.13.5e-3633.05unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 338..358
NoneNo IPR availablePANTHERPTHR31060:SF41,8-CINEOLE SYNTHASEcoord: 5..395
IPR038920BTB/POZ domain-containing proteinPANTHERPTHR31060OSJNBA0011J08.25 PROTEIN-RELATEDcoord: 5..395

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc10G08740.1Clc10G08740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016567 protein ubiquitination
cellular_component GO:0016021 integral component of membrane