ClCG10G014030 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG10G014030
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionUnknown protein
LocationCG_Chr10: 28339534 .. 28342891 (-)
RNA-Seq ExpressionClCG10G014030
SyntenyClCG10G014030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGTTCGAAGCTACTGGTGATGCAACGACCCTCGGTTACTGGTTAAACTGGAGGGTATTAATCTCTGCAATTTGGGTATTTGCTTCTTTTACCTTTGCACTATGGATGATATGGAAATATGAAGTTAAGGATAGATTAGGACACAGCAGACAAGCAACTCAGCAAGATAAAAACAAACTTCGGAGTTGTGAAGCTTGGACACCATGTCTCAAACAAATTCATCCAATTTGGTTATTGGCTTTTCGAGTTTGTGCGTTTGGTTTGATGCTGGCTTCACTAATTGTAAAGGCTTTGGCTAACGGGGCTTCCATGTTTTATTACTATACCCAGTAAGCATTTTCTTATTGCTGGGATTATCATTTGGCTATCTTAAAGTAAAGTAGAGCAAATCAAACCTATCGGGTTTCCTCTCTGGATGAAGTGTTCTAGTCATTACTTCTTTCCGTTTAATTGTTCCTTGGGAAAATGAGTTGTCTTACTTAATCTCATTCGATTTTAGTTAAAATAAGGAATTTAAATTATGGATTCAACATCACTGATGGAGGCGATTTGGGGTGTTCTGGATATTTACCGGGAAGATATATCAGTTTGCACAATTTCAGGAGGTTGGAAATTTTCAGACATTGGCTTTCAGCAGCAGATGTCTCAATCCATGTATCATCAATTGTGTTCTTATAGATGATATACTGTGCAATTTACCACCAGCGTTTCCAGTGTTACGAACCTTTCCCTTTATAATCTTCTGACTTCTGTTTTGTTGTATGGATTGTTTGGCAGGTGGACATTTACTTTACTTACAATATATTTTCTGGTATGTTGTCCTTTTTTTCTTCATTTCAGTACTGATTAATAGCACAAACTTCAACATACTTCAGTTTCTTCAAATGTTTCACTAGCTAATTATCTAAATTTCTTCTTTTCAGTGTGGGTCACTGATTTCCATATATGGAGTTTTTCTCTGTAACAGAAAAAGAACCGAAGGTCTTTATGCTCATCTTAATGAGAATGGTATGGAGGAAGGGCAACATGTTCCTCTTCTTTCTGGGAAGCCCTCAAATTTAACTGGGGGAAATATAGTTTCGTATTCCAAGGAACAAAGCTTTTCTTCGATGGCTGTCAACATCTGGAGTTATATTTTTGAAGTTTTATTCCAGGTTAATTTTCAGTTAACTTATAAAAGATCAATTCTCAAATTTGTTTGAATCACTATCAACTTTATCGACTTTGTTTTTTTTGTTTTCTCCAAACAGATTAATGCAGGGGCAGTTGTTCTCACTGATTGCACTTATTGGTTTGTCATATTTCCATTTCTTACCATCAAAGATTACAATTTGAGCTTTGTAAGTACTGTGGCTTTAATCAAAACGTCTAGCCTTTATTATGTTGTTTGCTTGGATGCTGGTAGATCTAAAATGCATAACTCTGAGAATTGTTGCATTTTGCATTTTGTTGTGTTAATGGAAATCTCTCCTCCCCCACCCCACAAGAAAGAAGAAGAAGAAGAAGAAAGGCTATAATCGGTTAAGACCTTGGAGACTAGATGCATCAAATGATAGCTCTTATTATAATTAGTATCGTTGTAATCACTTGAGAATATTTGTTGAATATTAGTTCTTCATAGTTCATTTACGACCATTTCTATTTCTCCTTTGCTTAAGGTTTGCTCCATCTCAATCAGTTATCTCCATAATGTCAGATGACGATCAATATGCACACGCTGAATTTGGTTTTACTTCTCGGTGAAACTGCTCTTAATTGCTTGGTAATGTTCTCTTATATTTCACTATTTAGTTAAAAAAAAAAAACATTTTTGATGTCTAAACTTTCATAAAAGTATCAACTAGTCCTATATTTTGAAAATTGTAACAATTTAGTCACTCTACTTTAAATTTTGTAATTTATAAAGTCTTTTGGTTTCTAATCAATATTTTGATATACATATGTAAGAGGTTCCATTAAATGTATTTTTCATGATAGAGACAAAATTGCTACATATGTTCAAATCACACGAATTAAATTGTTACATATGTTAGTTCAAGGAGTACGAAATTGCTATAAAATTAAAAACACAATGAGAATTGTTATTGCCATGTTTTTTAAGCTCATTTTTTCTCTTCCTTGTACTCTTTTTTTAATAAGCCGTTTGATGTCGATCCTCTGCAGACACTTCCAAGGTTCCGAATTTCATTCTTCTTTTTATGGACGGGTATTTATGTCATTTCTCAGTGGATTGTCCATGCATTTGTTTCGATTGGGTAACTTACACTTTCCCTTCATCCTGCTTTCCAACTCTTACCTATAAGAAAATCACCAAGAAAATAGCAAGAGTTGCCTTTCTTTTCTGCAGGTGGCCATATCCATTCCTTGATCTGTCAGCTCCTTATTCTCCATTATGGTACGAACTTAGATTTTCAATTCAGTCAAAACATTATTTGACATGAACTTTCACATAACTTAAGTATCTATCACTGCAACTCCATCTCAGTCTAGAAATTATGCTCCAATTAACATACAAGTGACTATTAATTTGTACATATTCATTTGCGTTTATATCGTATAAATTCATTAAAAAAAAAGTTTTCTTTACGTGTTCTAGGTTTAATTCCCATTTTGGTTCATAAACTTTGTACTTGTATCCTAAACTCTATTCAAAAAGCAATTATAGTCCTTGTTGTTAATTCTTCACTAATCATTTTACAAAAGAAGTTCAAATCTTCTTTGAAACATAACTCCTACACTACTAGATGTGATTTCAAACTAAACATTGTGTATAGATCTGTTCACCGAATAAATGAAAATAAAACGTGTTATGTTAGGTTGTTATAACGTCAAAACTCTGAACGTTTGTTCCATAAGTTTTTAATCCATGCTATTTTCATTCTTTCTTTCAGCATTTCACTAGCAAGTTGCTCTTTAAATGTTACTTCCAAACTACAAGCTTCTTCAATTTTTAGCGTCATGTTTGATACTGATTTTCAAATCAGTAATTCTAGTGAAAATAATTTATTCATGGTTTTTTTTTTAATCATATAATCAGTTTTATAATATCTTAGAGTGTTTGTGTCAGCTTGCGCACATTTCAACTAATTTTACAGGGCAACTTGCTTGGTTTTACAATAGTTGGGTGTCAAAGAAATATTAATCATCAAAATCGATTTTGAATGATTAAAGACATTTCAAAATCACTTTCAAATATGCTCTTACATGATTGAATCTAAATCCTTTTTCTTTGATGAATCTCAGGTATCTACTGATGGGATTGATTCATATTCCAAGCTATGGTATTTTCATGTTGATTATAACACTGAAACATAAACTGATGACGAAGTGGTTCCCTCAGTCATACCAGTGCTAG

mRNA sequence

ATGCTGTTCGAAGCTACTGGTGATGCAACGACCCTCGGTTACTGGTTAAACTGGAGGGTATTAATCTCTGCAATTTGGGTATTTGCTTCTTTTACCTTTGCACTATGGATGATATGGAAATATGAAGTTAAGGATAGATTAGGACACAGCAGACAAGCAACTCAGCAAGATAAAAACAAACTTCGGAGTTGTGAAGCTTGGACACCATGTCTCAAACAAATTCATCCAATTTGGTTATTGGCTTTTCGAGTTTGTGCGTTTGGTTTGATGCTGGCTTCACTAATTGTAAAGGCTTTGGCTAACGGGGCTTCCATGTTTTATTACTATACCCAGTGGACATTTACTTTACTTACAATATATTTTCTGTGTGGGTCACTGATTTCCATATATGGAGTTTTTCTCTGTAACAGAAAAAGAACCGAAGGTCTTTATGCTCATCTTAATGAGAATGGTATGGAGGAAGGGCAACATGTTCCTCTTCTTTCTGGGAAGCCCTCAAATTTAACTGGGGGAAATATAGTTTCGTATTCCAAGGAACAAAGCTTTTCTTCGATGGCTGTCAACATCTGGAGTTATATTTTTGAAGTTTTATTCCAGATTAATGCAGGGGCAGTTGTTCTCACTGATTGCACTTATTGGTTTGTCATATTTCCATTTCTTACCATCAAAGATTACAATTTGAGCTTTATGACGATCAATATGCACACGCTGAATTTGGTTTTACTTCTCGGTGAAACTGCTCTTAATTGCTTGACACTTCCAAGGTTCCGAATTTCATTCTTCTTTTTATGGACGGGTATTTATGTCATTTCTCAGTGGATTGTCCATGCATTTGTTTCGATTGGGTGGCCATATCCATTCCTTGATCTGTCAGCTCCTTATTCTCCATTATGGTATCTACTGATGGGATTGATTCATATTCCAAGCTATGGTATTTTCATGTTGATTATAACACTGAAACATAAACTGATGACGAAGTGGTTCCCTCAGTCATACCAGTGCTAG

Coding sequence (CDS)

ATGCTGTTCGAAGCTACTGGTGATGCAACGACCCTCGGTTACTGGTTAAACTGGAGGGTATTAATCTCTGCAATTTGGGTATTTGCTTCTTTTACCTTTGCACTATGGATGATATGGAAATATGAAGTTAAGGATAGATTAGGACACAGCAGACAAGCAACTCAGCAAGATAAAAACAAACTTCGGAGTTGTGAAGCTTGGACACCATGTCTCAAACAAATTCATCCAATTTGGTTATTGGCTTTTCGAGTTTGTGCGTTTGGTTTGATGCTGGCTTCACTAATTGTAAAGGCTTTGGCTAACGGGGCTTCCATGTTTTATTACTATACCCAGTGGACATTTACTTTACTTACAATATATTTTCTGTGTGGGTCACTGATTTCCATATATGGAGTTTTTCTCTGTAACAGAAAAAGAACCGAAGGTCTTTATGCTCATCTTAATGAGAATGGTATGGAGGAAGGGCAACATGTTCCTCTTCTTTCTGGGAAGCCCTCAAATTTAACTGGGGGAAATATAGTTTCGTATTCCAAGGAACAAAGCTTTTCTTCGATGGCTGTCAACATCTGGAGTTATATTTTTGAAGTTTTATTCCAGATTAATGCAGGGGCAGTTGTTCTCACTGATTGCACTTATTGGTTTGTCATATTTCCATTTCTTACCATCAAAGATTACAATTTGAGCTTTATGACGATCAATATGCACACGCTGAATTTGGTTTTACTTCTCGGTGAAACTGCTCTTAATTGCTTGACACTTCCAAGGTTCCGAATTTCATTCTTCTTTTTATGGACGGGTATTTATGTCATTTCTCAGTGGATTGTCCATGCATTTGTTTCGATTGGGTGGCCATATCCATTCCTTGATCTGTCAGCTCCTTATTCTCCATTATGGTATCTACTGATGGGATTGATTCATATTCCAAGCTATGGTATTTTCATGTTGATTATAACACTGAAACATAAACTGATGACGAAGTGGTTCCCTCAGTCATACCAGTGCTAG

Protein sequence

MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNKLRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIYFLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQSFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLVLLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYLLMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC
Homology
BLAST of ClCG10G014030 vs. NCBI nr
Match: XP_038903193.1 (uncharacterized protein LOC120089852 [Benincasa hispida])

HSP 1 Score: 630.9 bits (1626), Expect = 6.1e-177
Identity = 311/334 (93.11%), Postives = 314/334 (94.01%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFEATGDATTL YWLNWRVL+ AIWV ASFTFALWMIWKYEVKDRLGHSRQ TQQDKNK
Sbjct: 1   MLFEATGDATTLSYWLNWRVLLCAIWVLASFTFALWMIWKYEVKDRLGHSRQETQQDKNK 60

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LRSCE WTPCLKQIHPIWLLAFRVCAFGLMLASLIVKAL NG SMFYYYTQWTFTLLTIY
Sbjct: 61  LRSCEVWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALVNGTSMFYYYTQWTFTLLTIY 120

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           FLCGS+ISIYGVFLCNRKRTE       ENGMEEGQHVPLLSGKPSNL GGNIVSYSKE+
Sbjct: 121 FLCGSVISIYGVFLCNRKRTEEA-----ENGMEEGQHVPLLSGKPSNLIGGNIVSYSKEK 180

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           SFS  AVNIWSYI EVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV
Sbjct: 181 SFSLTAVNIWSYILEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALNCLTLPR RISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL
Sbjct: 241 LLLGETALNCLTLPRCRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMGLIHIPSYGIFMLII LKHKLMTKWFPQSYQC
Sbjct: 301 LMGLIHIPSYGIFMLIIKLKHKLMTKWFPQSYQC 329

BLAST of ClCG10G014030 vs. NCBI nr
Match: XP_008443544.1 (PREDICTED: uncharacterized protein LOC103487109 [Cucumis melo])

HSP 1 Score: 620.9 bits (1600), Expect = 6.3e-174
Identity = 303/334 (90.72%), Postives = 310/334 (92.81%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFEATGDATTL YWLNW  LI  IWVFASFTFALWMIW YEVKDRLGHSRQ TQQDKNK
Sbjct: 1   MLFEATGDATTLSYWLNWWALICEIWVFASFTFALWMIWNYEVKDRLGHSRQGTQQDKNK 60

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LR CEAWTPCL QIHPI+LLAFRVC+FGLMLASL+VKAL NGASMFYYYTQWTFTLLTIY
Sbjct: 61  LRGCEAWTPCLIQIHPIFLLAFRVCSFGLMLASLVVKALVNGASMFYYYTQWTFTLLTIY 120

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGSLISIYGVFLCNRKRT+GLYA LNENGMEEGQHVPLLSGKPSNL GGNIVSYSK+Q
Sbjct: 121 FACGSLISIYGVFLCNRKRTQGLYAQLNENGMEEGQHVPLLSGKPSNLIGGNIVSYSKDQ 180

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           S SS AVNIWSY FEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFM INMHTLNL+
Sbjct: 181 SLSSTAVNIWSYTFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMMINMHTLNLI 240

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLP FRISFFFLWTGIYVI QWIVHAFVSIGWPYPFLDLSAPYSPLWYL
Sbjct: 241 LLLGETALNSLTLPTFRISFFFLWTGIYVIFQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMG IHIPSYGIFMLII LKHKL+ KWFPQ YQC
Sbjct: 301 LMGSIHIPSYGIFMLIIKLKHKLIMKWFPQPYQC 334

BLAST of ClCG10G014030 vs. NCBI nr
Match: XP_004141192.2 (uncharacterized protein LOC101218542 isoform X1 [Cucumis sativus] >KGN59688.1 hypothetical protein Csa_000897 [Cucumis sativus])

HSP 1 Score: 611.3 bits (1575), Expect = 5.0e-171
Identity = 300/334 (89.82%), Postives = 306/334 (91.62%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFE TGDATTL YWLNW  L   IWVFASFTFALWMIW YEVKDRLGHSR+ TQQDKNK
Sbjct: 1   MLFEGTGDATTLSYWLNWWALSCEIWVFASFTFALWMIWNYEVKDRLGHSRRGTQQDKNK 60

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LR CEAWTPCL QIHPI LLAFRVCAFG+MLASLIVKAL NGASMFYYYTQW FTLLTIY
Sbjct: 61  LRGCEAWTPCLIQIHPICLLAFRVCAFGMMLASLIVKALVNGASMFYYYTQWAFTLLTIY 120

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGSLISIYGVFLCNRKRTEGL A +NENGMEEGQ VPLLSGKPSNL GGNIVSYSK+Q
Sbjct: 121 FACGSLISIYGVFLCNRKRTEGLCAQVNENGMEEGQQVPLLSGKPSNLIGGNIVSYSKDQ 180

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           SFSS AVNIW YIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV
Sbjct: 181 SFSSTAVNIWCYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLP FRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWY+
Sbjct: 241 LLLGETALNSLTLPTFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYM 300

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMG IHIPSYGIFMLII LKHKL+ KWFPQ Y C
Sbjct: 301 LMGSIHIPSYGIFMLIIKLKHKLIMKWFPQPYHC 334

BLAST of ClCG10G014030 vs. NCBI nr
Match: XP_022983923.1 (uncharacterized protein LOC111482400 isoform X1 [Cucurbita maxima])

HSP 1 Score: 603.2 bits (1554), Expect = 1.4e-168
Identity = 292/334 (87.43%), Postives = 306/334 (91.62%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFEAT DATTLGYWLNWRVLI AIWVFASFT ++WMIW+YE+KDRLGHS Q TQQDKNK
Sbjct: 7   MLFEATADATTLGYWLNWRVLICAIWVFASFTLSIWMIWEYEIKDRLGHSTQETQQDKNK 66

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LRSCEAW PCL+QIHPIW+LAFRVCAFG MLASLIVK LANGAS FYYYTQWTFTLLTIY
Sbjct: 67  LRSCEAWRPCLRQIHPIWMLAFRVCAFGSMLASLIVKTLANGASTFYYYTQWTFTLLTIY 126

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGS+ISIYGVF+CNRKRTEGL  HLNEN MEEGQHVPLLSGKPSNL GGNIVSYSKEQ
Sbjct: 127 FACGSVISIYGVFICNRKRTEGLNEHLNENNMEEGQHVPLLSGKPSNLIGGNIVSYSKEQ 186

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           SFS    +IWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYN SFMTINMHTLNL 
Sbjct: 187 SFS--PADIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNFSFMTINMHTLNLA 246

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLP FRISFFFLWTGIYVI QWI++A VSIGWPYPFLDLS PY+PLWY 
Sbjct: 247 LLLGETALNSLTLPSFRISFFFLWTGIYVIFQWIINAVVSIGWPYPFLDLSVPYAPLWYA 306

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMGLIHIPSYG+FMLII LKH+LM KWFPQSYQC
Sbjct: 307 LMGLIHIPSYGVFMLIIKLKHELMMKWFPQSYQC 338

BLAST of ClCG10G014030 vs. NCBI nr
Match: XP_022983924.1 (uncharacterized protein LOC111482400 isoform X2 [Cucurbita maxima])

HSP 1 Score: 603.2 bits (1554), Expect = 1.4e-168
Identity = 292/334 (87.43%), Postives = 306/334 (91.62%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFEAT DATTLGYWLNWRVLI AIWVFASFT ++WMIW+YE+KDRLGHS Q TQQDKNK
Sbjct: 4   MLFEATADATTLGYWLNWRVLICAIWVFASFTLSIWMIWEYEIKDRLGHSTQETQQDKNK 63

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LRSCEAW PCL+QIHPIW+LAFRVCAFG MLASLIVK LANGAS FYYYTQWTFTLLTIY
Sbjct: 64  LRSCEAWRPCLRQIHPIWMLAFRVCAFGSMLASLIVKTLANGASTFYYYTQWTFTLLTIY 123

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGS+ISIYGVF+CNRKRTEGL  HLNEN MEEGQHVPLLSGKPSNL GGNIVSYSKEQ
Sbjct: 124 FACGSVISIYGVFICNRKRTEGLNEHLNENNMEEGQHVPLLSGKPSNLIGGNIVSYSKEQ 183

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           SFS    +IWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYN SFMTINMHTLNL 
Sbjct: 184 SFS--PADIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNFSFMTINMHTLNLA 243

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLP FRISFFFLWTGIYVI QWI++A VSIGWPYPFLDLS PY+PLWY 
Sbjct: 244 LLLGETALNSLTLPSFRISFFFLWTGIYVIFQWIINAVVSIGWPYPFLDLSVPYAPLWYA 303

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMGLIHIPSYG+FMLII LKH+LM KWFPQSYQC
Sbjct: 304 LMGLIHIPSYGVFMLIIKLKHELMMKWFPQSYQC 335

BLAST of ClCG10G014030 vs. ExPASy TrEMBL
Match: A0A1S3B8A1 (uncharacterized protein LOC103487109 OS=Cucumis melo OX=3656 GN=LOC103487109 PE=4 SV=1)

HSP 1 Score: 620.9 bits (1600), Expect = 3.0e-174
Identity = 303/334 (90.72%), Postives = 310/334 (92.81%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFEATGDATTL YWLNW  LI  IWVFASFTFALWMIW YEVKDRLGHSRQ TQQDKNK
Sbjct: 1   MLFEATGDATTLSYWLNWWALICEIWVFASFTFALWMIWNYEVKDRLGHSRQGTQQDKNK 60

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LR CEAWTPCL QIHPI+LLAFRVC+FGLMLASL+VKAL NGASMFYYYTQWTFTLLTIY
Sbjct: 61  LRGCEAWTPCLIQIHPIFLLAFRVCSFGLMLASLVVKALVNGASMFYYYTQWTFTLLTIY 120

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGSLISIYGVFLCNRKRT+GLYA LNENGMEEGQHVPLLSGKPSNL GGNIVSYSK+Q
Sbjct: 121 FACGSLISIYGVFLCNRKRTQGLYAQLNENGMEEGQHVPLLSGKPSNLIGGNIVSYSKDQ 180

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           S SS AVNIWSY FEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFM INMHTLNL+
Sbjct: 181 SLSSTAVNIWSYTFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMMINMHTLNLI 240

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLP FRISFFFLWTGIYVI QWIVHAFVSIGWPYPFLDLSAPYSPLWYL
Sbjct: 241 LLLGETALNSLTLPTFRISFFFLWTGIYVIFQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMG IHIPSYGIFMLII LKHKL+ KWFPQ YQC
Sbjct: 301 LMGSIHIPSYGIFMLIIKLKHKLIMKWFPQPYQC 334

BLAST of ClCG10G014030 vs. ExPASy TrEMBL
Match: A0A0A0LD20 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G838670 PE=4 SV=1)

HSP 1 Score: 611.3 bits (1575), Expect = 2.4e-171
Identity = 300/334 (89.82%), Postives = 306/334 (91.62%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFE TGDATTL YWLNW  L   IWVFASFTFALWMIW YEVKDRLGHSR+ TQQDKNK
Sbjct: 1   MLFEGTGDATTLSYWLNWWALSCEIWVFASFTFALWMIWNYEVKDRLGHSRRGTQQDKNK 60

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LR CEAWTPCL QIHPI LLAFRVCAFG+MLASLIVKAL NGASMFYYYTQW FTLLTIY
Sbjct: 61  LRGCEAWTPCLIQIHPICLLAFRVCAFGMMLASLIVKALVNGASMFYYYTQWAFTLLTIY 120

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGSLISIYGVFLCNRKRTEGL A +NENGMEEGQ VPLLSGKPSNL GGNIVSYSK+Q
Sbjct: 121 FACGSLISIYGVFLCNRKRTEGLCAQVNENGMEEGQQVPLLSGKPSNLIGGNIVSYSKDQ 180

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           SFSS AVNIW YIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV
Sbjct: 181 SFSSTAVNIWCYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLP FRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWY+
Sbjct: 241 LLLGETALNSLTLPTFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYM 300

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMG IHIPSYGIFMLII LKHKL+ KWFPQ Y C
Sbjct: 301 LMGSIHIPSYGIFMLIIKLKHKLIMKWFPQPYHC 334

BLAST of ClCG10G014030 vs. ExPASy TrEMBL
Match: A0A6J1J8Z5 (uncharacterized protein LOC111482400 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482400 PE=4 SV=1)

HSP 1 Score: 603.2 bits (1554), Expect = 6.6e-169
Identity = 292/334 (87.43%), Postives = 306/334 (91.62%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFEAT DATTLGYWLNWRVLI AIWVFASFT ++WMIW+YE+KDRLGHS Q TQQDKNK
Sbjct: 4   MLFEATADATTLGYWLNWRVLICAIWVFASFTLSIWMIWEYEIKDRLGHSTQETQQDKNK 63

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LRSCEAW PCL+QIHPIW+LAFRVCAFG MLASLIVK LANGAS FYYYTQWTFTLLTIY
Sbjct: 64  LRSCEAWRPCLRQIHPIWMLAFRVCAFGSMLASLIVKTLANGASTFYYYTQWTFTLLTIY 123

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGS+ISIYGVF+CNRKRTEGL  HLNEN MEEGQHVPLLSGKPSNL GGNIVSYSKEQ
Sbjct: 124 FACGSVISIYGVFICNRKRTEGLNEHLNENNMEEGQHVPLLSGKPSNLIGGNIVSYSKEQ 183

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           SFS    +IWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYN SFMTINMHTLNL 
Sbjct: 184 SFS--PADIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNFSFMTINMHTLNLA 243

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLP FRISFFFLWTGIYVI QWI++A VSIGWPYPFLDLS PY+PLWY 
Sbjct: 244 LLLGETALNSLTLPSFRISFFFLWTGIYVIFQWIINAVVSIGWPYPFLDLSVPYAPLWYA 303

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMGLIHIPSYG+FMLII LKH+LM KWFPQSYQC
Sbjct: 304 LMGLIHIPSYGVFMLIIKLKHELMMKWFPQSYQC 335

BLAST of ClCG10G014030 vs. ExPASy TrEMBL
Match: A0A6J1J931 (uncharacterized protein LOC111482400 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482400 PE=4 SV=1)

HSP 1 Score: 603.2 bits (1554), Expect = 6.6e-169
Identity = 292/334 (87.43%), Postives = 306/334 (91.62%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFEAT DATTLGYWLNWRVLI AIWVFASFT ++WMIW+YE+KDRLGHS Q TQQDKNK
Sbjct: 7   MLFEATADATTLGYWLNWRVLICAIWVFASFTLSIWMIWEYEIKDRLGHSTQETQQDKNK 66

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LRSCEAW PCL+QIHPIW+LAFRVCAFG MLASLIVK LANGAS FYYYTQWTFTLLTIY
Sbjct: 67  LRSCEAWRPCLRQIHPIWMLAFRVCAFGSMLASLIVKTLANGASTFYYYTQWTFTLLTIY 126

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGS+ISIYGVF+CNRKRTEGL  HLNEN MEEGQHVPLLSGKPSNL GGNIVSYSKEQ
Sbjct: 127 FACGSVISIYGVFICNRKRTEGLNEHLNENNMEEGQHVPLLSGKPSNLIGGNIVSYSKEQ 186

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           SFS    +IWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYN SFMTINMHTLNL 
Sbjct: 187 SFS--PADIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNFSFMTINMHTLNLA 246

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLP FRISFFFLWTGIYVI QWI++A VSIGWPYPFLDLS PY+PLWY 
Sbjct: 247 LLLGETALNSLTLPSFRISFFFLWTGIYVIFQWIINAVVSIGWPYPFLDLSVPYAPLWYA 306

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQC 335
           LMGLIHIPSYG+FMLII LKH+LM KWFPQSYQC
Sbjct: 307 LMGLIHIPSYGVFMLIIKLKHELMMKWFPQSYQC 338

BLAST of ClCG10G014030 vs. ExPASy TrEMBL
Match: A0A6J1F1Y9 (uncharacterized protein LOC111441456 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111441456 PE=4 SV=1)

HSP 1 Score: 595.9 bits (1535), Expect = 1.0e-166
Identity = 290/333 (87.09%), Postives = 303/333 (90.99%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           MLFE T DATTLGYWLNWRVLI AIWVFASFT A+WMIW+YE+KDRLGHS Q TQQDKNK
Sbjct: 4   MLFETTADATTLGYWLNWRVLICAIWVFASFTLAIWMIWEYEIKDRLGHSTQETQQDKNK 63

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           LRS EAW PCL+QIHPIW+LAFRVCAF  MLASLIVK LANGAS FYYYTQWTFTLLTIY
Sbjct: 64  LRSFEAWRPCLRQIHPIWMLAFRVCAFSSMLASLIVKTLANGASTFYYYTQWTFTLLTIY 123

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F CGS+ISIYGVF+CNRKRTEGL  HLNEN MEEGQHVPLLSGKPSNL GGNIVSYSKEQ
Sbjct: 124 FACGSVISIYGVFICNRKRTEGLNEHLNENDMEEGQHVPLLSGKPSNLIGGNIVSYSKEQ 183

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
            FS    +IWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYN SFMT NMHTLNL 
Sbjct: 184 RFS--PADIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNFSFMTTNMHTLNLA 243

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLLGETALN LTLPRFRISFFFLWTGIYVI QWI+HA VSIGWPYPFLDLSAPY+PLWY+
Sbjct: 244 LLLGETALNSLTLPRFRISFFFLWTGIYVIFQWIIHAVVSIGWPYPFLDLSAPYAPLWYV 303

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQ 334
           LMGLIHIPSYG+FMLII LKH+LM KWFPQSYQ
Sbjct: 304 LMGLIHIPSYGVFMLIIKLKHELMMKWFPQSYQ 334

BLAST of ClCG10G014030 vs. TAIR 10
Match: AT5G62960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G10660.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 300.1 bits (767), Expect = 2.3e-81
Identity = 148/336 (44.05%), Postives = 213/336 (63.39%), Query Frame = 0

Query: 6   TGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYE----VKDRLGHSRQATQQDKNKL 65
           T + T   YW NWRV+I  IW+  +     ++I+KYE     +  +G      ++    +
Sbjct: 26  TANTTESSYWFNWRVMICCIWMAIATVITAFLIFKYEGFRRKRSDVGEVDGGEKEWSGNV 85

Query: 66  RSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIYF 125
              E W PCL+ IHP WLLAFRV AF ++L  LIV  L +G ++F+YYTQWTF L+T+YF
Sbjct: 86  YEDETWRPCLRNIHPAWLLAFRVVAFFVLLVMLIVIGLVDGPTIFFYYTQWTFGLITLYF 145

Query: 126 LCGSLISIYGVFLCNRK----RTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYS 185
             GSL+S++G +  N++    R + + A  +E    +G                N +  S
Sbjct: 146 GLGSLLSLHGCYQYNKRAAGDRVDSIEAIDSERARSKG--------------ADNTIQQS 205

Query: 186 KEQSFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTL 245
           +   +SS     W Y+F+++FQ+NAGAV+LTDC +WF+I PFL I DY+L+ + INMH+L
Sbjct: 206 Q---YSSNPAGFWGYVFQIIFQMNAGAVLLTDCVFWFIIVPFLEIHDYSLNVLVINMHSL 265

Query: 246 NLVLLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPL 305
           N + LLG+ ALN L+ P FRI++FF WT  YVI QW +H+ V I WPYPFLDLS+ Y+PL
Sbjct: 266 NAIFLLGDAALNSLSFPCFRIAYFFFWTIAYVIFQWALHSLVHIWWPYPFLDLSSHYAPL 325

Query: 306 WYLLMGLIHIPSYGIFMLIITLKHKLMTKWFPQSYQ 334
           WY  + ++H+P YG F L++ LKH+L+ +WFP+SYQ
Sbjct: 326 WYFSVAVMHLPCYGAFALLVKLKHRLLQRWFPESYQ 344

BLAST of ClCG10G014030 vs. TAIR 10
Match: AT3G27770.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G62960.1); Has 158 Blast hits to 157 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 13; Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 295.0 bits (754), Expect = 7.3e-80
Identity = 150/325 (46.15%), Postives = 209/325 (64.31%), Query Frame = 0

Query: 1   MLFEATGDATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNK 60
           M  E   + T+  YW NWRVL+ AIWV      +L ++WKYE         Q +    + 
Sbjct: 1   MNLENVLEFTSFDYWFNWRVLLCAIWVIVPMIVSLLVLWKYEDS---SVQTQPSLNGNDV 60

Query: 61  LRSCEAWTPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIY 120
           L   + W PC ++IHP WLL FRV  F  +LA+ I +    G  ++YYYTQWTFTL+ IY
Sbjct: 61  LCIDDVWRPCFERIHPGWLLGFRVLGFCFLLANNIARFANRGWRIYYYYTQWTFTLIAIY 120

Query: 121 FLCGSLISIYGVFLCNRKRTEGLYAHLNENGMEEGQHVPLLSGKPSNLTGGNIVSYSKEQ 180
           F  GSL+SIYG     ++   GL A       E G   PL+        G N+VS+ K +
Sbjct: 121 FGMGSLLSIYGCLQYKKQGNTGLIADQVGIDAENGFRSPLID-------GDNMVSFEKRK 180

Query: 181 SFSSMAVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLV 240
           +  S A+  + ++F++++Q+ AGA VLTD  YW VIFPFL+++DY +SFMT+N+HT NLV
Sbjct: 181 TSGSEALKSYVHLFQIIYQMGAGAAVLTDSIYWTVIFPFLSLQDYEMSFMTVNLHTSNLV 240

Query: 241 LLLGETALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYL 300
           LLL +T LN L  P FR S+F LWTG +V+ QWI+H F+S+GWPYPFL+LS   +P+WYL
Sbjct: 241 LLLIDTFLNRLKFPLFRFSYFILWTGCFVLFQWILHMFISVGWPYPFLNLSLDMAPVWYL 300

Query: 301 LMGLIHIPSYGIFMLIITLKHKLMT 326
           L+ L+H+PSYG+F LI+ +K+KL++
Sbjct: 301 LVALLHLPSYGLFALIVKIKYKLIS 315

BLAST of ClCG10G014030 vs. TAIR 10
Match: AT1G10660.2 (unknown protein; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G62960.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 274.6 bits (701), Expect = 1.0e-73
Identity = 144/319 (45.14%), Postives = 206/319 (64.58%), Query Frame = 0

Query: 8   DATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNKLRSCEAW 67
           D T   YWLNWRVL+ A+ + A    A  +IWKYE K R    R++ ++    L   EAW
Sbjct: 4   DTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRR--RQRESQRELPGTLFQDEAW 63

Query: 68  TPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIYFLCGSLI 127
           T C K+IHP+WLLAFRV +F  ML  LI   + +GA +FY+YTQWTFTL+T+YF   S++
Sbjct: 64  TTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFGYASVL 123

Query: 128 SIYGVFLCNRKRTEGLYAHLNENGMEEGQHVP--LLSGKPSNLTGGNIVSYSKEQSFSSM 187
           S+YG  + N++ +  + ++ +    E+G + P   L G+ +     N  S    ++ +  
Sbjct: 124 SVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASNRPS----EAPARK 183

Query: 188 AVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLVLLLGE 247
               W YIF++LFQ  AGAVVLTD  +W +I+PF   K Y LSF+ + MH+LN V LLG+
Sbjct: 184 TAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFLLGD 243

Query: 248 TALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYLLMGLI 307
           T+LN L  P FRI++F LW+ I+V  QWI+HA  ++ WPY FLDLS+PY+PLWYL + ++
Sbjct: 244 TSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGVAVM 303

Query: 308 HIPSYGIFMLIITLKHKLM 325
           HIP + +F L+I LK+ L+
Sbjct: 304 HIPCFAVFALVIKLKNYLL 314

BLAST of ClCG10G014030 vs. TAIR 10
Match: AT1G10660.3 (unknown protein; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G62960.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 274.6 bits (701), Expect = 1.0e-73
Identity = 144/319 (45.14%), Postives = 206/319 (64.58%), Query Frame = 0

Query: 8   DATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNKLRSCEAW 67
           D T   YWLNWRVL+ A+ + A    A  +IWKYE K R    R++ ++    L   EAW
Sbjct: 4   DTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRR--RQRESQRELPGTLFQDEAW 63

Query: 68  TPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIYFLCGSLI 127
           T C K+IHP+WLLAFRV +F  ML  LI   + +GA +FY+YTQWTFTL+T+YF   S++
Sbjct: 64  TTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFGYASVL 123

Query: 128 SIYGVFLCNRKRTEGLYAHLNENGMEEGQHVP--LLSGKPSNLTGGNIVSYSKEQSFSSM 187
           S+YG  + N++ +  + ++ +    E+G + P   L G+ +     N  S    ++ +  
Sbjct: 124 SVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASNRPS----EAPARK 183

Query: 188 AVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLVLLLGE 247
               W YIF++LFQ  AGAVVLTD  +W +I+PF   K Y LSF+ + MH+LN V LLG+
Sbjct: 184 TAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFLLGD 243

Query: 248 TALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYLLMGLI 307
           T+LN L  P FRI++F LW+ I+V  QWI+HA  ++ WPY FLDLS+PY+PLWYL + ++
Sbjct: 244 TSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGVAVM 303

Query: 308 HIPSYGIFMLIITLKHKLM 325
           HIP + +F L+I LK+ L+
Sbjct: 304 HIPCFAVFALVIKLKNYLL 314

BLAST of ClCG10G014030 vs. TAIR 10
Match: AT1G10660.4 (unknown protein; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G62960.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 274.6 bits (701), Expect = 1.0e-73
Identity = 144/319 (45.14%), Postives = 206/319 (64.58%), Query Frame = 0

Query: 8   DATTLGYWLNWRVLISAIWVFASFTFALWMIWKYEVKDRLGHSRQATQQDKNKLRSCEAW 67
           D T   YWLNWRVL+ A+ + A    A  +IWKYE K R    R++ ++    L   EAW
Sbjct: 4   DTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRR--RQRESQRELPGTLFQDEAW 63

Query: 68  TPCLKQIHPIWLLAFRVCAFGLMLASLIVKALANGASMFYYYTQWTFTLLTIYFLCGSLI 127
           T C K+IHP+WLLAFRV +F  ML  LI   + +GA +FY+YTQWTFTL+T+YF   S++
Sbjct: 64  TTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFGYASVL 123

Query: 128 SIYGVFLCNRKRTEGLYAHLNENGMEEGQHVP--LLSGKPSNLTGGNIVSYSKEQSFSSM 187
           S+YG  + N++ +  + ++ +    E+G + P   L G+ +     N  S    ++ +  
Sbjct: 124 SVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASNRPS----EAPARK 183

Query: 188 AVNIWSYIFEVLFQINAGAVVLTDCTYWFVIFPFLTIKDYNLSFMTINMHTLNLVLLLGE 247
               W YIF++LFQ  AGAVVLTD  +W +I+PF   K Y LSF+ + MH+LN V LLG+
Sbjct: 184 TAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFLLGD 243

Query: 248 TALNCLTLPRFRISFFFLWTGIYVISQWIVHAFVSIGWPYPFLDLSAPYSPLWYLLMGLI 307
           T+LN L  P FRI++F LW+ I+V  QWI+HA  ++ WPY FLDLS+PY+PLWYL + ++
Sbjct: 244 TSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGVAVM 303

Query: 308 HIPSYGIFMLIITLKHKLM 325
           HIP + +F L+I LK+ L+
Sbjct: 304 HIPCFAVFALVIKLKNYLL 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038903193.16.1e-17793.11uncharacterized protein LOC120089852 [Benincasa hispida][more]
XP_008443544.16.3e-17490.72PREDICTED: uncharacterized protein LOC103487109 [Cucumis melo][more]
XP_004141192.25.0e-17189.82uncharacterized protein LOC101218542 isoform X1 [Cucumis sativus] >KGN59688.1 hy... [more]
XP_022983923.11.4e-16887.43uncharacterized protein LOC111482400 isoform X1 [Cucurbita maxima][more]
XP_022983924.11.4e-16887.43uncharacterized protein LOC111482400 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3B8A13.0e-17490.72uncharacterized protein LOC103487109 OS=Cucumis melo OX=3656 GN=LOC103487109 PE=... [more]
A0A0A0LD202.4e-17189.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G838670 PE=4 SV=1[more]
A0A6J1J8Z56.6e-16987.43uncharacterized protein LOC111482400 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1J9316.6e-16987.43uncharacterized protein LOC111482400 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1F1Y91.0e-16687.09uncharacterized protein LOC111441456 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT5G62960.12.3e-8144.05unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G27770.17.3e-8046.15unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G10660.21.0e-7345.14unknown protein; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structu... [more]
AT1G10660.31.0e-7345.14unknown protein; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structu... [more]
AT1G10660.41.0e-7345.14unknown protein; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structu... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR12242UNCHARACTERIZEDcoord: 4..331
NoneNo IPR availablePANTHERPTHR12242:SF29PLANT/MGF10-16 PROTEINcoord: 4..331

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG10G014030.2ClCG10G014030.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane