Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGCATCAAATTTCCACCTAAAATCCATGGCTTCTCTCAAATATTTCCTTCTCTGTCCCTTTGTTTTCCTCTGCGGAAGCTACACCTTTGCCAATGGAGTCTTCAACTCCAATGATGGATCTCATTCCGGTCCTAGGGCTTGGCCATTACCCGACCCAAGTGCTGGTCCAGGAGTCGATAGAGGGGTAAAGAATGTTGGGGTTGGCCCAAGAGCCGGACCGACAGCTGGTCCAAGAGTCAAGGGAGGAGTAACTAATGTCATTGCCGGTCCGAGAGCCGAACCAAAAGCTGACCTGGAAGTCGAGGGAGGGGTAACTAATGTCAATGCTGGCCCAAGAGCCGGACCGAGAGCTAGCCTAGGAGTCGAGGGAGGGGCAAGTAATGTCAATGCTGGTCCGAGAGCAGGACCTAAAGCTAGCCTGGGAGTCGAGGGAGGGATAAACAATATCGGTGCTAGTCCGAGAGTTGGACCAAAAGCTGGCCTAGGAGTTGAGGGAGGGGTAAGCAATATTGGTGCTAGTTCGAGAGGTGGACCGAAAGCTGGCCCGGGAGCCGAGGAAGGCGTAAGCAATGTCGGTGCTGGTCCAAGAGCCGGACCCAAATCTGGCTCAGAACCTAAGGTAGGGGTAAGTGGTATTAGAGCTGGTCCAAGAGCGAGGCCAAAAGGTGTTAATTCAATTGTTAACGGAGTCGGAGTCGGAGTCGGAGTCGGAGTTGGGTACAAGCCAGGATTTGGACCTCCAGGATTTTTGCCTCCTGGATTTGGGTCAAGGCCAGGGTATTGGCCTAGGCCAGGATTTGAACCGTACGATGATTGCATATTGGGCTATGTTTGTCCAGCAAATGAAGCTAGGGAATGCAGCAAATTTGAGTATGGAACTTGCCATTCTTATAACTTTCATCCATTGACGGCTTCTACGGACCTACACGAAGTTGACATCAATTGGGCCAGAAGCAAGCCTTTTGCAACGGCCCAAAATGGTGGATCTGGACCAGTTATTCAAATCGACTCAGCCCACTAAGAAGGCCCAAAAAGGTGTCTAATTAAATGTTTTCATGGCCATCTCCTTGCTATCATGCAATATTAAATAATCTTTCACCTCTTAAATAAAAAAGAAGCTTTAGAAAAACATAAAATAAAGTTGCAATATTTAGTACAATCCTATTTCATATTTTGTCTAGTCCACTACTCTATTCGCTTCTATCATTTATCTACCACGCTTTTTCAATAAACATTTCACATCTAGGAGTAGATCCA
mRNA sequence
AAAGCATCAAATTTCCACCTAAAATCCATGGCTTCTCTCAAATATTTCCTTCTCTGTCCCTTTGTTTTCCTCTGCGGAAGCTACACCTTTGCCAATGGAGTCTTCAACTCCAATGATGGATCTCATTCCGGTCCTAGGGCTTGGCCATTACCCGACCCAAGTGCTGGTCCAGGAGTCGATAGAGGGGTAAAGAATGTTGGGGTTGGCCCAAGAGCCGGACCGACAGCTGGTCCAAGAGTCAAGGGAGGAGTAACTAATGTCATTGCCGGTCCGAGAGCCGAACCAAAAGCTGACCTGGAAGTCGAGGGAGGGGTAACTAATGTCAATGCTGGCCCAAGAGCCGGACCGAGAGCTAGCCTAGGAGTCGAGGGAGGGGCAAGTAATGTCAATGCTGGTCCGAGAGCAGGACCTAAAGCTAGCCTGGGAGTCGAGGGAGGGATAAACAATATCGGTGCTAGTCCGAGAGTTGGACCAAAAGCTGGCCTAGGAGTTGAGGGAGGGGTAAGCAATATTGGTGCTAGTTCGAGAGGTGGACCGAAAGCTGGCCCGGGAGCCGAGGAAGGCGTAAGCAATGTCGGTGCTGGTCCAAGAGCCGGACCCAAATCTGGCTCAGAACCTAAGGTAGGGGTAAGTGGTATTAGAGCTGGTCCAAGAGCGAGGCCAAAAGGTGTTAATTCAATTGTTAACGGAGTCGGAGTCGGAGTCGGAGTCGGAGTTGGGTACAAGCCAGGATTTGGACCTCCAGGATTTTTGCCTCCTGGATTTGGGTCAAGGCCAGGGTATTGGCCTAGGCCAGGATTTGAACCGTACGATGATTGCATATTGGGCTATGTTTGTCCAGCAAATGAAGCTAGGGAATGCAGCAAATTTGAGTATGGAACTTGCCATTCTTATAACTTTCATCCATTGACGGCTTCTACGGACCTACACGAAGTTGACATCAATTGGGCCAGAAGCAAGCCTTTTGCAACGGCCCAAAATGGTGGATCTGGACCAGTTATTCAAATCGACTCAGCCCACTAAGAAGGCCCAAAAAGGTGTCTAATTAAATGTTTTCATGGCCATCTCCTTGCTATCATGCAATATTAAATAATCTTTCACCTCTTAAATAAAAAAGAAGCTTTAGAAAAACATAAAATAAAGTTGCAATATTTAGTACAATCCTATTTCATATTTTGTCTAGTCCACTACTCTATTCGCTTCTATCATTTATCTACCACGCTTTTTCAATAAACATTTCACATCTAGGAGTAGATCCA
Coding sequence (CDS)
ATGGCTTCTCTCAAATATTTCCTTCTCTGTCCCTTTGTTTTCCTCTGCGGAAGCTACACCTTTGCCAATGGAGTCTTCAACTCCAATGATGGATCTCATTCCGGTCCTAGGGCTTGGCCATTACCCGACCCAAGTGCTGGTCCAGGAGTCGATAGAGGGGTAAAGAATGTTGGGGTTGGCCCAAGAGCCGGACCGACAGCTGGTCCAAGAGTCAAGGGAGGAGTAACTAATGTCATTGCCGGTCCGAGAGCCGAACCAAAAGCTGACCTGGAAGTCGAGGGAGGGGTAACTAATGTCAATGCTGGCCCAAGAGCCGGACCGAGAGCTAGCCTAGGAGTCGAGGGAGGGGCAAGTAATGTCAATGCTGGTCCGAGAGCAGGACCTAAAGCTAGCCTGGGAGTCGAGGGAGGGATAAACAATATCGGTGCTAGTCCGAGAGTTGGACCAAAAGCTGGCCTAGGAGTTGAGGGAGGGGTAAGCAATATTGGTGCTAGTTCGAGAGGTGGACCGAAAGCTGGCCCGGGAGCCGAGGAAGGCGTAAGCAATGTCGGTGCTGGTCCAAGAGCCGGACCCAAATCTGGCTCAGAACCTAAGGTAGGGGTAAGTGGTATTAGAGCTGGTCCAAGAGCGAGGCCAAAAGGTGTTAATTCAATTGTTAACGGAGTCGGAGTCGGAGTCGGAGTCGGAGTTGGGTACAAGCCAGGATTTGGACCTCCAGGATTTTTGCCTCCTGGATTTGGGTCAAGGCCAGGGTATTGGCCTAGGCCAGGATTTGAACCGTACGATGATTGCATATTGGGCTATGTTTGTCCAGCAAATGAAGCTAGGGAATGCAGCAAATTTGAGTATGGAACTTGCCATTCTTATAACTTTCATCCATTGACGGCTTCTACGGACCTACACGAAGTTGACATCAATTGGGCCAGAAGCAAGCCTTTTGCAACGGCCCAAAATGGTGGATCTGGACCAGTTATTCAAATCGACTCAGCCCACTAA
Protein sequence
MASLKYFLLCPFVFLCGSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVGPRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNVNAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGVSNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGFGPPGFLPPGFGSRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLTASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH
Homology
BLAST of Clc10G06480 vs. NCBI nr
Match:
KAG6577377.1 (hypothetical protein SDJN03_24951, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 239.6 bits (610), Expect = 3.9e-59
Identity = 160/329 (48.63%), Postives = 193/329 (58.66%), Query Frame = 0
Query: 1 MASLKYFLLCPFVFLCGSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVG 60
M SLKYFLL PFVFLC S TFAN V NS+DGS G D VG G
Sbjct: 1 MGSLKYFLLSPFVFLCLSCTFANRVPNSDDGS----------------GFD-----VGAG 60
Query: 61 PRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNV 120
P A PTAGP V+ GV+NV AGP A EG V +V AG +AGP+A G EG S+V
Sbjct: 61 PGAIPTAGPGVEKGVSNVRAGPAA--------EGWVNDVRAGLKAGPKAGPGAEGWVSDV 120
Query: 121 ----NAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGA 180
AGP+AGPKA G E ++++ A PR GPKAG G EG VS++ A R GPKAGPGA
Sbjct: 121 KAGLRAGPKAGPKAGPGAEEWVSDVKAGPRAGPKAGPGAEGWVSDVKAGLRAGPKAGPGA 180
Query: 181 EEGVSNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGF 240
E VSNV AGP GP++ + GVS G R + V+ ++NG+G+G+GV +GY+ GF
Sbjct: 181 EGWVSNVKAGPTVGPRAWPGTEGGVSSSEGGVR---RDVDPMINGLGLGLGVDIGYRSGF 240
Query: 241 -----GPPGFLPPGFGSRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSYNF 300
G + PG G G ++C LGYVCP R C KF YG C +Y F
Sbjct: 241 RAGLGGGEHWFGPGIGIGGG-------GVSNECTLGYVCPTYGRRGCDKFSYGNCDTYGF 290
Query: 301 HPLTASTDLHEVDINWAR-SKPFATAQNG 320
HPL AS LHEV++ WA+ SKP AT QNG
Sbjct: 301 HPLMASMHLHEVEMKWAKGSKPAATPQNG 290
BLAST of Clc10G06480 vs. NCBI nr
Match:
KGN56231.1 (hypothetical protein Csa_011503 [Cucumis sativus])
HSP 1 Score: 236.1 bits (601), Expect = 4.3e-58
Identity = 155/336 (46.13%), Postives = 177/336 (52.68%), Query Frame = 0
Query: 1 MASLKYFLLCPFVFLCGSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVG 60
MASLKYFLL PF+FLC SYTFANGVFN +DG G + P PDPSAGP VDRGV N G+G
Sbjct: 1 MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIG 60
Query: 61 PRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNV 120
P+A
Sbjct: 61 PKA--------------------------------------------------------- 120
Query: 121 NAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGV 180
GP+AGLGV GG+SN+ S GPKAGPG +E +
Sbjct: 121 ---------------------------GPRAGLGV-GGISNVDDGSDPGPKAGPGVKEEM 180
Query: 181 SNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGFGPPG 240
SNVGAGPR PK+GVS I AGPRA PKGV+ IV G+GVGVGV + PP
Sbjct: 181 SNVGAGPRV-------PKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNL-------PPI 236
Query: 241 FLPPGFGSR--PGYWPRPG---FEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLT 300
F P G R PG W PG EPY++C+LGYVCP N C K YG C SYNF PL+
Sbjct: 241 FGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLS 236
Query: 301 ASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH 332
AST+LH+V INWA+SK TAQ+G SGP I IDSAH
Sbjct: 301 ASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH 236
BLAST of Clc10G06480 vs. NCBI nr
Match:
KAA0060661.1 (hypothetical protein E6C27_scaffold22G005540 [Cucumis melo var. makuwa] >TYK02214.1 hypothetical protein E5676_scaffold18G00010 [Cucumis melo var. makuwa])
HSP 1 Score: 221.9 bits (564), Expect = 8.4e-54
Identity = 147/335 (43.88%), Postives = 167/335 (49.85%), Query Frame = 0
Query: 1 MASLKYFLLCPFVFLCGSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVG 60
MASLKYFLL PF+FLC SYTFA+GVFN + G G + P PDPSAGPGVD GV N+G+G
Sbjct: 1 MASLKYFLLSPFLFLCLSYTFADGVFNYDHGLDFGSMSSPTPDPSAGPGVDIGVSNIGIG 60
Query: 61 PRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNV 120
P+AGP AG
Sbjct: 61 PKAGPRAG---------------------------------------------------- 120
Query: 121 NAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGV 180
LG+ GGI+++ P
Sbjct: 121 -----------LGIGGGISDVDDEP----------------------------------- 180
Query: 181 SNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGFGPPG 240
GP+AGPK+ K+GVSGI AGPRA PKGVN G GVGVGV P FG P
Sbjct: 181 -----GPKAGPKASGGHKLGVSGIEAGPRAGPKGVN------GFGVGVGVDLPPVFGGPK 221
Query: 241 F-LPPGFGSRPGYWPRPG---FEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLTA 300
L PG PG W RPG EPY +C+LGYVCP N CSKF YG C SYNFHPL+A
Sbjct: 241 IGLKPG----PGGWYRPGPIIQEPYGNCMLGYVCP-NRPWACSKFAYGLCDSYNFHPLSA 221
Query: 301 STDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH 332
STDLHEV INWA+SKP ATAQ+G SGP +DSAH
Sbjct: 301 STDLHEVKINWAKSKPDATAQHGESGPATHVDSAH 221
BLAST of Clc10G06480 vs. NCBI nr
Match:
XP_023551823.1 (fibroin heavy chain-like isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 208.0 bits (528), Expect = 1.3e-49
Identity = 130/265 (49.06%), Postives = 161/265 (60.75%), Query Frame = 0
Query: 61 PRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNV 120
P A PTAGP V+ GV+NV AGP A EG V +V AG +AGP+A G EG S+V
Sbjct: 9 PGAVPTAGPGVEKGVSNVRAGPAA--------EGWVNDVWAGTKAGPKAGPGAEGWVSDV 68
Query: 121 NAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGV 180
AGPRAG KA G E ++++ A PR GPKAG G EG VS++ A R GPKAGPGAE V
Sbjct: 69 KAGPRAGLKAGPGAEEWVSDVKAGPRAGPKAGPGAEGWVSDVKAGPRAGPKAGPGAEGWV 128
Query: 181 SNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGF---- 240
+NV AGP GP++ + GVS G R + V+ ++NG+G+G+GV +GY+ GF
Sbjct: 129 NNVKAGPTVGPRAWPGTEGGVSSSEGGVR---RDVDPMINGLGLGLGVDIGYRSGFRAGV 188
Query: 241 -GPPGFLPPGFGSRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLT 300
G + PG G R ++C LGYVCP R C KF YG C SY FHPL
Sbjct: 189 GGGEHWFGPGIGGR---------GVSNECTLGYVCPTYGRRGCDKFSYGNCDSYGFHPLM 248
Query: 301 ASTDLHEVDINWAR-SKPFATAQNG 320
AS LHEV++ WA+ SKP AT QNG
Sbjct: 249 ASMQLHEVEMKWAKGSKPAATPQNG 253
BLAST of Clc10G06480 vs. NCBI nr
Match:
XP_022929340.1 (fibroin heavy chain-like [Cucurbita moschata])
HSP 1 Score: 194.9 bits (494), Expect = 1.1e-45
Identity = 140/325 (43.08%), Postives = 168/325 (51.69%), Query Frame = 0
Query: 1 MASLKYFLLCPFVFLCGSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVG 60
M SLKYFLL PFVFLC S TFAN V NS+DGS G D VG G
Sbjct: 1 MGSLKYFLLSPFVFLCLSCTFANRVPNSDDGS----------------GFD-----VGAG 60
Query: 61 PRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNV 120
P A PTAGP V+ GV+NV AGP A EG V +V AG +AGP+A G EG S+V
Sbjct: 61 PGAIPTAGPGVEKGVSNVRAGPAA--------EGWVNDVWAGLKAGPKAGPGAEGWVSDV 120
Query: 121 NAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGV 180
AG RAGPKA G EG ++N+ A P VGP+A G EGGVS SS GG +
Sbjct: 121 KAGLRAGPKAGPGAEGWVSNVKAGPTVGPRAWPGTEGGVS----SSEGGVR--------- 180
Query: 181 SNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGF---- 240
+ V+ ++NG+G+G+GV +GY+ GF
Sbjct: 181 --------------------------------RDVDPMINGLGLGLGVDIGYRSGFRAGL 240
Query: 241 -GPPGFLPPGFGSRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLT 300
G + PG G G ++C LGYVCP R C KF YG C +Y FHPL
Sbjct: 241 GGGEHWFGPGIGIGGG-------GVSNECTLGYVCPTYGRRGCDKFSYGNCDTYGFHPLM 244
Query: 301 ASTDLHEVDINWAR-SKPFATAQNG 320
AS LHEV++ WA+ SKP AT QNG
Sbjct: 301 ASMHLHEVEMKWAKGSKPAATPQNG 244
BLAST of Clc10G06480 vs. ExPASy TrEMBL
Match:
A0A0A0L7X7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G104850 PE=4 SV=1)
HSP 1 Score: 236.1 bits (601), Expect = 2.1e-58
Identity = 155/336 (46.13%), Postives = 177/336 (52.68%), Query Frame = 0
Query: 1 MASLKYFLLCPFVFLCGSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVG 60
MASLKYFLL PF+FLC SYTFANGVFN +DG G + P PDPSAGP VDRGV N G+G
Sbjct: 1 MASLKYFLLSPFLFLCLSYTFANGVFNYDDGLGFGSMSSPTPDPSAGP-VDRGVSNFGIG 60
Query: 61 PRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNV 120
P+A
Sbjct: 61 PKA--------------------------------------------------------- 120
Query: 121 NAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGV 180
GP+AGLGV GG+SN+ S GPKAGPG +E +
Sbjct: 121 ---------------------------GPRAGLGV-GGISNVDDGSDPGPKAGPGVKEEM 180
Query: 181 SNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGFGPPG 240
SNVGAGPR PK+GVS I AGPRA PKGV+ IV G+GVGVGV + PP
Sbjct: 181 SNVGAGPRV-------PKLGVSSIEAGPRAGPKGVDPIVTGLGVGVGVNL-------PPI 236
Query: 241 FLPPGFGSR--PGYWPRPG---FEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLT 300
F P G R PG W PG EPY++C+LGYVCP N C K YG C SYNF PL+
Sbjct: 241 FGGPKMGIRPGPGGWYGPGPIIQEPYNNCMLGYVCPTNRPWACGKVGYGLCESYNFRPLS 236
Query: 301 ASTDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH 332
AST+LH+V INWA+SK TAQ+G SGP I IDSAH
Sbjct: 301 ASTELHDVKINWAKSKSVETAQHGESGPGIHIDSAH 236
BLAST of Clc10G06480 vs. ExPASy TrEMBL
Match:
A0A5A7V4J6 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold18G00010 PE=4 SV=1)
HSP 1 Score: 221.9 bits (564), Expect = 4.1e-54
Identity = 147/335 (43.88%), Postives = 167/335 (49.85%), Query Frame = 0
Query: 1 MASLKYFLLCPFVFLCGSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVG 60
MASLKYFLL PF+FLC SYTFA+GVFN + G G + P PDPSAGPGVD GV N+G+G
Sbjct: 1 MASLKYFLLSPFLFLCLSYTFADGVFNYDHGLDFGSMSSPTPDPSAGPGVDIGVSNIGIG 60
Query: 61 PRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNV 120
P+AGP AG
Sbjct: 61 PKAGPRAG---------------------------------------------------- 120
Query: 121 NAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGV 180
LG+ GGI+++ P
Sbjct: 121 -----------LGIGGGISDVDDEP----------------------------------- 180
Query: 181 SNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGFGPPG 240
GP+AGPK+ K+GVSGI AGPRA PKGVN G GVGVGV P FG P
Sbjct: 181 -----GPKAGPKASGGHKLGVSGIEAGPRAGPKGVN------GFGVGVGVDLPPVFGGPK 221
Query: 241 F-LPPGFGSRPGYWPRPG---FEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLTA 300
L PG PG W RPG EPY +C+LGYVCP N CSKF YG C SYNFHPL+A
Sbjct: 241 IGLKPG----PGGWYRPGPIIQEPYGNCMLGYVCP-NRPWACSKFAYGLCDSYNFHPLSA 221
Query: 301 STDLHEVDINWARSKPFATAQNGGSGPVIQIDSAH 332
STDLHEV INWA+SKP ATAQ+G SGP +DSAH
Sbjct: 301 STDLHEVKINWAKSKPDATAQHGESGPATHVDSAH 221
BLAST of Clc10G06480 vs. ExPASy TrEMBL
Match:
A0A6J1EU53 (fibroin heavy chain-like OS=Cucurbita moschata OX=3662 GN=LOC111435943 PE=4 SV=1)
HSP 1 Score: 194.9 bits (494), Expect = 5.3e-46
Identity = 140/325 (43.08%), Postives = 168/325 (51.69%), Query Frame = 0
Query: 1 MASLKYFLLCPFVFLCGSYTFANGVFNSNDGSHSGPRAWPLPDPSAGPGVDRGVKNVGVG 60
M SLKYFLL PFVFLC S TFAN V NS+DGS G D VG G
Sbjct: 1 MGSLKYFLLSPFVFLCLSCTFANRVPNSDDGS----------------GFD-----VGAG 60
Query: 61 PRAGPTAGPRVKGGVTNVIAGPRAEPKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNV 120
P A PTAGP V+ GV+NV AGP A EG V +V AG +AGP+A G EG S+V
Sbjct: 61 PGAIPTAGPGVEKGVSNVRAGPAA--------EGWVNDVWAGLKAGPKAGPGAEGWVSDV 120
Query: 121 NAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGV 180
AG RAGPKA G EG ++N+ A P VGP+A G EGGVS SS GG +
Sbjct: 121 KAGLRAGPKAGPGAEGWVSNVKAGPTVGPRAWPGTEGGVS----SSEGGVR--------- 180
Query: 181 SNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNGVGVGVGVGVGYKPGF---- 240
+ V+ ++NG+G+G+GV +GY+ GF
Sbjct: 181 --------------------------------RDVDPMINGLGLGLGVDIGYRSGFRAGL 240
Query: 241 -GPPGFLPPGFGSRPGYWPRPGFEPYDDCILGYVCPANEARECSKFEYGTCHSYNFHPLT 300
G + PG G G ++C LGYVCP R C KF YG C +Y FHPL
Sbjct: 241 GGGEHWFGPGIGIGGG-------GVSNECTLGYVCPTYGRRGCDKFSYGNCDTYGFHPLM 244
Query: 301 ASTDLHEVDINWAR-SKPFATAQNG 320
AS LHEV++ WA+ SKP AT QNG
Sbjct: 301 ASMHLHEVEMKWAKGSKPAATPQNG 244
BLAST of Clc10G06480 vs. ExPASy TrEMBL
Match:
A0A7R9G286 (Hypothetical protein OS=Timema shepardi OX=629360 GN=TSIB3V08_LOCUS8372 PE=4 SV=1)
HSP 1 Score: 79.7 bits (195), Expect = 2.5e-11
Identity = 93/237 (39.24%), Postives = 99/237 (41.77%), Query Frame = 0
Query: 31 GSHSGPRAWPL--PDPSAGPGVDRGVKNVGVGPRAGPTAGPRVKGGV---TNVIAGPRAE 90
G GP P+ P GPG G G G GP GP V GG V GP
Sbjct: 533 GVGGGPGYGPVVGGGPGYGPGGAGGGPGYGPGVGGGPGYGPGVGGGPGYGPGVGGGPGYG 592
Query: 91 PKADLEVEGGVTNVNAGPRAGPRASLGVEGGASNVNAGPRAGPKASLGVEGGINNIGASP 150
P V G + V GP GP GV GG S + G GP G GG G
Sbjct: 593 PGGVGSVPGYGSGVGGGPGYGPG---GVGGGPSYGSGGVGGGPGYGPGARGG-PGYGPGV 652
Query: 151 RVGPKAGLGVEGGVSNIGASSRGGPKAGPGAEEGVSNVGAGPRAGPKSGSEPKVGVSGIR 210
GP G GV GG + GGP GPG VG GP GP G P G G+
Sbjct: 653 GGGPGYGPGVGGGPGYGPGGAGGGPGYGPG-------VGVGPGYGPGVGGGPGYG-PGVG 712
Query: 211 AGPRARPKGVNSIVNGVGVGVGVGVGYKPG--FGPPGFLPPGFGSRPGYWPRPGFEP 261
GP P GV S V G G GVG G GY PG G PG+ P G G PGY P G P
Sbjct: 713 GGPGYGPGGVGS-VPGYGSGVGGGPGYGPGGVGGGPGYGPGGVGGGPGYGPGVGGGP 756
BLAST of Clc10G06480 vs. ExPASy TrEMBL
Match:
A0A0M9A612 (Uncharacterized protein OS=Melipona quadrifasciata OX=166423 GN=WN51_05843 PE=4 SV=1)
HSP 1 Score: 67.8 bits (164), Expect = 9.8e-08
Identity = 76/212 (35.85%), Postives = 86/212 (40.57%), Query Frame = 0
Query: 49 GVDRGVKNVGVGP------RAGPTAGP-RVKGGVTNVIAGPRAEPKADLEVEGGVTNVNA 108
GV+ G VG G G AG V+ G V AG +EV G V A
Sbjct: 464 GVEAGSLGVGAGSVGVDAGSVGVDAGSLGVEAGSLGVGAGSVEVGAGSVEVGAGSLGVGA 523
Query: 109 GPRAGPRASLGVEGGASNVNAGPRAGPKASLGVEGGINNIGASPRVGPKAGLGVEGGVSN 168
G G SLGVE G+ V AG G SLGVE G +GA G LGVE G
Sbjct: 524 GSAGGDAGSLGVEAGSLGVGAGSAGGDAGSLGVEAGSLGVGAGSAGGDAGSLGVEAGSLG 583
Query: 169 IGASSRGGPKAGPGAEEGVSNVGAGPRAGPKSGSEPKVGVSGIRAGPRARPKGVNSIVNG 228
+GA S GG G E G VGAG G + G G+ AG G + G
Sbjct: 584 VGAGSAGGDAGSLGVEAGSLGVGAGSAGGDAGSLGVEAGSLGVGAGSAGGDAGSLGVEAG 643
Query: 229 -VGVGVGVGVGYKPGFG-PPGFLPPGFGSRPG 252
+GVG G G G G L G GS G
Sbjct: 644 SLGVGAGSAGGDAGSLGVEAGSLGVGAGSAGG 675
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG6577377.1 | 3.9e-59 | 48.63 | hypothetical protein SDJN03_24951, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KGN56231.1 | 4.3e-58 | 46.13 | hypothetical protein Csa_011503 [Cucumis sativus] | [more] |
KAA0060661.1 | 8.4e-54 | 43.88 | hypothetical protein E6C27_scaffold22G005540 [Cucumis melo var. makuwa] >TYK0221... | [more] |
XP_023551823.1 | 1.3e-49 | 49.06 | fibroin heavy chain-like isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022929340.1 | 1.1e-45 | 43.08 | fibroin heavy chain-like [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L7X7 | 2.1e-58 | 46.13 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G104850 PE=4 SV=1 | [more] |
A0A5A7V4J6 | 4.1e-54 | 43.88 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A6J1EU53 | 5.3e-46 | 43.08 | fibroin heavy chain-like OS=Cucurbita moschata OX=3662 GN=LOC111435943 PE=4 SV=1 | [more] |
A0A7R9G286 | 2.5e-11 | 39.24 | Hypothetical protein OS=Timema shepardi OX=629360 GN=TSIB3V08_LOCUS8372 PE=4 SV=... | [more] |
A0A0M9A612 | 9.8e-08 | 35.85 | Uncharacterized protein OS=Melipona quadrifasciata OX=166423 GN=WN51_05843 PE=4 ... | [more] |
Match Name | E-value | Identity | Description | |