Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCCCCTTTCATTTCACTTCAACCATTCACAAATGGCTTCTCATAAGTTTTCTTCTTTTGTATTGTTCTTTCTTTTGTTAGGTATTGGGGTTTCATATGCAGCTAGAACTCTCTTAACTTATGGTGGAGGAGGACCAGTGAATATACCTGCATTTGCTTATGGTGCAGGCAATGGTGGAGGTGGTGGAAGCGGTGGTGGATATGGTCCACTTGGTGGTGGTGGTGGAGGTTATGGAAGTGGAGGTGGTGGTAGTTATAGTTCTGTAGGAGTACAATATGGTGTTGGAGGCTATGGAAGTGGAGGTGGAGGTGGAAGTGGTGGTGGTGAAGGATACGGTCCTAGTGGTGGCTATGGTGGAGGAGGAGGTGGTGGGAGTGGCGGTGGTTCTGCTTATGGTCATGGTGGTTCTGCTTATGGAGGTGGTGGAGGAAGTGGCGGAGGAGGTGGTTATGGTCCTGGTGGTGGTGGATATGGAGGGGGTGGTGGAAATGGTGGTGGAGCGAGTTATGGTCCTGGAGAAGGTAGTGGATATGGAGGTGTAGGATATGGTGGAGGTGGTGGTAGTGGAGGTGGTGTGGGGTATGGCCCGGGAGGTGGAGGGTATGGAGGAGGTGGTGGGAATGGAGGTGGGGCTGGATATGGTCTTGGGGGTGCAGGATATGGAGGAGGTGGTGGAAATGGCGGTGGGGCAGGATATGGTCATGAAGGTGCAGGTTATGGAGGGGGTGGTGGAAATGGTGGTGGAGGAGGATATGGCCATGAAGGTGCAGGGTATGGAGGGGGTGGTGGAAATGGCGGTGGAGGAGGATACGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGAAAATAGTGGCGGTCATGAAAGTGGAGGATATGGAGGAAGTGGAGGAAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGTAGTGGAGGAAATGGAGGCGGCAGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATGGAAACAGTGGGGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGGTGGAAATAGCGGTGGGGCAGGATATGGTGGAGAACATGGAGCAGGGTATGGTGGTGGTGGTGGAAATGGTGGGGGTGGAGGTGGAGGAGCAGGAAGTGGCTCAGGAGGAGAATATGGCTCTGGTTATGGAAGCGGAGCAGGTGGGGGACATGGTGCAGGAGGAGGAAATAATGGTGGGGGAAGCGGAGGAGGCGGTGGAGGAGGATCGGGATATGGTGGAGGAAGTGCACATGGTGGTGCATACGGTGGACATGAAGAAGGCAATGGGTATGGTGGAGGAGGTGGTAGCGGGCAAGGTGGTGGACATGGTGGATATGCACCTTGAAATGGACATCATAATTGTTACCTTATATATATATATATTTATATTATATACAATTATAAAAGAAACAAATAGGGAAAATACATATTATCATTTATCCTTTCTTAATTCCCTATGTAATCAAAATATATTTACGTGTATCAAAGATTGGGCAAAGGGATTAGTTATATGTAGTAAGAAAAGTGTGTGATATATATGTCAAAATGGGAACACTTGCCAAGTTGTAACTTCTTCATCTCCATATTTTGTTGAAGTAAAAGATTAAGGGGACTCTCCTATTTTTGACAAACACCTTATAGTTCTCTTTCAAAATATATAGGACTTAATTTGAAAATTAATGATGGTAGTTTTATTTGAAGTGAGACTCAATATAAATTACTTTTATTTCTTTCACCCAACTATGTTCTTTTTTTCTTAAAAGAAAAAATACATCACTCTCTATTTTATAATTATATATCAAATTGTCTCATATATTTTCTTTTCATATTTTTACTAATATTAAATTTAATTTAAAAATATTACTGATTTAAACATTTTATTTATAAACTAATAAATATTATATACAAGTACAAAAATATTACCTTTTATTTTTGAACAAATAACACGAGAGTAAAGGATGTAAATAGTGTTTAGTGACAATAGTTTCACCTAAGAAACGAGGAGAACACTAAATGAAAATAGTAGAGACACACAATATTGTTAACCCAATTCGATGACATAACATCTACATTTGGGAGTCCTTAACTCATAATAAAATTCGATGAGATTTCTTATTACTTTGATAAAAATTCAATCATAATCAACATATTATGATTTTAGTACATAAAATTTCACTGACTTATTCACTTTGTACTAATTACCATGATATAAATTGAATTATCACTACAATCAAAGATAGATTTTGATAACCAAAACCTTCGACCTTAATCATGCAACATTCTTATCTTATCACCAAAATAAGATGTTAATCAAAAGTGTCTTCCCATGAACTTTTGACCTTCATTAAATTTCAACATATGTAACGATTATGTACACAAAACTCTTAACCAACAAAAGTTGGATCAAGATAACACATCATCTTTTTCTGAAGACAATTTCATTGCAAAGTGATTGCTTTTAGCACTAATTTTCTTTTTCTTTCAGCGGGATATACTAGGAGAATGA
mRNA sequence
GTCCCCTTTCATTTCACTTCAACCATTCACAAATGGCTTCTCATAAGTTTTCTTCTTTTGTATTGTTCTTTCTTTTGTTAGGTATTGGGGTTTCATATGCAGCTAGAACTCTCTTAACTTATGGTGGAGGAGGACCAGTGAATATACCTGCATTTGCTTATGGTGCAGGCAATGGTGGAGGTGGTGGAAGCGGTGGTGGATATGGTCCACTTGGTGGTGGTGGTGGAGGTTATGGAAGTGGAGGTGGTGGTAGTTATAGTTCTGTAGGAGTACAATATGGTGTTGGAGGCTATGGAAGTGGAGGTGGAGGTGGAAGTGGTGGTGGTGAAGGATACGGTCCTAGTGGTGGCTATGGTGGAGGAGGAGGTGGTGGGAGTGGCGGTGGTTCTGCTTATGGTCATGGTGGTTCTGCTTATGGAGGTGGTGGAGGAAGTGGCGGAGGAGGTGGTTATGGTCCTGGTGGTGGTGGATATGGAGGGGGTGGTGGAAATGGTGGTGGAGCGAGTTATGGTCCTGGAGAAGGTAGTGGATATGGAGGTGTAGGATATGGTGGAGGTGGTGGTAGTGGAGGTGGTGTGGGGTATGGCCCGGGAGGTGGAGGGTATGGAGGAGGTGGTGGGAATGGAGGTGGGGCTGGATATGGTCTTGGGGGTGCAGGATATGGAGGAGGTGGTGGAAATGGCGGTGGGGCAGGATATGGTCATGAAGGTGCAGGTTATGGAGGGGGTGGTGGAAATGGTGGTGGAGGAGGATATGGCCATGAAGGTGCAGGGTATGGAGGGGGTGGTGGAAATGGCGGTGGAGGAGGATACGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGAAAATAGTGGCGGTCATGAAAGTGGAGGATATGGAGGAAGTGGAGGAAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGTAGTGGAGGAAATGGAGGCGGCAGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATGGAAACAGTGGGGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGGTGGAAATAGCGGTGGGGCAGGATATGGTGGAGAACATGGAGCAGGGTATGGTGGTGGTGGTGGAAATGGTGGGGGTGGAGGTGGAGGAGCAGGAAGTGGCTCAGGAGGAGAATATGGCTCTGGTTATGGAAGCGGAGCAGGTGGGGGACATGGTGCAGGAGGAGGAAATAATGGTGGGGGAAGCGGAGGAGGCGGTGGAGGAGGATCGGGATATGGTGGAGGAAGTGCACATGGTGGTGCATACGGTGGACATGAAGAAGGCAATGGGTATGGTGGAGGAGGTGGTAGCGGGCAAGCGGGATATACTAGGAGAATGA
Coding sequence (CDS)
ATGGCTTCTCATAAGTTTTCTTCTTTTGTATTGTTCTTTCTTTTGTTAGGTATTGGGGTTTCATATGCAGCTAGAACTCTCTTAACTTATGGTGGAGGAGGACCAGTGAATATACCTGCATTTGCTTATGGTGCAGGCAATGGTGGAGGTGGTGGAAGCGGTGGTGGATATGGTCCACTTGGTGGTGGTGGTGGAGGTTATGGAAGTGGAGGTGGTGGTAGTTATAGTTCTGTAGGAGTACAATATGGTGTTGGAGGCTATGGAAGTGGAGGTGGAGGTGGAAGTGGTGGTGGTGAAGGATACGGTCCTAGTGGTGGCTATGGTGGAGGAGGAGGTGGTGGGAGTGGCGGTGGTTCTGCTTATGGTCATGGTGGTTCTGCTTATGGAGGTGGTGGAGGAAGTGGCGGAGGAGGTGGTTATGGTCCTGGTGGTGGTGGATATGGAGGGGGTGGTGGAAATGGTGGTGGAGCGAGTTATGGTCCTGGAGAAGGTAGTGGATATGGAGGTGTAGGATATGGTGGAGGTGGTGGTAGTGGAGGTGGTGTGGGGTATGGCCCGGGAGGTGGAGGGTATGGAGGAGGTGGTGGGAATGGAGGTGGGGCTGGATATGGTCTTGGGGGTGCAGGATATGGAGGAGGTGGTGGAAATGGCGGTGGGGCAGGATATGGTCATGAAGGTGCAGGTTATGGAGGGGGTGGTGGAAATGGTGGTGGAGGAGGATATGGCCATGAAGGTGCAGGGTATGGAGGGGGTGGTGGAAATGGCGGTGGAGGAGGATACGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGAAAATAGTGGCGGTCATGAAAGTGGAGGATATGGAGGAAGTGGAGGAAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGTAGTGGAGGAAATGGAGGCGGCAGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATGGAAACAGTGGGGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGGTGGAAATAGCGGTGGGGCAGGATATGGTGGAGAACATGGAGCAGGGTATGGTGGTGGTGGTGGAAATGGTGGGGGTGGAGGTGGAGGAGCAGGAAGTGGCTCAGGAGGAGAATATGGCTCTGGTTATGGAAGCGGAGCAGGTGGGGGACATGGTGCAGGAGGAGGAAATAATGGTGGGGGAAGCGGAGGAGGCGGTGGAGGAGGATCGGGATATGGTGGAGGAAGTGCACATGGTGGTGCATACGGTGGACATGAAGAAGGCAATGGGTATGGTGGAGGAGGTGGTAGCGGGCAAGCGGGATATACTAGGAGAATGA
Protein sequence
MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPLGGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSAYGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGGYGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMETVGEIAVVMKVEDMEAVVEIAVGQDMVENMEQGMVVVVEMVGVEVEEQEVAQEENMALVMEAEQVGDMVQEEEIMVGEAEEAVEEDRDMVEEVHMVVHTVDMKKAMGMVEEVVAGKRDILGE*
Homology
BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match:
A0A0A0LH77 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G902220 PE=4 SV=1)
HSP 1 Score: 297.4 bits (760), Expect = 1.6e-76
Identity = 334/424 (78.77%), Postives = 340/424 (80.19%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGAS 240
Query: 241 YG-HEGAGYGG-GGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYG 300
YG EG+GYGG GGGNGG GHESGGYGG+GGNGG HESGGYG
Sbjct: 241 YGPGEGSGYGGVGGGNGG-------------------GHESGGYGGNGGNGGSHESGGYG 300
Query: 301 GSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESG 360
GSG NSGG GYGG G+G GGG GG GG G+G G G YG G+G G G
Sbjct: 301 GSGGNSGG---AGYGGEHGAGYGGGGGNGGGGGGGAGSGSG---GEYGSGYGSGAG---G 360
Query: 361 GYGGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESG-GYGGSEGHGSGHE 420
G+G GGN GG GG GGSG GG G Y GGHE G GYGG G G G
Sbjct: 361 GHGAGGGNNGGGSGGGGGGGSGYGGGSAHGGAY-------GGHEEGNGYGGGGGSGQGGG 388
Query: 421 SGGY 422
GGY
Sbjct: 421 HGGY 388
BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match:
A0A6J1HQX0 (fibroin heavy chain-like isoform X21 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 139.4 bits (350), Expect = 5.7e-29
Identity = 339/562 (60.32%), Postives = 367/562 (65.30%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHK S +FFLLLGIGVS AAR LLTYG G PVNIPAFAYGAG G G GSGGGYG L
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GG GGG G+GGG Y SVG +YGVGGYGSGGGGGSG G GYGP GG G GGGGGSGGG
Sbjct: 61 GGYGGGGGNGGGSGYGSVG-EYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGG 120
Query: 121 Y---------GHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVG 180
Y G+G G GGGSGGG GYGPGGGGYGGGGGNGGGA YGP GG G
Sbjct: 121 YGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGP------GGSG 180
Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE-GAGYG 240
YGGGGG+GGG GYG GGGYGGGGGNGGGAGYG G GYGGGGGNGGGAGYGHE G GYG
Sbjct: 181 YGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYG 240
Query: 241 GGGGNGGGGGYGHE-GAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGGYGGS 300
GGGGNGGG GYGHE G GYGGGGGNGGG GYGHE GGYGG G+G GHE GGYGG
Sbjct: 241 GGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGG 300
Query: 301 GGNGGGHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
GGNGGG GY GHE GGYGG GG+GG G G GG+G GG E GG
Sbjct: 301 GGNGGG---AGY--------GHEGGGYGGGGGNGGGAG----YGSGGAGSGAGGGEGGGS 360
Query: 361 GGSGGNGGGHESGGYGGSGGNGGGSHESGGYGGSG---GNGGGHESGGYGNSGGNSGGHE 420
G G +G G+ SGG GG+GG GG + GG GSG G GGGH GG G+SGG SGG
Sbjct: 361 GYGGEHGAGYGSGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGGSSGGGSGGGG 420
Query: 421 SGGYG-GSEGHGSGHESGGYGGSGGNGGGHESGGYGNSG-GNSGGHESGGYGG-SGGNGG 480
GG G G H + G GGN GG++ G + G GN G+ G Y +GGN G
Sbjct: 421 GGGSGYGLNKHEEYDKDKHEGYDGGNYGGYDGGKHEKYGRGNYRGYGRGKYEEYNGGNYG 480
Query: 481 GHESGGY---------GGSGGNGGGHESGGYGG-SGGNGGGHESGGY---------GGSG 523
G++ G + G GGN GG++ G YG GGN GG++ G + G G
Sbjct: 481 GYDGGKHEEYDKDKHEGYDGGNYGGYDGGNYGRYDGGNYGGYDGGKHEEYDKDKHEGYDG 538
BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match:
A0A6J1HWR4 (fibroin heavy chain-like isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 137.9 bits (346), Expect = 1.7e-28
Identity = 352/596 (59.06%), Postives = 371/596 (62.25%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHK S +FFLLLGIGVS AAR LLTYG G PVNIPAFAYGAG G G GSGGGYG L
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GG GGG G+GGG Y SVG +YGVGGYGSGGGGGSG G GYGP GG G GGGGGSGGG
Sbjct: 61 GGYGGGGGNGGGSGYGSVG-EYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGG 120
Query: 121 Y---------GHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVG 180
Y G+G G GGGSGGG GYGPGGGGYGGGGGNGGGA YGP GG G
Sbjct: 121 YGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGP------GGSG 180
Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE-GAGYG 240
YGGGGG+GGG GYG GGGYGGGGGNGGGAGYG G GYGGGGGNGGGAGYGHE G GYG
Sbjct: 181 YGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYG 240
Query: 241 GGGGNGGGGGYGHE-GAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGGYGGS 300
GGGGNGGG GYGHE G GYGGGGGNGGG GYGHE GGYGG G+G GHE GGYGG
Sbjct: 241 GGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGG 300
Query: 301 GGNGG----GHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHE 360
GGNGG GHE GGYGG G N GG GY GSGG+G GG E GG G G +G G+
Sbjct: 301 GGNGGGAGYGHEGGGYGGGGGNGGG---AGY-GSGGAGSGAGGGEGGGSGYGGEHGAGYG 360
Query: 361 SGGYGGSGGNGG-----------GHESGGYGGSGGNGGGSHESGGYGGSGGNGGG----- 420
SGG GG+GG GG G+ SG GG GG GGGS GG GG GG G G
Sbjct: 361 SGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGGS-SGGGSGGGGGGGSGYGLNK 420
Query: 421 --------HESGGYGNSGGNSGG-HESGGYGGSEGHGSG----HESGGYGGSGGNGGGHE 480
HE GN GG GG HE G G G+G G + G YGG +GG HE
Sbjct: 421 HEEYDKDKHEGYDGGNYGGYDGGKHEKYGRGNYRGYGRGKYEEYNGGNYGGY--DGGKHE 480
Query: 481 ----SGGYGNSGGNSGGHESG-----------GYGG---SGGNGGGHE----SGGYGGSG 523
G GGN GG++ G GY G G NGG HE G G
Sbjct: 481 EYDKDKHEGYDGGNYGGYDGGKHEEYEKDKPEGYDGGKYGGYNGGKHEEYDKDKHEGYDG 540
BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match:
A0A6J1HS53 (fibroin heavy chain-like isoform X20 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 128.6 bits (322), Expect = 1.0e-25
Identity = 353/610 (57.87%), Postives = 373/610 (61.15%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHK S +FFLLLGIGVS AAR LLTYG G PVNIPAFAYGAG G G GSGGGYG L
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GG GGG G+GGG Y SVG +YGVGGYGSGGGGGSG G GYGP GG G GGGGGSGGG
Sbjct: 61 GGYGGGGGNGGGSGYGSVG-EYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGG 120
Query: 121 Y---------GHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVG 180
Y G+G G GGGSGGG GYGPGGGGYGGGGGNGGGA YGP GG G
Sbjct: 121 YGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGP------GGSG 180
Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE-GAGYG 240
YGGGGG+GGG GYG GGGYGGGGGNGGGAGYG G GYGGGGGNGGGAGYGHE G GYG
Sbjct: 181 YGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYG 240
Query: 241 GGGGNGGGGGYGHE-GAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGGYGGS 300
GGGGNGGG GYGHE G GYGGGGGNGGG GYGHE GGYGG G+G GHE GGYGG
Sbjct: 241 GGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGG 300
Query: 301 GGNGG----GHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHE 360
GGNGG GHE GGYGG G N GG GY GSGG+G GG E GG G G +G G+
Sbjct: 301 GGNGGGAGYGHEGGGYGGGGGNGGG---AGY-GSGGAGSGAGGGEGGGSGYGGEHGAGYG 360
Query: 361 SGGYGGSGGNGG-----------GHESGGYGGSGGNGGGSHESGGYGGSGGNGGG----- 420
SGG GG+GG GG G+ SG GG GG GGGS GG GG GG G G
Sbjct: 361 SGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGGS-SGGGSGGGGGGGSGYGLNK 420
Query: 421 --------HESGGYGNSGGNSGG-HESGGYGGSEGHGSG----HESGGYGGSGGNGGGHE 480
HE GN GG GG HE G G G+G G + G YGG +GG HE
Sbjct: 421 HEEYDKDKHEGYDGGNYGGYDGGKHEKYGRGNYRGYGRGKYEEYNGGNYGGY--DGGKHE 480
Query: 481 ----SGGYGNSGGNSGGHESG-----------GYGG---SGGNGGGHES------GGYGG 523
G GGN GG++ G GY G G NGG HE GY G
Sbjct: 481 EYDKDKHEGYDGGNYGGYDGGKHEEYEKDKPEGYDGGKYGGYNGGKHEEYDKDKHEGYDG 540
BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match:
A0A6J1HV40 (fibroin heavy chain-like isoform X8 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 128.6 bits (322), Expect = 1.0e-25
Identity = 353/610 (57.87%), Postives = 373/610 (61.15%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHK S +FFLLLGIGVS AAR LLTYG G PVNIPAFAYGAG G G GSGGGYG L
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GG GGG G+GGG Y SVG +YGVGGYGSGGGGGSG G GYGP GG G GGGGGSGGG
Sbjct: 61 GGYGGGGGNGGGSGYGSVG-EYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGG 120
Query: 121 Y---------GHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVG 180
Y G+G G GGGSGGG GYGPGGGGYGGGGGNGGGA YGP GG G
Sbjct: 121 YGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGP------GGSG 180
Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE-GAGYG 240
YGGGGG+GGG GYG GGGYGGGGGNGGGAGYG G GYGGGGGNGGGAGYGHE G GYG
Sbjct: 181 YGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYG 240
Query: 241 GGGGNGGGGGYGHE-GAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGGYGGS 300
GGGGNGGG GYGHE G GYGGGGGNGGG GYGHE GGYGG G+G GHE GGYGG
Sbjct: 241 GGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGG 300
Query: 301 GGNGG----GHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHE 360
GGNGG GHE GGYGG G N GG GY GSGG+G GG E GG G G +G G+
Sbjct: 301 GGNGGGAGYGHEGGGYGGGGGNGGG---AGY-GSGGAGSGAGGGEGGGSGYGGEHGAGYG 360
Query: 361 SGGYGGSGGNGG-----------GHESGGYGGSGGNGGGSHESGGYGGSGGNGGG----- 420
SGG GG+GG GG G+ SG GG GG GGGS GG GG GG G G
Sbjct: 361 SGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGGS-SGGGSGGGGGGGSGYGLNK 420
Query: 421 --------HESGGYGNSGGNSGG-HESGGYGGSEGHGSG----HESGGYGGSGGNGGGHE 480
HE GN GG GG HE G G G+G G + G YGG +GG HE
Sbjct: 421 HEEYDKDKHEGYDGGNYGGYDGGKHEKYGRGNYRGYGRGKYEEYNGGNYGGY--DGGKHE 480
Query: 481 ----SGGYGNSGGNSGGHESG-----------GYGG---SGGNGGGHES------GGYGG 523
G GGN GG++ G GY G G NGG HE GY G
Sbjct: 481 EYDKDKHEGYDGGNYGGYDGGKHEEYEKDKPEGYDGGKYGGYNGGKHEEYDKDKHEGYDG 540
BLAST of CSPI03G46640 vs. NCBI nr
Match:
XP_031738892.1 (glycine-rich cell wall structural protein 1.8 isoform X1 [Cucumis sativus] >KAE8651439.1 hypothetical protein Csa_001535 [Cucumis sativus])
HSP 1 Score: 506.5 bits (1303), Expect = 3.7e-139
Identity = 471/543 (86.74%), Postives = 480/543 (88.40%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
Query: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGS 300
YGHEGAGYGGGGGNGGGGGYGHESGGYGG+ G+G GHESGGYGGS GHESGGYGGS
Sbjct: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGNGGNGGGHESGGYGGS-----GHESGGYGGS 300
Query: 301 GENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
G N+GGHESGGYGGSGG+GGNGG HESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY
Sbjct: 301 GGNNGGHESGGYGGSGGNGGNGGSHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
Query: 361 GGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHESGG 420
GGSGGN GG HESGGYGGS G+GGGHESGGYG SGGN GGHESGGYGGS G+G GHESGG
Sbjct: 361 GGSGGNSGG-HESGGYGGSEGHGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGG 420
Query: 421 YGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNGGGHESGGYGGSGGNG-----GG 480
YG GG HESGGYG +GGN GGHESGGYG SGGN GGHESGGYGGSGGN GG
Sbjct: 421 YG-----GGSHESGGYGGNGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNSGGAGYGG 480
Query: 481 HESGGYGGSGGNGGGHESGGYGGSGGN---------GGGHESGGYGGSGGNGGGHESG-G 529
GYGG GGNGGG G GSGG GGGH +GG GG+GGG G G
Sbjct: 481 EHGAGYGGGGGNGGGGGGGAGSGSGGEYGSGYGSGAGGGHGAGGGNNGGGSGGGGGGGSG 531
BLAST of CSPI03G46640 vs. NCBI nr
Match:
XP_031738894.1 (glycine-rich cell wall structural protein 1.8 isoform X3 [Cucumis sativus])
HSP 1 Score: 500.4 bits (1287), Expect = 2.6e-137
Identity = 472/538 (87.73%), Postives = 480/538 (89.22%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
Query: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGS 300
YGHEGAGYGGGGGNGGGGGYGHESGGYGG+ G+G GHESGGYGGS GHESGGYGGS
Sbjct: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGNGGNGGGHESGGYGGS-----GHESGGYGGS 300
Query: 301 GENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
G N+GGHESGGYGGSGG+GGNGG HESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY
Sbjct: 301 GGNNGGHESGGYGGSGGNGGNGGSHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
Query: 361 GGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHESGG 420
GGSGGN GG HESGGYGGS G+GGGHESGGYG SGGN GGHESGGYGGS G HESGG
Sbjct: 361 GGSGGNSGG-HESGGYGGSEGHGGGHESGGYGGSGGNGGGHESGGYGGS--GGGSHESGG 420
Query: 421 YGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNG-----GGHESGGYGGSGGNGGG 480
YGG+GGNGGGHESGGYGNSGGNSGGHESGGYGGSGGN GG GYGG GGNGGG
Sbjct: 421 YGGNGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNSGGAGYGGEHGAGYGGGGGNGGG 480
Query: 481 HESGGYGGSGGN-GGGHES---GGYGGSGGNGGGHESGGYGGSGGNGGGHESGG-YGG 529
G GSGG G G+ S GG+G GGN GG GG GG G GGG GG YGG
Sbjct: 481 GGGGAGSGSGGEYGSGYGSGAGGGHGAGGGNNGGGSGGGGGGGSGYGGGSAHGGAYGG 529
BLAST of CSPI03G46640 vs. NCBI nr
Match:
XP_031738893.1 (glycine-rich cell wall structural protein 1.8 isoform X2 [Cucumis sativus])
HSP 1 Score: 495.0 bits (1273), Expect = 1.1e-135
Identity = 472/541 (87.25%), Postives = 478/541 (88.35%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
Query: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGS 300
YGHEGAGYGGGGGNGGGGGYGHESGGYGG+ G+G GHESGGYGGSGGN GGHESGGYGGS
Sbjct: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGNGGNGGGHESGGYGGSGGNNGGHESGGYGGS 300
Query: 301 ---GENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHES 360
G N G HESGGY GGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGN GGHES
Sbjct: 301 GGNGGNGGSHESGGY---GGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNSGGHES 360
Query: 361 GGYGGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHE 420
GGYGGS G+GGG HESGGYGGSGGNGGGHESGGYG SGGN GGHESGGYG G HE
Sbjct: 361 GGYGGSEGHGGG-HESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYG-----GGSHE 420
Query: 421 SGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNG-----GGHESGGYGGSGGN 480
SGGYGG+GGNGGGHESGGYGNSGGNSGGHESGGYGGSGGN GG GYGG GGN
Sbjct: 421 SGGYGGNGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNSGGAGYGGEHGAGYGGGGGN 480
Query: 481 GGGHESGGYGGSGGN-GGGHES---GGYGGSGGNGGGHESGGYGGSGGNGGGHESGG-YG 529
GGG G GSGG G G+ S GG+G GGN GG GG GG G GGG GG YG
Sbjct: 481 GGGGGGGAGSGSGGEYGSGYGSGAGGGHGAGGGNNGGGSGGGGGGGSGYGGGSAHGGAYG 531
BLAST of CSPI03G46640 vs. NCBI nr
Match:
XP_031738895.1 (glycine-rich cell wall structural protein 1 isoform X4 [Cucumis sativus])
HSP 1 Score: 372.9 bits (956), Expect = 6.3e-99
Identity = 401/498 (80.52%), Postives = 407/498 (81.73%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
Query: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61 GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
Query: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGS 300
YGHEGAGYGGGGGNGGGGGY GHESGGYGG+GGNGGGHESGGYGGS
Sbjct: 241 YGHEGAGYGGGGGNGGGGGY---------------GHESGGYGGNGGNGGGHESGGYGGS 300
Query: 301 GENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
GHESGGYGGSG GG HESGGYGG+GGNGGGHESGGYG SGGN GGHESGGY
Sbjct: 301 -----GHESGGYGGSG-----GGSHESGGYGGNGGNGGGHESGGYGNSGGNSGGHESGGY 360
Query: 361 GGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHESG- 420
GGSGGN GG+ GYGG G GYG GGN GG GG G GSG E G
Sbjct: 361 GGSGGNSGGA----GYGGEHG-------AGYGGGGGNGGGG-----GGGAGSGSGGEYGS 420
Query: 421 GYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESG 480
GYG G GGGH +GG GN+GG SGG GG GGSG GG G Y GGHE G
Sbjct: 421 GYG--SGAGGGHGAGG-GNNGGGSGG---GGGGGSGYGGGSAHGGAY-------GGHEEG 443
Query: 481 -GYGGSGGNGGGHESGGY 497
GYGG GG+G G GGY
Sbjct: 481 NGYGGGGGSGQGGGHGGY 443
BLAST of CSPI03G46640 vs. NCBI nr
Match:
XP_038887343.1 (glycine-rich cell wall structural protein-like isoform X1 [Benincasa hispida])
HSP 1 Score: 246.1 bits (627), Expect = 9.0e-61
Identity = 345/488 (70.70%), Postives = 358/488 (73.36%), Query Frame = 0
Query: 1 MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
MASHK SSFV FFLLLGIGVS AARTLLTY G VNIPAFAYGAGNG GGGSGGGYGPL
Sbjct: 1 MASHKLSSFV-FFLLLGIGVSSAARTLLTYADGESVNIPAFAYGAGNGAGGGSGGGYGPL 60
Query: 61 G--GGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGG 120
G GGGGGYG GGG SY S G QYGVGGYGSGGGGGSGGG GYGPSGGYGGGGGGGSGGG
Sbjct: 61 GGHGGGGGYGGGGGSSYGSEG-QYGVGGYGSGGGGGSGGGGGYGPSGGYGGGGGGGSGGG 120
Query: 121 SAYGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGA-------SYGPGEGSGYGGVG 180
SAYGHGGSAYGGGGG G G GYGPGGGGYGGGGGNGGGA YGPG G+GYGGVG
Sbjct: 121 SAYGHGGSAYGGGGGGGEGVGYGPGGGGYGGGGGNGGGAGYGPSGGGYGPGGGNGYGGVG 180
Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE------ 240
YGGGGGSGGGVGYG GGGGYGGGGGNGGGAGYG GG+GYGGGGGNGGGAGYGHE
Sbjct: 181 YGGGGGSGGGVGYGLGGGGYGGGGGNGGGAGYGPGGSGYGGGGGNGGGAGYGHEXGYGPG 240
Query: 241 GAGYGGGGGNGGGGGYGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGG 300
G+GYGGGGGNGGG GYGHEGAGYGGGGGNGGG GYGHESGGYGG G G GHE G
Sbjct: 241 GSGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGAGYGHESGGYGGGGGSGGGAGYGHEGAG 300
Query: 301 YGGSGGNGGGHESGGYGGSGENSGGHESGGYGGSGGSGGNGG-GHESGGYGGSGGNGGGH 360
YGG GGNGGG GY GHE GYGG GG+GG G GHE GYGG GGNGGG
Sbjct: 301 YGGGGGNGGG---AGY--------GHEGAGYGGGGGNGGGTGYGHEGAGYGGGGGNGGGA 360
Query: 361 ESGGYGGSGGNGGGHESGGYGGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGG 420
G GG G G G+ SGG G+G GG H GYGG GG+GGG G +G SGG
Sbjct: 361 GYGHEGGEYGGGAGYGSGGGEGNGSGYGGEH-GKGYGGGGGSGGG------GGAGNGSGG 420
Query: 421 HESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNGGG 469
GYG G G GH +GG GG+GGG GG GG GGHE G G+GG GG
Sbjct: 421 EYGSGYG--SGAGGGHGAGGGSSGGGSGGGSGYGGGSAPGGAYGGHEEG--SGNGGGGGS 464
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0LH77 | 1.6e-76 | 78.77 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G902220 PE=4 SV=1 | [more] |
A0A6J1HQX0 | 5.7e-29 | 60.32 | fibroin heavy chain-like isoform X21 OS=Cucurbita maxima OX=3661 GN=LOC111466985... | [more] |
A0A6J1HWR4 | 1.7e-28 | 59.06 | fibroin heavy chain-like isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111466985 ... | [more] |
A0A6J1HS53 | 1.0e-25 | 57.87 | fibroin heavy chain-like isoform X20 OS=Cucurbita maxima OX=3661 GN=LOC111466985... | [more] |
A0A6J1HV40 | 1.0e-25 | 57.87 | fibroin heavy chain-like isoform X8 OS=Cucurbita maxima OX=3661 GN=LOC111466985 ... | [more] |
Match Name | E-value | Identity | Description | |
XP_031738892.1 | 3.7e-139 | 86.74 | glycine-rich cell wall structural protein 1.8 isoform X1 [Cucumis sativus] >KAE8... | [more] |
XP_031738894.1 | 2.6e-137 | 87.73 | glycine-rich cell wall structural protein 1.8 isoform X3 [Cucumis sativus] | [more] |
XP_031738893.1 | 1.1e-135 | 87.25 | glycine-rich cell wall structural protein 1.8 isoform X2 [Cucumis sativus] | [more] |
XP_031738895.1 | 6.3e-99 | 80.52 | glycine-rich cell wall structural protein 1 isoform X4 [Cucumis sativus] | [more] |
XP_038887343.1 | 9.0e-61 | 70.70 | glycine-rich cell wall structural protein-like isoform X1 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |