CSPI03G46640 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G46640
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionfibroin heavy chain-like isoform X21
LocationChr3: 39831996 .. 39835332 (+)
RNA-Seq ExpressionCSPI03G46640
SyntenyCSPI03G46640
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCCCCTTTCATTTCACTTCAACCATTCACAAATGGCTTCTCATAAGTTTTCTTCTTTTGTATTGTTCTTTCTTTTGTTAGGTATTGGGGTTTCATATGCAGCTAGAACTCTCTTAACTTATGGTGGAGGAGGACCAGTGAATATACCTGCATTTGCTTATGGTGCAGGCAATGGTGGAGGTGGTGGAAGCGGTGGTGGATATGGTCCACTTGGTGGTGGTGGTGGAGGTTATGGAAGTGGAGGTGGTGGTAGTTATAGTTCTGTAGGAGTACAATATGGTGTTGGAGGCTATGGAAGTGGAGGTGGAGGTGGAAGTGGTGGTGGTGAAGGATACGGTCCTAGTGGTGGCTATGGTGGAGGAGGAGGTGGTGGGAGTGGCGGTGGTTCTGCTTATGGTCATGGTGGTTCTGCTTATGGAGGTGGTGGAGGAAGTGGCGGAGGAGGTGGTTATGGTCCTGGTGGTGGTGGATATGGAGGGGGTGGTGGAAATGGTGGTGGAGCGAGTTATGGTCCTGGAGAAGGTAGTGGATATGGAGGTGTAGGATATGGTGGAGGTGGTGGTAGTGGAGGTGGTGTGGGGTATGGCCCGGGAGGTGGAGGGTATGGAGGAGGTGGTGGGAATGGAGGTGGGGCTGGATATGGTCTTGGGGGTGCAGGATATGGAGGAGGTGGTGGAAATGGCGGTGGGGCAGGATATGGTCATGAAGGTGCAGGTTATGGAGGGGGTGGTGGAAATGGTGGTGGAGGAGGATATGGCCATGAAGGTGCAGGGTATGGAGGGGGTGGTGGAAATGGCGGTGGAGGAGGATACGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGAAAATAGTGGCGGTCATGAAAGTGGAGGATATGGAGGAAGTGGAGGAAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGTAGTGGAGGAAATGGAGGCGGCAGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATGGAAACAGTGGGGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGGTGGAAATAGCGGTGGGGCAGGATATGGTGGAGAACATGGAGCAGGGTATGGTGGTGGTGGTGGAAATGGTGGGGGTGGAGGTGGAGGAGCAGGAAGTGGCTCAGGAGGAGAATATGGCTCTGGTTATGGAAGCGGAGCAGGTGGGGGACATGGTGCAGGAGGAGGAAATAATGGTGGGGGAAGCGGAGGAGGCGGTGGAGGAGGATCGGGATATGGTGGAGGAAGTGCACATGGTGGTGCATACGGTGGACATGAAGAAGGCAATGGGTATGGTGGAGGAGGTGGTAGCGGGCAAGGTGGTGGACATGGTGGATATGCACCTTGAAATGGACATCATAATTGTTACCTTATATATATATATATTTATATTATATACAATTATAAAAGAAACAAATAGGGAAAATACATATTATCATTTATCCTTTCTTAATTCCCTATGTAATCAAAATATATTTACGTGTATCAAAGATTGGGCAAAGGGATTAGTTATATGTAGTAAGAAAAGTGTGTGATATATATGTCAAAATGGGAACACTTGCCAAGTTGTAACTTCTTCATCTCCATATTTTGTTGAAGTAAAAGATTAAGGGGACTCTCCTATTTTTGACAAACACCTTATAGTTCTCTTTCAAAATATATAGGACTTAATTTGAAAATTAATGATGGTAGTTTTATTTGAAGTGAGACTCAATATAAATTACTTTTATTTCTTTCACCCAACTATGTTCTTTTTTTCTTAAAAGAAAAAATACATCACTCTCTATTTTATAATTATATATCAAATTGTCTCATATATTTTCTTTTCATATTTTTACTAATATTAAATTTAATTTAAAAATATTACTGATTTAAACATTTTATTTATAAACTAATAAATATTATATACAAGTACAAAAATATTACCTTTTATTTTTGAACAAATAACACGAGAGTAAAGGATGTAAATAGTGTTTAGTGACAATAGTTTCACCTAAGAAACGAGGAGAACACTAAATGAAAATAGTAGAGACACACAATATTGTTAACCCAATTCGATGACATAACATCTACATTTGGGAGTCCTTAACTCATAATAAAATTCGATGAGATTTCTTATTACTTTGATAAAAATTCAATCATAATCAACATATTATGATTTTAGTACATAAAATTTCACTGACTTATTCACTTTGTACTAATTACCATGATATAAATTGAATTATCACTACAATCAAAGATAGATTTTGATAACCAAAACCTTCGACCTTAATCATGCAACATTCTTATCTTATCACCAAAATAAGATGTTAATCAAAAGTGTCTTCCCATGAACTTTTGACCTTCATTAAATTTCAACATATGTAACGATTATGTACACAAAACTCTTAACCAACAAAAGTTGGATCAAGATAACACATCATCTTTTTCTGAAGACAATTTCATTGCAAAGTGATTGCTTTTAGCACTAATTTTCTTTTTCTTTCAGCGGGATATACTAGGAGAATGA

mRNA sequence

GTCCCCTTTCATTTCACTTCAACCATTCACAAATGGCTTCTCATAAGTTTTCTTCTTTTGTATTGTTCTTTCTTTTGTTAGGTATTGGGGTTTCATATGCAGCTAGAACTCTCTTAACTTATGGTGGAGGAGGACCAGTGAATATACCTGCATTTGCTTATGGTGCAGGCAATGGTGGAGGTGGTGGAAGCGGTGGTGGATATGGTCCACTTGGTGGTGGTGGTGGAGGTTATGGAAGTGGAGGTGGTGGTAGTTATAGTTCTGTAGGAGTACAATATGGTGTTGGAGGCTATGGAAGTGGAGGTGGAGGTGGAAGTGGTGGTGGTGAAGGATACGGTCCTAGTGGTGGCTATGGTGGAGGAGGAGGTGGTGGGAGTGGCGGTGGTTCTGCTTATGGTCATGGTGGTTCTGCTTATGGAGGTGGTGGAGGAAGTGGCGGAGGAGGTGGTTATGGTCCTGGTGGTGGTGGATATGGAGGGGGTGGTGGAAATGGTGGTGGAGCGAGTTATGGTCCTGGAGAAGGTAGTGGATATGGAGGTGTAGGATATGGTGGAGGTGGTGGTAGTGGAGGTGGTGTGGGGTATGGCCCGGGAGGTGGAGGGTATGGAGGAGGTGGTGGGAATGGAGGTGGGGCTGGATATGGTCTTGGGGGTGCAGGATATGGAGGAGGTGGTGGAAATGGCGGTGGGGCAGGATATGGTCATGAAGGTGCAGGTTATGGAGGGGGTGGTGGAAATGGTGGTGGAGGAGGATATGGCCATGAAGGTGCAGGGTATGGAGGGGGTGGTGGAAATGGCGGTGGAGGAGGATACGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGAAAATAGTGGCGGTCATGAAAGTGGAGGATATGGAGGAAGTGGAGGAAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGTAGTGGAGGAAATGGAGGCGGCAGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATGGAAACAGTGGGGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGGTGGAAATAGCGGTGGGGCAGGATATGGTGGAGAACATGGAGCAGGGTATGGTGGTGGTGGTGGAAATGGTGGGGGTGGAGGTGGAGGAGCAGGAAGTGGCTCAGGAGGAGAATATGGCTCTGGTTATGGAAGCGGAGCAGGTGGGGGACATGGTGCAGGAGGAGGAAATAATGGTGGGGGAAGCGGAGGAGGCGGTGGAGGAGGATCGGGATATGGTGGAGGAAGTGCACATGGTGGTGCATACGGTGGACATGAAGAAGGCAATGGGTATGGTGGAGGAGGTGGTAGCGGGCAAGCGGGATATACTAGGAGAATGA

Coding sequence (CDS)

ATGGCTTCTCATAAGTTTTCTTCTTTTGTATTGTTCTTTCTTTTGTTAGGTATTGGGGTTTCATATGCAGCTAGAACTCTCTTAACTTATGGTGGAGGAGGACCAGTGAATATACCTGCATTTGCTTATGGTGCAGGCAATGGTGGAGGTGGTGGAAGCGGTGGTGGATATGGTCCACTTGGTGGTGGTGGTGGAGGTTATGGAAGTGGAGGTGGTGGTAGTTATAGTTCTGTAGGAGTACAATATGGTGTTGGAGGCTATGGAAGTGGAGGTGGAGGTGGAAGTGGTGGTGGTGAAGGATACGGTCCTAGTGGTGGCTATGGTGGAGGAGGAGGTGGTGGGAGTGGCGGTGGTTCTGCTTATGGTCATGGTGGTTCTGCTTATGGAGGTGGTGGAGGAAGTGGCGGAGGAGGTGGTTATGGTCCTGGTGGTGGTGGATATGGAGGGGGTGGTGGAAATGGTGGTGGAGCGAGTTATGGTCCTGGAGAAGGTAGTGGATATGGAGGTGTAGGATATGGTGGAGGTGGTGGTAGTGGAGGTGGTGTGGGGTATGGCCCGGGAGGTGGAGGGTATGGAGGAGGTGGTGGGAATGGAGGTGGGGCTGGATATGGTCTTGGGGGTGCAGGATATGGAGGAGGTGGTGGAAATGGCGGTGGGGCAGGATATGGTCATGAAGGTGCAGGTTATGGAGGGGGTGGTGGAAATGGTGGTGGAGGAGGATATGGCCATGAAGGTGCAGGGTATGGAGGGGGTGGTGGAAATGGCGGTGGAGGAGGATACGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGAAAATAGTGGCGGTCATGAAAGTGGAGGATATGGAGGAAGTGGAGGAAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGCAGTGGAGGAAATGGAGGGGGTCATGAAAGTGGAGGATATGGAGGTAGTGGAGGAAATGGAGGCGGCAGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGAAGGACATGGCAGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGCGGCGGTCATGAAAGTGGAGGATATGGAAACAGTGGAGGAAATAGCGGTGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGCAGTGGAGGAAATGGTGGCGGTCATGAAAGTGGAGGATACGGAGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATGGAAACAGTGGGGGAAATAGCGGTGGTCATGAAAGTGGAGGATATGGAGGCAGTGGTGGAAATAGCGGTGGGGCAGGATATGGTGGAGAACATGGAGCAGGGTATGGTGGTGGTGGTGGAAATGGTGGGGGTGGAGGTGGAGGAGCAGGAAGTGGCTCAGGAGGAGAATATGGCTCTGGTTATGGAAGCGGAGCAGGTGGGGGACATGGTGCAGGAGGAGGAAATAATGGTGGGGGAAGCGGAGGAGGCGGTGGAGGAGGATCGGGATATGGTGGAGGAAGTGCACATGGTGGTGCATACGGTGGACATGAAGAAGGCAATGGGTATGGTGGAGGAGGTGGTAGCGGGCAAGCGGGATATACTAGGAGAATGA

Protein sequence

MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPLGGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSAYGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGGYGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMETVGEIAVVMKVEDMEAVVEIAVGQDMVENMEQGMVVVVEMVGVEVEEQEVAQEENMALVMEAEQVGDMVQEEEIMVGEAEEAVEEDRDMVEEVHMVVHTVDMKKAMGMVEEVVAGKRDILGE*
Homology
BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match: A0A0A0LH77 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G902220 PE=4 SV=1)

HSP 1 Score: 297.4 bits (760), Expect = 1.6e-76
Identity = 334/424 (78.77%), Postives = 340/424 (80.19%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120

Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
           YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180

Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
           GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGG  
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGAS 240

Query: 241 YG-HEGAGYGG-GGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYG 300
           YG  EG+GYGG GGGNGG                   GHESGGYGG+GGNGG HESGGYG
Sbjct: 241 YGPGEGSGYGGVGGGNGG-------------------GHESGGYGGNGGNGGSHESGGYG 300

Query: 301 GSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESG 360
           GSG NSGG    GYGG  G+G  GGG   GG GG  G+G G   G YG   G+G G   G
Sbjct: 301 GSGGNSGG---AGYGGEHGAGYGGGGGNGGGGGGGAGSGSG---GEYGSGYGSGAG---G 360

Query: 361 GYGGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESG-GYGGSEGHGSGHE 420
           G+G  GGN GG    GG GGSG  GG    G Y       GGHE G GYGG  G G G  
Sbjct: 361 GHGAGGGNNGGGSGGGGGGGSGYGGGSAHGGAY-------GGHEEGNGYGGGGGSGQGGG 388

Query: 421 SGGY 422
            GGY
Sbjct: 421 HGGY 388

BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match: A0A6J1HQX0 (fibroin heavy chain-like isoform X21 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 5.7e-29
Identity = 339/562 (60.32%), Postives = 367/562 (65.30%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHK  S  +FFLLLGIGVS AAR LLTYG G PVNIPAFAYGAG G G GSGGGYG L
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GG GGG G+GGG  Y SVG +YGVGGYGSGGGGGSG G GYGP GG G GGGGGSGGG  
Sbjct: 61  GGYGGGGGNGGGSGYGSVG-EYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGG 120

Query: 121 Y---------GHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVG 180
           Y         G+G    G GGGSGGG GYGPGGGGYGGGGGNGGGA YGP      GG G
Sbjct: 121 YGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGP------GGSG 180

Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE-GAGYG 240
           YGGGGG+GGG GYG  GGGYGGGGGNGGGAGYG  G GYGGGGGNGGGAGYGHE G GYG
Sbjct: 181 YGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYG 240

Query: 241 GGGGNGGGGGYGHE-GAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGGYGGS 300
           GGGGNGGG GYGHE G GYGGGGGNGGG GYGHE GGYGG  G+G     GHE GGYGG 
Sbjct: 241 GGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGG 300

Query: 301 GGNGGGHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
           GGNGGG    GY        GHE GGYGG GG+GG  G     G GG+G   GG E GG 
Sbjct: 301 GGNGGG---AGY--------GHEGGGYGGGGGNGGGAG----YGSGGAGSGAGGGEGGGS 360

Query: 361 GGSGGNGGGHESGGYGGSGGNGGGSHESGGYGGSG---GNGGGHESGGYGNSGGNSGGHE 420
           G  G +G G+ SGG GG+GG GG  +  GG  GSG   G GGGH  GG G+SGG SGG  
Sbjct: 361 GYGGEHGAGYGSGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGGSSGGGSGGGG 420

Query: 421 SGGYG-GSEGHGSGHESGGYGGSGGNGGGHESGGYGNSG-GNSGGHESGGYGG-SGGNGG 480
            GG G G   H    +    G  GGN GG++ G +   G GN  G+  G Y   +GGN G
Sbjct: 421 GGGSGYGLNKHEEYDKDKHEGYDGGNYGGYDGGKHEKYGRGNYRGYGRGKYEEYNGGNYG 480

Query: 481 GHESGGY---------GGSGGNGGGHESGGYGG-SGGNGGGHESGGY---------GGSG 523
           G++ G +         G  GGN GG++ G YG   GGN GG++ G +         G  G
Sbjct: 481 GYDGGKHEEYDKDKHEGYDGGNYGGYDGGNYGRYDGGNYGGYDGGKHEEYDKDKHEGYDG 538

BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match: A0A6J1HWR4 (fibroin heavy chain-like isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 137.9 bits (346), Expect = 1.7e-28
Identity = 352/596 (59.06%), Postives = 371/596 (62.25%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHK  S  +FFLLLGIGVS AAR LLTYG G PVNIPAFAYGAG G G GSGGGYG L
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GG GGG G+GGG  Y SVG +YGVGGYGSGGGGGSG G GYGP GG G GGGGGSGGG  
Sbjct: 61  GGYGGGGGNGGGSGYGSVG-EYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGG 120

Query: 121 Y---------GHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVG 180
           Y         G+G    G GGGSGGG GYGPGGGGYGGGGGNGGGA YGP      GG G
Sbjct: 121 YGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGP------GGSG 180

Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE-GAGYG 240
           YGGGGG+GGG GYG  GGGYGGGGGNGGGAGYG  G GYGGGGGNGGGAGYGHE G GYG
Sbjct: 181 YGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYG 240

Query: 241 GGGGNGGGGGYGHE-GAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGGYGGS 300
           GGGGNGGG GYGHE G GYGGGGGNGGG GYGHE GGYGG  G+G     GHE GGYGG 
Sbjct: 241 GGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGG 300

Query: 301 GGNGG----GHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHE 360
           GGNGG    GHE GGYGG G N GG    GY GSGG+G   GG E GG G  G +G G+ 
Sbjct: 301 GGNGGGAGYGHEGGGYGGGGGNGGG---AGY-GSGGAGSGAGGGEGGGSGYGGEHGAGYG 360

Query: 361 SGGYGGSGGNGG-----------GHESGGYGGSGGNGGGSHESGGYGGSGGNGGG----- 420
           SGG GG+GG GG           G+ SG  GG GG GGGS   GG GG GG G G     
Sbjct: 361 SGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGGS-SGGGSGGGGGGGSGYGLNK 420

Query: 421 --------HESGGYGNSGGNSGG-HESGGYGGSEGHGSG----HESGGYGGSGGNGGGHE 480
                   HE    GN GG  GG HE  G G   G+G G    +  G YGG   +GG HE
Sbjct: 421 HEEYDKDKHEGYDGGNYGGYDGGKHEKYGRGNYRGYGRGKYEEYNGGNYGGY--DGGKHE 480

Query: 481 ----SGGYGNSGGNSGGHESG-----------GYGG---SGGNGGGHE----SGGYGGSG 523
                   G  GGN GG++ G           GY G    G NGG HE        G  G
Sbjct: 481 EYDKDKHEGYDGGNYGGYDGGKHEEYEKDKPEGYDGGKYGGYNGGKHEEYDKDKHEGYDG 540

BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match: A0A6J1HS53 (fibroin heavy chain-like isoform X20 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 1.0e-25
Identity = 353/610 (57.87%), Postives = 373/610 (61.15%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHK  S  +FFLLLGIGVS AAR LLTYG G PVNIPAFAYGAG G G GSGGGYG L
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GG GGG G+GGG  Y SVG +YGVGGYGSGGGGGSG G GYGP GG G GGGGGSGGG  
Sbjct: 61  GGYGGGGGNGGGSGYGSVG-EYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGG 120

Query: 121 Y---------GHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVG 180
           Y         G+G    G GGGSGGG GYGPGGGGYGGGGGNGGGA YGP      GG G
Sbjct: 121 YGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGP------GGSG 180

Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE-GAGYG 240
           YGGGGG+GGG GYG  GGGYGGGGGNGGGAGYG  G GYGGGGGNGGGAGYGHE G GYG
Sbjct: 181 YGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYG 240

Query: 241 GGGGNGGGGGYGHE-GAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGGYGGS 300
           GGGGNGGG GYGHE G GYGGGGGNGGG GYGHE GGYGG  G+G     GHE GGYGG 
Sbjct: 241 GGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGG 300

Query: 301 GGNGG----GHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHE 360
           GGNGG    GHE GGYGG G N GG    GY GSGG+G   GG E GG G  G +G G+ 
Sbjct: 301 GGNGGGAGYGHEGGGYGGGGGNGGG---AGY-GSGGAGSGAGGGEGGGSGYGGEHGAGYG 360

Query: 361 SGGYGGSGGNGG-----------GHESGGYGGSGGNGGGSHESGGYGGSGGNGGG----- 420
           SGG GG+GG GG           G+ SG  GG GG GGGS   GG GG GG G G     
Sbjct: 361 SGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGGS-SGGGSGGGGGGGSGYGLNK 420

Query: 421 --------HESGGYGNSGGNSGG-HESGGYGGSEGHGSG----HESGGYGGSGGNGGGHE 480
                   HE    GN GG  GG HE  G G   G+G G    +  G YGG   +GG HE
Sbjct: 421 HEEYDKDKHEGYDGGNYGGYDGGKHEKYGRGNYRGYGRGKYEEYNGGNYGGY--DGGKHE 480

Query: 481 ----SGGYGNSGGNSGGHESG-----------GYGG---SGGNGGGHES------GGYGG 523
                   G  GGN GG++ G           GY G    G NGG HE        GY G
Sbjct: 481 EYDKDKHEGYDGGNYGGYDGGKHEEYEKDKPEGYDGGKYGGYNGGKHEEYDKDKHEGYDG 540

BLAST of CSPI03G46640 vs. ExPASy TrEMBL
Match: A0A6J1HV40 (fibroin heavy chain-like isoform X8 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 1.0e-25
Identity = 353/610 (57.87%), Postives = 373/610 (61.15%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHK  S  +FFLLLGIGVS AAR LLTYG G PVNIPAFAYGAG G G GSGGGYG L
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GG GGG G+GGG  Y SVG +YGVGGYGSGGGGGSG G GYGP GG G GGGGGSGGG  
Sbjct: 61  GGYGGGGGNGGGSGYGSVG-EYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGG 120

Query: 121 Y---------GHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVG 180
           Y         G+G    G GGGSGGG GYGPGGGGYGGGGGNGGGA YGP      GG G
Sbjct: 121 YGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGP------GGSG 180

Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE-GAGYG 240
           YGGGGG+GGG GYG  GGGYGGGGGNGGGAGYG  G GYGGGGGNGGGAGYGHE G GYG
Sbjct: 181 YGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYG 240

Query: 241 GGGGNGGGGGYGHE-GAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGGYGGS 300
           GGGGNGGG GYGHE G GYGGGGGNGGG GYGHE GGYGG  G+G     GHE GGYGG 
Sbjct: 241 GGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGG 300

Query: 301 GGNGG----GHESGGYGGSGENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHE 360
           GGNGG    GHE GGYGG G N GG    GY GSGG+G   GG E GG G  G +G G+ 
Sbjct: 301 GGNGGGAGYGHEGGGYGGGGGNGGG---AGY-GSGGAGSGAGGGEGGGSGYGGEHGAGYG 360

Query: 361 SGGYGGSGGNGG-----------GHESGGYGGSGGNGGGSHESGGYGGSGGNGGG----- 420
           SGG GG+GG GG           G+ SG  GG GG GGGS   GG GG GG G G     
Sbjct: 361 SGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGGS-SGGGSGGGGGGGSGYGLNK 420

Query: 421 --------HESGGYGNSGGNSGG-HESGGYGGSEGHGSG----HESGGYGGSGGNGGGHE 480
                   HE    GN GG  GG HE  G G   G+G G    +  G YGG   +GG HE
Sbjct: 421 HEEYDKDKHEGYDGGNYGGYDGGKHEKYGRGNYRGYGRGKYEEYNGGNYGGY--DGGKHE 480

Query: 481 ----SGGYGNSGGNSGGHESG-----------GYGG---SGGNGGGHES------GGYGG 523
                   G  GGN GG++ G           GY G    G NGG HE        GY G
Sbjct: 481 EYDKDKHEGYDGGNYGGYDGGKHEEYEKDKPEGYDGGKYGGYNGGKHEEYDKDKHEGYDG 540

BLAST of CSPI03G46640 vs. NCBI nr
Match: XP_031738892.1 (glycine-rich cell wall structural protein 1.8 isoform X1 [Cucumis sativus] >KAE8651439.1 hypothetical protein Csa_001535 [Cucumis sativus])

HSP 1 Score: 506.5 bits (1303), Expect = 3.7e-139
Identity = 471/543 (86.74%), Postives = 480/543 (88.40%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120

Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
           YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180

Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
           GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240

Query: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGS 300
           YGHEGAGYGGGGGNGGGGGYGHESGGYGG+ G+G GHESGGYGGS     GHESGGYGGS
Sbjct: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGNGGNGGGHESGGYGGS-----GHESGGYGGS 300

Query: 301 GENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
           G N+GGHESGGYGGSGG+GGNGG HESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY
Sbjct: 301 GGNNGGHESGGYGGSGGNGGNGGSHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360

Query: 361 GGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHESGG 420
           GGSGGN GG HESGGYGGS G+GGGHESGGYG SGGN GGHESGGYGGS G+G GHESGG
Sbjct: 361 GGSGGNSGG-HESGGYGGSEGHGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGG 420

Query: 421 YGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNGGGHESGGYGGSGGNG-----GG 480
           YG     GG HESGGYG +GGN GGHESGGYG SGGN GGHESGGYGGSGGN      GG
Sbjct: 421 YG-----GGSHESGGYGGNGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNSGGAGYGG 480

Query: 481 HESGGYGGSGGNGGGHESGGYGGSGGN---------GGGHESGGYGGSGGNGGGHESG-G 529
               GYGG GGNGGG   G   GSGG          GGGH +GG    GG+GGG   G G
Sbjct: 481 EHGAGYGGGGGNGGGGGGGAGSGSGGEYGSGYGSGAGGGHGAGGGNNGGGSGGGGGGGSG 531

BLAST of CSPI03G46640 vs. NCBI nr
Match: XP_031738894.1 (glycine-rich cell wall structural protein 1.8 isoform X3 [Cucumis sativus])

HSP 1 Score: 500.4 bits (1287), Expect = 2.6e-137
Identity = 472/538 (87.73%), Postives = 480/538 (89.22%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120

Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
           YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180

Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
           GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240

Query: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGS 300
           YGHEGAGYGGGGGNGGGGGYGHESGGYGG+ G+G GHESGGYGGS     GHESGGYGGS
Sbjct: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGNGGNGGGHESGGYGGS-----GHESGGYGGS 300

Query: 301 GENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
           G N+GGHESGGYGGSGG+GGNGG HESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY
Sbjct: 301 GGNNGGHESGGYGGSGGNGGNGGSHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360

Query: 361 GGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHESGG 420
           GGSGGN GG HESGGYGGS G+GGGHESGGYG SGGN GGHESGGYGGS   G  HESGG
Sbjct: 361 GGSGGNSGG-HESGGYGGSEGHGGGHESGGYGGSGGNGGGHESGGYGGS--GGGSHESGG 420

Query: 421 YGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNG-----GGHESGGYGGSGGNGGG 480
           YGG+GGNGGGHESGGYGNSGGNSGGHESGGYGGSGGN      GG    GYGG GGNGGG
Sbjct: 421 YGGNGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNSGGAGYGGEHGAGYGGGGGNGGG 480

Query: 481 HESGGYGGSGGN-GGGHES---GGYGGSGGNGGGHESGGYGGSGGNGGGHESGG-YGG 529
              G   GSGG  G G+ S   GG+G  GGN GG   GG GG  G GGG   GG YGG
Sbjct: 481 GGGGAGSGSGGEYGSGYGSGAGGGHGAGGGNNGGGSGGGGGGGSGYGGGSAHGGAYGG 529

BLAST of CSPI03G46640 vs. NCBI nr
Match: XP_031738893.1 (glycine-rich cell wall structural protein 1.8 isoform X2 [Cucumis sativus])

HSP 1 Score: 495.0 bits (1273), Expect = 1.1e-135
Identity = 472/541 (87.25%), Postives = 478/541 (88.35%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120

Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
           YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180

Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
           GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240

Query: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGS 300
           YGHEGAGYGGGGGNGGGGGYGHESGGYGG+ G+G GHESGGYGGSGGN GGHESGGYGGS
Sbjct: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGNGGNGGGHESGGYGGSGGNNGGHESGGYGGS 300

Query: 301 ---GENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHES 360
              G N G HESGGY   GGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGN GGHES
Sbjct: 301 GGNGGNGGSHESGGY---GGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNSGGHES 360

Query: 361 GGYGGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHE 420
           GGYGGS G+GGG HESGGYGGSGGNGGGHESGGYG SGGN GGHESGGYG     G  HE
Sbjct: 361 GGYGGSEGHGGG-HESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGYG-----GGSHE 420

Query: 421 SGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNG-----GGHESGGYGGSGGN 480
           SGGYGG+GGNGGGHESGGYGNSGGNSGGHESGGYGGSGGN      GG    GYGG GGN
Sbjct: 421 SGGYGGNGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNSGGAGYGGEHGAGYGGGGGN 480

Query: 481 GGGHESGGYGGSGGN-GGGHES---GGYGGSGGNGGGHESGGYGGSGGNGGGHESGG-YG 529
           GGG   G   GSGG  G G+ S   GG+G  GGN GG   GG GG  G GGG   GG YG
Sbjct: 481 GGGGGGGAGSGSGGEYGSGYGSGAGGGHGAGGGNNGGGSGGGGGGGSGYGGGSAHGGAYG 531

BLAST of CSPI03G46640 vs. NCBI nr
Match: XP_031738895.1 (glycine-rich cell wall structural protein 1 isoform X4 [Cucumis sativus])

HSP 1 Score: 372.9 bits (956), Expect = 6.3e-99
Identity = 401/498 (80.52%), Postives = 407/498 (81.73%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHKFSSFVLFFLLLGIGVSYAARTLLTY GGGPVNIPAFAYGAGNGGGGGSGGGYGPL
Sbjct: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTY-GGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60

Query: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120
           GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA
Sbjct: 61  GGGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGGSA 120

Query: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180
           YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG
Sbjct: 121 YGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGASYGPGEGSGYGGVGYGGGGGSGG 180

Query: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240
           GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG
Sbjct: 181 GVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGGG 240

Query: 241 YGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGGS 300
           YGHEGAGYGGGGGNGGGGGY               GHESGGYGG+GGNGGGHESGGYGGS
Sbjct: 241 YGHEGAGYGGGGGNGGGGGY---------------GHESGGYGGNGGNGGGHESGGYGGS 300

Query: 301 GENSGGHESGGYGGSGGSGGNGGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESGGY 360
                GHESGGYGGSG     GG HESGGYGG+GGNGGGHESGGYG SGGN GGHESGGY
Sbjct: 301 -----GHESGGYGGSG-----GGSHESGGYGGNGGNGGGHESGGYGNSGGNSGGHESGGY 360

Query: 361 GGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSEGHGSGHESG- 420
           GGSGGN GG+    GYGG  G        GYG  GGN GG      GG  G GSG E G 
Sbjct: 361 GGSGGNSGGA----GYGGEHG-------AGYGGGGGNGGGG-----GGGAGSGSGGEYGS 420

Query: 421 GYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNGGGHESGGYGGSGGNGGGHESG 480
           GYG   G GGGH +GG GN+GG SGG   GG GGSG  GG    G Y       GGHE G
Sbjct: 421 GYG--SGAGGGHGAGG-GNNGGGSGG---GGGGGSGYGGGSAHGGAY-------GGHEEG 443

Query: 481 -GYGGSGGNGGGHESGGY 497
            GYGG GG+G G   GGY
Sbjct: 481 NGYGGGGGSGQGGGHGGY 443

BLAST of CSPI03G46640 vs. NCBI nr
Match: XP_038887343.1 (glycine-rich cell wall structural protein-like isoform X1 [Benincasa hispida])

HSP 1 Score: 246.1 bits (627), Expect = 9.0e-61
Identity = 345/488 (70.70%), Postives = 358/488 (73.36%), Query Frame = 0

Query: 1   MASHKFSSFVLFFLLLGIGVSYAARTLLTYGGGGPVNIPAFAYGAGNGGGGGSGGGYGPL 60
           MASHK SSFV FFLLLGIGVS AARTLLTY  G  VNIPAFAYGAGNG GGGSGGGYGPL
Sbjct: 1   MASHKLSSFV-FFLLLGIGVSSAARTLLTYADGESVNIPAFAYGAGNGAGGGSGGGYGPL 60

Query: 61  G--GGGGGYGSGGGGSYSSVGVQYGVGGYGSGGGGGSGGGEGYGPSGGYGGGGGGGSGGG 120
           G  GGGGGYG GGG SY S G QYGVGGYGSGGGGGSGGG GYGPSGGYGGGGGGGSGGG
Sbjct: 61  GGHGGGGGYGGGGGSSYGSEG-QYGVGGYGSGGGGGSGGGGGYGPSGGYGGGGGGGSGGG 120

Query: 121 SAYGHGGSAYGGGGGSGGGGGYGPGGGGYGGGGGNGGGA-------SYGPGEGSGYGGVG 180
           SAYGHGGSAYGGGGG G G GYGPGGGGYGGGGGNGGGA        YGPG G+GYGGVG
Sbjct: 121 SAYGHGGSAYGGGGGGGEGVGYGPGGGGYGGGGGNGGGAGYGPSGGGYGPGGGNGYGGVG 180

Query: 181 YGGGGGSGGGVGYGPGGGGYGGGGGNGGGAGYGLGGAGYGGGGGNGGGAGYGHE------ 240
           YGGGGGSGGGVGYG GGGGYGGGGGNGGGAGYG GG+GYGGGGGNGGGAGYGHE      
Sbjct: 181 YGGGGGSGGGVGYGLGGGGYGGGGGNGGGAGYGPGGSGYGGGGGNGGGAGYGHEXGYGPG 240

Query: 241 GAGYGGGGGNGGGGGYGHEGAGYGGGGGNGGGGGYGHESGGYGGSEGHGS----GHESGG 300
           G+GYGGGGGNGGG GYGHEGAGYGGGGGNGGG GYGHESGGYGG  G G     GHE  G
Sbjct: 241 GSGYGGGGGNGGGAGYGHEGAGYGGGGGNGGGAGYGHESGGYGGGGGSGGGAGYGHEGAG 300

Query: 301 YGGSGGNGGGHESGGYGGSGENSGGHESGGYGGSGGSGGNGG-GHESGGYGGSGGNGGGH 360
           YGG GGNGGG    GY        GHE  GYGG GG+GG  G GHE  GYGG GGNGGG 
Sbjct: 301 YGGGGGNGGG---AGY--------GHEGAGYGGGGGNGGGTGYGHEGAGYGGGGGNGGGA 360

Query: 361 ESGGYGGSGGNGGGHESGGYGGSGGNGGGSHESGGYGGSGGNGGGHESGGYGNSGGNSGG 420
             G  GG  G G G+ SGG  G+G   GG H   GYGG GG+GGG      G +G  SGG
Sbjct: 361 GYGHEGGEYGGGAGYGSGGGEGNGSGYGGEH-GKGYGGGGGSGGG------GGAGNGSGG 420

Query: 421 HESGGYGGSEGHGSGHESGGYGGSGGNGGGHESGGYGNSGGNSGGHESGGYGGSGGNGGG 469
               GYG   G G GH +GG    GG+GGG   GG    GG  GGHE G   G+GG GG 
Sbjct: 421 EYGSGYG--SGAGGGHGAGGGSSGGGSGGGSGYGGGSAPGGAYGGHEEG--SGNGGGGGS 464

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LH771.6e-7678.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G902220 PE=4 SV=1[more]
A0A6J1HQX05.7e-2960.32fibroin heavy chain-like isoform X21 OS=Cucurbita maxima OX=3661 GN=LOC111466985... [more]
A0A6J1HWR41.7e-2859.06fibroin heavy chain-like isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111466985 ... [more]
A0A6J1HS531.0e-2557.87fibroin heavy chain-like isoform X20 OS=Cucurbita maxima OX=3661 GN=LOC111466985... [more]
A0A6J1HV401.0e-2557.87fibroin heavy chain-like isoform X8 OS=Cucurbita maxima OX=3661 GN=LOC111466985 ... [more]
Match NameE-valueIdentityDescription
XP_031738892.13.7e-13986.74glycine-rich cell wall structural protein 1.8 isoform X1 [Cucumis sativus] >KAE8... [more]
XP_031738894.12.6e-13787.73glycine-rich cell wall structural protein 1.8 isoform X3 [Cucumis sativus][more]
XP_031738893.11.1e-13587.25glycine-rich cell wall structural protein 1.8 isoform X2 [Cucumis sativus][more]
XP_031738895.16.3e-9980.52glycine-rich cell wall structural protein 1 isoform X4 [Cucumis sativus][more]
XP_038887343.19.0e-6170.70glycine-rich cell wall structural protein-like isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 573..593
NoneNo IPR availablePRINTSPR01228EGGSHELLcoord: 140..150
score: 52.27
coord: 109..124
score: 52.34
coord: 5..21
score: 25.0
coord: 185..203
score: 45.39
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 356..457
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 266..336

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G46640.1CSPI03G46640.1mRNA