Cp4.1LG15g02020 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG15g02020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionfibroin heavy chain-like isoform X21
LocationCp4.1LG15: 1740091 .. 1742085 (+)
RNA-Seq ExpressionCp4.1LG15g02020
SyntenyCp4.1LG15g02020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGTTTCATTCAGTTGTTGACTCGGTTCATTCATCTTCTTTACTCGGGTGTCGTACACGAACAAAACTCGGAATGTGAGTTGAAATGGAGGGAGTGGAGGACAGCTTAATGACACTGATTTTGTGAAATGGAGTGAAAGAGGGAGTGGGTAGGTGTAGATTTAGGGCCACCCACTTTCTCCCTTTTGCATGTGCCTCAACCACTCTCTCCACACAGCTTCCCTTATATAAACCCTTCACTCACTCCCTTTTCACTTCAACCATTCCCAAATGGCTTCTCATAAACGCCTTTCTTCTTTTCTCTTTTTTCTTTTATTAGGAATTGGGGTTTCTTCTGCAGCCAGAAATCTCTTAACTTATGGTGAAGGAAAGCCGGTGAACATTCCGGCATTTGCTTATGGTGCAGGCGGTGGTGCAGGTGGTGGTAGTGGTGGTGGATATGGTTCTCTTGGTGGTTATGGCGGTGGTGGAGGAAATGGAGGTGGTAGTGGCTATGGTTCTGTAGGAGAATATGGTGTTGGTGGCTATGGAAGTGGTGGCGGTGGAGGAAGTGGTGAGGGAGGTGGGTATGGCCCGGGAGGTGGAAATGGATATGGTGGAGGTGGAGGAAGTGGCGGTGGAGGTGGTTATGGTTCTGGGGTTAGATATGGTGAAGTTGGATATGGCTCTGGAGGTTCAGGGTATGGTGGAGGGAGTGGAGGCGGTGCTGGGTATGGTCCTGGAGGTGGAGGGTATGGAGGTGGCGGTGGAAATGGAGGTGGGGCTGGGTATGGTCCTGGAGGTTCTGGATATGGAGGAGGTGGTGGAAATGGGGGTGGTGCAGGGTATGGTCATGAGGGTGGTGGTTATGGTGGGGGTGGCGGAAATGGCGGAGGGGCAGGGTATGGTCATGAGGGTGGTGGATATGGAGGGGGCGGTGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGAGGTGGAGGTGGAAATGGCGGTGGCGCAGGGTACGGTCATGACGGTGGTGGATACGGTGGGGGTGGTGGAAATGGTGGTGGCGCAGGGTATGGTCATGAAGGTAGTGGATACGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTTCTGGTGGTGCTGGGTCTGGCGCTGGAGGAGGAGAAGGGGGAGGTTCAGGGTATGGTGGAGAACATGGAGCAGGCTACGGTGGTGGTGGTGGAGGTGGAAATGGCGGCGGTGGAGGTGTAGGATATGGGCCAGGAGGAGAATATGGGTCAGGTTATGGAAGCGGAGCAGGTGGGGGTCATGGTGGAGGAGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGTGGTGGAGGAGGGTCGGGATACGGTGGAGGAAGTGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGTGGTGGTCATGAGGGAGGATATGGTGGAGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGAGGTGGTGGTCATGACGGAGGATATGGTGGTGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAGGAGGTGGTAGCAGCCAAGGCGGTGACCACGGTGGATATGCACCTTGAGAAAGACCTCACCATTACCTTATATATTTAGTATTATTATAAAAAGGGCATATACATATGGTCATTCATCCTTTTTACCCCAAGTAATTAATAGAAAATATAAATCCATGATTGGGAAAAGGGAATAATTAATTATATAATATATCGTAAGAAAAGTGTGATGTTAAAATGGGAAACTCTTGTATCTTCACCTCTCCAAATTTTCTTCATATATAAAATTATTCTCTCTTATTTCCACAAAACCACACCAATTGCATGCATGCTCCCT

mRNA sequence

TCGTTTCATTCAGTTGTTGACTCGGTTCATTCATCTTCTTTACTCGGGTGTCGTACACGAACAAAACTCGGAATGTGAGTTGAAATGGAGGGAGTGGAGGACAGCTTAATGACACTGATTTTGTGAAATGGAGTGAAAGAGGGAGTGGGTAGGTGTAGATTTAGGGCCACCCACTTTCTCCCTTTTGCATGTGCCTCAACCACTCTCTCCACACAGCTTCCCTTATATAAACCCTTCACTCACTCCCTTTTCACTTCAACCATTCCCAAATGGCTTCTCATAAACGCCTTTCTTCTTTTCTCTTTTTTCTTTTATTAGGAATTGGGGTTTCTTCTGCAGCCAGAAATCTCTTAACTTATGGTGAAGGAAAGCCGGTGAACATTCCGGCATTTGCTTATGGTGCAGGCGGTGGTGCAGGTGGTGGTAGTGGTGGTGGATATGGTTCTCTTGGTGGTTATGGCGGTGGTGGAGGAAATGGAGGTGGTAGTGGCTATGGTTCTGTAGGAGAATATGGTGTTGGTGGCTATGGAAGTGGTGGCGGTGGAGGAAGTGGTGAGGGAGGTGGGTATGGCCCGGGAGGTGGAAATGGATATGGTGGAGGTGGAGGAAGTGGCGGTGGAGGTGGTTATGGTTCTGGGGTTAGATATGGTGAAGTTGGATATGGCTCTGGAGGTTCAGGGTATGGTGGAGGGAGTGGAGGCGGTGCTGGGTATGGTCCTGGAGGTGGAGGGTATGGAGGTGGCGGTGGAAATGGAGGTGGGGCTGGGTATGGTCCTGGAGGTTCTGGATATGGAGGAGGTGGTGGAAATGGGGGTGGTGCAGGGTATGGTCATGAGGGTGGTGGTTATGGTGGGGGTGGCGGAAATGGCGGAGGGGCAGGGTATGGTCATGAGGGTGGTGGATATGGAGGGGGCGGTGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGAGGTAGTGGATACGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTTCTGGTGGTGCTGGGTCTGGCGCTGGAGGAGGAGAAGGGGGAGGTTCAGGGTATGGTGGAGAACATGGAGCAGGCTACGGTGGTGGTGGTGGAGGTGGAAATGGCGGCGGTGGAGGTGTAGGATATGGGCCAGGAGGAGAATATGGGTCAGGTTATGGAAGCGGAGCAGGTGGGGGTCATGGTGGAGGAGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGTGGTGGAGGAGGGTCGGGATACGGTGGAGGAAGTGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGTGGTGGTCATGAGGGAGGATATGGTGGAGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGAGGTGGTGGTCATGACGGAGGATATGGTGGTGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAGGAGGTGGTAGCAGCCAAGGCGGTGACCACGGTGGATATGCACCTTGAGAAAGACCTCACCATTACCTTATATATTTAGTATTATTATAAAAAGGGCATATACATATGGTCATTCATCCTTTTTACCCCAAGTAATTAATAGAAAATATAAATCCATGATTGGGAAAAGGGAATAATTAATTATATAATATATCGTAAGAAAAGTGTGATGTTAAAATGGGAAACTCTTGTATCTTCACCTCTCCAAATTTTCTTCATATATAAAATTATTCTCTCTTATTTCCACAAAACCACACCAATTGCATGCATGCTCCCT

Coding sequence (CDS)

ATGGCTTCTCATAAACGCCTTTCTTCTTTTCTCTTTTTTCTTTTATTAGGAATTGGGGTTTCTTCTGCAGCCAGAAATCTCTTAACTTATGGTGAAGGAAAGCCGGTGAACATTCCGGCATTTGCTTATGGTGCAGGCGGTGGTGCAGGTGGTGGTAGTGGTGGTGGATATGGTTCTCTTGGTGGTTATGGCGGTGGTGGAGGAAATGGAGGTGGTAGTGGCTATGGTTCTGTAGGAGAATATGGTGTTGGTGGCTATGGAAGTGGTGGCGGTGGAGGAAGTGGTGAGGGAGGTGGGTATGGCCCGGGAGGTGGAAATGGATATGGTGGAGGTGGAGGAAGTGGCGGTGGAGGTGGTTATGGTTCTGGGGTTAGATATGGTGAAGTTGGATATGGCTCTGGAGGTTCAGGGTATGGTGGAGGGAGTGGAGGCGGTGCTGGGTATGGTCCTGGAGGTGGAGGGTATGGAGGTGGCGGTGGAAATGGAGGTGGGGCTGGGTATGGTCCTGGAGGTTCTGGATATGGAGGAGGTGGTGGAAATGGGGGTGGTGCAGGGTATGGTCATGAGGGTGGTGGTTATGGTGGGGGTGGCGGAAATGGCGGAGGGGCAGGGTATGGTCATGAGGGTGGTGGATATGGAGGGGGCGGTGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGAGGTAGTGGATACGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTTCTGGTGGTGCTGGGTCTGGCGCTGGAGGAGGAGAAGGGGGAGGTTCAGGGTATGGTGGAGAACATGGAGCAGGCTACGGTGGTGGTGGTGGAGGTGGAAATGGCGGCGGTGGAGGTGTAGGATATGGGCCAGGAGGAGAATATGGGTCAGGTTATGGAAGCGGAGCAGGTGGGGGTCATGGTGGAGGAGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGTGGTGGAGGAGGGTCGGGATACGGTGGAGGAAGTGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGTGGTGGTCATGAGGGAGGATATGGTGGAGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGAGGTGGTGGTCATGACGGAGGATATGGTGGTGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAGGAGGTGGTAGCAGCCAAGGCGGTGACCACGGTGGATATGCACCTTGA

Protein sequence

MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSLGGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGYGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGGYGGSGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGSSGGGSGGGGGGGSGYGGGSAHGGASGGHEGGYGGSSGGGGGHEGGYGGGSAHGGASGGHEGGYGGSSGGGSGGGGGGHDGGYGGGSAHGGASGGHEGGYGGGGGSSQGGDHGGYAP
Homology
BLAST of Cp4.1LG15g02020 vs. NCBI nr
Match: XP_023553671.1 (glycine-rich cell wall structural protein-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 512 bits (1318), Expect = 4.37e-176
Identity = 452/485 (93.20%), Postives = 452/485 (93.20%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL
Sbjct: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGG---------------------------------SGYGGGGGNGGGA 300
           GAGYGHEGGGGYGG                                 SGYGGGGGNGGGA
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHDGGGYGGGGGNGGGAGYGHEGSGYGGGGGNGGGA 300

Query: 301 GYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGGGVGYGPGGEYGSGYGSGAG 360
           GYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGGGVGYGPGGEYGSGYGSGAG
Sbjct: 301 GYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGGGVGYGPGGEYGSGYGSGAG 360

Query: 361 GGHGGGGGSSGGGSGGGGGGGSGYGGGSAHGGASGGHEGGYGGSSGGGGGHEGGYGGGSA 420
           GGHGGGGGSSGGGSGGGGGGGSGYGGGSAHGGASGGHEGGYGGSSGGGGGHEGGYGGGSA
Sbjct: 361 GGHGGGGGSSGGGSGGGGGGGSGYGGGSAHGGASGGHEGGYGGSSGGGGGHEGGYGGGSA 420

Query: 421 HGGASGGHEGGYGGSSGGGSGGGGGGHDGGYGGGSAHGGASGGHEGGYGGGGGSSQGGDH 452
           HGGASGGHEGGYGGSSGGGSGGGGGGHDGGYGGGSAHGGASGGHEGGYGGGGGSSQGGDH
Sbjct: 421 HGGASGGHEGGYGGSSGGGSGGGGGGHDGGYGGGSAHGGASGGHEGGYGGGGGSSQGGDH 480

BLAST of Cp4.1LG15g02020 vs. NCBI nr
Match: XP_022967498.1 (glycine-rich cell wall structural protein 1.8-like isoform X50 [Cucurbita maxima])

HSP 1 Score: 370 bits (949), Expect = 1.20e-115
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

BLAST of Cp4.1LG15g02020 vs. NCBI nr
Match: XP_022967497.1 (PE-PGRS family protein PE_PGRS16-like isoform X49 [Cucurbita maxima])

HSP 1 Score: 370 bits (949), Expect = 2.14e-115
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

BLAST of Cp4.1LG15g02020 vs. NCBI nr
Match: XP_022967496.1 (fibroin heavy chain-like isoform X48 [Cucurbita maxima])

HSP 1 Score: 370 bits (949), Expect = 2.10e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

BLAST of Cp4.1LG15g02020 vs. NCBI nr
Match: XP_022967495.1 (fibroin heavy chain-like isoform X47 [Cucurbita maxima])

HSP 1 Score: 370 bits (949), Expect = 2.45e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

BLAST of Cp4.1LG15g02020 vs. ExPASy TrEMBL
Match: A0A6J1HV81 (glycine-rich cell wall structural protein 1.8-like isoform X50 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 370 bits (949), Expect = 5.81e-116
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

BLAST of Cp4.1LG15g02020 vs. ExPASy TrEMBL
Match: A0A6J1HR00 (PE-PGRS family protein PE_PGRS16-like isoform X49 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 370 bits (949), Expect = 1.04e-115
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

BLAST of Cp4.1LG15g02020 vs. ExPASy TrEMBL
Match: A0A6J1HS93 (fibroin heavy chain-like isoform X48 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 370 bits (949), Expect = 1.02e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

BLAST of Cp4.1LG15g02020 vs. ExPASy TrEMBL
Match: A0A6J1HUM8 (fibroin heavy chain-like isoform X47 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 370 bits (949), Expect = 1.19e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

BLAST of Cp4.1LG15g02020 vs. ExPASy TrEMBL
Match: A0A6J1HWV8 (fibroin heavy chain-like isoform X46 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)

HSP 1 Score: 370 bits (949), Expect = 1.38e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0

Query: 1   MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
           MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1   MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60

Query: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
           GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61  GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120

Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
           GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180

Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
           GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240

Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
           GAGYGHEGGGGYGG G                                            
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300

Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
                   YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023553671.14.37e-17693.20glycine-rich cell wall structural protein-like [Cucurbita pepo subsp. pepo][more]
XP_022967498.11.20e-11585.93glycine-rich cell wall structural protein 1.8-like isoform X50 [Cucurbita maxima... [more]
XP_022967497.12.14e-11585.93PE-PGRS family protein PE_PGRS16-like isoform X49 [Cucurbita maxima][more]
XP_022967496.12.10e-11485.93fibroin heavy chain-like isoform X48 [Cucurbita maxima][more]
XP_022967495.12.45e-11485.93fibroin heavy chain-like isoform X47 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1HV815.81e-11685.93glycine-rich cell wall structural protein 1.8-like isoform X50 OS=Cucurbita maxi... [more]
A0A6J1HR001.04e-11585.93PE-PGRS family protein PE_PGRS16-like isoform X49 OS=Cucurbita maxima OX=3661 GN... [more]
A0A6J1HS931.02e-11485.93fibroin heavy chain-like isoform X48 OS=Cucurbita maxima OX=3661 GN=LOC111466985... [more]
A0A6J1HUM81.19e-11485.93fibroin heavy chain-like isoform X47 OS=Cucurbita maxima OX=3661 GN=LOC111466985... [more]
A0A6J1HWV81.38e-11485.93fibroin heavy chain-like isoform X46 OS=Cucurbita maxima OX=3661 GN=LOC111466985... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePRINTSPR01228EGGSHELLcoord: 131..141
score: 52.27
coord: 171..189
score: 42.76
coord: 7..23
score: 32.35
coord: 103..118
score: 47.66
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 379..452

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g02020.1Cp4.1LG15g02020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane