Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGTTTCATTCAGTTGTTGACTCGGTTCATTCATCTTCTTTACTCGGGTGTCGTACACGAACAAAACTCGGAATGTGAGTTGAAATGGAGGGAGTGGAGGACAGCTTAATGACACTGATTTTGTGAAATGGAGTGAAAGAGGGAGTGGGTAGGTGTAGATTTAGGGCCACCCACTTTCTCCCTTTTGCATGTGCCTCAACCACTCTCTCCACACAGCTTCCCTTATATAAACCCTTCACTCACTCCCTTTTCACTTCAACCATTCCCAAATGGCTTCTCATAAACGCCTTTCTTCTTTTCTCTTTTTTCTTTTATTAGGAATTGGGGTTTCTTCTGCAGCCAGAAATCTCTTAACTTATGGTGAAGGAAAGCCGGTGAACATTCCGGCATTTGCTTATGGTGCAGGCGGTGGTGCAGGTGGTGGTAGTGGTGGTGGATATGGTTCTCTTGGTGGTTATGGCGGTGGTGGAGGAAATGGAGGTGGTAGTGGCTATGGTTCTGTAGGAGAATATGGTGTTGGTGGCTATGGAAGTGGTGGCGGTGGAGGAAGTGGTGAGGGAGGTGGGTATGGCCCGGGAGGTGGAAATGGATATGGTGGAGGTGGAGGAAGTGGCGGTGGAGGTGGTTATGGTTCTGGGGTTAGATATGGTGAAGTTGGATATGGCTCTGGAGGTTCAGGGTATGGTGGAGGGAGTGGAGGCGGTGCTGGGTATGGTCCTGGAGGTGGAGGGTATGGAGGTGGCGGTGGAAATGGAGGTGGGGCTGGGTATGGTCCTGGAGGTTCTGGATATGGAGGAGGTGGTGGAAATGGGGGTGGTGCAGGGTATGGTCATGAGGGTGGTGGTTATGGTGGGGGTGGCGGAAATGGCGGAGGGGCAGGGTATGGTCATGAGGGTGGTGGATATGGAGGGGGCGGTGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGAGGTGGAGGTGGAAATGGCGGTGGCGCAGGGTACGGTCATGACGGTGGTGGATACGGTGGGGGTGGTGGAAATGGTGGTGGCGCAGGGTATGGTCATGAAGGTAGTGGATACGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTTCTGGTGGTGCTGGGTCTGGCGCTGGAGGAGGAGAAGGGGGAGGTTCAGGGTATGGTGGAGAACATGGAGCAGGCTACGGTGGTGGTGGTGGAGGTGGAAATGGCGGCGGTGGAGGTGTAGGATATGGGCCAGGAGGAGAATATGGGTCAGGTTATGGAAGCGGAGCAGGTGGGGGTCATGGTGGAGGAGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGTGGTGGAGGAGGGTCGGGATACGGTGGAGGAAGTGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGTGGTGGTCATGAGGGAGGATATGGTGGAGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGAGGTGGTGGTCATGACGGAGGATATGGTGGTGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAGGAGGTGGTAGCAGCCAAGGCGGTGACCACGGTGGATATGCACCTTGAGAAAGACCTCACCATTACCTTATATATTTAGTATTATTATAAAAAGGGCATATACATATGGTCATTCATCCTTTTTACCCCAAGTAATTAATAGAAAATATAAATCCATGATTGGGAAAAGGGAATAATTAATTATATAATATATCGTAAGAAAAGTGTGATGTTAAAATGGGAAACTCTTGTATCTTCACCTCTCCAAATTTTCTTCATATATAAAATTATTCTCTCTTATTTCCACAAAACCACACCAATTGCATGCATGCTCCCT
mRNA sequence
TCGTTTCATTCAGTTGTTGACTCGGTTCATTCATCTTCTTTACTCGGGTGTCGTACACGAACAAAACTCGGAATGTGAGTTGAAATGGAGGGAGTGGAGGACAGCTTAATGACACTGATTTTGTGAAATGGAGTGAAAGAGGGAGTGGGTAGGTGTAGATTTAGGGCCACCCACTTTCTCCCTTTTGCATGTGCCTCAACCACTCTCTCCACACAGCTTCCCTTATATAAACCCTTCACTCACTCCCTTTTCACTTCAACCATTCCCAAATGGCTTCTCATAAACGCCTTTCTTCTTTTCTCTTTTTTCTTTTATTAGGAATTGGGGTTTCTTCTGCAGCCAGAAATCTCTTAACTTATGGTGAAGGAAAGCCGGTGAACATTCCGGCATTTGCTTATGGTGCAGGCGGTGGTGCAGGTGGTGGTAGTGGTGGTGGATATGGTTCTCTTGGTGGTTATGGCGGTGGTGGAGGAAATGGAGGTGGTAGTGGCTATGGTTCTGTAGGAGAATATGGTGTTGGTGGCTATGGAAGTGGTGGCGGTGGAGGAAGTGGTGAGGGAGGTGGGTATGGCCCGGGAGGTGGAAATGGATATGGTGGAGGTGGAGGAAGTGGCGGTGGAGGTGGTTATGGTTCTGGGGTTAGATATGGTGAAGTTGGATATGGCTCTGGAGGTTCAGGGTATGGTGGAGGGAGTGGAGGCGGTGCTGGGTATGGTCCTGGAGGTGGAGGGTATGGAGGTGGCGGTGGAAATGGAGGTGGGGCTGGGTATGGTCCTGGAGGTTCTGGATATGGAGGAGGTGGTGGAAATGGGGGTGGTGCAGGGTATGGTCATGAGGGTGGTGGTTATGGTGGGGGTGGCGGAAATGGCGGAGGGGCAGGGTATGGTCATGAGGGTGGTGGATATGGAGGGGGCGGTGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGAGGTAGTGGATACGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTTCTGGTGGTGCTGGGTCTGGCGCTGGAGGAGGAGAAGGGGGAGGTTCAGGGTATGGTGGAGAACATGGAGCAGGCTACGGTGGTGGTGGTGGAGGTGGAAATGGCGGCGGTGGAGGTGTAGGATATGGGCCAGGAGGAGAATATGGGTCAGGTTATGGAAGCGGAGCAGGTGGGGGTCATGGTGGAGGAGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGTGGTGGAGGAGGGTCGGGATACGGTGGAGGAAGTGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGTGGTGGTCATGAGGGAGGATATGGTGGAGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGAGGTGGTGGTCATGACGGAGGATATGGTGGTGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAGGAGGTGGTAGCAGCCAAGGCGGTGACCACGGTGGATATGCACCTTGAGAAAGACCTCACCATTACCTTATATATTTAGTATTATTATAAAAAGGGCATATACATATGGTCATTCATCCTTTTTACCCCAAGTAATTAATAGAAAATATAAATCCATGATTGGGAAAAGGGAATAATTAATTATATAATATATCGTAAGAAAAGTGTGATGTTAAAATGGGAAACTCTTGTATCTTCACCTCTCCAAATTTTCTTCATATATAAAATTATTCTCTCTTATTTCCACAAAACCACACCAATTGCATGCATGCTCCCT
Coding sequence (CDS)
ATGGCTTCTCATAAACGCCTTTCTTCTTTTCTCTTTTTTCTTTTATTAGGAATTGGGGTTTCTTCTGCAGCCAGAAATCTCTTAACTTATGGTGAAGGAAAGCCGGTGAACATTCCGGCATTTGCTTATGGTGCAGGCGGTGGTGCAGGTGGTGGTAGTGGTGGTGGATATGGTTCTCTTGGTGGTTATGGCGGTGGTGGAGGAAATGGAGGTGGTAGTGGCTATGGTTCTGTAGGAGAATATGGTGTTGGTGGCTATGGAAGTGGTGGCGGTGGAGGAAGTGGTGAGGGAGGTGGGTATGGCCCGGGAGGTGGAAATGGATATGGTGGAGGTGGAGGAAGTGGCGGTGGAGGTGGTTATGGTTCTGGGGTTAGATATGGTGAAGTTGGATATGGCTCTGGAGGTTCAGGGTATGGTGGAGGGAGTGGAGGCGGTGCTGGGTATGGTCCTGGAGGTGGAGGGTATGGAGGTGGCGGTGGAAATGGAGGTGGGGCTGGGTATGGTCCTGGAGGTTCTGGATATGGAGGAGGTGGTGGAAATGGGGGTGGTGCAGGGTATGGTCATGAGGGTGGTGGTTATGGTGGGGGTGGCGGAAATGGCGGAGGGGCAGGGTATGGTCATGAGGGTGGTGGATATGGAGGGGGCGGTGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTCATGAAGGTGGTGGTGGATATGGAGGTAGTGGATACGGTGGGGGTGGCGGAAATGGCGGTGGGGCAGGGTATGGTTCTGGTGGTGCTGGGTCTGGCGCTGGAGGAGGAGAAGGGGGAGGTTCAGGGTATGGTGGAGAACATGGAGCAGGCTACGGTGGTGGTGGTGGAGGTGGAAATGGCGGCGGTGGAGGTGTAGGATATGGGCCAGGAGGAGAATATGGGTCAGGTTATGGAAGCGGAGCAGGTGGGGGTCATGGTGGAGGAGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGTGGTGGAGGAGGGTCGGGATACGGTGGAGGAAGTGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGTGGTGGTCATGAGGGAGGATATGGTGGAGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAAGTAGTGGTGGAGGAAGCGGTGGAGGAGGTGGTGGTCATGACGGAGGATATGGTGGTGGAAGCGCACATGGAGGTGCATCTGGTGGTCATGAAGGAGGATATGGAGGAGGAGGTGGTAGCAGCCAAGGCGGTGACCACGGTGGATATGCACCTTGA
Protein sequence
MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSLGGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGYGSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGGGAGYGHEGGGGYGGSGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGGGVGYGPGGEYGSGYGSGAGGGHGGGGGSSGGGSGGGGGGGSGYGGGSAHGGASGGHEGGYGGSSGGGGGHEGGYGGGSAHGGASGGHEGGYGGSSGGGSGGGGGGHDGGYGGGSAHGGASGGHEGGYGGGGGSSQGGDHGGYAP
Homology
BLAST of Cp4.1LG15g02020.1 vs. NCBI nr
Match:
XP_023553671.1 (glycine-rich cell wall structural protein-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 512 bits (1318), Expect = 4.37e-176
Identity = 452/485 (93.20%), Postives = 452/485 (93.20%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL
Sbjct: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGG---------------------------------SGYGGGGGNGGGA 300
GAGYGHEGGGGYGG SGYGGGGGNGGGA
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHDGGGYGGGGGNGGGAGYGHEGSGYGGGGGNGGGA 300
Query: 301 GYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGGGVGYGPGGEYGSGYGSGAG 360
GYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGGGVGYGPGGEYGSGYGSGAG
Sbjct: 301 GYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGGGVGYGPGGEYGSGYGSGAG 360
Query: 361 GGHGGGGGSSGGGSGGGGGGGSGYGGGSAHGGASGGHEGGYGGSSGGGGGHEGGYGGGSA 420
GGHGGGGGSSGGGSGGGGGGGSGYGGGSAHGGASGGHEGGYGGSSGGGGGHEGGYGGGSA
Sbjct: 361 GGHGGGGGSSGGGSGGGGGGGSGYGGGSAHGGASGGHEGGYGGSSGGGGGHEGGYGGGSA 420
Query: 421 HGGASGGHEGGYGGSSGGGSGGGGGGHDGGYGGGSAHGGASGGHEGGYGGGGGSSQGGDH 452
HGGASGGHEGGYGGSSGGGSGGGGGGHDGGYGGGSAHGGASGGHEGGYGGGGGSSQGGDH
Sbjct: 421 HGGASGGHEGGYGGSSGGGSGGGGGGHDGGYGGGSAHGGASGGHEGGYGGGGGSSQGGDH 480
BLAST of Cp4.1LG15g02020.1 vs. NCBI nr
Match:
XP_022967498.1 (glycine-rich cell wall structural protein 1.8-like isoform X50 [Cucurbita maxima])
HSP 1 Score: 370 bits (949), Expect = 1.20e-115
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
BLAST of Cp4.1LG15g02020.1 vs. NCBI nr
Match:
XP_022967497.1 (PE-PGRS family protein PE_PGRS16-like isoform X49 [Cucurbita maxima])
HSP 1 Score: 370 bits (949), Expect = 2.14e-115
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
BLAST of Cp4.1LG15g02020.1 vs. NCBI nr
Match:
XP_022967496.1 (fibroin heavy chain-like isoform X48 [Cucurbita maxima])
HSP 1 Score: 370 bits (949), Expect = 2.10e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
BLAST of Cp4.1LG15g02020.1 vs. NCBI nr
Match:
XP_022967495.1 (fibroin heavy chain-like isoform X47 [Cucurbita maxima])
HSP 1 Score: 370 bits (949), Expect = 2.45e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
BLAST of Cp4.1LG15g02020.1 vs. ExPASy TrEMBL
Match:
A0A6J1HV81 (glycine-rich cell wall structural protein 1.8-like isoform X50 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 370 bits (949), Expect = 5.81e-116
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
BLAST of Cp4.1LG15g02020.1 vs. ExPASy TrEMBL
Match:
A0A6J1HR00 (PE-PGRS family protein PE_PGRS16-like isoform X49 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 370 bits (949), Expect = 1.04e-115
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
BLAST of Cp4.1LG15g02020.1 vs. ExPASy TrEMBL
Match:
A0A6J1HS93 (fibroin heavy chain-like isoform X48 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 370 bits (949), Expect = 1.02e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
BLAST of Cp4.1LG15g02020.1 vs. ExPASy TrEMBL
Match:
A0A6J1HUM8 (fibroin heavy chain-like isoform X47 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 370 bits (949), Expect = 1.19e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
BLAST of Cp4.1LG15g02020.1 vs. ExPASy TrEMBL
Match:
A0A6J1HWV8 (fibroin heavy chain-like isoform X46 OS=Cucurbita maxima OX=3661 GN=LOC111466985 PE=4 SV=1)
HSP 1 Score: 370 bits (949), Expect = 1.38e-114
Identity = 348/405 (85.93%), Postives = 349/405 (86.17%), Query Frame = 0
Query: 1 MASHKRLSSFLFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGGGSGGGYGSL 60
MASHKRLSSF+FFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAG GSGGGYGSL
Sbjct: 1 MASHKRLSSFIFFLLLGIGVSSAARNLLTYGEGKPVNIPAFAYGAGGGAGAGSGGGYGSL 60
Query: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY
Sbjct: 61 GGYGGGGGNGGGSGYGSVGEYGVGGYGSGGGGGSGEGGGYGPGGGNGYGGGGGSGGGGGY 120
Query: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN
Sbjct: 121 GSGVRYGEVGYGSGGSGYGGGSGGGAGYGPGGGGYGGGGGNGGGAGYGPGGSGYGGGGGN 180
Query: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG
Sbjct: 181 GGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGGYGGGGGNGG 240
Query: 241 GAGYGHEGGGGYGGSG-------------------------------------------- 300
GAGYGHEGGGGYGG G
Sbjct: 241 GAGYGHEGGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGAGYGHEGGGYGGGGGNGGGA 300
Query: 301 --------YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGGGGGGGNGGGG 352
YGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYG GGGGGNGGGG
Sbjct: 301 GYGHEGGGYGGGGGNGGGAGYGSGGAGSGAGGGEGGGSGYGGEHGAGYGSGGGGGNGGGG 360
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023553671.1 | 4.37e-176 | 93.20 | glycine-rich cell wall structural protein-like [Cucurbita pepo subsp. pepo] | [more] |
XP_022967498.1 | 1.20e-115 | 85.93 | glycine-rich cell wall structural protein 1.8-like isoform X50 [Cucurbita maxima... | [more] |
XP_022967497.1 | 2.14e-115 | 85.93 | PE-PGRS family protein PE_PGRS16-like isoform X49 [Cucurbita maxima] | [more] |
XP_022967496.1 | 2.10e-114 | 85.93 | fibroin heavy chain-like isoform X48 [Cucurbita maxima] | [more] |
XP_022967495.1 | 2.45e-114 | 85.93 | fibroin heavy chain-like isoform X47 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1HV81 | 5.81e-116 | 85.93 | glycine-rich cell wall structural protein 1.8-like isoform X50 OS=Cucurbita maxi... | [more] |
A0A6J1HR00 | 1.04e-115 | 85.93 | PE-PGRS family protein PE_PGRS16-like isoform X49 OS=Cucurbita maxima OX=3661 GN... | [more] |
A0A6J1HS93 | 1.02e-114 | 85.93 | fibroin heavy chain-like isoform X48 OS=Cucurbita maxima OX=3661 GN=LOC111466985... | [more] |
A0A6J1HUM8 | 1.19e-114 | 85.93 | fibroin heavy chain-like isoform X47 OS=Cucurbita maxima OX=3661 GN=LOC111466985... | [more] |
A0A6J1HWV8 | 1.38e-114 | 85.93 | fibroin heavy chain-like isoform X46 OS=Cucurbita maxima OX=3661 GN=LOC111466985... | [more] |
Match Name | E-value | Identity | Description | |
Relationships
This mRNA is a part of the following gene feature(s):
The following five_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG15g02020.1:five_prime_utr:001 | Cp4.1LG15g02020.1:five_prime_utr:001 | five_prime_UTR |
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG15g02020.1:exon:001 | Cp4.1LG15g02020.1:exon:001 | exon |
Cp4.1LG15g02020.1:exon:002 | Cp4.1LG15g02020.1:exon:002 | exon |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG15g02020.1:cds:001 | Cp4.1LG15g02020.1:cds:001 | CDS |
Cp4.1LG15g02020.1:cds:002 | Cp4.1LG15g02020.1:cds:002 | CDS |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG15g02020.1:three_prime_utr:001 | Cp4.1LG15g02020.1:three_prime_utr:001 | three_prime_UTR |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG15g02020.1 | Cp4.1LG15g02020.1-protein | polypeptide |