Cp4.1LG03g01920 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g01920
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionProtamine P1 family protein
LocationCp4.1LG03: 842368 .. 843399 (+)
RNA-Seq ExpressionCp4.1LG03g01920
SyntenyCp4.1LG03g01920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCCATCGGCGAAGCCGATTTCGAGTCCAGGCAGGACGGAGAAATTTCCGCCGCCATTGATGAGGTTTTTGAGGAGCAATGTGGGAAGTAAAAGCAGAGGAAGGTCGCGTGCGAGTCCGATGATGTTCATGAGAAAGAAAAACAACGCCACCGCCATTGAAACCCAAGAGCCTTCGTCCCCTAAAGTCACTTGCATCGGCCAGGTCCGAGTTCGCCGCTCCTCCACGCGCCGTAGCAGACGTTCTGGTGCGCCTACGCGCCGCCGTTGCCGCTGGTTCCGAGCTGCTCTGTTATGCCCCTGTTTTCGAAAAAAAATCAAGCCCAATTCTTCTCAAATATTTCAGAGATGGGTCTCATTTTTCCCAGTTGGGTTCCGCCGAAAATCGAGAAGTAGAAAAAAATCGCCGCCGCATGAAACCCCCGTCCACGGCGGATTTGAAATTCCGAACTCAGAAACACAAGTTGTCGCCCCCATCGACGATGACGAAGGAGAAGAAACAGTCGAAACGTTCATTTCTTCTCATTCTTCGCCGCCCAAAAATGCATTGTTACTAACAAGATGCAGATCTGCTCCATATCGATCGACATCATTAGCGAGTAGATTTTGGGGCTCTCCTCTGAGAACCGAGAAAAATCACGAAGAAGAGGAAGAAGAAGAAGAAGAAGAAGAACAGAGCACAAAGCCCAACAATGGCGGCAAAACGGTCGAGATTGAGAAACCCACATCCCAAAGAGCCTCAGTTTCTGATCAAGATCCGAATGGAGGCTTAGAATTCGAGGAGGCCAAGAAATTTATGAAGAACATCGACGACGATTCTACCACAGAAAGAATCATCAAATCGGGCAATATCGAACGAGAAAAAACAGGGGAAGAAGAAGAAGGACTGGGAAGCTCGTCTCGGCCCTTGATTTTGACACGTTGCAAATCTGCGCCGTCGAGAACGGCGGAGAAGATGAACCCAGAACTGGGATTCTGGAAAAAGAGAAGGTTGGGGATTACTGATTCAAGCTCGCCAAACAATTCATGA

mRNA sequence

ATGAAGCCATCGGCGAAGCCGATTTCGAGTCCAGGCAGGACGGAGAAATTTCCGCCGCCATTGATGAGGTTTTTGAGGAGCAATGTGGGAAGTAAAAGCAGAGGAAGGTCGCGTGCGAGTCCGATGATGTTCATGAGAAAGAAAAACAACGCCACCGCCATTGAAACCCAAGAGCCTTCGTCCCCTAAAGTCACTTGCATCGGCCAGGTCCGAGTTCGCCGCTCCTCCACGCGCCGTAGCAGACGTTCTGGTGCGCCTACGCGCCGCCGTTGCCGCTGGTTCCGAGCTGCTCTGTTATGCCCCTGTTTTCGAAAAAAAATCAAGCCCAATTCTTCTCAAATATTTCAGAGATGGGTCTCATTTTTCCCAGTTGGGTTCCGCCGAAAATCGAGAAGTAGAAAAAAATCGCCGCCGCATGAAACCCCCGTCCACGGCGGATTTGAAATTCCGAACTCAGAAACACAAGTTGTCGCCCCCATCGACGATGACGAAGGAGAAGAAACAGTCGAAACGTTCATTTCTTCTCATTCTTCGCCGCCCAAAAATGCATTGTTACTAACAAGATGCAGATCTGCTCCATATCGATCGACATCATTAGCGAGTAGATTTTGGGGCTCTCCTCTGAGAACCGAGAAAAATCACGAAGAAGAGGAAGAAGAAGAAGAAGAAGAAGAACAGAGCACAAAGCCCAACAATGGCGGCAAAACGGTCGAGATTGAGAAACCCACATCCCAAAGAGCCTCAGTTTCTGATCAAGATCCGAATGGAGGCTTAGAATTCGAGGAGGCCAAGAAATTTATGAAGAACATCGACGACGATTCTACCACAGAAAGAATCATCAAATCGGGCAATATCGAACGAGAAAAAACAGGGGAAGAAGAAGAAGGACTGGGAAGCTCGTCTCGGCCCTTGATTTTGACACGTTGCAAATCTGCGCCGTCGAGAACGGCGGAGAAGATGAACCCAGAACTGGGATTCTGGAAAAAGAGAAGGTTGGGGATTACTGATTCAAGCTCGCCAAACAATTCATGA

Coding sequence (CDS)

ATGAAGCCATCGGCGAAGCCGATTTCGAGTCCAGGCAGGACGGAGAAATTTCCGCCGCCATTGATGAGGTTTTTGAGGAGCAATGTGGGAAGTAAAAGCAGAGGAAGGTCGCGTGCGAGTCCGATGATGTTCATGAGAAAGAAAAACAACGCCACCGCCATTGAAACCCAAGAGCCTTCGTCCCCTAAAGTCACTTGCATCGGCCAGGTCCGAGTTCGCCGCTCCTCCACGCGCCGTAGCAGACGTTCTGGTGCGCCTACGCGCCGCCGTTGCCGCTGGTTCCGAGCTGCTCTGTTATGCCCCTGTTTTCGAAAAAAAATCAAGCCCAATTCTTCTCAAATATTTCAGAGATGGGTCTCATTTTTCCCAGTTGGGTTCCGCCGAAAATCGAGAAGTAGAAAAAAATCGCCGCCGCATGAAACCCCCGTCCACGGCGGATTTGAAATTCCGAACTCAGAAACACAAGTTGTCGCCCCCATCGACGATGACGAAGGAGAAGAAACAGTCGAAACGTTCATTTCTTCTCATTCTTCGCCGCCCAAAAATGCATTGTTACTAACAAGATGCAGATCTGCTCCATATCGATCGACATCATTAGCGAGTAGATTTTGGGGCTCTCCTCTGAGAACCGAGAAAAATCACGAAGAAGAGGAAGAAGAAGAAGAAGAAGAAGAACAGAGCACAAAGCCCAACAATGGCGGCAAAACGGTCGAGATTGAGAAACCCACATCCCAAAGAGCCTCAGTTTCTGATCAAGATCCGAATGGAGGCTTAGAATTCGAGGAGGCCAAGAAATTTATGAAGAACATCGACGACGATTCTACCACAGAAAGAATCATCAAATCGGGCAATATCGAACGAGAAAAAACAGGGGAAGAAGAAGAAGGACTGGGAAGCTCGTCTCGGCCCTTGATTTTGACACGTTGCAAATCTGCGCCGTCGAGAACGGCGGAGAAGATGAACCCAGAACTGGGATTCTGGAAAAAGAGAAGGTTGGGGATTACTGATTCAAGCTCGCCAAACAATTCATGA

Protein sequence

MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPSSPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVSFFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPPKNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIEKPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLGSSSRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS
Homology
BLAST of Cp4.1LG03g01920 vs. NCBI nr
Match: XP_023526734.1 (uncharacterized protein LOC111790135 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 671 bits (1732), Expect = 3.87e-243
Identity = 343/343 (100.00%), Postives = 343/343 (100.00%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS
Sbjct: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120
           SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS
Sbjct: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120

Query: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP 180
           FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP
Sbjct: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP 180

Query: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE 240
           KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE
Sbjct: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE 240

Query: 241 KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLGSS 300
           KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLGSS
Sbjct: 241 KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLGSS 300

Query: 301 SRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS 343
           SRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS
Sbjct: 301 SRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS 343

BLAST of Cp4.1LG03g01920 vs. NCBI nr
Match: KAG7017350.1 (hypothetical protein SDJN02_19215, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 635 bits (1638), Expect = 7.34e-229
Identity = 328/344 (95.35%), Postives = 332/344 (96.51%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS
Sbjct: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120
           SPKVTCIGQVRVRRSSTRRSRRSG PTRRRCRW RAALLCPCFRKKIKPNSSQIFQRWVS
Sbjct: 61  SPKVTCIGQVRVRRSSTRRSRRSGTPTRRRCRWIRAALLCPCFRKKIKPNSSQIFQRWVS 120

Query: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP 180
           FF VGFRRKSRSRKKSPPHETPVHGG EIPN+ETQVVAPIDDDEGEETVE FISSHSSPP
Sbjct: 121 FFQVGFRRKSRSRKKSPPHETPVHGGIEIPNAETQVVAPIDDDEGEETVEAFISSHSSPP 180

Query: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE 240
           KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEE+E    QSTKPNNGGKTVEIE
Sbjct: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEKE----QSTKPNNGGKTVEIE 240

Query: 241 KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEE-GLGS 300
           KPTSQRASVSDQDP+GGLEFEE KKFMKNIDDDSTTERIIKS NIEREKTGEEEE GLGS
Sbjct: 241 KPTSQRASVSDQDPSGGLEFEEDKKFMKNIDDDSTTERIIKSANIEREKTGEEEEEGLGS 300

Query: 301 SSRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS 343
           SSRPLILTRCKSAPSRTAEKMNPE+GFWKKRRLGITDSSSPNNS
Sbjct: 301 SSRPLILTRCKSAPSRTAEKMNPEMGFWKKRRLGITDSSSPNNS 340

BLAST of Cp4.1LG03g01920 vs. NCBI nr
Match: XP_022983984.1 (uncharacterized protein LOC111482438 [Cucurbita maxima])

HSP 1 Score: 630 bits (1625), Expect = 7.30e-227
Identity = 325/343 (94.75%), Postives = 329/343 (95.92%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSP RTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS
Sbjct: 1   MKPSAKPISSPSRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120
           SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRW RAALLCPCFRKKIKPNSSQIFQRWVS
Sbjct: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWIRAALLCPCFRKKIKPNSSQIFQRWVS 120

Query: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP 180
           FF VGFRRKSRSRKKSPPHETPVHGG EIP SET VVAPIDDDEGEETVE FISSHSSPP
Sbjct: 121 FFQVGFRRKSRSRKKSPPHETPVHGGIEIPKSETLVVAPIDDDEGEETVEAFISSHSSPP 180

Query: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE 240
           KNALLLTRCRSAPYRSTSLASRFWGSPLR EKNH+EEEEEEEE  QSTKPNNGGKTVEIE
Sbjct: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRNEKNHKEEEEEEEE--QSTKPNNGGKTVEIE 240

Query: 241 KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLGSS 300
           KPTSQRASVSDQDPNGGLEF E KKFMKNI+DDSTTERIIKS NIEREKTGEEEEGLGSS
Sbjct: 241 KPTSQRASVSDQDPNGGLEFVEDKKFMKNINDDSTTERIIKSANIEREKTGEEEEGLGSS 300

Query: 301 SRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS 343
           SRPLILTRCKSAPSRTAEKMNPE+GFWK+RRLGITDS SPNNS
Sbjct: 301 SRPLILTRCKSAPSRTAEKMNPEMGFWKRRRLGITDSISPNNS 341

BLAST of Cp4.1LG03g01920 vs. NCBI nr
Match: XP_022934304.1 (uncharacterized protein LOC111441509 [Cucurbita moschata])

HSP 1 Score: 627 bits (1618), Expect = 7.90e-226
Identity = 324/340 (95.29%), Postives = 327/340 (96.18%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS
Sbjct: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120
           SPKVTCIGQVRVRRSSTRRSRRSG PTRRRCRW RAALLCPCFRKKIKPNSSQIFQRWVS
Sbjct: 61  SPKVTCIGQVRVRRSSTRRSRRSGTPTRRRCRWIRAALLCPCFRKKIKPNSSQIFQRWVS 120

Query: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP 180
           FF VGFRRKSRSRKKSPPHETPVHGG EIPNSETQVVAPIDDDEGEETVE FISSHSSPP
Sbjct: 121 FFQVGFRRKSRSRKKSPPHETPVHGGIEIPNSETQVVAPIDDDEGEETVEAFISSHSSPP 180

Query: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE 240
           KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEE     QSTKPNNGGKTVEIE
Sbjct: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEE-----QSTKPNNGGKTVEIE 240

Query: 241 KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLGSS 300
           KPTSQRASVSDQDP+GGLEFEE KKFMKNIDDDSTTE IIKSGNIEREKTGEEEE LGSS
Sbjct: 241 KPTSQRASVSDQDPSGGLEFEEDKKFMKNIDDDSTTE-IIKSGNIEREKTGEEEEALGSS 300

Query: 301 SRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSP 340
           SRPLILTRCKSAPSRTAEKMNPE+GFWK+RRLGITDSSSP
Sbjct: 301 SRPLILTRCKSAPSRTAEKMNPEMGFWKRRRLGITDSSSP 334

BLAST of Cp4.1LG03g01920 vs. NCBI nr
Match: KAG6580594.1 (hypothetical protein SDJN03_20596, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 639 bits (1647), Expect = 2.06e-222
Identity = 331/344 (96.22%), Postives = 334/344 (97.09%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS
Sbjct: 516 MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 575

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120
           SPKVTCIGQVRVRRSSTRRSRRSG PTRRRCRW RAALLCPCFRKKIKPNSSQIFQRWVS
Sbjct: 576 SPKVTCIGQVRVRRSSTRRSRRSGTPTRRRCRWIRAALLCPCFRKKIKPNSSQIFQRWVS 635

Query: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP 180
           FF VGFRRKSRSRKKSPPHETPVHGG EIPN+ETQVVAPIDDDEGEETV  FISSHSSPP
Sbjct: 636 FFQVGFRRKSRSRKKSPPHETPVHGGIEIPNAETQVVAPIDDDEGEETVGAFISSHSSPP 695

Query: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE 240
           KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEE QSTKPNNGGKTVEIE
Sbjct: 696 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEE-QSTKPNNGGKTVEIE 755

Query: 241 KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEE-GLGS 300
           KPTSQRASVSDQDP+GGLEFEE KKFMKNIDDDSTTERIIKS NIEREKTGEEEE GLGS
Sbjct: 756 KPTSQRASVSDQDPSGGLEFEEDKKFMKNIDDDSTTERIIKSANIEREKTGEEEEEGLGS 815

Query: 301 SSRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS 343
           SSRPLILTRCKSAPSRTAEKMNPE+GFWKKRRLGITDSSSPNNS
Sbjct: 816 SSRPLILTRCKSAPSRTAEKMNPEMGFWKKRRLGITDSSSPNNS 858

BLAST of Cp4.1LG03g01920 vs. ExPASy TrEMBL
Match: A0A6J1J954 (uncharacterized protein LOC111482438 OS=Cucurbita maxima OX=3661 GN=LOC111482438 PE=4 SV=1)

HSP 1 Score: 630 bits (1625), Expect = 3.53e-227
Identity = 325/343 (94.75%), Postives = 329/343 (95.92%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSP RTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS
Sbjct: 1   MKPSAKPISSPSRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120
           SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRW RAALLCPCFRKKIKPNSSQIFQRWVS
Sbjct: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWIRAALLCPCFRKKIKPNSSQIFQRWVS 120

Query: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP 180
           FF VGFRRKSRSRKKSPPHETPVHGG EIP SET VVAPIDDDEGEETVE FISSHSSPP
Sbjct: 121 FFQVGFRRKSRSRKKSPPHETPVHGGIEIPKSETLVVAPIDDDEGEETVEAFISSHSSPP 180

Query: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE 240
           KNALLLTRCRSAPYRSTSLASRFWGSPLR EKNH+EEEEEEEE  QSTKPNNGGKTVEIE
Sbjct: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRNEKNHKEEEEEEEE--QSTKPNNGGKTVEIE 240

Query: 241 KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLGSS 300
           KPTSQRASVSDQDPNGGLEF E KKFMKNI+DDSTTERIIKS NIEREKTGEEEEGLGSS
Sbjct: 241 KPTSQRASVSDQDPNGGLEFVEDKKFMKNINDDSTTERIIKSANIEREKTGEEEEGLGSS 300

Query: 301 SRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS 343
           SRPLILTRCKSAPSRTAEKMNPE+GFWK+RRLGITDS SPNNS
Sbjct: 301 SRPLILTRCKSAPSRTAEKMNPEMGFWKRRRLGITDSISPNNS 341

BLAST of Cp4.1LG03g01920 vs. ExPASy TrEMBL
Match: A0A6J1F274 (uncharacterized protein LOC111441509 OS=Cucurbita moschata OX=3662 GN=LOC111441509 PE=4 SV=1)

HSP 1 Score: 627 bits (1618), Expect = 3.82e-226
Identity = 324/340 (95.29%), Postives = 327/340 (96.18%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS
Sbjct: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120
           SPKVTCIGQVRVRRSSTRRSRRSG PTRRRCRW RAALLCPCFRKKIKPNSSQIFQRWVS
Sbjct: 61  SPKVTCIGQVRVRRSSTRRSRRSGTPTRRRCRWIRAALLCPCFRKKIKPNSSQIFQRWVS 120

Query: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFISSHSSPP 180
           FF VGFRRKSRSRKKSPPHETPVHGG EIPNSETQVVAPIDDDEGEETVE FISSHSSPP
Sbjct: 121 FFQVGFRRKSRSRKKSPPHETPVHGGIEIPNSETQVVAPIDDDEGEETVEAFISSHSSPP 180

Query: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVEIE 240
           KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEE     QSTKPNNGGKTVEIE
Sbjct: 181 KNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEE-----QSTKPNNGGKTVEIE 240

Query: 241 KPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLGSS 300
           KPTSQRASVSDQDP+GGLEFEE KKFMKNIDDDSTTE IIKSGNIEREKTGEEEE LGSS
Sbjct: 241 KPTSQRASVSDQDPSGGLEFEEDKKFMKNIDDDSTTE-IIKSGNIEREKTGEEEEALGSS 300

Query: 301 SRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSP 340
           SRPLILTRCKSAPSRTAEKMNPE+GFWK+RRLGITDSSSP
Sbjct: 301 SRPLILTRCKSAPSRTAEKMNPEMGFWKRRRLGITDSSSP 334

BLAST of Cp4.1LG03g01920 vs. ExPASy TrEMBL
Match: A0A5D3DNC1 (Chromo domain-containing protein cec-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G002420 PE=4 SV=1)

HSP 1 Score: 537 bits (1384), Expect = 1.88e-190
Identity = 283/345 (82.03%), Postives = 302/345 (87.54%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMR+K NATAIETQEPS
Sbjct: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRRKTNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRR-CRWFRAALLCPCFRKKIKPNSSQIFQRWV 120
           SPKVTCIGQVRVRR++TRR +RSG PTRRR CRW RAAL CPCFRKK KPNSS IFQR V
Sbjct: 61  SPKVTCIGQVRVRRATTRRRKRSGTPTRRRRCRWIRAALFCPCFRKKFKPNSSPIFQRLV 120

Query: 121 SFFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPID-DDEGEETVETFISSHSS 180
           SFF  GFRRK + R  SPP ETP HGG EI N+E QVVA +D D+E EET E  ISS+SS
Sbjct: 121 SFFQCGFRRKPKVRTNSPPRETPFHGGVEISNTEIQVVAAVDNDEEEEETAEALISSNSS 180

Query: 181 PPKNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVE 240
           PPKNALLLTRCRSAPYRSTSLASRFWGSPL   KN E +EE+EEE+EQSTKPNNGGKTVE
Sbjct: 181 PPKNALLLTRCRSAPYRSTSLASRFWGSPL---KNEENQEEKEEEKEQSTKPNNGGKTVE 240

Query: 241 IEKPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLG 300
           IEKPTSQRASVSDQDP+GGLEFEE ++F KNID+ S  ERI+ S NI+REKTGEEEE LG
Sbjct: 241 IEKPTSQRASVSDQDPSGGLEFEEDEEFAKNIDEHSIPERIVTSANIKREKTGEEEEALG 300

Query: 301 SSSRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS 343
           SSSRPLILTRCKS PSRTAEKMNPE+GFWKKRRLGI DS  PNNS
Sbjct: 301 SSSRPLILTRCKSEPSRTAEKMNPEVGFWKKRRLGIPDSCLPNNS 342

BLAST of Cp4.1LG03g01920 vs. ExPASy TrEMBL
Match: A0A1S3B5Q8 (uncharacterized protein LOC103486466 OS=Cucumis melo OX=3656 GN=LOC103486466 PE=4 SV=1)

HSP 1 Score: 537 bits (1384), Expect = 1.88e-190
Identity = 283/345 (82.03%), Postives = 302/345 (87.54%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMR+K NATAIETQEPS
Sbjct: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRRKTNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRR-CRWFRAALLCPCFRKKIKPNSSQIFQRWV 120
           SPKVTCIGQVRVRR++TRR +RSG PTRRR CRW RAAL CPCFRKK KPNSS IFQR V
Sbjct: 61  SPKVTCIGQVRVRRATTRRRKRSGTPTRRRRCRWIRAALFCPCFRKKFKPNSSPIFQRLV 120

Query: 121 SFFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPID-DDEGEETVETFISSHSS 180
           SFF  GFRRK + R  SPP ETP HGG EI N+E QVVA +D D+E EET E  ISS+SS
Sbjct: 121 SFFQCGFRRKPKVRTNSPPRETPFHGGVEISNTEIQVVAAVDNDEEEEETAEALISSNSS 180

Query: 181 PPKNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTVE 240
           PPKNALLLTRCRSAPYRSTSLASRFWGSPL   KN E +EE+EEE+EQSTKPNNGGKTVE
Sbjct: 181 PPKNALLLTRCRSAPYRSTSLASRFWGSPL---KNEENQEEKEEEKEQSTKPNNGGKTVE 240

Query: 241 IEKPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEGLG 300
           IEKPTSQRASVSDQDP+GGLEFEE ++F KNID+ S  ERI+ S NI+REKTGEEEE LG
Sbjct: 241 IEKPTSQRASVSDQDPSGGLEFEEDEEFAKNIDEHSIPERIVTSANIKREKTGEEEEALG 300

Query: 301 SSSRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGITDSSSPNNS 343
           SSSRPLILTRCKS PSRTAEKMNPE+GFWKKRRLGI DS  PNNS
Sbjct: 301 SSSRPLILTRCKSEPSRTAEKMNPEVGFWKKRRLGIPDSCLPNNS 342

BLAST of Cp4.1LG03g01920 vs. ExPASy TrEMBL
Match: A0A0A0LDY5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736940 PE=4 SV=1)

HSP 1 Score: 533 bits (1372), Expect = 1.57e-188
Identity = 284/348 (81.61%), Postives = 301/348 (86.49%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPS 60
           MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMR+KNNATAIETQEPS
Sbjct: 1   MKPSAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRRKNNATAIETQEPS 60

Query: 61  SPKVTCIGQVRVRRSSTRRSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNSSQIFQRWVS 120
           SPKVTCIGQVRVRRSSTRR +RSG  TRRRCRW RAA  CPCFRKK KPNSS IFQR VS
Sbjct: 61  SPKVTCIGQVRVRRSSTRRRKRSGTRTRRRCRWIRAARCCPCFRKKFKPNSSPIFQRLVS 120

Query: 121 FFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDD---EGEETVETFISSHS 180
           FF  GFRRK + R  SPP E P  GG EI N E QVVA +DDD   E EET E  ISS+S
Sbjct: 121 FFQCGFRRKPKVRTNSPPREPPFRGGVEISNKEIQVVAAVDDDDAEEEEETAEALISSNS 180

Query: 181 SPPKNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGGKTV 240
           SPPKNALLLTRCRSAPYRSTSLASRFWGSPL+ E+N EE EEEE+E+EQSTKPNNGGKTV
Sbjct: 181 SPPKNALLLTRCRSAPYRSTSLASRFWGSPLKNEENQEETEEEEKEKEQSTKPNNGGKTV 240

Query: 241 EIEKPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEEEG- 300
           EIEKPTSQRASVSDQDP+GGLEFEE ++F KNID+ S  ERI+KS NI++EKTGEEEE  
Sbjct: 241 EIEKPTSQRASVSDQDPSGGLEFEENEEFAKNIDEHSVPERIVKSANIKQEKTGEEEEEV 300

Query: 301 LGSSSRPLILTRCKSAPSRTAEKMNPELG-FWKKRRLGITDSSSPNNS 343
           LGSSSRPLILTRCKS PSRTAEKMNPE+G FWKKRRLGI DS  PNNS
Sbjct: 301 LGSSSRPLILTRCKSEPSRTAEKMNPEVGLFWKKRRLGIPDSCLPNNS 348

BLAST of Cp4.1LG03g01920 vs. TAIR 10
Match: AT2G37100.1 (protamine P1 family protein )

HSP 1 Score: 155.2 bits (391), Expect = 9.3e-38
Identity = 130/344 (37.79%), Postives = 175/344 (50.87%), Query Frame = 0

Query: 4   SAKPISSPGRTEKFPPPLMRFLRSNVGSKSRGRSRASPMMFMRKKNNATAIETQEPSSPK 63
           S++P+SSPGRTE  PP LMRFLR+   S+SR RSR+   +F R+KN + A ETQEP+SPK
Sbjct: 5   SSRPVSSPGRTEN-PPLLMRFLRTK--SRSRSRSRSRRPIFFRRKNASAAAETQEPTSPK 64

Query: 64  VTCIGQVRVRRSSTRR---SRRSGAPTR-----RRCRWFRAALLCPCFRKKIKPNS-SQI 123
           VTC+GQVR+ RS   +   +R SG  T      RRC W + A  C  F   IKP   S +
Sbjct: 65  VTCMGQVRINRSKKPKPETARVSGGATERRRQSRRCGWVKNAFPCHSFTGIIKPTCFSPV 124

Query: 124 FQRWVSFFPVGFRRKSRSRKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVETFIS 183
           +++W SF    F +KS  R  S   E P+ G   +   E +      ++  EE   +  S
Sbjct: 125 WRKWKSFSHASFSKKSEKRSSSSRSE-PIFGRSTVEPEEPEETR--KEENQEEEASSCKS 184

Query: 184 SHSSPPKNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKPNNGG 243
             ++PP+NA LLTRCRSAPYRS S A+  +                E++EE +  P    
Sbjct: 185 FTATPPRNAFLLTRCRSAPYRSPSSANSLF----------------EDQEETNKAPFQ-- 244

Query: 244 KTVEIEKPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKTGEEE 303
                   +S+  SVS++      E               TTER+  S    RE    EE
Sbjct: 245 -----RHASSENVSVSEEPKTSVTE---------------TTERLEDS---RRESAASEE 297

Query: 304 ---EGLGSSSRPLILTRCKSAPSRTAEKMNPELGFWKKRRLGIT 336
                LGS  + LILTRC S P+R    + PE+G+ +  RLG T
Sbjct: 305 PKRSVLGSPRQCLILTRCNSEPAR----LVPEMGYRQNPRLGFT 297

BLAST of Cp4.1LG03g01920 vs. TAIR 10
Match: AT5G03110.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; BEST Arabidopsis thaliana protein match is: protamine P1 family protein (TAIR:AT2G37100.1); Has 81 Blast hits to 73 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 81; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 139.0 bits (349), Expect = 6.9e-33
Identity = 125/330 (37.88%), Postives = 164/330 (49.70%), Query Frame = 0

Query: 1   MKPSAKPISSPGRTEKFPPPLMRFLR--SNVG------SKSRGRSRASPMMFMRKKNNAT 60
           M  S +P+SSPGR EK+PPP M FLR  SN G      S+SRGRSRASP +F+R+  +A 
Sbjct: 1   MMISRRPVSSPGRVEKYPPPFMGFLRSKSNGGSTSRSRSRSRGRSRASP-LFVRRNKSAA 60

Query: 61  AIETQEPSSPKVTCIGQVRVRRSSTR-RSRRSGAPTRRRCRWFRAALLCPCFRKKIKPNS 120
           A+  QEPSSPKVTC+GQVRV RS  + +      PTRRRC W R A     F  KIK  +
Sbjct: 61  AV-AQEPSSPKVTCMGQVRVNRSKPKIKPESRDNPTRRRCEWLRNASFYNKFAGKIKTMT 120

Query: 121 SQIFQRWVSFFPVGFRRKSRS-RKKSPPHETPVHGGFEIPNSETQVVAPIDDDEGEETVE 180
                 W  +    F    R+ ++K  P     H   E   S  ++   I+ +E  E  +
Sbjct: 121 F-----WPKWRLCSFSCACRNLKEKDSPRSQLDHPTTE---SVREIKEEIEGEENFEIPK 180

Query: 181 TFISSHSSPPKNALLLTRCRSAPYRSTSLASRFWGSPLRTEKNHEEEEEEEEEEEQSTKP 240
            F+S  ++PP NALLLTR RSAPYRS+SLA RFW          EE  + E E +Q+ + 
Sbjct: 181 LFVSPATTPPINALLLTRSRSAPYRSSSLAFRFW----------EENNQREVESQQNVRS 240

Query: 241 NNGGKTVEIEKPTSQRASVSDQDPNGGLEFEEAKKFMKNIDDDSTTERIIKSGNIEREKT 300
                 V IEK             NG   F E  +++   +++ T  R +          
Sbjct: 241 EKTDSEVPIEK------------ING---FHE-PEYVDRDEEEFTKLRFV---------- 273

Query: 301 GEEEEGLGSSSRPLILTRCKSAPSRTAEKM 321
                      R  +LTR KS P+R  EKM
Sbjct: 301 -----------RQPVLTRSKSEPARIGEKM 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023526734.13.87e-243100.00uncharacterized protein LOC111790135 [Cucurbita pepo subsp. pepo][more]
KAG7017350.17.34e-22995.35hypothetical protein SDJN02_19215, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022983984.17.30e-22794.75uncharacterized protein LOC111482438 [Cucurbita maxima][more]
XP_022934304.17.90e-22695.29uncharacterized protein LOC111441509 [Cucurbita moschata][more]
KAG6580594.12.06e-22296.22hypothetical protein SDJN03_20596, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1J9543.53e-22794.75uncharacterized protein LOC111482438 OS=Cucurbita maxima OX=3661 GN=LOC111482438... [more]
A0A6J1F2743.82e-22695.29uncharacterized protein LOC111441509 OS=Cucurbita moschata OX=3662 GN=LOC1114415... [more]
A0A5D3DNC11.88e-19082.03Chromo domain-containing protein cec-1 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S3B5Q81.88e-19082.03uncharacterized protein LOC103486466 OS=Cucumis melo OX=3656 GN=LOC103486466 PE=... [more]
A0A0A0LDY51.57e-18881.61Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736940 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G37100.19.3e-3837.79protamine P1 family protein [more]
AT5G03110.16.9e-3337.88FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 257..296
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 129..155
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 230..252
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 205..304
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..39
NoneNo IPR availablePANTHERPTHR33448CHLOROPLAST PROTEIN HCF243-RELATEDcoord: 1..337
NoneNo IPR availablePANTHERPTHR33448:SF10PROTAMINE P1 FAMILY PROTEINcoord: 1..337

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g01920.1Cp4.1LG03g01920.1mRNA