Cp4.1LG13g00330 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG13g00330
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDUF2232 domain-containing protein
LocationCp4.1LG13: 193934 .. 198890 (-)
RNA-Seq ExpressionCp4.1LG13g00330
SyntenyCp4.1LG13g00330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTCTCGAGCCACGGGTAGAACCCCCACGATGGTAGTGCGATGTAGATCGGTTTAGAAGGGTAGCGTGAAGGTTTGATGAGTGTTTGATATTTCCTCCAACGTAGAACGAAGCCTACAACGGGGGAGTAAGGCTAAGGCCCAACCCCCGTCCCGGAAAAGCCCAGAGGGAAGACTGCAGTTCGCGTAAATCTTCAACTTCTTGAGGTTTTTGATTCTCCTAAGGGGCTGTGTCCGATAACGAAGCGCGTGCGGCTAACAACGTCTGTTCTGTGCGCCTCTCTGCAATTTCACTTCTTTCATTTCTTGGCTTTTTCTTAATTGAACATTCATGCCCATAATAATTATTTTTAAAAAAACGAAGATATTTGCTAGCGGATATTGTCCTATTTGGCCTTTTCTTTCGGCTTCCACTCAAGGTTTTAAAACGCTTCTGAGGGGAAGGTTTTCATACCCTTATAAATGGTGTTTTGTTCTCCGCCCCAACCAATGTGGGACATCATAATCCACCCCCTTCGGGGCCAGCATCCTCGCTGGGCTGGCACTCTTTCCTTCCTCCAATCGATGTGGGACCACCACCAAATCCACCCCCTTCGGGGCCCAGCGTCCTTACTGGTACACTGCCTCGTGTCTACCCCCTTCAGGGAACAGTGAGAAAGTTGGCGCATCGTCCGGTGTCTGGCTCTGATACCATTTGTAACAGTCCAGATCTTACGGCTTAGAAAGCTGGGTAGGGTTTACGACGAGTTGTGAGCTACATTACATATGAGCTAAAAGGGTGAGCGTCATGCCTAATTGCTTATTGAACTTCGTGTGTAGGAACAAATTTTAGCCAATAACCGACTTAGGCTTTCGAAAGAGCACCAGTAGACGTGCATAGGTGGGTCTCCCTTTATATCTGGATTGTCGAACAAACATGGTTTACATCATCCTTAGGTTTCCAATTCTAGTAGTCATGTTTTCACTCATACTAGGTCTTGTCGCATTAGTAACTTTTCAGTGCTCTGGAGCTAGAAGAATGGGCTCTTGAGCTTTCTCGAGAGAATAACACGATGTTATGTCCTAAAATTGAATGACCTCTCATGGAGTCAAAGTACAAATATGGCCAAACCCTAAACCCTATAATTACATTCACGTGTACGATAGGATAATTTATGTCTAAGATTATAACCACTACATGCAAGCTGATTTTTTCCGAGATGTTCATAATTATAATTGCATCTTATTCATTTATGATCTAACCAATGATAAAACTAAGTCCCTCTCGAACCACGGGTAGAACCGGGGCTTTGGTGGGAACGATGGTAGTGCTATGTACATCGGTTTAGAAGGGTAGCGTGAAGGTTTGATGAGTGTTTGATATTTCCTCCAACGTAGAACGAAGCCTACAACGGGGGAGTAAGGCTAAGGCCCAACCTCCGTCCCGGAAAAGCCCAGAGGGAAGAGTGACGGTTCGCGTAAATCTTCAACTTCTTGAGGTTTTTGATTCTCCATTCTTGTTCCTCATGATTTCTGGAAAGCTTTATCCATCCGTCTCCACATCATGTATTTTCCCACCAAAACCGACCTCGACCTCGACCCCAGTTCATGTTCATCTTCCTCTTCTTAAAATCTCTTCCAAACTCAGATTAATTAGCTTCCAATCCGTCTCCCTCTCTTTTCCGACCTTCATTGCTTCTAAATCCAGTGTCAAGTCCACTAGATTTTCGAATTCGGTGGCAAAAGTTTATAGCTTCGAGGGCCAAAACCCCACTTCTTTGTCGGATTTGGAAGACTTGTCTGAGAATGGAGTTGTCTATAAGAAGACGCTGGCGATGGTGGAATGCTCCATGTTCGCTGCACTTAATGGTTTGGTCTACTTCTTGAGCAATTCACTTGCTCTTGAGGTTTGTATCTCTCAGTTTCTGCGAATTCTATTCTTTCCCCCTGTGCTGCTGTTTCTATGGTTTCCGTCTCCTGTTCTTCATACATAATGTTTGTCTGGTTATTTTTCTTCAATAAGACGGATACCCCAAGTCCATCCGTGTGGTGGTTGGTTCTTCTTTGATGTCAGTTTGAAATGTCTTTTTTCCATTGAAACAAGAAACTACTTTATCCACTTCAAATTTATCGCAATTGCCTAATCGTATGTGATTACTTAATACTTCTAATGTGATTTTTTTTGAAAGAAAAATTCAATATGATAGTTTTCTGAAACATGACATCATGGGGCTTGTTTAGGGTACTCTTAAATCTACCCCCAATAAATTGAAGAGAATTCCCATGATGAACAACAAGCTTGTAAATGGTCTTGGGTCTAGCAACAAACTGACTTTGACGATGAACAACAAAAGTAGTATCCGTGGGAAATTTGAAAATGTAACAGGTTCCCAATAAACTCTTGGTCTTTATATGATGATGAATGAGGATGTCTGATGTATATGCCATTTTGAAGTGATCTTTTGCTCTAGGAAACCAACTAAAGTAATCTCATATTTTGAAGTGCCCAAGAAACTTGTCTTCTAGTTTTCTTAACATGGATGCCAATTTTATGAAAGTTCTCATAATATAAACATGCTTCTTTCTCATAAATTTTCACTTGTTTAAAGAAATTATGTTTTTGATTAACTTAGATGAAGTGATCTCTTAGATTTCTCCATTCTTTTAGTAGCAGTTAAGTTTCTCGAATTTTAATTTCGTTCTGTTTTTACTTGATATTGGACAGAATTACTTCGGCTGTTTCTTCTGTCTACCGATAGTAATCTCTTCGATGAGATGGGGCATAGCAGCAGGGAGAAAAACCATGGTCTGTAGCTTTCAAGGACTCGGTTGTTCGACAATTGATTTTATGTTGTCTTGTTATTCGTACATTCTATCGTTGTAGTAATGTATATATCTAAGTTGTCGCATCTGATCTTCTTTCAAGACAGGTGGCAACATTCTTGCTGTTGCTGGTTTTGTCTGGTCCAGTGAAAGCTTTAACATTTCTGGTGAGCATGCTTTCAATTGCCTGTGGATTTATTTTCATGTTTCAGAAATAATAATATCATATAAAAGAGGTTGCCTTTTAACCTATCAACCTTGTAAATTCTATCTTATGTAGAACAAGAAGAGAAAAAAATCAGGTCAAGGGCCTTCTTATACCAGCTTTTCTTGAGCTTCTCTCCCCTATGATTAAATAAGTGGTAATAAAAACTATACAAAATACATGTTAAAGTACAAAAATAATCACCTTAATCAATTGGAACGTTCGAAAATAACAATCCTAAAGGCATTTTGTTCGATGATTTTTTGAAGTTAATATTATGGAGTATTGCAAATTATGATTTGAATCGTAGAATTGTTGATATTGTCTCTGCAAATACAAGTTTAAATAAAATCAACTATACATACCTTTTTAACATTGCTGCATTCCTAGTGGCATAGATAGGCTAAGTGTTAATAGATCATTAGACAAAATCTGATACATGATAGATTTTGTGGCTTCAGCTTGTAGTGCCTTATCTAGATTTTTTATGTAATTGGGGCTGCTACCATTTGCTCCAGAGTGTTTTTGTAACTTTTTCGGCTTAATGTTTTTCCTTTTTGCGTATTTCCATATATCAGGGAAACAGTATTTAATAGTTTACCTTCTTCACCTGAATGGCTGACGTTGTTGTATGCTTCCTTGTGCAGCTTAGGCATGGTTTAGTGGGGTTGACAATGGGCTCCTTGTGGAGGTAGGTGATTGACTACGCATATGACGATCCATAAATACATAATTAACCTCAATTTAGGCATAGGCCATTAAGGATTCATAAAGTTTGTGAAATCTCAGCATGCTAGTTGGTCGAGTTTCTTTTGTTAGTTTGGCGATGATGCACTGACCAAAGGCATTAATGGGGAGACATTTACTTCACTTCTGCGCTTTGAATTTATCCTAATAAGAACAGAAAAATAGTAAAAATGACTGCTTTCCTGTTTTGTATCTTCTCAGGCTTGGAGCAAATTGGAGTACCTCAATCTTTCTGTGCACAATCGTATTGTTTCCTCACTTTTCAAACGCCCATCTGAACAAATGATGTCTGCTAGTTTTTTTTTAATCCATGCTACTAACCAAGTACATGACTGTGCGTTTAGGTTCGGGCACTCGGGGCAGTGGGTTATGTCTTAATATCTTCATTCTTGATAAGAGAGAACATACTAGCTCTGGTAACAAGCTATTCTGAATCTCTGCTGCTATACTTCAAAGAAATCACCTCTCAAGTCTCGAAATACCTTTCGTGTTCATGCAGATCACTATAAATATTCACGCTTCGCTCACCCTTATCTTCACTGCCTCGGGTGTAAACTTGATTCCATCGATGAATGCAATATACGCTATTTTTGGGACACTGGTATGTTTTGTATTCTTCAATGAAACTGGTGCAATGTAGATTGTACTTGTATGACATGCATTGTAGAGATTTCATTAAACTTTGATTGATTTCTCCTCTCTGTGTTCTGAAAGGTAATGCTGAACTGTGGATGCTTCATGTTTTTGCTTCATCTTTTGTACTCCATATTCCTTACTAGACTTGGCCTGAAGACTTCATTGACATTGCCAAGGTGGCTGGAGAAGGCAATGTAAACATTCGACAGGTATTAAATCGTTCACGATGATAAATACGCCATTTATTTTTTCCTTTTCTGAAGGGAGGAAGAAATTGGTTAGGGTCACTGGTACGTTTACACCTGATATTTTCGCACGCATTGGCGAGCAGATACAGATTGTTTTTTGATATGTTATCATGTTCTAGGTTCAATAAATATGCTTGTCTGAATGTGGAAAATGCTTGGATTTGCAAAGAATGTATGTCATTGATTTTGTCCAATTGGATAAAATGGAAGGAAGTAACTGTTAGGTTAGGGAGATTCGAATTTCTGCTACTGCCCATCTGGACCATCTGGAGCAAGCTTGTATTAATAGAGCTTTGTGGACTGGAATATAGGATTTAGGGC

mRNA sequence

CCTCTCGAGCCACGGGTAGAACCCCCACGATGGTAGTGCGATGTAGATCGGTTTAGAAGGGTAGCGTGAAGGTTTGATGAGTGTTTGATATTTCCTCCAACGTAGAACGAAGCCTACAACGGGGGAGTAAGGCTAAGGCCCAACCCCCGTCCCGGAAAAGCCCAGAGGGAAGACTGCAGTTCGCGTAAATCTTCAACTTCTTGAGGTTTTTGATTCTCCATTCTTGTTCCTCATGATTTCTGGAAAGCTTTATCCATCCGTCTCCACATCATGTATTTTCCCACCAAAACCGACCTCGACCTCGACCCCAGTTCATGTTCATCTTCCTCTTCTTAAAATCTCTTCCAAACTCAGATTAATTAGCTTCCAATCCGTCTCCCTCTCTTTTCCGACCTTCATTGCTTCTAAATCCAGTGTCAAGTCCACTAGATTTTCGAATTCGGTGGCAAAAGTTTATAGCTTCGAGGGCCAAAACCCCACTTCTTTGTCGGATTTGGAAGACTTGTCTGAGAATGGAGTTGTCTATAAGAAGACGCTGGCGATGGTGGAATGCTCCATGTTCGCTGCACTTAATGGTTTGGTCTACTTCTTGAGCAATTCACTTGCTCTTGAGAATTACTTCGGCTGTTTCTTCTGTCTACCGATAGTAATCTCTTCGATGAGATGGGGCATAGCAGCAGGGAGAAAAACCATGGTGGCAACATTCTTGCTGTTGCTGGTTTTGTCTGGTCCAGTGAAAGCTTTAACATTTCTGCTTAGGCATGGTTTAGTGGGGTTGACAATGGGCTCCTTGTGGAGGCTTGGAGCAAATTGGAGTACCTCAATCTTTCTGTGCACAATCGTTCGGGCACTCGGGGCAGTGGGTTATGTCTTAATATCTTCATTCTTGATAAGAGAGAACATACTAGCTCTGATCACTATAAATATTCACGCTTCGCTCACCCTTATCTTCACTGCCTCGGGTGTAAACTTGATTCCATCGATGAATGCAATATACGCTATTTTTGGGACACTGGTAATGCTGAACTGTGGATGCTTCATGTTTTTGCTTCATCTTTTGTACTCCATATTCCTTACTAGACTTGGCCTGAAGACTTCATTGACATTGCCAAGGTGGCTGGAGAAGGCAATGTAAACATTCGACAGGTTCAATAAATATGCTTGTCTGAATGTGGAAAATGCTTGGATTTGCAAAGAATGTATGTCATTGATTTTGTCCAATTGGATAAAATGGAAGGAAGTAACTGTTAGGTTAGGGAGATTCGAATTTCTGCTACTGCCCATCTGGACCATCTGGAGCAAGCTTGTATTAATAGAGCTTTGTGGACTGGAATATAGGATTTAGGGC

Coding sequence (CDS)

ATGATTTCTGGAAAGCTTTATCCATCCGTCTCCACATCATGTATTTTCCCACCAAAACCGACCTCGACCTCGACCCCAGTTCATGTTCATCTTCCTCTTCTTAAAATCTCTTCCAAACTCAGATTAATTAGCTTCCAATCCGTCTCCCTCTCTTTTCCGACCTTCATTGCTTCTAAATCCAGTGTCAAGTCCACTAGATTTTCGAATTCGGTGGCAAAAGTTTATAGCTTCGAGGGCCAAAACCCCACTTCTTTGTCGGATTTGGAAGACTTGTCTGAGAATGGAGTTGTCTATAAGAAGACGCTGGCGATGGTGGAATGCTCCATGTTCGCTGCACTTAATGGTTTGGTCTACTTCTTGAGCAATTCACTTGCTCTTGAGAATTACTTCGGCTGTTTCTTCTGTCTACCGATAGTAATCTCTTCGATGAGATGGGGCATAGCAGCAGGGAGAAAAACCATGGTGGCAACATTCTTGCTGTTGCTGGTTTTGTCTGGTCCAGTGAAAGCTTTAACATTTCTGCTTAGGCATGGTTTAGTGGGGTTGACAATGGGCTCCTTGTGGAGGCTTGGAGCAAATTGGAGTACCTCAATCTTTCTGTGCACAATCGTTCGGGCACTCGGGGCAGTGGGTTATGTCTTAATATCTTCATTCTTGATAAGAGAGAACATACTAGCTCTGATCACTATAAATATTCACGCTTCGCTCACCCTTATCTTCACTGCCTCGGGTGTAAACTTGATTCCATCGATGAATGCAATATACGCTATTTTTGGGACACTGGTAATGCTGAACTGTGGATGCTTCATGTTTTTGCTTCATCTTTTGTACTCCATATTCCTTACTAGACTTGGCCTGAAGACTTCATTGACATTGCCAAGGTGGCTGGAGAAGGCAATGTAA

Protein sequence

MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIASKSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM
Homology
BLAST of Cp4.1LG13g00330 vs. NCBI nr
Match: XP_023549519.1 (uncharacterized protein LOC111807999 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 562 bits (1448), Expect = 2.67e-201
Identity = 300/300 (100.00%), Postives = 300/300 (100.00%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIASKS 60
           MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIASKS
Sbjct: 1   MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIASKS 60

Query: 61  SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120
           SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL
Sbjct: 61  SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120

Query: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180
           SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV
Sbjct: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180

Query: 181 GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF 240
           GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF
Sbjct: 181 GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF 240

Query: 241 TASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 300
           TASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM
Sbjct: 241 TASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 300

BLAST of Cp4.1LG13g00330 vs. NCBI nr
Match: KAG6579706.1 (hypothetical protein SDJN03_24154, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 550 bits (1417), Expect = 1.42e-196
Identity = 295/300 (98.33%), Postives = 295/300 (98.33%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIASKS 60
           MISG LYPSVSTSCIFPPKPTSTST  HVHLPLLKISSKLRLISFQSVSLSFPTFIASKS
Sbjct: 1   MISGNLYPSVSTSCIFPPKPTSTSTTAHVHLPLLKISSKLRLISFQSVSLSFPTFIASKS 60

Query: 61  SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120
           SVKSTR  NSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL
Sbjct: 61  SVKSTRCLNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120

Query: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180
           SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV
Sbjct: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180

Query: 181 GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF 240
           GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF
Sbjct: 181 GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF 240

Query: 241 TASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 300
           TASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM
Sbjct: 241 TASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 300

BLAST of Cp4.1LG13g00330 vs. NCBI nr
Match: XP_022929150.1 (uncharacterized protein LOC111435817 [Cucurbita moschata])

HSP 1 Score: 549 bits (1415), Expect = 3.08e-196
Identity = 296/302 (98.01%), Postives = 297/302 (98.34%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPL--LKISSKLRLISFQSVSLSFPTFIAS 60
           MISG LYPSVSTSCIFPPKPTSTST  HVHLPL  LKISSKLRLISF+SVSLSFPTFIAS
Sbjct: 1   MISGNLYPSVSTSCIFPPKPTSTSTTAHVHLPLPLLKISSKLRLISFESVSLSFPTFIAS 60

Query: 61  KSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY 120
           KSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY
Sbjct: 61  KSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY 120

Query: 121 FLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHG 180
           FLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHG
Sbjct: 121 FLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHG 180

Query: 181 LVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTL 240
           LVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTL
Sbjct: 181 LVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTL 240

Query: 241 IFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK 300
           IFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK
Sbjct: 241 IFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK 300

BLAST of Cp4.1LG13g00330 vs. NCBI nr
Match: XP_022969819.1 (uncharacterized protein LOC111468904 [Cucurbita maxima] >XP_022969820.1 uncharacterized protein LOC111468904 [Cucurbita maxima])

HSP 1 Score: 543 bits (1399), Expect = 7.28e-194
Identity = 292/300 (97.33%), Postives = 295/300 (98.33%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIASKS 60
           MISG LYPSVSTSCIFPPK   TSTP HVHLPLLKIS+KLRLISF+SVSLSFPTFIASKS
Sbjct: 1   MISGNLYPSVSTSCIFPPK--RTSTPAHVHLPLLKISAKLRLISFESVSLSFPTFIASKS 60

Query: 61  SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120
           SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL
Sbjct: 61  SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120

Query: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180
           SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV
Sbjct: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180

Query: 181 GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF 240
           GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF
Sbjct: 181 GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF 240

Query: 241 TASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 300
           TASGVNLIPSM+AIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM
Sbjct: 241 TASGVNLIPSMSAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 298

BLAST of Cp4.1LG13g00330 vs. NCBI nr
Match: KAG7017148.1 (hypothetical protein SDJN02_22260 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 512 bits (1319), Expect = 1.75e-181
Identity = 284/310 (91.61%), Postives = 287/310 (92.58%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIASKS 60
           MISG LYPSVSTSCIFPPKPTSTST  HVHLPLLKISSKLRLISF+SVSLSFPTFIASKS
Sbjct: 1   MISGNLYPSVSTSCIFPPKPTSTSTTAHVHLPLLKISSKLRLISFESVSLSFPTFIASKS 60

Query: 61  SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120
           SVKSTR  NSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL
Sbjct: 61  SVKSTRCLNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120

Query: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180
           SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV
Sbjct: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180

Query: 181 GLTMGSLW-----RLGANWSTSIFLCT-----IVRALGAVGYVLISSFLIRENILALITI 240
           GLTMGSLW     R+   WS   +L       IVRALGAVGYVLISSFLIRENILALITI
Sbjct: 181 GLTMGSLWSMLVGRVSFAWSKLEYLNLSVHNRIVRALGAVGYVLISSFLIRENILALITI 240

Query: 241 NIHASLTLIFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSL 300
           NIHASLTLIFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSL
Sbjct: 241 NIHASLTLIFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSL 300

BLAST of Cp4.1LG13g00330 vs. ExPASy TrEMBL
Match: A0A6J1ETG3 (uncharacterized protein LOC111435817 OS=Cucurbita moschata OX=3662 GN=LOC111435817 PE=4 SV=1)

HSP 1 Score: 549 bits (1415), Expect = 1.49e-196
Identity = 296/302 (98.01%), Postives = 297/302 (98.34%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPL--LKISSKLRLISFQSVSLSFPTFIAS 60
           MISG LYPSVSTSCIFPPKPTSTST  HVHLPL  LKISSKLRLISF+SVSLSFPTFIAS
Sbjct: 1   MISGNLYPSVSTSCIFPPKPTSTSTTAHVHLPLPLLKISSKLRLISFESVSLSFPTFIAS 60

Query: 61  KSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY 120
           KSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY
Sbjct: 61  KSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY 120

Query: 121 FLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHG 180
           FLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHG
Sbjct: 121 FLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHG 180

Query: 181 LVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTL 240
           LVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTL
Sbjct: 181 LVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTL 240

Query: 241 IFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK 300
           IFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK
Sbjct: 241 IFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK 300

BLAST of Cp4.1LG13g00330 vs. ExPASy TrEMBL
Match: A0A6J1I3S1 (uncharacterized protein LOC111468904 OS=Cucurbita maxima OX=3661 GN=LOC111468904 PE=4 SV=1)

HSP 1 Score: 543 bits (1399), Expect = 3.53e-194
Identity = 292/300 (97.33%), Postives = 295/300 (98.33%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIASKS 60
           MISG LYPSVSTSCIFPPK   TSTP HVHLPLLKIS+KLRLISF+SVSLSFPTFIASKS
Sbjct: 1   MISGNLYPSVSTSCIFPPK--RTSTPAHVHLPLLKISAKLRLISFESVSLSFPTFIASKS 60

Query: 61  SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120
           SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL
Sbjct: 61  SVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYFL 120

Query: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180
           SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV
Sbjct: 121 SNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGLV 180

Query: 181 GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF 240
           GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF
Sbjct: 181 GLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIF 240

Query: 241 TASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 300
           TASGVNLIPSM+AIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM
Sbjct: 241 TASGVNLIPSMSAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 298

BLAST of Cp4.1LG13g00330 vs. ExPASy TrEMBL
Match: A0A1S4DWP5 (uncharacterized protein LOC103488678 isoform X6 OS=Cucumis melo OX=3656 GN=LOC103488678 PE=4 SV=1)

HSP 1 Score: 496 bits (1278), Expect = 1.07e-175
Identity = 266/301 (88.37%), Postives = 280/301 (93.02%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPV-HVHLPLLKISSKLRLISFQSVSLSFPTFIASK 60
           MISGKLY S S+SCIFPP PT T TP  ++HL  LKISS LRLISFQSVSLS P+  ASK
Sbjct: 1   MISGKLYSSSSSSCIFPPTPTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSSFASK 60

Query: 61  SSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYF 120
           SS KSTRFSNS+ +VYS+EGQN  +LSDL+DLSENGVVYKKTLAMVECSMFAALNGLVYF
Sbjct: 61  SSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNGLVYF 120

Query: 121 LSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHGL 180
           LSNSLALENYFGCFFCLPIVISSMRWGI+AGRKTMVATFLLLLVLSGPVKALT+LLRHGL
Sbjct: 121 LSNSLALENYFGCFFCLPIVISSMRWGISAGRKTMVATFLLLLVLSGPVKALTYLLRHGL 180

Query: 181 VGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLI 240
           VG TMGSLWRLGANWSTSIFLCTIVRA GAVGYVL+SSFLIRENIL+LITINIHASLTLI
Sbjct: 181 VGFTMGSLWRLGANWSTSIFLCTIVRAFGAVGYVLVSSFLIRENILSLITINIHASLTLI 240

Query: 241 FTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKA 300
           FTA GVNLIPSMNAIYAIFGTLV LNCGCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEKA
Sbjct: 241 FTAWGVNLIPSMNAIYAIFGTLVFLNCGCFMFLLHLLYSVFLTRLGLKTSLTLPRWLEKA 300

BLAST of Cp4.1LG13g00330 vs. ExPASy TrEMBL
Match: A0A1S4DVX7 (uncharacterized protein LOC103488678 isoform X5 OS=Cucumis melo OX=3656 GN=LOC103488678 PE=4 SV=1)

HSP 1 Score: 489 bits (1258), Expect = 1.65e-172
Identity = 266/310 (85.81%), Postives = 280/310 (90.32%), Query Frame = 0

Query: 1   MISGKLYPSVSTSCIFPPKPTSTSTPV-HVHLPLLKISSKLRLISFQSVSLSFPTFIASK 60
           MISGKLY S S+SCIFPP PT T TP  ++HL  LKISS LRLISFQSVSLS P+  ASK
Sbjct: 1   MISGKLYSSSSSSCIFPPTPTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSSFASK 60

Query: 61  SSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVYF 120
           SS KSTRFSNS+ +VYS+EGQN  +LSDL+DLSENGVVYKKTLAMVECSMFAALNGLVYF
Sbjct: 61  SSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNGLVYF 120

Query: 121 LSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMV---------ATFLLLLVLSGPVKA 180
           LSNSLALENYFGCFFCLPIVISSMRWGI+AGRKTMV         ATFLLLLVLSGPVKA
Sbjct: 121 LSNSLALENYFGCFFCLPIVISSMRWGISAGRKTMVCSFPGLKQVATFLLLLVLSGPVKA 180

Query: 181 LTFLLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITI 240
           LT+LLRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GAVGYVL+SSFLIRENIL+LITI
Sbjct: 181 LTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGAVGYVLVSSFLIRENILSLITI 240

Query: 241 NIHASLTLIFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSL 300
           NIHASLTLIFTA GVNLIPSMNAIYAIFGTLV LNCGCFMFLLHLLYS+FLTRLGLKTSL
Sbjct: 241 NIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVFLNCGCFMFLLHLLYSVFLTRLGLKTSL 300

BLAST of Cp4.1LG13g00330 vs. ExPASy TrEMBL
Match: A0A0A0K8H0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G042440 PE=4 SV=1)

HSP 1 Score: 486 bits (1252), Expect = 9.77e-172
Identity = 265/302 (87.75%), Postives = 278/302 (92.05%), Query Frame = 0

Query: 1   MISGKLYPSVSTS--CIFPPKPTSTSTPVHVHLPLLKISSKLRLISFQSVSLSFPTFIAS 60
           MISGKLY S S+S  CIFPP PTST    ++HL  LKISS LRLISFQS SLSFP+   S
Sbjct: 1   MISGKLYSSYSSSSSCIFPPTPTSTPRG-NLHLSFLKISSTLRLISFQSPSLSFPSSFVS 60

Query: 61  KSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY 120
           KSS KSTRFS+S+ +VYS+EGQN  +LSDL+DLSENGVVYKKTLAMVECSMFAALNGLVY
Sbjct: 61  KSSAKSTRFSSSLVQVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNGLVY 120

Query: 121 FLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTFLLRHG 180
           FLSNSLALENYFGCFFCLPIVISSMRWGI+AGRKTMVATFLLLLVLSGPVKALT+LLRHG
Sbjct: 121 FLSNSLALENYFGCFFCLPIVISSMRWGISAGRKTMVATFLLLLVLSGPVKALTYLLRHG 180

Query: 181 LVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTL 240
           LVG TMGSLWRLGANWSTSIFLCTIVRA GAVGYVL+SSFLIRENILALITINIHASLTL
Sbjct: 181 LVGFTMGSLWRLGANWSTSIFLCTIVRAFGAVGYVLVSSFLIRENILALITINIHASLTL 240

Query: 241 IFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK 300
           IFTA GVNLIPSMNAIYAIFGTLV LNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK
Sbjct: 241 IFTAWGVNLIPSMNAIYAIFGTLVFLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEK 300

BLAST of Cp4.1LG13g00330 vs. TAIR 10
Match: AT1G26180.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2232, membrane (InterPro:IPR018710); Has 285 Blast hits to 285 proteins in 90 species: Archae - 0; Bacteria - 140; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 105 (source: NCBI BLink). )

HSP 1 Score: 294.3 bits (752), Expect = 1.1e-79
Identity = 168/262 (64.12%), Postives = 206/262 (78.63%), Query Frame = 0

Query: 47  SVSLSFPTFIASKSSVKST----RFSN-SVAKVYS--FEGQNPTSLSDLEDLSE-NGVVY 106
           SVSLS    I S  S+ +      FS+ S A +Y+   +G+   S  +     E + VVY
Sbjct: 28  SVSLSPQRHIISLVSISNRGRCFAFSSVSGASLYNNQEDGKKEESERNYASTKEGDEVVY 87

Query: 107 KKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATF 166
           +KTL +VEC+MFAA+ GLVYFLSNSLA+ENYFGCFF LPIVISS+RW IA GRKTMVAT 
Sbjct: 88  QKTLRLVECAMFAAVTGLVYFLSNSLAIENYFGCFFSLPIVISSIRWNIAGGRKTMVATV 147

Query: 167 LLLLVLSGPVKALTFLLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSF 226
           +LL +LSGPVKALT+ L HGLVGL +GSLW +GA+W  SIFLCT+VRALG +GYVL SSF
Sbjct: 148 MLLFILSGPVKALTYFLTHGLVGLALGSLWSMGASWRLSIFLCTMVRALGLIGYVLTSSF 207

Query: 227 LIRENILALITINIHASLTLIFTASGVNLIPSMNAIYAIFGTLVMLNCGCFMFLLHLLYS 286
           LIRENILA+ITINIHASL+ +FTA G+N++PSM+ IY IFGT+++LN G F+ LLHLLYS
Sbjct: 208 LIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIFGTVLLLNSGFFVLLLHLLYS 267

Query: 287 IFLTRLGLKTSLTLPRWLEKAM 301
           IFLTRLG+K+SL LP WL+KA+
Sbjct: 268 IFLTRLGMKSSLRLPAWLDKAI 289

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023549519.12.67e-201100.00uncharacterized protein LOC111807999 [Cucurbita pepo subsp. pepo][more]
KAG6579706.11.42e-19698.33hypothetical protein SDJN03_24154, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022929150.13.08e-19698.01uncharacterized protein LOC111435817 [Cucurbita moschata][more]
XP_022969819.17.28e-19497.33uncharacterized protein LOC111468904 [Cucurbita maxima] >XP_022969820.1 uncharac... [more]
KAG7017148.11.75e-18191.61hypothetical protein SDJN02_22260 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1ETG31.49e-19698.01uncharacterized protein LOC111435817 OS=Cucurbita moschata OX=3662 GN=LOC1114358... [more]
A0A6J1I3S13.53e-19497.33uncharacterized protein LOC111468904 OS=Cucurbita maxima OX=3661 GN=LOC111468904... [more]
A0A1S4DWP51.07e-17588.37uncharacterized protein LOC103488678 isoform X6 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4DVX71.65e-17285.81uncharacterized protein LOC103488678 isoform X5 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0K8H09.77e-17287.75Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G042440 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G26180.11.1e-7964.12unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2232... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018710Protein of unknown function DUF2232PFAMPF09991DUF2232coord: 132..287
e-value: 2.8E-7
score: 30.0
NoneNo IPR availablePANTHERPTHR37185FAMILY NOT NAMEDcoord: 30..300
NoneNo IPR availablePANTHERPTHR37185:SF3MEMBRANE PROTEINcoord: 30..300

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g00330.1Cp4.1LG13g00330.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane