Cp4.1LG18g00230 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g00230
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG18: 410249 .. 414727 (+)
RNA-Seq ExpressionCp4.1LG18g00230
SyntenyCp4.1LG18g00230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAAAGAGTGGAGAAAGAATCAAAGAGGATCACAATTTGAAATGGATATGAAAAAAGGAGAAGAGAGTGAGAAGAAAATGTATTTTACTTCAGAGAATGGTGAATCTTCAAGCTTTCCTATAATCTCCGGCGCCGACAAGCTTGATCATTTCTCCAAAACTGGTACCCTCCCTGAGGTAATTTCTTATTCATTCATTCGTTCATTCTCATCGATAGTTTCAATGTTCATATTTTTGTTCATGTATTCATTGGCGATCGTGTGATTTTCCTTTGATTTCTTGTTTAATTTCTTTAAATTTCTCTTCAAAACCAAAACCCTAAACTTAATAATCACTACTTCAACATTCTTGGGTTTGAATTTCACTGAAAAATCGAAAACCCTTGAAATGATTTATGAGAATGAACAAATTTTGAAAAATTTCAAGTAATTTATTTTTAACCTAATTGAGATGAACTAGGGCTAACTTGACGTTCATCTCAGATCGAATCAAAGAAAACCCTAAACTTAATTATCACAATTTAAAGTTCTATTAATCATTACTTCATCATTTTTGGATTTTTTTATCTTCTCTTTGAATTTCAGTGAAAAATTGAAAACCCTTGAAATGATTTATGAGAATGAACAAATTTTGAAAAATTTTGAGCATTTTGAGTCATTTTTAACCTAATTGAGACGAACTAGGGTTACCTTGGCGTTCATCTTGTTTTCCTCAGTCAGATCGAATCAAAGAAAACCCTAAACTTAATTATCACAATTTAAAGTTCTACTAATCACTACTTCATCATTTTTGGATTTATTTTTATTTTTATTTTTATTTAGAATTTGAGAGAAAAATTGAAAACCCTTGAAATGATTTATGAGAATGAACGAATTTTGAAAATTTTCAAGTAGTTTATTTTTAACCTAATTGAGATGAACTAGGGCTAACTTGGCGTTCATCTTGTTTTCCTCGTCAGATCGAATCAAAGAAAACCCTAAATTTAATTATCACAATCTAAAGTTCTATTAATCTCTACTTCATCATTTTTGGATTTTTTTTATCTTCTCTCTTTGAATTTGAGTGAAAAATTGAAAACCCTTGAAATGATTTATGAGAATCAACAAATTTTGAAAAATTTAGAGTCATTTTGGGTCCTTTGTAACCTAATTGAGACGAACTAGGGCTAACTTGGCGTTCATCTTGTTTTCTTCGTCCGATCAAATCAAAGAAAACCCTAAACTTAATTATCACAATTTAAAGTTCTGCTAATCACTACTTCATCATTCTTGGATTTTTTTTTNTCTCTTTGAATTTCAGTGAAATTTCGAAAACCCTTGAAATGATTTATGAGAATGAGCAAACTTTGAAAGATTTGAGTCATTTTTAACCTAATTGAGATGAACTAGGGCTAACTTGGCATTCATCTTATTTTCTTCGTTCGATCGAATCAAAGAAAACCCTAAACTTAATTATCACAATTTAAAGTTCTGCTAATCACTACTTCATCATTCTTGAATTTTTTTTTTTTTTTTTTTTTTTTTTTAATCTTCTCTCTTTGAATTTCAGTGAAATTTCGAAAACCCTTGAAATGATTTATGAGAATGAGCAAACTTTGAAAGATTTGAGTCATTTTTAACCTAATTGAGATGAACTAGGGCTAACTTGGCATTCATCTTATTTTCTTCGTTCGATCGAATCAAAGAAAACCCTAAACTTAATTATCACAATTTAAAGTTCTGCTAATCACTACTTCATCATTCTTGAATTTTTTTTTTTTTTTTAATCTTCTCTCTTTGAATTTCAGTGAAATTTCGAAAACCCTTGAAATGATTTATGAGAATGAGCAAATTTTGAAAGATTTGAGTCATTTTTAACCTAATTGAGATGAACTAGGGCTAACTTGGCATTCATCTTGTTTTCTTTGTCGGATCGAATCAAAGAAAACCCTAAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAGGGTGAAGAAACAATTGAAGTAGTTCTAAAACTTCCAGCACTTCGGCTATCTTCTCTACTTGCAGCAATGGAATTAGAAAAGGATACAAAAATTTCATTGCCACCACCATTTACAAATCTCCCAAGTATCTTAACGGGCCATTCATTAATCCCTCTCCCACCCACAAGTGAAGAACAACAATACAATAATCCATCACATGAAAGAACAATCACCTCTGGACCAAGTAGTTCTTCTTCTTCTGAGCTTCCCCAGCTCGTGTTTAACGGAGTTCTGGCTCTTTTCCTTCGAGTTGGATCATGGCAGGTATGATACAAGATTTTAATTTGAATATGAATCCATGGGTTTGGTACTCATTTTTCATTTTTGTTCTAGGTGGTGCCCAAGAATGATGGTGATTTGGTGCTGAAATTTGATTATAGAAACAAGAAGGTAACTTGGGAGATTGTGAGGGAGGGGCCTTCCAAGCACAAGATTGAAATTGATTGGTCTAATATCATAGGGATTGAAGCTGCCATTGAAGATCATAGACAAGGAATCCTCCAACTTGAGGTTAGTTTAATTTTCTTCTTTTGATTCATAAAAATTTGTATCCATTGGGAGTTTGATGTTTGGATAACCCATTTCAAAATGGGACTTTTTTGCTTACATATGAATCTATATTGAGATATCAATTTCTTTTAGGGTTTCTTTTTCAATGTTGTTTTGTATGTAAGAACATTGTGTAAATTGATTTATTCATAACACATTTGGGTTTTTAGGAACCCTTTTTAATGAATGATTTGATTTGTAGAACAAATTATGGGTTTGTTAGTATTGAATTCATTCATAAATATTTTGTATTAGTTGCTTTTTAGATTATCAAGTTCTTGTTTGGAACCCTTTTTGAATACTAATTTATATATGAAAGTATGTTGAGTTATCACTTTCTTTTAGGGTTTTTTTTTTTTTTTTTCCATGAACAAATGAGTTGTTCTTCTTTTGCATGTAAGAACTTTGTATCCATTGATTGATTCATAACCCATTTAGTTTTTTGTTATTAAGATTAAGATTTAGATGTTTGATGAGTTGTTCTTCTCAAAGTAGTAATTTCGATGAAAATGTGTTTAGATAACACTTTCTTTAGGGTTTTCTTTTCCTTCTTTTAATGAATGTTTCAATTTGCGAATAGTAATACTAATAGTATTGAACTCATTTATATATCGGTTCCAAGGAAGTCGTACAATTGAGAAATCGTTAATTTGTCCAAGATTAGAATGTTCAAAGTTGAGATTTGTTGTTGTTGAGTGGTAGCAAATGAGAAATCTTGTAGATAGTTTGATTTTTCGGCTCTTATACATGCTGTTACAATGGTAGGGGAAGTTTTCTTGTCGCTATACAGTACTATTTCTTTAGCTCTTGCTCAAAAAGAGATTAAAAGAAGTTTAGCCAGCCTATTCTACAATGTCTCAATTGGTTATATGATGTTAGCTGCTTTGGGTATAGGGTCTTATCGAGCTGCTCTTTATTTCATTGGATTACTCATGAGAAAACTCTAAGCTAAGTTTATTGTCTTGAACTCATTCTTATGAACGATATAGTATTGAATTCATTTATACATTAGTTGTCTTGAATCCTCTTTAAAATAATAATATCTTACTAGAAGTCGCTCAATCGAGGAATCGTTAATTTGTCCAAGATTAGAACAAATTTATACAAATGTGTAGTATTGAGACTTGCTATTGTTGAATGGCAGCAAATGTGAAATCTCATGGATTGTTTGATTTTTAGAACGAATGGTTTGATTCTTCTTCTTTACCGATTTAAGCGTTTGGAACTAATTTGTCTGAGCCATTTTACGTAGCAATTTTGAAACCCTAAATTTAGTATCATATATATATATATATGTGTTTTTAGTTCTTAAATCTTTGTTATAATCCTTTTGAAGCTGCAAAAACCACCAAGATTTTACAAGGAGATTGAGTCCAAACCACATAAGCAGCCGAAATGGGCAGATGAATCGGATTTTACAGATGGGAGAGCTTCTCTAAACAGGTAAATCATCTCAAAAAGGAATGTTGTTATGTACTTAACATTATAGCTTTAAATGTCATATGTCACCATTTAATTCATTATTACTCAATTTCGATTGAAATTCGATTCATTTACAAACATAAAAGAAAATTATTCGAGTTCGATATTCAAACTTCGGATCGAGAGTTATATTCAGGTTGATGATATGCTTCGTGTTTGTATATACTAATATCGTGAATTGGGACGCTATCGGTTTTTGTAGGAGATACTTTGCTGTGTTTTCACCGGGAGTGCTCGGTACACATTATAAGCGACTAATGAAAAACAAGCATTTGTTAGAAGTAAGCCAAAAGTCATTTCCAACGACTCAATCCCCTTATTTTCCCCAGCTTTAAACTCTTACTTTAAGGATGTAAAGTTGATAGTTTATTGAAACTCATCATGTGAACTTGTTTTTTTTTTTTTTTTTTACTTCACTATGGTGTTGAGTAGA

mRNA sequence

TAAAAAGAGTGGAGAAAGAATCAAAGAGGATCACAATTTGAAATGGATATGAAAAAAGGAGAAGAGAGTGAGAAGAAAATGTATTTTACTTCAGAGAATGGTGAATCTTCAAGCTTTCCTATAATCTCCGGCGCCGACAAGCTTGATCATTTCTCCAAAACTGGTACCCTCCCTGAGGGTGAAGAAACAATTGAAGTAGTTCTAAAACTTCCAGCACTTCGGCTATCTTCTCTACTTGCAGCAATGGAATTAGAAAAGGATACAAAAATTTCATTGCCACCACCATTTACAAATCTCCCAAGTATCTTAACGGGCCATTCATTAATCCCTCTCCCACCCACAAGTGAAGAACAACAATACAATAATCCATCACATGAAAGAACAATCACCTCTGGACCAAGTAGTTCTTCTTCTTCTGAGCTTCCCCAGCTCGTGTTTAACGGAGTTCTGGCTCTTTTCCTTCGAGTTGGATCATGGCAGGTGGTGCCCAAGAATGATGGTGATTTGGTGCTGAAATTTGATTATAGAAACAAGAAGGTAACTTGGGAGATTGTGAGGGAGGGGCCTTCCAAGCACAAGATTGAAATTGATTGGTCTAATATCATAGGGATTGAAGCTGCCATTGAAGATCATAGACAAGGAATCCTCCAACTTGAGCTGCAAAAACCACCAAGATTTTACAAGGAGATTGAGTCCAAACCACATAAGCAGCCGAAATGGGCAGATGAATCGGATTTTACAGATGGGAGAGCTTCTCTAAACAGGAGATACTTTGCTGTGTTTTCACCGGGAGTGCTCGGTACACATTATAAGCGACTAATGAAAAACAAGCATTTGTTAGAAGTAAGCCAAAAGTCATTTCCAACGACTCAATCCCCTTATTTTCCCCAGCTTTAAACTCTTACTTTAAGGATGTAAAGTTGATAGTTTATTGAAACTCATCATGTGAACTTGTTTTTTTTTTTTTTTTTTACTTCACTATGGTGTTGAGTAGA

Coding sequence (CDS)

ATGGATATGAAAAAAGGAGAAGAGAGTGAGAAGAAAATGTATTTTACTTCAGAGAATGGTGAATCTTCAAGCTTTCCTATAATCTCCGGCGCCGACAAGCTTGATCATTTCTCCAAAACTGGTACCCTCCCTGAGGGTGAAGAAACAATTGAAGTAGTTCTAAAACTTCCAGCACTTCGGCTATCTTCTCTACTTGCAGCAATGGAATTAGAAAAGGATACAAAAATTTCATTGCCACCACCATTTACAAATCTCCCAAGTATCTTAACGGGCCATTCATTAATCCCTCTCCCACCCACAAGTGAAGAACAACAATACAATAATCCATCACATGAAAGAACAATCACCTCTGGACCAAGTAGTTCTTCTTCTTCTGAGCTTCCCCAGCTCGTGTTTAACGGAGTTCTGGCTCTTTTCCTTCGAGTTGGATCATGGCAGGTGGTGCCCAAGAATGATGGTGATTTGGTGCTGAAATTTGATTATAGAAACAAGAAGGTAACTTGGGAGATTGTGAGGGAGGGGCCTTCCAAGCACAAGATTGAAATTGATTGGTCTAATATCATAGGGATTGAAGCTGCCATTGAAGATCATAGACAAGGAATCCTCCAACTTGAGCTGCAAAAACCACCAAGATTTTACAAGGAGATTGAGTCCAAACCACATAAGCAGCCGAAATGGGCAGATGAATCGGATTTTACAGATGGGAGAGCTTCTCTAAACAGGAGATACTTTGCTGTGTTTTCACCGGGAGTGCTCGGTACACATTATAAGCGACTAATGAAAAACAAGCATTTGTTAGAAGTAAGCCAAAAGTCATTTCCAACGACTCAATCCCCTTATTTTCCCCAGCTTTAA

Protein sequence

MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALRLSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPSSSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLNRRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL
Homology
BLAST of Cp4.1LG18g00230 vs. NCBI nr
Match: XP_022961562.1 (uncharacterized protein LOC111462107 [Cucurbita moschata] >XP_023516882.1 uncharacterized protein LOC111780650 [Cucurbita pepo subsp. pepo] >KAG6590487.1 hypothetical protein SDJN03_15910, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 566 bits (1459), Expect = 1.63e-203
Identity = 284/284 (100.00%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALR 60
           MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALR
Sbjct: 1   MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALR 60

Query: 61  LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS 120
           LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS
Sbjct: 61  LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS 120

Query: 121 SSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI 180
           SSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI
Sbjct: 121 SSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI 180

Query: 181 EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN 240
           EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN
Sbjct: 181 EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN 240

Query: 241 RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284
           RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL
Sbjct: 241 RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284

BLAST of Cp4.1LG18g00230 vs. NCBI nr
Match: XP_022968779.1 (uncharacterized protein LOC111467912 [Cucurbita maxima])

HSP 1 Score: 564 bits (1453), Expect = 1.34e-202
Identity = 282/284 (99.30%), Postives = 283/284 (99.65%), Query Frame = 0

Query: 1   MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALR 60
           MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTG LPEGEETIEVVLKLPALR
Sbjct: 1   MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGALPEGEETIEVVLKLPALR 60

Query: 61  LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS 120
           LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS
Sbjct: 61  LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS 120

Query: 121 SSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI 180
           SSSSSELPQLVFNG+LALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI
Sbjct: 121 SSSSSELPQLVFNGILALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI 180

Query: 181 EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN 240
           EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN
Sbjct: 181 EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN 240

Query: 241 RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284
           RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL
Sbjct: 241 RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284

BLAST of Cp4.1LG18g00230 vs. NCBI nr
Match: KAG7024019.1 (hypothetical protein SDJN02_15048, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 545 bits (1405), Expect = 1.78e-195
Identity = 272/272 (100.00%), Postives = 272/272 (100.00%), Query Frame = 0

Query: 13  MYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALRLSSLLAAMELEK 72
           MYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALRLSSLLAAMELEK
Sbjct: 1   MYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALRLSSLLAAMELEK 60

Query: 73  DTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPSSSSSSELPQLVF 132
           DTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPSSSSSSELPQLVF
Sbjct: 61  DTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPSSSSSSELPQLVF 120

Query: 133 NGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKIEIDWSNIIGIEA 192
           NGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKIEIDWSNIIGIEA
Sbjct: 121 NGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKIEIDWSNIIGIEA 180

Query: 193 AIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLNRRYFAVFSPGVL 252
           AIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLNRRYFAVFSPGVL
Sbjct: 181 AIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLNRRYFAVFSPGVL 240

Query: 253 GTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284
           GTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL
Sbjct: 241 GTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 272

BLAST of Cp4.1LG18g00230 vs. NCBI nr
Match: XP_022158816.1 (uncharacterized protein LOC111025282 [Momordica charantia])

HSP 1 Score: 450 bits (1157), Expect = 1.83e-157
Identity = 230/286 (80.42%), Postives = 253/286 (88.46%), Query Frame = 0

Query: 1   MDMKK--GEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPA 60
           MD KK   E  EKK+Y TSENGESSSF IISGA++LDHFS+TGTLP+G ETIEVVLKLPA
Sbjct: 1   MDKKKEESESDEKKLYITSENGESSSFSIISGAEQLDHFSQTGTLPQGRETIEVVLKLPA 60

Query: 61  LRLSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSG 120
           LRLSSLLAAMELE+DTKIS+P PF+NLP ILTG SLIPLPP + EQ  N+ S+ERTITSG
Sbjct: 61  LRLSSLLAAMELEEDTKISIPAPFSNLPKILTGRSLIPLPPNNGEQDQNSSSNERTITSG 120

Query: 121 PSSSSS-SELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSK 180
           PSSSSS SELP  + N   ALFLR+GSWQVVP+N+GDLVL+FDYR KK++WEIVREGPSK
Sbjct: 121 PSSSSSPSELPH-ISNAAPALFLRIGSWQVVPQNEGDLVLRFDYRTKKISWEIVREGPSK 180

Query: 181 HKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRA 240
           HKIEIDWSNIIGIEAA EDHRQGILQLELQKPPRFYKEIE+K  KQ KW DESDFTDGRA
Sbjct: 181 HKIEIDWSNIIGIEAATEDHRQGILQLELQKPPRFYKEIEAKLQKQSKWIDESDFTDGRA 240

Query: 241 SLNRRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQ 283
           SLNRRYF+VFSPGVLG HYKR+MKNKH+LEVSQKSFP T SPYFP+
Sbjct: 241 SLNRRYFSVFSPGVLGAHYKRMMKNKHVLEVSQKSFPATYSPYFPK 285

BLAST of Cp4.1LG18g00230 vs. NCBI nr
Match: XP_016900954.1 (PREDICTED: uncharacterized protein LOC107991120 [Cucumis melo])

HSP 1 Score: 311 bits (796), Expect = 1.61e-103
Identity = 151/220 (68.64%), Postives = 179/220 (81.36%), Query Frame = 0

Query: 68  MELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQY----NNPSHERTITSGPSSSS 127
           M+L KDT+I++ PPFTNL  I  G +L+PLP  +EE+Q     N  S++RTI SGPSSSS
Sbjct: 1   MDLRKDTEIAMKPPFTNLQKIFNGRTLVPLPQINEEEQQHEYTNTQSNQRTIFSGPSSSS 60

Query: 128 SSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKIEID 187
            +E P   FN   ALFLR+GSWQVV  N+ DLVLKFDYRNKK++WE+VREGPSKHKIEID
Sbjct: 61  FAEPPNTPFNAAPALFLRIGSWQVVANNESDLVLKFDYRNKKISWEVVREGPSKHKIEID 120

Query: 188 WSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLNRRY 247
           WSNIIGI+AAIEDHRQGILQLELQ PPRFYKEIE++P K  KW +E DFT GR S++R++
Sbjct: 121 WSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRVSMHRKH 180

Query: 248 FAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQ 283
           F+VF+PG+LGT+YKRLMKNK LLEVSQK FPT  SPYF Q
Sbjct: 181 FSVFAPGILGTYYKRLMKNKDLLEVSQKPFPTADSPYFHQ 220

BLAST of Cp4.1LG18g00230 vs. ExPASy TrEMBL
Match: A0A6J1HAI6 (uncharacterized protein LOC111462107 OS=Cucurbita moschata OX=3662 GN=LOC111462107 PE=4 SV=1)

HSP 1 Score: 566 bits (1459), Expect = 7.88e-204
Identity = 284/284 (100.00%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALR 60
           MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALR
Sbjct: 1   MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALR 60

Query: 61  LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS 120
           LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS
Sbjct: 61  LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS 120

Query: 121 SSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI 180
           SSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI
Sbjct: 121 SSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI 180

Query: 181 EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN 240
           EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN
Sbjct: 181 EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN 240

Query: 241 RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284
           RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL
Sbjct: 241 RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284

BLAST of Cp4.1LG18g00230 vs. ExPASy TrEMBL
Match: A0A6J1I0N3 (uncharacterized protein LOC111467912 OS=Cucurbita maxima OX=3661 GN=LOC111467912 PE=4 SV=1)

HSP 1 Score: 564 bits (1453), Expect = 6.48e-203
Identity = 282/284 (99.30%), Postives = 283/284 (99.65%), Query Frame = 0

Query: 1   MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPALR 60
           MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTG LPEGEETIEVVLKLPALR
Sbjct: 1   MDMKKGEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGALPEGEETIEVVLKLPALR 60

Query: 61  LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS 120
           LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS
Sbjct: 61  LSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSGPS 120

Query: 121 SSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI 180
           SSSSSELPQLVFNG+LALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI
Sbjct: 121 SSSSSELPQLVFNGILALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKI 180

Query: 181 EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN 240
           EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN
Sbjct: 181 EIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLN 240

Query: 241 RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284
           RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL
Sbjct: 241 RRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQL 284

BLAST of Cp4.1LG18g00230 vs. ExPASy TrEMBL
Match: A0A6J1E232 (uncharacterized protein LOC111025282 OS=Momordica charantia OX=3673 GN=LOC111025282 PE=4 SV=1)

HSP 1 Score: 450 bits (1157), Expect = 8.85e-158
Identity = 230/286 (80.42%), Postives = 253/286 (88.46%), Query Frame = 0

Query: 1   MDMKK--GEESEKKMYFTSENGESSSFPIISGADKLDHFSKTGTLPEGEETIEVVLKLPA 60
           MD KK   E  EKK+Y TSENGESSSF IISGA++LDHFS+TGTLP+G ETIEVVLKLPA
Sbjct: 1   MDKKKEESESDEKKLYITSENGESSSFSIISGAEQLDHFSQTGTLPQGRETIEVVLKLPA 60

Query: 61  LRLSSLLAAMELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQYNNPSHERTITSG 120
           LRLSSLLAAMELE+DTKIS+P PF+NLP ILTG SLIPLPP + EQ  N+ S+ERTITSG
Sbjct: 61  LRLSSLLAAMELEEDTKISIPAPFSNLPKILTGRSLIPLPPNNGEQDQNSSSNERTITSG 120

Query: 121 PSSSSS-SELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSK 180
           PSSSSS SELP  + N   ALFLR+GSWQVVP+N+GDLVL+FDYR KK++WEIVREGPSK
Sbjct: 121 PSSSSSPSELPH-ISNAAPALFLRIGSWQVVPQNEGDLVLRFDYRTKKISWEIVREGPSK 180

Query: 181 HKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRA 240
           HKIEIDWSNIIGIEAA EDHRQGILQLELQKPPRFYKEIE+K  KQ KW DESDFTDGRA
Sbjct: 181 HKIEIDWSNIIGIEAATEDHRQGILQLELQKPPRFYKEIEAKLQKQSKWIDESDFTDGRA 240

Query: 241 SLNRRYFAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQ 283
           SLNRRYF+VFSPGVLG HYKR+MKNKH+LEVSQKSFP T SPYFP+
Sbjct: 241 SLNRRYFSVFSPGVLGAHYKRMMKNKHVLEVSQKSFPATYSPYFPK 285

BLAST of Cp4.1LG18g00230 vs. ExPASy TrEMBL
Match: A0A1S4DY92 (uncharacterized protein LOC107991120 OS=Cucumis melo OX=3656 GN=LOC107991120 PE=4 SV=1)

HSP 1 Score: 311 bits (796), Expect = 7.79e-104
Identity = 151/220 (68.64%), Postives = 179/220 (81.36%), Query Frame = 0

Query: 68  MELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQY----NNPSHERTITSGPSSSS 127
           M+L KDT+I++ PPFTNL  I  G +L+PLP  +EE+Q     N  S++RTI SGPSSSS
Sbjct: 1   MDLRKDTEIAMKPPFTNLQKIFNGRTLVPLPQINEEEQQHEYTNTQSNQRTIFSGPSSSS 60

Query: 128 SSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKIEID 187
            +E P   FN   ALFLR+GSWQVV  N+ DLVLKFDYRNKK++WE+VREGPSKHKIEID
Sbjct: 61  FAEPPNTPFNAAPALFLRIGSWQVVANNESDLVLKFDYRNKKISWEVVREGPSKHKIEID 120

Query: 188 WSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLNRRY 247
           WSNIIGI+AAIEDHRQGILQLELQ PPRFYKEIE++P K  KW +E DFT GR S++R++
Sbjct: 121 WSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRVSMHRKH 180

Query: 248 FAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQ 283
           F+VF+PG+LGT+YKRLMKNK LLEVSQK FPT  SPYF Q
Sbjct: 181 FSVFAPGILGTYYKRLMKNKDLLEVSQKPFPTADSPYFHQ 220

BLAST of Cp4.1LG18g00230 vs. ExPASy TrEMBL
Match: A0A0A0M1F9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G561960 PE=4 SV=1)

HSP 1 Score: 304 bits (778), Expect = 4.23e-101
Identity = 147/220 (66.82%), Postives = 178/220 (80.91%), Query Frame = 0

Query: 68  MELEKDTKISLPPPFTNLPSILTGHSLIPLPPTSEEQQ----YNNPSHERTITSGPSSSS 127
           M+L KDT+I + PPFTNL  I  G +L+PLP  +EE+Q    +N  S++RT  SGPSSSS
Sbjct: 1   MDLRKDTEIPMKPPFTNLQRIFNGRTLVPLPQINEEEQQHEYHNTQSNQRTTFSGPSSSS 60

Query: 128 SSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKIEID 187
            +E P   FN   ALFLR+GSWQVV  ++ DLVLKFDYRNKK++WE+V EGPSKHKIEI+
Sbjct: 61  FAEPPNTPFNAAPALFLRIGSWQVVANSESDLVLKFDYRNKKLSWEVVLEGPSKHKIEIE 120

Query: 188 WSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLNRRY 247
           WSNIIGI+AAIEDHRQGILQLELQ PPRFYKEIE++P K  KW +E DFT GRAS+NR++
Sbjct: 121 WSNIIGIQAAIEDHRQGILQLELQNPPRFYKEIETRPLKLFKWEEEYDFTQGRASMNRKH 180

Query: 248 FAVFSPGVLGTHYKRLMKNKHLLEVSQKSFPTTQSPYFPQ 283
           F+VF+PG+LGT+YKRLMKNK ++EVSQK FPT  SPYF Q
Sbjct: 181 FSVFAPGILGTYYKRLMKNKEMVEVSQKPFPTANSPYFHQ 220

BLAST of Cp4.1LG18g00230 vs. TAIR 10
Match: AT4G30780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24100.1); Has 109 Blast hits to 109 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 0; Plants - 95; Viruses - 0; Other Eukaryotes - 13 (source: NCBI BLink). )

HSP 1 Score: 110.9 bits (276), Expect = 1.7e-24
Identity = 57/165 (34.55%), Postives = 96/165 (58.18%), Query Frame = 0

Query: 118 GPSSSSSSELPQLVFNGVLALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSK 177
           GP+ +  S + +L  +   A  L++G W+   + +GDLV K  +   K+ WE++ +G  K
Sbjct: 125 GPTLAPGS-IEKLKASNFPASLLKIGQWEYKSRYEGDLVAKCYFAKHKLVWEVLEQG-LK 184

Query: 178 HKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRA 237
            KIEI WS+I+ ++A   +   G L L L + P F++E   +P K   W   SDFTDG+A
Sbjct: 185 SKIEIQWSDIMALKANCPEDGPGTLTLVLARQPLFFRETNPQPRKHTLWQATSDFTDGQA 244

Query: 238 SLNRRYFAVFSPGVLGTHYKRLMKNKH-LLEVSQKSFPTTQSPYF 282
           S+NR++F   + G++  H+++L++  H L  +S++      SPYF
Sbjct: 245 SMNRQHFLQCAQGIMNKHFEKLVQCDHRLFHLSRQPEIAIDSPYF 287

BLAST of Cp4.1LG18g00230 vs. TAIR 10
Match: AT2G24100.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G30780.1); Has 101 Blast hits to 101 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 95; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 110.5 bits (275), Expect = 2.2e-24
Identity = 52/146 (35.62%), Postives = 88/146 (60.27%), Query Frame = 0

Query: 137 ALFLRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPSKHKIEIDWSNIIGIEAAIED 196
           A  LR+G W+   + +GDLV K  +   K+ WE++ +G  K KIEI WS+I+ ++A + +
Sbjct: 116 ATILRIGQWEYKSRYEGDLVAKCYFAKHKLVWEVLEQG-LKSKIEIQWSDIMALKANLPE 175

Query: 197 HRQGILQLELQKPPRFYKEIESKPHKQPKWADESDFTDGRASLNRRYFAVFSPGVLGTHY 256
              G L + L + P F++E   +P K   W   SDFTDG+AS+NR++F    PG++  H+
Sbjct: 176 DEPGTLTIVLARRPLFFRETNPQPRKHTLWQATSDFTDGQASMNRQHFLQCPPGIMNKHF 235

Query: 257 KRLMKNKH-LLEVSQKSFPTTQSPYF 282
           ++L++  H L  +S++      +P+F
Sbjct: 236 EKLVQCDHRLFCLSRQPEINLAAPFF 260

BLAST of Cp4.1LG18g00230 vs. TAIR 10
Match: AT1G54300.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G05770.1); Has 107 Blast hits to 107 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 13 (source: NCBI BLink). )

HSP 1 Score: 95.5 bits (236), Expect = 7.3e-20
Identity = 53/150 (35.33%), Postives = 81/150 (54.00%), Query Frame = 0

Query: 140 LRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGPS------KHKIEIDWSNIIGIEAA 199
           +R+G W VV KN  D+V KF +  KK+ WE +   P       K KIEI W+++   E +
Sbjct: 8   IRIGGWVVVAKNPDDIVAKFYFAKKKLIWEFLFGEPETNTLRLKRKIEIQWNDVSSFEES 67

Query: 200 IEDHRQ-GILQLELQKPPRFYKEIESKPHKQPKWAD-ESDFTDGRASLNRRYFAVFSPGV 259
           I    + GIL++EL+K P F+ E   +  K  +W   + DFT   AS  RR+   F PGV
Sbjct: 68  ISSRDETGILKIELKKRPTFFIETNPQAGKHTQWKQLDHDFTGDHASNYRRHTLHFPPGV 127

Query: 260 LGTHYKRLMKNKHLLEVSQKSFPTTQSPYF 282
           L  + ++L+ +    ++ +  FP  +S YF
Sbjct: 128 LQKNLEKLVTDSFWSKLYEVPFPVHESRYF 157

BLAST of Cp4.1LG18g00230 vs. TAIR 10
Match: AT3G05770.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G54300.1); Has 105 Blast hits to 105 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 99; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 91.3 bits (225), Expect = 1.4e-18
Identity = 62/210 (29.52%), Postives = 98/210 (46.67%), Query Frame = 0

Query: 81  PFTNLPSILTG-HSLIPLPPTSEEQQYNNPSHERTITSGPSSSSSSELPQLVFNGVLALF 140
           P T  P ++    S + +  T   QQ  N S   T+   P    +   P           
Sbjct: 29  PLTKTPELINKIESYLKVHYTCPHQQTENSSKTSTLPKSPEKLKAMNFP--------IST 88

Query: 141 LRVGSWQVVPKNDGDLVLKFDYRNKKVTWEIVREGP------SKHKIEIDWSNIIGIEAA 200
           +++G    V KN  D+V KF +  KK+ WE +   P       K KIEI W+++   E +
Sbjct: 89  IKIGDCVFVAKNPDDIVAKFYFAKKKLLWEFLFGEPVANMPRLKSKIEIQWNDVSSFEES 148

Query: 201 IEDHRQ-GILQLELQKPPRFYKEIESKPHKQPKWAD-ESDFTDGRASLNRRYFAVFSPGV 260
           I    + GIL++EL+K P F+ E   +  K  +W   + DFT  +AS  RR+   F PGV
Sbjct: 149 INSRDETGILKIELKKRPTFFTETNPQAGKHTQWKQLDYDFTGDQASYYRRHTLHFPPGV 208

Query: 261 LGTHYKRLMKNKHLLEVSQKSFPTTQSPYF 282
           L  + ++L+ +    ++ +  FP  +S YF
Sbjct: 209 LQKNLEKLLTDSFWSKLYKVPFPVHESLYF 230

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022961562.11.63e-203100.00uncharacterized protein LOC111462107 [Cucurbita moschata] >XP_023516882.1 unchar... [more]
XP_022968779.11.34e-20299.30uncharacterized protein LOC111467912 [Cucurbita maxima][more]
KAG7024019.11.78e-195100.00hypothetical protein SDJN02_15048, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022158816.11.83e-15780.42uncharacterized protein LOC111025282 [Momordica charantia][more]
XP_016900954.11.61e-10368.64PREDICTED: uncharacterized protein LOC107991120 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A6J1HAI67.88e-204100.00uncharacterized protein LOC111462107 OS=Cucurbita moschata OX=3662 GN=LOC1114621... [more]
A0A6J1I0N36.48e-20399.30uncharacterized protein LOC111467912 OS=Cucurbita maxima OX=3661 GN=LOC111467912... [more]
A0A6J1E2328.85e-15880.42uncharacterized protein LOC111025282 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A1S4DY927.79e-10468.64uncharacterized protein LOC107991120 OS=Cucumis melo OX=3656 GN=LOC107991120 PE=... [more]
A0A0A0M1F94.23e-10166.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G561960 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G30780.11.7e-2434.55unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24100.12.2e-2435.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G54300.17.3e-2035.33unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G05770.11.4e-1829.52unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 97..125
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..26
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..15
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 93..125
NoneNo IPR availablePANTHERPTHR33494OS02G0793800 PROTEINcoord: 108..283
NoneNo IPR availablePANTHERPTHR33494:SF5F10A16.6 PROTEINcoord: 108..283

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g00230.1Cp4.1LG18g00230.1mRNA