CmoCh03G002850 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh03G002850
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionActin cytoskeleton-regulatory complex protein pan1
LocationCmo_Chr03: 4086487 .. 4089577 (+)
RNA-Seq ExpressionCmoCh03G002850
SyntenyCmoCh03G002850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTATAGGAGGAGACAAGCAATCACGAGGGCTTCCACTTTCAAAGAAGAGATCCATCATCCTGCCGACGATGTTGATCATAATTCCATCTCTTCTTCCTCTTCTTCTCTTGCTGCTCAGGCGATCCGAGCTTCTGCTGCTCATCGAGATTCTTCGAATTCCTCTGCCTTTTCCGGTTCCTCCTCCGTTTTCTCCCCTGGACATATGCGATCCAAGGTTCTCTCTCTCTCTAACCTCTACAATGTTGAGTTGATCGCTTTGAGTGTTTGTTTGATTGACTTGGATAATGATAAGAAATAAGAAATTCATCGAAATCTCAGATTTGATTGTGTCTATTGTTTCGTCTCCATCGTTGAATTTAGATATTTCTTATCAATACGATTTGTTTAGCATCGAATTCTTCCGAGGAAGTGATTCATTAATTGCCGTGACCTCCTCTGTTTTTGGCAAATGTTGCGATTTCTGATCTAATCGGAATTTAGGACGGCCTTTTCTTTTTTATTTGTTTAATTGCCTAACCTATCGGAAGAATGATGGCGTCTGTTGATGATTCATTGCCGTGTTCGAATTCTTCAGCAAGCTCTATTGTTTTTGTTGGTCATTCAAATATGCCTTTATAGCGGCCATGTAATCTGTATCTGATGCTATGCCTCTATTTCCAGAGTTTTGCTGGCGAGAATTTGACATTCAAAAGTGAGTCGAAATCCGGGTTTTGGGGCGTACTGGCTCGGAAAGCTAAGGCAATTCTCGAAGAAGATGACATAGCCATAGAAGAGGAAACTAGTGGTTTCCAGCCCATTAATACCTCAACCCTTAGCCAGGTGAGTATCTCTTAGTGTGATTTAGCCATTATCGACAATGCCTTGTCCACTGAGCTATCTGGCGTCAATACATTACATAAATATTGACACAATTTGTTGTGATAATGATTTATCTTAGGTATATGCAATCTTTCTAATTTGAAACTCCCTTGTTTATGTATAAGTTACATTATCAAAAGGTAAGGAATTAGAGTAACTCCTGGAAATGTGATGTTGAATTTTCAACAACACAACAGCAAAATCTCATATCAGCACAGAGATTTTAACTTATATGCAATTCGTTTATCCCTACTGCTCTGTTTTCTGATTTTCATATGAAATGGTATCAGGAGCCATGTCAGTCTACTGATCACGACTCGAAGAAATCGGACAATCCAGCAAAACCGAAGGGTTTGGATGCAATCACAACTTCGTTTAATCAACTTGGGGACACAAATCAAAAGGCATCTAAGGTAAGAACAGAACTTGGTTTTATTGTATTTTCATATCTTGTTTTTTTCTTGTCTCCAAGTTAGCTTACTAATTAGCTGTCCATTTCTCAGATTAATCGACCTTTTTGTACTTGAAAATATAGATTAATAGTATCTCTGCCTGATTTCAATTTGCAAGATGGTAAAATACATTGGAATATCTAGAGAACATCTCTCTCGATCATATATTTTCATGAAAATTATCACCTCACACTTAGGAATTTTAAAAATTCAAGGCCTCCTACTCTAATATACTATGATAAATAAATGTTACAAAATCCAAAAACTTGACTGCATAATCGAGTCTGATTTAAAAGTTGGAAACTCTTTATGAAAGGAGTAGTCCCACGTTGGCTAATTAAGAGGGAAGGTCATAGGTTTATAAGTAAGGAACACTATTTTCATTGGTACGAGGCATTTTGAGAAAACTAATAGAGCTTATGCTCAAAGTGGACAATATCATACCATTATGTAGGTTCGTGATTCCCAACCAACTTCATTCAAGAATATCTTTGGTATGGGTTCTTAGTGAACGGATATTCTAAATTAGATTTGGTCATACCATTTTCAACACCTCGTCTTGGTTTGCAGGAAGGTCGCCCAATTGTGGGAGCTAAGACTACAGACATCATTCATGAAACTCGCAAACTGCAGATTAGGAGAAAGGGCAACAAATCTGAAGGATTATTTCCTACTGTGAATAATCCATGGCAGCAACCAAATCTACTGTCTCCAGAACCACGGATGCAAGCACATCATGAAACTCAACTGAAGGCATCTCGAGACGTAAGGTGCCAACTTTGTAATTATGTATCGTAATGCAACTCAATGAATTTTTCTCTTTCTCAAATTAAAGTCAAAATGATTGGAGTTGGATTCATCGTCTTAGCTAAGTTTAATCTGACAAAGTAAGTTGAAACTGTTTCCTAAGCTTGAGATTTCCAATGTGAATTTCATCAATTGAATTTAGATGGTTAATTTTACTCGTACTTGCTCGAACTTCAGTAAAAAATACCTTTGGTGATGAGTGCATCAATGGCATAACAATGTCAAAAGTACCTGTGGAATGTAGGTGGCGATGGCTACAGCTGCCAAAGCCAAACTATTACTCCGGGAGCTGAAAACCATTAAAGCAGAACTGGCTTTTGCTAAAGAAAGATGTGCACAATTAGAGGAAGAAAACAAAATTCTCAGGGAAAATCGAGAGAAGGGAGACAATCGTGCAGATGATGACTTGGTATGTATTTACTTTCCTCTCTCAGTTCTTGCCAGATCTTTAAGACTAGAACATCAATTCAAAATCATGGTTTGTATGTTCTAATGTTCATTTGGATCATGAAAAGATTTTTGAACTGTTTCTAAATCCAGATTCGGCTTCAACTTGAGACTCTTTTGGCTGAGAAGGCTCGTTTGGCGCATGAGAATTCGATATATGCACGCGAGAACCGATTCCTAAGGGAAATTGTGGAGTACCACCAACTGACAATGCAGGATGTGGTTTACCTAGACGAAGGAATGGAAGAAGTGACAGAATATTATCCAAGATCCACCTCTCCAGAGATCACCAGAATGCTTTTTAATTATGGTTCCCCACATTCCTCAACTTCTCCAACCTCACCTTCAGCAGCAGCCATGGAAGATCTTCCTGTTCCTCGTGTTCATCCAAAAGAAGGGAAGGACGACGACGACGATGACGATGATAAAGAAGACAGTGAGGGAGACCCATCTCCCACTACAGTTTCTGAGGAAGAAGAAGGTACAGAAAACTCCCCATTGCCTTCAAGTACTTCTTCCTAA

mRNA sequence

ATGGCGTATAGGAGGAGACAAGCAATCACGAGGGCTTCCACTTTCAAAGAAGAGATCCATCATCCTGCCGACGATGTTGATCATAATTCCATCTCTTCTTCCTCTTCTTCTCTTGCTGCTCAGGCGATCCGAGCTTCTGCTGCTCATCGAGATTCTTCGAATTCCTCTGCCTTTTCCGGTTCCTCCTCCGTTTTCTCCCCTGGACATATGCGATCCAAGAGTTTTGCTGGCGAGAATTTGACATTCAAAAGTGAGTCGAAATCCGGGTTTTGGGGCGTACTGGCTCGGAAAGCTAAGGCAATTCTCGAAGAAGATGACATAGCCATAGAAGAGGAAACTAGTGGTTTCCAGCCCATTAATACCTCAACCCTTAGCCAGGAGCCATGTCAGTCTACTGATCACGACTCGAAGAAATCGGACAATCCAGCAAAACCGAAGGGTTTGGATGCAATCACAACTTCGTTTAATCAACTTGGGGACACAAATCAAAAGGCATCTAAGGAAGGTCGCCCAATTGTGGGAGCTAAGACTACAGACATCATTCATGAAACTCGCAAACTGCAGATTAGGAGAAAGGGCAACAAATCTGAAGGATTATTTCCTACTGTGAATAATCCATGGCAGCAACCAAATCTACTGTCTCCAGAACCACGGATGCAAGCACATCATGAAACTCAACTGAAGGCATCTCGAGACGTGGCGATGGCTACAGCTGCCAAAGCCAAACTATTACTCCGGGAGCTGAAAACCATTAAAGCAGAACTGGCTTTTGCTAAAGAAAGATGTGCACAATTAGAGGAAGAAAACAAAATTCTCAGGGAAAATCGAGAGAAGGGAGACAATCGTGCAGATGATGACTTGATTCGGCTTCAACTTGAGACTCTTTTGGCTGAGAAGGCTCGTTTGGCGCATGAGAATTCGATATATGCACGCGAGAACCGATTCCTAAGGGAAATTGTGGAGTACCACCAACTGACAATGCAGGATGTGGTTTACCTAGACGAAGGAATGGAAGAAGTGACAGAATATTATCCAAGATCCACCTCTCCAGAGATCACCAGAATGCTTTTTAATTATGGTTCCCCACATTCCTCAACTTCTCCAACCTCACCTTCAGCAGCAGCCATGGAAGATCTTCCTGTTCCTCGTGTTCATCCAAAAGAAGGGAAGGACGACGACGACGATGACGATGATAAAGAAGACAGTGAGGGAGACCCATCTCCCACTACAGTTTCTGAGGAAGAAGAAGGTACAGAAAACTCCCCATTGCCTTCAAGTACTTCTTCCTAA

Coding sequence (CDS)

ATGGCGTATAGGAGGAGACAAGCAATCACGAGGGCTTCCACTTTCAAAGAAGAGATCCATCATCCTGCCGACGATGTTGATCATAATTCCATCTCTTCTTCCTCTTCTTCTCTTGCTGCTCAGGCGATCCGAGCTTCTGCTGCTCATCGAGATTCTTCGAATTCCTCTGCCTTTTCCGGTTCCTCCTCCGTTTTCTCCCCTGGACATATGCGATCCAAGAGTTTTGCTGGCGAGAATTTGACATTCAAAAGTGAGTCGAAATCCGGGTTTTGGGGCGTACTGGCTCGGAAAGCTAAGGCAATTCTCGAAGAAGATGACATAGCCATAGAAGAGGAAACTAGTGGTTTCCAGCCCATTAATACCTCAACCCTTAGCCAGGAGCCATGTCAGTCTACTGATCACGACTCGAAGAAATCGGACAATCCAGCAAAACCGAAGGGTTTGGATGCAATCACAACTTCGTTTAATCAACTTGGGGACACAAATCAAAAGGCATCTAAGGAAGGTCGCCCAATTGTGGGAGCTAAGACTACAGACATCATTCATGAAACTCGCAAACTGCAGATTAGGAGAAAGGGCAACAAATCTGAAGGATTATTTCCTACTGTGAATAATCCATGGCAGCAACCAAATCTACTGTCTCCAGAACCACGGATGCAAGCACATCATGAAACTCAACTGAAGGCATCTCGAGACGTGGCGATGGCTACAGCTGCCAAAGCCAAACTATTACTCCGGGAGCTGAAAACCATTAAAGCAGAACTGGCTTTTGCTAAAGAAAGATGTGCACAATTAGAGGAAGAAAACAAAATTCTCAGGGAAAATCGAGAGAAGGGAGACAATCGTGCAGATGATGACTTGATTCGGCTTCAACTTGAGACTCTTTTGGCTGAGAAGGCTCGTTTGGCGCATGAGAATTCGATATATGCACGCGAGAACCGATTCCTAAGGGAAATTGTGGAGTACCACCAACTGACAATGCAGGATGTGGTTTACCTAGACGAAGGAATGGAAGAAGTGACAGAATATTATCCAAGATCCACCTCTCCAGAGATCACCAGAATGCTTTTTAATTATGGTTCCCCACATTCCTCAACTTCTCCAACCTCACCTTCAGCAGCAGCCATGGAAGATCTTCCTGTTCCTCGTGTTCATCCAAAAGAAGGGAAGGACGACGACGACGATGACGATGATAAAGAAGACAGTGAGGGAGACCCATCTCCCACTACAGTTTCTGAGGAAGAAGAAGGTACAGAAAACTCCCCATTGCCTTCAAGTACTTCTTCCTAA

Protein sequence

MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSLAAQAIRASAAHRDSSNSSAFSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQPINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKTTDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMATAAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLFNYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEGTENSPLPSSTSS
Homology
BLAST of CmoCh03G002850 vs. ExPASy TrEMBL
Match: A0A6J1GFX8 (uncharacterized protein LOC111453596 OS=Cucurbita moschata OX=3662 GN=LOC111453596 PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 2.8e-228
Identity = 429/429 (100.00%), Postives = 429/429 (100.00%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSLAAQAIRASAAHRDSSNSSAFSG 60
           MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSLAAQAIRASAAHRDSSNSSAFSG
Sbjct: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSLAAQAIRASAAHRDSSNSSAFSG 60

Query: 61  SSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQPIN 120
           SSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQPIN
Sbjct: 61  SSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQPIN 120

Query: 121 TSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKTTDI 180
           TSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKTTDI
Sbjct: 121 TSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKTTDI 180

Query: 181 IHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMATAAK 240
           IHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMATAAK
Sbjct: 181 IHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMATAAK 240

Query: 241 AKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKA 300
           AKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKA
Sbjct: 241 AKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKA 300

Query: 301 RLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLFNYG 360
           RLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLFNYG
Sbjct: 301 RLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLFNYG 360

Query: 361 SPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEGTEN 420
           SPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEGTEN
Sbjct: 361 SPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEGTEN 420

Query: 421 SPLPSSTSS 430
           SPLPSSTSS
Sbjct: 421 SPLPSSTSS 429

BLAST of CmoCh03G002850 vs. ExPASy TrEMBL
Match: A0A6J1IM16 (uncharacterized protein LOC111478244 OS=Cucurbita maxima OX=3661 GN=LOC111478244 PE=4 SV=1)

HSP 1 Score: 775.4 bits (2001), Expect = 1.2e-220
Identity = 421/432 (97.45%), Postives = 422/432 (97.69%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSI---SSSSSSLAAQAIRASAAHRDSSNSSA 60
           MAYRRRQAITRASTFKEEIH PADDVDHNSI   SSSSSSLA QAIRASAAHRDSSNSSA
Sbjct: 1   MAYRRRQAITRASTFKEEIHRPADDVDHNSISSSSSSSSSLAVQAIRASAAHRDSSNSSA 60

Query: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120
           FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ
Sbjct: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120

Query: 121 PINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT 180
           PINTSTLSQE CQ TDHDSKKSDNPAKPKGLDAITTS NQLGDTNQKASKEGRPIV AKT
Sbjct: 121 PINTSTLSQEACQFTDHDSKKSDNPAKPKGLDAITTSLNQLGDTNQKASKEGRPIVEAKT 180

Query: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240
           TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEP+MQAHHETQLKASRDVAMAT
Sbjct: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPQMQAHHETQLKASRDVAMAT 240

Query: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300
           AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA
Sbjct: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300

Query: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360
           EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF
Sbjct: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360

Query: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEG 420
           NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKE KDDDDDDDDKEDSEGDPSPTTVSEEEEG
Sbjct: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEEKDDDDDDDDKEDSEGDPSPTTVSEEEEG 420

Query: 421 TENSPLPSSTSS 430
           TENSPLPSSTSS
Sbjct: 421 TENSPLPSSTSS 432

BLAST of CmoCh03G002850 vs. ExPASy TrEMBL
Match: A0A5D3C4F3 (Actin cytoskeleton-regulatory complex protein pan1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold287G00700 PE=4 SV=1)

HSP 1 Score: 591.3 bits (1523), Expect = 3.3e-165
Identity = 346/437 (79.18%), Postives = 368/437 (84.21%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSI---SSSSSSLAAQAIRASAAHRDSSNSSA 60
           MAYRRRQAITRASTFKEEIH+P+D+ DHNSI   SSSSSSLAAQAIRASA H DSS+SSA
Sbjct: 1   MAYRRRQAITRASTFKEEIHYPSDEFDHNSISSSSSSSSSLAAQAIRASATHHDSSHSSA 60

Query: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120
           F+G+SS FSPGHMRSKSFAGEN+ FKSESKSGFWGVLARKAKAILEEDDIAIEEE S FQ
Sbjct: 61  FAGASSNFSPGHMRSKSFAGENV-FKSESKSGFWGVLARKAKAILEEDDIAIEEEPSRFQ 120

Query: 121 PINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT 180
           PINTS  SQEPCQST+ DSKK+DNPA  KGLDAI+TS NQLGDT +KA  EGR IV  KT
Sbjct: 121 PINTSNRSQEPCQSTNCDSKKTDNPAIRKGLDAISTSLNQLGDTFEKAYGEGRTIVENKT 180

Query: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240
            DII ETRKLQIR+KGN +EGL+P VNN WQQPN+ SPEP MQ HHETQLKASRDVAMAT
Sbjct: 181 ADIIQETRKLQIRKKGNNTEGLYPAVNNQWQQPNIQSPEPNMQTHHETQLKASRDVAMAT 240

Query: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300
           AAKAKLLLRELKTIKA+LAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA
Sbjct: 241 AAKAKLLLRELKTIKADLAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300

Query: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360
           EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT+ L 
Sbjct: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKTLS 360

Query: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVS----- 420
           N  SP S TSP+SP    ME LP     P + K  D D  DK+  E   S TT S     
Sbjct: 361 NSVSPRSPTSPSSP----MEVLPAVPPPPIQSK-QDKDHHDKDSDEHPTSSTTFSEEEEE 420

Query: 421 EEEEGTENSPLPSSTSS 430
           EEEEGT+  PLPS+TSS
Sbjct: 421 EEEEGTKTKPLPSTTSS 431

BLAST of CmoCh03G002850 vs. ExPASy TrEMBL
Match: A0A0A0KXX4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G337320 PE=4 SV=1)

HSP 1 Score: 591.3 bits (1523), Expect = 3.3e-165
Identity = 338/433 (78.06%), Postives = 367/433 (84.76%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSI---SSSSSSLAAQAIRASAAHRDSSNSSA 60
           MAYRRRQAITRASTFKEEIH+P+D++DHNSI   SSSSSSLAAQAIRASA  RDSS+SSA
Sbjct: 1   MAYRRRQAITRASTFKEEIHYPSDELDHNSISSSSSSSSSLAAQAIRASATQRDSSHSSA 60

Query: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120
           F+G+SS FSPGH+RSKSFAGEN+ FKS+SKSGFWGVLARKAKAILEEDDIAIE+E S FQ
Sbjct: 61  FAGASSNFSPGHLRSKSFAGENI-FKSDSKSGFWGVLARKAKAILEEDDIAIEDEPSRFQ 120

Query: 121 PINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT 180
           PIN S  SQEPCQST+ DSKK+DNPA  KGLDAI+TS NQLGDT +KA +EGR IV  KT
Sbjct: 121 PINNSNRSQEPCQSTNCDSKKTDNPAIRKGLDAISTSLNQLGDTFEKAYEEGRTIVENKT 180

Query: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240
            DII ETRKLQIR+KGN +EGL+P VNN WQQPN+ SPEP MQ HHETQLKASRDVAMAT
Sbjct: 181 ADIIQETRKLQIRKKGNNTEGLYPAVNNQWQQPNIQSPEPNMQTHHETQLKASRDVAMAT 240

Query: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300
           AAKAKLLLRELKTIKA+LAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA
Sbjct: 241 AAKAKLLLRELKTIKADLAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300

Query: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360
           EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT+ML 
Sbjct: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKMLS 360

Query: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSP-TTVSEEEE 420
           N  SP S TSP+SP        P P +  K+ KD  D D D+  +     P     EEEE
Sbjct: 361 NSVSPRSPTSPSSPMEVLPVVPPPPPIQSKQDKDHHDKDSDEHPTSSTTFPEEEEEEEEE 420

Query: 421 GTENSPLPSSTSS 430
           GT+ +PLPSSTSS
Sbjct: 421 GTKKNPLPSSTSS 432

BLAST of CmoCh03G002850 vs. ExPASy TrEMBL
Match: A0A1S3CJA7 (uncharacterized protein LOC103501490 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501490 PE=4 SV=1)

HSP 1 Score: 591.3 bits (1523), Expect = 3.3e-165
Identity = 346/437 (79.18%), Postives = 368/437 (84.21%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSI---SSSSSSLAAQAIRASAAHRDSSNSSA 60
           MAYRRRQAITRASTFKEEIH+P+D+ DHNSI   SSSSSSLAAQAIRASA H DSS+SSA
Sbjct: 1   MAYRRRQAITRASTFKEEIHYPSDEFDHNSISSSSSSSSSLAAQAIRASATHHDSSHSSA 60

Query: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120
           F+G+SS FSPGHMRSKSFAGEN+ FKSESKSGFWGVLARKAKAILEEDDIAIEEE S FQ
Sbjct: 61  FAGASSNFSPGHMRSKSFAGENV-FKSESKSGFWGVLARKAKAILEEDDIAIEEEPSRFQ 120

Query: 121 PINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT 180
           PINTS  SQEPCQST+ DSKK+DNPA  KGLDAI+TS NQLGDT +KA  EGR IV  KT
Sbjct: 121 PINTSNRSQEPCQSTNCDSKKTDNPAIRKGLDAISTSLNQLGDTFEKAYGEGRTIVENKT 180

Query: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240
            DII ETRKLQIR+KGN +EGL+P VNN WQQPN+ SPEP MQ HHETQLKASRDVAMAT
Sbjct: 181 ADIIQETRKLQIRKKGNNTEGLYPAVNNQWQQPNIQSPEPNMQTHHETQLKASRDVAMAT 240

Query: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300
           AAKAKLLLRELKTIKA+LAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA
Sbjct: 241 AAKAKLLLRELKTIKADLAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300

Query: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360
           EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT+ L 
Sbjct: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPISTSPEITKTLS 360

Query: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVS----- 420
           N  SP S TSP+SP    ME LP     P + K  D D  DK+  E   S TT S     
Sbjct: 361 NSVSPRSPTSPSSP----MEVLPAVPPPPIQSK-QDKDHHDKDSDEHPTSSTTFSEEEEE 420

Query: 421 EEEEGTENSPLPSSTSS 430
           EEEEGT+  PLPS+TSS
Sbjct: 421 EEEEGTKTKPLPSTTSS 431

BLAST of CmoCh03G002850 vs. NCBI nr
Match: XP_022950505.1 (uncharacterized protein LOC111453596 [Cucurbita moschata])

HSP 1 Score: 800.8 bits (2067), Expect = 5.7e-228
Identity = 429/429 (100.00%), Postives = 429/429 (100.00%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSLAAQAIRASAAHRDSSNSSAFSG 60
           MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSLAAQAIRASAAHRDSSNSSAFSG
Sbjct: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSLAAQAIRASAAHRDSSNSSAFSG 60

Query: 61  SSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQPIN 120
           SSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQPIN
Sbjct: 61  SSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQPIN 120

Query: 121 TSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKTTDI 180
           TSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKTTDI
Sbjct: 121 TSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKTTDI 180

Query: 181 IHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMATAAK 240
           IHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMATAAK
Sbjct: 181 IHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMATAAK 240

Query: 241 AKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKA 300
           AKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKA
Sbjct: 241 AKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKA 300

Query: 301 RLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLFNYG 360
           RLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLFNYG
Sbjct: 301 RLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLFNYG 360

Query: 361 SPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEGTEN 420
           SPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEGTEN
Sbjct: 361 SPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEGTEN 420

Query: 421 SPLPSSTSS 430
           SPLPSSTSS
Sbjct: 421 SPLPSSTSS 429

BLAST of CmoCh03G002850 vs. NCBI nr
Match: KAG7033696.1 (hypothetical protein SDJN02_03421, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 784.6 bits (2025), Expect = 4.2e-223
Identity = 427/438 (97.49%), Postives = 428/438 (97.72%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSI---SSSSSSLAAQAIRASAAHRDSSNSSA 60
           MAYRRRQAITRASTFKEEIHHPADDVDHNSI   SSSSSSLAAQAIRASAAHRDSSNSSA
Sbjct: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSSSSLAAQAIRASAAHRDSSNSSA 60

Query: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120
           FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ
Sbjct: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120

Query: 121 PINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT 180
           PI+TSTLSQEPCQSTDHDSKK DNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT
Sbjct: 121 PISTSTLSQEPCQSTDHDSKKLDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT 180

Query: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240
           TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT
Sbjct: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240

Query: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300
           AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA
Sbjct: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300

Query: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360
           EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF
Sbjct: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360

Query: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGK------DDDDDDDDKEDSEGDPSPTTV 420
           NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGK      DDDDDDDDKEDSEGDPSPTTV
Sbjct: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDDDDDDDKEDSEGDPSPTTV 420

Query: 421 SEEEEGTENSPLPSSTSS 430
           SEEEEGTENSPLPSSTSS
Sbjct: 421 SEEEEGTENSPLPSSTSS 438

BLAST of CmoCh03G002850 vs. NCBI nr
Match: XP_022978186.1 (uncharacterized protein LOC111478244 [Cucurbita maxima])

HSP 1 Score: 775.4 bits (2001), Expect = 2.6e-220
Identity = 421/432 (97.45%), Postives = 422/432 (97.69%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSI---SSSSSSLAAQAIRASAAHRDSSNSSA 60
           MAYRRRQAITRASTFKEEIH PADDVDHNSI   SSSSSSLA QAIRASAAHRDSSNSSA
Sbjct: 1   MAYRRRQAITRASTFKEEIHRPADDVDHNSISSSSSSSSSLAVQAIRASAAHRDSSNSSA 60

Query: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120
           FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ
Sbjct: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120

Query: 121 PINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT 180
           PINTSTLSQE CQ TDHDSKKSDNPAKPKGLDAITTS NQLGDTNQKASKEGRPIV AKT
Sbjct: 121 PINTSTLSQEACQFTDHDSKKSDNPAKPKGLDAITTSLNQLGDTNQKASKEGRPIVEAKT 180

Query: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240
           TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEP+MQAHHETQLKASRDVAMAT
Sbjct: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPQMQAHHETQLKASRDVAMAT 240

Query: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300
           AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA
Sbjct: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300

Query: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360
           EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF
Sbjct: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360

Query: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDKEDSEGDPSPTTVSEEEEG 420
           NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKE KDDDDDDDDKEDSEGDPSPTTVSEEEEG
Sbjct: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEEKDDDDDDDDKEDSEGDPSPTTVSEEEEG 420

Query: 421 TENSPLPSSTSS 430
           TENSPLPSSTSS
Sbjct: 421 TENSPLPSSTSS 432

BLAST of CmoCh03G002850 vs. NCBI nr
Match: XP_023544827.1 (uncharacterized protein LOC111804305 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 774.2 bits (1998), Expect = 5.7e-220
Identity = 420/436 (96.33%), Postives = 425/436 (97.48%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSI---SSSSSSLAAQAIRASAAHRDSSNSSA 60
           MAYRRRQAITRASTFKEEIHHPADDVDHNSI   SSSSSSLA QAIRASAAHRDSSNSSA
Sbjct: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSSSSLAVQAIRASAAHRDSSNSSA 60

Query: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120
           FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ
Sbjct: 61  FSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQ 120

Query: 121 PINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKT 180
           PINTSTLSQEPCQSTDHDSKKSDNPAKPK LDAITTSFNQLGDTN+KAS+EG PIVGAKT
Sbjct: 121 PINTSTLSQEPCQSTDHDSKKSDNPAKPKRLDAITTSFNQLGDTNKKASEEGHPIVGAKT 180

Query: 181 TDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240
           TDIIHE RKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT
Sbjct: 181 TDIIHENRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSPEPRMQAHHETQLKASRDVAMAT 240

Query: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300
           AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA
Sbjct: 241 AAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLA 300

Query: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360
           EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF
Sbjct: 301 EKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEITRMLF 360

Query: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGK----DDDDDDDDKEDSEGDPSPTTVSE 420
           NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGK    DDDDDDDDKEDSEGDPSPTTVSE
Sbjct: 361 NYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDDDDDDDDDKEDSEGDPSPTTVSE 420

Query: 421 EEEGTENSPLPSSTSS 430
           EEEGTENSPLPS+++S
Sbjct: 421 EEEGTENSPLPSTSTS 436

BLAST of CmoCh03G002850 vs. NCBI nr
Match: KAG6603516.1 (hypothetical protein SDJN03_04125, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 726.9 bits (1875), Expect = 1.0e-205
Identity = 393/399 (98.50%), Postives = 394/399 (98.75%), Query Frame = 0

Query: 36  SSLAAQAIRASAAHRDSSNSSAFSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLA 95
           SSLAAQAIRASAAHRDSSNSSAFSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLA
Sbjct: 12  SSLAAQAIRASAAHRDSSNSSAFSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLA 71

Query: 96  RKAKAILEEDDIAIEEETSGFQPINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSF 155
           RKAKAILEEDDIAIEEETSGFQPI+TSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSF
Sbjct: 72  RKAKAILEEDDIAIEEETSGFQPISTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSF 131

Query: 156 NQLGDTNQKASKEGRPIVGAKTTDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSP 215
           NQLGDTNQKASKEGRPIVGAKTTDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSP
Sbjct: 132 NQLGDTNQKASKEGRPIVGAKTTDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQPNLLSP 191

Query: 216 EPRMQAHHETQLKASRDVAMATAAKAKLLLRELKTIKAELAFAKERCAQLEEENKILREN 275
           EPRMQAHHETQLKASRDVAMATAAKAKLLLRELKTIKAELAFAKERCAQLEEENKILREN
Sbjct: 192 EPRMQAHHETQLKASRDVAMATAAKAKLLLRELKTIKAELAFAKERCAQLEEENKILREN 251

Query: 276 REKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDE 335
           REKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDE
Sbjct: 252 REKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDE 311

Query: 336 GMEEVTEYYPRSTSPEITRMLFNYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGK----- 395
           GMEEVTEYYPRSTSPEITRMLFNYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGK     
Sbjct: 312 GMEEVTEYYPRSTSPEITRMLFNYGSPHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDDDD 371

Query: 396 DDDDDDDDKEDSEGDPSPTTVSEEEEGTENSPLPSSTSS 430
           DDDDDDDDKEDSEGDPSPTTVSEEEEGTENSPLPSSTSS
Sbjct: 372 DDDDDDDDKEDSEGDPSPTTVSEEEEGTENSPLPSSTSS 410

BLAST of CmoCh03G002850 vs. TAIR 10
Match: AT5G01970.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G30050.1); Has 240 Blast hits to 236 proteins in 72 species: Archae - 0; Bacteria - 15; Metazoa - 51; Fungi - 19; Plants - 119; Viruses - 0; Other Eukaryotes - 36 (source: NCBI BLink). )

HSP 1 Score: 283.9 bits (725), Expect = 2.2e-76
Identity = 197/377 (52.25%), Postives = 247/377 (65.52%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSISSSSSSLAAQAIRASAAHRDSSNSSAFSG 60
           MAYRRRQ I + +TFKEE+    D +   SI+               A +D S  +    
Sbjct: 1   MAYRRRQGIGKFATFKEEV----DRLPPESIT---------------AVKDRSPPAR--- 60

Query: 61  SSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSGFQPIN 120
           SS  F     RSK+F  E          G WGV+A+KAK+++ EDD + +  T+  Q   
Sbjct: 61  SSPAFD--QPRSKNFTTE--------PKGLWGVIAQKAKSVI-EDDKSSDRSTTASQS-R 120

Query: 121 TSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKASKEGRPIVGAKTTDI 180
            S LS       D   KK DNP   +GLD +T+S NQ+GDT +KA ++GR +V  KT DI
Sbjct: 121 FSYLS-------DEGFKKMDNPKLRRGLDKLTSSLNQIGDTFEKAFEDGRTLVENKTADI 180

Query: 181 IHETRKLQIRRKG----NKSEGLFPTVNNPWQQPNLLSPEPRMQAH---HETQLKASRDV 240
           I ETRKLQ RR+G    ++++     V++ W++    SPE  MQ +   HETQLKASRDV
Sbjct: 181 IQETRKLQTRRRGTGGEDENQNQSYGVSSSWKK----SPEQPMQLNHIEHETQLKASRDV 240

Query: 241 AMATAAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRE-NREKGDNRADDDLIRLQL 300
           AMATAAKAKLLLRELKT+KA+LAFAKERCAQLEEENK LRE +REKG N AD+DLIRLQL
Sbjct: 241 AMATAAKAKLLLRELKTVKADLAFAKERCAQLEEENKHLRESHREKGSNPADEDLIRLQL 300

Query: 301 ETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEYYPRSTSPEI 360
           E+LLAEKARLAHENS+YARENRFLREIVEYHQLTMQDVVY+DEG EEVT+      SP +
Sbjct: 301 ESLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDVVYIDEGSEEVTQ-----VSPFV 327

Query: 361 TRMLFNYGSPHSSTSPT 370
           + ++ +  +  S + P+
Sbjct: 361 STLMTSSPADRSQSPPS 327

BLAST of CmoCh03G002850 vs. TAIR 10
Match: AT1G30050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 246 Blast hits to 244 proteins in 61 species: Archae - 0; Bacteria - 8; Metazoa - 78; Fungi - 10; Plants - 117; Viruses - 0; Other Eukaryotes - 33 (source: NCBI BLink). )

HSP 1 Score: 268.9 bits (686), Expect = 7.2e-72
Identity = 218/456 (47.81%), Postives = 266/456 (58.33%), Query Frame = 0

Query: 1   MAYRRRQAITRASTFKEEIHHPADDVDHNSIS--------------SSSSSLAAQAIRAS 60
           MAYRRRQ ITRASTFKE+I+H   D DH  +               SS SSLAAQAIRA 
Sbjct: 1   MAYRRRQGITRASTFKEDIYHQPPDPDHGDLKGHSNGGSFRSSQSFSSHSSLAAQAIRA- 60

Query: 61  AAHRDSSNSSAFSGSSSVFSPGHMRSKSFAGENLTFKSESKSGFWGVLARKAKAILEEDD 120
                SS +  F+                       KSES+ GFWG+LA+KAK+ILE+  
Sbjct: 61  -----SSQAQGFTAYED-------------------KSESR-GFWGILAQKAKSILED-- 120

Query: 121 IAIEEETSGFQPINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKAS 180
              EEE             Q+  Q  +    +  NP   K +D ITTS N +GD+ +KA 
Sbjct: 121 ---EEE------------QQQQQQQQNDVIFEPSNPTIRKSIDKITTSLNHIGDSFEKAF 180

Query: 181 KEGRPIVGAKTTDIIHETRKLQIRRKG-------NKSEGLFPTVNNPWQQPNLLSPEPRM 240
           +EGR IV +            QIRRKG       N +       ++PWQ   L  P PR 
Sbjct: 181 EEGRTIVAS------------QIRRKGSDLIDSDNNNYHQSSGSSSPWQP--LTQPNPR- 240

Query: 241 QAHHETQLKASRDVAMATAAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENREKG 300
               E+QLKASRDVAMATAAKAKLLLRELKT+KA+LAFAKERC+QLEEENK LR+NR+KG
Sbjct: 241 ----ESQLKASRDVAMATAAKAKLLLRELKTVKADLAFAKERCSQLEEENKRLRDNRDKG 300

Query: 301 DNR-ADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGME 360
           +N  ADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVY+DEG+E
Sbjct: 301 NNNPADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYIDEGIE 360

Query: 361 EVTEYYPRSTSPEITRML--FNYGS---PHSSTSPTSPSAAAMEDLPVPRVHPKEGKDDD 420
           EV E      +P ITR L   +Y +   P  S SP+SP++ +   +    ++P   +   
Sbjct: 361 EVAE-----VNPSITRTLSMASYSASELPSISPSPSSPASPSRLSVSTD-IYPILVQQSS 388

Query: 421 DDDDDKEDSEGDPSPTTVSEEEEGTENSPLPSSTSS 430
            +D   E  +    P+    ++    +S LPSS  S
Sbjct: 421 TNDITVESPKPVRPPSLGYTDDGKRPSSQLPSSQLS 388

BLAST of CmoCh03G002850 vs. TAIR 10
Match: AT2G30530.1 (unknown protein; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 5513 Blast hits to 872 proteins in 154 species: Archae - 0; Bacteria - 30; Metazoa - 615; Fungi - 144; Plants - 149; Viruses - 12; Other Eukaryotes - 4563 (source: NCBI BLink). )

HSP 1 Score: 261.5 bits (667), Expect = 1.2e-69
Identity = 173/320 (54.06%), Postives = 211/320 (65.94%), Query Frame = 0

Query: 32  SSSSSSLAAQAIRASAAHRDSSNSSAFSGSSSVFSPGHMRSKSFAGENLTFKS--ESKSG 91
           SS++SSLAA+AIRAS+AHRDSS SSA+S  SS   P   +  + A E  + KS  E K G
Sbjct: 48  SSAASSLAAKAIRASSAHRDSSLSSAYSSPSSAPVPTPPKEVNKAYEYTSMKSLNEPKRG 107

Query: 92  FWGVLARKAKAILEEDDIAIEEETSGFQPINTSTLSQEPCQSTDHDSKKSDNPAKPKGLD 151
           FWG LA KAKA L+EDD     ++      +  + +    +      +KS+NP+  + LD
Sbjct: 108 FWGSLASKAKAFLDEDDPNQLPQSPKRMEQSIPSATTSGTKEAGQTGRKSENPSLQRRLD 167

Query: 152 AITTSFNQLGDTNQKASKEGRPIVGAKTTDIIHETRKLQIRRKGNKSEGLFPTVNNPWQQ 211
           AIT+S N +G T     +EG   V  +T  II ETRK +I++K        P++    Q 
Sbjct: 168 AITSSLNYIGGTIGTVVEEGITAVENRTAGIIQETRK-KIKKK--------PSLTRNQQN 227

Query: 212 PNLLSPEPRMQAHHETQLKASRDVAMATAAKAKLLLRELKTIKAELAFAKERCAQLEEEN 271
           P +       QA  E QLKASRDVAMA AAKAKLLLRELK +K++LAFAK+RCAQLEEEN
Sbjct: 228 PEI-------QADLEIQLKASRDVAMAMAAKAKLLLRELKMVKSDLAFAKQRCAQLEEEN 287

Query: 272 KILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQD 331
           K+LRENR       DDDL+RLQLETLLAEKARLAHENSIY REN +LR +VEYHQLTMQD
Sbjct: 288 KVLRENRSGDSQTDDDDLVRLQLETLLAEKARLAHENSIYTRENLYLRGVVEYHQLTMQD 347

Query: 332 VVYLDEGMEEVTEYYPRSTS 350
           VVY DE  EEVTE YP + S
Sbjct: 348 VVYFDEKTEEVTEVYPINVS 351

BLAST of CmoCh03G002850 vs. TAIR 10
Match: AT4G02800.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 3209 Blast hits to 2720 proteins in 308 species: Archae - 13; Bacteria - 213; Metazoa - 1207; Fungi - 247; Plants - 183; Viruses - 21; Other Eukaryotes - 1325 (source: NCBI BLink). )

HSP 1 Score: 119.0 bits (297), Expect = 9.3e-27
Identity = 87/234 (37.18%), Postives = 137/234 (58.55%), Query Frame = 0

Query: 106 DIAIEEETSGFQPINTSTLSQEPCQSTDHDSKKSDNPAKPKGLDAITTSFNQLGDTNQKA 165
           D+ +EE +   +P+ T+T++ E  +S      K+D+    KG D+++ + + L      A
Sbjct: 52  DVLLEENSKNHKPV-TNTIAIESERSKRF---KNDSMLLLKGFDSVSHTLSLLSSNLDNA 111

Query: 166 SKEGRPIVGAKT-TDIIHETRKL-QIRR-------KGNKSEGLFPTVNNPWQQPNLLSPE 225
            +  R +    + ++I+H   K  QI+R       +  +S+G      +  +Q    S E
Sbjct: 112 LQGVRELAKPPSYSEILHSNLKADQIQRQQKEEDEEEEESKGKKRKHESDVEQTEDSSNE 171

Query: 226 PRMQAHHETQLKASRDVAMATAAKAKLLLRELKTIKAELAFAKERCAQLEEENKILRENR 285
              +      +K ++++A++ AAKA  L RELKTIK++L+F +ERC  LEEENK LR+  
Sbjct: 172 EEKRPKERKIMKKAKNIAISMAAKANSLARELKTIKSDLSFIQERCGLLEEENKRLRDGF 231

Query: 286 EKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDV 331
            KG    +DDL+RLQLE LLAEKARLA+EN+   REN+ L ++VEYHQ+T QD+
Sbjct: 232 VKGVRPEEDDLVRLQLEVLLAEKARLANENANLVRENQCLHQMVEYHQITSQDL 281

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GFX82.8e-228100.00uncharacterized protein LOC111453596 OS=Cucurbita moschata OX=3662 GN=LOC1114535... [more]
A0A6J1IM161.2e-22097.45uncharacterized protein LOC111478244 OS=Cucurbita maxima OX=3661 GN=LOC111478244... [more]
A0A5D3C4F33.3e-16579.18Actin cytoskeleton-regulatory complex protein pan1 OS=Cucumis melo var. makuwa O... [more]
A0A0A0KXX43.3e-16578.06Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G337320 PE=4 SV=1[more]
A0A1S3CJA73.3e-16579.18uncharacterized protein LOC103501490 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
XP_022950505.15.7e-228100.00uncharacterized protein LOC111453596 [Cucurbita moschata][more]
KAG7033696.14.2e-22397.49hypothetical protein SDJN02_03421, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022978186.12.6e-22097.45uncharacterized protein LOC111478244 [Cucurbita maxima][more]
XP_023544827.15.7e-22096.33uncharacterized protein LOC111804305 [Cucurbita pepo subsp. pepo][more]
KAG6603516.11.0e-20598.50hypothetical protein SDJN03_04125, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
AT5G01970.12.2e-7652.25unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G30050.17.2e-7247.81unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G30530.11.2e-6954.06unknown protein; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant ... [more]
AT4G02800.19.3e-2737.18unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 241..282
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..66
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 393..412
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..66
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 357..373
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 114..130
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 10..27
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 413..429
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 357..429
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 113..150
NoneNo IPR availablePANTHERPTHR31016UNCHARACTERIZEDcoord: 1..375
NoneNo IPR availablePANTHERPTHR31016:SF12HEAT-INDUCIBLE TRANSCRIPTION REPRESSOR-RELATEDcoord: 1..375

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh03G002850.1CmoCh03G002850.1mRNA