Cp4.1LG02g11230 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g11230
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionhistone-lysine N-methyltransferase SETD1B-like
LocationCp4.1LG02: 10290352 .. 10290954 (+)
RNA-Seq ExpressionCp4.1LG02g11230
SyntenyCp4.1LG02g11230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAACAAATTCAGGGGATTTTGGAGGCAAATTTGGGAAAATGAGGCTTAAAAGCCTCAAAACAAGGGATGATATATTAACGGTGGATGAAGATTACCACACAGGAGTATCAGCCACAGTGCCATTCATATGGGAGTCCGAGCCAGGCACTCCCAAGGCCAATTTCAGGGACCATGGCTCACTCCTTTCACCTCTCACACCCCCACCATCCTATTTCTCCTCACCCTCACACACACCCTCCAAGCATAAATATAACACCAACCCACATTCCTCTCGAGCCAATTTCCTCACCTCGGTTTTCAAGAAGCTTTCTATGAATAAGGCTCCTCTCCATCCGCTCTCTCCCGGCTCGTCGTCGTCGTCGACGACTTCGTCTACGACACTGACAAGCTCTGAGGGAGAGAGAGGAAGAAGCCCGAGGAGATTGTCGTTTGATTCGAGAGTGGACGAGAACGAGCACGAGGACGAAGAAGAGGAAGAGGAGAATTTTGACTCGCCTGTTTCGACTTTGTTCTTTGGACGTGGCGGAGGATCGAGTGATAAAGGATGCTATTGCTATCCAAAGTTGGTGAAGGTATTCTCCAAGCATTGTTCCAAATGA

mRNA sequence

ATGTCAACAAATTCAGGGGATTTTGGAGGCAAATTTGGGAAAATGAGGCTTAAAAGCCTCAAAACAAGGGATGATATATTAACGGTGGATGAAGATTACCACACAGGAGTATCAGCCACAGTGCCATTCATATGGGAGTCCGAGCCAGGCACTCCCAAGGCCAATTTCAGGGACCATGGCTCACTCCTTTCACCTCTCACACCCCCACCATCCTATTTCTCCTCACCCTCACACACACCCTCCAAGCATAAATATAACACCAACCCACATTCCTCTCGAGCCAATTTCCTCACCTCGGTTTTCAAGAAGCTTTCTATGAATAAGGCTCCTCTCCATCCGCTCTCTCCCGGCTCGTCGTCGTCGTCGACGACTTCGTCTACGACACTGACAAGCTCTGAGGGAGAGAGAGGAAGAAGCCCGAGGAGATTGTCGTTTGATTCGAGAGTGGACGAGAACGAGCACGAGGACGAAGAAGAGGAAGAGGAGAATTTTGACTCGCCTGTTTCGACTTTGTTCTTTGGACGTGGCGGAGGATCGAGTGATAAAGGATGCTATTGCTATCCAAAGTTGGTGAAGGTATTCTCCAAGCATTGTTCCAAATGA

Coding sequence (CDS)

ATGTCAACAAATTCAGGGGATTTTGGAGGCAAATTTGGGAAAATGAGGCTTAAAAGCCTCAAAACAAGGGATGATATATTAACGGTGGATGAAGATTACCACACAGGAGTATCAGCCACAGTGCCATTCATATGGGAGTCCGAGCCAGGCACTCCCAAGGCCAATTTCAGGGACCATGGCTCACTCCTTTCACCTCTCACACCCCCACCATCCTATTTCTCCTCACCCTCACACACACCCTCCAAGCATAAATATAACACCAACCCACATTCCTCTCGAGCCAATTTCCTCACCTCGGTTTTCAAGAAGCTTTCTATGAATAAGGCTCCTCTCCATCCGCTCTCTCCCGGCTCGTCGTCGTCGTCGACGACTTCGTCTACGACACTGACAAGCTCTGAGGGAGAGAGAGGAAGAAGCCCGAGGAGATTGTCGTTTGATTCGAGAGTGGACGAGAACGAGCACGAGGACGAAGAAGAGGAAGAGGAGAATTTTGACTCGCCTGTTTCGACTTTGTTCTTTGGACGTGGCGGAGGATCGAGTGATAAAGGATGCTATTGCTATCCAAAGTTGGTGAAGGTATTCTCCAAGCATTGTTCCAAATGA

Protein sequence

MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHGSLLSPLTPPPSYFSSPSHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSPGSSSSSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSSDKGCYCYPKLVKVFSKHCSK
Homology
BLAST of Cp4.1LG02g11230 vs. NCBI nr
Match: XP_022949317.1 (uncharacterized protein LOC111452704 [Cucurbita moschata])

HSP 1 Score: 389 bits (998), Expect = 4.68e-136
Identity = 197/200 (98.50%), Postives = 198/200 (99.00%), Query Frame = 0

Query: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60
           MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG
Sbjct: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60

Query: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSPGSSS 120
           SLLSPLTPPPSYFSSPSHTPSKHKYNTNPH SRANFLTSVFKKLS+NKAPLHPLSPGSSS
Sbjct: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHPSRANFLTSVFKKLSVNKAPLHPLSPGSSS 120

Query: 121 SSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180
           SST SSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS
Sbjct: 121 SST-SSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180

Query: 181 DKGCYCYPKLVKVFSKHCSK 200
           DKGCYCYPKLVKVFSKHCSK
Sbjct: 181 DKGCYCYPKLVKVFSKHCSK 199

BLAST of Cp4.1LG02g11230 vs. NCBI nr
Match: KAG6607361.1 (hypothetical protein SDJN03_00703, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 386 bits (992), Expect = 3.85e-135
Identity = 196/200 (98.00%), Postives = 197/200 (98.50%), Query Frame = 0

Query: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60
           MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG
Sbjct: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60

Query: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSPGSSS 120
           SLLSPLTPPPSYFSSPSHTPSKHKYNTNPH SRANFLTSVFKKLS+NKAPLHPLSPGSSS
Sbjct: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHPSRANFLTSVFKKLSVNKAPLHPLSPGSSS 120

Query: 121 SSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180
           SST SST LTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS
Sbjct: 121 SST-SSTILTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180

Query: 181 DKGCYCYPKLVKVFSKHCSK 200
           DKGCYCYPKLVKVFSKHCSK
Sbjct: 181 DKGCYCYPKLVKVFSKHCSK 199

BLAST of Cp4.1LG02g11230 vs. NCBI nr
Match: XP_022997732.1 (uncharacterized protein LOC111492604 [Cucurbita maxima])

HSP 1 Score: 384 bits (987), Expect = 2.23e-134
Identity = 195/200 (97.50%), Postives = 197/200 (98.50%), Query Frame = 0

Query: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60
           MSTNSG+FGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG
Sbjct: 1   MSTNSGNFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60

Query: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSPGSSS 120
           SLLSPLTPPPSYFSSPSHTP KHKYNTNPH SRANFLTSVFKKLS+NKAPLHPLSPGSSS
Sbjct: 61  SLLSPLTPPPSYFSSPSHTPFKHKYNTNPHPSRANFLTSVFKKLSVNKAPLHPLSPGSSS 120

Query: 121 SSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180
           SST SSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS
Sbjct: 121 SST-SSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180

Query: 181 DKGCYCYPKLVKVFSKHCSK 200
           DKGCYCYPKLVKVFSKHCSK
Sbjct: 181 DKGCYCYPKLVKVFSKHCSK 199

BLAST of Cp4.1LG02g11230 vs. NCBI nr
Match: KAG7037033.1 (hypothetical protein SDJN02_00654, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 384 bits (985), Expect = 4.19e-134
Identity = 194/200 (97.00%), Postives = 196/200 (98.00%), Query Frame = 0

Query: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60
           MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG
Sbjct: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60

Query: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSPGSSS 120
           SLLSPLTPPPSYFSSPSHTPSKHKYNTNPH SRANFLTSVFKKLS+NKAPLHPLSPGSSS
Sbjct: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHPSRANFLTSVFKKLSVNKAPLHPLSPGSSS 120

Query: 121 SSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180
           SS   S+TLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS
Sbjct: 121 SS---SSTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180

Query: 181 DKGCYCYPKLVKVFSKHCSK 200
           DKGCYCYPKLVKVFSKHCSK
Sbjct: 181 DKGCYCYPKLVKVFSKHCSK 197

BLAST of Cp4.1LG02g11230 vs. NCBI nr
Match: XP_038894794.1 (uncharacterized protein LOC120083212 [Benincasa hispida] >XP_038894795.1 uncharacterized protein LOC120083212 [Benincasa hispida])

HSP 1 Score: 198 bits (504), Expect = 5.78e-61
Identity = 120/200 (60.00%), Postives = 152/200 (76.00%), Query Frame = 0

Query: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDE-DYHTGVSATVPFIWESEPGTPKANFRDH 60
           +  NS D GGK  KMRLK L+TRDD+  V+E DYHTGVSA+VPF WESEPGTPKANF ++
Sbjct: 3   LGRNSEDLGGKSRKMRLKKLRTRDDMSFVEEEDYHTGVSASVPFKWESEPGTPKANFHEN 62

Query: 61  GSLLSPLTPPPSYFSSP---SHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSP 120
           G++LSPLTPPPSYFS+    +++P  H +++ P SS++NFL SVF+KLS+ K  L P SP
Sbjct: 63  GAILSPLTPPPSYFSNDHNITNSPLTH-FSSKPISSKSNFLNSVFRKLSV-KPTLQPPSP 122

Query: 121 GSSSSSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRG 180
             S SS++SST+  SSE  R  SPRRLSFDSRVD+++++D+     N +SPVSTLFFG G
Sbjct: 123 AGSLSSSSSSTS--SSERRRSGSPRRLSFDSRVDDDDNDDDG----NVESPVSTLFFGHG 182

Query: 181 GGSSDKGCYCYPKLVKVFSK 196
              SDKGCY  PKLVKVF++
Sbjct: 183 ---SDKGCY--PKLVKVFTR 189

BLAST of Cp4.1LG02g11230 vs. ExPASy TrEMBL
Match: A0A6J1GBP3 (uncharacterized protein LOC111452704 OS=Cucurbita moschata OX=3662 GN=LOC111452704 PE=4 SV=1)

HSP 1 Score: 389 bits (998), Expect = 2.27e-136
Identity = 197/200 (98.50%), Postives = 198/200 (99.00%), Query Frame = 0

Query: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60
           MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG
Sbjct: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60

Query: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSPGSSS 120
           SLLSPLTPPPSYFSSPSHTPSKHKYNTNPH SRANFLTSVFKKLS+NKAPLHPLSPGSSS
Sbjct: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHPSRANFLTSVFKKLSVNKAPLHPLSPGSSS 120

Query: 121 SSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180
           SST SSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS
Sbjct: 121 SST-SSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180

Query: 181 DKGCYCYPKLVKVFSKHCSK 200
           DKGCYCYPKLVKVFSKHCSK
Sbjct: 181 DKGCYCYPKLVKVFSKHCSK 199

BLAST of Cp4.1LG02g11230 vs. ExPASy TrEMBL
Match: A0A6J1K5W9 (uncharacterized protein LOC111492604 OS=Cucurbita maxima OX=3661 GN=LOC111492604 PE=4 SV=1)

HSP 1 Score: 384 bits (987), Expect = 1.08e-134
Identity = 195/200 (97.50%), Postives = 197/200 (98.50%), Query Frame = 0

Query: 1   MSTNSGDFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60
           MSTNSG+FGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG
Sbjct: 1   MSTNSGNFGGKFGKMRLKSLKTRDDILTVDEDYHTGVSATVPFIWESEPGTPKANFRDHG 60

Query: 61  SLLSPLTPPPSYFSSPSHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSPGSSS 120
           SLLSPLTPPPSYFSSPSHTP KHKYNTNPH SRANFLTSVFKKLS+NKAPLHPLSPGSSS
Sbjct: 61  SLLSPLTPPPSYFSSPSHTPFKHKYNTNPHPSRANFLTSVFKKLSVNKAPLHPLSPGSSS 120

Query: 121 SSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180
           SST SSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS
Sbjct: 121 SST-SSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSS 180

Query: 181 DKGCYCYPKLVKVFSKHCSK 200
           DKGCYCYPKLVKVFSKHCSK
Sbjct: 181 DKGCYCYPKLVKVFSKHCSK 199

BLAST of Cp4.1LG02g11230 vs. ExPASy TrEMBL
Match: A0A5D3BF70 (NADPH oxidase activator OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G00550 PE=4 SV=1)

HSP 1 Score: 179 bits (455), Expect = 8.75e-54
Identity = 118/203 (58.13%), Postives = 146/203 (71.92%), Query Frame = 0

Query: 4   NSGDFGG-KFGKMRLKSLKTRDD---ILTVDEDYHTGVSATVPFIWESEPGTPKANFRDH 63
           NSG+ GG K GKMRLK L+TRDD   I+  +EDYHTG+SA+VPF WESEPGTPKAN  D+
Sbjct: 7   NSGELGGGKSGKMRLKKLRTRDDMSRIMVEEEDYHTGLSASVPFKWESEPGTPKANLHDN 66

Query: 64  --GSLLSPLTPPPSYFSSP---SHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPL 123
             GSLLSPLTPPPSYFS+    + +P  H  ++ P  ++ + L +VF+ LS+ K  L P 
Sbjct: 67  NNGSLLSPLTPPPSYFSNHLIINSSPIIH-LSSKPSFNKPSCLNTVFRMLSV-KPTLQPP 126

Query: 124 SPGSSSSSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFG 183
           SP SSSSS   S T+   E  R  SPRRLSFDSRVD+++ ED+E E+ N +SPVSTLFFG
Sbjct: 127 SPASSSSS---SPTM---ERRRSGSPRRLSFDSRVDDDD-EDDENEDGNVESPVSTLFFG 186

Query: 184 RGGGSSDKGCYCYPKLVKVFSKH 197
           RG   S+KGCY  P LVKVF++H
Sbjct: 187 RG---SEKGCY--PNLVKVFTRH 195

BLAST of Cp4.1LG02g11230 vs. ExPASy TrEMBL
Match: A0A0A0LYC9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G533640 PE=4 SV=1)

HSP 1 Score: 179 bits (455), Expect = 8.75e-54
Identity = 115/203 (56.65%), Postives = 146/203 (71.92%), Query Frame = 0

Query: 4   NSGDFG-GKFGKMRLKSLKTRDDILTV---DEDYHTGVSATVPFIWESEPGTPKANFRDH 63
           NSG+ G GK GKMRLK L+ R+D+ T+   +EDYHTG+SA+VPF WESEPGTPKAN  D 
Sbjct: 7   NSGELGSGKSGKMRLKKLRKREDMSTIMVEEEDYHTGLSASVPFQWESEPGTPKANLNDR 66

Query: 64  GS--LLSPLTPPPSYFSSP---SHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPL 123
            S  LLSPLTPPPSYFS+    + +P  H  ++ P  ++ +FL +VF+KLS+  + L P 
Sbjct: 67  NSRSLLSPLTPPPSYFSNHLIINSSPMIH-LSSKPSFNKPSFLNTVFRKLSVKPSTLQPP 126

Query: 124 SPGSSSSSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFG 183
           SP SSSSS   S+T+   E  R  SPRRLSFDSRVD+++  D+E E+ N +SPVSTLFFG
Sbjct: 127 SPASSSSS---SSTM---ERRRYASPRRLSFDSRVDDDD--DDENEDGNLESPVSTLFFG 186

Query: 184 RGGGSSDKGCYCYPKLVKVFSKH 197
           R    SDKGCY  PKLVKVF+++
Sbjct: 187 R---RSDKGCY--PKLVKVFTRY 195

BLAST of Cp4.1LG02g11230 vs. ExPASy TrEMBL
Match: A0A1S3C5A7 (uncharacterized protein At4g00950 OS=Cucumis melo OX=3656 GN=LOC103497034 PE=4 SV=1)

HSP 1 Score: 179 bits (455), Expect = 8.75e-54
Identity = 118/203 (58.13%), Postives = 146/203 (71.92%), Query Frame = 0

Query: 4   NSGDFGG-KFGKMRLKSLKTRDD---ILTVDEDYHTGVSATVPFIWESEPGTPKANFRDH 63
           NSG+ GG K GKMRLK L+TRDD   I+  +EDYHTG+SA+VPF WESEPGTPKAN  D+
Sbjct: 7   NSGELGGGKSGKMRLKKLRTRDDMSRIMVEEEDYHTGLSASVPFKWESEPGTPKANLHDN 66

Query: 64  --GSLLSPLTPPPSYFSSP---SHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPL 123
             GSLLSPLTPPPSYFS+    + +P  H  ++ P  ++ + L +VF+ LS+ K  L P 
Sbjct: 67  NNGSLLSPLTPPPSYFSNHLIINSSPIIH-LSSKPSFNKPSCLNTVFRMLSV-KPTLQPP 126

Query: 124 SPGSSSSSTTSSTTLTSSEGERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFG 183
           SP SSSSS   S T+   E  R  SPRRLSFDSRVD+++ ED+E E+ N +SPVSTLFFG
Sbjct: 127 SPASSSSS---SPTM---ERRRSGSPRRLSFDSRVDDDD-EDDENEDGNVESPVSTLFFG 186

Query: 184 RGGGSSDKGCYCYPKLVKVFSKH 197
           RG   S+KGCY  P LVKVF++H
Sbjct: 187 RG---SEKGCY--PNLVKVFTRH 195

BLAST of Cp4.1LG02g11230 vs. TAIR 10
Match: AT1G06930.1 (unknown protein; Has 4478 Blast hits to 355 proteins in 82 species: Archae - 0; Bacteria - 29; Metazoa - 379; Fungi - 125; Plants - 120; Viruses - 25; Other Eukaryotes - 3800 (source: NCBI BLink). )

HSP 1 Score: 63.2 bits (152), Expect = 2.8e-10
Identity = 63/185 (34.05%), Postives = 87/185 (47.03%), Query Frame = 0

Query: 27  LTVDEDYHTGVSATVPFIWESEPGTPKANFRDHGS------------LLSPLTPPPSYF- 86
           + V+ DY+ G SA VPF WES+PGTP+   +   S            + +PLTPPPSYF 
Sbjct: 18  VAVEGDYYGGSSAAVPFKWESQPGTPRRLSKRSSSSGFNSDSDFNFPVSAPLTPPPSYFY 77

Query: 87  SSPSHTPSKHKYNTNPHSSRANFLTSVFKKLSMNKAPLHPLSPGSSSSSTTSSTTLTSSE 146
           +SPS T  KH    +P  +   F + + K  S+   P  P S  SSSSS+  S+ L +S+
Sbjct: 78  ASPSST--KH---VSPKKTNTLFSSLLSKNRSV---PSSPASSSSSSSSSVPSSPLRTSD 137

Query: 147 GERGRSPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGGGSSDKGCYCYPKLVKV 199
               R  R + F+S                     S+L +   G +S K   CY  LVKV
Sbjct: 138 LSNRR--RSMWFESG--------------------SSLDY---GSNSAKSSGCYASLVKV 169

BLAST of Cp4.1LG02g11230 vs. TAIR 10
Match: AT2G40475.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G56260.2); Has 477 Blast hits to 219 proteins in 41 species: Archae - 0; Bacteria - 4; Metazoa - 91; Fungi - 61; Plants - 144; Viruses - 0; Other Eukaryotes - 177 (source: NCBI BLink). )

HSP 1 Score: 58.9 bits (141), Expect = 5.3e-09
Identity = 54/148 (36.49%), Postives = 75/148 (50.68%), Query Frame = 0

Query: 33  YHTGVSATVPFIWESEPGTPKANFRDHGSLLSPLTPPPSYFSSPSHTPSKHKYNTNPHSS 92
           Y+ G  A+VPF+WE+ PGTPK         L PLTPPPSY+SS S + +K         +
Sbjct: 32  YYYG-GASVPFLWETRPGTPKHALFSESLRLPPLTPPPSYYSSSSSSGNKLS------KA 91

Query: 93  RANFLTSVFKKLSMNKAPLHPLSPGSSSSSTTSSTTLTSSEGERGRSPRRLSFDSRVDEN 152
           R    T   K L          S  S+SSS++SS + +S   +    PR+    SR   +
Sbjct: 92  RTIKQTRFVKTLLSRHVSRPSFSWSSASSSSSSSYSSSSPPSKVEHRPRKCYSCSR---S 151

Query: 153 EHEDEEEEEENFDSPVSTLFFGRGGGSS 181
             ++++EEE    SP STL + RG  SS
Sbjct: 152 YVKEDDEEEIVSSSPTSTLCYKRGFSSS 169

BLAST of Cp4.1LG02g11230 vs. TAIR 10
Match: AT3G56260.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 8 plant structures; EXPRESSED DURING: F mature embryo stage, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G40475.1). )

HSP 1 Score: 56.6 bits (135), Expect = 2.6e-08
Identity = 59/159 (37.11%), Postives = 82/159 (51.57%), Query Frame = 0

Query: 24  DDILTVDED---YHTGVSATVPFIWESEPGTPKANFRDHGSLLSPLTPPPSYFSSPSHTP 83
           +D   +DE+   Y+ G  A++PFIWES PGTPK +     SL  PLTPPPSY+S  S   
Sbjct: 6   EDDFAIDEERSSYYGG--ASIPFIWESRPGTPKNHLCSDSSLPLPLTPPPSYYS--SGIL 65

Query: 84  SKHKYNTNPHSSRANFLT-SVFKKL-SMNKAPLHPLSPGSSSSSTTSSTTLTSSEGERGR 143
           S  +  +   S  + FL+ S+F  L   N       S   S SST+SS++ +SS     +
Sbjct: 66  STPRTQSKVRSKLSKFLSFSMFSDLRRSNHGSKKTASSSFSWSSTSSSSSFSSSPPHSLK 125

Query: 144 SPRRLSFDSRVDENEHEDEEEEEENFDSPVSTLFFGRGG 178
             +R+S D +        EE+E  +  SP STL    GG
Sbjct: 126 --KRMSHDKKSPLLYANYEEDELRS--SPTSTLCCSNGG 156

BLAST of Cp4.1LG02g11230 vs. TAIR 10
Match: AT5G01790.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 13 growth stages; Has 121 Blast hits to 121 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 48.1 bits (113), Expect = 9.4e-06
Identity = 23/42 (54.76%), Postives = 28/42 (66.67%), Query Frame = 0

Query: 33  YHTGVSATVPFIWESEPGTPKANFRDHGSLLSPLTPPPSYFS 75
           Y+ G +  VPF WES PGTPK    +  + L PLTPPPS+FS
Sbjct: 67  YYDGAAGAVPFEWESHPGTPKHPSSELPT-LPPLTPPPSHFS 107

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022949317.14.68e-13698.50uncharacterized protein LOC111452704 [Cucurbita moschata][more]
KAG6607361.13.85e-13598.00hypothetical protein SDJN03_00703, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022997732.12.23e-13497.50uncharacterized protein LOC111492604 [Cucurbita maxima][more]
KAG7037033.14.19e-13497.00hypothetical protein SDJN02_00654, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_038894794.15.78e-6160.00uncharacterized protein LOC120083212 [Benincasa hispida] >XP_038894795.1 unchara... [more]
Match NameE-valueIdentityDescription
A0A6J1GBP32.27e-13698.50uncharacterized protein LOC111452704 OS=Cucurbita moschata OX=3662 GN=LOC1114527... [more]
A0A6J1K5W91.08e-13497.50uncharacterized protein LOC111492604 OS=Cucurbita maxima OX=3661 GN=LOC111492604... [more]
A0A5D3BF708.75e-5458.13NADPH oxidase activator OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LYC98.75e-5456.65Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G533640 PE=4 SV=1[more]
A0A1S3C5A78.75e-5458.13uncharacterized protein At4g00950 OS=Cucumis melo OX=3656 GN=LOC103497034 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G06930.12.8e-1034.05unknown protein; Has 4478 Blast hits to 355 proteins in 82 species: Archae - 0; ... [more]
AT2G40475.15.3e-0936.49unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G56260.22.6e-0837.11unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G01790.19.4e-0654.76unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007789Protein of unknown function DUF688PFAMPF05097DUF688coord: 39..166
e-value: 8.4E-6
score: 25.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 48..91
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 135..150
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 76..91
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 151..165
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 112..134
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 108..172
NoneNo IPR availablePANTHERPTHR33257:SF24EXPRESSED PROTEINcoord: 33..185
NoneNo IPR availablePANTHERPTHR33257OS05G0165500 PROTEINcoord: 33..185

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g11230.1Cp4.1LG02g11230.1mRNA