CsGy3G021122 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy3G021122
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionULP_PROTEASE domain-containing protein
LocationGy14Chr3: 19008267 .. 19009725 (-)
RNA-Seq ExpressionCsGy3G021122
SyntenyCsGy3G021122
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTGCGTTTGAAACTAAAGATACTATCATTGCACATGGGACAATATTTGATGCTGAAGGTGACGGTGAGAACATCAAGGTATTAGTGGATGTTGTCCTCGATGGCCAATGTGTGATTCTGAAACCGAAAAAGGAAGGAGTCACCAAACTGACGCATGAAGTCGGATCACATTTAATGTGGCCACGACATCTTGTTCTTACTAGGAATGATAAGGTAATTGTCTGTCTTATTGTATTGGAGTTTTTATGCATCAAATTATACTAACAGTAGTTGAACTTTTTGTGATAGAAGGAAACTGTGGGCTTCAATACGGATCTGTCCACATTCTCTAGTGCAACATTTCTACGTGCTCCAGTGGTGCTCTGGTGTTTGGTTAGACTCGCTGAACATATGGGGTCATCGATTCAACTAAACACGCCTTCAGAGGTATTCAGTGTGAAGAGAAAATGTTGCATTATGGTTGAGTCACTGAGGGATTTCTCGTCAATGCAACCAATATGCACATCATGTCTAGATGCTTACATGATGTAAGATTTCCTTTTAATATATATACGTCATTGATGTTTGCGGTCTCCTTTTAATTGTAAGTCCATTTCATTGTGTAGGTACCTTCACACAATTATGCTAGAAGGACGAAGTTTAAGCTTGTTCAAGTTCATGGATGTCAGATCTGTAAGTTACTCGAGTTACAAACAATCGCGTCTGCAGTTGTTGAATGCTCGATTTCTCGGAGCCGAGTATGACCAAGTAGTATTGTTCCCATACAACTCTGGGTAAGCGCATGATAATGACATTTTCATTAACATTCACAATTTGAATTCACTATTAAAATATTAATTTATTATAATGGTGGAAATGTGTTCTTAGCAATCACTGGACATTGGTCGTTGTTAATCCTACAAAGGGTGCTGCATATTGGATTGACCCGTTGAAGAATCAGATTGACGGAGACATGAGTGAAGTGCTTCAAATGTAGGACATATGATTGTCGTTAATGTCTTTATTATGTGTGCAACTATTTTAATTATGTAACAATCTTATATTATGTGGTAGGTCATTCAATATATCAAAGAAGAAAAAACCAAATTGGAAGGTTGTTAAGGTATGTGGCATTGACCAGTAGGATAGTCGTAAATAAGTTATTAATTTAAACGTTTCACACGTATTTTTTTTATTCTACATCCAGTATCCCAAACAAAATGGGGTGGTAGAATGCGAGTACTATGGTATGCGATTCATGCGGGACATAATTTCTGCGATGAGTACTTCGATTGTAGATGTCGTAAGTATTAGAATTTTAAAACCAATTTATGTCATTGTTTGGAATGAAAATAATTCTTAACCACTTTATTATCTACAGATGAAAAGTTTACCTCCTACGTACTCTCAAGATGAAATTGATGAAGTTAGATCAGAATTGGCAGAGTTCCTTTCTAAGCACGTACATCGTGCTTAG

mRNA sequence

ATGCTTGCGTTTGAAACTAAAGATACTATCATTGCACATGGGACAATATTTGATGCTGAAGGTGACGGTGAGAACATCAAGGTATTAGTGGATGTTGTCCTCGATGGCCAATGTGTGATTCTGAAACCGAAAAAGGAAGGAGTCACCAAACTGACGCATGAAGTCGGATCACATTTAATGTGGCCACGACATCTTGTTCTTACTAGGAATGATAAGGAAACTGTGGGCTTCAATACGGATCTGTCCACATTCTCTAGTGCAACATTTCTACGTGCTCCAGTGGTGCTCTGGTGTTTGGTTAGACTCGCTGAACATATGGGGTCATCGATTCAACTAAACACGCCTTCAGAGGTATTCAGTGTGAAGAGAAAATGTTGCATTATGGTTGAGTCACTGAGGGATTTCTCGTCAATGCAACCAATATGCACATCATGTCTAGATGCTTACATGATGTACCTTCACACAATTATGCTAGAAGGACGAAGTTTAAGCTTGTTCAAGTTCATGGATGTCAGATCTGTAAGTTACTCGAGTTACAAACAATCGCGTCTGCAGTTGTTGAATGCTCGATTTCTCGGAGCCGAGTATGACCAAGTAGTATTGTTCCCATACAACTCTGGCAATCACTGGACATTGGTCGTTGTTAATCCTACAAAGGGTGCTGCATATTGGATTGACCCGTTGAAGAATCAGATTGACGGAGACATGAGTGAAGTGCTTCAAATGTCATTCAATATATCAAAGAAGAAAAAACCAAATTGGAAGGTTGTTAAGTATCCCAAACAAAATGGGGTGGTAGAATGCGAGTACTATGGTATGCGATTCATGCGGGACATAATTTCTGCGATGAGTACTTCGATTGTAGATGTCATGAAAAGTTTACCTCCTACGTACTCTCAAGATGAAATTGATGAAGTTAGATCAGAATTGGCAGAGTTCCTTTCTAAGCACGTACATCGTGCTTAG

Coding sequence (CDS)

ATGCTTGCGTTTGAAACTAAAGATACTATCATTGCACATGGGACAATATTTGATGCTGAAGGTGACGGTGAGAACATCAAGGTATTAGTGGATGTTGTCCTCGATGGCCAATGTGTGATTCTGAAACCGAAAAAGGAAGGAGTCACCAAACTGACGCATGAAGTCGGATCACATTTAATGTGGCCACGACATCTTGTTCTTACTAGGAATGATAAGGAAACTGTGGGCTTCAATACGGATCTGTCCACATTCTCTAGTGCAACATTTCTACGTGCTCCAGTGGTGCTCTGGTGTTTGGTTAGACTCGCTGAACATATGGGGTCATCGATTCAACTAAACACGCCTTCAGAGGTATTCAGTGTGAAGAGAAAATGTTGCATTATGGTTGAGTCACTGAGGGATTTCTCGTCAATGCAACCAATATGCACATCATGTCTAGATGCTTACATGATGTACCTTCACACAATTATGCTAGAAGGACGAAGTTTAAGCTTGTTCAAGTTCATGGATGTCAGATCTGTAAGTTACTCGAGTTACAAACAATCGCGTCTGCAGTTGTTGAATGCTCGATTTCTCGGAGCCGAGTATGACCAAGTAGTATTGTTCCCATACAACTCTGGCAATCACTGGACATTGGTCGTTGTTAATCCTACAAAGGGTGCTGCATATTGGATTGACCCGTTGAAGAATCAGATTGACGGAGACATGAGTGAAGTGCTTCAAATGTCATTCAATATATCAAAGAAGAAAAAACCAAATTGGAAGGTTGTTAAGTATCCCAAACAAAATGGGGTGGTAGAATGCGAGTACTATGGTATGCGATTCATGCGGGACATAATTTCTGCGATGAGTACTTCGATTGTAGATGTCATGAAAAGTTTACCTCCTACGTACTCTCAAGATGAAATTGATGAAGTTAGATCAGAATTGGCAGAGTTCCTTTCTAAGCACGTACATCGTGCTTAG

Protein sequence

MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLMWPRHLVLTRNDKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFSVKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYKQSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVLQMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYSQDEIDEVRSELAEFLSKHVHRA*
Homology
BLAST of CsGy3G021122 vs. NCBI nr
Match: XP_031739281.1 (uncharacterized protein LOC105434889 isoform X2 [Cucumis sativus])

HSP 1 Score: 621 bits (1601), Expect = 6.76e-224
Identity = 310/321 (96.57%), Postives = 312/321 (97.20%), Query Frame = 0

Query: 1   MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM 60
           MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM
Sbjct: 1   MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM 60

Query: 61  WPRHLVLTRNDKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 120
           WPRHLVLTRNDKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS
Sbjct: 61  WPRHLVLTRNDKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 120

Query: 121 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYK 180
           VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIM++GRS SLFKFMD  SVSYSSYK
Sbjct: 121 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMVQGRSSSLFKFMDAGSVSYSSYK 180

Query: 181 QSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 240
           QSR QLLNAR LGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL
Sbjct: 181 QSRAQLLNARLLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 240

Query: 241 QMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYSQ 300
           QMSFNISKKKKPNWKVVK PKQNGVVEC YY MRFMRDIISA STSIVDVMKSLPPTYSQ
Sbjct: 241 QMSFNISKKKKPNWKVVKCPKQNGVVECRYYVMRFMRDIISARSTSIVDVMKSLPPTYSQ 300

Query: 301 DEIDEVRSELAEFLSKHVHRA 321
           DEIDEVRSELAEFLSKHVHRA
Sbjct: 301 DEIDEVRSELAEFLSKHVHRA 321

BLAST of CsGy3G021122 vs. NCBI nr
Match: XP_031739269.1 (uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739270.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739271.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739272.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739273.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739274.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739275.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739276.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739277.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739278.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739279.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739280.1 uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus])

HSP 1 Score: 616 bits (1589), Expect = 4.74e-222
Identity = 310/322 (96.27%), Postives = 312/322 (96.89%), Query Frame = 0

Query: 1   MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM 60
           MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM
Sbjct: 1   MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM 60

Query: 61  WPRHLVLTRNDK-ETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVF 120
           WPRHLVLTRNDK ETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVF
Sbjct: 61  WPRHLVLTRNDKKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVF 120

Query: 121 SVKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSY 180
           SVKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIM++GRS SLFKFMD  SVSYSSY
Sbjct: 121 SVKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMVQGRSSSLFKFMDAGSVSYSSY 180

Query: 181 KQSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEV 240
           KQSR QLLNAR LGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEV
Sbjct: 181 KQSRAQLLNARLLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEV 240

Query: 241 LQMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYS 300
           LQMSFNISKKKKPNWKVVK PKQNGVVEC YY MRFMRDIISA STSIVDVMKSLPPTYS
Sbjct: 241 LQMSFNISKKKKPNWKVVKCPKQNGVVECRYYVMRFMRDIISARSTSIVDVMKSLPPTYS 300

Query: 301 QDEIDEVRSELAEFLSKHVHRA 321
           QDEIDEVRSELAEFLSKHVHRA
Sbjct: 301 QDEIDEVRSELAEFLSKHVHRA 322

BLAST of CsGy3G021122 vs. NCBI nr
Match: XP_031738710.1 (uncharacterized protein LOC116402757 isoform X2 [Cucumis sativus])

HSP 1 Score: 568 bits (1465), Expect = 8.66e-202
Identity = 286/321 (89.10%), Postives = 296/321 (92.21%), Query Frame = 0

Query: 1   MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM 60
           MLAFET+DTIIAHGTIFDAEGDGENIKV VDVVLDG CVI    KEGVTKLTHEVGSHLM
Sbjct: 88  MLAFETEDTIIAHGTIFDAEGDGENIKVSVDVVLDGDCVIPNQTKEGVTKLTHEVGSHLM 147

Query: 61  WPRHLVLTRNDKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 120
           WPRHLVLTRND ETVGFNTD STFSSA +LRAPV L CLVRL EHMGSSIQLNTP E+F 
Sbjct: 148 WPRHLVLTRNDMETVGFNTDPSTFSSAAYLRAPVALRCLVRLVEHMGSSIQLNTPLELFG 207

Query: 121 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYK 180
           VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIM++GRS SLFKFMD  SVSYSSYK
Sbjct: 208 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMVQGRSSSLFKFMDAGSVSYSSYK 267

Query: 181 QSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 240
           QSR QLLNAR LGAEYDQVVLFPYNS NHWTLVVVNPTKGAAYWIDPLKN+IDGDMSEVL
Sbjct: 268 QSRAQLLNARLLGAEYDQVVLFPYNSVNHWTLVVVNPTKGAAYWIDPLKNRIDGDMSEVL 327

Query: 241 QMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYSQ 300
           QMSF+ISKKKKP+WKVVK PKQNGVVEC YY MRFMRDIISA STSIVDV+KSLPPTY Q
Sbjct: 328 QMSFDISKKKKPSWKVVKCPKQNGVVECGYYVMRFMRDIISARSTSIVDVIKSLPPTYCQ 387

Query: 301 DEIDEVRSELAEFLSKHVHRA 321
           DEI+EVRSELAEFLSKHVHRA
Sbjct: 388 DEINEVRSELAEFLSKHVHRA 408

BLAST of CsGy3G021122 vs. NCBI nr
Match: XP_031738701.1 (uncharacterized protein LOC116402757 isoform X1 [Cucumis sativus] >XP_031738704.1 uncharacterized protein LOC116402757 isoform X1 [Cucumis sativus])

HSP 1 Score: 566 bits (1459), Expect = 7.35e-201
Identity = 287/322 (89.13%), Postives = 297/322 (92.24%), Query Frame = 0

Query: 1   MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM 60
           MLAFET+DTIIAHGTIFDAEGDGENIKV VDVVLDG CVI    KEGVTKLTHEVGSHLM
Sbjct: 88  MLAFETEDTIIAHGTIFDAEGDGENIKVSVDVVLDGDCVIPNQTKEGVTKLTHEVGSHLM 147

Query: 61  WPRHLVLTRND-KETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVF 120
           WPRHLVLTRND KETVGFNTD STFSSA +LRAPV L CLVRL EHMGSSIQLNTP E+F
Sbjct: 148 WPRHLVLTRNDMKETVGFNTDPSTFSSAAYLRAPVALRCLVRLVEHMGSSIQLNTPLELF 207

Query: 121 SVKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSY 180
            VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIM++GRS SLFKFMD  SVSYSSY
Sbjct: 208 GVKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMVQGRSSSLFKFMDAGSVSYSSY 267

Query: 181 KQSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEV 240
           KQSR QLLNAR LGAEYDQVVLFPYNS NHWTLVVVNPTKGAAYWIDPLKN+IDGDMSEV
Sbjct: 268 KQSRAQLLNARLLGAEYDQVVLFPYNSVNHWTLVVVNPTKGAAYWIDPLKNRIDGDMSEV 327

Query: 241 LQMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYS 300
           LQMSF+ISKKKKP+WKVVK PKQNGVVEC YY MRFMRDIISA STSIVDV+KSLPPTY 
Sbjct: 328 LQMSFDISKKKKPSWKVVKCPKQNGVVECGYYVMRFMRDIISARSTSIVDVIKSLPPTYC 387

Query: 301 QDEIDEVRSELAEFLSKHVHRA 321
           QDEI+EVRSELAEFLSKHVHRA
Sbjct: 388 QDEINEVRSELAEFLSKHVHRA 409

BLAST of CsGy3G021122 vs. NCBI nr
Match: KAE8650633.1 (hypothetical protein Csa_011794, partial [Cucumis sativus])

HSP 1 Score: 507 bits (1305), Expect = 1.02e-179
Identity = 251/258 (97.29%), Postives = 253/258 (98.06%), Query Frame = 0

Query: 1   MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM 60
           MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM
Sbjct: 6   MLAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLM 65

Query: 61  WPRHLVLTRNDKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 120
           WPRHLVLTRNDKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS
Sbjct: 66  WPRHLVLTRNDKETVGFNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 125

Query: 121 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYK 180
           VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIM++GRS SLFKFMD  SVSYSSYK
Sbjct: 126 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMVQGRSSSLFKFMDAGSVSYSSYK 185

Query: 181 QSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 240
           QSR QLLNAR LGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL
Sbjct: 186 QSRAQLLNARLLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 245

Query: 241 QMSFNISKKKKPNWKVVK 258
           QMSFNISKKKKPNWKVVK
Sbjct: 246 QMSFNISKKKKPNWKVVK 263

BLAST of CsGy3G021122 vs. ExPASy TrEMBL
Match: A0A5D3D5Q6 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold204G00960 PE=3 SV=1)

HSP 1 Score: 406 bits (1043), Expect = 2.09e-133
Identity = 193/320 (60.31%), Postives = 252/320 (78.75%), Query Frame = 0

Query: 2   LAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLMW 61
           LAFETKD ++A GTI D++ +G+N+KV +DVV+DG C I  P ++G+ K++ EVGSH++W
Sbjct: 399 LAFETKDHVVAWGTIIDSDAEGDNVKVAIDVVVDGDCAIPIPSEQGMYKMSQEVGSHILW 458

Query: 62  PRHLVLTRNDKETVG-FNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 121
           PR LV+T N K   G F  D+STF+      APV L  L+R+ EHMGS+IQ+ TP +VF 
Sbjct: 459 PRDLVITNNIKMDYGEFTKDMSTFAPTPIQNAPVALRFLLRMVEHMGSAIQITTPHDVFG 518

Query: 122 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYK 181
           V+RKCCIM+ESL+DF+SM+PI T+CLDAY+MYL+T M   R+L+L+KF+D  S+S  S K
Sbjct: 519 VRRKCCIMIESLKDFTSMRPIATACLDAYIMYLYTRMESTRTLNLYKFLDAGSISCGSSK 578

Query: 182 QSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 241
           + R+QLL AR LG +YDQ++LFPYNSGNHWTLVV+N TKGAA+WIDPLKN+ID D++EV+
Sbjct: 579 EERVQLLTARLLGTDYDQLLLFPYNSGNHWTLVVINLTKGAAFWIDPLKNRIDPDVTEVV 638

Query: 242 QMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYSQ 301
           + SFNI  KKKP W+VVK PKQ+GVVEC YY MRFMRDII + STSI+ +MK  P  Y+Q
Sbjct: 639 ERSFNIMNKKKPAWRVVKCPKQSGVVECGYYVMRFMRDIIMSTSTSIIQIMKDSPRAYTQ 698

Query: 302 DEIDEVRSELAEFLSKHVHR 320
           D+ID +RSE AEF+ KHV +
Sbjct: 699 DDIDCIRSEWAEFVGKHVDK 718

BLAST of CsGy3G021122 vs. ExPASy TrEMBL
Match: A0A5D3CDJ5 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold654G00340 PE=3 SV=1)

HSP 1 Score: 406 bits (1043), Expect = 2.09e-133
Identity = 193/320 (60.31%), Postives = 252/320 (78.75%), Query Frame = 0

Query: 2   LAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLMW 61
           LAFETKD ++A GTI D++ +G+N+KV +DVV+DG C I  P ++G+ K++ EVGSH++W
Sbjct: 399 LAFETKDHVVAWGTIIDSDAEGDNVKVAIDVVVDGDCAIPIPSEQGMYKMSQEVGSHILW 458

Query: 62  PRHLVLTRNDKETVG-FNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 121
           PR LV+T N K   G F  D+STF+      APV L  L+R+ EHMGS+IQ+ TP +VF 
Sbjct: 459 PRDLVITNNIKMDYGEFTKDMSTFAPTPIQNAPVALRFLLRMVEHMGSAIQITTPHDVFG 518

Query: 122 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYK 181
           V+RKCCIM+ESL+DF+SM+PI T+CLDAY+MYL+T M   R+L+L+KF+D  S+S  S K
Sbjct: 519 VRRKCCIMIESLKDFTSMRPIATACLDAYIMYLYTRMESTRTLNLYKFLDAGSISCGSSK 578

Query: 182 QSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 241
           + R+QLL AR LG +YDQ++LFPYNSGNHWTLVV+N TKGAA+WIDPLKN+ID D++EV+
Sbjct: 579 EERVQLLTARLLGTDYDQLLLFPYNSGNHWTLVVINLTKGAAFWIDPLKNRIDPDVTEVV 638

Query: 242 QMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYSQ 301
           + SFNI  KKKP W+VVK PKQ+GVVEC YY MRFMRDII + STSI+ +MK  P  Y+Q
Sbjct: 639 ERSFNIMNKKKPAWRVVKCPKQSGVVECGYYVMRFMRDIIMSTSTSIIQIMKDSPRAYTQ 698

Query: 302 DEIDEVRSELAEFLSKHVHR 320
           D+ID +RSE AEF+ KHV +
Sbjct: 699 DDIDCIRSEWAEFVGKHVDK 718

BLAST of CsGy3G021122 vs. ExPASy TrEMBL
Match: A0A5A7U7F1 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold986G00680 PE=3 SV=1)

HSP 1 Score: 390 bits (1001), Expect = 2.55e-126
Identity = 188/321 (58.57%), Postives = 245/321 (76.32%), Query Frame = 0

Query: 2   LAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLMW 61
           LAFE KD ++A  TI D++ +G+N+KV V VV+DG C I  P ++G+ K++ EVGSH++W
Sbjct: 502 LAFEMKDHVVAWETIIDSDVEGDNVKVAVYVVVDGDCSIPIPSEQGIYKISQEVGSHILW 561

Query: 62  PRHLVLTRNDKETVG-FNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 121
           PR LV+T N K   G F  D+STF+      APV L  L+R+ EHMGS+IQ+ TP +VF 
Sbjct: 562 PRDLVITNNIKMDYGEFTKDMSTFTPTPIQNAPVALRFLLRMVEHMGSTIQITTPYDVFG 621

Query: 122 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYK 181
           V+RKCCIM+ESL+DF+SMQPI T+CLDAY+MYL+T M   R+L+L+KF+D  S+SY S K
Sbjct: 622 VRRKCCIMIESLKDFTSMQPIATACLDAYIMYLYTRMESSRTLNLYKFVDAGSISYGSSK 681

Query: 182 QSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 241
           + R QLL AR LG +YDQ++L PYNSGN WTLVV+N TKGAA+WIDPLKN++D D++EV+
Sbjct: 682 EERAQLLTARLLGIDYDQLLLIPYNSGNQWTLVVINLTKGAAFWIDPLKNRMDPDVTEVV 741

Query: 242 QMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYSQ 301
           + SFN+  KKKPNW+VVK PKQ+GVVEC YY MRFM DII + STSI+ +MK  P  Y+Q
Sbjct: 742 ERSFNLMNKKKPNWRVVKCPKQSGVVECGYYVMRFMGDIIMSTSTSIIQIMKDSPRAYTQ 801

Query: 302 DEIDEVRSELAEFLSKHVHRA 321
            +ID +RSE  EF+ KH H A
Sbjct: 802 YDIDCIRSEWTEFVGKHAHCA 822

BLAST of CsGy3G021122 vs. ExPASy TrEMBL
Match: A0A5A7UJF8 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold323G00380 PE=3 SV=1)

HSP 1 Score: 390 bits (1001), Expect = 1.67e-125
Identity = 189/320 (59.06%), Postives = 247/320 (77.19%), Query Frame = 0

Query: 2   LAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLMW 61
           LAFETKD I+A GTI D++  G+N+KV VDVV+DG C I  P ++G+ K++ EVGSH++W
Sbjct: 553 LAFETKDHIVAWGTIIDSDAKGDNVKVAVDVVVDGDCAIPIPSEQGMYKMSQEVGSHILW 612

Query: 62  PRHLVLTRNDKETVG-FNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 121
           P  LV+T N K   G F  D+S F+      A V L  L+R+ EHMGS+IQ+ TP +VF 
Sbjct: 613 PCDLVITNNIKMDYGEFTKDMSIFAPTPIQNASVALRFLLRMVEHMGSAIQITTPHDVFG 672

Query: 122 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYK 181
           V+RKCCIM+ESL+DF+SM+PI T+CLDAY+MYL+T M   R+L+L+KF+DV S+S  S K
Sbjct: 673 VRRKCCIMIESLKDFTSMRPIATACLDAYIMYLYTRMESSRTLNLYKFVDVGSISCGSSK 732

Query: 182 QSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 241
           + R QLL AR LG +YDQ++LFPYNSGNHWTLVV+N TKGAA+WIDPLKN+ID D++EV+
Sbjct: 733 EERAQLLTARLLGTDYDQLLLFPYNSGNHWTLVVINLTKGAAFWIDPLKNRIDPDVTEVV 792

Query: 242 QMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYSQ 301
           + SFNI  KKKP+W+VVK PKQ+GVVE  YY M FMR++I + STSI+ +MK  P  Y+Q
Sbjct: 793 ERSFNIMNKKKPDWRVVKCPKQSGVVEYGYYVMWFMRNVIMSTSTSIIQIMKDSPRAYTQ 852

Query: 302 DEIDEVRSELAEFLSKHVHR 320
           D+ID +RSE AEF+ KHV +
Sbjct: 853 DDIDCIRSEWAEFVGKHVDK 872

BLAST of CsGy3G021122 vs. ExPASy TrEMBL
Match: A0A5A7TBT8 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold84G001790 PE=3 SV=1)

HSP 1 Score: 365 bits (938), Expect = 8.25e-122
Identity = 180/320 (56.25%), Postives = 231/320 (72.19%), Query Frame = 0

Query: 2   LAFETKDTIIAHGTIFDAEGDGENIKVLVDVVLDGQCVILKPKKEGVTKLTHEVGSHLMW 61
           LAFE KD ++A GTI D++ +G+N+K                          EVG H++W
Sbjct: 98  LAFEMKDHVVAWGTIIDSDAEGDNVK--------------------------EVGLHILW 157

Query: 62  PRHLVLTRNDKETVG-FNTDLSTFSSATFLRAPVVLWCLVRLAEHMGSSIQLNTPSEVFS 121
           PR LV+  N K   G F  D+STF+      APV LW L+R+ EHMGS+IQ+ TP +VF 
Sbjct: 158 PRDLVIMNNIKMDYGEFTKDMSTFAPTPIQNAPVALWFLLRMVEHMGSAIQITTPHDVFG 217

Query: 122 VKRKCCIMVESLRDFSSMQPICTSCLDAYMMYLHTIMLEGRSLSLFKFMDVRSVSYSSYK 181
           V+RKCCIM+ESL+DF+SM+PI T+CLDAY+MYL+T M   R+L+L+KF+D  S+S  S K
Sbjct: 218 VRRKCCIMIESLKDFTSMRPIATACLDAYIMYLYTRMESSRTLNLYKFVDAGSISCGSSK 277

Query: 182 QSRLQLLNARFLGAEYDQVVLFPYNSGNHWTLVVVNPTKGAAYWIDPLKNQIDGDMSEVL 241
           + R QLL AR LG +YDQ++LFPYNSGNHWTLVV+N TKGAA+WIDPLKN+ID D++EV+
Sbjct: 278 EERAQLLTARLLGTDYDQLLLFPYNSGNHWTLVVINLTKGAAFWIDPLKNRIDPDVTEVV 337

Query: 242 QMSFNISKKKKPNWKVVKYPKQNGVVECEYYGMRFMRDIISAMSTSIVDVMKSLPPTYSQ 301
           + SFNI  KKKP W+VVK PKQ+GVVEC YY MRFMRDII + STSI+ +MK  P  Y+Q
Sbjct: 338 ERSFNIMNKKKPAWRVVKCPKQSGVVECGYYVMRFMRDIIMSTSTSIIQIMKDSPRAYTQ 391

Query: 302 DEIDEVRSELAEFLSKHVHR 320
           D+ID +RSE AEF+ KHV +
Sbjct: 398 DDIDCIRSEWAEFVGKHVDK 391

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_031739281.16.76e-22496.57uncharacterized protein LOC105434889 isoform X2 [Cucumis sativus][more]
XP_031739269.14.74e-22296.27uncharacterized protein LOC105434889 isoform X1 [Cucumis sativus] >XP_031739270.... [more]
XP_031738710.18.66e-20289.10uncharacterized protein LOC116402757 isoform X2 [Cucumis sativus][more]
XP_031738701.17.35e-20189.13uncharacterized protein LOC116402757 isoform X1 [Cucumis sativus] >XP_031738704.... [more]
KAE8650633.11.02e-17997.29hypothetical protein Csa_011794, partial [Cucumis sativus][more]
Match NameE-valueIdentityDescription
A0A5D3D5Q62.09e-13360.31ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A5D3CDJ52.09e-13360.31ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A5A7U7F12.55e-12658.57ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A5A7UJF81.67e-12559.06ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A5A7TBT88.25e-12256.25ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.395.10Adenoviral Proteinase; Chain Acoord: 101..317
e-value: 7.3E-16
score: 60.2
IPR003653Ulp1 protease family, C-terminal catalytic domainPFAMPF02902Peptidase_C48coord: 199..292
e-value: 1.2E-4
score: 22.0
IPR003653Ulp1 protease family, C-terminal catalytic domainPROSITEPS50600ULP_PROTEASEcoord: 128..279
score: 8.805888
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 115..317

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G021122.1CsGy3G021122.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity