Clc10G21300 (gene) Watermelon (cordophanus) v2

Overview
NameClc10G21300
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionUbiquitinyl hydrolase 1
LocationClcChr10: 34538400 .. 34541247 (-)
RNA-Seq ExpressionClc10G21300
SyntenyClc10G21300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACGGAGCCTGCAATGGAGGCATGTTGTATCACGAGGTTCAAGAATCCAAGCTCTGCGCTGTGCATTGCGTCAACACCGTCTTGCAAGGTCCCTTTTTCTCCGAATTCGATTTGGCTGCTCTCGCTTCCGATCTTGACCGCAAAGAGCGCCAGATGATGCTTTCTGGTTCCACCACCGGTGATTTCCTCTCCGAGGAGTCTCACAATGTCTCCCTGGACGGTGATTTTAGCATCCAGGTTTCTTTCTTCTCCTTCTACTCAACCCACTTCTTTCAATCTTCTCCTCACGCTTCCCTCTCAACAACTATCATCGCTTTACTCTAATTACGCTTCTTCCTCCCTGGATAATCTTTATGTGATTGGAATTCCATTTATCATGCGAGTTGACTACTCGAATTCCAGGTGCTAATTGCTATTGAATATTGCAATTGCATTCAGGAATATTGGTTTACGGATATATCCAATTTCTTACGGATATCAATTGAATTGCGATCCTGATGTCCTAGTTACGTATTCTTTAACAGTAGAAATAGACCAACCGTTGTAAGCCACTATATATGCAGTAGTGACAGTCATTACGACCCATTTCCGTAATGGATATAGTAATTACTGTTTTGAATTTAAAATAACTGATACTATGACTTGTTGTAATGACATGGGAATTCACACGTTCATAACAGTACCCATAATGACAATTTCTCAAGACAATGTTTGTTTGCCAAAACCCATAGCTTCTATTCTCCCTTATGTCCCAATGTCATCTACCGGAGGTAGCAGTGTTGAGTAGTGTGGTGATGCAAATTTTGAATAACATTGCAATGCATGACCTTTGTGGCAATGCATACTGTTCGGCCGTACTTGATAATATAGTGCAAGTTTGCTGTGTTTCCTTCAAGTGATTTGCGTTTCTTGCTGCATGCTCTGCTTCAAATGTGAAATTCTAAAGAATAGGATTATATGTAACTTATAACTGTGAATGAGAATCGACCACATGGTTGAAATTTGTTTCTTAGATAAATAATAAAGAGAATTGACCACATGGAGTGTCTGAGGGGAGGGATCTTGGCATACCTATTGTAGAACAGATATACTTCTGATCTCTTAGGTCTCATTTGAATAACCATTTGGTTTTTTGGTTTTGAAAATTGTACTTTTTTTCTCATAGTTTTTTTATTTTCATATTTCCCAAGGAAATATTTGTAAAAAGTGTAGATAACAAAGCAAAATAGTCTATAGGTGGTAGTGGTGTTAGCTTAGTTCTCAGAACCAGAAACCATATCAAATGGGGTCTTAATTCTTAAATTTAACTTAGAATAAGATATGGGATTGTTAGAGATCGCTAGAAAGATTTGCGATCAGCAAGGGGTTATAAAGAAATGGGGGAAATGTGGGAGTAGACCAAGATTTAATGAGTACTCAAAGAAGATTGAAAGTGGGCAAGTAAGGACGGGGCTCTTGACTTTCTTCCCAAATCTTGTAATATTTTCTTTATCTTAATGTTAAAAAGCCTTGGTTCCCATCAACTACACCAGACAAGGCACGTTGCTTATGAATTTTGTGGATGACCAATGAAACTCAGTATCTTATTTTGGTACTCAAATTTAGCACACTATCTCTCTCTATCTTTCTCATTTTTCTTTTCTTTTCTTTTCTTTTCATGGGTAAAGTTTTGATGTAAAACAGTGTAATATTCAGAGGGATCGGAGTTCATTTTTTATAGACCAGTTTCTCTTATAAAGCAAGCAATACATAGAACTTGTTGGGATGTTCTTATATGTTTAAACCATTATATATTTGCTTGCAGGTCTTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTCCTCTCAACTCACCAGTTGCTGAACCTGCGCAGATTGATCCTGAACTGGAGAATGCATTTATATGCCACTTGCAAGACCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTACAATTTTGATAGTCTATATGCAGCCCCACAGCATCTTTCTAAATTTTACCTTTCAGCTTACTTGGACTCTCTAAAGGGCTTCGGTTGGAGCATTTTTATTGTTAGGGGTAACTTCCCTAAGGATTTTCCCATCTCATCCTCTGAAGCATCCAACGGTTATGGTCAGTGGCTTTCCCCTGAGGATGCTGAGAGGATAACCAAATCGTGCAACTCTACCCAAGCCCCCCCTCCTCCTCAAAGAGTAAACTGGACAGAGCAGCAAGATACATTTCTTTCATCTGGAGAAACAGAAATGCTAATAGACATGGAGGATGAGGACTTGAAGGCTGCAATAGCTGCTAGCCTTATGGATTCCTCAGCAGTCATGGCAGCAGGAGTTGCTAACCCACAAAACGAACCTGCAGTTTCCTCCACCCAAGCTGCCTCCCCCAAGAATATACCTGTTGTTTCCCTTGAAACTGCCAAGACTGAAGATGTACCCGTAGTTTCCCGGAAAGCTTCCACCCTCCAAGATGTTCCTGCAGTTTCTCCAAAAGCTGACACTCTCCAAGATTTACCTGTAGTTTCCAACAAACCTGCCTCCCCTCAAAATGTACCTTTTGTTTCCCCTGAAGCTTCCTCCTCCCAGGATGTATGTGCACTTTCCCCCGATGCTGCTGCTACCCCCCAAGATTTACATGCTGTTTCCACCGCCAAAGCTGCCAACCCCAATAATGAATCTATGGTCTGCACAGAAGTTGCTGTTCATCAAAACGAGTCTGGAAATGGATCTGTAGGCAATGCGGATGCTGCCTTCTGTGAAAGTGGATCTGCAGATAATGCAGAATGTGGCATTTCCAGCCCTCGAAAGAAAATTAGTCGTACGAACAAGGGAACTGCGTGA

mRNA sequence

ATGGACGGAGCCTGCAATGGAGGCATGTTGTATCACGAGGTTCAAGAATCCAAGCTCTGCGCTGTGCATTGCGTCAACACCGTCTTGCAAGGTCCCTTTTTCTCCGAATTCGATTTGGCTGCTCTCGCTTCCGATCTTGACCGCAAAGAGCGCCAGATGATGCTTTCTGGTTCCACCACCGGTGATTTCCTCTCCGAGGAGTCTCACAATGTCTCCCTGGACGGTGATTTTAGCATCCAGGTCTTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTCCTCTCAACTCACCAGTTGCTGAACCTGCGCAGATTGATCCTGAACTGGAGAATGCATTTATATGCCACTTGCAAGACCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTACAATTTTGATAGTCTATATGCAGCCCCACAGCATCTTTCTAAATTTTACCTTTCAGCTTACTTGGACTCTCTAAAGGGCTTCGGTTGGAGCATTTTTATTGTTAGGGGTAACTTCCCTAAGGATTTTCCCATCTCATCCTCTGAAGCATCCAACGGTTATGGTCAGTGGCTTTCCCCTGAGGATGCTGAGAGGATAACCAAATCGTGCAACTCTACCCAAGCCCCCCCTCCTCCTCAAAGAGTAAACTGGACAGAGCAGCAAGATACATTTCTTTCATCTGGAGAAACAGAAATGCTAATAGACATGGAGGATGAGGACTTGAAGGCTGCAATAGCTGCTAGCCTTATGGATTCCTCAGCAGTCATGGCAGCAGGAGTTGCTAACCCACAAAACGAACCTGCAGTTTCCTCCACCCAAGCTGCCTCCCCCAAGAATATACCTGTTGTTTCCCTTGAAACTGCCAAGACTGAAGATGTACCCGTAGTTTCCCGGAAAGCTTCCACCCTCCAAGATGTTCCTGCAGTTTCTCCAAAAGCTGACACTCTCCAAGATTTACCTGTAGTTTCCAACAAACCTGCCTCCCCTCAAAATGTACCTTTTGTTTCCCCTGAAGCTTCCTCCTCCCAGGATGTATGTGCACTTTCCCCCGATGCTGCTGCTACCCCCCAAGATTTACATGCTGTTTCCACCGCCAAAGCTGCCAACCCCAATAATGAATCTATGGTCTGCACAGAAGTTGCTGTTCATCAAAACGAGTCTGGAAATGGATCTGTAGGCAATGCGGATGCTGCCTTCTGTGAAAGTGGATCTGCAGATAATGCAGAATGTGGCATTTCCAGCCCTCGAAAGAAAATTAGTCGTACGAACAAGGGAACTGCGTGA

Coding sequence (CDS)

ATGGACGGAGCCTGCAATGGAGGCATGTTGTATCACGAGGTTCAAGAATCCAAGCTCTGCGCTGTGCATTGCGTCAACACCGTCTTGCAAGGTCCCTTTTTCTCCGAATTCGATTTGGCTGCTCTCGCTTCCGATCTTGACCGCAAAGAGCGCCAGATGATGCTTTCTGGTTCCACCACCGGTGATTTCCTCTCCGAGGAGTCTCACAATGTCTCCCTGGACGGTGATTTTAGCATCCAGGTCTTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTCCTCTCAACTCACCAGTTGCTGAACCTGCGCAGATTGATCCTGAACTGGAGAATGCATTTATATGCCACTTGCAAGACCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTACAATTTTGATAGTCTATATGCAGCCCCACAGCATCTTTCTAAATTTTACCTTTCAGCTTACTTGGACTCTCTAAAGGGCTTCGGTTGGAGCATTTTTATTGTTAGGGGTAACTTCCCTAAGGATTTTCCCATCTCATCCTCTGAAGCATCCAACGGTTATGGTCAGTGGCTTTCCCCTGAGGATGCTGAGAGGATAACCAAATCGTGCAACTCTACCCAAGCCCCCCCTCCTCCTCAAAGAGTAAACTGGACAGAGCAGCAAGATACATTTCTTTCATCTGGAGAAACAGAAATGCTAATAGACATGGAGGATGAGGACTTGAAGGCTGCAATAGCTGCTAGCCTTATGGATTCCTCAGCAGTCATGGCAGCAGGAGTTGCTAACCCACAAAACGAACCTGCAGTTTCCTCCACCCAAGCTGCCTCCCCCAAGAATATACCTGTTGTTTCCCTTGAAACTGCCAAGACTGAAGATGTACCCGTAGTTTCCCGGAAAGCTTCCACCCTCCAAGATGTTCCTGCAGTTTCTCCAAAAGCTGACACTCTCCAAGATTTACCTGTAGTTTCCAACAAACCTGCCTCCCCTCAAAATGTACCTTTTGTTTCCCCTGAAGCTTCCTCCTCCCAGGATGTATGTGCACTTTCCCCCGATGCTGCTGCTACCCCCCAAGATTTACATGCTGTTTCCACCGCCAAAGCTGCCAACCCCAATAATGAATCTATGGTCTGCACAGAAGTTGCTGTTCATCAAAACGAGTCTGGAAATGGATCTGTAGGCAATGCGGATGCTGCCTTCTGTGAAAGTGGATCTGCAGATAATGCAGAATGTGGCATTTCCAGCCCTCGAAAGAAAATTAGTCGTACGAACAAGGGAACTGCGTGA

Protein sequence

MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKASTLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHAVSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISRTNKGTA
Homology
BLAST of Clc10G21300 vs. NCBI nr
Match: XP_038904645.1 (ataxin-3 homolog [Benincasa hispida])

HSP 1 Score: 775.0 bits (2000), Expect = 3.3e-220
Identity = 394/426 (92.49%), Postives = 408/426 (95.77%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQ DTFLSSGETEMLIDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQHDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASL+DSSAVMAAG AN QNEPAVSSTQAASP+N+PVVSLETAKTED P+VS KAS
Sbjct: 241 KAAIAASLLDSSAVMAAGAANSQNEPAVSSTQAASPQNVPVVSLETAKTEDAPIVSLKAS 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
           TLQDVPAV PKA TLQD+PVVSNKP+SPQNVPFVSP+AS+SQDV  LSPDA ATPQDLHA
Sbjct: 301 TLQDVPAVFPKAATLQDVPVVSNKPSSPQNVPFVSPKASTSQDVRPLSPDATATPQDLHA 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
           VST K A PNN+  VCTEV+VHQNESGN SVGNA+AAF ESG ADNAECG+SSPRKKISR
Sbjct: 361 VSTTKTATPNNKPAVCTEVSVHQNESGNESVGNAEAAFRESGPADNAECGVSSPRKKISR 420

Query: 421 TNKGTA 427
           T++GTA
Sbjct: 421 TDEGTA 426

BLAST of Clc10G21300 vs. NCBI nr
Match: XP_004137739.1 (ataxin-3 homolog [Cucumis sativus] >KGN58800.1 hypothetical protein Csa_000861 [Cucumis sativus])

HSP 1 Score: 753.4 bits (1944), Expect = 1.0e-213
Identity = 382/426 (89.67%), Postives = 398/426 (93.43%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASLMDSSAVMAAGVANP NEP VSSTQA SP+N+P V+LETA T+DV  VS  AS
Sbjct: 241 KAAIAASLMDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVALETANTQDVLAVSPNAS 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
            L+DVPAVSP+A TLQD+P +S K ASPQN P VSPEAS+SQDVC LSP+AA  PQDLH 
Sbjct: 301 ILEDVPAVSPEAATLQDVPAISAKAASPQNAPNVSPEASTSQDVCELSPNAADIPQDLHT 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
           VSTAKAA P N+S VCTEV+VHQNESGN SVGNAD AFC+SGSADN EC +SSPRKKISR
Sbjct: 361 VSTAKAAIPKNKSAVCTEVSVHQNESGNESVGNADTAFCDSGSADNTECAVSSPRKKISR 420

Query: 421 TNKGTA 427
           TN+GTA
Sbjct: 421 TNEGTA 426

BLAST of Clc10G21300 vs. NCBI nr
Match: XP_008442472.1 (PREDICTED: ataxin-3 homolog [Cucumis melo] >KAA0044145.1 ataxin-3-like protein [Cucumis melo var. makuwa] >TYK24992.1 ataxin-3-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 748.0 bits (1930), Expect = 4.3e-212
Identity = 383/426 (89.91%), Postives = 394/426 (92.49%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASL+DSSAVMAAGVANP NEP VSSTQA SP+N+P VSLETA T+DV  VS  AS
Sbjct: 241 KAAIAASLLDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVSLETANTQDVLAVSPNAS 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
            LQDV AVS KA TLQD+P +S K A PQNVP VSPEAS+SQDV AL PDAA  PQDLHA
Sbjct: 301 ILQDVTAVSTKAATLQDVPAISAKAACPQNVPNVSPEASTSQDVRALPPDAADAPQDLHA 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
           VST KAA PN++S VCTEV VHQNESGN SVGNAD AFCESGSADN EC ISSPRKKISR
Sbjct: 361 VSTTKAATPNDKSAVCTEVVVHQNESGNESVGNADTAFCESGSADNTECAISSPRKKISR 420

Query: 421 TNKGTA 427
           TN+G A
Sbjct: 421 TNEGAA 426

BLAST of Clc10G21300 vs. NCBI nr
Match: XP_022983320.1 (ataxin-3 homolog [Cucurbita maxima])

HSP 1 Score: 647.5 bits (1669), Expect = 8.0e-182
Identity = 344/421 (81.71%), Postives = 369/421 (87.65%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTT
Sbjct: 1   MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLTGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSE+SHNVSLDGDFSIQVLQKALEVWDLQVI LNSP AE AQIDPELE+AFICHLQ+
Sbjct: 61  GDFLSEDSHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQIDPELESAFICHLQN 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPIS S
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISCS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWL+PEDA+RITKSCNSTQA  PPQ +NWT+ QDTFLS G+ EML+D+EDED 
Sbjct: 181 EASNGYGQWLTPEDADRITKSCNSTQA-RPPQGINWTKPQDTFLSYGDAEMLMDVEDEDF 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASLMDS AVMAAGVANP NEPAVS TQAASP+N            ++P VS KA 
Sbjct: 241 KAAIAASLMDSPAVMAAGVANPHNEPAVSFTQAASPQN------------NIPAVSPKAF 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
           T QDVPAV+PKA TL+D+PVVS KPASPQ++P V+PEAS SQ+V ALSPDAAAT   LHA
Sbjct: 301 THQDVPAVAPKAATLKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAAT---LHA 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
           VS AKA  PNN + VCTEVAVH+NE  N SVGNADAAF ESG ADNAEC +SSPRKKISR
Sbjct: 361 VSAAKATTPNNLT-VCTEVAVHKNEPANESVGNADAAFDESGLADNAECAVSSPRKKISR 404

Query: 421 T 422
           T
Sbjct: 421 T 404

BLAST of Clc10G21300 vs. NCBI nr
Match: XP_023527477.1 (ataxin-3 homolog [Cucurbita pepo subsp. pepo])

HSP 1 Score: 647.5 bits (1669), Expect = 8.0e-182
Identity = 345/426 (80.99%), Postives = 370/426 (86.85%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTT
Sbjct: 1   MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLS+ESHNVSLDGDFSIQVLQKALEVWDLQVI LNSP AE AQIDPELE+AFICHLQ+
Sbjct: 61  GDFLSDESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQIDPELESAFICHLQN 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPIS S
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWL+PEDA+RITKSCNSTQA  PPQ +NWT+ QDTFLS G+ EML+D+EDED 
Sbjct: 181 EASNGYGQWLTPEDADRITKSCNSTQA-RPPQGINWTKPQDTFLSYGDAEMLMDVEDEDF 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASLMDS AVMAAGVANP NEPAVSSTQAASP+N            ++P V  KA 
Sbjct: 241 KAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQN------------NIPAVFPKAF 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
           T QDVPAV+PKA TL+D+PVVS KPASPQ++P V+PEAS SQ+V ALSPDAAAT   LHA
Sbjct: 301 THQDVPAVAPKAATLKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAAT---LHA 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
            S AKA  PNN + VCTEVAVH+NE  N SVGNADAAF ESG ADNAEC +SSPRKKISR
Sbjct: 361 DSAAKATTPNNLT-VCTEVAVHKNEPANESVGNADAAFGESGLADNAECAVSSPRKKISR 409

Query: 421 TNKGTA 427
           TN G A
Sbjct: 421 TNVGAA 409

BLAST of Clc10G21300 vs. ExPASy Swiss-Prot
Match: Q9M391 (Ataxin-3 homolog OS=Arabidopsis thaliana OX=3702 GN=At3g54130 PE=1 SV=1)

HSP 1 Score: 375.2 bits (962), Expect = 1.0e-102
Identity = 193/264 (73.11%), Postives = 217/264 (82.20%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           M+   NGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A+DLD KERQ+ML G+  
Sbjct: 1   MERTSNGGMLYHEVQESNLCAVHCVNTVLQGPFFSEFDLAAVAADLDGKERQVMLEGAAV 60

Query: 61  -----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFI 120
                GDFL+EESHNVSL GDFSIQVLQKALEVWDLQVIPLN P AEPAQIDPELE+AFI
Sbjct: 61  GGFAPGDFLAEESHNVSLGGDFSIQVLQKALEVWDLQVIPLNCPDAEPAQIDPELESAFI 120

Query: 121 CHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDF 180
           CHL DHWFCIRKVNGEWYNFDSL AAPQHLSKFYLSA+LDSLKG GWSIFIV+GNFP++ 
Sbjct: 121 CHLHDHWFCIRKVNGEWYNFDSLLAAPQHLSKFYLSAFLDSLKGAGWSIFIVKGNFPQEC 180

Query: 181 PI-SSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQ--DTFLSSGETEML 240
           P+ SSSEASN +GQWLSPEDAERI K+ +S  +    +  +   QQ  +  LS  E +  
Sbjct: 181 PMSSSSEASNSFGQWLSPEDAERIRKNTSSGSSARNKRSNDNVNQQRRNQALSREEVQAF 240

Query: 241 IDMEDEDLKAAIAASLMDSSAVMA 257
            +MED+DLKAAIAASL+D+SA  A
Sbjct: 241 SEMEDDDLKAAIAASLLDASAAEA 264

BLAST of Clc10G21300 vs. ExPASy Swiss-Prot
Match: Q8LQ36 (Putative ataxin-3 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=Os01g0851400 PE=3 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 1.2e-90
Identity = 184/324 (56.79%), Postives = 222/324 (68.52%), Query Frame = 0

Query: 4   ACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSG------ 63
           A NGG+LYHEVQE KLCAVHCVNT LQGPFFSEFDL+ALA DLD++ERQ+M  G      
Sbjct: 7   ASNGGLLYHEVQEGKLCAVHCVNTTLQGPFFSEFDLSALAVDLDQRERQVMSEGAAGAAT 66

Query: 64  STTGDFLS--EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFI 123
           +  GDFL+  E SHNVSL GDFSIQVLQKALEVWDLQVIPL+SP       DPELE AFI
Sbjct: 67  TAAGDFLAEGEGSHNVSLGGDFSIQVLQKALEVWDLQVIPLDSPDVGSCLFDPELETAFI 126

Query: 124 CHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDF 183
           CHLQDHWFCIRKVNGEWYNF+SLY AP+HLSKFYLSA++D+LKG GWSIF VRGNFPK+ 
Sbjct: 127 CHLQDHWFCIRKVNGEWYNFNSLYPAPEHLSKFYLSAFIDTLKGSGWSIFAVRGNFPKEC 186

Query: 184 PISSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDM 243
           P+ ++E SNG+GQWL+P+DA RIT SCN  Q P     V+    Q   +S  E +M+   
Sbjct: 187 PM-ATEGSNGFGQWLTPDDARRITSSCNQVQTPTQQAGVSLVADQSEEMS--EMDMIAAQ 246

Query: 244 EDE-DLKAAIAASLMDSSAVMAAGVANPQNEP----AVSSTQAASPKNIPVVSLETAKTE 303
           ++E DL AAIAASLMD+    A   A+ ++      A+ ST     K+    +LE     
Sbjct: 247 QEEADLNAAIAASLMDTGGPFANYAAHEESRSQDAFAIESTSGEMSKD---GNLEEQGAN 306

Query: 304 DVPVVSRKASTLQDVPAVSPKADT 315
                   +  ++     +PK +T
Sbjct: 307 KSETSEPNSDNIESASGSNPKQNT 324

BLAST of Clc10G21300 vs. ExPASy Swiss-Prot
Match: P54252 (Ataxin-3 OS=Homo sapiens OX=9606 GN=ATXN3 PE=1 SV=5)

HSP 1 Score: 158.7 bits (400), Expect = 1.5e-37
Identity = 90/242 (37.19%), Postives = 140/242 (57.85%), Query Frame = 0

Query: 10  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLS 69
           ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL 
Sbjct: 4   IFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRTFLQ 63

Query: 70  EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQDHWFCI 129
           + S N+   G FSIQV+  AL+VW L++I  NSP  +  +IDP  E +FIC+ ++HWF +
Sbjct: 64  QPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHWFTV 123

Query: 130 RKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNG 189
           RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P        EA   
Sbjct: 124 RKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLP------DCEADQ- 183

Query: 190 YGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIA 248
             Q +  +   R  K      A    QRV+ T+  +  L + +   ++D ++EDL+ A+A
Sbjct: 184 LLQMIRVQQMHR-PKLIGEELAQLKEQRVHKTD-LERVLEANDGSGMLDEDEEDLQRALA 236

BLAST of Clc10G21300 vs. ExPASy Swiss-Prot
Match: Q9CVD2 (Ataxin-3 OS=Mus musculus OX=10090 GN=Atxn3 PE=1 SV=2)

HSP 1 Score: 156.8 bits (395), Expect = 5.6e-37
Identity = 97/288 (33.68%), Postives = 152/288 (52.78%), Query Frame = 0

Query: 10  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLS 69
           ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL 
Sbjct: 4   IFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERLRMAEGGVTSEDYRTFLQ 63

Query: 70  EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQDHWFCI 129
           + S N+   G FSIQV+  AL+VW L++I  NSP  +  +IDP  E +FIC+ ++HWF +
Sbjct: 64  QPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHWFTV 123

Query: 130 RKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFP--------KDFPI 189
           RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P        +   +
Sbjct: 124 RKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQMIKV 183

Query: 190 SSSEASNGYGQWLS--------PEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGET 249
                    G+ L+          D ER+ ++ + +        +   ++ D   +   +
Sbjct: 184 QQMHRPKLIGEELAHLKEQSALKADLERVLEAADGS-------GIFDEDEDDLQRALAIS 243

Query: 250 EMLIDMEDE--DLKAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAAS 276
              IDMEDE  DL+ AI  S+  SS  M       +N P  SS   +S
Sbjct: 244 RQEIDMEDEEADLRRAIQLSMQGSSRSMC------ENSPQTSSPDLSS 278

BLAST of Clc10G21300 vs. ExPASy Swiss-Prot
Match: O35815 (Ataxin-3 OS=Rattus norvegicus OX=10116 GN=Atxn3 PE=1 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 2.1e-36
Identity = 97/282 (34.40%), Postives = 151/282 (53.55%), Query Frame = 0

Query: 10  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLS 69
           ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL 
Sbjct: 4   IFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERLRMAEGGVTSEDYRTFLQ 63

Query: 70  EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQDHWFCI 129
           + S N+   G FSIQV+  AL+VW L++I  NSP  +  +IDP  E +FIC+ ++HWF +
Sbjct: 64  QPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHWFTV 123

Query: 130 RKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFP---KDFPISSSEA 189
           RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P    D  +   + 
Sbjct: 124 RKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQMIKV 183

Query: 190 SNGYGQWLSPEDAERITKSC-------NSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDM 249
              +   L  E+   + +            +A   P   +  ++ D   +   +   IDM
Sbjct: 184 QQMHRPKLIGEELAHLKEQSALKADLERVLEAADGPGMFD-DDEDDLQRALAMSRQEIDM 243

Query: 250 EDE--DLKAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAAS 276
           EDE  DL+ AI  S+  SS  M       ++ P  SST  +S
Sbjct: 244 EDEEADLRRAIQLSMQGSSRGMC------EDSPQTSSTDLSS 278

BLAST of Clc10G21300 vs. ExPASy TrEMBL
Match: A0A0A0L9Z8 (Ubiquitinyl hydrolase 1 OS=Cucumis sativus OX=3659 GN=Csa_3G732570 PE=4 SV=1)

HSP 1 Score: 753.4 bits (1944), Expect = 5.0e-214
Identity = 382/426 (89.67%), Postives = 398/426 (93.43%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASLMDSSAVMAAGVANP NEP VSSTQA SP+N+P V+LETA T+DV  VS  AS
Sbjct: 241 KAAIAASLMDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVALETANTQDVLAVSPNAS 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
            L+DVPAVSP+A TLQD+P +S K ASPQN P VSPEAS+SQDVC LSP+AA  PQDLH 
Sbjct: 301 ILEDVPAVSPEAATLQDVPAISAKAASPQNAPNVSPEASTSQDVCELSPNAADIPQDLHT 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
           VSTAKAA P N+S VCTEV+VHQNESGN SVGNAD AFC+SGSADN EC +SSPRKKISR
Sbjct: 361 VSTAKAAIPKNKSAVCTEVSVHQNESGNESVGNADTAFCDSGSADNTECAVSSPRKKISR 420

Query: 421 TNKGTA 427
           TN+GTA
Sbjct: 421 TNEGTA 426

BLAST of Clc10G21300 vs. ExPASy TrEMBL
Match: A0A5D3DN96 (Ubiquitinyl hydrolase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G001100 PE=4 SV=1)

HSP 1 Score: 748.0 bits (1930), Expect = 2.1e-212
Identity = 383/426 (89.91%), Postives = 394/426 (92.49%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASL+DSSAVMAAGVANP NEP VSSTQA SP+N+P VSLETA T+DV  VS  AS
Sbjct: 241 KAAIAASLLDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVSLETANTQDVLAVSPNAS 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
            LQDV AVS KA TLQD+P +S K A PQNVP VSPEAS+SQDV AL PDAA  PQDLHA
Sbjct: 301 ILQDVTAVSTKAATLQDVPAISAKAACPQNVPNVSPEASTSQDVRALPPDAADAPQDLHA 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
           VST KAA PN++S VCTEV VHQNESGN SVGNAD AFCESGSADN EC ISSPRKKISR
Sbjct: 361 VSTTKAATPNDKSAVCTEVVVHQNESGNESVGNADTAFCESGSADNTECAISSPRKKISR 420

Query: 421 TNKGTA 427
           TN+G A
Sbjct: 421 TNEGAA 426

BLAST of Clc10G21300 vs. ExPASy TrEMBL
Match: A0A1S3B6I4 (Ubiquitinyl hydrolase 1 OS=Cucumis melo OX=3656 GN=LOC103486329 PE=4 SV=1)

HSP 1 Score: 748.0 bits (1930), Expect = 2.1e-212
Identity = 383/426 (89.91%), Postives = 394/426 (92.49%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASL+DSSAVMAAGVANP NEP VSSTQA SP+N+P VSLETA T+DV  VS  AS
Sbjct: 241 KAAIAASLLDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVSLETANTQDVLAVSPNAS 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
            LQDV AVS KA TLQD+P +S K A PQNVP VSPEAS+SQDV AL PDAA  PQDLHA
Sbjct: 301 ILQDVTAVSTKAATLQDVPAISAKAACPQNVPNVSPEASTSQDVRALPPDAADAPQDLHA 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
           VST KAA PN++S VCTEV VHQNESGN SVGNAD AFCESGSADN EC ISSPRKKISR
Sbjct: 361 VSTTKAATPNDKSAVCTEVVVHQNESGNESVGNADTAFCESGSADNTECAISSPRKKISR 420

Query: 421 TNKGTA 427
           TN+G A
Sbjct: 421 TNEGAA 426

BLAST of Clc10G21300 vs. ExPASy TrEMBL
Match: A0A6J1J5J2 (Ubiquitinyl hydrolase 1 OS=Cucurbita maxima OX=3661 GN=LOC111481936 PE=4 SV=1)

HSP 1 Score: 647.5 bits (1669), Expect = 3.9e-182
Identity = 344/421 (81.71%), Postives = 369/421 (87.65%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTT
Sbjct: 1   MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLTGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSE+SHNVSLDGDFSIQVLQKALEVWDLQVI LNSP AE AQIDPELE+AFICHLQ+
Sbjct: 61  GDFLSEDSHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQIDPELESAFICHLQN 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPIS S
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISCS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWL+PEDA+RITKSCNSTQA  PPQ +NWT+ QDTFLS G+ EML+D+EDED 
Sbjct: 181 EASNGYGQWLTPEDADRITKSCNSTQA-RPPQGINWTKPQDTFLSYGDAEMLMDVEDEDF 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASLMDS AVMAAGVANP NEPAVS TQAASP+N            ++P VS KA 
Sbjct: 241 KAAIAASLMDSPAVMAAGVANPHNEPAVSFTQAASPQN------------NIPAVSPKAF 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
           T QDVPAV+PKA TL+D+PVVS KPASPQ++P V+PEAS SQ+V ALSPDAAAT   LHA
Sbjct: 301 THQDVPAVAPKAATLKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAAT---LHA 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
           VS AKA  PNN + VCTEVAVH+NE  N SVGNADAAF ESG ADNAEC +SSPRKKISR
Sbjct: 361 VSAAKATTPNNLT-VCTEVAVHKNEPANESVGNADAAFDESGLADNAECAVSSPRKKISR 404

Query: 421 T 422
           T
Sbjct: 421 T 404

BLAST of Clc10G21300 vs. ExPASy TrEMBL
Match: A0A6J1F1U3 (Ubiquitinyl hydrolase 1 OS=Cucurbita moschata OX=3662 GN=LOC111441397 PE=4 SV=1)

HSP 1 Score: 641.7 bits (1654), Expect = 2.1e-180
Identity = 342/426 (80.28%), Postives = 367/426 (86.15%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTT
Sbjct: 1   MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVI LNSP AE AQIDPELE+AFICHLQ+
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQIDPELESAFICHLQN 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPIS S
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDL 240
           EASNGYGQWL+PEDA+RITKSCNSTQA  PPQ +NWT+  DTFLS G+ EML+D+EDED 
Sbjct: 181 EASNGYGQWLTPEDADRITKSCNSTQA-RPPQGINWTKPHDTFLSYGDAEMLMDVEDEDF 240

Query: 241 KAAIAASLMDSSAVMAAGVANPQNEPAVSSTQAASPKNIPVVSLETAKTEDVPVVSRKAS 300
           KAAIAASLMDS AVMAAGVANP NEPAVSSTQAASP+N            ++P V  KA 
Sbjct: 241 KAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQN------------NIPAVFPKAF 300

Query: 301 TLQDVPAVSPKADTLQDLPVVSNKPASPQNVPFVSPEASSSQDVCALSPDAAATPQDLHA 360
           T QDVPAV+ KA TL+D+PV+S KPASP+++P V+PEAS SQ+V ALSPDAAAT   LH 
Sbjct: 301 THQDVPAVAAKAATLKDVPVISTKPASPKDMPVVTPEASISQNVRALSPDAAAT---LHV 360

Query: 361 VSTAKAANPNNESMVCTEVAVHQNESGNGSVGNADAAFCESGSADNAECGISSPRKKISR 420
            S AKA  PNN + VCTEVAVHQNE  N SVGNADAAF ESG ADNAEC +SSPRKKISR
Sbjct: 361 DSAAKATTPNNLT-VCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKKISR 409

Query: 421 TNKGTA 427
           TN G A
Sbjct: 421 TNVGAA 409

BLAST of Clc10G21300 vs. TAIR 10
Match: AT3G54130.1 (Josephin family protein )

HSP 1 Score: 375.2 bits (962), Expect = 7.1e-104
Identity = 193/264 (73.11%), Postives = 217/264 (82.20%), Query Frame = 0

Query: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60
           M+   NGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A+DLD KERQ+ML G+  
Sbjct: 1   MERTSNGGMLYHEVQESNLCAVHCVNTVLQGPFFSEFDLAAVAADLDGKERQVMLEGAAV 60

Query: 61  -----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFI 120
                GDFL+EESHNVSL GDFSIQVLQKALEVWDLQVIPLN P AEPAQIDPELE+AFI
Sbjct: 61  GGFAPGDFLAEESHNVSLGGDFSIQVLQKALEVWDLQVIPLNCPDAEPAQIDPELESAFI 120

Query: 121 CHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDF 180
           CHL DHWFCIRKVNGEWYNFDSL AAPQHLSKFYLSA+LDSLKG GWSIFIV+GNFP++ 
Sbjct: 121 CHLHDHWFCIRKVNGEWYNFDSLLAAPQHLSKFYLSAFLDSLKGAGWSIFIVKGNFPQEC 180

Query: 181 PI-SSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQ--DTFLSSGETEML 240
           P+ SSSEASN +GQWLSPEDAERI K+ +S  +    +  +   QQ  +  LS  E +  
Sbjct: 181 PMSSSSEASNSFGQWLSPEDAERIRKNTSSGSSARNKRSNDNVNQQRRNQALSREEVQAF 240

Query: 241 IDMEDEDLKAAIAASLMDSSAVMA 257
            +MED+DLKAAIAASL+D+SA  A
Sbjct: 241 SEMEDDDLKAAIAASLLDASAAEA 264

BLAST of Clc10G21300 vs. TAIR 10
Match: AT2G29640.1 (JOSEPHIN-like protein )

HSP 1 Score: 57.0 bits (136), Expect = 4.3e-08
Identity = 40/147 (27.21%), Postives = 68/147 (46.26%), Query Frame = 0

Query: 10  LYHEVQESKLCAVHCVNTVLQG-PFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEES 69
           +YHE Q  + C +HC+N + Q    F++  L ++A  L+  +        T   F+ +  
Sbjct: 8   IYHERQRLQFCLLHCLNNLFQDKDAFTKESLNSIAEKLETNDPNK--ETWTPLSFVLKPH 67

Query: 70  HNVSLDGDFSIQVLQKALE------VWDLQVIPLNSPVAEPAQ------IDPELENAFIC 129
           HN ++ G++ + V+  ALE      VW  + I  +S   + A       ++  ++     
Sbjct: 68  HN-TITGNYDVNVMITALEGKGKSVVWHDKRIGASSIDLDDADTLMGIVLNVPVKRYGGL 127

Query: 130 HLQDHWFCIRKVNGEWYNFDSLYAAPQ 144
               HW  +RK+NG WYN DS    PQ
Sbjct: 128 WRSRHWVVVRKINGVWYNLDSDLVVPQ 151

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038904645.13.3e-22092.49ataxin-3 homolog [Benincasa hispida][more]
XP_004137739.11.0e-21389.67ataxin-3 homolog [Cucumis sativus] >KGN58800.1 hypothetical protein Csa_000861 [... [more]
XP_008442472.14.3e-21289.91PREDICTED: ataxin-3 homolog [Cucumis melo] >KAA0044145.1 ataxin-3-like protein [... [more]
XP_022983320.18.0e-18281.71ataxin-3 homolog [Cucurbita maxima][more]
XP_023527477.18.0e-18280.99ataxin-3 homolog [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9M3911.0e-10273.11Ataxin-3 homolog OS=Arabidopsis thaliana OX=3702 GN=At3g54130 PE=1 SV=1[more]
Q8LQ361.2e-9056.79Putative ataxin-3 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=Os01g08514... [more]
P542521.5e-3737.19Ataxin-3 OS=Homo sapiens OX=9606 GN=ATXN3 PE=1 SV=5[more]
Q9CVD25.6e-3733.68Ataxin-3 OS=Mus musculus OX=10090 GN=Atxn3 PE=1 SV=2[more]
O358152.1e-3634.40Ataxin-3 OS=Rattus norvegicus OX=10116 GN=Atxn3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L9Z85.0e-21489.67Ubiquitinyl hydrolase 1 OS=Cucumis sativus OX=3659 GN=Csa_3G732570 PE=4 SV=1[more]
A0A5D3DN962.1e-21289.91Ubiquitinyl hydrolase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B6I42.1e-21289.91Ubiquitinyl hydrolase 1 OS=Cucumis melo OX=3656 GN=LOC103486329 PE=4 SV=1[more]
A0A6J1J5J23.9e-18281.71Ubiquitinyl hydrolase 1 OS=Cucurbita maxima OX=3661 GN=LOC111481936 PE=4 SV=1[more]
A0A6J1F1U32.1e-18080.28Ubiquitinyl hydrolase 1 OS=Cucurbita moschata OX=3662 GN=LOC111441397 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G54130.17.1e-10473.11Josephin family protein [more]
AT2G29640.14.3e-0827.21JOSEPHIN-like protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePRINTSPR01233JOSEPHINcoord: 11..34
score: 58.33
coord: 129..149
score: 33.33
coord: 108..127
score: 50.0
coord: 153..172
score: 45.0
NoneNo IPR availableGENE3D1.10.287.10coord: 33..74
e-value: 4.4E-62
score: 210.4
NoneNo IPR availableGENE3D3.90.70.40coord: 19..172
e-value: 4.4E-62
score: 210.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 404..426
NoneNo IPR availablePANTHERPTHR14159:SF0ATAXIN-3coord: 1..354
IPR006155Josephin domainSMARTSM01246Josephin_2coord: 14..170
e-value: 6.3E-79
score: 278.2
IPR006155Josephin domainPFAMPF02099Josephincoord: 15..168
e-value: 3.5E-49
score: 167.0
IPR006155Josephin domainPROSITEPS50957JOSEPHINcoord: 7..182
score: 23.493376
IPR033865Machado-Joseph disease proteinPANTHERPTHR14159ATAXIN-3-RELATEDcoord: 1..354

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc10G21300.1Clc10G21300.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016579 protein deubiquitination
cellular_component GO:0005634 nucleus
molecular_function GO:0004843 thiol-dependent deubiquitinase