Tan0006652 (gene) Snake gourd v1

Overview
NameTan0006652
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUbiquitinyl hydrolase 1
LocationLG08: 72808892 .. 72811956 (-)
RNA-Seq ExpressionTan0006652
SyntenyTan0006652
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCATTTTCTTTGAGAGAGAATAAGAAGCATAGACCAAAAGTTACACGCATAGCTTGGCTGATTCTAGGGTTTATTTCTCTCTTCTTCTTCTTCCTTTCTTCTCTCTCTTTCCATTTTCTCTTTGATTAGATTATGGAGGGAGACAGCAATGGAGGTATGTTGTATCATGAGGTACAAGAATCCAAGCTTTGCGCTGTGCATTGTGTCAATACCGTGTTGCAGGGTCCCTTCTTCTCCGAATTCGATTTGGCTGCTTTGGCATCCGATCTTGACCGGAAAGAGCGCCAGGTGATGCTTGGTGGATCAACTACCGGTGATTTTCTCTCCGAGGAGTCTCACAATGTCTCCCTGGACGGTGATTTTAGTATCCAGGTTTGATTCCCTGTCTGTTTTGTTTGTTATTCTTTTCTCGTTCTTCAAGATAATTGAGGAAAAGTGAAGGACTGCTTTGGATGTCTGTTGATTTCGTCTGATAATGGACTAAAGTCCTGCCTCGTGTTATCAACAGCTTTGATCGATTTAGTGTAATTACGTAGCTTCCTCCGTGGATAGTCTTATATAACGCGTCTTCAATCTACTCGAATTCCAAACGTTATTGAATTTTTTGAAGTACATTCAAGAACAGTGGTTAATGAATATACCGAACATCTTACGGAAAGCGGTTGAATTGCGATCATTGTGTCTGAGTTGTGTATTATTTTAACGGTAGAAATAGATAATGTAAAGTGACAAACCGTTGTAAGGGCACAGTTACACTCATTATGAGCCATTTGCGTAATGGTTATAGCTGTTAATGTTTTGAATTTTAAAAGAACTGATACAGTGACTTGTATTATATGGGAATTCACACGTACATAACAGTACAGCAATGAACATTTTTGAAGACAATGTTACTACTTGCAAAAACCCCTAAGCTACTATTCTCCTTTGTCCCAGCCTCATCTGCCTGAGGTAGCAGTGATGAGTAGTGTAGTGATGCAAAATTTGAATAACATTGGGATGCATGACCTTTGTGATAATACTTGCCACACTGTACGGCTGTACCTGATAATATAGTGACAAGTTTATTGTGATCCATTCAAGGGATGTACGTTTCTTGCTTGAATATATGGTATGGTTAAAGACCGTTAGAAAGATTGTTACTAGCAAGTGGTGGTTAATAAAGAGGGGATGAGTGAGGGAGTAGACATTGATTTATTGAGTATACAAAGTAGATTGAAAGTGGGCAGGTAAGGACAGGGCTCTTGATCTTTCTTCCCAAATCTTGTTATTTTTTTTCTGATCTCAATATTAATATCCTTTAGTTCCCATTAACTACACCTGGGAAGGCACATTGCTTAATGAATTTTGGTCTTGAGAATATGGTGACCATGTGAACCATAATTACTTTATGCGTAGAAGTATGAAAACAGGAAGTAGGGAGGTAAAGTTATACCAAATGTGCCTCTGAAGCCATCTATTGTTCTGGAATTAGGTGTTGAAGCTGGCTCAGCTCAGGTTTTGTGTGGAGATTGACCTTATTTTGGATACTCAAATTTAGCACAGTGAATATCAGATCTCTTTTCTCTTTCATTTTTTGGGTATAGTAAAGTTAAACAATGAAGTGTTCAGAGTGATCGGAGTTAGTTTTTTATAAACCAGTTTCTCTTCTATGTGAATATAAGCAATACATAGAACCTGTGGGATGTTCTGATTTATTTAAACCATTATATTTTCTTGCAGGTTTTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTCCTCTCAACTCGCCAGTTGCTGAACCTGCGCAGATTGATCCTGAACTGGAGAATGCATTTATATGCCACTTGCAAGATCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTATAATTTTGATAGTCTTTATGCAGCCCCACAACATCTTTCTAAGTTTTACCTTTCAGCCTACTTAGACTCTCTAAAGGGCTTTGGTTGGAGCATCTTTATAGTGAGGGGTAACTTCCCCAAGGATTTTCCCATCTCATCCTCCGAAGCATCTAATGGTTATGGTCAGTGGCTTTCCCCTGAGGATGCCGAGAGGATAACCAAATCGTGCAACTCTACCCAAGCCCCCCCTCCTCCACAAAGAGTAAACTGGACAGAGCAGCAAGATACAATTTTTTCAACTGGAGAAGCAGAAATGCTTATGGACATGGAGGATGAAGACTTGAAGGCTGCAATAGCAGCTAGCCTTTTGGATTCGTCAGCAGCACTGGCAACAGGAGTTGCTAACCCCCAAAATGAACCTGCAGTTTCCTCCACCCAAGCTGCCTCCCCCCAGAATATAGCTGCTGTTTCCCTCGAAGCTGCCAACACCCAAGATGTACCTGCAGTTTCCCCAAAAGCTGCAACTCTTCAAGATGTACCTGAAGTTTCCACCAAACCTGCCTGCCCCCAAAATGTACCTGTTGTTTCCCCTGAAGCTTCCACGTCCCAGGATGTACGTGCAGTTCCCCCCGAAGCTGTTGCTACCCCCCAAGATGTTCATGCAGTTTCTGCCAAATCTGCCACTCCCAATAATGAACCTGCAATCTGCACCGAAGTTGCTATGCATCAAAACGAGTCTGCAAACAAATCTACAGGCAATGCAGACGCTGACTTTCATGAAAGTGGACCTGCAGATAATGCAGAATGTGCCATTTCCAGCCCTCGAAAGAAAATTAGTCGTACGAATGAGGGATCTTCTGCGTGAGGAAGAGTGTTTTCTGCCTTGAATTTGATTTTTGGTTCAAGATTGATGAGAACGGGATTGACACATAGACAAAAGAAGAGAATAATTTGCTGGAGAATGTGGAACACCCAAATCCATGTTGAGTTGAGCCTGAGGTTAGCTTAGCTAGTTGAAGTTGAATCTATGCTTTACAGTCGTTCTGTGCAGTGTAACTATTTTCACCTCACAATTTCAATTTTCAGATTGATACTGTATAAACTTTGAAAAAAATAAGCCGTCGGCTTTGGCCACTGGATTGTCACAAATGTTCGATTACAAGTTCACCAAATCTGATGATCAGAATTCATTACAAGTTCATTTG

mRNA sequence

CGCATTTTCTTTGAGAGAGAATAAGAAGCATAGACCAAAAGTTACACGCATAGCTTGGCTGATTCTAGGGTTTATTTCTCTCTTCTTCTTCTTCCTTTCTTCTCTCTCTTTCCATTTTCTCTTTGATTAGATTATGGAGGGAGACAGCAATGGAGGTATGTTGTATCATGAGGTACAAGAATCCAAGCTTTGCGCTGTGCATTGTGTCAATACCGTGTTGCAGGGTCCCTTCTTCTCCGAATTCGATTTGGCTGCTTTGGCATCCGATCTTGACCGGAAAGAGCGCCAGGTGATGCTTGGTGGATCAACTACCGGTGATTTTCTCTCCGAGGAGTCTCACAATGTCTCCCTGGACGGTGATTTTAGTATCCAGGTTTTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTCCTCTCAACTCGCCAGTTGCTGAACCTGCGCAGATTGATCCTGAACTGGAGAATGCATTTATATGCCACTTGCAAGATCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTATAATTTTGATAGTCTTTATGCAGCCCCACAACATCTTTCTAAGTTTTACCTTTCAGCCTACTTAGACTCTCTAAAGGGCTTTGGTTGGAGCATCTTTATAGTGAGGGGTAACTTCCCCAAGGATTTTCCCATCTCATCCTCCGAAGCATCTAATGGTTATGGTCAGTGGCTTTCCCCTGAGGATGCCGAGAGGATAACCAAATCGTGCAACTCTACCCAAGCCCCCCCTCCTCCACAAAGAGTAAACTGGACAGAGCAGCAAGATACAATTTTTTCAACTGGAGAAGCAGAAATGCTTATGGACATGGAGGATGAAGACTTGAAGGCTGCAATAGCAGCTAGCCTTTTGGATTCGTCAGCAGCACTGGCAACAGGAGTTGCTAACCCCCAAAATGAACCTGCAGTTTCCTCCACCCAAGCTGCCTCCCCCCAGAATATAGCTGCTGTTTCCCTCGAAGCTGCCAACACCCAAGATGTACCTGCAGTTTCCCCAAAAGCTGCAACTCTTCAAGATGTACCTGAAGTTTCCACCAAACCTGCCTGCCCCCAAAATGTACCTGTTGTTTCCCCTGAAGCTTCCACGTCCCAGGATGTACGTGCAGTTCCCCCCGAAGCTGTTGCTACCCCCCAAGATGTTCATGCAGTTTCTGCCAAATCTGCCACTCCCAATAATGAACCTGCAATCTGCACCGAAGTTGCTATGCATCAAAACGAGTCTGCAAACAAATCTACAGGCAATGCAGACGCTGACTTTCATGAAAGTGGACCTGCAGATAATGCAGAATGTGCCATTTCCAGCCCTCGAAAGAAAATTAGTCGTACGAATGAGGGATCTTCTGCGTGAGGAAGAGTGTTTTCTGCCTTGAATTTGATTTTTGGTTCAAGATTGATGAGAACGGGATTGACACATAGACAAAAGAAGAGAATAATTTGCTGGAGAATGTGGAACACCCAAATCCATGTTGAGTTGAGCCTGAGGTTAGCTTAGCTAGTTGAAGTTGAATCTATGCTTTACAGTCGTTCTGTGCAGTGTAACTATTTTCACCTCACAATTTCAATTTTCAGATTGATACTGTATAAACTTTGAAAAAAATAAGCCGTCGGCTTTGGCCACTGGATTGTCACAAATGTTCGATTACAAGTTCACCAAATCTGATGATCAGAATTCATTACAAGTTCATTTG

Coding sequence (CDS)

ATGGAGGGAGACAGCAATGGAGGTATGTTGTATCATGAGGTACAAGAATCCAAGCTTTGCGCTGTGCATTGTGTCAATACCGTGTTGCAGGGTCCCTTCTTCTCCGAATTCGATTTGGCTGCTTTGGCATCCGATCTTGACCGGAAAGAGCGCCAGGTGATGCTTGGTGGATCAACTACCGGTGATTTTCTCTCCGAGGAGTCTCACAATGTCTCCCTGGACGGTGATTTTAGTATCCAGGTTTTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTCCTCTCAACTCGCCAGTTGCTGAACCTGCGCAGATTGATCCTGAACTGGAGAATGCATTTATATGCCACTTGCAAGATCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTATAATTTTGATAGTCTTTATGCAGCCCCACAACATCTTTCTAAGTTTTACCTTTCAGCCTACTTAGACTCTCTAAAGGGCTTTGGTTGGAGCATCTTTATAGTGAGGGGTAACTTCCCCAAGGATTTTCCCATCTCATCCTCCGAAGCATCTAATGGTTATGGTCAGTGGCTTTCCCCTGAGGATGCCGAGAGGATAACCAAATCGTGCAACTCTACCCAAGCCCCCCCTCCTCCACAAAGAGTAAACTGGACAGAGCAGCAAGATACAATTTTTTCAACTGGAGAAGCAGAAATGCTTATGGACATGGAGGATGAAGACTTGAAGGCTGCAATAGCAGCTAGCCTTTTGGATTCGTCAGCAGCACTGGCAACAGGAGTTGCTAACCCCCAAAATGAACCTGCAGTTTCCTCCACCCAAGCTGCCTCCCCCCAGAATATAGCTGCTGTTTCCCTCGAAGCTGCCAACACCCAAGATGTACCTGCAGTTTCCCCAAAAGCTGCAACTCTTCAAGATGTACCTGAAGTTTCCACCAAACCTGCCTGCCCCCAAAATGTACCTGTTGTTTCCCCTGAAGCTTCCACGTCCCAGGATGTACGTGCAGTTCCCCCCGAAGCTGTTGCTACCCCCCAAGATGTTCATGCAGTTTCTGCCAAATCTGCCACTCCCAATAATGAACCTGCAATCTGCACCGAAGTTGCTATGCATCAAAACGAGTCTGCAAACAAATCTACAGGCAATGCAGACGCTGACTTTCATGAAAGTGGACCTGCAGATAATGCAGAATGTGCCATTTCCAGCCCTCGAAAGAAAATTAGTCGTACGAATGAGGGATCTTCTGCGTGA

Protein sequence

MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDLKAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQDVPAVSPKAATLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHAVSAKSATPNNEPAICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISRTNEGSSA
Homology
BLAST of Tan0006652 vs. ExPASy Swiss-Prot
Match: Q9M391 (Ataxin-3 homolog OS=Arabidopsis thaliana OX=3702 GN=At3g54130 PE=1 SV=1)

HSP 1 Score: 381.7 bits (979), Expect = 1.0e-104
Identity = 198/264 (75.00%), Postives = 218/264 (82.58%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVML----- 60
           ME  SNGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A+DLD KERQVML     
Sbjct: 1   MERTSNGGMLYHEVQESNLCAVHCVNTVLQGPFFSEFDLAAVAADLDGKERQVMLEGAAV 60

Query: 61  GGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFI 120
           GG   GDFL+EESHNVSL GDFSIQVLQKALEVWDLQVIPLN P AEPAQIDPELE+AFI
Sbjct: 61  GGFAPGDFLAEESHNVSLGGDFSIQVLQKALEVWDLQVIPLNCPDAEPAQIDPELESAFI 120

Query: 121 CHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDF 180
           CHL DHWFCIRKVNGEWYNFDSL AAPQHLSKFYLSA+LDSLKG GWSIFIV+GNFP++ 
Sbjct: 121 CHLHDHWFCIRKVNGEWYNFDSLLAAPQHLSKFYLSAFLDSLKGAGWSIFIVKGNFPQEC 180

Query: 181 PI-SSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQ--DTIFSTGEAEML 240
           P+ SSSEASN +GQWLSPEDAERI K+ +S  +    +  +   QQ  +   S  E +  
Sbjct: 181 PMSSSSEASNSFGQWLSPEDAERIRKNTSSGSSARNKRSNDNVNQQRRNQALSREEVQAF 240

Query: 241 MDMEDEDLKAAIAASLLDSSAALA 257
            +MED+DLKAAIAASLLD+SAA A
Sbjct: 241 SEMEDDDLKAAIAASLLDASAAEA 264

BLAST of Tan0006652 vs. ExPASy Swiss-Prot
Match: Q8LQ36 (Putative ataxin-3 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=Os01g0851400 PE=3 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 1.6e-92
Identity = 177/269 (65.80%), Postives = 206/269 (76.58%), Query Frame = 0

Query: 5   SNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVM----LGGSTT 64
           SNGG+LYHEVQE KLCAVHCVNT LQGPFFSEFDL+ALA DLD++ERQVM     G +TT
Sbjct: 8   SNGGLLYHEVQEGKLCAVHCVNTTLQGPFFSEFDLSALAVDLDQRERQVMSEGAAGAATT 67

Query: 65  --GDFLS--EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFIC 124
             GDFL+  E SHNVSL GDFSIQVLQKALEVWDLQVIPL+SP       DPELE AFIC
Sbjct: 68  AAGDFLAEGEGSHNVSLGGDFSIQVLQKALEVWDLQVIPLDSPDVGSCLFDPELETAFIC 127

Query: 125 HLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFP 184
           HLQDHWFCIRKVNGEWYNF+SLY AP+HLSKFYLSA++D+LKG GWSIF VRGNFPK+ P
Sbjct: 128 HLQDHWFCIRKVNGEWYNFNSLYPAPEHLSKFYLSAFIDTLKGSGWSIFAVRGNFPKECP 187

Query: 185 ISSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDME 244
           + ++E SNG+GQWL+P+DA RIT SCN  Q P     V+    Q    S  E +M+   +
Sbjct: 188 M-ATEGSNGFGQWLTPDDARRITSSCNQVQTPTQQAGVSLVADQSEEMS--EMDMIAAQQ 247

Query: 245 DE-DLKAAIAASLLDSSAALATGVANPQN 265
           +E DL AAIAASL+D+    A   A+ ++
Sbjct: 248 EEADLNAAIAASLMDTGGPFANYAAHEES 273

BLAST of Tan0006652 vs. ExPASy Swiss-Prot
Match: P54252 (Ataxin-3 OS=Homo sapiens OX=9606 GN=ATXN3 PE=1 SV=5)

HSP 1 Score: 158.3 bits (399), Expect = 1.9e-37
Identity = 89/242 (36.78%), Postives = 139/242 (57.44%), Query Frame = 0

Query: 10  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTTGD----FLS 69
           ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL 
Sbjct: 4   IFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRTFLQ 63

Query: 70  EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQDHWFCI 129
           + S N+   G FSIQV+  AL+VW L++I  NSP  +  +IDP  E +FIC+ ++HWF +
Sbjct: 64  QPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHWFTV 123

Query: 130 RKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNG 189
           RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P        EA   
Sbjct: 124 RKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLP------DCEADQ- 183

Query: 190 YGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDLKAAIA 248
             Q +  +   R  K      A    QRV+ T+  + +    +   ++D ++EDL+ A+A
Sbjct: 184 LLQMIRVQQMHR-PKLIGEELAQLKEQRVHKTD-LERVLEANDGSGMLDEDEEDLQRALA 236

BLAST of Tan0006652 vs. ExPASy Swiss-Prot
Match: Q9CVD2 (Ataxin-3 OS=Mus musculus OX=10090 GN=Atxn3 PE=1 SV=2)

HSP 1 Score: 153.3 bits (386), Expect = 6.0e-36
Identity = 94/291 (32.30%), Postives = 154/291 (52.92%), Query Frame = 0

Query: 10  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTTGD----FLS 69
           ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL 
Sbjct: 4   IFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERLRMAEGGVTSEDYRTFLQ 63

Query: 70  EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQDHWFCI 129
           + S N+   G FSIQV+  AL+VW L++I  NSP  +  +IDP  E +FIC+ ++HWF +
Sbjct: 64  QPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHWFTV 123

Query: 130 RKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFP--------KDFPI 189
           RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P        +   +
Sbjct: 124 RKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQMIKV 183

Query: 190 SSSEASNGYGQWLS--------PEDAERITKSCNSTQAPPPPQRVNWTEQQDTI---FST 249
                    G+ L+          D ER+ ++ + +          + E +D +    + 
Sbjct: 184 QQMHRPKLIGEELAHLKEQSALKADLERVLEAADGSGI--------FDEDEDDLQRALAI 243

Query: 250 GEAEMLMDMEDEDLKAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQ 278
              E+ M+ E+ DL+ AI  S+  SS ++       +N P  SS   +S +
Sbjct: 244 SRQEIDMEDEEADLRRAIQLSMQGSSRSMC------ENSPQTSSPDLSSEE 280

BLAST of Tan0006652 vs. ExPASy Swiss-Prot
Match: O35815 (Ataxin-3 OS=Rattus norvegicus OX=10116 GN=Atxn3 PE=1 SV=1)

HSP 1 Score: 151.8 bits (382), Expect = 1.7e-35
Identity = 94/288 (32.64%), Postives = 151/288 (52.43%), Query Frame = 0

Query: 10  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTTGD----FLS 69
           ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL 
Sbjct: 4   IFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERLRMAEGGVTSEDYRTFLQ 63

Query: 70  EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQDHWFCI 129
           + S N+   G FSIQV+  AL+VW L++I  NSP  +  +IDP  E +FIC+ ++HWF +
Sbjct: 64  QPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHWFTV 123

Query: 130 RKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFP--------KDFPI 189
           RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P        +   +
Sbjct: 124 RKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQMIKV 183

Query: 190 SSSEASNGYGQWLS--------PEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEA 249
                    G+ L+          D ER+ ++     A  P    +  +      +    
Sbjct: 184 QQMHRPKLIGEELAHLKEQSALKADLERVLEA-----ADGPGMFDDDEDDLQRALAMSRQ 243

Query: 250 EMLMDMEDEDLKAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQ 278
           E+ M+ E+ DL+ AI  S+  SS  +       ++ P  SST  +S +
Sbjct: 244 EIDMEDEEADLRRAIQLSMQGSSRGMC------EDSPQTSSTDLSSEE 280

BLAST of Tan0006652 vs. NCBI nr
Match: XP_038904645.1 (ataxin-3 homolog [Benincasa hispida])

HSP 1 Score: 705.3 bits (1819), Expect = 3.1e-199
Identity = 361/426 (84.74%), Postives = 383/426 (89.91%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           M+G  NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+ML GSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQ DT  S+GE EML+DMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQHDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANT----------- 300
           KAAIAASLLDSSA +A G AN QNEPAVSSTQAASPQN+  VSLE A T           
Sbjct: 241 KAAIAASLLDSSAVMAAGAANSQNEPAVSSTQAASPQNVPVVSLETAKTEDAPIVSLKAS 300

Query: 301 --QDVPAVSPKAATLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHA 360
             QDVPAV PKAATLQDVP VS KP+ PQNVP VSP+ASTSQDVR + P+A ATPQD+HA
Sbjct: 301 TLQDVPAVFPKAATLQDVPVVSNKPSSPQNVPFVSPKASTSQDVRPLSPDATATPQDLHA 360

Query: 361 VS-AKSATPNNEPAICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISR 413
           VS  K+ATPNN+PA+CTEV++HQNES N+S GNA+A F ESGPADNAEC +SSPRKKISR
Sbjct: 361 VSTTKTATPNNKPAVCTEVSVHQNESGNESVGNAEAAFRESGPADNAECGVSSPRKKISR 420

BLAST of Tan0006652 vs. NCBI nr
Match: XP_008442472.1 (PREDICTED: ataxin-3 homolog [Cucumis melo] >KAA0044145.1 ataxin-3-like protein [Cucumis melo var. makuwa] >TYK24992.1 ataxin-3-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 702.2 bits (1811), Expect = 2.6e-198
Identity = 361/426 (84.74%), Postives = 380/426 (89.20%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           M+G  NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+ML GSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDT  S+GE EML+DMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQDVPAVSP--- 300
           KAAIAASLLDSSA +A GVANP NEP VSSTQA SPQN+ AVSLE ANTQDV AVSP   
Sbjct: 241 KAAIAASLLDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVSLETANTQDVLAVSPNAS 300

Query: 301 ----------KAATLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHA 360
                     KAATLQDVP +S K ACPQNVP VSPEASTSQDVRA+PP+A   PQD+HA
Sbjct: 301 ILQDVTAVSTKAATLQDVPAISAKAACPQNVPNVSPEASTSQDVRALPPDAADAPQDLHA 360

Query: 361 VS-AKSATPNNEPAICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISR 413
           VS  K+ATPN++ A+CTEV +HQNES N+S GNAD  F ESG ADN ECAISSPRKKISR
Sbjct: 361 VSTTKAATPNDKSAVCTEVVVHQNESGNESVGNADTAFCESGSADNTECAISSPRKKISR 420

BLAST of Tan0006652 vs. NCBI nr
Match: XP_004137739.1 (ataxin-3 homolog [Cucumis sativus] >KGN58800.1 hypothetical protein Csa_000861 [Cucumis sativus])

HSP 1 Score: 684.1 bits (1764), Expect = 7.5e-193
Identity = 351/426 (82.39%), Postives = 374/426 (87.79%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           M+G  NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+ML GSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDT  S+GE EML+DMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQ---------- 300
           KAAIAASL+DSSA +A GVANP NEP VSSTQA SPQN+ AV+LE ANTQ          
Sbjct: 241 KAAIAASLMDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVALETANTQDVLAVSPNAS 300

Query: 301 ---DVPAVSPKAATLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHA 360
              DVPAVSP+AATLQDVP +S K A PQN P VSPEASTSQDV  + P A   PQD+H 
Sbjct: 301 ILEDVPAVSPEAATLQDVPAISAKAASPQNAPNVSPEASTSQDVCELSPNAADIPQDLHT 360

Query: 361 VS-AKSATPNNEPAICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISR 413
           VS AK+A P N+ A+CTEV++HQNES N+S GNAD  F +SG ADN ECA+SSPRKKISR
Sbjct: 361 VSTAKAAIPKNKSAVCTEVSVHQNESGNESVGNADTAFCDSGSADNTECAVSSPRKKISR 420

BLAST of Tan0006652 vs. NCBI nr
Match: KAG6596364.1 (Ataxin-3-like protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7027914.1 Ataxin-3-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 666.0 bits (1717), Expect = 2.1e-187
Identity = 348/411 (84.67%), Postives = 361/411 (87.83%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           MEG SNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+MLGGSTT
Sbjct: 1   MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLGGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPI S+
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPILST 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQ PPPPQRV+WTEQQDT  STGEAEMLMDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQNPPPPQRVSWTEQQDTFLSTGEAEMLMDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQDVPAVSPKAA 300
           KAAIAASL+DSSA +A GV NPQ EPA SSTQAASP N+  VSLEAA   DVPAVS K A
Sbjct: 241 KAAIAASLMDSSAVMAEGVGNPQKEPAASSTQAASPHNVPVVSLEAAANTDVPAVSNKPA 300

Query: 301 TLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHAVSAKSATPNNEPA 360
           +LQ+   V           VVSPEASTSQDV        AT QD HAVSAK+A+PNNEP 
Sbjct: 301 SLQNDVAV-----------VVSPEASTSQDVH-------ATIQDAHAVSAKAASPNNEPT 360

Query: 361 ICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISRTNEGS 412
              EVAMHQNES NKSTGNADA FHES PADNA+C ISSPRKKISRT+EG+
Sbjct: 361 TSPEVAMHQNESENKSTGNADAAFHESEPADNADCTISSPRKKISRTDEGT 393

BLAST of Tan0006652 vs. NCBI nr
Match: XP_022971387.1 (ataxin-3 homolog [Cucurbita maxima])

HSP 1 Score: 665.2 bits (1715), Expect = 3.6e-187
Identity = 346/411 (84.18%), Postives = 362/411 (88.08%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           MEG SNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+MLGGSTT
Sbjct: 1   MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLGGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPI S+
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPILST 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQ PPPPQRV+WTEQQDT  STGEAEMLMDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQNPPPPQRVSWTEQQDTFLSTGEAEMLMDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQDVPAVSPKAA 300
           KAAIAASL+DSSA +A GV NPQ EPA SSTQAASP N+  VSLEAA   DVPAV+ K A
Sbjct: 241 KAAIAASLMDSSAVMAAGVGNPQKEPAASSTQAASPDNVPVVSLEAAANTDVPAVANKPA 300

Query: 301 TLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHAVSAKSATPNNEPA 360
           +LQ+            +V VVSPEASTSQ+V        AT QD HAVSAK+A+PNNEP 
Sbjct: 301 SLQN------------DVAVVSPEASTSQNVH-------ATVQDAHAVSAKAASPNNEPT 360

Query: 361 ICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISRTNEGS 412
           I  EVAMHQNES NKST NADA FHES PADNA+C ISSPRKKISRT+EG+
Sbjct: 361 ISPEVAMHQNESENKSTDNADAAFHESEPADNADCTISSPRKKISRTDEGT 392

BLAST of Tan0006652 vs. ExPASy TrEMBL
Match: A0A5D3DN96 (Ubiquitinyl hydrolase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G001100 PE=4 SV=1)

HSP 1 Score: 702.2 bits (1811), Expect = 1.3e-198
Identity = 361/426 (84.74%), Postives = 380/426 (89.20%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           M+G  NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+ML GSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDT  S+GE EML+DMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQDVPAVSP--- 300
           KAAIAASLLDSSA +A GVANP NEP VSSTQA SPQN+ AVSLE ANTQDV AVSP   
Sbjct: 241 KAAIAASLLDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVSLETANTQDVLAVSPNAS 300

Query: 301 ----------KAATLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHA 360
                     KAATLQDVP +S K ACPQNVP VSPEASTSQDVRA+PP+A   PQD+HA
Sbjct: 301 ILQDVTAVSTKAATLQDVPAISAKAACPQNVPNVSPEASTSQDVRALPPDAADAPQDLHA 360

Query: 361 VS-AKSATPNNEPAICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISR 413
           VS  K+ATPN++ A+CTEV +HQNES N+S GNAD  F ESG ADN ECAISSPRKKISR
Sbjct: 361 VSTTKAATPNDKSAVCTEVVVHQNESGNESVGNADTAFCESGSADNTECAISSPRKKISR 420

BLAST of Tan0006652 vs. ExPASy TrEMBL
Match: A0A1S3B6I4 (Ubiquitinyl hydrolase 1 OS=Cucumis melo OX=3656 GN=LOC103486329 PE=4 SV=1)

HSP 1 Score: 702.2 bits (1811), Expect = 1.3e-198
Identity = 361/426 (84.74%), Postives = 380/426 (89.20%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           M+G  NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+ML GSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDT  S+GE EML+DMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQDVPAVSP--- 300
           KAAIAASLLDSSA +A GVANP NEP VSSTQA SPQN+ AVSLE ANTQDV AVSP   
Sbjct: 241 KAAIAASLLDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVSLETANTQDVLAVSPNAS 300

Query: 301 ----------KAATLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHA 360
                     KAATLQDVP +S K ACPQNVP VSPEASTSQDVRA+PP+A   PQD+HA
Sbjct: 301 ILQDVTAVSTKAATLQDVPAISAKAACPQNVPNVSPEASTSQDVRALPPDAADAPQDLHA 360

Query: 361 VS-AKSATPNNEPAICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISR 413
           VS  K+ATPN++ A+CTEV +HQNES N+S GNAD  F ESG ADN ECAISSPRKKISR
Sbjct: 361 VSTTKAATPNDKSAVCTEVVVHQNESGNESVGNADTAFCESGSADNTECAISSPRKKISR 420

BLAST of Tan0006652 vs. ExPASy TrEMBL
Match: A0A0A0L9Z8 (Ubiquitinyl hydrolase 1 OS=Cucumis sativus OX=3659 GN=Csa_3G732570 PE=4 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 3.6e-193
Identity = 351/426 (82.39%), Postives = 374/426 (87.79%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           M+G  NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+ML GSTT
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDP+LENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQAPPPPQR NWTEQQDT  S+GE EML+DMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRANWTEQQDTFLSSGETEMLIDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQ---------- 300
           KAAIAASL+DSSA +A GVANP NEP VSSTQA SPQN+ AV+LE ANTQ          
Sbjct: 241 KAAIAASLMDSSAVMAAGVANPPNEPVVSSTQAGSPQNVPAVALETANTQDVLAVSPNAS 300

Query: 301 ---DVPAVSPKAATLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHA 360
              DVPAVSP+AATLQDVP +S K A PQN P VSPEASTSQDV  + P A   PQD+H 
Sbjct: 301 ILEDVPAVSPEAATLQDVPAISAKAASPQNAPNVSPEASTSQDVCELSPNAADIPQDLHT 360

Query: 361 VS-AKSATPNNEPAICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISR 413
           VS AK+A P N+ A+CTEV++HQNES N+S GNAD  F +SG ADN ECA+SSPRKKISR
Sbjct: 361 VSTAKAAIPKNKSAVCTEVSVHQNESGNESVGNADTAFCDSGSADNTECAVSSPRKKISR 420

BLAST of Tan0006652 vs. ExPASy TrEMBL
Match: A0A6J1I1T6 (Ubiquitinyl hydrolase 1 OS=Cucurbita maxima OX=3661 GN=LOC111470125 PE=4 SV=1)

HSP 1 Score: 665.2 bits (1715), Expect = 1.7e-187
Identity = 346/411 (84.18%), Postives = 362/411 (88.08%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           MEG SNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+MLGGSTT
Sbjct: 1   MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLGGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPI S+
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPILST 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQ PPPPQRV+WTEQQDT  STGEAEMLMDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQNPPPPQRVSWTEQQDTFLSTGEAEMLMDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQDVPAVSPKAA 300
           KAAIAASL+DSSA +A GV NPQ EPA SSTQAASP N+  VSLEAA   DVPAV+ K A
Sbjct: 241 KAAIAASLMDSSAVMAAGVGNPQKEPAASSTQAASPDNVPVVSLEAAANTDVPAVANKPA 300

Query: 301 TLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHAVSAKSATPNNEPA 360
           +LQ+            +V VVSPEASTSQ+V        AT QD HAVSAK+A+PNNEP 
Sbjct: 301 SLQN------------DVAVVSPEASTSQNVH-------ATVQDAHAVSAKAASPNNEPT 360

Query: 361 ICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISRTNEGS 412
           I  EVAMHQNES NKST NADA FHES PADNA+C ISSPRKKISRT+EG+
Sbjct: 361 ISPEVAMHQNESENKSTDNADAAFHESEPADNADCTISSPRKKISRTDEGT 392

BLAST of Tan0006652 vs. ExPASy TrEMBL
Match: A0A6J1FRR9 (Ubiquitinyl hydrolase 1 OS=Cucurbita moschata OX=3662 GN=LOC111446649 PE=4 SV=1)

HSP 1 Score: 652.5 bits (1682), Expect = 1.2e-183
Identity = 343/411 (83.45%), Postives = 356/411 (86.62%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVMLGGSTT 60
           MEG SNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQ+MLGGSTT
Sbjct: 1   MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLGGSTT 60

Query: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120
           GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD
Sbjct: 61  GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFICHLQD 120

Query: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSS 180
           HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPI S+
Sbjct: 121 HWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPILST 180

Query: 181 EASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQDTIFSTGEAEMLMDMEDEDL 240
           EASNGYGQWLSPEDAERITKSCNSTQ PPPPQRV+WTEQQDT  STGEAEMLMDMEDEDL
Sbjct: 181 EASNGYGQWLSPEDAERITKSCNSTQNPPPPQRVSWTEQQDTFLSTGEAEMLMDMEDEDL 240

Query: 241 KAAIAASLLDSSAALATGVANPQNEPAVSSTQAASPQNIAAVSLEAANTQDVPAVSPKAA 300
           KAAIAASL+DSSA +A GV NPQ EPA SST      N+  VSLEAA   DVPAVS K A
Sbjct: 241 KAAIAASLMDSSAVMAAGVGNPQKEPAASST-----HNVPVVSLEAAANTDVPAVSNKPA 300

Query: 301 TLQDVPEVSTKPACPQNVPVVSPEASTSQDVRAVPPEAVATPQDVHAVSAKSATPNNEPA 360
           +LQ+   V           VVSPEASTSQDV        AT QD HAVSAK+A+PNNEP 
Sbjct: 301 SLQNDVAV-----------VVSPEASTSQDVH-------ATIQDAHAVSAKAASPNNEPT 360

Query: 361 ICTEVAMHQNESANKSTGNADADFHESGPADNAECAISSPRKKISRTNEGS 412
              EVAMHQNES NKSTGNADA FHES PADNA+C ISSPRKKISRT+EG+
Sbjct: 361 TSPEVAMHQNESENKSTGNADAAFHESEPADNADCTISSPRKKISRTDEGT 388

BLAST of Tan0006652 vs. TAIR 10
Match: AT3G54130.1 (Josephin family protein )

HSP 1 Score: 381.7 bits (979), Expect = 7.4e-106
Identity = 198/264 (75.00%), Postives = 218/264 (82.58%), Query Frame = 0

Query: 1   MEGDSNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQVML----- 60
           ME  SNGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A+DLD KERQVML     
Sbjct: 1   MERTSNGGMLYHEVQESNLCAVHCVNTVLQGPFFSEFDLAAVAADLDGKERQVMLEGAAV 60

Query: 61  GGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPELENAFI 120
           GG   GDFL+EESHNVSL GDFSIQVLQKALEVWDLQVIPLN P AEPAQIDPELE+AFI
Sbjct: 61  GGFAPGDFLAEESHNVSLGGDFSIQVLQKALEVWDLQVIPLNCPDAEPAQIDPELESAFI 120

Query: 121 CHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDF 180
           CHL DHWFCIRKVNGEWYNFDSL AAPQHLSKFYLSA+LDSLKG GWSIFIV+GNFP++ 
Sbjct: 121 CHLHDHWFCIRKVNGEWYNFDSLLAAPQHLSKFYLSAFLDSLKGAGWSIFIVKGNFPQEC 180

Query: 181 PI-SSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQ--DTIFSTGEAEML 240
           P+ SSSEASN +GQWLSPEDAERI K+ +S  +    +  +   QQ  +   S  E +  
Sbjct: 181 PMSSSSEASNSFGQWLSPEDAERIRKNTSSGSSARNKRSNDNVNQQRRNQALSREEVQAF 240

Query: 241 MDMEDEDLKAAIAASLLDSSAALA 257
            +MED+DLKAAIAASLLD+SAA A
Sbjct: 241 SEMEDDDLKAAIAASLLDASAAEA 264

BLAST of Tan0006652 vs. TAIR 10
Match: AT2G29640.1 (JOSEPHIN-like protein )

HSP 1 Score: 57.8 bits (138), Expect = 2.4e-08
Identity = 40/147 (27.21%), Postives = 68/147 (46.26%), Query Frame = 0

Query: 10  LYHEVQESKLCAVHCVNTVLQG-PFFSEFDLAALASDLDRKERQVMLGGSTTGDFLSEES 69
           +YHE Q  + C +HC+N + Q    F++  L ++A  L+  +        T   F+ +  
Sbjct: 8   IYHERQRLQFCLLHCLNNLFQDKDAFTKESLNSIAEKLETNDPNKETW--TPLSFVLKPH 67

Query: 70  HNVSLDGDFSIQVLQKALE------VWDLQVIPLNSPVAEPAQ------IDPELENAFIC 129
           HN ++ G++ + V+  ALE      VW  + I  +S   + A       ++  ++     
Sbjct: 68  HN-TITGNYDVNVMITALEGKGKSVVWHDKRIGASSIDLDDADTLMGIVLNVPVKRYGGL 127

Query: 130 HLQDHWFCIRKVNGEWYNFDSLYAAPQ 144
               HW  +RK+NG WYN DS    PQ
Sbjct: 128 WRSRHWVVVRKINGVWYNLDSDLVVPQ 151

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M3911.0e-10475.00Ataxin-3 homolog OS=Arabidopsis thaliana OX=3702 GN=At3g54130 PE=1 SV=1[more]
Q8LQ361.6e-9265.80Putative ataxin-3 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=Os01g08514... [more]
P542521.9e-3736.78Ataxin-3 OS=Homo sapiens OX=9606 GN=ATXN3 PE=1 SV=5[more]
Q9CVD26.0e-3632.30Ataxin-3 OS=Mus musculus OX=10090 GN=Atxn3 PE=1 SV=2[more]
O358151.7e-3532.64Ataxin-3 OS=Rattus norvegicus OX=10116 GN=Atxn3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_038904645.13.1e-19984.74ataxin-3 homolog [Benincasa hispida][more]
XP_008442472.12.6e-19884.74PREDICTED: ataxin-3 homolog [Cucumis melo] >KAA0044145.1 ataxin-3-like protein [... [more]
XP_004137739.17.5e-19382.39ataxin-3 homolog [Cucumis sativus] >KGN58800.1 hypothetical protein Csa_000861 [... [more]
KAG6596364.12.1e-18784.67Ataxin-3-like protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG70279... [more]
XP_022971387.13.6e-18784.18ataxin-3 homolog [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A5D3DN961.3e-19884.74Ubiquitinyl hydrolase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B6I41.3e-19884.74Ubiquitinyl hydrolase 1 OS=Cucumis melo OX=3656 GN=LOC103486329 PE=4 SV=1[more]
A0A0A0L9Z83.6e-19382.39Ubiquitinyl hydrolase 1 OS=Cucumis sativus OX=3659 GN=Csa_3G732570 PE=4 SV=1[more]
A0A6J1I1T61.7e-18784.18Ubiquitinyl hydrolase 1 OS=Cucurbita maxima OX=3661 GN=LOC111470125 PE=4 SV=1[more]
A0A6J1FRR91.2e-18383.45Ubiquitinyl hydrolase 1 OS=Cucurbita moschata OX=3662 GN=LOC111446649 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G54130.17.4e-10675.00Josephin family protein [more]
AT2G29640.12.4e-0827.21JOSEPHIN-like protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePRINTSPR01233JOSEPHINcoord: 153..172
score: 45.0
coord: 11..34
score: 58.33
coord: 129..149
score: 33.33
coord: 108..127
score: 50.0
NoneNo IPR availableGENE3D1.10.287.10coord: 33..74
e-value: 9.3E-62
score: 209.4
NoneNo IPR availableGENE3D3.90.70.40coord: 19..172
e-value: 9.3E-62
score: 209.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 302..332
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 369..413
NoneNo IPR availablePANTHERPTHR14159:SF0ATAXIN-3coord: 1..403
IPR006155Josephin domainSMARTSM01246Josephin_2coord: 14..170
e-value: 2.7E-78
score: 276.1
IPR006155Josephin domainPFAMPF02099Josephincoord: 15..168
e-value: 7.2E-49
score: 166.0
IPR006155Josephin domainPROSITEPS50957JOSEPHINcoord: 7..182
score: 24.48288
IPR033865Machado-Joseph disease proteinPANTHERPTHR14159ATAXIN-3-RELATEDcoord: 1..403

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006652.1Tan0006652.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016579 protein deubiquitination
cellular_component GO:0005634 nucleus
molecular_function GO:0004843 thiol-dependent deubiquitinase