Cp4.1LG03g06470 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g06470
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionOTU domain-containing protein
LocationCp4.1LG03 : 4033110 .. 4040781 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCGCCAAATCGTAAAGATTGGTGGAAGAAGGTGTTGAAGTGGGACCCACGCTCATCTGCGGCCATTGACTGTTACTTCCGGAGAAGGGAGAAGCCGCGAAAGCTACAGGCTGTGGGGGTCTGGGTCTACGAGGCGTTCCAATGAAGCCCCCCATTTCTTTCCGTCCCACGAATTCTTAACCCCTTCCCTGAAACATCCTAATCCCGTTCCACCTCAAACCATCCTCCACACCATCCCCGAAGTCAAATCGGACGCAGTTGCAGGTATAATTGATCTGTTCCTTACTTCTTACCCGCAGATTTCAGGTATGAATCGCCTGGTTCTTCCATATCCCTGCCTTTTTTATCTGTTTGAATTCTTCCTTGTTGTTCCTTTCTTTTAGTTTCTTCTTCAATCCTGGTTGCAATAATATGCCCAACCATGGAAGATACACCATGCGCCTTTTACATTTCACCTTCTTTTTATCGATTGATTTGATTATGTACTTCTAATTGGATAAATGTCTGTCTATGCGCGTTTAGTGGTGTTCTTCATTTGATTTTGCAGGATTTCTCATATTATGATTTTGTCTCCTCTAATAAATATAGCACTTTCATTGTCATGTTATGTTTGTTCTTGATCTACCATGTATTTTTCTCACTCCTCGAATAGTTTGAGCTCCAACTACACATTCGTGGAACAGGCGTTATTTTTTTTATTTTTTTAATGTTTCTGCAGCATGATAGCATGACTTTTTCTTGAATGTAATGGTGAGTTATTGCTAAGTGGCAGTATGAGGCCGGGAACTCTGAGCGTTGGAGAATGTTCGAGTTCTACTTCATTGAGCTCTCATCAAGACATTGACGATGATCGCATGATTGCTGTTGCATTAACGGAAGAGTACGCTAAGCTAGACGGTGGTGTTGCCAGACGCCTCTCCAACCTTGCCCCTATTGCTGTATGAATCTTTGCTCTTTTCCTTTTGTTCTAGCTTAGGTTTTTTTATTTTGTTTTGTTTTGTTTTTTCTTTTCATAATTTTCAAGCCACCGCTTTACCCTTCAAAGAGATTATTAGTTATCACTTCTCATATGAAAATATGAAAGTGTTCTCTCTTTTTAGGACGTTGTCCTGAAACTTTATTCAAATAGAATGTTCTGAGATGATTGATTTATTGGTTTTCTGATTATTCTCCTCATATCTTGGCTTATGAACTATTGTTGCTCCATCTTTTGAAAAAAGATATTACCTGAACTCATATGTTAAACTTAACTCTTTGCGGTATTCCAGCATACTCCAAGGGTAAATTTGTACATCCCCACCCAAAGTGATGCGAGTTTGGAGTATCATAGGCTTCTTCAGAGGTACGTATCCCTCATTTTCTAAAGTAAAAGATAGCTGGGGCTTGGTCAACAGATCTATATGTATGAAAAATACCACCTTTTCTCTCTGTTGAATAATGATGAAATTTAGCATAAAGAACAACTGTTAAAGAATTTTTACAAGCGACACATGCACATTCCCTATCTAGAGAGAGAGAGAGAGAGAGAGAGAGAGACATGCACATTCATGAGCAAGCATTGGTGGATTGAAAATTACTTCCAGGGAGTTGAAACGGTGCATAAATTCTCAAGTTTACTAGGTGGGTGATTTTGTAATCATGATCCGGTTGAAGGTCACAAAACCATTATGTCCAATTGATGACCTTATTGTTACTACTATTATGTTATATTCTTTATCGTGATTTAGGCTGTCTTAAAGGGATGTCATCTTCTGTCCCATAGCTGGTTCCTTGATTGAGGAATAGTTCAAACGAATCTAAAATTTTCATTACAAAATTAACAAATATAATAGCAAGTCATCACGATCCCTAACTTCGCCAACTTGAGGCCATTTGAGACAGCTTATTAGCCTACAGTCCAGTTTGAGCCCTCCAAATCCTAGCATATATTCTTTCAGGGTGCAGTTCTTCTCTGTATCTCCCTTTACAACATGAACGAAGTTCATTATGGGTTTTCTTTCTTGCTTACTTTCAGTCCTCTTATTCAGTTGCTCCTGCAACTTCAATTTAATCCATGCCTCTAACTTCTTAAATTATGCTTTGGATAAGAGCTTAGCTTTTCAAAATGAATATGGCATCATAGGAATAATCCAAAAAAAAAGTAGCTATTGTGTTTTTCCCTCCTTGGATGTTAATTGTTATTACTGTGATAATATCTAAACCCCAATAAGGATCGCTTGGATTTTCATTGCATCCATGTGTTGTCAAGAGGGAATTCATGAAAATTACTCAGGATAAAAACTGATGGTTGTATAAGTCTATGTTTAATGCCATTAAAATTTACAATCTCCCTTGGCTTCTCTTCTCCCTCCTACCACCTGTGATGAAGTAGTTTTCATCTCATTGTGTTTCAGGCTAAATGTCTATGGTTTGCATGAAGTGAAGGTCTCTGGAGATGGAAATTGTCAGGTATGTTGATTTTTTAAGTAAGAAGTAGAAGCTTCGGTTTTATCTACAATTTACGTATAGTTTCTTCTTATTTGGCAGTTTCGAGCACTTTCAGATCAGCTGTACAAATCACCTGAGTATCACAAGCACGTGCGGAAAGACATTGTAAAGCAGGTAGACTGTCATATCTCGAGGTGCTTGAACTAGGGAGAATAGTGGCTCCTTTTCTTTTGGGATGAAAAGCTGAGCTTTTGTGTAAGGAGATTGAAAGAGAAAGTAGTAGGGAAGGTTTTATGTAGCATGTGGCCGGAGTAGTTGGAAATGGGAGAATAAGTGGTAACTTTTCAGTTATATTTGGTTTTAGAATCTGCAACTTTGGTTTGTATTTTATCAGATCAGAGGCGTTCTCTTATACGGATTTTAATCAATATTTAAATCTGATCTGTTAGACTTGTTAGTAGTGGCTGGTAATGTCTTACTCATGTTTTGGTGTCAAACTGAAGTGAATAGAGTTTTCGAGTAAGGGTGAATATCTTGCTTGTGTATTACTCACCTGGGTTTATTTTCATATTCCAGCTAAAGGACTACCGTTCTCAATATGAAGGTTACGTTCCAATGAAGTTCAGTCGTTATTACAAGAAAATGGCGAAGTATGTGAAATTGTTCACGATCATACTCTCTCTTTCTCTCTTATTGAATCGATGGTCATCTGTGGATTAACTGATCTTAAACATGTTCTTGAGGAAAGGTCTCATTTTTTTATTTCTTAATATATGATGCAGATCTGGTGAATGGGGGGACCATGTTACCCTTCAAGCAGCAGCTGATAAGGTATATCATATGGCAATATTAACTGTCACCTTTTTATTTGCTACCTTATTGATTGAGTGTTGTATGGCATTATCGATTTAAGATTTTGTATAAAATACGCCCAAAGGAGAATGAGCAGCAAGTTATCATATCCTTGGAAATGTAAGCAATGGATGAAAATTGCTGAGACACTGGATTACAGTATTTGCTACCTTACTGAAGTCAGACTCTTGGATTAGGTGTTTTTCCGACTACCATTTAAACTTGGAGTTTAAGATAAACTAGAGAGATGGAGAGAGAGAGAGAGTGGAAGAGATGGATGACAGTACGTCAATGAAAGAACAACTATTTCAAAGAATTTGTTTATCCTGTTCCATGAATGAATTATATTCCAAAAAATCTGGAAGATTTTCCATTAGAACATTTACCTGCCTTCTCTGTTTGTTTCTTGATCTTGCTGCTTGATTTTGTGCAGTTTGCTGCAAAGATTTGCCTCCTTACGTCATTTAGAGATACTTGTTTCATTGAAATTGTTCCACAATCACAGACTCCTAAGCGTGGTAAGTTAAGAGGCATGGATATGGATACAAGACATAAATATGATACGGACGTAGAGACATGTCATTTTATATGGATAAGATATGTTTATTATAAAATACCTTTTTAAAAAATATATATCATTTCTATGCCCGATGAATTTGAAGTCAACAAGTTTATGCCCTTATATACTTAAAAAAATGAGTGTGATATGTTTTGGCTTAAAATTTATTACTTTTTTACTGTCTTAACAAGTGTCTTATGCATGTCTAATAAGTAGAAGGCTTGTTTTAATTTTACAACACGAAAATGCTTTCAAAGTGTTTGTGCTTCTTAAGTCGTAAGATAGTGGATATTTCTTGAAATTGTCATGTCATGATTTTTCATAGTGGAAATAGATTTACTTTATTGGTTATATGAAATATATTACAAAAGGGGTGGAAAGCCAGACCCCTGGCTGAGGAAGATTACAAGATGTTGCGATGTAGTGTAGAAACATAATCATGAGCAACTGAATTGGCAGTTGAATATAGTTTTATATTTTAAAATACCCTAAAAGTCCTTTCCAAAACCTTCATAAATCCAAATTTCATTGCATATCTTCTTTTATTATTCATTTTCCTCGAGAGTCTTAGCTATATGAAATATTGGAGAGTGATAATTTGTGCTGCTTAGATTAAGATATTTTTTTACCGCAAGTTTAAGGTGGAACCTGTGATAATAAAATATCTTGCAAGATATTTGATGTTGATGAAATTCAATAGATGCATTAACTACATAAGACGGGACTTTTTATCAGGGCTGTCTTCAAACGAACATACAATTGTGATGTACATTTCCTCTGTACCTCAGTTTCTCAATATGTAGAAACGTTTTGAATGTCGACACTGTTGCATTTACACACCAAATGCTTTGCTGTAGTTTTGGAATACTAGGGTTCTTACAGCTCCTTTACCTTTGCAATAATGCAGAGCTCTGGTTAAGTTTCTGGTCTGAGGTTCACTACAATTCACTTTACGAAATTCAAGGTTTGGTTATACTGCAAAATCCATTTTCTGGTTCTCAGAGTAGCAAGGTTTAATTAGAGTGAATGCGATTATATGGGAACCTTGGTTTTTAATGGTGATAATGTGTTTGTCTTTCTTGCAGATGTTCCAGTTCAACAAAAACCAGGAAAAAAACATTGGTTGTTCTAGTTAAACAGATTGATGTGAGTATAGATAGCATCCTAGATATGTGAGTACTACACTACAAAGTTCTTGTGTTTTTTTTTTTTTTGGAAAAAAAGAACGGAAAACCATATGATTGATCTTTATATATATAGTTTTATGTAAATAGTTCGTCATACACATACATTGCCTTGTATTAGTGCACTATAGATCCAATGAACTCTATGATAGAATTATAGTCACATTATGAACTCAAAAACTCTGGTGCACCTGTCCTAGTAGCATAACTGAATTCAGTTTAGCATATCCTACACCTACAGAGCAAGAAATAATTCATTTGTATTCATGAAATATCAAATTGACTTGACTCTTGGAACTTTGTGACTCTGCATAGTTTTGTGGCATATATATTTATTGGGCTGATGGGTAATGGGTAAAGGAAAATGGTTTTTGGTGTTCTCATTACAAATTAATAAGGGAAAATGAAGCAGAAATGCAGAAAACAAGCTAGAAAGGAGGTTAGCTCGGTGAGAGCAAACCACTCATGCTCATTTCCAGTTGTGGGCAACAATGTCGGCCAAATCAACAACCCGTTGAGAGTAACCCCACTCGTTATCATACCAAGCAATAACCTTCACCAAGTCATCACCCATAACCATAGTCAAGGAAGAGTCGACGGTTGAGGAGACATCAGAGCACCTAAAATCGACCGAAACAAGGGGCTCGTCACAAACAGAGAGGATGCCGTTGAGCTCCTTCTCAGCACTTTCCCGGAATGCTGCATTCACCTCTTCAGCAAAGGTCTTCTTAGAAACCTGGACAACAAGGTCTACAACAGACACATTTGGAGTGGGCACACGAAGTGCAATCCCATTGAGTTTTCCTTTAAGAGAAGGGAGGACTAGAGCAACAGCTTTGGCTGCTCCTGTGGAGGTAGGAACAATGTTGAGTGCAGCAGCTCTTGCCCTCCTGAGGTCACGGTGGCTGGCATCAAGTAGCCTCTGGTCACCAGTGTAGGAGTGAGTGGTAGTCATTGTTCCCTTGATGATACCTGCCATAGATTTATACCAAAACCTTCACAAACCTGAATTTTAGCACCTGAATTTGCAAGGGCCTAGAAATGTAAGTGCTATTGATGAATCTATGCTGGGATTCAGTATGTATTACCGAACTTCTGGTCAAGGACCTTGACAAAAGGAGCTAGGCAATTGGTAGTGCAAGAAGCATTGCTGATGATAGGCTCATCAGGGCTGTAAGCGTCGGCATTAACCCCAACGACGTAAGTCGGAATGTCACCCTTCCCAGGTGCTGTAATTAGGACCTTCTTTGCTCCAGCCTGAATATGCTTCCCTGCACCCTCTCTATCAACAAACACCCCAGTTCCTTCAATCACCAAGTCTATTCCCAAGTCCCTGCATTGTCATTTATCTAATCATTCAATTTTTTTTATCTCATACAGATAACGAATGAAAAGTTAGAAGCATAGAAGTTGAACACCCACTTCCAGGGAAGGTTGAGAGGGTTGCGGTTGGAGACCACCTGGATGATCTTGCCATCAACTGAAATAGCTTCATCTCCGGCAGGTTTGACATCGGCATCAAAGATGCCGAGGGTGGAGTCGTACTTGAGGAGGTGAGAAGCTTGCTTGACGCCGCCGGAATCGTTGATGGCAATGACATCAAGTGGGGAATCCTTGCGACCATGCCAGCACCTCAAGAAATTCCTGCCAATCCTTCCAAACCCATTGATGGCTACCTTAAGCTTTGCTTCCACAATCCCTTTCTTATACCCTCCGCTACTTCCCACCTTCAAGAATTTAATGGATGTAGGTAAGGCACATGAAGAATCATTAGTGTAGTATGAATATTCACTTCTATATAACCCCCTCATGCTATCTCAGATATGAAACCAGAGACCATAGTACGTGCTAGAAGAAGCTTAGACGAAAAATAAAATGAAACGGAGTATATTCAGTAGTCCAAATAGAACTTTCCAAAGAAATAGCTTAGTAGCCACATTAGCCGCACTTTTGGGATGATTTTAGGGGCAGCGTATTCCCTTTCGCTATATAATCGTGTGGTTTCTGGAAATTTATGATAAATCAAATTGAAGCCAAATTAGGCTTTGCTGTTGATCAATGAACAGCCTAAATCTACTACCTCTCTCCCAAAAACAGACCTAACCATCAATCAGATTTTGTTCCTTAAGATGAACACAATTGACAGAAGCCATGAATGTTATTTCTACATGTTCACTCATGTAACCATAGTACAACTTTTGTTACATCCCATGACTCAATGTAGATTGCTGCCGCATCATGATCCAAACAAAAGATTACTACACTGGGAAGAAACTATGATAAACTTTTAAGAATAGAGATAACTCACAGCAGAGGTCTGGAAGGCAACGACGGAAAGGAAGTCATCGGAGGTTCGCCTAGCGAACGGGAGGCAGGTCGACGAGTTGCGGAGGCCAGAGAATTCTCCAAATCCCTTTCCAGTAGCCTGCAATGCAAGAGAATGCAACAATGTTCTCAGATGAAAAACAGGAAGAGATTAAAGAAGGGGCGAGAAAGGAAGAGTGCAAGCAATGGAGGAACCTGGAGAGATGGTTTGGCTACTGAGAGAGTAGCCGTAGCCATGGTTGAGGGGCGATGAGAAGAGCAAAATGGGGGAAAGCAGAAGCTTGAAGCAGTAGAAGAGAGATGGGAAATTGAGAAGGGTGGAGTG

mRNA sequence

CGCGCCAAATCGTAAAGATTGGTGGAAGAAGGTGTTGAAGTGGGACCCACGCTCATCTGCGGCCATTGACTGTTACTTCCGGAGAAGGGAGAAGCCGCGAAAGCTACAGGCTGTGGGGGTCTGGGTCTACGAGGCGTTCCAATGAAGCCCCCCATTTCTTTCCGTCCCACGAATTCTTAACCCCTTCCCTGAAACATCCTAATCCCGTTCCACCTCAAACCATCCTCCACACCATCCCCGAAGTCAAATCGGACGCAGTTGCAGGTATAATTGATCTGTTCCTTACTTCTTACCCGCAGATTTCAGTATGAGGCCGGGAACTCTGAGCGTTGGAGAATGTTCGAGTTCTACTTCATTGAGCTCTCATCAAGACATTGACGATGATCGCATGATTGCTGTTGCATTAACGGAAGAGTACGCTAAGCTAGACGGTGGTGTTGCCAGACGCCTCTCCAACCTTGCCCCTATTGCTCATACTCCAAGGGTAAATTTGTACATCCCCACCCAAAGTGATGCGAGTTTGGAGTATCATAGGCTTCTTCAGAGGCTAAATGTCTATGGTTTGCATGAAGTGAAGGTCTCTGGAGATGGAAATTGTCAGTTTCGAGCACTTTCAGATCAGCTGTACAAATCACCTGAGTATCACAAGCACGTGCGGAAAGACATTGTAAAGCAGCTAAAGGACTACCGTTCTCAATATGAAGGTTACGTTCCAATGAAGTTCAGTCGTTATTACAAGAAAATGGCGAAATCTGGTGAATGGGGGGACCATGTTACCCTTCAAGCAGCAGCTGATAAGATTTTGTATAAAATACGCCCAAAGGAGAATGAGCAGCAAGTTATCATATCCTTGGAAATTTTCTCAATATGTAGAAACGTTTTGAATGTCGACACTGTTGCATTTACACACCAAATGCTTTGCTGTAGTTTTGGAATACTAGGGTTCTTACAGCTCCTTTACCTTTGCAATAATGCAGAGCTCTGGTTAAGTTTCTGGTCTGAGGTTCACTACAATTCACTTTACGAAATTCAAGGTTTGGTTATACTGCAAAATCCATTTTCTGGTTCTCAGAGTAGCAAGATGTTCCAGTTCAACAAAAACCAGGAAAAAAACATTGGTTGTTCTAGTTAAACAGATTGATGTGAGTATAGATAGCATCCTAGATATGTGAGTACTACACTACAAAGTTCTTGTGTTTTTTTTTTTTTTGGAAAAAAAGAACGGAAAACCATATGATTGATCTTTATATATATAGTTTTATGTAAATAGTTCGTCATACACATACATTGCCTTGTATTAGTGCACTATAGATCCAATGAACTCTATGATAGAATTATAGTCACATTATGAACTCAAAAACTCTGGTGCACCTGTCCTAGTAGCATAACTGAATTCAGTTTAGCATATCCTACACCTACAGAGCAAGAAATAATTCATTTGTATTCATGAAATATCAAATTGACTTGACTCTTGGAACTTTGTGACTCTGCATAGTTTTGTGGCATATATATTTATTGGGCTGATGGGTAATGGGTAAAGGAAAATGGTTTTTGGTGTTCTCATTACAAATTAATAAGGGAAAATGAAGCAGAAATGCAGAAAACAAGCTAGAAAGGAGGTTAGCTCGGTGAGAGCAAACCACTCATGCTCATTTCCAGTTGTGGGCAACAATGTCGGCCAAATCAACAACCCGTTGAGAGTAACCCCACTCGTTATCATACCAAGCAATAACCTTCACCAAGTCATCACCCATAACCATAGTCAAGGAAGAGTCGACGGTTGAGGAGACATCAGAGCACCTAAAATCGACCGAAACAAGGGGCTCGTCACAAACAGAGAGGATGCCGTTGAGCTCCTTCTCAGCACTTTCCCGGAATGCTGCATTCACCTCTTCAGCAAAGGTCTTCTTAGAAACCTGGACAACAAGGTCTACAACAGACACATTTGGAGTGGGCACACGAAGTGCAATCCCATTGAGTTTTCCTTTAAGAGAAGGGAGGACTAGAGCAACAGCTTTGGCTGCTCCTGTGGAGGTAGGAACAATGTTGAGTGCAGCAGCTCTTGCCCTCCTGAGGTCACGGTGGCTGGCATCAAGTAGCCTCTGGTCACCAGTGTAGGAGTGAGTGGTAGTCATTGTTCCCTTGATGATACCTGCCATAGATTTATACCAAAACCTTCACAAACCTGAATTTTAGCACCTGAATTTGCAAGGGCCTAGAAATGTAAGTGCTATTGATGAATCTATGCTGGGATTCAGTATGTATTACCGAACTTCTGGTCAAGGACCTTGACAAAAGGAGCTAGGCAATTGGTAGTGCAAGAAGCATTGCTGATGATAGGCTCATCAGGGCTGTAAGCGTCGGCATTAACCCCAACGACGTAAGTCGGAATGTCACCCTTCCCAGGTGCTGTAATTAGGACCTTCTTTGCTCCAGCCTGAATATGCTTCCCTGCACCCTCTCTATCAACAAACACCCCAGTTCCTTCAATCACCAAGTCTATTCCCAAGTCCCTGCATTGTCATTTATCTAATCATTCAATTTTTTTTATCTCATACAGATAACGAATGAAAAGTTAGAAGCATAGAAGTTGAACACCCACTTCCAGGGAAGGTTGAGAGGGTTGCGGTTGGAGACCACCTGGATGATCTTGCCATCAACTGAAATAGCTTCATCTCCGGCAGGTTTGACATCGGCATCAAAGATGCCGAGGGTGGAGTCGTACTTGAGGAGGTGAGAAGCTTGCTTGACGCCGCCGGAATCGTTGATGGCAATGACATCAAGTGGGGAATCCTTGCGACCATGCCAGCACCTCAAGAAATTCCTGCCAATCCTTCCAAACCCATTGATGGCTACCTTAAGCTTTGCTTCCACAATCCCTTTCTTATACCCTCCGCTACTTCCCACCTTCAAGAATTTAATGGATGTAGGTAAGGCACATGAAGAATCATTAGTGTAGTATGAATATTCACTTCTATATAACCCCCTCATGCTATCTCAGATATGAAACCAGAGACCATAGTACGTGCTAGAAGAAGCTTAGACGAAAAATAAAATGAAACGGAGTATATTCAGTAGTCCAAATAGAACTTTCCAAAGAAATAGCTTAGTAGCCACATTAGCCGCACTTTTGGGATGATTTTAGGGGCAGCGTATTCCCTTTCGCTATATAATCGTGTGGTTTCTGGAAATTTATGATAAATCAAATTGAAGCCAAATTAGGCTTTGCTGTTGATCAATGAACAGCCTAAATCTACTACCTCTCTCCCAAAAACAGACCTAACCATCAATCAGATTTTGTTCCTTAAGATGAACACAATTGACAGAAGCCATGAATGTTATTTCTACATGTTCACTCATGTAACCATAGTACAACTTTTGTTACATCCCATGACTCAATGTAGATTGCTGCCGCATCATGATCCAAACAAAAGATTACTACACTGGGAAGAAACTATGATAAACTTTTAAGAATAGAGATAACTCACAGCAGAGGTCTGGAAGGCAACGACGGAAAGGAAGTCATCGGAGGTTCGCCTAGCGAACGGGAGGCAGGTCGACGAGTTGCGGAGGCCAGAGAATTCTCCAAATCCCTTTCCAGTAGCCTGCAATGCAAGAGAATGCAACAATGTTCTCAGATGAAAAACAGGAAGAGATTAAAGAAGGGGCGAGAAAGGAAGAGTGCAAGCAATGGAGGAACCTGGAGAGATGGTTTGGCTACTGAGAGAGTAGCCGTAGCCATGGTTGAGGGGCGATGAGAAGAGCAAAATGGGGGAAAGCAGAAGCTTGAAGCAGTAGAAGAGAGATGGGAAATTGAGAAGGGTGGAGTG

Coding sequence (CDS)

ATGAGGCCGGGAACTCTGAGCGTTGGAGAATGTTCGAGTTCTACTTCATTGAGCTCTCATCAAGACATTGACGATGATCGCATGATTGCTGTTGCATTAACGGAAGAGTACGCTAAGCTAGACGGTGGTGTTGCCAGACGCCTCTCCAACCTTGCCCCTATTGCTCATACTCCAAGGGTAAATTTGTACATCCCCACCCAAAGTGATGCGAGTTTGGAGTATCATAGGCTTCTTCAGAGGCTAAATGTCTATGGTTTGCATGAAGTGAAGGTCTCTGGAGATGGAAATTGTCAGTTTCGAGCACTTTCAGATCAGCTGTACAAATCACCTGAGTATCACAAGCACGTGCGGAAAGACATTGTAAAGCAGCTAAAGGACTACCGTTCTCAATATGAAGGTTACGTTCCAATGAAGTTCAGTCGTTATTACAAGAAAATGGCGAAATCTGGTGAATGGGGGGACCATGTTACCCTTCAAGCAGCAGCTGATAAGATTTTGTATAAAATACGCCCAAAGGAGAATGAGCAGCAAGTTATCATATCCTTGGAAATTTTCTCAATATGTAGAAACGTTTTGAATGTCGACACTGTTGCATTTACACACCAAATGCTTTGCTGTAGTTTTGGAATACTAGGGTTCTTACAGCTCCTTTACCTTTGCAATAATGCAGAGCTCTGGTTAAGTTTCTGGTCTGAGGTTCACTACAATTCACTTTACGAAATTCAAGGTTTGGTTATACTGCAAAATCCATTTTCTGGTTCTCAGAGTAGCAAGATGTTCCAGTTCAACAAAAACCAGGAAAAAAACATTGGTTGTTCTAGTTAA

Protein sequence

MRPGTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRVNLYIPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVIISLEIFSICRNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLYEIQGLVILQNPFSGSQSSKMFQFNKNQEKNIGCSS
BLAST of Cp4.1LG03g06470 vs. Swiss-Prot
Match: Y4757_DICDI (OTU domain-containing protein DDB_G0284757 OS=Dictyostelium discoideum GN=DDB_G0284757 PE=3 SV=2)

HSP 1 Score: 72.4 bits (176), Expect = 8.6e-12
Identity = 59/174 (33.91%), Postives = 88/174 (50.57%), Query Frame = 1

Query: 40  LDGGVARRLSNLAP-IAHTPRVNLY-IPTQSDASLEYHRLLQRLNVYGLHEVK-VSGDGN 99
           L+G V + +++ A  I+    +NL+ +P   +  +   RL +RL +Y L   K + GDGN
Sbjct: 582 LEGLVLKNMNHDASLISSNVLLNLHPLPQSKEVQIAQQRLNERLELYMLKNSKEIPGDGN 641

Query: 100 CQFRALSDQLYKSPEYHKHVRKDIVKQL---KDYRSQYEGYV-----PMKFSRYYKKMAK 159
           CQ  ALSDQLY    + + VRK IV  L   KD++      +        +  Y   M+K
Sbjct: 642 CQMHALSDQLYGDLSHSQEVRKTIVDWLRKNKDFQLPNGATICQFVNTNNWDDYCNDMSK 701

Query: 160 SGEWGDHVTLQAAADKILYKIRPKEN-EQQVIISLEIFSICRNVLNVDTVAFTH 202
           +G WGDH+TL AAA+    KI    + E Q    +EI  I   +LN   +  +H
Sbjct: 702 NGNWGDHLTLLAAAEHFGSKISIISSVESQSNFFIEI--IPSKILNDKVLLLSH 753

BLAST of Cp4.1LG03g06470 vs. Swiss-Prot
Match: OTUD4_HUMAN (OTU domain-containing protein 4 OS=Homo sapiens GN=OTUD4 PE=1 SV=4)

HSP 1 Score: 55.5 bits (132), Expect = 1.1e-06
Identity = 31/103 (30.10%), Postives = 52/103 (50.49%), Query Frame = 1

Query: 81  LNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYRSQYEGYVPMKFS 140
           L   GL+   V+ DG+C FRA+++Q+  S   H  VR   +  L++ R ++E ++   F 
Sbjct: 29  LRKLGLYRKLVAKDGSCLFRAVAEQVLHSQSRHVEVRMACIHYLRENREKFEAFIEGSFE 88

Query: 141 RYYKKMAKSGEWGDHVTLQAAA-----DKILYKIRPKENEQQV 179
            Y K++    EW   V + A +     D I+Y+  P  +  QV
Sbjct: 89  EYLKRLENPQEWVGQVEISALSLMYRKDFIIYR-EPNVSPSQV 130

BLAST of Cp4.1LG03g06470 vs. Swiss-Prot
Match: OTUD4_MOUSE (OTU domain-containing protein 4 OS=Mus musculus GN=Otud4 PE=1 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 1.9e-06
Identity = 25/80 (31.25%), Postives = 43/80 (53.75%), Query Frame = 1

Query: 81  LNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYRSQYEGYVPMKFS 140
           L   GL+   V+ DG+C FRA+++Q+  S   H  VR   ++ L++ R ++E ++   F 
Sbjct: 29  LRKLGLYRKLVAKDGSCLFRAVAEQVLHSQSRHVEVRMACIRYLRENREKFEAFIEGSFE 88

Query: 141 RYYKKMAKSGEWGDHVTLQA 161
            Y K++    EW   V + A
Sbjct: 89  EYLKRLENPQEWVGQVEISA 108

BLAST of Cp4.1LG03g06470 vs. Swiss-Prot
Match: ALG13_MOUSE (Putative bifunctional UDP-N-acetylglucosamine transferase and deubiquitinase ALG13 OS=Mus musculus GN=Alg13 PE=1 SV=2)

HSP 1 Score: 53.1 bits (126), Expect = 5.4e-06
Identity = 32/111 (28.83%), Postives = 55/111 (49.55%), Query Frame = 1

Query: 71  SLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYRSQ 130
           SL    + + L   GL    V+ D +C FRA+S+QL+ S  +H  +R+  V  +K+ +  
Sbjct: 210 SLNEASMDEYLGSLGLFRKVVAKDASCLFRAISEQLFHSQIHHLQIRRACVSYMKENQQA 269

Query: 131 YEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAA-----DKILYKIRPKENEQ 177
           +E YV   F +Y +++    E    + L+A +     D I+Y+   K   Q
Sbjct: 270 FESYVEGSFEKYLERLGDPKESAGQLELKALSLIYNRDFIIYRYPGKPPTQ 320

BLAST of Cp4.1LG03g06470 vs. Swiss-Prot
Match: OTU5A_DANRE (OTU domain-containing protein 5-A OS=Danio rerio GN=otud5a PE=2 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 9.3e-06
Identity = 25/74 (33.78%), Postives = 40/74 (54.05%), Query Frame = 1

Query: 90  KVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYRSQYEGYVPMKFSRYYKKMAKS 149
           K+  DG C FRA++DQ+Y   + H+ VRK  +  L      +  YV   F+ Y  +  K+
Sbjct: 215 KMKEDGACLFRAVADQVYGDQDMHEVVRKHCMDYLMKNADYFSNYVTEDFTTYINRKRKN 274

Query: 150 GEWGDHVTLQAAAD 164
              G+H+ +QA A+
Sbjct: 275 NCHGNHIEMQAMAE 288

BLAST of Cp4.1LG03g06470 vs. TrEMBL
Match: A0A061EEF8_THECC (Cysteine proteinases superfamily protein OS=Theobroma cacao GN=TCM_017429 PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 9.4e-74
Identity = 152/243 (62.55%), Postives = 170/243 (69.96%), Query Frame = 1

Query: 1   MRPGTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRV 60
           MR G   VGECSSSTS SS QD +DD+MIAV L+EEYAKLDG VARRLS LAP+ H PR+
Sbjct: 1   MRNGVQHVGECSSSTSWSSQQDTEDDQMIAVVLSEEYAKLDGAVARRLSGLAPVPHVPRI 60

Query: 61  NLYIPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDI 120
           N +IP  SDASL++ RLLQRL VYGL+EVKVSGDGNCQFRALSDQ+YKSPEYHKHVRKDI
Sbjct: 61  NSFIPNVSDASLDHQRLLQRLQVYGLYEVKVSGDGNCQFRALSDQMYKSPEYHKHVRKDI 120

Query: 121 VKQLKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVII 180
           VKQLKD+R+ YEGYVPMK+ RY KKMAKSGEWGDHVTLQAA+DK   KI          +
Sbjct: 121 VKQLKDHRNLYEGYVPMKYKRYCKKMAKSGEWGDHVTLQAASDKFAAKI---------CL 180

Query: 181 SLEIFSICRNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLYE 240
                  C   +     A  H++                       LSFWSEVHYNSLYE
Sbjct: 181 LTSFRDTCFVEIMPQYQAPKHELW----------------------LSFWSEVHYNSLYE 212

Query: 241 IQG 244
           IQG
Sbjct: 241 IQG 212

BLAST of Cp4.1LG03g06470 vs. TrEMBL
Match: A0A059APN3_EUCGR (Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_I01566 PE=4 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 6.1e-73
Identity = 135/166 (81.33%), Postives = 152/166 (91.57%), Query Frame = 1

Query: 4   GTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRVNLY 63
           GT SVGECSSSTSLSS QD++DDRMIA+ L+EE+AK+DGGVARRLSNLAP+ H PR+N Y
Sbjct: 111 GTHSVGECSSSTSLSSQQDLEDDRMIALVLSEEFAKVDGGVARRLSNLAPVRHVPRINTY 170

Query: 64  IPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQ 123
           IP  SDASL++ RLLQRLN+YGL+EVKVSGDGNCQFRALSDQ+YKSPEYHK+VRK+IVKQ
Sbjct: 171 IPDLSDASLDHQRLLQRLNIYGLYEVKVSGDGNCQFRALSDQMYKSPEYHKNVRKEIVKQ 230

Query: 124 LKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKI 170
           LKDYRS YEGYVPMK+ RYYKKMAK GEWGDHVTLQAAADK + KI
Sbjct: 231 LKDYRSLYEGYVPMKYKRYYKKMAKLGEWGDHVTLQAAADKFVAKI 276

BLAST of Cp4.1LG03g06470 vs. TrEMBL
Match: M5X0D5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011031mg PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 1.0e-72
Identity = 138/169 (81.66%), Postives = 150/169 (88.76%), Query Frame = 1

Query: 1   MRPGTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRV 60
           M  GT SVGECSSSTSLSS QD++DD MIAV L+EEYAKLDG VARRLSNLAP+ H PR+
Sbjct: 1   MMNGTHSVGECSSSTSLSSQQDVEDDCMIAVVLSEEYAKLDGAVARRLSNLAPVPHIPRI 60

Query: 61  NLYIPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDI 120
           N YIP  SDASL++ RLLQRL+VYGL+EVKVSGDGNCQFRALSDQ+YKSPEYHKHVRK+I
Sbjct: 61  NSYIPNISDASLDHQRLLQRLHVYGLYEVKVSGDGNCQFRALSDQMYKSPEYHKHVRKEI 120

Query: 121 VKQLKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKI 170
           VKQLKDY S YEGYVPMK+ RYYKKMAKSGEWGDHVTLQAAADK   KI
Sbjct: 121 VKQLKDYHSLYEGYVPMKYKRYYKKMAKSGEWGDHVTLQAAADKFEAKI 169

BLAST of Cp4.1LG03g06470 vs. TrEMBL
Match: I3S995_LOTJA (Uncharacterized protein OS=Lotus japonicus PE=2 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 1.0e-72
Identity = 149/242 (61.57%), Postives = 170/242 (70.25%), Query Frame = 1

Query: 9   GECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRVNLYIPTQS 68
           GE S STSLSS QDI+DDRMIA+ L+EEYAKLDGGV RRLS L P+AH PR+N  IPT S
Sbjct: 11  GESSGSTSLSSQQDIEDDRMIALVLSEEYAKLDGGVGRRLSKLEPVAHVPRINSSIPTIS 70

Query: 69  DASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYR 128
           DAS+++ RLLQRLN+YGL EV+VSGDGNCQFRALSDQLY+SPE+HKHVRK+IV+QLKD+R
Sbjct: 71  DASMDHQRLLQRLNIYGLREVRVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVRQLKDHR 130

Query: 129 SQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVIISLEIFSIC 188
           S YE YVPMK+ RYYKKMAK GEWGDHVTLQAA+DK   K                  IC
Sbjct: 131 SLYECYVPMKYKRYYKKMAKLGEWGDHVTLQAASDKFAAK------------------IC 190

Query: 189 RNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLYEIQGLVILQ 248
                 DT         C   I+     LY     E+WLSFWSEVHYNSLYE++   I  
Sbjct: 191 LLTSFRDT---------CFIEIMP----LYQAPQREIWLSFWSEVHYNSLYEVRDAPIQH 221

Query: 249 NP 251
            P
Sbjct: 251 KP 221

BLAST of Cp4.1LG03g06470 vs. TrEMBL
Match: A0A0S3TA43_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.11G108000 PE=4 SV=1)

HSP 1 Score: 279.6 bits (714), Expect = 4.0e-72
Identity = 149/242 (61.57%), Postives = 171/242 (70.66%), Query Frame = 1

Query: 9   GECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRVNLYIPTQS 68
           GE S STSLSS QD++DD+MIA+ L+EEYAKLDG VARRL+NL P+ H PR+N +IPT +
Sbjct: 11  GESSRSTSLSSQQDVEDDQMIALVLSEEYAKLDGAVARRLTNLEPVPHVPRINSFIPTVN 70

Query: 69  DASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYR 128
           DAS+++ RLLQRLNVYGL EVKVSGDGNCQFRALSDQLY+SPE+HKHVRK+IVKQLKD+R
Sbjct: 71  DASMDHQRLLQRLNVYGLCEVKVSGDGNCQFRALSDQLYRSPEHHKHVRKEIVKQLKDHR 130

Query: 129 SQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVIISLEIFSIC 188
           S YE YVPMK+ +Y+KKMAKSGEWGDHVTLQAAAD    K                  IC
Sbjct: 131 SLYECYVPMKYKKYHKKMAKSGEWGDHVTLQAAADNFAAK------------------IC 190

Query: 189 RNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLYEIQGLVILQ 248
                 DT         C   I+     LY     ELWLSFWSEVHYNSLYEI+   I  
Sbjct: 191 LLTSFRDT---------CFIEIIP----LYQAPQRELWLSFWSEVHYNSLYEIRDAPIQP 221

Query: 249 NP 251
            P
Sbjct: 251 KP 221

BLAST of Cp4.1LG03g06470 vs. TAIR10
Match: AT3G02070.1 (AT3G02070.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 245.4 bits (625), Expect = 4.2e-65
Identity = 116/162 (71.60%), Postives = 137/162 (84.57%), Query Frame = 1

Query: 8   VGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRVNLYIPTQ 67
           +G+ SSSTS SS +D +DDRMIA  L+EEY+KLDG V RRLSNLAP+ H PR+N YIP  
Sbjct: 1   MGDSSSSTSWSSKKDTEDDRMIAFMLSEEYSKLDGAVGRRLSNLAPVPHVPRINCYIPNL 60

Query: 68  SDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDY 127
           +DA+L++ RLLQRLNVYGL E+KVSGDGNCQFRALSDQLY+SPEYHK VR+++VKQLK+ 
Sbjct: 61  NDATLDHQRLLQRLNVYGLCELKVSGDGNCQFRALSDQLYRSPEYHKQVRREVVKQLKEC 120

Query: 128 RSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKI 170
           RS YE YVPMK+ RYYKKM K GEWGDH+TLQAAAD+   KI
Sbjct: 121 RSMYESYVPMKYKRYYKKMGKFGEWGDHITLQAAADRFAAKI 162

BLAST of Cp4.1LG03g06470 vs. TAIR10
Match: AT3G22260.2 (AT3G22260.2 Cysteine proteinases superfamily protein)

HSP 1 Score: 187.6 bits (475), Expect = 1.0e-47
Identity = 112/236 (47.46%), Postives = 138/236 (58.47%), Query Frame = 1

Query: 7   SVGECSSSTSLSSHQDIDDDRMIAVALTE-EYAKLDGGVARRLSNLAPIAHTPRVNLYIP 66
           S    S+S+  SS  D DDD+ IA  L E E  + +G + +RLS+L  I HTPRVN  IP
Sbjct: 21  STSASSNSSFSSSVADTDDDQTIARILAEDESLRREGKLGKRLSHLDSIPHTPRVNREIP 80

Query: 67  TQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLK 126
             +DA+L++  L  RL  YGL E+++ GDGNCQFRAL+DQL+++ +YHKHVRK +VKQLK
Sbjct: 81  DINDATLDHELLSGRLATYGLAELQMEGDGNCQFRALADQLFRNADYHKHVRKHVVKQLK 140

Query: 127 DYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKI--RPKENEQQVIISLE 186
             R  YE YVPMK+  Y +KM K GEWGDHVTLQAAAD+   KI       +Q  I   E
Sbjct: 141 QQRKLYEEYVPMKYRHYTRKMKKHGEWGDHVTLQAAADRFEAKICLVTSFRDQSYI---E 200

Query: 187 IFSICRNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLY 240
           I    +N L                               E WLSFWSEVHYNSLY
Sbjct: 201 ILPHNKNPLR------------------------------EAWLSFWSEVHYNSLY 223

BLAST of Cp4.1LG03g06470 vs. TAIR10
Match: AT5G03330.1 (AT5G03330.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 164.9 bits (416), Expect = 7.2e-41
Identity = 97/224 (43.30%), Postives = 129/224 (57.59%), Query Frame = 1

Query: 16  SLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRVNLYIPTQSDASLEYH 75
           S SS  D D+      +   +    DG   RRL+ + PI + P++N  IP + +A  ++ 
Sbjct: 146 SCSSPSDTDE---YVYSWESDQCDADGEFGRRLNQMVPIPYIPKINGEIPPEEEAVSDHE 205

Query: 76  RLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYRSQYEGYV 135
           RL  RL ++   EVKV GDGNCQFRAL+DQLYK+ + HKHVR+ IVKQLK     Y+GYV
Sbjct: 206 RLRNRLEMFDFTEVKVPGDGNCQFRALADQLYKTADRHKHVRRQIVKQLKSRPDSYQGYV 265

Query: 136 PMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVIISLEIFSICRNVLNVD 195
           PM FS Y +KM++SGEWGDHVTLQAAAD   Y+++        I+ L  F         D
Sbjct: 266 PMDFSDYLRKMSRSGEWGDHVTLQAAADA--YRVK--------IVVLTSFK--------D 325

Query: 196 TVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLY 240
           T         C   IL   Q     +   ++LSFW+EVHYN++Y
Sbjct: 326 T---------CYIEILPTSQE----SKGVIFLSFWAEVHYNAIY 335

BLAST of Cp4.1LG03g06470 vs. TAIR10
Match: AT5G04250.1 (AT5G04250.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 161.0 bits (406), Expect = 1.0e-39
Identity = 95/216 (43.98%), Postives = 127/216 (58.80%), Query Frame = 1

Query: 24  DDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRVNLYIPTQSDASLEYHRLLQRLNV 83
           DDD + +V + EE       V +RL+ + PIAH P++N  +P++ +   ++ RL QRL +
Sbjct: 145 DDDSVCSVEIEEESWS---EVGKRLNQMIPIAHVPKINGELPSEDEQISDHERLFQRLQL 204

Query: 84  YGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYRSQYEGYVPMKFSRYY 143
           YGL E K+ GDGNCQFR+LSDQLY+SPE+H  VR+ +V QL   R  YEGYVPM ++ Y 
Sbjct: 205 YGLVENKIEGDGNCQFRSLSDQLYRSPEHHNFVREQVVNQLAYNREIYEGYVPMAYNDYL 264

Query: 144 KKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVIISLEIFSICRNVLNVDTVAFTHQM 203
           K M ++GEWGDHVTLQAA    L+ +R       VI S +           DT       
Sbjct: 265 KAMKRNGEWGDHVTLQAA--ADLFGVR-----MFVITSFK-----------DT------- 324

Query: 204 LCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLY 240
             C   IL   Q     +N  + LSFW+EVHYNS+Y
Sbjct: 325 --CYIEILPHFQK----SNRLICLSFWAEVHYNSIY 326

BLAST of Cp4.1LG03g06470 vs. TAIR10
Match: AT2G39320.1 (AT2G39320.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 57.8 bits (138), Expect = 1.2e-08
Identity = 31/79 (39.24%), Postives = 44/79 (55.70%), Query Frame = 1

Query: 91  VSGDGNCQFRALSDQLYKSPEYHKHVRKDIVKQLKDYRSQYEGYVPMKFSRYYKKMAKSG 150
           +  DGNCQFRAL+DQLY++ + H+ VR++IVKQ                      ++ + 
Sbjct: 2   MKSDGNCQFRALADQLYQNSDCHELVRQEIVKQ-------------------NMSLSTNS 61

Query: 151 EWGDHVTLQAAADKILYKI 170
           +WGD VTL+ AAD    KI
Sbjct: 62  QWGDEVTLRVAADVYQVKI 61

BLAST of Cp4.1LG03g06470 vs. NCBI nr
Match: gi|659098078|ref|XP_008449968.1| (PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X2 [Cucumis melo])

HSP 1 Score: 338.2 bits (866), Expect = 1.3e-89
Identity = 178/250 (71.20%), Postives = 190/250 (76.00%), Query Frame = 1

Query: 1   MRPGTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRV 60
           MRP T S+GECSSSTSLSSHQD+DDD MIAVAL+EEYAKLDG VARRLSNLAPIAHTPR+
Sbjct: 1   MRPETQSIGECSSSTSLSSHQDVDDDCMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60

Query: 61  NLYIPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDI 120
           NLYIP QSDASLEYHRLLQRL+VYGLHEVKVSGDGNCQFRALSDQLY+SPEYHKHVRKD+
Sbjct: 61  NLYIPNQSDASLEYHRLLQRLSVYGLHEVKVSGDGNCQFRALSDQLYRSPEYHKHVRKDV 120

Query: 121 VKQLKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVII 180
           VKQLKD+RS YEGYVPMK+SRYYKKMAKSGEWGDHVTLQAAADK   K            
Sbjct: 121 VKQLKDHRSLYEGYVPMKYSRYYKKMAKSGEWGDHVTLQAAADKFAAK------------ 180

Query: 181 SLEIFSICRNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLYE 240
                 IC      DT         C   I+   Q        ELWLSFWSEVHYNSLYE
Sbjct: 181 ------ICLLTSFRDT---------CFIEIVPLSQ----TPKRELWLSFWSEVHYNSLYE 219

Query: 241 IQGLVILQNP 251
           IQ + + Q P
Sbjct: 241 IQDVPVQQKP 219

BLAST of Cp4.1LG03g06470 vs. NCBI nr
Match: gi|659098074|ref|XP_008449966.1| (PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X1 [Cucumis melo])

HSP 1 Score: 338.2 bits (866), Expect = 1.3e-89
Identity = 178/250 (71.20%), Postives = 190/250 (76.00%), Query Frame = 1

Query: 1   MRPGTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRV 60
           MRP T S+GECSSSTSLSSHQD+DDD MIAVAL+EEYAKLDG VARRLSNLAPIAHTPR+
Sbjct: 23  MRPETQSIGECSSSTSLSSHQDVDDDCMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 82

Query: 61  NLYIPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDI 120
           NLYIP QSDASLEYHRLLQRL+VYGLHEVKVSGDGNCQFRALSDQLY+SPEYHKHVRKD+
Sbjct: 83  NLYIPNQSDASLEYHRLLQRLSVYGLHEVKVSGDGNCQFRALSDQLYRSPEYHKHVRKDV 142

Query: 121 VKQLKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVII 180
           VKQLKD+RS YEGYVPMK+SRYYKKMAKSGEWGDHVTLQAAADK   K            
Sbjct: 143 VKQLKDHRSLYEGYVPMKYSRYYKKMAKSGEWGDHVTLQAAADKFAAK------------ 202

Query: 181 SLEIFSICRNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLYE 240
                 IC      DT         C   I+   Q        ELWLSFWSEVHYNSLYE
Sbjct: 203 ------ICLLTSFRDT---------CFIEIVPLSQ----TPKRELWLSFWSEVHYNSLYE 241

Query: 241 IQGLVILQNP 251
           IQ + + Q P
Sbjct: 263 IQDVPVQQKP 241

BLAST of Cp4.1LG03g06470 vs. NCBI nr
Match: gi|449455768|ref|XP_004145623.1| (PREDICTED: OTU domain-containing protein DDB_G0284757 [Cucumis sativus])

HSP 1 Score: 310.5 bits (794), Expect = 3.0e-81
Identity = 149/169 (88.17%), Postives = 160/169 (94.67%), Query Frame = 1

Query: 1   MRPGTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRV 60
           M P T S+GECSSSTSLSSHQD++DDRMIAVAL+EEYAKLDG VARRLSNLAPIAHTPR+
Sbjct: 1   MTPETQSIGECSSSTSLSSHQDVEDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60

Query: 61  NLYIPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDI 120
           NLYIP QSDASLEYHRLLQRL+VYGLHEVKVSGDGNCQFRALSDQ+Y+SPEYHKHVRKD+
Sbjct: 61  NLYIPNQSDASLEYHRLLQRLSVYGLHEVKVSGDGNCQFRALSDQMYRSPEYHKHVRKDV 120

Query: 121 VKQLKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKI 170
           VKQLKD+RS YEGYVPMK+SRYYKKMAKSGEWGDHVTLQAAADK   KI
Sbjct: 121 VKQLKDHRSLYEGYVPMKYSRYYKKMAKSGEWGDHVTLQAAADKFAAKI 169

BLAST of Cp4.1LG03g06470 vs. NCBI nr
Match: gi|823129175|ref|XP_012446665.1| (PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X1 [Gossypium raimondii])

HSP 1 Score: 294.3 bits (752), Expect = 2.2e-76
Identity = 153/248 (61.69%), Postives = 179/248 (72.18%), Query Frame = 1

Query: 1   MRPGTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRV 60
           MR G   VGECSSSTS SSHQD +DD+MIAV L+EE++KLDG VARRLS LAP+ H P +
Sbjct: 1   MRNGAQHVGECSSSTSWSSHQDTEDDQMIAVVLSEEFSKLDGAVARRLSGLAPVPHVPHI 60

Query: 61  NLYIPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDI 120
           N YIP+  DASL++ RLL+RL+VYGL+EVKVSGDGNCQFRALSDQ+Y+SPEYHKHVRKDI
Sbjct: 61  NSYIPSLHDASLDHQRLLERLHVYGLYEVKVSGDGNCQFRALSDQMYRSPEYHKHVRKDI 120

Query: 121 VKQLKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVII 180
           VKQLKD R+ YEGYVPMK+ RY KKMAKSGEWGDHVTLQAA+DK+++             
Sbjct: 121 VKQLKDNRNLYEGYVPMKYKRYCKKMAKSGEWGDHVTLQAASDKVVF------------- 180

Query: 181 SLEIFSICRNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCN--NAELWLSFWSEVHYNSL 240
                        V T+      L  SF    F++++        ELWLSFWSEVHYNSL
Sbjct: 181 -------------VKTIFAAKICLLTSFRDTCFIEIMPQSQPPKHELWLSFWSEVHYNSL 222

Query: 241 YEIQGLVI 247
           YEIQG  I
Sbjct: 241 YEIQGAPI 222

BLAST of Cp4.1LG03g06470 vs. NCBI nr
Match: gi|747101690|ref|XP_011098983.1| (PREDICTED: OTU domain-containing protein DDB_G0284757 [Sesamum indicum])

HSP 1 Score: 290.0 bits (741), Expect = 4.2e-75
Identity = 151/250 (60.40%), Postives = 178/250 (71.20%), Query Frame = 1

Query: 1   MRPGTLSVGECSSSTSLSSHQDIDDDRMIAVALTEEYAKLDGGVARRLSNLAPIAHTPRV 60
           MR G++S   CSSSTSLSS QD++DDRMIAV L+EEYAKLDG V RR+S+LAP+ H PR+
Sbjct: 1   MRNGSISNDGCSSSTSLSSQQDVEDDRMIAVVLSEEYAKLDGSVGRRISSLAPVPHIPRI 60

Query: 61  NLYIPTQSDASLEYHRLLQRLNVYGLHEVKVSGDGNCQFRALSDQLYKSPEYHKHVRKDI 120
           NL+ P+ SD++L+Y RLLQRL VYGL+EVKVSGDGNCQFRALSDQ+Y+SPEYHKHVRK++
Sbjct: 61  NLHFPSSSDSNLDYQRLLQRLKVYGLYEVKVSGDGNCQFRALSDQIYRSPEYHKHVRKEV 120

Query: 121 VKQLKDYRSQYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKILYKIRPKENEQQVII 180
           VKQLKD  + YEGYVPMKF  YYKKMAKSGEWGDHVTLQAAADK   KI           
Sbjct: 121 VKQLKDNHALYEGYVPMKFKSYYKKMAKSGEWGDHVTLQAAADKFAAKIC---------- 180

Query: 181 SLEIFSICRNVLNVDTVAFTHQMLCCSFGILGFLQLLYLCNNAELWLSFWSEVHYNSLYE 240
              + +  R+   V+ V   HQ+                    ELWLSFWSEVHYNSLYE
Sbjct: 181 ---LLTSFRDTCFVEIVP-RHQVPV-----------------RELWLSFWSEVHYNSLYE 219

Query: 241 IQGLVILQNP 251
           +Q   +   P
Sbjct: 241 LQDAPLHHKP 219

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4757_DICDI8.6e-1233.91OTU domain-containing protein DDB_G0284757 OS=Dictyostelium discoideum GN=DDB_G0... [more]
OTUD4_HUMAN1.1e-0630.10OTU domain-containing protein 4 OS=Homo sapiens GN=OTUD4 PE=1 SV=4[more]
OTUD4_MOUSE1.9e-0631.25OTU domain-containing protein 4 OS=Mus musculus GN=Otud4 PE=1 SV=1[more]
ALG13_MOUSE5.4e-0628.83Putative bifunctional UDP-N-acetylglucosamine transferase and deubiquitinase ALG... [more]
OTU5A_DANRE9.3e-0633.78OTU domain-containing protein 5-A OS=Danio rerio GN=otud5a PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A061EEF8_THECC9.4e-7462.55Cysteine proteinases superfamily protein OS=Theobroma cacao GN=TCM_017429 PE=4 S... [more]
A0A059APN3_EUCGR6.1e-7381.33Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_I01566 PE=4 ... [more]
M5X0D5_PRUPE1.0e-7281.66Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011031mg PE=4 SV=1[more]
I3S995_LOTJA1.0e-7261.57Uncharacterized protein OS=Lotus japonicus PE=2 SV=1[more]
A0A0S3TA43_PHAAN4.0e-7261.57Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.11G108000 PE=... [more]
Match NameE-valueIdentityDescription
AT3G02070.14.2e-6571.60 Cysteine proteinases superfamily protein[more]
AT3G22260.21.0e-4747.46 Cysteine proteinases superfamily protein[more]
AT5G03330.17.2e-4143.30 Cysteine proteinases superfamily protein[more]
AT5G04250.11.0e-3943.98 Cysteine proteinases superfamily protein[more]
AT2G39320.11.2e-0839.24 Cysteine proteinases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659098078|ref|XP_008449968.1|1.3e-8971.20PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X2 [Cucumis melo][more]
gi|659098074|ref|XP_008449966.1|1.3e-8971.20PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X1 [Cucumis melo][more]
gi|449455768|ref|XP_004145623.1|3.0e-8188.17PREDICTED: OTU domain-containing protein DDB_G0284757 [Cucumis sativus][more]
gi|823129175|ref|XP_012446665.1|2.2e-7661.69PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X1 [Gossypium raim... [more]
gi|747101690|ref|XP_011098983.1|4.2e-7560.40PREDICTED: OTU domain-containing protein DDB_G0284757 [Sesamum indicum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003323OTU_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006508 proteolysis
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0008233 peptidase activity
molecular_function GO:0005524 ATP binding
molecular_function GO:0004386 helicase activity
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g06470.1Cp4.1LG03g06470.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003323OTU domainPFAMPF02338OTUcoord: 93..167
score: 2.
IPR003323OTU domainPROFILEPS50802OTUcoord: 86..241
score: 1
NoneNo IPR availablePANTHERPTHR12419OTU DOMAIN CONTAINING PROTEINcoord: 1..169
score: 3.4E-126coord: 188..250
score: 3.4E
NoneNo IPR availablePANTHERPTHR12419:SF3SUBFAMILY NOT NAMEDcoord: 188..250
score: 3.4E-126coord: 1..169
score: 3.4E
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 68..194
score: 1.18E-22coord: 222..241
score: 1.18