Lsi05G004240 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G004240
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionCysteine proteinases superfamily protein
Locationchr05 : 5411334 .. 5417799 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCACATCCATCTCCCCTGCGGGTATTGACTGTTACTTCCTCACAAGGAGAAACCCGCGAAAGCAACAGCCTGGCTCTGGCCTCTGGGCGTCGTTCCATTGAAGCTTCCATTTTTTTCCTTTTGCTTGCGATTTCTCAACCCCTTTCCCCAAATATCCTAATCCCGTCCCACTTCAAACATTCGCCGCTGAAGTCAAACCCGGACGTTTTTTTCAACCCCCACATTTCCCCTTTTTCCTAATTTCATTCATCTTCCTCTTTTCAGTCTCATCGCCTCCTTCCAGTTGCAATTCCAGGTATTAATTGATCTAATCCTTACTTTTTACTCACAGATTTCAGGTATCAATCATCGGTTTCGTTTAGTTTTTTCTTGAATACTTGTTGCAATCTGTCCAACCATTGAATGATTGAAGATTGACCAAACCCCTTTTTCATCTTACCTTTTTAGCGATTGGTTTTATTGTGTAGTTCGAATTAGATCAATGCCTGTCTGTTTATGCGTGTTCGATGATGTTTTCGTGTAATTAGGATTCCTTCATTTGATTTTGAACTCCGTTCTTGTTGACTTCCTTCTTTATCATCTTGATATATTTTTATCCAGTTATGATGGTTGCTCGCACATTTTCAGGATTTCCCATATTATGCTTTTGTCTTCTCCAATTAATATAGCACTTTTATTGTTAAGTAGTGTTTGTTCTCGATCTACTATGTTTCTTTCTTGCTCTTCGAATAGATTGAGCTCCAACTTCACATTTGTGGATCGGGCGTTATTCGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTAATGTATCTGCATCATGAAAGCATTAATTTTTCTTACATTCTTTTTTAAGGGATGGTGAGTAGTTGCTAAGTGGTAGCATGAGGCCGGGAACTCAGAGCATGGGAGAATGTTCGAGTTCTACTTCATTGAGCTCTCATCAGGACACTGACGATGATCGCATGATTGCTGTTGCATTATCGGAAGAGTATGCTAAGCTAGACGGTGCTGTTGCCAGACGCCTCTCCAACCTTGCCCCTATTGCTGTATGAATCTTTGCTCTTTTCTTTATCACTTCTCATATGAAATTGCGAAAGTTTTTTTTTTTTCTTCTTCTTAGATCGTTGCCCTGAAATTGTGCTCTAATGGAATCTTCTGAAGTGCTTGATTTATTGGTTTTCTGATTATTCTCATCTCTTGGCTTATGGACAATTTTGCTCTGTCTTTTGAGAAAAAGATATTACCTAAACTCATGTGTTAAACTTCACTCTTTGTGGTATTCCAGCATACTCCAAGGATAAATTTGTATATCCCCAACCAAAGTGATGCCAGTTTGGAATATCATAGGCTTCTTCAGAGGTACGTAACCCACATTTTCTGGAAGTAAAAGATAGAATGCCCGCTTGGTCAACAGATGGATCTCTCTCTATATATATATAATTCCACCTTTTCTCTCTGTAGAATAACGATGAAATTTAACATAAATAACAACTGCTAAGGAATTTTTTCCAAGCAGCGCATGCATATTCCCTATTAAAAAAAACAAAAGACAAAAGACACGTGCACATTCATGAACAAGCATCAGTAGAATGAAAATTACTTCCAGGGAGTTAAGACGGTGCATTAAGTTCTTAAATTTACTTGGTGGATGATTTTGTAATCATGATCCCATTGAAGGTCACGAAACCACTGTCCAATTTCTGGCATTATTGTTATTACTATTTTGGTATATTCTTGGTTGTGGTTTAAGCTTAGTCTTAAAGGGAAGTCATCTTCTGTGCCATAGCTAATTACTTGATTGAGGAATAGTTCAAACAAATCTAAAAAATTGCATTAGAAAACTAACAAGCATAATGAAGAATATCAGAGCATCATGATCACCGACTTCGCCAACATAAAGCCATTTGAGACAGCCTATTAGCCTACAGTCCAGTTTAAGCCACTCAAATTCTAGTCTACTTTTTTATTCTTTCAAGATGCAGTTTGATTCTGTATCTCCCTAACAACATGAACTAAGTTAATAACGGGCTTTCTTTCTTTCTTTCTTAATTTCAGCCTTCTTATTATGTTGCTTCTGCGACTTCAATTTAATCCTCGCTCCTAATTTCTTTAAAATTATGCATTGGATAAGAGCTTAGCTTTTCAAGATGAATATGACATCATAGAACTAATCCAATAAAAACGTAGCTATTATGTTTTTTTCCCCTTTGGATGTTAATTGTTATTGCTGTGATAATATCTAAAGTCCAATAAGGATCGCTTGGTTATTCATTGCATCCATGTGTTGTCAAGAGGGAATTGCTGAAAATTACTCAGATAAAAACTGATGGTGGTATAAGTCCAGTGTATGTTTAATGTCATTCAAATTTACCATCTCCCTAGGCTTCTCTTCTTGCCCCTACCACTTATGATGAAGTAGTTTTCATCTCATTGTGTTTCAGGCTAAATGTCTATGGTTTGCATGAAGTGAAGGTCTCTGGTGATGGAAATTGTCAGGTATATAGATTTCTGAAGTAAGAAGTAGAAGCTTCTGGTTTATCTACAATCTACCTATAGTTTCTTCTTATATGGCAGTTTCGAGCACTTTCAGATCAGATGTACAAATCACCTGAGTATCACAAGCACGTGCGGAAAGACATTGTAAAGCAGGTAAACTGTCATATCTCAAGGTGCTTGAACTAGGGAAAGTAGTGGCTACTTTTCTTTTGGGATGAAAAGCTCAGCATTGGTCAAGGCAATTGAAAGAGAGAGTAGGAAGGTCTATGTAGAATGTGGTCTAAGTAGTTCGGAATGGGAAAACAAGTAGCAAAATTTCACTTTTACTTGGTTTTAGAATCTGAAACTTTGATTTGTATTTTGTCAGATCAGAGGTGCTCCTATACGAATTTTAATCGATATGTATAAGTTAAATCTGATCTGTTAGACTTGTCAGTTGTGGCTGGTAATGTCTTACCCATGTTTTGGTGCCAAATTGAAGTGAACATAGTTTTCAAGTAAGGGTGAATGTCATGATTGTGTATTACTCATCTGGGTTTATTATAATGTTCCAGCTAAAGGACCACCGTTCTCTATATGAAGGTTATGTTCCAATGAAGTTCAGTCGTTATTACAAGAAAATGGCAAAGTATGTGAAATTGTTCAGGATCGTATTCTCTCTTTCTCTCTTATTGAATTGATGGTCATATGTTTATTGAATTGATGGTCATCTGTGGATTAACTGATATTAAACATGTTCTTGAGGAAAAGGCTTCCTTTTTTATTTCTTTAGCATATGATGCAGATCTGGTGAATGGGGTGACCATGTAACCCTACAAGCAGCAGCTGATAAGGTATAACATATAGCTAAAATTAGAAGGCGCCTTTGTATTTGCTACCTTACTGATTGATTGTTGATAATTTAAGATTTTGTATAATATATGCACTAAGCGGTATGAGCAGCAAGTTTTCACATCCTTGGAAATGTAGGCAATCGATGAAAATCAGTGAGACACTGGATTACTGTATTTGCTACCTCACTGAAGTCAGACGCTTGGATTGGGTGTTTTTCCAACTACCATTAAACTTGGAGTTATAAACTAGAAAGAGTGAGAGAGAGTGGAGAGATGGGTGATATACGTCAATAAATGACCAACTATTTCAAAGAATTTGTTTATCCAGTTCCATGAGTGAATTATATTCCAAAAATTCTGGAAGATTTTCCATTAGAACATTGAGTCCTGCCTTCTCTGTTTGTTTCTTGATACAGCTGCTTGATTATGTGCAGTTTGCAGCAAAGATTTGCCTCCTGACATCATTCAGAGATACTTGTTTCATTGAAATTGTTCCACAATCTCAAACTCCCAAACGTGGTAAGTTCAGAAACATGGATACAGATACGAGACATAAATATGATATGGATGTCGAGACATGTCATTTTATATAAATACGATAGGTTTATTAAAATATACCTTTTCCAACTATATATATCATTTTTGTATTTCAAGGAATTTAAAGTGAATAAGTTTATTATACATTTATCAACTTAAAAAATAAGCTTGATGTGTTTGACATACAACTAGTGTCTGATACGTGTTATTGTCCTAACGAGTTTCGGAGTGTCTGACATGTGTCAGATACAGAAACACGGACATATTTTTCAAACTAAAGTTGTATGTCTTAGGTGGTAAGATAGTGTATATTTCTTGAAATTCTCATATATTGATTTTTTATACTGGAAATAATTATTTTATTGACTATATAAAGTGGTACCGAAAGAGATGGAAAGCCAAACCCCTTCAAGAAACTGAATTGGCAGTCGAGTATAGTTTTATATTTTAAAATACCCTAAAAATGTATTTCCAAAAACCTTTGTAAATCCAGCTTTTGTTGCATATCTTTCCTTTATAGTCCTTTTTCCTCCAAGAGTTTTGGCTATGAATATGGAAGAAATGATCATTTTTGCTGCTTAGGTTAAAATCTTTTTGACCGCAAGTTTAAGGTGGAACCTGTAATTATAAAACATTTTGCAAGATATCTGATGTTGATGAAATTCAATATATGCTAACTACACAGGACGGGACTTGGTATCAGGTCTTTCTCCAAACTAACATGTAATTTTGATGTATATTTCCTCGGTACCTCAATTTCTTAATGTGTGGGGGTGAGACGTTTGAATTTCGACACTGTTGCATTTATACACTAAATGCTTTCCTCTAGTTTTGGAACACTAAGGTTCTTACAGCTACTTTACCTTTGCAATAATGCAGAGCTTTGGTTAAGTTTCTGGTCTGAGGTTCACTACAATTCACTTTATGAAATCCAAGGTTTGGTTATACTGCAAAATCCATTTTCTGGTTCTCAGAGTAGCAAGGTTTTCTTTAGAGTGAATGGGCTTATTTGGGAATTGTGTTTTTTATGGTAATAATATGATTATCTTTCTTGCAGATGTTCCAGTTCAACAAAAGCCAAGAAGAAAACATTGGTTGTTCTAGGAAAACGGATTGTTGTGAGTATAGATAGCATCCTAGATATTTGAGTACACTACATCGTAAAATTGTTGTTTTTTTTTTAAAGGGAAAAAAAATGAAAGGAAAAAGAAAAAGAAAACATATGATTGATATTTATATTTCTTTTAATTGTATGTAAATAGTTCATCATACATATATCGCCTTGTATTACTGCACTTATGGATCCAGTGAACAACTCTATAATAGAACTATAGTCTCATTATGAACTCAAAGACGAGACTCTTGTGCAACTGTTTAATTAGCATAACTGAATTCAGTTTAGCATATCCTACAGAGCAAGAAATATTTTATTTGTATTCATGAAATTTCAAACTTGACTGACTTTTGGAACTTTATCACTCTCATAGTTTTTGTGGCATATATATTTATTGGGTGATGGGTAATGGGTAATGGGTAATGGGTAAAGGAAATTGGTTTTGGTGTTCTCATTACAAATTAATAAGGGAAAAATTAAGCAGAAATGCAGAAAACAACCTAGAGATGAGGTTAGCTCAGAGAGAGCAAACCAATCGTGCTCATTTCCAGTTGTTGGCAACAATGTCAGCCAAATCAACAACCCGTTGAGAGTAACCCCACTCGTTGTCATACCAAGCAATAACCTTCACCAAGTCATCCCCCATAACCATAGTCAAGGAAGAGTCAACGGTTGAGGAGACATCAGAGCACCTAAAATCGACCGAAACAAGGGGCTCGTCACAAACAGAGAGGATACCATTGAGCTCCTTTTCAGCACTTTCTCGGAATGCAGCATTCACCTCTTCAGCAAACGTCTTCTTAGAAACCTG

mRNA sequence

CCACATCCATCTCCCCTGCGGGTATTGACTGTTACTTCCTCACAAGGAGAAACCCGCGAAAGCAACAGCCTGGCTCTGGCCTCTGGGCGTCGTTCCATTGAAGCTTCCATTTTTTTCCTTTTGCTTGCGATTTCTCAACCCCTTTCCCCAAATATCCTAATCCCGTCCCACTTCAAACATTCGCCGCTGAAGTCAAACCCGGACGTTTTTTTCAACCCCCACATTTCCCCTTTTTCCTAATTTCATTCATCTTCCTCTTTTCAGTCTCATCGCCTCCTTCCAGTTGCAATTCCAGGGATGGTGAGTAGTTGCTAAGTGGTAGCATGAGGCCGGGAACTCAGAGCATGGGAGAATGTTCGAGTTCTACTTCATTGAGCTCTCATCAGGACACTGACGATGATCGCATGATTGCTGTTGCATTATCGGAAGAGTATGCTAAGCTAGACGGTGCTGTTGCCAGACGCCTCTCCAACCTTGCCCCTATTGCTCATACTCCAAGGATAAATTTGTATATCCCCAACCAAAGTGATGCCAGTTTGGAATATCATAGGCTTCTTCAGAGGTACGTAACCCACATTTTCTGGAAGCTAAATGTCTATGGTTTGCATGAAGTGAAGGTCTCTGGTGATGGAAATTGTCAGTTTCGAGCACTTTCAGATCAGATGTACAAATCACCTGAGTATCACAAGCACGTGCGGAAAGACATTGTAAAGCAGACTTGTCAGTTGTGGCTGCTAAAGGACCACCGTTCTCTATATGAAGGTTATGTTCCAATGAAGTTCAGTCGTTATTACAAGAAAATGGCAAAATCTGGTGAATGGGGTGACCATGTAACCCTACAAGCAGCAGCTGATAAGCTACTTTACCTTTGCAATAATGCAGAGCTTTGGTTAAGTTTCTGGTCTGAGGTTCACTACAATTCACTTTATGAAATCCAAGGTTTGGTTATACTGCAAAATCCATTTTCTGGTTCTCAGAGTAGCAAGATGTTCCAGTTCAACAAAAGCCAAGAAGAAAACATTGGTTGTTCTAGGAAAACGGATTGTTGTGAGTATAGATAGCATCCTAGATATTTGAGTACACTACATCGTAAAATTGTTGTTTTTTTTTTAAAGGGAAAAAAAATGAAAGGAAAAAGAAAAAGAAAACATATGATTGATATTTATATTTCTTTTAATTGTATGTAAATAGTTCATCATACATATATCGCCTTGTATTACTGCACTTATGGATCCAGTGAACAACTCTATAATAGAACTATAGTCTCATTATGAACTCAAAGACGAGACTCTTGTGCAACTGTTTAATTAGCATAACTGAATTCAGTTTAGCATATCCTACAGAGCAAGAAATATTTTATTTGTATTCATGAAATTTCAAACTTGACTGACTTTTGGAACTTTATCACTCTCATAGTTTTTGTGGCATATATATTTATTGGGTGATGGGTAATGGGTAATGGGTAATGGGTAAAGGAAATTGGTTTTGGTGTTCTCATTACAAATTAATAAGGGAAAAATTAAGCAGAAATGCAGAAAACAACCTAGAGATGAGGTTAGCTCAGAGAGAGCAAACCAATCGTGCTCATTTCCAGTTGTTGGCAACAATGTCAGCCAAATCAACAACCCGTTGAGAGTAACCCCACTCGTTGTCATACCAAGCAATAACCTTCACCAAGTCATCCCCCATAACCATAGTCAAGGAAGAGTCAACGGTTGAGGAGACATCAGAGCACCTAAAATCGACCGAAACAAGGGGCTCGTCACAAACAGAGAGGATACCATTGAGCTCCTTTTCAGCACTTTCTCGGAATGCAGCATTCACCTCTTCAGCAAACGTCTTCTTAGAAACCTG

Coding sequence (CDS)

ATGAGGCCGGGAACTCAGAGCATGGGAGAATGTTCGAGTTCTACTTCATTGAGCTCTCATCAGGACACTGACGATGATCGCATGATTGCTGTTGCATTATCGGAAGAGTATGCTAAGCTAGACGGTGCTGTTGCCAGACGCCTCTCCAACCTTGCCCCTATTGCTCATACTCCAAGGATAAATTTGTATATCCCCAACCAAAGTGATGCCAGTTTGGAATATCATAGGCTTCTTCAGAGGTACGTAACCCACATTTTCTGGAAGCTAAATGTCTATGGTTTGCATGAAGTGAAGGTCTCTGGTGATGGAAATTGTCAGTTTCGAGCACTTTCAGATCAGATGTACAAATCACCTGAGTATCACAAGCACGTGCGGAAAGACATTGTAAAGCAGACTTGTCAGTTGTGGCTGCTAAAGGACCACCGTTCTCTATATGAAGGTTATGTTCCAATGAAGTTCAGTCGTTATTACAAGAAAATGGCAAAATCTGGTGAATGGGGTGACCATGTAACCCTACAAGCAGCAGCTGATAAGCTACTTTACCTTTGCAATAATGCAGAGCTTTGGTTAAGTTTCTGGTCTGAGGTTCACTACAATTCACTTTATGAAATCCAAGGTTTGGTTATACTGCAAAATCCATTTTCTGGTTCTCAGAGTAGCAAGATGTTCCAGTTCAACAAAAGCCAAGAAGAAAACATTGGTTGTTCTAGGAAAACGGATTGTTGTGAGTATAGATAG

Protein sequence

MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRINLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEYHKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKLLYLCNNAELWLSFWSEVHYNSLYEIQGLVILQNPFSGSQSSKMFQFNKSQEENIGCSRKTDCCEYR
BLAST of Lsi05G004240 vs. Swiss-Prot
Match: Y4757_DICDI (OTU domain-containing protein DDB_G0284757 OS=Dictyostelium discoideum GN=DDB_G0284757 PE=3 SV=2)

HSP 1 Score: 69.3 bits (168), Expect = 6.5e-11
Identity = 60/194 (30.93%), Postives = 87/194 (44.85%), Query Frame = 1

Query: 40  LDGAVARRLSNLAP-IAHTPRINLY-IPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEV 99
           L+G V + +++ A  I+    +NL+ +P   +  +   RL +R        L +Y L   
Sbjct: 582 LEGLVLKNMNHDASLISSNVLLNLHPLPQSKEVQIAQQRLNER--------LELYMLKNS 641

Query: 100 K-VSGDGNCQFRALSDQMYKSPEYHKHVRKDIVKQTCQLWLLKDHR-SLYEGYVPMKFSR 159
           K + GDGNCQ  ALSDQ+Y    + + VRK IV      WL K+    L  G    +F  
Sbjct: 642 KEIPGDGNCQMHALSDQLYGDLSHSQEVRKTIVD-----WLRKNKDFQLPNGATICQFVN 701

Query: 160 ------YYKKMAKSGEWGDHVTLQAAADKL----------------------LYLCNNAE 202
                 Y   M+K+G WGDH+TL AAA+                          + N+  
Sbjct: 702 TNNWDDYCNDMSKNGNWGDHLTLLAAAEHFGSKISIISSVESQSNFFIEIIPSKILNDKV 761

BLAST of Lsi05G004240 vs. TrEMBL
Match: M5X0D5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011031mg PE=4 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 8.7e-79
Identity = 162/233 (69.53%), Postives = 173/233 (74.25%), Query Frame = 1

Query: 1   MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60
           M  GT S+GECSSSTSLSS QD +DD MIAV LSEEYAKLDGAVARRLSNLAP+ H PRI
Sbjct: 1   MMNGTHSVGECSSSTSLSSQQDVEDDCMIAVVLSEEYAKLDGAVARRLSNLAPVPHIPRI 60

Query: 61  NLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEY 120
           N YIPN SDASL++ RLLQR        L+VYGL+EVKVSGDGNCQFRALSDQMYKSPEY
Sbjct: 61  NSYIPNISDASLDHQRLLQR--------LHVYGLYEVKVSGDGNCQFRALSDQMYKSPEY 120

Query: 121 HKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKL- 180
           HKHVRK+IVKQ      LKD+ SLYEGYVPMK+ RYYKKMAKSGEWGDHVTLQAAADK  
Sbjct: 121 HKHVRKEIVKQ------LKDYHSLYEGYVPMKYKRYYKKMAKSGEWGDHVTLQAAADKFE 180

Query: 181 -------------------LYLCNNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
                               Y     ELWLSFWSEVHYNSLYEI+   I Q P
Sbjct: 181 AKICLLTSFRDTCFIEIMPQYQPPKRELWLSFWSEVHYNSLYEIRDAPIQQKP 219

BLAST of Lsi05G004240 vs. TrEMBL
Match: A0A061EEF8_THECC (Cysteine proteinases superfamily protein OS=Theobroma cacao GN=TCM_017429 PE=4 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 1.5e-78
Identity = 159/226 (70.35%), Postives = 170/226 (75.22%), Query Frame = 1

Query: 1   MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60
           MR G Q +GECSSSTS SS QDT+DD+MIAV LSEEYAKLDGAVARRLS LAP+ H PRI
Sbjct: 1   MRNGVQHVGECSSSTSWSSQQDTEDDQMIAVVLSEEYAKLDGAVARRLSGLAPVPHVPRI 60

Query: 61  NLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEY 120
           N +IPN SDASL++ RLLQR        L VYGL+EVKVSGDGNCQFRALSDQMYKSPEY
Sbjct: 61  NSFIPNVSDASLDHQRLLQR--------LQVYGLYEVKVSGDGNCQFRALSDQMYKSPEY 120

Query: 121 HKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKLL 180
           HKHVRKDIVKQ      LKDHR+LYEGYVPMK+ RY KKMAKSGEWGDHVTLQAA+DK  
Sbjct: 121 HKHVRKDIVKQ------LKDHRNLYEGYVPMKYKRYCKKMAKSGEWGDHVTLQAASDKFA 180

Query: 181 --------------------YLCNNAELWLSFWSEVHYNSLYEIQG 207
                               Y     ELWLSFWSEVHYNSLYEIQG
Sbjct: 181 AKICLLTSFRDTCFVEIMPQYQAPKHELWLSFWSEVHYNSLYEIQG 212

BLAST of Lsi05G004240 vs. TrEMBL
Match: D7SVD8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0068g00690 PE=4 SV=1)

HSP 1 Score: 295.8 bits (756), Expect = 4.8e-77
Identity = 156/233 (66.95%), Postives = 171/233 (73.39%), Query Frame = 1

Query: 1   MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60
           M  G QS+GECSSSTSLSS QD +DDRMIAV LSEE+AKLDGAV RRL++L P+ H PRI
Sbjct: 37  MTNGMQSVGECSSSTSLSSQQDLEDDRMIAVVLSEEFAKLDGAVGRRLASLEPVRHVPRI 96

Query: 61  NLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEY 120
           N YIPN SDASL++ RL QR        LNVY L+EVKVSGDGNCQFRALSDQMYKSPEY
Sbjct: 97  NFYIPNLSDASLDHQRLQQR--------LNVYRLYEVKVSGDGNCQFRALSDQMYKSPEY 156

Query: 121 HKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKLL 180
           HKHVRK+IVKQ      LKD+RSLYEGYVPMK+ RYYKKMAKSGEWGDH+TLQAAAD+  
Sbjct: 157 HKHVRKEIVKQ------LKDYRSLYEGYVPMKYKRYYKKMAKSGEWGDHITLQAAADRFA 216

Query: 181 --------------------YLCNNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
                               Y     ELWLSFWSEVHYNSLYEI+   I Q P
Sbjct: 217 AKICLLTSFRDTCFIEIIPQYQAPKRELWLSFWSEVHYNSLYEIKDAPIRQKP 255

BLAST of Lsi05G004240 vs. TrEMBL
Match: A0A067L3T3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26911 PE=4 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 1.8e-76
Identity = 154/232 (66.38%), Postives = 171/232 (73.71%), Query Frame = 1

Query: 2   RPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRIN 61
           R GT S+GE SSSTS SS QDT+DDRMIA+ LSEEYA LDGAVARRL+NLAP+ H PRIN
Sbjct: 9   RQGTWSVGESSSSTSWSSQQDTEDDRMIALVLSEEYANLDGAVARRLANLAPVPHVPRIN 68

Query: 62  LYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEYH 121
            YIPN SDAS+++ RL+QR        LNVYGL+EV+VSGDGNCQFRALSDQMYKSPE+H
Sbjct: 69  TYIPNLSDASMDHQRLIQR--------LNVYGLYEVRVSGDGNCQFRALSDQMYKSPEHH 128

Query: 122 KHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKLL- 181
           KH+RK++VKQ      LKD+ SLYEGYVPMK+ RYYKKMAKSGEWGDHVTLQAAADK   
Sbjct: 129 KHIRKEVVKQ------LKDNHSLYEGYVPMKYKRYYKKMAKSGEWGDHVTLQAAADKFAA 188

Query: 182 -------------------YLCNNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
                              Y     ELWLSFWSEVHYNSLYEIQ   I   P
Sbjct: 189 KICLLTSFRDTCFIEIMPQYQSPQRELWLSFWSEVHYNSLYEIQDAPIPHKP 226

BLAST of Lsi05G004240 vs. TrEMBL
Match: A0A059APN3_EUCGR (Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_I01566 PE=4 SV=1)

HSP 1 Score: 293.5 bits (750), Expect = 2.4e-76
Identity = 155/230 (67.39%), Postives = 173/230 (75.22%), Query Frame = 1

Query: 4   GTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRINLY 63
           GT S+GECSSSTSLSS QD +DDRMIA+ LSEE+AK+DG VARRLSNLAP+ H PRIN Y
Sbjct: 111 GTHSVGECSSSTSLSSQQDLEDDRMIALVLSEEFAKVDGGVARRLSNLAPVRHVPRINTY 170

Query: 64  IPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEYHKH 123
           IP+ SDASL++ RLLQR        LN+YGL+EVKVSGDGNCQFRALSDQMYKSPEYHK+
Sbjct: 171 IPDLSDASLDHQRLLQR--------LNIYGLYEVKVSGDGNCQFRALSDQMYKSPEYHKN 230

Query: 124 VRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKLL-YL 183
           VRK+IVKQ      LKD+RSLYEGYVPMK+ RYYKKMAK GEWGDHVTLQAAADK +  +
Sbjct: 231 VRKEIVKQ------LKDYRSLYEGYVPMKYKRYYKKMAKLGEWGDHVTLQAAADKFVAKI 290

Query: 184 C-------------------NNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
           C                      E WLSFWSEVHYNSLYEI+   I Q P
Sbjct: 291 CLLTSFRDTCFIEIMPRSEAPQREFWLSFWSEVHYNSLYEIRDAPIPQKP 326

BLAST of Lsi05G004240 vs. TAIR10
Match: AT3G02070.1 (AT3G02070.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 266.5 bits (680), Expect = 1.6e-71
Identity = 138/226 (61.06%), Postives = 160/226 (70.80%), Query Frame = 1

Query: 8   MGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRINLYIPNQ 67
           MG+ SSSTS SS +DT+DDRMIA  LSEEY+KLDGAV RRLSNLAP+ H PRIN YIPN 
Sbjct: 1   MGDSSSSTSWSSKKDTEDDRMIAFMLSEEYSKLDGAVGRRLSNLAPVPHVPRINCYIPNL 60

Query: 68  SDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEYHKHVRKD 127
           +DA+L++ RLLQR        LNVYGL E+KVSGDGNCQFRALSDQ+Y+SPEYHK VR++
Sbjct: 61  NDATLDHQRLLQR--------LNVYGLCELKVSGDGNCQFRALSDQLYRSPEYHKQVRRE 120

Query: 128 IVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKLL------- 187
           +VKQ      LK+ RS+YE YVPMK+ RYYKKM K GEWGDH+TLQAAAD+         
Sbjct: 121 VVKQ------LKECRSMYESYVPMKYKRYYKKMGKFGEWGDHITLQAAADRFAAKICLLT 180

Query: 188 -------------YLCNNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
                        Y      LWLSFWSEVHYNSLY+IQ   +   P
Sbjct: 181 SFRDTCFIEIIPQYQAPKGVLWLSFWSEVHYNSLYDIQAAPVQHKP 212

BLAST of Lsi05G004240 vs. TAIR10
Match: AT3G22260.2 (AT3G22260.2 Cysteine proteinases superfamily protein)

HSP 1 Score: 184.1 bits (466), Expect = 1.0e-46
Identity = 106/217 (48.85%), Postives = 136/217 (62.67%), Query Frame = 1

Query: 7   SMGECSSSTSLSSHQDTDDDRMIAVALSE-EYAKLDGAVARRLSNLAPIAHTPRINLYIP 66
           S    S+S+  SS  DTDDD+ IA  L+E E  + +G + +RLS+L  I HTPR+N  IP
Sbjct: 21  STSASSNSSFSSSVADTDDDQTIARILAEDESLRREGKLGKRLSHLDSIPHTPRVNREIP 80

Query: 67  NQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEYHKHVR 126
           + +DA+L++  L  R  T        YGL E+++ GDGNCQFRAL+DQ++++ +YHKHVR
Sbjct: 81  DINDATLDHELLSGRLAT--------YGLAELQMEGDGNCQFRALADQLFRNADYHKHVR 140

Query: 127 KDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADK------- 186
           K +VKQ      LK  R LYE YVPMK+  Y +KM K GEWGDHVTLQAAAD+       
Sbjct: 141 KHVVKQ------LKQQRKLYEEYVPMKYRHYTRKMKKHGEWGDHVTLQAAADRFEAKICL 200

Query: 187 ---------LLYLCNN----AELWLSFWSEVHYNSLY 203
                    +  L +N     E WLSFWSEVHYNSLY
Sbjct: 201 VTSFRDQSYIEILPHNKNPLREAWLSFWSEVHYNSLY 223

BLAST of Lsi05G004240 vs. TAIR10
Match: AT5G04250.1 (AT5G04250.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 158.3 bits (399), Expect = 6.0e-39
Identity = 88/204 (43.14%), Postives = 122/204 (59.80%), Query Frame = 1

Query: 19  SHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRINLYIPNQSDASLEYHRLL 78
           S    DDD + +V + EE       V +RL+ + PIAH P+IN  +P++ +   ++ RL 
Sbjct: 140 SSPSRDDDSVCSVEIEEESWS---EVGKRLNQMIPIAHVPKINGELPSEDEQISDHERLF 199

Query: 79  QRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEYHKHVRKDIVKQTCQLWLL 138
           QR        L +YGL E K+ GDGNCQFR+LSDQ+Y+SPE+H  VR+ +V Q      L
Sbjct: 200 QR--------LQLYGLVENKIEGDGNCQFRSLSDQLYRSPEHHNFVREQVVNQ------L 259

Query: 139 KDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAAD------------------KLL 198
             +R +YEGYVPM ++ Y K M ++GEWGDHVTLQAAAD                  ++L
Sbjct: 260 AYNREIYEGYVPMAYNDYLKAMKRNGEWGDHVTLQAAADLFGVRMFVITSFKDTCYIEIL 319

Query: 199 --YLCNNAELWLSFWSEVHYNSLY 203
             +  +N  + LSFW+EVHYNS+Y
Sbjct: 320 PHFQKSNRLICLSFWAEVHYNSIY 326

BLAST of Lsi05G004240 vs. TAIR10
Match: AT5G03330.1 (AT5G03330.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 157.9 bits (398), Expect = 7.9e-39
Identity = 90/207 (43.48%), Postives = 120/207 (57.97%), Query Frame = 1

Query: 16  SLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRINLYIPNQSDASLEYH 75
           S SS  DTD+      +   +    DG   RRL+ + PI + P+IN  IP + +A  ++ 
Sbjct: 146 SCSSPSDTDE---YVYSWESDQCDADGEFGRRLNQMVPIPYIPKINGEIPPEEEAVSDHE 205

Query: 76  RLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEYHKHVRKDIVKQTCQL 135
           RL  R        L ++   EVKV GDGNCQFRAL+DQ+YK+ + HKHVR+ IVKQ    
Sbjct: 206 RLRNR--------LEMFDFTEVKVPGDGNCQFRALADQLYKTADRHKHVRRQIVKQ---- 265

Query: 136 WLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAAD----KLLYLC-------- 195
             LK     Y+GYVPM FS Y +KM++SGEWGDHVTLQAAAD    K++ L         
Sbjct: 266 --LKSRPDSYQGYVPMDFSDYLRKMSRSGEWGDHVTLQAAADAYRVKIVVLTSFKDTCYI 325

Query: 196 --------NNAELWLSFWSEVHYNSLY 203
                   +   ++LSFW+EVHYN++Y
Sbjct: 326 EILPTSQESKGVIFLSFWAEVHYNAIY 335

BLAST of Lsi05G004240 vs. TAIR10
Match: AT2G39320.1 (AT2G39320.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 59.7 bits (143), Expect = 2.9e-09
Identity = 40/143 (27.97%), Postives = 68/143 (47.55%), Query Frame = 1

Query: 99  VSGDGNCQFRALSDQMYKSPEYHKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYK 158
           +  DGNCQFRAL+DQ+Y++ + H+ VR++IVKQ   L                       
Sbjct: 2   MKSDGNCQFRALADQLYQNSDCHELVRQEIVKQNMSL----------------------- 61

Query: 159 KMAKSGEWGDHVTLQAAAD----KLLYLCN-----------------NAELWLSFWSEVH 218
             + + +WGD VTL+ AAD    K++ + +                 +  + +S+ + +H
Sbjct: 62  --STNSQWGDEVTLRVAADVYQVKIILITSIKLIPFMEFLPKSQKEPDKVIHMSYLAGIH 112

Query: 219 YNSLYEIQGLVILQNPFSGSQSS 221
           +NS+Y+       +N   GS+SS
Sbjct: 122 FNSIYK-------KNKEKGSRSS 112

BLAST of Lsi05G004240 vs. NCBI nr
Match: gi|659098078|ref|XP_008449968.1| (PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X2 [Cucumis melo])

HSP 1 Score: 337.0 bits (863), Expect = 2.7e-89
Identity = 177/233 (75.97%), Postives = 185/233 (79.40%), Query Frame = 1

Query: 1   MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60
           MRP TQS+GECSSSTSLSSHQD DDD MIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI
Sbjct: 1   MRPETQSIGECSSSTSLSSHQDVDDDCMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60

Query: 61  NLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEY 120
           NLYIPNQSDASLEYHRLLQR        L+VYGLHEVKVSGDGNCQFRALSDQ+Y+SPEY
Sbjct: 61  NLYIPNQSDASLEYHRLLQR--------LSVYGLHEVKVSGDGNCQFRALSDQLYRSPEY 120

Query: 121 HKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKL- 180
           HKHVRKD+VKQ      LKDHRSLYEGYVPMK+SRYYKKMAKSGEWGDHVTLQAAADK  
Sbjct: 121 HKHVRKDVVKQ------LKDHRSLYEGYVPMKYSRYYKKMAKSGEWGDHVTLQAAADKFA 180

Query: 181 -------------------LYLCNNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
                              L      ELWLSFWSEVHYNSLYEIQ + + Q P
Sbjct: 181 AKICLLTSFRDTCFIEIVPLSQTPKRELWLSFWSEVHYNSLYEIQDVPVQQKP 219

BLAST of Lsi05G004240 vs. NCBI nr
Match: gi|659098074|ref|XP_008449966.1| (PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X1 [Cucumis melo])

HSP 1 Score: 337.0 bits (863), Expect = 2.7e-89
Identity = 177/233 (75.97%), Postives = 185/233 (79.40%), Query Frame = 1

Query: 1   MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60
           MRP TQS+GECSSSTSLSSHQD DDD MIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI
Sbjct: 23  MRPETQSIGECSSSTSLSSHQDVDDDCMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 82

Query: 61  NLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEY 120
           NLYIPNQSDASLEYHRLLQR        L+VYGLHEVKVSGDGNCQFRALSDQ+Y+SPEY
Sbjct: 83  NLYIPNQSDASLEYHRLLQR--------LSVYGLHEVKVSGDGNCQFRALSDQLYRSPEY 142

Query: 121 HKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKL- 180
           HKHVRKD+VKQ      LKDHRSLYEGYVPMK+SRYYKKMAKSGEWGDHVTLQAAADK  
Sbjct: 143 HKHVRKDVVKQ------LKDHRSLYEGYVPMKYSRYYKKMAKSGEWGDHVTLQAAADKFA 202

Query: 181 -------------------LYLCNNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
                              L      ELWLSFWSEVHYNSLYEIQ + + Q P
Sbjct: 203 AKICLLTSFRDTCFIEIVPLSQTPKRELWLSFWSEVHYNSLYEIQDVPVQQKP 241

BLAST of Lsi05G004240 vs. NCBI nr
Match: gi|449455768|ref|XP_004145623.1| (PREDICTED: OTU domain-containing protein DDB_G0284757 [Cucumis sativus])

HSP 1 Score: 334.0 bits (855), Expect = 2.3e-88
Identity = 175/233 (75.11%), Postives = 186/233 (79.83%), Query Frame = 1

Query: 1   MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60
           M P TQS+GECSSSTSLSSHQD +DDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI
Sbjct: 1   MTPETQSIGECSSSTSLSSHQDVEDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60

Query: 61  NLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEY 120
           NLYIPNQSDASLEYHRLLQR        L+VYGLHEVKVSGDGNCQFRALSDQMY+SPEY
Sbjct: 61  NLYIPNQSDASLEYHRLLQR--------LSVYGLHEVKVSGDGNCQFRALSDQMYRSPEY 120

Query: 121 HKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKL- 180
           HKHVRKD+VKQ      LKDHRSLYEGYVPMK+SRYYKKMAKSGEWGDHVTLQAAADK  
Sbjct: 121 HKHVRKDVVKQ------LKDHRSLYEGYVPMKYSRYYKKMAKSGEWGDHVTLQAAADKFA 180

Query: 181 LYLC-------------------NNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
             +C                      ELWLSFWSEVHYNSLYEI+ + + + P
Sbjct: 181 AKICLLTSFRDTCFIEIVPQSQTPKRELWLSFWSEVHYNSLYEIKDVPVQEKP 219

BLAST of Lsi05G004240 vs. NCBI nr
Match: gi|595974200|ref|XP_007217756.1| (hypothetical protein PRUPE_ppa011031mg [Prunus persica])

HSP 1 Score: 301.6 bits (771), Expect = 1.3e-78
Identity = 162/233 (69.53%), Postives = 173/233 (74.25%), Query Frame = 1

Query: 1   MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60
           M  GT S+GECSSSTSLSS QD +DD MIAV LSEEYAKLDGAVARRLSNLAP+ H PRI
Sbjct: 1   MMNGTHSVGECSSSTSLSSQQDVEDDCMIAVVLSEEYAKLDGAVARRLSNLAPVPHIPRI 60

Query: 61  NLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEY 120
           N YIPN SDASL++ RLLQR        L+VYGL+EVKVSGDGNCQFRALSDQMYKSPEY
Sbjct: 61  NSYIPNISDASLDHQRLLQR--------LHVYGLYEVKVSGDGNCQFRALSDQMYKSPEY 120

Query: 121 HKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKL- 180
           HKHVRK+IVKQ      LKD+ SLYEGYVPMK+ RYYKKMAKSGEWGDHVTLQAAADK  
Sbjct: 121 HKHVRKEIVKQ------LKDYHSLYEGYVPMKYKRYYKKMAKSGEWGDHVTLQAAADKFE 180

Query: 181 -------------------LYLCNNAELWLSFWSEVHYNSLYEIQGLVILQNP 214
                               Y     ELWLSFWSEVHYNSLYEI+   I Q P
Sbjct: 181 AKICLLTSFRDTCFIEIMPQYQPPKRELWLSFWSEVHYNSLYEIRDAPIQQKP 219

BLAST of Lsi05G004240 vs. NCBI nr
Match: gi|590648140|ref|XP_007032092.1| (Cysteine proteinases superfamily protein [Theobroma cacao])

HSP 1 Score: 300.8 bits (769), Expect = 2.1e-78
Identity = 159/226 (70.35%), Postives = 170/226 (75.22%), Query Frame = 1

Query: 1   MRPGTQSMGECSSSTSLSSHQDTDDDRMIAVALSEEYAKLDGAVARRLSNLAPIAHTPRI 60
           MR G Q +GECSSSTS SS QDT+DD+MIAV LSEEYAKLDGAVARRLS LAP+ H PRI
Sbjct: 1   MRNGVQHVGECSSSTSWSSQQDTEDDQMIAVVLSEEYAKLDGAVARRLSGLAPVPHVPRI 60

Query: 61  NLYIPNQSDASLEYHRLLQRYVTHIFWKLNVYGLHEVKVSGDGNCQFRALSDQMYKSPEY 120
           N +IPN SDASL++ RLLQR        L VYGL+EVKVSGDGNCQFRALSDQMYKSPEY
Sbjct: 61  NSFIPNVSDASLDHQRLLQR--------LQVYGLYEVKVSGDGNCQFRALSDQMYKSPEY 120

Query: 121 HKHVRKDIVKQTCQLWLLKDHRSLYEGYVPMKFSRYYKKMAKSGEWGDHVTLQAAADKLL 180
           HKHVRKDIVKQ      LKDHR+LYEGYVPMK+ RY KKMAKSGEWGDHVTLQAA+DK  
Sbjct: 121 HKHVRKDIVKQ------LKDHRNLYEGYVPMKYKRYCKKMAKSGEWGDHVTLQAASDKFA 180

Query: 181 --------------------YLCNNAELWLSFWSEVHYNSLYEIQG 207
                               Y     ELWLSFWSEVHYNSLYEIQG
Sbjct: 181 AKICLLTSFRDTCFVEIMPQYQAPKHELWLSFWSEVHYNSLYEIQG 212

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4757_DICDI6.5e-1130.93OTU domain-containing protein DDB_G0284757 OS=Dictyostelium discoideum GN=DDB_G0... [more]
Match NameE-valueIdentityDescription
M5X0D5_PRUPE8.7e-7969.53Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011031mg PE=4 SV=1[more]
A0A061EEF8_THECC1.5e-7870.35Cysteine proteinases superfamily protein OS=Theobroma cacao GN=TCM_017429 PE=4 S... [more]
D7SVD8_VITVI4.8e-7766.95Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0068g00690 PE=4 SV=... [more]
A0A067L3T3_JATCU1.8e-7666.38Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26911 PE=4 SV=1[more]
A0A059APN3_EUCGR2.4e-7667.39Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_I01566 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT3G02070.11.6e-7161.06 Cysteine proteinases superfamily protein[more]
AT3G22260.21.0e-4648.85 Cysteine proteinases superfamily protein[more]
AT5G04250.16.0e-3943.14 Cysteine proteinases superfamily protein[more]
AT5G03330.17.9e-3943.48 Cysteine proteinases superfamily protein[more]
AT2G39320.12.9e-0927.97 Cysteine proteinases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659098078|ref|XP_008449968.1|2.7e-8975.97PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X2 [Cucumis melo][more]
gi|659098074|ref|XP_008449966.1|2.7e-8975.97PREDICTED: OTU domain-containing protein DDB_G0284757 isoform X1 [Cucumis melo][more]
gi|449455768|ref|XP_004145623.1|2.3e-8875.11PREDICTED: OTU domain-containing protein DDB_G0284757 [Cucumis sativus][more]
gi|595974200|ref|XP_007217756.1|1.3e-7869.53hypothetical protein PRUPE_ppa011031mg [Prunus persica][more]
gi|590648140|ref|XP_007032092.1|2.1e-7870.35Cysteine proteinases superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003323OTU_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006508 proteolysis
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0008233 peptidase activity
molecular_function GO:0005524 ATP binding
molecular_function GO:0004386 helicase activity
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G004240.1Lsi05G004240.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003323OTU domainPFAMPF02338OTUcoord: 101..179
score: 3.
IPR003323OTU domainPROFILEPS50802OTUcoord: 94..204
score: 10
NoneNo IPR availablePANTHERPTHR12419OTU DOMAIN CONTAINING PROTEINcoord: 1..213
score: 2.9E
NoneNo IPR availablePANTHERPTHR12419:SF3SUBFAMILY NOT NAMEDcoord: 1..213
score: 2.9E
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 79..204
score: 1.96