Lsi07G012670.1 (mRNA) Bottle gourd (USVL1VR-Ls)

NameLsi07G012670.1
TypemRNA
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionATP-dependent caseinolytic protease/crotonase family protein
Locationchr07 : 18495064 .. 18499659 (+)
Sequence length1897
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCGATGGAAGACAAACAAAAATTTGGCGCCAACTGATCCCGCGAATTTCAAATCAAAACCAACAAAAATTTTAAGAAGTCAGAGAAAATTTTTCTCCATGAATCAAATGTTGGTTGTGTTATGAATCTTGCGAAATCGGAGCCATGGAGAATTCGAGCTTCACTGATTCTCAGGTATAGTTTTCTTCTCCACGCTGTGGTTATCGATTTTTCATTTTCTGTTTGGATCCTTCGGACATATGAGGAATTAAAAGCGTCTTTTGCTGGAAAATTTTGTGTTACTGATATGCTAATTCCAATGACTTGTTTAATTCGACTTAATGAATCTGTAGCAGCTTCGTAGAGCAATCTTTGCCCCAAATTTCTATTTCTAATTCTTTCATTTGCTTCGTTTTTTGTAGGTTATGCATTCTTCGTTCTTGGATTCGCCGTCTTCAGGTTCGTACGATTCTCCCTGATTCACCGTCTTGATATCTGTTTGGATGTTGAGAAAATGTAGAAAAGAAAATTATGGCATAGGAAGATAGTGATCTTAATTAGGTTTTTGAATATGTTTTTGTGTGTTTTTGAGTTAGGTCATAGTTGGACTCACTTTTATTCGTTTGTTTAATTTCCATCAGAGCCGCCTGATATCAGGAACTGGTTTTCGAGCTATGAATATGAATCTCCCGAACTTGATTCAAACGATAATTTTGGAGATTCAATTTCGAGAGAGAAAGAATTTGAGGTCAAGGAAGATGAGCAGACTGCGGGGGAGGTTAGGGATGTCACTAAGATTGAAGAAGAAGAAGCTGCTGAGAAGCTTCCCGGAACTTGGATTCCTTTGAATTGCACTTCAAGGGGAGATCATCGTGAAAGCCAGCCACTAGGCATGGTATAGAAAAACTGTCATTCCTTTCAACGGTTTTTTTCCTTCAATTAATTGCAAAATAAGAATTTGTTTAAGAATTTGGTGGCTAAATCTCCCACCAAATGCTTGAACTTCAGTCACTTATTGAAATGCTATTTCTTTCTTTTGGAACCTTTGGGGCGACCGGAACGAGCGTCTTTCAAAGGCTCCTTTTCATCTTTGGATCGTGTTATAGAATTAGTGCTTTCTACGCTTTCTTTGGTGCAAAACTAAGTATCATTTTTCTCCTTTCAGCTTATATTATTTATGCTCTAATTAGAGGCTTTTATTGTAAACATTTATTGGTGTTGGGTTTATCCCTTTATTTCATTCCTTAATGAAATATTTCTATTTTCAACAAATTAAAAGAATTAAAAAATTAAAAAAATTAAAAAAATATGAGATTCTAACAAGAAAAATCAATGGTTAATTAACCAAAGTTTAAATATCTTAATTGGCTATATAGAAATAGTTTAAGGGATCACGTAATGGAGAAACTTAGACTTTGTTGTATAATGATTTTCTGTTTGGTTTTTCGTTTGTTCTAAAAAAGTTAAGGTTATTTACAAATAACTTGTGTTCTGTTTACCCTTTTACTAAGCAAATCTCCGTCAAATTTAAAAGGCAAACAAGTTCTTCCTTCTTCTTCCTTCTTATTCTTTAAAGAAAACATTGTTGGAAATTAGGTTATTGATTGTAAAATTATTAGTAAACAGGTTTAATTCTCAAACTAGAAAACTAACAACAAAATTATTTTTGAATTAACTACAAAGTACTTCTAGATTTGATATTTTTATGTTATTTGAACTGTAGGCAAGCAGGTGTAAGAAAGCTCAAAATTGCTAACTTCTAACAGTTAATTTCTCTTTTTGTATTTTTATGTTAGAATCAAGATTCTTGGTGCTCCCACTCGCTTCTGTCAGGTTTGTACTATTCACCCTGGAATTCAATTTTCCAATAAAAATTAATATTCTATCCTCACCATCTATTTGATTTGGCATTTTTTTTCCCTTTTTCATCAGAGCCTCCTGATATTGGGAACTGGTTTTCAAGTTATGTATACGAATCCCCTACATTGAATCCAAGCCAGGAATTTGGATATTGTGAGAGCAAGAAAACTACGTTGGTCCATAAAATAGAAGAGACTTCGGATAATGTTAGAAAAACCAAAAATGGAGGTGCAGGAGTACAGTTGAACCCGTTAAAATCCAATGGCAATTCCACGGGCAACAGACAGGATAATCAGTCCCCAAGCGAGGTACAAGTTTTGCCTTTTTTTTTTTCCTCGAATGGAAATAGAACTTTTCATTAAAGGCTAATTAATATAAACATTAAATACCTACAAATAGCACTTATAAGATAAACAAAGATCTTCTAACTTCTAATTTGGACTTCAACACCCTTCCCCCTAAGCCCAGCTTAGATGGTGCGGGCTCATAGAAAGTTTAGGAGAGTTTTTCTGAATCCTAAGTTTGAGGCTTCAAGCTAAATGCTTAGTATAAGAAAATGAAAATCTTTCTATCTTCTAGATATAACTCACTTGCCTAATATGTGGGGACAATACTCCTTTTTATTGGCAAGAGATCCCTGGCCAAAGTAAGATACATTTTCTATCCCATTCTTGAATTCTATGTATCTTAGATATTTCTTTGGACATCAACCATATCTAAGAGGAAATTCATTGGAACCTAGAATTATTATTTAATGTTAGAATATTTCTAGAGCCCTTTTAGATATTTATGTATACTCTTTAAGATATTTCTAGAGAAAAATAAAATATACATTTAAGATATTAATATGAAAACTCCAAGAAGTTTCTAGAAAAATTAAGCTCCCATTTGAAAATTTCATTTTTAATTTTCTGATTTTGAAATTTATGTTGTTTTCCCAAATTTCTTTTTATGGCTTTCTCATGTCTTAATAAAACGCTTGAATTTTTAGTCATGTTCTAAAAACAAAAATTGGCTTAATTTTTTAGAACATTGATAGAAAGTTTATAACGAAATAAAGAAACTTATAGGTGGAAGTAGTGTTTATAAGCTTAATTTTTAGAAACAAGAAATCACAAACTAAATGGTCATCAAATGGGCGTGAATGTTTATATTTTATTTTAACAACGCGGTAATGGAACTGATAATTTATCTCATTCAAGTTAAAGATCATCCCCATGTCAAAATTTAATCAATGTTACAACGGTAGAGTTGGATGACCCTTTTTCGAGTGCTTTTATTTCAAGTCATTTCCATTCTCTATTAGTTTGAGAGACTCAATTGTTCATAGGAAGTAAAAGTCTTTTACCCTGTTCTCAGCAGAATTTGTTTTCTGAAAGAACCTCGGAGCAAGATCCAAAAGAGAAAACTGCGGGAACCAACGAAATTAGTCCAACGAAAGAAGTTCCAAACTCAAGCCTAACTACTGAAGATCCTCAATGTAAGTTACAAGAGAGAGTTTTGCAAGAGAATGGTTCTGTGCCACTGCATATAAATGGAAGTTTCAATGATGATAATAAGAAGCCTGCAACTCATACAAATTTGATCCATAAGATTGATCTCATGCTGGAAAGTTCTGAAACTAAAAGTGAGGTCCAACCACAACTTAATGGCTCATCAGCTGGAAATGATGTGCCTTCTGTCTTTACCAATGAACAGTCAATAAGAGAACCAATTGATCTAAGCCACAACAAACTCAAAGAAAACAAAGAGAAAGGAGTTTCAAATGTTGGTTTCATTACAGCAAACAAGCGTGGGTTTTCTGGAGCAAGTTGCAAGAAATCAGTGGAAATGCAAGAAAACTACAATAATAAAGAGGCAGGAAGTGCTTTTGCTTGTTTAAGAAGAAAACCATTGTCAGACAAAACCAATACTGAGCACTCTAATATCTTAGAGATAGTTGGGAAATGGAGTTGTCCTCAGAAGAGTAAGCCGAATCTTGGACCGCCATTGAAGCAGCTTCGACTTGAGCGATGGGTTCACAAGAAATAAAACTTGATATTAATGGAGGGACTGCTCCCAATATGATTATGATATGCAAATCTTCTCATTGTGTTAGCATCTCCACTGTATATACTTTTGTTGTAAGGTATAAGTAGAAGAAGAAACACTGGCAAGAAAGTGATGGAGGCTTTTGCAGAAAAGGTTTCCCCCTGATGTTTCCTTTCCATGAAATTCAATCATATCAAGCAGAGAGTTTCAAACTCAAGACAAAAGGATGAAATCTGTATAATAGCTTCAAATGGTAAGTCCTTCAACTTTGCACTTCCAACCTTTGGGTTTCTTCTATTGTAAATGGTTTCTGGCTTGTACATTTTTGTTCTTTTTTTATTTTTAGTTTAACAAGTGGGGTGTAGAGATTCAAACTTTTTTTACCTAATGATATATACTTTAACCAGTTGAATTATATTTATATGAATCATTCTTATCTTTTAAATTTTGATCTCCCTCTAACTTTTATATATTTTTTGCTTTATTCAGGCTAATGAGCAGATAGAAGTAAGTTCACCTCAAGTAGGTGGGTGCCTAGGCTTTCTTTTCTCCCATGATACGTGAGGTGAGGGTTCGAACTTCTAATCTTTCGAATGAAGATAAAATGTCTTAACTAGTTATGTTATATGCAAGTTGACAAATTATTTATTAATTATTTTATGTGGACAAATTTTACATCAATTAGATTATTTATTAACGATTAGATGTGGATGAACTTTCAAGTT

mRNA sequence

CTTCGATGGAAGACAAACAAAAATTTGGCGCCAACTGATCCCGCGAATTTCAAATCAAAACCAACAAAAATTTTAAGAAGTCAGAGAAAATTTTTCTCCATGAATCAAATGTTGGTTGTGTTATGAATCTTGCGAAATCGGAGCCATGGAGAATTCGAGCTTCACTGATTCTCAGGTTATGCATTCTTCGTTCTTGGATTCGCCGTCTTCAGAGCCGCCTGATATCAGGAACTGGTTTTCGAGCTATGAATATGAATCTCCCGAACTTGATTCAAACGATAATTTTGGAGATTCAATTTCGAGAGAGAAAGAATTTGAGGTCAAGGAAGATGAGCAGACTGCGGGGGAGGTTAGGGATGTCACTAAGATTGAAGAAGAAGAAGCTGCTGAGAAGCTTCCCGGAACTTGGATTCCTTTGAATTGCACTTCAAGGGGAGATCATCGTGAAAGCCAGCCACTAGGCATGAATCAAGATTCTTGGTGCTCCCACTCGCTTCTGTCAGAGCCTCCTGATATTGGGAACTGGTTTTCAAGTTATGTATACGAATCCCCTACATTGAATCCAAGCCAGGAATTTGGATATTGTGAGAGCAAGAAAACTACGTTGGTCCATAAAATAGAAGAGACTTCGGATAATGTTAGAAAAACCAAAAATGGAGGTGCAGGAGTACAGTTGAACCCGTTAAAATCCAATGGCAATTCCACGGGCAACAGACAGGATAATCAGTCCCCAAGCGAGAATTTGTTTTCTGAAAGAACCTCGGAGCAAGATCCAAAAGAGAAAACTGCGGGAACCAACGAAATTAGTCCAACGAAAGAAGTTCCAAACTCAAGCCTAACTACTGAAGATCCTCAATGTAAGTTACAAGAGAGAGTTTTGCAAGAGAATGGTTCTGTGCCACTGCATATAAATGGAAGTTTCAATGATGATAATAAGAAGCCTGCAACTCATACAAATTTGATCCATAAGATTGATCTCATGCTGGAAAGTTCTGAAACTAAAAGTGAGGTCCAACCACAACTTAATGGCTCATCAGCTGGAAATGATGTGCCTTCTGTCTTTACCAATGAACAGTCAATAAGAGAACCAATTGATCTAAGCCACAACAAACTCAAAGAAAACAAAGAGAAAGGAGTTTCAAATGTTGGTTTCATTACAGCAAACAAGCGTGGGTTTTCTGGAGCAAGTTGCAAGAAATCAGTGGAAATGCAAGAAAACTACAATAATAAAGAGGCAGGAAGTGCTTTTGCTTGTTTAAGAAGAAAACCATTGTCAGACAAAACCAATACTGAGCACTCTAATATCTTAGAGATAGTTGGGAAATGGAGTTGTCCTCAGAAGAGTAAGCCGAATCTTGGACCGCCATTGAAGCAGCTTCGACTTGAGCGATGGGTTCACAAGAAATAAAACTTGATATTAATGGAGGGACTGCTCCCAATATGATTATGATATGCAAATCTTCTCATTGTGTTAGCATCTCCACTGTATATACTTTTGTTGTAAGGTATAAGTAGAAGAAGAAACACTGGCAAGAAAGTGATGGAGGCTTTTGCAGAAAAGGTTTCCCCCTGATGTTTCCTTTCCATGAAATTCAATCATATCAAGCAGAGAGTTTCAAACTCAAGACAAAAGGATGAAATCTGTATAATAGCTTCAAATGGCTAATGAGCAGATAGAAGTAAGTTCACCTCAAGTAGGTGGGTGCCTAGGCTTTCTTTTCTCCCATGATACGTGAGGTGAGGGTTCGAACTTCTAATCTTTCGAATGAAGATAAAATGTCTTAACTAGTTATGTTATATGCAAGTTGACAAATTATTTATTAATTATTTTATGTGGACAAATTTTACATCAATTAGATTATTTATTAACGATTAGATGTGGATGAACTTTCAAGTT

Coding sequence (CDS)

ATGGAGAATTCGAGCTTCACTGATTCTCAGGTTATGCATTCTTCGTTCTTGGATTCGCCGTCTTCAGAGCCGCCTGATATCAGGAACTGGTTTTCGAGCTATGAATATGAATCTCCCGAACTTGATTCAAACGATAATTTTGGAGATTCAATTTCGAGAGAGAAAGAATTTGAGGTCAAGGAAGATGAGCAGACTGCGGGGGAGGTTAGGGATGTCACTAAGATTGAAGAAGAAGAAGCTGCTGAGAAGCTTCCCGGAACTTGGATTCCTTTGAATTGCACTTCAAGGGGAGATCATCGTGAAAGCCAGCCACTAGGCATGAATCAAGATTCTTGGTGCTCCCACTCGCTTCTGTCAGAGCCTCCTGATATTGGGAACTGGTTTTCAAGTTATGTATACGAATCCCCTACATTGAATCCAAGCCAGGAATTTGGATATTGTGAGAGCAAGAAAACTACGTTGGTCCATAAAATAGAAGAGACTTCGGATAATGTTAGAAAAACCAAAAATGGAGGTGCAGGAGTACAGTTGAACCCGTTAAAATCCAATGGCAATTCCACGGGCAACAGACAGGATAATCAGTCCCCAAGCGAGAATTTGTTTTCTGAAAGAACCTCGGAGCAAGATCCAAAAGAGAAAACTGCGGGAACCAACGAAATTAGTCCAACGAAAGAAGTTCCAAACTCAAGCCTAACTACTGAAGATCCTCAATGTAAGTTACAAGAGAGAGTTTTGCAAGAGAATGGTTCTGTGCCACTGCATATAAATGGAAGTTTCAATGATGATAATAAGAAGCCTGCAACTCATACAAATTTGATCCATAAGATTGATCTCATGCTGGAAAGTTCTGAAACTAAAAGTGAGGTCCAACCACAACTTAATGGCTCATCAGCTGGAAATGATGTGCCTTCTGTCTTTACCAATGAACAGTCAATAAGAGAACCAATTGATCTAAGCCACAACAAACTCAAAGAAAACAAAGAGAAAGGAGTTTCAAATGTTGGTTTCATTACAGCAAACAAGCGTGGGTTTTCTGGAGCAAGTTGCAAGAAATCAGTGGAAATGCAAGAAAACTACAATAATAAAGAGGCAGGAAGTGCTTTTGCTTGTTTAAGAAGAAAACCATTGTCAGACAAAACCAATACTGAGCACTCTAATATCTTAGAGATAGTTGGGAAATGGAGTTGTCCTCAGAAGAGTAAGCCGAATCTTGGACCGCCATTGAAGCAGCTTCGACTTGAGCGATGGGTTCACAAGAAATAA

Protein sequence

MENSSFTDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVKEDEQTAGEVRDVTKIEEEEAAEKLPGTWIPLNCTSRGDHRESQPLGMNQDSWCSHSLLSEPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTTLVHKIEETSDNVRKTKNGGAGVQLNPLKSNGNSTGNRQDNQSPSENLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDPQCKLQERVLQENGSVPLHINGSFNDDNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSSAGNDVPSVFTNEQSIREPIDLSHNKLKENKEKGVSNVGFITANKRGFSGASCKKSVEMQENYNNKEAGSAFACLRRKPLSDKTNTEHSNILEIVGKWSCPQKSKPNLGPPLKQLRLERWVHKK
BLAST of Lsi07G012670.1 vs. TrEMBL
Match: A0A0A0L1G5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G017290 PE=4 SV=1)

HSP 1 Score: 650.6 bits (1677), Expect = 1.3e-183
Identity = 347/423 (82.03%), Postives = 368/423 (87.00%), Query Frame = 1

Query: 1   MENSSFTDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVK 60
           MENSSFTDSQV HSSFL SPSSEPPDI NWFSSYEYESPELDSNDNFGDS+SRE+EFEV 
Sbjct: 1   MENSSFTDSQVPHSSFLHSPSSEPPDIGNWFSSYEYESPELDSNDNFGDSVSREREFEVG 60

Query: 61  EDEQTAGEVRD-VTKIEEEEAAEKLPGTWIPLNCTSRGDHRESQPLGMNQDSWCSHSLLS 120
           EDEQT GE+RD VTKIEEEEAA++LPGTWIPL C SR DHRESQ LGMNQDSWCS SLLS
Sbjct: 61  EDEQTVGELRDNVTKIEEEEAAKELPGTWIPLKCNSREDHRESQLLGMNQDSWCSQSLLS 120

Query: 121 EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTTLVHKIEETSDNVRKTKNGGAGVQLNP 180
           EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKT L H+IEET DN      GG GVQLN 
Sbjct: 121 EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTGLGHEIEETLDN------GGEGVQLNL 180

Query: 181 LK-SNGNSTGNRQDNQSPS-ENLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDPQ 240
            + SNG+STGNRQDNQ PS +NLFSERTSEQDPKEKT GTN+ISPTKEVP S+LTTED Q
Sbjct: 181 FEISNGDSTGNRQDNQPPSKQNLFSERTSEQDPKEKTMGTNDISPTKEVPISNLTTEDLQ 240

Query: 241 CKLQERVLQENGSVPLHINGSFNDDNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSS 300
           CK QERVLQENG VPLH N S ND N KP THTNLIHKID +LE+SETKSEVQPQL GS 
Sbjct: 241 CKFQERVLQENGLVPLHKNRSSNDGNSKPPTHTNLIHKIDPILENSETKSEVQPQLKGS- 300

Query: 301 AGNDVPSVFTNEQSIREPIDLSHNKLKENKEKGVSNVGFITANKRGFSGASCKKSVEMQE 360
             N +P VF N   +REP DLSHNK  ENKE+GVSNVGFITANKRGFS A+CKKSVEMQE
Sbjct: 301 --NHMPCVFPN---VREPNDLSHNK--ENKERGVSNVGFITANKRGFSEATCKKSVEMQE 360

Query: 361 NYNNKEAGSAFACLRRKPLSDKTNTEHSNILEIVGKWSCPQKSKPNLGPPLKQLRLERWV 420
           NYNNKEAG AFACLRRK LSDKTNTEHSNI+E++GKWSCPQKSKPNLGPPLKQLRLERWV
Sbjct: 361 NYNNKEAGRAFACLRRKGLSDKTNTEHSNIIEVIGKWSCPQKSKPNLGPPLKQLRLERWV 409

BLAST of Lsi07G012670.1 vs. TrEMBL
Match: W9RPJ9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_024020 PE=4 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 6.8e-39
Identity = 153/458 (33.41%), Postives = 225/458 (49.13%), Query Frame = 1

Query: 7   TDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREK---EFEVKEDE 66
           +DSQ+ + S+  S  SEPP++ NWF SY+YESP LDS+DNF +S+  EK   + ++  D+
Sbjct: 19  SDSQIQNPSYSLSIPSEPPELGNWFPSYKYESPVLDSDDNFEESVFEEKVVRKEKILIDD 78

Query: 67  QTAGEVRDVTKI-EEEEAAEKLPGTWIPLNCTSRGDHRESQPLG-------MNQDSWCSH 126
              G+   + +  E+ E  E L G     N +S   + +  P         +  DS    
Sbjct: 79  CERGKEESLREFGEKGEKDEVLVGDDKGGNQSSSECNLDLSPFSCDLNGRELKGDSVVCL 138

Query: 127 SLLSEPPDIGNWFSSYVYESPTLNPSQEF--------------------------GYCES 186
             L EP D+ NWF SYVYESP L+ +  F                          GY + 
Sbjct: 139 PDLLEPIDVKNWFPSYVYESPALDTNDGFKDSLNKESECEEDRFIVDESNREKAEGYVKL 198

Query: 187 KKTTLVHKIEETSDNVRKTKN--GGAGVQLNPLKSNGNSTGNRQDNQSPSENLFSERTSE 246
           K+TT    + E+S+ +    N  G   ++   L  NG  +G  +   S   N+    T E
Sbjct: 199 KRTTKRDGV-ESSNGLANCGNYCGNNQLEKQLLNKNGRGSGEVKHILSDKSNMCFGSTLE 258

Query: 247 QDPKEKTAGTNEISPTKEVPNSSLTTEDPQCKLQERVLQENGSVPLHINGSFNDDNKKPA 306
           Q    +T     ++P KEV  SSL  E+PQC       +E G  PLH+ G    ++++  
Sbjct: 259 QCSSNETMRNPVLNPAKEVELSSLDQENPQC--VHSFSREIGIRPLHMQGRTCSNDRE-- 318

Query: 307 THTNLIHKIDLMLESSETKSEVQPQLNGS-SAGNDVPSVFTNEQSIREPIDLSHN-KLKE 366
               L  K+   +E+ +       Q+ G  S+ ND  S   NE + RE    +H  K KE
Sbjct: 319 ----LSQKLMCRMENIKDSEAKGRQVAGHLSSKNDRKSDLVNE-ATRES---THGFKDKE 378

Query: 367 NKEKGVSNVGFITANKRGFSGASCKKSVEMQENYNNK---EAGSAFACLRRKPLSDKTNT 421
           N  K +S  GF+T  K  +S A+ +   ++ +    +   ++G   + + RK LS++TN 
Sbjct: 379 NDGKEISKDGFVTTRKGRYSKANAENLQKIPKETGRQTVSQSGGEDSVVERKALSERTNF 438

BLAST of Lsi07G012670.1 vs. TrEMBL
Match: A0A061G4B1_THECC (ATP-dependent caseinolytic protease/crotonase family protein OS=Theobroma cacao GN=TCM_015894 PE=3 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 4.0e-31
Identity = 148/469 (31.56%), Postives = 220/469 (46.91%), Query Frame = 1

Query: 2   ENSSFTDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVKE 61
           E+ S   ++V  +    S  SEPPDIRNW+SSY YESP LD++D F   +SRE E E  +
Sbjct: 105 EHGSLNSNEVQDTLHSPSILSEPPDIRNWYSSYVYESPLLDTSDGFRSYVSRESECE--K 164

Query: 62  DEQTAGE-VRD-VTKIEEE-------EAAEKLPGTWIPLNCTSRGDHRESQPLGMNQDSW 121
           DE   GE ++D    + +E       +A+EK+  T + + C+S    R+++         
Sbjct: 165 DELAIGESIKDEAANLGQETKSSCKPDASEKICSTKL-VKCSSSLVDRKNE--------- 224

Query: 122 CSHSLLSEPPDIGNWFSSYVYESPTLNPSQEFG--------------YCESKKTTLVHKI 181
            SHSL S PPD+G WF+ YVYESP L+ S EF                 E +K     K+
Sbjct: 225 -SHSLFSGPPDLGFWFADYVYESPVLDTSDEFRDTLSEEREPNEDEFAVEERKREKQEKV 284

Query: 182 EETSDNVRKTKNGGAGVQLNP--LKSNGNSTGNRQDNQSPSENLFSERTSEQ-------- 241
             T+    + + G           K N +   + Q+N S S++L      E         
Sbjct: 285 NTTTKTRHRNEVGVVKKMCANEFRKCNSSLRNDEQENMSISKDLHCAGGKENLTWKGDLC 344

Query: 242 -----DP--KEKTAGTNEISPTKEVPNSSLTTEDPQCKLQERVLQENGSVPLHINGSFND 301
                DP  + K    + I+  K V NS     D   KL++   Q        I+ S   
Sbjct: 345 FEKILDPILEVKQVRGSTINSNKGVENSGFNGGDFLSKLEKADSQSTD-----ISRSAGK 404

Query: 302 DNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSSAGNDVPSVFTNEQS--IREPIDLS 361
            ++K +    LI+  D +  S ETK ++        A +D    F        R+P   S
Sbjct: 405 TDRKSSK--KLINTRDSIERSPETKVDL--------ASHDQSQDFDQVSGGYWRKPTHGS 464

Query: 362 HNKLKENKEKGVSNVGFITANKRGFSGASCKKS------VEMQENYN---NKEAGSAFAC 420
           ++K  EN+ K ++  GF+T +K  F+  + + S      V +Q + N   N   G   A 
Sbjct: 465 NDK--ENEGKDIAKNGFVTTSKNKFTRRNGENSLGGRREVVLQCSRNKSSNITGGQRGAV 524

BLAST of Lsi07G012670.1 vs. TrEMBL
Match: A0A067DBQ0_CITSI (Uncharacterized protein (Fragment) OS=Citrus sinensis GN=CISIN_1g0095801mg PE=4 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 4.0e-31
Identity = 142/481 (29.52%), Postives = 209/481 (43.45%), Query Frame = 1

Query: 2   ENSSFTDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVKE 61
           EN   T S+++ SS L S  SEP DI NWFSSY YES  LD+ND+  DS+S   E E + 
Sbjct: 58  ENHLIT-SEILDSSCLPSQVSEPADIGNWFSSYAYESFVLDTNDDVQDSVSEGSECEKEG 117

Query: 62  D---EQTAGEVRDVTKIEEEEAAEKLPGTWIP------------------LNCTSRGDHR 121
               E+  G+   +   E +  ++   G   P                   N   R    
Sbjct: 118 SLVGERHKGQENMIATREIDVKSDSSLGDDKPDEKPSIKTLLIYFHNYSDKNQLQRAKFF 177

Query: 122 ESQPLGMNQ--DSWCSHSLLSEPPDIGNWFSSYVYESPTLNPSQEFGYC----------- 181
            S  L + Q  DS    ++ +EPPD+ NWFSSY Y SP L+ S +F              
Sbjct: 178 ISNTLMLIQILDSMGPSTVFTEPPDVKNWFSSYAYGSPVLDTSDQFEDSLHLEMEPEKFV 237

Query: 182 -------ESKKTTLVHKIEETSDNVRKTKNGGAGVQLNP-------------LKSNGNST 241
                    +K T++ K+    + V + K   +  + N              +   G   
Sbjct: 238 VEDSDAETEEKLTIIRKVRSGDEKVDEEKERYSFARHNSSIEVCEQKQVCKRIDRIGGKR 297

Query: 242 GNRQDNQSPSENLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDPQCKLQERVLQE 301
            +   N+  S+ +F +       + K   +N  SP+K++   S   E    KL+ ++ QE
Sbjct: 298 SSSSQNKIQSDKIFKDIL-----EGKAQQSNVTSPSKDIRELSFDDEGSISKLELKLSQE 357

Query: 302 NGSVPLHINGSFNDDNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSSAGNDVPSVFT 361
             SV  ++N   +    KP   TN  H  D   +  E  S + P  N   A +   S   
Sbjct: 358 ACSVSWNLNRISSSKKDKP--QTNSFHTTDFKEKFMEEVSPLSPTSNSKLAQDCGAS--- 417

Query: 362 NEQSIREPIDLSHNKL-KENKEKGVSNVGFITANKRGFSGASCKKSVEMQENY------N 420
                  P   +H +  KEN +KG++  GF+   K G +  +   S++M          N
Sbjct: 418 -------PRKQTHRRNDKENYQKGIAENGFVATRKYGSTRTANADSLKMSPGILLECSRN 477

BLAST of Lsi07G012670.1 vs. TrEMBL
Match: A0A067D504_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g043652mg PE=4 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 2.6e-30
Identity = 131/424 (30.90%), Postives = 199/424 (46.93%), Query Frame = 1

Query: 23  EPPDIRNWFSSYEYESPELDSNDNFGDSISR----EKEFEVKEDEQTAGE---------V 82
           EP DI NWFSSY YES  LD+ND+  DS+S     EKE  +  +     E         V
Sbjct: 6   EPADIGNWFSSYAYESFVLDTNDDVQDSVSEGSECEKEGSLVGERHNGQENMAATREIDV 65

Query: 83  RDVTKIEEEEAAEKLPGTWIPLNCTS---RGDHRESQPLGMNQ--DSWCSHSLLSEPPDI 142
           +  + + ++E  EK   + I  +C S   R     S  L + Q  DS    ++ +EPPD+
Sbjct: 66  KSDSSLGDDEPDEK---SSIKNSCKSQLQRAKFLTSNTLMLIQILDSPDPSTVFTEPPDV 125

Query: 143 GNWFSSYVYESPTLNPSQEFGYCESKKTTLVHKIEETSDNVRKTKNGGAGVQLNPLKSNG 202
            NWFSSY +ESP L  S +F      +      + E SD   + K       +  ++S  
Sbjct: 126 KNWFSSYAHESPVLGTSDQFEDSLHLEMEPEKFVVEDSDLETEEKL----TIIRKVRSGD 185

Query: 203 NSTGNRQDNQSPSENLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDPQCKLQERV 262
                 +++ + + +  S    EQ  +    G    S ++   +S  T +D    L+ ++
Sbjct: 186 EKVDEEKEHYNFTRHNSSIEVREQKQRIDRIGGKRSSSSQNKIHSDKTFKD---ILEAKI 245

Query: 263 LQENGSVPLHINGSFNDDNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSSAGNDVPS 322
           + E  SV  ++NG  +    KP   TN  H  D   +  E  S + P  N   A      
Sbjct: 246 VPEACSVSWNLNGISSSKKDKP--QTNSFHTTDFKEKFMEEVSPLSPTSNSKLA------ 305

Query: 323 VFTNEQSIREPIDLSHNKL-KENKEKGVSNVGFITANKRGFSGASCKKSVEMQE------ 382
               +     P   +H +  KEN +KG++  GF+   K G +  +   S++M +      
Sbjct: 306 ----QDCGASPRKQTHRRNDKENYQKGIAENGFVATRKNGSTRTANADSLKMSQGILLEC 365

Query: 383 NYNNKEAGSAFACL--RRKPLSDKTNTEHSNILEIVGKWSCPQKSKPNLGPPLKQLRLER 420
           + NN+    A   +  RRK L + TN ++S+ LE+ GKW CPQK KP+LGPPLKQLRLER
Sbjct: 366 SRNNRSVSPAGEKVVDRRKALIETTNYQNSDALEVTGKWRCPQKGKPSLGPPLKQLRLER 407

BLAST of Lsi07G012670.1 vs. TAIR10
Match: AT4G16807.1 (AT4G16807.1 unknown protein)

HSP 1 Score: 118.6 bits (296), Expect = 9.1e-27
Identity = 129/420 (30.71%), Postives = 190/420 (45.24%), Query Frame = 1

Query: 18  DSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVKEDEQTAGEVRDVTKIEE 77
           +S  SEPPD+  WFSSY YESP LD++D  G  +S   E E  ++ QT  E     KIE 
Sbjct: 64  ESLPSEPPDLGKWFSSYVYESPLLDTSD--GLELSVPGESECVKETQTENES---PKIEG 123

Query: 78  EEAAEKLPGTWIPLNCTSRGDHRESQPLGMNQDSWCSHSLLSEPPDIGNWFSSYVYESPT 137
            +   +L    + ++ T   D  ESQ            S+LSEPPD+ NWFSSY Y+SP 
Sbjct: 124 NDVCPRLLEQEL-VSSTKVTDFSESQ------------SVLSEPPDLRNWFSSYEYQSPQ 183

Query: 138 LNPSQEFGYCESKKTTLVHKIEETSDNV-----RKTKNGGAGVQLNPLKSNGNSTGNRQD 197
           L+  QEFG+  S+K  L+ +  +T + +     RK K+    V L  L SN        D
Sbjct: 184 LSDIQEFGFLYSEKDELIIEESDTEEGISSGIFRKIKSRQETVGLGRLDSNDYKENIATD 243

Query: 198 NQSPSENLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDP-QCKLQERVLQENGSV 257
             +  E       S+Q  +++++     +  K+V   S   ++P  C+LQE   + +  V
Sbjct: 244 --TAKEVSLDNAVSDQKMEKRSSVRLLNASKKDVKQESSFKQEPLLCELQEEA-RFSPRV 303

Query: 258 PLHINGSFNDDNKKPATHTNLIHKI-------DLMLESSETKS-----EVQPQLNGSSAG 317
                  +N   K P+ +   +H++        + + S+  KS     E   + N     
Sbjct: 304 -----SRYNPKLKSPSKNDTSLHELRPIHIQESISMNSNRQKSPPIEQESYDKENVHGQS 363

Query: 318 NDVPSVFTNEQSIREPIDLSHNKLKENKEKGVSNVGFITANKRGFSGASCKKSVEMQENY 377
           N    V   + S RE  D    K K N E  V                S  K ++ +   
Sbjct: 364 NQTGFVTMKKASFREARDQCSLK-KPNTEVLVK--------------CSSSKELKTKAGE 423

Query: 378 NNKEAGSAFACLRRKPLSDKTNTEHSNILEIVGKWSCPQKSKPNLGPPLKQLRLERWVHK 420
           +N E        +R+ L + +N   S   EI GKW CPQK+K  + PPLKQLRL+ W+HK
Sbjct: 424 DNTEEREK----KRRVLGEMSNQLSSVAKEIAGKWRCPQKNKRKIVPPLKQLRLDAWIHK 438

BLAST of Lsi07G012670.1 vs. NCBI nr
Match: gi|778675280|ref|XP_011650381.1| (PREDICTED: uncharacterized protein LOC101212491 isoform X2 [Cucumis sativus])

HSP 1 Score: 654.8 bits (1688), Expect = 1.0e-184
Identity = 347/422 (82.23%), Postives = 368/422 (87.20%), Query Frame = 1

Query: 1   MENSSFTDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVK 60
           MENSSFTDSQV HSSFL SPSSEPPDI NWFSSYEYESPELDSNDNFGDS+SRE+EFEV 
Sbjct: 1   MENSSFTDSQVPHSSFLHSPSSEPPDIGNWFSSYEYESPELDSNDNFGDSVSREREFEVG 60

Query: 61  EDEQTAGEVRD-VTKIEEEEAAEKLPGTWIPLNCTSRGDHRESQPLGMNQDSWCSHSLLS 120
           EDEQT GE+RD VTKIEEEEAA++LPGTWIPL C SR DHRESQ LGMNQDSWCS SLLS
Sbjct: 61  EDEQTVGELRDNVTKIEEEEAAKELPGTWIPLKCNSREDHRESQLLGMNQDSWCSQSLLS 120

Query: 121 EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTTLVHKIEETSDNVRKTKNGGAGVQLNP 180
           EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKT L H+IEET DN      GG GVQLN 
Sbjct: 121 EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTGLGHEIEETLDN------GGEGVQLNL 180

Query: 181 LK-SNGNSTGNRQDNQSPSENLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDPQC 240
            + SNG+STGNRQDNQ PS+NLFSERTSEQDPKEKT GTN+ISPTKEVP S+LTTED QC
Sbjct: 181 FEISNGDSTGNRQDNQPPSKNLFSERTSEQDPKEKTMGTNDISPTKEVPISNLTTEDLQC 240

Query: 241 KLQERVLQENGSVPLHINGSFNDDNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSSA 300
           K QERVLQENG VPLH N S ND N KP THTNLIHKID +LE+SETKSEVQPQL GS  
Sbjct: 241 KFQERVLQENGLVPLHKNRSSNDGNSKPPTHTNLIHKIDPILENSETKSEVQPQLKGS-- 300

Query: 301 GNDVPSVFTNEQSIREPIDLSHNKLKENKEKGVSNVGFITANKRGFSGASCKKSVEMQEN 360
            N +P VF N   +REP DLSHN  KENKE+GVSNVGFITANKRGFS A+CKKSVEMQEN
Sbjct: 301 -NHMPCVFPN---VREPNDLSHN--KENKERGVSNVGFITANKRGFSEATCKKSVEMQEN 360

Query: 361 YNNKEAGSAFACLRRKPLSDKTNTEHSNILEIVGKWSCPQKSKPNLGPPLKQLRLERWVH 420
           YNNKEAG AFACLRRK LSDKTNTEHSNI+E++GKWSCPQKSKPNLGPPLKQLRLERWV 
Sbjct: 361 YNNKEAGRAFACLRRKGLSDKTNTEHSNIIEVIGKWSCPQKSKPNLGPPLKQLRLERWVC 408

BLAST of Lsi07G012670.1 vs. NCBI nr
Match: gi|449459052|ref|XP_004147260.1| (PREDICTED: uncharacterized protein LOC101212491 isoform X1 [Cucumis sativus])

HSP 1 Score: 650.6 bits (1677), Expect = 1.9e-183
Identity = 347/423 (82.03%), Postives = 368/423 (87.00%), Query Frame = 1

Query: 1   MENSSFTDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVK 60
           MENSSFTDSQV HSSFL SPSSEPPDI NWFSSYEYESPELDSNDNFGDS+SRE+EFEV 
Sbjct: 1   MENSSFTDSQVPHSSFLHSPSSEPPDIGNWFSSYEYESPELDSNDNFGDSVSREREFEVG 60

Query: 61  EDEQTAGEVRD-VTKIEEEEAAEKLPGTWIPLNCTSRGDHRESQPLGMNQDSWCSHSLLS 120
           EDEQT GE+RD VTKIEEEEAA++LPGTWIPL C SR DHRESQ LGMNQDSWCS SLLS
Sbjct: 61  EDEQTVGELRDNVTKIEEEEAAKELPGTWIPLKCNSREDHRESQLLGMNQDSWCSQSLLS 120

Query: 121 EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTTLVHKIEETSDNVRKTKNGGAGVQLNP 180
           EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKT L H+IEET DN      GG GVQLN 
Sbjct: 121 EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTGLGHEIEETLDN------GGEGVQLNL 180

Query: 181 LK-SNGNSTGNRQDNQSPS-ENLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDPQ 240
            + SNG+STGNRQDNQ PS +NLFSERTSEQDPKEKT GTN+ISPTKEVP S+LTTED Q
Sbjct: 181 FEISNGDSTGNRQDNQPPSKQNLFSERTSEQDPKEKTMGTNDISPTKEVPISNLTTEDLQ 240

Query: 241 CKLQERVLQENGSVPLHINGSFNDDNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSS 300
           CK QERVLQENG VPLH N S ND N KP THTNLIHKID +LE+SETKSEVQPQL GS 
Sbjct: 241 CKFQERVLQENGLVPLHKNRSSNDGNSKPPTHTNLIHKIDPILENSETKSEVQPQLKGS- 300

Query: 301 AGNDVPSVFTNEQSIREPIDLSHNKLKENKEKGVSNVGFITANKRGFSGASCKKSVEMQE 360
             N +P VF N   +REP DLSHNK  ENKE+GVSNVGFITANKRGFS A+CKKSVEMQE
Sbjct: 301 --NHMPCVFPN---VREPNDLSHNK--ENKERGVSNVGFITANKRGFSEATCKKSVEMQE 360

Query: 361 NYNNKEAGSAFACLRRKPLSDKTNTEHSNILEIVGKWSCPQKSKPNLGPPLKQLRLERWV 420
           NYNNKEAG AFACLRRK LSDKTNTEHSNI+E++GKWSCPQKSKPNLGPPLKQLRLERWV
Sbjct: 361 NYNNKEAGRAFACLRRKGLSDKTNTEHSNIIEVIGKWSCPQKSKPNLGPPLKQLRLERWV 409

BLAST of Lsi07G012670.1 vs. NCBI nr
Match: gi|659095856|ref|XP_008448799.1| (PREDICTED: uncharacterized protein LOC103490858 isoform X2 [Cucumis melo])

HSP 1 Score: 637.5 bits (1643), Expect = 1.6e-179
Identity = 342/422 (81.04%), Postives = 360/422 (85.31%), Query Frame = 1

Query: 1   MENSSFTDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVK 60
           MENSSFTDSQV+HSS   SPSSEPPDIRNWFSSYEYESPELDSNDNFGDS+SRE+EFEV+
Sbjct: 1   MENSSFTDSQVLHSSLSHSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSVSREREFEVE 60

Query: 61  EDEQTAGEVRD-VTKIEEEEAAEKLPGTWIPLNCTSRGDHRESQPLGMNQDSWCSHSLLS 120
           EDEQT GE RD VTKIEEEEAA++LPGTWIPL C S  D RESQ L MNQDSWCS SLLS
Sbjct: 61  EDEQTVGEFRDNVTKIEEEEAAKELPGTWIPLKCNSIEDRRESQLLSMNQDSWCSQSLLS 120

Query: 121 EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTTLVHKIEETSDNVRKTKNGGAGVQLNP 180
           EPPDIGNWFSSYVY SPTL+PSQEFGYCESKKT L H++EET DN      GG GVQLN 
Sbjct: 121 EPPDIGNWFSSYVYASPTLDPSQEFGYCESKKTGLGHEVEETLDN------GGEGVQLNL 180

Query: 181 LK-SNGNSTGNRQDNQSPSENLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDPQC 240
            + SN NSTGNRQDNQ PSENLFSERTSEQDPKEKT GTN ISPTKEVP SSLTTED QC
Sbjct: 181 FEISNDNSTGNRQDNQPPSENLFSERTSEQDPKEKTMGTNAISPTKEVPISSLTTEDLQC 240

Query: 241 KLQERVLQENGSVPLHINGSFNDDNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSSA 300
           KLQERVLQENGSVPLH N S ND N  P THTNLIHKID +LESSETKSEVQ QLNGS  
Sbjct: 241 KLQERVLQENGSVPLHKNRSSNDGNNNPPTHTNLIHKIDPILESSETKSEVQLQLNGS-- 300

Query: 301 GNDVPSVFTNEQSIREPIDLSHNKLKENKEKGVSNVGFITANKRGFSGASCKKSVEMQEN 360
            N +P VF N   +REP DLS N  KENKE+GVSN GFITANKRGFS A+ KKSVEMQEN
Sbjct: 301 -NHMPRVFPN---VREPNDLSDN--KENKERGVSNAGFITANKRGFSEATYKKSVEMQEN 360

Query: 361 YNNKEAGSAFACLRRKPLSDKTNTEHSNILEIVGKWSCPQKSKPNLGPPLKQLRLERWVH 420
           YNNKEAGSAFAC RRK LSDKTNTEHSNI E++GKWSCPQKSKPNLGPPLKQLRLERWVH
Sbjct: 361 YNNKEAGSAFACSRRKGLSDKTNTEHSNIYEVIGKWSCPQKSKPNLGPPLKQLRLERWVH 408

BLAST of Lsi07G012670.1 vs. NCBI nr
Match: gi|659095847|ref|XP_008448796.1| (PREDICTED: uncharacterized protein LOC103490858 isoform X1 [Cucumis melo])

HSP 1 Score: 632.9 bits (1631), Expect = 4.1e-178
Identity = 342/423 (80.85%), Postives = 360/423 (85.11%), Query Frame = 1

Query: 1   MENSSFTDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREKEFEVK 60
           MENSSFTDSQV+HSS   SPSSEPPDIRNWFSSYEYESPELDSNDNFGDS+SRE+EFEV+
Sbjct: 1   MENSSFTDSQVLHSSLSHSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSVSREREFEVE 60

Query: 61  EDEQTAGEVRD-VTKIEEEEAAEKLPGTWIPLNCTSRGDHRESQPLGMNQDSWCSHSLLS 120
           EDEQT GE RD VTKIEEEEAA++LPGTWIPL C S  D RESQ L MNQDSWCS SLLS
Sbjct: 61  EDEQTVGEFRDNVTKIEEEEAAKELPGTWIPLKCNSIEDRRESQLLSMNQDSWCSQSLLS 120

Query: 121 EPPDIGNWFSSYVYESPTLNPSQEFGYCESKKTTLVHKIEETSDNVRKTKNGGAGVQLNP 180
           EPPDIGNWFSSYVY SPTL+PSQEFGYCESKKT L H++EET DN      GG GVQLN 
Sbjct: 121 EPPDIGNWFSSYVYASPTLDPSQEFGYCESKKTGLGHEVEETLDN------GGEGVQLNL 180

Query: 181 LK-SNGNSTGNRQDNQSPSE-NLFSERTSEQDPKEKTAGTNEISPTKEVPNSSLTTEDPQ 240
            + SN NSTGNRQDNQ PSE NLFSERTSEQDPKEKT GTN ISPTKEVP SSLTTED Q
Sbjct: 181 FEISNDNSTGNRQDNQPPSEQNLFSERTSEQDPKEKTMGTNAISPTKEVPISSLTTEDLQ 240

Query: 241 CKLQERVLQENGSVPLHINGSFNDDNKKPATHTNLIHKIDLMLESSETKSEVQPQLNGSS 300
           CKLQERVLQENGSVPLH N S ND N  P THTNLIHKID +LESSETKSEVQ QLNGS 
Sbjct: 241 CKLQERVLQENGSVPLHKNRSSNDGNNNPPTHTNLIHKIDPILESSETKSEVQLQLNGS- 300

Query: 301 AGNDVPSVFTNEQSIREPIDLSHNKLKENKEKGVSNVGFITANKRGFSGASCKKSVEMQE 360
             N +P VF N   +REP DLS NK  ENKE+GVSN GFITANKRGFS A+ KKSVEMQE
Sbjct: 301 --NHMPRVFPN---VREPNDLSDNK--ENKERGVSNAGFITANKRGFSEATYKKSVEMQE 360

Query: 361 NYNNKEAGSAFACLRRKPLSDKTNTEHSNILEIVGKWSCPQKSKPNLGPPLKQLRLERWV 420
           NYNNKEAGSAFAC RRK LSDKTNTEHSNI E++GKWSCPQKSKPNLGPPLKQLRLERWV
Sbjct: 361 NYNNKEAGSAFACSRRKGLSDKTNTEHSNIYEVIGKWSCPQKSKPNLGPPLKQLRLERWV 409

BLAST of Lsi07G012670.1 vs. NCBI nr
Match: gi|703131185|ref|XP_010104820.1| (hypothetical protein L484_024020 [Morus notabilis])

HSP 1 Score: 169.9 bits (429), Expect = 9.7e-39
Identity = 153/458 (33.41%), Postives = 225/458 (49.13%), Query Frame = 1

Query: 7   TDSQVMHSSFLDSPSSEPPDIRNWFSSYEYESPELDSNDNFGDSISREK---EFEVKEDE 66
           +DSQ+ + S+  S  SEPP++ NWF SY+YESP LDS+DNF +S+  EK   + ++  D+
Sbjct: 19  SDSQIQNPSYSLSIPSEPPELGNWFPSYKYESPVLDSDDNFEESVFEEKVVRKEKILIDD 78

Query: 67  QTAGEVRDVTKI-EEEEAAEKLPGTWIPLNCTSRGDHRESQPLG-------MNQDSWCSH 126
              G+   + +  E+ E  E L G     N +S   + +  P         +  DS    
Sbjct: 79  CERGKEESLREFGEKGEKDEVLVGDDKGGNQSSSECNLDLSPFSCDLNGRELKGDSVVCL 138

Query: 127 SLLSEPPDIGNWFSSYVYESPTLNPSQEF--------------------------GYCES 186
             L EP D+ NWF SYVYESP L+ +  F                          GY + 
Sbjct: 139 PDLLEPIDVKNWFPSYVYESPALDTNDGFKDSLNKESECEEDRFIVDESNREKAEGYVKL 198

Query: 187 KKTTLVHKIEETSDNVRKTKN--GGAGVQLNPLKSNGNSTGNRQDNQSPSENLFSERTSE 246
           K+TT    + E+S+ +    N  G   ++   L  NG  +G  +   S   N+    T E
Sbjct: 199 KRTTKRDGV-ESSNGLANCGNYCGNNQLEKQLLNKNGRGSGEVKHILSDKSNMCFGSTLE 258

Query: 247 QDPKEKTAGTNEISPTKEVPNSSLTTEDPQCKLQERVLQENGSVPLHINGSFNDDNKKPA 306
           Q    +T     ++P KEV  SSL  E+PQC       +E G  PLH+ G    ++++  
Sbjct: 259 QCSSNETMRNPVLNPAKEVELSSLDQENPQC--VHSFSREIGIRPLHMQGRTCSNDRE-- 318

Query: 307 THTNLIHKIDLMLESSETKSEVQPQLNGS-SAGNDVPSVFTNEQSIREPIDLSHN-KLKE 366
               L  K+   +E+ +       Q+ G  S+ ND  S   NE + RE    +H  K KE
Sbjct: 319 ----LSQKLMCRMENIKDSEAKGRQVAGHLSSKNDRKSDLVNE-ATRES---THGFKDKE 378

Query: 367 NKEKGVSNVGFITANKRGFSGASCKKSVEMQENYNNK---EAGSAFACLRRKPLSDKTNT 421
           N  K +S  GF+T  K  +S A+ +   ++ +    +   ++G   + + RK LS++TN 
Sbjct: 379 NDGKEISKDGFVTTRKGRYSKANAENLQKIPKETGRQTVSQSGGEDSVVERKALSERTNF 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L1G5_CUCSA1.3e-18382.03Uncharacterized protein OS=Cucumis sativus GN=Csa_3G017290 PE=4 SV=1[more]
W9RPJ9_9ROSA6.8e-3933.41Uncharacterized protein OS=Morus notabilis GN=L484_024020 PE=4 SV=1[more]
A0A061G4B1_THECC4.0e-3131.56ATP-dependent caseinolytic protease/crotonase family protein OS=Theobroma cacao ... [more]
A0A067DBQ0_CITSI4.0e-3129.52Uncharacterized protein (Fragment) OS=Citrus sinensis GN=CISIN_1g0095801mg PE=4 ... [more]
A0A067D504_CITSI2.6e-3030.90Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g043652mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16807.19.1e-2730.71 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778675280|ref|XP_011650381.1|1.0e-18482.23PREDICTED: uncharacterized protein LOC101212491 isoform X2 [Cucumis sativus][more]
gi|449459052|ref|XP_004147260.1|1.9e-18382.03PREDICTED: uncharacterized protein LOC101212491 isoform X1 [Cucumis sativus][more]
gi|659095856|ref|XP_008448799.1|1.6e-17981.04PREDICTED: uncharacterized protein LOC103490858 isoform X2 [Cucumis melo][more]
gi|659095847|ref|XP_008448796.1|4.1e-17880.85PREDICTED: uncharacterized protein LOC103490858 isoform X1 [Cucumis melo][more]
gi|703131185|ref|XP_010104820.1|9.7e-3933.41hypothetical protein L484_024020 [Morus notabilis][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Lsi07G012670Lsi07G012670gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Lsi07G012670.1Lsi07G012670.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi07G012670.1.five_prime_UTR.1Lsi07G012670.1.five_prime_UTR.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi07G012670.1.CDS.1Lsi07G012670.1.CDS.1CDS
Lsi07G012670.1.CDS.2Lsi07G012670.1.CDS.2CDS
Lsi07G012670.1.CDS.3Lsi07G012670.1.CDS.3CDS
Lsi07G012670.1.CDS.4Lsi07G012670.1.CDS.4CDS
Lsi07G012670.1.CDS.5Lsi07G012670.1.CDS.5CDS
Lsi07G012670.1.CDS.6Lsi07G012670.1.CDS.6CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi07G012670.1.three_prime_UTR.1Lsi07G012670.1.three_prime_UTR.1three_prime_UTR
Lsi07G012670.1.three_prime_UTR.2Lsi07G012670.1.three_prime_UTR.2three_prime_UTR


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36368FAMILY NOT NAMEDcoord: 8..420
score: 3.8
NoneNo IPR availablePANTHERPTHR36368:SF1GENOMIC DNA, CHROMOSOME 3, TAC CLONE:K17E7coord: 8..420
score: 3.8