ClCG03G010490.1 (mRNA) Watermelon (Charleston Gray)

NameClCG03G010490.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionMatrix metalloproteinase, putative
LocationCG_Chr03 : 18954307 .. 18955245 (+)
Sequence length939
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAATAAACTTTTGGTTCTTGTCTTCATCTCCATCTTCCTTCCTCTTTGCTTTTCACTTCCATTGCTCCAAGTTTCTCCATTTGCATTTCTCAATGACCTCCAAGGATGCAAGAAAGGTGATAATGTCAAAGGCATATCCAAGCTTAAGAACTTCTTCCGTTACTACGGTTATTTAAACCATCGAACCAATGCCACCGGTCATTTGATTGACGATGATGGGGACAACACCTTCGACGATCTCCTCGAGTCTGCCATTAAAACCTACCAACAATATTTCCATCTCAACCCCACTGGATCCCTGAATGCCGAAACGATATCCCAACTCGCAACACCTCGATGCGGTGTTCCAGATATCATCAATGGAACTATTGGTCGAATGTTATCAGAACACAGTGACAACGACGGTCACAATCACCATCACCACGTTCCCCATGCTGTCTCGCACTATGCTTTCTTCCCAGGAAGGCGTAGATGGCCATCTACCAAATACCGTCTCACATATGCATTTCTTCCTGGCACTCGTGCAGACGCAAAAGCACCGGTGGCTCGAGCATTCGCAACATGGGGTCGAAACACTCAGTTTAAGTTTTCTTTGACCACAAACTATAGAAGAGCTGACTTGAAGATTGGTTTCTATAGAGGGAACCATGGAGATGGGTACCCATTTGATGGGCCAGGGGGGACTTTGGCACATGCTTTTGCTCCAACTGATGGGAGATTTCATTATGATTCAACAGAGAAGTGGGGAGTTGGGGCAGTGAGAGGGCGATATGATTTGCAAACGGTGGCTTTGCATGAGATTGGACATCTTCTTGGGCTTGGGCATAGCACAGTTAAGAATGCTATAATGTATCCTTATATCAATCCTGGGACTACAAAGGGTTTGAATGCAGATGACATTAAAGGAATCAAGGTTTTATACAATAGACGCTAG

mRNA sequence

ATGGCAAATAAACTTTTGGTTCTTGTCTTCATCTCCATCTTCCTTCCTCTTTGCTTTTCACTTCCATTGCTCCAAGTTTCTCCATTTGCATTTCTCAATGACCTCCAAGGATGCAAGAAAGGTGATAATGTCAAAGGCATATCCAAGCTTAAGAACTTCTTCCGTTACTACGGTTATTTAAACCATCGAACCAATGCCACCGGTCATTTGATTGACGATGATGGGGACAACACCTTCGACGATCTCCTCGAGTCTGCCATTAAAACCTACCAACAATATTTCCATCTCAACCCCACTGGATCCCTGAATGCCGAAACGATATCCCAACTCGCAACACCTCGATGCGGTGTTCCAGATATCATCAATGGAACTATTGGTCGAATGTTATCAGAACACAGTGACAACGACGGTCACAATCACCATCACCACGTTCCCCATGCTGTCTCGCACTATGCTTTCTTCCCAGGAAGGCGTAGATGGCCATCTACCAAATACCGTCTCACATATGCATTTCTTCCTGGCACTCGTGCAGACGCAAAAGCACCGGTGGCTCGAGCATTCGCAACATGGGGTCGAAACACTCAGTTTAAGTTTTCTTTGACCACAAACTATAGAAGAGCTGACTTGAAGATTGGTTTCTATAGAGGGAACCATGGAGATGGGTACCCATTTGATGGGCCAGGGGGGACTTTGGCACATGCTTTTGCTCCAACTGATGGGAGATTTCATTATGATTCAACAGAGAAGTGGGGAGTTGGGGCAGTGAGAGGGCGATATGATTTGCAAACGGTGGCTTTGCATGAGATTGGACATCTTCTTGGGCTTGGGCATAGCACAGTTAAGAATGCTATAATGTATCCTTATATCAATCCTGGGACTACAAAGGGTTTGAATGCAGATGACATTAAAGGAATCAAGGTTTTATACAATAGACGCTAG

Coding sequence (CDS)

ATGGCAAATAAACTTTTGGTTCTTGTCTTCATCTCCATCTTCCTTCCTCTTTGCTTTTCACTTCCATTGCTCCAAGTTTCTCCATTTGCATTTCTCAATGACCTCCAAGGATGCAAGAAAGGTGATAATGTCAAAGGCATATCCAAGCTTAAGAACTTCTTCCGTTACTACGGTTATTTAAACCATCGAACCAATGCCACCGGTCATTTGATTGACGATGATGGGGACAACACCTTCGACGATCTCCTCGAGTCTGCCATTAAAACCTACCAACAATATTTCCATCTCAACCCCACTGGATCCCTGAATGCCGAAACGATATCCCAACTCGCAACACCTCGATGCGGTGTTCCAGATATCATCAATGGAACTATTGGTCGAATGTTATCAGAACACAGTGACAACGACGGTCACAATCACCATCACCACGTTCCCCATGCTGTCTCGCACTATGCTTTCTTCCCAGGAAGGCGTAGATGGCCATCTACCAAATACCGTCTCACATATGCATTTCTTCCTGGCACTCGTGCAGACGCAAAAGCACCGGTGGCTCGAGCATTCGCAACATGGGGTCGAAACACTCAGTTTAAGTTTTCTTTGACCACAAACTATAGAAGAGCTGACTTGAAGATTGGTTTCTATAGAGGGAACCATGGAGATGGGTACCCATTTGATGGGCCAGGGGGGACTTTGGCACATGCTTTTGCTCCAACTGATGGGAGATTTCATTATGATTCAACAGAGAAGTGGGGAGTTGGGGCAGTGAGAGGGCGATATGATTTGCAAACGGTGGCTTTGCATGAGATTGGACATCTTCTTGGGCTTGGGCATAGCACAGTTAAGAATGCTATAATGTATCCTTATATCAATCCTGGGACTACAAAGGGTTTGAATGCAGATGACATTAAAGGAATCAAGGTTTTATACAATAGACGCTAG

Protein sequence

MANKLLVLVFISIFLPLCFSLPLLQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADAKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYINPGTTKGLNADDIKGIKVLYNRR
BLAST of ClCG03G010490.1 vs. Swiss-Prot
Match: 2MMP_ARATH (Metalloendoproteinase 2-MMP OS=Arabidopsis thaliana GN=2MMP PE=1 SV=1)

HSP 1 Score: 234.2 bits (596), Expect = 2.0e-60
Identity = 124/287 (43.21%), Postives = 170/287 (59.23%), Query Frame = 1

Query: 33  NDLQGCKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQ 92
           ++  GC  G NV G+ ++K +F+ +GY+      +G+  DD     FDD+L++A++ YQ 
Sbjct: 45  SNFTGCHHGQNVDGLYRIKKYFQRFGYIPE--TFSGNFTDD-----FDDILKAAVELYQT 104

Query: 93  YFHLNPTGSLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYA 152
            F+LN TG L+A TI  +  PRCG PD++NGT           + +    H+ HAV  Y 
Sbjct: 105 NFNLNVTGELDALTIQHIVIPRCGNPDVVNGTSLMHGGRRKTFEVNFSRTHL-HAVKRYT 164

Query: 153 FFPGRRRWPSTKYRLTYAFLPGT--RADAKAPVARAFATWGRNTQFKFSLTTNYRRADLK 212
            FPG  RWP  +  LTYAF P      + K+  +RAF  W   T   F+L+ ++  +D+ 
Sbjct: 165 LFPGEPRWPRNRRDLTYAFDPKNPLTEEVKSVFSRAFGRWSDVTALNFTLSESFSTSDIT 224

Query: 213 IGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGVG-------AVRGRYDLQT 272
           IGFY G+HGDG PFDG  GTLAHAF+P  G+FH D+ E W V        +V    DL++
Sbjct: 225 IGFYTGDHGDGEPFDGVLGTLAHAFSPPSGKFHLDADENWVVSGDLDSFLSVTAAVDLES 284

Query: 273 VALHEIGHLLGLGHSTVKNAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           VA+HEIGHLLGLGHS+V+ +IMYP I  G  K  L  DD++GI+ LY
Sbjct: 285 VAVHEIGHLLGLGHSSVEESIMYPTITTGKRKVDLTNDDVEGIQYLY 323

BLAST of ClCG03G010490.1 vs. Swiss-Prot
Match: 3MMP_ARATH (Metalloendoproteinase 3-MMP OS=Arabidopsis thaliana GN=3MMP PE=1 SV=1)

HSP 1 Score: 234.2 bits (596), Expect = 2.0e-60
Identity = 125/291 (42.96%), Postives = 165/291 (56.70%), Query Frame = 1

Query: 30  AFLNDLQGCKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKT 89
           +FLN   GC  G    G+  LK +F+++GY+   TN +G+  DD     FDD+L++A++ 
Sbjct: 45  SFLN-FTGCHAGKKYDGLYMLKQYFQHFGYITE-TNLSGNFTDD-----FDDILKNAVEM 104

Query: 90  YQQYFHLNPTGSLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVS 149
           YQ+ F LN TG L+  T+  +  PRCG PD++NGT        +            HAV 
Sbjct: 105 YQRNFQLNVTGVLDELTLKHVVIPRCGNPDVVNGTSTMHSGRKTFEVSFAGRGQRFHAVK 164

Query: 150 HYAFFPGRRRWPSTKYRLTYAFLP--GTRADAKAPVARAFATWGRNTQFKFSLTTNYRRA 209
           HY+FFPG  RWP  +  LTYAF P      + K+  +RAF  W   T   F+    +  +
Sbjct: 165 HYSFFPGEPRWPRNRRDLTYAFDPRNALTEEVKSVFSRAFTRWEEVTPLTFTRVERFSTS 224

Query: 210 DLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKW--------GVGAVRGRY 269
           D+ IGFY G HGDG PFDGP  TLAHAF+P  G FH D  E W        G  +V    
Sbjct: 225 DISIGFYSGEHGDGEPFDGPMRTLAHAFSPPTGHFHLDGEENWIVSGEGGDGFISVSEAV 284

Query: 270 DLQTVALHEIGHLLGLGHSTVKNAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           DL++VA+HEIGHLLGLGHS+V+ +IMYP I  G  K  L  DD++G++ LY
Sbjct: 285 DLESVAVHEIGHLLGLGHSSVEGSIMYPTIRTGRRKVDLTTDDVEGVQYLY 328

BLAST of ClCG03G010490.1 vs. Swiss-Prot
Match: 5MMP_ARATH (Metalloendoproteinase 5-MMP OS=Arabidopsis thaliana GN=5MMP PE=1 SV=1)

HSP 1 Score: 230.7 bits (587), Expect = 2.2e-59
Identity = 137/329 (41.64%), Postives = 182/329 (55.32%), Query Frame = 1

Query: 5   LLVLVFISIFLPLC--FSLPLLQVSPFAFLNDLQ----------GCKKGDNVKGISKLKN 64
           L +L+F     P+   F   +  + P  FLN  Q          GC  G+N+ G+SKLK 
Sbjct: 6   LTILIFFFTVNPISAKFYTNVSSIPPLQFLNATQNAWETFSKLAGCHIGENINGLSKLKQ 65

Query: 65  FFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLAT 124
           +FR +GY+      TG+  DD     FDD+L+SAI TYQ+ F+L  TG L++ T+ Q+  
Sbjct: 66  YFRRFGYIT----TTGNCTDD-----FDDVLQSAINTYQKNFNLKVTGKLDSSTLRQIVK 125

Query: 125 PRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRWPSTKYRLTYAFL 184
           PRCG PD+I+G         S+ +G      +      Y+FFPG+ RWP  K  LTYAF 
Sbjct: 126 PRCGNPDLIDGV--------SEMNGGK----ILRTTEKYSFFPGKPRWPKRKRDLTYAFA 185

Query: 185 PGTRA--DAKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGT 244
           P      + K   +RAF  W   T   F+ + +  RAD+ IGF+ G HGDG PFDG  GT
Sbjct: 186 PQNNLTDEVKRVFSRAFTRWAEVTPLNFTRSESILRADIVIGFFSGEHGDGEPFDGAMGT 245

Query: 245 LAHAFAPTDGRFHYDSTEKWGV--GAVRGR-------YDLQTVALHEIGHLLGLGHSTVK 304
           LAHA +P  G  H D  E W +  G +  R        DL++VA+HEIGHLLGLGHS+V+
Sbjct: 246 LAHASSPPTGMLHLDGDEDWLISNGEISRRILPVTTVVDLESVAVHEIGHLLGLGHSSVE 305

Query: 305 NAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           +AIM+P I+ G  K  L  DDI+GI+ LY
Sbjct: 306 DAIMFPAISGGDRKVELAKDDIEGIQHLY 313

BLAST of ClCG03G010490.1 vs. Swiss-Prot
Match: MEP1_SOYBN (Metalloendoproteinase 1 OS=Glycine max PE=1 SV=2)

HSP 1 Score: 224.9 bits (572), Expect = 1.2e-57
Identity = 128/278 (46.04%), Postives = 159/278 (57.19%), Query Frame = 1

Query: 41  GDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTG 100
           G N KG+S +KN+F + GY+ +  +          D+ FDD L SAIKTYQ+ ++LN TG
Sbjct: 48  GQNYKGLSNVKNYFHHLGYIPNAPHF---------DDNFDDTLVSAIKTYQKNYNLNVTG 107

Query: 101 SLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRW 160
             +  T+ Q+ TPRCGVPDII  T        + + G          +S Y FF    RW
Sbjct: 108 KFDINTLKQIMTPRCGVPDIIINT------NKTTSFG---------MISDYTFFKDMPRW 167

Query: 161 PSTKYRLTYAFLPGTRADA--KAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNH 220
            +   +LTYAF P  R D   K+ +ARAF+ W       F  TT+Y  A++KI F   NH
Sbjct: 168 QAGTTQLTYAFSPEPRLDDTFKSAIARAFSKWTPVVNIAFQETTSYETANIKILFASKNH 227

Query: 221 GDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGVGA------VRGRYDLQTVALHEIGHL 280
           GD YPFDGPGG L HAFAPTDGR H+D+ E W          V   +DL++VA+HEIGHL
Sbjct: 228 GDPYPFDGPGGILGHAFAPTDGRCHFDADEYWVASGDVTKSPVTSAFDLESVAVHEIGHL 287

Query: 281 LGLGHSTVKNAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           LGLGHS+   AIMYP I P T K  L  DDI GI+ LY
Sbjct: 288 LGLGHSSDLRAIMYPSIPPRTRKVNLAQDDIDGIRKLY 301

BLAST of ClCG03G010490.1 vs. Swiss-Prot
Match: 1MMP_ARATH (Metalloendoproteinase 1-MMP OS=Arabidopsis thaliana GN=1MMP PE=1 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 9.4e-47
Identity = 117/281 (41.64%), Postives = 150/281 (53.38%), Query Frame = 1

Query: 41  GDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTG 100
           G +V G+S+LK +   +GY+N      G  I  D    FD  LESAI  YQ+   L  TG
Sbjct: 64  GSHVSGVSELKRYLHRFGYVND-----GSEIFSD---VFDGPLESAISLYQENLGLPITG 123

Query: 101 SLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRW 160
            L+  T++ ++ PRCGV D  + TI        +ND         H  +HY +F G+ +W
Sbjct: 124 RLDTSTVTLMSLPRCGVSDT-HMTI--------NND-------FLHTTAHYTYFNGKPKW 183

Query: 161 PSTKYRLTYAFLPG------TRADAKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFY 220
              +  LTYA          T  D K    RAF+ W       F    ++  ADLKIGFY
Sbjct: 184 --NRDTLTYAISKTHKLDYLTSEDVKTVFRRAFSQWSSVIPVSFEEVDDFTTADLKIGFY 243

Query: 221 RGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGV-----GAVRGRYDLQTVALHEI 280
            G+HGDG PFDG  GTLAHAFAP +GR H D+ E W V     G+     DL++VA HEI
Sbjct: 244 AGDHGDGLPFDGVLGTLAHAFAPENGRLHLDAAETWIVDDDLKGSSEVAVDLESVATHEI 303

Query: 281 GHLLGLGHSTVKNAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           GHLLGLGHS+ ++A+MYP + P T K  L  DD+ G+  LY
Sbjct: 304 GHLLGLGHSSQESAVMYPSLRPRTKKVDLTVDDVAGVLKLY 318

BLAST of ClCG03G010490.1 vs. TrEMBL
Match: A0A0A0M0J6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G654860 PE=3 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 5.2e-161
Identity = 275/314 (87.58%), Postives = 287/314 (91.40%), Query Frame = 1

Query: 1   MANK--LLVLVFISIFLPLCFSLPLLQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYG 60
           M NK   ++L+ ISIFLPLCFSLPL+QVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYG
Sbjct: 1   MGNKPLAILLLIISIFLPLCFSLPLIQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYG 60

Query: 61  YLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVP 120
           YLNH+ NATGHLID D ++ FDD LESAIKTYQQYFHLNPTGSLNAET+SQLATPRCG P
Sbjct: 61  YLNHQINATGHLIDIDANDIFDDRLESAIKTYQQYFHLNPTGSLNAETLSQLATPRCGNP 120

Query: 121 DIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRAD 180
           DIIN T GRMLSE  DN   + HHH+PHAVSHYAFFPGR RWPSTKYRLTYAFLPGTRAD
Sbjct: 121 DIINETTGRMLSEDIDNVSSHDHHHLPHAVSHYAFFPGRLRWPSTKYRLTYAFLPGTRAD 180

Query: 181 AKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPT 240
           AKAPVARAFATW RNT FKF+L TNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPT
Sbjct: 181 AKAPVARAFATWARNTHFKFTLVTNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPT 240

Query: 241 DGRFHYDSTEKWGVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYINPGTTKGLN 300
           DGRFHYDSTEKW VGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYI  G+TKGLN
Sbjct: 241 DGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGSTKGLN 300

Query: 301 ADDIKGIKVLYNRR 313
            DDIKGIKVLYNRR
Sbjct: 301 VDDIKGIKVLYNRR 314

BLAST of ClCG03G010490.1 vs. TrEMBL
Match: A0A0A0M032_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G654840 PE=3 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 2.5e-139
Identity = 242/307 (78.83%), Postives = 264/307 (85.99%), Query Frame = 1

Query: 5   LLVLVFISIFLPLCFSLPLLQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGYLNHRT 64
           L +L+FISI LPLCFSLPL QVS  +FLNDLQG KKGDNVKGISKLKNFFR YGYLNH+ 
Sbjct: 6   LAILLFISI-LPLCFSLPLRQVSRLSFLNDLQGSKKGDNVKGISKLKNFFRRYGYLNHQI 65

Query: 65  NATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIINGT 124
           N TGHLID D D+TFDD  ESA+KTYQQYFHLN TGSLNAET+SQLATPRCG PDI+N  
Sbjct: 66  NVTGHLIDHDADDTFDDRFESAVKTYQQYFHLNSTGSLNAETLSQLATPRCGNPDILNEA 125

Query: 125 IGRMLSEHSDNDG-HNHHHHVPHAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADAKAPV 184
            GRML E+++ND  H+H+H + HAV HY+FFPGR RWP TKY LTY FLP T ADAKAPV
Sbjct: 126 TGRMLLENNNNDSSHDHYHQLSHAVPHYSFFPGRPRWPPTKYHLTYEFLPNTHADAKAPV 185

Query: 185 ARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFH 244
            RAFATW R+T FKFSL TN RRADLKIGFYRGNHGDGYPFDG GGTLAHAF PTDGR H
Sbjct: 186 TRAFATWARHTHFKFSLATNSRRADLKIGFYRGNHGDGYPFDGSGGTLAHAFTPTDGRVH 245

Query: 245 YDSTEKWGVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYINPGTTKGLNADDIK 304
           +DSTEKW VGAVRGR+DL+TVALHEIGHLLGLGHS VKNAIMYP I  G+TKGLNADDI+
Sbjct: 246 FDSTEKWVVGAVRGRFDLETVALHEIGHLLGLGHSRVKNAIMYPTIESGSTKGLNADDIE 305

Query: 305 GIKVLYN 311
           GI+VLYN
Sbjct: 306 GIEVLYN 311

BLAST of ClCG03G010490.1 vs. TrEMBL
Match: S1SMU3_THECC (Matrixin family protein OS=Theobroma cacao GN=TCM_044196 PE=3 SV=1)

HSP 1 Score: 348.6 bits (893), Expect = 7.9e-93
Identity = 187/320 (58.44%), Postives = 210/320 (65.62%), Query Frame = 1

Query: 3   NKLLVLVFISIF-LPLCFSLPLLQVS-----PFAFLNDLQGCKKGDNVKGISKLKNFFRY 62
           N +  L F ++  LPL F   L         PF FL  LQGC KGD VK I KLK +   
Sbjct: 4   NAISFLSFCTLLVLPLLFQATLADSKDKKPYPFDFLKHLQGCHKGDKVKDIRKLKKYLEQ 63

Query: 63  YGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCG 122
           +GYL++  N T H  DDD    FDDLLESAIKTYQ  FHLN  G+L+ ET+S++  PRCG
Sbjct: 64  FGYLSYSKNKT-HANDDD----FDDLLESAIKTYQLNFHLNSNGALDTETVSKMMMPRCG 123

Query: 123 VPDIINGTIGRMLSEHSDNDGHNHHHHVP-----HAVSHYAFFPGRRRWPSTKYRLTYAF 182
           V DIINGT G          G    H        H VSHYAFFP   RWP +K  LTYAF
Sbjct: 124 VADIINGTSGM-------RSGKKKPHRAAGSKSIHEVSHYAFFPRSPRWPPSKSHLTYAF 183

Query: 183 LPGTRADAKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGTL 242
           LPGTRADA  PVA AF TW  NT F+FS   NYR AD+ IGF R +HGDG PFDGPGGTL
Sbjct: 184 LPGTRADAVNPVAGAFQTWAANTHFRFSRIDNYRDADITIGFQRRDHGDGNPFDGPGGTL 243

Query: 243 AHAFAPTDGRFHYDSTEKWGVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYINP 302
           AHAFAPT GRFHYD+ E W V A  G   L+TVALHEIGHLLGLGHS+++NAIMYP I  
Sbjct: 244 AHAFAPTLGRFHYDADETWSVSARPGTMHLETVALHEIGHLLGLGHSSIENAIMYPSITA 303

Query: 303 GTTKGLNADDIKGIKVLYNR 312
           GT+KGL  DDI+GIK LYNR
Sbjct: 304 GTSKGLARDDIEGIKALYNR 311

BLAST of ClCG03G010490.1 vs. TrEMBL
Match: B9RUG6_RICCO (Metalloendoproteinase 1, putative OS=Ricinus communis GN=RCOM_0852690 PE=3 SV=1)

HSP 1 Score: 347.4 bits (890), Expect = 1.8e-92
Identity = 170/284 (59.86%), Postives = 208/284 (73.24%), Query Frame = 1

Query: 27  SPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESA 86
           S F FL  LQGC KGDN+KGI  LK +   +GYL+++  +  H  DDD    FDDLLE A
Sbjct: 36  SAFDFLKHLQGCHKGDNLKGIHDLKKYLENFGYLSYKNQS--HSNDDD----FDDLLEYA 95

Query: 87  IKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPH 146
           +KTYQ  +HLN TG L++ET++++  PRCGV DIINGT  RM S  S+ + H+H     H
Sbjct: 96  LKTYQFNYHLNVTGFLDSETVTKMMMPRCGVADIINGTT-RMQS--SNKNPHHHSSTSFH 155

Query: 147 AVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADAKAPVARAFATWGRNTQFKFSLTTNYRR 206
            VSHY FFPG  RWP++KY LTY FLPGT   A  PVA+AF TW  NT F+F+   +YR 
Sbjct: 156 TVSHYEFFPGNPRWPASKYHLTYGFLPGTPNQAMEPVAKAFQTWAANTHFRFTRVQDYRA 215

Query: 207 ADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGVGAVRGRYDLQTVAL 266
           AD+ IGF+RG+HGDG PFDG GGTLAHAFAP DGRFHYD  E W VGA +G +D++TVAL
Sbjct: 216 ADITIGFHRGDHGDGSPFDGRGGTLAHAFAPQDGRFHYDGDEHWAVGATQGAFDVETVAL 275

Query: 267 HEIGHLLGLGHSTVKNAIMYPYINPGTTKGLNADDIKGIKVLYN 311
           HEIGHLLGLGHS+V+ AIM+P I  G TKGL++DDI+GI+ LYN
Sbjct: 276 HEIGHLLGLGHSSVEGAIMHPSIQSGATKGLHSDDIQGIRALYN 310

BLAST of ClCG03G010490.1 vs. TrEMBL
Match: A0A061FPL5_THECC (Matrixin family protein OS=Theobroma cacao GN=TCM_044201 PE=3 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 6.7e-92
Identity = 184/320 (57.50%), Postives = 207/320 (64.69%), Query Frame = 1

Query: 3   NKLLVLVFISIF-LPLCFSLPLLQVS-----PFAFLNDLQGCKKGDNVKGISKLKNFFRY 62
           N +  L F ++  LPL F   L         PF FL  LQGC KGD VK I KLK +   
Sbjct: 4   NAISFLSFCTLLVLPLLFQATLADSKDKKPYPFDFLKHLQGCHKGDKVKDIRKLKKYLEQ 63

Query: 63  YGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCG 122
           +GYL++  N T        DN FDDLLESAIKTYQ  FHLN  G+L+ ET+S++  PRCG
Sbjct: 64  FGYLSYSKNKT-----HANDNDFDDLLESAIKTYQLNFHLNSNGALDTETVSKMMMPRCG 123

Query: 123 VPDIINGTIGRMLSEHSDNDGHNHHHHVP-----HAVSHYAFFPGRRRWPSTKYRLTYAF 182
           V DIINGT G          G    H        H VSHYAFFP   RWP +K  LTYAF
Sbjct: 124 VADIINGTSGM-------RSGKKKPHRAAGSKSIHEVSHYAFFPRSPRWPPSKSHLTYAF 183

Query: 183 LPGTRADAKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGTL 242
           LPGTRADA  PVA AF TW  NT F+FS   NYR AD+ IGF R +HGDG PFDGPGGTL
Sbjct: 184 LPGTRADAVNPVAGAFQTWAANTHFRFSRIDNYRDADITIGFQRRDHGDGNPFDGPGGTL 243

Query: 243 AHAFAPTDGRFHYDSTEKWGVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYINP 302
           AHAFAPT GRFHYD+ E W V A  G   L+TVALHEIGHLLGL HS+++NAIMYP I  
Sbjct: 244 AHAFAPTIGRFHYDADETWSVSARPGTMHLETVALHEIGHLLGLSHSSIENAIMYPSITA 303

Query: 303 GTTKGLNADDIKGIKVLYNR 312
           GT+KGL  DDI+GIK LYNR
Sbjct: 304 GTSKGLARDDIEGIKALYNR 311

BLAST of ClCG03G010490.1 vs. TAIR10
Match: AT1G70170.1 (AT1G70170.1 matrix metalloproteinase)

HSP 1 Score: 234.2 bits (596), Expect = 1.1e-61
Identity = 124/287 (43.21%), Postives = 170/287 (59.23%), Query Frame = 1

Query: 33  NDLQGCKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQ 92
           ++  GC  G NV G+ ++K +F+ +GY+      +G+  DD     FDD+L++A++ YQ 
Sbjct: 45  SNFTGCHHGQNVDGLYRIKKYFQRFGYIPE--TFSGNFTDD-----FDDILKAAVELYQT 104

Query: 93  YFHLNPTGSLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYA 152
            F+LN TG L+A TI  +  PRCG PD++NGT           + +    H+ HAV  Y 
Sbjct: 105 NFNLNVTGELDALTIQHIVIPRCGNPDVVNGTSLMHGGRRKTFEVNFSRTHL-HAVKRYT 164

Query: 153 FFPGRRRWPSTKYRLTYAFLPGT--RADAKAPVARAFATWGRNTQFKFSLTTNYRRADLK 212
            FPG  RWP  +  LTYAF P      + K+  +RAF  W   T   F+L+ ++  +D+ 
Sbjct: 165 LFPGEPRWPRNRRDLTYAFDPKNPLTEEVKSVFSRAFGRWSDVTALNFTLSESFSTSDIT 224

Query: 213 IGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGVG-------AVRGRYDLQT 272
           IGFY G+HGDG PFDG  GTLAHAF+P  G+FH D+ E W V        +V    DL++
Sbjct: 225 IGFYTGDHGDGEPFDGVLGTLAHAFSPPSGKFHLDADENWVVSGDLDSFLSVTAAVDLES 284

Query: 273 VALHEIGHLLGLGHSTVKNAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           VA+HEIGHLLGLGHS+V+ +IMYP I  G  K  L  DD++GI+ LY
Sbjct: 285 VAVHEIGHLLGLGHSSVEESIMYPTITTGKRKVDLTNDDVEGIQYLY 323

BLAST of ClCG03G010490.1 vs. TAIR10
Match: AT1G24140.1 (AT1G24140.1 Matrixin family protein)

HSP 1 Score: 234.2 bits (596), Expect = 1.1e-61
Identity = 125/291 (42.96%), Postives = 165/291 (56.70%), Query Frame = 1

Query: 30  AFLNDLQGCKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKT 89
           +FLN   GC  G    G+  LK +F+++GY+   TN +G+  DD     FDD+L++A++ 
Sbjct: 45  SFLN-FTGCHAGKKYDGLYMLKQYFQHFGYITE-TNLSGNFTDD-----FDDILKNAVEM 104

Query: 90  YQQYFHLNPTGSLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVS 149
           YQ+ F LN TG L+  T+  +  PRCG PD++NGT        +            HAV 
Sbjct: 105 YQRNFQLNVTGVLDELTLKHVVIPRCGNPDVVNGTSTMHSGRKTFEVSFAGRGQRFHAVK 164

Query: 150 HYAFFPGRRRWPSTKYRLTYAFLP--GTRADAKAPVARAFATWGRNTQFKFSLTTNYRRA 209
           HY+FFPG  RWP  +  LTYAF P      + K+  +RAF  W   T   F+    +  +
Sbjct: 165 HYSFFPGEPRWPRNRRDLTYAFDPRNALTEEVKSVFSRAFTRWEEVTPLTFTRVERFSTS 224

Query: 210 DLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKW--------GVGAVRGRY 269
           D+ IGFY G HGDG PFDGP  TLAHAF+P  G FH D  E W        G  +V    
Sbjct: 225 DISIGFYSGEHGDGEPFDGPMRTLAHAFSPPTGHFHLDGEENWIVSGEGGDGFISVSEAV 284

Query: 270 DLQTVALHEIGHLLGLGHSTVKNAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           DL++VA+HEIGHLLGLGHS+V+ +IMYP I  G  K  L  DD++G++ LY
Sbjct: 285 DLESVAVHEIGHLLGLGHSSVEGSIMYPTIRTGRRKVDLTTDDVEGVQYLY 328

BLAST of ClCG03G010490.1 vs. TAIR10
Match: AT1G59970.1 (AT1G59970.1 Matrixin family protein)

HSP 1 Score: 230.7 bits (587), Expect = 1.2e-60
Identity = 137/329 (41.64%), Postives = 182/329 (55.32%), Query Frame = 1

Query: 5   LLVLVFISIFLPLC--FSLPLLQVSPFAFLNDLQ----------GCKKGDNVKGISKLKN 64
           L +L+F     P+   F   +  + P  FLN  Q          GC  G+N+ G+SKLK 
Sbjct: 6   LTILIFFFTVNPISAKFYTNVSSIPPLQFLNATQNAWETFSKLAGCHIGENINGLSKLKQ 65

Query: 65  FFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLAT 124
           +FR +GY+      TG+  DD     FDD+L+SAI TYQ+ F+L  TG L++ T+ Q+  
Sbjct: 66  YFRRFGYIT----TTGNCTDD-----FDDVLQSAINTYQKNFNLKVTGKLDSSTLRQIVK 125

Query: 125 PRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRWPSTKYRLTYAFL 184
           PRCG PD+I+G         S+ +G      +      Y+FFPG+ RWP  K  LTYAF 
Sbjct: 126 PRCGNPDLIDGV--------SEMNGGK----ILRTTEKYSFFPGKPRWPKRKRDLTYAFA 185

Query: 185 PGTRA--DAKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGT 244
           P      + K   +RAF  W   T   F+ + +  RAD+ IGF+ G HGDG PFDG  GT
Sbjct: 186 PQNNLTDEVKRVFSRAFTRWAEVTPLNFTRSESILRADIVIGFFSGEHGDGEPFDGAMGT 245

Query: 245 LAHAFAPTDGRFHYDSTEKWGV--GAVRGR-------YDLQTVALHEIGHLLGLGHSTVK 304
           LAHA +P  G  H D  E W +  G +  R        DL++VA+HEIGHLLGLGHS+V+
Sbjct: 246 LAHASSPPTGMLHLDGDEDWLISNGEISRRILPVTTVVDLESVAVHEIGHLLGLGHSSVE 305

Query: 305 NAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           +AIM+P I+ G  K  L  DDI+GI+ LY
Sbjct: 306 DAIMFPAISGGDRKVELAKDDIEGIQHLY 313

BLAST of ClCG03G010490.1 vs. TAIR10
Match: AT4G16640.1 (AT4G16640.1 Matrixin family protein)

HSP 1 Score: 188.7 bits (478), Expect = 5.3e-48
Identity = 117/281 (41.64%), Postives = 150/281 (53.38%), Query Frame = 1

Query: 41  GDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTG 100
           G +V G+S+LK +   +GY+N      G  I  D    FD  LESAI  YQ+   L  TG
Sbjct: 64  GSHVSGVSELKRYLHRFGYVND-----GSEIFSD---VFDGPLESAISLYQENLGLPITG 123

Query: 101 SLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRW 160
            L+  T++ ++ PRCGV D  + TI        +ND         H  +HY +F G+ +W
Sbjct: 124 RLDTSTVTLMSLPRCGVSDT-HMTI--------NND-------FLHTTAHYTYFNGKPKW 183

Query: 161 PSTKYRLTYAFLPG------TRADAKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFY 220
              +  LTYA          T  D K    RAF+ W       F    ++  ADLKIGFY
Sbjct: 184 --NRDTLTYAISKTHKLDYLTSEDVKTVFRRAFSQWSSVIPVSFEEVDDFTTADLKIGFY 243

Query: 221 RGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGV-----GAVRGRYDLQTVALHEI 280
            G+HGDG PFDG  GTLAHAFAP +GR H D+ E W V     G+     DL++VA HEI
Sbjct: 244 AGDHGDGLPFDGVLGTLAHAFAPENGRLHLDAAETWIVDDDLKGSSEVAVDLESVATHEI 303

Query: 281 GHLLGLGHSTVKNAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           GHLLGLGHS+ ++A+MYP + P T K  L  DD+ G+  LY
Sbjct: 304 GHLLGLGHSSQESAVMYPSLRPRTKKVDLTVDDVAGVLKLY 318

BLAST of ClCG03G010490.1 vs. TAIR10
Match: AT2G45040.1 (AT2G45040.1 Matrixin family protein)

HSP 1 Score: 170.2 bits (430), Expect = 2.0e-42
Identity = 103/275 (37.45%), Postives = 133/275 (48.36%), Query Frame = 1

Query: 47  ISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAET 106
           I ++K   + YGYL     +             D   E A+  YQ+   L  TG  +++T
Sbjct: 50  IPEIKRHLQQYGYLPQNKESD------------DVSFEQALVRYQKNLGLPITGKPDSDT 109

Query: 107 ISQLATPRCGVPDIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRWP-STKY 166
           +SQ+  PRCG PD +                        H    Y +FPGR RW      
Sbjct: 110 LSQILLPRCGFPDDVEPKTAPF-----------------HTGKKYVYFPGRPRWTRDVPL 169

Query: 167 RLTYAFLPGTRADAKAPV------ARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHG 226
           +LTYAF         AP        RAF  W       F  T +Y  AD+KIGF+ G+HG
Sbjct: 170 KLTYAFSQENLTPYLAPTDIRRVFRRAFGKWASVIPVSFIETEDYVIADIKIGFFNGDHG 229

Query: 227 DGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGVGAVRGR----YDLQTVALHEIGHLLGL 286
           DG PFDG  G LAH F+P +GR H D  E W V     +     DL++VA+HEIGH+LGL
Sbjct: 230 DGEPFDGVLGVLAHTFSPENGRLHLDKAETWAVDFDEEKSSVAVDLESVAVHEIGHVLGL 289

Query: 287 GHSTVKNAIMYPYINPGTTK-GLNADDIKGIKVLY 310
           GHS+VK+A MYP + P + K  LN DD+ G++ LY
Sbjct: 290 GHSSVKDAAMYPTLKPRSKKVNLNMDDVVGVQSLY 295

BLAST of ClCG03G010490.1 vs. NCBI nr
Match: gi|659085906|ref|XP_008443669.1| (PREDICTED: metalloendoproteinase 1-like [Cucumis melo])

HSP 1 Score: 589.0 bits (1517), Expect = 5.0e-165
Identity = 279/313 (89.14%), Postives = 292/313 (93.29%), Query Frame = 1

Query: 1   MANKLL-VLVFISIFLPLCFSLPLLQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGY 60
           M NK L +L+FIS+FLPLCFSLPLLQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGY
Sbjct: 1   MGNKTLAILLFISVFLPLCFSLPLLQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGY 60

Query: 61  LNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPD 120
           LNH+ NATGHLID D D+TFDD LE AIKTYQQYFHLNPTGSLNAETISQLATPRCG PD
Sbjct: 61  LNHQINATGHLIDTDADDTFDDRLEFAIKTYQQYFHLNPTGSLNAETISQLATPRCGNPD 120

Query: 121 IINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADA 180
           IIN + GRMLSEH++ND  + HHH+PHAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADA
Sbjct: 121 IINESTGRMLSEHNNNDSSHDHHHLPHAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADA 180

Query: 181 KAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTD 240
           KAPV RAFATW RNT FKFSL TNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTD
Sbjct: 181 KAPVTRAFATWARNTHFKFSLITNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTD 240

Query: 241 GRFHYDSTEKWGVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYINPGTTKGLNA 300
           GRFHYDSTEKW VGAV+GRYDLQTVALHEIGHLLGLGHSTV+NAIMYPYI  G+TKGLNA
Sbjct: 241 GRFHYDSTEKWAVGAVKGRYDLQTVALHEIGHLLGLGHSTVRNAIMYPYIRSGSTKGLNA 300

Query: 301 DDIKGIKVLYNRR 313
           DDIKGIKVLYNRR
Sbjct: 301 DDIKGIKVLYNRR 313

BLAST of ClCG03G010490.1 vs. NCBI nr
Match: gi|449442791|ref|XP_004139164.1| (PREDICTED: metalloendoproteinase 1-like [Cucumis sativus])

HSP 1 Score: 575.1 bits (1481), Expect = 7.5e-161
Identity = 275/314 (87.58%), Postives = 287/314 (91.40%), Query Frame = 1

Query: 1   MANK--LLVLVFISIFLPLCFSLPLLQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYG 60
           M NK   ++L+ ISIFLPLCFSLPL+QVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYG
Sbjct: 1   MGNKPLAILLLIISIFLPLCFSLPLIQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYG 60

Query: 61  YLNHRTNATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVP 120
           YLNH+ NATGHLID D ++ FDD LESAIKTYQQYFHLNPTGSLNAET+SQLATPRCG P
Sbjct: 61  YLNHQINATGHLIDIDANDIFDDRLESAIKTYQQYFHLNPTGSLNAETLSQLATPRCGNP 120

Query: 121 DIINGTIGRMLSEHSDNDGHNHHHHVPHAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRAD 180
           DIIN T GRMLSE  DN   + HHH+PHAVSHYAFFPGR RWPSTKYRLTYAFLPGTRAD
Sbjct: 121 DIINETTGRMLSEDIDNVSSHDHHHLPHAVSHYAFFPGRLRWPSTKYRLTYAFLPGTRAD 180

Query: 181 AKAPVARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPT 240
           AKAPVARAFATW RNT FKF+L TNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPT
Sbjct: 181 AKAPVARAFATWARNTHFKFTLVTNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPT 240

Query: 241 DGRFHYDSTEKWGVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYINPGTTKGLN 300
           DGRFHYDSTEKW VGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYI  G+TKGLN
Sbjct: 241 DGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGSTKGLN 300

Query: 301 ADDIKGIKVLYNRR 313
            DDIKGIKVLYNRR
Sbjct: 301 VDDIKGIKVLYNRR 314

BLAST of ClCG03G010490.1 vs. NCBI nr
Match: gi|449442789|ref|XP_004139163.1| (PREDICTED: metalloendoproteinase 1-like [Cucumis sativus])

HSP 1 Score: 503.1 bits (1294), Expect = 3.6e-139
Identity = 242/307 (78.83%), Postives = 264/307 (85.99%), Query Frame = 1

Query: 5   LLVLVFISIFLPLCFSLPLLQVSPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGYLNHRT 64
           L +L+FISI LPLCFSLPL QVS  +FLNDLQG KKGDNVKGISKLKNFFR YGYLNH+ 
Sbjct: 6   LAILLFISI-LPLCFSLPLRQVSRLSFLNDLQGSKKGDNVKGISKLKNFFRRYGYLNHQI 65

Query: 65  NATGHLIDDDGDNTFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIINGT 124
           N TGHLID D D+TFDD  ESA+KTYQQYFHLN TGSLNAET+SQLATPRCG PDI+N  
Sbjct: 66  NVTGHLIDHDADDTFDDRFESAVKTYQQYFHLNSTGSLNAETLSQLATPRCGNPDILNEA 125

Query: 125 IGRMLSEHSDNDG-HNHHHHVPHAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADAKAPV 184
            GRML E+++ND  H+H+H + HAV HY+FFPGR RWP TKY LTY FLP T ADAKAPV
Sbjct: 126 TGRMLLENNNNDSSHDHYHQLSHAVPHYSFFPGRPRWPPTKYHLTYEFLPNTHADAKAPV 185

Query: 185 ARAFATWGRNTQFKFSLTTNYRRADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFH 244
            RAFATW R+T FKFSL TN RRADLKIGFYRGNHGDGYPFDG GGTLAHAF PTDGR H
Sbjct: 186 TRAFATWARHTHFKFSLATNSRRADLKIGFYRGNHGDGYPFDGSGGTLAHAFTPTDGRVH 245

Query: 245 YDSTEKWGVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYINPGTTKGLNADDIK 304
           +DSTEKW VGAVRGR+DL+TVALHEIGHLLGLGHS VKNAIMYP I  G+TKGLNADDI+
Sbjct: 246 FDSTEKWVVGAVRGRFDLETVALHEIGHLLGLGHSRVKNAIMYPTIESGSTKGLNADDIE 305

Query: 305 GIKVLYN 311
           GI+VLYN
Sbjct: 306 GIEVLYN 311

BLAST of ClCG03G010490.1 vs. NCBI nr
Match: gi|645249909|ref|XP_008230960.1| (PREDICTED: metalloendoproteinase 1-like [Prunus mume])

HSP 1 Score: 353.2 bits (905), Expect = 4.6e-94
Identity = 176/285 (61.75%), Postives = 206/285 (72.28%), Query Frame = 1

Query: 27  SPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESA 86
           SPF FL  L+GC KGD V+GI  LK +   +GYL+   N  GH  DDD    FDD LESA
Sbjct: 40  SPFEFLEHLKGCHKGDKVQGIQDLKKYLGKFGYLSSNNN--GHFNDDD----FDDQLESA 99

Query: 87  IKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGH-NHHHHVP 146
           IKTYQ  +HL  TG+L+AET+S +  PRCGV DIINGT     S  S    H +HHHH  
Sbjct: 100 IKTYQLNYHLKATGTLDAETVSNMMMPRCGVADIINGTS----SMRSGKQRHPHHHHHGG 159

Query: 147 HAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADAKAPVARAFATWGRNTQFKFSLTTNYR 206
           H V+HY FF G  +WP++KY LTYAFL GT A+A  PVARAF TW  NT F FS +   +
Sbjct: 160 HTVAHYTFFRGNPKWPASKYHLTYAFLQGTPAEATGPVARAFQTWAANTHFTFSQSN--Q 219

Query: 207 RADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGVGAVRGRYDLQTVA 266
             DL + F+RGNHGDG PFDGPGGT+AHAFAPT+GRFHYD+ E++ VGAV G YDL+TVA
Sbjct: 220 NPDLTVSFHRGNHGDGSPFDGPGGTIAHAFAPTNGRFHYDADERFSVGAVSGAYDLETVA 279

Query: 267 LHEIGHLLGLGHSTVKNAIMYPYINPGTTKGLNADDIKGIKVLYN 311
           LHEIGHLLGLGHS+V  AIMYP I+PG TKGL+ DDI+GIK LYN
Sbjct: 280 LHEIGHLLGLGHSSVLGAIMYPTISPGATKGLHGDDIQGIKALYN 312

BLAST of ClCG03G010490.1 vs. NCBI nr
Match: gi|645249271|ref|XP_008230675.1| (PREDICTED: metalloendoproteinase 1-like [Prunus mume])

HSP 1 Score: 353.2 bits (905), Expect = 4.6e-94
Identity = 176/285 (61.75%), Postives = 206/285 (72.28%), Query Frame = 1

Query: 27  SPFAFLNDLQGCKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLIDDDGDNTFDDLLESA 86
           SPF FL  L+GC KGD V+GI  LK +   +GYL+   N  GH  DDD    FDD LESA
Sbjct: 32  SPFEFLEHLKGCHKGDKVQGIQDLKKYLGKFGYLSSNNN--GHFNDDD----FDDQLESA 91

Query: 87  IKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIINGTIGRMLSEHSDNDGH-NHHHHVP 146
           IKTYQ  +HL  TG+L+AET+S +  PRCGV DIINGT     S  S    H +HHHH  
Sbjct: 92  IKTYQLNYHLKATGTLDAETVSNMMMPRCGVADIINGTS----SMRSGKQRHPHHHHHGG 151

Query: 147 HAVSHYAFFPGRRRWPSTKYRLTYAFLPGTRADAKAPVARAFATWGRNTQFKFSLTTNYR 206
           H V+HY FF G  +WP++KY LTYAFL GT A+A  PVARAF TW  NT F FS +   +
Sbjct: 152 HTVAHYTFFRGNPKWPASKYHLTYAFLQGTPAEATGPVARAFQTWAANTHFTFSQSN--Q 211

Query: 207 RADLKIGFYRGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWGVGAVRGRYDLQTVA 266
             DL + F+RGNHGDG PFDGPGGT+AHAFAPT+GRFHYD+ E++ VGAV G YDL+TVA
Sbjct: 212 NPDLTVSFHRGNHGDGSPFDGPGGTIAHAFAPTNGRFHYDADERFSVGAVSGAYDLETVA 271

Query: 267 LHEIGHLLGLGHSTVKNAIMYPYINPGTTKGLNADDIKGIKVLYN 311
           LHEIGHLLGLGHS+V  AIMYP I+PG TKGL+ DDI+GIK LYN
Sbjct: 272 LHEIGHLLGLGHSSVLGAIMYPTISPGATKGLHGDDIQGIKALYN 304

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
2MMP_ARATH2.0e-6043.21Metalloendoproteinase 2-MMP OS=Arabidopsis thaliana GN=2MMP PE=1 SV=1[more]
3MMP_ARATH2.0e-6042.96Metalloendoproteinase 3-MMP OS=Arabidopsis thaliana GN=3MMP PE=1 SV=1[more]
5MMP_ARATH2.2e-5941.64Metalloendoproteinase 5-MMP OS=Arabidopsis thaliana GN=5MMP PE=1 SV=1[more]
MEP1_SOYBN1.2e-5746.04Metalloendoproteinase 1 OS=Glycine max PE=1 SV=2[more]
1MMP_ARATH9.4e-4741.64Metalloendoproteinase 1-MMP OS=Arabidopsis thaliana GN=1MMP PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0M0J6_CUCSA5.2e-16187.58Uncharacterized protein OS=Cucumis sativus GN=Csa_1G654860 PE=3 SV=1[more]
A0A0A0M032_CUCSA2.5e-13978.83Uncharacterized protein OS=Cucumis sativus GN=Csa_1G654840 PE=3 SV=1[more]
S1SMU3_THECC7.9e-9358.44Matrixin family protein OS=Theobroma cacao GN=TCM_044196 PE=3 SV=1[more]
B9RUG6_RICCO1.8e-9259.86Metalloendoproteinase 1, putative OS=Ricinus communis GN=RCOM_0852690 PE=3 SV=1[more]
A0A061FPL5_THECC6.7e-9257.50Matrixin family protein OS=Theobroma cacao GN=TCM_044201 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G70170.11.1e-6143.21 matrix metalloproteinase[more]
AT1G24140.11.1e-6142.96 Matrixin family protein[more]
AT1G59970.11.2e-6041.64 Matrixin family protein[more]
AT4G16640.15.3e-4841.64 Matrixin family protein[more]
AT2G45040.12.0e-4237.45 Matrixin family protein[more]
Match NameE-valueIdentityDescription
gi|659085906|ref|XP_008443669.1|5.0e-16589.14PREDICTED: metalloendoproteinase 1-like [Cucumis melo][more]
gi|449442791|ref|XP_004139164.1|7.5e-16187.58PREDICTED: metalloendoproteinase 1-like [Cucumis sativus][more]
gi|449442789|ref|XP_004139163.1|3.6e-13978.83PREDICTED: metalloendoproteinase 1-like [Cucumis sativus][more]
gi|645249909|ref|XP_008230960.1|4.6e-9461.75PREDICTED: metalloendoproteinase 1-like [Prunus mume][more]
gi|645249271|ref|XP_008230675.1|4.6e-9461.75PREDICTED: metalloendoproteinase 1-like [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001818Pept_M10_metallopeptidase
IPR002477Peptidoglycan-bd-like
IPR006026Peptidase_Metallo
IPR021158Pept_M10A_Zn_BS
IPR021190Pept_M10A
IPR024079MetalloPept_cat_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004222metalloendopeptidase activity
GO:0008270zinc ion binding
GO:0008237metallopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Cellular Component
TermDefinition
GO:0031012extracellular matrix
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0031012 extracellular matrix
molecular_function GO:0004222 metalloendopeptidase activity
molecular_function GO:0008237 metallopeptidase activity
molecular_function GO:0008270 zinc ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
ClCG03G010490ClCG03G010490gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
ClCG03G010490.1ClCG03G010490.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG03G010490.1.cds1ClCG03G010490.1.cds1CDS


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001818Peptidase M10, metallopeptidasePFAMPF00413Peptidase_M10coord: 164..309
score: 1.0
IPR002477Peptidoglycan binding-likeGENE3DG3DSA:1.10.101.10coord: 47..112
score: 1.0
IPR002477Peptidoglycan binding-likePFAMPF01471PG_binding_1coord: 48..110
score: 2.
IPR002477Peptidoglycan binding-likeunknownSSF47090PGBD-likecoord: 38..120
score: 7.38
IPR006026Peptidase, metallopeptidaseSMARTSM00235col_5coord: 156..311
score: 1.3
IPR021158Peptidase M10A, cysteine switch, zinc binding sitePROSITEPS00546CYSTEINE_SWITCHcoord: 113..120
scor
IPR021190Peptidase M10APRINTSPR00138MATRIXINcoord: 110..123
score: 1.6E-36coord: 183..198
score: 1.6E-36coord: 207..235
score: 1.6E-36coord: 264..289
score: 1.6E-36coord: 297..310
score: 1.6
IPR024079Metallopeptidase, catalytic domainGENE3DG3DSA:3.40.390.10coord: 113..310
score: 5.8
NoneNo IPR availablePANTHERPTHR10201MATRIX METALLOPROTEINASEcoord: 146..309
score: 1.6E-124coord: 1..123
score: 1.6E
NoneNo IPR availablePANTHERPTHR10201:SF140F20P5.11 PROTEINcoord: 1..123
score: 1.6E-124coord: 146..309
score: 1.6E
NoneNo IPR availableunknownSSF55486Metalloproteases ("zincins"), catalytic domaincoord: 153..310
score: 1.39