Cp4.1LG07g08910 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG07g08910
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSerine protease htra2, putative
LocationCp4.1LG07 : 8030896 .. 8036747 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGACCCAATACTGTTTGCCTCCTCGTCGGCGTTAACCGGAATTTAAGGGTTCAGGCGGCGATTTGGAAAACAAGTTTCGGACAAACAACTACGAGGGAATTTCCGAGCAAAAGACAACGCCAATGATCCCGCTTCTTGTTGTATTTAGTTGAATCACTCTTCAATCCTCAATCGATATCAGTTTTTTTGTTCTTTTTAATTGTTCATTTTATGTATTATTGTTTATTGATGAATCTGATTTTTGCAGAGAAAGGTTGCGAGCTCACGCAATACGCTCGGACGGATCGCCGCAATTGCTGCTGCTGGTTCTTGTATCTGGTATGCCGGAAGTAAATTAGATTCTGGTGAGTTTTTGAGCTCTTTTAATGGCTTTCAAGATATTGATGAATGAAAATGAATACTCAGGAGAGTTTTTTTTGCTGTATGTCTGATGAATCGAAAACGTAGATGAATGTAAATTCTGTACTGGTCAATATCTTGAGAATTTTCTATTCATTGGTCAACACACTGCCATTCTTTCAGCTTACTTTTGCCAGTCTTTTGTATGTTCTTCACTGAAGGATCCTCTGTAGTGTTGTCAATTCCTGCTGCTTTGAGTGAGCCACTGTTCCTTCCATGGCAGACCGCGCACGGCTTCACGATTCATCCTTCGGGTGCATTTGATCACCAGAAATTGGGTCTGTCTGTTTGTTTATTACTTTCTTGATTGTAGGCAAGTTTTTTTATACATAGATTCTGTTTCAACTGGAGGACTTGTGTGAAATTGTAGGTCTTTCATTTTGTTCTTCAAGAGTCAGTCCTGCTCCACCATCTGGTGTGGAGAAGGAAAAGCCTGGAGATACGCAGAAGCCTTGTCCAAGATGTTTGGATAGAGATACAATTGCAAATGCTGCAGCAGATGTTGGCCCTGCTGTTGTAAATATTTCTGTTTCACATGGTATTGGTAATTGTTTTGTTATTTTTTCTGTGAGTTCTGTGAAAAAAGTTGTAAGGGATTGTTTTCTTAGGTATTTATGGAATTGCTACTGCTAAAAGCATGGGATCCGGAACAATTATTGACAAGGATGGTACTATTTTAACATGTGCCCATGTCGTGACGGATTTTCATGGTCCACGAGCTGCATCCAAAGGAAAGGCAAGTAGATGTATCAGCCTAGTTTTTAACGCATATGTATTGTGTGTGTCCAACACATGTTGGAACAGGAACCCTCTTAACAAACTAATGTGTCCCTTTCTTAGATTTATGTTGATGCGTTGTTTGCAATTATGTGTTTGATACTACTTTTCCAATGGCAAATATCTTTCTTGCTGTTCAAGCTATTAATATGCTACGAATGTTCTTATACAGCTCTTTTCAAGTTTCATATTTGGTCCATAGTTTCCAGTTTTTAAGATTGCTTTTGAGATGTCAAGATCTATTAGTTGGTTTAAATGTCCCTTCCATTATTTATTTATTTGTTTATTTCATGCGTGGAATGGCACAGGTAGAGGTTACTTTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCCATGGCAAAGCTTGGTTCTTCAAGCAAGCTCCGACCAGGGGATTGGGTTGTAGCAATTGGGTGTCCACTTTCGCTTCAGAATACTGTCACAGCTGGTATAGTAAGGTGGTCAATTATAGGAAGTTTCTCTGTCCTTGATGTTTCCATGGAAGAGATAGGAACATATAATTCCTTCATATTTTTGATACTGAAGAGATGAATGATACAAAGTCTTTTCCTCAATTACTTACTTTAAGTGTTATCTATAGTTGTTGACTAAGAGGCACGAATACAGACATTTGACATGGTTAGATATGATATAGACACGGTGCCATGTCATACTTCTAAAAATCTAGGACTTGAAACGACAAGTAGTAAACCATATTTTTTCTAAAAATGTCATTTTTATATAAGAATTTTGTTTTGAAGTCAATATGTTTATGCATTTATTTGCTTAAAAAATGAGTTTGATGTATTTCACGTATCAGGCATTTCTATTTTTATCTATTTAGTATGCTCAAACAAGTGTCCTATATGTGCCTAATAGATGTTTGGTCCTTATTCACTTTATACCACAAGTGTTTCACACATGCATTTGTACTACCATACATGCAATACATGTTTAAGAAGTGTCCTTGCTTAGCATTAATCATGGTCCACGAGCTGCATCCAAAGGAAAGGCAAGTAGATGTATCAGCCTAGTTTTTAACGCATATGTATTGTGTGTGTCCAACACATGTTGGAACAGGAACCCTCTTAACAAACTAATGTGTCCCTTTCTTAGATTTATGTTGATGCGTTGTTTGCAATTATGTGTTTGATACTACTTTTCCAATGGCAAATATCTTTCTTGCTGTTCAAGCTATTAATATGCTACGAATGTTCTTATACAGCTCTTTTCAAGTTTCATATTTGGTCCATAGTTTCCAGTTTTTAAGATTGCTTTTGAGATGTCAAGATCTATTAGTTGGTTTAAATGTCCCTTCCATTATTTATTTATTTGTTTATTTCATGCGTGGAATGGCACAGGTAGAGGTTACTTTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCCATGGCAAAGCTTGGTTCTTCAAGCAAGCTCCGACCAGGGGATTGGGTTGTAGCAATTGGGTGTCCACTTTCGCTTCAGAATACTGTCACAGCTGGTATAGTAAGGTGGTCAATTATAGGAAGTTTCTCTGTCCTTGATGTTTCCATGGAAGAGATAGGAACATATAATTCCTTCATATTTTTGATACTGAAGAGATGAATGATACAAAGTCTTTTCCTCAATTACTTACTTTAAGTGTTATCTATAGTTGTTGACTAAGAGGCACGAATACAGACATTTGACATGGTTAGATATGATATAGACACGGTGCCATGTCATACTTCTAAAAATCTAGGACTTGAAACGACAAGTAGTAAACCATATTTTTTCTAAAAATGTCATTTTTATATAAGAATTTTGTTTTGAAGTCAATATGTTTATGCATTTATTTGCTTAAAAAATGAGTTTGATGTATTTCACGTATCAGGCATTTCTATTTTTATCTATTTAGTATGCTCAAACAAGTGTCCTATATGTGCCTAATAGATGTTTGGTCCTTATTCACTTTATACCACAAGTGTTTCACACATGCATTTGTACTACCATACATGCAATACATGTTTAAGAAGTGTCCTTGCTTAGCATTAACTATATATTTGATTGTCTACTTTTCGTCTTTGATTCAATTCAGTTGTGTTGACCGTAAGAGTAGTGATTTGGGTCTTGGTGGAATGCGAAGGGAATATCTACAAACAGATTGTGCAATTAACGTGGTATGTATCTCATTTTTTACCCATCTCCCTTTGCAATATCGAACGTGCTTCATTGTTGTTAAAATGAATACGAAACACTGGCATTCTGTAATCAGTATTTGTCATGTAGGGAAATTCTGGGGGTCCTCTTGTTAATGTGGATGGAGAAGTTATTGGTGTAAATATTATGAAAGTGGATGATGCTGTTGGATTAAGTTTCGCTGTACCAATTGATTCAGTCTCCAAAATTACAGAGCAATTCAAGAAAAGAGGGTATTGTAACACATTTAGAGACTAATATTGTTTTACTAGTGAATTGAATTGAATATACTCCATTTGTAACAACCCATGCCCACCACTAGCAGATATTGTCCTCTTTAGGCTTTCCCTTTCGGGCTTCCCCTCAAGGCTTTAAAACGCGTCTGCTAGGGGAAGGTCTCCACACCCTTATAAATGGTGGTTTGTTCTCCTCTCCAACCAATGTGGGACATCACAATCCACCCCCCTTCGGGGCCCAGCGTCCTCGCTGGCACTCTTTCCTTCCTCCAATCGATGTGGGACCGCCCCCAAATCTACCCCCCTTTGTGGCCCAGCGTCCTTACCGGCACACTGCCTCGTGTCTACCCCCTTCGGGGAACAGCGAGAAGGCTGGCACATCGTCCGGTGTCTGGCTCTGATACCATTTGTAACAACCCAGACCCACCGCTAGCAGATATGGTCCTCTTTGGGCTTTCCCTTTCGGGCTTCCCCTCAAGGCTTTAAAACGCGTATGCTACGGGAAGGTTCCCACACCCTTATAAATGGTGGTTTGTTCTCCTCCCTAACCAACGTGGGACATCATACCATTCATGTGTAAAATAGATTTATTTCACCATTGTTTTTATTTTGCATTCAAGGGTTGAATGAAATGGCTTTATTTTTATTTTCCAATGATGGTTATAATACAAAGTGTGATGATGCAGGAGAGTTATTCGGCCTTGGCTTGGATTGAAAATGATCGATCTCAATGAAATGATAATCGAACAACTTAAAGAAAGAGATGCATCTTTTCCAGACGTTACTAAAGGGGTTCTTGTAGCTATGGTAATAATAATACCTTTTTGCATTACATTCTTCCTTTTCTATCTTTTTCTTCTTATTTTATGCATTAAATTTGGCATGAGCGAATAGATTTAGCAAATACTTCAATAGGTAACTCCTGGATCCCCTGCTGGTCGTGCTGGGTTCCGTCCTGGTGATGTCGTCATCGAGTTCGATAAGCAACCTGTTGCCAGTATCCAAGAGGTATTTGGATCTACTTTTAGTTCTTAATTCTTACCAATGGTATTGCCAGCATACTGTTATCTTGTGAATTCTTTGGATAGAAAAGTTCATTCTTGCTTGAATCCTAATGTTTGTAAATAAAAACCACTTCTAGAGTTCTTAGAATTGTACTCTTAAATTTGTGCAAATACCTCAAAATTATGTGCTTAGATTCATTGGATTGCAACTATTTCTAATATTCGAGGGAAAAAAACTCCCAAACTTGGTGATTTTTTTATAATTTTTTTTGCAATTTCTTGTTATTACATGGTGTGTAAATTTCTGTTTTCTGGAGTTCGAGTAACCTTCTCTACTGTTGCGAGACTAGGGATCGTTTTTCCCCCCTCAACAGGAGTCCACTCCTGTCGAGATGGGGAAGGGGAGGGCACGGGGGGAGTTTTCTCCGTCGTCTAAATAAGGATGGGGCAGGGAATGTTGTTGTCGTTGTCGTTGTCGTTGTCGTTGTCATTGTCGTCTTTATCACCCTAGGCAAGGATTATGTATCTGACAGTTCGAGTTTTTGGCAGATCATTGAAATTATGGGAGATAGAGTTGGGATTCCATTGAAGGCAGTTGTGAAACGATCTCTTAATAGCATCATCACTTTGACTGTTCTTCCTGAGGAGTCCAATCCAGATATGTGATACTATACACTCACTACACAATCAATTCAATTCTTTCATATTTCCCATCATTTGGTTATGGCTTAGGGTTAGAACTCGATACCTTAACCTTCTCGTGTAAAGTTATATAGGATGTTTAAGTTTAGTTTGTGAACTTTTAAGCTTGTCTATTTGGTCGTTGAACTTGAATTTGACTAGAAATAGTCGTAGTTTTAAACTTTTTTTTTTTTTTTCATTTAAATTTTTTATCTTAAGAAGACCAAAAAATGTGAGAAATTTTGGAGATCGAAAAAATGAAAAAAGACTTTCTTAGTTAGAATCCTGCCTAGGGAAGTAACTGCGTATTAGACGTATACATAGGTAAAGCCGTGTGCAATGAAAGATGCAAGCCCGGGTTGGGGAGAGATTTTTTTTACTTAGTTTTTTATAAAAATAAGTTATCTACTCCATCTGACTAGTTTTGGGTTCGAATCCCGGACAACCCATAT

mRNA sequence

TGGACCCAATACTGTTTGCCTCCTCGTCGGCGTTAACCGGAATTTAAGGGTTCAGGCGGCGATTTGGAAAACAAGTTTCGGACAAACAACTACGAGGGAATTTCCGAGCAAAAGACAACGCCAATGATCCCGCTTCTTGTTAGAAAGGTTGCGAGCTCACGCAATACGCTCGGACGGATCGCCGCAATTGCTGCTGCTGGTTCTTGTATCTGGTATGCCGGAAGTAAATTAGATTCTGGATCCTCTGTAGTGTTGTCAATTCCTGCTGCTTTGAGTGAGCCACTGTTCCTTCCATGGCAGACCGCGCACGGCTTCACGATTCATCCTTCGGGTGCATTTGATCACCAGAAATTGGGTCTTTCATTTTGTTCTTCAAGAGTCAGTCCTGCTCCACCATCTGGTGTGGAGAAGGAAAAGCCTGGAGATACGCAGAAGCCTTGTCCAAGATGTTTGGATAGAGATACAATTGCAAATGCTGCAGCAGATGTTGGCCCTGCTGTTGTAAATATTTCTGTTTCACATGGTATTTATGGAATTGCTACTGCTAAAAGCATGGGATCCGGAACAATTATTGACAAGGATGGTACTATTTTAACATGTGCCCATGTCGTGACGGATTTTCATGGTCCACGAGCTGCATCCAAAGGAAAGGCAAGTAGATGTATCAGCCTAGTTTTTAACGCATATGTAGAGGTTACTTTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCCATGGCAAAGCTTGGTTCTTCAAGCAAGCTCCGACCAGGGGATTGGGTTGTAGCAATTGGGTGTCCACTTTCGCTTCAGAATACTGTCACAGCTGGTATAGTAGAGGTTACTTTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCCATGGCAAAGCTTGGTTCTTCAAGCAAGCTCCGACCAGGGGATTGGGTTGTAGCAATTGGGTGTCCACTTTCGCTTCAGAATACTGTCACAGCTGTTGTGTTGACCGTAAGAGTAGTGATTTGGGTCTTGGTGGAATGCGAAGGGAATATCTACAAACAGATTGTGCAATTAACGTGGAGAGTTATTCGGCCTTGGCTTGGATTGAAAATGATCGATCTCAATGAAATGATAATCGAACAACTTAAAGAAAGAGATGCATCTTTTCCAGACGTTACTAAAGGGGTTCTTGTAGCTATGGTAACTCCTGGATCCCCTGCTGGTCGTGCTGGGTTCCGTCCTGGTGATGTCGTCATCGAGTTCGATAAGCAACCTGTTGCCAGTATCCAAGAGATCATTGAAATTATGGGAGATAGAGTTGGGATTCCATTGAAGGCAGTTGTGAAACGATCTCTTAATAGCATCATCACTTTGACTGTTCTTCCTGAGGAGTCCAATCCAGATATGTGATACTATACACTCACTACACAATCAATTCAATTCTTTCATATTTCCCATCATTTGGTTATGGCTTAGGGTTAGAACTCGATACCTTAACCTTCTCGTGTAAAGTTATATAGGATGTTTAAGTTTAGTTTGTGAACTTTTAAGCTTGTCTATTTGGTCGTTGAACTTGAATTTGACTAGAAATAGTCGTAGTTTTAAACTTTTTTTTTTTTTTTCATTTAAATTTTTTATCTTAAGAAGACCAAAAAATGTGAGAAATTTTGGAGATCGAAAAAATGAAAAAAGACTTTCTTAGTTAGAATCCTGCCTAGGGAAGTAACTGCGTATTAGACGTATACATAGGTAAAGCCGTGTGCAATGAAAGATGCAAGCCCGGGTTGGGGAGAGATTTTTTTTACTTAGTTTTTTATAAAAATAAGTTATCTACTCCATCTGACTAGTTTTGGGTTCGAATCCCGGACAACCCATAT

Coding sequence (CDS)

ATGATCCCGCTTCTTGTTAGAAAGGTTGCGAGCTCACGCAATACGCTCGGACGGATCGCCGCAATTGCTGCTGCTGGTTCTTGTATCTGGTATGCCGGAAGTAAATTAGATTCTGGATCCTCTGTAGTGTTGTCAATTCCTGCTGCTTTGAGTGAGCCACTGTTCCTTCCATGGCAGACCGCGCACGGCTTCACGATTCATCCTTCGGGTGCATTTGATCACCAGAAATTGGGTCTTTCATTTTGTTCTTCAAGAGTCAGTCCTGCTCCACCATCTGGTGTGGAGAAGGAAAAGCCTGGAGATACGCAGAAGCCTTGTCCAAGATGTTTGGATAGAGATACAATTGCAAATGCTGCAGCAGATGTTGGCCCTGCTGTTGTAAATATTTCTGTTTCACATGGTATTTATGGAATTGCTACTGCTAAAAGCATGGGATCCGGAACAATTATTGACAAGGATGGTACTATTTTAACATGTGCCCATGTCGTGACGGATTTTCATGGTCCACGAGCTGCATCCAAAGGAAAGGCAAGTAGATGTATCAGCCTAGTTTTTAACGCATATGTAGAGGTTACTTTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCCATGGCAAAGCTTGGTTCTTCAAGCAAGCTCCGACCAGGGGATTGGGTTGTAGCAATTGGGTGTCCACTTTCGCTTCAGAATACTGTCACAGCTGGTATAGTAGAGGTTACTTTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCCATGGCAAAGCTTGGTTCTTCAAGCAAGCTCCGACCAGGGGATTGGGTTGTAGCAATTGGGTGTCCACTTTCGCTTCAGAATACTGTCACAGCTGTTGTGTTGACCGTAAGAGTAGTGATTTGGGTCTTGGTGGAATGCGAAGGGAATATCTACAAACAGATTGTGCAATTAACGTGGAGAGTTATTCGGCCTTGGCTTGGATTGAAAATGATCGATCTCAATGAAATGATAATCGAACAACTTAAAGAAAGAGATGCATCTTTTCCAGACGTTACTAAAGGGGTTCTTGTAGCTATGGTAACTCCTGGATCCCCTGCTGGTCGTGCTGGGTTCCGTCCTGGTGATGTCGTCATCGAGTTCGATAAGCAACCTGTTGCCAGTATCCAAGAGATCATTGAAATTATGGGAGATAGAGTTGGGATTCCATTGAAGGCAGTTGTGAAACGATCTCTTAATAGCATCATCACTTTGACTGTTCTTCCTGAGGAGTCCAATCCAGATATGTGA

Protein sequence

MIPLLVRKVASSRNTLGRIAAIAAAGSCIWYAGSKLDSGSSVVLSIPAALSEPLFLPWQTAHGFTIHPSGAFDHQKLGLSFCSSRVSPAPPSGVEKEKPGDTQKPCPRCLDRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIEFDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM
BLAST of Cp4.1LG07g08910 vs. Swiss-Prot
Match: DGP14_ARATH (Putative protease Do-like 14 OS=Arabidopsis thaliana GN=DEGP14 PE=3 SV=2)

HSP 1 Score: 352.8 bits (904), Expect = 5.6e-96
Identity = 228/470 (48.51%), Postives = 284/470 (60.43%), Query Frame = 1

Query: 1   MIPLLVRKVASS-RNTLGRIAAIAAAGSCIWYAGSKLDSGSSVVLSIPAALSEPL-FLPW 60
           M+  L R V+SS R+ L RI ++A A S I YA +  D+ + V L+IP ++ E L  LPW
Sbjct: 1   MMNFLRRAVSSSKRSELIRIISVATATSGILYASTNPDARTRVSLAIPESVRESLSLLPW 60

Query: 61  QTAHGFTIHPSGAFDHQKLGLSFCSSRVSP---AP---PSGVEKEKPGDTQKPCPRCLDR 120
           Q + G    P    +    G    SSRVSP   AP     GV  E    + KP    L R
Sbjct: 61  QISPGLIHRP----EQSLFGNFVFSSRVSPKSEAPINDEKGVSVEASDSSSKPSNGYLGR 120

Query: 121 DTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAA 180
           DTIANAAA +GPAVVN+SV  G +GI+  KS+GSGTIID DGTILTCAHVV DF   R +
Sbjct: 121 DTIANAAARIGPAVVNLSVPQGFHGISMGKSIGSGTIIDADGTILTCAHVVVDFQNIRHS 180

Query: 181 SKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSS 240
           SKG+            V+VTLQDGRTFEG V+NAD  SDIA+VKI SK+PLP AKLG SS
Sbjct: 181 SKGR------------VDVTLQDGRTFEGVVVNADLQSDIALVKIKSKTPLPTAKLGFSS 240

Query: 241 KLRPGDWVVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSP 300
           KLRPGDWV+A+GCPLSLQNTVTAGIV  V  +      G        +D +I   NS  P
Sbjct: 241 KLRPGDWVIAVGCPLSLQNTVTAGIVSCVDRKSSDLGLGGKHREYLQTDCSINAGNSGGP 300

Query: 301 LPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLT 360
           L          +     V+ +         +  V+    +   V ++    I +   + +
Sbjct: 301 L----------VNLDGEVIGVN--------IMKVLAADGLGFSVPIDSVSKIIEHFKK-S 360

Query: 361 WRVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVI 420
            RVIRPW+GLKM++LN +I+ QLKERD  FPDV +GVLV  V PGSPA RAGF+PGDVV+
Sbjct: 361 GRVIRPWIGLKMVELNNLIVAQLKERDPMFPDVERGVLVPTVIPGSPADRAGFKPGDVVV 420

Query: 421 EFDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
            FD +PV      IEIM DRVG  ++ VV+RS    +TL V+PEE+NPDM
Sbjct: 421 RFDGKPV------IEIMDDRVGKRMQVVVERSNKERVTLEVIPEEANPDM 429

BLAST of Cp4.1LG07g08910 vs. Swiss-Prot
Match: HTRA1_BOVIN (Serine protease HTRA1 OS=Bos taurus GN=HTRA1 PE=2 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 5.7e-32
Identity = 109/356 (30.62%), Postives = 173/356 (48.60%), Query Frame = 1

Query: 115 IANAAADVGPAVVNISVSHGI--YGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAA 174
           IA+    + PAVV+I +   +         + GSG I+ +DG I+T AHVVT+ H     
Sbjct: 179 IADVVEKIAPAVVHIELFRKLPFSKREVPVASGSGFIVSEDGLIVTNAHVVTNKHR---- 238

Query: 175 SKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSS 234
                           V+V L++G T+E  + + D  +DIA++KI+ +  LP+  LG SS
Sbjct: 239 ----------------VKVELKNGATYEAKIKDVDEKADIALIKIDHQGKLPVLLLGRSS 298

Query: 235 KLRPGDWVVAIGCPLSLQNTVTAGIVEVTLQDGRTFEGTVMNADFHSDIAIVKI-NSKSP 294
           +LRPG++VVAIG P SLQNTVT GIV  T + G+       + D+    AI+   NS  P
Sbjct: 299 ELRPGEFVVAIGSPFSLQNTVTTGIVSTTQRGGKELGLRNSDMDYIQTDAIINYGNSGGP 358

Query: 295 LPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIV--- 354
           L                       ++L   V   + T++V   +      +  K+ +   
Sbjct: 359 L-----------------------VNLDGEVIG-INTLKVTAGISFAIPSDKIKKFLTES 418

Query: 355 ---QLTWRVI--RPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAG 414
              Q   + I  + ++G++M+ L     ++LK+R   FPDV  G  +  V P +PA   G
Sbjct: 419 HDRQAKGKAITKKKYIGIRMMSLTPSKAKELKDRHRDFPDVLSGAYIIEVIPDTPAEAGG 478

Query: 415 FRPGDVVIEFDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNP 460
            +  DV+I  + Q V S  ++ +++  +    L  VV+R  N  I +TV+PEE +P
Sbjct: 479 LKENDVIISINGQSVVSANDVSDVI--KKESTLNMVVRRG-NEDIMITVIPEEIDP 487

BLAST of Cp4.1LG07g08910 vs. Swiss-Prot
Match: HTRA1_HUMAN (Serine protease HTRA1 OS=Homo sapiens GN=HTRA1 PE=1 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 5.7e-32
Identity = 109/356 (30.62%), Postives = 173/356 (48.60%), Query Frame = 1

Query: 115 IANAAADVGPAVVNISVSHGI--YGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAA 174
           IA+    + PAVV+I +   +         + GSG I+ +DG I+T AHVVT+ H     
Sbjct: 172 IADVVEKIAPAVVHIELFRKLPFSKREVPVASGSGFIVSEDGLIVTNAHVVTNKHR---- 231

Query: 175 SKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSS 234
                           V+V L++G T+E  + + D  +DIA++KI+ +  LP+  LG SS
Sbjct: 232 ----------------VKVELKNGATYEAKIKDVDEKADIALIKIDHQGKLPVLLLGRSS 291

Query: 235 KLRPGDWVVAIGCPLSLQNTVTAGIVEVTLQDGRTFEGTVMNADFHSDIAIVKI-NSKSP 294
           +LRPG++VVAIG P SLQNTVT GIV  T + G+       + D+    AI+   NS  P
Sbjct: 292 ELRPGEFVVAIGSPFSLQNTVTTGIVSTTQRGGKELGLRNSDMDYIQTDAIINYGNSGGP 351

Query: 295 LPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIV--- 354
           L                       ++L   V   + T++V   +      +  K+ +   
Sbjct: 352 L-----------------------VNLDGEVIG-INTLKVTAGISFAIPSDKIKKFLTES 411

Query: 355 ---QLTWRVI--RPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAG 414
              Q   + I  + ++G++M+ L     ++LK+R   FPDV  G  +  V P +PA   G
Sbjct: 412 HDRQAKGKAITKKKYIGIRMMSLTSSKAKELKDRHRDFPDVISGAYIIEVIPDTPAEAGG 471

Query: 415 FRPGDVVIEFDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNP 460
            +  DV+I  + Q V S  ++ +++  +    L  VV+R  N  I +TV+PEE +P
Sbjct: 472 LKENDVIISINGQSVVSANDVSDVI--KRESTLNMVVRRG-NEDIMITVIPEEIDP 480

BLAST of Cp4.1LG07g08910 vs. Swiss-Prot
Match: HTRA3_HUMAN (Serine protease HTRA3 OS=Homo sapiens GN=HTRA3 PE=1 SV=2)

HSP 1 Score: 140.2 bits (352), Expect = 5.7e-32
Identity = 106/343 (30.90%), Postives = 179/343 (52.19%), Query Frame = 1

Query: 115 IANAAADVGPAVVNISV--SHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAA 174
           IA+    + PAVV+I +   H ++G     S GSG I+ + G I+T AHVV+      +A
Sbjct: 143 IADVVEKIAPAVVHIELFLRHPLFGRNVPLSSGSGFIMSEAGLIITNAHVVSS----NSA 202

Query: 175 SKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSS 234
           + G+            ++V LQ+G ++E T+ + D  SDIA +KI+ K  LP+  LG S+
Sbjct: 203 APGRQQ----------LKVQLQNGDSYEATIKDIDKKSDIATIKIHPKKKLPVLLLGHSA 262

Query: 235 KLRPGDWVVAIGCPLSLQNTVTAGIVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPL 294
            LRPG++VVAIG P +LQNTVT GIV    ++GR       + D+    AI+   + S  
Sbjct: 263 DLRPGEFVVAIGSPFALQNTVTTGIVSTAQREGRELGLRDSDMDYIQTDAIINYGN-SGG 322

Query: 295 PMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTW 354
           P+  L           V+ I   L +   ++  + + R+  + L E +    KQI    W
Sbjct: 323 PLVNLDGE--------VIGIN-TLKVTAGISFAIPSDRITRF-LTEFQD---KQIKD--W 382

Query: 355 RVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIE 414
           +  + ++G++M  +   ++++LK  +  FP+V+ G+ V  V P SP+ R G + GD++++
Sbjct: 383 K--KRFIGIRMRTITPSLVDELKASNPDFPEVSSGIYVQEVAPNSPSQRGGIQDGDIIVK 442

Query: 415 FDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPE 456
            + +P+    E+ E +      PL   V+R  N  +  ++ PE
Sbjct: 443 VNGRPLVDSSELQEAV--LTESPLLLEVRRG-NDDLLFSIAPE 450

BLAST of Cp4.1LG07g08910 vs. Swiss-Prot
Match: HTRA3_MOUSE (Serine protease HTRA3 OS=Mus musculus GN=Htra3 PE=1 SV=3)

HSP 1 Score: 138.7 bits (348), Expect = 1.7e-31
Identity = 102/343 (29.74%), Postives = 175/343 (51.02%), Query Frame = 1

Query: 115 IANAAADVGPAVVNISV--SHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAA 174
           IA+    + PAVV+I +   H ++G     S GSG I+ + G I+T AHVV+      + 
Sbjct: 149 IADVVEKIAPAVVHIELFLRHPLFGRNVPLSSGSGFIMSEAGLIVTNAHVVSS----SST 208

Query: 175 SKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSS 234
           + G+            ++V LQ+G  +E T+ + D  SDIA + I+ K  LP+  LG S+
Sbjct: 209 ASGRQQ----------LKVQLQNGDAYEATIQDIDKKSDIATIVIHPKKKLPVLLLGHSA 268

Query: 235 KLRPGDWVVAIGCPLSLQNTVTAGIVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPL 294
            LRPG++VVAIG P +LQNTVT GIV    +DG+       + D+    AI+   + S  
Sbjct: 269 DLRPGEFVVAIGSPFALQNTVTTGIVSTAQRDGKELGLRDSDMDYIQTDAIINYGN-SGG 328

Query: 295 PMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTW 354
           P+  L           V+ I   L +   ++  + + R+  + L E +    K      W
Sbjct: 329 PLVNLDGE--------VIGIN-TLKVAAGISFAIPSDRITRF-LSEFQNKHVKD-----W 388

Query: 355 RVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIE 414
           +  + ++G++M  +   ++E+LK  +  FP V+ G+ V  V P SP+ R G + GD++++
Sbjct: 389 K--KRFIGIRMRTITPSLVEELKAANPDFPAVSSGIYVQEVVPNSPSQRGGIQDGDIIVK 448

Query: 415 FDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPE 456
            + +P+A   E+ E + +   + L+    R  N  +  +++PE
Sbjct: 449 VNGRPLADSSELQEAVLNESSLLLEV---RRGNDDLLFSIIPE 456

BLAST of Cp4.1LG07g08910 vs. TrEMBL
Match: A0A0A0LFW5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G030600 PE=4 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 3.1e-146
Identity = 310/463 (66.95%), Postives = 335/463 (72.35%), Query Frame = 1

Query: 1   MIPLLVRKVASSRNTLGRIAAIAAAGSCIWYAGSKLD-SGSSVVLSIPAALSEPLFLPWQ 60
           MIP L R V+SS  T  R AA+AAAGSC  YA S LD S  S+VLSIPAA S+PLFLPWQ
Sbjct: 1   MIPFL-RNVSSSYKTFRRFAAVAAAGSCYLYARSDLDYSKPSIVLSIPAAWSDPLFLPWQ 60

Query: 61  TAHGFTIHPSGAFDHQKLGLSFCSSRVSPAPPSGVEKEKPGDTQKPCPRCLDRDTIANAA 120
           T HG    P G FDH+ L +S CSSRVSP            D +K  P CL RDTIANAA
Sbjct: 61  TTHGVRPRPLGTFDHRLLDISLCSSRVSP------------DDKKETP-CLGRDTIANAA 120

Query: 121 ADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASR 180
           ADVGPAVVNISVS+GIYGIA+AKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGK   
Sbjct: 121 ADVGPAVVNISVSYGIYGIASAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGK--- 180

Query: 181 CISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDW 240
                    VEVTLQDGRTFEGTVMNADFHSDIAIVKINSK+PLP AKLGSSSKLRPGDW
Sbjct: 181 ---------VEVTLQDGRTFEGTVMNADFHSDIAIVKINSKTPLPKAKLGSSSKLRPGDW 240

Query: 241 VVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLG 300
           VVAIGCPLSLQNTVTAGIV  V  +      G +      +D AI   NS  PL      
Sbjct: 241 VVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINMGNSGGPL------ 300

Query: 301 SSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPW 360
               +     VV +    ++     A  L+  V I    +    I +Q  +   RVIRPW
Sbjct: 301 ----VNVDGEVVGV----NIMKVDDAAGLSFAVPI----DSVSKITEQF-KKRGRVIRPW 360

Query: 361 LGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIEFDKQPV 420
           LGLKMIDLNEMIIEQLKERDA+FPDVTKGVLVAMVTPGSPA  AGFRPGDVVIE DKQPV
Sbjct: 361 LGLKMIDLNEMIIEQLKERDATFPDVTKGVLVAMVTPGSPASHAGFRPGDVVIELDKQPV 418

Query: 421 ASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           ASI+EIIEIMGDR G+PL AVVKRSLN+IITLTVLPEESNPDM
Sbjct: 421 ASIKEIIEIMGDRAGVPLNAVVKRSLNTIITLTVLPEESNPDM 418

BLAST of Cp4.1LG07g08910 vs. TrEMBL
Match: A0A067JKY4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23321 PE=4 SV=1)

HSP 1 Score: 426.0 bits (1094), Expect = 5.8e-116
Identity = 260/459 (56.64%), Postives = 308/459 (67.10%), Query Frame = 1

Query: 9   VASSRNTLGRIAAIAAAGSCIWYA-GSKLDSGSSVVLSIPAALSEPLFLPWQTAHGF-TI 68
           V+S R +L R  AIAAAGS + YA  S  DS  ++ LSIPA LSE L      +    ++
Sbjct: 10  VSSGRPSLIRALAIAAAGSGLLYALSSYSDSNGTISLSIPAPLSESLLPSCHLSRQLISL 69

Query: 69  HPSGAFDHQKLG-LSFCSSRVSPAPPSGVEK--EKPGDTQKPCPRCLDRDTIANAAADVG 128
            P  + +H   G LS  SS VSP PP+ ++K     GD  KPC  CL RDTIANAAA VG
Sbjct: 70  PPFISAEHWDFGNLSLFSSGVSPVPPADIKKGCSVVGDDPKPCCGCLGRDTIANAAARVG 129

Query: 129 PAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASRCISL 188
           PAVVN+SV  G +GI T KS+GSGTIID DGTILTCAHVV DF G +A+SKGK       
Sbjct: 130 PAVVNLSVPQGFFGITTGKSIGSGTIIDSDGTILTCAHVVVDFQGLKASSKGK------- 189

Query: 189 VFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAI 248
                V+VTLQDGRTFEGTV+NAD HSDIAIVKI SK+PLP AKLG SS+LRPGDWV+A+
Sbjct: 190 -----VDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPNAKLGVSSRLRPGDWVIAM 249

Query: 249 GCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSK 308
           GCPLSLQNTVTAGIV  V  +      G +      +D AI + NS  PL          
Sbjct: 250 GCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINEGNSGGPL---------- 309

Query: 309 LRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPWLGLK 368
           +     VV +    ++   + A  L+  V I  + +   +  K     + RV+RPWLGLK
Sbjct: 310 VNIDGEVVGV----NIMKVLAADGLSFAVPIDSVAKIIEHFKK-----SGRVVRPWLGLK 369

Query: 369 MIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIEFDKQPVASIQ 428
           MIDLNEMII QLKERDA FP+V +GVLV MVTPGSPA RAGF PGDVVIEFD +PV SI+
Sbjct: 370 MIDLNEMIIAQLKERDARFPNVDRGVLVPMVTPGSPADRAGFHPGDVVIEFDGKPVESIK 429

Query: 429 EIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           EIIEIMGDRVG+PLKAVVKRS + ++TLTV PEE+NPDM
Sbjct: 430 EIIEIMGDRVGVPLKAVVKRSNDILVTLTVTPEEANPDM 437

BLAST of Cp4.1LG07g08910 vs. TrEMBL
Match: A0A067EZ89_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012318mg PE=4 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 4.7e-110
Identity = 247/472 (52.33%), Postives = 302/472 (63.98%), Query Frame = 1

Query: 13  RNTLGRIAAIAAAGSCIWYAGSKLDSGSSVVLSIPAALSEPLFLPWQTAHGFTIH-PSGA 72
           RN+L R+ AIAAAGS ++Y  S  DS + + LSIPA L E + +  Q +  FT H P  +
Sbjct: 8   RNSLSRVVAIAAAGSGLFYGSSNPDSKTRISLSIPATLHESVLVRRQMSQSFTPHSPFIS 67

Query: 73  FDHQKLG-LSFCSSRVSPAPPSGVEKEKPGDTQKP---------------CPRCLDRDTI 132
            D  + G +S  SSRV+PA    ++KE P   + P               C RCL RDTI
Sbjct: 68  SDRWQFGNVSLVSSRVNPASAGSIKKEYPVTKEAPVKEETTGDVKDGKDSCCRCLGRDTI 127

Query: 133 ANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKG 192
           ANAAA V PAVVN+S      GI + + +GSG I+D DGTILTCAHVV DFHG RA  KG
Sbjct: 128 ANAAARVCPAVVNLSAPREFLGILSGRGIGSGAIVDADGTILTCAHVVVDFHGSRALPKG 187

Query: 193 KASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLR 252
           K            V+VTLQDGRTFEGTV+NADFHSDIAIVKINSK+PLP AKLG+SSKL 
Sbjct: 188 K------------VDVTLQDGRTFEGTVLNADFHSDIAIVKINSKTPLPAAKLGTSSKLC 247

Query: 253 PGDWVVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPM 312
           PGDWVVA+GCP SLQNTVTAGIV  V  +      G +      +D AI   NS  PL  
Sbjct: 248 PGDWVVAMGCPHSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINAGNSGGPLVN 307

Query: 313 AKLGSSSKLRPGDWVVAIGCPLSLQ-NTVTAVVLTVRVVIWVLVECEGNIY----KQIVQ 372
              G    +       A G   ++  ++   ++   +   W+ VE +  +     KQ+V 
Sbjct: 308 ID-GEIVGINIMKVAAADGLSFAVPIDSAAKIIEQFKKNGWMHVEQKVPLLWSTCKQVVI 367

Query: 373 LTWRVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDV 432
           L  RV+RPWLGLKM+DLN+MII QLKERD SFP+V  GVLV +VTPGSPA  AGF P DV
Sbjct: 368 LCRRVVRPWLGLKMLDLNDMIIAQLKERDPSFPNVKSGVLVPVVTPGSPAHLAGFLPSDV 427

Query: 433 VIEFDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           VI+FD +PV SI EIIEIMGDRVG PLK VV+R+ + ++TLTV+PEE+NPDM
Sbjct: 428 VIKFDGKPVQSITEIIEIMGDRVGEPLKVVVQRANDQLVTLTVIPEEANPDM 466

BLAST of Cp4.1LG07g08910 vs. TrEMBL
Match: B9RF97_RICCO (Serine protease htra2, putative OS=Ricinus communis GN=RCOM_1433060 PE=4 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 6.2e-110
Identity = 254/463 (54.86%), Postives = 305/463 (65.87%), Query Frame = 1

Query: 5   LVRKVASSRNTLGRIAAIAAAGSCIWYAGSKLDSGSSVVLSIPAALSEPLFLPWQTAHGF 64
           L+RK A  RN++ R  A AA+GS I YA    DS ++V LS PA L E L     +    
Sbjct: 3   LMRK-APLRNSIIRTLAYAASGSGILYANINSDSDAAVSLSFPAHLRESL-----SEALI 62

Query: 65  TIHPSG-AFDHQKLG-LSFCSSRVSPAPPSGVEKEKPG---DTQKPCPRCLDRDTIANAA 124
           +++PS    D+   G L   SSR SP P + +++E  G   + +KP   CL RDTIA+AA
Sbjct: 63  SLNPSFICADNWHFGNLPLFSSRASPVPAADIDRESSGFAGEDKKPSCGCLGRDTIADAA 122

Query: 125 ADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASR 184
           A V PAVVN+SV  G YGI+T +S+GSGTIID DGTILTCAHVV D  G RA SKGK   
Sbjct: 123 AKVAPAVVNLSVPLGFYGISTGESIGSGTIIDSDGTILTCAHVVVDSQGRRALSKGK--- 182

Query: 185 CISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDW 244
                    V VTLQDGRTFEGTV+NAD HSDIA+VKI SK+PLP AKLGSSSKLRPGDW
Sbjct: 183 ---------VHVTLQDGRTFEGTVVNADLHSDIAMVKIKSKTPLPTAKLGSSSKLRPGDW 242

Query: 245 VVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLG 304
           V+A+GCPLSLQNTVTAGIV  V  +      G +      +D A    NS  PL      
Sbjct: 243 VIAMGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCATNGGNSGGPL------ 302

Query: 305 SSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPW 364
               +     VV +    ++   V A  L+  V I  + +   ++ K     + RVIRPW
Sbjct: 303 ----VNVDGEVVGV----NIMKVVAADGLSFSVPIDSVTKIIEHLKK-----SGRVIRPW 362

Query: 365 LGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIEFDKQPV 424
           LGLKMIDLNEMII QLKERD+ FP+V +G+LV MVTPGSPA RAGFRPGDVVIEFD++PV
Sbjct: 363 LGLKMIDLNEMIIAQLKERDSRFPNVNRGILVPMVTPGSPADRAGFRPGDVVIEFDRKPV 422

Query: 425 ASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
            SI+EIIEIMGDRV IPLK VVKRS + + TLTV+PEE+NPDM
Sbjct: 423 ESIKEIIEIMGDRVRIPLKVVVKRSNDILATLTVIPEEANPDM 428

BLAST of Cp4.1LG07g08910 vs. TrEMBL
Match: A0A061F3I9_THECC (Protease Do-like 14, putative isoform 1 OS=Theobroma cacao GN=TCM_026639 PE=4 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 6.8e-109
Identity = 243/458 (53.06%), Postives = 292/458 (63.76%), Query Frame = 1

Query: 9   VASSRNTLGRIAAIAAAGSCIWYAGSKLDSGSSVVLSIPAALSEPLFLPWQTAHGFTIHP 68
           V+ SR++L RI AI  AGS + Y  +  DS ++V LSIP  L E L   W+        P
Sbjct: 10  VSCSRSSLIRIVAIGTAGSGLLYWNTNPDSETTVKLSIPVPLREHLSFQWR-------RP 69

Query: 69  SGAFDHQKLG-LSFCSSRVSPAPPSGVEKEKP---GDTQKPCPRCLDRDTIANAAADVGP 128
             +  H ++G L   SSRVS AP     KE P    D +KPC  CL RD+IANAAA VGP
Sbjct: 70  FLSSYHWEIGNLPLFSSRVSAAPAGDTTKEAPVAVWDDKKPCCGCLSRDSIANAAAKVGP 129

Query: 129 AVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASRCISLV 188
           AVVN+SV  GIYGI T +S+GSGTIID DGTILTCAHVV +F G R+  KGK        
Sbjct: 130 AVVNLSVPQGIYGITTGRSIGSGTIIDADGTILTCAHVVVEFQGMRSTIKGK-------- 189

Query: 189 FNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIG 248
               V+VTLQDGRTFEGTV+NAD HSDIAIVKI SK+PLP AK GSSS LRPGDWV+A+G
Sbjct: 190 ----VDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPTAKFGSSSNLRPGDWVIAMG 249

Query: 249 CPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKL 308
           CPLSLQNT+TAGIV  V  +      G +      +D AI   NS  PL          +
Sbjct: 250 CPLSLQNTITAGIVSCVDRKSSDLGLGGMRREYLQTDCAINAGNSGGPL----------V 309

Query: 309 RPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPWLGLKM 368
                +V +         +  VV    +   V V+    I +     + RVIRPWLGLKM
Sbjct: 310 NIDGEIVGVN--------IMKVVAADGLSFAVPVDSVSKIIEHFKN-SGRVIRPWLGLKM 369

Query: 369 IDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIEFDKQPVASIQE 428
           +DLNEMII QL+ERDA FP + KG+LV MVTPGSPA  AGFRP DVV+EFD +PV SI+E
Sbjct: 370 LDLNEMIIAQLRERDAKFPKIEKGILVPMVTPGSPADLAGFRPSDVVVEFDGKPVESIKE 429

Query: 429 IIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           I+EIM DR+G PLK VVKR+ +  + LTV+PEE+NPDM
Sbjct: 430 IVEIMDDRIGKPLKVVVKRANDEEVMLTVIPEEANPDM 429

BLAST of Cp4.1LG07g08910 vs. TAIR10
Match: AT5G27660.1 (AT5G27660.1 Trypsin family protein with PDZ domain)

HSP 1 Score: 327.4 bits (838), Expect = 1.4e-89
Identity = 214/442 (48.42%), Postives = 264/442 (59.73%), Query Frame = 1

Query: 1   MIPLLVRKVASS-RNTLGRIAAIAAAGSCIWYAGSKLDSGSSVVLSIPAALSEPL-FLPW 60
           M+  L R V+SS R+ L RI ++A A S I YA +  D+ + V L+IP ++ E L  LPW
Sbjct: 1   MMNFLRRAVSSSKRSELIRIISVATATSGILYASTNPDARTRVSLAIPESVRESLSLLPW 60

Query: 61  QTAHGFTIHPSGAFDHQKLGLSFCSSRVSP---AP---PSGVEKEKPGDTQKPCPRCLDR 120
           Q + G    P    +    G    SSRVSP   AP     GV  E    + KP    L R
Sbjct: 61  QISPGLIHRP----EQSLFGNFVFSSRVSPKSEAPINDEKGVSVEASDSSSKPSNGYLGR 120

Query: 121 DTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAA 180
           DTIANAAA +GPAVVN+SV  G +GI+  KS+GSGTIID DGTILTCAHVV DF   R +
Sbjct: 121 DTIANAAARIGPAVVNLSVPQGFHGISMGKSIGSGTIIDADGTILTCAHVVVDFQNIRHS 180

Query: 181 SKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSS 240
           SKG+            V+VTLQDGRTFEG V+NAD  SDIA+VKI SK+PLP AKLG SS
Sbjct: 181 SKGR------------VDVTLQDGRTFEGVVVNADLQSDIALVKIKSKTPLPTAKLGFSS 240

Query: 241 KLRPGDWVVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSP 300
           KLRPGDWV+A+GCPLSLQNTVTAGIV  V  +      G        +D +I   NS  P
Sbjct: 241 KLRPGDWVIAVGCPLSLQNTVTAGIVSCVDRKSSDLGLGGKHREYLQTDCSINAGNSGGP 300

Query: 301 LPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLT 360
           L          +     V+ +         +  V+    +   V ++    I +   + +
Sbjct: 301 L----------VNLDGEVIGVN--------IMKVLAADGLGFSVPIDSVSKIIEHFKK-S 360

Query: 361 WRVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVI 420
            RVIRPW+GLKM++LN +I+ QLKERD  FPDV +GVLV  V PGSPA RAGF+PGDVV+
Sbjct: 361 GRVIRPWIGLKMVELNNLIVAQLKERDPMFPDVERGVLVPTVIPGSPADRAGFKPGDVVV 401

Query: 421 EFDKQPVASIQEIIEIMGDRVG 434
            FD +PV      IEIM DRVG
Sbjct: 421 RFDGKPV------IEIMDDRVG 401

BLAST of Cp4.1LG07g08910 vs. TAIR10
Match: AT3G27925.1 (AT3G27925.1 DegP protease 1)

HSP 1 Score: 75.5 bits (184), Expect = 9.7e-14
Identity = 102/378 (26.98%), Postives = 163/378 (43.12%), Query Frame = 1

Query: 107 PRCLDRDTIANAAA--DVGPAVV---NISVSHGIYGI---ATAKSMGSGTIIDKDGTILT 166
           P+ L  D +A      +  P+VV   N++V    + +      +  GSG + DK G    
Sbjct: 111 PKKLQTDELATVRLFQENTPSVVYITNLAVRQDAFTLDVLEVPQGSGSGFVWDKQG---- 170

Query: 167 CAHVVTDFHGPRAASKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKIN 226
             H+VT++H  R AS               + VTL D  TF+  V+  D   D+A+++I+
Sbjct: 171 --HIVTNYHVIRGASD--------------LRVTLADQTTFDAKVVGFDQDKDVAVLRID 230

Query: 227 S-KSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNAD 286
           + K+ L    +G S+ L  G  V AIG P  L +T+T G++  +  +      G  +   
Sbjct: 231 APKNKLRPIPVGVSADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDV 290

Query: 287 FHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVL 346
             +D AI   NS  PL    L SS  L        IG   ++ +   A   +  V   + 
Sbjct: 291 IQTDAAINPGNSGGPL----LDSSGTL--------IGINTAIYSPSGA---SSGVGFSIP 350

Query: 347 VECEGNIYKQIVQLTWRVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPG 406
           V+  G I  Q+V+   +V RP LG+K     +  +EQL            GVLV    P 
Sbjct: 351 VDTVGGIVDQLVRF-GKVTRPILGIKFAP--DQSVEQLG---------VSGVLVLDAPPS 410

Query: 407 SPAGRAGFRP-----------GDVVIEFDKQPVASIQEIIEIM-----GDRVGIP-LKAV 458
            PAG+AG +            GD++   +   V++  ++  I+     GD V +  L+  
Sbjct: 411 GPAGKAGLQSTKRDGYGRLVLGDIITSVNGTKVSNGSDLYRILDQCKVGDEVTVEVLRGD 439

BLAST of Cp4.1LG07g08910 vs. TAIR10
Match: AT5G39830.1 (AT5G39830.1 Trypsin family protein with PDZ domain)

HSP 1 Score: 73.2 bits (178), Expect = 4.8e-13
Identity = 80/312 (25.64%), Postives = 137/312 (43.91%), Query Frame = 1

Query: 145 GSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASRCISLVFNAYVEVTLQDG--RTFEGT 204
           GSG + D  G I+T  HV+ +      +      R         V +   DG  + FEG 
Sbjct: 155 GSGVVWDGQGYIVTNYHVIGNALSRNPSPGDVVGR---------VNILASDGVQKNFEGK 214

Query: 205 VMNADFHSDIAIVKINS-KSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVEVT 264
           ++ AD   D+A++K+++ ++ L   K+G S+ L+ G   +AIG P    +T+T G++   
Sbjct: 215 LVGADRAKDLAVLKVDAPETLLKPIKVGQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGL 274

Query: 265 LQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNT 324
            +D  +  G  +     +D AI   NS  PL    L S   L      + I   +  Q  
Sbjct: 275 NRDIFSQTGVTIGGGIQTDAAINPGNSGGPL----LDSKGNL------IGINTAIFTQTG 334

Query: 325 VTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPWLGLKMIDLNEMIIEQLKERDASF 384
            +A V        VL      I  Q++Q + +V+R  + +++    + +  QL       
Sbjct: 335 TSAGVGFAIPSSTVL-----KIVPQLIQFS-KVLRAGINIELAP--DPVANQL------- 394

Query: 385 PDVTKGVLVAMVTPGSPAGRAGFRP-----------GDVVIEFDKQPVASIQEIIEIM-- 438
            +V  G LV  V   S A +AG  P           GD+++  D +PV +  E+++I+  
Sbjct: 395 -NVRNGALVLQVPGKSLAEKAGLHPTSRGFAGNIVLGDIIVAVDDKPVKNKAELMKILDE 431

BLAST of Cp4.1LG07g08910 vs. TAIR10
Match: AT4G18370.1 (AT4G18370.1 DEGP protease 5)

HSP 1 Score: 54.7 bits (130), Expect = 1.8e-07
Identity = 46/153 (30.07%), Postives = 73/153 (47.71%), Query Frame = 1

Query: 145 GSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASRCISLVFNAYVEVTLQDGR----TFE 204
           GSG + DK G I+T  HV+       A  +    RC         +V+L D +    + E
Sbjct: 131 GSGFVWDKLGHIVTNYHVIAKL----ATDQFGLQRC---------KVSLVDAKGTRFSKE 190

Query: 205 GTVMNADFHSDIAIVKINSKS-PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVE 264
           G ++  D  +D+A++KI ++   L    LG+S+ LR G    AIG P   +NT+T G+V 
Sbjct: 191 GKIVGLDPDNDLAVLKIETEGRELNPVVLGTSNDLRVGQSCFAIGNPYGYENTLTIGVVS 250

Query: 265 VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPL 293
              ++  +  G  ++    +D  I   NS  PL
Sbjct: 251 GLGREIPSPNGKSISEAIQTDADINSGNSGGPL 270

BLAST of Cp4.1LG07g08910 vs. NCBI nr
Match: gi|449454081|ref|XP_004144784.1| (PREDICTED: putative protease Do-like 14 [Cucumis sativus])

HSP 1 Score: 526.6 bits (1355), Expect = 4.5e-146
Identity = 310/463 (66.95%), Postives = 335/463 (72.35%), Query Frame = 1

Query: 1   MIPLLVRKVASSRNTLGRIAAIAAAGSCIWYAGSKLD-SGSSVVLSIPAALSEPLFLPWQ 60
           MIP L R V+SS  T  R AA+AAAGSC  YA S LD S  S+VLSIPAA S+PLFLPWQ
Sbjct: 1   MIPFL-RNVSSSYKTFRRFAAVAAAGSCYLYARSDLDYSKPSIVLSIPAAWSDPLFLPWQ 60

Query: 61  TAHGFTIHPSGAFDHQKLGLSFCSSRVSPAPPSGVEKEKPGDTQKPCPRCLDRDTIANAA 120
           T HG    P G FDH+ L +S CSSRVSP            D +K  P CL RDTIANAA
Sbjct: 61  TTHGVRPRPLGTFDHRLLDISLCSSRVSP------------DDKKETP-CLGRDTIANAA 120

Query: 121 ADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASR 180
           ADVGPAVVNISVS+GIYGIA+AKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGK   
Sbjct: 121 ADVGPAVVNISVSYGIYGIASAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGK--- 180

Query: 181 CISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDW 240
                    VEVTLQDGRTFEGTVMNADFHSDIAIVKINSK+PLP AKLGSSSKLRPGDW
Sbjct: 181 ---------VEVTLQDGRTFEGTVMNADFHSDIAIVKINSKTPLPKAKLGSSSKLRPGDW 240

Query: 241 VVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLG 300
           VVAIGCPLSLQNTVTAGIV  V  +      G +      +D AI   NS  PL      
Sbjct: 241 VVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINMGNSGGPL------ 300

Query: 301 SSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPW 360
               +     VV +    ++     A  L+  V I    +    I +Q  +   RVIRPW
Sbjct: 301 ----VNVDGEVVGV----NIMKVDDAAGLSFAVPI----DSVSKITEQF-KKRGRVIRPW 360

Query: 361 LGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIEFDKQPV 420
           LGLKMIDLNEMIIEQLKERDA+FPDVTKGVLVAMVTPGSPA  AGFRPGDVVIE DKQPV
Sbjct: 361 LGLKMIDLNEMIIEQLKERDATFPDVTKGVLVAMVTPGSPASHAGFRPGDVVIELDKQPV 418

Query: 421 ASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           ASI+EIIEIMGDR G+PL AVVKRSLN+IITLTVLPEESNPDM
Sbjct: 421 ASIKEIIEIMGDRAGVPLNAVVKRSLNTIITLTVLPEESNPDM 418

BLAST of Cp4.1LG07g08910 vs. NCBI nr
Match: gi|659070278|ref|XP_008454302.1| (PREDICTED: putative protease Do-like 14 [Cucumis melo])

HSP 1 Score: 523.9 bits (1348), Expect = 2.9e-145
Identity = 309/462 (66.88%), Postives = 338/462 (73.16%), Query Frame = 1

Query: 1   MIPLLVRKVASSRNTLGRIAAIAAAGSCIWYAGSKLDSGSSVVLSIPAALSEPLFLPWQT 60
           MIP L R V+SS NT  RIAA+AAA S   YA S +DS  S+VLSIPAA S+PLFLPWQT
Sbjct: 1   MIPFL-RNVSSSHNTFRRIAAVAAAASYYLYARSDVDSKPSIVLSIPAAWSDPLFLPWQT 60

Query: 61  AHGFTIHPSGAFDHQKLGLSFCSSRVSPAPPSGVEKEKPGDTQKPCPRCLDRDTIANAAA 120
            H   +   G+FDHQ L +S CSSRVSP    G +KE P         CL RDTIANAAA
Sbjct: 61  THRARL--LGSFDHQLLDISLCSSRVSP----GEKKEAP---------CLGRDTIANAAA 120

Query: 121 DVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASRC 180
           DVGPAVVNISVS GIYGIA+AKS+GSGTIIDKDGTILTCAHVVTDFHGPRAASKGK    
Sbjct: 121 DVGPAVVNISVSRGIYGIASAKSVGSGTIIDKDGTILTCAHVVTDFHGPRAASKGK---- 180

Query: 181 ISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWV 240
                   VEVTLQDGRTFEGTVMNADFHSDIAIVKINSK+PLPMAKLGSSSKLRPGDWV
Sbjct: 181 --------VEVTLQDGRTFEGTVMNADFHSDIAIVKINSKTPLPMAKLGSSSKLRPGDWV 240

Query: 241 VAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGS 300
           +AIGCPLSLQNTVTAGIV  V  +      G +      +D AI   NS  PL       
Sbjct: 241 LAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPL------- 300

Query: 301 SSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPWL 360
              +     VV +    ++     AV L+  V I    +    I +Q  +   RVIRPWL
Sbjct: 301 ---VNVDGEVVGV----NIMKVDDAVGLSFAVPI----DSVSKITEQF-KKRGRVIRPWL 360

Query: 361 GLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIEFDKQPVA 420
           GLKMIDLN+MIIEQLKERDA+FPDVTKGVLVAMVTPGSPA RAGFRPGDVVIE DKQPV 
Sbjct: 361 GLKMIDLNKMIIEQLKERDATFPDVTKGVLVAMVTPGSPASRAGFRPGDVVIELDKQPVT 415

Query: 421 SIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           SI+EIIEIMGDRVG+PL AVVKRSLN+IITLTVLPEESNPDM
Sbjct: 421 SIKEIIEIMGDRVGVPLNAVVKRSLNTIITLTVLPEESNPDM 415

BLAST of Cp4.1LG07g08910 vs. NCBI nr
Match: gi|802756413|ref|XP_012089022.1| (PREDICTED: putative protease Do-like 14 [Jatropha curcas])

HSP 1 Score: 426.0 bits (1094), Expect = 8.3e-116
Identity = 260/459 (56.64%), Postives = 308/459 (67.10%), Query Frame = 1

Query: 9   VASSRNTLGRIAAIAAAGSCIWYA-GSKLDSGSSVVLSIPAALSEPLFLPWQTAHGF-TI 68
           V+S R +L R  AIAAAGS + YA  S  DS  ++ LSIPA LSE L      +    ++
Sbjct: 10  VSSGRPSLIRALAIAAAGSGLLYALSSYSDSNGTISLSIPAPLSESLLPSCHLSRQLISL 69

Query: 69  HPSGAFDHQKLG-LSFCSSRVSPAPPSGVEK--EKPGDTQKPCPRCLDRDTIANAAADVG 128
            P  + +H   G LS  SS VSP PP+ ++K     GD  KPC  CL RDTIANAAA VG
Sbjct: 70  PPFISAEHWDFGNLSLFSSGVSPVPPADIKKGCSVVGDDPKPCCGCLGRDTIANAAARVG 129

Query: 129 PAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKASRCISL 188
           PAVVN+SV  G +GI T KS+GSGTIID DGTILTCAHVV DF G +A+SKGK       
Sbjct: 130 PAVVNLSVPQGFFGITTGKSIGSGTIIDSDGTILTCAHVVVDFQGLKASSKGK------- 189

Query: 189 VFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAI 248
                V+VTLQDGRTFEGTV+NAD HSDIAIVKI SK+PLP AKLG SS+LRPGDWV+A+
Sbjct: 190 -----VDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPNAKLGVSSRLRPGDWVIAM 249

Query: 249 GCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSK 308
           GCPLSLQNTVTAGIV  V  +      G +      +D AI + NS  PL          
Sbjct: 250 GCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINEGNSGGPL---------- 309

Query: 309 LRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQLTWRVIRPWLGLK 368
           +     VV +    ++   + A  L+  V I  + +   +  K     + RV+RPWLGLK
Sbjct: 310 VNIDGEVVGV----NIMKVLAADGLSFAVPIDSVAKIIEHFKK-----SGRVVRPWLGLK 369

Query: 369 MIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVVIEFDKQPVASIQ 428
           MIDLNEMII QLKERDA FP+V +GVLV MVTPGSPA RAGF PGDVVIEFD +PV SI+
Sbjct: 370 MIDLNEMIIAQLKERDARFPNVDRGVLVPMVTPGSPADRAGFHPGDVVIEFDGKPVESIK 429

Query: 429 EIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           EIIEIMGDRVG+PLKAVVKRS + ++TLTV PEE+NPDM
Sbjct: 430 EIIEIMGDRVGVPLKAVVKRSNDILVTLTVTPEEANPDM 437

BLAST of Cp4.1LG07g08910 vs. NCBI nr
Match: gi|743830012|ref|XP_011023671.1| (PREDICTED: putative protease Do-like 14 [Populus euphratica])

HSP 1 Score: 408.7 bits (1049), Expect = 1.4e-110
Identity = 256/471 (54.35%), Postives = 306/471 (64.97%), Query Frame = 1

Query: 1   MIPLLVRKVASSRNTLGRIAAIA----AAGSCIWYAGSK-LDSGSSVVLSIPA-ALSEPL 60
           M+  L+RKV++  +   RI  IA    A GS + YA SK  DS + + LS  A +L E L
Sbjct: 1   MMDYLLRKVSTCSSKYIRIPVIAIAAAAGGSGLLYANSKHRDSDTRISLSFRAESLHESL 60

Query: 61  FLPWQTAHGFTIHPSGAFDHQKLGLSFCSSRVSPAPPSGVEKEKPG---DTQKPCPRCLD 120
            LPW+T    T H S  F +    L   SSR+SP P   ++ E PG   ++ KP   CL 
Sbjct: 61  LLPWRTPLDLTPH-SWHFGN----LPLFSSRISPVPSGDIKNESPGVVGESPKPSCGCLG 120

Query: 121 RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRA 180
           RDTIANAAA VGPAVVN+SV  G YGI T KS+GSGTIID +GTILTCAHVV DF   RA
Sbjct: 121 RDTIANAAARVGPAVVNLSVPKGFYGITTGKSIGSGTIIDSNGTILTCAHVVVDFQDMRA 180

Query: 181 ASKGKASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSS 240
           +SKGK            V+VTLQDGRTFEGTV+NAD HSDIAIVKI SK+PLP AKLGSS
Sbjct: 181 SSKGK------------VDVTLQDGRTFEGTVVNADLHSDIAIVKIKSKTPLPTAKLGSS 240

Query: 241 SKLRPGDWVVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKS 300
           SKLRPGDWVVA+GCPLSLQNTVTAGIV  V  +      G +      +D AI   NS  
Sbjct: 241 SKLRPGDWVVAMGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINMGNSGG 300

Query: 301 PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAVVLTVRVVIWVLVECEGNIYKQIVQL 360
           PL          +     VV +    ++   + A  L+  V I  + +   +  +     
Sbjct: 301 PL----------INVDGEVVGV----NIMKVLAADGLSFAVPIDSIAKIMEHFKR----- 360

Query: 361 TWRVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDVV 420
           + RVIRPWLGLKMIDLNEMII QLKERD  FP+V +GVLV MVTPGSPA RAGF PGDVV
Sbjct: 361 SGRVIRPWLGLKMIDLNEMIITQLKERDPKFPNVKEGVLVPMVTPGSPADRAGFHPGDVV 420

Query: 421 IEFDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           I+FD +PV SI+EIIEIMGDRVG PL+ V+KR  + ++ LTV+PEE+NPDM
Sbjct: 421 IKFDGKPVRSIKEIIEIMGDRVGKPLEVVLKRPNDVVVNLTVIPEEANPDM 435

BLAST of Cp4.1LG07g08910 vs. NCBI nr
Match: gi|641837286|gb|KDO56241.1| (hypothetical protein CISIN_1g012318mg [Citrus sinensis])

HSP 1 Score: 406.4 bits (1043), Expect = 6.8e-110
Identity = 247/472 (52.33%), Postives = 302/472 (63.98%), Query Frame = 1

Query: 13  RNTLGRIAAIAAAGSCIWYAGSKLDSGSSVVLSIPAALSEPLFLPWQTAHGFTIH-PSGA 72
           RN+L R+ AIAAAGS ++Y  S  DS + + LSIPA L E + +  Q +  FT H P  +
Sbjct: 8   RNSLSRVVAIAAAGSGLFYGSSNPDSKTRISLSIPATLHESVLVRRQMSQSFTPHSPFIS 67

Query: 73  FDHQKLG-LSFCSSRVSPAPPSGVEKEKPGDTQKP---------------CPRCLDRDTI 132
            D  + G +S  SSRV+PA    ++KE P   + P               C RCL RDTI
Sbjct: 68  SDRWQFGNVSLVSSRVNPASAGSIKKEYPVTKEAPVKEETTGDVKDGKDSCCRCLGRDTI 127

Query: 133 ANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKG 192
           ANAAA V PAVVN+S      GI + + +GSG I+D DGTILTCAHVV DFHG RA  KG
Sbjct: 128 ANAAARVCPAVVNLSAPREFLGILSGRGIGSGAIVDADGTILTCAHVVVDFHGSRALPKG 187

Query: 193 KASRCISLVFNAYVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLR 252
           K            V+VTLQDGRTFEGTV+NADFHSDIAIVKINSK+PLP AKLG+SSKL 
Sbjct: 188 K------------VDVTLQDGRTFEGTVLNADFHSDIAIVKINSKTPLPAAKLGTSSKLC 247

Query: 253 PGDWVVAIGCPLSLQNTVTAGIVE-VTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPM 312
           PGDWVVA+GCP SLQNTVTAGIV  V  +      G +      +D AI   NS  PL  
Sbjct: 248 PGDWVVAMGCPHSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINAGNSGGPLVN 307

Query: 313 AKLGSSSKLRPGDWVVAIGCPLSLQ-NTVTAVVLTVRVVIWVLVECEGNIY----KQIVQ 372
              G    +       A G   ++  ++   ++   +   W+ VE +  +     KQ+V 
Sbjct: 308 ID-GEIVGINIMKVAAADGLSFAVPIDSAAKIIEQFKKNGWMHVEQKVPLLWSTCKQVVI 367

Query: 373 LTWRVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPAGRAGFRPGDV 432
           L  RV+RPWLGLKM+DLN+MII QLKERD SFP+V  GVLV +VTPGSPA  AGF P DV
Sbjct: 368 LCRRVVRPWLGLKMLDLNDMIIAQLKERDPSFPNVKSGVLVPVVTPGSPAHLAGFLPSDV 427

Query: 433 VIEFDKQPVASIQEIIEIMGDRVGIPLKAVVKRSLNSIITLTVLPEESNPDM 462
           VI+FD +PV SI EIIEIMGDRVG PLK VV+R+ + ++TLTV+PEE+NPDM
Sbjct: 428 VIKFDGKPVQSITEIIEIMGDRVGEPLKVVVQRANDQLVTLTVIPEEANPDM 466

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DGP14_ARATH5.6e-9648.51Putative protease Do-like 14 OS=Arabidopsis thaliana GN=DEGP14 PE=3 SV=2[more]
HTRA1_BOVIN5.7e-3230.62Serine protease HTRA1 OS=Bos taurus GN=HTRA1 PE=2 SV=1[more]
HTRA1_HUMAN5.7e-3230.62Serine protease HTRA1 OS=Homo sapiens GN=HTRA1 PE=1 SV=1[more]
HTRA3_HUMAN5.7e-3230.90Serine protease HTRA3 OS=Homo sapiens GN=HTRA3 PE=1 SV=2[more]
HTRA3_MOUSE1.7e-3129.74Serine protease HTRA3 OS=Mus musculus GN=Htra3 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0LFW5_CUCSA3.1e-14666.95Uncharacterized protein OS=Cucumis sativus GN=Csa_2G030600 PE=4 SV=1[more]
A0A067JKY4_JATCU5.8e-11656.64Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23321 PE=4 SV=1[more]
A0A067EZ89_CITSI4.7e-11052.33Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012318mg PE=4 SV=1[more]
B9RF97_RICCO6.2e-11054.86Serine protease htra2, putative OS=Ricinus communis GN=RCOM_1433060 PE=4 SV=1[more]
A0A061F3I9_THECC6.8e-10953.06Protease Do-like 14, putative isoform 1 OS=Theobroma cacao GN=TCM_026639 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT5G27660.11.4e-8948.42 Trypsin family protein with PDZ domain[more]
AT3G27925.19.7e-1426.98 DegP protease 1[more]
AT5G39830.14.8e-1325.64 Trypsin family protein with PDZ domain[more]
AT4G18370.11.8e-0730.07 DEGP protease 5[more]
Match NameE-valueIdentityDescription
gi|449454081|ref|XP_004144784.1|4.5e-14666.95PREDICTED: putative protease Do-like 14 [Cucumis sativus][more]
gi|659070278|ref|XP_008454302.1|2.9e-14566.88PREDICTED: putative protease Do-like 14 [Cucumis melo][more]
gi|802756413|ref|XP_012089022.1|8.3e-11656.64PREDICTED: putative protease Do-like 14 [Jatropha curcas][more]
gi|743830012|ref|XP_011023671.1|1.4e-11054.35PREDICTED: putative protease Do-like 14 [Populus euphratica][more]
gi|641837286|gb|KDO56241.1|6.8e-11052.33hypothetical protein CISIN_1g012318mg [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004252serine-type endopeptidase activity
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR009003Peptidase_S1_PA
IPR001940Peptidase_S1C
IPR001478PDZ
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0004252 serine-type endopeptidase activity
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g08910.1Cp4.1LG07g08910.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001478PDZ domainGENE3DG3DSA:2.30.42.10coord: 356..456
score: 1.6
IPR001478PDZ domainPFAMPF13180PDZ_2coord: 358..426
score: 6.
IPR001478PDZ domainSMARTSM00228pdz_newcoord: 357..444
score: 8.
IPR001478PDZ domainPROFILEPS50106PDZcoord: 347..428
score: 9
IPR001478PDZ domainunknownSSF50156PDZ domain-likecoord: 356..453
score: 1.71
IPR001940Peptidase S1CPRINTSPR00834PROTEASES2Ccoord: 236..260
score: 1.2E-12coord: 154..166
score: 1.2E-12coord: 403..415
score: 1.2
IPR009003Peptidase S1, PA clanunknownSSF50494Trypsin-like serine proteasescoord: 111..259
score: 3.32E-30coord: 258..334
score: 2.07
NoneNo IPR availableGENE3DG3DSA:2.40.10.10coord: 292..324
score: 1.7E-7coord: 113..229
score: 9.2E-25coord: 264..291
score: 8.1E-9coord: 230..263
score: 1.
NoneNo IPR availablePANTHERPTHR22939SERINE PROTEASE FAMILY S1C HTRA-RELATEDcoord: 354..458
score: 1.6E-119coord: 189..321
score: 1.6E-119coord: 101..164
score: 1.6E
NoneNo IPR availablePANTHERPTHR22939:SF95SERINE PROTEASE HTRA2, MITOCHONDRIALcoord: 101..164
score: 1.6E-119coord: 189..321
score: 1.6E-119coord: 354..458
score: 1.6E
NoneNo IPR availablePFAMPF13365Trypsin_2coord: 145..259
score: 1.1

The following gene(s) are paralogous to this gene:

None