Cp4.1LG07g10850.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG07g10850.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtease 2
LocationCp4.1LG07 : 10130941 .. 10137034 (-)
Sequence length2474
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACTCCGGCAATGAACCAATTTCGTGCAGCCCTCCGCCACTGCCGCTCTAATCTTCACGGTGCTCTCCGGCGATGCCTCCACTACAAGGCACCAAAGACTCCACAGCCACCGTCACCACCGGGGCCGCCAAAGCCTCCAAAGAAGCCACAGAGTTTCACTATGCACGATATCACTTGGGAGGATCCGTACAGTTGGATGTCGAGATTGAACGACAAGGTGGCGATGCGGCATATGGACGTTTACATGGAGCAGGAGGAGAAGTATGTGGAGGCTGTAATGGCTGACACAGAACGACTCCAGAGTAAGCTTCAATCTGAAATGGCTTCTCGCTTGGCTTTTGACCTCTCGACTCCTTCACTTCGTTTGGGTCCTTGGTAGGGTTTTTTTGGTTATGAACTCTTTGCATTGTTGTTAATCGATTAGAATCAATCTCTTTGCCTCAATTATTGTTGATCTCTTTGAAGGCGAATGTCAATCGCCGCAGGTTGTATTATCGAAGAGTCGAAGAAGGAAAGCAGTATCAAGTTCTCTGTCGCAGATTAGCGAGCTTACATGAAAAGTTCATTTCTAATAAATCTCCTTCAGCTGGATTTGATTATGTTTCCGGCAAGAAAATTGAGCAAAAGTTGCTTGATTATAATCAAGAAGCTGAGAGATTCGGAGGTATTGTTATCTTTTCCCTGATCCTCCTCTGTTTCCCTTTGGTTAAAAATGTTCTTTTGTGGGATTCACTTTAAAACAAATCATTTTGAGAGAATTCTATTTCGTGCTGCTTCCTGATAAACTTTGTTTTGCCTCATAACATGAAATGAAGCAATCATCTACAAGAGGAATGTTGAGGATTGTTGGGAGGGAGTCTCACAGGTTGAGATATGATTCAAAGTATTCAAAATGAAGATATGATTTGAAGTAGTTGAGATATGATTCAAATATTCAAACTACCAAGATTTTAGGAGATTTTAGGAGATTGTTAGTAGATTTGAATTTATCTAGATTTGCATGATTTACTATTTCTGGTCCAGTGTATTAGCTATAGATACTGGAATTGCTTAACCCTTTAAGCATCGAGTCAGTATCTATTTTATCCATTCCTATCTCTTTCACTATCCCCTTTCCAATCTCACATTGGCTAATTTAGGGAATGATCATGAGTTTATAAGTAAGGAAAGGAAGCCAAAAGCAAAGCCATGAGAGCTTATGATCAAAGTGGACAATATCATACCATTGTGGAGAGTCGTTATTTCTAACATGGTATCGGAGCCATGCCCTTAACTTAGCTCAGTCAATAGGATCCTCAAATGTCGAATAAAGAAGTTGTGACCCTCAAAGGTGTATTTCAAAGTGACTCAAGTGTCGAACAAGGATGTACTTTGTTCGAGGAATCCAGAGAAGAAGTCGAGCCCCGATTAAGGGGAGGTTGTTCGAGGACTCCATAGGCCTCAGGAAAGGCTCTGTAGTGTACTTTGTTGGAGGGGAGGATTGTTGGAAGGGAGTCCCACATTGACTAATTTAGGGAACGATCATGGGTTTATAAGTAAGGAATACATCTCCATTGGTATGAGGCCTTTTGGGAAGCCCAAAGCAAAACCATGAGAGCTTATGCTCAAAGTGGACAATATCATACCATTGTGGAGAGTCCTTATTCCTGACAAGGAAGAGCTTGGTTTCTGATTATAGTTTCCGCATGCCATTACTTATTCCCCCTTTATATTTGAAGCATGTTGAATCTCACCCTCTTAATAATTATTGTAGGTTATGCCTATGAGGAGCTATCAGAAGTATCTCCTGATCATCGCTTTCTTGCATACACTATGTATGACAAGGACAATGACTACTTCAGATTGTCTGTAAAGAATTTGAGTTCTGGTTCTTTATGTAGTAAGCCTCAAGTCGATCGAGTTTCTAATTTGGCATGGGCCAAAGGCGGCCAGGCATTGCTCTATGTTGTTACTGATCAAAATAAAAGACCATGTAGGTTGGTAAAATAGTTTCTCTGCGCAGTTACTCCTCATCAGTGTATTCTTCAGTTTTTTTCCATATTGACGACTATGGTTGTGAAAATGAGATGGAATATGCTTTTTTTTATAGATTATATTGTAGCATGATTGGATCAACTGATGAAGATACTTTGCTTCTGGAAGAACCAGATGATGATGTTCATGTTTATATTAGACACACAAAAGACTTTAATTTTGTTACTGTTAATCGATTCACTCCTACATCTTCCAAGGTATGTATGACATCTGAAGAACTTCATAGTTGGTACTAAGGTCGGTTGCAATTATATATTTATTGTGAACCCTACATTTTAGGTCTTTCTGATTGATGCTGCCAATCCGTTATCTGGTATGGAGTTAATTTGGGAGTGTGAAGGATTAGCTCATTGCATAATGGAACATCATCTAGGAGTGCTTTACTTGTTTACGAATGCTAATAAAGGTCACGAAGCAGTAGATTCTCATTATCTTCTTCGTAGCCCACTTAGTGTTGAATCTACTTCAAGAACATGGGAGGTTGGTTTTGTTCATTGGGACTTCTTGTCCTTTTTTATCCCCTTTTACTCTGAATGATCCAAAACCTTCAACATTTTCATGAACATGGTTTTTGTTTTCTCCATCTTGATGATTCTCAACCAATTTTCTAATGTGGTGCAGAATGTATTTGTTGATGATCCAGACTTGGTGATTGTGGATGTCGATTTCAGTCACACGCATTTGGTTCTTATTCTTAGGGAAGGACAGAAACTTAGACTCTGTGCTGTTCGGCTACCCTTGCCTGTTGGTGGAAAGGTCTAATTTCTGGAATTTTAGGACTATCTATGCCTCTGGTTGCATATAATGTGCAGCAACACTAGCAGTTACCATATAATTAAGCAGGCCCTTGACTACCATTATTTAGTCCATCTGTCATTTTTTTTATGTGCCAGGGATCAATCAATCTCAAAGAACTAGAACCACATTTTCTGCCTCTTCCTAAGCACGTATCGCAGATTTCTTCAGGACCAAATTACGACTTTTATTCATCGACAATGCGATTTACCATTTCATCACCTGTGGTATGTGAATTTCTATTGGTCGTATATTCTTAAGATTGTTTTAACGCACGTGGTTGCTCGTAACTGTGGCTGAGGCACAAAACTTTACGGAAATGAATTGCAGATTGCTTAACGGTCAAAGTTATGTCGATTTTCATATCTTATGGAGTGTTTGTGGGCAGTTTGATTCGTCTCTATTGTTAAAACTAGAACTGCTGGTTGCTAGAATCAGTAATATGAACTAGAATGATATCTATCCTTTGGAATGTAATGATTATAAGAACTTAACCTACTTAGACTTTAGGACAAATGTAATAATGTCCGACTGTTTGCTGCCCATTAATTGTTATTTAACTTTCAACAGATGCCTGATGCTGTGGTTGATTATAACCTATCAGATGGAAAGTGGAATATCATTCAACAGCAAAGCATTCTTCATGAACGAACACGAATTCTTTATGGAACGACTTCCTCTGCAGAAGCATCAGGAAAAATATCTAATGAGTCGGAGATTTCTACGGGTGAAGCCAACTTCGATGATGATCAGATGTGGAACACCCTCTCTGAATTTTATGCTTGTGAACACTTCAATGTCTCATCACATGATGAAGTTTTGATTCCTTTAACGGTCGTATACTCTTACAAGAGTAAAAGAGAAAATGAAAACCCTGGATTACTTCATGTACATGGAGCTTTTGGTGAGCCACTCGACAAACGGTGGCGCAGCGAGTTGAAAAGCCTTCTTGATCGTGGCTGGGTCATTGCATATGCTGATGTTAGGTTCGTAAGCATTTATACTTCCTTGAATTCTGAATGACTTCATGGCTTGAACCTTTTATTGATTTAGCTATGATAAACAGCTTACTTTGGAAATGGTTTTAATTCATCTTATGAACGTTAAACTTATTTCAGATTCCATCAAAAGTTCTATATCTCTAATTTGGCATCTCATTTCCTGTTGTTTTGTTTGCTTCCAGAGGTGGAGGTGGTGGGGGTAAGAAGTGGCATCATGATGGTAGGCGTACAAAGAAGTTTAATTCAGTTCAAGATTATATTTCGTGTGCTAAATTCCTTGTTGAAAGAAAGATTGTAAATGAGGAGAAGCTTGCTGGTTGGGGCTATAGTGCTGGAGGACTTTTGGTTGCTTCTGCTATCAATCAATGCCCAGAATTGTTTCGATCTGCTATTTTGAAAGTATGCCTCTATCTTCTGCATTATTTCTCTTGCATTTTATATGTTTGTTCTAAAGTTGGATGGAAAGATTCCGTGTTATAAGATTTTATTCTTCTGCTATAATCAGTTTGTTTTAAAAGAAACTTTTCTTCAAAAGGGTTTTGATCCATTTTATCTTTACTAAATCTTTCATTTTGTTACAGGTTCCATTTCTAGATCCAATAAACACACTCCTTCATCCCATTATACCACTAACACCAGCTGACTATGAAGAATTTGGATACCCTGAGGAGGATATAGATGATTTTCATGCAGTTCGCAGATACTCTCCGTATGATAACATACAGAAGGATGTCGCCTACCCAGCTGTTTTGATAACCTCGTCCTTTAATACCCGGTAATTTTCTTTTCATATGCTAGCCGAAANCTTCTGCTATAATCAGTTTGTTTTAAAAGAAACTTTTCTTCAAAAGGGTTTTGATCCATTTTATCTTTACTAAATCTTTCATTTTGTTACAGGTTCCATTTCTAGATCCAATAAACACACTCCTTCATCCCATTATACCACTAACACCAGCTGACTATGAAGAATTTGGATACCCTGAGGAGGATATAGATGATTTTCATGCAGTTCGCAGATACTCTCCGTATGATAACATACAGAAGGATGTCGCCTACCCAGCTGTTTTGATAACCTCGTCCTTTAATACCCGGTAATTTTCTTTTCATATGCTAGCCGAAACATTCTTTATAAGGGTGTGAAAACCTTTCCTTAGTAGATACGTTTTAAAACCTTGAAAGGAAGGCTGGAAGGGGAAGCTCAAAGAGGACAATATTTACTAGCGGTGGGCTTGGGCCATTAGAAATGGTATAAGAGCTAGACACTGGATGATGTGCTAGTAAGGAGGCTGAGTCCCAAAGAGGTTGGACACGAGGCGGTGTGCCACAAGGAGGTAGCCTCGAAGGAGGGTGGACACGAGGCAGTGTGCCAACGATGAGGCTGAGCCCCAAAGGGGGGTGGACATGAGACGGTGTGCTAGCAAGGACGTTGGGTCCCAATTGGGGGATCCCGTATCGATTGGAGAAGAGATGAGTGCTAGTGAGGATGTTGGGCCCCGAAGGGGGGTAAATTGTGAGATCCCACATCGGTTGAGGAGGAGAACGAAATATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGTATACGCATTTTAAAATCTTGAGGAGAAACCTGGAAGGGAAAGCTCAAAGAGAACAATATCTATTAGCGGTAGGCTTAGGCCGTTACATACTATATCGAGTAAAATGTCAGGATTGGACATTGATATTACTCGAACATGCTCTGAACTTGTTATTTGGAGTATCCAATTATTCTTAGTTATTACTTCATTAAAATCTTTGGCAAGTAGGGACACACTTGGAAGAAAATTCATTGTTGAATACGTCTATAACGCATCCTTGTCCATGTTTCTTCCTCTAAAAAGAAGAATGGTTTGCAAATCATTGTTCAAAACACTGCTTTTGTCTGAATTATCTGTGTTAGAATTCATCTAACTCTGACATTACATAAAAGGCCTTGTACCAATGGAGGGAGATGTATTCCTTATAAACTCATGATCAACTCATTGTTTCATGCATTCATCGAGTAACATTGTTCATAGTCCAGCCATTCTGTGATGTTCAATCCACTTATCTCTCTGTGTTGTGCAGATTTGGGGTATGGGAAGCTGCAAAATGGATTGCTCGAGTGCGGGATTACAGTATTTATGATCCAAAACGTCCGGTAATTCTCAATATAACAACAGACATAGTGGAGGAAAACAGGTATTTGCACTGTAAAGAATCAGCTTTAGAGACTGCATTTCTTTTGAAGTTTATAGGATCG

mRNA sequence

CAACTCCGGCAATGAACCAATTTCGTGCAGCCCTCCGCCACTGCCGCTCTAATCTTCACGGTGCTCTCCGGCGATGCCTCCACTACAAGGCACCAAAGACTCCACAGCCACCGTCACCACCGGGGCCGCCAAAGCCTCCAAAGAAGCCACAGAGTTTCACTATGCACGATATCACTTGGGAGGATCCGTACAGTTGGATGTCGAGATTGAACGACAAGGTGGCGATGCGGCATATGGACGTTTACATGGAGCAGGAGGAGAAGTATGTGGAGGCTGTAATGGCTGACACAGAACGACTCCAGAGTAAGCTTCAATCTGAAATGGCTTCTCGCTTGGCTTTTGACCTCTCGACTCCTTCACTTCGTTTGGGTCCTTGGTTGTATTATCGAAGAGTCGAAGAAGGAAAGCAGTATCAAGTTCTCTGTCGCAGATTAGCGAGCTTACATGAAAAGTTCATTTCTAATAAATCTCCTTCAGCTGGATTTGATTATGTTTCCGGCAAGAAAATTGAGCAAAAGTTGCTTGATTATAATCAAGAAGCTGAGAGATTCGGAGGTTATGCCTATGAGGAGCTATCAGAAGTATCTCCTGATCATCGCTTTCTTGCATACACTATGTATGACAAGGACAATGACTACTTCAGATTGTCTGTAAAGAATTTGAGTTCTGGTTCTTTATGTAGTAAGCCTCAAGTCGATCGAGTTTCTAATTTGGCATGGGCCAAAGGCGGCCAGGCATTGCTCTATGTTGTTACTGATCAAAATAAAAGACCATGTAGCATGATTGGATCAACTGATGAAGATACTTTGCTTCTGGAAGAACCAGATGATGATGTTCATGTTTATATTAGACACACAAAAGACTTTAATTTTGTTACTGTTAATCGATTCACTCCTACATCTTCCAAGGTCTTTCTGATTGATGCTGCCAATCCGTTATCTGGTATGGAGTTAATTTGGGAGTGTGAAGGATTAGCTCATTGCATAATGGAACATCATCTAGGAGTGCTTTACTTGTTTACGAATGCTAATAAAGGTCACGAAGCAGTAGATTCTCATTATCTTCTTCGTAGCCCACTTAGTGTTGAATCTACTTCAAGAACATGGGAGAATGTATTTGTTGATGATCCAGACTTGGTGATTGTGGATGTCGATTTCAGTCACACGCATTTGGTTCTTATTCTTAGGGAAGGACAGAAACTTAGACTCTGTGCTGTTCGGCTACCCTTGCCTGTTGGTGGAAAGGGATCAATCAATCTCAAAGAACTAGAACCACATTTTCTGCCTCTTCCTAAGCACGTATCGCAGATTTCTTCAGGACCAAATTACGACTTTTATTCATCGACAATGCGATTTACCATTTCATCACCTGTGATGCCTGATGCTGTGGTTGATTATAACCTATCAGATGGAAAGTGGAATATCATTCAACAGCAAAGCATTCTTCATGAACGAACACGAATTCTTTATGGAACGACTTCCTCTGCAGAAGCATCAGGAAAAATATCTAATGAGTCGGAGATTTCTACGGGTGAAGCCAACTTCGATGATGATCAGATGTGGAACACCCTCTCTGAATTTTATGCTTGTGAACACTTCAATGTCTCATCACATGATGAAGTTTTGATTCCTTTAACGGTCGTATACTCTTACAAGAGTAAAAGAGAAAATGAAAACCCTGGATTACTTCATGTACATGGAGCTTTTGGTGAGCCACTCGACAAACGGTGGCGCAGCGAGTTGAAAAGCCTTCTTGATCGTGGCTGGGTCATTGCATATGCTGATGTTAGAGGTGGAGGTGGTGGGGGTAAGAAGTGGCATCATGATGGTAGGCGTACAAAGAAGTTTAATTCAGTTCAAGATTATATTTCGTGTGCTAAATTCCTTGTTGAAAGAAAGATTGTAAATGAGGAGAAGCTTGCTGGTTGGGGCTATAGTGCTGGAGGACTTTTGGTTGCTTCTGCTATCAATCAATGCCCAGAATTGTTTCGATCTGCTATTTTGAAAGTTCCATTTCTAGATCCAATAAACACACTCCTTCATCCCATTATACCACTAACACCAGCTGACTATGAAGAATTTGGATACCCTGAGGAGGATATAGATGATTTTCATGCAGTTCGCAGATACTCTCCGTATGATAACATACAGAAGGATGTCGCCTACCCAGCTGTTCCATTTCTAGATCCAATAAACACACTCCTTCATCCCATTATACCACTAACACCAGCTGACTATGAAGAATTTGGATACCCTGAGGAGGATATAGATGATTTTCATGCAGTTCGCAGATACTCTCCATTTGGGGTATGGGAAGCTGCAAAATGGATTGCTCGAGTGCGGGATTACAGTATTTATGATCCAAAACGTCCGGTAATTCTCAATATAACAACAGACATAGTGGAGGAAAACAGGTATTTGCACTGTAAAGAATCAGCTTTAGAGACTGCATTTCTTTTGAAGTTTATAGGATCG

Coding sequence (CDS)

ATGAACCAATTTCGTGCAGCCCTCCGCCACTGCCGCTCTAATCTTCACGGTGCTCTCCGGCGATGCCTCCACTACAAGGCACCAAAGACTCCACAGCCACCGTCACCACCGGGGCCGCCAAAGCCTCCAAAGAAGCCACAGAGTTTCACTATGCACGATATCACTTGGGAGGATCCGTACAGTTGGATGTCGAGATTGAACGACAAGGTGGCGATGCGGCATATGGACGTTTACATGGAGCAGGAGGAGAAGTATGTGGAGGCTGTAATGGCTGACACAGAACGACTCCAGAGTAAGCTTCAATCTGAAATGGCTTCTCGCTTGGCTTTTGACCTCTCGACTCCTTCACTTCGTTTGGGTCCTTGGTTGTATTATCGAAGAGTCGAAGAAGGAAAGCAGTATCAAGTTCTCTGTCGCAGATTAGCGAGCTTACATGAAAAGTTCATTTCTAATAAATCTCCTTCAGCTGGATTTGATTATGTTTCCGGCAAGAAAATTGAGCAAAAGTTGCTTGATTATAATCAAGAAGCTGAGAGATTCGGAGGTTATGCCTATGAGGAGCTATCAGAAGTATCTCCTGATCATCGCTTTCTTGCATACACTATGTATGACAAGGACAATGACTACTTCAGATTGTCTGTAAAGAATTTGAGTTCTGGTTCTTTATGTAGTAAGCCTCAAGTCGATCGAGTTTCTAATTTGGCATGGGCCAAAGGCGGCCAGGCATTGCTCTATGTTGTTACTGATCAAAATAAAAGACCATGTAGCATGATTGGATCAACTGATGAAGATACTTTGCTTCTGGAAGAACCAGATGATGATGTTCATGTTTATATTAGACACACAAAAGACTTTAATTTTGTTACTGTTAATCGATTCACTCCTACATCTTCCAAGGTCTTTCTGATTGATGCTGCCAATCCGTTATCTGGTATGGAGTTAATTTGGGAGTGTGAAGGATTAGCTCATTGCATAATGGAACATCATCTAGGAGTGCTTTACTTGTTTACGAATGCTAATAAAGGTCACGAAGCAGTAGATTCTCATTATCTTCTTCGTAGCCCACTTAGTGTTGAATCTACTTCAAGAACATGGGAGAATGTATTTGTTGATGATCCAGACTTGGTGATTGTGGATGTCGATTTCAGTCACACGCATTTGGTTCTTATTCTTAGGGAAGGACAGAAACTTAGACTCTGTGCTGTTCGGCTACCCTTGCCTGTTGGTGGAAAGGGATCAATCAATCTCAAAGAACTAGAACCACATTTTCTGCCTCTTCCTAAGCACGTATCGCAGATTTCTTCAGGACCAAATTACGACTTTTATTCATCGACAATGCGATTTACCATTTCATCACCTGTGATGCCTGATGCTGTGGTTGATTATAACCTATCAGATGGAAAGTGGAATATCATTCAACAGCAAAGCATTCTTCATGAACGAACACGAATTCTTTATGGAACGACTTCCTCTGCAGAAGCATCAGGAAAAATATCTAATGAGTCGGAGATTTCTACGGGTGAAGCCAACTTCGATGATGATCAGATGTGGAACACCCTCTCTGAATTTTATGCTTGTGAACACTTCAATGTCTCATCACATGATGAAGTTTTGATTCCTTTAACGGTCGTATACTCTTACAAGAGTAAAAGAGAAAATGAAAACCCTGGATTACTTCATGTACATGGAGCTTTTGGTGAGCCACTCGACAAACGGTGGCGCAGCGAGTTGAAAAGCCTTCTTGATCGTGGCTGGGTCATTGCATATGCTGATGTTAGAGGTGGAGGTGGTGGGGGTAAGAAGTGGCATCATGATGGTAGGCGTACAAAGAAGTTTAATTCAGTTCAAGATTATATTTCGTGTGCTAAATTCCTTGTTGAAAGAAAGATTGTAAATGAGGAGAAGCTTGCTGGTTGGGGCTATAGTGCTGGAGGACTTTTGGTTGCTTCTGCTATCAATCAATGCCCAGAATTGTTTCGATCTGCTATTTTGAAAGTTCCATTTCTAGATCCAATAAACACACTCCTTCATCCCATTATACCACTAACACCAGCTGACTATGAAGAATTTGGATACCCTGAGGAGGATATAGATGATTTTCATGCAGTTCGCAGATACTCTCCGTATGATAACATACAGAAGGATGTCGCCTACCCAGCTGTTCCATTTCTAGATCCAATAAACACACTCCTTCATCCCATTATACCACTAACACCAGCTGACTATGAAGAATTTGGATACCCTGAGGAGGATATAGATGATTTTCATGCAGTTCGCAGATACTCTCCATTTGGGGTATGGGAAGCTGCAAAATGGATTGCTCGAGTGCGGGATTACAGTATTTATGATCCAAAACGTCCGGTAATTCTCAATATAACAACAGACATAGTGGAGGAAAACAGGTATTTGCACTGTAAAGAATCAGCTTTAGAGACTGCATTTCTTTTGAAGTTTATAGGATCG

Protein sequence

MNQFRAALRHCRSNLHGALRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDITWEDPYSWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLGPWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERFGGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGGQALLYVVTDQNKRPCSMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPTSSKVFLIDAANPLSGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPLSVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINLKELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQSILHERTRILYGTTSSAEASGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNVSSHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCPELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDVAYPAVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARVRDYSIYDPKRPVILNITTDIVEENRYLHCKESALETAFLLKFIGS
BLAST of Cp4.1LG07g10850.1 vs. Swiss-Prot
Match: PPCEL_CHICK (Prolyl endopeptidase-like OS=Gallus gallus GN=PREPL PE=2 SV=1)

HSP 1 Score: 162.2 bits (409), Expect = 2.5e-38
Identity = 143/554 (25.81%), Postives = 248/554 (44.77%), Query Frame = 1

Query: 173 YNQEAERFGGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVS 232
           ++ E   F G AY +   +SPD R+LA ++  ++++     +  L       +  +  V 
Sbjct: 164 FSTEDMGFSG-AYIKRIRISPDERYLATSLQSENSEEATCVIMKLGDVPFVEEV-IPNVF 223

Query: 233 NLAWAKGGQALLYVVTDQNKRPCSMIGSTDEDT----LLLEEPDDDVHVYIRHTKDFNFV 292
           +  WA     +LY  + +N +  ++  +T  +     L+  E D    V I  TKD  F+
Sbjct: 224 SFEWATND--VLYYTSQKNLKCQNVFMTTFTNEKYTKLVYTEQDARFFVDIYCTKDRRFL 283

Query: 293 TVNRFTPTSSKVFLIDAANPLSGMELIW-ECEGLAHCIMEHHLGVLYLFTNANKGHEAVD 352
           T+N  + T+S+V+LID  +P     L+    +G+ + + EH    LY+ T+  +  E   
Sbjct: 284 TINSNSKTTSEVWLIDCRHPFKLPVLVQARTKGVIYHV-EHRNNELYILTSYGEPAE--- 343

Query: 353 SHYLLRSPLSVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLP 412
             Y L       +    W+ V+  +    ++D++    H ++ L++   L L  +     
Sbjct: 344 --YKLMKASVASTGMENWQLVYALEEKTKLIDLEMFRDHCIMFLQKAGYLYLNVIAFV-- 403

Query: 413 VGGKGSINLKELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDG 472
                S +++ ++   LP      ++ S P +   SST  F ++SPV P     Y+  + 
Sbjct: 404 -----SHSVQSIQ---LPTWACAFELESHPEH--ASSTCYFQLTSPVHPPRRFAYSFKEN 463

Query: 473 KWNIIQQQSILHERTRILYGTTSSAEASGKISNESEISTGEANFDDDQMWNTLSEFYACE 532
             N+I+Q +   E   I+   T+   A  K                D+    ++ F+   
Sbjct: 464 --NLIEQAA--EEVPIIMNCHTTRLLAKSK----------------DETLVPITVFH--- 523

Query: 533 HFNVSSHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVI 592
             NV+S +    PL V                 V+GA+G  L+  ++ E   L++ GW++
Sbjct: 524 --NVNSKELHRKPLLVH----------------VYGAYGIDLNMSFKEEKLMLIEEGWIL 583

Query: 593 AYADVRGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLL 652
           AY  VRGGG  G +WH DG +  K   + D  +C   L E      +  A    SAGG+L
Sbjct: 584 AYCHVRGGGELGLRWHKDGCQQNKLKGLHDLKACIMLLHELGFSQPKYTALTAVSAGGVL 643

Query: 653 VASAINQCPELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYS 712
             +  N  PEL R+ +L+ PF+D +NT++   +PL+  + EE+G P  D      ++ Y 
Sbjct: 644 AGAICNSDPELIRAVVLQAPFVDVLNTMMKTHLPLSIEEQEEWGNPLADEKCMKYIKNYC 653

Query: 713 PYDNIQKDVAYPAV 722
           PY NI K   YP+V
Sbjct: 704 PYHNI-KPQCYPSV 653

BLAST of Cp4.1LG07g10850.1 vs. Swiss-Prot
Match: PTRB_ECOLI (Protease 2 OS=Escherichia coli (strain K12) GN=ptrB PE=1 SV=2)

HSP 1 Score: 156.0 bits (393), Expect = 1.8e-36
Identity = 108/343 (31.49%), Postives = 167/343 (48.69%), Query Frame = 1

Query: 480 ERTRILYGTTSSAEASGKISNESEISTGEAN-FDDDQMWNTLSEFYACEHFNVSSHDEVL 539
           E  R+ YG +S          E ++ TGE       ++    +  Y  EH  + + D V 
Sbjct: 371 ETARLRYGYSSMTTPDTLF--ELDMDTGERRVLKQTEVPGFYAANYRSEHLWIVARDGVE 430

Query: 540 IPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGGGG 599
           +P+++VY  K  R+  NP L++ +G++G  +D  +     SLLDRG+V A   VRGGG  
Sbjct: 431 VPVSLVYHRKHFRKGHNPLLVYGYGSYGASIDADFSFSRLSLLDRGFVYAIVHVRGGGEL 490

Query: 600 GKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCPEL 659
           G++W+ DG+  KK N+  DY+     L++    +       G SAGG+L+  AINQ PEL
Sbjct: 491 GQQWYEDGKFLKKKNTFNDYLDACDALLKLGYGSPSLCYAMGGSAGGMLMGVAINQRPEL 550

Query: 660 FRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDVAY 719
           F   I +VPF+D + T+L   IPLT  ++EE+G P +D   +  ++ YSPYDN+    AY
Sbjct: 551 FHGVIAQVPFVDVVTTMLDESIPLTTGEFEEWGNP-QDPQYYEYMKSYSPYDNVTAQ-AY 610

Query: 720 PAVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARVRD 779
           P +     + T LH                             S    WE AKW+A++R+
Sbjct: 611 PHL----LVTTGLHD----------------------------SQVQYWEPAKWVAKLRE 670

Query: 780 YSIYDPKRPVILNITTDI-----VEENRYLHCKESALETAFLL 817
               D     +L + TD+      +  R+   +  A+E AFL+
Sbjct: 671 LKTDDH----LLLLCTDMDSGHGGKSGRFKSYEGVAMEYAFLV 673

BLAST of Cp4.1LG07g10850.1 vs. Swiss-Prot
Match: PPCEL_MOUSE (Prolyl endopeptidase-like OS=Mus musculus GN=Prepl PE=1 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 4.4e-35
Identity = 78/189 (41.27%), Postives = 109/189 (57.67%), Query Frame = 1

Query: 533 SHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADV 592
           S D  L+P+TV +   S+     P L+HV+GA+G  L   +R E + L+D GW++AY  V
Sbjct: 448 SKDGKLVPMTVFHKTDSEDLQRKPLLVHVYGAYGMDLKMNFRPEKRVLVDDGWILAYCHV 507

Query: 593 RGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAI 652
           RGGG  G +WH DGR TKK N + D ++C K L  +            +SAGG+LV +  
Sbjct: 508 RGGGELGLQWHADGRLTKKLNGLADLVACIKTLHSQGFSQPSLTTLSAFSAGGVLVGALC 567

Query: 653 NQCPELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNI 712
           N  PEL R+  L+ PFLD +NT+L   +PLT  + EE+G P  D    + ++RY P  NI
Sbjct: 568 NSKPELLRAVTLEAPFLDVLNTMLDTTLPLTLEELEEWGNPSSDEKHKNYIKRYCPCQNI 627

Query: 713 QKDVAYPAV 722
            K   YP+V
Sbjct: 628 -KPQHYPSV 635

BLAST of Cp4.1LG07g10850.1 vs. Swiss-Prot
Match: PPCEL_HUMAN (Prolyl endopeptidase-like OS=Homo sapiens GN=PREPL PE=1 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 1.7e-34
Identity = 76/189 (40.21%), Postives = 109/189 (57.67%), Query Frame = 1

Query: 533 SHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADV 592
           S D  L+P+TV +   S+   + P L+HV+GA+G  L   +R E + L+D GW++AY  V
Sbjct: 450 SKDGKLVPMTVFHKTDSEDLQKKPLLVHVYGAYGMDLKMNFRPERRVLVDDGWILAYCHV 509

Query: 593 RGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAI 652
           RGGG  G +WH DGR TKK N + D  +C K L  +            +SAGG+L  +  
Sbjct: 510 RGGGELGLQWHADGRLTKKLNGLADLEACIKTLHGQGFSQPSLTTLTAFSAGGVLAGALC 569

Query: 653 NQCPELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNI 712
           N  PEL R+  L+ PFLD +NT++   +PLT  + EE+G P  D    + ++RY PY NI
Sbjct: 570 NSNPELVRAVTLEAPFLDVLNTMMDTTLPLTLEELEEWGNPSSDEKHKNYIKRYCPYQNI 629

Query: 713 QKDVAYPAV 722
            K   YP++
Sbjct: 630 -KPQHYPSI 637

BLAST of Cp4.1LG07g10850.1 vs. Swiss-Prot
Match: PPCEL_MACFA (Prolyl endopeptidase-like OS=Macaca fascicularis GN=PREPL PE=2 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 1.7e-34
Identity = 77/189 (40.74%), Postives = 109/189 (57.67%), Query Frame = 1

Query: 533 SHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADV 592
           S D  L+P+TV +   S+   + P L+HV+GA+G  L   +R E + L+D GW++AY  V
Sbjct: 450 SKDGKLVPMTVFHKTDSEDLQKKPLLIHVYGAYGMDLKMNFRPERRVLVDDGWILAYCHV 509

Query: 593 RGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAI 652
           RGGG  G +WH DGR TKK N + D  +C K L  +            +SAGG+L  +  
Sbjct: 510 RGGGELGLQWHADGRLTKKLNGLADLEACIKTLHGQGFSQPSLTTLTAFSAGGVLAGALC 569

Query: 653 NQCPELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNI 712
           N  PEL R+  L+ PFLD +NT++   +PLT  + EE+G P  D    + ++RY PY NI
Sbjct: 570 NCNPELLRAVTLEAPFLDVLNTMMDTTLPLTLEELEEWGNPSSDEKHKNYIKRYCPYQNI 629

Query: 713 QKDVAYPAV 722
            K   YP+V
Sbjct: 630 -KPQHYPSV 637

BLAST of Cp4.1LG07g10850.1 vs. TrEMBL
Match: A0A0A0LL77_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009530 PE=4 SV=1)

HSP 1 Score: 1380.2 bits (3571), Expect = 0.0e+00
Identity = 685/825 (83.03%), Postives = 731/825 (88.61%), Query Frame = 1

Query: 1   MNQFRAALRHCRSNLHGALRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDITWEDPY 60
           MN+ RA LRH R++LHG   RCLHYK PKTPQPP+PP PPKPPKKPQSFT+H+ITWEDPY
Sbjct: 1   MNRLRAVLRHRRTHLHGDFGRCLHYKVPKTPQPPAPPAPPKPPKKPQSFTLHEITWEDPY 60

Query: 61  SWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLG 120
           SWMS LNDKVAMRHMDVYMEQEEKY EAVM  TERLQSKLQSEMASRLAF+LSTP LR G
Sbjct: 61  SWMSSLNDKVAMRHMDVYMEQEEKYTEAVMGGTERLQSKLQSEMASRLAFELSTPPLRWG 120

Query: 121 PWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERF 180
           PWLYYRRVEEGKQY VLCRRLASLHE+FISNKSPSAGFDYVSG+KIEQKL+DYNQEAERF
Sbjct: 121 PWLYYRRVEEGKQYPVLCRRLASLHEEFISNKSPSAGFDYVSGQKIEQKLIDYNQEAERF 180

Query: 181 GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG 240
           GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG
Sbjct: 181 GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG 240

Query: 241 QALLYVVTDQNKRPC----SMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPT 300
           Q+LLYVVTDQNKRPC    S IGS DEDTLLLEE DDDVHVYIRHTKDF FVTVNRF+PT
Sbjct: 241 QSLLYVVTDQNKRPCRLYCSTIGSIDEDTLLLEEKDDDVHVYIRHTKDFRFVTVNRFSPT 300

Query: 301 SSKVFLIDAANPLSGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPL 360
           SSKVFLIDAA+PLSGM+LIWECE LAHCI+EHHLG LYLFT+A+KGHE VDSHYLLRSPL
Sbjct: 301 SSKVFLIDAADPLSGMKLIWECEELAHCIVEHHLGDLYLFTDASKGHERVDSHYLLRSPL 360

Query: 361 SVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINL 420
            V+ST RTWE+VFVDDPD VIVDVDF HTHLVLILREG+K  LCAVRLPLPVGGKG I+L
Sbjct: 361 KVDSTLRTWEHVFVDDPDFVIVDVDFCHTHLVLILREGRKFSLCAVRLPLPVGGKGPISL 420

Query: 421 KELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS 480
           KELE  +LPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS
Sbjct: 421 KELELQYLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS 480

Query: 481 ILHERTRILYGTTSSAEASGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNVSSHDE 540
           ILHERTRILYGTTSSA  S +ISN  E S GEANFD+ QMWN+LSE+YACEH+NVSS D 
Sbjct: 481 ILHERTRILYGTTSSAGGSREISNALENSVGEANFDE-QMWNSLSEYYACEHYNVSSDDG 540

Query: 541 VLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGG 600
           VL+PLTVVYSYK K+ENENPGLLHVHGA+GE LDKRWRSELKSLLDRGWVIAYADVRGGG
Sbjct: 541 VLVPLTVVYSYKCKKENENPGLLHVHGAYGELLDKRWRSELKSLLDRGWVIAYADVRGGG 600

Query: 601 GGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCP 660
           GGGKKWH DGRR KKFNSVQDYISCAKFL ER+IVNE+KLAGWGYSAGGLLVASAINQCP
Sbjct: 601 GGGKKWHQDGRRIKKFNSVQDYISCAKFLAERQIVNEDKLAGWGYSAGGLLVASAINQCP 660

Query: 661 ELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDV 720
           ELFR+AILKVPFLDPI+TLL+PIIPLTPADYEEFGYP  + DDFHA+RRYSPYDNIQKD 
Sbjct: 661 ELFRAAILKVPFLDPISTLLNPIIPLTPADYEEFGYPGNE-DDFHAIRRYSPYDNIQKDA 720

Query: 721 AYPAVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARV 780
           A              +P + +T +              F+     + FGVWEAAKWIARV
Sbjct: 721 A--------------YPAVLITSS--------------FN-----TRFGVWEAAKWIARV 780

Query: 781 RDYSIYDPKRPVILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           RDYSIYDPKRPVILN+T DIVEENRYLHCKESALETAFL+K + S
Sbjct: 781 RDYSIYDPKRPVILNLTIDIVEENRYLHCKESALETAFLMKAMES 790

BLAST of Cp4.1LG07g10850.1 vs. TrEMBL
Match: V4UI75_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10024926mg PE=4 SV=1)

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 580/825 (70.30%), Postives = 677/825 (82.06%), Query Frame = 1

Query: 1   MNQFRAALRHCRSNLHGALRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDITWEDPY 60
           M     A+R C  N HG+L +  HYK PKT +PP+ P PPKPPKKPQ FT HD TWEDPY
Sbjct: 1   MRHLLTAVR-CFYNNHGSLTQVRHYKPPKTSRPPAAPSPPKPPKKPQRFTFHDHTWEDPY 60

Query: 61  SWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLG 120
           SWMS LNDKVAMRHMD+Y+EQEEKY EAVM+DTERLQSKLQSEMASRLAF+LSTP LR G
Sbjct: 61  SWMSSLNDKVAMRHMDMYIEQEEKYAEAVMSDTERLQSKLQSEMASRLAFELSTPPLRWG 120

Query: 121 PWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERF 180
           PWLYYRRVEEGKQY VLCRRL SL+E+FIS+KSP+AGFD+ SGKKIEQKLLDYNQEAERF
Sbjct: 121 PWLYYRRVEEGKQYLVLCRRLVSLNEEFISHKSPAAGFDFTSGKKIEQKLLDYNQEAERF 180

Query: 181 GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG 240
           GGYAYEELSEVSPDH+FLAYTMYDKDNDYF LSV+NL+SG+LCSKPQ  RVSN+AWAK G
Sbjct: 181 GGYAYEELSEVSPDHKFLAYTMYDKDNDYFTLSVRNLNSGALCSKPQAVRVSNIAWAKDG 240

Query: 241 QALLYVVTDQNKRP----CSMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPT 300
           QAL+YVV+DQNKRP    CS+IGSTDED LLLEE +++V+V IRHTKDF+FV V+ F+ T
Sbjct: 241 QALIYVVSDQNKRPYQIYCSIIGSTDEDALLLEESNENVYVNIRHTKDFHFVCVHTFSTT 300

Query: 301 SSKVFLIDAANPLSGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPL 360
           SSKVFLI+AA+P SG+ L+WECEGLAHCI+EHH G LYLFT+A K  +  D+HYLLR P+
Sbjct: 301 SSKVFLINAADPFSGLTLVWECEGLAHCIVEHHEGFLYLFTDAAKEGQEADNHYLLRCPV 360

Query: 361 SVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINL 420
                SRTWE+VF+DD  LV+ DVDF  TH+ LILREG+  RLC+V LPLP G KG ++L
Sbjct: 361 DASFPSRTWESVFIDDQGLVVEDVDFCKTHMALILREGRTYRLCSVSLPLPAG-KGVVHL 420

Query: 421 KELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS 480
           KEL PHFLPLPK+VSQI+ GPNYD+YSSTMRFTISSPVMPDAVVDY+LS GKWNIIQQQ+
Sbjct: 421 KELHPHFLPLPKYVSQIAPGPNYDYYSSTMRFTISSPVMPDAVVDYDLSYGKWNIIQQQN 480

Query: 481 ILHERTRILYGTTSSAEASGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNVSSHDE 540
           +L ERTRILYGT SSA  S  ++ +S  S  E   D D +WN LSEFY+CE ++V SHD 
Sbjct: 481 MLRERTRILYGTASSATIS--LNAKSGESVNELKSDSDNLWNDLSEFYSCEQYDVPSHDG 540

Query: 541 VLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGG 600
           + +PLT++YS K K+EN+NPGLLH HGA+GE LDKRWRSELKSLLDRGWV+A+ADVRGGG
Sbjct: 541 ISVPLTIIYSPKYKKENQNPGLLHGHGAYGELLDKRWRSELKSLLDRGWVVAFADVRGGG 600

Query: 601 GGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCP 660
           GGGKKWHHDGRRTKK NS++D+ISCA+FL+E++IV E KLAGWGYSAGGLLVA+AIN CP
Sbjct: 601 GGGKKWHHDGRRTKKLNSIKDFISCARFLIEKEIVKEHKLAGWGYSAGGLLVAAAINCCP 660

Query: 661 ELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDV 720
           +LFR+ +L+VPFLD  NTLL+PI+PL  ADYEEFGYP  DIDDFHA+R YSPYDNIQKDV
Sbjct: 661 DLFRAVVLEVPFLDATNTLLYPILPLIAADYEEFGYPG-DIDDFHAIRNYSPYDNIQKDV 720

Query: 721 AYPAVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARV 780
                         L+P + +T +              F+     + FGVWEAAKW+ARV
Sbjct: 721 --------------LYPAVLVTSS--------------FN-----TRFGVWEAAKWVARV 780

Query: 781 RDYSIYDPKRPVILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           R+ +IYDPKRP++LN+TTDIVEENRYL CKESALETAFL+K + S
Sbjct: 781 RESTIYDPKRPILLNLTTDIVEENRYLQCKESALETAFLIKMMES 787

BLAST of Cp4.1LG07g10850.1 vs. TrEMBL
Match: A0A061DG58_THECC (Prolyl oligopeptidase family protein isoform 1 OS=Theobroma cacao GN=TCM_000521 PE=4 SV=1)

HSP 1 Score: 1164.1 bits (3010), Expect = 0.0e+00
Identity = 566/807 (70.14%), Postives = 660/807 (81.78%), Query Frame = 1

Query: 19  LRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDITWEDPYSWMSRLNDKVAMRHMDVY 78
           L R   YK PKTP PPSPP PPK P+KPQ+FT HD+TWEDPYSWMS L DKVAMRHMD+Y
Sbjct: 25  LYRSASYKHPKTPTPPSPPKPPKAPQKPQTFTFHDVTWEDPYSWMSSLQDKVAMRHMDMY 84

Query: 79  MEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLGPWLYYRRVEEGKQYQVLC 138
           MEQEEKY EAVM+DTERLQ+KLQSEMASRL FDLSTP LR GPWLYYRRVEEGKQY VLC
Sbjct: 85  MEQEEKYTEAVMSDTERLQTKLQSEMASRLDFDLSTPPLRWGPWLYYRRVEEGKQYPVLC 144

Query: 139 RRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERFGGYAYEELSEVSPDHRFL 198
           RRLASL+++FIS+KSPSAGFD+ SGK+IEQKLLDYNQEAERFGGYAYEELSE+SPDH+FL
Sbjct: 145 RRLASLNDEFISHKSPSAGFDFTSGKRIEQKLLDYNQEAERFGGYAYEELSEISPDHKFL 204

Query: 199 AYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGGQALLYVVTDQNKRP---- 258
           AYTMYDKDNDYF+LSV+NL+SG+LCSKP  +RVSNLAW K GQALLYV+TD+N+RP    
Sbjct: 205 AYTMYDKDNDYFKLSVRNLNSGALCSKPNANRVSNLAWIKDGQALLYVITDENRRPHRIY 264

Query: 259 CSMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPTSSKVFLIDAANPLSGMEL 318
           CSMIGST+ED LLLEE D+ V+V IRHTKDF+FVTVN F+PTSSKVFLI+AA+P SGM L
Sbjct: 265 CSMIGSTEEDVLLLEEQDETVYVNIRHTKDFHFVTVNTFSPTSSKVFLINAADPFSGMTL 324

Query: 319 IWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPLSVESTSRTWENVFVDDPD 378
           +WE EG+ HCI+EHH G LYLFT+A K    VDSHYLL SP+   S  R WE+VF+DD D
Sbjct: 325 VWESEGIVHCILEHHQGYLYLFTDAAKDGHVVDSHYLLCSPVDCPSNPRIWESVFIDDQD 384

Query: 379 LVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINLKELEPHFLPLPKHVSQIS 438
           L+I DVDFS++ LVLI REG+   +C+V LPL +G K ++ L+EL PHFLPLPK+V +IS
Sbjct: 385 LIIEDVDFSNSRLVLITREGRNFGICSVALPL-LGRKQAVYLRELNPHFLPLPKNVCKIS 444

Query: 439 SGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQSILHERTRILYGTTSSAEA 498
            GPNYDFYS+TMRFTISSPVMPDAVVDY+LS+GKWNI+QQQ+ILHERTRILYGT  S+  
Sbjct: 445 PGPNYDFYSTTMRFTISSPVMPDAVVDYDLSNGKWNIVQQQNILHERTRILYGTALSSAI 504

Query: 499 SGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNVSSHDEVLIPLTVVYSYKSKRENE 558
           + K +N    ST +   +DD +WN LSEFYACEH++VSS+D  ++PLT+VYS K++++ +
Sbjct: 505 AEKSTNVKNSSTNDVKSEDDNLWNDLSEFYACEHYDVSSYDGTVVPLTIVYSCKNRKDKQ 564

Query: 559 NPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGGGGGKKWHHDGRRTKKFNS 618
           +PGLLH HGAFGE LDK+WRSELKSLLDRGW++AYADVRGGGGGGKKWHHDGR TKK NS
Sbjct: 565 SPGLLHGHGAFGEILDKQWRSELKSLLDRGWIVAYADVRGGGGGGKKWHHDGRGTKKQNS 624

Query: 619 VQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCPELFRSAILKVPFLDPINT 678
           ++DYISCAK+LVE++IV E KLA WGYSAGGLLVASAIN  PELFR+A+LKVPFLD  NT
Sbjct: 625 IRDYISCAKYLVEKEIVQENKLAAWGYSAGGLLVASAINCSPELFRAAVLKVPFLDATNT 684

Query: 679 LLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDVAYPAVPFLDPINTLLHPI 738
           LL+PI+PLT  DYEEFGYP  DIDDFHA+R++SPYDNIQKDV                  
Sbjct: 685 LLYPILPLTAVDYEEFGYPG-DIDDFHAIRKFSPYDNIQKDVL----------------- 744

Query: 739 IPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARVRDYSIYDPKRPVILNITT 798
                       YP   +      R    FGVWEAAKW+ARVR+ +IYDPK P++LN+ T
Sbjct: 745 ------------YPSVLVSSSFNTR----FGVWEAAKWVARVREQTIYDPKHPILLNLMT 796

Query: 799 DIVEENRYLHCKESALETAFLLKFIGS 822
           DIVEENRYL CKESALETAFLLK + S
Sbjct: 805 DIVEENRYLQCKESALETAFLLKAMES 796

BLAST of Cp4.1LG07g10850.1 vs. TrEMBL
Match: W9R2Z1_9ROSA (Protease 2 OS=Morus notabilis GN=L484_020287 PE=4 SV=1)

HSP 1 Score: 1162.5 bits (3006), Expect = 0.0e+00
Identity = 572/813 (70.36%), Postives = 665/813 (81.80%), Query Frame = 1

Query: 19  LRRCLHYKAP----KTPQPPSPPGPPKPPKKPQSFTMHDITWEDPYSWMSRLNDKVAMRH 78
           L R  HYKAP    K P PP+PP PPKPP+KPQSF+ HD TWEDPYSWMS LNDKVAMRH
Sbjct: 22  LVRRAHYKAPQKAAKPPSPPTPPKPPKPPQKPQSFSFHDQTWEDPYSWMSSLNDKVAMRH 81

Query: 79  MDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLGPWLYYRRVEEGKQY 138
           MD+YMEQEEKY EAVMADTERLQSKLQSEMA RLA+DLSTP LR GPWLYYRR EEGKQY
Sbjct: 82  MDIYMEQEEKYAEAVMADTERLQSKLQSEMAFRLAYDLSTPPLRWGPWLYYRRAEEGKQY 141

Query: 139 QVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERFGGYAYEELSEVSPD 198
            VLCRRLASL+E+FIS+KSPSAGFD+ SGK+IEQKL+DYNQEAERFGGYAYEELSEVSPD
Sbjct: 142 PVLCRRLASLNEEFISHKSPSAGFDFASGKRIEQKLIDYNQEAERFGGYAYEELSEVSPD 201

Query: 199 HRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGGQALLYVVTDQNKRP 258
           HRFLAYTMYDKDND+FRLSV+NL+SG+LC KPQ D +SNLAWAK GQALLYVVTDQ KRP
Sbjct: 202 HRFLAYTMYDKDNDFFRLSVRNLNSGALCGKPQADCISNLAWAKDGQALLYVVTDQKKRP 261

Query: 259 C-----SMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPTSSKVFLIDAANPL 318
                 SMIGSTD+D LLLEE D++V+V IRHTKDF FVTVN F+PTSSKVFLI+AA+PL
Sbjct: 262 YRWIYYSMIGSTDDDVLLLEELDENVYVNIRHTKDFRFVTVNTFSPTSSKVFLINAADPL 321

Query: 319 SGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPLSVESTSRTWENVF 378
           SG+ LIWEC+G+AHCI+EHH G LYLFT+A K  + VD HYLLRSP+   +  R WENVF
Sbjct: 322 SGLNLIWECDGVAHCIVEHHQGFLYLFTDAAKAGQPVDFHYLLRSPVDTSTGPRIWENVF 381

Query: 379 VDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINLKELEPHFLPLPKH 438
           +DDP LV+ DVDF +THL+LILREG++ RL +V LPLP G +G ++LKEL PH+LPLPK+
Sbjct: 382 IDDPHLVVEDVDFCNTHLLLILREGRQFRLGSVTLPLPAG-RGPVSLKELHPHYLPLPKY 441

Query: 439 VSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQSILHERTRILYGTT 498
           VSQIS G  YD++SSTMRFTISSPVMPDA+VDY+LS+GKWNI+QQQ+ILHERT++LYGT+
Sbjct: 442 VSQISPGMIYDYFSSTMRFTISSPVMPDAIVDYDLSNGKWNIVQQQNILHERTKVLYGTS 501

Query: 499 SSAEASGKISNESEI-STGEANFDDDQMWNTLSEFYACEHFNVSSHDEVLIPLTVVYSYK 558
           S +  S    N   + +T E   DD  +WN LSEFYACEH NVSS+D V +PLT++YS K
Sbjct: 502 SLSSISKHTLNSKTVDTTDEVRSDDANLWNDLSEFYACEHRNVSSYDGVEVPLTIIYSRK 561

Query: 559 SKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGGGGGKKWHHDGRR 618
           +++E + PGLLH HGA+GE LDKRWRSELKSLLDRGW++AYADVRGGGGGGKKWH+DGRR
Sbjct: 562 NEKEGQYPGLLHGHGAYGELLDKRWRSELKSLLDRGWIVAYADVRGGGGGGKKWHYDGRR 621

Query: 619 TKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCPELFRSAILKVPF 678
           TKK NS++DYISCAK+L+ER+IV++ KLAGWGYSAGGLLVASAIN CP+LFR+A   VPF
Sbjct: 622 TKKINSIKDYISCAKYLIEREIVHQNKLAGWGYSAGGLLVASAINSCPDLFRAA---VPF 681

Query: 679 LDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDVAYPAVPFLDPIN 738
           LD  NTLL+P++P+T ADYEEFGYP  DI+DFHA+R YSPYDNIQKDV            
Sbjct: 682 LDATNTLLYPVLPVTAADYEEFGYPW-DINDFHAIREYSPYDNIQKDVP----------- 741

Query: 739 TLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARVRDYSIYDPKRPV 798
                             YP   I      R    FG+WEAAKW+ARVR+++IYDPKRPV
Sbjct: 742 ------------------YPALLISSSFNTR----FGIWEAAKWVARVREHTIYDPKRPV 796

Query: 799 ILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           +LN+TTDIVEENRYL CKESALE AFL+K + S
Sbjct: 802 LLNLTTDIVEENRYLQCKESALEAAFLMKVMES 796

BLAST of Cp4.1LG07g10850.1 vs. TrEMBL
Match: A0A0D2S7A8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G052400 PE=4 SV=1)

HSP 1 Score: 1161.7 bits (3004), Expect = 0.0e+00
Identity = 565/830 (68.07%), Postives = 671/830 (80.84%), Query Frame = 1

Query: 1   MNQFRAALRHCRSNL-----HGALRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDIT 60
           +   R  +RH RS +     H    R   Y  PKT  PPSPP PPK P+KPQ+FT HD+T
Sbjct: 2   LRHLRTTVRH-RSTIILWHHHNHYHRSAKYNPPKTANPPSPPKPPKAPQKPQTFTFHDVT 61

Query: 61  WEDPYSWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTP 120
           WEDPYSWMS L DKVAMRHMD+YMEQEEKY EAVM+DTERLQ+KLQSEMASRL FDLSTP
Sbjct: 62  WEDPYSWMSSLQDKVAMRHMDMYMEQEEKYTEAVMSDTERLQTKLQSEMASRLNFDLSTP 121

Query: 121 SLRLGPWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQ 180
            LR GPWLYYRRVEEGKQY VLCRRLASL+E+FIS KSPS+GFD+ SGK+IEQKLLDYNQ
Sbjct: 122 PLRWGPWLYYRRVEEGKQYPVLCRRLASLNEEFISLKSPSSGFDFTSGKRIEQKLLDYNQ 181

Query: 181 EAERFGGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLA 240
           EAERFGGYAYEELSE+SPDH+FLAYTMYDKDNDYF+LSV+NL+SG+LCSKP  +RVSNLA
Sbjct: 182 EAERFGGYAYEELSEISPDHKFLAYTMYDKDNDYFKLSVRNLNSGALCSKPHANRVSNLA 241

Query: 241 WAKGGQALLYVVTDQNKRP----CSMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVN 300
           W K GQALLYVVTD+NKRP    CSMIGSTDED LLLEE D++V+V IRHTKDF+FVT N
Sbjct: 242 WVKDGQALLYVVTDENKRPYRIYCSMIGSTDEDVLLLEEQDENVYVNIRHTKDFHFVTAN 301

Query: 301 RFTPTSSKVFLIDAANPLSGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYL 360
            F+PT SKVFLI+AA+P SGM L+WE EG+ HC++EHH G LYLFT+A K  + VDSHYL
Sbjct: 302 TFSPTFSKVFLINAADPFSGMNLVWESEGIVHCVLEHHQGYLYLFTDAPKDGQIVDSHYL 361

Query: 361 LRSPLSVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGK 420
           LRSP+   S  R WENVF+ D +LVI D DF ++HLVL+ REG+K  +C+V LPLP G K
Sbjct: 362 LRSPVDSSSNPRIWENVFIGDQNLVIEDGDFCNSHLVLLTREGRKYGICSVALPLP-GWK 421

Query: 421 GSINLKELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNI 480
            +++L+EL+PHFLPLPKHV  IS GPNYD+YS TMRFTIS+PVMPDAVVDY+LS+GKWNI
Sbjct: 422 QAVHLRELQPHFLPLPKHVCNISPGPNYDYYSKTMRFTISAPVMPDAVVDYDLSNGKWNI 481

Query: 481 IQQQSILHERTRILYGTTSSAEASGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNV 540
           +QQQ++LHERTRILYGT  S+  + K +N    S  +   +D  +WN LSEFYACEH  V
Sbjct: 482 VQQQNMLHERTRILYGTALSSAIAEKTTNVKFSSMNDVKSEDRNLWNDLSEFYACEHHYV 541

Query: 541 SSHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYAD 600
           SS+D  ++PLT+VYS K+++++++PGLLH HGA+GE LDKRWRSELKSLLDRGW++AYAD
Sbjct: 542 SSYDGAMVPLTIVYSRKNRKDSQSPGLLHGHGAYGEILDKRWRSELKSLLDRGWIVAYAD 601

Query: 601 VRGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASA 660
           VRGGGGGGKKWHHDGRRTKK NS++DYISCAK+LVE++IV E KLAGWGYS GGLLVASA
Sbjct: 602 VRGGGGGGKKWHHDGRRTKKQNSIKDYISCAKYLVEKEIVQENKLAGWGYSVGGLLVASA 661

Query: 661 INQCPELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDN 720
           IN CP+LFR+A+LKVPFLD  NTLL+PI+PLT ADYEEFGYP  DID+FHA+R++SPYDN
Sbjct: 662 INCCPDLFRAAVLKVPFLDATNTLLYPILPLTAADYEEFGYPG-DIDEFHAIRKFSPYDN 721

Query: 721 IQKDVAYPAVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAK 780
           IQK+               L+P + ++ +              F+     + FGVWEAAK
Sbjct: 722 IQKNA--------------LYPAVLVSTS--------------FN-----TRFGVWEAAK 781

Query: 781 WIARVRDYSIYDPKRPVILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           W+ARVR+ +IYDPK P++LN+T D+VEENRYL CKESALETAFLLK +GS
Sbjct: 782 WVARVREQTIYDPKHPILLNLTIDVVEENRYLQCKESALETAFLLKTVGS 795

BLAST of Cp4.1LG07g10850.1 vs. TAIR10
Match: AT5G66960.1 (AT5G66960.1 Prolyl oligopeptidase family protein)

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 540/817 (66.10%), Postives = 641/817 (78.46%), Query Frame = 1

Query: 12  RSNLHGALRRCLHYKAPKTPQPPSPPGP-PKPPKKPQSFTMHDITWEDPYSWMSRLNDKV 71
           R N     ++C  YK PK+P PP PP   PKPPKKPQSFT HD TWEDPYSWMS+L DKV
Sbjct: 11  RHNCRFRRQQCRCYKPPKSPPPPPPPPALPKPPKKPQSFTFHDATWEDPYSWMSKLEDKV 70

Query: 72  AMRHMDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLGPWLYYRRVEE 131
           AMRHMD+YMEQEEKY EAV+ADT+R+Q+KLQSEMASRL+F+LSTP LR GPWLYYRRVEE
Sbjct: 71  AMRHMDIYMEQEEKYTEAVLADTDRIQTKLQSEMASRLSFELSTPPLRWGPWLYYRRVEE 130

Query: 132 GKQYQVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERFGGYAYEELSE 191
           GKQY VLCRRLASLHE+FIS+KSP+AGFDY SGK+IEQKLLDYNQEAERFGGYAYEE+SE
Sbjct: 131 GKQYPVLCRRLASLHEEFISHKSPAAGFDYTSGKRIEQKLLDYNQEAERFGGYAYEEMSE 190

Query: 192 VSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGGQALLYVVTDQ 251
           +SPDH+FLAYTMYDKDNDYF+L V+NL+SG+LCSKP  DRVSN+AWAK GQALLYVVTDQ
Sbjct: 191 ISPDHKFLAYTMYDKDNDYFKLCVRNLNSGALCSKPHADRVSNIAWAKNGQALLYVVTDQ 250

Query: 252 NKRPC----SMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPTSSKVFLIDAA 311
            KRPC    S IGSTDED LL EE + +VHV IRHTKDF+FVTVN F+ T SKVFLI+AA
Sbjct: 251 KKRPCRIYCSTIGSTDEDVLLHEEFEGNVHVNIRHTKDFHFVTVNTFSTTFSKVFLINAA 310

Query: 312 NPLSGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPLSVESTSRTWE 371
           +P SG+ L+WE    AHCI+EHH G LYLFTNA+     VD HYLLRSP+   S  R WE
Sbjct: 311 DPFSGLALVWEHNAPAHCIIEHHQGFLYLFTNASNDGGTVDHHYLLRSPVHFSSCQRIWE 370

Query: 372 NVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINLKELEPHFLPL 431
            VF++DP+L+I DVDF   HL LI++E Q  ++C V LPL    +  ++L++++P +LPL
Sbjct: 371 TVFINDPELIIEDVDFCKKHLSLIVKEMQSFKICVVDLPLKTK-RVPVHLRDIKPRYLPL 430

Query: 432 PKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQSILHERTRILY 491
           PKHVSQI  G NYDF S TMRFTISS VMPDAVVDY+L +GKWNI+QQQ++LHERTR+LY
Sbjct: 431 PKHVSQIFPGTNYDFNSPTMRFTISSLVMPDAVVDYDLLNGKWNIVQQQNMLHERTRVLY 490

Query: 492 GTTSSAEASGKISNESEIS--TGEANFDDDQMWNTLSEFYACEHFNVSSHDEVLIPLTVV 551
           GT +S E+    S    +S  T +   ++D +WN L+EFYAC++  VSSHD  ++PL++V
Sbjct: 491 GTANSTESPNIPSGTRTVSFDTEDTTAENDNLWNDLTEFYACDYHEVSSHDGAMVPLSIV 550

Query: 552 YSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGGGGGKKWHH 611
           YS   K EN+ PGLLHVHGA+GE LDKRWRSELKSLLDRGWV+AYADVRGGGG GKKWH 
Sbjct: 551 YSRAQKEENQKPGLLHVHGAYGEMLDKRWRSELKSLLDRGWVLAYADVRGGGGKGKKWHQ 610

Query: 612 DGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCPELFRSAIL 671
           DGR  KK NS++DYI CAK+LVE  IV E KLAGWGYSAGGL+VASAIN CP+LF++A+L
Sbjct: 611 DGRGAKKLNSIKDYIQCAKYLVENNIVEENKLAGWGYSAGGLVVASAINHCPDLFQAAVL 670

Query: 672 KVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDVAYPAVPFL 731
           KVPFLDP +TL++PI+PLT  DYEEFGYP  DI+DFHA+R YSPYDNI KDV        
Sbjct: 671 KVPFLDPTHTLIYPILPLTAEDYEEFGYPG-DINDFHAIREYSPYDNIPKDV-------- 730

Query: 732 DPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARVRDYSIYDP 791
                 L+P + +T +              F+     + FGVWEAAKW+ARVRD +  DP
Sbjct: 731 ------LYPAVLVTSS--------------FN-----TRFGVWEAAKWVARVRDNTFNDP 790

Query: 792 KRPVILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           +RPV+LN+TTDIVEENR+L  KESALE AFL+K + S
Sbjct: 791 ERPVLLNLTTDIVEENRFLQTKESALEIAFLIKMMES 792

BLAST of Cp4.1LG07g10850.1 vs. TAIR10
Match: AT1G69020.1 (AT1G69020.1 Prolyl oligopeptidase family protein)

HSP 1 Score: 491.5 bits (1264), Expect = 1.0e-138
Identity = 299/797 (37.52%), Postives = 440/797 (55.21%), Query Frame = 1

Query: 39  PPKPPKKPQSFTMHDITWEDPYSWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQS 98
           PP P K P + + H IT +DP+ WM   +D   +     ++++E  Y +A MADTE L+ 
Sbjct: 40  PPVPKKIPFAISSHGITRQDPFHWMKNTDDTDFVD----FLKRENSYSQAFMADTETLRR 99

Query: 99  KLQSEMASRLAFDLSTPSLRLGPWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGF 158
            L SEM +R+  ++ TP  R G WLY + + +GK+Y +LCRRL      ++S        
Sbjct: 100 DLFSEMKTRIPEEIFTPPERWGQWLYRQYIPKGKEYPLLCRRLEKGKTNWLSG------- 159

Query: 159 DYVSGKKIEQKLLDYNQEAERFGGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLS 218
               G++ E+ +LD+NQ AE+F GY +  +  VSPDH +LAYT+ D + D          
Sbjct: 160 -LFRGEE-EEVVLDWNQIAEQF-GYVHVGVCRVSPDHNYLAYTV-DPEGD---------- 219

Query: 219 SGSLCSKPQVDRVSNLAWAKGGQALLYVVTDQNKRPCSMIGSTDE-----DTLLLEEPDD 278
                                G  L Y VTD+N+RP  ++ +  E     D ++  E D 
Sbjct: 220 ---------------------GITLFYTVTDENQRPHRVVVTNVESDGRDDAVVFTERDS 279

Query: 279 DVHVYIRHTKDFNFVTVNRFTPTSSKVFLIDAANPLSGMELIWECEGLAHCIMEHHLGVL 338
              V I  TKD  FVT+N  + TSS+V++++A  P++G++   E      C +EHH G  
Sbjct: 280 SFCVDITTTKDGKFVTINSNSRTSSEVYIVNADKPMAGLQRTRERVPGVQCFLEHHNGFF 339

Query: 339 YLFTN--ANKGHEAVDSHYLLRSPLSVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLIL 398
           Y+ TN  +N   E     Y L   L  E  +  W+ VF  D D+VI D+D  + +LVL L
Sbjct: 340 YILTNSPSNAISEWSGEGYYLTRCLVEEIEASDWQTVFRPDDDVVIQDMDMFNDYLVLYL 399

Query: 399 REGQKLRLCAVRLPLPVGGKGSINLKELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTIS 458
            +     LC++ +P+    K   ++ +L P + PLP     ++ G N+DF SS  R  +S
Sbjct: 400 NKKGLPMLCSIDMPIKANTK---HMDDLVPWYFPLPVDSCSVAPGSNHDFQSSIYRVVLS 459

Query: 459 SPVMPDAVVDYNLSDGKWNIIQQQSIL---HERTRILYGTTSSAEASGKISNESEISTGE 518
           SPV+PD +VDY++S   ++I+QQ+  +    + ++  Y    S E +G++++ +  S GE
Sbjct: 460 SPVIPDTIVDYDVSRRLFSIVQQEGGVVDNSDSSKPWYTADRSTENNGQLNDRT--SEGE 519

Query: 519 ANFDDDQM--WNTLSEFYACEHFNVSSHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFG 578
               D +M  W  LS+ Y CE   VSSHD V +PLT++YS ++ +++E+PG+L  +GA+G
Sbjct: 520 DGQLDSRMPKWEDLSDTYVCERQEVSSHDGVEVPLTILYSREAWKKSESPGMLIGYGAYG 579

Query: 579 EPLDKRWRSELKSLLDRGWVIAYADVRGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLV 638
           E LDK W +   S+LDRGWVIA+ADVRGGG G   WH  G R+ K NS+QD+I  AK+LV
Sbjct: 580 EVLDKSWCTNRLSMLDRGWVIAFADVRGGGSGEFSWHKSGTRSLKQNSIQDFIYSAKYLV 639

Query: 639 ERKIVNEEKLAGWGYSAGGLLVASAINQCPELFRSAILKVPFLDPINTLLHPIIPLTPAD 698
           E+  V+   LA  GYSAG +L A+A+N  P LF++ ILKVPF+D +NTL  P +PLT  D
Sbjct: 640 EKGYVHRHHLAAVGYSAGAILPAAAMNMHPSLFQAVILKVPFVDVLNTLSDPNLPLTLLD 699

Query: 699 YEEFGYPEEDIDDFHAVRRYSPYDNIQKDVAYPAVPFLDPINTLLHPIIPLTPADYEEFG 758
           +EEFG P+    DF ++  YSPYD I+KDV YP++     + T  H              
Sbjct: 700 HEEFGNPDNQ-TDFGSILSYSPYDKIRKDVCYPSM----LVTTSFHD------------- 752

Query: 759 YPEEDIDDFHAVRRYSPFGVWEAAKWIARVRDYSIYDPKRPVILNITTD---IVEENRYL 818
                          S  GVWE AKW+A++RD + +D  R VIL    +     E  RY 
Sbjct: 760 ---------------SRVGVWEGAKWVAKIRDSTCHDCSRAVILKTNMNGGHFGEGGRYA 752

Query: 819 HCKESALETAFLLKFIG 821
            C+E+A + AFLLK +G
Sbjct: 820 QCEETAFDYAFLLKVMG 752

BLAST of Cp4.1LG07g10850.1 vs. TAIR10
Match: AT1G50380.1 (AT1G50380.1 Prolyl oligopeptidase family protein)

HSP 1 Score: 227.3 bits (578), Expect = 3.5e-59
Identity = 186/688 (27.03%), Postives = 326/688 (47.38%), Query Frame = 1

Query: 39  PPKPPKKPQSFTMHDITWEDPYSWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQS 98
           PP   K      M      D Y W+   +D      M  Y+ +E  Y + VM+ T++ ++
Sbjct: 7   PPVAKKVEHVMEMFGDVRVDNYYWLR--DDSRTNPDMLSYLREENHYTDFVMSGTKQFEN 66

Query: 99  KLQSEMASRLAFDLSTPSLRLGPWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGF 158
           +L +E+  R+  D  +  LR GP+ YY +  +GK+Y   CRRL +       NK+  + +
Sbjct: 67  QLFAEIRGRIKEDDISAPLRKGPYYYYEKNLQGKEYIQHCRRLIT------DNKAEPSVY 126

Query: 159 DYV---SGKKIEQKLLDYNQEAERFGGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVK 218
           D +        E  +LD N +A+    Y      + SPDH+ +AY    K ++ + ++V 
Sbjct: 127 DTMPTGPDAPPEHVILDENTKAQEHDYYRIGAF-KASPDHKLVAYAEDTKGDEIYTVNVI 186

Query: 219 NLSSGSLCSKPQVDRVSNLAWAKGGQALLYVVTDQNKRPCSM----IGS-TDEDTLLLEE 278
           +  +     +      S L WA G  ALLY+  D+  RP  +    +G+    D  L  E
Sbjct: 187 DSEALKPVGQQLKGLTSYLEWA-GNDALLYITMDEILRPDKVWLHKLGTEQSSDVCLYHE 246

Query: 279 PDDDVHVYIRHTKDFNFVTVNRFTPTSSKVFLIDAANPLSGMELIWECEGLAHCIMEHHL 338
            DD   + +  ++   ++ V   + T+  VF +D +    G+ ++          + H  
Sbjct: 247 KDDMFSLELHASESHKYLFVASESKTTRFVFSLDVSKTQDGLRVLTPRVDGIDSSVSHRG 306

Query: 339 GVLYLFTNANKGHEAVDSHYLLRSPLSVESTSRTWENVFVDDPDLV-IVDVDFSHTHLVL 398
              ++   + + + +     L+  P  V+ TS+T   V +   + V I ++     HL +
Sbjct: 307 NHFFIQRRSTEFYNS----ELIACP--VDDTSKT--TVLLPHRESVKIQEIQLFRDHLAV 366

Query: 399 ILREGQKLRLCAVRLPL---PVGG-KGSINLKELEPHFLPLPKHVSQISSGPNYDFYSST 458
             RE    ++   RLP    P+ G +G  N+  ++P        V  + S  + +F S  
Sbjct: 367 FERENGLQKITVHRLPAEGQPLEGLQGGRNVSFVDP--------VYSVDSTES-EFSSRV 426

Query: 459 MRFTISSPVMPDAVVDYNLSDGKWNIIQQQSILHERTRILYGTTSSAEASGKISNESEIS 518
           +RF   S   P +V DY++  G        S++ +   +L G  +S              
Sbjct: 427 LRFKYCSMKTPPSVYDYDMDSG-------TSVVKKIDTVLGGFDAS-------------- 486

Query: 519 TGEANFDDDQMWNTLSEFYACEHFNVSSHDEVLIPLTVVYSYK-SKRENENPGLLHVHGA 578
               N+  ++ W   S             D   IP+++VY+ K +K +  +P LL+ +G+
Sbjct: 487 ----NYVTERKWVAAS-------------DGTQIPMSIVYNKKLAKLDGSDPLLLYGYGS 546

Query: 579 FGEPLDKRWRSELKSLLDRGWVIAYADVRGGGGGGKKWHHDGRRTKKFNSVQDYISCAKF 638
           +   +D  +++   SLLDRG+    A VRGGG  G++W+ +G+  KK N+  D+I+CA+ 
Sbjct: 547 YEISVDPYFKASRLSLLDRGFTFVIAHVRGGGEMGRQWYENGKLLKKKNTFTDFIACAER 606

Query: 639 LVERKIVNEEKLAGWGYSAGGLLVASAINQCPELFRSAILKVPFLDPINTLLHPIIPLTP 698
           L+E K  ++EKL   G SAGGLL+ + +N  P+LF+  I  VPF+D + T+L P IPLT 
Sbjct: 607 LIELKYCSKEKLCMEGRSAGGLLMGAVVNMRPDLFKVVIAGVPFVDVLTTMLDPTIPLTT 628

Query: 699 ADYEEFGYPEEDIDDFHAVRRYSPYDNI 713
           +++EE+G P ++ + +  ++ YSP DN+
Sbjct: 667 SEWEEWGDPRKE-EFYFYMKSYSPVDNV 628

BLAST of Cp4.1LG07g10850.1 vs. TAIR10
Match: AT1G76140.1 (AT1G76140.1 Prolyl oligopeptidase family protein)

HSP 1 Score: 94.7 bits (234), Expect = 2.7e-19
Identity = 61/184 (33.15%), Postives = 95/184 (51.63%), Query Frame = 1

Query: 533 SHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDR--GWVIAYA 592
           S D   IP+ +V     K +  +P LL+ +G F   +   + S  + +L +  G V  +A
Sbjct: 522 SKDGTKIPMFIVAKKDIKLDGSHPCLLYAYGGFNISITPSF-SASRIVLSKHLGVVFCFA 581

Query: 593 DVRGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVAS 652
           ++RGGG  G++WH  G   KK N   D+IS A++LV        KL   G S GGLLV +
Sbjct: 582 NIRGGGEYGEEWHKAGSLAKKQNCFDDFISGAEYLVSAGYTQPSKLCIEGGSNGGLLVGA 641

Query: 653 AINQCPELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYD 712
            INQ P+L+  A+  V  +D +      I     +DY   G  E + ++FH + +YSP  
Sbjct: 642 CINQRPDLYGCALAHVGVMDMLRFHKFTIGHAWTSDY---GCSENE-EEFHWLIKYSPLH 700

Query: 713 NIQK 715
           N+++
Sbjct: 702 NVKR 700

BLAST of Cp4.1LG07g10850.1 vs. TAIR10
Match: AT1G20380.1 (AT1G20380.1 Prolyl oligopeptidase family protein)

HSP 1 Score: 94.0 bits (232), Expect = 4.7e-19
Identity = 61/184 (33.15%), Postives = 96/184 (52.17%), Query Frame = 1

Query: 533 SHDEVLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDR--GWVIAYA 592
           S D   IP+ +V     K +  +P LL+ +G F   +   + S  + +L R  G V  +A
Sbjct: 458 SKDGTDIPMFIVARKDIKLDGSHPCLLYAYGGFSISMTPFF-SATRIVLGRHLGTVFCFA 517

Query: 593 DVRGGGGGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVAS 652
           ++RGGG  G++WH  G    K N   D+IS A++LV        KL   G S GG+LV +
Sbjct: 518 NIRGGGEYGEEWHKSGALANKQNCFDDFISGAEYLVSAGYTQPRKLCIEGGSNGGILVGA 577

Query: 653 AINQCPELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYD 712
            INQ P+LF  A+  V  +D +    H    +  A   EFG  +++ ++FH + +YSP  
Sbjct: 578 CINQRPDLFGCALAHVGVMDMLR--FHK-FTIGHAWTSEFGCSDKE-EEFHWLIKYSPLH 636

Query: 713 NIQK 715
           N+++
Sbjct: 638 NVKR 636

BLAST of Cp4.1LG07g10850.1 vs. NCBI nr
Match: gi|659070747|ref|XP_008456457.1| (PREDICTED: prolyl endopeptidase-like [Cucumis melo])

HSP 1 Score: 1394.0 bits (3607), Expect = 0.0e+00
Identity = 691/825 (83.76%), Postives = 734/825 (88.97%), Query Frame = 1

Query: 1   MNQFRAALRHCRSNLHGALRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDITWEDPY 60
           MN+ RAALRH R+N+H ALRRCLHYK PKTP PPSPP PPKPPKKPQSFTMH ITWEDPY
Sbjct: 1   MNRLRAALRHRRTNIHFALRRCLHYKVPKTPAPPSPPAPPKPPKKPQSFTMHGITWEDPY 60

Query: 61  SWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLG 120
           SWMS LNDKVAMRHMDVYMEQEEKY EAVM  TERLQSKLQSEMASRLAF+LSTP LR G
Sbjct: 61  SWMSSLNDKVAMRHMDVYMEQEEKYTEAVMGGTERLQSKLQSEMASRLAFELSTPPLRWG 120

Query: 121 PWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERF 180
           PWLYYRRVEE KQY VLCRRLASLH++FISNKSPSAGFDYVSG+KIEQKL+DYNQEAERF
Sbjct: 121 PWLYYRRVEEEKQYPVLCRRLASLHDEFISNKSPSAGFDYVSGQKIEQKLIDYNQEAERF 180

Query: 181 GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG 240
           GGYAYEELSEVSPDHRF+AYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG
Sbjct: 181 GGYAYEELSEVSPDHRFIAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG 240

Query: 241 QALLYVVTDQNKRPC----SMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPT 300
           Q+LLYVVTDQNKRPC    SMIGS DEDTLLLEE DDDVHVYIRHTKDF FVTVNRF+PT
Sbjct: 241 QSLLYVVTDQNKRPCRLYCSMIGSIDEDTLLLEEQDDDVHVYIRHTKDFRFVTVNRFSPT 300

Query: 301 SSKVFLIDAANPLSGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPL 360
           SSKVFLIDAA+PLSGMELIWECE L HCI+EHHLG LYLFT+A+KGHE VDSHYLLRSPL
Sbjct: 301 SSKVFLIDAADPLSGMELIWECEKLTHCIVEHHLGDLYLFTDASKGHEPVDSHYLLRSPL 360

Query: 361 SVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINL 420
            V+STSRTWE+VFVDDPDLVIVDVDFSHTHLVLILREG+K RLCAVRLPLPVG KG INL
Sbjct: 361 KVDSTSRTWEHVFVDDPDLVIVDVDFSHTHLVLILREGRKFRLCAVRLPLPVGRKGPINL 420

Query: 421 KELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS 480
           KELE  +LPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS
Sbjct: 421 KELELQYLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS 480

Query: 481 ILHERTRILYGTTSSAEASGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNVSSHDE 540
           ILHERTRILYGTT SA  S +ISN  E S GEANFDD+QMWN+LSE+YACEH+NVSS D 
Sbjct: 481 ILHERTRILYGTTFSAGGSREISNALENSMGEANFDDEQMWNSLSEYYACEHYNVSSDDG 540

Query: 541 VLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGG 600
           VL+PLTV+YSYK K+ENENPGLLHVHGA+GE LDKRWRSELKSLLDRGWVIAYADVRGGG
Sbjct: 541 VLVPLTVIYSYKCKKENENPGLLHVHGAYGELLDKRWRSELKSLLDRGWVIAYADVRGGG 600

Query: 601 GGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCP 660
           GGGKKWH DGRR KKFNSVQDYISCAKFL ERKIVNEEKLAGWGYSAGGLLVASAINQCP
Sbjct: 601 GGGKKWHQDGRRIKKFNSVQDYISCAKFLAERKIVNEEKLAGWGYSAGGLLVASAINQCP 660

Query: 661 ELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDV 720
           ELFR+A+LKVPFLDPI+TL +PIIPLTPADYEEFGYP  + DDFHA+RRYSPYDNIQKDV
Sbjct: 661 ELFRAAVLKVPFLDPISTLRNPIIPLTPADYEEFGYPGNE-DDFHAIRRYSPYDNIQKDV 720

Query: 721 AYPAVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARV 780
           A              +P + +T +              F+     + FGVWEAAKWIARV
Sbjct: 721 A--------------YPAVLITSS--------------FN-----TRFGVWEAAKWIARV 780

Query: 781 RDYSIYDPKRPVILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           RDYSIYDPKRPVILN+T DIVEENRYLHCKESALETAFL+K + S
Sbjct: 781 RDYSIYDPKRPVILNLTIDIVEENRYLHCKESALETAFLMKAMES 791

BLAST of Cp4.1LG07g10850.1 vs. NCBI nr
Match: gi|449442973|ref|XP_004139255.1| (PREDICTED: prolyl endopeptidase-like [Cucumis sativus])

HSP 1 Score: 1380.2 bits (3571), Expect = 0.0e+00
Identity = 685/825 (83.03%), Postives = 731/825 (88.61%), Query Frame = 1

Query: 1   MNQFRAALRHCRSNLHGALRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDITWEDPY 60
           MN+ RA LRH R++LHG   RCLHYK PKTPQPP+PP PPKPPKKPQSFT+H+ITWEDPY
Sbjct: 1   MNRLRAVLRHRRTHLHGDFGRCLHYKVPKTPQPPAPPAPPKPPKKPQSFTLHEITWEDPY 60

Query: 61  SWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLG 120
           SWMS LNDKVAMRHMDVYMEQEEKY EAVM  TERLQSKLQSEMASRLAF+LSTP LR G
Sbjct: 61  SWMSSLNDKVAMRHMDVYMEQEEKYTEAVMGGTERLQSKLQSEMASRLAFELSTPPLRWG 120

Query: 121 PWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERF 180
           PWLYYRRVEEGKQY VLCRRLASLHE+FISNKSPSAGFDYVSG+KIEQKL+DYNQEAERF
Sbjct: 121 PWLYYRRVEEGKQYPVLCRRLASLHEEFISNKSPSAGFDYVSGQKIEQKLIDYNQEAERF 180

Query: 181 GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG 240
           GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG
Sbjct: 181 GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG 240

Query: 241 QALLYVVTDQNKRPC----SMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPT 300
           Q+LLYVVTDQNKRPC    S IGS DEDTLLLEE DDDVHVYIRHTKDF FVTVNRF+PT
Sbjct: 241 QSLLYVVTDQNKRPCRLYCSTIGSIDEDTLLLEEKDDDVHVYIRHTKDFRFVTVNRFSPT 300

Query: 301 SSKVFLIDAANPLSGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPL 360
           SSKVFLIDAA+PLSGM+LIWECE LAHCI+EHHLG LYLFT+A+KGHE VDSHYLLRSPL
Sbjct: 301 SSKVFLIDAADPLSGMKLIWECEELAHCIVEHHLGDLYLFTDASKGHERVDSHYLLRSPL 360

Query: 361 SVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINL 420
            V+ST RTWE+VFVDDPD VIVDVDF HTHLVLILREG+K  LCAVRLPLPVGGKG I+L
Sbjct: 361 KVDSTLRTWEHVFVDDPDFVIVDVDFCHTHLVLILREGRKFSLCAVRLPLPVGGKGPISL 420

Query: 421 KELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS 480
           KELE  +LPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS
Sbjct: 421 KELELQYLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS 480

Query: 481 ILHERTRILYGTTSSAEASGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNVSSHDE 540
           ILHERTRILYGTTSSA  S +ISN  E S GEANFD+ QMWN+LSE+YACEH+NVSS D 
Sbjct: 481 ILHERTRILYGTTSSAGGSREISNALENSVGEANFDE-QMWNSLSEYYACEHYNVSSDDG 540

Query: 541 VLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGG 600
           VL+PLTVVYSYK K+ENENPGLLHVHGA+GE LDKRWRSELKSLLDRGWVIAYADVRGGG
Sbjct: 541 VLVPLTVVYSYKCKKENENPGLLHVHGAYGELLDKRWRSELKSLLDRGWVIAYADVRGGG 600

Query: 601 GGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCP 660
           GGGKKWH DGRR KKFNSVQDYISCAKFL ER+IVNE+KLAGWGYSAGGLLVASAINQCP
Sbjct: 601 GGGKKWHQDGRRIKKFNSVQDYISCAKFLAERQIVNEDKLAGWGYSAGGLLVASAINQCP 660

Query: 661 ELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDV 720
           ELFR+AILKVPFLDPI+TLL+PIIPLTPADYEEFGYP  + DDFHA+RRYSPYDNIQKD 
Sbjct: 661 ELFRAAILKVPFLDPISTLLNPIIPLTPADYEEFGYPGNE-DDFHAIRRYSPYDNIQKDA 720

Query: 721 AYPAVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARV 780
           A              +P + +T +              F+     + FGVWEAAKWIARV
Sbjct: 721 A--------------YPAVLITSS--------------FN-----TRFGVWEAAKWIARV 780

Query: 781 RDYSIYDPKRPVILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           RDYSIYDPKRPVILN+T DIVEENRYLHCKESALETAFL+K + S
Sbjct: 781 RDYSIYDPKRPVILNLTIDIVEENRYLHCKESALETAFLMKAMES 790

BLAST of Cp4.1LG07g10850.1 vs. NCBI nr
Match: gi|567866363|ref|XP_006425804.1| (hypothetical protein CICLE_v10024926mg [Citrus clementina])

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 580/825 (70.30%), Postives = 677/825 (82.06%), Query Frame = 1

Query: 1   MNQFRAALRHCRSNLHGALRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDITWEDPY 60
           M     A+R C  N HG+L +  HYK PKT +PP+ P PPKPPKKPQ FT HD TWEDPY
Sbjct: 1   MRHLLTAVR-CFYNNHGSLTQVRHYKPPKTSRPPAAPSPPKPPKKPQRFTFHDHTWEDPY 60

Query: 61  SWMSRLNDKVAMRHMDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLG 120
           SWMS LNDKVAMRHMD+Y+EQEEKY EAVM+DTERLQSKLQSEMASRLAF+LSTP LR G
Sbjct: 61  SWMSSLNDKVAMRHMDMYIEQEEKYAEAVMSDTERLQSKLQSEMASRLAFELSTPPLRWG 120

Query: 121 PWLYYRRVEEGKQYQVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERF 180
           PWLYYRRVEEGKQY VLCRRL SL+E+FIS+KSP+AGFD+ SGKKIEQKLLDYNQEAERF
Sbjct: 121 PWLYYRRVEEGKQYLVLCRRLVSLNEEFISHKSPAAGFDFTSGKKIEQKLLDYNQEAERF 180

Query: 181 GGYAYEELSEVSPDHRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGG 240
           GGYAYEELSEVSPDH+FLAYTMYDKDNDYF LSV+NL+SG+LCSKPQ  RVSN+AWAK G
Sbjct: 181 GGYAYEELSEVSPDHKFLAYTMYDKDNDYFTLSVRNLNSGALCSKPQAVRVSNIAWAKDG 240

Query: 241 QALLYVVTDQNKRP----CSMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPT 300
           QAL+YVV+DQNKRP    CS+IGSTDED LLLEE +++V+V IRHTKDF+FV V+ F+ T
Sbjct: 241 QALIYVVSDQNKRPYQIYCSIIGSTDEDALLLEESNENVYVNIRHTKDFHFVCVHTFSTT 300

Query: 301 SSKVFLIDAANPLSGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPL 360
           SSKVFLI+AA+P SG+ L+WECEGLAHCI+EHH G LYLFT+A K  +  D+HYLLR P+
Sbjct: 301 SSKVFLINAADPFSGLTLVWECEGLAHCIVEHHEGFLYLFTDAAKEGQEADNHYLLRCPV 360

Query: 361 SVESTSRTWENVFVDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINL 420
                SRTWE+VF+DD  LV+ DVDF  TH+ LILREG+  RLC+V LPLP G KG ++L
Sbjct: 361 DASFPSRTWESVFIDDQGLVVEDVDFCKTHMALILREGRTYRLCSVSLPLPAG-KGVVHL 420

Query: 421 KELEPHFLPLPKHVSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQS 480
           KEL PHFLPLPK+VSQI+ GPNYD+YSSTMRFTISSPVMPDAVVDY+LS GKWNIIQQQ+
Sbjct: 421 KELHPHFLPLPKYVSQIAPGPNYDYYSSTMRFTISSPVMPDAVVDYDLSYGKWNIIQQQN 480

Query: 481 ILHERTRILYGTTSSAEASGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNVSSHDE 540
           +L ERTRILYGT SSA  S  ++ +S  S  E   D D +WN LSEFY+CE ++V SHD 
Sbjct: 481 MLRERTRILYGTASSATIS--LNAKSGESVNELKSDSDNLWNDLSEFYSCEQYDVPSHDG 540

Query: 541 VLIPLTVVYSYKSKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGG 600
           + +PLT++YS K K+EN+NPGLLH HGA+GE LDKRWRSELKSLLDRGWV+A+ADVRGGG
Sbjct: 541 ISVPLTIIYSPKYKKENQNPGLLHGHGAYGELLDKRWRSELKSLLDRGWVVAFADVRGGG 600

Query: 601 GGGKKWHHDGRRTKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCP 660
           GGGKKWHHDGRRTKK NS++D+ISCA+FL+E++IV E KLAGWGYSAGGLLVA+AIN CP
Sbjct: 601 GGGKKWHHDGRRTKKLNSIKDFISCARFLIEKEIVKEHKLAGWGYSAGGLLVAAAINCCP 660

Query: 661 ELFRSAILKVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDV 720
           +LFR+ +L+VPFLD  NTLL+PI+PL  ADYEEFGYP  DIDDFHA+R YSPYDNIQKDV
Sbjct: 661 DLFRAVVLEVPFLDATNTLLYPILPLIAADYEEFGYPG-DIDDFHAIRNYSPYDNIQKDV 720

Query: 721 AYPAVPFLDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARV 780
                         L+P + +T +              F+     + FGVWEAAKW+ARV
Sbjct: 721 --------------LYPAVLVTSS--------------FN-----TRFGVWEAAKWVARV 780

Query: 781 RDYSIYDPKRPVILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           R+ +IYDPKRP++LN+TTDIVEENRYL CKESALETAFL+K + S
Sbjct: 781 RESTIYDPKRPILLNLTTDIVEENRYLQCKESALETAFLIKMMES 787

BLAST of Cp4.1LG07g10850.1 vs. NCBI nr
Match: gi|590704291|ref|XP_007047118.1| (Prolyl oligopeptidase family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 1164.1 bits (3010), Expect = 0.0e+00
Identity = 566/807 (70.14%), Postives = 660/807 (81.78%), Query Frame = 1

Query: 19  LRRCLHYKAPKTPQPPSPPGPPKPPKKPQSFTMHDITWEDPYSWMSRLNDKVAMRHMDVY 78
           L R   YK PKTP PPSPP PPK P+KPQ+FT HD+TWEDPYSWMS L DKVAMRHMD+Y
Sbjct: 25  LYRSASYKHPKTPTPPSPPKPPKAPQKPQTFTFHDVTWEDPYSWMSSLQDKVAMRHMDMY 84

Query: 79  MEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLGPWLYYRRVEEGKQYQVLC 138
           MEQEEKY EAVM+DTERLQ+KLQSEMASRL FDLSTP LR GPWLYYRRVEEGKQY VLC
Sbjct: 85  MEQEEKYTEAVMSDTERLQTKLQSEMASRLDFDLSTPPLRWGPWLYYRRVEEGKQYPVLC 144

Query: 139 RRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERFGGYAYEELSEVSPDHRFL 198
           RRLASL+++FIS+KSPSAGFD+ SGK+IEQKLLDYNQEAERFGGYAYEELSE+SPDH+FL
Sbjct: 145 RRLASLNDEFISHKSPSAGFDFTSGKRIEQKLLDYNQEAERFGGYAYEELSEISPDHKFL 204

Query: 199 AYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGGQALLYVVTDQNKRP---- 258
           AYTMYDKDNDYF+LSV+NL+SG+LCSKP  +RVSNLAW K GQALLYV+TD+N+RP    
Sbjct: 205 AYTMYDKDNDYFKLSVRNLNSGALCSKPNANRVSNLAWIKDGQALLYVITDENRRPHRIY 264

Query: 259 CSMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPTSSKVFLIDAANPLSGMEL 318
           CSMIGST+ED LLLEE D+ V+V IRHTKDF+FVTVN F+PTSSKVFLI+AA+P SGM L
Sbjct: 265 CSMIGSTEEDVLLLEEQDETVYVNIRHTKDFHFVTVNTFSPTSSKVFLINAADPFSGMTL 324

Query: 319 IWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPLSVESTSRTWENVFVDDPD 378
           +WE EG+ HCI+EHH G LYLFT+A K    VDSHYLL SP+   S  R WE+VF+DD D
Sbjct: 325 VWESEGIVHCILEHHQGYLYLFTDAAKDGHVVDSHYLLCSPVDCPSNPRIWESVFIDDQD 384

Query: 379 LVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINLKELEPHFLPLPKHVSQIS 438
           L+I DVDFS++ LVLI REG+   +C+V LPL +G K ++ L+EL PHFLPLPK+V +IS
Sbjct: 385 LIIEDVDFSNSRLVLITREGRNFGICSVALPL-LGRKQAVYLRELNPHFLPLPKNVCKIS 444

Query: 439 SGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQSILHERTRILYGTTSSAEA 498
            GPNYDFYS+TMRFTISSPVMPDAVVDY+LS+GKWNI+QQQ+ILHERTRILYGT  S+  
Sbjct: 445 PGPNYDFYSTTMRFTISSPVMPDAVVDYDLSNGKWNIVQQQNILHERTRILYGTALSSAI 504

Query: 499 SGKISNESEISTGEANFDDDQMWNTLSEFYACEHFNVSSHDEVLIPLTVVYSYKSKRENE 558
           + K +N    ST +   +DD +WN LSEFYACEH++VSS+D  ++PLT+VYS K++++ +
Sbjct: 505 AEKSTNVKNSSTNDVKSEDDNLWNDLSEFYACEHYDVSSYDGTVVPLTIVYSCKNRKDKQ 564

Query: 559 NPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGGGGGKKWHHDGRRTKKFNS 618
           +PGLLH HGAFGE LDK+WRSELKSLLDRGW++AYADVRGGGGGGKKWHHDGR TKK NS
Sbjct: 565 SPGLLHGHGAFGEILDKQWRSELKSLLDRGWIVAYADVRGGGGGGKKWHHDGRGTKKQNS 624

Query: 619 VQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCPELFRSAILKVPFLDPINT 678
           ++DYISCAK+LVE++IV E KLA WGYSAGGLLVASAIN  PELFR+A+LKVPFLD  NT
Sbjct: 625 IRDYISCAKYLVEKEIVQENKLAAWGYSAGGLLVASAINCSPELFRAAVLKVPFLDATNT 684

Query: 679 LLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDVAYPAVPFLDPINTLLHPI 738
           LL+PI+PLT  DYEEFGYP  DIDDFHA+R++SPYDNIQKDV                  
Sbjct: 685 LLYPILPLTAVDYEEFGYPG-DIDDFHAIRKFSPYDNIQKDVL----------------- 744

Query: 739 IPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARVRDYSIYDPKRPVILNITT 798
                       YP   +      R    FGVWEAAKW+ARVR+ +IYDPK P++LN+ T
Sbjct: 745 ------------YPSVLVSSSFNTR----FGVWEAAKWVARVREQTIYDPKHPILLNLMT 796

Query: 799 DIVEENRYLHCKESALETAFLLKFIGS 822
           DIVEENRYL CKESALETAFLLK + S
Sbjct: 805 DIVEENRYLQCKESALETAFLLKAMES 796

BLAST of Cp4.1LG07g10850.1 vs. NCBI nr
Match: gi|703071321|ref|XP_010089001.1| (Protease 2 [Morus notabilis])

HSP 1 Score: 1162.5 bits (3006), Expect = 0.0e+00
Identity = 572/813 (70.36%), Postives = 665/813 (81.80%), Query Frame = 1

Query: 19  LRRCLHYKAP----KTPQPPSPPGPPKPPKKPQSFTMHDITWEDPYSWMSRLNDKVAMRH 78
           L R  HYKAP    K P PP+PP PPKPP+KPQSF+ HD TWEDPYSWMS LNDKVAMRH
Sbjct: 22  LVRRAHYKAPQKAAKPPSPPTPPKPPKPPQKPQSFSFHDQTWEDPYSWMSSLNDKVAMRH 81

Query: 79  MDVYMEQEEKYVEAVMADTERLQSKLQSEMASRLAFDLSTPSLRLGPWLYYRRVEEGKQY 138
           MD+YMEQEEKY EAVMADTERLQSKLQSEMA RLA+DLSTP LR GPWLYYRR EEGKQY
Sbjct: 82  MDIYMEQEEKYAEAVMADTERLQSKLQSEMAFRLAYDLSTPPLRWGPWLYYRRAEEGKQY 141

Query: 139 QVLCRRLASLHEKFISNKSPSAGFDYVSGKKIEQKLLDYNQEAERFGGYAYEELSEVSPD 198
            VLCRRLASL+E+FIS+KSPSAGFD+ SGK+IEQKL+DYNQEAERFGGYAYEELSEVSPD
Sbjct: 142 PVLCRRLASLNEEFISHKSPSAGFDFASGKRIEQKLIDYNQEAERFGGYAYEELSEVSPD 201

Query: 199 HRFLAYTMYDKDNDYFRLSVKNLSSGSLCSKPQVDRVSNLAWAKGGQALLYVVTDQNKRP 258
           HRFLAYTMYDKDND+FRLSV+NL+SG+LC KPQ D +SNLAWAK GQALLYVVTDQ KRP
Sbjct: 202 HRFLAYTMYDKDNDFFRLSVRNLNSGALCGKPQADCISNLAWAKDGQALLYVVTDQKKRP 261

Query: 259 C-----SMIGSTDEDTLLLEEPDDDVHVYIRHTKDFNFVTVNRFTPTSSKVFLIDAANPL 318
                 SMIGSTD+D LLLEE D++V+V IRHTKDF FVTVN F+PTSSKVFLI+AA+PL
Sbjct: 262 YRWIYYSMIGSTDDDVLLLEELDENVYVNIRHTKDFRFVTVNTFSPTSSKVFLINAADPL 321

Query: 319 SGMELIWECEGLAHCIMEHHLGVLYLFTNANKGHEAVDSHYLLRSPLSVESTSRTWENVF 378
           SG+ LIWEC+G+AHCI+EHH G LYLFT+A K  + VD HYLLRSP+   +  R WENVF
Sbjct: 322 SGLNLIWECDGVAHCIVEHHQGFLYLFTDAAKAGQPVDFHYLLRSPVDTSTGPRIWENVF 381

Query: 379 VDDPDLVIVDVDFSHTHLVLILREGQKLRLCAVRLPLPVGGKGSINLKELEPHFLPLPKH 438
           +DDP LV+ DVDF +THL+LILREG++ RL +V LPLP G +G ++LKEL PH+LPLPK+
Sbjct: 382 IDDPHLVVEDVDFCNTHLLLILREGRQFRLGSVTLPLPAG-RGPVSLKELHPHYLPLPKY 441

Query: 439 VSQISSGPNYDFYSSTMRFTISSPVMPDAVVDYNLSDGKWNIIQQQSILHERTRILYGTT 498
           VSQIS G  YD++SSTMRFTISSPVMPDA+VDY+LS+GKWNI+QQQ+ILHERT++LYGT+
Sbjct: 442 VSQISPGMIYDYFSSTMRFTISSPVMPDAIVDYDLSNGKWNIVQQQNILHERTKVLYGTS 501

Query: 499 SSAEASGKISNESEI-STGEANFDDDQMWNTLSEFYACEHFNVSSHDEVLIPLTVVYSYK 558
           S +  S    N   + +T E   DD  +WN LSEFYACEH NVSS+D V +PLT++YS K
Sbjct: 502 SLSSISKHTLNSKTVDTTDEVRSDDANLWNDLSEFYACEHRNVSSYDGVEVPLTIIYSRK 561

Query: 559 SKRENENPGLLHVHGAFGEPLDKRWRSELKSLLDRGWVIAYADVRGGGGGGKKWHHDGRR 618
           +++E + PGLLH HGA+GE LDKRWRSELKSLLDRGW++AYADVRGGGGGGKKWH+DGRR
Sbjct: 562 NEKEGQYPGLLHGHGAYGELLDKRWRSELKSLLDRGWIVAYADVRGGGGGGKKWHYDGRR 621

Query: 619 TKKFNSVQDYISCAKFLVERKIVNEEKLAGWGYSAGGLLVASAINQCPELFRSAILKVPF 678
           TKK NS++DYISCAK+L+ER+IV++ KLAGWGYSAGGLLVASAIN CP+LFR+A   VPF
Sbjct: 622 TKKINSIKDYISCAKYLIEREIVHQNKLAGWGYSAGGLLVASAINSCPDLFRAA---VPF 681

Query: 679 LDPINTLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPYDNIQKDVAYPAVPFLDPIN 738
           LD  NTLL+P++P+T ADYEEFGYP  DI+DFHA+R YSPYDNIQKDV            
Sbjct: 682 LDATNTLLYPVLPVTAADYEEFGYPW-DINDFHAIREYSPYDNIQKDVP----------- 741

Query: 739 TLLHPIIPLTPADYEEFGYPEEDIDDFHAVRRYSPFGVWEAAKWIARVRDYSIYDPKRPV 798
                             YP   I      R    FG+WEAAKW+ARVR+++IYDPKRPV
Sbjct: 742 ------------------YPALLISSSFNTR----FGIWEAAKWVARVREHTIYDPKRPV 796

Query: 799 ILNITTDIVEENRYLHCKESALETAFLLKFIGS 822
           +LN+TTDIVEENRYL CKESALE AFL+K + S
Sbjct: 802 LLNLTTDIVEENRYLQCKESALEAAFLMKVMES 796

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPCEL_CHICK2.5e-3825.81Prolyl endopeptidase-like OS=Gallus gallus GN=PREPL PE=2 SV=1[more]
PTRB_ECOLI1.8e-3631.49Protease 2 OS=Escherichia coli (strain K12) GN=ptrB PE=1 SV=2[more]
PPCEL_MOUSE4.4e-3541.27Prolyl endopeptidase-like OS=Mus musculus GN=Prepl PE=1 SV=1[more]
PPCEL_HUMAN1.7e-3440.21Prolyl endopeptidase-like OS=Homo sapiens GN=PREPL PE=1 SV=1[more]
PPCEL_MACFA1.7e-3440.74Prolyl endopeptidase-like OS=Macaca fascicularis GN=PREPL PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LL77_CUCSA0.0e+0083.03Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009530 PE=4 SV=1[more]
V4UI75_9ROSI0.0e+0070.30Uncharacterized protein OS=Citrus clementina GN=CICLE_v10024926mg PE=4 SV=1[more]
A0A061DG58_THECC0.0e+0070.14Prolyl oligopeptidase family protein isoform 1 OS=Theobroma cacao GN=TCM_000521 ... [more]
W9R2Z1_9ROSA0.0e+0070.36Protease 2 OS=Morus notabilis GN=L484_020287 PE=4 SV=1[more]
A0A0D2S7A8_GOSRA0.0e+0068.07Uncharacterized protein OS=Gossypium raimondii GN=B456_007G052400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66960.10.0e+0066.10 Prolyl oligopeptidase family protein[more]
AT1G69020.11.0e-13837.52 Prolyl oligopeptidase family protein[more]
AT1G50380.13.5e-5927.03 Prolyl oligopeptidase family protein[more]
AT1G76140.12.7e-1933.15 Prolyl oligopeptidase family protein[more]
AT1G20380.14.7e-1933.15 Prolyl oligopeptidase family protein[more]
Match NameE-valueIdentityDescription
gi|659070747|ref|XP_008456457.1|0.0e+0083.76PREDICTED: prolyl endopeptidase-like [Cucumis melo][more]
gi|449442973|ref|XP_004139255.1|0.0e+0083.03PREDICTED: prolyl endopeptidase-like [Cucumis sativus][more]
gi|567866363|ref|XP_006425804.1|0.0e+0070.30hypothetical protein CICLE_v10024926mg [Citrus clementina][more]
gi|590704291|ref|XP_007047118.1|0.0e+0070.14Prolyl oligopeptidase family protein isoform 1 [Theobroma cacao][more]
gi|703071321|ref|XP_010089001.1|0.0e+0070.36Protease 2 [Morus notabilis][more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0070008serine-type exopeptidase activity
GO:0004252serine-type endopeptidase activity
GO:0008236serine-type peptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR023302Pept_S9A_N
IPR0110426-blade_b-propeller_TolB-like
IPR002470Peptidase_S9A
IPR001375Peptidase_S9
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016020 membrane
molecular_function GO:0004252 serine-type endopeptidase activity
molecular_function GO:0070008 serine-type exopeptidase activity
molecular_function GO:0008236 serine-type peptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG07g10850Cp4.1LG07g10850gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG07g10850.1Cp4.1LG07g10850.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG07g10850.1:five_prime_utr:001Cp4.1LG07g10850.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG07g10850.1:cds:001Cp4.1LG07g10850.1:cds:001CDS
Cp4.1LG07g10850.1:cds:002Cp4.1LG07g10850.1:cds:002CDS
Cp4.1LG07g10850.1:cds:003Cp4.1LG07g10850.1:cds:003CDS
Cp4.1LG07g10850.1:cds:004Cp4.1LG07g10850.1:cds:004CDS
Cp4.1LG07g10850.1:cds:005Cp4.1LG07g10850.1:cds:005CDS
Cp4.1LG07g10850.1:cds:006Cp4.1LG07g10850.1:cds:006CDS
Cp4.1LG07g10850.1:cds:007Cp4.1LG07g10850.1:cds:007CDS
Cp4.1LG07g10850.1:cds:008Cp4.1LG07g10850.1:cds:008CDS
Cp4.1LG07g10850.1:cds:009Cp4.1LG07g10850.1:cds:009CDS
Cp4.1LG07g10850.1:cds:010Cp4.1LG07g10850.1:cds:010CDS
Cp4.1LG07g10850.1:cds:011Cp4.1LG07g10850.1:cds:011CDS
Cp4.1LG07g10850.1:cds:012Cp4.1LG07g10850.1:cds:012CDS


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001375Peptidase S9, prolyl oligopeptidase, catalytic domainPFAMPF00326Peptidase_S9coord: 575..714
score: 5.2
IPR002470Peptidase S9A, prolyl oligopeptidasePRINTSPR00862PROLIGOPTASEcoord: 582..606
score: 5.2E-24coord: 610..629
score: 5.2E-24coord: 640..660
score: 5.2E-24coord: 556..574
score: 5.2E-24coord: 699..714
score: 5.2
IPR002470Peptidase S9A, prolyl oligopeptidasePANTHERPTHR11757PROTEASE FAMILY S9A OLIGOPEPTIDASEcoord: 521..729
score: 0.0coord: 6..495
score: 0.0coord: 763..820
score:
IPR011042Six-bladed beta-propeller, TolB-likeGENE3DG3DSA:2.120.10.30coord: 190..262
score: 9.
IPR023302Peptidase S9A, N-terminal domainPFAMPF02897Peptidase_S9_Ncoord: 43..474
score: 1.1
NoneNo IPR availablePANTHERPTHR11757:SF6PROLYL OLIGOPEPTIDASE FAMILY PROTEINcoord: 521..729
score: 0.0coord: 763..820
score: 0.0coord: 6..495
score:
NoneNo IPR availableunknownSSF50993Peptidase/esterase 'gauge' domaincoord: 36..477
score: 7.59