CmaCh20G009810 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G009810
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr20 : 5217563 .. 5223142 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTCGCTTTCTTTAAAATGAATCATCCTTCACCCAACTTTAACTGCAATTGTTTATTTTGAATCAATCAATCAGCTCTCACCTTGATTTCGCATCCCTTCAATCCATTCTCCTTCTTCCCTCTTCGATCTGCCTCCTTCCGTTGACCTTTCCCATATCCAATCGCCAATGGCACGAACACCCAATCTACTCCTCGCTGTTCTCCTCCATTTCTTGCACCTCACGCATTTCACACTCTCCGCCGATCCCATCTCCTCCAATCCTCTCCTTACGCCTTCGCATCGCGCAATGGTGTTGCCTCTTTATCGCTCTTCTCCCAATTCCTCCAAATTGATCTCCAAGCCTCACCGTCGTCTCCGCGGATTCCCCAATTCGAATAATCGTTCCAACGCTCGAATGCGGCTCTACGACGATCTCCTTCTCAATGGGTATGATCCACCTCCTTTAATTTTTGGATGGTTATGATTTTCGTTTGATGAAATTTGGTTTTTGGTTCAGGTATTATACGACACGGCTTTGGATCGGTACTCCGCCGCAGAAATTCGCGCTTATTGTTGATACGGGAAGTACGGTTACTTATGTTCCTTGCTCAACTTGCGAACTTTGTGGGAAGCACCAGGTTAAAATGTTTAGAATTTGATGCTTGCCAGATCATTTATAAGGCCTTCATATTCATATCCATATTCAATCATTTGCTCTGTTTCCTTTATCTGCTTTTTATGATATTGATTTGGAAATTGGTAATTTATCAGTCTGGTAAGGAAATAGATCAAATGATATGGATCCTAGAATGATATATGTTTATACTGCACCAAAGGGTAAATGATAAGGTCTCCTTCGATGGACAAATTGTTTTTTTCTGGGAAGAAGTGATCTCGGAATGCTGTCTGACTCTGGACGCTAGAATGTTGTTTCCCTCCAATTTAGAGAAATGTGGATAACTAAAAGGCATTATCTATTTAATGAATGTGACAGAATTGTATTGTCGGTATTCTATATGGGTCTTAAAGCTTGTTCCTACGATGGTTTTATGGATAAAATTTGTGCAGTCATTTATTACATTTATATTAAAGCATTTCTAGTTTTTTCTTCATAGGAACCTTGCCAGCTGTTCTTTCTACTTAAACTTTTATTTCTATCAGGACCCAAAGTTTGACCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATTCTGATTGCACTTGTGACAATGACGGAGTGCAGTGTGTCTATGAGAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGATGATGTTATATCCTTTGGAAATCAGAGTGCACTCGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGAGGAAACTGGTGATCTTTACAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGATCTTAGTATTGTCGACCAACTAGTTGAAAAAGGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCTCCTCCATCAGAGATGATTTTTAGCTACTCAGACCCTGTGAGAAGGTGTGTACTAAAGTCGTTTAACCTTAAAAAAATTGTACTTCATGCTCTTTCATAACAATTACAATTGATACATCTCACACATTTACTTTAGTTTCGGGATGGCAACAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAGTTGCCTCTCGAGCCAAGCGTTTTTGACGGAAGATATGGATCTGTCTTGGATAGTGGTACAACTTATTCTTACCTACCACAGGAAGCCTTTGGACCTTTCAAGAATGCTGTAAGCTTTTATTTCCCAGAATTTAACATTCTCAGTCAGGAGTTTCTTTAGTAATGTCTGAATTTGAAGATTGTTGCATGCTTTGCCAAGGACATAGTAAGACAGTCTGATTCTTACTTTTCCAGAGTTATCTTTCTTCTATTTCTGTTTCAGTCTGATAATTGTTTGCATCGTTTTTTTATTTTCTTTTGTAAACTTGTTCTAGTTGCTTTACATTTTAACGATTTGATATGAATCAATTTACATTGTTTCCTTGTCAATCTTAATAAATGAGAACATTTGTCATATTTGTGGAGGCTTTCAATTCAGTGTTAAGCCAAGTTCTTGTGTTGCCTGAAAAAACTTCGTCTATACAATTAAATGGAGAAAGGAAGAGAGAGAAACGAAAAACTTTTGGCTCCTTATTAGAAAGGATCGATCCCTGTAGTAGTCAAACCATGACCAGGCATATATCCTGTCTCTTTTCTCTTGTTTTCTCTAATAGTGTTCCCTTGATTACAAAATTACAAATTTGATCTTGAGTCGTTAGCAGTTTGAAGTCCTGTTTTTTGCATCTTTGTTTTAGAATGCATAATGAATTTAAAAAGAAAAAATCTTTCAGATTATGAATGCGCTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAAAGATACATGTTTTTCTGGTGCTGGAAGGTATGGTTAGGTAATACGTTGTCAGTTTTGAAAGTGATTTACAACAATAAGTAATTATGATTCGTTACTTTATGTTAGTGATGCTGCTGAATTATCAAAAACATTTCCGACAGTTGACTTGATATTTGACAATGGCCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTCCGGGTAAGGAACATCCCTTGATTACTAAAGATTTTTGCGTTACTTCTTTGTATCTGTGTCTGTTTACATGGGATTAAGTGTGGGGTTGGGCAGTATAAGCCTCCATAACAAAAATAAGGGCCGTCAATTGATCTTTGTATTCAATTGAAATAGTGGTCATAAAAGTTTGAGCTAACGGATTCTTCTTTACTGTGAATTTTCTGAATCATTATTTCAGCACTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTCGAGAATGGAAATAATGATCAAACTACTCTTCTAGGAGGTACATTTTTTTCATGGCACAGACGATTTTCGCTCGTGAATGCTTATTCTATAGGAATTTGGAATTTCTGACTGTTCATTTTGTGAATTCGCCTTTTTTTTTTTTTTTTTTTGTAATTTAATCACATATCACATTTTAGCATGACTGTTTTCTGTTACAGGGATCATTGTCCGCAACACTTTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACTAATTGTTCCGAGTTATGGGAAAGGCTTCACATTTCGGATGAAAATGCTCATGCTCCTTCAGTTTCAAATACATCACACGATACTGATACGGCACCTGCATCAGCTCCAAGCGAATCACCACATGATATGATTCCGGGTATGGTTAATTGCAAGGTTTATTTTTGCTTTTATGCTAATACGGAAGCTTGACAGCATGTCAGTTGGTTCTTTAGTAATATTCTTTTCCTTATTGTCGATATGGTCGTTGAGCTTAATTGATTTGGTAAGTTAAAACAAGTTATACCATCCTTCATATCATGATGACAGTTGAGGTTCTACCATTTTAGTCTTCTTTGATATCATGCTAATGTATATGATATAATATGTGAATTTCAAACGATGATTGTTTTAACAACCCCTTTTGTAATTTCATACTATCGATGAAATTATTTCTTATAAAAAAAAAAAAAAAAAGAAGAGTATTTTAACAAATTTATTATCTGTTGCAAGCTCTACCTTAAAGGTCTGTTTGTTAAAGTGTTTTCTTTTGTGCTACGTGCAGAAGATATCCAGATTGGACGTATCACATTTGATATCTTGTTGAACATAAGCTACAAACATCTGGAGCCTCATATTACACACCTTTCCGATCATATTGCTCAAGAGTTAAATGTTAGTCATTCACAGGTTAGTACACTTGACACATTGGGGATTGAAATTTATTTTTCAGTATTTAACAATAATTAATGTAATAACAAATAGGTTTTATTATAAATGTATTCCATTGTTATGGCTATTTGATCCCTAAGATTTATAAGTCATTAAACATTGACGTTAGTTAGTTAAAAGTCATTAATTATGGTTCTAAATAGTTCCTATATTGAGTTAATTTTTCACTAAATATTGACATGCCAAATGAGCTAACACTGGTTCAGTGAGCATATCATGTCATAGCTTAGTTTATAACATAAGTGGGAGGTGAAATCTATTAAAGTTTAAGGAGAATAAATTGATGCAACGATCAAAAGTCACGGGCAAAACTTATAATTTACCCTGAGAGACATCTCTCTCAGCTTTTCTTACTAATTGAGTTGTGACTGTATCCAACTTTTTACTGCATATCATTTTTAACCATTATTTCAGGTCCGTTTATTGAACTTTACCATGAGAGGAAATCATTCACTTATTCAGTTGGCCATACTCCCTAATGGATCCTCAGAATTTTTCTCACATGCGACTGCTACTGTAAGTGAATATACCTGCAACTGTAAACTGCACAACAACCGAAATTCACAAGCAACCCTTTAAAATATTGTTTCTCTGGCAGACGATAATTTCCCTGATCGTCGAGCATCACATGAAGCTACCTCCTAGGTATGGAAGTTACCAGGTCATTCGATGGAATGTCGAACCTCTAATGGATAGGTAAGTATGTAAATAGGTTTAAATGTTAAATTATGAAGTTAGTCAATAAATCTTCTTTAGTTGTGTCTATTTAGTTTTTAAACTTCAAAAACTATGAATTATGTTCTTAAAATTTTGGTTTTTATTCTAAGACGTTCCAGTCATGACATAGTGATAGAACTAACGCACTCACGTACTTGAGGGATTAAGCTAACTAGAAAAAAATAAGTGGACTAGATTTTGCTTGATTGTTGAGCCTTCTTGTGGGTCAAATGTCACGAGCCTTCTTTGCATCATTTACTAAAATATCAACCTGATTGTTACCTATTTAAATTCAAGGAAGTGATCTAAGTGTGATAAAATTTAAACTGAATTTTTTTACATTTTGTTTTATCAACTAGCCTACTCCCTCATGCGCCACCCGTAATCACATCACGTCAAATATAGTCCAGTTGAAACTTTTAAAAATTTATGGATTAAATAGACATATTGTCACGTCTTAAGGACTAAATGGTAATGTAGTCATACAAATGTTATGTTGTTACTACATTCCTAGTTAGTTAGAGTCTTGTATTCATGGATGGCTAAATGTTGCATAGCATTAATGTAATCCAATCTGTTTTATATTGTTTAGGTCATTGTGGAAGCGACTTTATGTTTTGGTGGGTTTAGCCATTATGGTCACGCTTATTCTTGGGTTGTCAGCAGTGGGAGTGTGGTTTATTTGGAGGAGGAGACAGCAAGCATTCCATTCATATAAGCCTGTCAATGCAGCAGCTCCAGAGCAGGAACTCCAGACCCTGTAGTAAGAGTCCCGCCAACAAATTTTTGTTTTGTTGTCACTGTTTGTATCATTTGCTTTGAATTTTTATAGGTTCAATGTTCAGTGGGAGACATTGAAAGAATATTATTATTTTCCACATTTTGTATTCATTTTATGCTCAAAGGTTCTAACCTTAAACAAATCGGAAGTTCCCTTTTTTCAC

mRNA sequence

CCTCGCTTTCTTTAAAATGAATCATCCTTCACCCAACTTTAACTGCAATTGTTTATTTTGAATCAATCAATCAGCTCTCACCTTGATTTCGCATCCCTTCAATCCATTCTCCTTCTTCCCTCTTCGATCTGCCTCCTTCCGTTGACCTTTCCCATATCCAATCGCCAATGGCACGAACACCCAATCTACTCCTCGCTGTTCTCCTCCATTTCTTGCACCTCACGCATTTCACACTCTCCGCCGATCCCATCTCCTCCAATCCTCTCCTTACGCCTTCGCATCGCGCAATGGTGTTGCCTCTTTATCGCTCTTCTCCCAATTCCTCCAAATTGATCTCCAAGCCTCACCGTCGTCTCCGCGGATTCCCCAATTCGAATAATCGTTCCAACGCTCGAATGCGGCTCTACGACGATCTCCTTCTCAATGGGTATTATACGACACGGCTTTGGATCGGTACTCCGCCGCAGAAATTCGCGCTTATTGTTGATACGGGAAGTACGGTTACTTATGTTCCTTGCTCAACTTGCGAACTTTGTGGGAAGCACCAGGACCCAAAGTTTGACCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATTCTGATTGCACTTGTGACAATGACGGAGTGCAGTGTGTCTATGAGAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGATGATGTTATATCCTTTGGAAATCAGAGTGCACTCGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGAGGAAACTGGTGATCTTTACAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGATCTTAGTATTGTCGACCAACTAGTTGAAAAAGGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCTCCTCCATCAGAGATGATTTTTAGCTACTCAGACCCTGTGAGAAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAGTTGCCTCTCGAGCCAAGCGTTTTTGACGGAAGATATGGATCTGTCTTGGATAGTGGTACAACTTATTCTTACCTACCACAGGAAGCCTTTGGACCTTTCAAGAATGCTATTATGAATGCGCTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAAAGATACATGTTTTTCTGGTGCTGGAAGTGATGCTGCTGAATTATCAAAAACATTTCCGACAGTTGACTTGATATTTGACAATGGCCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTCCGGCACTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTCGAGAATGGAAATAATGATCAAACTACTCTTCTAGGAGGGATCATTGTCCGCAACACTTTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACTAATTGTTCCGAGTTATGGGAAAGGCTTCACATTTCGGATGAAAATGCTCATGCTCCTTCAGTTTCAAATACATCACACGATACTGATACGGCACCTGCATCAGCTCCAAGCGAATCACCACATGATATGATTCCGGAAGATATCCAGATTGGACGTATCACATTTGATATCTTGTTGAACATAAGCTACAAACATCTGGAGCCTCATATTACACACCTTTCCGATCATATTGCTCAAGAGTTAAATGTTAGTCATTCACAGGTCCGTTTATTGAACTTTACCATGAGAGGAAATCATTCACTTATTCAGTTGGCCATACTCCCTAATGGATCCTCAGAATTTTTCTCACATGCGACTGCTACTACGATAATTTCCCTGATCGTCGAGCATCACATGAAGCTACCTCCTAGGTATGGAAGTTACCAGGTCATTCGATGGAATGTCGAACCTCTAATGGATAGGTCATTGTGGAAGCGACTTTATGTTTTGGTGGGTTTAGCCATTATGGTCACGCTTATTCTTGGGTTGTCAGCAGTGGGAGTGTGGTTTATTTGGAGGAGGAGACAGCAAGCATTCCATTCATATAAGCCTGTCAATGCAGCAGCTCCAGAGCAGGAACTCCAGACCCTGTAGTAAGAGTCCCGCCAACAAATTTTTGTTTTGTTGTCACTGTTTGTATCATTTGCTTTGAATTTTTATAGGTTCAATGTTCAGTGGGAGACATTGAAAGAATATTATTATTTTCCACATTTTGTATTCATTTTATGCTCAAAGGTTCTAACCTTAAACAAATCGGAAGTTCCCTTTTTTCAC

Coding sequence (CDS)

ATGGCACGAACACCCAATCTACTCCTCGCTGTTCTCCTCCATTTCTTGCACCTCACGCATTTCACACTCTCCGCCGATCCCATCTCCTCCAATCCTCTCCTTACGCCTTCGCATCGCGCAATGGTGTTGCCTCTTTATCGCTCTTCTCCCAATTCCTCCAAATTGATCTCCAAGCCTCACCGTCGTCTCCGCGGATTCCCCAATTCGAATAATCGTTCCAACGCTCGAATGCGGCTCTACGACGATCTCCTTCTCAATGGGTATTATACGACACGGCTTTGGATCGGTACTCCGCCGCAGAAATTCGCGCTTATTGTTGATACGGGAAGTACGGTTACTTATGTTCCTTGCTCAACTTGCGAACTTTGTGGGAAGCACCAGGACCCAAAGTTTGACCCAGAATTGTCAAGCACTTACCAACCTGTCAAATGCAATTCTGATTGCACTTGTGACAATGACGGAGTGCAGTGTGTCTATGAGAGGCAGTATGCTGAAATGAGTACTAGCAGTGGTGTCCTTGGTGATGATGTTATATCCTTTGGAAATCAGAGTGCACTCGTACCCCAGCGTGCTGTGTTTGGTTGTGAGAATGAGGAAACTGGTGATCTTTACAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGATCTTAGTATTGTCGACCAACTAGTTGAAAAAGGTGTGATTAATGATTCTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGAATCTCTCCTCCATCAGAGATGATTTTTAGCTACTCAGACCCTGTGAGAAGTCCATATTACAATGTTGATTTGAAGGAGATACATGTTGCGGGTAAAAAGTTGCCTCTCGAGCCAAGCGTTTTTGACGGAAGATATGGATCTGTCTTGGATAGTGGTACAACTTATTCTTACCTACCACAGGAAGCCTTTGGACCTTTCAAGAATGCTATTATGAATGCGCTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAAAGATACATGTTTTTCTGGTGCTGGAAGTGATGCTGCTGAATTATCAAAAACATTTCCGACAGTTGACTTGATATTTGACAATGGCCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTCCGGCACTCAAAGGTACATGGTGCATATTGTCTGGGAATTTTCGAGAATGGAAATAATGATCAAACTACTCTTCTAGGAGGGATCATTGTCCGCAACACTTTAGTGATGTATGACAGAGAGCATTCAAAAATTGGATTTTGGAAAACTAATTGTTCCGAGTTATGGGAAAGGCTTCACATTTCGGATGAAAATGCTCATGCTCCTTCAGTTTCAAATACATCACACGATACTGATACGGCACCTGCATCAGCTCCAAGCGAATCACCACATGATATGATTCCGGAAGATATCCAGATTGGACGTATCACATTTGATATCTTGTTGAACATAAGCTACAAACATCTGGAGCCTCATATTACACACCTTTCCGATCATATTGCTCAAGAGTTAAATGTTAGTCATTCACAGGTCCGTTTATTGAACTTTACCATGAGAGGAAATCATTCACTTATTCAGTTGGCCATACTCCCTAATGGATCCTCAGAATTTTTCTCACATGCGACTGCTACTACGATAATTTCCCTGATCGTCGAGCATCACATGAAGCTACCTCCTAGGTATGGAAGTTACCAGGTCATTCGATGGAATGTCGAACCTCTAATGGATAGGTCATTGTGGAAGCGACTTTATGTTTTGGTGGGTTTAGCCATTATGGTCACGCTTATTCTTGGGTTGTCAGCAGTGGGAGTGTGGTTTATTTGGAGGAGGAGACAGCAAGCATTCCATTCATATAAGCCTGTCAATGCAGCAGCTCCAGAGCAGGAACTCCAGACCCTGTAG

Protein sequence

MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL
BLAST of CmaCh20G009810 vs. Swiss-Prot
Match: ASPL2_ARATH (Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=2)

HSP 1 Score: 149.1 bits (375), Expect = 1.7e-34
Identity = 116/404 (28.71%), Postives = 188/404 (46.53%), Query Frame = 1

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNG--------YYTTRLWIGTPPQKFALIVDTGSTV 120
           + L  F + + R ++RM    DL L G         Y T++ +G+PP+++ + VDTGS +
Sbjct: 38  KNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDI 97

Query: 121 TYVPCSTCELCGKHQD-----PKFDPELSSTYQPVKCNSD-CT--CDNDGVQ----CVYE 180
            ++ C  C  C    +       FD   SST + V C+ D C+    +D  Q    C Y 
Sbjct: 98  LWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYH 157

Query: 181 RQYAEMSTSSGVLGDDVISFGN-----QSALVPQRAVFGCENEETGDLYS--QRADGIMG 240
             YA+ STS G    D+++        ++  + Q  VFGC ++++G L +     DG+MG
Sbjct: 158 IVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMG 217

Query: 241 LGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYY 300
            G  + S++ QL   G     FS C   +  GGG   +G +  P   + +        +Y
Sbjct: 218 FGQSNTSVLSQLAATGDAKRVFSHCLDNVK-GGGIFAVGVVDSPK--VKTTPMVPNQMHY 277

Query: 301 NVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKI 360
           NV L  + V G  L L  S+     G+++DSGTT +Y P+  +      I+       K+
Sbjct: 278 NVMLMGMDVDGTSLDLPRSIVRNG-GTIVDSGTTLAYFPKVLYDSLIETIL--ARQPVKL 337

Query: 361 GGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLG 420
              +  F+  CF    S +  + + FP V   F++  KL++ P +YLF  +     YC G
Sbjct: 338 HIVEETFQ--CF----SFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLF--TLEEELYCFG 397

Query: 421 IFENG----NNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCS 434
               G       +  LLG +++ N LV+YD ++  IG+   NCS
Sbjct: 398 WQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427

BLAST of CmaCh20G009810 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 3.8e-34
Identity = 107/361 (29.64%), Postives = 165/361 (45.71%), Query Frame = 1

Query: 86  NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
           +G Y  R+ +G+PP+   +++D+GS + +V C  C+LC K  DP FDP  S +Y  V C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 146 SDCTCD---NDGVQ---CVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEE 205
           S   CD   N G     C YE  Y + S + G L  + ++F   +  V +    GC +  
Sbjct: 188 SS-VCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF---AKTVVRNVAMGCGHRN 247

Query: 206 TGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCY--GGMDIGGGAMVLGGISPP 265
            G      A G++G+G G +S V QL   G    +F  C    G D   G++V G  + P
Sbjct: 248 RGMFIG--AAGLLGIGGGSMSFVGQL--SGQTGGAFGYCLVSRGTD-STGSLVFGREALP 307

Query: 266 --SEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD----GRYGSVLDSGTTYSYL 325
             +  +    +P    +Y V LK + V G ++PL   VFD    G  G V+D+GT  + L
Sbjct: 308 VGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRL 367

Query: 326 PQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQK 385
           P  A+  F++   +   +L +  G   +  DTC+  +G     +S   PTV   F  G  
Sbjct: 368 PTAAYVAFRDGFKSQTANLPRASG--VSIFDTCYDLSGF----VSVRVPTVSFYFTEGPV 427

Query: 386 LSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTN 433
           L+L   N+L       G YC     +      +++G I      V +D  +  +GF    
Sbjct: 428 LTLPARNFLMPVDD-SGTYCFAFAASPTG--LSIIGNIQQEGIQVSFDGANGFVGFGPNV 470

BLAST of CmaCh20G009810 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 3.8e-34
Identity = 105/363 (28.93%), Postives = 172/363 (47.38%), Query Frame = 1

Query: 86  NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
           +G Y +R+ +GTP ++  L++DTGS V ++ C  C  C +  DP F+P  SSTY+ + C+
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCS 218

Query: 146 S-DCTCDNDGV----QCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEET 205
           +  C+          +C+Y+  Y + S + G L  D ++FGN   +       GC ++  
Sbjct: 219 APQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKI--NNVALGCGHDNE 278

Query: 206 GDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMV------LGGI 265
           G L++  A G++GLG G LSI +Q+        SFS C    D G  + +      LGG 
Sbjct: 279 G-LFTGAA-GLLGLGGGVLSITNQMKA-----TSFSYCLVDRDSGKSSSLDFNSVQLGGG 338

Query: 266 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD----GRYGSVLDSGTTYSY 325
              + ++    +     +Y V L    V G+K+ L  ++FD    G  G +LD GT  + 
Sbjct: 339 DATAPLL---RNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 398

Query: 326 LPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKT-FPTVDLIFDNG 385
           L  +A+   ++A +    +LKK G    +  DTC+     D + LS    PTV   F  G
Sbjct: 399 LQTQAYNSLRDAFLKLTVNLKK-GSSSISLFDTCY-----DFSSLSTVKVPTVAFHFTGG 458

Query: 386 QKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWK 433
           + L L  +NYL       G +C       ++   +++G +  + T + YD   + IG   
Sbjct: 459 KSLDLPAKNYLIPVDD-SGTFCFAFAPTSSS--LSIIGNVQQQGTRITYDLSKNVIGLSG 500

BLAST of CmaCh20G009810 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 142.5 bits (358), Expect = 1.6e-32
Identity = 111/369 (30.08%), Postives = 169/369 (45.80%), Query Frame = 1

Query: 86  NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
           +G Y   L IGTP Q F+ I+DTGS + +  C  C  C     P F+P+ SS++  + C+
Sbjct: 92  DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCS 151

Query: 146 SDC-------TCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENE 205
           S         TC N+   C Y   Y + S + G +G + ++FG+ S  +P    FGC   
Sbjct: 152 SQLCQALSSPTCSNN--FCQYTYGYGDGSETQGSMGTETLTFGSVS--IP-NITFGCGEN 211

Query: 206 ETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGA---MVLGGI- 265
             G      A G++G+G G LS+  QL       D     Y    IG      ++LG + 
Sbjct: 212 NQGFGQGNGA-GLVGMGRGPLSLPSQL-------DVTKFSYCMTPIGSSTPSNLLLGSLA 271

Query: 266 ------SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVF-----DGRYGSVLD 325
                 SP + +I S   P    +Y + L  + V   +LP++PS F     +G  G ++D
Sbjct: 272 NSVTAGSPNTTLIQSSQIPT---FYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIID 331

Query: 326 SGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVD 385
           SGTT +Y    A+   +   ++ + +L  + G    F D CF    SD + L    PT  
Sbjct: 332 SGTTLTYFVNNAYQSVRQEFISQI-NLPVVNGSSSGF-DLCFQ-TPSDPSNLQ--IPTFV 391

Query: 386 LIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHS 433
           + FD G  L L  ENY    S  +G  CL +    ++   ++ G I  +N LV+YD  +S
Sbjct: 392 MHFDGGD-LELPSENYFI--SPSNGLICLAM--GSSSQGMSIFGNIQQQNMLVVYDTGNS 434

BLAST of CmaCh20G009810 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 2.1e-32
Identity = 117/370 (31.62%), Postives = 177/370 (47.84%), Query Frame = 1

Query: 86  NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
           +G Y   + IGTP   F+ I+DTGS + +  C  C  C     P F+P+ SS++  + C 
Sbjct: 93  DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCE 152

Query: 146 SD-C------TCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENE 205
           S  C      TC+N+  +C Y   Y + ST+ G +  +  +F  +++ VP  A FGC  +
Sbjct: 153 SQYCQDLPSETCNNN--ECQYTYGYGDGSTTQGYMATETFTF--ETSSVPNIA-FGCGED 212

Query: 206 ETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLC---YGG-----MDIGGGAMV 265
             G      A G++G+G G LS+  QL   GV    FS C   YG      + +G  A  
Sbjct: 213 NQGFGQGNGA-GLIGMGWGPLSLPSQL---GV--GQFSYCMTSYGSSSPSTLALGSAASG 272

Query: 266 LGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVF----DGRYGSVLDSGT 325
           +   SP + +I S  +P    YY + L+ I V G  L +  S F    DG  G ++DSGT
Sbjct: 273 VPEGSPSTTLIHSSLNPT---YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGT 332

Query: 326 TYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFK--DTCFSGAGSDAAELSKTFPTVDL 385
           T +YLPQ+A+    NA+  A      +   D +     TCF    SD + +    P + +
Sbjct: 333 TLTYLPQDAY----NAVAQAFTDQINLPTVDESSSGLSTCFQ-QPSDGSTVQ--VPEISM 392

Query: 386 IFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQ--TTLLGGIIVRNTLVMYDREH 433
            FD G  L+L  +N L   S   G  CL +   G++ Q   ++ G I  + T V+YD ++
Sbjct: 393 QFDGG-VLNLGEQNILI--SPAEGVICLAM---GSSSQLGISIFGNIQQQETQVLYDLQN 435

BLAST of CmaCh20G009810 vs. TrEMBL
Match: A0A0A0LJB9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G277070 PE=3 SV=1)

HSP 1 Score: 1073.5 bits (2775), Expect = 6.9e-311
Identity = 526/640 (82.19%), Postives = 569/640 (88.91%), Query Frame = 1

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MA++P L+ A+LLH        LSADPIS NPLL+PSHRAMVLPLY SSPNSSK IS PH
Sbjct: 1   MAKSPFLVAAILLHIF------LSADPISPNPLLSPSHRAMVLPLYLSSPNSSKFISNPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLR FP S+N SNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           E CG+HQDPKFDPE SSTY+P+KCN DC CD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL   +FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGA 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTY+YLP EAF  FK+AIM+ +HSLKKI GPDPNFKD CFSGAGSDAAELS  FP
Sbjct: 301 VLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVD++F+NGQKLSL PENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
            +SKIGFWKTNCSELWERL ISD+NA  PSVS  SHD+D APASAPSE PH  IP ++QI
Sbjct: 421 ANSKIGFWKTNCSELWERLRISDDNADGPSVSTKSHDSDIAPASAPSERPHYTIPGELQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITF ILLN SY  LEPHIT LSDHIAQELNVSHSQV +LNFTMRGN SLIQLAILP G
Sbjct: 481 GRITFAILLNKSYTDLEPHITELSDHIAQELNVSHSQVIILNFTMRGNDSLIQLAILPYG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSE FSHATA TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGL I+V
Sbjct: 541 SSEIFSHATANTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLVIVV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
             ILGLSA+G WF+ R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFVLRSRQQAINSYKPVNAAVPEQELQPL 633

BLAST of CmaCh20G009810 vs. TrEMBL
Match: A0A061DHD4_THECC (Aspartyl protease family protein OS=Theobroma cacao GN=TCM_000732 PE=3 SV=1)

HSP 1 Score: 850.9 bits (2197), Expect = 1.0e-243
Identity = 420/628 (66.88%), Postives = 501/628 (79.78%), Query Frame = 1

Query: 21  FTLS-ADPISSNPLLTP-----SHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSN 80
           F LS ++P +S PLL P     +  AM+LPL+    NSS+  S   R L    + ++  N
Sbjct: 19  FLLSRSNPSTSTPLLLPPPHHGARPAMILPLFPFPKNSSRTFSHSGRHLLRSDSHSSHPN 78

Query: 81  ARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPE 140
           ARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPC+TCE CG+HQDPKF P+
Sbjct: 79  ARMRLYDDLLLNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCATCEQCGRHQDPKFQPD 138

Query: 141 LSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFG 200
           LSSTYQPVKCN DC+CD D VQC YERQYAEMS+SSGVLG+D+ISFGNQS LVPQRAVFG
Sbjct: 139 LSSTYQPVKCNLDCSCDTDRVQCTYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAVFG 198

Query: 201 CENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGI 260
           CENEETGDLYSQ ADGIMGLG GDLS+VDQLVEKGVI+DSFSLCYGGMDIGGGAMVLGGI
Sbjct: 199 CENEETGDLYSQHADGIMGLGRGDLSVVDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGI 258

Query: 261 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQE 320
           S P +M+FSYSDP RSPYYN+DLK IHVAGK+LPL P+VFD +YG+VLDSGTTY+YLP+ 
Sbjct: 259 SSPPDMVFSYSDPERSPYYNIDLKAIHVAGKQLPLNPNVFDVKYGTVLDSGTTYAYLPEA 318

Query: 321 AFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSL 380
           AF  FKNAI+  L SLK+I GPDPN+ D CFSGA SD +ELSK FPTV+++FDN QKL L
Sbjct: 319 AFAAFKNAIIKELTSLKQIRGPDPNYNDICFSGASSDVSELSKIFPTVEMVFDNQQKLLL 378

Query: 381 APENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSE 440
           APENYLFRHSKV G YCLGIF N   D TTLLGGIIVRNTLV YDREH KIGFWKTNCSE
Sbjct: 379 APENYLFRHSKVRGGYCLGIFPN-EKDPTTLLGGIIVRNTLVTYDREHLKIGFWKTNCSE 438

Query: 441 LWERLHISDENAHAPSVSNTSHDT--DTAPASAPSESPHDMIPEDIQIGRITFDILLNIS 500
           LWERL I+   + +PS S+   ++  ++ P SAP  S H  IP +IQIG IT D+ L+I 
Sbjct: 439 LWERLRINGAPSPSPSSSSGKDNSTVESPPTSAPDGSSHYAIPGEIQIGEITLDMSLSID 498

Query: 501 YKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATT 560
           Y +L+PHI  L++ IA+EL+V+ SQV LL+FT  GN SL+  AI+P+GS+ + S+  A +
Sbjct: 499 YSYLKPHINELAEFIAKELDVNASQVHLLDFTSEGNSSLVTWAIVPSGSATYISNVAAIS 558

Query: 561 IISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVW 620
           IIS + EH ++LP  +G+YQ+++W VEP + ++ W++ Y++V LAIM+T+I+GLSA G W
Sbjct: 559 IISQLAEHRVRLPDTFGNYQLVQWKVEPSVQQTWWQQHYLVVLLAIMITIIVGLSASGGW 618

Query: 621 FIWRRRQQAFHSYKPVNAAAPEQELQTL 641
            IWRRRQQA   YKPV+ A  EQELQ L
Sbjct: 619 IIWRRRQQALKLYKPVDGAVSEQELQPL 645

BLAST of CmaCh20G009810 vs. TrEMBL
Match: A0A0D2QM13_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G056000 PE=3 SV=1)

HSP 1 Score: 850.1 bits (2195), Expect = 1.7e-243
Identity = 421/641 (65.68%), Postives = 507/641 (79.10%), Query Frame = 1

Query: 6   NLLLAVLLHFLHLTHFTLSADPISSNP--LLTPSHR----AMVLPLYRSSPNSSKLISKP 65
           NL +  ++ FL    F LS    S++P  LL P H     AMVLPL+ SS NSS+     
Sbjct: 7   NLAVGTVVFFLL---FLLSQSNPSTSPPRLLPPPHHGARPAMVLPLFPSSKNSSRTFLHS 66

Query: 66  HRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCST 125
           HR L    + ++  NARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPC+T
Sbjct: 67  HRHLLRSDSHSSHPNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCAT 126

Query: 126 CELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVIS 185
           CE CG+HQDPKF P+LSSTYQPVKCN DC CD+D VQC+YERQYAEMS+SSGVLG+D+IS
Sbjct: 127 CEQCGRHQDPKFQPDLSSTYQPVKCNLDCNCDSDRVQCIYERQYAEMSSSSGVLGEDIIS 186

Query: 186 FGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCY 245
           FGNQS LVPQRAVFGCENEETGDLYSQ ADGIMGLG GDLS+VDQLVEKGVI+DSFSLCY
Sbjct: 187 FGNQSELVPQRAVFGCENEETGDLYSQHADGIMGLGRGDLSVVDQLVEKGVISDSFSLCY 246

Query: 246 GGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYG 305
           GGMDIGGGAMVLGGIS PS+M+FSY+DPVRSPYY++ LKEIHVAGK+L L PSVFDG+YG
Sbjct: 247 GGMDIGGGAMVLGGISAPSDMVFSYADPVRSPYYSIGLKEIHVAGKQLSLNPSVFDGKYG 306

Query: 306 SVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTF 365
           +VLDSGTTY+YLP+ AF  FK AI+  L+ LK+I GPDPN+ D CFS A SD +ELSKTF
Sbjct: 307 TVLDSGTTYAYLPEPAFLAFKEAILKELNGLKQIRGPDPNYNDICFSTASSDVSELSKTF 366

Query: 366 PTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYD 425
           PTV+++F + QKL L+PENYLFRHSKVHGAYCLGIF+N   D TTLLGGIIVRNTLV YD
Sbjct: 367 PTVEMVFGDQQKLLLSPENYLFRHSKVHGAYCLGIFQN-EKDPTTLLGGIIVRNTLVTYD 426

Query: 426 REHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQ 485
           REHSKIGFWKTNCSELWERLHI+   +  PS S   + T++   +A   SPH   P  IQ
Sbjct: 427 REHSKIGFWKTNCSELWERLHITGALSPTPSSSGKGNSTESPTTTASDGSPHYDFPGKIQ 486

Query: 486 IGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPN 545
           IG+I  D+ L+ ++ +L+P I  L++ IA+EL+V+ SQV LLNFT  GN SL++LAI+P+
Sbjct: 487 IGKIILDMSLSTNHSYLKPQINKLTEFIAKELDVNASQVHLLNFTSEGNSSLVRLAIVPS 546

Query: 546 GSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIM 605
            SS +    TA  IIS + EH +KLP  +G+YQ+++W VEP   ++ W R Y++V +A++
Sbjct: 547 DSSTYIYKETARNIISRLAEHRVKLPDTFGNYQLVQWKVEPSTKQTWWGRNYMVVVVALI 606

Query: 606 VTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           + +++GLS  GVW +WRR+QQ  +SYKPV AAAPEQELQ L
Sbjct: 607 IIVVIGLSVYGVWGMWRRKQQTVNSYKPVGAAAPEQELQPL 643

BLAST of CmaCh20G009810 vs. TrEMBL
Match: G7JCS6_MEDTR (Eukaryotic aspartyl protease family protein OS=Medicago truncatula GN=MTR_4g095270 PE=3 SV=2)

HSP 1 Score: 842.4 bits (2175), Expect = 3.6e-241
Identity = 418/641 (65.21%), Postives = 501/641 (78.16%), Query Frame = 1

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MAR    L+ +L+  LH+TH T++ D       L   H AM+LPLY ++PNSS     P 
Sbjct: 1   MARPLTHLILILI--LHITH-TIAGD----TAFLRNRHHAMILPLYLTTPNSSTSALDPR 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           R+L G   S    NARMRL+DDLLLNGYYTTRLWIGTPPQ FALIVDTGSTVTYVPCSTC
Sbjct: 61  RQLHG-SESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           E CG+HQDPKF P+LSSTYQPVKC  DC CDND +QCVYERQYAEMSTSSGVLG+DV+SF
Sbjct: 121 EQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQS L PQRAVFGCEN ETGDLYSQ ADGIMGLG GDLSI+DQLV+K V++DSFSLCYG
Sbjct: 181 GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMD+GGGAMVLGGISPPS+M+F+ SDPVRSPYYN+DLKEIHVAGK+LPL PSVFDG++GS
Sbjct: 241 GMDVGGGAMVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGS 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTY+YLP+EAF  FK AI+  L S  +I GPDPN+ D CFSGAG D ++LSKTFP
Sbjct: 301 VLDSGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
            VD+IF NG K SL+PENY+FRHSKV GAYCLGIF+NG  D TTLLGGI+VRNTLV+YDR
Sbjct: 361 VVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNG-KDPTTLLGGIVVRNTLVLYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTA-PASAPSESPHDMIPEDIQ 480
           E +KIGFWKTNC+ELWERL IS      P  +  ++ T +  P+ APS S H++   + Q
Sbjct: 421 EQTKIGFWKTNCAELWERLQISSAPPPMPPNTEATNSTKSVDPSVAPSVSQHNIPRGEFQ 480

Query: 481 IGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPN 540
           I +IT  +  NISY  ++P +T L+  IA ELNV+ SQ+ LLNFT  GN SL + AI P 
Sbjct: 481 IAQITIAVSFNISYDDMKPRLTELAGLIAHELNVNTSQIHLLNFTSSGNDSLSRWAITPR 540

Query: 541 GSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIM 600
             +++FS++TA  II  + EH M+LP  +GSY++I WNV P   R+ W+R Y++VGLA++
Sbjct: 541 PYADYFSNSTAMNIIGRLAEHRMQLPDAFGSYKLIDWNVMPPSKRNWWQRYYMIVGLAVL 600

Query: 601 VTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           +T +LGLS  G +FIW+RR+Q+ HSYKPV+ A PEQELQ L
Sbjct: 601 LTSLLGLSIFG-FFIWKRRRQSAHSYKPVDVAVPEQELQPL 631

BLAST of CmaCh20G009810 vs. TrEMBL
Match: V4SQ94_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025144mg PE=3 SV=1)

HSP 1 Score: 840.1 bits (2169), Expect = 1.8e-240
Identity = 417/643 (64.85%), Postives = 510/643 (79.32%), Query Frame = 1

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHR--AMVLPLYRSSPNSSKLISK 60
           MAR    LL  ++ F+++    + ++P +S   +       AMVLPLY S PN S+ IS 
Sbjct: 1   MARASIPLLTTIVAFVYV----IQSNPATSTATILHGRTRPAMVLPLYLSQPNISRSISI 60

Query: 61  PHRRL-RGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPC 120
             R L R  PNS+   NARMRLYDDLLLNGYYTTRLWIGTPPQ FALIVDTGSTVTYVPC
Sbjct: 61  SRRHLQRSHPNSH--PNARMRLYDDLLLNGYYTTRLWIGTPPQTFALIVDTGSTVTYVPC 120

Query: 121 STCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDV 180
           +TCE CG HQDPKF+P+LSSTYQPVKCN  C CD +  QCVYER+YAEMS+SSGVLG+D+
Sbjct: 121 ATCEHCGDHQDPKFEPDLSSTYQPVKCNLYCNCDRERAQCVYERKYAEMSSSSGVLGEDI 180

Query: 181 ISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSL 240
           ISFGN+S L PQRAVFGCEN ETGDLYSQ ADGI+GLG GDLS+VDQLVEKGVI+DSFSL
Sbjct: 181 ISFGNESDLKPQRAVFGCENVETGDLYSQHADGIIGLGRGDLSVVDQLVEKGVISDSFSL 240

Query: 241 CYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGR 300
           CYGGMD+GGGAMVLGGISPP +M+F++SDPVRSPYYN+DLK IHVAGK LPL P VFDG+
Sbjct: 241 CYGGMDVGGGAMVLGGISPPKDMVFTHSDPVRSPYYNIDLKVIHVAGKPLPLNPKVFDGK 300

Query: 301 YGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSK 360
           +G+VLDSGTTY+YLP+ AF  FK+AIM+ L SLK+I GPDPN+ D CFSGA SD ++LS 
Sbjct: 301 HGTVLDSGTTYAYLPEAAFLAFKDAIMSELQSLKQIRGPDPNYNDICFSGAPSDVSQLSD 360

Query: 361 TFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVM 420
           TFP V++ F NGQKL L+PENYLFRHSKV GAYCLGIF+NG  D TTLLGGIIVRNTLVM
Sbjct: 361 TFPAVEMAFGNGQKLLLSPENYLFRHSKVRGAYCLGIFQNG-RDPTTLLGGIIVRNTLVM 420

Query: 421 YDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPED 480
           YDREHSKIGFWKTNCSELWERLHI+   +  PS   +S   +++   +PSE P+ ++P D
Sbjct: 421 YDREHSKIGFWKTNCSELWERLHITGALSPIPS---SSEGKNSSTDLSPSEPPNYVLPGD 480

Query: 481 IQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAIL 540
           +QIGRITFD+ L+I+Y  L PHI  L+D IAQEL+V+ SQV LLNF  +GN+S I  A+ 
Sbjct: 481 LQIGRITFDMFLSINYSDLRPHIPELADSIAQELDVNTSQVHLLNFMSKGNNSFIAWAVF 540

Query: 541 PNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLA 600
           P+GS+ + S+ATA  IIS + EH + +P  +G+Y++++WN+EP + R+ W+  +++V LA
Sbjct: 541 PSGSANYISNATALRIISRLAEHRVHIPDTFGNYKLLQWNIEPQVKRTWWQEHFLMVVLA 600

Query: 601 IMVTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           I + +++GLS  G+ FI RRR Q+ +SYKPV+AA PEQELQ L
Sbjct: 601 ITIMMVVGLSVFGILFILRRRHQSVNSYKPVDAALPEQELQPL 633

BLAST of CmaCh20G009810 vs. TAIR10
Match: AT3G50050.1 (AT3G50050.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 714.9 bits (1844), Expect = 4.4e-206
Identity = 357/607 (58.81%), Postives = 451/607 (74.30%), Query Frame = 1

Query: 37  SHRAMVLPLYRSSPNSS-KLISKPHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWI 96
           S R MV PL+ S PNSS + IS PHR+L    +S +  ++RMRLYDDLL+NGYYTTRLWI
Sbjct: 41  SRRPMVFPLFLSQPNSSSRSISIPHRKLHK-SDSKSLPHSRMRLYDDLLINGYYTTRLWI 100

Query: 97  GTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGV 156
           GTPPQ FALIVD+GSTVTYVPCS CE CGKHQDPKF PE+SSTYQPVKCN DC CD+D  
Sbjct: 101 GTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMDCNCDDDRE 160

Query: 157 QCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLG 216
           QCVYER+YAE S+S GVLG+D+ISFGN+S L PQRAVFGCE  ETGDLYSQRADGI+GLG
Sbjct: 161 QCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRADGIIGLG 220

Query: 217 SGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNV 276
            GDLS+VDQLV+KG+I++SF LCYGGMD+GGG+M+LGG   PS+M+F+ SDP RSPYYN+
Sbjct: 221 QGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNI 280

Query: 277 DLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGG 336
           DL  I VAGK+L L   VFDG +G+VLDSGTTY+YLP  AF  F+ A+M  + +LK+I G
Sbjct: 281 DLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDG 340

Query: 337 PDPNFKDTCFSGAGSD-AAELSKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGI 396
           PDPNFKDTCF  A S+  +ELSK FP+V+++F +GQ   L+PENY+FRHSKVHGAYCLG+
Sbjct: 341 PDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGV 400

Query: 397 FENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNT 456
           F NG  D TTLLGGI+VRNTLV+YDRE+SK+GFW+TNCSEL +RLHI      A   SN 
Sbjct: 401 FPNG-KDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSDRLHIDGAPPPATLPSND 460

Query: 457 SHDTDTAPASAPSESPHDMIPEDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVS 516
           S+         PS +    +    Q+G+I  DI L ++  +L+P I  LS   ++EL+V 
Sbjct: 461 SN---------PSHNSSSNLSGVTQVGQINLDIQLTVNSSYLKPRIEDLSKIFSKELDVK 520

Query: 517 HSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVI 576
            SQV L N T +GN SL+++ +LP   S +FS+ TAT I+S    H +KLP  +G+YQ++
Sbjct: 521 SSQVSLSNLTSKGNESLVRMVVLPPEPSTWFSNVTATNIVSRFTNHQIKLPEIFGNYQLV 580

Query: 577 RWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVN-AAAP 636
            + +EP   R+    + + +G+   + +I+GLSA G W IW+R+Q +   YKPV+ A   
Sbjct: 581 NYKLEPPRKRTNNNIVVIAIGI---IAVIVGLSAYGAWLIWKRKQTSI-PYKPVDEAIVA 632

Query: 637 EQELQTL 641
           EQELQ +
Sbjct: 641 EQELQPI 632

BLAST of CmaCh20G009810 vs. TAIR10
Match: AT5G43100.1 (AT5G43100.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 705.3 bits (1819), Expect = 3.5e-203
Identity = 350/626 (55.91%), Postives = 455/626 (72.68%), Query Frame = 1

Query: 16  LHLTHFTLSADPISSNPLLTPSHRAMVLPL-YRSSPNSSKLISKPHRRLRGFPNSNNRSN 75
           L L  FT +   I    L T     M+ PL Y S P   ++     RRL    + +   N
Sbjct: 6   LLLLLFTTTTISIFFFDLTTADESPMIFPLSYSSLPPRPRVEDFRRRRL----HQSQLPN 65

Query: 76  ARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPE 135
           A M+LYDDLL NGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC+ CGKHQDPKF PE
Sbjct: 66  AHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPE 125

Query: 136 LSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFG 195
           LS++YQ +KCN DC CD++G  CVYER+YAEMS+SSGVL +D+ISFGN+S L PQRAVFG
Sbjct: 126 LSTSYQALKCNPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFG 185

Query: 196 CENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGI 255
           CENEETGDL+SQRADGIMGLG G LS+VDQLV+KGVI D FSLCYGGM++GGGAMVLG I
Sbjct: 186 CENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKI 245

Query: 256 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQE 315
           SPP  M+FS+SDP RSPYYN+DLK++HVAGK L L P VF+G++G+VLDSGTTY+Y P+E
Sbjct: 246 SPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKE 305

Query: 316 AFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSL 375
           AF   K+A++  + SLK+I GPDPN+ D CFSGAG D AE+   FP + + F NGQKL L
Sbjct: 306 AFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLIL 365

Query: 376 APENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSE 435
           +PENYLFRH+KV GAYCLGIF   + D TTLLGGI+VRNTLV YDRE+ K+GF KTNCS+
Sbjct: 366 SPENYLFRHTKVRGAYCLGIFP--DRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSD 425

Query: 436 LWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQIGRITFDILLNISYK 495
           +W RL   +  A    +S  +  ++ +P+ A SESP   +P   ++G ITF++ ++++  
Sbjct: 426 IWRRLAAPESPAPTSPISQ-NKSSNISPSPATSESPTSHLPGVFRVGVITFEVSISVNNS 485

Query: 496 HLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATTII 555
            L+P  + ++D IA EL++  +QVRLLNF+  GN   ++  + P  SSE+ S+ TA  I+
Sbjct: 486 SLKPKFSEIADFIAHELDIQSAQVRLLNFSSSGNEYRLKWGVFPPQSSEYISNTTALNIM 545

Query: 556 SLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVWFI 615
            L+ E+ ++LP ++GSY+++ W  E    +S W++  + V    M++L++    + +  +
Sbjct: 546 LLLKENRLRLPGQFGSYKLLEWKAEQKKKQSWWEKHLLGVVGGAMISLLVTSVMIKLALV 605

Query: 616 WRRRQQAFHSYKPVNAAAPEQELQTL 641
           WRRR+Q   +Y+PVNAA  EQELQ L
Sbjct: 606 WRRRKQEEATYEPVNAAIKEQELQPL 624

BLAST of CmaCh20G009810 vs. TAIR10
Match: AT5G22850.1 (AT5G22850.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 201.4 bits (511), Expect = 1.6e-51
Identity = 142/418 (33.97%), Postives = 210/418 (50.24%), Query Frame = 1

Query: 82  DLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPK-----FDPELS 141
           D  + G Y T+L +GTPP+ F + VDTGS V +V C++C  C +    +     FDP  S
Sbjct: 74  DPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSS 133

Query: 142 STYQPVKC----------NSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGN--QS 201
            T  P+ C          +SD  C      C Y  QY + S +SG    DV+ F     S
Sbjct: 134 VTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGS 193

Query: 202 ALVPQR---AVFGCENEETGDLY-SQRA-DGIMGLGSGDLSIVDQLVEKGVINDSFSLCY 261
           +LVP      VFGC   +TGDL  S RA DGI G G   +S++ QL  +G+    FS C 
Sbjct: 194 SLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253

Query: 262 GGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVF--DGR 321
            G + GGG +VLG I  P+ M+F+   P   P+YNV+L  I V G+ LP+ PSVF     
Sbjct: 254 KGENGGGGILVLGEIVEPN-MVFTPLVP-SQPHYNVNLLSISVNGQALPINPSVFSTSNG 313

Query: 322 YGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSK 381
            G+++D+GTT +YL + A+ PF  AI NA+    +   P  +  + C+    S    +  
Sbjct: 314 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR---PVVSKGNQCYVITTS----VGD 373

Query: 382 TFPTVDLIFDNGQKLSLAPENYLFRHSKVHG--AYCLGIFENGNNDQTTLLGGIIVRNTL 441
            FP V L F  G  + L P++YL + + V G   +C+G F+   N   T+LG +++++ +
Sbjct: 374 IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIG-FQRIQNQGITILGDLVLKDKI 433

Query: 442 VMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDM 474
            +YD    +IG+   +CS        +  N  A S S  S   +    S  + +P  +
Sbjct: 434 FVYDLVGQRIGWANYDCS--------TSVNVSATSSSGRSEYVNAGQFSENAAAPQKL 473

BLAST of CmaCh20G009810 vs. TAIR10
Match: AT1G08210.1 (AT1G08210.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 199.5 bits (506), Expect = 6.2e-51
Identity = 152/456 (33.33%), Postives = 226/456 (49.56%), Query Frame = 1

Query: 7   LLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRS--SPNSSKLISKPHRRLR 66
           ++ AVLL  L  T     +D +     L P +  + L   R+  S    +L+  P   + 
Sbjct: 11  IIAAVLL--LAATTLACGSDAVLKLERLIPPNHELGLTELRAFDSARHGRLLQSPVGGVV 70

Query: 67  GFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCG 126
            FP              D  L G Y T++ +GTPP++F + +DTGS V +V C++C  C 
Sbjct: 71  NFPVDGA---------SDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCP 130

Query: 127 KHQDPK-----FDPELSSTYQPVKCN-----------SDCTCDNDGVQCVYERQYAEMST 186
           K  + +     FDP +SS+   V C+           S C+ +N    C Y  +Y + S 
Sbjct: 131 KTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNN---LCSYSFKYGDGSG 190

Query: 187 SSGVLGDDVISFGN--QSALVPQRA---VFGCENEETGDLYSQR--ADGIMGLGSGDLSI 246
           +SG    D +SF     S L    +   VFGC N ++GDL   R   DGI GLG G LS+
Sbjct: 191 TSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSV 250

Query: 247 VDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRS-PYYNVDLKEI 306
           + QL  +G+    FS C  G   GGG MVLG I  P  +   Y+  V S P+YNV+L+ I
Sbjct: 251 ISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTV---YTPLVPSQPHYNVNLQSI 310

Query: 307 HVAGKKLPLEPSVFD--GRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDP 366
            V G+ LP++PSVF      G+++D+GTT +YLP EA+ PF  A+ NA   + + G P  
Sbjct: 311 AVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANA---VSQYGRPIT 370

Query: 367 NFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSLAPENYL-FRHSKVHGAYCLGIFEN 426
                CF     D       FP V L F  G  + L P  YL    S     +C+G F+ 
Sbjct: 371 YESYQCFEITAGDV----DVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIG-FQR 430

Query: 427 GNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCS 434
            ++ + T+LG +++++ +V+YD    +IG+ + +CS
Sbjct: 431 MSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441

BLAST of CmaCh20G009810 vs. TAIR10
Match: AT2G36670.1 (AT2G36670.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 182.2 bits (461), Expect = 1.0e-45
Identity = 127/377 (33.69%), Postives = 198/377 (52.52%), Query Frame = 1

Query: 89  YTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPK-----FDPELSSTYQPVK 148
           Y T++ +G+PP +F + +DTGS + +V CS+C  C            FD   S T   V 
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164

Query: 149 CNSDCTCDN----------DGVQCVYERQYAEMSTSSG-----------VLGDDVISFGN 208
           C SD  C +          +  QC Y  +Y + S +SG           +LG+ +++  N
Sbjct: 165 C-SDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA--N 224

Query: 209 QSALVPQRAVFGCENEETGDLYS--QRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 268
            SA +    VFGC   ++GDL    +  DGI G G G LS+V QL  +G+    FS C  
Sbjct: 225 SSAPI----VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLK 284

Query: 269 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD--GRY 328
           G   GGG  VLG I  P  M++S   P   P+YN++L  I V G+ LPL+ +VF+     
Sbjct: 285 GDGSGGGVFVLGEILVPG-MVYSPLVP-SQPHYNLNLLSIGVNGQMLPLDAAVFEASNTR 344

Query: 329 GSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKT 388
           G+++D+GTT +YL +EA+  F NAI N   S+ ++  P  +  + C+  + S    +S  
Sbjct: 345 GTIVDTGTTLTYLVKEAYDLFLNAISN---SVSQLVTPIISNGEQCYLVSTS----ISDM 404

Query: 389 FPTVDLIFDNGQKLSLAPENYLFRHSKVHGA--YCLGIFENGNNDQTTLLGGIIVRNTLV 434
           FP+V L F  G  + L P++YLF +    GA  +C+G F+    +Q T+LG +++++ + 
Sbjct: 405 FPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG-FQKAPEEQ-TILGDLVLKDKVF 463

BLAST of CmaCh20G009810 vs. NCBI nr
Match: gi|659115870|ref|XP_008457780.1| (PREDICTED: aspartic proteinase CDR1-like [Cucumis melo])

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 531/640 (82.97%), Postives = 573/640 (89.53%), Query Frame = 1

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MA++P LLL  +L      HF LSADPIS NPL+TPSHRAMVLPLY SS NSSK IS PH
Sbjct: 1   MAKSPFLLLPAIL-----LHFFLSADPISPNPLITPSHRAMVLPLYLSSSNSSKFISNPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           R LR FP S+NRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61  RHLRQFPTSDNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           E CG+HQDPKFDPE SSTY+P+KCN DCTCD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCTCDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL  S+FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSSIFDGRYGT 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTY+YLP EAFG FK+AIM+ LHSLKKI GPDPNFKD CFSGAGSDAAELS  FP
Sbjct: 301 VLDSGTTYAYLPAEAFGAFKDAIMDELHSLKKIDGPDPNFKDICFSGAGSDAAELSNIFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVD++F+NGQKLSLAPENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLAPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
            HSKIGFWKTNCSELWERL  SD+NAHAPS+S  SH +D APASAP ESPH  IP ++QI
Sbjct: 421 AHSKIGFWKTNCSELWERLRTSDDNAHAPSISTKSHGSDMAPASAPIESPHYTIPGELQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITF+ILLN SY  LEPHIT LSDHIAQELNVSHSQV LLNFTMRGN SLI+LAI+P G
Sbjct: 481 GRITFEILLNKSYTDLEPHITELSDHIAQELNVSHSQVLLLNFTMRGNDSLIKLAIIPYG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSE FSHAT  TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGLAI+V
Sbjct: 541 SSEIFSHATVNTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLAIIV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
             ILGLSA+G WFI R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFILRSRQQAINSYKPVNAAVPEQELQPL 634

BLAST of CmaCh20G009810 vs. NCBI nr
Match: gi|778669864|ref|XP_011649314.1| (PREDICTED: aspartic proteinase CDR1 [Cucumis sativus])

HSP 1 Score: 1073.5 bits (2775), Expect = 1.0e-310
Identity = 526/640 (82.19%), Postives = 569/640 (88.91%), Query Frame = 1

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
           MA++P L+ A+LLH        LSADPIS NPLL+PSHRAMVLPLY SSPNSSK IS PH
Sbjct: 1   MAKSPFLVAAILLHIF------LSADPISPNPLLSPSHRAMVLPLYLSSPNSSKFISNPH 60

Query: 61  RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
           RRLR FP S+N SNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61  RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120

Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
           E CG+HQDPKFDPE SSTY+P+KCN DC CD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180

Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
           GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240

Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
           GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL   +FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGA 300

Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
           VLDSGTTY+YLP EAF  FK+AIM+ +HSLKKI GPDPNFKD CFSGAGSDAAELS  FP
Sbjct: 301 VLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFP 360

Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
           TVD++F+NGQKLSL PENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420

Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
            +SKIGFWKTNCSELWERL ISD+NA  PSVS  SHD+D APASAPSE PH  IP ++QI
Sbjct: 421 ANSKIGFWKTNCSELWERLRISDDNADGPSVSTKSHDSDIAPASAPSERPHYTIPGELQI 480

Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
           GRITF ILLN SY  LEPHIT LSDHIAQELNVSHSQV +LNFTMRGN SLIQLAILP G
Sbjct: 481 GRITFAILLNKSYTDLEPHITELSDHIAQELNVSHSQVIILNFTMRGNDSLIQLAILPYG 540

Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
           SSE FSHATA TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGL I+V
Sbjct: 541 SSEIFSHATANTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLVIVV 600

Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
             ILGLSA+G WF+ R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFVLRSRQQAINSYKPVNAAVPEQELQPL 633

BLAST of CmaCh20G009810 vs. NCBI nr
Match: gi|590705429|ref|XP_007047435.1| (Aspartyl protease family protein [Theobroma cacao])

HSP 1 Score: 850.9 bits (2197), Expect = 1.4e-243
Identity = 420/628 (66.88%), Postives = 501/628 (79.78%), Query Frame = 1

Query: 21  FTLS-ADPISSNPLLTP-----SHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSN 80
           F LS ++P +S PLL P     +  AM+LPL+    NSS+  S   R L    + ++  N
Sbjct: 19  FLLSRSNPSTSTPLLLPPPHHGARPAMILPLFPFPKNSSRTFSHSGRHLLRSDSHSSHPN 78

Query: 81  ARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPE 140
           ARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPC+TCE CG+HQDPKF P+
Sbjct: 79  ARMRLYDDLLLNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCATCEQCGRHQDPKFQPD 138

Query: 141 LSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFG 200
           LSSTYQPVKCN DC+CD D VQC YERQYAEMS+SSGVLG+D+ISFGNQS LVPQRAVFG
Sbjct: 139 LSSTYQPVKCNLDCSCDTDRVQCTYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAVFG 198

Query: 201 CENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGI 260
           CENEETGDLYSQ ADGIMGLG GDLS+VDQLVEKGVI+DSFSLCYGGMDIGGGAMVLGGI
Sbjct: 199 CENEETGDLYSQHADGIMGLGRGDLSVVDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGI 258

Query: 261 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQE 320
           S P +M+FSYSDP RSPYYN+DLK IHVAGK+LPL P+VFD +YG+VLDSGTTY+YLP+ 
Sbjct: 259 SSPPDMVFSYSDPERSPYYNIDLKAIHVAGKQLPLNPNVFDVKYGTVLDSGTTYAYLPEA 318

Query: 321 AFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSL 380
           AF  FKNAI+  L SLK+I GPDPN+ D CFSGA SD +ELSK FPTV+++FDN QKL L
Sbjct: 319 AFAAFKNAIIKELTSLKQIRGPDPNYNDICFSGASSDVSELSKIFPTVEMVFDNQQKLLL 378

Query: 381 APENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSE 440
           APENYLFRHSKV G YCLGIF N   D TTLLGGIIVRNTLV YDREH KIGFWKTNCSE
Sbjct: 379 APENYLFRHSKVRGGYCLGIFPN-EKDPTTLLGGIIVRNTLVTYDREHLKIGFWKTNCSE 438

Query: 441 LWERLHISDENAHAPSVSNTSHDT--DTAPASAPSESPHDMIPEDIQIGRITFDILLNIS 500
           LWERL I+   + +PS S+   ++  ++ P SAP  S H  IP +IQIG IT D+ L+I 
Sbjct: 439 LWERLRINGAPSPSPSSSSGKDNSTVESPPTSAPDGSSHYAIPGEIQIGEITLDMSLSID 498

Query: 501 YKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATT 560
           Y +L+PHI  L++ IA+EL+V+ SQV LL+FT  GN SL+  AI+P+GS+ + S+  A +
Sbjct: 499 YSYLKPHINELAEFIAKELDVNASQVHLLDFTSEGNSSLVTWAIVPSGSATYISNVAAIS 558

Query: 561 IISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVW 620
           IIS + EH ++LP  +G+YQ+++W VEP + ++ W++ Y++V LAIM+T+I+GLSA G W
Sbjct: 559 IISQLAEHRVRLPDTFGNYQLVQWKVEPSVQQTWWQQHYLVVLLAIMITIIVGLSASGGW 618

Query: 621 FIWRRRQQAFHSYKPVNAAAPEQELQTL 641
            IWRRRQQA   YKPV+ A  EQELQ L
Sbjct: 619 IIWRRRQQALKLYKPVDGAVSEQELQPL 645

BLAST of CmaCh20G009810 vs. NCBI nr
Match: gi|823184460|ref|XP_012489205.1| (PREDICTED: aspartic proteinase-like protein 2 [Gossypium raimondii])

HSP 1 Score: 850.1 bits (2195), Expect = 2.5e-243
Identity = 421/641 (65.68%), Postives = 507/641 (79.10%), Query Frame = 1

Query: 6   NLLLAVLLHFLHLTHFTLSADPISSNP--LLTPSHR----AMVLPLYRSSPNSSKLISKP 65
           NL +  ++ FL    F LS    S++P  LL P H     AMVLPL+ SS NSS+     
Sbjct: 7   NLAVGTVVFFLL---FLLSQSNPSTSPPRLLPPPHHGARPAMVLPLFPSSKNSSRTFLHS 66

Query: 66  HRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCST 125
           HR L    + ++  NARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPC+T
Sbjct: 67  HRHLLRSDSHSSHPNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCAT 126

Query: 126 CELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVIS 185
           CE CG+HQDPKF P+LSSTYQPVKCN DC CD+D VQC+YERQYAEMS+SSGVLG+D+IS
Sbjct: 127 CEQCGRHQDPKFQPDLSSTYQPVKCNLDCNCDSDRVQCIYERQYAEMSSSSGVLGEDIIS 186

Query: 186 FGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCY 245
           FGNQS LVPQRAVFGCENEETGDLYSQ ADGIMGLG GDLS+VDQLVEKGVI+DSFSLCY
Sbjct: 187 FGNQSELVPQRAVFGCENEETGDLYSQHADGIMGLGRGDLSVVDQLVEKGVISDSFSLCY 246

Query: 246 GGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYG 305
           GGMDIGGGAMVLGGIS PS+M+FSY+DPVRSPYY++ LKEIHVAGK+L L PSVFDG+YG
Sbjct: 247 GGMDIGGGAMVLGGISAPSDMVFSYADPVRSPYYSIGLKEIHVAGKQLSLNPSVFDGKYG 306

Query: 306 SVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTF 365
           +VLDSGTTY+YLP+ AF  FK AI+  L+ LK+I GPDPN+ D CFS A SD +ELSKTF
Sbjct: 307 TVLDSGTTYAYLPEPAFLAFKEAILKELNGLKQIRGPDPNYNDICFSTASSDVSELSKTF 366

Query: 366 PTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYD 425
           PTV+++F + QKL L+PENYLFRHSKVHGAYCLGIF+N   D TTLLGGIIVRNTLV YD
Sbjct: 367 PTVEMVFGDQQKLLLSPENYLFRHSKVHGAYCLGIFQN-EKDPTTLLGGIIVRNTLVTYD 426

Query: 426 REHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQ 485
           REHSKIGFWKTNCSELWERLHI+   +  PS S   + T++   +A   SPH   P  IQ
Sbjct: 427 REHSKIGFWKTNCSELWERLHITGALSPTPSSSGKGNSTESPTTTASDGSPHYDFPGKIQ 486

Query: 486 IGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPN 545
           IG+I  D+ L+ ++ +L+P I  L++ IA+EL+V+ SQV LLNFT  GN SL++LAI+P+
Sbjct: 487 IGKIILDMSLSTNHSYLKPQINKLTEFIAKELDVNASQVHLLNFTSEGNSSLVRLAIVPS 546

Query: 546 GSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIM 605
            SS +    TA  IIS + EH +KLP  +G+YQ+++W VEP   ++ W R Y++V +A++
Sbjct: 547 DSSTYIYKETARNIISRLAEHRVKLPDTFGNYQLVQWKVEPSTKQTWWGRNYMVVVVALI 606

Query: 606 VTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           + +++GLS  GVW +WRR+QQ  +SYKPV AAAPEQELQ L
Sbjct: 607 IIVVIGLSVYGVWGMWRRKQQTVNSYKPVGAAAPEQELQPL 643

BLAST of CmaCh20G009810 vs. NCBI nr
Match: gi|802645015|ref|XP_012079339.1| (PREDICTED: aspartic proteinase-like protein 2 [Jatropha curcas])

HSP 1 Score: 845.9 bits (2184), Expect = 4.7e-242
Identity = 418/645 (64.81%), Postives = 507/645 (78.60%), Query Frame = 1

Query: 1   MARTPNLLLAVLLHFLHLTHFTLSADPISSNP----LLTPSHRAMVLPLYRSSPNSSKLI 60
           MA TP  L+     F       +  D +S+N     LL  +  A++LPL+ S  NSSK +
Sbjct: 1   MASTPIQLIIFFYFFFFQLDAAIVLD-VSANSTTTVLLGGATPALILPLFLSPSNSSKQL 60

Query: 61  SKPHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVP 120
           S P R L G  N++ R NARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVP
Sbjct: 61  SNPPRHLLG-SNASARPNARMRLYDDLLLNGYYTTRLWIGTPPQRFALIVDTGSTVTYVP 120

Query: 121 CSTCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDD 180
           CSTCE CG HQDPKF PELSSTYQP+KCN DC CD++  QC+Y+R+YAEMSTSSGVL +D
Sbjct: 121 CSTCEQCGNHQDPKFQPELSSTYQPLKCNPDCNCDDEREQCIYDRRYAEMSTSSGVLAED 180

Query: 181 VISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFS 240
            ISFGNQS L PQRAVFGCEN ETGDLYSQ ADGIMGLGSGDLSIVDQLVEKGVI+DSFS
Sbjct: 181 FISFGNQSELEPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKGVISDSFS 240

Query: 241 LCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDG 300
           LCYGGM+IGGGAMVLG +SPPS M+F+YSDPVRS YYN+DL+EIHVAGK+LPLEP VFD 
Sbjct: 241 LCYGGMNIGGGAMVLGSLSPPSGMVFTYSDPVRSQYYNIDLREIHVAGKRLPLEPGVFDR 300

Query: 301 RYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELS 360
           ++G++LDSGTTY+YLP+  F  FK+AIM  LHSLK+I GPDPN+ D CFSGAGS+ ++LS
Sbjct: 301 KHGTILDSGTTYAYLPEAVFKAFKDAIMKELHSLKQIRGPDPNYNDICFSGAGSEVSQLS 360

Query: 361 KTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLV 420
             FPTVD+IF++GQK SL+PENYLFRH+KV GAYCLGIF NG  D TTLLGGIIVRNTLV
Sbjct: 361 NAFPTVDMIFEHGQKWSLSPENYLFRHTKVPGAYCLGIFPNG-KDPTTLLGGIIVRNTLV 420

Query: 421 MYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSN-TSHDTDTAPASAPSESPHDMIP 480
           MYDRE+SK+GFWKTNCSELWERLHI+   A  PS SN T+   +  P  APS+  H ++P
Sbjct: 421 MYDRENSKVGFWKTNCSELWERLHITSAAAPLPSDSNGTNITVEIPPTLAPSDQLHYVLP 480

Query: 481 EDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLA 540
           +++QIG+ITF++ L  +Y HL+ H T L   IAQ+L V+ SQV LL    +GN SLI   
Sbjct: 481 DELQIGQITFEMSLKANYSHLKIHATELIGFIAQQLGVNSSQVHLLKLASKGNDSLIGWT 540

Query: 541 ILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVG 600
           I+P+GS++  S+ATA +IIS + EHH++LP  +GSY+++ W +EP  +R+ W++ Y+  G
Sbjct: 541 IVPSGSADHISNATALSIISRVAEHHIQLPDTFGSYRLVHWKIEPPANRTWWQQHYLFAG 600

Query: 601 LAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
           L +++ LILGLSA G+ FIWR R+Q F +Y+PVN A PEQELQ L
Sbjct: 601 LVVIIVLILGLSASGLLFIWRCREQTFSAYRPVNTAVPEQELQPL 642

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPL2_ARATH1.7e-3428.71Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=... [more]
ASPG2_ARATH3.8e-3429.64Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
ASPG1_ARATH3.8e-3428.93Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
NEP1_NEPGR1.6e-3230.08Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR2.1e-3231.62Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LJB9_CUCSA6.9e-31182.19Uncharacterized protein OS=Cucumis sativus GN=Csa_2G277070 PE=3 SV=1[more]
A0A061DHD4_THECC1.0e-24366.88Aspartyl protease family protein OS=Theobroma cacao GN=TCM_000732 PE=3 SV=1[more]
A0A0D2QM13_GOSRA1.7e-24365.68Uncharacterized protein OS=Gossypium raimondii GN=B456_007G056000 PE=3 SV=1[more]
G7JCS6_MEDTR3.6e-24165.21Eukaryotic aspartyl protease family protein OS=Medicago truncatula GN=MTR_4g0952... [more]
V4SQ94_9ROSI1.8e-24064.85Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025144mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G50050.14.4e-20658.81 Eukaryotic aspartyl protease family protein[more]
AT5G43100.13.5e-20355.91 Eukaryotic aspartyl protease family protein[more]
AT5G22850.11.6e-5133.97 Eukaryotic aspartyl protease family protein[more]
AT1G08210.16.2e-5133.33 Eukaryotic aspartyl protease family protein[more]
AT2G36670.11.0e-4533.69 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659115870|ref|XP_008457780.1|0.0e+0082.97PREDICTED: aspartic proteinase CDR1-like [Cucumis melo][more]
gi|778669864|ref|XP_011649314.1|1.0e-31082.19PREDICTED: aspartic proteinase CDR1 [Cucumis sativus][more]
gi|590705429|ref|XP_007047435.1|1.4e-24366.88Aspartyl protease family protein [Theobroma cacao][more]
gi|823184460|ref|XP_012489205.1|2.5e-24365.68PREDICTED: aspartic proteinase-like protein 2 [Gossypium raimondii][more]
gi|802645015|ref|XP_012079339.1|4.7e-24264.81PREDICTED: aspartic proteinase-like protein 2 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G009810.1CmaCh20G009810.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 404..419
score: 8.3E-9coord: 300..311
score: 8.3E-9coord: 247..260
score: 8.3E-9coord: 95..115
score: 8.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 40..440
score: 2.9E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 104..115
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 86..254
score: 2.8E-38coord: 267..436
score: 8.0
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 85..437
score: 3.2
NoneNo IPR availablePANTHERPTHR13683:SF342SUBFAMILY NOT NAMEDcoord: 40..440
score: 2.9E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh20G009810CmaCh02G005060Cucurbita maxima (Rimu)cmacmaB469