CsaUNG008290 (gene) Cucumber (Chinese Long) v2

NameCsaUNG008290
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionAspartic proteinase nepenthesin-1, putative; contains IPR001461 (Peptidase A1), IPR021109 (Aspartic peptidase)
LocationScaffold000122 : 145927 .. 147992 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCATCTCTGTATTCACCAATTAATGCACACCTCAACTAATGTTATAAAACAACCCGTTTGATCCTACAATATTTAAGTATCACCACCGTAGATTGAACTCAGCATTTCTACTCGTGGCTTCTACTTACTGCCTACATGGATAAATAGAGGTCTACCCATGATTGGTGTACACATCTATACATCCATATATATCTTTTACCAACTTCACCTTCAGTTTCTTATCTATTTCTTTCTACAGAAAACACTCACAAACCAGTAACTCAATCATAATCCAACCCCACAAATTTTCTTTTTCCCTCTCATATTTCTTCTCACACTTAAATCCCCCATTTCTCATTTCCATTCTCAAGCTCCCCTACAACACTCACTTCACAGTTCAACTACTAAACCATGCTACCTTTCCTTCTTCTTTCTCTTTTAGCCACCGCAGTCGCTTCGGTTGCCACTGGTCCTGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAGCAGAAACCGCACCCTCTAGACTACCACAAGATCTTGAACTCCATGAAAACTACCCTATTTTTGAGTTAGACAACAACAGCAGCCAAAGTCAATGGAAGCTCAAGCTTTTCCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCCCCGTCGTTTCAAGGAGCGTATCAGCAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCGCCTACTCTCCAGTGGCAGCGATGAGCAGGTGATCAGGAGTGAAGCTAAGAAGGAAGAAGGGAGCCAATTCCCCTCCTTTGAGTTCATATAAATATACTCAATATTGTTTCAAAGATTATAGATAAAAAAAATGTTGGAATATAGAATTTTTGTTTCTTTCTGATTTTGCTTCTATTTTCATTTTTTGGCTGTGGCCGTGCAGGTGACGGACTTCGGATCGGATGTGGTCTCCGGCACGGAGCAGGGTAGCGGAGAGTACTTCGTAAGGATAGGCGTTGGCAGCCCGCCGAGGAGCCAATACGTAGTGATTGATTCCGGCAGCGACATTGTATGGGTGCAATGCCAGCCCTGCAGCGAATGCTACCAACAATCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCACCTACGCCGGAATCTCCTGTGACTCATCAGTGTGTGACCGCCTCGACAACGCAGGCTGCAACGATGGCCGGTGCCGGTACGAGGTGTCATACGGCGACGGATCCTACACCCGCGGCACCCTCGCGCTCGAAACCTTAACTTTCGGCCGGGTCCTAATCCGAAACATCGCGATTGGGTGCGGCCATATGAACCGAGGAATGTTCATCGGAGCCGCAGGGTTGCTCGGTCTCGGCGGCGGCGCCATGTCATTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTAGTCAGTCGAGGCACAGAGTCCACAGGAACACTGGAATTCGGCCGCGGCGCTATGCCAGTCGGCGCCGCGTGGGTTCCCCTAATCCGAAATCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGTCTCGGAGTCGGAGGGATCCGAGTACCAATACCCGAACAAATCTTCGAACTCACCGATTTAGGGTACGGTGGTGTAGTAATGGACACCGGAACCGCCGTGACGAGGCTACCGGCGCCGGCGTACGAAGCATTCCGAGACACGTTCATCGGACAAACGGCAAACCTACCTCGATCGGACAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCTTTCTACTTCTCCGGTGGGCCAATACTGACGTTGCCGGCGAGGAATTTTCTAATCCCGGTGGACGGAGAAGGGACATTTTGCTTTGCATTTGCAGCATCGGCGTCAGGATTGTCGATCATAGGGAACATTCAACAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAACTATTTGTTAATTATTCCCTTCATCAAATATCAAACGAAGACCCAATCTCAAATTATGTCTTAAATACGTTTTTTCCCCTC

mRNA sequence

ATGCTACCTTTCCTTCTTCTTTCTCTTTTAGCCACCGCAGTCGCTTCGGTTGCCACTGGTCCTGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAGCAGAAACCGCACCCTCTAGACTACCACAAGATCTTGAACTCCATGAAAACTACCCTATTTTTGAGTTAGACAACAACAGCAGCCAAAGTCAATGGAAGCTCAAGCTTTTCCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCCCCGTCGTTTCAAGGAGCGTATCAGCAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCGCCTACTCTCCAGTGGCAGCGATGAGCAGGTGACGGACTTCGGATCGGATGTGGTCTCCGGCACGGAGCAGGGTAGCGGAGAGTACTTCGTAAGGATAGGCGTTGGCAGCCCGCCGAGGAGCCAATACGTAGTGATTGATTCCGGCAGCGACATTGTATGGGTGCAATGCCAGCCCTGCAGCGAATGCTACCAACAATCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCACCTACGCCGGAATCTCCTGTGACTCATCAGTGTGTGACCGCCTCGACAACGCAGGCTGCAACGATGGCCGGTGCCGGTACGAGGTGTCATACGGCGACGGATCCTACACCCGCGGCACCCTCGCGCTCGAAACCTTAACTTTCGGCCGGGTCCTAATCCGAAACATCGCGATTGGGTGCGGCCATATGAACCGAGGAATGTTCATCGGAGCCGCAGGGTTGCTCGGTCTCGGCGGCGGCGCCATGTCATTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTAGTCAGTCGAGGCACAGAGTCCACAGGAACACTGGAATTCGGCCGCGGCGCTATGCCAGTCGGCGCCGCGTGGGTTCCCCTAATCCGAAATCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGTCTCGGAGTCGGAGGGATCCGAGTACCAATACCCGAACAAATCTTCGAACTCACCGATTTAGGGTACGGTGGTGTAGTAATGGACACCGGAACCGCCGTGACGAGGCTACCGGCGCCGGCGTACGAAGCATTCCGAGACACGTTCATCGGACAAACGGCAAACCTACCTCGATCGGACAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCTTTCTACTTCTCCGGTGGGCCAATACTGACGTTGCCGGCGAGGAATTTTCTAATCCCGGTGGACGGAGAAGGGACATTTTGCTTTGCATTTGCAGCATCGGCGTCAGGATTGTCGATCATAGGGAACATTCAACAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAACTATTTGTTAA

Coding sequence (CDS)

ATGCTACCTTTCCTTCTTCTTTCTCTTTTAGCCACCGCAGTCGCTTCGGTTGCCACTGGTCCTGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAGCAGAAACCGCACCCTCTAGACTACCACAAGATCTTGAACTCCATGAAAACTACCCTATTTTTGAGTTAGACAACAACAGCAGCCAAAGTCAATGGAAGCTCAAGCTTTTCCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCCCCGTCGTTTCAAGGAGCGTATCAGCAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCGCCTACTCTCCAGTGGCAGCGATGAGCAGGTGACGGACTTCGGATCGGATGTGGTCTCCGGCACGGAGCAGGGTAGCGGAGAGTACTTCGTAAGGATAGGCGTTGGCAGCCCGCCGAGGAGCCAATACGTAGTGATTGATTCCGGCAGCGACATTGTATGGGTGCAATGCCAGCCCTGCAGCGAATGCTACCAACAATCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCACCTACGCCGGAATCTCCTGTGACTCATCAGTGTGTGACCGCCTCGACAACGCAGGCTGCAACGATGGCCGGTGCCGGTACGAGGTGTCATACGGCGACGGATCCTACACCCGCGGCACCCTCGCGCTCGAAACCTTAACTTTCGGCCGGGTCCTAATCCGAAACATCGCGATTGGGTGCGGCCATATGAACCGAGGAATGTTCATCGGAGCCGCAGGGTTGCTCGGTCTCGGCGGCGGCGCCATGTCATTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTAGTCAGTCGAGGCACAGAGTCCACAGGAACACTGGAATTCGGCCGCGGCGCTATGCCAGTCGGCGCCGCGTGGGTTCCCCTAATCCGAAATCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGTCTCGGAGTCGGAGGGATCCGAGTACCAATACCCGAACAAATCTTCGAACTCACCGATTTAGGGTACGGTGGTGTAGTAATGGACACCGGAACCGCCGTGACGAGGCTACCGGCGCCGGCGTACGAAGCATTCCGAGACACGTTCATCGGACAAACGGCAAACCTACCTCGATCGGACAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCTTTCTACTTCTCCGGTGGGCCAATACTGACGTTGCCGGCGAGGAATTTTCTAATCCCGGTGGACGGAGAAGGGACATTTTGCTTTGCATTTGCAGCATCGGCGTCAGGATTGTCGATCATAGGGAACATTCAACAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAACTATTTGTTAA

Protein sequence

MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC*
BLAST of CsaUNG008290 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 638.3 bits (1645), Expect = 6.9e-182
Identity = 307/466 (65.88%), Postives = 371/466 (79.61%), Query Frame = 1

Query: 17  VATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSSQSQWKLKLF 76
           +++  + ++P  Q++   D ++   T  + LP     H +        + S S++ L+L 
Sbjct: 16  LSSSSSISFPDFQII---DVLQPPLTVTATLPDFNNTHFS--------DESSSKYTLRLL 75

Query: 77  HRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLS------SGSDEQVTDFGSDVVSGT 136
           HRD+ P     +H  R   R+ RD+ RVS++LR +S      S S  +V DFGSD+VSG 
Sbjct: 76  HRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGM 135

Query: 137 EQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGI 196
           +QGSGEYFVRIGVGSPPR QY+VIDSGSD+VWVQCQPC  CY+QSDPVFDPA S +Y G+
Sbjct: 136 DQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGV 195

Query: 197 SCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMNR 256
           SC SSVCDR++N+GC+ G CRYEV YGDGSYT+GTLALETLTF + ++RN+A+GCGH NR
Sbjct: 196 SCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNR 255

Query: 257 GMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGAAWV 316
           GMFIGAAGLLG+GGG+MSFVGQL GQTGGAF YCLVSRGT+STG+L FGR A+PVGA+WV
Sbjct: 256 GMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVGASWV 315

Query: 317 PLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYE 376
           PL+RNPRAPSFYYVGL GLGVGG+R+P+P+ +F+LT+ G GGVVMDTGTAVTRLP  AY 
Sbjct: 316 PLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYV 375

Query: 377 AFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPV 436
           AFRD F  QTANLPR+  VSIFDTCY+L+GFVSVRVPTVSFYF+ GP+LTLPARNFL+PV
Sbjct: 376 AFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPV 435

Query: 437 DGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           D  GT+CFAFAAS +GLSIIGNIQQEGIQ+S DG+NGFVGFGP +C
Sbjct: 436 DDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CsaUNG008290 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 2.3e-113
Identity = 218/496 (43.95%), Postives = 301/496 (60.69%), Query Frame = 1

Query: 1   MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF 60
           +L  + LSL  T     ++   +T P T +L+V  ++++ +T  S  P    L    P  
Sbjct: 9   LLAVVTLSLFLTTT-DASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPES 68

Query: 61  ELDNN--SSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSL---LRLLSSGS 120
             D    +S S   L+L  RD    +   D+      R+ RDS RV+ +   +R    G 
Sbjct: 69  LSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGV 128

Query: 121 DE-------------QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVW 180
           D              Q  D  + VVSG  QGSGEYF RIGVG+P +  Y+V+D+GSD+ W
Sbjct: 129 DRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNW 188

Query: 181 VQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYT 240
           +QC+PC++CYQQSDPVF+P  S+TY  ++C +  C  L+ + C   +C Y+VSYGDGS+T
Sbjct: 189 IQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFT 248

Query: 241 RGTLALETLTFGRV-LIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAF 300
            G LA +T+TFG    I N+A+GCGH N G+F GAAGLLGLGGG +S   Q+      +F
Sbjct: 249 VGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQM---KATSF 308

Query: 301 SYCLVSRGTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQ 360
           SYCLV R +  + +L+F    +  G A  PL+RN +  +FYYVGLSG  VGG +V +P+ 
Sbjct: 309 SYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDA 368

Query: 361 IFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPR-SDRVSIFDTCYNLNG 420
           IF++   G GGV++D GTAVTRL   AY + RD F+  T NL + S  +S+FDTCY+ + 
Sbjct: 369 IFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSS 428

Query: 421 FVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQI 477
             +V+VPTV+F+F+GG  L LPA+N+LIPVD  GTFCFAFA ++S LSIIGN+QQ+G +I
Sbjct: 429 LSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRI 488

BLAST of CsaUNG008290 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 4.4e-112
Identity = 215/423 (50.83%), Postives = 270/423 (63.83%), Query Frame = 1

Query: 64  NNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTD-- 123
           ++ S S   L L H D L  N  PD    F  R+ RDS+RV S+  L +      VT   
Sbjct: 65  DSESSSSITLNLDHIDALSSNKTPDE--LFSSRLQRDSRRVKSIATLAAQIPGRNVTHAP 124

Query: 124 ----FGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP 183
               F S VVSG  QGSGEYF R+GVG+P R  Y+V+D+GSDIVW+QC PC  CY QSDP
Sbjct: 125 RPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP 184

Query: 184 VFDPAGSATYAGISCDSSVCDRLDNAGCNDGR--CRYEVSYGDGSYTRGTLALETLTFGR 243
           +FDP  S TYA I C S  C RLD+AGCN  R  C Y+VSYGDGS+T G  + ETLTF R
Sbjct: 185 IFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRR 244

Query: 244 VLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES-TG 303
             ++ +A+GCGH N G+F+GAAGLLGLG G +SF GQ G +    FSYCLV R   S   
Sbjct: 245 NRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPS 304

Query: 304 TLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELTDLGYGGV 363
           ++ FG  A+   A + PL+ NP+  +FYYVGL G+ VGG RVP +   +F+L  +G GGV
Sbjct: 305 SVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGV 364

Query: 364 VMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYF 423
           ++D+GT+VTRL  PAY A RD F      L R+   S+FDTC++L+    V+VPTV  +F
Sbjct: 365 IIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF 424

Query: 424 SGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGP 477
            G  + +LPA N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++  D ++  VGF P
Sbjct: 425 RGADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAP 484

BLAST of CsaUNG008290 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 267.7 bits (683), Expect = 2.4e-70
Identity = 152/389 (39.07%), Postives = 215/389 (55.27%), Query Frame = 1

Query: 94  KERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVV 153
           K  I R  +R+ S+  +L S S  +   +  D         GEY + + +G+P  S   +
Sbjct: 62  KRAIKRGERRMRSINAMLQSSSGIETPVYAGD---------GEYLMNVAIGTPDSSFSAI 121

Query: 154 IDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYE 213
           +D+GSD++W QC+PC++C+ Q  P+F+P  S++++ + C+S  C  L +  CN+  C+Y 
Sbjct: 122 MDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYT 181

Query: 214 VSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMNRGMFIG-AAGLLGLGGGAMSFVGQ 273
             YGDGS T+G +A ET TF    + NIA GCG  N+G   G  AGL+G+G G +S   Q
Sbjct: 182 YGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ 241

Query: 274 LGGQTGGAFSYCLVSRGTESTGTLEFGRGA--MPVGAAWVPLIRNPRAPSFYYVGLSGLG 333
           LG    G FSYC+ S G+ S  TL  G  A  +P G+    LI +   P++YY+ L G+ 
Sbjct: 242 LG---VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGIT 301

Query: 334 VGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVS 393
           VGG  + IP   F+L D G GG+++D+GT +T LP  AY A    F  Q  NLP  D  S
Sbjct: 302 VGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ-INLPTVDESS 361

Query: 394 I-FDTCYNL-NGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASAS-GL 453
               TC+   +   +V+VP +S  F GG +L L  +N LI    EG  C A  +S+  G+
Sbjct: 362 SGLSTCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILIS-PAEGVICLAMGSSSQLGI 421

Query: 454 SIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           SI GNIQQ+  Q+  D  N  V F PT C
Sbjct: 422 SIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CsaUNG008290 vs. Swiss-Prot
Match: ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 7.4e-67
Identity = 148/405 (36.54%), Postives = 215/405 (53.09%), Query Frame = 1

Query: 87  PDHPRRFKERISRDSKRVSSLLRLLSSG------SDEQVTDFGSDVVSGTEQGSGEYFVR 146
           PDH     E +  D  RV+S+   LS        S+ + TD  +    G+  GSG Y V 
Sbjct: 82  PDHV----EILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAK--DGSTLGSGNYIVT 141

Query: 147 IGVGSPPRSQYVVIDSGSDIVWVQCQPCSE-CYQQSDPVFDPAGSATYAGISCDSSVCDR 206
           +G+G+P     ++ D+GSD+ W QCQPC   CY Q +P+F+P+ S +Y  +SC S+ C  
Sbjct: 142 VGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGS 201

Query: 207 LDNA-----GCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV-LIRNIAIGCGHMNRGMF 266
           L +A      C+   C Y + YGD S++ G LA E  T     +   +  GCG  N+G+F
Sbjct: 202 LSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLF 261

Query: 267 IGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGAAWVPLI 326
            G AGLLGLG   +SF  Q        FSYCL S  +  TG L FG   +     + P+ 
Sbjct: 262 TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASY-TGHLTFGSAGISRSVKFTPIS 321

Query: 327 RNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFR 386
                 SFY + +  + VGG ++PIP  +F        G ++D+GT +TRLP  AY A R
Sbjct: 322 TITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALR 381

Query: 387 DTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGE 446
            +F  + +  P +  VSI DTC++L+GF +V +P V+F FSGG ++ L ++  +  V   
Sbjct: 382 SSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKG-IFYVFKI 441

Query: 447 GTFCFAFAASA--SGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
              C AFA ++  S  +I GN+QQ+ +++  DG+ G VGF P  C
Sbjct: 442 SQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473

BLAST of CsaUNG008290 vs. TrEMBL
Match: A0A0D2T690_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G147500 PE=3 SV=1)

HSP 1 Score: 699.5 bits (1804), Expect = 2.8e-198
Identity = 343/484 (70.87%), Postives = 404/484 (83.47%), Query Frame = 1

Query: 2   LPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 61
           LP +L+++L   ++S AT   A+YP  QLLNVK T+       +++P+ L+  E++ + +
Sbjct: 7   LPIVLVAMLHLTLSSAAT---ASYPDFQLLNVKQTL-----IGTKIPRPLQTSEHHQVSD 66

Query: 62  LDNNSSQSQWKLKLFHRDKLPLNFDP---DHPRRFKERISRDSKRVSSLLRLLSSGSDE- 121
           +  + +Q +WKLKL HRDKL  N      DH RRF  R+ RD KRV+SLLR LS G    
Sbjct: 67  V--SETQGKWKLKLVHRDKLSSNTSATFRDHSRRFHARMQRDVKRVASLLRRLSGGGGHD 126

Query: 122 -----QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECY 181
                +V DFGSDVVSG +QGSGEYFVRIGVGSPP+SQY+VIDSGSDIVWVQCQPC++CY
Sbjct: 127 GGAAYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPKSQYMVIDSGSDIVWVQCQPCNQCY 186

Query: 182 QQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLT 241
           +QSDPVFDPA SA+YAGISC S+VCDR++N+GCN GRCRYEV YGDGSYT+GTLALETLT
Sbjct: 187 RQSDPVFDPADSASYAGISCSSAVCDRIENSGCNAGRCRYEVLYGDGSYTKGTLALETLT 246

Query: 242 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES 301
           FGR +++N+AIGCGH+NRGMFIGAAGLLGLGGG++S VGQLGGQTGGAFSYCLVSRG+++
Sbjct: 247 FGRTVVKNVAIGCGHINRGMFIGAAGLLGLGGGSLSLVGQLGGQTGGAFSYCLVSRGSDA 306

Query: 302 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 361
           +G+LEFGRGAMPVGAAWVPL+RNP+APSFYYVGLSGLGVGGIRVP+ E IF+LT+LGYGG
Sbjct: 307 SGSLEFGRGAMPVGAAWVPLLRNPQAPSFYYVGLSGLGVGGIRVPVSEDIFQLTELGYGG 366

Query: 362 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 421
           VVMDTGTAV+R P  AY+A RD FI QTANLPR   VSIFDTCY L+ FV++RVPTVSFY
Sbjct: 367 VVMDTGTAVSRFPTLAYKALRDAFIAQTANLPRISTVSIFDTCYKLSDFVTIRVPTVSFY 426

Query: 422 FSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFG 477
           FSGGPILTLPA NFLIPVD  GTFC AFA+S SGLSIIGNIQQEGIQIS DG+NGFVGFG
Sbjct: 427 FSGGPILTLPASNFLIPVDDVGTFCLAFASSTSGLSIIGNIQQEGIQISFDGANGFVGFG 480

BLAST of CsaUNG008290 vs. TrEMBL
Match: A0A061E1W9_THECC (Aspartic proteinase nepenthesin-1, putative OS=Theobroma cacao GN=TCM_007665 PE=3 SV=1)

HSP 1 Score: 698.7 bits (1802), Expect = 4.8e-198
Identity = 349/484 (72.11%), Postives = 401/484 (82.85%), Query Frame = 1

Query: 2   LPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 61
           L  +L+++L   V+ +AT   A++P  QLLNVK T+   +  P+ L +  E HE     E
Sbjct: 7   LAMILVAVLQLTVSGIAT---ASHPDFQLLNVKQTLIGTKK-PTPL-KTFEYHEQSNASE 66

Query: 62  LDNNSSQSQWKLKLFHRDKLPLNFDP---DHPRRFKERISRDSKRVSSLLRLLSSGSDE- 121
            D    Q +WKLKL HRDKL  N      DH  RF  R+ RD KRV+SL+RLLS G    
Sbjct: 67  SD----QGKWKLKLVHRDKLFSNTTTAFHDHSHRFLARMQRDVKRVASLVRLLSGGGGHD 126

Query: 122 -----QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECY 181
                +V DFGSDVVSG +QGSGEYFVRIGVGSPPRSQY+VIDSGSDIVWVQCQPC++CY
Sbjct: 127 GDAAYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCY 186

Query: 182 QQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLT 241
           +QSDPVFDPA SA+Y+G+SC SSVCDR++N+GC+ GRCRYEV YGDGSYT+GTLALETLT
Sbjct: 187 RQSDPVFDPANSASYSGVSCTSSVCDRIENSGCHAGRCRYEVMYGDGSYTKGTLALETLT 246

Query: 242 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES 301
           FGR +++N+AIGCGH+NRGMFIGAAGLLG+GGG+MS VGQLGGQTGGAFSYCLVSRG+++
Sbjct: 247 FGRTVVKNVAIGCGHINRGMFIGAAGLLGVGGGSMSLVGQLGGQTGGAFSYCLVSRGSDA 306

Query: 302 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 361
           +G+L FGRGAMPVGAAWVPL+RNPRAPSFYYVGLSGLGVGGIRVP+ E  F L++LGYGG
Sbjct: 307 SGSLVFGRGAMPVGAAWVPLLRNPRAPSFYYVGLSGLGVGGIRVPVSEDTFRLSELGYGG 366

Query: 362 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 421
           VVMDTGTAVTR P  AY AFRD F+ QTANLPR+  VSIFDTCYNL+GFVSVRVPTVSFY
Sbjct: 367 VVMDTGTAVTRFPTLAYNAFRDAFVAQTANLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 426

Query: 422 FSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFG 477
           FSGGPILTLPARNFLIPVD  GTFCFAFA+SASGLSIIGNIQQEGIQIS DG+NGFVGFG
Sbjct: 427 FSGGPILTLPARNFLIPVDDVGTFCFAFASSASGLSIIGNIQQEGIQISFDGANGFVGFG 481

BLAST of CsaUNG008290 vs. TrEMBL
Match: A0A0L9U1G0_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan02g264700 PE=3 SV=1)

HSP 1 Score: 686.4 bits (1770), Expect = 2.5e-194
Identity = 335/470 (71.28%), Postives = 391/470 (83.19%), Query Frame = 1

Query: 9   LLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSSQ 68
           LL + + S ++    +YP  Q L+VK T+ + +  P+  P+    H   P   +D++S++
Sbjct: 6   LLLSFLLSTSSSHNLSYPHFQQLDVKQTLAQTKLIPTPTPKQPSNHHQTPNTVIDSSSTK 65

Query: 69  SQWKLKLFHRDKLP-LNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTD-FGSDV 128
              KLKL HRDK+P  N   DH  RF  RI RDS+RV++LLR L+SG     ++ FGSDV
Sbjct: 66  G--KLKLVHRDKVPTFNTSHDHQTRFNARIKRDSRRVAALLRRLASGKPTLDSEAFGSDV 125

Query: 129 VSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSAT 188
           VSG EQGSGEYFVRIGVGSPPR+QYVVIDSGSDI+WVQC+PC++CY QSDPVF+PA S++
Sbjct: 126 VSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSS 185

Query: 189 YAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCG 248
           YAG+SC+S+VC  +DNAGC++GRCRYEVSYGDGSYT+GTLALET+TFGR LIRN+AIGCG
Sbjct: 186 YAGVSCESTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAIGCG 245

Query: 249 HMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVG 308
           H N+GMF+GAAGLLGLGGG MSFVGQLGGQTG AFSYCLVSRGT+S G LEFGR AMPVG
Sbjct: 246 HRNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGEAFSYCLVSRGTQSPGLLEFGREAMPVG 305

Query: 309 AAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPA 368
           AAWVPLI NPRAPSFYYVGL GLGVGG+R+PIPE +F+L+++G GGVVMDTGTAVTRLP 
Sbjct: 306 AAWVPLIHNPRAPSFYYVGLLGLGVGGLRIPIPEDVFKLSEMGDGGVVMDTGTAVTRLPT 365

Query: 369 PAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNF 428
            AYEAFRD FI QT NLPR+   SIFDTCY+L GFVSVRVPTVSFYFSGGPILTLPARNF
Sbjct: 366 AAYEAFRDGFIAQTTNLPRASGASIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNF 425

Query: 429 LIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           LIPVD  GTFCFAFA S+S LSIIGNIQQEGIQIS+DG+NGFVGFGP +C
Sbjct: 426 LIPVDDVGTFCFAFAPSSSALSIIGNIQQEGIQISVDGANGFVGFGPNVC 473

BLAST of CsaUNG008290 vs. TrEMBL
Match: A0A0S3SL96_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.08G014700 PE=3 SV=1)

HSP 1 Score: 686.4 bits (1770), Expect = 2.5e-194
Identity = 335/470 (71.28%), Postives = 391/470 (83.19%), Query Frame = 1

Query: 9   LLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSSQ 68
           LL + + S ++    +YP  Q L+VK T+ + +  P+  P+    H   P   +D++S++
Sbjct: 6   LLLSFLLSTSSSHNLSYPHFQQLDVKQTLAQTKLIPTPTPKQPSNHHQTPNTVIDSSSTK 65

Query: 69  SQWKLKLFHRDKLP-LNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTD-FGSDV 128
              KLKL HRDK+P  N   DH  RF  RI RDS+RV++LLR L+SG     ++ FGSDV
Sbjct: 66  G--KLKLVHRDKVPTFNTSHDHQTRFNARIKRDSRRVAALLRRLASGKPTLDSEAFGSDV 125

Query: 129 VSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSAT 188
           VSG EQGSGEYFVRIGVGSPPR+QYVVIDSGSDI+WVQC+PC++CY QSDPVF+PA S++
Sbjct: 126 VSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSS 185

Query: 189 YAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCG 248
           YAG+SC+S+VC  +DNAGC++GRCRYEVSYGDGSYT+GTLALET+TFGR LIRN+AIGCG
Sbjct: 186 YAGVSCESTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAIGCG 245

Query: 249 HMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVG 308
           H N+GMF+GAAGLLGLGGG MSFVGQLGGQTG AFSYCLVSRGT+S G LEFGR AMPVG
Sbjct: 246 HRNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGEAFSYCLVSRGTQSPGLLEFGREAMPVG 305

Query: 309 AAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPA 368
           AAWVPLI NPRAPSFYYVGL GLGVGG+R+PIPE +F+L+++G GGVVMDTGTAVTRLP 
Sbjct: 306 AAWVPLIHNPRAPSFYYVGLLGLGVGGLRIPIPEDVFKLSEMGDGGVVMDTGTAVTRLPT 365

Query: 369 PAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNF 428
            AYEAFRD FI QT NLPR+   SIFDTCY+L GFVSVRVPTVSFYFSGGPILTLPARNF
Sbjct: 366 AAYEAFRDGFIAQTTNLPRASGASIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNF 425

Query: 429 LIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           LIPVD  GTFCFAFA S+S LSIIGNIQQEGIQIS+DG+NGFVGFGP +C
Sbjct: 426 LIPVDDVGTFCFAFAPSSSALSIIGNIQQEGIQISVDGANGFVGFGPNVC 473

BLAST of CsaUNG008290 vs. TrEMBL
Match: F6HVZ3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0071g00770 PE=3 SV=1)

HSP 1 Score: 683.3 bits (1762), Expect = 2.1e-193
Identity = 335/469 (71.43%), Postives = 385/469 (82.09%), Query Frame = 1

Query: 12  TAVASVATGPA--ATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSSQS 71
           T  AS AT     ++YP  Q LNVK+TI      P      LE+ E       D+     
Sbjct: 24  TTAASAATAAINNSSYPTFQHLNVKETIAGTRIIP------LEVSE-------DHEEGGE 83

Query: 72  QWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGS--DEQVTDFGSDVV 131
           +W +K+ HRD+L      DH  R   R+ RD+KRV+SL+R LSSG     +V DFG+DV+
Sbjct: 84  KWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVDDFGTDVI 143

Query: 132 SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATY 191
           SG EQGSGEYFVRIGVGSPPRSQY+VIDSGSDIVWVQCQPC++CY QSDPVFDPA SA++
Sbjct: 144 SGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASF 203

Query: 192 AGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGH 251
            G+SC SSVCDRL+NAGC+ GRCRYEVSYGDGSYT+GTLALETLTFGR ++R++AIGCGH
Sbjct: 204 TGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGH 263

Query: 252 MNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGA 311
            NRGMF+GAAGLLGLGGG+MSFVGQLGGQTGGAFSYCLVSRGT+S+G+L FGR A+P GA
Sbjct: 264 RNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREALPAGA 323

Query: 312 AWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAP 371
           AWVPL+RNPRAPSFYY+GL+GLGVGGIRVPI E++F LT+LG GGVVMDTGTAVTRLP  
Sbjct: 324 AWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTL 383

Query: 372 AYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFL 431
           AY+AFRD F+ QTANLPR+  V+IFDTCY+L GFVSVRVPTVSFYFSGGPILTLPARNFL
Sbjct: 384 AYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFL 443

Query: 432 IPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           IP+D  GTFCFAFA S SGLSI+GNIQQEGIQIS DG+NG+VGFGP IC
Sbjct: 444 IPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479

BLAST of CsaUNG008290 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 638.3 bits (1645), Expect = 3.9e-183
Identity = 307/466 (65.88%), Postives = 371/466 (79.61%), Query Frame = 1

Query: 17  VATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSSQSQWKLKLF 76
           +++  + ++P  Q++   D ++   T  + LP     H +        + S S++ L+L 
Sbjct: 16  LSSSSSISFPDFQII---DVLQPPLTVTATLPDFNNTHFS--------DESSSKYTLRLL 75

Query: 77  HRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLS------SGSDEQVTDFGSDVVSGT 136
           HRD+ P     +H  R   R+ RD+ RVS++LR +S      S S  +V DFGSD+VSG 
Sbjct: 76  HRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGM 135

Query: 137 EQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGI 196
           +QGSGEYFVRIGVGSPPR QY+VIDSGSD+VWVQCQPC  CY+QSDPVFDPA S +Y G+
Sbjct: 136 DQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGV 195

Query: 197 SCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMNR 256
           SC SSVCDR++N+GC+ G CRYEV YGDGSYT+GTLALETLTF + ++RN+A+GCGH NR
Sbjct: 196 SCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNR 255

Query: 257 GMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGAAWV 316
           GMFIGAAGLLG+GGG+MSFVGQL GQTGGAF YCLVSRGT+STG+L FGR A+PVGA+WV
Sbjct: 256 GMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVGASWV 315

Query: 317 PLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYE 376
           PL+RNPRAPSFYYVGL GLGVGG+R+P+P+ +F+LT+ G GGVVMDTGTAVTRLP  AY 
Sbjct: 316 PLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYV 375

Query: 377 AFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPV 436
           AFRD F  QTANLPR+  VSIFDTCY+L+GFVSVRVPTVSFYF+ GP+LTLPARNFL+PV
Sbjct: 376 AFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPV 435

Query: 437 DGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           D  GT+CFAFAAS +GLSIIGNIQQEGIQ+S DG+NGFVGFGP +C
Sbjct: 436 DDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CsaUNG008290 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 411.0 bits (1055), Expect = 1.0e-114
Identity = 205/472 (43.43%), Postives = 298/472 (63.14%), Query Frame = 1

Query: 21  PAATYPATQLLNVKDTIKEAE-TAPSRLPQDLELHENYPIFELDNNSSQSQWKLKLFHRD 80
           P  +   T +LNV D+I   + T+  RL Q  E            +S+ S + L+L  R 
Sbjct: 26  PETSTTTTSILNVADSIHRTKYTSSFRLNQQEE----------QTHSASSSFSLQLHSRV 85

Query: 81  KLPLNFDPDHPRRFKERISRDSKRVSSL---------------LRLLSSGSDEQVTDFGS 140
            +      D+      R++RD+ RV SL               L+ +S+    +  D  +
Sbjct: 86  SVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQDIEA 145

Query: 141 DVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGS 200
            ++SGT QGSGEYF R+G+G P R  Y+V+D+GSD+ W+QC PC++CY Q++P+F+P+ S
Sbjct: 146 PLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSS 205

Query: 201 ATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIG 260
           ++Y  +SCD+  C+ L+ + C +  C YEVSYGDGSYT G  A ETLT G  L++N+A+G
Sbjct: 206 SSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVG 265

Query: 261 CGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMP 320
           CGH N G+F+GAAGLLGLGGG ++   QL      +FSYCLV R ++S  T++FG    P
Sbjct: 266 CGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVDFGTSLSP 325

Query: 321 VGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRL 380
             A   PL+RN +  +FYY+GL+G+ VGG  + IP+  FE+ + G GG+++D+GTAVTRL
Sbjct: 326 -DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRL 385

Query: 381 PAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPAR 440
               Y + RD+F+  T +L ++  V++FDTCYNL+   +V VPTV+F+F GG +L LPA+
Sbjct: 386 QTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAK 445

Query: 441 NFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           N++IPVD  GTFC AFA +AS L+IIGN+QQ+G +++ D +N  +GF    C
Sbjct: 446 NYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of CsaUNG008290 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 410.6 bits (1054), Expect = 1.3e-114
Identity = 218/496 (43.95%), Postives = 301/496 (60.69%), Query Frame = 1

Query: 1   MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF 60
           +L  + LSL  T     ++   +T P T +L+V  ++++ +T  S  P    L    P  
Sbjct: 9   LLAVVTLSLFLTTT-DASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPES 68

Query: 61  ELDNN--SSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSL---LRLLSSGS 120
             D    +S S   L+L  RD    +   D+      R+ RDS RV+ +   +R    G 
Sbjct: 69  LSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGV 128

Query: 121 DE-------------QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVW 180
           D              Q  D  + VVSG  QGSGEYF RIGVG+P +  Y+V+D+GSD+ W
Sbjct: 129 DRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNW 188

Query: 181 VQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYT 240
           +QC+PC++CYQQSDPVF+P  S+TY  ++C +  C  L+ + C   +C Y+VSYGDGS+T
Sbjct: 189 IQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFT 248

Query: 241 RGTLALETLTFGRV-LIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAF 300
            G LA +T+TFG    I N+A+GCGH N G+F GAAGLLGLGGG +S   Q+      +F
Sbjct: 249 VGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQM---KATSF 308

Query: 301 SYCLVSRGTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQ 360
           SYCLV R +  + +L+F    +  G A  PL+RN +  +FYYVGLSG  VGG +V +P+ 
Sbjct: 309 SYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDA 368

Query: 361 IFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPR-SDRVSIFDTCYNLNG 420
           IF++   G GGV++D GTAVTRL   AY + RD F+  T NL + S  +S+FDTCY+ + 
Sbjct: 369 IFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSS 428

Query: 421 FVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQI 477
             +V+VPTV+F+F+GG  L LPA+N+LIPVD  GTFCFAFA ++S LSIIGN+QQ+G +I
Sbjct: 429 LSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRI 488

BLAST of CsaUNG008290 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 406.4 bits (1043), Expect = 2.5e-113
Identity = 215/423 (50.83%), Postives = 270/423 (63.83%), Query Frame = 1

Query: 64  NNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTD-- 123
           ++ S S   L L H D L  N  PD    F  R+ RDS+RV S+  L +      VT   
Sbjct: 65  DSESSSSITLNLDHIDALSSNKTPDE--LFSSRLQRDSRRVKSIATLAAQIPGRNVTHAP 124

Query: 124 ----FGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP 183
               F S VVSG  QGSGEYF R+GVG+P R  Y+V+D+GSDIVW+QC PC  CY QSDP
Sbjct: 125 RPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP 184

Query: 184 VFDPAGSATYAGISCDSSVCDRLDNAGCNDGR--CRYEVSYGDGSYTRGTLALETLTFGR 243
           +FDP  S TYA I C S  C RLD+AGCN  R  C Y+VSYGDGS+T G  + ETLTF R
Sbjct: 185 IFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRR 244

Query: 244 VLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES-TG 303
             ++ +A+GCGH N G+F+GAAGLLGLG G +SF GQ G +    FSYCLV R   S   
Sbjct: 245 NRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPS 304

Query: 304 TLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELTDLGYGGV 363
           ++ FG  A+   A + PL+ NP+  +FYYVGL G+ VGG RVP +   +F+L  +G GGV
Sbjct: 305 SVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGV 364

Query: 364 VMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYF 423
           ++D+GT+VTRL  PAY A RD F      L R+   S+FDTC++L+    V+VPTV  +F
Sbjct: 365 IIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF 424

Query: 424 SGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGP 477
            G  + +LPA N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++  D ++  VGF P
Sbjct: 425 RGADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAP 484

BLAST of CsaUNG008290 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 374.0 bits (959), Expect = 1.4e-103
Identity = 196/430 (45.58%), Postives = 263/430 (61.16%), Query Frame = 1

Query: 65  NSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTD--- 124
           + S +   + L H D L    D      F  R+ RDS RV S+  L +  +    T    
Sbjct: 55  SESTTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTP 114

Query: 125 -----FGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 184
                F   V+SG  QGSGEYF+R+GVG+P  + Y+V+D+GSD+VW+QC PC  CY Q+D
Sbjct: 115 RTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTD 174

Query: 185 PVFDPAGSATYAGISCDSSVCDRLDNAG-CNDGR---CRYEVSYGDGSYTRGTLALETLT 244
            +FDP  S T+A + C S +C RLD++  C   R   C Y+VSYGDGS+T G  + ETLT
Sbjct: 175 AIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT 234

Query: 245 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSR---- 304
           F    + ++ +GCGH N G+F+GAAGLLGLG G +SF  Q   +  G FSYCLV R    
Sbjct: 235 FHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSG 294

Query: 305 -GTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELT 364
             ++   T+ FG  A+P  + + PL+ NP+  +FYY+ L G+ VGG RVP + E  F+L 
Sbjct: 295 SSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLD 354

Query: 365 DLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRV 424
             G GGV++D+GT+VTRL  PAY A RD F      L R+   S+FDTC++L+G  +V+V
Sbjct: 355 ATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKV 414

Query: 425 PTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSN 477
           PTV F+F GG + +LPA N+LIPV+ EG FCFAFA +   LSIIGNIQQ+G +++ D   
Sbjct: 415 PTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVG 474

BLAST of CsaUNG008290 vs. NCBI nr
Match: gi|449464952|ref|XP_004150193.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 964.9 bits (2493), Expect = 5.1e-278
Identity = 476/476 (100.00%), Postives = 476/476 (100.00%), Query Frame = 1

Query: 1   MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF 60
           MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF
Sbjct: 1   MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF 60

Query: 61  ELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT 120
           ELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT
Sbjct: 61  ELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT 120

Query: 121 DFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFD 180
           DFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFD
Sbjct: 121 DFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFD 180

Query: 181 PAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRN 240
           PAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRN
Sbjct: 181 PAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRN 240

Query: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR 300
           IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR
Sbjct: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR 300

Query: 301 GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA 360
           GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA
Sbjct: 301 GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA 360

Query: 361 VTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILT 420
           VTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILT
Sbjct: 361 VTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILT 420

Query: 421 LPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           LPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC
Sbjct: 421 LPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476

BLAST of CsaUNG008290 vs. NCBI nr
Match: gi|659072346|ref|XP_008465249.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo])

HSP 1 Score: 940.3 bits (2429), Expect = 1.4e-270
Identity = 464/477 (97.27%), Postives = 469/477 (98.32%), Query Frame = 1

Query: 1   MLPFLLL-SLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPI 60
           MLP LLL  LLATAV+SVATGPAATYPATQLLNVKDTIKE ET PSRLPQDL LHENYP+
Sbjct: 1   MLPLLLLLPLLATAVSSVATGPAATYPATQLLNVKDTIKETETTPSRLPQDLNLHENYPL 60

Query: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQV 120
           FELDNNSSQSQWKLKLFHRDKLPLNFD +HPRRFKERISRDSKRVSSLLRLLS+ SDEQV
Sbjct: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDTNHPRRFKERISRDSKRVSSLLRLLSNASDEQV 120

Query: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180
           TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF
Sbjct: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180

Query: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIR 240
           DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR+LIR
Sbjct: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRILIR 240

Query: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300
           NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG
Sbjct: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300

Query: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360
           RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT
Sbjct: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360

Query: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420
           AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL
Sbjct: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420

Query: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC
Sbjct: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477

BLAST of CsaUNG008290 vs. NCBI nr
Match: gi|823209871|ref|XP_012438084.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Gossypium raimondii])

HSP 1 Score: 699.5 bits (1804), Expect = 4.0e-198
Identity = 343/484 (70.87%), Postives = 404/484 (83.47%), Query Frame = 1

Query: 2   LPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 61
           LP +L+++L   ++S AT   A+YP  QLLNVK T+       +++P+ L+  E++ + +
Sbjct: 7   LPIVLVAMLHLTLSSAAT---ASYPDFQLLNVKQTL-----IGTKIPRPLQTSEHHQVSD 66

Query: 62  LDNNSSQSQWKLKLFHRDKLPLNFDP---DHPRRFKERISRDSKRVSSLLRLLSSGSDE- 121
           +  + +Q +WKLKL HRDKL  N      DH RRF  R+ RD KRV+SLLR LS G    
Sbjct: 67  V--SETQGKWKLKLVHRDKLSSNTSATFRDHSRRFHARMQRDVKRVASLLRRLSGGGGHD 126

Query: 122 -----QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECY 181
                +V DFGSDVVSG +QGSGEYFVRIGVGSPP+SQY+VIDSGSDIVWVQCQPC++CY
Sbjct: 127 GGAAYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPKSQYMVIDSGSDIVWVQCQPCNQCY 186

Query: 182 QQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLT 241
           +QSDPVFDPA SA+YAGISC S+VCDR++N+GCN GRCRYEV YGDGSYT+GTLALETLT
Sbjct: 187 RQSDPVFDPADSASYAGISCSSAVCDRIENSGCNAGRCRYEVLYGDGSYTKGTLALETLT 246

Query: 242 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES 301
           FGR +++N+AIGCGH+NRGMFIGAAGLLGLGGG++S VGQLGGQTGGAFSYCLVSRG+++
Sbjct: 247 FGRTVVKNVAIGCGHINRGMFIGAAGLLGLGGGSLSLVGQLGGQTGGAFSYCLVSRGSDA 306

Query: 302 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 361
           +G+LEFGRGAMPVGAAWVPL+RNP+APSFYYVGLSGLGVGGIRVP+ E IF+LT+LGYGG
Sbjct: 307 SGSLEFGRGAMPVGAAWVPLLRNPQAPSFYYVGLSGLGVGGIRVPVSEDIFQLTELGYGG 366

Query: 362 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 421
           VVMDTGTAV+R P  AY+A RD FI QTANLPR   VSIFDTCY L+ FV++RVPTVSFY
Sbjct: 367 VVMDTGTAVSRFPTLAYKALRDAFIAQTANLPRISTVSIFDTCYKLSDFVTIRVPTVSFY 426

Query: 422 FSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFG 477
           FSGGPILTLPA NFLIPVD  GTFC AFA+S SGLSIIGNIQQEGIQIS DG+NGFVGFG
Sbjct: 427 FSGGPILTLPASNFLIPVDDVGTFCLAFASSTSGLSIIGNIQQEGIQISFDGANGFVGFG 480

BLAST of CsaUNG008290 vs. NCBI nr
Match: gi|590689368|ref|XP_007043208.1| (Aspartic proteinase nepenthesin-1, putative [Theobroma cacao])

HSP 1 Score: 698.7 bits (1802), Expect = 6.8e-198
Identity = 349/484 (72.11%), Postives = 401/484 (82.85%), Query Frame = 1

Query: 2   LPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 61
           L  +L+++L   V+ +AT   A++P  QLLNVK T+   +  P+ L +  E HE     E
Sbjct: 7   LAMILVAVLQLTVSGIAT---ASHPDFQLLNVKQTLIGTKK-PTPL-KTFEYHEQSNASE 66

Query: 62  LDNNSSQSQWKLKLFHRDKLPLNFDP---DHPRRFKERISRDSKRVSSLLRLLSSGSDE- 121
            D    Q +WKLKL HRDKL  N      DH  RF  R+ RD KRV+SL+RLLS G    
Sbjct: 67  SD----QGKWKLKLVHRDKLFSNTTTAFHDHSHRFLARMQRDVKRVASLVRLLSGGGGHD 126

Query: 122 -----QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECY 181
                +V DFGSDVVSG +QGSGEYFVRIGVGSPPRSQY+VIDSGSDIVWVQCQPC++CY
Sbjct: 127 GDAAYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCY 186

Query: 182 QQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLT 241
           +QSDPVFDPA SA+Y+G+SC SSVCDR++N+GC+ GRCRYEV YGDGSYT+GTLALETLT
Sbjct: 187 RQSDPVFDPANSASYSGVSCTSSVCDRIENSGCHAGRCRYEVMYGDGSYTKGTLALETLT 246

Query: 242 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES 301
           FGR +++N+AIGCGH+NRGMFIGAAGLLG+GGG+MS VGQLGGQTGGAFSYCLVSRG+++
Sbjct: 247 FGRTVVKNVAIGCGHINRGMFIGAAGLLGVGGGSMSLVGQLGGQTGGAFSYCLVSRGSDA 306

Query: 302 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 361
           +G+L FGRGAMPVGAAWVPL+RNPRAPSFYYVGLSGLGVGGIRVP+ E  F L++LGYGG
Sbjct: 307 SGSLVFGRGAMPVGAAWVPLLRNPRAPSFYYVGLSGLGVGGIRVPVSEDTFRLSELGYGG 366

Query: 362 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 421
           VVMDTGTAVTR P  AY AFRD F+ QTANLPR+  VSIFDTCYNL+GFVSVRVPTVSFY
Sbjct: 367 VVMDTGTAVTRFPTLAYNAFRDAFVAQTANLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 426

Query: 422 FSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFG 477
           FSGGPILTLPARNFLIPVD  GTFCFAFA+SASGLSIIGNIQQEGIQIS DG+NGFVGFG
Sbjct: 427 FSGGPILTLPARNFLIPVDDVGTFCFAFASSASGLSIIGNIQQEGIQISFDGANGFVGFG 481

BLAST of CsaUNG008290 vs. NCBI nr
Match: gi|1009138729|ref|XP_015886741.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ziziphus jujuba])

HSP 1 Score: 687.2 bits (1772), Expect = 2.1e-194
Identity = 339/486 (69.75%), Postives = 396/486 (81.48%), Query Frame = 1

Query: 5   LLLSLLATAVASVATGPAA---TYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 64
           L+++++   +AS ++   +   +YP  Q LNVK TI      P       E  ++  IF+
Sbjct: 54  LIIAMIIPLLASTSSSSPSNKISYPDFQSLNVKATITAMVKNPKPPTNTSESDDDDGIFQ 113

Query: 65  -LDNNSSQSQWKLKLFHRDKLPL----NFDPDHPRRFKERISRDSKRVSSLLRLLSSGSD 124
             DN+ S+ +WKLKL HRDK+      N   D    F+ER+ RD +RV++L+R L     
Sbjct: 114 GHDNDESERKWKLKLLHRDKMATMSNNNLLGDLHHHFQERMKRDVQRVAALVRRLERQQS 173

Query: 125 E------QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSE 184
           E      +V DFGS+VVSG  +GSGEYFVRIGVGSPPRSQY+VIDSGSDIVWVQCQPCS+
Sbjct: 174 EAGSQSFEVEDFGSEVVSGMNEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 233

Query: 185 CYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALET 244
           CY QSDPVFDPA SA+Y G+SC S+VC+RL+NAGC+ GRCRYEVSYGDGSYT+GTLALET
Sbjct: 234 CYHQSDPVFDPAASASYTGVSCSSAVCNRLENAGCHAGRCRYEVSYGDGSYTKGTLALET 293

Query: 245 LTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGT 304
           LTFGR +IRN+AIGCGHMN+GMF+GAAGLLGLG G+MSFVGQLGGQTGGA+SYCLVSRGT
Sbjct: 294 LTFGRAVIRNVAIGCGHMNQGMFVGAAGLLGLGSGSMSFVGQLGGQTGGAYSYCLVSRGT 353

Query: 305 ESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGY 364
            S+G+LEFGRGAMPVGAAWVPL+RNPRAPSFYY+GL GLGVGGI++PI E IF LT+LGY
Sbjct: 354 VSSGSLEFGRGAMPVGAAWVPLVRNPRAPSFYYIGLVGLGVGGIKLPISEDIFRLTELGY 413

Query: 365 GGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVS 424
           GGVVMDTGTAVTRLP  AYE FRDTFI QTANLPR+  +SIFDTCYNLNGF+SVRVPT+S
Sbjct: 414 GGVVMDTGTAVTRLPTLAYETFRDTFIQQTANLPRASGISIFDTCYNLNGFISVRVPTIS 473

Query: 425 FYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVG 477
           FYFSGGP LTLPA NFLIPVD +GTFCFAFA + +GLSIIGNIQQEGIQISIDG+NGFVG
Sbjct: 474 FYFSGGPTLTLPASNFLIPVDDKGTFCFAFAPNPTGLSIIGNIQQEGIQISIDGANGFVG 533

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPG2_ARATH6.9e-18265.88Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
ASPG1_ARATH2.3e-11343.95Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
APF2_ARATH4.4e-11250.83Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP2_NEPGR2.4e-7039.07Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
ASPA_ARATH7.4e-6736.54Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 ... [more]
Match NameE-valueIdentityDescription
A0A0D2T690_GOSRA2.8e-19870.87Uncharacterized protein OS=Gossypium raimondii GN=B456_008G147500 PE=3 SV=1[more]
A0A061E1W9_THECC4.8e-19872.11Aspartic proteinase nepenthesin-1, putative OS=Theobroma cacao GN=TCM_007665 PE=... [more]
A0A0L9U1G0_PHAAN2.5e-19471.28Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan02g264700 PE=3 SV=1[more]
A0A0S3SL96_PHAAN2.5e-19471.28Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.08G014700 PE=... [more]
F6HVZ3_VITVI2.1e-19371.43Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0071g00770 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT3G20015.13.9e-18365.88 Eukaryotic aspartyl protease family protein[more]
AT1G25510.11.0e-11443.43 Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.3e-11443.95 Eukaryotic aspartyl protease family protein[more]
AT1G01300.12.5e-11350.83 Eukaryotic aspartyl protease family protein[more]
AT3G61820.11.4e-10345.58 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449464952|ref|XP_004150193.1|5.1e-278100.00PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|659072346|ref|XP_008465249.1|1.4e-27097.27PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo][more]
gi|823209871|ref|XP_012438084.1|4.0e-19870.87PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Gossypium raimondii][more]
gi|590689368|ref|XP_007043208.1|6.8e-19872.11Aspartic proteinase nepenthesin-1, putative [Theobroma cacao][more]
gi|1009138729|ref|XP_015886741.1|2.1e-19469.75PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU134830cucumber EST collection version 3.0transcribed_cluster
CU148644cucumber EST collection version 3.0transcribed_cluster
CU160515cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaUNG008290.1CsaUNG008290.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU148644CU148644transcribed_cluster
CU134830CU134830transcribed_cluster
CU160515CU160515transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 61..476
score: 1.4E-275coord: 1..19
score: 1.4E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 152..163
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 128..299
score: 4.2E-34coord: 308..476
score: 7.5
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 132..476
score: 7.37
NoneNo IPR availablePANTHERPTHR13683:SF265PROTEIN ASPARTIC PROTEASE IN GUARD CELL 2coord: 61..476
score: 1.4E-275coord: 1..19
score: 1.4E