CSPI04G00110 (gene) Wild cucumber (PI 183967)

NameCSPI04G00110
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPeptidase, M50 family
LocationChr4 : 67846 .. 71844 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGAATCAACGTCAACTCGGATGAACAGGTAAAGAACACTCGAACAATTTCGTCAATGAATATGAAAAACAAAAATCTCCATATTTCATTTCAGCCCCACAAGTCTCTTACGAATCATCGTCTTCCCCATTCTTCTCCATGATTCTAGACTCCTCTTCCAATCCCTTTATTTCCCTCCATCCTGCCATGGATATCTCCATCTCCAAAGTCGAACTCGAAAATTCAAATTTCGAATCATTTCTCACATGGCCGCTCTCTCAATCGCTTCAAATTCATGGTTCATCAGTCACAGAGAGAAGCACTATACAAGCCGCACAATGGCCAAGCCTTTTGGTAAGATTCCACTTGGGCGTAGAACCGGCGGTTACTTCTTTACAATTTCCGCACCTATTACAGAGGATCGTTTAAGATTCTCCGCCAGAGATGATTCGGAAAGCGAGCCGTCGTCTTCCTCGATCGCGGTGGTTTCCGATGAGAGGGGAGGCGGAAATGACAACGAGATGGCGGAACTGTCTGCCGGAGAACATGGGGGTGAAGAGAGAGAGAAACAACAGGAAATGGATTGGAAGACGGACGAGGAGTTCAAGAAGTTCATGGGAAATCCTTCTATTGAAGCTGCCATAAAGCTGGAGAAGAAGAGGGCGGATAGGAAGCTGAAGGAGCTTGATCGTGAAGGGGCCAATAATCCGATCGTGGGGTTGTTCAATAGAATTGCTCGGGATAATTTGGAAAAAGAGAAGGAGAGATTAAAGAAGGCTGAAGAGACTTTCAAGGCTCTTGATCTCAGCAAGGTATTTATCTATTTTTCTCCGGATCTTTATGTGCATGTACACACACGTCGTATCCATGTTTTGCATAAAGGTTTTGACACTTGCCAGTTGCCGATGTCAGTCTAGCACATTTAAATAAAATATAATCTAAAAAAAATAAATTATTTTTTCTTTTTTCTCTTCGAGGGGCATTATTGAAACTTACCCTTATCAAGGTTATTTTATATAATCTAGCCTAAATTAATTGGAATGATTTTTCATGACACAAACTCAGGCTCTTGATTATGAACTATGGATACTGATGCTGTTACTTGTTAACTAACAAACTTCTTTAAATTGGCAGTTAAGAGGTTGCTTTGGATTCAATACATTTTTCGCAACTGATGTACGTAGATTTGGAGATGGAGGTATTTTTATTGGGAACTTGAGGAGACCCATTGAAGAAGTGATTCCCCAGTTGGAGAAAAAACTATCAGAAGCAGCAGGAAGGGAGGTAGTCCTATGGTTCATGGAAGAAAAAACAGATGACATCACGAAACAGGTAGAATCCCATTGTCCATTGATATTTCTACAAAAATAATGTTAACACATAGCTTGAGGGCCTGCCCAATATTTCAGTCCATATAGAAATATTAATGACATATAAAAAATGTGAGCAAGATATCAATTGCACACAAATGCTGATAACTGGGATTTCTTTAACATATTTAAAAGGTATGGAATAGTTTGTGTATTTATTTATTCTATAACTTATTTTTGGAATATGTTTTAGGTCTGTATGGTGCAACCCAAGGCAGAAATAGATCTCCAATTTGAGTCTACCAAGCTGAGTACTCCATTGGGATATTTTAGTGCAATAACGTTATGTGTTGCAACGTTTGGGACTATTGCTTTGATGAGTGGCTTCTTTCTAAAACCTGGTGCTACCTTTGATGACTATATAGCGAATGTTGTTCCTCTTTTTGGTGGCTTCATCTCTATTCTGGGAGTTTCAGAGGTTTGTGAACCAACTTTTATTCATCTTTATCCGCTGTTTTCTCTTAATGCATTTCTCGTCGCTTTCATAGTTCAACATGTGTGGAATCTTGCTGGTTTTGAGTGTAGTTTTTTTACCCAAGTCTTATCTCTTTTCAAAGCATAACCCAAGAGACTTTCTTGTGGAAAGGGAGTGTTGAGATCTTTATCTAATTTAGTGGGAGCTAGTTCGAATGTGGAAGAAGAACATTGAGGATTGCTTATCGTTTTATAGTCCTTTTTGACAAAATGGTCGTGACATTTTCCTTCAAGAGTCTAATGCTCTGCTGCACAATATTGTGGCAACCATCATGCTTAGACACCCCCAAGTCTTATTATTCATTGCGTTGTTGGTTATACCATATTTTCATTTCTGTGGAATTCTGGACTGAGATTTTTTTTTTTTTTTTAACTGCATTGGTCTCGTAGTACAGGTTTGCCTGTACTTCTTGTTCTTGTTATGGATAGTTTTTTAAACTGAAAATAAGTGTCATCCTTTAGACAGCATTTTCTTGATCTATTGAAACCAATGGGCGTTTGATCTGCAGATAGCAACGAGGGTAACAGCAGCTCGTTATGGCGTGAAGCTAAGCCCTTCTTTTCTCGTGCCTTCCAACTGGACAGGATGCTTAGGAGTGATGAATAACTATGAATCTCTACTTCCAAACAAGAAAGCGCTTTTTGATATTCCAGTAGCACGTACTGCTTCTGCATATTTAACGTCCTTGGCACTTGCAGTCTCTGCTTTTGTAATTGACGGTGGCTTTAATGGTGGAGACAATGCAATGTAAGTTGAGCATTCTCTTGTCCTTTAATCTTTCCATGTGAGATTGAGTTTTTCTTTCCTCAAGAAGCAGAAATGCCCCCAAGGTTTTAACACTTGAGAAATCAAATTTTGTTTGCCCAATATCATCTGAATTAAAAAAACTGAATAAATGAAATGAACTGCTAAGATACTTCTTGATTCTTGGATTGAGATTAAGAGCTTGTTTGGATTGACTTTGAAATATTGATGAATTTTAGAGGTTGTTACAATTGCACCTTTGACTTTTCAATTGGGACAGTTTACCAATTAAATTTATGAAATTATTGCAATTAACTGTAGACAACAATTCGGTAGTTTTTTGTGTTGCTGCAACAACTTAGGGCTTTATTTTAAGTTTTAACAACCTCTATGCTTCGCATTTCCTTTAGTTTTAGGATGATTAGTGTACAAGTTAAAGGAGTTGTAATAATTTCTTAACTGCAAATATTTCATGAACTTGTTGGTGATTGTTCAAACTAAAGTACATCACATCGACATGTAACCAGAAAGCTTTTTTATATCAGACTGAAATCTCACTATATTCCATTCTTGATGATTCCAATTCTTCATTTGCAGGTACATTAGACCTCAATTCTTTTACAACAATCCCTTACTTTCTTTTATCCAGTTTGTTATTGGACCTTACTCGGATGACCTTGGCAATGTATTGCCCTATGCAGTGGAAGGTGTTGGGGTTCCTGTGGATCCCCTTGCTTTTGCTGGCCTTTTAGGTATTTCCCTTCTTCCCCACATAAACTCGATCACTTTACAAACCCCACTCTCACAGACTCGTAAATGCCTATTGGCTTTTGGGATTTTCCCGAATTGTTTCTGATCGTAGACTCCTCCATGTTCTTCCTATAGGAATGGTAGTGACATCTTTGAATTTGTTGCCGTGCGGGAGGCTCGAAGGAGGCCGCATTGCACAAGCCATGTTTGGGAGAAGCACCGCTGCCCTACTATCATTTGCCACATCTCTTGTACTTGGTATCGGTGGACTGAGTGGGAGTGTCCTTTGTTTGGCTTGGGGTTTGTTTGCAACTTTCTTTCGAGGTGGTGAAGAAGTTCCAGCAACAGATGAGATCACTCCCTTGGGAGATGATCGGTACGCGTGGGGTGTCGTTCTCGGCCTCATTTGCTTACTTACCTTGTTCCCTAATGGCGGAGGCACATTCTCAAGTCCATTTTTCAGTGCACCATTTTTCAGGGGTGACTTATAAAGTGTACATATAGTTTGTCACTTGACCGTAGATAAAAGAGTGAGAGCACCAAACTAACTAATAGAAGTATGGAATAGTGTTTTTGTAAATGGAATCTTTGCTTTTGTATTTGATCATCAAGTCTAAAATCTTATTAAGTGTATCATCTTCTAATACCAATTAGGACATCTACTTTCATGATCATAATGT

mRNA sequence

ATGGCCGCTCTCTCAATCGCTTCAAATTCATGGTTCATCAGTCACAGAGAGAAGCACTATACAAGCCGCACAATGGCCAAGCCTTTTGGTAAGATTCCACTTGGGCGTAGAACCGGCGGTTACTTCTTTACAATTTCCGCACCTATTACAGAGGATCGTTTAAGATTCTCCGCCAGAGATGATTCGGAAAGCGAGCCGTCGTCTTCCTCGATCGCGGTGGTTTCCGATGAGAGGGGAGGCGGAAATGACAACGAGATGGCGGAACTGTCTGCCGGAGAACATGGGGGTGAAGAGAGAGAGAAACAACAGGAAATGGATTGGAAGACGGACGAGGAGTTCAAGAAGTTCATGGGAAATCCTTCTATTGAAGCTGCCATAAAGCTGGAGAAGAAGAGGGCGGATAGGAAGCTGAAGGAGCTTGATCGTGAAGGGGCCAATAATCCGATCGTGGGGTTGTTCAATAGAATTGCTCGGGATAATTTGGAAAAAGAGAAGGAGAGATTAAAGAAGGCTGAAGAGACTTTCAAGGCTCTTGATCTCAGCAAGTTAAGAGGTTGCTTTGGATTCAATACATTTTTCGCAACTGATGTACGTAGATTTGGAGATGGAGGTATTTTTATTGGGAACTTGAGGAGACCCATTGAAGAAGTGATTCCCCAGTTGGAGAAAAAACTATCAGAAGCAGCAGGAAGGGAGGTAGTCCTATGGTTCATGGAAGAAAAAACAGATGACATCACGAAACAGGTCTGTATGGTGCAACCCAAGGCAGAAATAGATCTCCAATTTGAGTCTACCAAGCTGAGTACTCCATTGGGATATTTTAGTGCAATAACGTTATGTGTTGCAACGTTTGGGACTATTGCTTTGATGAGTGGCTTCTTTCTAAAACCTGGTGCTACCTTTGATGACTATATAGCGAATGTTGTTCCTCTTTTTGGTGGCTTCATCTCTATTCTGGGAGTTTCAGAGATAGCAACGAGGGTAACAGCAGCTCGTTATGGCGTGAAGCTAAGCCCTTCTTTTCTCGTGCCTTCCAACTGGACAGGATGCTTAGGAGTGATGAATAACTATGAATCTCTACTTCCAAACAAGAAAGCGCTTTTTGATATTCCAGTAGCACGTACTGCTTCTGCATATTTAACGTCCTTGGCACTTGCAGTCTCTGCTTTTGTAATTGACGGTGGCTTTAATGGTGGAGACAATGCAATGTACATTAGACCTCAATTCTTTTACAACAATCCCTTACTTTCTTTTATCCAGTTTGTTATTGGACCTTACTCGGATGACCTTGGCAATGTATTGCCCTATGCAGTGGAAGGTGTTGGGGTTCCTGTGGATCCCCTTGCTTTTGCTGGCCTTTTAGGAATGGTAGTGACATCTTTGAATTTGTTGCCGTGCGGGAGGCTCGAAGGAGGCCGCATTGCACAAGCCATGTTTGGGAGAAGCACCGCTGCCCTACTATCATTTGCCACATCTCTTGTACTTGGTATCGGTGGACTGAGTGGGAGTGTCCTTTGTTTGGCTTGGGGTTTGTTTGCAACTTTCTTTCGAGGTGGTGAAGAAGTTCCAGCAACAGATGAGATCACTCCCTTGGGAGATGATCGGTACGCGTGGGGTGTCGTTCTCGGCCTCATTTGCTTACTTACCTTGTTCCCTAATGGCGGAGGCACATTCTCAAGTCCATTTTTCAGTGCACCATTTTTCAGGGGTGACTTATAA

Coding sequence (CDS)

ATGGCCGCTCTCTCAATCGCTTCAAATTCATGGTTCATCAGTCACAGAGAGAAGCACTATACAAGCCGCACAATGGCCAAGCCTTTTGGTAAGATTCCACTTGGGCGTAGAACCGGCGGTTACTTCTTTACAATTTCCGCACCTATTACAGAGGATCGTTTAAGATTCTCCGCCAGAGATGATTCGGAAAGCGAGCCGTCGTCTTCCTCGATCGCGGTGGTTTCCGATGAGAGGGGAGGCGGAAATGACAACGAGATGGCGGAACTGTCTGCCGGAGAACATGGGGGTGAAGAGAGAGAGAAACAACAGGAAATGGATTGGAAGACGGACGAGGAGTTCAAGAAGTTCATGGGAAATCCTTCTATTGAAGCTGCCATAAAGCTGGAGAAGAAGAGGGCGGATAGGAAGCTGAAGGAGCTTGATCGTGAAGGGGCCAATAATCCGATCGTGGGGTTGTTCAATAGAATTGCTCGGGATAATTTGGAAAAAGAGAAGGAGAGATTAAAGAAGGCTGAAGAGACTTTCAAGGCTCTTGATCTCAGCAAGTTAAGAGGTTGCTTTGGATTCAATACATTTTTCGCAACTGATGTACGTAGATTTGGAGATGGAGGTATTTTTATTGGGAACTTGAGGAGACCCATTGAAGAAGTGATTCCCCAGTTGGAGAAAAAACTATCAGAAGCAGCAGGAAGGGAGGTAGTCCTATGGTTCATGGAAGAAAAAACAGATGACATCACGAAACAGGTCTGTATGGTGCAACCCAAGGCAGAAATAGATCTCCAATTTGAGTCTACCAAGCTGAGTACTCCATTGGGATATTTTAGTGCAATAACGTTATGTGTTGCAACGTTTGGGACTATTGCTTTGATGAGTGGCTTCTTTCTAAAACCTGGTGCTACCTTTGATGACTATATAGCGAATGTTGTTCCTCTTTTTGGTGGCTTCATCTCTATTCTGGGAGTTTCAGAGATAGCAACGAGGGTAACAGCAGCTCGTTATGGCGTGAAGCTAAGCCCTTCTTTTCTCGTGCCTTCCAACTGGACAGGATGCTTAGGAGTGATGAATAACTATGAATCTCTACTTCCAAACAAGAAAGCGCTTTTTGATATTCCAGTAGCACGTACTGCTTCTGCATATTTAACGTCCTTGGCACTTGCAGTCTCTGCTTTTGTAATTGACGGTGGCTTTAATGGTGGAGACAATGCAATGTACATTAGACCTCAATTCTTTTACAACAATCCCTTACTTTCTTTTATCCAGTTTGTTATTGGACCTTACTCGGATGACCTTGGCAATGTATTGCCCTATGCAGTGGAAGGTGTTGGGGTTCCTGTGGATCCCCTTGCTTTTGCTGGCCTTTTAGGAATGGTAGTGACATCTTTGAATTTGTTGCCGTGCGGGAGGCTCGAAGGAGGCCGCATTGCACAAGCCATGTTTGGGAGAAGCACCGCTGCCCTACTATCATTTGCCACATCTCTTGTACTTGGTATCGGTGGACTGAGTGGGAGTGTCCTTTGTTTGGCTTGGGGTTTGTTTGCAACTTTCTTTCGAGGTGGTGAAGAAGTTCCAGCAACAGATGAGATCACTCCCTTGGGAGATGATCGGTACGCGTGGGGTGTCGTTCTCGGCCTCATTTGCTTACTTACCTTGTTCCCTAATGGCGGAGGCACATTCTCAAGTCCATTTTTCAGTGCACCATTTTTCAGGGGTGACTTATAA
BLAST of CSPI04G00110 vs. Swiss-Prot
Match: EGY3_ARATH (Probable zinc metallopeptidase EGY3, chloroplastic OS=Arabidopsis thaliana GN=EGY3 PE=2 SV=1)

HSP 1 Score: 832.4 bits (2149), Expect = 3.0e-240
Identity = 423/523 (80.88%), Postives = 469/523 (89.67%), Query Frame = 1

Query: 54  LRFSARDDSESEP--SSSSIAVVSDERGGGNDNEMAELS--AGEHGGEEREKQQEMDWKT 113
           LR SA DD   EP   + S   +++E+   +DN  A  S  + E   E++ KQQEMDWKT
Sbjct: 49  LRCSAEDDRVREPVNEAPSPVALAEEQKEDHDNNNAPPSPESSEEEEEKKSKQQEMDWKT 108

Query: 114 DEEFKKFMGNPSIEAAIKLEKKRADRKLKELDREG-ANNPIVGLFNRIARDNLEKEKERL 173
           DEEFKKFMGNPSIEAAIKLEK R DRKLKEL++E  + NPI+G++N +ARD+L KEKERL
Sbjct: 109 DEEFKKFMGNPSIEAAIKLEKTRTDRKLKELNKESNSENPIIGIYNSLARDSLTKEKERL 168

Query: 174 KKAEETFKALDLSKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEA 233
           +KAEETFKALDL+KL+ CFGF+TFFATDVRRFGDGGIFIGNLR+PI+EV P+LE KLSEA
Sbjct: 169 EKAEETFKALDLNKLKSCFGFDTFFATDVRRFGDGGIFIGNLRKPIDEVTPKLEAKLSEA 228

Query: 234 AGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIA 293
           AGR+VV+WFMEE++++ITKQVCMVQPKAEIDLQFEST+LSTP GY SAI LCV TFGTIA
Sbjct: 229 AGRDVVVWFMEERSNEITKQVCMVQPKAEIDLQFESTRLSTPWGYVSAIALCVTTFGTIA 288

Query: 294 LMSGFFLKPGATFDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWT 353
           LMSGFFLKP ATFDDYIANVVPLFGGF+SILGVSEIATRVTAAR+GVKLSPSFLVPSNWT
Sbjct: 289 LMSGFFLKPDATFDDYIANVVPLFGGFLSILGVSEIATRVTAARHGVKLSPSFLVPSNWT 348

Query: 354 GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQ 413
           GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSL LA +AF+ DG FNGGDNA+YIRPQ
Sbjct: 349 GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLLLAAAAFISDGSFNGGDNALYIRPQ 408

Query: 414 FFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR 473
           FF NNPLLSF+QFV+GPY+DDLGNVLP AVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR
Sbjct: 409 FFDNNPLLSFVQFVVGPYADDLGNVLPNAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR 468

Query: 474 LEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEI 533
           LEGGRIAQAMFGRSTAA+LSF TSL+LGIGGLSGSVLCLAWGLFATFFRGGEE PA DEI
Sbjct: 469 LEGGRIAQAMFGRSTAAILSFTTSLLLGIGGLSGSVLCLAWGLFATFFRGGEETPAKDEI 528

Query: 534 TPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSPFFSAPFFRGD 572
           TP+GDDR+AWG+VLGLIC LTLFPN GGTFS+ FF+ PFFRGD
Sbjct: 529 TPVGDDRFAWGIVLGLICFLTLFPNSGGTFSTSFFNGPFFRGD 571

BLAST of CSPI04G00110 vs. Swiss-Prot
Match: EGY3_ORYSJ (Probable zinc metalloprotease EGY3, chloroplastic OS=Oryza sativa subsp. japonica GN=EGY3 PE=2 SV=1)

HSP 1 Score: 772.3 bits (1993), Expect = 3.7e-222
Identity = 394/533 (73.92%), Postives = 448/533 (84.05%), Query Frame = 1

Query: 46  SAPITEDRLRFSARDDSESEPSSSSIAV-----VSDERGGGNDNEMAELSAGEHGGEERE 105
           +AP  E       +DD+ +   +S  AV     V+D  GGG      EL        E E
Sbjct: 65  AAPAAESHHAGGGQDDAAT---ASHHAVEGENGVADADGGGVKKSKEEL--------EEE 124

Query: 106 KQQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDN 165
           +QQE+DW++DEEFK+FMGNPSIEAAIKLEKKRADRKL+ELDRE   NP+ GL   +AR  
Sbjct: 125 EQQEVDWRSDEEFKRFMGNPSIEAAIKLEKKRADRKLRELDREPDANPLAGLLRGLARGQ 184

Query: 166 LEKEKERLKKAEETFKALDLSKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQ 225
           L +EKERL+ AE TFKALDL+KL+ CFG++TFFA DVRRFGDGGIFIGNLR+P+EEV P+
Sbjct: 185 LAREKERLELAENTFKALDLNKLKSCFGYDTFFAVDVRRFGDGGIFIGNLRKPVEEVRPK 244

Query: 226 LEKKLSEAAGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLC 285
           LEKK++EAAG +V LWFMEEK DDITKQVCMVQPKAEIDLQ E TKLSTP GY SA+ L 
Sbjct: 245 LEKKIAEAAGTDVTLWFMEEKNDDITKQVCMVQPKAEIDLQLEITKLSTPWGYLSAVALA 304

Query: 286 VATFGTIALMSGFFLKPGATFDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPS 345
           V TFGTIA+MSGFFLKPGATFDDY+++V+PLF GF+SILGVSEIATR+TAARYGVKLSPS
Sbjct: 305 VTTFGTIAIMSGFFLKPGATFDDYVSDVLPLFAGFLSILGVSEIATRLTAARYGVKLSPS 364

Query: 346 FLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGD 405
           FLVPSNWTGCLGVMNNYESLLPNKKALFDIPVAR ASAYLTS+ALAVSAFV DG  NGG 
Sbjct: 365 FLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARAASAYLTSVALAVSAFVSDGSLNGGK 424

Query: 406 NAMYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTS 465
           NA+++RP+FFYNNPLLSF+Q VIGPY+D+LGNVLP AVEGVGVPVDPLAFAGLLG+VVTS
Sbjct: 425 NALFVRPEFFYNNPLLSFVQAVIGPYADELGNVLPNAVEGVGVPVDPLAFAGLLGIVVTS 484

Query: 466 LNLLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGG-LSGSVLCLAWGLFATFFRGG 525
           LNLLPCGRLEGGRIAQA+FGR  AA+LSFATS+ LG G  + GSVLCLAWGLFATF RGG
Sbjct: 485 LNLLPCGRLEGGRIAQALFGRGAAAVLSFATSVALGAGAIIGGSVLCLAWGLFATFVRGG 544

Query: 526 EEVPATDEITPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 573
           EE+PA DEITPLG +RYAWG+VL ++CLLTLFPNGGGT+SS F  APFFRG +
Sbjct: 545 EEIPAQDEITPLGSERYAWGLVLAVVCLLTLFPNGGGTYSSDFLGAPFFRGGI 586

BLAST of CSPI04G00110 vs. Swiss-Prot
Match: EGY3_ORYSI (Probable zinc metalloprotease EGY3, chloroplastic OS=Oryza sativa subsp. indica GN=EGY3 PE=3 SV=1)

HSP 1 Score: 770.8 bits (1989), Expect = 1.1e-221
Identity = 393/533 (73.73%), Postives = 447/533 (83.86%), Query Frame = 1

Query: 46  SAPITEDRLRFSARDDSESEPSSSSIAV-----VSDERGGGNDNEMAELSAGEHGGEERE 105
           +AP  E       +DD+ +   +S  AV     V+D  GGG      EL        E E
Sbjct: 65  AAPAAESHHAGGGQDDAAT---ASHHAVEGENGVADADGGGVKKSKEEL--------EEE 124

Query: 106 KQQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDN 165
           +QQE+DW++DEEFK+FMGNPSIE AIKLEKKRADRKL+ELDRE   NP+ GL   +AR  
Sbjct: 125 EQQEVDWRSDEEFKRFMGNPSIEGAIKLEKKRADRKLRELDREPDANPLAGLLRGLARGQ 184

Query: 166 LEKEKERLKKAEETFKALDLSKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQ 225
           L +EKERL+ AE TFKALDL+KL+ CFG++TFFA DVRRFGDGGIFIGNLR+P+EEV P+
Sbjct: 185 LAREKERLELAENTFKALDLNKLKSCFGYDTFFAVDVRRFGDGGIFIGNLRKPVEEVRPK 244

Query: 226 LEKKLSEAAGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLC 285
           LEKK++EAAG +V LWFMEEK DDITKQVCMVQPKAEIDLQ E TKLSTP GY SA+ L 
Sbjct: 245 LEKKIAEAAGTDVTLWFMEEKNDDITKQVCMVQPKAEIDLQLEITKLSTPWGYLSAVALA 304

Query: 286 VATFGTIALMSGFFLKPGATFDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPS 345
           V TFGTIA+MSGFFLKPGATFDDY+++V+PLF GF+SILGVSEIATR+TAARYGVKLSPS
Sbjct: 305 VTTFGTIAIMSGFFLKPGATFDDYVSDVLPLFAGFLSILGVSEIATRLTAARYGVKLSPS 364

Query: 346 FLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGD 405
           FLVPSNWTGCLGVMNNYESLLPNKKALFDIPVAR ASAYLTS+ALAVSAFV DG  NGG 
Sbjct: 365 FLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARAASAYLTSVALAVSAFVSDGSLNGGK 424

Query: 406 NAMYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTS 465
           NA+++RP+FFYNNPLLSF+Q VIGPY+D+LGNVLP AVEGVGVPVDPLAFAGLLG+VVTS
Sbjct: 425 NALFVRPEFFYNNPLLSFVQAVIGPYADELGNVLPNAVEGVGVPVDPLAFAGLLGIVVTS 484

Query: 466 LNLLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGG-LSGSVLCLAWGLFATFFRGG 525
           LNLLPCGRLEGGRIAQA+FGR  AA+LSFATS+ LG G  + GSVLCLAWGLFATF RGG
Sbjct: 485 LNLLPCGRLEGGRIAQALFGRGAAAVLSFATSVALGAGAIIGGSVLCLAWGLFATFVRGG 544

Query: 526 EEVPATDEITPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 573
           EE+PA DEITPLG +RYAWG+VL ++CLLTLFPNGGGT+SS F  APFFRG +
Sbjct: 545 EEIPAQDEITPLGSERYAWGLVLAVVCLLTLFPNGGGTYSSDFLGAPFFRGGI 586

BLAST of CSPI04G00110 vs. Swiss-Prot
Match: EGY1_ORYSJ (Probable zinc metalloprotease EGY1, chloroplastic OS=Oryza sativa subsp. japonica GN=EGY1 PE=2 SV=3)

HSP 1 Score: 121.3 bits (303), Expect = 3.4e-26
Identity = 127/510 (24.90%), Postives = 216/510 (42.35%), Query Frame = 1

Query: 76  DERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADR 135
           D  GGG          G  GGE+ EK+ E +     E K       +  A++  +    R
Sbjct: 77  DGGGGGGGG-----GGGGTGGEDGEKRGEEEAAAAAEAK-------VGGAVEEMRSERTR 136

Query: 136 KLKELDREGANNPIVGLFNR-----IARDNLEKEKERLKKAEETFKALDLSKLRG-CFGF 195
                    +++   G+ N       + DN++  K       E   + D+  ++   FG+
Sbjct: 137 SGSFSSSSSSSSGTPGISNEPPFLSFSVDNIDTVKLLELLGPEKVDSADVKAIKEKLFGY 196

Query: 196 NTFFATDVRRFGD---GGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEEKTDD-- 255
            TF+ T    FGD   G +FIGNLR   EE+  +L+++L E  G +  L+ +EE   +  
Sbjct: 197 TTFWLTREEPFGDLGEGVLFIGNLRGKREEIFAKLQQQLRELTGDKYNLFMVEEPNSEGE 256

Query: 256 ------------ITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVAT--FGTIALM 315
                       + ++V   +P      Q+  + L   L  FS + L +A+        +
Sbjct: 257 DPRGGPRVSFGLLRREVS--EPGPTTLWQYVISLLLFLLTVFSCVELGIASKISSLPPEI 316

Query: 316 SGFFLKPGAT--------FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFL 375
             +F  P AT           ++ + +P+  G ++I    E+   + A    VKLS  F 
Sbjct: 317 VTYFTDPNATGPPPDMQLLLPFVESALPVAYGVLAIQLFHEVGHFLAAFPKKVKLSIPFF 376

Query: 376 VPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNA 435
           +P+   G  G +  ++S+LP+KK +FDI +A   +    S ++     ++     G  + 
Sbjct: 377 IPNFTLGTFGAITQFKSILPDKKTMFDISMAGPLAGAALSFSMFSVGLLLSSNPAGASDL 436

Query: 436 MYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLN 495
           + +  + F  + LL  +      Y          A+    V + PL  AG  G+  T+ N
Sbjct: 437 VEVPSKLFQGSLLLGLVSRATLGYR---------AMHAATVAIHPLVIAGWCGLTTTAFN 496

Query: 496 LLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEV 553
           +LP G L+GGR  Q  FG+         T  +LG+G L G  L L WGL+    +   E 
Sbjct: 497 MLPVGCLDGGRALQGAFGKDALFGFGLTTYSLLGLGVLGGP-LSLPWGLYVLICQRTPEK 556

BLAST of CSPI04G00110 vs. Swiss-Prot
Match: EGY1_ARATH (Probable zinc metalloprotease EGY1, chloroplastic OS=Arabidopsis thaliana GN=EGY1 PE=2 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 1.3e-25
Identity = 100/394 (25.38%), Postives = 172/394 (43.65%), Query Frame = 1

Query: 187 FGFNTFFATDVRRFGDGG---IFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEEKTD 246
           FG++TF+ T    FGD G   +F+GNLR   E+V  +L++KL E A  +  L+ +EE   
Sbjct: 152 FGYSTFWVTKEEPFGDLGEGILFLGNLRGKKEDVFAKLQRKLVEVASDKYNLFMIEEPNS 211

Query: 247 DITKQVCMVQPKAEIDLQFESTKLSTP-----LGYFSAITLCVATFGTIALMS------- 306
           +        +  A +       ++S P       Y  A+ L + T G+   +        
Sbjct: 212 EGPDP----RGGARVSFGLLRKEVSEPGPTTLWQYVIALILFLLTIGSSVELGIASQINR 271

Query: 307 ------GFFLKPGAT-------FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLS 366
                  +F  P A           ++   +PL  G + IL   E+   + A    VKLS
Sbjct: 272 LPPEVVKYFTDPNAVEPPDMELLYPFVDAALPLAYGVLGILLFHELGHFLAAVPKKVKLS 331

Query: 367 PSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNG 426
             + +P+   G  G +  ++S+LP++    DI +A   +    S+++      +    + 
Sbjct: 332 IPYFIPNITLGSFGAITQFKSILPDRSTKVDISLAGPFAGAALSVSMFAVGLFLSTEPDA 391

Query: 427 GDNAMYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVV 486
            ++ + +    F  + LL  I      Y+         A+    V + PL  AG  G+  
Sbjct: 392 ANDLVQVPSMLFQGSLLLGLISRATLGYA---------ALHAATVSIHPLVIAGWCGLTT 451

Query: 487 TSLNLLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRG 546
           T+ N+LP G L+GGR  Q  FG++       +T ++LG+  L G  L L WGL+    + 
Sbjct: 452 TAFNMLPVGCLDGGRAVQGAFGKNALVTFGLSTYVMLGLRVLGGP-LALPWGLYVLICQR 511

Query: 547 GEEVPATDEITPLGDDRYAWGVVLGLICLLTLFP 553
             E P  +++T +G  R A   +  ++ +LTL P
Sbjct: 512 TPEKPCLNDVTEVGTWRKALVGIALILVVLTLLP 531

BLAST of CSPI04G00110 vs. TrEMBL
Match: A0A0A0KUZ7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000600 PE=4 SV=1)

HSP 1 Score: 1121.3 bits (2899), Expect = 0.0e+00
Identity = 570/572 (99.65%), Postives = 571/572 (99.83%), Query Frame = 1

Query: 1   MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTISAPITEDRLRFSARD 60
           MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTI APITEDRLRFSARD
Sbjct: 1   MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTIFAPITEDRLRFSARD 60

Query: 61  DSESEPSSSSIAVVSDERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNP 120
           DSESEPSSSSIAVVSDERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNP
Sbjct: 61  DSESEPSSSSIAVVSDERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNP 120

Query: 121 SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERLKKAEETFKALDL 180
           SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERL+KAEETFKALDL
Sbjct: 121 SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERLEKAEETFKALDL 180

Query: 181 SKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE 240
           SKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE
Sbjct: 181 SKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE 240

Query: 241 KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT 300
           KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT
Sbjct: 241 KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT 300

Query: 301 FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL 360
           FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL
Sbjct: 301 FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL 360

Query: 361 LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ 420
           LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ
Sbjct: 361 LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ 420

Query: 421 FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG 480
           FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG
Sbjct: 421 FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG 480

Query: 481 RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV 540
           RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV
Sbjct: 481 RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV 540

Query: 541 VLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 573
           VLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL
Sbjct: 541 VLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 572

BLAST of CSPI04G00110 vs. TrEMBL
Match: E5GBH3_CUCME (Sterol regulatory element-binding protein site 2 protease OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 1095.5 bits (2832), Expect = 0.0e+00
Identity = 556/572 (97.20%), Postives = 563/572 (98.43%), Query Frame = 1

Query: 1   MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTISAPITEDRLRFSARD 60
           MA LSIASNSWFISH+E+HYTSRTMAKPFGKIPLGR+T GYFFTIS PITE+RLRFSARD
Sbjct: 1   MATLSIASNSWFISHKERHYTSRTMAKPFGKIPLGRKTDGYFFTISTPITENRLRFSARD 60

Query: 61  DSESEPSSSSIAVVSDERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNP 120
           DSESE SSSSIAVVSDERGGGNDNE AELSAGEH  EEREKQQEMDWKTDEEFKKFMGNP
Sbjct: 61  DSESESSSSSIAVVSDERGGGNDNEKAELSAGEHESEEREKQQEMDWKTDEEFKKFMGNP 120

Query: 121 SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERLKKAEETFKALDL 180
           SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERL+KAEETFKALDL
Sbjct: 121 SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERLEKAEETFKALDL 180

Query: 181 SKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE 240
           +KL+ CFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE
Sbjct: 181 NKLKSCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE 240

Query: 241 KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT 300
           KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT
Sbjct: 241 KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT 300

Query: 301 FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL 360
           FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL
Sbjct: 301 FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL 360

Query: 361 LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ 420
           LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ
Sbjct: 361 LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ 420

Query: 421 FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG 480
           FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG
Sbjct: 421 FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG 480

Query: 481 RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV 540
           RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV
Sbjct: 481 RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV 540

Query: 541 VLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 573
           VLGLIC LTLFPNGGGTFSSPFFSAPFFRGDL
Sbjct: 541 VLGLICFLTLFPNGGGTFSSPFFSAPFFRGDL 572

BLAST of CSPI04G00110 vs. TrEMBL
Match: A5ALY0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_010704 PE=4 SV=1)

HSP 1 Score: 888.3 bits (2294), Expect = 5.1e-255
Identity = 454/551 (82.40%), Postives = 493/551 (89.47%), Query Frame = 1

Query: 28  PFGKIPLGRRTGGYFFTISAPITEDRLRFSARDDSESEP-SSSSIAVVSDERGGGNDNEM 87
           P  K P GRRT   F   S P +   LR S +DD E +P SSSS+A+VS+++  G+D + 
Sbjct: 29  PLPKHPFGRRTHLPFSLTSTPRSIKPLRSSNKDDPEGQPPSSSSVALVSEKQDDGDDAQK 88

Query: 88  AELSAGEHGGEE-----REKQQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELD 147
           + L+A    GEE     RE QQEMDWK DEEFKKFMGNPSIEAAIKLEKKRADRKLKELD
Sbjct: 89  SGLAAEVELGEENDSGERENQQEMDWKLDEEFKKFMGNPSIEAAIKLEKKRADRKLKELD 148

Query: 148 REGANNPIVGLFNRIARDNLEKEKERLKKAEETFKALDLSKLRGCFGFNTFFATDVRRFG 207
           RE ++NP+VGLFNR+ RD+L +EKERL+KAEE FKALDL+KL+ CFGF+TF+ATDVRRFG
Sbjct: 149 RESSDNPVVGLFNRVVRDSLAREKERLEKAEEAFKALDLNKLKNCFGFDTFYATDVRRFG 208

Query: 208 DGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQ 267
           DGGIFIGNLRRPIEEVIP+LEKKLSEAAGREVVLWFMEEK +DITKQVCMVQPKAE+DLQ
Sbjct: 209 DGGIFIGNLRRPIEEVIPKLEKKLSEAAGREVVLWFMEEKANDITKQVCMVQPKAEMDLQ 268

Query: 268 FESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGATFDDYIANVVPLFGGFISILGV 327
           FESTKLSTP GY S+I LCVATFGTIALMSGFFLKP ATFDDY+A+VVPLF GF++ILGV
Sbjct: 269 FESTKLSTPWGYISSIVLCVATFGTIALMSGFFLKPNATFDDYLADVVPLFSGFVTILGV 328

Query: 328 SEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLT 387
           SEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAY+T
Sbjct: 329 SEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYIT 388

Query: 388 SLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGV 447
           SL LAV+AF+ DG FNGGDNA+YIRPQFFYNNPLLSFIQFVIGPY+DDLGNVLPYAVEGV
Sbjct: 389 SLVLAVAAFIADGSFNGGDNALYIRPQFFYNNPLLSFIQFVIGPYTDDLGNVLPYAVEGV 448

Query: 448 GVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGGLS 507
           GVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQA+FGR+ A LLSF TSL+LGIGGLS
Sbjct: 449 GVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQALFGRNIATLLSFGTSLLLGIGGLS 508

Query: 508 GSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSP 567
           GSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWG VL LIC LTLFPNGGGTFSS 
Sbjct: 509 GSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGFVLALICFLTLFPNGGGTFSSS 568

Query: 568 FFSAPFFRGDL 573
           FFS PFFRGDL
Sbjct: 569 FFSDPFFRGDL 579

BLAST of CSPI04G00110 vs. TrEMBL
Match: A0A061E1K2_THECC (Ethylene-dependent gravitropism-deficient and yellow-green-like 3 OS=Theobroma cacao GN=TCM_007262 PE=4 SV=1)

HSP 1 Score: 870.2 bits (2247), Expect = 1.4e-249
Identity = 448/545 (82.20%), Postives = 484/545 (88.81%), Query Frame = 1

Query: 33  PLGRRTGGYFFTISAPITEDRLRFSARDDSESEP-SSSSIAVVSDERGGGNDN----EMA 92
           PL     GY+ + S  +    L+ S  D+ ESEP SSSS+AV  +E     ++    +  
Sbjct: 66  PLDLAVKGYYLSFSRRLKP--LKSSVTDEPESEPTSSSSVAVAPEEPSNEKESPKSVQEV 125

Query: 93  ELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELDREGANN 152
            LS      E +E QQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKE DRE + N
Sbjct: 126 GLSKENEETEGKENQQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKEFDRESSGN 185

Query: 153 PIVGLFNRIARDNLEKEKERLKKAEETFKALDLSKLRGCFGFNTFFATDVRRFGDGGIFI 212
           PIVGLFN++ RDNL +EKERL++AEETFKALDL+KL+ CFGF+TFFATDVRRFGDGGI+I
Sbjct: 186 PIVGLFNKLVRDNLTREKERLEQAEETFKALDLNKLKSCFGFDTFFATDVRRFGDGGIYI 245

Query: 213 GNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQFESTKL 272
           GNLRRPIEEVIP LEKKLS+AAG EVVLWFMEEK +DITKQ C+VQPKAEIDLQFESTKL
Sbjct: 246 GNLRRPIEEVIPILEKKLSDAAGWEVVLWFMEEKANDITKQACVVQPKAEIDLQFESTKL 305

Query: 273 STPLGYFSAITLCVATFGTIALMSGFFLKPGATFDDYIANVVPLFGGFISILGVSEIATR 332
           STP GY SAI LCVATFGTIALMSGFFLKPGATFDDY+A+VVPLFGGF+SILGVSEIATR
Sbjct: 306 STPWGYVSAIALCVATFGTIALMSGFFLKPGATFDDYLADVVPLFGGFVSILGVSEIATR 365

Query: 333 VTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAV 392
           VTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALF IPVARTASAYLTSL LAV
Sbjct: 366 VTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFGIPVARTASAYLTSLVLAV 425

Query: 393 SAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDP 452
           +AFV DG FNGGDNA+YIRPQFFYNNPLLSFIQFVIGPY+DDLGNVLPYAVEGVGVPVDP
Sbjct: 426 AAFVADGSFNGGDNALYIRPQFFYNNPLLSFIQFVIGPYTDDLGNVLPYAVEGVGVPVDP 485

Query: 453 LAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCL 512
           LAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFGR+TA LLSFATSL+LGIGGLSGSVLCL
Sbjct: 486 LAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFGRNTATLLSFATSLLLGIGGLSGSVLCL 545

Query: 513 AWGLFATFFRGGEEVPATDEITPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSPFFSAPF 572
           AWGLFATFFRGGEE+PA DEITPLGD+R+AWGVVLGLIC LTLFPNGGGTFSSPFFS PF
Sbjct: 546 AWGLFATFFRGGEEMPAKDEITPLGDNRFAWGVVLGLICFLTLFPNGGGTFSSPFFSDPF 605

BLAST of CSPI04G00110 vs. TrEMBL
Match: A0A059BCF2_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_G01188 PE=4 SV=1)

HSP 1 Score: 869.4 bits (2245), Expect = 2.5e-249
Identity = 447/543 (82.32%), Postives = 487/543 (89.69%), Query Frame = 1

Query: 33  PLGRRTGGYFFTISAPITEDR-LRFSARDDSESEP-SSSSIAVVSDERGGGNDNEMAELS 92
           P G+R        S P +  R L+ SA DD E+EP SSSS+A VS+  G  + +  A  +
Sbjct: 32  PFGQRARIPSSLSSRPSSSVRPLKSSAGDDRENEPASSSSVAAVSERPGREDADAEATET 91

Query: 93  AGEHGGEER--EKQQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELDREGANNP 152
           +GE   EER  EKQQEMDWK+DEEFK+FMG+PSIEAAIKLEKKRADRKLKELDRE ++NP
Sbjct: 92  SGESEEEERGREKQQEMDWKSDEEFKRFMGSPSIEAAIKLEKKRADRKLKELDRESSDNP 151

Query: 153 IVGLFNRIARDNLEKEKERLKKAEETFKALDLSKLRGCFGFNTFFATDVRRFGDGGIFIG 212
           IVGLFNRIARD+L KE+ERL+KAEE FKALDL+KLR CFGFNTFFATDVRRFGDGGIFIG
Sbjct: 152 IVGLFNRIARDSLAKERERLEKAEEAFKALDLNKLRSCFGFNTFFATDVRRFGDGGIFIG 211

Query: 213 NLRRPIEEVIPQLEKKLSEAAGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQFESTKLS 272
           NLRRPI+EVIP+LE KLSEAAGREVV+WFMEEK DDITKQ C+VQPK+E+DLQFEST+LS
Sbjct: 212 NLRRPIDEVIPKLESKLSEAAGREVVVWFMEEKADDITKQACVVQPKSEMDLQFESTRLS 271

Query: 273 TPLGYFSAITLCVATFGTIALMSGFFLKPGATFDDYIANVVPLFGGFISILGVSEIATRV 332
           TP GY SA+ LCV TFGTIALMSGFFLKP ATFDDY++NVVPLFGGF+SILGVSEIATRV
Sbjct: 272 TPWGYVSAVALCVTTFGTIALMSGFFLKPDATFDDYLSNVVPLFGGFVSILGVSEIATRV 331

Query: 333 TAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVS 392
           TAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLAL ++
Sbjct: 332 TAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALTIA 391

Query: 393 AFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPL 452
           AFV DG FNGGDNA+YIRPQFF+NNPLLSFIQFVIGPY+DDLGNVLPYAVEGVGVPVDPL
Sbjct: 392 AFVADGSFNGGDNALYIRPQFFFNNPLLSFIQFVIGPYTDDLGNVLPYAVEGVGVPVDPL 451

Query: 453 AFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCLA 512
           AFAGLLGMVVTSLNLLPCGRLEGGRIAQAMF RSTA LLSFATSL+LGIGGLSGSVLCLA
Sbjct: 452 AFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFSRSTATLLSFATSLLLGIGGLSGSVLCLA 511

Query: 513 WGLFATFFRGGEEVPATDEITPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSPFFSAPFF 572
           WGLFATFFRGGEE+PA DEITPLGDDR+AWG+VLGLIC LTLFPN GGTFSS FF +PFF
Sbjct: 512 WGLFATFFRGGEEIPAKDEITPLGDDRFAWGIVLGLICFLTLFPNVGGTFSSSFFESPFF 571

BLAST of CSPI04G00110 vs. TAIR10
Match: AT1G17870.1 (AT1G17870.1 ethylene-dependent gravitropism-deficient and yellow-green-like 3)

HSP 1 Score: 832.4 bits (2149), Expect = 1.7e-241
Identity = 423/523 (80.88%), Postives = 469/523 (89.67%), Query Frame = 1

Query: 54  LRFSARDDSESEP--SSSSIAVVSDERGGGNDNEMAELS--AGEHGGEEREKQQEMDWKT 113
           LR SA DD   EP   + S   +++E+   +DN  A  S  + E   E++ KQQEMDWKT
Sbjct: 49  LRCSAEDDRVREPVNEAPSPVALAEEQKEDHDNNNAPPSPESSEEEEEKKSKQQEMDWKT 108

Query: 114 DEEFKKFMGNPSIEAAIKLEKKRADRKLKELDREG-ANNPIVGLFNRIARDNLEKEKERL 173
           DEEFKKFMGNPSIEAAIKLEK R DRKLKEL++E  + NPI+G++N +ARD+L KEKERL
Sbjct: 109 DEEFKKFMGNPSIEAAIKLEKTRTDRKLKELNKESNSENPIIGIYNSLARDSLTKEKERL 168

Query: 174 KKAEETFKALDLSKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEA 233
           +KAEETFKALDL+KL+ CFGF+TFFATDVRRFGDGGIFIGNLR+PI+EV P+LE KLSEA
Sbjct: 169 EKAEETFKALDLNKLKSCFGFDTFFATDVRRFGDGGIFIGNLRKPIDEVTPKLEAKLSEA 228

Query: 234 AGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIA 293
           AGR+VV+WFMEE++++ITKQVCMVQPKAEIDLQFEST+LSTP GY SAI LCV TFGTIA
Sbjct: 229 AGRDVVVWFMEERSNEITKQVCMVQPKAEIDLQFESTRLSTPWGYVSAIALCVTTFGTIA 288

Query: 294 LMSGFFLKPGATFDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWT 353
           LMSGFFLKP ATFDDYIANVVPLFGGF+SILGVSEIATRVTAAR+GVKLSPSFLVPSNWT
Sbjct: 289 LMSGFFLKPDATFDDYIANVVPLFGGFLSILGVSEIATRVTAARHGVKLSPSFLVPSNWT 348

Query: 354 GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQ 413
           GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSL LA +AF+ DG FNGGDNA+YIRPQ
Sbjct: 349 GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLLLAAAAFISDGSFNGGDNALYIRPQ 408

Query: 414 FFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR 473
           FF NNPLLSF+QFV+GPY+DDLGNVLP AVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR
Sbjct: 409 FFDNNPLLSFVQFVVGPYADDLGNVLPNAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR 468

Query: 474 LEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEI 533
           LEGGRIAQAMFGRSTAA+LSF TSL+LGIGGLSGSVLCLAWGLFATFFRGGEE PA DEI
Sbjct: 469 LEGGRIAQAMFGRSTAAILSFTTSLLLGIGGLSGSVLCLAWGLFATFFRGGEETPAKDEI 528

Query: 534 TPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSPFFSAPFFRGD 572
           TP+GDDR+AWG+VLGLIC LTLFPN GGTFS+ FF+ PFFRGD
Sbjct: 529 TPVGDDRFAWGIVLGLICFLTLFPNSGGTFSTSFFNGPFFRGD 571

BLAST of CSPI04G00110 vs. TAIR10
Match: AT5G35220.1 (AT5G35220.1 Peptidase M50 family protein)

HSP 1 Score: 119.4 bits (298), Expect = 7.3e-27
Identity = 100/394 (25.38%), Postives = 172/394 (43.65%), Query Frame = 1

Query: 187 FGFNTFFATDVRRFGDGG---IFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEEKTD 246
           FG++TF+ T    FGD G   +F+GNLR   E+V  +L++KL E A  +  L+ +EE   
Sbjct: 152 FGYSTFWVTKEEPFGDLGEGILFLGNLRGKKEDVFAKLQRKLVEVASDKYNLFMIEEPNS 211

Query: 247 DITKQVCMVQPKAEIDLQFESTKLSTP-----LGYFSAITLCVATFGTIALMS------- 306
           +        +  A +       ++S P       Y  A+ L + T G+   +        
Sbjct: 212 EGPDP----RGGARVSFGLLRKEVSEPGPTTLWQYVIALILFLLTIGSSVELGIASQINR 271

Query: 307 ------GFFLKPGAT-------FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLS 366
                  +F  P A           ++   +PL  G + IL   E+   + A    VKLS
Sbjct: 272 LPPEVVKYFTDPNAVEPPDMELLYPFVDAALPLAYGVLGILLFHELGHFLAAVPKKVKLS 331

Query: 367 PSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNG 426
             + +P+   G  G +  ++S+LP++    DI +A   +    S+++      +    + 
Sbjct: 332 IPYFIPNITLGSFGAITQFKSILPDRSTKVDISLAGPFAGAALSVSMFAVGLFLSTEPDA 391

Query: 427 GDNAMYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVV 486
            ++ + +    F  + LL  I      Y+         A+    V + PL  AG  G+  
Sbjct: 392 ANDLVQVPSMLFQGSLLLGLISRATLGYA---------ALHAATVSIHPLVIAGWCGLTT 451

Query: 487 TSLNLLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRG 546
           T+ N+LP G L+GGR  Q  FG++       +T ++LG+  L G  L L WGL+    + 
Sbjct: 452 TAFNMLPVGCLDGGRAVQGAFGKNALVTFGLSTYVMLGLRVLGGP-LALPWGLYVLICQR 511

Query: 547 GEEVPATDEITPLGDDRYAWGVVLGLICLLTLFP 553
             E P  +++T +G  R A   +  ++ +LTL P
Sbjct: 512 TPEKPCLNDVTEVGTWRKALVGIALILVVLTLLP 531

BLAST of CSPI04G00110 vs. TAIR10
Match: AT5G05740.1 (AT5G05740.1 ethylene-dependent gravitropism-deficient and yellow-green-like 2)

HSP 1 Score: 99.0 bits (245), Expect = 1.0e-20
Identity = 95/374 (25.40%), Postives = 163/374 (43.58%), Query Frame = 1

Query: 183 LRG-CFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEEK 242
           LRG  FGF+TFF T    +  G +F GNLR        +++ ++    G +  L+ +   
Sbjct: 187 LRGQVFGFDTFFVTSQEPYEGGVLFKGNLRGKPATSYEKIKTRMENNFGDQYKLFLLTNP 246

Query: 243 TDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGATF 302
            DD  K V +V P+  ++ +  +       G F  + L       +  +    L   + F
Sbjct: 247 EDD--KPVAVVVPRRSLEPETTAVPEWFAAGSFGLVALFTLFLRNVPALQSDLL---SAF 306

Query: 303 DDYIANVVPLFGGFIS--ILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYES 362
           D+       L G  ++  +LGV E+   + A   G+KL   F VPS   G  G +     
Sbjct: 307 DNLELLKDGLPGALVTALVLGVHELGHILVANSLGIKLGVPFFVPSWQIGSFGAITR--- 366

Query: 363 LLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDN-AMYIRPQFFYNNPLLSF 422
            + N  A  +  +   A+  L   +L +  F+I       D   + +    F+ + L   
Sbjct: 367 -IKNIVAKREDLLKVAAAGPLAGFSLGLILFLIGLFVPPSDGIGVVVDASVFHESFLAGG 426

Query: 423 IQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAM 482
           I  ++      LG+ L    EG  + ++PL      G+++  +N +P G L+GG+IA ++
Sbjct: 427 IAKLL------LGDALK---EGTSISLNPLVIWAWAGLLINGINSIPAGELDGGKIAFSI 486

Query: 483 FGRSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAW 542
           +GR TA  L+ A+  +LG+  L   V    W +   F + G   P  +EIT   D   + 
Sbjct: 487 WGRKTATRLTGASIALLGLSALFSDV-AFYWVVLIFFLQRGPIAPLAEEITVPDDKYVSL 541

Query: 543 GVVLGLICLLTLFP 553
           G+++  + LL   P
Sbjct: 547 GILVLFLSLLVCLP 541

BLAST of CSPI04G00110 vs. NCBI nr
Match: gi|449465097|ref|XP_004150265.1| (PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Cucumis sativus])

HSP 1 Score: 1121.3 bits (2899), Expect = 0.0e+00
Identity = 570/572 (99.65%), Postives = 571/572 (99.83%), Query Frame = 1

Query: 1   MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTISAPITEDRLRFSARD 60
           MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTI APITEDRLRFSARD
Sbjct: 1   MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTIFAPITEDRLRFSARD 60

Query: 61  DSESEPSSSSIAVVSDERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNP 120
           DSESEPSSSSIAVVSDERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNP
Sbjct: 61  DSESEPSSSSIAVVSDERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNP 120

Query: 121 SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERLKKAEETFKALDL 180
           SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERL+KAEETFKALDL
Sbjct: 121 SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERLEKAEETFKALDL 180

Query: 181 SKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE 240
           SKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE
Sbjct: 181 SKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE 240

Query: 241 KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT 300
           KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT
Sbjct: 241 KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT 300

Query: 301 FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL 360
           FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL
Sbjct: 301 FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL 360

Query: 361 LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ 420
           LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ
Sbjct: 361 LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ 420

Query: 421 FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG 480
           FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG
Sbjct: 421 FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG 480

Query: 481 RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV 540
           RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV
Sbjct: 481 RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV 540

Query: 541 VLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 573
           VLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL
Sbjct: 541 VLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 572

BLAST of CSPI04G00110 vs. NCBI nr
Match: gi|659108849|ref|XP_008454418.1| (PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Cucumis melo])

HSP 1 Score: 1095.5 bits (2832), Expect = 0.0e+00
Identity = 556/572 (97.20%), Postives = 563/572 (98.43%), Query Frame = 1

Query: 1   MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTISAPITEDRLRFSARD 60
           MA LSIASNSWFISH+E+HYTSRTMAKPFGKIPLGR+T GYFFTIS PITE+RLRFSARD
Sbjct: 1   MATLSIASNSWFISHKERHYTSRTMAKPFGKIPLGRKTDGYFFTISTPITENRLRFSARD 60

Query: 61  DSESEPSSSSIAVVSDERGGGNDNEMAELSAGEHGGEEREKQQEMDWKTDEEFKKFMGNP 120
           DSESE SSSSIAVVSDERGGGNDNE AELSAGEH  EEREKQQEMDWKTDEEFKKFMGNP
Sbjct: 61  DSESESSSSSIAVVSDERGGGNDNEKAELSAGEHESEEREKQQEMDWKTDEEFKKFMGNP 120

Query: 121 SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERLKKAEETFKALDL 180
           SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERL+KAEETFKALDL
Sbjct: 121 SIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERLEKAEETFKALDL 180

Query: 181 SKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE 240
           +KL+ CFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE
Sbjct: 181 NKLKSCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEE 240

Query: 241 KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT 300
           KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT
Sbjct: 241 KTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGAT 300

Query: 301 FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL 360
           FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL
Sbjct: 301 FDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESL 360

Query: 361 LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ 420
           LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ
Sbjct: 361 LPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQ 420

Query: 421 FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG 480
           FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG
Sbjct: 421 FVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFG 480

Query: 481 RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV 540
           RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV
Sbjct: 481 RSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGV 540

Query: 541 VLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 573
           VLGLIC LTLFPNGGGTFSSPFFSAPFFRGDL
Sbjct: 541 VLGLICFLTLFPNGGGTFSSPFFSAPFFRGDL 572

BLAST of CSPI04G00110 vs. NCBI nr
Match: gi|225429195|ref|XP_002271890.1| (PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Vitis vinifera])

HSP 1 Score: 888.3 bits (2294), Expect = 7.3e-255
Identity = 454/551 (82.40%), Postives = 493/551 (89.47%), Query Frame = 1

Query: 28  PFGKIPLGRRTGGYFFTISAPITEDRLRFSARDDSESEP-SSSSIAVVSDERGGGNDNEM 87
           P  K P GRRT   F   S P +   LR S +DD E +P SSSS+A+VS+++  G+D + 
Sbjct: 29  PLPKHPFGRRTHLPFSLTSTPRSIKPLRSSNKDDPEGQPPSSSSVALVSEKQDDGDDAQK 88

Query: 88  AELSAGEHGGEE-----REKQQEMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELD 147
           + L+A    GEE     RE QQEMDWK DEEFKKFMGNPSIEAAIKLEKKRADRKLKELD
Sbjct: 89  SGLAAEVELGEENDSGERENQQEMDWKLDEEFKKFMGNPSIEAAIKLEKKRADRKLKELD 148

Query: 148 REGANNPIVGLFNRIARDNLEKEKERLKKAEETFKALDLSKLRGCFGFNTFFATDVRRFG 207
           RE ++NP+VGLFNR+ RD+L +EKERL+KAEE FKALDL+KL+ CFGF+TF+ATDVRRFG
Sbjct: 149 RESSDNPVVGLFNRVVRDSLAREKERLEKAEEAFKALDLNKLKNCFGFDTFYATDVRRFG 208

Query: 208 DGGIFIGNLRRPIEEVIPQLEKKLSEAAGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQ 267
           DGGIFIGNLRRPIEEVIP+LEKKLSEAAGREVVLWFMEEK +DITKQVCMVQPKAE+DLQ
Sbjct: 209 DGGIFIGNLRRPIEEVIPKLEKKLSEAAGREVVLWFMEEKANDITKQVCMVQPKAEMDLQ 268

Query: 268 FESTKLSTPLGYFSAITLCVATFGTIALMSGFFLKPGATFDDYIANVVPLFGGFISILGV 327
           FESTKLSTP GY S+I LCVATFGTIALMSGFFLKP ATFDDY+A+VVPLF GF++ILGV
Sbjct: 269 FESTKLSTPWGYISSIVLCVATFGTIALMSGFFLKPNATFDDYLADVVPLFSGFVTILGV 328

Query: 328 SEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLT 387
           SEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAY+T
Sbjct: 329 SEIATRVTAARYGVKLSPSFLVPSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYIT 388

Query: 388 SLALAVSAFVIDGGFNGGDNAMYIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGV 447
           SL LAV+AF+ DG FNGGDNA+YIRPQFFYNNPLLSFIQFVIGPY+DDLGNVLPYAVEGV
Sbjct: 389 SLVLAVAAFIADGSFNGGDNALYIRPQFFYNNPLLSFIQFVIGPYTDDLGNVLPYAVEGV 448

Query: 448 GVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGGLS 507
           GVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQA+FGR+ A LLSF TSL+LGIGGLS
Sbjct: 449 GVPVDPLAFAGLLGMVVTSLNLLPCGRLEGGRIAQALFGRNIATLLSFGTSLLLGIGGLS 508

Query: 508 GSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSP 567
           GSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWG VL LIC LTLFPNGGGTFSS 
Sbjct: 509 GSVLCLAWGLFATFFRGGEEVPATDEITPLGDDRYAWGFVLALICFLTLFPNGGGTFSSS 568

Query: 568 FFSAPFFRGDL 573
           FFS PFFRGDL
Sbjct: 569 FFSDPFFRGDL 579

BLAST of CSPI04G00110 vs. NCBI nr
Match: gi|1009165902|ref|XP_015901296.1| (PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 888.3 bits (2294), Expect = 7.3e-255
Identity = 463/589 (78.61%), Postives = 506/589 (85.91%), Query Frame = 1

Query: 1   MAALSIASNSWFISHREKHYT---SRTMAKPFGKIPLGRRTGGYFFTISA---PITEDRL 60
           MA LSI ++S   S   K+     ++      G   +  +     FT S    PIT   L
Sbjct: 1   MATLSITTHSSVYSWSPKNMNMNMNKNSPPTLGSSFIRNQFKSKHFTFSRNSKPITRKSL 60

Query: 61  RFSARDDSESEP---SSSSIAVVSDERGGGNDNEMAELSA--------GEHGGEEREKQQ 120
           R   +   E+E    SSSS  VVS++    ND   +ELSA        GE+G +E EKQQ
Sbjct: 61  RVFIKAQQENETTSSSSSSATVVSEKPNDDNDARKSELSAEEGETGKEGEYGPDEAEKQQ 120

Query: 121 EMDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEK 180
           E+DWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELDRE + NP++GLFNRI RDNL +
Sbjct: 121 EIDWKTDEEFKKFMGNPSIEAAIKLEKKRADRKLKELDRESSGNPVLGLFNRIVRDNLTR 180

Query: 181 EKERLKKAEETFKALDLSKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEK 240
           EKERL+KAEETF+ALDL+KLR CFGF+TFFATDVRRFGDGGIFIGNLR+PIEEVIP+LEK
Sbjct: 181 EKERLEKAEETFRALDLNKLRSCFGFDTFFATDVRRFGDGGIFIGNLRKPIEEVIPKLEK 240

Query: 241 KLSEAAGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVAT 300
           KLSEAA R+VV+WFMEEKT+DITKQVCMVQPK E+DLQFESTKLSTP GY SAI LCVAT
Sbjct: 241 KLSEAAERDVVIWFMEEKTNDITKQVCMVQPKTEMDLQFESTKLSTPWGYVSAIVLCVAT 300

Query: 301 FGTIALMSGFFLKPGATFDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLV 360
           FGTIALMSGFFLKP AT+DDY+ANVVPLFGGFISILGVSEIATR+TAA+YGVKLSPSFLV
Sbjct: 301 FGTIALMSGFFLKPDATWDDYLANVVPLFGGFISILGVSEIATRLTAAQYGVKLSPSFLV 360

Query: 361 PSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAM 420
           PSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAV+AF+ DG FNGGDNA+
Sbjct: 361 PSNWTGCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVAAFIADGSFNGGDNAL 420

Query: 421 YIRPQFFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNL 480
           YIRPQFFYNNPLLSFIQFVIGPY+DDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNL
Sbjct: 421 YIRPQFFYNNPLLSFIQFVIGPYTDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNL 480

Query: 481 LPCGRLEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVP 540
           LPCGRLEGGRIAQAMFGR+TA LLSFATSL+LGIGGLSGSVLCLAWGLFATFFRGGEE+P
Sbjct: 481 LPCGRLEGGRIAQAMFGRNTATLLSFATSLLLGIGGLSGSVLCLAWGLFATFFRGGEEIP 540

Query: 541 ATDEITPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 573
           A DEITPLGD+RYAWG+VLGLIC LTLFPNGGGTFSSPFFS PFFRGD+
Sbjct: 541 AKDEITPLGDNRYAWGIVLGLICFLTLFPNGGGTFSSPFFSDPFFRGDM 589

BLAST of CSPI04G00110 vs. NCBI nr
Match: gi|645261367|ref|XP_008236259.1| (PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Prunus mume])

HSP 1 Score: 885.9 bits (2288), Expect = 3.6e-254
Identity = 456/584 (78.08%), Postives = 510/584 (87.33%), Query Frame = 1

Query: 1   MAALSIASNSWFISHREKHYTSRTMAKPFGKIPLGRRTGGYFFTISAPITEDRLR--FSA 60
           MA++S+AS     S   K   SR +   F KIP G++T  +  + S+  T  R R  FS 
Sbjct: 1   MASVSVASRYTVSSWPPKTNNSRAVVSSFTKIPFGKKTQYFSLSPSSNPTSKRRRLSFSM 60

Query: 61  RDDSESEPSSSSIA--VVSDERGGGNDNEMAELSA-----GEHGG---EEREKQQEMDWK 120
           ++D E+EPSSSS +  V++++    +D + ++L A     G+  G   EE+E+QQEMDWK
Sbjct: 61  KNDQENEPSSSSSSAVVITEKPSDDSDTQQSKLPAEEVETGKESGSESEEKERQQEMDWK 120

Query: 121 TDEEFKKFMGNPSIEAAIKLEKKRADRKLKELDREGANNPIVGLFNRIARDNLEKEKERL 180
           TDEEFKKFMGNPSIEAAIKLEKKRADRKLK+LDRE + NP+VGLFN+I RD+L +EKERL
Sbjct: 121 TDEEFKKFMGNPSIEAAIKLEKKRADRKLKDLDRESSGNPLVGLFNKILRDSLTREKERL 180

Query: 181 KKAEETFKALDLSKLRGCFGFNTFFATDVRRFGDGGIFIGNLRRPIEEVIPQLEKKLSEA 240
           +KAEE FKA+DL+KL+ CFGF++FF TDVRRFGDGGIF+GNLRRPIEEV+P+LE+KLS+A
Sbjct: 181 EKAEEAFKAIDLNKLKSCFGFDSFFPTDVRRFGDGGIFVGNLRRPIEEVMPKLEQKLSDA 240

Query: 241 AGREVVLWFMEEKTDDITKQVCMVQPKAEIDLQFESTKLSTPLGYFSAITLCVATFGTIA 300
           AGREVVLWFMEE T+DI KQVCMVQPKAEIDLQFESTKLSTPLGY SA+ LCVATFGTIA
Sbjct: 241 AGREVVLWFMEENTNDIRKQVCMVQPKAEIDLQFESTKLSTPLGYVSAVALCVATFGTIA 300

Query: 301 LMSGFFLKPGATFDDYIANVVPLFGGFISILGVSEIATRVTAARYGVKLSPSFLVPSNWT 360
           LMSGFFLKP AT+DDY+A+VVPLFGGFISILGVSEIATRVTAAR+GVKLSPSFLVPSNWT
Sbjct: 301 LMSGFFLKPDATWDDYLADVVPLFGGFISILGVSEIATRVTAARHGVKLSPSFLVPSNWT 360

Query: 361 GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLALAVSAFVIDGGFNGGDNAMYIRPQ 420
           GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSL L VSAFV DG FNGGDNA+YIRPQ
Sbjct: 361 GCLGVMNNYESLLPNKKALFDIPVARTASAYLTSLVLTVSAFVADGSFNGGDNALYIRPQ 420

Query: 421 FFYNNPLLSFIQFVIGPYSDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR 480
           FFYNNPLLSF+QFVIGPY+DDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR
Sbjct: 421 FFYNNPLLSFVQFVIGPYTDDLGNVLPYAVEGVGVPVDPLAFAGLLGMVVTSLNLLPCGR 480

Query: 481 LEGGRIAQAMFGRSTAALLSFATSLVLGIGGLSGSVLCLAWGLFATFFRGGEEVPATDEI 540
           LEGGRIAQAMFGR TA LLSFATSL+LGIGGLSGSVLCLAWGLFATFFRGGEE+PA DEI
Sbjct: 481 LEGGRIAQAMFGRGTATLLSFATSLLLGIGGLSGSVLCLAWGLFATFFRGGEEMPAKDEI 540

Query: 541 TPLGDDRYAWGVVLGLICLLTLFPNGGGTFSSPFFSAPFFRGDL 573
           TPLGDDR+AWG VLGLIC LTLFPN GGTFSS FFSAP+FRGDL
Sbjct: 541 TPLGDDRFAWGCVLGLICFLTLFPNSGGTFSSSFFSAPYFRGDL 584

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EGY3_ARATH3.0e-24080.88Probable zinc metallopeptidase EGY3, chloroplastic OS=Arabidopsis thaliana GN=EG... [more]
EGY3_ORYSJ3.7e-22273.92Probable zinc metalloprotease EGY3, chloroplastic OS=Oryza sativa subsp. japonic... [more]
EGY3_ORYSI1.1e-22173.73Probable zinc metalloprotease EGY3, chloroplastic OS=Oryza sativa subsp. indica ... [more]
EGY1_ORYSJ3.4e-2624.90Probable zinc metalloprotease EGY1, chloroplastic OS=Oryza sativa subsp. japonic... [more]
EGY1_ARATH1.3e-2525.38Probable zinc metalloprotease EGY1, chloroplastic OS=Arabidopsis thaliana GN=EGY... [more]
Match NameE-valueIdentityDescription
A0A0A0KUZ7_CUCSA0.0e+0099.65Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000600 PE=4 SV=1[more]
E5GBH3_CUCME0.0e+0097.20Sterol regulatory element-binding protein site 2 protease OS=Cucumis melo subsp.... [more]
A5ALY0_VITVI5.1e-25582.40Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_010704 PE=4 SV=1[more]
A0A061E1K2_THECC1.4e-24982.20Ethylene-dependent gravitropism-deficient and yellow-green-like 3 OS=Theobroma c... [more]
A0A059BCF2_EUCGR2.5e-24982.32Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_G01188 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G17870.11.7e-24180.88 ethylene-dependent gravitropism-deficient and yellow-green-like 3[more]
AT5G35220.17.3e-2725.38 Peptidase M50 family protein[more]
AT5G05740.11.0e-2025.40 ethylene-dependent gravitropism-deficient and yellow-green-like 2[more]
Match NameE-valueIdentityDescription
gi|449465097|ref|XP_004150265.1|0.0e+0099.65PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Cucumis sativus][more]
gi|659108849|ref|XP_008454418.1|0.0e+0097.20PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Cucumis melo][more]
gi|225429195|ref|XP_002271890.1|7.3e-25582.40PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Vitis vinifera][more]
gi|1009165902|ref|XP_015901296.1|7.3e-25578.61PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Ziziphus jujuba][more]
gi|645261367|ref|XP_008236259.1|3.6e-25478.08PREDICTED: probable zinc metallopeptidase EGY3, chloroplastic [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006457 protein folding
biological_process GO:0006508 proteolysis
biological_process GO:0009408 response to heat
biological_process GO:0009644 response to high light intensity
biological_process GO:0042542 response to hydrogen peroxide
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004177 aminopeptidase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G00110.1CSPI04G00110.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 123..143
score: -coord: 154..181
scor
NoneNo IPR availablePANTHERPTHR31412FAMILY NOT NAMEDcoord: 57..570
score:
NoneNo IPR availablePANTHERPTHR31412:SF2ZINC METALLOPEPTIDASE EGY3, CHLOROPLASTIC-RELATEDcoord: 57..570
score: