Tan0005274 (gene) Snake gourd v1

Overview
NameTan0005274
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionThylakoidal processing peptidase 1
LocationLG02: 87426625 .. 87429795 (-)
RNA-Seq ExpressionTan0005274
SyntenyTan0005274
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTATTCGTGTTACTCTCTCCTACTCTGGCTATGTCGCCCAAAACCTAGCCTCCTCCACCGGCCTTCGGGCCGGTAACTGCCGCGTCTTTCAGGAGTGTTGGGTCCGATCTTGTATTTTTGGCTCGAGCCATCATCCGGACTTCAAATCTGCTGGAAGCGCGCGCAATTATCGGTCCGATAGCCGGCGGTTCAGGCCGAGTTGCTCTGTTGAGAAGCCAGCTTCTATGTACAGCACTTTCGCCGGAGAAATGGTTGGGGAAAACCCCAAGAGTCCCATGGTTCTGGGTTTGATGTCGATGCTGAAATCGATGGCCTCTTCTTCTGGTTCTTCTTCGATCTCCACGGGGATTTTTGGGGTTTCTTCGTTTAAAGCCACTTCGATTATACCCTTTTTACAAGGATCGAAGTGGCTTCCCGGTTACGATATACGATCTCATGTAAGCGACGATGTGGATAAAGGCGGGACAGTTTGTTGTGATTATGATGAAAGTGGGAGTAATCAGTTTTATGAGAATGATTTTGAGAAGAGCAGCTGGGTTTCACGCTTGTTGAGTACCTATTCCGAGGATGCAAAGGCTCTCTTTACGGCTTTGACTGTTAGTGTGCTCTTTAAATCGTTCTTGGCGGAGCCGAAATCCATTCCTTCTTCTTCCATGTATCCTACTCTGGAAGTGGGGGATCGTGTTTTAGCGGAAAAGGTGATCCTTAGTTCATTTAGTTGAATCTCTGAAATTGAATTGTTGATATCTTCTAATATGTTGACTGTATTATTTATTAGTGCGCTGTTTTAGAATGTGTTCAACTACATGAAACCATTTAACAATGATATGATGCTTCCAAATAAATTTCGATAATTAATTTATGATCTACTATTAGTAAGGTATAGATGACATACTTCTTTAATTACTATAATTACCAATTCACCATATTCATATGCCATTCTCCTAACAATCATAAGACAAAAGAATTGTTTATTTCTTGTTACAGTTTTCAAAATGATTCATTCGGAATAGTGTTGATTGCTCTCCTGTATTAATGTCATAAAGTTGCCTTTTCTGTTCCTCCACTGATCTTTATTCCTTTTCCTCCTGATACTAGGTTTCGTACATTTTCAGGAAACCTGAAGTTTCAGATATAGTGATCTTTAAAGCTCCTAAAATCTTGCAGGTATTACTGTAATTAATGAGGTTCTTTTGAGGAAATTTTCCTGTTTAATAATGTTGGATGCTGATATTTCTTGCACCTTTTTTGCACCCCATTTAGGAAGTTGGAGTTAGTTCAAGTGAGGTGTTTATTAAGAGAGTTGTAGCTACGTCTGGGGACGTGGTTGAAGTAAGTGTTCTTCAATCTTATAGCATCATTTCTTTAATATGCAATAATTCCATGCTATGTTCATTATCCAAAAAAAAATAAAAATAAATTCTATGCTTTGTTATCTGATTCATATTGGTAAGCAAATAAGTAAAATAACTTATTAGCCTAAATTTGAAGTAACATCAGAGACTATAAATCTACCATTATGAATATAAATGAGGTTATTAAAATAGCTACCATCTGGGATAGATATTAGGTGACATGAAGATTCTGGAGCCATTAGTCACTTAGCTCAGCATCCAGCACTGTAGGTTGTTTCTTACAATTAATTTGCCTGAGAGCCTGGGTATGAATCCTTATGGCTAGATTTTCATTTGATTCAGATTATAGATTCGATCAGTCTTGTGCCATATGATAAAATATTTACTAGATAAAACATCTCCTCCTTTGGATTTTCAAAGTTTCTTCTGATTTCTTGATGGACCCATTCATATTGTCTTAGAGGTTTGTTGGCTTCATGTGTAGGAACTAAATGACATTTCAATCCTAAAAACTTCGTCTTGAGCAATCTGATGGGAATATAATTGATGATAATTGGTCAAAGGAAGTTTGGAGTCAAAGTTACAGTACTGATGAATGTTTGCAACAGTTAAAGCCTATGTAGATAAAATTCAAGAGAGAGAAAAAGCTGAAAATGAATGTTTTGTCCTTTCAGTTTCATCTCTCTAATAAGAGTGTATGGGCTAGTTTGGACACTCACTTATTTTAGGGATAACTGTACTACATTTATTTTCAATGGAGTTTGTCTCTTGTTAAAAAAAAAGAATTTATTTGCAATGGAAACTGGTAGGATATTAAAACTTAAGTAGGTGACTAGCTAGCTACTATAGTTTGAACCTATTCCCTCTATGCTCTTCACTTATTTTTATTGGCTGGGGTGACCACTTGGTCAACTCACGATATAGATAAGTTCCTCTTCCAACTTAACTTTGATGGTTATGTTCTTTTTTCTTCTAAAAGTACACCATTAAATAAGATATATTCTATGCTCTGTAACTAATTAATGGTTGTAAGACAGATATCTCCAATGCTGGAGATTAATAGGGAACCGACTATAGTTACAATTTGAGGAGTTGCTCAACACTAACCAATAACCATGGTCCATTAGTAAAAAATAGGATTCTCAAACCATTGGTAGTTGCTCAACACTAAACATTTTGATCGGTGTTCCATAGGCATTTGAGATCTATATCTTAAACTTTAGTCATCTTCTTTTGAACTTCTGACTGGAATTAGTGTGGTTCTTCTTTACGTTAATTACTTGATGCAATGTTGTATGGTAGGTTCGGAATGGGAAATTGGTGGTAAATGGTGTTGTTCAAGATGAGGACTTCGTCTTAGAGCCAATTGCTTATGAGATGGATCCAATGGTAATTGTCTTTCCTTGTTAACCCTCTCCCTCTCTTTGGAGGGGAAGATAATAAACCTAAATTCTGTTCAATGGGACAGTTTGGTTAGTCTAATTATCTTGTTTTCTTCTGCAGTTTGTGCCTGAAGGTTATGTGTATGTAATGGGAGACAACCGTAACAATAGTTGTGATTCTCATGACTGGTAATTTCACCATCTCAGTTCTCTTGTTGTTTAATTTGAAACACAAGGTCGCACATATGCACGTGTGCCATTTTTTATGATACTATGGTATATCTGATTCTCTTTCTTCTTCTCCTCCAGGGGTCCGCTCCCAATAGAAAATATTGTAGGTAGATCATTATTCAAGTATTGGCCTCCCCCCAAAGGATCCAGCATGGTAGATGAACCACATGCAAGGAAGATTAATCTGGGGATTTCCTGA

mRNA sequence

ATGGCTATTCGTGTTACTCTCTCCTACTCTGGCTATGTCGCCCAAAACCTAGCCTCCTCCACCGGCCTTCGGGCCGGTAACTGCCGCGTCTTTCAGGAGTGTTGGGTCCGATCTTGTATTTTTGGCTCGAGCCATCATCCGGACTTCAAATCTGCTGGAAGCGCGCGCAATTATCGGTCCGATAGCCGGCGGTTCAGGCCGAGTTGCTCTGTTGAGAAGCCAGCTTCTATGTACAGCACTTTCGCCGGAGAAATGGTTGGGGAAAACCCCAAGAGTCCCATGGTTCTGGGTTTGATGTCGATGCTGAAATCGATGGCCTCTTCTTCTGGTTCTTCTTCGATCTCCACGGGGATTTTTGGGGTTTCTTCGTTTAAAGCCACTTCGATTATACCCTTTTTACAAGGATCGAAGTGGCTTCCCGGTTACGATATACGATCTCATGTAAGCGACGATGTGGATAAAGGCGGGACAGTTTGTTGTGATTATGATGAAAGTGGGAGTAATCAGTTTTATGAGAATGATTTTGAGAAGAGCAGCTGGGTTTCACGCTTGTTGAGTACCTATTCCGAGGATGCAAAGGCTCTCTTTACGGCTTTGACTGTTAGTGTGCTCTTTAAATCGTTCTTGGCGGAGCCGAAATCCATTCCTTCTTCTTCCATGTATCCTACTCTGGAAGTGGGGGATCGTGTTTTAGCGGAAAAGGTTTCGTACATTTTCAGGAAACCTGAAGTTTCAGATATAGTGATCTTTAAAGCTCCTAAAATCTTGCAGGAAGTTGGAGTTAGTTCAAGTGAGGTGTTTATTAAGAGAGTTGTAGCTACGTCTGGGGACGTGGTTGAAGTTCGGAATGGGAAATTGGTGGTAAATGGTGTTGTTCAAGATGAGGACTTCGTCTTAGAGCCAATTGCTTATGAGATGGATCCAATGTTTGTGCCTGAAGGTTATGTGTATGTAATGGGAGACAACCGTAACAATAGTTGTGATTCTCATGACTGGGGTCCGCTCCCAATAGAAAATATTGTAGGTAGATCATTATTCAAGTATTGGCCTCCCCCCAAAGGATCCAGCATGGTAGATGAACCACATGCAAGGAAGATTAATCTGGGGATTTCCTGA

Coding sequence (CDS)

ATGGCTATTCGTGTTACTCTCTCCTACTCTGGCTATGTCGCCCAAAACCTAGCCTCCTCCACCGGCCTTCGGGCCGGTAACTGCCGCGTCTTTCAGGAGTGTTGGGTCCGATCTTGTATTTTTGGCTCGAGCCATCATCCGGACTTCAAATCTGCTGGAAGCGCGCGCAATTATCGGTCCGATAGCCGGCGGTTCAGGCCGAGTTGCTCTGTTGAGAAGCCAGCTTCTATGTACAGCACTTTCGCCGGAGAAATGGTTGGGGAAAACCCCAAGAGTCCCATGGTTCTGGGTTTGATGTCGATGCTGAAATCGATGGCCTCTTCTTCTGGTTCTTCTTCGATCTCCACGGGGATTTTTGGGGTTTCTTCGTTTAAAGCCACTTCGATTATACCCTTTTTACAAGGATCGAAGTGGCTTCCCGGTTACGATATACGATCTCATGTAAGCGACGATGTGGATAAAGGCGGGACAGTTTGTTGTGATTATGATGAAAGTGGGAGTAATCAGTTTTATGAGAATGATTTTGAGAAGAGCAGCTGGGTTTCACGCTTGTTGAGTACCTATTCCGAGGATGCAAAGGCTCTCTTTACGGCTTTGACTGTTAGTGTGCTCTTTAAATCGTTCTTGGCGGAGCCGAAATCCATTCCTTCTTCTTCCATGTATCCTACTCTGGAAGTGGGGGATCGTGTTTTAGCGGAAAAGGTTTCGTACATTTTCAGGAAACCTGAAGTTTCAGATATAGTGATCTTTAAAGCTCCTAAAATCTTGCAGGAAGTTGGAGTTAGTTCAAGTGAGGTGTTTATTAAGAGAGTTGTAGCTACGTCTGGGGACGTGGTTGAAGTTCGGAATGGGAAATTGGTGGTAAATGGTGTTGTTCAAGATGAGGACTTCGTCTTAGAGCCAATTGCTTATGAGATGGATCCAATGTTTGTGCCTGAAGGTTATGTGTATGTAATGGGAGACAACCGTAACAATAGTTGTGATTCTCATGACTGGGGTCCGCTCCCAATAGAAAATATTGTAGGTAGATCATTATTCAAGTATTGGCCTCCCCCCAAAGGATCCAGCATGGTAGATGAACCACATGCAAGGAAGATTAATCTGGGGATTTCCTGA

Protein sequence

MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRSDSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFGVSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSWVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFRKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLEPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVDEPHARKINLGIS
Homology
BLAST of Tan0005274 vs. ExPASy Swiss-Prot
Match: O04348 (Thylakoidal processing peptidase 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TPP1 PE=2 SV=2)

HSP 1 Score: 370.2 bits (949), Expect = 2.8e-101
Identity = 212/361 (58.73%), Postives = 257/361 (71.19%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIR+T +YS +VA+NL    G R G      E  VR   F  SH  DF    S RN   
Sbjct: 1   MAIRITFTYSTHVARNL---VGTRVGPGGYCFESLVRPRFF--SHKRDFDR--SPRN--- 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
                       +PASMY + A E++GE  +SP+V+GL+S+LK   S++G  S +  + G
Sbjct: 61  ------------RPASMYGSIARELIGEGSQSPLVMGLISILK---STTGHESSTMNVLG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSW 180
           VSSFKA+SIIPFLQGSKW+        V DDVDKGGTVC D D+  S          S W
Sbjct: 121 VSSFKASSIIPFLQGSKWIK----NPPVIDDVDKGGTVCDDDDDKESRN------GGSGW 180

Query: 181 VSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFR 240
           V++LLS  SEDAKA FTA+TVS+LF+S LAEPKSIPS+SMYPTL+ GDRV+AEKVSY FR
Sbjct: 181 VNKLLSVCSEDAKAAFTAVTVSILFRSALAEPKSIPSTSMYPTLDKGDRVMAEKVSYFFR 240

Query: 241 KPEVSDIVIFKAPKIL---QEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDF 300
           KPEVSDIVIFKAP IL    E G SS++VFIKR+VA+ GD VEVR+GKL VN +VQ+EDF
Sbjct: 241 KPEVSDIVIFKAPPILLEYPEYGYSSNDVFIKRIVASEGDWVEVRDGKLFVNDIVQEEDF 300

Query: 301 VLEPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSM 359
           VLEP++YEM+PMFVP+GYV+V+GDNRN S DSH+WGPLPIENIVGRS+F+YWPP K S  
Sbjct: 301 VLEPMSYEMEPMFVPKGYVFVLGDNRNKSFDSHNWGPLPIENIVGRSVFRYWPPSKVSDT 326

BLAST of Tan0005274 vs. ExPASy Swiss-Prot
Match: Q9M9Z2 (Probable thylakoidal processing peptidase 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TPP2 PE=2 SV=1)

HSP 1 Score: 353.6 bits (906), Expect = 2.7e-96
Identity = 206/365 (56.44%), Postives = 259/365 (70.96%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLR--AGNCRVFQECWVRSCIFGSSHHPDF--KSAGSAR 60
           MAIRVT +YS YVA+++ASS G R   G+ R   E WVR    G +  PD   KS GS  
Sbjct: 1   MAIRVTFTYSSYVARSIASSAGTRVGTGDVRSCFETWVRPRFCGHNQIPDIVDKSPGSNT 60

Query: 61  NYRSDSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSIST 120
              S   R RP+      +SMYST A E++ E  KSP+VLG++S++    +   S    T
Sbjct: 61  WGPSSGPRARPA------SSMYSTIAREILEEGCKSPLVLGMISLMNLTGAPQFSG--MT 120

Query: 121 GIFGVSSFKATSIIPFLQGSKWLPGYDIRSHVSDD---VDKGGTVCCDYDESGSNQFYEN 180
           G+ G+S FK +S+IPFL+GSKW+P   I + +S D   VD+GG VC    +   +    N
Sbjct: 121 GL-GISPFKTSSVIPFLRGSKWMP-CSIPATLSTDIAEVDRGGKVCDPKVKLELSDKVSN 180

Query: 181 DFEKSSWVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAE 240
               + WV++LL+  SEDAKA FTA+TVS+LF+S LAEPKSIPS+SM PTL+VGDRV+AE
Sbjct: 181 G--GNGWVNKLLNICSEDAKAAFTAVTVSLLFRSALAEPKSIPSTSMLPTLDVGDRVIAE 240

Query: 241 KVSYIFRKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQ 300
           KVSY FRKPEVSDIVIFKAP IL E G S ++VFIKR+VA+ GD VEV +GKL+VN  VQ
Sbjct: 241 KVSYFFRKPEVSDIVIFKAPPILVEHGYSCADVFIKRIVASEGDWVEVCDGKLLVNDTVQ 300

Query: 301 DEDFVLEPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPK 359
            EDFVLEPI YEM+PMFVPEGYV+V+GDNRN S DSH+WGPLPI+NI+GRS+F+YWPP K
Sbjct: 301 AEDFVLEPIDYEMEPMFVPEGYVFVLGDNRNKSFDSHNWGPLPIKNIIGRSVFRYWPPSK 353

BLAST of Tan0005274 vs. ExPASy Swiss-Prot
Match: Q8H0W1 (Chloroplast processing peptidase OS=Arabidopsis thaliana OX=3702 GN=PLSP1 PE=2 SV=2)

HSP 1 Score: 231.9 bits (590), Expect = 1.2e-59
Identity = 112/185 (60.54%), Postives = 144/185 (77.84%), Query Frame = 0

Query: 176 EKSSWVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKV 235
           EK+      L   S+DA+ +F A+ VS+ F+ F+AEP+ IPS SMYPT +VGDR++AEKV
Sbjct: 99  EKNRLFPEWLDFTSDDAQTVFVAIAVSLAFRYFIAEPRYIPSLSMYPTFDVGDRLVAEKV 158

Query: 236 SYIFRKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDE 295
           SY FRKP  +DIVIFK+P +LQEVG + ++VFIKR+VA  GD+VEV NGKL+VNGV ++E
Sbjct: 159 SYYFRKPCANDIVIFKSPPVLQEVGYTDADVFIKRIVAKEGDLVEVHNGKLMVNGVARNE 218

Query: 296 DFVLEPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGS 355
            F+LEP  YEM P+ VPE  V+VMGDNRNNS DSH WGPLP++NI+GRS+F+YWPP + S
Sbjct: 219 KFILEPPGYEMTPIRVPENSVFVMGDNRNNSYDSHVWGPLPLKNIIGRSVFRYWPPNRVS 278

Query: 356 SMVDE 361
             V E
Sbjct: 279 GTVLE 283

BLAST of Tan0005274 vs. ExPASy Swiss-Prot
Match: P72660 (Probable signal peptidase I-1 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=lepB1 PE=3 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 1.8e-39
Identity = 80/161 (49.69%), Postives = 111/161 (68.94%), Query Frame = 0

Query: 190 EDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFRKPEVSDIVI 249
           E+   L  AL +++L + F+AEP+ IPS SM PTLE GDR++ EKVSY F  P+V DI++
Sbjct: 15  ENIPLLMVALVLALLLRFFVAEPRYIPSDSMLPTLEQGDRLVVEKVSYHFHPPQVGDIIV 74

Query: 250 FKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLEPIAYEMDPM 309
           F  P++LQ  G    + FIKRV+A  G  VEV NG +  +G    E+++LEP  Y +  +
Sbjct: 75  FHPPELLQVQGYDLGQAFIKRVIALPGQTVEVNNGIVYRDGQPLQEEYILEPPQYNLPAV 134

Query: 310 FVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWP 351
            VP+G V+VMGDNRNNS DSH WG LP +NI+G +LF+++P
Sbjct: 135 RVPDGQVFVMGDNRNNSNDSHVWGFLPQQNIIGHALFRFFP 175

BLAST of Tan0005274 vs. ExPASy Swiss-Prot
Match: P73157 (Probable signal peptidase I-2 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=lepB2 PE=3 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 6.0e-35
Identity = 72/183 (39.34%), Postives = 114/183 (62.30%), Query Frame = 0

Query: 186 STYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFRKPEVS 245
           +T+ E  K + TA+ +++  ++F+AE + IPSSSM PTL++ DR++ EK+SY  R PE  
Sbjct: 19  NTWLELGKTMVTAVILAIGIRTFVAEARYIPSSSMEPTLQINDRLIIEKISYRLRDPERG 78

Query: 246 DIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLEPIAYE 305
           +IV+F     L+    +  + FIKR++   GD V V  G + VNG + DE+++  P AYE
Sbjct: 79  EIVVFNPTDALK--AKNFHDAFIKRIIGLPGDEVRVSQGNVYVNGKMLDENYIAAPPAYE 138

Query: 306 MDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVDEPHARK 365
             P+ VP+    V+GDNRNNS DSH WG +P E ++GR+  ++WP P+   + D+     
Sbjct: 139 YGPVKVPDDQYLVLGDNRNNSYDSHYWGFVPREKLLGRAFVRFWPVPRVGLLTDDAEREA 198

Query: 366 INL 369
           + +
Sbjct: 199 VEI 199

BLAST of Tan0005274 vs. NCBI nr
Match: XP_022951822.1 (thylakoidal processing peptidase 1, chloroplastic-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 667.9 bits (1722), Expect = 5.0e-188
Identity = 334/371 (90.03%), Postives = 351/371 (94.61%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSH+PD KSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHNPDLKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+PSCS     SMYS   GEMVGENPKSPM+LGL+SMLKSMAS+S SS I TGI G
Sbjct: 61  DSRRFKPSCS-----SMYSALPGEMVGENPKSPMILGLISMLKSMASASESSPIPTGILG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSW 180
           VSSFKATSIIPFL+GS WLPGYDIRSHVSDDVDKGGTVCCDYDESGS++FYE+DFEKSSW
Sbjct: 121 VSSFKATSIIPFLRGSNWLPGYDIRSHVSDDVDKGGTVCCDYDESGSDEFYESDFEKSSW 180

Query: 181 VSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFR 240
           VSRLL+TYS+DAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLE GDRVLAEKVSY FR
Sbjct: 181 VSRLLNTYSDDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEAGDRVLAEKVSYFFR 240

Query: 241 KPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLE 300
           KPEVSDIVIFK PKILQE GVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVV+DEDF+LE
Sbjct: 241 KPEVSDIVIFKPPKILQEFGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVEDEDFILE 300

Query: 301 PIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVDE 360
           PIAYEMDPM VPEGYVYVMGDNRNNSCDSH+WGPLP+ENIVGRSLFKYWPPPKGSSMVDE
Sbjct: 301 PIAYEMDPMLVPEGYVYVMGDNRNNSCDSHNWGPLPVENIVGRSLFKYWPPPKGSSMVDE 360

Query: 361 PHARKINLGIS 372
           PHA+ INLG+S
Sbjct: 361 PHAKNINLGMS 366

BLAST of Tan0005274 vs. NCBI nr
Match: XP_023537459.1 (thylakoidal processing peptidase 1, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 667.2 bits (1720), Expect = 8.5e-188
Identity = 334/371 (90.03%), Postives = 350/371 (94.34%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSH PD KSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHKPDLKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+PSCS     SMYS   GEMVGENPKSPM+LGL+SMLKSMAS+S SS ISTGI G
Sbjct: 61  DSRRFKPSCS-----SMYSALPGEMVGENPKSPMILGLISMLKSMASASESSPISTGILG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSW 180
           VSSF ATSIIPFL+GS WLPGYDIRSHVSDDVDKGGTVCCDYDESGS++FYE+DFEKSSW
Sbjct: 121 VSSFNATSIIPFLRGSNWLPGYDIRSHVSDDVDKGGTVCCDYDESGSDEFYESDFEKSSW 180

Query: 181 VSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFR 240
           VSRLL+TYS+DAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLE GDRVLAEKVSY FR
Sbjct: 181 VSRLLNTYSDDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEAGDRVLAEKVSYFFR 240

Query: 241 KPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLE 300
           KPEVSDIVIFK PKILQE GVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVV+DEDF+LE
Sbjct: 241 KPEVSDIVIFKPPKILQEFGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVEDEDFILE 300

Query: 301 PIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVDE 360
           PIAYEMDPM VPEGYVYVMGDNRNNSCDSH+WGPLP+ENIVGRSLFKYWPPPKGSSMVDE
Sbjct: 301 PIAYEMDPMLVPEGYVYVMGDNRNNSCDSHNWGPLPVENIVGRSLFKYWPPPKGSSMVDE 360

Query: 361 PHARKINLGIS 372
           PHA+ INLG+S
Sbjct: 361 PHAKNINLGMS 366

BLAST of Tan0005274 vs. NCBI nr
Match: KAG7020547.1 (Thylakoidal processing peptidase 1, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 666.8 bits (1719), Expect = 1.1e-187
Identity = 333/371 (89.76%), Postives = 351/371 (94.61%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSH+PD KSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHNPDLKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+PSCS     SMYS   GEMVGENPKSPM+LGL+SMLKSMAS+S SS I TGI G
Sbjct: 61  DSRRFKPSCS-----SMYSALPGEMVGENPKSPMILGLISMLKSMASASESSPIPTGILG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSW 180
           VSSFKATSIIPFL+GS WLPGYDIRSHVSDDVDKGGTVCCDYDESGS++FYE+DFEKSSW
Sbjct: 121 VSSFKATSIIPFLRGSNWLPGYDIRSHVSDDVDKGGTVCCDYDESGSDEFYESDFEKSSW 180

Query: 181 VSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFR 240
           VSRLL+TY++DAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLE GDRVLAEKVSY FR
Sbjct: 181 VSRLLNTYADDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEAGDRVLAEKVSYFFR 240

Query: 241 KPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLE 300
           KPEVSDIVIFK PKILQE GVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVV+DEDF+LE
Sbjct: 241 KPEVSDIVIFKPPKILQEFGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVEDEDFILE 300

Query: 301 PIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVDE 360
           PIAYEMDPM VPEGYVYVMGDNRNNSCDSH+WGPLP+ENIVGRSLFKYWPPPKGSSMVDE
Sbjct: 301 PIAYEMDPMLVPEGYVYVMGDNRNNSCDSHNWGPLPVENIVGRSLFKYWPPPKGSSMVDE 360

Query: 361 PHARKINLGIS 372
           PHA+ INLG+S
Sbjct: 361 PHAKNINLGMS 366

BLAST of Tan0005274 vs. NCBI nr
Match: XP_022144217.1 (thylakoidal processing peptidase 1, chloroplastic-like [Momordica charantia])

HSP 1 Score: 656.0 bits (1691), Expect = 2.0e-184
Identity = 335/373 (89.81%), Postives = 352/373 (94.37%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQE WVRSCIFGS+H+P+ KSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQEFWVRSCIFGSTHNPELKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+PS S++KPASMYST AGEMVGENPKSP+VLGLMSMLKS+AS+SGSSSI+TGIFG
Sbjct: 61  DSRRFKPSGSMKKPASMYSTLAGEMVGENPKSPVVLGLMSMLKSVASASGSSSITTGIFG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDK-GGTVCCDYDESGSNQFYENDFEKSS 180
            SSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDK GGT+  DYD  GSNQ YENDFEKSS
Sbjct: 121 ASSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGGTIVSDYDGIGSNQIYENDFEKSS 180

Query: 181 WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIF 240
           WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSY F
Sbjct: 181 WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYFF 240

Query: 241 RKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVL 300
           RKPEVSDIV+FKAP+ILQE+GVSSSEVFIKRVVATSGDVVEV  GKLVVNGVVQDEDF+L
Sbjct: 241 RKPEVSDIVVFKAPQILQEIGVSSSEVFIKRVVATSGDVVEVLKGKLVVNGVVQDEDFIL 300

Query: 301 EPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYW-PPPKGSSMV 360
           EPIAYEMDP+ VPEGYVYVMGDNRNNSCDSH+WGPL IENIVGRSLFKYW PPPKGSSM 
Sbjct: 301 EPIAYEMDPLVVPEGYVYVMGDNRNNSCDSHNWGPLAIENIVGRSLFKYWPPPPKGSSMA 360

Query: 361 DEPHARKINLGIS 372
           D PHA KI LGIS
Sbjct: 361 DSPHASKIKLGIS 373

BLAST of Tan0005274 vs. NCBI nr
Match: XP_038884798.1 (thylakoidal processing peptidase 1, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 637.9 bits (1644), Expect = 5.5e-179
Identity = 327/372 (87.90%), Postives = 346/372 (93.01%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYV QNLASSTGLRAGNCRVFQE WVRSC+FGS+H+P+FKSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVVQNLASSTGLRAGNCRVFQEFWVRSCVFGSTHNPEFKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+P  SVEKP SMYST AGE VGE+PK+PMVLGLMSMLKSMA    SS+I+TG FG
Sbjct: 61  DSRRFKPGGSVEKPTSMYSTLAGERVGESPKNPMVLGLMSMLKSMAD---SSAINTGTFG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGG-TVCCDYDESGSNQFYENDFEKSS 180
           VSSFKATSII FLQGSKWLPGYDIRS+VS++VDKGG TVC DYDESGSN+FYENDFEKSS
Sbjct: 121 VSSFKATSIISFLQGSKWLPGYDIRSNVSNNVDKGGTTVCYDYDESGSNRFYENDFEKSS 180

Query: 181 WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIF 240
           WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSM PTLEVGDR+LAEKVSY F
Sbjct: 181 WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMCPTLEVGDRILAEKVSYFF 240

Query: 241 RKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVL 300
           RKPEVSDIVIFKAP+ILQE GVSS+E+FIKRVVATSGDVV V  GKLVVNGVVQDEDFVL
Sbjct: 241 RKPEVSDIVIFKAPQILQEFGVSSNEMFIKRVVATSGDVVAVSKGKLVVNGVVQDEDFVL 300

Query: 301 EPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVD 360
           EPIAYEMDP+ VPEGYVYVMGDNRNNSCDSH+WGPL IENIVGRSLFKYWPP KGSSMVD
Sbjct: 301 EPIAYEMDPLLVPEGYVYVMGDNRNNSCDSHNWGPLAIENIVGRSLFKYWPPSKGSSMVD 360

Query: 361 EPHARKINLGIS 372
           EP AR INLGIS
Sbjct: 361 EPRARNINLGIS 369

BLAST of Tan0005274 vs. ExPASy TrEMBL
Match: A0A6J1GIQ5 (thylakoidal processing peptidase 1, chloroplastic-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454555 PE=4 SV=1)

HSP 1 Score: 667.9 bits (1722), Expect = 2.4e-188
Identity = 334/371 (90.03%), Postives = 351/371 (94.61%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSH+PD KSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHNPDLKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+PSCS     SMYS   GEMVGENPKSPM+LGL+SMLKSMAS+S SS I TGI G
Sbjct: 61  DSRRFKPSCS-----SMYSALPGEMVGENPKSPMILGLISMLKSMASASESSPIPTGILG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSW 180
           VSSFKATSIIPFL+GS WLPGYDIRSHVSDDVDKGGTVCCDYDESGS++FYE+DFEKSSW
Sbjct: 121 VSSFKATSIIPFLRGSNWLPGYDIRSHVSDDVDKGGTVCCDYDESGSDEFYESDFEKSSW 180

Query: 181 VSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFR 240
           VSRLL+TYS+DAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLE GDRVLAEKVSY FR
Sbjct: 181 VSRLLNTYSDDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEAGDRVLAEKVSYFFR 240

Query: 241 KPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLE 300
           KPEVSDIVIFK PKILQE GVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVV+DEDF+LE
Sbjct: 241 KPEVSDIVIFKPPKILQEFGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVEDEDFILE 300

Query: 301 PIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVDE 360
           PIAYEMDPM VPEGYVYVMGDNRNNSCDSH+WGPLP+ENIVGRSLFKYWPPPKGSSMVDE
Sbjct: 301 PIAYEMDPMLVPEGYVYVMGDNRNNSCDSHNWGPLPVENIVGRSLFKYWPPPKGSSMVDE 360

Query: 361 PHARKINLGIS 372
           PHA+ INLG+S
Sbjct: 361 PHAKNINLGMS 366

BLAST of Tan0005274 vs. ExPASy TrEMBL
Match: A0A6J1CT20 (thylakoidal processing peptidase 1, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111013959 PE=4 SV=1)

HSP 1 Score: 656.0 bits (1691), Expect = 9.5e-185
Identity = 335/373 (89.81%), Postives = 352/373 (94.37%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQE WVRSCIFGS+H+P+ KSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQEFWVRSCIFGSTHNPELKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+PS S++KPASMYST AGEMVGENPKSP+VLGLMSMLKS+AS+SGSSSI+TGIFG
Sbjct: 61  DSRRFKPSGSMKKPASMYSTLAGEMVGENPKSPVVLGLMSMLKSVASASGSSSITTGIFG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDK-GGTVCCDYDESGSNQFYENDFEKSS 180
            SSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDK GGT+  DYD  GSNQ YENDFEKSS
Sbjct: 121 ASSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGGTIVSDYDGIGSNQIYENDFEKSS 180

Query: 181 WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIF 240
           WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSY F
Sbjct: 181 WVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYFF 240

Query: 241 RKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVL 300
           RKPEVSDIV+FKAP+ILQE+GVSSSEVFIKRVVATSGDVVEV  GKLVVNGVVQDEDF+L
Sbjct: 241 RKPEVSDIVVFKAPQILQEIGVSSSEVFIKRVVATSGDVVEVLKGKLVVNGVVQDEDFIL 300

Query: 301 EPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYW-PPPKGSSMV 360
           EPIAYEMDP+ VPEGYVYVMGDNRNNSCDSH+WGPL IENIVGRSLFKYW PPPKGSSM 
Sbjct: 301 EPIAYEMDPLVVPEGYVYVMGDNRNNSCDSHNWGPLAIENIVGRSLFKYWPPPPKGSSMA 360

Query: 361 DEPHARKINLGIS 372
           D PHA KI LGIS
Sbjct: 361 DSPHASKIKLGIS 373

BLAST of Tan0005274 vs. ExPASy TrEMBL
Match: A0A6J1GIT4 (thylakoidal processing peptidase 1, chloroplastic-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111454555 PE=4 SV=1)

HSP 1 Score: 622.1 bits (1603), Expect = 1.5e-174
Identity = 318/371 (85.71%), Postives = 335/371 (90.30%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSH+PD KSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHNPDLKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+PSCS     SMYS   GEMVGENPKSPM+LGL+SMLKSMAS+S SS I TGI G
Sbjct: 61  DSRRFKPSCS-----SMYSALPGEMVGENPKSPMILGLISMLKSMASASESSPIPTGILG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSW 180
           VSSFKATSIIPFL+GS WLPGYDIRSHV                SGS++FYE+DFEKSSW
Sbjct: 121 VSSFKATSIIPFLRGSNWLPGYDIRSHV----------------SGSDEFYESDFEKSSW 180

Query: 181 VSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFR 240
           VSRLL+TYS+DAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLE GDRVLAEKVSY FR
Sbjct: 181 VSRLLNTYSDDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEAGDRVLAEKVSYFFR 240

Query: 241 KPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLE 300
           KPEVSDIVIFK PKILQE GVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVV+DEDF+LE
Sbjct: 241 KPEVSDIVIFKPPKILQEFGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVEDEDFILE 300

Query: 301 PIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVDE 360
           PIAYEMDPM VPEGYVYVMGDNRNNSCDSH+WGPLP+ENIVGRSLFKYWPPPKGSSMVDE
Sbjct: 301 PIAYEMDPMLVPEGYVYVMGDNRNNSCDSHNWGPLPVENIVGRSLFKYWPPPKGSSMVDE 350

Query: 361 PHARKINLGIS 372
           PHA+ INLG+S
Sbjct: 361 PHAKNINLGMS 350

BLAST of Tan0005274 vs. ExPASy TrEMBL
Match: A0A6J1KS79 (thylakoidal processing peptidase 1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111496017 PE=4 SV=1)

HSP 1 Score: 621.3 bits (1601), Expect = 2.6e-174
Identity = 317/371 (85.44%), Postives = 335/371 (90.30%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSH+PD KSAGSARNYRS
Sbjct: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHNPDLKSAGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+PSCS     SMYS   GEMVGENPKSPM+LGL+SMLKSMAS+SGSS  STGI G
Sbjct: 61  DSRRFKPSCS-----SMYSALPGEMVGENPKSPMILGLISMLKSMASASGSSPTSTGILG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSW 180
           VSSFKATSIIPFL+GS WLPGYDIRSHV                SGS++FYE+DFEKSSW
Sbjct: 121 VSSFKATSIIPFLRGSNWLPGYDIRSHV----------------SGSDEFYESDFEKSSW 180

Query: 181 VSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFR 240
           VSRLL+TYS+DAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLE GDRVLAEKVSY FR
Sbjct: 181 VSRLLNTYSDDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEAGDRVLAEKVSYFFR 240

Query: 241 KPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLE 300
           KPEVSDIVIFK PKILQE GVSSSEVFIKRVVATSGDVVEVRNGKLVVN VV+DEDF+LE
Sbjct: 241 KPEVSDIVIFKPPKILQEFGVSSSEVFIKRVVATSGDVVEVRNGKLVVNSVVEDEDFILE 300

Query: 301 PIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMVDE 360
           PIAYEMDPM VPEGYVYVMGDNRNNSCDSH+WGPLP+ENIVGRSLFKYWPPPKGSS+VDE
Sbjct: 301 PIAYEMDPMLVPEGYVYVMGDNRNNSCDSHNWGPLPVENIVGRSLFKYWPPPKGSSVVDE 350

Query: 361 PHARKINLGIS 372
           PHA+ INLG+S
Sbjct: 361 PHAKNINLGMS 350

BLAST of Tan0005274 vs. ExPASy TrEMBL
Match: A0A0A0LNI7 (Peptidase_S26 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G360710 PE=4 SV=1)

HSP 1 Score: 607.4 bits (1565), Expect = 3.9e-170
Identity = 317/373 (84.99%), Postives = 340/373 (91.15%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIRVTLSYSG+V QNLASSTGLRAGNCRVFQE WVRSCIFGS+H+P+ KS+GSARNYRS
Sbjct: 1   MAIRVTLSYSGHVVQNLASSTGLRAGNCRVFQEFWVRSCIFGSTHNPELKSSGSARNYRS 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
           DSRRF+P  SVEK  +MYST  GE VGE+PK+PM+LGLMSMLKSM     SS ISTGI G
Sbjct: 61  DSRRFKPGGSVEKATAMYSTLTGERVGESPKNPMILGLMSMLKSMGD---SSVISTGISG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGG-TVCCD-YDESGSNQFYENDFEKS 180
           VSSFKATSIIPFLQGSKWLPGYD+RS VSDDVDKGG TVC D YD+SG++QFYENDFEK 
Sbjct: 121 VSSFKATSIIPFLQGSKWLPGYDVRS-VSDDVDKGGTTVCYDYYDKSGNDQFYENDFEK- 180

Query: 181 SWVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYI 240
           SWVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSM PTLEVGDR+LAEKVSYI
Sbjct: 181 SWVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMCPTLEVGDRILAEKVSYI 240

Query: 241 FRKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFV 300
           FRKPEVSDIVIFKAP+ILQ+ GVSS EVFIKRVVATSGDVVEV+ GKLVVNGV QDEDFV
Sbjct: 241 FRKPEVSDIVIFKAPQILQDFGVSSDEVFIKRVVATSGDVVEVQKGKLVVNGVAQDEDFV 300

Query: 301 LEPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSMV 360
           LEPIAY+M+P+ VPEGYVYVMGDNRNNSCDSH+WGPLPIENIVGRSLFKYWPP KGS+MV
Sbjct: 301 LEPIAYDMEPLLVPEGYVYVMGDNRNNSCDSHNWGPLPIENIVGRSLFKYWPPSKGSAMV 360

Query: 361 DEPHARKINLGIS 372
           DE    KINLGIS
Sbjct: 361 DELRVGKINLGIS 368

BLAST of Tan0005274 vs. TAIR 10
Match: AT2G30440.1 (thylakoid processing peptide )

HSP 1 Score: 370.2 bits (949), Expect = 2.0e-102
Identity = 212/361 (58.73%), Postives = 257/361 (71.19%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLRAGNCRVFQECWVRSCIFGSSHHPDFKSAGSARNYRS 60
           MAIR+T +YS +VA+NL    G R G      E  VR   F  SH  DF    S RN   
Sbjct: 1   MAIRITFTYSTHVARNL---VGTRVGPGGYCFESLVRPRFF--SHKRDFDR--SPRN--- 60

Query: 61  DSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSISTGIFG 120
                       +PASMY + A E++GE  +SP+V+GL+S+LK   S++G  S +  + G
Sbjct: 61  ------------RPASMYGSIARELIGEGSQSPLVMGLISILK---STTGHESSTMNVLG 120

Query: 121 VSSFKATSIIPFLQGSKWLPGYDIRSHVSDDVDKGGTVCCDYDESGSNQFYENDFEKSSW 180
           VSSFKA+SIIPFLQGSKW+        V DDVDKGGTVC D D+  S          S W
Sbjct: 121 VSSFKASSIIPFLQGSKWIK----NPPVIDDVDKGGTVCDDDDDKESRN------GGSGW 180

Query: 181 VSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKVSYIFR 240
           V++LLS  SEDAKA FTA+TVS+LF+S LAEPKSIPS+SMYPTL+ GDRV+AEKVSY FR
Sbjct: 181 VNKLLSVCSEDAKAAFTAVTVSILFRSALAEPKSIPSTSMYPTLDKGDRVMAEKVSYFFR 240

Query: 241 KPEVSDIVIFKAPKIL---QEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDF 300
           KPEVSDIVIFKAP IL    E G SS++VFIKR+VA+ GD VEVR+GKL VN +VQ+EDF
Sbjct: 241 KPEVSDIVIFKAPPILLEYPEYGYSSNDVFIKRIVASEGDWVEVRDGKLFVNDIVQEEDF 300

Query: 301 VLEPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGSSM 359
           VLEP++YEM+PMFVP+GYV+V+GDNRN S DSH+WGPLPIENIVGRS+F+YWPP K S  
Sbjct: 301 VLEPMSYEMEPMFVPKGYVFVLGDNRNKSFDSHNWGPLPIENIVGRSVFRYWPPSKVSDT 326

BLAST of Tan0005274 vs. TAIR 10
Match: AT1G06870.1 (Peptidase S24/S26A/S26B/S26C family protein )

HSP 1 Score: 353.6 bits (906), Expect = 1.9e-97
Identity = 206/365 (56.44%), Postives = 259/365 (70.96%), Query Frame = 0

Query: 1   MAIRVTLSYSGYVAQNLASSTGLR--AGNCRVFQECWVRSCIFGSSHHPDF--KSAGSAR 60
           MAIRVT +YS YVA+++ASS G R   G+ R   E WVR    G +  PD   KS GS  
Sbjct: 1   MAIRVTFTYSSYVARSIASSAGTRVGTGDVRSCFETWVRPRFCGHNQIPDIVDKSPGSNT 60

Query: 61  NYRSDSRRFRPSCSVEKPASMYSTFAGEMVGENPKSPMVLGLMSMLKSMASSSGSSSIST 120
              S   R RP+      +SMYST A E++ E  KSP+VLG++S++    +   S    T
Sbjct: 61  WGPSSGPRARPA------SSMYSTIAREILEEGCKSPLVLGMISLMNLTGAPQFSG--MT 120

Query: 121 GIFGVSSFKATSIIPFLQGSKWLPGYDIRSHVSDD---VDKGGTVCCDYDESGSNQFYEN 180
           G+ G+S FK +S+IPFL+GSKW+P   I + +S D   VD+GG VC    +   +    N
Sbjct: 121 GL-GISPFKTSSVIPFLRGSKWMP-CSIPATLSTDIAEVDRGGKVCDPKVKLELSDKVSN 180

Query: 181 DFEKSSWVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAE 240
               + WV++LL+  SEDAKA FTA+TVS+LF+S LAEPKSIPS+SM PTL+VGDRV+AE
Sbjct: 181 G--GNGWVNKLLNICSEDAKAAFTAVTVSLLFRSALAEPKSIPSTSMLPTLDVGDRVIAE 240

Query: 241 KVSYIFRKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQ 300
           KVSY FRKPEVSDIVIFKAP IL E G S ++VFIKR+VA+ GD VEV +GKL+VN  VQ
Sbjct: 241 KVSYFFRKPEVSDIVIFKAPPILVEHGYSCADVFIKRIVASEGDWVEVCDGKLLVNDTVQ 300

Query: 301 DEDFVLEPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPK 359
            EDFVLEPI YEM+PMFVPEGYV+V+GDNRN S DSH+WGPLPI+NI+GRS+F+YWPP K
Sbjct: 301 AEDFVLEPIDYEMEPMFVPEGYVFVLGDNRNKSFDSHNWGPLPIKNIIGRSVFRYWPPSK 353

BLAST of Tan0005274 vs. TAIR 10
Match: AT3G24590.1 (plastidic type i signal peptidase 1 )

HSP 1 Score: 231.9 bits (590), Expect = 8.5e-61
Identity = 112/185 (60.54%), Postives = 144/185 (77.84%), Query Frame = 0

Query: 176 EKSSWVSRLLSTYSEDAKALFTALTVSVLFKSFLAEPKSIPSSSMYPTLEVGDRVLAEKV 235
           EK+      L   S+DA+ +F A+ VS+ F+ F+AEP+ IPS SMYPT +VGDR++AEKV
Sbjct: 99  EKNRLFPEWLDFTSDDAQTVFVAIAVSLAFRYFIAEPRYIPSLSMYPTFDVGDRLVAEKV 158

Query: 236 SYIFRKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDE 295
           SY FRKP  +DIVIFK+P +LQEVG + ++VFIKR+VA  GD+VEV NGKL+VNGV ++E
Sbjct: 159 SYYFRKPCANDIVIFKSPPVLQEVGYTDADVFIKRIVAKEGDLVEVHNGKLMVNGVARNE 218

Query: 296 DFVLEPIAYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWPPPKGS 355
            F+LEP  YEM P+ VPE  V+VMGDNRNNS DSH WGPLP++NI+GRS+F+YWPP + S
Sbjct: 219 KFILEPPGYEMTPIRVPENSVFVMGDNRNNSYDSHVWGPLPLKNIIGRSVFRYWPPNRVS 278

Query: 356 SMVDE 361
             V E
Sbjct: 279 GTVLE 283

BLAST of Tan0005274 vs. TAIR 10
Match: AT1G23465.1 (Peptidase S24/S26A/S26B/S26C family protein )

HSP 1 Score: 70.1 bits (170), Expect = 4.3e-12
Identity = 45/131 (34.35%), Postives = 73/131 (55.73%), Query Frame = 0

Query: 219 SMYPTLE-VGDRVLAEKVSYIFRKPEVSDIVIFKAPKILQEVGVSSSEVFIKRVVATSGD 278
           SM PTL   G+ +LAE++S  ++KP   DIV+ ++P+       + ++  IKRVV   GD
Sbjct: 47  SMIPTLHPSGNMLLAERISKRYQKPSRGDIVVIRSPE-------NPNKTPIKRVVGVEGD 106

Query: 279 VVEVRNGKLVVNGVVQDEDFVLEPI-AYEMDPMFVPEGYVYVMGDNRNNSCDSHDWGPLP 338
            +                 FV++P+ + E   + VP+G+V+V GD  +NS DS ++GP+P
Sbjct: 107 CI----------------SFVIDPVKSDESQTIVVPKGHVFVQGDYTHNSRDSRNFGPVP 154

Query: 339 IENIVGRSLFK 348
              I GR L++
Sbjct: 167 YGLIQGRVLWR 154

BLAST of Tan0005274 vs. TAIR 10
Match: AT1G29960.1 (Peptidase S24/S26A/S26B/S26C family protein )

HSP 1 Score: 66.2 bits (160), Expect = 6.2e-11
Identity = 49/157 (31.21%), Postives = 80/157 (50.96%), Query Frame = 0

Query: 195 LFTALTVSVLFKSFLAEPKSIPSSSMYPTLE-VGDRVLAEKVSYIFRKPEVSDIVIFKAP 254
           L+  L V+  +  F+A        SM PTL   G+ +LAE++S  ++KP   DIV+ ++P
Sbjct: 26  LYCFLHVTTNYLGFMAYAY---GPSMTPTLHPSGNVLLAERISKRYQKPSRGDIVVIRSP 85

Query: 255 KILQEVGVSSSEVFIKRVVATSGDVVEVRNGKLVVNGVVQDEDFVLEPIAYEMDPMFVPE 314
           +       + ++  IKRV+   GD +       V++    DE             + VP+
Sbjct: 86  E-------NPNKTPIKRVIGIEGDCI-----SFVIDSRKSDES----------QTIVVPK 145

Query: 315 GYVYVMGDNRNNSCDSHDWGPLPIENIVGRSLFKYWP 351
           G+V+V GD  +NS DS ++G +P   I GR L++ WP
Sbjct: 146 GHVFVQGDYTHNSRDSRNFGTVPYGLIQGRVLWRVWP 157

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O043482.8e-10158.73Thylakoidal processing peptidase 1, chloroplastic OS=Arabidopsis thaliana OX=370... [more]
Q9M9Z22.7e-9656.44Probable thylakoidal processing peptidase 2, chloroplastic OS=Arabidopsis thalia... [more]
Q8H0W11.2e-5960.54Chloroplast processing peptidase OS=Arabidopsis thaliana OX=3702 GN=PLSP1 PE=2 S... [more]
P726601.8e-3949.69Probable signal peptidase I-1 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX... [more]
P731576.0e-3539.34Probable signal peptidase I-2 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX... [more]
Match NameE-valueIdentityDescription
XP_022951822.15.0e-18890.03thylakoidal processing peptidase 1, chloroplastic-like isoform X1 [Cucurbita mos... [more]
XP_023537459.18.5e-18890.03thylakoidal processing peptidase 1, chloroplastic-like [Cucurbita pepo subsp. pe... [more]
KAG7020547.11.1e-18789.76Thylakoidal processing peptidase 1, chloroplastic [Cucurbita argyrosperma subsp.... [more]
XP_022144217.12.0e-18489.81thylakoidal processing peptidase 1, chloroplastic-like [Momordica charantia][more]
XP_038884798.15.5e-17987.90thylakoidal processing peptidase 1, chloroplastic-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1GIQ52.4e-18890.03thylakoidal processing peptidase 1, chloroplastic-like isoform X1 OS=Cucurbita m... [more]
A0A6J1CT209.5e-18589.81thylakoidal processing peptidase 1, chloroplastic-like OS=Momordica charantia OX... [more]
A0A6J1GIT41.5e-17485.71thylakoidal processing peptidase 1, chloroplastic-like isoform X2 OS=Cucurbita m... [more]
A0A6J1KS792.6e-17485.44thylakoidal processing peptidase 1, chloroplastic-like OS=Cucurbita maxima OX=36... [more]
A0A0A0LNI73.9e-17084.99Peptidase_S26 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G3607... [more]
Match NameE-valueIdentityDescription
AT2G30440.12.0e-10258.73thylakoid processing peptide [more]
AT1G06870.11.9e-9756.44Peptidase S24/S26A/S26B/S26C family protein [more]
AT3G24590.18.5e-6160.54plastidic type i signal peptidase 1 [more]
AT1G23465.14.3e-1234.35Peptidase S24/S26A/S26B/S26C family protein [more]
AT1G29960.16.2e-1131.21Peptidase S24/S26A/S26B/S26C family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000223Peptidase S26A, signal peptidase IPRINTSPR00727LEADERPTASEcoord: 311..330
score: 57.74
coord: 208..224
score: 52.1
coord: 267..279
score: 47.07
IPR000223Peptidase S26A, signal peptidase ITIGRFAMTIGR02227TIGR02227coord: 193..350
e-value: 2.3E-39
score: 132.8
IPR000223Peptidase S26A, signal peptidase IPANTHERPTHR43390SIGNAL PEPTIDASE Icoord: 1..367
NoneNo IPR availableGENE3D2.10.109.10Umud Fragment, subunit Acoord: 205..365
e-value: 7.0E-36
score: 125.0
NoneNo IPR availablePANTHERPTHR43390:SF2THYLAKOIDAL PROCESSING PEPTIDASE 2, CHLOROPLASTIC-RELATEDcoord: 1..367
IPR019533Peptidase S26PFAMPF10502Peptidase_S26coord: 192..349
e-value: 9.3E-41
score: 139.6
IPR019533Peptidase S26CDDcd06530S26_SPase_Icoord: 211..343
e-value: 1.28079E-25
score: 96.8859
IPR019758Peptidase S26A, signal peptidase I, conserved sitePROSITEPS00761SPASE_I_3coord: 316..329
IPR019756Peptidase S26A, signal peptidase I, serine active sitePROSITEPS00501SPASE_I_1coord: 217..224
IPR036286LexA/Signal peptidase-like superfamilySUPERFAMILY51306LexA/Signal peptidasecoord: 207..355

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0005274.1Tan0005274.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006465 signal peptide processing
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0006508 proteolysis
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0005887 integral component of plasma membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0004252 serine-type endopeptidase activity
molecular_function GO:0008236 serine-type peptidase activity