Tan0016732 (gene) Snake gourd v1

Overview
NameTan0016732
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionpentatricopeptide repeat-containing protein At2g01390
LocationLG07: 27186817 .. 27188698 (-)
RNA-Seq ExpressionTan0016732
SyntenyTan0016732
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATTGTTTTAATCGTTTCTCCTTACTTTTGAGCAATTATGTGGTTATCTCTGCCATCCGTCATAGAATTTATCAGGATATTTCCATTAAATATTTGCATTCCTTTCACCAACATAAACAAGAGAAACCCATCAAACTATTTAGTAGAAAGTTGAGGAAAGGAACTAAGGTAGTTAAGAAGGAAGAAGTAGTTCCAAAGCTTTACACGAGAGATACAGTGAGGAACGCATACAATCTTCTGAGAAATTGCTCATGGAGTTCTGCACAAGAACACCTAGAGAGGCTCCCTATAAGATGGGATTCTTACCTAATCAACCAGATTCTGAAAACTCATCCACCATTGGAGAAGGCATGGTTGTTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCAGTATACATACACGACGATGTTGGATATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACTTATACTTCATTAATGCACTGGCGTTCAAACTCGGGGGATGTTGATGGGGCTATAAAGGTTTGGAAGGAAATGAAAGCCAATGGCTGCTATCCGACAGTGGTTTCGTATACTGCTTATATAAAGATTTTGTTGGATAATGATCAAGTTAAGGAGGCCACTGATACTTACAAGGAGATGCTTCAAACTGGGCTTTCTCCAAATTGCTGTACTTACACTGTCTTAATGGAATACCTTATTGGAGCAGGTGAGTTTCTATACTCTTGCTTATTATTGACTTTCAAAATTTGCTGTTTATCAGTTTTCTGTTGTCTTTTTTTATTTTGTAAGGTTTACAGTTGTTTCTTTCATTGTCCTTCCAGGTAAATGCAGAGAAGCCCTCGATATTTTTCACAAAATGCAAGATGCTGGAGTATATCCTGATAAAGTGGCTTGCAATATATTGATTCAGAAATGCTCTAAATCAGGGGAGATGGTGGTAATGACACAAATCCTCGAGTACATGAAAGATAAACGCCTTGTGCTTCGATACCCTGTGTTTGTTGAAGCACATGAAACTTTAAAATGTTGTTCCGTAAGTGATACCCTACTCAGGCAAGTTAATCCTCATATAGAAACTGAATCAATCGGTAAGGATGAGGTTATGGTTGTTAGTACAAGTTCTAATATTATTCCTCCCAATGTAGATTATGAGCTTTTGGCAATTCTGTTGAAGGAGGATAAACTTATTGCTGTTGACCACATACTCATTGGGACGATAGATAAGAACGTACAGTTGGATTCTTCGATTGTTTCATCCATAATTGAGGTAAATTGCAAACGTAATCGACCTAACGGTGCTCTGCTGGCTTTTGACTACTGTTTAAAAAACGGTGTTAACATTGAGAGAAATCTGTATCTTCGCTTGATAGGGATTCTGATCCGATCGAGTATATATTCGAAGTTGCTGGAAGTTGTGCAGGAAATGTATAGGCAAGGGCATTGTCTTGGACTCTATCATGCCACAGTTATCCTTTATAGGCTTGGGAAAGCTGGAAAGCCGCAATATGCTAAGAAAGTTTTTCATATGTTGCCTGAAGAATTGAAGTGCACTGCAACTTACACTGCTCTGGTTGGTGCTTATTTCTCTGCTGGAAGTTCTAGTAAAGGGCTTAAAATTTACGAAACAATGCGAAAGAGAGGATTTACACCGTCTTTAGGCACATATAATGTGCTGTTAACTGGTCTTATGAAGAGCGGTAGAGTTGTTGAATTAGATATTTATAGAAGGGAGAAGAAGAGTTTTGAGATCAGTCATCATTCTCATCATAATACAATACTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGGTATCTTGA

mRNA sequence

ATGAATTGTTTTAATCGTTTCTCCTTACTTTTGAGCAATTATGTGGTTATCTCTGCCATCCGTCATAGAATTTATCAGGATATTTCCATTAAATATTTGCATTCCTTTCACCAACATAAACAAGAGAAACCCATCAAACTATTTAGTAGAAAGTTGAGGAAAGGAACTAAGGTAGTTAAGAAGGAAGAAGTAGTTCCAAAGCTTTACACGAGAGATACAGTGAGGAACGCATACAATCTTCTGAGAAATTGCTCATGGAGTTCTGCACAAGAACACCTAGAGAGGCTCCCTATAAGATGGGATTCTTACCTAATCAACCAGATTCTGAAAACTCATCCACCATTGGAGAAGGCATGGTTGTTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCAGTATACATACACGACGATGTTGGATATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACTTATACTTCATTAATGCACTGGCGTTCAAACTCGGGGGATGTTGATGGGGCTATAAAGGTTTGGAAGGAAATGAAAGCCAATGGCTGCTATCCGACAGTGGTTTCGTATACTGCTTATATAAAGATTTTGTTGGATAATGATCAAGTTAAGGAGGCCACTGATACTTACAAGGAGATGCTTCAAACTGGGCTTTCTCCAAATTGCTGTACTTACACTGTCTTAATGGAATACCTTATTGGAGCAGGTAAATGCAGAGAAGCCCTCGATATTTTTCACAAAATGCAAGATGCTGGAGTATATCCTGATAAAGTGGCTTGCAATATATTGATTCAGAAATGCTCTAAATCAGGGGAGATGGTGGTAATGACACAAATCCTCGAGTACATGAAAGATAAACGCCTTGTGCTTCGATACCCTGTGTTTGTTGAAGCACATGAAACTTTAAAATGTTGTTCCGTAAGTGATACCCTACTCAGGCAAGTTAATCCTCATATAGAAACTGAATCAATCGGTAAGGATGAGGTTATGGTTGTTAGTACAAGTTCTAATATTATTCCTCCCAATGTAGATTATGAGCTTTTGGCAATTCTGTTGAAGGAGGATAAACTTATTGCTGTTGACCACATACTCATTGGGACGATAGATAAGAACGTACAGTTGGATTCTTCGATTGTTTCATCCATAATTGAGGTAAATTGCAAACGTAATCGACCTAACGGTGCTCTGCTGGCTTTTGACTACTGTTTAAAAAACGGTGTTAACATTGAGAGAAATCTGTATCTTCGCTTGATAGGGATTCTGATCCGATCGAGTATATATTCGAAGTTGCTGGAAGTTGTGCAGGAAATGTATAGGCAAGGGCATTGTCTTGGACTCTATCATGCCACAGTTATCCTTTATAGGCTTGGGAAAGCTGGAAAGCCGCAATATGCTAAGAAAGTTTTTCATATGTTGCCTGAAGAATTGAAGTGCACTGCAACTTACACTGCTCTGGTTGGTGCTTATTTCTCTGCTGGAAGTTCTAGTAAAGGGCTTAAAATTTACGAAACAATGCGAAAGAGAGGATTTACACCGTCTTTAGGCACATATAATGTGCTGTTAACTGGTCTTATGAAGAGCGGTAGAGTTGTTGAATTAGATATTTATAGAAGGGAGAAGAAGAGTTTTGAGATCAGTCATCATTCTCATCATAATACAATACTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGGTATCTTGA

Coding sequence (CDS)

ATGAATTGTTTTAATCGTTTCTCCTTACTTTTGAGCAATTATGTGGTTATCTCTGCCATCCGTCATAGAATTTATCAGGATATTTCCATTAAATATTTGCATTCCTTTCACCAACATAAACAAGAGAAACCCATCAAACTATTTAGTAGAAAGTTGAGGAAAGGAACTAAGGTAGTTAAGAAGGAAGAAGTAGTTCCAAAGCTTTACACGAGAGATACAGTGAGGAACGCATACAATCTTCTGAGAAATTGCTCATGGAGTTCTGCACAAGAACACCTAGAGAGGCTCCCTATAAGATGGGATTCTTACCTAATCAACCAGATTCTGAAAACTCATCCACCATTGGAGAAGGCATGGTTGTTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAGCATGACCAGTATACATACACGACGATGTTGGATATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACTTATACTTCATTAATGCACTGGCGTTCAAACTCGGGGGATGTTGATGGGGCTATAAAGGTTTGGAAGGAAATGAAAGCCAATGGCTGCTATCCGACAGTGGTTTCGTATACTGCTTATATAAAGATTTTGTTGGATAATGATCAAGTTAAGGAGGCCACTGATACTTACAAGGAGATGCTTCAAACTGGGCTTTCTCCAAATTGCTGTACTTACACTGTCTTAATGGAATACCTTATTGGAGCAGGTAAATGCAGAGAAGCCCTCGATATTTTTCACAAAATGCAAGATGCTGGAGTATATCCTGATAAAGTGGCTTGCAATATATTGATTCAGAAATGCTCTAAATCAGGGGAGATGGTGGTAATGACACAAATCCTCGAGTACATGAAAGATAAACGCCTTGTGCTTCGATACCCTGTGTTTGTTGAAGCACATGAAACTTTAAAATGTTGTTCCGTAAGTGATACCCTACTCAGGCAAGTTAATCCTCATATAGAAACTGAATCAATCGGTAAGGATGAGGTTATGGTTGTTAGTACAAGTTCTAATATTATTCCTCCCAATGTAGATTATGAGCTTTTGGCAATTCTGTTGAAGGAGGATAAACTTATTGCTGTTGACCACATACTCATTGGGACGATAGATAAGAACGTACAGTTGGATTCTTCGATTGTTTCATCCATAATTGAGGTAAATTGCAAACGTAATCGACCTAACGGTGCTCTGCTGGCTTTTGACTACTGTTTAAAAAACGGTGTTAACATTGAGAGAAATCTGTATCTTCGCTTGATAGGGATTCTGATCCGATCGAGTATATATTCGAAGTTGCTGGAAGTTGTGCAGGAAATGTATAGGCAAGGGCATTGTCTTGGACTCTATCATGCCACAGTTATCCTTTATAGGCTTGGGAAAGCTGGAAAGCCGCAATATGCTAAGAAAGTTTTTCATATGTTGCCTGAAGAATTGAAGTGCACTGCAACTTACACTGCTCTGGTTGGTGCTTATTTCTCTGCTGGAAGTTCTAGTAAAGGGCTTAAAATTTACGAAACAATGCGAAAGAGAGGATTTACACCGTCTTTAGGCACATATAATGTGCTGTTAACTGGTCTTATGAAGAGCGGTAGAGTTGTTGAATTAGATATTTATAGAAGGGAGAAGAAGAGTTTTGAGATCAGTCATCATTCTCATCATAATACAATACTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGGTATCTTGA

Protein sequence

MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVKKEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEYMKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPNVDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYCLKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKPQYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLTGLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS
Homology
BLAST of Tan0016732 vs. ExPASy Swiss-Prot
Match: Q9ZU29 (Pentatricopeptide repeat-containing protein At2g01390 OS=Arabidopsis thaliana OX=3702 GN=At2g01390/At2g01380 PE=2 SV=2)

HSP 1 Score: 559.7 bits (1441), Expect = 4.0e-158
Identity = 296/559 (52.95%), Postives = 391/559 (69.95%), Query Frame = 0

Query: 29  SIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVKKEEVV-PKLYTRDTVRNAYNLLRNCSWS 88
           S+K LHS  + K     K FS+K     K+VK + +  P +YTRD V N YN+L+  +W 
Sbjct: 20  SVKLLHSLPRLKPTNS-KRFSQK----PKLVKTQTLPDPSVYTRDIVSNIYNILKYSNWD 79

Query: 89  SAQEHLERLPIRWDSYLINQILKTHPPLEKAWLFFNWASRLQIFKHDQYTYTTMLDIFGE 148
           SAQE L  L +RWDS++IN++LK HPP++KAWLFFNWA++++ FKHD +TYTTMLDIFGE
Sbjct: 80  SAQEQLPHLGVRWDSHIINRVLKAHPPMQKAWLFFNWAAQIKGFKHDHFTYTTMLDIFGE 139

Query: 149 AGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKANGCYPTVVS 208
           AGRI SM  VF  MKEKG+ ID VTYTSL+HW S+SGDVDGA+++W+EM+ NGC PTVVS
Sbjct: 140 AGRIQSMYSVFHLMKEKGVLIDTVTYTSLIHWVSSSGDVDGAMRLWEEMRDNGCEPTVVS 199

Query: 209 YTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIGAGKCREALDIFHKMQ 268
           YTAY+K+L  + +V+EAT+ YKEML++ +SPNC TYTVLMEYL+  GKC EALDIF KMQ
Sbjct: 200 YTAYMKMLFADGRVEEATEVYKEMLRSRVSPNCHTYTVLMEYLVATGKCEEALDIFFKMQ 259

Query: 269 DAGVYPDKVACNILIQKCSKSGEMVVMTQILEYMKDKRLVLRYPVFVEAHETLKCCSVSD 328
           + GV PDK ACNILI K  K GE   MT++L YMK+  +VLRYP+FVEA ETLK    SD
Sbjct: 260 EIGVQPDKAACNILIAKALKFGETSFMTRVLVYMKENGVVLRYPIFVEALETLKAAGESD 319

Query: 329 TLLRQVNPHIETESIGKDEVMVVSTSSNIIPPNVDYE--LLAILLKEDKLIAVDHILIGT 388
            LLR+VN HI  ES+   ++    T+      N D    + ++LL +  L+AVD +L   
Sbjct: 320 DLLREVNSHISVESLCSSDIDETPTAEVNDTKNSDDSRVISSVLLMKQNLVAVDILLNQM 379

Query: 389 IDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYCLKNGVNIERNLYLRLIGILIRSSIY 448
            D+N++LDS +VS+IIE NC R R  GA LAFDY L+ G++++++ YL LIG  +RS+  
Sbjct: 380 RDRNIKLDSFVVSAIIETNCDRCRTEGASLAFDYSLEMGIHLKKSAYLALIGNFLRSNEL 439

Query: 449 SKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKPQYAKKVFHMLPEELKCTATYTALVG 508
            K++EVV+EM +  H LG Y   ++++RLG   +P+ A  VF +LP++ K  A YTAL+ 
Sbjct: 440 PKVIEVVKEMVKAQHSLGCYQGAMLIHRLGFGRRPRLAADVFDLLPDDQKGVAAYTALMD 499

Query: 509 AYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLTGLMK-SGRVVELDIYRREKKSFEIS 568
            Y SAGS  K +KI   MR+R   PSLGTY+VLL+GL K S    E+ + R+EKKS   S
Sbjct: 500 VYISAGSPEKAMKILREMREREIMPSLGTYDVLLSGLEKTSDFQKEVALLRKEKKSLVAS 559

Query: 569 HHSHHNTILEEERICDLLF 584
                N +  E++ICDLLF
Sbjct: 560 ARFREN-VHVEDKICDLLF 572

BLAST of Tan0016732 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 2.5e-27
Identity = 119/436 (27.29%), Postives = 196/436 (44.95%), Query Frame = 0

Query: 147 EAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKANGCYPTVV 206
           +AGR      +F  +K+ G+  D+VTY  +M   S  G++D AIK+  EM  NGC P V+
Sbjct: 480 KAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVI 539

Query: 207 SYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIGAGKCREALDIFHKM 266
              + I  L   D+V EA   +  M +  L P   TY  L+  L   GK +EA+++F  M
Sbjct: 540 VVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGM 599

Query: 267 QDAGVYPDKVACNILIQKCSKSGEMVVMTQILEYMKDKRL---VLRYPVFV--------- 326
              G  P+ +  N L     K+ E+ +  ++L  M D      V  Y   +         
Sbjct: 600 VQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKMMDMGCVPDVFTYNTIIFGLVKNGQV 659

Query: 327 -EA----HETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVST---SSNIIPPNVDYE-L 386
            EA    H+  K        L  + P +   S+ +D   +++    +    P N+ +E L
Sbjct: 660 KEAMCFFHQMKKLVYPDFVTLCTLLPGVVKASLIEDAYKIITNFLYNCADQPANLFWEDL 719

Query: 387 LAILLKE---DKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYCLK 446
           +  +L E   D  ++    L+   +   +   SI+  II  +CK N  +GA   F+   K
Sbjct: 720 IGSILAEAGIDNAVSFSERLVA--NGICRDGDSILVPIIRYSCKHNNVSGARTLFEKFTK 779

Query: 447 N-GVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLG----LYHATVILYRLGKA 506
           + GV  +   Y  LIG L+ +     ++E+ Q+++ Q    G    +     +L   GK+
Sbjct: 780 DLGVQPKLPTYNLLIGGLLEAD----MIEIAQDVFLQVKSTGCIPDVATYNFLLDAYGKS 839

Query: 507 GKPQYAKKVF-HMLPEELKC-TATYTALVGAYFSAGSSSKGLKI-YETMRKRGFTPSLGT 551
           GK     +++  M   E +  T T+  ++     AG+    L + Y+ M  R F+P+  T
Sbjct: 840 GKIDELFELYKEMSTHECEANTITHNIVISGLVKAGNVDDALDLYYDLMSDRDFSPTACT 899

BLAST of Tan0016732 vs. ExPASy Swiss-Prot
Match: Q8GYP6 (Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX=3702 GN=At1g18900 PE=2 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 2.5e-27
Identity = 74/216 (34.26%), Postives = 113/216 (52.31%), Query Frame = 0

Query: 74  VRNAYNLLRNCSWS-SAQEHLERLPIRWDSYLINQILKTHPPLEKAWLFFNWASRLQIFK 133
           V N  ++LR   W  +A+E L+ L +R D+Y  NQ+LK       A  FF W  R   FK
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 134 HDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKV 193
           HD +TYTTM+   G A +  ++N +  +M   G + + VTY  L+H    +  ++ A+ V
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNV 421

Query: 194 WKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIG 253
           + +M+  GC P  V+Y   I I      +  A D Y+ M   GLSP+  TY+V++  L  
Sbjct: 422 FNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGK 481

Query: 254 AGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKS 289
           AG    A  +F +M D G  P+ V  NI++   +K+
Sbjct: 482 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKA 517

BLAST of Tan0016732 vs. ExPASy Swiss-Prot
Match: Q9SSF9 (Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana OX=3702 GN=At1g74750 PE=2 SV=1)

HSP 1 Score: 123.6 bits (309), Expect = 7.3e-27
Identity = 81/259 (31.27%), Postives = 123/259 (47.49%), Query Frame = 0

Query: 33  LHSFHQHKQEKPIKLFSRKLRKGTKVVKKEEVVPKLYTRD--TVRNAYNLLRNCSWS-SA 92
           +HS         ++ F +  R+  KV  +    P+ +      V N  ++LR   W  +A
Sbjct: 254 VHSSDDRTIISSVEGFGKPSREMMKVTPRTAPTPRQHCNPGYVVENVSSILRRFKWGHAA 313

Query: 93  QEHLERLPIRWDSYLINQILKTHPPLEKAWLFFNWASRLQIFKHDQYTYTTMLDIFGEAG 152
           +E L     R D+Y  NQ+LK       A  FF W  R   FKHD +TYTTM+   G A 
Sbjct: 314 EEALHNFGFRMDAYQANQVLKQMDNYANALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAK 373

Query: 153 RISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKANGCYPTVVSYT 212
           +   +N +  +M   G K + VTY  L+H    +  +  A+ V+ +M+  GC P  V+Y 
Sbjct: 374 QFGEINKLLDEMVRDGCKPNTVTYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYC 433

Query: 213 AYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIGAGKCREALDIFHKMQDA 272
             I I      +  A D Y+ M + GLSP+  TY+V++  L  AG    A  +F +M   
Sbjct: 434 TLIDIHAKAGFLDIAMDMYQRMQEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQ 493

Query: 273 GVYPDKVACNILIQKCSKS 289
           G  P+ V  NI+I   +K+
Sbjct: 494 GCTPNLVTFNIMIALHAKA 512

BLAST of Tan0016732 vs. ExPASy Swiss-Prot
Match: Q9ZU31 (Pentatricopeptide repeat-containing protein At2g01360 OS=Arabidopsis thaliana OX=3702 GN=At2g01360 PE=2 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 2.1e-26
Identity = 66/168 (39.29%), Postives = 104/168 (61.90%), Query Frame = 0

Query: 413 ALLAFDYCLKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILY 472
           A LA D+ L+ G+++E++ YL L G  +RS+  SK+++VV+EM +  H LG+YH  ++++
Sbjct: 15  ASLALDWSLEMGIHLEKSAYLALAGNFLRSNELSKVIDVVKEMVKSQHSLGVYHGAMLIH 74

Query: 473 RLGKAGKPQYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSL 532
            LG   +P  A +   +LP++ K  + YTAL+  Y SAGS  K +KI   MR+R   PSL
Sbjct: 75  MLGFGRRPSLAAEALDLLPDDQKGLSAYTALMDVYISAGSPEKAMKILGEMREREIMPSL 134

Query: 533 GTYNVLLTGLMKSGRVV-ELDIYRREKKSFEISHHSHHNTILEEERIC 580
           GTY+VLL+GL K+     E    R+E+KS  ++       +  E++IC
Sbjct: 135 GTYDVLLSGLEKTSDFQRETSSLRKEQKSL-VASTRFREIVHVEDKIC 181

BLAST of Tan0016732 vs. NCBI nr
Match: XP_022155480.1 (pentatricopeptide repeat-containing protein At2g01390 isoform X1 [Momordica charantia] >XP_022155481.1 pentatricopeptide repeat-containing protein At2g01390 isoform X1 [Momordica charantia])

HSP 1 Score: 1014.2 bits (2621), Expect = 4.5e-292
Identity = 496/588 (84.35%), Postives = 548/588 (93.20%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M+  N FSLLLSNYVVISAIR +IY +ISIK LHS  Q+KQEKPIKLFSRKLRKG KVV+
Sbjct: 1   MHYSNSFSLLLSNYVVISAIRKKIYHNISIKALHSLRQYKQEKPIKLFSRKLRKGAKVVE 60

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV PKLYTRDTVRN YN+LRN SWSSAQEHLERLP+RWDSYLINQ++KTHPPLEKAWL
Sbjct: 61  KEEVDPKLYTRDTVRNIYNILRNFSWSSAQEHLERLPMRWDSYLINQVMKTHPPLEKAWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWA RL+ FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 121 FFNWACRLRTFKHDQYTYTTMLDIFGEAGRISSMNYIFQQMKEKGIKIDAVTYTSLMHWR 180

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           S SGDVDGAIKVWKEMK NGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQ+GLSPNC
Sbjct: 181 SKSGDVDGAIKVWKEMKTNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNC 240

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYTVLMEYLIGAGKC+EALDIFHKMQDAGVYPDK ACNILI KC +SGEM+VMT ILEY
Sbjct: 241 CTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILILKCCRSGEMLVMTPILEY 300

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+ R VLRYPVFVEAH+TLK CSVS+TLLRQVNPHIETES+ KDEV+ V TSS IIP N
Sbjct: 301 MKENRFVLRYPVFVEAHQTLKSCSVSETLLRQVNPHIETESVSKDEVIHVITSSTIIPSN 360

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VD+EL+ ILLK++KLIAVD++L G +DKN+QLDS+I+S+IIEVNCK NRP+GALL FD+C
Sbjct: 361 VDHELMEILLKKEKLIAVDYLLTGMVDKNIQLDSAIISTIIEVNCKHNRPDGALLVFDHC 420

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LK+GVN++RNLYL LIG+LIRSSIYSKLLE+V EMYRQGHCLGLYHAT+ILYRLGKAGKP
Sbjct: 421 LKSGVNMKRNLYLGLIGVLIRSSIYSKLLEIVLEMYRQGHCLGLYHATLILYRLGKAGKP 480

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA K+F++LPEELKCTATYTALVGAYFSAGSS KGLKIYETMRK+GF+PSLGTYNVLLT
Sbjct: 481 QYAVKIFNVLPEELKCTATYTALVGAYFSAGSSGKGLKIYETMRKKGFSPSLGTYNVLLT 540

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL KSGRVVEL+IYRREKKSFEI ++SHH+ ILEE+RICDLL+GE++S
Sbjct: 541 GLEKSGRVVELEIYRREKKSFEIGYNSHHHIILEEDRICDLLYGEMIS 588

BLAST of Tan0016732 vs. NCBI nr
Match: XP_022971714.1 (pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima] >XP_022971715.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima])

HSP 1 Score: 1002.7 bits (2591), Expect = 1.3e-288
Identity = 500/588 (85.03%), Postives = 534/588 (90.82%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M C N FS L+SNYVV SAI  RIYQ+IS K LHS HQ+KQEKP   FSRKLRKGTK VK
Sbjct: 5   MRCSNSFSFLMSNYVVTSAICKRIYQNISSKCLHSSHQYKQEKPFSRFSRKLRKGTKGVK 64

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV    YTRDTVRN YN+LRNCSW SAQ H+E LPIRWDSYLINQ+LKTHPPLEKAWL
Sbjct: 65  KEEVNLTPYTRDTVRNIYNILRNCSWGSAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           SNSGDVDGAI+VW+EMKANGCYPTVVSYTAYIKILLDN +V++ATDTYKEMLQ+GLSPNC
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNGRVRKATDTYKEMLQSGLSPNC 244

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYTVLMEYLIG  K +EALDIFHKMQDAGVYPDK ACNILIQKC KSGEM+VMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+KRLVLRYPVFVEAHE LK CSVS TLL QVNPHIE ES+ K EV+ VSTS N+I P+
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VDYEL+A LLKE+KLIAVDHILIG  DKN+QLDSSI+ SIIEVNCKRNRPNGALLAFDYC
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LKNGV +ERNLYL LIG+LIRSSIYS LLE+VQ+MY +GHCLGLYHAT+ILYRLGKAGKP
Sbjct: 425 LKNGVKVERNLYLTLIGVLIRSSIYSNLLEIVQDMYTKGHCLGLYHATLILYRLGKAGKP 484

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA+KVF+MLPEELKCTATYTALV AYFSAGS  KGLKIYETMRK+GFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL+KS RVVELDIYRREKK FEISHHSHH TILEEERICDLLFGELVS
Sbjct: 545 GLVKSDRVVELDIYRREKKIFEISHHSHHGTILEEERICDLLFGELVS 592

BLAST of Tan0016732 vs. NCBI nr
Match: XP_038901985.1 (pentatricopeptide repeat-containing protein At2g01390 [Benincasa hispida])

HSP 1 Score: 1001.1 bits (2587), Expect = 3.9e-288
Identity = 495/587 (84.33%), Postives = 535/587 (91.14%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M+C N FS LLSNYVV SAI  RIYQ+IS K LHSFHQ+KQEKPIK F+RK RKGTKVVK
Sbjct: 1   MHCSNSFSFLLSNYVVTSAIGKRIYQNISSKCLHSFHQYKQEKPIKQFNRKSRKGTKVVK 60

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV  + YTRDTVRN YN+LR CSW SAQEHLE LPIRWDSYLINQ+LKTHPPLEK WL
Sbjct: 61  KEEVDLRRYTRDTVRNIYNILRQCSWGSAQEHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ+FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEK IKIDAVTYTSLMHWR
Sbjct: 121 FFNWASRLQMFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKRIKIDAVTYTSLMHWR 180

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           SNSGDV+GAIKVWKEMKANGCYPTVVSYTAYIKILLD+DQ+KEATDTYKEMLQ+GL PNC
Sbjct: 181 SNSGDVEGAIKVWKEMKANGCYPTVVSYTAYIKILLDSDQIKEATDTYKEMLQSGLPPNC 240

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYT+LMEYLIG GKC+EALDIF KMQDAGVYPDK ACNILIQKC KSGE +VMTQILEY
Sbjct: 241 CTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGETLVMTQILEY 300

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MKDKRLVLRYPVFVEAHETLK CSVS TLLRQVNPHIE ES+ K EV+ VST SNI+PPN
Sbjct: 301 MKDKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVSKGEVVNVSTRSNIVPPN 360

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VD+ELLAILLKE+KL A+D++L G +D+N+QLDSSI+ SI EVNCK NRPNGALLAF+YC
Sbjct: 361 VDHELLAILLKENKLTAIDYMLTGIVDRNIQLDSSIILSIFEVNCKSNRPNGALLAFNYC 420

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LK+GVNIER LYL LIGILIRSSIY KLLE+VQ+MY QGHCLGLYHAT+ILYRLGKAGKP
Sbjct: 421 LKDGVNIERKLYLDLIGILIRSSIYPKLLEIVQKMYTQGHCLGLYHATLILYRLGKAGKP 480

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA+KVF++LPEELKCTATYTALV AYFSAGSS KGLKIYETMRK+GF PSLGTYNVLL 
Sbjct: 481 QYARKVFNVLPEELKCTATYTALVDAYFSAGSSGKGLKIYETMRKKGFAPSLGTYNVLLA 540

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELV 588
           GL K GR+ EL IYR+E+KSFEISHHSH  TILEEERICDLL+GELV
Sbjct: 541 GLAKCGRIDELHIYRKERKSFEISHHSHLYTILEEERICDLLYGELV 587

BLAST of Tan0016732 vs. NCBI nr
Match: XP_022928072.1 (pentatricopeptide repeat-containing protein At2g01390 [Cucurbita moschata] >XP_022928073.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita moschata])

HSP 1 Score: 999.2 bits (2582), Expect = 1.5e-287
Identity = 498/588 (84.69%), Postives = 533/588 (90.65%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M C N FS L+SNYVV SAI  RIYQ+IS K LHS HQ+KQEKP   FSRKLRKGTK VK
Sbjct: 5   MRCSNHFSFLMSNYVVTSAICKRIYQNISSKCLHSSHQYKQEKPFSRFSRKLRKGTKGVK 64

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV    YTRDTVRN YN+LRNCSW+SAQ H+E LPIRWDSYLINQ+LKTHPPLEKAWL
Sbjct: 65  KEEVNLTPYTRDTVRNIYNILRNCSWASAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFRHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           SNSGDVDGAI+VW+EMKANGCYPTVVSYTAYIKILLDN +V++ATD YKEMLQ+GLSPNC
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNVRVRKATDAYKEMLQSGLSPNC 244

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYTVLMEYLIG  K +EALDIFHKMQDAG YPDK ACNILIQKC KSGEM+VMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGAYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+KRLVLRYPVFVEAHE LK CSVS TLL QVNPHIE ES+ K EV+ VSTS N+I P+
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VDYEL+A LLKE+KLIAVDHILIG  DKN+QLDSSI+ SIIEVNCKRNRPNGALLAFDYC
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LKNGV +ERNLYL LIG+LIRSSIYS LLE+VQEMY +GHCLGLYHAT+ILYRLGKAGKP
Sbjct: 425 LKNGVKVERNLYLTLIGVLIRSSIYSNLLEIVQEMYTKGHCLGLYHATLILYRLGKAGKP 484

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA+KVF+MLPEELKCTATYTALV AYFSAGS  KGLKIYETMRK+GFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL+KS RVVELDIYRREKK FEISHHSHH TILEEERICDLLFGELVS
Sbjct: 545 GLVKSDRVVELDIYRREKKIFEISHHSHHGTILEEERICDLLFGELVS 592

BLAST of Tan0016732 vs. NCBI nr
Match: XP_023512200.1 (pentatricopeptide repeat-containing protein At2g01390 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 993.4 bits (2567), Expect = 8.2e-286
Identity = 493/588 (83.84%), Postives = 529/588 (89.97%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M C N FS  +SNYVV SAI  R+YQ+IS K LHS HQ+KQEKP   F+RKLRKGTK VK
Sbjct: 5   MRCSNHFSFFMSNYVVTSAICKRVYQNISSKCLHSSHQYKQEKPFSRFNRKLRKGTKGVK 64

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEE+ P  YTRDTVRN YN+LRNCSW  AQ H+E LPIRWDSYLINQ+LKTHPPLEKAWL
Sbjct: 65  KEELDPTPYTRDTVRNIYNILRNCSWGFAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFRHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           SNSGDVDGAI+VW+EMKANGCYPTVVSYTAYIKILLDN +V++ATD YKEMLQ+GLSPNC
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNGRVRKATDAYKEMLQSGLSPNC 244

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYTVLMEYLIG  K +EALDIFHKMQDAG YPDK ACNILIQKC KSGEM+VMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGAYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+KRLVLRYPVFVEAHE LK CSVS TLL QVNPHIE ES+ K EV+ VSTS N+I P+
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VDYEL+A LLKE+KLIAVDHILIG  DKN+QLDSSI+ SIIEVNCKRNRPNGALLAFDYC
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LKNGV +ERNLYL LIG+LIRSSIYSKLLEVVQEMY +GHCLGLYHAT+ LYRLGKAGKP
Sbjct: 425 LKNGVKVERNLYLGLIGLLIRSSIYSKLLEVVQEMYTKGHCLGLYHATLTLYRLGKAGKP 484

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA+KVF+MLPEELKCTATYTALV AYFSAGS  KGLKIYETMRK+GFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL+KS RV ELDIYRREKK FEISHHSHH TILEEERICDLLFGE VS
Sbjct: 545 GLVKSDRVAELDIYRREKKIFEISHHSHHGTILEEERICDLLFGEFVS 592

BLAST of Tan0016732 vs. ExPASy TrEMBL
Match: A0A6J1DMJ2 (pentatricopeptide repeat-containing protein At2g01390 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022612 PE=4 SV=1)

HSP 1 Score: 1014.2 bits (2621), Expect = 2.2e-292
Identity = 496/588 (84.35%), Postives = 548/588 (93.20%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M+  N FSLLLSNYVVISAIR +IY +ISIK LHS  Q+KQEKPIKLFSRKLRKG KVV+
Sbjct: 1   MHYSNSFSLLLSNYVVISAIRKKIYHNISIKALHSLRQYKQEKPIKLFSRKLRKGAKVVE 60

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV PKLYTRDTVRN YN+LRN SWSSAQEHLERLP+RWDSYLINQ++KTHPPLEKAWL
Sbjct: 61  KEEVDPKLYTRDTVRNIYNILRNFSWSSAQEHLERLPMRWDSYLINQVMKTHPPLEKAWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWA RL+ FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 121 FFNWACRLRTFKHDQYTYTTMLDIFGEAGRISSMNYIFQQMKEKGIKIDAVTYTSLMHWR 180

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           S SGDVDGAIKVWKEMK NGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQ+GLSPNC
Sbjct: 181 SKSGDVDGAIKVWKEMKTNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNC 240

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYTVLMEYLIGAGKC+EALDIFHKMQDAGVYPDK ACNILI KC +SGEM+VMT ILEY
Sbjct: 241 CTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILILKCCRSGEMLVMTPILEY 300

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+ R VLRYPVFVEAH+TLK CSVS+TLLRQVNPHIETES+ KDEV+ V TSS IIP N
Sbjct: 301 MKENRFVLRYPVFVEAHQTLKSCSVSETLLRQVNPHIETESVSKDEVIHVITSSTIIPSN 360

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VD+EL+ ILLK++KLIAVD++L G +DKN+QLDS+I+S+IIEVNCK NRP+GALL FD+C
Sbjct: 361 VDHELMEILLKKEKLIAVDYLLTGMVDKNIQLDSAIISTIIEVNCKHNRPDGALLVFDHC 420

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LK+GVN++RNLYL LIG+LIRSSIYSKLLE+V EMYRQGHCLGLYHAT+ILYRLGKAGKP
Sbjct: 421 LKSGVNMKRNLYLGLIGVLIRSSIYSKLLEIVLEMYRQGHCLGLYHATLILYRLGKAGKP 480

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA K+F++LPEELKCTATYTALVGAYFSAGSS KGLKIYETMRK+GF+PSLGTYNVLLT
Sbjct: 481 QYAVKIFNVLPEELKCTATYTALVGAYFSAGSSGKGLKIYETMRKKGFSPSLGTYNVLLT 540

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL KSGRVVEL+IYRREKKSFEI ++SHH+ ILEE+RICDLL+GE++S
Sbjct: 541 GLEKSGRVVELEIYRREKKSFEIGYNSHHHIILEEDRICDLLYGEMIS 588

BLAST of Tan0016732 vs. ExPASy TrEMBL
Match: A0A6J1I9C5 (pentatricopeptide repeat-containing protein At2g01390 OS=Cucurbita maxima OX=3661 GN=LOC111470381 PE=4 SV=1)

HSP 1 Score: 1002.7 bits (2591), Expect = 6.5e-289
Identity = 500/588 (85.03%), Postives = 534/588 (90.82%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M C N FS L+SNYVV SAI  RIYQ+IS K LHS HQ+KQEKP   FSRKLRKGTK VK
Sbjct: 5   MRCSNSFSFLMSNYVVTSAICKRIYQNISSKCLHSSHQYKQEKPFSRFSRKLRKGTKGVK 64

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV    YTRDTVRN YN+LRNCSW SAQ H+E LPIRWDSYLINQ+LKTHPPLEKAWL
Sbjct: 65  KEEVNLTPYTRDTVRNIYNILRNCSWGSAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFKHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           SNSGDVDGAI+VW+EMKANGCYPTVVSYTAYIKILLDN +V++ATDTYKEMLQ+GLSPNC
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNGRVRKATDTYKEMLQSGLSPNC 244

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYTVLMEYLIG  K +EALDIFHKMQDAGVYPDK ACNILIQKC KSGEM+VMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+KRLVLRYPVFVEAHE LK CSVS TLL QVNPHIE ES+ K EV+ VSTS N+I P+
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VDYEL+A LLKE+KLIAVDHILIG  DKN+QLDSSI+ SIIEVNCKRNRPNGALLAFDYC
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LKNGV +ERNLYL LIG+LIRSSIYS LLE+VQ+MY +GHCLGLYHAT+ILYRLGKAGKP
Sbjct: 425 LKNGVKVERNLYLTLIGVLIRSSIYSNLLEIVQDMYTKGHCLGLYHATLILYRLGKAGKP 484

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA+KVF+MLPEELKCTATYTALV AYFSAGS  KGLKIYETMRK+GFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL+KS RVVELDIYRREKK FEISHHSHH TILEEERICDLLFGELVS
Sbjct: 545 GLVKSDRVVELDIYRREKKIFEISHHSHHGTILEEERICDLLFGELVS 592

BLAST of Tan0016732 vs. ExPASy TrEMBL
Match: A0A6J1EIW0 (pentatricopeptide repeat-containing protein At2g01390 OS=Cucurbita moschata OX=3662 GN=LOC111434967 PE=4 SV=1)

HSP 1 Score: 999.2 bits (2582), Expect = 7.2e-288
Identity = 498/588 (84.69%), Postives = 533/588 (90.65%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M C N FS L+SNYVV SAI  RIYQ+IS K LHS HQ+KQEKP   FSRKLRKGTK VK
Sbjct: 5   MRCSNHFSFLMSNYVVTSAICKRIYQNISSKCLHSSHQYKQEKPFSRFSRKLRKGTKGVK 64

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV    YTRDTVRN YN+LRNCSW+SAQ H+E LPIRWDSYLINQ+LKTHPPLEKAWL
Sbjct: 65  KEEVNLTPYTRDTVRNIYNILRNCSWASAQGHIETLPIRWDSYLINQVLKTHPPLEKAWL 124

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRLQ F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 125 FFNWASRLQNFRHDHYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 184

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           SNSGDVDGAI+VW+EMKANGCYPTVVSYTAYIKILLDN +V++ATD YKEMLQ+GLSPNC
Sbjct: 185 SNSGDVDGAIRVWEEMKANGCYPTVVSYTAYIKILLDNVRVRKATDAYKEMLQSGLSPNC 244

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYTVLMEYLIG  K +EALDIFHKMQDAG YPDK ACNILIQKC KSGEM+VMTQILEY
Sbjct: 245 CTYTVLMEYLIGEDKGKEALDIFHKMQDAGAYPDKAACNILIQKCCKSGEMLVMTQILEY 304

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+KRLVLRYPVFVEAHE LK CSVS TLL QVNPHIE ES+ K EV+ VSTS N+I P+
Sbjct: 305 MKEKRLVLRYPVFVEAHEILKSCSVSITLLSQVNPHIEIESVSKGEVVDVSTSCNVILPS 364

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VDYEL+A LLKE+KLIAVDHILIG  DKN+QLDSSI+ SIIEVNCKRNRPNGALLAFDYC
Sbjct: 365 VDYELVANLLKEEKLIAVDHILIGMKDKNIQLDSSIILSIIEVNCKRNRPNGALLAFDYC 424

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LKNGV +ERNLYL LIG+LIRSSIYS LLE+VQEMY +GHCLGLYHAT+ILYRLGKAGKP
Sbjct: 425 LKNGVKVERNLYLTLIGVLIRSSIYSNLLEIVQEMYTKGHCLGLYHATLILYRLGKAGKP 484

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA+KVF+MLPEELKCTATYTALV AYFSAGS  KGLKIYETMRK+GFTPSLGTYNVLL+
Sbjct: 485 QYARKVFNMLPEELKCTATYTALVAAYFSAGSFGKGLKIYETMRKKGFTPSLGTYNVLLS 544

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL+KS RVVELDIYRREKK FEISHHSHH TILEEERICDLLFGELVS
Sbjct: 545 GLVKSDRVVELDIYRREKKIFEISHHSHHGTILEEERICDLLFGELVS 592

BLAST of Tan0016732 vs. ExPASy TrEMBL
Match: A0A1S4E1N2 (pentatricopeptide repeat-containing protein At2g01390-like OS=Cucumis melo OX=3656 GN=LOC103497457 PE=4 SV=1)

HSP 1 Score: 988.8 bits (2555), Expect = 9.8e-285
Identity = 487/588 (82.82%), Postives = 532/588 (90.48%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M+  NRFSLLLSNYVVISAIR RIYQ+IS K LHS HQ+K+EKPI  FSR  RKGTKVVK
Sbjct: 1   MHFCNRFSLLLSNYVVISAIRKRIYQNISCKCLHSLHQYKREKPISRFSRNSRKGTKVVK 60

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV+P++YTRDTV N  N+LRNCSW+SAQ+HLE LPIRWDSYLINQ+LKTHPPLEK WL
Sbjct: 61  KEEVIPRVYTRDTVCNICNILRNCSWASAQKHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWASRL++FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDA TYTSLMHWR
Sbjct: 121 FFNWASRLKVFKHDQYTYTTMLDIFGEAGRISSMNYLFQQMKEKGIKIDAATYTSLMHWR 180

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           SNSGDVDGAIKVWKEMKANGC+PTVVSYTAYIKILLDN Q KEAT TYKEML+TGLSPNC
Sbjct: 181 SNSGDVDGAIKVWKEMKANGCHPTVVSYTAYIKILLDNGQSKEATATYKEMLKTGLSPNC 240

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYT+LMEYLIG GKC+EALDIF KMQDAGVYPDK ACNILIQKC KSGE +VMTQILE+
Sbjct: 241 CTYTILMEYLIGEGKCKEALDIFSKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEF 300

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+ R VLRYPVFVEAHE LK CSV   LLRQVNPHIE ESI K EV+ VST SN +PPN
Sbjct: 301 MKENRFVLRYPVFVEAHENLKSCSVGHALLRQVNPHIEIESISKGEVLDVSTGSNTVPPN 360

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VD ELLA+LLK++KL A+DH+LIG +DKN+QLDSSI+ SIIEVNCK NRPN A+LAFDYC
Sbjct: 361 VDNELLAMLLKDNKLTAIDHMLIGIVDKNIQLDSSIIYSIIEVNCKSNRPNSAMLAFDYC 420

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LKNGVNI R LYL LIGILIRSSIY KLLE+VQEMY QGHC+GLYHAT+ILY LG+AGKP
Sbjct: 421 LKNGVNIGRKLYLDLIGILIRSSIYPKLLEIVQEMYTQGHCIGLYHATLILYSLGRAGKP 480

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA+KVF++LPEELKCTATYT+LV AYFSAGSS KGLKI+ETMRK+GFTPSLGTYNVLL 
Sbjct: 481 QYARKVFNILPEELKCTATYTSLVDAYFSAGSSGKGLKIFETMRKKGFTPSLGTYNVLLN 540

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL KSGR VEL+IYRREKKSFEISHHS  NTIL++ERICDLLFGELVS
Sbjct: 541 GLAKSGRGVELNIYRREKKSFEISHHSRLNTILDDERICDLLFGELVS 588

BLAST of Tan0016732 vs. ExPASy TrEMBL
Match: A0A0A0LJM3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G295410 PE=4 SV=1)

HSP 1 Score: 982.6 bits (2539), Expect = 7.0e-283
Identity = 486/588 (82.65%), Postives = 531/588 (90.31%), Query Frame = 0

Query: 1   MNCFNRFSLLLSNYVVISAIRHRIYQDISIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVK 60
           M+  N FSLLLSNYVV SAIR RIYQ+IS K LHS HQ+K++KPI  FSR+ RKGTKV K
Sbjct: 1   MHFCNIFSLLLSNYVVSSAIRKRIYQNISSKCLHSLHQYKRDKPISRFSRQSRKGTKVAK 60

Query: 61  KEEVVPKLYTRDTVRNAYNLLRNCSWSSAQEHLERLPIRWDSYLINQILKTHPPLEKAWL 120
           KEEV+P+LYTRDTVRN  N+LRNCSW+SAQ+HLE LPIRWDSYLINQ+LKTHPPLEK WL
Sbjct: 61  KEEVIPRLYTRDTVRNICNILRNCSWASAQKHLEMLPIRWDSYLINQVLKTHPPLEKTWL 120

Query: 121 FFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180
           FFNWAS LQ+FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR
Sbjct: 121 FFNWASTLQVFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWR 180

Query: 181 SNSGDVDGAIKVWKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNC 240
           SNSGDVDGAIK+WKEMKANGC+PTVVSYTAYIKILLDN Q+ EAT TYK+MLQ+GLSPNC
Sbjct: 181 SNSGDVDGAIKLWKEMKANGCHPTVVSYTAYIKILLDNGQINEATATYKKMLQSGLSPNC 240

Query: 241 CTYTVLMEYLIGAGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKSGEMVVMTQILEY 300
           CTYT+LMEYLIG GKC+EALDIF KMQDAGVYPDK ACNILIQKC KSGE +VMTQILE+
Sbjct: 241 CTYTILMEYLIGEGKCKEALDIFSKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEF 300

Query: 301 MKDKRLVLRYPVFVEAHETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVSTSSNIIPPN 360
           MK+ R VLRYPVFVEAHETLK CSVS  LL+QVNPH+E ESI K EV+ VST SN +PPN
Sbjct: 301 MKENRFVLRYPVFVEAHETLKSCSVSYALLKQVNPHMEIESISKGEVVDVSTGSNTVPPN 360

Query: 361 VDYELLAILLKEDKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYC 420
           VD ELLA+LLK++KL AVDH+LIG +DKN+QLDSSI+ SIIEVNCK NRPN ALLAFDYC
Sbjct: 361 VDNELLAMLLKDNKLTAVDHMLIGIVDKNIQLDSSIIYSIIEVNCKSNRPNSALLAFDYC 420

Query: 421 LKNGVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKP 480
           LKN VNI+R LYL LIGILIRSSIY KLLE+VQEMY QGHCLGLYHAT+IL  LGKAGKP
Sbjct: 421 LKNSVNIKRKLYLDLIGILIRSSIYPKLLEIVQEMYTQGHCLGLYHATLILCSLGKAGKP 480

Query: 481 QYAKKVFHMLPEELKCTATYTALVGAYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLT 540
           QYA+KVF+MLPEELKCTATYTALV  YFSAGSS KGLKI+ETMRK+GFTPSLGTYNVLL 
Sbjct: 481 QYARKVFNMLPEELKCTATYTALVDGYFSAGSSGKGLKIFETMRKKGFTPSLGTYNVLLN 540

Query: 541 GLMKSGRVVELDIYRREKKSFEISHHSHHNTILEEERICDLLFGELVS 589
           GL K+GR VEL+IYRREKKSFEISHHS  NTIL++ERICDLLFGELVS
Sbjct: 541 GLAKNGRGVELNIYRREKKSFEISHHSRLNTILDDERICDLLFGELVS 588

BLAST of Tan0016732 vs. TAIR 10
Match: AT2G01390.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 559.7 bits (1441), Expect = 2.8e-159
Identity = 296/559 (52.95%), Postives = 391/559 (69.95%), Query Frame = 0

Query: 29  SIKYLHSFHQHKQEKPIKLFSRKLRKGTKVVKKEEVV-PKLYTRDTVRNAYNLLRNCSWS 88
           S+K LHS  + K     K FS+K     K+VK + +  P +YTRD V N YN+L+  +W 
Sbjct: 20  SVKLLHSLPRLKPTNS-KRFSQK----PKLVKTQTLPDPSVYTRDIVSNIYNILKYSNWD 79

Query: 89  SAQEHLERLPIRWDSYLINQILKTHPPLEKAWLFFNWASRLQIFKHDQYTYTTMLDIFGE 148
           SAQE L  L +RWDS++IN++LK HPP++KAWLFFNWA++++ FKHD +TYTTMLDIFGE
Sbjct: 80  SAQEQLPHLGVRWDSHIINRVLKAHPPMQKAWLFFNWAAQIKGFKHDHFTYTTMLDIFGE 139

Query: 149 AGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKANGCYPTVVS 208
           AGRI SM  VF  MKEKG+ ID VTYTSL+HW S+SGDVDGA+++W+EM+ NGC PTVVS
Sbjct: 140 AGRIQSMYSVFHLMKEKGVLIDTVTYTSLIHWVSSSGDVDGAMRLWEEMRDNGCEPTVVS 199

Query: 209 YTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIGAGKCREALDIFHKMQ 268
           YTAY+K+L  + +V+EAT+ YKEML++ +SPNC TYTVLMEYL+  GKC EALDIF KMQ
Sbjct: 200 YTAYMKMLFADGRVEEATEVYKEMLRSRVSPNCHTYTVLMEYLVATGKCEEALDIFFKMQ 259

Query: 269 DAGVYPDKVACNILIQKCSKSGEMVVMTQILEYMKDKRLVLRYPVFVEAHETLKCCSVSD 328
           + GV PDK ACNILI K  K GE   MT++L YMK+  +VLRYP+FVEA ETLK    SD
Sbjct: 260 EIGVQPDKAACNILIAKALKFGETSFMTRVLVYMKENGVVLRYPIFVEALETLKAAGESD 319

Query: 329 TLLRQVNPHIETESIGKDEVMVVSTSSNIIPPNVDYE--LLAILLKEDKLIAVDHILIGT 388
            LLR+VN HI  ES+   ++    T+      N D    + ++LL +  L+AVD +L   
Sbjct: 320 DLLREVNSHISVESLCSSDIDETPTAEVNDTKNSDDSRVISSVLLMKQNLVAVDILLNQM 379

Query: 389 IDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYCLKNGVNIERNLYLRLIGILIRSSIY 448
            D+N++LDS +VS+IIE NC R R  GA LAFDY L+ G++++++ YL LIG  +RS+  
Sbjct: 380 RDRNIKLDSFVVSAIIETNCDRCRTEGASLAFDYSLEMGIHLKKSAYLALIGNFLRSNEL 439

Query: 449 SKLLEVVQEMYRQGHCLGLYHATVILYRLGKAGKPQYAKKVFHMLPEELKCTATYTALVG 508
            K++EVV+EM +  H LG Y   ++++RLG   +P+ A  VF +LP++ K  A YTAL+ 
Sbjct: 440 PKVIEVVKEMVKAQHSLGCYQGAMLIHRLGFGRRPRLAADVFDLLPDDQKGVAAYTALMD 499

Query: 509 AYFSAGSSSKGLKIYETMRKRGFTPSLGTYNVLLTGLMK-SGRVVELDIYRREKKSFEIS 568
            Y SAGS  K +KI   MR+R   PSLGTY+VLL+GL K S    E+ + R+EKKS   S
Sbjct: 500 VYISAGSPEKAMKILREMREREIMPSLGTYDVLLSGLEKTSDFQKEVALLRKEKKSLVAS 559

Query: 569 HHSHHNTILEEERICDLLF 584
                N +  E++ICDLLF
Sbjct: 560 ARFREN-VHVEDKICDLLF 572

BLAST of Tan0016732 vs. TAIR 10
Match: AT1G18900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 125.2 bits (313), Expect = 1.8e-28
Identity = 74/216 (34.26%), Postives = 113/216 (52.31%), Query Frame = 0

Query: 74  VRNAYNLLRNCSWS-SAQEHLERLPIRWDSYLINQILKTHPPLEKAWLFFNWASRLQIFK 133
           V N  ++LR   W  +A+E L+ L +R D+Y  NQ+LK       A  FF W  R   FK
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 134 HDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKV 193
           HD +TYTTM+   G A +  ++N +  +M   G + + VTY  L+H    +  ++ A+ V
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNV 421

Query: 194 WKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIG 253
           + +M+  GC P  V+Y   I I      +  A D Y+ M   GLSP+  TY+V++  L  
Sbjct: 422 FNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGK 481

Query: 254 AGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKS 289
           AG    A  +F +M D G  P+ V  NI++   +K+
Sbjct: 482 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKA 517

BLAST of Tan0016732 vs. TAIR 10
Match: AT1G18900.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 125.2 bits (313), Expect = 1.8e-28
Identity = 74/216 (34.26%), Postives = 113/216 (52.31%), Query Frame = 0

Query: 74  VRNAYNLLRNCSWS-SAQEHLERLPIRWDSYLINQILKTHPPLEKAWLFFNWASRLQIFK 133
           V N  ++LR   W  +A+E L+ L +R D+Y  NQ+LK       A  FF W  R   FK
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 134 HDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKV 193
           HD +TYTTM+   G A +  ++N +  +M   G + + VTY  L+H    +  ++ A+ V
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNV 421

Query: 194 WKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIG 253
           + +M+  GC P  V+Y   I I      +  A D Y+ M   GLSP+  TY+V++  L  
Sbjct: 422 FNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGK 481

Query: 254 AGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKS 289
           AG    A  +F +M D G  P+ V  NI++   +K+
Sbjct: 482 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKA 517

BLAST of Tan0016732 vs. TAIR 10
Match: AT1G18900.3 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 125.2 bits (313), Expect = 1.8e-28
Identity = 74/216 (34.26%), Postives = 113/216 (52.31%), Query Frame = 0

Query: 74  VRNAYNLLRNCSWS-SAQEHLERLPIRWDSYLINQILKTHPPLEKAWLFFNWASRLQIFK 133
           V N  ++LR   W  +A+E L+ L +R D+Y  NQ+LK       A  FF W  R   FK
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 134 HDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKV 193
           HD +TYTTM+   G A +  ++N +  +M   G + + VTY  L+H    +  ++ A+ V
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYLNEAMNV 421

Query: 194 WKEMKANGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIG 253
           + +M+  GC P  V+Y   I I      +  A D Y+ M   GLSP+  TY+V++  L  
Sbjct: 422 FNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINCLGK 481

Query: 254 AGKCREALDIFHKMQDAGVYPDKVACNILIQKCSKS 289
           AG    A  +F +M D G  P+ V  NI++   +K+
Sbjct: 482 AGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKA 517

BLAST of Tan0016732 vs. TAIR 10
Match: AT4G31850.1 (proton gradient regulation 3 )

HSP 1 Score: 125.2 bits (313), Expect = 1.8e-28
Identity = 119/436 (27.29%), Postives = 196/436 (44.95%), Query Frame = 0

Query: 147 EAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKANGCYPTVV 206
           +AGR      +F  +K+ G+  D+VTY  +M   S  G++D AIK+  EM  NGC P V+
Sbjct: 480 KAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVI 539

Query: 207 SYTAYIKILLDNDQVKEATDTYKEMLQTGLSPNCCTYTVLMEYLIGAGKCREALDIFHKM 266
              + I  L   D+V EA   +  M +  L P   TY  L+  L   GK +EA+++F  M
Sbjct: 540 VVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGM 599

Query: 267 QDAGVYPDKVACNILIQKCSKSGEMVVMTQILEYMKDKRL---VLRYPVFV--------- 326
              G  P+ +  N L     K+ E+ +  ++L  M D      V  Y   +         
Sbjct: 600 VQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKMMDMGCVPDVFTYNTIIFGLVKNGQV 659

Query: 327 -EA----HETLKCCSVSDTLLRQVNPHIETESIGKDEVMVVST---SSNIIPPNVDYE-L 386
            EA    H+  K        L  + P +   S+ +D   +++    +    P N+ +E L
Sbjct: 660 KEAMCFFHQMKKLVYPDFVTLCTLLPGVVKASLIEDAYKIITNFLYNCADQPANLFWEDL 719

Query: 387 LAILLKE---DKLIAVDHILIGTIDKNVQLDSSIVSSIIEVNCKRNRPNGALLAFDYCLK 446
           +  +L E   D  ++    L+   +   +   SI+  II  +CK N  +GA   F+   K
Sbjct: 720 IGSILAEAGIDNAVSFSERLVA--NGICRDGDSILVPIIRYSCKHNNVSGARTLFEKFTK 779

Query: 447 N-GVNIERNLYLRLIGILIRSSIYSKLLEVVQEMYRQGHCLG----LYHATVILYRLGKA 506
           + GV  +   Y  LIG L+ +     ++E+ Q+++ Q    G    +     +L   GK+
Sbjct: 780 DLGVQPKLPTYNLLIGGLLEAD----MIEIAQDVFLQVKSTGCIPDVATYNFLLDAYGKS 839

Query: 507 GKPQYAKKVF-HMLPEELKC-TATYTALVGAYFSAGSSSKGLKI-YETMRKRGFTPSLGT 551
           GK     +++  M   E +  T T+  ++     AG+    L + Y+ M  R F+P+  T
Sbjct: 840 GKIDELFELYKEMSTHECEANTITHNIVISGLVKAGNVDDALDLYYDLMSDRDFSPTACT 899

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZU294.0e-15852.95Pentatricopeptide repeat-containing protein At2g01390 OS=Arabidopsis thaliana OX... [more]
Q9SZ522.5e-2727.29Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Q8GYP62.5e-2734.26Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX... [more]
Q9SSF97.3e-2731.27Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana OX... [more]
Q9ZU312.1e-2639.29Pentatricopeptide repeat-containing protein At2g01360 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022155480.14.5e-29284.35pentatricopeptide repeat-containing protein At2g01390 isoform X1 [Momordica char... [more]
XP_022971714.11.3e-28885.03pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima] >XP_022... [more]
XP_038901985.13.9e-28884.33pentatricopeptide repeat-containing protein At2g01390 [Benincasa hispida][more]
XP_022928072.11.5e-28784.69pentatricopeptide repeat-containing protein At2g01390 [Cucurbita moschata] >XP_0... [more]
XP_023512200.18.2e-28683.84pentatricopeptide repeat-containing protein At2g01390 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
A0A6J1DMJ22.2e-29284.35pentatricopeptide repeat-containing protein At2g01390 isoform X1 OS=Momordica ch... [more]
A0A6J1I9C56.5e-28985.03pentatricopeptide repeat-containing protein At2g01390 OS=Cucurbita maxima OX=366... [more]
A0A6J1EIW07.2e-28884.69pentatricopeptide repeat-containing protein At2g01390 OS=Cucurbita moschata OX=3... [more]
A0A1S4E1N29.8e-28582.82pentatricopeptide repeat-containing protein At2g01390-like OS=Cucumis melo OX=36... [more]
A0A0A0LJM37.0e-28382.65Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G295410 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01390.12.8e-15952.95Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G18900.11.8e-2834.26Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.21.8e-2834.26Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.31.8e-2834.26Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G31850.11.8e-2827.29proton gradient regulation 3 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 229..306
e-value: 2.4E-15
score: 58.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 371..580
e-value: 1.9E-20
score: 75.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 89..222
e-value: 1.3E-26
score: 95.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 499..531
e-value: 2.5E-6
score: 25.3
coord: 206..240
e-value: 3.5E-4
score: 18.5
coord: 241..274
e-value: 6.7E-7
score: 27.1
coord: 171..205
e-value: 9.6E-9
score: 32.9
coord: 136..170
e-value: 4.5E-8
score: 30.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 496..544
e-value: 4.6E-10
score: 39.5
coord: 241..286
e-value: 5.1E-9
score: 36.2
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 157..215
e-value: 1.4E-13
score: 50.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 496..530
score: 10.742131
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 204..238
score: 9.766576
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 239..273
score: 10.511944
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 169..203
score: 11.838262
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 134..168
score: 10.89559
NoneNo IPR availablePANTHERPTHR47938:SF18OS10G0358700 PROTEINcoord: 36..587
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 36..587

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016732.1Tan0016732.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding