Tan0015212 (gene) Snake gourd v1

Overview
NameTan0015212
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG11: 4316816 .. 4319473 (+)
RNA-Seq ExpressionTan0015212
SyntenyTan0015212
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAATCAGTCAAAAAAAAAAGAGAGAGAAAAAAAACTGATTATTCGTGGGGGGAAAAATGCAAAACCTAAAACCCCGGGTTCGTTATTGCGAAGAAGTTGAACTACCAACCAAAAAACTGGCGCTGGCGGAGTACTCGATGCTGATGCTTGCTTTTGAATCCAGATCCCTTCTTCTTCCTCTCCGGCGTCTCCGCCATTGTTTTTTTCCGGAGGATGAAGATGAACCAACGCCTAATCTACTACACTTTCATCTTTCCAATTAACCACGTTTTCTCCTTCCAATCCTTCTAATTCATCCAATTTCAGGATATTCATCAGCTTCCTAATCTCTCGAGGTAACTTCCCCACCCCCCGCCATTCTTGATTTGACGCATCTCTTCTAATTTTTCAGCATGGAAGTCCCTCTCTCCCGCTATCAAAACTATCTTTATGATCGCCTTCAATGCAGCTCCACTTCTAGCTCTACTTCCTACTTCTCCGTTCGTTTCTCAGATTCCGACCTTTTTAGAAAGAGATCTTTGCTTTCTGGGTATTCTCTGTGGTCTAACAGAAGAAAATCGCTTAATTCGTTTTGTTGGGTCAAGTGCTCTTCGTTGGAACAAGGCCTACACCCACGACCCAAACCTAAACCTTCGAAAATCGATCCGGATATTCGTAAAGGGACCTCTTCGAAGAAGACCCATATCAGAAAATCCGGTGTAGGGATCTGTAGCCAGATAGAGAAGTTGGTTTTGTGTAAGAAGTACCGAGATGCACTTGAGATGTTTGAATTTTTTGAGCTGGAGGGTGGTTATGATGTTGGTAATAGCACGTTTGATGCGTTGATTAATGCATGTATTGGCTTGAAATCTGTAAGAGGGGTGAAGAGGTTGTGTAATTACATGATTGATCATGGAATTGAGCCTGATCAATATATGAGGAACAGGGTTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAAATGCCCGAGAGGAATGCGGTTTCGTGGAATACTATAATTTCCGGGTATGTAGACTCTGGAAATTATGAAGAAGCGTTTAGATTGTTCATAATGATGTGGGAAGAGTACTCTGATTGTGGTCCTCGCACCTTTGCCACAATGATACGGGCATCGGCTGGTTTAGAACTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGGTAAAGGCAGGCGTGGGACAGAACATTTTTGTTTCCTGTGCGTTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTGATGAGATGCCCGATAAGACGATAGTTGGATGGAATTCAATTATTGCTGGTTACGCACTCCATGGCTACAGTGAGGAAGCTCTGAATCTATGTTATGAGATGCGTGACTCTGGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGGATATGTTCGAGATTAGCTTCTGTAGCACGTGCCAAGCAAGCGCATGCGAGCTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTGTAAAAACGTAATATCATGGAATGCTCTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTCGGGAAGGCATGATGCCGAACCATGTGACATTTCTTTCTGTTCTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATATTTCAAACAATGACTAGGGATCACAAGATTAAACCACGCGCTATGCATTACGCTTGCTTGATTGAGTTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCCACAGCAAATATGTGGGCTGCCTTGCTTCGAGCTTGTAGGGTTCATGAAAATCTAGAACTTGGGAAATTTGCTGCTGAAAAGCTTTATGGGATGGAACCCGAGAAGCTCAGTAATTACATTGTGCTTTTAAACATATATAACAGTTCTGGCAAGTTAAAGGAAGCAGCTGAGGTTGTTCGGACATTGAAAAGAAAGGGCTTGAGAATGCTTCCGGCATGTAGTTGGATTGAAGTTAATAATCAACCCCATGCATTCCTCTCTGGGGATAAACACCATACCGAAATAGAAAAAGTCGTCGAGAAAGTGGACGAATTAATGCTAAAGATCTCAAAGCTTGGTCATGTTCCTGAACAGAACTTCTTGCTTTCAGATGTTGATGAACATGAAGAAAAGATACAAATGTACCACAGTGAAAAACTGGCAATAGCTTATGGACTTATCAATACGTTGAAGCGAACGCCATTGCAAATTGTGCAGAGCCATCGCATTTGCGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGATCAGAGATGCTAGCAGATTCCACCATTTCAGAGATGGCAATTGTTCTTGTGGAGACTATTGGTGAGGAAGGTAAATCTATATTGTTACTTTGTTTGTTTATTTTCATAGGAACGGATAACCTCTCTTCCATTCGATAAACATGATATATTGTACATTCTTTTTCAAGGAGCTTATAATGAAATGAATTCAT

mRNA sequence

AAAAAAATCAGTCAAAAAAAAAAGAGAGAGAAAAAAAACTGATTATTCGTGGGGGGAAAAATGCAAAACCTAAAACCCCGGGTTCGTTATTGCGAAGAAGTTGAACTACCAACCAAAAAACTGGCGCTGGCGGAGTACTCGATGCTGATGCTTGCTTTTGAATCCAGATCCCTTCTTCTTCCTCTCCGGCGTCTCCGCCATTGTTTTTTTCCGGAGGATGAAGATGAACCAACGCCTAATCTACTACACTTTCATCTTTCCAATTAACCACGTTTTCTCCTTCCAATCCTTCTAATTCATCCAATTTCAGGATATTCATCAGCTTCCTAATCTCTCGAGGTAACTTCCCCACCCCCCGCCATTCTTGATTTGACGCATCTCTTCTAATTTTTCAGCATGGAAGTCCCTCTCTCCCGCTATCAAAACTATCTTTATGATCGCCTTCAATGCAGCTCCACTTCTAGCTCTACTTCCTACTTCTCCGTTCGTTTCTCAGATTCCGACCTTTTTAGAAAGAGATCTTTGCTTTCTGGGTATTCTCTGTGGTCTAACAGAAGAAAATCGCTTAATTCGTTTTGTTGGGTCAAGTGCTCTTCGTTGGAACAAGGCCTACACCCACGACCCAAACCTAAACCTTCGAAAATCGATCCGGATATTCGTAAAGGGACCTCTTCGAAGAAGACCCATATCAGAAAATCCGGTGTAGGGATCTGTAGCCAGATAGAGAAGTTGGTTTTGTGTAAGAAGTACCGAGATGCACTTGAGATGTTTGAATTTTTTGAGCTGGAGGGTGGTTATGATGTTGGTAATAGCACGTTTGATGCGTTGATTAATGCATGTATTGGCTTGAAATCTGTAAGAGGGGTGAAGAGGTTGTGTAATTACATGATTGATCATGGAATTGAGCCTGATCAATATATGAGGAACAGGGTTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAAATGCCCGAGAGGAATGCGGTTTCGTGGAATACTATAATTTCCGGGTATGTAGACTCTGGAAATTATGAAGAAGCGTTTAGATTGTTCATAATGATGTGGGAAGAGTACTCTGATTGTGGTCCTCGCACCTTTGCCACAATGATACGGGCATCGGCTGGTTTAGAACTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGGTAAAGGCAGGCGTGGGACAGAACATTTTTGTTTCCTGTGCGTTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTGATGAGATGCCCGATAAGACGATAGTTGGATGGAATTCAATTATTGCTGGTTACGCACTCCATGGCTACAGTGAGGAAGCTCTGAATCTATGTTATGAGATGCGTGACTCTGGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGGATATGTTCGAGATTAGCTTCTGTAGCACGTGCCAAGCAAGCGCATGCGAGCTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTGTAAAAACGTAATATCATGGAATGCTCTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTCGGGAAGGCATGATGCCGAACCATGTGACATTTCTTTCTGTTCTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATATTTCAAACAATGACTAGGGATCACAAGATTAAACCACGCGCTATGCATTACGCTTGCTTGATTGAGTTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCCACAGCAAATATGTGGGCTGCCTTGCTTCGAGCTTGTAGGGTTCATGAAAATCTAGAACTTGGGAAATTTGCTGCTGAAAAGCTTTATGGGATGGAACCCGAGAAGCTCAGTAATTACATTGTGCTTTTAAACATATATAACAGTTCTGGCAAGTTAAAGGAAGCAGCTGAGGTTGTTCGGACATTGAAAAGAAAGGGCTTGAGAATGCTTCCGGCATGTAGTTGGATTGAAGTTAATAATCAACCCCATGCATTCCTCTCTGGGGATAAACACCATACCGAAATAGAAAAAGTCGTCGAGAAAGTGGACGAATTAATGCTAAAGATCTCAAAGCTTGGTCATGTTCCTGAACAGAACTTCTTGCTTTCAGATGTTGATGAACATGAAGAAAAGATACAAATGTACCACAGTGAAAAACTGGCAATAGCTTATGGACTTATCAATACGTTGAAGCGAACGCCATTGCAAATTGTGCAGAGCCATCGCATTTGCGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGATCAGAGATGCTAGCAGATTCCACCATTTCAGAGATGGCAATTGTTCTTGTGGAGACTATTGGTGAGGAAGGTAAATCTATATTGTTACTTTGTTTGTTTATTTTCATAGGAACGGATAACCTCTCTTCCATTCGATAAACATGATATATTGTACATTCTTTTTCAAGGAGCTTATAATGAAATGAATTCAT

Coding sequence (CDS)

ATGGAAGTCCCTCTCTCCCGCTATCAAAACTATCTTTATGATCGCCTTCAATGCAGCTCCACTTCTAGCTCTACTTCCTACTTCTCCGTTCGTTTCTCAGATTCCGACCTTTTTAGAAAGAGATCTTTGCTTTCTGGGTATTCTCTGTGGTCTAACAGAAGAAAATCGCTTAATTCGTTTTGTTGGGTCAAGTGCTCTTCGTTGGAACAAGGCCTACACCCACGACCCAAACCTAAACCTTCGAAAATCGATCCGGATATTCGTAAAGGGACCTCTTCGAAGAAGACCCATATCAGAAAATCCGGTGTAGGGATCTGTAGCCAGATAGAGAAGTTGGTTTTGTGTAAGAAGTACCGAGATGCACTTGAGATGTTTGAATTTTTTGAGCTGGAGGGTGGTTATGATGTTGGTAATAGCACGTTTGATGCGTTGATTAATGCATGTATTGGCTTGAAATCTGTAAGAGGGGTGAAGAGGTTGTGTAATTACATGATTGATCATGGAATTGAGCCTGATCAATATATGAGGAACAGGGTTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATGAAATGCCCGAGAGGAATGCGGTTTCGTGGAATACTATAATTTCCGGGTATGTAGACTCTGGAAATTATGAAGAAGCGTTTAGATTGTTCATAATGATGTGGGAAGAGTACTCTGATTGTGGTCCTCGCACCTTTGCCACAATGATACGGGCATCGGCTGGTTTAGAACTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGGTAAAGGCAGGCGTGGGACAGAACATTTTTGTTTCCTGTGCGTTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTGATGAGATGCCCGATAAGACGATAGTTGGATGGAATTCAATTATTGCTGGTTACGCACTCCATGGCTACAGTGAGGAAGCTCTGAATCTATGTTATGAGATGCGTGACTCTGGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGGATATGTTCGAGATTAGCTTCTGTAGCACGTGCCAAGCAAGCGCATGCGAGCTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCACTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTGTAAAAACGTAATATCATGGAATGCTCTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTCGGGAAGGCATGATGCCGAACCATGTGACATTTCTTTCTGTTCTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATATTTCAAACAATGACTAGGGATCACAAGATTAAACCACGCGCTATGCATTACGCTTGCTTGATTGAGTTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCCACAGCAAATATGTGGGCTGCCTTGCTTCGAGCTTGTAGGGTTCATGAAAATCTAGAACTTGGGAAATTTGCTGCTGAAAAGCTTTATGGGATGGAACCCGAGAAGCTCAGTAATTACATTGTGCTTTTAAACATATATAACAGTTCTGGCAAGTTAAAGGAAGCAGCTGAGGTTGTTCGGACATTGAAAAGAAAGGGCTTGAGAATGCTTCCGGCATGTAGTTGGATTGAAGTTAATAATCAACCCCATGCATTCCTCTCTGGGGATAAACACCATACCGAAATAGAAAAAGTCGTCGAGAAAGTGGACGAATTAATGCTAAAGATCTCAAAGCTTGGTCATGTTCCTGAACAGAACTTCTTGCTTTCAGATGTTGATGAACATGAAGAAAAGATACAAATGTACCACAGTGAAAAACTGGCAATAGCTTATGGACTTATCAATACGTTGAAGCGAACGCCATTGCAAATTGTGCAGAGCCATCGCATTTGCGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGATCAGAGATGCTAGCAGATTCCACCATTTCAGAGATGGCAATTGTTCTTGTGGAGACTATTGGTGA

Protein sequence

MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSFCWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIEKLVLCKKYRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
Homology
BLAST of Tan0015212 vs. ExPASy Swiss-Prot
Match: Q9FK33 (Pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H58 PE=2 SV=1)

HSP 1 Score: 882.1 bits (2278), Expect = 4.2e-255
Identity = 429/718 (59.75%), Postives = 540/718 (75.21%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           ME+PLSRYQ+   D ++ SS++     F  +FS          L G       R+  N F
Sbjct: 1   MEIPLSRYQSIRLDEIRDSSSNPKVLTFPRKFS----------LRG-------RRWKNPF 60

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSS--KKTHIRKSGVGICSQIEKLVLCKKY 120
             + CSS+ QGL P+PK KP  I  ++++        T I KSGV ICSQIEKLVLC ++
Sbjct: 61  GRLSCSSVVQGLKPKPKLKPEPIRIEVKESKDQILDDTQISKSGVTICSQIEKLVLCNRF 120

Query: 121 RDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNR 180
           R+A E+FE  E+   + VG ST+DAL+ ACI LKS+R VKR+  +M+ +G EP+QYM NR
Sbjct: 121 REAFELFEILEIRCSFKVGVSTYDALVEACIRLKSIRCVKRVYGFMMSNGFEPEQYMMNR 180

Query: 181 VLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCG 240
           +LLMHVKCGM+IDA RLFDE+PERN  S+ +IISG+V+ GNY EAF LF MMWEE SDC 
Sbjct: 181 ILLMHVKCGMIIDARRLFDEIPERNLYSYYSIISGFVNFGNYVEAFELFKMMWEELSDCE 240

Query: 241 PRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD 300
             TFA M+RASAGL  I+ G+QLH CA+K GV  N FVSC LIDMYSKCG +EDA C F+
Sbjct: 241 THTFAVMLRASAGLGSIYVGKQLHVCALKLGVVDNTFVSCGLIDMYSKCGDIEDARCAFE 300

Query: 301 EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVAR 360
            MP+KT V WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +  
Sbjct: 301 CMPEKTTVAWNNVIAGYALHGYSEEALCLLYDMRDSGVSIDQFTLSIMIRISTKLAKLEL 360

Query: 361 AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGN 420
            KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  KN+ISWNAL+ GY N
Sbjct: 361 TKQAHASLIRNGFESEIVANTALVDFYSKWGRVDTARYVFDKLPRKNIISWNALMGGYAN 420

Query: 421 HGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAM 480
           HGRG +A+++FEKM+   + PNHVTFL+VLSAC+ SGL E+GWEIF +M+  H IKPRAM
Sbjct: 421 HGRGTDAVKLFEKMIAANVAPNHVTFLAVLSACAYSGLSEQGWEIFLSMSEVHGIKPRAM 480

Query: 481 HYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGM 540
           HYAC+IELLGR+GLLDEA A IR+AP + T NMWAALL ACR+ ENLELG+  AEKLYGM
Sbjct: 481 HYACMIELLGRDGLLDEAIAFIRRAPLKTTVNMWAALLNACRMQENLELGRVVAEKLYGM 540

Query: 541 EPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDK- 600
            PEKL NY+V+ N+YNS GK  EAA V+ TL+ KGL M+PAC+W+EV +Q H+FLSGD+ 
Sbjct: 541 GPEKLGNYVVMYNMYNSMGKTAEAAGVLETLESKGLSMMPACTWVEVGDQTHSFLSGDRF 600

Query: 601 ---HHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDE-HEEKIQMYHSEKLAIAYGL 660
              + T   ++ +KVDELM +IS+ G+  E+  LL DVDE  EE++  YHSEKLAIAYGL
Sbjct: 601 DSYNETVKRQIYQKVDELMEEISEYGYSEEEQHLLPDVDEKEEERVGRYHSEKLAIAYGL 660

Query: 661 INTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           +NT +  PLQI Q+HRIC +CH V++ I+++T RE+V+RDASRFHHF++G CSCG YW
Sbjct: 661 VNTPEWNPLQITQNHRICKNCHKVVEFISLVTGREMVVRDASRFHHFKEGKCSCGGYW 701

BLAST of Tan0015212 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 485.3 bits (1248), Expect = 1.1e-135
Identity = 227/585 (38.80%), Postives = 363/585 (62.05%), Query Frame = 0

Query: 129 ELEGGYDVGNSTF-DALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCG 188
           +LEG Y   +  F + L+  C   K +   + +  +++      D  M N +L M+ KCG
Sbjct: 50  DLEGSYIPADRRFYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCG 109

Query: 189 MMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIR 248
            + +A ++F++MP+R+ V+W T+ISGY       +A   F  M          T +++I+
Sbjct: 110 SLEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIK 169

Query: 249 ASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVG 308
           A+A       G QLH   VK G   N+ V  AL+D+Y++ G ++DA  VFD +  +  V 
Sbjct: 170 AAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVS 229

Query: 309 WNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLV 368
           WN++IAG+A    +E+AL L   M   G +  HF+++ +   CS    + + K  HA ++
Sbjct: 230 WNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMI 289

Query: 369 RNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE 428
           ++G  L   A   L+D Y+K G + DAR +FDR++ ++V+SWN+L+  Y  HG G+EA+ 
Sbjct: 290 KSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVW 349

Query: 429 MFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELL 488
            FE+M R G+ PN ++FLSVL+ACS SGL + GW  ++ M +D  I P A HY  +++LL
Sbjct: 350 WFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKD-GIVPEAWHYVTVVDLL 409

Query: 489 GREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYI 548
           GR G L+ A   I + P +PTA +W ALL ACR+H+N ELG +AAE ++ ++P+    ++
Sbjct: 410 GRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHV 469

Query: 549 VLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVE 608
           +L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F++ D+ H + E++  
Sbjct: 470 ILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIAR 529

Query: 609 KVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQM-YHSEKLAIAYGLINTLKRTPLQIVQ 668
           K +E++ KI +LG+VP+ + ++  VD+ E ++ + YHSEK+A+A+ L+NT   + + I +
Sbjct: 530 KWEEVLAKIKELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKK 589

Query: 669 SHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           + R+CGDCH+ IKL + +  REI++RD +RFHHF+DGNCSC DYW
Sbjct: 590 NIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDGNCSCKDYW 633

BLAST of Tan0015212 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 1.9e-130
Identity = 226/611 (36.99%), Postives = 366/611 (59.90%), Query Frame = 0

Query: 138 NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVK---CGMMIDACR 197
           ++ F +++ +C  +  +R  + +  +++  G++ D Y  N ++ M+ K    G  I    
Sbjct: 105 HNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVGN 164

Query: 198 LFDEMPER---------------------------------NAVSWNTIISGYVDSGNYE 257
           +FDEMP+R                                 + VS+NTII+GY  SG YE
Sbjct: 165 VFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYE 224

Query: 258 EAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALI 317
           +A R+   M          T ++++   +    +  G+++H   ++ G+  ++++  +L+
Sbjct: 225 DALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLV 284

Query: 318 DMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHF 377
           DMY+K   +ED+  VF  +  +  + WNS++AGY  +G   EAL L  +M  + VK    
Sbjct: 285 DMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAV 344

Query: 378 TFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM 437
            FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Sbjct: 345 AFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM 404

Query: 438 SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGW 497
           +  + +SW A+I G+  HG G EA+ +FE+M R+G+ PN V F++VL+ACS  GL +  W
Sbjct: 405 NVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAW 464

Query: 498 EIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV 557
             F +MT+ + +     HYA + +LLGR G L+EAY  I K   +PT ++W+ LL +C V
Sbjct: 465 GYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSV 524

Query: 558 HENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACS 617
           H+NLEL +  AEK++ ++ E +  Y+++ N+Y S+G+ KE A++   +++KGLR  PACS
Sbjct: 525 HKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACS 584

Query: 618 WIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHEEKIQ 677
           WIE+ N+ H F+SGD+ H  ++K+ E +  +M ++ K G+V + + +L DVD EH+ ++ 
Sbjct: 585 WIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELL 644

Query: 678 MYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF 712
             HSE+LA+A+G+INT   T +++ ++ RIC DCH  IK I+ IT+REI++RD SRFHHF
Sbjct: 645 FGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHF 704

BLAST of Tan0015212 vs. ExPASy Swiss-Prot
Match: Q9SI53 (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 463.0 bits (1190), Expect = 6.1e-129
Identity = 224/575 (38.96%), Postives = 357/575 (62.09%), Query Frame = 0

Query: 138 NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFD 197
           ++T+  LI  CI  ++V     +C ++  +G  P  ++ N ++ M+VK  ++ DA +LFD
Sbjct: 61  SATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDAHQLFD 120

Query: 198 EMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFP 257
           +MP+RN +SW T+IS Y     +++A  L ++M  +       T+++++R+  G+  +  
Sbjct: 121 QMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCNGMSDV-- 180

Query: 258 GRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL 317
            R LH   +K G+  ++FV  ALID+++K G  EDA  VFDEM     + WNSII G+A 
Sbjct: 181 -RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSIIGGFAQ 240

Query: 318 HGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVA 377
           +  S+ AL L   M+ +G   +  T + ++R C+ LA +    QAH  +V+  +  D++ 
Sbjct: 241 NSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVK--YDQDLIL 300

Query: 378 NTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM 437
           N ALVD Y K G ++DA  VF++M  ++VI+W+ +I+G   +G  +EA+++FE+M   G 
Sbjct: 301 NNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERMKSSGT 360

Query: 438 MPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAY 497
            PN++T + VL ACS +GL E GW  F++M + + I P   HY C+I+LLG+ G LD+A 
Sbjct: 361 KPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDAV 420

Query: 498 ALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSG 557
            L+ +   +P A  W  LL ACRV  N+ L ++AA+K+  ++PE    Y +L NIY +S 
Sbjct: 421 KLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANSQ 480

Query: 558 KLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKIS 617
           K     E+   ++ +G++  P CSWIEVN Q HAF+ GD  H +I +V +K+++L+ +++
Sbjct: 481 KWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQLIHRLT 540

Query: 618 KLGHVPEQNFLLSDVD-EHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHS 677
            +G+VPE NF+L D++ E  E    +HSEKLA+A+GL+       ++I ++ RICGDCH 
Sbjct: 541 GIGYVPETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRIRKNLRICGDCHV 600

Query: 678 VIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
             KL + +  R IVIRD  R+HHF+DG CSCGDYW
Sbjct: 601 FCKLASKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of Tan0015212 vs. ExPASy Swiss-Prot
Match: Q9S7F4 (Putative pentatricopeptide repeat-containing protein At2g01510 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H36 PE=3 SV=1)

HSP 1 Score: 460.7 bits (1184), Expect = 3.0e-128
Identity = 228/597 (38.19%), Postives = 362/597 (60.64%), Query Frame = 0

Query: 118 YRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRN 177
           Y +++ +F     + G+   + TF  ++ A +GL      ++L    +  G   D  + N
Sbjct: 231 YTESIHLFLKMR-QSGHQPSDFTFSGVLKAVVGLHDFALGQQLHALSVTTGFSRDASVGN 290

Query: 178 RVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDC 237
           ++L  + K   +++   LFDEMPE + VS+N +IS Y  +  YE +   F  M     D 
Sbjct: 291 QILDFYSKHDRVLETRMLFDEMPELDFVSYNVVISSYSQADQYEASLHFFREMQCMGFDR 350

Query: 238 GPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVF 297
               FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F
Sbjct: 351 RNFPFATMLSIAANLSSLQMGRQLHCQALLATADSILHVGNSLVDMYAKCEMFEEAELIF 410

Query: 298 DEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVA 357
             +P +T V W ++I+GY   G     L L  +MR S ++ D  TF+ +++  +  AS+ 
Sbjct: 411 KSLPQRTTVSWTALISGYVQKGLHGAGLKLFTKMRGSNLRADQSTFATVLKASASFASLL 470

Query: 358 RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG 417
             KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  +N +SWNALI+ + 
Sbjct: 471 LGKQLHAFIIRSGNLENVFSGSGLVDMYAKCGSIKDAVQVFEEMPDRNAVSWNALISAHA 530

Query: 418 NHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRA 477
           ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ M+  + I P+ 
Sbjct: 531 DNGDGEAAIGAFAKMIESGLQPDSVSILGVLTACSHCGFVEQGTEYFQAMSPIYGITPKK 590

Query: 478 MHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYG 537
            HYAC+++LLGR G   EA  L+ + PF+P   MW+++L ACR+H+N  L + AAEKL+ 
Sbjct: 591 KHYACMLDLLGRNGRFAEAEKLMDEMPFEPDEIMWSSVLNACRIHKNQSLAERAAEKLFS 650

Query: 538 MEP-EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGD 597
           ME     + Y+ + NIY ++G+ ++  +V + ++ +G++ +PA SW+EVN++ H F S D
Sbjct: 651 MEKLRDAAAYVSMSNIYAAAGEWEKVRDVKKAMRERGIKKVPAYSWVEVNHKIHVFSSND 710

Query: 598 KHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQ--MYHSEKLAIAYGLI 657
           + H   +++V K++EL  +I + G+ P+ + ++ DVDE + KI+   YHSE+LA+A+ LI
Sbjct: 711 QTHPNGDEIVRKINELTAEIEREGYKPDTSSVVQDVDE-QMKIESLKYHSERLAVAFALI 770

Query: 658 NTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           +T +  P+ ++++ R C DCH+ IKLI+ I KREI +RD SRFHHF +G CSCGDYW
Sbjct: 771 STPEGCPIVVMKNLRACRDCHAAIKLISKIVKREITVRDTSRFHHFSEGVCSCGDYW 825

BLAST of Tan0015212 vs. NCBI nr
Match: XP_022133879.1 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia] >XP_022133880.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia])

HSP 1 Score: 1330.5 bits (3442), Expect = 0.0e+00
Identity = 634/712 (89.04%), Postives = 679/712 (95.37%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           MEVPL RYQNY+YDRLQCSSTSSS+SY  VRF+DS LFRKRSLLS Y+LWSNRRK  NSF
Sbjct: 3   MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSF 62

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKK-THIRKSGVGICSQIEKLVLCKKYR 120
           CW+KCSSLEQGL PRP+P+PSKID D+RKGTSS + T IRKSGVGICSQIEKLVLCKKYR
Sbjct: 63  CWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIRKSGVGICSQIEKLVLCKKYR 122

Query: 121 DALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRV 180
           DALEMFE FELEGGYD+GNST+DALINACIGLKS+RGVKRLCNYMID+G EPDQYM+NR+
Sbjct: 123 DALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRI 182

Query: 181 LLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGP 240
           LLMHVKCGMMIDACRLFDEMPERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE SD GP
Sbjct: 183 LLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGP 242

Query: 241 RTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE 300
           RTFA MIRASAGLELIFPGRQLHSCA+KAGVGQ+IFVSCALIDMYSKCGSLEDAHCVFDE
Sbjct: 243 RTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE 302

Query: 301 MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARA 360
           MPDKTIVGWNSIIAGYALHGYSEEAL+LCYEMRDSG+KMDHFTFSIIIRICSRLASVARA
Sbjct: 303 MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARA 362

Query: 361 KQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNH 420
           KQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+FDRMS KN+ISWNALIAGYGNH
Sbjct: 363 KQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYGNH 422

Query: 421 GRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMH 480
           GRGEEAI+MFE+MLREGM PNHVTFL+VLSACSISGLFERGWEIFQ++T DHKIKPRAMH
Sbjct: 423 GRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMH 482

Query: 481 YACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGME 540
           +AC+IELLGREGLLDEAYALIR APF+PTANMWAALLRACRVHENLELGK AAE LYGME
Sbjct: 483 FACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGME 542

Query: 541 PEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 600
           P+KLSNYIVLLNIYNSSGKLKEAA+VV+TLKRKGLRM+PACSWIEV NQPH+FLSGDKHH
Sbjct: 543 PDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH 602

Query: 601 TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKR 660
            EIEKVVEKVDE+MLKISKLG+V EQNFLL DVDE EEKI MYHSEKLAIAYGL++TLK+
Sbjct: 603 AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKK 662

Query: 661 TPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           TPLQIVQSHRICGDCHS IKLIA+IT+REIV+RDASRFHHFRDG+CSCGDYW
Sbjct: 663 TPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW 714

BLAST of Tan0015212 vs. NCBI nr
Match: XP_022989822.1 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1287.3 bits (3330), Expect = 0.0e+00
Identity = 623/711 (87.62%), Postives = 663/711 (93.25%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           MEVPL  YQNY++D L+ +S SSSTSYFS  FS S+LFR RSLLS YSLWSNRRK  NSF
Sbjct: 1   MEVPL--YQNYVHDHLRRTSLSSSTSYFSHHFSGSELFRDRSLLSAYSLWSNRRKLRNSF 60

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIEKLVLCKKYRD 120
           CWVKCSSLEQGL PR KPKPSK+D D+RKGT SK+T I KS V IC  IEKLVLC K+RD
Sbjct: 61  CWVKCSSLEQGLRPRLKPKPSKVDRDVRKGTPSKETRITKSSVRICCHIEKLVLCNKFRD 120

Query: 121 ALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVL 180
           ALEMFE  ELEGGYDVGNSTFDALI ACIGLKS+RG KRLC YMID+GIEPDQY+ NR+L
Sbjct: 121 ALEMFEILELEGGYDVGNSTFDALIIACIGLKSIRGAKRLCAYMIDNGIEPDQYIMNRIL 180

Query: 181 LMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPR 240
           LMHV+CGMMIDA +LFDEMPERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEEY  C PR
Sbjct: 181 LMHVRCGMMIDASKLFDEMPERNAVSWNTIISGYVDSGNYKEAFRLFIMMWEEYPGCSPR 240

Query: 241 TFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM 300
           TFAT+IRASAGLELIFPG+QLHSCAVKAGVGQ+IFVSCALIDMYSKCG LEDAHCVFDEM
Sbjct: 241 TFATVIRASAGLELIFPGKQLHSCAVKAGVGQDIFVSCALIDMYSKCGGLEDAHCVFDEM 300

Query: 301 PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAK 360
           PDKTIVGWNSIIAGYALHG+SEEALNL ++MRDSGVK+DHFTFSIIIRICSRLASV RAK
Sbjct: 301 PDKTIVGWNSIIAGYALHGHSEEALNLYFQMRDSGVKIDHFTFSIIIRICSRLASVTRAK 360

Query: 361 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHG 420
           QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDRMSCKN+ISWNALIAGYGNHG
Sbjct: 361 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHIFDRMSCKNLISWNALIAGYGNHG 420

Query: 421 RGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHY 480
           RGEEAIE+FE+MLREGM+PNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHKIK RAMHY
Sbjct: 421 RGEEAIEIFERMLREGMVPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKLRAMHY 480

Query: 481 ACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEP 540
            C+IELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGK+AAEKLYGMEP
Sbjct: 481 TCMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKYAAEKLYGMEP 540

Query: 541 EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT 600
           EKL NYIVLLNIY SSGKLKEAA+VVRTLKRKGL MLPACSWIEV +QPHAFLSGDKHH 
Sbjct: 541 EKLRNYIVLLNIYKSSGKLKEAADVVRTLKRKGLSMLPACSWIEVKHQPHAFLSGDKHHP 600

Query: 601 EIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRT 660
           EIEKVVEKVDELML+ISKLG+VPEQN LL DVD HEEKIQ+YHSEKLAIAYGLINTLK+T
Sbjct: 601 EIEKVVEKVDELMLEISKLGYVPEQNILLPDVD-HEEKIQIYHSEKLAIAYGLINTLKQT 660

Query: 661 PLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           PLQIVQ HR+CGDCHSVIKLIAMITKREIV+RDASRFHHFRDG CSCGDYW
Sbjct: 661 PLQIVQGHRVCGDCHSVIKLIAMITKREIVVRDASRFHHFRDGRCSCGDYW 708

BLAST of Tan0015212 vs. NCBI nr
Match: XP_008459324.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo] >XP_008459325.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo])

HSP 1 Score: 1284.2 bits (3322), Expect = 0.0e+00
Identity = 623/712 (87.50%), Postives = 664/712 (93.26%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           ME+PLSRYQNY+YDRLQC     ST YFS+R+SDS LF K S L      SNRRK  NSF
Sbjct: 3   MELPLSRYQNYVYDRLQC----YSTPYFSLRYSDSHLFMKTSFL------SNRRKCRNSF 62

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIEKLVLCKKYRD 120
           CWVKCSS EQGL PRP+PKPSK+D  +RK    K+T +RKS VGICSQIEKLVLCK+YRD
Sbjct: 63  CWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRD 122

Query: 121 ALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVL 180
           ALEMFE FELE G+ VGNST+DALINACIGLKS+RGVKRL NYM+D+G EPDQYMRNRVL
Sbjct: 123 ALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVL 182

Query: 181 LMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPR 240
           LMHVKCGMMIDACRLFDEMPERNAVSW+TIISGYVDSGNY EAFRLFI+MWEE   CGPR
Sbjct: 183 LMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPR 242

Query: 241 TFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM 300
           T ATMIRASAGLE+IF GRQLHSCA+KAG+GQ+IFVSCALIDMYSKCGSLEDAHCVFDEM
Sbjct: 243 TLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEM 302

Query: 301 PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAK 360
           PDKTIVGWNSIIAGYALHGYSEEAL+L +EM  SGVKMDHFTFSIIIRICSRLASVARAK
Sbjct: 303 PDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAK 362

Query: 361 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHG 420
           QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSC+NVISWNALIAGYGNHG
Sbjct: 363 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHG 422

Query: 421 RGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHY 480
           RGEEAI+MFEKMLREGMMPNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHK++PRAMH+
Sbjct: 423 RGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHF 482

Query: 481 ACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEP 540
           AC+IELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGMEP
Sbjct: 483 ACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGMEP 542

Query: 541 EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT 600
           EKLSNYIVLLNIYN+SGKLKEAA+VV+TLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 
Sbjct: 543 EKLSNYIVLLNIYNTSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHV 602

Query: 601 EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKR 660
           ++EKVV KVDELMLKISKLG+VP EQNF+L DVDEHEEKI+MYHSEKLAIAYGL+NTL+R
Sbjct: 603 QLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLER 662

Query: 661 TPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           TPLQIVQSHRIC DCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
Sbjct: 663 TPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 704

BLAST of Tan0015212 vs. NCBI nr
Match: XP_004148701.1 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus] >XP_031740897.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 1283.5 bits (3320), Expect = 0.0e+00
Identity = 622/714 (87.11%), Postives = 666/714 (93.28%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           ME+PLSRYQNY+YDRLQC    +STS+FS+R+SDSDLF K S L      SN RK  NSF
Sbjct: 3   MELPLSRYQNYVYDRLQC----NSTSFFSLRYSDSDLFTKTSFL------SNPRKYRNSF 62

Query: 61  CWVKCSSLEQGLHPRPK--PKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIEKLVLCKKY 120
           CW+KCSS EQGL PRP+  PKPSK+D   RK T  K+TH++KS VGICSQIEKLVLCKKY
Sbjct: 63  CWIKCSSFEQGLRPRPRPQPKPSKLDVGDRKETPLKETHVKKSSVGICSQIEKLVLCKKY 122

Query: 121 RDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNR 180
           RDALEMFE FELE G+ VG ST+DALINACIGLKS+RGVKRLCNYM+D+G EPDQYMRNR
Sbjct: 123 RDALEMFEIFELEDGFHVGYSTYDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNR 182

Query: 181 VLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCG 240
           VLLMHVKCGMMIDACRLFDEMP RNAVSW TIISGYVDSGNY EAFRLFI+M EE+ DCG
Sbjct: 183 VLLMHVKCGMMIDACRLFDEMPARNAVSWGTIISGYVDSGNYVEAFRLFILMREEFYDCG 242

Query: 241 PRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD 300
           PRTFATMIRASAGLE+IFPGRQLHSCA+KAG+GQ+IFVSCALIDMYSKCGSLEDAHCVFD
Sbjct: 243 PRTFATMIRASAGLEIIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFD 302

Query: 301 EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVAR 360
           EMPDKTIVGWNSIIAGYALHGYSEEAL+L +EMRDSGVKMDHFTFSIIIRICSRLASVAR
Sbjct: 303 EMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMRDSGVKMDHFTFSIIIRICSRLASVAR 362

Query: 361 AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGN 420
           AKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSC+N+ISWNALIAGYGN
Sbjct: 363 AKQVHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNIISWNALIAGYGN 422

Query: 421 HGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAM 480
           HG GEEAI+MFEKMLREGMMPNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHK+KPRAM
Sbjct: 423 HGHGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVKPRAM 482

Query: 481 HYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGM 540
           H+AC+IELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGM
Sbjct: 483 HFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGM 542

Query: 541 EPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKH 600
           EPEKLSNYIVLLNIYNSSGKLKEAA+V +TLKRKGLRMLPACSWIEVNNQPHAFLSGDKH
Sbjct: 543 EPEKLSNYIVLLNIYNSSGKLKEAADVFQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKH 602

Query: 601 HTEIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTL 660
           H +IEKVV KVDELML ISKLG+VP EQNF+L DVDE+EEKI+MYHSEKLAIAYGL+NTL
Sbjct: 603 HVQIEKVVGKVDELMLNISKLGYVPEEQNFMLPDVDENEEKIRMYHSEKLAIAYGLLNTL 662

Query: 661 KRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           ++TPLQIVQSHRIC DCHSVIKLIAMITKREIVIRDASRFHHFRDG+CSCGDYW
Sbjct: 663 EKTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGSCSCGDYW 706

BLAST of Tan0015212 vs. NCBI nr
Match: XP_038890388.1 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida] >XP_038890389.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida] >XP_038890390.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 1281.9 bits (3316), Expect = 0.0e+00
Identity = 619/712 (86.94%), Postives = 662/712 (92.98%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           ME+PLS YQNYLYDR+QC    +STSY S+RFS  DLFR+R  L       NRRK  NS 
Sbjct: 3   MEIPLSCYQNYLYDRVQC----NSTSYVSLRFSYFDLFRERFFL------CNRRKCRNSL 62

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIEKLVLCKKYRD 120
            W+KCSS EQGL PRP+PKPSK+DP + K T  K+TH+ +S VGICSQIEKLVLCKKYRD
Sbjct: 63  RWIKCSSFEQGLRPRPQPKPSKLDPGVHKITPLKETHVMQSSVGICSQIEKLVLCKKYRD 122

Query: 121 ALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVL 180
           ALEMFE FELEGG+  GN+T DALINAC+ LKS+RGVK+LCNYM+D+G EPDQYMRNRVL
Sbjct: 123 ALEMFEIFELEGGFHAGNTTLDALINACVELKSIRGVKKLCNYMVDNGFEPDQYMRNRVL 182

Query: 181 LMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPR 240
           LMHVKCGMMIDACRLFD+MPERNAVSWNTIISG+VDSGNY EAFRLFI+MWEEY DCGPR
Sbjct: 183 LMHVKCGMMIDACRLFDQMPERNAVSWNTIISGHVDSGNYVEAFRLFILMWEEYYDCGPR 242

Query: 241 TFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM 300
           TFATMIRASAGLELIFPGRQLHSCA+KA +GQ+IFVSCALIDMYSKCGSLEDAHCVFDEM
Sbjct: 243 TFATMIRASAGLELIFPGRQLHSCAIKADLGQDIFVSCALIDMYSKCGSLEDAHCVFDEM 302

Query: 301 PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAK 360
           PDKTIVGWNSIIAGYALHGYSEEAL+L YEMRDSG+KMDHFTFSIIIRICSRLASVA AK
Sbjct: 303 PDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGIKMDHFTFSIIIRICSRLASVACAK 362

Query: 361 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHG 420
           QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSC+N+ISWNALIAGYGNHG
Sbjct: 363 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNIISWNALIAGYGNHG 422

Query: 421 RGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHY 480
           RG EAI+MFEKMLREG +PNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHKIKPRAMHY
Sbjct: 423 RGVEAIDMFEKMLREGKIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHY 482

Query: 481 ACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEP 540
           AC+IELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGMEP
Sbjct: 483 ACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGMEP 542

Query: 541 EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT 600
           EKLSNYIVLLNIYNSSGKLKEAA+VV+TLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 
Sbjct: 543 EKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHV 602

Query: 601 EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKR 660
           +IEKVV KVDELMLKISKLG+VP EQNF+L DVDEHEEKIQMYHSEKLAIAYGL+NTL++
Sbjct: 603 QIEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIQMYHSEKLAIAYGLLNTLEQ 662

Query: 661 TPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           TPLQIVQSHRIC DCH VIKLIAMITKREIVIRDASRFHHFRDG+CSCGDYW
Sbjct: 663 TPLQIVQSHRICSDCHFVIKLIAMITKREIVIRDASRFHHFRDGSCSCGDYW 704

BLAST of Tan0015212 vs. ExPASy TrEMBL
Match: A0A6J1BWH3 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111006324 PE=3 SV=1)

HSP 1 Score: 1330.5 bits (3442), Expect = 0.0e+00
Identity = 634/712 (89.04%), Postives = 679/712 (95.37%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           MEVPL RYQNY+YDRLQCSSTSSS+SY  VRF+DS LFRKRSLLS Y+LWSNRRK  NSF
Sbjct: 3   MEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRNSF 62

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKK-THIRKSGVGICSQIEKLVLCKKYR 120
           CW+KCSSLEQGL PRP+P+PSKID D+RKGTSS + T IRKSGVGICSQIEKLVLCKKYR
Sbjct: 63  CWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIRKSGVGICSQIEKLVLCKKYR 122

Query: 121 DALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRV 180
           DALEMFE FELEGGYD+GNST+DALINACIGLKS+RGVKRLCNYMID+G EPDQYM+NR+
Sbjct: 123 DALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKNRI 182

Query: 181 LLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGP 240
           LLMHVKCGMMIDACRLFDEMPERNAVSW+TIISGYVDSGNY EAFRLFIMMWEE SD GP
Sbjct: 183 LLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDSGP 242

Query: 241 RTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE 300
           RTFA MIRASAGLELIFPGRQLHSCA+KAGVGQ+IFVSCALIDMYSKCGSLEDAHCVFDE
Sbjct: 243 RTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVFDE 302

Query: 301 MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARA 360
           MPDKTIVGWNSIIAGYALHGYSEEAL+LCYEMRDSG+KMDHFTFSIIIRICSRLASVARA
Sbjct: 303 MPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVARA 362

Query: 361 KQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNH 420
           KQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+FDRMS KN+ISWNALIAGYGNH
Sbjct: 363 KQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYGNH 422

Query: 421 GRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMH 480
           GRGEEAI+MFE+MLREGM PNHVTFL+VLSACSISGLFERGWEIFQ++T DHKIKPRAMH
Sbjct: 423 GRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRAMH 482

Query: 481 YACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGME 540
           +AC+IELLGREGLLDEAYALIR APF+PTANMWAALLRACRVHENLELGK AAE LYGME
Sbjct: 483 FACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYGME 542

Query: 541 PEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 600
           P+KLSNYIVLLNIYNSSGKLKEAA+VV+TLKRKGLRM+PACSWIEV NQPH+FLSGDKHH
Sbjct: 543 PDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDKHH 602

Query: 601 TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKR 660
            EIEKVVEKVDE+MLKISKLG+V EQNFLL DVDE EEKI MYHSEKLAIAYGL++TLK+
Sbjct: 603 AEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTLKK 662

Query: 661 TPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           TPLQIVQSHRICGDCHS IKLIA+IT+REIV+RDASRFHHFRDG+CSCGDYW
Sbjct: 663 TPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW 714

BLAST of Tan0015212 vs. ExPASy TrEMBL
Match: A0A6J1JGW0 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111486894 PE=3 SV=1)

HSP 1 Score: 1287.3 bits (3330), Expect = 0.0e+00
Identity = 623/711 (87.62%), Postives = 663/711 (93.25%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           MEVPL  YQNY++D L+ +S SSSTSYFS  FS S+LFR RSLLS YSLWSNRRK  NSF
Sbjct: 1   MEVPL--YQNYVHDHLRRTSLSSSTSYFSHHFSGSELFRDRSLLSAYSLWSNRRKLRNSF 60

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIEKLVLCKKYRD 120
           CWVKCSSLEQGL PR KPKPSK+D D+RKGT SK+T I KS V IC  IEKLVLC K+RD
Sbjct: 61  CWVKCSSLEQGLRPRLKPKPSKVDRDVRKGTPSKETRITKSSVRICCHIEKLVLCNKFRD 120

Query: 121 ALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVL 180
           ALEMFE  ELEGGYDVGNSTFDALI ACIGLKS+RG KRLC YMID+GIEPDQY+ NR+L
Sbjct: 121 ALEMFEILELEGGYDVGNSTFDALIIACIGLKSIRGAKRLCAYMIDNGIEPDQYIMNRIL 180

Query: 181 LMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPR 240
           LMHV+CGMMIDA +LFDEMPERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEEY  C PR
Sbjct: 181 LMHVRCGMMIDASKLFDEMPERNAVSWNTIISGYVDSGNYKEAFRLFIMMWEEYPGCSPR 240

Query: 241 TFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM 300
           TFAT+IRASAGLELIFPG+QLHSCAVKAGVGQ+IFVSCALIDMYSKCG LEDAHCVFDEM
Sbjct: 241 TFATVIRASAGLELIFPGKQLHSCAVKAGVGQDIFVSCALIDMYSKCGGLEDAHCVFDEM 300

Query: 301 PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAK 360
           PDKTIVGWNSIIAGYALHG+SEEALNL ++MRDSGVK+DHFTFSIIIRICSRLASV RAK
Sbjct: 301 PDKTIVGWNSIIAGYALHGHSEEALNLYFQMRDSGVKIDHFTFSIIIRICSRLASVTRAK 360

Query: 361 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHG 420
           QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDRMSCKN+ISWNALIAGYGNHG
Sbjct: 361 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHIFDRMSCKNLISWNALIAGYGNHG 420

Query: 421 RGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHY 480
           RGEEAIE+FE+MLREGM+PNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHKIK RAMHY
Sbjct: 421 RGEEAIEIFERMLREGMVPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKLRAMHY 480

Query: 481 ACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEP 540
            C+IELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGK+AAEKLYGMEP
Sbjct: 481 TCMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKYAAEKLYGMEP 540

Query: 541 EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT 600
           EKL NYIVLLNIY SSGKLKEAA+VVRTLKRKGL MLPACSWIEV +QPHAFLSGDKHH 
Sbjct: 541 EKLRNYIVLLNIYKSSGKLKEAADVVRTLKRKGLSMLPACSWIEVKHQPHAFLSGDKHHP 600

Query: 601 EIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKRT 660
           EIEKVVEKVDELML+ISKLG+VPEQN LL DVD HEEKIQ+YHSEKLAIAYGLINTLK+T
Sbjct: 601 EIEKVVEKVDELMLEISKLGYVPEQNILLPDVD-HEEKIQIYHSEKLAIAYGLINTLKQT 660

Query: 661 PLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           PLQIVQ HR+CGDCHSVIKLIAMITKREIV+RDASRFHHFRDG CSCGDYW
Sbjct: 661 PLQIVQGHRVCGDCHSVIKLIAMITKREIVVRDASRFHHFRDGRCSCGDYW 708

BLAST of Tan0015212 vs. ExPASy TrEMBL
Match: A0A1S3C9W7 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498490 PE=3 SV=1)

HSP 1 Score: 1284.2 bits (3322), Expect = 0.0e+00
Identity = 623/712 (87.50%), Postives = 664/712 (93.26%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           ME+PLSRYQNY+YDRLQC     ST YFS+R+SDS LF K S L      SNRRK  NSF
Sbjct: 3   MELPLSRYQNYVYDRLQC----YSTPYFSLRYSDSHLFMKTSFL------SNRRKCRNSF 62

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIEKLVLCKKYRD 120
           CWVKCSS EQGL PRP+PKPSK+D  +RK    K+T +RKS VGICSQIEKLVLCK+YRD
Sbjct: 63  CWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRD 122

Query: 121 ALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVL 180
           ALEMFE FELE G+ VGNST+DALINACIGLKS+RGVKRL NYM+D+G EPDQYMRNRVL
Sbjct: 123 ALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVL 182

Query: 181 LMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPR 240
           LMHVKCGMMIDACRLFDEMPERNAVSW+TIISGYVDSGNY EAFRLFI+MWEE   CGPR
Sbjct: 183 LMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPR 242

Query: 241 TFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEM 300
           T ATMIRASAGLE+IF GRQLHSCA+KAG+GQ+IFVSCALIDMYSKCGSLEDAHCVFDEM
Sbjct: 243 TLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEM 302

Query: 301 PDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAK 360
           PDKTIVGWNSIIAGYALHGYSEEAL+L +EM  SGVKMDHFTFSIIIRICSRLASVARAK
Sbjct: 303 PDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAK 362

Query: 361 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHG 420
           QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSC+NVISWNALIAGYGNHG
Sbjct: 363 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHG 422

Query: 421 RGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHY 480
           RGEEAI+MFEKMLREGMMPNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHK++PRAMH+
Sbjct: 423 RGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHF 482

Query: 481 ACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEP 540
           AC+IELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGMEP
Sbjct: 483 ACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGMEP 542

Query: 541 EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHT 600
           EKLSNYIVLLNIYN+SGKLKEAA+VV+TLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 
Sbjct: 543 EKLSNYIVLLNIYNTSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHV 602

Query: 601 EIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKR 660
           ++EKVV KVDELMLKISKLG+VP EQNF+L DVDEHEEKI+MYHSEKLAIAYGL+NTL+R
Sbjct: 603 QLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLER 662

Query: 661 TPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           TPLQIVQSHRIC DCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
Sbjct: 663 TPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 704

BLAST of Tan0015212 vs. ExPASy TrEMBL
Match: A0A0A0KXD9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G636580 PE=3 SV=1)

HSP 1 Score: 1283.5 bits (3320), Expect = 0.0e+00
Identity = 622/714 (87.11%), Postives = 666/714 (93.28%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           ME+PLSRYQNY+YDRLQC    +STS+FS+R+SDSDLF K S L      SN RK  NSF
Sbjct: 3   MELPLSRYQNYVYDRLQC----NSTSFFSLRYSDSDLFTKTSFL------SNPRKYRNSF 62

Query: 61  CWVKCSSLEQGLHPRPK--PKPSKIDPDIRKGTSSKKTHIRKSGVGICSQIEKLVLCKKY 120
           CW+KCSS EQGL PRP+  PKPSK+D   RK T  K+TH++KS VGICSQIEKLVLCKKY
Sbjct: 63  CWIKCSSFEQGLRPRPRPQPKPSKLDVGDRKETPLKETHVKKSSVGICSQIEKLVLCKKY 122

Query: 121 RDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNR 180
           RDALEMFE FELE G+ VG ST+DALINACIGLKS+RGVKRLCNYM+D+G EPDQYMRNR
Sbjct: 123 RDALEMFEIFELEDGFHVGYSTYDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNR 182

Query: 181 VLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCG 240
           VLLMHVKCGMMIDACRLFDEMP RNAVSW TIISGYVDSGNY EAFRLFI+M EE+ DCG
Sbjct: 183 VLLMHVKCGMMIDACRLFDEMPARNAVSWGTIISGYVDSGNYVEAFRLFILMREEFYDCG 242

Query: 241 PRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD 300
           PRTFATMIRASAGLE+IFPGRQLHSCA+KAG+GQ+IFVSCALIDMYSKCGSLEDAHCVFD
Sbjct: 243 PRTFATMIRASAGLEIIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFD 302

Query: 301 EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVAR 360
           EMPDKTIVGWNSIIAGYALHGYSEEAL+L +EMRDSGVKMDHFTFSIIIRICSRLASVAR
Sbjct: 303 EMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMRDSGVKMDHFTFSIIIRICSRLASVAR 362

Query: 361 AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGN 420
           AKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSC+N+ISWNALIAGYGN
Sbjct: 363 AKQVHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNIISWNALIAGYGN 422

Query: 421 HGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAM 480
           HG GEEAI+MFEKMLREGMMPNHVTFL+VLSACSISGLFERGWEIFQ+MTRDHK+KPRAM
Sbjct: 423 HGHGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVKPRAM 482

Query: 481 HYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGM 540
           H+AC+IELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVH NLELGKFAAEKLYGM
Sbjct: 483 HFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGM 542

Query: 541 EPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKH 600
           EPEKLSNYIVLLNIYNSSGKLKEAA+V +TLKRKGLRMLPACSWIEVNNQPHAFLSGDKH
Sbjct: 543 EPEKLSNYIVLLNIYNSSGKLKEAADVFQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKH 602

Query: 601 HTEIEKVVEKVDELMLKISKLGHVP-EQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTL 660
           H +IEKVV KVDELML ISKLG+VP EQNF+L DVDE+EEKI+MYHSEKLAIAYGL+NTL
Sbjct: 603 HVQIEKVVGKVDELMLNISKLGYVPEEQNFMLPDVDENEEKIRMYHSEKLAIAYGLLNTL 662

Query: 661 KRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           ++TPLQIVQSHRIC DCHSVIKLIAMITKREIVIRDASRFHHFRDG+CSCGDYW
Sbjct: 663 EKTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGSCSCGDYW 706

BLAST of Tan0015212 vs. ExPASy TrEMBL
Match: A0A6J1GSZ5 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111457149 PE=3 SV=1)

HSP 1 Score: 1273.1 bits (3293), Expect = 0.0e+00
Identity = 618/712 (86.80%), Postives = 658/712 (92.42%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           MEVPL  YQNY++D LQ +S SSSTSYFS  FS S+LFR RSLLS YSLWSN RK  NSF
Sbjct: 1   MEVPL--YQNYVHDHLQRTSLSSSTSYFSHHFSGSELFRDRSLLSAYSLWSNGRKLRNSF 60

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGT-SSKKTHIRKSGVGICSQIEKLVLCKKYR 120
           CWVKCSSLEQGL PR KPKPSK++ D+RKGT  SK+T I KS V IC  IEKLVLC K+R
Sbjct: 61  CWVKCSSLEQGLRPRLKPKPSKVERDVRKGTPPSKETRITKSSVRICCHIEKLVLCNKFR 120

Query: 121 DALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRV 180
           DALEMFE  ELEGGYDVGNSTFDALINACIGLKS+RG KRLC YMID+GIEPDQY+ NR+
Sbjct: 121 DALEMFEILELEGGYDVGNSTFDALINACIGLKSIRGAKRLCAYMIDNGIEPDQYIMNRI 180

Query: 181 LLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGP 240
           LLMHV+CGMMIDA +LFDEMPERNAVSWNTIISGYVDSGNY+EAFRLFIMMWEEY  C P
Sbjct: 181 LLMHVRCGMMIDASKLFDEMPERNAVSWNTIISGYVDSGNYKEAFRLFIMMWEEYPGCSP 240

Query: 241 RTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDE 300
           RTFAT+IRASAGLELIFPGRQLHSCAVKAGVGQ+IFVSCALIDMYSKCG LEDAHCVFDE
Sbjct: 241 RTFATVIRASAGLELIFPGRQLHSCAVKAGVGQDIFVSCALIDMYSKCGGLEDAHCVFDE 300

Query: 301 MPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARA 360
           MPDKTIVGWNSIIAGYALHGYSEEALNL Y+MRDSGVK+DHFTFSIIIRICSRLASV RA
Sbjct: 301 MPDKTIVGWNSIIAGYALHGYSEEALNLYYQMRDSGVKIDHFTFSIIIRICSRLASVTRA 360

Query: 361 KQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNH 420
           KQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDRMSCKN+ISWNALIAGYGNH
Sbjct: 361 KQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHIFDRMSCKNLISWNALIAGYGNH 420

Query: 421 GRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMH 480
           GRGEEAIE+FE+MLREGM+PNHVTFL+VLSACSISGLFERGWEIFQ++TRDHK+K RAMH
Sbjct: 421 GRGEEAIEIFERMLREGMVPNHVTFLAVLSACSISGLFERGWEIFQSVTRDHKMKLRAMH 480

Query: 481 YACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGME 540
           Y C+IELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGK+ AEKLYGME
Sbjct: 481 YTCMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKYVAEKLYGME 540

Query: 541 PEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH 600
           PEKL NYIVLLNIY SSGKLKEAA+VV+TLKRKGL MLPACSWIEV +QPHAF SGDK H
Sbjct: 541 PEKLRNYIVLLNIYKSSGKLKEAADVVQTLKRKGLSMLPACSWIEVKHQPHAFQSGDKRH 600

Query: 601 TEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQMYHSEKLAIAYGLINTLKR 660
            EIEKVVEKVDELML+ISKLG+VPE+N LL DVD HEEKIQ+YHSEKLAIAYGLINTL  
Sbjct: 601 PEIEKVVEKVDELMLEISKLGYVPERNILLPDVD-HEEKIQIYHSEKLAIAYGLINTLNH 660

Query: 661 TPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           TPLQIVQ HR+CGDCHSVIKLIAMITKREIV+RDASRFHHFRDG CSCGDYW
Sbjct: 661 TPLQIVQGHRVCGDCHSVIKLIAMITKREIVVRDASRFHHFRDGRCSCGDYW 709

BLAST of Tan0015212 vs. TAIR 10
Match: AT5G50390.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 882.1 bits (2278), Expect = 3.0e-256
Identity = 429/718 (59.75%), Postives = 540/718 (75.21%), Query Frame = 0

Query: 1   MEVPLSRYQNYLYDRLQCSSTSSSTSYFSVRFSDSDLFRKRSLLSGYSLWSNRRKSLNSF 60
           ME+PLSRYQ+   D ++ SS++     F  +FS          L G       R+  N F
Sbjct: 1   MEIPLSRYQSIRLDEIRDSSSNPKVLTFPRKFS----------LRG-------RRWKNPF 60

Query: 61  CWVKCSSLEQGLHPRPKPKPSKIDPDIRKGTSS--KKTHIRKSGVGICSQIEKLVLCKKY 120
             + CSS+ QGL P+PK KP  I  ++++        T I KSGV ICSQIEKLVLC ++
Sbjct: 61  GRLSCSSVVQGLKPKPKLKPEPIRIEVKESKDQILDDTQISKSGVTICSQIEKLVLCNRF 120

Query: 121 RDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNR 180
           R+A E+FE  E+   + VG ST+DAL+ ACI LKS+R VKR+  +M+ +G EP+QYM NR
Sbjct: 121 REAFELFEILEIRCSFKVGVSTYDALVEACIRLKSIRCVKRVYGFMMSNGFEPEQYMMNR 180

Query: 181 VLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCG 240
           +LLMHVKCGM+IDA RLFDE+PERN  S+ +IISG+V+ GNY EAF LF MMWEE SDC 
Sbjct: 181 ILLMHVKCGMIIDARRLFDEIPERNLYSYYSIISGFVNFGNYVEAFELFKMMWEELSDCE 240

Query: 241 PRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFD 300
             TFA M+RASAGL  I+ G+QLH CA+K GV  N FVSC LIDMYSKCG +EDA C F+
Sbjct: 241 THTFAVMLRASAGLGSIYVGKQLHVCALKLGVVDNTFVSCGLIDMYSKCGDIEDARCAFE 300

Query: 301 EMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVAR 360
            MP+KT V WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +  
Sbjct: 301 CMPEKTTVAWNNVIAGYALHGYSEEALCLLYDMRDSGVSIDQFTLSIMIRISTKLAKLEL 360

Query: 361 AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGN 420
            KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  KN+ISWNAL+ GY N
Sbjct: 361 TKQAHASLIRNGFESEIVANTALVDFYSKWGRVDTARYVFDKLPRKNIISWNALMGGYAN 420

Query: 421 HGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAM 480
           HGRG +A+++FEKM+   + PNHVTFL+VLSAC+ SGL E+GWEIF +M+  H IKPRAM
Sbjct: 421 HGRGTDAVKLFEKMIAANVAPNHVTFLAVLSACAYSGLSEQGWEIFLSMSEVHGIKPRAM 480

Query: 481 HYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGM 540
           HYAC+IELLGR+GLLDEA A IR+AP + T NMWAALL ACR+ ENLELG+  AEKLYGM
Sbjct: 481 HYACMIELLGRDGLLDEAIAFIRRAPLKTTVNMWAALLNACRMQENLELGRVVAEKLYGM 540

Query: 541 EPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDK- 600
            PEKL NY+V+ N+YNS GK  EAA V+ TL+ KGL M+PAC+W+EV +Q H+FLSGD+ 
Sbjct: 541 GPEKLGNYVVMYNMYNSMGKTAEAAGVLETLESKGLSMMPACTWVEVGDQTHSFLSGDRF 600

Query: 601 ---HHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDE-HEEKIQMYHSEKLAIAYGL 660
              + T   ++ +KVDELM +IS+ G+  E+  LL DVDE  EE++  YHSEKLAIAYGL
Sbjct: 601 DSYNETVKRQIYQKVDELMEEISEYGYSEEEQHLLPDVDEKEEERVGRYHSEKLAIAYGL 660

Query: 661 INTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           +NT +  PLQI Q+HRIC +CH V++ I+++T RE+V+RDASRFHHF++G CSCG YW
Sbjct: 661 VNTPEWNPLQITQNHRICKNCHKVVEFISLVTGREMVVRDASRFHHFKEGKCSCGGYW 701

BLAST of Tan0015212 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 468.0 bits (1203), Expect = 1.3e-131
Identity = 226/611 (36.99%), Postives = 366/611 (59.90%), Query Frame = 0

Query: 138 NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVK---CGMMIDACR 197
           ++ F +++ +C  +  +R  + +  +++  G++ D Y  N ++ M+ K    G  I    
Sbjct: 105 HNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVGN 164

Query: 198 LFDEMPER---------------------------------NAVSWNTIISGYVDSGNYE 257
           +FDEMP+R                                 + VS+NTII+GY  SG YE
Sbjct: 165 VFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYE 224

Query: 258 EAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALI 317
           +A R+   M          T ++++   +    +  G+++H   ++ G+  ++++  +L+
Sbjct: 225 DALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLV 284

Query: 318 DMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHF 377
           DMY+K   +ED+  VF  +  +  + WNS++AGY  +G   EAL L  +M  + VK    
Sbjct: 285 DMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAV 344

Query: 378 TFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM 437
            FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Sbjct: 345 AFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM 404

Query: 438 SCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGW 497
           +  + +SW A+I G+  HG G EA+ +FE+M R+G+ PN V F++VL+ACS  GL +  W
Sbjct: 405 NVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAW 464

Query: 498 EIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV 557
             F +MT+ + +     HYA + +LLGR G L+EAY  I K   +PT ++W+ LL +C V
Sbjct: 465 GYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSV 524

Query: 558 HENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACS 617
           H+NLEL +  AEK++ ++ E +  Y+++ N+Y S+G+ KE A++   +++KGLR  PACS
Sbjct: 525 HKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACS 584

Query: 618 WIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVD-EHEEKIQ 677
           WIE+ N+ H F+SGD+ H  ++K+ E +  +M ++ K G+V + + +L DVD EH+ ++ 
Sbjct: 585 WIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELL 644

Query: 678 MYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHF 712
             HSE+LA+A+G+INT   T +++ ++ RIC DCH  IK I+ IT+REI++RD SRFHHF
Sbjct: 645 FGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHF 704

BLAST of Tan0015212 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 464.2 bits (1193), Expect = 1.9e-130
Identity = 219/578 (37.89%), Postives = 356/578 (61.59%), Query Frame = 0

Query: 129 ELEGGYDVGNSTF-DALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCG 188
           +LEG Y   +  F + L+  C   K +   + +  +++      D  M N +L M+ KCG
Sbjct: 50  DLEGSYIPADRRFYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCG 109

Query: 189 MMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIR 248
            + +A ++F++MP+R+ V+W T+ISGY       +A   F  M          T +++I+
Sbjct: 110 SLEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIK 169

Query: 249 ASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVG 308
           A+A       G QLH   VK G   N+ V  AL+D+Y++ G ++DA  VFD +  +  V 
Sbjct: 170 AAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVS 229

Query: 309 WNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLV 368
           WN++IAG+A    +E+AL L   M   G +  HF+++ +   CS    + + K  HA ++
Sbjct: 230 WNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMI 289

Query: 369 RNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIE 428
           ++G  L   A   L+D Y+K G + DAR +FDR++ ++V+SWN+L+  Y  HG G+EA+ 
Sbjct: 290 KSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVW 349

Query: 429 MFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELL 488
            FE+M R G+ PN ++FLSVL+ACS SGL + GW  ++ M +D  I P A HY  +++LL
Sbjct: 350 WFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKD-GIVPEAWHYVTVVDLL 409

Query: 489 GREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYI 548
           GR G L+ A   I + P +PTA +W ALL ACR+H+N ELG +AAE ++ ++P+    ++
Sbjct: 410 GRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHV 469

Query: 549 VLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVE 608
           +L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F++ D+ H + E++  
Sbjct: 470 ILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIAR 529

Query: 609 KVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQM-YHSEKLAIAYGLINTLKRTPLQIVQ 668
           K +E++ KI +LG+VP+ + ++  VD+ E ++ + YHSEK+A+A+ L+NT   + + I +
Sbjct: 530 KWEEVLAKIKELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKK 589

Query: 669 SHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGN 705
           + R+CGDCH+ IKL + +  REI++RD +RFHHF+D +
Sbjct: 590 NIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDAS 626

BLAST of Tan0015212 vs. TAIR 10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 463.0 bits (1190), Expect = 4.3e-130
Identity = 224/575 (38.96%), Postives = 357/575 (62.09%), Query Frame = 0

Query: 138 NSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRNRVLLMHVKCGMMIDACRLFD 197
           ++T+  LI  CI  ++V     +C ++  +G  P  ++ N ++ M+VK  ++ DA +LFD
Sbjct: 61  SATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDAHQLFD 120

Query: 198 EMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDCGPRTFATMIRASAGLELIFP 257
           +MP+RN +SW T+IS Y     +++A  L ++M  +       T+++++R+  G+  +  
Sbjct: 121 QMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCNGMSDV-- 180

Query: 258 GRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL 317
            R LH   +K G+  ++FV  ALID+++K G  EDA  VFDEM     + WNSII G+A 
Sbjct: 181 -RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSIIGGFAQ 240

Query: 318 HGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVA 377
           +  S+ AL L   M+ +G   +  T + ++R C+ LA +    QAH  +V+  +  D++ 
Sbjct: 241 NSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVK--YDQDLIL 300

Query: 378 NTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYGNHGRGEEAIEMFEKMLREGM 437
           N ALVD Y K G ++DA  VF++M  ++VI+W+ +I+G   +G  +EA+++FE+M   G 
Sbjct: 301 NNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERMKSSGT 360

Query: 438 MPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRAMHYACLIELLGREGLLDEAY 497
            PN++T + VL ACS +GL E GW  F++M + + I P   HY C+I+LLG+ G LD+A 
Sbjct: 361 KPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDAV 420

Query: 498 ALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSG 557
            L+ +   +P A  W  LL ACRV  N+ L ++AA+K+  ++PE    Y +L NIY +S 
Sbjct: 421 KLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANSQ 480

Query: 558 KLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHTEIEKVVEKVDELMLKIS 617
           K     E+   ++ +G++  P CSWIEVN Q HAF+ GD  H +I +V +K+++L+ +++
Sbjct: 481 KWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQLIHRLT 540

Query: 618 KLGHVPEQNFLLSDVD-EHEEKIQMYHSEKLAIAYGLINTLKRTPLQIVQSHRICGDCHS 677
            +G+VPE NF+L D++ E  E    +HSEKLA+A+GL+       ++I ++ RICGDCH 
Sbjct: 541 GIGYVPETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRIRKNLRICGDCHV 600

Query: 678 VIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
             KL + +  R IVIRD  R+HHF+DG CSCGDYW
Sbjct: 601 FCKLASKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of Tan0015212 vs. TAIR 10
Match: AT3G02010.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 460.7 bits (1184), Expect = 2.2e-129
Identity = 228/597 (38.19%), Postives = 362/597 (60.64%), Query Frame = 0

Query: 118 YRDALEMFEFFELEGGYDVGNSTFDALINACIGLKSVRGVKRLCNYMIDHGIEPDQYMRN 177
           Y +++ +F     + G+   + TF  ++ A +GL      ++L    +  G   D  + N
Sbjct: 231 YTESIHLFLKMR-QSGHQPSDFTFSGVLKAVVGLHDFALGQQLHALSVTTGFSRDASVGN 290

Query: 178 RVLLMHVKCGMMIDACRLFDEMPERNAVSWNTIISGYVDSGNYEEAFRLFIMMWEEYSDC 237
           ++L  + K   +++   LFDEMPE + VS+N +IS Y  +  YE +   F  M     D 
Sbjct: 291 QILDFYSKHDRVLETRMLFDEMPELDFVSYNVVISSYSQADQYEASLHFFREMQCMGFDR 350

Query: 238 GPRTFATMIRASAGLELIFPGRQLHSCAVKAGVGQNIFVSCALIDMYSKCGSLEDAHCVF 297
               FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F
Sbjct: 351 RNFPFATMLSIAANLSSLQMGRQLHCQALLATADSILHVGNSLVDMYAKCEMFEEAELIF 410

Query: 298 DEMPDKTIVGWNSIIAGYALHGYSEEALNLCYEMRDSGVKMDHFTFSIIIRICSRLASVA 357
             +P +T V W ++I+GY   G     L L  +MR S ++ D  TF+ +++  +  AS+ 
Sbjct: 411 KSLPQRTTVSWTALISGYVQKGLHGAGLKLFTKMRGSNLRADQSTFATVLKASASFASLL 470

Query: 358 RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCKNVISWNALIAGYG 417
             KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  +N +SWNALI+ + 
Sbjct: 471 LGKQLHAFIIRSGNLENVFSGSGLVDMYAKCGSIKDAVQVFEEMPDRNAVSWNALISAHA 530

Query: 418 NHGRGEEAIEMFEKMLREGMMPNHVTFLSVLSACSISGLFERGWEIFQTMTRDHKIKPRA 477
           ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ M+  + I P+ 
Sbjct: 531 DNGDGEAAIGAFAKMIESGLQPDSVSILGVLTACSHCGFVEQGTEYFQAMSPIYGITPKK 590

Query: 478 MHYACLIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKFAAEKLYG 537
            HYAC+++LLGR G   EA  L+ + PF+P   MW+++L ACR+H+N  L + AAEKL+ 
Sbjct: 591 KHYACMLDLLGRNGRFAEAEKLMDEMPFEPDEIMWSSVLNACRIHKNQSLAERAAEKLFS 650

Query: 538 MEP-EKLSNYIVLLNIYNSSGKLKEAAEVVRTLKRKGLRMLPACSWIEVNNQPHAFLSGD 597
           ME     + Y+ + NIY ++G+ ++  +V + ++ +G++ +PA SW+EVN++ H F S D
Sbjct: 651 MEKLRDAAAYVSMSNIYAAAGEWEKVRDVKKAMRERGIKKVPAYSWVEVNHKIHVFSSND 710

Query: 598 KHHTEIEKVVEKVDELMLKISKLGHVPEQNFLLSDVDEHEEKIQ--MYHSEKLAIAYGLI 657
           + H   +++V K++EL  +I + G+ P+ + ++ DVDE + KI+   YHSE+LA+A+ LI
Sbjct: 711 QTHPNGDEIVRKINELTAEIEREGYKPDTSSVVQDVDE-QMKIESLKYHSERLAVAFALI 770

Query: 658 NTLKRTPLQIVQSHRICGDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 712
           +T +  P+ ++++ R C DCH+ IKLI+ I KREI +RD SRFHHF +G CSCGDYW
Sbjct: 771 STPEGCPIVVMKNLRACRDCHAAIKLISKIVKREITVRDTSRFHHFSEGVCSCGDYW 825

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FK334.2e-25559.75Pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Arabidop... [more]
Q9LIQ71.1e-13538.80Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Q9LW631.9e-13036.99Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SI536.1e-12938.96Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Q9S7F43.0e-12838.19Putative pentatricopeptide repeat-containing protein At2g01510 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
XP_022133879.10.0e+0089.04pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica ... [more]
XP_022989822.10.0e+0087.62pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucurbita ... [more]
XP_008459324.10.0e+0087.50PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic ... [more]
XP_004148701.10.0e+0087.11pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 ... [more]
XP_038890388.10.0e+0086.94pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 ... [more]
Match NameE-valueIdentityDescription
A0A6J1BWH30.0e+0089.04pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Momordic... [more]
A0A6J1JGW00.0e+0087.62pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucurbit... [more]
A0A1S3C9W70.0e+0087.50pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucumis ... [more]
A0A0A0KXD90.0e+0087.11DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G6365... [more]
A0A6J1GSZ50.0e+0086.80pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT5G50390.13.0e-25659.75Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G23330.11.3e-13136.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G24000.11.9e-13037.89Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G03880.14.3e-13038.96Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G02010.12.2e-12938.19Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 304..350
e-value: 8.7E-11
score: 41.9
coord: 404..451
e-value: 2.0E-13
score: 50.3
coord: 203..248
e-value: 1.6E-8
score: 34.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 279..302
e-value: 6.8E-4
score: 17.7
coord: 308..339
e-value: 4.6E-5
score: 21.3
coord: 205..232
e-value: 4.4E-6
score: 24.5
coord: 379..405
e-value: 2.2E-4
score: 19.2
coord: 140..172
e-value: 0.0022
score: 16.1
coord: 407..440
e-value: 4.1E-9
score: 34.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 279..303
e-value: 0.0012
score: 19.0
coord: 546..574
e-value: 0.59
score: 10.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 203..233
score: 10.446177
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 405..439
score: 13.59207
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 304..338
score: 10.007725
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 273..303
score: 8.61564
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 258..364
e-value: 1.2E-22
score: 82.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 423..683
e-value: 2.8E-27
score: 97.9
coord: 104..255
e-value: 1.8E-26
score: 95.3
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 579..701
e-value: 2.0E-36
score: 124.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 75..92
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 71..92
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 1..705
NoneNo IPR availablePANTHERPTHR47924:SF36SUBFAMILY NOT NAMEDcoord: 1..705

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0015212.1Tan0015212.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding