Tan0011958 (gene) Snake gourd v1

Overview
NameTan0011958
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG05: 77966915 .. 77971946 (+)
RNA-Seq ExpressionTan0011958
SyntenyTan0011958
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATGTTCCAATTTTATTTGAATTTGAAAGTGTGAGTGTGCTTGTGTTTTATTTTAGATTTGGAAGGGTTAAAAAGAAGAGTTGGGAGGGATAGGTTGGTCATTTTAACTATTTTAGTGTCAAAAATAAGAGAAACAAAGAAAATAGAAAGTTAAAAAAAAAAAAAACCCACCGACCTTTTCCTTTTGTTCTTCGCGGGCACACCGGAAGGTAAGGGGGGTTTAGGGTTTAAGGGATCACTCGACGGTGCACGGAGGGGAAGGGACCGACAAACGATCCGAGTCAGCGGCGGTTTCCGGCAGATGTTTCTCCCTCTCCTTCGCGAATTTCTTTTTCTCTCTCTTGCGCTCGAGCAAAACTCCGGACGCCATCCACTCCTTTGGGTCAGCGGCGGCATCTGCAGCAAGCGGTGGCCGATTGTAGTTGACGACTCCGATAGATCTGATTGACGTCGACACCCTGCGACGCCATTCCCTTGCATCAGATCTGTGCCATTGACACGAGCTTCGACTACAACGGCGATCTCACCGCTCGCGGACGATTCTTCTTCAGTGACAAACGCAACCTTCGGCAAAGACCCACGCGTAATTGACTCCAGATCTGCAGTACCCAGTTTTTTTTCGGGTTGGATCGGTACACAAACGACTTTATCGCCTTTGTTTGAGCTAAGAATCGAATACTCTTAGCCTTACAATGTCAGATCTGCGAGGTAATGTTTTTTTACTGCTTGGTTCTTGGTCGGTTTTGGTGTTTTGAAGATAACCCACAGTAGTTCTAACTGTTTCTGATTACCCATAACATTTGGGTATTGGGTTTGCGGAATCAACCCATTCGGAGTCAAATTCAGCCAGGATTCAAATTAAAAAACTAAGAAGTGGTTGGGCGAGCGAGAGCGAAGTTCGCCAGGTAGTCACTCTTGAGGTAAGTAGCCTAATTAGATAGCCGTTTTATGCTTCTTTTTAGTATATGCTTGAGCATGATCGACATTGCGTCTATGCATAGTAAATCTCTCGAGTAGGAAGCTCACCCTATTATATCTTTGCACAACGTGGCTATTGGACTAAATGGTTAAGTAGTGTCTGCTTGAGATGTTTGGAAGTTTTTCCGAGTTCAGTGTGCCTGAAATGGTCTAGACTCTTGGATTCCTTTGGTTGGTGTTTGTTTTTTCGTGACAATGTGAAGGACTTCCTCATTGATGTCCTACTGGACATCCTTTTAAGAAAACAAAGTTTCACTTTGGTTGAACATTACTAGAGCCTTTTTTGGGTTCTTTGGAAAGAAAGAAATCAAAGTCTCTTCAAAGAAAAAGAGAGTACTTTCGAATCTTTCTTGATCTTGTTATCTAATATGCTTTGTCTTGGTGTAAATTGTCTGCTCACTTTGCTTCGTACACATATGCTTTCTTTTTAGCTAGTTGGGATAGCCTTTTGTAATATCCGTGGATTATCTATCCTTTTTGTAAATTTCATACATCAATGAAATTGCTTCAATGCTTCTTATAAAAAAAATGTGGTCATGCTTCCTAGGGTTGAGATGTGTGATTGGCATGAGGGTTGCTTTCTCTTTTGCACGTGCTAGCCTAGCAGGCGATTCCCCCACTATGGTTAGAATGAAGGTTGCTTTCCTTCTTGTTGTATGCTAGTTTAGGGGTGATTCCCCACTGTATTGAGTGTGAAGGTTGTTTTCCTTCTTGTTGATAGCCAGCTTAGAGGCAATTCCTTACTGTATTTAGAATGGAAGTTGCTTTCCTTTCTGTTGTATGCTTGCTAGGGGCAATTTCCCCCCCGTATTTAGTATGTAGATTGCTTTTCTTCGTGTTGTGTGCTAGCTTAAGGGTGAATCCCACCGTGTTTAGTTCGGAGGCTGCTTTCCTTCGTGTTGTGTGTTGGCTTAGGGGCGAATCCCCACCATGTTTAGTTTTAAGATTGTTTTCTTTTATGTTGTGTGCTAACTTAGAGTCGAAATCCTATTGTTTTGGGCTTGGGGTTGCACTTCTTCTTTGTGTTAGGCTAGCTTAGAGGAGGATGGTACCTTGCAGGTAGGTTGCATGACTCATAGTTATAATTTAAGTGAAGTGCACAGGTGCTTAGTCGAGTATGAGACCATTTGAGAATGAGGAAGAGTTAGTTAGCCGGGGCAAGATCGTTCCTTGTTTAAGTCAACACTTTGGTTTTTTTTTTCTAAAAGTGAGTTGCTTATTGAGTATGCTATTTTGCTTTCTGTCTTTAAGGTGATGTGCAACCATGAGGCGGATGATAGAAGAGATGATCGACCAAGGCTATGGAGACGTCATGAGAGTTTCTCCCTAGGGTCATAGTTTAGTTTTGTCCTTGTCAAGTAACGTGGGGTAGATTTGTTTAATTGCAGTTTCCTTTGTGTTCTCTTTTGTTCAGTGGTTTCTCTTAGAGCTAAAGTCTGTGGTTGGCCAAACTCCGTTTTTATTCGTAGTTTGGTGGATGTTGAGCGGTTCTTCTGAGATATAGAAGTATTTTTGTACCTTTCAGTCCCTTGTAAACTTTTGGTTTCCTTGTAATTATATTGTTATTTTGTCGAAATTTCAGGCATAAGTGGTGTTGATCAGTTTCATGTCGTCCATAGCCTATCCAAGTTTAGGGTCATTATATTAATGCTTTCTTGAATTCAAAAAGCGAACTCCGAGTCTCATATCCCTACCAAAAACGCGCTGCCCCTTTTGCCTAACATGCTTCACCTTCGACGATCAAATCCCATTATTCATAGTTTCGTTTTTTGTTTCAAATTCCAGAACTTTCCTGCCACCCAATCAAGATTGCTCAACACGCTTTCCTCCCTCTTCAATCGATGCAACTCACGTCAACACCTCGAACAAATTCATGCCAGATTCATTCTCCATGGTTTCCACCAAAACTCAACTCTCTCTTGCAAGCTTATTGACTGTTATGCGAATCTTGGACTCCTTAATCTCTCTCAGCAAGTTTTCTACTCTATAATCGATCCCAATTCAACTCTTTATAATGCTATACTGAGAAATTTGACTAAATATGGTGAATACGAGCGTACATTGTTGGTGTACCGAGAAATGTTTGCCAAGTCCATGCACTCGGATGAAGAGACTTACCCTTTTGTTTTGCGATCCTGTTGTTGCTTATCAAATGTTGAATTTGGGACGAAGATTCATGGGCGTTTGGTTAAACTTGGTGTTGATTCATATGATACGGTAGCCACTGCTCTAGCTGAGATGTATGATGAGTGCATTGATTTTGAGAATTATCATCAACCGTTTGATAAAATGTTTGTGAAGGATTTGGAATGCTGGAGTTCCTTGATTTCAAAGAATTCTCAAAATAGGAATGGAGATGAAAATTCCTTGTTCTCTGGGAGAATGAGAACAGAGCAATTAGTACCAGATTCACTCACATTCATCAATCTCTTGAGGTCCATTGCAGGTTTTAATTCAATTCAGCTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAACTTGTGTGGAGATTTGTTAGTAAATACTGCTGTATTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTGGATGCTAGAAAATTATTTAACAAAATGCCAGAGAATGACTGTGTTGTATGGAATATAATGATATCAGCTTACGCCCGAGAAGGGAAACCGACAGAATGTCTCGAGCTCTTCATGTCCATGGCACGATCTGGGATTAGAGCTGATCTATTTACTGCACTCCCTGTTATCTCTTCAGTTTCACAGTTGAGATGTGTTGATTGGGGCAAACAAACCCATGCCCATATATTGAGGAATGGTTTAGACAGTCAAGTTTCAGTTCATAACTCTCTCATTGACATGTACTGCGAGTGTAACATCTTAGATTCGGCTTGTAAGATCTTCAACGGGACGACAGACAAGACTGTAATTTCATGGAGTGCAATGATCAAGGGGTATGTCAAACATGGTCGGTCTCTCATTGCTTTGTCTCTCTTCCTTAGGATGAAATGTGAAGGGATTCAAGCTGATTTCATTACAATGATTAATATCTTACCTGCATTTGTTCACATAGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAGTGAAGCTAGGTCTGACTTCCCTTCCATCACTTAACACAACCCTCCTAATTACCTATGCAAAATGTGGCTGTATAGAGATGGCCCAAAGGCTATTTGAGGAAGAAAGAGTTGATGACAAGGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAACCATGGAGATTGGTCCCAATGTTTCAAGTTGTACAATCAACTGAAGTGCTCAAATTCAAAGCCAGATCAAGTAACATTTTTGGGACTACTAACAACTTGCGTCAATTGCGGTCTCGTAGAAAAAGGAAAGGAGTTTTTCAAGGAGATGGTTGAAAATTATGGTTGCCAGCCAAGTCAAGAGCATTATGCTTGTATGGTTAATCTCTTAGGGAGAGCTGGACTTATCAATGAAGCTGGAGAACTTGTAAGAACCATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCATTCTAGGTCCAAGCTTGCAGAGTTTGCAGCAGAGCAGCTCATTGATATGGAGCCTAAAAATGCAGGGAATTACATATTGCTTTCGAACATATATGCTGCTGCAGGAAGATGGGATAGAGTTGCAAAAATGAGAAGTTTCCTTAGGGATAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCAGGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTAGAAAACCTTGAACATGAAATCAAAGAAGCTAGAGCAAAGAGTCCAGAAAAATTGAGTAATCTTCTATAACACTTGTATCCATTTTTTGTTTTTAATGATATATCTTCTCATATAACAGGTTTTACTTGCTCATTTATTTACGTCTTCACATTACATTGTTTACATGACGATGTCCAATAAATATATTAATGTAAATGATCTCATTTCTTCAGCATTTATGTC

mRNA sequence

CGATGTTCCAATTTTATTTGAATTTGAAAGTGTGAGTGTGCTTGTGTTTTATTTTAGATTTGGAAGGGTTAAAAAGAAGAGTTGGGAGGGATAGGTTGGTCATTTTAACTATTTTAGTGTCAAAAATAAGAGAAACAAAGAAAATAGAAAGTTAAAAAAAAAAAAAACCCACCGACCTTTTCCTTTTGTTCTTCGCGGGCACACCGGAAGGTAAGGGGGGTTTAGGGTTTAAGGGATCACTCGACGGTGCACGGAGGGGAAGGGACCGACAAACGATCCGAGTCAGCGGCGGTTTCCGGCAGATGTTTCTCCCTCTCCTTCGCGAATTTCTTTTTCTCTCTCTTGCGCTCGAGCAAAACTCCGGACGCCATCCACTCCTTTGGGTCAGCGGCGGCATCTGCAGCAAGCGGTGGCCGATTGTAGTTGACGACTCCGATAGATCTGATTGACGTCGACACCCTGCGACGCCATTCCCTTGCATCAGATCTGTGCCATTGACACGAGCTTCGACTACAACGGCGATCTCACCGCTCGCGGACGATTCTTCTTCAGTGACAAACGCAACCTTCGGCAAAGACCCACGCGTAATTGACTCCAGATCTGCAGTACCCAGTTTTTTTTCGGGTTGGATCGGTACACAAACGACTTTATCGCCTTTGTTTGAGCTAAGAATCGAATACTCTTAGCCTTACAATGTCAGATCTGCGAGGTAATGTTTTTTTACTGCTTGGTTCTTGGTCGGTTTTGGTGTTTTGAAGATAACCCACAGTAGTTCTAACTGTTTCTGATTACCCATAACATTTGGGTATTGGGTTTGCGGAATCAACCCATTCGGAGTCAAATTCAGCCAGGATTCAAATTAAAAAACTAAGAAGTGGTTGGGCGAGCGAGAGCGAAGTTCGCCAGGTAGTCACTCTTGAGGTGATGTGCAACCATGAGGCGGATGATAGAAGAGATGATCGACCAAGGCTATGGAGACGTCATGAGAGTTTCTCCCTAGGGTCATAGTTTAGTTTTGTCCTTGTCAAGTAACGTGGGGTAGATTTGTTTAATTGCAGTTTCCTTTGTGTTCTCTTTTGTTCAGTGGTTTCTCTTAGAGCTAAAGTCTGTGGTTGGCCAAACTCCGTTTTTATTCGTAGTTTGGTGGATGTTGAGCGGTTCTTCTGAGATATAGAAGTATTTTTGTACCTTTCAGTCCCTTGTAAACTTTTGGTTTCCTTGTAATTATATTGTTATTTTGTCGAAATTTCAGGCATAAGTGGTGTTGATCAGTTTCATGTCGTCCATAGCCTATCCAAGTTTAGGGTCATTATATTAATGCTTTCTTGAATTCAAAAAGCGAACTCCGAGTCTCATATCCCTACCAAAAACGCGCTGCCCCTTTTGCCTAACATGCTTCACCTTCGACGATCAAATCCCATTATTCATAGTTTCGTTTTTTGTTTCAAATTCCAGAACTTTCCTGCCACCCAATCAAGATTGCTCAACACGCTTTCCTCCCTCTTCAATCGATGCAACTCACGTCAACACCTCGAACAAATTCATGCCAGATTCATTCTCCATGGTTTCCACCAAAACTCAACTCTCTCTTGCAAGCTTATTGACTGTTATGCGAATCTTGGACTCCTTAATCTCTCTCAGCAAGTTTTCTACTCTATAATCGATCCCAATTCAACTCTTTATAATGCTATACTGAGAAATTTGACTAAATATGGTGAATACGAGCGTACATTGTTGGTGTACCGAGAAATGTTTGCCAAGTCCATGCACTCGGATGAAGAGACTTACCCTTTTGTTTTGCGATCCTGTTGTTGCTTATCAAATGTTGAATTTGGGACGAAGATTCATGGGCGTTTGGTTAAACTTGGTGTTGATTCATATGATACGGTAGCCACTGCTCTAGCTGAGATGTATGATGAGTGCATTGATTTTGAGAATTATCATCAACCGTTTGATAAAATGTTTGTGAAGGATTTGGAATGCTGGAGTTCCTTGATTTCAAAGAATTCTCAAAATAGGAATGGAGATGAAAATTCCTTGTTCTCTGGGAGAATGAGAACAGAGCAATTAGTACCAGATTCACTCACATTCATCAATCTCTTGAGGTCCATTGCAGGTTTTAATTCAATTCAGCTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAACTTGTGTGGAGATTTGTTAGTAAATACTGCTGTATTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTGGATGCTAGAAAATTATTTAACAAAATGCCAGAGAATGACTGTGTTGTATGGAATATAATGATATCAGCTTACGCCCGAGAAGGGAAACCGACAGAATGTCTCGAGCTCTTCATGTCCATGGCACGATCTGGGATTAGAGCTGATCTATTTACTGCACTCCCTGTTATCTCTTCAGTTTCACAGTTGAGATGTGTTGATTGGGGCAAACAAACCCATGCCCATATATTGAGGAATGGTTTAGACAGTCAAGTTTCAGTTCATAACTCTCTCATTGACATGTACTGCGAGTGTAACATCTTAGATTCGGCTTGTAAGATCTTCAACGGGACGACAGACAAGACTGTAATTTCATGGAGTGCAATGATCAAGGGGTATGTCAAACATGGTCGGTCTCTCATTGCTTTGTCTCTCTTCCTTAGGATGAAATGTGAAGGGATTCAAGCTGATTTCATTACAATGATTAATATCTTACCTGCATTTGTTCACATAGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAGTGAAGCTAGGTCTGACTTCCCTTCCATCACTTAACACAACCCTCCTAATTACCTATGCAAAATGTGGCTGTATAGAGATGGCCCAAAGGCTATTTGAGGAAGAAAGAGTTGATGACAAGGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAACCATGGAGATTGGTCCCAATGTTTCAAGTTGTACAATCAACTGAAGTGCTCAAATTCAAAGCCAGATCAAGTAACATTTTTGGGACTACTAACAACTTGCGTCAATTGCGGTCTCGTAGAAAAAGGAAAGGAGTTTTTCAAGGAGATGGTTGAAAATTATGGTTGCCAGCCAAGTCAAGAGCATTATGCTTGTATGGTTAATCTCTTAGGGAGAGCTGGACTTATCAATGAAGCTGGAGAACTTGTAAGAACCATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCATTCTAGGTCCAAGCTTGCAGAGTTTGCAGCAGAGCAGCTCATTGATATGGAGCCTAAAAATGCAGGGAATTACATATTGCTTTCGAACATATATGCTGCTGCAGGAAGATGGGATAGAGTTGCAAAAATGAGAAGTTTCCTTAGGGATAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCAGGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTAGAAAACCTTGAACATGAAATCAAAGAAGCTAGAGCAAAGAGTCCAGAAAAATTGAGTAATCTTCTATAACACTTGTATCCATTTTTTGTTTTTAATGATATATCTTCTCATATAACAGGTTTTACTTGCTCATTTATTTACGTCTTCACATTACATTGTTTACATGACGATGTCCAATAAATATATTAATGTAAATGATCTCATTTCTTCAGCATTTATGTC

Coding sequence (CDS)

ATGCTTCACCTTCGACGATCAAATCCCATTATTCATAGTTTCGTTTTTTGTTTCAAATTCCAGAACTTTCCTGCCACCCAATCAAGATTGCTCAACACGCTTTCCTCCCTCTTCAATCGATGCAACTCACGTCAACACCTCGAACAAATTCATGCCAGATTCATTCTCCATGGTTTCCACCAAAACTCAACTCTCTCTTGCAAGCTTATTGACTGTTATGCGAATCTTGGACTCCTTAATCTCTCTCAGCAAGTTTTCTACTCTATAATCGATCCCAATTCAACTCTTTATAATGCTATACTGAGAAATTTGACTAAATATGGTGAATACGAGCGTACATTGTTGGTGTACCGAGAAATGTTTGCCAAGTCCATGCACTCGGATGAAGAGACTTACCCTTTTGTTTTGCGATCCTGTTGTTGCTTATCAAATGTTGAATTTGGGACGAAGATTCATGGGCGTTTGGTTAAACTTGGTGTTGATTCATATGATACGGTAGCCACTGCTCTAGCTGAGATGTATGATGAGTGCATTGATTTTGAGAATTATCATCAACCGTTTGATAAAATGTTTGTGAAGGATTTGGAATGCTGGAGTTCCTTGATTTCAAAGAATTCTCAAAATAGGAATGGAGATGAAAATTCCTTGTTCTCTGGGAGAATGAGAACAGAGCAATTAGTACCAGATTCACTCACATTCATCAATCTCTTGAGGTCCATTGCAGGTTTTAATTCAATTCAGCTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAACTTGTGTGGAGATTTGTTAGTAAATACTGCTGTATTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTGGATGCTAGAAAATTATTTAACAAAATGCCAGAGAATGACTGTGTTGTATGGAATATAATGATATCAGCTTACGCCCGAGAAGGGAAACCGACAGAATGTCTCGAGCTCTTCATGTCCATGGCACGATCTGGGATTAGAGCTGATCTATTTACTGCACTCCCTGTTATCTCTTCAGTTTCACAGTTGAGATGTGTTGATTGGGGCAAACAAACCCATGCCCATATATTGAGGAATGGTTTAGACAGTCAAGTTTCAGTTCATAACTCTCTCATTGACATGTACTGCGAGTGTAACATCTTAGATTCGGCTTGTAAGATCTTCAACGGGACGACAGACAAGACTGTAATTTCATGGAGTGCAATGATCAAGGGGTATGTCAAACATGGTCGGTCTCTCATTGCTTTGTCTCTCTTCCTTAGGATGAAATGTGAAGGGATTCAAGCTGATTTCATTACAATGATTAATATCTTACCTGCATTTGTTCACATAGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAGTGAAGCTAGGTCTGACTTCCCTTCCATCACTTAACACAACCCTCCTAATTACCTATGCAAAATGTGGCTGTATAGAGATGGCCCAAAGGCTATTTGAGGAAGAAAGAGTTGATGACAAGGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAACCATGGAGATTGGTCCCAATGTTTCAAGTTGTACAATCAACTGAAGTGCTCAAATTCAAAGCCAGATCAAGTAACATTTTTGGGACTACTAACAACTTGCGTCAATTGCGGTCTCGTAGAAAAAGGAAAGGAGTTTTTCAAGGAGATGGTTGAAAATTATGGTTGCCAGCCAAGTCAAGAGCATTATGCTTGTATGGTTAATCTCTTAGGGAGAGCTGGACTTATCAATGAAGCTGGAGAACTTGTAAGAACCATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCATTCTAGGTCCAAGCTTGCAGAGTTTGCAGCAGAGCAGCTCATTGATATGGAGCCTAAAAATGCAGGGAATTACATATTGCTTTCGAACATATATGCTGCTGCAGGAAGATGGGATAGAGTTGCAAAAATGAGAAGTTTCCTTAGGGATAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCAGGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTAGAAAACCTTGAACATGAAATCAAAGAAGCTAGAGCAAAGAGTCCAGAAAAATTGAGTAATCTTCTATAA

Protein sequence

MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEKLSNLL
Homology
BLAST of Tan0011958 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 8.7e-115
Identity = 220/677 (32.50%), Postives = 368/677 (54.36%), Query Frame = 0

Query: 37  LFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTL 96
           L  RC+S + L QI      +G +Q      KL+  +   G ++ + +VF  I    + L
Sbjct: 43  LLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVL 102

Query: 97  YNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLV 156
           Y+ +L+   K  + ++ L  +  M    +      + ++L+ C   + +  G +IHG LV
Sbjct: 103 YHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLV 162

Query: 157 KLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSL 216
           K G        T L  MY +C       + FD+M  +DL  W+++++  SQN        
Sbjct: 163 KSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALE 222

Query: 217 FSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYS 276
               M  E L P  +T +++L +++    I + K +H  A+ S     + ++TA++ +Y+
Sbjct: 223 MVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYA 282

Query: 277 KLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALP 336
           K GSL  AR+LF+ M E + V WN MI AY +   P E + +F  M   G++    + + 
Sbjct: 283 KCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMG 342

Query: 337 VISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKT 396
            + + + L  ++ G+  H   +  GLD  VSV NSLI MYC+C  +D+A  +F     +T
Sbjct: 343 ALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRT 402

Query: 397 VISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHG 456
           ++SW+AMI G+ ++GR + AL+ F +M+   ++ D  T ++++ A   +    + K++HG
Sbjct: 403 LVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHG 462

Query: 457 YSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDW 516
             ++  L     + T L+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG  
Sbjct: 463 VVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIF--DMMSERHVTTWNAMIDGYGTHGFG 522

Query: 517 SQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYAC 576
               +L+ +++    KP+ VTFL +++ C + GLVE G + F  M ENY  + S +HY  
Sbjct: 523 KAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGA 582

Query: 577 MVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKN 636
           MV+LLGRAG +NEA + +  MP+KP   V+G +L AC++H     AE AAE+L ++ P +
Sbjct: 583 MVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDD 642

Query: 637 AGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRA 696
            G ++LL+NIY AA  W++V ++R  +  +GL+KTPGCS +EI  +V  F      HP +
Sbjct: 643 GGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDS 702

Query: 697 EDIYTILENLEHEIKEA 714
           + IY  LE L   IKEA
Sbjct: 703 KKIYAFLEKLICHIKEA 717

BLAST of Tan0011958 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 2.2e-110
Identity = 225/731 (30.78%), Postives = 399/731 (54.58%), Query Frame = 0

Query: 2   LHLRRS-----NPIIHSFV----------FCFKFQNFPATQSRLLNTLSSLFNRCNSRQH 61
           L LRRS     N II SFV          F FK   F  +    ++T   L   C + ++
Sbjct: 96  LDLRRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPD--VSTFPCLVKACVALKN 155

Query: 62  LEQIHARFILH-----GFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAIL 121
            + I   F+       G   N  ++  LI  Y   G +++  ++F  ++  +  ++N +L
Sbjct: 156 FKGID--FLSDTVSSLGMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVML 215

Query: 122 RNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVD 181
               K G  +  +  +  M    +  +  T+  VL  C     ++ G ++HG +V  GVD
Sbjct: 216 NGYAKCGALDSVIKGFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVD 275

Query: 182 SYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRM 241
              ++  +L  MY +C  F++  + F  M   D   W+ +IS   Q+   +E+  F   M
Sbjct: 276 FEGSIKNSLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEM 335

Query: 242 RTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSL 301
            +  ++PD++TF +LL S++ F +++  K +HC  +  ++  D+ + +A++  Y K   +
Sbjct: 336 ISSGVLPDAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGV 395

Query: 302 VDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSV 361
             A+ +F++    D VV+  MIS Y   G   + LE+F  + +  I  +  T + ++  +
Sbjct: 396 SMAQNIFSQCNSVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVI 455

Query: 362 SQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWS 421
             L  +  G++ H  I++ G D++ ++  ++IDMY +C  ++ A +IF   + + ++SW+
Sbjct: 456 GILLALKLGRELHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWN 515

Query: 422 AMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKL 481
           +MI    +      A+ +F +M   GI  D +++   L A  ++ +    K +HG+ +K 
Sbjct: 516 SMITRCAQSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKH 575

Query: 482 GLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFK 541
            L S     +TL+  YAKCG ++ A  +F  + + +K+++ WNS+I+A  NHG       
Sbjct: 576 SLASDVYSESTLIDMYAKCGNLKAAMNVF--KTMKEKNIVSWNSIIAACGNHGKLKDSLC 635

Query: 542 LYNQL-KCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNL 601
           L++++ + S  +PDQ+TFL ++++C + G V++G  FF+ M E+YG QP QEHYAC+V+L
Sbjct: 636 LFHEMVEKSGIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDL 695

Query: 602 LGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNY 661
            GRAG + EA E V++MP  PDA VWG LL AC+LH   +LAE A+ +L+D++P N+G Y
Sbjct: 696 FGRAGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYY 755

Query: 662 ILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY 712
           +L+SN +A A  W+ V K+RS ++++ ++K PG SW+EIN +   F   D  HP +  IY
Sbjct: 756 VLISNAHANAREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIY 815

BLAST of Tan0011958 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 2.9e-110
Identity = 227/717 (31.66%), Postives = 395/717 (55.09%), Query Frame = 0

Query: 27  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQ 86
           QS+      S    C +   L+  H      G   + +   KL+     LG    L+ ++
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 87  QVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCL 146
           +VF +     +  +YN+++R     G     +L++  M    +  D+ T+PF L +C   
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKS 147

Query: 147 SNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLI 206
                G +IHG +VK+G      V  +L   Y EC + ++  + FD+M  +++  W+S+I
Sbjct: 148 RAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMI 207

Query: 207 SKNSQNRNG-DENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNL 266
              ++     D   LF   +R E++ P+S+T + ++ + A    ++  + V+     S +
Sbjct: 208 CGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGI 267

Query: 267 CGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS 326
             + L+ +A++ +Y K  ++  A++LF++   ++  + N M S Y R+G   E L +F  
Sbjct: 268 EVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNL 327

Query: 327 MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNI 386
           M  SG+R D  + L  ISS SQLR + WGK  H ++LRNG +S  ++ N+LIDMY +C+ 
Sbjct: 328 MMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHR 387

Query: 387 LDSACKIFNGTTDKTVISWSAMIKGYVKHGR------------------------SLIAL 446
            D+A +IF+  ++KTV++W++++ GYV++G                          L+  
Sbjct: 388 QDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQG 447

Query: 447 SLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSL 506
           SLF   + + C     EG+ AD +TM++I  A  H+GAL+  K+++ Y  K G+     L
Sbjct: 448 SLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRL 507

Query: 507 NTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCS 566
            TTL+  +++CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ +   
Sbjct: 508 GTTLVDMFSRCGDPESAMSIFNS--LTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQ 567

Query: 567 NSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE 626
             KPD V F+G LT C + GLV++GKE F  M++ +G  P   HY CMV+LLGRAGL+ E
Sbjct: 568 GLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEE 627

Query: 627 AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAA 686
           A +L+  MP++P+  +W  LL+AC++    ++A +AAE++  + P+  G+Y+LLSN+YA+
Sbjct: 628 AVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYAS 687

Query: 687 AGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENL 707
           AGRW+ +AK+R  +++KGL+K PG S ++I G+  EF   D++HP   +I  +L+ +
Sbjct: 688 AGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEV 742

BLAST of Tan0011958 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 391.3 bits (1004), Expect = 2.3e-107
Identity = 210/657 (31.96%), Postives = 359/657 (54.64%), Query Frame = 0

Query: 57  HGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLV 116
           +GF  +S L  KL   Y N G L  + +VF  +    +  +N ++  L K G++  ++ +
Sbjct: 123 NGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGL 182

Query: 117 YREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDE 176
           +++M +  +  D  T+  V +S   L +V  G ++HG ++K G    ++V  +L   Y +
Sbjct: 183 FKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLK 242

Query: 177 CIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINL 236
               ++  + FD+M  +D+  W+S+I+    N   ++      +M    +  D  T +++
Sbjct: 243 NQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSV 302

Query: 237 LRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDC 296
               A    I L + VH I + +    +      +L +YSK G L  A+ +F +M +   
Sbjct: 303 FAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSV 362

Query: 297 VVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAH 356
           V +  MI+ YAREG   E ++LF  M   GI  D++T   V++  ++ R +D GK+ H  
Sbjct: 363 VSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEW 422

Query: 357 ILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIA 416
           I  N L   + V N+L+DMY +C  +  A  +F+    K +ISW+ +I GY K+  +  A
Sbjct: 423 IKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEA 482

Query: 417 LSLF-LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLI 476
           LSLF L ++ +    D  T+  +LPA   + A +  + +HGY ++ G  S   +  +L+ 
Sbjct: 483 LSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVD 542

Query: 477 TYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQ 536
            YAKCG + +A  LF++  +  KDL+ W  MI+ +  HG   +   L+NQ++ +  + D+
Sbjct: 543 MYAKCGALLLAHMLFDD--IASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADE 602

Query: 537 VTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVR 596
           ++F+ LL  C + GLV++G  FF  M      +P+ EHYAC+V++L R G + +A   + 
Sbjct: 603 ISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIE 662

Query: 597 TMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDR 656
            MPI PDA +WG LL  C++H   KLAE  AE++ ++EP+N G Y+L++NIYA A +W++
Sbjct: 663 NMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQ 722

Query: 657 VAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKE 713
           V ++R  +  +GL+K PGCSW+EI G+V  F   D ++P  E+I   L  +   + E
Sbjct: 723 VKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIE 777

BLAST of Tan0011958 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 381.7 bits (979), Expect = 1.8e-104
Identity = 204/681 (29.96%), Postives = 362/681 (53.16%), Query Frame = 0

Query: 35  SSLFNRCNSRQHL---EQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIID 94
           SS+ + C   + L   EQ+H   +  GF  ++ +   L+  Y +LG L  ++ +F ++  
Sbjct: 292 SSVLSACKKIESLEIGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQ 351

Query: 95  PNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKI 154
            ++  YN ++  L++ G  E+ + +++ M    +  D  T   ++ +C     +  G ++
Sbjct: 352 RDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQL 411

Query: 155 HGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNG 214
           H    KLG  S + +  AL  +Y +C D E     F +  V+++  W+ ++       + 
Sbjct: 412 HAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDL 471

Query: 215 DENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAV 274
             +     +M+ E++VP+  T+ ++L++      ++L + +H   I +N   +  V + +
Sbjct: 472 RNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVL 531

Query: 275 LSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADL 334
           + +Y+KLG L  A  +  +    D V W  MI+ Y +     + L  F  M   GIR+D 
Sbjct: 532 IDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDE 591

Query: 335 FTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNG 394
                 +S+ + L+ +  G+Q HA    +G  S +   N+L+ +Y  C  ++ +   F  
Sbjct: 592 VGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQ 651

Query: 395 TTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENV 454
           T     I+W+A++ G+ + G +  AL +F+RM  EGI  +  T  + + A      ++  
Sbjct: 652 TEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQG 711

Query: 455 KYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHA 514
           K +H    K G  S   +   L+  YAKCG I  A++ F E  V  K+ + WN++I+A++
Sbjct: 712 KQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLE--VSTKNEVSWNAIINAYS 771

Query: 515 NHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQ 574
            HG  S+    ++Q+  SN +P+ VT +G+L+ C + GLV+KG  +F+ M   YG  P  
Sbjct: 772 KHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKP 831

Query: 575 EHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLID 634
           EHY C+V++L RAGL++ A E ++ MPIKPDA VW  LLSAC +H   ++ EFAA  L++
Sbjct: 832 EHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLE 891

Query: 635 MEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQ 694
           +EP+++  Y+LLSN+YA + +WD     R  +++KG+KK PG SW+E+   +  F V DQ
Sbjct: 892 LEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQ 951

Query: 695 THPRAEDIYTILENLEHEIKE 713
            HP A++I+   ++L     E
Sbjct: 952 NHPLADEIHEYFQDLTKRASE 970

BLAST of Tan0011958 vs. NCBI nr
Match: XP_022994744.1 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1248.0 bits (3228), Expect = 0.0e+00
Identity = 616/725 (84.97%), Postives = 655/725 (90.34%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           M HL+RS  I  S +F FKF NFPATQSRLLNTLSSLF+RC SRQ LEQIHARF+LHGFH
Sbjct: 1   MFHLQRSKSITQSPIFRFKFPNFPATQSRLLNTLSSLFSRCKSRQQLEQIHARFVLHGFH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLSCKLIDCYAN GLLN+S  VF SIIDPNSTLYNAILRNLT++GEYERTLLVYREM
Sbjct: 61  QNPTLSCKLIDCYANFGLLNVSHHVFNSIIDPNSTLYNAILRNLTRFGEYERTLLVYREM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
            AKSMH DE+TYPFVL+SCCCLSNVEFG  IHG L+KLGVDSYDTV T LAEMY +CIDF
Sbjct: 121 VAKSMHPDEQTYPFVLQSCCCLSNVEFGKNIHGCLIKLGVDSYDTVVTVLAEMYGKCIDF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN HQ FDKM VKDL+CWSSLIS+  QN NGDE SL  GRM++E LV DSLTFINLLRSI
Sbjct: 181 ENAHQLFDKMSVKDLDCWSSLISEAPQNGNGDEISLLLGRMKSEPLVTDSLTFINLLRSI 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           +G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF KMPE D VVWN
Sbjct: 241 SGLSSIQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKMPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+C DWGKQTHA+ILRN
Sbjct: 301 IMIAAYAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKCADWGKQTHANILRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSVHNSLIDMYCECN L+SACKIFN  T+KTVISWSAMIKG VKHG  LIALSLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECNSLESACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
             MK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKC
Sbjct: 421 FMMKSDGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           GCIEMAQRLFEEERV+DKDLIMWNSMISAHANHGDWSQCFKLYNQ+KCSNS PDQVTFLG
Sbjct: 481 GCIEMAQRLFEEERVNDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSNPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Sbjct: 541 LLTACVNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 601 PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY IL NLE +IKEA+  SPEK
Sbjct: 661 SFLRDKGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEAKEMSPEK 720

Query: 721 LSNLL 726
           L  LL
Sbjct: 721 LGTLL 725

BLAST of Tan0011958 vs. NCBI nr
Match: XP_023541395.1 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1239.9 bits (3207), Expect = 0.0e+00
Identity = 609/725 (84.00%), Postives = 654/725 (90.21%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           M HL+RS PI  S +F FKF NFPATQSRL NTLSSLF+RC SRQ L+QIHARF+LHGFH
Sbjct: 1   MFHLQRSKPITQSPIFRFKFPNFPATQSRLFNTLSSLFSRCKSRQQLQQIHARFVLHGFH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLSCKLIDCYAN GLLNLS  VF SIIDPNSTLYNAILRNLT++GEYERTLL+YREM
Sbjct: 61  QNPTLSCKLIDCYANFGLLNLSHHVFNSIIDPNSTLYNAILRNLTRFGEYERTLLMYREM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
             KSMH DE+TYPFVLRSCCCLS+VEFG  IHG L+KLGVDSYDTV T LAEMY++CIDF
Sbjct: 121 VGKSMHPDEQTYPFVLRSCCCLSHVEFGKNIHGCLIKLGVDSYDTVVTVLAEMYEKCIDF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN HQ FDKM VKDL+CWSSL+S+  QN NGD+ SL  GRM++E +V DSLTFIN LRS+
Sbjct: 181 ENAHQLFDKMSVKDLDCWSSLMSEAPQNGNGDDISLLFGRMKSEPIVTDSLTFINRLRSV 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           +G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF KMPE D VVWN
Sbjct: 241 SGLSSIQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKMPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+  DWGKQTHA+ILRN
Sbjct: 301 IMIAAYAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKRADWGKQTHANILRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSVHNSLIDMYCECN LDSACKIFN  T+KTVISWSAMIKG VKHG  LIALSLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECNSLDSACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
            RMK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKC
Sbjct: 421 FRMKSDGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           GCI+MAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLY+Q+KCSNS PDQVTFLG
Sbjct: 481 GCIDMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYSQMKCSNSNPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Sbjct: 541 LLTACVNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 601 PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY IL NLE +IKEA+  SPEK
Sbjct: 661 SFLRDKGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEAKEMSPEK 720

Query: 721 LSNLL 726
           L  LL
Sbjct: 721 LGTLL 725

BLAST of Tan0011958 vs. NCBI nr
Match: XP_038894029.1 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 1236.9 bits (3199), Expect = 0.0e+00
Identity = 615/725 (84.83%), Postives = 651/725 (89.79%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           MLHL+RS P+IHS +    F NFPATQSRLLNTLS LF+RC+SRQHL+QIHARF+LHGFH
Sbjct: 34  MLHLQRSKPVIHSLI----FPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFH 93

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLS KLIDCYANLGLLNLS QVFYSI +PNST+YNAILRNLT+YGE ERTLLVYR+M
Sbjct: 94  QNPTLSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQM 153

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
            AKSMH DEETYP VLRSCC  SNV  G KIHG LVKLG DS+D VATAL EMY+ECIDF
Sbjct: 154 VAKSMHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDF 213

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           E+ HQ FDK  VKDLECWSS  ++  QN NG+      GRMR EQLV DSLTFINLLR I
Sbjct: 214 ESAHQLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFI 273

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           AGFNSIQLAKIVHCIAIVS LCGDLLVNTAVLSLYSKLGSLVDARKLF+KMPEND VVWN
Sbjct: 274 AGFNSIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWN 333

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREGKPTECL LF SMARSGIR+D+FTALPVISS+SQL+  DWGKQTHA+ILRN
Sbjct: 334 IMIAAYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRN 393

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSV+NSLIDMYCECNILDSACKIFN   DKTVISWSAMIKGYVKHG SLIALSLF
Sbjct: 394 GSDSQVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLF 453

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
             MK +GIQ+DFIT+INILPAFVHIG LENVKYLHGYS+KLGLTSLPSLNT LLITYAKC
Sbjct: 454 SSMKSDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKC 513

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           GCIEMAQR+FEEER+DDKDLIMWNSMISAHANHGDWSQCFKLYNQ+KCSN+KPDQVTFLG
Sbjct: 514 GCIEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLG 573

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKEF KEM ENYGCQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Sbjct: 574 LLTACVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK 633

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACKLH  SKLAEFAAE+L+DMEPKNAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 634 PDARVWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMR 693

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLRDKGLKKTPGCSWLEING VTEFRVADQTHPRAEDIYTIL NLE EIKEAR KS EK
Sbjct: 694 SFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEK 753

Query: 721 LSNLL 726
           L N L
Sbjct: 754 LGNPL 754

BLAST of Tan0011958 vs. NCBI nr
Match: KAG7012542.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1234.9 bits (3194), Expect = 0.0e+00
Identity = 610/725 (84.14%), Postives = 650/725 (89.66%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           M HL+RS PI     F FKF NFPATQSRLLNTLSSLF+RC SRQ L+QIHARF+LHGFH
Sbjct: 1   MFHLQRSKPI-----FRFKFPNFPATQSRLLNTLSSLFSRCKSRQQLQQIHARFVLHGFH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLSCKLIDCYAN GLLNLS  VF SIIDPNS LYNAILRNLT++GEYERTLLVYREM
Sbjct: 61  QNPTLSCKLIDCYANFGLLNLSHHVFNSIIDPNSALYNAILRNLTRFGEYERTLLVYREM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
            AKSMH DE+TYPFVLRSCCCLSNV+FG  IHG L+KLGVDSYDTV T L EMY++CIDF
Sbjct: 121 VAKSMHPDEQTYPFVLRSCCCLSNVQFGKNIHGCLIKLGVDSYDTVVTVLVEMYEKCIDF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN HQ FDKM VKDL+CWSSL+S   QN NGD+ SL  GRM++E LV DSLTFINLLRSI
Sbjct: 181 ENAHQLFDKMSVKDLDCWSSLMSDAPQNGNGDDISLLFGRMKSEPLVTDSLTFINLLRSI 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           +G +SIQLAK+VHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF K+PE D VVWN
Sbjct: 241 SGLSSIQLAKMVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKIPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+  DWGKQTHA+ILRN
Sbjct: 301 IMIAAYAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKRADWGKQTHANILRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSVHNSLIDMYCECN LDSACKIFN  T+KTVISWSAMIKG VKHG  LIALSLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECNSLDSACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
            RMK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKC
Sbjct: 421 FRMKSDGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           GCI+MAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQ+KCSNS PDQVTFLG
Sbjct: 481 GCIDMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSNPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKEFFKEM+E Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Sbjct: 541 LLTACVNSGLVEKGKEFFKEMIERYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 601 PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY IL NLE +IKE +  SPEK
Sbjct: 661 SFLRDKGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEVKEMSPEK 720

Query: 721 LSNLL 726
           L  LL
Sbjct: 721 LGTLL 720

BLAST of Tan0011958 vs. NCBI nr
Match: KAG6573373.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1231.1 bits (3184), Expect = 0.0e+00
Identity = 607/722 (84.07%), Postives = 650/722 (90.03%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           M HL+RS PI     F FKF NFPATQSRLLNTLSSLF+RC SRQ L+QIHARF+LHGFH
Sbjct: 1   MFHLQRSKPI-----FRFKFPNFPATQSRLLNTLSSLFSRCKSRQQLQQIHARFVLHGFH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLSCKLIDCYAN GLLNLS  VF SIIDPNS LYNAILRNLT++GEYERTLLVYREM
Sbjct: 61  QNPTLSCKLIDCYANFGLLNLSHHVFNSIIDPNSALYNAILRNLTRFGEYERTLLVYREM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
            AKSMH DE+TYPFVLRSCCCLSNV+FG  IHG L+KLGVDSYDTV T L EMY++CIDF
Sbjct: 121 VAKSMHPDEQTYPFVLRSCCCLSNVQFGKNIHGCLIKLGVDSYDTVVTVLVEMYEKCIDF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN HQ FDKM VKDL+CWSSLI++  QN NGD+ S   GRM++E LV DSLTFINLLRS+
Sbjct: 181 ENAHQLFDKMSVKDLDCWSSLITEAPQNGNGDDISRLFGRMKSEPLVTDSLTFINLLRSV 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           +G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF K+PE D VVWN
Sbjct: 241 SGLSSIQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKIPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+  DWGKQTHA+ILRN
Sbjct: 301 IMIAAYAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKRADWGKQTHANILRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSVHNSLIDMYCECN LDSACKIFN  T+KTVISWSAMIKG VKHG  LIALSLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECNSLDSACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
            RMK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKC
Sbjct: 421 FRMKSDGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           GCI+MAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCF LYNQ+KCSNS PDQVTFLG
Sbjct: 481 GCIDMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFNLYNQMKCSNSNPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Sbjct: 541 LLTACVNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 601 PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY IL NLE +IKEA+  SPEK
Sbjct: 661 SFLRDKGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEAKEMSPEK 717

Query: 721 LS 723
           L+
Sbjct: 721 LA 717

BLAST of Tan0011958 vs. ExPASy TrEMBL
Match: A0A6J1K3Q8 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111490383 PE=4 SV=1)

HSP 1 Score: 1248.0 bits (3228), Expect = 0.0e+00
Identity = 616/725 (84.97%), Postives = 655/725 (90.34%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           M HL+RS  I  S +F FKF NFPATQSRLLNTLSSLF+RC SRQ LEQIHARF+LHGFH
Sbjct: 1   MFHLQRSKSITQSPIFRFKFPNFPATQSRLLNTLSSLFSRCKSRQQLEQIHARFVLHGFH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLSCKLIDCYAN GLLN+S  VF SIIDPNSTLYNAILRNLT++GEYERTLLVYREM
Sbjct: 61  QNPTLSCKLIDCYANFGLLNVSHHVFNSIIDPNSTLYNAILRNLTRFGEYERTLLVYREM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
            AKSMH DE+TYPFVL+SCCCLSNVEFG  IHG L+KLGVDSYDTV T LAEMY +CIDF
Sbjct: 121 VAKSMHPDEQTYPFVLQSCCCLSNVEFGKNIHGCLIKLGVDSYDTVVTVLAEMYGKCIDF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN HQ FDKM VKDL+CWSSLIS+  QN NGDE SL  GRM++E LV DSLTFINLLRSI
Sbjct: 181 ENAHQLFDKMSVKDLDCWSSLISEAPQNGNGDEISLLLGRMKSEPLVTDSLTFINLLRSI 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           +G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF KMPE D VVWN
Sbjct: 241 SGLSSIQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKMPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+C DWGKQTHA+ILRN
Sbjct: 301 IMIAAYAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKCADWGKQTHANILRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSVHNSLIDMYCECN L+SACKIFN  T+KTVISWSAMIKG VKHG  LIALSLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECNSLESACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
             MK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKC
Sbjct: 421 FMMKSDGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           GCIEMAQRLFEEERV+DKDLIMWNSMISAHANHGDWSQCFKLYNQ+KCSNS PDQVTFLG
Sbjct: 481 GCIEMAQRLFEEERVNDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSNPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Sbjct: 541 LLTACVNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 601 PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY IL NLE +IKEA+  SPEK
Sbjct: 661 SFLRDKGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEAKEMSPEK 720

Query: 721 LSNLL 726
           L  LL
Sbjct: 721 LGTLL 725

BLAST of Tan0011958 vs. ExPASy TrEMBL
Match: A0A6J1CE61 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111010677 PE=4 SV=1)

HSP 1 Score: 1230.7 bits (3183), Expect = 0.0e+00
Identity = 615/725 (84.83%), Postives = 650/725 (89.66%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           MLHL+RS PI     F F+F NFPATQSR LNTLS LF+RC+SRQ LEQIHARFILHG H
Sbjct: 1   MLHLQRSKPI-----FRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN  LSC+LID YANLGLL LSQQVF SIIDP STLY+AILRNL+ +GEYERTLLVYREM
Sbjct: 61  QNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
           FAKSMH DEETYP VLRSCCCLSNVE+G KIHG LVKLGVD YD+ ATALAEMY +CI F
Sbjct: 121 FAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN H  FDKM +KD ECW+SL S+ SQN NGDE     GRMRTEQLV DSLTFINLLRSI
Sbjct: 181 ENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSI 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
            G NSIQLAKIVHC+AI SNLCGDLLVNTAVLSLYSKLG LV+ARKLF+KMPE D VVWN
Sbjct: 241 VGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AY REG P ECLELF SMARSGIRADLFTALPVISS+SQL+CVDWGKQTHAH LRN
Sbjct: 301 IMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G D+QVSVHNSLIDMYCE NILDSACKIF+  T+KTVISWSAMIKG VKHG+SL ALSLF
Sbjct: 361 GSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
            RMK +GIQADFIT+INILPAFVHIGALENVKYLHGYS+KLGLTSLPSLNT LLITYAKC
Sbjct: 421 SRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFK+YNQ+KCSNS+PDQVTFLG
Sbjct: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKE FKEM+ENYGCQPSQEHYACMVNLLGRAGLIN+AG LVR MPIK
Sbjct: 541 LLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 601 PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLRDKGLKKTPGCSWLEING VTEFRVAD+THPRAEDIYTIL NLE EIKEAR KSPEK
Sbjct: 661 SFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEAREKSPEK 720

Query: 721 LSNLL 726
           L  LL
Sbjct: 721 LGILL 720

BLAST of Tan0011958 vs. ExPASy TrEMBL
Match: A0A6J1GR57 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111456773 PE=4 SV=1)

HSP 1 Score: 1225.3 bits (3169), Expect = 0.0e+00
Identity = 605/725 (83.45%), Postives = 648/725 (89.38%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           M HL+RS PI     F FKF NFPAT SRLLNTLSSLF+RC SRQ L+QIHARF+LHGFH
Sbjct: 1   MFHLQRSKPI-----FRFKFPNFPATHSRLLNTLSSLFSRCKSRQQLQQIHARFVLHGFH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLSCKLIDCYAN GLLNLS  VF SIIDPNS LYNAILRNLT++GEYERTLLVYREM
Sbjct: 61  QNPTLSCKLIDCYANFGLLNLSHHVFNSIIDPNSALYNAILRNLTRFGEYERTLLVYREM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
            AKSMH DE+TYPFVLRSCCCLSNV+FG  IHG L+KLGVDSYDTV T L EMY++CIDF
Sbjct: 121 VAKSMHPDEQTYPFVLRSCCCLSNVQFGKNIHGCLIKLGVDSYDTVVTVLVEMYEKCIDF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN HQ FDKM VKDL+CWSSLI++  QN NGD+ S   GRM++E LV DSLTFINLLRS+
Sbjct: 181 ENAHQLFDKMSVKDLDCWSSLITEAPQNGNGDDISRLFGRMKSEPLVTDSLTFINLLRSV 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           +G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF K+PE D VVWN
Sbjct: 241 SGLSSIQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKIPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREG+P ECLELF SMARSGIRADLFT LPVISS+SQL+  DWGKQTHA+ILRN
Sbjct: 301 IMIAAYAREGRPMECLELFESMARSGIRADLFTVLPVISSISQLKRADWGKQTHANILRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSVHNSLIDMYCECN LDSA KIFN  T+KTVISWSAMIKG VKHG  LIALSLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECNSLDSASKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
            RMK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKC
Sbjct: 421 FRMKSDGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           GCI+MAQRLFEEERV+DKDLIMWNSMISAHANHGDWSQCFKLYNQ+KCSNS PDQVTFLG
Sbjct: 481 GCIDMAQRLFEEERVNDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSNPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Sbjct: 541 LLTACVNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 601 PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY IL NLE +IKE +  SPEK
Sbjct: 661 SFLRDKGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEVKEMSPEK 720

Query: 721 LSNLL 726
           L  LL
Sbjct: 721 LGTLL 720

BLAST of Tan0011958 vs. ExPASy TrEMBL
Match: A0A0A0M0Z6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G534720 PE=4 SV=1)

HSP 1 Score: 1192.2 bits (3083), Expect = 0.0e+00
Identity = 599/725 (82.62%), Postives = 640/725 (88.28%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           MLHL RS PIIHS +F     NFPATQSRLLNTLS LF+RCNS QHL+QIHARFILHGFH
Sbjct: 1   MLHLHRSKPIIHSPIFL----NFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLS KLIDCYANLGLLN S QVF S+IDPN TL+NAILRNLT+YGE ERTLLVY++M
Sbjct: 61  QNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
            AKSMH DEETYPFVLRSC   SNV FG  IHG LVKLG D +D VATALAEMY+ECI+F
Sbjct: 121 VAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN HQ FDK  VKDL   SSL ++  QN NG+      GRM  EQLVPDS TF NLLR I
Sbjct: 181 ENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFI 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           AG NSIQLAKIVHCIAIVS L GDLLVNTAVLSLYSKL SLVDARKLF+KMPE D VVWN
Sbjct: 241 AGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREGKPTECLELF SMARSGIR+DLFTALPVISS++QL+CVDWGKQTHAHILRN
Sbjct: 301 IMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSVHNSLIDMYCEC ILDSACKIFN  TDK+VISWSAMIKGYVK+G+SL ALSLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
            +MK +GIQADF+ MINILPAFVHIGALENVKYLHGYS+KLGLTSLPSLNT LLITYAKC
Sbjct: 421 SKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           G IEMAQRLFEEE++DDKDLIMWNSMISAHANHGDWSQCFKLYN++KCSNSKPDQVTFLG
Sbjct: 481 GSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GLVEKGKEFFKEM E+YGCQPSQEHYACMVNLLGRAGLI+EAGELV+ MPIK
Sbjct: 541 LLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACK+H  SKLAEFAAE+LI+MEP+NAGNYILLSNIYAAAG+WD VAKMR
Sbjct: 601 PDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLR+KGLKK PGCSWLEING VTEFRVADQTHPRA DIYTIL NLE EIKE R KSP+ 
Sbjct: 661 SFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDT 720

Query: 721 LSNLL 726
           L N L
Sbjct: 721 LVNPL 721

BLAST of Tan0011958 vs. ExPASy TrEMBL
Match: A0A5D3DB69 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G00810 PE=4 SV=1)

HSP 1 Score: 1171.0 bits (3028), Expect = 0.0e+00
Identity = 591/725 (81.52%), Postives = 636/725 (87.72%), Query Frame = 0

Query: 1   MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFH 60
           MLHL+RS PIIH+ +      NFPATQSRLLNTLS LFNRCNS QHL+QIHARFILHGFH
Sbjct: 1   MLHLQRSKPIIHTPILL----NFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFH 60

Query: 61  QNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREM 120
           QN TLS KLIDCYANLGLL  S QVF SIIDPN TL+NAILRNLT+YGE ER LLVY++M
Sbjct: 61  QNPTLSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQM 120

Query: 121 FAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDF 180
            AKSMH DEETYPF+ RSC   SNV FG  IHG LVKLG DS+D VATALAEMY++ I F
Sbjct: 121 VAKSMHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAF 180

Query: 181 ENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSI 240
           EN HQ FDK  VKDL   SSL ++ SQN NG+       RMR EQLVPDSLTF+NLLR I
Sbjct: 181 ENAHQLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFI 240

Query: 241 AGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN 300
           AG NSIQLAKIVHCIAIVS L GDLLV TAVLSLYSKL SLVDAR+LF+KMPE D VVWN
Sbjct: 241 AGLNSIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWN 300

Query: 301 IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRN 360
           IMI+AYAREGKP ECLELF SMARSGIR+DLFTALPVISS++QL+CVDWGKQTHAHILRN
Sbjct: 301 IMIAAYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRN 360

Query: 361 GLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF 420
           G DSQVSVHNSLIDMYCEC +LDSAC IFN  TDK+VISWSAMIKGYVK+G+SL A SLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLF 420

Query: 421 LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKC 480
            +MK +GIQADF+TMINILPAFVHIGALENVKYLHGYS+KLGLTSLPSLNT LLITYAKC
Sbjct: 421 SKMKSDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKC 480

Query: 481 GCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLG 540
           G IEMAQRLFEEER+DDKDLIMWNSMISAHANHGDWSQCFKLYN++KCSNSKPDQVTFLG
Sbjct: 481 GYIEMAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLG 540

Query: 541 LLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK 600
           LLT CVN GL+EKGKEFFKEM E+YGC PSQEH+ACMVNLLGRAGLI+EAGELVR MPIK
Sbjct: 541 LLTACVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIK 600

Query: 601 PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR 660
           PDARVWGPLLSACK+H  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+W+ VAKMR
Sbjct: 601 PDARVWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMR 660

Query: 661 SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEK 720
           SFLR+KGLKKTPGCS LEING+VTEFRVADQTHPRAEDIYTIL NLE EIKE R KS + 
Sbjct: 661 SFLRNKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSLDT 720

Query: 721 LSNLL 726
           L N L
Sbjct: 721 LVNPL 721

BLAST of Tan0011958 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 416.0 bits (1068), Expect = 6.2e-116
Identity = 220/677 (32.50%), Postives = 368/677 (54.36%), Query Frame = 0

Query: 37  LFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTL 96
           L  RC+S + L QI      +G +Q      KL+  +   G ++ + +VF  I    + L
Sbjct: 43  LLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVL 102

Query: 97  YNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLV 156
           Y+ +L+   K  + ++ L  +  M    +      + ++L+ C   + +  G +IHG LV
Sbjct: 103 YHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLV 162

Query: 157 KLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSL 216
           K G        T L  MY +C       + FD+M  +DL  W+++++  SQN        
Sbjct: 163 KSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALE 222

Query: 217 FSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYS 276
               M  E L P  +T +++L +++    I + K +H  A+ S     + ++TA++ +Y+
Sbjct: 223 MVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYA 282

Query: 277 KLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALP 336
           K GSL  AR+LF+ M E + V WN MI AY +   P E + +F  M   G++    + + 
Sbjct: 283 KCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMG 342

Query: 337 VISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKT 396
            + + + L  ++ G+  H   +  GLD  VSV NSLI MYC+C  +D+A  +F     +T
Sbjct: 343 ALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRT 402

Query: 397 VISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHG 456
           ++SW+AMI G+ ++GR + AL+ F +M+   ++ D  T ++++ A   +    + K++HG
Sbjct: 403 LVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHG 462

Query: 457 YSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDW 516
             ++  L     + T L+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG  
Sbjct: 463 VVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIF--DMMSERHVTTWNAMIDGYGTHGFG 522

Query: 517 SQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYAC 576
               +L+ +++    KP+ VTFL +++ C + GLVE G + F  M ENY  + S +HY  
Sbjct: 523 KAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGA 582

Query: 577 MVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKN 636
           MV+LLGRAG +NEA + +  MP+KP   V+G +L AC++H     AE AAE+L ++ P +
Sbjct: 583 MVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDD 642

Query: 637 AGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRA 696
            G ++LL+NIY AA  W++V ++R  +  +GL+KTPGCS +EI  +V  F      HP +
Sbjct: 643 GGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDS 702

Query: 697 EDIYTILENLEHEIKEA 714
           + IY  LE L   IKEA
Sbjct: 703 KKIYAFLEKLICHIKEA 717

BLAST of Tan0011958 vs. TAIR 10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 401.4 bits (1030), Expect = 1.6e-111
Identity = 225/731 (30.78%), Postives = 399/731 (54.58%), Query Frame = 0

Query: 2   LHLRRS-----NPIIHSFV----------FCFKFQNFPATQSRLLNTLSSLFNRCNSRQH 61
           L LRRS     N II SFV          F FK   F  +    ++T   L   C + ++
Sbjct: 96  LDLRRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPD--VSTFPCLVKACVALKN 155

Query: 62  LEQIHARFILH-----GFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAIL 121
            + I   F+       G   N  ++  LI  Y   G +++  ++F  ++  +  ++N +L
Sbjct: 156 FKGID--FLSDTVSSLGMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVML 215

Query: 122 RNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVD 181
               K G  +  +  +  M    +  +  T+  VL  C     ++ G ++HG +V  GVD
Sbjct: 216 NGYAKCGALDSVIKGFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVD 275

Query: 182 SYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRM 241
              ++  +L  MY +C  F++  + F  M   D   W+ +IS   Q+   +E+  F   M
Sbjct: 276 FEGSIKNSLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEM 335

Query: 242 RTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSL 301
            +  ++PD++TF +LL S++ F +++  K +HC  +  ++  D+ + +A++  Y K   +
Sbjct: 336 ISSGVLPDAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGV 395

Query: 302 VDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSV 361
             A+ +F++    D VV+  MIS Y   G   + LE+F  + +  I  +  T + ++  +
Sbjct: 396 SMAQNIFSQCNSVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVI 455

Query: 362 SQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWS 421
             L  +  G++ H  I++ G D++ ++  ++IDMY +C  ++ A +IF   + + ++SW+
Sbjct: 456 GILLALKLGRELHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWN 515

Query: 422 AMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKL 481
           +MI    +      A+ +F +M   GI  D +++   L A  ++ +    K +HG+ +K 
Sbjct: 516 SMITRCAQSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKH 575

Query: 482 GLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFK 541
            L S     +TL+  YAKCG ++ A  +F  + + +K+++ WNS+I+A  NHG       
Sbjct: 576 SLASDVYSESTLIDMYAKCGNLKAAMNVF--KTMKEKNIVSWNSIIAACGNHGKLKDSLC 635

Query: 542 LYNQL-KCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNL 601
           L++++ + S  +PDQ+TFL ++++C + G V++G  FF+ M E+YG QP QEHYAC+V+L
Sbjct: 636 LFHEMVEKSGIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDL 695

Query: 602 LGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNY 661
            GRAG + EA E V++MP  PDA VWG LL AC+LH   +LAE A+ +L+D++P N+G Y
Sbjct: 696 FGRAGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYY 755

Query: 662 ILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY 712
           +L+SN +A A  W+ V K+RS ++++ ++K PG SW+EIN +   F   D  HP +  IY
Sbjct: 756 VLISNAHANAREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIY 815

BLAST of Tan0011958 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 401.0 bits (1029), Expect = 2.1e-111
Identity = 227/717 (31.66%), Postives = 395/717 (55.09%), Query Frame = 0

Query: 27  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQ 86
           QS+      S    C +   L+  H      G   + +   KL+     LG    L+ ++
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 87  QVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCL 146
           +VF +     +  +YN+++R     G     +L++  M    +  D+ T+PF L +C   
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKS 147

Query: 147 SNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLI 206
                G +IHG +VK+G      V  +L   Y EC + ++  + FD+M  +++  W+S+I
Sbjct: 148 RAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMI 207

Query: 207 SKNSQNRNG-DENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNL 266
              ++     D   LF   +R E++ P+S+T + ++ + A    ++  + V+     S +
Sbjct: 208 CGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGI 267

Query: 267 CGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS 326
             + L+ +A++ +Y K  ++  A++LF++   ++  + N M S Y R+G   E L +F  
Sbjct: 268 EVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNL 327

Query: 327 MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNI 386
           M  SG+R D  + L  ISS SQLR + WGK  H ++LRNG +S  ++ N+LIDMY +C+ 
Sbjct: 328 MMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHR 387

Query: 387 LDSACKIFNGTTDKTVISWSAMIKGYVKHGR------------------------SLIAL 446
            D+A +IF+  ++KTV++W++++ GYV++G                          L+  
Sbjct: 388 QDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQG 447

Query: 447 SLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSL 506
           SLF   + + C     EG+ AD +TM++I  A  H+GAL+  K+++ Y  K G+     L
Sbjct: 448 SLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRL 507

Query: 507 NTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCS 566
            TTL+  +++CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ +   
Sbjct: 508 GTTLVDMFSRCGDPESAMSIFNS--LTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQ 567

Query: 567 NSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE 626
             KPD V F+G LT C + GLV++GKE F  M++ +G  P   HY CMV+LLGRAGL+ E
Sbjct: 568 GLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEE 627

Query: 627 AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAA 686
           A +L+  MP++P+  +W  LL+AC++    ++A +AAE++  + P+  G+Y+LLSN+YA+
Sbjct: 628 AVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYAS 687

Query: 687 AGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENL 707
           AGRW+ +AK+R  +++KGL+K PG S ++I G+  EF   D++HP   +I  +L+ +
Sbjct: 688 AGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEV 742

BLAST of Tan0011958 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 401.0 bits (1029), Expect = 2.1e-111
Identity = 227/717 (31.66%), Postives = 395/717 (55.09%), Query Frame = 0

Query: 27  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQ 86
           QS+      S    C +   L+  H      G   + +   KL+     LG    L+ ++
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 87  QVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCL 146
           +VF +     +  +YN+++R     G     +L++  M    +  D+ T+PF L +C   
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKS 147

Query: 147 SNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLI 206
                G +IHG +VK+G      V  +L   Y EC + ++  + FD+M  +++  W+S+I
Sbjct: 148 RAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMI 207

Query: 207 SKNSQNRNG-DENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNL 266
              ++     D   LF   +R E++ P+S+T + ++ + A    ++  + V+     S +
Sbjct: 208 CGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGI 267

Query: 267 CGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS 326
             + L+ +A++ +Y K  ++  A++LF++   ++  + N M S Y R+G   E L +F  
Sbjct: 268 EVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNL 327

Query: 327 MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNI 386
           M  SG+R D  + L  ISS SQLR + WGK  H ++LRNG +S  ++ N+LIDMY +C+ 
Sbjct: 328 MMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHR 387

Query: 387 LDSACKIFNGTTDKTVISWSAMIKGYVKHGR------------------------SLIAL 446
            D+A +IF+  ++KTV++W++++ GYV++G                          L+  
Sbjct: 388 QDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQG 447

Query: 447 SLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSL 506
           SLF   + + C     EG+ AD +TM++I  A  H+GAL+  K+++ Y  K G+     L
Sbjct: 448 SLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRL 507

Query: 507 NTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCS 566
            TTL+  +++CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ +   
Sbjct: 508 GTTLVDMFSRCGDPESAMSIFNS--LTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQ 567

Query: 567 NSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE 626
             KPD V F+G LT C + GLV++GKE F  M++ +G  P   HY CMV+LLGRAGL+ E
Sbjct: 568 GLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEE 627

Query: 627 AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAA 686
           A +L+  MP++P+  +W  LL+AC++    ++A +AAE++  + P+  G+Y+LLSN+YA+
Sbjct: 628 AVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYAS 687

Query: 687 AGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENL 707
           AGRW+ +AK+R  +++KGL+K PG S ++I G+  EF   D++HP   +I  +L+ +
Sbjct: 688 AGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEV 742

BLAST of Tan0011958 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 391.3 bits (1004), Expect = 1.6e-108
Identity = 210/657 (31.96%), Postives = 359/657 (54.64%), Query Frame = 0

Query: 57  HGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLV 116
           +GF  +S L  KL   Y N G L  + +VF  +    +  +N ++  L K G++  ++ +
Sbjct: 123 NGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGL 182

Query: 117 YREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDE 176
           +++M +  +  D  T+  V +S   L +V  G ++HG ++K G    ++V  +L   Y +
Sbjct: 183 FKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLK 242

Query: 177 CIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINL 236
               ++  + FD+M  +D+  W+S+I+    N   ++      +M    +  D  T +++
Sbjct: 243 NQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSV 302

Query: 237 LRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDC 296
               A    I L + VH I + +    +      +L +YSK G L  A+ +F +M +   
Sbjct: 303 FAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSV 362

Query: 297 VVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAH 356
           V +  MI+ YAREG   E ++LF  M   GI  D++T   V++  ++ R +D GK+ H  
Sbjct: 363 VSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEW 422

Query: 357 ILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIA 416
           I  N L   + V N+L+DMY +C  +  A  +F+    K +ISW+ +I GY K+  +  A
Sbjct: 423 IKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEA 482

Query: 417 LSLF-LRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLI 476
           LSLF L ++ +    D  T+  +LPA   + A +  + +HGY ++ G  S   +  +L+ 
Sbjct: 483 LSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVD 542

Query: 477 TYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQ 536
            YAKCG + +A  LF++  +  KDL+ W  MI+ +  HG   +   L+NQ++ +  + D+
Sbjct: 543 MYAKCGALLLAHMLFDD--IASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADE 602

Query: 537 VTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVR 596
           ++F+ LL  C + GLV++G  FF  M      +P+ EHYAC+V++L R G + +A   + 
Sbjct: 603 ISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIE 662

Query: 597 TMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDR 656
            MPI PDA +WG LL  C++H   KLAE  AE++ ++EP+N G Y+L++NIYA A +W++
Sbjct: 663 NMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQ 722

Query: 657 VAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKE 713
           V ++R  +  +GL+K PGCSW+EI G+V  F   D ++P  E+I   L  +   + E
Sbjct: 723 VKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIE 777

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3E6Q18.7e-11532.50Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9STE12.2e-11030.78Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Q9LUJ22.9e-11031.66Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9SN392.3e-10731.96Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9SVP71.8e-10429.96Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022994744.10.0e+0084.97pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucur... [more]
XP_023541395.10.0e+0084.00pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucur... [more]
XP_038894029.10.0e+0084.83pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benin... [more]
KAG7012542.10.0e+0084.14Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
KAG6573373.10.0e+0084.07Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1K3Q80.0e+0084.97pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cuc... [more]
A0A6J1CE610.0e+0084.83pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Mom... [more]
A0A6J1GR570.0e+0083.45pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cuc... [more]
A0A0A0M0Z60.0e+0082.62Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G534720 PE=4 SV=1[more]
A0A5D3DB690.0e+0081.52Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G11290.16.2e-11632.50Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21300.11.6e-11130.78Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.12.1e-11131.66CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
AT3G22690.22.1e-11131.66INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT4G18750.11.6e-10831.96Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 577..604
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..602
NoneNo IPR availablePANTHERPTHR47928:SF25PPR CONTAINING PLANT-LIKE PROTEINcoord: 1..602
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 132..229
e-value: 3.8E-17
score: 64.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 349..579
e-value: 8.1E-36
score: 125.9
coord: 1..131
e-value: 3.6E-7
score: 31.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 240..338
e-value: 2.2E-8
score: 35.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 156..536
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 352..373
e-value: 0.89
score: 9.9
coord: 250..271
e-value: 0.97
score: 9.8
coord: 150..177
e-value: 0.0028
score: 17.8
coord: 279..309
e-value: 4.5E-7
score: 29.7
coord: 178..208
e-value: 8.9E-9
score: 35.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 380..427
e-value: 8.6E-10
score: 38.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 382..415
e-value: 2.4E-6
score: 25.4
coord: 150..178
e-value: 2.7E-4
score: 18.9
coord: 417..451
e-value: 1.3E-4
score: 19.9
coord: 178..211
e-value: 8.5E-8
score: 29.9
coord: 279..312
e-value: 7.0E-6
score: 23.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 176..210
score: 12.123256
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 10.358486
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 415..450
score: 8.692369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 380..414
score: 10.55579

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011958.1Tan0011958.1mRNA
Tan0011958.2Tan0011958.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding