Tan0012074 (gene) Snake gourd v1

Overview
NameTan0012074
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
LocationLG09: 34356685 .. 34361462 (+)
RNA-Seq ExpressionTan0012074
SyntenyTan0012074
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCAAATTTCTCCAGGGTTTGTCTTTGGCCATTTCAATTCCGTACCGCCCCTGCCTCCTTTCATAGTTGCTCTTCCTTTGGCTTCCTTCCGTCGAACCCTCCGTCCAGCGTCCGGTATCAGTCAAGCTCTTCATTGACGGTTTTGAAAGTAGATTACTCCTTCATGGATACCAATGATCATTTTCTACAATTTTCTTCGTCCCCTGACATGTTGCATTTCATATGGATATTCCACAACTGCCAAGTTAGTGCAATCTGTTCTAGGCAATTGTGACACTCGTGCAAGGTTCTTTAGCCTTTATTGTAGTGATTTCTTTGTTAATTTGTGTGCTTGTTAATTATAGTCTGGGTCTGTTCTAATTCATGATCTATCTTTTCTTTTCCTTGTTAAATGATTAGCAATACCAAAAATGGAATGATTAATGGCTAGTTTGTTATTCTTCTCCAAATAAAAAAGTACGAAGTTGAAGTTTGCAACAATCATCATACTTTCTTGAACAAGTTTTGGTAGGAACCACCTGCACTGGTTTTCATGTCTTATTTATTTATTTTTGGTTGTGTTGGCCGGAAATCACCGTGCTATTACTTTTTGTGTGACATTTGACTTTAAAAAGAAAAACAAAAAACAGAAAACAAAAAACAAAATCTCAAGAGCTAATGAAATGAATAGCAAGATAAATATGTGGTTTTTGGTTTTTCAAAAAAAAAGCTTATAAGCTTTGCTTGGAGTTATAAATTTCTTTGTTTTGTTATCTACTTTTACAATGTTTCCAAAATCCAAGCCCAGTTTTAAAAACTAAAAAAGTAGTTTTGTAAATTTGTTTTTGTTTTTAGAATTTGGCTAAATCTTTGTTTAAGAAGGATGAGAATCATTACAAGAAAATGATGAGAAAACAAACACAATTTTCAAAAACCAAATGGTTATCAAACAAGGTCCTAGTTTTATGAATATGAGAATTAGAAAGAACAAGATGACAAATTTATTTTTTTTTAAAAAAGGACATGATTCATGGAATAGCATAACAAGTTTGGCTTTAGCCTGGGGACATAGTGCCTGTAGAGAGATGGTTTAGGAGTTTCATCTCCAAAGGAAGTTTTTTGTGTTGGGTCTTTAGGGGAGAGGAATAATAGAATTTTCAAAGGGCTTGAAAGACCTTTGAGTGATATTTGTGGGGTCTTGTTAGGTTCAATGGTCGCTTTGGGTTTCGGTGGTTAAGCTGTTTTGCAACTATTCGTTAGGTCATATTTTGCTTGATTGGGGGCCTTTTCTTTACCTTGCTTCATTCTTTTGTGAGCTCTTTTTTTGTACGCTTTTGTATTTTTTTCAATTTTTTTAATGGAAGTTTGGTTATTTATATAAAAAAATGGAATAGAAATTAATTGCTTGTGAAGGTCAAAATTTAAAAGATATGATTAATTTATGCTAAGGATATTATTAATATATATATATATATATTACATATATTCTTTTCTTTTCCTTCTCACTCCTTCGAGAGTTTGTTTCTGAACATTTCCCTTTTAGGTTATCAGTGAAAATTGTTTTGTTCAAAAAATAAGTTGATAGGAAAAATATTGATTTATATCACAGCTGTGTAGTCGCTTTCACAATTCTTTTTATGGAATATGTTGATCGTCTTGTATTTTAGCTTCATTTAAATGTGATGCTTGCAGATTGAAGGCTTTAAACTTGTTTGAACTGGATAGACAACATAAATTTGTTGGGATTGGTTAGTAGTGTTTAGAAAATCTGAGTGATTTCTTTTGTGAATTTTGTCAACTTTTAGCAATTTATTTACAGAACTCAAACTTATAATTCCTTGGTGTTTTAATGTTAGCATGAAGATAATGATTTCATTCGTGAAGTTATATTAGAAGGTTATTACTATTTTATATTATTATGATGCTATTTTCACGATCAAACTCAATTTTTATGCCAAAAACAACTTGGAGTTTTTGCTATATATGGTTGCCTAAGATTATAGGTAGGGGTGAGGTGTTGACTGGTGTACTGCTACCACGAGTATACCTTTTTGTGATTGCTAGTTAAATACTATGTCTGCCTCCTATACTTAAAAGTTTTATATAAATAATTTTGTTGTGCCTGTGCACAAATTATTTCAACGATGTGCACATAGCCCTCAAGATACCTGTTGATTCTTCAATTAATAAGTTAAGCTTTTTATTGGTTGGTTTTAAGTTTATTTACAGTTTAATTTATTATCTAATGCATTGTAGGTTCTCATGAAGCAAGTTCGTTTAAAGTGACGAGAAAATGTGAGCGTACTACATTGCTTGCATTTATCAGAAAAGTGCTAGATGAAAATATGGTTTGAAATTATTTGTTTTTTCTTAGTTATGTTTGCTCAGAGCGAGAAATCCTTCCTCTACGTTTGACAGTATTTCAAAGAATTCGAAGTCCTCAATACTTTGAGTCCTTTGGCAATGGCTTTCTTTCACAACTATCCTCCTTTGTTCTTTTGATGCCACCTTTCCTTCACTTTCCTTGTTTGCCTCTTAAAAGGTTCATTTCACAGACCTCAACATCCTCCATTAAAATTGTCCCATTCAACCCACAATCCAATGTTCAACCATTTTTCTCACTTCTCCAAGAATTCCCCCGTAACATTTTGTGGGTCAAATCTATTCATGCCCAGATTATTATCACAAACGCCATATCTGGGGACCAGTTTTTGGCTGCAAAGCTTGTTGCTGCATACTCAGGTCTGGGTTGTTTGGAAAATGCACGGAAAGTGTTTGATAAGTTTCCTCAAACAGAAACTGTTCTTTGCAATGCCATGGTTAATGGGTATTTGCAAAATGAGCATTATAGTGAGAGTATTGAGCTGTTTAAGATGATGGGTCGATGTGATTTAGAATTGGATAGTTATACTTGTAATTTTGCTCTTAAGGCGTGCACGTTCTTAATGGACTATGAAATGGGGATGGAGGTGATTAAATTAGCTGTGTGTAAGGGTTTGGATAGAGGTCGGTTTTTAGGAAGTTCGATTTTGAATTTTTTGGTGAAAACTGGTGATATTATGTGTGCACAAAGATTTTTTTATCGAATGATTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGCCTTCATGCAGGAAGGCTTGTTTGGTGAAGGTTTTAAGGTGTTTCTTGATATGCTTTACAATAAAATTGAGCCTAGTGTTGTGACCATGACAAGCTTGATTCAATCCTGTGGGGCAACTAAGAATTTAGAATTTGGAAAATGTATTCATGGCTATGTTCTTGGATTTGGAATGGGTAGTGATACAAGGGTACTGACCTCATTGATTGTTATGTATTGTAAAACGGGTGATGTCGAAAGTGCTGCATGGATTTTTGATACGATGCCTTCGAGGAATTTGGTCTCTTGGAATGCTATGATTTCTGGTTGCGTTCAAAATGGTTTGCTTGTTGAAACATTACATCTCTTTCGAATGTTGGTTACAAGTGATGGAGGTTTTGATTCAGGTACCATTGTTAGCCTCGTCCAGGTTTGTTCTCACAGGGTTGATTTGGATGGCGGGAAAATTCTTCATGGTTGCATTTATCGAAGGGGACTTGATTTAAATTTGGTTCTGTCTACTGCAATCGTTGATCTATATGCTAAATGTGGATACCTAGCTTATGCTTCTTCTGTTTTTGAAAGAATGAAAAATAAGAATGTGATTTCATGGACTGCCATGCTGGTAGGATTGGCACAGAATGGACATGCGAGAGATGCTTTAAGGTTATTTAATCAGATGCAAAATGAGAAGGTTACTTTTAATGCTCTCACCTTAGTTGGTTTAATCCACTGTTGTGCACTCCTAGGCTCATTACATGAAGGGAGAAGTGTGCATGCTATTTTAATTCGCTTCGGTTTTGCTTCCGAAGTTGTCGTTATGACAGCCCTCATTGATATGTATGCAAAATGCAGCAAAATAGACTCAGCTGAGAAGGTATTCAAGTACGGTGTTACACCCAAGGATGTGATTTTATATAACTCTATGATTTCGGGCTATGGAACACATGGTCTCGGGCGTAAGGCACTGTGCGTCTACCATCAAATGAATCAAGGAGGACTTCAGCCAAATGAAAGCACCCTTCTTTCTCTGCTATCTGCTTGTAGCCATTCAGGCCTGGTGGAAGAAGGAATCTCCTTGTTTCACAATATGCAGACTGTCCATAACATAACACCTACCGATAAACTTTACGCCTGTTTTGTAGATCTTCTAAGTCGAGCGGGTCGCCTGCAGCAAGCTGAGGCGTTTATCAATCAAATGCCTTTCATTCCAACTAGTGGTATACTTGAAACTCTGCTGAATGGATGTCTGATGCACAAGGACATTGAATTGGGTGTAAAAATTGCTGACAGATTGCTTTGTTTGGATTCTAGAAATCCCAGCATCTACATTACCTTGTCGAATATATATGCCGAAGCGAGACAATGGGATTCGGTAAAGTATATCCGAGGCCTTATGACTGAGCAGGAAATTAAGAAGATTTCGGGATATAGCTCAATTGAAGTAAATATTTAGTCTGTGTTTTGTGCGCGAAGGGGAGAATTTGCAGCCATGTTGGGCAAACATCAAGTAAATTTTTGGAGTTCAAAGTACTTTCTTTTTAATCTTTTGTTTATTTTTTTTGTATAATCAACACGTCAAACTTAACATATTGTTTATTGTAATCTCATGGAATGTTATGAGAGTGAGGCATACGTAGTAGACATGTATGAGAACTCTAAACCAATATTATGTAAAATTAGAATAACAGAC

mRNA sequence

CTCAAATTTCTCCAGGGTTTGTCTTTGGCCATTTCAATTCCGTACCGCCCCTGCCTCCTTTCATAGTTGCTCTTCCTTTGGCTTCCTTCCGTCGAACCCTCCGTCCAGCGTCCGGTATCAGTCAAGCTCTTCATTGACGGTTTTGAAAGTAGATTACTCCTTCATGGATACCAATGATCATTTTCTACAATTTTCTTCGTCCCCTGACATGTTGCATTTCATATGGATATTCCACAACTGCCAAGTTAGTGCAATCTGTTCTAGGCAATTGTGACACTCGTGCAAGGTTCTCATGAAGCAAGTTCGTTTAAAGTGACGAGAAAATGTGAGCGTACTACATTGCTTGCATTTATCAGAAAAGTGCTAGATGAAAATATGGTTTGAAATTATTTGTTTTTTCTTAGTTATGTTTGCTCAGAGCGAGAAATCCTTCCTCTACGTTTGACAGTATTTCAAAGAATTCGAAGTCCTCAATACTTTGAGTCCTTTGGCAATGGCTTTCTTTCACAACTATCCTCCTTTGTTCTTTTGATGCCACCTTTCCTTCACTTTCCTTGTTTGCCTCTTAAAAGGTTCATTTCACAGACCTCAACATCCTCCATTAAAATTGTCCCATTCAACCCACAATCCAATGTTCAACCATTTTTCTCACTTCTCCAAGAATTCCCCCGTAACATTTTGTGGGTCAAATCTATTCATGCCCAGATTATTATCACAAACGCCATATCTGGGGACCAGTTTTTGGCTGCAAAGCTTGTTGCTGCATACTCAGGTCTGGGTTGTTTGGAAAATGCACGGAAAGTGTTTGATAAGTTTCCTCAAACAGAAACTGTTCTTTGCAATGCCATGGTTAATGGGTATTTGCAAAATGAGCATTATAGTGAGAGTATTGAGCTGTTTAAGATGATGGGTCGATGTGATTTAGAATTGGATAGTTATACTTGTAATTTTGCTCTTAAGGCGTGCACGTTCTTAATGGACTATGAAATGGGGATGGAGGTGATTAAATTAGCTGTGTGTAAGGGTTTGGATAGAGGTCGGTTTTTAGGAAGTTCGATTTTGAATTTTTTGGTGAAAACTGGTGATATTATGTGTGCACAAAGATTTTTTTATCGAATGATTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGCCTTCATGCAGGAAGGCTTGTTTGGTGAAGGTTTTAAGGTGTTTCTTGATATGCTTTACAATAAAATTGAGCCTAGTGTTGTGACCATGACAAGCTTGATTCAATCCTGTGGGGCAACTAAGAATTTAGAATTTGGAAAATGTATTCATGGCTATGTTCTTGGATTTGGAATGGGTAGTGATACAAGGGTACTGACCTCATTGATTGTTATGTATTGTAAAACGGGTGATGTCGAAAGTGCTGCATGGATTTTTGATACGATGCCTTCGAGGAATTTGGTCTCTTGGAATGCTATGATTTCTGGTTGCGTTCAAAATGGTTTGCTTGTTGAAACATTACATCTCTTTCGAATGTTGGTTACAAGTGATGGAGGTTTTGATTCAGGTACCATTGTTAGCCTCGTCCAGGTTTGTTCTCACAGGGTTGATTTGGATGGCGGGAAAATTCTTCATGGTTGCATTTATCGAAGGGGACTTGATTTAAATTTGGTTCTGTCTACTGCAATCGTTGATCTATATGCTAAATGTGGATACCTAGCTTATGCTTCTTCTGTTTTTGAAAGAATGAAAAATAAGAATGTGATTTCATGGACTGCCATGCTGGTAGGATTGGCACAGAATGGACATGCGAGAGATGCTTTAAGGTTATTTAATCAGATGCAAAATGAGAAGGTTACTTTTAATGCTCTCACCTTAGTTGGTTTAATCCACTGTTGTGCACTCCTAGGCTCATTACATGAAGGGAGAAGTGTGCATGCTATTTTAATTCGCTTCGGTTTTGCTTCCGAAGTTGTCGTTATGACAGCCCTCATTGATATGTATGCAAAATGCAGCAAAATAGACTCAGCTGAGAAGGTATTCAAGTACGGTGTTACACCCAAGGATGTGATTTTATATAACTCTATGATTTCGGGCTATGGAACACATGGTCTCGGGCGTAAGGCACTGTGCGTCTACCATCAAATGAATCAAGGAGGACTTCAGCCAAATGAAAGCACCCTTCTTTCTCTGCTATCTGCTTGTAGCCATTCAGGCCTGGTGGAAGAAGGAATCTCCTTGTTTCACAATATGCAGACTGTCCATAACATAACACCTACCGATAAACTTTACGCCTGTTTTGTAGATCTTCTAAGTCGAGCGGGTCGCCTGCAGCAAGCTGAGGCGTTTATCAATCAAATGCCTTTCATTCCAACTAGTGGTATACTTGAAACTCTGCTGAATGGATGTCTGATGCACAAGGACATTGAATTGGGTGTAAAAATTGCTGACAGATTGCTTTGTTTGGATTCTAGAAATCCCAGCATCTACATTACCTTGTCGAATATATATGCCGAAGCGAGACAATGGGATTCGGTAAAGTATATCCGAGGCCTTATGACTGAGCAGGAAATTAAGAAGATTTCGGGATATAGCTCAATTGAAGTAAATATTTAGTCTGTGTTTTGTGCGCGAAGGGGAGAATTTGCAGCCATGTTGGGCAAACATCAAGTAAATTTTTGGAGTTCAAAGTACTTTCTTTTTAATCTTTTGTTTATTTTTTTTGTATAATCAACACGTCAAACTTAACATATTGTTTATTGTAATCTCATGGAATGTTATGAGAGTGAGGCATACGTAGTAGACATGTATGAGAACTCTAAACCAATATTATGTAAAATTAGAATAACAGAC

Coding sequence (CDS)

ATGCCACCTTTCCTTCACTTTCCTTGTTTGCCTCTTAAAAGGTTCATTTCACAGACCTCAACATCCTCCATTAAAATTGTCCCATTCAACCCACAATCCAATGTTCAACCATTTTTCTCACTTCTCCAAGAATTCCCCCGTAACATTTTGTGGGTCAAATCTATTCATGCCCAGATTATTATCACAAACGCCATATCTGGGGACCAGTTTTTGGCTGCAAAGCTTGTTGCTGCATACTCAGGTCTGGGTTGTTTGGAAAATGCACGGAAAGTGTTTGATAAGTTTCCTCAAACAGAAACTGTTCTTTGCAATGCCATGGTTAATGGGTATTTGCAAAATGAGCATTATAGTGAGAGTATTGAGCTGTTTAAGATGATGGGTCGATGTGATTTAGAATTGGATAGTTATACTTGTAATTTTGCTCTTAAGGCGTGCACGTTCTTAATGGACTATGAAATGGGGATGGAGGTGATTAAATTAGCTGTGTGTAAGGGTTTGGATAGAGGTCGGTTTTTAGGAAGTTCGATTTTGAATTTTTTGGTGAAAACTGGTGATATTATGTGTGCACAAAGATTTTTTTATCGAATGATTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGCCTTCATGCAGGAAGGCTTGTTTGGTGAAGGTTTTAAGGTGTTTCTTGATATGCTTTACAATAAAATTGAGCCTAGTGTTGTGACCATGACAAGCTTGATTCAATCCTGTGGGGCAACTAAGAATTTAGAATTTGGAAAATGTATTCATGGCTATGTTCTTGGATTTGGAATGGGTAGTGATACAAGGGTACTGACCTCATTGATTGTTATGTATTGTAAAACGGGTGATGTCGAAAGTGCTGCATGGATTTTTGATACGATGCCTTCGAGGAATTTGGTCTCTTGGAATGCTATGATTTCTGGTTGCGTTCAAAATGGTTTGCTTGTTGAAACATTACATCTCTTTCGAATGTTGGTTACAAGTGATGGAGGTTTTGATTCAGGTACCATTGTTAGCCTCGTCCAGGTTTGTTCTCACAGGGTTGATTTGGATGGCGGGAAAATTCTTCATGGTTGCATTTATCGAAGGGGACTTGATTTAAATTTGGTTCTGTCTACTGCAATCGTTGATCTATATGCTAAATGTGGATACCTAGCTTATGCTTCTTCTGTTTTTGAAAGAATGAAAAATAAGAATGTGATTTCATGGACTGCCATGCTGGTAGGATTGGCACAGAATGGACATGCGAGAGATGCTTTAAGGTTATTTAATCAGATGCAAAATGAGAAGGTTACTTTTAATGCTCTCACCTTAGTTGGTTTAATCCACTGTTGTGCACTCCTAGGCTCATTACATGAAGGGAGAAGTGTGCATGCTATTTTAATTCGCTTCGGTTTTGCTTCCGAAGTTGTCGTTATGACAGCCCTCATTGATATGTATGCAAAATGCAGCAAAATAGACTCAGCTGAGAAGGTATTCAAGTACGGTGTTACACCCAAGGATGTGATTTTATATAACTCTATGATTTCGGGCTATGGAACACATGGTCTCGGGCGTAAGGCACTGTGCGTCTACCATCAAATGAATCAAGGAGGACTTCAGCCAAATGAAAGCACCCTTCTTTCTCTGCTATCTGCTTGTAGCCATTCAGGCCTGGTGGAAGAAGGAATCTCCTTGTTTCACAATATGCAGACTGTCCATAACATAACACCTACCGATAAACTTTACGCCTGTTTTGTAGATCTTCTAAGTCGAGCGGGTCGCCTGCAGCAAGCTGAGGCGTTTATCAATCAAATGCCTTTCATTCCAACTAGTGGTATACTTGAAACTCTGCTGAATGGATGTCTGATGCACAAGGACATTGAATTGGGTGTAAAAATTGCTGACAGATTGCTTTGTTTGGATTCTAGAAATCCCAGCATCTACATTACCTTGTCGAATATATATGCCGAAGCGAGACAATGGGATTCGGTAAAGTATATCCGAGGCCTTATGACTGAGCAGGAAATTAAGAAGATTTCGGGATATAGCTCAATTGAAGTAAATATTTAG

Protein sequence

MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQIIITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESIELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFLVKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMTSLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSRNLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALIDMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNESTLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFINQMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDSVKYIRGLMTEQEIKKISGYSSIEVNI
Homology
BLAST of Tan0012074 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 409.1 bits (1050), Expect = 1.0e-112
Identity = 213/619 (34.41%), Postives = 355/619 (57.35%), Query Frame = 0

Query: 68  DQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESIELFKMMG 127
           ++F+A+ L+ AY   G ++   K+FD+  Q + V+ N M+NGY +       I+ F +M 
Sbjct: 172 NEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCGALDSVIKGFSVMR 231

Query: 128 RCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFLVKTGDIM 187
              +  ++ T +  L  C   +  ++G+++  L V  G+D    + +S+L+   K G   
Sbjct: 232 MDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFD 291

Query: 188 CAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMTSLIQSCG 247
            A + F  M   D V WN MI  ++Q GL  E    F +M+ + + P  +T +SL+ S  
Sbjct: 292 DASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVS 351

Query: 248 ATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSRNLVSWNA 307
             +NLE+ K IH Y++   +  D  + ++LI  Y K   V  A  IF    S ++V + A
Sbjct: 352 KFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTA 411

Query: 308 MISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILHGCIYRRG 367
           MISG + NGL +++L +FR LV      +  T+VS++ V    + L  G+ LHG I ++G
Sbjct: 412 MISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFIIKKG 471

Query: 368 LDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALRLFN 427
            D    +  A++D+YAKCG +  A  +FER+  ++++SW +M+   AQ+ +   A+ +F 
Sbjct: 472 FDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAIDIFR 531

Query: 428 QMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALIDMYAKCS 487
           QM    + ++ +++   +  CA L S   G+++H  +I+   AS+V   + LIDMYAKC 
Sbjct: 532 QMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMYAKCG 591

Query: 488 KIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQM-NQGGLQPNESTLLSL 547
            + +A  VFK  +  K+++ +NS+I+  G HG  + +LC++H+M  + G++P++ T L +
Sbjct: 592 NLKAAMNVFK-TMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQITFLEI 651

Query: 548 LSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFINQMPFIP 607
           +S+C H G V+EG+  F +M   + I P  + YAC VDL  RAGRL +A   +  MPF P
Sbjct: 652 ISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKSMPFPP 711

Query: 608 TSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDSVKYIRG 667
            +G+  TLL  C +HK++EL    + +L+ LD  N   Y+ +SN +A AR+W+SV  +R 
Sbjct: 712 DAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWESVTKVRS 771

Query: 668 LMTEQEIKKISGYSSIEVN 686
           LM E+E++KI GYS IE+N
Sbjct: 772 LMKEREVQKIPGYSWIEIN 789

BLAST of Tan0012074 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 6.8e-109
Identity = 214/630 (33.97%), Postives = 358/630 (56.83%), Query Frame = 0

Query: 61  ITNAISGDQF-----LAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEH 120
           + N I G+ F     L +KL   Y+  G L+ A +VFD+    + +  N ++N   ++  
Sbjct: 116 VDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGD 175

Query: 121 YSESIELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSS 180
           +S SI LFK M    +E+DSYT +   K+ + L     G ++    +  G      +G+S
Sbjct: 176 FSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNS 235

Query: 181 ILNFLVKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPS 240
           ++ F +K   +  A++ F  M E+DV+ WN +I  ++  GL  +G  VF+ ML + IE  
Sbjct: 236 LVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEID 295

Query: 241 VVTMTSLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFD 300
           + T+ S+   C  ++ +  G+ +H   +      + R   +L+ MY K GD++SA  +F 
Sbjct: 296 LATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFR 355

Query: 301 TMPSRNLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDG 360
            M  R++VS+ +MI+G  + GL  E + LF  +       D  T+ +++  C+    LD 
Sbjct: 356 EMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDE 415

Query: 361 GKILHGCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQ 420
           GK +H  I    L  ++ +S A++D+YAKCG +  A  VF  M+ K++ISW  ++ G ++
Sbjct: 416 GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSK 475

Query: 421 NGHARDALRLFNQMQNEK-VTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVV 480
           N +A +AL LFN +  EK  + +  T+  ++  CA L +  +GR +H  ++R G+ S+  
Sbjct: 476 NCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRH 535

Query: 481 VMTALIDMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQG 540
           V  +L+DMYAKC  +  A  +F   +  KD++ +  MI+GYG HG G++A+ +++QM Q 
Sbjct: 536 VANSLVDMYAKCGALLLAHMLFD-DIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQA 595

Query: 541 GLQPNESTLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQ 600
           G++ +E + +SLL ACSHSGLV+EG   F+ M+    I PT + YAC VD+L+R G L +
Sbjct: 596 GIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIK 655

Query: 601 AEAFINQMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAE 660
           A  FI  MP  P + I   LL GC +H D++L  K+A+++  L+  N   Y+ ++NIYAE
Sbjct: 656 AYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAE 715

Query: 661 ARQWDSVKYIRGLMTEQEIKKISGYSSIEV 685
           A +W+ VK +R  + ++ ++K  G S IE+
Sbjct: 716 AEKWEQVKRLRKRIGQRGLRKNPGCSWIEI 744

BLAST of Tan0012074 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 3.7e-107
Identity = 205/626 (32.75%), Postives = 340/626 (54.31%), Query Frame = 0

Query: 59  IIITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSE 118
           ++  N +  + F   KLV+ +   G ++ A +VF+       VL + M+ G+ +     +
Sbjct: 59  LVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDK 118

Query: 119 SIELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILN 178
           +++ F  M   D+E   Y   + LK C    +  +G E+  L V  G     F  + + N
Sbjct: 119 ALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLEN 178

Query: 179 FLVKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVT 238
              K   +  A++ F RM E+D+V WN ++  + Q G+     ++   M    ++PS +T
Sbjct: 179 MYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFIT 238

Query: 239 MTSLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMP 298
           + S++ +  A + +  GK IHGY +  G  S   + T+L+ MY K G +E+A  +FD M 
Sbjct: 239 IVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGML 298

Query: 299 SRNLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKI 358
            RN+VSWN+MI   VQN    E + +F+ ++         +++  +  C+   DL+ G+ 
Sbjct: 299 ERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRF 358

Query: 359 LHGCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGH 418
           +H      GLD N+ +  +++ +Y KC  +  A+S+F +++++ ++SW AM++G AQNG 
Sbjct: 359 IHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGR 418

Query: 419 ARDALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTA 478
             DAL  F+QM++  V  +  T V +I   A L   H  + +H +++R      V V TA
Sbjct: 419 PIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTA 478

Query: 479 LIDMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQP 538
           L+DMYAKC  I  A  +F   ++ + V  +N+MI GYGTHG G+ AL ++ +M +G ++P
Sbjct: 479 LVDMYAKCGAIMIARLIFDM-MSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKP 538

Query: 539 NESTLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAF 598
           N  T LS++SACSHSGLVE G+  F+ M+  ++I  +   Y   VDLL RAGRL +A  F
Sbjct: 539 NGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDF 598

Query: 599 INQMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQW 658
           I QMP  P   +   +L  C +HK++    K A+RL  L+  +   ++ L+NIY  A  W
Sbjct: 599 IMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMW 658

Query: 659 DSVKYIRGLMTEQEIKKISGYSSIEV 685
           + V  +R  M  Q ++K  G S +E+
Sbjct: 659 EKVGQVRVSMLRQGLRKTPGCSMVEI 683

BLAST of Tan0012074 vs. ExPASy Swiss-Prot
Match: Q9FLZ9 (Pentatricopeptide repeat-containing protein At5g39350 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E16 PE=2 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 1.2e-105
Identity = 216/663 (32.58%), Postives = 358/663 (54.00%), Query Frame = 0

Query: 30  NPQSNVQPFFSLLQEF--PRNILWVKSIHAQIIITNAISGDQFLAAKLVAAYSGLGCLEN 89
           N  S+V+ + SLL  F   ++I   K++H  +I    +SG   + + L   Y+  G +  
Sbjct: 10  NALSSVKQYQSLLNHFAATQSISKTKALHCHVITGGRVSG--HILSTLSVTYALCGHITY 69

Query: 90  ARKVFDKFPQTETVLCNAMVNGYLQNEHYSESIELFKMMGRCDLEL--DSYTCNFALKAC 149
           ARK+F++ PQ+  +  N ++  Y++   Y ++I +F  M    ++   D YT  F  KA 
Sbjct: 70  ARKLFEEMPQSSLLSYNIVIRMYVREGLYHDAISVFIRMVSEGVKCVPDGYTYPFVAKAA 129

Query: 150 TFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFLVKTGDIMCAQRFFYRMIEKDVVCWN 209
             L   ++G+ V    +     R +++ +++L   +  G +  A+  F  M  +DV+ WN
Sbjct: 130 GELKSMKLGLVVHGRILRSWFGRDKYVQNALLAMYMNFGKVEMARDVFDVMKNRDVISWN 189

Query: 210 VMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMTSLIQSCGATKNLEFGKCIHGYVLGF 269
            MI  + + G   +   +F  M+   ++    T+ S++  CG  K+LE G+ +H  V   
Sbjct: 190 TMISGYYRNGYMNDALMMFDWMVNESVDLDHATIVSMLPVCGHLKDLEMGRNVHKLVEEK 249

Query: 270 GMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSRNLVSWNAMISGCVQNGLLVETLHLF 329
            +G    V  +L+ MY K G ++ A ++FD M  R++++W  MI+G  ++G +   L L 
Sbjct: 250 RLGDKIEVKNALVNMYLKCGRMDEARFVFDRMERRDVITWTCMINGYTEDGDVENALELC 309

Query: 330 RMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKC 389
           R++       ++ TI SLV VC   + ++ GK LHG   R+ +  ++++ T+++ +YAKC
Sbjct: 310 RLMQFEGVRPNAVTIASLVSVCGDALKVNDGKCLHGWAVRQQVYSDIIIETSLISMYAKC 369

Query: 390 GYLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALRLFNQMQNEKVTFNALTLVGLI 449
             +     VF      +   W+A++ G  QN    DAL LF +M+ E V  N  TL  L+
Sbjct: 370 KRVDLCFRVFSGASKYHTGPWSAIIAGCVQNELVSDALGLFKRMRREDVEPNIATLNSLL 429

Query: 450 HCCALLGSLHEGRSVHAILIRFGFASEVVVMTALIDMYAKCSKIDSAEKVFKYGV----T 509
              A L  L +  ++H  L + GF S +   T L+ +Y+KC  ++SA K+F  G+     
Sbjct: 430 PAYAALADLRQAMNIHCYLTKTGFMSSLDAATGLVHVYSKCGTLESAHKIFN-GIQEKHK 489

Query: 510 PKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNESTLLSLLSACSHSGLVEEGIS 569
            KDV+L+ ++ISGYG HG G  AL V+ +M + G+ PNE T  S L+ACSHSGLVEEG++
Sbjct: 490 SKDVVLWGALISGYGMHGDGHNALQVFMEMVRSGVTPNEITFTSALNACSHSGLVEEGLT 549

Query: 570 LFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFINQMPFIPTSGILETLLNGCLMH 629
           LF  M   +        Y C VDLL RAGRL +A   I  +PF PTS +   LL  C+ H
Sbjct: 550 LFRFMLEHYKTLARSNHYTCIVDLLGRAGRLDEAYNLITTIPFEPTSTVWGALLAACVTH 609

Query: 630 KDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDSVKYIRGLMTEQEIKKISGYSS 685
           ++++LG   A++L  L+  N   Y+ L+NIYA   +W  ++ +R +M    ++K  G+S+
Sbjct: 610 ENVQLGEMAANKLFELEPENTGNYVLLANIYAALGRWKDMEKVRSMMENVGLRKKPGHST 669

BLAST of Tan0012074 vs. ExPASy Swiss-Prot
Match: Q9ZQ74 (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E47 PE=3 SV=1)

HSP 1 Score: 382.1 bits (980), Expect = 1.3e-104
Identity = 232/679 (34.17%), Postives = 368/679 (54.20%), Query Frame = 0

Query: 14  RFISQTSTSSIKIVPFNPQSNV-----QPFFSLLQEFPRNILWVKSIHAQIIITNAISGD 73
           R +S T+   + +   N  S++      P F LL +   NI  ++  H  ++  N + GD
Sbjct: 18  RCVSFTTIKELILTEENDGSSLHYAASSPCFLLLSKC-TNIDSLRQSHG-VLTGNGLMGD 77

Query: 74  QFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESIELFKMMGR 133
             +A KLV+ Y   G  ++AR VFD+ P+ +  L   M+  Y  N+   E ++L+ ++ +
Sbjct: 78  ISIATKLVSLYGFFGYTKDARLVFDQIPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMK 137

Query: 134 CDLELDSYTCNFALKACTFLMDYEMGMEV-IKLAVCKGLDRGRFLGSSILNFLVKTGDIM 193
                D    + ALKACT L D + G ++  +L      D     G  +L+   K G+I 
Sbjct: 138 HGFRYDDIVFSKALKACTELQDLDNGKKIHCQLVKVPSFDNVVLTG--LLDMYAKCGEIK 197

Query: 194 CAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMTSLIQSCG 253
            A + F  +  ++VVCW  MI  +++  L  EG  +F  M  N +  +  T  +LI +C 
Sbjct: 198 SAHKVFNDITLRNVVCWTSMIAGYVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACT 257

Query: 254 ATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSRNLVSWNA 313
               L  GK  HG ++  G+   + ++TSL+ MY K GD+ +A  +F+     +LV W A
Sbjct: 258 KLSALHQGKWFHGCLVKSGIELSSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTA 317

Query: 314 MISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILHGCIYRRG 373
           MI G   NG + E L LF+ +   +   +  TI S++  C    +L+ G+ +HG   + G
Sbjct: 318 MIVGYTHNGSVNEALSLFQKMKGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVG 377

Query: 374 L-DLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALRLF 433
           + D N  ++ A+V +YAKC     A  VFE    K++++W +++ G +QNG   +AL LF
Sbjct: 378 IWDTN--VANALVHMYAKCYQNRDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLF 437

Query: 434 NQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGF--ASEVVVMTALIDMYA 493
           ++M +E VT N +T+  L   CA LGSL  G S+HA  ++ GF  +S V V TAL+D YA
Sbjct: 438 HRMNSESVTPNGVTVASLFSACASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYA 497

Query: 494 KCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNESTLL 553
           KC    SA  +F   +  K+ I +++MI GYG  G    +L ++ +M +   +PNEST  
Sbjct: 498 KCGDPQSARLIFD-TIEEKNTITWSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFT 557

Query: 554 SLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFINQMPF 613
           S+LSAC H+G+V EG   F +M   +N TP+ K Y C VD+L+RAG L+QA   I +MP 
Sbjct: 558 SILSACGHTGMVNEGKKYFSSMYKDYNFTPSTKHYTCMVDMLARAGELEQALDIIEKMPI 617

Query: 614 IPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDSVKYI 673
            P        L+GC MH   +LG  +  ++L L   + S Y+ +SN+YA   +W+  K +
Sbjct: 618 QPDVRCFGAFLHGCGMHSRFDLGEIVIKKMLDLHPDDASYYVLVSNLYASDGRWNQAKEV 677

Query: 674 RGLMTEQEIKKISGYSSIE 684
           R LM ++ + KI+G+S++E
Sbjct: 678 RNLMKQRGLSKIAGHSTME 689

BLAST of Tan0012074 vs. NCBI nr
Match: XP_038877556.1 (pentatricopeptide repeat-containing protein At5g39350-like [Benincasa hispida])

HSP 1 Score: 1158.3 bits (2995), Expect = 0.0e+00
Identity = 569/686 (82.94%), Postives = 612/686 (89.21%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFLHFPC PLKRFIS TS SSIK   FNPQ N+QPF S L+EFP NIL VKSIHAQII
Sbjct: 1   MPPFLHFPCFPLKRFISHTSKSSIKNASFNPQPNLQPFLSFLKEFPHNILSVKSIHAQII 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITNAISGDQFLAAKLVAAYSGLGCLE ARK+FDK PQ +TVLCNAMVNGYLQNEHY+ES+
Sbjct: 61  ITNAISGDQFLAAKLVAAYSGLGCLEYARKLFDKIPQPKTVLCNAMVNGYLQNEHYNESM 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           EL KMM RC LE DSYTCNFALKAC FLMDYE GM VI++AVCKG   GRFLGSSILNFL
Sbjct: 121 ELLKMMSRCGLEFDSYTCNFALKACMFLMDYETGMAVIRIAVCKGFAGGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VKTGDIMCAQ FF++M+EKDVVCWNVMIG  +QEGLF EG+ +FLDMLYN+IEPS VTMT
Sbjct: 181 VKTGDIMCAQIFFHQMVEKDVVCWNVMIGGLVQEGLFNEGYILFLDMLYNEIEPSAVTMT 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SLIQSCG  +NL+FGKC+H YV  FGM SDTRVLTSLI MYCKTGDVESA WIFDTMPSR
Sbjct: 241 SLIQSCGEMRNLKFGKCMHSYVFEFGMSSDTRVLTSLIDMYCKTGDVESARWIFDTMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           N VSWN MISG VQNGL VETLHLF+MLV +DGGFDSGT+VSL+Q+CS   DLDGGKI+H
Sbjct: 301 NWVSWNVMISGYVQNGLFVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTADLDGGKIIH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GCIYRRGLDLNLVLSTAIVDLYAK G +AYASSVFERMKNKNVISWTAMLVGLAQNGHAR
Sbjct: 361 GCIYRRGLDLNLVLSTAIVDLYAKGGSVAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF  MQNEKVT NALTLV L+HCC LLGSL EGRS+HA+L RF FA EVVVMTALI
Sbjct: 421 DALKLFYHMQNEKVTPNALTLVSLVHCCMLLGSLREGRSIHALLTRFHFAFEVVVMTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCSKI+SAEKVFKYG TPKDVILYN+MISGYG HGLGRKALC+YHQMN+ GLQ NE
Sbjct: 481 DMYAKCSKINSAEKVFKYGFTPKDVILYNTMISGYGMHGLGRKALCIYHQMNREGLQSNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGL EEGISLF NM+  HNITPTDKLYAC VDLL RAGRL+QAE  IN
Sbjct: 541 STFVSLLSACSHSGLFEEGISLFRNMEKDHNITPTDKLYACLVDLLCRAGRLRQAEELIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
           QMPF+PTSGILETLL+GCL+HKDIELGVKIADRLL L+SRNPS YITLSNIYAEA +WDS
Sbjct: 601 QMPFLPTSGILETLLSGCLLHKDIELGVKIADRLLSLESRNPSTYITLSNIYAEASRWDS 660

Query: 661 VKYIRGLMTEQEIKKISGYSSIEVNI 687
           VKY+RGLMTEQE+KKI GYSSIEVNI
Sbjct: 661 VKYVRGLMTEQELKKIPGYSSIEVNI 686

BLAST of Tan0012074 vs. NCBI nr
Match: XP_008450740.1 (PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucumis melo] >TYK10182.1 pentatricopeptide repeat-containing protein DOT4 [Cucumis melo var. makuwa])

HSP 1 Score: 1146.7 bits (2965), Expect = 0.0e+00
Identity = 564/686 (82.22%), Postives = 609/686 (88.78%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFLHFPC PLKRFIS TS SS++   FNPQ N+QPF S LQE P NIL VKSIHAQII
Sbjct: 1   MPPFLHFPCFPLKRFISHTSKSSLQNALFNPQPNLQPFLSFLQECPHNILSVKSIHAQII 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITN I GDQFL AKLVAAYSGLGCLE ARKVFD+ PQ +TVLCNAMVNGYLQNEH+++ I
Sbjct: 61  ITNGIYGDQFLVAKLVAAYSGLGCLETARKVFDEIPQPKTVLCNAMVNGYLQNEHFNDCI 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           EL +MM RC LE DSYTCNFALKACTFL+DYEMGMEVI+LAVCKGL RGRFLGSSILNFL
Sbjct: 121 ELLEMMSRCHLEFDSYTCNFALKACTFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VKTGDIMCAQ FF++M EKDVVCWNVMIG FMQEGLF EG+ +F DMLYNKIEPS VTM 
Sbjct: 181 VKTGDIMCAQYFFHQMDEKDVVCWNVMIGGFMQEGLFREGYNLFFDMLYNKIEPSAVTMI 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SLIQSCG T+NL+FGKC+H +VLGFGM SDTRVLT+LI MYCK+GDVESA WIFD MPSR
Sbjct: 241 SLIQSCGETRNLKFGKCMHSFVLGFGMSSDTRVLTTLIDMYCKSGDVESARWIFDNMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           NLVSWN MISG VQNGLLVETL LF+ L+  D GFDSGT+VSL+Q+CS   DLDGGKILH
Sbjct: 301 NLVSWNVMISGYVQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKILH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GCIYRRGLDLNLVLSTAIVDLYAKCG LAYASSVFER+KNKNVISWTAMLVGLAQNGHAR
Sbjct: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGSLAYASSVFERIKNKNVISWTAMLVGLAQNGHAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF+QMQNE+VTFN LTLV L++CC LL  L EGRSVHA L RF FASEVVVMTALI
Sbjct: 421 DALKLFDQMQNERVTFNVLTLVSLVYCCTLLRLLREGRSVHATLTRFHFASEVVVMTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCSKI+SAE VFKYG+TPKDVILYNSMISGYG HGLG KALCVYH+MN+ GLQPNE
Sbjct: 481 DMYAKCSKINSAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGLVEEGI+LF NM   HN TPTDKLYAC VDLLSRAGRLQQAE  IN
Sbjct: 541 STFVSLLSACSHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLQQAEELIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
           QMPF PTSGILETLLNGCL+HKDIELGVK+ADRLL L+SRNPSIYITLSNIYA+A +WDS
Sbjct: 601 QMPFTPTSGILETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWDS 660

Query: 661 VKYIRGLMTEQEIKKISGYSSIEVNI 687
           VK++RGLM EQEIKKI G SSIEVNI
Sbjct: 661 VKHVRGLMMEQEIKKIPGCSSIEVNI 686

BLAST of Tan0012074 vs. NCBI nr
Match: KAG7023956.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1128.6 bits (2918), Expect = 0.0e+00
Identity = 556/686 (81.05%), Postives = 609/686 (88.78%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFL FPC PLK FIS TSTSS K    N   N QPFFS LQEFPR++L VKSIHA+II
Sbjct: 1   MPPFLRFPCSPLKTFISNTSTSSTK----NALFNTQPFFSFLQEFPRDLLSVKSIHARII 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITNAISGDQFL AKLVAAY+ LG LENARKVFDK PQ +TVLCNAMVNGYLQN+HY+E+I
Sbjct: 61  ITNAISGDQFLVAKLVAAYATLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNETI 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           ELFK+MGRC  E DSYTCNFALKAC FL+DYEMGMEVI+LA+CKGL  GRFLGSSILNFL
Sbjct: 121 ELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGMEVIRLALCKGLAGGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VK GDIM A+ FF+ M+EKDVVCWNVMIG FMQEGLF EG+K+FLDML+N+IEPS VTMT
Sbjct: 181 VKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLHNRIEPSAVTMT 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SL+QSCG  ++LEFGKCIH YVLGFGM SDTRVLTSLI MYCKTGDV SA WIFDTMPSR
Sbjct: 241 SLVQSCGDMRDLEFGKCIHSYVLGFGMSSDTRVLTSLIDMYCKTGDVVSARWIFDTMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           NLVSWN MISG VQNGL VETLHLFRMLVT++GGFDS T+VSLVQ+CS   DLDGGKI+H
Sbjct: 301 NLVSWNVMISGYVQNGLRVETLHLFRMLVTNEGGFDSNTVVSLVQLCSRTADLDGGKIVH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GC+YRR LDLNL+LSTAIVDLYAKCG LAYA SVFERMK KNV+SWTAMLVGLAQNG AR
Sbjct: 361 GCVYRRELDLNLILSTAIVDLYAKCGCLAYAYSVFERMKTKNVVSWTAMLVGLAQNGQAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF+QMQNE+VTFNALTLV L+HCC LLGSL EGRSVHA+LIRF FAS+VV  TALI
Sbjct: 421 DALKLFSQMQNERVTFNALTLVSLVHCCTLLGSLREGRSVHAVLIRFRFASDVVAKTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCS+IDS EKVF +G TPKDVILYNSMISGYG HGLG KAL VYHQMNQ  LQPNE
Sbjct: 481 DMYAKCSEIDSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQ-ELQPNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGLVEEGISLF +M+ VH++TPTDKLYACFVDLLSRAGRL+QAE  IN
Sbjct: 541 STFVSLLSACSHSGLVEEGISLFRDMEKVHSVTPTDKLYACFVDLLSRAGRLRQAEEVIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
           QMPF PTSGILETLLNGCL+HKDIELGVKIADRLL  +SRNPS+Y++LSNIYAEA +WD+
Sbjct: 601 QMPFRPTSGILETLLNGCLLHKDIELGVKIADRLLSFESRNPSVYVSLSNIYAEAGRWDT 660

Query: 661 VKYIRGLMTEQEIKKISGYSSIEVNI 687
           V Y+RGLMTEQE+KKI GYSSIEVNI
Sbjct: 661 VNYLRGLMTEQELKKIPGYSSIEVNI 681

BLAST of Tan0012074 vs. NCBI nr
Match: XP_022961380.1 (pentatricopeptide repeat-containing protein At5g39350-like [Cucurbita moschata])

HSP 1 Score: 1124.4 bits (2907), Expect = 0.0e+00
Identity = 553/686 (80.61%), Postives = 608/686 (88.63%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFL FPC PLK FIS TSTSS K    N   + QPFFS LQEFPR++L VKSIHA+II
Sbjct: 1   MPPFLRFPCSPLKTFISNTSTSSTK----NALFSTQPFFSFLQEFPRDLLSVKSIHARII 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITNAISG QFL AKLVAAYS LG LENARKVFDK PQ +T+LCNAMVNGYLQN+HY+E+I
Sbjct: 61  ITNAISGGQFLVAKLVAAYSTLGSLENARKVFDKIPQPKTILCNAMVNGYLQNQHYNETI 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           ELFK+MGRC  E DSYTCNFALKAC FL+DYEMGMEVI+LA+CKGL  GRFLGSSILNFL
Sbjct: 121 ELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGMEVIRLALCKGLAGGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VK GDIM A+ FF+ M+EKDVVCWNVMIG FMQEGLF EG+K+FLDML+N+IEPS VTMT
Sbjct: 181 VKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLHNRIEPSAVTMT 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SL+QSCG  ++LEFGKCIH YVLGFGM SDTRVLTSLI MYCKTGDV SA WIFDTMPSR
Sbjct: 241 SLVQSCGDMRDLEFGKCIHSYVLGFGMSSDTRVLTSLIDMYCKTGDVVSARWIFDTMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           NLVSWN MISG VQNGL VETLHLFRMLVT++GGFDS T+VSLVQ+CS   DLDGGKI+H
Sbjct: 301 NLVSWNVMISGYVQNGLRVETLHLFRMLVTNEGGFDSNTVVSLVQLCSRTADLDGGKIVH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GC+YRR LDLNL+LSTAIVDLYAKCG LAYA SVFERMK KNV+SWTAMLVGLAQNG AR
Sbjct: 361 GCVYRRELDLNLILSTAIVDLYAKCGCLAYAYSVFERMKTKNVVSWTAMLVGLAQNGQAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF+QMQNE+VTFNALTLV L+HCC LLGSL EGRSVHA+LIRF FAS+VV  TALI
Sbjct: 421 DALKLFSQMQNERVTFNALTLVSLVHCCTLLGSLREGRSVHAVLIRFRFASDVVAKTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCS+IDS EKVF +G TPKDVILYNSMISGYG HGLG KAL VYHQMNQ  LQPNE
Sbjct: 481 DMYAKCSEIDSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQ-ELQPNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGLVEEGISLF +M+ VH++TPTDKLYACFVDLLSRAGRL+QAE  IN
Sbjct: 541 STFVSLLSACSHSGLVEEGISLFRDMEKVHSVTPTDKLYACFVDLLSRAGRLRQAEEVIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
           QMPF PTSG+LETLLNGCL+HKDIELGVKIADRLL  +SRNPS+Y++LSNIYAEA +WD+
Sbjct: 601 QMPFRPTSGVLETLLNGCLLHKDIELGVKIADRLLSFESRNPSVYVSLSNIYAEAGRWDT 660

Query: 661 VKYIRGLMTEQEIKKISGYSSIEVNI 687
           V Y+RGLMTEQE+KKI GYSSIEVNI
Sbjct: 661 VNYLRGLMTEQELKKIPGYSSIEVNI 681

BLAST of Tan0012074 vs. NCBI nr
Match: KAG6590404.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1113.2 bits (2878), Expect = 0.0e+00
Identity = 548/678 (80.83%), Postives = 600/678 (88.50%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFL FPC PLK FIS TSTSS K    N   N QPFFS LQEFPR++L VKSIHA+II
Sbjct: 1   MPPFLRFPCSPLKTFISNTSTSSTK----NALFNTQPFFSFLQEFPRDLLSVKSIHARII 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITNAISGDQFL AKLVAAYS LG LENARKVFDK PQ +TVLCNAMVNGYLQN+HY+E+I
Sbjct: 61  ITNAISGDQFLVAKLVAAYSTLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQHYNETI 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           ELFK+MGRC  E DSYTCNFALKAC FL+DYEMGMEVI+LA+CKGL  GRFLGSSILNFL
Sbjct: 121 ELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGMEVIRLALCKGLAGGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VK GDIM A+ FF+ M+EKDVVCWNVMIG FMQEGLF EG+K+FLDML+N+IEPS VTMT
Sbjct: 181 VKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLHNRIEPSAVTMT 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SL+QSCG  ++LE GKCIH YVLGFGM SDTRVLTSLI MYCKTGDV SA WIFDTMPSR
Sbjct: 241 SLVQSCGDMRDLELGKCIHSYVLGFGMSSDTRVLTSLIDMYCKTGDVVSARWIFDTMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           NLVSWN MISG VQNGL VETLHLFRMLVT++GGFDS T+VSLVQ+CS   DLDGGKI+H
Sbjct: 301 NLVSWNVMISGYVQNGLRVETLHLFRMLVTNEGGFDSNTVVSLVQLCSRTADLDGGKIVH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GC+YRR LDLNL+LSTAIVDLYAKCG LAYA SVFERMK KNV+SWTAMLVGLAQNG AR
Sbjct: 361 GCVYRRELDLNLILSTAIVDLYAKCGCLAYAYSVFERMKTKNVVSWTAMLVGLAQNGQAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF+QMQNE+VTFNALTLV L+HCC LLGSL EGRSVHA+LIRF FAS+VV  TALI
Sbjct: 421 DALKLFSQMQNERVTFNALTLVSLVHCCTLLGSLREGRSVHAVLIRFRFASDVVAKTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCS+IDS EKVF +G TPKDVILYNSMISGYG HGLG KAL VYHQMNQ  LQPNE
Sbjct: 481 DMYAKCSEIDSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQ-ELQPNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGLVEEGISLF +M+ VH++TPTDKLYACFVDLLSRAGRL+QAE  IN
Sbjct: 541 STFVSLLSACSHSGLVEEGISLFRDMEKVHSVTPTDKLYACFVDLLSRAGRLRQAEEVIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
           QMPF PTSGILETLLNGCL+HKDIELGVKIADRLL  +SRNPS+Y++LSNIYAEA +WD+
Sbjct: 601 QMPFRPTSGILETLLNGCLLHKDIELGVKIADRLLSFESRNPSVYVSLSNIYAEAGRWDT 660

Query: 661 VKYIRGLMTEQEIKKISG 679
           V Y+RGLMTEQE+KKI G
Sbjct: 661 VNYLRGLMTEQELKKIPG 673

BLAST of Tan0012074 vs. ExPASy TrEMBL
Match: A0A5D3CG12 (Pentatricopeptide repeat-containing protein DOT4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold16G003270 PE=4 SV=1)

HSP 1 Score: 1146.7 bits (2965), Expect = 0.0e+00
Identity = 564/686 (82.22%), Postives = 609/686 (88.78%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFLHFPC PLKRFIS TS SS++   FNPQ N+QPF S LQE P NIL VKSIHAQII
Sbjct: 1   MPPFLHFPCFPLKRFISHTSKSSLQNALFNPQPNLQPFLSFLQECPHNILSVKSIHAQII 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITN I GDQFL AKLVAAYSGLGCLE ARKVFD+ PQ +TVLCNAMVNGYLQNEH+++ I
Sbjct: 61  ITNGIYGDQFLVAKLVAAYSGLGCLETARKVFDEIPQPKTVLCNAMVNGYLQNEHFNDCI 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           EL +MM RC LE DSYTCNFALKACTFL+DYEMGMEVI+LAVCKGL RGRFLGSSILNFL
Sbjct: 121 ELLEMMSRCHLEFDSYTCNFALKACTFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VKTGDIMCAQ FF++M EKDVVCWNVMIG FMQEGLF EG+ +F DMLYNKIEPS VTM 
Sbjct: 181 VKTGDIMCAQYFFHQMDEKDVVCWNVMIGGFMQEGLFREGYNLFFDMLYNKIEPSAVTMI 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SLIQSCG T+NL+FGKC+H +VLGFGM SDTRVLT+LI MYCK+GDVESA WIFD MPSR
Sbjct: 241 SLIQSCGETRNLKFGKCMHSFVLGFGMSSDTRVLTTLIDMYCKSGDVESARWIFDNMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           NLVSWN MISG VQNGLLVETL LF+ L+  D GFDSGT+VSL+Q+CS   DLDGGKILH
Sbjct: 301 NLVSWNVMISGYVQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKILH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GCIYRRGLDLNLVLSTAIVDLYAKCG LAYASSVFER+KNKNVISWTAMLVGLAQNGHAR
Sbjct: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGSLAYASSVFERIKNKNVISWTAMLVGLAQNGHAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF+QMQNE+VTFN LTLV L++CC LL  L EGRSVHA L RF FASEVVVMTALI
Sbjct: 421 DALKLFDQMQNERVTFNVLTLVSLVYCCTLLRLLREGRSVHATLTRFHFASEVVVMTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCSKI+SAE VFKYG+TPKDVILYNSMISGYG HGLG KALCVYH+MN+ GLQPNE
Sbjct: 481 DMYAKCSKINSAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGLVEEGI+LF NM   HN TPTDKLYAC VDLLSRAGRLQQAE  IN
Sbjct: 541 STFVSLLSACSHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLQQAEELIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
           QMPF PTSGILETLLNGCL+HKDIELGVK+ADRLL L+SRNPSIYITLSNIYA+A +WDS
Sbjct: 601 QMPFTPTSGILETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWDS 660

Query: 661 VKYIRGLMTEQEIKKISGYSSIEVNI 687
           VK++RGLM EQEIKKI G SSIEVNI
Sbjct: 661 VKHVRGLMMEQEIKKIPGCSSIEVNI 686

BLAST of Tan0012074 vs. ExPASy TrEMBL
Match: A0A1S3BQY0 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103492229 PE=4 SV=1)

HSP 1 Score: 1146.7 bits (2965), Expect = 0.0e+00
Identity = 564/686 (82.22%), Postives = 609/686 (88.78%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFLHFPC PLKRFIS TS SS++   FNPQ N+QPF S LQE P NIL VKSIHAQII
Sbjct: 1   MPPFLHFPCFPLKRFISHTSKSSLQNALFNPQPNLQPFLSFLQECPHNILSVKSIHAQII 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITN I GDQFL AKLVAAYSGLGCLE ARKVFD+ PQ +TVLCNAMVNGYLQNEH+++ I
Sbjct: 61  ITNGIYGDQFLVAKLVAAYSGLGCLETARKVFDEIPQPKTVLCNAMVNGYLQNEHFNDCI 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           EL +MM RC LE DSYTCNFALKACTFL+DYEMGMEVI+LAVCKGL RGRFLGSSILNFL
Sbjct: 121 ELLEMMSRCHLEFDSYTCNFALKACTFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VKTGDIMCAQ FF++M EKDVVCWNVMIG FMQEGLF EG+ +F DMLYNKIEPS VTM 
Sbjct: 181 VKTGDIMCAQYFFHQMDEKDVVCWNVMIGGFMQEGLFREGYNLFFDMLYNKIEPSAVTMI 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SLIQSCG T+NL+FGKC+H +VLGFGM SDTRVLT+LI MYCK+GDVESA WIFD MPSR
Sbjct: 241 SLIQSCGETRNLKFGKCMHSFVLGFGMSSDTRVLTTLIDMYCKSGDVESARWIFDNMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           NLVSWN MISG VQNGLLVETL LF+ L+  D GFDSGT+VSL+Q+CS   DLDGGKILH
Sbjct: 301 NLVSWNVMISGYVQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKILH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GCIYRRGLDLNLVLSTAIVDLYAKCG LAYASSVFER+KNKNVISWTAMLVGLAQNGHAR
Sbjct: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGSLAYASSVFERIKNKNVISWTAMLVGLAQNGHAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF+QMQNE+VTFN LTLV L++CC LL  L EGRSVHA L RF FASEVVVMTALI
Sbjct: 421 DALKLFDQMQNERVTFNVLTLVSLVYCCTLLRLLREGRSVHATLTRFHFASEVVVMTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCSKI+SAE VFKYG+TPKDVILYNSMISGYG HGLG KALCVYH+MN+ GLQPNE
Sbjct: 481 DMYAKCSKINSAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGLVEEGI+LF NM   HN TPTDKLYAC VDLLSRAGRLQQAE  IN
Sbjct: 541 STFVSLLSACSHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLQQAEELIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
           QMPF PTSGILETLLNGCL+HKDIELGVK+ADRLL L+SRNPSIYITLSNIYA+A +WDS
Sbjct: 601 QMPFTPTSGILETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWDS 660

Query: 661 VKYIRGLMTEQEIKKISGYSSIEVNI 687
           VK++RGLM EQEIKKI G SSIEVNI
Sbjct: 661 VKHVRGLMMEQEIKKIPGCSSIEVNI 686

BLAST of Tan0012074 vs. ExPASy TrEMBL
Match: A0A6J1HC27 (pentatricopeptide repeat-containing protein At5g39350-like OS=Cucurbita moschata OX=3662 GN=LOC111461970 PE=4 SV=1)

HSP 1 Score: 1124.4 bits (2907), Expect = 0.0e+00
Identity = 553/686 (80.61%), Postives = 608/686 (88.63%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFL FPC PLK FIS TSTSS K    N   + QPFFS LQEFPR++L VKSIHA+II
Sbjct: 1   MPPFLRFPCSPLKTFISNTSTSSTK----NALFSTQPFFSFLQEFPRDLLSVKSIHARII 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITNAISG QFL AKLVAAYS LG LENARKVFDK PQ +T+LCNAMVNGYLQN+HY+E+I
Sbjct: 61  ITNAISGGQFLVAKLVAAYSTLGSLENARKVFDKIPQPKTILCNAMVNGYLQNQHYNETI 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           ELFK+MGRC  E DSYTCNFALKAC FL+DYEMGMEVI+LA+CKGL  GRFLGSSILNFL
Sbjct: 121 ELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGMEVIRLALCKGLAGGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VK GDIM A+ FF+ M+EKDVVCWNVMIG FMQEGLF EG+K+FLDML+N+IEPS VTMT
Sbjct: 181 VKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLHNRIEPSAVTMT 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SL+QSCG  ++LEFGKCIH YVLGFGM SDTRVLTSLI MYCKTGDV SA WIFDTMPSR
Sbjct: 241 SLVQSCGDMRDLEFGKCIHSYVLGFGMSSDTRVLTSLIDMYCKTGDVVSARWIFDTMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           NLVSWN MISG VQNGL VETLHLFRMLVT++GGFDS T+VSLVQ+CS   DLDGGKI+H
Sbjct: 301 NLVSWNVMISGYVQNGLRVETLHLFRMLVTNEGGFDSNTVVSLVQLCSRTADLDGGKIVH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GC+YRR LDLNL+LSTAIVDLYAKCG LAYA SVFERMK KNV+SWTAMLVGLAQNG AR
Sbjct: 361 GCVYRRELDLNLILSTAIVDLYAKCGCLAYAYSVFERMKTKNVVSWTAMLVGLAQNGQAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF+QMQNE+VTFNALTLV L+HCC LLGSL EGRSVHA+LIRF FAS+VV  TALI
Sbjct: 421 DALKLFSQMQNERVTFNALTLVSLVHCCTLLGSLREGRSVHAVLIRFRFASDVVAKTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCS+IDS EKVF +G TPKDVILYNSMISGYG HGLG KAL VYHQMNQ  LQPNE
Sbjct: 481 DMYAKCSEIDSGEKVFNHGFTPKDVILYNSMISGYGMHGLGHKALSVYHQMNQ-ELQPNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGLVEEGISLF +M+ VH++TPTDKLYACFVDLLSRAGRL+QAE  IN
Sbjct: 541 STFVSLLSACSHSGLVEEGISLFRDMEKVHSVTPTDKLYACFVDLLSRAGRLRQAEEVIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
           QMPF PTSG+LETLLNGCL+HKDIELGVKIADRLL  +SRNPS+Y++LSNIYAEA +WD+
Sbjct: 601 QMPFRPTSGVLETLLNGCLLHKDIELGVKIADRLLSFESRNPSVYVSLSNIYAEAGRWDT 660

Query: 661 VKYIRGLMTEQEIKKISGYSSIEVNI 687
           V Y+RGLMTEQE+KKI GYSSIEVNI
Sbjct: 661 VNYLRGLMTEQELKKIPGYSSIEVNI 681

BLAST of Tan0012074 vs. ExPASy TrEMBL
Match: A0A6J1HU79 (pentatricopeptide repeat-containing protein At1g06140, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111467854 PE=4 SV=1)

HSP 1 Score: 1112.1 bits (2875), Expect = 0.0e+00
Identity = 551/686 (80.32%), Postives = 599/686 (87.32%), Query Frame = 0

Query: 1   MPPFLHFPCLPLKRFISQTSTSSIKIVPFNPQSNVQPFFSLLQEFPRNILWVKSIHAQII 60
           MPPFL  PCLPLK FIS TSTSS K    N   N QPFFS LQEFPR++L VKSIHAQ I
Sbjct: 1   MPPFLRLPCLPLKTFISNTSTSSTK----NALFNTQPFFSFLQEFPRDLLSVKSIHAQFI 60

Query: 61  ITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESI 120
           ITNAISGDQ L AKLVAAYS LG LENARKVFDK PQ +TVLCNAMVNGYLQN+ Y+E+I
Sbjct: 61  ITNAISGDQRLVAKLVAAYSTLGSLENARKVFDKIPQPKTVLCNAMVNGYLQNQRYNETI 120

Query: 121 ELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFL 180
           ELFK+MGRC  E DSYTCNFALKAC FL+DYEMGMEVI+LA+CKGL  GRFLGSSILNFL
Sbjct: 121 ELFKLMGRCHFEFDSYTCNFALKACMFLLDYEMGMEVIRLALCKGLAGGRFLGSSILNFL 180

Query: 181 VKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMT 240
           VK GDIM A+ FF+ M+EKDVVCWNVMIG FMQEGLF EG+K+FLDMLYN+IEPS VTMT
Sbjct: 181 VKAGDIMNARIFFHEMVEKDVVCWNVMIGGFMQEGLFSEGYKLFLDMLYNRIEPSAVTMT 240

Query: 241 SLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSR 300
           SL+QSCG  +NLEFGKCIH YVLGFGM SDTRVLTSLI MYCKTGDV SA WIFDTMPSR
Sbjct: 241 SLVQSCGEMRNLEFGKCIHSYVLGFGMSSDTRVLTSLIDMYCKTGDVVSARWIFDTMPSR 300

Query: 301 NLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILH 360
           NLVSWN MISG VQNG  VETLHLF MLVT++GGFDS T+VSLVQ+CS   DLDGGKI+H
Sbjct: 301 NLVSWNVMISGYVQNGFRVETLHLFHMLVTNEGGFDSNTVVSLVQLCSRTADLDGGKIVH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GC+YRR LDLNL+LSTAIVDLYAKCG LAYA SVFERMK KNV+SWTAMLVGLAQNG AR
Sbjct: 361 GCVYRRELDLNLILSTAIVDLYAKCGCLAYAYSVFERMKTKNVVSWTAMLVGLAQNGQAR 420

Query: 421 DALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALI 480
           DAL+LF+QMQNE+VTFNALTLV L+HCC LLGSL EGRSVHA+LIRF FA +VV  TALI
Sbjct: 421 DALKLFSQMQNERVTFNALTLVSLVHCCTLLGSLREGRSVHAVLIRFHFALDVVAKTALI 480

Query: 481 DMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNE 540
           DMYAKCS+IDS EKVF +G TPKDVILYNSMIS YG HG GRKAL VYHQ+NQ  LQPNE
Sbjct: 481 DMYAKCSEIDSGEKVFNHGFTPKDVILYNSMISAYGMHGHGRKALSVYHQINQ-ELQPNE 540

Query: 541 STLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFIN 600
           ST +SLLSACSHSGLVEEGISLF NM+ VHN+TPTDKLYACFVDLLSRAGRL QAE  IN
Sbjct: 541 STFVSLLSACSHSGLVEEGISLFRNMEKVHNVTPTDKLYACFVDLLSRAGRLWQAEEVIN 600

Query: 601 QMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDS 660
            MPF PTSGILETLLNGCL+HK+IELGVKIADRLL L+SRNPS+Y++LSNIYAEA +WD+
Sbjct: 601 HMPFRPTSGILETLLNGCLLHKEIELGVKIADRLLSLESRNPSVYVSLSNIYAEAGRWDT 660

Query: 661 VKYIRGLMTEQEIKKISGYSSIEVNI 687
           V  +R LMTEQE+KKI GYSSIEVNI
Sbjct: 661 VNNLRSLMTEQELKKIPGYSSIEVNI 681

BLAST of Tan0012074 vs. ExPASy TrEMBL
Match: A0A6J1DRK8 (pentatricopeptide repeat-containing protein At4g21300-like OS=Momordica charantia OX=3673 GN=LOC111022546 PE=4 SV=1)

HSP 1 Score: 991.1 bits (2561), Expect = 2.3e-285
Identity = 482/582 (82.82%), Postives = 533/582 (91.58%), Query Frame = 0

Query: 106 MVNGYLQNEHYSESIELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKG 165
           MVNGYLQNEHY ESIELFK+MGRCDLE DSYTCNFALKACTFL+DYEMGMEVI+LAV  G
Sbjct: 1   MVNGYLQNEHYVESIELFKIMGRCDLEFDSYTCNFALKACTFLLDYEMGMEVIRLAVYMG 60

Query: 166 LDRGRFLGSSILNFLVKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFL 225
            DRG+FLGSSILNFLVKTGDI  AQ+FF++M+ KDVVCWNVMIG FM+EGL+ EGF VFL
Sbjct: 61  WDRGKFLGSSILNFLVKTGDIKGAQKFFHQMLGKDVVCWNVMIGGFMKEGLYSEGFSVFL 120

Query: 226 DMLYNKIEPSVVTMTSLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTG 285
           DML++ IEP+ VTMTSLIQ+CG T N+EFGKCIHGY+LGFGM SDTRVLTSLI MYCKTG
Sbjct: 121 DMLFSGIEPTAVTMTSLIQACGETGNVEFGKCIHGYILGFGMSSDTRVLTSLIDMYCKTG 180

Query: 286 DVESAAWIFDTMPSRNLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQ 345
           D+++A WIF++MP RNLVSWNAMISG VQNGL +E L+LFRMLVTSDGGFDS TIVSL+Q
Sbjct: 181 DIKTARWIFNSMPLRNLVSWNAMISGYVQNGLPLEALNLFRMLVTSDGGFDSATIVSLLQ 240

Query: 346 VCSHRVDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVIS 405
           VCSH VDLDGGKILHGC+YR GLDLNL+LSTAIVDLYAKCG LAYA S F+RMK+KNVIS
Sbjct: 241 VCSHAVDLDGGKILHGCVYRSGLDLNLILSTAIVDLYAKCGSLAYAFSFFQRMKSKNVIS 300

Query: 406 WTAMLVGLAQNGHARDALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILI 465
           WTAMLVGL QNGHARDALRLFNQMQNEKV+FNALTL+ L+HCC LLGSL++GRSVHAILI
Sbjct: 301 WTAMLVGLTQNGHARDALRLFNQMQNEKVSFNALTLISLMHCCTLLGSLNKGRSVHAILI 360

Query: 466 RFGFASEVVVMTALIDMYAKCSKIDSAEKVFKYG-VTPKDVILYNSMISGYGTHGLGRKA 525
           RFGF+S+ +  TALIDMYAKCSKIDSAEKVFKYG  TPKDVILYNSMISGYGTHGLG KA
Sbjct: 361 RFGFSSDAIATTALIDMYAKCSKIDSAEKVFKYGFFTPKDVILYNSMISGYGTHGLGHKA 420

Query: 526 LCVYHQMNQGGLQPNESTLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVD 585
           LCVY +M + GLQPNEST +SLL ACSHSGLVEEG+SLF +M+  HNITPTDKLYACFVD
Sbjct: 421 LCVYREMEREGLQPNESTFVSLLFACSHSGLVEEGVSLFSSMKKDHNITPTDKLYACFVD 480

Query: 586 LLSRAGRLQQAEAFINQMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSI 645
           LLSRAGRL+QA+A INQMPFIPTSGILETLLNGCLMHKDIELGVKIADRLL LDS+N SI
Sbjct: 481 LLSRAGRLRQADALINQMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLSLDSKNSSI 540

Query: 646 YITLSNIYAEARQWDSVKYIRGLMTEQEIKKISGYSSIEVNI 687
           YITLSNIYAEARQWDSVKY+RGLM EQE+KKI+GY+SIEV++
Sbjct: 541 YITLSNIYAEARQWDSVKYVRGLMLEQELKKITGYTSIEVSL 582

BLAST of Tan0012074 vs. TAIR 10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 409.1 bits (1050), Expect = 7.2e-114
Identity = 213/619 (34.41%), Postives = 355/619 (57.35%), Query Frame = 0

Query: 68  DQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESIELFKMMG 127
           ++F+A+ L+ AY   G ++   K+FD+  Q + V+ N M+NGY +       I+ F +M 
Sbjct: 172 NEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCGALDSVIKGFSVMR 231

Query: 128 RCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFLVKTGDIM 187
              +  ++ T +  L  C   +  ++G+++  L V  G+D    + +S+L+   K G   
Sbjct: 232 MDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFD 291

Query: 188 CAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMTSLIQSCG 247
            A + F  M   D V WN MI  ++Q GL  E    F +M+ + + P  +T +SL+ S  
Sbjct: 292 DASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVS 351

Query: 248 ATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSRNLVSWNA 307
             +NLE+ K IH Y++   +  D  + ++LI  Y K   V  A  IF    S ++V + A
Sbjct: 352 KFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTA 411

Query: 308 MISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILHGCIYRRG 367
           MISG + NGL +++L +FR LV      +  T+VS++ V    + L  G+ LHG I ++G
Sbjct: 412 MISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFIIKKG 471

Query: 368 LDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALRLFN 427
            D    +  A++D+YAKCG +  A  +FER+  ++++SW +M+   AQ+ +   A+ +F 
Sbjct: 472 FDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAIDIFR 531

Query: 428 QMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTALIDMYAKCS 487
           QM    + ++ +++   +  CA L S   G+++H  +I+   AS+V   + LIDMYAKC 
Sbjct: 532 QMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMYAKCG 591

Query: 488 KIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQM-NQGGLQPNESTLLSL 547
            + +A  VFK  +  K+++ +NS+I+  G HG  + +LC++H+M  + G++P++ T L +
Sbjct: 592 NLKAAMNVFK-TMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQITFLEI 651

Query: 548 LSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFINQMPFIP 607
           +S+C H G V+EG+  F +M   + I P  + YAC VDL  RAGRL +A   +  MPF P
Sbjct: 652 ISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKSMPFPP 711

Query: 608 TSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDSVKYIRG 667
            +G+  TLL  C +HK++EL    + +L+ LD  N   Y+ +SN +A AR+W+SV  +R 
Sbjct: 712 DAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWESVTKVRS 771

Query: 668 LMTEQEIKKISGYSSIEVN 686
           LM E+E++KI GYS IE+N
Sbjct: 772 LMKEREVQKIPGYSWIEIN 789

BLAST of Tan0012074 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 396.4 bits (1017), Expect = 4.8e-110
Identity = 214/630 (33.97%), Postives = 358/630 (56.83%), Query Frame = 0

Query: 61  ITNAISGDQF-----LAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEH 120
           + N I G+ F     L +KL   Y+  G L+ A +VFD+    + +  N ++N   ++  
Sbjct: 116 VDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGD 175

Query: 121 YSESIELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSS 180
           +S SI LFK M    +E+DSYT +   K+ + L     G ++    +  G      +G+S
Sbjct: 176 FSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNS 235

Query: 181 ILNFLVKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPS 240
           ++ F +K   +  A++ F  M E+DV+ WN +I  ++  GL  +G  VF+ ML + IE  
Sbjct: 236 LVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEID 295

Query: 241 VVTMTSLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFD 300
           + T+ S+   C  ++ +  G+ +H   +      + R   +L+ MY K GD++SA  +F 
Sbjct: 296 LATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFR 355

Query: 301 TMPSRNLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDG 360
            M  R++VS+ +MI+G  + GL  E + LF  +       D  T+ +++  C+    LD 
Sbjct: 356 EMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDE 415

Query: 361 GKILHGCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQ 420
           GK +H  I    L  ++ +S A++D+YAKCG +  A  VF  M+ K++ISW  ++ G ++
Sbjct: 416 GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSK 475

Query: 421 NGHARDALRLFNQMQNEK-VTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVV 480
           N +A +AL LFN +  EK  + +  T+  ++  CA L +  +GR +H  ++R G+ S+  
Sbjct: 476 NCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRH 535

Query: 481 VMTALIDMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQG 540
           V  +L+DMYAKC  +  A  +F   +  KD++ +  MI+GYG HG G++A+ +++QM Q 
Sbjct: 536 VANSLVDMYAKCGALLLAHMLFD-DIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQA 595

Query: 541 GLQPNESTLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQ 600
           G++ +E + +SLL ACSHSGLV+EG   F+ M+    I PT + YAC VD+L+R G L +
Sbjct: 596 GIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIK 655

Query: 601 AEAFINQMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAE 660
           A  FI  MP  P + I   LL GC +H D++L  K+A+++  L+  N   Y+ ++NIYAE
Sbjct: 656 AYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAE 715

Query: 661 ARQWDSVKYIRGLMTEQEIKKISGYSSIEV 685
           A +W+ VK +R  + ++ ++K  G S IE+
Sbjct: 716 AEKWEQVKRLRKRIGQRGLRKNPGCSWIEI 744

BLAST of Tan0012074 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 390.6 bits (1002), Expect = 2.6e-108
Identity = 205/626 (32.75%), Postives = 340/626 (54.31%), Query Frame = 0

Query: 59  IIITNAISGDQFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSE 118
           ++  N +  + F   KLV+ +   G ++ A +VF+       VL + M+ G+ +     +
Sbjct: 59  LVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDK 118

Query: 119 SIELFKMMGRCDLELDSYTCNFALKACTFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILN 178
           +++ F  M   D+E   Y   + LK C    +  +G E+  L V  G     F  + + N
Sbjct: 119 ALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLEN 178

Query: 179 FLVKTGDIMCAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVT 238
              K   +  A++ F RM E+D+V WN ++  + Q G+     ++   M    ++PS +T
Sbjct: 179 MYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFIT 238

Query: 239 MTSLIQSCGATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMP 298
           + S++ +  A + +  GK IHGY +  G  S   + T+L+ MY K G +E+A  +FD M 
Sbjct: 239 IVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGML 298

Query: 299 SRNLVSWNAMISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKI 358
            RN+VSWN+MI   VQN    E + +F+ ++         +++  +  C+   DL+ G+ 
Sbjct: 299 ERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRF 358

Query: 359 LHGCIYRRGLDLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGH 418
           +H      GLD N+ +  +++ +Y KC  +  A+S+F +++++ ++SW AM++G AQNG 
Sbjct: 359 IHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGR 418

Query: 419 ARDALRLFNQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGFASEVVVMTA 478
             DAL  F+QM++  V  +  T V +I   A L   H  + +H +++R      V V TA
Sbjct: 419 PIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTA 478

Query: 479 LIDMYAKCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQP 538
           L+DMYAKC  I  A  +F   ++ + V  +N+MI GYGTHG G+ AL ++ +M +G ++P
Sbjct: 479 LVDMYAKCGAIMIARLIFDM-MSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKP 538

Query: 539 NESTLLSLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAF 598
           N  T LS++SACSHSGLVE G+  F+ M+  ++I  +   Y   VDLL RAGRL +A  F
Sbjct: 539 NGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDF 598

Query: 599 INQMPFIPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQW 658
           I QMP  P   +   +L  C +HK++    K A+RL  L+  +   ++ L+NIY  A  W
Sbjct: 599 IMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMW 658

Query: 659 DSVKYIRGLMTEQEIKKISGYSSIEV 685
           + V  +R  M  Q ++K  G S +E+
Sbjct: 659 EKVGQVRVSMLRQGLRKTPGCSMVEI 683

BLAST of Tan0012074 vs. TAIR 10
Match: AT5G39350.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 385.6 bits (989), Expect = 8.5e-107
Identity = 216/663 (32.58%), Postives = 358/663 (54.00%), Query Frame = 0

Query: 30  NPQSNVQPFFSLLQEF--PRNILWVKSIHAQIIITNAISGDQFLAAKLVAAYSGLGCLEN 89
           N  S+V+ + SLL  F   ++I   K++H  +I    +SG   + + L   Y+  G +  
Sbjct: 10  NALSSVKQYQSLLNHFAATQSISKTKALHCHVITGGRVSG--HILSTLSVTYALCGHITY 69

Query: 90  ARKVFDKFPQTETVLCNAMVNGYLQNEHYSESIELFKMMGRCDLEL--DSYTCNFALKAC 149
           ARK+F++ PQ+  +  N ++  Y++   Y ++I +F  M    ++   D YT  F  KA 
Sbjct: 70  ARKLFEEMPQSSLLSYNIVIRMYVREGLYHDAISVFIRMVSEGVKCVPDGYTYPFVAKAA 129

Query: 150 TFLMDYEMGMEVIKLAVCKGLDRGRFLGSSILNFLVKTGDIMCAQRFFYRMIEKDVVCWN 209
             L   ++G+ V    +     R +++ +++L   +  G +  A+  F  M  +DV+ WN
Sbjct: 130 GELKSMKLGLVVHGRILRSWFGRDKYVQNALLAMYMNFGKVEMARDVFDVMKNRDVISWN 189

Query: 210 VMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMTSLIQSCGATKNLEFGKCIHGYVLGF 269
            MI  + + G   +   +F  M+   ++    T+ S++  CG  K+LE G+ +H  V   
Sbjct: 190 TMISGYYRNGYMNDALMMFDWMVNESVDLDHATIVSMLPVCGHLKDLEMGRNVHKLVEEK 249

Query: 270 GMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSRNLVSWNAMISGCVQNGLLVETLHLF 329
            +G    V  +L+ MY K G ++ A ++FD M  R++++W  MI+G  ++G +   L L 
Sbjct: 250 RLGDKIEVKNALVNMYLKCGRMDEARFVFDRMERRDVITWTCMINGYTEDGDVENALELC 309

Query: 330 RMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKC 389
           R++       ++ TI SLV VC   + ++ GK LHG   R+ +  ++++ T+++ +YAKC
Sbjct: 310 RLMQFEGVRPNAVTIASLVSVCGDALKVNDGKCLHGWAVRQQVYSDIIIETSLISMYAKC 369

Query: 390 GYLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALRLFNQMQNEKVTFNALTLVGLI 449
             +     VF      +   W+A++ G  QN    DAL LF +M+ E V  N  TL  L+
Sbjct: 370 KRVDLCFRVFSGASKYHTGPWSAIIAGCVQNELVSDALGLFKRMRREDVEPNIATLNSLL 429

Query: 450 HCCALLGSLHEGRSVHAILIRFGFASEVVVMTALIDMYAKCSKIDSAEKVFKYGV----T 509
              A L  L +  ++H  L + GF S +   T L+ +Y+KC  ++SA K+F  G+     
Sbjct: 430 PAYAALADLRQAMNIHCYLTKTGFMSSLDAATGLVHVYSKCGTLESAHKIFN-GIQEKHK 489

Query: 510 PKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNESTLLSLLSACSHSGLVEEGIS 569
            KDV+L+ ++ISGYG HG G  AL V+ +M + G+ PNE T  S L+ACSHSGLVEEG++
Sbjct: 490 SKDVVLWGALISGYGMHGDGHNALQVFMEMVRSGVTPNEITFTSALNACSHSGLVEEGLT 549

Query: 570 LFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFINQMPFIPTSGILETLLNGCLMH 629
           LF  M   +        Y C VDLL RAGRL +A   I  +PF PTS +   LL  C+ H
Sbjct: 550 LFRFMLEHYKTLARSNHYTCIVDLLGRAGRLDEAYNLITTIPFEPTSTVWGALLAACVTH 609

Query: 630 KDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDSVKYIRGLMTEQEIKKISGYSS 685
           ++++LG   A++L  L+  N   Y+ L+NIYA   +W  ++ +R +M    ++K  G+S+
Sbjct: 610 ENVQLGEMAANKLFELEPENTGNYVLLANIYAALGRWKDMEKVRSMMENVGLRKKPGHST 669

BLAST of Tan0012074 vs. TAIR 10
Match: AT2G03380.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 382.1 bits (980), Expect = 9.4e-106
Identity = 232/679 (34.17%), Postives = 368/679 (54.20%), Query Frame = 0

Query: 14  RFISQTSTSSIKIVPFNPQSNV-----QPFFSLLQEFPRNILWVKSIHAQIIITNAISGD 73
           R +S T+   + +   N  S++      P F LL +   NI  ++  H  ++  N + GD
Sbjct: 18  RCVSFTTIKELILTEENDGSSLHYAASSPCFLLLSKC-TNIDSLRQSHG-VLTGNGLMGD 77

Query: 74  QFLAAKLVAAYSGLGCLENARKVFDKFPQTETVLCNAMVNGYLQNEHYSESIELFKMMGR 133
             +A KLV+ Y   G  ++AR VFD+ P+ +  L   M+  Y  N+   E ++L+ ++ +
Sbjct: 78  ISIATKLVSLYGFFGYTKDARLVFDQIPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMK 137

Query: 134 CDLELDSYTCNFALKACTFLMDYEMGMEV-IKLAVCKGLDRGRFLGSSILNFLVKTGDIM 193
                D    + ALKACT L D + G ++  +L      D     G  +L+   K G+I 
Sbjct: 138 HGFRYDDIVFSKALKACTELQDLDNGKKIHCQLVKVPSFDNVVLTG--LLDMYAKCGEIK 197

Query: 194 CAQRFFYRMIEKDVVCWNVMIGAFMQEGLFGEGFKVFLDMLYNKIEPSVVTMTSLIQSCG 253
            A + F  +  ++VVCW  MI  +++  L  EG  +F  M  N +  +  T  +LI +C 
Sbjct: 198 SAHKVFNDITLRNVVCWTSMIAGYVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACT 257

Query: 254 ATKNLEFGKCIHGYVLGFGMGSDTRVLTSLIVMYCKTGDVESAAWIFDTMPSRNLVSWNA 313
               L  GK  HG ++  G+   + ++TSL+ MY K GD+ +A  +F+     +LV W A
Sbjct: 258 KLSALHQGKWFHGCLVKSGIELSSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTA 317

Query: 314 MISGCVQNGLLVETLHLFRMLVTSDGGFDSGTIVSLVQVCSHRVDLDGGKILHGCIYRRG 373
           MI G   NG + E L LF+ +   +   +  TI S++  C    +L+ G+ +HG   + G
Sbjct: 318 MIVGYTHNGSVNEALSLFQKMKGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVG 377

Query: 374 L-DLNLVLSTAIVDLYAKCGYLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALRLF 433
           + D N  ++ A+V +YAKC     A  VFE    K++++W +++ G +QNG   +AL LF
Sbjct: 378 IWDTN--VANALVHMYAKCYQNRDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLF 437

Query: 434 NQMQNEKVTFNALTLVGLIHCCALLGSLHEGRSVHAILIRFGF--ASEVVVMTALIDMYA 493
           ++M +E VT N +T+  L   CA LGSL  G S+HA  ++ GF  +S V V TAL+D YA
Sbjct: 438 HRMNSESVTPNGVTVASLFSACASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYA 497

Query: 494 KCSKIDSAEKVFKYGVTPKDVILYNSMISGYGTHGLGRKALCVYHQMNQGGLQPNESTLL 553
           KC    SA  +F   +  K+ I +++MI GYG  G    +L ++ +M +   +PNEST  
Sbjct: 498 KCGDPQSARLIFD-TIEEKNTITWSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFT 557

Query: 554 SLLSACSHSGLVEEGISLFHNMQTVHNITPTDKLYACFVDLLSRAGRLQQAEAFINQMPF 613
           S+LSAC H+G+V EG   F +M   +N TP+ K Y C VD+L+RAG L+QA   I +MP 
Sbjct: 558 SILSACGHTGMVNEGKKYFSSMYKDYNFTPSTKHYTCMVDMLARAGELEQALDIIEKMPI 617

Query: 614 IPTSGILETLLNGCLMHKDIELGVKIADRLLCLDSRNPSIYITLSNIYAEARQWDSVKYI 673
            P        L+GC MH   +LG  +  ++L L   + S Y+ +SN+YA   +W+  K +
Sbjct: 618 QPDVRCFGAFLHGCGMHSRFDLGEIVIKKMLDLHPDDASYYVLVSNLYASDGRWNQAKEV 677

Query: 674 RGLMTEQEIKKISGYSSIE 684
           R LM ++ + KI+G+S++E
Sbjct: 678 RNLMKQRGLSKIAGHSTME 689

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9STE11.0e-11234.41Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Q9SN396.8e-10933.97Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q3E6Q13.7e-10732.75Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9FLZ91.2e-10532.58Pentatricopeptide repeat-containing protein At5g39350 OS=Arabidopsis thaliana OX... [more]
Q9ZQ741.3e-10434.17Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_038877556.10.0e+0082.94pentatricopeptide repeat-containing protein At5g39350-like [Benincasa hispida][more]
XP_008450740.10.0e+0082.22PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like ... [more]
KAG7023956.10.0e+0081.05Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022961380.10.0e+0080.61pentatricopeptide repeat-containing protein At5g39350-like [Cucurbita moschata][more]
KAG6590404.10.0e+0080.83Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
Match NameE-valueIdentityDescription
A0A5D3CG120.0e+0082.22Pentatricopeptide repeat-containing protein DOT4 OS=Cucumis melo var. makuwa OX=... [more]
A0A1S3BQY00.0e+0082.22pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis ... [more]
A0A6J1HC270.0e+0080.61pentatricopeptide repeat-containing protein At5g39350-like OS=Cucurbita moschata... [more]
A0A6J1HU790.0e+0080.32pentatricopeptide repeat-containing protein At1g06140, mitochondrial-like OS=Cuc... [more]
A0A6J1DRK82.3e-28582.82pentatricopeptide repeat-containing protein At4g21300-like OS=Momordica charanti... [more]
Match NameE-valueIdentityDescription
AT4G21300.17.2e-11434.41Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.14.8e-11033.97Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.12.6e-10832.75Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39350.18.5e-10732.58Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G03380.19.4e-10634.17Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 100..145
e-value: 5.9E-8
score: 32.8
coord: 503..551
e-value: 3.4E-11
score: 43.2
coord: 401..448
e-value: 1.4E-8
score: 34.8
coord: 199..246
e-value: 7.9E-12
score: 45.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 404..437
e-value: 9.8E-6
score: 23.4
coord: 202..235
e-value: 1.1E-4
score: 20.1
coord: 303..330
e-value: 4.2E-4
score: 18.3
coord: 507..540
e-value: 1.8E-6
score: 25.8
coord: 101..134
e-value: 2.5E-4
score: 19.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 474..497
e-value: 0.034
score: 14.4
coord: 275..301
e-value: 9.3E-4
score: 19.3
coord: 303..329
e-value: 6.8E-5
score: 22.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 402..436
score: 10.347525
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 200..234
score: 10.599635
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 9.810421
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 99..133
score: 8.900633
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 504..538
score: 11.454616
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 499..683
e-value: 2.8E-30
score: 107.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 151..257
e-value: 5.1E-16
score: 60.5
coord: 51..150
e-value: 7.9E-13
score: 50.1
coord: 356..464
e-value: 1.0E-19
score: 72.6
coord: 262..355
e-value: 4.3E-16
score: 60.8
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 48..684
NoneNo IPR availablePANTHERPTHR47928:SF132PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 48..684

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012074.1Tan0012074.1mRNA
Tan0012074.2Tan0012074.2mRNA
Tan0012074.3Tan0012074.3mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding