Tan0010144 (gene) Snake gourd v1

Overview
NameTan0010144
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG04: 18467647 .. 18470189 (-)
RNA-Seq ExpressionTan0010144
SyntenyTan0010144
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGTGGGTCTTTATCAATTGAAATTGCAGTCACCCTAAGCATTCAGGGACTCTGTCCGACTCCAAAACAAGCCCATTTCAATGTCTCAAAAGACTGGAACTCAATTATTAAACATCAAACCAAGCTCAAGAATGACCATGCCATTCTTTCTACATATACCCAGATGGAATCTCTTGGCATTGCACTCGATTCTGCTACAATGCCTCTTGTTTTAAAGGCTTGCGGGAGGCTCAAAGCCATTGACAAAGGGGTACGAATTCATTCTTGTATTAGGCATTCAGATTTGATCAAAGACGTTCGGGTTGGGACTGCCTTGGTCGACTTTTACAGTAAATGTGGGTTTGTTGGGGAAGCCAGTAAAGTGTTCGATGAAATGCCTGAAAGAGATGTAGTGTCTTGGAGTGCTTTAATTTCTGGATATGTTGGGTGTTCTTGCTATAAAGAGGCAGTGTTGTTGTTTATGGAGATGCAAAGGACAGGATTCACACCCAAATCTTGTACTATCGTGGCTCTGCTTTTGGCATGTGCTGAGATGTTTGAAATGAGATTAGGACAAGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTCGATATGGATGCTCATGTTGGTACTGTTTTAGTTGGATTTTATATGAGATTTGATGCAGCAGTTTCACACCGTGTATTTAGCTTGATGATGGTGAGAAATGTAGTGAGTTGGAATGCAATAATAACTGGATATCTTGATATTGGAGATTACACAAAAGCTTTGGAGCTTTTTAGTAGTATGCTGACTGAGGGTGTTAAGTTTGATGCTGTTACAATGTTGGTGTTAATTCAAGCCTGTGCAGAATCTCAATCTCTCCAATTAGGCATGCATCTGCATCAGTTGGCTATTAAGTTCAATTTCATTGATGATTTGTTCATATTAAATGCATTACTGAATATGTACAGTGATAATGGAAGTCTGGAGTCATCATGTGTGTTGTTTAATGCCGTTCCCACCTCTGATGCTGCTTTATGGAATTCTATGATATCAGCATACATTGCCTTCGGATTTCATGCTGAAGCTGTAGCTTTGTTTACTAAAATGCGTTTGGAAGGCATAAAAGAAGATGAAAGAACTGTTGCGATTATGTTATCTTTATGTGAAGATCTAACCGATGGTTTGATACGGGGTAGAGGCTTACATGCTCAAGCCATGAAAAGGGGAATGGAACTAGGTGTACTTCTGGGTAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAAGTTTTTGATAAGATGAGAGGTTTTGACGTCATCTCATGGAACACAATGATATTGGCACTTGCTCAGAGTAAGTTTCGAGCCAAAGCATTCGAAATCTTCATGATGATGTATGAATCAGAATTCAAGTTTAATTCGTACACGATGATATCTCTCCTCGCATTGTGTAGAGATGGAAGTGATCTAGTATTTGGGCGATCAATCCATGGTTTTGCAAGAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATCAATTGCGGTGATGAAGGATCAGCTACAAACCTGTTTAGTAGATGTCATAGAGATTTAATTTCATGGAATTCCCTAATTTCGCTATATAAGGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCCAAGTCTGACAATCATAAATATTCTCACATCCTGTACACAGCTTGCCCATCTACTACTAGGACAGTGCTTGCATGATTACACTACAAGAAGGGAAGAATCTCTTGAATTGGATGCTTCTTTAGCAAATGCTTTGATAACTATGTATGCAAGATGTGGTAAAATGCAATATGCAGAAAAGATTTTTAACACCATGAAGGCAAGAAATATTGTCTCATGGAATGCCATGATAACAGAATATGGCATGCATGGTCGTGGACACGATGCTACTCTAGCATTTTCACAGATGTTGGATGATGGTTTCAAGCCAAACAATGTATCTTTTGCATCTGTTTTATCTGCCTGCAGCCATTCTGGTTTGACCGAGACAGGTTTGCAGCTTTTCAATTCCATGGTGCGGGATTTTGGTATTACTCCTGAACTTGCTCACTATGGTTGTATGGTCGATCTGCTTGGTCGAGAGCTCTAGCTTTCATCAACTTGATGCCCATTGAACCTGATGCATCAGTTTGGAGGGCTTTGCTCAGTTCATGTCAGGTTAAAAGCAATAAAAAGCTAGTGGAAACCATCTTTGGAAAGCTTGTCGAATTAGAACCAAGCAATCCAGGCCAGGCCAGGTTAGGTCAGGTCGTCCTGACTCGAGTTGAACTGGTCGTCTGGTTTGGCTCGTCTGTTCCATGGACTCCTTCGGTATACCTTCATTATGGATCTCTCGATTTGGCTTGTATTCGACCCAATTGTCTTCTGTAGTTGAATTTTACCCGATTGTCCAAATTTACCCATAACAAAGTGCATTTGAAAAAAGTAAACTTTTTCAATTCCATGAAAAACCTTTTCCAATTCCA

mRNA sequence

ATGTTTGTGGGTCTTTATCAATTGAAATTGCAGTCACCCTAAGCATTCAGGGACTCTGTCCGACTCCAAAACAAGCCCATTTCAATGTCTCAAAAGACTGGAACTCAATTATTAAACATCAAACCAAGCTCAAGAATGACCATGCCATTCTTTCTACATATACCCAGATGGAATCTCTTGGCATTGCACTCGATTCTGCTACAATGCCTCTTGTTTTAAAGGCTTGCGGGAGGCTCAAAGCCATTGACAAAGGGGTACGAATTCATTCTTGTATTAGGCATTCAGATTTGATCAAAGACGTTCGGGTTGGGACTGCCTTGGTCGACTTTTACAGTAAATGTGGGTTTGTTGGGGAAGCCAGTAAAGTGTTCGATGAAATGCCTGAAAGAGATGTAGTGTCTTGGAGTGCTTTAATTTCTGGATATGTTGGGTGTTCTTGCTATAAAGAGGCAGTGTTGTTGTTTATGGAGATGCAAAGGACAGGATTCACACCCAAATCTTGTACTATCGTGGCTCTGCTTTTGGCATGTGCTGAGATGTTTGAAATGAGATTAGGACAAGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTCGATATGGATGCTCATGTTGGTACTGTTTTAGTTGGATTTTATATGAGATTTGATGCAGCAGTTTCACACCGTGTATTTAGCTTGATGATGGTGAGAAATGTAGTGAGTTGGAATGCAATAATAACTGGATATCTTGATATTGGAGATTACACAAAAGCTTTGGAGCTTTTTAGTAGTATGCTGACTGAGGGTGTTAAGTTTGATGCTGTTACAATGTTGGTGTTAATTCAAGCCTGTGCAGAATCTCAATCTCTCCAATTAGGCATGCATCTGCATCAGTTGGCTATTAAGTTCAATTTCATTGATGATTTGTTCATATTAAATGCATTACTGAATATGTACAGTGATAATGGAAGTCTGGAGTCATCATGTGTGTTGTTTAATGCCGTTCCCACCTCTGATGCTGCTTTATGGAATTCTATGATATCAGCATACATTGCCTTCGGATTTCATGCTGAAGCTGTAGCTTTGTTTACTAAAATGCGTTTGGAAGGCATAAAAGAAGATGAAAGAACTGTTGCGATTATGTTATCTTTATGTGAAGATCTAACCGATGGTTTGATACGGGGTAGAGGCTTACATGCTCAAGCCATGAAAAGGGGAATGGAACTAGGTGTACTTCTGGGTAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAAGTTTTTGATAAGATGAGAGGTTTTGACGTCATCTCATGGAACACAATGATATTGGCACTTGCTCAGAGTAAGTTTCGAGCCAAAGCATTCGAAATCTTCATGATGATGTATGAATCAGAATTCAAGTTTAATTCGTACACGATGATATCTCTCCTCGCATTGTGTAGAGATGGAAGTGATCTAGTATTTGGGCGATCAATCCATGGTTTTGCAAGAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATCAATTGCGGTGATGAAGGATCAGCTACAAACCTCTGGAGCCCAAGTCTGACAATCATAAATATTCTCACATCCTGTACACAGCTTGCCCATCTACTACTAGGACAGTGCTTGCATGATTACACTACAAGAAGGGAAGAATCTCTTGAATTGGATGCTTCTTTAGCAAATGCTTTGATAACTATGTATGCAAGATGTGGTAAAATGCAATATGCAGAAAAGATTTTTAACACCATGAAGGCAAGAAATATTGTCTCATGGAATGCCATGATAACAGAATATGGCATGCATGGTCGTGGACACGATGCTACTCTAGCATTTTCACAGATGTTGGATGATGGTTTCAAGCCAAACAATGTATCTTTTGCATCTGTTTTATCTGCCTGCAGCCATTCTGGTTTGACCGAGACAGGTTTGCAGCTTTTCAATTCCATGGTGCGGGATTTTGGTATTACTCCTGAACTTGCTCACTATGGTTGTATGGTCGATCTGCTTGGTCGAGAGCTCTAGCTTTCATCAACTTGATGCCCATTGAACCTGATGCATCAGTTTGGAGGGCTTTGCTCAGTTCATGTCAGGTTAAAAGCAATAAAAAGCTAGTGGAAACCATCTTTGGAAAGCTTGTCGAATTAGAACCAAGCAATCCAGGCCAGGCCAGGTTAGGTCAGGTCGTCCTGACTCGAGTTGAACTGGTCGTCTGGTTTGGCTCGTCTGTTCCATGGACTCCTTCGGTATACCTTCATTATGGATCTCTCGATTTGGCTTGTATTCGACCCAATTGTCTTCTGTAGTTGAATTTTACCCGATTGTCCAAATTTACCCATAACAAAGTGCATTTGAAAAAAGTAAACTTTTTCAATTCCATGAAAAACCTTTTCCAATTCCA

Coding sequence (CDS)

ATGGAATCTCTTGGCATTGCACTCGATTCTGCTACAATGCCTCTTGTTTTAAAGGCTTGCGGGAGGCTCAAAGCCATTGACAAAGGGGTACGAATTCATTCTTGTATTAGGCATTCAGATTTGATCAAAGACGTTCGGGTTGGGACTGCCTTGGTCGACTTTTACAGTAAATGTGGGTTTGTTGGGGAAGCCAGTAAAGTGTTCGATGAAATGCCTGAAAGAGATGTAGTGTCTTGGAGTGCTTTAATTTCTGGATATGTTGGGTGTTCTTGCTATAAAGAGGCAGTGTTGTTGTTTATGGAGATGCAAAGGACAGGATTCACACCCAAATCTTGTACTATCGTGGCTCTGCTTTTGGCATGTGCTGAGATGTTTGAAATGAGATTAGGACAAGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTCGATATGGATGCTCATGTTGGTACTGTTTTAGTTGGATTTTATATGAGATTTGATGCAGCAGTTTCACACCGTGTATTTAGCTTGATGATGGTGAGAAATGTAGTGAGTTGGAATGCAATAATAACTGGATATCTTGATATTGGAGATTACACAAAAGCTTTGGAGCTTTTTAGTAGTATGCTGACTGAGGGTGTTAAGTTTGATGCTGTTACAATGTTGGTGTTAATTCAAGCCTGTGCAGAATCTCAATCTCTCCAATTAGGCATGCATCTGCATCAGTTGGCTATTAAGTTCAATTTCATTGATGATTTGTTCATATTAAATGCATTACTGAATATGTACAGTGATAATGGAAGTCTGGAGTCATCATGTGTGTTGTTTAATGCCGTTCCCACCTCTGATGCTGCTTTATGGAATTCTATGATATCAGCATACATTGCCTTCGGATTTCATGCTGAAGCTGTAGCTTTGTTTACTAAAATGCGTTTGGAAGGCATAAAAGAAGATGAAAGAACTGTTGCGATTATGTTATCTTTATGTGAAGATCTAACCGATGGTTTGATACGGGGTAGAGGCTTACATGCTCAAGCCATGAAAAGGGGAATGGAACTAGGTGTACTTCTGGGTAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAAGTTTTTGATAAGATGAGAGGTTTTGACGTCATCTCATGGAACACAATGATATTGGCACTTGCTCAGAGTAAGTTTCGAGCCAAAGCATTCGAAATCTTCATGATGATGTATGAATCAGAATTCAAGTTTAATTCGTACACGATGATATCTCTCCTCGCATTGTGTAGAGATGGAAGTGATCTAGTATTTGGGCGATCAATCCATGGTTTTGCAAGAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATCAATTGCGGTGATGAAGGATCAGCTACAAACCTCTGGAGCCCAAGTCTGACAATCATAAATATTCTCACATCCTGTACACAGCTTGCCCATCTACTACTAGGACAGTGCTTGCATGATTACACTACAAGAAGGGAAGAATCTCTTGAATTGGATGCTTCTTTAGCAAATGCTTTGATAACTATGTATGCAAGATGTGGTAAAATGCAATATGCAGAAAAGATTTTTAACACCATGAAGGCAAGAAATATTGTCTCATGGAATGCCATGATAACAGAATATGGCATGCATGGTCGTGGACACGATGCTACTCTAGCATTTTCACAGATGTTGGATGATGGTTTCAAGCCAAACAATGTATCTTTTGCATCTGTTTTATCTGCCTGCAGCCATTCTGGTTTGACCGAGACAGGTTTGCAGCTTTTCAATTCCATGGTGCGGGATTTTGGTATTACTCCTGAACTTGCTCACTATGGTTGTATGGTCGATCTGCTTGGTCGAGAGCTCTAG

Protein sequence

MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGFVGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLACAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSWNAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMISLLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWSPSLTIINILTSCTQLAHLLLGQCLHDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCMVDLLGREL
Homology
BLAST of Tan0010144 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 308.9 bits (790), Expect = 1.3e-82
Identity = 202/669 (30.19%), Postives = 319/669 (47.68%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKAC-GRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCG 60
           M S  +  +  T   VL+AC G   A D   +IH+ I +  L     V   L+D YS+ G
Sbjct: 177 MVSENVTPNEGTFSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNG 236

Query: 61  FVGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLL 120
           FV  A +VFD +  +D  SW A+ISG     C  EA+ LF +M   G  P      ++L 
Sbjct: 237 FVDLARRVFDGLRLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLS 296

Query: 121 ACAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVS-HRVFSLMMVRNVV 180
           AC ++  + +G+++HG  L+ G F  D +V   LV  Y      +S   +FS M  R+ V
Sbjct: 297 ACKKIESLEIGEQLHGLVLKLG-FSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAV 356

Query: 181 SWNAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLA 240
           ++N +I G    G   KA+ELF  M  +G++ D+ T+  L+ AC+   +L  G  LH   
Sbjct: 357 TYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYT 416

Query: 241 IKFNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAV 300
            K  F  +  I  ALLN+Y+    +E++   F      +  LWN M+ AY        + 
Sbjct: 417 TKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSF 476

Query: 301 ALFTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSM 360
            +F +M++E I  ++ T   +L  C  L D L  G  +H+Q +K   +L   + + L+ M
Sbjct: 477 RIFRQMQIEEIVPNQYTYPSILKTCIRLGD-LELGEQIHSQIIKTNFQLNAYVCSVLIDM 536

Query: 361 YVEHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTM 420
           Y +  ++D A  +  +  G DV+SW TMI    Q  F  KA   F  M +   + +   +
Sbjct: 537 YAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGL 596

Query: 421 ISLLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCG-----------DEG 480
            + ++ C     L  G+ IH  A  +G   +     +L  +Y  CG            E 
Sbjct: 597 TNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEA 656

Query: 481 SATNLWSP-----------------------------SLTIINILTSCTQLAHLLLGQCL 540
                W+                              + T  + + + ++ A++  G+ +
Sbjct: 657 GDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQV 716

Query: 541 HDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHG 600
           H   T+     + +  + NALI+MYA+CG +  AEK F  +  +N VSWNA+I  Y  HG
Sbjct: 717 HAVITK--TGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHG 776

Query: 601 RGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHY 628
            G +A  +F QM+    +PN+V+   VLSACSH GL + G+  F SM  ++G++P+  HY
Sbjct: 777 FGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHY 836

BLAST of Tan0010144 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 1.9e-81
Identity = 189/624 (30.29%), Postives = 309/624 (49.52%), Query Frame = 0

Query: 8   LDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGFVGEASKV 67
           +D  T+  VL+ C   K++  G  + + IR +  + D  +G+ L   Y+ CG + EAS+V
Sbjct: 92  IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRV 151

Query: 68  FDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLACAEMFEM 127
           FDE+     + W+ L++       +  ++ LF +M  +G    S T   +  + + +  +
Sbjct: 152 FDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSV 211

Query: 128 RLGQEIHGYCLRNGLFDMDAHVGTVLVGFYM---RFDAAVSHRVFSLMMVRNVVSWNAII 187
             G+++HG+ L++G  + ++ VG  LV FY+   R D+A   +VF  M  R+V+SWN+II
Sbjct: 212 HGGEQLHGFILKSGFGERNS-VGNSLVAFYLKNQRVDSA--RKVFDEMTERDVISWNSII 271

Query: 188 TGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFI 247
            GY+  G   K L +F  ML  G++ D  T++ +   CA+S+ + LG  +H + +K  F 
Sbjct: 272 NGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFS 331

Query: 248 DDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKM 307
            +    N LL+MYS  G L+S+  +F  +       + SMI+ Y   G   EAV LF +M
Sbjct: 332 REDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEM 391

Query: 308 RLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHNQ 367
             EGI  D  TV  +L+ C      L  G+ +H    +  +   + + NAL+ MY +   
Sbjct: 392 EEEGISPDVYTVTAVLNCCARYR-LLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGS 451

Query: 368 IDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIF-MMMYESEFKFNSYTMISLLA 427
           +  A+ VF +MR  D+ISWNT+I   +++ +  +A  +F +++ E  F  +  T+  +L 
Sbjct: 452 MQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLP 511

Query: 428 LCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWSPSLTIINIL 487
            C   S    GR IHG+  +NG              Y +                     
Sbjct: 512 ACASLSAFDKGREIHGYIMRNG--------------YFS--------------------- 571

Query: 488 TSCTQLAHLLLGQCLHDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARN 547
                                       D  +AN+L+ MYA+CG +  A  +F+ + +++
Sbjct: 572 ----------------------------DRHVANSLVDMYAKCGALLLAHMLFDDIASKD 631

Query: 548 IVSWNAMITEYGMHGRGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFN 607
           +VSW  MI  YGMHG G +A   F+QM   G + + +SF S+L ACSHSGL + G + FN
Sbjct: 632 LVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFN 648

Query: 608 SMVRDFGITPELAHYGCMVDLLGR 628
            M  +  I P + HY C+VD+L R
Sbjct: 692 IMRHECKIEPTVEHYACIVDMLAR 648

BLAST of Tan0010144 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 302.8 bits (774), Expect = 9.3e-81
Identity = 193/623 (30.98%), Postives = 315/623 (50.56%), Query Frame = 0

Query: 6   IALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGFVGEAS 65
           ++ D  T P V+KAC  L   + G  ++  I       D+ VG ALVD YS+ G +  A 
Sbjct: 102 VSPDKYTFPSVIKACAGLFDAEMGDLVYEQILDMGFESDLFVGNALVDMYSRMGLLTRAR 161

Query: 66  KVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLACAEMF 125
           +VFDEMP RD+VSW++LISGY     Y+EA+ ++ E++ +   P S T+ ++L A   + 
Sbjct: 162 QVFDEMPVRDLVSWNSLISGYSSHGYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLL 221

Query: 126 EMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAV-SHRVFSLMMVRNVVSWNAII 185
            ++ GQ +HG+ L++G+  +   V   LV  Y++F     + RVF  M VR+ VS+N +I
Sbjct: 222 VVKQGQGLHGFALKSGVNSV-VVVNNGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMI 281

Query: 186 TGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFI 245
            GYL +    +++ +F   L +  K D +T+  +++AC   + L L  +++   +K  F+
Sbjct: 282 CGYLKLEMVEESVRMFLENLDQ-FKPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFV 341

Query: 246 DDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKM 305
            +  + N L+++Y+  G + ++  +FN++   D   WNS+IS YI  G   EA+ LF  M
Sbjct: 342 LESTVRNILIDVYAKCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMM 401

Query: 306 RLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHNQ 365
            +   + D  T  +++S+   L D L  G+GLH+  +K G+ + + + NAL+ MY +  +
Sbjct: 402 MIMEEQADHITYLMLISVSTRLAD-LKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGE 461

Query: 366 IDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMISLLAL 425
           +  + K+F  M   D ++WNT+I A  +                                
Sbjct: 462 VGDSLKIFSSMGTGDTVTWNTVISACVR-------------------------------- 521

Query: 426 CRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWSPSLTIINILT 485
                   FG     FA   GL++ T +  S                +     T +  L 
Sbjct: 522 --------FG----DFA--TGLQVTTQMRKS---------------EVVPDMATFLVTLP 581

Query: 486 SCTQLAHLLLGQCLHDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNI 545
            C  LA   LG+ +H    R     E +  + NALI MY++CG ++ + ++F  M  R++
Sbjct: 582 MCASLAAKRLGKEIHCCLLR--FGYESELQIGNALIEMYSKCGCLENSSRVFERMSRRDV 641

Query: 546 VSWNAMITEYGMHGRGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNS 605
           V+W  MI  YGM+G G  A   F+ M   G  P++V F +++ ACSHSGL + GL  F  
Sbjct: 642 VTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEGLACFEK 658

Query: 606 MVRDFGITPELAHYGCMVDLLGR 628
           M   + I P + HY C+VDLL R
Sbjct: 702 MKTHYKIDPMIEHYACVVDLLSR 658

BLAST of Tan0010144 vs. ExPASy Swiss-Prot
Match: Q0WN60 (Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H8 PE=2 SV=2)

HSP 1 Score: 300.1 bits (767), Expect = 6.1e-80
Identity = 197/664 (29.67%), Postives = 328/664 (49.40%), Query Frame = 0

Query: 15  LVLKACGRLKAIDKGVRIHSCIRHSDLIK-DVRVGTALVDFYSKCGFVGEASKVFDEMPE 74
           L+L+A G+ K I+ G +IH  +  S  ++ D  + T ++  Y+ CG   ++  VFD +  
Sbjct: 89  LLLQASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVFDALRS 148

Query: 75  RDVVSWSALISGYVGCSCYKEAVLLFMEM-QRTGFTPKSCTIVALLLACAEMFEMRLGQE 134
           +++  W+A+IS Y     Y E +  F+EM   T   P   T   ++ ACA M ++ +G  
Sbjct: 149 KNLFQWNAVISSYSRNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGIGLA 208

Query: 135 IHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAV-SHRVFSLMMVRNVVSWNAIITGYLDIG 194
           +HG  ++ GL + D  VG  LV FY        + ++F +M  RN+VSWN++I  + D G
Sbjct: 209 VHGLVVKTGLVE-DVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDNG 268

Query: 195 DYTKALELFSSMLTE----GVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFIDDL 254
              ++  L   M+ E        D  T++ ++  CA  + + LG  +H  A+K     +L
Sbjct: 269 FSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKEL 328

Query: 255 FILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKMRL- 314
            + NAL++MYS  G + ++ ++F      +   WN+M+  + A G       +  +M   
Sbjct: 329 VLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQMLAG 388

Query: 315 -EGIKEDERTVAIMLSLC--EDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHN 374
            E +K DE T+   + +C  E     L   + LH  ++K+      L+ NA ++ Y +  
Sbjct: 389 GEDVKADEVTILNAVPVCFHESFLPSL---KELHCYSLKQEFVYNELVANAFVASYAKCG 448

Query: 375 QIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMISLLA 434
            +  AQ+VF  +R   V SWN +I   AQS     + +  + M  S    +S+T+ SLL+
Sbjct: 449 SLSYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLS 508

Query: 435 LCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGD-----------EGSATNL 494
            C     L  G+ +HGF  +N LE +  +  S+  +YI+CG+           E  +   
Sbjct: 509 ACSKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVS 568

Query: 495 WSPSLT-----------------------------IINILTSCTQLAHLLLGQCLHDYTT 554
           W+  +T                             ++ +  +C+ L  L LG+  H Y  
Sbjct: 569 WNTVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYAL 628

Query: 555 RREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGHDA 614
           +    LE DA +A +LI MYA+ G +  + K+FN +K ++  SWNAMI  YG+HG   +A
Sbjct: 629 K--HLLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEA 688

Query: 615 TLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCMVD 628
              F +M   G  P++++F  VL+AC+HSGL   GL+  + M   FG+ P L HY C++D
Sbjct: 689 IKLFEEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVID 746

BLAST of Tan0010144 vs. ExPASy Swiss-Prot
Match: Q9CA56 (Pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E69 PE=3 SV=1)

HSP 1 Score: 299.7 bits (766), Expect = 7.9e-80
Identity = 189/623 (30.34%), Postives = 311/623 (49.92%), Query Frame = 0

Query: 9   DSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGFVGEASKVF 68
           DS T   VL AC  L+ +  G  + + +      +DV V TA+VD Y+KCG + EA +VF
Sbjct: 250 DSYTYSSVLAACASLEKLRFGKVVQARVIKCG-AEDVFVCTAIVDLYAKCGHMAEAMEVF 309

Query: 69  DEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLACAEMFEMR 128
             +P   VVSW+ ++SGY   +    A+ +F EM+ +G    +CT+ +++ AC     + 
Sbjct: 310 SRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACGRPSMVC 369

Query: 129 LGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRF-DAAVSHRVFSLM---MVRNVVSWNAII 188
              ++H +  ++G F +D+ V   L+  Y +  D  +S +VF  +     +N+V  N +I
Sbjct: 370 EASQVHAWVFKSG-FYLDSSVAAALISMYSKSGDIDLSEQVFEDLDDIQRQNIV--NVMI 429

Query: 189 TGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFI 248
           T +       KA+ LF+ ML EG++ D  ++  L+        L LG  +H   +K   +
Sbjct: 430 TSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVL---DCLNLGKQVHGYTLKSGLV 489

Query: 249 DDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKM 308
            DL + ++L  +YS  GSLE S  LF  +P  D A W SMIS +  +G+  EA+ LF++M
Sbjct: 490 LDLTVGSSLFTLYSKCGSLEESYKLFQGIPFKDNACWASMISGFNEYGYLREAIGLFSEM 549

Query: 309 RLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHNQ 368
             +G   DE T+A +L++C      L RG+ +H   ++ G++ G+ LG+AL++MY +   
Sbjct: 550 LDDGTSPDESTLAAVLTVCSS-HPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCGS 609

Query: 369 IDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMISLLAL 428
           +  A++V+D++   D +S +++I   +Q       F +F  M  S F  +S+ + S+L  
Sbjct: 610 LKLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTMDSFAISSILKA 669

Query: 429 CRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWSPSLTIINILT 488
                +   G  +H +  K GL                                      
Sbjct: 670 AALSDESSLGAQVHAYITKIGL-------------------------------------- 729

Query: 489 SCTQLAHLLLGQCLHDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNI 548
            CT                        + S+ ++L+TMY++ G +    K F+ +   ++
Sbjct: 730 -CT------------------------EPSVGSSLLTMYSKFGSIDDCCKAFSQINGPDL 789

Query: 549 VSWNAMITEYGMHGRGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNS 608
           ++W A+I  Y  HG+ ++A   ++ M + GFKP+ V+F  VLSACSH GL E      NS
Sbjct: 790 IAWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSHGGLVEESYFHLNS 801

Query: 609 MVRDFGITPELAHYGCMVDLLGR 628
           MV+D+GI PE  HY CMVD LGR
Sbjct: 850 MVKDYGIEPENRHYVCMVDALGR 801

BLAST of Tan0010144 vs. NCBI nr
Match: XP_023512048.1 (pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023512049.1 pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023512050.1 pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1088.2 bits (2813), Expect = 0.0e+00
Identity = 552/666 (82.88%), Postives = 588/666 (88.29%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGIA DSATMPLVLKACGRL AI+KGVRIHSCIR SDLI+DVRVGTALVDFYSKCG 
Sbjct: 49  MESLGIAPDSATMPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGL 108

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           VGEASKVFDEMPERD+VSW+ALISGYVGCSCYKEAVLLFMEMQ+ G TP S T+V LLLA
Sbjct: 109 VGEASKVFDEMPERDLVSWNALISGYVGCSCYKEAVLLFMEMQKAGLTPNSRTVVPLLLA 168

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           CAEM E+RLG EIHGYCLRNGLFDMDAHVGT L+GFYMRFDAAVSHRVFSLM VRNVVSW
Sbjct: 169 CAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHRVFSLMEVRNVVSW 228

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NA+ITGYL+IGDYTKAL+LFSSMLTEG+KFDAVTML++IQACAES+SLQLGM LHQLAIK
Sbjct: 229 NAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIK 288

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FNF+DDLF+LNALLNMYSDNG LESSC LFNAVPTSDAALWNSMISAYIAFGFHAEA+AL
Sbjct: 289 FNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIAL 348

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           + KMRLEG+KED+RTVAIMLSLCEDL DG I GRGLHA AMK GMEL V LGNALLSMYV
Sbjct: 349 YIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYV 408

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           EHNQIDAAQK+FDKMRG DVISWNTMILALAQSKFRAKAF++FM M ESE KFNSYTMIS
Sbjct: 409 EHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMTMCESEIKFNSYTMIS 468

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNL-------- 480
           LLALC+DGSDLVFGRSIHGFA KNGLEINTSLNTSLTEMYINC DEGSATNL        
Sbjct: 469 LLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGSATNLFIRCPQRD 528

Query: 481 ---WSP----------------------------SLTIINILTSCTQLAHLLLGQCLHDY 540
              W+                             S+TII+ILTSCTQLAHL LGQCLH Y
Sbjct: 529 LISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAY 588

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
           TTRR ES ELDASLANA ITMYARCGKMQYAEKIFNT++ARNIVSWNAMIT YGMHGRGH
Sbjct: 589 TTRRGESFELDASLANAFITMYARCGKMQYAEKIFNTLQARNIVSWNAMITGYGMHGRGH 648

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNN+SF SVLSACSHSGLT+TGLQLF+SMVRDFGI P+LAHYGC+
Sbjct: 649 DATLAFAQMLDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCI 708

BLAST of Tan0010144 vs. NCBI nr
Match: XP_022986695.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita maxima] >XP_022986696.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 547/666 (82.13%), Postives = 583/666 (87.54%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGIA DSATMPLVLKACGRL AI+KG RIHSCIR SDLI+DVRVGTALVDFYSKCG 
Sbjct: 49  MESLGIAPDSATMPLVLKACGRLNAIEKGARIHSCIRDSDLIRDVRVGTALVDFYSKCGL 108

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           V EASKVFDEMPERD+VSW+ALISGYVGCSCYKEAVLLFMEMQ+ G TP S T+V LLLA
Sbjct: 109 VREASKVFDEMPERDLVSWNALISGYVGCSCYKEAVLLFMEMQKAGLTPNSRTVVPLLLA 168

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           CAEM E+RLG EIHGYCLRNGLFDMDAHVGT L+GFYMRFDAAVSHRVFSLM +RNVVSW
Sbjct: 169 CAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHRVFSLMEMRNVVSW 228

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NA+ITGYL+IGDY KAL+LFSSMLTEG+KFDAVTML++IQACAES+SLQLGM LHQLAIK
Sbjct: 229 NAMITGYLNIGDYPKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIK 288

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FNFIDDLF+LNALLNMYSDNG LESSC LFNAVPTSDAALWNSMISAYIAFGFHAEA+AL
Sbjct: 289 FNFIDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIAL 348

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           + KMRLEG+KED+RTVAIMLSLCEDL DG I GRGLHA AMK GMEL V LGNALLSMYV
Sbjct: 349 YIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYV 408

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           EHNQIDAAQK+FDK RG DVISWNTMILALAQSKFRAKAFE+FM M ESE KFNSYTMIS
Sbjct: 409 EHNQIDAAQKLFDKTRGLDVISWNTMILALAQSKFRAKAFELFMTMCESEIKFNSYTMIS 468

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNL-------- 480
           LLALC+DGSDLVFGRSIHGFA KNGLEINTSLNTSLTEMYIN  DEGSATNL        
Sbjct: 469 LLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINSSDEGSATNLFIRCPERD 528

Query: 481 ---WSP----------------------------SLTIINILTSCTQLAHLLLGQCLHDY 540
              W+                             S+TII+ILTSCTQLAHL LGQCLH Y
Sbjct: 529 LISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAY 588

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
           TTRR ES ELDASLANA ITMYARCGKMQYAEKIFNT++ARNIVSWNAMIT YGMHGRGH
Sbjct: 589 TTRRGESFELDASLANAFITMYARCGKMQYAEKIFNTLQARNIVSWNAMITGYGMHGRGH 648

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNN+SF SVLSACSHSGLT+TGLQLF+SMVRDFGI P+LAHYGC+
Sbjct: 649 DATLAFAQMLDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCI 708

BLAST of Tan0010144 vs. NCBI nr
Match: KAG6570395.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1073.5 bits (2775), Expect = 4.8e-310
Identity = 548/666 (82.28%), Postives = 584/666 (87.69%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGIA DSATMPLVLKACGRL AI+KGVRIHSCIR SDLI+DVRVGTALVDFYSKCG 
Sbjct: 49  MESLGIAPDSATMPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGL 108

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           VGEASKVFDEMPERD+VSW+ALISGYVGCSCYKEAVLLF+EMQ+ G TP S T+V LLLA
Sbjct: 109 VGEASKVFDEMPERDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLA 168

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           CAEM E+RLG EIHGYCLRNGLFDMDAHVGT L+GFYMRFDAAVSHRVFS M VRNVVSW
Sbjct: 169 CAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHRVFSSMEVRNVVSW 228

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NA+ITGYL+IGDYTKAL+LFSSMLTEG+KFDAVTML++IQACAES+SLQLGM LHQLAIK
Sbjct: 229 NAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIK 288

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FNFI DLF+LNALLNMYSDNG LESSC LFNAVPTSDAALWNSMISAYIAFGFHAEA+AL
Sbjct: 289 FNFIGDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIAL 348

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           + KMRLEG+KED+RTV IMLSLCEDL DG I GRGLHA AMK GMEL V LGNALLSMYV
Sbjct: 349 YIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYV 408

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           EHNQIDAAQK+FDKMRG DVIS NTMILALA+SKFRAKAFE+FM M ESE KFNSYTMIS
Sbjct: 409 EHNQIDAAQKLFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMIS 468

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNL-------- 480
           LLALC+DGSDLVFGRSIHGFA KNGLEINTSLNTSLTEMYINC DEGSATNL        
Sbjct: 469 LLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRD 528

Query: 481 ---WSP----------------------------SLTIINILTSCTQLAHLLLGQCLHDY 540
              W+                             S+TII+ILTSCTQLAHL LGQCLH Y
Sbjct: 529 LISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAY 588

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
           TTRR ES ELDASLANA ITMYARCGKMQYAEKIF+T+KARNIVSWNAMIT YGMHGRGH
Sbjct: 589 TTRRGESFELDASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGH 648

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNN+SF SVLSACSHSGLT+TGLQLF+SMVRDFGI P+LAHYGC+
Sbjct: 649 DATLAFAQMLDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCI 708

BLAST of Tan0010144 vs. NCBI nr
Match: XP_022944564.1 (pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like [Cucurbita moschata] >XP_022944565.1 pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1072.0 bits (2771), Expect = 1.9e-309
Identity = 547/666 (82.13%), Postives = 583/666 (87.54%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGIA DSATMPLVLKACGRL AI+KGVRIHSCIR SDLI+DVRVGTALVDFYSKCG 
Sbjct: 49  MESLGIAPDSATMPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGL 108

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           VGEASKVFDEMPERD+VSW+ALISGYVGCSCYKEAVLLF+EMQ+ G TP S T+V LLLA
Sbjct: 109 VGEASKVFDEMPERDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLA 168

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           CAEM E+RLG EIHGYCLRNGLFDMDAHVGT L+GFYMRFDA VSHRVFS M VRNVVSW
Sbjct: 169 CAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSW 228

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NA+ITGYL+IGDYTKAL+LFSSMLTEG+KFDAVTML++IQACAES+SLQLGM LHQLAIK
Sbjct: 229 NAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIK 288

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FNFI DLF+LNALLNMYSDNG LESSC LFNAVPTSDAALWNSMISAYIAFGFHAEA+AL
Sbjct: 289 FNFIGDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIAL 348

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           + KMRLEG+KED+RTV IMLSLCEDL DG I GRGLHA AMK GMEL V LGNALLSMYV
Sbjct: 349 YIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYV 408

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           EHNQIDAAQK+FDKMRG DVIS NTMILALA+SKFRAKAFE+FM M ESE KFNSYTMIS
Sbjct: 409 EHNQIDAAQKLFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMIS 468

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNL-------- 480
           LLALC+DGSDLVFGRSIHGFA KNGLEINTSLNTSLTEMYINC DEGSATNL        
Sbjct: 469 LLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRD 528

Query: 481 ---WSP----------------------------SLTIINILTSCTQLAHLLLGQCLHDY 540
              W+                             S+TII+ILTSCTQLAHL LGQCLH Y
Sbjct: 529 LVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAY 588

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
           TTRR ES ELDASLANA ITMYARCGKMQYAEKIF+T+KARNIVSWNAMIT YGMHGRGH
Sbjct: 589 TTRRGESFELDASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGH 648

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNN+SF SVLSACSHSGLT+TGLQLF+SMVRDFGI P+LAHYGC+
Sbjct: 649 DATLAFAQMLDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCI 708

BLAST of Tan0010144 vs. NCBI nr
Match: XP_038902698.1 (pentatricopeptide repeat-containing protein At2g13600-like [Benincasa hispida] >XP_038902699.1 pentatricopeptide repeat-containing protein At2g13600-like [Benincasa hispida])

HSP 1 Score: 1052.7 bits (2721), Expect = 1.2e-303
Identity = 532/666 (79.88%), Postives = 573/666 (86.04%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGIA DSATMPLVLKACGRL AIDKGVRIHSCIR SDLIKDVR+GTALVDFY KCG 
Sbjct: 48  MESLGIAPDSATMPLVLKACGRLNAIDKGVRIHSCIRDSDLIKDVRIGTALVDFYCKCGL 107

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           V EASKVFDEM ERD+VSW+ALISGYVGCSCYKEAVLLFMEM++ G TP S T+VALLLA
Sbjct: 108 VTEASKVFDEMLERDLVSWNALISGYVGCSCYKEAVLLFMEMKKAGLTPNSRTVVALLLA 167

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           C EM E+RLGQEIHG+CLRNGLFDMDA+VGT L+GFYMRFDAA+SHRVFSLM+V+N+VSW
Sbjct: 168 CGEMLELRLGQEIHGFCLRNGLFDMDAYVGTALIGFYMRFDAALSHRVFSLMVVKNIVSW 227

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NAIITGY DIGDY KAL+LFSSMLTEG+KFDAVTMLV+IQACAES+ L+LGM LHQLAIK
Sbjct: 228 NAIITGYFDIGDYAKALKLFSSMLTEGIKFDAVTMLVVIQACAESECLRLGMQLHQLAIK 287

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FNFIDDLFILNALLNMYSDNG+LESSC LFNAVPTSDAALWNSMISAY+ FGFHAEA+AL
Sbjct: 288 FNFIDDLFILNALLNMYSDNGNLESSCALFNAVPTSDAALWNSMISAYVVFGFHAEAIAL 347

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           F KMRLE IKEDERT+AI+LSLCEDL DG I GRGLHA A K GMEL   LGNALLSMYV
Sbjct: 348 FNKMRLEHIKEDERTIAILLSLCEDLNDGSIWGRGLHAHATKSGMELNAFLGNALLSMYV 407

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           EHNQID  Q VF+KM G DVISWNT+I ALAQSKFRAKAFE+F MM ESE KFNSYTMIS
Sbjct: 408 EHNQIDYTQNVFEKMSGLDVISWNTLISALAQSKFRAKAFELFTMMCESEIKFNSYTMIS 467

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWS------ 480
           LLALC+DGSDL+FGRSIHG A KNGLEINTSLNTSLTEMY+NCGDEGSATNL++      
Sbjct: 468 LLALCKDGSDLLFGRSIHGLAIKNGLEINTSLNTSLTEMYVNCGDEGSATNLFTRCPQRD 527

Query: 481 ---------------------------------PSLTIINILTSCTQLAHLLLGQCLHDY 540
                                             S+TIINILTSCTQLAHL LGQCLH Y
Sbjct: 528 LISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINILTSCTQLAHLPLGQCLHAY 587

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
           TTRRE+SLELDASLANA ITMY RCGKMQYAE IFNT++ARNIVSWNAMIT YGMHGRG 
Sbjct: 588 TTRREKSLELDASLANAFITMYMRCGKMQYAETIFNTLQARNIVSWNAMITGYGMHGRGR 647

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF++MLDDGFKPNN SF SVLSACSHSGLTETGLQLFNSMV+DFG+ P+L HYGCM
Sbjct: 648 DATLAFAKMLDDGFKPNNASFVSVLSACSHSGLTETGLQLFNSMVQDFGMAPQLTHYGCM 707

BLAST of Tan0010144 vs. ExPASy TrEMBL
Match: A0A6J1JER5 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111484372 PE=4 SV=1)

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 547/666 (82.13%), Postives = 583/666 (87.54%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGIA DSATMPLVLKACGRL AI+KG RIHSCIR SDLI+DVRVGTALVDFYSKCG 
Sbjct: 49  MESLGIAPDSATMPLVLKACGRLNAIEKGARIHSCIRDSDLIRDVRVGTALVDFYSKCGL 108

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           V EASKVFDEMPERD+VSW+ALISGYVGCSCYKEAVLLFMEMQ+ G TP S T+V LLLA
Sbjct: 109 VREASKVFDEMPERDLVSWNALISGYVGCSCYKEAVLLFMEMQKAGLTPNSRTVVPLLLA 168

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           CAEM E+RLG EIHGYCLRNGLFDMDAHVGT L+GFYMRFDAAVSHRVFSLM +RNVVSW
Sbjct: 169 CAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHRVFSLMEMRNVVSW 228

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NA+ITGYL+IGDY KAL+LFSSMLTEG+KFDAVTML++IQACAES+SLQLGM LHQLAIK
Sbjct: 229 NAMITGYLNIGDYPKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIK 288

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FNFIDDLF+LNALLNMYSDNG LESSC LFNAVPTSDAALWNSMISAYIAFGFHAEA+AL
Sbjct: 289 FNFIDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIAL 348

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           + KMRLEG+KED+RTVAIMLSLCEDL DG I GRGLHA AMK GMEL V LGNALLSMYV
Sbjct: 349 YIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYV 408

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           EHNQIDAAQK+FDK RG DVISWNTMILALAQSKFRAKAFE+FM M ESE KFNSYTMIS
Sbjct: 409 EHNQIDAAQKLFDKTRGLDVISWNTMILALAQSKFRAKAFELFMTMCESEIKFNSYTMIS 468

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNL-------- 480
           LLALC+DGSDLVFGRSIHGFA KNGLEINTSLNTSLTEMYIN  DEGSATNL        
Sbjct: 469 LLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINSSDEGSATNLFIRCPERD 528

Query: 481 ---WSP----------------------------SLTIINILTSCTQLAHLLLGQCLHDY 540
              W+                             S+TII+ILTSCTQLAHL LGQCLH Y
Sbjct: 529 LISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAY 588

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
           TTRR ES ELDASLANA ITMYARCGKMQYAEKIFNT++ARNIVSWNAMIT YGMHGRGH
Sbjct: 589 TTRRGESFELDASLANAFITMYARCGKMQYAEKIFNTLQARNIVSWNAMITGYGMHGRGH 648

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNN+SF SVLSACSHSGLT+TGLQLF+SMVRDFGI P+LAHYGC+
Sbjct: 649 DATLAFAQMLDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCI 708

BLAST of Tan0010144 vs. ExPASy TrEMBL
Match: A0A6J1FWX3 (pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111448981 PE=4 SV=1)

HSP 1 Score: 1072.0 bits (2771), Expect = 9.4e-310
Identity = 547/666 (82.13%), Postives = 583/666 (87.54%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGIA DSATMPLVLKACGRL AI+KGVRIHSCIR SDLI+DVRVGTALVDFYSKCG 
Sbjct: 49  MESLGIAPDSATMPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGL 108

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           VGEASKVFDEMPERD+VSW+ALISGYVGCSCYKEAVLLF+EMQ+ G TP S T+V LLLA
Sbjct: 109 VGEASKVFDEMPERDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLA 168

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           CAEM E+RLG EIHGYCLRNGLFDMDAHVGT L+GFYMRFDA VSHRVFS M VRNVVSW
Sbjct: 169 CAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSW 228

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NA+ITGYL+IGDYTKAL+LFSSMLTEG+KFDAVTML++IQACAES+SLQLGM LHQLAIK
Sbjct: 229 NAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIK 288

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FNFI DLF+LNALLNMYSDNG LESSC LFNAVPTSDAALWNSMISAYIAFGFHAEA+AL
Sbjct: 289 FNFIGDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIAL 348

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           + KMRLEG+KED+RTV IMLSLCEDL DG I GRGLHA AMK GMEL V LGNALLSMYV
Sbjct: 349 YIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYV 408

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           EHNQIDAAQK+FDKMRG DVIS NTMILALA+SKFRAKAFE+FM M ESE KFNSYTMIS
Sbjct: 409 EHNQIDAAQKLFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMIS 468

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNL-------- 480
           LLALC+DGSDLVFGRSIHGFA KNGLEINTSLNTSLTEMYINC DEGSATNL        
Sbjct: 469 LLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRD 528

Query: 481 ---WSP----------------------------SLTIINILTSCTQLAHLLLGQCLHDY 540
              W+                             S+TII+ILTSCTQLAHL LGQCLH Y
Sbjct: 529 LVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAY 588

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
           TTRR ES ELDASLANA ITMYARCGKMQYAEKIF+T+KARNIVSWNAMIT YGMHGRGH
Sbjct: 589 TTRRGESFELDASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGH 648

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNN+SF SVLSACSHSGLT+TGLQLF+SMVRDFGI P+LAHYGC+
Sbjct: 649 DATLAFAQMLDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCI 708

BLAST of Tan0010144 vs. ExPASy TrEMBL
Match: A0A5A7TDA1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold206G00610 PE=4 SV=1)

HSP 1 Score: 1022.3 bits (2642), Expect = 8.5e-295
Identity = 519/666 (77.93%), Postives = 565/666 (84.83%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGI  DSATMPLVLKACGRL AIDKGVRIHSCIR SDLI DVRVGTALVDFY KCG 
Sbjct: 51  MESLGITPDSATMPLVLKACGRLNAIDKGVRIHSCIRGSDLINDVRVGTALVDFYCKCGL 110

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           V EASKVF EMPERD+VSW+ALISGYVGC CYKEAVLLF+EM++ G TP S T+VALLLA
Sbjct: 111 VAEASKVFVEMPERDLVSWNALISGYVGCLCYKEAVLLFVEMKKAGLTPNSRTVVALLLA 170

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           C EM E+RLGQEIHGYCLRNGLFDMDA+VGT LVGFY+RFDA +SHRVFSLM+VRN+VSW
Sbjct: 171 CGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYLRFDAVLSHRVFSLMVVRNIVSW 230

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NAIITG+L++GDYTKAL+LFSSML EG+KFDAVTMLV+IQACAE   L+LGM LHQLAIK
Sbjct: 231 NAIITGFLNVGDYTKALKLFSSMLIEGIKFDAVTMLVVIQACAEYGCLRLGMQLHQLAIK 290

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FN I+D+F+LNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMIS YI FGFHAEA+AL
Sbjct: 291 FNLINDVFVLNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISCYIGFGFHAEAIAL 350

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           F KMRLE IKED RT+ IMLSLC DL DG I GRGLHA AMK G+EL   LGNALLSMYV
Sbjct: 351 FIKMRLERIKEDVRTIVIMLSLCNDLNDGSIWGRGLHAHAMKSGIELDAFLGNALLSMYV 410

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           +HNQI+AAQ VF+K RG DVISWNTMI ALAQS FRAKAFE+F MM ESE KFNSYT+IS
Sbjct: 411 KHNQINAAQNVFEKTRGLDVISWNTMISALAQSMFRAKAFELFFMMCESEIKFNSYTIIS 470

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWS------ 480
           LLALC+DG+DLVFGRSIHGFA KNGLEINTSLNTSLTEMYINCGDE +A ++++      
Sbjct: 471 LLALCKDGNDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCGDERAAIDMFTRCPQRD 530

Query: 481 ---------------------------------PSLTIINILTSCTQLAHLLLGQCLHDY 540
                                             S+TIINILTSCTQLAHL LGQCLH Y
Sbjct: 531 LISWNSLILSYIKNDNAGKALLLFNHMISELEPNSVTIINILTSCTQLAHLPLGQCLHAY 590

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
            TRREESLE+DASLANA ITMYARCGKMQYAE+IF T++ RNIVSWNAMIT YGMHGRG 
Sbjct: 591 ATRREESLEMDASLANAFITMYARCGKMQYAEQIFRTLQTRNIVSWNAMITGYGMHGRGR 650

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNNVSFASVLSACSHSGLTETGL LF+SMVRDFG+ P+L HYGCM
Sbjct: 651 DATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTETGLLLFHSMVRDFGLAPQLTHYGCM 710

BLAST of Tan0010144 vs. ExPASy TrEMBL
Match: A0A5D3BD37 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold182G001060 PE=4 SV=1)

HSP 1 Score: 1021.5 bits (2640), Expect = 1.5e-294
Identity = 518/666 (77.78%), Postives = 565/666 (84.83%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGI  DSATMPLVLKACGRL AIDKGVRIHSCIR SDLI DVRVGTALVDFY KCG 
Sbjct: 51  MESLGITPDSATMPLVLKACGRLNAIDKGVRIHSCIRGSDLINDVRVGTALVDFYCKCGL 110

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           V EASKVF EMPERD+VSW+ALISGYVGC CYKEAVLLF+EM++ G TP S T+VALLLA
Sbjct: 111 VAEASKVFVEMPERDLVSWNALISGYVGCLCYKEAVLLFVEMKKAGLTPNSRTVVALLLA 170

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           C EM E+RLGQEIHGYCLRNGLFDMDA+VGT LVGFY+RFDA +SHRVFSLM+VRN+VSW
Sbjct: 171 CGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYLRFDAVLSHRVFSLMVVRNIVSW 230

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NAIITG+L++GDYTKAL+LFSSML EG+KFDAVTMLV+IQACAE   L+LGM LHQLAIK
Sbjct: 231 NAIITGFLNVGDYTKALKLFSSMLIEGIKFDAVTMLVVIQACAEYGCLRLGMQLHQLAIK 290

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FN I+D+F+LNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMIS YI FGFHAEA+AL
Sbjct: 291 FNLINDVFVLNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISCYIGFGFHAEAIAL 350

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           F KMRLE IKED RT+ IMLSLC DL DG + GRGLHA AMK G+EL   LGNALLSMYV
Sbjct: 351 FIKMRLERIKEDVRTIVIMLSLCNDLNDGSLWGRGLHAHAMKSGIELDAFLGNALLSMYV 410

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           +HNQI+AAQ VF+K RG DVISWNTMI ALAQS FRAKAFE+F MM ESE KFNSYT+IS
Sbjct: 411 KHNQINAAQNVFEKTRGLDVISWNTMISALAQSMFRAKAFELFFMMCESEIKFNSYTIIS 470

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWS------ 480
           LLALC+DG+DLVFGRSIHGFA KNGLEINTSLNTSLTEMYINCGDE +A ++++      
Sbjct: 471 LLALCKDGNDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCGDERAAIDMFTRCPQRD 530

Query: 481 ---------------------------------PSLTIINILTSCTQLAHLLLGQCLHDY 540
                                             S+TIINILTSCTQLAHL LGQCLH Y
Sbjct: 531 LISWNSLILSYIKNDNAGKALLLFNHMISELEPNSVTIINILTSCTQLAHLPLGQCLHAY 590

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
            TRREESLE+DASLANA ITMYARCGKMQYAE+IF T++ RNIVSWNAMIT YGMHGRG 
Sbjct: 591 ATRREESLEMDASLANAFITMYARCGKMQYAEQIFRTLQTRNIVSWNAMITGYGMHGRGR 650

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNNVSFASVLSACSHSGLTETGL LF+SMVRDFG+ P+L HYGCM
Sbjct: 651 DATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTETGLLLFHSMVRDFGLAPQLTHYGCM 710

BLAST of Tan0010144 vs. ExPASy TrEMBL
Match: A0A1S3BNK9 (pentatricopeptide repeat-containing protein At2g13600-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491511 PE=4 SV=1)

HSP 1 Score: 1021.5 bits (2640), Expect = 1.5e-294
Identity = 518/666 (77.78%), Postives = 565/666 (84.83%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGF 60
           MESLGI  DSATMPLVLKACGRL AIDKGVRIHSCIR SDLI DVRVGTALVDFY KCG 
Sbjct: 51  MESLGITPDSATMPLVLKACGRLNAIDKGVRIHSCIRGSDLINDVRVGTALVDFYCKCGL 110

Query: 61  VGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLA 120
           V EASKVF EMPERD+VSW+ALISGYVGC CYKEAVLLF+EM++ G TP S T+VALLLA
Sbjct: 111 VAEASKVFVEMPERDLVSWNALISGYVGCLCYKEAVLLFVEMKKAGLTPNSRTVVALLLA 170

Query: 121 CAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVSHRVFSLMMVRNVVSW 180
           C EM E+RLGQEIHGYCLRNGLFDMDA+VGT LVGFY+RFDA +SHRVFSLM+VRN+VSW
Sbjct: 171 CGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYLRFDAVLSHRVFSLMVVRNIVSW 230

Query: 181 NAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIK 240
           NAIITG+L++GDYTKAL+LFSSML EG+KFDAVTMLV+IQACAE   L+LGM LHQLAIK
Sbjct: 231 NAIITGFLNVGDYTKALKLFSSMLIEGIKFDAVTMLVVIQACAEYGCLRLGMQLHQLAIK 290

Query: 241 FNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVAL 300
           FN I+D+F+LNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMIS YI FGFHAEA+AL
Sbjct: 291 FNLINDVFVLNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISCYIGFGFHAEAIAL 350

Query: 301 FTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYV 360
           F KMRLE IKED RT+ IMLSLC DL DG + GRGLHA AMK G+EL   LGNALLSMYV
Sbjct: 351 FIKMRLERIKEDVRTIVIMLSLCNDLNDGSLWGRGLHAHAMKSGIELDAFLGNALLSMYV 410

Query: 361 EHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMIS 420
           +HNQI+AAQ VF+K RG DVISWNTMI ALAQS FRAKAFE+F MM ESE KFNSYT+IS
Sbjct: 411 KHNQINAAQNVFEKTRGLDVISWNTMISALAQSMFRAKAFELFFMMCESEIKFNSYTIIS 470

Query: 421 LLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWS------ 480
           LLALC+DG+DLVFGRSIHGFA KNGLEINTSLNTSLTEMYINCGDE +A ++++      
Sbjct: 471 LLALCKDGNDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCGDERAAIDMFTRCPQRD 530

Query: 481 ---------------------------------PSLTIINILTSCTQLAHLLLGQCLHDY 540
                                             S+TIINILTSCTQLAHL LGQCLH Y
Sbjct: 531 LISWNSLILSYIKNDNAGKALLLFNHMISELEPNSVTIINILTSCTQLAHLPLGQCLHAY 590

Query: 541 TTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGH 600
            TRREESLE+DASLANA ITMYARCGKMQYAE+IF T++ RNIVSWNAMIT YGMHGRG 
Sbjct: 591 ATRREESLEMDASLANAFITMYARCGKMQYAEQIFRTLQTRNIVSWNAMITGYGMHGRGR 650

Query: 601 DATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCM 628
           DATLAF+QMLDDGFKPNNVSFASVLSACSHSGLTETGL LF+SMVRDFG+ P+L HYGCM
Sbjct: 651 DATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTETGLLLFHSMVRDFGLAPQLTHYGCM 710

BLAST of Tan0010144 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 308.9 bits (790), Expect = 9.3e-84
Identity = 202/669 (30.19%), Postives = 319/669 (47.68%), Query Frame = 0

Query: 1   MESLGIALDSATMPLVLKAC-GRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCG 60
           M S  +  +  T   VL+AC G   A D   +IH+ I +  L     V   L+D YS+ G
Sbjct: 177 MVSENVTPNEGTFSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNG 236

Query: 61  FVGEASKVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLL 120
           FV  A +VFD +  +D  SW A+ISG     C  EA+ LF +M   G  P      ++L 
Sbjct: 237 FVDLARRVFDGLRLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLS 296

Query: 121 ACAEMFEMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAVS-HRVFSLMMVRNVV 180
           AC ++  + +G+++HG  L+ G F  D +V   LV  Y      +S   +FS M  R+ V
Sbjct: 297 ACKKIESLEIGEQLHGLVLKLG-FSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAV 356

Query: 181 SWNAIITGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLA 240
           ++N +I G    G   KA+ELF  M  +G++ D+ T+  L+ AC+   +L  G  LH   
Sbjct: 357 TYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYT 416

Query: 241 IKFNFIDDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAV 300
            K  F  +  I  ALLN+Y+    +E++   F      +  LWN M+ AY        + 
Sbjct: 417 TKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSF 476

Query: 301 ALFTKMRLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSM 360
            +F +M++E I  ++ T   +L  C  L D L  G  +H+Q +K   +L   + + L+ M
Sbjct: 477 RIFRQMQIEEIVPNQYTYPSILKTCIRLGD-LELGEQIHSQIIKTNFQLNAYVCSVLIDM 536

Query: 361 YVEHNQIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTM 420
           Y +  ++D A  +  +  G DV+SW TMI    Q  F  KA   F  M +   + +   +
Sbjct: 537 YAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGL 596

Query: 421 ISLLALCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCG-----------DEG 480
            + ++ C     L  G+ IH  A  +G   +     +L  +Y  CG            E 
Sbjct: 597 TNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEA 656

Query: 481 SATNLWSP-----------------------------SLTIINILTSCTQLAHLLLGQCL 540
                W+                              + T  + + + ++ A++  G+ +
Sbjct: 657 GDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQV 716

Query: 541 HDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHG 600
           H   T+     + +  + NALI+MYA+CG +  AEK F  +  +N VSWNA+I  Y  HG
Sbjct: 717 HAVITK--TGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHG 776

Query: 601 RGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHY 628
            G +A  +F QM+    +PN+V+   VLSACSH GL + G+  F SM  ++G++P+  HY
Sbjct: 777 FGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHY 836

BLAST of Tan0010144 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 305.1 bits (780), Expect = 1.3e-82
Identity = 189/624 (30.29%), Postives = 309/624 (49.52%), Query Frame = 0

Query: 8   LDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGFVGEASKV 67
           +D  T+  VL+ C   K++  G  + + IR +  + D  +G+ L   Y+ CG + EAS+V
Sbjct: 92  IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRV 151

Query: 68  FDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLACAEMFEM 127
           FDE+     + W+ L++       +  ++ LF +M  +G    S T   +  + + +  +
Sbjct: 152 FDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSV 211

Query: 128 RLGQEIHGYCLRNGLFDMDAHVGTVLVGFYM---RFDAAVSHRVFSLMMVRNVVSWNAII 187
             G+++HG+ L++G  + ++ VG  LV FY+   R D+A   +VF  M  R+V+SWN+II
Sbjct: 212 HGGEQLHGFILKSGFGERNS-VGNSLVAFYLKNQRVDSA--RKVFDEMTERDVISWNSII 271

Query: 188 TGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFI 247
            GY+  G   K L +F  ML  G++ D  T++ +   CA+S+ + LG  +H + +K  F 
Sbjct: 272 NGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFS 331

Query: 248 DDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKM 307
            +    N LL+MYS  G L+S+  +F  +       + SMI+ Y   G   EAV LF +M
Sbjct: 332 REDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEM 391

Query: 308 RLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHNQ 367
             EGI  D  TV  +L+ C      L  G+ +H    +  +   + + NAL+ MY +   
Sbjct: 392 EEEGISPDVYTVTAVLNCCARYR-LLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGS 451

Query: 368 IDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIF-MMMYESEFKFNSYTMISLLA 427
           +  A+ VF +MR  D+ISWNT+I   +++ +  +A  +F +++ E  F  +  T+  +L 
Sbjct: 452 MQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLP 511

Query: 428 LCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWSPSLTIINIL 487
            C   S    GR IHG+  +NG              Y +                     
Sbjct: 512 ACASLSAFDKGREIHGYIMRNG--------------YFS--------------------- 571

Query: 488 TSCTQLAHLLLGQCLHDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARN 547
                                       D  +AN+L+ MYA+CG +  A  +F+ + +++
Sbjct: 572 ----------------------------DRHVANSLVDMYAKCGALLLAHMLFDDIASKD 631

Query: 548 IVSWNAMITEYGMHGRGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFN 607
           +VSW  MI  YGMHG G +A   F+QM   G + + +SF S+L ACSHSGL + G + FN
Sbjct: 632 LVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFN 648

Query: 608 SMVRDFGITPELAHYGCMVDLLGR 628
            M  +  I P + HY C+VD+L R
Sbjct: 692 IMRHECKIEPTVEHYACIVDMLAR 648

BLAST of Tan0010144 vs. TAIR 10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 302.8 bits (774), Expect = 6.6e-82
Identity = 193/623 (30.98%), Postives = 315/623 (50.56%), Query Frame = 0

Query: 6   IALDSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGFVGEAS 65
           ++ D  T P V+KAC  L   + G  ++  I       D+ VG ALVD YS+ G +  A 
Sbjct: 102 VSPDKYTFPSVIKACAGLFDAEMGDLVYEQILDMGFESDLFVGNALVDMYSRMGLLTRAR 161

Query: 66  KVFDEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLACAEMF 125
           +VFDEMP RD+VSW++LISGY     Y+EA+ ++ E++ +   P S T+ ++L A   + 
Sbjct: 162 QVFDEMPVRDLVSWNSLISGYSSHGYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLL 221

Query: 126 EMRLGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAV-SHRVFSLMMVRNVVSWNAII 185
            ++ GQ +HG+ L++G+  +   V   LV  Y++F     + RVF  M VR+ VS+N +I
Sbjct: 222 VVKQGQGLHGFALKSGVNSV-VVVNNGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMI 281

Query: 186 TGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFI 245
            GYL +    +++ +F   L +  K D +T+  +++AC   + L L  +++   +K  F+
Sbjct: 282 CGYLKLEMVEESVRMFLENLDQ-FKPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFV 341

Query: 246 DDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKM 305
            +  + N L+++Y+  G + ++  +FN++   D   WNS+IS YI  G   EA+ LF  M
Sbjct: 342 LESTVRNILIDVYAKCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMM 401

Query: 306 RLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHNQ 365
            +   + D  T  +++S+   L D L  G+GLH+  +K G+ + + + NAL+ MY +  +
Sbjct: 402 MIMEEQADHITYLMLISVSTRLAD-LKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGE 461

Query: 366 IDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMISLLAL 425
           +  + K+F  M   D ++WNT+I A  +                                
Sbjct: 462 VGDSLKIFSSMGTGDTVTWNTVISACVR-------------------------------- 521

Query: 426 CRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWSPSLTIINILT 485
                   FG     FA   GL++ T +  S                +     T +  L 
Sbjct: 522 --------FG----DFA--TGLQVTTQMRKS---------------EVVPDMATFLVTLP 581

Query: 486 SCTQLAHLLLGQCLHDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNI 545
            C  LA   LG+ +H    R     E +  + NALI MY++CG ++ + ++F  M  R++
Sbjct: 582 MCASLAAKRLGKEIHCCLLR--FGYESELQIGNALIEMYSKCGCLENSSRVFERMSRRDV 641

Query: 546 VSWNAMITEYGMHGRGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNS 605
           V+W  MI  YGM+G G  A   F+ M   G  P++V F +++ ACSHSGL + GL  F  
Sbjct: 642 VTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEGLACFEK 658

Query: 606 MVRDFGITPELAHYGCMVDLLGR 628
           M   + I P + HY C+VDLL R
Sbjct: 702 MKTHYKIDPMIEHYACVVDLLSR 658

BLAST of Tan0010144 vs. TAIR 10
Match: AT1G18485.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 300.1 bits (767), Expect = 4.3e-81
Identity = 197/664 (29.67%), Postives = 328/664 (49.40%), Query Frame = 0

Query: 15  LVLKACGRLKAIDKGVRIHSCIRHSDLIK-DVRVGTALVDFYSKCGFVGEASKVFDEMPE 74
           L+L+A G+ K I+ G +IH  +  S  ++ D  + T ++  Y+ CG   ++  VFD +  
Sbjct: 89  LLLQASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVFDALRS 148

Query: 75  RDVVSWSALISGYVGCSCYKEAVLLFMEM-QRTGFTPKSCTIVALLLACAEMFEMRLGQE 134
           +++  W+A+IS Y     Y E +  F+EM   T   P   T   ++ ACA M ++ +G  
Sbjct: 149 KNLFQWNAVISSYSRNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGIGLA 208

Query: 135 IHGYCLRNGLFDMDAHVGTVLVGFYMRFDAAV-SHRVFSLMMVRNVVSWNAIITGYLDIG 194
           +HG  ++ GL + D  VG  LV FY        + ++F +M  RN+VSWN++I  + D G
Sbjct: 209 VHGLVVKTGLVE-DVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDNG 268

Query: 195 DYTKALELFSSMLTE----GVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFIDDL 254
              ++  L   M+ E        D  T++ ++  CA  + + LG  +H  A+K     +L
Sbjct: 269 FSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKEL 328

Query: 255 FILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKMRL- 314
            + NAL++MYS  G + ++ ++F      +   WN+M+  + A G       +  +M   
Sbjct: 329 VLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQMLAG 388

Query: 315 -EGIKEDERTVAIMLSLC--EDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHN 374
            E +K DE T+   + +C  E     L   + LH  ++K+      L+ NA ++ Y +  
Sbjct: 389 GEDVKADEVTILNAVPVCFHESFLPSL---KELHCYSLKQEFVYNELVANAFVASYAKCG 448

Query: 375 QIDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMISLLA 434
            +  AQ+VF  +R   V SWN +I   AQS     + +  + M  S    +S+T+ SLL+
Sbjct: 449 SLSYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLS 508

Query: 435 LCRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGD-----------EGSATNL 494
            C     L  G+ +HGF  +N LE +  +  S+  +YI+CG+           E  +   
Sbjct: 509 ACSKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVS 568

Query: 495 WSPSLT-----------------------------IINILTSCTQLAHLLLGQCLHDYTT 554
           W+  +T                             ++ +  +C+ L  L LG+  H Y  
Sbjct: 569 WNTVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYAL 628

Query: 555 RREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNIVSWNAMITEYGMHGRGHDA 614
           +    LE DA +A +LI MYA+ G +  + K+FN +K ++  SWNAMI  YG+HG   +A
Sbjct: 629 K--HLLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEA 688

Query: 615 TLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNSMVRDFGITPELAHYGCMVD 628
              F +M   G  P++++F  VL+AC+HSGL   GL+  + M   FG+ P L HY C++D
Sbjct: 689 IKLFEEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVID 746

BLAST of Tan0010144 vs. TAIR 10
Match: AT1G74600.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 299.7 bits (766), Expect = 5.6e-81
Identity = 189/623 (30.34%), Postives = 311/623 (49.92%), Query Frame = 0

Query: 9   DSATMPLVLKACGRLKAIDKGVRIHSCIRHSDLIKDVRVGTALVDFYSKCGFVGEASKVF 68
           DS T   VL AC  L+ +  G  + + +      +DV V TA+VD Y+KCG + EA +VF
Sbjct: 250 DSYTYSSVLAACASLEKLRFGKVVQARVIKCG-AEDVFVCTAIVDLYAKCGHMAEAMEVF 309

Query: 69  DEMPERDVVSWSALISGYVGCSCYKEAVLLFMEMQRTGFTPKSCTIVALLLACAEMFEMR 128
             +P   VVSW+ ++SGY   +    A+ +F EM+ +G    +CT+ +++ AC     + 
Sbjct: 310 SRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACGRPSMVC 369

Query: 129 LGQEIHGYCLRNGLFDMDAHVGTVLVGFYMRF-DAAVSHRVFSLM---MVRNVVSWNAII 188
              ++H +  ++G F +D+ V   L+  Y +  D  +S +VF  +     +N+V  N +I
Sbjct: 370 EASQVHAWVFKSG-FYLDSSVAAALISMYSKSGDIDLSEQVFEDLDDIQRQNIV--NVMI 429

Query: 189 TGYLDIGDYTKALELFSSMLTEGVKFDAVTMLVLIQACAESQSLQLGMHLHQLAIKFNFI 248
           T +       KA+ LF+ ML EG++ D  ++  L+        L LG  +H   +K   +
Sbjct: 430 TSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVL---DCLNLGKQVHGYTLKSGLV 489

Query: 249 DDLFILNALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISAYIAFGFHAEAVALFTKM 308
            DL + ++L  +YS  GSLE S  LF  +P  D A W SMIS +  +G+  EA+ LF++M
Sbjct: 490 LDLTVGSSLFTLYSKCGSLEESYKLFQGIPFKDNACWASMISGFNEYGYLREAIGLFSEM 549

Query: 309 RLEGIKEDERTVAIMLSLCEDLTDGLIRGRGLHAQAMKRGMELGVLLGNALLSMYVEHNQ 368
             +G   DE T+A +L++C      L RG+ +H   ++ G++ G+ LG+AL++MY +   
Sbjct: 550 LDDGTSPDESTLAAVLTVCSS-HPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCGS 609

Query: 369 IDAAQKVFDKMRGFDVISWNTMILALAQSKFRAKAFEIFMMMYESEFKFNSYTMISLLAL 428
           +  A++V+D++   D +S +++I   +Q       F +F  M  S F  +S+ + S+L  
Sbjct: 610 LKLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTMDSFAISSILKA 669

Query: 429 CRDGSDLVFGRSIHGFARKNGLEINTSLNTSLTEMYINCGDEGSATNLWSPSLTIINILT 488
                +   G  +H +  K GL                                      
Sbjct: 670 AALSDESSLGAQVHAYITKIGL-------------------------------------- 729

Query: 489 SCTQLAHLLLGQCLHDYTTRREESLELDASLANALITMYARCGKMQYAEKIFNTMKARNI 548
            CT                        + S+ ++L+TMY++ G +    K F+ +   ++
Sbjct: 730 -CT------------------------EPSVGSSLLTMYSKFGSIDDCCKAFSQINGPDL 789

Query: 549 VSWNAMITEYGMHGRGHDATLAFSQMLDDGFKPNNVSFASVLSACSHSGLTETGLQLFNS 608
           ++W A+I  Y  HG+ ++A   ++ M + GFKP+ V+F  VLSACSH GL E      NS
Sbjct: 790 IAWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSHGGLVEESYFHLNS 801

Query: 609 MVRDFGITPELAHYGCMVDLLGR 628
           MV+D+GI PE  HY CMVD LGR
Sbjct: 850 MVKDYGIEPENRHYVCMVDALGR 801

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SVP71.3e-8230.19Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q9SN391.9e-8130.29Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9SS609.3e-8130.98Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Q0WN606.1e-8029.67Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX... [more]
Q9CA567.9e-8030.34Pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023512048.10.0e+0082.88pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isofor... [more]
XP_022986695.10.0e+0082.13pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucur... [more]
KAG6570395.14.8e-31082.28Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022944564.11.9e-30982.13pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like [Cucur... [more]
XP_038902698.11.2e-30379.88pentatricopeptide repeat-containing protein At2g13600-like [Benincasa hispida] >... [more]
Match NameE-valueIdentityDescription
A0A6J1JER50.0e+0082.13pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cuc... [more]
A0A6J1FWX39.4e-31082.13pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like OS=Cuc... [more]
A0A5A7TDA18.5e-29577.93Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3BD371.5e-29477.78Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BNK91.5e-29477.78pentatricopeptide repeat-containing protein At2g13600-like isoform X1 OS=Cucumis... [more]
Match NameE-valueIdentityDescription
AT4G13650.19.3e-8430.19Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18750.11.3e-8230.29Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G03580.16.6e-8230.98Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G18485.14.3e-8129.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74600.15.6e-8130.34pentatricopeptide (PPR) repeat-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 74..122
e-value: 4.7E-8
score: 33.1
coord: 542..590
e-value: 1.2E-8
score: 35.0
coord: 175..223
e-value: 6.4E-12
score: 45.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 545..578
e-value: 1.7E-5
score: 22.7
coord: 517..545
e-value: 0.0019
score: 16.2
coord: 178..211
e-value: 1.6E-6
score: 25.9
coord: 580..613
e-value: 4.8E-4
score: 18.1
coord: 280..310
e-value: 0.0026
score: 15.8
coord: 49..77
e-value: 0.0019
score: 16.2
coord: 77..109
e-value: 2.2E-5
score: 22.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 381..409
e-value: 0.036
score: 14.3
coord: 353..378
e-value: 0.011
score: 15.9
coord: 281..309
e-value: 4.2E-5
score: 23.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 75..109
score: 11.542307
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 10.259834
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 379..413
score: 8.95544
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 176..210
score: 11.23539
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 578..613
score: 8.999285
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 512..542
score: 8.801982
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 543..577
score: 10.764054
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 229..328
e-value: 2.3E-11
score: 45.7
coord: 329..474
e-value: 5.0E-18
score: 67.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 135..228
e-value: 1.4E-13
score: 52.6
coord: 1..134
e-value: 9.6E-25
score: 89.0
coord: 475..597
e-value: 5.0E-24
score: 86.6
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 333..472
coord: 8..376
NoneNo IPR availablePANTHERPTHR47929:SF21PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 476..627
NoneNo IPR availablePANTHERPTHR47929:SF21PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 333..472
coord: 8..376
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 476..627

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0010144.1Tan0010144.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding