Tan0021030 (gene) Snake gourd v1

Overview
NameTan0021030
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG08: 74719411 .. 74721863 (+)
RNA-Seq ExpressionTan0021030
SyntenyTan0021030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCAGATGGAGAAAGCTGCTCTTAGTTAATGAGCTATGGCAGCTTCTCCACAGATTTCTTTCCCAAATTTCTCAATAGAAAATAACAATATTCCTTTCAGAAACCATCAAATTCTCTCCAGAATCAATCAATGTTCAAGTGCAAAGCAATTGAAGCAAGTTCACGCTCAGATGCTCCGCACCGGCCTCTTTTTCGACCCCTTCTCCGCCAGCAAGCTCCTAACAGTCTCCGCTCTTTCGTCCTTCTCCACTCTCGAGTATGCCCGCCAAATGTTCGACCAAATTTCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCGTACGCTTCCAGCTCCGACCCATTTCAGAGTTTCGTGACATTCCTGGATTTGCTCGATAAAAGTGATGATTTGCCCAACAATTTCACTTTCCCTTTTGTGATTAAGGCTGCTTCGGAGCTAAAAGCTTCGCGGGTCGGCAGAGCTGTTCATGGAATGGCGATTAAGATGTCGTTTGGTATGGATTTGTATATTCTTAATTCGCTTGTGCGATTCTATGGGGTATGTGGAGATTTGAACATGGCTGAGCGATTGTTTGAGGGTATTTCTAGCAAAGATGTGGTGTCTTGGAATTCGATGATCTCTGCTTTTGCTCAGGGGAACTGTCCAGAAGGTGCATTGGACTTGTTTTTGAAAATGGAAAGGGAGAATGTGATGCCAAACTCTGTGACAATGGTGGGTGTTTTATCTGCTTGTGCAAAGAAGTTGGATTTGGAGTTTGGGAGGTGGGTTTGTTCATACATTGAAAGGAAAGAAATTAAAGTGGATTTAACTCTGTGTAATGCCATGCTTGACATGTATACAAAATGTGGAAGTATTGCTGATGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCACCATGCTTGATGGGTATGCGAAAACGGGCGATTTTGATACTGCTCGGCAAGTGTTTGATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAACGTTCTCATATCTGCTTATGAACAAAATGGTAAGCCTAAGGAGGCTTTGGCCACTTTCAATGAGTTGCAGCTTAGTAAGATTGCAAAGCCTGATGAAGTCACTTTGGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGGTGGATTCATGTGTACATAAAAAGGGAGGGGATAGATCTAAACTGCCATTTGATTACTTCTCTTGTAGACATGTATGCTAAGTGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCACGGGCGTGGGAAGGCGGCAATCGAACTGTTCTTCAAAATGCAGGAAGCTAAGGTGAAGCCGAATGATGTGACGTTTACGAATGTACTATGTGCCTGTAGTCATGCTCGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTGCCTGGAACGAAGCACTACACATGTATGGTAGATATTCTCGGCCGTGCAGGGTTTCTCGAAGAAGCTATGGAGTTAATCAATGAAATGCCTGTAACCCCGAGCGCCTCCATTTGGGGTGCTTTACTTGGTGCCTGCAGGCTTCATATGAATATTGAGCTTGCAGAACTGGCTAGTAACCAATTGCTCAAGTTAGAGCCAAGAAATCATGGTGCTATTGTACTTTTATCCAACATTTATGCCAAAACAGGTAGATGGGACAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACTCTGAACTGAAAAAGGAACCAGGTTGTAGTTCAGTTGAAGTTGATGGCAACGTCCAAGAGTTTCTAGTTGGCGATAATTCCCACCCGTTATCCCGGGAAATCTATTCGAAGTTGGATGAGATTGCAACAAAGCTAAAATCAGTTGGATACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAAGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGTGAGAAGTTAGCGATCGCCTTCGGGCTTATTAGTTTGACTCCATCTCAACCAATTCGAGTGGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTCATATCTAGAGTTTACTGCAGAGATATAATACTGCGAGATCGGTATCGATTCCATCATTTTCGAGACGGGCATTGCTCATGTATGGATTTTTGGTAAAGCAGCATAAGAATGGCTTCTATATTCACTTGCTCTGTTTGGTGCATGAGGTGAACTTTTCTGTAAATTGTGTATATTTCTTATTAAGTGTAAACTGAGTATTCAACTGACTAGGAAGGAATGATCTCTGTTAAATGAGTAGTAATTTGGAAACTTTTATTACTGTTCAATTATATCATATAAAACAATGTATGGTTAGAAATAAATTGAGTACCTTGAATA

mRNA sequence

CTCAGATGGAGAAAGCTGCTCTTAGTTAATGAGCTATGGCAGCTTCTCCACAGATTTCTTTCCCAAATTTCTCAATAGAAAATAACAATATTCCTTTCAGAAACCATCAAATTCTCTCCAGAATCAATCAATGTTCAAGTGCAAAGCAATTGAAGCAAGTTCACGCTCAGATGCTCCGCACCGGCCTCTTTTTCGACCCCTTCTCCGCCAGCAAGCTCCTAACAGTCTCCGCTCTTTCGTCCTTCTCCACTCTCGAGTATGCCCGCCAAATGTTCGACCAAATTTCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCGTACGCTTCCAGCTCCGACCCATTTCAGAGTTTCGTGACATTCCTGGATTTGCTCGATAAAAGTGATGATTTGCCCAACAATTTCACTTTCCCTTTTGTGATTAAGGCTGCTTCGGAGCTAAAAGCTTCGCGGGTCGGCAGAGCTGTTCATGGAATGGCGATTAAGATGTCGTTTGGTATGGATTTGTATATTCTTAATTCGCTTGTGCGATTCTATGGGGTATGTGGAGATTTGAACATGGCTGAGCGATTGTTTGAGGGTATTTCTAGCAAAGATGTGGTGTCTTGGAATTCGATGATCTCTGCTTTTGCTCAGGGGAACTGTCCAGAAGGTGCATTGGACTTGTTTTTGAAAATGGAAAGGGAGAATGTGATGCCAAACTCTGTGACAATGGTGGGTGTTTTATCTGCTTGTGCAAAGAAGTTGGATTTGGAGTTTGGGAGGTGGGTTTGTTCATACATTGAAAGGAAAGAAATTAAAGTGGATTTAACTCTGTGTAATGCCATGCTTGACATGTATACAAAATGTGGAAGTATTGCTGATGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCACCATGCTTGATGGGTATGCGAAAACGGGCGATTTTGATACTGCTCGGCAAGTGTTTGATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAACGTTCTCATATCTGCTTATGAACAAAATGGTAAGCCTAAGGAGGCTTTGGCCACTTTCAATGAGTTGCAGCTTAGTAAGATTGCAAAGCCTGATGAAGTCACTTTGGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGGTGGATTCATGTGTACATAAAAAGGGAGGGGATAGATCTAAACTGCCATTTGATTACTTCTCTTGTAGACATGTATGCTAAGTGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCACGGGCGTGGGAAGGCGGCAATCGAACTGTTCTTCAAAATGCAGGAAGCTAAGGTGAAGCCGAATGATGTGACGTTTACGAATGTACTATGTGCCTGTAGTCATGCTCGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTGCCTGGAACGAAGCACTACACATGTATGGTAGATATTCTCGGCCGTGCAGGGTTTCTCGAAGAAGCTATGGAGTTAATCAATGAAATGCCTGTAACCCCGAGCGCCTCCATTTGGGGTGCTTTACTTGGTGCCTGCAGGCTTCATATGAATATTGAGCTTGCAGAACTGGCTAGTAACCAATTGCTCAAGTTAGAGCCAAGAAATCATGGTGCTATTGTACTTTTATCCAACATTTATGCCAAAACAGGTAGATGGGACAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACTCTGAACTGAAAAAGGAACCAGGTTGTAGTTCAGTTGAAGTTGATGGCAACGTCCAAGAGTTTCTAGTTGGCGATAATTCCCACCCGTTATCCCGGGAAATCTATTCGAAGTTGGATGAGATTGCAACAAAGCTAAAATCAGTTGGATACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAAGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGTGAGAAGTTAGCGATCGCCTTCGGGCTTATTAGTTTGACTCCATCTCAACCAATTCGAGTGGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTCATATCTAGAGTTTACTGCAGAGATATAATACTGCGAGATCGGTATCGATTCCATCATTTTCGAGACGGGCATTGCTCATGTATGGATTTTTGGTAAAGCAGCATAAGAATGGCTTCTATATTCACTTGCTCTGTTTGGTGCATGAGGTGAACTTTTCTGTAAATTGTGTATATTTCTTATTAAGTGTAAACTGAGTATTCAACTGACTAGGAAGGAATGATCTCTGTTAAATGAGTAGTAATTTGGAAACTTTTATTACTGTTCAATTATATCATATAAAACAATGTATGGTTAGAAATAAATTGAGTACCTTGAATA

Coding sequence (CDS)

ATGGCAGCTTCTCCACAGATTTCTTTCCCAAATTTCTCAATAGAAAATAACAATATTCCTTTCAGAAACCATCAAATTCTCTCCAGAATCAATCAATGTTCAAGTGCAAAGCAATTGAAGCAAGTTCACGCTCAGATGCTCCGCACCGGCCTCTTTTTCGACCCCTTCTCCGCCAGCAAGCTCCTAACAGTCTCCGCTCTTTCGTCCTTCTCCACTCTCGAGTATGCCCGCCAAATGTTCGACCAAATTTCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCGTACGCTTCCAGCTCCGACCCATTTCAGAGTTTCGTGACATTCCTGGATTTGCTCGATAAAAGTGATGATTTGCCCAACAATTTCACTTTCCCTTTTGTGATTAAGGCTGCTTCGGAGCTAAAAGCTTCGCGGGTCGGCAGAGCTGTTCATGGAATGGCGATTAAGATGTCGTTTGGTATGGATTTGTATATTCTTAATTCGCTTGTGCGATTCTATGGGGTATGTGGAGATTTGAACATGGCTGAGCGATTGTTTGAGGGTATTTCTAGCAAAGATGTGGTGTCTTGGAATTCGATGATCTCTGCTTTTGCTCAGGGGAACTGTCCAGAAGGTGCATTGGACTTGTTTTTGAAAATGGAAAGGGAGAATGTGATGCCAAACTCTGTGACAATGGTGGGTGTTTTATCTGCTTGTGCAAAGAAGTTGGATTTGGAGTTTGGGAGGTGGGTTTGTTCATACATTGAAAGGAAAGAAATTAAAGTGGATTTAACTCTGTGTAATGCCATGCTTGACATGTATACAAAATGTGGAAGTATTGCTGATGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCACCATGCTTGATGGGTATGCGAAAACGGGCGATTTTGATACTGCTCGGCAAGTGTTTGATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAACGTTCTCATATCTGCTTATGAACAAAATGGTAAGCCTAAGGAGGCTTTGGCCACTTTCAATGAGTTGCAGCTTAGTAAGATTGCAAAGCCTGATGAAGTCACTTTGGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGGTGGATTCATGTGTACATAAAAAGGGAGGGGATAGATCTAAACTGCCATTTGATTACTTCTCTTGTAGACATGTATGCTAAGTGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCACGGGCGTGGGAAGGCGGCAATCGAACTGTTCTTCAAAATGCAGGAAGCTAAGGTGAAGCCGAATGATGTGACGTTTACGAATGTACTATGTGCCTGTAGTCATGCTCGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTGCCTGGAACGAAGCACTACACATGTATGGTAGATATTCTCGGCCGTGCAGGGTTTCTCGAAGAAGCTATGGAGTTAATCAATGAAATGCCTGTAACCCCGAGCGCCTCCATTTGGGGTGCTTTACTTGGTGCCTGCAGGCTTCATATGAATATTGAGCTTGCAGAACTGGCTAGTAACCAATTGCTCAAGTTAGAGCCAAGAAATCATGGTGCTATTGTACTTTTATCCAACATTTATGCCAAAACAGGTAGATGGGACAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACTCTGAACTGAAAAAGGAACCAGGTTGTAGTTCAGTTGAAGTTGATGGCAACGTCCAAGAGTTTCTAGTTGGCGATAATTCCCACCCGTTATCCCGGGAAATCTATTCGAAGTTGGATGAGATTGCAACAAAGCTAAAATCAGTTGGATACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAAGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGTGAGAAGTTAGCGATCGCCTTCGGGCTTATTAGTTTGACTCCATCTCAACCAATTCGAGTGGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTCATATCTAGAGTTTACTGCAGAGATATAATACTGCGAGATCGGTATCGATTCCATCATTTTCGAGACGGGCATTGCTCATGTATGGATTTTTGGTAA

Protein sequence

MAASPQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEGISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW
Homology
BLAST of Tan0021030 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 935.6 bits (2417), Expect = 3.3e-271
Identity = 446/725 (61.52%), Postives = 568/725 (78.34%), Query Frame = 0

Query: 10  PNFSIENNNIPFRNHQ---ILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSA 69
           PNFS  N N P  N++    +S I +C S +QLKQ H  M+RTG F DP+SASKL  ++A
Sbjct: 16  PNFS--NPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAA 75

Query: 70  LSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNNFT 129
           LSSF++LEYAR++FD+I +PN + WNTLIRAYAS  DP  S   FLD++ +S   PN +T
Sbjct: 76  LSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYT 135

Query: 130 FPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEGIS 189
           FPF+IKAA+E+ +  +G+++HGMA+K + G D+++ NSL+  Y  CGDL+ A ++F  I 
Sbjct: 136 FPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIK 195

Query: 190 SKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRW 249
            KDVVSWNSMI+ F Q   P+ AL+LF KME E+V  + VTMVGVLSACAK  +LEFGR 
Sbjct: 196 EKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQ 255

Query: 250 VCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGD 309
           VCSYIE   + V+LTL NAMLDMYTKCGSI DA++LFD M E+D  +WTTMLDGYA + D
Sbjct: 256 VCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISED 315

Query: 310 FDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLS 369
           ++ AR+V ++MP K+I AWN LISAYEQNGKP EAL  F+ELQL K  K +++TLVSTLS
Sbjct: 316 YEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLS 375

Query: 370 ACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYV 429
           ACAQ+GA++LG WIH YIK+ GI +N H+ ++L+ MY+KCG LEK+ EVF SVE+RDV+V
Sbjct: 376 ACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFV 435

Query: 430 WSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEME 489
           WSAMI GL MHG G  A+++F+KMQEA VKPN VTFTNV CACSH  LVDE  + FH+ME
Sbjct: 436 WSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQME 495

Query: 490 PVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELA 549
             YG+VP  KHY C+VD+LGR+G+LE+A++ I  MP+ PS S+WGALLGAC++H N+ LA
Sbjct: 496 SNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLA 555

Query: 550 ELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGN 609
           E+A  +LL+LEPRN GA VLLSNIYAK G+W+ VSELRK MR + LKKEPGCSS+E+DG 
Sbjct: 556 EMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGM 615

Query: 610 VQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEK 669
           + EFL GDN+HP+S ++Y KL E+  KLKS GYEP  S +LQ+IEE+++KEQ+L+LHSEK
Sbjct: 616 IHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEK 675

Query: 670 LAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCS 729
           LAI +GLIS    + IRV+KNLR+CGDCH VAKLIS++Y R+II+RDRYRFHHFR+G CS
Sbjct: 676 LAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCS 735

Query: 730 CMDFW 732
           C DFW
Sbjct: 736 CNDFW 738

BLAST of Tan0021030 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 602.1 bits (1551), Expect = 8.7e-171
Identity = 318/765 (41.57%), Postives = 459/765 (60.00%), Query Frame = 0

Query: 5   PQISFPNFSIENNNIP----FRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASK 64
           P  S+P   + +++ P     RNH  LS ++ C + + L+ +HAQM++ GL    ++ SK
Sbjct: 11  PSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSK 70

Query: 65  LLTVSALS-SFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSD 124
           L+    LS  F  L YA  +F  I +PNL  WNT+ R +A SSDP  +   ++ ++    
Sbjct: 71  LIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLG- 130

Query: 125 DLPNNFTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNS--------------- 184
            LPN++TFPFV+K+ ++ KA + G+ +HG  +K+   +DLY+  S               
Sbjct: 131 LLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAH 190

Query: 185 ----------------LVRFYGVCGDLNMAERLFEGISSKDVVSWNSMISAFAQGNCPEG 244
                           L++ Y   G +  A++LF+ I  KDVVSWN+MIS +A+    + 
Sbjct: 191 KVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE 250

Query: 245 ALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNAMLD 304
           AL+LF  M + NV P+  TMV V+SACA+   +E GR V  +I+      +L + NA++D
Sbjct: 251 ALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALID 310

Query: 305 MYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVKEIAAWNVL 364
           +Y+KCG +  A  LF+ +P +DV SW T++ GY     +                     
Sbjct: 311 LYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY--------------------- 370

Query: 365 ISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKR-- 424
                     KEAL  F E+ L     P++VT++S L ACA LGAID+G WIHVYI +  
Sbjct: 371 ----------KEALLLFQEM-LRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRL 430

Query: 425 EGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIEL 484
           +G+     L TSL+DMYAKCG +E A +VF S+  + +  W+AMI G  MHGR  A+ +L
Sbjct: 431 KGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDL 490

Query: 485 FFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTCMVDILG 544
           F +M++  ++P+D+TF  +L ACSH+ ++D GR  F  M   Y + P  +HY CM+D+LG
Sbjct: 491 FSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLG 550

Query: 545 RAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRNHGAIVL 604
            +G  +EA E+IN M + P   IW +LL AC++H N+EL E  +  L+K+EP N G+ VL
Sbjct: 551 HSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVL 610

Query: 605 LSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLSREIYSK 664
           LSNIYA  GRW++V++ R L+ D  +KK PGCSS+E+D  V EF++GD  HP +REIY  
Sbjct: 611 LSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGM 670

Query: 665 LDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQPIRVVK 724
           L+E+   L+  G+ P+ S +LQ +EE + KE AL  HSEKLAIAFGLIS  P   + +VK
Sbjct: 671 LEEMEVLLEKAGFVPDTSEVLQEMEE-EWKEGALRHHSEKLAIAFGLISTKPGTKLTIVK 730

Query: 725 NLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW 732
           NLR+C +CHE  KLIS++Y R+II RDR RFHHFRDG CSC D+W
Sbjct: 731 NLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Tan0021030 vs. ExPASy Swiss-Prot
Match: O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 546.6 bits (1407), Expect = 4.3e-154
Identity = 277/713 (38.85%), Postives = 431/713 (60.45%), Query Frame = 0

Query: 26  ILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSALSSFSTLEYARQMFDQI-S 85
           IL +++ C S   +KQ+HA +LRT    +    S L  +S  SS   L YA  +F  I S
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 86  QPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNNFTFPFVIKAASELKASRVGR 145
            P    +N  +R  + SS+P ++ + F   +       + F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 146 AVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEGISSKDVVSWNSMISAFAQGN 205
            +HG+A K++   D ++    +  Y  CG +N A  +F+ +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 206 CPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCN 265
             + A  LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++   ++++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 266 AMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVKEIAA 325
           A++ MY   G +  A++ F +M  R++F  T M+ GY+K G  D A+ +FD    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 326 WNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 385
           W  +ISAY ++  P+EAL  F E+  S I KPD V++ S +SACA LG +D   W+H  I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 386 KREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAI 445
              G++    +  +L++MYAKCG L+   +VF  +  R+V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 446 ELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTCMVDI 505
            LF +M++  V+PN+VTF  VL  CSH+ LV+EG+  F  M   Y + P  +HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 506 LGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRNHGAI 565
            GRA  L EA+E+I  MPV  +  IWG+L+ ACR+H  +EL + A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 566 VLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLSREIY 625
           VL+SNIYA+  RW+ V  +R++M +  + KE G S ++ +G   EFL+GD  H  S EIY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 626 SKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQP--- 685
           +KLDE+ +KLK  GY P+   +L  +EE++ K+  L  HSEKLA+ FGL++    +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 686 ---IRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW 732
              IR+VKNLR+C DCH   KL+S+VY R+II+RDR RFH +++G CSC D+W
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of Tan0021030 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 532.7 bits (1371), Expect = 6.4e-150
Identity = 275/706 (38.95%), Postives = 426/706 (60.34%), Query Frame = 0

Query: 28  SRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSALSSFSTLEYARQMFDQISQPN 87
           S I+  +   QLKQ+HA++L  GL F  F  +KL  + A SSF  + +ARQ+FD + +P 
Sbjct: 26  SLIDSATHKAQLKQIHARLLVLGLQFSGFLITKL--IHASSSFGDITFARQVFDDLPRPQ 85

Query: 88  LYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNNFTFPFVIKAASELKASRVGRAVH 147
           ++ WN +IR Y S ++ FQ  +     +  +   P++FTFP ++KA S L   ++GR VH
Sbjct: 86  IFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVH 145

Query: 148 GMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEG--ISSKDVVSWNSMISAFAQGNC 207
               ++ F  D+++ N L+  Y  C  L  A  +FEG  +  + +VSW +++SA+AQ   
Sbjct: 146 AQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGE 205

Query: 208 PEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 267
           P  AL++F +M + +V P+ V +V VL+A     DL+ GR + + + +  ++++  L  +
Sbjct: 206 PMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLIS 265

Query: 268 MLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVKEIAAW 327
           +  MY KCG +A A+ LFD+M   ++  W  M+ GYAK                      
Sbjct: 266 LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK---------------------- 325

Query: 328 NVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 387
                    NG  +EA+  F+E+ ++K  +PD +++ S +SACAQ+G+++    ++ Y+ 
Sbjct: 326 ---------NGYAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVG 385

Query: 388 REGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIE 447
           R     +  + ++L+DM+AKCG++E A  VF    +RDV VWSAMI G G+HGR + AI 
Sbjct: 386 RSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAIS 445

Query: 448 LFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTCMVDIL 507
           L+  M+   V PNDVTF  +L AC+H+ +V EG  FF+ M   + + P  +HY C++D+L
Sbjct: 446 LYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 505

Query: 508 GRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRNHGAIV 567
           GRAG L++A E+I  MPV P  ++WGALL AC+ H ++EL E A+ QL  ++P N G  V
Sbjct: 506 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 565

Query: 568 LLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLSREIYS 627
            LSN+YA    WD+V+E+R  M++  L K+ GCS VEV G ++ F VGD SHP   EI  
Sbjct: 566 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 625

Query: 628 KLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQPIRVV 687
           +++ I ++LK  G+  NK   L  + +++  E+ L  HSE++AIA+GLIS     P+R+ 
Sbjct: 626 QVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHSERIAIAYGLISTPQGTPLRIT 685

Query: 688 KNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW 732
           KNLR C +CH   KLIS++  R+I++RD  RFHHF+DG CSC D+W
Sbjct: 686 KNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of Tan0021030 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 528.9 bits (1361), Expect = 9.3e-149
Identity = 277/711 (38.96%), Postives = 412/711 (57.95%), Query Frame = 0

Query: 26  ILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSALSSF--STLEYARQMFDQI 85
           +  +IN C + + L Q+HA  +++G   D  +A+++L   A S      L+YA ++F+Q+
Sbjct: 26  LFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQM 85

Query: 86  SQPNLYTWNTLIRAYASSSD--PFQSFVTFLDLLDKSDDLPNNFTFPFVIKAASELKASR 145
            Q N ++WNT+IR ++ S +     +   F +++      PN FTFP V+KA ++    +
Sbjct: 86  PQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQ 145

Query: 146 VGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLF-EGISSKDVVSWNSMISAF 205
            G+ +HG+A+K  FG D +++++LVR Y +CG +  A  LF + I  KD+V         
Sbjct: 146 EGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMV--------- 205

Query: 206 AQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDL 265
                                                                       
Sbjct: 206 ------------------------------------------------------------ 265

Query: 266 TLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVK 325
                M D   + G               ++  W  M+DGY + GD   AR +FD M  +
Sbjct: 266 ----VMTDRRKRDG---------------EIVLWNVMIDGYMRLGDCKAARMLFDKMRQR 325

Query: 326 EIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWI 385
            + +WN +IS Y  NG  K+A+  F E++   I +P+ VTLVS L A ++LG+++LG W+
Sbjct: 326 SVVSWNTMISGYSLNGFFKDAVEVFREMKKGDI-RPNYVTLVSVLPAISRLGSLELGEWL 385

Query: 386 HVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRG 445
           H+Y +  GI ++  L ++L+DMY+KCG +EKA+ VF  +   +V  WSAMI G  +HG+ 
Sbjct: 386 HLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQA 445

Query: 446 KAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTC 505
             AI+ F KM++A V+P+DV + N+L ACSH  LV+EGR +F +M  V G+ P  +HY C
Sbjct: 446 GDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGC 505

Query: 506 MVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRN 565
           MVD+LGR+G L+EA E I  MP+ P   IW ALLGACR+  N+E+ +  +N L+ + P +
Sbjct: 506 MVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHD 565

Query: 566 HGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLS 625
            GA V LSN+YA  G W +VSE+R  M++ +++K+PGCS +++DG + EF+V D+SHP +
Sbjct: 566 SGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKA 625

Query: 626 REIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQ 685
           +EI S L EI+ KL+  GY P  + +L  +EE+D KE  L  HSEK+A AFGLIS +P +
Sbjct: 626 KEINSMLVEISDKLRLAGYRPITTQVLLNLEEED-KENVLHYHSEKIATAFGLISTSPGK 646

Query: 686 PIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW 732
           PIR+VKNLRIC DCH   KLIS+VY R I +RDR RFHHF+DG CSCMD+W
Sbjct: 686 PIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of Tan0021030 vs. NCBI nr
Match: XP_038893523.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa hispida])

HSP 1 Score: 1359.4 bits (3517), Expect = 0.0e+00
Identity = 668/727 (91.88%), Postives = 689/727 (94.77%), Query Frame = 0

Query: 5   PQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTV 64
           P IS  NF   N+N+PFRNHQILS I+QCSS KQLKQVHA MLRTGLFFDPFSASKL T 
Sbjct: 7   PLISLQNFPTPNDNLPFRNHQILSTIDQCSSPKQLKQVHAHMLRTGLFFDPFSASKLFTA 66

Query: 65  SALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNN 124
           SALSSFSTL+YA  +FDQIS PNLYTWNTLIRAYASSSDPFQSFV FLDLLDK DDLPNN
Sbjct: 67  SALSSFSTLDYALNVFDQISHPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCDDLPNN 126

Query: 125 FTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEG 184
           FTFPFVIKAASELKASRVGRAVHGMAIK+SFGMDLYILNSLVRFYG CGDLNMAERLFEG
Sbjct: 127 FTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGTCGDLNMAERLFEG 186

Query: 185 ISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 244
           IS KDVVSWNSMISAFAQGNCPE ALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFG
Sbjct: 187 ISCKDVVSWNSMISAFAQGNCPEDALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 246

Query: 245 RWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKT 304
           RWVCSYIERKEIKVDLTLCNAMLDMYTKCGSI DAQKLFDEMPERDVFSWTTMLDGYAK 
Sbjct: 247 RWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKM 306

Query: 305 GDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 364
           GDFD AR+VFDAMPVKEIAAWNVLISAYEQNG PKEALATFNELQLSKIAKPDEVTLVST
Sbjct: 307 GDFDAARRVFDAMPVKEIAAWNVLISAYEQNGNPKEALATFNELQLSKIAKPDEVTLVST 366

Query: 365 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDV 424
           LSAC+QLGAIDLGGWIHVYIKREGIDLNCHLI+SLVDMYAKCGALEKALEVFYSVE RDV
Sbjct: 367 LSACSQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEVRDV 426

Query: 425 YVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHE 484
           YVWSAMIAGLGMHGRGKAAI LFF+MQEAKVKPN VTF NVLCACSHA LVDEGRAF HE
Sbjct: 427 YVWSAMIAGLGMHGRGKAAINLFFEMQEAKVKPNSVTFMNVLCACSHAGLVDEGRAFLHE 486

Query: 485 MEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIE 544
           MEP+YGVVPGTKHY CMVDILGRAGFLEEAMELINEMP+TPSASIWGALLGAC LHMN+E
Sbjct: 487 MEPIYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASIWGALLGACSLHMNVE 546

Query: 545 LAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVD 604
           LAELAS+QLLKLEPRNHGAIVLLSNIYAKTGRW+KVSELRKLMRDS+LKKEPGCSS+EVD
Sbjct: 547 LAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSKLKKEPGCSSIEVD 606

Query: 605 GNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 664
           GNV EFLVGDNSHPLS +IY KLDEIATKLKSVGYEPNKSHLLQ IEEDDLKEQALSLHS
Sbjct: 607 GNVHEFLVGDNSHPLSSKIYLKLDEIATKLKSVGYEPNKSHLLQFIEEDDLKEQALSLHS 666

Query: 665 EKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGH 724
           EKLAIAFGLISL PSQPIRVVKNLRICGDCHEVAKL+SRVY RDI+LRDRYRFHHFRDGH
Sbjct: 667 EKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRDGH 726

Query: 725 CSCMDFW 732
           CSC D+W
Sbjct: 727 CSCRDYW 733

BLAST of Tan0021030 vs. NCBI nr
Match: XP_022964665.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1333.5 bits (3450), Expect = 0.0e+00
Identity = 653/729 (89.57%), Postives = 687/729 (94.24%), Query Frame = 0

Query: 3   ASPQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLL 62
           ++P +S PN SI +NN+ FRNHQILS I+QCSS KQLKQVHAQMLRTGLFFDPFSASKL+
Sbjct: 5   SAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSASKLI 64

Query: 63  TVSALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLP 122
             SAL S STLEYAR +FDQI  PNLYTWNTLIRAYASS+DPFQSFV FL LLD+ DDLP
Sbjct: 65  AASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDECDDLP 124

Query: 123 NNFTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLF 182
           NNFTFPFVIKAASELKASRVGRAVHGMAIK+S GMD YILNSLVRFYG CGDLNMAERLF
Sbjct: 125 NNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMAERLF 184

Query: 183 EGISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLE 242
           EGIS KDVVSWNSMISAFAQGNCPE AL+LFLKME  NVMPNSVTMVGVLSACAKKLDLE
Sbjct: 185 EGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKKLDLE 244

Query: 243 FGRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYA 302
           FGRWVCSYIERKEI VDLTLCNAMLDMYTKCGSI DA+KLFDEMPERDVFSWTTMLDGYA
Sbjct: 245 FGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTMLDGYA 304

Query: 303 KTGDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLV 362
           K GDF+ AR+VFD MPVKEIAAWN LISAYE+NGKPKEALATFNELQLSKIAKPDEVTLV
Sbjct: 305 KMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDEVTLV 364

Query: 363 STLSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEER 422
           S+LSACAQLGAIDLGGWIHVYIKREGI+LN HLITSL+DMYAKCGALEKALEVFY+VEE+
Sbjct: 365 SSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYAVEEK 424

Query: 423 DVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFF 482
           DVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTN+LCACSHA LVDEGRA F
Sbjct: 425 DVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEGRALF 484

Query: 483 HEMEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMN 542
           HEMEPVYGVVPGTKHY CMVDILGRAGFLEEAMELINEMP TPSAS+WGALLGAC LHMN
Sbjct: 485 HEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACSLHMN 544

Query: 543 IELAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVE 602
           +ELAELAS+QLLKLEPRNHGAI+LLSN+YAKTGRWDKVSELRKLMRDSELKKEPGCSSVE
Sbjct: 545 VELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGCSSVE 604

Query: 603 VDGNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSL 662
           V+G V EFLVGDNSHPLSR+IYSKLDEIA KLKSVGYEPNKSHLLQLIEEDD+KE ALSL
Sbjct: 605 VNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEHALSL 664

Query: 663 HSEKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRD 722
           HSEKLAIAFGLISL PSQPIRVVKNLRICGDCHEVAKLISRVY RDI+++DRYRFHHFRD
Sbjct: 665 HSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFHHFRD 724

Query: 723 GHCSCMDFW 732
           GHCSCMD+W
Sbjct: 725 GHCSCMDYW 733

BLAST of Tan0021030 vs. NCBI nr
Match: KAA0031814.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1330.5 bits (3442), Expect = 0.0e+00
Identity = 652/727 (89.68%), Postives = 685/727 (94.22%), Query Frame = 0

Query: 5   PQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTV 64
           P IS  NFS  NNN+PFRNHQILS I++CSS+KQLK+VHA+MLRTGLFFDPFSASKL T 
Sbjct: 7   PLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66

Query: 65  SALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNN 124
           SALSSFSTL+YAR +FDQI QPNLYTWN LIRAYASSSDPFQSFV FLDLLDK +DLPNN
Sbjct: 67  SALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 126

Query: 125 FTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEG 184
           FTFPFVIKAASELKASRVG AVHGMAIK+SFGMDLYILNSLVRFYG CGDL+MAERLF+G
Sbjct: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186

Query: 185 ISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 244
           IS KDVVSWNSMISAFAQGNCPE AL+LFLKMERENVMPNSVTMV VLSACAKKLDLEFG
Sbjct: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKKLDLEFG 246

Query: 245 RWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKT 304
           RWVCSYIERK IK+DLTL NAMLDMYTKCGS+ DAQKLFDEMPERDVFSWT MLDGYAK 
Sbjct: 247 RWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306

Query: 305 GDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 364
           GD+D AR VF+AMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST
Sbjct: 307 GDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 366

Query: 365 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDV 424
           LSACAQLGAIDLGGWIHVYIKREGIDLNCHLI+SLVDMYAKCGALEKALEVFYSVEERDV
Sbjct: 367 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEERDV 426

Query: 425 YVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHE 484
           YVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN VTFTNVLCACSH  LVDEGR FFHE
Sbjct: 427 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEGRVFFHE 486

Query: 485 MEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIE 544
           MEPVYGVVP TKHY CMVDILGRAGFLEEAMELINEM +TPSAS+WGALLGAC LHMN+E
Sbjct: 487 MEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACSLHMNVE 546

Query: 545 LAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVD 604
           L ELAS+QLLKLEPRNHGAIVLLSNIYAKTGRW+KVSELRKLMRD+ELKKEPGCSS+EV+
Sbjct: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEVN 606

Query: 605 GNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 664
           GNV EFLVGDN HPLS  IYSKLD+IATKLK VGYEPNKSHLLQLIEEDDLKEQALSLHS
Sbjct: 607 GNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQALSLHS 666

Query: 665 EKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGH 724
           EKLAIAFGL+SL PSQPIRVVKNLRICGDCHE AKL+SRVY RDI+LRDRYRFHHFRDGH
Sbjct: 667 EKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFHHFRDGH 726

Query: 725 CSCMDFW 732
           CSCMD+W
Sbjct: 727 CSCMDYW 733

BLAST of Tan0021030 vs. NCBI nr
Match: XP_022150874.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Momordica charantia])

HSP 1 Score: 1328.9 bits (3438), Expect = 0.0e+00
Identity = 649/728 (89.15%), Postives = 682/728 (93.68%), Query Frame = 0

Query: 4   SPQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLT 63
           +P IS PNF + +NN+PF+NHQILS I++CSSAK+LKQVHA MLRTGLFFDPFSASKL  
Sbjct: 9   APAISLPNFPVTSNNLPFQNHQILSVIDRCSSAKELKQVHAHMLRTGLFFDPFSASKLFA 68

Query: 64  VSALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPN 123
            SALSSFSTL+YA  +FDQI QPNLYTWNTLIRAYASSSDPFQSFV FL+LLD+ DDLPN
Sbjct: 69  ASALSSFSTLQYAHDLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVLFLELLDRCDDLPN 128

Query: 124 NFTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFE 183
           NFTFPFVIKAASELKASRVG+AVHGMAIKMS GMD+YILNSLVRFYGVCGDLNMAERLF 
Sbjct: 129 NFTFPFVIKAASELKASRVGKAVHGMAIKMSLGMDVYILNSLVRFYGVCGDLNMAERLFA 188

Query: 184 GISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEF 243
            I+SKDVVSWNSMISAF QGNCPE ALDLFLKME ENV PNSVTMVGVLSACAKKLDLEF
Sbjct: 189 SIASKDVVSWNSMISAFTQGNCPEDALDLFLKMEGENVKPNSVTMVGVLSACAKKLDLEF 248

Query: 244 GRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAK 303
           GRWVC YIERKEI+VDLTL NA LDMYTKCGSI DAQKLFDEMPERDVFSWTTMLDGYAK
Sbjct: 249 GRWVCEYIERKEIRVDLTLINATLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAK 308

Query: 304 TGDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVS 363
            GDFD ARQVFD MPVKEIAAWNVLISAYEQNGKPKEALATFNELQL KIAKPDEVTLVS
Sbjct: 309 MGDFDAARQVFDTMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLRKIAKPDEVTLVS 368

Query: 364 TLSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERD 423
           TLSACAQLGAIDLGGWIHVY+KREGIDLNCHLITSLVDMYAKCG LEKALEVF+SVEERD
Sbjct: 369 TLSACAQLGAIDLGGWIHVYMKREGIDLNCHLITSLVDMYAKCGDLEKALEVFHSVEERD 428

Query: 424 VYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFH 483
           VYVWSAMIAGLGMHGRGKAAI+LFFKMQEAKV PN VTFTN+LCACSHA LVD GR FFH
Sbjct: 429 VYVWSAMIAGLGMHGRGKAAIDLFFKMQEAKVSPNSVTFTNILCACSHAGLVDAGRVFFH 488

Query: 484 EMEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNI 543
           EMEPVYGVVPGTKHY CMVDILGRAG L EAMELINEMPVTPSAS+WGALLGACRLHMN+
Sbjct: 489 EMEPVYGVVPGTKHYACMVDILGRAGLLGEAMELINEMPVTPSASVWGALLGACRLHMNV 548

Query: 544 ELAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEV 603
           ELAELA ++LLKLEPRNHGAIVLLSNIYAKT RWDKVSELR LMRDS+LKKEPGCSSVEV
Sbjct: 549 ELAELACDRLLKLEPRNHGAIVLLSNIYAKTERWDKVSELRNLMRDSDLKKEPGCSSVEV 608

Query: 604 DGNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLH 663
           +G+V EFLVGDNSHPLS +IYSKLDEIA KLKSVGYEPNKSHLLQL+EEDDLKEQALSLH
Sbjct: 609 NGSVHEFLVGDNSHPLSSKIYSKLDEIAAKLKSVGYEPNKSHLLQLVEEDDLKEQALSLH 668

Query: 664 SEKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDG 723
           SEKLAIAFGLISL PSQPIRVVKNLR+CGDCHEVAKL+SRVY RDI+LRDRYRFHHFRDG
Sbjct: 669 SEKLAIAFGLISLAPSQPIRVVKNLRVCGDCHEVAKLVSRVYGRDILLRDRYRFHHFRDG 728

Query: 724 HCSCMDFW 732
           HCSCMD+W
Sbjct: 729 HCSCMDYW 736

BLAST of Tan0021030 vs. NCBI nr
Match: XP_008457379.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis melo] >TYJ97320.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1328.2 bits (3436), Expect = 0.0e+00
Identity = 651/727 (89.55%), Postives = 684/727 (94.09%), Query Frame = 0

Query: 5   PQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTV 64
           P IS  NFS  NNN+PFRNHQILS I++CSS+KQLK+VHA+MLRTGLFFDPFSASKL T 
Sbjct: 7   PLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66

Query: 65  SALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNN 124
           SALSSFSTL+YAR +FDQI QPNLYTWN LIRAYASSSDPFQSFV FLDLLDK +DLPNN
Sbjct: 67  SALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 126

Query: 125 FTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEG 184
           FTFPFVIKAASELKASRVG AVHGMAIK+SFGMDLYILNSLVRFYG CGDL+MAERLF+G
Sbjct: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186

Query: 185 ISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 244
           IS KDVVSWNSMISAFAQGNCPE AL+LFLKMERENVMPNSVTMV VLSACAKKLDLEFG
Sbjct: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKKLDLEFG 246

Query: 245 RWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKT 304
           RWVCSYIERK IK+DLTL NAMLDMYTKCGS+ DAQKLFDEMPERDVFSWT MLDGYAK 
Sbjct: 247 RWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306

Query: 305 GDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 364
           GD+D AR VF+AMPVKEIAAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDEVTLVST
Sbjct: 307 GDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 366

Query: 365 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDV 424
           LSACAQLGAIDLGGWIHVYIKREGIDLNCHLI+SLVDMYAKCGALEKALEVFYSVEERDV
Sbjct: 367 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEERDV 426

Query: 425 YVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHE 484
           YVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN VTFTNVLCACSH  LVDEGR FFHE
Sbjct: 427 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEGRVFFHE 486

Query: 485 MEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIE 544
           MEPVYGVVP TKHY CMVDILGRAGFLEEAMELINEM +TPSAS+WGALLGAC LHMN+E
Sbjct: 487 MEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACSLHMNVE 546

Query: 545 LAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVD 604
           L ELAS+QLLKLEPRNHGAIVLLSNIYAKTGRW+KVSELRKLMRD+ELKKEPGCSS+EV+
Sbjct: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEVN 606

Query: 605 GNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 664
           GNV EFLVGDN HPLS  IYSKLD+IATKLK VGYEPNKSHLLQLIEEDDLKEQALSLHS
Sbjct: 607 GNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQALSLHS 666

Query: 665 EKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGH 724
           EKLAIAFGL+SL PSQPIRVVKNLRICGDCHE AKL+SRVY RDI+LRDRYRFHHFRDGH
Sbjct: 667 EKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFHHFRDGH 726

Query: 725 CSCMDFW 732
           CSCMD+W
Sbjct: 727 CSCMDYW 733

BLAST of Tan0021030 vs. ExPASy TrEMBL
Match: A0A6J1HLG4 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111464676 PE=3 SV=1)

HSP 1 Score: 1333.5 bits (3450), Expect = 0.0e+00
Identity = 653/729 (89.57%), Postives = 687/729 (94.24%), Query Frame = 0

Query: 3   ASPQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLL 62
           ++P +S PN SI +NN+ FRNHQILS I+QCSS KQLKQVHAQMLRTGLFFDPFSASKL+
Sbjct: 5   SAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSASKLI 64

Query: 63  TVSALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLP 122
             SAL S STLEYAR +FDQI  PNLYTWNTLIRAYASS+DPFQSFV FL LLD+ DDLP
Sbjct: 65  AASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDECDDLP 124

Query: 123 NNFTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLF 182
           NNFTFPFVIKAASELKASRVGRAVHGMAIK+S GMD YILNSLVRFYG CGDLNMAERLF
Sbjct: 125 NNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMAERLF 184

Query: 183 EGISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLE 242
           EGIS KDVVSWNSMISAFAQGNCPE AL+LFLKME  NVMPNSVTMVGVLSACAKKLDLE
Sbjct: 185 EGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKKLDLE 244

Query: 243 FGRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYA 302
           FGRWVCSYIERKEI VDLTLCNAMLDMYTKCGSI DA+KLFDEMPERDVFSWTTMLDGYA
Sbjct: 245 FGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTMLDGYA 304

Query: 303 KTGDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLV 362
           K GDF+ AR+VFD MPVKEIAAWN LISAYE+NGKPKEALATFNELQLSKIAKPDEVTLV
Sbjct: 305 KMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDEVTLV 364

Query: 363 STLSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEER 422
           S+LSACAQLGAIDLGGWIHVYIKREGI+LN HLITSL+DMYAKCGALEKALEVFY+VEE+
Sbjct: 365 SSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYAVEEK 424

Query: 423 DVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFF 482
           DVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTN+LCACSHA LVDEGRA F
Sbjct: 425 DVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEGRALF 484

Query: 483 HEMEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMN 542
           HEMEPVYGVVPGTKHY CMVDILGRAGFLEEAMELINEMP TPSAS+WGALLGAC LHMN
Sbjct: 485 HEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACSLHMN 544

Query: 543 IELAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVE 602
           +ELAELAS+QLLKLEPRNHGAI+LLSN+YAKTGRWDKVSELRKLMRDSELKKEPGCSSVE
Sbjct: 545 VELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGCSSVE 604

Query: 603 VDGNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSL 662
           V+G V EFLVGDNSHPLSR+IYSKLDEIA KLKSVGYEPNKSHLLQLIEEDD+KE ALSL
Sbjct: 605 VNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEHALSL 664

Query: 663 HSEKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRD 722
           HSEKLAIAFGLISL PSQPIRVVKNLRICGDCHEVAKLISRVY RDI+++DRYRFHHFRD
Sbjct: 665 HSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFHHFRD 724

Query: 723 GHCSCMDFW 732
           GHCSCMD+W
Sbjct: 725 GHCSCMDYW 733

BLAST of Tan0021030 vs. ExPASy TrEMBL
Match: A0A5A7SKX2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00570 PE=3 SV=1)

HSP 1 Score: 1330.5 bits (3442), Expect = 0.0e+00
Identity = 652/727 (89.68%), Postives = 685/727 (94.22%), Query Frame = 0

Query: 5   PQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTV 64
           P IS  NFS  NNN+PFRNHQILS I++CSS+KQLK+VHA+MLRTGLFFDPFSASKL T 
Sbjct: 7   PLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66

Query: 65  SALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNN 124
           SALSSFSTL+YAR +FDQI QPNLYTWN LIRAYASSSDPFQSFV FLDLLDK +DLPNN
Sbjct: 67  SALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 126

Query: 125 FTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEG 184
           FTFPFVIKAASELKASRVG AVHGMAIK+SFGMDLYILNSLVRFYG CGDL+MAERLF+G
Sbjct: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186

Query: 185 ISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 244
           IS KDVVSWNSMISAFAQGNCPE AL+LFLKMERENVMPNSVTMV VLSACAKKLDLEFG
Sbjct: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKKLDLEFG 246

Query: 245 RWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKT 304
           RWVCSYIERK IK+DLTL NAMLDMYTKCGS+ DAQKLFDEMPERDVFSWT MLDGYAK 
Sbjct: 247 RWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306

Query: 305 GDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 364
           GD+D AR VF+AMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST
Sbjct: 307 GDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 366

Query: 365 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDV 424
           LSACAQLGAIDLGGWIHVYIKREGIDLNCHLI+SLVDMYAKCGALEKALEVFYSVEERDV
Sbjct: 367 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEERDV 426

Query: 425 YVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHE 484
           YVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN VTFTNVLCACSH  LVDEGR FFHE
Sbjct: 427 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEGRVFFHE 486

Query: 485 MEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIE 544
           MEPVYGVVP TKHY CMVDILGRAGFLEEAMELINEM +TPSAS+WGALLGAC LHMN+E
Sbjct: 487 MEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACSLHMNVE 546

Query: 545 LAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVD 604
           L ELAS+QLLKLEPRNHGAIVLLSNIYAKTGRW+KVSELRKLMRD+ELKKEPGCSS+EV+
Sbjct: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEVN 606

Query: 605 GNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 664
           GNV EFLVGDN HPLS  IYSKLD+IATKLK VGYEPNKSHLLQLIEEDDLKEQALSLHS
Sbjct: 607 GNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQALSLHS 666

Query: 665 EKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGH 724
           EKLAIAFGL+SL PSQPIRVVKNLRICGDCHE AKL+SRVY RDI+LRDRYRFHHFRDGH
Sbjct: 667 EKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFHHFRDGH 726

Query: 725 CSCMDFW 732
           CSCMD+W
Sbjct: 727 CSCMDYW 733

BLAST of Tan0021030 vs. ExPASy TrEMBL
Match: A0A6J1DBC5 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018924 PE=3 SV=1)

HSP 1 Score: 1328.9 bits (3438), Expect = 0.0e+00
Identity = 649/728 (89.15%), Postives = 682/728 (93.68%), Query Frame = 0

Query: 4   SPQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLT 63
           +P IS PNF + +NN+PF+NHQILS I++CSSAK+LKQVHA MLRTGLFFDPFSASKL  
Sbjct: 9   APAISLPNFPVTSNNLPFQNHQILSVIDRCSSAKELKQVHAHMLRTGLFFDPFSASKLFA 68

Query: 64  VSALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPN 123
            SALSSFSTL+YA  +FDQI QPNLYTWNTLIRAYASSSDPFQSFV FL+LLD+ DDLPN
Sbjct: 69  ASALSSFSTLQYAHDLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVLFLELLDRCDDLPN 128

Query: 124 NFTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFE 183
           NFTFPFVIKAASELKASRVG+AVHGMAIKMS GMD+YILNSLVRFYGVCGDLNMAERLF 
Sbjct: 129 NFTFPFVIKAASELKASRVGKAVHGMAIKMSLGMDVYILNSLVRFYGVCGDLNMAERLFA 188

Query: 184 GISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEF 243
            I+SKDVVSWNSMISAF QGNCPE ALDLFLKME ENV PNSVTMVGVLSACAKKLDLEF
Sbjct: 189 SIASKDVVSWNSMISAFTQGNCPEDALDLFLKMEGENVKPNSVTMVGVLSACAKKLDLEF 248

Query: 244 GRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAK 303
           GRWVC YIERKEI+VDLTL NA LDMYTKCGSI DAQKLFDEMPERDVFSWTTMLDGYAK
Sbjct: 249 GRWVCEYIERKEIRVDLTLINATLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAK 308

Query: 304 TGDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVS 363
            GDFD ARQVFD MPVKEIAAWNVLISAYEQNGKPKEALATFNELQL KIAKPDEVTLVS
Sbjct: 309 MGDFDAARQVFDTMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLRKIAKPDEVTLVS 368

Query: 364 TLSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERD 423
           TLSACAQLGAIDLGGWIHVY+KREGIDLNCHLITSLVDMYAKCG LEKALEVF+SVEERD
Sbjct: 369 TLSACAQLGAIDLGGWIHVYMKREGIDLNCHLITSLVDMYAKCGDLEKALEVFHSVEERD 428

Query: 424 VYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFH 483
           VYVWSAMIAGLGMHGRGKAAI+LFFKMQEAKV PN VTFTN+LCACSHA LVD GR FFH
Sbjct: 429 VYVWSAMIAGLGMHGRGKAAIDLFFKMQEAKVSPNSVTFTNILCACSHAGLVDAGRVFFH 488

Query: 484 EMEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNI 543
           EMEPVYGVVPGTKHY CMVDILGRAG L EAMELINEMPVTPSAS+WGALLGACRLHMN+
Sbjct: 489 EMEPVYGVVPGTKHYACMVDILGRAGLLGEAMELINEMPVTPSASVWGALLGACRLHMNV 548

Query: 544 ELAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEV 603
           ELAELA ++LLKLEPRNHGAIVLLSNIYAKT RWDKVSELR LMRDS+LKKEPGCSSVEV
Sbjct: 549 ELAELACDRLLKLEPRNHGAIVLLSNIYAKTERWDKVSELRNLMRDSDLKKEPGCSSVEV 608

Query: 604 DGNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLH 663
           +G+V EFLVGDNSHPLS +IYSKLDEIA KLKSVGYEPNKSHLLQL+EEDDLKEQALSLH
Sbjct: 609 NGSVHEFLVGDNSHPLSSKIYSKLDEIAAKLKSVGYEPNKSHLLQLVEEDDLKEQALSLH 668

Query: 664 SEKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDG 723
           SEKLAIAFGLISL PSQPIRVVKNLR+CGDCHEVAKL+SRVY RDI+LRDRYRFHHFRDG
Sbjct: 669 SEKLAIAFGLISLAPSQPIRVVKNLRVCGDCHEVAKLVSRVYGRDILLRDRYRFHHFRDG 728

Query: 724 HCSCMDFW 732
           HCSCMD+W
Sbjct: 729 HCSCMDYW 736

BLAST of Tan0021030 vs. ExPASy TrEMBL
Match: A0A5D3BBW6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001260 PE=3 SV=1)

HSP 1 Score: 1328.2 bits (3436), Expect = 0.0e+00
Identity = 651/727 (89.55%), Postives = 684/727 (94.09%), Query Frame = 0

Query: 5   PQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTV 64
           P IS  NFS  NNN+PFRNHQILS I++CSS+KQLK+VHA+MLRTGLFFDPFSASKL T 
Sbjct: 7   PLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66

Query: 65  SALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNN 124
           SALSSFSTL+YAR +FDQI QPNLYTWN LIRAYASSSDPFQSFV FLDLLDK +DLPNN
Sbjct: 67  SALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 126

Query: 125 FTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEG 184
           FTFPFVIKAASELKASRVG AVHGMAIK+SFGMDLYILNSLVRFYG CGDL+MAERLF+G
Sbjct: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186

Query: 185 ISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 244
           IS KDVVSWNSMISAFAQGNCPE AL+LFLKMERENVMPNSVTMV VLSACAKKLDLEFG
Sbjct: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKKLDLEFG 246

Query: 245 RWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKT 304
           RWVCSYIERK IK+DLTL NAMLDMYTKCGS+ DAQKLFDEMPERDVFSWT MLDGYAK 
Sbjct: 247 RWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306

Query: 305 GDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 364
           GD+D AR VF+AMPVKEIAAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDEVTLVST
Sbjct: 307 GDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 366

Query: 365 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDV 424
           LSACAQLGAIDLGGWIHVYIKREGIDLNCHLI+SLVDMYAKCGALEKALEVFYSVEERDV
Sbjct: 367 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEERDV 426

Query: 425 YVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHE 484
           YVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN VTFTNVLCACSH  LVDEGR FFHE
Sbjct: 427 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEGRVFFHE 486

Query: 485 MEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIE 544
           MEPVYGVVP TKHY CMVDILGRAGFLEEAMELINEM +TPSAS+WGALLGAC LHMN+E
Sbjct: 487 MEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACSLHMNVE 546

Query: 545 LAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVD 604
           L ELAS+QLLKLEPRNHGAIVLLSNIYAKTGRW+KVSELRKLMRD+ELKKEPGCSS+EV+
Sbjct: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEVN 606

Query: 605 GNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 664
           GNV EFLVGDN HPLS  IYSKLD+IATKLK VGYEPNKSHLLQLIEEDDLKEQALSLHS
Sbjct: 607 GNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQALSLHS 666

Query: 665 EKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGH 724
           EKLAIAFGL+SL PSQPIRVVKNLRICGDCHE AKL+SRVY RDI+LRDRYRFHHFRDGH
Sbjct: 667 EKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFHHFRDGH 726

Query: 725 CSCMDFW 732
           CSCMD+W
Sbjct: 727 CSCMDYW 733

BLAST of Tan0021030 vs. ExPASy TrEMBL
Match: A0A1S3C623 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103497087 PE=3 SV=1)

HSP 1 Score: 1328.2 bits (3436), Expect = 0.0e+00
Identity = 651/727 (89.55%), Postives = 684/727 (94.09%), Query Frame = 0

Query: 5   PQISFPNFSIENNNIPFRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTV 64
           P IS  NFS  NNN+PFRNHQILS I++CSS+KQLK+VHA+MLRTGLFFDPFSASKL T 
Sbjct: 7   PLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66

Query: 65  SALSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNN 124
           SALSSFSTL+YAR +FDQI QPNLYTWN LIRAYASSSDPFQSFV FLDLLDK +DLPNN
Sbjct: 67  SALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 126

Query: 125 FTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEG 184
           FTFPFVIKAASELKASRVG AVHGMAIK+SFGMDLYILNSLVRFYG CGDL+MAERLF+G
Sbjct: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186

Query: 185 ISSKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 244
           IS KDVVSWNSMISAFAQGNCPE AL+LFLKMERENVMPNSVTMV VLSACAKKLDLEFG
Sbjct: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKKLDLEFG 246

Query: 245 RWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKT 304
           RWVCSYIERK IK+DLTL NAMLDMYTKCGS+ DAQKLFDEMPERDVFSWT MLDGYAK 
Sbjct: 247 RWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306

Query: 305 GDFDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 364
           GD+D AR VF+AMPVKEIAAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDEVTLVST
Sbjct: 307 GDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 366

Query: 365 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDV 424
           LSACAQLGAIDLGGWIHVYIKREGIDLNCHLI+SLVDMYAKCGALEKALEVFYSVEERDV
Sbjct: 367 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEERDV 426

Query: 425 YVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHE 484
           YVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN VTFTNVLCACSH  LVDEGR FFHE
Sbjct: 427 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEGRVFFHE 486

Query: 485 MEPVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIE 544
           MEPVYGVVP TKHY CMVDILGRAGFLEEAMELINEM +TPSAS+WGALLGAC LHMN+E
Sbjct: 487 MEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACSLHMNVE 546

Query: 545 LAELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVD 604
           L ELAS+QLLKLEPRNHGAIVLLSNIYAKTGRW+KVSELRKLMRD+ELKKEPGCSS+EV+
Sbjct: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEVN 606

Query: 605 GNVQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 664
           GNV EFLVGDN HPLS  IYSKLD+IATKLK VGYEPNKSHLLQLIEEDDLKEQALSLHS
Sbjct: 607 GNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQALSLHS 666

Query: 665 EKLAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGH 724
           EKLAIAFGL+SL PSQPIRVVKNLRICGDCHE AKL+SRVY RDI+LRDRYRFHHFRDGH
Sbjct: 667 EKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFHHFRDGH 726

Query: 725 CSCMDFW 732
           CSCMD+W
Sbjct: 727 CSCMDYW 733

BLAST of Tan0021030 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 935.6 bits (2417), Expect = 2.3e-272
Identity = 446/725 (61.52%), Postives = 568/725 (78.34%), Query Frame = 0

Query: 10  PNFSIENNNIPFRNHQ---ILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSA 69
           PNFS  N N P  N++    +S I +C S +QLKQ H  M+RTG F DP+SASKL  ++A
Sbjct: 16  PNFS--NPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAA 75

Query: 70  LSSFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNNFT 129
           LSSF++LEYAR++FD+I +PN + WNTLIRAYAS  DP  S   FLD++ +S   PN +T
Sbjct: 76  LSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYT 135

Query: 130 FPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEGIS 189
           FPF+IKAA+E+ +  +G+++HGMA+K + G D+++ NSL+  Y  CGDL+ A ++F  I 
Sbjct: 136 FPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIK 195

Query: 190 SKDVVSWNSMISAFAQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRW 249
            KDVVSWNSMI+ F Q   P+ AL+LF KME E+V  + VTMVGVLSACAK  +LEFGR 
Sbjct: 196 EKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQ 255

Query: 250 VCSYIERKEIKVDLTLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGD 309
           VCSYIE   + V+LTL NAMLDMYTKCGSI DA++LFD M E+D  +WTTMLDGYA + D
Sbjct: 256 VCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISED 315

Query: 310 FDTARQVFDAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLS 369
           ++ AR+V ++MP K+I AWN LISAYEQNGKP EAL  F+ELQL K  K +++TLVSTLS
Sbjct: 316 YEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLS 375

Query: 370 ACAQLGAIDLGGWIHVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYV 429
           ACAQ+GA++LG WIH YIK+ GI +N H+ ++L+ MY+KCG LEK+ EVF SVE+RDV+V
Sbjct: 376 ACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFV 435

Query: 430 WSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEME 489
           WSAMI GL MHG G  A+++F+KMQEA VKPN VTFTNV CACSH  LVDE  + FH+ME
Sbjct: 436 WSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQME 495

Query: 490 PVYGVVPGTKHYTCMVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELA 549
             YG+VP  KHY C+VD+LGR+G+LE+A++ I  MP+ PS S+WGALLGAC++H N+ LA
Sbjct: 496 SNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLA 555

Query: 550 ELASNQLLKLEPRNHGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGN 609
           E+A  +LL+LEPRN GA VLLSNIYAK G+W+ VSELRK MR + LKKEPGCSS+E+DG 
Sbjct: 556 EMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGM 615

Query: 610 VQEFLVGDNSHPLSREIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEK 669
           + EFL GDN+HP+S ++Y KL E+  KLKS GYEP  S +LQ+IEE+++KEQ+L+LHSEK
Sbjct: 616 IHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEK 675

Query: 670 LAIAFGLISLTPSQPIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCS 729
           LAI +GLIS    + IRV+KNLR+CGDCH VAKLIS++Y R+II+RDRYRFHHFR+G CS
Sbjct: 676 LAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCS 735

Query: 730 CMDFW 732
           C DFW
Sbjct: 736 CNDFW 738

BLAST of Tan0021030 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 602.1 bits (1551), Expect = 6.1e-172
Identity = 318/765 (41.57%), Postives = 459/765 (60.00%), Query Frame = 0

Query: 5   PQISFPNFSIENNNIP----FRNHQILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASK 64
           P  S+P   + +++ P     RNH  LS ++ C + + L+ +HAQM++ GL    ++ SK
Sbjct: 11  PSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSK 70

Query: 65  LLTVSALS-SFSTLEYARQMFDQISQPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSD 124
           L+    LS  F  L YA  +F  I +PNL  WNT+ R +A SSDP  +   ++ ++    
Sbjct: 71  LIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLG- 130

Query: 125 DLPNNFTFPFVIKAASELKASRVGRAVHGMAIKMSFGMDLYILNS--------------- 184
            LPN++TFPFV+K+ ++ KA + G+ +HG  +K+   +DLY+  S               
Sbjct: 131 LLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAH 190

Query: 185 ----------------LVRFYGVCGDLNMAERLFEGISSKDVVSWNSMISAFAQGNCPEG 244
                           L++ Y   G +  A++LF+ I  KDVVSWN+MIS +A+    + 
Sbjct: 191 KVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE 250

Query: 245 ALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNAMLD 304
           AL+LF  M + NV P+  TMV V+SACA+   +E GR V  +I+      +L + NA++D
Sbjct: 251 ALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALID 310

Query: 305 MYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVKEIAAWNVL 364
           +Y+KCG +  A  LF+ +P +DV SW T++ GY     +                     
Sbjct: 311 LYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY--------------------- 370

Query: 365 ISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKR-- 424
                     KEAL  F E+ L     P++VT++S L ACA LGAID+G WIHVYI +  
Sbjct: 371 ----------KEALLLFQEM-LRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRL 430

Query: 425 EGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIEL 484
           +G+     L TSL+DMYAKCG +E A +VF S+  + +  W+AMI G  MHGR  A+ +L
Sbjct: 431 KGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDL 490

Query: 485 FFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTCMVDILG 544
           F +M++  ++P+D+TF  +L ACSH+ ++D GR  F  M   Y + P  +HY CM+D+LG
Sbjct: 491 FSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLG 550

Query: 545 RAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRNHGAIVL 604
            +G  +EA E+IN M + P   IW +LL AC++H N+EL E  +  L+K+EP N G+ VL
Sbjct: 551 HSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVL 610

Query: 605 LSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLSREIYSK 664
           LSNIYA  GRW++V++ R L+ D  +KK PGCSS+E+D  V EF++GD  HP +REIY  
Sbjct: 611 LSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGM 670

Query: 665 LDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQPIRVVK 724
           L+E+   L+  G+ P+ S +LQ +EE + KE AL  HSEKLAIAFGLIS  P   + +VK
Sbjct: 671 LEEMEVLLEKAGFVPDTSEVLQEMEE-EWKEGALRHHSEKLAIAFGLISTKPGTKLTIVK 730

Query: 725 NLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW 732
           NLR+C +CHE  KLIS++Y R+II RDR RFHHFRDG CSC D+W
Sbjct: 731 NLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Tan0021030 vs. TAIR 10
Match: AT4G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 546.6 bits (1407), Expect = 3.1e-155
Identity = 277/713 (38.85%), Postives = 431/713 (60.45%), Query Frame = 0

Query: 26  ILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSALSSFSTLEYARQMFDQI-S 85
           IL +++ C S   +KQ+HA +LRT    +    S L  +S  SS   L YA  +F  I S
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 86  QPNLYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNNFTFPFVIKAASELKASRVGR 145
            P    +N  +R  + SS+P ++ + F   +       + F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 146 AVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEGISSKDVVSWNSMISAFAQGN 205
            +HG+A K++   D ++    +  Y  CG +N A  +F+ +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 206 CPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCN 265
             + A  LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++   ++++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 266 AMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVKEIAA 325
           A++ MY   G +  A++ F +M  R++F  T M+ GY+K G  D A+ +FD    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 326 WNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 385
           W  +ISAY ++  P+EAL  F E+  S I KPD V++ S +SACA LG +D   W+H  I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 386 KREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAI 445
              G++    +  +L++MYAKCG L+   +VF  +  R+V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 446 ELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTCMVDI 505
            LF +M++  V+PN+VTF  VL  CSH+ LV+EG+  F  M   Y + P  +HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 506 LGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRNHGAI 565
            GRA  L EA+E+I  MPV  +  IWG+L+ ACR+H  +EL + A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 566 VLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLSREIY 625
           VL+SNIYA+  RW+ V  +R++M +  + KE G S ++ +G   EFL+GD  H  S EIY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 626 SKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQP--- 685
           +KLDE+ +KLK  GY P+   +L  +EE++ K+  L  HSEKLA+ FGL++    +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 686 ---IRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW 732
              IR+VKNLR+C DCH   KL+S+VY R+II+RDR RFH +++G CSC D+W
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of Tan0021030 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 532.7 bits (1371), Expect = 4.6e-151
Identity = 275/706 (38.95%), Postives = 426/706 (60.34%), Query Frame = 0

Query: 28  SRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSALSSFSTLEYARQMFDQISQPN 87
           S I+  +   QLKQ+HA++L  GL F  F  +KL  + A SSF  + +ARQ+FD + +P 
Sbjct: 26  SLIDSATHKAQLKQIHARLLVLGLQFSGFLITKL--IHASSSFGDITFARQVFDDLPRPQ 85

Query: 88  LYTWNTLIRAYASSSDPFQSFVTFLDLLDKSDDLPNNFTFPFVIKAASELKASRVGRAVH 147
           ++ WN +IR Y S ++ FQ  +     +  +   P++FTFP ++KA S L   ++GR VH
Sbjct: 86  IFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVH 145

Query: 148 GMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLFEG--ISSKDVVSWNSMISAFAQGNC 207
               ++ F  D+++ N L+  Y  C  L  A  +FEG  +  + +VSW +++SA+AQ   
Sbjct: 146 AQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGE 205

Query: 208 PEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 267
           P  AL++F +M + +V P+ V +V VL+A     DL+ GR + + + +  ++++  L  +
Sbjct: 206 PMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLIS 265

Query: 268 MLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVKEIAAW 327
           +  MY KCG +A A+ LFD+M   ++  W  M+ GYAK                      
Sbjct: 266 LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK---------------------- 325

Query: 328 NVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 387
                    NG  +EA+  F+E+ ++K  +PD +++ S +SACAQ+G+++    ++ Y+ 
Sbjct: 326 ---------NGYAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVG 385

Query: 388 REGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIE 447
           R     +  + ++L+DM+AKCG++E A  VF    +RDV VWSAMI G G+HGR + AI 
Sbjct: 386 RSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAIS 445

Query: 448 LFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTCMVDIL 507
           L+  M+   V PNDVTF  +L AC+H+ +V EG  FF+ M   + + P  +HY C++D+L
Sbjct: 446 LYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 505

Query: 508 GRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRNHGAIV 567
           GRAG L++A E+I  MPV P  ++WGALL AC+ H ++EL E A+ QL  ++P N G  V
Sbjct: 506 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 565

Query: 568 LLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLSREIYS 627
            LSN+YA    WD+V+E+R  M++  L K+ GCS VEV G ++ F VGD SHP   EI  
Sbjct: 566 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 625

Query: 628 KLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQPIRVV 687
           +++ I ++LK  G+  NK   L  + +++  E+ L  HSE++AIA+GLIS     P+R+ 
Sbjct: 626 QVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHSERIAIAYGLISTPQGTPLRIT 685

Query: 688 KNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW 732
           KNLR C +CH   KLIS++  R+I++RD  RFHHF+DG CSC D+W
Sbjct: 686 KNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of Tan0021030 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 528.9 bits (1361), Expect = 6.6e-150
Identity = 277/711 (38.96%), Postives = 412/711 (57.95%), Query Frame = 0

Query: 26  ILSRINQCSSAKQLKQVHAQMLRTGLFFDPFSASKLLTVSALSSF--STLEYARQMFDQI 85
           +  +IN C + + L Q+HA  +++G   D  +A+++L   A S      L+YA ++F+Q+
Sbjct: 26  LFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQM 85

Query: 86  SQPNLYTWNTLIRAYASSSD--PFQSFVTFLDLLDKSDDLPNNFTFPFVIKAASELKASR 145
            Q N ++WNT+IR ++ S +     +   F +++      PN FTFP V+KA ++    +
Sbjct: 86  PQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQ 145

Query: 146 VGRAVHGMAIKMSFGMDLYILNSLVRFYGVCGDLNMAERLF-EGISSKDVVSWNSMISAF 205
            G+ +HG+A+K  FG D +++++LVR Y +CG +  A  LF + I  KD+V         
Sbjct: 146 EGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMV--------- 205

Query: 206 AQGNCPEGALDLFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDL 265
                                                                       
Sbjct: 206 ------------------------------------------------------------ 265

Query: 266 TLCNAMLDMYTKCGSIADAQKLFDEMPERDVFSWTTMLDGYAKTGDFDTARQVFDAMPVK 325
                M D   + G               ++  W  M+DGY + GD   AR +FD M  +
Sbjct: 266 ----VMTDRRKRDG---------------EIVLWNVMIDGYMRLGDCKAARMLFDKMRQR 325

Query: 326 EIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWI 385
            + +WN +IS Y  NG  K+A+  F E++   I +P+ VTLVS L A ++LG+++LG W+
Sbjct: 326 SVVSWNTMISGYSLNGFFKDAVEVFREMKKGDI-RPNYVTLVSVLPAISRLGSLELGEWL 385

Query: 386 HVYIKREGIDLNCHLITSLVDMYAKCGALEKALEVFYSVEERDVYVWSAMIAGLGMHGRG 445
           H+Y +  GI ++  L ++L+DMY+KCG +EKA+ VF  +   +V  WSAMI G  +HG+ 
Sbjct: 386 HLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQA 445

Query: 446 KAAIELFFKMQEAKVKPNDVTFTNVLCACSHARLVDEGRAFFHEMEPVYGVVPGTKHYTC 505
             AI+ F KM++A V+P+DV + N+L ACSH  LV+EGR +F +M  V G+ P  +HY C
Sbjct: 446 GDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGC 505

Query: 506 MVDILGRAGFLEEAMELINEMPVTPSASIWGALLGACRLHMNIELAELASNQLLKLEPRN 565
           MVD+LGR+G L+EA E I  MP+ P   IW ALLGACR+  N+E+ +  +N L+ + P +
Sbjct: 506 MVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHD 565

Query: 566 HGAIVLLSNIYAKTGRWDKVSELRKLMRDSELKKEPGCSSVEVDGNVQEFLVGDNSHPLS 625
            GA V LSN+YA  G W +VSE+R  M++ +++K+PGCS +++DG + EF+V D+SHP +
Sbjct: 566 SGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKA 625

Query: 626 REIYSKLDEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLISLTPSQ 685
           +EI S L EI+ KL+  GY P  + +L  +EE+D KE  L  HSEK+A AFGLIS +P +
Sbjct: 626 KEINSMLVEISDKLRLAGYRPITTQVLLNLEEED-KENVLHYHSEKIATAFGLISTSPGK 646

Query: 686 PIRVVKNLRICGDCHEVAKLISRVYCRDIILRDRYRFHHFRDGHCSCMDFW 732
           PIR+VKNLRIC DCH   KLIS+VY R I +RDR RFHHF+DG CSCMD+W
Sbjct: 686 PIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O823803.3e-27161.52Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LN018.7e-17141.57Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O233374.3e-15438.85Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... [more]
Q9LTV86.4e-15038.95Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9FI809.3e-14938.96Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038893523.10.0e+0091.88pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa ... [more]
XP_022964665.10.0e+0089.57pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita ... [more]
KAA0031814.10.0e+0089.68pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_022150874.10.0e+0089.15pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Momordica ... [more]
XP_008457379.10.0e+0089.55PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... [more]
Match NameE-valueIdentityDescription
A0A6J1HLG40.0e+0089.57pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbit... [more]
A0A5A7SKX20.0e+0089.68Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1DBC50.0e+0089.15pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Momordic... [more]
A0A5D3BBW60.0e+0089.55Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C6230.0e+0089.55pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT2G29760.12.3e-27261.52Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.16.1e-17241.57Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G14820.13.1e-15538.85Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.14.6e-15138.95mitochondrial editing factor 22 [more]
AT5G48910.16.6e-15038.96Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 569..590
e-value: 0.99
score: 9.8
coord: 324..350
e-value: 1.7E-5
score: 24.7
coord: 497..521
e-value: 6.5E-5
score: 22.9
coord: 89..111
e-value: 0.21
score: 11.9
coord: 292..320
e-value: 5.0E-6
score: 26.4
coord: 263..290
e-value: 1.4E-4
score: 21.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 292..319
e-value: 1.3E-6
score: 26.2
coord: 460..493
e-value: 0.0013
score: 16.8
coord: 263..291
e-value: 0.0012
score: 16.9
coord: 191..224
e-value: 7.5E-7
score: 26.9
coord: 324..357
e-value: 3.6E-4
score: 18.5
coord: 425..458
e-value: 1.1E-6
score: 26.4
coord: 498..521
e-value: 5.5E-4
score: 17.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 423..470
e-value: 8.3E-10
score: 38.7
coord: 188..237
e-value: 8.4E-11
score: 41.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 423..457
score: 10.851745
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 259..289
score: 9.799459
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 290..324
score: 11.366925
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 189..223
score: 11.695765
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 596..721
e-value: 1.2E-35
score: 122.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 13..117
e-value: 1.9E-7
score: 32.8
coord: 324..486
e-value: 5.5E-36
score: 126.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 487..609
e-value: 1.6E-12
score: 49.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 136..242
e-value: 6.9E-21
score: 76.4
coord: 243..323
e-value: 1.7E-19
score: 71.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 301..580
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 173..304
coord: 10..182
coord: 302..717
NoneNo IPR availablePANTHERPTHR47926:SF99BNAANNG32650D PROTEINcoord: 173..304
coord: 10..182
coord: 302..717

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021030.1Tan0021030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding