Tan0002208 (gene) Snake gourd v1

Overview
NameTan0002208
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG04: 10180341 .. 10182377 (+)
RNA-Seq ExpressionTan0002208
SyntenyTan0002208
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCATAAGTTATTAATTCCTTCCAACTCAAAGTGCTTGCTTAGCTTGATCGCTTGCGCTCCGACCAAATCCCATGAAGCGCGAAATAGCGCGATTGGTGGCGAATGGACTTTACATCGAAGCACTCTCCACGTACTCACAGCACCACTCGGCCTCCCTTCGTCCTCACAATTTCATTTTTCCTCCTCTCTTCAAAGCATGCGCAAAGCTTAACTCAGTTCCACAAGGCCAAATGCTGCACACCCACTTGATTAAAACGGGGTTTCAAGCAGACGTCTATGCAGCAACAGCCCTTACAGATATGTATTTGAAGCTTCTTCATTTTGACGATGCTCTAAAGATGTTCGACGAAATGCCCCACAGAAATTTGGCATCCTTAAACGCCATGATTTCTGTGTTTTTGTTGAATGGTTCACGTGGAGAAGCACTACGGATGGTTGAGCTTGTGAATCGTGGTTCATTAAGGCCTAGTTCAATTACCATTGTTAACTTTCTGTCAGCATGCGCAAGTGTTGCTCGTGGGATGCAAATACATTGTTGGGCGATAAAACTGGGTTTTCTAATGGACGTGTATGTCGCTACAGCGCTCTTGACGATGTATTCTACTTGTGATGAAGTGAGTTTCGCTGCTAAGGTATTTCAGGAGTTGTCGAACAAAAATGTGGTGAGTTTTAATGCATATATTTCAGGGCTTCTACTGAATGGCATGCCTCGTATGGTATTGGATGTGTTTAAGAGCATGATGGAATGGCTATGTGGGAAACCAAATTCAGTCACATTGGCTTCTGTCCTTTCTGCTTGTTCTAATATTTCATATCTTGAATTTGGTAGGCAGGTTCATGCACTCACTGTTAAAATTGATAGCGATGATGTAATGGTAGGAACTGCAATGGTGGACATGTATTCTAAATGTGGGGCTTGGCAGTGGGCACACAATGTTTTCAACGAGCGGAAAGGAACCAGTAACTTATTTACTTGGAATTCATTGATTGCTGGAATGATGTTAAATGATAAGAGTGAGATTGCAATGGAACTTTTTGAATCGTTGGAGTCCCACAAATTGCAACCAGATTCAGCTACCTGGAATTCAATGATTAGTGGTCTTGCCCAATTGGGTCGGCCGATTGAGGCCTTCAAGTACTTTCATAGAATGCAATCGGCTGGCACAGTCCCTAGTTTAAAATCTCTCACCAGTCTTTTATCCATTTGTTCAGATTTGTCTGCATTGCGGCATGGCAAAGAAATCCATGCCCAAGCAACTAAAAGCATCATTCATATGGATGTGTTTCTTGCCACGGCACTTATTGACATGTACATGAAGTGTGGTCATCTTTCTTCGGCACGAAGACTCTTCGATCAATTTGACTTCAAACCAAAAGATACAATATTCTGGAACGCAATGATCTCAGGTTACGGAAGAAATGGAGACAGCAAATCTGCGTTTGATATCTTTGATCGAATGCTGGAAGAAAGAGTACAACCAAATGCAGCCACGTTTACGAGTCTCTTATCTATCTGCAGTCATTGCGGTCAGGTTCACAAAGGCTGGCAATTTTTCAAAATGATGAACACGAAATACGACCTGCAGCCAGCTCGTGACCATTATAACTGCATGATAGACATTTTGGGTAGAGCTGGCCAGTTGGGAAAAGCTCGAAAATTGTTGGCAGAATTGCCAGAGCCCTCTATGTCTTCTCTTGCTTCTTTGCTTGGTGCCTGTAGTCTGCACAAAGATTCTAAACTAGGCGAAGAAATGGCTTTAAGAATTTCAGAATTAGAGCCGAAGAACCCACTTCCATTTGTTATGTTATCCAATATATATGCTAAGCTTGGAAGGTGGGAAGATGTTGAAAGGGTCAGAGAAATGATGAATGATAAAGGATTGAAAAAACTATCTGGCATTAGTTCAATAGAATGGCCTAAGAGTAAGTACTTAAGCAGAGTAATTAGTATACTTTTGATGTCACACCAGAAGTTGCAACTCTTTTACATCTTTCATTTCTTTCATTCGTTAAGGAAAACTTATAAATATCACTGA

mRNA sequence

CTCATAAGTTATTAATTCCTTCCAACTCAAAGTGCTTGCTTAGCTTGATCGCTTGCGCTCCGACCAAATCCCATGAAGCGCGAAATAGCGCGATTGGTGGCGAATGGACTTTACATCGAAGCACTCTCCACGTACTCACAGCACCACTCGGCCTCCCTTCGTCCTCACAATTTCATTTTTCCTCCTCTCTTCAAAGCATGCGCAAAGCTTAACTCAGTTCCACAAGGCCAAATGCTGCACACCCACTTGATTAAAACGGGGTTTCAAGCAGACGTCTATGCAGCAACAGCCCTTACAGATATGTATTTGAAGCTTCTTCATTTTGACGATGCTCTAAAGATGTTCGACGAAATGCCCCACAGAAATTTGGCATCCTTAAACGCCATGATTTCTGTGTTTTTGTTGAATGGTTCACGTGGAGAAGCACTACGGATGGTTGAGCTTGTGAATCGTGGTTCATTAAGGCCTAGTTCAATTACCATTGTTAACTTTCTGTCAGCATGCGCAAGTGTTGCTCGTGGGATGCAAATACATTGTTGGGCGATAAAACTGGGTTTTCTAATGGACGTGTATGTCGCTACAGCGCTCTTGACGATGTATTCTACTTGTGATGAAGTGAGTTTCGCTGCTAAGGTATTTCAGGAGTTGTCGAACAAAAATGTGGTGAGTTTTAATGCATATATTTCAGGGCTTCTACTGAATGGCATGCCTCGTATGGTATTGGATGTGTTTAAGAGCATGATGGAATGGCTATGTGGGAAACCAAATTCAGTCACATTGGCTTCTGTCCTTTCTGCTTGTTCTAATATTTCATATCTTGAATTTGGTAGGCAGGTTCATGCACTCACTGTTAAAATTGATAGCGATGATGTAATGGTAGGAACTGCAATGGTGGACATGTATTCTAAATGTGGGGCTTGGCAGTGGGCACACAATGTTTTCAACGAGCGGAAAGGAACCAGTAACTTATTTACTTGGAATTCATTGATTGCTGGAATGATGTTAAATGATAAGAGTGAGATTGCAATGGAACTTTTTGAATCGTTGGAGTCCCACAAATTGCAACCAGATTCAGCTACCTGGAATTCAATGATTAGTGGTCTTGCCCAATTGGGTCGGCCGATTGAGGCCTTCAAGTACTTTCATAGAATGCAATCGGCTGGCACAGTCCCTAGTTTAAAATCTCTCACCAGTCTTTTATCCATTTGTTCAGATTTGTCTGCATTGCGGCATGGCAAAGAAATCCATGCCCAAGCAACTAAAAGCATCATTCATATGGATGTGTTTCTTGCCACGGCACTTATTGACATGTACATGAAGTGTGGTCATCTTTCTTCGGCACGAAGACTCTTCGATCAATTTGACTTCAAACCAAAAGATACAATATTCTGGAACGCAATGATCTCAGGTTACGGAAGAAATGGAGACAGCAAATCTGCGTTTGATATCTTTGATCGAATGCTGGAAGAAAGAGTACAACCAAATGCAGCCACGTTTACGAGTCTCTTATCTATCTGCAGTCATTGCGGTCAGGTTCACAAAGGCTGGCAATTTTTCAAAATGATGAACACGAAATACGACCTGCAGCCAGCTCGTGACCATTATAACTGCATGATAGACATTTTGGGTAGAGCTGGCCAGTTGGGAAAAGCTCGAAAATTGTTGGCAGAATTGCCAGAGCCCTCTATGTCTTCTCTTGCTTCTTTGCTTGGTGCCTGTAGTCTGCACAAAGATTCTAAACTAGGCGAAGAAATGGCTTTAAGAATTTCAGAATTAGAGCCGAAGAACCCACTTCCATTTGTTATGTTATCCAATATATATGCTAAGCTTGGAAGGTGGGAAGATGTTGAAAGGGTCAGAGAAATGATGAATGATAAAGGATTGAAAAAACTATCTGGCATTAGTTCAATAGAATGGCCTAAGAGTAAGTACTTAAGCAGAGTAATTAGTATACTTTTGATGTCACACCAGAAGTTGCAACTCTTTTACATCTTTCATTTCTTTCATTCGTTAAGGAAAACTTATAAATATCACTGA

Coding sequence (CDS)

ATGAAGCGCGAAATAGCGCGATTGGTGGCGAATGGACTTTACATCGAAGCACTCTCCACGTACTCACAGCACCACTCGGCCTCCCTTCGTCCTCACAATTTCATTTTTCCTCCTCTCTTCAAAGCATGCGCAAAGCTTAACTCAGTTCCACAAGGCCAAATGCTGCACACCCACTTGATTAAAACGGGGTTTCAAGCAGACGTCTATGCAGCAACAGCCCTTACAGATATGTATTTGAAGCTTCTTCATTTTGACGATGCTCTAAAGATGTTCGACGAAATGCCCCACAGAAATTTGGCATCCTTAAACGCCATGATTTCTGTGTTTTTGTTGAATGGTTCACGTGGAGAAGCACTACGGATGGTTGAGCTTGTGAATCGTGGTTCATTAAGGCCTAGTTCAATTACCATTGTTAACTTTCTGTCAGCATGCGCAAGTGTTGCTCGTGGGATGCAAATACATTGTTGGGCGATAAAACTGGGTTTTCTAATGGACGTGTATGTCGCTACAGCGCTCTTGACGATGTATTCTACTTGTGATGAAGTGAGTTTCGCTGCTAAGGTATTTCAGGAGTTGTCGAACAAAAATGTGGTGAGTTTTAATGCATATATTTCAGGGCTTCTACTGAATGGCATGCCTCGTATGGTATTGGATGTGTTTAAGAGCATGATGGAATGGCTATGTGGGAAACCAAATTCAGTCACATTGGCTTCTGTCCTTTCTGCTTGTTCTAATATTTCATATCTTGAATTTGGTAGGCAGGTTCATGCACTCACTGTTAAAATTGATAGCGATGATGTAATGGTAGGAACTGCAATGGTGGACATGTATTCTAAATGTGGGGCTTGGCAGTGGGCACACAATGTTTTCAACGAGCGGAAAGGAACCAGTAACTTATTTACTTGGAATTCATTGATTGCTGGAATGATGTTAAATGATAAGAGTGAGATTGCAATGGAACTTTTTGAATCGTTGGAGTCCCACAAATTGCAACCAGATTCAGCTACCTGGAATTCAATGATTAGTGGTCTTGCCCAATTGGGTCGGCCGATTGAGGCCTTCAAGTACTTTCATAGAATGCAATCGGCTGGCACAGTCCCTAGTTTAAAATCTCTCACCAGTCTTTTATCCATTTGTTCAGATTTGTCTGCATTGCGGCATGGCAAAGAAATCCATGCCCAAGCAACTAAAAGCATCATTCATATGGATGTGTTTCTTGCCACGGCACTTATTGACATGTACATGAAGTGTGGTCATCTTTCTTCGGCACGAAGACTCTTCGATCAATTTGACTTCAAACCAAAAGATACAATATTCTGGAACGCAATGATCTCAGGTTACGGAAGAAATGGAGACAGCAAATCTGCGTTTGATATCTTTGATCGAATGCTGGAAGAAAGAGTACAACCAAATGCAGCCACGTTTACGAGTCTCTTATCTATCTGCAGTCATTGCGGTCAGGTTCACAAAGGCTGGCAATTTTTCAAAATGATGAACACGAAATACGACCTGCAGCCAGCTCGTGACCATTATAACTGCATGATAGACATTTTGGGTAGAGCTGGCCAGTTGGGAAAAGCTCGAAAATTGTTGGCAGAATTGCCAGAGCCCTCTATGTCTTCTCTTGCTTCTTTGCTTGGTGCCTGTAGTCTGCACAAAGATTCTAAACTAGGCGAAGAAATGGCTTTAAGAATTTCAGAATTAGAGCCGAAGAACCCACTTCCATTTGTTATGTTATCCAATATATATGCTAAGCTTGGAAGGTGGGAAGATGTTGAAAGGGTCAGAGAAATGATGAATGATAAAGGATTGAAAAAACTATCTGGCATTAGTTCAATAGAATGGCCTAAGAGTAAGTACTTAAGCAGAGTAATTAGTATACTTTTGATGTCACACCAGAAGTTGCAACTCTTTTACATCTTTCATTTCTTTCATTCGTTAAGGAAAACTTATAAATATCACTGA

Protein sequence

MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALRMVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVLSACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLFTWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRMQSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHLSSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLSICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMSSLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMNDKGLKKLSGISSIEWPKSKYLSRVISILLMSHQKLQLFYIFHFFHSLRKTYKYH
Homology
BLAST of Tan0002208 vs. ExPASy Swiss-Prot
Match: Q1PFA6 (Pentatricopeptide repeat-containing protein At2g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E22 PE=2 SV=2)

HSP 1 Score: 628.6 bits (1620), Expect = 7.7e-179
Identity = 304/608 (50.00%), Postives = 434/608 (71.38%), Query Frame = 0

Query: 5   IARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGF 64
           ++ LV  G  ++ + ++S        P+ F FPPL K+CAKL  V QG++LH  ++KTGF
Sbjct: 11  VSNLVTGGTSLDVILSHS--------PNKFTFPPLLKSCAKLGDVVQGRILHAQVVKTGF 70

Query: 65  QADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALRMVEL 124
             DV+ ATAL  MY+K+    DALK+ DEMP R +AS+NA +S  L NG   +A RM   
Sbjct: 71  FVDVFTATALVSMYMKVKQVTDALKVLDEMPERGIASVNAAVSGLLENGFCRDAFRMFGD 130

Query: 125 VNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEVSF 184
                   +S+T+ + L  C  +  GMQ+HC A+K GF M+VYV T+L++MYS C E   
Sbjct: 131 ARVSGSGMNSVTVASVLGGCGDIEGGMQLHCLAMKSGFEMEVYVGTSLVSMYSRCGEWVL 190

Query: 185 AAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVLSACS 244
           AA++F+++ +K+VV++NA+ISGL+ NG+  +V  VF  M ++   +PN VT  + ++AC+
Sbjct: 191 AARMFEKVPHKSVVTYNAFISGLMENGVMNLVPSVFNLMRKFSSEEPNDVTFVNAITACA 250

Query: 245 NISYLEFGRQVHALTVKIDSD-DVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLFTWN 304
           ++  L++GRQ+H L +K +   + MVGTA++DMYSKC  W+ A+ VF E K T NL +WN
Sbjct: 251 SLLNLQYGRQLHGLVMKKEFQFETMVGTALIDMYSKCRCWKSAYIVFTELKDTRNLISWN 310

Query: 305 SLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRMQSA 364
           S+I+GMM+N + E A+ELFE L+S  L+PDSATWNS+ISG +QLG+ IEAFK+F RM S 
Sbjct: 311 SVISGMMINGQHETAVELFEKLDSEGLKPDSATWNSLISGFSQLGKVIEAFKFFERMLSV 370

Query: 365 GTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHLSSA 424
             VPSLK LTSLLS CSD+  L++GKEIH    K+    D+F+ T+LIDMYMKCG  S A
Sbjct: 371 VMVPSLKCLTSLLSACSDIWTLKNGKEIHGHVIKAAAERDIFVLTSLIDMYMKCGLSSWA 430

Query: 425 RRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLSICS 484
           RR+FD+F+ KPKD +FWN MISGYG++G+ +SA +IF+ + EE+V+P+ ATFT++LS CS
Sbjct: 431 RRIFDRFEPKPKDPVFWNVMISGYGKHGECESAIEIFELLREEKVEPSLATFTAVLSACS 490

Query: 485 HCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMSSLA 544
           HCG V KG Q F++M  +Y  +P+ +H  CMID+LGR+G+L +A++++ ++ EPS S  +
Sbjct: 491 HCGNVEKGSQIFRLMQEEYGYKPSTEHIGCMIDLLGRSGRLREAKEVIDQMSEPSSSVYS 550

Query: 545 SLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMNDKG 604
           SLLG+C  H D  LGEE A++++ELEP+NP PFV+LS+IYA L RWEDVE +R++++ K 
Sbjct: 551 SLLGSCRQHLDPVLGEEAAMKLAELEPENPAPFVILSSIYAALERWEDVESIRQVIDQKQ 610

Query: 605 LKKLSGIS 612
           L KL G+S
Sbjct: 611 LVKLPGLS 610

BLAST of Tan0002208 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 361.7 bits (927), Expect = 1.8e-98
Identity = 209/604 (34.60%), Postives = 333/604 (55.13%), Query Frame = 0

Query: 16  EALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGFQADVYAATALT 75
           +AL  + +     + P  + F  L K C     +  G+ +H  L+K+GF  D++A T L 
Sbjct: 118 KALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLE 177

Query: 76  DMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALRMVELVNRGSLRPSSI 135
           +MY K    ++A K+FD MP R+L S N +++ +  NG    AL MV+ +   +L+PS I
Sbjct: 178 NMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFI 237

Query: 136 TIVNFL---SACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEVSFAAKVFQEL 195
           TIV+ L   SA   ++ G +IH +A++ GF   V ++TAL+ MY+ C  +  A ++F  +
Sbjct: 238 TIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGM 297

Query: 196 SNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVLSACSNISYLEFG 255
             +NVVS+N+ I   + N  P+  + +F+ M++    KP  V++   L AC+++  LE G
Sbjct: 298 LERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGV-KPTDVSVMGALHACADLGDLERG 357

Query: 256 RQVHALTVKIDSD-DVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLFTWNSLIAGMML 315
           R +H L+V++  D +V V  +++ MY KC                               
Sbjct: 358 RFIHKLSVELGLDRNVSVVNSLISMYCKC------------------------------- 417

Query: 316 NDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRMQSAGTVPSLKS 375
             + + A  +F  L+S  L     +WN+MI G AQ GRPI+A  YF +M+S    P   +
Sbjct: 418 -KEVDTAASMFGKLQSRTL----VSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFT 477

Query: 376 LTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHLSSARRLFDQFD 435
             S+++  ++LS   H K IH    +S +  +VF+ TAL+DMY KCG +  AR +FD   
Sbjct: 478 YVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMS 537

Query: 436 FKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLSICSHCGQVHKG 495
            +   T  WNAMI GYG +G  K+A ++F+ M +  ++PN  TF S++S CSH G V  G
Sbjct: 538 ERHVTT--WNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAG 597

Query: 496 WQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELP-EPSMSSLASLLGACS 555
            + F MM   Y ++ + DHY  M+D+LGRAG+L +A   + ++P +P+++   ++LGAC 
Sbjct: 598 LKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQ 657

Query: 556 LHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMNDKGLKKLSGI 615
           +HK+    E+ A R+ EL P +    V+L+NIY     WE V +VR  M  +GL+K  G 
Sbjct: 658 IHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGC 682

BLAST of Tan0002208 vs. ExPASy Swiss-Prot
Match: O04659 (Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E14 PE=2 SV=2)

HSP 1 Score: 358.6 bits (919), Expect = 1.5e-97
Identity = 206/613 (33.61%), Postives = 336/613 (54.81%), Query Frame = 0

Query: 11  NGLYIEALSTYSQHHSASL-RPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGFQADVY 70
           N ++ + L  + +  + S+  P +F FP + KA   L     G+M+HT ++K+G+  DV 
Sbjct: 84  NSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREFLGRMIHTLVVKSGYVCDVV 143

Query: 71  AATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALRMVELVNRGS 130
            A++L  MY K   F+++L++FDEMP R++AS N +IS F  +G   +AL +   +    
Sbjct: 144 VASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFYQSGEAEKALELFGRMESSG 203

Query: 131 LRPSSITIVNFLSACAS---VARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEVSFAA 190
             P+S+++   +SAC+    + RG +IH   +K GF +D YV +AL+ MY  CD +  A 
Sbjct: 204 FEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEYVNSALVDMYGKCDCLEVAR 263

Query: 191 KVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCG-KPNSVTLASVLSACSN 250
           +VFQ++  K++V++N+ I G +  G  +  +++   M+  + G +P+  TL S+L ACS 
Sbjct: 264 EVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMI--IEGTRPSQTTLTSILMACSR 323

Query: 251 ISYLEFGRQVHALTVK-IDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLFTWNS 310
              L  G+ +H   ++ + + D+ V  +++D+Y KCG    A  VF+             
Sbjct: 324 SRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAETVFS------------- 383

Query: 311 LIAGMMLNDKSEIAMELFESLESHKLQPDSA-TWNSMISGLAQLGRPIEAFKYFHRMQSA 370
                                   K Q D A +WN MIS    +G   +A + + +M S 
Sbjct: 384 ------------------------KTQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSV 443

Query: 371 GTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHLSSA 430
           G  P + + TS+L  CS L+AL  GK+IH   ++S +  D  L +AL+DMY KCG+   A
Sbjct: 444 GVKPDVVTFTSVLPACSQLAALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEA 503

Query: 431 RRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLSICS 490
            R+F+      KD + W  MIS YG +G  + A   FD M +  ++P+  T  ++LS C 
Sbjct: 504 FRIFN--SIPKKDVVSWTVMISAYGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACG 563

Query: 491 HCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMSS-- 550
           H G + +G +FF  M +KY ++P  +HY+CMIDILGRAG+L +A +++ + PE S ++  
Sbjct: 564 HAGLIDEGLKFFSQMRSKYGIEPIIEHYSCMIDILGRAGRLLEAYEIIQQTPETSDNAEL 623

Query: 551 LASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMND 610
           L++L  AC LH +  LG+ +A  + E  P +   +++L N+YA    W+   RVR  M +
Sbjct: 624 LSTLFSACCLHLEHSLGDRIARLLVENYPDDASTYMVLFNLYASGESWDAARRVRLKMKE 655

Query: 611 KGLKKLSGISSIE 615
            GL+K  G S IE
Sbjct: 684 MGLRKKPGCSWIE 655

BLAST of Tan0002208 vs. ExPASy Swiss-Prot
Match: Q0WN60 (Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H8 PE=2 SV=2)

HSP 1 Score: 351.3 bits (900), Expect = 2.4e-95
Identity = 227/686 (33.09%), Postives = 348/686 (50.73%), Query Frame = 0

Query: 11  NGLYIEALSTYSQHHSAS-LRPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGFQADVY 70
           N LY E L T+ +  S + L P +F +P + KACA ++ V  G  +H  ++KTG   DV+
Sbjct: 164 NELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGIGLAVHGLVVKTGLVEDVF 223

Query: 71  AATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEAL----RMVELV 130
              AL   Y       DAL++FD MP RNL S N+MI VF  NG   E+      M+E  
Sbjct: 224 VGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDNGFSEESFLLLGEMMEEN 283

Query: 131 NRGSLRPSSITIVNFLSACA---SVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEV 190
             G+  P   T+V  L  CA    +  G  +H WA+KL    ++ +  AL+ MYS C  +
Sbjct: 284 GDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKELVLNNALMDMYSKCGCI 343

Query: 191 SFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCG----KPNSVTLAS 250
           + A  +F+  +NKNVVS+N  + G    G      DV + M   L G    K + VT+ +
Sbjct: 344 TNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQM---LAGGEDVKADEVTILN 403

Query: 251 VLSACSNISYLEFGRQVHALTVKID-SDDVMVGTAMVDMYSKCGAWQWAHNVFNE-RKGT 310
            +  C + S+L   +++H  ++K +   + +V  A V  Y+KCG+  +A  VF+  R  T
Sbjct: 404 AVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGSLSYAQRVFHGIRSKT 463

Query: 311 SNLFTWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSAT-------------------- 370
            N  +WN+LI G   ++   ++++    ++   L PDS T                    
Sbjct: 464 VN--SWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSACSKLKSLRLGKEV 523

Query: 371 ----------------------------------------------WNSMISGLAQLGRP 430
                                                         WN++I+G  Q G P
Sbjct: 524 HGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSWNTVITGYLQNGFP 583

Query: 431 IEAFKYFHRMQSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATAL 490
             A   F +M   G      S+  +   CS L +LR G+E HA A K ++  D F+A +L
Sbjct: 584 DRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALKHLLEDDAFIACSL 643

Query: 491 IDMYMKCGHLSSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQP 550
           IDMY K G ++ + ++F+    K K T  WNAMI GYG +G +K A  +F+ M      P
Sbjct: 644 IDMYAKNGSITQSSKVFN--GLKEKSTASWNAMIMGYGIHGLAKEAIKLFEEMQRTGHNP 703

Query: 551 NAATFTSLLSICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKL 610
           +  TF  +L+ C+H G +H+G ++   M + + L+P   HY C+ID+LGRAGQL KA ++
Sbjct: 704 DDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGRAGQLDKALRV 763

Query: 611 LAE--LPEPSMSSLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGR 615
           +AE    E  +    SLL +C +H++ ++GE++A ++ ELEP+ P  +V+LSN+YA LG+
Sbjct: 764 VAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYVLLSNLYAGLGK 823

BLAST of Tan0002208 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 1.2e-94
Identity = 211/643 (32.81%), Postives = 339/643 (52.72%), Query Frame = 0

Query: 36  FPPLFKACAKLN-SVPQGQMLHTHLIKTGFQADVYAATALTDMYLKLLHFDDALKMFDEM 95
           F  L  +C K   S    + +H  +IK+GF  +++    L D Y K    +D  ++FD+M
Sbjct: 22  FAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKM 81

Query: 96  PHRNL-------------------------------ASLNAMISVFLLNGSRGEALRMVE 155
           P RN+                                + N+M+S F  +    EAL    
Sbjct: 82  PQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFA 141

Query: 156 LVNRGSLRPSSITIVNFLSACA---SVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 215
           ++++     +  +  + LSAC+    + +G+Q+H    K  FL DVY+ +AL+ MYS C 
Sbjct: 142 MMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCG 201

Query: 216 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 275
            V+ A +VF E+ ++NVVS+N+ I+    NG     LDVF+ M+E    +P+ VTLASV+
Sbjct: 202 NVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRV-EPDEVTLASVI 261

Query: 276 SACSNISYLEFGRQVHALTVKIDS--DDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSN 335
           SAC+++S ++ G++VH   VK D   +D+++  A VDMY+KC   + A  +F+      N
Sbjct: 262 SACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMP-IRN 321

Query: 336 LFTWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFH 395
           +    S+I+G  +   ++ A  +F  +    +     +WN++I+G  Q G   EA   F 
Sbjct: 322 VIAETSMISGYAMAASTKAARLMFTKMAERNV----VSWNALIAGYTQNGENEEALSLFC 381

Query: 396 RMQSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATK------SIIHMDVFLATALID 455
            ++     P+  S  ++L  C+DL+ L  G + H    K      S    D+F+  +LID
Sbjct: 382 LLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLID 441

Query: 456 MYMKCGHLSSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNA 515
           MY+KCG +     +F +     +D + WNAMI G+ +NG    A ++F  MLE   +P+ 
Sbjct: 442 MYVKCGCVEEGYLVFRK--MMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDH 501

Query: 516 ATFTSLLSICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLA 575
            T   +LS C H G V +G  +F  M   + + P RDHY CM+D+LGRAG L +A+ ++ 
Sbjct: 502 ITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIE 561

Query: 576 ELP-EPSMSSLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWED 635
           E+P +P      SLL AC +H++  LG+ +A ++ E+EP N  P+V+LSN+YA+LG+WED
Sbjct: 562 EMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWED 621

BLAST of Tan0002208 vs. NCBI nr
Match: KAG7032196.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 537/614 (87.46%), Postives = 570/614 (92.83%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           M REI RLVA G Y+EA S YSQHHSASL  +NFIFPPLFKACAKLNSVPQGQMLHTHL+
Sbjct: 1   MNREIVRLVAKGFYVEAFSMYSQHHSASLSSYNFIFPPLFKACAKLNSVPQGQMLHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           K GF ADVYAATALTDMYLKL H DDALK+FDEMPHRN ASLNAMIS FLLNG RGEALR
Sbjct: 61  KVGFSADVYAATALTDMYLKLFHVDDALKVFDEMPHRNTASLNAMISGFLLNGFRGEALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           MVELVNRGSL+P+SITIVN LSACASVA GMQ+HCWAI LGF MDVYVATALLTMY TC+
Sbjct: 121 MVELVNRGSLKPNSITIVNLLSACASVAHGMQVHCWAINLGFQMDVYVATALLTMYCTCE 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
           ++ FAAKVFQE+SNKNVVSFNAYISGLLLNGMPRMV+DVFKSMME   GK NSVTL SVL
Sbjct: 181 KMGFAAKVFQEMSNKNVVSFNAYISGLLLNGMPRMVIDVFKSMMECPPGKSNSVTLVSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLF 300
           SACS++S+L FGRQVHALTVKID+DDVMVGTA+VDMYSKCGAWQ AH VFNERKGTSNLF
Sbjct: 241 SACSSLSHLGFGRQVHALTVKIDNDDVMVGTALVDMYSKCGAWQCAHKVFNERKGTSNLF 300

Query: 301 TWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRM 360
           TWNSLIAGMMLN++SEIA+ELFESLESHKLQPDSATWNSMISG A+LG+P++AFKYFHRM
Sbjct: 301 TWNSLIAGMMLNEQSEIAVELFESLESHKLQPDSATWNSMISGHARLGQPVKAFKYFHRM 360

Query: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHL 420
           QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQ TKS IHMDV LATALIDMYMKCG L
Sbjct: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQVTKSFIHMDVLLATALIDMYMKCGCL 420

Query: 421 SSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLS 480
           SSA+RLFDQF FKPKDTIFWNAMISGYG NG++KSAFDIFDRMLEE V PNAATFTSLLS
Sbjct: 421 SSAQRLFDQFGFKPKDTIFWNAMISGYGNNGENKSAFDIFDRMLEEGVHPNAATFTSLLS 480

Query: 481 ICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMS 540
           ICSHCGQV KGWQFFKMMN KYDLQP RDHYNCMIDILGRAGQLGKARKLL ELPEPSMS
Sbjct: 481 ICSHCGQVDKGWQFFKMMNKKYDLQPDRDHYNCMIDILGRAGQLGKARKLLEELPEPSMS 540

Query: 541 SLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMN 600
           SLASLLGACSLH+DSKLGEEMA RISELEPKNPLPFV+LSNIYA+LGRW+DVERVREMMN
Sbjct: 541 SLASLLGACSLHRDSKLGEEMASRISELEPKNPLPFVILSNIYAELGRWKDVERVREMMN 600

Query: 601 DKGLKKLSGISSIE 615
           DKGL+K SG+SSIE
Sbjct: 601 DKGLRKPSGLSSIE 614

BLAST of Tan0002208 vs. NCBI nr
Match: XP_022998231.1 (pentatricopeptide repeat-containing protein At2g02750 [Cucurbita maxima])

HSP 1 Score: 1077.8 bits (2786), Expect = 0.0e+00
Identity = 537/614 (87.46%), Postives = 566/614 (92.18%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           M REI RLVA G Y+EA S YSQHHSASL  HNFIFPPLFKACAKLNSVPQGQMLHTHL+
Sbjct: 1   MNREIVRLVAKGFYVEAFSMYSQHHSASLSSHNFIFPPLFKACAKLNSVPQGQMLHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           K GF ADVYAATALTDMYLKL H DDALK+FDEMPHRN ASLNAMIS FLLNG RGEALR
Sbjct: 61  KVGFSADVYAATALTDMYLKLFHVDDALKVFDEMPHRNTASLNAMISGFLLNGFRGEALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           MVELVNRGSL+P+SITIVN LSACASVA GMQ+HCWAI LGF MDVYVATALLTMY TC+
Sbjct: 121 MVELVNRGSLKPNSITIVNLLSACASVAHGMQVHCWAINLGFQMDVYVATALLTMYCTCE 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
           ++ FAAKVFQE+SNKNVVSFNAYISGLLLNGMPRMV+DVFKSMME   GK NSVTL SVL
Sbjct: 181 KMGFAAKVFQEMSNKNVVSFNAYISGLLLNGMPRMVIDVFKSMMERPPGKSNSVTLVSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLF 300
           SACS++S+L FGRQVHALTVKID+DDVMVGTA+VDMYSKCGAWQ AHNV NERKGTSNLF
Sbjct: 241 SACSSLSHLGFGRQVHALTVKIDNDDVMVGTALVDMYSKCGAWQCAHNVLNERKGTSNLF 300

Query: 301 TWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRM 360
           TWNSLIAGMMLN++SEIA+ELFESLESHKLQPDSATWNSMISG A LG+P +AFKYFHRM
Sbjct: 301 TWNSLIAGMMLNEQSEIAVELFESLESHKLQPDSATWNSMISGHAHLGQPAKAFKYFHRM 360

Query: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHL 420
           QSAGTVPSLKSLTS LSICSDLSALRHGKEIHAQ TK  IHMDV LATALIDMYMKCG L
Sbjct: 361 QSAGTVPSLKSLTSFLSICSDLSALRHGKEIHAQVTKRFIHMDVLLATALIDMYMKCGCL 420

Query: 421 SSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLS 480
           SSA+RLFDQF FKPKDTIFWNAMISGYG NG++KSAFDIFDRMLEE V PNAATFTSLLS
Sbjct: 421 SSAQRLFDQFGFKPKDTIFWNAMISGYGNNGENKSAFDIFDRMLEEGVHPNAATFTSLLS 480

Query: 481 ICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMS 540
           ICSHCGQV KGWQFFKMMN KYDLQP RDHYNCMIDILGRAGQLGKARKLL ELPEPSMS
Sbjct: 481 ICSHCGQVDKGWQFFKMMNKKYDLQPDRDHYNCMIDILGRAGQLGKARKLLEELPEPSMS 540

Query: 541 SLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMN 600
           SLASLLGACSLHKDSKLGEEMA RISELEPKNPLPFV+LSNIYA+LGRW+DVERVREMMN
Sbjct: 541 SLASLLGACSLHKDSKLGEEMASRISELEPKNPLPFVILSNIYAELGRWKDVERVREMMN 600

Query: 601 DKGLKKLSGISSIE 615
           DKGL+K SG+SSIE
Sbjct: 601 DKGLRKPSGLSSIE 614

BLAST of Tan0002208 vs. NCBI nr
Match: KAG6601415.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1077.4 bits (2785), Expect = 0.0e+00
Identity = 537/614 (87.46%), Postives = 569/614 (92.67%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           M REI RLVA G Y+EA S YSQHHSASL  +NFIFPPLFKACAKLNSVPQGQMLHTHL+
Sbjct: 1   MNREIVRLVAKGFYVEAFSMYSQHHSASLSSYNFIFPPLFKACAKLNSVPQGQMLHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           K GF ADVYAATALTDMYLKL H DDALK+FDEMPHRN ASLNAMIS FLLNG RGEALR
Sbjct: 61  KVGFSADVYAATALTDMYLKLFHVDDALKVFDEMPHRNTASLNAMISGFLLNGFRGEALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           MVELVNRGSL+P+SITIVN LSACASVA GMQ+HCWAI LGF MDVYVATALLTMY TC+
Sbjct: 121 MVELVNRGSLKPNSITIVNLLSACASVAHGMQVHCWAINLGFQMDVYVATALLTMYCTCE 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
           ++ FAAKVFQE+SNKNVVSFNAYISGLLLNGMPRMV+DVFKSMME   GK NSVTL SVL
Sbjct: 181 KMGFAAKVFQEMSNKNVVSFNAYISGLLLNGMPRMVIDVFKSMMECPPGKSNSVTLFSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLF 300
           SACS++S+L FGRQVHALTVKID+DDVMVGTA+VDMYSKCGAWQ A  VFNERKGTSNLF
Sbjct: 241 SACSSLSHLGFGRQVHALTVKIDNDDVMVGTALVDMYSKCGAWQCARKVFNERKGTSNLF 300

Query: 301 TWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRM 360
           TWNSLIAGMMLN++SEIA+ELFESLESHKLQPDSATWNSMISG A+LG+P++AFKYFHRM
Sbjct: 301 TWNSLIAGMMLNEQSEIAVELFESLESHKLQPDSATWNSMISGHARLGQPVKAFKYFHRM 360

Query: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHL 420
           QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQ TKS IHMDV LATALIDMYMKCG L
Sbjct: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQVTKSFIHMDVLLATALIDMYMKCGCL 420

Query: 421 SSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLS 480
           SSA+RLFDQF FKPKDTIFWNAMISGYG NG++KSAFDIFDRMLEE V PNAATFTSLLS
Sbjct: 421 SSAQRLFDQFGFKPKDTIFWNAMISGYGNNGENKSAFDIFDRMLEEGVHPNAATFTSLLS 480

Query: 481 ICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMS 540
           ICSHCGQV KGWQFFKMMN KYDLQP RDHYNCMIDILGRAGQLGKARKLL ELPEPSMS
Sbjct: 481 ICSHCGQVDKGWQFFKMMNKKYDLQPDRDHYNCMIDILGRAGQLGKARKLLEELPEPSMS 540

Query: 541 SLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMN 600
           SLASLLGACSLHKDSKLGEEMA RISELEPKNPLPFV+LSNIYA+LGRW+DVERVREMMN
Sbjct: 541 SLASLLGACSLHKDSKLGEEMASRISELEPKNPLPFVILSNIYAELGRWKDVERVREMMN 600

Query: 601 DKGLKKLSGISSIE 615
           DKGL+K SG+SSIE
Sbjct: 601 DKGLRKPSGLSSIE 614

BLAST of Tan0002208 vs. NCBI nr
Match: XP_023537719.1 (pentatricopeptide repeat-containing protein At2g02750 [Cucurbita pepo subsp. pepo] >XP_023537727.1 pentatricopeptide repeat-containing protein At2g02750 [Cucurbita pepo subsp. pepo] >XP_023537736.1 pentatricopeptide repeat-containing protein At2g02750 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 537/614 (87.46%), Postives = 566/614 (92.18%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           M REI RLVA G Y+EA S YSQHHSASL  HNFIFPPLFKACAKLNSVPQGQMLHTHL+
Sbjct: 1   MNREIVRLVAKGFYVEAFSMYSQHHSASLSSHNFIFPPLFKACAKLNSVPQGQMLHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           K GF ADVYAATALTDMYLKL H DDALK+FDEMPHRN ASLNAMIS F LNG RGEALR
Sbjct: 61  KVGFSADVYAATALTDMYLKLFHVDDALKVFDEMPHRNTASLNAMISGFSLNGFRGEALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           MVELVNR SL+P+SITIVN LSACASVA G+QIHCWAI LGF MDVYVATALLTMY TC+
Sbjct: 121 MVELVNRDSLKPNSITIVNLLSACASVAHGIQIHCWAINLGFQMDVYVATALLTMYCTCE 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
           ++ FAAKVFQE+SNKNVVSFNAYISGLLLNGMPRMV+DVFKSMME   GK NSVTL SVL
Sbjct: 181 KMGFAAKVFQEMSNKNVVSFNAYISGLLLNGMPRMVIDVFKSMMECPPGKSNSVTLVSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLF 300
           SACS +S+L FGRQVHALTVKID+DDVMVGTA+VDMYSKCGAWQ AH VFNERKGTSNLF
Sbjct: 241 SACSTLSHLGFGRQVHALTVKIDNDDVMVGTALVDMYSKCGAWQCAHKVFNERKGTSNLF 300

Query: 301 TWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRM 360
           TWNSLIAGMMLN++SEIA+ELFESLESHKLQPDSATWNSMISG A LG+P++AFKYFHRM
Sbjct: 301 TWNSLIAGMMLNEQSEIAVELFESLESHKLQPDSATWNSMISGHAHLGQPVKAFKYFHRM 360

Query: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHL 420
           QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQ TKS IHMDV LATALIDMYMKCG L
Sbjct: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQVTKSFIHMDVLLATALIDMYMKCGCL 420

Query: 421 SSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLS 480
           SSA+RLFDQF FKPKDTIFWNAMISGYG NG++KSAFDIFDRMLEE V PNAATFTSLLS
Sbjct: 421 SSAQRLFDQFGFKPKDTIFWNAMISGYGNNGENKSAFDIFDRMLEEGVHPNAATFTSLLS 480

Query: 481 ICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMS 540
           ICSHCGQV KGWQFFKMMN KYDLQP RDHYNCMIDILGRAGQLGKARKLL ELPEPSMS
Sbjct: 481 ICSHCGQVDKGWQFFKMMNKKYDLQPDRDHYNCMIDILGRAGQLGKARKLLEELPEPSMS 540

Query: 541 SLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMN 600
           SLASLLGACSLHKDSKLGEEMA RISELEPKNPLPFV+LSNIYA+LGRW+DVERVREMMN
Sbjct: 541 SLASLLGACSLHKDSKLGEEMASRISELEPKNPLPFVILSNIYAELGRWKDVERVREMMN 600

Query: 601 DKGLKKLSGISSIE 615
           DKGL+K SG+SSIE
Sbjct: 601 DKGLRKPSGLSSIE 614

BLAST of Tan0002208 vs. NCBI nr
Match: XP_022956404.1 (pentatricopeptide repeat-containing protein At2g02750 [Cucurbita moschata])

HSP 1 Score: 1071.2 bits (2769), Expect = 3.5e-309
Identity = 532/614 (86.64%), Postives = 566/614 (92.18%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           M REI RLVA G Y+EA S YSQHHSASL  +NFIFPPLFKACAKLNSVPQGQMLHTHL+
Sbjct: 1   MNREIVRLVAKGFYVEAFSMYSQHHSASLSSYNFIFPPLFKACAKLNSVPQGQMLHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           K GF ADVYAATALTDMYLKL H DDALK+FDEMPHRN ASLNAMIS FLLNG RGEALR
Sbjct: 61  KVGFSADVYAATALTDMYLKLFHVDDALKVFDEMPHRNTASLNAMISGFLLNGFRGEALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           MVELVNRGSL+P+SITIVN LSACASVA GMQ+HCWAI LGF MDVYVATALLTMY TC+
Sbjct: 121 MVELVNRGSLKPNSITIVNLLSACASVAHGMQVHCWAINLGFQMDVYVATALLTMYCTCE 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
           ++ FAAKVFQE+SNKNVVSFNAYISGLLLNGMP MV+DVFKSMME   GK NSVTL SVL
Sbjct: 181 KMGFAAKVFQEMSNKNVVSFNAYISGLLLNGMPPMVIDVFKSMMECPPGKSNSVTLVSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLF 300
           SACS++S+L FGRQVHALTVKID+DDVMVGTA+VDMYSKCGAWQ AH VFNERKGTSNLF
Sbjct: 241 SACSSLSHLGFGRQVHALTVKIDNDDVMVGTALVDMYSKCGAWQCAHKVFNERKGTSNLF 300

Query: 301 TWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRM 360
           TWNSLIAGMMLN++SEIA+ELFESLESHKLQPDSATWNSMISG A+LG+P++AFKYFHRM
Sbjct: 301 TWNSLIAGMMLNEQSEIAVELFESLESHKLQPDSATWNSMISGHARLGQPVKAFKYFHRM 360

Query: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHL 420
           QSAGTVPSLKSLTSLL +CSDLSALRHGKEIHAQ TKS  HMDV LATALIDMYMKCG L
Sbjct: 361 QSAGTVPSLKSLTSLLFVCSDLSALRHGKEIHAQVTKSFTHMDVLLATALIDMYMKCGCL 420

Query: 421 SSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLS 480
           SSA+RLFDQF FKPKDTIFWNAMISGYG NG++KSAFDIFDRMLEE V PNAATFTSLLS
Sbjct: 421 SSAQRLFDQFGFKPKDTIFWNAMISGYGNNGENKSAFDIFDRMLEEGVHPNAATFTSLLS 480

Query: 481 ICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMS 540
           ICSHCGQV KGWQF KMMN KYDLQP RDHYNCMIDILGRAGQLGKARKLL ELPEPSMS
Sbjct: 481 ICSHCGQVDKGWQFLKMMNKKYDLQPDRDHYNCMIDILGRAGQLGKARKLLEELPEPSMS 540

Query: 541 SLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMN 600
           SLASLLGACSLHKDSKLGEEMA RISELEPKNPLPF++LSNIYA+LGRW+DVERVREMMN
Sbjct: 541 SLASLLGACSLHKDSKLGEEMASRISELEPKNPLPFIILSNIYAELGRWKDVERVREMMN 600

Query: 601 DKGLKKLSGISSIE 615
           DKGL+K SG+SSIE
Sbjct: 601 DKGLRKPSGLSSIE 614

BLAST of Tan0002208 vs. ExPASy TrEMBL
Match: A0A6J1K9P7 (pentatricopeptide repeat-containing protein At2g02750 OS=Cucurbita maxima OX=3661 GN=LOC111492916 PE=4 SV=1)

HSP 1 Score: 1077.8 bits (2786), Expect = 0.0e+00
Identity = 537/614 (87.46%), Postives = 566/614 (92.18%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           M REI RLVA G Y+EA S YSQHHSASL  HNFIFPPLFKACAKLNSVPQGQMLHTHL+
Sbjct: 1   MNREIVRLVAKGFYVEAFSMYSQHHSASLSSHNFIFPPLFKACAKLNSVPQGQMLHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           K GF ADVYAATALTDMYLKL H DDALK+FDEMPHRN ASLNAMIS FLLNG RGEALR
Sbjct: 61  KVGFSADVYAATALTDMYLKLFHVDDALKVFDEMPHRNTASLNAMISGFLLNGFRGEALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           MVELVNRGSL+P+SITIVN LSACASVA GMQ+HCWAI LGF MDVYVATALLTMY TC+
Sbjct: 121 MVELVNRGSLKPNSITIVNLLSACASVAHGMQVHCWAINLGFQMDVYVATALLTMYCTCE 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
           ++ FAAKVFQE+SNKNVVSFNAYISGLLLNGMPRMV+DVFKSMME   GK NSVTL SVL
Sbjct: 181 KMGFAAKVFQEMSNKNVVSFNAYISGLLLNGMPRMVIDVFKSMMERPPGKSNSVTLVSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLF 300
           SACS++S+L FGRQVHALTVKID+DDVMVGTA+VDMYSKCGAWQ AHNV NERKGTSNLF
Sbjct: 241 SACSSLSHLGFGRQVHALTVKIDNDDVMVGTALVDMYSKCGAWQCAHNVLNERKGTSNLF 300

Query: 301 TWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRM 360
           TWNSLIAGMMLN++SEIA+ELFESLESHKLQPDSATWNSMISG A LG+P +AFKYFHRM
Sbjct: 301 TWNSLIAGMMLNEQSEIAVELFESLESHKLQPDSATWNSMISGHAHLGQPAKAFKYFHRM 360

Query: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHL 420
           QSAGTVPSLKSLTS LSICSDLSALRHGKEIHAQ TK  IHMDV LATALIDMYMKCG L
Sbjct: 361 QSAGTVPSLKSLTSFLSICSDLSALRHGKEIHAQVTKRFIHMDVLLATALIDMYMKCGCL 420

Query: 421 SSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLS 480
           SSA+RLFDQF FKPKDTIFWNAMISGYG NG++KSAFDIFDRMLEE V PNAATFTSLLS
Sbjct: 421 SSAQRLFDQFGFKPKDTIFWNAMISGYGNNGENKSAFDIFDRMLEEGVHPNAATFTSLLS 480

Query: 481 ICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMS 540
           ICSHCGQV KGWQFFKMMN KYDLQP RDHYNCMIDILGRAGQLGKARKLL ELPEPSMS
Sbjct: 481 ICSHCGQVDKGWQFFKMMNKKYDLQPDRDHYNCMIDILGRAGQLGKARKLLEELPEPSMS 540

Query: 541 SLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMN 600
           SLASLLGACSLHKDSKLGEEMA RISELEPKNPLPFV+LSNIYA+LGRW+DVERVREMMN
Sbjct: 541 SLASLLGACSLHKDSKLGEEMASRISELEPKNPLPFVILSNIYAELGRWKDVERVREMMN 600

Query: 601 DKGLKKLSGISSIE 615
           DKGL+K SG+SSIE
Sbjct: 601 DKGLRKPSGLSSIE 614

BLAST of Tan0002208 vs. ExPASy TrEMBL
Match: A0A6J1GYX9 (pentatricopeptide repeat-containing protein At2g02750 OS=Cucurbita moschata OX=3662 GN=LOC111458155 PE=4 SV=1)

HSP 1 Score: 1071.2 bits (2769), Expect = 1.7e-309
Identity = 532/614 (86.64%), Postives = 566/614 (92.18%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           M REI RLVA G Y+EA S YSQHHSASL  +NFIFPPLFKACAKLNSVPQGQMLHTHL+
Sbjct: 1   MNREIVRLVAKGFYVEAFSMYSQHHSASLSSYNFIFPPLFKACAKLNSVPQGQMLHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           K GF ADVYAATALTDMYLKL H DDALK+FDEMPHRN ASLNAMIS FLLNG RGEALR
Sbjct: 61  KVGFSADVYAATALTDMYLKLFHVDDALKVFDEMPHRNTASLNAMISGFLLNGFRGEALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           MVELVNRGSL+P+SITIVN LSACASVA GMQ+HCWAI LGF MDVYVATALLTMY TC+
Sbjct: 121 MVELVNRGSLKPNSITIVNLLSACASVAHGMQVHCWAINLGFQMDVYVATALLTMYCTCE 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
           ++ FAAKVFQE+SNKNVVSFNAYISGLLLNGMP MV+DVFKSMME   GK NSVTL SVL
Sbjct: 181 KMGFAAKVFQEMSNKNVVSFNAYISGLLLNGMPPMVIDVFKSMMECPPGKSNSVTLVSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLF 300
           SACS++S+L FGRQVHALTVKID+DDVMVGTA+VDMYSKCGAWQ AH VFNERKGTSNLF
Sbjct: 241 SACSSLSHLGFGRQVHALTVKIDNDDVMVGTALVDMYSKCGAWQCAHKVFNERKGTSNLF 300

Query: 301 TWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRM 360
           TWNSLIAGMMLN++SEIA+ELFESLESHKLQPDSATWNSMISG A+LG+P++AFKYFHRM
Sbjct: 301 TWNSLIAGMMLNEQSEIAVELFESLESHKLQPDSATWNSMISGHARLGQPVKAFKYFHRM 360

Query: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHL 420
           QSAGTVPSLKSLTSLL +CSDLSALRHGKEIHAQ TKS  HMDV LATALIDMYMKCG L
Sbjct: 361 QSAGTVPSLKSLTSLLFVCSDLSALRHGKEIHAQVTKSFTHMDVLLATALIDMYMKCGCL 420

Query: 421 SSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLS 480
           SSA+RLFDQF FKPKDTIFWNAMISGYG NG++KSAFDIFDRMLEE V PNAATFTSLLS
Sbjct: 421 SSAQRLFDQFGFKPKDTIFWNAMISGYGNNGENKSAFDIFDRMLEEGVHPNAATFTSLLS 480

Query: 481 ICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMS 540
           ICSHCGQV KGWQF KMMN KYDLQP RDHYNCMIDILGRAGQLGKARKLL ELPEPSMS
Sbjct: 481 ICSHCGQVDKGWQFLKMMNKKYDLQPDRDHYNCMIDILGRAGQLGKARKLLEELPEPSMS 540

Query: 541 SLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMN 600
           SLASLLGACSLHKDSKLGEEMA RISELEPKNPLPF++LSNIYA+LGRW+DVERVREMMN
Sbjct: 541 SLASLLGACSLHKDSKLGEEMASRISELEPKNPLPFIILSNIYAELGRWKDVERVREMMN 600

Query: 601 DKGLKKLSGISSIE 615
           DKGL+K SG+SSIE
Sbjct: 601 DKGLRKPSGLSSIE 614

BLAST of Tan0002208 vs. ExPASy TrEMBL
Match: A0A6J1DCC8 (pentatricopeptide repeat-containing protein At2g02750 OS=Momordica charantia OX=3673 GN=LOC111019194 PE=4 SV=1)

HSP 1 Score: 1049.3 bits (2712), Expect = 6.8e-303
Identity = 520/614 (84.69%), Postives = 559/614 (91.04%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           MKREI RLVA+GLYIEALS YSQHHSASLRPH F+FPPLFKACAKLN VPQG+MLHTHL+
Sbjct: 1   MKREIVRLVASGLYIEALSIYSQHHSASLRPHKFLFPPLFKACAKLNFVPQGKMLHTHLV 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           KTGF ADVYAATALTDMY+KLLHFDDALK+FDEMPHRNLASLNAMIS FL NGSRGEALR
Sbjct: 61  KTGFIADVYAATALTDMYMKLLHFDDALKVFDEMPHRNLASLNAMISGFLFNGSRGEALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           MVE+VN GSLRP+SITIVN LS C  VA GMQIHCWA KLGF +DVYVATALLTMY TC+
Sbjct: 121 MVEIVNSGSLRPNSITIVNLLSGCERVAHGMQIHCWATKLGFQVDVYVATALLTMYFTCE 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
           EV FAAKVFQE+ NKNVVS+NAYISGLLLN MPR VLDVFKSMME  CGKPNSVTLASVL
Sbjct: 181 EVGFAAKVFQEMWNKNVVSYNAYISGLLLNDMPRKVLDVFKSMMECPCGKPNSVTLASVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLF 300
           SACS++S L FGRQVHALTV+I +DDVMVGTA+VDMYSKCGAWQWA+NVF ERK T NLF
Sbjct: 241 SACSDLSDLGFGRQVHALTVRIQNDDVMVGTALVDMYSKCGAWQWAYNVFKERKRTGNLF 300

Query: 301 TWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRM 360
           TWNS+IAGMMLN +SEIAMELFE LE  KLQPDSATWNSMISG A+LGRP+EAFKYFHRM
Sbjct: 301 TWNSVIAGMMLNGQSEIAMELFELLEFQKLQPDSATWNSMISGFARLGRPVEAFKYFHRM 360

Query: 361 QSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHL 420
           QSAG VPS+KSLTSLLSIC+DLSALRHGKEIHAQ TK +IH+D+FLATALID YMKCG+ 
Sbjct: 361 QSAGAVPSIKSLTSLLSICADLSALRHGKEIHAQTTKRVIHLDMFLATALIDTYMKCGNF 420

Query: 421 SSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLS 480
           S ARRLFDQFD KPKDTIFWN MISGYGRNG++  AFDIFDRMLEE V+PNAATFTSLLS
Sbjct: 421 SWARRLFDQFDPKPKDTIFWNTMISGYGRNGENTCAFDIFDRMLEENVRPNAATFTSLLS 480

Query: 481 ICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMS 540
           ICSH GQV KGWQ F+MMN +Y LQP RDH +CMID+LGRAGQLGKARKLL ELPEPS+S
Sbjct: 481 ICSHSGQVEKGWQLFRMMNKQYGLQPYRDHCSCMIDLLGRAGQLGKARKLLEELPEPSIS 540

Query: 541 SLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMN 600
           SLASLLGACSLH DSKLGEEMALRISELEPKNPLPF +LSNIYA+LGRW DVERVREMMN
Sbjct: 541 SLASLLGACSLHNDSKLGEEMALRISELEPKNPLPFSILSNIYAELGRWRDVERVREMMN 600

Query: 601 DKGLKKLSGISSIE 615
           DKGL+KLSGISSIE
Sbjct: 601 DKGLRKLSGISSIE 614

BLAST of Tan0002208 vs. ExPASy TrEMBL
Match: A0A314YQJ5 (Pentatricopeptide repeat-containing protein OS=Prunus yedoensis var. nudiflora OX=2094558 GN=Pyn_14577 PE=4 SV=1)

HSP 1 Score: 790.0 bits (2039), Expect = 7.4e-225
Identity = 392/612 (64.05%), Postives = 478/612 (78.10%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           MKREIARLVA+GLY +AL  Y+Q HSASLRPH F FPPL KAC KL S PQ Q+LHTHL+
Sbjct: 1   MKREIARLVADGLYRDALCLYAQLHSASLRPHKFTFPPLLKACGKLQSAPQAQILHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           KTGF ADVY+ATALTD+Y+KL   DDA+K+F+EMP RNLASLNA+I+ FL NG   EALR
Sbjct: 61  KTGFSADVYSATALTDVYMKLHLIDDAVKVFEEMPERNLASLNAVITGFLQNGYCREALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           +   V  G  RP+S+TI + LSAC +V  GM++HC A+KLG   DVYVAT++LTMYS C 
Sbjct: 121 LFANVGPGGFRPNSVTIASMLSACGNVEHGMEMHCLAVKLGVESDVYVATSVLTMYSNCG 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
            +  AAKVF+E+  KN+VS+NA+ISGLL NG+P +VLD+FK M       PNSVTL SVL
Sbjct: 181 GLFSAAKVFEEMPIKNIVSYNAFISGLLQNGVPHVVLDIFKKMRACTGEDPNSVTLLSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSD-DVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNL 300
           SAC+++ YL FG+QVH L +KI+ + D M+GTA+VDMYSKCG WQ A+  F E     NL
Sbjct: 241 SACASLLYLGFGKQVHGLMMKIEVELDTMLGTALVDMYSKCGCWQLAYGTFKELNENRNL 300

Query: 301 FTWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHR 360
           FTWN++I+GMMLN ++E A+ELFE LES   +PDS TWNSMISG +QLG+ IEAF YF R
Sbjct: 301 FTWNAMISGMMLNAQNENAVELFEQLESEGFKPDSVTWNSMISGFSQLGKAIEAFVYFRR 360

Query: 361 MQSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGH 420
           MQSAG VPSLKS+TSLL  C+DLSAL+ GKE+H  A ++ I  D+F++TALIDMYMKCG 
Sbjct: 361 MQSAGVVPSLKSITSLLPACADLSALQCGKEVHGLAIRTSISNDLFISTALIDMYMKCGQ 420

Query: 421 LSSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLL 480
            S ARR+FD F  KP D  FWNA+ISGYGRNGD++SAF IFD+MLE +VQPNAATFTSLL
Sbjct: 421 SSWARRIFDWFQIKPNDPAFWNAIISGYGRNGDNESAFGIFDQMLEAKVQPNAATFTSLL 480

Query: 481 SICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSM 540
           S+CSH G V KGWQ F+MMN  + L+P   H+ CMID+LGR G+L +AR L+ EL EPS 
Sbjct: 481 SMCSHTGLVDKGWQVFRMMNRDFGLKPNPAHFGCMIDLLGRTGRLDEARGLIQELSEPSG 540

Query: 541 SSLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMM 600
           +  ASLLGAC  H DS+LG+EMA+R+SELEP+NP PFV+LS IYA LGRWED E++R +M
Sbjct: 541 AVFASLLGACESHLDSQLGKEMAIRLSELEPENPTPFVILSKIYAALGRWEDAEKIRGLM 600

Query: 601 NDKGLKKLSGIS 612
           NDK L+KL G S
Sbjct: 601 NDKTLRKLPGFS 612

BLAST of Tan0002208 vs. ExPASy TrEMBL
Match: A0A5E4EY70 (PREDICTED: pentatricopeptide OS=Prunus dulcis OX=3755 GN=ALMOND_2B028369 PE=4 SV=1)

HSP 1 Score: 789.6 bits (2038), Expect = 9.7e-225
Identity = 390/612 (63.73%), Postives = 479/612 (78.27%), Query Frame = 0

Query: 1   MKREIARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLI 60
           M+REIARLVA+GLY +AL  Y+Q HSASLRPH F FPPL KAC KL S P  Q+LHTHL+
Sbjct: 1   MRREIARLVADGLYRDALCLYAQLHSASLRPHKFTFPPLLKACGKLQSAPHAQILHTHLM 60

Query: 61  KTGFQADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALR 120
           KTGF ADVY+ATALTD+Y+KL    DA+K+F+EMP RNLASLNA+I+ FL NG   EALR
Sbjct: 61  KTGFSADVYSATALTDVYMKLHLIGDAVKLFEEMPERNLASLNAVITGFLQNGYCREALR 120

Query: 121 MVELVNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 180
           + + V  G  RP+S+TI + LSAC +V  GM+IHC A+KLG   DVYVAT++LTMYS C 
Sbjct: 121 LFKNVGPGGFRPNSVTIASMLSACGNVEHGMEIHCLAVKLGVESDVYVATSVLTMYSNCG 180

Query: 181 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 240
            +  AAKVF+E+  KN+VS+NA+ISGLL NG+P +VLD+FK M       PNSVTL SVL
Sbjct: 181 GLFLAAKVFEEMPIKNIVSYNAFISGLLQNGVPHVVLDIFKKMRACTGENPNSVTLLSVL 240

Query: 241 SACSNISYLEFGRQVHALTVKIDSD-DVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNL 300
           SAC+++ YL FG+QVH L +KI+ + D M+GTA+VDMYSKCG WQ A+  F E     NL
Sbjct: 241 SACASLLYLRFGKQVHGLMMKIEVELDTMLGTALVDMYSKCGCWQLAYGTFKELNENRNL 300

Query: 301 FTWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHR 360
           FTWN++I+GMMLN ++E A+ELFE LES   +PDS TWNSMISG +QLG+ IEAF YF R
Sbjct: 301 FTWNAMISGMMLNAQNENAVELFEQLESEGFKPDSVTWNSMISGFSQLGKAIEAFVYFRR 360

Query: 361 MQSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGH 420
           MQSAG VPSLKS+TSLL  C+DLSAL+ GKE H  A ++ I  D+F++TALIDMYMKCG 
Sbjct: 361 MQSAGVVPSLKSITSLLPACADLSALQCGKEAHGLAVRTSISNDLFISTALIDMYMKCGQ 420

Query: 421 LSSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLL 480
            S ARR+FD F  KP D  FWNA+ISGYGRNGD++SAF IFD+MLE +VQPNAATFTSLL
Sbjct: 421 SSWARRIFDWFQIKPNDPAFWNAIISGYGRNGDNESAFGIFDQMLEAKVQPNAATFTSLL 480

Query: 481 SICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSM 540
           S+CSH G V KGWQ F+MM+  + L+P   H+ CMID+LGR G+L +AR+L+ EL EPS 
Sbjct: 481 SMCSHTGLVDKGWQVFRMMDRDFGLKPNPAHFGCMIDLLGRTGRLDEARELIQELSEPSG 540

Query: 541 SSLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMM 600
           + LASLLGAC  H DS+LG+EMA+++SELEP+NP PFV+LS IYA LGRWED E++RE+M
Sbjct: 541 AVLASLLGACESHLDSQLGKEMAIKLSELEPENPTPFVILSKIYAALGRWEDAEKIRELM 600

Query: 601 NDKGLKKLSGIS 612
           NDK L+KL G S
Sbjct: 601 NDKTLRKLPGFS 612

BLAST of Tan0002208 vs. TAIR 10
Match: AT2G02750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 628.6 bits (1620), Expect = 5.5e-180
Identity = 304/608 (50.00%), Postives = 434/608 (71.38%), Query Frame = 0

Query: 5   IARLVANGLYIEALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGF 64
           ++ LV  G  ++ + ++S        P+ F FPPL K+CAKL  V QG++LH  ++KTGF
Sbjct: 11  VSNLVTGGTSLDVILSHS--------PNKFTFPPLLKSCAKLGDVVQGRILHAQVVKTGF 70

Query: 65  QADVYAATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALRMVEL 124
             DV+ ATAL  MY+K+    DALK+ DEMP R +AS+NA +S  L NG   +A RM   
Sbjct: 71  FVDVFTATALVSMYMKVKQVTDALKVLDEMPERGIASVNAAVSGLLENGFCRDAFRMFGD 130

Query: 125 VNRGSLRPSSITIVNFLSACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEVSF 184
                   +S+T+ + L  C  +  GMQ+HC A+K GF M+VYV T+L++MYS C E   
Sbjct: 131 ARVSGSGMNSVTVASVLGGCGDIEGGMQLHCLAMKSGFEMEVYVGTSLVSMYSRCGEWVL 190

Query: 185 AAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVLSACS 244
           AA++F+++ +K+VV++NA+ISGL+ NG+  +V  VF  M ++   +PN VT  + ++AC+
Sbjct: 191 AARMFEKVPHKSVVTYNAFISGLMENGVMNLVPSVFNLMRKFSSEEPNDVTFVNAITACA 250

Query: 245 NISYLEFGRQVHALTVKIDSD-DVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLFTWN 304
           ++  L++GRQ+H L +K +   + MVGTA++DMYSKC  W+ A+ VF E K T NL +WN
Sbjct: 251 SLLNLQYGRQLHGLVMKKEFQFETMVGTALIDMYSKCRCWKSAYIVFTELKDTRNLISWN 310

Query: 305 SLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRMQSA 364
           S+I+GMM+N + E A+ELFE L+S  L+PDSATWNS+ISG +QLG+ IEAFK+F RM S 
Sbjct: 311 SVISGMMINGQHETAVELFEKLDSEGLKPDSATWNSLISGFSQLGKVIEAFKFFERMLSV 370

Query: 365 GTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHLSSA 424
             VPSLK LTSLLS CSD+  L++GKEIH    K+    D+F+ T+LIDMYMKCG  S A
Sbjct: 371 VMVPSLKCLTSLLSACSDIWTLKNGKEIHGHVIKAAAERDIFVLTSLIDMYMKCGLSSWA 430

Query: 425 RRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLSICS 484
           RR+FD+F+ KPKD +FWN MISGYG++G+ +SA +IF+ + EE+V+P+ ATFT++LS CS
Sbjct: 431 RRIFDRFEPKPKDPVFWNVMISGYGKHGECESAIEIFELLREEKVEPSLATFTAVLSACS 490

Query: 485 HCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMSSLA 544
           HCG V KG Q F++M  +Y  +P+ +H  CMID+LGR+G+L +A++++ ++ EPS S  +
Sbjct: 491 HCGNVEKGSQIFRLMQEEYGYKPSTEHIGCMIDLLGRSGRLREAKEVIDQMSEPSSSVYS 550

Query: 545 SLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMNDKG 604
           SLLG+C  H D  LGEE A++++ELEP+NP PFV+LS+IYA L RWEDVE +R++++ K 
Sbjct: 551 SLLGSCRQHLDPVLGEEAAMKLAELEPENPAPFVILSSIYAALERWEDVESIRQVIDQKQ 610

Query: 605 LKKLSGIS 612
           L KL G+S
Sbjct: 611 LVKLPGLS 610

BLAST of Tan0002208 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 361.7 bits (927), Expect = 1.3e-99
Identity = 209/604 (34.60%), Postives = 333/604 (55.13%), Query Frame = 0

Query: 16  EALSTYSQHHSASLRPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGFQADVYAATALT 75
           +AL  + +     + P  + F  L K C     +  G+ +H  L+K+GF  D++A T L 
Sbjct: 118 KALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLE 177

Query: 76  DMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALRMVELVNRGSLRPSSI 135
           +MY K    ++A K+FD MP R+L S N +++ +  NG    AL MV+ +   +L+PS I
Sbjct: 178 NMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFI 237

Query: 136 TIVNFL---SACASVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEVSFAAKVFQEL 195
           TIV+ L   SA   ++ G +IH +A++ GF   V ++TAL+ MY+ C  +  A ++F  +
Sbjct: 238 TIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGM 297

Query: 196 SNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVLSACSNISYLEFG 255
             +NVVS+N+ I   + N  P+  + +F+ M++    KP  V++   L AC+++  LE G
Sbjct: 298 LERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGV-KPTDVSVMGALHACADLGDLERG 357

Query: 256 RQVHALTVKIDSD-DVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLFTWNSLIAGMML 315
           R +H L+V++  D +V V  +++ MY KC                               
Sbjct: 358 RFIHKLSVELGLDRNVSVVNSLISMYCKC------------------------------- 417

Query: 316 NDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFHRMQSAGTVPSLKS 375
             + + A  +F  L+S  L     +WN+MI G AQ GRPI+A  YF +M+S    P   +
Sbjct: 418 -KEVDTAASMFGKLQSRTL----VSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFT 477

Query: 376 LTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHLSSARRLFDQFD 435
             S+++  ++LS   H K IH    +S +  +VF+ TAL+DMY KCG +  AR +FD   
Sbjct: 478 YVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMS 537

Query: 436 FKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLSICSHCGQVHKG 495
            +   T  WNAMI GYG +G  K+A ++F+ M +  ++PN  TF S++S CSH G V  G
Sbjct: 538 ERHVTT--WNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAG 597

Query: 496 WQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELP-EPSMSSLASLLGACS 555
            + F MM   Y ++ + DHY  M+D+LGRAG+L +A   + ++P +P+++   ++LGAC 
Sbjct: 598 LKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQ 657

Query: 556 LHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMNDKGLKKLSGI 615
           +HK+    E+ A R+ EL P +    V+L+NIY     WE V +VR  M  +GL+K  G 
Sbjct: 658 IHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGC 682

BLAST of Tan0002208 vs. TAIR 10
Match: AT5G27110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 358.6 bits (919), Expect = 1.1e-98
Identity = 206/613 (33.61%), Postives = 336/613 (54.81%), Query Frame = 0

Query: 11  NGLYIEALSTYSQHHSASL-RPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGFQADVY 70
           N ++ + L  + +  + S+  P +F FP + KA   L     G+M+HT ++K+G+  DV 
Sbjct: 84  NSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREFLGRMIHTLVVKSGYVCDVV 143

Query: 71  AATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEALRMVELVNRGS 130
            A++L  MY K   F+++L++FDEMP R++AS N +IS F  +G   +AL +   +    
Sbjct: 144 VASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFYQSGEAEKALELFGRMESSG 203

Query: 131 LRPSSITIVNFLSACAS---VARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEVSFAA 190
             P+S+++   +SAC+    + RG +IH   +K GF +D YV +AL+ MY  CD +  A 
Sbjct: 204 FEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEYVNSALVDMYGKCDCLEVAR 263

Query: 191 KVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCG-KPNSVTLASVLSACSN 250
           +VFQ++  K++V++N+ I G +  G  +  +++   M+  + G +P+  TL S+L ACS 
Sbjct: 264 EVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMI--IEGTRPSQTTLTSILMACSR 323

Query: 251 ISYLEFGRQVHALTVK-IDSDDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSNLFTWNS 310
              L  G+ +H   ++ + + D+ V  +++D+Y KCG    A  VF+             
Sbjct: 324 SRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAETVFS------------- 383

Query: 311 LIAGMMLNDKSEIAMELFESLESHKLQPDSA-TWNSMISGLAQLGRPIEAFKYFHRMQSA 370
                                   K Q D A +WN MIS    +G   +A + + +M S 
Sbjct: 384 ------------------------KTQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSV 443

Query: 371 GTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATALIDMYMKCGHLSSA 430
           G  P + + TS+L  CS L+AL  GK+IH   ++S +  D  L +AL+DMY KCG+   A
Sbjct: 444 GVKPDVVTFTSVLPACSQLAALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEA 503

Query: 431 RRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNAATFTSLLSICS 490
            R+F+      KD + W  MIS YG +G  + A   FD M +  ++P+  T  ++LS C 
Sbjct: 504 FRIFN--SIPKKDVVSWTVMISAYGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACG 563

Query: 491 HCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLAELPEPSMSS-- 550
           H G + +G +FF  M +KY ++P  +HY+CMIDILGRAG+L +A +++ + PE S ++  
Sbjct: 564 HAGLIDEGLKFFSQMRSKYGIEPIIEHYSCMIDILGRAGRLLEAYEIIQQTPETSDNAEL 623

Query: 551 LASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWEDVERVREMMND 610
           L++L  AC LH +  LG+ +A  + E  P +   +++L N+YA    W+   RVR  M +
Sbjct: 624 LSTLFSACCLHLEHSLGDRIARLLVENYPDDASTYMVLFNLYASGESWDAARRVRLKMKE 655

Query: 611 KGLKKLSGISSIE 615
            GL+K  G S IE
Sbjct: 684 MGLRKKPGCSWIE 655

BLAST of Tan0002208 vs. TAIR 10
Match: AT1G18485.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 351.3 bits (900), Expect = 1.7e-96
Identity = 227/686 (33.09%), Postives = 348/686 (50.73%), Query Frame = 0

Query: 11  NGLYIEALSTYSQHHSAS-LRPHNFIFPPLFKACAKLNSVPQGQMLHTHLIKTGFQADVY 70
           N LY E L T+ +  S + L P +F +P + KACA ++ V  G  +H  ++KTG   DV+
Sbjct: 164 NELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGIGLAVHGLVVKTGLVEDVF 223

Query: 71  AATALTDMYLKLLHFDDALKMFDEMPHRNLASLNAMISVFLLNGSRGEAL----RMVELV 130
              AL   Y       DAL++FD MP RNL S N+MI VF  NG   E+      M+E  
Sbjct: 224 VGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDNGFSEESFLLLGEMMEEN 283

Query: 131 NRGSLRPSSITIVNFLSACA---SVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCDEV 190
             G+  P   T+V  L  CA    +  G  +H WA+KL    ++ +  AL+ MYS C  +
Sbjct: 284 GDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKELVLNNALMDMYSKCGCI 343

Query: 191 SFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCG----KPNSVTLAS 250
           + A  +F+  +NKNVVS+N  + G    G      DV + M   L G    K + VT+ +
Sbjct: 344 TNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQM---LAGGEDVKADEVTILN 403

Query: 251 VLSACSNISYLEFGRQVHALTVKID-SDDVMVGTAMVDMYSKCGAWQWAHNVFNE-RKGT 310
            +  C + S+L   +++H  ++K +   + +V  A V  Y+KCG+  +A  VF+  R  T
Sbjct: 404 AVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGSLSYAQRVFHGIRSKT 463

Query: 311 SNLFTWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSAT-------------------- 370
            N  +WN+LI G   ++   ++++    ++   L PDS T                    
Sbjct: 464 VN--SWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSACSKLKSLRLGKEV 523

Query: 371 ----------------------------------------------WNSMISGLAQLGRP 430
                                                         WN++I+G  Q G P
Sbjct: 524 HGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSWNTVITGYLQNGFP 583

Query: 431 IEAFKYFHRMQSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATKSIIHMDVFLATAL 490
             A   F +M   G      S+  +   CS L +LR G+E HA A K ++  D F+A +L
Sbjct: 584 DRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALKHLLEDDAFIACSL 643

Query: 491 IDMYMKCGHLSSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQP 550
           IDMY K G ++ + ++F+    K K T  WNAMI GYG +G +K A  +F+ M      P
Sbjct: 644 IDMYAKNGSITQSSKVFN--GLKEKSTASWNAMIMGYGIHGLAKEAIKLFEEMQRTGHNP 703

Query: 551 NAATFTSLLSICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKL 610
           +  TF  +L+ C+H G +H+G ++   M + + L+P   HY C+ID+LGRAGQL KA ++
Sbjct: 704 DDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGRAGQLDKALRV 763

Query: 611 LAE--LPEPSMSSLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGR 615
           +AE    E  +    SLL +C +H++ ++GE++A ++ ELEP+ P  +V+LSN+YA LG+
Sbjct: 764 VAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYVLLSNLYAGLGK 823

BLAST of Tan0002208 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 349.0 bits (894), Expect = 8.4e-96
Identity = 211/643 (32.81%), Postives = 339/643 (52.72%), Query Frame = 0

Query: 36  FPPLFKACAKLN-SVPQGQMLHTHLIKTGFQADVYAATALTDMYLKLLHFDDALKMFDEM 95
           F  L  +C K   S    + +H  +IK+GF  +++    L D Y K    +D  ++FD+M
Sbjct: 22  FAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKM 81

Query: 96  PHRNL-------------------------------ASLNAMISVFLLNGSRGEALRMVE 155
           P RN+                                + N+M+S F  +    EAL    
Sbjct: 82  PQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFA 141

Query: 156 LVNRGSLRPSSITIVNFLSACA---SVARGMQIHCWAIKLGFLMDVYVATALLTMYSTCD 215
           ++++     +  +  + LSAC+    + +G+Q+H    K  FL DVY+ +AL+ MYS C 
Sbjct: 142 MMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCG 201

Query: 216 EVSFAAKVFQELSNKNVVSFNAYISGLLLNGMPRMVLDVFKSMMEWLCGKPNSVTLASVL 275
            V+ A +VF E+ ++NVVS+N+ I+    NG     LDVF+ M+E    +P+ VTLASV+
Sbjct: 202 NVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRV-EPDEVTLASVI 261

Query: 276 SACSNISYLEFGRQVHALTVKIDS--DDVMVGTAMVDMYSKCGAWQWAHNVFNERKGTSN 335
           SAC+++S ++ G++VH   VK D   +D+++  A VDMY+KC   + A  +F+      N
Sbjct: 262 SACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMP-IRN 321

Query: 336 LFTWNSLIAGMMLNDKSEIAMELFESLESHKLQPDSATWNSMISGLAQLGRPIEAFKYFH 395
           +    S+I+G  +   ++ A  +F  +    +     +WN++I+G  Q G   EA   F 
Sbjct: 322 VIAETSMISGYAMAASTKAARLMFTKMAERNV----VSWNALIAGYTQNGENEEALSLFC 381

Query: 396 RMQSAGTVPSLKSLTSLLSICSDLSALRHGKEIHAQATK------SIIHMDVFLATALID 455
            ++     P+  S  ++L  C+DL+ L  G + H    K      S    D+F+  +LID
Sbjct: 382 LLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLID 441

Query: 456 MYMKCGHLSSARRLFDQFDFKPKDTIFWNAMISGYGRNGDSKSAFDIFDRMLEERVQPNA 515
           MY+KCG +     +F +     +D + WNAMI G+ +NG    A ++F  MLE   +P+ 
Sbjct: 442 MYVKCGCVEEGYLVFRK--MMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDH 501

Query: 516 ATFTSLLSICSHCGQVHKGWQFFKMMNTKYDLQPARDHYNCMIDILGRAGQLGKARKLLA 575
            T   +LS C H G V +G  +F  M   + + P RDHY CM+D+LGRAG L +A+ ++ 
Sbjct: 502 ITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIE 561

Query: 576 ELP-EPSMSSLASLLGACSLHKDSKLGEEMALRISELEPKNPLPFVMLSNIYAKLGRWED 635
           E+P +P      SLL AC +H++  LG+ +A ++ E+EP N  P+V+LSN+YA+LG+WED
Sbjct: 562 EMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWED 621

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q1PFA67.7e-17950.00Pentatricopeptide repeat-containing protein At2g02750 OS=Arabidopsis thaliana OX... [more]
Q3E6Q11.8e-9834.60Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
O046591.5e-9733.61Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX... [more]
Q0WN602.4e-9533.09Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX... [more]
Q9SIT71.2e-9432.81Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
KAG7032196.10.0e+0087.46Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022998231.10.0e+0087.46pentatricopeptide repeat-containing protein At2g02750 [Cucurbita maxima][more]
KAG6601415.10.0e+0087.46Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023537719.10.0e+0087.46pentatricopeptide repeat-containing protein At2g02750 [Cucurbita pepo subsp. pep... [more]
XP_022956404.13.5e-30986.64pentatricopeptide repeat-containing protein At2g02750 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1K9P70.0e+0087.46pentatricopeptide repeat-containing protein At2g02750 OS=Cucurbita maxima OX=366... [more]
A0A6J1GYX91.7e-30986.64pentatricopeptide repeat-containing protein At2g02750 OS=Cucurbita moschata OX=3... [more]
A0A6J1DCC86.8e-30384.69pentatricopeptide repeat-containing protein At2g02750 OS=Momordica charantia OX=... [more]
A0A314YQJ57.4e-22564.05Pentatricopeptide repeat-containing protein OS=Prunus yedoensis var. nudiflora O... [more]
A0A5E4EY709.7e-22563.73PREDICTED: pentatricopeptide OS=Prunus dulcis OX=3755 GN=ALMOND_2B028369 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT2G02750.15.5e-18050.00Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.11.3e-9934.60Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G27110.11.1e-9833.61Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G18485.11.7e-9633.09Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G13600.18.4e-9632.81Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 195..244
e-value: 2.1E-8
score: 34.2
coord: 298..345
e-value: 1.4E-10
score: 41.2
coord: 435..482
e-value: 1.8E-13
score: 50.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 408..429
e-value: 0.18
score: 12.1
coord: 581..604
e-value: 0.44
score: 10.9
coord: 510..534
e-value: 0.0032
score: 17.6
coord: 72..98
e-value: 0.077
score: 13.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 198..226
e-value: 9.1E-4
score: 17.2
coord: 440..472
e-value: 3.6E-9
score: 34.2
coord: 336..368
e-value: 6.3E-9
score: 33.5
coord: 300..334
e-value: 0.0016
score: 16.5
coord: 474..498
e-value: 0.0026
score: 15.8
coord: 511..534
e-value: 0.0012
score: 16.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..470
score: 12.715165
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 333..367
score: 12.791895
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 298..332
score: 9.854266
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 572..606
score: 8.736214
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 382..483
e-value: 1.1E-23
score: 85.5
coord: 148..252
e-value: 5.2E-16
score: 60.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 253..380
e-value: 5.8E-23
score: 83.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 484..629
e-value: 1.8E-15
score: 58.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..147
e-value: 3.7E-21
score: 77.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 162..465
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 6..611
NoneNo IPR availablePANTHERPTHR24015:SF930PPR CONTAINING PLANT-LIKE PROTEINcoord: 6..611

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002208.1Tan0002208.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding