IVF0008841 (gene) Melon (IVF77) v1

Overview
NameIVF0008841
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr02: 16876017 .. 16878317 (-)
RNA-Seq ExpressionIVF0008841
SyntenyIVF0008841
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACCAACTAATTCTGTCCTCCCCTTCTTCCTCTGGCTGCTCCGGCTATTCCCATTTGAATCTCCAACAAACCCACCAACTCCATGCCCATTTCATCAAAACCCAATTCCATAATCCTCACCCTTTCTTCTCCCAATCCCACTTCACCCCTGAAGCCAATTACAATCTCCTCATTTCATCTTACACCAACAACCACCTCCCTCAAGCTTCTTTAAACTGCTACCTCCATATGCGTACCAATGATGCTGCTGCTGCACTTGACAACTTCATTCTCCCTTCGCTTCTCAAAGCTTGTGCCCAAGCTTCCTCTGCGGATTTAGGCAGGGAACTCCACGGTTTTGCCCAAAAGAACGGTTTTGCTTCAGACGTTTTTGTGTGCAACGCTCTGATGAACATGTATGAGAAATGTGGGTGCTTGGTTTCTGCTAGCTTGGTGTTTGATAAAATGCCCGAAAGAGATGTTGTCTCTTGGAGTACCATGCTTGGGTGCTATGTACGGAGCAAAGCTTTTGGTGAAGCGCTTCGACTCGTACGGGAGATGCAGTTCGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGGTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGAGAAGATGGAAGTTTCATTGACGACTGCATTGATCGATATGTATTGCAAATGTGAATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCTTGGACGGTGATGATAAAAGGTTGTATTCGCAGTTGCAGATTAGTTGAAGGGGCAAAGAACTTTAATAGAATGCTTGAAGAAAAATTATTCCCCAATGAGATTACACTGCTAAGCTTGATTACTGAATGTGGTTTCGTGAAAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTGTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAATGGTGTCGAGAAAAAGGATGTTAAGATTTGGAGTGCTTTGATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCCTCGAGATGTTAGACAATGAAGTGAAACCAAACAAGGTGACAATGGTTAGCCTTCTTTCTTTATGTGCTGAGGCTGGAACCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGGCTTGAAGTAGACGTCATTCTAGAAACAGCTCTCATCAACATGTATGTAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTTGATGAAGCTACACAACGGGATATTCACATGTGGAACGCAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCATTTCTATTTTCCACGCTTGTAGTCATTCTGGATTGGTAGTAGAAGGGAAAAAGCATTTCAACAGAATGGTTCACAGCTTTGGAATTGTTCCGAAGATGGAACACTATGGATGCTTGGTGGATCTTCTCGGTCGAGCTGGACATCTTGAGGAAGCTCACAACATCATTGAAAATATGCCGATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTCGAGAAACAATGAGCCATTTAGGAATGAAGAAAGAACCAGGACTCAGCTGGATTGAAGTGAATGGTTCAGTTCACCACTTCAAATCTGGAGACAAGACATGCACACAAACAACAAAAGTATATGAAATGGTGGCCGAAATGTGCATCAAATTGAGAGAGGCGGGATACACACCGAACACAGCGGAAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGTGAGAAACTGGCTATGGCGTTTGGTCTCATAAGCACAGCTCCTGGAACACCGATCCGAATCATTAAGAATTTGAGGATTTGTGATGATTGTCATGCTGCAATGAAGCTATTATCAAAAATCTATGCACGAACAATAATAGTAAGAGACAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTCTGGGTTATTGGTAA

mRNA sequence

ATGAACCAACTAATTCTGTCCTCCCCTTCTTCCTCTGGCTGCTCCGGCTATTCCCATTTGAATCTCCAACAAACCCACCAACTCCATGCCCATTTCATCAAAACCCAATTCCATAATCCTCACCCTTTCTTCTCCCAATCCCACTTCACCCCTGAAGCCAATTACAATCTCCTCATTTCATCTTACACCAACAACCACCTCCCTCAAGCTTCTTTAAACTGCTACCTCCATATGCGTACCAATGATGCTGCTGCTGCACTTGACAACTTCATTCTCCCTTCGCTTCTCAAAGCTTGTGCCCAAGCTTCCTCTGCGGATTTAGGCAGGGAACTCCACGGTTTTGCCCAAAAGAACGGTTTTGCTTCAGACGTTTTTGTGTGCAACGCTCTGATGAACATGTATGAGAAATGTGGGTGCTTGGTTTCTGCTAGCTTGGTGTTTGATAAAATGCCCGAAAGAGATGTTGTCTCTTGGAGTACCATGCTTGGGTGCTATGTACGGAGCAAAGCTTTTGGTGAAGCGCTTCGACTCGTACGGGAGATGCAGTTCGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGGTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGAGAAGATGGAAGTTTCATTGACGACTGCATTGATCGATATGTATTGCAAATGTGAATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCTTGGACGGTGATGATAAAAGGTTGTATTCGCAGTTGCAGATTAGTTGAAGGGGCAAAGAACTTTAATAGAATGCTTGAAGAAAAATTATTCCCCAATGAGATTACACTGCTAAGCTTGATTACTGAATGTGGTTTCGTGAAAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTGTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAATGGTGTCGAGAAAAAGGATGTTAAGATTTGGAGTGCTTTGATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCCTCGAGATGTTAGACAATGAAGTGAAACCAAACAAGGTGACAATGGTTAGCCTTCTTTCTTTATGTGCTGAGGCTGGAACCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGGCTTGAAGTAGACGTCATTCTAGAAACAGCTCTCATCAACATGTATGTAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTTGATGAAGCTACACAACGGGATATTCACATGTGGAACGCAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCATTTCTATTTTCCACGCTTGTAGTCATTCTGGATTGGTAGTAGAAGGGAAAAAGCATTTCAACAGAATGGTTCACAGCTTTGGAATTGTTCCGAAGATGGAACACTATGGATGCTTGGTGGATCTTCTCGGTCGAGCTGGACATCTTGAGGAAGCTCACAACATCATTGAAAATATGCCGATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTCGAGAAACAATGAGCCATTTAGGAATGAAGAAAGAACCAGGACTCAGCTGGATTGAAGTGAATGGTTCAGTTCACCACTTCAAATCTGGAGACAAGACATGCACACAAACAACAAAAGTATATGAAATGGTGGCCGAAATGTGCATCAAATTGAGAGAGGCGGGATACACACCGAACACAGCGGAAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGTGAGAAACTGGCTATGGCGTTTGGTCTCATAAGCACAGCTCCTGGAACACCGATCCGAATCATTAAGAATTTGAGGATTTGTGATGATTGTCATGCTGCAATGAAGCTATTATCAAAAATCTATGCACGAACAATAATAGTAAGAGACAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTCTGGGTTATTGGTAA

Coding sequence (CDS)

ATGAACCAACTAATTCTGTCCTCCCCTTCTTCCTCTGGCTGCTCCGGCTATTCCCATTTGAATCTCCAACAAACCCACCAACTCCATGCCCATTTCATCAAAACCCAATTCCATAATCCTCACCCTTTCTTCTCCCAATCCCACTTCACCCCTGAAGCCAATTACAATCTCCTCATTTCATCTTACACCAACAACCACCTCCCTCAAGCTTCTTTAAACTGCTACCTCCATATGCGTACCAATGATGCTGCTGCTGCACTTGACAACTTCATTCTCCCTTCGCTTCTCAAAGCTTGTGCCCAAGCTTCCTCTGCGGATTTAGGCAGGGAACTCCACGGTTTTGCCCAAAAGAACGGTTTTGCTTCAGACGTTTTTGTGTGCAACGCTCTGATGAACATGTATGAGAAATGTGGGTGCTTGGTTTCTGCTAGCTTGGTGTTTGATAAAATGCCCGAAAGAGATGTTGTCTCTTGGAGTACCATGCTTGGGTGCTATGTACGGAGCAAAGCTTTTGGTGAAGCGCTTCGACTCGTACGGGAGATGCAGTTCGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGGTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGAGAAGATGGAAGTTTCATTGACGACTGCATTGATCGATATGTATTGCAAATGTGAATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCTTGGACGGTGATGATAAAAGGTTGTATTCGCAGTTGCAGATTAGTTGAAGGGGCAAAGAACTTTAATAGAATGCTTGAAGAAAAATTATTCCCCAATGAGATTACACTGCTAAGCTTGATTACTGAATGTGGTTTCGTGAAAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTGTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAATGGTGTCGAGAAAAAGGATGTTAAGATTTGGAGTGCTTTGATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCCTCGAGATGTTAGACAATGAAGTGAAACCAAACAAGGTGACAATGGTTAGCCTTCTTTCTTTATGTGCTGAGGCTGGAACCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGGCTTGAAGTAGACGTCATTCTAGAAACAGCTCTCATCAACATGTATGTAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTTGATGAAGCTACACAACGGGATATTCACATGTGGAACGCAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCATTTCTATTTTCCACGCTTGTAGTCATTCTGGATTGGTAGTAGAAGGGAAAAAGCATTTCAACAGAATGGTTCACAGCTTTGGAATTGTTCCGAAGATGGAACACTATGGATGCTTGGTGGATCTTCTCGGTCGAGCTGGACATCTTGAGGAAGCTCACAACATCATTGAAAATATGCCGATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTCGAGAAACAATGAGCCATTTAGGAATGAAGAAAGAACCAGGACTCAGCTGGATTGAAGTGAATGGTTCAGTTCACCACTTCAAATCTGGAGACAAGACATGCACACAAACAACAAAAGTATATGAAATGGTGGCCGAAATGTGCATCAAATTGAGAGAGGCGGGATACACACCGAACACAGCGGAAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGTGAGAAACTGGCTATGGCGTTTGGTCTCATAAGCACAGCTCCTGGAACACCGATCCGAATCATTAAGAATTTGAGGATTTGTGATGATTGTCATGCTGCAATGAAGCTATTATCAAAAATCTATGCACGAACAATAATAGTAAGAGACAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTCTGGGTTATTGGTAA

Protein sequence

MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVREMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVKIWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYINRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEALELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDLLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVYEMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW
Homology
BLAST of IVF0008841 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 9.1e-155
Identity = 275/712 (38.62%), Postives = 424/712 (59.55%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGF 114
           Y+ ++  +        +L  ++ MR +D    + NF    LLK C   +   +G+E+HG 
Sbjct: 103 YHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTY--LLKVCGDEAELRVGKEIHGL 162

Query: 115 AQKNGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEA 174
             K+GF+ D+F    L NMY KC  +  A  VFD+MPERD+VSW+T++  Y ++     A
Sbjct: 163 LVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMA 222

Query: 175 LRLVREMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALI 234
           L +V+ M    +K S + ++S++     L  +  G+ +HGY +R+  D    V+++TAL+
Sbjct: 223 LEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSL--VNISTALV 282

Query: 235 DMYCKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEI 294
           DMY KC  L +A++LFD + +R+VVSW  MI   +++    E    F +ML+E + P ++
Sbjct: 283 DMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDV 342

Query: 295 TLLSLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV 354
           +++  +  C  +  L+ G++ H   +  G   ++++V +LI MY KC +V  A ++F  +
Sbjct: 343 SVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKL 402

Query: 355 EKKDVKIWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGK 414
           + + +  W+A+I  +A         N F +M    VKP+  T VS+++  AE       K
Sbjct: 403 QSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAK 462

Query: 415 WTHAYINRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHG 474
           W H  + R  L+ +V + TAL++MY KCG + IAR +FD  ++R +  WNAM+ G+  HG
Sbjct: 463 WIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHG 522

Query: 475 CGKEALELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHY 534
            GK ALELF EM+   ++PN +TF+S+  ACSHSGLV  G K F  M  ++ I   M+HY
Sbjct: 523 FGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHY 582

Query: 535 GCLVDLLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDP 594
           G +VDLLGRAG L EA + I  MP++P   ++GA+L AC++HKN+   E AA ++ EL+P
Sbjct: 583 GAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNP 642

Query: 595 QNCGYSVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCT 654
            + GY VL +NIY +A  W  V  VR +M   G++K PG S +E+   VH F SG     
Sbjct: 643 DDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHP 702

Query: 655 QTTKVYEMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPG 714
            + K+Y  + ++   ++EAGY P+T  ++L ++ + KE  LS HSEKLA++FGL++T  G
Sbjct: 703 DSKKIYAFLEKLICHIKEAGYVPDT-NLVLGVENDVKEQLLSTHSEKLAISFGLLNTTAG 762

Query: 715 TPIRIIKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           T I + KNLR+C DCH A K +S +  R I+VRD  RFHHF  G CSC  YW
Sbjct: 763 TTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of IVF0008841 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 541.2 bits (1393), Expect = 1.9e-152
Identity = 294/771 (38.13%), Postives = 445/771 (57.72%), Query Frame = 0

Query: 20  LNLQQTHQLHAHFIKT-QFHNPH---PFFSQSHFT---------------PEAN---YNL 79
           ++L+Q  Q H H I+T  F +P+     F+ +  +               P+ N   +N 
Sbjct: 41  VSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNT 100

Query: 80  LISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQK 139
           LI +Y +   P  S+  +L M  +++    + +  P L+KA A+ SS  LG+ LHG A K
Sbjct: 101 LIRAYASGPDPVLSIWAFLDM-VSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVK 160

Query: 140 NGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRL 199
           +   SDVFV N+L++ Y  CG L SA  VF  + E+DVVSW++M+  +V+  +  +AL L
Sbjct: 161 SAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALEL 220

Query: 200 VREMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMY 259
            ++M+   VK S V ++ ++     + +++ GR V  YI  N     + ++L  A++DMY
Sbjct: 221 FKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEEN--RVNVNLTLANAMLDMY 280

Query: 260 CKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLL 319
            KC  +  A+RLFD + ++  V+WT M+ G                              
Sbjct: 281 TKCGSIEDAKRLFDAMEEKDNVTWTTMLDG------------------------------ 340

Query: 320 SLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKK 379
                                                   Y        AR + N + +K
Sbjct: 341 ----------------------------------------YAISEDYEAAREVLNSMPQK 400

Query: 380 DVKIWSALISAYAHVSCMDQVFNLFLEM-LDNEVKPNKVTMVSLLSLCAEAGTLDLGKWT 439
           D+  W+ALISAY      ++   +F E+ L   +K N++T+VS LS CA+ G L+LG+W 
Sbjct: 401 DIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWI 460

Query: 440 HAYINRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCG 499
           H+YI +HG+ ++  + +ALI+MY KCGD+  +R +F+   +RD+ +W+AM+ G +MHGCG
Sbjct: 461 HSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCG 520

Query: 500 KEALELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGC 559
            EA+++F +M+   V+PN +TF ++F ACSH+GLV E +  F++M  ++GIVP+ +HY C
Sbjct: 521 NEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYAC 580

Query: 560 LVDLLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQN 619
           +VD+LGR+G+LE+A   IE MP+ P+T +WGALL ACK+H NL L E+A  ++LEL+P+N
Sbjct: 581 IVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRN 640

Query: 620 CGYSVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQT 679
            G  VL SNIYA   +W +V+ +R+ M   G+KKEPG S IE++G +H F SGD     +
Sbjct: 641 DGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMS 700

Query: 680 TKVYEMVAEMCIKLREAGYTPNTAEVLLNIDEEE-KESALSYHSEKLAMAFGLISTAPGT 739
            KVY  + E+  KL+  GY P  ++VL  I+EEE KE +L+ HSEKLA+ +GLIST    
Sbjct: 701 EKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPK 738

Query: 740 PIRIIKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
            IR+IKNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CSC  +W
Sbjct: 761 VIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of IVF0008841 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 537.0 bits (1382), Expect = 3.6e-151
Identity = 289/748 (38.64%), Postives = 432/748 (57.75%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGF 114
           YN LI  Y ++ L   ++  +L M   ++  + D +  P  L ACA++ +   G ++HG 
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRMM--NSGISPDKYTFPFGLSACAKSRAKGNGIQIHGL 161

Query: 115 AQKNGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEA 174
             K G+A D+FV N+L++ Y +CG L SA  VFD+M ER+VVSW++M+  Y R     +A
Sbjct: 162 IVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDA 221

Query: 175 L----RLVREMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLT 234
           +    R+VR+ +   V  + V ++ +I     L D+++G  V+ +I RN G E  ++ + 
Sbjct: 222 VDLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAFI-RNSGIEVNDL-MV 281

Query: 235 TALIDMYCKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLF 294
           +AL+DMY KC  +  A+RLFD     ++     M    +R     E    FN M++  + 
Sbjct: 282 SALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVR 341

Query: 295 PNEITLLSLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKC--------- 354
           P+ I++LS I+ C  ++ +  GK  H Y+LRNGF     +  ALIDMY KC         
Sbjct: 342 PDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRI 401

Query: 355 ----------------------GQVGYARALFNGVEKKDVKIWSALISAYAHVSCMDQVF 414
                                 G+V  A   F  + +K++  W+ +IS     S  ++  
Sbjct: 402 FDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAI 461

Query: 415 NLFLEMLDNE-VKPNKVTMVSLLSLCAEAGTLDLGKWTHAYINRHGLEVDVILETALINM 474
            +F  M   E V  + VTM+S+ S C   G LDL KW + YI ++G+++DV L T L++M
Sbjct: 462 EVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDM 521

Query: 475 YVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEALELFSEMESHGVEPNDITF 534
           + +CGD   A S+F+  T RD+  W A +   +M G  + A+ELF +M   G++P+ + F
Sbjct: 522 FSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAF 581

Query: 535 ISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDLLGRAGHLEEAHNIIENMP 594
           +    ACSH GLV +GK+ F  M+   G+ P+  HYGC+VDLLGRAG LEEA  +IE+MP
Sbjct: 582 VGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMP 641

Query: 595 MRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTS 654
           M PN +IW +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+  
Sbjct: 642 MEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAK 701

Query: 655 VRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVYEMVAEMCIKLREAGYTPN 714
           VR +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P+
Sbjct: 702 VRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVPD 761

Query: 715 TAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAAMKLLSK 767
            + VL+++DE+EK   LS HSEKLAMA+GLIS+  GT IRI+KNLR+C DCH+  K  SK
Sbjct: 762 LSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFASK 821

BLAST of IVF0008841 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 1.7e-148
Identity = 288/770 (37.40%), Postives = 424/770 (55.06%), Query Frame = 0

Query: 22  LQQTHQLHAHFIKTQFHNPHPFFSQ--------SHF------------TPEAN---YNLL 81
           LQ    +HA  IK   HN +   S+         HF              E N   +N +
Sbjct: 46  LQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTM 105

Query: 82  ISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKN 141
              +  +  P ++L  Y+ M +       +++  P +LK+CA++ +   G+++HG   K 
Sbjct: 106 FRGHALSSDPVSALKLYVCMIS--LGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKL 165

Query: 142 GFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLV 201
           G   D++V  +L++MY + G L  A  VFDK P RDVVS++ ++                
Sbjct: 166 GCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALI---------------- 225

Query: 202 REMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYC 261
                                         G A  GYI                      
Sbjct: 226 -----------------------------KGYASRGYI---------------------- 285

Query: 262 KCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLS 321
                 +AQ+LFD +  + VVSW  MI G   +    E  + F  M++  + P+E T+++
Sbjct: 286 -----ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVT 345

Query: 322 LITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKD 381
           +++ C    +++LG+  H ++  +GFG +L +V ALID+Y KCG++  A  LF  +  KD
Sbjct: 346 VVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKD 405

Query: 382 VKIWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHA 441
           V  W+ LI  Y H++   +   LF EML +   PN VTM+S+L  CA  G +D+G+W H 
Sbjct: 406 VISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHV 465

Query: 442 YINRH--GLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCG 501
           YI++   G+     L T+LI+MY KCGD+  A  +F+    + +  WNAM+ GF+MHG  
Sbjct: 466 YIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRA 525

Query: 502 KEALELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGC 561
             + +LFS M   G++P+DITF+ +  ACSHSG++  G+  F  M   + + PK+EHYGC
Sbjct: 526 DASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGC 585

Query: 562 LVDLLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQN 621
           ++DLLG +G  +EA  +I  M M P+ +IW +LL ACK+H N+ LGE  A  +++++P+N
Sbjct: 586 MIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPEN 645

Query: 622 CGYSVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQT 681
            G  VL SNIYASA RWN+V   R  ++  GMKK PG S IE++  VH F  GDK   + 
Sbjct: 646 PGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRN 705

Query: 682 TKVYEMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTP 741
            ++Y M+ EM + L +AG+ P+T+EVL  ++EE KE AL +HSEKLA+AFGLIST PGT 
Sbjct: 706 REIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTK 741

Query: 742 IRIIKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           + I+KNLR+C +CH A KL+SKIY R II RDR RFHHF +G CSC  YW
Sbjct: 766 LTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of IVF0008841 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 526.9 bits (1356), Expect = 3.7e-148
Identity = 273/716 (38.13%), Postives = 418/716 (58.38%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGF 114
           +N+L++    +     S+  +  M +  +   +D++    + K+ +   S   G +LHGF
Sbjct: 163 WNILMNELAKSGDFSGSIGLFKKMMS--SGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGF 222

Query: 115 AQKNGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEA 174
             K+GF     V N+L+  Y K   + SA  VFD+M ERDV+SW++++  YV +    + 
Sbjct: 223 ILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKG 282

Query: 175 LRLVREMQFVGVKLSGVALISLIGVFGNLLD---MKSGRAVHGYIVRNVGDEKMEVSLTT 234
           L +  +M   G+++    L +++ VF    D   +  GRAVH   V+       E     
Sbjct: 283 LSVFVQMLVSGIEID---LATIVSVFAGCADSRLISLGRAVHSIGVKACFSR--EDRFCN 342

Query: 235 ALIDMYCKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFP 294
            L+DMY KC  L SA+ +F  +S RSVVS+T MI G  R     E  K F  M EE + P
Sbjct: 343 TLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISP 402

Query: 295 NEITLLSLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALF 354
           +  T+ +++  C   + LD GK  H ++  N  G  + +  AL+DMY KCG +  A  +F
Sbjct: 403 DVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVF 462

Query: 355 NGVEKKDVKIWSALISAYAHVSCMDQVFNLFLEMLDNE-VKPNKVTMVSLLSLCAEAGTL 414
           + +  KD+  W+ +I  Y+     ++  +LF  +L+ +   P++ T+  +L  CA     
Sbjct: 463 SEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAF 522

Query: 415 DLGKWTHAYINRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGF 474
           D G+  H YI R+G   D  +  +L++MY KCG + +A  LFD+   +D+  W  M+AG+
Sbjct: 523 DKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGY 582

Query: 475 SMHGCGKEALELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPK 534
            MHG GKEA+ LF++M   G+E ++I+F+S+ +ACSHSGLV EG + FN M H   I P 
Sbjct: 583 GMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPT 642

Query: 535 MEHYGCLVDLLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKIL 594
           +EHY C+VD+L R G L +A+  IENMP+ P+  IWGALL  C++H ++ L E  A K+ 
Sbjct: 643 VEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVF 702

Query: 595 ELDPQNCGYSVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGD 654
           EL+P+N GY VL +NIYA A++W  V  +R+ +   G++K PG SWIE+ G V+ F +GD
Sbjct: 703 ELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGD 762

Query: 655 KTCTQTTKVYEMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLIS 714
            +  +T  +   + ++  ++ E GY+P T   L++ +E EKE AL  HSEKLAMA G+IS
Sbjct: 763 SSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGIIS 822

Query: 715 TAPGTPIRIIKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           +  G  IR+ KNLR+C DCH   K +SK+  R I++RD NRFH F +G+CSC G+W
Sbjct: 823 SGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871

BLAST of IVF0008841 vs. ExPASy TrEMBL
Match: A0A5A7V2V9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold403G001350 PE=3 SV=1)

HSP 1 Score: 1561.2 bits (4041), Expect = 0.0e+00
Identity = 766/766 (100.00%), Postives = 766/766 (100.00%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
           ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of IVF0008841 vs. ExPASy TrEMBL
Match: A0A1S3CJ58 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501009 PE=3 SV=1)

HSP 1 Score: 1558.9 bits (4035), Expect = 0.0e+00
Identity = 764/766 (99.74%), Postives = 766/766 (100.00%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYI+RNVGDEKMEVSLTTALIDMYCKC
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIMRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
           ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVH+FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHNFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of IVF0008841 vs. ExPASy TrEMBL
Match: A0A0A0LYC2 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G690260 PE=3 SV=1)

HSP 1 Score: 1473.4 bits (3813), Expect = 0.0e+00
Identity = 720/766 (93.99%), Postives = 738/766 (96.34%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLIL SPS SGCSG+SHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYTNNHLPQAS NCYLHMR+ND AAALDNFILPSLLKACAQASS DLGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASFNCYLHMRSND-AAALDNFILPSLLKACAQASSGDLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSA LVFD+MPERDVVSW+TMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           MQFVGVKLSGVALISLI VFGNLLDMKSGRAVHGYIVRNVGDEKMEVS+TTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
            CLASAQRLFDRLSKRSVVSWTVMI GCIRSCRL EGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 241 GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFV TLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV+KKDVK
Sbjct: 301 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWS LISAYAHVSCMDQVFNLF+EML+N+VKPN VTMVSLLSLCAEAG LDLGKWTHAYI
Sbjct: 361 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMY KCGDVTIARSLF+EA QRDI MWN MMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITF+SIFHACSHSGLVVEGKK+FN+MVH FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHL+EAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRE MSH GMKKEPGLSWIEV+GSVHHFKSGDK CTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMV EMCIKLRE+GYTPNTA VLLNIDEEEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 661 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 765

BLAST of IVF0008841 vs. ExPASy TrEMBL
Match: A0A6J1HA74 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111461539 PE=3 SV=1)

HSP 1 Score: 1338.6 bits (3463), Expect = 0.0e+00
Identity = 651/766 (84.99%), Postives = 703/766 (91.78%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           M+QLILS+ S SG SG+SHLNLQQTHQ+HAH IKTQF NPH FFS+SHFTPEAN+NLLIS
Sbjct: 1   MDQLILSAASPSG-SGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYT+NHLPQA+   Y HMRT D AAA+DNFI+PSLLKACAQASS + GRE+HGFA KNGF
Sbjct: 61  SYTDNHLPQAAFILYHHMRTTD-AAAVDNFIVPSLLKACAQASSTNFGREVHGFAVKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
            SDVFVCNALMNMYEKCG LVSA LVFDKMP+RDVVSWSTMLGCYVRSK+FGEA RLVRE
Sbjct: 121 VSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           M FVGVKLS VALIS+IGVFG L DMKSGRA+HGY+VRNVG+E++E+ LTTALIDMYCK 
Sbjct: 181 MHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKG 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
           + LASA RLFD LS+R+VVSWT +I GCIRSCR VEGAKNF+RMLEE + PNEITLLSLI
Sbjct: 241 DKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFV  LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGVE+KDVK
Sbjct: 301 TECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWSALISAYAH SC+DQ F+LFL+MLD+EVKPNKVTMVSLLSLCAE G LDLG+WTHAYI
Sbjct: 361 IWSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
           NRHG+EVDV+LETALINMY KCGD+  AR LFDEAT+RDIHMWNAMMAGFS+HGCGKEAL
Sbjct: 421 NRHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFS+M  HGVEPNDITFIS+FHACSHSGLV EG KHF+RMVH FGIVPK+EHYGCLVDL
Sbjct: 481 ELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRA  L+ AH+IIENMPMRPNTI+WGALLAACKLHKNLALGEVAARKILELDP+NCGY 
Sbjct: 541 LGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYR 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYAS KRW DVTSVRETMSHLGMKKEPGLSWIEVNGSVHHF+SGDKTCTQT KV+
Sbjct: 601 VLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVH 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMV EMCIKLREAGY PNT+ VLLN+++EEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of IVF0008841 vs. ExPASy TrEMBL
Match: A0A6J1JKG9 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima OX=3661 GN=LOC111485395 PE=3 SV=1)

HSP 1 Score: 1321.2 bits (3418), Expect = 0.0e+00
Identity = 641/766 (83.68%), Postives = 699/766 (91.25%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           M+QLILS+ S SG SG+SHLNLQQTHQ+HAHFIKTQF NPH FFS+S+FTPEAN+NLLIS
Sbjct: 1   MDQLILSAASPSG-SGHSHLNLQQTHQIHAHFIKTQFRNPHNFFSRSNFTPEANFNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYT+NH PQA+ N Y HMRT D AAA+DNFI+PSLLKACAQASS +LGRE+HGFA KNGF
Sbjct: 61  SYTDNHRPQAAFNLYHHMRTTD-AAAVDNFIVPSLLKACAQASSTNLGREVHGFAVKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
            SDVFVCNALMNMYEKCG LVSA LVFDKMP+RDVVSWSTMLGCYVRSK+FGEA RLVRE
Sbjct: 121 VSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           M FVGVKLS VALIS+IGVFG L DMKSGRA+HGY+VRNVG E++E+ LTTALIDMYCK 
Sbjct: 181 MHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGKERIELPLTTALIDMYCKG 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
           + LASA RLF+ LS+R+VVSWT +I GCIRSCR VEGAKNF+RMLEE + PNEITLLSLI
Sbjct: 241 DNLASAMRLFNGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIVPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFV  LDLGKW H+YLLRNGFGMSL L TALIDMYGKCGQV YARALFN V++KDVK
Sbjct: 301 TECGFVGALDLGKWLHSYLLRNGFGMSLTLTTALIDMYGKCGQVAYARALFNVVDEKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWSALISAYAH SC+DQ F+LFL+MLD+EVKPNKVTMVSLLSLCAE G LDLG+WTHAYI
Sbjct: 361 IWSALISAYAHTSCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
             HG+EVD++LETALINMY KCGD+  ARSLFDEATQRDIHMWNAMMAGFS+HGCGKEAL
Sbjct: 421 IHHGVEVDIVLETALINMYAKCGDLKTARSLFDEATQRDIHMWNAMMAGFSIHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFS+ME HGVEPNDITFIS+FHACSHSGLV EG KHF+RMVH FGIVPK+EHYGCLVDL
Sbjct: 481 ELFSDMECHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRA  L+ AH+IIENMPMRPNTIIWGALLAACKLHKNL LG+VAARKILELDP+NCGY 
Sbjct: 541 LGRAKRLDAAHSIIENMPMRPNTIIWGALLAACKLHKNLPLGKVAARKILELDPENCGYR 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYAS KRW +VTS+RE+MSHLGMKKEPGLSW EVNGSVHHF+SGDKTCTQ  KV+
Sbjct: 601 VLKSNIYASEKRWTNVTSIRESMSHLGMKKEPGLSWTEVNGSVHHFRSGDKTCTQARKVH 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMV EMCIKLREAGY PNT+ VLLN+++EEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of IVF0008841 vs. NCBI nr
Match: KAA0062552.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK28774.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1553 bits (4020), Expect = 0.0
Identity = 766/766 (100.00%), Postives = 766/766 (100.00%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
           ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766
           KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of IVF0008841 vs. NCBI nr
Match: XP_008462708.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucumis melo])

HSP 1 Score: 1550 bits (4014), Expect = 0.0
Identity = 764/766 (99.74%), Postives = 766/766 (100.00%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYI+RNVGDEKMEVSLTTALIDMYCKC
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIMRNVGDEKMEVSLTTALIDMYCKC 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
           ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVH+FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHNFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766
           KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of IVF0008841 vs. NCBI nr
Match: XP_011660280.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial [Cucumis sativus])

HSP 1 Score: 1464 bits (3791), Expect = 0.0
Identity = 720/766 (93.99%), Postives = 738/766 (96.34%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           MNQLIL SPS SGCSG+SHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS
Sbjct: 1   MNQLILCSPSFSGCSGHSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYTNNHLPQAS NCYLHMR+NDAAA LDNFILPSLLKACAQASS DLGRELHGFAQKNGF
Sbjct: 61  SYTNNHLPQASFNCYLHMRSNDAAA-LDNFILPSLLKACAQASSGDLGRELHGFAQKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
           ASDVFVCNALMNMYEKCGCLVSA LVFD+MPERDVVSW+TMLGCYVRSKAFGEALRLVRE
Sbjct: 121 ASDVFVCNALMNMYEKCGCLVSARLVFDQMPERDVVSWTTMLGCYVRSKAFGEALRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           MQFVGVKLSGVALISLI VFGNLLDMKSGRAVHGYIVRNVGDEKMEVS+TTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
            CLASAQRLFDRLSKRSVVSWTVMI GCIRSCRL EGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 241 GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFV TLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV+KKDVK
Sbjct: 301 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWS LISAYAHVSCMDQVFNLF+EML+N+VKPN VTMVSLLSLCAEAG LDLGKWTHAYI
Sbjct: 361 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
           NRHGLEVDVILETALINMY KCGDVTIARSLF+EA QRDI MWN MMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFSEMESHGVEPNDITF+SIFHACSHSGLVVEGKK+FN+MVH FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRAGHL+EAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYASAKRWNDVTSVRE MSH GMKKEPGLSWIEV+GSVHHFKSGDK CTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMV EMCIKLRE+GYTPNTA VLLNIDEEEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 661 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 765

BLAST of IVF0008841 vs. NCBI nr
Match: XP_038879151.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 1389 bits (3594), Expect = 0.0
Identity = 681/767 (88.79%), Postives = 717/767 (93.48%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCS-GYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLI 60
           MNQLILSSPS SG   G+SHL+LQQT Q+HAHFIKTQFH PHPFFSQ+HF+PEANYNLLI
Sbjct: 1   MNQLILSSPSFSGSGHGHSHLSLQQTQQIHAHFIKTQFHRPHPFFSQTHFSPEANYNLLI 60

Query: 61  SSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNG 120
           SSYTNNHLPQAS   YLHMRT DAAAALDNFILPSLLKACAQAS   LGRELHGFA KNG
Sbjct: 61  SSYTNNHLPQASFKLYLHMRTTDAAAALDNFILPSLLKACAQASCGVLGRELHGFAIKNG 120

Query: 121 FASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVR 180
           FA DVFVCNALMNMYEKCG LV A LVFDKMP+RDVVSWSTMLGCYVRSK++ EAL LVR
Sbjct: 121 FAPDVFVCNALMNMYEKCGSLVFARLVFDKMPDRDVVSWSTMLGCYVRSKSYDEALVLVR 180

Query: 181 EMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCK 240
           EM FVGVKLSGVALIS+IG FG LLDMKSGRAVHGYIVRNV DEKMEV LTTALI+MYCK
Sbjct: 181 EMHFVGVKLSGVALISMIGAFGELLDMKSGRAVHGYIVRNVVDEKMEVPLTTALINMYCK 240

Query: 241 CECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSL 300
            E L SAQRLFD L ++SVVSWTVMI GCIR+CRLVEGA NFNRMLEE++FPNEITLL+L
Sbjct: 241 GERLESAQRLFDVLPQKSVVSWTVMIAGCIRNCRLVEGANNFNRMLEEEVFPNEITLLNL 300

Query: 301 ITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDV 360
           ITECGFV TLDLGKWFHAYLLRN FGMSLALVTALIDMYGKCGQVGYARALFNG+E+KDV
Sbjct: 301 ITECGFVGTLDLGKWFHAYLLRNEFGMSLALVTALIDMYGKCGQVGYARALFNGIEEKDV 360

Query: 361 KIWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAY 420
           KIWSAL+ AYAH SC+DQ FNLFLEMLD+EVKPNKVTMV LLSLCAEAG L+LGKWTH Y
Sbjct: 361 KIWSALLLAYAHASCIDQAFNLFLEMLDSEVKPNKVTMVGLLSLCAEAGALNLGKWTHTY 420

Query: 421 INRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEA 480
           INRHGLEVDV+LETALINMY KCGD+TIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEA
Sbjct: 421 INRHGLEVDVVLETALINMYAKCGDLTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEA 480

Query: 481 LELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVD 540
           LELFSEME +GVEPNDITFIS+FHACSHSGLVV+GKKHFNRMVH FGIVPK+EHYGCLVD
Sbjct: 481 LELFSEMEGYGVEPNDITFISVFHACSHSGLVVDGKKHFNRMVHDFGIVPKIEHYGCLVD 540

Query: 541 LLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGY 600
           LLGRAGHL+EAHNIIENMPM+PNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGY
Sbjct: 541 LLGRAGHLDEAHNIIENMPMKPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGY 600

Query: 601 SVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKV 660
           SVLKSNIYASAKRW DVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTT+V
Sbjct: 601 SVLKSNIYASAKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTEV 660

Query: 661 YEMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRI 720
           YEMV EMCIKLRE GYTPNT+ VLLN++EEEKES LSYHSEKLAMAFGLISTAPGTPIRI
Sbjct: 661 YEMVTEMCIKLRETGYTPNTSAVLLNVEEEEKESTLSYHSEKLAMAFGLISTAPGTPIRI 720

Query: 721 IKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766
           +KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEG+CSCLGYW
Sbjct: 721 VKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGFCSCLGYW 767

BLAST of IVF0008841 vs. NCBI nr
Match: KAG6590282.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1334 bits (3452), Expect = 0.0
Identity = 652/766 (85.12%), Postives = 704/766 (91.91%), Query Frame = 0

Query: 1   MNQLILSSPSSSGCSGYSHLNLQQTHQLHAHFIKTQFHNPHPFFSQSHFTPEANYNLLIS 60
           M+QLILS+ S SG SG+SHLNLQQTHQ+HAH IKTQF NPH FFS+SHFTPEAN+NLLIS
Sbjct: 1   MDQLILSAASPSG-SGHSHLNLQQTHQIHAHCIKTQFRNPHSFFSRSHFTPEANFNLLIS 60

Query: 61  SYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKNGF 120
           SYT+NHLPQA+ N Y HMRT DAAA +DNFI+PSLLKACAQASS + GRE+HGFA KNGF
Sbjct: 61  SYTDNHLPQAAFNLYHHMRTTDAAA-VDNFIVPSLLKACAQASSTNFGREVHGFAVKNGF 120

Query: 121 ASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLVRE 180
            SDVFVCNALMNMYEKCG LVSA LVFDKMP+RDVVSWSTMLGCYVRSK+FGEA RLVRE
Sbjct: 121 VSDVFVCNALMNMYEKCGSLVSACLVFDKMPDRDVVSWSTMLGCYVRSKSFGEAYRLVRE 180

Query: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYCKC 240
           M FVGVKLS VALIS+IGVFG L DMKSGRA+HGY+VRNVG+E++E+ LTTALIDMYCK 
Sbjct: 181 MHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKG 240

Query: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300
           + LASA RLFD LS+R+VVSWTV+I GCIRSCR VEGAKNF+RMLEE + PNEITLLSLI
Sbjct: 241 DKLASAMRLFDGLSQRNVVSWTVLIAGCIRSCRFVEGAKNFSRMLEENIVPNEITLLSLI 300

Query: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360
           TECGFV  LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGVE+KDVK
Sbjct: 301 TECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVK 360

Query: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420
           IWSALISAYAH SC+DQ F+LFL+MLD+EVKPNKVTMVSLLSLCAE G LDLG+WTHAYI
Sbjct: 361 IWSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI 420

Query: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480
           NRHG+EVD +LETALINMY KCGD+  AR LFDEAT+RDIHMWNAMMAGFS+HGCGKEAL
Sbjct: 421 NRHGVEVDAVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEAL 480

Query: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDL 540
           ELFS+M  HGVEPNDITFIS+FHACSHSGLV EG KHF+RMVH FGIVPK+EHYGCLVDL
Sbjct: 481 ELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDL 540

Query: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600
           LGRA  L+ AH+IIENMPMRPNTI+WGALLAACKLHKNLALGEVAARKILELDP+NCGY 
Sbjct: 541 LGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYR 600

Query: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660
           VLKSNIYAS KRW DVTSVRETMSHLGMKKEPGLSWIEVNGSVHHF+SGDKTCTQT KV+
Sbjct: 601 VLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVH 660

Query: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720
           EMV EMCIKLREAGY PNT+ VLLN+++EEKESALSYHSEKLAMAFGLISTAPGTPIRII
Sbjct: 661 EMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSCLGYW
Sbjct: 721 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of IVF0008841 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 548.9 bits (1413), Expect = 6.5e-156
Identity = 275/712 (38.62%), Postives = 424/712 (59.55%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGF 114
           Y+ ++  +        +L  ++ MR +D    + NF    LLK C   +   +G+E+HG 
Sbjct: 103 YHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTY--LLKVCGDEAELRVGKEIHGL 162

Query: 115 AQKNGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEA 174
             K+GF+ D+F    L NMY KC  +  A  VFD+MPERD+VSW+T++  Y ++     A
Sbjct: 163 LVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMA 222

Query: 175 LRLVREMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALI 234
           L +V+ M    +K S + ++S++     L  +  G+ +HGY +R+  D    V+++TAL+
Sbjct: 223 LEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSL--VNISTALV 282

Query: 235 DMYCKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEI 294
           DMY KC  L +A++LFD + +R+VVSW  MI   +++    E    F +ML+E + P ++
Sbjct: 283 DMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDV 342

Query: 295 TLLSLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV 354
           +++  +  C  +  L+ G++ H   +  G   ++++V +LI MY KC +V  A ++F  +
Sbjct: 343 SVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKL 402

Query: 355 EKKDVKIWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGK 414
           + + +  W+A+I  +A         N F +M    VKP+  T VS+++  AE       K
Sbjct: 403 QSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAK 462

Query: 415 WTHAYINRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHG 474
           W H  + R  L+ +V + TAL++MY KCG + IAR +FD  ++R +  WNAM+ G+  HG
Sbjct: 463 WIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHG 522

Query: 475 CGKEALELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHY 534
            GK ALELF EM+   ++PN +TF+S+  ACSHSGLV  G K F  M  ++ I   M+HY
Sbjct: 523 FGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHY 582

Query: 535 GCLVDLLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDP 594
           G +VDLLGRAG L EA + I  MP++P   ++GA+L AC++HKN+   E AA ++ EL+P
Sbjct: 583 GAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNP 642

Query: 595 QNCGYSVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCT 654
            + GY VL +NIY +A  W  V  VR +M   G++K PG S +E+   VH F SG     
Sbjct: 643 DDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHP 702

Query: 655 QTTKVYEMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPG 714
            + K+Y  + ++   ++EAGY P+T  ++L ++ + KE  LS HSEKLA++FGL++T  G
Sbjct: 703 DSKKIYAFLEKLICHIKEAGYVPDT-NLVLGVENDVKEQLLSTHSEKLAISFGLLNTTAG 762

Query: 715 TPIRIIKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           T I + KNLR+C DCH A K +S +  R I+VRD  RFHHF  G CSC  YW
Sbjct: 763 TTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of IVF0008841 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 541.2 bits (1393), Expect = 1.3e-153
Identity = 294/771 (38.13%), Postives = 445/771 (57.72%), Query Frame = 0

Query: 20  LNLQQTHQLHAHFIKT-QFHNPH---PFFSQSHFT---------------PEAN---YNL 79
           ++L+Q  Q H H I+T  F +P+     F+ +  +               P+ N   +N 
Sbjct: 41  VSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNT 100

Query: 80  LISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQK 139
           LI +Y +   P  S+  +L M  +++    + +  P L+KA A+ SS  LG+ LHG A K
Sbjct: 101 LIRAYASGPDPVLSIWAFLDM-VSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVK 160

Query: 140 NGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRL 199
           +   SDVFV N+L++ Y  CG L SA  VF  + E+DVVSW++M+  +V+  +  +AL L
Sbjct: 161 SAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALEL 220

Query: 200 VREMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMY 259
            ++M+   VK S V ++ ++     + +++ GR V  YI  N     + ++L  A++DMY
Sbjct: 221 FKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEEN--RVNVNLTLANAMLDMY 280

Query: 260 CKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLL 319
            KC  +  A+RLFD + ++  V+WT M+ G                              
Sbjct: 281 TKCGSIEDAKRLFDAMEEKDNVTWTTMLDG------------------------------ 340

Query: 320 SLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKK 379
                                                   Y        AR + N + +K
Sbjct: 341 ----------------------------------------YAISEDYEAAREVLNSMPQK 400

Query: 380 DVKIWSALISAYAHVSCMDQVFNLFLEM-LDNEVKPNKVTMVSLLSLCAEAGTLDLGKWT 439
           D+  W+ALISAY      ++   +F E+ L   +K N++T+VS LS CA+ G L+LG+W 
Sbjct: 401 DIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWI 460

Query: 440 HAYINRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCG 499
           H+YI +HG+ ++  + +ALI+MY KCGD+  +R +F+   +RD+ +W+AM+ G +MHGCG
Sbjct: 461 HSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCG 520

Query: 500 KEALELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGC 559
            EA+++F +M+   V+PN +TF ++F ACSH+GLV E +  F++M  ++GIVP+ +HY C
Sbjct: 521 NEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYAC 580

Query: 560 LVDLLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQN 619
           +VD+LGR+G+LE+A   IE MP+ P+T +WGALL ACK+H NL L E+A  ++LEL+P+N
Sbjct: 581 IVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRN 640

Query: 620 CGYSVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQT 679
            G  VL SNIYA   +W +V+ +R+ M   G+KKEPG S IE++G +H F SGD     +
Sbjct: 641 DGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMS 700

Query: 680 TKVYEMVAEMCIKLREAGYTPNTAEVLLNIDEEE-KESALSYHSEKLAMAFGLISTAPGT 739
            KVY  + E+  KL+  GY P  ++VL  I+EEE KE +L+ HSEKLA+ +GLIST    
Sbjct: 701 EKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPK 738

Query: 740 PIRIIKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
            IR+IKNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CSC  +W
Sbjct: 761 VIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of IVF0008841 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 537.0 bits (1382), Expect = 2.5e-152
Identity = 289/748 (38.64%), Postives = 432/748 (57.75%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGF 114
           YN LI  Y ++ L   ++  +L M   ++  + D +  P  L ACA++ +   G ++HG 
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRMM--NSGISPDKYTFPFGLSACAKSRAKGNGIQIHGL 161

Query: 115 AQKNGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEA 174
             K G+A D+FV N+L++ Y +CG L SA  VFD+M ER+VVSW++M+  Y R     +A
Sbjct: 162 IVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDA 221

Query: 175 L----RLVREMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLT 234
           +    R+VR+ +   V  + V ++ +I     L D+++G  V+ +I RN G E  ++ + 
Sbjct: 222 VDLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAFI-RNSGIEVNDL-MV 281

Query: 235 TALIDMYCKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLF 294
           +AL+DMY KC  +  A+RLFD     ++     M    +R     E    FN M++  + 
Sbjct: 282 SALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVR 341

Query: 295 PNEITLLSLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKC--------- 354
           P+ I++LS I+ C  ++ +  GK  H Y+LRNGF     +  ALIDMY KC         
Sbjct: 342 PDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRI 401

Query: 355 ----------------------GQVGYARALFNGVEKKDVKIWSALISAYAHVSCMDQVF 414
                                 G+V  A   F  + +K++  W+ +IS     S  ++  
Sbjct: 402 FDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAI 461

Query: 415 NLFLEMLDNE-VKPNKVTMVSLLSLCAEAGTLDLGKWTHAYINRHGLEVDVILETALINM 474
            +F  M   E V  + VTM+S+ S C   G LDL KW + YI ++G+++DV L T L++M
Sbjct: 462 EVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDM 521

Query: 475 YVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEALELFSEMESHGVEPNDITF 534
           + +CGD   A S+F+  T RD+  W A +   +M G  + A+ELF +M   G++P+ + F
Sbjct: 522 FSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAF 581

Query: 535 ISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDLLGRAGHLEEAHNIIENMP 594
           +    ACSH GLV +GK+ F  M+   G+ P+  HYGC+VDLLGRAG LEEA  +IE+MP
Sbjct: 582 VGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMP 641

Query: 595 MRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTS 654
           M PN +IW +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+  
Sbjct: 642 MEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAK 701

Query: 655 VRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVYEMVAEMCIKLREAGYTPN 714
           VR +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P+
Sbjct: 702 VRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVPD 761

Query: 715 TAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAAMKLLSK 767
            + VL+++DE+EK   LS HSEKLAMA+GLIS+  GT IRI+KNLR+C DCH+  K  SK
Sbjct: 762 LSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFASK 821

BLAST of IVF0008841 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 533.9 bits (1374), Expect = 2.2e-151
Identity = 288/744 (38.71%), Postives = 430/744 (57.80%), Query Frame = 0

Query: 55  YNLLISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGF 114
           YN LI  Y ++ L   ++  +L M   ++  + D +  P  L ACA++ +   G ++HG 
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRMM--NSGISPDKYTFPFGLSACAKSRAKGNGIQIHGL 161

Query: 115 AQKNGFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEA 174
             K G+A D+FV N+L++ Y +CG L SA  VFD+M ER+VVSW++M+  Y R     +A
Sbjct: 162 IVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDA 221

Query: 175 L----RLVREMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLT 234
           +    R+VR+ +   V  + V ++ +I     L D+++G  V+ +I RN G E  ++ + 
Sbjct: 222 VDLFFRMVRDEE---VTPNSVTMVCVISACAKLEDLETGEKVYAFI-RNSGIEVNDL-MV 281

Query: 235 TALIDMYCKCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLF 294
           +AL+DMY KC  +  A+RLFD     ++     M    +R     E    FN M++  + 
Sbjct: 282 SALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVR 341

Query: 295 PNEITLLSLITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKC--------- 354
           P+ I++LS I+ C  ++ +  GK  H Y+LRNGF     +  ALIDMY KC         
Sbjct: 342 PDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRI 401

Query: 355 ----------------------GQVGYARALFNGVEKKDVKIWSALISAYAHVSCMDQVF 414
                                 G+V  A   F  + +K++  W+ +IS     S  ++  
Sbjct: 402 FDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAI 461

Query: 415 NLFLEMLDNE-VKPNKVTMVSLLSLCAEAGTLDLGKWTHAYINRHGLEVDVILETALINM 474
            +F  M   E V  + VTM+S+ S C   G LDL KW + YI ++G+++DV L T L++M
Sbjct: 462 EVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDM 521

Query: 475 YVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEALELFSEMESHGVEPNDITF 534
           + +CGD   A S+F+  T RD+  W A +   +M G  + A+ELF +M   G++P+ + F
Sbjct: 522 FSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAF 581

Query: 535 ISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGCLVDLLGRAGHLEEAHNIIENMP 594
           +    ACSH GLV +GK+ F  M+   G+ P+  HYGC+VDLLGRAG LEEA  +IE+MP
Sbjct: 582 VGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMP 641

Query: 595 MRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTS 654
           M PN +IW +LLAAC++  N+ +   AA KI  L P+  G  VL SN+YASA RWND+  
Sbjct: 642 MEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAK 701

Query: 655 VRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVYEMVAEMCIKLREAGYTPN 714
           VR +M   G++K PG S I++ G  H F SGD++  +   +  M+ E+  +    G+ P+
Sbjct: 702 VRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVPD 761

Query: 715 TAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRIIKNLRICDDCHAAMKLLSK 763
            + VL+++DE+EK   LS HSEKLAMA+GLIS+  GT IRI+KNLR+C DCH+  K  SK
Sbjct: 762 LSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFASK 821

BLAST of IVF0008841 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 528.1 bits (1359), Expect = 1.2e-149
Identity = 288/770 (37.40%), Postives = 424/770 (55.06%), Query Frame = 0

Query: 22  LQQTHQLHAHFIKTQFHNPHPFFSQ--------SHF------------TPEAN---YNLL 81
           LQ    +HA  IK   HN +   S+         HF              E N   +N +
Sbjct: 46  LQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTM 105

Query: 82  ISSYTNNHLPQASLNCYLHMRTNDAAAALDNFILPSLLKACAQASSADLGRELHGFAQKN 141
              +  +  P ++L  Y+ M +       +++  P +LK+CA++ +   G+++HG   K 
Sbjct: 106 FRGHALSSDPVSALKLYVCMIS--LGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKL 165

Query: 142 GFASDVFVCNALMNMYEKCGCLVSASLVFDKMPERDVVSWSTMLGCYVRSKAFGEALRLV 201
           G   D++V  +L++MY + G L  A  VFDK P RDVVS++ ++                
Sbjct: 166 GCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALI---------------- 225

Query: 202 REMQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSLTTALIDMYC 261
                                         G A  GYI                      
Sbjct: 226 -----------------------------KGYASRGYI---------------------- 285

Query: 262 KCECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLS 321
                 +AQ+LFD +  + VVSW  MI G   +    E  + F  M++  + P+E T+++
Sbjct: 286 -----ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVT 345

Query: 322 LITECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKD 381
           +++ C    +++LG+  H ++  +GFG +L +V ALID+Y KCG++  A  LF  +  KD
Sbjct: 346 VVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKD 405

Query: 382 VKIWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHA 441
           V  W+ LI  Y H++   +   LF EML +   PN VTM+S+L  CA  G +D+G+W H 
Sbjct: 406 VISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHV 465

Query: 442 YINRH--GLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCG 501
           YI++   G+     L T+LI+MY KCGD+  A  +F+    + +  WNAM+ GF+MHG  
Sbjct: 466 YIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRA 525

Query: 502 KEALELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHSFGIVPKMEHYGC 561
             + +LFS M   G++P+DITF+ +  ACSHSG++  G+  F  M   + + PK+EHYGC
Sbjct: 526 DASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGC 585

Query: 562 LVDLLGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQN 621
           ++DLLG +G  +EA  +I  M M P+ +IW +LL ACK+H N+ LGE  A  +++++P+N
Sbjct: 586 MIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPEN 645

Query: 622 CGYSVLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQT 681
            G  VL SNIYASA RWN+V   R  ++  GMKK PG S IE++  VH F  GDK   + 
Sbjct: 646 PGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRN 705

Query: 682 TKVYEMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTP 741
            ++Y M+ EM + L +AG+ P+T+EVL  ++EE KE AL +HSEKLA+AFGLIST PGT 
Sbjct: 706 REIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTK 741

Query: 742 IRIIKNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 767
           + I+KNLR+C +CH A KL+SKIY R II RDR RFHHF +G CSC  YW
Sbjct: 766 LTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3E6Q19.1e-15538.62Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
O823801.9e-15238.13Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LUJ23.6e-15138.64Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9LN011.7e-14837.40Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9SN393.7e-14838.13Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A5A7V2V90.0e+00100.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CJ580.0e+0099.74pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
A0A0A0LYC20.0e+0093.99DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G6902... [more]
A0A6J1HA740.0e+0084.99pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
A0A6J1JKG90.0e+0083.68pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita maxima O... [more]
Match NameE-valueIdentityDescription
KAA0062552.10.0100.00pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK28774... [more]
XP_008462708.10.099.74PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-... [more]
XP_011660280.10.093.99pentatricopeptide repeat-containing protein At3g26782, mitochondrial [Cucumis sa... [more]
XP_038879151.10.088.79pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benin... [more]
KAG6590282.10.085.12Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
AT1G11290.16.5e-15638.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.11.3e-15338.13Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.22.5e-15238.64INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT3G22690.12.2e-15138.71CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
AT1G08070.11.2e-14937.40Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 221..306
e-value: 9.5E-16
score: 59.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 50..220
e-value: 7.8E-26
score: 93.2
coord: 432..674
e-value: 5.2E-41
score: 143.0
coord: 307..431
e-value: 1.0E-20
score: 76.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 459..506
e-value: 1.7E-12
score: 47.3
coord: 357..404
e-value: 1.3E-7
score: 31.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 156..188
e-value: 1.7E-4
score: 19.5
coord: 462..494
e-value: 2.8E-8
score: 31.5
coord: 259..293
e-value: 2.6E-4
score: 18.9
coord: 361..393
e-value: 1.4E-4
score: 19.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 156..183
e-value: 6.0E-4
score: 19.9
coord: 259..287
e-value: 0.14
score: 12.5
coord: 533..558
e-value: 0.22
score: 11.8
coord: 231..256
e-value: 0.089
score: 13.1
coord: 433..458
e-value: 0.037
score: 14.3
coord: 126..154
e-value: 0.0058
score: 16.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 154..188
score: 10.676364
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 257..291
score: 9.415814
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 459..493
score: 12.474017
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 358..392
score: 10.117337
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 632..755
e-value: 2.6E-38
score: 130.8
NoneNo IPR availablePANTHERPTHR47924:SF46PENTATRICOPEPTIDE REPEAT PROTEIN-RELATEDcoord: 35..408
coord: 229..678
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 35..408
coord: 229..678

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0008841.1IVF0008841.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding