IVF0022277 (gene) Melon (IVF77) v1

Overview
NameIVF0022277
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr04: 31028768 .. 31030554 (-)
RNA-Seq ExpressionIVF0022277
SyntenyIVF0022277
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTACGGTGCTTACGGTCGCCTTATCCAGCACTGCACTGACCACCTCTTCTTCCGCGTCGGTAAGCAGCTTCACGCTCGTCTTGTTCTATCATCCGTAGCTCCCGATAACTTCCTCGGATCGAAACTCATCAGCTTCTACTCAAAATCCGGTAGCCTTCGAGATGCCTACAATGTGTTTGGTAAAATTCCTCGCAAAAACATTTTCTCTTGGAATGCTTTGTTTATCAGCTACACTCTTCACAATATGCACACTGATCTGCTGAAGCTGTTTTTGTCTTTGGTTAACTCAAATTCGACGGATGTGAAGCCTGATAGGTTTACGGTTACTTGTGTTTTGAAAGCGTTGGCGTCTTTGTTTTCTAATTCGGTTTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGGGAGCTCGAGTCTGATATTTTTGTTGTCAATGCTTTGATTACTTTTTACTCGAGGTGTGATGAGCTGGTTTTAGCGAGAATTATGTTTGATAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCGATGTTGGCTGGGTATTCTCAGGGTGGATCCTATGAGAAGTGCAAGGAGCTATTTAGAGTGATGTCGAGTTCACTGGAGGTGAAGCCTAATGCATTAACCGCAGTCAGTGTTTTGCAAGCTTGTGCTCAGTCCAACGATCTAATTTTGGAATGGAGGTTCATAGATTTGTTAATGAAAGCCAGATTAAAATGGATGTTTCACTATGGAATGCTGTTATTGGATTATATGCGAAGTGTGGTAGCTTGGATTATGCTCGGGAGCTGTTTGAAGAAATGCCGGAGAAGGATGGGATCACCTATTGCTCCATGATATCAGGCTACATGGTTCATGGTTTTGTTAACCAAGCAATGGATCTTTTTCGAGAACTGGAAAGGCCAAGGTTGCCAACATGGAATGCTGTTATTTCCGGTCTGGTTCAGAACAACCGGCAAGATGGAGCTCTAGATATATTTCGAGCAATGCAGTCACATGGTTGCAGACCAAATACTGTGACACTTGCGAGCATTCTTCCCATTTTCTCACATTTTTCAACCCTTAAAGGTGGGAAAGAAATTCATGGTTATGCCATTAGAAACACTTATGATGGGAATATTTTTGTTGCTACTGCCATCATTGATTCTTATGCTAAGTGTGGTTACCTCCAAGGGGCTCGACAAGTTTTTGATCAATTAAAAGGTAGGAGTCTTATAGCTTGGACATCAATAATCTCAGCATATGCTGTACATGGAGATGCCAATGTGGCTCTTAGTCTTTTCTATGAGATGCTGACATATGGGATTCAGCCTGACCAGGTAACCTTTACATCAGTATTGGCTGCCTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAATATCTTGTTACCAGACTATGGGATTCAACCACTAGTTGAGCATTATGCTTGCATGGTAGGAGTCCTTAGTCGAGCAGGAAAGCTCTCTGATGCTGTTGAATTTATTTCTAAAATGCCATTGGAACCCAATGCAAAAGTTTGGGGTGCTCTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTATGTTTTTGATCGTCTCTTTGAGATTGAGCCTGGAAATACTGGTAACTACGTCATAATGGCTAACTTATATTCACAATCTGGAAGGTGGAAAGAGGCCGACACAATTAGGGATTTGATGAAGGAAGTTAGATTGAAGAAGATCCCGGGAAATAGCTGGATAGAAACAAGGGGAGGGTTGCAGAGTTTTTGCTAG

mRNA sequence

ATGAACTACGGTGCTTACGGTCGCCTTATCCAGCACTGCACTGACCACCTCTTCTTCCGCGTCGGTAAGCAGCTTCACGCTCGTCTTGTTCTATCATCCGTAGCTCCCGATAACTTCCTCGGATCGAAACTCATCAGCTTCTACTCAAAATCCGGTAGCCTTCGAGATGCCTACAATGTGTTTGGTAAAATTCCTCGCAAAAACATTTTCTCTTGGAATGCTTTGTTTATCAGCTACACTCTTCACAATATGCACACTGATCTGCTGAAGCTGTTTTTGTCTTTGGTTAACTCAAATTCGACGGATGTGAAGCCTGATAGGTTTACGGTTACTTGTGTTTTGAAAGCGTTGGCGTCTTTGTTTTCTAATTCGGTTTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGGGAGCTCGAGTCTGATATTTTTGTTGTCAATGCTTTGATTACTTTTTACTCGAGGTGTGATGAGCTGGTTTTAGCGAGAATTATGTTTGATAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCGATGTTGGCTGGGTATTCTCAGGGTGGATCCTATGAGAAGTGCAAGGAGCTATTTAGAGTGATGTCGAGTTCACTGGAGGTGAAGCCTAATGCATTAACCGCAGTCAGTGTTTTGCAAGCTTGTGCTCAATTTGTTAATGAAAGCCAGATTAAAATGGATGTTTCACTATGGAATGCTGTTATTGGATTATATGCGAAGTGTGGTAGCTTGGATTATGCTCGGGAGCTGTTTGAAGAAATGCCGGAGAAGGATGGGATCACCTATTGCTCCATGATATCAGGCTACATGGTTCATGGTTTTGTTAACCAAGCAATGGATCTTTTTCGAGAACTGGAAAGGCCAAGGTTGCCAACATGGAATGCTGTTATTTCCGGTCTGGTTCAGAACAACCGGCAAGATGGAGCTCTAGATATATTTCGAGCAATGCAGTCACATGGTTGCAGACCAAATACTGTGACACTTGCGAGCATTCTTCCCATTTTCTCACATTTTTCAACCCTTAAAGGTGGGAAAGAAATTCATGGTTATGCCATTAGAAACACTTATGATGGGAATATTTTTGTTGCTACTGCCATCATTGATTCTTATGCTAAGTGTGGTTACCTCCAAGGGGCTCGACAAGTTTTTGATCAATTAAAAGGTAGGAGTCTTATAGCTTGGACATCAATAATCTCAGCATATGCTGTACATGGAGATGCCAATGTGGCTCTTAGTCTTTTCTATGAGATGCTGACATATGGGATTCAGCCTGACCAGGTAACCTTTACATCAGTATTGGCTGCCTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAATATCTTGTTACCAGACTATGGGATTCAACCACTAGTTGAGCATTATGCTTGCATGGTAGGAGTCCTTAGTCGAGCAGGAAAGCTCTCTGATGCTGTTGAATTTATTTCTAAAATGCCATTGGAACCCAATGCAAAAGTTTGGGGTGCTCTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTATGTTTTTGATCGTCTCTTTGAGATTGAGCCTGGAAATACTGGTAACTACGTCATAATGGCTAACTTATATTCACAATCTGGAAGGTGGAAAGAGGCCGACACAATTAGGGATTTGATGAAGGAAGTTAGATTGAAGAAGATCCCGGGAAATAGCTGGATAGAAACAAGGGGAGGGTTGCAGAGTTTTTGCTAG

Coding sequence (CDS)

ATGAACTACGGTGCTTACGGTCGCCTTATCCAGCACTGCACTGACCACCTCTTCTTCCGCGTCGGTAAGCAGCTTCACGCTCGTCTTGTTCTATCATCCGTAGCTCCCGATAACTTCCTCGGATCGAAACTCATCAGCTTCTACTCAAAATCCGGTAGCCTTCGAGATGCCTACAATGTGTTTGGTAAAATTCCTCGCAAAAACATTTTCTCTTGGAATGCTTTGTTTATCAGCTACACTCTTCACAATATGCACACTGATCTGCTGAAGCTGTTTTTGTCTTTGGTTAACTCAAATTCGACGGATGTGAAGCCTGATAGGTTTACGGTTACTTGTGTTTTGAAAGCGTTGGCGTCTTTGTTTTCTAATTCGGTTTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGGGAGCTCGAGTCTGATATTTTTGTTGTCAATGCTTTGATTACTTTTTACTCGAGGTGTGATGAGCTGGTTTTAGCGAGAATTATGTTTGATAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCGATGTTGGCTGGGTATTCTCAGGGTGGATCCTATGAGAAGTGCAAGGAGCTATTTAGAGTGATGTCGAGTTCACTGGAGGTGAAGCCTAATGCATTAACCGCAGTCAGTGTTTTGCAAGCTTGTGCTCAATTTGTTAATGAAAGCCAGATTAAAATGGATGTTTCACTATGGAATGCTGTTATTGGATTATATGCGAAGTGTGGTAGCTTGGATTATGCTCGGGAGCTGTTTGAAGAAATGCCGGAGAAGGATGGGATCACCTATTGCTCCATGATATCAGGCTACATGGTTCATGGTTTTGTTAACCAAGCAATGGATCTTTTTCGAGAACTGGAAAGGCCAAGGTTGCCAACATGGAATGCTGTTATTTCCGGTCTGGTTCAGAACAACCGGCAAGATGGAGCTCTAGATATATTTCGAGCAATGCAGTCACATGGTTGCAGACCAAATACTGTGACACTTGCGAGCATTCTTCCCATTTTCTCACATTTTTCAACCCTTAAAGGTGGGAAAGAAATTCATGGTTATGCCATTAGAAACACTTATGATGGGAATATTTTTGTTGCTACTGCCATCATTGATTCTTATGCTAAGTGTGGTTACCTCCAAGGGGCTCGACAAGTTTTTGATCAATTAAAAGGTAGGAGTCTTATAGCTTGGACATCAATAATCTCAGCATATGCTGTACATGGAGATGCCAATGTGGCTCTTAGTCTTTTCTATGAGATGCTGACATATGGGATTCAGCCTGACCAGGTAACCTTTACATCAGTATTGGCTGCCTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAATATCTTGTTACCAGACTATGGGATTCAACCACTAGTTGAGCATTATGCTTGCATGGTAGGAGTCCTTAGTCGAGCAGGAAAGCTCTCTGATGCTGTTGAATTTATTTCTAAAATGCCATTGGAACCCAATGCAAAAGTTTGGGGTGCTCTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTATGTTTTTGATCGTCTCTTTGAGATTGAGCCTGGAAATACTGGTAACTACGTCATAATGGCTAACTTATATTCACAATCTGGAAGGTGGAAAGAGGCCGACACAATTAGGGATTTGATGAAGGAAGTTAGATTGAAGAAGATCCCGGGAAATAGCTGGATAGAAACAAGGGGAGGGTTGCAGAGTTTTTGCTAG

Protein sequence

MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNVFGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAMLAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQFVNESQIKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFRELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKGGKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAVHGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEIEPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC
Homology
BLAST of IVF0022277 vs. ExPASy Swiss-Prot
Match: Q9ZUT5 (Pentatricopeptide repeat-containing protein At2g37310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E49 PE=2 SV=1)

HSP 1 Score: 682.2 bits (1759), Expect = 5.2e-195
Identity = 339/596 (56.88%), Postives = 436/596 (73.15%), Query Frame = 0

Query: 4   GAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNVFGK 63
           GAYG LIQH T H       QLHAR+V+ S+ PDNFL SKLISFY++    R A +VF +
Sbjct: 23  GAYGHLIQHFTRHRLPLHVLQLHARIVVFSIKPDNFLASKLISFYTRQDRFRQALHVFDE 82

Query: 64  IPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNS---NSTDVKPDRFTVTCVLKALASL 123
           I  +N FS+NAL I+YT   M+ D   LFLS + S   +S   +PD  +++CVLKAL+  
Sbjct: 83  ITVRNAFSYNALLIAYTSREMYFDAFSLFLSWIGSSCYSSDAARPDSISISCVLKALSGC 142

Query: 124 --FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWN 183
             F    LA++VH F++R   +SD+FV N +IT+Y++CD +  AR +FD M ERD+VSWN
Sbjct: 143 DDFWLGSLARQVHGFVIRGGFDSDVFVGNGMITYYTKCDNIESARKVFDEMSERDVVSWN 202

Query: 184 AMLAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQFVN------------E 243
           +M++GYSQ GS+E CK++++ M +  + KPN +T +SV QAC Q  +            E
Sbjct: 203 SMISGYSQSGSFEDCKKMYKAMLACSDFKPNGVTVISVFQACGQSSDLIFGLEVHKKMIE 262

Query: 244 SQIKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDL 303
           + I+MD+SL NAVIG YAKCGSLDYAR LF+EM EKD +TY ++ISGYM HG V +AM L
Sbjct: 263 NHIQMDLSLCNAVIGFYAKCGSLDYARALFDEMSEKDSVTYGAIISGYMAHGLVKEAMAL 322

Query: 304 FRELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTL 363
           F E+E   L TWNA+ISGL+QNN  +  ++ FR M   G RPNTVTL+S+LP  ++ S L
Sbjct: 323 FSEMESIGLSTWNAMISGLMQNNHHEEVINSFREMIRCGSRPNTVTLSSLLPSLTYSSNL 382

Query: 364 KGGKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAY 423
           KGGKEIH +AIRN  D NI+V T+IID+YAK G+L GA++VFD  K RSLIAWT+II+AY
Sbjct: 383 KGGKEIHAFAIRNGADNNIYVTTSIIDNYAKLGFLLGAQRVFDNCKDRSLIAWTAIITAY 442

Query: 424 AVHGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPL 483
           AVHGD++ A SLF +M   G +PD VT T+VL+A AHSG+ D A  IF+ +L  Y I+P 
Sbjct: 443 AVHGDSDSACSLFDQMQCLGTKPDDVTLTAVLSAFAHSGDSDMAQHIFDSMLTKYDIEPG 502

Query: 484 VEHYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLF 543
           VEHYACMV VLSRAGKLSDA+EFISKMP++P AKVWGALLNGASV GD+E+ ++  DRLF
Sbjct: 503 VEHYACMVSVLSRAGKLSDAMEFISKMPIDPIAKVWGALLNGASVLGDLEIARFACDRLF 562

Query: 544 EIEPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 583
           E+EP NTGNY IMANLY+Q+GRW+EA+ +R+ MK + LKKIPG SWIET  GL+SF
Sbjct: 563 EMEPENTGNYTIMANLYTQAGRWEEAEMVRNKMKRIGLKKIPGTSWIETEKGLRSF 618

BLAST of IVF0022277 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 385.2 bits (988), Expect = 1.3e-105
Identity = 206/589 (34.97%), Postives = 336/589 (57.05%), Query Frame = 0

Query: 9   LIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISF--YSKSGSLRDAYNVFGKIPR 68
           LI+ C      R  KQ H  ++ +    D +  SKL +    S   SL  A  VF +IP+
Sbjct: 36  LIERCVS---LRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPK 95

Query: 69  KNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVL 128
            N F+WN L  +Y         +  FL +V  + +   P+++T   ++KA A + S S L
Sbjct: 96  PNSFAWNTLIRAYASGPDPVLSIWAFLDMV--SESQCYPNKYTFPFLIKAAAEVSSLS-L 155

Query: 129 AKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAMLAGYSQ 188
            + +H   ++  + SD+FV N+LI  Y  C +L  A  +F  + E+D+VSWN+M+ G+ Q
Sbjct: 156 GQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 215

Query: 189 GGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQIKMDVS 248
            GS +K  ELF+ M S  +VK + +T V VL ACA+            ++ E+++ ++++
Sbjct: 216 KGSPDKALELFKKMESE-DVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLT 275

Query: 249 LWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFRELERPR 308
           L NA++ +Y KCGS++ A+ LF+ M EKD +T+ +M+ GY +      A ++   + +  
Sbjct: 276 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 335

Query: 309 LPTWNAVISGLVQNNRQDGALDIFRAMQ-SHGCRPNTVTLASILPIFSHFSTLKGGKEIH 368
           +  WNA+IS   QN + + AL +F  +Q     + N +TL S L   +    L+ G+ IH
Sbjct: 336 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 395

Query: 369 GYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAVHGDAN 428
            Y  ++    N  V +A+I  Y+KCG L+ +R+VF+ ++ R +  W+++I   A+HG  N
Sbjct: 396 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 455

Query: 429 VALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVEHYACM 488
            A+ +FY+M    ++P+ VTFT+V  AC+H+G +DEA  +F+ +  +YGI P  +HYAC+
Sbjct: 456 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 515

Query: 489 VGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEIEPGNT 548
           V VL R+G L  AV+FI  MP+ P+  VWGALL    +  ++ L +    RL E+EP N 
Sbjct: 516 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 575

Query: 549 GNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 583
           G +V+++N+Y++ G+W+    +R  M+   LKK PG S IE  G +  F
Sbjct: 576 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEF 617

BLAST of IVF0022277 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 365.5 bits (937), Expect = 1.1e-99
Identity = 210/658 (31.91%), Postives = 339/658 (51.52%), Query Frame = 0

Query: 9   LIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNVFGKIPRKN 68
           ++Q C D    + GK++   +  +    D+ LGSKL   Y+  G L++A  VF ++  + 
Sbjct: 100 VLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEK 159

Query: 69  IFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVLAK 128
              WN L          +  + LF  +++S    V+ D +T +CV K+ +SL S     +
Sbjct: 160 ALFWNILMNELAKSGDFSGSIGLFKKMMSSG---VEMDSYTFSCVSKSFSSLRSVHG-GE 219

Query: 129 EVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAMLAGYSQGG 188
           ++H FIL+        V N+L+ FY +   +  AR +FD M ERD++SWN+++ GY   G
Sbjct: 220 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 279

Query: 189 SYEKCKELF-RVMSSSLEVKPNALTAVSVLQACA--------QFVNESQIKMDVS----L 248
             EK   +F +++ S +E+  +  T VSV   CA        + V+   +K   S     
Sbjct: 280 LAEKGLSVFVQMLVSGIEI--DLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 339

Query: 249 WNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFRELERPRL 308
            N ++ +Y+KCG LD A+ +F EM ++  ++Y SMI+GY   G   +A+ LF E+E   +
Sbjct: 340 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 399

Query: 309 P----------------------------------------------------------- 368
                                                                       
Sbjct: 400 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 459

Query: 369 -----------TWNAVISGLVQNNRQDGALDIFR-AMQSHGCRPNTVTLASILPIFSHFS 428
                      +WN +I G  +N   + AL +F   ++     P+  T+A +LP  +  S
Sbjct: 460 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 519

Query: 429 TLKGGKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIIS 488
               G+EIHGY +RN Y  +  VA +++D YAKCG L  A  +FD +  + L++WT +I+
Sbjct: 520 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 579

Query: 489 AYAVHGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQ 548
            Y +HG    A++LF +M   GI+ D+++F S+L AC+HSG +DE W+ FNI+  +  I+
Sbjct: 580 GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIE 639

Query: 549 PLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDR 583
           P VEHYAC+V +L+R G L  A  FI  MP+ P+A +WGALL G  +  DV+L + V ++
Sbjct: 640 PTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEK 699

BLAST of IVF0022277 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 363.2 bits (931), Expect = 5.4e-99
Identity = 204/574 (35.54%), Postives = 315/574 (54.88%), Query Frame = 0

Query: 23  KQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNVFGKIPRKNIFSWNALFISYTLH 82
           KQ+HARL++  +    FL +KLI   S  G +  A  VF  +PR  IF WNA+   Y+ +
Sbjct: 38  KQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRN 97

Query: 83  NMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVLAKEVHCFILRRELESD 142
           N   D L ++    N     V PD FT   +LKA + L S+  + + VH  + R   ++D
Sbjct: 98  NHFQDALLMY---SNMQLARVSPDSFTFPHLLKACSGL-SHLQMGRFVHAQVFRLGFDAD 157

Query: 143 IFVVNALITFYSRCDELVLARIMFD--RMPERDIVSWNAMLAGYSQGGSYEKCKELFRVM 202
           +FV N LI  Y++C  L  AR +F+   +PER IVSW A+++ Y+Q G   +  E+F  M
Sbjct: 158 VFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQM 217

Query: 203 SSSLEVKPNALTAVSVLQA--CAQFVNESQ----------IKMDVSLWNAVIGLYAKCGS 262
              ++VKP+ +  VSVL A  C Q + + +          ++++  L  ++  +YAKCG 
Sbjct: 218 -RKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQ 277

Query: 263 LDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFRELERPRLPTWNAVISGLVQN 322
           +  A+ LF++M                               + P L  WNA+ISG  +N
Sbjct: 278 VATAKILFDKM-------------------------------KSPNLILWNAMISGYAKN 337

Query: 323 NRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKGGKEIHGYAIRNTYDGNIFVA 382
                A+D+F  M +   RP+T+++ S +   +   +L+  + ++ Y  R+ Y  ++F++
Sbjct: 338 GYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFIS 397

Query: 383 TAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAVHGDANVALSLFYEMLTYGIQ 442
           +A+ID +AKCG ++GAR VFD+   R ++ W+++I  Y +HG A  A+SL+  M   G+ 
Sbjct: 398 SALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVH 457

Query: 443 PDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVEHYACMVGVLSRAGKLSDAVE 502
           P+ VTF  +L AC HSG + E W  FN  + D+ I P  +HYAC++ +L RAG L  A E
Sbjct: 458 PNDVTFLGLLMACNHSGMVREGWWFFN-RMADHKINPQQQHYACVIDLLGRAGHLDQAYE 517

Query: 503 FISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEIEPGNTGNYVIMANLYSQSGR 562
            I  MP++P   VWGALL+       VELG+Y   +LF I+P NTG+YV ++NLY+ +  
Sbjct: 518 VIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARL 574

Query: 563 WKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 583
           W     +R  MKE  L K  G SW+E RG L++F
Sbjct: 578 WDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAF 574

BLAST of IVF0022277 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 362.1 bits (928), Expect = 1.2e-98
Identity = 206/616 (33.44%), Postives = 338/616 (54.87%), Query Frame = 0

Query: 16  HLFFRVGKQLHARLVLSSV-APDNFLGSKLISFYSKSGSLRDAYNVFGKIPRKNIFSWNA 75
           +++ + G  LHAR +   +     F  + ++S YSK G +      F ++P+++  SW  
Sbjct: 57  NVYSKTGYALHARKLFDEMPLRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTT 116

Query: 76  LFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVLAKEVHCFI 135
           + + Y     +   +++   +V      ++P +FT+T VL ++A+        K+VH FI
Sbjct: 117 MIVGYKNIGQYHKAIRVMGDMVKEG---IEPTQFTLTNVLASVAATRCMET-GKKVHSFI 176

Query: 136 LRRELESDIFVVNALITFYSRCDELVLARIMFDR-------------------------- 195
           ++  L  ++ V N+L+  Y++C + ++A+ +FDR                          
Sbjct: 177 VKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAM 236

Query: 196 -----MPERDIVSWNAMLAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQF 255
                M ERDIV+WN+M++G++Q G   +  ++F  M     + P+  T  SVL ACA  
Sbjct: 237 AQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANL 296

Query: 256 VN-------ESQI---KMDVS--LWNAVIGLYAKCGSLDYARELFEEMPEKD----GITY 315
                     S I     D+S  + NA+I +Y++CG ++ AR L E+   KD    G T 
Sbjct: 297 EKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFT- 356

Query: 316 CSMISGYMVHGFVNQAMDLFRELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCR 375
            +++ GY+  G +NQA ++F  L+   +  W A+I G  Q+     A+++FR+M   G R
Sbjct: 357 -ALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQR 416

Query: 376 PNTVTLASILPIFSHFSTLKGGKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQV 435
           PN+ TLA++L + S  ++L  GK+IHG A+++    ++ V+ A+I  YAK G +  A + 
Sbjct: 417 PNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRA 476

Query: 436 FDQLK-GRSLIAWTSIISAYAVHGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGE 495
           FD ++  R  ++WTS+I A A HG A  AL LF  ML  G++PD +T+  V +AC H+G 
Sbjct: 477 FDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGL 536

Query: 496 LDEAWKIFNILLPDYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALL 555
           +++  + F+++     I P + HYACMV +  RAG L +A EFI KMP+EP+   WG+LL
Sbjct: 537 VNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLL 596

Query: 556 NGASVAGDVELGKYVFDRLFEIEPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKK 583
           +   V  +++LGK   +RL  +EP N+G Y  +ANLYS  G+W+EA  IR  MK+ R+KK
Sbjct: 597 SACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKK 656

BLAST of IVF0022277 vs. ExPASy TrEMBL
Match: A0A5A7TRM4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G002030 PE=4 SV=1)

HSP 1 Score: 1176.8 bits (3043), Expect = 0.0e+00
Identity = 583/595 (97.98%), Postives = 583/595 (97.98%), Query Frame = 0

Query: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
           MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60

Query: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
           FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL
Sbjct: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120

Query: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
           FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM
Sbjct: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180

Query: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
           LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ            FVNESQ
Sbjct: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 240

Query: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
           IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR
Sbjct: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300

Query: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
           ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG
Sbjct: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360

Query: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
           GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV
Sbjct: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420

Query: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
           HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE
Sbjct: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480

Query: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
           HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540

Query: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC 584
           EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC
Sbjct: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC 595

BLAST of IVF0022277 vs. ExPASy TrEMBL
Match: A0A1S4DUQ6 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucumis melo OX=3656 GN=LOC107990300 PE=4 SV=1)

HSP 1 Score: 1176.8 bits (3043), Expect = 0.0e+00
Identity = 583/595 (97.98%), Postives = 583/595 (97.98%), Query Frame = 0

Query: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
           MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60

Query: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
           FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL
Sbjct: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120

Query: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
           FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM
Sbjct: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180

Query: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
           LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ            FVNESQ
Sbjct: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 240

Query: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
           IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR
Sbjct: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300

Query: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
           ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG
Sbjct: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360

Query: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
           GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV
Sbjct: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420

Query: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
           HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE
Sbjct: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480

Query: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
           HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540

Query: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC 584
           EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC
Sbjct: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC 595

BLAST of IVF0022277 vs. ExPASy TrEMBL
Match: A0A0A0LFN1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736590 PE=4 SV=1)

HSP 1 Score: 1112.4 bits (2876), Expect = 0.0e+00
Identity = 551/595 (92.61%), Postives = 562/595 (94.45%), Query Frame = 0

Query: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
           MNY AYGRLIQHCTDHLFFRVGKQLHARLVLSSV PDNFLGSKLISFYSKSGS+RDAYNV
Sbjct: 1   MNYSAYGRLIQHCTDHLFFRVGKQLHARLVLSSVVPDNFLGSKLISFYSKSGSIRDAYNV 60

Query: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
           FGKIPRKNIFSWNAL ISYTLHNMHTDLLKLF SLVNSNSTDVKPDRFTVTC LKALASL
Sbjct: 61  FGKIPRKNIFSWNALLISYTLHNMHTDLLKLFSSLVNSNSTDVKPDRFTVTCALKALASL 120

Query: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
           FSNS LAKEVH FILRR LE DIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM
Sbjct: 121 FSNSGLAKEVHSFILRRGLEYDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180

Query: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
           LAGYSQGGSYEKCKELFRVM SSLEVKPNALTAVSVLQACAQ            FVNESQ
Sbjct: 181 LAGYSQGGSYEKCKELFRVMLSSLEVKPNALTAVSVLQACAQSNDLIFGIEVHRFVNESQ 240

Query: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
           IKMDVSLWNAVIGLYAKCGSLDYARELFEEM EKD ITYCSMISGYMVHGFVNQAMDLFR
Sbjct: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMLEKDAITYCSMISGYMVHGFVNQAMDLFR 300

Query: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
           E ERPRLPTWNAVISGLVQNNRQ+GA+DIFRAMQSHGCRPNTVTLASILP+FSHFSTLKG
Sbjct: 301 EQERPRLPTWNAVISGLVQNNRQEGAVDIFRAMQSHGCRPNTVTLASILPVFSHFSTLKG 360

Query: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
           GKEIHGYAIRNTYD NI+VATAIIDSYAKCGYL GA+ VFDQ+KGRSLIAWTSIISAYAV
Sbjct: 361 GKEIHGYAIRNTYDRNIYVATAIIDSYAKCGYLHGAQLVFDQIKGRSLIAWTSIISAYAV 420

Query: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
           HGDANVALSLFYEMLT GIQPDQVTFTSVLAACAHSGELDEAWKIFN+LLP+YGIQPLVE
Sbjct: 421 HGDANVALSLFYEMLTNGIQPDQVTFTSVLAACAHSGELDEAWKIFNVLLPEYGIQPLVE 480

Query: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
           HYACMVGVLSRAGKLSDAVEFISKMPLEP AKVWGALLNGASVAGDVELGKYVFDRLFEI
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEI 540

Query: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC 584
           EP NTGNYVIMANLYSQSGRWK+ADTIRDLMKEVRLKKIPGNSWIET GG+Q FC
Sbjct: 541 EPENTGNYVIMANLYSQSGRWKDADTIRDLMKEVRLKKIPGNSWIETSGGMQRFC 595

BLAST of IVF0022277 vs. ExPASy TrEMBL
Match: A0A6J1F110 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita moschata OX=3662 GN=LOC111441405 PE=4 SV=1)

HSP 1 Score: 1033.1 bits (2670), Expect = 4.5e-298
Identity = 509/594 (85.69%), Postives = 536/594 (90.24%), Query Frame = 0

Query: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
           MNYGAYGRLIQHCTD  FFR+GKQLHARLVLSSVAPDNFLGSKLI+ YSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 60

Query: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
           F  I  KNIFSWNALFISYTLHNMH D+LKLF SLVN NSTDVKPD+FTVTCVLKALASL
Sbjct: 61  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNVNSTDVKPDKFTVTCVLKALASL 120

Query: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
           F+NS+LAKEVHCF+LRR LESDIFVVNALITFYSRCDEL LARIMFDR PERDIVSWNAM
Sbjct: 121 FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELALARIMFDRTPERDIVSWNAM 180

Query: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACA------------QFVNESQ 240
           +AGYSQGG YE CKELF+ M  S E KPNALTAVSVLQACA            +FVNES 
Sbjct: 181 VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAHSNDLIFGMEVHKFVNESG 240

Query: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
           I+MDVSL+NAVIGLYAKCGSLDYARELFE MPEKD +TY SMISGYMVHGFVNQAMDLFR
Sbjct: 241 IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 300

Query: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
           ELERP L TWNAVISGLVQNN+QDG +DIFRAMQ HGCRPNTVTLAS+LPIFSHFSTLKG
Sbjct: 301 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 360

Query: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
           GKEIH YA+RN YDGNI+VATAIIDSYAK GYLQGARQVFDQLK RSLI WT+IISAYA 
Sbjct: 361 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQLKRRSLIIWTAIISAYAA 420

Query: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
           HGDAN  LSLFYEMLT GI+PD VTFTSVL ACAHSGELDEAWKIFN+LLP++GIQPLVE
Sbjct: 421 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELDEAWKIFNVLLPEFGIQPLVE 480

Query: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
           HYACMVGVLSRAGKLSDAVEFISKMP+EP AKVWGALLNGASVAGDVELGKYVFDRL +I
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 540

Query: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 583
           EP NTGNY+IMANLYSQ GRWKEAD +RDLMKEV LKKIPGNSWIETR GLQSF
Sbjct: 541 EPENTGNYIIMANLYSQFGRWKEADNVRDLMKEVGLKKIPGNSWIETREGLQSF 594

BLAST of IVF0022277 vs. ExPASy TrEMBL
Match: A0A6J1J0S5 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita maxima OX=3661 GN=LOC111482423 PE=4 SV=1)

HSP 1 Score: 1028.5 bits (2658), Expect = 1.1e-296
Identity = 508/594 (85.52%), Postives = 535/594 (90.07%), Query Frame = 0

Query: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
           MNYGAYGRLIQHCTD  FFR+GKQLHARLVLSSVAPDNFLGSKLI+ YSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 60

Query: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
           F  I  KNIFSWNALFISYTLHNMH D+LKLF SLVN NSTDVKPD+FTVTCVLKALASL
Sbjct: 61  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDKFTVTCVLKALASL 120

Query: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
           F+NS+LAKEVHCF+LRR LESDIFVVNALITFYSRCDELVLARIMF R PERDIVSWNAM
Sbjct: 121 FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFHRTPERDIVSWNAM 180

Query: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
           +AGYSQGG YE CKELF+ M  S E KPNALTAVSVLQACAQ            FVNES 
Sbjct: 181 VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLIFGMEVHKFVNESG 240

Query: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
           I+MDVSL+NAVIGLYAKCGSLDYARELFE MPEKD +TY SMISGYMVHGFVNQAMDLFR
Sbjct: 241 IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 300

Query: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
           ELERP L TWNAVISGLVQNN+QDG +DIFRAMQ HGCRPNTVTLAS+LPIFSHFSTLKG
Sbjct: 301 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 360

Query: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
           GKEIH YA+RN YDGNI+VATAIIDSYAK GYLQGARQVFDQ K RSLI WT+IISAYA 
Sbjct: 361 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQSKRRSLIIWTAIISAYAA 420

Query: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
           HGDAN  LSLFYEMLT GI+PD VTFTSVL ACAHSGEL+EAWKIFN+LLP++GIQPLVE
Sbjct: 421 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEFGIQPLVE 480

Query: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
           HYACMVGVLSRAGKLSDAVEFISKMP+EP AKVWGALLNGASVAGDVELGKYVFDRL +I
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 540

Query: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 583
           EP NTGNY+IMANLYSQ G WKEAD +RDLMKEV LKKIPGNSWIETRGGLQSF
Sbjct: 541 EPENTGNYIIMANLYSQFGWWKEADHVRDLMKEVGLKKIPGNSWIETRGGLQSF 594

BLAST of IVF0022277 vs. NCBI nr
Match: XP_016899722.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g37310 [Cucumis melo] >KAA0044055.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK25080.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1170 bits (3026), Expect = 0.0
Identity = 583/595 (97.98%), Postives = 583/595 (97.98%), Query Frame = 0

Query: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
           MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60

Query: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
           FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL
Sbjct: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120

Query: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
           FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM
Sbjct: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180

Query: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
           LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ            FVNESQ
Sbjct: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 240

Query: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
           IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR
Sbjct: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300

Query: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
           ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG
Sbjct: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360

Query: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
           GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV
Sbjct: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420

Query: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
           HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE
Sbjct: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480

Query: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
           HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540

Query: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC 583
           EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC
Sbjct: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSFC 595

BLAST of IVF0022277 vs. NCBI nr
Match: XP_004137952.2 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g37310 [Cucumis sativus])

HSP 1 Score: 1102 bits (2850), Expect = 0.0
Identity = 550/594 (92.59%), Postives = 561/594 (94.44%), Query Frame = 0

Query: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
           MNY AYGRLIQHCTDHLFFRVGKQLHARLVLSSV PDNFLGSKLISFYSKSGS+RDAYNV
Sbjct: 1   MNYSAYGRLIQHCTDHLFFRVGKQLHARLVLSSVVPDNFLGSKLISFYSKSGSIRDAYNV 60

Query: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
           FGKIPRKNIFSWNAL ISYTLHNMHTDLLKLF SLVNSNSTDVKPDRFTVTC LKALASL
Sbjct: 61  FGKIPRKNIFSWNALLISYTLHNMHTDLLKLFSSLVNSNSTDVKPDRFTVTCALKALASL 120

Query: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
           FSNS LAKEVH FILRR LE DIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM
Sbjct: 121 FSNSGLAKEVHSFILRRGLEYDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180

Query: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
           LAGYSQGGSYEKCKELFRVM SSLEVKPNALTAVSVLQACAQ            FVNESQ
Sbjct: 181 LAGYSQGGSYEKCKELFRVMLSSLEVKPNALTAVSVLQACAQSNDLIFGIEVHRFVNESQ 240

Query: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
           IKMDVSLWNAVIGLYAKCGSLDYARELFEEM EKD ITYCSMISGYMVHGFVNQAMDLFR
Sbjct: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMLEKDAITYCSMISGYMVHGFVNQAMDLFR 300

Query: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
           E ERPRLPTWNAVISGLVQNNRQ+GA+DIFRAMQSHGCRPNTVTLASILP+FSHFSTLKG
Sbjct: 301 EQERPRLPTWNAVISGLVQNNRQEGAVDIFRAMQSHGCRPNTVTLASILPVFSHFSTLKG 360

Query: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
           GKEIHGYAIRNTYD NI+VATAIIDSYAKCGYL GA+ VFDQ+KGRSLIAWTSIISAYAV
Sbjct: 361 GKEIHGYAIRNTYDRNIYVATAIIDSYAKCGYLHGAQLVFDQIKGRSLIAWTSIISAYAV 420

Query: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
           HGDANVALSLFYEMLT GIQPDQVTFTSVLAACAHSGELDEAWKIFN+LLP+YGIQPLVE
Sbjct: 421 HGDANVALSLFYEMLTNGIQPDQVTFTSVLAACAHSGELDEAWKIFNVLLPEYGIQPLVE 480

Query: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
           HYACMVGVLSRAGKLSDAVEFISKMPLEP AKVWGALLNGASVAGDVELGKYVFDRLFEI
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEI 540

Query: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 582
           EP NTGNYVIMANLYSQSGRWK+ADTIRDLMKEVRLKKIPGNSWIET GG+Q F
Sbjct: 541 EPENTGNYVIMANLYSQSGRWKDADTIRDLMKEVRLKKIPGNSWIETSGGMQRF 594

BLAST of IVF0022277 vs. NCBI nr
Match: XP_038905794.1 (pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905795.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905796.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905797.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida])

HSP 1 Score: 1038 bits (2684), Expect = 0.0
Identity = 516/594 (86.87%), Postives = 543/594 (91.41%), Query Frame = 0

Query: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
           MNYGAYGRLIQHCTD LF R+GKQLHARLVLSSVAPDNFLGSKLI+FYSKSGSLRDAYNV
Sbjct: 30  MNYGAYGRLIQHCTDQLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNV 89

Query: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
           FG I  KNIF+WNALFISYTLHNMH D+L+LF SLVNSNSTDVKPD+FT+TCVLKALASL
Sbjct: 90  FGNISHKNIFTWNALFISYTLHNMHIDMLRLFSSLVNSNSTDVKPDKFTITCVLKALASL 149

Query: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
           FSNSVLAKEVHCFILRRELE DIFVVNALITFYSRCDELVLARI+FDRMPE+DIVSWNAM
Sbjct: 150 FSNSVLAKEVHCFILRRELEFDIFVVNALITFYSRCDELVLARIVFDRMPEKDIVSWNAM 209

Query: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
           +AGYSQGG YE+CKELF+ M SS+E+KPNALT VSVLQACAQ            FV+ESQ
Sbjct: 210 VAGYSQGGFYEECKELFKAMLSSVELKPNALTTVSVLQACAQSNDLIFGMEVHRFVSESQ 269

Query: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
           I+MDVSL NAVIGLYAKCGSLDYARELFEEMP+KD +TY SMISGYMV+GFVNQAMDLFR
Sbjct: 270 IEMDVSLCNAVIGLYAKCGSLDYARELFEEMPKKDEVTYGSMISGYMVYGFVNQAMDLFR 329

Query: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
           ELERP L TWNAVISGLVQNN+QD  LDIFRAMQSHGCRPNTVTLAS+LPIFSHFST+KG
Sbjct: 330 ELERPVLSTWNAVISGLVQNNQQDEVLDIFRAMQSHGCRPNTVTLASVLPIFSHFSTIKG 389

Query: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
           GKEIH YAIR  YDGNI+VAT II+SYAK GYL GARQVFDQLKGRSLI WT+IISAYA 
Sbjct: 390 GKEIHAYAIRKAYDGNIYVATGIINSYAKSGYLHGARQVFDQLKGRSLIIWTAIISAYAA 449

Query: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
           HGDANVALSLFYEML  GIQPD VTFTSVL ACAHSGELDEAWKIFN+LLP YGIQP VE
Sbjct: 450 HGDANVALSLFYEMLANGIQPDPVTFTSVLVACAHSGELDEAWKIFNVLLPKYGIQPPVE 509

Query: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
           HYACMVGVLSRAGKLSDAVEFISKMP EP AKVWGALLNGASVAGDVELGKYVFDRLFEI
Sbjct: 510 HYACMVGVLSRAGKLSDAVEFISKMPFEPTAKVWGALLNGASVAGDVELGKYVFDRLFEI 569

Query: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 582
           EP NTGNYVIMANLYSQ GRWKEAD +RDLMKEV LKKIPGNSWIETRGGLQSF
Sbjct: 570 EPENTGNYVIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLQSF 623

BLAST of IVF0022277 vs. NCBI nr
Match: KAG7017327.1 (ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1030 bits (2662), Expect = 0.0
Identity = 511/594 (86.03%), Postives = 537/594 (90.40%), Query Frame = 0

Query: 1    MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
            MNYGAYGRLIQHCTD  FFR+GKQLHARLVLSSVAPDNFLGSKLI+ YSKSGSLRDAYNV
Sbjct: 748  MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 807

Query: 61   FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
            F  I  KNIFSWNALFISYTLHNMH D+LKLF SLVN NSTDVKPD+FTVTCVLKALASL
Sbjct: 808  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDKFTVTCVLKALASL 867

Query: 121  FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
            F+NS+LAKEVHCF+LRR LESDIFVVNALITFYSRCDELVLARIMFDR PERDIVSWNAM
Sbjct: 868  FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFDRTPERDIVSWNAM 927

Query: 181  LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
            +AGYSQGG YE CKELF+ M  S E KPNALTAVSVLQACAQ            FVNES 
Sbjct: 928  VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLIFGMEVHKFVNESG 987

Query: 241  IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
            I+MDVSL+NAVIGLYAKCGSLDYARELFE MPEKD +TY SMISGYMVHGFVNQAMDLFR
Sbjct: 988  IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 1047

Query: 301  ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
            ELERP L TWNAVISGLVQNN+QDG +DIFRAMQ HGCRPNTVTLAS+LPIFSHFSTLKG
Sbjct: 1048 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 1107

Query: 361  GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
            GKEIH YA+RN YDGNI+VATAIIDSYAK GYLQGARQVFDQ K RSLI WT+IISAYA 
Sbjct: 1108 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQSKRRSLIIWTAIISAYAA 1167

Query: 421  HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
            HGDAN  LSLFYEMLT GI+PD VTFTSVL ACAHSGELDEAWKIFN+LLP++GIQPLVE
Sbjct: 1168 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELDEAWKIFNVLLPEFGIQPLVE 1227

Query: 481  HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
            HYACMVGVLSRAGKLSDAVEFISKMP+EP AKVWGALLNGASVAGDVELGKYVFDRL +I
Sbjct: 1228 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 1287

Query: 541  EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 582
            EP NTGNY+IMANLYSQ GRWKEAD +RDLMKEV LKKIPGNSWIETRGGLQSF
Sbjct: 1288 EPENTGNYIIMANLYSQFGRWKEADRVRDLMKEVGLKKIPGNSWIETRGGLQSF 1341

BLAST of IVF0022277 vs. NCBI nr
Match: KAG6580575.1 (ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1028 bits (2657), Expect = 0.0
Identity = 510/594 (85.86%), Postives = 536/594 (90.24%), Query Frame = 0

Query: 1    MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60
            MNYGAYGRLIQHCTD  FFR+GKQLHARLVLSSVAPDNFLGSKLI+ YSKSGSLRDAYNV
Sbjct: 729  MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 788

Query: 61   FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120
            F  I  KNIFSWNALFISYTLHNMH D+LKLF SLVN NSTDVKPD+FTVTCVLKALASL
Sbjct: 789  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDKFTVTCVLKALASL 848

Query: 121  FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180
            F+NS+LAKEVHCF+LRR LESDIFVVNALITFYSRCDELVLARIMFDR PERDIVSWNAM
Sbjct: 849  FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFDRTPERDIVSWNAM 908

Query: 181  LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQ 240
            +AGYSQGG YE CKELF+ M  S E KPNALTAVSVLQACAQ            FVNES 
Sbjct: 909  VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLIFGMEVHKFVNESG 968

Query: 241  IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300
            I+MDVSL+NAVIGLYAKCGSLDYARELFE MPEKD +TY SMISGYMVHGFVNQAMDLFR
Sbjct: 969  IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 1028

Query: 301  ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360
            ELERP L TWNAVISGLVQNN+QDG +DIFRAMQ HGCRPNTVTLAS+LPIFSHFSTLKG
Sbjct: 1029 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 1088

Query: 361  GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420
            GKEIH YA+RN YDGNI+VATAIIDSYAK GYL GARQVFDQ K RSLI WT+IISAYA 
Sbjct: 1089 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLHGARQVFDQSKRRSLIIWTAIISAYAA 1148

Query: 421  HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480
            HGDAN  LSLFYEMLT GI+PD VTFTSVL ACAHSGELDEAWKIFN+LLP++GIQPLVE
Sbjct: 1149 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELDEAWKIFNVLLPEFGIQPLVE 1208

Query: 481  HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540
            HYACMVGVLSRAGKLSDAVEFISKMP+EP AKVWGALLNGASVAGDVELGKYVFDRL +I
Sbjct: 1209 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 1268

Query: 541  EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 582
            EP NTGNY+IMANLYSQ GRWKEAD +RDLMKEV LKKIPGNSWIETRGGLQSF
Sbjct: 1269 EPENTGNYIIMANLYSQFGRWKEADRVRDLMKEVGLKKIPGNSWIETRGGLQSF 1322

BLAST of IVF0022277 vs. TAIR 10
Match: AT2G37310.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 682.2 bits (1759), Expect = 3.7e-196
Identity = 339/596 (56.88%), Postives = 436/596 (73.15%), Query Frame = 0

Query: 4   GAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNVFGK 63
           GAYG LIQH T H       QLHAR+V+ S+ PDNFL SKLISFY++    R A +VF +
Sbjct: 23  GAYGHLIQHFTRHRLPLHVLQLHARIVVFSIKPDNFLASKLISFYTRQDRFRQALHVFDE 82

Query: 64  IPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNS---NSTDVKPDRFTVTCVLKALASL 123
           I  +N FS+NAL I+YT   M+ D   LFLS + S   +S   +PD  +++CVLKAL+  
Sbjct: 83  ITVRNAFSYNALLIAYTSREMYFDAFSLFLSWIGSSCYSSDAARPDSISISCVLKALSGC 142

Query: 124 --FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWN 183
             F    LA++VH F++R   +SD+FV N +IT+Y++CD +  AR +FD M ERD+VSWN
Sbjct: 143 DDFWLGSLARQVHGFVIRGGFDSDVFVGNGMITYYTKCDNIESARKVFDEMSERDVVSWN 202

Query: 184 AMLAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQFVN------------E 243
           +M++GYSQ GS+E CK++++ M +  + KPN +T +SV QAC Q  +            E
Sbjct: 203 SMISGYSQSGSFEDCKKMYKAMLACSDFKPNGVTVISVFQACGQSSDLIFGLEVHKKMIE 262

Query: 244 SQIKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDL 303
           + I+MD+SL NAVIG YAKCGSLDYAR LF+EM EKD +TY ++ISGYM HG V +AM L
Sbjct: 263 NHIQMDLSLCNAVIGFYAKCGSLDYARALFDEMSEKDSVTYGAIISGYMAHGLVKEAMAL 322

Query: 304 FRELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTL 363
           F E+E   L TWNA+ISGL+QNN  +  ++ FR M   G RPNTVTL+S+LP  ++ S L
Sbjct: 323 FSEMESIGLSTWNAMISGLMQNNHHEEVINSFREMIRCGSRPNTVTLSSLLPSLTYSSNL 382

Query: 364 KGGKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAY 423
           KGGKEIH +AIRN  D NI+V T+IID+YAK G+L GA++VFD  K RSLIAWT+II+AY
Sbjct: 383 KGGKEIHAFAIRNGADNNIYVTTSIIDNYAKLGFLLGAQRVFDNCKDRSLIAWTAIITAY 442

Query: 424 AVHGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPL 483
           AVHGD++ A SLF +M   G +PD VT T+VL+A AHSG+ D A  IF+ +L  Y I+P 
Sbjct: 443 AVHGDSDSACSLFDQMQCLGTKPDDVTLTAVLSAFAHSGDSDMAQHIFDSMLTKYDIEPG 502

Query: 484 VEHYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLF 543
           VEHYACMV VLSRAGKLSDA+EFISKMP++P AKVWGALLNGASV GD+E+ ++  DRLF
Sbjct: 503 VEHYACMVSVLSRAGKLSDAMEFISKMPIDPIAKVWGALLNGASVLGDLEIARFACDRLF 562

Query: 544 EIEPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 583
           E+EP NTGNY IMANLY+Q+GRW+EA+ +R+ MK + LKKIPG SWIET  GL+SF
Sbjct: 563 EMEPENTGNYTIMANLYTQAGRWEEAEMVRNKMKRIGLKKIPGTSWIETEKGLRSF 618

BLAST of IVF0022277 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 385.2 bits (988), Expect = 9.4e-107
Identity = 206/589 (34.97%), Postives = 336/589 (57.05%), Query Frame = 0

Query: 9   LIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISF--YSKSGSLRDAYNVFGKIPR 68
           LI+ C      R  KQ H  ++ +    D +  SKL +    S   SL  A  VF +IP+
Sbjct: 36  LIERCVS---LRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPK 95

Query: 69  KNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVL 128
            N F+WN L  +Y         +  FL +V  + +   P+++T   ++KA A + S S L
Sbjct: 96  PNSFAWNTLIRAYASGPDPVLSIWAFLDMV--SESQCYPNKYTFPFLIKAAAEVSSLS-L 155

Query: 129 AKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAMLAGYSQ 188
            + +H   ++  + SD+FV N+LI  Y  C +L  A  +F  + E+D+VSWN+M+ G+ Q
Sbjct: 156 GQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 215

Query: 189 GGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQ------------FVNESQIKMDVS 248
            GS +K  ELF+ M S  +VK + +T V VL ACA+            ++ E+++ ++++
Sbjct: 216 KGSPDKALELFKKMESE-DVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLT 275

Query: 249 LWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFRELERPR 308
           L NA++ +Y KCGS++ A+ LF+ M EKD +T+ +M+ GY +      A ++   + +  
Sbjct: 276 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 335

Query: 309 LPTWNAVISGLVQNNRQDGALDIFRAMQ-SHGCRPNTVTLASILPIFSHFSTLKGGKEIH 368
           +  WNA+IS   QN + + AL +F  +Q     + N +TL S L   +    L+ G+ IH
Sbjct: 336 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 395

Query: 369 GYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAVHGDAN 428
            Y  ++    N  V +A+I  Y+KCG L+ +R+VF+ ++ R +  W+++I   A+HG  N
Sbjct: 396 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 455

Query: 429 VALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVEHYACM 488
            A+ +FY+M    ++P+ VTFT+V  AC+H+G +DEA  +F+ +  +YGI P  +HYAC+
Sbjct: 456 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 515

Query: 489 VGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEIEPGNT 548
           V VL R+G L  AV+FI  MP+ P+  VWGALL    +  ++ L +    RL E+EP N 
Sbjct: 516 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 575

Query: 549 GNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 583
           G +V+++N+Y++ G+W+    +R  M+   LKK PG S IE  G +  F
Sbjct: 576 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEF 617

BLAST of IVF0022277 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 365.5 bits (937), Expect = 7.7e-101
Identity = 210/658 (31.91%), Postives = 339/658 (51.52%), Query Frame = 0

Query: 9   LIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNVFGKIPRKN 68
           ++Q C D    + GK++   +  +    D+ LGSKL   Y+  G L++A  VF ++  + 
Sbjct: 100 VLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEK 159

Query: 69  IFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVLAK 128
              WN L          +  + LF  +++S    V+ D +T +CV K+ +SL S     +
Sbjct: 160 ALFWNILMNELAKSGDFSGSIGLFKKMMSSG---VEMDSYTFSCVSKSFSSLRSVHG-GE 219

Query: 129 EVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAMLAGYSQGG 188
           ++H FIL+        V N+L+ FY +   +  AR +FD M ERD++SWN+++ GY   G
Sbjct: 220 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 279

Query: 189 SYEKCKELF-RVMSSSLEVKPNALTAVSVLQACA--------QFVNESQIKMDVS----L 248
             EK   +F +++ S +E+  +  T VSV   CA        + V+   +K   S     
Sbjct: 280 LAEKGLSVFVQMLVSGIEI--DLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 339

Query: 249 WNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFRELERPRL 308
            N ++ +Y+KCG LD A+ +F EM ++  ++Y SMI+GY   G   +A+ LF E+E   +
Sbjct: 340 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 399

Query: 309 P----------------------------------------------------------- 368
                                                                       
Sbjct: 400 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 459

Query: 369 -----------TWNAVISGLVQNNRQDGALDIFR-AMQSHGCRPNTVTLASILPIFSHFS 428
                      +WN +I G  +N   + AL +F   ++     P+  T+A +LP  +  S
Sbjct: 460 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 519

Query: 429 TLKGGKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIIS 488
               G+EIHGY +RN Y  +  VA +++D YAKCG L  A  +FD +  + L++WT +I+
Sbjct: 520 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 579

Query: 489 AYAVHGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQ 548
            Y +HG    A++LF +M   GI+ D+++F S+L AC+HSG +DE W+ FNI+  +  I+
Sbjct: 580 GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIE 639

Query: 549 PLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDR 583
           P VEHYAC+V +L+R G L  A  FI  MP+ P+A +WGALL G  +  DV+L + V ++
Sbjct: 640 PTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEK 699

BLAST of IVF0022277 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 363.2 bits (931), Expect = 3.8e-100
Identity = 204/574 (35.54%), Postives = 315/574 (54.88%), Query Frame = 0

Query: 23  KQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNVFGKIPRKNIFSWNALFISYTLH 82
           KQ+HARL++  +    FL +KLI   S  G +  A  VF  +PR  IF WNA+   Y+ +
Sbjct: 38  KQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRN 97

Query: 83  NMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVLAKEVHCFILRRELESD 142
           N   D L ++    N     V PD FT   +LKA + L S+  + + VH  + R   ++D
Sbjct: 98  NHFQDALLMY---SNMQLARVSPDSFTFPHLLKACSGL-SHLQMGRFVHAQVFRLGFDAD 157

Query: 143 IFVVNALITFYSRCDELVLARIMFD--RMPERDIVSWNAMLAGYSQGGSYEKCKELFRVM 202
           +FV N LI  Y++C  L  AR +F+   +PER IVSW A+++ Y+Q G   +  E+F  M
Sbjct: 158 VFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQM 217

Query: 203 SSSLEVKPNALTAVSVLQA--CAQFVNESQ----------IKMDVSLWNAVIGLYAKCGS 262
              ++VKP+ +  VSVL A  C Q + + +          ++++  L  ++  +YAKCG 
Sbjct: 218 -RKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQ 277

Query: 263 LDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFRELERPRLPTWNAVISGLVQN 322
           +  A+ LF++M                               + P L  WNA+ISG  +N
Sbjct: 278 VATAKILFDKM-------------------------------KSPNLILWNAMISGYAKN 337

Query: 323 NRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKGGKEIHGYAIRNTYDGNIFVA 382
                A+D+F  M +   RP+T+++ S +   +   +L+  + ++ Y  R+ Y  ++F++
Sbjct: 338 GYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFIS 397

Query: 383 TAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAVHGDANVALSLFYEMLTYGIQ 442
           +A+ID +AKCG ++GAR VFD+   R ++ W+++I  Y +HG A  A+SL+  M   G+ 
Sbjct: 398 SALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVH 457

Query: 443 PDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVEHYACMVGVLSRAGKLSDAVE 502
           P+ VTF  +L AC HSG + E W  FN  + D+ I P  +HYAC++ +L RAG L  A E
Sbjct: 458 PNDVTFLGLLMACNHSGMVREGWWFFN-RMADHKINPQQQHYACVIDLLGRAGHLDQAYE 517

Query: 503 FISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEIEPGNTGNYVIMANLYSQSGR 562
            I  MP++P   VWGALL+       VELG+Y   +LF I+P NTG+YV ++NLY+ +  
Sbjct: 518 VIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARL 574

Query: 563 WKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 583
           W     +R  MKE  L K  G SW+E RG L++F
Sbjct: 578 WDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAF 574

BLAST of IVF0022277 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 362.1 bits (928), Expect = 8.5e-100
Identity = 206/616 (33.44%), Postives = 338/616 (54.87%), Query Frame = 0

Query: 16  HLFFRVGKQLHARLVLSSV-APDNFLGSKLISFYSKSGSLRDAYNVFGKIPRKNIFSWNA 75
           +++ + G  LHAR +   +     F  + ++S YSK G +      F ++P+++  SW  
Sbjct: 57  NVYSKTGYALHARKLFDEMPLRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTT 116

Query: 76  LFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVLAKEVHCFI 135
           + + Y     +   +++   +V      ++P +FT+T VL ++A+        K+VH FI
Sbjct: 117 MIVGYKNIGQYHKAIRVMGDMVKEG---IEPTQFTLTNVLASVAATRCMET-GKKVHSFI 176

Query: 136 LRRELESDIFVVNALITFYSRCDELVLARIMFDR-------------------------- 195
           ++  L  ++ V N+L+  Y++C + ++A+ +FDR                          
Sbjct: 177 VKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAM 236

Query: 196 -----MPERDIVSWNAMLAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQF 255
                M ERDIV+WN+M++G++Q G   +  ++F  M     + P+  T  SVL ACA  
Sbjct: 237 AQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANL 296

Query: 256 VN-------ESQI---KMDVS--LWNAVIGLYAKCGSLDYARELFEEMPEKD----GITY 315
                     S I     D+S  + NA+I +Y++CG ++ AR L E+   KD    G T 
Sbjct: 297 EKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFT- 356

Query: 316 CSMISGYMVHGFVNQAMDLFRELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCR 375
            +++ GY+  G +NQA ++F  L+   +  W A+I G  Q+     A+++FR+M   G R
Sbjct: 357 -ALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQR 416

Query: 376 PNTVTLASILPIFSHFSTLKGGKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQV 435
           PN+ TLA++L + S  ++L  GK+IHG A+++    ++ V+ A+I  YAK G +  A + 
Sbjct: 417 PNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRA 476

Query: 436 FDQLK-GRSLIAWTSIISAYAVHGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGE 495
           FD ++  R  ++WTS+I A A HG A  AL LF  ML  G++PD +T+  V +AC H+G 
Sbjct: 477 FDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGL 536

Query: 496 LDEAWKIFNILLPDYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALL 555
           +++  + F+++     I P + HYACMV +  RAG L +A EFI KMP+EP+   WG+LL
Sbjct: 537 VNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLL 596

Query: 556 NGASVAGDVELGKYVFDRLFEIEPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKK 583
           +   V  +++LGK   +RL  +EP N+G Y  +ANLYS  G+W+EA  IR  MK+ R+KK
Sbjct: 597 SACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKK 656

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZUT55.2e-19556.88Pentatricopeptide repeat-containing protein At2g37310 OS=Arabidopsis thaliana OX... [more]
O823801.3e-10534.97Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9SN391.1e-9931.91Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9LTV85.4e-9935.54Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9SHZ81.2e-9833.44Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7TRM40.0e+0097.98Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DUQ60.0e+0097.98pentatricopeptide repeat-containing protein At2g37310 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0LFN10.0e+0092.61Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736590 PE=4 SV=1[more]
A0A6J1F1104.5e-29885.69pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita moschata OX=3... [more]
A0A6J1J0S51.1e-29685.52pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
XP_016899722.10.097.98PREDICTED: pentatricopeptide repeat-containing protein At2g37310 [Cucumis melo] ... [more]
XP_004137952.20.092.59LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g37310 [Cucu... [more]
XP_038905794.10.086.87pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_03... [more]
KAG7017327.10.086.03ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyr... [more]
KAG6580575.10.085.86ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. soror... [more]
Match NameE-valueIdentityDescription
AT2G37310.13.7e-19656.88Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.19.4e-10734.97Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.17.7e-10131.91Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.13.8e-10035.54mitochondrial editing factor 22 [more]
AT2G22070.18.5e-10033.44pentatricopeptide (PPR) repeat-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 370..573
e-value: 7.4E-40
score: 139.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 8..123
e-value: 4.7E-12
score: 47.6
coord: 124..222
e-value: 7.3E-19
score: 69.8
coord: 223..292
e-value: 3.4E-18
score: 67.6
coord: 293..347
e-value: 8.1E-10
score: 40.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 262..559
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 432..455
e-value: 8.9E-4
score: 17.3
coord: 265..291
e-value: 6.2E-4
score: 17.8
coord: 297..330
e-value: 1.0E-7
score: 29.7
coord: 235..262
e-value: 1.0E-4
score: 20.2
coord: 398..430
e-value: 1.8E-6
score: 25.8
coord: 175..210
e-value: 3.1E-6
score: 25.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 470..493
e-value: 0.29
score: 11.5
coord: 44..69
e-value: 0.33
score: 11.3
coord: 265..292
e-value: 3.0E-5
score: 24.0
coord: 369..394
e-value: 0.0062
score: 16.7
coord: 536..561
e-value: 0.068
score: 13.4
coord: 235..262
e-value: 5.5E-6
score: 26.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 396..442
e-value: 2.0E-9
score: 37.5
coord: 173..221
e-value: 4.1E-7
score: 30.1
coord: 297..334
e-value: 3.6E-8
score: 33.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 173..203
score: 9.810421
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 395..429
score: 10.676364
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..262
score: 10.47906
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 294..328
score: 11.465577
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 430..465
score: 10.544828
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 263..293
score: 8.769097
NoneNo IPR availablePANTHERPTHR47925:SF76BNAA04G21330D PROTEINcoord: 5..582
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 5..582

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0022277.2IVF0022277.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding