IVF0023772 (gene) Melon (IVF77) v1

Overview
NameIVF0023772
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr05: 18618191 .. 18621058 (+)
RNA-Seq ExpressionIVF0023772
SyntenyIVF0023772
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGACCGCAGTGGCACGACGAAGCGGCGGAGAGACCAGCAGCAGTTTTGGTGCATGTAGAGGCGGTGGTACCCACGGACGAAAGAGCAGAGACCCAAGGGGGAGGAAGGGCGAGCAGATCCGTAGAGTCGGCGGTGGCTGGTAGACAATAGGCTTCGCGGCAGAAGAATTCTTTTTTTTTTTTCCTTCTTCTTTTACCATGTTCAATTCTGTATTATTGAAGATGATTGTCTGTTTACAGAGGGACTTACAACAGTGTATATATGAACCAAGGGGATTAGGAAAATATAATGTAGCTTAAGAAGATGAGATTCTAGTTGGCTAGTTTATAGCACAACATATTTGGTATTGCAGATGATATGCACAAACACTACCACCTCTGCCATTCGTGGACAAAAATTCTTCCTCACGCTTCTTAACAACGCTACTACACTTCCTCAACTCCTTCAAATTCAAGCCCAGTTGATCCTCCATGGTATCCAGTATGATCTCTCTTCGATTACCAAGCTTACCCACAAGTTCTTCGACCTTGGCGCCGTTGTCCATGTGCGCCAACTCTTCAATAAGGTTTCGAAACCTGATTTATTCTTGTTTAATGTCCTTATTAGAGGCTTCTCCGACAATGGTTTGCCTAAATCTTCGATCTTTCTCTATACCCATTTGAGAAAAAGGACTAATCTTAGGCCAGACAATTTCACTTATGCATTTGCGATTTCAGCTGCTTCGAGACTTGAGGATGAGAGGGTTGGTGTCTTGTTGCATGCGCACTCCATTGTTGATGGGGTGGCGTCAAATTTGTTTGTTGGCTCTGCAATTGTTGATTTGTACTTTAAATTTACCCGCGCTGAGTTGGCTCGTAAGGTGTTTGATGTAATGCCTGAGAGGGATACGGTTCTTTGGAACACGATGATATCTGGATTTTCCAGGAATTCTTATTTTGAGGACTCGATACGTGTTTTTGTGGATATGCTCGATGTTGGTTTGTCATTTGATTCTACAACTTTGGCTACGGTGCTTACAGCAGTAGCAGAGTTGCAGGAATATAGGCTAGGGATGGGTATCCAATGTTTGGCTTCAAAAAAAGGACTCCATTCTGATGTTTATGTGCTTACAGGATTGATATCATTGTATTCAAAATGCGGGAAGAGTCACAAAGGAAGGATATTGTTTGATCAGATTGATCAGCCAGATTTGATATCTTATAATGCAATGATTTCTGGTTATACTTTCAATCATGAAACTGAGTCGGCAGTTACACTATTCAGAGAATTGCTTGCCTCCGGACAAGGTGTTAATTCAAGCACTTTGGTGGGCTTAATTCCAGTTTATTCGCCCTTCAACCATCTGCAACTTACTCTCTTGATTCAAAATTTAAGCTTGAAGCTTGGTATTATTTTGCAACCTTCGGTTTCAACTGCTCTTACTACTGTTTATTGTCGACTAAATGAAGTGCAATTTGCAAGGCAGTTGTTTGATGAGTCTCCAGAGAAAAGTTTGGCTTCCTGGAATGCCATGATATCAGGGTATACTCAAAATGGGTTGACAGATAGAGCAATTTCTCTTTTCCAGGAAATGATGCCTCAGCTCAGTCCAAACCCTGTTACTGTCACCAGTATTCTTTCAGCTTGTGCACAACTTGGAGCTCTAAGTATTGGAAAATGGGTTCATGGCTTGATTAAGAGTGAAAGACTTGAATCTAACTTGTATGTTTCTACTGCATTAGTTGATATGTATGCAAAATGTGGAAGCATCGTGGAGGCTCGGCAATTATTTGACTTGATGGTAGACAAGAATGTCGTAACCTGGAATGCCATGATAACTGGTTATGGTCTCCATGGACATGGCAAGGAAGCACTAAAACTCTTTTATGAGATGTTGCAATCTGGGATTCCACCGACAGGTGTTACTTTCCTTTCTATCTTGTATGCTTGCAGTCACTCTGGCTTGGTGAGAGAGGGAAATGAAATTTTCCACTCTATGGCTAACGATTATGGTTTTCAGCCCATGAGTGAGCACTACGCTTGCATGGTTGACATACTTGGGAGAGCTGGACAACTAACAAATGCCTTGGAGTTTATTGAAAGAATGCCACTGGAGCCTGGCCCAGCTGTTTGGGGTGCACTGCTTGGCGCTTGCATGATTCACAAGAATACAGAGATAGCCAATGTTGCTTCCAAAAGACTTTTTCAATTGGACCCAGAAAATGTGGGGTACTATGTTCTACTTTCTAACATATATTCTACTGACAGGAATTTCCCCAAAGCTGCTTCAGTACGACAAGTTGTTAAGAAGAGAAAACTAGCAAAAACACCTGGTTGCACTCTAATTGAGATTGGCGATCAACAATATGTGTTCACATCGGGGGATCGATCCCATCCTCAGGCCACAGCCATTTTTGAGATGCTAGAGAAGTTAACAGGGAAAATGAGAGAGGCTGGATATCAGGCAGAAACTGTCACTACTGCTTTGCATGATGTAGAGGATGAAGAGAAGGAGTTAATGGTGAATGTCCACAGTGAAAAATTAGCAATTGCTTTTGGTCTTATTTCAACCGAGCCTGGAACTGAAATTAGGATTATCAAGAACCTCCGAGTTTGTCTAGATTGTCATACTGCAACTAAATTTATATCAAAGATCACTGAGAGAGTGATTGTTGTTAGGGATGCTAATAGATTCCATCATTTCAAAAATGGTATTTGTTCATGTGGAGACTACTGGTGAAATTAACATTCCAGTGGCAGTTAAGATTTTTGGATTGGAGGAACCATAAAAGCCGGAAGCCATTGACATTTCATGCATGCAGTTGAAATAATTTTTCGATTGTAGATATTATGATTATTTTCAAAGAAGGTTAAACTTAACTCAGGAG

mRNA sequence

TGGACCGCAGTGGCACGACGAAGCGGCGGAGAGACCAGCAGCAGTTTTGGTGCATGTAGAGGCGGTGGTACCCACGGACGAAAGAGCAGAGACCCAAGGGGGAGGAAGGGCGAGCAGATCCGTAGAGTCGGCGGTGGCTGGTAGACAATAGGCTTCGCGGCAGAAGAATTCTTTTTTTTTTTTCCTTCTTCTTTTACCATGTTCAATTCTGTATTATTGAAGATGATTGTCTGTTTACAGAGGGACTTACAACAGTGTATATATGAACCAAGGGGATTAGGAAAATATAATGTAGCTTAAGAAGATGAGATTCTAGTTGGCTAGTTTATAGCACAACATATTTGGTATTGCAGATGATATGCACAAACACTACCACCTCTGCCATTCGTGGACAAAAATTCTTCCTCACGCTTCTTAACAACGCTACTACACTTCCTCAACTCCTTCAAATTCAAGCCCAGTTGATCCTCCATGGTATCCAGTATGATCTCTCTTCGATTACCAAGCTTACCCACAAGTTCTTCGACCTTGGCGCCGTTGTCCATGTGCGCCAACTCTTCAATAAGGTTTCGAAACCTGATTTATTCTTGTTTAATGTCCTTATTAGAGGCTTCTCCGACAATGGTTTGCCTAAATCTTCGATCTTTCTCTATACCCATTTGAGAAAAAGGACTAATCTTAGGCCAGACAATTTCACTTATGCATTTGCGATTTCAGCTGCTTCGAGACTTGAGGATGAGAGGGTTGGTGTCTTGTTGCATGCGCACTCCATTGTTGATGGGGTGGCGTCAAATTTGTTTGTTGGCTCTGCAATTGTTGATTTGTACTTTAAATTTACCCGCGCTGAGTTGGCTCGTAAGGTGTTTGATGTAATGCCTGAGAGGGATACGGTTCTTTGGAACACGATGATATCTGGATTTTCCAGGAATTCTTATTTTGAGGACTCGATACGTGTTTTTGTGGATATGCTCGATGTTGGTTTGTCATTTGATTCTACAACTTTGGCTACGGTGCTTACAGCAGTAGCAGAGTTGCAGGAATATAGGCTAGGGATGGGTATCCAATGTTTGGCTTCAAAAAAAGGACTCCATTCTGATGTTTATGTGCTTACAGGATTGATATCATTGTATTCAAAATGCGGGAAGAGTCACAAAGGAAGGATATTGTTTGATCAGATTGATCAGCCAGATTTGATATCTTATAATGCAATGATTTCTGGTTATACTTTCAATCATGAAACTGAGTCGGCAGTTACACTATTCAGAGAATTGCTTGCCTCCGGACAAGGTGTTAATTCAAGCACTTTGGTGGGCTTAATTCCAGTTTATTCGCCCTTCAACCATCTGCAACTTACTCTCTTGATTCAAAATTTAAGCTTGAAGCTTGGTATTATTTTGCAACCTTCGGTTTCAACTGCTCTTACTACTGTTTATTGTCGACTAAATGAAGTGCAATTTGCAAGGCAGTTGTTTGATGAGTCTCCAGAGAAAAGTTTGGCTTCCTGGAATGCCATGATATCAGGGTATACTCAAAATGGGTTGACAGATAGAGCAATTTCTCTTTTCCAGGAAATGATGCCTCAGCTCAGTCCAAACCCTGTTACTGTCACCAGTATTCTTTCAGCTTGTGCACAACTTGGAGCTCTAAGTATTGGAAAATGGGTTCATGGCTTGATTAAGAGTGAAAGACTTGAATCTAACTTGTATGTTTCTACTGCATTAGTTGATATGTATGCAAAATGTGGAAGCATCGTGGAGGCTCGGCAATTATTTGACTTGATGGTAGACAAGAATGTCGTAACCTGGAATGCCATGATAACTGGTTATGGTCTCCATGGACATGGCAAGGAAGCACTAAAACTCTTTTATGAGATGTTGCAATCTGGGATTCCACCGACAGGTGTTACTTTCCTTTCTATCTTGTATGCTTGCAGTCACTCTGGCTTGGTGAGAGAGGGAAATGAAATTTTCCACTCTATGGCTAACGATTATGGTTTTCAGCCCATGAGTGAGCACTACGCTTGCATGGTTGACATACTTGGGAGAGCTGGACAACTAACAAATGCCTTGGAGTTTATTGAAAGAATGCCACTGGAGCCTGGCCCAGCTGTTTGGGGTGCACTGCTTGGCGCTTGCATGATTCACAAGAATACAGAGATAGCCAATGTTGCTTCCAAAAGACTTTTTCAATTGGACCCAGAAAATGTGGGGTACTATGTTCTACTTTCTAACATATATTCTACTGACAGGAATTTCCCCAAAGCTGCTTCAGTACGACAAGTTGTTAAGAAGAGAAAACTAGCAAAAACACCTGGTTGCACTCTAATTGAGATTGGCGATCAACAATATGTGTTCACATCGGGGGATCGATCCCATCCTCAGGCCACAGCCATTTTTGAGATGCTAGAGAAGTTAACAGGGAAAATGAGAGAGGCTGGATATCAGGCAGAAACTGTCACTACTGCTTTGCATGATGTAGAGGATGAAGAGAAGGAGTTAATGGTGAATGTCCACAGTGAAAAATTAGCAATTGCTTTTGGTCTTATTTCAACCGAGCCTGGAACTGAAATTAGGATTATCAAGAACCTCCGAGTTTGTCTAGATTGTCATACTGCAACTAAATTTATATCAAAGATCACTGAGAGAGTGATTGTTGTTAGGGATGCTAATAGATTCCATCATTTCAAAAATGGTATTTGTTCATGTGGAGACTACTGGTGAAATTAACATTCCAGTGGCAGTTAAGATTTTTGGATTGGAGGAACCATAAAAGCCGGAAGCCATTGACATTTCATGCATGCAGTTGAAATAATTTTTCGATTGTAGATATTATGATTATTTTCAAAGAAGGTTAAACTTAACTCAGGAG

Coding sequence (CDS)

ATGATATGCACAAACACTACCACCTCTGCCATTCGTGGACAAAAATTCTTCCTCACGCTTCTTAACAACGCTACTACACTTCCTCAACTCCTTCAAATTCAAGCCCAGTTGATCCTCCATGGTATCCAGTATGATCTCTCTTCGATTACCAAGCTTACCCACAAGTTCTTCGACCTTGGCGCCGTTGTCCATGTGCGCCAACTCTTCAATAAGGTTTCGAAACCTGATTTATTCTTGTTTAATGTCCTTATTAGAGGCTTCTCCGACAATGGTTTGCCTAAATCTTCGATCTTTCTCTATACCCATTTGAGAAAAAGGACTAATCTTAGGCCAGACAATTTCACTTATGCATTTGCGATTTCAGCTGCTTCGAGACTTGAGGATGAGAGGGTTGGTGTCTTGTTGCATGCGCACTCCATTGTTGATGGGGTGGCGTCAAATTTGTTTGTTGGCTCTGCAATTGTTGATTTGTACTTTAAATTTACCCGCGCTGAGTTGGCTCGTAAGGTGTTTGATGTAATGCCTGAGAGGGATACGGTTCTTTGGAACACGATGATATCTGGATTTTCCAGGAATTCTTATTTTGAGGACTCGATACGTGTTTTTGTGGATATGCTCGATGTTGGTTTGTCATTTGATTCTACAACTTTGGCTACGGTGCTTACAGCAGTAGCAGAGTTGCAGGAATATAGGCTAGGGATGGGTATCCAATGTTTGGCTTCAAAAAAAGGACTCCATTCTGATGTTTATGTGCTTACAGGATTGATATCATTGTATTCAAAATGCGGGAAGAGTCACAAAGGAAGGATATTGTTTGATCAGATTGATCAGCCAGATTTGATATCTTATAATGCAATGATTTCTGGTTATACTTTCAATCATGAAACTGAGTCGGCAGTTACACTATTCAGAGAATTGCTTGCCTCCGGACAAGGTGTTAATTCAAGCACTTTGGTGGGCTTAATTCCAGTTTATTCGCCCTTCAACCATCTGCAACTTACTCTCTTGATTCAAAATTTAAGCTTGAAGCTTGGTATTATTTTGCAACCTTCGGTTTCAACTGCTCTTACTACTGTTTATTGTCGACTAAATGAAGTGCAATTTGCAAGGCAGTTGTTTGATGAGTCTCCAGAGAAAAGTTTGGCTTCCTGGAATGCCATGATATCAGGGTATACTCAAAATGGGTTGACAGATAGAGCAATTTCTCTTTTCCAGGAAATGATGCCTCAGCTCAGTCCAAACCCTGTTACTGTCACCAGTATTCTTTCAGCTTGTGCACAACTTGGAGCTCTAAGTATTGGAAAATGGGTTCATGGCTTGATTAAGAGTGAAAGACTTGAATCTAACTTGTATGTTTCTACTGCATTAGTTGATATGTATGCAAAATGTGGAAGCATCGTGGAGGCTCGGCAATTATTTGACTTGATGGTAGACAAGAATGTCGTAACCTGGAATGCCATGATAACTGGTTATGGTCTCCATGGACATGGCAAGGAAGCACTAAAACTCTTTTATGAGATGTTGCAATCTGGGATTCCACCGACAGGTGTTACTTTCCTTTCTATCTTGTATGCTTGCAGTCACTCTGGCTTGGTGAGAGAGGGAAATGAAATTTTCCACTCTATGGCTAACGATTATGGTTTTCAGCCCATGAGTGAGCACTACGCTTGCATGGTTGACATACTTGGGAGAGCTGGACAACTAACAAATGCCTTGGAGTTTATTGAAAGAATGCCACTGGAGCCTGGCCCAGCTGTTTGGGGTGCACTGCTTGGCGCTTGCATGATTCACAAGAATACAGAGATAGCCAATGTTGCTTCCAAAAGACTTTTTCAATTGGACCCAGAAAATGTGGGGTACTATGTTCTACTTTCTAACATATATTCTACTGACAGGAATTTCCCCAAAGCTGCTTCAGTACGACAAGTTGTTAAGAAGAGAAAACTAGCAAAAACACCTGGTTGCACTCTAATTGAGATTGGCGATCAACAATATGTGTTCACATCGGGGGATCGATCCCATCCTCAGGCCACAGCCATTTTTGAGATGCTAGAGAAGTTAACAGGGAAAATGAGAGAGGCTGGATATCAGGCAGAAACTGTCACTACTGCTTTGCATGATGTAGAGGATGAAGAGAAGGAGTTAATGGTGAATGTCCACAGTGAAAAATTAGCAATTGCTTTTGGTCTTATTTCAACCGAGCCTGGAACTGAAATTAGGATTATCAAGAACCTCCGAGTTTGTCTAGATTGTCATACTGCAACTAAATTTATATCAAAGATCACTGAGAGAGTGATTGTTGTTAGGGATGCTAATAGATTCCATCATTTCAAAAATGGTATTTGTTCATGTGGAGACTACTGGTGA

Protein sequence

MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVYCRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTSILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW
Homology
BLAST of IVF0023772 vs. ExPASy Swiss-Prot
Match: Q9SUH6 (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 1011.9 bits (2615), Expect = 3.9e-294
Identity = 505/787 (64.17%), Postives = 615/787 (78.14%), Query Frame = 0

Query: 4   TNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVV 63
           T  TT+A+  +  +L     +T++  L Q  AQ+ILHG + D+S +TKLT +  DLGA+ 
Sbjct: 10  TAETTAALISKNTYLDFFKRSTSISHLAQTHAQIILHGFRNDISLLTKLTQRLSDLGAIY 69

Query: 64  HVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAA 123
           + R +F  V +PD+FLFNVL+RGFS N  P SS+ ++ HLRK T+L+P++ TYAFAISAA
Sbjct: 70  YARDIFLSVQRPDVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAISAA 129

Query: 124 SRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWN 183
           S   D+R G ++H  ++VDG  S L +GS IV +YFKF R E ARKVFD MPE+DT+LWN
Sbjct: 130 SGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWN 189

Query: 184 TMISGFSRNSYFEDSIRVFVDMLDVGLS-FDSTTLATVLTAVAELQEYRLGMGIQCLASK 243
           TMISG+ +N  + +SI+VF D+++   +  D+TTL  +L AVAELQE RLGM I  LA+K
Sbjct: 190 TMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATK 249

Query: 244 KGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTL 303
            G +S  YVLTG ISLYSKCGK   G  LF +  +PD+++YNAMI GYT N ETE +++L
Sbjct: 250 TGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSL 309

Query: 304 FRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVYCR 363
           F+EL+ SG  + SSTLV L+PV     HL L   I    LK   +   SVSTALTTVY +
Sbjct: 310 FKELMLSGARLRSSTLVSLVPV---SGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYSK 369

Query: 364 LNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMM-PQLSPNPVTVTSI 423
           LNE++ AR+LFDESPEKSL SWNAMISGYTQNGLT+ AISLF+EM   + SPNPVT+T I
Sbjct: 370 LNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCI 429

Query: 424 LSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNV 483
           LSACAQLGALS+GKWVH L++S   ES++YVSTAL+ MYAKCGSI EAR+LFDLM  KN 
Sbjct: 430 LSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNE 489

Query: 484 VTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHS 543
           VTWN MI+GYGLHG G+EAL +FYEML SGI PT VTFL +LYACSH+GLV+EG+EIF+S
Sbjct: 490 VTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNS 549

Query: 544 MANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTE 603
           M + YGF+P  +HYACMVDILGRAG L  AL+FIE M +EPG +VW  LLGAC IHK+T 
Sbjct: 550 MIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTN 609

Query: 604 IANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIG 663
           +A   S++LF+LDP+NVGY+VLLSNI+S DRN+P+AA+VRQ  KKRKLAK PG TLIEIG
Sbjct: 610 LARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIG 669

Query: 664 DQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHS 723
           +  +VFTSGD+SHPQ   I+E LEKL GKMREAGYQ ET   ALHDVE+EE+ELMV VHS
Sbjct: 670 ETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPET-ELALHDVEEEERELMVKVHS 729

Query: 724 EKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGI 783
           E+LAIAFGLI+TEPGTEIRIIKNLRVCLDCHT TK ISKITERVIVVRDANRFHHFK+G+
Sbjct: 730 ERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGV 789

Query: 784 CSCGDYW 789
           CSCGDYW
Sbjct: 790 CSCGDYW 792

BLAST of IVF0023772 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 589.7 bits (1519), Expect = 4.8e-167
Identity = 303/770 (39.35%), Postives = 467/770 (60.65%), Query Frame = 0

Query: 20  LLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKPDLFL 79
           LL   ++L +L QI   +  +G+  +    TKL   F   G+V    ++F  +      L
Sbjct: 43  LLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVL 102

Query: 80  FNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAHS 139
           ++ +++GF+       ++  +  +R   ++ P  + + + +       + RVG  +H   
Sbjct: 103 YHTMLKGFAKVSDLDKALQFFVRMR-YDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLL 162

Query: 140 IVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDSI 199
           +  G + +LF  + + ++Y K  +   ARKVFD MPERD V WNT+++G+S+N     ++
Sbjct: 163 VKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMAL 222

Query: 200 RVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLISLY 259
            +   M +  L     T+ +VL AV+ L+   +G  I   A + G  S V + T L+ +Y
Sbjct: 223 EMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMY 282

Query: 260 SKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQGVNSSTLV 319
           +KCG     R LFD + + +++S+N+MI  Y  N   + A+ +F+++L  G      +++
Sbjct: 283 AKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVM 342

Query: 320 GLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVYCRLNEVQFARQLFDESPEK 379
           G +   +    L+    I  LS++LG+    SV  +L ++YC+  EV  A  +F +   +
Sbjct: 343 GALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSR 402

Query: 380 SLASWNAMISGYTQNGLTDRAISLFQEMMPQ-LSPNPVTVTSILSACAQLGALSIGKWVH 439
           +L SWNAMI G+ QNG    A++ F +M  + + P+  T  S+++A A+L      KW+H
Sbjct: 403 TLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIH 462

Query: 440 GLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHGHGK 499
           G++    L+ N++V+TALVDMYAKCG+I+ AR +FD+M +++V TWNAMI GYG HG GK
Sbjct: 463 GVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGK 522

Query: 500 EALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHYACM 559
            AL+LF EM +  I P GVTFLS++ ACSHSGLV  G + F+ M  +Y  +   +HY  M
Sbjct: 523 AALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAM 582

Query: 560 VDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDPENV 619
           VD+LGRAG+L  A +FI +MP++P   V+GA+LGAC IHKN   A  A++RLF+L+P++ 
Sbjct: 583 VDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDG 642

Query: 620 GYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHPQAT 679
           GY+VLL+NIY     + K   VR  + ++ L KTPGC+++EI ++ + F SG  +HP + 
Sbjct: 643 GYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSK 702

Query: 680 AIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEPGTE 739
            I+  LEKL   ++EAGY  +  T  +  VE++ KE +++ HSEKLAI+FGL++T  GT 
Sbjct: 703 KIYAFLEKLICHIKEAGYVPD--TNLVLGVENDVKEQLLSTHSEKLAISFGLLNTTAGTT 762

Query: 740 IRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 789
           I + KNLRVC DCH ATK+IS +T R IVVRD  RFHHFKNG CSCGDYW
Sbjct: 763 IHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of IVF0023772 vs. ExPASy Swiss-Prot
Match: O81767 (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 552.4 bits (1422), Expect = 8.5e-156
Identity = 295/774 (38.11%), Postives = 455/774 (58.79%), Query Frame = 0

Query: 19  TLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKPDLF 78
           TL    T L     + A+L++     ++    KL + +  LG V   R  F+ +   D++
Sbjct: 59  TLFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTFDHIQNRDVY 118

Query: 79  LFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAH 138
            +N++I G+   G     I  ++     + L PD  T+   + A   + D   G  +H  
Sbjct: 119 AWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKACRTVID---GNKIHCL 178

Query: 139 SIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDS 198
           ++  G   +++V ++++ LY ++     AR +FD MP RD   WN MISG+ ++   +++
Sbjct: 179 ALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEA 238

Query: 199 IRVFVDMLDVGL-SFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLIS 258
           +      L  GL + DS T+ ++L+A  E  ++  G+ I   + K GL S+++V   LI 
Sbjct: 239 L-----TLSNGLRAMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLID 298

Query: 259 LYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQGVNSST 318
           LY++ G+    + +FD++   DLIS+N++I  Y  N +   A++LF+E+  S    +  T
Sbjct: 299 LYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQPDCLT 358

Query: 319 LVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQP-SVSTALTTVYCRLNEVQFARQLFDES 378
           L+ L  + S    ++    +Q  +L+ G  L+  ++  A+  +Y +L  V  AR +F+  
Sbjct: 359 LISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNWL 418

Query: 379 PEKSLASWNAMISGYTQNGLTDRAISLF--QEMMPQLSPNPVTVTSILSACAQLGALSIG 438
           P   + SWN +ISGY QNG    AI ++   E   +++ N  T  S+L AC+Q GAL  G
Sbjct: 419 PNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQG 478

Query: 439 KWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLH 498
             +HG +    L  +++V T+L DMY KCG + +A  LF  +   N V WN +I  +G H
Sbjct: 479 MKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGFH 538

Query: 499 GHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEH 558
           GHG++A+ LF EML  G+ P  +TF+++L ACSHSGLV EG   F  M  DYG  P  +H
Sbjct: 539 GHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKH 598

Query: 559 YACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLD 618
           Y CMVD+ GRAGQL  AL+FI+ M L+P  ++WGALL AC +H N ++  +AS+ LF+++
Sbjct: 599 YGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVE 658

Query: 619 PENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSH 678
           PE+VGY+VLLSN+Y++   +     +R +   + L KTPG + +E+ ++  VF +G+++H
Sbjct: 659 PEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTH 718

Query: 679 PQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTE 738
           P    ++  L  L  K++  GY  +     L DVED+EKE ++  HSE+LAIAF LI+T 
Sbjct: 719 PMYEEMYRELTALQAKLKMIGYVPDH-RFVLQDVEDDEKEHILMSHSERLAIAFALIATP 778

Query: 739 PGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 789
             T IRI KNLRVC DCH+ TKFISKITER I+VRD+NRFHHFKNG+CSCGDYW
Sbjct: 779 AKTTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

BLAST of IVF0023772 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 535.0 bits (1377), Expect = 1.4e-150
Identity = 271/697 (38.88%), Postives = 430/697 (61.69%), Query Frame = 0

Query: 95  SSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAHSIVDGVASNLFVGSAI 154
           +S  LYT+    + +  D+F  +   SA  + + +++    HA  +V G+  + F+ + +
Sbjct: 8   ASPLLYTN----SGIHSDSFYASLIDSATHKAQLKQI----HARLLVLGLQFSGFLITKL 67

Query: 155 VDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDS 214
           +     F     AR+VFD +P      WN +I G+SRN++F+D++ ++ +M    +S DS
Sbjct: 68  IHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDS 127

Query: 215 TTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQ 274
            T   +L A + L   ++G  +     + G  +DV+V  GLI+LY+KC +    R +F+ 
Sbjct: 128 FTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEG 187

Query: 275 IDQPD--LISYNAMISGYTFNHETESAVTLFRELLASGQGVNSSTLVGLIPVYSPFNHLQ 334
           +  P+  ++S+ A++S Y  N E   A+ +F ++       +   LV ++  ++    L+
Sbjct: 188 LPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLK 247

Query: 335 LTLLIQNLSLKLGIILQPSVSTALTTVYCRLNEVQFARQLFDESPEKSLASWNAMISGYT 394
               I    +K+G+ ++P +  +L T+Y +  +V  A+ LFD+    +L  WNAMISGY 
Sbjct: 248 QGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYA 307

Query: 395 QNGLTDRAISLFQEMM-PQLSPNPVTVTSILSACAQLGALSIGKWVHGLIKSERLESNLY 454
           +NG    AI +F EM+   + P+ +++TS +SACAQ+G+L   + ++  +       +++
Sbjct: 308 KNGYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVF 367

Query: 455 VSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHGHGKEALKLFYEMLQSG 514
           +S+AL+DM+AKCGS+  AR +FD  +D++VV W+AMI GYGLHG  +EA+ L+  M + G
Sbjct: 368 ISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGG 427

Query: 515 IPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHYACMVDILGRAGQLTNA 574
           + P  VTFL +L AC+HSG+VREG   F+ MA D+   P  +HYAC++D+LGRAG L  A
Sbjct: 428 VHPNDVTFLGLLMACNHSGMVREGWWFFNRMA-DHKINPQQQHYACVIDLLGRAGHLDQA 487

Query: 575 LEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDPENVGYYVLLSNIYSTD 634
            E I+ MP++PG  VWGALL AC  H++ E+   A+++LF +DP N G+YV LSN+Y+  
Sbjct: 488 YEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAA 547

Query: 635 RNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHPQATAIFEMLEKLTGKM 694
           R + + A VR  +K++ L K  GC+ +E+  +   F  GD+SHP+   I   +E +  ++
Sbjct: 548 RLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRL 607

Query: 695 REAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEPGTEIRIIKNLRVCLDC 754
           +E G+ A     +LHD+ DEE E  +  HSE++AIA+GLIST  GT +RI KNLR C++C
Sbjct: 608 KEGGFVANK-DASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNC 667

Query: 755 HTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 789
           H ATK ISK+ +R IVVRD NRFHHFK+G+CSCGDYW
Sbjct: 668 HAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of IVF0023772 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 528.5 bits (1360), Expect = 1.3e-148
Identity = 299/819 (36.51%), Postives = 455/819 (55.56%), Query Frame = 0

Query: 21  LNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKPD--LF 80
           ++   T+ Q+  I  +L+  GI   L+  + L   +  +G + H   L  +    D  ++
Sbjct: 35  IHKCKTISQVKLIHQKLLSFGI-LTLNLTSHLISTYISVGCLSHAVSLLRRFPPSDAGVY 94

Query: 81  LFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAH 140
            +N LIR + DNG     ++L+  L    +  PDN+T+ F   A   +   R G   HA 
Sbjct: 95  HWNSLIRSYGDNGCANKCLYLF-GLMHSLSWTPDNYTFPFVFKACGEISSVRCGESAHAL 154

Query: 141 SIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDS 200
           S+V G  SN+FVG+A+V +Y +      ARKVFD M   D V WN++I  +++    + +
Sbjct: 155 SLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKVA 214

Query: 201 IRVFVDML-DVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLIS 260
           + +F  M  + G   D+ TL  VL   A L  + LG  + C A    +  +++V   L+ 
Sbjct: 215 LEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAVTSEMIQNMFVGNCLVD 274

Query: 261 LYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLF-------------- 320
           +Y+KCG   +   +F  +   D++S+NAM++GY+     E AV LF              
Sbjct: 275 MYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKMDVVT 334

Query: 321 ---------------------RELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSL 380
                                R++L+SG   N  TL+ ++   +    L     I   ++
Sbjct: 335 WSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCASVGALMHGKEIHCYAI 394

Query: 381 KLGIILQPS-------VSTALTTVYCRLNEVQFARQLFDE-SP-EKSLASWNAMISGYTQ 440
           K  I L+ +       V   L  +Y +  +V  AR +FD  SP E+ + +W  MI GY+Q
Sbjct: 395 KYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGGYSQ 454

Query: 441 NGLTDRAISLFQEMMP---QLSPNPVTVTSILSACAQLGALSIGKWVHG-LIKSERLESN 500
           +G  ++A+ L  EM     Q  PN  T++  L ACA L AL IGK +H   +++++    
Sbjct: 455 HGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAVP 514

Query: 501 LYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHGHGKEALKLFYEMLQ 560
           L+VS  L+DMYAKCGSI +AR +FD M+ KN VTW +++TGYG+HG+G+EAL +F EM +
Sbjct: 515 LFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMRR 574

Query: 561 SGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHYACMVDILGRAGQLT 620
            G    GVT L +LYACSHSG++ +G E F+ M   +G  P  EHYAC+VD+LGRAG+L 
Sbjct: 575 IGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRLN 634

Query: 621 NALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDPENVGYYVLLSNIYS 680
            AL  IE MP+EP P VW A L  C IH   E+   A++++ +L   + G Y LLSN+Y+
Sbjct: 635 AALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLYA 694

Query: 681 TDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHPQATAIFEMLEKLTG 740
               +     +R +++ + + K PGC+ +E       F  GD++HP A  I+++L     
Sbjct: 695 NAGRWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKTHPHAKEIYQVLLDHMQ 754

Query: 741 KMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEPGTEIRIIKNLRVCL 789
           ++++ GY  ET   ALHDV+DEEK+ ++  HSEKLA+A+G+++T  G  IRI KNLRVC 
Sbjct: 755 RIKDIGYVPET-GFALHDVDDEEKDDLLFEHSEKLALAYGILTTPQGAAIRITKNLRVCG 814

BLAST of IVF0023772 vs. ExPASy TrEMBL
Match: A0A5A7U078 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G00560 PE=3 SV=1)

HSP 1 Score: 1566.6 bits (4055), Expect = 0.0e+00
Identity = 788/788 (100.00%), Postives = 788/788 (100.00%), Query Frame = 0

Query: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60
           MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG
Sbjct: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60

Query: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120
           AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI
Sbjct: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120

Query: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180
           SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV
Sbjct: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180

Query: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240
           LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA
Sbjct: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240

Query: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300
           SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV
Sbjct: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300

Query: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360
           TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY
Sbjct: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360

Query: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420
           CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS
Sbjct: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420

Query: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480
           ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN
Sbjct: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480

Query: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540
           VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH
Sbjct: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540

Query: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600
           SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT
Sbjct: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600

Query: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660
           EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI
Sbjct: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660

Query: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720
           GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH
Sbjct: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720

Query: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780
           SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG
Sbjct: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780

Query: 781 ICSCGDYW 789
           ICSCGDYW
Sbjct: 781 ICSCGDYW 788

BLAST of IVF0023772 vs. ExPASy TrEMBL
Match: A0A1S4E1Q1 (pentatricopeptide repeat-containing protein At4g30700 OS=Cucumis melo OX=3656 GN=LOC103492100 PE=3 SV=1)

HSP 1 Score: 1566.6 bits (4055), Expect = 0.0e+00
Identity = 788/788 (100.00%), Postives = 788/788 (100.00%), Query Frame = 0

Query: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60
           MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG
Sbjct: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60

Query: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120
           AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI
Sbjct: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120

Query: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180
           SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV
Sbjct: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180

Query: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240
           LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA
Sbjct: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240

Query: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300
           SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV
Sbjct: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300

Query: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360
           TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY
Sbjct: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360

Query: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420
           CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS
Sbjct: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420

Query: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480
           ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN
Sbjct: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480

Query: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540
           VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH
Sbjct: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540

Query: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600
           SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT
Sbjct: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600

Query: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660
           EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI
Sbjct: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660

Query: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720
           GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH
Sbjct: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720

Query: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780
           SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG
Sbjct: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780

Query: 781 ICSCGDYW 789
           ICSCGDYW
Sbjct: 781 ICSCGDYW 788

BLAST of IVF0023772 vs. ExPASy TrEMBL
Match: A0A0A0LMK7 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G070330 PE=3 SV=1)

HSP 1 Score: 1505.3 bits (3896), Expect = 0.0e+00
Identity = 757/773 (97.93%), Postives = 764/773 (98.84%), Query Frame = 0

Query: 16  FFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKP 75
           FFLTLLNNATTL QLLQIQAQLILHGI YDLSSITKLTHKFFDLGAV HVRQLFNKVSKP
Sbjct: 12  FFLTLLNNATTLSQLLQIQAQLILHGIHYDLSSITKLTHKFFDLGAVAHVRQLFNKVSKP 71

Query: 76  DLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLL 135
           DLFLFNVLIRGFSDNGLPKSSIFLYTHLRK+TNLRPDNFTYAFAISAASRLEDERVGVLL
Sbjct: 72  DLFLFNVLIRGFSDNGLPKSSIFLYTHLRKKTNLRPDNFTYAFAISAASRLEDERVGVLL 131

Query: 136 HAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYF 195
           HAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYF
Sbjct: 132 HAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYF 191

Query: 196 EDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGL 255
           EDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGL
Sbjct: 192 EDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGL 251

Query: 256 ISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQGVNS 315
           ISLYSKCGKS KGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQ VNS
Sbjct: 252 ISLYSKCGKSCKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQRVNS 311

Query: 316 STLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVYCRLNEVQFARQLFDE 375
           STLVGLIPVY PFNHLQL+ LIQNLSLK+GIILQPSVSTALTTVYCRLNEVQFARQLFDE
Sbjct: 312 STLVGLIPVYLPFNHLQLSRLIQNLSLKIGIILQPSVSTALTTVYCRLNEVQFARQLFDE 371

Query: 376 SPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTSILSACAQLGALSIGK 435
           SPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTSILSACAQLGALSIGK
Sbjct: 372 SPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTSILSACAQLGALSIGK 431

Query: 436 WVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHG 495
           WVHGLIKSERLESN+YVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHG
Sbjct: 432 WVHGLIKSERLESNVYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHG 491

Query: 496 HGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHY 555
           HGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLV EGNEIFHSMAN+YGFQPMSEHY
Sbjct: 492 HGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVSEGNEIFHSMANNYGFQPMSEHY 551

Query: 556 ACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDP 615
           ACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTE+ANVASKRLFQLDP
Sbjct: 552 ACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEMANVASKRLFQLDP 611

Query: 616 ENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHP 675
           ENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI DQQYVFTSGDRSHP
Sbjct: 612 ENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIDDQQYVFTSGDRSHP 671

Query: 676 QATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEP 735
           QATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLIST+P
Sbjct: 672 QATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTKP 731

Query: 736 GTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 789
           GTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW
Sbjct: 732 GTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 784

BLAST of IVF0023772 vs. ExPASy TrEMBL
Match: A0A6J1EDA0 (pentatricopeptide repeat-containing protein At4g30700 OS=Cucurbita moschata OX=3662 GN=LOC111433113 PE=3 SV=1)

HSP 1 Score: 1431.0 bits (3703), Expect = 0.0e+00
Identity = 712/788 (90.36%), Postives = 749/788 (95.05%), Query Frame = 0

Query: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60
           MICTNT  S IR +KFFL LLN ATTLPQLLQ+QAQLILHGI YDLSSITKLTHKFFDLG
Sbjct: 1   MICTNTAISVIRDKKFFLALLNKATTLPQLLQVQAQLILHGIHYDLSSITKLTHKFFDLG 60

Query: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120
           AV HVRQLF  VS+PDLF+FNVLIRGFSDN LPKSSI +YTHLRK TNLRPDNFTYAFAI
Sbjct: 61  AVRHVRQLFANVSRPDLFMFNVLIRGFSDNNLPKSSISVYTHLRKWTNLRPDNFTYAFAI 120

Query: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180
           SAAS+ EDER+GVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRA+LARKVFD MPERDTV
Sbjct: 121 SAASKFEDERLGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRADLARKVFDAMPERDTV 180

Query: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240
           LWNTMISGFSRNSYFEDSIRVFVDMLDVGL FDSTTLA VLTAVAELQEYRLGM IQCLA
Sbjct: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLPFDSTTLAAVLTAVAELQEYRLGMSIQCLA 240

Query: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300
           SKKGLHSDVYVLTGLISL+SKCG+S K R+LFDQIDQPDLISYNAMISGYTFNHET SAV
Sbjct: 241 SKKGLHSDVYVLTGLISLFSKCGESDKARLLFDQIDQPDLISYNAMISGYTFNHETGSAV 300

Query: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360
           TLFRELLASGQGV+SSTLVGLIPV+SPF+HLQLT  IQ LS+KLGII +PSVSTALTTVY
Sbjct: 301 TLFRELLASGQGVSSSTLVGLIPVFSPFSHLQLTRSIQTLSIKLGIISKPSVSTALTTVY 360

Query: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420
           CRLNE+Q+ARQLFDESPEKSLASWNAMISGYTQNGLT+ AISLFQEMMPQLSPNPVTVTS
Sbjct: 361 CRLNEIQYARQLFDESPEKSLASWNAMISGYTQNGLTESAISLFQEMMPQLSPNPVTVTS 420

Query: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480
           ILSACAQLGALS+GKWVHGLIKSE+LESN+YV+TALVDMYAKCGS+VEARQLFDL  +KN
Sbjct: 421 ILSACAQLGALSLGKWVHGLIKSEKLESNIYVTTALVDMYAKCGSVVEARQLFDLTAEKN 480

Query: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540
            VTWNAMITGYGLHG+G EAL LF +MLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH
Sbjct: 481 AVTWNAMITGYGLHGYGNEALNLFNKMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540

Query: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600
           SM N++GFQPMSEHYACMVDI GRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT
Sbjct: 541 SMVNNFGFQPMSEHYACMVDIFGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600

Query: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660
           +IA+VAS+RLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKR LAKTPGCTLIEI
Sbjct: 601 DIAHVASERLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRNLAKTPGCTLIEI 660

Query: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720
            DQQ+VFTSGDRSHP+A AI+ MLEKL GKMREAGYQAETVTTALHDVEDEEKELMVNVH
Sbjct: 661 DDQQHVFTSGDRSHPRAMAIYAMLEKLIGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720

Query: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780
           SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFK+G
Sbjct: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKDG 780

Query: 781 ICSCGDYW 789
           +CSCGDYW
Sbjct: 781 LCSCGDYW 788

BLAST of IVF0023772 vs. ExPASy TrEMBL
Match: A0A6J1IEL0 (pentatricopeptide repeat-containing protein At4g30700 OS=Cucurbita maxima OX=3661 GN=LOC111471980 PE=3 SV=1)

HSP 1 Score: 1429.5 bits (3699), Expect = 0.0e+00
Identity = 710/788 (90.10%), Postives = 750/788 (95.18%), Query Frame = 0

Query: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60
           MICTNTT S IR +KFFL LLN ATTLPQLLQIQAQLILHGI YDLSSITKLTHKFFDLG
Sbjct: 1   MICTNTTISVIRDKKFFLPLLNKATTLPQLLQIQAQLILHGIHYDLSSITKLTHKFFDLG 60

Query: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120
           AV HVRQLF  VS+PDLF+FNVLIRGFSDN LPKSSI +YTHLRK TNLRPDNFTYAFAI
Sbjct: 61  AVRHVRQLFANVSRPDLFMFNVLIRGFSDNNLPKSSISVYTHLRKWTNLRPDNFTYAFAI 120

Query: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180
           SAAS+ EDER+GVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRA++ARKVFD MPERDTV
Sbjct: 121 SAASKFEDERLGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRADMARKVFDAMPERDTV 180

Query: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240
           LWNTMISGFSRNSYFEDSIRVFVDML VGL FDSTTLA VLTAVAELQEYRLGM IQCLA
Sbjct: 181 LWNTMISGFSRNSYFEDSIRVFVDMLHVGLPFDSTTLAAVLTAVAELQEYRLGMSIQCLA 240

Query: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300
           SKKGLHSDVYVLTGLISL+SKCG+S K R+LFDQIDQPDLISYNAMISGYTFNHET SAV
Sbjct: 241 SKKGLHSDVYVLTGLISLFSKCGESDKARLLFDQIDQPDLISYNAMISGYTFNHETGSAV 300

Query: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360
           TLFRELLASGQGV+SSTLVGLIPV+SPF+HLQLT  IQ LS+K+GII +PSVSTALTTVY
Sbjct: 301 TLFRELLASGQGVSSSTLVGLIPVFSPFSHLQLTRSIQTLSIKIGIISKPSVSTALTTVY 360

Query: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420
           CRLNE+Q+ARQLFDESPEKSLASWNAMISGYTQNGLT+ AISLFQEM+PQLSPNPVTVTS
Sbjct: 361 CRLNEIQYARQLFDESPEKSLASWNAMISGYTQNGLTESAISLFQEMLPQLSPNPVTVTS 420

Query: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480
           ILSACAQLGALS+GKWVHGLIKSE+LESN+YV+TAL+DMYAKCGS+VEARQLFDLM +KN
Sbjct: 421 ILSACAQLGALSLGKWVHGLIKSEKLESNIYVTTALIDMYAKCGSVVEARQLFDLMAEKN 480

Query: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540
            VTWNAMITGYGLHG+G EAL LF +MLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH
Sbjct: 481 AVTWNAMITGYGLHGYGNEALNLFNKMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540

Query: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600
           SM N++GFQPMSEHYACMVDI GRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT
Sbjct: 541 SMVNNFGFQPMSEHYACMVDIFGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600

Query: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660
           +IA+VAS+RLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKR LAKTPGCTLIEI
Sbjct: 601 DIAHVASERLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRNLAKTPGCTLIEI 660

Query: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720
            DQQ+VFTSGD+SHP+ATAI+ MLEKL GKMREAGYQAETVTTALHDVEDEEKELMVNVH
Sbjct: 661 DDQQHVFTSGDQSHPRATAIYAMLEKLIGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720

Query: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780
           SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFK+G
Sbjct: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKDG 780

Query: 781 ICSCGDYW 789
            CSCGDYW
Sbjct: 781 FCSCGDYW 788

BLAST of IVF0023772 vs. NCBI nr
Match: XP_016902152.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g30700 [Cucumis melo] >KAA0047626.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK08282.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1560 bits (4040), Expect = 0.0
Identity = 788/788 (100.00%), Postives = 788/788 (100.00%), Query Frame = 0

Query: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60
           MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG
Sbjct: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60

Query: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120
           AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI
Sbjct: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120

Query: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180
           SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV
Sbjct: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180

Query: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240
           LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA
Sbjct: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240

Query: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300
           SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV
Sbjct: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300

Query: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360
           TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY
Sbjct: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360

Query: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420
           CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS
Sbjct: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420

Query: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480
           ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN
Sbjct: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480

Query: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540
           VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH
Sbjct: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540

Query: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600
           SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT
Sbjct: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600

Query: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660
           EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI
Sbjct: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660

Query: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720
           GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH
Sbjct: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720

Query: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780
           SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG
Sbjct: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780

Query: 781 ICSCGDYW 788
           ICSCGDYW
Sbjct: 781 ICSCGDYW 788

BLAST of IVF0023772 vs. NCBI nr
Match: XP_004152852.1 (pentatricopeptide repeat-containing protein At4g30700 [Cucumis sativus])

HSP 1 Score: 1524 bits (3947), Expect = 0.0
Identity = 770/788 (97.72%), Postives = 778/788 (98.73%), Query Frame = 0

Query: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60
           MICTNT TSAIRGQ+FFLTLLNNATTL QLLQIQAQLILHGI YDLSSITKLTHKFFDLG
Sbjct: 1   MICTNTATSAIRGQRFFLTLLNNATTLSQLLQIQAQLILHGIHYDLSSITKLTHKFFDLG 60

Query: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120
           AV HVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRK+TNLRPDNFTYAFAI
Sbjct: 61  AVAHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKKTNLRPDNFTYAFAI 120

Query: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180
           SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV
Sbjct: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180

Query: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240
           LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA
Sbjct: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240

Query: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300
           SKKGLHSDVYVLTGLISLYSKCGKS KGRILFDQIDQPDLISYNAMISGYTFNHETESAV
Sbjct: 241 SKKGLHSDVYVLTGLISLYSKCGKSCKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300

Query: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360
           TLFRELLASGQ VNSSTLVGLIPVY PFNHLQL+ LIQNLSLK+GIILQPSVSTALTTVY
Sbjct: 301 TLFRELLASGQRVNSSTLVGLIPVYLPFNHLQLSRLIQNLSLKIGIILQPSVSTALTTVY 360

Query: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420
           CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS
Sbjct: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420

Query: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480
           ILSACAQLGALSIGKWVHGLIKSERLESN+YVSTALVDMYAKCGSIVEARQLFDLMVDKN
Sbjct: 421 ILSACAQLGALSIGKWVHGLIKSERLESNVYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480

Query: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540
           VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLV EGNEIFH
Sbjct: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVSEGNEIFH 540

Query: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600
           SMAN+YGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT
Sbjct: 541 SMANNYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600

Query: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660
           E+ANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI
Sbjct: 601 EMANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660

Query: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720
            DQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH
Sbjct: 661 DDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720

Query: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780
           SEKLAIAFGLIST+PGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG
Sbjct: 721 SEKLAIAFGLISTKPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780

Query: 781 ICSCGDYW 788
           ICSCGDYW
Sbjct: 781 ICSCGDYW 788

BLAST of IVF0023772 vs. NCBI nr
Match: KGN61216.1 (hypothetical protein Csa_006224 [Cucumis sativus])

HSP 1 Score: 1499 bits (3881), Expect = 0.0
Identity = 757/773 (97.93%), Postives = 764/773 (98.84%), Query Frame = 0

Query: 16  FFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKP 75
           FFLTLLNNATTL QLLQIQAQLILHGI YDLSSITKLTHKFFDLGAV HVRQLFNKVSKP
Sbjct: 12  FFLTLLNNATTLSQLLQIQAQLILHGIHYDLSSITKLTHKFFDLGAVAHVRQLFNKVSKP 71

Query: 76  DLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLL 135
           DLFLFNVLIRGFSDNGLPKSSIFLYTHLRK+TNLRPDNFTYAFAISAASRLEDERVGVLL
Sbjct: 72  DLFLFNVLIRGFSDNGLPKSSIFLYTHLRKKTNLRPDNFTYAFAISAASRLEDERVGVLL 131

Query: 136 HAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYF 195
           HAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYF
Sbjct: 132 HAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYF 191

Query: 196 EDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGL 255
           EDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGL
Sbjct: 192 EDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGL 251

Query: 256 ISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQGVNS 315
           ISLYSKCGKS KGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQ VNS
Sbjct: 252 ISLYSKCGKSCKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQRVNS 311

Query: 316 STLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVYCRLNEVQFARQLFDE 375
           STLVGLIPVY PFNHLQL+ LIQNLSLK+GIILQPSVSTALTTVYCRLNEVQFARQLFDE
Sbjct: 312 STLVGLIPVYLPFNHLQLSRLIQNLSLKIGIILQPSVSTALTTVYCRLNEVQFARQLFDE 371

Query: 376 SPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTSILSACAQLGALSIGK 435
           SPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTSILSACAQLGALSIGK
Sbjct: 372 SPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTSILSACAQLGALSIGK 431

Query: 436 WVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHG 495
           WVHGLIKSERLESN+YVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHG
Sbjct: 432 WVHGLIKSERLESNVYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHG 491

Query: 496 HGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHY 555
           HGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLV EGNEIFHSMAN+YGFQPMSEHY
Sbjct: 492 HGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVSEGNEIFHSMANNYGFQPMSEHY 551

Query: 556 ACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDP 615
           ACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTE+ANVASKRLFQLDP
Sbjct: 552 ACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEMANVASKRLFQLDP 611

Query: 616 ENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHP 675
           ENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI DQQYVFTSGDRSHP
Sbjct: 612 ENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIDDQQYVFTSGDRSHP 671

Query: 676 QATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEP 735
           QATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLIST+P
Sbjct: 672 QATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTKP 731

Query: 736 GTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 788
           GTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW
Sbjct: 732 GTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 784

BLAST of IVF0023772 vs. NCBI nr
Match: XP_038889958.1 (pentatricopeptide repeat-containing protein At4g30700-like isoform X2 [Benincasa hispida] >XP_038889959.1 pentatricopeptide repeat-containing protein At4g30700-like isoform X2 [Benincasa hispida])

HSP 1 Score: 1483 bits (3839), Expect = 0.0
Identity = 748/788 (94.92%), Postives = 766/788 (97.21%), Query Frame = 0

Query: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60
           MICTNTTTSAI G+KFFLTLLN ATTLPQLLQI AQLILHGI  DLSSITKLTHKFFDLG
Sbjct: 1   MICTNTTTSAIHGRKFFLTLLNKATTLPQLLQIHAQLILHGIHNDLSSITKLTHKFFDLG 60

Query: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120
           AV HVRQLF KVSKPDLFLFNVLIRGFSDN LPKSSIFLYTHLRK TNLRPDNFT+AFAI
Sbjct: 61  AVYHVRQLFAKVSKPDLFLFNVLIRGFSDNSLPKSSIFLYTHLRKGTNLRPDNFTFAFAI 120

Query: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180
           SAASR EDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV
Sbjct: 121 SAASRFEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180

Query: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240
           LWNTMISGFSRNSYFEDSIRVFVDML+ GLSFDSTTLA VLTAVAELQEYRLGMGIQCLA
Sbjct: 181 LWNTMISGFSRNSYFEDSIRVFVDMLNAGLSFDSTTLAAVLTAVAELQEYRLGMGIQCLA 240

Query: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300
           SKKGLHSDVYVLTGLISLYSKCGKS KGR+LFDQIDQPDLISYNAMISGYTFNHETESAV
Sbjct: 241 SKKGLHSDVYVLTGLISLYSKCGKSDKGRLLFDQIDQPDLISYNAMISGYTFNHETESAV 300

Query: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360
           TLF+ELLASGQGVNSSTLVGL+PV+SPFNHLQLT LIQNLS+K+GII QPSVSTALTTVY
Sbjct: 301 TLFKELLASGQGVNSSTLVGLVPVFSPFNHLQLTCLIQNLSMKIGIISQPSVSTALTTVY 360

Query: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420
           CRLNEVQFAR+LFDESPEKSLASWNAMISGYTQNGLT+RAISLFQEM+PQLSPNPVTVTS
Sbjct: 361 CRLNEVQFARKLFDESPEKSLASWNAMISGYTQNGLTERAISLFQEMVPQLSPNPVTVTS 420

Query: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480
           ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLM +KN
Sbjct: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMAEKN 480

Query: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540
           VVTWNAMITGYGLHGHGKEAL LF EML+SGIP T VTFLSILYACSHSGLVREGNEIFH
Sbjct: 481 VVTWNAMITGYGLHGHGKEALNLFNEMLRSGIPLTRVTFLSILYACSHSGLVREGNEIFH 540

Query: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600
           SM N+YGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT
Sbjct: 541 SMVNNYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600

Query: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660
           EIA+VASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI
Sbjct: 601 EIAHVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660

Query: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720
           G+QQYVFTSGD+SHPQATAIF MLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH
Sbjct: 661 GNQQYVFTSGDQSHPQATAIFAMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720

Query: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780
           SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG
Sbjct: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780

Query: 781 ICSCGDYW 788
           ICSCGDYW
Sbjct: 781 ICSCGDYW 788

BLAST of IVF0023772 vs. NCBI nr
Match: XP_038889951.1 (pentatricopeptide repeat-containing protein At4g30700-like isoform X1 [Benincasa hispida] >XP_038889952.1 pentatricopeptide repeat-containing protein At4g30700-like isoform X1 [Benincasa hispida] >XP_038889953.1 pentatricopeptide repeat-containing protein At4g30700-like isoform X1 [Benincasa hispida] >XP_038889954.1 pentatricopeptide repeat-containing protein At4g30700-like isoform X1 [Benincasa hispida] >XP_038889955.1 pentatricopeptide repeat-containing protein At4g30700-like isoform X1 [Benincasa hispida] >XP_038889956.1 pentatricopeptide repeat-containing protein At4g30700-like isoform X1 [Benincasa hispida] >XP_038889957.1 pentatricopeptide repeat-containing protein At4g30700-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1483 bits (3839), Expect = 0.0
Identity = 748/788 (94.92%), Postives = 766/788 (97.21%), Query Frame = 0

Query: 1   MICTNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLG 60
           MICTNTTTSAI G+KFFLTLLN ATTLPQLLQI AQLILHGI  DLSSITKLTHKFFDLG
Sbjct: 5   MICTNTTTSAIHGRKFFLTLLNKATTLPQLLQIHAQLILHGIHNDLSSITKLTHKFFDLG 64

Query: 61  AVVHVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAI 120
           AV HVRQLF KVSKPDLFLFNVLIRGFSDN LPKSSIFLYTHLRK TNLRPDNFT+AFAI
Sbjct: 65  AVYHVRQLFAKVSKPDLFLFNVLIRGFSDNSLPKSSIFLYTHLRKGTNLRPDNFTFAFAI 124

Query: 121 SAASRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 180
           SAASR EDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV
Sbjct: 125 SAASRFEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTV 184

Query: 181 LWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLA 240
           LWNTMISGFSRNSYFEDSIRVFVDML+ GLSFDSTTLA VLTAVAELQEYRLGMGIQCLA
Sbjct: 185 LWNTMISGFSRNSYFEDSIRVFVDMLNAGLSFDSTTLAAVLTAVAELQEYRLGMGIQCLA 244

Query: 241 SKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAV 300
           SKKGLHSDVYVLTGLISLYSKCGKS KGR+LFDQIDQPDLISYNAMISGYTFNHETESAV
Sbjct: 245 SKKGLHSDVYVLTGLISLYSKCGKSDKGRLLFDQIDQPDLISYNAMISGYTFNHETESAV 304

Query: 301 TLFRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVY 360
           TLF+ELLASGQGVNSSTLVGL+PV+SPFNHLQLT LIQNLS+K+GII QPSVSTALTTVY
Sbjct: 305 TLFKELLASGQGVNSSTLVGLVPVFSPFNHLQLTCLIQNLSMKIGIISQPSVSTALTTVY 364

Query: 361 CRLNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMMPQLSPNPVTVTS 420
           CRLNEVQFAR+LFDESPEKSLASWNAMISGYTQNGLT+RAISLFQEM+PQLSPNPVTVTS
Sbjct: 365 CRLNEVQFARKLFDESPEKSLASWNAMISGYTQNGLTERAISLFQEMVPQLSPNPVTVTS 424

Query: 421 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKN 480
           ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLM +KN
Sbjct: 425 ILSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMAEKN 484

Query: 481 VVTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFH 540
           VVTWNAMITGYGLHGHGKEAL LF EML+SGIP T VTFLSILYACSHSGLVREGNEIFH
Sbjct: 485 VVTWNAMITGYGLHGHGKEALNLFNEMLRSGIPLTRVTFLSILYACSHSGLVREGNEIFH 544

Query: 541 SMANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 600
           SM N+YGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT
Sbjct: 545 SMVNNYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNT 604

Query: 601 EIANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 660
           EIA+VASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI
Sbjct: 605 EIAHVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEI 664

Query: 661 GDQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 720
           G+QQYVFTSGD+SHPQATAIF MLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH
Sbjct: 665 GNQQYVFTSGDQSHPQATAIFAMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVH 724

Query: 721 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 780
           SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG
Sbjct: 725 SEKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNG 784

Query: 781 ICSCGDYW 788
           ICSCGDYW
Sbjct: 785 ICSCGDYW 792

BLAST of IVF0023772 vs. TAIR 10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 1011.9 bits (2615), Expect = 2.8e-295
Identity = 505/787 (64.17%), Postives = 615/787 (78.14%), Query Frame = 0

Query: 4   TNTTTSAIRGQKFFLTLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVV 63
           T  TT+A+  +  +L     +T++  L Q  AQ+ILHG + D+S +TKLT +  DLGA+ 
Sbjct: 10  TAETTAALISKNTYLDFFKRSTSISHLAQTHAQIILHGFRNDISLLTKLTQRLSDLGAIY 69

Query: 64  HVRQLFNKVSKPDLFLFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAA 123
           + R +F  V +PD+FLFNVL+RGFS N  P SS+ ++ HLRK T+L+P++ TYAFAISAA
Sbjct: 70  YARDIFLSVQRPDVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAISAA 129

Query: 124 SRLEDERVGVLLHAHSIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWN 183
           S   D+R G ++H  ++VDG  S L +GS IV +YFKF R E ARKVFD MPE+DT+LWN
Sbjct: 130 SGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWN 189

Query: 184 TMISGFSRNSYFEDSIRVFVDMLDVGLS-FDSTTLATVLTAVAELQEYRLGMGIQCLASK 243
           TMISG+ +N  + +SI+VF D+++   +  D+TTL  +L AVAELQE RLGM I  LA+K
Sbjct: 190 TMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATK 249

Query: 244 KGLHSDVYVLTGLISLYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTL 303
            G +S  YVLTG ISLYSKCGK   G  LF +  +PD+++YNAMI GYT N ETE +++L
Sbjct: 250 TGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSL 309

Query: 304 FRELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVYCR 363
           F+EL+ SG  + SSTLV L+PV     HL L   I    LK   +   SVSTALTTVY +
Sbjct: 310 FKELMLSGARLRSSTLVSLVPV---SGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYSK 369

Query: 364 LNEVQFARQLFDESPEKSLASWNAMISGYTQNGLTDRAISLFQEMM-PQLSPNPVTVTSI 423
           LNE++ AR+LFDESPEKSL SWNAMISGYTQNGLT+ AISLF+EM   + SPNPVT+T I
Sbjct: 370 LNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCI 429

Query: 424 LSACAQLGALSIGKWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNV 483
           LSACAQLGALS+GKWVH L++S   ES++YVSTAL+ MYAKCGSI EAR+LFDLM  KN 
Sbjct: 430 LSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNE 489

Query: 484 VTWNAMITGYGLHGHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHS 543
           VTWN MI+GYGLHG G+EAL +FYEML SGI PT VTFL +LYACSH+GLV+EG+EIF+S
Sbjct: 490 VTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNS 549

Query: 544 MANDYGFQPMSEHYACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTE 603
           M + YGF+P  +HYACMVDILGRAG L  AL+FIE M +EPG +VW  LLGAC IHK+T 
Sbjct: 550 MIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTN 609

Query: 604 IANVASKRLFQLDPENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIG 663
           +A   S++LF+LDP+NVGY+VLLSNI+S DRN+P+AA+VRQ  KKRKLAK PG TLIEIG
Sbjct: 610 LARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIG 669

Query: 664 DQQYVFTSGDRSHPQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHS 723
           +  +VFTSGD+SHPQ   I+E LEKL GKMREAGYQ ET   ALHDVE+EE+ELMV VHS
Sbjct: 670 ETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPET-ELALHDVEEEERELMVKVHS 729

Query: 724 EKLAIAFGLISTEPGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGI 783
           E+LAIAFGLI+TEPGTEIRIIKNLRVCLDCHT TK ISKITERVIVVRDANRFHHFK+G+
Sbjct: 730 ERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGV 789

Query: 784 CSCGDYW 789
           CSCGDYW
Sbjct: 790 CSCGDYW 792

BLAST of IVF0023772 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 589.7 bits (1519), Expect = 3.4e-168
Identity = 303/770 (39.35%), Postives = 467/770 (60.65%), Query Frame = 0

Query: 20  LLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKPDLFL 79
           LL   ++L +L QI   +  +G+  +    TKL   F   G+V    ++F  +      L
Sbjct: 43  LLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVL 102

Query: 80  FNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAHS 139
           ++ +++GF+       ++  +  +R   ++ P  + + + +       + RVG  +H   
Sbjct: 103 YHTMLKGFAKVSDLDKALQFFVRMR-YDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLL 162

Query: 140 IVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDSI 199
           +  G + +LF  + + ++Y K  +   ARKVFD MPERD V WNT+++G+S+N     ++
Sbjct: 163 VKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMAL 222

Query: 200 RVFVDMLDVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLISLY 259
            +   M +  L     T+ +VL AV+ L+   +G  I   A + G  S V + T L+ +Y
Sbjct: 223 EMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMY 282

Query: 260 SKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQGVNSSTLV 319
           +KCG     R LFD + + +++S+N+MI  Y  N   + A+ +F+++L  G      +++
Sbjct: 283 AKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVM 342

Query: 320 GLIPVYSPFNHLQLTLLIQNLSLKLGIILQPSVSTALTTVYCRLNEVQFARQLFDESPEK 379
           G +   +    L+    I  LS++LG+    SV  +L ++YC+  EV  A  +F +   +
Sbjct: 343 GALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSR 402

Query: 380 SLASWNAMISGYTQNGLTDRAISLFQEMMPQ-LSPNPVTVTSILSACAQLGALSIGKWVH 439
           +L SWNAMI G+ QNG    A++ F +M  + + P+  T  S+++A A+L      KW+H
Sbjct: 403 TLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIH 462

Query: 440 GLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHGHGK 499
           G++    L+ N++V+TALVDMYAKCG+I+ AR +FD+M +++V TWNAMI GYG HG GK
Sbjct: 463 GVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGK 522

Query: 500 EALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHYACM 559
            AL+LF EM +  I P GVTFLS++ ACSHSGLV  G + F+ M  +Y  +   +HY  M
Sbjct: 523 AALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAM 582

Query: 560 VDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDPENV 619
           VD+LGRAG+L  A +FI +MP++P   V+GA+LGAC IHKN   A  A++RLF+L+P++ 
Sbjct: 583 VDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDG 642

Query: 620 GYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHPQAT 679
           GY+VLL+NIY     + K   VR  + ++ L KTPGC+++EI ++ + F SG  +HP + 
Sbjct: 643 GYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSK 702

Query: 680 AIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEPGTE 739
            I+  LEKL   ++EAGY  +  T  +  VE++ KE +++ HSEKLAI+FGL++T  GT 
Sbjct: 703 KIYAFLEKLICHIKEAGYVPD--TNLVLGVENDVKEQLLSTHSEKLAISFGLLNTTAGTT 762

Query: 740 IRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 789
           I + KNLRVC DCH ATK+IS +T R IVVRD  RFHHFKNG CSCGDYW
Sbjct: 763 IHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of IVF0023772 vs. TAIR 10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 552.4 bits (1422), Expect = 6.0e-157
Identity = 295/774 (38.11%), Postives = 455/774 (58.79%), Query Frame = 0

Query: 19  TLLNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKPDLF 78
           TL    T L     + A+L++     ++    KL + +  LG V   R  F+ +   D++
Sbjct: 59  TLFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTFDHIQNRDVY 118

Query: 79  LFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAH 138
            +N++I G+   G     I  ++     + L PD  T+   + A   + D   G  +H  
Sbjct: 119 AWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKACRTVID---GNKIHCL 178

Query: 139 SIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDS 198
           ++  G   +++V ++++ LY ++     AR +FD MP RD   WN MISG+ ++   +++
Sbjct: 179 ALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEA 238

Query: 199 IRVFVDMLDVGL-SFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLIS 258
           +      L  GL + DS T+ ++L+A  E  ++  G+ I   + K GL S+++V   LI 
Sbjct: 239 L-----TLSNGLRAMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLID 298

Query: 259 LYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLFRELLASGQGVNSST 318
           LY++ G+    + +FD++   DLIS+N++I  Y  N +   A++LF+E+  S    +  T
Sbjct: 299 LYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQPDCLT 358

Query: 319 LVGLIPVYSPFNHLQLTLLIQNLSLKLGIILQP-SVSTALTTVYCRLNEVQFARQLFDES 378
           L+ L  + S    ++    +Q  +L+ G  L+  ++  A+  +Y +L  V  AR +F+  
Sbjct: 359 LISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNWL 418

Query: 379 PEKSLASWNAMISGYTQNGLTDRAISLF--QEMMPQLSPNPVTVTSILSACAQLGALSIG 438
           P   + SWN +ISGY QNG    AI ++   E   +++ N  T  S+L AC+Q GAL  G
Sbjct: 419 PNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQG 478

Query: 439 KWVHGLIKSERLESNLYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLH 498
             +HG +    L  +++V T+L DMY KCG + +A  LF  +   N V WN +I  +G H
Sbjct: 479 MKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGFH 538

Query: 499 GHGKEALKLFYEMLQSGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEH 558
           GHG++A+ LF EML  G+ P  +TF+++L ACSHSGLV EG   F  M  DYG  P  +H
Sbjct: 539 GHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKH 598

Query: 559 YACMVDILGRAGQLTNALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLD 618
           Y CMVD+ GRAGQL  AL+FI+ M L+P  ++WGALL AC +H N ++  +AS+ LF+++
Sbjct: 599 YGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVE 658

Query: 619 PENVGYYVLLSNIYSTDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSH 678
           PE+VGY+VLLSN+Y++   +     +R +   + L KTPG + +E+ ++  VF +G+++H
Sbjct: 659 PEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTH 718

Query: 679 PQATAIFEMLEKLTGKMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTE 738
           P    ++  L  L  K++  GY  +     L DVED+EKE ++  HSE+LAIAF LI+T 
Sbjct: 719 PMYEEMYRELTALQAKLKMIGYVPDH-RFVLQDVEDDEKEHILMSHSERLAIAFALIATP 778

Query: 739 PGTEIRIIKNLRVCLDCHTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 789
             T IRI KNLRVC DCH+ TKFISKITER I+VRD+NRFHHFKNG+CSCGDYW
Sbjct: 779 AKTTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

BLAST of IVF0023772 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 535.0 bits (1377), Expect = 1.0e-151
Identity = 271/697 (38.88%), Postives = 430/697 (61.69%), Query Frame = 0

Query: 95  SSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAHSIVDGVASNLFVGSAI 154
           +S  LYT+    + +  D+F  +   SA  + + +++    HA  +V G+  + F+ + +
Sbjct: 8   ASPLLYTN----SGIHSDSFYASLIDSATHKAQLKQI----HARLLVLGLQFSGFLITKL 67

Query: 155 VDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDSIRVFVDMLDVGLSFDS 214
           +     F     AR+VFD +P      WN +I G+SRN++F+D++ ++ +M    +S DS
Sbjct: 68  IHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDS 127

Query: 215 TTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLISLYSKCGKSHKGRILFDQ 274
            T   +L A + L   ++G  +     + G  +DV+V  GLI+LY+KC +    R +F+ 
Sbjct: 128 FTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEG 187

Query: 275 IDQPD--LISYNAMISGYTFNHETESAVTLFRELLASGQGVNSSTLVGLIPVYSPFNHLQ 334
           +  P+  ++S+ A++S Y  N E   A+ +F ++       +   LV ++  ++    L+
Sbjct: 188 LPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLK 247

Query: 335 LTLLIQNLSLKLGIILQPSVSTALTTVYCRLNEVQFARQLFDESPEKSLASWNAMISGYT 394
               I    +K+G+ ++P +  +L T+Y +  +V  A+ LFD+    +L  WNAMISGY 
Sbjct: 248 QGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYA 307

Query: 395 QNGLTDRAISLFQEMM-PQLSPNPVTVTSILSACAQLGALSIGKWVHGLIKSERLESNLY 454
           +NG    AI +F EM+   + P+ +++TS +SACAQ+G+L   + ++  +       +++
Sbjct: 308 KNGYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVF 367

Query: 455 VSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHGHGKEALKLFYEMLQSG 514
           +S+AL+DM+AKCGS+  AR +FD  +D++VV W+AMI GYGLHG  +EA+ L+  M + G
Sbjct: 368 ISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGG 427

Query: 515 IPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHYACMVDILGRAGQLTNA 574
           + P  VTFL +L AC+HSG+VREG   F+ MA D+   P  +HYAC++D+LGRAG L  A
Sbjct: 428 VHPNDVTFLGLLMACNHSGMVREGWWFFNRMA-DHKINPQQQHYACVIDLLGRAGHLDQA 487

Query: 575 LEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDPENVGYYVLLSNIYSTD 634
            E I+ MP++PG  VWGALL AC  H++ E+   A+++LF +DP N G+YV LSN+Y+  
Sbjct: 488 YEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAA 547

Query: 635 RNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHPQATAIFEMLEKLTGKM 694
           R + + A VR  +K++ L K  GC+ +E+  +   F  GD+SHP+   I   +E +  ++
Sbjct: 548 RLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRL 607

Query: 695 REAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEPGTEIRIIKNLRVCLDC 754
           +E G+ A     +LHD+ DEE E  +  HSE++AIA+GLIST  GT +RI KNLR C++C
Sbjct: 608 KEGGFVANK-DASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNC 667

Query: 755 HTATKFISKITERVIVVRDANRFHHFKNGICSCGDYW 789
           H ATK ISK+ +R IVVRD NRFHHFK+G+CSCGDYW
Sbjct: 668 HAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of IVF0023772 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 528.5 bits (1360), Expect = 9.3e-150
Identity = 299/819 (36.51%), Postives = 455/819 (55.56%), Query Frame = 0

Query: 21  LNNATTLPQLLQIQAQLILHGIQYDLSSITKLTHKFFDLGAVVHVRQLFNKVSKPD--LF 80
           ++   T+ Q+  I  +L+  GI   L+  + L   +  +G + H   L  +    D  ++
Sbjct: 35  IHKCKTISQVKLIHQKLLSFGI-LTLNLTSHLISTYISVGCLSHAVSLLRRFPPSDAGVY 94

Query: 81  LFNVLIRGFSDNGLPKSSIFLYTHLRKRTNLRPDNFTYAFAISAASRLEDERVGVLLHAH 140
            +N LIR + DNG     ++L+  L    +  PDN+T+ F   A   +   R G   HA 
Sbjct: 95  HWNSLIRSYGDNGCANKCLYLF-GLMHSLSWTPDNYTFPFVFKACGEISSVRCGESAHAL 154

Query: 141 SIVDGVASNLFVGSAIVDLYFKFTRAELARKVFDVMPERDTVLWNTMISGFSRNSYFEDS 200
           S+V G  SN+FVG+A+V +Y +      ARKVFD M   D V WN++I  +++    + +
Sbjct: 155 SLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKVA 214

Query: 201 IRVFVDML-DVGLSFDSTTLATVLTAVAELQEYRLGMGIQCLASKKGLHSDVYVLTGLIS 260
           + +F  M  + G   D+ TL  VL   A L  + LG  + C A    +  +++V   L+ 
Sbjct: 215 LEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAVTSEMIQNMFVGNCLVD 274

Query: 261 LYSKCGKSHKGRILFDQIDQPDLISYNAMISGYTFNHETESAVTLF-------------- 320
           +Y+KCG   +   +F  +   D++S+NAM++GY+     E AV LF              
Sbjct: 275 MYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKMDVVT 334

Query: 321 ---------------------RELLASGQGVNSSTLVGLIPVYSPFNHLQLTLLIQNLSL 380
                                R++L+SG   N  TL+ ++   +    L     I   ++
Sbjct: 335 WSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCASVGALMHGKEIHCYAI 394

Query: 381 KLGIILQPS-------VSTALTTVYCRLNEVQFARQLFDE-SP-EKSLASWNAMISGYTQ 440
           K  I L+ +       V   L  +Y +  +V  AR +FD  SP E+ + +W  MI GY+Q
Sbjct: 395 KYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGGYSQ 454

Query: 441 NGLTDRAISLFQEMMP---QLSPNPVTVTSILSACAQLGALSIGKWVHG-LIKSERLESN 500
           +G  ++A+ L  EM     Q  PN  T++  L ACA L AL IGK +H   +++++    
Sbjct: 455 HGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAVP 514

Query: 501 LYVSTALVDMYAKCGSIVEARQLFDLMVDKNVVTWNAMITGYGLHGHGKEALKLFYEMLQ 560
           L+VS  L+DMYAKCGSI +AR +FD M+ KN VTW +++TGYG+HG+G+EAL +F EM +
Sbjct: 515 LFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMRR 574

Query: 561 SGIPPTGVTFLSILYACSHSGLVREGNEIFHSMANDYGFQPMSEHYACMVDILGRAGQLT 620
            G    GVT L +LYACSHSG++ +G E F+ M   +G  P  EHYAC+VD+LGRAG+L 
Sbjct: 575 IGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRLN 634

Query: 621 NALEFIERMPLEPGPAVWGALLGACMIHKNTEIANVASKRLFQLDPENVGYYVLLSNIYS 680
            AL  IE MP+EP P VW A L  C IH   E+   A++++ +L   + G Y LLSN+Y+
Sbjct: 635 AALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLYA 694

Query: 681 TDRNFPKAASVRQVVKKRKLAKTPGCTLIEIGDQQYVFTSGDRSHPQATAIFEMLEKLTG 740
               +     +R +++ + + K PGC+ +E       F  GD++HP A  I+++L     
Sbjct: 695 NAGRWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKTHPHAKEIYQVLLDHMQ 754

Query: 741 KMREAGYQAETVTTALHDVEDEEKELMVNVHSEKLAIAFGLISTEPGTEIRIIKNLRVCL 789
           ++++ GY  ET   ALHDV+DEEK+ ++  HSEKLA+A+G+++T  G  IRI KNLRVC 
Sbjct: 755 RIKDIGYVPET-GFALHDVDDEEKDDLLFEHSEKLALAYGILTTPQGAAIRITKNLRVCG 814

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SUH63.9e-29464.17Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
Q3E6Q14.8e-16739.35Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
O817678.5e-15638.11Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
Q9LTV81.4e-15038.88Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9LFL51.3e-14836.51Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7U0780.0e+00100.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4E1Q10.0e+00100.00pentatricopeptide repeat-containing protein At4g30700 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0LMK70.0e+0097.93DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0703... [more]
A0A6J1EDA00.0e+0090.36pentatricopeptide repeat-containing protein At4g30700 OS=Cucurbita moschata OX=3... [more]
A0A6J1IEL00.0e+0090.10pentatricopeptide repeat-containing protein At4g30700 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
XP_016902152.10.0100.00PREDICTED: pentatricopeptide repeat-containing protein At4g30700 [Cucumis melo] ... [more]
XP_004152852.10.097.72pentatricopeptide repeat-containing protein At4g30700 [Cucumis sativus][more]
KGN61216.10.097.93hypothetical protein Csa_006224 [Cucumis sativus][more]
XP_038889958.10.094.92pentatricopeptide repeat-containing protein At4g30700-like isoform X2 [Benincasa... [more]
XP_038889951.10.094.92pentatricopeptide repeat-containing protein At4g30700-like isoform X1 [Benincasa... [more]
Match NameE-valueIdentityDescription
AT4G30700.12.8e-29564.17Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.13.4e-16839.35Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G33990.16.0e-15738.11Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G12770.11.0e-15138.88mitochondrial editing factor 22 [more]
AT5G16860.19.3e-15036.51Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 653..778
e-value: 3.8E-41
score: 139.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 503..664
e-value: 8.9E-26
score: 93.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 17..130
e-value: 2.2E-7
score: 32.4
coord: 236..331
e-value: 2.4E-13
score: 51.8
coord: 332..433
e-value: 4.0E-16
score: 60.9
coord: 131..232
e-value: 4.3E-16
score: 60.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 435..502
e-value: 1.9E-9
score: 39.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 354..639
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 479..526
e-value: 3.5E-12
score: 46.3
coord: 178..225
e-value: 1.3E-7
score: 31.7
coord: 382..426
e-value: 1.4E-8
score: 34.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 482..515
e-value: 2.1E-8
score: 31.8
coord: 454..481
e-value: 5.2E-4
score: 18.0
coord: 180..213
e-value: 7.9E-6
score: 23.7
coord: 383..408
e-value: 1.9E-6
score: 25.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 555..578
e-value: 0.59
score: 10.5
coord: 281..310
e-value: 4.8E-4
score: 20.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 178..212
score: 10.621557
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 279..313
score: 9.262356
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 380..415
score: 10.128299
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 480..514
score: 13.252269
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 47..782
NoneNo IPR availablePANTHERPTHR47924:SF42PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 47..782

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0023772.2IVF0023772.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding