Sgr029636 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029636
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153446: 2169694 .. 2171742 (-)
RNA-Seq ExpressionSgr029636
SyntenySgr029636
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCGGCGTTGGCAATGTTCAAAATCTTGAGGAGTTCTTCTTCAGGTTGCACGAGAACAGCAAAAACAGAGACAGATGCATTCTGTTTTATGGTGTTGAGATTATACAGCGCGAGACGAAACTGCAACAGAAGGAACCTCTTCGCGAGGATTAGTCCTCTCGGTGACCCTCAGCTTAGTGTAGTCCCGATTCTTGATCAGTGGATTGAGGAAGGCAGGAAGATGAAGGACTTTGAGCTCCGGAGGATCGTTCGCGACCTTCGTACTTGTCGGCGATATGGCCAAGCCCTTGAGGTGAGCGCGATTGGAAAACACACCCAAATTTCTTCTTGGTTTTACCATTTCCCGTGAAAGGAACACTGGTATTATATATTTTGTATGTTGCATGAAGTAAAATTCTCGTGGTTTATTGAATAAATGCTGGCGAGAGAGGAATCCCAAATTCTTGTTCGTATTTTATCCCCATAATGCCTTGATTGATTGCTGAAAATATTGTGGAGGAAGGCAACAACCTAGAAAAATGCATTTTGAATTTTGAAGCAGACAAATTTAGTTAAGCTAATTGCGATTTTACTTGTTCACCTTCATTACAACATGATTTTGGCTATGCAATTTGGTTGGCTTCTCATGTTGATCCGAATGAATCGTCGATGTAAATCTTTGCCTGCTGGGTAAGATGATAGCAAATTTTATGTTATGTGCAATTGTTGAATCTTGTGGTTAGCTCAGCTCTAGGTTGATGTCCTAAGCTTTTTTCAACCCACTTGATTTCATCGGCCTAGATTTTATTTTTCTTATATCTATAATATTCTGGATAGTTGATCATTTGTTTCTCTTGGATTATTCAAATTGAAGGTGTCTGAATGGATGAGTAGCAAGGGACTTTTTACCCTTACAACAAGAGACTTTGCTGTACAGCTCGATCTGATTGGCCGAGTTCGGGGGCTGGATTCTGCAGAGAAGTACTTCAGCCGTGTTTCTAACCAGGAGGAAATTGGTAAACTTTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTGGATAAGTCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCACTCCCCTCAGCTACAATGATATCATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCAAATGTACTGTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTTTAGCTATAGAATTTGCATCAATTCTTATGGAGCTAGGTCTGATCTAATCAGTATGGAGAAGGTTTTGAAAGAAATGGAGAGCCAAACACACATATCCATGGACTGGACTACTTATTCAATGGTTGCTAATTTTTTCATAAAAGCCGGTTTGCATGAGAAAGCAATGAATTACCTTCGAAAATGTGAGGACAAGGTAGATCAAGATGCGCTCGGCTTCAATCATCTCATTTCACTCTATGCCAGTCTGGGACGCAAGGACGAAATAACGAGATTGTGGGCTCTCCAGAAGGAGAAGTGTAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGAGTTCTTTGGTAAAGCTCGAGGAGCTTGAGGAAGCAGAGAATCTAATCAGGGAATGGGAGTCATCCTGCAAGTGTTATGATTTTCGAGTTCCAAACATTCTTCTTATCGGGTACTCACAAAAAGGGTTAATCGAAAGAGCTGAAAAGATGCTTCGAAACATCGTCAGAGAAGGGAGAATCCCACCCCCAAATAGTTGGGCCATTTTTGCAGCAGGGTACTTGGAAAAGCAAAACCTGGAGAAGGCTTTCAAGTGCATGAAGGAAGCTCTTGCTGTACAAGAGCAGAACAAAGGGTGGAGGCCCAAACCTAGTGTTTTGTCAAGCATAATGCAATGGCTATCTGAAAATGAAAGATATGAGGAACTGAAAGAGTTTCTGAGCTCATTGAAGACTGTACCTACCTTGGATGGAAAACTAAATAGTGCCTTAGATGAGCTTCTGGAAACCTTAGATGATGATGAAAATGAAATAGTGACGACCCGCGAATTAGAGGAGAGGTGA

mRNA sequence

ATGGCGGCGGCGTTGGCAATGTTCAAAATCTTGAGGAGTTCTTCTTCAGGTTGCACGAGAACAGCAAAAACAGAGACAGATGCATTCTGTTTTATGGTGTTGAGATTATACAGCGCGAGACGAAACTGCAACAGAAGGAACCTCTTCGCGAGGATTAGTCCTCTCGGTGACCCTCAGCTTAGTGTAGTCCCGATTCTTGATCAGTGGATTGAGGAAGGCAGGAAGATGAAGGACTTTGAGCTCCGGAGGATCGTTCGCGACCTTCGTACTTGTCGGCGATATGGCCAAGCCCTTGAGGTGTCTGAATGGATGAGTAGCAAGGGACTTTTTACCCTTACAACAAGAGACTTTGCTGTACAGCTCGATCTGATTGGCCGAGTTCGGGGGCTGGATTCTGCAGAGAAGTACTTCAGCCGTGTTTCTAACCAGGAGGAAATTGGTAAACTTTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTGGATAAGTCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCACTCCCCTCAGCTACAATGATATCATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCAAATGTACTGTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTTTAGCTATAGAATTTGCATCAATTCTTATGGAGCTAGGTCTGATCTAATCAGTATGGAGAAGGTTTTGAAAGAAATGGAGAGCCAAACACACATATCCATGGACTGGACTACTTATTCAATGGTTGCTAATTTTTTCATAAAAGCCGGTTTGCATGAGAAAGCAATGAATTACCTTCGAAAATGTGAGGACAAGGTAGATCAAGATGCGCTCGGCTTCAATCATCTCATTTCACTCTATGCCAGTCTGGGACGCAAGGACGAAATAACGAGATTGTGGGCTCTCCAGAAGGAGAAGTGTAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGAGTTCTTTGGTAAAGCTCGAGGAGCTTGAGGAAGCAGAGAATCTAATCAGGGAATGGGAGTCATCCTGCAAGTGTTATGATTTTCGAGTTCCAAACATTCTTCTTATCGGGTACTCACAAAAAGGGTTAATCGAAAGAGCTGAAAAGATGCTTCGAAACATCGTCAGAGAAGGGAGAATCCCACCCCCAAATAGTTGGGCCATTTTTGCAGCAGGGTACTTGGAAAAGCAAAACCTGGAGAAGGCTTTCAAGTGCATGAAGGAAGCTCTTGCTGTACAAGAGCAGAACAAAGGGTGGAGGCCCAAACCTAGTGTTTTGTCAAGCATAATGCAATGGCTATCTGAAAATGAAAGATATGAGGAACTGAAAGAGTTTCTGAGCTCATTGAAGACTGTACCTACCTTGGATGGAAAACTAAATAGTGCCTTAGATGAGCTTCTGGAAACCTTAGATGATGATGAAAATGAAATAGTGACGACCCGCGAATTAGAGGAGAGGTGA

Coding sequence (CDS)

ATGGCGGCGGCGTTGGCAATGTTCAAAATCTTGAGGAGTTCTTCTTCAGGTTGCACGAGAACAGCAAAAACAGAGACAGATGCATTCTGTTTTATGGTGTTGAGATTATACAGCGCGAGACGAAACTGCAACAGAAGGAACCTCTTCGCGAGGATTAGTCCTCTCGGTGACCCTCAGCTTAGTGTAGTCCCGATTCTTGATCAGTGGATTGAGGAAGGCAGGAAGATGAAGGACTTTGAGCTCCGGAGGATCGTTCGCGACCTTCGTACTTGTCGGCGATATGGCCAAGCCCTTGAGGTGTCTGAATGGATGAGTAGCAAGGGACTTTTTACCCTTACAACAAGAGACTTTGCTGTACAGCTCGATCTGATTGGCCGAGTTCGGGGGCTGGATTCTGCAGAGAAGTACTTCAGCCGTGTTTCTAACCAGGAGGAAATTGGTAAACTTTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTGGATAAGTCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCACTCCCCTCAGCTACAATGATATCATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCAAATGTACTGTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTTTAGCTATAGAATTTGCATCAATTCTTATGGAGCTAGGTCTGATCTAATCAGTATGGAGAAGGTTTTGAAAGAAATGGAGAGCCAAACACACATATCCATGGACTGGACTACTTATTCAATGGTTGCTAATTTTTTCATAAAAGCCGGTTTGCATGAGAAAGCAATGAATTACCTTCGAAAATGTGAGGACAAGGTAGATCAAGATGCGCTCGGCTTCAATCATCTCATTTCACTCTATGCCAGTCTGGGACGCAAGGACGAAATAACGAGATTGTGGGCTCTCCAGAAGGAGAAGTGTAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGAGTTCTTTGGTAAAGCTCGAGGAGCTTGAGGAAGCAGAGAATCTAATCAGGGAATGGGAGTCATCCTGCAAGTGTTATGATTTTCGAGTTCCAAACATTCTTCTTATCGGGTACTCACAAAAAGGGTTAATCGAAAGAGCTGAAAAGATGCTTCGAAACATCGTCAGAGAAGGGAGAATCCCACCCCCAAATAGTTGGGCCATTTTTGCAGCAGGGTACTTGGAAAAGCAAAACCTGGAGAAGGCTTTCAAGTGCATGAAGGAAGCTCTTGCTGTACAAGAGCAGAACAAAGGGTGGAGGCCCAAACCTAGTGTTTTGTCAAGCATAATGCAATGGCTATCTGAAAATGAAAGATATGAGGAACTGAAAGAGTTTCTGAGCTCATTGAAGACTGTACCTACCTTGGATGGAAAACTAAATAGTGCCTTAGATGAGCTTCTGGAAACCTTAGATGATGATGAAAATGAAATAGTGACGACCCGCGAATTAGAGGAGAGGTGA

Protein sequence

MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQLSVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLDDDENEIVTTRELEER
Homology
BLAST of Sgr029636 vs. NCBI nr
Match: XP_022142737.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Momordica charantia])

HSP 1 Score: 867.8 bits (2241), Expect = 4.4e-248
Identity = 430/495 (86.87%), Postives = 464/495 (93.74%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           MAAALAMFKIL S SS   RT +TETD+FC +VLRLYS RRNCNRRNLFARISPLGDP+L
Sbjct: 1   MAAALAMFKIL-SRSSNFERTVRTETDSFCSLVLRLYSTRRNCNRRNLFARISPLGDPEL 60

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVV ILDQWIEEGRK+KDFELRRIVRDLR+CRRYGQALEVSEWMSSKGLF LTTRDFAVQ
Sbjct: 61  SVVQILDQWIEEGRKIKDFELRRIVRDLRSCRRYGQALEVSEWMSSKGLFPLTTRDFAVQ 120

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIGRVRGLDSAEKYFS VSNQEEIGKLYGALLNCYVREGLVDKSL+HMQ+MK+MGFAS
Sbjct: 121 LDLIGRVRGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKSLTHMQEMKQMGFAS 180

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           TPL+YNDIMCLYLNTG VDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLI+MEKVL
Sbjct: 181 TPLNYNDIMCLYLNTGHVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLITMEKVL 240

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEMESQ+HISMDWTTYSMVANFFIKA +HE+A+NYLRKCEDKVD+DALGFNHLISLY SL
Sbjct: 241 KEMESQSHISMDWTTYSMVANFFIKASMHEEALNYLRKCEDKVDRDALGFNHLISLYTSL 300

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           G  DE+ RLWALQKEKCKKQVNRDYITML SLVKLE LEEAENL++EWESSC+CYDFRVP
Sbjct: 301 GHNDEVMRLWALQKEKCKKQVNRDYITMLGSLVKLERLEEAENLLKEWESSCQCYDFRVP 360

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N+LLIGYSQKGLIERAEKMLR+I REGRIPPPNSWAI AAGYLEKQNLEKAFKCM EALA
Sbjct: 361 NVLLIGYSQKGLIERAEKMLRSIAREGRIPPPNSWAIIAAGYLEKQNLEKAFKCMNEALA 420

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           V+EQN GWRPKPSV+SSI++WLSEN RY ELKEFLSSLKTVP++DGKL++ALDEL+ETL+
Sbjct: 421 VKEQNNGWRPKPSVVSSILRWLSENGRYGELKEFLSSLKTVPSMDGKLHNALDELVETLE 480

Query: 481 DDENEIVTTRELEER 496
           +D  EI  T EL+ R
Sbjct: 481 ND-GEIAMTGELQVR 493

BLAST of Sgr029636 vs. NCBI nr
Match: XP_023545982.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 855.5 bits (2209), Expect = 2.2e-244
Identity = 418/483 (86.54%), Postives = 456/483 (94.41%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           +AAALAMFKILRS SSG TRTA+TETDAFCF+ LRLYSARR CNRRNLFARISPLG P+L
Sbjct: 38  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 97

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVVPILDQWI+EGR +KDFE+RRIVRDLR CRRYGQALEVSEWM SKGLF+ TTRDFAVQ
Sbjct: 98  SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQ 157

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIGRV+GLDSAEKYFS VSNQEEIGKLYGALLNCYVREGLVDK+LSHMQKMKEMGFAS
Sbjct: 158 LDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 217

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           +PL YNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDN+SYRICI+SYGARSDLI M KVL
Sbjct: 218 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVL 277

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           +EMESQTHISMDWTTYSMVANFFIKAG+HE+AM+YLRKCEDKV+QDALGFNHLISLY SL
Sbjct: 278 REMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 337

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           GRKDE+ RLWALQK KCKKQVNRDYITML  LVKLE LEEAE L++EWESSC+CYDFRVP
Sbjct: 338 GRKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVP 397

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N+LLIGYSQ+GLIERAEKML+NI+ +GRIPPPNSW I AAGYLEKQN E+AFKCMKEA+A
Sbjct: 398 NVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVA 457

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           VQEQNKGWRPKPSVLSSI++WLSEN RYEELKEFLSSLKTVP++DGKL++A DELLETL 
Sbjct: 458 VQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLK 517

Query: 481 DDE 484
           +++
Sbjct: 518 NND 519

BLAST of Sgr029636 vs. NCBI nr
Match: KAG6600732.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 854.7 bits (2207), Expect = 3.8e-244
Identity = 418/483 (86.54%), Postives = 454/483 (94.00%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           +AAALAMFKILRS SSG TRTA+TETDAFCF+ LRLYSARR CNRRNLFARISPLG P+L
Sbjct: 49  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 108

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVVPILDQWI+EGR +KDFE+RRIVRDLR CRRYGQAL+VSEWM SKGLF+ TTRDFAVQ
Sbjct: 109 SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQ 168

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIGRVRG+DSAEKYF  VSNQEEIGKLYGALLNCYVREGLVDK+LSHMQKMKEMGFAS
Sbjct: 169 LDLIGRVRGIDSAEKYFGSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 228

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           +PL YNDIMCLYLNTGQVDKVPNVLSEMK NGVLPDN+SYRICI+SYGARSDLI M KVL
Sbjct: 229 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKVNGVLPDNYSYRICISSYGARSDLIGMLKVL 288

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEMESQTHISMDWTTYSMVANFFIKAG+HEKAM+YLRKCEDKV+QDALGFNHLISLY SL
Sbjct: 289 KEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSL 348

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           GRKDE+ RLWALQK KCKKQVNRDYITML  LVKLE LEEAE L++EWESSC+CYDFRVP
Sbjct: 349 GRKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVP 408

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N+LLIGYSQKGLIERAEKML+NI+ +GRIPPPNSW I AAGYLEKQN E+AFKCMKEA+A
Sbjct: 409 NVLLIGYSQKGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVA 468

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           VQEQNKGWRPKPSVLSSI++WLSEN RYEELKEFLSSLKTVP++DGKL++A DELLETL 
Sbjct: 469 VQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLK 528

Query: 481 DDE 484
           +++
Sbjct: 529 NND 530

BLAST of Sgr029636 vs. NCBI nr
Match: XP_022989754.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 853.6 bits (2204), Expect = 8.5e-244
Identity = 423/494 (85.63%), Postives = 459/494 (92.91%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           +AAALAMFKILRS SSG TRTA+TETDAFCF+ LRLYSARR CNRRNLFARISPLG P+L
Sbjct: 45  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 104

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVVPILDQWI+EGR +KDFELRRIVRDLR CRRYGQALEVSEWM SKGLF+ TTRDFAVQ
Sbjct: 105 SVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQ 164

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIGRV+GLDSAEKYFS VSNQEE+GKLYGALLNCYVREGLVDK+LSHMQKMKEMGFAS
Sbjct: 165 LDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 224

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           +PL YNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDN+SYRICI+SYGARSDLI M KVL
Sbjct: 225 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVL 284

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEMESQTHISMDWTTYSMVANFFIKAG+HEKAM+YLRKCEDKV+QDALGFNHLISLY SL
Sbjct: 285 KEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSL 344

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           G KDE+ RLWALQK KCKKQVNRDYITML  LVKLE LEEAE L++EW SSC+CYDFRVP
Sbjct: 345 GCKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVP 404

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N+LLIGYS++GLIERAEKML+NI+ +GRIPPPNSW I AAGYLEKQNLEKAFKCMKEA+A
Sbjct: 405 NVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVA 464

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           VQEQNKGWRPKPSVLSSI++WLSEN RYEELKEFLSSLKTVP++DGKL++A DELLETL 
Sbjct: 465 VQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLK 524

Query: 481 DDENEIVTTRELEE 495
           +  N+  T   L+E
Sbjct: 525 N--NDETTADALKE 535

BLAST of Sgr029636 vs. NCBI nr
Match: XP_022942045.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 852.4 bits (2201), Expect = 1.9e-243
Identity = 416/483 (86.13%), Postives = 454/483 (94.00%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           +AAALAMFKILRS SSG TRTA+TETDAFCF+ LRLYSARR CNRRNLFARISPLG P+L
Sbjct: 49  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 108

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVVPILDQWI+EGR +KDFE+RRIVRDLR CRRYGQAL+VSEWM SKGLF+ TTRDFAVQ
Sbjct: 109 SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQ 168

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIGRVRG+DSAEKYFS VSNQEEIGKLYGALLNCYVREGLVDK+LSHMQKMKEMGFAS
Sbjct: 169 LDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 228

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           +PL YNDIMCLYLNTGQVDKVPNVLSEMK+NGVLPDN+SYRICI+SYGARSDLI M KVL
Sbjct: 229 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVL 288

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEMESQTHISMDWTTYSMVANFFIKAG+HE+AM+YLRKCEDKV+QDALGFNHLISLY SL
Sbjct: 289 KEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 348

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           GRKDE+ RLWALQK KCKKQVNRDYITML  LVKLE LEEAE L+ EWESSC+CYDFRVP
Sbjct: 349 GRKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVP 408

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N+LLIGYSQ+GLIERAEKML+NI+ +GRIPPPNSW I AAGYLEKQN E+AFKCMKEA+A
Sbjct: 409 NVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVA 468

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           VQEQNKGWRPKPSVLSSI++WLSEN RYEELKEFLSSLK VP++DGKL++A DELLETL 
Sbjct: 469 VQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLK 528

Query: 481 DDE 484
           +++
Sbjct: 529 NND 530

BLAST of Sgr029636 vs. ExPASy Swiss-Prot
Match: Q84JR3 (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 434.1 bits (1115), Expect = 2.1e-120
Identity = 213/468 (45.51%), Postives = 312/468 (66.67%), Query Frame = 0

Query: 32  MVLRLYSARRNCNRRNLFARISPLGDPQLSVVPILDQWIEEGRKMKDFELRRIVRDLRTC 91
           ++   Y       +  L+++ISPLGDP+ SV P L  W++ G+K+   EL RIV DLR  
Sbjct: 11  LIASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRR 70

Query: 92  RRYGQALEVSEWMSSKGLFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQEEIGKLYG 151
           +R+  ALEVS+WM+  G+   +  + AV LDLIGRV G  +AE+YF  +  Q +  K YG
Sbjct: 71  KRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYG 130

Query: 152 ALLNCYVREGLVDKSLSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKEN 211
           ALLNCYVR+  V+KSL H +KMKEMGF ++ L+YN+IMCLY N GQ +KVP VL EMKE 
Sbjct: 131 ALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEE 190

Query: 212 GVLPDNFSYRICINSYGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEK 271
            V PDN+SYRICIN++GA  DL  +   L++ME +  I+MDW TY++ A F+I  G  ++
Sbjct: 191 NVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDR 250

Query: 272 AMNYLRKCEDKVD-QDALGFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLS 331
           A+  L+  E++++ +D  G+NHLI+LYA LG+K E+ RLW L+K+ CK+++N+DY+T+L 
Sbjct: 251 AVELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 310

Query: 332 SLVKLEELEEAENLIREWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIP 391
           SLVK++ L EAE ++ EW+SS  CYDFRVPN ++ GY  K + E+AE ML ++ R G+  
Sbjct: 311 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 370

Query: 392 PPNSWAIFAAGYLEKQNLEKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEE 451
            P SW + A  Y EK  LE AFKCMK AL V+  ++ WRP  ++++S++ W+ +    +E
Sbjct: 371 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 430

Query: 452 LKEFLSSLKTVPTLDGKLNSAL------------DELLETLDDDENEI 487
           ++ F++SL+    ++ ++  AL            D LL+ + DD+ EI
Sbjct: 431 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEI 478

BLAST of Sgr029636 vs. ExPASy Swiss-Prot
Match: Q9SKU6 (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 299.7 bits (766), Expect = 6.2e-80
Identity = 152/410 (37.07%), Postives = 246/410 (60.00%), Query Frame = 0

Query: 51  RISPLGDPQLSVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLF 110
           R++  GDP  S++ +LD W+++G  +K  EL  I++ LR   R+  AL++S+WMS   + 
Sbjct: 43  RVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEHRVH 102

Query: 111 TLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHM 170
            ++  D A++LDLI +V GL  AEK+F  +  +     LYGALLNCY  + ++ K+    
Sbjct: 103 EISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKAEQVF 162

Query: 171 QKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGAR 230
           Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM++  V PD F+    +++Y   
Sbjct: 163 QEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAYSVV 222

Query: 231 SDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDAL-- 290
           SD+  MEK L   E+   + +DW TY+  AN +IKAGL EKA+  LRK E  V+      
Sbjct: 223 SDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQKRKH 282

Query: 291 GFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREW 350
            +  L+S Y + G+K+E+ RLW+L KE      N  YI+++S+L+K++++EE E ++ EW
Sbjct: 283 AYEVLMSFYGAAGKKEEVYRLWSLYKE-LDGFYNTGYISVISALLKMDDIEEVEKIMEEW 342

Query: 351 ESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNL 410
           E+    +D R+P++L+ GY +KG++E+AE+++  +V++ R+   ++W   A GY     +
Sbjct: 343 EAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAGKM 402

Query: 411 EKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSL 459
           EKA +  K A+ V +   GWRP   VL S + +L      E L++ L  L
Sbjct: 403 EKAVEKWKRAIEVSK--PGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL 449

BLAST of Sgr029636 vs. ExPASy Swiss-Prot
Match: Q8LPS6 (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 286.2 bits (731), Expect = 7.1e-76
Identity = 157/448 (35.04%), Postives = 260/448 (58.04%), Query Frame = 0

Query: 48  LFARISPLGDPQLSVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSK 107
           ++ +IS +  P+L    +L+QW + GRK+  +EL R+V++LR  +R  QALEV +WM+++
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 108 G-LFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKS 167
           G  F L+  D A+QLDLIG+VRG+  AE++F ++    +  ++YG+LLN YVR    +K+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 168 LSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINS 227
            + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK+  +  D +SY I ++S
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 228 YGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKV-DQ 287
            G+   +  ME V ++M+S   I  +WTT+S +A  +IK G  EKA + LRK E ++  +
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 288 DALGFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLI 347
           + + +++L+SLY SLG K E+ R+W + K       N  Y  ++SSLV++ ++E AE + 
Sbjct: 309 NRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVY 368

Query: 348 REWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEK 407
            EW      YD R+PN+L+  Y +   +E AE +  ++V  G  P  ++W I A G+  K
Sbjct: 369 EEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRK 428

Query: 408 QNLEKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLD 467
           + + +A  C++ A +  E +  WRPK  +LS   +   E       +  L  L+    L+
Sbjct: 429 RCISEALTCLRNAFSA-EGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLE 488

Query: 468 GKLNSALDELLETLDDDENEIVTTRELE 494
            K   AL      +D DEN  V   E++
Sbjct: 489 DKSYLAL------IDVDENRTVNNSEID 509

BLAST of Sgr029636 vs. ExPASy Swiss-Prot
Match: O22714 (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=1 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 7.6e-70
Identity = 137/438 (31.28%), Postives = 245/438 (55.94%), Query Frame = 0

Query: 32  MVLRLYSARRNCNRRN--------LFARISPLGDPQLSVVPILDQWIEEGRKMKDFELRR 91
           M +R  S  R+  +R+        L+ R+   G  ++ V   L+Q+++  + +  +E+  
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 92  IVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQ 151
            ++ LR    Y  AL++SE M  +G+   T  D A+ LDL+ + R + + E YF  +   
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPET 120

Query: 152 EEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPN 211
            +    YG+LLNCY +E L +K+   + KMKE+    + +SYN +M LY  TG+ +KVP 
Sbjct: 121 SKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPA 180

Query: 212 VLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFF 271
           ++ E+K   V+PD+++Y + + +  A +D+  +E+V++EM     ++ DWTTYS +A+ +
Sbjct: 181 MIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIY 240

Query: 272 IKAGLHEKAMNYLRKCEDK-VDQDALGFNHLISLYASLGRKDEITRLWALQKEKCKKQVN 331
           + AGL +KA   L++ E K   +D   +  LI+LY  LG+  E+ R+W   +    K  N
Sbjct: 241 VDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 300

Query: 332 RDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRN 391
             Y+ M+  LVKL +L  AE L +EW+++C  YD R+ N+L+  Y+Q+GLI++A ++   
Sbjct: 301 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 360

Query: 392 IVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALAVQEQNKG-WRPKPSVLSSIMQW 451
             R G      +W IF   Y++  ++ +A +CM +A+++ + + G W P P  + ++M +
Sbjct: 361 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 420

Query: 452 LSENERYEELKEFLSSLK 460
             + +     +  L  LK
Sbjct: 421 FEQKKDVNGAENLLEILK 437

BLAST of Sgr029636 vs. ExPASy Swiss-Prot
Match: Q3E911 (Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX=3702 GN=At5g27460 PE=2 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 8.2e-64
Identity = 142/418 (33.97%), Postives = 229/418 (54.78%), Query Frame = 0

Query: 46  RNLFARISPLGDPQLSVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMS 105
           RN    I     P+ SV  +L + I+ G  +   ELR I + L    RY  AL++ EWM 
Sbjct: 38  RNSLKEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQMMEWME 97

Query: 106 SKGLFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRV---SNQEEIGK-LYGALLNCYVREG 165
           ++     +  D A++LDLI +  GL   E+YF ++   S    + K  Y  LL  YV+  
Sbjct: 98  NQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRAYVKNK 157

Query: 166 LVDKSLSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYR 225
           +V ++ + M+K+  +GF  TP  +N++M LY  +GQ +KV  V+S MK N +  +  SY 
Sbjct: 158 MVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRNVLSYN 217

Query: 226 ICINSYGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCED 285
           + +N+    S + ++E V KEM     + + W++   +AN +IK+G  EKA   L   E 
Sbjct: 218 LWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVLEDAEK 277

Query: 286 KVDQ-DALGFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEE 345
            +++ + LG+  LI+LYASLG K+ + RLW + K  C +    +YI +LSSLVK  +LEE
Sbjct: 278 MLNRSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLVKTGDLEE 337

Query: 346 AENLIREWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAA 405
           AE +  EWE+ C  YD RV N+LL  Y + G I +AE +   ++  G  P   +W I   
Sbjct: 338 AERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYKTWEILME 397

Query: 406 GYLEKQNLEKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSL 459
           G+++ +N+EKA   M +   +  +   WRP  +++ +I ++  + E+ EE   ++  L
Sbjct: 398 GWVKCENMEKAIDAMHQVFVLMRRCH-WRPSHNIVMAIAEYFEKEEKIEEATAYVRDL 454

BLAST of Sgr029636 vs. ExPASy TrEMBL
Match: A0A6J1CNQ2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like OS=Momordica charantia OX=3673 GN=LOC111012782 PE=4 SV=1)

HSP 1 Score: 867.8 bits (2241), Expect = 2.1e-248
Identity = 430/495 (86.87%), Postives = 464/495 (93.74%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           MAAALAMFKIL S SS   RT +TETD+FC +VLRLYS RRNCNRRNLFARISPLGDP+L
Sbjct: 1   MAAALAMFKIL-SRSSNFERTVRTETDSFCSLVLRLYSTRRNCNRRNLFARISPLGDPEL 60

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVV ILDQWIEEGRK+KDFELRRIVRDLR+CRRYGQALEVSEWMSSKGLF LTTRDFAVQ
Sbjct: 61  SVVQILDQWIEEGRKIKDFELRRIVRDLRSCRRYGQALEVSEWMSSKGLFPLTTRDFAVQ 120

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIGRVRGLDSAEKYFS VSNQEEIGKLYGALLNCYVREGLVDKSL+HMQ+MK+MGFAS
Sbjct: 121 LDLIGRVRGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKSLTHMQEMKQMGFAS 180

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           TPL+YNDIMCLYLNTG VDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLI+MEKVL
Sbjct: 181 TPLNYNDIMCLYLNTGHVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLITMEKVL 240

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEMESQ+HISMDWTTYSMVANFFIKA +HE+A+NYLRKCEDKVD+DALGFNHLISLY SL
Sbjct: 241 KEMESQSHISMDWTTYSMVANFFIKASMHEEALNYLRKCEDKVDRDALGFNHLISLYTSL 300

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           G  DE+ RLWALQKEKCKKQVNRDYITML SLVKLE LEEAENL++EWESSC+CYDFRVP
Sbjct: 301 GHNDEVMRLWALQKEKCKKQVNRDYITMLGSLVKLERLEEAENLLKEWESSCQCYDFRVP 360

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N+LLIGYSQKGLIERAEKMLR+I REGRIPPPNSWAI AAGYLEKQNLEKAFKCM EALA
Sbjct: 361 NVLLIGYSQKGLIERAEKMLRSIAREGRIPPPNSWAIIAAGYLEKQNLEKAFKCMNEALA 420

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           V+EQN GWRPKPSV+SSI++WLSEN RY ELKEFLSSLKTVP++DGKL++ALDEL+ETL+
Sbjct: 421 VKEQNNGWRPKPSVVSSILRWLSENGRYGELKEFLSSLKTVPSMDGKLHNALDELVETLE 480

Query: 481 DDENEIVTTRELEER 496
           +D  EI  T EL+ R
Sbjct: 481 ND-GEIAMTGELQVR 493

BLAST of Sgr029636 vs. ExPASy TrEMBL
Match: A0A6J1JQ72 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111486809 PE=4 SV=1)

HSP 1 Score: 853.6 bits (2204), Expect = 4.1e-244
Identity = 423/494 (85.63%), Postives = 459/494 (92.91%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           +AAALAMFKILRS SSG TRTA+TETDAFCF+ LRLYSARR CNRRNLFARISPLG P+L
Sbjct: 45  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 104

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVVPILDQWI+EGR +KDFELRRIVRDLR CRRYGQALEVSEWM SKGLF+ TTRDFAVQ
Sbjct: 105 SVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQ 164

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIGRV+GLDSAEKYFS VSNQEE+GKLYGALLNCYVREGLVDK+LSHMQKMKEMGFAS
Sbjct: 165 LDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 224

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           +PL YNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDN+SYRICI+SYGARSDLI M KVL
Sbjct: 225 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVL 284

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEMESQTHISMDWTTYSMVANFFIKAG+HEKAM+YLRKCEDKV+QDALGFNHLISLY SL
Sbjct: 285 KEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSL 344

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           G KDE+ RLWALQK KCKKQVNRDYITML  LVKLE LEEAE L++EW SSC+CYDFRVP
Sbjct: 345 GCKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVP 404

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N+LLIGYS++GLIERAEKML+NI+ +GRIPPPNSW I AAGYLEKQNLEKAFKCMKEA+A
Sbjct: 405 NVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVA 464

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           VQEQNKGWRPKPSVLSSI++WLSEN RYEELKEFLSSLKTVP++DGKL++A DELLETL 
Sbjct: 465 VQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLK 524

Query: 481 DDENEIVTTRELEE 495
           +  N+  T   L+E
Sbjct: 525 N--NDETTADALKE 535

BLAST of Sgr029636 vs. ExPASy TrEMBL
Match: A0A6J1FVG8 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111447235 PE=4 SV=1)

HSP 1 Score: 852.4 bits (2201), Expect = 9.2e-244
Identity = 416/483 (86.13%), Postives = 454/483 (94.00%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           +AAALAMFKILRS SSG TRTA+TETDAFCF+ LRLYSARR CNRRNLFARISPLG P+L
Sbjct: 49  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 108

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVVPILDQWI+EGR +KDFE+RRIVRDLR CRRYGQAL+VSEWM SKGLF+ TTRDFAVQ
Sbjct: 109 SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQ 168

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIGRVRG+DSAEKYFS VSNQEEIGKLYGALLNCYVREGLVDK+LSHMQKMKEMGFAS
Sbjct: 169 LDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 228

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           +PL YNDIMCLYLNTGQVDKVPNVLSEMK+NGVLPDN+SYRICI+SYGARSDLI M KVL
Sbjct: 229 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVL 288

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEMESQTHISMDWTTYSMVANFFIKAG+HE+AM+YLRKCEDKV+QDALGFNHLISLY SL
Sbjct: 289 KEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 348

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           GRKDE+ RLWALQK KCKKQVNRDYITML  LVKLE LEEAE L+ EWESSC+CYDFRVP
Sbjct: 349 GRKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVP 408

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N+LLIGYSQ+GLIERAEKML+NI+ +GRIPPPNSW I AAGYLEKQN E+AFKCMKEA+A
Sbjct: 409 NVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVA 468

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           VQEQNKGWRPKPSVLSSI++WLSEN RYEELKEFLSSLK VP++DGKL++A DELLETL 
Sbjct: 469 VQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLK 528

Query: 481 DDE 484
           +++
Sbjct: 529 NND 530

BLAST of Sgr029636 vs. ExPASy TrEMBL
Match: A0A0A0L7Y2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G104890 PE=4 SV=1)

HSP 1 Score: 814.7 bits (2103), Expect = 2.1e-232
Identity = 399/493 (80.93%), Postives = 447/493 (90.67%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           MAAA AMFKIL  SSSGCTRT + ETDAFCF+ LRLYS RR+C+RRNL+ARISPLGDP+ 
Sbjct: 1   MAAASAMFKILSRSSSGCTRTLRPETDAFCFVALRLYSTRRSCDRRNLYARISPLGDPEC 60

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           +VVP+L+QWIEEGR +KDFELRRIVRDLRTCRRY QALEVSEWM SKGLF+LTTRDFA+Q
Sbjct: 61  TVVPVLNQWIEEGRNIKDFELRRIVRDLRTCRRYRQALEVSEWMCSKGLFSLTTRDFAIQ 120

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIG+VRGLDSAEKYF  VSNQ+EIGKLYGALLNCYVREGL+DKSL+HMQKMKEMG AS
Sbjct: 121 LDLIGQVRGLDSAEKYFGSVSNQKEIGKLYGALLNCYVREGLIDKSLAHMQKMKEMGLAS 180

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           +PL YNDIMCLYLNTGQ DKVPNVLSEMKENGVLPDNFSYRICI+SYGARSD+ISME VL
Sbjct: 181 SPLCYNDIMCLYLNTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGARSDVISMENVL 240

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEME QTHISMDWTTYSMVA FFIKAG+H+KAMNYLRKCEDKVD+DALGFNHLIS Y +L
Sbjct: 241 KEMEGQTHISMDWTTYSMVAGFFIKAGMHDKAMNYLRKCEDKVDEDALGFNHLISHYTNL 300

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           G K+E+ RLWAL K K KKQ+NRDYITML SLVKLE LEEAENL+ EWESSC+CYDFRVP
Sbjct: 301 GHKNEVMRLWALLK-KGKKQLNRDYITMLGSLVKLELLEEAENLVMEWESSCQCYDFRVP 360

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N++LIGYSQKGLIE+AEKMLRNI+  G IP PNSW I A+GYLEKQNLEKAF+CMKEALA
Sbjct: 361 NVVLIGYSQKGLIEKAEKMLRNIIVNGMIPSPNSWGIIASGYLEKQNLEKAFECMKEALA 420

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           V+ QNK WRPKP+VLSSI++WLSEN RYEE+KEF+SSLKTVP++D KLN+ALDELLE + 
Sbjct: 421 VKGQNKVWRPKPNVLSSILRWLSENRRYEEMKEFMSSLKTVPSMDEKLNNALDELLEIMA 480

Query: 481 DDENEIVTTRELE 494
           +D+   ++  ELE
Sbjct: 481 NDDG--ISKDELE 490

BLAST of Sgr029636 vs. ExPASy TrEMBL
Match: A0A1S4DZ16 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103493335 PE=4 SV=1)

HSP 1 Score: 801.2 bits (2068), Expect = 2.4e-228
Identity = 390/483 (80.75%), Postives = 436/483 (90.27%), Query Frame = 0

Query: 1   MAAALAMFKILRSSSSGCTRTAKTETDAFCFMVLRLYSARRNCNRRNLFARISPLGDPQL 60
           MAAA AMFKIL  SSSGCTRT + ETDAFCF+ LRLYS RR+CNRR L+A ISPLGDP  
Sbjct: 1   MAAASAMFKILSRSSSGCTRTPRPETDAFCFVALRLYSTRRSCNRRKLYAMISPLGDPDS 60

Query: 61  SVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQ 120
           SVVP+L+QWI+EGRK+KDFELRRIVRDLRTCRRY QALEVSEWM SKG F+LTTRDFA+Q
Sbjct: 61  SVVPVLNQWIKEGRKIKDFELRRIVRDLRTCRRYRQALEVSEWMCSKGRFSLTTRDFAIQ 120

Query: 121 LDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFAS 180
           LDLIG+VRGLDSAEKYF  VS Q+EIGKLYG+LLNCYVREGL+DKSL+HMQKMKEMGFAS
Sbjct: 121 LDLIGQVRGLDSAEKYFGSVSKQKEIGKLYGSLLNCYVREGLIDKSLAHMQKMKEMGFAS 180

Query: 181 TPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVL 240
           +PL YNDIMCLYLNTGQ DKVPNVLSEMKENGVLPDNFSYRICI+SYGARSD+ISME VL
Sbjct: 181 SPLCYNDIMCLYLNTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGARSDVISMENVL 240

Query: 241 KEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDALGFNHLISLYASL 300
           KEMESQTHISMDW TYSMVA FFIK  +H+KA NYLRKCED+VDQDALGFNHLIS Y +L
Sbjct: 241 KEMESQTHISMDWITYSMVAGFFIKVVMHDKARNYLRKCEDRVDQDALGFNHLISHYTNL 300

Query: 301 GRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVP 360
           G K+E+ RLWALQK K KKQ+NRDYITML SLVKL+ LEEAENL+ EWESSC+C DFRVP
Sbjct: 301 GHKNEVMRLWALQK-KAKKQLNRDYITMLGSLVKLDLLEEAENLVMEWESSCQCNDFRVP 360

Query: 361 NILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALA 420
           N++LIGYSQ GLIE+AEKMLRNI+  G IP PNSW I A+GYLEKQNLEKAF+CMKEALA
Sbjct: 361 NVVLIGYSQNGLIEKAEKMLRNIIVNGMIPSPNSWGIIASGYLEKQNLEKAFECMKEALA 420

Query: 421 VQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLDGKLNSALDELLETLD 480
           V+ QNK WRPKP+VLSSI++WLSEN RYEE+KEF+SSLKTVP++D KLNSALDELLE ++
Sbjct: 421 VKGQNKVWRPKPNVLSSILRWLSENRRYEEMKEFMSSLKTVPSMDEKLNSALDELLEIME 480

Query: 481 DDE 484
           +D+
Sbjct: 481 NDD 482

BLAST of Sgr029636 vs. TAIR 10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 434.1 bits (1115), Expect = 1.5e-121
Identity = 213/468 (45.51%), Postives = 312/468 (66.67%), Query Frame = 0

Query: 32  MVLRLYSARRNCNRRNLFARISPLGDPQLSVVPILDQWIEEGRKMKDFELRRIVRDLRTC 91
           ++   Y       +  L+++ISPLGDP+ SV P L  W++ G+K+   EL RIV DLR  
Sbjct: 11  LIASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRR 70

Query: 92  RRYGQALEVSEWMSSKGLFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQEEIGKLYG 151
           +R+  ALEVS+WM+  G+   +  + AV LDLIGRV G  +AE+YF  +  Q +  K YG
Sbjct: 71  KRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYG 130

Query: 152 ALLNCYVREGLVDKSLSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKEN 211
           ALLNCYVR+  V+KSL H +KMKEMGF ++ L+YN+IMCLY N GQ +KVP VL EMKE 
Sbjct: 131 ALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEE 190

Query: 212 GVLPDNFSYRICINSYGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEK 271
            V PDN+SYRICIN++GA  DL  +   L++ME +  I+MDW TY++ A F+I  G  ++
Sbjct: 191 NVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDR 250

Query: 272 AMNYLRKCEDKVD-QDALGFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLS 331
           A+  L+  E++++ +D  G+NHLI+LYA LG+K E+ RLW L+K+ CK+++N+DY+T+L 
Sbjct: 251 AVELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 310

Query: 332 SLVKLEELEEAENLIREWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIP 391
           SLVK++ L EAE ++ EW+SS  CYDFRVPN ++ GY  K + E+AE ML ++ R G+  
Sbjct: 311 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 370

Query: 392 PPNSWAIFAAGYLEKQNLEKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEE 451
            P SW + A  Y EK  LE AFKCMK AL V+  ++ WRP  ++++S++ W+ +    +E
Sbjct: 371 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 430

Query: 452 LKEFLSSLKTVPTLDGKLNSAL------------DELLETLDDDENEI 487
           ++ F++SL+    ++ ++  AL            D LL+ + DD+ EI
Sbjct: 431 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEI 478

BLAST of Sgr029636 vs. TAIR 10
Match: AT2G20710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 299.7 bits (766), Expect = 4.4e-81
Identity = 152/410 (37.07%), Postives = 246/410 (60.00%), Query Frame = 0

Query: 51  RISPLGDPQLSVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSKGLF 110
           R++  GDP  S++ +LD W+++G  +K  EL  I++ LR   R+  AL++S+WMS   + 
Sbjct: 43  RVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEHRVH 102

Query: 111 TLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKSLSHM 170
            ++  D A++LDLI +V GL  AEK+F  +  +     LYGALLNCY  + ++ K+    
Sbjct: 103 EISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKAEQVF 162

Query: 171 QKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINSYGAR 230
           Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM++  V PD F+    +++Y   
Sbjct: 163 QEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAYSVV 222

Query: 231 SDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKVDQDAL-- 290
           SD+  MEK L   E+   + +DW TY+  AN +IKAGL EKA+  LRK E  V+      
Sbjct: 223 SDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQKRKH 282

Query: 291 GFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLIREW 350
            +  L+S Y + G+K+E+ RLW+L KE      N  YI+++S+L+K++++EE E ++ EW
Sbjct: 283 AYEVLMSFYGAAGKKEEVYRLWSLYKE-LDGFYNTGYISVISALLKMDDIEEVEKIMEEW 342

Query: 351 ESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEKQNL 410
           E+    +D R+P++L+ GY +KG++E+AE+++  +V++ R+   ++W   A GY     +
Sbjct: 343 EAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAGKM 402

Query: 411 EKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSL 459
           EKA +  K A+ V +   GWRP   VL S + +L      E L++ L  L
Sbjct: 403 EKAVEKWKRAIEVSK--PGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL 449

BLAST of Sgr029636 vs. TAIR 10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 286.2 bits (731), Expect = 5.1e-77
Identity = 157/448 (35.04%), Postives = 260/448 (58.04%), Query Frame = 0

Query: 48  LFARISPLGDPQLSVVPILDQWIEEGRKMKDFELRRIVRDLRTCRRYGQALEVSEWMSSK 107
           ++ +IS +  P+L    +L+QW + GRK+  +EL R+V++LR  +R  QALEV +WM+++
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 108 G-LFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLVDKS 167
           G  F L+  D A+QLDLIG+VRG+  AE++F ++    +  ++YG+LLN YVR    +K+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 168 LSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRICINS 227
            + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK+  +  D +SY I ++S
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 228 YGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKV-DQ 287
            G+   +  ME V ++M+S   I  +WTT+S +A  +IK G  EKA + LRK E ++  +
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 288 DALGFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEAENLI 347
           + + +++L+SLY SLG K E+ R+W + K       N  Y  ++SSLV++ ++E AE + 
Sbjct: 309 NRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVY 368

Query: 348 REWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAGYLEK 407
            EW      YD R+PN+L+  Y +   +E AE +  ++V  G  P  ++W I A G+  K
Sbjct: 369 EEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRK 428

Query: 408 QNLEKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSLKTVPTLD 467
           + + +A  C++ A +  E +  WRPK  +LS   +   E       +  L  L+    L+
Sbjct: 429 RCISEALTCLRNAFSA-EGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLE 488

Query: 468 GKLNSALDELLETLDDDENEIVTTRELE 494
            K   AL      +D DEN  V   E++
Sbjct: 489 DKSYLAL------IDVDENRTVNNSEID 509

BLAST of Sgr029636 vs. TAIR 10
Match: AT1G60770.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 266.2 bits (679), Expect = 5.4e-71
Identity = 137/438 (31.28%), Postives = 245/438 (55.94%), Query Frame = 0

Query: 32  MVLRLYSARRNCNRRN--------LFARISPLGDPQLSVVPILDQWIEEGRKMKDFELRR 91
           M +R  S  R+  +R+        L+ R+   G  ++ V   L+Q+++  + +  +E+  
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 92  IVRDLRTCRRYGQALEVSEWMSSKGLFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQ 151
            ++ LR    Y  AL++SE M  +G+   T  D A+ LDL+ + R + + E YF  +   
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPET 120

Query: 152 EEIGKLYGALLNCYVREGLVDKSLSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPN 211
            +    YG+LLNCY +E L +K+   + KMKE+    + +SYN +M LY  TG+ +KVP 
Sbjct: 121 SKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPA 180

Query: 212 VLSEMKENGVLPDNFSYRICINSYGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFF 271
           ++ E+K   V+PD+++Y + + +  A +D+  +E+V++EM     ++ DWTTYS +A+ +
Sbjct: 181 MIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIY 240

Query: 272 IKAGLHEKAMNYLRKCEDK-VDQDALGFNHLISLYASLGRKDEITRLWALQKEKCKKQVN 331
           + AGL +KA   L++ E K   +D   +  LI+LY  LG+  E+ R+W   +    K  N
Sbjct: 241 VDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 300

Query: 332 RDYITMLSSLVKLEELEEAENLIREWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRN 391
             Y+ M+  LVKL +L  AE L +EW+++C  YD R+ N+L+  Y+Q+GLI++A ++   
Sbjct: 301 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 360

Query: 392 IVREGRIPPPNSWAIFAAGYLEKQNLEKAFKCMKEALAVQEQNKG-WRPKPSVLSSIMQW 451
             R G      +W IF   Y++  ++ +A +CM +A+++ + + G W P P  + ++M +
Sbjct: 361 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 420

Query: 452 LSENERYEELKEFLSSLK 460
             + +     +  L  LK
Sbjct: 421 FEQKKDVNGAENLLEILK 437

BLAST of Sgr029636 vs. TAIR 10
Match: AT2G20710.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 250.0 bits (637), Expect = 4.0e-66
Identity = 132/357 (36.97%), Postives = 211/357 (59.10%), Query Frame = 0

Query: 104 MSSKGLFTLTTRDFAVQLDLIGRVRGLDSAEKYFSRVSNQEEIGKLYGALLNCYVREGLV 163
           MS   +  ++  D A++LDLI +V GL  AEK+F  +  +     LYGALLNCY  + ++
Sbjct: 1   MSEHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVL 60

Query: 164 DKSLSHMQKMKEMGFASTPLSYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNFSYRIC 223
            K+    Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM++  V PD F+    
Sbjct: 61  HKAEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTR 120

Query: 224 INSYGARSDLISMEKVLKEMESQTHISMDWTTYSMVANFFIKAGLHEKAMNYLRKCEDKV 283
           +++Y   SD+  MEK L   E+   + +DW TY+  AN +IKAGL EKA+  LRK E  V
Sbjct: 121 LHAYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMV 180

Query: 284 DQDAL--GFNHLISLYASLGRKDEITRLWALQKEKCKKQVNRDYITMLSSLVKLEELEEA 343
           +       +  L+S Y + G+K+E+ RLW+L KE      N  YI+++S+L+K++++EE 
Sbjct: 181 NAQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKE-LDGFYNTGYISVISALLKMDDIEEV 240

Query: 344 ENLIREWESSCKCYDFRVPNILLIGYSQKGLIERAEKMLRNIVREGRIPPPNSWAIFAAG 403
           E ++ EWE+    +D R+P++L+ GY +KG++E+AE+++  +V++ R+   ++W   A G
Sbjct: 241 EKIMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALG 300

Query: 404 YLEKQNLEKAFKCMKEALAVQEQNKGWRPKPSVLSSIMQWLSENERYEELKEFLSSL 459
           Y     +EKA +  K A+ V +   GWRP   VL S + +L      E L++ L  L
Sbjct: 301 YKMAGKMEKAVEKWKRAIEVSK--PGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL 354

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022142737.14.4e-24886.87pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Momor... [more]
XP_023545982.12.2e-24486.54pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucur... [more]
KAG6600732.13.8e-24486.54Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022989754.18.5e-24485.63pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucur... [more]
XP_022942045.11.9e-24386.13pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q84JR32.1e-12045.51Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
Q9SKU66.2e-8037.07Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
Q8LPS67.1e-7635.04Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
O227147.6e-7031.28Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
Q3E9118.2e-6433.97Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1CNQ22.1e-24886.87pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like OS=Mom... [more]
A0A6J1JQ724.1e-24485.63pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like OS=Cuc... [more]
A0A6J1FVG89.2e-24486.13pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like OS=Cuc... [more]
A0A0A0L7Y22.1e-23280.93Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G104890 PE=4 SV=1[more]
A0A1S4DZ162.4e-22880.75pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT4G21705.11.5e-12145.51Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G20710.14.4e-8137.07Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G02150.15.1e-7735.04Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G60770.15.4e-7131.28Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G20710.24.0e-6636.97Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 405..425
NoneNo IPR availableCOILSCoilCoilcoord: 472..492
NoneNo IPR availablePANTHERPTHR45717:SF7PPR CONTAINING PLANT-LIKE PROTEINcoord: 32..475
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 32..475
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 361..387
e-value: 0.016
score: 15.4
coord: 325..350
e-value: 0.2
score: 12.0
coord: 150..178
e-value: 2.1E-4
score: 21.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 184..216
e-value: 1.9E-5
score: 22.6
coord: 149..178
e-value: 8.2E-4
score: 17.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 184..227
e-value: 6.3E-9
score: 35.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 181..215
score: 10.095415
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 146..180
score: 8.560833
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 356..390
score: 8.780059
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 65..178
e-value: 1.3E-7
score: 33.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 247..478
e-value: 1.7E-19
score: 72.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 179..246
e-value: 2.6E-10
score: 42.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 132..333
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 259..426
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 392..425
score: 8.1424

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029636.1Sgr029636.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding