Sgr026412 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr026412
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153031: 5073595 .. 5076654 (-)
RNA-Seq ExpressionSgr026412
SyntenySgr026412
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATTCCAAATGTAATTCTATGCAAAATGCACAGAAGGCATTTGATGATTTACCCATTAGAACTATTCACTCTTGGAATATCATTCTTGCCTTCTACTCACGCGTTGGATTGTTAAGTCAAGCTCGTAAGTTCTTTGATGAAATGCCTCATCCAAATATCGTTAGCTACAATACCTTGATTTCTAGCTTTACTCGCCATGGGTTGTATGTAGAATCAATTAATATCTTTCGACAAATGCAACAGGATTTTGATCTTTTAGTCTTGGATGAGTTTACTCTTGTGAGTATGGTCGGTACTTGTGCTTGTTTGGGTGTTCTGGCATCGTTGCGTCAGGTTCATGGAGCAGCTATTGTCATCCGATTGGAGTTTAATATGATCGTTTGCAATGCTATAATTGATGCTTATGGTAAATGTGGTGAACCAGATACGTCATATTCTATTTTTAGTCGAATGCAAGAGAGAGATGTTGTTACCTGGACCTCAATGGTTGTAGCCTGTGCTCAGACATCCAAGTTAGATGATGCATTTCAATTGTTCAGTTGTATGCCGATAAAAAATGCTCATACTTGGACTGCTTTGAGTAATGCTTTTGCAAGAAGCAAGTATAGCAATGAGGCCCTGGGTTTGTTTGAACAAATGCTGAAGGAGAAAATTTCTCCTAATGCTTTCACATTTGTAGGTGTTTTAAGTGCTTGCGCAGATCTTGCTTTGATAGCAAAAGGCAAACAGATTCATGGACTCATAATCAGAAGTAGCAGTAGCCTTAATTTTCTAAACGCATATATATGTAATGCCTTAATTGATATGTACAGTAAGAGTGGTGACATGAAATCAGATAGGACGTCGTTCAACTTGATTCTTGGAAAGGATGTAGTGTCGTGGAATTCATTAATCACTGGGTTTGCACAAAAATGGGCTTGGAAAGGAAGCACTTCTTGCCTTTAGGAGGATGATAGAAGTAGGGATGAGGCCTAATGAAGTGACATTTCTTGGTGTGCTGTCTGCCTGTTCCCATACCGGTTTGTCATCTGAAGGATTATATATTCTGGAGTTAATGGAAAAGTGTTATGGTTTAGATCATTATGCAGTCTTGATCGATATGTTTGGAAGAAAAAATAGACTTGCTGAAGCATTGGATTTAATAGCTAGAGCACCCAATGGATCAAAACACGTTGGAATATGGGGTGCAGTTCTGGGGGCTTGTCGAATGCGTGAAAATTTGAACCTGGCTATAAGAGCTGCAGAAACTTTGTTTAAGATGGAGCCAGATAATGCCGGAAGGTATGTATGTAATGTTATCTAATATATTTGCTGCAGCAAGTAGATGGATGGATGCCCATAATGTGAGAAAACTTATGGAGGAAAGAGGTTCCAAGAAGGAAGTAGCATATAGCTGCATAGAAATAAGAAATACAAGACATAAGTTTGTGGCAAGAGATAATTCCCACAGTCAGATGGGTGAGATATATGAGCTAATGTTTATGCTACTAGATCACATGAAAAATTTTGGTTACATGCCGATTGACAATGGTATTTACATTTATGATGGATATGGTATTTGAACTTTGAGCATGATTTATTTGGATGCCGTGGCTCAGTGTTGCCATTGTATAGACATTGAAAATGTTAGCAATGAATGAATTTGAGGATATGAAAGCTGCAAGATGATGAAGCTACAAAGCTGAAGAGGACAGATTGCTATCAAGTAGGCGGTCTTTTATTTTATGTTATTTTCCCCCACCATATTGATGTGATGTTAATAATTTGGCAAAGTACACTAAAATAAATCGTAAAATTTTGATCTTAAAGTTTTATCAAGTCCCACAGAAAATAAACAGATATTCAGGCATTAATCATTAATATAGTTTTAGCCACACAAGCATATAGGTTTTCCTTTTTGGGTTTTGATTTTTGAATTATTTTCCTTTAAAAAAATTTGGAATTGCAATAATACCAAACTATTTTGACACACATTCACCTCCCTTTCTACGTCACGATTTTAATCAAGGCATTTTTTTTTATGACAAACCAAAATTTAGACAAACGAAGAATACAAGTAGTCTATACAAAAAAAAGGAGGAAAGCCCACTACAAAAGAAGCCTGAACATGGGAGCCAAACAAAAATGAAAAAAAGTGGCCTAACAGTGAGAGACAAGGGATAATTACAAAACTCTTTGCAAATTAACATAGAACCAATACATTATGGTTTCAGGTTTGAACTGATGTTGGTCGGCACAATCTTTTGTCCTATAGATGAAACTTAGGACCAAATTAAGAAATTCTGTTGACCTTTTTGTTCCTTAGGCATAATCGTGGAGCATGACCTTCCTCTACTTCTTTAAGGAACTAAGTCTCAAATATTTAATGGATGCTCATTAGCATGCTTGTTATTCTAATTATAAAACTACTTTCAGTTAATGAGAGTCTATAGTAACCTTTTTTTTGCAAATTAATCTTCTAAGTAAGTTATGCTGAAAGTGTACGATCATGGAGTAACAATTTTTCTCTCCACTGGGTAGAAATGCAATTGAATTTTGAAATATTTTACTTAACCCTCCATGCAAAATATTAGATGCTATTTTGGTTGCATATGTTTATACTTGATAAATATTACTTGGCCAGACTGCATATTGTATTTGATTGAGAGATATCTCCTATGAAACGTGACTCTCTGGGGTTCTTCATTTTGCTGCTCTTCTTTCTGTATTGATGACTCTGATCCTTACTTCCTTTTTTGTAATCCTCTAGGAAGATTACTGTTACTGGAACTAAAAGAAATGATGAAGTGCTGCTGTTCTTAATCTAGCCAGAAAAATTTGAGGGTGCAAGGCCTGATCGAGTTCTGTCTTATGAGTTTGTATGCAGAGATGAAATTCAGAAAAAATTTCTCCCTGAGAGGGAAAAATCTTTCCTTTGTCGTGGTTGCTCTTGCATTCACTGTCTTGGTGTTGTGGACCTGGGAAGAAAATCCTTTTCTTACCACTTGTCAATCAGTTCAAGCGTGGTACAGAACTTCTTATGCAGGTATGCTTGCCAATCCCTGTTTAAAAATGTAG

mRNA sequence

ATGTATTCCAAATGTAATTCTATGCAAAATGCACAGAAGGCATTTGATGATTTACCCATTAGAACTATTCACTCTTGGAATATCATTCTTGCCTTCTACTCACGCGTTGGATTGTTAAGTCAAGCTCGTAAGTTCTTTGATGAAATGCCTCATCCAAATATCGTTAGCTACAATACCTTGATTTCTAGCTTTACTCGCCATGGGTTGTATGTAGAATCAATTAATATCTTTCGACAAATGCAACAGGATTTTGATCTTTTAGTCTTGGATGAGTTTACTCTTGTGAGTATGGTCGGTACTTGTGCTTGTTTGGGTGTTCTGGCATCGTTGCGTCAGGTTCATGGAGCAGCTATTGTCATCCGATTGGAGTTTAATATGATCGTTTGCAATGCTATAATTGATGCTTATGGTAAATGTGGTGAACCAGATACGTCATATTCTATTTTTAGTCGAATGCAAGAGAGAGATGTTGTTACCTGGACCTCAATGGTTGTAGCCTGTGCTCAGACATCCAAGTTAGATGATGCATTTCAATTGTTCAGTTGTATGCCGATAAAAAATGCTCATACTTGGACTGCTTTGAGTAATGCTTTTGCAAGAAGCAAGTATAGCAATGAGGCCCTGGGTTTGTTTGAACAAATGCTGAAGGAGAAAATTTCTCCTAATGCTTTCACATTTGTAGGTGTTTTAAGTGCTTGCGCAGATCTTGCTTTGATAGCAAAAGGCAAACAGATTCATGGACTCATAATCAGAAGTAGCAGTAGCCTTAATTTTCTAAACGCATATATATGTAATGCCTTAATTGATATGTACAGTAAGAGTGGTGACATGAAATCAGATAGGACGTCGTTCAACTTGATTCTTGGAAAGGATGTAGTGTCGTGGAATTCATTAATCACTGGGAGGATGATAGAAGTAGGGATGAGGCCTAATGAAGTGACATTTCTTGGTGTGCTGTCTGCCTGTTCCCATACCGGTTTGTCATCTGAAGGATTATATATTCTGGAGTTAATGGAAAAGTGTTATGGTTTAGATCATTATGCAGTCTTGATCGATATGTTTGGAAGAAAAAATAGACTTGCTGAAGCATTGGATTTAATAGCTAGAGCACCCAATGGATCAAAACACGTTGGAATATGGGGTGCAGTTCTGGGGGCTTGTCGAATGCGTGAAAATTTGAACCTGGCTATAAGAGCTGCAGAAACTTTGTTTAAGATGGAGCCAGATAATGCCGGAAGCCAGAAAAATTTGAGGGTGCAAGGCCTGATCGAGTTCTGTCTTATGAGTTTGTATGCAGAGATGAAATTCAGAAAAAATTTCTCCCTGAGAGGGAAAAATCTTTCCTTTGTCGTGGTTGCTCTTGCATTCACTGTCTTGGTGTTGTGGACCTGGGAAGAAAATCCTTTTCTTACCACTTGTCAATCAGTTCAAGCGTGGTACAGAACTTCTTATGCAGGTATGCTTGCCAATCCCTGTTTAAAAATGTAG

Coding sequence (CDS)

ATGTATTCCAAATGTAATTCTATGCAAAATGCACAGAAGGCATTTGATGATTTACCCATTAGAACTATTCACTCTTGGAATATCATTCTTGCCTTCTACTCACGCGTTGGATTGTTAAGTCAAGCTCGTAAGTTCTTTGATGAAATGCCTCATCCAAATATCGTTAGCTACAATACCTTGATTTCTAGCTTTACTCGCCATGGGTTGTATGTAGAATCAATTAATATCTTTCGACAAATGCAACAGGATTTTGATCTTTTAGTCTTGGATGAGTTTACTCTTGTGAGTATGGTCGGTACTTGTGCTTGTTTGGGTGTTCTGGCATCGTTGCGTCAGGTTCATGGAGCAGCTATTGTCATCCGATTGGAGTTTAATATGATCGTTTGCAATGCTATAATTGATGCTTATGGTAAATGTGGTGAACCAGATACGTCATATTCTATTTTTAGTCGAATGCAAGAGAGAGATGTTGTTACCTGGACCTCAATGGTTGTAGCCTGTGCTCAGACATCCAAGTTAGATGATGCATTTCAATTGTTCAGTTGTATGCCGATAAAAAATGCTCATACTTGGACTGCTTTGAGTAATGCTTTTGCAAGAAGCAAGTATAGCAATGAGGCCCTGGGTTTGTTTGAACAAATGCTGAAGGAGAAAATTTCTCCTAATGCTTTCACATTTGTAGGTGTTTTAAGTGCTTGCGCAGATCTTGCTTTGATAGCAAAAGGCAAACAGATTCATGGACTCATAATCAGAAGTAGCAGTAGCCTTAATTTTCTAAACGCATATATATGTAATGCCTTAATTGATATGTACAGTAAGAGTGGTGACATGAAATCAGATAGGACGTCGTTCAACTTGATTCTTGGAAAGGATGTAGTGTCGTGGAATTCATTAATCACTGGGAGGATGATAGAAGTAGGGATGAGGCCTAATGAAGTGACATTTCTTGGTGTGCTGTCTGCCTGTTCCCATACCGGTTTGTCATCTGAAGGATTATATATTCTGGAGTTAATGGAAAAGTGTTATGGTTTAGATCATTATGCAGTCTTGATCGATATGTTTGGAAGAAAAAATAGACTTGCTGAAGCATTGGATTTAATAGCTAGAGCACCCAATGGATCAAAACACGTTGGAATATGGGGTGCAGTTCTGGGGGCTTGTCGAATGCGTGAAAATTTGAACCTGGCTATAAGAGCTGCAGAAACTTTGTTTAAGATGGAGCCAGATAATGCCGGAAGCCAGAAAAATTTGAGGGTGCAAGGCCTGATCGAGTTCTGTCTTATGAGTTTGTATGCAGAGATGAAATTCAGAAAAAATTTCTCCCTGAGAGGGAAAAATCTTTCCTTTGTCGTGGTTGCTCTTGCATTCACTGTCTTGGTGTTGTGGACCTGGGAAGAAAATCCTTTTCTTACCACTTGTCAATCAGTTCAAGCGTGGTACAGAACTTCTTATGCAGGTATGCTTGCCAATCCCTGTTTAAAAATGTAG

Protein sequence

MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTLISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVIRLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLFSCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIAKGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLITGRMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCYGLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAETLFKMEPDNAGSQKNLRVQGLIEFCLMSLYAEMKFRKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAGMLANPCLKM
Homology
BLAST of Sgr026412 vs. NCBI nr
Match: XP_023543897.1 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023543898.1 pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 749.6 bits (1934), Expect = 1.7e-212
Identity = 385/532 (72.37%), Postives = 426/532 (80.08%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLP + IHSWN ILA YSR G LSQAR  FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVE++NIF QMQQDFD LVLDEFT VS+VGTCACLG L  LRQVHGAAI I
Sbjct: 115 ISSFTHHGLYVEAMNIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNA+I+AYGKCGEP TSYS+FSRMQ+RDVVTWTSMVVA  QTSKLDDAF++F
Sbjct: 175 GLEFNMIVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP+KN HTWTAL NAF ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 RSMPVKNVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IHG+I R SS LNF N Y+CNAL+D+YSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKEIHGIITRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKC---- 360
           G               RMIEVG++PNEVTFLGVLSACSHTGLSSEGLYI+ELMEK     
Sbjct: 355 GFAQNGLGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIK 414

Query: 361 YGLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 420
             LDHYAVLIDMFGRKNRLAEALDLI+RAPN SKHVGIWGAVLGACR+ +NL+LAIRAAE
Sbjct: 415 LSLDHYAVLIDMFGRKNRLAEALDLISRAPNASKHVGIWGAVLGACRIHDNLDLAIRAAE 474

Query: 421 TLFKMEPDNAGSQKNL--------------RVQGLIE-----------FCLMSLYAEMKF 480
           TLF+MEPDNAG    L               V+ L+E           F  +  +AEMK 
Sbjct: 475 TLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKL 534

Query: 481 RKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAGML 489
           R NF  RGK+ SFV+ ALAFT++VLW W EN F+TT QSVQAWYRTSY+G +
Sbjct: 535 RNNFPARGKHCSFVMAALAFTIVVLWAWGENSFITTSQSVQAWYRTSYSGFM 586

BLAST of Sgr026412 vs. NCBI nr
Match: XP_023543899.1 (pentatricopeptide repeat-containing protein At2g21090-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 749.2 bits (1933), Expect = 2.3e-212
Identity = 385/530 (72.64%), Postives = 425/530 (80.19%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLP + IHSWN ILA YSR G LSQAR  FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVE++NIF QMQQDFD LVLDEFT VS+VGTCACLG L  LRQVHGAAI I
Sbjct: 115 ISSFTHHGLYVEAMNIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNA+I+AYGKCGEP TSYS+FSRMQ+RDVVTWTSMVVA  QTSKLDDAF++F
Sbjct: 175 GLEFNMIVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP+KN HTWTAL NAF ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 RSMPVKNVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IHG+I R SS LNF N Y+CNAL+D+YSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKEIHGIITRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKC---- 360
           G               RMIEVG++PNEVTFLGVLSACSHTGLSSEGLYI+ELMEK     
Sbjct: 355 GFAQNGLGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIK 414

Query: 361 YGLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 420
             LDHYAVLIDMFGRKNRLAEALDLI+RAPN SKHVGIWGAVLGACR+ +NL+LAIRAAE
Sbjct: 415 LSLDHYAVLIDMFGRKNRLAEALDLISRAPNASKHVGIWGAVLGACRIHDNLDLAIRAAE 474

Query: 421 TLFKMEPDNAGSQKNL--------------RVQGLIE-----------FCLMSLYAEMKF 480
           TLF+MEPDNAG    L               V+ L+E           F  +  +AEMK 
Sbjct: 475 TLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKL 534

Query: 481 RKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAG 487
           R NF  RGK+ SFV+ ALAFT++VLW W EN F+TT QSVQAWYRTSY+G
Sbjct: 535 RNNFPARGKHCSFVMAALAFTIVVLWAWGENSFITTSQSVQAWYRTSYSG 584

BLAST of Sgr026412 vs. NCBI nr
Match: XP_022925937.1 (pentatricopeptide repeat-containing protein At2g21090-like isoform X3 [Cucurbita moschata])

HSP 1 Score: 746.5 bits (1926), Expect = 1.5e-211
Identity = 383/532 (71.99%), Postives = 426/532 (80.08%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLP + IHSWN ILA YSR G LSQAR  FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVE+++IF QMQQDFD LVLDEFT VS+VGTCACLG L  LRQVHGAAI I
Sbjct: 115 ISSFTHHGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNA+I+AYGKCGEP TSYS+FSRMQ+RDVVTWTSMVVA  QTSKLDDAF++F
Sbjct: 175 GLEFNMIVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP+KN HTWTAL NAF ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 RSMPVKNVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IH +IIR SS LNF N Y+CNAL+D+YSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKEIHAIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCY--- 360
           G               RMIEVG++PNEVTFLGVLSACSHTGLSSEGLYI+ELMEK     
Sbjct: 355 GFAQNGLGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIK 414

Query: 361 -GLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 420
             LDHYAVLIDMFGRKNRLAEALDLI+RAPN SKH+GIWGAVLGACR+ +NL+LAIRAAE
Sbjct: 415 PSLDHYAVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAE 474

Query: 421 TLFKMEPDNAGSQKNL--------------RVQGLIE-----------FCLMSLYAEMKF 480
           TLF+MEPDNAG    L               V+ L+E           F  +  +AEMK 
Sbjct: 475 TLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKL 534

Query: 481 RKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAGML 489
           R NF  RGK+ SFV+ ALAFT++VLW W EN F+TT QSVQAWYRTSY+G +
Sbjct: 535 RNNFPARGKHCSFVMAALAFTIVVLWAWGENSFITTSQSVQAWYRTSYSGFM 586

BLAST of Sgr026412 vs. NCBI nr
Match: XP_022925935.1 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 746.5 bits (1926), Expect = 1.5e-211
Identity = 383/532 (71.99%), Postives = 426/532 (80.08%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLP + IHSWN ILA YSR G LSQAR  FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVE+++IF QMQQDFD LVLDEFT VS+VGTCACLG L  LRQVHGAAI I
Sbjct: 115 ISSFTHHGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNA+I+AYGKCGEP TSYS+FSRMQ+RDVVTWTSMVVA  QTSKLDDAF++F
Sbjct: 175 GLEFNMIVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP+KN HTWTAL NAF ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 RSMPVKNVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IH +IIR SS LNF N Y+CNAL+D+YSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKEIHAIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCY--- 360
           G               RMIEVG++PNEVTFLGVLSACSHTGLSSEGLYI+ELMEK     
Sbjct: 355 GFAQNGLGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIK 414

Query: 361 -GLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 420
             LDHYAVLIDMFGRKNRLAEALDLI+RAPN SKH+GIWGAVLGACR+ +NL+LAIRAAE
Sbjct: 415 PSLDHYAVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAE 474

Query: 421 TLFKMEPDNAGSQKNL--------------RVQGLIE-----------FCLMSLYAEMKF 480
           TLF+MEPDNAG    L               V+ L+E           F  +  +AEMK 
Sbjct: 475 TLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKL 534

Query: 481 RKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAGML 489
           R NF  RGK+ SFV+ ALAFT++VLW W EN F+TT QSVQAWYRTSY+G +
Sbjct: 535 RNNFPARGKHCSFVMAALAFTIVVLWAWGENSFITTSQSVQAWYRTSYSGFM 586

BLAST of Sgr026412 vs. NCBI nr
Match: XP_022925936.1 (pentatricopeptide repeat-containing protein At2g21090-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 746.1 bits (1925), Expect = 1.9e-211
Identity = 383/530 (72.26%), Postives = 425/530 (80.19%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLP + IHSWN ILA YSR G LSQAR  FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVE+++IF QMQQDFD LVLDEFT VS+VGTCACLG L  LRQVHGAAI I
Sbjct: 115 ISSFTHHGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNA+I+AYGKCGEP TSYS+FSRMQ+RDVVTWTSMVVA  QTSKLDDAF++F
Sbjct: 175 GLEFNMIVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP+KN HTWTAL NAF ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 RSMPVKNVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IH +IIR SS LNF N Y+CNAL+D+YSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKEIHAIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCY--- 360
           G               RMIEVG++PNEVTFLGVLSACSHTGLSSEGLYI+ELMEK     
Sbjct: 355 GFAQNGLGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIK 414

Query: 361 -GLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 420
             LDHYAVLIDMFGRKNRLAEALDLI+RAPN SKH+GIWGAVLGACR+ +NL+LAIRAAE
Sbjct: 415 PSLDHYAVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAE 474

Query: 421 TLFKMEPDNAGSQKNL--------------RVQGLIE-----------FCLMSLYAEMKF 480
           TLF+MEPDNAG    L               V+ L+E           F  +  +AEMK 
Sbjct: 475 TLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKL 534

Query: 481 RKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAG 487
           R NF  RGK+ SFV+ ALAFT++VLW W EN F+TT QSVQAWYRTSY+G
Sbjct: 535 RNNFPARGKHCSFVMAALAFTIVVLWAWGENSFITTSQSVQAWYRTSYSG 584

BLAST of Sgr026412 vs. ExPASy Swiss-Prot
Match: Q9SKQ4 (Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E48 PE=2 SV=1)

HSP 1 Score: 256.5 bits (654), Expect = 6.0e-67
Identity = 147/432 (34.03%), Postives = 237/432 (54.86%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MY KC    +A K FD + +R ++SWN +++ Y + G+L +AR  FD MP  ++VS+NT+
Sbjct: 91  MYMKCGKPIDACKVFDQMHLRNLYSWNNMVSGYVKSGMLVRARVVFDSMPERDVVSWNTM 150

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           +  + + G   E++  +++ ++    +  +EF+   ++  C     L   RQ HG  +V 
Sbjct: 151 VIGYAQDGNLHEALWFYKEFRRSG--IKFNEFSFAGLLTACVKSRQLQLNRQAHGQVLVA 210

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
               N+++  +IIDAY KCG+ +++   F  M  +D+  WT+++   A+   ++ A +LF
Sbjct: 211 GFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVKDIHIWTTLISGYAKLGDMEAAEKLF 270

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP KN  +WTAL   + R    N AL LF +M+   + P  FTF   L A A +A + 
Sbjct: 271 CEMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLR 330

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGK-DVVSWNSLI 300
            GK+IHG +IR++      NA + ++LIDMYSKSG +++    F +   K D V WN++I
Sbjct: 331 HGKEIHGYMIRTNVR---PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMI 390

Query: 301 TG---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCYGL 360
           +                 MI+  ++PN  T + +L+ACSH+GL  EGL   E M   +G+
Sbjct: 391 SALAQHGLGHKALRMLDDMIKFRVQPNRTTLVVILNACSHSGLVEEGLRWFESMTVQHGI 450

Query: 361 ----DHYAVLIDMFGRKNRLAEALDLIARAP-NGSKHVGIWGAVLGACRMRENLNLAIRA 412
               +HYA LID+ GR     E +  I   P    KH  IW A+LG CR+  N  L  +A
Sbjct: 451 VPDQEHYACLIDLLGRAGCFKELMRKIEEMPFEPDKH--IWNAILGVCRIHGNEELGKKA 510

BLAST of Sgr026412 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 245.4 bits (625), Expect = 1.4e-63
Identity = 141/439 (32.12%), Postives = 240/439 (54.67%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MY+KC     A+  FD + +R I SWN ++A + +VG +  A   F++M   +IV++N++
Sbjct: 190 MYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSM 249

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           IS F + G  + +++IF +M +D  LL  D FTL S++  CA L  L   +Q+H   +  
Sbjct: 250 ISGFNQRGYDLRALDIFSKMLRD-SLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTT 309

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERD--VVTWTSMVVACAQTSKLDDAFQ 180
             + + IV NA+I  Y +CG  +T+  +  +   +D  +  +T+++    +   ++ A  
Sbjct: 310 GFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKN 369

Query: 181 LFSCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLAL 240
           +F  +  ++   WTA+   + +     EA+ LF  M+     PN++T   +LS  + LA 
Sbjct: 370 IFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLAS 429

Query: 241 IAKGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLI-LGKDVVSWNS 300
           ++ GKQIHG  ++S       +  + NALI MY+K+G++ S   +F+LI   +D VSW S
Sbjct: 430 LSHGKQIHGSAVKSG---EIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTS 489

Query: 301 LITG---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEK-- 360
           +I                  M+  G+RP+ +T++GV SAC+H GL ++G    ++M+   
Sbjct: 490 MIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVD 549

Query: 361 --CYGLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIR 418
                L HYA ++D+FGR   L EA + I + P     V  WG++L ACR+ +N++L   
Sbjct: 550 KIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPI-EPDVVTWGSLLSACRVHKNIDLGKV 609

BLAST of Sgr026412 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 1.6e-59
Identity = 134/413 (32.45%), Postives = 223/413 (54.00%), Query Frame = 0

Query: 23  IHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTLISSFTRHGLYVESINIFRQMQQ 82
           ++  + ++  YS+ G ++ A++ FDEM   N+VS+N+LI+ F ++G  VE++++F+ M +
Sbjct: 187 VYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLE 246

Query: 83  DFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIV-IRLEFNMIVCNAIIDAYGKCGE 142
               +  DE TL S++  CA L  +   ++VHG  +   +L  ++I+ NA +D Y KC  
Sbjct: 247 --SRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSR 306

Query: 143 PDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLFSCMPIKNAHTWTALSNAFARS 202
              +  IF  M  R+V+  TSM+   A  +    A  +F+ M  +N  +W AL   + ++
Sbjct: 307 IKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQN 366

Query: 203 KYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIAKGKQIHGLIIRSS---SSLNF 262
             + EAL LF  + +E + P  ++F  +L ACADLA +  G Q H  +++      S   
Sbjct: 367 GENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEE 426

Query: 263 LNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLITG---------------RM 322
            + ++ N+LIDMY K G ++     F  ++ +D VSWN++I G                M
Sbjct: 427 DDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREM 486

Query: 323 IEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCYGL----DHYAVLIDMFGRKNR 382
           +E G +P+ +T +GVLSAC H G   EG +    M + +G+    DHY  ++D+ GR   
Sbjct: 487 LESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGF 546

Query: 383 LAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAETLFKMEPDNAG 413
           L EA  +I   P     V IWG++L AC++  N+ L    AE L ++EP N+G
Sbjct: 547 LEEAKSMIEEMPMQPDSV-IWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSG 596

BLAST of Sgr026412 vs. ExPASy Swiss-Prot
Match: O23169 (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 227.3 bits (578), Expect = 3.9e-58
Identity = 137/429 (31.93%), Postives = 230/429 (53.61%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MY+KC S+ +A+K FD++P R + SWN+++  Y+ VGLL +ARK FDEM   +  S+  +
Sbjct: 129 MYAKCGSLVDARKVFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMTEKDSYSWTAM 188

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ++ + +     E++ ++  MQ+  +    + FT+   V   A +  +   +++HG  +  
Sbjct: 189 VTGYVKKDQPEEALVLYSLMQRVPNSRP-NIFTVSIAVAAAAAVKCIRRGKEIHGHIVRA 248

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            L+ + ++ ++++D YGKCG  D + +IF ++ E+DVV+WTSM+    ++S+  + F LF
Sbjct: 249 GLDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLF 308

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
           S                        E +G  E+       PN +TF GVL+ACADL    
Sbjct: 309 S------------------------ELVGSCER-------PNEYTFAGVLNACADLTTEE 368

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
            GKQ+HG + R         ++  ++L+DMY+K G+++S +   +     D+VSW SLI 
Sbjct: 369 LGKQVHGYMTRVGFD---PYSFASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIG 428

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGL-YILELMEK---C 360
           G                +++ G +P+ VTF+ VLSAC+H GL  +GL +   + EK    
Sbjct: 429 GCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSITEKHRLS 488

Query: 361 YGLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 411
           +  DHY  L+D+  R  R  +   +I+  P       +W +VLG C    N++LA  AA+
Sbjct: 489 HTSDHYTCLVDLLARSGRFEQLKSVISEMPMKPSKF-LWASVLGGCSTYGNIDLAEEAAQ 521

BLAST of Sgr026412 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 6.7e-58
Identity = 135/407 (33.17%), Postives = 218/407 (53.56%), Query Frame = 0

Query: 27  NIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTLISSFTRHGLYVESINIFRQMQQDFDL 86
           N ++  Y   G L  A K F  +   ++VS+N++I+ F + G   +++ +F++M+ +   
Sbjct: 170 NSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESED-- 229

Query: 87  LVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVIRLEFNMIVCNAIIDAYGKCGEPDTSY 146
           +     T+V ++  CA +  L   RQV       R+  N+ + NA++D Y KCG  + + 
Sbjct: 230 VKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAK 289

Query: 147 SIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLFSCMPIKNAHTWTALSNAFARSKYSNE 206
            +F  M+E+D VTWT+M+   A +   + A ++ + MP K+   W AL +A+ ++   NE
Sbjct: 290 RLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNE 349

Query: 207 ALGLFEQM-LKEKISPNAFTFVGVLSACADLALIAKGKQIHGLIIRSSSSLNFLNAYICN 266
           AL +F ++ L++ +  N  T V  LSACA +  +  G+ IH  I +    +NF   ++ +
Sbjct: 350 ALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNF---HVTS 409

Query: 267 ALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLITG---------------RMIEVGMRP 326
           ALI MYSK GD++  R  FN +  +DV  W+++I G               +M E  ++P
Sbjct: 410 ALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKP 469

Query: 327 NEVTFLGVLSACSHTGLSSEGLYILELMEKCYGL----DHYAVLIDMFGRKNRLAEALDL 386
           N VTF  V  ACSHTGL  E   +   ME  YG+     HYA ++D+ GR   L +A+  
Sbjct: 470 NGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKF 529

Query: 387 IARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAETLFKMEPDNAGS 414
           I   P       +WGA+LGAC++  NLNLA  A   L ++EP N G+
Sbjct: 530 IEAMPI-PPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGA 570

BLAST of Sgr026412 vs. ExPASy TrEMBL
Match: A0A6J1EJM2 (pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111433199 PE=3 SV=1)

HSP 1 Score: 746.5 bits (1926), Expect = 7.1e-212
Identity = 383/532 (71.99%), Postives = 426/532 (80.08%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLP + IHSWN ILA YSR G LSQAR  FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVE+++IF QMQQDFD LVLDEFT VS+VGTCACLG L  LRQVHGAAI I
Sbjct: 115 ISSFTHHGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNA+I+AYGKCGEP TSYS+FSRMQ+RDVVTWTSMVVA  QTSKLDDAF++F
Sbjct: 175 GLEFNMIVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP+KN HTWTAL NAF ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 RSMPVKNVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IH +IIR SS LNF N Y+CNAL+D+YSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKEIHAIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCY--- 360
           G               RMIEVG++PNEVTFLGVLSACSHTGLSSEGLYI+ELMEK     
Sbjct: 355 GFAQNGLGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIK 414

Query: 361 -GLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 420
             LDHYAVLIDMFGRKNRLAEALDLI+RAPN SKH+GIWGAVLGACR+ +NL+LAIRAAE
Sbjct: 415 PSLDHYAVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAE 474

Query: 421 TLFKMEPDNAGSQKNL--------------RVQGLIE-----------FCLMSLYAEMKF 480
           TLF+MEPDNAG    L               V+ L+E           F  +  +AEMK 
Sbjct: 475 TLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKL 534

Query: 481 RKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAGML 489
           R NF  RGK+ SFV+ ALAFT++VLW W EN F+TT QSVQAWYRTSY+G +
Sbjct: 535 RNNFPARGKHCSFVMAALAFTIVVLWAWGENSFITTSQSVQAWYRTSYSGFM 586

BLAST of Sgr026412 vs. ExPASy TrEMBL
Match: A0A6J1ECZ4 (pentatricopeptide repeat-containing protein At2g21090-like isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111433199 PE=3 SV=1)

HSP 1 Score: 746.5 bits (1926), Expect = 7.1e-212
Identity = 383/532 (71.99%), Postives = 426/532 (80.08%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLP + IHSWN ILA YSR G LSQAR  FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVE+++IF QMQQDFD LVLDEFT VS+VGTCACLG L  LRQVHGAAI I
Sbjct: 115 ISSFTHHGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNA+I+AYGKCGEP TSYS+FSRMQ+RDVVTWTSMVVA  QTSKLDDAF++F
Sbjct: 175 GLEFNMIVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP+KN HTWTAL NAF ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 RSMPVKNVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IH +IIR SS LNF N Y+CNAL+D+YSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKEIHAIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCY--- 360
           G               RMIEVG++PNEVTFLGVLSACSHTGLSSEGLYI+ELMEK     
Sbjct: 355 GFAQNGLGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIK 414

Query: 361 -GLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 420
             LDHYAVLIDMFGRKNRLAEALDLI+RAPN SKH+GIWGAVLGACR+ +NL+LAIRAAE
Sbjct: 415 PSLDHYAVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAE 474

Query: 421 TLFKMEPDNAGSQKNL--------------RVQGLIE-----------FCLMSLYAEMKF 480
           TLF+MEPDNAG    L               V+ L+E           F  +  +AEMK 
Sbjct: 475 TLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKL 534

Query: 481 RKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAGML 489
           R NF  RGK+ SFV+ ALAFT++VLW W EN F+TT QSVQAWYRTSY+G +
Sbjct: 535 RNNFPARGKHCSFVMAALAFTIVVLWAWGENSFITTSQSVQAWYRTSYSGFM 586

BLAST of Sgr026412 vs. ExPASy TrEMBL
Match: A0A6J1EDI3 (pentatricopeptide repeat-containing protein At2g21090-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111433199 PE=3 SV=1)

HSP 1 Score: 746.1 bits (1925), Expect = 9.3e-212
Identity = 383/530 (72.26%), Postives = 425/530 (80.19%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLP + IHSWN ILA YSR G LSQAR  FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPFKNIHSWNTILASYSRAGFLSQARMIFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVE+++IF QMQQDFD LVLDEFT VS+VGTCACLG L  LRQVHGAAI I
Sbjct: 115 ISSFTHHGLYVEAMDIFWQMQQDFDRLVLDEFTFVSIVGTCACLGALEMLRQVHGAAIFI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNA+I+AYGKCGEP TSYS+FSRMQ+RDVVTWTSMVVA  QTSKLDDAF++F
Sbjct: 175 GLEFNMIVCNAVINAYGKCGEPGTSYSVFSRMQKRDVVTWTSMVVAYTQTSKLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP+KN HTWTAL NAF ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 RSMPVKNVHTWTALINAFVKNKYSNEALDLFQQMLEEKYSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IH +IIR SS LNF N Y+CNAL+D+YSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKEIHAIIIRRSSDLNFPNVYMCNALVDLYSKSGDMKSARTLFNLVPKKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCY--- 360
           G               RMIEVG++PNEVTFLGVLSACSHTGLSSEGLYI+ELMEK     
Sbjct: 355 GFAQNGLGREALIAFRRMIEVGIKPNEVTFLGVLSACSHTGLSSEGLYIMELMEKSNDIK 414

Query: 361 -GLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 420
             LDHYAVLIDMFGRKNRLAEALDLI+RAPN SKH+GIWGAVLGACR+ +NL+LAIRAAE
Sbjct: 415 PSLDHYAVLIDMFGRKNRLAEALDLISRAPNASKHIGIWGAVLGACRIHDNLDLAIRAAE 474

Query: 421 TLFKMEPDNAGSQKNL--------------RVQGLIE-----------FCLMSLYAEMKF 480
           TLF+MEPDNAG    L               V+ L+E           F  +  +AEMK 
Sbjct: 475 TLFEMEPDNAGRYVMLSNVFAAASRWMDAHNVRKLMEERGFKKEVAQSFIEIRNFAEMKL 534

Query: 481 RKNFSLRGKNLSFVVVALAFTVLVLWTWEENPFLTTCQSVQAWYRTSYAG 487
           R NF  RGK+ SFV+ ALAFT++VLW W EN F+TT QSVQAWYRTSY+G
Sbjct: 535 RNNFPARGKHCSFVMAALAFTIVVLWAWGENSFITTSQSVQAWYRTSYSG 584

BLAST of Sgr026412 vs. ExPASy TrEMBL
Match: A0A6J1BT15 (pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charantia OX=3673 GN=LOC111005503 PE=4 SV=1)

HSP 1 Score: 704.1 bits (1816), Expect = 4.0e-199
Identity = 357/431 (82.83%), Postives = 384/431 (89.10%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLPIR +HSWN ILA Y+R+G LSQARKFFDEMPHPNI+SYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPIRNVHSWNTILALYTRIGCLSQARKFFDEMPHPNIISYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           I SFTRHGLYVES+NIFR+MQQDFDLLVLDEFTLVS+ GTCACLG LA LRQ+HGAAIVI
Sbjct: 115 IYSFTRHGLYVESMNIFRKMQQDFDLLVLDEFTLVSIAGTCACLGALALLRQIHGAAIVI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFN+IV NAIIDAYGKCGEPDTSYSIFS+MQERDVVTWTSMVVA AQTS+LDDAF++F
Sbjct: 175 GLEFNVIVSNAIIDAYGKCGEPDTSYSIFSQMQERDVVTWTSMVVAYAQTSRLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
           SCMP+KN HTWTAL NAFA++KYSNEAL LFEQML+EKIS N+FTFVGVLSACADLALIA
Sbjct: 235 SCMPMKNVHTWTALINAFAKNKYSNEALDLFEQMLEEKISLNSFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGKQIHGLIIRSS SLNFLN YI NALIDMYSKSGDMKS RT FNL+  KDVVSWNSLIT
Sbjct: 295 KGKQIHGLIIRSSCSLNFLNVYIYNALIDMYSKSGDMKSARTLFNLMPEKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCYG-- 360
           G               RMIEVG+RPN+VTFLGVLSACSHTGL SEGLY+LELMEK +G  
Sbjct: 355 GFAQNGLGKEALIAFRRMIEVGIRPNKVTFLGVLSACSHTGLLSEGLYLLELMEKFFGIK 414

Query: 361 --LDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 413
             LDHYAVLIDMFGRKNRLAEALDLIARAPN S HVGIWGAVLGACRM ENL+LA+ AAE
Sbjct: 415 PSLDHYAVLIDMFGRKNRLAEALDLIARAPNRSNHVGIWGAVLGACRMHENLDLAMSAAE 474

BLAST of Sgr026412 vs. ExPASy TrEMBL
Match: A0A0A0KFI0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G476040 PE=4 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 3.0e-194
Identity = 346/431 (80.28%), Postives = 374/431 (86.77%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MYSKCNSM+NAQKAFDDLPIR IHSWN ILA YSR G  SQARK FDEMPHPNIVSYNTL
Sbjct: 55  MYSKCNSMENAQKAFDDLPIRNIHSWNTILASYSRAGFFSQARKVFDEMPHPNIVSYNTL 114

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ISSFT HGLYVES+NIFRQMQQDFDLL LDE TLVS+ GTCACLG L  LRQVHGAAIVI
Sbjct: 115 ISSFTHHGLYVESMNIFRQMQQDFDLLALDEITLVSIAGTCACLGALEFLRQVHGAAIVI 174

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            LEFNMIVCNAI+DAYGKCG+PD SYSIFSRM+ERDVVTWTSMVVA  QTS+LDDAF++F
Sbjct: 175 GLEFNMIVCNAIVDAYGKCGDPDASYSIFSRMKERDVVTWTSMVVAYNQTSRLDDAFRVF 234

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
           SCMP+KN HTWTAL NA  ++KYSNEAL LF+QML+EK SPNAFTFVGVLSACADLALIA
Sbjct: 235 SCMPVKNVHTWTALINALVKNKYSNEALDLFQQMLEEKTSPNAFTFVGVLSACADLALIA 294

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
           KGK+IHGLIIR SS LNF N Y+CNALID+YSKSGD+KS R  FNLIL KDVVSWNSLIT
Sbjct: 295 KGKEIHGLIIRRSSELNFPNVYVCNALIDLYSKSGDVKSARMLFNLILEKDVVSWNSLIT 354

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCY--- 360
           G               +M EVG+RPN+VTFL VLSACSHTGLSSEGL ILELMEK Y   
Sbjct: 355 GFAQNGLGREALLAFRKMTEVGIRPNKVTFLAVLSACSHTGLSSEGLCILELMEKFYDIE 414

Query: 361 -GLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 413
             L+HYAV+IDMFGR+NRLAEALDLI+RAPNGSKHVGIWGAVLGACR+ ENL+LAIRAAE
Sbjct: 415 PSLEHYAVMIDMFGRENRLAEALDLISRAPNGSKHVGIWGAVLGACRIHENLDLAIRAAE 474

BLAST of Sgr026412 vs. TAIR 10
Match: AT2G21090.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 256.5 bits (654), Expect = 4.3e-68
Identity = 147/432 (34.03%), Postives = 237/432 (54.86%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MY KC    +A K FD + +R ++SWN +++ Y + G+L +AR  FD MP  ++VS+NT+
Sbjct: 91  MYMKCGKPIDACKVFDQMHLRNLYSWNNMVSGYVKSGMLVRARVVFDSMPERDVVSWNTM 150

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           +  + + G   E++  +++ ++    +  +EF+   ++  C     L   RQ HG  +V 
Sbjct: 151 VIGYAQDGNLHEALWFYKEFRRSG--IKFNEFSFAGLLTACVKSRQLQLNRQAHGQVLVA 210

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
               N+++  +IIDAY KCG+ +++   F  M  +D+  WT+++   A+   ++ A +LF
Sbjct: 211 GFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVKDIHIWTTLISGYAKLGDMEAAEKLF 270

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
             MP KN  +WTAL   + R    N AL LF +M+   + P  FTF   L A A +A + 
Sbjct: 271 CEMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLR 330

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGK-DVVSWNSLI 300
            GK+IHG +IR++      NA + ++LIDMYSKSG +++    F +   K D V WN++I
Sbjct: 331 HGKEIHGYMIRTNVR---PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMI 390

Query: 301 TG---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCYGL 360
           +                 MI+  ++PN  T + +L+ACSH+GL  EGL   E M   +G+
Sbjct: 391 SALAQHGLGHKALRMLDDMIKFRVQPNRTTLVVILNACSHSGLVEEGLRWFESMTVQHGI 450

Query: 361 ----DHYAVLIDMFGRKNRLAEALDLIARAP-NGSKHVGIWGAVLGACRMRENLNLAIRA 412
               +HYA LID+ GR     E +  I   P    KH  IW A+LG CR+  N  L  +A
Sbjct: 451 VPDQEHYACLIDLLGRAGCFKELMRKIEEMPFEPDKH--IWNAILGVCRIHGNEELGKKA 510

BLAST of Sgr026412 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 245.4 bits (625), Expect = 9.9e-65
Identity = 141/439 (32.12%), Postives = 240/439 (54.67%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MY+KC     A+  FD + +R I SWN ++A + +VG +  A   F++M   +IV++N++
Sbjct: 190 MYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSM 249

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           IS F + G  + +++IF +M +D  LL  D FTL S++  CA L  L   +Q+H   +  
Sbjct: 250 ISGFNQRGYDLRALDIFSKMLRD-SLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTT 309

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERD--VVTWTSMVVACAQTSKLDDAFQ 180
             + + IV NA+I  Y +CG  +T+  +  +   +D  +  +T+++    +   ++ A  
Sbjct: 310 GFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKN 369

Query: 181 LFSCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLAL 240
           +F  +  ++   WTA+   + +     EA+ LF  M+     PN++T   +LS  + LA 
Sbjct: 370 IFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLAS 429

Query: 241 IAKGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLI-LGKDVVSWNS 300
           ++ GKQIHG  ++S       +  + NALI MY+K+G++ S   +F+LI   +D VSW S
Sbjct: 430 LSHGKQIHGSAVKSG---EIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTS 489

Query: 301 LITG---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEK-- 360
           +I                  M+  G+RP+ +T++GV SAC+H GL ++G    ++M+   
Sbjct: 490 MIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVD 549

Query: 361 --CYGLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIR 418
                L HYA ++D+FGR   L EA + I + P     V  WG++L ACR+ +N++L   
Sbjct: 550 KIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPI-EPDVVTWGSLLSACRVHKNIDLGKV 609

BLAST of Sgr026412 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 231.9 bits (590), Expect = 1.1e-60
Identity = 134/413 (32.45%), Postives = 223/413 (54.00%), Query Frame = 0

Query: 23  IHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTLISSFTRHGLYVESINIFRQMQQ 82
           ++  + ++  YS+ G ++ A++ FDEM   N+VS+N+LI+ F ++G  VE++++F+ M +
Sbjct: 187 VYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLE 246

Query: 83  DFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIV-IRLEFNMIVCNAIIDAYGKCGE 142
               +  DE TL S++  CA L  +   ++VHG  +   +L  ++I+ NA +D Y KC  
Sbjct: 247 --SRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSR 306

Query: 143 PDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLFSCMPIKNAHTWTALSNAFARS 202
              +  IF  M  R+V+  TSM+   A  +    A  +F+ M  +N  +W AL   + ++
Sbjct: 307 IKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQN 366

Query: 203 KYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIAKGKQIHGLIIRSS---SSLNF 262
             + EAL LF  + +E + P  ++F  +L ACADLA +  G Q H  +++      S   
Sbjct: 367 GENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEE 426

Query: 263 LNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLITG---------------RM 322
            + ++ N+LIDMY K G ++     F  ++ +D VSWN++I G                M
Sbjct: 427 DDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREM 486

Query: 323 IEVGMRPNEVTFLGVLSACSHTGLSSEGLYILELMEKCYGL----DHYAVLIDMFGRKNR 382
           +E G +P+ +T +GVLSAC H G   EG +    M + +G+    DHY  ++D+ GR   
Sbjct: 487 LESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGF 546

Query: 383 LAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAETLFKMEPDNAG 413
           L EA  +I   P     V IWG++L AC++  N+ L    AE L ++EP N+G
Sbjct: 547 LEEAKSMIEEMPMQPDSV-IWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSG 596

BLAST of Sgr026412 vs. TAIR 10
Match: AT4G37170.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 227.3 bits (578), Expect = 2.8e-59
Identity = 137/429 (31.93%), Postives = 230/429 (53.61%), Query Frame = 0

Query: 1   MYSKCNSMQNAQKAFDDLPIRTIHSWNIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTL 60
           MY+KC S+ +A+K FD++P R + SWN+++  Y+ VGLL +ARK FDEM   +  S+  +
Sbjct: 129 MYAKCGSLVDARKVFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMTEKDSYSWTAM 188

Query: 61  ISSFTRHGLYVESINIFRQMQQDFDLLVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVI 120
           ++ + +     E++ ++  MQ+  +    + FT+   V   A +  +   +++HG  +  
Sbjct: 189 VTGYVKKDQPEEALVLYSLMQRVPNSRP-NIFTVSIAVAAAAAVKCIRRGKEIHGHIVRA 248

Query: 121 RLEFNMIVCNAIIDAYGKCGEPDTSYSIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLF 180
            L+ + ++ ++++D YGKCG  D + +IF ++ E+DVV+WTSM+    ++S+  + F LF
Sbjct: 249 GLDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLF 308

Query: 181 SCMPIKNAHTWTALSNAFARSKYSNEALGLFEQMLKEKISPNAFTFVGVLSACADLALIA 240
           S                        E +G  E+       PN +TF GVL+ACADL    
Sbjct: 309 S------------------------ELVGSCER-------PNEYTFAGVLNACADLTTEE 368

Query: 241 KGKQIHGLIIRSSSSLNFLNAYICNALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLIT 300
            GKQ+HG + R         ++  ++L+DMY+K G+++S +   +     D+VSW SLI 
Sbjct: 369 LGKQVHGYMTRVGFD---PYSFASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIG 428

Query: 301 G---------------RMIEVGMRPNEVTFLGVLSACSHTGLSSEGL-YILELMEK---C 360
           G                +++ G +P+ VTF+ VLSAC+H GL  +GL +   + EK    
Sbjct: 429 GCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSITEKHRLS 488

Query: 361 YGLDHYAVLIDMFGRKNRLAEALDLIARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAE 411
           +  DHY  L+D+  R  R  +   +I+  P       +W +VLG C    N++LA  AA+
Sbjct: 489 HTSDHYTCLVDLLARSGRFEQLKSVISEMPMKPSKF-LWASVLGGCSTYGNIDLAEEAAQ 521

BLAST of Sgr026412 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 226.5 bits (576), Expect = 4.8e-59
Identity = 135/407 (33.17%), Postives = 218/407 (53.56%), Query Frame = 0

Query: 27  NIILAFYSRVGLLSQARKFFDEMPHPNIVSYNTLISSFTRHGLYVESINIFRQMQQDFDL 86
           N ++  Y   G L  A K F  +   ++VS+N++I+ F + G   +++ +F++M+ +   
Sbjct: 170 NSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESED-- 229

Query: 87  LVLDEFTLVSMVGTCACLGVLASLRQVHGAAIVIRLEFNMIVCNAIIDAYGKCGEPDTSY 146
           +     T+V ++  CA +  L   RQV       R+  N+ + NA++D Y KCG  + + 
Sbjct: 230 VKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAK 289

Query: 147 SIFSRMQERDVVTWTSMVVACAQTSKLDDAFQLFSCMPIKNAHTWTALSNAFARSKYSNE 206
            +F  M+E+D VTWT+M+   A +   + A ++ + MP K+   W AL +A+ ++   NE
Sbjct: 290 RLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNE 349

Query: 207 ALGLFEQM-LKEKISPNAFTFVGVLSACADLALIAKGKQIHGLIIRSSSSLNFLNAYICN 266
           AL +F ++ L++ +  N  T V  LSACA +  +  G+ IH  I +    +NF   ++ +
Sbjct: 350 ALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNF---HVTS 409

Query: 267 ALIDMYSKSGDMKSDRTSFNLILGKDVVSWNSLITG---------------RMIEVGMRP 326
           ALI MYSK GD++  R  FN +  +DV  W+++I G               +M E  ++P
Sbjct: 410 ALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKP 469

Query: 327 NEVTFLGVLSACSHTGLSSEGLYILELMEKCYGL----DHYAVLIDMFGRKNRLAEALDL 386
           N VTF  V  ACSHTGL  E   +   ME  YG+     HYA ++D+ GR   L +A+  
Sbjct: 470 NGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKF 529

Query: 387 IARAPNGSKHVGIWGAVLGACRMRENLNLAIRAAETLFKMEPDNAGS 414
           I   P       +WGA+LGAC++  NLNLA  A   L ++EP N G+
Sbjct: 530 IEAMPI-PPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGA 570

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023543897.11.7e-21272.37pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita... [more]
XP_023543899.12.3e-21272.64pentatricopeptide repeat-containing protein At2g21090-like isoform X2 [Cucurbita... [more]
XP_022925937.11.5e-21171.99pentatricopeptide repeat-containing protein At2g21090-like isoform X3 [Cucurbita... [more]
XP_022925935.11.5e-21171.99pentatricopeptide repeat-containing protein At2g21090-like isoform X1 [Cucurbita... [more]
XP_022925936.11.9e-21172.26pentatricopeptide repeat-containing protein At2g21090-like isoform X2 [Cucurbita... [more]
Match NameE-valueIdentityDescription
Q9SKQ46.0e-6734.03Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX... [more]
Q9SHZ81.4e-6332.12Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9SIT71.6e-5932.45Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
O231693.9e-5831.93Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX... [more]
O823806.7e-5833.17Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1EJM27.1e-21271.99pentatricopeptide repeat-containing protein At2g21090-like isoform X1 OS=Cucurbi... [more]
A0A6J1ECZ47.1e-21271.99pentatricopeptide repeat-containing protein At2g21090-like isoform X3 OS=Cucurbi... [more]
A0A6J1EDI39.3e-21272.26pentatricopeptide repeat-containing protein At2g21090-like isoform X2 OS=Cucurbi... [more]
A0A6J1BT154.0e-19982.83pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charanti... [more]
A0A0A0KFI03.0e-19480.28Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G476040 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G21090.14.3e-6834.03Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G22070.19.9e-6532.12pentatricopeptide (PPR) repeat-containing protein [more]
AT2G13600.11.1e-6032.45Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G37170.12.8e-5931.93Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.14.8e-5933.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 186..234
e-value: 5.0E-9
score: 36.2
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 51..81
e-value: 5.5E-6
score: 26.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 190..223
e-value: 7.7E-5
score: 20.6
coord: 24..49
e-value: 0.0025
score: 15.9
coord: 128..158
e-value: 5.9E-6
score: 24.1
coord: 55..82
e-value: 3.2E-5
score: 21.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 313..341
e-value: 1.3
score: 9.4
coord: 158..184
e-value: 2.3E-4
score: 21.2
coord: 128..156
e-value: 2.4E-5
score: 24.3
coord: 25..49
e-value: 0.0057
score: 16.8
coord: 263..278
e-value: 1.0
score: 9.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 187..221
score: 11.027125
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 125..159
score: 10.347525
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 53..83
score: 9.832344
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 109..186
e-value: 5.6E-13
score: 51.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 196..301
e-value: 3.9E-14
score: 54.7
coord: 302..435
e-value: 3.1E-14
score: 55.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..108
e-value: 4.4E-19
score: 70.5
NoneNo IPR availablePANTHERPTHR47926:SF232SUBFAMILY NOT NAMEDcoord: 87..412
NoneNo IPR availablePANTHERPTHR47926:SF232SUBFAMILY NOT NAMEDcoord: 1..86
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 1..86
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 87..412

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr026412.1Sgr026412.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding
molecular_function GO:0005515 protein binding