Sgr016624 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr016624
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00152977: 910437 .. 912807 (+)
RNA-Seq ExpressionSgr016624
SyntenySgr016624
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATGCTCTTAGCAATTGGTGTCCAACTGGTAGCTCCGGCGTTCAATTGGGTTCTTCTTGCAGGCTTCATGGGTCTAGGAAAAGGGTAAAGTGCGTTGGGTTTTCTGATTGTTGTTGTGGAAATGGCGGTTTCGGATTGATTTCCTTCAATCTAAGAGTTTTGAGAAGTTTGTTTTGCTATGAGAATTCGAGATTCGACTGTGGATGTGAGTTCGGCCATGGCTGTTCTAAGCTTAGAGTTGCTCGCTTAATGAAGCCGAAGAGAAATTCTCTGGGCGCATGGTTTTTATCTGCTTGGGCTGTTGGACAACAGACGATTGATAATGAAATTGTTAGGGTTGAATCGAATTCTGAAGATGATTTGCCTGAGAGAAGTGAGAGGGAGGGCTATGGCGGTTTACATTGGGATGATCATGACAATGATAATGGTGAAAATAGCCATGGAGGAGGAGATTTTAAAGAGGAGGAAGGAATGGAGGGAGAGGAAGATGTTAGGGTTGATGTTCTTGCCCTAGCGTGTCAGTTGCAGCTTGCCCGAACAGCAGATGATGTTGAAGAAGTTCTCAAGGATGTGGGTGAATTGCCTCTTCAAGTGTTCTCATCCATGATTAGAGGTTTTGGTAGAGACAGAAGGTTGGAGTGTGCAGTGGTTCTTGTTGATTGGCTGAAGAGAAAGAAGCTCGAAACTAATGGTCGTATCGCTCCGAACTTGTTCATATACAACAGTCTTCTCGGTGCAATTAAGCAATCTGCAGAGTTTTCAAAAATGCAAGATGTCTTGACTGATATGGCACGGGAAGGAATCGATTCGAATGTCGTCACATACAACACGATCATGTCGATTTACTTGGAACAAGGACTAGCAATGAAAGCTCTTGGCATTCTTGAAGAGATGCCGAAGAAAGGTCTAACTCCGTCTCCCGTATCCTACTCTACAGCCTTACGAGCATACCGAAGGCTGAAAGATGGGAATGGAGCTTTAAAGTTCATGATTGAGTTGAGAGAAAGATATCGTAATGGTGAGATAGCAAAAGATGATAATGTAGATTGGGCTGACGAATTCTTGAAGCTTGAAAACTTTACAAAACGTGTTTGCTACCAAGTAATGAGGATTTGGCTTGTGAAGGGCGATAACGCAAGCACGAAGGTGTTGCAACTTCTCATGGAAATGGATAAGGCAGGACTGTCACTTGGTCGTGTCGAGGAGGAACGACTTATTTGGGCTTGTACGTGTGCAGAACACCATAATGTAGCAAAAGAATTGTACTACAGGATAAGAGAAAAGCAGTCTGGTATAAGCTTATCTGTTTGCAATCATGTGATTTGGTTGATGGGGAAAGCTAAGAAGTGGTGGGCAGCATTGGAGATTTATGAAGATTTGTTGGACAAAGGACCTAAACCAAATAACATGTCATATGAACTAATTGTCTCTCACTTCAATGTTCTTCTCACTGCTGCAAAGAAAAGGGGGATTTGGAGATGGGGTGTGAGGTTACTCAACAAAATGGAAGAGAAAGGTCTTAAACCCGGAAGTCGGGAGTGGAATGCTGTTCTTGTTGCCTGTTCCAAAGCTGCAGAAACTTCTGCAGCTATAGAAATCTTTAGGAGGATGGTCGAACAAGGTGAAAAACCCACTGTCCTTTCGTACGGGGCATTACTTAGCGCCCTGGAAAAGGGAAAACTCTTTGATGAAGCTCGTAGTGTGTGGGATCATATGATTAAAGTCGGGGTGGAGCCGAACATCTATGCCTATACAACTTTGGCTTCAGTTTTCACTGGCCAAGGAAAGTTTAATATGGTTGAAGTCACCATTAATGAGATGGTTTCATCTGGCATTGAGCCAACAGTCGTCACGTACAATGCAATAATCACTGGTTGTGTACGTAATGGTATGAGCAGTGTAGCTTACGAGTGGTTTCACCGCATGAAAGTTCGAAATATCTCTCCGAACGAGGTGAGTTACGAGTTGCTCATCGAGGCCCTTGCAAAGGAAGGTAAACCAAGGCTTGCTTATGAGTTATACATGAGGGCTAAGAATGAGGGTCTAAATCTTTCTTCTAAGATATATTATGCAGTGATTCAATCCTCTCAAGTTTATGGAGCGTCCATTGATGTAAGAGCATTAGGGCCTCCACCTGATAAGAACAAGAGTTCATGGATTAAGAAAAAAACGACTTTTGACGAATGTTCGTAACTCTGCTGATGTTCCCAGGAAAAATGAACGGTTTCAAAGAAAGGAAGCCAATGATAGATGATTTCTCATATCCTGATCTCTTAATTTGTGCAGGAAGCATTATGAGCAACTCAGGAAGCCTATGGCTTCTGATGACCCGACCCGAGCACATACCGAGCTGGAGATAGCCAATTGA

mRNA sequence

ATGCATGCTCTTAGCAATTGGTGTCCAACTGGTAGCTCCGGCGTTCAATTGGGTTCTTCTTGCAGGCTTCATGGGTCTAGGAAAAGGGTAAAGTGCGTTGGGTTTTCTGATTGTTGTTGTGGAAATGGCGGTTTCGGATTGATTTCCTTCAATCTAAGAGTTTTGAGAAGTTTGTTTTGCTATGAGAATTCGAGATTCGACTGTGGATGTGAGTTCGGCCATGGCTGTTCTAAGCTTAGAGTTGCTCGCTTAATGAAGCCGAAGAGAAATTCTCTGGGCGCATGGTTTTTATCTGCTTGGGCTGTTGGACAACAGACGATTGATAATGAAATTGTTAGGGTTGAATCGAATTCTGAAGATGATTTGCCTGAGAGAAGTGAGAGGGAGGGCTATGGCGGTTTACATTGGGATGATCATGACAATGATAATGGTGAAAATAGCCATGGAGGAGGAGATTTTAAAGAGGAGGAAGGAATGGAGGGAGAGGAAGATGTTAGGGTTGATGTTCTTGCCCTAGCGTGTCAGTTGCAGCTTGCCCGAACAGCAGATGATGTTGAAGAAGTTCTCAAGGATGTGGGTGAATTGCCTCTTCAAGTGTTCTCATCCATGATTAGAGGTTTTGGTAGAGACAGAAGGTTGGAGTGTGCAGTGGTTCTTGTTGATTGGCTGAAGAGAAAGAAGCTCGAAACTAATGGTCGTATCGCTCCGAACTTGTTCATATACAACAGTCTTCTCGGTGCAATTAAGCAATCTGCAGAGTTTTCAAAAATGCAAGATGTCTTGACTGATATGGCACGGGAAGGAATCGATTCGAATGTCGTCACATACAACACGATCATGTCGATTTACTTGGAACAAGGACTAGCAATGAAAGCTCTTGGCATTCTTGAAGAGATGCCGAAGAAAGGTCTAACTCCGTCTCCCGTATCCTACTCTACAGCCTTACGAGCATACCGAAGGCTGAAAGATGGGAATGGAGCTTTAAAGTTCATGATTGAGTTGAGAGAAAGATATCGTAATGGTGAGATAGCAAAAGATGATAATGTAGATTGGGCTGACGAATTCTTGAAGCTTGAAAACTTTACAAAACGTGTTTGCTACCAAGTAATGAGGATTTGGCTTGTGAAGGGCGATAACGCAAGCACGAAGGTGTTGCAACTTCTCATGGAAATGGATAAGGCAGGACTGTCACTTGGTCGTGTCGAGGAGGAACGACTTATTTGGGCTTGTACGTGTGCAGAACACCATAATGTAGCAAAAGAATTGTACTACAGGATAAGAGAAAAGCAGTCTGGTATAAGCTTATCTGTTTGCAATCATGTGATTTGGTTGATGGGGAAAGCTAAGAAGTGGTGGGCAGCATTGGAGATTTATGAAGATTTGTTGGACAAAGGACCTAAACCAAATAACATGTCATATGAACTAATTGTCTCTCACTTCAATGTTCTTCTCACTGCTGCAAAGAAAAGGGGGATTTGGAGATGGGGTGTGAGGTTACTCAACAAAATGGAAGAGAAAGGTCTTAAACCCGGAAGTCGGGAGTGGAATGCTGTTCTTGTTGCCTGTTCCAAAGCTGCAGAAACTTCTGCAGCTATAGAAATCTTTAGGAGGATGGTCGAACAAGGTGAAAAACCCACTGTCCTTTCGTACGGGGCATTACTTAGCGCCCTGGAAAAGGGAAAACTCTTTGATGAAGCTCGTAGTGTGTGGGATCATATGATTAAAGTCGGGGTGGAGCCGAACATCTATGCCTATACAACTTTGGCTTCAGTTTTCACTGGCCAAGGAAAGTTTAATATGGTTGAAGTCACCATTAATGAGATGGTTTCATCTGGCATTGAGCCAACAGTCGTCACGTACAATGCAATAATCACTGGTTGTGTACGTAATGGTATGAGCAGTGTAGCTTACGAGTGGTTTCACCGCATGAAAGTTCGAAATATCTCTCCGAACGAGGTGAGTTACGAGTTGCTCATCGAGGCCCTTGCAAAGGAAGGTAAACCAAGGCTTGCTTATGAGTTATACATGAGGGCTAAGAATGAGGGTCTAAATCTTTCTTCTAAGATATATTATGCAGTGATTCAATCCTCTCAAGTTTATGGAGCGTCCATTGATGTAAGAGCATTAGGGCCTCCACCTGATAAGAACAAGAGTTCATGGATTAAGAAAAAAACGACTTTTGACGAATGTTCGAAGCATTATGAGCAACTCAGGAAGCCTATGGCTTCTGATGACCCGACCCGAGCACATACCGAGCTGGAGATAGCCAATTGA

Coding sequence (CDS)

ATGCATGCTCTTAGCAATTGGTGTCCAACTGGTAGCTCCGGCGTTCAATTGGGTTCTTCTTGCAGGCTTCATGGGTCTAGGAAAAGGGTAAAGTGCGTTGGGTTTTCTGATTGTTGTTGTGGAAATGGCGGTTTCGGATTGATTTCCTTCAATCTAAGAGTTTTGAGAAGTTTGTTTTGCTATGAGAATTCGAGATTCGACTGTGGATGTGAGTTCGGCCATGGCTGTTCTAAGCTTAGAGTTGCTCGCTTAATGAAGCCGAAGAGAAATTCTCTGGGCGCATGGTTTTTATCTGCTTGGGCTGTTGGACAACAGACGATTGATAATGAAATTGTTAGGGTTGAATCGAATTCTGAAGATGATTTGCCTGAGAGAAGTGAGAGGGAGGGCTATGGCGGTTTACATTGGGATGATCATGACAATGATAATGGTGAAAATAGCCATGGAGGAGGAGATTTTAAAGAGGAGGAAGGAATGGAGGGAGAGGAAGATGTTAGGGTTGATGTTCTTGCCCTAGCGTGTCAGTTGCAGCTTGCCCGAACAGCAGATGATGTTGAAGAAGTTCTCAAGGATGTGGGTGAATTGCCTCTTCAAGTGTTCTCATCCATGATTAGAGGTTTTGGTAGAGACAGAAGGTTGGAGTGTGCAGTGGTTCTTGTTGATTGGCTGAAGAGAAAGAAGCTCGAAACTAATGGTCGTATCGCTCCGAACTTGTTCATATACAACAGTCTTCTCGGTGCAATTAAGCAATCTGCAGAGTTTTCAAAAATGCAAGATGTCTTGACTGATATGGCACGGGAAGGAATCGATTCGAATGTCGTCACATACAACACGATCATGTCGATTTACTTGGAACAAGGACTAGCAATGAAAGCTCTTGGCATTCTTGAAGAGATGCCGAAGAAAGGTCTAACTCCGTCTCCCGTATCCTACTCTACAGCCTTACGAGCATACCGAAGGCTGAAAGATGGGAATGGAGCTTTAAAGTTCATGATTGAGTTGAGAGAAAGATATCGTAATGGTGAGATAGCAAAAGATGATAATGTAGATTGGGCTGACGAATTCTTGAAGCTTGAAAACTTTACAAAACGTGTTTGCTACCAAGTAATGAGGATTTGGCTTGTGAAGGGCGATAACGCAAGCACGAAGGTGTTGCAACTTCTCATGGAAATGGATAAGGCAGGACTGTCACTTGGTCGTGTCGAGGAGGAACGACTTATTTGGGCTTGTACGTGTGCAGAACACCATAATGTAGCAAAAGAATTGTACTACAGGATAAGAGAAAAGCAGTCTGGTATAAGCTTATCTGTTTGCAATCATGTGATTTGGTTGATGGGGAAAGCTAAGAAGTGGTGGGCAGCATTGGAGATTTATGAAGATTTGTTGGACAAAGGACCTAAACCAAATAACATGTCATATGAACTAATTGTCTCTCACTTCAATGTTCTTCTCACTGCTGCAAAGAAAAGGGGGATTTGGAGATGGGGTGTGAGGTTACTCAACAAAATGGAAGAGAAAGGTCTTAAACCCGGAAGTCGGGAGTGGAATGCTGTTCTTGTTGCCTGTTCCAAAGCTGCAGAAACTTCTGCAGCTATAGAAATCTTTAGGAGGATGGTCGAACAAGGTGAAAAACCCACTGTCCTTTCGTACGGGGCATTACTTAGCGCCCTGGAAAAGGGAAAACTCTTTGATGAAGCTCGTAGTGTGTGGGATCATATGATTAAAGTCGGGGTGGAGCCGAACATCTATGCCTATACAACTTTGGCTTCAGTTTTCACTGGCCAAGGAAAGTTTAATATGGTTGAAGTCACCATTAATGAGATGGTTTCATCTGGCATTGAGCCAACAGTCGTCACGTACAATGCAATAATCACTGGTTGTGTACGTAATGGTATGAGCAGTGTAGCTTACGAGTGGTTTCACCGCATGAAAGTTCGAAATATCTCTCCGAACGAGGTGAGTTACGAGTTGCTCATCGAGGCCCTTGCAAAGGAAGGTAAACCAAGGCTTGCTTATGAGTTATACATGAGGGCTAAGAATGAGGGTCTAAATCTTTCTTCTAAGATATATTATGCAGTGATTCAATCCTCTCAAGTTTATGGAGCGTCCATTGATGTAAGAGCATTAGGGCCTCCACCTGATAAGAACAAGAGTTCATGGATTAAGAAAAAAACGACTTTTGACGAATGTTCGAAGCATTATGAGCAACTCAGGAAGCCTATGGCTTCTGATGACCCGACCCGAGCACATACCGAGCTGGAGATAGCCAATTGA

Protein sequence

MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFCYENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSEDDLPERSEREGYGGLHWDDHDNDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQLARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNLFIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKLENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNVAKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKFNMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALGPPPDKNKSSWIKKKTTFDECSKHYEQLRKPMASDDPTRAHTELEIAN
Homology
BLAST of Sgr016624 vs. NCBI nr
Match: XP_022154192.1 (pentatricopeptide repeat-containing protein At3g46610 [Momordica charantia])

HSP 1 Score: 1306.2 bits (3379), Expect = 0.0e+00
Identity = 646/720 (89.72%), Postives = 678/720 (94.17%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           MHALSNWCPT SS V+LGSSC +  S KR+KCVGFSDCCCGNGGF LISFNLRV RS FC
Sbjct: 1   MHALSNWCPTSSSKVELGSSCVVRRSGKRLKCVGFSDCCCGNGGFSLISFNLRVFRSGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENS+FDC CEF HGCSKL VARLMKPKRNSLGAWFLSAWAV Q T+ NEIVRVESNSED
Sbjct: 61  YENSKFDCSCEFRHGCSKLIVARLMKPKRNSLGAWFLSAWAVEQPTVGNEIVRVESNSED 120

Query: 121 DLPERSEREGYGGLHWDDHDNDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQLAR 180
           DL ERSE EGYGGL WDDH N NGEN HGGGDFK+E+GMEGE DV VDV ALA +LQL R
Sbjct: 121 DLAERSEGEGYGGLDWDDHHNVNGENGHGGGDFKDEDGMEGEGDVWVDVRALAGRLQLTR 180

Query: 181 TADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNLFI 240
           TADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAV LVDWLKRKK+ET+GRIAPNLFI
Sbjct: 181 TADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETDGRIAPNLFI 240

Query: 241 YNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEEMP 300
           YNSLLGA+KQS  FSKM+DVL DMA+EGI SNV+TYNTIMSIYLEQGLAMKALGILEEMP
Sbjct: 241 YNSLLGAVKQSTVFSKMEDVLADMAQEGITSNVITYNTIMSIYLEQGLAMKALGILEEMP 300

Query: 301 KKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKLEN 360
           KKGLTPSPVSYST L+AYRR+KDGNGALKFM ELRE+YR+GE+AKDDNVDWADEF+KLEN
Sbjct: 301 KKGLTPSPVSYSTGLQAYRRMKDGNGALKFMTELREKYRSGEMAKDDNVDWADEFMKLEN 360

Query: 361 FTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNVAK 420
           FTKRVCYQVMRIWLVKG +ASTKVLQLL+EMDKAGLSL R EEERLIWACTCAEHHNVAK
Sbjct: 361 FTKRVCYQVMRIWLVKGYSASTKVLQLLVEMDKAGLSLDRAEEERLIWACTCAEHHNVAK 420

Query: 421 ELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHF 480
           ELYYRIREKQ GISLSVCNHVIWLMGKAKKWWAALEIYEDLL+KGPKPNNMSYELIVSHF
Sbjct: 421 ELYYRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHF 480

Query: 481 NVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVE 540
           NVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVE
Sbjct: 481 NVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVE 540

Query: 541 QGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKFNM 600
           QGEKPT+LSYGALLSALEKGKL+DEARSVWDHMIKVGV+PNIYAYTT+ASVFTGQGKFNM
Sbjct: 541 QGEKPTILSYGALLSALEKGKLYDEARSVWDHMIKVGVKPNIYAYTTMASVFTGQGKFNM 600

Query: 601 VEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIE 660
           VEVTIN+MVSSGIEPTVVTYNAIITGCVRNG+SSVAYEWFHRMKVRNISPNEVSYELLIE
Sbjct: 601 VEVTINDMVSSGIEPTVVTYNAIITGCVRNGLSSVAYEWFHRMKVRNISPNEVSYELLIE 660

Query: 661 ALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALG-PPPDKNKSS 720
           ALAKEGKPRLAYELY+RAKN+ LNLSSK Y AVIQSSQVYGASID+RALG PPPD NKSS
Sbjct: 661 ALAKEGKPRLAYELYLRAKNDSLNLSSKTYDAVIQSSQVYGASIDIRALGSPPPDTNKSS 720

BLAST of Sgr016624 vs. NCBI nr
Match: XP_038898205.1 (protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic [Benincasa hispida])

HSP 1 Score: 1258.0 bits (3254), Expect = 0.0e+00
Identity = 630/723 (87.14%), Postives = 663/723 (91.70%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           MH LSNWCPT SSGV+LGS   +H S  R+KC GFSDC CGNGGF LISFN  VLRS FC
Sbjct: 1   MHVLSNWCPTSSSGVELGSYSVVHRSWNRIKCFGFSDCSCGNGGFSLISFNSSVLRSGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENS F C CEF HGCSKL VA LMKPKRNSLGAW LSAWAV + TID+E+ RVES+S D
Sbjct: 61  YENSTFVCNCEFRHGCSKLGVASLMKPKRNSLGAWCLSAWAVEEPTIDDELARVESSSRD 120

Query: 121 DLPERSEREGYGGLHWD---DHDNDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQ 180
            LPERS       L WD   DHDN NGENSHGGG FK+EEGMEGE DVRVDV ALA QLQ
Sbjct: 121 GLPERS-------LEWDDDHDHDNVNGENSHGGGSFKDEEGMEGEGDVRVDVCALAAQLQ 180

Query: 181 LARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPN 240
           LARTADDV+EVLKDVGELPLQVFSSMIRGFGRDRRLECAV LV+WLKRKK+ETNGRI PN
Sbjct: 181 LARTADDVDEVLKDVGELPLQVFSSMIRGFGRDRRLECAVALVEWLKRKKIETNGRIGPN 240

Query: 241 LFIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILE 300
           LF YNSLLGA+KQS E SKM++VLTDMA+EGI SNVVTYNTIMSIYLEQGLAMKALGILE
Sbjct: 241 LFTYNSLLGAVKQSGELSKMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILE 300

Query: 301 EMPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLK 360
           EMPKKGLT SPVSYSTALRAYRR+KDGNGALKFMIELRERY NGEIAKDDNVDW +EFLK
Sbjct: 301 EMPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMIELRERYHNGEIAKDDNVDWTNEFLK 360

Query: 361 LENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHN 420
           LENFT+RVCYQVMRIWLVKGD ASTKVLQLLMEMDKAGLSL R E+ERLIWACTCAEHHN
Sbjct: 361 LENFTRRVCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEQERLIWACTCAEHHN 420

Query: 421 VAKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIV 480
           VAKELYYRIREKQ GISLSVCNHVIWLMGKAKKWWAALE+YEDLL+KGPKPNNMSYELIV
Sbjct: 421 VAKELYYRIREKQCGISLSVCNHVIWLMGKAKKWWAALEVYEDLLEKGPKPNNMSYELIV 480

Query: 481 SHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRR 540
           SHFNVLLTAAKKRGIWRWGVRLLNKMEEKGL+PG REWNAVLVACS+AAETSAAI+IFRR
Sbjct: 481 SHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGRREWNAVLVACSRAAETSAAIDIFRR 540

Query: 541 MVEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGK 600
           MVEQGEKPTVLSYGALLSALEKGKL+DEARSVWDHMI+VGVEPNIYAYTT+ASVFTGQGK
Sbjct: 541 MVEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGK 600

Query: 601 FNMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYEL 660
           FNMVEVTI++MV+SGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYEL
Sbjct: 601 FNMVEVTISDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYEL 660

Query: 661 LIEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALG-PPPDKN 720
           LIEALAKEGKPRLAYELYMRAK+EGLNLSSKIY AVIQSSQ+YGASID+R LG  PPDKN
Sbjct: 661 LIEALAKEGKPRLAYELYMRAKDEGLNLSSKIYDAVIQSSQLYGASIDIRLLGLRPPDKN 716

BLAST of Sgr016624 vs. NCBI nr
Match: XP_011651578.1 (protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic [Cucumis sativus])

HSP 1 Score: 1231.5 bits (3185), Expect = 0.0e+00
Identity = 620/722 (85.87%), Postives = 661/722 (91.55%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           MHALSNWCPT  SGV+LGS   +H S KRVK  GFSDC CGN GF LISFNL VLRS FC
Sbjct: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENSRF C CEF HGCSKLRV  LMK  RNSLGA+ LSAWAV Q TID+EI RVESNS D
Sbjct: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120

Query: 121 DLPERSEREGYGGLHWDDHDND--NGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQL 180
            LPER       GL WDD D+   NGENSHGGG FK+E  +EG  DVRVDV ALA QLQL
Sbjct: 121 GLPER-------GLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQL 180

Query: 181 ARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNL 240
           ARTADDV++VLKD+ ELPLQVFSSMIRGFGRDRRLECAV LVDWLKRKK+ETNGRIAPNL
Sbjct: 181 ARTADDVDQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNL 240

Query: 241 FIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGA+KQS E S+M++VLTDMA+EGI SNVVTYNTIMSIYLEQGLAMKALGILEE
Sbjct: 241 FIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEE 300

Query: 301 MPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKL 360
           MPKKGLT SPVSYSTALRAYRR+KDGNGALKFM+ELRERYRNGEIAKDDNVDWA+EFLKL
Sbjct: 301 MPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKL 360

Query: 361 ENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNV 420
           ENFT+RVCYQVMRIWLVKGD ASTKVLQLLMEMDKAGLSL R E ERLIWACTCAEH+NV
Sbjct: 361 ENFTRRVCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNV 420

Query: 421 AKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVS 480
           AKELY+RIREKQ GISLSVCNHVIWLMGKAKKWWAALEIYEDLL+KGPKPNNMSYELIVS
Sbjct: 421 AKELYFRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540
           HFNVLLTAAKKRGIWRWGVRLLNKMEEKGL+PGSREWNAVLVACS+AAETSAAI+IFR+M
Sbjct: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKF 600
           VEQGEKPTVLSYGALLSALEKGKL+DEARSVWDHMI+VGVEPNIYAYTT+ASVFTGQGKF
Sbjct: 541 VEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKF 600

Query: 601 NMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVEVTIN+MV+SGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL
Sbjct: 601 NMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALG-PPPDKNK 720
           IEALAKEGKPRLAYELYMRAK+EGLNLSSK+Y AVI+SSQ+YGAS++++ LG  PPD+NK
Sbjct: 661 IEALAKEGKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNK 715

BLAST of Sgr016624 vs. NCBI nr
Match: KAA0066960.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK30239.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1221.8 bits (3160), Expect = 0.0e+00
Identity = 617/722 (85.46%), Postives = 652/722 (90.30%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           MH LSNWCPT  SGV LGS   +H S KR+KC GFSDCCCGN GF LISFNL VL S FC
Sbjct: 1   MHVLSNWCPTSCSGVDLGSYSVVHRSWKRIKCFGFSDCCCGNWGFSLISFNLSVLGSGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENSRF C CEF HG SKLRV  LMKP RNSL AW LSAW V Q TI +E+ RVESNS D
Sbjct: 61  YENSRFVCNCEFRHGYSKLRVVPLMKPNRNSLEAWCLSAWTVEQPTIGDELPRVESNSRD 120

Query: 121 DLPERSEREGYGGLHW--DDHDNDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQL 180
            LPER        L W  DD DN NGENSHGGG FK+E  MEG  DVRVDV ALA QLQL
Sbjct: 121 GLPERR-------LDWDGDDDDNVNGENSHGGGSFKDEGEMEGVGDVRVDVRALAAQLQL 180

Query: 181 ARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNL 240
           ARTADDV++VLKD+ ELPLQVFSSMIRGFGRDRRLECAV LVDWLKRKK+ETNGRIAPNL
Sbjct: 181 ARTADDVDQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNL 240

Query: 241 FIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGA+KQS E SKM++VLT+MA+EGI SNVVTYNTIMSIYLEQGLA KALGILEE
Sbjct: 241 FIYNSLLGAVKQSGELSKMENVLTEMAQEGIVSNVVTYNTIMSIYLEQGLATKALGILEE 300

Query: 301 MPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKL 360
           MPKKGLT SPVSYSTALRAYR++KDGNGAL+FM+ELRERY NGEIAKDDNVDWA+EFLKL
Sbjct: 301 MPKKGLTLSPVSYSTALRAYRKMKDGNGALEFMVELRERYHNGEIAKDDNVDWANEFLKL 360

Query: 361 ENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNV 420
           ENFT+RVCYQVMRIWLVKGD ASTKVLQLLMEMDKAGLSL R EEERLIWACTCAEH+NV
Sbjct: 361 ENFTRRVCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEEERLIWACTCAEHYNV 420

Query: 421 AKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVS 480
           AKELY RIREKQ GISLSVCNHVIWLMGKAKKWWAALEIYE+LL+KGPKPNNMSYELIVS
Sbjct: 421 AKELYIRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEELLEKGPKPNNMSYELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540
           HFNVLLTAAKKRGIWRWGVRLLNKMEEKGL+PGSREWNAVLVACS+AAETSAAI+IFRRM
Sbjct: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRRM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKF 600
           VEQGEKPTVLSYGALLSALEKGKL+DEARSVWDHMI+VGVEPNIYAYTT+ASVFT QGKF
Sbjct: 541 VEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTSQGKF 600

Query: 601 NMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVEVTIN+MV+SGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL
Sbjct: 601 NMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALG-PPPDKNK 720
           IEALAKEGKPRLAYELY RAK+EGLNLSSKIY AVI+SSQ+YGASID+R LG  PPDKNK
Sbjct: 661 IEALAKEGKPRLAYELYRRAKDEGLNLSSKIYDAVIESSQLYGASIDIRLLGLRPPDKNK 715

BLAST of Sgr016624 vs. NCBI nr
Match: XP_022989603.1 (pentatricopeptide repeat-containing protein At3g46610-like [Cucurbita maxima])

HSP 1 Score: 1209.9 bits (3129), Expect = 0.0e+00
Identity = 606/722 (83.93%), Postives = 657/722 (91.00%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           M  LS+WCP+ SSG++LG S  ++GSRKR+ CVGFS  CCGNGGF LI F+  VLR  FC
Sbjct: 1   MSILSDWCPS-SSGLELGCSSVVNGSRKRINCVGFSG-CCGNGGFSLIPFSSSVLRCGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENS+FDC  EF HGCSKLRVARLMKPKRNSLG WFLSAWAV Q TID EIVRV+SN  D
Sbjct: 61  YENSKFDCNFEFRHGCSKLRVARLMKPKRNSLGVWFLSAWAVEQPTIDGEIVRVQSNCGD 120

Query: 121 DLPERSEREGYGGLHWDDHDND--NGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQL 180
           D PE+S       L WDDHD+D  N ENS+ G  FK+EEG+EGE DV+VDV ALA +L+L
Sbjct: 121 DFPEKS-------LDWDDHDHDTVNSENSN-GRSFKDEEGIEGEGDVKVDVRALAGRLEL 180

Query: 181 ARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNL 240
           ART DDVEEVLKDVGELPLQVFSS+I+GFGRD+RL CA+ LV+WLK +K++TNGRIAPNL
Sbjct: 181 ARTVDDVEEVLKDVGELPLQVFSSLIKGFGRDKRLGCALALVEWLKTRKIKTNGRIAPNL 240

Query: 241 FIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGA+KQS EFSKM+D+L DM++EGI SNVVTYNTIMSIYL+QGL MKAL ILEE
Sbjct: 241 FIYNSLLGAVKQSGEFSKMEDILNDMSQEGIVSNVVTYNTIMSIYLDQGLPMKALDILEE 300

Query: 301 MPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKL 360
           MPKKGLT SPVSYSTALRAYRR+KDGNGALKFM+ELRERYRNGEIAKDDNVDW DEFLKL
Sbjct: 301 MPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWDDEFLKL 360

Query: 361 ENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNV 420
           ENFT+RVCYQVMRIWLVKGD+ASTKVLQLL EMDKAGLSL R EEERL+WACTCAEHHNV
Sbjct: 361 ENFTRRVCYQVMRIWLVKGDSASTKVLQLLTEMDKAGLSLDRAEEERLVWACTCAEHHNV 420

Query: 421 AKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVS 480
           AKELYYRIREK+SGISLSVCNHVIWLMGKAKKWWAALEIYEDLL+KGPKPNN+SYELIVS
Sbjct: 421 AKELYYRIREKKSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNLSYELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540
           HFNVLLTAAK RGIWRWGVRLLNKMEEKGLKPG REWNAVLVACS+AAETS AIEIFRRM
Sbjct: 481 HFNVLLTAAKNRGIWRWGVRLLNKMEEKGLKPGIREWNAVLVACSRAAETSMAIEIFRRM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKF 600
           V+QGEKPTVLSYGALLSALEKGKL+DEARSVWDHMIKVGV PNIYAYTT+ASVFTGQGKF
Sbjct: 541 VDQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIKVGVAPNIYAYTTMASVFTGQGKF 600

Query: 601 NMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVE+TIN+MV+SGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL
Sbjct: 601 NMVELTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALGP-PPDKNK 720
           IEALAK+GKPRLAYELYM+A NEGLNLSSKIY AVI SSQVYGASID+R LGP PPD+NK
Sbjct: 661 IEALAKDGKPRLAYELYMKANNEGLNLSSKIYDAVIHSSQVYGASIDIRLLGPRPPDENK 712

BLAST of Sgr016624 vs. ExPASy Swiss-Prot
Match: Q9SNB7 (Protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LPE1 PE=1 SV=1)

HSP 1 Score: 772.7 bits (1994), Expect = 3.8e-222
Identity = 390/637 (61.22%), Postives = 485/637 (76.14%), Query Frame = 0

Query: 84  LMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSEDDL---PERSEREGYGGLHW--DD 143
           ++ PK       FL     G  +  +  + V SN +      P+RS      G+ W  + 
Sbjct: 29  VVSPKTTRKRLCFLEQACFGSSSSISSFIFVSSNRKVLFLCEPKRSLLGSSFGVGWATEQ 88

Query: 144 HDNDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQLARTADDVEEVLKDVGELPLQ 203
            + + GE      D     G E + ++RVDV  LA  L+ A+TADDV+ VLKD GELPLQ
Sbjct: 89  RELELGEEEVSTEDLSSANGGE-KNNLRVDVRELAFSLRAAKTADDVDAVLKDKGELPLQ 148

Query: 204 VFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNLFIYNSLLGAIKQSAEFSKMQ 263
           VF +MI+GFG+D+RL+ AV +VDWLKRKK E+ G I PNLFIYNSLLGA++    F + +
Sbjct: 149 VFCAMIKGFGKDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAMR---GFGEAE 208

Query: 264 DVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTPSPVSYSTALRAY 323
            +L DM  EGI  N+VTYNT+M IY+E+G  +KALGIL+   +KG  P+P++YSTAL  Y
Sbjct: 209 KILKDMEEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVY 268

Query: 324 RRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKLENFTKRVCYQVMRIWLVKGD 383
           RR++DG GAL+F +ELRE+Y   EI  D   DW  EF+KLENF  R+CYQVMR WLVK D
Sbjct: 269 RRMEDGMGALEFFVELREKYAKREIGNDVGYDWEFEFVKLENFIGRICYQVMRRWLVKDD 328

Query: 384 NASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNVAKELYYRIREKQSGISLSVC 443
           N +T+VL+LL  MD AG+   R E ERLIWACT  EH+ V KELY RIRE+ S ISLSVC
Sbjct: 329 NWTTRVLKLLNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVC 388

Query: 444 NHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVR 503
           NH+IWLMGKAKKWWAALEIYEDLLD+GP+PNN+SYEL+VSHFN+LL+AA KRGIWRWGVR
Sbjct: 389 NHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVR 448

Query: 504 LLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVEQGEKPTVLSYGALLSALE 563
           LLNKME+KGLKP  R WNAVLVACSKA+ET+AAI+IF+ MV+ GEKPTV+SYGALLSALE
Sbjct: 449 LLNKMEDKGLKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALE 508

Query: 564 KGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKFNMVEVTINEMVSSGIEPTVV 623
           KGKL+DEA  VW+HMIKVG+EPN+YAYTT+ASV TGQ KFN+++  + EM S GIEP+VV
Sbjct: 509 KGKLYDEAFRVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVV 568

Query: 624 TYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPRLAYELYMRA 683
           T+NA+I+GC RNG+S VAYEWFHRMK  N+ PNE++YE+LIEALA + KPRLAYEL+++A
Sbjct: 569 TFNAVISGCARNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHVKA 628

Query: 684 KNEGLNLSSKIYYAVIQSSQVYGASIDVRALGPPPDK 716
           +NEGL LSSK Y AV++S++ YGA+ID+  LGP PDK
Sbjct: 629 QNEGLKLSSKPYDAVVKSAETYGATIDLNLLGPRPDK 661

BLAST of Sgr016624 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 3.7e-31
Identity = 129/545 (23.67%), Postives = 242/545 (44.40%), Query Frame = 0

Query: 146 NSHGGGDFKEEEGMEGEEDVRVDVLALACQLQLARTADDVEEVLKDV------GELP-LQ 205
           NS+G G +         EDV  +        Q+ RT  ++EE  K +      G +P + 
Sbjct: 84  NSNGNGHYSSVNSSFALEDVESN----NHLRQMVRTG-ELEEGFKFLENMVYHGNVPDII 143

Query: 206 VFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNLFIYNSLLGAIKQSAEFSKMQ 265
             +++IRGF R  +   A  +++      LE +G + P++  YN ++    ++ E +   
Sbjct: 144 PCTTLIRGFCRLGKTRKAAKILE-----ILEGSGAV-PDVITYNVMISGYCKAGEINNAL 203

Query: 266 DVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTPSPVSYSTALRAY 325
            VL  M+   +  +VVTYNTI+    + G   +A+ +L+ M ++   P  ++Y+  + A 
Sbjct: 204 SVLDRMS---VSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRDCYPDVITYTILIEAT 263

Query: 326 RRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKLENFTKRVCYQVMRIWLVKGD 385
            R      A+K + E+R+R    ++                     V Y V+   + K +
Sbjct: 264 CRDSGVGHAMKLLDEMRDRGCTPDV---------------------VTYNVLVNGICK-E 323

Query: 386 NASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNVAKELYYRIREKQSGISLSVC 445
               + ++ L +M  +G     +    ++ +         A++L   +  K    S+   
Sbjct: 324 GRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTF 383

Query: 446 NHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVR 505
           N +I  + +      A++I E +   G +PN++SY  ++  F       K++ + R  + 
Sbjct: 384 NILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGF------CKEKKMDR-AIE 443

Query: 506 LLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVEQGEKPTVLSYGALLSALE 565
            L +M  +G  P    +N +L A  K  +   A+EI  ++  +G  P +++Y  ++  L 
Sbjct: 444 YLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLA 503

Query: 566 KGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKFNMVEVTINEMVSSGIEPTVV 625
           K     +A  + D M    ++P+   Y++L    + +GK +      +E    GI P  V
Sbjct: 504 KAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAV 563

Query: 626 TYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPRLAYELYMRA 684
           T+N+I+ G  ++  +  A ++   M  R   PNE SY +LIE LA EG  + A EL    
Sbjct: 564 TFNSIMLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNEL 585

BLAST of Sgr016624 vs. ExPASy Swiss-Prot
Match: O64624 (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 132.5 bits (332), Expect = 2.0e-29
Identity = 95/448 (21.21%), Postives = 188/448 (41.96%), Query Frame = 0

Query: 236 PNLFIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGI 295
           P    YN+LL    ++  +++   VL +M      ++ VTYN +++ Y+  G + +A G+
Sbjct: 314 PGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFSKEAAGV 373

Query: 296 LEEMPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEF 355
           +E M KKG+ P+ ++Y+T + AY +    + ALK    ++E                   
Sbjct: 374 IEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKE------------------- 433

Query: 356 LKLENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEH 415
                     C     + L+   + S +++++L +M   G S  R     ++  C     
Sbjct: 434 ---AGCVPNTCTYNAVLSLLGKKSRSNEMIKMLCDMKSNGCSPNRATWNTMLALCGNKGM 493

Query: 416 HNVAKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYEL 475
                 ++  ++           N +I   G+      A ++Y ++   G       +  
Sbjct: 494 DKFVNRVFREMKSCGFEPDRDTFNTLISAYGRCGSEVDASKMYGEMTRAG-------FNA 553

Query: 476 IVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIF 535
            V+ +N LL A  ++G WR G  +++ M+ KG KP    ++ +L   +K         I 
Sbjct: 554 CVTTYNALLNALARKGDWRSGENVISDMKSKGFKPTETSYSLMLQCYAKGGNYLGIERIE 613

Query: 536 RRMVEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQ 595
            R+ E    P+ +    LL A  K +    +   +    K G +P++  + ++ S+FT  
Sbjct: 614 NRIKEGQIFPSWMLLRTLLLANFKCRALAGSERAFTLFKKHGYKPDMVIFNSMLSIFTRN 673

Query: 596 GKFNMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSY 655
             ++  E  +  +   G+ P +VTYN+++   VR G    A E    ++   + P+ VSY
Sbjct: 674 NMYDQAEGILESIREDGLSPDLVTYNSLMDMYVRRGECWKAEEILKTLEKSQLKPDLVSY 732

Query: 656 ELLIEALAKEGKPRLAYELYMRAKNEGL 684
             +I+   + G  + A  +       G+
Sbjct: 734 NTVIKGFCRRGLMQEAVRMLSEMTERGI 732

BLAST of Sgr016624 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 132.1 bits (331), Expect = 2.6e-29
Identity = 112/495 (22.63%), Postives = 213/495 (43.03%), Query Frame = 0

Query: 172 LACQLQLARTADDVEEVLKDVGELPLQ----VFSSMIRGFGRDRRLECAVVLVDWLKRKK 231
           L   ++  R     E V K++ E  +      ++ +IRGF     ++ A+ L D     K
Sbjct: 176 LDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFD-----K 235

Query: 232 LETNGRIAPNLFIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQG 291
           +ET G + PN+  YN+L+    +  +      +L  MA +G++ N+++YN +++    +G
Sbjct: 236 METKGCL-PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREG 295

Query: 292 LAMKALGILEEMPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDD 351
              +   +L EM ++G +   V+Y+T ++ Y   K+GN                      
Sbjct: 296 RMKEVSFVLTEMNRRGYSLDEVTYNTLIKGY--CKEGN---------------------- 355

Query: 352 NVDWADEFLKLENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLI 411
                                              + L +  EM + GL+   +    LI
Sbjct: 356 ---------------------------------FHQALVMHAEMLRHGLTPSVITYTSLI 415

Query: 412 WACTCAEHHNVAKELYYRIREKQSGISLSVCNHVIWLMGKAKKWW--AALEIYEDLLDKG 471
            +   A + N A E   ++R +  G+  +   +   + G ++K +   A  +  ++ D G
Sbjct: 416 HSMCKAGNMNRAMEFLDQMRVR--GLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNG 475

Query: 472 PKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKA 531
             P+ ++Y       N L+      G     + +L  M+EKGL P    ++ VL    ++
Sbjct: 476 FSPSVVTY-------NALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRS 535

Query: 532 AETSAAIEIFRRMVEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAY 591
            +   A+ + R MVE+G KP  ++Y +L+    + +   EA  +++ M++VG+ P+ + Y
Sbjct: 536 YDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTY 595

Query: 592 TTLASVFTGQGKFNMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKV 651
           T L + +  +G         NEMV  G+ P VVTY+ +I G  +   +  A     ++  
Sbjct: 596 TALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFY 598

Query: 652 RNISPNEVSYELLIE 661
               P++V+Y  LIE
Sbjct: 656 EESVPSDVTYHTLIE 598

BLAST of Sgr016624 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 130.6 bits (327), Expect = 7.6e-29
Identity = 123/528 (23.30%), Postives = 230/528 (43.56%), Query Frame = 0

Query: 197 LQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNLFIYNSLLGAIKQSAEFSK 256
           LQ +SS++ G G+ R ++  + L+     K++ET G + PN++ +   +  + ++ + ++
Sbjct: 223 LQTYSSLMVGLGKRRDIDSVMGLL-----KEMETLG-LKPNVYTFTICIRVLGRAGKINE 282

Query: 257 MQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTPSPVSYSTALR 316
             ++L  M  EG   +VVTY  ++           A  + E+M      P  V+Y T L 
Sbjct: 283 AYEILKRMDDEGCGPDVVTYTVLIDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITLLD 342

Query: 317 AYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDW---ADEFLKLENFTKRV-------- 376
            +   +D +   +F  E+    ++G +   D V +    D   K  NF +          
Sbjct: 343 RFSDNRDLDSVKQFWSEME---KDGHV--PDVVTFTILVDALCKAGNFGEAFDTLDVMRD 402

Query: 377 --------CYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHN 436
                    Y  +   L++        L+L   M+  G+          I     +    
Sbjct: 403 QGILPNLHTYNTLICGLLRVHRLD-DALELFGNMESLGVKPTAYTYIVFIDYYGKSGDSV 462

Query: 437 VAKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIV 496
            A E + +++ K    ++  CN  ++ + KA +   A +I+  L D G  P++++Y    
Sbjct: 463 SALETFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFYGLKDIGLVPDSVTY---- 522

Query: 497 SHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRR 556
              N+++    K G     ++LL++M E G +P     N+++    KA     A ++F R
Sbjct: 523 ---NMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDEAWKMFMR 582

Query: 557 MVEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGK 616
           M E   KPTV++Y  LL+ L K     EA  +++ M++ G  PN   + TL        +
Sbjct: 583 MKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFDCLCKNDE 642

Query: 617 FNMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYEL 676
             +    + +M+  G  P V TYN II G V+NG    A  +FH+MK + + P+ V+   
Sbjct: 643 VTLALKMLFKMMDMGCVPDVFTYNTIIFGLVKNGQVKEAMCFFHQMK-KLVYPDFVTLCT 702

Query: 677 LIEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASID 706
           L+  + K      AY++         +  + +++  +  S +  A ID
Sbjct: 703 LLPGVVKASLIEDAYKIITNFLYNCADQPANLFWEDLIGSILAEAGID 730

BLAST of Sgr016624 vs. ExPASy TrEMBL
Match: A0A6J1DL11 (pentatricopeptide repeat-containing protein At3g46610 OS=Momordica charantia OX=3673 GN=LOC111021510 PE=3 SV=1)

HSP 1 Score: 1306.2 bits (3379), Expect = 0.0e+00
Identity = 646/720 (89.72%), Postives = 678/720 (94.17%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           MHALSNWCPT SS V+LGSSC +  S KR+KCVGFSDCCCGNGGF LISFNLRV RS FC
Sbjct: 1   MHALSNWCPTSSSKVELGSSCVVRRSGKRLKCVGFSDCCCGNGGFSLISFNLRVFRSGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENS+FDC CEF HGCSKL VARLMKPKRNSLGAWFLSAWAV Q T+ NEIVRVESNSED
Sbjct: 61  YENSKFDCSCEFRHGCSKLIVARLMKPKRNSLGAWFLSAWAVEQPTVGNEIVRVESNSED 120

Query: 121 DLPERSEREGYGGLHWDDHDNDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQLAR 180
           DL ERSE EGYGGL WDDH N NGEN HGGGDFK+E+GMEGE DV VDV ALA +LQL R
Sbjct: 121 DLAERSEGEGYGGLDWDDHHNVNGENGHGGGDFKDEDGMEGEGDVWVDVRALAGRLQLTR 180

Query: 181 TADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNLFI 240
           TADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAV LVDWLKRKK+ET+GRIAPNLFI
Sbjct: 181 TADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETDGRIAPNLFI 240

Query: 241 YNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEEMP 300
           YNSLLGA+KQS  FSKM+DVL DMA+EGI SNV+TYNTIMSIYLEQGLAMKALGILEEMP
Sbjct: 241 YNSLLGAVKQSTVFSKMEDVLADMAQEGITSNVITYNTIMSIYLEQGLAMKALGILEEMP 300

Query: 301 KKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKLEN 360
           KKGLTPSPVSYST L+AYRR+KDGNGALKFM ELRE+YR+GE+AKDDNVDWADEF+KLEN
Sbjct: 301 KKGLTPSPVSYSTGLQAYRRMKDGNGALKFMTELREKYRSGEMAKDDNVDWADEFMKLEN 360

Query: 361 FTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNVAK 420
           FTKRVCYQVMRIWLVKG +ASTKVLQLL+EMDKAGLSL R EEERLIWACTCAEHHNVAK
Sbjct: 361 FTKRVCYQVMRIWLVKGYSASTKVLQLLVEMDKAGLSLDRAEEERLIWACTCAEHHNVAK 420

Query: 421 ELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHF 480
           ELYYRIREKQ GISLSVCNHVIWLMGKAKKWWAALEIYEDLL+KGPKPNNMSYELIVSHF
Sbjct: 421 ELYYRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHF 480

Query: 481 NVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVE 540
           NVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVE
Sbjct: 481 NVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVE 540

Query: 541 QGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKFNM 600
           QGEKPT+LSYGALLSALEKGKL+DEARSVWDHMIKVGV+PNIYAYTT+ASVFTGQGKFNM
Sbjct: 541 QGEKPTILSYGALLSALEKGKLYDEARSVWDHMIKVGVKPNIYAYTTMASVFTGQGKFNM 600

Query: 601 VEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIE 660
           VEVTIN+MVSSGIEPTVVTYNAIITGCVRNG+SSVAYEWFHRMKVRNISPNEVSYELLIE
Sbjct: 601 VEVTINDMVSSGIEPTVVTYNAIITGCVRNGLSSVAYEWFHRMKVRNISPNEVSYELLIE 660

Query: 661 ALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALG-PPPDKNKSS 720
           ALAKEGKPRLAYELY+RAKN+ LNLSSK Y AVIQSSQVYGASID+RALG PPPD NKSS
Sbjct: 661 ALAKEGKPRLAYELYLRAKNDSLNLSSKTYDAVIQSSQVYGASIDIRALGSPPPDTNKSS 720

BLAST of Sgr016624 vs. ExPASy TrEMBL
Match: A0A0A0LB88 (PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G595200 PE=3 SV=1)

HSP 1 Score: 1231.5 bits (3185), Expect = 0.0e+00
Identity = 620/722 (85.87%), Postives = 661/722 (91.55%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           MHALSNWCPT  SGV+LGS   +H S KRVK  GFSDC CGN GF LISFNL VLRS FC
Sbjct: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENSRF C CEF HGCSKLRV  LMK  RNSLGA+ LSAWAV Q TID+EI RVESNS D
Sbjct: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120

Query: 121 DLPERSEREGYGGLHWDDHDND--NGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQL 180
            LPER       GL WDD D+   NGENSHGGG FK+E  +EG  DVRVDV ALA QLQL
Sbjct: 121 GLPER-------GLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQL 180

Query: 181 ARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNL 240
           ARTADDV++VLKD+ ELPLQVFSSMIRGFGRDRRLECAV LVDWLKRKK+ETNGRIAPNL
Sbjct: 181 ARTADDVDQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNL 240

Query: 241 FIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGA+KQS E S+M++VLTDMA+EGI SNVVTYNTIMSIYLEQGLAMKALGILEE
Sbjct: 241 FIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEE 300

Query: 301 MPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKL 360
           MPKKGLT SPVSYSTALRAYRR+KDGNGALKFM+ELRERYRNGEIAKDDNVDWA+EFLKL
Sbjct: 301 MPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKL 360

Query: 361 ENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNV 420
           ENFT+RVCYQVMRIWLVKGD ASTKVLQLLMEMDKAGLSL R E ERLIWACTCAEH+NV
Sbjct: 361 ENFTRRVCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNV 420

Query: 421 AKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVS 480
           AKELY+RIREKQ GISLSVCNHVIWLMGKAKKWWAALEIYEDLL+KGPKPNNMSYELIVS
Sbjct: 421 AKELYFRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540
           HFNVLLTAAKKRGIWRWGVRLLNKMEEKGL+PGSREWNAVLVACS+AAETSAAI+IFR+M
Sbjct: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKF 600
           VEQGEKPTVLSYGALLSALEKGKL+DEARSVWDHMI+VGVEPNIYAYTT+ASVFTGQGKF
Sbjct: 541 VEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKF 600

Query: 601 NMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVEVTIN+MV+SGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL
Sbjct: 601 NMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALG-PPPDKNK 720
           IEALAKEGKPRLAYELYMRAK+EGLNLSSK+Y AVI+SSQ+YGAS++++ LG  PPD+NK
Sbjct: 661 IEALAKEGKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNK 715

BLAST of Sgr016624 vs. ExPASy TrEMBL
Match: A0A5D3E368 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold595G00640 PE=3 SV=1)

HSP 1 Score: 1221.8 bits (3160), Expect = 0.0e+00
Identity = 617/722 (85.46%), Postives = 652/722 (90.30%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           MH LSNWCPT  SGV LGS   +H S KR+KC GFSDCCCGN GF LISFNL VL S FC
Sbjct: 1   MHVLSNWCPTSCSGVDLGSYSVVHRSWKRIKCFGFSDCCCGNWGFSLISFNLSVLGSGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENSRF C CEF HG SKLRV  LMKP RNSL AW LSAW V Q TI +E+ RVESNS D
Sbjct: 61  YENSRFVCNCEFRHGYSKLRVVPLMKPNRNSLEAWCLSAWTVEQPTIGDELPRVESNSRD 120

Query: 121 DLPERSEREGYGGLHW--DDHDNDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQL 180
            LPER        L W  DD DN NGENSHGGG FK+E  MEG  DVRVDV ALA QLQL
Sbjct: 121 GLPERR-------LDWDGDDDDNVNGENSHGGGSFKDEGEMEGVGDVRVDVRALAAQLQL 180

Query: 181 ARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNL 240
           ARTADDV++VLKD+ ELPLQVFSSMIRGFGRDRRLECAV LVDWLKRKK+ETNGRIAPNL
Sbjct: 181 ARTADDVDQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNL 240

Query: 241 FIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGA+KQS E SKM++VLT+MA+EGI SNVVTYNTIMSIYLEQGLA KALGILEE
Sbjct: 241 FIYNSLLGAVKQSGELSKMENVLTEMAQEGIVSNVVTYNTIMSIYLEQGLATKALGILEE 300

Query: 301 MPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKL 360
           MPKKGLT SPVSYSTALRAYR++KDGNGAL+FM+ELRERY NGEIAKDDNVDWA+EFLKL
Sbjct: 301 MPKKGLTLSPVSYSTALRAYRKMKDGNGALEFMVELRERYHNGEIAKDDNVDWANEFLKL 360

Query: 361 ENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNV 420
           ENFT+RVCYQVMRIWLVKGD ASTKVLQLLMEMDKAGLSL R EEERLIWACTCAEH+NV
Sbjct: 361 ENFTRRVCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEEERLIWACTCAEHYNV 420

Query: 421 AKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVS 480
           AKELY RIREKQ GISLSVCNHVIWLMGKAKKWWAALEIYE+LL+KGPKPNNMSYELIVS
Sbjct: 421 AKELYIRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEELLEKGPKPNNMSYELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540
           HFNVLLTAAKKRGIWRWGVRLLNKMEEKGL+PGSREWNAVLVACS+AAETSAAI+IFRRM
Sbjct: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRRM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKF 600
           VEQGEKPTVLSYGALLSALEKGKL+DEARSVWDHMI+VGVEPNIYAYTT+ASVFT QGKF
Sbjct: 541 VEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTSQGKF 600

Query: 601 NMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVEVTIN+MV+SGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL
Sbjct: 601 NMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALG-PPPDKNK 720
           IEALAKEGKPRLAYELY RAK+EGLNLSSKIY AVI+SSQ+YGASID+R LG  PPDKNK
Sbjct: 661 IEALAKEGKPRLAYELYRRAKDEGLNLSSKIYDAVIESSQLYGASIDIRLLGLRPPDKNK 715

BLAST of Sgr016624 vs. ExPASy TrEMBL
Match: A0A6J1JQT1 (pentatricopeptide repeat-containing protein At3g46610-like OS=Cucurbita maxima OX=3661 GN=LOC111486640 PE=4 SV=1)

HSP 1 Score: 1209.9 bits (3129), Expect = 0.0e+00
Identity = 606/722 (83.93%), Postives = 657/722 (91.00%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           M  LS+WCP+ SSG++LG S  ++GSRKR+ CVGFS  CCGNGGF LI F+  VLR  FC
Sbjct: 1   MSILSDWCPS-SSGLELGCSSVVNGSRKRINCVGFSG-CCGNGGFSLIPFSSSVLRCGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
           YENS+FDC  EF HGCSKLRVARLMKPKRNSLG WFLSAWAV Q TID EIVRV+SN  D
Sbjct: 61  YENSKFDCNFEFRHGCSKLRVARLMKPKRNSLGVWFLSAWAVEQPTIDGEIVRVQSNCGD 120

Query: 121 DLPERSEREGYGGLHWDDHDND--NGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQL 180
           D PE+S       L WDDHD+D  N ENS+ G  FK+EEG+EGE DV+VDV ALA +L+L
Sbjct: 121 DFPEKS-------LDWDDHDHDTVNSENSN-GRSFKDEEGIEGEGDVKVDVRALAGRLEL 180

Query: 181 ARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNL 240
           ART DDVEEVLKDVGELPLQVFSS+I+GFGRD+RL CA+ LV+WLK +K++TNGRIAPNL
Sbjct: 181 ARTVDDVEEVLKDVGELPLQVFSSLIKGFGRDKRLGCALALVEWLKTRKIKTNGRIAPNL 240

Query: 241 FIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGA+KQS EFSKM+D+L DM++EGI SNVVTYNTIMSIYL+QGL MKAL ILEE
Sbjct: 241 FIYNSLLGAVKQSGEFSKMEDILNDMSQEGIVSNVVTYNTIMSIYLDQGLPMKALDILEE 300

Query: 301 MPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKL 360
           MPKKGLT SPVSYSTALRAYRR+KDGNGALKFM+ELRERYRNGEIAKDDNVDW DEFLKL
Sbjct: 301 MPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWDDEFLKL 360

Query: 361 ENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNV 420
           ENFT+RVCYQVMRIWLVKGD+ASTKVLQLL EMDKAGLSL R EEERL+WACTCAEHHNV
Sbjct: 361 ENFTRRVCYQVMRIWLVKGDSASTKVLQLLTEMDKAGLSLDRAEEERLVWACTCAEHHNV 420

Query: 421 AKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVS 480
           AKELYYRIREK+SGISLSVCNHVIWLMGKAKKWWAALEIYEDLL+KGPKPNN+SYELIVS
Sbjct: 421 AKELYYRIREKKSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNLSYELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540
           HFNVLLTAAK RGIWRWGVRLLNKMEEKGLKPG REWNAVLVACS+AAETS AIEIFRRM
Sbjct: 481 HFNVLLTAAKNRGIWRWGVRLLNKMEEKGLKPGIREWNAVLVACSRAAETSMAIEIFRRM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKF 600
           V+QGEKPTVLSYGALLSALEKGKL+DEARSVWDHMIKVGV PNIYAYTT+ASVFTGQGKF
Sbjct: 541 VDQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIKVGVAPNIYAYTTMASVFTGQGKF 600

Query: 601 NMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVE+TIN+MV+SGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL
Sbjct: 601 NMVELTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALGP-PPDKNK 720
           IEALAK+GKPRLAYELYM+A NEGLNLSSKIY AVI SSQVYGASID+R LGP PPD+NK
Sbjct: 661 IEALAKDGKPRLAYELYMKANNEGLNLSSKIYDAVIHSSQVYGASIDIRLLGPRPPDENK 712

BLAST of Sgr016624 vs. ExPASy TrEMBL
Match: A0A6J1F973 (pentatricopeptide repeat-containing protein At3g46610-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441977 PE=4 SV=1)

HSP 1 Score: 1200.3 bits (3104), Expect = 0.0e+00
Identity = 605/722 (83.80%), Postives = 651/722 (90.17%), Query Frame = 0

Query: 1   MHALSNWCPTGSSGVQLGSSCRLHGSRKRVKCVGFSDCCCGNGGFGLISFNLRVLRSLFC 60
           MH  SNWCP  SSGV+LGS   +H S KR+  VGFSD C GNG   LISFN  VLRS FC
Sbjct: 1   MHVHSNWCPISSSGVELGSYSVVHSSWKRINRVGFSDSCYGNGNLYLISFNFSVLRSGFC 60

Query: 61  YENSRFDCGCEFGHGCSKLRVARLMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSED 120
            E SRF+C  EF HGCSKLRVA LMKPKRNSLGAWFL AWAV Q TID+EI RVESNS D
Sbjct: 61  CETSRFECIREFRHGCSKLRVAPLMKPKRNSLGAWFLFAWAVEQPTIDDEIARVESNSRD 120

Query: 121 DLPERSEREGYGGLHWDDHD--NDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQL 180
           DLPE S       L WD +D  N N ENSHG G+FK+EEGMEGE DVRVDV ALA +LQL
Sbjct: 121 DLPESS-------LDWDVYDPGNVNSENSHGRGNFKDEEGMEGEGDVRVDVRALARRLQL 180

Query: 181 ARTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNL 240
           ARTADDVEE+LKDVG LPLQVFSS+IRGFGR+RRLECAV LV+WLK+KK+ETNGRIAPNL
Sbjct: 181 ARTADDVEELLKDVGVLPLQVFSSIIRGFGRNRRLECAVALVEWLKKKKIETNGRIAPNL 240

Query: 241 FIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGA+KQS EFSKM+DVLTDMA+EGI SNVVTYNTIMSIYLEQGLA+KALGILEE
Sbjct: 241 FIYNSLLGAVKQSGEFSKMEDVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAIKALGILEE 300

Query: 301 MPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKL 360
           MP+KGLTP PVSYSTAL+AYRR+ DGNGALKFMIELRERYRNGE+ KDDNVDWAD+FLKL
Sbjct: 301 MPRKGLTPCPVSYSTALQAYRRMNDGNGALKFMIELRERYRNGELVKDDNVDWADKFLKL 360

Query: 361 ENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNV 420
           E FT+RVCYQVMRIWLVK D A+TKVLQLLMEMDKAGLSL RVEEERLIWACTCAEHHNV
Sbjct: 361 EKFTRRVCYQVMRIWLVKDDPANTKVLQLLMEMDKAGLSLDRVEEERLIWACTCAEHHNV 420

Query: 421 AKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVS 480
           AKELYYRIREKQ  ISLSVCNHVIWL GKAKKWWAALEIYEDLL+KGPKPNN+S ELIVS
Sbjct: 421 AKELYYRIREKQCSISLSVCNHVIWLTGKAKKWWAALEIYEDLLEKGPKPNNLSNELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540
           HFNVLLTAAKKRGIWRWGVRLL+KMEEKGLKPG REWNAVLVACS+AAETSAAI+IFRRM
Sbjct: 481 HFNVLLTAAKKRGIWRWGVRLLDKMEEKGLKPGIREWNAVLVACSRAAETSAAIDIFRRM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKF 600
           VE+GEKPTVLSYGALLSALEKGKL+DEARSVWDHMIKVGVEPNIYAYTT+ S+FTGQGKF
Sbjct: 541 VEKGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIKVGVEPNIYAYTTMTSIFTGQGKF 600

Query: 601 NMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVEVT+N+MV+SGIEPTVVTYNAIITGCVRNGMS+VAYEWFHRMK RNISP+EVSYELL
Sbjct: 601 NMVEVTLNDMVTSGIEPTVVTYNAIITGCVRNGMSTVAYEWFHRMKARNISPDEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKNEGLNLSSKIYYAVIQSSQVYGASIDVRALGPPP-DKNK 720
           +EALAKEGKPRLAYELY+ AK+EGLNLSSKIY AVIQSSQV+GASID+R LGP P +KNK
Sbjct: 661 VEALAKEGKPRLAYELYLSAKDEGLNLSSKIYDAVIQSSQVHGASIDIRLLGPRPLEKNK 715

BLAST of Sgr016624 vs. TAIR 10
Match: AT3G46610.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 772.7 bits (1994), Expect = 2.7e-223
Identity = 390/637 (61.22%), Postives = 485/637 (76.14%), Query Frame = 0

Query: 84  LMKPKRNSLGAWFLSAWAVGQQTIDNEIVRVESNSEDDL---PERSEREGYGGLHW--DD 143
           ++ PK       FL     G  +  +  + V SN +      P+RS      G+ W  + 
Sbjct: 29  VVSPKTTRKRLCFLEQACFGSSSSISSFIFVSSNRKVLFLCEPKRSLLGSSFGVGWATEQ 88

Query: 144 HDNDNGENSHGGGDFKEEEGMEGEEDVRVDVLALACQLQLARTADDVEEVLKDVGELPLQ 203
            + + GE      D     G E + ++RVDV  LA  L+ A+TADDV+ VLKD GELPLQ
Sbjct: 89  RELELGEEEVSTEDLSSANGGE-KNNLRVDVRELAFSLRAAKTADDVDAVLKDKGELPLQ 148

Query: 204 VFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNLFIYNSLLGAIKQSAEFSKMQ 263
           VF +MI+GFG+D+RL+ AV +VDWLKRKK E+ G I PNLFIYNSLLGA++    F + +
Sbjct: 149 VFCAMIKGFGKDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAMR---GFGEAE 208

Query: 264 DVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTPSPVSYSTALRAY 323
            +L DM  EGI  N+VTYNT+M IY+E+G  +KALGIL+   +KG  P+P++YSTAL  Y
Sbjct: 209 KILKDMEEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVY 268

Query: 324 RRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKLENFTKRVCYQVMRIWLVKGD 383
           RR++DG GAL+F +ELRE+Y   EI  D   DW  EF+KLENF  R+CYQVMR WLVK D
Sbjct: 269 RRMEDGMGALEFFVELREKYAKREIGNDVGYDWEFEFVKLENFIGRICYQVMRRWLVKDD 328

Query: 384 NASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNVAKELYYRIREKQSGISLSVC 443
           N +T+VL+LL  MD AG+   R E ERLIWACT  EH+ V KELY RIRE+ S ISLSVC
Sbjct: 329 NWTTRVLKLLNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVC 388

Query: 444 NHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVR 503
           NH+IWLMGKAKKWWAALEIYEDLLD+GP+PNN+SYEL+VSHFN+LL+AA KRGIWRWGVR
Sbjct: 389 NHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVR 448

Query: 504 LLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVEQGEKPTVLSYGALLSALE 563
           LLNKME+KGLKP  R WNAVLVACSKA+ET+AAI+IF+ MV+ GEKPTV+SYGALLSALE
Sbjct: 449 LLNKMEDKGLKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALE 508

Query: 564 KGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKFNMVEVTINEMVSSGIEPTVV 623
           KGKL+DEA  VW+HMIKVG+EPN+YAYTT+ASV TGQ KFN+++  + EM S GIEP+VV
Sbjct: 509 KGKLYDEAFRVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVV 568

Query: 624 TYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPRLAYELYMRA 683
           T+NA+I+GC RNG+S VAYEWFHRMK  N+ PNE++YE+LIEALA + KPRLAYEL+++A
Sbjct: 569 TFNAVISGCARNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHVKA 628

Query: 684 KNEGLNLSSKIYYAVIQSSQVYGASIDVRALGPPPDK 716
           +NEGL LSSK Y AV++S++ YGA+ID+  LGP PDK
Sbjct: 629 QNEGLKLSSKPYDAVVKSAETYGATIDLNLLGPRPDK 661

BLAST of Sgr016624 vs. TAIR 10
Match: AT5G14350.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 164.5 bits (415), Expect = 3.4e-40
Identity = 91/200 (45.50%), Postives = 120/200 (60.00%), Query Frame = 0

Query: 447 KAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEK 506
           K +   +ALE+YEDLLD+GP+PNN+SYE                      +RL  ++  K
Sbjct: 95  KLRNGGSALEMYEDLLDEGPEPNNLSYE---------------------PMRL--QLRPK 154

Query: 507 GLKPGSREWNAVLVACSKAAETSAAIEIFRRMVEQGEKPTVLSYGALLSALEKGKLFDEA 566
            +K    +W                              TV S+GALLSALEKGKL+DE 
Sbjct: 155 SIK----QW----------------------------LTTVKSHGALLSALEKGKLYDEV 214

Query: 567 RSVWDHMIKVGVEPNIYAYTTLASVFTGQGKFNMVEVTINEMVSSG-IEPTVVTYNAIIT 626
             VW+HM+KVG+EPN+YAYTT+ASV TGQ K N+++  + EM S G I+P+VVTYNA+I+
Sbjct: 215 LRVWNHMVKVGIEPNLYAYTTMASVLTGQQKLNLLDTLLKEMPSKGIIKPSVVTYNAVIS 239

Query: 627 GCVRNGMSSVAYEWFHRMKV 646
           GC RNG+S VAYEWFHRM++
Sbjct: 275 GCTRNGLSGVAYEWFHRMRI 239

BLAST of Sgr016624 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 138.3 bits (347), Expect = 2.6e-32
Identity = 129/545 (23.67%), Postives = 242/545 (44.40%), Query Frame = 0

Query: 146 NSHGGGDFKEEEGMEGEEDVRVDVLALACQLQLARTADDVEEVLKDV------GELP-LQ 205
           NS+G G +         EDV  +        Q+ RT  ++EE  K +      G +P + 
Sbjct: 84  NSNGNGHYSSVNSSFALEDVESN----NHLRQMVRTG-ELEEGFKFLENMVYHGNVPDII 143

Query: 206 VFSSMIRGFGRDRRLECAVVLVDWLKRKKLETNGRIAPNLFIYNSLLGAIKQSAEFSKMQ 265
             +++IRGF R  +   A  +++      LE +G + P++  YN ++    ++ E +   
Sbjct: 144 PCTTLIRGFCRLGKTRKAAKILE-----ILEGSGAV-PDVITYNVMISGYCKAGEINNAL 203

Query: 266 DVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTPSPVSYSTALRAY 325
            VL  M+   +  +VVTYNTI+    + G   +A+ +L+ M ++   P  ++Y+  + A 
Sbjct: 204 SVLDRMS---VSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRDCYPDVITYTILIEAT 263

Query: 326 RRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEFLKLENFTKRVCYQVMRIWLVKGD 385
            R      A+K + E+R+R    ++                     V Y V+   + K +
Sbjct: 264 CRDSGVGHAMKLLDEMRDRGCTPDV---------------------VTYNVLVNGICK-E 323

Query: 386 NASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEHHNVAKELYYRIREKQSGISLSVC 445
               + ++ L +M  +G     +    ++ +         A++L   +  K    S+   
Sbjct: 324 GRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTF 383

Query: 446 NHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVR 505
           N +I  + +      A++I E +   G +PN++SY  ++  F       K++ + R  + 
Sbjct: 384 NILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGF------CKEKKMDR-AIE 443

Query: 506 LLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRMVEQGEKPTVLSYGALLSALE 565
            L +M  +G  P    +N +L A  K  +   A+EI  ++  +G  P +++Y  ++  L 
Sbjct: 444 YLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLA 503

Query: 566 KGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQGKFNMVEVTINEMVSSGIEPTVV 625
           K     +A  + D M    ++P+   Y++L    + +GK +      +E    GI P  V
Sbjct: 504 KAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAV 563

Query: 626 TYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPRLAYELYMRA 684
           T+N+I+ G  ++  +  A ++   M  R   PNE SY +LIE LA EG  + A EL    
Sbjct: 564 TFNSIMLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNEL 585

BLAST of Sgr016624 vs. TAIR 10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 132.5 bits (332), Expect = 1.4e-30
Identity = 95/448 (21.21%), Postives = 188/448 (41.96%), Query Frame = 0

Query: 236 PNLFIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQGLAMKALGI 295
           P    YN+LL    ++  +++   VL +M      ++ VTYN +++ Y+  G + +A G+
Sbjct: 314 PGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFSKEAAGV 373

Query: 296 LEEMPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDDNVDWADEF 355
           +E M KKG+ P+ ++Y+T + AY +    + ALK    ++E                   
Sbjct: 374 IEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKE------------------- 433

Query: 356 LKLENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLIWACTCAEH 415
                     C     + L+   + S +++++L +M   G S  R     ++  C     
Sbjct: 434 ---AGCVPNTCTYNAVLSLLGKKSRSNEMIKMLCDMKSNGCSPNRATWNTMLALCGNKGM 493

Query: 416 HNVAKELYYRIREKQSGISLSVCNHVIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYEL 475
                 ++  ++           N +I   G+      A ++Y ++   G       +  
Sbjct: 494 DKFVNRVFREMKSCGFEPDRDTFNTLISAYGRCGSEVDASKMYGEMTRAG-------FNA 553

Query: 476 IVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIF 535
            V+ +N LL A  ++G WR G  +++ M+ KG KP    ++ +L   +K         I 
Sbjct: 554 CVTTYNALLNALARKGDWRSGENVISDMKSKGFKPTETSYSLMLQCYAKGGNYLGIERIE 613

Query: 536 RRMVEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAYTTLASVFTGQ 595
            R+ E    P+ +    LL A  K +    +   +    K G +P++  + ++ S+FT  
Sbjct: 614 NRIKEGQIFPSWMLLRTLLLANFKCRALAGSERAFTLFKKHGYKPDMVIFNSMLSIFTRN 673

Query: 596 GKFNMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSY 655
             ++  E  +  +   G+ P +VTYN+++   VR G    A E    ++   + P+ VSY
Sbjct: 674 NMYDQAEGILESIREDGLSPDLVTYNSLMDMYVRRGECWKAEEILKTLEKSQLKPDLVSY 732

Query: 656 ELLIEALAKEGKPRLAYELYMRAKNEGL 684
             +I+   + G  + A  +       G+
Sbjct: 734 NTVIKGFCRRGLMQEAVRMLSEMTERGI 732

BLAST of Sgr016624 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 132.1 bits (331), Expect = 1.9e-30
Identity = 112/495 (22.63%), Postives = 213/495 (43.03%), Query Frame = 0

Query: 172 LACQLQLARTADDVEEVLKDVGELPLQ----VFSSMIRGFGRDRRLECAVVLVDWLKRKK 231
           L   ++  R     E V K++ E  +      ++ +IRGF     ++ A+ L D     K
Sbjct: 176 LDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFD-----K 235

Query: 232 LETNGRIAPNLFIYNSLLGAIKQSAEFSKMQDVLTDMAREGIDSNVVTYNTIMSIYLEQG 291
           +ET G + PN+  YN+L+    +  +      +L  MA +G++ N+++YN +++    +G
Sbjct: 236 METKGCL-PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREG 295

Query: 292 LAMKALGILEEMPKKGLTPSPVSYSTALRAYRRLKDGNGALKFMIELRERYRNGEIAKDD 351
              +   +L EM ++G +   V+Y+T ++ Y   K+GN                      
Sbjct: 296 RMKEVSFVLTEMNRRGYSLDEVTYNTLIKGY--CKEGN---------------------- 355

Query: 352 NVDWADEFLKLENFTKRVCYQVMRIWLVKGDNASTKVLQLLMEMDKAGLSLGRVEEERLI 411
                                              + L +  EM + GL+   +    LI
Sbjct: 356 ---------------------------------FHQALVMHAEMLRHGLTPSVITYTSLI 415

Query: 412 WACTCAEHHNVAKELYYRIREKQSGISLSVCNHVIWLMGKAKKWW--AALEIYEDLLDKG 471
            +   A + N A E   ++R +  G+  +   +   + G ++K +   A  +  ++ D G
Sbjct: 416 HSMCKAGNMNRAMEFLDQMRVR--GLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNG 475

Query: 472 PKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKA 531
             P+ ++Y       N L+      G     + +L  M+EKGL P    ++ VL    ++
Sbjct: 476 FSPSVVTY-------NALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRS 535

Query: 532 AETSAAIEIFRRMVEQGEKPTVLSYGALLSALEKGKLFDEARSVWDHMIKVGVEPNIYAY 591
            +   A+ + R MVE+G KP  ++Y +L+    + +   EA  +++ M++VG+ P+ + Y
Sbjct: 536 YDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTY 595

Query: 592 TTLASVFTGQGKFNMVEVTINEMVSSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKV 651
           T L + +  +G         NEMV  G+ P VVTY+ +I G  +   +  A     ++  
Sbjct: 596 TALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFY 598

Query: 652 RNISPNEVSYELLIE 661
               P++V+Y  LIE
Sbjct: 656 EESVPSDVTYHTLIE 598

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022154192.10.0e+0089.72pentatricopeptide repeat-containing protein At3g46610 [Momordica charantia][more]
XP_038898205.10.0e+0087.14protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic [Benincasa hispida][more]
XP_011651578.10.0e+0085.87protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic [Cucumis sativus][more]
KAA0066960.10.0e+0085.46pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK30239... [more]
XP_022989603.10.0e+0083.93pentatricopeptide repeat-containing protein At3g46610-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9SNB73.8e-22261.22Protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic OS=Arabidopsis thaliana O... [more]
Q3EDF83.7e-3123.67Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
O646242.0e-2921.21Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
Q9FIX32.6e-2922.63Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9SZ527.6e-2923.30Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1DL110.0e+0089.72pentatricopeptide repeat-containing protein At3g46610 OS=Momordica charantia OX=... [more]
A0A0A0LB880.0e+0085.87PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G595200 PE... [more]
A0A5D3E3680.0e+0085.46Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1JQT10.0e+0083.93pentatricopeptide repeat-containing protein At3g46610-like OS=Cucurbita maxima O... [more]
A0A6J1F9730.0e+0083.80pentatricopeptide repeat-containing protein At3g46610-like isoform X1 OS=Cucurbi... [more]
Match NameE-valueIdentityDescription
AT3G46610.12.7e-22361.22Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G14350.13.4e-4045.50Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G09900.12.6e-3223.67Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G18940.11.4e-3021.21Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G39710.11.9e-3022.63Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 615..664
e-value: 1.5E-14
score: 53.9
coord: 272..318
e-value: 5.7E-12
score: 45.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 583..617
e-value: 0.0018
score: 16.3
coord: 515..546
e-value: 3.7E-5
score: 21.6
coord: 549..582
e-value: 8.5E-6
score: 23.6
coord: 618..652
e-value: 7.0E-9
score: 33.3
coord: 274..307
e-value: 1.1E-5
score: 23.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 651..685
score: 9.766576
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 272..306
score: 11.805378
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 616..650
score: 11.564229
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 511..545
score: 9.930995
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 581..615
score: 9.317163
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 476..510
score: 8.53891
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 546..580
score: 10.676364
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 447..596
e-value: 2.0E-10
score: 40.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 525..744
e-value: 8.4E-44
score: 152.2
coord: 181..376
e-value: 4.6E-25
score: 90.6
coord: 377..524
e-value: 5.3E-17
score: 64.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 116..155
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 116..158
NoneNo IPR availablePANTHERPTHR47940OS12G0283900 PROTEINcoord: 4..729

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr016624.1Sgr016624.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding