Spg031939 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg031939
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold11: 43290512 .. 43292305 (+)
RNA-Seq ExpressionSpg031939
SyntenySpg031939
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGTGCTTTTCGAGATGTGCACGCAACCTGTTTGTCAAGAGTCCCCAAAGAAAATTTTATACCATAGCTCAAAAGATCGATGCCATTTCAACCAAAACACAAACGTATTACGAAGACCCAGTTGGATTTTACGCTCAACGAGAGAACGTCATAGCTTGGACGTCTAAAATCACCAATTTGGTAAGAACAGGTCAGCCAGATTCTGCTTTTGGCCTCTTCAAGACGATGTTCGCAAATGGGCACAGGCCAAATCATGTGACAATACTGAGCGTAATGAGAGCCATCGACGCATCGAGCTGGGAATCAACGATCGAGGCGATGCATGGAGGAGTGATCAAAATGGGCTTTGAATCAGAAGTGGCAGTTTCAACAGCTCTTCTTGGAGTGTATTCAATGCTTGATATTGGAATCGTTTGGAAGTTGTTTTATCAGATACCTTGTAAGGATGTTGTTTTGTGGAGTGCCATGATCTCAGCCTGCGTGAAAAATGGTCAGTATATTGAAGCATTTGATCTTATCAGAGAGATGCAATATGAAGGTGTTCAACCAAACCATGTAAGTATTGTAAGTATTCTACCTGCTTGTGCTGATTCTGGTGCTCTGTGTTTGGGTAAAGAGATACATGGGTTCTCAATGAGAAGGGATTTTTATTCTTTGGTTAACATTCAGAATTCACTCATGGATATGTATTCAAAATGTAGAAATCTCGAGGCATCCATTAGAGTTTTGAAGATGATGAGGAAAAAGGACATGGTATCATGGAGAACTATAACTCATGCGTGTATCCAAAACAATAATCCTAGTAAAGCGTTTAAAATTTTCTCTAGGATGAAGTTTTCCGGATTCGAATTAGGCGAAACGATGATGCTCGACATCATAGCTGCAGTATTACTAGTTGAAGAGCTTTTACTTGGCTTGGCTGTTCATTGTTATGCACTGAAAAGTGGTTTTCTCTGTTTCATTTCAGTTGGAACTGAACTTCTCCAAATGTATGCTAAATTTGGTGATTTGGGATTGGCCAAACTTATATTTGACGACCTTGTTGACAAAGATATCATTGCGTGGAGTGCAATGATCTCAGCTTACTCCCATGGTGAAGATCCACTTAATGCCATCCAGACATTTAAAATGATGCAGTCAACTAATGAAAAGGCTAATGAGATAACTCTTGTAAGTGTAATGAATGCTTGTTCTTCTTTGGGGGCTCAGGAACTTGGAGGAAGCATACAAGCTCATATAACAAAATCCGGATACTCATCTAATACTCATTTGATGTCAGCATTGGTTGATTTTTACTGCACACTAGGGAGGATAAAGCTAGGGAAACATGTTTTCGATGAGATTTCGACGAAGGATTTAATTTGTTGGGGTGCTATGATTAAAGGGTATGGAATGAATGGCTGGGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCCAATGGAGTGCTCTTCCTCTCTCTTTTATCTGCTTGTGCTCAGTGTGGGCTGGAAAAGGAAGGTTGGATGTGGTTTCATTCAATGGTTGACAAGTATAATGTTACTCCAACAGTGGCTCATTATGCTTGTATGGTGGATTTGCTTGCTAGGAAAGGAAAAACTAGAGAAGCTGTTGAATTTGTGAAGAAAATACTAGTAGAATCTGATACAAGAATCTGGGGTGCTCTTTTTTCTGGTTGTAAATCAACTGATATTGCAGATTCAATTGTTGAACAGCTCACTGCTTTAAAACAAAACAATTCTGACTTTTATGCAATGTTGCTCAACTTTTAG

mRNA sequence

ATGCTGTGCTTTTCGAGATGTGCACGCAACCTGTTTGTCAAGAGTCCCCAAAGAAAATTTTATACCATAGCTCAAAAGATCGATGCCATTTCAACCAAAACACAAACGTATTACGAAGACCCAGTTGGATTTTACGCTCAACGAGAGAACGTCATAGCTTGGACGTCTAAAATCACCAATTTGGTAAGAACAGGTCAGCCAGATTCTGCTTTTGGCCTCTTCAAGACGATGTTCGCAAATGGGCACAGGCCAAATCATGTGACAATACTGAGCGTAATGAGAGCCATCGACGCATCGAGCTGGGAATCAACGATCGAGGCGATGCATGGAGGAGTGATCAAAATGGGCTTTGAATCAGAAGTGGCAGTTTCAACAGCTCTTCTTGGAGTGTATTCAATGCTTGATATTGGAATCGTTTGGAAGTTGTTTTATCAGATACCTTGTAAGGATGTTGTTTTGTGGAGTGCCATGATCTCAGCCTGCGTGAAAAATGGTCAGTATATTGAAGCATTTGATCTTATCAGAGAGATGCAATATGAAGGTGTTCAACCAAACCATGTAAGTATTGTAAGTATTCTACCTGCTTGTGCTGATTCTGGTGCTCTGTGTTTGGGTAAAGAGATACATGGGTTCTCAATGAGAAGGGATTTTTATTCTTTGGTTAACATTCAGAATTCACTCATGGATATGTATTCAAAATGTAGAAATCTCGAGGCATCCATTAGAGTTTTGAAGATGATGAGGAAAAAGGACATGGTATCATGGAGAACTATAACTCATGCGTGTATCCAAAACAATAATCCTAGTAAAGCGTTTAAAATTTTCTCTAGGATGAAGTTTTCCGGATTCGAATTAGGCGAAACGATGATGCTCGACATCATAGCTGCAGTATTACTAGTTGAAGAGCTTTTACTTGGCTTGGCTGTTCATTGTTATGCACTGAAAAGTGGTTTTCTCTGTTTCATTTCAGTTGGAACTGAACTTCTCCAAATGTATGCTAAATTTGGTGATTTGGGATTGGCCAAACTTATATTTGACGACCTTGTTGACAAAGATATCATTGCGTGGAGTGCAATGATCTCAGCTTACTCCCATGGTGAAGATCCACTTAATGCCATCCAGACATTTAAAATGATGCAGTCAACTAATGAAAAGGCTAATGAGATAACTCTTGTAAGTGTAATGAATGCTTGTTCTTCTTTGGGGGCTCAGGAACTTGGAGGAAGCATACAAGCTCATATAACAAAATCCGGATACTCATCTAATACTCATTTGATGTCAGCATTGGTTGATTTTTACTGCACACTAGGGAGGATAAAGCTAGGGAAACATGTTTTCGATGAGATTTCGACGAAGGATTTAATTTGTTGGGGTGCTATGATTAAAGGGTATGGAATGAATGGCTGGGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCCAATGGAGTGCTCTTCCTCTCTCTTTTATCTGCTTGTGCTCAGTGTGGGCTGGAAAAGGAAGGTTGGATGTGGTTTCATTCAATGGTTGACAAGTATAATGTTACTCCAACAGTGGCTCATTATGCTTGTATGGTGGATTTGCTTGCTAGGAAAGGAAAAACTAGAGAAGCTGTTGAATTTGTGAAGAAAATACTAGTAGAATCTGATACAAGAATCTGGGGTGCTCTTTTTTCTGGTTGTAAATCAACTGATATTGCAGATTCAATTGTTGAACAGCTCACTGCTTTAAAACAAAACAATTCTGACTTTTATGCAATGTTGCTCAACTTTTAG

Coding sequence (CDS)

ATGCTGTGCTTTTCGAGATGTGCACGCAACCTGTTTGTCAAGAGTCCCCAAAGAAAATTTTATACCATAGCTCAAAAGATCGATGCCATTTCAACCAAAACACAAACGTATTACGAAGACCCAGTTGGATTTTACGCTCAACGAGAGAACGTCATAGCTTGGACGTCTAAAATCACCAATTTGGTAAGAACAGGTCAGCCAGATTCTGCTTTTGGCCTCTTCAAGACGATGTTCGCAAATGGGCACAGGCCAAATCATGTGACAATACTGAGCGTAATGAGAGCCATCGACGCATCGAGCTGGGAATCAACGATCGAGGCGATGCATGGAGGAGTGATCAAAATGGGCTTTGAATCAGAAGTGGCAGTTTCAACAGCTCTTCTTGGAGTGTATTCAATGCTTGATATTGGAATCGTTTGGAAGTTGTTTTATCAGATACCTTGTAAGGATGTTGTTTTGTGGAGTGCCATGATCTCAGCCTGCGTGAAAAATGGTCAGTATATTGAAGCATTTGATCTTATCAGAGAGATGCAATATGAAGGTGTTCAACCAAACCATGTAAGTATTGTAAGTATTCTACCTGCTTGTGCTGATTCTGGTGCTCTGTGTTTGGGTAAAGAGATACATGGGTTCTCAATGAGAAGGGATTTTTATTCTTTGGTTAACATTCAGAATTCACTCATGGATATGTATTCAAAATGTAGAAATCTCGAGGCATCCATTAGAGTTTTGAAGATGATGAGGAAAAAGGACATGGTATCATGGAGAACTATAACTCATGCGTGTATCCAAAACAATAATCCTAGTAAAGCGTTTAAAATTTTCTCTAGGATGAAGTTTTCCGGATTCGAATTAGGCGAAACGATGATGCTCGACATCATAGCTGCAGTATTACTAGTTGAAGAGCTTTTACTTGGCTTGGCTGTTCATTGTTATGCACTGAAAAGTGGTTTTCTCTGTTTCATTTCAGTTGGAACTGAACTTCTCCAAATGTATGCTAAATTTGGTGATTTGGGATTGGCCAAACTTATATTTGACGACCTTGTTGACAAAGATATCATTGCGTGGAGTGCAATGATCTCAGCTTACTCCCATGGTGAAGATCCACTTAATGCCATCCAGACATTTAAAATGATGCAGTCAACTAATGAAAAGGCTAATGAGATAACTCTTGTAAGTGTAATGAATGCTTGTTCTTCTTTGGGGGCTCAGGAACTTGGAGGAAGCATACAAGCTCATATAACAAAATCCGGATACTCATCTAATACTCATTTGATGTCAGCATTGGTTGATTTTTACTGCACACTAGGGAGGATAAAGCTAGGGAAACATGTTTTCGATGAGATTTCGACGAAGGATTTAATTTGTTGGGGTGCTATGATTAAAGGGTATGGAATGAATGGCTGGGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCCAATGGAGTGCTCTTCCTCTCTCTTTTATCTGCTTGTGCTCAGTGTGGGCTGGAAAAGGAAGGTTGGATGTGGTTTCATTCAATGGTTGACAAGTATAATGTTACTCCAACAGTGGCTCATTATGCTTGTATGGTGGATTTGCTTGCTAGGAAAGGAAAAACTAGAGAAGCTGTTGAATTTGTGAAGAAAATACTAGTAGAATCTGATACAAGAATCTGGGGTGCTCTTTTTTCTGGTTGTAAATCAACTGATATTGCAGATTCAATTGTTGAACAGCTCACTGCTTTAAAACAAAACAATTCTGACTTTTATGCAATGTTGCTCAACTTTTAG

Protein sequence

MLCFSRCARNLFVKSPQRKFYTIAQKIDAISTKTQTYYEDPVGFYAQRENVIAWTSKITNLVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFESEVAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQYEGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGCKSTDIADSIVEQLTALKQNNSDFYAMLLNF
Homology
BLAST of Spg031939 vs. NCBI nr
Match: XP_022147491.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic isoform X1 [Momordica charantia] >XP_022147492.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 1021.5 bits (2640), Expect = 2.9e-294
Identity = 510/602 (84.72%), Postives = 547/602 (90.86%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAIST-KTQTYYEDPVGFYAQRENVIAWTSKIT 60
           MLCFSRCARNLFVKSPQRK Y I   ++A S+ K Q YYEDPVGFYAQRE+VI+WTSKIT
Sbjct: 1   MLCFSRCARNLFVKSPQRKNYMIDPMVEATSSNKKQAYYEDPVGFYAQREDVISWTSKIT 60

Query: 61  NLVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFES 120
           NLVRTGQ ++AFG FKTMFANGHRPNHVT+LSV+RAIDA SWES  E +HGGVIKMGFES
Sbjct: 61  NLVRTGQSEAAFGFFKTMFANGHRPNHVTMLSVIRAIDALSWESMNEVVHGGVIKMGFES 120

Query: 121 EVAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQY 180
           EVAVSTALLG YS  DIG VWKLFYQIP KD+VLWSAMISACVKNGQ+IEA DL REMQY
Sbjct: 121 EVAVSTALLGFYSNRDIGTVWKLFYQIPYKDIVLWSAMISACVKNGQFIEALDLFREMQY 180

Query: 181 EGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA 240
           +GVQPNHVSIVS+LPACADSG + LGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA
Sbjct: 181 QGVQPNHVSIVSVLPACADSGVISLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA 240

Query: 241 SIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLL 300
           SIRVLKMMRKKDMVSWRT+T+ACIQNN PSKAFKIFSRM+  GFELG+TMML IIAAVLL
Sbjct: 241 SIRVLKMMRKKDMVSWRTVTNACIQNNCPSKAFKIFSRMRSFGFELGKTMMLAIIAAVLL 300

Query: 301 VEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAM 360
           VEELLLGLAVHCYALK GFLCFI+VGTE+LQMYAKFG LGLAKLIFD+LVDKDIIAWSAM
Sbjct: 301 VEELLLGLAVHCYALKGGFLCFIAVGTEILQMYAKFGHLGLAKLIFDELVDKDIIAWSAM 360

Query: 361 ISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGY 420
           ISAYSHGEDPLNAIQTFKMMQSTNEK NEIT VS++NACSSLGAQELG SI AHI KSG 
Sbjct: 361 ISAYSHGEDPLNAIQTFKMMQSTNEKPNEITFVSLVNACSSLGAQELGESIHAHIMKSGC 420

Query: 421 SSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSD 480
           SSNTHLMSA VD YCTLGRIK GKHVFDEISTKDLICW  MIKGYGMNG GNEAL+TFSD
Sbjct: 421 SSNTHLMSAFVDLYCTLGRIKQGKHVFDEISTKDLICWSTMIKGYGMNGCGNEALDTFSD 480

Query: 481 MLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKG 540
           MLS GLKPNGVLF+SLLSACAQCG+EKEGW+WF SM+DKYN+TPTVAHYACMVDLL R+G
Sbjct: 481 MLSCGLKPNGVLFVSLLSACAQCGIEKEGWVWFRSMIDKYNITPTVAHYACMVDLLVRQG 540

Query: 541 KTREAVEFVKKILVESDTRIWGALFSGCKST----DIADSIVEQLTALKQNNSDFYAMLL 598
           K REAVEFVKK+ VE DTRIWGALF+GCK T    DI DSIVEQLTA++ NNS+FYAMLL
Sbjct: 541 KIREAVEFVKKMPVEPDTRIWGALFTGCKLTHGFPDIVDSIVEQLTAMEPNNSNFYAMLL 600

BLAST of Spg031939 vs. NCBI nr
Match: XP_023538701.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023538702.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023538703.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023538704.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1017.7 bits (2630), Expect = 4.1e-293
Identity = 506/601 (84.19%), Postives = 546/601 (90.85%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAISTKTQTYYEDPVGFYAQRENVIAWTSKITN 60
           MLCFSRCAR LFVKSPQRK YTI   IDA S K Q YYEDPVGF+AQ+E+VI+WTSKITN
Sbjct: 1   MLCFSRCARYLFVKSPQRKNYTIGLVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFESE 120
           LVRTGQPDSAFG FK MFANGHRPN+VT+LSV+RAIDA SWESTIE MHGGVIKMGFESE
Sbjct: 61  LVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120

Query: 121 VAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQYE 180
           VAVSTALLG YSM DIGIVWKLFYQIP KDVVLWSA+ISACVK+GQ+IEAF L REMQY+
Sbjct: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEAFHLFREMQYQ 180

Query: 181 GVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEAS 240
           GV PNHVSIVSILPACAD GAL LGKEIH FSMRRDFYS+VNIQNSLMDMYSKCRNLEAS
Sbjct: 181 GVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240

Query: 241 IRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLLV 300
           IRVLK MRKKDMVSWRT+THACIQNN PSKAFKIF+RM+  GFELGETMMLD IAAVLLV
Sbjct: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300

Query: 301 EELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAMI 360
           +ELLLGLAVHC+ALK GFLCFISVGTELLQMYAKFG+LGLAKL FD+LVDKDIIAWSAMI
Sbjct: 301 DELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360

Query: 361 SAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGYS 420
           SAYSHGE+PL+AIQTFKMMQSTNE+ N IT VS++NACSSL AQELG SI AHITKSGYS
Sbjct: 361 SAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESIHAHITKSGYS 420

Query: 421 SNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSDM 480
           SNT LMSALVDFYC L R+KLG+HVFDEI TKDL+CW  MIKGYG NG GNEALNTFSDM
Sbjct: 421 SNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCGNEALNTFSDM 480

Query: 481 LSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKGK 540
           LSYGLKPNG LF+SLLSACAQCGLEKEGWMWF++M+D+YN+TPTVAHYACMV+LLAR+GK
Sbjct: 481 LSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYACMVELLARQGK 540

Query: 541 TREAVEFVKKILVESDTRIWGALFSGCKST----DIADSIVEQLTALKQNNSDFYAMLLN 598
            REAVEFVKK+ VE DTRIWGALF+GCK T    DIADSIV+QL AL+ NNSDF+AML N
Sbjct: 541 IREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNALEPNNSDFHAMLHN 600

BLAST of Spg031939 vs. NCBI nr
Match: XP_022974593.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita maxima] >XP_022974594.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1017.3 bits (2629), Expect = 5.4e-293
Identity = 505/601 (84.03%), Postives = 545/601 (90.68%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAISTKTQTYYEDPVGFYAQRENVIAWTSKITN 60
           MLCFSRCARNLFVK+PQRK YTI   IDA S K Q YYEDPVGF+AQ+E+VI+WTSKITN
Sbjct: 1   MLCFSRCARNLFVKNPQRKNYTIGPVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFESE 120
           LVRTGQPDSAFG FK MFANGHRPNHVT+LSV+RAIDA SWESTIE MHGGVIKMGFESE
Sbjct: 61  LVRTGQPDSAFGFFKMMFANGHRPNHVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120

Query: 121 VAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQYE 180
           VAVSTALLG YSM DIGIVWKLFYQIP KDVVLWSA+ISACVKNGQ+IEAF L REMQY+
Sbjct: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKNGQFIEAFHLFREMQYQ 180

Query: 181 GVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEAS 240
           GV PNHVSIVSILPACAD GAL LGKEIH FSMRRDFYS+VNIQNSLMDMYSKCRNLEAS
Sbjct: 181 GVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240

Query: 241 IRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLLV 300
           IRVLK MRKKDMVSWRT+THACIQNN PSKAFKIF+RM+  GFELGETMMLD IAAVLLV
Sbjct: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300

Query: 301 EELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAMI 360
           +EL+LGLAVHCYALK GFLCFISVGTELLQMYAKFG+LGLAKL FD+LVDKDIIAWSAMI
Sbjct: 301 DELVLGLAVHCYALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360

Query: 361 SAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGYS 420
           SAYSHGE+PL+ IQTFKMMQSTNE+ N IT VS++NACSSL AQELG SI AHITKSGYS
Sbjct: 361 SAYSHGEEPLSVIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESIHAHITKSGYS 420

Query: 421 SNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSDM 480
           SNT LMSALVDFYC L R+KLG+HVFDEI TKDL+CW  MIKGYGMNG+GNEALNTFSDM
Sbjct: 421 SNTRLMSALVDFYCILRRVKLGEHVFDEILTKDLVCWSTMIKGYGMNGYGNEALNTFSDM 480

Query: 481 LSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKGK 540
           LSYGLK NG LF+SLLSACAQCGLEKEGWMWF+SM+D+YN+TPTVAHYACMV+LLAR+GK
Sbjct: 481 LSYGLKLNGTLFVSLLSACAQCGLEKEGWMWFNSMIDEYNITPTVAHYACMVELLARQGK 540

Query: 541 TREAVEFVKKILVESDTRIWGALFSGCK----STDIADSIVEQLTALKQNNSDFYAMLLN 598
            REAVEFV K+ VE DTRI GALF+GCK     +DIADSIV+QL AL+ NNSDF+AML N
Sbjct: 541 IREAVEFVNKMAVEPDTRICGALFAGCKLNHGFSDIADSIVQQLNALEPNNSDFHAMLHN 600

BLAST of Spg031939 vs. NCBI nr
Match: KAG7028589.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1013.4 bits (2619), Expect = 7.8e-292
Identity = 502/601 (83.53%), Postives = 544/601 (90.52%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAISTKTQTYYEDPVGFYAQRENVIAWTSKITN 60
           MLCFSRCARNLFVKSPQRK YTI   ID  S K Q YYEDPVGF+A++E+VI+WTSKITN
Sbjct: 1   MLCFSRCARNLFVKSPQRKNYTIGSVIDDTSAKKQAYYEDPVGFFAKQEDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFESE 120
           LVRTGQPDSAFG FK MFANGHRPNHVT+LSV+RAIDA SWESTIE MHGGVIKMGFESE
Sbjct: 61  LVRTGQPDSAFGFFKMMFANGHRPNHVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120

Query: 121 VAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQYE 180
           VAVSTALLG YSM DIGIVWKLFYQIP KDVVLWSA+ISACVK+GQ+ EAF L REMQY+
Sbjct: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFSEAFHLFREMQYQ 180

Query: 181 GVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEAS 240
           GV PNHVSIVSILPACAD GAL LGKEIH FSMRRDFYS+VNIQNSLMDMYSKCRNLEAS
Sbjct: 181 GVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240

Query: 241 IRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLLV 300
           IRVLK MRKKDMVSWRT+THACIQNN PSKAFKIF+RM+  GFELGETMMLD IAAVLLV
Sbjct: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300

Query: 301 EELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAMI 360
           +EL+LGLAVHCYALK GFLCFISVGTELLQMYAKFG+LGLAKL FD+LVDKDIIAWSAMI
Sbjct: 301 DELVLGLAVHCYALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360

Query: 361 SAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGYS 420
           SAYSHGE+PL+AIQ+FK+MQSTNE+ N IT VS++NACSSL AQELG SI AHITKSGYS
Sbjct: 361 SAYSHGEEPLSAIQSFKVMQSTNERPNAITFVSLVNACSSLDAQELGESIHAHITKSGYS 420

Query: 421 SNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSDM 480
           SNT LMSAL+DFYC L R+KLG+HVF EI TKDL+CW  MIKGYG NG GNEALNTFSDM
Sbjct: 421 SNTRLMSALLDFYCILRRVKLGEHVFYEIVTKDLVCWSTMIKGYGTNGCGNEALNTFSDM 480

Query: 481 LSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKGK 540
           LSYGLKPNG LF+SLLSACAQCGLEKEGWMWF+SM+D+YN+TPTVAHYACMV+LLAR+GK
Sbjct: 481 LSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFYSMIDEYNITPTVAHYACMVELLARQGK 540

Query: 541 TREAVEFVKKILVESDTRIWGALFSGCKST----DIADSIVEQLTALKQNNSDFYAMLLN 598
            REAVEFVKK+ VE DTRIWGALF+GCK T    DIADSIV+QL AL+ NNSDF+AML N
Sbjct: 541 IREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNALEPNNSDFHAMLHN 600

BLAST of Spg031939 vs. NCBI nr
Match: XP_022934182.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 978.8 bits (2529), Expect = 2.1e-281
Identity = 488/588 (82.99%), Postives = 528/588 (89.80%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAISTKTQTYYEDPVGFYAQRENVIAWTSKITN 60
           MLCFSRCARNLFVKSPQRK YTI   ID  S K Q YYEDPVGF+A++E+VI+WTSKITN
Sbjct: 1   MLCFSRCARNLFVKSPQRKNYTIGSVIDDTSAKKQAYYEDPVGFFAKQEDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFESE 120
           LVRTGQPDSAFG FK MFANGHRPN+VT+LSV+RAIDA SWESTIE MHGGVIKMGFESE
Sbjct: 61  LVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120

Query: 121 VAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQYE 180
           VAVSTALLG YSM DIGIVWKLFYQIP KDVVLWSA+ISACVK+GQ  EAF L REMQY+
Sbjct: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQCSEAFHLFREMQYQ 180

Query: 181 GVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEAS 240
           GV+PNHVSIVSILPACAD GAL LGKEIH FSMRRDFYS+VNIQNSLMDMYSKCRNLEAS
Sbjct: 181 GVRPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240

Query: 241 IRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLLV 300
           IRVLK MRKKDMVSWRT+THACIQNN PSKAFKIF+RM+  GFELGETMMLD IAAVLLV
Sbjct: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300

Query: 301 EELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAMI 360
           +EL+LGLAVHCYALK GFLCFISVGTELLQMYAKFG+LGLAKL FD+LVDKDIIAWSAMI
Sbjct: 301 DELVLGLAVHCYALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360

Query: 361 SAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGYS 420
           SAYSHGE+PL+AIQTFKMMQSTNE+ N IT  S++NACSSL AQELG SI AHITKSGYS
Sbjct: 361 SAYSHGEEPLSAIQTFKMMQSTNERPNAITFGSLVNACSSLDAQELGESIHAHITKSGYS 420

Query: 421 SNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSDM 480
           SNT LMSALVDFYC L R+KLG+HVF EI TKDL+CW  MIKGYGMNG+G EALNTFSDM
Sbjct: 421 SNTRLMSALVDFYCILRRVKLGEHVFYEILTKDLVCWSTMIKGYGMNGYGKEALNTFSDM 480

Query: 481 LSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKGK 540
           LSYGLKPNG LF+SLLSACAQCGLEK GWMWF+SM+D+YN+TPTVAHYACMV+LLAR+GK
Sbjct: 481 LSYGLKPNGTLFVSLLSACAQCGLEKAGWMWFNSMIDEYNITPTVAHYACMVELLARQGK 540

Query: 541 TREAVEFVKKILVESDTRIWGALFSGCKST----DIADSIVEQLTALK 585
               VEFVKK+ VE DTRIWGALF+GCK T    DIADSIV+QL A +
Sbjct: 541 ---IVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNAFR 585

BLAST of Spg031939 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 307.8 bits (787), Expect = 2.8e-82
Identity = 173/556 (31.12%), Postives = 299/556 (53.78%), Query Frame = 0

Query: 49  ENVIAWTSKITNLVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAM 108
           E  + W   +  L ++G    + GLFK M ++G   +  T   V ++  +       E +
Sbjct: 158 EKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQL 217

Query: 109 HGGVIKMGFESEVAVSTALLGVY-SMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQY 168
           HG ++K GF    +V  +L+  Y     +    K+F ++  +DV+ W+++I+  V NG  
Sbjct: 218 HGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLA 277

Query: 169 IEAFDLIREMQYEGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSL 228
            +   +  +M   G++ +  +IVS+   CADS  + LG+ +H   ++  F       N+L
Sbjct: 278 EKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTL 337

Query: 229 MDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFE--- 288
           +DMYSKC +L+++  V + M  + +VS+ ++     +     +A K+F  M+  G     
Sbjct: 338 LDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDV 397

Query: 289 LGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLI 348
              T +L+  A   L++E   G  VH +  ++     I V   L+ MYAK G +  A+L+
Sbjct: 398 YTVTAVLNCCARYRLLDE---GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 457

Query: 349 FDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFK-MMQSTNEKANEITLVSVMNACSSLGA 408
           F ++  KDII+W+ +I  YS       A+  F  +++      +E T+  V+ AC+SL A
Sbjct: 458 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 517

Query: 409 QELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKG 468
            + G  I  +I ++GY S+ H+ ++LVD Y   G + L   +FD+I++KDL+ W  MI G
Sbjct: 518 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 577

Query: 469 YGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTP 528
           YGM+G+G EA+  F+ M   G++ + + F+SLL AC+  GL  EGW +F+ M  +  + P
Sbjct: 578 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 637

Query: 529 TVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGCK---STDIADSIVEQL 588
           TV HYAC+VD+LAR G   +A  F++ + +  D  IWGAL  GC+      +A+ + E++
Sbjct: 638 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKV 697

Query: 589 TALKQNNSDFYAMLLN 597
             L+  N+ +Y ++ N
Sbjct: 698 FELEPENTGYYVLMAN 710

BLAST of Spg031939 vs. ExPASy Swiss-Prot
Match: Q9SUH6 (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 300.4 bits (768), Expect = 4.4e-80
Identity = 168/566 (29.68%), Postives = 296/566 (52.30%), Query Frame = 0

Query: 37  YYEDPVGFYAQRENVIAWTSKITNLVRTGQPDSAFGLFKTM-FANGHRPNHVTILSVMRA 96
           YY   +    QR +V  +   +        P S+  +F  +  +   +PN  T    + A
Sbjct: 69  YYARDIFLSVQRPDVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAISA 128

Query: 97  IDASSWESTIEAMHGGVIKMGFESEVAVSTALLGVY-SMLDIGIVWKLFYQIPCKDVVLW 156
                 +     +HG  +  G +SE+ + + ++ +Y     +    K+F ++P KD +LW
Sbjct: 129 ASGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILW 188

Query: 157 SAMISACVKNGQYIEAFDLIREMQYEG-VQPNHVSIVSILPACADSGALCLGKEIHGFSM 216
           + MIS   KN  Y+E+  + R++  E   + +  +++ ILPA A+   L LG +IH  + 
Sbjct: 189 NTMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLAT 248

Query: 217 RRDFYSLVNIQNSLMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFK 276
           +   YS   +    + +YSKC  ++    + +  RK D+V++  + H    N     +  
Sbjct: 249 KTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLS 308

Query: 277 IFSRMKFSGFELGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYA 336
           +F  +  SG  L  + ++ ++    +   L+L  A+H Y LKS FL   SV T L  +Y+
Sbjct: 309 LFKELMLSGARLRSSTLVSLVP---VSGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYS 368

Query: 337 KFGDLGLAKLIFDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVS 396
           K  ++  A+ +FD+  +K + +W+AMIS Y+      +AI  F+ MQ +    N +T+  
Sbjct: 369 KLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITC 428

Query: 397 VMNACSSLGAQELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKD 456
           +++AC+ LGA  LG  +   +  + + S+ ++ +AL+  Y   G I   + +FD ++ K+
Sbjct: 429 ILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKN 488

Query: 457 LICWGAMIKGYGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFH 516
            + W  MI GYG++G G EALN F +ML+ G+ P  V FL +L AC+  GL KEG   F+
Sbjct: 489 EVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFN 548

Query: 517 SMVDKYNVTPTVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGC---KST 576
           SM+ +Y   P+V HYACMVD+L R G  + A++F++ + +E  + +W  L   C   K T
Sbjct: 549 SMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDT 608

Query: 577 DIADSIVEQLTALKQNNSDFYAMLLN 597
           ++A ++ E+L  L  +N  ++ +L N
Sbjct: 609 NLARTVSEKLFELDPDNVGYHVLLSN 631

BLAST of Spg031939 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 297.0 bits (759), Expect = 4.9e-79
Identity = 158/493 (32.05%), Postives = 279/493 (56.59%), Query Frame = 0

Query: 108 MHGGVIKMGFESEVAVSTALLGVYSML-DIGIVWKLFYQIPCKDVVLWSAMISACVKNGQ 167
           +HG ++K GF  ++   T L  +Y+    +    K+F ++P +D+V W+ +++   +NG 
Sbjct: 157 IHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGM 216

Query: 168 YIEAFDLIREMQYEGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNS 227
              A ++++ M  E ++P+ ++IVS+LPA +    + +GKEIHG++MR  F SLVNI  +
Sbjct: 217 ARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTA 276

Query: 228 LMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELG 287
           L+DMY+KC +LE + ++   M ++++VSW ++  A +QN NP +A  IF +M   G +  
Sbjct: 277 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPT 336

Query: 288 ETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFD 347
           +  ++  + A   + +L  G  +H  +++ G    +SV   L+ MY K  ++  A  +F 
Sbjct: 337 DVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFG 396

Query: 348 DLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQEL 407
            L  + +++W+AMI  ++    P++A+  F  M+S   K +  T VSV+ A + L     
Sbjct: 397 KLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 456

Query: 408 GGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGM 467
              I   + +S    N  + +ALVD Y   G I + + +FD +S + +  W AMI GYG 
Sbjct: 457 AKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGT 516

Query: 468 NGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVA 527
           +G+G  AL  F +M    +KPNGV FLS++SAC+  GL + G   F+ M + Y++  ++ 
Sbjct: 517 HGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMD 576

Query: 528 HYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGC---KSTDIADSIVEQLTAL 587
           HY  MVDLL R G+  EA +F+ ++ V+    ++GA+   C   K+ + A+   E+L  L
Sbjct: 577 HYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFEL 636

Query: 588 KQNNSDFYAMLLN 597
             ++  ++ +L N
Sbjct: 637 NPDDGGYHVLLAN 649

BLAST of Spg031939 vs. ExPASy Swiss-Prot
Match: P93005 (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 295.4 bits (755), Expect = 1.4e-78
Identity = 178/559 (31.84%), Postives = 297/559 (53.13%), Query Frame = 0

Query: 49  ENVIAWTSKITNLVRTGQPDSAF---GLFKTMFANGHRPNHVTILSVMRAIDASSWESTI 108
           ++V++W S IT   + G   S++    LF+ M A    PN  T+  + +A ++S   ST+
Sbjct: 78  KDVVSWNSLITGYSQNGGISSSYTVMQLFREMRAQDILPNAYTLAGIFKA-ESSLQSSTV 137

Query: 109 -EAMHGGVIKMGFESEVAVSTALLGVY---SMLDIGIVWKLFYQIPCKDVVLWSAMISAC 168
               H  V+KM    ++ V T+L+G+Y    +++ G+  K+F  +P ++   WS M+S  
Sbjct: 138 GRQAHALVVKMSSFGDIYVDTSLVGMYCKAGLVEDGL--KVFAYMPERNTYTWSTMVSGY 197

Query: 169 VKNGQYIEA---FDLIREMQYEGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFY 228
              G+  EA   F+L    + EG   ++V   ++L + A +  + LG++IH  +++    
Sbjct: 198 ATRGRVEEAIKVFNLFLREKEEGSDSDYV-FTAVLSSLAATIYVGLGRQIHCITIKNGLL 257

Query: 229 SLVNIQNSLMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRM 288
             V + N+L+ MYSKC +L  + ++      ++ ++W  +     QN    +A K+FSRM
Sbjct: 258 GFVALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRM 317

Query: 289 KFSGFELGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDL 348
             +G +  E  ++ ++ A   +  L  G  +H + LK GF   +   T L+ MYAK G L
Sbjct: 318 FSAGIKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCL 377

Query: 349 GLAKLIFDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNAC 408
             A+  FD L ++D+  W+++IS Y    D   A+  ++ M++     N+ T+ SV+ AC
Sbjct: 378 ADARKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKAC 437

Query: 409 SSLGAQELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWG 468
           SSL   ELG  +  H  K G+     + SAL   Y   G ++ G  VF     KD++ W 
Sbjct: 438 SSLATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLEDGNLVFRRTPNKDVVSWN 497

Query: 469 AMIKGYGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDK 528
           AMI G   NG G+EAL  F +ML+ G++P+ V F++++SAC+  G  + GW +F+ M D+
Sbjct: 498 AMISGLSHNGQGDEALELFEEMLAEGMEPDDVTFVNIISACSHKGFVERGWFYFNMMSDQ 557

Query: 529 YNVTPTVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGCKS---TDIADS 588
             + P V HYACMVDLL+R G+ +EA EF++   ++    +W  L S CK+    ++   
Sbjct: 558 IGLDPKVDHYACMVDLLSRAGQLKEAKEFIESANIDHGLCLWRILLSACKNHGKCELGVY 617

Query: 589 IVEQLTALKQNNSDFYAML 595
             E+L AL    S  Y  L
Sbjct: 618 AGEKLMALGSRESSTYVQL 632

BLAST of Spg031939 vs. ExPASy Swiss-Prot
Match: Q9XE98 (Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E99 PE=3 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.1e-75
Identity = 170/556 (30.58%), Postives = 294/556 (52.88%), Query Frame = 0

Query: 47  QRENVIAWTSKITNLVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIE 106
           +  +V+ WT+ I    R G    A  L   M   G +P  VT+L ++  +      + ++
Sbjct: 108 RERDVVHWTAMIGCYSRAGIVGEACSLVNEMRFQGIKPGPVTLLEMLSGVLEI---TQLQ 167

Query: 107 AMHGGVIKMGFESEVAVSTALLGVYSMLD-IGIVWKLFYQIPCKDVVLWSAMISACVKNG 166
            +H   +  GF+ ++AV  ++L +Y   D +G    LF Q+  +D+V W+ MIS     G
Sbjct: 168 CLHDFAVIYGFDCDIAVMNSMLNLYCKCDHVGDAKDLFDQMEQRDMVSWNTMISGYASVG 227

Query: 167 QYIEAFDLIREMQYEGVQPNHVSIVSILPACADSGALC---LGKEIHGFSMRRDFYSLVN 226
              E   L+  M+ +G++P+  +  + L     SG +C   +G+ +H   ++  F   ++
Sbjct: 228 NMSEILKLLYRMRGDGLRPDQQTFGASLSV---SGTMCDLEMGRMLHCQIVKTGFDVDMH 287

Query: 227 IQNSLMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSG 286
           ++ +L+ MY KC   EAS RVL+ +  KD+V W  +    ++     KA  +FS M  SG
Sbjct: 288 LKTALITMYLKCGKEEASYRVLETIPNKDVVCWTVMISGLMRLGRAEKALIVFSEMLQSG 347

Query: 287 FELGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAK 346
            +L    +  ++A+   +    LG +VH Y L+ G+         L+ MYAK G L  + 
Sbjct: 348 SDLSSEAIASVVASCAQLGSFDLGASVHGYVLRHGYTLDTPALNSLITMYAKCGHLDKSL 407

Query: 347 LIFDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQ-STNEKANEITLVSVMNACSSL 406
           +IF+ + ++D+++W+A+IS Y+   D   A+  F+ M+  T ++ +  T+VS++ ACSS 
Sbjct: 408 VIFERMNERDLVSWNAIISGYAQNVDLCKALLLFEEMKFKTVQQVDSFTVVSLLQACSSA 467

Query: 407 GAQELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMI 466
           GA  +G  I   + +S     + + +ALVD Y   G ++  +  FD IS KD++ WG +I
Sbjct: 468 GALPVGKLIHCIVIRSFIRPCSLVDTALVDMYSKCGYLEAAQRCFDSISWKDVVSWGILI 527

Query: 467 KGYGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNV 526
            GYG +G G+ AL  +S+ L  G++PN V+FL++LS+C+  G+ ++G   F SMV  + V
Sbjct: 528 AGYGFHGKGDIALEIYSEFLHSGMEPNHVIFLAVLSSCSHNGMVQQGLKIFSSMVRDFGV 587

Query: 527 TPTVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGCKS---TDIADSIVE 586
            P   H AC+VDLL R  +  +A +F K+        + G +   C++   T++ D I E
Sbjct: 588 EPNHEHLACVVDLLCRAKRIEDAFKFYKENFTRPSIDVLGIILDACRANGKTEVEDIICE 647

Query: 587 QLTALKQNNSDFYAML 595
            +  LK  ++  Y  L
Sbjct: 648 DMIELKPGDAGHYVKL 657

BLAST of Spg031939 vs. ExPASy TrEMBL
Match: A0A6J1D2H4 (pentatricopeptide repeat-containing protein DOT4, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016401 PE=4 SV=1)

HSP 1 Score: 1021.5 bits (2640), Expect = 1.4e-294
Identity = 510/602 (84.72%), Postives = 547/602 (90.86%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAIST-KTQTYYEDPVGFYAQRENVIAWTSKIT 60
           MLCFSRCARNLFVKSPQRK Y I   ++A S+ K Q YYEDPVGFYAQRE+VI+WTSKIT
Sbjct: 1   MLCFSRCARNLFVKSPQRKNYMIDPMVEATSSNKKQAYYEDPVGFYAQREDVISWTSKIT 60

Query: 61  NLVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFES 120
           NLVRTGQ ++AFG FKTMFANGHRPNHVT+LSV+RAIDA SWES  E +HGGVIKMGFES
Sbjct: 61  NLVRTGQSEAAFGFFKTMFANGHRPNHVTMLSVIRAIDALSWESMNEVVHGGVIKMGFES 120

Query: 121 EVAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQY 180
           EVAVSTALLG YS  DIG VWKLFYQIP KD+VLWSAMISACVKNGQ+IEA DL REMQY
Sbjct: 121 EVAVSTALLGFYSNRDIGTVWKLFYQIPYKDIVLWSAMISACVKNGQFIEALDLFREMQY 180

Query: 181 EGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA 240
           +GVQPNHVSIVS+LPACADSG + LGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA
Sbjct: 181 QGVQPNHVSIVSVLPACADSGVISLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA 240

Query: 241 SIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLL 300
           SIRVLKMMRKKDMVSWRT+T+ACIQNN PSKAFKIFSRM+  GFELG+TMML IIAAVLL
Sbjct: 241 SIRVLKMMRKKDMVSWRTVTNACIQNNCPSKAFKIFSRMRSFGFELGKTMMLAIIAAVLL 300

Query: 301 VEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAM 360
           VEELLLGLAVHCYALK GFLCFI+VGTE+LQMYAKFG LGLAKLIFD+LVDKDIIAWSAM
Sbjct: 301 VEELLLGLAVHCYALKGGFLCFIAVGTEILQMYAKFGHLGLAKLIFDELVDKDIIAWSAM 360

Query: 361 ISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGY 420
           ISAYSHGEDPLNAIQTFKMMQSTNEK NEIT VS++NACSSLGAQELG SI AHI KSG 
Sbjct: 361 ISAYSHGEDPLNAIQTFKMMQSTNEKPNEITFVSLVNACSSLGAQELGESIHAHIMKSGC 420

Query: 421 SSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSD 480
           SSNTHLMSA VD YCTLGRIK GKHVFDEISTKDLICW  MIKGYGMNG GNEAL+TFSD
Sbjct: 421 SSNTHLMSAFVDLYCTLGRIKQGKHVFDEISTKDLICWSTMIKGYGMNGCGNEALDTFSD 480

Query: 481 MLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKG 540
           MLS GLKPNGVLF+SLLSACAQCG+EKEGW+WF SM+DKYN+TPTVAHYACMVDLL R+G
Sbjct: 481 MLSCGLKPNGVLFVSLLSACAQCGIEKEGWVWFRSMIDKYNITPTVAHYACMVDLLVRQG 540

Query: 541 KTREAVEFVKKILVESDTRIWGALFSGCKST----DIADSIVEQLTALKQNNSDFYAMLL 598
           K REAVEFVKK+ VE DTRIWGALF+GCK T    DI DSIVEQLTA++ NNS+FYAMLL
Sbjct: 541 KIREAVEFVKKMPVEPDTRIWGALFTGCKLTHGFPDIVDSIVEQLTAMEPNNSNFYAMLL 600

BLAST of Spg031939 vs. ExPASy TrEMBL
Match: A0A6J1IGR3 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111473280 PE=4 SV=1)

HSP 1 Score: 1017.3 bits (2629), Expect = 2.6e-293
Identity = 505/601 (84.03%), Postives = 545/601 (90.68%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAISTKTQTYYEDPVGFYAQRENVIAWTSKITN 60
           MLCFSRCARNLFVK+PQRK YTI   IDA S K Q YYEDPVGF+AQ+E+VI+WTSKITN
Sbjct: 1   MLCFSRCARNLFVKNPQRKNYTIGPVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFESE 120
           LVRTGQPDSAFG FK MFANGHRPNHVT+LSV+RAIDA SWESTIE MHGGVIKMGFESE
Sbjct: 61  LVRTGQPDSAFGFFKMMFANGHRPNHVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120

Query: 121 VAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQYE 180
           VAVSTALLG YSM DIGIVWKLFYQIP KDVVLWSA+ISACVKNGQ+IEAF L REMQY+
Sbjct: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKNGQFIEAFHLFREMQYQ 180

Query: 181 GVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEAS 240
           GV PNHVSIVSILPACAD GAL LGKEIH FSMRRDFYS+VNIQNSLMDMYSKCRNLEAS
Sbjct: 181 GVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240

Query: 241 IRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLLV 300
           IRVLK MRKKDMVSWRT+THACIQNN PSKAFKIF+RM+  GFELGETMMLD IAAVLLV
Sbjct: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300

Query: 301 EELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAMI 360
           +EL+LGLAVHCYALK GFLCFISVGTELLQMYAKFG+LGLAKL FD+LVDKDIIAWSAMI
Sbjct: 301 DELVLGLAVHCYALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360

Query: 361 SAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGYS 420
           SAYSHGE+PL+ IQTFKMMQSTNE+ N IT VS++NACSSL AQELG SI AHITKSGYS
Sbjct: 361 SAYSHGEEPLSVIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESIHAHITKSGYS 420

Query: 421 SNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSDM 480
           SNT LMSALVDFYC L R+KLG+HVFDEI TKDL+CW  MIKGYGMNG+GNEALNTFSDM
Sbjct: 421 SNTRLMSALVDFYCILRRVKLGEHVFDEILTKDLVCWSTMIKGYGMNGYGNEALNTFSDM 480

Query: 481 LSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKGK 540
           LSYGLK NG LF+SLLSACAQCGLEKEGWMWF+SM+D+YN+TPTVAHYACMV+LLAR+GK
Sbjct: 481 LSYGLKLNGTLFVSLLSACAQCGLEKEGWMWFNSMIDEYNITPTVAHYACMVELLARQGK 540

Query: 541 TREAVEFVKKILVESDTRIWGALFSGCK----STDIADSIVEQLTALKQNNSDFYAMLLN 598
            REAVEFV K+ VE DTRI GALF+GCK     +DIADSIV+QL AL+ NNSDF+AML N
Sbjct: 541 IREAVEFVNKMAVEPDTRICGALFAGCKLNHGFSDIADSIVQQLNALEPNNSDFHAMLHN 600

BLAST of Spg031939 vs. ExPASy TrEMBL
Match: A0A6J1F1Z2 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111441423 PE=4 SV=1)

HSP 1 Score: 978.8 bits (2529), Expect = 1.0e-281
Identity = 488/588 (82.99%), Postives = 528/588 (89.80%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAISTKTQTYYEDPVGFYAQRENVIAWTSKITN 60
           MLCFSRCARNLFVKSPQRK YTI   ID  S K Q YYEDPVGF+A++E+VI+WTSKITN
Sbjct: 1   MLCFSRCARNLFVKSPQRKNYTIGSVIDDTSAKKQAYYEDPVGFFAKQEDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFESE 120
           LVRTGQPDSAFG FK MFANGHRPN+VT+LSV+RAIDA SWESTIE MHGGVIKMGFESE
Sbjct: 61  LVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120

Query: 121 VAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQYE 180
           VAVSTALLG YSM DIGIVWKLFYQIP KDVVLWSA+ISACVK+GQ  EAF L REMQY+
Sbjct: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQCSEAFHLFREMQYQ 180

Query: 181 GVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEAS 240
           GV+PNHVSIVSILPACAD GAL LGKEIH FSMRRDFYS+VNIQNSLMDMYSKCRNLEAS
Sbjct: 181 GVRPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240

Query: 241 IRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLLV 300
           IRVLK MRKKDMVSWRT+THACIQNN PSKAFKIF+RM+  GFELGETMMLD IAAVLLV
Sbjct: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300

Query: 301 EELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAMI 360
           +EL+LGLAVHCYALK GFLCFISVGTELLQMYAKFG+LGLAKL FD+LVDKDIIAWSAMI
Sbjct: 301 DELVLGLAVHCYALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360

Query: 361 SAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGYS 420
           SAYSHGE+PL+AIQTFKMMQSTNE+ N IT  S++NACSSL AQELG SI AHITKSGYS
Sbjct: 361 SAYSHGEEPLSAIQTFKMMQSTNERPNAITFGSLVNACSSLDAQELGESIHAHITKSGYS 420

Query: 421 SNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSDM 480
           SNT LMSALVDFYC L R+KLG+HVF EI TKDL+CW  MIKGYGMNG+G EALNTFSDM
Sbjct: 421 SNTRLMSALVDFYCILRRVKLGEHVFYEILTKDLVCWSTMIKGYGMNGYGKEALNTFSDM 480

Query: 481 LSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKGK 540
           LSYGLKPNG LF+SLLSACAQCGLEK GWMWF+SM+D+YN+TPTVAHYACMV+LLAR+GK
Sbjct: 481 LSYGLKPNGTLFVSLLSACAQCGLEKAGWMWFNSMIDEYNITPTVAHYACMVELLARQGK 540

Query: 541 TREAVEFVKKILVESDTRIWGALFSGCKST----DIADSIVEQLTALK 585
               VEFVKK+ VE DTRIWGALF+GCK T    DIADSIV+QL A +
Sbjct: 541 ---IVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNAFR 585

BLAST of Spg031939 vs. ExPASy TrEMBL
Match: A0A6J1D1G5 (pentatricopeptide repeat-containing protein DOT4, chloroplastic isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016401 PE=4 SV=1)

HSP 1 Score: 956.4 bits (2471), Expect = 5.5e-275
Identity = 484/602 (80.40%), Postives = 520/602 (86.38%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAIST-KTQTYYEDPVGFYAQRENVIAWTSKIT 60
           MLCFSRCARNLFVKSPQRK Y I   ++A S+ K Q YYEDPVGFYAQRE+VI+WTSKIT
Sbjct: 1   MLCFSRCARNLFVKSPQRKNYMIDPMVEATSSNKKQAYYEDPVGFYAQREDVISWTSKIT 60

Query: 61  NLVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFES 120
           NLVRTGQ ++AFG FKTMFANGHRPNHVT+LSV+RAIDA SWES  E +HGGVIKMGFES
Sbjct: 61  NLVRTGQSEAAFGFFKTMFANGHRPNHVTMLSVIRAIDALSWESMNEVVHGGVIKMGFES 120

Query: 121 EVAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQY 180
           EVAVSTALLG YS  DIG VWKLFYQIP KD+VLWSAMISACVKNGQ+IEA DL REMQY
Sbjct: 121 EVAVSTALLGFYSNRDIGTVWKLFYQIPYKDIVLWSAMISACVKNGQFIEALDLFREMQY 180

Query: 181 EGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA 240
           +GVQPNHVSIVS+LPACADSG + LGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA
Sbjct: 181 QGVQPNHVSIVSVLPACADSGVISLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEA 240

Query: 241 SIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLL 300
           SIRVLKMMRKKDMVSWRT+T+ACIQNN PSKAFKIFSRM+  GFELG+TMML IIAAVLL
Sbjct: 241 SIRVLKMMRKKDMVSWRTVTNACIQNNCPSKAFKIFSRMRSFGFELGKTMMLAIIAAVLL 300

Query: 301 VEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAM 360
           VEELLLGLAVHCYALK GFLCFI+VGTE+LQMYAKFG LGLAKLIFD+LVDKDIIAWSAM
Sbjct: 301 VEELLLGLAVHCYALKGGFLCFIAVGTEILQMYAKFGHLGLAKLIFDELVDKDIIAWSAM 360

Query: 361 ISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGY 420
           ISAYSH                             +NACSSLGAQELG SI AHI KSG 
Sbjct: 361 ISAYSH-----------------------------VNACSSLGAQELGESIHAHIMKSGC 420

Query: 421 SSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSD 480
           SSNTHLMSA VD YCTLGRIK GKHVFDEISTKDLICW  MIKGYGMNG GNEAL+TFSD
Sbjct: 421 SSNTHLMSAFVDLYCTLGRIKQGKHVFDEISTKDLICWSTMIKGYGMNGCGNEALDTFSD 480

Query: 481 MLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKG 540
           MLS GLKPNGVLF+SLLSACAQCG+EKEGW+WF SM+DKYN+TPTVAHYACMVDLL R+G
Sbjct: 481 MLSCGLKPNGVLFVSLLSACAQCGIEKEGWVWFRSMIDKYNITPTVAHYACMVDLLVRQG 540

Query: 541 KTREAVEFVKKILVESDTRIWGALFSGCKST----DIADSIVEQLTALKQNNSDFYAMLL 598
           K REAVEFVKK+ VE DTRIWGALF+GCK T    DI DSIVEQLTA++ NNS+FYAMLL
Sbjct: 541 KIREAVEFVKKMPVEPDTRIWGALFTGCKLTHGFPDIVDSIVEQLTAMEPNNSNFYAMLL 573

BLAST of Spg031939 vs. ExPASy TrEMBL
Match: A0A1S4DSI8 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g56570 OS=Cucumis melo OX=3656 GN=LOC103484166 PE=4 SV=1)

HSP 1 Score: 925.2 bits (2390), Expect = 1.3e-265
Identity = 459/570 (80.53%), Postives = 501/570 (87.89%), Query Frame = 0

Query: 1   MLCFSRCARNLFVKSPQRKFYTIAQKIDAISTKTQTYYEDPVGFYAQRENVIAWTSKITN 60
           MLCFSRCARNLFV SP RK YTI   +DA STK + Y+EDPV FYAQRE+VI+WTSKITN
Sbjct: 1   MLCFSRCARNLFVISPNRKNYTIRSMMDATSTKKRGYFEDPVEFYAQREDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAMHGGVIKMGFESE 120
           LVR GQP+SAFG FK MF+NGHRPN+VT+LSV+RAIDA SW+S IE MHG  IKMGFESE
Sbjct: 61  LVRAGQPESAFGFFKMMFSNGHRPNYVTMLSVIRAIDALSWDSMIEVMHGVTIKMGFESE 120

Query: 121 VAVSTALLGVYSMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQYIEAFDLIREMQYE 180
           VAVSTALLG YS+ DI  VWKLF QIPCKDVV WSA+ISACVKNGQY EAFDL+REMQ +
Sbjct: 121 VAVSTALLGFYSIRDIETVWKLFNQIPCKDVVFWSAIISACVKNGQYSEAFDLLREMQDQ 180

Query: 181 GVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSLMDMYSKCRNLEAS 240
           GVQPN VSIVSILPACAD G L LGKE+H FSMR+DFYS+V+IQNSLMDMYSKCR  EAS
Sbjct: 181 GVQPNQVSIVSILPACADFGVLSLGKELHAFSMRKDFYSMVDIQNSLMDMYSKCRMFEAS 240

Query: 241 IRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELGETMMLDIIAAVLLV 300
           I+VLK+MRKKD VSW+ ITHACIQNN PS+ FKIFSRM+  GFEL ETM+LD+I+AVLLV
Sbjct: 241 IKVLKLMRKKDAVSWKIITHACIQNNYPSEVFKIFSRMRSLGFELSETMVLDMISAVLLV 300

Query: 301 EELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFDDLVDKDIIAWSAMI 360
           +ELLLGLAVHCYALK GFLCFI VGTELLQMYAKFGDL LAKL+FD+LVDKDIIAWSAMI
Sbjct: 301 DELLLGLAVHCYALKGGFLCFILVGTELLQMYAKFGDLRLAKLVFDELVDKDIIAWSAMI 360

Query: 361 SAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQELGGSIQAHITKSGYS 420
           S YSHGEDPLNAIQTFKMMQSTNEK NE T VS+M+ACSSLGA+ELG SIQAH  K GY+
Sbjct: 361 SVYSHGEDPLNAIQTFKMMQSTNEKPNERTFVSLMDACSSLGAKELGESIQAHTIKCGYT 420

Query: 421 SNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGMNGWGNEALNTFSDM 480
           SNTHLMSALV FYCTLGRIKLG+HVFDEISTKDLICW AMIKGYG+NG GN+ALNTFSDM
Sbjct: 421 SNTHLMSALVGFYCTLGRIKLGEHVFDEISTKDLICWNAMIKGYGLNGCGNKALNTFSDM 480

Query: 481 LSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVAHYACMVDLLARKGK 540
           LSYGLKPNGV+F SLLSACAQCGLEKE  MWF SM+DKY +TPT AHYAC+VDLL RKGK
Sbjct: 481 LSYGLKPNGVVFASLLSACAQCGLEKEVRMWFRSMIDKYGITPTEAHYACIVDLLVRKGK 540

Query: 541 TREAVEFVKKILVESDTRIWGALFSGCKST 571
             EAVEFVK + VE DTRIWGAL  GCK T
Sbjct: 541 IGEAVEFVKXMPVEPDTRIWGALLLGCKLT 570

BLAST of Spg031939 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 307.8 bits (787), Expect = 2.0e-83
Identity = 173/556 (31.12%), Postives = 299/556 (53.78%), Query Frame = 0

Query: 49  ENVIAWTSKITNLVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIEAM 108
           E  + W   +  L ++G    + GLFK M ++G   +  T   V ++  +       E +
Sbjct: 158 EKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQL 217

Query: 109 HGGVIKMGFESEVAVSTALLGVY-SMLDIGIVWKLFYQIPCKDVVLWSAMISACVKNGQY 168
           HG ++K GF    +V  +L+  Y     +    K+F ++  +DV+ W+++I+  V NG  
Sbjct: 218 HGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLA 277

Query: 169 IEAFDLIREMQYEGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNSL 228
            +   +  +M   G++ +  +IVS+   CADS  + LG+ +H   ++  F       N+L
Sbjct: 278 EKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTL 337

Query: 229 MDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFE--- 288
           +DMYSKC +L+++  V + M  + +VS+ ++     +     +A K+F  M+  G     
Sbjct: 338 LDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDV 397

Query: 289 LGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLI 348
              T +L+  A   L++E   G  VH +  ++     I V   L+ MYAK G +  A+L+
Sbjct: 398 YTVTAVLNCCARYRLLDE---GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 457

Query: 349 FDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFK-MMQSTNEKANEITLVSVMNACSSLGA 408
           F ++  KDII+W+ +I  YS       A+  F  +++      +E T+  V+ AC+SL A
Sbjct: 458 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 517

Query: 409 QELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKG 468
            + G  I  +I ++GY S+ H+ ++LVD Y   G + L   +FD+I++KDL+ W  MI G
Sbjct: 518 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 577

Query: 469 YGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTP 528
           YGM+G+G EA+  F+ M   G++ + + F+SLL AC+  GL  EGW +F+ M  +  + P
Sbjct: 578 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 637

Query: 529 TVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGCK---STDIADSIVEQL 588
           TV HYAC+VD+LAR G   +A  F++ + +  D  IWGAL  GC+      +A+ + E++
Sbjct: 638 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKV 697

Query: 589 TALKQNNSDFYAMLLN 597
             L+  N+ +Y ++ N
Sbjct: 698 FELEPENTGYYVLMAN 710

BLAST of Spg031939 vs. TAIR 10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 300.4 bits (768), Expect = 3.1e-81
Identity = 168/566 (29.68%), Postives = 296/566 (52.30%), Query Frame = 0

Query: 37  YYEDPVGFYAQRENVIAWTSKITNLVRTGQPDSAFGLFKTM-FANGHRPNHVTILSVMRA 96
           YY   +    QR +V  +   +        P S+  +F  +  +   +PN  T    + A
Sbjct: 69  YYARDIFLSVQRPDVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAISA 128

Query: 97  IDASSWESTIEAMHGGVIKMGFESEVAVSTALLGVY-SMLDIGIVWKLFYQIPCKDVVLW 156
                 +     +HG  +  G +SE+ + + ++ +Y     +    K+F ++P KD +LW
Sbjct: 129 ASGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILW 188

Query: 157 SAMISACVKNGQYIEAFDLIREMQYEG-VQPNHVSIVSILPACADSGALCLGKEIHGFSM 216
           + MIS   KN  Y+E+  + R++  E   + +  +++ ILPA A+   L LG +IH  + 
Sbjct: 189 NTMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLAT 248

Query: 217 RRDFYSLVNIQNSLMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFK 276
           +   YS   +    + +YSKC  ++    + +  RK D+V++  + H    N     +  
Sbjct: 249 KTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLS 308

Query: 277 IFSRMKFSGFELGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYA 336
           +F  +  SG  L  + ++ ++    +   L+L  A+H Y LKS FL   SV T L  +Y+
Sbjct: 309 LFKELMLSGARLRSSTLVSLVP---VSGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYS 368

Query: 337 KFGDLGLAKLIFDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVS 396
           K  ++  A+ +FD+  +K + +W+AMIS Y+      +AI  F+ MQ +    N +T+  
Sbjct: 369 KLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITC 428

Query: 397 VMNACSSLGAQELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKD 456
           +++AC+ LGA  LG  +   +  + + S+ ++ +AL+  Y   G I   + +FD ++ K+
Sbjct: 429 ILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKN 488

Query: 457 LICWGAMIKGYGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFH 516
            + W  MI GYG++G G EALN F +ML+ G+ P  V FL +L AC+  GL KEG   F+
Sbjct: 489 EVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFN 548

Query: 517 SMVDKYNVTPTVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGC---KST 576
           SM+ +Y   P+V HYACMVD+L R G  + A++F++ + +E  + +W  L   C   K T
Sbjct: 549 SMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDT 608

Query: 577 DIADSIVEQLTALKQNNSDFYAMLLN 597
           ++A ++ E+L  L  +N  ++ +L N
Sbjct: 609 NLARTVSEKLFELDPDNVGYHVLLSN 631

BLAST of Spg031939 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 297.0 bits (759), Expect = 3.5e-80
Identity = 158/493 (32.05%), Postives = 279/493 (56.59%), Query Frame = 0

Query: 108 MHGGVIKMGFESEVAVSTALLGVYSML-DIGIVWKLFYQIPCKDVVLWSAMISACVKNGQ 167
           +HG ++K GF  ++   T L  +Y+    +    K+F ++P +D+V W+ +++   +NG 
Sbjct: 157 IHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGM 216

Query: 168 YIEAFDLIREMQYEGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFYSLVNIQNS 227
              A ++++ M  E ++P+ ++IVS+LPA +    + +GKEIHG++MR  F SLVNI  +
Sbjct: 217 ARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTA 276

Query: 228 LMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSGFELG 287
           L+DMY+KC +LE + ++   M ++++VSW ++  A +QN NP +A  IF +M   G +  
Sbjct: 277 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPT 336

Query: 288 ETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAKLIFD 347
           +  ++  + A   + +L  G  +H  +++ G    +SV   L+ MY K  ++  A  +F 
Sbjct: 337 DVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFG 396

Query: 348 DLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNACSSLGAQEL 407
            L  + +++W+AMI  ++    P++A+  F  M+S   K +  T VSV+ A + L     
Sbjct: 397 KLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 456

Query: 408 GGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMIKGYGM 467
              I   + +S    N  + +ALVD Y   G I + + +FD +S + +  W AMI GYG 
Sbjct: 457 AKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGT 516

Query: 468 NGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNVTPTVA 527
           +G+G  AL  F +M    +KPNGV FLS++SAC+  GL + G   F+ M + Y++  ++ 
Sbjct: 517 HGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMD 576

Query: 528 HYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGC---KSTDIADSIVEQLTAL 587
           HY  MVDLL R G+  EA +F+ ++ V+    ++GA+   C   K+ + A+   E+L  L
Sbjct: 577 HYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFEL 636

Query: 588 KQNNSDFYAMLLN 597
             ++  ++ +L N
Sbjct: 637 NPDDGGYHVLLAN 649

BLAST of Spg031939 vs. TAIR 10
Match: AT2G33680.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 295.4 bits (755), Expect = 1.0e-79
Identity = 178/559 (31.84%), Postives = 297/559 (53.13%), Query Frame = 0

Query: 49  ENVIAWTSKITNLVRTGQPDSAF---GLFKTMFANGHRPNHVTILSVMRAIDASSWESTI 108
           ++V++W S IT   + G   S++    LF+ M A    PN  T+  + +A ++S   ST+
Sbjct: 78  KDVVSWNSLITGYSQNGGISSSYTVMQLFREMRAQDILPNAYTLAGIFKA-ESSLQSSTV 137

Query: 109 -EAMHGGVIKMGFESEVAVSTALLGVY---SMLDIGIVWKLFYQIPCKDVVLWSAMISAC 168
               H  V+KM    ++ V T+L+G+Y    +++ G+  K+F  +P ++   WS M+S  
Sbjct: 138 GRQAHALVVKMSSFGDIYVDTSLVGMYCKAGLVEDGL--KVFAYMPERNTYTWSTMVSGY 197

Query: 169 VKNGQYIEA---FDLIREMQYEGVQPNHVSIVSILPACADSGALCLGKEIHGFSMRRDFY 228
              G+  EA   F+L    + EG   ++V   ++L + A +  + LG++IH  +++    
Sbjct: 198 ATRGRVEEAIKVFNLFLREKEEGSDSDYV-FTAVLSSLAATIYVGLGRQIHCITIKNGLL 257

Query: 229 SLVNIQNSLMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRM 288
             V + N+L+ MYSKC +L  + ++      ++ ++W  +     QN    +A K+FSRM
Sbjct: 258 GFVALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRM 317

Query: 289 KFSGFELGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDL 348
             +G +  E  ++ ++ A   +  L  G  +H + LK GF   +   T L+ MYAK G L
Sbjct: 318 FSAGIKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCL 377

Query: 349 GLAKLIFDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKANEITLVSVMNAC 408
             A+  FD L ++D+  W+++IS Y    D   A+  ++ M++     N+ T+ SV+ AC
Sbjct: 378 ADARKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKAC 437

Query: 409 SSLGAQELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWG 468
           SSL   ELG  +  H  K G+     + SAL   Y   G ++ G  VF     KD++ W 
Sbjct: 438 SSLATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLEDGNLVFRRTPNKDVVSWN 497

Query: 469 AMIKGYGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDK 528
           AMI G   NG G+EAL  F +ML+ G++P+ V F++++SAC+  G  + GW +F+ M D+
Sbjct: 498 AMISGLSHNGQGDEALELFEEMLAEGMEPDDVTFVNIISACSHKGFVERGWFYFNMMSDQ 557

Query: 529 YNVTPTVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGCKS---TDIADS 588
             + P V HYACMVDLL+R G+ +EA EF++   ++    +W  L S CK+    ++   
Sbjct: 558 IGLDPKVDHYACMVDLLSRAGQLKEAKEFIESANIDHGLCLWRILLSACKNHGKCELGVY 617

Query: 589 IVEQLTALKQNNSDFYAML 595
             E+L AL    S  Y  L
Sbjct: 618 AGEKLMALGSRESSTYVQL 632

BLAST of Spg031939 vs. TAIR 10
Match: AT4G04370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 285.8 bits (730), Expect = 8.0e-77
Identity = 170/556 (30.58%), Postives = 294/556 (52.88%), Query Frame = 0

Query: 47  QRENVIAWTSKITNLVRTGQPDSAFGLFKTMFANGHRPNHVTILSVMRAIDASSWESTIE 106
           +  +V+ WT+ I    R G    A  L   M   G +P  VT+L ++  +      + ++
Sbjct: 108 RERDVVHWTAMIGCYSRAGIVGEACSLVNEMRFQGIKPGPVTLLEMLSGVLEI---TQLQ 167

Query: 107 AMHGGVIKMGFESEVAVSTALLGVYSMLD-IGIVWKLFYQIPCKDVVLWSAMISACVKNG 166
            +H   +  GF+ ++AV  ++L +Y   D +G    LF Q+  +D+V W+ MIS     G
Sbjct: 168 CLHDFAVIYGFDCDIAVMNSMLNLYCKCDHVGDAKDLFDQMEQRDMVSWNTMISGYASVG 227

Query: 167 QYIEAFDLIREMQYEGVQPNHVSIVSILPACADSGALC---LGKEIHGFSMRRDFYSLVN 226
              E   L+  M+ +G++P+  +  + L     SG +C   +G+ +H   ++  F   ++
Sbjct: 228 NMSEILKLLYRMRGDGLRPDQQTFGASLSV---SGTMCDLEMGRMLHCQIVKTGFDVDMH 287

Query: 227 IQNSLMDMYSKCRNLEASIRVLKMMRKKDMVSWRTITHACIQNNNPSKAFKIFSRMKFSG 286
           ++ +L+ MY KC   EAS RVL+ +  KD+V W  +    ++     KA  +FS M  SG
Sbjct: 288 LKTALITMYLKCGKEEASYRVLETIPNKDVVCWTVMISGLMRLGRAEKALIVFSEMLQSG 347

Query: 287 FELGETMMLDIIAAVLLVEELLLGLAVHCYALKSGFLCFISVGTELLQMYAKFGDLGLAK 346
            +L    +  ++A+   +    LG +VH Y L+ G+         L+ MYAK G L  + 
Sbjct: 348 SDLSSEAIASVVASCAQLGSFDLGASVHGYVLRHGYTLDTPALNSLITMYAKCGHLDKSL 407

Query: 347 LIFDDLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQ-STNEKANEITLVSVMNACSSL 406
           +IF+ + ++D+++W+A+IS Y+   D   A+  F+ M+  T ++ +  T+VS++ ACSS 
Sbjct: 408 VIFERMNERDLVSWNAIISGYAQNVDLCKALLLFEEMKFKTVQQVDSFTVVSLLQACSSA 467

Query: 407 GAQELGGSIQAHITKSGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDLICWGAMI 466
           GA  +G  I   + +S     + + +ALVD Y   G ++  +  FD IS KD++ WG +I
Sbjct: 468 GALPVGKLIHCIVIRSFIRPCSLVDTALVDMYSKCGYLEAAQRCFDSISWKDVVSWGILI 527

Query: 467 KGYGMNGWGNEALNTFSDMLSYGLKPNGVLFLSLLSACAQCGLEKEGWMWFHSMVDKYNV 526
            GYG +G G+ AL  +S+ L  G++PN V+FL++LS+C+  G+ ++G   F SMV  + V
Sbjct: 528 AGYGFHGKGDIALEIYSEFLHSGMEPNHVIFLAVLSSCSHNGMVQQGLKIFSSMVRDFGV 587

Query: 527 TPTVAHYACMVDLLARKGKTREAVEFVKKILVESDTRIWGALFSGCKS---TDIADSIVE 586
            P   H AC+VDLL R  +  +A +F K+        + G +   C++   T++ D I E
Sbjct: 588 EPNHEHLACVVDLLCRAKRIEDAFKFYKENFTRPSIDVLGIILDACRANGKTEVEDIICE 647

Query: 587 QLTALKQNNSDFYAML 595
            +  LK  ++  Y  L
Sbjct: 648 DMIELKPGDAGHYVKL 657

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022147491.12.9e-29484.72pentatricopeptide repeat-containing protein DOT4, chloroplastic isoform X1 [Momo... [more]
XP_023538701.14.1e-29384.19pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
XP_022974593.15.4e-29384.03pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
KAG7028589.17.8e-29283.53Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
XP_022934182.12.1e-28182.99pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9SN392.8e-8231.12Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9SUH64.4e-8029.68Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
Q3E6Q14.9e-7932.05Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
P930051.4e-7831.84Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana OX... [more]
Q9XE981.1e-7530.58Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1D2H41.4e-29484.72pentatricopeptide repeat-containing protein DOT4, chloroplastic isoform X1 OS=Mo... [more]
A0A6J1IGR32.6e-29384.03pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
A0A6J1F1Z21.0e-28182.99pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
A0A6J1D1G55.5e-27580.40pentatricopeptide repeat-containing protein DOT4, chloroplastic isoform X2 OS=Mo... [more]
A0A1S4DSI81.3e-26580.53LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g56... [more]
Match NameE-valueIdentityDescription
AT4G18750.12.0e-8331.12Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G30700.13.1e-8129.68Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.13.5e-8032.05Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G33680.11.0e-7931.84Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G04370.18.0e-7730.58Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 149..197
e-value: 3.2E-10
score: 40.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 54..81
e-value: 0.56
score: 10.6
coord: 355..381
e-value: 0.0051
score: 17.0
coord: 253..283
e-value: 0.001
score: 19.2
coord: 225..251
e-value: 0.022
score: 15.0
coord: 456..485
e-value: 4.3E-5
score: 23.5
coord: 528..552
e-value: 0.39
score: 11.1
coord: 491..518
e-value: 0.068
score: 13.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 455..488
e-value: 2.8E-6
score: 25.1
coord: 253..285
e-value: 2.5E-4
score: 19.0
coord: 224..253
e-value: 0.0024
score: 15.9
coord: 152..185
e-value: 6.1E-7
score: 27.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 453..487
score: 10.753093
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 50..84
score: 9.88715
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 251..285
score: 10.029647
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 150..184
score: 12.517862
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 212..300
e-value: 2.1E-12
score: 48.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 468..597
e-value: 1.3E-22
score: 82.5
coord: 301..413
e-value: 1.9E-12
score: 49.2
coord: 49..211
e-value: 1.2E-21
score: 79.4
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 70..240
NoneNo IPR availablePANTHERPTHR47926:SF86PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN DOT4, CHLOROPLASTIC-LIKEcoord: 344..596
coord: 244..354
NoneNo IPR availablePANTHERPTHR47926:SF86PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN DOT4, CHLOROPLASTIC-LIKEcoord: 70..240
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 344..596
coord: 244..354

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg031939.1Spg031939.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding