Spg030768 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg030768
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
Locationscaffold11: 27402654 .. 27404651 (+)
RNA-Seq ExpressionSpg030768
SyntenySpg030768
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCTCCTCCTCTCCACCCACATTCATCGTCTCCTCATTACTCAGAAACCCAATCACACATACCATCGCCACCGACTATTTAATAATCCCCCTCATGTTCGCACCACAACTGCCGAGAATTATGCCAATTTATGTGTAGCCCACCAACCGTTCGACGAAATTCCTATATGGGATACTTTTGCTTGGAACAATCTTATCCAAACCCATCTCACCAATGGAGATGTGGGGCATGTTATTTCAACATATCAACAGATGCTGTTTCGAGGAGTTCGCCCTGACAAACACACCCTTCCTCGAATTATATGCGCTACCCGCCAGTGTGGTGATCTGCAGGTTGGCAAACAGCTCCATGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTCTATGTAATTACCTCCTTGATTGAATTGTATGGGATTCTTGACTGTGCTGACACTGCAAAGTGGCTCCATGACAAGTCGGCTTGCAGAAACTCTGTTTCTTGGACAATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCGTGGACTTGTTTTACCAAATGGTGGAGTTGGCCGCTGATATTGATGCAGTGGCATTGGCCACGGCCATTGGTGCCTGTGGTGCACTCAAATTGCTGCAACACGGAAGAAATATCCACCATATCGCTAGAATTCAAGCCTTGGAATTTGACGTCTTGGTCAGTAATTCCCTATTAAAAATGTACCTTGATTGTGGTAGTATCAAAGATGCTCGGGGATTCTTTGACCGAATGCCGTACAAAGATGTCATTTCGTGGACAGAACTCATCCATGCGTATGTTAAGAAAGGTGGAATTAATGAGGGCTTTAAGCTGTTTCGGCAGATGAATATGGATGGAGGATTGAAGGCTGATCCTCTTACAATTAGCAGCATTCTCCCAGCCTGTGGAAGAATGGCTGCGCATAAGCATGGAAGAGAGATTCATGGATATGTGCTTAAAAATGCTATTAATGAGAATCTCATTGCCCAAAACGCATTGATTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAATTTTCTCGAGGATGAAGGAGAAAGATATGGTTTCGTGGACCGTCATGATCTTGGGCTACAGCTTACATGGCCAAGGAAAACTCGGAGTCAGTTTGTTCCAGGAAATGGAGAGGAACTTGAGGGTGCATAGAGATGAGATCACTTATACTGCAGTTTTGCATGCTTGTAGTACTGCAAGCATGGTAGATGAAGGGGATTTTTACTTCAATTGCATTACTGAACCAACTGTGGCACACTTTGCTTTAAAGGTGGCTCTTTTAGCCCGAGCAGGACGACTGGATGAAGCAAGGACCTTTGTCGAAAAACATAAACTTGACAAACATGCAGAGGTTTTGAGAGCATTACTCAATGGATGCAGGAACCACCATCAAGAAAAATTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTCGAACCTCTAAATGCGGAGAATTACATCCTACTTTCAAATTGGTATGCCTGCAACGAAAAATGGGATATGGTCGAAAGGTTGAGAGAAACCATTAGAGACATGGGATTAAGACCAAAAAAGGCTTACAGTTGGATGGAGTTCCGCAACAAAATTCATGTGTTTGGGACAGGGGATGTATCCCACCCGAGATCACAGAACATATATTGGAATTTACAGTGCCTGATGAAGAAAATGGAAGAAGATGGTTTCAAGCCGAAACCAGATTTTAGATTCCACGACGTCGATGAGGAGCGAGAGTGTGTTCTAATAGGACACAGTGAGCTCTTGGCAATTTCGTTTGGGCTGATTAGTACAGAAGCAGGAAGGAGAATTCGTATTACAAAGAACCTTCGTGTATGCCATAGTTGTCATGAGTCTGCAAAGTTCATATCCAAGATTGTTGGCCGAGAAATCATAGTAAGAGATCCTTATGTTTTCCATCATTTCAAGGATGGCTATTGTTCTTGTGAAGCTTTTTGTTAA

mRNA sequence

ATGGATCTCCTCCTCTCCACCCACATTCATCGTCTCCTCATTACTCAGAAACCCAATCACACATACCATCGCCACCGACTATTTAATAATCCCCCTCATGTTCGCACCACAACTGCCGAGAATTATGCCAATTTATGTGTAGCCCACCAACCGTTCGACGAAATTCCTATATGGGATACTTTTGCTTGGAACAATCTTATCCAAACCCATCTCACCAATGGAGATGTGGGGCATGTTATTTCAACATATCAACAGATGCTGTTTCGAGGAGTTCGCCCTGACAAACACACCCTTCCTCGAATTATATGCGCTACCCGCCAGTGTGGTGATCTGCAGGTTGGCAAACAGCTCCATGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTCTATGTAATTACCTCCTTGATTGAATTGTATGGGATTCTTGACTGTGCTGACACTGCAAAGTGGCTCCATGACAAGTCGGCTTGCAGAAACTCTGTTTCTTGGACAATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCGTGGACTTGTTTTACCAAATGGTGGAGTTGGCCGCTGATATTGATGCAGTGGCATTGGCCACGGCCATTGGTGCCTGTGGTGCACTCAAATTGCTGCAACACGGAAGAAATATCCACCATATCGCTAGAATTCAAGCCTTGGAATTTGACGTCTTGGTCAGTAATTCCCTATTAAAAATGTACCTTGATTGTGGTAGTATCAAAGATGCTCGGGGATTCTTTGACCGAATGCCGTACAAAGATGTCATTTCGTGGACAGAACTCATCCATGCGTATGTTAAGAAAGGTGGAATTAATGAGGGCTTTAAGCTGTTTCGGCAGATGAATATGGATGGAGGATTGAAGGCTGATCCTCTTACAATTAGCAGCATTCTCCCAGCCTGTGGAAGAATGGCTGCGCATAAGCATGGAAGAGAGATTCATGGATATGTGCTTAAAAATGCTATTAATGAGAATCTCATTGCCCAAAACGCATTGATTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAATTTTCTCGAGGATGAAGGAGAAAGATATGGTTTCGTGGACCGTCATGATCTTGGGCTACAGCTTACATGGCCAAGGAAAACTCGGAGTCAGTTTGTTCCAGGAAATGGAGAGGAACTTGAGGGTGCATAGAGATGAGATCACTTATACTGCAGTTTTGCATGCTTGTAGTACTGCAAGCATGGTAGATGAAGGGGATTTTTACTTCAATTGCATTACTGAACCAACTGTGGCACACTTTGCTTTAAAGGTGGCTCTTTTAGCCCGAGCAGGACGACTGGATGAAGCAAGGACCTTTGTCGAAAAACATAAACTTGACAAACATGCAGAGGTTTTGAGAGCATTACTCAATGGATGCAGGAACCACCATCAAGAAAAATTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTCGAACCTCTAAATGCGGAGAATTACATCCTACTTTCAAATTGGTATGCCTGCAACGAAAAATGGGATATGGTCGAAAGGTTGAGAGAAACCATTAGAGACATGGGATTAAGACCAAAAAAGGCTTACAGTTGGATGGAGTTCCGCAACAAAATTCATGTGTTTGGGACAGGGGATGTATCCCACCCGAGATCACAGAACATATATTGGAATTTACAGTGCCTGATGAAGAAAATGGAAGAAGATGGTTTCAAGCCGAAACCAGATTTTAGATTCCACGACGTCGATGAGGAGCGAGAGTGTGTTCTAATAGGACACAGTGAGCTCTTGGCAATTTCGTTTGGGCTGATTAGTACAGAAGCAGGAAGGAGAATTCGTATTACAAAGAACCTTCGTGTATGCCATAGTTGTCATGAGTCTGCAAAGTTCATATCCAAGATTGTTGGCCGAGAAATCATAGTAAGAGATCCTTATGTTTTCCATCATTTCAAGGATGGCTATTGTTCTTGTGAAGCTTTTTGTTAA

Coding sequence (CDS)

ATGGATCTCCTCCTCTCCACCCACATTCATCGTCTCCTCATTACTCAGAAACCCAATCACACATACCATCGCCACCGACTATTTAATAATCCCCCTCATGTTCGCACCACAACTGCCGAGAATTATGCCAATTTATGTGTAGCCCACCAACCGTTCGACGAAATTCCTATATGGGATACTTTTGCTTGGAACAATCTTATCCAAACCCATCTCACCAATGGAGATGTGGGGCATGTTATTTCAACATATCAACAGATGCTGTTTCGAGGAGTTCGCCCTGACAAACACACCCTTCCTCGAATTATATGCGCTACCCGCCAGTGTGGTGATCTGCAGGTTGGCAAACAGCTCCATGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTCTATGTAATTACCTCCTTGATTGAATTGTATGGGATTCTTGACTGTGCTGACACTGCAAAGTGGCTCCATGACAAGTCGGCTTGCAGAAACTCTGTTTCTTGGACAATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCGTGGACTTGTTTTACCAAATGGTGGAGTTGGCCGCTGATATTGATGCAGTGGCATTGGCCACGGCCATTGGTGCCTGTGGTGCACTCAAATTGCTGCAACACGGAAGAAATATCCACCATATCGCTAGAATTCAAGCCTTGGAATTTGACGTCTTGGTCAGTAATTCCCTATTAAAAATGTACCTTGATTGTGGTAGTATCAAAGATGCTCGGGGATTCTTTGACCGAATGCCGTACAAAGATGTCATTTCGTGGACAGAACTCATCCATGCGTATGTTAAGAAAGGTGGAATTAATGAGGGCTTTAAGCTGTTTCGGCAGATGAATATGGATGGAGGATTGAAGGCTGATCCTCTTACAATTAGCAGCATTCTCCCAGCCTGTGGAAGAATGGCTGCGCATAAGCATGGAAGAGAGATTCATGGATATGTGCTTAAAAATGCTATTAATGAGAATCTCATTGCCCAAAACGCATTGATTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAATTTTCTCGAGGATGAAGGAGAAAGATATGGTTTCGTGGACCGTCATGATCTTGGGCTACAGCTTACATGGCCAAGGAAAACTCGGAGTCAGTTTGTTCCAGGAAATGGAGAGGAACTTGAGGGTGCATAGAGATGAGATCACTTATACTGCAGTTTTGCATGCTTGTAGTACTGCAAGCATGGTAGATGAAGGGGATTTTTACTTCAATTGCATTACTGAACCAACTGTGGCACACTTTGCTTTAAAGGTGGCTCTTTTAGCCCGAGCAGGACGACTGGATGAAGCAAGGACCTTTGTCGAAAAACATAAACTTGACAAACATGCAGAGGTTTTGAGAGCATTACTCAATGGATGCAGGAACCACCATCAAGAAAAATTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTCGAACCTCTAAATGCGGAGAATTACATCCTACTTTCAAATTGGTATGCCTGCAACGAAAAATGGGATATGGTCGAAAGGTTGAGAGAAACCATTAGAGACATGGGATTAAGACCAAAAAAGGCTTACAGTTGGATGGAGTTCCGCAACAAAATTCATGTGTTTGGGACAGGGGATGTATCCCACCCGAGATCACAGAACATATATTGGAATTTACAGTGCCTGATGAAGAAAATGGAAGAAGATGGTTTCAAGCCGAAACCAGATTTTAGATTCCACGACGTCGATGAGGAGCGAGAGTGTGTTCTAATAGGACACAGTGAGCTCTTGGCAATTTCGTTTGGGCTGATTAGTACAGAAGCAGGAAGGAGAATTCGTATTACAAAGAACCTTCGTGTATGCCATAGTTGTCATGAGTCTGCAAAGTTCATATCCAAGATTGTTGGCCGAGAAATCATAGTAAGAGATCCTTATGTTTTCCATCATTTCAAGGATGGCTATTGTTCTTGTGAAGCTTTTTGTTAA

Protein sequence

MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDTFAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCSCEAFC
Homology
BLAST of Spg030768 vs. NCBI nr
Match: XP_004137884.2 (pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus])

HSP 1 Score: 1220.7 bits (3157), Expect = 0.0e+00
Identity = 580/665 (87.22%), Postives = 617/665 (92.78%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           M+LLLSTH H L ITQKPNH YHRH  FNN PHVRT T ENYANLCVAHQ FD+IPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHVRTMTVENYANLCVAHQVFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD+GHVISTY+QMLFRGVRPDKHTLPRIICATRQ GDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLGFSSNLYV+TSLIELYGILD ADTAKWLHDKS CRNSVSWT+LAKLYL EDKPS A
Sbjct: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTVLAKLYLREDKPSLA 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELA DIDAVALATAIGACGALK+L HGRNIHH+AR+  LEF++LVSNSLLKM
Sbjct: 181 LDLFYQMVELADDIDAVALATAIGACGALKMLHHGRNIHHLARVHGLEFNILVSNSLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           Y+DC SIKDARGFFD+MP KD+ISWTELIH YVKKGGINE FKLFRQMNMDG LK DP T
Sbjct: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGRMAAHKHG+EIHGYV+KNA +ENLI QNAL+DMYVKSGCIQSASK FS MK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKDMVSW++M LGYSLHGQGKLGVSLF+EME+N ++ RDEITYTAVLHAC+TA+MVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMRRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
            YF+CIT+PTVAH ALKVALLARAGRLDEARTFVEK KLDKH E+LRALL+GCRNH Q+K
Sbjct: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFVEKKKLDKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENYILLSNWYACNEKWDMVE+LRETIRDMGLRPKKAYSW+EF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMK+MEEDG KP PDF  HDVDEERECV IGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKEMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR IRITKNLRVCHSCHESAKFISK+VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CENFC 665

BLAST of Spg030768 vs. NCBI nr
Match: KAG6597728.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1216.4 bits (3146), Expect = 0.0e+00
Identity = 577/665 (86.77%), Postives = 615/665 (92.48%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           MDLLLST IHRL +TQKPNHTYHRHRLFNNPPHVRTTTAE  A+LCVAHQ FD+IPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR+ICA+R  GDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELAADIDAVALATAIGACGA KLLQHGRNIHH+ARI  LEFDVLVSN LLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           YLDC SIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGR+AAHKHGREIHGYVLKN  ++NLI QNAL+DMYVKSGCIQSA KIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKD+VSWTVMI GYSLHGQGKLGV LF+EM+RN RVHRDEITYTAVL ACSTASMV+EGD
Sbjct: 361 EKDVVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
           FYFNCITEPT+AHF LKVALL RAGR DEARTFV+KHKLDK++E+LRALL+GCR HHQ+K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENY+LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC LIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR IRI KNLRVCHSCHESAKFISK VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISKKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CEDFC 665

BLAST of Spg030768 vs. NCBI nr
Match: KAG7029175.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1216.1 bits (3145), Expect = 0.0e+00
Identity = 577/665 (86.77%), Postives = 614/665 (92.33%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           MDLLLST IHRL +TQKPNHTYHRHRLFNNPPHVRTTTAE  A+LCVAHQ FD+IPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR+ICA+R  GDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELAADIDAVALATAIGACGA KLLQHGRNIHH+ARI  LEFDVLVSN LLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           YLDC SIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGR+AAHKHGREIHGYVLKN  ++NLI QNAL+DMYVKSGCIQSA KIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKDMVSWTVMI GYSLHGQGKLGV LF+EM+RN RVHRDEITYTAVL ACSTASMV+EGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
           FYFNCITEPT+AHF LKVALL RAGR DEARTFV+KHKLDK++E+LRALL+GCR HHQ+K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENY+LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC LIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR IRI KNLRVCHSCHESAKFIS  VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CEDFC 665

BLAST of Spg030768 vs. NCBI nr
Match: XP_022158739.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Momordica charantia])

HSP 1 Score: 1216.1 bits (3145), Expect = 0.0e+00
Identity = 585/665 (87.97%), Postives = 614/665 (92.33%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           MDLLLSTH  RL IT K + TY R R FNNPPHVRT   ENYANLC AH PFDEIP WDT
Sbjct: 1   MDLLLSTHFRRLPITPKTDLTYRRRRPFNNPPHVRTAITENYANLCEAHHPFDEIPTWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDVG VISTY+QML RGVRPD HTLPRII A+RQCGDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDVGLVISTYEQMLLRGVRPDNHTLPRIIGASRQCGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
            FKLGFSSNLYVITSLIELYGILD ADTAKWLHDKSACRNSVSWTMLAKLY+MEDKPSFA
Sbjct: 121 VFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTMLAKLYVMEDKPSFA 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELAADIDAVALATAIGACG+LKLLQHGRNIH +AR   LEFDVLVSNSLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGSLKLLQHGRNIHLLARTHGLEFDVLVSNSLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           YLDCGSI+DARGFF+RMP KDVISWTELI AYVKKGGINEGFKLFRQMNMDGGLK DP+T
Sbjct: 241 YLDCGSIRDARGFFNRMPSKDVISWTELIQAYVKKGGINEGFKLFRQMNMDGGLKPDPIT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGRMAAHKHGREIHGYVLK+AI+ NLI QNAL+DMYVKSGCIQSA KIFSRMK
Sbjct: 301 ISSILPACGRMAAHKHGREIHGYVLKSAIDVNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKD +SWTVMILGYSLHGQGKLGVSLF+ MERNLR+HRDEITYT+VLHACSTAS+V+EGD
Sbjct: 361 EKDAISWTVMILGYSLHGQGKLGVSLFRLMERNLRMHRDEITYTSVLHACSTASLVEEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
           FYFNCI EPT +HFALKVALLARAGRLDEAR FVE+HKLDKH E+LRALL+GCR H  +K
Sbjct: 421 FYFNCIMEPTFSHFALKVALLARAGRLDEARAFVEQHKLDKHPEILRALLDGCRTHRDKK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENYILLSNWYACN K DMVE+ RE +RDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNGKLDMVEKSREIVRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNL+CLMKKME+DG KPKPDF FHDVDEERECVLIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLECLMKKMEDDGLKPKPDFSFHDVDEERECVLIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR I ITKNLRVCHSCHESAKFISKIVGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTICITKNLRVCHSCHESAKFISKIVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CEDFC 665

BLAST of Spg030768 vs. NCBI nr
Match: XP_023539701.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1213.7 bits (3139), Expect = 0.0e+00
Identity = 575/665 (86.47%), Postives = 615/665 (92.48%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           M+LLLST IHRL +TQKPNHTYHRHRLFNNPPHVRTTTAE  A+LCVAHQ FD+IPIWDT
Sbjct: 1   MNLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR+ICA+R  GDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELAADIDAVALATA+GACGA KLLQHGRNIHH+ARI  LEFDVLVSN LLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATALGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           YLDC SIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGR+AAHKHGREIHGYVLKN  ++NLI QNAL+DMYVKSGCIQSA KIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKDMVSWTVMI GYSLHGQGKLGV LF+EM+RN RVHRDEITYTAVL ACSTASMV+EGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
           FYFNCITEPT+AHF LKVALL RAGR DEARTFV+KHKLDK++E+LRALL+GCR HHQ+K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENY+LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC LIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR IRI+KNLRVCHSCHESAKFIS  VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CEDFC 665

BLAST of Spg030768 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 7.5e-129
Identity = 237/623 (38.04%), Postives = 357/623 (57.30%), Query Frame = 0

Query: 48  AHQPFDEIPIWDTFAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQ 107
           A + FDE+   D  +WN++I  +++NG     +S + QML  G+  D  T+  +      
Sbjct: 249 ARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCAD 308

Query: 108 CGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTML 167
              + +G+ +H+   K  FS       +L+++Y      D+AK +  + + R+ VS+T +
Sbjct: 309 SRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSM 368

Query: 168 AKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQAL 227
              Y  E     AV LF +M E     D   +   +  C   +LL  G+ +H   +   L
Sbjct: 369 IAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDL 428

Query: 228 EFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQ 287
            FD+ VSN+L+ MY  CGS+++A   F  M  KD+ISW  +I  Y K    NE   LF  
Sbjct: 429 GFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNL 488

Query: 288 MNMDGGLKADPLTISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSG 347
           +  +     D  T++ +LPAC  ++A   GREIHGY+++N    +    N+L+DMY K G
Sbjct: 489 LLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCG 548

Query: 348 CIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVL 407
            +  A  +F  +  KD+VSWTVMI GY +HG GK  ++LF +M R   +  DEI++ ++L
Sbjct: 549 ALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM-RQAGIEADEISFVSLL 608

Query: 408 HACSTASMVDEGDFYFN-----CITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKH 467
           +ACS + +VDEG  +FN     C  EPTV H+A  V +LAR G L +A  F+E   +   
Sbjct: 609 YACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPD 668

Query: 468 AEVLRALLNGCRNHHQEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRET 527
           A +  ALL GCR HH  KL +++ E++ + EP N   Y+L++N YA  EKW+ V+RLR+ 
Sbjct: 669 ATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKR 728

Query: 528 IRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFR 587
           I   GLR     SW+E + ++++F  GD S+P ++NI   L+ +  +M E+G+ P   + 
Sbjct: 729 IGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYA 788

Query: 588 FHDVDE-ERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR 647
             D +E E+E  L GHSE LA++ G+IS+  G+ IR+TKNLRVC  CHE AKF+SK+  R
Sbjct: 789 LIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRR 848

Query: 648 EIIVRDPYVFHHFKDGYCSCEAF 665
           EI++RD   FH FKDG+CSC  F
Sbjct: 849 EIVLRDSNRFHQFKDGHCSCRGF 870

BLAST of Spg030768 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 4.5e-118
Identity = 222/630 (35.24%), Postives = 351/630 (55.71%), Query Frame = 0

Query: 39  AENYANLCVAHQPFDEIPIWDTFAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTL 98
           + ++ ++  A Q FD++P    F WN +I+ +  N      +  Y  M    V PD  T 
Sbjct: 63  SSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTF 122

Query: 99  PRIICATRQCGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSAC 158
           P ++ A      LQ+G+ +HAQ F+LGF ++++V   LI LY       +A+ + +    
Sbjct: 123 PHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPL 182

Query: 159 --RNSVSWTMLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGR 218
             R  VSWT +   Y    +P  A+++F QM ++    D VAL + + A   L+ L+ GR
Sbjct: 183 PERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGR 242

Query: 219 NIHHIARIQALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKG 278
           +IH       LE +  +  SL  MY  CG +  A+  FD+M   ++I W  +I  Y K G
Sbjct: 243 SIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNG 302

Query: 279 GINEGFKLFRQMNMDGGLKADPLTISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQ 338
              E   +F +M ++  ++ D ++I+S + AC ++ + +  R ++ YV ++   +++   
Sbjct: 303 YAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFIS 362

Query: 339 NALIDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRV 398
           +ALIDM+ K G ++ A  +F R  ++D+V W+ MI+GY LHG+ +  +SL++ MER   V
Sbjct: 363 SALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERG-GV 422

Query: 399 HRDEITYTAVLHACSTASMVDEGDFYFNCITE----PTVAHFALKVALLARAGRLDEART 458
           H +++T+  +L AC+ + MV EG ++FN + +    P   H+A  + LL RAG LD+A  
Sbjct: 423 HPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYE 482

Query: 459 FVEKHKLDKHAEVLRALLNGCRNHHQEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEK 518
            ++   +     V  ALL+ C+ H   +LG+   +QL   +P N  +Y+ LSN YA    
Sbjct: 483 VIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARL 542

Query: 519 WDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEE 578
           WD V  +R  +++ GL      SW+E R ++  F  GD SHPR + I   ++ +  +++E
Sbjct: 543 WDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKE 602

Query: 579 DGFKPKPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHES 638
            GF    D   HD+ DEE E  L  HSE +AI++GLIST  G  +RITKNLR C +CH +
Sbjct: 603 GGFVANKDASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAA 662

Query: 639 AKFISKIVGREIIVRDPYVFHHFKDGYCSC 662
            K ISK+V REI+VRD   FHHFKDG CSC
Sbjct: 663 TKLISKLVDREIVVRDTNRFHHFKDGVCSC 690

BLAST of Spg030768 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 1.2e-107
Identity = 223/665 (33.53%), Postives = 345/665 (51.88%), Query Frame = 0

Query: 42  YANLCVAHQP---FDEIPIWDTFAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTL 101
           Y NL + H+    F  +      AW ++I+           ++++ +M   G  PD +  
Sbjct: 49  YTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLFSKALASFVEMRASGRCPDHNVF 108

Query: 102 PRIICATRQCGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGIL--------------- 161
           P ++ +     DL+ G+ +H    +LG   +LY   +L+ +Y  L               
Sbjct: 109 PSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVGNVFDE 168

Query: 162 ----------------DC-----ADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAVD 221
                            C      D+ + + +    ++ VS+  +   Y        A+ 
Sbjct: 169 MPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALR 228

Query: 222 LFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKMYL 281
           +  +M       D+  L++ +        +  G+ IH     + ++ DV + +SL+ MY 
Sbjct: 229 MVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYA 288

Query: 282 DCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTIS 341
               I+D+   F R+  +D ISW  L+  YV+ G  NE  +LFRQM +   +K   +  S
Sbjct: 289 KSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQM-VTAKVKPGAVAFS 348

Query: 342 SILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMKEK 401
           S++PAC  +A    G+++HGYVL+     N+   +AL+DMY K G I++A KIF RM   
Sbjct: 349 SVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVL 408

Query: 402 DMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGDFY 461
           D VSWT +I+G++LHG G   VSLF+EM+R   V  +++ + AVL ACS   +VDE   Y
Sbjct: 409 DEVSWTAIIMGHALHGHGHEAVSLFEEMKRQ-GVKPNQVAFVAVLTACSHVGLVDEAWGY 468

Query: 462 FNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHH 521
           FN +T+       + H+A    LL RAG+L+EA  F+ K  ++    V   LL+ C  H 
Sbjct: 469 FNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHK 528

Query: 522 QEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWM 581
             +L +++ E++   +  N   Y+L+ N YA N +W  + +LR  +R  GLR K A SW+
Sbjct: 529 NLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWI 588

Query: 582 EFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIG 641
           E +NK H F +GD SHP    I   L+ +M++ME++G+        HDVDEE +  +L G
Sbjct: 589 EMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFG 648

Query: 642 HSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKD 662
           HSE LA++FG+I+TE G  IR+TKN+R+C  CH + KFISKI  REIIVRD   FHHF  
Sbjct: 649 HSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNR 708

BLAST of Spg030768 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 8.0e-107
Identity = 233/731 (31.87%), Postives = 369/731 (50.48%), Query Frame = 0

Query: 14  ITQKPNHTYHRHRLFNNPPHVRTTTAEN--------YANLCVAHQPFDEIPIWDTFAWNN 73
           +  K  +  H  +LF+  P +RT  + N          ++    + FD++P  D+ +W  
Sbjct: 58  VYSKTGYALHARKLFDEMP-LRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTT 117

Query: 74  LIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQAFKLG 133
           +I  +   G     I     M+  G+ P + TL  ++ +      ++ GK++H+   KLG
Sbjct: 118 MIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLG 177

Query: 134 FSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAVDLFY 193
              N+ V  SL+ +Y        AK++ D+   R+  SW  +  L++   +   A+  F 
Sbjct: 178 LRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFE 237

Query: 194 QMVE--------------------LAADI------------DAVALATAIGACGALKLLQ 253
           QM E                     A DI            D   LA+ + AC  L+ L 
Sbjct: 238 QMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLC 297

Query: 254 HGRNIHHIARIQALEFDVLVSNSLLKMYLDCGSIKDAR--------------GF------ 313
            G+ IH        +   +V N+L+ MY  CG ++ AR              GF      
Sbjct: 298 IGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDG 357

Query: 314 -------------FDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 373
                        F  +  +DV++WT +I  Y + G   E   LFR M + GG + +  T
Sbjct: 358 YIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSM-VGGGQRPNSYT 417

Query: 374 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 433
           ++++L     +A+  HG++IHG  +K+    ++   NALI MY K+G I SAS+ F  ++
Sbjct: 418 LAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIR 477

Query: 434 -EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEG 493
            E+D VSWT MI+  + HG  +  + LF+ M     +  D ITY  V  AC+ A +V++G
Sbjct: 478 CERDTVSWTSMIIALAQHGHAEEALELFETMLME-GLRPDHITYVGVFSACTHAGLVNQG 537

Query: 494 DFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCR 553
             YF+ + +     PT++H+A  V L  RAG L EA+ F+EK  ++       +LL+ CR
Sbjct: 538 RQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACR 597

Query: 554 NHHQEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAY 613
            H    LGK   E+L   EP N+  Y  L+N Y+   KW+   ++R++++D  ++ ++ +
Sbjct: 598 VHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGF 657

Query: 614 SWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECV 665
           SW+E ++K+HVFG  D +HP    IY  ++ +  ++++ G+ P      HD++EE +E +
Sbjct: 658 SWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQI 717

BLAST of Spg030768 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 8.9e-106
Identity = 229/700 (32.71%), Postives = 362/700 (51.71%), Query Frame = 0

Query: 4   LLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDTFAW 63
           L  TH H ++ T   +  Y   +LF            ++A+L  A + FDEIP  ++FAW
Sbjct: 46  LKQTHGH-MIRTGTFSDPYSASKLF------AMAALSSFASLEYARKVFDEIPKPNSFAW 105

Query: 64  NNLIQTHLTNGDVGHVISTYQQMLFRG-VRPDKHTLPRIICATRQCGDLQVGKQLHAQAF 123
           N LI+ + +  D    I  +  M+      P+K+T P +I A  +   L +G+ LH  A 
Sbjct: 106 NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 165

Query: 124 KLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAVD 183
           K    S+++V  SLI  Y      D+A  +      ++ VSW  +   ++ +  P  A++
Sbjct: 166 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 225

Query: 184 LFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKMYL 243
           LF +M         V +   + AC  ++ L+ GR +        +  ++ ++N++L MY 
Sbjct: 226 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYT 285

Query: 244 DCGSIKDARGFFD-------------------------------RMPYKDVISWTELIHA 303
            CGSI+DA+  FD                                MP KD+++W  LI A
Sbjct: 286 KCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISA 345

Query: 304 YVKKGGINEGFKLFRQMNMDGGLKADPLTISSILPACGRMAAHKHGREIHGYVLKNAINE 363
           Y + G  NE   +F ++ +   +K + +T+ S L AC ++ A + GR IH Y+ K+ I  
Sbjct: 346 YEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRM 405

Query: 364 NLIAQNALIDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEME 423
           N    +ALI MY K G ++ + ++F+ ++++D+  W+ MI G ++HG G   V +F +M+
Sbjct: 406 NFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQ 465

Query: 424 RNLRVHRDEITYTAVLHACSTASMVDEGDFYFNCITE-----PTVAHFALKVALLARAGR 483
               V  + +T+T V  ACS   +VDE +  F+ +       P   H+A  V +L R+G 
Sbjct: 466 -EANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGY 525

Query: 484 LDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEKLGKRIIEQLCDFEPLNAENYILLSNW 543
           L++A  F+E   +     V  ALL  C+ H    L +    +L + EP N   ++LLSN 
Sbjct: 526 LEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNI 585

Query: 544 YACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCL 603
           YA   KW+ V  LR+ +R  GL+ +   S +E    IH F +GD +HP S+ +Y  L  +
Sbjct: 586 YAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEV 645

Query: 604 MKKMEEDGFKPKPDFRFHDVDEE--RECVLIGHSELLAISFGLISTEAGRRIRITKNLRV 663
           M+K++ +G++P+       ++EE  +E  L  HSE LAI +GLISTEA + IR+ KNLRV
Sbjct: 646 MEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRV 705

Query: 664 CHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCSCEAF 665
           C  CH  AK IS++  REIIVRD Y FHHF++G CSC  F
Sbjct: 706 CGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDF 737

BLAST of Spg030768 vs. ExPASy TrEMBL
Match: A0A6J1E0A4 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111025202 PE=3 SV=1)

HSP 1 Score: 1216.1 bits (3145), Expect = 0.0e+00
Identity = 585/665 (87.97%), Postives = 614/665 (92.33%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           MDLLLSTH  RL IT K + TY R R FNNPPHVRT   ENYANLC AH PFDEIP WDT
Sbjct: 1   MDLLLSTHFRRLPITPKTDLTYRRRRPFNNPPHVRTAITENYANLCEAHHPFDEIPTWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDVG VISTY+QML RGVRPD HTLPRII A+RQCGDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDVGLVISTYEQMLLRGVRPDNHTLPRIIGASRQCGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
            FKLGFSSNLYVITSLIELYGILD ADTAKWLHDKSACRNSVSWTMLAKLY+MEDKPSFA
Sbjct: 121 VFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTMLAKLYVMEDKPSFA 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELAADIDAVALATAIGACG+LKLLQHGRNIH +AR   LEFDVLVSNSLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGSLKLLQHGRNIHLLARTHGLEFDVLVSNSLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           YLDCGSI+DARGFF+RMP KDVISWTELI AYVKKGGINEGFKLFRQMNMDGGLK DP+T
Sbjct: 241 YLDCGSIRDARGFFNRMPSKDVISWTELIQAYVKKGGINEGFKLFRQMNMDGGLKPDPIT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGRMAAHKHGREIHGYVLK+AI+ NLI QNAL+DMYVKSGCIQSA KIFSRMK
Sbjct: 301 ISSILPACGRMAAHKHGREIHGYVLKSAIDVNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKD +SWTVMILGYSLHGQGKLGVSLF+ MERNLR+HRDEITYT+VLHACSTAS+V+EGD
Sbjct: 361 EKDAISWTVMILGYSLHGQGKLGVSLFRLMERNLRMHRDEITYTSVLHACSTASLVEEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
           FYFNCI EPT +HFALKVALLARAGRLDEAR FVE+HKLDKH E+LRALL+GCR H  +K
Sbjct: 421 FYFNCIMEPTFSHFALKVALLARAGRLDEARAFVEQHKLDKHPEILRALLDGCRTHRDKK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENYILLSNWYACN K DMVE+ RE +RDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNGKLDMVEKSREIVRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNL+CLMKKME+DG KPKPDF FHDVDEERECVLIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLECLMKKMEDDGLKPKPDFSFHDVDEERECVLIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR I ITKNLRVCHSCHESAKFISKIVGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTICITKNLRVCHSCHESAKFISKIVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CEDFC 665

BLAST of Spg030768 vs. ExPASy TrEMBL
Match: A0A6J1I9E1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111470851 PE=3 SV=1)

HSP 1 Score: 1211.1 bits (3132), Expect = 0.0e+00
Identity = 575/665 (86.47%), Postives = 613/665 (92.18%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           MDLLLST IHRL +TQKPNHTYHRHRLFNNPPHVRTTTAE  A+LCVAHQ FD+IPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR+ICA+R  GDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELAADIDAVALATAIGACGA KLLQHGRNIHH+ARI  LEFDVLVSN LLKM
Sbjct: 181 LDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           YLDC SIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGR+AAHKHGREIHGYVLKN  ++NLI QNAL+DMYVKSGCIQSA KIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNDFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKDMVSWTVMI GYSLHGQGKLGV LF+EM+RN RVHRDEITYTAVL +CSTASMV+EGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQSCSTASMVEEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
           FYFNCITEPT+AHF LKVALL RAGR DEARTFV+KHKLDK++E+LRALL+GCR HHQ K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQHK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENY+LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC  IGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECAPIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR IRI+KNLRVCHSCHESAKFIS  VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CEDFC 665

BLAST of Spg030768 vs. ExPASy TrEMBL
Match: A0A1S3CPR5 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103502829 PE=3 SV=1)

HSP 1 Score: 1209.1 bits (3127), Expect = 0.0e+00
Identity = 578/665 (86.92%), Postives = 613/665 (92.18%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           M+LLLSTH H L ITQKP H YHRH  FNN PHVRTTT ENYA+LCVAHQ FDEIPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPYHAYHRHPPFNNLPHVRTTTVENYADLCVAHQVFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD GHVIS Y+QMLFRGVRPDKHTLPRIICATRQ GDL VGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDWGHVISIYRQMLFRGVRPDKHTLPRIICATRQYGDLPVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLGFSS+LYV+TSLIELYGILD ADTAKWLHDKS CRNSVSWT+LAKLYL EDKPSFA
Sbjct: 121 AFKLGFSSDLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELA DID+VALATAIGACGALK+L HGRNIHH+ARI  LEF++LVSNSLLKM
Sbjct: 181 IDLFYQMVELADDIDSVALATAIGACGALKMLHHGRNIHHLARIHGLEFNILVSNSLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           YLDC SIKDARGFFD+MP KDVISWTELIH YVKKGGINE FKLFRQMNMDG LK DPLT
Sbjct: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGRMAAHKHG+EIHGYVLKN  +ENLI QNAL+DMYVKSGCIQSASK FS MK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNGFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKDMVSW++M LGYSLHGQGKLGV LF+EME+NL++HRDEITYTAVLHAC+TA+MVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVGLFREMEKNLKMHRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
           FYF+ IT+PTVAH ALKVALLARAGRLDEARTFVEK KL+KH E+LRALL+GCRNH Q+K
Sbjct: 421 FYFSRITKPTVAHIALKVALLARAGRLDEARTFVEKKKLNKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLN ENYILLSNWYACN+KWDMVE LRETIRDMGLRPKKAYSW+EF 
Sbjct: 481 LGKRIIEQLCDLEPLNTENYILLSNWYACNKKWDMVEELRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDG K  P+F  HDVDEERECV IGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKTNPEFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR IRITKNLRVCHSCHESAKFISK+VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CENFC 665

BLAST of Spg030768 vs. ExPASy TrEMBL
Match: A0A6J1EXC6 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111439085 PE=3 SV=1)

HSP 1 Score: 1206.4 bits (3120), Expect = 0.0e+00
Identity = 572/665 (86.02%), Postives = 611/665 (91.88%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           MDLLLST IHRL +TQKPNHTY RHRLFNNPPHVRTTTAE  A+LCVAHQ FD+IPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR+ICA+R  GDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELAADIDAVALATAIGACGA KLLQHGRNIHH+ARI  LEFD+LVSN LLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           YLDCGSIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Sbjct: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGR+ AHKHGREIHGYVLKN  ++NLI QNAL+DMYVKSGCIQSA KIFSRMK
Sbjct: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKDMVSWTV+I GYSLHGQGKLGV LF+EM+RN  VHRDEITYTAVL ACSTASMV+EGD
Sbjct: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
           FYFNCITEPT+AHF LKVALL RAGR +EARTFV+KHKLDK+ E+LRALL+GCR HHQ+K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENY+LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC LIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKDGYCS 660
           LAISFGLISTEAGR IRI KNLRVCHSCHESAKFIS  VGREIIV+DPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEAFC 666
           CE FC
Sbjct: 661 CEDFC 665

BLAST of Spg030768 vs. ExPASy TrEMBL
Match: A0A0A0L9N4 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G722890 PE=3 SV=1)

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 546/641 (85.18%), Postives = 587/641 (91.58%), Query Frame = 0

Query: 1   MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCVAHQPFDEIPIWDT 60
           M+LLLSTH H L ITQKPNH YHRH  FNN PHVRT T ENYANLCVAHQ FD+IPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHVRTMTVENYANLCVAHQVFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD+GHVISTY+QMLFRGVRPDKHTLPRIICATRQ GDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFA 180
           AFKLGFSSNLYV+TSLIELYGILD ADTAKWLHDKS CRNSVSWT+LAKLYL EDKPS A
Sbjct: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTVLAKLYLREDKPSLA 180

Query: 181 VDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKM 240
           +DLFYQMVELA DIDAVALATAIGACGALK+L HGRNIHH+AR+  LEF++LVSNSLLKM
Sbjct: 181 LDLFYQMVELADDIDAVALATAIGACGALKMLHHGRNIHHLARVHGLEFNILVSNSLLKM 240

Query: 241 YLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 300
           Y+DC SIKDARGFFD+MP KD+ISWTELIH YVKKGGINE FKLFRQMNMDG LK DP T
Sbjct: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300

Query: 301 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 360
           ISSILPACGRMAAHKHG+EIHGYV+KNA +ENLI QNAL+DMYVKSGCIQSASK FS MK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGD 420
           EKDMVSW++M LGYSLHGQGKLGVSLF+EME+N ++ RDEITYTAVLHAC+TA+MVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMRRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHHQEK 480
            YF+CIT+PTVAH ALKVALLARAGRLDEARTFVEK KLDKH E+LRALL+GCRNH Q+K
Sbjct: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFVEKKKLDKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCD EPLNAENYILLSNWYACNEKWDMVE+LRETIRDMGLRPKKAYSW+EF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMK+MEEDG KP PDF  HDVDEERECV IGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKEMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR 642
           LAISFGLISTEAGR IRITKNLR+  +  ++  F++ I GR
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRMVAALVKT--FVNFIDGR 639

BLAST of Spg030768 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 462.6 bits (1189), Expect = 5.3e-130
Identity = 237/623 (38.04%), Postives = 357/623 (57.30%), Query Frame = 0

Query: 48  AHQPFDEIPIWDTFAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQ 107
           A + FDE+   D  +WN++I  +++NG     +S + QML  G+  D  T+  +      
Sbjct: 249 ARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCAD 308

Query: 108 CGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTML 167
              + +G+ +H+   K  FS       +L+++Y      D+AK +  + + R+ VS+T +
Sbjct: 309 SRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSM 368

Query: 168 AKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQAL 227
              Y  E     AV LF +M E     D   +   +  C   +LL  G+ +H   +   L
Sbjct: 369 IAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDL 428

Query: 228 EFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQ 287
            FD+ VSN+L+ MY  CGS+++A   F  M  KD+ISW  +I  Y K    NE   LF  
Sbjct: 429 GFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNL 488

Query: 288 MNMDGGLKADPLTISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSG 347
           +  +     D  T++ +LPAC  ++A   GREIHGY+++N    +    N+L+DMY K G
Sbjct: 489 LLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCG 548

Query: 348 CIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVL 407
            +  A  +F  +  KD+VSWTVMI GY +HG GK  ++LF +M R   +  DEI++ ++L
Sbjct: 549 ALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM-RQAGIEADEISFVSLL 608

Query: 408 HACSTASMVDEGDFYFN-----CITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKH 467
           +ACS + +VDEG  +FN     C  EPTV H+A  V +LAR G L +A  F+E   +   
Sbjct: 609 YACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPD 668

Query: 468 AEVLRALLNGCRNHHQEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRET 527
           A +  ALL GCR HH  KL +++ E++ + EP N   Y+L++N YA  EKW+ V+RLR+ 
Sbjct: 669 ATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKR 728

Query: 528 IRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFR 587
           I   GLR     SW+E + ++++F  GD S+P ++NI   L+ +  +M E+G+ P   + 
Sbjct: 729 IGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYA 788

Query: 588 FHDVDE-ERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR 647
             D +E E+E  L GHSE LA++ G+IS+  G+ IR+TKNLRVC  CHE AKF+SK+  R
Sbjct: 789 LIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRR 848

Query: 648 EIIVRDPYVFHHFKDGYCSCEAF 665
           EI++RD   FH FKDG+CSC  F
Sbjct: 849 EIVLRDSNRFHQFKDGHCSCRGF 870

BLAST of Spg030768 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 426.8 bits (1096), Expect = 3.2e-119
Identity = 222/630 (35.24%), Postives = 351/630 (55.71%), Query Frame = 0

Query: 39  AENYANLCVAHQPFDEIPIWDTFAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTL 98
           + ++ ++  A Q FD++P    F WN +I+ +  N      +  Y  M    V PD  T 
Sbjct: 63  SSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTF 122

Query: 99  PRIICATRQCGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSAC 158
           P ++ A      LQ+G+ +HAQ F+LGF ++++V   LI LY       +A+ + +    
Sbjct: 123 PHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPL 182

Query: 159 --RNSVSWTMLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGR 218
             R  VSWT +   Y    +P  A+++F QM ++    D VAL + + A   L+ L+ GR
Sbjct: 183 PERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGR 242

Query: 219 NIHHIARIQALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKG 278
           +IH       LE +  +  SL  MY  CG +  A+  FD+M   ++I W  +I  Y K G
Sbjct: 243 SIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNG 302

Query: 279 GINEGFKLFRQMNMDGGLKADPLTISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQ 338
              E   +F +M ++  ++ D ++I+S + AC ++ + +  R ++ YV ++   +++   
Sbjct: 303 YAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFIS 362

Query: 339 NALIDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRV 398
           +ALIDM+ K G ++ A  +F R  ++D+V W+ MI+GY LHG+ +  +SL++ MER   V
Sbjct: 363 SALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERG-GV 422

Query: 399 HRDEITYTAVLHACSTASMVDEGDFYFNCITE----PTVAHFALKVALLARAGRLDEART 458
           H +++T+  +L AC+ + MV EG ++FN + +    P   H+A  + LL RAG LD+A  
Sbjct: 423 HPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYE 482

Query: 459 FVEKHKLDKHAEVLRALLNGCRNHHQEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEK 518
            ++   +     V  ALL+ C+ H   +LG+   +QL   +P N  +Y+ LSN YA    
Sbjct: 483 VIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARL 542

Query: 519 WDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEE 578
           WD V  +R  +++ GL      SW+E R ++  F  GD SHPR + I   ++ +  +++E
Sbjct: 543 WDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKE 602

Query: 579 DGFKPKPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHES 638
            GF    D   HD+ DEE E  L  HSE +AI++GLIST  G  +RITKNLR C +CH +
Sbjct: 603 GGFVANKDASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAA 662

Query: 639 AKFISKIVGREIIVRDPYVFHHFKDGYCSC 662
            K ISK+V REI+VRD   FHHFKDG CSC
Sbjct: 663 TKLISKLVDREIVVRDTNRFHHFKDGVCSC 690

BLAST of Spg030768 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 392.1 bits (1006), Expect = 8.8e-109
Identity = 223/665 (33.53%), Postives = 345/665 (51.88%), Query Frame = 0

Query: 42  YANLCVAHQP---FDEIPIWDTFAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTL 101
           Y NL + H+    F  +      AW ++I+           ++++ +M   G  PD +  
Sbjct: 49  YTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLFSKALASFVEMRASGRCPDHNVF 108

Query: 102 PRIICATRQCGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGIL--------------- 161
           P ++ +     DL+ G+ +H    +LG   +LY   +L+ +Y  L               
Sbjct: 109 PSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVGNVFDE 168

Query: 162 ----------------DC-----ADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAVD 221
                            C      D+ + + +    ++ VS+  +   Y        A+ 
Sbjct: 169 MPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALR 228

Query: 222 LFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDVLVSNSLLKMYL 281
           +  +M       D+  L++ +        +  G+ IH     + ++ DV + +SL+ MY 
Sbjct: 229 MVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYA 288

Query: 282 DCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTIS 341
               I+D+   F R+  +D ISW  L+  YV+ G  NE  +LFRQM +   +K   +  S
Sbjct: 289 KSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQM-VTAKVKPGAVAFS 348

Query: 342 SILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMKEK 401
           S++PAC  +A    G+++HGYVL+     N+   +AL+DMY K G I++A KIF RM   
Sbjct: 349 SVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVL 408

Query: 402 DMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEGDFY 461
           D VSWT +I+G++LHG G   VSLF+EM+R   V  +++ + AVL ACS   +VDE   Y
Sbjct: 409 DEVSWTAIIMGHALHGHGHEAVSLFEEMKRQ-GVKPNQVAFVAVLTACSHVGLVDEAWGY 468

Query: 462 FNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCRNHH 521
           FN +T+       + H+A    LL RAG+L+EA  F+ K  ++    V   LL+ C  H 
Sbjct: 469 FNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHK 528

Query: 522 QEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWM 581
             +L +++ E++   +  N   Y+L+ N YA N +W  + +LR  +R  GLR K A SW+
Sbjct: 529 NLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWI 588

Query: 582 EFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIG 641
           E +NK H F +GD SHP    I   L+ +M++ME++G+        HDVDEE +  +L G
Sbjct: 589 EMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFG 648

Query: 642 HSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYVFHHFKD 662
           HSE LA++FG+I+TE G  IR+TKN+R+C  CH + KFISKI  REIIVRD   FHHF  
Sbjct: 649 HSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNR 708

BLAST of Spg030768 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 389.4 bits (999), Expect = 5.7e-108
Identity = 233/731 (31.87%), Postives = 369/731 (50.48%), Query Frame = 0

Query: 14  ITQKPNHTYHRHRLFNNPPHVRTTTAEN--------YANLCVAHQPFDEIPIWDTFAWNN 73
           +  K  +  H  +LF+  P +RT  + N          ++    + FD++P  D+ +W  
Sbjct: 58  VYSKTGYALHARKLFDEMP-LRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTT 117

Query: 74  LIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDLQVGKQLHAQAFKLG 133
           +I  +   G     I     M+  G+ P + TL  ++ +      ++ GK++H+   KLG
Sbjct: 118 MIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLG 177

Query: 134 FSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAVDLFY 193
              N+ V  SL+ +Y        AK++ D+   R+  SW  +  L++   +   A+  F 
Sbjct: 178 LRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFE 237

Query: 194 QMVE--------------------LAADI------------DAVALATAIGACGALKLLQ 253
           QM E                     A DI            D   LA+ + AC  L+ L 
Sbjct: 238 QMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLC 297

Query: 254 HGRNIHHIARIQALEFDVLVSNSLLKMYLDCGSIKDAR--------------GF------ 313
            G+ IH        +   +V N+L+ MY  CG ++ AR              GF      
Sbjct: 298 IGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDG 357

Query: 314 -------------FDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT 373
                        F  +  +DV++WT +I  Y + G   E   LFR M + GG + +  T
Sbjct: 358 YIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSM-VGGGQRPNSYT 417

Query: 374 ISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQSASKIFSRMK 433
           ++++L     +A+  HG++IHG  +K+    ++   NALI MY K+G I SAS+ F  ++
Sbjct: 418 LAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIR 477

Query: 434 -EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTASMVDEG 493
            E+D VSWT MI+  + HG  +  + LF+ M     +  D ITY  V  AC+ A +V++G
Sbjct: 478 CERDTVSWTSMIIALAQHGHAEEALELFETMLME-GLRPDHITYVGVFSACTHAGLVNQG 537

Query: 494 DFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLNGCR 553
             YF+ + +     PT++H+A  V L  RAG L EA+ F+EK  ++       +LL+ CR
Sbjct: 538 RQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACR 597

Query: 554 NHHQEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAY 613
            H    LGK   E+L   EP N+  Y  L+N Y+   KW+   ++R++++D  ++ ++ +
Sbjct: 598 VHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGF 657

Query: 614 SWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECV 665
           SW+E ++K+HVFG  D +HP    IY  ++ +  ++++ G+ P      HD++EE +E +
Sbjct: 658 SWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQI 717

BLAST of Spg030768 vs. TAIR 10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 386.0 bits (990), Expect = 6.3e-107
Identity = 221/615 (35.93%), Postives = 327/615 (53.17%), Query Frame = 0

Query: 52  FDEIPIWDTFAWNNLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQCGDL 111
           FD +P  D  +WN +I  +  NG     +  +  M    V PD  TL  +I A    GD 
Sbjct: 254 FDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELLGDR 313

Query: 112 QVGKQLHAQAFKLGFSSNLYVITSLIELYGILDCADTAKWLHDKSACRNSVSWTMLAKLY 171
           ++G+ +HA     GF+ ++ V  SL ++Y        A+ L  +   ++ VSWT +   Y
Sbjct: 314 RLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGY 373

Query: 172 LMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIQALEFDV 231
                P  A+D +  M + +   D + +A  + AC  L  L  G  +H +A    L   V
Sbjct: 374 EYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYV 433

Query: 232 LVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMD 291
           +V+N+L+ MY  C  I  A   F  +P K+VISWT +I          E     RQM M 
Sbjct: 434 IVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQMKMT 493

Query: 292 GGLKADPLTISSILPACGRMAAHKHGREIHGYVLKNAINENLIAQNALIDMYVKSGCIQS 351
             L+ + +T+++ L AC R+ A   G+EIH +VL+  +  +    NAL+DMYV+ G + +
Sbjct: 494 --LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRMNT 553

Query: 352 ASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACS 411
           A   F+  K KD+ SW +++ GYS  GQG + V LF  M ++ RV  DEIT+ ++L  CS
Sbjct: 554 AWSQFNSQK-KDVTSWNILLTGYSERGQGSMVVELFDRMVKS-RVRPDEITFISLLCGCS 613

Query: 412 TASMVDEGDFYFNCITE----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLR 471
            + MV +G  YF+ + +    P + H+A  V LL RAG L EA  F++K  +     V  
Sbjct: 614 KSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAVWG 673

Query: 472 ALLNGCRNHHQEKLGKRIIEQLCDFEPLNAENYILLSNWYACNEKWDMVERLRETIRDMG 531
           ALLN CR HH+  LG+   + + + +  +   YILL N YA   KW  V ++R  +++ G
Sbjct: 674 ALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKENG 733

Query: 532 LRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGF-KPKPDFRFHDV 591
           L      SW+E + K+H F + D  HP+++ I   L+   +KM E G  K        + 
Sbjct: 734 LTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLEGFYEKMSEVGLTKISESSSMDET 793

Query: 592 DEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVR 651
           +  R+ +  GHSE  AI+FGLI+T  G  I +TKNL +C +CH++ KFISK V REI VR
Sbjct: 794 EISRDEIFCGHSERKAIAFGLINTVPGMPIWVTKNLSMCENCHDTVKFISKTVRREISVR 853

Query: 652 DPYVFHHFKDGYCSC 662
           D   FHHFKDG CSC
Sbjct: 854 DAEHFHHFKDGECSC 864

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004137884.20.0e+0087.22pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus... [more]
KAG6597728.10.0e+0086.77Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
KAG7029175.10.0e+0086.77Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
XP_022158739.10.0e+0087.97pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Momordica ... [more]
XP_023539701.10.0e+0086.47pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9SN397.5e-12938.04Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9LTV84.5e-11835.24Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9LW631.2e-10733.53Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SHZ88.0e-10731.87Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
O823808.9e-10632.71Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1E0A40.0e+0087.97pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Momordic... [more]
A0A6J1I9E10.0e+0086.47pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
A0A1S3CPR50.0e+0086.92pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis ... [more]
A0A6J1EXC60.0e+0086.02pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
A0A0A0L9N40.0e+0085.18DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G7228... [more]
Match NameE-valueIdentityDescription
AT4G18750.15.3e-13038.04Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.13.2e-11935.24mitochondrial editing factor 22 [more]
AT3G23330.18.8e-10933.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22070.15.7e-10831.87pentatricopeptide (PPR) repeat-containing protein [more]
AT1G15510.16.3e-10735.93Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 37..141
e-value: 3.9E-9
score: 38.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 317..386
e-value: 2.9E-6
score: 28.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 169..315
e-value: 4.5E-22
score: 80.8
coord: 387..567
e-value: 1.7E-16
score: 62.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 162..189
e-value: 0.24
score: 11.7
coord: 235..260
e-value: 0.38
score: 11.1
coord: 337..363
e-value: 1.4E-4
score: 21.9
coord: 263..292
e-value: 0.001
score: 19.2
coord: 365..393
e-value: 1.0E-4
score: 22.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 62..94
e-value: 0.0012
score: 16.9
coord: 336..365
e-value: 1.4E-4
score: 19.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 9.689847
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 59..93
score: 9.832344
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 332..366
score: 9.514466
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 534..655
e-value: 2.1E-33
score: 114.9
NoneNo IPR availablePANTHERPTHR24015:SF1853REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..657
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 1..657

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg030768.1Spg030768.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding