CsGy1G007550 (gene) Cucumber (Gy14) v2

NameCsGy1G007550
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat superfamily protein, putative
LocationChr1 : 4840735 .. 4842540 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGATTCATAGGTTCTCTTCATCTTCCACCTTGATTACCAAGAAGCCTCTTTATCTATGGAACTTGACGATTAGAAGCTCTGTCAATGGCGGGTTTTTCGCCCAATCTCTTGAAACCTACTCGTTTATGCGCCACTCTGGAATCCATGGCAACAATTTCACCTTTCCTCTCCTCCTCAAGGCTTGCGCCAATCTTGCTTCGATCGGTGATGGCACAATGCTCCACGCTCACCTCATCCATGTAGGCTTTGAATCAGACGTCTTTGTTCAAACCTCGCTCGTTGACATGTACTCCAAATTTTCTAACTTGCGTGCTTCACGCCAAGTGTTTGACGAAACGTCTACAAGAAGTGTCATTTCTTGGAATTCTATGATTGCTGCTTATTCTCGTAGTTTTCGGGTTAATGAAGCTTTAAAGCTATTCAGAGAGATGTTGGGGGGTGGATTTGAGCCAAATTCCTCAACTTTTGTAAGCTTATTGTCAGGTTTTGCTGACCCAACTCATGGATCTCTCTTTCAGGGACGTTTGCTACACGGTTGCTTAACCAAGTTTCAACTTCATGATGATACGCCTGTCGAAAATTCTCTTGTGCAAATGTACGTAAACTTTGGTCAAATCGATTCTGCTTGCTCTGTTTTTTATGCCATCAGCGAGAAGACAGTAATTTCTTGGACAATAATGCTTGGTGGTTACTTGAAAGCTGGGGCTGTTGCCAAAGTATTCGAAACCTTTAGCCAAATGAGGCAAAATAATGTCGTATTGGATAAATTTGTTTTTGTAGACATAATCTCCTCTTGTATACAACTAGGAAATTTGTTTTTAGGTTCTTCACTTCATTCCCTCCTCTTGAAAACTGGGCTCAAGTACGAGGATCCTATTGGTTGTTTGCTCATTAGCATGTATTCAAAATGTGGAGACCTCTTGTCTGCTCGAGCAGTATTTGATTTGTTATCTGAAAAAAGCATCTATTCATGGACATCAATGATAAGTGGATATGCCAATGCTGGGTATCCCAGAGAAGCATTAAGTCTATTTTCAATGGCAACACAAAATAATGTTAGACCAAATGGAGCAATGCTAGCTACTGCTATCTCTGCTTGTGCTGATTTAGGATCATTGAGCATGCGTAGGGAAATTGAGGCATTCATACAGCAGGACGGTTTAGCATCGGATAGTCAAGTTTCAACATCGTTGATACATTTGTATTGCAAATTTGGAAGTATTGAGAAGGCAGAAAAAGTTTTTAATAGTATGATACATAGAGACTTGGCAGCTTGGAGTTCCATGATGAACGGTTATGCCGTGCATGGGATGGGAGAAAAGACGATGAATCTGTTTCATGAGATGCAAAGATCAGGAATAAAACCAGATGGTTCTGTTTATGCAAGCATTTTATTGGCTTGCAGTCATTCAGGTCTAGTGGAAGATGGACTAGAGCATTTCAAGAACATGCAGTTGGATTATGGAATAGTACCTACCATGGTACACTACACTTGTTTGGTAGACATTCTAAGCCGAGCTGGTCATCTAGAATTAGCTTTGAATACAATTCAAGAGATGCCTACCCAATTTCAATCTCAAGCTTGGGCTCCTTTCCTCAGTGCTTGCAGAACTTATTGTGATGTTGAACTTGGAGAAGTTGCAAATAGATGTCTATTAAGTTCAAATCCTAGAAACCCAGTAAATCATGTTTTGATGGCTAATTTATACACATCTATGGGTAAGTGGAAAGAAGCAGCCAAAGTGAGAAGTTTGATTGATGATAAAGGTTTGGTCAAAGAACCAGGATGCAGCCAGCTTTAA

mRNA sequence

ATGCAGATTCATAGGTTCTCTTCATCTTCCACCTTGATTACCAAGAAGCCTCTTTATCTATGGAACTTGACGATTAGAAGCTCTGTCAATGGCGGGTTTTTCGCCCAATCTCTTGAAACCTACTCGTTTATGCGCCACTCTGGAATCCATGGCAACAATTTCACCTTTCCTCTCCTCCTCAAGGCTTGCGCCAATCTTGCTTCGATCGGTGATGGCACAATGCTCCACGCTCACCTCATCCATGTAGGCTTTGAATCAGACGTCTTTGTTCAAACCTCGCTCGTTGACATGTACTCCAAATTTTCTAACTTGCGTGCTTCACGCCAAGTGTTTGACGAAACGTCTACAAGAAGTGTCATTTCTTGGAATTCTATGATTGCTGCTTATTCTCGTAGTTTTCGGGTTAATGAAGCTTTAAAGCTATTCAGAGAGATGTTGGGGGGTGGATTTGAGCCAAATTCCTCAACTTTTGTAAGCTTATTGTCAGGTTTTGCTGACCCAACTCATGGATCTCTCTTTCAGGGACGTTTGCTACACGGTTGCTTAACCAAGTTTCAACTTCATGATGATACGCCTGTCGAAAATTCTCTTGTGCAAATGTACGTAAACTTTGGTCAAATCGATTCTGCTTGCTCTGTTTTTTATGCCATCAGCGAGAAGACAGTAATTTCTTGGACAATAATGCTTGGTGGTTACTTGAAAGCTGGGGCTGTTGCCAAAGTATTCGAAACCTTTAGCCAAATGAGGCAAAATAATGTCGTATTGGATAAATTTGTTTTTGTAGACATAATCTCCTCTTGTATACAACTAGGAAATTTGTTTTTAGGTTCTTCACTTCATTCCCTCCTCTTGAAAACTGGGCTCAAGTACGAGGATCCTATTGGTTGTTTGCTCATTAGCATGTATTCAAAATGTGGAGACCTCTTGTCTGCTCGAGCAGTATTTGATTTGTTATCTGAAAAAAGCATCTATTCATGGACATCAATGATAAGTGGATATGCCAATGCTGGGTATCCCAGAGAAGCATTAAGTCTATTTTCAATGGCAACACAAAATAATGTTAGACCAAATGGAGCAATGCTAGCTACTGCTATCTCTGCTTGTGCTGATTTAGGATCATTGAGCATGCGTAGGGAAATTGAGGCATTCATACAGCAGGACGGTTTAGCATCGGATAGTCAAGTTTCAACATCGTTGATACATTTGTATTGCAAATTTGGAAGTATTGAGAAGGCAGAAAAAGTTTTTAATAGTATGATACATAGAGACTTGGCAGCTTGGAGTTCCATGATGAACGGTTATGCCGTGCATGGGATGGGAGAAAAGACGATGAATCTGTTTCATGAGATGCAAAGATCAGGAATAAAACCAGATGGTTCTGTTTATGCAAGCATTTTATTGGCTTGCAGTCATTCAGGTCTAGTGGAAGATGGACTAGAGCATTTCAAGAACATGCAGTTGGATTATGGAATAGTACCTACCATGGTACACTACACTTGTTTGGTAGACATTCTAAGCCGAGCTGGTCATCTAGAATTAGCTTTGAATACAATTCAAGAGATGCCTACCCAATTTCAATCTCAAGCTTGGGCTCCTTTCCTCAGTGCTTGCAGAACTTATTGTGATGTTGAACTTGGAGAAGTTGCAAATAGATGTCTATTAAGTTCAAATCCTAGAAACCCAGTAAATCATGTTTTGATGGCTAATTTATACACATCTATGGGTAAGTGGAAAGAAGCAGCCAAAGTGAGAAGTTTGATTGATGATAAAGGTTTGGTCAAAGAACCAGGATGCAGCCAGCTTTAA

Coding sequence (CDS)

ATGCAGATTCATAGGTTCTCTTCATCTTCCACCTTGATTACCAAGAAGCCTCTTTATCTATGGAACTTGACGATTAGAAGCTCTGTCAATGGCGGGTTTTTCGCCCAATCTCTTGAAACCTACTCGTTTATGCGCCACTCTGGAATCCATGGCAACAATTTCACCTTTCCTCTCCTCCTCAAGGCTTGCGCCAATCTTGCTTCGATCGGTGATGGCACAATGCTCCACGCTCACCTCATCCATGTAGGCTTTGAATCAGACGTCTTTGTTCAAACCTCGCTCGTTGACATGTACTCCAAATTTTCTAACTTGCGTGCTTCACGCCAAGTGTTTGACGAAACGTCTACAAGAAGTGTCATTTCTTGGAATTCTATGATTGCTGCTTATTCTCGTAGTTTTCGGGTTAATGAAGCTTTAAAGCTATTCAGAGAGATGTTGGGGGGTGGATTTGAGCCAAATTCCTCAACTTTTGTAAGCTTATTGTCAGGTTTTGCTGACCCAACTCATGGATCTCTCTTTCAGGGACGTTTGCTACACGGTTGCTTAACCAAGTTTCAACTTCATGATGATACGCCTGTCGAAAATTCTCTTGTGCAAATGTACGTAAACTTTGGTCAAATCGATTCTGCTTGCTCTGTTTTTTATGCCATCAGCGAGAAGACAGTAATTTCTTGGACAATAATGCTTGGTGGTTACTTGAAAGCTGGGGCTGTTGCCAAAGTATTCGAAACCTTTAGCCAAATGAGGCAAAATAATGTCGTATTGGATAAATTTGTTTTTGTAGACATAATCTCCTCTTGTATACAACTAGGAAATTTGTTTTTAGGTTCTTCACTTCATTCCCTCCTCTTGAAAACTGGGCTCAAGTACGAGGATCCTATTGGTTGTTTGCTCATTAGCATGTATTCAAAATGTGGAGACCTCTTGTCTGCTCGAGCAGTATTTGATTTGTTATCTGAAAAAAGCATCTATTCATGGACATCAATGATAAGTGGATATGCCAATGCTGGGTATCCCAGAGAAGCATTAAGTCTATTTTCAATGGCAACACAAAATAATGTTAGACCAAATGGAGCAATGCTAGCTACTGCTATCTCTGCTTGTGCTGATTTAGGATCATTGAGCATGCGTAGGGAAATTGAGGCATTCATACAGCAGGACGGTTTAGCATCGGATAGTCAAGTTTCAACATCGTTGATACATTTGTATTGCAAATTTGGAAGTATTGAGAAGGCAGAAAAAGTTTTTAATAGTATGATACATAGAGACTTGGCAGCTTGGAGTTCCATGATGAACGGTTATGCCGTGCATGGGATGGGAGAAAAGACGATGAATCTGTTTCATGAGATGCAAAGATCAGGAATAAAACCAGATGGTTCTGTTTATGCAAGCATTTTATTGGCTTGCAGTCATTCAGGTCTAGTGGAAGATGGACTAGAGCATTTCAAGAACATGCAGTTGGATTATGGAATAGTACCTACCATGGTACACTACACTTGTTTGGTAGACATTCTAAGCCGAGCTGGTCATCTAGAATTAGCTTTGAATACAATTCAAGAGATGCCTACCCAATTTCAATCTCAAGCTTGGGCTCCTTTCCTCAGTGCTTGCAGAACTTATTGTGATGTTGAACTTGGAGAAGTTGCAAATAGATGTCTATTAAGTTCAAATCCTAGAAACCCAGTAAATCATGTTTTGATGGCTAATTTATACACATCTATGGGTAAGTGGAAAGAAGCAGCCAAAGTGAGAAGTTTGATTGATGATAAAGGTTTGGTCAAAGAACCAGGATGCAGCCAGCTTTAA

Protein sequence

MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL
BLAST of CsGy1G007550 vs. NCBI nr
Match: XP_004137641.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativus] >KGN64229.1 hypothetical protein Csa_1G043310 [Cucumis sativus])

HSP 1 Score: 1201.0 bits (3106), Expect = 0.0e+00
Identity = 601/601 (100.00%), Postives = 601/601 (100.00%), Query Frame = 0

Query: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60
           MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120
           KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180
           SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG
Sbjct: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240
           CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK
Sbjct: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300
           VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300

Query: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360
           MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360

Query: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420
           LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI
Sbjct: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420

Query: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480
           HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE
Sbjct: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480

Query: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540
           HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600
           CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600

Query: 601 L 602
           L
Sbjct: 601 L 601

BLAST of CsGy1G007550 vs. NCBI nr
Match: XP_008446053.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Cucumis melo])

HSP 1 Score: 1156.4 bits (2990), Expect = 0.0e+00
Identity = 578/601 (96.17%), Postives = 588/601 (97.84%), Query Frame = 0

Query: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60
           MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQ+LETYSFMR SGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQTLETYSFMRQSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120
           KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSK S+LRASRQVFDETSTRSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKISDLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180
            WNSMIAAYSR FRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG
Sbjct: 121 FWNSMIAAYSRGFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240
            +TKFQ HDDTPV+NSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK
Sbjct: 181 FMTKFQFHDDTPVQNSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300
           VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNL LGSSLHSLLLKT LKY+DPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLSLGSSLHSLLLKTALKYQDPIGCLLIS 300

Query: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360
           MYSKCGDLLSARAVFDLLSEKSIYSWTSMIS YANAGYPREALSLF+MATQNNVRPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISEYANAGYPREALSLFTMATQNNVRPNGAM 360

Query: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420
           LATAISACADLGSLSM REIEAFIQQDGLASD QVSTSLIHLYCKFGS EKAEKVF+SMI
Sbjct: 361 LATAISACADLGSLSMLREIEAFIQQDGLASDYQVSTSLIHLYCKFGSFEKAEKVFSSMI 420

Query: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480
           HRDLAAWSSMMNGYA+HGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGL+
Sbjct: 421 HRDLAAWSSMMNGYAMHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLQ 480

Query: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540
           HFKNMQLDYGIVP MVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPNMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600
           CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVK+PGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKQPGCSQ 600

Query: 601 L 602
           L
Sbjct: 601 L 601

BLAST of CsGy1G007550 vs. NCBI nr
Match: XP_022137264.1 (pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia])

HSP 1 Score: 984.6 bits (2544), Expect = 1.5e-283
Identity = 494/601 (82.20%), Postives = 537/601 (89.35%), Query Frame = 0

Query: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60
           MQ HRFS SS  I K+PLYLWNL IRSSVNGGFFA++LETYSFMRHSGIHGNNFTFPLLL
Sbjct: 1   MQAHRFSPSSGFI-KRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120
           KACANLASIGDGTMLHAHLI VGFE+D+FVQTSLVDMYSK  +L +SRQVFDE S RSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVI 120

Query: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180
           SWNSMIAAYSR+FRVNE  KLFREM G GFEPNSSTFVSLLSGFA+P HGSLFQ  L+ G
Sbjct: 121 SWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQG 180

Query: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240
           CLTKF+L +DTPV NSL++MYVNFGQID+A SVFYAI  KTVISWTIMLGGYLK+GAVA+
Sbjct: 181 CLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAE 240

Query: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300
           VF  FSQMR NNVVLDK VFVDIISSCIQLGNL L SSLHSLLLK GL  EDPIGCLLIS
Sbjct: 241 VFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLIS 300

Query: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360
           MYSKCGD LSARAVFD+L EK I+ WTS+ISGYANAGYP EAL LF+MATQNN+RPNGAM
Sbjct: 301 MYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAM 360

Query: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420
           LATA+SACAD GSLSM +E+EA+IQ +G+A D QVSTSLIH+YCK  SI+KAE VF SMI
Sbjct: 361 LATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMI 420

Query: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480
            RDLAAWS+MMNGYAV+GMGE+ +NLFHEM+R+GIKPD SVYASILLACSHSGLVEDGL 
Sbjct: 421 SRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLN 480

Query: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540
           HFKNMQLD+GI PT+ HYTCLVDILSRAGHLELALN IQEMP QFQ+QAW PFLSACRTY
Sbjct: 481 HFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTY 540

Query: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600
           CDVELGEVAN+ +  SNP NPVNHVL+ANLYTS+GKWKEAA VRSLI DKGLVKEPGCSQ
Sbjct: 541 CDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQ 600

Query: 601 L 602
           L
Sbjct: 601 L 600

BLAST of CsGy1G007550 vs. NCBI nr
Match: XP_020221554.1 (pentatricopeptide repeat-containing protein At4g19191, mitochondrial-like [Cajanus cajan])

HSP 1 Score: 657.1 bits (1694), Expect = 5.6e-185
Identity = 329/596 (55.20%), Postives = 435/596 (72.99%), Query Frame = 0

Query: 9   SSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLLKACANLAS 68
           SS    ++ LY WNL IR S N GFFAQ+L  YS M HSG+HGNN T+PLLLKACANLAS
Sbjct: 6   SSLANFRRSLYQWNLMIRDSNNNGFFAQTLNIYSSMAHSGVHGNNLTYPLLLKACANLAS 65

Query: 69  IGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAA 128
           I  GTMLH H++ +GF++D FVQT+L+DMYSK S++ ++RQVFDE   RSV+SWN+M++A
Sbjct: 66  IQHGTMLHGHVLKLGFQADTFVQTALLDMYSKCSHVSSARQVFDEMPQRSVVSWNAMVSA 125

Query: 129 YSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFAD-PTHGSLFQGRLLHGCLTKFQL 188
           YSR   V++AL+  +EM   GFEPN STFVS+LSG+++  +    +QGR +H CL K  +
Sbjct: 126 YSRGSSVDQALRFLKEMWVLGFEPNCSTFVSILSGYSNLDSFKFHWQGRSIHCCLIKLGI 185

Query: 189 -HDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFS 248
            + +  + NSL+ MY  F  +D A  VF  + EK++ISWT M+GGY+K G   + F+ F+
Sbjct: 186 VYLEVSLANSLMGMYAQFCLMDEARKVFDLMDEKSIISWTTMIGGYVKIGHAVEAFKLFN 245

Query: 249 QMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCG 308
           QM+  ++ +D  VF+++IS CIQ+G   L SS+HSL+LK G   ED I  LLI+MY+KCG
Sbjct: 246 QMQHQSIGIDFVVFLNLISGCIQVGEFLLASSVHSLVLKCGCDVEDSIENLLITMYAKCG 305

Query: 309 DLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAIS 368
            L SAR +FDL+ EKS+ +WTSMI+GY ++G+P EAL LF    +  +RPNGA LAT +S
Sbjct: 306 RLTSARRIFDLIIEKSMLTWTSMIAGYVHSGHPVEALDLFRRMVKTGIRPNGATLATVLS 365

Query: 369 ACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAA 428
           ACADLGSLS   EIE +I  +GL SD Q+ TSLIH+Y K GSI KA +VF  +  +DL  
Sbjct: 366 ACADLGSLSTGEEIEEYIFLNGLESDQQIQTSLIHMYSKCGSIMKAREVFERVRDKDLTV 425

Query: 429 WSSMMNGYAVHGMGEKTMNLFHEMQRS-GIKPDGSVYASILLACSHSGLVEDGLEHFKNM 488
           W+SM+N YA+HGMG++ ++LFH+M  +  I PD  VY  +LLACSHSGLVEDGL++FK+M
Sbjct: 426 WTSMINSYAIHGMGDEAISLFHKMTTAERIMPDAIVYTGVLLACSHSGLVEDGLKYFKSM 485

Query: 489 QLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVEL 548
           Q D+GI PT+ H TCL+D+L R G L+LA + IQ MP + Q+QAW P LSACR + +VEL
Sbjct: 486 QKDFGIAPTVEHCTCLIDLLGRVGQLDLAFDAIQGMPLEVQAQAWGPLLSACRIHGNVEL 545

Query: 549 GEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL 602
           GE+    LL  NP +  ++VLMANLYTS+GKWKEA  +R+LID KGLVKE G SQ+
Sbjct: 546 GELVTDRLLDINPESSGSYVLMANLYTSLGKWKEAHMMRNLIDGKGLVKERGWSQV 601

BLAST of CsGy1G007550 vs. NCBI nr
Match: RDY03240.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Mucuna pruriens])

HSP 1 Score: 655.6 bits (1690), Expect = 1.6e-184
Identity = 329/590 (55.76%), Postives = 434/590 (73.56%), Query Frame = 0

Query: 15  KKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTM 74
           ++PLYLWNL IR S N GFF+Q+L  YS M HSG+HGNN T+PLLLKACANL+SI  G M
Sbjct: 12  RRPLYLWNLMIRDSTNNGFFSQTLNIYSSMAHSGVHGNNLTYPLLLKACANLSSIQHGIM 71

Query: 75  LHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRSFR 134
           LH H++ +GF++D FVQTSLVDMYSK S++ ++RQVFDE   RSV+SWN+M++AYSR   
Sbjct: 72  LHGHVLKLGFQADTFVQTSLVDMYSKCSHVASARQVFDEMPQRSVVSWNAMVSAYSRRSS 131

Query: 135 VNEALKLFREMLGGGFEPNSSTFVSLLSGFAD-PTHGSLFQGRLLHGCLTKFQL-HDDTP 194
           +++AL L +EM   GFEP SSTFVS+LSG ++  +     QGR +H CL K  + + +  
Sbjct: 132 MDQALSLLKEMWVLGFEPTSSTFVSILSGCSNLDSFEFCLQGRSIHCCLIKLGIVYLEVS 191

Query: 195 VENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQNN 254
           + NSL+ MYV F  +  A  VF  + EK++ISWT M+GGY+K G   + F+ F+QM+  +
Sbjct: 192 LANSLMGMYVQFCLMGEARKVFDLMDEKSIISWTTMIGGYVKIGHALEAFDLFNQMQHQS 251

Query: 255 VVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSAR 314
             +D  VF+++IS CIQ+  L L SS+HS +LK G   ED I  LLI+MY+KCG+L SAR
Sbjct: 252 TGIDFVVFLNLISGCIQVRELLLASSVHSFVLKCGCDEEDSIENLLITMYAKCGNLTSAR 311

Query: 315 AVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADLG 374
            +FDL+ EKS+ SWTSMI+GYA++ +P EAL LF    + ++RPNGA LAT +SACADLG
Sbjct: 312 RIFDLIIEKSMLSWTSMIAGYAHSCHPEEALDLFRRMVRTDIRPNGATLATVLSACADLG 371

Query: 375 SLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSMMN 434
           SL   +EIE ++  +GL +D QV TSLIH+Y K GSI KA +VF  +  +DL  W+SM+N
Sbjct: 372 SLGTGQEIEEYVFLNGLEADQQVQTSLIHMYSKCGSIMKAREVFERVTDKDLTVWTSMIN 431

Query: 435 GYAVHGMGEKTMNLFHEMQRS-GIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYGI 494
            YA+HGMG + ++LFH+M  + GI PD  VY S+LLACSHSGLVEDGL++FK+MQ D+GI
Sbjct: 432 SYAIHGMGNEAISLFHKMTTAEGIMPDAIVYTSVLLACSHSGLVEDGLKYFKSMQKDFGI 491

Query: 495 VPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVANR 554
            PT+ H TCL+D+L R G L+LAL+ IQ MP + Q+QAW P LSAC  + +VELGE+A  
Sbjct: 492 APTVEHCTCLIDLLGRVGQLDLALDAIQGMPLEVQAQAWGPLLSACIIHGNVELGELATV 551

Query: 555 CLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL 602
            LL  +P +  N+VLMANLYTS+GKWKEA  +R+LID KGLVKE G SQ+
Sbjct: 552 KLLEISPGSSGNYVLMANLYTSLGKWKEAHMMRNLIDGKGLVKECGWSQV 601

BLAST of CsGy1G007550 vs. TAIR10
Match: AT2G03380.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 360.5 bits (924), Expect = 2.0e-99
Identity = 203/591 (34.35%), Postives = 323/591 (54.65%), Query Frame = 0

Query: 13  ITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDG 72
           I +   YLW + +R         + ++ Y  +   G   ++  F   LKAC  L  + +G
Sbjct: 102 IPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQDLDNG 161

Query: 73  TMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRS 132
             +H  L+ V    D  V T L+DMY+K   ++++ +VF++ + R+V+ W SMIA Y ++
Sbjct: 162 KKIHCQLVKVP-SFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGYVKN 221

Query: 133 FRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKFQLHDDTP 192
               E L LF  M       N  T+ +L+   A     +L QG+  HGCL K  +   + 
Sbjct: 222 DLCEEGLVLFNRMRENNVLGNEYTYGTLI--MACTKLSALHQGKWFHGCLVKSGIELSSC 281

Query: 193 VENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQNN 252
           +  SL+ MYV  G I +A  VF   S   ++ WT M+ GY   G+V +    F +M+   
Sbjct: 282 LVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVE 341

Query: 253 VVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSAR 312
           +  +      ++S C  + NL LG S+H L +K G+ ++  +   L+ MY+KC     A+
Sbjct: 342 IKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGI-WDTNVANALVHMYAKCYQNRDAK 401

Query: 313 AVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADLG 372
            VF++ SEK I +W S+ISG++  G   EAL LF      +V PNG  +A+  SACA LG
Sbjct: 402 YVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLG 461

Query: 373 SLSMRREIEAFIQQDG-LASDS-QVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSM 432
           SL++   + A+  + G LAS S  V T+L+  Y K G  + A  +F+++  ++   WS+M
Sbjct: 462 SLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKNTITWSAM 521

Query: 433 MNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYG 492
           + GY   G    ++ LF EM +   KP+ S + SIL AC H+G+V +G ++F +M  DY 
Sbjct: 522 IGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMVNEGKKYFSSMYKDYN 581

Query: 493 IVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVAN 552
             P+  HYTC+VD+L+RAG LE AL+ I++MP Q   + +  FL  C  +   +LGE+  
Sbjct: 582 FTPSTKHYTCMVDMLARAGELEQALDIIEKMPIQPDVRCFGAFLHGCGMHSRFDLGEIVI 641

Query: 553 RCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL 602
           + +L  +P +   +VL++NLY S G+W +A +VR+L+  +GL K  G S +
Sbjct: 642 KKMLDLHPDDASYYVLVSNLYASDGRWNQAKEVRNLMKQRGLSKIAGHSTM 688

BLAST of CsGy1G007550 vs. TAIR10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 355.1 bits (910), Expect = 8.3e-98
Identity = 193/588 (32.82%), Postives = 320/588 (54.42%), Query Frame = 0

Query: 13  ITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRH-SGIHGNNFTFPLLLKACANLASIGD 72
           ++++ L+ WN+ +      G+F +++  Y  M    G+  + +TFP +L+ C  +  +  
Sbjct: 155 MSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVLRTCGGIPDLAR 214

Query: 73  GTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSR 132
           G  +H H++  G+E D+ V  +L+ MY K  +++++R +FD    R +ISWN+MI+ Y  
Sbjct: 215 GKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIISWNAMISGYFE 274

Query: 133 SFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKFQLHDDT 192
           +   +E L+LF  M G   +P+  T  S++S  A    G    GR +H  +       D 
Sbjct: 275 NGMCHEGLELFFAMRGLSVDPDLMTLTSVIS--ACELLGDRRLGRDIHAYVITTGFAVDI 334

Query: 193 PVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQN 252
            V NSL QMY+N G    A  +F  +  K ++SWT M+ GY       K  +T+  M Q+
Sbjct: 335 SVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAIDTYRMMDQD 394

Query: 253 NVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSA 312
           +V  D+     ++S+C  LG+L  G  LH L +K  L     +   LI+MYSKC  +  A
Sbjct: 395 SVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINMYSKCKCIDKA 454

Query: 313 RAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADL 372
             +F  +  K++ SWTS+I+G        EAL +F    +  ++PN   L  A++ACA +
Sbjct: 455 LDIFHNIPRKNVISWTSIIAGLRLNNRCFEAL-IFLRQMKMTLQPNAITLTAALAACARI 514

Query: 373 GSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSMM 432
           G+L   +EI A + + G+  D  +  +L+ +Y + G +  A   FNS   +D+ +W+ ++
Sbjct: 515 GALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRMNTAWSQFNSQ-KKDVTSWNILL 574

Query: 433 NGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYGI 492
            GY+  G G   + LF  M +S ++PD   + S+L  CS S +V  GL +F  M+ DYG+
Sbjct: 575 TGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGCSKSQMVRQGLMYFSKME-DYGV 634

Query: 493 VPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVANR 552
            P + HY C+VD+L RAG L+ A   IQ+MP       W   L+ACR +  ++LGE++ +
Sbjct: 635 TPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAVWGALLNACRIHHKIDLGELSAQ 694

Query: 553 CLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCS 600
            +   + ++   ++L+ NLY   GKW+E AKVR ++ + GL  + GCS
Sbjct: 695 HIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKENGLTVDAGCS 737

BLAST of CsGy1G007550 vs. TAIR10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 350.1 bits (897), Expect = 2.7e-96
Identity = 215/630 (34.13%), Postives = 321/630 (50.95%), Query Frame = 0

Query: 18  LYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHA 77
           +Y WN  IRS  + G   + L  +  M       +N+TFP + KAC  ++S+  G   HA
Sbjct: 92  VYHWNSLIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGEISSVRCGESAHA 151

Query: 78  HLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRSFRVNE 137
             +  GF S+VFV  +LV MYS+  +L  +R+VFDE S   V+SWNS+I +Y++  +   
Sbjct: 152 LSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKV 211

Query: 138 ALKLFREMLGG-GFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKFQLHDDTPVENS 197
           AL++F  M    G  P++ T V++L   A     SL  G+ LH      ++  +  V N 
Sbjct: 212 ALEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSL--GKQLHCFAVTSEMIQNMFVGNC 271

Query: 198 LVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQNNV--- 257
           LV MY   G +D A +VF  +S K V+SW  M+ GY + G        F +M++  +   
Sbjct: 272 LVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKXX 331

Query: 258 --------------------------------VLDKFVFVDIISSCIQLGNLFLGSSLHS 317
                                                              L  G  +H 
Sbjct: 332 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALMHGKEIHC 391

Query: 318 L-------LLKTGLKYEDPIGCLLISMYSKCGDLLSARAVFDLLS--EKSIYSWTSMISG 377
                   L K G   E+ +   LI MY+KC  + +ARA+FD LS  E+ + +WT MI G
Sbjct: 392 YAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGG 451

Query: 378 YANAGYPREALSLFSMATQNN--VRPNGAMLATAISACADLGSLSMRREIEAF-IQQDGL 437
           Y+  G   +AL L S   + +   RPN   ++ A+ ACA L +L + ++I A+ ++    
Sbjct: 452 YSQHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIGKQIHAYALRNQQN 511

Query: 438 ASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSMMNGYAVHGMGEKTMNLFHE 497
           A    VS  LI +Y K GSI  A  VF++M+ ++   W+S+M GY +HG GE+ + +F E
Sbjct: 512 AVPLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDE 571

Query: 498 MQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYGIVPTMVHYTCLVDILSRAG 557
           M+R G K DG     +L ACSHSG+++ G+E+F  M+  +G+ P   HY CLVD+L RAG
Sbjct: 572 MRRIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAG 631

Query: 558 HLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVANRCLLSSNPRNPVNHVLMAN 600
            L  AL  I+EMP +     W  FLS CR +  VELGE A   +      +  ++ L++N
Sbjct: 632 RLNAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSN 691

BLAST of CsGy1G007550 vs. TAIR10
Match: AT5G39350.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 349.4 bits (895), Expect = 4.5e-96
Identity = 202/593 (34.06%), Postives = 317/593 (53.46%), Query Frame = 0

Query: 18  LYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIH--GNNFTFPLLLKACANLASIGDGTML 77
           L  +N+ IR  V  G +  ++  +  M   G+    + +T+P + KA   L S+  G ++
Sbjct: 80  LLSYNIVIRMYVREGLYHDAISVFIRMVSEGVKCVPDGYTYPFVAKAAGELKSMKLGLVV 139

Query: 78  HAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRSFRV 137
           H  ++   F  D +VQ +L+ MY  F  +  +R VFD    R VISWN+MI+ Y R+  +
Sbjct: 140 HGRILRSWFGRDKYVQNALLAMYMNFGKVEMARDVFDVMKNRDVISWNTMISGYYRNGYM 199

Query: 138 NEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHG---SLFQGRLLHGCLTKFQLHDDTP 197
           N+AL +F  M+    + + +T VS+L     P  G    L  GR +H  + + +L D   
Sbjct: 200 NDALMMFDWMVNESVDLDHATIVSML-----PVCGHLKDLEMGRNVHKLVEEKRLGDKIE 259

Query: 198 VENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQNN 257
           V+N+LV MY+  G++D A  VF  +  + VI+WT M+ GY + G V    E    M+   
Sbjct: 260 VKNALVNMYLKCGRMDEARFVFDRMERRDVITWTCMINGYTEDGDVENALELCRLMQFEG 319

Query: 258 VVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSAR 317
           V  +      ++S C     +  G  LH   ++  +  +  I   LISMY+KC  +    
Sbjct: 320 VRPNAVTIASLVSVCGDALKVNDGKCLHGWAVRQQVYSDIIIETSLISMYAKCKRVDLCF 379

Query: 318 AVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADLG 377
            VF   S+     W+++I+G        +AL LF    + +V PN A L + + A A L 
Sbjct: 380 RVFSGASKYHTGPWSAIIAGCVQNELVSDALGLFKRMRREDVEPNIATLNSLLPAYAALA 439

Query: 378 SLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIH----RDLAAWS 437
            L     I  ++ + G  S    +T L+H+Y K G++E A K+FN +      +D+  W 
Sbjct: 440 DLRQAMNIHCYLTKTGFMSSLDAATGLVHVYSKCGTLESAHKIFNGIQEKHKSKDVVLWG 499

Query: 438 SMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLD 497
           ++++GY +HG G   + +F EM RSG+ P+   + S L ACSHSGLVE+GL  F+ M   
Sbjct: 500 ALISGYGMHGDGHNALQVFMEMVRSGVTPNEITFTSALNACSHSGLVEEGLTLFRFMLEH 559

Query: 498 YGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEV 557
           Y  +    HYTC+VD+L RAG L+ A N I  +P +  S  W   L+AC T+ +V+LGE+
Sbjct: 560 YKTLARSNHYTCIVDLLGRAGRLDEAYNLITTIPFEPTSTVWGALLAACVTHENVQLGEM 619

Query: 558 ANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL 602
           A   L    P N  N+VL+AN+Y ++G+WK+  KVRS++++ GL K+PG S +
Sbjct: 620 AANKLFELEPENTGNYVLLANIYAALGRWKDMEKVRSMMENVGLRKKPGHSTI 667

BLAST of CsGy1G007550 vs. TAIR10
Match: AT1G18485.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 347.1 bits (889), Expect = 2.2e-95
Identity = 184/594 (30.98%), Postives = 319/594 (53.70%), Query Frame = 0

Query: 12  LITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHG----NNFTFPLLLKACANLA 71
           ++ ++ L  WN  IR   + GF  +S      M      G    +  T   +L  CA   
Sbjct: 247 IMPERNLVSWNSMIRVFSDNGFSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCARER 306

Query: 72  SIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIA 131
            IG G  +H   + +  + ++ +  +L+DMYSK   +  ++ +F   + ++V+SWN+M+ 
Sbjct: 307 EIGLGKGVHGWAVKLRLDKELVLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVG 366

Query: 132 AYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQG-RLLHGCLTKFQ 191
            +S     +    + R+ML GG E   +  V++L+      H S     + LH    K +
Sbjct: 367 GFSAEGDTHGTFDVLRQMLAGG-EDVKADEVTILNAVPVCFHESFLPSLKELHCYSLKQE 426

Query: 192 LHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFS 251
              +  V N+ V  Y   G +  A  VF+ I  KTV SW  ++GG+ ++       +   
Sbjct: 427 FVYNELVANAFVASYAKCGSLSYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHL 486

Query: 252 QMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCG 311
           QM+ + ++ D F    ++S+C +L +L LG  +H  +++  L+ +  +   ++S+Y  CG
Sbjct: 487 QMKISGLLPDSFTVCSLLSACSKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCG 546

Query: 312 DLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAIS 371
           +L + +A+FD + +KS+ SW ++I+GY   G+P  AL +F       ++  G  +     
Sbjct: 547 ELCTVQALFDAMEDKSLVSWNTVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFG 606

Query: 372 ACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAA 431
           AC+ L SL + RE  A+  +  L  D+ ++ SLI +Y K GSI ++ KVFN +  +  A+
Sbjct: 607 ACSLLPSLRLGREAHAYALKHLLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTAS 666

Query: 432 WSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQ 491
           W++M+ GY +HG+ ++ + LF EMQR+G  PD   +  +L AC+HSGL+ +GL +   M+
Sbjct: 667 WNAMIMGYGIHGLAKEAIKLFEEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMK 726

Query: 492 LDYGIVPTMVHYTCLVDILSRAGHLELALNTI-QEMPTQFQSQAWAPFLSACRTYCDVEL 551
             +G+ P + HY C++D+L RAG L+ AL  + +EM  +     W   LS+CR + ++E+
Sbjct: 727 SSFGLKPNLKHYACVIDMLGRAGQLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEM 786

Query: 552 GEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCS 600
           GE     L    P  P N+VL++NLY  +GKW++  KVR  +++  L K+ GCS
Sbjct: 787 GEKVAAKLFELEPEKPENYVLLSNLYAGLGKWEDVRKVRQRMNEMSLRKDAGCS 839

BLAST of CsGy1G007550 vs. Swiss-Prot
Match: sp|Q9ZQ74|PP146_ARATH (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E47 PE=3 SV=1)

HSP 1 Score: 360.5 bits (924), Expect = 3.5e-98
Identity = 203/591 (34.35%), Postives = 323/591 (54.65%), Query Frame = 0

Query: 13  ITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDG 72
           I +   YLW + +R         + ++ Y  +   G   ++  F   LKAC  L  + +G
Sbjct: 102 IPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQDLDNG 161

Query: 73  TMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRS 132
             +H  L+ V    D  V T L+DMY+K   ++++ +VF++ + R+V+ W SMIA Y ++
Sbjct: 162 KKIHCQLVKVP-SFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGYVKN 221

Query: 133 FRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKFQLHDDTP 192
               E L LF  M       N  T+ +L+   A     +L QG+  HGCL K  +   + 
Sbjct: 222 DLCEEGLVLFNRMRENNVLGNEYTYGTLI--MACTKLSALHQGKWFHGCLVKSGIELSSC 281

Query: 193 VENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQNN 252
           +  SL+ MYV  G I +A  VF   S   ++ WT M+ GY   G+V +    F +M+   
Sbjct: 282 LVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVE 341

Query: 253 VVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSAR 312
           +  +      ++S C  + NL LG S+H L +K G+ ++  +   L+ MY+KC     A+
Sbjct: 342 IKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGI-WDTNVANALVHMYAKCYQNRDAK 401

Query: 313 AVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADLG 372
            VF++ SEK I +W S+ISG++  G   EAL LF      +V PNG  +A+  SACA LG
Sbjct: 402 YVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLG 461

Query: 373 SLSMRREIEAFIQQDG-LASDS-QVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSM 432
           SL++   + A+  + G LAS S  V T+L+  Y K G  + A  +F+++  ++   WS+M
Sbjct: 462 SLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKNTITWSAM 521

Query: 433 MNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYG 492
           + GY   G    ++ LF EM +   KP+ S + SIL AC H+G+V +G ++F +M  DY 
Sbjct: 522 IGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMVNEGKKYFSSMYKDYN 581

Query: 493 IVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVAN 552
             P+  HYTC+VD+L+RAG LE AL+ I++MP Q   + +  FL  C  +   +LGE+  
Sbjct: 582 FTPSTKHYTCMVDMLARAGELEQALDIIEKMPIQPDVRCFGAFLHGCGMHSRFDLGEIVI 641

Query: 553 RCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL 602
           + +L  +P +   +VL++NLY S G+W +A +VR+L+  +GL K  G S +
Sbjct: 642 KKMLDLHPDDASYYVLVSNLYASDGRWNQAKEVRNLMKQRGLSKIAGHSTM 688

BLAST of CsGy1G007550 vs. Swiss-Prot
Match: sp|Q9M9E2|PPR45_ARATH (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 355.1 bits (910), Expect = 1.5e-96
Identity = 193/588 (32.82%), Postives = 320/588 (54.42%), Query Frame = 0

Query: 13  ITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRH-SGIHGNNFTFPLLLKACANLASIGD 72
           ++++ L+ WN+ +      G+F +++  Y  M    G+  + +TFP +L+ C  +  +  
Sbjct: 155 MSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVLRTCGGIPDLAR 214

Query: 73  GTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSR 132
           G  +H H++  G+E D+ V  +L+ MY K  +++++R +FD    R +ISWN+MI+ Y  
Sbjct: 215 GKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIISWNAMISGYFE 274

Query: 133 SFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKFQLHDDT 192
           +   +E L+LF  M G   +P+  T  S++S  A    G    GR +H  +       D 
Sbjct: 275 NGMCHEGLELFFAMRGLSVDPDLMTLTSVIS--ACELLGDRRLGRDIHAYVITTGFAVDI 334

Query: 193 PVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQN 252
            V NSL QMY+N G    A  +F  +  K ++SWT M+ GY       K  +T+  M Q+
Sbjct: 335 SVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAIDTYRMMDQD 394

Query: 253 NVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSA 312
           +V  D+     ++S+C  LG+L  G  LH L +K  L     +   LI+MYSKC  +  A
Sbjct: 395 SVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINMYSKCKCIDKA 454

Query: 313 RAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADL 372
             +F  +  K++ SWTS+I+G        EAL +F    +  ++PN   L  A++ACA +
Sbjct: 455 LDIFHNIPRKNVISWTSIIAGLRLNNRCFEAL-IFLRQMKMTLQPNAITLTAALAACARI 514

Query: 373 GSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSMM 432
           G+L   +EI A + + G+  D  +  +L+ +Y + G +  A   FNS   +D+ +W+ ++
Sbjct: 515 GALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRMNTAWSQFNSQ-KKDVTSWNILL 574

Query: 433 NGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYGI 492
            GY+  G G   + LF  M +S ++PD   + S+L  CS S +V  GL +F  M+ DYG+
Sbjct: 575 TGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGCSKSQMVRQGLMYFSKME-DYGV 634

Query: 493 VPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVANR 552
            P + HY C+VD+L RAG L+ A   IQ+MP       W   L+ACR +  ++LGE++ +
Sbjct: 635 TPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAVWGALLNACRIHHKIDLGELSAQ 694

Query: 553 CLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCS 600
            +   + ++   ++L+ NLY   GKW+E AKVR ++ + GL  + GCS
Sbjct: 695 HIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKENGLTVDAGCS 737

BLAST of CsGy1G007550 vs. Swiss-Prot
Match: sp|Q9LFL5|PP390_ARATH (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 4.8e-95
Identity = 215/630 (34.13%), Postives = 321/630 (50.95%), Query Frame = 0

Query: 18  LYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHA 77
           +Y WN  IRS  + G   + L  +  M       +N+TFP + KAC  ++S+  G   HA
Sbjct: 92  VYHWNSLIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGEISSVRCGESAHA 151

Query: 78  HLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRSFRVNE 137
             +  GF S+VFV  +LV MYS+  +L  +R+VFDE S   V+SWNS+I +Y++  +   
Sbjct: 152 LSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKV 211

Query: 138 ALKLFREMLGG-GFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKFQLHDDTPVENS 197
           AL++F  M    G  P++ T V++L   A     SL  G+ LH      ++  +  V N 
Sbjct: 212 ALEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSL--GKQLHCFAVTSEMIQNMFVGNC 271

Query: 198 LVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQNNV--- 257
           LV MY   G +D A +VF  +S K V+SW  M+ GY + G        F +M++  +   
Sbjct: 272 LVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKXX 331

Query: 258 --------------------------------VLDKFVFVDIISSCIQLGNLFLGSSLHS 317
                                                              L  G  +H 
Sbjct: 332 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALMHGKEIHC 391

Query: 318 L-------LLKTGLKYEDPIGCLLISMYSKCGDLLSARAVFDLLS--EKSIYSWTSMISG 377
                   L K G   E+ +   LI MY+KC  + +ARA+FD LS  E+ + +WT MI G
Sbjct: 392 YAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGG 451

Query: 378 YANAGYPREALSLFSMATQNN--VRPNGAMLATAISACADLGSLSMRREIEAF-IQQDGL 437
           Y+  G   +AL L S   + +   RPN   ++ A+ ACA L +L + ++I A+ ++    
Sbjct: 452 YSQHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIGKQIHAYALRNQQN 511

Query: 438 ASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSMMNGYAVHGMGEKTMNLFHE 497
           A    VS  LI +Y K GSI  A  VF++M+ ++   W+S+M GY +HG GE+ + +F E
Sbjct: 512 AVPLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDE 571

Query: 498 MQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYGIVPTMVHYTCLVDILSRAG 557
           M+R G K DG     +L ACSHSG+++ G+E+F  M+  +G+ P   HY CLVD+L RAG
Sbjct: 572 MRRIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAG 631

Query: 558 HLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVANRCLLSSNPRNPVNHVLMAN 600
            L  AL  I+EMP +     W  FLS CR +  VELGE A   +      +  ++ L++N
Sbjct: 632 RLNAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSN 691

BLAST of CsGy1G007550 vs. Swiss-Prot
Match: sp|Q9FLZ9|PP405_ARATH (Pentatricopeptide repeat-containing protein At5g39350 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E16 PE=2 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 8.2e-95
Identity = 202/593 (34.06%), Postives = 317/593 (53.46%), Query Frame = 0

Query: 18  LYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIH--GNNFTFPLLLKACANLASIGDGTML 77
           L  +N+ IR  V  G +  ++  +  M   G+    + +T+P + KA   L S+  G ++
Sbjct: 80  LLSYNIVIRMYVREGLYHDAISVFIRMVSEGVKCVPDGYTYPFVAKAAGELKSMKLGLVV 139

Query: 78  HAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRSFRV 137
           H  ++   F  D +VQ +L+ MY  F  +  +R VFD    R VISWN+MI+ Y R+  +
Sbjct: 140 HGRILRSWFGRDKYVQNALLAMYMNFGKVEMARDVFDVMKNRDVISWNTMISGYYRNGYM 199

Query: 138 NEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHG---SLFQGRLLHGCLTKFQLHDDTP 197
           N+AL +F  M+    + + +T VS+L     P  G    L  GR +H  + + +L D   
Sbjct: 200 NDALMMFDWMVNESVDLDHATIVSML-----PVCGHLKDLEMGRNVHKLVEEKRLGDKIE 259

Query: 198 VENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQNN 257
           V+N+LV MY+  G++D A  VF  +  + VI+WT M+ GY + G V    E    M+   
Sbjct: 260 VKNALVNMYLKCGRMDEARFVFDRMERRDVITWTCMINGYTEDGDVENALELCRLMQFEG 319

Query: 258 VVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSAR 317
           V  +      ++S C     +  G  LH   ++  +  +  I   LISMY+KC  +    
Sbjct: 320 VRPNAVTIASLVSVCGDALKVNDGKCLHGWAVRQQVYSDIIIETSLISMYAKCKRVDLCF 379

Query: 318 AVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADLG 377
            VF   S+     W+++I+G        +AL LF    + +V PN A L + + A A L 
Sbjct: 380 RVFSGASKYHTGPWSAIIAGCVQNELVSDALGLFKRMRREDVEPNIATLNSLLPAYAALA 439

Query: 378 SLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIH----RDLAAWS 437
            L     I  ++ + G  S    +T L+H+Y K G++E A K+FN +      +D+  W 
Sbjct: 440 DLRQAMNIHCYLTKTGFMSSLDAATGLVHVYSKCGTLESAHKIFNGIQEKHKSKDVVLWG 499

Query: 438 SMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLD 497
           ++++GY +HG G   + +F EM RSG+ P+   + S L ACSHSGLVE+GL  F+ M   
Sbjct: 500 ALISGYGMHGDGHNALQVFMEMVRSGVTPNEITFTSALNACSHSGLVEEGLTLFRFMLEH 559

Query: 498 YGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEV 557
           Y  +    HYTC+VD+L RAG L+ A N I  +P +  S  W   L+AC T+ +V+LGE+
Sbjct: 560 YKTLARSNHYTCIVDLLGRAGRLDEAYNLITTIPFEPTSTVWGALLAACVTHENVQLGEM 619

Query: 558 ANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL 602
           A   L    P N  N+VL+AN+Y ++G+WK+  KVRS++++ GL K+PG S +
Sbjct: 620 AANKLFELEPENTGNYVLLANIYAALGRWKDMEKVRSMMENVGLRKKPGHSTI 667

BLAST of CsGy1G007550 vs. Swiss-Prot
Match: sp|O81767|PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 347.1 bits (889), Expect = 4.1e-94
Identity = 204/592 (34.46%), Postives = 316/592 (53.38%), Query Frame = 0

Query: 13  ITKKPLYLWNLTIRSSVNGGFFAQSLETYS-FMRHSGIHGNNFTFPLLLKACANLASIGD 72
           I  + +Y WNL I      G  ++ +  +S FM  SG+  +  TFP +LKAC    ++ D
Sbjct: 112 IQNRDVYAWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKAC---RTVID 171

Query: 73  GTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSR 132
           G  +H   +  GF  DV+V  SL+ +YS++  +  +R +FDE   R + SWN+MI+ Y +
Sbjct: 172 GNKIHCLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQ 231

Query: 133 SFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKFQLHDDT 192
           S    EAL L      G    +S T VSLLS   +   G   +G  +H    K  L  + 
Sbjct: 232 SGNAKEALTL----SNGLRAMDSVTVVSLLSACTEA--GDFNRGVTIHSYSIKHGLESEL 291

Query: 193 PVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQN 252
            V N L+ +Y  FG++     VF  +  + +ISW  ++  Y       +    F +MR +
Sbjct: 292 FVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLS 351

Query: 253 NVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYED-PIGCLLISMYSKCGDLLS 312
            +  D    + + S   QLG++    S+    L+ G   ED  IG  ++ MY+K G + S
Sbjct: 352 RIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDS 411

Query: 313 ARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFS-MATQNNVRPNGAMLATAISACA 372
           ARAVF+ L    + SW ++ISGYA  G+  EA+ +++ M  +  +  N     + + AC+
Sbjct: 412 ARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACS 471

Query: 373 DLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSS 432
             G+L    ++   + ++GL  D  V TSL  +Y K G +E A  +F  +   +   W++
Sbjct: 472 QAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNT 531

Query: 433 MMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDY 492
           ++  +  HG GEK + LF EM   G+KPD   + ++L ACSHSGLV++G   F+ MQ DY
Sbjct: 532 LIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDY 591

Query: 493 GIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVA 552
           GI P++ HY C+VD+  RAG LE AL  I+ M  Q  +  W   LSACR + +V+LG++A
Sbjct: 592 GITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIA 651

Query: 553 NRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL 602
           +  L    P +   HVL++N+Y S GKW+   ++RS+   KGL K PG S +
Sbjct: 652 SEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSM 694

BLAST of CsGy1G007550 vs. TrEMBL
Match: tr|A0A0A0LT91|A0A0A0LT91_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043310 PE=4 SV=1)

HSP 1 Score: 1201.0 bits (3106), Expect = 0.0e+00
Identity = 601/601 (100.00%), Postives = 601/601 (100.00%), Query Frame = 0

Query: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60
           MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120
           KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180
           SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG
Sbjct: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240
           CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK
Sbjct: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300
           VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300

Query: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360
           MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360

Query: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420
           LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI
Sbjct: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420

Query: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480
           HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE
Sbjct: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480

Query: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540
           HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600
           CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600

Query: 601 L 602
           L
Sbjct: 601 L 601

BLAST of CsGy1G007550 vs. TrEMBL
Match: tr|A0A1S3BDP0|A0A1S3BDP0_CUCME (pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103488899 PE=4 SV=1)

HSP 1 Score: 1156.4 bits (2990), Expect = 0.0e+00
Identity = 578/601 (96.17%), Postives = 588/601 (97.84%), Query Frame = 0

Query: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60
           MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQ+LETYSFMR SGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQTLETYSFMRQSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120
           KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSK S+LRASRQVFDETSTRSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKISDLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180
            WNSMIAAYSR FRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG
Sbjct: 121 FWNSMIAAYSRGFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240
            +TKFQ HDDTPV+NSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK
Sbjct: 181 FMTKFQFHDDTPVQNSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300
           VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNL LGSSLHSLLLKT LKY+DPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLSLGSSLHSLLLKTALKYQDPIGCLLIS 300

Query: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360
           MYSKCGDLLSARAVFDLLSEKSIYSWTSMIS YANAGYPREALSLF+MATQNNVRPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISEYANAGYPREALSLFTMATQNNVRPNGAM 360

Query: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420
           LATAISACADLGSLSM REIEAFIQQDGLASD QVSTSLIHLYCKFGS EKAEKVF+SMI
Sbjct: 361 LATAISACADLGSLSMLREIEAFIQQDGLASDYQVSTSLIHLYCKFGSFEKAEKVFSSMI 420

Query: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480
           HRDLAAWSSMMNGYA+HGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGL+
Sbjct: 421 HRDLAAWSSMMNGYAMHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLQ 480

Query: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540
           HFKNMQLDYGIVP MVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPNMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600
           CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVK+PGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKQPGCSQ 600

Query: 601 L 602
           L
Sbjct: 601 L 601

BLAST of CsGy1G007550 vs. TrEMBL
Match: tr|A0A0D2RDY6|A0A0D2RDY6_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_005G014400 PE=4 SV=1)

HSP 1 Score: 651.4 bits (1679), Expect = 2.0e-183
Identity = 330/605 (54.55%), Postives = 440/605 (72.73%), Query Frame = 0

Query: 3   IHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFM-RHSGIHGNNFTFPLLLK 62
           +  F  +S    K+PLYL+NL IR+S N G FA +L+ YS M R + +HGN+FTFPLL K
Sbjct: 1   MRHFPLNSITSKKRPLYLFNLKIRNSTNNGDFADTLKIYSSMLRDTPVHGNSFTFPLLFK 60

Query: 63  ACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVIS 122
           ACA+L S+ DGT LHAH++ +GF+ D+FVQTSL+DMYSK S+L ++R VFDE   R+V+ 
Sbjct: 61  ACASLNSLHDGTKLHAHVLQLGFQQDIFVQTSLLDMYSKCSDLASARNVFDEMVMRNVVC 120

Query: 123 WNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGC 182
           WN+MI+AY R FRV EA+ L +EM   GFE N+STFVS+++        +L  G  +H C
Sbjct: 121 WNTMISAYCRCFRVMEAMNLLKEMWVIGFELNASTFVSVIAACT-----NLRLGLSMHCC 180

Query: 183 LTKF-QLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 242
           + K   LH + P+ NS+V MYV FG ID A S+F  + E++++SWT ++GGY+  G V +
Sbjct: 181 VFKLGLLHCEIPLANSVVNMYVKFGLIDDARSIFDTVDERSILSWTTIIGGYVSVGNVGE 240

Query: 243 VFETFSQMRQNNVV-LDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLI 302
            F  F++MRQ   V  D  +FV IIS C++ GNL L SS+HSL+LK+G   E  I   ++
Sbjct: 241 AFNLFNRMRQMGCVSQDMVLFVKIISGCVKSGNLLLASSVHSLVLKSGFHGEASIDNSVL 300

Query: 303 SMYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGA 362
           +MYSKCGD++SAR VF+++ EK I+ WTSMI+     GYP EAL LF    + +++PN A
Sbjct: 301 NMYSKCGDIVSARRVFEMVDEKCIFLWTSMIAANTQHGYPAEALDLFKSLLRTDLKPNEA 360

Query: 363 MLATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSM 422
            +A+ +SACADLGSLS+  EIE +++ +GLAS+ QV TSLIH+YCK G I+KAE+VF  +
Sbjct: 361 TIASILSACADLGSLSIGNEIEHYVKLNGLASNQQVQTSLIHMYCKCGRIDKAEEVFAGV 420

Query: 423 IHRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKP---DGSVYASILLACSHSGLVE 482
           +H+DLA WSSM+NGYA+HGMG + + LFH MQ +  KP   D  V+ SILLACSHSGLVE
Sbjct: 421 LHKDLAVWSSMINGYAIHGMGNEALKLFHRMQIT--KPCSLDHVVFTSILLACSHSGLVE 480

Query: 483 DGLEHFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSA 542
           DGL+++K+M+ DYGI P + HYTCLVD+L RAGH +LAL TIQEMP Q Q+Q WAP LS+
Sbjct: 481 DGLKYYKSMKDDYGIEPGIEHYTCLVDLLGRAGHFDLALKTIQEMPLQVQAQVWAPLLSS 540

Query: 543 CRTYCDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEP 602
           CR +C +ELGE   + LL  NP N  ++VLMAN+YTS GKWKEAAK RS++ +KGLVKEP
Sbjct: 541 CRKHCKIELGEYVAKKLLDLNPGNTSSYVLMANIYTSAGKWKEAAKTRSMMRNKGLVKEP 598

BLAST of CsGy1G007550 vs. TrEMBL
Match: tr|A0A1U8MA90|A0A1U8MA90_GOSHI (pentatricopeptide repeat-containing protein At4g21065-like OS=Gossypium hirsutum OX=3635 GN=LOC107935563 PE=4 SV=1)

HSP 1 Score: 648.3 bits (1671), Expect = 1.7e-182
Identity = 328/605 (54.21%), Postives = 439/605 (72.56%), Query Frame = 0

Query: 3   IHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFM-RHSGIHGNNFTFPLLLK 62
           +  F  +S    K+PLYL+NL IR+S N G FA +L+ YS M R + +HGN+FTFPLL K
Sbjct: 1   MRHFPLNSITSKKRPLYLFNLKIRNSTNNGDFADTLKIYSSMLRDTHVHGNSFTFPLLFK 60

Query: 63  ACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVIS 122
           ACA+L S+ DGT LHAH++ +GF+ D+FVQTSL+DMYSK S+L ++R VFDE   R+V+ 
Sbjct: 61  ACASLNSLHDGTKLHAHVLQLGFQQDIFVQTSLLDMYSKCSDLASARNVFDEMVMRNVVC 120

Query: 123 WNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGC 182
           WN+MI+AY R FRV EA+ L +EM   GFE N+STFVS+++        +L  G  +H C
Sbjct: 121 WNTMISAYCRCFRVMEAMNLLKEMWVIGFELNASTFVSVIAACT-----NLRLGLSMHCC 180

Query: 183 LTKF-QLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 242
           + K   LH + P+ NS+V MYV FG ID A S+F  + E++++SWT ++GGY+  G V +
Sbjct: 181 VFKLGLLHCEIPLANSVVNMYVKFGLIDDARSIFDTVDERSILSWTTIIGGYVSVGNVGE 240

Query: 243 VFETFSQMRQNNVV-LDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLI 302
            F  F++MRQ   V  D  +FV IIS C++ GNL L SS+HSL+LK+G   E  I   ++
Sbjct: 241 AFNLFNRMRQMGCVSQDMVLFVKIISGCVKSGNLLLASSVHSLVLKSGFHGEASIDNSVL 300

Query: 303 SMYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGA 362
           +MYSKCGD++SAR VF+++ EK I+ WTSMI+     GYP EAL LF    + +++PN A
Sbjct: 301 NMYSKCGDIVSARRVFEMVDEKCIFLWTSMIAANTQHGYPAEALDLFKSLLRTDLKPNEA 360

Query: 363 MLATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSM 422
            +A+ +SACADLGSLS+  EIE +++ +GLAS+ QV TSLIH+YCK G I+KAE+VF  +
Sbjct: 361 TIASILSACADLGSLSIGNEIEHYVKLNGLASNQQVQTSLIHMYCKCGRIDKAEEVFAGV 420

Query: 423 IHRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKP---DGSVYASILLACSHSGLVE 482
           +H+DLA WSSM+NGYA+HGMG + + LFH MQ +  KP   D  V+ SILLACSHSGLVE
Sbjct: 421 LHKDLAVWSSMINGYAIHGMGNEALKLFHRMQIT--KPCSLDHVVFTSILLACSHSGLVE 480

Query: 483 DGLEHFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSA 542
           DGL+++K+M+ DYGI P + HYTCLVD+L RAGH +LAL TIQEMP Q ++Q WAP LS+
Sbjct: 481 DGLKYYKSMKDDYGIEPGIEHYTCLVDLLGRAGHFDLALKTIQEMPLQVEAQVWAPLLSS 540

Query: 543 CRTYCDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEP 602
           CR +C +ELGE   + LL  NP N  ++VLMAN+YTS GKWKEAAK RS++ +KGL KEP
Sbjct: 541 CRKHCKIELGEYVAKKLLDLNPGNTSSYVLMANIYTSAGKWKEAAKTRSMMRNKGLFKEP 598

BLAST of CsGy1G007550 vs. TrEMBL
Match: tr|A0A061F704|A0A061F704_THECC (Pentatricopeptide repeat superfamily protein, putative OS=Theobroma cacao OX=3641 GN=TCM_031344 PE=4 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 1.4e-179
Identity = 326/588 (55.44%), Postives = 431/588 (73.30%), Query Frame = 0

Query: 18  LYLWNLTIRSSVNGGFFAQSLETYSFMRH-SGIHGNNFTFPLLLKACANLASIGDGTMLH 77
           +YL NL IR+S+N G FA +L+ YS M H S +HGN+FTFPLL KACA L S+ DGTMLH
Sbjct: 13  IYLVNLRIRNSINNGHFADTLKIYSSMLHNSNVHGNSFTFPLLFKACAALTSLRDGTMLH 72

Query: 78  AHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVISWNSMIAAYSRSFRVN 137
           AH++ +GF  D+FVQTSL+DMYSK S L ++R VFDE  +R+VISWN+MI+AY R FRV 
Sbjct: 73  AHVLELGFVHDIFVQTSLLDMYSKCSCLVSARNVFDEMLSRNVISWNTMISAYCRGFRVM 132

Query: 138 EALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHGCLTKF-QLHDDTPVEN 197
           EA+KL +EM   GFE ++STF+S+++  A     +L  G  +H C+ K   L  + P+ N
Sbjct: 133 EAIKLLKEMWVLGFELSASTFISVVAACA-----NLQLGLSMHCCIFKLGLLQCEIPLAN 192

Query: 198 SLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAKVFETFSQMRQ-NNVV 257
           SL+ MYV FG I+ A SVF  + E++++SWT ++GGY+  G V + F  F++MR+   V 
Sbjct: 193 SLMNMYVKFGFINGARSVFDTMDERSILSWTTIIGGYVNVGNVGEAFSLFNRMRKVEGVS 252

Query: 258 LDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLISMYSKCGDLLSARAV 317
            D  +F+ IIS C+Q GNL L SS+HSL+LK G   ED +  L+++MY+KCGD+ SA+ V
Sbjct: 253 QDMVLFIKIISGCVQAGNLPLASSIHSLVLKCGYDGEDLMHNLVLNMYAKCGDIGSAQRV 312

Query: 318 FDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAMLATAISACADLGSL 377
           F+++ EK I+ WTSMI+ Y   GYP EAL LF    +  ++P+ A  AT +SACADLGS 
Sbjct: 313 FEMVDEKCIFLWTSMIAAYTQFGYPAEALDLFKRLLRTGLKPHEATFATILSACADLGSP 372

Query: 378 SMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMIHRDLAAWSSMMNGY 437
           S+ +EIE +++ +GLAS+ QV TSLIH+YCK G +EKAE+VF  ++H+DLA WSSM+NGY
Sbjct: 373 SLGKEIEHYVKINGLASNRQVQTSLIHMYCKCGIVEKAEEVFVEVLHKDLAVWSSMINGY 432

Query: 438 AVHGMGEKTMNLFHEMQ-RSGIKPDGSVYASILLACSHSGLVEDGLEHFKNMQLDYGIVP 497
           A+HGMG + +NLFH+MQ       D  V+ SILLACSHSGLVEDGL++FK+M+  YGI P
Sbjct: 433 AIHGMGNEALNLFHQMQITESFSLDHVVFTSILLACSHSGLVEDGLKYFKDMKRVYGIEP 492

Query: 498 TMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTYCDVELGEVANRCL 557
           ++ HYTCLVD+L RAGH +LAL TIQE+P Q Q+Q WAP LSACR Y +V+LGE   R L
Sbjct: 493 SIEHYTCLVDLLGRAGHFDLALKTIQEIPVQVQAQVWAPLLSACRKYRNVDLGEYIARKL 552

Query: 558 LSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQL 602
           L  NP N  N+VLMANLYTS GKWKEAA  RS++ ++GLVKEPG SQ+
Sbjct: 553 LELNPGNTSNYVLMANLYTSGGKWKEAAITRSMLRNRGLVKEPGWSQI 595

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004137641.10.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativu... [more]
XP_008446053.10.0e+0096.17PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-... [more]
XP_022137264.11.5e-28382.20pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia][more]
XP_020221554.15.6e-18555.20pentatricopeptide repeat-containing protein At4g19191, mitochondrial-like [Cajan... [more]
RDY03240.11.6e-18455.76Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Mucuna... [more]
Match NameE-valueIdentityDescription
AT2G03380.12.0e-9934.35Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G15510.18.3e-9832.82Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G16860.12.7e-9634.13Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G39350.14.5e-9634.06Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G18485.12.2e-9530.98Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9ZQ74|PP146_ARATH3.5e-9834.35Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... [more]
sp|Q9M9E2|PPR45_ARATH1.5e-9632.82Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
sp|Q9LFL5|PP390_ARATH4.8e-9534.13Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
sp|Q9FLZ9|PP405_ARATH8.2e-9534.06Pentatricopeptide repeat-containing protein At5g39350 OS=Arabidopsis thaliana OX... [more]
sp|O81767|PP348_ARATH4.1e-9434.46Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LT91|A0A0A0LT91_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043310 PE=4 SV=1[more]
tr|A0A1S3BDP0|A0A1S3BDP0_CUCME0.0e+0096.17pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like OS=Cuc... [more]
tr|A0A0D2RDY6|A0A0D2RDY6_GOSRA2.0e-18354.55Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_005G014400 PE=4 ... [more]
tr|A0A1U8MA90|A0A1U8MA90_GOSHI1.7e-18254.21pentatricopeptide repeat-containing protein At4g21065-like OS=Gossypium hirsutum... [more]
tr|A0A061F704|A0A061F704_THECC1.4e-17955.44Pentatricopeptide repeat superfamily protein, putative OS=Theobroma cacao OX=364... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G007550.1CsGy1G007550.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 496..522
e-value: 0.043
score: 14.0
coord: 298..321
e-value: 0.26
score: 11.5
coord: 223..253
e-value: 3.8E-4
score: 20.4
coord: 324..348
e-value: 1.9E-6
score: 27.7
coord: 397..422
e-value: 1.3E-5
score: 25.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 118..165
e-value: 7.5E-13
score: 48.3
coord: 423..469
e-value: 3.7E-8
score: 33.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 397..424
e-value: 1.7E-5
score: 22.7
coord: 223..256
e-value: 1.4E-4
score: 19.8
coord: 426..458
e-value: 2.1E-8
score: 31.8
coord: 324..357
e-value: 1.5E-5
score: 22.9
coord: 120..153
e-value: 1.9E-8
score: 32.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 256..290
score: 5.974
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 221..255
score: 9.646
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 17..51
score: 6.325
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 423..457
score: 12.321
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 118..152
score: 12.726
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 322..356
score: 10.983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 190..220
score: 6.445
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 458..493
score: 7.914
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 494..524
score: 7.41
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 52..86
score: 6.007
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 392..422
score: 9.087
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 291..321
score: 5.382
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 560..594
score: 6.884
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 87..117
score: 6.106
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 380..487
e-value: 1.4E-24
score: 89.2
coord: 170..292
e-value: 4.3E-13
score: 51.5
coord: 488..600
e-value: 1.4E-6
score: 30.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 293..379
e-value: 2.7E-16
score: 61.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 22..169
e-value: 5.5E-28
score: 100.2
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 18..317
NoneNo IPR availablePANTHERPTHR24015:SF498SUBFAMILY NOT NAMEDcoord: 18..317
coord: 310..600
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 310..600