CsGy1G030320 (gene) Cucumber (Gy14) v2

NameCsGy1G030320
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein At1g62350
LocationChr1 : 28575477 .. 28584000 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTTCTTTAAATTTAAGAAAATGGTTTTAAACCTCACCAAATAGTATCCACTTTCACAAGCTGCACAAACCCCGCCCAATTCTCTTGCGTTTCGAAAAGAGAGAAACTTCGGTCCTGTATTGCTTCAAGGTTCCTCCATAATTTTGATGCTTCGTCTTGCTCCAAATCTTCTTCGCAAAATCTCAAGCAGCCCCATTTCCTCAACCCCTTTCCATCGTTTTTCCCTTTCCACATTTCTCAACCTCGATCTGTTACAGCAACAATTGCTGTTACGCTTCATCACTGGCTCTGCTTCCAGCCCTAGCCTCTCAATATGGAGGAGGAAGAAGGAGATGGGCAAGGAGGGTCTGATTGTAGTCAAAGAACTCAAGAGACTTCAGTCCAATTTCATTCGCCTCGACCGCTTCATTTCCTCCCATGTCTCTCGCTTGCTCAAGTCCGACCTTGTTGCGGTTCTTGTTGAGCTTCAGAGACAGAATCATGTCTTTCTCTGCATGAAGGTATGCCCTCATTTACTCGCTTTATTATTATCTCTCTTCCCCTTTATGACATATGTTGAATTTTCAGCTCTGTTAAGCGACATGAGAAGTTAATTAGTTTCTTGCTAGTGTATTGGTTAATTCGGAATTGCCGTAATTGGGATGAATTACGTATATGGGAATCATTTTTACCTTAGCATTCACGGATTGGAAATGACATAGGTTTGGTTTTTGTCGATTTTTCATCAACCTTTGTGTGTTTTTTAGTAAAATTCCCGTTCAAGTATTTGATGACTGCGTTTTGAAGTAGTCGAAAGTTATGTGAACTCTAGAAGCGTGGTTGGTGAAAATAGATACGACTATGGCATGCGAACCTTCTTTTAGCTCAATTCAAATTAGGAGTACTTGCTTGCATTAGCCTTTACACTATGTCTGTTTGAAGTGAATAGAAGTCCCGTTGATTGAGATCAGGGCTGAAAGATTTTCTTCTACAGGCTGGGTGTACTGGGAATTTAATATAAAAGAACGGCTTAAATAAGTAGGCTAAATTATCAAATACCAACCGTTATTGAGTTGAGACTGTGTATGTGAATGGATATATTTCATGTTATCTGTACTAGTAGTAATATAGGAATGGAAATATTTCATGTTATCTGTACTAGTAGTAATATGGGAAGAGGGAAGAGTAGTGATGCAAAGTTCGTCAAGTTTCAATGTATTGAAGTTATCTTGGGACAGGTCTTTAAGGAAATGCCTTCCTCTCTTCTTAATGGTGAGATTCCCTACTTTGTTCTTTTTCCTACCAAGAATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTTCTCATCATACTAAGTTAGATCCCGAATCCTTGAAGTGTATCTTCTTGGGCTATTCATGTGTTCAAAAGGTATCGGTGTTATTGTCCTATCCTTAAAAGGTATCTTGTTTCGCCTGATGTTGCATTTTTTGAGGATATACTCTTTACCTCATCACCATCGAGTTCGTGTCAGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTCTTCCACTGATGCGCCTCCTTCCCGCCTGTTGATTTCTCGAGTCTACTTTCGACGACCTCCACCTCAACCTTCAGGCTCATGTCCTCCATCATGCAATCTGAGGCTAAGTGATGATCTTCCTATTGCTCTTCACAAAGGTAAACGCAAGTGTACTTACCATATTTTTTCGTTTGTTTCTATCACTAGTTGTCTCTCCCCACATATGCTTTTATTACGTCTCTTGACTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTATTTCATCCTGGATGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTTCTGCAGGAAAGAAGGCCATTGGTTATAAATGGGTGTTTGCTGTCAAGATGAATCTTGATGGAATCGTGGCTTGATTGAAAGCTCACCTTGTTGCCAAAGGTTATGCTCAAATCTATAGAAATGATAATTCAGATACATTTTCTTCGGTTACCAAATAAACTTTCATCCGCTATTTCTTTCCATGACTGCTATCCATAAATGACCTTTGCATCAACTTGGCATTAGGAATGCTTTTCTGCATGACGATCTTCAAGAGGAAGTTTATATAGAGCAACCACCTAGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTTGCCTTCGAAAATCTTTGTATGGTTTAAAGCAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCTCTTGTATGATTTGGTATGTAGAAGAGTACATCTGATCATTCGGTTTTCTATCGTCGATCTGATAATGGTATAGTTCTACTAGTTGTATATGTTGATGATATTTTTTTTACTGGAAATGATGCATCGGGTATTTCGTCTCTCAAAACTTTCCTTCAGGGTCAGTTTTATACGAAAGATTTGAGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAAAAAGGTATTTATTTGTCTCAATAAAAATATGTACTTGATTTGTTGTCTGAGACAGAAAAATTAGGAGCCAAACCAAGTGGCACTCTGATGATTCCTAATCAGCAACTTGTTAAAGAAGAGAATTATGTAAAGATCCTGAGATATAGGAGATTAGTTGGGAAGTTGAACTATTTAACAGTGACTCGACCAAACATTGCTTGTTCTGTAAGTGTTGTAAGTCCGCTCATGTCTTCTCTTACAGTAGATCATTGGGCTACAGTAGAGCAAATTTTGTGTTATCTGAAAGTTGCTCCTGGACGTGGGATCTTATCCAAAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCTAATTGGGGGAGATCTTGTGAGGATAGGAGATCAGCTTTTGGATATTGTGTCTTTGTAGGTGGAAACTTAGTATCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTCGAGTGTTGAGTCAGAATATAGAGCTATGGTACAATCTGTGTGCGAAATAGTGTGGATTCACCAACTATTATATGAGATAAGTTTCAGCATTACAGTGCCAACTAAATTATGGTGTGATAATCAAGTTGCACTTCACATTGCATCTAATCCATTATTTTATGAACGAACTAAACATGTAACTTCATTCATGAGAAAATCTAAGATGGATTGGTGTCCACAGGATATGTGAAGACTGGAGAACAATTGGGAGATATTTTGGCTAAAGCTTTAAAGATGGAGCAATGATAAGCTATCTGTGCAACAAGTTGGACATGATCGACATATTTGCTTTTGCTTGAGGGGAGTGTTATGATATATAGTTATAACACATGTCCTTTATTGTAATTGTACACAGTGTCTAGGGGTTATTTTTCTTGTTGAAGAGAGCACTACTGGGTTGGGGAATATACACACCAGTTGGAGAGCTTTCAATGCGAAGGACGAAAGCGTCGTCTAATGGTGGAATCTTAGAGTCAGAGAGAATCTGTGCCTTTGGCATTTCAAATTCAGCTAAGAGTCCATTCAAAAAGATCATAACAACCATCTTCTCTCGTTGAGCTTGTTGAACTTTAACAACATGACTAAAAGGTAACAACAAGCCAAGCTCGACAGTTATCTTCTTAAGGCACTTAAAGTAGTTGGTAACAGACTTAGCTTTTTGTTCAGCACGAAAAAATTTCATACAAACCTCAAACATTTTATGTACTTGCTCTTTACTTGAATATAAAAAATCTAAAAATTCCAGAAGTTCTTGAACAGACTCACAATGATCAACCAATCCAATTATCTCACTCTCAATGGAGTTTTTGATCTGAAGATATAAACGGGCATCATCACGAAGCCAATAATTTTTCTGCTTTGCATCTTCTGGGAGATCTTTGCTCATATGATCATCCATATTAGTACTTCTCAAAATAGAACAAAATTGTCCGACGCCAGTCGTAATAATTAGATTAATTTAACTTGATGTTTTGTAATCTTAGAGGCTAAGGGAATAATGTTGGAGATTATCAAATTTTTTTATGTCAGCAAAATTGAAGTCACACATTGATTGTAGTCTACACAATCAAAGAAAAAATCAATCCAAATAGATAGAAAATTGGAATAAAACCCTAAAACAATGGCGTAGCCTTCAGTTTTTGTCAAACCGAATACTATTTGATAACAACACAGATTGACCGAAAACAAAGTTTGAGACAAGCCCAGAAGACGGAGGTCAAATCGGAGTCCTCACGCACTAGCGCGTGGATGACGTGCAAAGATATTTCGACAGCGTGTGAAGCTCACGCGCGGTGTTTTCCGACGATCGACGAAGGCAGTCTTGGGCTGATGGAGGTGGCGCTTCTTATGGTCTGGACGATGTTGACAAACGATCAGTGGAAGGTAACCTGAACTTAAGCCTAATGGAGAAGGAAGCTAACGACCATTCGAAAGCCTAACAACTCCAATACCATGTAATCCAGAATTCAAAGAAGTGTACTTATTCATTTGAATCAAAGAAAATTTACATATTTATATAGAGAACAAAATAAACCCTAGACACTATGTACAATTACAATAAGAGACATATGTTATAACTATATATCATAACACAATCCATGGTGGCCACCTACCAAGGAAATTTTCTAGAGTTTTCTTGACACCCAAATGTTGTGTGGTCAGCTGGTTAGGCGAGTTGTATAAAAAAACATGTCTTAGAGGTCGGATGAATTAGGGGTGGGTTGTCTTAAACAATGAAATGCTGCTCTTTTGTTAAAGTGGTTTTGGCCGTTCTCCCTGCAAGAAAATAGTATGGGGAGGAGAGTTGTTGTTGCTATGTATGCCATATAGTCAAATGGTTGGCTAATTAAACGTGCTAAAATTGTTGCCAATGGAAGGCCTTGGATAGATCTTGAGAAAATAAAAATAGGTAATTGTTCCTTGATCTAGTAAATTAGAAGTGTGCAGATGGCACAAAAATTCGATTTTGAAAAGATACATGGACCTCACATGCTCCTCTCCATGTGAAATACTCTGTTCTATATGCCTTTATCTTCCAAAATAGAGGCTTCAATCTCAGACTGTTGGTAAGTTACTGTAAATACTTGGTATTTGGGGTCGAGGAGGCACTTTTTTTACTTTGAGATTCTTATAGGTGCTTAGTTCATAGAGCATATTAAGGTTGGTAGGCTTGGAAGTAGTTTGGACTGACTTGTGGAAGTTGACCTCTTTGGCACTTTGAATGCTAAATCTGTTTTCCTTCATTTGATGCAGTCTCATGCCAATTCAATGTACTCGTTGACTATATATGGGAGATAAAGGTTCCCATGAGTGTTTTACTTGGTCATTGATGAAGACAAATTTAATATTATATCACTCAACACTCCTCCTCACTTGTGGGCTTGATATATGAAGAATGCCCAACAAATTGAAATCAATTTTAAATGGGGATGAAATAACGTTGCAAGGGTGTGCACAAGACCTTTGGGCCCTACTGCTCTGATACCATCTTAAATCACTTATTAACTCTAAAGCTTAAGCTAGTGAGCGTGAAGGAAAATTTTATTTTATATCATTCAACACCTTGCATCCATGTCCTCTAAGTTCTTTATTATTTTCACGACCCTTATTGACCAATGGACATTGTCATTGTTGGGAGCATGGATTTTAAGGGCAAAGTATGGATTTTGGAAGCCTATTGTGTAATTAGAACTGTCCTTTTTCAGACTTGGAAGGAAATAAATCAAGACTTTTGGATGTCTAAGTTGGTCGTTTACTTTAGGAACAATGTATTGTATACTACATCTTGATGGTGTACAAACAACAAAAAGTTTTTTCTACTTACAACCTCATTATTTTGAATGGTTCTGGATTTGTTGGCTTAGTTGTTGTATGTCAGGGGCTTTTTCATCCCCTGATCCTTTAGGCCGTCCTATTTCTTTTGTATATAATACAACCTTGTGCTTCTTACAAAAAGAAAAAGAACAAAGAAAATCGATACATACAAGAACTTAAACTTTGTAATGTTTTATTGGCAAATTAGTTATAGCGATTCCTATTTGGAGCCTCATGCTTTTGGCGGGGAAATGTTGCAATGCCAAAACCAAAAATACATGGATCTTGGGTCGGGGATCAACTCCCTTACCAAAAATCACTAGTAGATCTTGCTATTCTTTCTGAGTTGCCTGATAAATCTGTCCCCTGCTAATGCCTCTATCTCCTCCTCGACATGGTGGTAAAATTTTCCAAGAAGGTGAAGTTCTTTGTGCAGCAAATTTTGCACAGATGAGTTAACACTTTGGCTTGGATCTTGGCCATAGAGTCCTCCTTGGTTGGGCTCACTGTTGTGTTGTTTACACGCAGGTGGCTTGAGGACCTTGATCATATACTTTGGAGTTGTGTTTTCACTGCTTATTTTGGTTTTTATAGCCTTAGCTTCCCGGGTTATAGGGAGTCAATTGAAGTTCTGCATGGGGAACTTCACCCTTTTTCAGTTTATCAGATTGCAAGTTTGGTAGATAAAATGTGTCTATGAGTGGATGTCATTGGGTCAGGAGTGCTGCTTTTGGAGATAAAAACCTTGAAGTATGCTAGCCTTTTTTAATGTACACCTTACATTAGTTTAAGGCATGTTTGTTTCCCAGGAGGCGAGATGCAATGTGTCCATGAGTGATACATCTATTAAAAACAACAATTGTAGCATTATATTATTAATAGGGGGTGAAGTCCACTATGACATAAGATGTATTGCATAACTAGGCGTGTATGTCTCTGGTGACAATAAAGTAATTTAACAAATAACAATCCTCTTATGTAAGTTTTAAGTAAAATATATAAAAAAATATTACTTATGCTTAATTTTGTGTTTCGCAGTAGGCCTTTGATGAATTTACATTACTATATTGTACTTTTATGTTCATGAGATGTTGAAGAAAAAAATTGAAAATATTATGTCAGAAACAATTTGTAGGCAGATTTGGCTAAGGTGAGCTTAGCTGATGCATGACCTTCCATCACAGAGGTGGGAGATTTTTCTTGTATTAAAAGAAAAAAATTGAGGCAGATTTGTTTGTTTTTCTGGTTACTTTTTGGCTTTGAAGGGCCTACTCTTTTCCCAATTATATTTCTCTTAAACCATCATGAGTTGGCCTAGTGATAAAAAAATGAGATAGTCTCAAGTACTAACTGGGAGGTTATGAGTTCAATCTATGGTGGACACCTAGCTAGGAATTATTTTCGAGTCAGACGGGTTGTTTCGAGAGATTAGTCGAGCTGCACGTAAGTTGGTCTGGACACTCACAGATATAAAAATAGAATTACATTTATCTTGCAACAATGCATGCTTATGACTATATGTCCTAAAGAATCTACTCTACTGAAAATTTATCCTATTGTCGTGGGGCCTATTCTCTTTTCAATTATATTGTCTATGATGCGTGCGTGTTTGGATGTCTTATGATCTGTTTTGATAGTTTAATCTTTTGTCTTTAACACCTGTTTGACTAGGCAAAAAAAATGTTTTTTACTTAAACTTTTTTTTTTTTGCTAAAAACTTTAATATAAACTTGAAAATGTTTGAAGCCCAATTTTTTTTTTCAAATGACTTGTTTTCAAAATTAAATATGTGATAAGTTAAACCAAACATTTTATTATTTAACAAACCCACCATTTGTATTTTATTAAAAAAATGTCTTCTTAAGTTGATCTCACCTTTGTTTTTATCCCATTAATTGCATTATTAACTCCTGCCTGACCCTTTCAGTTGTATAACGTGGTTCGTAAAGAAGTATGGTACCGGCCCGACATGTTCTTTTATAGAGATATGCTTATGATGCTTGCAAAGAACAAAAGGGTAGAGGAAACAAAACAAGTATGGGAGGATCTGAAAAAAGAGGGAGTGTTATTTGATCAGCATACTTTTGGAGACATTATTCGAGCATACTTGGATAATACAATGCTCTCTGAGGCCATGGATATATATCGTGAAATGAGAGAATCTCCTGATAGGCCCTTATCTTTGCCTTTTCGTGTCATTCTTAAGGGCCTTATTCCATATCCAGAATTGAGAGAACAAGTTAAAGATGACTTCTTGGAGCTCTTCCCTGATATGATCGTCTACGATCCACCAGAAGACTTGTTTGAAGAAGATGAAGATAGAAACAAGAGTGAAGATGATTAACTAATTAAGAATGCTGCTCTTGCAGTTGTAGCTTTAAGTTTTGAATTTTGATAAGTCTCTAAATTTTCAAAAATATCAATGAAATCTTTCATTTTTCAACTACGTATCTGGTTAATTCGTGAACTTTAAAAGTGTTGTGTCTTATTAGTTCTTTAATTTTTTATTTTGTGTGACCCATTAGACAAGTTTTAAAATTCACGCAAGTATTTGACGCAAAGCTGAAAGGTAAATCTAGTTTTGTGTTCAATAAACATGTGTGTTTTGAAAACTATCCATTTTGTAAGACATTTTTGAACTTCAGAAATTTATTTGACATAATCTGGGAAGTTTAGCATAA

mRNA sequence

TTTTTTTCTTTAAATTTAAGAAAATGGTTTTAAACCTCACCAAATAGTATCCACTTTCACAAGCTGCACAAACCCCGCCCAATTCTCTTGCGTTTCGAAAAGAGAGAAACTTCGGTCCTGTATTGCTTCAAGGTTCCTCCATAATTTTGATGCTTCGTCTTGCTCCAAATCTTCTTCGCAAAATCTCAAGCAGCCCCATTTCCTCAACCCCTTTCCATCGTTTTTCCCTTTCCACATTTCTCAACCTCGATCTGTTACAGCAACAATTGCTGTTACGCTTCATCACTGGCTCTGCTTCCAGCCCTAGCCTCTCAATATGGAGGAGGAAGAAGGAGATGGGCAAGGAGGGTCTGATTGTAGTCAAAGAACTCAAGAGACTTCAGTCCAATTTCATTCGCCTCGACCGCTTCATTTCCTCCCATGTCTCTCGCTTGCTCAAGTCCGACCTTGTTGCGGTTCTTGTTGAGCTTCAGAGACAGAATCATGTCTTTCTCTGCATGAAGTTGTATAACGTGGTTCGTAAAGAAGTATGGTACCGGCCCGACATGTTCTTTTATAGAGATATGCTTATGATGCTTGCAAAGAACAAAAGGGTAGAGGAAACAAAACAAGTATGGGAGGATCTGAAAAAAGAGGGAGTGTTATTTGATCAGCATACTTTTGGAGACATTATTCGAGCATACTTGGATAATACAATGCTCTCTGAGGCCATGGATATATATCGTGAAATGAGAGAATCTCCTGATAGGCCCTTATCTTTGCCTTTTCGTGTCATTCTTAAGGGCCTTATTCCATATCCAGAATTGAGAGAACAAGTTAAAGATGACTTCTTGGAGCTCTTCCCTGATATGATCGTCTACGATCCACCAGAAGACTTGTTTGAAGAAGATGAAGATAGAAACAAGAGTGAAGATGATTAACTAATTAAGAATGCTGCTCTTGCAGTTGTAGCTTTAAGTTTTGAATTTTGATAAGTCTCTAAATTTTCAAAAATATCAATGAAATCTTTCATTTTTCAACTACGTATCTGGTTAATTCGTGAACTTTAAAAGTGTTGTGTCTTATTAGTTCTTTAATTTTTTATTTTGTGTGACCCATTAGACAAGTTTTAAAATTCACGCAAGTATTTGACGCAAAGCTGAAAGGTAAATCTAGTTTTGTGTTCAATAAACATGTGTGTTTTGAAAACTATCCATTTTGTAAGACATTTTTGAACTTCAGAAATTTATTTGACATAATCTGGGAAGTTTAGCATAA

Coding sequence (CDS)

ATGCTTCGTCTTGCTCCAAATCTTCTTCGCAAAATCTCAAGCAGCCCCATTTCCTCAACCCCTTTCCATCGTTTTTCCCTTTCCACATTTCTCAACCTCGATCTGTTACAGCAACAATTGCTGTTACGCTTCATCACTGGCTCTGCTTCCAGCCCTAGCCTCTCAATATGGAGGAGGAAGAAGGAGATGGGCAAGGAGGGTCTGATTGTAGTCAAAGAACTCAAGAGACTTCAGTCCAATTTCATTCGCCTCGACCGCTTCATTTCCTCCCATGTCTCTCGCTTGCTCAAGTCCGACCTTGTTGCGGTTCTTGTTGAGCTTCAGAGACAGAATCATGTCTTTCTCTGCATGAAGTTGTATAACGTGGTTCGTAAAGAAGTATGGTACCGGCCCGACATGTTCTTTTATAGAGATATGCTTATGATGCTTGCAAAGAACAAAAGGGTAGAGGAAACAAAACAAGTATGGGAGGATCTGAAAAAAGAGGGAGTGTTATTTGATCAGCATACTTTTGGAGACATTATTCGAGCATACTTGGATAATACAATGCTCTCTGAGGCCATGGATATATATCGTGAAATGAGAGAATCTCCTGATAGGCCCTTATCTTTGCCTTTTCGTGTCATTCTTAAGGGCCTTATTCCATATCCAGAATTGAGAGAACAAGTTAAAGATGACTTCTTGGAGCTCTTCCCTGATATGATCGTCTACGATCCACCAGAAGACTTGTTTGAAGAAGATGAAGATAGAAACAAGAGTGAAGATGATTAA

Protein sequence

MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLDNTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPPEDLFEEDEDRNKSEDD
BLAST of CsGy1G030320 vs. NCBI nr
Match: XP_004139064.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g62350 [Cucumis sativus])

HSP 1 Score: 498.4 bits (1282), Expect = 1.4e-137
Identity = 256/256 (100.00%), Postives = 256/256 (100.00%), Query Frame = 0

Query: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK 60
           MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK
Sbjct: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120
           KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180

Query: 181 NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240
           NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP
Sbjct: 181 NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDEDRNKSEDD 257
           EDLFEEDEDRNKSEDD
Sbjct: 241 EDLFEEDEDRNKSEDD 256

BLAST of CsGy1G030320 vs. NCBI nr
Match: XP_008450338.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g62350 [Cucumis melo])

HSP 1 Score: 494.2 bits (1271), Expect = 2.7e-136
Identity = 253/256 (98.83%), Postives = 254/256 (99.22%), Query Frame = 0

Query: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK 60
           MLRLAPNLLRKISSSP+SSTPFHRFSLSTFLNLDLLQQQLLLRFI GSASSPSLSIWRRK
Sbjct: 1   MLRLAPNLLRKISSSPVSSTPFHRFSLSTFLNLDLLQQQLLLRFIAGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120
           KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180

Query: 181 NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240
           N MLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP
Sbjct: 181 NAMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDEDRNKSEDD 257
           EDLFEEDEDRNKSEDD
Sbjct: 241 EDLFEEDEDRNKSEDD 256

BLAST of CsGy1G030320 vs. NCBI nr
Match: XP_022966084.1 (pentatricopeptide repeat-containing protein At1g62350 [Cucurbita maxima] >XP_022966085.1 pentatricopeptide repeat-containing protein At1g62350 [Cucurbita maxima])

HSP 1 Score: 431.4 bits (1108), Expect = 2.1e-117
Identity = 223/253 (88.14%), Postives = 232/253 (91.70%), Query Frame = 0

Query: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK 60
           MLR APNLLR+IS+S  S+   +RF+  TF      QQQLLLRFITGSASSPSLSIWRRK
Sbjct: 1   MLRFAPNLLRRISNSATSTIHLYRFTHCTFFEHHRPQQQLLLRFITGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120
           KEMGKEGLIVVKELKRLQSN IRLDRFISSHVSRLLKSDLVAVLVELQRQN VFLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNLIRLDRFISSHVSRLLKSDLVAVLVELQRQNQVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180
           NVVRKEVWYRPDMFFYRDML MLAKNKRVEETKQVWEDLK EGVLFDQHTFGDI+RAYLD
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLTMLAKNKRVEETKQVWEDLKGEGVLFDQHTFGDIMRAYLD 180

Query: 181 NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240
           N M SEAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQV+DDFLELFPDMIVYDPP
Sbjct: 181 NAMPSEAMDIYREMRQSPDRPLSLPFRVILKGLVPYPELREQVRDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDEDRNKS 254
           EDLFEEDEDR KS
Sbjct: 241 EDLFEEDEDRFKS 253

BLAST of CsGy1G030320 vs. NCBI nr
Match: XP_023531705.1 (pentatricopeptide repeat-containing protein At1g62350 [Cucurbita pepo subsp. pepo] >XP_023531706.1 pentatricopeptide repeat-containing protein At1g62350 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 427.9 bits (1099), Expect = 2.4e-116
Identity = 220/253 (86.96%), Postives = 230/253 (90.91%), Query Frame = 0

Query: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK 60
           MLR APNLLR+ S+   S+   +RF+  TF      QQQLLLRFITGSASSPSLS+WRRK
Sbjct: 1   MLRFAPNLLRRFSNRATSTIHLYRFTHCTFFEHHRPQQQLLLRFITGSASSPSLSVWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120
           KEMGKEGLIVVKELKR+QSN IRLDRFISSHVSRLLKSDLVAVLVELQRQN VFLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRIQSNLIRLDRFISSHVSRLLKSDLVAVLVELQRQNQVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLK EGVLFDQHTFGDI+RAYLD
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKGEGVLFDQHTFGDIMRAYLD 180

Query: 181 NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240
           N M SEAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQVKDDFLELFPDMIVYDPP
Sbjct: 181 NAMPSEAMDIYREMRQSPDRPLSLPFRVILKGLVPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDEDRNKS 254
           EDLFEEDED  KS
Sbjct: 241 EDLFEEDEDMCKS 253

BLAST of CsGy1G030320 vs. NCBI nr
Match: XP_022930047.1 (pentatricopeptide repeat-containing protein At1g62350 [Cucurbita moschata] >XP_022930049.1 pentatricopeptide repeat-containing protein At1g62350 [Cucurbita moschata])

HSP 1 Score: 424.9 bits (1091), Expect = 2.0e-115
Identity = 220/253 (86.96%), Postives = 229/253 (90.51%), Query Frame = 0

Query: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK 60
           MLR APNLLR+ S+S  S+   +R +  TF      QQQLLLRFITGSASSPSLSIWRRK
Sbjct: 1   MLRFAPNLLRRFSNSATSTIHVYRLTHCTFFEHHRPQQQLLLRFITGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120
           KEMGKEGLIVVKELKR+QSN IRLDRFISSHVSRLLKSDLVAVLVELQRQN VFLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRIQSNLIRLDRFISSHVSRLLKSDLVAVLVELQRQNQVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLK EGVLFDQHTFGDI+RAYLD
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKGEGVLFDQHTFGDIMRAYLD 180

Query: 181 NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240
           N M SEAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQVKDDFLELFPDMIVYDPP
Sbjct: 181 NAMPSEAMDIYREMRQSPDRPLSLPFRVILKGLVPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDEDRNKS 254
           EDLFEEDE   KS
Sbjct: 241 EDLFEEDEGMCKS 253

BLAST of CsGy1G030320 vs. TAIR10
Match: AT1G62350.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 305.8 bits (782), Expect = 2.4e-83
Identity = 148/194 (76.29%), Postives = 171/194 (88.14%), Query Frame = 0

Query: 63  MGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLYNV 122
           M KEGLI  KELKRLQ+  +RLDRFI SHVSRLLKSDLV+VL E QRQN VFLCMKLY V
Sbjct: 1   MSKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEV 60

Query: 123 VRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLDNT 182
           VR+E+WYRPDMFFYRDMLMMLA+NK+V+ETK+VWEDLKKE VLFDQHTFGD++R +LDN 
Sbjct: 61  VRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEVLFDQHTFGDLVRGFLDNE 120

Query: 183 MLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPPED 242
           +  EAM +Y EMRESPDRPLSLPFRVILKGL+PYPELRE+VKDDFLELFP MIVYDPPED
Sbjct: 121 LPLEAMRLYGEMRESPDRPLSLPFRVILKGLVPYPELREKVKDDFLELFPGMIVYDPPED 180

Query: 243 LFEEDEDRNKSEDD 257
           + E+ ++  +++ D
Sbjct: 181 ICEDSDEEARTDSD 194

BLAST of CsGy1G030320 vs. TAIR10
Match: AT3G46870.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 201.4 bits (511), Expect = 6.5e-52
Identity = 96/202 (47.52%), Postives = 144/202 (71.29%), Query Frame = 0

Query: 43  RFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVA 102
           RF  G    P   +WR KK +GKE L V+  LKRL+ +  +LD+FI +HV RLLK D++A
Sbjct: 56  RFHDGRPRGP---LWRGKKLIGKEALFVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLA 115

Query: 103 VLVELQRQNHVFLCMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKE 162
           V+ EL+RQ    L +K++ V++K+ WY+PD+F Y+D+++ LAK+KR++E   +WE +KKE
Sbjct: 116 VIGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKE 175

Query: 163 GVLFDQHTFGDIIRAYLDNTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQ 222
            +  D  T+ ++IR +L +   ++AM++Y +M +SPD P  LPFRV+LKGL+P+P LR +
Sbjct: 176 NLFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELPFRVLLKGLLPHPLLRNK 235

Query: 223 VKDDFLELFPDMIVYDPPEDLF 245
           VK DF ELFP+   YDPPE++F
Sbjct: 236 VKKDFEELFPEKHAYDPPEEIF 254

BLAST of CsGy1G030320 vs. TAIR10
Match: AT5G09320.1 (Vacuolar sorting protein 9 (VPS9) domain)

HSP 1 Score: 87.0 bits (214), Expect = 1.8e-17
Identity = 58/169 (34.32%), Postives = 89/169 (52.66%), Query Frame = 0

Query: 84  LDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLYNVVRKEVWYRPDMFFYRDMLMML 143
           LDR I S   RLLK D+VAVL EL RQN   L +K++  +RKE WY+P +  Y DM+ ++
Sbjct: 541 LDRVIISKFRRLLKFDMVAVLRELLRQNECSLALKVFEEIRKEYWYKPQVRMYTDMITVM 600

Query: 144 AKNKRVEETKQVWEDLKKE-GVLFDQHTFGDIIRAYLDNTMLSEAMDIYREMRESPDRPL 203
           A N  +EE   ++  +K E G++ +   F  ++   L++ +    MD Y  M+     P 
Sbjct: 601 ADNSLMEEVNYLYSAMKSEKGLMAEIEWFNTLLTILLNHKLFDLVMDCYAFMQSIGYEPD 660

Query: 204 SLPFRVILKGLIPYPE--LREQVKDDFLELFPDMIVYDPPEDLFEEDED 250
              FRV++ GL    E  L   V+ D  E + + +      +  EEDE+
Sbjct: 661 RASFRVLVLGLESNGEMGLSAIVRQDAHEYYGESL------EFIEEDEE 703

BLAST of CsGy1G030320 vs. TAIR10
Match: AT3G42570.1 (peroxidase family protein)

HSP 1 Score: 52.0 bits (123), Expect = 6.4e-07
Identity = 25/34 (73.53%), Postives = 27/34 (79.41%), Query Frame = 0

Query: 60 KKEMGKEGLIVVKELKRLQSNFIRLDRFISSHVS 94
          KKE  KEGLI  KELKRLQ+N +RLDRFI SH S
Sbjct: 4  KKEKSKEGLIAAKELKRLQTNLVRLDRFIDSHPS 37

BLAST of CsGy1G030320 vs. TAIR10
Match: AT3G27750.1 (FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 50.8 bits (120), Expect = 1.4e-06
Identity = 38/136 (27.94%), Postives = 67/136 (49.26%), Query Frame = 0

Query: 63  MGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLYNV 122
           +  E +  ++ LKR     + L   +   + RL+KSDL++VL EL RQ++  L + + + 
Sbjct: 48  LSTEAIQSIQSLKRAHRTGVSLSLTLRP-LRRLIKSDLISVLRELLRQDYCTLAVHVLST 107

Query: 123 VRKEVWYRP-DMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLDN 182
           +R E  Y P D+  Y D++  L +NK  +E  ++  ++       D      +IRA +  
Sbjct: 108 LRTE--YPPLDLVLYADIVNALTRNKEFDEIDRLIGEIDGIDQRSDDKALAKLIRAVVGA 167

Query: 183 TMLSEAMDIYREMRES 198
                 + +Y  MRES
Sbjct: 168 ERRESVVRVYTLMRES 180

BLAST of CsGy1G030320 vs. Swiss-Prot
Match: sp|Q1PFH7|PPR89_ARATH (Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX=3702 GN=At1g62350 PE=2 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 4.4e-82
Identity = 148/194 (76.29%), Postives = 171/194 (88.14%), Query Frame = 0

Query: 63  MGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLYNV 122
           M KEGLI  KELKRLQ+  +RLDRFI SHVSRLLKSDLV+VL E QRQN VFLCMKLY V
Sbjct: 1   MSKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEV 60

Query: 123 VRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLDNT 182
           VR+E+WYRPDMFFYRDMLMMLA+NK+V+ETK+VWEDLKKE VLFDQHTFGD++R +LDN 
Sbjct: 61  VRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEVLFDQHTFGDLVRGFLDNE 120

Query: 183 MLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPPED 242
           +  EAM +Y EMRESPDRPLSLPFRVILKGL+PYPELRE+VKDDFLELFP MIVYDPPED
Sbjct: 121 LPLEAMRLYGEMRESPDRPLSLPFRVILKGLVPYPELREKVKDDFLELFPGMIVYDPPED 180

Query: 243 LFEEDEDRNKSEDD 257
           + E+ ++  +++ D
Sbjct: 181 ICEDSDEEARTDSD 194

BLAST of CsGy1G030320 vs. Swiss-Prot
Match: sp|Q9STF9|THA8L_ARATH (Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8L PE=1 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 1.2e-50
Identity = 96/202 (47.52%), Postives = 144/202 (71.29%), Query Frame = 0

Query: 43  RFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVA 102
           RF  G    P   +WR KK +GKE L V+  LKRL+ +  +LD+FI +HV RLLK D++A
Sbjct: 56  RFHDGRPRGP---LWRGKKLIGKEALFVILGLKRLKEDDEKLDKFIKTHVFRLLKLDMLA 115

Query: 103 VLVELQRQNHVFLCMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKE 162
           V+ EL+RQ    L +K++ V++K+ WY+PD+F Y+D+++ LAK+KR++E   +WE +KKE
Sbjct: 116 VIGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKE 175

Query: 163 GVLFDQHTFGDIIRAYLDNTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQ 222
            +  D  T+ ++IR +L +   ++AM++Y +M +SPD P  LPFRV+LKGL+P+P LR +
Sbjct: 176 NLFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELPFRVLLKGLLPHPLLRNK 235

Query: 223 VKDDFLELFPDMIVYDPPEDLF 245
           VK DF ELFP+   YDPPE++F
Sbjct: 236 VKKDFEELFPEKHAYDPPEEIF 254

BLAST of CsGy1G030320 vs. Swiss-Prot
Match: sp|Q9LVW6|THA8_ARATH (Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8 PE=2 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 2.6e-05
Identity = 38/136 (27.94%), Postives = 67/136 (49.26%), Query Frame = 0

Query: 63  MGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLYNV 122
           +  E +  ++ LKR     + L   +   + RL+KSDL++VL EL RQ++  L + + + 
Sbjct: 48  LSTEAIQSIQSLKRAHRTGVSLSLTLRP-LRRLIKSDLISVLRELLRQDYCTLAVHVLST 107

Query: 123 VRKEVWYRP-DMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLDN 182
           +R E  Y P D+  Y D++  L +NK  +E  ++  ++       D      +IRA +  
Sbjct: 108 LRTE--YPPLDLVLYADIVNALTRNKEFDEIDRLIGEIDGIDQRSDDKALAKLIRAVVGA 167

Query: 183 TMLSEAMDIYREMRES 198
                 + +Y  MRES
Sbjct: 168 ERRESVVRVYTLMRES 180

BLAST of CsGy1G030320 vs. Swiss-Prot
Match: sp|Q9FKC3|PP424_ARATH (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 47.0 bits (110), Expect = 3.7e-04
Identity = 19/99 (19.19%), Postives = 57/99 (57.58%), Query Frame = 0

Query: 117 MKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIR 176
           ++++ ++R+++WY+P++  Y  +++ML K K+ E+  ++++++  EG + +   +  ++ 
Sbjct: 134 IQVFELLREQLWYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVYTALVS 193

Query: 177 AYLDNTMLSEAMDIYREMRESPD-RPLSLPFRVILKGLI 215
           AY  +     A  +   M+ S + +P    + +++K  +
Sbjct: 194 AYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFL 232

BLAST of CsGy1G030320 vs. TrEMBL
Match: tr|A0A1S3BQ11|A0A1S3BQ11_CUCME (pentatricopeptide repeat-containing protein At1g62350 OS=Cucumis melo OX=3656 GN=LOC103491974 PE=4 SV=1)

HSP 1 Score: 494.2 bits (1271), Expect = 1.8e-136
Identity = 253/256 (98.83%), Postives = 254/256 (99.22%), Query Frame = 0

Query: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK 60
           MLRLAPNLLRKISSSP+SSTPFHRFSLSTFLNLDLLQQQLLLRFI GSASSPSLSIWRRK
Sbjct: 1   MLRLAPNLLRKISSSPVSSTPFHRFSLSTFLNLDLLQQQLLLRFIAGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120
           KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180

Query: 181 NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240
           N MLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP
Sbjct: 181 NAMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDEDRNKSEDD 257
           EDLFEEDEDRNKSEDD
Sbjct: 241 EDLFEEDEDRNKSEDD 256

BLAST of CsGy1G030320 vs. TrEMBL
Match: tr|A0A2I4F1J6|A0A2I4F1J6_9ROSI (pentatricopeptide repeat-containing protein At1g62350 OS=Juglans regia OX=51240 GN=LOC108994666 PE=4 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 8.9e-96
Identity = 188/261 (72.03%), Postives = 214/261 (81.99%), Query Frame = 0

Query: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLL-----QQQLLLRFITGSASSPSLS 60
           MLR A NL RK S S  +  P   FS +  L L        QQ+  LR++ GSASSPSLS
Sbjct: 1   MLRQAQNLFRKTSFSTTTRLPSSPFSPNFPLLLQDTFSKNHQQKWFLRYVAGSASSPSLS 60

Query: 61  IWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFL 120
           IWRR+KEMGKEGLIV KELKRL+ N +RLDRFI SHVSRLLKSDL+AVL E QRQ+ + L
Sbjct: 61  IWRRRKEMGKEGLIVAKELKRLRFNQLRLDRFIRSHVSRLLKSDLLAVLAEFQRQDQILL 120

Query: 121 CMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDII 180
           CMKLY+VVRKE+WYRPDM+FYRDMLMMLA+NK+V+E KQVWEDLKKE VLFDQHTFGDII
Sbjct: 121 CMKLYDVVRKEIWYRPDMYFYRDMLMMLARNKKVDEAKQVWEDLKKEEVLFDQHTFGDII 180

Query: 181 RAYLDNTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMI 240
           RA+LDN + SEAMDIY EMR SPD P+SLPFRVILKGL+PYPELRE++KDDFLELFPDM+
Sbjct: 181 RAFLDNELPSEAMDIYDEMRLSPDPPISLPFRVILKGLLPYPELREKIKDDFLELFPDMV 240

Query: 241 VYDPPEDLFEEDEDRNKSEDD 257
           VYDPPEDLFE ++ R K  DD
Sbjct: 241 VYDPPEDLFEHEDWRKKIGDD 261

BLAST of CsGy1G030320 vs. TrEMBL
Match: tr|A0A2N9FMM7|A0A2N9FMM7_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16222 PE=4 SV=1)

HSP 1 Score: 353.2 bits (905), Expect = 4.9e-94
Identity = 185/261 (70.88%), Postives = 218/261 (83.52%), Query Frame = 0

Query: 1   MLRLAPNLLRKISSS-----PISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLS 60
           MLR A  L+ K SSS      +SS+ F  F     L+ +   +Q L R+I+G ASSPSLS
Sbjct: 1   MLRHARTLITKPSSSSTTALSLSSSFFPDFPKHPLLSENSFSKQWLFRYISGLASSPSLS 60

Query: 61  IWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFL 120
           IWRRKKEMGKEGLIV KELKRL+S+ +RLDRFI S+VSRLLKSDL AVL E QRQ+ VFL
Sbjct: 61  IWRRKKEMGKEGLIVAKELKRLRSDSVRLDRFIRSNVSRLLKSDLFAVLAEFQRQDQVFL 120

Query: 121 CMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDII 180
           CMKLY+VVRKE+WYRPDMFFYRDMLMMLA+NK+V+E K+VWEDLK+E VLFDQHTFGDI+
Sbjct: 121 CMKLYDVVRKEIWYRPDMFFYRDMLMMLARNKKVDEAKRVWEDLKREEVLFDQHTFGDIM 180

Query: 181 RAYLDNTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMI 240
           RA+LD+ + SEAM+IY EMR+SPD P+SLPFRVILKGL+PYPELRE+VKDDFLELFP MI
Sbjct: 181 RAFLDSGLPSEAMEIYDEMRQSPDPPISLPFRVILKGLLPYPELREKVKDDFLELFPGMI 240

Query: 241 VYDPPEDLFEEDEDRNKSEDD 257
           VYDPPEDLFE+ + R ++EDD
Sbjct: 241 VYDPPEDLFEDQDWRRENEDD 261

BLAST of CsGy1G030320 vs. TrEMBL
Match: tr|A0A1J7GN98|A0A1J7GN98_LUPAN (Uncharacterized protein OS=Lupinus angustifolius OX=3871 GN=TanjilG_14022 PE=4 SV=1)

HSP 1 Score: 344.4 bits (882), Expect = 2.3e-91
Identity = 172/215 (80.00%), Postives = 192/215 (89.30%), Query Frame = 0

Query: 44  FITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAV 103
           F++GSASSPSLSIWRRKKE+GKEGLIV KELKRLQSN +RLDRFI S VSRLLKSDLVAV
Sbjct: 25  FVSGSASSPSLSIWRRKKEIGKEGLIVAKELKRLQSNPVRLDRFIQSQVSRLLKSDLVAV 84

Query: 104 LVELQRQNHVFLCMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEG 163
           L E QRQ+ VFLCMKLYN+VRKE+WYRPDMFFYRDMLMMLA+N+RVEETK+VW DLK E 
Sbjct: 85  LAEFQRQDQVFLCMKLYNIVRKEIWYRPDMFFYRDMLMMLARNQRVEETKRVWRDLKGEE 144

Query: 164 VLFDQHTFGDIIRAYLDNTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQV 223
           VLFDQHTFGDIIRA+LD  + SEAM+IY EMR+SPD PLSLPFRV+LKGLIPYPELRE+V
Sbjct: 145 VLFDQHTFGDIIRAFLDGGLPSEAMEIYEEMRQSPDPPLSLPFRVMLKGLIPYPELREKV 204

Query: 224 KDDFLELFPDMIVYDPPEDLFEE---DEDRNKSED 256
           KDDFLE+FP+MI+YDPPEDLFE    D D +  ED
Sbjct: 205 KDDFLEIFPNMIIYDPPEDLFENSEGDSDEDNEED 239

BLAST of CsGy1G030320 vs. TrEMBL
Match: tr|G7I945|G7I945_MEDTR (PPR containing plant-like protein OS=Medicago truncatula OX=3880 GN=11422506 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 3.9e-91
Identity = 166/206 (80.58%), Postives = 192/206 (93.20%), Query Frame = 0

Query: 43  RFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVA 102
           RFITGSAS PSLSIWRRKKE+GKEGLI+ KELKRLQS+ +RLDRF+ S+VSRLLKSDLV+
Sbjct: 16  RFITGSASKPSLSIWRRKKELGKEGLIITKELKRLQSDPVRLDRFVRSNVSRLLKSDLVS 75

Query: 103 VLVELQRQNHVFLCMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKE 162
           VL E  RQ++VFL MKLY++VRKE+WYRPDMFFYRDMLMMLA+NKRV+ETK+VW+DLK E
Sbjct: 76  VLFEFHRQDNVFLSMKLYDIVRKEIWYRPDMFFYRDMLMMLARNKRVDETKRVWDDLKGE 135

Query: 163 GVLFDQHTFGDIIRAYLDNTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQ 222
           GVLFDQHTFGDI+RAYLD+ M SEAMDIY EMR+SP+ PLSLPFRVILKGLIPYPELRE+
Sbjct: 136 GVLFDQHTFGDIVRAYLDSGMPSEAMDIYEEMRQSPEPPLSLPFRVILKGLIPYPELREK 195

Query: 223 VKDDFLELFPDMIVYDPPEDLFEEDE 249
           +KDDFLE+FPDMI+YDPPEDLF++ E
Sbjct: 196 IKDDFLEVFPDMIIYDPPEDLFDDHE 221

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139064.11.4e-137100.00PREDICTED: pentatricopeptide repeat-containing protein At1g62350 [Cucumis sativu... [more]
XP_008450338.12.7e-13698.83PREDICTED: pentatricopeptide repeat-containing protein At1g62350 [Cucumis melo][more]
XP_022966084.12.1e-11788.14pentatricopeptide repeat-containing protein At1g62350 [Cucurbita maxima] >XP_022... [more]
XP_023531705.12.4e-11686.96pentatricopeptide repeat-containing protein At1g62350 [Cucurbita pepo subsp. pep... [more]
XP_022930047.12.0e-11586.96pentatricopeptide repeat-containing protein At1g62350 [Cucurbita moschata] >XP_0... [more]
Match NameE-valueIdentityDescription
AT1G62350.12.4e-8376.29Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G46870.16.5e-5247.52Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G09320.11.8e-1734.32Vacuolar sorting protein 9 (VPS9) domain[more]
AT3G42570.16.4e-0773.53peroxidase family protein[more]
AT3G27750.11.4e-0627.94FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
sp|Q1PFH7|PPR89_ARATH4.4e-8276.29Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX... [more]
sp|Q9STF9|THA8L_ARATH1.2e-5047.52Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
sp|Q9LVW6|THA8_ARATH2.6e-0527.94Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=T... [more]
sp|Q9FKC3|PP424_ARATH3.7e-0419.19Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3BQ11|A0A1S3BQ11_CUCME1.8e-13698.83pentatricopeptide repeat-containing protein At1g62350 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2I4F1J6|A0A2I4F1J6_9ROSI8.9e-9672.03pentatricopeptide repeat-containing protein At1g62350 OS=Juglans regia OX=51240 ... [more]
tr|A0A2N9FMM7|A0A2N9FMM7_FAGSY4.9e-9470.88Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16222 PE=4 SV=1[more]
tr|A0A1J7GN98|A0A1J7GN98_LUPAN2.3e-9180.00Uncharacterized protein OS=Lupinus angustifolius OX=3871 GN=TanjilG_14022 PE=4 S... [more]
tr|G7I945|G7I945_MEDTR3.9e-9180.58PPR containing plant-like protein OS=Medicago truncatula OX=3880 GN=11422506 PE=... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G030320.1CsGy1G030320.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 170..197
e-value: 0.0023
score: 16.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 136..164
e-value: 0.18
score: 12.0
coord: 170..197
e-value: 0.0069
score: 16.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 132..166
score: 8.769
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 167..201
score: 9.767
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 61..247
e-value: 1.7E-38
score: 133.8
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 45..242
NoneNo IPR availablePANTHERPTHR24015:SF590SUBFAMILY NOT NAMEDcoord: 45..242