CsGy1G032120.1 (mRNA) Cucumber (Gy14) v2

NameCsGy1G032120.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationChr1 : 31326165 .. 31334135 (-)
Sequence length1902
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTGAGGATACTTTAGGCATGTTTTGGTGGTCGATCACTTATGTAGTAAACTTTTCAAGACAATTGTTTTTCTGAAGTAAGTAGCTACTAGGGAAGGCTCGCAGTATTCCTTCCCAGATGGAAGTGGTTTATAGGTTAACCTTGACTAGGTGGTAGAGATTTGTAGTCACCTAAGCTTATAGGCCACGGTAGGAGGTTTGTTCTTAATACTCGTAGTTGAACCTTTGGCTCAGCTGATTCACAATGACACCATTTTTCCCAGCAAAAGATACGAGAAGACCAGAGAGAACATATGTTTGCGTTGCCTCAGTGTTAACTTTTTCTAATGGCACCAGAAACTTAAAGTCCTTCAGTAATGGAATGGACTTGATGGAGGTTAGTGCTCATGCGAGATATTTACTAAATAAAATAGGCTTATTTACCCCACTTGTCATTTGCTCAAAATTGTTTCTCAATTTGAAGTCCTGTAATTGGTTAAGTGCTTGACAAGCTGGAGGGCTTATTGGCTTTATTATTTGTTTATTACTACATAAACATAGGTTGAATTGTTATGAACCAATTGCCCTAACATGATCATTGAAATGCTTTACGGTGCAGAATTGTTATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAAATGTGTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAGTCAGGATTTGCTTCTGATATGTTTGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGTTCGAGATATACCCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTACTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGGTGAGTTTTCTATATCCTTCTTCATTTGTTTTTGTTCTGAATTTTAGTTTCTAAATCACTAATGAACTTCAGAAGTATAAGTGAGTACATAGTTAATCTTTTATTTATGTTCACATGTTTTGTATGTTTGGTAGCTGGTATAGGTTAGTGATAATAGAACACCTACTTTGGATGATTCTTATTCAATGGAAACAACAAAAGGAATTACAAGGAAATGGTAGAGCACCCAGTCTTGAACAAGAACAAAGGCAACTAACAAAGAACGGAAAGTAAACCCTACCCTTCCCCAGGCCTTTTCTATCTCTCAAGGAATAAACAAATCCCTCACACAATTTTCATACCCTCACTCTCAGACTCCCTCACCATTTAAACCTTCTCCTTTCCCTAACTAACTGTGGGACCCGGCCTTAGCAGCTATAGTACCCATTACACATGCACTTCCTTTGTTCCCTCTCCTACTGTATTACCATATAATAGGTGCCCTAACATTACGCCTTCTCTAAAATCACATTCTCCTTAAGGTGAAATCTGGAAATTGCTGTTGAAAATCATAACAAATCTCCTAGGCAGCTTTATAAGGAGGTAGCACTTGCCAACAAATCAAGACCTCCCACACTCCCGTTGTAGGGTGCCTTTGATATCCATAAACTTCCTCTAGCTTGGCTACTCATTCATAATTTTTCGACAAGTGACTAGTTGCTGCACCTCAGAATGCTCACTAACACTCCCTCAACTGTGAGACATGAGACACCGGATGAATAGAGGCCGATGGAGGTAATTCCAGTTTTTATGCCACTGGTCCAATTCTTTCAAGCAAAAATTTTGGAGATAATTTTTCACTACTCCGTTTTCGCAATGACGAGTGCCCGTTTGAAGTTGGTCGCCTAGCTGTTTTAAGAGGAGCAAGTGCCTCTCACCTAAGGCGAGCCCTCGGTTGCACCTCGAAAACACTGGCTGCAGTATTATTTTCTCTTCAATTAAAACTGATTTCCATTTGTTGAGTCTTCTACATATTTCAAGCCTACAAAGGAGAGGGAAATGTTAAAACGATACAACATTAAATTTTCCTTCACCTTAAACTATAGCTAAATCTACAAGGTTGAAGTTGTTGTGTTATTGTATGTTTAAGCATTTATATTAATAGTCTTATTACATCTTTTGCATTTATACTGACGGTGGTGTTATATCTTTCTCACAACAGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATGTACATCATGATCAAAACGAAGATGAAGAACTACTCTATTCTTCGTAATTATTTTACGGTTTGATTGAATCGTTATATTGCTTCATGATTATTATATTAGATATATGAGATTTAAAGAACTAGGAATTAATGTCTAGAATAGTTCATTAGGGAATAGCAAAATTGTATGTAGTTGAATTTTGCACTTATTCGGTTTTTGTCCTAAAGTTTAAATACTTTTATACCTATGAATTCATTCAAAATGATAGTTTTTTTTATCATGTCCATTCTCTCCTCTTTTAGTACATTTCTTTATTCATTTATTTTATATATCAAATTAATTATATGAATATTGTATTTGTGATTTATAAAATTATAACAATTTTATTAAATTAATATATATATACATATATATATATAAACAATTTAATGTACTTAATCTTTTATTTTAATTAGTTTTTATATAAAGAAATTGTCAAAAATGACAAAATGGACATTATTAACTTCATAACATCAGGTTTAATGAGTTTGTTATTTAGTGGCTTTTTTTGTCATGAATTATAAATAGGTTTTTTTTTATTATTATTCTAAAGAAAAACCTACATATTTACTGTTTTCGTTAGTTAATGACCAGAGAGTGCACCAAATAGGAAGGATAAGAACTCACCTCAAGTCCTCTCAAAGATAAGTAGAGAGAAAAGAATATTGATGGAGACAATAGGCAATTCCATCAAAGGGTTGGAAATACATGTAGAGGGCCAAACCAGCAGAACAATTAGAATTGTCCCTTATTCCTTTACATATTCAAGAGTCACTATAGAGCAAGTGAATGCCTCAATCGAGCGAAACTTAATGTCATTCAAGCATAATTAGTTTTTGATTTGAACGATAAAGATGTTATTACCGAGGACAACACCATGCAAGTGGAAGAAAGTGAAACCCACGAGTTGAGGTCTTGAGATTTTTAGTTGTCTTCCAAAAGAACGTGGGAGAATCAAAAGAATCTAAAAAAGAAGGGTTAATGAATGTAGACACATGGGACAACCATAGATTGACCAAGAGTACCATGGTCGAGTCTAAAGCCACATGCAACTTCATCATGTTCGAAGTAGAAGTTAGACTGTTAAAACTTTGCTTGGATGGAGGCACATAAAAAATGAAAGCAGTGAACTCTACGACCTTGTCTATTACAGGAGCATTAAAAAGAGAAATCAAACAATTGGGAGATTTTAGTGGACCAATTGACTTTGTGATAGTTAAAATAGGCAACTTTGATGTGGTGTTGTGGATGAAATTCATGCTTTAATGCATATTTCTCTCGTCAAGTGTTTGGTGATCACTGGGTCTATGTCCACAATTGTGCACACAAAAATTAAATAGCTAAATGGAATAAAGATGATCTCGGTTCTTCAATTGAAGAAGAGCCTTGCCCACGAGTCAATTGGAGCCATTGAAGAAGCATCTTCTCAAGATATTTTTTGTCTGACAAAAGGGTCAAAATGTAAACAACTTACAAAAGTCTCTACCGCCATAAAGGAAAAAGAGAAGGCACATATGCTATGAGTTAAAACTGGGAACAAAGTTTGAGTCCAGTTACATCCAGGTTAGTTTCAAATCCCTGGCTAAAGAAAACAATGCCTCACTAGGTGCTATGAAGGACCTATTGAAGTTTTCGGCAAAGTCGGGAAAGGCTCTTATCAGGTACATTTGACTTCACGGGTGAAGGTTCATCCAATCATTGACGTGACTAATTTGAAATCGTATAACCCCAACCATGTCAACGATTCAATGTTTTGAAGCGTCCACCTACAATGTTGAAGAGAGACGTCGAGAAAGAAGTCGACAAAAGACTTCCAAAAAATGACAACCAAGGTAGAATACCAAGTCAACAAATGAAAGTTCCTTAAGAGGAGTGGAAGAGCCTCAAAGGCAAAGAAGTTAAAGTAGAGAGTTTGCCGGGATTCTCAAGGCAGTTGTTCAACATATATTGTTGAGTTCGAGTAAAATCAGTTGATGAGGTCATTAATCATTAGAATATCATAGTCATGCTTGTCCACGACCTAATGCTCGTGGCCGCATGCTTGCTCCTACCTCATTTTATTATGCTTTCATTTTATGTATTTCATTTTGTTAGTTAGTTTGCTTTATATGCAATTGTCATGTTGTATGATCACTCACCACTTTGCATATGTGATAAACATCCACTATGTTTGTTAGATGCAAAGTTAGAGGCCCTTCTAGGATACTTGCAAAAACTCACAAAGAAAGATAACTTAAATCAAATCTAGAATCTATGATGATCACTGTTAGATCCTAAGACAACTCACTTCAGGGTTAGCTTGATCAAGACAAATCCAATGAATTGATTTTTAACATCAAATATAAAACAAAATTTGAAAAGATAACAACAAAATGGTTCAATAAGCACAAGCCTTTTAATATTGATATTCTCCTCCTAAAATGTGAAAACTACAACCTTATTTATACTAATTAAAAATGTTAATGAAAGGATACAACATTTAATTCTATGCGCTTTCAACTAAATAATAAATGAAAACACATAACATTAAATAAGCATATAATTCATTAAAGATGTGAATGAATTTGAACTTTTAGTTTCATTATGACTTTGGTTGTTCAAATAACTTTTGAAGTTTCAAGCGTTACAACTCTTCAAGTTTATTGGAAAGCTTAGCTTGAAAGTTATTCGTTCTATGCTACAACTAATCAATGCCAACTATTTGCTTATCAATATGCACAGATGGTAGATTGTCTTTTGTTCAAAATATACTATATGGGGGTAAGACTTATCATCATACATTTATACCCACGAGTTATAAGATATAAAGTTGATGTTTTCCCTCATTACTTATAGCCTTATAACGACAGTTCTTTTCCCAAACATGCTTTCAAACGTTTGCAAAAACCTTTTCTTCTAAACTTGTTTTCTAAAACATTTTCTCACAAATGGGGGTGGAGGTTGTGAAATACATCCATTATGGAACTGACTATTTAGATTGGGTCAGGCTGACTTGTTCTCATACGATTACTCTGATACCACTTTGATGTAATTCTAAGTAGGATATATCTTCATAAAATGAATTATCTACCTTCAAAGGTTAAGAGGTGGGGTATTTATTAAGAATCTTGAGAGTTTGTTACAAGATAGTTATATGAATTGTTACCGAAACCGTTAGAATGGTTAACTATTTTAACTATCTTATTCTTATTCTACTTATTACATCAATTTCCTTAGGAAACGTTTCTAAAATTTAGTATGCGAGCAAACCTTCTATCAAACCTACTTTTGTAAATTTTTAAATAATCTCATTCTTCGTTTTCAACAATGAAAACCTCATTGGAGTGTTAAATTTGGGGGTGTCAAATGTTTGAGCTTATAATGCATTAAAAATGTTAAATAGGTTTCCAAACAAGCTTATTTTTAAAAAATTAAACTCATTTAAGAAAGAAAAAAAAAAAAACTCCAATGTCTTCGAAAAGCCTTAGATAAGGTTAAAGACATGAACAACACTTACCCAATGTTGGGTGCCTTCTACACCTCATAGTTGGTATGCCAAAATGAAAGTGAGTAATTAGAAATGATAAACATTAATTGCTCCAATGTCTATGCGATGAAAGTAAATAAGCAATGATAAAAGTTGAACTTGTAAAAGGAACAAACTCAAAGTAAGTCAAAAGTAAAAACAAGAGAGAGTTTCTGAATGAAACAAAAGGCCAGTTTTAAAAAGGAAAATGTGTACATGAGAACAACACAAAGGAATTTTGAACCTTAAGCTTGCCGTACCTCAATTTCAAAAGTATTTCTGTAGGCAGTAGAAGCAAACTAAGGGATGGAGATCCCAGAGGTTTAATCTAAGAACAAGGAACATGATGAGCGATTTAGTGTTTGAATCAATATAAGATAAATTTTTACTTAGTCCTTACAAATATGATTGATAGGATTGAGTACAAGCACTGTTTTAAATTTTGGAGTATGATAACTTGTAGGAAATATAGTTATTGTAACGTTTTATTTAGAATAATGGAGTCAAATTGAAATATAAGGCATTGAATTAAAATAAACTGGTATGAAAATTCTTACTAAGCAAACACAAGTTCACCTAAGGATTACCATGTCTGTTTTTTATGAGTTTTAATGTTTTAAATGTAGGATTATGTACAAAGAAGAGAAATGGTGAAGATAATATCTTCGTCCAAAGACATATGAAGAAAAGTGGTGACCCAATATTTACACCTGATCAACACGTCAAGCAACAAACCATGATACCACTTAATGAATTGACAAAATGATATGAAACCATGCCAAGATGACAAGCGTGGTTGAGAATCATCCAAATCAACAAGCTAAGTTAGTAAGGTCAAAATCCACATTGTGCTATCTACGATGGATTTGTGATTCGATGAAACAATGAGCAGAGCATGAAAGTTAATTCTTGGTATTCGACATAATTAGATGATGAAGATACTTCAAAATATACCATTTTGACACTTGTGGATTACCCACCATAAAAGCTTGGTATATTCAAAAATAAAAGTACCATACGGACGTTAAAAAACAGGTTAATTTAGGAAATTATTTTTGTTTTAAGGGTAGTATACTTGTACACGATTTTCTTTAATCTAGTCAAAGCCCACATGGCTGAAGACTGGATCAAACTGGTCAAATGTTATCTAG

mRNA sequence

AGTGAGGATACTTTAGGCATGTTTTGGTGGTCGATCACTTATGTAGTAAACTTTTCAAGACAATTGTTTTTCTGAAGTAAGTAGCTACTAGGGAAGGCTCGCAGTATTCCTTCCCAGATGGAAGTGGTTTATAGGTTAACCTTGACTAGGTGGTAGAGATTTGTAGTCACCTAAGCTTATAGGCCACGGTAGGAGGTTTGTTCTTAATACTCGTAGTTGAACCTTTGGCTCAGCTGATTCACAATGACACCATTTTTCCCAGCAAAAGATACGAGAAGACCAGAGAGAACATATGTTTGCGTTGCCTCAGTGTTAACTTTTTCTAATGGCACCAGAAACTTAAAGTCCTTCAGTAATGGAATGGACTTGATGGAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAAATGTGTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAGTCAGGATTTGCTTCTGATATGTTTGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGTTCGAGATATACCCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTACTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATTCAAAGCCCACATGGCTGAAGACTGGATCAAACTGGTCAAATGTTATCTAG

Coding sequence (CDS)

ATGACACCATTTTTCCCAGCAAAAGATACGAGAAGACCAGAGAGAACATATGTTTGCGTTGCCTCAGTGTTAACTTTTTCTAATGGCACCAGAAACTTAAAGTCCTTCAGTAATGGAATGGACTTGATGGAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAAATGTGTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAGTCAGGATTTGCTTCTGATATGTTTGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGTTCGAGATATACCCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTACTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATTCAAAGCCCACATGGCTGAAGACTGGATCAAACTGGTCAAATGTTATCTAG

Protein sequence

MTPFFPAKDTRRPERTYVCVASVLTFSNGTRNLKSFSNGMDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHSKPTWLKTGSNWSNVI
BLAST of CsGy1G032120.1 vs. NCBI nr
Match: XP_011660274.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativus] >KGN66775.1 hypothetical protein Csa_1G690140 [Cucumis sativus])

HSP 1 Score: 896.3 bits (2315), Expect = 5.0e-257
Identity = 493/497 (99.20%), Postives = 496/497 (99.80%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM
Sbjct: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
Sbjct: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV
Sbjct: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG
Sbjct: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE
Sbjct: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480

Query: 520 IYALLHKIYDIIKLHKH 537
           IYALLHKIYDIIKLHKH
Sbjct: 481 IYALLHKIYDIIKLHKH 497

BLAST of CsGy1G032120.1 vs. NCBI nr
Match: XP_022979295.1 (pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxima])

HSP 1 Score: 798.9 bits (2062), Expect = 1.1e-227
Identity = 440/499 (88.18%), Postives = 471/499 (94.39%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQTFSSI 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           GHPHRCWLLY QMC QGCSPN +SFTFLFPACAS  N YPGQMLHSHFCKSGFASD+FA+
Sbjct: 61  GHPHRCWLLYYQMCLQGCSPNHHSFTFLFPACASFLNAYPGQMLHSHFCKSGFASDVFAL 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TALLDMY KLG+L+SARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TALLDMYGKLGILKSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX  ENEKGTKPNEV+IASVLPAC+QLGALDIGKRIE Y
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXENEKGTKPNEVTIASVLPACAQLGALDIGKRIEVY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           AR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Sbjct: 241 ARKNGFFKNLYVSNAILEVHARCGNIEEARRVFDEIGSKRNLCSWNTMIMGLAVHGRCPD 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQ+FESME KFQ+APKLEHYGCLV
Sbjct: 301 ALQLYDQMLIQRTRPDDVTFVGLLLACTHGGMVAKGRQIFESMEIKFQIAPKLEHYGCLV 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGNVELGEVAAESLFKLEPWNPG
Sbjct: 361 DLLGRAGEIEEAYSLIQSMPMFPDSVIWGALLGACSFHGNVELGEVAAESLFKLEPWNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS E
Sbjct: 421 NYVILSNIYASAGDWSGVARVRKMMKGGHIRKRAGCSYIEVGDGIHEFIVEDRSHPKSDE 480

Query: 520 IYALLHKIYDIIKLHKHSK 539
           IYALLH IY IIKLH  ++
Sbjct: 481 IYALLHAIYAIIKLHSQNE 499

BLAST of CsGy1G032120.1 vs. NCBI nr
Match: XP_023527365.1 (pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 793.5 bits (2048), Expect = 4.6e-226
Identity = 437/499 (87.58%), Postives = 469/499 (93.99%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQTFSSI 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           GH HRCWLLY QMC QGCSPN +SFTFLFPACAS  N YPGQMLHSHFCKSGFASD+FA+
Sbjct: 61  GHHHRCWLLYYQMCLQGCSPNHHSFTFLFPACASFLNAYPGQMLHSHFCKSGFASDVFAL 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TALLDMY KLG+L+SARQLFDE PVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TALLDMYGKLGILKSARQLFDEKPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX  ENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAY
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXENEKGTKPNEVTIASVLPACAHLGALDIGKRIEAY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           AR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Sbjct: 241 ARKNGFFKNLYVSNAILEVHARCGNIEEARRVFDEIGSKRNLCSWNTMIMGLAVHGRCCD 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           ALQLYDQML+++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+APKLEHYGCLV
Sbjct: 301 ALQLYDQMLMQRTRPDDVTFVGLLLACTHGGMVAKGRQLFESMERKFQIAPKLEHYGCLV 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGNVELGEVAAESLFKLEPWNPG
Sbjct: 361 DLLGRAGEIEEAYSLIQSMPMLPDSVIWGALLGACSFHGNVELGEVAAESLFKLEPWNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS E
Sbjct: 421 NYVILSNIYASAGDWSGVARVRKMMKGGHIRKRAGCSYIEVGDGIHEFIVEDRSHPKSDE 480

Query: 520 IYALLHKIYDIIKLHKHSK 539
           IYALLH +Y IIKLH  ++
Sbjct: 481 IYALLHAVYAIIKLHNQNE 499

BLAST of CsGy1G032120.1 vs. NCBI nr
Match: XP_022147487.1 (pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Momordica charantia])

HSP 1 Score: 780.4 bits (2014), Expect = 4.0e-222
Identity = 430/496 (86.69%), Postives = 464/496 (93.55%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LFD IPKPSV+LYNKFIQ++SS 
Sbjct: 1   MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSS 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           G  HRCW LY QMC QGCSPNQ+SFTFLF ACASL NV+PGQMLH+HFCKSGFASD+FA+
Sbjct: 61  GQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFPGQMLHAHFCKSGFASDVFAL 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TALLDMYAKLGMLRSARQLFDEMPVRDIPT XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX   NE+G KPNEV++ASVLPAC+QLGALDIG+RIEAY
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXNERGIKPNEVTVASVLPACAQLGALDIGRRIEAY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           ARNNGFFKN YVSNA+LE+HARCGNIEEA+QVFDEIGSKRNLCSWNTMIMGLAVHGRC  
Sbjct: 241 ARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           A++LYDQML +++RPDDVTF+GLLLACTHGGMVA+GRQLFESMESKFQ+APKLEHYGCLV
Sbjct: 301 AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLV 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGRAGEL+EAYNLIQ MPM PDSVIWG LLGACSFHG+VEL EVAAESLFKLEPWNPG
Sbjct: 361 DLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGSVELAEVAAESLFKLEPWNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYA AGDW GVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS E
Sbjct: 421 NYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDE 480

Query: 520 IYALLHKIYDIIKLHK 536
           IYALLH IY IIKL K
Sbjct: 481 IYALLHGIYSIIKLQK 496

BLAST of CsGy1G032120.1 vs. NCBI nr
Match: XP_022147489.1 (pentatricopeptide repeat-containing protein At5g08510 isoform X2 [Momordica charantia])

HSP 1 Score: 780.4 bits (2014), Expect = 4.0e-222
Identity = 430/496 (86.69%), Postives = 464/496 (93.55%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LFD IPKPSV+LYNKFIQ++SS 
Sbjct: 1   MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSS 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           G  HRCW LY QMC QGCSPNQ+SFTFLF ACASL NV+PGQMLH+HFCKSGFASD+FA+
Sbjct: 61  GQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFPGQMLHAHFCKSGFASDVFAL 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TALLDMYAKLGMLRSARQLFDEMPVRDIPT XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX   NE+G KPNEV++ASVLPAC+QLGALDIG+RIEAY
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXNERGIKPNEVTVASVLPACAQLGALDIGRRIEAY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           ARNNGFFKN YVSNA+LE+HARCGNIEEA+QVFDEIGSKRNLCSWNTMIMGLAVHGRC  
Sbjct: 241 ARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           A++LYDQML +++RPDDVTF+GLLLACTHGGMVA+GRQLFESMESKFQ+APKLEHYGCLV
Sbjct: 301 AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLV 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGRAGEL+EAYNLIQ MPM PDSVIWG LLGACSFHG+VEL EVAAESLFKLEPWNPG
Sbjct: 361 DLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGSVELAEVAAESLFKLEPWNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYA AGDW GVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS E
Sbjct: 421 NYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDE 480

Query: 520 IYALLHKIYDIIKLHK 536
           IYALLH IY IIKL K
Sbjct: 481 IYALLHGIYSIIKLQK 496

BLAST of CsGy1G032120.1 vs. TAIR10
Match: AT5G08510.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 489.2 bits (1258), Expect = 3.4e-138
Identity = 293/497 (58.95%), Postives = 377/497 (75.86%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++Q+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +   
Sbjct: 1   MNGIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
             PH   +LY  +   G  P+ ++F F+F A AS  +  P ++LHS F +SGF SD F  
Sbjct: 61  HQPHESIVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCC 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           T L+  YAKLG L  AR++FDEM  RD+P  XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TTLITAYAKLGALCCARRVFDEMSKRDVPVWXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX    +K  KPN +++ SVLPAC+ LG L+IG+R+E Y
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXDKSVKPNHITVVSVLPACANLGELEIGRRLEGY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           AR NGFF N YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI  LA HG+  +
Sbjct: 241 ARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDE 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   +++PKLEHYGC++
Sbjct: 301 ALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMI 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGNVE+ E+A+E+LFKLEP NPG
Sbjct: 361 DLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASEALFKLEPTNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSG 519
           N VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S 
Sbjct: 421 NCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYFVEVGVDVHKFTVEDKSHPRSY 480

Query: 520 EIYALLHKIYDIIKLHK 536
           EIY +L +I+  +KL K
Sbjct: 481 EIYQVLEEIFRRMKLEK 497

BLAST of CsGy1G032120.1 vs. TAIR10
Match: AT4G37380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 329.7 bits (844), Expect = 3.4e-90
Identity = 236/506 (46.64%), Postives = 334/506 (66.01%), Query Frame = 0

Query: 37  SNGMDLMEQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQIPKPSVYLY 96
           S  +D + QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+
Sbjct: 39  SQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLF 98

Query: 97  NKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCK 156
              I T S  G   + +LLY Q+ S   +PN+++F+ L  +C++      G+++H+H  K
Sbjct: 99  TAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCST----KSGKLIHTHVLK 158

Query: 157 SGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXX 216
            G   D +  T L+D+YAK G + SA+++FD MP R + +XXXXXXXXXXXXXXXXXXXX
Sbjct: 159 FGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSXXXXXXXXXXXXXXXXXXXX 218

Query: 217 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGA 276
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    E   KP+E+++ + L ACSQ+GA
Sbjct: 219 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAEGKPKPDEITVVAALSACSQIGA 278

Query: 277 LDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIM 336
           L+ G+ I  + +++    N  V   +++++++CG++EEA  VF++   ++++ +WN MI 
Sbjct: 279 LETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDT-PRKDIVAWNAMIA 338

Query: 337 GLAVHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQV 396
           G A+HG   DAL+L+++M  I  ++P D+TF+G L AC H G+V EG ++FESM  ++ +
Sbjct: 339 GYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGI 398

Query: 397 APKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAE 456
            PK+EHYGCLV LLGRAG+L+ AY  I+NM M  DSV+W ++LG+C  HG+  LG+  AE
Sbjct: 399 KPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAE 458

Query: 457 SLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFI 516
            L  L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF 
Sbjct: 459 YLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEFR 518

Query: 517 VEDRSHLKSGEIYALLHKIYDIIKLH 535
             DR H KS EIY +L KI + IK H
Sbjct: 519 AGDREHSKSKEIYTMLRKISERIKSH 539

BLAST of CsGy1G032120.1 vs. TAIR10
Match: AT2G20540.1 (mitochondrial editing factor 21)

HSP 1 Score: 326.6 bits (836), Expect = 2.9e-89
Identity = 230/498 (46.18%), Postives = 333/498 (66.87%), Query Frame = 0

Query: 44  EQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 103
           ++I+A  + +GL  + F++ K++    ++ D+ YA  LF+Q+  P+V+LYN  I+ ++  
Sbjct: 27  KKINASIIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQVSNPNVFLYNSIIRAYTHN 86

Query: 104 GHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFA 163
                   +Y Q+  +    P++++F F+F +CASL + Y G+ +H H CK G    +  
Sbjct: 87  SLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYLGKQVHGHLCKFGPRFHVVT 146

Query: 164 MTALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 223
             AL+DMY K   L  A ++FDEM  RD   XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 147 ENALIDMYMKFDDLVDAHKVFDEMYERDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 206

Query: 224 XXXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEA 283
           XXXXXXXXXXXXXXXXXXXXXXXX ++   G +P+E+S+ SVLP+C+QLG+L++GK I  
Sbjct: 207 XXXXXXXXXXXXXXXXXXXXXXXXEMQ-LAGIEPDEISLISVLPSCAQLGSLELGKWIHL 266

Query: 284 YARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCI 343
           YA   GF K   V NA++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG   
Sbjct: 267 YAERRGFLKQTGVCNALIEMYSKCGVISQAIQLFGQMEGK-DVISWSTMISGYAYHGNAH 326

Query: 344 DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCL 403
            A++ +++M   K++P+ +TF+GLL AC+H GM  EG + F+ M   +Q+ PK+EHYGCL
Sbjct: 327 GAIETFNEMQRAKVKPNGITFLGLLSACSHVGMWQEGLRYFDMMRQDYQIEPKIEHYGCL 386

Query: 404 VDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNP 463
           +D+L RAG+L+ A  + + MPM PDS IWG+LL +C   GN+++  VA + L +LEP + 
Sbjct: 387 IDVLARAGKLERAVEITKTMPMKPDSKIWGSLLSSCRTPGNLDVALVAMDHLVELEPEDM 446

Query: 464 GNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSG 523
           GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S     
Sbjct: 447 GNYVLLANIYADLGKWEDVSRLRKMIRNENMKKTPGGSLIEVNNIVQEFVSGDNSKPFWT 506

Query: 524 EIYALL-----HKIYDII 532
           EI  +L     H+  D+I
Sbjct: 507 EISIVLQLFTSHQDQDVI 522

BLAST of CsGy1G032120.1 vs. TAIR10
Match: AT3G29230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 312.4 bits (799), Expect = 5.6e-85
Identity = 222/586 (37.88%), Postives = 327/586 (55.80%), Query Frame = 0

Query: 13  PERTYVCVASVLTFSNGTRNLKSFSNGMDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDL- 72
           P R    V+S   F    ++L   +N ++ ++Q+HA  +R  L     +  KL+    L 
Sbjct: 6   PVRAPSWVSSRRIFEERLQDLPKCAN-LNQVKQLHAQIIRRNLHEDLHIAPKLISALSLC 65

Query: 73  ---PYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLF 132
                A  +F+Q+ +P+V+L N  I+  +    P++ + ++ +M   G   + +++ FL 
Sbjct: 66  RQTNLAVRVFNQVQEPNVHLCNSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLL 125

Query: 133 PACASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGML--RSARQLFDEMPVRD 192
            AC+    +   +M+H+H  K G +SD++   AL+D Y++ G L  R A +LF++M  RD
Sbjct: 126 KACSGQSWLPVVKMMHNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERD 185

Query: 193 IPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLEN 252
             +             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    
Sbjct: 186 TVSWNSMLGGLVKAGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 245

Query: 253 ------------------------------------------------------------ 312
                                                                       
Sbjct: 246 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 305

Query: 313 ---EKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCG 372
                G K +  ++ S+L AC++ G L +G RI +  + +    NAYV NA+L+++A+CG
Sbjct: 306 XXVASGLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAKCG 365

Query: 373 NIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLL 432
           N+++A  VF++I  K++L SWNTM+ GL VHG   +A++L+ +M    +RPD VTF+ +L
Sbjct: 366 NLKKAFDVFNDI-PKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVL 425

Query: 433 LACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPD 492
            +C H G++ EG   F SME  + + P++EHYGCLVDLLGR G L+EA  ++Q MPM P+
Sbjct: 426 CSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEPN 485

Query: 493 SVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKM 530
            VIWG LLGAC  H  V++ +   ++L KL+P +PGNY +LSNIYA A DW GVA +R  
Sbjct: 486 VVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRSK 545

BLAST of CsGy1G032120.1 vs. TAIR10
Match: AT5G56310.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 304.3 bits (778), Expect = 1.5e-82
Identity = 192/521 (36.85%), Postives = 304/521 (58.35%), Query Frame = 0

Query: 22  SVLTFSNG----TRNLKSFSNGMDLMEQIHAYSLRNGLDHTKFLIEKLLQ----LPDLPY 81
           + L+ S+G      +LK   N +  ++Q H Y +  GL+     + K ++       L Y
Sbjct: 6   NALSLSSGLNWFVTSLKIHGNNLKTLKQSHCYMIITGLNRDNLNVAKFIEACSNAGHLRY 65

Query: 82  ACTLFDQIPKPSVYLYNKFIQTFSSIGHPHR---CWLLYCQMCSQGCSPNQYSFTFLFPA 141
           A ++F   P P+ YL+N  I+  S +  P+       +Y ++ +    P+ ++F F+   
Sbjct: 66  AYSVFTHQPCPNTYLHNTMIRALSLLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKI 125

Query: 142 CASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTX 201
              + +V+ G+ +H      GF S +  +T L+ MY   G L  AR++FDEM V+D+   
Sbjct: 126 AVRVSDVWFGRQIHGQVVVFGFDSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVW 185

Query: 202 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG----LEN 261
                                          XXXXXXXXXXXXXXXXXXXXX     +EN
Sbjct: 186 NALLAGYGKVGEMDEARSLLEMMPCWVRNEVXXXXXXXXXXXXXXXXXXXXXFQRMLMEN 245

Query: 262 EKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIE 321
               +P+EV++ +VL AC+ LG+L++G+RI +Y  + G  +   ++NAV++++A+ GNI 
Sbjct: 246 ---VEPDEVTLLAVLSACADLGSLELGERICSYVDHRGMNRAVSLNNAVIDMYAKSGNIT 305

Query: 322 EAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLAC 381
           +A  VF E  ++RN+ +W T+I GLA HG   +AL ++++M+   +RP+DVTF+ +L AC
Sbjct: 306 KALDVF-ECVNERNVVTWTTIIAGLATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSAC 365

Query: 382 THGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVI 441
           +H G V  G++LF SM SK+ + P +EHYGC++DLLGRAG+L+EA  +I++MP   ++ I
Sbjct: 366 SHVGWVDLGKRLFNSMRSKYGIHPNIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAI 425

Query: 442 WGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKG 501
           WG+LL A + H ++ELGE A   L KLEP N GNY++L+N+Y+  G W     +R MMKG
Sbjct: 426 WGSLLAASNVHHDLELGERALSELIKLEPNNSGNYMLLANLYSNLGRWDESRMMRNMMKG 485

Query: 502 GHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKI 528
             + K AG S IEV + +++FI  D +H +   I+ +L ++
Sbjct: 486 IGVKKMAGESSIEVENRVYKFISGDLTHPQVERIHEILQEM 522

BLAST of CsGy1G032120.1 vs. Swiss-Prot
Match: sp|Q9FNN7|PP371_ARATH (Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E20 PE=2 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 6.1e-137
Identity = 293/497 (58.95%), Postives = 377/497 (75.86%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++Q+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +   
Sbjct: 1   MNGIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
             PH   +LY  +   G  P+ ++F F+F A AS  +  P ++LHS F +SGF SD F  
Sbjct: 61  HQPHESIVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCC 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           T L+  YAKLG L  AR++FDEM  RD+P  XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TTLITAYAKLGALCCARRVFDEMSKRDVPVWXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX    +K  KPN +++ SVLPAC+ LG L+IG+R+E Y
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXDKSVKPNHITVVSVLPACANLGELEIGRRLEGY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           AR NGFF N YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI  LA HG+  +
Sbjct: 241 ARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDE 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   +++PKLEHYGC++
Sbjct: 301 ALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMI 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGNVE+ E+A+E+LFKLEP NPG
Sbjct: 361 DLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASEALFKLEPTNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSG 519
           N VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S 
Sbjct: 421 NCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYFVEVGVDVHKFTVEDKSHPRSY 480

Query: 520 EIYALLHKIYDIIKLHK 536
           EIY +L +I+  +KL K
Sbjct: 481 EIYQVLEEIFRRMKLEK 497

BLAST of CsGy1G032120.1 vs. Swiss-Prot
Match: sp|Q9SZT8|PP354_ARATH (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ELI1 PE=3 SV=1)

HSP 1 Score: 329.7 bits (844), Expect = 6.2e-89
Identity = 236/506 (46.64%), Postives = 334/506 (66.01%), Query Frame = 0

Query: 37  SNGMDLMEQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQIPKPSVYLY 96
           S  +D + QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+
Sbjct: 39  SQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLF 98

Query: 97  NKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCK 156
              I T S  G   + +LLY Q+ S   +PN+++F+ L  +C++      G+++H+H  K
Sbjct: 99  TAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCST----KSGKLIHTHVLK 158

Query: 157 SGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXX 216
            G   D +  T L+D+YAK G + SA+++FD MP R + +XXXXXXXXXXXXXXXXXXXX
Sbjct: 159 FGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSXXXXXXXXXXXXXXXXXXXX 218

Query: 217 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGA 276
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    E   KP+E+++ + L ACSQ+GA
Sbjct: 219 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAEGKPKPDEITVVAALSACSQIGA 278

Query: 277 LDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIM 336
           L+ G+ I  + +++    N  V   +++++++CG++EEA  VF++   ++++ +WN MI 
Sbjct: 279 LETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDT-PRKDIVAWNAMIA 338

Query: 337 GLAVHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQV 396
           G A+HG   DAL+L+++M  I  ++P D+TF+G L AC H G+V EG ++FESM  ++ +
Sbjct: 339 GYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGI 398

Query: 397 APKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAE 456
            PK+EHYGCLV LLGRAG+L+ AY  I+NM M  DSV+W ++LG+C  HG+  LG+  AE
Sbjct: 399 KPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAE 458

Query: 457 SLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFI 516
            L  L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF 
Sbjct: 459 YLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEFR 518

Query: 517 VEDRSHLKSGEIYALLHKIYDIIKLH 535
             DR H KS EIY +L KI + IK H
Sbjct: 519 AGDREHSKSKEIYTMLRKISERIKSH 539

BLAST of CsGy1G032120.1 vs. Swiss-Prot
Match: sp|Q9SIL5|PP165_ARATH (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 326.6 bits (836), Expect = 5.2e-88
Identity = 230/498 (46.18%), Postives = 333/498 (66.87%), Query Frame = 0

Query: 44  EQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 103
           ++I+A  + +GL  + F++ K++    ++ D+ YA  LF+Q+  P+V+LYN  I+ ++  
Sbjct: 27  KKINASIIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQVSNPNVFLYNSIIRAYTHN 86

Query: 104 GHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFA 163
                   +Y Q+  +    P++++F F+F +CASL + Y G+ +H H CK G    +  
Sbjct: 87  SLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYLGKQVHGHLCKFGPRFHVVT 146

Query: 164 MTALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 223
             AL+DMY K   L  A ++FDEM  RD   XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 147 ENALIDMYMKFDDLVDAHKVFDEMYERDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 206

Query: 224 XXXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEA 283
           XXXXXXXXXXXXXXXXXXXXXXXX ++   G +P+E+S+ SVLP+C+QLG+L++GK I  
Sbjct: 207 XXXXXXXXXXXXXXXXXXXXXXXXEMQ-LAGIEPDEISLISVLPSCAQLGSLELGKWIHL 266

Query: 284 YARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCI 343
           YA   GF K   V NA++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG   
Sbjct: 267 YAERRGFLKQTGVCNALIEMYSKCGVISQAIQLFGQMEGK-DVISWSTMISGYAYHGNAH 326

Query: 344 DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCL 403
            A++ +++M   K++P+ +TF+GLL AC+H GM  EG + F+ M   +Q+ PK+EHYGCL
Sbjct: 327 GAIETFNEMQRAKVKPNGITFLGLLSACSHVGMWQEGLRYFDMMRQDYQIEPKIEHYGCL 386

Query: 404 VDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNP 463
           +D+L RAG+L+ A  + + MPM PDS IWG+LL +C   GN+++  VA + L +LEP + 
Sbjct: 387 IDVLARAGKLERAVEITKTMPMKPDSKIWGSLLSSCRTPGNLDVALVAMDHLVELEPEDM 446

Query: 464 GNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSG 523
           GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S     
Sbjct: 447 GNYVLLANIYADLGKWEDVSRLRKMIRNENMKKTPGGSLIEVNNIVQEFVSGDNSKPFWT 506

Query: 524 EIYALL-----HKIYDII 532
           EI  +L     H+  D+I
Sbjct: 507 EISIVLQLFTSHQDQDVI 522

BLAST of CsGy1G032120.1 vs. Swiss-Prot
Match: sp|Q9LS72|PP261_ARATH (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 1.0e-83
Identity = 222/586 (37.88%), Postives = 327/586 (55.80%), Query Frame = 0

Query: 13  PERTYVCVASVLTFSNGTRNLKSFSNGMDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDL- 72
           P R    V+S   F    ++L   +N ++ ++Q+HA  +R  L     +  KL+    L 
Sbjct: 6   PVRAPSWVSSRRIFEERLQDLPKCAN-LNQVKQLHAQIIRRNLHEDLHIAPKLISALSLC 65

Query: 73  ---PYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLF 132
                A  +F+Q+ +P+V+L N  I+  +    P++ + ++ +M   G   + +++ FL 
Sbjct: 66  RQTNLAVRVFNQVQEPNVHLCNSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLL 125

Query: 133 PACASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGML--RSARQLFDEMPVRD 192
            AC+    +   +M+H+H  K G +SD++   AL+D Y++ G L  R A +LF++M  RD
Sbjct: 126 KACSGQSWLPVVKMMHNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERD 185

Query: 193 IPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLEN 252
             +             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    
Sbjct: 186 TVSWNSMLGGLVKAGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 245

Query: 253 ------------------------------------------------------------ 312
                                                                       
Sbjct: 246 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 305

Query: 313 ---EKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCG 372
                G K +  ++ S+L AC++ G L +G RI +  + +    NAYV NA+L+++A+CG
Sbjct: 306 XXVASGLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAKCG 365

Query: 373 NIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLL 432
           N+++A  VF++I  K++L SWNTM+ GL VHG   +A++L+ +M    +RPD VTF+ +L
Sbjct: 366 NLKKAFDVFNDI-PKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVL 425

Query: 433 LACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPD 492
            +C H G++ EG   F SME  + + P++EHYGCLVDLLGR G L+EA  ++Q MPM P+
Sbjct: 426 CSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEPN 485

Query: 493 SVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKM 530
            VIWG LLGAC  H  V++ +   ++L KL+P +PGNY +LSNIYA A DW GVA +R  
Sbjct: 486 VVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRSK 545

BLAST of CsGy1G032120.1 vs. Swiss-Prot
Match: sp|Q9FMA1|PP433_ARATH (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 304.3 bits (778), Expect = 2.8e-81
Identity = 192/521 (36.85%), Postives = 304/521 (58.35%), Query Frame = 0

Query: 22  SVLTFSNG----TRNLKSFSNGMDLMEQIHAYSLRNGLDHTKFLIEKLLQ----LPDLPY 81
           + L+ S+G      +LK   N +  ++Q H Y +  GL+     + K ++       L Y
Sbjct: 6   NALSLSSGLNWFVTSLKIHGNNLKTLKQSHCYMIITGLNRDNLNVAKFIEACSNAGHLRY 65

Query: 82  ACTLFDQIPKPSVYLYNKFIQTFSSIGHPHR---CWLLYCQMCSQGCSPNQYSFTFLFPA 141
           A ++F   P P+ YL+N  I+  S +  P+       +Y ++ +    P+ ++F F+   
Sbjct: 66  AYSVFTHQPCPNTYLHNTMIRALSLLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKI 125

Query: 142 CASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTX 201
              + +V+ G+ +H      GF S +  +T L+ MY   G L  AR++FDEM V+D+   
Sbjct: 126 AVRVSDVWFGRQIHGQVVVFGFDSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVW 185

Query: 202 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG----LEN 261
                                          XXXXXXXXXXXXXXXXXXXXX     +EN
Sbjct: 186 NALLAGYGKVGEMDEARSLLEMMPCWVRNEVXXXXXXXXXXXXXXXXXXXXXFQRMLMEN 245

Query: 262 EKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIE 321
               +P+EV++ +VL AC+ LG+L++G+RI +Y  + G  +   ++NAV++++A+ GNI 
Sbjct: 246 ---VEPDEVTLLAVLSACADLGSLELGERICSYVDHRGMNRAVSLNNAVIDMYAKSGNIT 305

Query: 322 EAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLAC 381
           +A  VF E  ++RN+ +W T+I GLA HG   +AL ++++M+   +RP+DVTF+ +L AC
Sbjct: 306 KALDVF-ECVNERNVVTWTTIIAGLATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSAC 365

Query: 382 THGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVI 441
           +H G V  G++LF SM SK+ + P +EHYGC++DLLGRAG+L+EA  +I++MP   ++ I
Sbjct: 366 SHVGWVDLGKRLFNSMRSKYGIHPNIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAI 425

Query: 442 WGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKG 501
           WG+LL A + H ++ELGE A   L KLEP N GNY++L+N+Y+  G W     +R MMKG
Sbjct: 426 WGSLLAASNVHHDLELGERALSELIKLEPNNSGNYMLLANLYSNLGRWDESRMMRNMMKG 485

Query: 502 GHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKI 528
             + K AG S IEV + +++FI  D +H +   I+ +L ++
Sbjct: 486 IGVKKMAGESSIEVENRVYKFISGDLTHPQVERIHEILQEM 522

BLAST of CsGy1G032120.1 vs. TrEMBL
Match: tr|A0A0A0LY28|A0A0A0LY28_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G690140 PE=4 SV=1)

HSP 1 Score: 896.3 bits (2315), Expect = 3.3e-257
Identity = 493/497 (99.20%), Postives = 496/497 (99.80%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM
Sbjct: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
Sbjct: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV
Sbjct: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG
Sbjct: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE
Sbjct: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480

Query: 520 IYALLHKIYDIIKLHKH 537
           IYALLHKIYDIIKLHKH
Sbjct: 481 IYALLHKIYDIIKLHKH 497

BLAST of CsGy1G032120.1 vs. TrEMBL
Match: tr|A0A2N9IZX4|A0A2N9IZX4_FAGSY (RING-type E3 ubiquitin transferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57682 PE=4 SV=1)

HSP 1 Score: 625.2 bits (1611), Expect = 1.4e-175
Identity = 357/498 (71.69%), Postives = 418/498 (83.94%), Query Frame = 0

Query: 40   MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
            M+ ++QIHAY+LRNG+D+TK LI   LQ+P+L YA  +FD IPKP+V+LYNK IQ +S  
Sbjct: 1053 MNQLKQIHAYTLRNGIDYTKTLIVNSLQIPNLSYARKVFDLIPKPTVFLYNKLIQAYSCH 1112

Query: 100  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
            G  ++C  LY QMC QGC PNQ++FTFLF  CAS  ++  GQMLH+HF KSGF  D+FA+
Sbjct: 1113 GQYYQCMSLYSQMCIQGCPPNQHTFTFLFVTCASHSSLRHGQMLHAHFVKSGFEFDVFAL 1172

Query: 160  TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
            TAL+DMYAKLGML SARQ FDE+ V+DIPT XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1173 TALVDMYAKLGMLASARQKFDEIKVKDIPTWXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1232

Query: 220  XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
            XXXXXXXXXXXXXXXXXXXXXXX  E EK  +PNEV+IAS+LPAC+ LGAL++G+RIEAY
Sbjct: 1233 XXXXXXXXXXXXXXXXXXXXXXXXXEKEKDVRPNEVTIASILPACANLGALEVGERIEAY 1292

Query: 280  ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
            AR NGFFKN+YV+NAVLE++ARCG I+ A  VFDEIG +RNLCSWN+MIMGLAVHGRC +
Sbjct: 1293 ARRNGFFKNSYVANAVLEMYARCGKIDVAWHVFDEIGRRRNLCSWNSMIMGLAVHGRCNE 1352

Query: 340  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
            AL+LYD+ML     PDDVT VGLLLACTHGGMV +GRQLFESME+   + PKLEHYGC+V
Sbjct: 1353 ALELYDKMLGEGNAPDDVTLVGLLLACTHGGMVVKGRQLFESMETNLHITPKLEHYGCMV 1412

Query: 400  DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
            DLLGR GELQEA++LIQNMPM PDSV+WG LLGACSFHGNVEL E+AAESLFKLEPWNPG
Sbjct: 1413 DLLGRCGELQEAFDLIQNMPMKPDSVVWGALLGACSFHGNVELAEIAAESLFKLEPWNPG 1472

Query: 460  NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
            N+VILSNIYA AG W GVA+LRK+MKGG ITK AGYS+IE G  IH+FIV DRSH +  E
Sbjct: 1473 NFVILSNIYASAGQWDGVAKLRKLMKGGQITKAAGYSFIEEGGQIHKFIVGDRSHTRIEE 1532

Query: 520  IYALLHKIYDIIKLHKHS 538
            IYA L ++   + L +++
Sbjct: 1533 IYAFLDEVSKKMMLQRNA 1550

BLAST of CsGy1G032120.1 vs. TrEMBL
Match: tr|A0A2I4GS89|A0A2I4GS89_9ROSI (pentatricopeptide repeat-containing protein At5g08510 OS=Juglans regia OX=51240 GN=LOC109010380 PE=4 SV=1)

HSP 1 Score: 622.9 bits (1605), Expect = 7.1e-175
Identity = 356/496 (71.77%), Postives = 420/496 (84.68%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QIHAY+LRNG+DH K LI  LL +P+L YA  LFD IP P+V+LYNK IQT+SS 
Sbjct: 1   MNQLKQIHAYTLRNGIDHAKTLIVGLLNIPNLSYARKLFDLIPNPTVFLYNKLIQTYSSH 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           G  +RC  LY QMC QGC PNQ++FTF+F  CA+L +   GQMLH+HF KSGF SD+FA+
Sbjct: 61  GQYYRCMSLYSQMCLQGCPPNQHTFTFVFATCAALSSPCHGQMLHTHFVKSGFESDVFAL 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TAL+DMYAKLGML SARQ FDE+ VRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TALVDMYAKLGMLASARQKFDEIKVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX    EK  +PNEV+IASVLPAC+ LGAL++G+RIEAY
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXKEKDMRPNEVTIASVLPACANLGALEVGERIEAY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           AR NGFFKN YVSNAVLE++ RCG I+ A +VFDEIG  R+LCSWN+MI+GLAVHG+C +
Sbjct: 241 ARKNGFFKNLYVSNAVLEMYVRCGKIDIAWRVFDEIGGCRSLCSWNSMIVGLAVHGQCNE 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           AL+LYDQML   + PDDVTFVGLLLACTHGG+V +GRQLF+ M + F +APKLEHYGC+V
Sbjct: 301 ALELYDQMLREGIAPDDVTFVGLLLACTHGGLVIKGRQLFQLMLTNFHIAPKLEHYGCMV 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGR+G+LQEAY+LI+ MPM PDSV+WG LLGACSFHGN+EL E+AAESLF+LEPWNPG
Sbjct: 361 DLLGRSGDLQEAYDLIKGMPMKPDSVVWGALLGACSFHGNIELAEIAAESLFQLEPWNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYA AG W+GVA+LRK+MKGG +TK AGYS+IE G  IH+FIVEDRSH +  E
Sbjct: 421 NYVILSNIYASAGQWAGVAKLRKLMKGGLVTKAAGYSFIEEGGQIHKFIVEDRSHPRCDE 480

Query: 520 IYALLHKIYDIIKLHK 536
           IY LL  ++  ++L +
Sbjct: 481 IYVLLDGVFAEMRLQR 496

BLAST of CsGy1G032120.1 vs. TrEMBL
Match: tr|F6I6G7|F6I6G7_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_15s0046g00030 PE=4 SV=1)

HSP 1 Score: 610.5 bits (1573), Expect = 3.7e-171
Identity = 353/495 (71.31%), Postives = 416/495 (84.04%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QI AY+LRNG++HTK LI  LLQ+P +PYA  LFD IPKP+V+LYNK IQ +SS 
Sbjct: 1   MNRLKQIQAYTLRNGIEHTKQLIVSLLQIPSIPYAHKLFDFIPKPTVFLYNKLIQAYSSH 60

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           G  H+C+ LY QMC QGCSPN++SFTFLF ACASL +   G+MLH+HF KSGF  D+FA+
Sbjct: 61  GPHHQCFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFAL 120

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TAL+DMYAKLG+L  AR+ FDEM VRD+PTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 TALVDMYAKLGLLSLARKQFDEMTVRDVPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX    E   +PNEV++ASVLPAC+ LGAL++G+RIE Y
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXEETEMRPNEVTLASVLPACANLGALEVGERIEVY 240

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           AR NG+FKN YVSNA+LE++ARCG I++A  VF+EI  +RNLCSWN+MIMGLAVHGRC +
Sbjct: 241 ARGNGYFKNLYVSNALLEMYARCGRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDE 300

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           A++L+ +ML     PDDVTFVG+LLACTHGGMV EG+  FESME  F +APKLEHYGC+V
Sbjct: 301 AIELFYKMLREGAAPDDVTFVGVLLACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMV 360

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGRAGEL+EA++LI  MPM PDSV+WGTLLGACSFHG+VEL E AA +LF+LEP NPG
Sbjct: 361 DLLGRAGELREAHDLILRMPMEPDSVVWGTLLGACSFHGHVELAEKAAGALFELEPSNPG 420

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYA AG W GVARLRK+MKGG ITK AGYS+IE G  IH+FIVEDRSH +S E
Sbjct: 421 NYVILSNIYATAGRWDGVARLRKLMKGGKITKAAGYSFIEEGGHIHKFIVEDRSHSRSDE 480

Query: 520 IYALLHKIYDIIKLH 535
           IYALL ++   +KLH
Sbjct: 481 IYALLDEVSMKMKLH 495

BLAST of CsGy1G032120.1 vs. TrEMBL
Match: tr|A0A2P5AR79|A0A2P5AR79_9ROSA (Tetratricopeptide-like helical domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_343820 PE=4 SV=1)

HSP 1 Score: 600.1 bits (1546), Expect = 4.9e-168
Identity = 346/494 (70.04%), Postives = 412/494 (83.40%), Query Frame = 0

Query: 40  MDLMEQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 99
           M+ ++QIHAY++RNG+D+T  L+ KLL++P++PYA  LFD IPKP+V+LYNK I+ +SS 
Sbjct: 18  MNQLKQIHAYTVRNGIDYTDTLVLKLLEIPNIPYAHNLFDLIPKPTVFLYNKLIKAYSSH 77

Query: 100 GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 159
           G  H+C  LY +M  Q C PN+ SFT LF ACASL +   GQM+HS F KSG A D FA 
Sbjct: 78  GQHHQCLSLYTRMSLQRCVPNERSFTLLFSACASLSSPRLGQMIHSRFVKSGLALDGFAE 137

Query: 160 TALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           TAL+DMYAKLGML  ARQ FDEM VRDIPT  XXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 138 TALVDMYAKLGMLACARQQFDEMRVRDIPTWNXXXXXXXXXXXXXXXXXXXXXXXXXXXX 197

Query: 220 XXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 279
           XXXXXXXXXXXXXXXXXXXXXXX    E+  KPNEV+IASVLPAC+ LGAL+IG+RIE Y
Sbjct: 198 XXXXXXXXXXXXXXXXXXXXXXXXXXXERDVKPNEVTIASVLPACANLGALEIGERIEEY 257

Query: 280 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 339
           +R + FF+N++VSNA+LE++ARCG I+ A++VFDEIGS+RNLCSWN+MIMGLAVHGRC +
Sbjct: 258 SRRSRFFENSHVSNAILEMYARCGKIDIARRVFDEIGSRRNLCSWNSMIMGLAVHGRCRE 317

Query: 340 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 399
           AL LY+QML   +RPDDVTFVGL+LACTHGGMV++GRQLF+SME  F +APKLEHYGC+V
Sbjct: 318 ALNLYEQMLTVGIRPDDVTFVGLILACTHGGMVSKGRQLFQSMEPNFSIAPKLEHYGCMV 377

Query: 400 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 459
           DLLGRAGEL+EAY+LIQ+MPM PD+VIWG LLGACSFHGN++  E AAESLF+LEPWNP 
Sbjct: 378 DLLGRAGELEEAYDLIQDMPMKPDNVIWGALLGACSFHGNIKFAEKAAESLFELEPWNPA 437

Query: 460 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 519
           NYVILSNIYA  G W GVA+LRK+MKGG ITK AGYS+IE    +H FIV+D+SH +S E
Sbjct: 438 NYVILSNIYASGGRWDGVAKLRKVMKGGKITKAAGYSFIEERGQVHMFIVDDKSHPRSHE 497

Query: 520 IYALLHKIYDIIKL 534
           +YALL   Y  ++L
Sbjct: 498 MYALLDGFYTKVRL 511

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011660274.15.0e-25799.20PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativu... [more]
XP_022979295.11.1e-22788.18pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxi... [more]
XP_023527365.14.6e-22687.58pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pep... [more]
XP_022147487.14.0e-22286.69pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Momordica char... [more]
XP_022147489.14.0e-22286.69pentatricopeptide repeat-containing protein At5g08510 isoform X2 [Momordica char... [more]
Match NameE-valueIdentityDescription
AT5G08510.13.4e-13858.95Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G37380.13.4e-9046.64Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G20540.12.9e-8946.18mitochondrial editing factor 21[more]
AT3G29230.15.6e-8537.88Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G56310.11.5e-8236.85Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FNN7|PP371_ARATH6.1e-13758.95Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana OX... [more]
sp|Q9SZT8|PP354_ARATH6.2e-8946.64Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
sp|Q9SIL5|PP165_ARATH5.2e-8846.18Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX... [more]
sp|Q9LS72|PP261_ARATH1.0e-8337.88Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX... [more]
sp|Q9FMA1|PP433_ARATH2.8e-8136.85Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LY28|A0A0A0LY28_CUCSA3.3e-25799.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G690140 PE=4 SV=1[more]
tr|A0A2N9IZX4|A0A2N9IZX4_FAGSY1.4e-17571.69RING-type E3 ubiquitin transferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57682... [more]
tr|A0A2I4GS89|A0A2I4GS89_9ROSI7.1e-17571.77pentatricopeptide repeat-containing protein At5g08510 OS=Juglans regia OX=51240 ... [more]
tr|F6I6G7|F6I6G7_VITVI3.7e-17171.31Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_15s0046g00030 PE=4 SV=... [more]
tr|A0A2P5AR79|A0A2P5AR79_9ROSA4.9e-16870.04Tetratricopeptide-like helical domain containing protein OS=Trema orientalis OX=... [more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy1G032120CsGy1G032120gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy1G032120.1CsGy1G032120.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G032120.1.five_prime_UTR.1CsGy1G032120.1.five_prime_UTR.1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G032120.1.exon.4CsGy1G032120.1.exon.4exon
CsGy1G032120.1.exon.3CsGy1G032120.1.exon.3exon
CsGy1G032120.1.exon.2CsGy1G032120.1.exon.2exon
CsGy1G032120.1.exon.1CsGy1G032120.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G032120.1.CDS.4CsGy1G032120.1.CDS.4CDS
CsGy1G032120.1.CDS.3CsGy1G032120.1.CDS.3CDS
CsGy1G032120.1.CDS.2CsGy1G032120.1.CDS.2CDS
CsGy1G032120.1.CDS.1CsGy1G032120.1.CDS.1CDS


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 320..366
e-value: 7.1E-8
score: 32.4
coord: 84..132
e-value: 2.1E-8
score: 34.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 189..217
e-value: 1.1E-8
score: 34.6
coord: 293..317
e-value: 0.057
score: 13.6
coord: 394..418
e-value: 0.28
score: 11.4
coord: 219..242
e-value: 7.0E-6
score: 25.8
coord: 159..186
e-value: 1.4E-4
score: 21.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 323..355
e-value: 4.2E-6
score: 24.6
coord: 160..187
e-value: 3.6E-4
score: 18.5
coord: 87..120
e-value: 2.5E-5
score: 22.2
coord: 219..253
e-value: 4.9E-5
score: 21.2
coord: 189..216
e-value: 3.1E-7
score: 28.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 355..385
score: 7.815
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 221..252
score: 7.191
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 457..491
score: 5.821
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 253..287
score: 5.382
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 155..185
score: 8.934
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 186..220
score: 11.992
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 391..421
score: 6.73
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 85..119
score: 9.175
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 288..318
score: 8.122
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 320..354
score: 10.424
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 423..453
score: 5.119
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 339..499
e-value: 4.5E-25
score: 90.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 139..281
e-value: 3.8E-31
score: 109.8
NoneNo IPR availablePANTHERPTHR24015:SF148SUBFAMILY NOT NAMEDcoord: 53..522
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 53..522