CsGy2G007450 (gene) Cucumber (Gy14) v2

NameCsGy2G007450
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionLOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330
LocationChr2 : 5996569 .. 6000412 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGTGGATGAATCTAAGCAGTTCTTGCTTTCCTTCTCCTGCTTTTCTGAAACTTTCTCATTCTATTTCTCAAGGTACAATGACCCATAAAATCATATCATTCAACTTGTCTGAGCATCACTTGTTCAAGTCATTTTCCTACCACACTTCAAATCATTTTTCATCCAATACCCTTCATGCCAAAATGGTCAAGATTGGTTCTATTTTTGTATCAGGCAAGTTTGTTTTGACCTCTTATGTAAAATCTGAGAAATTAAACGATGCTCAGAAACTGTTCGACGAAATGCCCAATAGAGATGTACTTACATGGACGGCCCTTATATCGGGTTTTTCTAGAGTCAATTCTTCTGGAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGGTGTTTCTCCAAATCACTTTACTTTGTCTACTGTTCTTAAACTTTGCTCTAAAGTAGGTGATGTGCGAATGGGTAAGGGAATTCATGGATGGATACTGAGAAATGGGGTTAAATTAGATGTTGTCTTGGAGAATTCTATGCTTGATTTGTATGCTAAGTTTGATGAATTTGTTTATGCCAGAAAGTTGTATGATTCAATGAGAGAAAAGAGTACTGATACTGACAACATAATACTTGGTGTGTACGTCCGTAGTTGTGATGTTAACAAATCTCTTCATTTATTCAGAAACTTGCCCTGCAGAAATGCTGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATTTGAATGCAGCATTGGAGCTACTCTATGAAATGGTGGAGAACGAATCTGAGTTTAACAATTTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTCTTGAGCTAGGTAGACAGGTACATGGCCGAATTGTCAGGTGTGGTCTTCATAATGATGGATTTGTAAAGAGTGCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCATCAGTGATATATAGTCGACTGCCTTCAGGTTTTGCAACAAAACAAGGTTCCAATATTGTATGCAGTGACACGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTATGGATATGTCCGAAATGGCAAATATGAAGATGCCTTCAAAACTTTTGTGTCTATGGTCCGTGAACGGGTTCTAATGGACAAATTTACCATTGCAAATGTTGTGTCTGCTTGTTCTAATGCTGGTGTTTTGGAGCTTGGACGTCAAGTCCATGGATTCATTCATAAAACTGTGGAACAACTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGACCAAATGACCAATTACTTAAATGTTGTAATATGGACTTCCATGATCGTTGGATGTGCTTTACACGGGCATGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATTATACCAAATGAGGTCACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCGGGGCTGCTTGAAGATGGTCATCTATATTTTAATATGATGAAAGATGTTTATGCAATCAAGCCTAAGGTTGAGCATTACACTTGTATGGTAGATCTTTACGGCCGAGCTGGACTCTTGAATGAAGTCAAAGAATTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTCTCATCCTGTCGGCTTTACAGGGACCTTGAAATGGGGAAGTGGGTTTCTGAAAAATTGTTTAGACTCAAACCACAAGATGAAGGGTCTTACGTTTTACTATCAAACATGTGCTCCGGCAGTCAAAAGTGGGAAGAAGCTTCAAGAGCAAGAAGATCTATGCAACACAGTGGGATTAACAAAACACCTGGTCAATCTTGGATTCATTTGAAAAATCAAGTCCACTCTTTCGTTGCAGGAGACCAATCACACCCTCAACATGCTCAAATATATGAATATCTGGACAAGCTAATTGGAAGGTTGAAGGAAATCGGGTACTTGCATGATGTGAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGTATAATCAGCTTGGGTTCTGCCATTCCAATCCGAATCATGAAGAACCTTCGGATATGTACTGATTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGAGAGATCATTGTTCGAGATATTTATCGTTTCCATCATTTCAATTCCGGTCATTGCTCTTGTGGTGATTATTGGTGAGCTGGGAAAGAGAATCCAATGCATAACAGAGAAAGCAGAACTTGGGAGAAAATCTTACTAACTGCAGTAAGTTCCTTTATTTCTGTACAAAATATTTACATCAACCAAGCTTGGTATTTATAGATGCTTGGTAGTAAATAATAACTGTCTAACAGATTAACTATTAACTGTGCAACAGAAAAATAACAATCTAACAACTCTATTGTTAATAATACACATCAATTTTCATGGATAATACTAAGTTAATCTGCCTGGAGTTCACAACGCAGCTGGGGGAAGCAAGACAATAAATTCATTAGAAAAAGCCAAAATCACATTATTCTTCTTCTTAGATGTATCAACGCTGTGCAGTTCCGAAACCTGAATTGCATATAACAGTGAACTTCACAAGTTACATCCCTGAACTGAAGGGTTGAAGTTAATTCTAGGTAGTTCAACCCATATATATACAAACGGGATACCATGAACGTGATTGGAGATTTATGTAAATGAATGAGCCAGCAATTACCTGGTGAACTAAGTGACAACCTTTACCACTTGATTTCCATCAGGAAAGAATAGTGAATTGAAGATACTATGGTCTGAACCCCAATGGATCAGTCTTACCACCACCTCAACAACAGTTTCAGAAACAAAATGTTAATCATATTGATGTTTAACCGGAAGAGTGGAGAAGTATTGCTTACAGAAATCCTCTTACACATTGTAAGAATCATGTTATGACTTTTGATAACTTTATTAGTGGTTTTGAAGGAAGTGTTTAAAAATAGTTTAAAGTGCTCCAGAAATTTTCAAGACATTTTTAATGGAAAGCACTTTCTATTCTTCAAATTCCCTCTAGGCACTTCAACAGTGCTTATGAGGCACATTGAAGAAATGCAAGGGCAGAAGATTCTTAAGTGTTAACATGTTGAACATTTTTCTCAATCACTTGTGTACTCATGACAGGGCAAATGAATGTTTTTCTATGATTAGAAACATTAATCGTCATTGATTTTGATATCTTTTTTTTCTGGATAACTTCTATATGTTATTTCTCTTATGACAGAAAAGAAAAAAGTACCATCGTATTGTTGATATTCTCCTCCTACCAGTTTTTAATGCAGTGGCACAGAAGCTATTGTTTGTGAAGAAGCAAATGGAAGTAGAGTACATACATAACAGTTCCAAAAAGGAAATCGATATGCAATCACAAAGGATCTAACAAAGAGGTTTGGCAGAAAATATACAATTTTATGTTTCCATAATACAGACATGGAAGTTGCATTGTAGAAACAGAAAATGATACAAAATCTACTTGACGTAGTAAGGGAGGTCCAAGACATAGAGGAGGAAGTATGCCCAGATCACACCGGAGATACCACCGAAGAAAAACCCGCCTGAGAATTTTGCCCACCCATCGGCTGTTTGGAGAGGATCAGGGGTCTTCTTCCGGCCAGTCAAAGTCAATGATGGTGCGGTAGATGGCTCTCCTTCATTGAATGATGCAACACCATACATTGTCAAGCAGACGCTCAAAATGACAATGAGACCCCCAGCAGCCAGCGAGCCTGCTCCTCCGGCGTAG

mRNA sequence

ATGAGGTGGATGAATCTAAGCAGTTCTTGCTTTCCTTCTCCTGCTTTTCTGAAACTTTCTCATTCTATTTCTCAAGGTACAATGACCCATAAAATCATATCATTCAACTTGTCTGAGCATCACTTGTTCAAGTCATTTTCCTACCACACTTCAAATCATTTTTCATCCAATACCCTTCATGCCAAAATGGTCAAGATTGGTTCTATTTTTGTATCAGGCAAGTTTGTTTTGACCTCTTATGTAAAATCTGAGAAATTAAACGATGCTCAGAAACTGTTCGACGAAATGCCCAATAGAGATGTACTTACATGGACGGCCCTTATATCGGGTTTTTCTAGAGTCAATTCTTCTGGAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGGTGTTTCTCCAAATCACTTTACTTTGTCTACTGTTCTTAAACTTTGCTCTAAAGTAGGTGATGTGCGAATGGGTAAGGGAATTCATGGATGGATACTGAGAAATGGGGTTAAATTAGATGTTGTCTTGGAGAATTCTATGCTTGATTTGTATGCTAAGTTTGATGAATTTGTTTATGCCAGAAAGTTGTATGATTCAATGAGAGAAAAGAGTACTGATACTGACAACATAATACTTGGTGTGTACGTCCGTAGTTGTGATGTTAACAAATCTCTTCATTTATTCAGAAACTTGCCCTGCAGAAATGCTGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATTTGAATGCAGCATTGGAGCTACTCTATGAAATGGTGGAGAACGAATCTGAGTTTAACAATTTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTCTTGAGCTAGGTAGACAGGTACATGGCCGAATTGTCAGGTGTGGTCTTCATAATGATGGATTTGTAAAGAGTGCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCATCAGTGATATATAGTCGACTGCCTTCAGGTTTTGCAACAAAACAAGGTTCCAATATTGTATGCAGTGACACGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTATGGATATGTCCGAAATGGCAAATATGAAGATGCCTTCAAAACTTTTGTGTCTATGGTCCGTGAACGGGTTCTAATGGACAAATTTACCATTGCAAATGTTGTGTCTGCTTGTTCTAATGCTGGTGTTTTGGAGCTTGGACGTCAAGTCCATGGATTCATTCATAAAACTGTGGAACAACTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGACCAAATGACCAATTACTTAAATGTTGTAATATGGACTTCCATGATCGTTGGATGTGCTTTACACGGGCATGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATTATACCAAATGAGGTCACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCGGGGCTGCTTGAAGATGGTCATCTATATTTTAATATGATGAAAGATGTTTATGCAATCAAGCCTAAGGTTGAGCATTACACTTGTATGGTAGATCTTTACGGCCGAGCTGGACTCTTGAATGAAGTCAAAGAATTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTCTCATCCTGTCGGCTTTACAGGGACCTTGAAATGGGGAAGTGGGTTTCTGAAAAATTGTTTAGACTCAAACCACAAGATGAAGGGTCTTACGTTTTACTATCAAACATGTGCTCCGGCAGTCAAAAGTGGGAAGAAGCTTCAAGAGCAAGAAGATCTATGCAACACAGTGGGATTAACAAAACACCTGGTCAATCTTGGATTCATTTGAAAAATCAAGTCCACTCTTTCGTTGCAGGAGACCAATCACACCCTCAACATGCTCAAATATATGAATATCTGGACAAGCTAATTGGAAGGTTGAAGGAAATCGGGTACTTGCATGATGTGAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGTATAATCAGCTTGGGTTCTGCCATTCCAATCCGAATCATGAAGAACCTTCGGATATGTACTGATTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGAGAGATCATTGTTCGAGATATTTATCGTTTCCATCATTTCAATTCCGGCACTTCAACAGTGCTTATGAGGCACATTGAAGAAATGCAAGGGCAGAAGATTCTTAATGGCACAGAAGCTATTGTTTGTGAAGAAGCAAATGGAAGTAGAATCACACCGGAGATACCACCGAAGAAAAACCCGCCTGAGAATTTTGCCCACCCATCGGCTGTTTGGAGAGGATCAGGGGTCTTCTTCCGGCCAGTCAAAGTCAATGATGGTGCGGTAGATGGCTCTCCTTCATTGAATGATGCAACACCATACATTGTCAAGCAGACGCTCAAAATGACAATGAGACCCCCAGCAGCCAGCGAGCCTGCTCCTCCGGCGTAG

Coding sequence (CDS)

ATGAGGTGGATGAATCTAAGCAGTTCTTGCTTTCCTTCTCCTGCTTTTCTGAAACTTTCTCATTCTATTTCTCAAGGTACAATGACCCATAAAATCATATCATTCAACTTGTCTGAGCATCACTTGTTCAAGTCATTTTCCTACCACACTTCAAATCATTTTTCATCCAATACCCTTCATGCCAAAATGGTCAAGATTGGTTCTATTTTTGTATCAGGCAAGTTTGTTTTGACCTCTTATGTAAAATCTGAGAAATTAAACGATGCTCAGAAACTGTTCGACGAAATGCCCAATAGAGATGTACTTACATGGACGGCCCTTATATCGGGTTTTTCTAGAGTCAATTCTTCTGGAATGGCATTGCAACTGTTTAGAGAAATGCTGGTTGAAGGTGTTTCTCCAAATCACTTTACTTTGTCTACTGTTCTTAAACTTTGCTCTAAAGTAGGTGATGTGCGAATGGGTAAGGGAATTCATGGATGGATACTGAGAAATGGGGTTAAATTAGATGTTGTCTTGGAGAATTCTATGCTTGATTTGTATGCTAAGTTTGATGAATTTGTTTATGCCAGAAAGTTGTATGATTCAATGAGAGAAAAGAGTACTGATACTGACAACATAATACTTGGTGTGTACGTCCGTAGTTGTGATGTTAACAAATCTCTTCATTTATTCAGAAACTTGCCCTGCAGAAATGCTGCGAGTTGGAATACAATTATATGTGGGCTAATGCAAGGTGGGTATTTGAATGCAGCATTGGAGCTACTCTATGAAATGGTGGAGAACGAATCTGAGTTTAACAATTTTACTTCTTCCATAGCTTTGAGTGTGGTTTCTTCTTTACTGATTCTTGAGCTAGGTAGACAGGTACATGGCCGAATTGTCAGGTGTGGTCTTCATAATGATGGATTTGTAAAGAGTGCACTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAAGCATCAGTGATATATAGTCGACTGCCTTCAGGTTTTGCAACAAAACAAGGTTCCAATATTGTATGCAGTGACACGATGACAGAAATTGTTTCGCGGAGCTCAATGGTGTATGGATATGTCCGAAATGGCAAATATGAAGATGCCTTCAAAACTTTTGTGTCTATGGTCCGTGAACGGGTTCTAATGGACAAATTTACCATTGCAAATGTTGTGTCTGCTTGTTCTAATGCTGGTGTTTTGGAGCTTGGACGTCAAGTCCATGGATTCATTCATAAAACTGTGGAACAACTTGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCTAAAGGTGGGAGTTTGGATTGTGCCCATCGAATTTTTGACCAAATGACCAATTACTTAAATGTTGTAATATGGACTTCCATGATCGTTGGATGTGCTTTACACGGGCATGGTAAGGAAGCCATTAGACTGTTTGAACAGATGAGATATGAGGGAATTATACCAAATGAGGTCACTTTTATAGGAGTTTTAACAGCTTGCAGTCATGCGGGGCTGCTTGAAGATGGTCATCTATATTTTAATATGATGAAAGATGTTTATGCAATCAAGCCTAAGGTTGAGCATTACACTTGTATGGTAGATCTTTACGGCCGAGCTGGACTCTTGAATGAAGTCAAAGAATTCATCTATGAGAATGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTCTCATCCTGTCGGCTTTACAGGGACCTTGAAATGGGGAAGTGGGTTTCTGAAAAATTGTTTAGACTCAAACCACAAGATGAAGGGTCTTACGTTTTACTATCAAACATGTGCTCCGGCAGTCAAAAGTGGGAAGAAGCTTCAAGAGCAAGAAGATCTATGCAACACAGTGGGATTAACAAAACACCTGGTCAATCTTGGATTCATTTGAAAAATCAAGTCCACTCTTTCGTTGCAGGAGACCAATCACACCCTCAACATGCTCAAATATATGAATATCTGGACAAGCTAATTGGAAGGTTGAAGGAAATCGGGTACTTGCATGATGTGAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTGGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGTATAATCAGCTTGGGTTCTGCCATTCCAATCCGAATCATGAAGAACCTTCGGATATGTACTGATTGTCATAACTTTATGAAGTTAACATCTCAACTTTTAGGCAGAGAGATCATTGTTCGAGATATTTATCGTTTCCATCATTTCAATTCCGGCACTTCAACAGTGCTTATGAGGCACATTGAAGAAATGCAAGGGCAGAAGATTCTTAATGGCACAGAAGCTATTGTTTGTGAAGAAGCAAATGGAAGTAGAATCACACCGGAGATACCACCGAAGAAAAACCCGCCTGAGAATTTTGCCCACCCATCGGCTGTTTGGAGAGGATCAGGGGTCTTCTTCCGGCCAGTCAAAGTCAATGATGGTGCGGTAGATGGCTCTCCTTCATTGAATGATGCAACACCATACATTGTCAAGCAGACGCTCAAAATGACAATGAGACCCCCAGCAGCCAGCGAGCCTGCTCCTCCGGCGTAG

Protein sequence

MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLHAKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGTSTVLMRHIEEMQGQKILNGTEAIVCEEANGSRITPEIPPKKNPPENFAHPSAVWRGSGVFFRPVKVNDGAVDGSPSLNDATPYIVKQTLKMTMRPPAASEPAPPA
BLAST of CsGy2G007450 vs. NCBI nr
Match: KGN61262.1 (hypothetical protein Csa_2G074230 [Cucumis sativus])

HSP 1 Score: 1513.4 bits (3917), Expect = 0.0e+00
Identity = 749/750 (99.87%), Postives = 749/750 (99.87%), Query Frame = 0

Query: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60
           MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH
Sbjct: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60

Query: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120
           AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA
Sbjct: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120

Query: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180
           LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL
Sbjct: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180

Query: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240
           YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII
Sbjct: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240

Query: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300
           CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH
Sbjct: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300

Query: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGY 360
           NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQ SNIVCSDTMTEIVSRSSMVYGY
Sbjct: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQSSNIVCSDTMTEIVSRSSMVYGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420
           VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA
Sbjct: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480
           HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480

Query: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540
           EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN
Sbjct: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600

Query: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660
           GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG
Sbjct: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660

Query: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720
           RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD
Sbjct: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           CHNFMKLTSQLLGREIIVRDIYRFHHFNSG
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSG 750

BLAST of CsGy2G007450 vs. NCBI nr
Match: XP_011648996.1 (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 [Cucumis sativus])

HSP 1 Score: 1439.1 bits (3724), Expect = 0.0e+00
Identity = 719/743 (96.77%), Postives = 723/743 (97.31%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSE-------HHLFKSFSYHTSNHFSSNTLHAKMVKIG 78
            L+  ISQGT+        + F+LS        HHLFKSFSYHTSNHFSSNTLHAKMVKIG
Sbjct: 267  LASKISQGTVATVGGLLFLGFSLSSYFFPPLXHHLFKSFSYHTSNHFSSNTLHAKMVKIG 326

Query: 79   SIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREM 138
            SIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREM
Sbjct: 327  SIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREM 386

Query: 139  LVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEF 198
            LVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEF
Sbjct: 387  LVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEF 446

Query: 199  VYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGG 258
            VYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGG
Sbjct: 447  VYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGG 506

Query: 259  YLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKS 318
            YLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKS
Sbjct: 507  YLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKS 566

Query: 319  ALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYE 378
            ALINMYIKCGNLEKASVIYSRLPSGFATKQ SNIVCSDTMTEIVSRSSMVYGYVRNGKYE
Sbjct: 567  ALINMYIKCGNLEKASVIYSRLPSGFATKQSSNIVCSDTMTEIVSRSSMVYGYVRNGKYE 626

Query: 379  DAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLI 438
            DAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLI
Sbjct: 627  DAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLI 686

Query: 439  DMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNE 498
            DMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNE
Sbjct: 687  DMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNE 746

Query: 499  VTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIY 558
            VTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIY
Sbjct: 747  VTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIY 806

Query: 559  ENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEE 618
            ENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEE
Sbjct: 807  ENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEE 866

Query: 619  ASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGY 678
            ASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGY
Sbjct: 867  ASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGY 926

Query: 679  LHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKL 738
            LHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKL
Sbjct: 927  LHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKL 986

Query: 739  TSQLLGREIIVRDIYRFHHFNSG 751
            TSQLLGREIIVRDIYRFHHFNSG
Sbjct: 987  TSQLLGREIIVRDIYRFHHFNSG 1009

BLAST of CsGy2G007450 vs. NCBI nr
Match: XP_008441858.1 (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 [Cucumis melo])

HSP 1 Score: 1376.3 bits (3561), Expect = 0.0e+00
Identity = 688/738 (93.22%), Postives = 704/738 (95.39%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSEHHL--FKSFSYHTSNHFSSNTLHAKMVKIGSIFVS 78
            L+  ISQGT+        + F+LS +       F YHTSN FSSNTLHAKMVKIGSI  S
Sbjct: 267  LASKISQGTVATVGGLLFLGFSLSSYFFPPLXKFCYHTSNSFSSNTLHAKMVKIGSIIES 326

Query: 79   GKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGV 138
            GKFVLTSYVKS+KLNDAQKLFDEMPNRDVLTWTA+ISGFSRVN SGMALQLFREMLVEGV
Sbjct: 327  GKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWTAIISGFSRVNCSGMALQLFREMLVEGV 386

Query: 139  SPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARK 198
             PNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENS+LDLYAKFDEFVYARK
Sbjct: 387  CPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSLLDLYAKFDEFVYARK 446

Query: 199  LYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 258
            LYDSM EKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA
Sbjct: 447  LYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 506

Query: 259  LELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 318
            LELLYEMVENESEFNNFTSSIALSV SSLLILELGRQVHGRIVRCGLHNDGFVKSALINM
Sbjct: 507  LELLYEMVENESEFNNFTSSIALSVASSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 566

Query: 319  YIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 378
            YIKCGNLEKASVIYS+LPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT
Sbjct: 567  YIKCGNLEKASVIYSQLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 626

Query: 379  FVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDMYAK 438
            FVSMVRERVLMDKFTIA+VVSAC+NAGVLELGRQVHGFI K+VEQLDAHLASSLIDMYAK
Sbjct: 627  FVSMVRERVLMDKFTIASVVSACANAGVLELGRQVHGFIQKSVEQLDAHLASSLIDMYAK 686

Query: 439  GGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVTFIG 498
            GGSLDCAHRIFDQMT YLNVVIWTSMIVGC+LHGHGKEAIRLFEQMRYEGIIPNEVTFIG
Sbjct: 687  GGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLHGHGKEAIRLFEQMRYEGIIPNEVTFIG 746

Query: 499  VLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS 558
            VLTACSHAGLLEDG LYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS
Sbjct: 747  VLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS 806

Query: 559  HLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEASRAR 618
            HLS VWKAFLSSC LYRDLEMGKWVSEKLFRL+PQDEGSYVLLSNMCSGSQKW+EASRAR
Sbjct: 807  HLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLEPQDEGSYVLLSNMCSGSQKWQEASRAR 866

Query: 619  RSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLHDVK 678
             SMQHSGINKTPGQSWIHLKNQVHSFVAGD+SHPQHAQIYEYLDKLIGRLKEIGYLHDVK
Sbjct: 867  SSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLHDVK 926

Query: 679  LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL 738
            LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL
Sbjct: 927  LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL 986

Query: 739  GREIIVRDIYRFHHFNSG 751
            GREIIVRDI RFHHFNSG
Sbjct: 987  GREIIVRDICRFHHFNSG 1004

BLAST of CsGy2G007450 vs. NCBI nr
Match: XP_022929759.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita moschata])

HSP 1 Score: 1234.6 bits (3193), Expect = 0.0e+00
Identity = 619/741 (83.54%), Postives = 669/741 (90.28%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSEHHLFKSF-----SYHTSNHFSSNTLHAKMVKIGSI 78
            L+  ISQGT+        + F+LS +     F     ++H+SN    NTLHAKMVK GSI
Sbjct: 268  LASKISQGTVATVGGLLFLGFSLSSYFFPPLFLVALENFHSSNDSLPNTLHAKMVKNGSI 327

Query: 79   FVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLV 138
            F S KF+L+SYVKSEKLNDA+K+FDEMP+RDVLTWT LISGF+RVN S MALQLFREMLV
Sbjct: 328  FESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLV 387

Query: 139  EGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVY 198
            EGV PN FTLSTVLKLCS+VGDV+MGKGIHGWILR+GV LDVVLENSMLDLYAKFDEF Y
Sbjct: 388  EGVCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGVSLDVVLENSMLDLYAKFDEFDY 447

Query: 199  ARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYL 258
              KL+DSMREKST T NI+LGV+VRS DVNKSL LFRNLPCR+ ASWNT+ICGLMQGGYL
Sbjct: 448  VTKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRNLPCRDTASWNTVICGLMQGGYL 507

Query: 259  NAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSAL 318
            N ALELLYEMVENE EFN  TSSIALSVVSSLLI+ELGRQVHGRIVRCGLHNDGFVKS+L
Sbjct: 508  NEALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELGRQVHGRIVRCGLHNDGFVKSSL 567

Query: 319  INMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDA 378
            INMYIKCGNLEKASVIYS++PSGFATKQ  NIVCSDTMTEIVSRSSMV GYVRNGKYEDA
Sbjct: 568  INMYIKCGNLEKASVIYSQMPSGFATKQDFNIVCSDTMTEIVSRSSMVSGYVRNGKYEDA 627

Query: 379  FKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDM 438
            FKTFVSMVRERVLMDKFTIA+VVSACSNAGV ELGRQ+H +I KT EQLDAHL SSLIDM
Sbjct: 628  FKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQIHAYIQKTGEQLDAHLTSSLIDM 687

Query: 439  YAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVT 498
            YAKGGSLDCA +IF+Q T YLNVVIWTSMI GCALHG GKEAIRLFE+MRYEG+IPNEVT
Sbjct: 688  YAKGGSLDCARQIFEQ-TTYLNVVIWTSMITGCALHGQGKEAIRLFEKMRYEGMIPNEVT 747

Query: 499  FIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYEN 558
            FIGVL ACSHAGLLEDG LYFNMMKDVYAIKPKVEH+TCMVDLYGRAG LNEVK+FIYEN
Sbjct: 748  FIGVLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGHLNEVKKFIYEN 807

Query: 559  DLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEAS 618
            DLSHL+AVWKAFLSSC+LY+D+EMG WVSE+LFRL+P DEG YVLLSNMCS +QKWEEA 
Sbjct: 808  DLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLDEGPYVLLSNMCSSNQKWEEAF 867

Query: 619  RARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLH 678
            R RRSMQH GI+KTPGQSWIH+KN+VHSFVAGD+SHPQHAQIYEYLDKLIGRLKEIGYL 
Sbjct: 868  RTRRSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLF 927

Query: 679  DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTS 738
            DVKLVMQDVEEEQGEVLLGWHSEKLA+AYG+ISLGS+IPIRIMKNLRICTDCHNFMKLTS
Sbjct: 928  DVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIPIRIMKNLRICTDCHNFMKLTS 987

Query: 739  QLLGREIIVRDIYRFHHFNSG 751
            QLL REIIVRDI+RFHHFNSG
Sbjct: 988  QLLCREIIVRDIHRFHHFNSG 1006

BLAST of CsGy2G007450 vs. NCBI nr
Match: XP_023545881.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1221.8 bits (3160), Expect = 0.0e+00
Identity = 612/741 (82.59%), Postives = 664/741 (89.61%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSEHHLFKSF-----SYHTSNHFSSNTLHAKMVKIGSI 78
            L+  ISQGT+        + F+LS +     F     +YH+SN    NTLHA MVK GSI
Sbjct: 268  LASKISQGTVATVGGLLFLGFSLSSYFFPPLFLVALENYHSSNDSLPNTLHAMMVKNGSI 327

Query: 79   FVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLV 138
            F S KF+L+SYVKSEKLNDA+K+FDEMP+RDVLTWT LISGF+RVN S MALQLFREMLV
Sbjct: 328  FESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLV 387

Query: 139  EGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVY 198
            EGV PN FTLSTVLKLCS+VGDV+MGKGIHGWILR+GV LDVVLENSMLDLYAKFDEF Y
Sbjct: 388  EGVCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGVNLDVVLENSMLDLYAKFDEFDY 447

Query: 199  ARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYL 258
             +KL+DSMREKST T NI+LGV+VRS DVNKSL LFRNLPCR+ ASWNT+ICGLMQGGYL
Sbjct: 448  VKKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRNLPCRDTASWNTVICGLMQGGYL 507

Query: 259  NAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSAL 318
            N ALELLYEMVENE EFN  TSSIALSVVSSLLI ELGRQVHGRIVRCG HNDGFVKS+L
Sbjct: 508  NEALELLYEMVENEPEFNKVTSSIALSVVSSLLIFELGRQVHGRIVRCGFHNDGFVKSSL 567

Query: 319  INMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDA 378
            INMYIKCGNLEKASVIYS++PSGFA KQ  +IVCSDTMTEIVSRSSMV GYVRNGKYEDA
Sbjct: 568  INMYIKCGNLEKASVIYSQMPSGFAKKQDFDIVCSDTMTEIVSRSSMVSGYVRNGKYEDA 627

Query: 379  FKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDM 438
            FKTFVSMVRE+VLMDKFTIA+VVSACSNAGV ELGRQ+H +I KT EQLDAHL SSLIDM
Sbjct: 628  FKTFVSMVREQVLMDKFTIASVVSACSNAGVFELGRQIHAYIQKTGEQLDAHLTSSLIDM 687

Query: 439  YAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVT 498
            YAKGGSLDCA +IF+Q T YLNVVIWTSMI GCALHG GKEAIRLFE+MRYEG+IPNEVT
Sbjct: 688  YAKGGSLDCARQIFEQ-TTYLNVVIWTSMITGCALHGQGKEAIRLFEKMRYEGMIPNEVT 747

Query: 499  FIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYEN 558
            F+GVL ACSHAGLLEDG LYFNMMKDVYAIKPKVEH+TCMVDLYGRAG LNEVK+FIYEN
Sbjct: 748  FLGVLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGRLNEVKKFIYEN 807

Query: 559  DLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEAS 618
            DLSHL+AVWKAFLSSC+LY+D+EMG WVSE+LFRL+P DEG YVLLSNMCS +QKWEEA 
Sbjct: 808  DLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLDEGPYVLLSNMCSSNQKWEEAF 867

Query: 619  RARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLH 678
            R RRSMQH GI+KTPGQSWIH+KN+VHSFVAGD+SHPQHAQIYEYLDKLIGRLKEIGYL 
Sbjct: 868  RTRRSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLF 927

Query: 679  DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTS 738
            DVKLVMQDVEEEQGEVLLGWHSEKLA+AYG+ISLGS+IPIRIMKNLRICTDCHNFMKLTS
Sbjct: 928  DVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIPIRIMKNLRICTDCHNFMKLTS 987

Query: 739  QLLGREIIVRDIYRFHHFNSG 751
            QLL REII RDI+RFH FNSG
Sbjct: 988  QLLCREIIARDIHRFHRFNSG 1006

BLAST of CsGy2G007450 vs. TAIR10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 498.4 bits (1282), Expect = 8.6e-141
Identity = 255/698 (36.53%), Postives = 423/698 (60.60%), Query Frame = 0

Query: 59  LHAKMVKIGSI-FVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSS 118
           LHA+ ++  S+   S   V++ Y   + L++A  LF  + +  VL W ++I  F+  +  
Sbjct: 27  LHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLF 86

Query: 119 GMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSM 178
             AL  F EM   G  P+H    +VLK C+ + D+R G+ +HG+I+R G+  D+   N++
Sbjct: 87  SKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNAL 146

Query: 179 LDLYAK---FDEFVYARKLYDSMREKSTDT--DNIILGVYVRSCDVNKSLHLFRNLPCRN 238
           +++YAK       +    ++D M ++++++  +++     +    ++    +F  +P ++
Sbjct: 147 MNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKD 206

Query: 239 AASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHG 298
             S+NTII G  Q G    AL ++ EM   + + ++FT S  L + S  + +  G+++HG
Sbjct: 207 VVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHG 266

Query: 299 RIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVS 358
            ++R G+ +D ++ S+L++MY K   +E +  ++SRL             C D     +S
Sbjct: 267 YVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRL------------YCRDG----IS 326

Query: 359 RSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIH 418
            +S+V GYV+NG+Y +A + F  MV  +V       ++V+ AC++   L LG+Q+HG++ 
Sbjct: 327 WNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVL 386

Query: 419 KTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAI 478
           +     +  +AS+L+DMY+K G++  A +IFD+M N L+ V WT++I+G ALHGHG EA+
Sbjct: 387 RGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM-NVLDEVSWTAIIMGHALHGHGHEAV 446

Query: 479 RLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDL 538
            LFE+M+ +G+ PN+V F+ VLTACSH GL+++   YFN M  VY +  ++EHY  + DL
Sbjct: 447 SLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADL 506

Query: 539 YGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSY 598
            GRAG L E   FI +  +    +VW   LSSC ++++LE+ + V+EK+F +  ++ G+Y
Sbjct: 507 LGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAY 566

Query: 599 VLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIY 658
           VL+ NM + + +W+E ++ R  M+  G+ K P  SWI +KN+ H FV+GD+SHP   +I 
Sbjct: 567 VLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKIN 626

Query: 659 EYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIM 718
           E+L  ++ ++++ GY+ D   V+ DV+EE    LL  HSE+LAVA+GII+      IR+ 
Sbjct: 627 EFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVT 686

Query: 719 KNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           KN+RICTDCH  +K  S++  REIIVRD  RFHHFN G
Sbjct: 687 KNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRG 707

BLAST of CsGy2G007450 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 449.9 bits (1156), Expect = 3.5e-126
Identity = 236/701 (33.67%), Postives = 389/701 (55.49%), Query Frame = 0

Query: 54   FSSNTLHAKMVKIGSIFVSGKFV----LTSYVKSEKLNDAQKLFDEMPNRDVLTWTALIS 113
            F    LHA   K+G  F S   +    L  Y K   +  A   F E    +V+ W  ++ 
Sbjct: 406  FRGQQLHAYTTKLG--FASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLV 465

Query: 114  GFSRVNSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKL 173
             +  ++    + ++FR+M +E + PN +T  ++LK C ++GD+ +G+ IH  I++   +L
Sbjct: 466  AYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQL 525

Query: 174  DVVLENSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLP 233
            +  + + ++D+YAK  +               T  D +I                     
Sbjct: 526  NAYVCSVLIDMYAKLGKL-------------DTAWDILI------------------RFA 585

Query: 234  CRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQ 293
             ++  SW T+I G  Q  + + AL    +M++     +    + A+S  + L  L+ G+Q
Sbjct: 586  GKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQ 645

Query: 294  VHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTE 353
            +H +    G  +D   ++AL+ +Y +CG +E++ + + +      T+ G NI        
Sbjct: 646  IHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQ------TEAGDNI-------- 705

Query: 354  IVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHG 413
              + +++V G+ ++G  E+A + FV M RE +  + FT  + V A S    ++ G+QVH 
Sbjct: 706  --AWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHA 765

Query: 414  FIHKTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGK 473
             I KT    +  + ++LI MYAK GS+  A + F +++   N V W ++I   + HG G 
Sbjct: 766  VITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVST-KNEVSWNAIINAYSKHGFGS 825

Query: 474  EAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCM 533
            EA+  F+QM +  + PN VT +GVL+ACSH GL++ G  YF  M   Y + PK EHY C+
Sbjct: 826  EALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCV 885

Query: 534  VDLYGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDE 593
            VD+  RAGLL+  KEFI E  +   + VW+  LS+C +++++E+G++ +  L  L+P+D 
Sbjct: 886  VDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDS 945

Query: 594  GSYVLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHA 653
             +YVLLSN+ + S+KW+     R+ M+  G+ K PGQSWI +KN +HSF  GDQ+HP   
Sbjct: 946  ATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLAD 1005

Query: 654  QIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPI 713
            +I+EY   L  R  EIGY+ D   ++ +++ EQ + ++  HSEKLA+++G++SL + +PI
Sbjct: 1006 EIHEYFQDLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPI 1056

Query: 714  RIMKNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
             +MKNLR+C DCH ++K  S++  REIIVRD YRFHHF  G
Sbjct: 1066 NVMKNLRVCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGG 1056

BLAST of CsGy2G007450 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 447.2 bits (1149), Expect = 2.3e-125
Identity = 244/696 (35.06%), Postives = 390/696 (56.03%), Query Frame = 0

Query: 59  LHAKMVKIGSIFVSGKFVLTS----YVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRV 118
           +H  +VK G  F    F +T     Y K  ++N+A+K+FD MP RD+++W  +++G+S+ 
Sbjct: 157 IHGLLVKSG--FSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQN 216

Query: 119 NSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLE 178
             + MAL++ + M  E + P+  T+ +VL   S +  + +GK IHG+ +R+G    V + 
Sbjct: 217 GMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS 276

Query: 179 NSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAA 238
            +++D+YAK      AR+L+D M E                               RN  
Sbjct: 277 TALVDMYAKCGSLETARQLFDGMLE-------------------------------RNVV 336

Query: 239 SWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRI 298
           SWN++I   +Q      A+ +  +M++   +  + +   AL   + L  LE GR +H   
Sbjct: 337 SWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLS 396

Query: 299 VRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRS 358
           V  GL  +  V ++LI+MY KC  ++ A+ ++ +L S                  +VS +
Sbjct: 397 VELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQS----------------RTLVSWN 456

Query: 359 SMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKT 418
           +M+ G+ +NG+  DA   F  M    V  D FT  +V++A +   +    + +HG + ++
Sbjct: 457 AMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRS 516

Query: 419 VEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRL 478
               +  + ++L+DMYAK G++  A  IFD M+   +V  W +MI G   HG GK A+ L
Sbjct: 517 CLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSE-RHVTTWNAMIDGYGTHGFGKAALEL 576

Query: 479 FEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYG 538
           FE+M+   I PN VTF+ V++ACSH+GL+E G   F MMK+ Y+I+  ++HY  MVDL G
Sbjct: 577 FEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLG 636

Query: 539 RAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVL 598
           RAG LNE  +FI +  +     V+ A L +C++++++   +  +E+LF L P D G +VL
Sbjct: 637 RAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVL 696

Query: 599 LSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEY 658
           L+N+   +  WE+  + R SM   G+ KTPG S + +KN+VHSF +G  +HP   +IY +
Sbjct: 697 LANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAF 756

Query: 659 LDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKN 718
           L+KLI  +KE GY+ D  LV+  VE +  E LL  HSEKLA+++G+++  +   I + KN
Sbjct: 757 LEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKN 801

Query: 719 LRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           LR+C DCHN  K  S + GREI+VRD+ RFHHF +G
Sbjct: 817 LRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNG 801

BLAST of CsGy2G007450 vs. TAIR10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 442.2 bits (1136), Expect = 7.3e-124
Identity = 246/688 (35.76%), Postives = 380/688 (55.23%), Query Frame = 0

Query: 73  GKFVLTSYVKSE-KLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEG 132
           G  ++  +VK E    +A K+FD+M   +V+TWT +I+   ++     A++ F +M++ G
Sbjct: 205 GCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG 264

Query: 133 VSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFD---EFV 192
              + FTLS+V   C+++ ++ +GK +H W +R+G+  DV  E S++D+YAK        
Sbjct: 265 FESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDV--ECSLVDMYAKCSADGSVD 324

Query: 193 YARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGY 252
             RK++D M + S                                 SW  +I G M+   
Sbjct: 325 DCRKVFDRMEDHS-------------------------------VMSWTALITGYMKNCN 384

Query: 253 L-NAALELLYEMV-ENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVK 312
           L   A+ L  EM+ +   E N+FT S A     +L    +G+QV G+  + GL ++  V 
Sbjct: 385 LATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVA 444

Query: 313 SALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKY 372
           +++I+M++K   +E A   +  L                +   +VS ++ + G  RN  +
Sbjct: 445 NSVISMFVKSDRMEDAQRAFESL----------------SEKNLVSYNTFLDGTCRNLNF 504

Query: 373 EDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSL 432
           E AFK    +    + +  FT A+++S  +N G +  G Q+H  + K     +  + ++L
Sbjct: 505 EQAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNAL 564

Query: 433 IDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPN 492
           I MY+K GS+D A R+F+ M N  NV+ WTSMI G A HG     +  F QM  EG+ PN
Sbjct: 565 ISMYSKCGSIDTASRVFNFMEN-RNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPN 624

Query: 493 EVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFI 552
           EVT++ +L+ACSH GL+ +G  +FN M + + IKPK+EHY CMVDL  RAGLL +  EFI
Sbjct: 625 EVTYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFI 684

Query: 553 YENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWE 612
                     VW+ FL +CR++ + E+GK  + K+  L P +  +Y+ LSN+ + + KWE
Sbjct: 685 NTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWE 744

Query: 613 EASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIG 672
           E++  RR M+   + K  G SWI + +++H F  GD +HP   QIY+ LD+LI  +K  G
Sbjct: 745 ESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCG 804

Query: 673 YLHDVKLVMQDVEEEQGEV----LLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCH 732
           Y+ D  LV+  +EEE  E     LL  HSEK+AVA+G+IS   + P+R+ KNLR+C DCH
Sbjct: 805 YVPDTDLVLHKLEEENDEAEKERLLYQHSEKIAVAFGLISTSKSRPVRVFKNLRVCGDCH 842

Query: 733 NFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           N MK  S + GREI++RD+ RFHHF  G
Sbjct: 865 NAMKYISTVSGREIVLRDLNRFHHFKDG 842

BLAST of CsGy2G007450 vs. TAIR10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 429.9 bits (1104), Expect = 3.8e-120
Identity = 242/682 (35.48%), Postives = 383/682 (56.16%), Query Frame = 0

Query: 73  GKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGV 132
           G  ++  Y K   + DA+++F  M ++D ++W ++I+G  +      A++ ++ M    +
Sbjct: 352 GNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSMRRHDI 411

Query: 133 SPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARK 192
            P  FTL + L  C+ +   ++G+ IHG  L+ G+ L+V + N+++ LYA+       RK
Sbjct: 412 LPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLNECRK 471

Query: 193 LYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 252
           ++ SM E    + N I+G   RS          R+LP         ++C      +LNA 
Sbjct: 472 IFSSMPEHDQVSWNSIIGALARS---------ERSLP-------EAVVC------FLNAQ 531

Query: 253 LELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 312
                       + N  T S  LS VSSL   ELG+Q+HG  ++  + ++   ++ALI  
Sbjct: 532 --------RAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIAC 591

Query: 313 YIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 372
           Y KCG ++    I+SR+                   + V+ +SM+ GY+ N     A   
Sbjct: 592 YGKCGEMDGCEKIFSRMAE---------------RRDNVTWNSMISGYIHNELLAKALDL 651

Query: 373 FVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDMYAK 432
              M++    +D F  A V+SA ++   LE G +VH    +   + D  + S+L+DMY+K
Sbjct: 652 VWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSK 711

Query: 433 GGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEG-IIPNEVTFI 492
            G LD A R F+ M    N   W SMI G A HG G+EA++LFE M+ +G   P+ VTF+
Sbjct: 712 CGRLDYALRFFNTMP-VRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFV 771

Query: 493 GVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDL 552
           GVL+ACSHAGLLE+G  +F  M D Y + P++EH++CM D+ GRAG L+++++FI +  +
Sbjct: 772 GVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPM 831

Query: 553 SHLSAVWKAFLSSC--RLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEAS 612
                +W+  L +C     R  E+GK  +E LF+L+P++  +YVLL NM +   +WE+  
Sbjct: 832 KPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLV 891

Query: 613 RARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLH 672
           +AR+ M+ + + K  G SW+ +K+ VH FVAGD+SHP    IY+ L +L  ++++ GY+ 
Sbjct: 892 KARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDAGYVP 951

Query: 673 DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIIS-LGSAIPIRIMKNLRICTDCHNFMKLT 732
                + D+E+E  E +L +HSEKLAVA+ + +   S +PIRIMKNLR+C DCH+  K  
Sbjct: 952 QTGFALYDLEQENKEEILSYHSEKLAVAFVLAAQRSSTLPIRIMKNLRVCGDCHSAFKYI 987

Query: 733 SQLLGREIIVRDIYRFHHFNSG 751
           S++ GR+II+RD  RFHHF  G
Sbjct: 1012 SKIEGRQIILRDSNRFHHFQDG 987

BLAST of CsGy2G007450 vs. Swiss-Prot
Match: sp|Q9LW63|PP251_ARATH (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 498.4 bits (1282), Expect = 1.6e-139
Identity = 255/698 (36.53%), Postives = 423/698 (60.60%), Query Frame = 0

Query: 59  LHAKMVKIGSI-FVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSS 118
           LHA+ ++  S+   S   V++ Y   + L++A  LF  + +  VL W ++I  F+  +  
Sbjct: 27  LHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLF 86

Query: 119 GMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSM 178
             AL  F EM   G  P+H    +VLK C+ + D+R G+ +HG+I+R G+  D+   N++
Sbjct: 87  SKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNAL 146

Query: 179 LDLYAK---FDEFVYARKLYDSMREKSTDT--DNIILGVYVRSCDVNKSLHLFRNLPCRN 238
           +++YAK       +    ++D M ++++++  +++     +    ++    +F  +P ++
Sbjct: 147 MNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKD 206

Query: 239 AASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHG 298
             S+NTII G  Q G    AL ++ EM   + + ++FT S  L + S  + +  G+++HG
Sbjct: 207 VVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHG 266

Query: 299 RIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVS 358
            ++R G+ +D ++ S+L++MY K   +E +  ++SRL             C D     +S
Sbjct: 267 YVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRL------------YCRDG----IS 326

Query: 359 RSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIH 418
            +S+V GYV+NG+Y +A + F  MV  +V       ++V+ AC++   L LG+Q+HG++ 
Sbjct: 327 WNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVL 386

Query: 419 KTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAI 478
           +     +  +AS+L+DMY+K G++  A +IFD+M N L+ V WT++I+G ALHGHG EA+
Sbjct: 387 RGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM-NVLDEVSWTAIIMGHALHGHGHEAV 446

Query: 479 RLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDL 538
            LFE+M+ +G+ PN+V F+ VLTACSH GL+++   YFN M  VY +  ++EHY  + DL
Sbjct: 447 SLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADL 506

Query: 539 YGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSY 598
            GRAG L E   FI +  +    +VW   LSSC ++++LE+ + V+EK+F +  ++ G+Y
Sbjct: 507 LGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAY 566

Query: 599 VLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIY 658
           VL+ NM + + +W+E ++ R  M+  G+ K P  SWI +KN+ H FV+GD+SHP   +I 
Sbjct: 567 VLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKIN 626

Query: 659 EYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIM 718
           E+L  ++ ++++ GY+ D   V+ DV+EE    LL  HSE+LAVA+GII+      IR+ 
Sbjct: 627 EFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVT 686

Query: 719 KNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           KN+RICTDCH  +K  S++  REIIVRD  RFHHFN G
Sbjct: 687 KNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRG 707

BLAST of CsGy2G007450 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 449.9 bits (1156), Expect = 6.3e-125
Identity = 236/701 (33.67%), Postives = 389/701 (55.49%), Query Frame = 0

Query: 54   FSSNTLHAKMVKIGSIFVSGKFV----LTSYVKSEKLNDAQKLFDEMPNRDVLTWTALIS 113
            F    LHA   K+G  F S   +    L  Y K   +  A   F E    +V+ W  ++ 
Sbjct: 406  FRGQQLHAYTTKLG--FASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLV 465

Query: 114  GFSRVNSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKL 173
             +  ++    + ++FR+M +E + PN +T  ++LK C ++GD+ +G+ IH  I++   +L
Sbjct: 466  AYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQL 525

Query: 174  DVVLENSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLP 233
            +  + + ++D+YAK  +               T  D +I                     
Sbjct: 526  NAYVCSVLIDMYAKLGKL-------------DTAWDILI------------------RFA 585

Query: 234  CRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQ 293
             ++  SW T+I G  Q  + + AL    +M++     +    + A+S  + L  L+ G+Q
Sbjct: 586  GKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQ 645

Query: 294  VHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTE 353
            +H +    G  +D   ++AL+ +Y +CG +E++ + + +      T+ G NI        
Sbjct: 646  IHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQ------TEAGDNI-------- 705

Query: 354  IVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHG 413
              + +++V G+ ++G  E+A + FV M RE +  + FT  + V A S    ++ G+QVH 
Sbjct: 706  --AWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHA 765

Query: 414  FIHKTVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGK 473
             I KT    +  + ++LI MYAK GS+  A + F +++   N V W ++I   + HG G 
Sbjct: 766  VITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVST-KNEVSWNAIINAYSKHGFGS 825

Query: 474  EAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCM 533
            EA+  F+QM +  + PN VT +GVL+ACSH GL++ G  YF  M   Y + PK EHY C+
Sbjct: 826  EALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCV 885

Query: 534  VDLYGRAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDE 593
            VD+  RAGLL+  KEFI E  +   + VW+  LS+C +++++E+G++ +  L  L+P+D 
Sbjct: 886  VDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDS 945

Query: 594  GSYVLLSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHA 653
             +YVLLSN+ + S+KW+     R+ M+  G+ K PGQSWI +KN +HSF  GDQ+HP   
Sbjct: 946  ATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLAD 1005

Query: 654  QIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPI 713
            +I+EY   L  R  EIGY+ D   ++ +++ EQ + ++  HSEKLA+++G++SL + +PI
Sbjct: 1006 EIHEYFQDLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPI 1056

Query: 714  RIMKNLRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
             +MKNLR+C DCH ++K  S++  REIIVRD YRFHHF  G
Sbjct: 1066 NVMKNLRVCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGG 1056

BLAST of CsGy2G007450 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 447.2 bits (1149), Expect = 4.1e-124
Identity = 244/696 (35.06%), Postives = 390/696 (56.03%), Query Frame = 0

Query: 59  LHAKMVKIGSIFVSGKFVLTS----YVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRV 118
           +H  +VK G  F    F +T     Y K  ++N+A+K+FD MP RD+++W  +++G+S+ 
Sbjct: 157 IHGLLVKSG--FSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQN 216

Query: 119 NSSGMALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLE 178
             + MAL++ + M  E + P+  T+ +VL   S +  + +GK IHG+ +R+G    V + 
Sbjct: 217 GMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS 276

Query: 179 NSMLDLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAA 238
            +++D+YAK      AR+L+D M E                               RN  
Sbjct: 277 TALVDMYAKCGSLETARQLFDGMLE-------------------------------RNVV 336

Query: 239 SWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRI 298
           SWN++I   +Q      A+ +  +M++   +  + +   AL   + L  LE GR +H   
Sbjct: 337 SWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLS 396

Query: 299 VRCGLHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRS 358
           V  GL  +  V ++LI+MY KC  ++ A+ ++ +L S                  +VS +
Sbjct: 397 VELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQS----------------RTLVSWN 456

Query: 359 SMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKT 418
           +M+ G+ +NG+  DA   F  M    V  D FT  +V++A +   +    + +HG + ++
Sbjct: 457 AMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRS 516

Query: 419 VEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRL 478
               +  + ++L+DMYAK G++  A  IFD M+   +V  W +MI G   HG GK A+ L
Sbjct: 517 CLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSE-RHVTTWNAMIDGYGTHGFGKAALEL 576

Query: 479 FEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYG 538
           FE+M+   I PN VTF+ V++ACSH+GL+E G   F MMK+ Y+I+  ++HY  MVDL G
Sbjct: 577 FEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLG 636

Query: 539 RAGLLNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVL 598
           RAG LNE  +FI +  +     V+ A L +C++++++   +  +E+LF L P D G +VL
Sbjct: 637 RAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVL 696

Query: 599 LSNMCSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEY 658
           L+N+   +  WE+  + R SM   G+ KTPG S + +KN+VHSF +G  +HP   +IY +
Sbjct: 697 LANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAF 756

Query: 659 LDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKN 718
           L+KLI  +KE GY+ D  LV+  VE +  E LL  HSEKLA+++G+++  +   I + KN
Sbjct: 757 LEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKN 801

Query: 719 LRICTDCHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           LR+C DCHN  K  S + GREI+VRD+ RFHHF +G
Sbjct: 817 LRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNG 801

BLAST of CsGy2G007450 vs. Swiss-Prot
Match: sp|Q5G1T1|PP272_ARATH (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 442.2 bits (1136), Expect = 1.3e-122
Identity = 246/688 (35.76%), Postives = 380/688 (55.23%), Query Frame = 0

Query: 73  GKFVLTSYVKSE-KLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEG 132
           G  ++  +VK E    +A K+FD+M   +V+TWT +I+   ++     A++ F +M++ G
Sbjct: 205 GCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG 264

Query: 133 VSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFD---EFV 192
              + FTLS+V   C+++ ++ +GK +H W +R+G+  DV  E S++D+YAK        
Sbjct: 265 FESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDV--ECSLVDMYAKCSADGSVD 324

Query: 193 YARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGY 252
             RK++D M + S                                 SW  +I G M+   
Sbjct: 325 DCRKVFDRMEDHS-------------------------------VMSWTALITGYMKNCN 384

Query: 253 L-NAALELLYEMV-ENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVK 312
           L   A+ L  EM+ +   E N+FT S A     +L    +G+QV G+  + GL ++  V 
Sbjct: 385 LATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVA 444

Query: 313 SALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKY 372
           +++I+M++K   +E A   +  L                +   +VS ++ + G  RN  +
Sbjct: 445 NSVISMFVKSDRMEDAQRAFESL----------------SEKNLVSYNTFLDGTCRNLNF 504

Query: 373 EDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSL 432
           E AFK    +    + +  FT A+++S  +N G +  G Q+H  + K     +  + ++L
Sbjct: 505 EQAFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNAL 564

Query: 433 IDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPN 492
           I MY+K GS+D A R+F+ M N  NV+ WTSMI G A HG     +  F QM  EG+ PN
Sbjct: 565 ISMYSKCGSIDTASRVFNFMEN-RNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPN 624

Query: 493 EVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFI 552
           EVT++ +L+ACSH GL+ +G  +FN M + + IKPK+EHY CMVDL  RAGLL +  EFI
Sbjct: 625 EVTYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFI 684

Query: 553 YENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWE 612
                     VW+ FL +CR++ + E+GK  + K+  L P +  +Y+ LSN+ + + KWE
Sbjct: 685 NTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWE 744

Query: 613 EASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIG 672
           E++  RR M+   + K  G SWI + +++H F  GD +HP   QIY+ LD+LI  +K  G
Sbjct: 745 ESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCG 804

Query: 673 YLHDVKLVMQDVEEEQGEV----LLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCH 732
           Y+ D  LV+  +EEE  E     LL  HSEK+AVA+G+IS   + P+R+ KNLR+C DCH
Sbjct: 805 YVPDTDLVLHKLEEENDEAEKERLLYQHSEKIAVAFGLISTSKSRPVRVFKNLRVCGDCH 842

Query: 733 NFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           N MK  S + GREI++RD+ RFHHF  G
Sbjct: 865 NAMKYISTVSGREIVLRDLNRFHHFKDG 842

BLAST of CsGy2G007450 vs. Swiss-Prot
Match: sp|Q9FIB2|PP373_ARATH (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 429.9 bits (1104), Expect = 6.8e-119
Identity = 242/682 (35.48%), Postives = 383/682 (56.16%), Query Frame = 0

Query: 73  GKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGV 132
           G  ++  Y K   + DA+++F  M ++D ++W ++I+G  +      A++ ++ M    +
Sbjct: 352 GNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSMRRHDI 411

Query: 133 SPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARK 192
            P  FTL + L  C+ +   ++G+ IHG  L+ G+ L+V + N+++ LYA+       RK
Sbjct: 412 LPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLNECRK 471

Query: 193 LYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 252
           ++ SM E    + N I+G   RS          R+LP         ++C      +LNA 
Sbjct: 472 IFSSMPEHDQVSWNSIIGALARS---------ERSLP-------EAVVC------FLNAQ 531

Query: 253 LELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 312
                       + N  T S  LS VSSL   ELG+Q+HG  ++  + ++   ++ALI  
Sbjct: 532 --------RAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIAC 591

Query: 313 YIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 372
           Y KCG ++    I+SR+                   + V+ +SM+ GY+ N     A   
Sbjct: 592 YGKCGEMDGCEKIFSRMAE---------------RRDNVTWNSMISGYIHNELLAKALDL 651

Query: 373 FVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDMYAK 432
              M++    +D F  A V+SA ++   LE G +VH    +   + D  + S+L+DMY+K
Sbjct: 652 VWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSK 711

Query: 433 GGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEG-IIPNEVTFI 492
            G LD A R F+ M    N   W SMI G A HG G+EA++LFE M+ +G   P+ VTF+
Sbjct: 712 CGRLDYALRFFNTMP-VRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFV 771

Query: 493 GVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDL 552
           GVL+ACSHAGLLE+G  +F  M D Y + P++EH++CM D+ GRAG L+++++FI +  +
Sbjct: 772 GVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPM 831

Query: 553 SHLSAVWKAFLSSC--RLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEAS 612
                +W+  L +C     R  E+GK  +E LF+L+P++  +YVLL NM +   +WE+  
Sbjct: 832 KPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLV 891

Query: 613 RARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLH 672
           +AR+ M+ + + K  G SW+ +K+ VH FVAGD+SHP    IY+ L +L  ++++ GY+ 
Sbjct: 892 KARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDAGYVP 951

Query: 673 DVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIIS-LGSAIPIRIMKNLRICTDCHNFMKLT 732
                + D+E+E  E +L +HSEKLAVA+ + +   S +PIRIMKNLR+C DCH+  K  
Sbjct: 952 QTGFALYDLEQENKEEILSYHSEKLAVAFVLAAQRSSTLPIRIMKNLRVCGDCHSAFKYI 987

Query: 733 SQLLGREIIVRDIYRFHHFNSG 751
           S++ GR+II+RD  RFHHF  G
Sbjct: 1012 SKIEGRQIILRDSNRFHHFQDG 987

BLAST of CsGy2G007450 vs. TrEMBL
Match: tr|A0A0A0LKI4|A0A0A0LKI4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074230 PE=4 SV=1)

HSP 1 Score: 1513.4 bits (3917), Expect = 0.0e+00
Identity = 749/750 (99.87%), Postives = 749/750 (99.87%), Query Frame = 0

Query: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60
           MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH
Sbjct: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60

Query: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120
           AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA
Sbjct: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120

Query: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180
           LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL
Sbjct: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180

Query: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240
           YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII
Sbjct: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240

Query: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300
           CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH
Sbjct: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300

Query: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGY 360
           NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQ SNIVCSDTMTEIVSRSSMVYGY
Sbjct: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQSSNIVCSDTMTEIVSRSSMVYGY 360

Query: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420
           VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA
Sbjct: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480
           HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480

Query: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540
           EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN
Sbjct: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600

Query: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660
           GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG
Sbjct: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660

Query: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720
           RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD
Sbjct: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           CHNFMKLTSQLLGREIIVRDIYRFHHFNSG
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSG 750

BLAST of CsGy2G007450 vs. TrEMBL
Match: tr|A0A1S3B4E3|A0A1S3B4E3_CUCME (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucumis melo OX=3656 GN=LOC103485889 PE=4 SV=1)

HSP 1 Score: 1376.3 bits (3561), Expect = 0.0e+00
Identity = 688/738 (93.22%), Postives = 704/738 (95.39%), Query Frame = 0

Query: 19   LSHSISQGTMTH----KIISFNLSEHHL--FKSFSYHTSNHFSSNTLHAKMVKIGSIFVS 78
            L+  ISQGT+        + F+LS +       F YHTSN FSSNTLHAKMVKIGSI  S
Sbjct: 267  LASKISQGTVATVGGLLFLGFSLSSYFFPPLXKFCYHTSNSFSSNTLHAKMVKIGSIIES 326

Query: 79   GKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQLFREMLVEGV 138
            GKFVLTSYVKS+KLNDAQKLFDEMPNRDVLTWTA+ISGFSRVN SGMALQLFREMLVEGV
Sbjct: 327  GKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWTAIISGFSRVNCSGMALQLFREMLVEGV 386

Query: 139  SPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYAKFDEFVYARK 198
             PNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENS+LDLYAKFDEFVYARK
Sbjct: 387  CPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSLLDLYAKFDEFVYARK 446

Query: 199  LYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 258
            LYDSM EKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA
Sbjct: 447  LYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICGLMQGGYLNAA 506

Query: 259  LELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 318
            LELLYEMVENESEFNNFTSSIALSV SSLLILELGRQVHGRIVRCGLHNDGFVKSALINM
Sbjct: 507  LELLYEMVENESEFNNFTSSIALSVASSLLILELGRQVHGRIVRCGLHNDGFVKSALINM 566

Query: 319  YIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 378
            YIKCGNLEKASVIYS+LPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT
Sbjct: 567  YIKCGNLEKASVIYSQLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVRNGKYEDAFKT 626

Query: 379  FVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHLASSLIDMYAK 438
            FVSMVRERVLMDKFTIA+VVSAC+NAGVLELGRQVHGFI K+VEQLDAHLASSLIDMYAK
Sbjct: 627  FVSMVRERVLMDKFTIASVVSACANAGVLELGRQVHGFIQKSVEQLDAHLASSLIDMYAK 686

Query: 439  GGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEGIIPNEVTFIG 498
            GGSLDCAHRIFDQMT YLNVVIWTSMIVGC+LHGHGKEAIRLFEQMRYEGIIPNEVTFIG
Sbjct: 687  GGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLHGHGKEAIRLFEQMRYEGIIPNEVTFIG 746

Query: 499  VLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS 558
            VLTACSHAGLLEDG LYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS
Sbjct: 747  VLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEVKEFIYENDLS 806

Query: 559  HLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGSQKWEEASRAR 618
            HLS VWKAFLSSC LYRDLEMGKWVSEKLFRL+PQDEGSYVLLSNMCSGSQKW+EASRAR
Sbjct: 807  HLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLEPQDEGSYVLLSNMCSGSQKWQEASRAR 866

Query: 619  RSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRLKEIGYLHDVK 678
             SMQHSGINKTPGQSWIHLKNQVHSFVAGD+SHPQHAQIYEYLDKLIGRLKEIGYLHDVK
Sbjct: 867  SSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLHDVK 926

Query: 679  LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL 738
            LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL
Sbjct: 927  LVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTDCHNFMKLTSQLL 986

Query: 739  GREIIVRDIYRFHHFNSG 751
            GREIIVRDI RFHHFNSG
Sbjct: 987  GREIIVRDICRFHHFNSG 1004

BLAST of CsGy2G007450 vs. TrEMBL
Match: tr|A0A1U8JQG3|A0A1U8JQG3_GOSHI (pentatricopeptide repeat-containing protein At4g21065-like OS=Gossypium hirsutum OX=3635 GN=LOC107907754 PE=4 SV=1)

HSP 1 Score: 801.6 bits (2069), Expect = 1.7e-228
Identity = 411/750 (54.80%), Postives = 547/750 (72.93%), Query Frame = 0

Query: 6   LSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLHAKMVK 65
           +S+ C    A LK+   +    +  K+  F+   HH  K +S+H+    ++++LH K +K
Sbjct: 2   VSTYCLTPKANLKIPTIVV--FLLKKLSPFHF--HHC-KFYSHHSLPFPTTSSLHVKAIK 61

Query: 66  IGS---IFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMALQ 125
            GS   + V+ + +L  YVKS+ LN A KLFDEMP+RDV TWT L+S F+RV S+G AL+
Sbjct: 62  DGSFQNLDVANR-LLNVYVKSKNLNHACKLFDEMPHRDVRTWTILVSTFARVGSNGAALE 121

Query: 126 LFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDLYA 185
           LF+ M  EG+ PN FTLS+VLK CS + ++++GKG+HGWILRNGV  DV+LEN++   Y 
Sbjct: 122 LFKNMQNEGIKPNQFTLSSVLKCCSSLSELKIGKGVHGWILRNGVVFDVILENALFYFYV 181

Query: 186 KFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTIICG 245
           K ++F  A+ L++SM EK++ T NI++G Y+ + +V+K++ LFR    +  + WNTII G
Sbjct: 182 KCEDFGSAKWLFESMEEKNSVTWNIMIGAYLDTGNVDKAVDLFRRQGLKGVSIWNTIIKG 241

Query: 246 LMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLHND 305
           LM+ G+   AL+LLYEMV++ + FN  T SIAL +VS L  LELG+Q+HGR++  G+H D
Sbjct: 242 LMRNGFERIALKLLYEMVKDGTLFNEVTFSIALVLVSLLKDLELGKQIHGRVLLSGIHVD 301

Query: 306 GFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVYGYVR 365
           GF++++LI+MY KCG ++ A  ++ ++ + F   + S       + E++S SS++ G V 
Sbjct: 302 GFLRNSLIDMYCKCGEMKMALKVFEKMDTYFGRNENS-------IEEVISWSSIISGLVL 361

Query: 366 NGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDAHL 425
           NG++EDAFKTF SMVR+ + +D F+I ++VSAC++ GVLELGRQVHG + K   +LDAHL
Sbjct: 362 NGEFEDAFKTFTSMVRKDIDIDAFSITSIVSACASFGVLELGRQVHGLVQKMGHKLDAHL 421

Query: 426 ASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRYEG 485
            SSLIDMYAK G+LD A RIF Q TN +NVV+WTSM+   ALHG G+EA+ LFE     G
Sbjct: 422 GSSLIDMYAKCGNLDDAKRIFKQ-TNDMNVVLWTSMVYSYALHGRGREAVHLFEFSMSHG 481

Query: 486 IIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLNEV 545
           ++PNEVTFIGVLTACSHAGL+E+G  YF +MK+VY IKP VEH+T MVDLYGRAG   E+
Sbjct: 482 LLPNEVTFIGVLTACSHAGLVEEGCRYFRLMKEVYGIKPGVEHFTRMVDLYGRAGQFKEI 541

Query: 546 KEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCSGS 605
           K+FI EN + HL AVW++FLSSCRL+RD+EM +WVSE L R K  D G YVLLSN+ +  
Sbjct: 542 KKFIDENGIHHLRAVWRSFLSSCRLHRDIEMAEWVSENLLRCKTLDAGPYVLLSNIYAIK 601

Query: 606 QKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIGRL 665
           Q+WEE +  RR MQ  G+ K P QSWI ++NQVH+F+  D+SHPQ  +I  YL KLIGRL
Sbjct: 602 QRWEEVATVRRLMQSRGVKKQPCQSWIQIRNQVHAFIMDDRSHPQKNEICAYLYKLIGRL 661

Query: 666 KEIGYLHDVKLVMQDVEEEQ--GEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 725
           +EIGY  D KLVMQDVE++Q  GE+LLG+HSEKLA AYGIIS  S  PIRIMKNLRIC D
Sbjct: 662 REIGYSSDAKLVMQDVEKKQGEGEMLLGFHSEKLATAYGIISTASQTPIRIMKNLRICDD 721

Query: 726 CHNFMKLTSQLLGREIIVRDIYRFHHFNSG 751
           CHNFM+ TSQLL +EIIVRDI+RFHHF  G
Sbjct: 722 CHNFMRYTSQLLDKEIIVRDIHRFHHFKHG 737

BLAST of CsGy2G007450 vs. TrEMBL
Match: tr|A0A2I4EX39|A0A2I4EX39_9ROSI (pentatricopeptide repeat-containing protein At4g21065-like isoform X1 OS=Juglans regia OX=51240 GN=LOC108993491 PE=4 SV=1)

HSP 1 Score: 793.5 bits (2048), Expect = 4.7e-226
Identity = 419/755 (55.50%), Postives = 520/755 (68.87%), Query Frame = 0

Query: 8   SSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSN-HFSSNTL-----HA 67
           S CF    F K +H ++   +T     F   E   F+ F YH+S  H+ +  L     HA
Sbjct: 7   SLCFIPTRFSKRAHFLTLDCLTTNTTKF---EFRRFQFFWYHSSPVHYLTADLAVGIHHA 66

Query: 68  KMVKIGS---IFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSG 127
           K VK GS   + V+  F+  S VKS+ LN AQKLF+E+ +RDV TWT L+SG++R+  S 
Sbjct: 67  KAVKSGSFQNLNVANHFLNLS-VKSQDLNYAQKLFNEISDRDVRTWTILMSGYARIGVST 126

Query: 128 MALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSML 187
           M L L REM +EGV PN F+LS V K CS   D+RMG GIHGWILR+G+ LDVVL+NS+L
Sbjct: 127 MVLDLLREMRIEGVHPNQFSLSIVFKCCSSTSDLRMGMGIHGWILRSGINLDVVLDNSIL 186

Query: 188 DLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNT 247
           D Y K   F YA+ L+  M E+ T                                    
Sbjct: 187 DFYVKCGSFDYAKTLFSVMLERDTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 246

Query: 248 IICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCG 307
              GLM+ GY   ALELLYEM  N    N  T SIAL +VSSL I+ELGRQ+HGR ++ G
Sbjct: 247 XXXGLMRNGYERTALELLYEMARNGHVLNKVTFSIALVLVSSLSIMELGRQIHGRALKFG 306

Query: 308 LHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVY 367
              DGFV+++LI+MY KCG +EKASVI+ + P      Q S I   +   EIVS SSMV 
Sbjct: 307 FEKDGFVRTSLIDMYCKCGKMEKASVIFRKRPLDCVGTQNSKITFDEPTAEIVSWSSMVS 366

Query: 368 GYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQL 427
           GYV+NG+++DA K F+SMV E+V +D+FTI ++VSAC++AG+LELGRQ+H +I K    L
Sbjct: 367 GYVQNGEFKDALKMFISMVHEQVEVDRFTITSIVSACADAGILELGRQIHAYIQKVGYTL 426

Query: 428 DAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQM 487
           D +L+SS + MYAK GSLD A  IF Q T + NVV+WT+MI G ALHG G++AI LFEQM
Sbjct: 427 DVYLSSSSVYMYAKCGSLDDAWTIFKQ-TIHPNVVLWTAMITGSALHGQGRKAIWLFEQM 486

Query: 488 RYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGL 547
            +EGI PNEVTF+GVLTACSHAGLLE+G  YF +MK+VY IKP VEH+TCMVDLYGRAG 
Sbjct: 487 VHEGIKPNEVTFVGVLTACSHAGLLEEGSKYFRLMKEVYGIKPGVEHFTCMVDLYGRAGH 546

Query: 548 LNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNM 607
           L++V+ FI+EN +S LS VW++FLSSC LY+++EMGKWVSEKL +L+P + G Y+LLSNM
Sbjct: 547 LDKVEAFIHENAISDLSGVWRSFLSSCTLYKNIEMGKWVSEKLLQLEPLEAGPYILLSNM 606

Query: 608 CSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKL 667
           CS + +W EA+  R  M+   + K PGQSWI LKNQVH+FV GD+SHPQ   IY  L +L
Sbjct: 607 CSTNDRWVEAANVRSLMRRRMVKKVPGQSWIQLKNQVHTFVMGDRSHPQDTDIYSCLAEL 666

Query: 668 IGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRIC 727
            GRLKEIGY  DV LVMQDVEEEQ EVLLG+HSEKLA+ YGIIS  S +PIRIMKNLRIC
Sbjct: 667 TGRLKEIGYSSDVNLVMQDVEEEQREVLLGYHSEKLALVYGIISTTSEMPIRIMKNLRIC 726

Query: 728 TDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGTST 754
            DCH+F+K TSQLL REIIVRD++RFHHF SG  T
Sbjct: 727 IDCHSFVKYTSQLLHREIIVRDMHRFHHFKSGHCT 756

BLAST of CsGy2G007450 vs. TrEMBL
Match: tr|A0A2I4EX42|A0A2I4EX42_9ROSI (pentatricopeptide repeat-containing protein At4g21065-like isoform X2 OS=Juglans regia OX=51240 GN=LOC108993491 PE=4 SV=1)

HSP 1 Score: 793.5 bits (2048), Expect = 4.7e-226
Identity = 419/755 (55.50%), Postives = 520/755 (68.87%), Query Frame = 0

Query: 8   SSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSN-HFSSNTL-----HA 67
           S CF    F K +H ++   +T     F   E   F+ F YH+S  H+ +  L     HA
Sbjct: 7   SLCFIPTRFSKRAHFLTLDCLTTNTTKF---EFRRFQFFWYHSSPVHYLTADLAVGIHHA 66

Query: 68  KMVKIGS---IFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSG 127
           K VK GS   + V+  F+  S VKS+ LN AQKLF+E+ +RDV TWT L+SG++R+  S 
Sbjct: 67  KAVKSGSFQNLNVANHFLNLS-VKSQDLNYAQKLFNEISDRDVRTWTILMSGYARIGVST 126

Query: 128 MALQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSML 187
           M L L REM +EGV PN F+LS V K CS   D+RMG GIHGWILR+G+ LDVVL+NS+L
Sbjct: 127 MVLDLLREMRIEGVHPNQFSLSIVFKCCSSTSDLRMGMGIHGWILRSGINLDVVLDNSIL 186

Query: 188 DLYAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNT 247
           D Y K   F YA+ L+  M E+ T                                    
Sbjct: 187 DFYVKCGSFDYAKTLFSVMLERDTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 246

Query: 248 IICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCG 307
              GLM+ GY   ALELLYEM  N    N  T SIAL +VSSL I+ELGRQ+HGR ++ G
Sbjct: 247 XXXGLMRNGYERTALELLYEMARNGHVLNKVTFSIALVLVSSLSIMELGRQIHGRALKFG 306

Query: 308 LHNDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQGSNIVCSDTMTEIVSRSSMVY 367
              DGFV+++LI+MY KCG +EKASVI+ + P      Q S I   +   EIVS SSMV 
Sbjct: 307 FEKDGFVRTSLIDMYCKCGKMEKASVIFRKRPLDCVGTQNSKITFDEPTAEIVSWSSMVS 366

Query: 368 GYVRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQL 427
           GYV+NG+++DA K F+SMV E+V +D+FTI ++VSAC++AG+LELGRQ+H +I K    L
Sbjct: 367 GYVQNGEFKDALKMFISMVHEQVEVDRFTITSIVSACADAGILELGRQIHAYIQKVGYTL 426

Query: 428 DAHLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQM 487
           D +L+SS + MYAK GSLD A  IF Q T + NVV+WT+MI G ALHG G++AI LFEQM
Sbjct: 427 DVYLSSSSVYMYAKCGSLDDAWTIFKQ-TIHPNVVLWTAMITGSALHGQGRKAIWLFEQM 486

Query: 488 RYEGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGL 547
            +EGI PNEVTF+GVLTACSHAGLLE+G  YF +MK+VY IKP VEH+TCMVDLYGRAG 
Sbjct: 487 VHEGIKPNEVTFVGVLTACSHAGLLEEGSKYFRLMKEVYGIKPGVEHFTCMVDLYGRAGH 546

Query: 548 LNEVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNM 607
           L++V+ FI+EN +S LS VW++FLSSC LY+++EMGKWVSEKL +L+P + G Y+LLSNM
Sbjct: 547 LDKVEAFIHENAISDLSGVWRSFLSSCTLYKNIEMGKWVSEKLLQLEPLEAGPYILLSNM 606

Query: 608 CSGSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKL 667
           CS + +W EA+  R  M+   + K PGQSWI LKNQVH+FV GD+SHPQ   IY  L +L
Sbjct: 607 CSTNDRWVEAANVRSLMRRRMVKKVPGQSWIQLKNQVHTFVMGDRSHPQDTDIYSCLAEL 666

Query: 668 IGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRIC 727
            GRLKEIGY  DV LVMQDVEEEQ EVLLG+HSEKLA+ YGIIS  S +PIRIMKNLRIC
Sbjct: 667 TGRLKEIGYSSDVNLVMQDVEEEQREVLLGYHSEKLALVYGIISTTSEMPIRIMKNLRIC 726

Query: 728 TDCHNFMKLTSQLLGREIIVRDIYRFHHFNSGTST 754
            DCH+F+K TSQLL REIIVRD++RFHHF SG  T
Sbjct: 727 IDCHSFVKYTSQLLHREIIVRDMHRFHHFKSGHCT 756

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN61262.10.0e+0099.87hypothetical protein Csa_2G074230 [Cucumis sativus][more]
XP_011648996.10.0e+0096.77PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
XP_008441858.10.0e+0093.22PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
XP_022929759.10.0e+0083.54putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita moscha... [more]
XP_023545881.10.0e+0082.59putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita pepo s... [more]
Match NameE-valueIdentityDescription
AT3G23330.18.6e-14136.53Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G13650.13.5e-12633.67Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.12.3e-12535.06Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G49170.17.3e-12435.76Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G09950.13.8e-12035.48Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9LW63|PP251_ARATH1.6e-13936.53Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
sp|Q9SVP7|PP307_ARATH6.3e-12533.67Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
sp|Q3E6Q1|PPR32_ARATH4.1e-12435.06Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|Q5G1T1|PP272_ARATH1.3e-12235.76Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
sp|Q9FIB2|PP373_ARATH6.8e-11935.48Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LKI4|A0A0A0LKI4_CUCSA0.0e+0099.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074230 PE=4 SV=1[more]
tr|A0A1S3B4E3|A0A1S3B4E3_CUCME0.0e+0093.22LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
tr|A0A1U8JQG3|A0A1U8JQG3_GOSHI1.7e-22854.80pentatricopeptide repeat-containing protein At4g21065-like OS=Gossypium hirsutum... [more]
tr|A0A2I4EX39|A0A2I4EX39_9ROSI4.7e-22655.50pentatricopeptide repeat-containing protein At4g21065-like isoform X1 OS=Juglans... [more]
tr|A0A2I4EX42|A0A2I4EX42_9ROSI4.7e-22655.50pentatricopeptide repeat-containing protein At4g21065-like isoform X2 OS=Juglans... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy2G007450.1CsGy2G007450.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 278..423
e-value: 8.2E-17
score: 63.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 70..206
e-value: 2.5E-26
score: 94.8
coord: 424..649
e-value: 3.1E-34
score: 120.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 100..147
e-value: 6.6E-11
score: 42.1
coord: 451..498
e-value: 4.4E-10
score: 39.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 175..200
e-value: 0.063
score: 13.5
coord: 235..262
e-value: 1.8E-4
score: 21.4
coord: 525..547
e-value: 0.16
score: 12.2
coord: 424..447
e-value: 0.047
score: 13.9
coord: 352..380
e-value: 1.9E-4
score: 21.3
coord: 307..328
e-value: 0.42
score: 10.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 235..264
e-value: 7.1E-5
score: 20.7
coord: 354..384
e-value: 6.3E-5
score: 20.9
coord: 103..135
e-value: 8.7E-7
score: 26.7
coord: 453..487
e-value: 2.2E-5
score: 22.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 451..485
score: 11.586
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 419..449
score: 7.947
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 135..169
score: 7.607
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 486..516
score: 6.654
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..204
score: 7.815
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 384..414
score: 5.097
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 522..556
score: 6.073
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 9.131
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 588..622
score: 7.618
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..383
score: 8.89
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 69..99
score: 5.897
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 302..332
score: 6.697
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 100..134
score: 11.652
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 624..747
e-value: 2.1E-39
score: 134.2
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 36..682
NoneNo IPR availablePANTHERPTHR24015:SF727SUBFAMILY NOT NAMEDcoord: 36..682