CsGy2G000170 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy2G000170
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr2: 121456 .. 124886 (-)
RNA-Seq ExpressionCsGy2G000170
SyntenyCsGy2G000170
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAATACAATGGCCTTGATCTTCAGCTCCTCGGCCCGAAGAAAAACCACACAAGACTTTTGAGAAAACATTAAACAAAGTAAAACGGAAAACCCTAAACGCAGGAACCCAGGGAGTCGTTGATTCATCCGATTCGCAGCCGTCTTCACTCTTTCAAGGTCCAGATATTTTGATGGAGTTCTCGTAGTATCATCTTGCGGGCCGCACATTGATTGGAACTCTGAGTTCCTTACTCGGTTGCGAGTTTTCCTTCTTTTGCGAATAGGCAACTTTCATCAACCAGCATTTTGTTGTCAACAGTGTTATGGCATCAGAGCTTCGTTCGATCTCAGGTTTATTGTCTTGAATAAAATGATGGATTTCTTATTATTTATGCGAGTTAGACTGGACATTTAATTCATTTCATGAATATTAGAGAGCCTGACACACAATCTGAGAGACCTACTGGGTGTAAACAGGAAAGAATTGTGATTGAGAATATATGGGAAGATAGAGCAGAGCCCAACGATGGCGTGGGGCTGCAGATGACCCTCTTATTCTTCTCTAGCCGTGCCTACAGGCTTCGAACCTCCAACTTCTCCATTGCTCAGTTTTATAGCCTTGCAGATTCAGTATCACGCGCTCGGCCTTTTTGTGATAGAGAGATAATACGACACTCGGAGGCTTGGCTAGTGAAGGTTGTTTGCACTCTCTTTTTTCGATCGCATTCCCTGAATGCTTGTTTTGGTTATCTAAGTAGAAACTTGAACCCTTCAATCGCTTTTGAGGTTATTAAGAGGTTTAGTGATCCCTTATTGGGTTTGAAGTTTTTTGAGTTTAGTAGAACACACCTGAGCATTAACCATACCTTTAATACCTATGATTTGCTCATGAGGAATCTCTGTAAAGTGGGTCTCAACGATTCTGCAAAAATTGTTTTTGATTGCATGAGGAGTGATGGGATTTTGCCGGATAGTTCCATTTTAGAACTCTTGGTGTCTTCATACGCTCGAATGGGGAAGTTAGATTCTGCCAAAAATTTTCTTAATGAAGTTCACTGTTATGGTATTAAAGTTAGTCCTTTTGTGTATAATAACTTGTTGAATATGTTGGTCAAGCAGAACCTAGTAGATGAAGCTGTCTTACTATTCAGGGAGCACTTGGAACCATATTTCGTTCCAGATGTTTACAGCTTCAATATTTTAATTAGAGGATTGTGCAGAATAGGAGAAATTGATAAGGCTTTTGAGTTTTTCCAGAATATGGGAAATTTTGGTTGCTTTCCTGATATTGTTTCGTATAATACGCTTATAAATGGGTTTTGTAGGGTCAATGAGATTAGTAAAGGGCATGATTTGCTAAAAGAAGATATGTTAATAAAAGGGGTTTCCCCAGATGTTATAACCTATACATCGATTATATCAGGCTATTGCAAATTGGGTGATATGAAGGCAGCTTCTGAGCTTTTTGACGAGATGGTTAGTTCTGGAATCAAACCCAACGACTTCACTTTCAATGTCCTCATTGATGGTTTTGGCAAGGTTGGAAACATGAGATCTGCTATGGTCATGTATGAGAAAATGCTTCTTCTTGGCTGTCTACCAGACGTGGTTACATTCACTTCCCTGATTGATGGCTATTGCCGGGAAGGTGAAGTGAATCAAGGTTTGAAGCTCTGGGAGGAGATGAAAGTAAGAAATCTGTCTCCAAATGTGTATACCTATGCTGTGCTCATCAATGCTCTCTGTAAGGAAAATAGAATACGGGAGGCAAGAAATTTTTTGAGGCACTTGAAATCGAGTGAGGTTGTTCCCAAACCATTTATATATAATCCCGTTATTGATGGGTTTTGCAAGGCTGGAAAGGTGGATGAGGCAAACTTCATAGTGGCTGAGATGCAGGAGAAGAAATGCAGGCCAGATAAAATAACATTTACCATTCTTATTATTGGCAACTGTATGAAAGGGAGGATGGTGGAGGCAATTAGCACTTTCTATAAGATGATTGAGATCAACTGTGTTCCAGATGAAATTACTATTAATTCTTTGATATCTTGCCTTCTGAAGGCTGGAATGCCTAATGAAGCCTCTCAAATCAAACAAGCTGCTTTACAGAAGCTCAACTTGGGTTTGTCATCGTTAGGAAGCCCGCTTACAAGAAAATCTTCACGTGTGCCAGCTGCTGTTTGGTGAGGTCTATTGTGTAGTGGACTAAGAATTTTCGGCCTGCAACTATTTGGAAGTGTTCGCCTGTTCTGTGCCAGATTACAAAATAAGCAAGTGAAGGATGAGACAAGTTGACTACTAATAGACGGTTAGGACATTTACACTGATTCTAGTTTCTCAAGGCTGAATAGTATCTATGGTAGTAAAGTACTACCTACTGGCCATATCAACTTCTATGCTGTTGTGAGGTGCCTGTTATTCGGAATAGAAATGTTTTACCGAAAATGTTTCTTTTTTTTTTTTTGGGGGGGGGGGGGGGATCATAGGTTAAAATTATCAAGATTCTTCTTTCATTTCATTACTGTTGCCATCCGAATTTTTAATGTTTCTGTTGCACTGCAGGCCAGTACTAGTTCGAGTTCTCACGTAAAGGGTTTAAAGCTCGCTGTCTGAATCCCAGCCACAATGATTACCATACTATTTCATGCTGGTCATAGGCCAAACCTTTGGAGTTCTGATTGAAAACCAATCGAAGAGGGAGTTGTGCTGAAGCCGGTTTGGATACGTTAGTGAGGGGAAGACCATTAGGCCATATTCTTCGTGGGATTGATGGCATTTGTTCTCGGACGTTATTATTGTTACCATTGGCGGTGATTCAGTGAAGAAACGGAGACGAGCTTAGATCAAGCGATTAGTTTTGGTCATGTACAGGCCTTTTCAAATTAACATAAAATAAATGCAAGGATTATCATATTTCTTGAGTTGACTTGATTCAGCTTGAGTGGAATACCCTGCTGTTATGTAAACTTTAATTATAGCATAACAATTCTGTGGTCGTGTCTGGTGCAGTAATTATGAATTCGTGTCTGAAACTGATGGATTGATTCAACATGTTATGTGGATTTGTGGAATTTCTAGACCTGAACGATCTAGAATTAACATGGATGAAAGTAAAAGTTGAAGCCAATGTGGTAGCGAATACTTATGCTTAATATAATAAACAGTCAAGAGGCTGGGTTTCTTTTGTTGACCATGGTGGCTTCTGTCGCCGTTTGAGACCGTCTTCAGTAAGTGGCCAACTCAAACGCGTTAACCTTGTCTATCAAATTACAAATTAATAATCTATTAATCGTTAATTTAAAAGCTACAAAAATGGGATAAGTCAAGTAAGTTTGGCGAGCAGCACTGTTTAACTTAATACATTGTATACTAAATTATTTTTTTCTGAAGTCTTTGTTGTATTAATTTTACACCTTTCTTTACATTAAAATAAAAATTTGTGTCAGAC

mRNA sequence

CAAATACAATGGCCTTGATCTTCAGCTCCTCGGCCCGAAGAAAAACCACACAAGACTTTTGAGAAAACATTAAACAAAGTAAAACGGAAAACCCTAAACGCAGGAACCCAGGGAGTCGTTGATTCATCCGATTCGCAGCCGTCTTCACTCTTTCAAGGTCCAGATATTTTGATGGAGTTCTCGTAGTATCATCTTGCGGGCCGCACATTGATTGGAACTCTGAGTTCCTTACTCGGTTGCGAGTTTTCCTTCTTTTGCGAATAGGCAACTTTCATCAACCAGCATTTTGTTGTCAACAGTGTTATGGCATCAGAGCTTCGTTCGATCTCAGGTTTATTGTCTTGAATAAAATGATGGATTTCTTATTATTTATGCGAGTTAGACTGGACATTTAATTCATTTCATGAATATTAGAGAGCCTGACACACAATCTGAGAGACCTACTGGGTGTAAACAGGAAAGAATTGTGATTGAGAATATATGGGAAGATAGAGCAGAGCCCAACGATGGCGTGGGGCTGCAGATGACCCTCTTATTCTTCTCTAGCCGTGCCTACAGGCTTCGAACCTCCAACTTCTCCATTGCTCAGTTTTATAGCCTTGCAGATTCAGTATCACGCGCTCGGCCTTTTTGTGATAGAGAGATAATACGACACTCGGAGGCTTGGCTAGTGAAGGTTGTTTGCACTCTCTTTTTTCGATCGCATTCCCTGAATGCTTGTTTTGGTTATCTAAGTAGAAACTTGAACCCTTCAATCGCTTTTGAGGTTATTAAGAGGTTTAGTGATCCCTTATTGGGTTTGAAGTTTTTTGAGTTTAGTAGAACACACCTGAGCATTAACCATACCTTTAATACCTATGATTTGCTCATGAGGAATCTCTGTAAAGTGGGTCTCAACGATTCTGCAAAAATTGTTTTTGATTGCATGAGGAGTGATGGGATTTTGCCGGATAGTTCCATTTTAGAACTCTTGGTGTCTTCATACGCTCGAATGGGGAAGTTAGATTCTGCCAAAAATTTTCTTAATGAAGTTCACTGTTATGGTATTAAAGTTAGTCCTTTTGTGTATAATAACTTGTTGAATATGTTGGTCAAGCAGAACCTAGTAGATGAAGCTGTCTTACTATTCAGGGAGCACTTGGAACCATATTTCGTTCCAGATGTTTACAGCTTCAATATTTTAATTAGAGGATTGTGCAGAATAGGAGAAATTGATAAGGCTTTTGAGTTTTTCCAGAATATGGGAAATTTTGGTTGCTTTCCTGATATTGTTTCGTATAATACGCTTATAAATGGGTTTTGTAGGGTCAATGAGATTAGTAAAGGGCATGATTTGCTAAAAGAAGATATGTTAATAAAAGGGGTTTCCCCAGATGTTATAACCTATACATCGATTATATCAGGCTATTGCAAATTGGGTGATATGAAGGCAGCTTCTGAGCTTTTTGACGAGATGGTTAGTTCTGGAATCAAACCCAACGACTTCACTTTCAATGTCCTCATTGATGGTTTTGGCAAGGTTGGAAACATGAGATCTGCTATGGTCATGTATGAGAAAATGCTTCTTCTTGGCTGTCTACCAGACGTGGTTACATTCACTTCCCTGATTGATGGCTATTGCCGGGAAGGTGAAGTGAATCAAGGTTTGAAGCTCTGGGAGGAGATGAAAGTAAGAAATCTGTCTCCAAATGTGTATACCTATGCTGTGCTCATCAATGCTCTCTGTAAGGAAAATAGAATACGGGAGGCAAGAAATTTTTTGAGGCACTTGAAATCGAGTGAGGTTGTTCCCAAACCATTTATATATAATCCCGTTATTGATGGGTTTTGCAAGGCTGGAAAGGTGGATGAGGCAAACTTCATAGTGGCTGAGATGCAGGAGAAGAAATGCAGGCCAGATAAAATAACATTTACCATTCTTATTATTGGCAACTGTATGAAAGGGAGGATGGTGGAGGCAATTAGCACTTTCTATAAGATGATTGAGATCAACTGTGTTCCAGATGAAATTACTATTAATTCTTTGATATCTTGCCTTCTGAAGGCTGGAATGCCTAATGAAGCCTCTCAAATCAAACAAGCTGCTTTACAGAAGCTCAACTTGGGTTTGTCATCGTTAGGAAGCCCGCTTACAAGAAAATCTTCACGTGTGCCAGCTGCTGTTTGGTGAGGTCTATTGTGTAGTGGACTAAGAATTTTCGGCCTGCAACTATTTGGAAGTGTTCGCCTGTTCTGTGCCAGATTACAAAATAAGCAAGTGAAGGATGAGACAAGTTGACTACTAATAGACGGCCAGTACTAGTTCGAGTTCTCACGTAAAGGGTTTAAAGCTCGCTGTCTGAATCCCAGCCACAATGATTACCATACTATTTCATGCTGGTCATAGGCCAAACCTTTGGAGTTCTGATTGAAAACCAATCGAAGAGGGAGTTGTGCTGAAGCCGGTTTGGATACGTTAGTGAGGGGAAGACCATTAGGCCATATTCTTCGTGGGATTGATGGCATTTGTTCTCGGACGTTATTATTGTTACCATTGGCGGTGATTCAGTGAAGAAACGGAGACGAGCTTAGATCAAGCGATTAGTTTTGGTCATGTACAGGCCTTTTCAAATTAACATAAAATAAATGCAAGGATTATCATATTTCTTGAGTTGACTTGATTCAGCTTGAGTGGAATACCCTGCTGTTATGTAAACTTTAATTATAGCATAACAATTCTGTGGTCGTGTCTGGTGCAGTAATTATGAATTCGTGTCTGAAACTGATGGATTGATTCAACATGTTATGTGGATTTGTGGAATTTCTAGACCTGAACGATCTAGAATTAACATGGATGAAAGTAAAAGTTGAAGCCAATGTGGTAGCGAATACTTATGCTTAATATAATAAACAGTCAAGAGGCTGGGTTTCTTTTGTTGACCATGGTGGCTTCTGTCGCCGTTTGAGACCGTCTTCAGTAAGTGGCCAACTCAAACGCGTTAACCTTGTCTATCAAATTACAAATTAATAATCTATTAATCGTTAATTTAAAAGCTACAAAAATGGGATAAGTCAAGTAAGTTTGGCGAGCAGCACTGTTTAACTTAATACATTGTATACTAAATTATTTTTTTCTGAAGTCTTTGTTGTATTAATTTTACACCTTTCTTTACATTAAAATAAAAATTTGTGTCAGAC

Coding sequence (CDS)

ATGAATATTAGAGAGCCTGACACACAATCTGAGAGACCTACTGGGTGTAAACAGGAAAGAATTGTGATTGAGAATATATGGGAAGATAGAGCAGAGCCCAACGATGGCGTGGGGCTGCAGATGACCCTCTTATTCTTCTCTAGCCGTGCCTACAGGCTTCGAACCTCCAACTTCTCCATTGCTCAGTTTTATAGCCTTGCAGATTCAGTATCACGCGCTCGGCCTTTTTGTGATAGAGAGATAATACGACACTCGGAGGCTTGGCTAGTGAAGGTTGTTTGCACTCTCTTTTTTCGATCGCATTCCCTGAATGCTTGTTTTGGTTATCTAAGTAGAAACTTGAACCCTTCAATCGCTTTTGAGGTTATTAAGAGGTTTAGTGATCCCTTATTGGGTTTGAAGTTTTTTGAGTTTAGTAGAACACACCTGAGCATTAACCATACCTTTAATACCTATGATTTGCTCATGAGGAATCTCTGTAAAGTGGGTCTCAACGATTCTGCAAAAATTGTTTTTGATTGCATGAGGAGTGATGGGATTTTGCCGGATAGTTCCATTTTAGAACTCTTGGTGTCTTCATACGCTCGAATGGGGAAGTTAGATTCTGCCAAAAATTTTCTTAATGAAGTTCACTGTTATGGTATTAAAGTTAGTCCTTTTGTGTATAATAACTTGTTGAATATGTTGGTCAAGCAGAACCTAGTAGATGAAGCTGTCTTACTATTCAGGGAGCACTTGGAACCATATTTCGTTCCAGATGTTTACAGCTTCAATATTTTAATTAGAGGATTGTGCAGAATAGGAGAAATTGATAAGGCTTTTGAGTTTTTCCAGAATATGGGAAATTTTGGTTGCTTTCCTGATATTGTTTCGTATAATACGCTTATAAATGGGTTTTGTAGGGTCAATGAGATTAGTAAAGGGCATGATTTGCTAAAAGAAGATATGTTAATAAAAGGGGTTTCCCCAGATGTTATAACCTATACATCGATTATATCAGGCTATTGCAAATTGGGTGATATGAAGGCAGCTTCTGAGCTTTTTGACGAGATGGTTAGTTCTGGAATCAAACCCAACGACTTCACTTTCAATGTCCTCATTGATGGTTTTGGCAAGGTTGGAAACATGAGATCTGCTATGGTCATGTATGAGAAAATGCTTCTTCTTGGCTGTCTACCAGACGTGGTTACATTCACTTCCCTGATTGATGGCTATTGCCGGGAAGGTGAAGTGAATCAAGGTTTGAAGCTCTGGGAGGAGATGAAAGTAAGAAATCTGTCTCCAAATGTGTATACCTATGCTGTGCTCATCAATGCTCTCTGTAAGGAAAATAGAATACGGGAGGCAAGAAATTTTTTGAGGCACTTGAAATCGAGTGAGGTTGTTCCCAAACCATTTATATATAATCCCGTTATTGATGGGTTTTGCAAGGCTGGAAAGGTGGATGAGGCAAACTTCATAGTGGCTGAGATGCAGGAGAAGAAATGCAGGCCAGATAAAATAACATTTACCATTCTTATTATTGGCAACTGTATGAAAGGGAGGATGGTGGAGGCAATTAGCACTTTCTATAAGATGATTGAGATCAACTGTGTTCCAGATGAAATTACTATTAATTCTTTGATATCTTGCCTTCTGAAGGCTGGAATGCCTAATGAAGCCTCTCAAATCAAACAAGCTGCTTTACAGAAGCTCAACTTGGGTTTGTCATCGTTAGGAAGCCCGCTTACAAGAAAATCTTCACGTGTGCCAGCTGCTGTTTGGTGA

Protein sequence

MNIREPDTQSERPTGCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSIAQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAFEVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAVW*
Homology
BLAST of CsGy2G000170 vs. ExPASy Swiss-Prot
Match: Q9ZUE9 (Pentatricopeptide repeat-containing protein At2g06000 OS=Arabidopsis thaliana OX=3702 GN=At2g06000 PE=2 SV=1)

HSP 1 Score: 560.1 bits (1442), Expect = 3.0e-158
Identity = 283/532 (53.20%), Postives = 378/532 (71.05%), Query Frame = 0

Query: 59  SIAQFYSLADSVSRARPFCD--REIIRHSEAWLVKVVCTLF-FRSHSLNACFGYLSRNLN 118
           +IA F++ +   ++ARP  +  RE+I   EAWLVK+V TLF +R    + CF YLS+NLN
Sbjct: 9   AIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFCYLSKNLN 68

Query: 119 PSIAFEVIKRF-SDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDC 178
           P I+FEV+K+  ++P +G +F+EFSR  L+I H+F TY+LL R+LCK GL+D A  +F+C
Sbjct: 69  PFISFEVVKKLDNNPHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHDLAGQMFEC 128

Query: 179 MRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNL 238
           M+SDG+ P++ +L  LVSS+A  GKL  A   L  +  + ++    V N+LLN LVK + 
Sbjct: 129 MKSDGVSPNNRLLGFLVSSFAEKGKLHFATALL--LQSFEVEGCCMVVNSLLNTLVKLDR 188

Query: 239 VDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNT 298
           V++A+ LF EHL      D  +FNILIRGLC +G+ +KA E    M  FGC PDIV+YNT
Sbjct: 189 VEDAMKLFDEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSGFGCEPDIVTYNT 248

Query: 299 LINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSS 358
           LI GFC+ NE++K  ++ K+       SPDV+TYTS+ISGYCK G M+ AS L D+M+  
Sbjct: 249 LIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMREASSLLDDMLRL 308

Query: 359 GIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQG 418
           GI P + TFNVL+DG+ K G M +A  +  KM+  GC PDVVTFTSLIDGYCR G+V+QG
Sbjct: 309 GIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLIDGYCRVGQVSQG 368

Query: 419 LKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDG 478
            +LWEEM  R + PN +TY++LINALC ENR+ +AR  L  L S +++P+PF+YNPVIDG
Sbjct: 369 FRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDIIPQPFMYNPVIDG 428

Query: 479 FCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPD 538
           FCKAGKV+EAN IV EM++KKC+PDKITFTILIIG+CMKGRM EA+S F+KM+ I C PD
Sbjct: 429 FCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSIFHKMVAIGCSPD 488

Query: 539 EITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAA 587
           +IT++SL+SCLLKAGM  EA  + Q A +    G S+   PL  K++    A
Sbjct: 489 KITVSSLLSCLLKAGMAKEAYHLNQIARK----GQSNNVVPLETKTANATLA 534

BLAST of CsGy2G000170 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 269.2 bits (687), Expect = 1.1e-70
Identity = 152/474 (32.07%), Postives = 255/474 (53.80%), Query Frame = 0

Query: 109 YLSRNLNPSIAFE-VIKRFSDPLLGLKFFEFSRTH--------------LSINHTFNTYD 168
           +LS N  P  A   ++K  +D  L LKF  ++  H              L+    + T  
Sbjct: 41  HLSANFTPEAASNLLLKSQNDQALILKFLNWANPHQFFTLRCKCITLHILTKFKLYKTAQ 100

Query: 169 LLMRNLCKVGLNDS-AKIVFDCMRS--DGILPDSSILELLVSSYARMGKLDSAKNFLNEV 228
           +L  ++    L+D  A +VF  ++   D     SS+ +L+V SY+R+  +D A + ++  
Sbjct: 101 ILAEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLA 160

Query: 229 HCYGIKVSPFVYNNLLNMLVK-QNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGE 288
             +G       YN +L+  ++ +  +  A  +F+E LE    P+V+++NILIRG C  G 
Sbjct: 161 QAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGN 220

Query: 289 IDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYT 348
           ID A   F  M   GC P++V+YNTLI+G+C++ +I  G  LL+  M +KG+ P++I+Y 
Sbjct: 221 IDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLR-SMALKGLEPNLISYN 280

Query: 349 SIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLL 408
            +I+G C+ G MK  S +  EM   G   ++ T+N LI G+ K GN   A+VM+ +ML  
Sbjct: 281 VVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRH 340

Query: 409 GCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREA 468
           G  P V+T+TSLI   C+ G +N+ ++  ++M+VR L PN  TY  L++   ++  + EA
Sbjct: 341 GLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEA 400

Query: 469 RNFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIG 528
              LR +  +   P    YN +I+G C  GK+++A  ++ +M+EK   PD ++++ ++ G
Sbjct: 401 YRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSG 460

Query: 529 NCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEASQIKQAALQ 564
            C    + EA+    +M+E    PD IT +SLI    +     EA  + +  L+
Sbjct: 461 FCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLR 513

BLAST of CsGy2G000170 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 1.5e-64
Identity = 144/399 (36.09%), Postives = 214/399 (53.63%), Query Frame = 0

Query: 151 TYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEV 210
           TY+ ++   CK+G  + A      +   G+ PD      L+  Y +   LDSA    NE+
Sbjct: 220 TYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEM 279

Query: 211 HCYGIKVSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEI 270
              G + +   Y +L++ L     +DEA+ LF +  +    P V ++ +LI+ LC     
Sbjct: 280 PLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERK 339

Query: 271 DKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTS 330
            +A    + M   G  P+I +Y  LI+  C   +  K  +LL + ML KG+ P+VITY +
Sbjct: 340 SEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQ-MLEKGLMPNVITYNA 399

Query: 331 IISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLG 390
           +I+GYCK G ++ A ++ + M S  + PN  T+N LI G+ K  N+  AM +  KML   
Sbjct: 400 LINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCK-SNVHKAMGVLNKMLERK 459

Query: 391 CLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREAR 450
            LPDVVT+ SLIDG CR G  +   +L   M  R L P+ +TY  +I++LCK  R+ EA 
Sbjct: 460 VLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEAC 519

Query: 451 NFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGN 510
           +    L+   V P   +Y  +IDG+CKAGKVDEA+ ++ +M  K C P+ +TF  LI G 
Sbjct: 520 DLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGL 579

Query: 511 CMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAG 550
           C  G++ EA     KM++I   P   T   LI  LLK G
Sbjct: 580 CADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDG 616

BLAST of CsGy2G000170 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 242.3 bits (617), Expect = 1.4e-62
Identity = 137/436 (31.42%), Postives = 231/436 (52.98%), Query Frame = 0

Query: 133 LKFFEFSRTHLS---INHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILEL 192
           LK  E S   +S   I    +T+++L++ LC+      A ++ + M S G++PD      
Sbjct: 170 LKLVEISHAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTT 229

Query: 193 LVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVLLFRE-HLEP 252
           ++  Y   G LD A     ++  +G   S    N +++   K+  V++A+   +E   + 
Sbjct: 230 VMQGYIEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQD 289

Query: 253 YFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKG 312
            F PD Y+FN L+ GLC+ G +  A E    M   G  PD+ +YN++I+G C++ E+ + 
Sbjct: 290 GFFPDQYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEA 349

Query: 313 HDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLID 372
            ++L + M+ +  SP+ +TY ++IS  CK   ++ A+EL   + S GI P+  TFN LI 
Sbjct: 350 VEVL-DQMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQ 409

Query: 373 GFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSP 432
           G     N R AM ++E+M   GC PD  T+  LID  C +G++++ L + ++M++   + 
Sbjct: 410 GLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCAR 469

Query: 433 NVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIV 492
           +V TY  LI+  CK N+ REA      ++   V      YN +IDG CK+ +V++A  ++
Sbjct: 470 SVITYNTLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLM 529

Query: 493 AEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKA 552
            +M  +  +PDK T+  L+   C  G + +A      M    C PD +T  +LIS L KA
Sbjct: 530 DQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKA 589

Query: 553 GMPNEASQIKQAALQK 565
           G    AS++ ++   K
Sbjct: 590 GRVEVASKLLRSIQMK 604

BLAST of CsGy2G000170 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 233.0 bits (593), Expect = 8.5e-60
Identity = 121/392 (30.87%), Postives = 210/392 (53.57%), Query Frame = 0

Query: 151 TYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEV 210
           T   L+  LC  G    A ++ D M   G  PD      +++   + G    A +   ++
Sbjct: 177 TVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKM 236

Query: 211 HCYGIKVSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEI 270
               IK S   Y+ +++ L K    D+A+ LF E        DV +++ LI GLC  G+ 
Sbjct: 237 EERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKW 296

Query: 271 DKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTS 330
           D   +  + M      PD+V+++ LI+ F +  ++ +  +L  E M+ +G++PD ITY S
Sbjct: 297 DDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNE-MITRGIAPDTITYNS 356

Query: 331 IISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLG 390
           +I G+CK   +  A+++FD MVS G +P+  T+++LI+ + K   +   M ++ ++   G
Sbjct: 357 LIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKG 416

Query: 391 CLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREAR 450
            +P+ +T+ +L+ G+C+ G++N   +L++EM  R + P+V TY +L++ LC    + +A 
Sbjct: 417 LIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKAL 476

Query: 451 NFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGN 510
                ++ S +     IYN +I G C A KVD+A  +   + +K  +PD +T+ ++I G 
Sbjct: 477 EIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGL 536

Query: 511 CMKGRMVEAISTFYKMIEINCVPDEITINSLI 543
           C KG + EA   F KM E  C PD+ T N LI
Sbjct: 537 CKKGSLSEADMLFRKMKEDGCTPDDFTYNILI 567

BLAST of CsGy2G000170 vs. NCBI nr
Match: XP_008459803.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g06000 [Cucumis melo] >XP_008459808.1 PREDICTED: pentatricopeptide repeat-containing protein At2g06000 [Cucumis melo] >XP_008459817.1 PREDICTED: pentatricopeptide repeat-containing protein At2g06000 [Cucumis melo])

HSP 1 Score: 1126 bits (2913), Expect = 0.0
Identity = 561/588 (95.41%), Postives = 569/588 (96.77%), Query Frame = 0

Query: 1   MNIREPDTQSERPTGCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 60
           M IRE + QSERPT CKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLR SNFSI
Sbjct: 1   MKIRESERQSERPTVCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRLSNFSI 60

Query: 61  AQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 120
           +QFYSLADSVSRARPFCDREIIR+SEAWLVKVVCTLF RSHSLNACFGYLSRNLNPSIAF
Sbjct: 61  SQFYSLADSVSRARPFCDREIIRNSEAWLVKVVCTLFSRSHSLNACFGYLSRNLNPSIAF 120

Query: 121 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180
           EVIKRF DPLLGLKFFEFSR HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI
Sbjct: 121 EVIKRFRDPLLGLKFFEFSRIHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180

Query: 181 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 240
           LPDSSILELLVSSYARMGK DSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAV 
Sbjct: 181 LPDSSILELLVSSYARMGKFDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVS 240

Query: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300
           LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC
Sbjct: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300

Query: 301 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 360
           RVNEISKGHDLLKE MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPN+
Sbjct: 301 RVNEISKGHDLLKEVMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNN 360

Query: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 420
           FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGC PDVVT TSLIDGYCREGEVNQGLKLWEE
Sbjct: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCPPDVVTVTSLIDGYCREGEVNQGLKLWEE 420

Query: 421 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 480
           MK+RNLSPNVYTYAVLINALCKENRI+EARNFLRHLKSSEVVPKPFIYNPVID FCKAG+
Sbjct: 421 MKLRNLSPNVYTYAVLINALCKENRIQEARNFLRHLKSSEVVPKPFIYNPVIDRFCKAGQ 480

Query: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 540
           VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINC+PDEITINS
Sbjct: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCIPDEITINS 540

Query: 541 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAVW 588
           LISCLLKAGMPNEASQIKQAALQKLNLG  SLGS LTRKSS VP AVW
Sbjct: 541 LISCLLKAGMPNEASQIKQAALQKLNLGSLSLGSLLTRKSSSVPVAVW 588

BLAST of CsGy2G000170 vs. NCBI nr
Match: KAA0061465.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK10809.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1125 bits (2911), Expect = 0.0
Identity = 561/588 (95.41%), Postives = 569/588 (96.77%), Query Frame = 0

Query: 1   MNIREPDTQSERPTGCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 60
           M IRE + QSERPT CKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLR SNFSI
Sbjct: 1   MKIRESERQSERPTVCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRLSNFSI 60

Query: 61  AQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 120
           +QFYSLADSVSRARPFCDREIIR+SEAWLVKVVCTLF RSHSLNACFGYLSRNLNPSIAF
Sbjct: 61  SQFYSLADSVSRARPFCDREIIRNSEAWLVKVVCTLFSRSHSLNACFGYLSRNLNPSIAF 120

Query: 121 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180
           EVIKRF DPLLGLKFFEFSR HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI
Sbjct: 121 EVIKRFRDPLLGLKFFEFSRIHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180

Query: 181 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 240
            PDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAV 
Sbjct: 181 SPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVS 240

Query: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300
           LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC
Sbjct: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300

Query: 301 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 360
           RVNEISKGHDLLKE MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPN+
Sbjct: 301 RVNEISKGHDLLKEVMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNN 360

Query: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 420
           FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGC PDVVT TSLIDGYCREGEVNQGLKLWEE
Sbjct: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCPPDVVTVTSLIDGYCREGEVNQGLKLWEE 420

Query: 421 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 480
           MK+RNLSPNVYTYAVLINALCKENRI+EARNFLRHLKSSEVVPKPFIYNPVID FCKAG+
Sbjct: 421 MKLRNLSPNVYTYAVLINALCKENRIQEARNFLRHLKSSEVVPKPFIYNPVIDRFCKAGQ 480

Query: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 540
           VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINC+PDEITINS
Sbjct: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCIPDEITINS 540

Query: 541 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAVW 588
           LISCLLKAGMPNEASQIKQAALQKLNLG  SLGS LTRKSS VP AVW
Sbjct: 541 LISCLLKAGMPNEASQIKQAALQKLNLGSLSLGSLLTRKSSSVPVAVW 588

BLAST of CsGy2G000170 vs. NCBI nr
Match: KAE8651488.1 (hypothetical protein Csa_019253 [Cucumis sativus])

HSP 1 Score: 1103 bits (2852), Expect = 0.0
Identity = 547/548 (99.82%), Postives = 547/548 (99.82%), Query Frame = 0

Query: 41  MTLLFFSSRAYRLRTSNFSIAQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRS 100
           MTLLFFSSRAYRLRTSNFSIAQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRS
Sbjct: 1   MTLLFFSSRAYRLRTSNFSIAQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRS 60

Query: 101 HSLNACFGYLSRNLNPSIAFEVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLC 160
           HSLNACFGYLSRNLNPSIAFEVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLC
Sbjct: 61  HSLNACFGYLSRNLNPSIAFEVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLC 120

Query: 161 KVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPF 220
           KVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPF
Sbjct: 121 KVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPF 180

Query: 221 VYNNLLNMLVKQNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNM 280
           VYNNLLNMLVKQNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNM
Sbjct: 181 VYNNLLNMLVKQNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNM 240

Query: 281 GNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGD 340
           GNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGD
Sbjct: 241 GNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGD 300

Query: 341 MKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTS 400
           MKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTS
Sbjct: 301 MKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTS 360

Query: 401 LIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSE 460
           LIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSE
Sbjct: 361 LIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSE 420

Query: 461 VVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAI 520
           VVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAI
Sbjct: 421 VVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAI 480

Query: 521 STFYKMIEINCVPDEITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKS 580
           STFYKMIEINCVPDEITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKS
Sbjct: 481 STFYKMIEINCVPDEITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKS 540

Query: 581 SRVPAAVW 588
           SRVP AVW
Sbjct: 541 SRVPVAVW 548

BLAST of CsGy2G000170 vs. NCBI nr
Match: XP_031736975.1 (pentatricopeptide repeat-containing protein At2g06000 [Cucumis sativus])

HSP 1 Score: 1100 bits (2845), Expect = 0.0
Identity = 554/588 (94.22%), Postives = 554/588 (94.22%), Query Frame = 0

Query: 1   MNIREPDTQSERPTGCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 60
           MNIREPDTQSERPTGCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI
Sbjct: 1   MNIREPDTQSERPTGCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 60

Query: 61  AQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 120
           AQFYSLADSVSRARPFCDRE                                 LNPSIAF
Sbjct: 61  AQFYSLADSVSRARPFCDRE---------------------------------LNPSIAF 120

Query: 121 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180
           EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI
Sbjct: 121 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180

Query: 181 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 240
           LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL
Sbjct: 181 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 240

Query: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300
           LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC
Sbjct: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300

Query: 301 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 360
           RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND
Sbjct: 301 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 360

Query: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 420
           FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE
Sbjct: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 420

Query: 421 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 480
           MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK
Sbjct: 421 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 480

Query: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 540
           VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS
Sbjct: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 540

Query: 541 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAVW 588
           LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVP AVW
Sbjct: 541 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPVAVW 555

BLAST of CsGy2G000170 vs. NCBI nr
Match: XP_038890050.1 (pentatricopeptide repeat-containing protein At2g06000 [Benincasa hispida])

HSP 1 Score: 1078 bits (2789), Expect = 0.0
Identity = 532/584 (91.10%), Postives = 558/584 (95.55%), Query Frame = 0

Query: 7   DTQSERPT---GCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSIAQF 66
           DTQSERP+   GCK+ RIVIENIWEDRAEP+DGVGLQMTLLFFSSR YR R SNFSIAQF
Sbjct: 11  DTQSERPSRIPGCKRARIVIENIWEDRAEPSDGVGLQMTLLFFSSRTYRFRASNFSIAQF 70

Query: 67  YSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAFEVI 126
           YSLAD VSRARPFCDRE++R+ E WLVKVVCTLFFRS SLNACFGYLSRNLNPSIAFEVI
Sbjct: 71  YSLADGVSRARPFCDREVLRNPETWLVKVVCTLFFRSISLNACFGYLSRNLNPSIAFEVI 130

Query: 127 KRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPD 186
           KRF DPLLGLKFFEFSRTHLSI HTFNTYDLL+RNLC++GLNDSAKIVFDCMRSDGILPD
Sbjct: 131 KRFRDPLLGLKFFEFSRTHLSIKHTFNTYDLLVRNLCQMGLNDSAKIVFDCMRSDGILPD 190

Query: 187 SSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVLLFR 246
           SSI+EL+VSSYA+MGKLDSAKNFLNEVH YGIKVSPFVYNNLLNMLVKQNLVDEAVLLFR
Sbjct: 191 SSIVELMVSSYAQMGKLDSAKNFLNEVHRYGIKVSPFVYNNLLNMLVKQNLVDEAVLLFR 250

Query: 247 EHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVN 306
           EHLEPY VPDVYSFNILIRGLCRIG+IDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVN
Sbjct: 251 EHLEPYLVPDVYSFNILIRGLCRIGKIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVN 310

Query: 307 EISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTF 366
           EISKGHDLLKE + IKGVSPDV+TYTS+ISGYCKLGDMKAAS+LFDEMVSSGIKPNDFTF
Sbjct: 311 EISKGHDLLKEVLSIKGVSPDVVTYTSLISGYCKLGDMKAASDLFDEMVSSGIKPNDFTF 370

Query: 367 NVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKV 426
           NVLIDGFGKVGNMRSA+VMYEKMLLLGC PDVVT TSLIDGYCR GEVNQGLKLWEEMKV
Sbjct: 371 NVLIDGFGKVGNMRSALVMYEKMLLLGCPPDVVTVTSLIDGYCRAGEVNQGLKLWEEMKV 430

Query: 427 RNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDE 486
           RNLSPNVYTYAVLINALCKENRI+EAR+FLRHLK SEVVPKPFIYNPVIDGFCKAGKVDE
Sbjct: 431 RNLSPNVYTYAVLINALCKENRIQEARSFLRHLKVSEVVPKPFIYNPVIDGFCKAGKVDE 490

Query: 487 ANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINSLIS 546
           AN IVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKM+E+NC+PDEITINSLIS
Sbjct: 491 ANVIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMLEVNCIPDEITINSLIS 550

Query: 547 CLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAV 587
           CLLKAGMPNEASQIKQAA++ LNLGL SLGSPLT+KSS VP AV
Sbjct: 551 CLLKAGMPNEASQIKQAAVENLNLGLLSLGSPLTKKSSCVPVAV 594

BLAST of CsGy2G000170 vs. ExPASy TrEMBL
Match: A0A1S3CB42 (pentatricopeptide repeat-containing protein At2g06000 OS=Cucumis melo OX=3656 GN=LOC103498823 PE=4 SV=1)

HSP 1 Score: 1126 bits (2913), Expect = 0.0
Identity = 561/588 (95.41%), Postives = 569/588 (96.77%), Query Frame = 0

Query: 1   MNIREPDTQSERPTGCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 60
           M IRE + QSERPT CKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLR SNFSI
Sbjct: 1   MKIRESERQSERPTVCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRLSNFSI 60

Query: 61  AQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 120
           +QFYSLADSVSRARPFCDREIIR+SEAWLVKVVCTLF RSHSLNACFGYLSRNLNPSIAF
Sbjct: 61  SQFYSLADSVSRARPFCDREIIRNSEAWLVKVVCTLFSRSHSLNACFGYLSRNLNPSIAF 120

Query: 121 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180
           EVIKRF DPLLGLKFFEFSR HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI
Sbjct: 121 EVIKRFRDPLLGLKFFEFSRIHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180

Query: 181 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 240
           LPDSSILELLVSSYARMGK DSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAV 
Sbjct: 181 LPDSSILELLVSSYARMGKFDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVS 240

Query: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300
           LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC
Sbjct: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300

Query: 301 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 360
           RVNEISKGHDLLKE MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPN+
Sbjct: 301 RVNEISKGHDLLKEVMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNN 360

Query: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 420
           FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGC PDVVT TSLIDGYCREGEVNQGLKLWEE
Sbjct: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCPPDVVTVTSLIDGYCREGEVNQGLKLWEE 420

Query: 421 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 480
           MK+RNLSPNVYTYAVLINALCKENRI+EARNFLRHLKSSEVVPKPFIYNPVID FCKAG+
Sbjct: 421 MKLRNLSPNVYTYAVLINALCKENRIQEARNFLRHLKSSEVVPKPFIYNPVIDRFCKAGQ 480

Query: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 540
           VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINC+PDEITINS
Sbjct: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCIPDEITINS 540

Query: 541 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAVW 588
           LISCLLKAGMPNEASQIKQAALQKLNLG  SLGS LTRKSS VP AVW
Sbjct: 541 LISCLLKAGMPNEASQIKQAALQKLNLGSLSLGSLLTRKSSSVPVAVW 588

BLAST of CsGy2G000170 vs. ExPASy TrEMBL
Match: A0A5A7V6K1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold332G001250 PE=4 SV=1)

HSP 1 Score: 1125 bits (2911), Expect = 0.0
Identity = 561/588 (95.41%), Postives = 569/588 (96.77%), Query Frame = 0

Query: 1   MNIREPDTQSERPTGCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 60
           M IRE + QSERPT CKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLR SNFSI
Sbjct: 1   MKIRESERQSERPTVCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRLSNFSI 60

Query: 61  AQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 120
           +QFYSLADSVSRARPFCDREIIR+SEAWLVKVVCTLF RSHSLNACFGYLSRNLNPSIAF
Sbjct: 61  SQFYSLADSVSRARPFCDREIIRNSEAWLVKVVCTLFSRSHSLNACFGYLSRNLNPSIAF 120

Query: 121 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180
           EVIKRF DPLLGLKFFEFSR HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI
Sbjct: 121 EVIKRFRDPLLGLKFFEFSRIHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 180

Query: 181 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 240
            PDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAV 
Sbjct: 181 SPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVS 240

Query: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300
           LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC
Sbjct: 241 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 300

Query: 301 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 360
           RVNEISKGHDLLKE MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPN+
Sbjct: 301 RVNEISKGHDLLKEVMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNN 360

Query: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 420
           FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGC PDVVT TSLIDGYCREGEVNQGLKLWEE
Sbjct: 361 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCPPDVVTVTSLIDGYCREGEVNQGLKLWEE 420

Query: 421 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 480
           MK+RNLSPNVYTYAVLINALCKENRI+EARNFLRHLKSSEVVPKPFIYNPVID FCKAG+
Sbjct: 421 MKLRNLSPNVYTYAVLINALCKENRIQEARNFLRHLKSSEVVPKPFIYNPVIDRFCKAGQ 480

Query: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 540
           VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINC+PDEITINS
Sbjct: 481 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCIPDEITINS 540

Query: 541 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAVW 588
           LISCLLKAGMPNEASQIKQAALQKLNLG  SLGS LTRKSS VP AVW
Sbjct: 541 LISCLLKAGMPNEASQIKQAALQKLNLGSLSLGSLLTRKSSSVPVAVW 588

BLAST of CsGy2G000170 vs. ExPASy TrEMBL
Match: A0A6J1G470 (pentatricopeptide repeat-containing protein At2g06000 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111450639 PE=4 SV=1)

HSP 1 Score: 1042 bits (2695), Expect = 0.0
Identity = 512/587 (87.22%), Postives = 543/587 (92.50%), Query Frame = 0

Query: 4   REPDTQSERPT---GCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 63
           +E  TQSE P    GCK+ RIVIENIWEDRAEP+DGVGLQMTLLFFSSRAYR R SNF I
Sbjct: 8   QESVTQSEGPNRIPGCKRARIVIENIWEDRAEPSDGVGLQMTLLFFSSRAYRFRASNFFI 67

Query: 64  AQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 123
           AQFYS+AD VSRARPFCDRE+IR+ EAWLVKVVCTLFFRSHS+NACFGYLSRNLNPSIAF
Sbjct: 68  AQFYSIADGVSRARPFCDREVIRNPEAWLVKVVCTLFFRSHSINACFGYLSRNLNPSIAF 127

Query: 124 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 183
           EVIKRF DPLLGLKFFEFSRTHLSINHTFNTYDLL+RNLC++GLNDSAK VFDCMR+DGI
Sbjct: 128 EVIKRFRDPLLGLKFFEFSRTHLSINHTFNTYDLLVRNLCQMGLNDSAKTVFDCMRTDGI 187

Query: 184 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 243
           LPDSSI+E+LVSSYAR+GKLD+AKNFL+E+HCYGIKVSPFVYNN LNMLVKQN VDEAVL
Sbjct: 188 LPDSSIVEILVSSYARLGKLDAAKNFLDEIHCYGIKVSPFVYNNFLNMLVKQNQVDEAVL 247

Query: 244 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 303
           LFREHLEPYF PDVY+FNILIRGLCRIGE DKAFE+ QNM NFGCFPDIVSYNTLINGFC
Sbjct: 248 LFREHLEPYFTPDVYTFNILIRGLCRIGETDKAFEYLQNMENFGCFPDIVSYNTLINGFC 307

Query: 304 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 363
           RVNE+SKGH LLKE   IKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND
Sbjct: 308 RVNEVSKGHVLLKEVQSIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 367

Query: 364 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 423
           FTFNVLIDGFGK G MR A+ MYEKMLLLGCLPDVVT TSLIDGYCR GEVNQGLKLWEE
Sbjct: 368 FTFNVLIDGFGKAGEMRLALAMYEKMLLLGCLPDVVTVTSLIDGYCRAGEVNQGLKLWEE 427

Query: 424 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 483
           MKVR+LSPNVYTYAVLINALCKENRI+EARNFLRHLKSSE+VPKPFIYNPVIDGFCKAGK
Sbjct: 428 MKVRDLSPNVYTYAVLINALCKENRIQEARNFLRHLKSSEIVPKPFIYNPVIDGFCKAGK 487

Query: 484 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 543
           VDEAN IVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAI+ FYKM+E  C+PDEITINS
Sbjct: 488 VDEANVIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAINIFYKMLETKCIPDEITINS 547

Query: 544 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAV 587
           LISCLLKAG P+EASQIKQAA + LN  L SL +PLTRKSS VP A+
Sbjct: 548 LISCLLKAGRPDEASQIKQAAFENLNFDLLSLKNPLTRKSSCVPVAI 594

BLAST of CsGy2G000170 vs. ExPASy TrEMBL
Match: A0A6J1KES2 (pentatricopeptide repeat-containing protein At2g06000 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111493657 PE=4 SV=1)

HSP 1 Score: 1035 bits (2675), Expect = 0.0
Identity = 509/587 (86.71%), Postives = 542/587 (92.33%), Query Frame = 0

Query: 4   REPDTQSERPT---GCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 63
           +E  TQSE P    GCK+ RIVIENIWEDRAEP+DGVGLQMTLLFFSSRA R R SNF I
Sbjct: 8   QESVTQSEGPNRIPGCKRARIVIENIWEDRAEPSDGVGLQMTLLFFSSRACRFRASNFFI 67

Query: 64  AQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 123
           AQFYS+AD VSRARPFCDRE+IR+ E+WLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF
Sbjct: 68  AQFYSIADGVSRARPFCDREVIRNPESWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 127

Query: 124 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 183
           EVIKRF DPLLGLKFFEFSRTHLSINHTFNTYDLL+RNLC++GLNDSAK VFDCMR+DGI
Sbjct: 128 EVIKRFRDPLLGLKFFEFSRTHLSINHTFNTYDLLVRNLCQMGLNDSAKTVFDCMRTDGI 187

Query: 184 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 243
           LPDSSI+E+LVSSYA++GKLD+AKNFL+E+HCYGIKV PFVYNN LNMLVKQN VDEAVL
Sbjct: 188 LPDSSIVEILVSSYAQLGKLDAAKNFLDEIHCYGIKVIPFVYNNFLNMLVKQNQVDEAVL 247

Query: 244 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 303
           LFREHLEPYF PDVY+FNILIRGLCRIGE DKAFE+ QNM NFGCFPDIVSYNTLINGFC
Sbjct: 248 LFREHLEPYFTPDVYTFNILIRGLCRIGETDKAFEYLQNMENFGCFPDIVSYNTLINGFC 307

Query: 304 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 363
           RVNE+SKGHDLLKE   IKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND
Sbjct: 308 RVNEVSKGHDLLKEVQSIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 367

Query: 364 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 423
           FTFNVLIDGFGK G MR A+ MY+KMLLLGCLPDVVT TSLIDGYCR GEVNQGLKLWEE
Sbjct: 368 FTFNVLIDGFGKAGEMRLALAMYDKMLLLGCLPDVVTVTSLIDGYCRAGEVNQGLKLWEE 427

Query: 424 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 483
           MKVR+LSPNVYTYAVLINALCKENRI+EARNFLRHLK SE+VPKPFIYNPVIDGFCKAGK
Sbjct: 428 MKVRDLSPNVYTYAVLINALCKENRIQEARNFLRHLKLSEIVPKPFIYNPVIDGFCKAGK 487

Query: 484 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 543
           VDEAN IVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAI+ FYKM+E  C+PDEITINS
Sbjct: 488 VDEANVIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAINIFYKMLETKCIPDEITINS 547

Query: 544 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAV 587
           LISCLLKAGMP+EASQIKQAAL+ LN  L SL +PLT KSS VP A+
Sbjct: 548 LISCLLKAGMPDEASQIKQAALEYLNFDLLSLKNPLTIKSSCVPVAI 594

BLAST of CsGy2G000170 vs. ExPASy TrEMBL
Match: A0A6J1DMW0 (pentatricopeptide repeat-containing protein At2g06000 OS=Momordica charantia OX=3673 GN=LOC111021880 PE=4 SV=1)

HSP 1 Score: 1010 bits (2611), Expect = 0.0
Identity = 500/587 (85.18%), Postives = 533/587 (90.80%), Query Frame = 0

Query: 4   REPDTQSERPT---GCKQERIVIENIWEDRAEPNDGVGLQMTLLFFSSRAYRLRTSNFSI 63
           +E DTQSERP    GCK+ RIVIEN+WEDRAE +DGVGLQMTLLFFSSRAYR R S+FSI
Sbjct: 8   QESDTQSERPDRIPGCKRTRIVIENLWEDRAESSDGVGLQMTLLFFSSRAYRFRASSFSI 67

Query: 64  AQFYSLADSVSRARPFCDREIIRHSEAWLVKVVCTLFFRSHSLNACFGYLSRNLNPSIAF 123
           AQF+S  D  SRAR F DRE+IR+ EAWL KVVCTLFFRSHSLN CFGYLSRNL PSIAF
Sbjct: 68  AQFHSFTDGGSRARSFYDREVIRNPEAWLAKVVCTLFFRSHSLNGCFGYLSRNLTPSIAF 127

Query: 124 EVIKRFSDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGI 183
           EVIKRF DP+LGLKFFEFSR HLSINH+FNTYDLLMRNLC++GL+ SAK+VFDCMR DGI
Sbjct: 128 EVIKRFKDPILGLKFFEFSRAHLSINHSFNTYDLLMRNLCQMGLHGSAKMVFDCMRCDGI 187

Query: 184 LPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVL 243
           LPDSS++ELLVSSYA+MGKLDSAK FLNEVHCYGIKVSPFVYNNLLN+LVKQN VDEAVL
Sbjct: 188 LPDSSVVELLVSSYAQMGKLDSAKIFLNEVHCYGIKVSPFVYNNLLNLLVKQNQVDEAVL 247

Query: 244 LFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFC 303
           LFREHLEPYF+PD Y+FNI+IRGLCRIGEI KAFEFFQNMGNFGCFPDIVSYNTLINGFC
Sbjct: 248 LFREHLEPYFIPDAYTFNIIIRGLCRIGEIHKAFEFFQNMGNFGCFPDIVSYNTLINGFC 307

Query: 304 RVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPND 363
           RVNEISKGHDLLKE   +KGV+PDVITYTSIISGYCKLGDMKAAS L DEMV SGIKPND
Sbjct: 308 RVNEISKGHDLLKEVQSVKGVAPDVITYTSIISGYCKLGDMKAASGLLDEMVGSGIKPND 367

Query: 364 FTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEE 423
           FTFNVLI GFGKVG+M SA+ MYEKMLLLGC PDVVT TSLIDGYCR GEVNQGLKLWEE
Sbjct: 368 FTFNVLIHGFGKVGDMISALAMYEKMLLLGCPPDVVTLTSLIDGYCRAGEVNQGLKLWEE 427

Query: 424 MKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGK 483
           MKVR LSPNVYTYAVLINALCKENRI+EAR FLRHLK SEVVPK FIYNPVIDGFCKAGK
Sbjct: 428 MKVRGLSPNVYTYAVLINALCKENRIQEARFFLRHLKLSEVVPKTFIYNPVIDGFCKAGK 487

Query: 484 VDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINS 543
           VDEAN IVAEM+EKKC PDKITFTILIIGNCM+GRMVEAIS FYKM+E NC+PDEITINS
Sbjct: 488 VDEANVIVAEMREKKCSPDKITFTILIIGNCMQGRMVEAISIFYKMLETNCIPDEITINS 547

Query: 544 LISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAAV 587
           LISCLLKAGMPNEASQIKQAA++ LNLGL SL +PL RK S VP AV
Sbjct: 548 LISCLLKAGMPNEASQIKQAAVENLNLGLLSLRNPLVRKPSCVPIAV 594

BLAST of CsGy2G000170 vs. TAIR 10
Match: AT2G06000.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 560.1 bits (1442), Expect = 2.2e-159
Identity = 283/532 (53.20%), Postives = 378/532 (71.05%), Query Frame = 0

Query: 59  SIAQFYSLADSVSRARPFCD--REIIRHSEAWLVKVVCTLF-FRSHSLNACFGYLSRNLN 118
           +IA F++ +   ++ARP  +  RE+I   EAWLVK+V TLF +R    + CF YLS+NLN
Sbjct: 9   AIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFCYLSKNLN 68

Query: 119 PSIAFEVIKRF-SDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDC 178
           P I+FEV+K+  ++P +G +F+EFSR  L+I H+F TY+LL R+LCK GL+D A  +F+C
Sbjct: 69  PFISFEVVKKLDNNPHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHDLAGQMFEC 128

Query: 179 MRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNL 238
           M+SDG+ P++ +L  LVSS+A  GKL  A   L  +  + ++    V N+LLN LVK + 
Sbjct: 129 MKSDGVSPNNRLLGFLVSSFAEKGKLHFATALL--LQSFEVEGCCMVVNSLLNTLVKLDR 188

Query: 239 VDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNT 298
           V++A+ LF EHL      D  +FNILIRGLC +G+ +KA E    M  FGC PDIV+YNT
Sbjct: 189 VEDAMKLFDEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSGFGCEPDIVTYNT 248

Query: 299 LINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSS 358
           LI GFC+ NE++K  ++ K+       SPDV+TYTS+ISGYCK G M+ AS L D+M+  
Sbjct: 249 LIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMREASSLLDDMLRL 308

Query: 359 GIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQG 418
           GI P + TFNVL+DG+ K G M +A  +  KM+  GC PDVVTFTSLIDGYCR G+V+QG
Sbjct: 309 GIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLIDGYCRVGQVSQG 368

Query: 419 LKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDG 478
            +LWEEM  R + PN +TY++LINALC ENR+ +AR  L  L S +++P+PF+YNPVIDG
Sbjct: 369 FRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDIIPQPFMYNPVIDG 428

Query: 479 FCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPD 538
           FCKAGKV+EAN IV EM++KKC+PDKITFTILIIG+CMKGRM EA+S F+KM+ I C PD
Sbjct: 429 FCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSIFHKMVAIGCSPD 488

Query: 539 EITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAA 587
           +IT++SL+SCLLKAGM  EA  + Q A +    G S+   PL  K++    A
Sbjct: 489 KITVSSLLSCLLKAGMAKEAYHLNQIARK----GQSNNVVPLETKTANATLA 534

BLAST of CsGy2G000170 vs. TAIR 10
Match: AT2G06000.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 560.1 bits (1442), Expect = 2.2e-159
Identity = 283/532 (53.20%), Postives = 378/532 (71.05%), Query Frame = 0

Query: 59  SIAQFYSLADSVSRARPFCD--REIIRHSEAWLVKVVCTLF-FRSHSLNACFGYLSRNLN 118
           +IA F++ +   ++ARP  +  RE+I   EAWLVK+V TLF +R    + CF YLS+NLN
Sbjct: 9   AIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFCYLSKNLN 68

Query: 119 PSIAFEVIKRF-SDPLLGLKFFEFSRTHLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDC 178
           P I+FEV+K+  ++P +G +F+EFSR  L+I H+F TY+LL R+LCK GL+D A  +F+C
Sbjct: 69  PFISFEVVKKLDNNPHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHDLAGQMFEC 128

Query: 179 MRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNL 238
           M+SDG+ P++ +L  LVSS+A  GKL  A   L  +  + ++    V N+LLN LVK + 
Sbjct: 129 MKSDGVSPNNRLLGFLVSSFAEKGKLHFATALL--LQSFEVEGCCMVVNSLLNTLVKLDR 188

Query: 239 VDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNT 298
           V++A+ LF EHL      D  +FNILIRGLC +G+ +KA E    M  FGC PDIV+YNT
Sbjct: 189 VEDAMKLFDEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSGFGCEPDIVTYNT 248

Query: 299 LINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSS 358
           LI GFC+ NE++K  ++ K+       SPDV+TYTS+ISGYCK G M+ AS L D+M+  
Sbjct: 249 LIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMREASSLLDDMLRL 308

Query: 359 GIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQG 418
           GI P + TFNVL+DG+ K G M +A  +  KM+  GC PDVVTFTSLIDGYCR G+V+QG
Sbjct: 309 GIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLIDGYCRVGQVSQG 368

Query: 419 LKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDG 478
            +LWEEM  R + PN +TY++LINALC ENR+ +AR  L  L S +++P+PF+YNPVIDG
Sbjct: 369 FRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDIIPQPFMYNPVIDG 428

Query: 479 FCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPD 538
           FCKAGKV+EAN IV EM++KKC+PDKITFTILIIG+CMKGRM EA+S F+KM+ I C PD
Sbjct: 429 FCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSIFHKMVAIGCSPD 488

Query: 539 EITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSPLTRKSSRVPAA 587
           +IT++SL+SCLLKAGM  EA  + Q A +    G S+   PL  K++    A
Sbjct: 489 KITVSSLLSCLLKAGMAKEAYHLNQIARK----GQSNNVVPLETKTANATLA 534

BLAST of CsGy2G000170 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 269.2 bits (687), Expect = 7.6e-72
Identity = 152/474 (32.07%), Postives = 255/474 (53.80%), Query Frame = 0

Query: 109 YLSRNLNPSIAFE-VIKRFSDPLLGLKFFEFSRTH--------------LSINHTFNTYD 168
           +LS N  P  A   ++K  +D  L LKF  ++  H              L+    + T  
Sbjct: 41  HLSANFTPEAASNLLLKSQNDQALILKFLNWANPHQFFTLRCKCITLHILTKFKLYKTAQ 100

Query: 169 LLMRNLCKVGLNDS-AKIVFDCMRS--DGILPDSSILELLVSSYARMGKLDSAKNFLNEV 228
           +L  ++    L+D  A +VF  ++   D     SS+ +L+V SY+R+  +D A + ++  
Sbjct: 101 ILAEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLA 160

Query: 229 HCYGIKVSPFVYNNLLNMLVK-QNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGE 288
             +G       YN +L+  ++ +  +  A  +F+E LE    P+V+++NILIRG C  G 
Sbjct: 161 QAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGN 220

Query: 289 IDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYT 348
           ID A   F  M   GC P++V+YNTLI+G+C++ +I  G  LL+  M +KG+ P++I+Y 
Sbjct: 221 IDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLR-SMALKGLEPNLISYN 280

Query: 349 SIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLL 408
            +I+G C+ G MK  S +  EM   G   ++ T+N LI G+ K GN   A+VM+ +ML  
Sbjct: 281 VVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRH 340

Query: 409 GCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREA 468
           G  P V+T+TSLI   C+ G +N+ ++  ++M+VR L PN  TY  L++   ++  + EA
Sbjct: 341 GLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEA 400

Query: 469 RNFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIG 528
              LR +  +   P    YN +I+G C  GK+++A  ++ +M+EK   PD ++++ ++ G
Sbjct: 401 YRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSG 460

Query: 529 NCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEASQIKQAALQ 564
            C    + EA+    +M+E    PD IT +SLI    +     EA  + +  L+
Sbjct: 461 FCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLR 513

BLAST of CsGy2G000170 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 248.8 bits (634), Expect = 1.1e-65
Identity = 144/399 (36.09%), Postives = 214/399 (53.63%), Query Frame = 0

Query: 151 TYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEV 210
           TY+ ++   CK+G  + A      +   G+ PD      L+  Y +   LDSA    NE+
Sbjct: 220 TYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEM 279

Query: 211 HCYGIKVSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPYFVPDVYSFNILIRGLCRIGEI 270
              G + +   Y +L++ L     +DEA+ LF +  +    P V ++ +LI+ LC     
Sbjct: 280 PLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERK 339

Query: 271 DKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTS 330
            +A    + M   G  P+I +Y  LI+  C   +  K  +LL + ML KG+ P+VITY +
Sbjct: 340 SEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQ-MLEKGLMPNVITYNA 399

Query: 331 IISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLG 390
           +I+GYCK G ++ A ++ + M S  + PN  T+N LI G+ K  N+  AM +  KML   
Sbjct: 400 LINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCK-SNVHKAMGVLNKMLERK 459

Query: 391 CLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREAR 450
            LPDVVT+ SLIDG CR G  +   +L   M  R L P+ +TY  +I++LCK  R+ EA 
Sbjct: 460 VLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEAC 519

Query: 451 NFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGN 510
           +    L+   V P   +Y  +IDG+CKAGKVDEA+ ++ +M  K C P+ +TF  LI G 
Sbjct: 520 DLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGL 579

Query: 511 CMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAG 550
           C  G++ EA     KM++I   P   T   LI  LLK G
Sbjct: 580 CADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDG 616

BLAST of CsGy2G000170 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 242.3 bits (617), Expect = 1.0e-63
Identity = 137/436 (31.42%), Postives = 231/436 (52.98%), Query Frame = 0

Query: 133 LKFFEFSRTHLS---INHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILEL 192
           LK  E S   +S   I    +T+++L++ LC+      A ++ + M S G++PD      
Sbjct: 170 LKLVEISHAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTT 229

Query: 193 LVSSYARMGKLDSAKNFLNEVHCYGIKVSPFVYNNLLNMLVKQNLVDEAVLLFRE-HLEP 252
           ++  Y   G LD A     ++  +G   S    N +++   K+  V++A+   +E   + 
Sbjct: 230 VMQGYIEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQD 289

Query: 253 YFVPDVYSFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKG 312
            F PD Y+FN L+ GLC+ G +  A E    M   G  PD+ +YN++I+G C++ E+ + 
Sbjct: 290 GFFPDQYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEA 349

Query: 313 HDLLKEDMLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLID 372
            ++L + M+ +  SP+ +TY ++IS  CK   ++ A+EL   + S GI P+  TFN LI 
Sbjct: 350 VEVL-DQMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQ 409

Query: 373 GFGKVGNMRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSP 432
           G     N R AM ++E+M   GC PD  T+  LID  C +G++++ L + ++M++   + 
Sbjct: 410 GLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCAR 469

Query: 433 NVYTYAVLINALCKENRIREARNFLRHLKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIV 492
           +V TY  LI+  CK N+ REA      ++   V      YN +IDG CK+ +V++A  ++
Sbjct: 470 SVITYNTLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLM 529

Query: 493 AEMQEKKCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKA 552
            +M  +  +PDK T+  L+   C  G + +A      M    C PD +T  +LIS L KA
Sbjct: 530 DQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKA 589

Query: 553 GMPNEASQIKQAALQK 565
           G    AS++ ++   K
Sbjct: 590 GRVEVASKLLRSIQMK 604

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZUE93.0e-15853.20Pentatricopeptide repeat-containing protein At2g06000 OS=Arabidopsis thaliana OX... [more]
Q9FIX31.1e-7032.07Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9LSL91.5e-6436.09Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q9LFF11.4e-6231.42Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q6NQ838.5e-6030.87Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_008459803.10.095.41PREDICTED: pentatricopeptide repeat-containing protein At2g06000 [Cucumis melo] ... [more]
KAA0061465.10.095.41pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK10809... [more]
KAE8651488.10.099.82hypothetical protein Csa_019253 [Cucumis sativus][more]
XP_031736975.10.094.22pentatricopeptide repeat-containing protein At2g06000 [Cucumis sativus][more]
XP_038890050.10.091.10pentatricopeptide repeat-containing protein At2g06000 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A1S3CB420.095.41pentatricopeptide repeat-containing protein At2g06000 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7V6K10.095.41Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1G4700.087.22pentatricopeptide repeat-containing protein At2g06000 isoform X1 OS=Cucurbita mo... [more]
A0A6J1KES20.086.71pentatricopeptide repeat-containing protein At2g06000 isoform X1 OS=Cucurbita ma... [more]
A0A6J1DMW00.085.18pentatricopeptide repeat-containing protein At2g06000 OS=Momordica charantia OX=... [more]
Match NameE-valueIdentityDescription
AT2G06000.12.2e-15953.20Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G06000.22.2e-15953.20Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.17.6e-7232.07Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G65560.11.1e-6536.09Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53700.11.0e-6331.42Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 537..556
e-value: 0.17
score: 12.2
coord: 190..215
e-value: 0.45
score: 10.9
coord: 221..244
e-value: 0.017
score: 15.4
coord: 151..180
e-value: 0.014
score: 15.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 431..463
e-value: 1.2E-5
score: 23.2
coord: 467..499
e-value: 7.3E-9
score: 33.3
coord: 255..289
e-value: 1.7E-9
score: 35.3
coord: 361..395
e-value: 3.8E-8
score: 31.0
coord: 326..359
e-value: 6.0E-12
score: 43.0
coord: 501..535
e-value: 0.0011
score: 16.9
coord: 396..430
e-value: 5.2E-10
score: 36.9
coord: 290..325
e-value: 1.7E-7
score: 28.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 323..371
e-value: 5.0E-19
score: 68.2
coord: 393..442
e-value: 3.2E-20
score: 72.1
coord: 466..511
e-value: 8.6E-13
score: 48.3
coord: 252..301
e-value: 5.9E-16
score: 58.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 499..533
score: 9.744654
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 429..463
score: 11.509422
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 359..393
score: 11.465577
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 148..182
score: 9.909073
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 218..252
score: 8.889672
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 253..287
score: 13.054966
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 324..358
score: 14.699161
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 288..323
score: 11.783455
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 394..428
score: 13.493418
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 464..498
score: 12.024604
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 424..490
e-value: 1.9E-19
score: 72.0
coord: 212..281
e-value: 2.0E-16
score: 62.0
coord: 354..423
e-value: 2.3E-21
score: 78.2
coord: 491..563
e-value: 1.1E-15
score: 59.7
coord: 103..211
e-value: 5.4E-9
score: 37.8
coord: 282..353
e-value: 9.9E-23
score: 82.6
NoneNo IPR availablePANTHERPTHR47941PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 3, MITOCHONDRIALcoord: 55..576
NoneNo IPR availablePANTHERPTHR47941:SF1OSJNBA0027P08.18 PROTEINcoord: 55..576
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 194..388

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy2G000170.2CsGy2G000170.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding