CsGy4G013690.1 (mRNA) Cucumber (Gy14) v2

NameCsGy4G013690.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein At2g37320
LocationChr4 : 18688909 .. 18690462 (+)
Sequence length1242
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTTCCCGATATCTCAATCTTCTGCACCATATAGACATGGCTTCAACGTCTTACCCTCCATTAGGTTTTTCTCTAACTTCAAATCCAATACCCATCCGACCACTAACTTACCAAAGCCTCCGAGACTTTTGGACCTCATTTCTCCGAAGGGAGATGTTTCCTATGAAAGTCGCCAAACCCATCTTCGCCTCATTCAGGACTTTTTACAAACAGATTCGGGTCAGTGCCGATCTCAAACCCTTCCCGTTGGATGTGATTCTCGTTCAATTGGTTTATCCAAGGATTCATCCTTTGTTCTTGATCAAGAATGTGAATCTGGTCATTGGGATGTTCAGTCGTTCGCAGGTAGATTTAAGTTTAATGCGAACGATATATCCAGTGTTTTGAGTTTGTGTAATTCTCAACGCAATCTTCGGGGTGGAATTCAGTATCATTCTGTGGCGATACGAACTGGGTTTATTGCTAATGTGTATGTAGGAAGTTCGCTGGTGAGTTTGTATGGGAAATGTGGGGAGTTGAGTAATGCATATCGGGTGTTTGATGAAATGCCTGTGAGAAATGTTGTGTCATGGACGGCCATTATTGCTGGGTTTGCTGTAGAATGGCAAGTTAATATGTGCTTGGAGCTTTTCCAAGAGATGAAGAGAATGGCATTGCAACCCAATGAGTTTACTTTTGTCACAATATTGACTGCTTGCACTGGTAGTGGAGCCCTTGGAGTAGGAAGAAGTCTTCATTGTCAAACAGTCAAAATGGGCTTTCATTCTTATCTCCATGTTGCAAATGCTTTGATCTCAATGTACTGTAAATGTGGAGCTCTTAACTTTGCATTATACATATTTGAAGCCATGGAAGTCAAAGACACTGTTTCGTGGAATTCCATGATTGCAGGTTATGCCCAACATGGACTTTCTCTGAGAGCCATTGATCTTTTCAAAGCAATGAGAAAGCAGAAGCAAGTGGAAGCCGATGCCATCACTTTCCTTGGTGTTCTGTCCTCGTGTAGACATGCAGGGTTTGTGGAAGAGGGGAGACACTACTTCAATCTTATGGTCGAGCTCGGTTTGAAACCGGAATTGGATCATTATTCATGTGTTATTGATCTGCTTGGTCGAGCTGGGTTACTAAAAGAGGCTCAAAACTTCATTGAGAAGATGCCCATAACTCCCAATTCAATTGTTTGGGGATCACTTCTTTCTGCTTGCAGGCTTCATGGGAATGTCTGGATAGGACTGAAGGCTGCGGAGAGTAGATTGTTACTGCAACCCGACTGTGCTTCAACGCATTTGCAATTGACGAATCTGTATGCAAAAGCAGGGTACTTAGATGATGCTGCAAGATTGAGGAAGATTATGAAAGACAAAGGGCTGAAAACTGCTCCTGGATATAGTTGGATTGAGATTCAAAATAAGGTTTATAGATTCAAAGCAGAAGATAAGTCAAACCCTTTAATGGTTGAGATTTTTGGTCTTATAGATGGCATGGTGAATCACATGAGATTCGTAGGTTGTGCTCACGAATTGGAGGATAAAGTTAATGAGTTCTGCTAG

mRNA sequence

ATGCTCTTCCCGATATCTCAATCTTCTGCACCATATAGACATGGCTTCAACGTCTTACCCTCCATTAGGTTTTTCTCTAACTTCAAATCCAATACCCATCCGACCACTAACTTACCAAAGCCTCCGAGACTTTTGGACCTCATTTCTCCGAAGGGAGATGTTTCCTATGAAAGTCGCCAAACCCATCTTCGCCTCATTCAGGACTTTTTACAAACAGATTCGGGTCAGTGCCGATCTCAAACCCTTCCCGTTGGATGTGATTCTCGTTCAATTGGTTTATCCAAGGATTCATCCTTTGTTCTTGATCAAGAATGTGAATCTGGTCATTGGGATGTTCAGTCGTTCGCAGGTAGATTTAAGTTTAATGCGAACGATATATCCAGTGTTTTGAGTTTGTGTAATTCTCAACGCAATCTTCGGGGTGGAATTCAGTATCATTCTGTGGCGATACGAACTGGGTTTATTGCTAATGTGTATGTAGGAAGTTCGCTGGTGAGTTTGTATGGGAAATGTGGGGAGTTGAGTAATGCATATCGGGTGTTTGATGAAATGCCTGTGAGAAATGTTGTGTCATGGACGGCCATTATTGCTGGGTTTGCTGTAGAATGGCAAGTTAATATGTGCTTGGAGCTTTTCCAAGAGATGAAGAGAATGGCATTGCAACCCAATGAGTTTACTTTTGTCACAATATTGACTGCTTGCACTGGGTTTGTGGAAGAGGGGAGACACTACTTCAATCTTATGGTCGAGCTCGGTTTGAAACCGGAATTGGATCATTATTCATGTGTTATTGATCTGCTTGGTCGAGCTGGGTTACTAAAAGAGGCTCAAAACTTCATTGAGAAGATGCCCATAACTCCCAATTCAATTGTTTGGGGATCACTTCTTTCTGCTTGCAGGCTTCATGGGAATGTCTGGATAGGACTGAAGGCTGCGGAGAGTAGATTGTTACTGCAACCCGACTGTGCTTCAACGCATTTGCAATTGACGAATCTGTATGCAAAAGCAGGGTACTTAGATGATGCTGCAAGATTGAGGAAGATTATGAAAGACAAAGGGCTGAAAACTGCTCCTGGATATAGTTGGATTGAGATTCAAAATAAGGTTTATAGATTCAAAGCAGAAGATAAGTCAAACCCTTTAATGGTTGAGATTTTTGGTCTTATAGATGGCATGGTGAATCACATGAGATTCGTAGGTTGTGCTCACGAATTGGAGGATAAAGTTAATGAGTTCTGCTAG

Coding sequence (CDS)

ATGCTCTTCCCGATATCTCAATCTTCTGCACCATATAGACATGGCTTCAACGTCTTACCCTCCATTAGGTTTTTCTCTAACTTCAAATCCAATACCCATCCGACCACTAACTTACCAAAGCCTCCGAGACTTTTGGACCTCATTTCTCCGAAGGGAGATGTTTCCTATGAAAGTCGCCAAACCCATCTTCGCCTCATTCAGGACTTTTTACAAACAGATTCGGGTCAGTGCCGATCTCAAACCCTTCCCGTTGGATGTGATTCTCGTTCAATTGGTTTATCCAAGGATTCATCCTTTGTTCTTGATCAAGAATGTGAATCTGGTCATTGGGATGTTCAGTCGTTCGCAGGTAGATTTAAGTTTAATGCGAACGATATATCCAGTGTTTTGAGTTTGTGTAATTCTCAACGCAATCTTCGGGGTGGAATTCAGTATCATTCTGTGGCGATACGAACTGGGTTTATTGCTAATGTGTATGTAGGAAGTTCGCTGGTGAGTTTGTATGGGAAATGTGGGGAGTTGAGTAATGCATATCGGGTGTTTGATGAAATGCCTGTGAGAAATGTTGTGTCATGGACGGCCATTATTGCTGGGTTTGCTGTAGAATGGCAAGTTAATATGTGCTTGGAGCTTTTCCAAGAGATGAAGAGAATGGCATTGCAACCCAATGAGTTTACTTTTGTCACAATATTGACTGCTTGCACTGGGTTTGTGGAAGAGGGGAGACACTACTTCAATCTTATGGTCGAGCTCGGTTTGAAACCGGAATTGGATCATTATTCATGTGTTATTGATCTGCTTGGTCGAGCTGGGTTACTAAAAGAGGCTCAAAACTTCATTGAGAAGATGCCCATAACTCCCAATTCAATTGTTTGGGGATCACTTCTTTCTGCTTGCAGGCTTCATGGGAATGTCTGGATAGGACTGAAGGCTGCGGAGAGTAGATTGTTACTGCAACCCGACTGTGCTTCAACGCATTTGCAATTGACGAATCTGTATGCAAAAGCAGGGTACTTAGATGATGCTGCAAGATTGAGGAAGATTATGAAAGACAAAGGGCTGAAAACTGCTCCTGGATATAGTTGGATTGAGATTCAAAATAAGGTTTATAGATTCAAAGCAGAAGATAAGTCAAACCCTTTAATGGTTGAGATTTTTGGTCTTATAGATGGCATGGTGAATCACATGAGATTCGTAGGTTGTGCTCACGAATTGGAGGATAAAGTTAATGAGTTCTGCTAG

Protein sequence

MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQTHLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFKFNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRVFDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACTGFVEEGRHYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMRFVGCAHELEDKVNEFC
BLAST of CsGy4G013690.1 vs. NCBI nr
Match: XP_004142220.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g37320 [Cucumis sativus] >KGN54200.1 hypothetical protein Csa_4G293100 [Cucumis sativus])

HSP 1 Score: 807.7 bits (2085), Expect = 1.8e-230
Identity = 413/517 (79.88%), Postives = 413/517 (79.88%), Query Frame = 0

Query: 1   MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ 60
           MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ
Sbjct: 8   MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ 67

Query: 61  THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK 120
           THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK
Sbjct: 68  THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK 127

Query: 121 FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV 180
           FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV
Sbjct: 128 FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV 187

Query: 181 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT----- 240
           FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT     
Sbjct: 188 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACTGSGAL 247

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 248 GVGRSLHCQTVKMGFHSYLHVANALISMYCKCGALNFALYIFEAMEVKDTVSWNSMIAGY 307

Query: 301 ---------------------------------------GFVEEGRHYFNLMVELGLKPE 360
                                                  GFVEEGRHYFNLMVELGLKPE
Sbjct: 308 AQHGLSLRAIDLFKAMRKQKQVEADAITFLGVLSSCRHAGFVEEGRHYFNLMVELGLKPE 367

Query: 361 LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL 414
           LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL
Sbjct: 368 LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL 427

BLAST of CsGy4G013690.1 vs. NCBI nr
Match: XP_008447309.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g37320 [Cucumis melo])

HSP 1 Score: 750.0 bits (1935), Expect = 4.4e-213
Identity = 385/515 (74.76%), Postives = 400/515 (77.67%), Query Frame = 0

Query: 1   MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ 60
           MLF ISQSSA YRHGFNV+PS+RFFSNFKSNTHPTTNLPKP RLLDLISPKGDVSYESRQ
Sbjct: 8   MLFAISQSSARYRHGFNVVPSVRFFSNFKSNTHPTTNLPKPLRLLDLISPKGDVSYESRQ 67

Query: 61  THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK 120
           THLRLIQDFLQTD  QCRSQTL VG DSRS+GLSKDSSFVLDQECESGHWDVQSFAGRFK
Sbjct: 68  THLRLIQDFLQTDPDQCRSQTLSVGFDSRSVGLSKDSSFVLDQECESGHWDVQSFAGRFK 127

Query: 121 FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV 180
           F+ANDISSVLSLCNSQRNLRGG+QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAY++
Sbjct: 128 FSANDISSVLSLCNSQRNLRGGLQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQM 187

Query: 181 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT----- 240
           FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELF+EMKRMALQPNEFTFVTILTACT     
Sbjct: 188 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFEEMKRMALQPNEFTFVTILTACTGSGAL 247

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 248 GVGRSLHCQTFKMGFHSYLHIANALISMYCKCGALNFALYIFEAMEVKDTVSWNSMIAGY 307

Query: 301 ---------------------------------------GFVEEGRHYFNLMVELGLKPE 360
                                                  GFVEEGRHYFNLMVELGLKPE
Sbjct: 308 AQHGLSHRAIDLFKAMRKQKQVEADAITFLGVLSSCRHAGFVEEGRHYFNLMVELGLKPE 367

Query: 361 LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL 412
           LDHYSCVIDLLGRAGLLKEAQNFIEKMP++PNSI+WGSLLSACRLHGNVWIGLKAAESRL
Sbjct: 368 LDHYSCVIDLLGRAGLLKEAQNFIEKMPMSPNSIIWGSLLSACRLHGNVWIGLKAAESRL 427

BLAST of CsGy4G013690.1 vs. NCBI nr
Match: XP_022977771.1 (pentatricopeptide repeat-containing protein At2g37320 [Cucurbita maxima])

HSP 1 Score: 660.2 bits (1702), Expect = 4.5e-186
Identity = 346/516 (67.05%), Postives = 376/516 (72.87%), Query Frame = 0

Query: 1   MLFPISQSSAPY-RHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESR 60
           MLF ISQSSA   RHGF+++ SIR FS FK NT PTTNLPKPPRLLDLISPKG+ + ESR
Sbjct: 8   MLFVISQSSALLCRHGFHIVTSIRLFSYFKRNTRPTTNLPKPPRLLDLISPKGNAASESR 67

Query: 61  QTHLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRF 120
           QTHLRLI+DFLQTDS QCRSQTL  G DS S+ LSKDSS VLDQE ESGHWD Q FAGRF
Sbjct: 68  QTHLRLIKDFLQTDSDQCRSQTLSDGFDSDSVFLSKDSSSVLDQERESGHWDFQLFAGRF 127

Query: 121 KFNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYR 180
           +F+ANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A +
Sbjct: 128 EFDANDISSALSLCCSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCCEMTDACQ 187

Query: 181 VFDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT---- 240
           VFDEMPVRNVVSWTAIIAGFA EWQV+MCLELFQ M+RMALQPNEFTF TIL+ACT    
Sbjct: 188 VFDEMPVRNVVSWTAIIAGFAQEWQVDMCLELFQRMRRMALQPNEFTFATILSACTGSGA 247

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 248 LGVGRSLHCQTFKMGFDSHVHIANALISMYCKCGALNFAVYLFEAMEVKDTVSWNSMIAG 307

Query: 301 ----------------------------------------GFVEEGRHYFNLMVELGLKP 360
                                                   G VEEGR+YFNLMVEL LKP
Sbjct: 308 YAQHGLSLKAIDLFEAMRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELALKP 367

Query: 361 ELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESR 412
           ELDHYSCVIDLLGRAGLLKEAQNFIEKMPI+PNSIVWGSLLSACRLHGNVWIGLKAAESR
Sbjct: 368 ELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESR 427

BLAST of CsGy4G013690.1 vs. NCBI nr
Match: XP_023544680.1 (pentatricopeptide repeat-containing protein At2g37320 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 653.3 bits (1684), Expect = 5.6e-184
Identity = 342/513 (66.67%), Postives = 375/513 (73.10%), Query Frame = 0

Query: 2   LFPISQSSAPY-RHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ 61
           LF IS+SSA   RHGF+++ SIR FS FK NT PTT+LPKPPRLLDLISPKG+ + ESRQ
Sbjct: 9   LFVISESSALLCRHGFHIVTSIRLFSYFKRNTLPTTSLPKPPRLLDLISPKGNAASESRQ 68

Query: 62  THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK 121
           THLRLI+DFLQTDS QCRSQTL  G DS S+ LSKDSS VLDQE ESGHW  Q FAGRF+
Sbjct: 69  THLRLIKDFLQTDSDQCRSQTLSDGFDSDSVFLSKDSSSVLDQERESGHWGFQLFAGRFE 128

Query: 122 FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV 181
           F+ANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A +V
Sbjct: 129 FDANDISSALSLCCSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCWEMTDACQV 188

Query: 182 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT----- 241
           FDEMPVRNVVSWTA+IAGFA EWQV+MCLELFQ+MKRMAL+PNEFTF TIL+ACT     
Sbjct: 189 FDEMPVRNVVSWTAMIAGFAQEWQVDMCLELFQQMKRMALRPNEFTFATILSACTGSGAL 248

Query: 242 ------------------------------------------------------------ 301
                                                                       
Sbjct: 249 GVGRSLHCQTFKMGFDSHVHIANALISMYCKCGALNFAVYLFEAMEVKDTVSWNSMIAGY 308

Query: 302 ---------------------------------------GFVEEGRHYFNLMVELGLKPE 361
                                                  G VEEGR+YFNLMVEL LKPE
Sbjct: 309 AQHGLSLQAIDLFEAMRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELSLKPE 368

Query: 362 LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL 410
           LDHYSCVIDLLGRAGLLKEAQNFIEKMPI+PNSIVWGSLLSACRLHGNVWIGLKAAESRL
Sbjct: 369 LDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRL 428

BLAST of CsGy4G013690.1 vs. NCBI nr
Match: XP_022949850.1 (pentatricopeptide repeat-containing protein At2g37320 [Cucurbita moschata])

HSP 1 Score: 649.8 bits (1675), Expect = 6.1e-183
Identity = 341/516 (66.09%), Postives = 372/516 (72.09%), Query Frame = 0

Query: 1   MLFPISQSSAPY-RHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESR 60
           MLF IS SSA   RHGF+++ SIR FS FK NT PTTNLPKPPRLLDLISPKG+ + ESR
Sbjct: 8   MLFVISASSALLCRHGFHIVTSIRLFSYFKRNTRPTTNLPKPPRLLDLISPKGNAASESR 67

Query: 61  QTHLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRF 120
           QTHLRLI+DFL+TDS QCRSQTL  G DS S+ LSKDSS V DQE ESGHW  Q FAGRF
Sbjct: 68  QTHLRLIKDFLRTDSDQCRSQTLSDGFDSDSVFLSKDSSSVRDQERESGHWGFQLFAGRF 127

Query: 121 KFNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYR 180
           +F+ANDISS LSLC SQRN RGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A +
Sbjct: 128 EFDANDISSALSLCCSQRNRRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCWEMNDACQ 187

Query: 181 VFDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT---- 240
           VFDEMPVRNVVSWTAIIAGFA EWQV+MCLELFQ M+RMALQPNEFTF TIL+ACT    
Sbjct: 188 VFDEMPVRNVVSWTAIIAGFAQEWQVDMCLELFQRMRRMALQPNEFTFATILSACTGSGA 247

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 248 LGVGRSLHCQTFKMGFDSHVHIANALISMYCKCGALNFAVYLFEAMEVKDTVSWNSIIAG 307

Query: 301 ----------------------------------------GFVEEGRHYFNLMVELGLKP 360
                                                   G VEEGR+YFNLMVELGLKP
Sbjct: 308 YAQHGLSLQAIDLFKAMRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELGLKP 367

Query: 361 ELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESR 412
           ELDHYSCVIDLLGRAGLLKEAQN IE MPI+PNSIVWGSLLSACRLHGNVWIGLKAAESR
Sbjct: 368 ELDHYSCVIDLLGRAGLLKEAQNLIENMPISPNSIVWGSLLSACRLHGNVWIGLKAAESR 427

BLAST of CsGy4G013690.1 vs. TAIR10
Match: AT2G37320.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 339.3 bits (869), Expect = 3.2e-93
Identity = 190/464 (40.95%), Postives = 257/464 (55.39%), Query Frame = 0

Query: 43  RLLDLISPK-GDVSYESRQTHLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVL 102
           R+LD+IS K G VS  +RQ H   +Q+F QTDS + R Q +     S    LS+  + V 
Sbjct: 44  RVLDIISSKSGGVS--NRQDHFGFVQEFRQTDSWRFRGQAI-----SEDFDLSRTKNGVS 103

Query: 103 DQECESGHWDVQSFAGR--FKFNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVY 162
               E    D  S   R  + F+A  +SS +  C   R+ R G  +H +A++ GFI++VY
Sbjct: 104 SVLEEVMLEDSSSSVKRDGWSFDAYGLSSAVRSCGLNRDFRTGSGFHCLALKGGFISDVY 163

Query: 163 VGSSLVSLYGKCGELSNAYRVFDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMA 222
           +GSSLV LY   GE+ NAY+VF+EMP RNVVSWTA+I+GFA EW+V++CL+L+ +M++  
Sbjct: 164 LGSSLVVLYRDSGEVENAYKVFEEMPERNVVSWTAMISGFAQEWRVDICLKLYSKMRKST 223

Query: 223 LQPNEFTFVTILTACT-------------------------------------------- 282
             PN++TF  +L+ACT                                            
Sbjct: 224 SDPNDYTFTALLSACTGSGALGQGRSVHCQTLHMGLKSYLHISNSLISMYCKCGDLKDAF 283

Query: 283 ------------------------------------------------------------ 342
                                                                       
Sbjct: 284 RIFDQFSNKDVVSWNSMIAGYAQHGLAMQAIELFELMMPKSGTKPDAITYLGVLSSCRHA 343

Query: 343 GFVEEGRHYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSL 400
           G V+EGR +FNLM E GLKPEL+HYSC++DLLGR GLL+EA   IE MP+ PNS++WGSL
Sbjct: 344 GLVKEGRKFFNLMAEHGLKPELNHYSCLVDLLGRFGLLQEALELIENMPMKPNSVIWGSL 403

BLAST of CsGy4G013690.1 vs. TAIR10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 239.6 bits (610), Expect = 3.5e-63
Identity = 122/301 (40.53%), Postives = 194/301 (64.45%), Query Frame = 0

Query: 122 NANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRVF 181
           N+  ++++LS+ +S  +L  G Q H  A+++G I +V V ++L+++Y K G +++A R F
Sbjct: 412 NSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAF 471

Query: 182 DEMPV-RNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT--GFV 241
           D +   R+ VSWT++I   A        LELF+ M    L+P+  T+V + +ACT  G V
Sbjct: 472 DLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLV 531

Query: 242 EEGRHYFNLMVELG-LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLS 301
            +GR YF++M ++  + P L HY+C++DL GRAGLL+EAQ FIEKMPI P+ + WGSLLS
Sbjct: 532 NQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLS 591

Query: 302 ACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKTA 361
           ACR+H N+ +G  AAE  LLL+P+ +  +  L NLY+  G  ++AA++RK MKD  +K  
Sbjct: 592 ACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKE 651

Query: 362 PGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMRFVG-------CAHELEDKVN 412
            G+SWIE+++KV+ F  ED ++P   EI+  +  + + ++ +G         H+LE++V 
Sbjct: 652 QGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVK 711

BLAST of CsGy4G013690.1 vs. TAIR10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 235.7 bits (600), Expect = 5.0e-62
Identity = 115/302 (38.08%), Postives = 180/302 (59.60%), Query Frame = 0

Query: 120 KFNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYR 179
           + N +  SS LS C     L  G Q H   ++ G+    +VG++L+ +Y KCG +  A  
Sbjct: 406 RLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEAND 465

Query: 180 VFDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTAC--TGF 239
           +F EM  +++VSW  +IAG++      + L  F+ MKR  L+P++ T V +L+AC  TG 
Sbjct: 466 LFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGL 525

Query: 240 VEEGRHYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLL 299
           V++GR YF  M  + G+ P   HY+C++DLLGRAGLL++A N ++ MP  P++ +WG+LL
Sbjct: 526 VDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLL 585

Query: 300 SACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKT 359
            A R+HGN  +   AA+    ++P+ +  ++ L+NLYA +G   D  +LR  M+DKG+K 
Sbjct: 586 GASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKGVKK 645

Query: 360 APGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMRFVG-------CAHELEDKV 412
            PGYSWIEIQNK + F   D+ +P   EIF  ++ +   M+  G         H++E++ 
Sbjct: 646 VPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKTSVVLHDVEEEE 705

BLAST of CsGy4G013690.1 vs. TAIR10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 234.6 bits (597), Expect = 1.1e-61
Identity = 115/280 (41.07%), Postives = 177/280 (63.21%), Query Frame = 0

Query: 127 SSVLSLCNSQRNLRGGIQYHSVAIRTGF------IANVYVGSSLVSLYGKCGELSNAYRV 186
           +++L  C     L  G+Q H   ++ GF        +++VG+SL+ +Y KCG +   Y V
Sbjct: 390 ANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLV 449

Query: 187 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTAC--TGFV 246
           F +M  R+ VSW A+I GFA     N  LELF+EM     +P+  T + +L+AC   GFV
Sbjct: 450 FRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFV 509

Query: 247 EEGRHYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLS 306
           EEGRHYF+ M  + G+ P  DHY+C++DLLGRAG L+EA++ IE+MP+ P+S++WGSLL+
Sbjct: 510 EEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLA 569

Query: 307 ACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKTA 366
           AC++H N+ +G   AE  L ++P  +  ++ L+N+YA+ G  +D   +RK M+ +G+   
Sbjct: 570 ACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQ 629

Query: 367 PGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMR 398
           PG SWI+IQ   + F  +DKS+P   +I  L+D ++  MR
Sbjct: 630 PGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR 669

BLAST of CsGy4G013690.1 vs. TAIR10
Match: AT4G14850.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 229.9 bits (585), Expect = 2.8e-60
Identity = 118/286 (41.26%), Postives = 180/286 (62.94%), Query Frame = 0

Query: 126 ISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRVFDEMP 185
           ISSVLS C     L  G   H+ A++      ++VGS+LV +YGKCG + ++ + FDEMP
Sbjct: 313 ISSVLSACAGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGCIEDSEQAFDEMP 372

Query: 186 VRNVVSWTAIIAGFAVEWQVNMCLELFQEM--KRMALQPNEFTFVTILTACT--GFVEEG 245
            +N+V+  ++I G+A + QV+M L LF+EM  +     PN  TFV++L+AC+  G VE G
Sbjct: 373 EKNLVTRNSLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLLSACSRAGAVENG 432

Query: 246 RHYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACR 305
              F+ M    G++P  +HYSC++D+LGRAG+++ A  FI+KMPI P   VWG+L +ACR
Sbjct: 433 MKIFDSMRSTYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPTISVWGALQNACR 492

Query: 306 LHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKTAPGY 365
           +HG   +GL AAE+   L P  +  H+ L+N +A AG   +A  +R+ +K  G+K   GY
Sbjct: 493 MHGKPQLGLLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREELKGVGIKKGAGY 552

Query: 366 SWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMRFVGCAHELE 407
           SWI ++N+V+ F+A+D+S+ L  EI   +  + N M   G   +L+
Sbjct: 553 SWITVKNQVHAFQAKDRSHILNKEIQTTLAKLRNEMEAAGYKPDLK 598

BLAST of CsGy4G013690.1 vs. Swiss-Prot
Match: sp|Q9ZUT4|PP192_ARATH (Pentatricopeptide repeat-containing protein At2g37320 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E50 PE=2 SV=1)

HSP 1 Score: 339.3 bits (869), Expect = 5.8e-92
Identity = 190/464 (40.95%), Postives = 257/464 (55.39%), Query Frame = 0

Query: 43  RLLDLISPK-GDVSYESRQTHLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVL 102
           R+LD+IS K G VS  +RQ H   +Q+F QTDS + R Q +     S    LS+  + V 
Sbjct: 44  RVLDIISSKSGGVS--NRQDHFGFVQEFRQTDSWRFRGQAI-----SEDFDLSRTKNGVS 103

Query: 103 DQECESGHWDVQSFAGR--FKFNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVY 162
               E    D  S   R  + F+A  +SS +  C   R+ R G  +H +A++ GFI++VY
Sbjct: 104 SVLEEVMLEDSSSSVKRDGWSFDAYGLSSAVRSCGLNRDFRTGSGFHCLALKGGFISDVY 163

Query: 163 VGSSLVSLYGKCGELSNAYRVFDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMA 222
           +GSSLV LY   GE+ NAY+VF+EMP RNVVSWTA+I+GFA EW+V++CL+L+ +M++  
Sbjct: 164 LGSSLVVLYRDSGEVENAYKVFEEMPERNVVSWTAMISGFAQEWRVDICLKLYSKMRKST 223

Query: 223 LQPNEFTFVTILTACT-------------------------------------------- 282
             PN++TF  +L+ACT                                            
Sbjct: 224 SDPNDYTFTALLSACTGSGALGQGRSVHCQTLHMGLKSYLHISNSLISMYCKCGDLKDAF 283

Query: 283 ------------------------------------------------------------ 342
                                                                       
Sbjct: 284 RIFDQFSNKDVVSWNSMIAGYAQHGLAMQAIELFELMMPKSGTKPDAITYLGVLSSCRHA 343

Query: 343 GFVEEGRHYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSL 400
           G V+EGR +FNLM E GLKPEL+HYSC++DLLGR GLL+EA   IE MP+ PNS++WGSL
Sbjct: 344 GLVKEGRKFFNLMAEHGLKPELNHYSCLVDLLGRFGLLQEALELIENMPMKPNSVIWGSL 403

BLAST of CsGy4G013690.1 vs. Swiss-Prot
Match: sp|Q9SHZ8|PP168_ARATH (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 6.3e-62
Identity = 122/301 (40.53%), Postives = 194/301 (64.45%), Query Frame = 0

Query: 122 NANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRVF 181
           N+  ++++LS+ +S  +L  G Q H  A+++G I +V V ++L+++Y K G +++A R F
Sbjct: 412 NSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAF 471

Query: 182 DEMPV-RNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT--GFV 241
           D +   R+ VSWT++I   A        LELF+ M    L+P+  T+V + +ACT  G V
Sbjct: 472 DLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLV 531

Query: 242 EEGRHYFNLMVELG-LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLS 301
            +GR YF++M ++  + P L HY+C++DL GRAGLL+EAQ FIEKMPI P+ + WGSLLS
Sbjct: 532 NQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLS 591

Query: 302 ACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKTA 361
           ACR+H N+ +G  AAE  LLL+P+ +  +  L NLY+  G  ++AA++RK MKD  +K  
Sbjct: 592 ACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKE 651

Query: 362 PGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMRFVG-------CAHELEDKVN 412
            G+SWIE+++KV+ F  ED ++P   EI+  +  + + ++ +G         H+LE++V 
Sbjct: 652 QGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVK 711

BLAST of CsGy4G013690.1 vs. Swiss-Prot
Match: sp|Q9SY02|PP301_ARATH (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 9.0e-61
Identity = 115/302 (38.08%), Postives = 180/302 (59.60%), Query Frame = 0

Query: 120 KFNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYR 179
           + N +  SS LS C     L  G Q H   ++ G+    +VG++L+ +Y KCG +  A  
Sbjct: 406 RLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEAND 465

Query: 180 VFDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTAC--TGF 239
           +F EM  +++VSW  +IAG++      + L  F+ MKR  L+P++ T V +L+AC  TG 
Sbjct: 466 LFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGL 525

Query: 240 VEEGRHYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLL 299
           V++GR YF  M  + G+ P   HY+C++DLLGRAGLL++A N ++ MP  P++ +WG+LL
Sbjct: 526 VDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLL 585

Query: 300 SACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKT 359
            A R+HGN  +   AA+    ++P+ +  ++ L+NLYA +G   D  +LR  M+DKG+K 
Sbjct: 586 GASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKGVKK 645

Query: 360 APGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMRFVG-------CAHELEDKV 412
            PGYSWIEIQNK + F   D+ +P   EIF  ++ +   M+  G         H++E++ 
Sbjct: 646 VPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKTSVVLHDVEEEE 705

BLAST of CsGy4G013690.1 vs. Swiss-Prot
Match: sp|Q9SIT7|PP151_ARATH (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 2.0e-60
Identity = 115/280 (41.07%), Postives = 177/280 (63.21%), Query Frame = 0

Query: 127 SSVLSLCNSQRNLRGGIQYHSVAIRTGF------IANVYVGSSLVSLYGKCGELSNAYRV 186
           +++L  C     L  G+Q H   ++ GF        +++VG+SL+ +Y KCG +   Y V
Sbjct: 390 ANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLV 449

Query: 187 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTAC--TGFV 246
           F +M  R+ VSW A+I GFA     N  LELF+EM     +P+  T + +L+AC   GFV
Sbjct: 450 FRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFV 509

Query: 247 EEGRHYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLS 306
           EEGRHYF+ M  + G+ P  DHY+C++DLLGRAG L+EA++ IE+MP+ P+S++WGSLL+
Sbjct: 510 EEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLA 569

Query: 307 ACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKTA 366
           AC++H N+ +G   AE  L ++P  +  ++ L+N+YA+ G  +D   +RK M+ +G+   
Sbjct: 570 ACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQ 629

Query: 367 PGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMR 398
           PG SWI+IQ   + F  +DKS+P   +I  L+D ++  MR
Sbjct: 630 PGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR 669

BLAST of CsGy4G013690.1 vs. Swiss-Prot
Match: sp|Q0WSH6|PP312_ARATH (Pentatricopeptide repeat-containing protein At4g14850 OS=Arabidopsis thaliana OX=3702 GN=LOI1 PE=1 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 5.0e-59
Identity = 118/286 (41.26%), Postives = 180/286 (62.94%), Query Frame = 0

Query: 126 ISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRVFDEMP 185
           ISSVLS C     L  G   H+ A++      ++VGS+LV +YGKCG + ++ + FDEMP
Sbjct: 313 ISSVLSACAGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGCIEDSEQAFDEMP 372

Query: 186 VRNVVSWTAIIAGFAVEWQVNMCLELFQEM--KRMALQPNEFTFVTILTACT--GFVEEG 245
            +N+V+  ++I G+A + QV+M L LF+EM  +     PN  TFV++L+AC+  G VE G
Sbjct: 373 EKNLVTRNSLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLLSACSRAGAVENG 432

Query: 246 RHYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACR 305
              F+ M    G++P  +HYSC++D+LGRAG+++ A  FI+KMPI P   VWG+L +ACR
Sbjct: 433 MKIFDSMRSTYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPTISVWGALQNACR 492

Query: 306 LHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKTAPGY 365
           +HG   +GL AAE+   L P  +  H+ L+N +A AG   +A  +R+ +K  G+K   GY
Sbjct: 493 MHGKPQLGLLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREELKGVGIKKGAGY 552

Query: 366 SWIEIQNKVYRFKAEDKSNPLMVEIFGLIDGMVNHMRFVGCAHELE 407
           SWI ++N+V+ F+A+D+S+ L  EI   +  + N M   G   +L+
Sbjct: 553 SWITVKNQVHAFQAKDRSHILNKEIQTTLAKLRNEMEAAGYKPDLK 598

BLAST of CsGy4G013690.1 vs. TrEMBL
Match: tr|A0A0A0KX36|A0A0A0KX36_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G293100 PE=4 SV=1)

HSP 1 Score: 807.7 bits (2085), Expect = 1.2e-230
Identity = 413/517 (79.88%), Postives = 413/517 (79.88%), Query Frame = 0

Query: 1   MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ 60
           MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ
Sbjct: 8   MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ 67

Query: 61  THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK 120
           THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK
Sbjct: 68  THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK 127

Query: 121 FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV 180
           FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV
Sbjct: 128 FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV 187

Query: 181 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT----- 240
           FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT     
Sbjct: 188 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACTGSGAL 247

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 248 GVGRSLHCQTVKMGFHSYLHVANALISMYCKCGALNFALYIFEAMEVKDTVSWNSMIAGY 307

Query: 301 ---------------------------------------GFVEEGRHYFNLMVELGLKPE 360
                                                  GFVEEGRHYFNLMVELGLKPE
Sbjct: 308 AQHGLSLRAIDLFKAMRKQKQVEADAITFLGVLSSCRHAGFVEEGRHYFNLMVELGLKPE 367

Query: 361 LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL 414
           LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL
Sbjct: 368 LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL 427

BLAST of CsGy4G013690.1 vs. TrEMBL
Match: tr|A0A1S3BGK7|A0A1S3BGK7_CUCME (pentatricopeptide repeat-containing protein At2g37320 OS=Cucumis melo OX=3656 GN=LOC103489779 PE=4 SV=1)

HSP 1 Score: 750.0 bits (1935), Expect = 2.9e-213
Identity = 385/515 (74.76%), Postives = 400/515 (77.67%), Query Frame = 0

Query: 1   MLFPISQSSAPYRHGFNVLPSIRFFSNFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQ 60
           MLF ISQSSA YRHGFNV+PS+RFFSNFKSNTHPTTNLPKP RLLDLISPKGDVSYESRQ
Sbjct: 8   MLFAISQSSARYRHGFNVVPSVRFFSNFKSNTHPTTNLPKPLRLLDLISPKGDVSYESRQ 67

Query: 61  THLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFK 120
           THLRLIQDFLQTD  QCRSQTL VG DSRS+GLSKDSSFVLDQECESGHWDVQSFAGRFK
Sbjct: 68  THLRLIQDFLQTDPDQCRSQTLSVGFDSRSVGLSKDSSFVLDQECESGHWDVQSFAGRFK 127

Query: 121 FNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRV 180
           F+ANDISSVLSLCNSQRNLRGG+QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAY++
Sbjct: 128 FSANDISSVLSLCNSQRNLRGGLQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQM 187

Query: 181 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT----- 240
           FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELF+EMKRMALQPNEFTFVTILTACT     
Sbjct: 188 FDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFEEMKRMALQPNEFTFVTILTACTGSGAL 247

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 248 GVGRSLHCQTFKMGFHSYLHIANALISMYCKCGALNFALYIFEAMEVKDTVSWNSMIAGY 307

Query: 301 ---------------------------------------GFVEEGRHYFNLMVELGLKPE 360
                                                  GFVEEGRHYFNLMVELGLKPE
Sbjct: 308 AQHGLSHRAIDLFKAMRKQKQVEADAITFLGVLSSCRHAGFVEEGRHYFNLMVELGLKPE 367

Query: 361 LDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRL 412
           LDHYSCVIDLLGRAGLLKEAQNFIEKMP++PNSI+WGSLLSACRLHGNVWIGLKAAESRL
Sbjct: 368 LDHYSCVIDLLGRAGLLKEAQNFIEKMPMSPNSIIWGSLLSACRLHGNVWIGLKAAESRL 427

BLAST of CsGy4G013690.1 vs. TrEMBL
Match: tr|A0A067K1V0|A0A067K1V0_JATCU (Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_16432 PE=4 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 1.7e-112
Identity = 234/496 (47.18%), Postives = 294/496 (59.27%), Query Frame = 0

Query: 17  NVLPSIRFFSNFK-SNTHPTTNLPKPPRLLDLISPKGDVSYESRQTHLRLIQDFLQTDSG 76
           + L  IR FS+ K  +T PT  L K  R+LD+I+PK       RQ+HLRLIQDFLQT+S 
Sbjct: 16  HTLSHIRSFSDHKLRHTTPTKRLDKALRVLDIITPK--TGARIRQSHLRLIQDFLQTNSN 75

Query: 77  QCRSQTLPVGCDSRSIGLSKDSSFVLDQECESGHWDVQ-SFAGRFKFNANDISSVLSLCN 136
           Q         CD  S    +  S VLD+  ES   + Q S A  F+++A  +S+ +S C 
Sbjct: 76  QDSEPYF--RCDFISCRSVEGISNVLDKIIESSPPNDQVSDASCFRYDAGSLSNAVSWCA 135

Query: 137 SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRVFDEMPVRNVVSWTA 196
           S R+LR GIQYH +AI  GF AN YVGSSL++LY KCGEL NAY+VF EMPVRNVVSWTA
Sbjct: 136 SSRDLRAGIQYHCLAINIGFDANAYVGSSLITLYCKCGELDNAYKVFYEMPVRNVVSWTA 195

Query: 197 IIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACT------------------- 256
           II+GFA EWQ+++CLELF  M+   L PN+FTF ++L+ACT                   
Sbjct: 196 IISGFAQEWQIDVCLELFSAMRNSTLVPNDFTFTSLLSACTGSGALGQGRSAHCQIIQMG 255

Query: 257 ------------------------------------------------------------ 316
                                                                       
Sbjct: 256 LDSHLHIANALISMYCKCGSVQDALFIFDNMYTKDIVSWNSMISGYAQHGLTVQAIELFE 315

Query: 317 ------------------------GFVEEGRHYFNLMVELGLKPELDHYSCVIDLLGRAG 376
                                   GFVE GR YFN MVE G++PELDHYSC++DLLGRAG
Sbjct: 316 KMKKLGTKPDSITFLGVLSSCRHAGFVEAGRGYFNSMVEYGVRPELDHYSCLVDLLGRAG 375

Query: 377 LLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTN 408
           L +EA++ I +MP+ PN+I+WGSLLS+CRLHGNVWIG++AAESRLLL+PDCA+THLQL N
Sbjct: 376 LTEEARDIILRMPLPPNAIIWGSLLSSCRLHGNVWIGIQAAESRLLLEPDCAATHLQLAN 435

BLAST of CsGy4G013690.1 vs. TrEMBL
Match: tr|A0A2P6RAF1|A0A2P6RAF1_ROSCH (Putative tetratricopeptide-like helical domain-containing protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr3g0468071 PE=4 SV=1)

HSP 1 Score: 408.7 bits (1049), Expect = 1.6e-110
Identity = 215/473 (45.45%), Postives = 284/473 (60.04%), Query Frame = 0

Query: 43  RLLDLISPKGDVSYESRQTHLRLIQDFLQTDSGQCRSQTLPVGCDSRSIGLSKDSSFVLD 102
           R+LDLI+PK   +   RQ HLRLIQD L +DS Q  + +   G  S     S   S + D
Sbjct: 452 RVLDLITPKPTSTASRRQGHLRLIQDVLHSDSDQLSNPSSFSGSHS-----SIQFSNLFD 511

Query: 103 QECESGHWDVQSFAGRFKFNANDISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGS 162
           Q  +S   D  S+   F  +A+ IS  +S C S+R+LRGG+Q+H  A+R+GF ANVY+GS
Sbjct: 512 QLFDSSPVDSHSYPESFTIDASVISHAISSCGSKRDLRGGVQHHCAAVRSGFGANVYIGS 571

Query: 163 SLVSLYGKCGELSNAYRVFDEMPVRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQP 222
           SL+ LYGKC  L NAY+VFDEMPVRNVVSWTAII+GFA EWQV++CLELF EM+    +P
Sbjct: 572 SLIHLYGKCSALENAYKVFDEMPVRNVVSWTAIISGFAQEWQVDVCLELFSEMRSSGSKP 631

Query: 223 NEFTFVTILTACT----------------------------------------------- 282
           N+FT+ ++L+ACT                                               
Sbjct: 632 NDFTYASVLSACTGSGALGQGRCAHGQTIRMGFDSYVHIANALMSMYCKCGAVKDALYIF 691

Query: 283 --------------------------------------------------------GFVE 342
                                                                   G V+
Sbjct: 692 ESLDGKDNVSWNSMIAGYAQHGLVLQAIDLFEEMKNRGVKPDAITLLGVLSSCRHAGLVK 751

Query: 343 EGRHYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSA 402
           EG HYFN MV E G++PELDHYSC++DLLGRAGLL EA++FIEKMPI PN+++WGSLLS+
Sbjct: 752 EGWHYFNSMVEEHGIQPELDHYSCIVDLLGRAGLLDEARDFIEKMPIRPNAVIWGSLLSS 811

Query: 403 CRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAGYLDDAARLRKIMKDKGLKTAP 409
           CR+HG VWIG++AAESRLL++P CASTH+QL NLYA  G  D+AAR+RK+MKDKG+KT+P
Sbjct: 812 CRVHGGVWIGIEAAESRLLMEPGCASTHVQLANLYASLGCWDEAARVRKLMKDKGIKTSP 871

BLAST of CsGy4G013690.1 vs. TrEMBL
Match: tr|A0A251NVA1|A0A251NVA1_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G245400 PE=4 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 7.8e-110
Identity = 227/491 (46.23%), Postives = 291/491 (59.27%), Query Frame = 0

Query: 24  FFS--NFKSNTHPTTNLPKPPRLLDLISPKGDVSYESRQTHLRLIQDFLQTDSGQCRSQT 83
           FFS  N   ++  T  L    R+LDLI+PK  ++   RQ HLRLIQDFLQ+DS    + +
Sbjct: 31  FFSSPNLTRSSQETKKLHNALRVLDLITPKPTLTARRRQGHLRLIQDFLQSDSEHFSNAS 90

Query: 84  LPVGCDSRSIGLSKDSSFVLDQECESGHWDVQSFAGRFKFNANDISSVLSLCNSQRNLRG 143
                DS S       S +LD+  +S   D  S A RF  +A+ +S  +S   S RNL G
Sbjct: 91  --AFSDSNS---PIKISTLLDELFDSSSVDSPSCAERFPIDASVLSHAISSYGSSRNLHG 150

Query: 144 GIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRVFDEMPVRNVVSWTAIIAGFAV 203
           GI YH  AIR+G +ANVY+GSSLVS YG+C EL NAYRVF+EMPVRNVVSWTAII+GFA 
Sbjct: 151 GIPYHCAAIRSGLVANVYIGSSLVSFYGRCNELQNAYRVFEEMPVRNVVSWTAIISGFAQ 210

Query: 204 EWQVNMCLELFQEMKRMALQPNEFTFVTILTACT-------------------------- 263
           EWQV+ CL+LF EM R + +PN+FT+ +IL+ACT                          
Sbjct: 211 EWQVDACLQLFSEM-RHSSKPNDFTYASILSACTGSGALGHGRSAHCHTIRMGFDLYIHI 270

Query: 264 ------------------------------------------------------------ 323
                                                                       
Sbjct: 271 ANALISMYCKCGDVKDALCIFKNLDGKDNVSWNSMIAGYAQHGLASQAIDLFEEMKQQCV 330

Query: 324 -----------------GFVEEGRHYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAQ 383
                            G V+EGR YFN M+ E G++PELDHYSCVIDLLGRAG L+EAQ
Sbjct: 331 EPDAITLLGVLSSCRHAGLVQEGRSYFNSMIKEHGIQPELDHYSCVIDLLGRAGCLEEAQ 390

Query: 384 NFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTNLYAKAG 409
            FIEKMPI PN+I+WGSLLS+CR+HG+VWIG++AAESRLLL+P+CASTH+QL NLYA  G
Sbjct: 391 CFIEKMPIRPNAIIWGSLLSSCRVHGSVWIGIEAAESRLLLEPECASTHVQLANLYASVG 450

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004142220.11.8e-23079.88PREDICTED: pentatricopeptide repeat-containing protein At2g37320 [Cucumis sativu... [more]
XP_008447309.14.4e-21374.76PREDICTED: pentatricopeptide repeat-containing protein At2g37320 [Cucumis melo][more]
XP_022977771.14.5e-18667.05pentatricopeptide repeat-containing protein At2g37320 [Cucurbita maxima][more]
XP_023544680.15.6e-18466.67pentatricopeptide repeat-containing protein At2g37320 [Cucurbita pepo subsp. pep... [more]
XP_022949850.16.1e-18366.09pentatricopeptide repeat-containing protein At2g37320 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT2G37320.13.2e-9340.95Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G22070.13.5e-6340.53pentatricopeptide (PPR) repeat-containing protein[more]
AT4G02750.15.0e-6238.08Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G13600.11.1e-6141.07Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G14850.12.8e-6041.26Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9ZUT4|PP192_ARATH5.8e-9240.95Pentatricopeptide repeat-containing protein At2g37320 OS=Arabidopsis thaliana OX... [more]
sp|Q9SHZ8|PP168_ARATH6.3e-6240.53Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
sp|Q9SY02|PP301_ARATH9.0e-6138.08Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
sp|Q9SIT7|PP151_ARATH2.0e-6041.07Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
sp|Q0WSH6|PP312_ARATH5.0e-5941.26Pentatricopeptide repeat-containing protein At4g14850 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KX36|A0A0A0KX36_CUCSA1.2e-23079.88Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G293100 PE=4 SV=1[more]
tr|A0A1S3BGK7|A0A1S3BGK7_CUCME2.9e-21374.76pentatricopeptide repeat-containing protein At2g37320 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A067K1V0|A0A067K1V0_JATCU1.7e-11247.18Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_16432 PE=4 SV=1[more]
tr|A0A2P6RAF1|A0A2P6RAF1_ROSCH1.6e-11045.45Putative tetratricopeptide-like helical domain-containing protein OS=Rosa chinen... [more]
tr|A0A251NVA1|A0A251NVA1_PRUPE7.8e-11046.23Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G245400 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy4G013690CsGy4G013690gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy4G013690.1.CDS.1CsGy4G013690.1.CDS.1CDS
CsGy4G013690.1.CDS.2CsGy4G013690.1.CDS.2CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy4G013690.1CsGy4G013690.1-proteinpolypeptide


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 187..234
e-value: 1.0E-12
score: 47.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 259..283
e-value: 0.018
score: 15.1
coord: 331..354
e-value: 0.087
score: 13.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 256..286
score: 6.665
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 157..187
score: 8.079
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 188..222
score: 9.887
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 322..356
score: 7.98
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 111..235
e-value: 1.8E-23
score: 84.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 236..384
e-value: 6.1E-18
score: 67.4
NoneNo IPR availablePANTHERPTHR24015:SF334SUBFAMILY NOT NAMEDcoord: 99..213
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 128..380
NoneNo IPR availablePANTHERPTHR24015:SF334SUBFAMILY NOT NAMEDcoord: 128..380
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 99..213