Cucsa.164890 (gene) Cucumber (Gy14) v1

NameCucsa.164890
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionPentatricopeptide (PPR) repeat-containing protein-like protein
Locationscaffold01153 : 240115 .. 242495 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTCACGATTAGATTTGCCTGCGCCGCTGGATACCTTCCCCTGACCAAAGCTCTAGCTTCTTACTCCACGACCACCAACCCATGTTTGTTTCATAAACAGAATGCCAAGTTTCATTAAACTCGATACCCATTACAAAGAGCGCTGAAATTTCAATTTTTCTGATGTTGGTTTTCCATGGAACCTCAACTGGCTTCGATGCCCTCATGCCAAAGGTAACCAATAGCCCTCAAACTTCCTCAATTTCGTTTTAACAATTTCTGAACTCTTTAATCAGACTATTTCCCCCTTTTCCTTATAGATAGATTGCATTTACTATCACAACAAGTTCACATTTACACCTTCCAGCGTCATTTGTGTTCACAACCAAGCTGCACAGCCCCTTACTAGTTTCACCACACCTGAGAGGTGCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTTCTTTTCTCTTTCTGATTTTCCTTTCAACCCCATGATTTTGATTTTGATTTTGATTTTGAATATAGACGTGTTGTTAAGAAGGTTGGGAAGGAGACACACCATTTATGGAAGAAAAGAGATTCTGCTGGCTCTGGGCAAAAGGCTCTTAATCTTGTTAGAATTGTGAGCTGCGTTTTCTTGAATGTCCCTCCAATATTGAGAGGAAAGAGTGATAATGGCAATTAGTTTGAAATTCAAATGGTTTTCTAATATATTGTAGGTTTCCCAATGTCCTAATGAGAAAGAAGCTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACAGAGTTTCCATTGATTGCAGCTGCTAAAGCTTTAAGAATACTGAGGAAGAGAAGTCAATGGAAGCGTGTCATTCAAGTATGAATAACTTTGGTTTTACTCGTTGTCTTTTTGGCCTTTGGCAAATCGGGGTGTTGATTCTAAAGGATGCAGTTGCTGTGTCTTTAAAAATCATCTTTGTCCGGCTTGTGCTTCTATTTCTTTGCTCGTTTTTTGTTCAGCTAATCTACTAGTGCGTTGGATTTTTGTTGTAATGTTGCATTTATTTGATAATTTTTGTGGCTTGTTAAGTTTTTATATTGAAGTTTGTTCTTAAACCTTGGTCCTTTAGACTTCTAAGTATGAATGAACAAGAGATGACACTTGGGATTTGGTTGATACATTAAATTTTATAATTTCTGTTTAAGATTTAGCTTTATGGGCAGCGAACCTATGGTTGTTGGTTCCAGGTTTGTGAATGATGAAACAACCGTTGTCTCTGTATCTTTGAGATAAATGTTTCTCGAACTAGAACACCAAATTACTTAAGGCGATGTAAACCTCAGTGTTGGACTTGCAAGTAACTTGGACTTTGACGTGTGAAAATATTCTCTCTGATTAACAAGAGAAGTTAGAAGAAGAAAAGTTTTCGTCTTTTGTCAAGGTCAATTGCTATAGTCCTTTCTGTGGTGAAGTTTGGGCTTATGAATATTTATTATGGTTTCTGTAAGCAATATTCAATAGTTTCCTTTTGTCCTTCCACAAGAAAATTGGTAACTTGAGACCTTTTCACCTATTGATATCACTTTCACATGTTGAAGTGCCTTGGCTTGATGAGAGTCGGTTTAATAGCTGCTTATTAATTCTCCCACCTAGTCTTTTCATTAATCAATGCTTCTTATTACTCAGGTGGCTAAGTGGATGTTAAGCAAGGGCCAAGGAGCCACAATGGGAACATACGACACCCTTCTACTGGCATTTGATATGGACAAGAGGGTGGACGAAGCCGAATCCTTATGGAACATGATATTGCATACACATACACGTTCCATCTCTAAGCGAGTATTTTCTAGGATGATCTCTTTGTACGAACATCATGACTTGCAAGATAAGATTATTGAGGTATTGGAAATGGGAAGTCCACATGTATTGCTTAACCTCGCTATTTTTCAATTTTCTCTTCATACGTTATCAAATGATCTCATGCTTCAATTTCTTCTCTCTTGGCACAGATATTTGCAGACATGGAAGAATTGGGTGTAAAACCAGATGAAGATACCGTAAGAAGAGTCTGTCGTGCCTTTCAAAAACTAGGTCAAGAAGATAACCGGAAAATGGTCTATAAAAGATACAGCTGCCAATGGAAATACATACACTTCAAGGGTGAGAGGGTTAGAGTGAGAAGAGATGGATGGGATGAAGATTATCAATGATAGATACATTAACATGAACAAGTTCATAAGGTATCTGTCTACTGATCCCCATTAATTATATGCGCTTAAGCACACTGAAACAGAGATTTGCAGTCTATTTTGTCCCTTTGCTCTGAATATTTAGAACAACTACTAACAGAGAAAATGTAAAGATTGTAAGTTCTGAAGAAAAATAGTGTATATGTTAACCAAAAGACGTCA

mRNA sequence

GTTCACGATTAGATTTGCCTGCGCCGCTGGATACCTTCCCCTGACCAAAGCTCTAGCTTCTTACTCCACGACCACCAACCCATGTTTGTTTCATAAACAGAATGCCAAGTTTCATTAAACTCGATACCCATTACAAAGAGCGCTGAAATTTCAATTTTTCTGATGTTGGTTTTCCATGGAACCTCAACTGGCTTCGATGCCCTCATGCCAAAGCGTCATTTGTGTTCACAACCAAGCTGCACAGCCCCTTACTAGTTTCACCACACCTGAGAGACGTGTTGTTAAGAAGGTTGGGAAGGAGACACACCATTTATGGAAGAAAAGAGATTCTGCTGGCTCTGGGCAAAAGGCTCTTAATCTTGTTAGAATTGTTTCCCAATGTCCTAATGAGAAAGAAGCTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACAGAGTTTCCATTGATTGCAGCTGCTAAAGCTTTAAGAATACTGAGGAAGAGAAGTCAATGGAAGCGTGTCATTCAAGTGGCTAAGTGGATGTTAAGCAAGGGCCAAGGAGCCACAATGGGAACATACGACACCCTTCTACTGGCATTTGATATGGACAAGAGGGTGGACGAAGCCGAATCCTTATGGAACATGATATTGCATACACATACACGTTCCATCTCTAAGCGAGTATTTTCTAGGATGATCTCTTTGTACGAACATCATGACTTGCAAGATAAGATTATTGAGATATTTGCAGACATGGAAGAATTGGGTGTAAAACCAGATGAAGATACCGTAAGAAGAGTCTGTCGTGCCTTTCAAAAACTAGGTCAAGAAGATAACCGGAAAATGGTCTATAAAAGATACAGCTGCCAATGGAAATACATACACTTCAAGGGTGAGAGGGTTAGAGTGAGAAGAGATGGATGGGATGAAGATTATCAATGATAGATACATTAACATGAACAAGTTCATAAGGTATCTGTCTACTGATCCCCATTAATTATATGCGCTTAAGCACACTGAAACAGAGATTTGCAGTCTATTTTGTCCCTTTGCTCTGAATATTTAGAACAACTACTAACAGAGAAAATGTAAAGATTGTAAGTTCTGAAGAAAAATAGTGTATATGTTAACCAAAAGACGTCA

Coding sequence (CDS)

ATGGAACCTCAACTGGCTTCGATGCCCTCATGCCAAAGCGTCATTTGTGTTCACAACCAAGCTGCACAGCCCCTTACTAGTTTCACCACACCTGAGAGACGTGTTGTTAAGAAGGTTGGGAAGGAGACACACCATTTATGGAAGAAAAGAGATTCTGCTGGCTCTGGGCAAAAGGCTCTTAATCTTGTTAGAATTGTTTCCCAATGTCCTAATGAGAAAGAAGCTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACAGAGTTTCCATTGATTGCAGCTGCTAAAGCTTTAAGAATACTGAGGAAGAGAAGTCAATGGAAGCGTGTCATTCAAGTGGCTAAGTGGATGTTAAGCAAGGGCCAAGGAGCCACAATGGGAACATACGACACCCTTCTACTGGCATTTGATATGGACAAGAGGGTGGACGAAGCCGAATCCTTATGGAACATGATATTGCATACACATACACGTTCCATCTCTAAGCGAGTATTTTCTAGGATGATCTCTTTGTACGAACATCATGACTTGCAAGATAAGATTATTGAGATATTTGCAGACATGGAAGAATTGGGTGTAAAACCAGATGAAGATACCGTAAGAAGAGTCTGTCGTGCCTTTCAAAAACTAGGTCAAGAAGATAACCGGAAAATGGTCTATAAAAGATACAGCTGCCAATGGAAATACATACACTTCAAGGGTGAGAGGGTTAGAGTGAGAAGAGATGGATGGGATGAAGATTATCAATGA

Protein sequence

MEPQLASMPSCQSVICVHNQAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGERVRVRRDGWDEDYQ*
BLAST of Cucsa.164890 vs. Swiss-Prot
Match: PP322_ARATH (Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidopsis thaliana GN=At4g18975 PE=2 SV=2)

HSP 1 Score: 342.8 bits (878), Expect = 3.4e-93
Identity = 161/213 (75.59%), Postives = 188/213 (88.26%), Query Frame = 1

Query: 50  TPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETE 109
           T   + +KKVGK+ HHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE E
Sbjct: 69  TVNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVE 128

Query: 110 FPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESL 169
           FP+IAAAKAL+ILRKRSQW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESL
Sbjct: 129 FPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESL 188

Query: 170 WNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQ 229
           WNMILHTHTRSI +R+F+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF+
Sbjct: 189 WNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFR 248

Query: 230 KLGQEDNRKMVYKRYSCQWKYIHFKGERVRVRR 263
           +L QE+NRK++ +RY  ++KYI+F GERVRV+R
Sbjct: 249 ELNQEENRKLILRRYLSEYKYIYFNGERVRVKR 281

BLAST of Cucsa.164890 vs. Swiss-Prot
Match: PP332_ARATH (Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana GN=EMB1417 PE=2 SV=1)

HSP 1 Score: 187.6 bits (475), Expect = 1.8e-46
Identity = 92/197 (46.70%), Postives = 125/197 (63.45%), Query Frame = 1

Query: 66  LWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKR 125
           +WK R   G+  KA  ++  +    N KE VYG L+ +IAWE EFPL+   KAL IL   
Sbjct: 45  VWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDE 104

Query: 126 SQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRV 185
            +WK++IQV KWMLSKGQG TMGTY +LL A   D R+DEAE LWN +   H     ++ 
Sbjct: 105 KEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKF 164

Query: 186 FSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMVYKRY- 245
           F++MIS+Y   D+  K+ E+FADMEELGVKP+   V  V + F KL  +D  + + K+Y 
Sbjct: 165 FNKMISIYYKRDMHQKLFEVFADMEELGVKPNVAIVSMVGKVFVKLEMKDKYEKLMKKYP 224

Query: 246 SCQWKYIHFKGERVRVR 262
             QW++ + KG RV+V+
Sbjct: 225 PPQWEFRYIKGRRVKVK 241

BLAST of Cucsa.164890 vs. TrEMBL
Match: A0A0A0KSA5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189910 PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 1.7e-155
Identity = 269/270 (99.63%), Postives = 269/270 (99.63%), Query Frame = 1

Query: 1   MLVFHGTSTGFDALMPKIDCIYYHNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVG 60
           MLVFHGTSTGFDALMPKIDCIYYHNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVG
Sbjct: 1   MLVFHGTSTGFDALMPKIDCIYYHNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVG 60

Query: 61  KETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR 120
           KETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR
Sbjct: 61  KETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR 120

Query: 121 ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS 180
           ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS
Sbjct: 121 ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS 180

Query: 181 ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMV 240
           ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVC AFQKLGQEDNRKMV
Sbjct: 181 ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCCAFQKLGQEDNRKMV 240

Query: 241 YKRYSCQWKYIHFKGERVRVRRDGWDEDYQ 271
           YKRYSCQWKYIHFKGERVRVRRDGWDEDYQ
Sbjct: 241 YKRYSCQWKYIHFKGERVRVRRDGWDEDYQ 270

BLAST of Cucsa.164890 vs. TrEMBL
Match: D7TJV2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g02210 PE=4 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 4.6e-105
Identity = 190/250 (76.00%), Postives = 212/250 (84.80%), Query Frame = 1

Query: 25  NKFTFTP-------SSVICVHNQAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQ 84
           + F+F+P         V C HN       S+   E+ + KKVGK+ HHLW+KRDS GSGQ
Sbjct: 31  SSFSFSPFHKVTSMRHVKCCHNPP-----SYRAVEKEISKKVGKKEHHLWRKRDSIGSGQ 90

Query: 85  KALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKW 144
           KALNLVRIVS+ PNEKEAVYG L+KW AWETEFPLIAAAKALRILRKR+QWKRVIQVAKW
Sbjct: 91  KALNLVRIVSELPNEKEAVYGALDKWTAWETEFPLIAAAKALRILRKRNQWKRVIQVAKW 150

Query: 145 MLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHD 204
           MLSKGQGATMGTYDTLLLAFDMD RVDEAESLWNMILHTHTRSISK++FSRMISLY+HHD
Sbjct: 151 MLSKGQGATMGTYDTLLLAFDMDWRVDEAESLWNMILHTHTRSISKQLFSRMISLYDHHD 210

Query: 205 LQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGER 264
           ++DK+IE+FADMEELGVKPDEDTVRRV  AFQ LGQED +K+V K+Y C+WKYIHF GER
Sbjct: 211 MRDKVIEVFADMEELGVKPDEDTVRRVACAFQTLGQEDKQKLVLKKYQCKWKYIHFNGER 270

Query: 265 VRVRRDGWDE 268
           VRVRRD WDE
Sbjct: 271 VRVRRDAWDE 275

BLAST of Cucsa.164890 vs. TrEMBL
Match: M5WV26_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011078mg PE=4 SV=1)

HSP 1 Score: 381.3 bits (978), Expect = 9.6e-103
Identity = 180/216 (83.33%), Postives = 200/216 (92.59%), Query Frame = 1

Query: 53  RRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPL 112
           R+ +KKVG++ HHLW+KRDSAGSGQKALNLVRIVS  PNEKE VYG L+KW AWETEFPL
Sbjct: 4   RKTIKKVGRKEHHLWQKRDSAGSGQKALNLVRIVSGLPNEKETVYGALDKWTAWETEFPL 63

Query: 113 IAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNM 172
           IAA KALRILRKRSQW RVIQVAKWMLSKGQGATMGTYDTLLLAFDMD+RVDEAESLWNM
Sbjct: 64  IAAVKALRILRKRSQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNM 123

Query: 173 ILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLG 232
           ILHTHTRSISKR+FSRMISLY+HHD Q+KIIE+FADMEELGVKPDEDTVRRV RAF++LG
Sbjct: 124 ILHTHTRSISKRLFSRMISLYDHHDKQNKIIEVFADMEELGVKPDEDTVRRVARAFKELG 183

Query: 233 QEDNRKMVYKRYSCQWKYIHFKGERVRVRRDGWDED 269
           QE+N+ +V +RY C+WKYIHFKGERV+VR + WDED
Sbjct: 184 QEENKTLVLRRYQCKWKYIHFKGERVKVRTNAWDED 219

BLAST of Cucsa.164890 vs. TrEMBL
Match: A0A061GEM0_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_029590 PE=4 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 4.0e-101
Identity = 187/237 (78.90%), Postives = 203/237 (85.65%), Query Frame = 1

Query: 32  SSVICVHNQAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPN 91
           S V C      Q L      E++ VKKVGK  HHLWKKRDSAGSGQKALNLVRI+SQ PN
Sbjct: 40  SYVKCSQKLGEQSLGISEAVEKKPVKKVGKNEHHLWKKRDSAGSGQKALNLVRIISQLPN 99

Query: 92  EKEAVYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYD 151
           EKEAVYG L+KW AWETEFPLIAAAKALRILRKRSQW RVIQVAKWMLSKGQGATMGTYD
Sbjct: 100 EKEAVYGALDKWTAWETEFPLIAAAKALRILRKRSQWLRVIQVAKWMLSKGQGATMGTYD 159

Query: 152 TLLLAFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEE 211
           TLLLAFDMDKRVDEAESLWNMILH HTRSISKR+FSRMISLY+HH++QDKIIE+FADMEE
Sbjct: 160 TLLLAFDMDKRVDEAESLWNMILHIHTRSISKRLFSRMISLYDHHNMQDKIIEVFADMEE 219

Query: 212 LGVKPDEDTVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGERVRVRRDGWDED 269
           L V+PDE+TVR+V RAFQKLGQED +K+V +RY  +WKYIHF GERVRV R   DED
Sbjct: 220 LCVRPDENTVRKVARAFQKLGQEDKQKLVLRRYLSKWKYIHFNGERVRVTRYESDED 276

BLAST of Cucsa.164890 vs. TrEMBL
Match: W9R677_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015617 PE=4 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 1.9e-98
Identity = 181/257 (70.43%), Postives = 207/257 (80.54%), Query Frame = 1

Query: 21  IYYHNKFTFTPSSVICVHNQAAQPLTSFTTPE--------------RRVVKKVGKETHHL 80
           I + +K T   S   C      +PLTS    E              R +VKK GK+ +HL
Sbjct: 60  ISFDDKLTMNYSHHNCSIKGNGEPLTSSKAIEKLQRLCIEFLYMEFRNLVKKTGKKEYHL 119

Query: 81  WKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRS 140
           WKK+DSAGSGQKALNL+RI+S  PNEKE VYG LNKWIAWETEFPLIAAAKALRILRKRS
Sbjct: 120 WKKKDSAGSGQKALNLIRILSVLPNEKEVVYGALNKWIAWETEFPLIAAAKALRILRKRS 179

Query: 141 QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRVF 200
           QWKRVIQVAKWMLSKGQG TMGTYDTLLLAFDMD+RVDEAES WNMILHTH RSISKR+F
Sbjct: 180 QWKRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDQRVDEAESFWNMILHTHKRSISKRLF 239

Query: 201 SRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMVYKRYSC 260
           SRMI+LY+HHD++DKIIE+FADMEEL V+ DEDTVRRV  AFQKLGQE+ +K++ ++Y C
Sbjct: 240 SRMIALYDHHDVKDKIIEVFADMEELSVRLDEDTVRRVAYAFQKLGQEEKKKLLLRKYQC 299

Query: 261 QWKYIHFKGERVRVRRD 264
           +WKY+HFKGER+RVRRD
Sbjct: 300 KWKYVHFKGERIRVRRD 316

BLAST of Cucsa.164890 vs. TAIR10
Match: AT4G18975.1 (AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 342.8 bits (878), Expect = 1.9e-94
Identity = 161/213 (75.59%), Postives = 188/213 (88.26%), Query Frame = 1

Query: 50  TPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETE 109
           T   + +KKVGK+ HHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE E
Sbjct: 69  TVNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVE 128

Query: 110 FPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESL 169
           FP+IAAAKAL+ILRKRSQW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESL
Sbjct: 129 FPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESL 188

Query: 170 WNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQ 229
           WNMILHTHTRSI +R+F+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF+
Sbjct: 189 WNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFR 248

Query: 230 KLGQEDNRKMVYKRYSCQWKYIHFKGERVRVRR 263
           +L QE+NRK++ +RY  ++KYI+F GERVRV+R
Sbjct: 249 ELNQEENRKLILRRYLSEYKYIYFNGERVRVKR 281

BLAST of Cucsa.164890 vs. TAIR10
Match: AT4G21190.1 (AT4G21190.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 187.6 bits (475), Expect = 1.0e-47
Identity = 92/197 (46.70%), Postives = 125/197 (63.45%), Query Frame = 1

Query: 66  LWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKR 125
           +WK R   G+  KA  ++  +    N KE VYG L+ +IAWE EFPL+   KAL IL   
Sbjct: 45  VWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDE 104

Query: 126 SQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRV 185
            +WK++IQV KWMLSKGQG TMGTY +LL A   D R+DEAE LWN +   H     ++ 
Sbjct: 105 KEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKF 164

Query: 186 FSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMVYKRY- 245
           F++MIS+Y   D+  K+ E+FADMEELGVKP+   V  V + F KL  +D  + + K+Y 
Sbjct: 165 FNKMISIYYKRDMHQKLFEVFADMEELGVKPNVAIVSMVGKVFVKLEMKDKYEKLMKKYP 224

Query: 246 SCQWKYIHFKGERVRVR 262
             QW++ + KG RV+V+
Sbjct: 225 PPQWEFRYIKGRRVKVK 241

BLAST of Cucsa.164890 vs. TAIR10
Match: AT1G04590.2 (AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4))

HSP 1 Score: 125.2 bits (313), Expect = 6.3e-29
Identity = 65/168 (38.69%), Postives = 101/168 (60.12%), Query Frame = 1

Query: 82  LVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSK 141
           LV  +    + KEAVYG L+ W+AWE  FP+ +    +  L K  QW R++QV KW+LSK
Sbjct: 149 LVNTLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLEKEHQWHRMVQVIKWILSK 208

Query: 142 GQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLY-EHHDLQD 201
           GQG TMGTY  L+ A DMD+R +EA  +W   +     S+  ++  +M+ +Y  ++ LQ+
Sbjct: 209 GQGNTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPWQLCLQMMRIYFRNNMLQE 268

Query: 202 --KIIEIFADMEELGVK-PDEDTVRRVCRAFQKLGQEDNRKMVYKRYS 246
             K++++F D+E    K PD+  V+ V  A++ LG  D ++ V  +YS
Sbjct: 269 LVKVMKLFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKERVVTKYS 316

BLAST of Cucsa.164890 vs. NCBI nr
Match: gi|778701148|ref|XP_011654973.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis sativus])

HSP 1 Score: 556.6 bits (1433), Expect = 2.4e-155
Identity = 269/270 (99.63%), Postives = 269/270 (99.63%), Query Frame = 1

Query: 1   MLVFHGTSTGFDALMPKIDCIYYHNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVG 60
           MLVFHGTSTGFDALMPKIDCIYYHNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVG
Sbjct: 1   MLVFHGTSTGFDALMPKIDCIYYHNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVG 60

Query: 61  KETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR 120
           KETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR
Sbjct: 61  KETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR 120

Query: 121 ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS 180
           ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS
Sbjct: 121 ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS 180

Query: 181 ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMV 240
           ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVC AFQKLGQEDNRKMV
Sbjct: 181 ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCCAFQKLGQEDNRKMV 240

Query: 241 YKRYSCQWKYIHFKGERVRVRRDGWDEDYQ 271
           YKRYSCQWKYIHFKGERVRVRRDGWDEDYQ
Sbjct: 241 YKRYSCQWKYIHFKGERVRVRRDGWDEDYQ 270

BLAST of Cucsa.164890 vs. NCBI nr
Match: gi|659129885|ref|XP_008464896.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 529.6 bits (1363), Expect = 3.1e-147
Identity = 256/270 (94.81%), Postives = 262/270 (97.04%), Query Frame = 1

Query: 1   MLVFHGTSTGFDALMPKIDCIYYHNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVG 60
           MLV HG+STGFDAL+PKIDCIYYHNK  F P+SVICVHNQAAQP TSFTTPERRVVKKVG
Sbjct: 1   MLVLHGSSTGFDALVPKIDCIYYHNKCAFRPASVICVHNQAAQPHTSFTTPERRVVKKVG 60

Query: 61  KETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR 120
           KE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR
Sbjct: 61  KEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALR 120

Query: 121 ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS 180
           ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS
Sbjct: 121 ILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRS 180

Query: 181 ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMV 240
           ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRR+ RAFQKLGQE+NRKMV
Sbjct: 181 ISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRIGRAFQKLGQEENRKMV 240

Query: 241 YKRYSCQWKYIHFKGERVRVRRDGWDEDYQ 271
           YKRYSCQWKYIHFKGERVRVR+DGWDED Q
Sbjct: 241 YKRYSCQWKYIHFKGERVRVRKDGWDEDDQ 270

BLAST of Cucsa.164890 vs. NCBI nr
Match: gi|659129887|ref|XP_008464897.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 438.7 bits (1127), Expect = 7.3e-120
Identity = 214/226 (94.69%), Postives = 219/226 (96.90%), Query Frame = 1

Query: 45  LTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWI 104
           L S  + +RRVVKKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWI
Sbjct: 5   LGSMPSCQRRVVKKVGKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWI 64

Query: 105 AWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVD 164
           AWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVD
Sbjct: 65  AWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVD 124

Query: 165 EAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRV 224
           EAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRR+
Sbjct: 125 EAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRI 184

Query: 225 CRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGERVRVRRDGWDEDYQ 271
            RAFQKLGQE+NRKMVYKRYSCQWKYIHFKGERVRVR+DGWDED Q
Sbjct: 185 GRAFQKLGQEENRKMVYKRYSCQWKYIHFKGERVRVRKDGWDEDDQ 230

BLAST of Cucsa.164890 vs. NCBI nr
Match: gi|694362787|ref|XP_009360801.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Pyrus x bretschneideri])

HSP 1 Score: 392.9 bits (1008), Expect = 4.6e-106
Identity = 188/248 (75.81%), Postives = 214/248 (86.29%), Query Frame = 1

Query: 22  YYHNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALN 81
           Y H       S+V   H Q+ QPL S  T E++++KK GK+ HHLW+K+DSAGSGQKALN
Sbjct: 47  YVHQTVAIHTSNVKFSHKQSRQPLPSSKTTEKKIIKKAGKKEHHLWQKKDSAGSGQKALN 106

Query: 82  LVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSK 141
           LVRIVS  PNEKE ++G L+KW AWETEFPLIAAAKAL ILRKRSQW RVIQVAKWMLSK
Sbjct: 107 LVRIVSSLPNEKETMFGALDKWTAWETEFPLIAAAKALIILRKRSQWVRVIQVAKWMLSK 166

Query: 142 GQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDK 201
           GQGATMGTYDTLLLAFDMD+RVDEAESLWNMILHTHTRSIS+R+FSRMISLY HHD+Q K
Sbjct: 167 GQGATMGTYDTLLLAFDMDQRVDEAESLWNMILHTHTRSISRRLFSRMISLYHHHDMQSK 226

Query: 202 IIEIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGERVRVR 261
           IIE+FADMEELGV+PDEDTVRRV RAFQ+LGQE+N+K+  +RY C+WKYIHFKGERV+VR
Sbjct: 227 IIEVFADMEELGVRPDEDTVRRVARAFQELGQEENKKLFLRRYQCKWKYIHFKGERVKVR 286

Query: 262 R-DGWDED 269
           R + WDED
Sbjct: 287 RTNAWDED 294

BLAST of Cucsa.164890 vs. NCBI nr
Match: gi|645237846|ref|XP_008225398.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Prunus mume])

HSP 1 Score: 392.5 bits (1007), Expect = 6.0e-106
Identity = 187/245 (76.33%), Postives = 212/245 (86.53%), Query Frame = 1

Query: 24  HNKFTFTPSSVICVHNQAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALNLV 83
           H   +    +V C   Q+ QPLTS    E++ +KKVG++ HHLW+KRDSAGSGQKA+NLV
Sbjct: 49  HQTVSINAFNVKCSFKQSRQPLTSPKAIEKKTIKKVGRKEHHLWQKRDSAGSGQKAINLV 108

Query: 84  RIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQ 143
           RIVS  PNEKE VYG L+KW AWE EFPLIAA KALRILRKRSQW RVIQVAKWMLSKGQ
Sbjct: 109 RIVSGLPNEKETVYGALDKWTAWEAEFPLIAAVKALRILRKRSQWVRVIQVAKWMLSKGQ 168

Query: 144 GATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKII 203
           GATMGTYDTLLLAFDMD+RVDEAESLWNMILHTHTRSISKR+FSRMISLY+HHD Q+KII
Sbjct: 169 GATMGTYDTLLLAFDMDQRVDEAESLWNMILHTHTRSISKRLFSRMISLYDHHDKQNKII 228

Query: 204 EIFADMEELGVKPDEDTVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGERVRVRRD 263
           E+FADMEELGVKPDEDTVRRV RAF++LGQE+N+ +V +RY C+WKYIHFKGERV+VR +
Sbjct: 229 EVFADMEELGVKPDEDTVRRVARAFKELGQEENKTLVLRRYQCKWKYIHFKGERVKVRTN 288

Query: 264 GWDED 269
            WDED
Sbjct: 289 AWDED 293

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP322_ARATH3.4e-9375.59Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidop... [more]
PP332_ARATH1.8e-4646.70Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KSA5_CUCSA1.7e-15599.63Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189910 PE=4 SV=1[more]
D7TJV2_VITVI4.6e-10576.00Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g02210 PE=4 SV=... [more]
M5WV26_PRUPE9.6e-10383.33Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011078mg PE=4 SV=1[more]
A0A061GEM0_THECC4.0e-10178.90Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_029590 PE... [more]
W9R677_9ROSA1.9e-9870.43Uncharacterized protein OS=Morus notabilis GN=L484_015617 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G18975.11.9e-9475.59 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21190.11.0e-4746.70 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G04590.26.3e-2938.69 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat... [more]
Match NameE-valueIdentityDescription
gi|778701148|ref|XP_011654973.1|2.4e-15599.63PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic ... [more]
gi|659129885|ref|XP_008464896.1|3.1e-14794.81PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic ... [more]
gi|659129887|ref|XP_008464897.1|7.3e-12094.69PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic ... [more]
gi|694362787|ref|XP_009360801.1|4.6e-10675.81PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-... [more]
gi|645237846|ref|XP_008225398.1|6.0e-10676.33PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.164890.2Cucsa.164890.2mRNA
Cucsa.164890.1Cucsa.164890.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 162..196
score: 8.627coord: 126..160
score:
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 9..217
score: 3.7E
NoneNo IPR availablePANTHERPTHR24015:SF314SUBFAMILY NOT NAMEDcoord: 9..217
score: 3.7E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.164890Silver-seed gourdcarcgyB0746
Cucsa.164890Cucumber (Chinese Long) v3cgycucB262
Cucsa.164890Watermelon (97103) v2cgywmbB294
Cucsa.164890Wax gourdcgywgoB339
Cucsa.164890Wax gourdcgywgoB341
Cucsa.164890Cucumber (Gy14) v1cgycgyB015
Cucsa.164890Cucumber (Gy14) v1cgycgyB084
Cucsa.164890Cucumber (Gy14) v1cgycgyB105
Cucsa.164890Cucurbita maxima (Rimu)cgycmaB0467
Cucsa.164890Cucurbita maxima (Rimu)cgycmaB0472
Cucsa.164890Cucurbita moschata (Rifu)cgycmoB0468
Cucsa.164890Cucurbita moschata (Rifu)cgycmoB0473
Cucsa.164890Cucurbita moschata (Rifu)cgycmoB0474
Cucsa.164890Cucurbita moschata (Rifu)cgycmoB0475
Cucsa.164890Wild cucumber (PI 183967)cgycpiB254
Cucsa.164890Wild cucumber (PI 183967)cgycpiB255
Cucsa.164890Wild cucumber (PI 183967)cgycpiB261
Cucsa.164890Cucumber (Chinese Long) v2cgycuB247
Cucsa.164890Cucumber (Chinese Long) v2cgycuB248
Cucsa.164890Cucumber (Chinese Long) v2cgycuB254
Cucsa.164890Melon (DHL92) v3.5.1cgymeB290
Cucsa.164890Melon (DHL92) v3.5.1cgymeB292
Cucsa.164890Melon (DHL92) v3.5.1cgymeB293
Cucsa.164890Melon (DHL92) v3.5.1cgymeB294
Cucsa.164890Melon (DHL92) v3.5.1cgymeB295
Cucsa.164890Watermelon (Charleston Gray)cgywcgB288
Cucsa.164890Watermelon (Charleston Gray)cgywcgB293
Cucsa.164890Watermelon (97103) v1cgywmB303
Cucsa.164890Watermelon (97103) v1cgywmB309
Cucsa.164890Watermelon (97103) v1cgywmB312
Cucsa.164890Cucurbita pepo (Zucchini)cgycpeB0458
Cucsa.164890Melon (DHL92) v3.6.1cgymedB285
Cucsa.164890Melon (DHL92) v3.6.1cgymedB288