CsGy3G016120.1 (mRNA) Cucumber (Gy14) v2

NameCsGy3G016120.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationChr3 : 11971832 .. 11972989 (-)
Sequence length453
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGATGCACACCAGGTGTTCGACGAAATGGGTAATCGTAATTTCTCTGCATTTGCTTGGAATTCTCTTATTTCTGGATACGCGGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACAATTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCAAATCGGAGAGGCGGTGCATCGGCATGTAGTTCGTTCTGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTTTAGTTGACATGTATTCCAAATGTGGTTGCATTGTGAGGGCTAGGAAAGTGTTTGATCAGATTGAGTATAAGGATATAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCACTTTGAGGCATTAGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCCGATTCGGTTGCTTTGTCCACCCTACTTTCTAACATTTCGTCAATGAAATTCAAGTTACATATTCATGGATGGGTAATTCGACATGGAGTCGAATGGAATTTGTCCATTGCTAACTCCTTGATAGTCATGTATGCCAAATGTGGTAAGCTTAACAGAGCAAAATGGCTGTTCCAGCAAATGCCTCAAAAGGACATGGTCTCATGGAACTCCATAATCTCTGCTCATTTCAATAGCGCAGAAGCTTTGACATATTTCGAAGTGATGGAAAGCCTTGGTGTTTCGCCAGACGGTGTAACATTTGTGTCATTGTTATCAACTTGTGCTCATCTGGGGTTGGTGAAGGAAGGGGGGAAATTGTATTTTTTGATGAAGGGGAAGTACGGAATAAGACCAACAATTGAACATTATGCTTGTATGGTGAATCTTTACGGGAGAGCAGGGATGATTGAAGAAGCTTATAAAATCATAACGAAAGGGATGGAGATCGAGGCAGGTCCGACCATATGGGGGGCGTTGTTGTATGCGTGTTATCTCCATAGCGATGTAGATATCGCTGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCAGATAATGAGCTCAATTTTGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCGGAGGACGAGAAGCGAGTGAAATTAATGATGGCAGAACGAGGACTGAATTCATAG

mRNA sequence

ATGGAGGATGCACACCAGGTGTTCGACGAAATGGGTAATCGTAATTTCTCTGCATTTGCTTGGAATTCTCTTATTTCTGGATACGCGGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACAATTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCAAATCGGAGAGGCGGTGCATCGGCATGTAGTTCGTTCTGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTTTAGTTGACATCGATGTAGATATCGCTGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCAGATAATGAGCTCAATTTTGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCGGAGGACGAGAAGCGAGTGAAATTAATGATGGCAGAACGAGGACTGAATTCATAG

Coding sequence (CDS)

ATGGAGGATGCACACCAGGTGTTCGACGAAATGGGTAATCGTAATTTCTCTGCATTTGCTTGGAATTCTCTTATTTCTGGATACGCGGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACAATTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCAAATCGGAGAGGCGGTGCATCGGCATGTAGTTCGTTCTGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTTTAGTTGACATCGATGTAGATATCGCTGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCAGATAATGAGCTCAATTTTGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCGGAGGACGAGAAGCGAGTGAAATTAATGATGGCAGAACGAGGACTGAATTCATAG

Protein sequence

MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS
BLAST of CsGy3G016120.1 vs. NCBI nr
Match: KGN57280.1 (hypothetical protein Csa_3G176270 [Cucumis sativus])

HSP 1 Score: 204.5 bits (519), Expect = 2.5e-49
Identity = 149/385 (38.70%), Postives = 150/385 (38.96%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL
Sbjct: 160 MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 219

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDI----------------------- 120
           KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVD+                       
Sbjct: 220 KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIV 279

Query: 121 ------------------------------------------------------------ 151
                                                                       
Sbjct: 280 SWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRH 339

BLAST of CsGy3G016120.1 vs. NCBI nr
Match: XP_011652769.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucumis sativus])

HSP 1 Score: 204.5 bits (519), Expect = 2.5e-49
Identity = 149/385 (38.70%), Postives = 150/385 (38.96%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL
Sbjct: 409 MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 468

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDI----------------------- 120
           KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVD+                       
Sbjct: 469 KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIV 528

Query: 121 ------------------------------------------------------------ 151
                                                                       
Sbjct: 529 SWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRH 588

BLAST of CsGy3G016120.1 vs. NCBI nr
Match: XP_016898932.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucumis melo])

HSP 1 Score: 197.2 bits (500), Expect = 4.0e-47
Identity = 145/385 (37.66%), Postives = 149/385 (38.70%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           MEDAHQVFDEMG RNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTFPRVL
Sbjct: 145 MEDAHQVFDEMGKRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVL 204

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDI----------------------- 120
           KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVD+                       
Sbjct: 205 KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIVYKDIV 264

Query: 121 ------------------------------------------------------------ 151
                                                                       
Sbjct: 265 SWNSMLTGYTRHGLHFEALDIFDQMIQEGYKPDSVALSTLLSNILSLKFKLHIHGWVIRH 324

BLAST of CsGy3G016120.1 vs. NCBI nr
Match: XP_022140840.1 (pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Momordica charantia])

HSP 1 Score: 186.8 bits (473), Expect = 5.3e-44
Identity = 92/108 (85.19%), Postives = 96/108 (88.89%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           ME AHQVFDEM  RN SAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTFPRVL
Sbjct: 140 MESAHQVFDEMCKRNVSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVL 199

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER 109
           KACGGIGSIQIGEAVHRH+VRSGFAGDVFVLNALVD+     +I   R
Sbjct: 200 KACGGIGSIQIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRAR 247


HSP 2 Score: 101.7 bits (252), Expect = 2.3e-18
Identity = 69/157 (43.95%), Postives = 93/157 (59.24%), Query Frame = 0

Query: 3   DAHQVFDEMGNRNF--SAFAWNSLISGYAELGLYEDALALYFQME-EEGVEPDNFTFPRV 62
           +A + F++M +         + SL+S  A LGL ++   L+  M+ + G+ P    +  +
Sbjct: 371 EALEYFEQMESHGVLPDTVTFVSLLSTCAHLGLVKEGERLHSVMKGKYGIRPTMEHYACM 430

Query: 63  LKACGGIGSIQIGEAVHRHVVRS-------GFAGDVFVLNALVDIDVDIAEIAAERLFEL 122
           +   G  G I   E  +R ++R           G +     L +  VDIAEIAAE+LFEL
Sbjct: 431 VNLYGRAGLI---EEAYRIIIRGMELEAGPTVWGALLYACFLRNGSVDIAEIAAEKLFEL 490

Query: 123 EPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLN 150
           EPDNELNFELLMKIYGNAGRSEDEKRV+LMM ERGL+
Sbjct: 491 EPDNELNFELLMKIYGNAGRSEDEKRVRLMMKERGLD 524

BLAST of CsGy3G016120.1 vs. NCBI nr
Match: XP_022942651.1 (pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita moschata])

HSP 1 Score: 186.4 bits (472), Expect = 7.0e-44
Identity = 89/108 (82.41%), Postives = 97/108 (89.81%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           MEDAHQVFDEM  RN SAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTFPRVL
Sbjct: 145 MEDAHQVFDEMCQRNLSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVL 204

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER 109
           KACGGIGSI++GEAVHRH+VRSGFAGD+FVLNALVD+     +I   R
Sbjct: 205 KACGGIGSIRVGEAVHRHIVRSGFAGDIFVLNALVDMYAKCGDIMRAR 252


HSP 2 Score: 95.1 bits (235), Expect = 2.1e-16
Identity = 64/135 (47.41%), Postives = 86/135 (63.70%), Query Frame = 0

Query: 23  SLISGYAELGLYEDALALYFQME-EEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVR 82
           SL+S  A L L ++   LY +M+ + G+ P    +  ++   G  G I   E  +R ++ 
Sbjct: 398 SLLSTCAHLSLVKEGGKLYTEMKGKYGIRPTIEHYACMVNLYGRAGLI---EEAYR-IIT 457

Query: 83  SGFAGDV--FVLNALVDI-----DVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSE 142
           +G   +    V  AL+       +VDIAE+AAE+LFE EPDNELNF+LLMKIYGNAGR E
Sbjct: 458 NGMEVEAGPTVWGALLYACYLHGNVDIAEVAAEKLFESEPDNELNFKLLMKIYGNAGRIE 517

Query: 143 DEKRVKLMMAERGLN 150
           DEKRV+LMMAERGL+
Sbjct: 518 DEKRVRLMMAERGLD 528

BLAST of CsGy3G016120.1 vs. TAIR10
Match: AT4G25270.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 155.2 bits (391), Expect = 3.1e-38
Identity = 79/149 (53.02%), Postives = 106/149 (71.14%), Query Frame = 0

Query: 2   EDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLK 61
           E AH+VFD M  R+ S FAWNSLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLK
Sbjct: 144 EVAHEVFDRMSKRDSSPFAWNSLISGYAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLK 203

Query: 62  ACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIA-AERLFELEPDNE-LN 121
           ACGGIGS+QIGEA+HR +V+ GF  DV+VLNALV +     +I  A  +F++ P  + ++
Sbjct: 204 ACGGIGSVQIGEAIHRDLVKEGFGYDVYVLNALVVMYAKCGDIVKARNVFDMIPHKDYVS 263

Query: 122 FELLMKIYGNAGRSEDEKRVKLMMAERGL 149
           +  ++  Y + G   +   +  +M + G+
Sbjct: 264 WNSMLTGYLHHGLLHEALDIFRLMVQNGI 292


HSP 2 Score: 80.9 bits (198), Expect = 7.5e-16
Identity = 56/151 (37.09%), Postives = 82/151 (54.30%), Query Frame = 0

Query: 8   FDEM--GNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEE-GVEPDNFTFPRVLKACG 67
           F++M   N       + S++S  A  G+ ED   L+  M +E G++P    +  ++   G
Sbjct: 379 FEQMHRANAKPDGITFVSVLSLCANTGMVEDGERLFSLMSKEYGIDPKMEHYACMVNLYG 438

Query: 68  GIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDI-----DVDIAEIAAERLFELEPDNELN 127
             G ++  EA    V   G      V  AL+       + DI E+AA+RLFELEPDNE N
Sbjct: 439 RAGMME--EAYSMIVQEMGLEAGPTVWGALLYACYLHGNTDIGEVAAQRLFELEPDNEHN 498

Query: 128 FELLMKIYGNAGRSEDEKRVKLMMAERGLNS 151
           FELL++IY  A R+ED +RV+ MM +RGL +
Sbjct: 499 FELLIRIYSKAKRAEDVERVRQMMVDRGLET 527

BLAST of CsGy3G016120.1 vs. TAIR10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 94.7 bits (234), Expect = 5.0e-20
Identity = 57/150 (38.00%), Postives = 90/150 (60.00%), Query Frame = 0

Query: 3   DAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQM-EEEGVEPDNFTFPRVLK 62
           DA  VF +M  RN   F+WN L+ GYA+ G +++A+ LY +M    GV+PD +TFP VL+
Sbjct: 147 DAWYVFGKMSERNL--FSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVLR 206

Query: 63  ACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER-LFELEPDNE-LN 122
            CGGI  +  G+ VH HVVR G+  D+ V+NAL+ + V   ++ + R LF+  P  + ++
Sbjct: 207 TCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIIS 266

Query: 123 FELLMKIYGNAGRSEDEKRVKLMMAERGLN 150
           +  ++  Y   G   +   ++L  A RGL+
Sbjct: 267 WNAMISGYFENGMCHE--GLELFFAMRGLS 292

BLAST of CsGy3G016120.1 vs. TAIR10
Match: AT4G35130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 92.8 bits (229), Expect = 1.9e-19
Identity = 47/116 (40.52%), Postives = 73/116 (62.93%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           MEDA Q+FDEM   +  AF WN +I G+   GLY +A+  Y +M   GV+ D FT+P V+
Sbjct: 80  MEDALQLFDEMNKAD--AFLWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVI 139

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDI-AEIAAERLFELEPD 116
           K+  GI S++ G+ +H  V++ GF  DV+V N+L+ + + +     AE++FE  P+
Sbjct: 140 KSVAGISSLEEGKKIHAMVIKLGFVSDVYVCNSLISLYMKLGCAWDAEKVFEEMPE 193

BLAST of CsGy3G016120.1 vs. TAIR10
Match: AT3G12770.1 (mitochondrial editing factor 22)

HSP 1 Score: 89.0 bits (219), Expect = 2.8e-18
Identity = 43/105 (40.95%), Postives = 61/105 (58.10%), Query Frame = 0

Query: 4   AHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKAC 63
           A QVFD++       F WN++I GY+    ++DAL +Y  M+   V PD+FTFP +LKAC
Sbjct: 72  ARQVFDDLPRPQI--FPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKAC 131

Query: 64  GGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER 109
            G+  +Q+G  VH  V R GF  DVFV N L+ +      + + R
Sbjct: 132 SGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSAR 174

BLAST of CsGy3G016120.1 vs. TAIR10
Match: AT3G16610.1 (pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 85.9 bits (211), Expect = 2.3e-17
Identity = 46/119 (38.66%), Postives = 68/119 (57.14%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           +E A  VFDE+ +   +  AW+ +I  YA     E AL LY++M   GV P  +T+P VL
Sbjct: 51  VELARHVFDEIPHPRINPIAWDLMIRAYASNDFAEKALDLYYKMLNSGVRPTKYTYPFVL 110

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEI-AAERLFELEPDNEL 119
           KAC G+ +I  G+ +H HV  S FA D++V  ALVD      E+  A ++F+  P  ++
Sbjct: 111 KACAGLRAIDDGKLIHSHVNCSDFATDMYVCTALVDFYAKCGELEMAIKVFDEMPKRDM 169

BLAST of CsGy3G016120.1 vs. Swiss-Prot
Match: sp|Q9SB36|PP337_ARATH (Pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E53 PE=3 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 5.6e-37
Identity = 79/149 (53.02%), Postives = 106/149 (71.14%), Query Frame = 0

Query: 2   EDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLK 61
           E AH+VFD M  R+ S FAWNSLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLK
Sbjct: 144 EVAHEVFDRMSKRDSSPFAWNSLISGYAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLK 203

Query: 62  ACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIA-AERLFELEPDNE-LN 121
           ACGGIGS+QIGEA+HR +V+ GF  DV+VLNALV +     +I  A  +F++ P  + ++
Sbjct: 204 ACGGIGSVQIGEAIHRDLVKEGFGYDVYVLNALVVMYAKCGDIVKARNVFDMIPHKDYVS 263

Query: 122 FELLMKIYGNAGRSEDEKRVKLMMAERGL 149
           +  ++  Y + G   +   +  +M + G+
Sbjct: 264 WNSMLTGYLHHGLLHEALDIFRLMVQNGI 292


HSP 2 Score: 80.9 bits (198), Expect = 1.4e-14
Identity = 56/151 (37.09%), Postives = 82/151 (54.30%), Query Frame = 0

Query: 8   FDEM--GNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEE-GVEPDNFTFPRVLKACG 67
           F++M   N       + S++S  A  G+ ED   L+  M +E G++P    +  ++   G
Sbjct: 379 FEQMHRANAKPDGITFVSVLSLCANTGMVEDGERLFSLMSKEYGIDPKMEHYACMVNLYG 438

Query: 68  GIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDI-----DVDIAEIAAERLFELEPDNELN 127
             G ++  EA    V   G      V  AL+       + DI E+AA+RLFELEPDNE N
Sbjct: 439 RAGMME--EAYSMIVQEMGLEAGPTVWGALLYACYLHGNTDIGEVAAQRLFELEPDNEHN 498

Query: 128 FELLMKIYGNAGRSEDEKRVKLMMAERGLNS 151
           FELL++IY  A R+ED +RV+ MM +RGL +
Sbjct: 499 FELLIRIYSKAKRAEDVERVRQMMVDRGLET 527

BLAST of CsGy3G016120.1 vs. Swiss-Prot
Match: sp|Q9M9E2|PPR45_ARATH (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 9.0e-19
Identity = 57/150 (38.00%), Postives = 90/150 (60.00%), Query Frame = 0

Query: 3   DAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQM-EEEGVEPDNFTFPRVLK 62
           DA  VF +M  RN   F+WN L+ GYA+ G +++A+ LY +M    GV+PD +TFP VL+
Sbjct: 147 DAWYVFGKMSERNL--FSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVLR 206

Query: 63  ACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER-LFELEPDNE-LN 122
            CGGI  +  G+ VH HVVR G+  D+ V+NAL+ + V   ++ + R LF+  P  + ++
Sbjct: 207 TCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIIS 266

Query: 123 FELLMKIYGNAGRSEDEKRVKLMMAERGLN 150
           +  ++  Y   G   +   ++L  A RGL+
Sbjct: 267 WNAMISGYFENGMCHE--GLELFFAMRGLS 292

BLAST of CsGy3G016120.1 vs. Swiss-Prot
Match: sp|O49619|PP350_ARATH (Pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H27 PE=3 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 3.4e-18
Identity = 47/116 (40.52%), Postives = 73/116 (62.93%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           MEDA Q+FDEM   +  AF WN +I G+   GLY +A+  Y +M   GV+ D FT+P V+
Sbjct: 80  MEDALQLFDEMNKAD--AFLWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVI 139

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDI-AEIAAERLFELEPD 116
           K+  GI S++ G+ +H  V++ GF  DV+V N+L+ + + +     AE++FE  P+
Sbjct: 140 KSVAGISSLEEGKKIHAMVIKLGFVSDVYVCNSLISLYMKLGCAWDAEKVFEEMPE 193

BLAST of CsGy3G016120.1 vs. Swiss-Prot
Match: sp|Q9LTV8|PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 5.0e-17
Identity = 43/105 (40.95%), Postives = 61/105 (58.10%), Query Frame = 0

Query: 4   AHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKAC 63
           A QVFD++       F WN++I GY+    ++DAL +Y  M+   V PD+FTFP +LKAC
Sbjct: 72  ARQVFDDLPRPQI--FPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKAC 131

Query: 64  GGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER 109
            G+  +Q+G  VH  V R GF  DVFV N L+ +      + + R
Sbjct: 132 SGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSAR 174

BLAST of CsGy3G016120.1 vs. Swiss-Prot
Match: sp|Q9LUS3|PP237_ARATH (Pentatricopeptide repeat-containing protein At3g16610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E91 PE=2 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 4.2e-16
Identity = 46/119 (38.66%), Postives = 68/119 (57.14%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           +E A  VFDE+ +   +  AW+ +I  YA     E AL LY++M   GV P  +T+P VL
Sbjct: 51  VELARHVFDEIPHPRINPIAWDLMIRAYASNDFAEKALDLYYKMLNSGVRPTKYTYPFVL 110

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEI-AAERLFELEPDNEL 119
           KAC G+ +I  G+ +H HV  S FA D++V  ALVD      E+  A ++F+  P  ++
Sbjct: 111 KACAGLRAIDDGKLIHSHVNCSDFATDMYVCTALVDFYAKCGELEMAIKVFDEMPKRDM 169

BLAST of CsGy3G016120.1 vs. TrEMBL
Match: tr|A0A0A0L5M0|A0A0A0L5M0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G176270 PE=4 SV=1)

HSP 1 Score: 204.5 bits (519), Expect = 1.6e-49
Identity = 149/385 (38.70%), Postives = 150/385 (38.96%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL
Sbjct: 160 MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 219

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDI----------------------- 120
           KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVD+                       
Sbjct: 220 KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIV 279

Query: 121 ------------------------------------------------------------ 151
                                                                       
Sbjct: 280 SWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRH 339

BLAST of CsGy3G016120.1 vs. TrEMBL
Match: tr|A0A1S4DSF7|A0A1S4DSF7_CUCME (pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Cucumis melo OX=3656 GN=LOC107990382 PE=4 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 2.6e-47
Identity = 145/385 (37.66%), Postives = 149/385 (38.70%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           MEDAHQVFDEMG RNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTFPRVL
Sbjct: 145 MEDAHQVFDEMGKRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVL 204

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDI----------------------- 120
           KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVD+                       
Sbjct: 205 KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIVYKDIV 264

Query: 121 ------------------------------------------------------------ 151
                                                                       
Sbjct: 265 SWNSMLTGYTRHGLHFEALDIFDQMIQEGYKPDSVALSTLLSNILSLKFKLHIHGWVIRH 324

BLAST of CsGy3G016120.1 vs. TrEMBL
Match: tr|A0A2P6SMD5|A0A2P6SMD5_ROSCH (Putative tetratricopeptide-like helical domain-containing protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr1g0374811 PE=4 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 6.9e-40
Identity = 83/108 (76.85%), Postives = 91/108 (84.26%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           +E+AHQVFD+M  R+ SAFAWNSLISGYAELGLYEDA+ALYFQMEEEGVEPD FTFPRVL
Sbjct: 136 VEEAHQVFDQMPKRDVSAFAWNSLISGYAELGLYEDAMALYFQMEEEGVEPDRFTFPRVL 195

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER 109
           KACGGIG +QIGEAVHRH VR GF GD FVLNALVD+     +I   R
Sbjct: 196 KACGGIGFVQIGEAVHRHAVRLGFVGDRFVLNALVDMYAKCGDIVKAR 243


HSP 2 Score: 80.9 bits (198), Expect = 2.7e-12
Identity = 55/139 (39.57%), Postives = 79/139 (56.83%), Query Frame = 0

Query: 18  AFAWNSLISGYAELGLYEDALALYFQMEEE-GVEPDNFTFPRVLKACGGIGSIQIGEAVH 77
           A  + S++S  A LGL +D   L+  M+    + P    +  ++   G  G I+  EA  
Sbjct: 384 AITFVSILSVCAHLGLVKDGERLFSTMKNRYRISPIMEHYACMVNLYGRAGLIK--EAYG 443

Query: 78  RHVVRSGFAGDVFVLNALVDI-----DVDIAEIAAERLFELEPDNELNFELLMKIYGNAG 137
             +    F     V  AL+       + +I E+AAERLF+LEPDNE NFELLMKIYGN G
Sbjct: 444 IIMEGMEFEAGPTVWGALLYACYLYGNAEIGEVAAERLFDLEPDNEYNFELLMKIYGNVG 503

Query: 138 RSEDEKRVKLMMAERGLNS 151
           R ++ +RV++MM +RGL+S
Sbjct: 504 RLDNVERVRMMMVDRGLDS 520

BLAST of CsGy3G016120.1 vs. TrEMBL
Match: tr|M5XGA4|M5XGA4_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G280000 PE=4 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 3.4e-39
Identity = 83/108 (76.85%), Postives = 90/108 (83.33%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           +E+AHQVFDEM  R+ SAFAWNSLISGYAELGLYEDA+ALYFQMEEEGVEPD FTFPRVL
Sbjct: 135 IEEAHQVFDEMPKRDVSAFAWNSLISGYAELGLYEDAMALYFQMEEEGVEPDRFTFPRVL 194

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER 109
           KACGGIG IQIGEAVHRH+VR G   D FVLNALVD+     +I   R
Sbjct: 195 KACGGIGFIQIGEAVHRHIVRLGLLNDRFVLNALVDMYAKCGDIVKAR 242


HSP 2 Score: 89.0 bits (219), Expect = 1.0e-14
Identity = 61/133 (45.86%), Postives = 77/133 (57.89%), Query Frame = 0

Query: 23  SLISGYAELGLYEDALALYFQMEEE-GVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVR 82
           S++S  A LGL +D   LY  M+    + P    +  ++   G  G I+  EA    V  
Sbjct: 389 SILSTCAHLGLVKDGERLYSVMKNRYRISPIMEHYACMVNLYGRAGRIR--EAYGIIVDG 448

Query: 83  SGFAGDVFVLNALVDI-----DVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDE 142
             F     V  AL+       +VDI E+AAERLFELEPDNE NFELL+KIYGN GR ED 
Sbjct: 449 MEFEAGPTVWGALLYACYLHGNVDIGEVAAERLFELEPDNEYNFELLIKIYGNVGRLEDV 508

Query: 143 KRVKLMMAERGLN 150
           +RV+LMM ERGL+
Sbjct: 509 ERVRLMMVERGLD 519

BLAST of CsGy3G016120.1 vs. TrEMBL
Match: tr|D7T277|D7T277_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_16s0022g00970 PE=4 SV=1)

HSP 1 Score: 168.3 bits (425), Expect = 1.3e-38
Identity = 81/108 (75.00%), Postives = 90/108 (83.33%), Query Frame = 0

Query: 1   MEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVL 60
           +E+AH++FD+M  RN SAFAWNSLISGYAELGLYEDA+ALYFQMEEEGV PD FTFPRVL
Sbjct: 130 IEEAHRLFDQMSRRNRSAFAWNSLISGYAELGLYEDAMALYFQMEEEGVVPDRFTFPRVL 189

Query: 61  KACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDIDVDIAEIAAER 109
           KACGGIGSI +GE VHRHVVR GFA D FVLNALVD+     +I   R
Sbjct: 190 KACGGIGSISVGEEVHRHVVRCGFADDGFVLNALVDMYAKCGDIVKAR 237


HSP 2 Score: 84.7 bits (208), Expect = 1.9e-13
Identity = 59/134 (44.03%), Postives = 76/134 (56.72%), Query Frame = 0

Query: 23  SLISGYAELGLYEDALALYFQMEEE-GVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVR 82
           SL+S  A LGL +D   L+  M E+ G+ P    +  ++   G  G I+  EA      R
Sbjct: 383 SLLSACAHLGLVKDGEGLFSMMREDYGMIPSMEHYACMVNLYGRAGLIE--EAYEIIEKR 442

Query: 83  SGFAGDVFVLNALV-----DIDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDE 142
             F     V  AL+       +VDI +IAAE LFELEPDNE NFELLM IY N GR ED 
Sbjct: 443 MEFEAGPTVWGALLYACYFHHNVDIGKIAAECLFELEPDNEHNFELLMNIYRNVGRLEDV 502

Query: 143 KRVKLMMAERGLNS 151
           ++V+ MMA+RG +S
Sbjct: 503 EKVRKMMADRGFDS 514

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN57280.12.5e-4938.70hypothetical protein Csa_3G176270 [Cucumis sativus][more]
XP_011652769.12.5e-4938.70PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic ... [more]
XP_016898932.14.0e-4737.66PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic ... [more]
XP_022140840.15.3e-4485.19pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Momordica ... [more]
XP_022942651.17.0e-4482.41pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT4G25270.13.1e-3853.02Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G15510.15.0e-2038.00Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G35130.11.9e-1940.52Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G12770.12.8e-1840.95mitochondrial editing factor 22[more]
AT3G16610.12.3e-1738.66pentatricopeptide (PPR) repeat-containing protein[more]
Match NameE-valueIdentityDescription
sp|Q9SB36|PP337_ARATH5.6e-3753.02Pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Arabidop... [more]
sp|Q9M9E2|PPR45_ARATH9.0e-1938.00Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
sp|O49619|PP350_ARATH3.4e-1840.52Pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Arabidop... [more]
sp|Q9LTV8|PP224_ARATH5.0e-1740.95Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
sp|Q9LUS3|PP237_ARATH4.2e-1638.66Pentatricopeptide repeat-containing protein At3g16610 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0L5M0|A0A0A0L5M0_CUCSA1.6e-4938.70Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G176270 PE=4 SV=1[more]
tr|A0A1S4DSF7|A0A1S4DSF7_CUCME2.6e-4737.66pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Cucumis ... [more]
tr|A0A2P6SMD5|A0A2P6SMD5_ROSCH6.9e-4076.85Putative tetratricopeptide-like helical domain-containing protein OS=Rosa chinen... [more]
tr|M5XGA4|M5XGA4_PRUPE3.4e-3976.85Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G280000 PE=4 SV=1[more]
tr|D7T277|D7T277_VITVI1.3e-3875.00Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_16s0022g00970 PE=4 SV=... [more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy3G016120CsGy3G016120gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G016120.1.CDS.2CsGy3G016120.1.CDS.2CDS
CsGy3G016120.1.CDS.1CsGy3G016120.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G016120.1.exon.2CsGy3G016120.1.exon.2exon
CsGy3G016120.1.exon.1CsGy3G016120.1.exon.1exon


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy3G016120.1CsGy3G016120.1-proteinpolypeptide


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 19..63
e-value: 3.5E-11
score: 43.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 20..52
e-value: 4.1E-10
score: 37.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 52..86
score: 6.533
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 17..51
score: 12.912
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 116..150
score: 7.147
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 1..150
e-value: 1.3E-21
score: 79.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 2..97
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 97..149
NoneNo IPR availablePANTHERPTHR24015:SF497SUBFAMILY NOT NAMEDcoord: 97..149
coord: 2..97