CsaV3_1G010760 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G010760
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1 : 6680731 .. 6681883 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTGAGATGGATCTTGCTCTAAAGCTGACAAGGAACATTGAGGTGATTTCAGGAGGCAGTGTTTCGCCTAATATAGTTGCCTATAATTGTATCATTAATGGATTCTGCAAGATAAGGAGGTTAGAGTCTGCAAAAAATGTTCTTGGTGAAATGATTAAGCTGGGAATAGATTTCAATGAGAGAACATATGCCACTTTGATTGACGGATATGCTAGAAAAGGGAGTTTGGATGTGGCATTTAGGTTATGTGATGAAATGGTTGAAATGAGGTTTGATTCCAGACACTTTTGTATATAACTCCCTCATCTACTGGCTATACATGGAAGGAGAATTAGAAGAAGCTTCTTTTTTATTATCTGACATGATTAATAGGCGTATCCTCCCTGATGAGTTTACCTACTCCATCCTTACAAAAGGTCTTTGCGTAAGTGGACATCTCAATAAAGCTTTAAGAGTTCACTACTACATTGTCGAAAGAAGCCTTGTAAGAGATGCATTTACTTATAATATTCTTATCAACTATATGTTTCAGAGCCGGAATATAGCAGGCGCCAAGCAACTACTGAGCAGTATGATCGTTGGTGGTATCAAACCTGACATGGTTACTTATGGCACTCCGGTTGATGGGCATTGTAAGGAAGGAAAAATTGAAGCTGCGGTTCAGATTTATGACAAAGCTGTGGTGTATAATTCTATTTTAGATGGGCTGTGCAAGCAAGGTTCAATTTATGCTGCTAAACTCTTGGTAGACAAATTACAGCAAAATGGTTTTCTCGATCCAGTTACCTATAACACATTGCTACATGGATTCTGCGTCAATGGGGAGATCGAGAAGGCTTTTGCACTGTTTTTAGAGATGATTAATGTGGGGAGTTTGGTGAACATAGTTTCTTACAATATAATGATTAACTTTCTATGCAAGATGGGATTGATCCAACAAGCCATGGAACTGATGAGAGCAATGTCTAGCCAGGGGATCATTCCCGACCTTATAACATACACGACTCTCATCACCAATTTTGTTGAGACTTGTGGCTCCGAGGATGTAATTGAGTTACATGGTTATATGATGCTTAAAGGAGCAGTTCCTGATAGGAAAACATACCGGTCTTTTGTAAGCCCCTGCCTTCAAGAACACACTGAGAGGTAG

mRNA sequence

ATGGGTGAGATGGATCTTGCTCTAAAGCTGACAAGGAACATTGAGGTGATTTCAGGAGGCAGTGTTTCGCCTAATATAGTTGCCTATAATTGTATCATTAATGGATTCTGCAAGATAAGGAGGTTAGAGTCTGCAAAAAATGTTCTTGGTGAAATGATTAAGCTGGGAATAGATTTCAATGAGAGAACATATGCCACTTTGATTGACGGATATGCTAGAAAAGGGAGTTTGGATGTGGCATTTAGGTTATGTGATGAAATGGTTGAAATGAGGCGTATCCTCCCTGATGAGTTTACCTACTCCATCCTTACAAAAGGTCTTTGCGTAAGTGGACATCTCAATAAAGCTTTAAGAGTTCACTACTACATTGTCGAAAGAAGCCTTGTAAGAGATGCATTTACTTATAATATTCTTATCAACTATATGTTTCAGAGCCGGAATATAGCAGGCGCCAAGCAACTACTGAGCAGTATGATCGTTGGTGGTATCAAACCTGACATGGTTACTTATGGCACTCCGGTTGATGGGCATTGTAAGGAAGGAAAAATTGAAGCTGCGGTTCAGATTTATGACAAAGCTGTGGTGTATAATTCTATTTTAGATGGGCTGTGCAAGCAAGGTTCAATTTATGCTGCTAAACTCTTGGTAGACAAATTACAGCAAAATGGTTTTCTCGATCCAGTTACCTATAACACATTGCTACATGGATTCTGCGTCAATGGGGAGATCGAGAAGGCTTTTGCACTGTTTTTAGAGATGATTAATGTGGGGAGTTTGGTGAACATAGTTTCTTACAATATAATGATTAACTTTCTATGCAAGATGGGATTGATCCAACAAGCCATGGAACTGATGAGAGCAATGTCTAGCCAGGGGATCATTCCCGACCTTATAACATACACGACTCTCATCACCAATTTTGTTGAGACTTGTGGCTCCGAGGATGTAATTGAGTTACATGGTTATATGATGCTTAAAGGAGCAGTTCCTGATAGGAAAACATACCGGTCTTTTGTAAGCCCCTGCCTTCAAGAACACACTGAGAGGTAG

Coding sequence (CDS)

ATGGGTGAGATGGATCTTGCTCTAAAGCTGACAAGGAACATTGAGGTGATTTCAGGAGGCAGTGTTTCGCCTAATATAGTTGCCTATAATTGTATCATTAATGGATTCTGCAAGATAAGGAGGTTAGAGTCTGCAAAAAATGTTCTTGGTGAAATGATTAAGCTGGGAATAGATTTCAATGAGAGAACATATGCCACTTTGATTGACGGATATGCTAGAAAAGGGAGTTTGGATGTGGCATTTAGGTTATGTGATGAAATGGTTGAAATGAGGCGTATCCTCCCTGATGAGTTTACCTACTCCATCCTTACAAAAGGTCTTTGCGTAAGTGGACATCTCAATAAAGCTTTAAGAGTTCACTACTACATTGTCGAAAGAAGCCTTGTAAGAGATGCATTTACTTATAATATTCTTATCAACTATATGTTTCAGAGCCGGAATATAGCAGGCGCCAAGCAACTACTGAGCAGTATGATCGTTGGTGGTATCAAACCTGACATGGTTACTTATGGCACTCCGGTTGATGGGCATTGTAAGGAAGGAAAAATTGAAGCTGCGGTTCAGATTTATGACAAAGCTGTGGTGTATAATTCTATTTTAGATGGGCTGTGCAAGCAAGGTTCAATTTATGCTGCTAAACTCTTGGTAGACAAATTACAGCAAAATGGTTTTCTCGATCCAGTTACCTATAACACATTGCTACATGGATTCTGCGTCAATGGGGAGATCGAGAAGGCTTTTGCACTGTTTTTAGAGATGATTAATGTGGGGAGTTTGGTGAACATAGTTTCTTACAATATAATGATTAACTTTCTATGCAAGATGGGATTGATCCAACAAGCCATGGAACTGATGAGAGCAATGTCTAGCCAGGGGATCATTCCCGACCTTATAACATACACGACTCTCATCACCAATTTTGTTGAGACTTGTGGCTCCGAGGATGTAATTGAGTTACATGGTTATATGATGCTTAAAGGAGCAGTTCCTGATAGGAAAACATACCGGTCTTTTGTAAGCCCCTGCCTTCAAGAACACACTGAGAGGTAG

Protein sequence

MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHYYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKEGKIEAAVQIYDKAVVYNSILDGLCKQGSIYAAKLLVDKLQQNGFLDPVTYNTLLHGFCVNGEIEKAFALFLEMINVGSLVNIVSYNIMINFLCKMGLIQQAMELMRAMSSQGIIPDLITYTTLITNFVETCGSEDVIELHGYMMLKGAVPDRKTYRSFVSPCLQEHTER
BLAST of CsaV3_1G010760 vs. NCBI nr
Match: KGN64519.1 (hypothetical protein Csa_1G062940 [Cucumis sativus])

HSP 1 Score: 400.6 bits (1028), Expect = 5.5e-108
Identity = 248/248 (100.00%), Postives = 248/248 (100.00%), Query Frame = 0

Query: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN 60
           MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN
Sbjct: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN 60

Query: 61  ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVH 120
           ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVH
Sbjct: 61  ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVH 120

Query: 121 YYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKE 180
           YYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKE
Sbjct: 121 YYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKE 180

Query: 181 GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX 240
           GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX
Sbjct: 181 GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX 240

Query: 241 XXXEKAFA 249
           XXXEKAFA
Sbjct: 241 XXXEKAFA 248

BLAST of CsaV3_1G010760 vs. NCBI nr
Match: XP_011652662.1 (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11710, mitochondrial [Cucumis sativus])

HSP 1 Score: 134.8 bits (338), Expect = 5.6e-28
Identity = 139/248 (56.05%), Postives = 139/248 (56.05%), Query Frame = 0

Query: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN 60
           MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN
Sbjct: 62  MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN 121

Query: 61  ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVH 120
           E                                                           
Sbjct: 122 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 181

Query: 121 YYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKE 180
                                   SRNIAGAK                          KE
Sbjct: 182 XXXXXXXXXXXXXXXXXXKGLC--SRNIAGAKXXXXXXXXXXXXXXXXXXXXXXXXXXKE 241

Query: 181 GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX 240
           GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX
Sbjct: 242 GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX 301

Query: 241 XXXEKAFA 249
           XXXEKAFA
Sbjct: 302 XXXEKAFA 307

BLAST of CsaV3_1G010760 vs. NCBI nr
Match: ESQ29591.1 (hypothetical protein EUTSA_v10024126mg [Eutrema salsugineum])

HSP 1 Score: 122.5 bits (306), Expect = 2.9e-24
Identity = 65/201 (32.34%), Postives = 106/201 (52.74%), Query Frame = 0

Query: 25  NIVAYNCIINGFCKIRRLESAKNVLG----EMIKLGIDFNERTYATLIDGYARKGSLDVA 84
           N+  +N +I   CK  +L  A ++ G    +M++ G+  NERTY +L+D Y R    D A
Sbjct: 9   NVNTFNLVIYSCCKEYKLLEALDLAGKIRIDMVESGVGCNERTYGSLVDTYGRARRSDEA 68

Query: 85  FRLCDEMVE------------------------------MRRILPDEFTYSILTKGLCVS 144
            RLCDEM                                 + +  D FT+ I+ +GLC +
Sbjct: 69  LRLCDEMTSNGLLAIPLSIPLLFIGCSWKVTQKQLCRCWCKEMRIDRFTHFIVVRGLCRN 128

Query: 145 GHLNKALRVHYYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTY 192
           G++ +A++    I+E+ LV      N L++++ + + +A A Q+L SM+V G+  D +++
Sbjct: 129 GYVEEAVKFQRQILEKKLVEYGVCQNTLMHHLVRDKKLASADQILGSMLVCGLNLDAISF 188

BLAST of CsaV3_1G010760 vs. NCBI nr
Match: PWA33153.1 (Pentatricopeptide repeat-containing protein [Artemisia annua])

HSP 1 Score: 110.5 bits (275), Expect = 1.1e-20
Identity = 72/232 (31.03%), Postives = 111/232 (47.84%), Query Frame = 0

Query: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGE------MIK 60
           +GE ++AL    N      G    N+V+Y C+++ +C++ R E  K++ G+      + K
Sbjct: 60  IGEYEVALAFYDN--AAKCGGFEMNVVSYTCVLSAYCRLNRFEEVKDLKGKVDEALRVFK 119

Query: 61  L----GIDFNERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVS 120
           L    G+  +E  YATLIDG+ +    D  FRL DEM E + + P   TY+I+  GLC  
Sbjct: 120 LVENSGMGLDEFVYATLIDGFCKVCDFDSVFRLLDEMKE-KGVHPSVVTYNIIINGLCKC 179

Query: 121 GHLNKALRV---------------HYYIVERSLV---------------RDAFTYNILIN 180
           G  N+A +V               H YI E+ L                 D    N+LI 
Sbjct: 180 GRTNEAYKVSKGIDGDVITYSTLLHGYIKEKDLTGVIITKKRLEECGVRMDVVMCNVLIK 239

Query: 181 YMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKEGKIEAAVQIYDK 193
            +F   +      +   M   G+ P+ VT+ T VDG+CK G+IE A++I+D+
Sbjct: 240 ALFLVGSFEDVNIIYKGMPEMGLTPNDVTFCTLVDGYCKLGRIEEALEIFDE 288

BLAST of CsaV3_1G010760 vs. NCBI nr
Match: PWA94556.1 (pentatricopeptide repeat (PPR) superfamily protein [Artemisia annua])

HSP 1 Score: 110.2 bits (274), Expect = 1.5e-20
Identity = 73/218 (33.49%), Postives = 105/218 (48.17%), Query Frame = 0

Query: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN 60
           MG++ L LKL RN+ ++S G                          ++  EM  +G++ N
Sbjct: 194 MGDVSLGLKLFRNMGIMSMGXXXXXXXXXXXXXXXXXXXXXXXXXXSLRDEM-TVGVEVN 253

Query: 61  ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILP------------------------- 120
            RTYATL+DGY RKG    AF+LC +MV+ R ++P                         
Sbjct: 254 VRTYATLVDGYLRKGCTKEAFKLCSDMVD-RGLVPNIVVYNSLIHWLYFEGDTTTASVLL 313

Query: 121 ----------DEFTYSILTKGLCVSGHLNKALRVHYYIVERSLV-RDAFTYNILINYMFQ 180
                     D+FT SIL KGL  +G+L +AL  H ++   +LV +D F  NILI Y+F+
Sbjct: 314 SYMIKTNICFDKFTNSILVKGLSRNGYLKEALDYHKWLAGENLVDKDGFLENILIYYLFR 373

BLAST of CsaV3_1G010760 vs. TAIR10
Match: AT5G25630.2 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 66.2 bits (160), Expect = 4.5e-11
Identity = 46/195 (23.59%), Postives = 85/195 (43.59%), Query Frame = 0

Query: 22  VSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNERTYATLIDGYARKG-SLDVA 81
           V PNI  +N ++  +CK +++E A  V+ +M + G+  +  TY T+   Y +KG ++   
Sbjct: 184 VGPNIRTFNVLVQAWCKKKKVEEAWEVVKKMEECGVRPDTVTYNTIATCYVQKGETVRAE 243

Query: 82  FRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHYYIVERSLVRDAFTYNILIN 141
             + ++MV   +  P+  T  I+  G C  G +   LR    + E  +  +   +N LIN
Sbjct: 244 SEVVEKMVMKEKAKPNGRTCGIVVGGYCREGRVRDGLRFVRRMKEMRVEANLVVFNSLIN 303

Query: 142 YMFQSRNIAGAK-------------------------QLLSSMIVGGIKPDMVTYGTPVD 191
              +  +  G                           Q+L+ M    +K D++TY T ++
Sbjct: 304 GFVEVMDRDGIDEVTLTLLLMSFNEEVELVGNQKMKVQVLTLMKECNVKADVITYSTVMN 363

BLAST of CsaV3_1G010760 vs. TAIR10
Match: AT3G09650.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 65.9 bits (159), Expect = 5.8e-11
Identity = 44/144 (30.56%), Postives = 69/144 (47.92%), Query Frame = 0

Query: 3   EMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNER 62
           ++D A  L R  E+     + P++V+YN II+G   I     A     EM   GI   + 
Sbjct: 502 QIDRAEDLLR--EMTEDAGIEPDVVSYNIIIDGCILIDDSAGALAFFNEMRTRGIAPTKI 561

Query: 63  TYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHYY 122
           +Y TL+  +A  G   +A R+ DEM+   R+  D   +++L +G C  G +  A RV   
Sbjct: 562 SYTTLMKAFAMSGQPKLANRVFDEMMNDPRVKVDLIAWNMLVEGYCRLGLIEDAQRVVSR 621

Query: 123 IVERSLVRDAFTYNILINYMFQSR 147
           + E     +  TY  L N + Q+R
Sbjct: 622 MKENGFYPNVATYGSLANGVSQAR 643

BLAST of CsaV3_1G010760 vs. TAIR10
Match: AT5G48730.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 64.7 bits (156), Expect = 1.3e-10
Identity = 45/162 (27.78%), Postives = 68/162 (41.98%), Query Frame = 0

Query: 24  PNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNERTYATLIDGYARKGSLDVAFRL 83
           PN+  Y  +I    K ++ E A  +  EMI  G   N   Y  L+  Y+R G  D AF L
Sbjct: 148 PNVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVYTALVSAYSRSGRFDAAFTL 207

Query: 84  CDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHYYIVERSLVRDAFTYNILINYMF 143
            + M       PD  TYSIL K        +K   +   +  + +  +  TYN LI+   
Sbjct: 208 LERMKSSHNCQPDVHTYSILIKSFLQVFAFDKVQDLLSDMRRQGIRPNTITYNTLIDAYG 267

Query: 144 QSRNIAGAKQLLSSMI-VGGIKPDMVTYGTPVDGHCKEGKIE 185
           +++     +  L  M+     KPD  T  + +      G+IE
Sbjct: 268 KAKMFVEMESTLIQMLGEDDCKPDSWTMNSTLRAFGGNGQIE 309

BLAST of CsaV3_1G010760 vs. TAIR10
Match: AT2G16880.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 62.8 bits (151), Expect = 4.9e-10
Identity = 27/96 (28.12%), Postives = 53/96 (55.21%), Query Frame = 0

Query: 22  VSPNIVAYNCIINGFCKIR---RLESAKNVLGEMIKLGIDFNERTYATLIDGYARKGSLD 81
           + PN++  N ++ G  +      + SA+ V  +M+K+G+  N +T+  L++GY  +G L+
Sbjct: 162 LKPNLLTCNTLLIGLVRYPSSFSISSAREVFDDMVKIGVSLNVQTFNVLVNGYCLEGKLE 221

Query: 82  VAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLN 115
            A  + + MV   ++ PD  TY+ + K +   G L+
Sbjct: 222 DALGMLERMVSEFKVNPDNVTYNTILKAMSKKGRLS 257

BLAST of CsaV3_1G010760 vs. TAIR10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 59.7 bits (143), Expect = 4.2e-09
Identity = 40/191 (20.94%), Postives = 85/191 (44.50%), Query Frame = 0

Query: 2   GEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNE 61
           G +D  +K      V+    +SPN V ++C+++       ++    + G ++  G+DF  
Sbjct: 218 GALDSVIK---GFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEG 277

Query: 62  RTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHY 121
               +L+  Y++ G  D A +L   M        D  T++ +  G   SG + ++L   Y
Sbjct: 278 SIKNSLLSMYSKCGRFDDASKLFRMMSR-----ADTVTWNCMISGYVQSGLMEESLTFFY 337

Query: 122 YIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKEG 181
            ++   ++ DA T++ L+  + +  N+   KQ+   ++   I  D+      +D + K  
Sbjct: 338 EMISSGVLPDAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCR 397

Query: 182 KIEAAVQIYDK 193
            +  A  I+ +
Sbjct: 398 GVSMAQNIFSQ 400

BLAST of CsaV3_1G010760 vs. Swiss-Prot
Match: sp|Q9SF38|PP222_ARATH (Pentatricopeptide repeat-containing protein At3g09650, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=HCF152 PE=2 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 1.1e-09
Identity = 44/144 (30.56%), Postives = 69/144 (47.92%), Query Frame = 0

Query: 3   EMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNER 62
           ++D A  L R  E+     + P++V+YN II+G   I     A     EM   GI   + 
Sbjct: 502 QIDRAEDLLR--EMTEDAGIEPDVVSYNIIIDGCILIDDSAGALAFFNEMRTRGIAPTKI 561

Query: 63  TYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHYY 122
           +Y TL+  +A  G   +A R+ DEM+   R+  D   +++L +G C  G +  A RV   
Sbjct: 562 SYTTLMKAFAMSGQPKLANRVFDEMMNDPRVKVDLIAWNMLVEGYCRLGLIEDAQRVVSR 621

Query: 123 IVERSLVRDAFTYNILINYMFQSR 147
           + E     +  TY  L N + Q+R
Sbjct: 622 MKENGFYPNVATYGSLANGVSQAR 643

BLAST of CsaV3_1G010760 vs. Swiss-Prot
Match: sp|Q9FKC3|PP424_ARATH (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 64.7 bits (156), Expect = 2.3e-09
Identity = 45/162 (27.78%), Postives = 68/162 (41.98%), Query Frame = 0

Query: 24  PNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNERTYATLIDGYARKGSLDVAFRL 83
           PN+  Y  +I    K ++ E A  +  EMI  G   N   Y  L+  Y+R G  D AF L
Sbjct: 148 PNVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVYTALVSAYSRSGRFDAAFTL 207

Query: 84  CDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHYYIVERSLVRDAFTYNILINYMF 143
            + M       PD  TYSIL K        +K   +   +  + +  +  TYN LI+   
Sbjct: 208 LERMKSSHNCQPDVHTYSILIKSFLQVFAFDKVQDLLSDMRRQGIRPNTITYNTLIDAYG 267

Query: 144 QSRNIAGAKQLLSSMI-VGGIKPDMVTYGTPVDGHCKEGKIE 185
           +++     +  L  M+     KPD  T  + +      G+IE
Sbjct: 268 KAKMFVEMESTLIQMLGEDDCKPDSWTMNSTLRAFGGNGQIE 309

BLAST of CsaV3_1G010760 vs. Swiss-Prot
Match: sp|Q9ZVX5|PP156_ARATH (Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana OX=3702 GN=At2g16880 PE=2 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 8.9e-09
Identity = 27/96 (28.12%), Postives = 53/96 (55.21%), Query Frame = 0

Query: 22  VSPNIVAYNCIINGFCKIR---RLESAKNVLGEMIKLGIDFNERTYATLIDGYARKGSLD 81
           + PN++  N ++ G  +      + SA+ V  +M+K+G+  N +T+  L++GY  +G L+
Sbjct: 162 LKPNLLTCNTLLIGLVRYPSSFSISSAREVFDDMVKIGVSLNVQTFNVLVNGYCLEGKLE 221

Query: 82  VAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLN 115
            A  + + MV   ++ PD  TY+ + K +   G L+
Sbjct: 222 DALGMLERMVSEFKVNPDNVTYNTILKAMSKKGRLS 257

BLAST of CsaV3_1G010760 vs. Swiss-Prot
Match: sp|Q9STE1|PP333_ARATH (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 7.5e-08
Identity = 40/191 (20.94%), Postives = 85/191 (44.50%), Query Frame = 0

Query: 2   GEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNE 61
           G +D  +K      V+    +SPN V ++C+++       ++    + G ++  G+DF  
Sbjct: 218 GALDSVIK---GFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEG 277

Query: 62  RTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHY 121
               +L+  Y++ G  D A +L   M        D  T++ +  G   SG + ++L   Y
Sbjct: 278 SIKNSLLSMYSKCGRFDDASKLFRMMSR-----ADTVTWNCMISGYVQSGLMEESLTFFY 337

Query: 122 YIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKEG 181
            ++   ++ DA T++ L+  + +  N+   KQ+   ++   I  D+      +D + K  
Sbjct: 338 EMISSGVLPDAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCR 397

Query: 182 KIEAAVQIYDK 193
            +  A  I+ +
Sbjct: 398 GVSMAQNIFSQ 400

BLAST of CsaV3_1G010760 vs. Swiss-Prot
Match: sp|Q8GZ63|PP397_ARATH (Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX=3702 GN=At5g25630 PE=2 SV=2)

HSP 1 Score: 58.9 bits (141), Expect = 1.3e-07
Identity = 35/136 (25.74%), Postives = 67/136 (49.26%), Query Frame = 0

Query: 22  VSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFNERTYATLIDGYARKG-SLDVA 81
           V PNI  +N ++  +CK +++E A  V+ +M + G+  +  TY T+   Y +KG ++   
Sbjct: 184 VGPNIRTFNVLVQAWCKKKKVEEAWEVVKKMEECGVRPDTVTYNTIATCYVQKGETVRAE 243

Query: 82  FRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVHYYIVERSLVRDAFTYNILIN 141
             + ++MV   +  P+  T  I+  G C  G +   LR    + E  +  +   +N LIN
Sbjct: 244 SEVVEKMVMKEKAKPNGRTCGIVVGGYCREGRVRDGLRFVRRMKEMRVEANLVVFNSLIN 303

Query: 142 YMFQSRNIAGAKQLLS 157
              +  +  G  ++L+
Sbjct: 304 GFVEVMDRDGIDEVLT 319

BLAST of CsaV3_1G010760 vs. TrEMBL
Match: tr|A0A0A0LU83|A0A0A0LU83_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G062940 PE=4 SV=1)

HSP 1 Score: 400.6 bits (1028), Expect = 3.6e-108
Identity = 248/248 (100.00%), Postives = 248/248 (100.00%), Query Frame = 0

Query: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN 60
           MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN
Sbjct: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN 60

Query: 61  ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVH 120
           ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVH
Sbjct: 61  ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVSGHLNKALRVH 120

Query: 121 YYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKE 180
           YYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKE
Sbjct: 121 YYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKE 180

Query: 181 GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX 240
           GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX
Sbjct: 181 GKIEAAVQIYDKXXXXXXXXXXXXXXXXXXXXXXXXXXLQQNGFLDPXXXXXXXXXXXXX 240

Query: 241 XXXEKAFA 249
           XXXEKAFA
Sbjct: 241 XXXEKAFA 248

BLAST of CsaV3_1G010760 vs. TrEMBL
Match: tr|V4MEA9|V4MEA9_EUTSA (Uncharacterized protein OS=Eutrema salsugineum OX=72664 GN=EUTSA_v10024126mg PE=4 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 1.9e-24
Identity = 65/201 (32.34%), Postives = 106/201 (52.74%), Query Frame = 0

Query: 25  NIVAYNCIINGFCKIRRLESAKNVLG----EMIKLGIDFNERTYATLIDGYARKGSLDVA 84
           N+  +N +I   CK  +L  A ++ G    +M++ G+  NERTY +L+D Y R    D A
Sbjct: 9   NVNTFNLVIYSCCKEYKLLEALDLAGKIRIDMVESGVGCNERTYGSLVDTYGRARRSDEA 68

Query: 85  FRLCDEMVE------------------------------MRRILPDEFTYSILTKGLCVS 144
            RLCDEM                                 + +  D FT+ I+ +GLC +
Sbjct: 69  LRLCDEMTSNGLLAIPLSIPLLFIGCSWKVTQKQLCRCWCKEMRIDRFTHFIVVRGLCRN 128

Query: 145 GHLNKALRVHYYIVERSLVRDAFTYNILINYMFQSRNIAGAKQLLSSMIVGGIKPDMVTY 192
           G++ +A++    I+E+ LV      N L++++ + + +A A Q+L SM+V G+  D +++
Sbjct: 129 GYVEEAVKFQRQILEKKLVEYGVCQNTLMHHLVRDKKLASADQILGSMLVCGLNLDAISF 188

BLAST of CsaV3_1G010760 vs. TrEMBL
Match: tr|A0A2U1KAM5|A0A2U1KAM5_ARTAN (Pentatricopeptide repeat-containing protein OS=Artemisia annua OX=35608 GN=CTI12_AA618910 PE=4 SV=1)

HSP 1 Score: 110.5 bits (275), Expect = 7.5e-21
Identity = 72/232 (31.03%), Postives = 111/232 (47.84%), Query Frame = 0

Query: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGE------MIK 60
           +GE ++AL    N      G    N+V+Y C+++ +C++ R E  K++ G+      + K
Sbjct: 60  IGEYEVALAFYDN--AAKCGGFEMNVVSYTCVLSAYCRLNRFEEVKDLKGKVDEALRVFK 119

Query: 61  L----GIDFNERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILPDEFTYSILTKGLCVS 120
           L    G+  +E  YATLIDG+ +    D  FRL DEM E + + P   TY+I+  GLC  
Sbjct: 120 LVENSGMGLDEFVYATLIDGFCKVCDFDSVFRLLDEMKE-KGVHPSVVTYNIIINGLCKC 179

Query: 121 GHLNKALRV---------------HYYIVERSLV---------------RDAFTYNILIN 180
           G  N+A +V               H YI E+ L                 D    N+LI 
Sbjct: 180 GRTNEAYKVSKGIDGDVITYSTLLHGYIKEKDLTGVIITKKRLEECGVRMDVVMCNVLIK 239

Query: 181 YMFQSRNIAGAKQLLSSMIVGGIKPDMVTYGTPVDGHCKEGKIEAAVQIYDK 193
            +F   +      +   M   G+ P+ VT+ T VDG+CK G+IE A++I+D+
Sbjct: 240 ALFLVGSFEDVNIIYKGMPEMGLTPNDVTFCTLVDGYCKLGRIEEALEIFDE 288

BLAST of CsaV3_1G010760 vs. TrEMBL
Match: tr|A0A2U1Q973|A0A2U1Q973_ARTAN (Pentatricopeptide repeat (PPR) superfamily protein OS=Artemisia annua OX=35608 GN=CTI12_AA059420 PE=4 SV=1)

HSP 1 Score: 110.2 bits (274), Expect = 9.8e-21
Identity = 73/218 (33.49%), Postives = 105/218 (48.17%), Query Frame = 0

Query: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGIDFN 60
           MG++ L LKL RN+ ++S G                          ++  EM  +G++ N
Sbjct: 194 MGDVSLGLKLFRNMGIMSMGXXXXXXXXXXXXXXXXXXXXXXXXXXSLRDEM-TVGVEVN 253

Query: 61  ERTYATLIDGYARKGSLDVAFRLCDEMVEMRRILP------------------------- 120
            RTYATL+DGY RKG    AF+LC +MV+ R ++P                         
Sbjct: 254 VRTYATLVDGYLRKGCTKEAFKLCSDMVD-RGLVPNIVVYNSLIHWLYFEGDTTTASVLL 313

Query: 121 ----------DEFTYSILTKGLCVSGHLNKALRVHYYIVERSLV-RDAFTYNILINYMFQ 180
                     D+FT SIL KGL  +G+L +AL  H ++   +LV +D F  NILI Y+F+
Sbjct: 314 SYMIKTNICFDKFTNSILVKGLSRNGYLKEALDYHKWLAGENLVDKDGFLENILIYYLFR 373

BLAST of CsaV3_1G010760 vs. TrEMBL
Match: tr|A0A1S3B385|A0A1S3B385_CUCME (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11710, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103485289 PE=4 SV=1)

HSP 1 Score: 109.8 bits (273), Expect = 1.3e-20
Identity = 54/57 (94.74%), Postives = 54/57 (94.74%), Query Frame = 0

Query: 1   MGEMDLALKLTRNIEVISGGSVSPNIVAYNCIINGFCKIRRLESAKNVLGEMIKLGI 58
           MGEMDLALKLTRN EVISGGSVSPNIV YNCIINGFCKIRRLESAKNVL EMIKLGI
Sbjct: 262 MGEMDLALKLTRNTEVISGGSVSPNIVTYNCIINGFCKIRRLESAKNVLAEMIKLGI 318

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN64519.15.5e-108100.00hypothetical protein Csa_1G062940 [Cucumis sativus][more]
XP_011652662.15.6e-2856.05PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g... [more]
ESQ29591.12.9e-2432.34hypothetical protein EUTSA_v10024126mg [Eutrema salsugineum][more]
PWA33153.11.1e-2031.03Pentatricopeptide repeat-containing protein [Artemisia annua][more]
PWA94556.11.5e-2033.49pentatricopeptide repeat (PPR) superfamily protein [Artemisia annua][more]
Match NameE-valueIdentityDescription
AT5G25630.24.5e-1123.59Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G09650.15.8e-1130.56Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G48730.11.3e-1027.78Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G16880.14.9e-1028.13Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21300.14.2e-0920.94Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SF38|PP222_ARATH1.1e-0930.56Pentatricopeptide repeat-containing protein At3g09650, chloroplastic OS=Arabidop... [more]
sp|Q9FKC3|PP424_ARATH2.3e-0927.78Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
sp|Q9ZVX5|PP156_ARATH8.9e-0928.13Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana OX... [more]
sp|Q9STE1|PP333_ARATH7.5e-0820.94Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
sp|Q8GZ63|PP397_ARATH1.3e-0725.74Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LU83|A0A0A0LU83_CUCSA3.6e-108100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G062940 PE=4 SV=1[more]
tr|V4MEA9|V4MEA9_EUTSA1.9e-2432.34Uncharacterized protein OS=Eutrema salsugineum OX=72664 GN=EUTSA_v10024126mg PE=... [more]
tr|A0A2U1KAM5|A0A2U1KAM5_ARTAN7.5e-2131.03Pentatricopeptide repeat-containing protein OS=Artemisia annua OX=35608 GN=CTI12... [more]
tr|A0A2U1Q973|A0A2U1Q973_ARTAN9.8e-2133.49Pentatricopeptide repeat (PPR) superfamily protein OS=Artemisia annua OX=35608 G... [more]
tr|A0A1S3B385|A0A1S3B385_CUCME1.3e-2094.74LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11710, mito... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G010760.1CsaV3_1G010760.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 168..192
e-value: 8.2E-4
score: 17.4
coord: 263..296
e-value: 1.1E-8
score: 32.7
coord: 63..96
e-value: 2.5E-7
score: 28.4
coord: 27..59
e-value: 1.6E-7
score: 29.0
coord: 133..166
e-value: 3.1E-6
score: 25.0
coord: 194..224
e-value: 7.7E-5
score: 20.6
coord: 228..260
e-value: 1.0E-8
score: 32.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 24..73
e-value: 2.2E-14
score: 53.2
coord: 261..307
e-value: 2.3E-13
score: 50.0
coord: 131..179
e-value: 1.9E-10
score: 40.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 225..253
e-value: 2.3E-10
score: 39.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 98..126
e-value: 0.078
score: 13.2
coord: 194..224
e-value: 0.0011
score: 19.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 131..165
score: 10.26
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 60..94
score: 10.282
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 296..330
score: 8.385
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 12.386
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 96..130
score: 9.109
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 226..260
score: 11.816
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 166..200
score: 8.418
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 25..59
score: 11.301
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 1..110
e-value: 2.0E-28
score: 101.0
coord: 111..193
e-value: 3.6E-17
score: 64.2
coord: 260..348
e-value: 3.2E-19
score: 71.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 194..259
e-value: 3.3E-14
score: 55.1
NoneNo IPR availablePANTHERPTHR44149FAMILY NOT NAMEDcoord: 83..222
NoneNo IPR availablePANTHERPTHR44149:SF1SUBFAMILY NOT NAMEDcoord: 4..192
NoneNo IPR availablePANTHERPTHR44149FAMILY NOT NAMEDcoord: 195..345
coord: 133..292
NoneNo IPR availablePANTHERPTHR44149:SF1SUBFAMILY NOT NAMEDcoord: 83..222
NoneNo IPR availablePANTHERPTHR44149:SF1SUBFAMILY NOT NAMEDcoord: 195..345
coord: 133..292
NoneNo IPR availablePANTHERPTHR44149FAMILY NOT NAMEDcoord: 4..192
NoneNo IPR availableSUPERFAMILYSSF81901HCP-likecoord: 177..284

The following gene(s) are paralogous to this gene:

None