CsaV3_1G029810 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G029810
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat
Locationchr1 : 16586009 .. 16586576 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCTCTCTTCCCCTTCACTCCTACTTTCACCCTCTTTCCATGTCCTTCCCTCTTCCGATCCTCCTTATAGGGTCCTTCAAGAACACCCATCTCTCAAGCTTCTCTCCAAATGCCAAAGCATTCGAACTTTCAAACAAATCCACGCTCACATCATCAAAACTGGCCTCCACAACACCCTCTTCGCCCTCAGCAAGCTCATCGAGTTCTCTGCTGTTTCACGCTCTGGTGATATCTCTTATGCCATCTCCTTGTTTAATTCCATCGAAGAACCTAATTTATTTATTTGGAATTCCATGATTCGAGGGCTTTCCATGAGTCTGTCACCAGCTCTGGCCTTGGTTTTCTTCGTCAGAATGATTTATTCCGGGGTTGAGCCGAACTCTTATACCTTCCCTTTTCTTTTGAAGTCTTGTGCTAAGCTCGCCTCTGCCCATGAAGGGAAACAGATTCATGCCCATGTTTTGAAGCTTGGGTTTGTGTCTGATGTGTTCATTCATACCTCGCTTATCAATATGTATGCCCAAAGTGGTGAAATGAATAATGCCCAATTGGTTTTTGATCAAA

mRNA sequence

ATGGCGCTCTCTTCCCCTTCACTCCTACTTTCACCCTCTTTCCATGTCCTTCCCTCTTCCGATCCTCCTTATAGGGTCCTTCAAGAACACCCATCTCTCAAGCTTCTCTCCAAATGCCAAAGCATTCGAACTTTCAAACAAATCCACGCTCACATCATCAAAACTGGCCTCCACAACACCCTCTTCGCCCTCAGCAAGCTCATCGAGTTCTCTGCTGTTTCACGCTCTGGTGATATCTCTTATGCCATCTCCTTGTTTAATTCCATCGAAGAACCTAATTTATTTATTTGGAATTCCATGATTCGAGGGCTTTCCATGAGTCTGTCACCAGCTCTGGCCTTGGTTTTCTTCGTCAGAATGATTTATTCCGGGGTTGAGCCGAACTCTTATACCTTCCCTTTTCTTTTGAAGTCTTGTGCTAAGCTCGCCTCTGCCCATGAAGGGAAACAGATTCATGCCCATGTTTTGAAGCTTGGGTTTGTGTCTGATGTGTTCATTCATACCTCGCTTATCAATATGTATGCCCAAAGTGGTGAAATGAATAATGCCCAATTGGTTTTTGATCAAA

Coding sequence (CDS)

ATGGCGCTCTCTTCCCCTTCACTCCTACTTTCACCCTCTTTCCATGTCCTTCCCTCTTCCGATCCTCCTTATAGGGTCCTTCAAGAACACCCATCTCTCAAGCTTCTCTCCAAATGCCAAAGCATTCGAACTTTCAAACAAATCCACGCTCACATCATCAAAACTGGCCTCCACAACACCCTCTTCGCCCTCAGCAAGCTCATCGAGTTCTCTGCTGTTTCACGCTCTGGTGATATCTCTTATGCCATCTCCTTGTTTAATTCCATCGAAGAACCTAATTTATTTATTTGGAATTCCATGATTCGAGGGCTTTCCATGAGTCTGTCACCAGCTCTGGCCTTGGTTTTCTTCGTCAGAATGATTTATTCCGGGGTTGAGCCGAACTCTTATACCTTCCCTTTTCTTTTGAAGTCTTGTGCTAAGCTCGCCTCTGCCCATGAAGGGAAACAGATTCATGCCCATGTTTTGAAGCTTGGGTTTGTGTCTGATGTGTTCATTCATACCTCGCTTATCAATATGTATGCCCAAAGTGGTGAAATGAATAATGCCCAATTGGTTTTTGATCAAA

Protein sequence

MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNTLFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQX
BLAST of CsaV3_1G029810 vs. NCBI nr
Match: XP_004150015.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g08070 [Cucumis sativus] >KGN65308.1 hypothetical protein Csa_1G306800 [Cucumis sativus])

HSP 1 Score: 369.4 bits (947), Expect = 7.4e-99
Identity = 189/189 (100.00%), Postives = 189/189 (100.00%), Query Frame = 0

Query: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60
           MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60

Query: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120
           LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM
Sbjct: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQ 190
           NNAQLVFDQ
Sbjct: 181 NNAQLVFDQ 189

BLAST of CsaV3_1G029810 vs. NCBI nr
Match: XP_008465017.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis melo])

HSP 1 Score: 356.3 bits (913), Expect = 6.5e-95
Identity = 181/189 (95.77%), Postives = 186/189 (98.41%), Query Frame = 0

Query: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60
           MALSSPSLLLSPSFHVLPSSDPPYRVLQEHP+LKLLSKCQ+IRTFKQIHAHIIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPALKLLSKCQNIRTFKQIHAHIIKTGLHNT 60

Query: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120
            FALSKLIEFSAVSRSGDISYAISLF+SIE+PNLFIWNSMIRGLSMSLSP LALVFFVRM
Sbjct: 61  HFALSKLIEFSAVSRSGDISYAISLFSSIEDPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASA EGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQ 190
           NNAQL+FDQ
Sbjct: 181 NNAQLIFDQ 189

BLAST of CsaV3_1G029810 vs. NCBI nr
Match: XP_022961045.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita moschata])

HSP 1 Score: 316.6 bits (810), Expect = 5.7e-83
Identity = 163/189 (86.24%), Postives = 176/189 (93.12%), Query Frame = 0

Query: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60
           MA S+PSL+LSP+    PSSDPPYR+LQ+HPSLKL+SKC+SIRT +QIHA IIKTGLHNT
Sbjct: 1   MATSAPSLVLSPT---SPSSDPPYRLLQDHPSLKLISKCRSIRTLRQIHAQIIKTGLHNT 60

Query: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120
            FALSKLIEFSAVSR  DISYA+SLFNSIEEPNLFIWNSMIRGLS+SLSP LALVFFVRM
Sbjct: 61  QFALSKLIEFSAVSRYADISYAVSLFNSIEEPNLFIWNSMIRGLSISLSPVLALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180
           I++GVEPNSYTFPFLLKSCAKLASA EGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGE+
Sbjct: 121 IHAGVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEI 180

Query: 181 NNAQLVFDQ 190
           N AQLVFDQ
Sbjct: 181 NYAQLVFDQ 186

BLAST of CsaV3_1G029810 vs. NCBI nr
Match: XP_023515625.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 316.6 bits (810), Expect = 5.7e-83
Identity = 163/189 (86.24%), Postives = 176/189 (93.12%), Query Frame = 0

Query: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60
           MA S+PSL+LSP+    PSSDPPYR+LQ+HPSLKL+SKC+SIRT +QIHA IIKTGLHNT
Sbjct: 1   MATSAPSLVLSPT---SPSSDPPYRLLQDHPSLKLISKCRSIRTLRQIHAQIIKTGLHNT 60

Query: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120
            FALSKLIEFSAVSR  DISYA+SLFNSIEEPNLFIWNSMIRGLS+SLSP LALVFFVRM
Sbjct: 61  QFALSKLIEFSAVSRYADISYAVSLFNSIEEPNLFIWNSMIRGLSISLSPVLALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180
           I++GVEPNSYTFPFLLKSCAKLASA EGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGE+
Sbjct: 121 IHAGVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEI 180

Query: 181 NNAQLVFDQ 190
           N AQLVFDQ
Sbjct: 181 NYAQLVFDQ 186

BLAST of CsaV3_1G029810 vs. NCBI nr
Match: XP_022987625.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita maxima])

HSP 1 Score: 313.9 bits (803), Expect = 3.7e-82
Identity = 161/189 (85.19%), Postives = 175/189 (92.59%), Query Frame = 0

Query: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60
           MA S+PSL+LSP+    PSSDPPYR+LQ+HPSLKL+SKC+SIRT +QIHA IIKTGLHNT
Sbjct: 1   MATSAPSLVLSPT---SPSSDPPYRLLQDHPSLKLISKCRSIRTLRQIHAQIIKTGLHNT 60

Query: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120
            FALSKLIEFSAVSR  DISYA+SLFNSIEEPNLFIWNSMIRGLS+SLSP LALVFF RM
Sbjct: 61  QFALSKLIEFSAVSRYADISYAVSLFNSIEEPNLFIWNSMIRGLSISLSPVLALVFFARM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180
           I++GVEPNSYTFPFLLKSCA+LASA EGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGE+
Sbjct: 121 IHAGVEPNSYTFPFLLKSCARLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEI 180

Query: 181 NNAQLVFDQ 190
           N AQLVFDQ
Sbjct: 181 NYAQLVFDQ 186

BLAST of CsaV3_1G029810 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 193.7 bits (491), Expect = 1.0e-49
Identity = 104/193 (53.89%), Postives = 137/193 (70.98%), Query Frame = 0

Query: 1   MALSSPSLLLSPS--FHVLP-SSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGL 60
           M   SP  + S S  FH LP SSDPPY  ++ HPSL LL  C+++++ + IHA +IK GL
Sbjct: 2   MLSCSPLTVPSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGL 61

Query: 61  HNTLFALSKLIEFSAVSRSGD-ISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVF 120
           HNT +ALSKLIEF  +S   + + YAIS+F +I+EPNL IWN+M RG ++S  P  AL  
Sbjct: 62  HNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKL 121

Query: 121 FVRMIYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQ 180
           +V MI  G+ PNSYTFPF+LKSCAK  +  EG+QIH HVLKLG   D+++HTSLI+MY Q
Sbjct: 122 YVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQ 181

Query: 181 SGEMNNAQLVFDQ 190
           +G + +A  VFD+
Sbjct: 182 NGRLEDAHKVFDK 194

BLAST of CsaV3_1G029810 vs. TAIR10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 130.2 bits (326), Expect = 1.4e-30
Identity = 66/161 (40.99%), Postives = 98/161 (60.87%), Query Frame = 0

Query: 30  HPSLKLLSKCQSIRTFKQIHAHIIKTGLHNTLFALSKLIEFSAVSRSGD-ISYAISLFNS 89
           + ++  L +C      KQIHA ++KTGL    +A++K + F   S S D + YA  +F+ 
Sbjct: 15  YETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDG 74

Query: 90  IEEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEG 149
            + P+ F+WN MIRG S S  P  +L+ + RM+ S    N+YTFP LLK+C+ L++  E 
Sbjct: 75  FDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEET 134

Query: 150 KQIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQ 190
            QIHA + KLG+ +DV+   SLIN YA +G    A L+FD+
Sbjct: 135 TQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDR 175

BLAST of CsaV3_1G029810 vs. TAIR10
Match: AT1G31920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 127.5 bits (319), Expect = 8.9e-30
Identity = 70/164 (42.68%), Postives = 102/164 (62.20%), Query Frame = 0

Query: 28  QEHPSLKLLSKCQSIRTFKQIHAHIIKTGL-HNTLFALSKLIEFSAVSR-SGDISYAISL 87
           +E   L LL +C +I  FKQ+HA  IK  L +++ F+ S ++   A S     ++YA S+
Sbjct: 29  KEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASI 88

Query: 88  FNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASA 147
           F  I++P  F +N+MIRG    +S   AL F+  M+  G EP+++T+P LLK+C +L S 
Sbjct: 89  FRGIDDPCTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLKACTRLKSI 148

Query: 148 HEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQ 190
            EGKQIH  V KLG  +DVF+  SLINMY + GEM  +  VF++
Sbjct: 149 REGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEK 192

BLAST of CsaV3_1G029810 vs. TAIR10
Match: AT3G05240.1 (mitochondrial editing factor 19)

HSP 1 Score: 120.6 bits (301), Expect = 1.1e-27
Identity = 61/159 (38.36%), Postives = 99/159 (62.26%), Query Frame = 0

Query: 31  PSLKLLSKCQSIRTFKQIHAHIIKTGLHNTLFALSKLIEF-SAVSRSGDISYAISLFNSI 90
           P L  L  C+S+    Q+H  +IK+ +   +  LS+LI+F +    + ++SYA S+F SI
Sbjct: 8   PILSQLENCRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFESI 67

Query: 91  EEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEGK 150
           + P+++IWNSMIRG S S +P  AL+F+  M+  G  P+ +TFP++LK+C+ L     G 
Sbjct: 68  DCPSVYIWNSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGS 127

Query: 151 QIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFD 189
            +H  V+K GF  ++++ T L++MY   GE+N    VF+
Sbjct: 128 CVHGFVVKTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFE 166

BLAST of CsaV3_1G029810 vs. TAIR10
Match: AT4G18840.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 119.8 bits (299), Expect = 1.8e-27
Identity = 63/160 (39.38%), Postives = 95/160 (59.38%), Query Frame = 0

Query: 31  PSLKLLSKCQSIRTFKQIHAHIIKTGLHNTLFALSKLIEFSAVS-RSGDISYAISLFNSI 90
           P L    + +S+   +Q HA ++KTGL +  F+ SKL+ F+A +     +SYA S+ N I
Sbjct: 41  PILSFTERAKSLTEIQQAHAFMLKTGLFHDTFSASKLVAFAATNPEPKTVSYAHSILNRI 100

Query: 91  EEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEGK 150
             PN F  NS+IR  + S +P +AL  F  M+   V P+ Y+F F+LK+CA      EG+
Sbjct: 101 GSPNGFTHNSVIRAYANSSTPEVALTVFREMLLGPVFPDKYSFTFVLKACAAFCGFEEGR 160

Query: 151 QIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQ 190
           QIH   +K G V+DVF+  +L+N+Y +SG    A+ V D+
Sbjct: 161 QIHGLFIKSGLVTDVFVENTLVNVYGRSGYFEIARKVLDR 200

BLAST of CsaV3_1G029810 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 193.7 bits (491), Expect = 1.8e-48
Identity = 104/193 (53.89%), Postives = 137/193 (70.98%), Query Frame = 0

Query: 1   MALSSPSLLLSPS--FHVLP-SSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGL 60
           M   SP  + S S  FH LP SSDPPY  ++ HPSL LL  C+++++ + IHA +IK GL
Sbjct: 2   MLSCSPLTVPSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGL 61

Query: 61  HNTLFALSKLIEFSAVSRSGD-ISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVF 120
           HNT +ALSKLIEF  +S   + + YAIS+F +I+EPNL IWN+M RG ++S  P  AL  
Sbjct: 62  HNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKL 121

Query: 121 FVRMIYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQ 180
           +V MI  G+ PNSYTFPF+LKSCAK  +  EG+QIH HVLKLG   D+++HTSLI+MY Q
Sbjct: 122 YVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQ 181

Query: 181 SGEMNNAQLVFDQ 190
           +G + +A  VFD+
Sbjct: 182 NGRLEDAHKVFDK 194

BLAST of CsaV3_1G029810 vs. Swiss-Prot
Match: sp|Q9FJY7|PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 2.5e-29
Identity = 66/161 (40.99%), Postives = 98/161 (60.87%), Query Frame = 0

Query: 30  HPSLKLLSKCQSIRTFKQIHAHIIKTGLHNTLFALSKLIEFSAVSRSGD-ISYAISLFNS 89
           + ++  L +C      KQIHA ++KTGL    +A++K + F   S S D + YA  +F+ 
Sbjct: 15  YETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDG 74

Query: 90  IEEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEG 149
            + P+ F+WN MIRG S S  P  +L+ + RM+ S    N+YTFP LLK+C+ L++  E 
Sbjct: 75  FDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEET 134

Query: 150 KQIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQ 190
            QIHA + KLG+ +DV+   SLIN YA +G    A L+FD+
Sbjct: 135 TQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDR 175

BLAST of CsaV3_1G029810 vs. Swiss-Prot
Match: sp|Q9C6T2|PPR68_ARATH (Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H11 PE=2 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 1.6e-28
Identity = 70/164 (42.68%), Postives = 102/164 (62.20%), Query Frame = 0

Query: 28  QEHPSLKLLSKCQSIRTFKQIHAHIIKTGL-HNTLFALSKLIEFSAVSR-SGDISYAISL 87
           +E   L LL +C +I  FKQ+HA  IK  L +++ F+ S ++   A S     ++YA S+
Sbjct: 29  KEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASI 88

Query: 88  FNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASA 147
           F  I++P  F +N+MIRG    +S   AL F+  M+  G EP+++T+P LLK+C +L S 
Sbjct: 89  FRGIDDPCTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLKACTRLKSI 148

Query: 148 HEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQ 190
            EGKQIH  V KLG  +DVF+  SLINMY + GEM  +  VF++
Sbjct: 149 REGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEK 192

BLAST of CsaV3_1G029810 vs. Swiss-Prot
Match: sp|Q9MA95|PP214_ARATH (Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E82 PE=3 SV=2)

HSP 1 Score: 120.6 bits (301), Expect = 2.0e-26
Identity = 61/159 (38.36%), Postives = 99/159 (62.26%), Query Frame = 0

Query: 31  PSLKLLSKCQSIRTFKQIHAHIIKTGLHNTLFALSKLIEF-SAVSRSGDISYAISLFNSI 90
           P L  L  C+S+    Q+H  +IK+ +   +  LS+LI+F +    + ++SYA S+F SI
Sbjct: 8   PILSQLENCRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFESI 67

Query: 91  EEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEGK 150
           + P+++IWNSMIRG S S +P  AL+F+  M+  G  P+ +TFP++LK+C+ L     G 
Sbjct: 68  DCPSVYIWNSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGS 127

Query: 151 QIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFD 189
            +H  V+K GF  ++++ T L++MY   GE+N    VF+
Sbjct: 128 CVHGFVVKTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFE 166

BLAST of CsaV3_1G029810 vs. Swiss-Prot
Match: sp|O49399|PP321_ARATH (Pentatricopeptide repeat-containing protein At4g18840 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E101 PE=3 SV=2)

HSP 1 Score: 119.8 bits (299), Expect = 3.3e-26
Identity = 63/160 (39.38%), Postives = 95/160 (59.38%), Query Frame = 0

Query: 31  PSLKLLSKCQSIRTFKQIHAHIIKTGLHNTLFALSKLIEFSAVS-RSGDISYAISLFNSI 90
           P L    + +S+   +Q HA ++KTGL +  F+ SKL+ F+A +     +SYA S+ N I
Sbjct: 41  PILSFTERAKSLTEIQQAHAFMLKTGLFHDTFSASKLVAFAATNPEPKTVSYAHSILNRI 100

Query: 91  EEPNLFIWNSMIRGLSMSLSPALALVFFVRMIYSGVEPNSYTFPFLLKSCAKLASAHEGK 150
             PN F  NS+IR  + S +P +AL  F  M+   V P+ Y+F F+LK+CA      EG+
Sbjct: 101 GSPNGFTHNSVIRAYANSSTPEVALTVFREMLLGPVFPDKYSFTFVLKACAAFCGFEEGR 160

Query: 151 QIHAHVLKLGFVSDVFIHTSLINMYAQSGEMNNAQLVFDQ 190
           QIH   +K G V+DVF+  +L+N+Y +SG    A+ V D+
Sbjct: 161 QIHGLFIKSGLVTDVFVENTLVNVYGRSGYFEIARKVLDR 200

BLAST of CsaV3_1G029810 vs. TrEMBL
Match: tr|A0A0A0LU28|A0A0A0LU28_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G306800 PE=4 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 4.9e-99
Identity = 189/189 (100.00%), Postives = 189/189 (100.00%), Query Frame = 0

Query: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60
           MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60

Query: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120
           LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM
Sbjct: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQ 190
           NNAQLVFDQ
Sbjct: 181 NNAQLVFDQ 189

BLAST of CsaV3_1G029810 vs. TrEMBL
Match: tr|A0A1S3CMX0|A0A1S3CMX0_CUCME (pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502738 PE=4 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 4.3e-95
Identity = 181/189 (95.77%), Postives = 186/189 (98.41%), Query Frame = 0

Query: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNT 60
           MALSSPSLLLSPSFHVLPSSDPPYRVLQEHP+LKLLSKCQ+IRTFKQIHAHIIKTGLHNT
Sbjct: 1   MALSSPSLLLSPSFHVLPSSDPPYRVLQEHPALKLLSKCQNIRTFKQIHAHIIKTGLHNT 60

Query: 61  LFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRM 120
            FALSKLIEFSAVSRSGDISYAISLF+SIE+PNLFIWNSMIRGLSMSLSP LALVFFVRM
Sbjct: 61  HFALSKLIEFSAVSRSGDISYAISLFSSIEDPNLFIWNSMIRGLSMSLSPVLALVFFVRM 120

Query: 121 IYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180
           IYSGVEPNSYTFPFLLKSCAKLASA EGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM
Sbjct: 121 IYSGVEPNSYTFPFLLKSCAKLASAREGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEM 180

Query: 181 NNAQLVFDQ 190
           NNAQL+FDQ
Sbjct: 181 NNAQLIFDQ 189

BLAST of CsaV3_1G029810 vs. TrEMBL
Match: tr|A0A061GN82|A0A061GN82_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma cacao OX=3641 GN=TCM_038026 PE=4 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 2.9e-67
Identity = 131/190 (68.95%), Postives = 155/190 (81.58%), Query Frame = 0

Query: 1   MALSSPSLLLSP-SFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHN 60
           MAL S S+ +SP   H+LPSSDPPY++LQ HPSL LLSKC++I+T KQ+H HIIKTGLH+
Sbjct: 59  MALPSTSVSISPFPLHLLPSSDPPYKLLQNHPSLSLLSKCRTIQTLKQVHCHIIKTGLHH 118

Query: 61  TLFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVR 120
           T FALSKLIEF AVS  GD+ YA+ LF SI+EPN  IWN+MIRG S+S SP L L F+V+
Sbjct: 119 TQFALSKLIEFCAVSPFGDLPYALLLFESIDEPNQVIWNTMIRGFSLSSSPGLTLEFYVK 178

Query: 121 MIYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGE 180
           MI+SG+ PNSYTFPF+LKSCAK AS  EGKQIH  VLKLG  SD F+HTSLINMYAQ+GE
Sbjct: 179 MIWSGIVPNSYTFPFVLKSCAKTASTQEGKQIHGQVLKLGLESDAFVHTSLINMYAQNGE 238

Query: 181 MNNAQLVFDQ 190
             NA+LVFD+
Sbjct: 239 FGNARLVFDK 248

BLAST of CsaV3_1G029810 vs. TrEMBL
Match: tr|A0A2P5CMB1|A0A2P5CMB1_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_280290 PE=4 SV=1)

HSP 1 Score: 263.5 bits (672), Expect = 3.8e-67
Identity = 131/188 (69.68%), Postives = 161/188 (85.64%), Query Frame = 0

Query: 3   LSSPSLLLSPS-FHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHNTL 62
           +SS SL +SPS F VLPSSDPPY++L+ HPSLKLLS+C +I++ KQ+H HIIK GLHNT 
Sbjct: 4   ISSSSLPISPSNFCVLPSSDPPYKLLESHPSLKLLSQCNNIQSLKQVHTHIIKNGLHNTQ 63

Query: 63  FALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVRMI 122
           FALSKL+EF AVS SGDISYAIS+  +IEEPN FIWN++IRGLS+S  PALA++F+VRMI
Sbjct: 64  FALSKLVEFCAVSPSGDISYAISVLEAIEEPNQFIWNTIIRGLSLSSDPALAILFYVRMI 123

Query: 123 YSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGEMN 182
            SGVEPNSYTFP LLKSCAK+A  HEGKQ+H  VLKLG  SDVF+++SLINMYAQ+ E++
Sbjct: 124 SSGVEPNSYTFPVLLKSCAKMADTHEGKQLHGQVLKLGLDSDVFVNSSLINMYAQNCELD 183

Query: 183 NAQLVFDQ 190
            A+L+FD+
Sbjct: 184 IARLIFDK 191

BLAST of CsaV3_1G029810 vs. TrEMBL
Match: tr|A0A1R3KQ11|A0A1R3KQ11_9ROSI (Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_05756 PE=4 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 1.1e-66
Identity = 128/190 (67.37%), Postives = 157/190 (82.63%), Query Frame = 0

Query: 1   MALSSPSLLLSP-SFHVLPSSDPPYRVLQEHPSLKLLSKCQSIRTFKQIHAHIIKTGLHN 60
           MAL S S+ +SP   H+LPSSDPPY++LQ HPSL LLSKC++++T  Q+H+HIIKTGLH+
Sbjct: 1   MALLSTSVPISPFPLHILPSSDPPYKLLQNHPSLSLLSKCRTMQTLTQVHSHIIKTGLHH 60

Query: 61  TLFALSKLIEFSAVSRSGDISYAISLFNSIEEPNLFIWNSMIRGLSMSLSPALALVFFVR 120
           T FALSKLIEF AVS SGD+ YA+ +F SI+EPN  IWN+MIRG S+S SP +AL ++V+
Sbjct: 61  TQFALSKLIEFCAVSPSGDLPYALLIFESIDEPNQVIWNTMIRGFSLSSSPRMALEYYVK 120

Query: 121 MIYSGVEPNSYTFPFLLKSCAKLASAHEGKQIHAHVLKLGFVSDVFIHTSLINMYAQSGE 180
           MI+SG+ PNSYTFPF+LKSCAK  S  EGKQIH  VLKLG  SD F+HTSLINMYAQ+GE
Sbjct: 121 MIWSGIVPNSYTFPFVLKSCAKTGSTQEGKQIHGQVLKLGLDSDAFVHTSLINMYAQNGE 180

Query: 181 MNNAQLVFDQ 190
           + NAQLVFD+
Sbjct: 181 LGNAQLVFDK 190

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004150015.17.4e-99100.00PREDICTED: pentatricopeptide repeat-containing protein At1g08070 [Cucumis sativu... [more]
XP_008465017.16.5e-9595.77PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic ... [more]
XP_022961045.15.7e-8386.24pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita ... [more]
XP_023515625.15.7e-8386.24pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita ... [more]
XP_022987625.13.7e-8285.19pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT1G08070.11.0e-4953.89Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G66520.11.4e-3040.99Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G31920.18.9e-3042.68Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G05240.11.1e-2738.36mitochondrial editing factor 19[more]
AT4G18840.11.8e-2739.38Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9LN01|PPR21_ARATH1.8e-4853.89Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q9FJY7|PP449_ARATH2.5e-2940.99Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
sp|Q9C6T2|PPR68_ARATH1.6e-2842.68Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX... [more]
sp|Q9MA95|PP214_ARATH2.0e-2638.36Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis th... [more]
sp|O49399|PP321_ARATH3.3e-2639.38Pentatricopeptide repeat-containing protein At4g18840 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LU28|A0A0A0LU28_CUCSA4.9e-99100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G306800 PE=4 SV=1[more]
tr|A0A1S3CMX0|A0A1S3CMX0_CUCME4.3e-9595.77pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Cucumis ... [more]
tr|A0A061GN82|A0A061GN82_THECC2.9e-6768.95Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma c... [more]
tr|A0A2P5CMB1|A0A2P5CMB1_9ROSA3.8e-6769.68DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_280290 ... [more]
tr|A0A1R3KQ11|A0A1R3KQ11_9ROSI1.1e-6667.37Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_05756 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0031425 chloroplast RNA processing
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G029810.1CsaV3_1G029810.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 92..141
e-value: 1.2E-8
score: 34.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 167..188
e-value: 0.063
score: 13.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..190
score: 7.859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 7.114
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 93..127
score: 9.339
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 30..190
e-value: 1.3E-24
score: 89.1
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 33..184
NoneNo IPR availablePANTHERPTHR24015:SF527SUBFAMILY NOT NAMEDcoord: 33..184

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_1G029810CmoCh11G005630Cucurbita moschata (Rifu)cmocucB0093
CsaV3_1G029810Cp4.1LG18g04810Cucurbita pepo (Zucchini)cpecucB0433
CsaV3_1G029810Cla97C06G120110Watermelon (97103) v2cucwmbB065
CsaV3_1G029810Bhi02G000867Wax gourdcucwgoB107
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsaV3_1G029810Wax gourdcucwgoB002
CsaV3_1G029810Cucumber (Chinese Long) v3cuccucB006
CsaV3_1G029810Cucumber (Chinese Long) v3cuccucB062
CsaV3_1G029810Silver-seed gourdcarcucB0036
CsaV3_1G029810Silver-seed gourdcarcucB0041
CsaV3_1G029810Silver-seed gourdcarcucB0143
CsaV3_1G029810Silver-seed gourdcarcucB0602
CsaV3_1G029810Cucumber (Gy14) v2cgybcucB001
CsaV3_1G029810Cucumber (Gy14) v2cgybcucB057
CsaV3_1G029810Cucumber (Gy14) v1cgycucB052
CsaV3_1G029810Cucumber (Gy14) v1cgycucB188
CsaV3_1G029810Cucurbita maxima (Rimu)cmacucB0070
CsaV3_1G029810Cucurbita maxima (Rimu)cmacucB0103
CsaV3_1G029810Cucurbita maxima (Rimu)cmacucB0925
CsaV3_1G029810Cucurbita moschata (Rifu)cmocucB0056
CsaV3_1G029810Cucurbita moschata (Rifu)cmocucB0106
CsaV3_1G029810Cucurbita moschata (Rifu)cmocucB0917
CsaV3_1G029810Cucurbita pepo (Zucchini)cpecucB0102
CsaV3_1G029810Cucurbita pepo (Zucchini)cpecucB0802
CsaV3_1G029810Cucurbita pepo (Zucchini)cpecucB0818
CsaV3_1G029810Wild cucumber (PI 183967)cpicucB000
CsaV3_1G029810Wild cucumber (PI 183967)cpicucB070
CsaV3_1G029810Bottle gourd (USVL1VR-Ls)cuclsiB006
CsaV3_1G029810Bottle gourd (USVL1VR-Ls)cuclsiB070
CsaV3_1G029810Melon (DHL92) v3.5.1cucmeB041
CsaV3_1G029810Melon (DHL92) v3.5.1cucmeB057
CsaV3_1G029810Melon (DHL92) v3.5.1cucmeB079
CsaV3_1G029810Melon (DHL92) v3.6.1cucmedB038
CsaV3_1G029810Melon (DHL92) v3.6.1cucmedB054
CsaV3_1G029810Melon (DHL92) v3.6.1cucmedB073
CsaV3_1G029810Watermelon (Charleston Gray)cucwcgB076
CsaV3_1G029810Watermelon (Charleston Gray)cucwcgB008
CsaV3_1G029810Watermelon (97103) v1cucwmB086
CsaV3_1G029810Watermelon (97103) v1cucwmB093
CsaV3_1G029810Watermelon (97103) v2cucwmbB018