CsaV3_1G045110 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G045110
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionpentatricopeptide repeat-containing protein At3g04760, chloroplastic
Locationchr1 : 30687470 .. 30689897 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAAAAAACAATTTAGTTTAGATTTTCTATTAAAGCTTTCTATTAACGATAGGTATTATTTTGATAATTTAACGCGATATCTCTCGGTTCTTCTTAAGAGAGAAAAGGTATTGGATTATCTATGAGGTGTCCAAGTGCGGACATACCTCGTGGATTTCTTATCCAAAGGGCTAATGCTTTTACAATTGAATCCTTTCAATCTTTTGTTGCTCGTTTCTCTCAGCACTAGATAGACAAAAGGAGCGAAACTCGTTTGCAAATTCTGTTCAAAGTTTAAACAAAGGACGATGCTCCATCTCCAACACCACCTAATGGCATAGCAAGCGTCATCAACGAGCATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGATGTTGACTATAGAATTTTCAACTTTTGGCTATTTTGTTTTGTTTTCTTTGTTTTTTTTTTTTTTTTTTTTGTGCAATTTCATTAACCCACCAATTGACTTTTAATTTTTCTTTTATTTTTCCTGAAAAAAACAAAGTCTGCTAATAAATAAGACTAGGGAAAGAAGAAAAAAACATTATTAAATTAATTTATTGAATGCCAAAGCAGCTTTTAGTTGGGGATCTATAGAATGCAATGGAAAACAATAAGAGTCAGATCATCAAATATCTTGAAAACATGATAATTTGCAAAAGCATGTTCTTGCCCTTATTTAGAAAATGTTTAAAGCAAAAGATGAATATTAA

mRNA sequence

ATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGA

Coding sequence (CDS)

ATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGA

Protein sequence

MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
BLAST of CsaV3_1G045110 vs. NCBI nr
Match: XP_004142590.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis sativus] >KGN66736.1 hypothetical protein Csa_1G666460 [Cucumis sativus])

HSP 1 Score: 330.5 bits (846), Expect = 1.2e-86
Identity = 162/162 (100.00%), Postives = 162/162 (100.00%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 162

BLAST of CsaV3_1G045110 vs. NCBI nr
Match: XP_008443759.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis melo])

HSP 1 Score: 314.3 bits (804), Expect = 8.6e-82
Identity = 155/162 (95.68%), Postives = 158/162 (97.53%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 162

BLAST of CsaV3_1G045110 vs. NCBI nr
Match: XP_023515375.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 275.4 bits (703), Expect = 4.4e-70
Identity = 139/162 (85.80%), Postives = 144/162 (88.89%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSE L QSLHF NPL+ PTIPQS S S    RF NKTHLRN  SSAE R+PH P LDN
Sbjct: 1   MFSSELLSQSLHFINPLSNPTIPQSHSSSFTR-RFPNKTHLRNGASSAETREPHDPILDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           R+ HLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RETHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMR RGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANEVFDRMRRRGF 161

BLAST of CsaV3_1G045110 vs. NCBI nr
Match: XP_022960811.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita moschata])

HSP 1 Score: 275.0 bits (702), Expect = 5.8e-70
Identity = 139/162 (85.80%), Postives = 144/162 (88.89%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSE L QSLHF NPL+ PTIPQS S S    RF NKTHLRN  SSAE R+PH P LDN
Sbjct: 1   MFSSELLSQSLHFINPLSNPTIPQSHSSSFTR-RFPNKTHLRNGASSAETREPHDPILDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           R+ HLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RETHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMR RGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRRRGF 161

BLAST of CsaV3_1G045110 vs. NCBI nr
Match: XP_022988060.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita maxima])

HSP 1 Score: 272.7 bits (696), Expect = 2.9e-69
Identity = 138/162 (85.19%), Postives = 143/162 (88.27%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSE L QSLHF NPL+ PTIPQS S S    RF NKTHLRN  SSAE R+PH P LDN
Sbjct: 1   MFSSELLSQSLHFINPLSNPTIPQSHSSSFTR-RFPNKTHLRNGASSAETREPHDPILDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           R+ HLMKLLNRSCRAGKHNESLYFLESVVSKG KPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RETHLMKLLNRSCRAGKHNESLYFLESVVSKGLKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMR RGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRRRGF 161

BLAST of CsaV3_1G045110 vs. TAIR10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 98.6 bits (244), Expect = 1.3e-20
Identity = 53/114 (46.49%), Postives = 70/114 (61.40%), Query Frame = 0

Query: 11  LHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLN 70
           L F+N  + P     RS      R        + T   E RQ H  +L  RD  ++K+ +
Sbjct: 40  LTFSN--SNPNXXXXRSFXXXXARNLQXXXXXDATLPTERRQQHSQSLGFRDTQMLKIFH 99

Query: 71  RSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILE 125
           RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF  RN+ KA+RVMEILE
Sbjct: 100 RSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILE 151

BLAST of CsaV3_1G045110 vs. TAIR10
Match: AT2G39620.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 52.4 bits (124), Expect = 1.1e-06
Identity = 31/86 (36.05%), Postives = 46/86 (53.49%), Query Frame = 0

Query: 74  RAGKHNESL-YFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG-DPDV 133
           RAG H E+L +F      KG  PD    T  +K    S + KK +R+ +++   G + DV
Sbjct: 76  RAGLHREALGFFGYMSEEKGIDPDKYSFTFALKACAGSMDFKKGLRIHDLIAEMGLESDV 135

Query: 134 YSYNAMISGFSKANQIDSANQVFDRM 158
           Y   A++  + KA  + SA QVFD+M
Sbjct: 136 YIGTALVEMYCKARDLVSARQVFDKM 161

BLAST of CsaV3_1G045110 vs. TAIR10
Match: AT3G22670.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 50.8 bits (120), Expect = 3.2e-06
Identity = 28/95 (29.47%), Postives = 48/95 (50.53%), Query Frame = 0

Query: 65  LMKLLNRSCRAGKHNESL-YFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEIL 124
           + K++ R  ++GK+N+++  FLE   S G K D +    L+       +++ A  V   L
Sbjct: 206 MSKVMRRLAKSGKYNKAVDAFLEMEKSYGVKTDTIAMNSLMDALVKENSIEHAHEVFLKL 265

Query: 125 ETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMR 159
                PD  ++N +I GF KA + D A  + D M+
Sbjct: 266 FDTIKPDARTFNILIHGFCKARKFDDARAMMDLMK 300

BLAST of CsaV3_1G045110 vs. TAIR10
Match: AT1G20300.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 47.0 bits (110), Expect = 4.7e-05
Identity = 24/91 (26.37%), Postives = 44/91 (48.35%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG 127
           L+ R  RAG  +E+++    +   G  PD +  + +I      R   +A    + L+   
Sbjct: 192 LIRRYVRAGLASEAVHCFNRMEDYGCVPDKIAFSIVISNLSRKRRASEAQSFFDSLKDRF 251

Query: 128 DPDVYSYNAMISGFSKANQIDSANQVFDRMR 159
           +PDV  Y  ++ G+ +A +I  A +VF  M+
Sbjct: 252 EPDVIVYTNLVRGWCRAGEISEAEKVFKEMK 282

BLAST of CsaV3_1G045110 vs. TAIR10
Match: AT5G27110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 45.4 bits (106), Expect = 1.4e-04
Identity = 23/72 (31.94%), Postives = 40/72 (55.56%), Query Frame = 0

Query: 86  ESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKAN 145
           + +++ G + DVVLC  LI  +F  ++   A  V E  +     DVY +N+++SG+SK +
Sbjct: 28  QRILTLGLRRDVVLCKSLINVYFTCKDHCSARHVFENFDIRS--DVYIWNSLMSGYSKNS 87

Query: 146 QIDSANQVFDRM 158
                 +VF R+
Sbjct: 88  MFHDTLEVFKRL 97

BLAST of CsaV3_1G045110 vs. Swiss-Prot
Match: sp|Q9SR00|PP213_ARATH (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 2.4e-19
Identity = 53/114 (46.49%), Postives = 70/114 (61.40%), Query Frame = 0

Query: 11  LHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLN 70
           L F+N  + P     RS      R        + T   E RQ H  +L  RD  ++K+ +
Sbjct: 40  LTFSN--SNPNXXXXRSFXXXXARNLQXXXXXDATLPTERRQQHSQSLGFRDTQMLKIFH 99

Query: 71  RSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILE 125
           RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF  RN+ KA+RVMEILE
Sbjct: 100 RSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILE 151

BLAST of CsaV3_1G045110 vs. Swiss-Prot
Match: sp|O80647|PP195_ARATH (Pentatricopeptide repeat-containing protein At2g39620 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E33 PE=3 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 2.0e-05
Identity = 31/86 (36.05%), Postives = 46/86 (53.49%), Query Frame = 0

Query: 74  RAGKHNESL-YFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG-DPDV 133
           RAG H E+L +F      KG  PD    T  +K    S + KK +R+ +++   G + DV
Sbjct: 76  RAGLHREALGFFGYMSEEKGIDPDKYSFTFALKACAGSMDFKKGLRIHDLIAEMGLESDV 135

Query: 134 YSYNAMISGFSKANQIDSANQVFDRM 158
           Y   A++  + KA  + SA QVFD+M
Sbjct: 136 YIGTALVEMYCKARDLVSARQVFDKM 161

BLAST of CsaV3_1G045110 vs. Swiss-Prot
Match: sp|Q9LUJ4|PP248_ARATH (Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22670 PE=2 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 5.8e-05
Identity = 28/95 (29.47%), Postives = 48/95 (50.53%), Query Frame = 0

Query: 65  LMKLLNRSCRAGKHNESL-YFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEIL 124
           + K++ R  ++GK+N+++  FLE   S G K D +    L+       +++ A  V   L
Sbjct: 206 MSKVMRRLAKSGKYNKAVDAFLEMEKSYGVKTDTIAMNSLMDALVKENSIEHAHEVFLKL 265

Query: 125 ETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMR 159
                PD  ++N +I GF KA + D A  + D M+
Sbjct: 266 FDTIKPDARTFNILIHGFCKARKFDDARAMMDLMK 300

BLAST of CsaV3_1G045110 vs. Swiss-Prot
Match: sp|Q9LN22|PPR54_ARATH (Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g20300 PE=2 SV=1)

HSP 1 Score: 47.0 bits (110), Expect = 8.4e-04
Identity = 24/91 (26.37%), Postives = 44/91 (48.35%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG 127
           L+ R  RAG  +E+++    +   G  PD +  + +I      R   +A    + L+   
Sbjct: 192 LIRRYVRAGLASEAVHCFNRMEDYGCVPDKIAFSIVISNLSRKRRASEAQSFFDSLKDRF 251

Query: 128 DPDVYSYNAMISGFSKANQIDSANQVFDRMR 159
           +PDV  Y  ++ G+ +A +I  A +VF  M+
Sbjct: 252 EPDVIVYTNLVRGWCRAGEISEAEKVFKEMK 282

BLAST of CsaV3_1G045110 vs. TrEMBL
Match: tr|A0A0A0M3C6|A0A0A0M3C6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G666460 PE=4 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 7.7e-87
Identity = 162/162 (100.00%), Postives = 162/162 (100.00%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 162

BLAST of CsaV3_1G045110 vs. TrEMBL
Match: tr|A0A1S3B9K3|A0A1S3B9K3_CUCME (pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487275 PE=4 SV=1)

HSP 1 Score: 314.3 bits (804), Expect = 5.7e-82
Identity = 155/162 (95.68%), Postives = 158/162 (97.53%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 162

BLAST of CsaV3_1G045110 vs. TrEMBL
Match: tr|A0A061F8R7|A0A061F8R7_THECC (Pentatricopeptide repeat (PPR-like) superfamily protein OS=Theobroma cacao OX=3641 GN=TCM_026185 PE=4 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 2.6e-42
Identity = 94/170 (55.29%), Postives = 120/170 (70.59%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTI-PQSRSDSIPAC-------RFSNKTHLRNVTSSAEFRQ 60
           +FS+E +  SL FT    KPT    S   S+ +C         S   + + V  SAE R 
Sbjct: 3   LFSTELVTHSLPFTTQQLKPTSNSHSHHTSLVSCLNHESQDSSSKSRNNQKVRVSAETRP 62

Query: 61  PHFPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRN 120
            H  + D ++ HLMKLLNRSC+AGK+NE+ YFLE +V KG+KPDVVLCTK+IKGFFN RN
Sbjct: 63  THLLSFDFKETHLMKLLNRSCKAGKYNEAFYFLECMVGKGYKPDVVLCTKMIKGFFNGRN 122

Query: 121 LKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           ++KA RV+EILE YG+PDV++YNA+ISGF K N++D AN+V DRMRSRGF
Sbjct: 123 VEKATRVIEILEKYGEPDVFAYNAIISGFCKMNRLDFANKVLDRMRSRGF 172

BLAST of CsaV3_1G045110 vs. TrEMBL
Match: tr|A0A2P5C367|A0A2P5C367_PARAD (Tetratricopeptide-like helical domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_188020 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 5.8e-42
Identity = 91/170 (53.53%), Postives = 127/170 (74.71%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFT-NPLAKPTIPQSRSDSIPACRFSNKTHLRNVTS-------SAEFRQ 60
           + S+EFLPQSL  T +P  KPT  +    ++ +CR  + +  +N +        S + R 
Sbjct: 3   IISTEFLPQSLAITVHP--KPTSQKLHHSTVVSCRVPSFSESKNFSRKPFDSRVSVDTRA 62

Query: 61  PHFPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRN 120
           PH    D ++ HL+K LNRSC+AGK+NE+LYFL+ ++SKGFKPDV+LCTK+++GFF SRN
Sbjct: 63  PHLHRNDYKENHLLKALNRSCKAGKYNEALYFLQLMISKGFKPDVILCTKVMRGFFYSRN 122

Query: 121 LKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           ++KA+RVMEILE +G+PD++SYNAMISGF KAN+I+ ANQV DRMR++GF
Sbjct: 123 VQKAIRVMEILEKHGEPDLFSYNAMISGFCKANRIELANQVLDRMRAQGF 170

BLAST of CsaV3_1G045110 vs. TrEMBL
Match: tr|A0A2P4K6V3|A0A2P4K6V3_QUESU (Pentatricopeptide repeat-containing protein, chloroplastic OS=Quercus suber OX=58331 GN=CFP56_39233 PE=4 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 7.5e-42
Identity = 96/167 (57.49%), Postives = 122/167 (73.05%), Query Frame = 0

Query: 3   SSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHL----RN---VTSSAEFRQPHF 62
           S++F P +L F  PL KPT    +S    +C   N        RN   V  SA+ R  H 
Sbjct: 5   STDFWPLTLPFIIPL-KPTSHSHQSSVFVSCGVPNXXXXXXXSRNPPKVWVSAKTRPTHL 64

Query: 63  PNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKK 122
            N+D +++HLMK+LNRSC+AGK+ ESLYFLE +V+KG+KPDV+LCTKLIKGFFN RN+ K
Sbjct: 65  QNVDFKESHLMKVLNRSCKAGKYKESLYFLECLVNKGYKPDVILCTKLIKGFFNYRNIPK 124

Query: 123 AMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 163
           A+RVM+ILE +G+PDV+SYNA+ISGF KANQI SAN+V DRM+ RGF
Sbjct: 125 AIRVMQILEEHGEPDVFSYNALISGFCKANQIVSANKVLDRMKRRGF 170

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004142590.11.2e-86100.00PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
XP_008443759.18.6e-8295.68PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
XP_023515375.14.4e-7085.80pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita ... [more]
XP_022960811.15.8e-7085.80pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita ... [more]
XP_022988060.12.9e-6985.19pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT3G04760.11.3e-2046.49Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT2G39620.11.1e-0636.05Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G22670.13.2e-0629.47Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G20300.14.7e-0526.37Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G27110.11.4e-0431.94Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SR00|PP213_ARATH2.4e-1946.49Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
sp|O80647|PP195_ARATH2.0e-0536.05Pentatricopeptide repeat-containing protein At2g39620 OS=Arabidopsis thaliana OX... [more]
sp|Q9LUJ4|PP248_ARATH5.8e-0529.47Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidop... [more]
sp|Q9LN22|PPR54_ARATH8.4e-0426.37Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0M3C6|A0A0A0M3C6_CUCSA7.7e-87100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G666460 PE=4 SV=1[more]
tr|A0A1S3B9K3|A0A1S3B9K3_CUCME5.7e-8295.68pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucumis ... [more]
tr|A0A061F8R7|A0A061F8R7_THECC2.6e-4255.29Pentatricopeptide repeat (PPR-like) superfamily protein OS=Theobroma cacao OX=36... [more]
tr|A0A2P5C367|A0A2P5C367_PARAD5.8e-4253.53Tetratricopeptide-like helical domain containing protein OS=Parasponia andersoni... [more]
tr|A0A2P4K6V3|A0A2P4K6V3_QUESU7.5e-4257.49Pentatricopeptide repeat-containing protein, chloroplastic OS=Quercus suber OX=5... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G045110.1CsaV3_1G045110.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 374..421
e-value: 1.3E-9
score: 37.9
coord: 214..248
e-value: 3.1E-9
score: 36.7
coord: 269..318
e-value: 4.9E-15
score: 55.3
coord: 129..178
e-value: 7.0E-19
score: 67.6
coord: 479..526
e-value: 5.9E-8
score: 32.7
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 179..210
e-value: 2.3E-5
score: 24.3
coord: 85..122
e-value: 5.0E-6
score: 26.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 272..306
e-value: 4.1E-8
score: 30.9
coord: 413..446
e-value: 4.7E-4
score: 18.1
coord: 482..516
e-value: 7.7E-5
score: 20.6
coord: 237..271
e-value: 1.6E-7
score: 29.0
coord: 307..341
e-value: 7.0E-9
score: 33.3
coord: 447..480
e-value: 2.2E-8
score: 31.7
coord: 167..201
e-value: 1.0E-9
score: 36.0
coord: 342..376
e-value: 3.0E-9
score: 34.5
coord: 379..411
e-value: 3.1E-5
score: 21.9
coord: 132..166
e-value: 5.3E-11
score: 40.0
coord: 202..235
e-value: 1.1E-6
score: 26.4
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 335..368
e-value: 1.6E-9
score: 37.2
coord: 440..473
e-value: 1.8E-13
score: 49.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 11.937
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 480..514
score: 9.942
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 61..95
score: 7.278
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 165..199
score: 13.362
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 235..269
score: 12.057
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 410..444
score: 10.523
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 200..234
score: 11.641
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 12.967
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 96..126
score: 6.697
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..409
score: 11.181
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 13.208
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 130..164
score: 13.943
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 445..479
score: 12.54
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 515..549
score: 6.982
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 64..125
e-value: 5.7E-7
score: 31.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 367..474
e-value: 2.0E-30
score: 108.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 475..554
e-value: 6.6E-11
score: 43.8
coord: 126..211
e-value: 1.1E-30
score: 108.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 212..366
e-value: 8.7E-49
score: 168.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 10..542
NoneNo IPR availablePANTHERPTHR24015:SF572SUBFAMILY NOT NAMEDcoord: 10..542

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsaV3_1G045110Silver-seed gourdcarcucB0949
CsaV3_1G045110Cucurbita maxima (Rimu)cmacucB0099
CsaV3_1G045110Cucurbita moschata (Rifu)cmocucB0090
CsaV3_1G045110Cucurbita pepo (Zucchini)cpecucB0799
CsaV3_1G045110Bottle gourd (USVL1VR-Ls)cuclsiB068
CsaV3_1G045110Wax gourdcucwgoB118