CU129867 (transcribed_cluster) Cucumber (Chinese Long) v2

NameCU129867
Typetranscribed_cluster
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationChr1 : 20156642 .. 20156871 (+)
Sequence length229
The following sequences are available for this feature:

transcribed_cluster sequence

CCTGCGTGGCCGATATGTTTGGGCGTGCGGGTCGTCTGGATGAAGCATTTGAAACCATAAATAGTATGCCATTCCCTCCAGATGCTGGTGTTTGGGGAACACTACTCGGGGCCTGCCACATTCATGGAAATGTTGAGCTTGCGGAAGTGGCATCAAAACATCTATTTGATTTAGACCCTTTGAACTCTGGGTACTATGTATTGCTTGCTAATGTGCAGGCTGGGGCTGG
Library categoryESTs
fruit (8DPP) library1
BLAST of CU129867 vs. Swiss-Prot
Match: PP333_ARATH (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 3.1e-28
Identity = 57/75 (76.00%), Postives = 61/75 (81.33%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           CV D+FGRAGRL EA+ET+ SMPFPPDAGVWGTLLGAC +H NVELAEVAS  L DLDP 
Sbjct: 685 CVVDLFGRAGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPS 744

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVL++N  A A
Sbjct: 745 NSGYYVLISNAHANA 759

Query: 123 182
           
Sbjct: 805 759

Query: 183 228
           
Sbjct: 865 759

BLAST of CU129867 vs. Swiss-Prot
Match: PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 95.9 bits (237), Expect = 2.0e-19
Identity = 40/75 (53.33%), Postives = 56/75 (74.67%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ DM+GRAG+L+ A + I SM   PDA +WG LL AC +HGNV+L ++AS+HLF+++P 
Sbjct: 593 CMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVEPE 652

Query: 63  NSGYYVLLANVQAGA 122
           + GY+VLL+N+ A A
Sbjct: 653 HVGYHVLLSNMYASA 667

Query: 123 182
           
Sbjct: 713 667

Query: 183 228
           
Sbjct: 773 667


HSP 2 Score: 36.6 bits (83), Expect = 1.4e-01
Identity = 21/56 (37.50%), Postives = 31/56 (55.36%), Query Frame = 3

Query: 6   VADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACH-IHGNVELAEVASKHLFD 65
           +ADM+G+ GRL++A      +P   ++  W TL+ ACH  HG+ E A +  K + D
Sbjct: 492 LADMYGKCGRLEDALSLFYQIP-RVNSVPWNTLI-ACHGFHGHGEKAVMLFKEMLD 545

Query: 66  125
           
Sbjct: 552 545

Query: 126 171
           
Sbjct: 612 545

BLAST of CU129867 vs. Swiss-Prot
Match: PP371_ARATH (Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana GN=PCMP-E20 PE=2 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 4.5e-19
Identity = 41/73 (56.16%), Postives = 53/73 (72.60%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ D+ GR G+L EA++ I +MP  PDA VWGTLLGAC  HGNVE+AE+AS+ LF L+P 
Sbjct: 358 CMIDLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASEALFKLEPT 417

Query: 63  NSGYYVLLANVQA 122
           N G  V+++N+ A
Sbjct: 418 NPGNCVIMSNIYA 430

Query: 123 182
           
Sbjct: 478 430

Query: 183 222
           
Sbjct: 538 430

BLAST of CU129867 vs. Swiss-Prot
Match: PP223_ARATH (Putative pentatricopeptide repeat-containing protein At3g11460 OS=Arabidopsis thaliana GN=PCMP-H52 PE=3 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 9.9e-19
Identity = 41/71 (57.75%), Postives = 50/71 (70.42%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ D+ GRAGRLDEA E I SMP  PD  VWG LLGAC IH NV++AE+A   + + +P 
Sbjct: 399 CLVDLLGRAGRLDEAMEFIESMPVEPDGAVWGALLGACKIHKNVDMAELAFAKVIEFEPN 458

Query: 63  NSGYYVLLANV 122
           N GYYVL++N+
Sbjct: 459 NIGYYVLMSNI 469

Query: 123 182
           
Sbjct: 519 469

Query: 183 216
           
Sbjct: 579 469

BLAST of CU129867 vs. Swiss-Prot
Match: PP417_ARATH (Pentatricopeptide repeat-containing protein At5g44230 OS=Arabidopsis thaliana GN=PCMP-H17 PE=2 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 1.7e-18
Identity = 42/75 (56.00%), Postives = 51/75 (68.00%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ D+ GR GRL EA E I +M   P  GVWG LLGAC IH N E+AE+A++HLF+L+P 
Sbjct: 426 CMVDLLGRTGRLQEALELIKTMSVEPHGGVWGALLGACRIHNNPEIAEIAAEHLFELEPD 485

Query: 63  NSGYYVLLANVQAGA 122
             G Y+LL+NV A A
Sbjct: 486 IIGNYILLSNVYASA 500

Query: 123 182
           
Sbjct: 546 500

Query: 183 228
           
Sbjct: 606 500

BLAST of CU129867 vs. TrEMBL
Match: A0A0A0LW16_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G553510 PE=4 SV=1)

HSP 1 Score: 159.1 bits (401), Expect = 2.1e-36
Identity = 75/75 (100.00%), Postives = 75/75 (100.00%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL
Sbjct: 690 CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 749

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVLLANVQAGA
Sbjct: 750 NSGYYVLLANVQAGA 764

Query: 123 182
           
Sbjct: 810 764

Query: 183 228
           
Sbjct: 870 764

BLAST of CU129867 vs. TrEMBL
Match: A0A067GD08_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003150mg PE=4 SV=1)

HSP 1 Score: 132.5 bits (332), Expect = 2.1e-28
Identity = 60/75 (80.00%), Postives = 67/75 (89.33%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ D+FGRAGRL++A ETINSMPF PDAGVWGTLLGAC +HGNVELAEVAS HLFDLDP 
Sbjct: 693 CMVDLFGRAGRLNKALETINSMPFAPDAGVWGTLLGACRVHGNVELAEVASSHLFDLDPQ 752

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVLL+N+ A A
Sbjct: 753 NSGYYVLLSNIHADA 767

Query: 123 182
           
Sbjct: 813 767

Query: 183 228
           
Sbjct: 873 767

BLAST of CU129867 vs. TrEMBL
Match: V4UIB9_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014257mg PE=4 SV=1)

HSP 1 Score: 132.5 bits (332), Expect = 2.1e-28
Identity = 60/75 (80.00%), Postives = 67/75 (89.33%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ D+FGRAGRL++A ETINSMPF PDAGVWGTLLGAC +HGNVELAEVAS HLFDLDP 
Sbjct: 693 CMVDLFGRAGRLNKALETINSMPFAPDAGVWGTLLGACRVHGNVELAEVASSHLFDLDPQ 752

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVLL+N+ A A
Sbjct: 753 NSGYYVLLSNIHADA 767

Query: 123 182
           
Sbjct: 813 767

Query: 183 228
           
Sbjct: 873 767

BLAST of CU129867 vs. TrEMBL
Match: A0A061G6X6_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_026989 PE=4 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 1.1e-27
Identity = 61/75 (81.33%), Postives = 66/75 (88.00%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           CV D+FGRAGRL+EAFETI SMPF PDAGVWGTLLGAC  HGNVELAE AS+HLFDLDP 
Sbjct: 651 CVVDLFGRAGRLNEAFETIKSMPFSPDAGVWGTLLGACRNHGNVELAEFASRHLFDLDPQ 710

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVLL+N+ A A
Sbjct: 711 NSGYYVLLSNLLADA 725

Query: 123 182
           
Sbjct: 771 725

Query: 183 228
           
Sbjct: 831 725

BLAST of CU129867 vs. TrEMBL
Match: A0A061G867_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_026989 PE=4 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 1.1e-27
Identity = 61/75 (81.33%), Postives = 66/75 (88.00%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           CV D+FGRAGRL+EAFETI SMPF PDAGVWGTLLGAC  HGNVELAE AS+HLFDLDP 
Sbjct: 679 CVVDLFGRAGRLNEAFETIKSMPFSPDAGVWGTLLGACRNHGNVELAEFASRHLFDLDPQ 738

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVLL+N+ A A
Sbjct: 739 NSGYYVLLSNLLADA 753

Query: 123 182
           
Sbjct: 799 753

Query: 183 228
           
Sbjct: 859 753

BLAST of CU129867 vs. NCBI nr
Match: gi|778662050|ref|XP_004135750.2| (PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Cucumis sativus])

HSP 1 Score: 164.1 bits (414), Expect = 9.5e-38
Identity = 75/75 (100.00%), Postives = 75/75 (100.00%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL
Sbjct: 690 CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 749

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVLLANVQAGA
Sbjct: 750 NSGYYVLLANVQAGA 764

Query: 123 182
           
Sbjct: 810 764

Query: 183 228
           
Sbjct: 870 764

BLAST of CU129867 vs. NCBI nr
Match: gi|659118448|ref|XP_008459124.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Cucumis melo])

HSP 1 Score: 157.5 bits (397), Expect = 8.9e-36
Identity = 70/75 (93.33%), Postives = 75/75 (100.00%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+AD+FGRAGRLDEAFETINSMPFPPDAGVWGTLLGACH+HGNVELAEVASK+LF+LDPL
Sbjct: 688 CMADLFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVASKYLFELDPL 747

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVLLANVQAGA
Sbjct: 748 NSGYYVLLANVQAGA 762

Query: 123 182
           
Sbjct: 808 762

Query: 183 228
           
Sbjct: 868 762

BLAST of CU129867 vs. NCBI nr
Match: gi|1009163141|ref|XP_015899806.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21300 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 137.9 bits (346), Expect = 7.3e-30
Identity = 60/75 (80.00%), Postives = 68/75 (90.67%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ D+FGRAGRL+EAFETI SMPF PDAGVWGTLLGAC +HGNVELAEVASK+LFDLDP 
Sbjct: 690 CMVDLFGRAGRLNEAFETIQSMPFSPDAGVWGTLLGACRVHGNVELAEVASKNLFDLDPQ 749

Query: 63  NSGYYVLLANVQAGA 122
           NSGYY+LL+N+ A A
Sbjct: 750 NSGYYILLSNINADA 764

Query: 123 182
           
Sbjct: 810 764

Query: 183 228
           
Sbjct: 870 764

BLAST of CU129867 vs. NCBI nr
Match: gi|1009163145|ref|XP_015899808.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21300 isoform X2 [Ziziphus jujuba])

HSP 1 Score: 137.9 bits (346), Expect = 7.3e-30
Identity = 60/75 (80.00%), Postives = 68/75 (90.67%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ D+FGRAGRL+EAFETI SMPF PDAGVWGTLLGAC +HGNVELAEVASK+LFDLDP 
Sbjct: 656 CMVDLFGRAGRLNEAFETIQSMPFSPDAGVWGTLLGACRVHGNVELAEVASKNLFDLDPQ 715

Query: 63  NSGYYVLLANVQAGA 122
           NSGYY+LL+N+ A A
Sbjct: 716 NSGYYILLSNINADA 730

Query: 123 182
           
Sbjct: 776 730

Query: 183 228
           
Sbjct: 836 730

BLAST of CU129867 vs. NCBI nr
Match: gi|641858851|gb|KDO77573.1| (hypothetical protein CISIN_1g003150mg [Citrus sinensis])

HSP 1 Score: 137.5 bits (345), Expect = 9.6e-30
Identity = 60/75 (80.00%), Postives = 67/75 (89.33%), Query Frame = 3

Query: 3   CVADMFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHIHGNVELAEVASKHLFDLDPL 62
           C+ D+FGRAGRL++A ETINSMPF PDAGVWGTLLGAC +HGNVELAEVAS HLFDLDP 
Sbjct: 693 CMVDLFGRAGRLNKALETINSMPFAPDAGVWGTLLGACRVHGNVELAEVASSHLFDLDPQ 752

Query: 63  NSGYYVLLANVQAGA 122
           NSGYYVLL+N+ A A
Sbjct: 753 NSGYYVLLSNIHADA 767

Query: 123 182
           
Sbjct: 813 767

Query: 183 228
           
Sbjct: 873 767

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP333_ARATH3.1e-2876.00Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana GN... [more]
PP348_ARATH2.0e-1953.33Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN... [more]
PP371_ARATH4.5e-1956.16Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana GN... [more]
PP223_ARATH9.9e-1957.75Putative pentatricopeptide repeat-containing protein At3g11460 OS=Arabidopsis th... [more]
PP417_ARATH1.7e-1856.00Pentatricopeptide repeat-containing protein At5g44230 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LW16_CUCSA2.1e-36100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G553510 PE=4 SV=1[more]
A0A067GD08_CITSI2.1e-2880.00Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003150mg PE=4 SV=1[more]
V4UIB9_9ROSI2.1e-2880.00Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014257mg PE=4 SV=1[more]
A0A061G6X6_THECC1.1e-2781.33Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 OS=T... [more]
A0A061G867_THECC1.1e-2781.33Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 1 OS=T... [more]
Match NameE-valueIdentityDescription
gi|778662050|ref|XP_004135750.2|9.5e-38100.00PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Cucumis s... [more]
gi|659118448|ref|XP_008459124.1|8.9e-3693.33PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Cucumis melo][more]
gi|1009163141|ref|XP_015899806.1|7.3e-3080.00PREDICTED: pentatricopeptide repeat-containing protein At4g21300 isoform X1 [Ziz... [more]
gi|1009163145|ref|XP_015899808.1|7.3e-3080.00PREDICTED: pentatricopeptide repeat-containing protein At4g21300 isoform X2 [Ziz... [more]
gi|641858851|gb|KDO77573.1|9.6e-3080.00hypothetical protein CISIN_1g003150mg [Citrus sinensis][more]
The following terms have been associated with this transcribed_cluster:
Vocabulary: INTERPRO
TermDefinition

This transcribed_cluster is associated with the following gene feature(s):

Feature NameUnique NameType
Csa1G553510Csa1G553510gene


The following EST feature(s) are a part of this transcribed_cluster:

Feature NameUnique NameType
FKNP3UI02N3ECGFKNP3UI02N3ECGEST


Analysis Name: InterPro Annotations of cucumber unigene v3
Date Performed: 2016-11-16
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 2..77
score: 4.6
NoneNo IPR availablePANTHERPTHR24015:SF388SUBFAMILY NOT NAMEDcoord: 2..77
score: 4.6