Csa5G221930 (gene) Cucumber (Chinese Long) v2

NameCsa5G221930
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionAppr-1-p processing domain protein; contains IPR002589 (Appr-1-p processing)
LocationChr5 : 10046165 .. 10050466 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAAGCCATCGGGCAGTGGAGTGGTTCGCTTCAAAGTCTCTCCCTCAACCGCTTGCGTTATTCAGAAGGGTGACATCACAAAGTGGTTCATCGACGGTTCTTCTGACGCCATTGTCCGTAACTCTATCTTTCTCCTCTACTCGGACTAGGCTATTTTATTAAAGGCCTTATTCGCTTCCTTATTTGTTTACTGCTAAAGCAAAGGCTGGTGAGATGTAATCAGTTGAAATCTCCTGTGTCTTTGAGTAATGATCAGAACCATCTTTTGTTTGAATGATGGGATGGTTTGTTTTTTATGAATCGCTTTAAGTTCGTAGTTGAATTGAAATGGGAACATTACTTAGGGGTTGGATTGCTTCCCCATTGCTTGACTTTGTCCTGTAGGCTCTTAGAATCATCAATACTCACTGAGAATTGTTTTGTAGAACTTTGATTTTTTTCCCCTTTTTGTAAGAAACAATTTTATTGATGAATGAAATATACTAGTAAAAAGTAAAAACTCCAAACACTGAAATTCAAAAAGACTCTCCCATTGAAGACTAAATGGAAAAAGCTTTATAAAATAATAATTGCTTGTTCTTGCACCACTAGAGAGTACAAGTTCCAGAAAGTGATCAAAAGAGCAGGAAACCTCCATGAAGAGTTGGCCTTTTTGAGCAATCCAAAGAATTCAGAATAAAGCACTATAGAAAGCAAGCCATATATTCTTTTTAATATCATGAAAAGGATGAGCGACCAAAATAATTGTCATGAGGCATGTAGTATTGTCGGGGATAGGCAAAGACCAACAAAAAGACTTCAAAATGATGGTTCAAAAACAAGAATCAAAAGGACAGAGTCAAAATTGTGAAGTATTCTTTTATAGAATTGTGATTGGGGGCTTTTATTTGTATAGTTTTAGGTTACTGTATGCAGTAGAGTTCATTAGAAATGATTTCAATTTCCATGATTTCCAAAGGATGACAACTTTGGTGAGATCACATCAAGTGAAAACAACTAAAACGTAGGCACTACTTTTTATTATTTTCTTATCTTCTTTAAACTACGTAAGTGAAAAAAATAATTTTTAAGCTCAAATTCTAAATGGTTCCCAAGTCCCAATTTATATATATATATAATTTATTTGATTAAAAGTGTAATTGCATTCTAAACTAAACCATAACTCCAAGAATGAAGATAAAGAAGGAGGGCACCAAACTAATGAACTACCGTGACGGTCCAAGGCAAGGTGACCCTTTGTCACCTTCTCTCTTTCTCTTGGTGGTCAATGACCTAAGCAGGTTAGTGCTCTTAGGTGTAGAGGAAGAAGTGGATAGTGGTTGAGGGGCTTAAGGTGGGGAAGGATTTGGTTCACCTTTTGCCTTTTCTTTTCGAATCGGATTCAAAGTAGGGCTGAATCTTAGTGGATCGTGACAACAAGACCACTTTTCCACTTATGATAACCTATCACATATTTAAGTCATCTGCAAAGGATTCGATTCCCACTCAATGGGAATTACGTATTTTCAATTTATAGTTGGCATACTCTTTTCTTGATCGGGTCAAGTGAGTTCCTTTCTTAACCTTAATAGTTTTTATCACTTTTTGAAGAGATTTTTGATCTTATGAATAACGACAGGAAGAGCTCCATCTTGAGGATGAATTATTGCTGGTTCAAGTTAGGAGCCTAGGTAGCTATGTTTCTCTTGGGTTGTAATCCGATTAGCCTGTTTTTCATGGAACCCTATGTTTGAAAGTTCAAAGTGGTTGTCCACATGGAAAAAAGAACTTGTTTTTCTAAAGGGCAGCTTATCCTCATCTAGGTGATGCTGAGTGAAAATTTCATTTATTTATGGTCTCCTTTTAGAATTGCTAATTTTGTGAGTGTCTAGGGATCATGAGGAATTTCTAGTCTGAAGGGGTTGAGAAGGGGGAAGGTCTGCGGTTGGTTTAGCGAGAGGTCTAAGAATTATTATCAGGTTGATGGAAGTCTAAAGCTCCAAATAGAATTGGTATTCTATCTTGGGTTTTGCTTAAAAATAGTGTAAATACTATTGAGGTGCCTCAGAAGAAGATTTCTTGTCTATTTCTTCTCCTTCTGTTGACCTCTCTAAGATTTCTTGTCCAAAAGTGTTTATTATTGATTCAGAGTTCTTGAATATTTCCAGGAACAATTGAATTCATAGAGAGGTTAGGAGGTTATGGGAGGGGGATAAGTTTAACATCTCCTTTGAGGGTCGATCTTCAAGCCTTTTGTAGCATGATAGTGGTATTGTTTTTTTAGGCTAGCGTCATTTTCTGTAGCTAAGGTTAGAGTCAAACTCCTATTTTTGTTGGGGCTTGTTTTTTGTATTTTCTTGTATTCTTTCATTTTTCTCAATGAAATTGTAGTTTCTTACATAAAAAAAGGTTATCGACTACCTTGCCTTGCATGCCAAAATTGCAAACTGTCTTCCAAAATGATTTCCTGTTTGGCCATTACATGGTGACAAATATTTTCCATGGTTCATCTCTAATATTATGCATCTTAATTCCAATGTCGACGTCAGTCTTGACTGAATTCTTTGTGATCCTACTAGCTCTGTTTGGATCAATGCACGCACTATGAAATTTAATGTACTCATCCAACTTTTTTGGGATGCTGTGTTGTATCTCCTGATTAAGCACCTTTATTTAGCTCATTGTGCGGTGCAAGTGATGTAATCATGCATCAAGCAGATGAATATTCAAGAGAAGCACCTTTGTAATTATAGAAGATTTAACCCCACTTGCTCAGTAAATTAACTAACCAACAAACTCTAATTTAAAATTAAAAAACTATTATTATTTACTAATTACACTTTCTATCAAAAAACTATTACTTATTATACTACATCTGCAAGAAGGACCCACATGGACCATATGTTTTATCCTGTATTTTCTTATTGTTAGGAGTTTCCAATACCATGAAATCTATTAAACTTAGTTTTCTTTGGCTTAAATTTTATTTAGGTCAATCCAGCAAATCAGGTAATGCTTGGAGGTGGTGGTGCTGATGGAGGTAAGGAGTTTCAACATTATTTTGGTTTTATAACTTGGATGAACTTACCATCTTGAATATTTTTTCCTCACTGTTTTGTAGCCATACATAATGCTGCTGGGCCAGATCTCATACAAGCATGTTATTCTGTCCAAGAAGTCCAACCTGGAATCCGTTGTCCAACTGGAGAAGCAAGGATTACGCCGTAGGATCTTTTGATTTCGTAGCTGATCATGATCAATAGTTTCATTTGTACATGCTAATGTGATGTTAAAATGCAGAGGTTTTCAGTTGCCAGCATCTCATGTAATCCATACTGTTGGACCCATCTACAATGCTAGTAGAAACCCCCAGGCCCTATTGAGAAGCGCATATAGGTTCTTGTTATGTAATTGCATTTTTCTTTTCAGTTTGATTTATCATTATGGACTCAACGTTTTCTATTGTCATGTTGATTTAGAAATTCCTTGGCCGTGGCAAAGGAGAATAACATTCAATATATTGCTTTTCCTGCCATATCCTGTGGTGTATTTCGGTAAGTTATCATACTGATTAGAAATTTCAGTATGTGTTTGCTTTATACCCTTCTTTAATCATTAAAAGAAAAATTAGCAGTTTCATAATTGATTGATTAGGTTCTTCTTAATTTCGTTTTCTAGTGGTTTGTGTGTGTGCACGTATGCTGGTTTGTTCCATTTTTCTATCATAAAATTTAGTTGGTTAATGGTTGATTGATCCAGATTCCTTTGAACATATACTTTTATTGATTTTGATTGTAACTTGCAGATATCCTTACGATGAAGCTGCCACAATAGCCTTATCTACCATTAAAGAGTTTTCCCAGGGCCTGAAAGAAGTAAGAATTTTCCTATGGTACACTTGGAGAATCTATCATCTTTTTGTTGCAGTATGCTAATATGTTCTTCACATATTTTGGGCGTGTTTGGATTATCCCATCAAGGTCCTCCTATCTTTGACCTCTTATCTTTGATTGGAAAGTTTTCACTTAGCCCAAGGAGAGGGATTAGTGTCTTTTGGGGCTTTGTCGGTTTTTCTTGTAGTTCCTTCTATGGATGGTGAATTCTCACTCGCTAGTTCCTGTCTTTTGCTCACTATGGAAGTTCAAATTTCCCAAGCAGCTTAAATTGTTTGTGGTATGTCTTAAATGGAAGGGTTAATACCTTAGATCATGTTTCAAGGGGTGACTAAGGATCTGACCACCTTCCCGGGGGTTCTTAATTTTTATTTTCAGTTTGAAGCTGCTCCTTTCAGGCTTTCGATTTTCTTTAGGTTGACCTAG

mRNA sequence

ATGGCCAAGCCATCGGGCAGTGGAGTGGTTCGCTTCAAAGTCTCTCCCTCAACCGCTTGCGTTATTCAGAAGGGTGACATCACAAAGTGGTTCATCGACGGTTCTTCTGACGCCATTGTCAATCCAGCAAATCAGGTAATGCTTGGAGGTGGTGGTGCTGATGGAGCCATACATAATGCTGCTGGGCCAGATCTCATACAAGCATGTTATTCTGTCCAAGAAGTCCAACCTGGAATCCGTTGTCCAACTGGAGAAGCAAGGATTACGCCAGGTTTTCAGTTGCCAGCATCTCATGTAATCCATACTGTTGGACCCATCTACAATGCTAGTAGAAACCCCCAGGCCCTATTGAGAAGCGCATATAGAAATTCCTTGGCCGTGGCAAAGGAGAATAACATTCAATATATTGCTTTTCCTGCCATATCCTGTGGTGTATTTCGATATCCTTACGATGAAGCTGCCACAATAGCCTTATCTACCATTAAAGAGTTTTCCCAGGGCCTGAAAGAATTCCTTCTATGGATGGTGAATTCTCACTCGCTAGTTCCTGTCTTTTGCTCACTATGGAAGTTCAAATTTCCCAAGCAGCTTAAATTGTTTGTGTTTGAAGCTGCTCCTTTCAGGCTTTCGATTTTCTTTAGGTTGACCTAG

Coding sequence (CDS)

ATGGCCAAGCCATCGGGCAGTGGAGTGGTTCGCTTCAAAGTCTCTCCCTCAACCGCTTGCGTTATTCAGAAGGGTGACATCACAAAGTGGTTCATCGACGGTTCTTCTGACGCCATTGTCAATCCAGCAAATCAGGTAATGCTTGGAGGTGGTGGTGCTGATGGAGCCATACATAATGCTGCTGGGCCAGATCTCATACAAGCATGTTATTCTGTCCAAGAAGTCCAACCTGGAATCCGTTGTCCAACTGGAGAAGCAAGGATTACGCCAGGTTTTCAGTTGCCAGCATCTCATGTAATCCATACTGTTGGACCCATCTACAATGCTAGTAGAAACCCCCAGGCCCTATTGAGAAGCGCATATAGAAATTCCTTGGCCGTGGCAAAGGAGAATAACATTCAATATATTGCTTTTCCTGCCATATCCTGTGGTGTATTTCGATATCCTTACGATGAAGCTGCCACAATAGCCTTATCTACCATTAAAGAGTTTTCCCAGGGCCTGAAAGAATTCCTTCTATGGATGGTGAATTCTCACTCGCTAGTTCCTGTCTTTTGCTCACTATGGAAGTTCAAATTTCCCAAGCAGCTTAAATTGTTTGTGTTTGAAGCTGCTCCTTTCAGGCTTTCGATTTTCTTTAGGTTGACCTAG

Protein sequence

MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNAAGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHSLVPVFCSLWKFKFPKQLKLFVFEAAPFRLSIFFRLT*
BLAST of Csa5G221930 vs. Swiss-Prot
Match: Y3343_XANAC (Macro domain-containing protein XAC3343 OS=Xanthomonas axonopodis pv. citri (strain 306) GN=XAC3343 PE=3 SV=2)

HSP 1 Score: 146.7 bits (369), Expect = 2.9e-34
Identity = 72/149 (48.32%), Postives = 100/149 (67.11%), Query Frame = 1

Query: 22  IQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNAAGPDLIQACYSVQEVQPGIRC 81
           + +GDIT+  +D     IVN AN+ +LGGGG DGAIH AAGP L++AC ++ +V+PG+RC
Sbjct: 5   VWQGDITELDVD----VIVNAANESLLGGGGVDGAIHRAAGPRLLEACEALPQVRPGVRC 64

Query: 82  PTGEARITPGFQLPASHVIHTVGPIYNASR-NPQALLRSAYRNSLAVAKENNIQYIAFPA 141
           PTGE RIT GF L A H+ HTVGP++   R N    L + Y  SL +A++  +  IAFPA
Sbjct: 65  PTGEIRITDGFDLKARHIFHTVGPVWRDGRHNEPEQLANCYWQSLKLAEQMMLHSIAFPA 124

Query: 142 ISCGVFRYPYDEAATIALSTIKEFSQGLK 170
           ISCG++ YP  +AA IA++  +++ +  K
Sbjct: 125 ISCGIYGYPLHQAARIAVTETRDWQRSHK 149

BLAST of Csa5G221930 vs. Swiss-Prot
Match: Y3184_XANCP (Macro domain-containing protein XCC3184 OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) GN=XCC3184 PE=3 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 3.7e-34
Identity = 72/149 (48.32%), Postives = 100/149 (67.11%), Query Frame = 1

Query: 22  IQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNAAGPDLIQACYSVQEVQPGIRC 81
           + +GDIT+  +D     IVN AN+ +LGGGG DGAIH AAGP L++AC ++ EV+PG+RC
Sbjct: 5   VWQGDITQLDVD----VIVNAANESLLGGGGVDGAIHRAAGPRLLEACEALPEVRPGVRC 64

Query: 82  PTGEARITPGFQLPASHVIHTVGPIY-NASRNPQALLRSAYRNSLAVAKENNIQYIAFPA 141
           PTGE RIT GF L A H+ HTVGP++ +   N    L + Y  SL +A++  +  IAFPA
Sbjct: 65  PTGEIRITDGFDLKARHIFHTVGPVWRDGKHNEPEQLANCYWQSLKLAEQMMLHSIAFPA 124

Query: 142 ISCGVFRYPYDEAATIALSTIKEFSQGLK 170
           ISCG++ YP  +AA IA++  +++ +  K
Sbjct: 125 ISCGIYGYPLYQAARIAVTETRDWQRSHK 149

BLAST of Csa5G221930 vs. Swiss-Prot
Match: Y4103_VIBPA (Macro domain-containing protein VPA0103 OS=Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) GN=VPA0103 PE=3 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 3.7e-34
Identity = 81/172 (47.09%), Postives = 105/172 (61.05%), Query Frame = 1

Query: 19  ACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNAAGPDLIQACYSVQEVQPG 78
           A  + +GDIT   +D    AIVN AN  MLGGGG DGAIH AAGP LI ACY+V +V  G
Sbjct: 3   AISLVQGDITTAHVD----AIVNAANPRMLGGGGVDGAIHRAAGPALINACYAVDDVD-G 62

Query: 79  IRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSAYRNSLAVAKENNIQYIAF 138
           IRCP G+ARIT    L A +VIH VGPIY+   +P+ +L SAY+ SL +A  N+ Q +A 
Sbjct: 63  IRCPFGDARITEAGNLNARYVIHAVGPIYDKFADPKTVLESAYQRSLDLALANHCQSVAL 122

Query: 139 PAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHSLVPVFCSLWK 191
           PAISCGV+ YP  EAA +A++  +       +   ++ +   L     S+W+
Sbjct: 123 PAISCGVYGYPPQEAAEVAMAVCQRPEYAALDMRFYLFSEEML-----SIWQ 164

BLAST of Csa5G221930 vs. Swiss-Prot
Match: Y334_RALSO (Macro domain-containing protein RSc0334 OS=Ralstonia solanacearum (strain GMI1000) GN=RSc0334 PE=3 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 1.2e-32
Identity = 71/137 (51.82%), Postives = 95/137 (69.34%), Query Frame = 1

Query: 37  DAIVNPANQVMLGGGGADGAIHNAAGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPA 96
           DAIVN AN  +LGGGG DGAIH AAGP+L++AC ++        C TG+A+ITPGF LPA
Sbjct: 21  DAIVNAANSALLGGGGVDGAIHRAAGPELLEACRALH------GCRTGQAKITPGFLLPA 80

Query: 97  SHVIHTVGPIYNASRNPQ-ALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAAT 156
            ++IHTVGPI+   R  + ALL + YRNSLA+AK+++++ IAFP IS GV+ +P   AA 
Sbjct: 81  RYIIHTVGPIWRGGRQDEAALLAACYRNSLALAKQHDVRTIAFPCISTGVYGFPPQLAAP 140

Query: 157 IALSTIKEFSQGLKEFL 173
           IA+ T++E    L + +
Sbjct: 141 IAVRTVREHGADLDDIV 151

BLAST of Csa5G221930 vs. Swiss-Prot
Match: Y3408_LACPL (Macro domain-containing protein lp_3408 OS=Lactobacillus plantarum (strain ATCC BAA-793 / NCIMB 8826 / WCFS1) GN=lp_3408 PE=3 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 7.8e-32
Identity = 75/143 (52.45%), Postives = 95/143 (66.43%), Query Frame = 1

Query: 25  GDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNAAGPDLIQACYSVQEVQPGIRCPTG 84
           GDITK  +D    AIVN AN  +LGGGG DGAIH AAGP L+ AC      +P   C TG
Sbjct: 9   GDITKMTVD----AIVNAANTSLLGGGGVDGAIHRAAGPALLAAC------RPLHGCATG 68

Query: 85  EARITPGFQLPASHVIHTVGPIYNASR-NPQALLRSAYRNSLAVAKENNIQYIAFPAISC 144
           EA+ITPGF+LPA +VIHT GP++   + N   LL ++YRNSL +A EN+ Q +AFP+IS 
Sbjct: 69  EAKITPGFRLPAKYVIHTPGPVWQGGQHNELQLLANSYRNSLNLAAENHCQTVAFPSIST 128

Query: 145 GVFRYPYDEAATIALSTIKEFSQ 167
           GV+ +P   AA +AL T++  +Q
Sbjct: 129 GVYHFPLSIAAPLALKTLQATAQ 141

BLAST of Csa5G221930 vs. TrEMBL
Match: A0A0A0KSJ5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G221930 PE=4 SV=1)

HSP 1 Score: 439.5 bits (1129), Expect = 2.4e-120
Identity = 216/216 (100.00%), Postives = 216/216 (100.00%), Query Frame = 1

Query: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 60
           MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA
Sbjct: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 60

Query: 61  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 120
           AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA
Sbjct: 61  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 120

Query: 121 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHS 180
           YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHS
Sbjct: 121 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHS 180

Query: 181 LVPVFCSLWKFKFPKQLKLFVFEAAPFRLSIFFRLT 217
           LVPVFCSLWKFKFPKQLKLFVFEAAPFRLSIFFRLT
Sbjct: 181 LVPVFCSLWKFKFPKQLKLFVFEAAPFRLSIFFRLT 216

BLAST of Csa5G221930 vs. TrEMBL
Match: E5GB75_CUCME (Appr-1-p processing enzyme family protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 291.2 bits (744), Expect = 1.0e-75
Identity = 150/181 (82.87%), Postives = 158/181 (87.29%), Query Frame = 1

Query: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITK--------WFIDGSSDA---IVNPANQVMLG 60
           MA  S SGVV FKVSPST CVIQK  + K        + +DGS      +VNPAN+VMLG
Sbjct: 54  MANESRSGVVGFKVSPSTDCVIQKEGVEKEEGLSKNYYQVDGSPKLKIELVNPANEVMLG 113

Query: 61  GGGADGAIHNAAGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNA 120
           GGGADGAIHNAAGPDL++ACYSVQEVQPGIRCPTGEARITPGF+LPASHVIHTVGPIYNA
Sbjct: 114 GGGADGAIHNAAGPDLVRACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNA 173

Query: 121 SRNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLK 171
           SRNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLK
Sbjct: 174 SRNPQALLRSAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLK 233

BLAST of Csa5G221930 vs. TrEMBL
Match: G7I3E3_MEDTR (Appr-1-P processing enzyme family protein OS=Medicago truncatula GN=MTR_1g007640 PE=2 SV=1)

HSP 1 Score: 268.5 bits (685), Expect = 7.2e-69
Identity = 134/178 (75.28%), Postives = 151/178 (84.83%), Query Frame = 1

Query: 1   MAKPSGSG-VVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHN 60
           MA  +G+G VVRF +S S A +IQKGDITKW IDGS+DAIVNPAN+ MLGGGGADGAIH 
Sbjct: 39  MASSNGNGGVVRFPLSSSNALIIQKGDITKWSIDGSTDAIVNPANERMLGGGGADGAIHR 98

Query: 61  AAGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRS 120
           AAGPDL++AC +V EV+PG+RCPTGEARITPGF LPASHVIHTVGPIY+   NP A L S
Sbjct: 99  AAGPDLLRACRNVPEVRPGVRCPTGEARITPGFLLPASHVIHTVGPIYDVDSNPAASLAS 158

Query: 121 AYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE--FLLWM 176
           AYRNSL VAKENNIQYIAFPAISCGV+ YPYDEAAT+A+STIKEF    KE  F+L+M
Sbjct: 159 AYRNSLRVAKENNIQYIAFPAISCGVYGYPYDEAATVAISTIKEFQNDFKEVHFVLFM 216

BLAST of Csa5G221930 vs. TrEMBL
Match: K4C1Q7_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 4.7e-68
Identity = 126/165 (76.36%), Postives = 144/165 (87.27%), Query Frame = 1

Query: 6   GSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNAAGPDL 65
           G   V F+++PS+   IQKGDIT+W +DGSSDAIVNPAN+ MLGGGGADGAIH AAGP+L
Sbjct: 5   GENPVTFQLTPSSLLKIQKGDITRWSVDGSSDAIVNPANERMLGGGGADGAIHRAAGPEL 64

Query: 66  IQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSAYRNSL 125
             ACY V+EVQPGIRCPTGEARITPGF+LPASHVIHTVGP+Y+A+ NP+A L +AYRNSL
Sbjct: 65  RDACYKVREVQPGIRCPTGEARITPGFRLPASHVIHTVGPVYDANPNPKASLTNAYRNSL 124

Query: 126 AVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE 171
            VAKENNIQYIAFPAISCGVF YPYDEAAT+A+ST+KEF   LKE
Sbjct: 125 RVAKENNIQYIAFPAISCGVFGYPYDEAATVAISTVKEFGSDLKE 169

BLAST of Csa5G221930 vs. TrEMBL
Match: W9S9J7_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003426 PE=4 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 4.7e-68
Identity = 123/182 (67.58%), Postives = 154/182 (84.62%), Query Frame = 1

Query: 2   AKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNAA 61
           ++ S  GVVRF +S +++ VIQ+GDITKWF+DGS+DAI+NPAN+ MLGGGGADGAIH AA
Sbjct: 5   SRASSGGVVRFPLSSTSSLVIQRGDITKWFVDGSTDAIINPANERMLGGGGADGAIHRAA 64

Query: 62  GPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSAY 121
           GPDL+QACY V EV PG+RCPTGEARITPGF+LP +HV+ TVGP+YN     +A LR+ Y
Sbjct: 65  GPDLLQACYGVPEVSPGVRCPTGEARITPGFRLPVAHVVFTVGPMYNGRSTAEAALRNTY 124

Query: 122 RNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE--FLLWMVNSH 181
           RNSL +AKENNIQYIAFPAISCGV+ YPYDEA+T+A+STIKEF+ G+KE  F+L+  + +
Sbjct: 125 RNSLKIAKENNIQYIAFPAISCGVYGYPYDEASTVAISTIKEFANGIKEVHFVLFQEDIY 184

BLAST of Csa5G221930 vs. TAIR10
Match: AT2G40600.1 (AT2G40600.1 appr-1-p processing enzyme family protein)

HSP 1 Score: 237.3 bits (604), Expect = 9.1e-63
Identity = 115/166 (69.28%), Postives = 130/166 (78.31%), Query Frame = 1

Query: 5   SGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNAAGPD 64
           SG     F +S S+   I KGDITKW +D SSDAIVNPAN+ MLGGGGADGAIH AAGP 
Sbjct: 67  SGDEGAVFNLSDSSLLKILKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRAAGPQ 126

Query: 65  LIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSAYRNS 124
           L  ACY V EV+PG+RCPTGEARITPGF LPAS VIHTVGPIY++  NPQ  L ++Y+NS
Sbjct: 127 LRAACYEVPEVRPGVRCPTGEARITPGFNLPASRVIHTVGPIYDSDVNPQESLTNSYKNS 186

Query: 125 LAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE 171
           L VAKENNI+YIAFPAISCG++ YP+DEAA I +STIK+FS   KE
Sbjct: 187 LRVAKENNIKYIAFPAISCGIYGYPFDEAAAIGISTIKQFSTDFKE 232

BLAST of Csa5G221930 vs. TAIR10
Match: AT1G69340.1 (AT1G69340.1 appr-1-p processing enzyme family protein)

HSP 1 Score: 73.2 bits (178), Expect = 2.3e-13
Identity = 59/200 (29.50%), Postives = 93/200 (46.50%), Query Frame = 1

Query: 2   AKPSGSGVV-RFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 61
           A  SG+G+V +F V       I       W ++   DA+VN  N+ +     + G +H A
Sbjct: 66  AGSSGNGMVSKFPVDHEINSRIYLWRGEPWNLE--VDAVVNSTNENLDEAHSSPG-LHVA 125

Query: 62  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQA--LLR 121
           AGP L + C ++        C TG A++T  + LPA  VIHTVGP Y    +  A   L 
Sbjct: 126 AGPGLAEQCATLGG------CRTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALS 185

Query: 122 SAYRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE---FLLWM 181
             YR+ L +  ++ +Q IA   I      YP + AA +A+ T++ F +  K+    +++ 
Sbjct: 186 HCYRSCLELLIDSGLQSIALGCIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKISAVVFC 245

Query: 182 VNSHSLVPVFCSLWKFKFPK 196
             + S   ++  L    FP+
Sbjct: 246 TTTSSDTEIYKRLLPLYFPR 256

BLAST of Csa5G221930 vs. NCBI nr
Match: gi|700195554|gb|KGN50731.1| (hypothetical protein Csa_5G221930 [Cucumis sativus])

HSP 1 Score: 439.5 bits (1129), Expect = 3.4e-120
Identity = 216/216 (100.00%), Postives = 216/216 (100.00%), Query Frame = 1

Query: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 60
           MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA
Sbjct: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 60

Query: 61  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 120
           AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA
Sbjct: 61  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 120

Query: 121 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHS 180
           YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHS
Sbjct: 121 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHS 180

Query: 181 LVPVFCSLWKFKFPKQLKLFVFEAAPFRLSIFFRLT 217
           LVPVFCSLWKFKFPKQLKLFVFEAAPFRLSIFFRLT
Sbjct: 181 LVPVFCSLWKFKFPKQLKLFVFEAAPFRLSIFFRLT 216

BLAST of Csa5G221930 vs. NCBI nr
Match: gi|778701529|ref|XP_011655041.1| (PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD1 isoform X2 [Cucumis sativus])

HSP 1 Score: 345.1 bits (884), Expect = 8.7e-92
Identity = 170/172 (98.84%), Postives = 171/172 (99.42%), Query Frame = 1

Query: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 60
           MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA
Sbjct: 54  MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 113

Query: 61  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 120
           AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA
Sbjct: 114 AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 173

Query: 121 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFL 173
           YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE +
Sbjct: 174 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKELV 225

BLAST of Csa5G221930 vs. NCBI nr
Match: gi|449457407|ref|XP_004146440.1| (PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD2 isoform X1 [Cucumis sativus])

HSP 1 Score: 344.7 bits (883), Expect = 1.1e-91
Identity = 170/170 (100.00%), Postives = 170/170 (100.00%), Query Frame = 1

Query: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 60
           MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA
Sbjct: 54  MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 113

Query: 61  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 120
           AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA
Sbjct: 114 AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 173

Query: 121 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE 171
           YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE
Sbjct: 174 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE 223

BLAST of Csa5G221930 vs. NCBI nr
Match: gi|659114192|ref|XP_008456944.1| (PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD2 isoform X1 [Cucumis melo])

HSP 1 Score: 334.0 bits (855), Expect = 2.0e-88
Identity = 167/198 (84.34%), Postives = 178/198 (89.90%), Query Frame = 1

Query: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 60
           MA  S SGVV FKVSPST CVIQKGDITKWFIDGSSDAIVNPAN+VMLGGGGADGAIHNA
Sbjct: 54  MANESRSGVVGFKVSPSTDCVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNA 113

Query: 61  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 120
           AGPDL++ACYSVQEVQPGIRCPTGEARITPGF+LPASHVIHTVGPIYNASRNPQALLRSA
Sbjct: 114 AGPDLVRACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNASRNPQALLRSA 173

Query: 121 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEFLLWMVNSHS 180
           YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE  +++V+   
Sbjct: 174 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEVRIFLVHFVL 233

Query: 181 LVPVFCSLWKFKFPKQLK 199
             P   ++W  K  + LK
Sbjct: 234 YAPDIYNVWLDKANELLK 251

BLAST of Csa5G221930 vs. NCBI nr
Match: gi|659114194|ref|XP_008456945.1| (PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD1 isoform X2 [Cucumis melo])

HSP 1 Score: 327.4 bits (838), Expect = 1.9e-86
Identity = 168/200 (84.00%), Postives = 177/200 (88.50%), Query Frame = 1

Query: 1   MAKPSGSGVVRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANQVMLGGGGADGAIHNA 60
           MA  S SGVV FKVSPST CVIQKGDITKWFIDGSSDAIVNPAN+VMLGGGGADGAIHNA
Sbjct: 54  MANESRSGVVGFKVSPSTDCVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNA 113

Query: 61  AGPDLIQACYSVQEVQPGIRCPTGEARITPGFQLPASHVIHTVGPIYNASRNPQALLRSA 120
           AGPDL++ACYSVQEVQPGIRCPTGEARITPGF+LPASHVIHTVGPIYNASRNPQALLRSA
Sbjct: 114 AGPDLVRACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNASRNPQALLRSA 173

Query: 121 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE--FLLWMVNS 180
           YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKE  F+L+    
Sbjct: 174 YRNSLAVAKENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEVHFVLY---- 233

Query: 181 HSLVPVFCSLWKFKFPKQLK 199
               P   ++W  K  + LK
Sbjct: 234 ---APDIYNVWLDKANELLK 246

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y3343_XANAC2.9e-3448.32Macro domain-containing protein XAC3343 OS=Xanthomonas axonopodis pv. citri (str... [more]
Y3184_XANCP3.7e-3448.32Macro domain-containing protein XCC3184 OS=Xanthomonas campestris pv. campestris... [more]
Y4103_VIBPA3.7e-3447.09Macro domain-containing protein VPA0103 OS=Vibrio parahaemolyticus serotype O3:K... [more]
Y334_RALSO1.2e-3251.82Macro domain-containing protein RSc0334 OS=Ralstonia solanacearum (strain GMI100... [more]
Y3408_LACPL7.8e-3252.45Macro domain-containing protein lp_3408 OS=Lactobacillus plantarum (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A0A0KSJ5_CUCSA2.4e-120100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_5G221930 PE=4 SV=1[more]
E5GB75_CUCME1.0e-7582.87Appr-1-p processing enzyme family protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
G7I3E3_MEDTR7.2e-6975.28Appr-1-P processing enzyme family protein OS=Medicago truncatula GN=MTR_1g007640... [more]
K4C1Q7_SOLLC4.7e-6876.36Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
W9S9J7_9ROSA4.7e-6867.58Uncharacterized protein OS=Morus notabilis GN=L484_003426 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G40600.19.1e-6369.28 appr-1-p processing enzyme family protein[more]
AT1G69340.12.3e-1329.50 appr-1-p processing enzyme family protein[more]
Match NameE-valueIdentityDescription
gi|700195554|gb|KGN50731.1|3.4e-120100.00hypothetical protein Csa_5G221930 [Cucumis sativus][more]
gi|778701529|ref|XP_011655041.1|8.7e-9298.84PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD1 isoform X2 [Cucumis sativus][more]
gi|449457407|ref|XP_004146440.1|1.1e-91100.00PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD2 isoform X1 [Cucumis sativus][more]
gi|659114192|ref|XP_008456944.1|2.0e-8884.34PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD2 isoform X1 [Cucumis melo][more]
gi|659114194|ref|XP_008456945.1|1.9e-8684.00PREDICTED: O-acetyl-ADP-ribose deacetylase MACROD1 isoform X2 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002589Macro_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009664 plant-type cell wall organization
cellular_component GO:0005829 cytosol
cellular_component GO:0005576 extracellular region
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU089384cucumber EST collection version 3.0transcribed_cluster
CU097562cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa5G221930.1Csa5G221930.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU097562CU097562transcribed_cluster
CU089384CU089384transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002589Macro domainPFAMPF01661Macrocoord: 40..157
score: 2.8
IPR002589Macro domainSMARTSM00506YBR022w_8coord: 19..157
score: 1.1
IPR002589Macro domainPROFILEPS51154MACROcoord: 7..216
score: 2
NoneNo IPR availableGENE3DG3DSA:3.40.220.10coord: 11..167
score: 7.8
NoneNo IPR availablePANTHERPTHR11106GANGLIOSIDE INDUCED DIFFERENTIATION ASSOCIATED PROTEIN 2-RELATEDcoord: 21..164
score: 2.2
NoneNo IPR availableunknownSSF52949Macro domain-likecoord: 20..201
score: 3.03