Cla021767 (gene) Watermelon (97103) v1

NameCla021767
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionGlycoside hydrolase family 28 protein/polygalacturonase family protein (AHRD V1 ***- Q1PF10_ARATH); contains Interpro domain(s) IPR012334 Pectin lyase fold
LocationChr5 : 6020636 .. 6022961 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTAGCATTGCTTTTGTTGCTACAATTGAGCAATGCTGCGAAAATAATTGGTGAAGAGCTCGCCGGACAGTGCGATTCTAAACCAGCATTAGATCCAAGACCACACAGTGTGTCTATCTTGGAGTTTGGTGCTGTTGGAGATGGCAAAACTCTTAACACCATTGCCTTCCAAAATGCCATTTTCTATCTTAAATCCTTTGCTGATAAGGGTGGAGCTCAGCTTTATGTTCCACCTGGAAAATGGCTCACTGGAAGCATTAATCTTACCAGCCATCTGACCCTCTTCTTGGAAAAAGGTGCTGTGATTCTTGGCTCTCAGGTATGTTCTTTGTCATGGATGGTGCTCAAACAAGTAGTTGTTTCTAGGCAGTTGTCGTGAAAACACATTGATCTTCTCTTTAACAGGACCCATCCCATTGGGAACTTGTCAATCCCTTACCTTCATATGGTAGAGGCATTGAAGTTCCAGGAAAACGATACCGGAGCTTGATAAATGGTTACAATTTACAAGATGTTGTAATAACAGGTTTGTTGCTGCTCTTCTAACCCTTGATTCAGCATCTAGTTTGCGGAAATGCATTCCTCTGAACTAGTGTTATTATGTTGATTTTGGATGTGCTAGGTGATGATGGGGTCATTGATGGACAGGGTTTAGTTTGGTGGAATTGGTTTAGCTCTCATTCCTTAAACTACAGCCGCCCTCATCTCGTGGAATTTGAGGATTCTCAATATGTAGTTGTTTCAAACCTCACTTTCTTGAATACTCCTGCATATAACATTCATCCAGTCTATTGCAGGTATGCTATAACAACCAGGGATCAGTTTCTCCTGCCTCAACTGGTTCTGTTCACCTTGTTGATATGTCTATTTGCTGATTGTCAGAGAATCTGTATTGAATATAAGTTTTTTCCATGTGCAGTAACGTCTATGTCTACAACATCTCTGTCTCTGCACCTTCTGAATCTCCCTACACAGTTGGAATAGTCCCAGGTTCGTAACGCTTCCTCAATTTTCAGCATCCTCATTGCTATTTTGTGTGACTATTGATGGGCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATATGATGTGATACAGTCTTTGGCTGAATGGCGTGAGCACCATTGAAGAATCTGTCCCTTTTATCTCACTCCCACATGCACACACACACCACATAGTGCTCAGAAGAAGATGCCATCAAGAAAGCTATCATGAGTAGTTGACTGTTAACAGATATGTTTATAAGTTTTAGTTAATAACAGCACTTGTTTAACATCATGTAATCTAAGATAAGCAATGTCTTCTAATGAAGTCCCAATATTCCAAGTTCTTATATTTGGGGCCTGAGGTCTGTGGTGATAATTTTTGCCGTAAAATTTAAGAAACTTACCACTCTTAAAATGGTGAAGTTGGATTGAATTTAGAAAATAAAATTCCGTTAGGAGTTTATGACACGTAGTGGGATGCCATCATGATCAAAAGGATTTATTGTGGGGTGGACCAAATTTGGACATACAGTACATCACGGTTTTCATTAATGATCATCAATCCAAGAATTAAATTTAGCAGTATGCAATTCATTCTTTCCTGTTTCTAATTAATAAAGTGGAGAGGGGCATTCTTTTCTAACTAAAGTTTGATTGGTGCTTTCAGATTCCTCCGATCATGTATGCATAGAGGGGTGCAACATTGCCACAGGATATGATGCCATTGCCTTGAAGAGTGGTTGGGATCAGTATGGTATTGCCTATGGCAGACCATCTAAAAACATACACATTAGAAGGGTCCACCTTCAGTCATCATCAGGTTCTTCCATTGCCTTTGGTAGTGAGATGTCTGGTGGGATATCTAATGTTCTTGTGGAGCATGTGCAACTGAACAACTCATTTATTGGCATTCAATTTAGAACTACAAAGGGAAGAGGGGGTTACATTAAAGGAATTGTCGTTTCAGATGTTGAAATGGAAAACATATCCACGGCATTCAGTGCCTCTGGTCACTTTGGGTCACATCCCGATGATGAGTTTGATCCTAATGCTCTTCCAATTGTGCAGGACATAACCTTGCAGAATGTGAGAGGCACAAACATTAAAATTGCTGGAAACTTTTCTGGGATACAAGAATCTCCCTTCTCTTCAATCTATCTATCCAACATTACCTTTTCAATCAATTCATCTTCTTCCACTTCTTGGATCTGTTCAGATGTTTCTGGCTTTTCAGAATCTGTGATCCCACCACCCTGTTCTGATCTCAGTGCTCCATATTCAATTTCTTCCTCAGCAGCCTCTCCCCTTGTGAATTCAACTGGAAAAACTGCTGTTTTATGA

mRNA sequence

GTAGCATTGCTTTTGTTGCTACAATTGAGCAATGCTGCGAAAATAATTGGTGAAGAGCTCGCCGGACAGTGCGATTCTAAACCAGCATTAGATCCAAGACCACACAGTGTGTCTATCTTGGAGTTTGGTGCTGTTGGAGATGGCAAAACTCTTAACACCATTGCCTTCCAAAATGCCATTTTCTATCTTAAATCCTTTGCTGATAAGGGTGGAGCTCAGCTTTATGTTCCACCTGGAAAATGGCTCACTGGAAGCATTAATCTTACCAGCCATCTGACCCTCTTCTTGGAAAAAGGTGCTGTGATTCTTGGCTCTCAGGACCCATCCCATTGGGAACTTGTCAATCCCTTACCTTCATATGGTAGAGGCATTGAAGTTCCAGGAAAACGATACCGGAGCTTGATAAATGGTTACAATTTACAAGATGTTGTAATAACAGGTGATGATGGGGTCATTGATGGACAGGGTTTAGTTTGGTGGAATTGGTTTAGCTCTCATTCCTTAAACTACAGCCGCCCTCATCTCGTGGAATTTGAGGATTCTCAATATGTAGTTGTTTCAAACCTCACTTTCTTGAATACTCCTGCATATAACATTCATCCAGTCTATTGCAGTAACGTCTATGTCTACAACATCTCTGTCTCTGCACCTTCTGAATCTCCCTACACAGTTGGAATAGTCCCAGATTCCTCCGATCATGTATGCATAGAGGGGTGCAACATTGCCACAGGATATGATGCCATTGCCTTGAAGAGTGGTTGGGATCAGTATGGTATTGCCTATGGCAGACCATCTAAAAACATACACATTAGAAGGGTCCACCTTCAGTCATCATCAGGTTCTTCCATTGCCTTTGGTAGTGAGATGTCTGGTGGGATATCTAATGTTCTTGTGGAGCATGTGCAACTGAACAACTCATTTATTGGCATTCAATTTAGAACTACAAAGGGAAGAGGGGGTTACATTAAAGGAATTGTCGTTTCAGATGTTGAAATGGAAAACATATCCACGGCATTCAGTGCCTCTGGTCACTTTGGGTCACATCCCGATGATGAGTTTGATCCTAATGCTCTTCCAATTGTGCAGGACATAACCTTGCAGAATGTGAGAGGCACAAACATTAAAATTGCTGGAAACTTTTCTGGGATACAAGAATCTCCCTTCTCTTCAATCTATCTATCCAACATTACCTTTTCAATCAATTCATCTTCTTCCACTTCTTGGATCTGTTCAGATGTTTCTGGCTTTTCAGAATCTGTGATCCCACCACCCTGTTCTGATCTCAGTGCTCCATATTCAATTTCTTCCTCAGCAGCCTCTCCCCTTGTGAATTCAACTGGAAAAACTGCTGTTTTATGA

Coding sequence (CDS)

GTAGCATTGCTTTTGTTGCTACAATTGAGCAATGCTGCGAAAATAATTGGTGAAGAGCTCGCCGGACAGTGCGATTCTAAACCAGCATTAGATCCAAGACCACACAGTGTGTCTATCTTGGAGTTTGGTGCTGTTGGAGATGGCAAAACTCTTAACACCATTGCCTTCCAAAATGCCATTTTCTATCTTAAATCCTTTGCTGATAAGGGTGGAGCTCAGCTTTATGTTCCACCTGGAAAATGGCTCACTGGAAGCATTAATCTTACCAGCCATCTGACCCTCTTCTTGGAAAAAGGTGCTGTGATTCTTGGCTCTCAGGACCCATCCCATTGGGAACTTGTCAATCCCTTACCTTCATATGGTAGAGGCATTGAAGTTCCAGGAAAACGATACCGGAGCTTGATAAATGGTTACAATTTACAAGATGTTGTAATAACAGGTGATGATGGGGTCATTGATGGACAGGGTTTAGTTTGGTGGAATTGGTTTAGCTCTCATTCCTTAAACTACAGCCGCCCTCATCTCGTGGAATTTGAGGATTCTCAATATGTAGTTGTTTCAAACCTCACTTTCTTGAATACTCCTGCATATAACATTCATCCAGTCTATTGCAGTAACGTCTATGTCTACAACATCTCTGTCTCTGCACCTTCTGAATCTCCCTACACAGTTGGAATAGTCCCAGATTCCTCCGATCATGTATGCATAGAGGGGTGCAACATTGCCACAGGATATGATGCCATTGCCTTGAAGAGTGGTTGGGATCAGTATGGTATTGCCTATGGCAGACCATCTAAAAACATACACATTAGAAGGGTCCACCTTCAGTCATCATCAGGTTCTTCCATTGCCTTTGGTAGTGAGATGTCTGGTGGGATATCTAATGTTCTTGTGGAGCATGTGCAACTGAACAACTCATTTATTGGCATTCAATTTAGAACTACAAAGGGAAGAGGGGGTTACATTAAAGGAATTGTCGTTTCAGATGTTGAAATGGAAAACATATCCACGGCATTCAGTGCCTCTGGTCACTTTGGGTCACATCCCGATGATGAGTTTGATCCTAATGCTCTTCCAATTGTGCAGGACATAACCTTGCAGAATGTGAGAGGCACAAACATTAAAATTGCTGGAAACTTTTCTGGGATACAAGAATCTCCCTTCTCTTCAATCTATCTATCCAACATTACCTTTTCAATCAATTCATCTTCTTCCACTTCTTGGATCTGTTCAGATGTTTCTGGCTTTTCAGAATCTGTGATCCCACCACCCTGTTCTGATCTCAGTGCTCCATATTCAATTTCTTCCTCAGCAGCCTCTCCCCTTGTGAATTCAACTGGAAAAACTGCTGTTTTATGA

Protein sequence

VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAIFYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSYGRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFEDSQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCNIATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEHVQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPIVQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESVIPPPCSDLSAPYSISSSAASPLVNSTGKTAVL
BLAST of Cla021767 vs. Swiss-Prot
Match: PGLR_VITVI (Probable polygalacturonase OS=Vitis vinifera GN=GSVIVT00026920001 PE=1 SV=1)

HSP 1 Score: 409.5 bits (1051), Expect = 4.9e-113
Identity = 206/429 (48.02%), Postives = 275/429 (64.10%), Query Frame = 1

Query: 1   VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 60
           VA++LLL +S      G  L  +     A+  R HS S+++FG VGDG+TLNT AFQ+A+
Sbjct: 29  VAVVLLLSVSRGECRKGRIL--EALEYSAISCRAHSASLVDFGGVGDGQTLNTKAFQDAV 88

Query: 61  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 120
             L  +  +GGAQLYVP GKWLTGS +LTSH TLFL + AV+L SQD S W ++ PLPSY
Sbjct: 89  SELSKYGSEGGAQLYVPAGKWLTGSFSLTSHFTLFLHRDAVLLASQDISQWPVIKPLPSY 148

Query: 121 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 180
           GRG +    RY SLI G NL DV+ITGD+G IDGQG +WW  F    L Y+RP+L+E   
Sbjct: 149 GRGRDAAAGRYTSLIFGTNLTDVIITGDNGTIDGQGGLWWQRFHGGKLKYTRPYLIELMY 208

Query: 181 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 240
           S  + +SNLT LN+P++N+HPVY  N+ +  I++ AP  SP T GI PDS  +  IE C 
Sbjct: 209 SADIQISNLTLLNSPSWNVHPVYSRNILIQGITILAPVRSPNTDGINPDSCTNTRIEDCY 268

Query: 241 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 300
           I +G D +A+KSGWD+YGIAYG P+K + IRR+   S   + IA GSEMSGGI +V  E 
Sbjct: 269 IVSGDDCVAVKSGWDEYGIAYGMPTKQLVIRRLTCISPYSAVIALGSEMSGGIQDVRAED 328

Query: 301 VQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPI 360
           +   NS  GI+ +T  GRGGY+K I V  + M+ +  AF  +G++GSH D+ +DP A P+
Sbjct: 329 IVAINSESGIRIKTGIGRGGYVKDIYVRGMTMKTMKWAFWMTGNYGSHADNHYDPKAFPV 388

Query: 361 VQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINS-SSSTSWICSDVSGFSES 420
           +Q I  +++   N+ +A    GI   PF+ I +SN+T  + + +    W C+DV G S  
Sbjct: 389 IQGINYRDMVAENVSMAARLEGIPSDPFTGICISNVTIHLAAKAKKVPWTCTDVEGISSG 448

Query: 421 VIPPPCSDL 429
           V P PCS L
Sbjct: 449 VTPTPCSTL 455

BLAST of Cla021767 vs. Swiss-Prot
Match: ADPG2_ARATH (Polygalacturonase ADPG2 OS=Arabidopsis thaliana GN=ADPG2 PE=2 SV=2)

HSP 1 Score: 117.1 bits (292), Expect = 5.1e-25
Identity = 107/404 (26.49%), Postives = 189/404 (46.78%), Query Frame = 1

Query: 34  PHSVSILEFGAVGDGKTLNTIAFQNAIFYLKSFADKGGAQLYVPPGK-WLTGSINLTS-- 93
           P +VS+ +FGA GDGKT +T AF NA  + K+ +  G   L VP G  +L  SI LT   
Sbjct: 65  PTTVSVSDFGAKGDGKTDDTQAFVNA--WKKACSSNGAVNLLVPKGNTYLLKSIQLTGPC 124

Query: 94  HLTLFLEKGAVILGSQDPSHWELVNPLPSYGRGIEVPGKRYRSLINGYNLQDVVITGDDG 153
           +  L ++    +  SQ  S ++ ++      + I   G      +N  ++      GD G
Sbjct: 125 NSILTVQIFGTLSASQKRSDYKDIS------KWIMFDG------VNNLSVDG----GDTG 184

Query: 154 VIDGQGLVWWNWFSSHSLNYSRP-----HLVEFEDSQYVVVSNLTFLNTPAYNIHPVYCS 213
           V+DG G  WW   +S   N ++P       + F +S+ ++V NL   N     I    CS
Sbjct: 185 VVDGNGETWWQ--NSCKRNKAKPCTKAPTALTFYNSKSLIVKNLKVRNAQQIQISIEKCS 244

Query: 214 NVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCNIATGYDAIALKSGWDQYGIAYGRPS 273
           NV V N+ V+AP++SP T GI   ++ ++ +    I TG D I+++SG           S
Sbjct: 245 NVQVSNVVVTAPADSPNTDGIHITNTQNIRVSESIIGTGDDCISIESG-----------S 304

Query: 274 KNIHIRRVHLQSSSGSSI-AFGSEMSGG-ISNVLVEHVQLNNSFIGIQFRTTKGRGGYIK 333
           +N+ I  +      G SI + G + S   +S V V+  +L+ +  G++ +T +G  G   
Sbjct: 305 QNVQINDITCGPGHGISIGSLGDDNSKAFVSGVTVDGAKLSGTDNGVRIKTYQGGSGTAS 364

Query: 334 GIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPIVQDITLQNVRGTNI-KIAGNFSG 393
            I+  +++M+N+         +        + +A+  V+++  +++ GT+  + A  F+ 
Sbjct: 365 NIIFQNIQMDNVKNPIIIDQDYCDKSKCTTEKSAVQ-VKNVVYRDISGTSASENAITFNC 424

Query: 394 IQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESVIPPPCS 427
            +  P   I L  +  +I    +T   C++ +   +  + P C+
Sbjct: 425 SKNYPCQGIVLDRV--NIKGGKAT---CTNANVVDKGAVLPQCN 431

BLAST of Cla021767 vs. Swiss-Prot
Match: PGLR3_ARATH (Probable polygalacturonase At3g15720 OS=Arabidopsis thaliana GN=At3g15720 PE=1 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 7.3e-24
Identity = 110/406 (27.09%), Postives = 178/406 (43.84%), Query Frame = 1

Query: 35  HSVSILEFGAVGDGKTLNTIAFQNAIFYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTL 94
           +++ + +FGAVGDG T ++ AF  A  +    +  G  Q  VP G        +T  L  
Sbjct: 22  NALDVTQFGAVGDGVTDDSQAFLKA--WEAVCSGTGDGQFVVPAG--------MTFMLQP 81

Query: 95  FLEKGAVILGSQDPSHWELVNPLPSYGRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDG 154
              +G+       P   +++  L +  +G     K    L    +++ +VI GD G I+G
Sbjct: 82  LKFQGSC---KSTPVFVQMLGKLVAPSKGNWKGDKDQWILFT--DIEGLVIEGD-GEING 141

Query: 155 QGLVWWNWFSSHSLNYSRPHLVEFEDSQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISV 214
           QG  WW          SRP  ++F     + +S LT L++P  +IH   C+ V + ++ +
Sbjct: 142 QGSSWWEHKG------SRPTALKFRSCNNLRLSGLTHLDSPMAHIHISECNYVTISSLRI 201

Query: 215 SAPSESPYTVGIVPDSSDHVCIEGCNIATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVH 274
           +AP  SP T GI   +S +V I+ C IATG D IA+ SG           + NIHI  + 
Sbjct: 202 NAPESSPNTDGIDVGASSNVVIQDCIIATGDDCIAINSG-----------TSNIHISGI- 261

Query: 275 LQSSSGSSIAFGSEMSGG----ISNVLVEHVQLNNSFIGIQFRTTKGRGGYIKGIVVSDV 334
                G  I+ GS    G    + NV V++     +  G + +T +G  GY + I  + +
Sbjct: 262 -DCGPGHGISIGSLGKDGETATVENVCVQNCNFRGTMNGARIKTWQGGSGYARMITFNGI 321

Query: 335 EMENISTAFSASGHF-GSHPDDEFDPNALPI-VQDITLQNVRGTNIKIAG-NFSGIQESP 394
            ++N+         + G   D+  D  +  + V  +   N  GT+    G +F   +  P
Sbjct: 322 TLDNVENPIIIDQFYNGGDSDNAKDRKSSAVEVSKVVFSNFIGTSKSEYGVDFRCSERVP 381

Query: 395 FSSIYLSNITFSINSSSS---TSWICSDVSGFSESVIPP-PCSDLS 430
            + I+L ++     SS S       C +V G S   +P   C +LS
Sbjct: 382 CTEIFLRDMKIETASSGSGQVAQGQCLNVRGASTIAVPGLECLELS 392

BLAST of Cla021767 vs. Swiss-Prot
Match: PGLR_SOLLC (Polygalacturonase-2 OS=Solanum lycopersicum GN=PG2 PE=1 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 3.1e-22
Identity = 98/403 (24.32%), Postives = 174/403 (43.18%), Query Frame = 1

Query: 37  VSILEFGAVGDGKTLNTIAFQNAIFYLKSFADKGGAQLYVPPGK-WLTGSINLTSHLTLF 96
           +++L FGA GDGKT + IAF+ A  + ++ + +   Q  VP  K +L   I  +      
Sbjct: 76  INVLSFGAKGDGKTYDNIAFEQA--WNEACSSRTPVQFVVPKNKNYLLKQITFSGPCRSS 135

Query: 97  LEKGAVILGSQDPSHWELVNPLPSYGRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQ 156
           +     I GS + S               ++   + R L   ++    ++ G  G I+G 
Sbjct: 136 I--SVKIFGSLEASS--------------KISDYKDRRLWIAFDSVQNLVVGGGGTINGN 195

Query: 157 GLVWWNWFSSHSLNYS-----RPHLVEFEDSQYVVVSNLTFLNTPAYNIHPVYCSNVYVY 216
           G VWW   SS  +N S      P  + F + + + V+NL   N    +I    C+NV   
Sbjct: 196 GQVWWP--SSCKINKSLPCRDAPTALTFWNCKNLKVNNLKSKNAQQIHIKFESCTNVVAS 255

Query: 217 NISVSAPSESPYTVGIVPDSSDHVCIEGCNIATGYDAIALKSGWDQYGIAYGRPSKNIHI 276
           N+ ++A ++SP T G+   ++ ++ I    I TG D I++ SG           S+N  +
Sbjct: 256 NLMINASAKSPNTDGVHVSNTQYIQISDTIIGTGDDCISIVSG-----------SQN--V 315

Query: 277 RRVHLQSSSGSSIAFGSEMSGG----ISNVLVEHVQLNNSFIGIQFRTTKGRGGYIKGIV 336
           +  ++    G  I+ GS  SG     +SNV V   ++  +  G++ +T +G  G    I 
Sbjct: 316 QATNITCGPGHGISIGSLGSGNSEAYVSNVTVNEAKIIGAENGVRIKTWQGGSGQASNIK 375

Query: 337 VSDVEMENISTAFSASGHFGSHPDDEFDPNALPIVQDITLQNVRGTN-IKIAGNFSGIQE 396
             +VEM+++        ++    +      +   V+++  +N++GT+  K+A  F     
Sbjct: 376 FLNVEMQDVKYPIIIDQNYCDRVEPCIQQFSAVQVKNVVYENIKGTSATKVAIKFDCSTN 435

Query: 397 SPFSSIYLSNITFSINSSSSTSWICSDVSGFSESVIPPPCSDL 429
            P   I + NI     S   +   C +V   +   + P C+ L
Sbjct: 436 FPCEGIIMENINLVGESGKPSEATCKNVHFNNAEHVTPHCTSL 445

BLAST of Cla021767 vs. Swiss-Prot
Match: ADPG1_ARATH (Polygalacturonase ADPG1 OS=Arabidopsis thaliana GN=ADPG1 PE=2 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 1.4e-19
Identity = 92/401 (22.94%), Postives = 173/401 (43.14%), Query Frame = 1

Query: 36  SVSILEFGAVGDGKTLNTIAFQNAIFYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLF 95
           +VS+  FGA GDGKT +T AF+ A  + K+ +  G     VP GK            T  
Sbjct: 67  TVSVSNFGAKGDGKTDDTQAFKKA--WKKACSTNGVTTFLVPKGK------------TYL 126

Query: 96  LEKGAVILGSQDPSHWELVNPLPSYGRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQ 155
           L+        +   +++++  L +  +  +   K +  ++   N    +  G  G+I+G 
Sbjct: 127 LKSTRFRGPCKSLRNFQILGTLSASTKRSDYKDKNHWLILEDVNNLS-IDGGSTGIINGN 186

Query: 156 GLVWWNWFSSHSLNYSR-----PHLVEFEDSQYVVVSNLTFLNTPAYNIHPVYCSNVYVY 215
           G  WW   +S  ++ S+     P  +   + + + V NL   N     I    C+ V V 
Sbjct: 187 GKTWWQ--NSCKIDKSKPCTKAPTALTLYNLKNLNVKNLRVKNAQQIQISIEKCNKVEVS 246

Query: 216 NISVSAPSESPYTVGIVPDSSDHVCIEGCNIATGYDAIALKSGWDQYGIAYGRPSKNIHI 275
           N+ ++AP +SP T GI   ++ ++ +   +I TG D I+++ G           ++N+ I
Sbjct: 247 NVEITAPGDSPNTDGIHITNTQNIRVSNSDIGTGDDCISIEDG-----------TQNLQI 306

Query: 276 RRVHLQSSSGSSIAFGS----EMSGGISNVLVEHVQLNNSFIGIQFRTTKGRGGYIKGIV 335
               L    G  I+ GS         +S + V+  + + S  G++ +T +G  G  K I 
Sbjct: 307 --FDLTCGPGHGISIGSLGDDNSKAYVSGINVDGAKFSESDNGVRIKTYQGGSGTAKNIK 366

Query: 336 VSDVEMENISTAFSASGHFGSHPDDEFDPNALPIVQDITLQNVRGTN-IKIAGNFSGIQE 395
             ++ MEN+         +      E   +A+  V+++  +N+ GT+   +A   +  ++
Sbjct: 367 FQNIRMENVKNPIIIDQDYCDKDKCEDQESAVQ-VKNVVYKNISGTSATDVAITLNCSEK 426

Query: 396 SPFSSIYLSNITFSINSSSSTSWICSDVSGFSESVIPPPCS 427
            P   I L N+     ++S     C + +  ++  + P CS
Sbjct: 427 YPCQGIVLENVKIKGGTAS-----CKNANVKNQGTVSPKCS 431

BLAST of Cla021767 vs. TrEMBL
Match: A0A061ENJ0_THECC (Pectin lyase-like superfamily protein OS=Theobroma cacao GN=TCM_021274 PE=3 SV=1)

HSP 1 Score: 697.2 bits (1798), Expect = 1.3e-197
Identity = 336/452 (74.34%), Postives = 388/452 (85.84%), Query Frame = 1

Query: 1   VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 60
           VALLLLL L NA K+ GEE  GQC+ K  LDPRPHSVSILEFGAVGDGKTLNTIAFQNAI
Sbjct: 159 VALLLLLALCNAIKVNGEESNGQCNHKLTLDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 218

Query: 61  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 120
           FYLKSFADKGGAQLYVPPG+WLTGS NLTSHLTLFLEKGAVILGSQDPSHW++V PLPSY
Sbjct: 219 FYLKSFADKGGAQLYVPPGRWLTGSFNLTSHLTLFLEKGAVILGSQDPSHWDIVEPLPSY 278

Query: 121 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 180
           GRGIE+PG RYRSL+NGY L+DVVITGD+G IDGQG VWW WF SHSLNYSRP LVEF  
Sbjct: 279 GRGIELPGGRYRSLVNGYMLRDVVITGDNGTIDGQGSVWWEWFMSHSLNYSRPQLVEFVS 338

Query: 181 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 240
           S Y++VSN+TFLN PAYNIHPVYCSNV+++NISV AP  SP+TVGIVPDSSD+VCIE C+
Sbjct: 339 SDYILVSNITFLNAPAYNIHPVYCSNVHIHNISVYAPPASPFTVGIVPDSSDNVCIEDCS 398

Query: 241 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 300
           I+ G+DAIALKSGWD+YGI YGRP+ N+HIRRV+LQSSSGSS+AFGSEMSGGIS+V VE 
Sbjct: 399 ISMGHDAIALKSGWDEYGITYGRPTTNVHIRRVNLQSSSGSSLAFGSEMSGGISDVQVEQ 458

Query: 301 VQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPI 360
             L NS  GI+FRTT+GRGGYI+ I++SDV++ N+  AF A GH+GSHPDD++DP+A P+
Sbjct: 459 AHLYNSLSGIEFRTTRGRGGYIEDIIISDVDLLNVHMAFGAIGHYGSHPDDKYDPDAFPV 518

Query: 361 VQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESV 420
           +Q ITLQN+ GTNI +AGNF+GI+ESPF+SI LSNI+ SINS+SSTSW+CS VSGFSESV
Sbjct: 519 LQKITLQNIIGTNITVAGNFTGIRESPFTSICLSNISLSINSASSTSWVCSYVSGFSESV 578

Query: 421 IPPPCSDLSAPYSISSSAASPLVNSTGKTAVL 453
            P PC DL    S +SS+   L+   G+ AVL
Sbjct: 579 FPEPCPDLE--NSNTSSSCVSLLKPNGRAAVL 608

BLAST of Cla021767 vs. TrEMBL
Match: M5XF65_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005599mg PE=3 SV=1)

HSP 1 Score: 694.5 bits (1791), Expect = 8.5e-197
Identity = 344/452 (76.11%), Postives = 387/452 (85.62%), Query Frame = 1

Query: 1   VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 60
           V LLLLL LSNA  + GE+    CD    L+PRPHSVSILEFGAVGDG+TLNT+AFQNAI
Sbjct: 5   VVLLLLLTLSNAIDLGGEKDEKPCDYNLTLEPRPHSVSILEFGAVGDGRTLNTLAFQNAI 64

Query: 61  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 120
           FYLKSFADKGGAQLYVPPG+WLTGS NLTSHLTLFLEKGAVILGSQDPSHWE+V PLPSY
Sbjct: 65  FYLKSFADKGGAQLYVPPGRWLTGSFNLTSHLTLFLEKGAVILGSQDPSHWEVVEPLPSY 124

Query: 121 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 180
           GRGIE+PG RYRSLINGY L DVVITGD+G I+GQG VWW+WFSS +LNYSRPHLVEF  
Sbjct: 125 GRGIELPGGRYRSLINGYMLHDVVITGDNGTINGQGSVWWDWFSSQTLNYSRPHLVEFVS 184

Query: 181 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 240
           S+YVVVSNLTF+N PAYNIHPVYCSNV+V+NISVSAP ESPYTVGIVPDSSD+VCIE C+
Sbjct: 185 SKYVVVSNLTFVNAPAYNIHPVYCSNVHVHNISVSAPPESPYTVGIVPDSSDNVCIEDCS 244

Query: 241 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 300
           I  GYDAIALKSGWD+YGIAYGRP+ N+HIRRV+LQSSSGSS+AFGSEMSGGIS+V VE 
Sbjct: 245 IGMGYDAIALKSGWDEYGIAYGRPTTNVHIRRVYLQSSSGSSLAFGSEMSGGISDVFVEQ 304

Query: 301 VQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPI 360
           V + NSF GIQFRTTKGRGGYI+ I++SDVEMENI  AF ASG FGSHPDD+FDPNALP 
Sbjct: 305 VHIYNSFSGIQFRTTKGRGGYIREIIISDVEMENIHMAFGASGQFGSHPDDKFDPNALPD 364

Query: 361 VQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESV 420
           +  ITLQ+V GTNI IAG+F+GIQESPF+S  LSNI+ S NS S T W CS+VSG S+SV
Sbjct: 365 LDHITLQDVIGTNITIAGSFTGIQESPFTSFCLSNISLSANSGSPT-WECSNVSGSSDSV 424

Query: 421 IPPPCSDLSAPYSISSSAASPLVNSTGKTAVL 453
            P PCS+ ++ Y   SS+   L+ + GKTAVL
Sbjct: 425 FPQPCSEFNSSY---SSSCFSLLTANGKTAVL 452

BLAST of Cla021767 vs. TrEMBL
Match: F6H6V7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0077g01760 PE=3 SV=1)

HSP 1 Score: 690.3 bits (1780), Expect = 1.6e-195
Identity = 343/451 (76.05%), Postives = 384/451 (85.14%), Query Frame = 1

Query: 3   LLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAIFY 62
           LL++L LSNA +  G+E  GQC +   LDPRPHSVSILEFGAVGDGKTLNTIAFQNAIFY
Sbjct: 5   LLVILVLSNAVESNGQERGGQCTNSLTLDPRPHSVSILEFGAVGDGKTLNTIAFQNAIFY 64

Query: 63  LKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSYGR 122
           LKSFADKGGAQLYVPPGKWLTGS NLTSHLTLFLE+GAVILGSQDPSHWE++ PLPSYGR
Sbjct: 65  LKSFADKGGAQLYVPPGKWLTGSFNLTSHLTLFLERGAVILGSQDPSHWEVIEPLPSYGR 124

Query: 123 GIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFEDSQ 182
           GIE+PG RYRSLINGY L+DVVITGD+G I+GQG VWW+WF+SHSLNYSRPHLVEF  S 
Sbjct: 125 GIELPGGRYRSLINGYMLRDVVITGDNGTINGQGSVWWDWFTSHSLNYSRPHLVEFLAST 184

Query: 183 YVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCNIA 242
            VVVSNLTFLN PAYNIHPVYCSNV V NISV AP ESPYTVGIVPDSSD  CIE C+IA
Sbjct: 185 NVVVSNLTFLNAPAYNIHPVYCSNVRVQNISVYAPPESPYTVGIVPDSSDSTCIEDCSIA 244

Query: 243 TGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEHVQ 302
            G+DAIALKSGWD+YGIAYGRP+ N+HIRRV+LQSSSGSS+AFGSEMSGGISNV VE V 
Sbjct: 245 MGHDAIALKSGWDEYGIAYGRPTTNVHIRRVNLQSSSGSSLAFGSEMSGGISNVCVEQVH 304

Query: 303 LNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPIVQ 362
           L NSF GI+FRTTKGRGGYI+ I++SDV MENI TAFSA+G  GSHPDD FDPNALP++ 
Sbjct: 305 LYNSFSGIEFRTTKGRGGYIQEIIISDVAMENIHTAFSATGQIGSHPDDHFDPNALPVLD 364

Query: 363 DITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESVIP 422
            ITLQNV GTNI IAG+F+GIQESPF+SI LSNI+ S    +S SW+CS+VSGFS+ V P
Sbjct: 365 HITLQNVIGTNITIAGSFTGIQESPFTSICLSNISLSTTPPASISWVCSNVSGFSQWVFP 424

Query: 423 PPCSDL-SAPYSISSSAASPLVNSTGKTAVL 453
            PC  L S+  + SS+  S L +S  +TAVL
Sbjct: 425 EPCPSLESSLLNSSSTCLSRLNSSPTQTAVL 455

BLAST of Cla021767 vs. TrEMBL
Match: A0A067K7R1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18934 PE=3 SV=1)

HSP 1 Score: 687.2 bits (1772), Expect = 1.4e-194
Identity = 336/452 (74.34%), Postives = 384/452 (84.96%), Query Frame = 1

Query: 1   VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 60
           V LLLLL L NA  I GEE  G CD KP++DPRPHSVSILEFGAVGDGKTLNTIAFQNAI
Sbjct: 3   VTLLLLLALINAIGIDGEESNGLCDYKPSIDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 62

Query: 61  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 120
           FYLKSFADKGGAQLYVPPGKWLTGS NLTSHLTLFLEKGAVILG QDPSHW+++ PLPSY
Sbjct: 63  FYLKSFADKGGAQLYVPPGKWLTGSFNLTSHLTLFLEKGAVILGFQDPSHWDVLEPLPSY 122

Query: 121 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 180
           GRGIE+PG RYRSLINGY L+DVVITGD+G IDGQG VWW+WF+SHSLNYSRPH+VEF +
Sbjct: 123 GRGIELPGGRYRSLINGYKLRDVVITGDNGTIDGQGSVWWDWFNSHSLNYSRPHIVEFIE 182

Query: 181 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 240
           S+ VVVSNLTFLN PAYNIHPVYCSNV V N+S+SAP ESPYT+GIVPDSS++VCIE   
Sbjct: 183 SERVVVSNLTFLNAPAYNIHPVYCSNVLVQNMSISAPPESPYTIGIVPDSSNNVCIEDSI 242

Query: 241 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 300
           I  GYDAI+LKSGWD+YGIAY R + ++HIRRVHLQSSSGSSIAFGSEMSGGISNV VE 
Sbjct: 243 IEMGYDAISLKSGWDEYGIAYDRATSDVHIRRVHLQSSSGSSIAFGSEMSGGISNVHVEK 302

Query: 301 VQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPI 360
           V L NSF GI+FRTTKGRGGYIK I +SD+E+ENI+ AF A G  G HPDD+FDPNALP+
Sbjct: 303 VHLYNSFSGIEFRTTKGRGGYIKRIFISDIELENINLAFGAFGDHGLHPDDKFDPNALPV 362

Query: 361 VQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESV 420
           +  IT ++V GTNI  AGNF+GIQESPF+SI L N++ ++ S+SS SW+CS++ GFSESV
Sbjct: 363 IDQITFRDVTGTNITTAGNFTGIQESPFTSICLFNVSLTV-SASSNSWVCSNIVGFSESV 422

Query: 421 IPPPCSDLSAPYSISSSAASPLVNSTGKTAVL 453
            P PC  L+ P+S  SSA   L+NS G++A L
Sbjct: 423 FPEPCPQLTNPFSNFSSACYSLLNSYGESASL 453

BLAST of Cla021767 vs. TrEMBL
Match: B9RTB3_RICCO (Polygalacturonase, putative OS=Ricinus communis GN=RCOM_0682780 PE=3 SV=1)

HSP 1 Score: 686.0 bits (1769), Expect = 3.0e-194
Identity = 340/454 (74.89%), Postives = 387/454 (85.24%), Query Frame = 1

Query: 1   VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 60
           VALLLLL LSNA  I GEE +GQCD+KP+LDPRPHSVSILEFGAVGDGKTLNTI+FQNAI
Sbjct: 3   VALLLLLALSNAIVIYGEESSGQCDNKPSLDPRPHSVSILEFGAVGDGKTLNTISFQNAI 62

Query: 61  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 120
           FYLKSFADKGGA+LYVPPG+WLTGS NLTSHLTLFLEKGAVILGSQDPSH++L+ PLPSY
Sbjct: 63  FYLKSFADKGGAKLYVPPGRWLTGSFNLTSHLTLFLEKGAVILGSQDPSHYDLIEPLPSY 122

Query: 121 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 180
           GRGIE+PG RYRSLINGY L+DVVITGD+G IDGQG VWW+WF+SHSLNYSRPHLVEF +
Sbjct: 123 GRGIELPGGRYRSLINGYKLRDVVITGDNGTIDGQGSVWWDWFNSHSLNYSRPHLVEFIE 182

Query: 181 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 240
           S+ +VVSNLTFLN PAYNIHPVYCSNV V N+S+SAP ESP T+GIVPDSS++VCIE   
Sbjct: 183 SERIVVSNLTFLNAPAYNIHPVYCSNVLVQNMSLSAPPESPQTIGIVPDSSNNVCIEESI 242

Query: 241 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 300
           I  GYDAI+LKSGWD+YGIAY R ++++HIRRVHLQSSSGSSIAFGSEMSGGISNV VE 
Sbjct: 243 IKMGYDAISLKSGWDEYGIAYDRATRDVHIRRVHLQSSSGSSIAFGSEMSGGISNVHVEQ 302

Query: 301 VQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPI 360
           V L NSF GI FRTTKGRGGYIK I +SDVE+ENI+ A  A G  G HPDD+FDP A+P+
Sbjct: 303 VHLYNSFSGIGFRTTKGRGGYIKRIFISDVELENINLALGAIGDHGLHPDDKFDPKAVPV 362

Query: 361 VQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESV 420
           V  ITLQN+ GTNI  AGNF+GIQ+SPF+S+ L NIT  +  SSS SW CS+V G+S+SV
Sbjct: 363 VDQITLQNLTGTNISTAGNFTGIQDSPFTSLCLFNITLMV--SSSNSWTCSNVIGYSDSV 422

Query: 421 IPPPCSDLSAPYSISSSAASP--LVNSTGKTAVL 453
            P PC +L +PYS SSSA     L+NS GK+A L
Sbjct: 423 FPVPCPELKSPYSNSSSACYSLLLLNSYGKSASL 454

BLAST of Cla021767 vs. NCBI nr
Match: gi|449432886|ref|XP_004134229.1| (PREDICTED: probable polygalacturonase isoform X2 [Cucumis sativus])

HSP 1 Score: 901.4 bits (2328), Expect = 6.6e-259
Identity = 445/452 (98.45%), Postives = 447/452 (98.89%), Query Frame = 1

Query: 1   VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 60
           VALLL LQLSNAAKIIGEEL GQCDSKP LDPRPHSVSILEFGAVGDGKTLNTIAFQNAI
Sbjct: 6   VALLLFLQLSNAAKIIGEELVGQCDSKPTLDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 65

Query: 61  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 120
           FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY
Sbjct: 66  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 125

Query: 121 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 180
           GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED
Sbjct: 126 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 185

Query: 181 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 240
           SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN
Sbjct: 186 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 245

Query: 241 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 300
           IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH
Sbjct: 246 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 305

Query: 301 VQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPI 360
           VQLNNSFIGIQ RTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDE+DPNALPI
Sbjct: 306 VQLNNSFIGIQIRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEYDPNALPI 365

Query: 361 VQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESV 420
           VQDITLQNVRGTNIKIAGNFSGIQESPF+SIYLSNITFSINSSSSTSWICSDVSGFSESV
Sbjct: 366 VQDITLQNVRGTNIKIAGNFSGIQESPFTSIYLSNITFSINSSSSTSWICSDVSGFSESV 425

Query: 421 IPPPCSDLSAPYSISSSAASPLVNSTGKTAVL 453
           IPPPCSDLS PYSISSSAASPLVNSTGKTAVL
Sbjct: 426 IPPPCSDLSTPYSISSSAASPLVNSTGKTAVL 457

BLAST of Cla021767 vs. NCBI nr
Match: gi|778678970|ref|XP_011651065.1| (PREDICTED: probable polygalacturonase isoform X1 [Cucumis sativus])

HSP 1 Score: 901.4 bits (2328), Expect = 6.6e-259
Identity = 445/452 (98.45%), Postives = 447/452 (98.89%), Query Frame = 1

Query: 1   VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 60
           VALLL LQLSNAAKIIGEEL GQCDSKP LDPRPHSVSILEFGAVGDGKTLNTIAFQNAI
Sbjct: 7   VALLLFLQLSNAAKIIGEELVGQCDSKPTLDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 66

Query: 61  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 120
           FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY
Sbjct: 67  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 126

Query: 121 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 180
           GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED
Sbjct: 127 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 186

Query: 181 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 240
           SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN
Sbjct: 187 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 246

Query: 241 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 300
           IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH
Sbjct: 247 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 306

Query: 301 VQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPI 360
           VQLNNSFIGIQ RTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDE+DPNALPI
Sbjct: 307 VQLNNSFIGIQIRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEYDPNALPI 366

Query: 361 VQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESV 420
           VQDITLQNVRGTNIKIAGNFSGIQESPF+SIYLSNITFSINSSSSTSWICSDVSGFSESV
Sbjct: 367 VQDITLQNVRGTNIKIAGNFSGIQESPFTSIYLSNITFSINSSSSTSWICSDVSGFSESV 426

Query: 421 IPPPCSDLSAPYSISSSAASPLVNSTGKTAVL 453
           IPPPCSDLS PYSISSSAASPLVNSTGKTAVL
Sbjct: 427 IPPPCSDLSTPYSISSSAASPLVNSTGKTAVL 458

BLAST of Cla021767 vs. NCBI nr
Match: gi|694438461|ref|XP_009346202.1| (PREDICTED: probable polygalacturonase [Pyrus x bretschneideri])

HSP 1 Score: 704.9 bits (1818), Expect = 9.1e-200
Identity = 346/449 (77.06%), Postives = 387/449 (86.19%), Query Frame = 1

Query: 4   LLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAIFYL 63
           LLLL +SNA K+ GE+    CDSK  L  RPHSVSI EFGAVGDGKTLNT AFQNAIFYL
Sbjct: 8   LLLLAVSNAVKLNGEDDETPCDSKLTLKQRPHSVSISEFGAVGDGKTLNTHAFQNAIFYL 67

Query: 64  KSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSYGRG 123
           KSFADKGGAQLYVPPG+WLTGS NLTSHLTLFLEKGAVILGSQDPSHWE+V PLPSYGRG
Sbjct: 68  KSFADKGGAQLYVPPGRWLTGSFNLTSHLTLFLEKGAVILGSQDPSHWEIVEPLPSYGRG 127

Query: 124 IEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFEDSQY 183
           IE+PG+RY SLINGY LQDV+ITGD+G IDGQG VWW+WFSS SLNYSRPHLVEF  S+Y
Sbjct: 128 IELPGRRYGSLINGYMLQDVIITGDNGTIDGQGTVWWDWFSSQSLNYSRPHLVEFVMSKY 187

Query: 184 VVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCNIAT 243
           VVVSNLTF+N PAYNIHPVYCSNV+V+NISVSAP +SP+TVGIVPDSSD VCIE C+I  
Sbjct: 188 VVVSNLTFVNAPAYNIHPVYCSNVHVHNISVSAPPDSPHTVGIVPDSSDIVCIEDCSIGM 247

Query: 244 GYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEHVQL 303
           GYDAIALKSGWD+YGIAYGRP+ N+HIRRV LQS++GSS+AFGSEMSGGISNVLVE V+L
Sbjct: 248 GYDAIALKSGWDEYGIAYGRPTTNVHIRRVTLQSATGSSLAFGSEMSGGISNVLVEKVRL 307

Query: 304 NNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPIVQD 363
           NNSF GIQFRTTKGRGGYIK +++SDVEMENI  AF ASG FGSHPDD++DPNALP++  
Sbjct: 308 NNSFSGIQFRTTKGRGGYIKDVIISDVEMENIYMAFGASGQFGSHPDDKYDPNALPVLDH 367

Query: 364 ITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESVIPP 423
           ITLQ+V GTNI IAG F+GI+ESPF+S +LSNI+ S+NS S T+W CS VSG SESV P 
Sbjct: 368 ITLQDVVGTNITIAGCFTGIKESPFTSFFLSNISLSLNSGSPTTWDCSYVSGSSESVFPE 427

Query: 424 PCSDLSAPYSISSSAASPLVNSTGKTAVL 453
           PCSDL+  YS  SSA   L+   GK AVL
Sbjct: 428 PCSDLNTSYSNPSSARFSLLTLNGKAAVL 456

BLAST of Cla021767 vs. NCBI nr
Match: gi|658011053|ref|XP_008340766.1| (PREDICTED: probable polygalacturonase [Malus domestica])

HSP 1 Score: 702.2 bits (1811), Expect = 5.9e-199
Identity = 345/449 (76.84%), Postives = 388/449 (86.41%), Query Frame = 1

Query: 4   LLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAIFYL 63
           LLLL +SNA K+ GE+    CDSK  L+ RPHSVSI EFGAVGDGKTLNT AFQNAIFYL
Sbjct: 8   LLLLAVSNAVKLNGEDDEKPCDSKLTLEQRPHSVSISEFGAVGDGKTLNTHAFQNAIFYL 67

Query: 64  KSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSYGRG 123
           KSFADKGGAQLYVPPG+WLTGS NLTSHLTLFLEKGAVILGSQDPSHWE+V PLPSYGRG
Sbjct: 68  KSFADKGGAQLYVPPGRWLTGSFNLTSHLTLFLEKGAVILGSQDPSHWEIVEPLPSYGRG 127

Query: 124 IEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFEDSQY 183
           IE+PG+RY SLINGY L+DVVITGD+G I+GQG VWW+WFSS SLNYSRPHLVEF  S+Y
Sbjct: 128 IELPGRRYGSLINGYMLRDVVITGDNGTINGQGSVWWDWFSSKSLNYSRPHLVEFVMSKY 187

Query: 184 VVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCNIAT 243
           VVVSNLTF+N PAYNIHPVYCSNV+V+NISVSAP +SP+TVGIVPDSSD VCIE C+I  
Sbjct: 188 VVVSNLTFVNAPAYNIHPVYCSNVHVHNISVSAPPDSPHTVGIVPDSSDIVCIEDCSIGM 247

Query: 244 GYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEHVQL 303
           GYDAIALKSGWD+YGIAYGRP+ N+HIRRV LQS++GSS+AFGSEMSGGISNVLVE V+L
Sbjct: 248 GYDAIALKSGWDEYGIAYGRPTANVHIRRVTLQSATGSSLAFGSEMSGGISNVLVEKVRL 307

Query: 304 NNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPIVQD 363
           NNSF GIQFRTTKGRGGYIK +++SDVEMENI  AF ASG FGSHPDD++DPNALP++  
Sbjct: 308 NNSFSGIQFRTTKGRGGYIKDVIISDVEMENIYMAFGASGQFGSHPDDKYDPNALPVLDH 367

Query: 364 ITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESVIPP 423
           ITLQ V GTNI IAG F+GI+ESPF+S +LSNI+ S+NS+S T+W CS VSG SESV P 
Sbjct: 368 ITLQGVVGTNITIAGCFTGIKESPFTSFFLSNISLSLNSASPTTWNCSYVSGSSESVFPE 427

Query: 424 PCSDLSAPYSISSSAASPLVNSTGKTAVL 453
           PCSDL+  YS  SSA   L+   GK AVL
Sbjct: 428 PCSDLNTSYSNPSSARFSLLTLNGKAAVL 456

BLAST of Cla021767 vs. NCBI nr
Match: gi|694388614|ref|XP_009369995.1| (PREDICTED: probable polygalacturonase isoform X2 [Pyrus x bretschneideri])

HSP 1 Score: 701.0 bits (1808), Expect = 1.3e-198
Identity = 344/452 (76.11%), Postives = 387/452 (85.62%), Query Frame = 1

Query: 1   VALLLLLQLSNAAKIIGEELAGQCDSKPALDPRPHSVSILEFGAVGDGKTLNTIAFQNAI 60
           V LLLLL LSNA K+ GE+    CD+K  L+PRPHSVSILEFGAVGDGKTLNT+AFQNAI
Sbjct: 5   VVLLLLLALSNAVKVDGEDDEKPCDNKLTLEPRPHSVSILEFGAVGDGKTLNTLAFQNAI 64

Query: 61  FYLKSFADKGGAQLYVPPGKWLTGSINLTSHLTLFLEKGAVILGSQDPSHWELVNPLPSY 120
           FYLKSFADKGGAQLYVPPG+WLTGS NLTSHLTLFLEK AVILGSQDPSHWE+V PLPSY
Sbjct: 65  FYLKSFADKGGAQLYVPPGRWLTGSFNLTSHLTLFLEKDAVILGSQDPSHWEIVEPLPSY 124

Query: 121 GRGIEVPGKRYRSLINGYNLQDVVITGDDGVIDGQGLVWWNWFSSHSLNYSRPHLVEFED 180
           GRGIE+PG+RYRSLINGY L DVVITGD+G I+GQG VWW+WFSS SLNYSRPHLVEF  
Sbjct: 125 GRGIELPGRRYRSLINGYMLHDVVITGDNGTINGQGSVWWDWFSSQSLNYSRPHLVEFVM 184

Query: 181 SQYVVVSNLTFLNTPAYNIHPVYCSNVYVYNISVSAPSESPYTVGIVPDSSDHVCIEGCN 240
           S+YVVVSNLTF+N PAYNIHPVYCSNV+V NISVSAP +SP+TVGIVPDSSD VCIE C 
Sbjct: 185 SKYVVVSNLTFINAPAYNIHPVYCSNVHVLNISVSAPPDSPHTVGIVPDSSDTVCIEDCI 244

Query: 241 IATGYDAIALKSGWDQYGIAYGRPSKNIHIRRVHLQSSSGSSIAFGSEMSGGISNVLVEH 300
           I  GYDAIALKSGWD+YGIAYGRP+ N+HIR V LQS++GSS+AFGSEMSGGISNVLVE 
Sbjct: 245 IGVGYDAIALKSGWDEYGIAYGRPTTNVHIRHVTLQSATGSSLAFGSEMSGGISNVLVEQ 304

Query: 301 VQLNNSFIGIQFRTTKGRGGYIKGIVVSDVEMENISTAFSASGHFGSHPDDEFDPNALPI 360
           V+L+NSF GIQFRTTKGRGGYI   ++ DVE+ENI  AF ASG FGSHPDD++DPNALP+
Sbjct: 305 VRLHNSFSGIQFRTTKGRGGYITAPLIPDVEVENIYMAFGASGQFGSHPDDKYDPNALPV 364

Query: 361 VQDITLQNVRGTNIKIAGNFSGIQESPFSSIYLSNITFSINSSSSTSWICSDVSGFSESV 420
           +  ITLQ+V GTNI IAG F+GI+ESPF+S +LSNI+ S+NS+S T+W CS VSG SESV
Sbjct: 365 LDHITLQDVVGTNITIAGRFTGIEESPFTSFFLSNISLSLNSASPTTWDCSYVSGSSESV 424

Query: 421 IPPPCSDLSAPYSISSSAASPLVNSTGKTAVL 453
            P PCSDL+  YS  SSA   L+   GK AVL
Sbjct: 425 FPEPCSDLNTSYSNPSSAHISLLTVNGKAAVL 456

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PGLR_VITVI4.9e-11348.02Probable polygalacturonase OS=Vitis vinifera GN=GSVIVT00026920001 PE=1 SV=1[more]
ADPG2_ARATH5.1e-2526.49Polygalacturonase ADPG2 OS=Arabidopsis thaliana GN=ADPG2 PE=2 SV=2[more]
PGLR3_ARATH7.3e-2427.09Probable polygalacturonase At3g15720 OS=Arabidopsis thaliana GN=At3g15720 PE=1 S... [more]
PGLR_SOLLC3.1e-2224.32Polygalacturonase-2 OS=Solanum lycopersicum GN=PG2 PE=1 SV=1[more]
ADPG1_ARATH1.4e-1922.94Polygalacturonase ADPG1 OS=Arabidopsis thaliana GN=ADPG1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A061ENJ0_THECC1.3e-19774.34Pectin lyase-like superfamily protein OS=Theobroma cacao GN=TCM_021274 PE=3 SV=1[more]
M5XF65_PRUPE8.5e-19776.11Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005599mg PE=3 SV=1[more]
F6H6V7_VITVI1.6e-19576.05Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0077g01760 PE=3 SV=... [more]
A0A067K7R1_JATCU1.4e-19474.34Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18934 PE=3 SV=1[more]
B9RTB3_RICCO3.0e-19474.89Polygalacturonase, putative OS=Ricinus communis GN=RCOM_0682780 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
gi|449432886|ref|XP_004134229.1|6.6e-25998.45PREDICTED: probable polygalacturonase isoform X2 [Cucumis sativus][more]
gi|778678970|ref|XP_011651065.1|6.6e-25998.45PREDICTED: probable polygalacturonase isoform X1 [Cucumis sativus][more]
gi|694438461|ref|XP_009346202.1|9.1e-20077.06PREDICTED: probable polygalacturonase [Pyrus x bretschneideri][more]
gi|658011053|ref|XP_008340766.1|5.9e-19976.84PREDICTED: probable polygalacturonase [Malus domestica][more]
gi|694388614|ref|XP_009369995.1|1.3e-19876.11PREDICTED: probable polygalacturonase isoform X2 [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000743Glyco_hydro_28
IPR006626PbH1
IPR011050Pectin_lyase_fold/virulence
IPR012334Pectin_lyas_fold
Vocabulary: Molecular Function
TermDefinition
GO:0004650polygalacturonase activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071555 cell wall organization
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
cellular_component GO:0005576 extracellular region
cellular_component GO:0005575 cellular_component
molecular_function GO:0016829 lyase activity
molecular_function GO:0004650 polygalacturonase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU36418watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021767Cla021767.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU36418WMU36418transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000743Glycoside hydrolase, family 28PFAMPF00295Glyco_hydro_28coord: 129..414
score: 5.4
IPR006626Parallel beta-helix repeatSMARTSM00710pbh1coord: 293..314
score: 2600.0coord: 356..378
score: 4700.0coord: 204..230
score: 5100.0coord: 265..287
score: 780.0coord: 322..343
score: 6800.0coord: 231..252
score: 930.0coord: 181..203
score: 17
IPR011050Pectin lyase fold/virulence factorunknownSSF51126Pectin lyase-likecoord: 32..429
score: 1.08
IPR012334Pectin lyase foldGENE3DG3DSA:2.160.20.10coord: 32..416
score: 1.6E
NoneNo IPR availablePANTHERPTHR31339FAMILY NOT NAMEDcoord: 15..437
score: 2.9E
NoneNo IPR availablePANTHERPTHR31339:SF15PECTIN LYASE-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 15..437
score: 2.9E

The following gene(s) are paralogous to this gene:

None