Cp4.1LG20g01490 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g01490
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPathogenesis-related thaumatin family protein
LocationCp4.1LG20 : 838668 .. 840643 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACTAATAGCAAAAGTTATTGATAAATTTTATCCACAAAATCAAATTTATAATCAGGAAAATAAAGTTTTGGTTCTCGAGGAAGAATCTAAAATGGGTAGCAGTCATTAGTAGGTTCATTGGAGAAAGTGAAGTGGTTTACAGTTTTCTGAAGAATGGAGGAGAGAGGTCATATACCAGTAGGCCACTTCATACTTTTATTCAAAACAGATTCCCTTGACATGTGAAGGAGTTTCCTTTTTGTAAAATGAACACAGCCAAAGCCCCTCCTGTGCTTCTGACATCTTTGGGAACTGGGCAACTTTGTCATCATTTATCCCTCTCTCTAACCCCGCTCTAAAATAAATATGGTCTTTTTTGAACTTTTACTCTTGAATATTATCTTTGGAGAGAGTTTTTTACCATCTTACAATGAATAATTCGTTCTCTAAATCAATCCGAATCCACTTTTTGAGTGCGCCAGGTCTCTCTCCTCCTAATTTTACTCCATTTCCCCTCTGACCTGCATTGTCTATTTCTCTATATAAGCACTCATATCCATCAACTTTTTGAGGGAGCTTGAGCAGAAGAAAGACATACTCTCTCTAACTCTCTATAACACTATCAGTCATGATGGTTTATCTGAGATCTCTTGTGGTTCTCACTCTCTACACCCTCTTGACTTCCTACAATTCAGGTAACTGACTCACTAATGAAATCTTCGGGTTATGTTTTGTGTTCTTAGATTCTAAACAATGTGACTCAACTCCATGGTTTTTCTTCTTGCATTTGCTACAGTTTTCGTCTCTGCAACTACAATAACATTCTACAACAAATGCCCTCATCCAGTTTGGCCTGGCATTCAGCCCAGTGCTGGCAAGCCCTTGTTGGCTCGCGGAGGCTTCAAGCTCCCACCCAACAAGGGCTACTCTCTACGCCTTCCAGCCCTGTGGTCTGGCCGTTTCTGGGGCCGCCATGGCTGTGCTTTCGACGCATCTGGCAGAGGCAAGTGCGCTACAGGGGATTGTGGTGGTGCCCTCTACTGCAATGGCATTGGAGGTGCCCCACCAGCCACTCTTGCTGAGATTACACTTGGAAATGACCAAGATTTCTATGATGTGAGCCTTGTTGATGGCTACAATCTGGCTATGTCCATTACCCCGTCTAAAGGCTCCGGGAAATGTAGCTATGCTGGGTGTGTGAGTGACCTTAACATGATCTGTCCTGTGGGCTTACAAGTTAGATCTCACGATGACCACAGAGTGGTGGCCTGCAAAAGTGCTTGCTCTGCATTCAATTCTCCAAGGTATTGCTGCACAGGCAGGTTTGGTAATCCGCAGTCCTGCAAGCCAACCGCATATTCCAAGATCTTCAAGACCGCTTGTCCTAAGGCTTACTCTTATGCTTATGATGATCCCACTAGCATTGCAACTTGCACTGGTGGCAACTATTTGGTTACTTTTTGCCCTCATCACAGTTAGATGCACTTTAAGCAAAAGAAAAAAAAAAAACGTTTATTATGATACTTTTGATAGGAGTTTGTTTATGGGGAATAGTGGCTTTGACTAATGGACTTGTATTGATGCCATCTCTCTCTCTCTCTCTCTATTGTTACTAGGTATTAATAGGCTTTTGGCCGTTGGGGCCTTGTATTAATGGCTGGCTTTTGGTTCTTGTGGACTATGACTTATGAGTAAAAGGGAACTAAGAAGCTGATTAGCTTGCAAGAATCGCAGTCTCTTGTCCCAATTGCTAATATGGAAATGACAGTATTGTATTACATGAGGTGAAAAGCTGTGCTGGGATAGTGTGAGTGGATTTCTAGTGTGTTCCGTTGGCTATATCTTTATAAGGTAAAGCATGCCAGCTGGGTCAGCCATCCAATGGCAGTGCTTTTCACATGAAAATATATTTAAAAAGGGAGCCATTTTCCACTCTACTTGACACACTTGCTTTTCTTTTAAAAAAAAGTTCACCTAAACACTACTTTTTCATTTA

mRNA sequence

AACTAATAGCAAAAGTTATTGATAAATTTTATCCACAAAATCAAATTTATAATCAGGAAAATAAAGTTTTGGTTCTCGAGGAAGAATCTAAAATGGGTAGCAGTCATTAGTAGGTTCATTGGAGAAAGTGAAGTGGTTTACAGTTTTCTGAAGAATGGAGGAGAGAGGTCATATACCAGTAGGCCACTTCATACTTTTATTCAAAACAGATTCCCTTGACATGTGAAGGAGTTTCCTTTTTGTAAAATGAACACAGCCAAAGCCCCTCCTGTGCTTCTGACATCTTTGGGAACTGGGCAACTTTGTCATCATTTATCCCTCTCTCTAACCCCGCTCTAAAATAAATATGGTCTTTTTTGAACTTTTACTCTTGAATATTATCTTTGGAGAGAGTTTTTTACCATCTTACAATGAATAATTCGTTCTCTAAATCAATCCGAATCCACTTTTTGAGTGCGCCAGGTCTCTCTCCTCCTAATTTTACTCCATTTCCCCTCTGACCTGCATTGTCTATTTCTCTATATAAGCACTCATATCCATCAACTTTTTGAGGGAGCTTGAGCAGAAGAAAGACATACTCTCTCTAACTCTCTATAACACTATCAGTCATGATGGTTTATCTGAGATCTCTTGTGGTTCTCACTCTCTACACCCTCTTGACTTCCTACAATTCAGTTTTCGTCTCTGCAACTACAATAACATTCTACAACAAATGCCCTCATCCAGTTTGGCCTGGCATTCAGCCCAGTGCTGGCAAGCCCTTGTTGGCTCGCGGAGGCTTCAAGCTCCCACCCAACAAGGGCTACTCTCTACGCCTTCCAGCCCTGTGGTCTGGCCGTTTCTGGGGCCGCCATGGCTGTGCTTTCGACGCATCTGGCAGAGGCAAGTGCGCTACAGGGGATTGTGGTGGTGCCCTCTACTGCAATGGCATTGGAGGTGCCCCACCAGCCACTCTTGCTGAGATTACACTTGGAAATGACCAAGATTTCTATGATGTGAGCCTTGTTGATGGCTACAATCTGGCTATGTCCATTACCCCGTCTAAAGGCTCCGGGAAATGTAGCTATGCTGGGTGTGTGAGTGACCTTAACATGATCTGTCCTGTGGGCTTACAAGTTAGATCTCACGATGACCACAGAGTGGTGGCCTGCAAAAGTGCTTGCTCTGCATTCAATTCTCCAAGGTATTGCTGCACAGGCAGGTTTGGTAATCCGCAGTCCTGCAAGCCAACCGCATATTCCAAGATCTTCAAGACCGCTTGTCCTAAGGCTTACTCTTATGCTTATGATGATCCCACTAGCATTGCAACTTGCACTGGTGGCAACTATTTGGTTACTTTTTGCCCTCATCACAGTTAGATGCACTTTAAGCAAAAGAAAAAAAAAAAACGTTTATTATGATACTTTTGATAGGAGTTTGTTTATGGGGAATAGTGGCTTTGACTAATGGACTTGTATTGATGCCATCTCTCTCTCTCTCTCTCTATTGTTACTAGGTATTAATAGGCTTTTGGCCGTTGGGGCCTTGTATTAATGGCTGGCTTTTGGTTCTTGTGGACTATGACTTATGAGTAAAAGGGAACTAAGAAGCTGATTAGCTTGCAAGAATCGCAGTCTCTTGTCCCAATTGCTAATATGGAAATGACAGTATTGTATTACATGAGGTGAAAAGCTGTGCTGGGATAGTGTGAGTGGATTTCTAGTGTGTTCCGTTGGCTATATCTTTATAAGGTAAAGCATGCCAGCTGGGTCAGCCATCCAATGGCAGTGCTTTTCACATGAAAATATATTTAAAAAGGGAGCCATTTTCCACTCTACTTGACACACTTGCTTTTCTTTTAAAAAAAAGTTCACCTAAACACTACTTTTTCATTTA

Coding sequence (CDS)

ATGATGGTTTATCTGAGATCTCTTGTGGTTCTCACTCTCTACACCCTCTTGACTTCCTACAATTCAGTTTTCGTCTCTGCAACTACAATAACATTCTACAACAAATGCCCTCATCCAGTTTGGCCTGGCATTCAGCCCAGTGCTGGCAAGCCCTTGTTGGCTCGCGGAGGCTTCAAGCTCCCACCCAACAAGGGCTACTCTCTACGCCTTCCAGCCCTGTGGTCTGGCCGTTTCTGGGGCCGCCATGGCTGTGCTTTCGACGCATCTGGCAGAGGCAAGTGCGCTACAGGGGATTGTGGTGGTGCCCTCTACTGCAATGGCATTGGAGGTGCCCCACCAGCCACTCTTGCTGAGATTACACTTGGAAATGACCAAGATTTCTATGATGTGAGCCTTGTTGATGGCTACAATCTGGCTATGTCCATTACCCCGTCTAAAGGCTCCGGGAAATGTAGCTATGCTGGGTGTGTGAGTGACCTTAACATGATCTGTCCTGTGGGCTTACAAGTTAGATCTCACGATGACCACAGAGTGGTGGCCTGCAAAAGTGCTTGCTCTGCATTCAATTCTCCAAGGTATTGCTGCACAGGCAGGTTTGGTAATCCGCAGTCCTGCAAGCCAACCGCATATTCCAAGATCTTCAAGACCGCTTGTCCTAAGGCTTACTCTTATGCTTATGATGATCCCACTAGCATTGCAACTTGCACTGGTGGCAACTATTTGGTTACTTTTTGCCCTCATCACAGTTAG

Protein sequence

MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITLGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLVTFCPHHS
BLAST of Cp4.1LG20g01490 vs. Swiss-Prot
Match: TLPH_ARATH (Thaumatin-like protein OS=Arabidopsis thaliana GN=At1g18250 PE=2 SV=2)

HSP 1 Score: 402.5 bits (1033), Expect = 3.3e-111
Identity = 189/238 (79.41%), Postives = 204/238 (85.71%), Query Frame = 1

Query: 11  LTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGYSLRL 70
           L  + LL S+     SA+T+ FYNKC HPVWPGIQPSAG+ LLA GGFKLP NK +SL+L
Sbjct: 8   LFAFLLLLSH----ASASTVIFYNKCKHPVWPGIQPSAGQNLLAGGGFKLPANKAHSLQL 67

Query: 71  PALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITLGNDQDFYDV 130
           P LWSGRFWGRHGC FD SGRG CATGDCGG+L CNG GG PPATLAEITLG + DFYDV
Sbjct: 68  PPLWSGRFWGRHGCTFDRSGRGHCATGDCGGSLSCNGAGGEPPATLAEITLGPELDFYDV 127

Query: 131 SLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKSACSAFNS 190
           SLVDGYNLAMSI P KGSG+CSYAGCVSDLN +CPVGLQVRS +  RVVACKSACSAFNS
Sbjct: 128 SLVDGYNLAMSIMPVKGSGQCSYAGCVSDLNQMCPVGLQVRSRNGKRVVACKSACSAFNS 187

Query: 191 PRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLVTFCPHH 249
           P+YCCTG FGNPQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATC+  NY+VTFCPHH
Sbjct: 188 PQYCCTGLFGNPQSCKPTAYSKIFKVACPKAYSYAYDDPTSIATCSKANYIVTFCPHH 241

BLAST of Cp4.1LG20g01490 vs. Swiss-Prot
Match: PR5_ARATH (Pathogenesis-related protein 5 OS=Arabidopsis thaliana GN=At1g75040 PE=1 SV=1)

HSP 1 Score: 267.7 bits (683), Expect = 1.3e-70
Identity = 143/247 (57.89%), Postives = 161/247 (65.18%), Query Frame = 1

Query: 2   MVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLP 61
           M  + S+ +L L   +TS   + V AT  T  N CP  VW G     G P L  GGF+L 
Sbjct: 1   MANISSIHILFL-VFITS--GIAVMATDFTLRNNCPTTVWAGTLAGQG-PKLGDGGFELT 60

Query: 62  PNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITL 121
           P     L  PA WSGRFW R GC FDASG G+C TGDCGG L CNG GG PP TLAE TL
Sbjct: 61  PGASRQLTAPAGWSGRFWARTGCNFDASGNGRCVTGDCGG-LRCNG-GGVPPVTLAEFTL 120

Query: 122 GND--QDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVV 181
             D  +DFYDVSLVDGYN+ + I PS GSG C YAGCVSDLN  CP  L+V   D + VV
Sbjct: 121 VGDGGKDFYDVSLVDGYNVKLGIRPSGGSGDCKYAGCVSDLNAACPDMLKVM--DQNNVV 180

Query: 182 ACKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGN 241
           ACKSAC  FN+ +YCC G    P++C PT YS+IFK ACP AYSYAYDD TS  TCTG N
Sbjct: 181 ACKSACERFNTDQYCCRGANDKPETCPPTDYSRIFKNACPDAYSYAYDDETSTFTCTGAN 239

Query: 242 YLVTFCP 247
           Y +TFCP
Sbjct: 241 YEITFCP 239

BLAST of Cp4.1LG20g01490 vs. Swiss-Prot
Match: TLP1_PYRPY (Thaumatin-like protein 1 OS=Pyrus pyrifolia GN=TL1 PE=1 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 1.5e-66
Identity = 119/226 (52.65%), Postives = 150/226 (66.37%), Query Frame = 1

Query: 25  VSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGYSLRLPALWSGRFWGRHGC 84
           V +   TF NKCP+ VWPG     G P L   GF+L      SL + A WSGRFWGR  C
Sbjct: 20  VYSAKFTFTNKCPNTVWPGTLTGGGGPQLLSTGFELASGASTSLTVQAPWSGRFWGRSHC 79

Query: 85  AFDASGRGKCATGDCG-GALYCNGIGGAPPATLAEITLGND--QDFYDVSLVDGYNLAMS 144
           + D+SG+ KC+TGDCG G + CNG G +PPA+L E+TL  +  QDFYDVSLVDG+NL + 
Sbjct: 80  SIDSSGKFKCSTGDCGSGQISCNGAGASPPASLVELTLATNGGQDFYDVSLVDGFNLPIK 139

Query: 145 ITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKSACSAFNSPRYCCTGRFGN 204
           + P  GSG C+   C +++N +CP  L  +  D   V+ CKSAC A N P+YCCTG +G 
Sbjct: 140 LAPRGGSGDCNSTSCAANINTVCPAELSDKGSDGS-VIGCKSACLALNQPQYCCTGAYGT 199

Query: 205 PQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGG-NYLVTFCP 247
           P +C PT +SK+FK  CP+AYSYAYDD +S  TC GG NY +TFCP
Sbjct: 200 PDTCPPTDFSKVFKNQCPQAYSYAYDDKSSTFTCFGGPNYEITFCP 244

BLAST of Cp4.1LG20g01490 vs. Swiss-Prot
Match: TLP1_PRUPE (Thaumatin-like protein 1 OS=Prunus persica PE=2 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 1.6e-65
Identity = 128/237 (54.01%), Postives = 159/237 (67.09%), Query Frame = 1

Query: 17  LTSYNSVFVS---ATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGYSLRLPAL 76
           LT+   +F S   A  ITF NKC + VWPG      KP L+  GF+L      S+  P+ 
Sbjct: 11  LTTLAILFFSGAHAAKITFTNKCSYTVWPGTLTGDQKPQLSLTGFELATGISRSVDAPSP 70

Query: 77  WSGRFWGRHGCAFDASGRGKCATGDCG-GALYCNGIGGAPPATLAEITLGND--QDFYDV 136
           WSGRF+GR  C+ DASG+  CAT DCG G + CNG G APPATL EIT+ ++  QDFYDV
Sbjct: 71  WSGRFFGRTRCSTDASGKFTCATADCGSGQVSCNGNGAAPPATLVEITIASNGGQDFYDV 130

Query: 137 SLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKSACSAFNS 196
           SLVDG+NL MS+ P  G+GKC  + C +D+N +CP  LQV+  D   V+ACKSAC AFN 
Sbjct: 131 SLVDGFNLPMSVAPQGGTGKCKASTCPADINKVCPAPLQVKGSDGS-VIACKSACLAFNQ 190

Query: 197 PRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTG-GNYLVTFCP 247
           P+YCCT     P++C P  YSK+FKT CP+AYSYAYDD +S  TC+G   YL+TFCP
Sbjct: 191 PKYCCTPPNDKPETCPPPDYSKLFKTQCPQAYSYAYDDKSSTFTCSGRPAYLITFCP 246

BLAST of Cp4.1LG20g01490 vs. Swiss-Prot
Match: TLP_PRUAV (Glucan endo-1,3-beta-glucosidase OS=Prunus avium PE=1 SV=1)

HSP 1 Score: 243.4 bits (620), Expect = 2.6e-63
Identity = 126/243 (51.85%), Postives = 153/243 (62.96%), Query Frame = 1

Query: 8   LVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGYS 67
           +VVL+L   + S+      A TI+F N CP+ VWPG   S  KP L+  GF+L     + 
Sbjct: 6   VVVLSLSLTILSFGGAH--AATISFKNNCPYMVWPGTLTSDQKPQLSTTGFELASQASFQ 65

Query: 68  LRLPALWSGRFWGRHGCAFDASGRGKCATGDC-GGALYCNGIGGAPPATLAE--ITLGND 127
           L  P  W+GRFW R GC+ DASG+  CAT DC  G + CNG G  PPATLAE  I  G  
Sbjct: 66  LDTPVPWNGRFWARTGCSTDASGKFVCATADCASGQVMCNGNGAIPPATLAEFNIPAGGG 125

Query: 128 QDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKSA 187
           QDFYDVSLVDG+NL MS+TP  G+G C  A C +++N +CP  LQ +   D  VVAC SA
Sbjct: 126 QDFYDVSLVDGFNLPMSVTPQGGTGDCKTASCPANVNAVCPSELQ-KKGSDGSVVACLSA 185

Query: 188 CSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGG-NYLVT 247
           C  F +P+YCCT     P++C PT YS+IF  ACP AYSYAYDD     TC GG NY +T
Sbjct: 186 CVKFGTPQYCCTPPQNTPETCPPTNYSEIFHNACPDAYSYAYDDKRGTFTCNGGPNYAIT 245

BLAST of Cp4.1LG20g01490 vs. TrEMBL
Match: A0A0A0LT35_CUCSA (Protein P21 OS=Cucumis sativus GN=Csa_1G042760 PE=4 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 3.3e-134
Identity = 227/246 (92.28%), Postives = 236/246 (95.93%), Query Frame = 1

Query: 3   VYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPP 62
           V LRS+V LTLYTL TS+ SV VSATTITFYNKC HPVWPGIQPSAGKPLLARGGFKLPP
Sbjct: 4   VSLRSVVALTLYTLFTSHISVLVSATTITFYNKCSHPVWPGIQPSAGKPLLARGGFKLPP 63

Query: 63  NKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITLG 122
           NK Y+L+LPALWSGRFWGRHGCAFDASGRGKCATGDCGG+L+CNGIGG PPATLAEITLG
Sbjct: 64  NKSYNLQLPALWSGRFWGRHGCAFDASGRGKCATGDCGGSLFCNGIGGTPPATLAEITLG 123

Query: 123 NDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACK 182
           NDQDFYDVSLVDGYNLA+SITPSKGSGKCSYAGCVSDLNM+CPVGLQVRSHD+ RVVACK
Sbjct: 124 NDQDFYDVSLVDGYNLAISITPSKGSGKCSYAGCVSDLNMMCPVGLQVRSHDNRRVVACK 183

Query: 183 SACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLV 242
           SAC AFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLV
Sbjct: 184 SACFAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLV 243

Query: 243 TFCPHH 249
           TFCPHH
Sbjct: 244 TFCPHH 249

BLAST of Cp4.1LG20g01490 vs. TrEMBL
Match: B9T3K3_RICCO (Protein P21, putative OS=Ricinus communis GN=RCOM_0337100 PE=4 SV=1)

HSP 1 Score: 456.8 bits (1174), Expect = 1.6e-125
Identity = 215/248 (86.69%), Postives = 229/248 (92.34%), Query Frame = 1

Query: 1   MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 60
           M++ LRSL+ L L TL+ S+ S  VS+TTIT YNKC HPVWPGIQPSAGKPLLARGGFKL
Sbjct: 1   MVIVLRSLLTLMLSTLIFSHISE-VSSTTITLYNKCTHPVWPGIQPSAGKPLLARGGFKL 60

Query: 61  PPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEIT 120
           PPNK YSL+LP LWSGRFWGRHGC+FD SGRG+CATGDCGGAL+CNGIGG PPATLAEIT
Sbjct: 61  PPNKAYSLKLPPLWSGRFWGRHGCSFDGSGRGRCATGDCGGALFCNGIGGTPPATLAEIT 120

Query: 121 LGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVA 180
           LGNDQDFYDVSLVDGYNLAMSITP KGSGKCSYAGCVSDLN++CPVGLQVRS D+ RVVA
Sbjct: 121 LGNDQDFYDVSLVDGYNLAMSITPFKGSGKCSYAGCVSDLNLMCPVGLQVRSKDNRRVVA 180

Query: 181 CKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 240
           CKSACSAFNSPRYCCTG+FGNPQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATCT GNY
Sbjct: 181 CKSACSAFNSPRYCCTGKFGNPQSCKPTAYSKIFKAACPKAYSYAYDDPTSIATCTRGNY 240

Query: 241 LVTFCPHH 249
           LVTFCPHH
Sbjct: 241 LVTFCPHH 247

BLAST of Cp4.1LG20g01490 vs. TrEMBL
Match: A0A061FUV3_THECC (Pathogenesis-related thaumatin superfamily protein OS=Theobroma cacao GN=TCM_012750 PE=4 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 3.7e-125
Identity = 212/247 (85.83%), Postives = 229/247 (92.71%), Query Frame = 1

Query: 1   MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 60
           M   LRSL+  TL+TLL S+ SV VSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL
Sbjct: 27  MEAMLRSLLTFTLFTLLFSHISVEVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 86

Query: 61  PPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEIT 120
           PPNK YS+RLP LWSGRFWGRHGC+FDASGRG+CATGDCGG+L+CNG+GGAPPATLAEIT
Sbjct: 87  PPNKAYSMRLPPLWSGRFWGRHGCSFDASGRGRCATGDCGGSLFCNGLGGAPPATLAEIT 146

Query: 121 LGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVA 180
           LG +QDFYDVSLVDGYN+AMSITP KGSGKCSYAGCVSDLN++CPVGLQVRS D+ RV+A
Sbjct: 147 LGQEQDFYDVSLVDGYNIAMSITPFKGSGKCSYAGCVSDLNLMCPVGLQVRSRDNKRVLA 206

Query: 181 CKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 240
           CKSAC AFNSPRYCCTG FG+PQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATCT G+Y
Sbjct: 207 CKSACFAFNSPRYCCTGSFGSPQSCKPTAYSKIFKAACPKAYSYAYDDPTSIATCTRGSY 266

Query: 241 LVTFCPH 248
           LVTFCPH
Sbjct: 267 LVTFCPH 273

BLAST of Cp4.1LG20g01490 vs. TrEMBL
Match: A0A0B0MT31_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_27474 PE=4 SV=1)

HSP 1 Score: 451.8 bits (1161), Expect = 5.3e-124
Identity = 213/247 (86.23%), Postives = 225/247 (91.09%), Query Frame = 1

Query: 1   MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 60
           M   LRSL+  TL+TLL SY    VSATTIT YNKCPHPVWPGIQPSAGKPLLARGGFKL
Sbjct: 1   MATMLRSLLTFTLFTLLFSY----VSATTITLYNKCPHPVWPGIQPSAGKPLLARGGFKL 60

Query: 61  PPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEIT 120
            PNK YS+RLP LWSGRFWGRHGC+FDASGRG+CATGDCGG+L+CNG+GGAPPATLAEIT
Sbjct: 61  RPNKAYSMRLPPLWSGRFWGRHGCSFDASGRGRCATGDCGGSLFCNGLGGAPPATLAEIT 120

Query: 121 LGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVA 180
           LG DQDFYDVSLVDGYN+AMSITP KGSGKCSYAGCVSDLN++CPVGLQVRS D+ RVVA
Sbjct: 121 LGQDQDFYDVSLVDGYNIAMSITPFKGSGKCSYAGCVSDLNLMCPVGLQVRSKDNKRVVA 180

Query: 181 CKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 240
           CKSAC AFNSPRYCCTG FGNPQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATCT GNY
Sbjct: 181 CKSACFAFNSPRYCCTGTFGNPQSCKPTAYSKIFKAACPKAYSYAYDDPTSIATCTRGNY 240

Query: 241 LVTFCPH 248
           LVTFCPH
Sbjct: 241 LVTFCPH 243

BLAST of Cp4.1LG20g01490 vs. TrEMBL
Match: A0A068TSM5_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00022420001 PE=4 SV=1)

HSP 1 Score: 451.4 bits (1160), Expect = 6.9e-124
Identity = 210/248 (84.68%), Postives = 227/248 (91.53%), Query Frame = 1

Query: 1   MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 60
           M   LR+L+ L L +L    +SV VSATT+TFYNKC HPVWPGIQPSAGKP+LARGGFKL
Sbjct: 1   MDAMLRNLLALLLLSLHLLDHSVEVSATTMTFYNKCSHPVWPGIQPSAGKPILARGGFKL 60

Query: 61  PPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEIT 120
           PPN+ YSLRLPA WSGRFWGRHGC+FDASGRG+CATGDCGG+L+CNG+GG PPATLAEIT
Sbjct: 61  PPNRSYSLRLPAGWSGRFWGRHGCSFDASGRGRCATGDCGGSLFCNGLGGTPPATLAEIT 120

Query: 121 LGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVA 180
           LGN+QDFYDVSLVDGYNLA+SITP +GSGKCSYAGCVSDLNM+CPVGLQVRSHD  RVVA
Sbjct: 121 LGNEQDFYDVSLVDGYNLAISITPFRGSGKCSYAGCVSDLNMMCPVGLQVRSHDKRRVVA 180

Query: 181 CKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 240
           CKSAC AFNSPRYCCTG FGNPQSCKPTAYSKIFK ACP+AYSYAYDDPTSIATCTGGNY
Sbjct: 181 CKSACFAFNSPRYCCTGSFGNPQSCKPTAYSKIFKAACPRAYSYAYDDPTSIATCTGGNY 240

Query: 241 LVTFCPHH 249
           LVTFCPHH
Sbjct: 241 LVTFCPHH 248

BLAST of Cp4.1LG20g01490 vs. TAIR10
Match: AT1G18250.2 (AT1G18250.2 Pathogenesis-related thaumatin superfamily protein)

HSP 1 Score: 403.3 bits (1035), Expect = 1.1e-112
Identity = 189/238 (79.41%), Postives = 205/238 (86.13%), Query Frame = 1

Query: 11  LTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGYSLRL 70
           L  + LL S+ S   +A+T+ FYNKC HPVWPGIQPSAG+ LLA GGFKLP NK +SL+L
Sbjct: 8   LFAFLLLLSHAS---AASTVIFYNKCKHPVWPGIQPSAGQNLLAGGGFKLPANKAHSLQL 67

Query: 71  PALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITLGNDQDFYDV 130
           P LWSGRFWGRHGC FD SGRG CATGDCGG+L CNG GG PPATLAEITLG + DFYDV
Sbjct: 68  PPLWSGRFWGRHGCTFDRSGRGHCATGDCGGSLSCNGAGGEPPATLAEITLGPELDFYDV 127

Query: 131 SLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKSACSAFNS 190
           SLVDGYNLAMSI P KGSG+CSYAGCVSDLN +CPVGLQVRS +  RVVACKSACSAFNS
Sbjct: 128 SLVDGYNLAMSIMPVKGSGQCSYAGCVSDLNQMCPVGLQVRSRNGKRVVACKSACSAFNS 187

Query: 191 PRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLVTFCPHH 249
           P+YCCTG FGNPQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATC+  NY+VTFCPHH
Sbjct: 188 PQYCCTGLFGNPQSCKPTAYSKIFKVACPKAYSYAYDDPTSIATCSKANYIVTFCPHH 242

BLAST of Cp4.1LG20g01490 vs. TAIR10
Match: AT1G73620.1 (AT1G73620.1 Pathogenesis-related thaumatin superfamily protein)

HSP 1 Score: 396.7 bits (1018), Expect = 1.0e-110
Identity = 185/241 (76.76%), Postives = 206/241 (85.48%), Query Frame = 1

Query: 7   SLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGY 66
           SL +L L  L++       SA+T+ FYNKC + VWPGIQPS+G+ LLA GGFKL PN+ Y
Sbjct: 26  SLFLLPLLLLVSQ-----ASASTVIFYNKCTYTVWPGIQPSSGQSLLAGGGFKLSPNRAY 85

Query: 67  SLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITLGNDQD 126
           +L+LP LWSGRFWGRHGC+FD SGRG+CATGDCGG+  CNG GG PPATLAEITLG+D D
Sbjct: 86  TLQLPPLWSGRFWGRHGCSFDRSGRGRCATGDCGGSFLCNGAGGVPPATLAEITLGHDMD 145

Query: 127 FYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKSACS 186
           FYDVSLVDGYNLAMSI P KG+GKC+YAGCVSDLN +CPVGLQVRS D  +VVACKSACS
Sbjct: 146 FYDVSLVDGYNLAMSIMPVKGTGKCTYAGCVSDLNRMCPVGLQVRSRDGTQVVACKSACS 205

Query: 187 AFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLVTFCP 246
           AFNSPRYCCTG FGNPQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATC+  NY+VTFCP
Sbjct: 206 AFNSPRYCCTGLFGNPQSCKPTAYSKIFKVACPKAYSYAYDDPTSIATCSKANYVVTFCP 261

Query: 247 H 248
           H
Sbjct: 266 H 261

BLAST of Cp4.1LG20g01490 vs. TAIR10
Match: AT1G75050.1 (AT1G75050.1 Pathogenesis-related thaumatin superfamily protein)

HSP 1 Score: 278.9 bits (712), Expect = 3.1e-75
Identity = 133/223 (59.64%), Postives = 150/223 (67.26%), Query Frame = 1

Query: 26  SATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGYSLRLPALWSGRFWGRHGCA 85
           +AT  T  N CP+ VWPGI  S     L  GGF L P     L  PA WSGRFW R GC 
Sbjct: 23  AATVFTLQNSCPYTVWPGIL-SGNDNTLGDGGFPLTPGASVQLTAPAGWSGRFWARTGCN 82

Query: 86  FDASGRGKCATGDCGGALYCNGIGGAPPATLAEITLGND--QDFYDVSLVDGYNLAMSIT 145
           FDASG G C TGDCGG L CNG GG PP TLAE TL  D  +DFYDVSLVDGYN+ M I 
Sbjct: 83  FDASGHGNCGTGDCGGVLKCNG-GGVPPVTLAEFTLVGDGGKDFYDVSLVDGYNVEMGIK 142

Query: 146 PSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKSACSAFNSPRYCCTGRFGNPQ 205
           P  GSG C YAGCV+D+N +CP  L++       + ACKSAC+AFNS  +CCTG    PQ
Sbjct: 143 PQGGSGDCHYAGCVADVNAVCPNELRLMDPHTGIIAACKSACAAFNSEEFCCTGAHATPQ 202

Query: 206 SCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLVTFCP 247
           +C PT YS +FK+ACP AYSYAYDD TS  TCTG NYL++FCP
Sbjct: 203 TCSPTHYSAMFKSACPGAYSYAYDDATSTFTCTGSNYLISFCP 243

BLAST of Cp4.1LG20g01490 vs. TAIR10
Match: AT1G75030.1 (AT1G75030.1 thaumatin-like protein 3)

HSP 1 Score: 274.2 bits (700), Expect = 7.7e-74
Identity = 136/246 (55.28%), Postives = 156/246 (63.41%), Query Frame = 1

Query: 7   SLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPPNKGY 66
           S + +  +  +TS   +  SAT  T  N C + VWPG   S     L  GGF L P    
Sbjct: 5   SSIHILFFVFITS--GIADSATVFTLQNSCAYTVWPGTL-SGNSITLGDGGFPLTPGASV 64

Query: 67  SLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITLGNDQ- 126
            L  P  WSGRFW R GC FDASG G C TGDCGG L C G GG PPATLAE T+G+   
Sbjct: 65  QLTAPTGWSGRFWARTGCNFDASGHGTCVTGDCGGVLKCTG-GGVPPATLAEFTVGSSNA 124

Query: 127 --DFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACKS 186
             DFYDVSLVDGYN+ M I P  G G C YAGCVSD+N ICP  L++   +   V ACKS
Sbjct: 125 GMDFYDVSLVDGYNVKMGIKPQGGFGNCKYAGCVSDINEICPSELRIMDPNSGSVAACKS 184

Query: 187 ACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLVT 246
           AC+AF+SP +CCTG    PQ+C PT YS +FK ACP AYSYAYDD +S  TCTG NYL+T
Sbjct: 185 ACAAFSSPEFCCTGAHATPQTCSPTYYSSMFKNACPSAYSYAYDDASSTFTCTGSNYLIT 244

Query: 247 FCPHHS 250
           FCP  S
Sbjct: 245 FCPTQS 246

BLAST of Cp4.1LG20g01490 vs. TAIR10
Match: AT1G75040.1 (AT1G75040.1 pathogenesis-related gene 5)

HSP 1 Score: 267.7 bits (683), Expect = 7.2e-72
Identity = 143/247 (57.89%), Postives = 161/247 (65.18%), Query Frame = 1

Query: 2   MVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLP 61
           M  + S+ +L L   +TS   + V AT  T  N CP  VW G     G P L  GGF+L 
Sbjct: 1   MANISSIHILFL-VFITS--GIAVMATDFTLRNNCPTTVWAGTLAGQG-PKLGDGGFELT 60

Query: 62  PNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITL 121
           P     L  PA WSGRFW R GC FDASG G+C TGDCGG L CNG GG PP TLAE TL
Sbjct: 61  PGASRQLTAPAGWSGRFWARTGCNFDASGNGRCVTGDCGG-LRCNG-GGVPPVTLAEFTL 120

Query: 122 GND--QDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVV 181
             D  +DFYDVSLVDGYN+ + I PS GSG C YAGCVSDLN  CP  L+V   D + VV
Sbjct: 121 VGDGGKDFYDVSLVDGYNVKLGIRPSGGSGDCKYAGCVSDLNAACPDMLKVM--DQNNVV 180

Query: 182 ACKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGN 241
           ACKSAC  FN+ +YCC G    P++C PT YS+IFK ACP AYSYAYDD TS  TCTG N
Sbjct: 181 ACKSACERFNTDQYCCRGANDKPETCPPTDYSRIFKNACPDAYSYAYDDETSTFTCTGAN 239

Query: 242 YLVTFCP 247
           Y +TFCP
Sbjct: 241 YEITFCP 239

BLAST of Cp4.1LG20g01490 vs. NCBI nr
Match: gi|449439451|ref|XP_004137499.1| (PREDICTED: thaumatin-like protein [Cucumis sativus])

HSP 1 Score: 485.7 bits (1249), Expect = 4.8e-134
Identity = 227/246 (92.28%), Postives = 236/246 (95.93%), Query Frame = 1

Query: 3   VYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKLPP 62
           V LRS+V LTLYTL TS+ SV VSATTITFYNKC HPVWPGIQPSAGKPLLARGGFKLPP
Sbjct: 4   VSLRSVVALTLYTLFTSHISVLVSATTITFYNKCSHPVWPGIQPSAGKPLLARGGFKLPP 63

Query: 63  NKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEITLG 122
           NK Y+L+LPALWSGRFWGRHGCAFDASGRGKCATGDCGG+L+CNGIGG PPATLAEITLG
Sbjct: 64  NKSYNLQLPALWSGRFWGRHGCAFDASGRGKCATGDCGGSLFCNGIGGTPPATLAEITLG 123

Query: 123 NDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVACK 182
           NDQDFYDVSLVDGYNLA+SITPSKGSGKCSYAGCVSDLNM+CPVGLQVRSHD+ RVVACK
Sbjct: 124 NDQDFYDVSLVDGYNLAISITPSKGSGKCSYAGCVSDLNMMCPVGLQVRSHDNRRVVACK 183

Query: 183 SACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLV 242
           SAC AFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLV
Sbjct: 184 SACFAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNYLV 243

Query: 243 TFCPHH 249
           TFCPHH
Sbjct: 244 TFCPHH 249

BLAST of Cp4.1LG20g01490 vs. NCBI nr
Match: gi|659066840|ref|XP_008463807.1| (PREDICTED: thaumatin-like protein [Cucumis melo])

HSP 1 Score: 484.6 bits (1246), Expect = 1.1e-133
Identity = 228/248 (91.94%), Postives = 235/248 (94.76%), Query Frame = 1

Query: 1   MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 60
           M V LRSLV LTLYTL  S+ SV VSATTIT YNKC HPVWPGIQPSAGKPLLARGGFKL
Sbjct: 2   MRVSLRSLVALTLYTLFASHISVLVSATTITLYNKCSHPVWPGIQPSAGKPLLARGGFKL 61

Query: 61  PPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEIT 120
           PPNK Y+L+LPALWSGRFWGRHGCAFDASGRGKCATGDCGGAL+CNGIGG PPATLAEIT
Sbjct: 62  PPNKAYTLQLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALFCNGIGGTPPATLAEIT 121

Query: 121 LGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVA 180
           LGNDQDFYDVSLVDGYNLA+SITPSKGSGKCSYAGCVSDLNM+CPVGLQVRSHD+ RVVA
Sbjct: 122 LGNDQDFYDVSLVDGYNLAISITPSKGSGKCSYAGCVSDLNMMCPVGLQVRSHDNRRVVA 181

Query: 181 CKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 240
           CKSAC AFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY
Sbjct: 182 CKSACFAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 241

Query: 241 LVTFCPHH 249
           LVTFCPHH
Sbjct: 242 LVTFCPHH 249

BLAST of Cp4.1LG20g01490 vs. NCBI nr
Match: gi|255584165|ref|XP_002532822.1| (PREDICTED: thaumatin-like protein [Ricinus communis])

HSP 1 Score: 456.8 bits (1174), Expect = 2.4e-125
Identity = 215/248 (86.69%), Postives = 229/248 (92.34%), Query Frame = 1

Query: 1   MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 60
           M++ LRSL+ L L TL+ S+ S  VS+TTIT YNKC HPVWPGIQPSAGKPLLARGGFKL
Sbjct: 1   MVIVLRSLLTLMLSTLIFSHISE-VSSTTITLYNKCTHPVWPGIQPSAGKPLLARGGFKL 60

Query: 61  PPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEIT 120
           PPNK YSL+LP LWSGRFWGRHGC+FD SGRG+CATGDCGGAL+CNGIGG PPATLAEIT
Sbjct: 61  PPNKAYSLKLPPLWSGRFWGRHGCSFDGSGRGRCATGDCGGALFCNGIGGTPPATLAEIT 120

Query: 121 LGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVA 180
           LGNDQDFYDVSLVDGYNLAMSITP KGSGKCSYAGCVSDLN++CPVGLQVRS D+ RVVA
Sbjct: 121 LGNDQDFYDVSLVDGYNLAMSITPFKGSGKCSYAGCVSDLNLMCPVGLQVRSKDNRRVVA 180

Query: 181 CKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 240
           CKSACSAFNSPRYCCTG+FGNPQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATCT GNY
Sbjct: 181 CKSACSAFNSPRYCCTGKFGNPQSCKPTAYSKIFKAACPKAYSYAYDDPTSIATCTRGNY 240

Query: 241 LVTFCPHH 249
           LVTFCPHH
Sbjct: 241 LVTFCPHH 247

BLAST of Cp4.1LG20g01490 vs. NCBI nr
Match: gi|590665636|ref|XP_007036794.1| (Pathogenesis-related thaumatin superfamily protein [Theobroma cacao])

HSP 1 Score: 455.7 bits (1171), Expect = 5.3e-125
Identity = 212/247 (85.83%), Postives = 229/247 (92.71%), Query Frame = 1

Query: 1   MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 60
           M   LRSL+  TL+TLL S+ SV VSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL
Sbjct: 27  MEAMLRSLLTFTLFTLLFSHISVEVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 86

Query: 61  PPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEIT 120
           PPNK YS+RLP LWSGRFWGRHGC+FDASGRG+CATGDCGG+L+CNG+GGAPPATLAEIT
Sbjct: 87  PPNKAYSMRLPPLWSGRFWGRHGCSFDASGRGRCATGDCGGSLFCNGLGGAPPATLAEIT 146

Query: 121 LGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVA 180
           LG +QDFYDVSLVDGYN+AMSITP KGSGKCSYAGCVSDLN++CPVGLQVRS D+ RV+A
Sbjct: 147 LGQEQDFYDVSLVDGYNIAMSITPFKGSGKCSYAGCVSDLNLMCPVGLQVRSRDNKRVLA 206

Query: 181 CKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 240
           CKSAC AFNSPRYCCTG FG+PQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATCT G+Y
Sbjct: 207 CKSACFAFNSPRYCCTGSFGSPQSCKPTAYSKIFKAACPKAYSYAYDDPTSIATCTRGSY 266

Query: 241 LVTFCPH 248
           LVTFCPH
Sbjct: 267 LVTFCPH 273

BLAST of Cp4.1LG20g01490 vs. NCBI nr
Match: gi|728820711|gb|KHG03900.1| (hypothetical protein F383_27474 [Gossypium arboreum])

HSP 1 Score: 451.8 bits (1161), Expect = 7.6e-124
Identity = 213/247 (86.23%), Postives = 225/247 (91.09%), Query Frame = 1

Query: 1   MMVYLRSLVVLTLYTLLTSYNSVFVSATTITFYNKCPHPVWPGIQPSAGKPLLARGGFKL 60
           M   LRSL+  TL+TLL SY    VSATTIT YNKCPHPVWPGIQPSAGKPLLARGGFKL
Sbjct: 1   MATMLRSLLTFTLFTLLFSY----VSATTITLYNKCPHPVWPGIQPSAGKPLLARGGFKL 60

Query: 61  PPNKGYSLRLPALWSGRFWGRHGCAFDASGRGKCATGDCGGALYCNGIGGAPPATLAEIT 120
            PNK YS+RLP LWSGRFWGRHGC+FDASGRG+CATGDCGG+L+CNG+GGAPPATLAEIT
Sbjct: 61  RPNKAYSMRLPPLWSGRFWGRHGCSFDASGRGRCATGDCGGSLFCNGLGGAPPATLAEIT 120

Query: 121 LGNDQDFYDVSLVDGYNLAMSITPSKGSGKCSYAGCVSDLNMICPVGLQVRSHDDHRVVA 180
           LG DQDFYDVSLVDGYN+AMSITP KGSGKCSYAGCVSDLN++CPVGLQVRS D+ RVVA
Sbjct: 121 LGQDQDFYDVSLVDGYNIAMSITPFKGSGKCSYAGCVSDLNLMCPVGLQVRSKDNKRVVA 180

Query: 181 CKSACSAFNSPRYCCTGRFGNPQSCKPTAYSKIFKTACPKAYSYAYDDPTSIATCTGGNY 240
           CKSAC AFNSPRYCCTG FGNPQSCKPTAYSKIFK ACPKAYSYAYDDPTSIATCT GNY
Sbjct: 181 CKSACFAFNSPRYCCTGTFGNPQSCKPTAYSKIFKAACPKAYSYAYDDPTSIATCTRGNY 240

Query: 241 LVTFCPH 248
           LVTFCPH
Sbjct: 241 LVTFCPH 243

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TLPH_ARATH3.3e-11179.41Thaumatin-like protein OS=Arabidopsis thaliana GN=At1g18250 PE=2 SV=2[more]
PR5_ARATH1.3e-7057.89Pathogenesis-related protein 5 OS=Arabidopsis thaliana GN=At1g75040 PE=1 SV=1[more]
TLP1_PYRPY1.5e-6652.65Thaumatin-like protein 1 OS=Pyrus pyrifolia GN=TL1 PE=1 SV=1[more]
TLP1_PRUPE1.6e-6554.01Thaumatin-like protein 1 OS=Prunus persica PE=2 SV=1[more]
TLP_PRUAV2.6e-6351.85Glucan endo-1,3-beta-glucosidase OS=Prunus avium PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LT35_CUCSA3.3e-13492.28Protein P21 OS=Cucumis sativus GN=Csa_1G042760 PE=4 SV=1[more]
B9T3K3_RICCO1.6e-12586.69Protein P21, putative OS=Ricinus communis GN=RCOM_0337100 PE=4 SV=1[more]
A0A061FUV3_THECC3.7e-12585.83Pathogenesis-related thaumatin superfamily protein OS=Theobroma cacao GN=TCM_012... [more]
A0A0B0MT31_GOSAR5.3e-12486.23Uncharacterized protein OS=Gossypium arboreum GN=F383_27474 PE=4 SV=1[more]
A0A068TSM5_COFCA6.9e-12484.68Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00022420001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G18250.21.1e-11279.41 Pathogenesis-related thaumatin superfamily protein[more]
AT1G73620.11.0e-11076.76 Pathogenesis-related thaumatin superfamily protein[more]
AT1G75050.13.1e-7559.64 Pathogenesis-related thaumatin superfamily protein[more]
AT1G75030.17.7e-7455.28 thaumatin-like protein 3[more]
AT1G75040.17.2e-7257.89 pathogenesis-related gene 5[more]
Match NameE-valueIdentityDescription
gi|449439451|ref|XP_004137499.1|4.8e-13492.28PREDICTED: thaumatin-like protein [Cucumis sativus][more]
gi|659066840|ref|XP_008463807.1|1.1e-13391.94PREDICTED: thaumatin-like protein [Cucumis melo][more]
gi|255584165|ref|XP_002532822.1|2.4e-12586.69PREDICTED: thaumatin-like protein [Ricinus communis][more]
gi|590665636|ref|XP_007036794.1|5.3e-12585.83Pathogenesis-related thaumatin superfamily protein [Theobroma cacao][more]
gi|728820711|gb|KHG03900.1|7.6e-12486.23hypothetical protein F383_27474 [Gossypium arboreum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR017949Thaumatin_CS
IPR001938Thaumatin
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g01490.1Cp4.1LG20g01490.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001938ThaumatinPRINTSPR00347THAUMATINcoord: 236..245
score: 2.5E-26coord: 29..41
score: 2.5E-26coord: 77..88
score: 2.5E-26coord: 90..119
score: 2.5E-26coord: 126..142
score: 2.5
IPR001938ThaumatinGENE3DG3DSA:2.60.110.10coord: 29..247
score: 3.2
IPR001938ThaumatinPIRPIRSF002703PR5coord: 1..249
score: 5.8E
IPR001938ThaumatinPANTHERPTHR31048FAMILY NOT NAMEDcoord: 1..249
score: 4.4E
IPR001938ThaumatinPFAMPF00314Thaumatincoord: 34..246
score: 2.2
IPR001938ThaumatinSMARTSM00205tha2coord: 30..246
score: 6.7E
IPR001938ThaumatinPROFILEPS51367THAUMATIN_2coord: 27..248
score: 47
IPR001938ThaumatinunknownSSF49870Osmotin, thaumatin-like proteincoord: 28..247
score: 1.96
IPR017949Thaumatin, conserved sitePROSITEPS00316THAUMATIN_1coord: 90..105
scor
NoneNo IPR availablePANTHERPTHR31048:SF1PATHOGENESIS-RELATED THAUMATIN-LIKE PROTEIN-RELATEDcoord: 1..249
score: 4.4E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG20g01490Cp4.1LG09g11090Cucurbita pepo (Zucchini)cpecpeB049