ClCG07G004420 (gene) Watermelon (Charleston Gray)

NameClCG07G004420
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPollen Ole e 1 allergen and extensin family protein LENGTH=183
LocationCG_Chr07 : 5522279 .. 5523347 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAACCAAAAACCTCCTCTCCCTCTCCCTTCTCTTCCTCCTCCTCCACATCGCCGCCTCCACCACCACCACCACTCGACCCGCCCTCAAACCCGTCTCTGTGGCGGTTGAGGGCGTGGTCTACTGCCAAAACTGCCAGAAAATCGGAACGTGGTCGCTGACAGGCGCGAAGCCAATCAGCGGAGCCAAAGTTAGTGTGATTTGCAAGAACCATAACAACCAAGTGAACTTTTACAAGGTGTATGAAACCAACAAAGATGGTTACTTTTATGCGGAGTTGGTCGGGTACAAAATGAACCACCCGGTTCTTGACCACCCGCTTCAAGCTTGTAAAGTCAAGCCTGTTTATTCCCCACTTTCAGACTGTAACCTCTTGACCAATCTCAACGATGGCCTCGTCGGAGCCCCACTCCGCTATGAGAAAAAGCTCATCGTTGGCCCCAATTATAGGGCTGCCGTCTTCGCCGCCGGGCCGCTGGCTTTCCACCCTGAGAAATGCCTCTAGANCCCCTTTGCCCCCATCTTCAAAAAAAAAAAAAAAAAAAAAAAAAACACTCAAAATGTCAACCAAAAACCTCCTCTCCCTCTCCCTTCTCTTCCTCCTCCTCCACATCGCCGCCTCCACCACCACCACCACTCGACCCGCCCTCAAACCCGTCTCTGTGGCGGTTGAGGGCGTGGTCTACTGCCAAAACTGCCAGAAAATCGGAACGTGGTCGCTGACAGGCGCGAAGCCAATCAGCGGAGCCAAAGTTAGTGTGATTTGCAAGAACCATAACAACCAAGTGAACTTTTACAAGGTGTATGAAACCAACAAAGATGGTTACTTTTATGCGGAGTTGGTCGGGTACAAAATGAACCACCCGGTTCTTGACCACCCGCTTCAAGCTTGTAAAGTCAAGCCTGTTTATTCCCCACTTTCAGACTGTAACCTCTTGACCAATCTCAACGATGGCCTCGTCGGAGCCCCACTCCGCTATGAGAAAAAGCTCATCGTTGGCCCCAATTATAGGGCTGCCGTCTTCGCCGCCGGGCCGCTGGCTTTCCACCCTGAGAAATGCCTCTAG

mRNA sequence

ATGTCAACCAAAAACCTCCTCTCCCTCTCCCTTCTCTTCCTCCTCCTCCACATCGCCGCCTCCACCACCACCACCACTCGACCCGCCCTCAAACCCGTCTCTGTGGCGGTTGAGGGCGTGGTCTACTGCCAAAACTGCCAGAAAATCGGAACGTGGTCGCTGACAGGCGCGAAGCCAATCAGCGGAGCCAAAGTTAGTGTGATTTGCAAGAACCATAACAACCAAGTGAACTTTTACAAGGTGTATGAAACCAACAAAGATGGTTACTTTTATGCGGAGTTGGTCGGGTACAAAATGAACCACCCGGTTCTTGACCACCCGCTTCAAGCTTGTAAAGTCAAGCCTGTTTATTCCCCACTTTCAGACTGTAACCTCTTGACCAATCTCAACGATGGCCTCGTCGGAGCCCCACTCCGCTATGAGAAAAAGCTCATCGTTGGCCCCAATTATAGGGCTGCCGGCGTGGTCTACTGCCAAAACTGCCAGAAAATCGGAACGTGGTCGCTGACAGGCGCGAAGCCAATCAGCGGAGCCAAAGTTAGTGTGATTTGCAAGAACCATAACAACCAAGTGAACTTTTACAAGGTGTATGAAACCAACAAAGATGGTTACTTTTATGCGGAGTTGGTCGGGTACAAAATGAACCACCCGGTTCTTGACCACCCGCTTCAAGCTTGTAAAGTCAAGCCTGTTTATTCCCCACTTTCAGACTGTAACCTCTTGACCAATCTCAACGATGGCCTCGTCGGAGCCCCACTCCGCTATGAGAAAAAGCTCATCGTTGGCCCCAATTATAGGGCTGCCGTCTTCGCCGCCGGGCCGCTGGCTTTCCACCCTGAGAAATGCCTCTAG

Coding sequence (CDS)

ATGTCAACCAAAAACCTCCTCTCCCTCTCCCTTCTCTTCCTCCTCCTCCACATCGCCGCCTCCACCACCACCACCACTCGACCCGCCCTCAAACCCGTCTCTGTGGCGGTTGAGGGCGTGGTCTACTGCCAAAACTGCCAGAAAATCGGAACGTGGTCGCTGACAGGCGCGAAGCCAATCAGCGGAGCCAAAGTTAGTGTGATTTGCAAGAACCATAACAACCAAGTGAACTTTTACAAGGTGTATGAAACCAACAAAGATGGTTACTTTTATGCGGAGTTGGTCGGGTACAAAATGAACCACCCGGTTCTTGACCACCCGCTTCAAGCTTGTAAAGTCAAGCCTGTTTATTCCCCACTTTCAGACTGTAACCTCTTGACCAATCTCAACGATGGCCTCGTCGGAGCCCCACTCCGCTATGAGAAAAAGCTCATCGTTGGCCCCAATTATAGGGCTGCCGGCGTGGTCTACTGCCAAAACTGCCAGAAAATCGGAACGTGGTCGCTGACAGGCGCGAAGCCAATCAGCGGAGCCAAAGTTAGTGTGATTTGCAAGAACCATAACAACCAAGTGAACTTTTACAAGGTGTATGAAACCAACAAAGATGGTTACTTTTATGCGGAGTTGGTCGGGTACAAAATGAACCACCCGGTTCTTGACCACCCGCTTCAAGCTTGTAAAGTCAAGCCTGTTTATTCCCCACTTTCAGACTGTAACCTCTTGACCAATCTCAACGATGGCCTCGTCGGAGCCCCACTCCGCTATGAGAAAAAGCTCATCGTTGGCCCCAATTATAGGGCTGCCGTCTTCGCCGCCGGGCCGCTGGCTTTCCACCCTGAGAAATGCCTCTAG

Protein sequence

MSTKNLLSLSLLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAGPLAFHPEKCL
BLAST of ClCG07G004420 vs. Swiss-Prot
Match: AGP30_ARATH (Non-classical arabinogalactan protein 30 OS=Arabidopsis thaliana GN=AGP30 PE=2 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 2.6e-11
Identity = 48/151 (31.79%), Postives = 78/151 (51.66%), Query Frame = 1

Query: 137 PLRYEKKLIVGPNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKV 196
           P +Y K L+        GVVYC+ C+  G  ++ GAKP+  A V ++CKN  N ++  K 
Sbjct: 102 PPKYNKTLVA-----VRGVVYCKACKYAGVNNVQGAKPVKDAVVRLVCKNKKNSISETK- 161

Query: 197 YETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYE 256
             T+K+GYF   L+  K    V ++ ++ C+   V SP + C+ +++L+DG  G+ L   
Sbjct: 162 --TDKNGYFM--LLAPK---TVTNYDIKGCRAFLVKSPDTKCSKVSSLHDGGKGSVL--- 221

Query: 257 KKLIVGPN--------YRAAVFAAGPLAFHP 280
            K ++ P         ++ +V+  GP AF P
Sbjct: 222 -KPVLKPGFSSTIMRWFKYSVYNVGPFAFEP 235


HSP 2 Score: 68.2 bits (165), Expect = 1.7e-10
Identity = 40/112 (35.71%), Postives = 63/112 (56.25%), Query Frame = 1

Query: 28  PALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKD 87
           P      VAV GVVYC+ C+  G  ++ GAKP+  A V ++CKN  N ++  K   T+K+
Sbjct: 103 PKYNKTLVAVRGVVYCKACKYAGVNNVQGAKPVKDAVVRLVCKNKKNSISETK---TDKN 162

Query: 88  GYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLR 140
           GYF   L+  K    V ++ ++ C+   V SP + C+ +++L+DG  G+ L+
Sbjct: 163 GYFM--LLAPK---TVTNYDIKGCRAFLVKSPDTKCSKVSSLHDGGKGSVLK 206

BLAST of ClCG07G004420 vs. Swiss-Prot
Match: AGP31_ARATH (Non-classical arabinogalactan protein 31 OS=Arabidopsis thaliana GN=AGP31 PE=1 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 1.1e-09
Identity = 49/147 (33.33%), Postives = 72/147 (48.98%), Query Frame = 1

Query: 137 PLRYEKKLIVGPNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKV 196
           P ++ + L+        G VYC++C+     +L GAKPI GA V ++CK   ++ N    
Sbjct: 222 PPKFNRSLVA-----VRGTVYCKSCKYAAFNTLLGAKPIEGATVKLVCK---SKKNITAE 281

Query: 197 YETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYE 256
             T+K+GYF   L+  K    V +   + C+V  V S    C+ ++ L  G VGA L+ E
Sbjct: 282 TTTDKNGYFL--LLAPK---TVTNFGFRGCRVYLVKSKDYKCSKVSKLFGGDVGAELKPE 341

Query: 257 KKL----IVGPNYRAAVFAAGPLAFHP 280
           KKL    +V       +F  GP AF+P
Sbjct: 342 KKLGKSTVVVNKLVYGLFNVGPFAFNP 355


HSP 2 Score: 63.2 bits (152), Expect = 5.4e-09
Identity = 43/110 (39.09%), Postives = 59/110 (53.64%), Query Frame = 1

Query: 35  VAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAEL 94
           VAV G VYC++C+     +L GAKPI GA V ++CK   ++ N      T+K+GYF   L
Sbjct: 230 VAVRGTVYCKSCKYAAFNTLLGAKPIEGATVKLVCK---SKKNITAETTTDKNGYFL--L 289

Query: 95  VGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKL 145
           +  K    V +   + C+V  V S    C+ ++ L  G VGA L+ EKKL
Sbjct: 290 LAPK---TVTNFGFRGCRVYLVKSKDYKCSKVSKLFGGDVGAELKPEKKL 331

BLAST of ClCG07G004420 vs. Swiss-Prot
Match: PRP3_ARATH (Proline-rich protein 3 OS=Arabidopsis thaliana GN=PRP3 PE=2 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 3.5e-08
Identity = 39/122 (31.97%), Postives = 55/122 (45.08%), Query Frame = 1

Query: 23  TTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKN------HNNQV 82
           T TT+P +  +  AV+G++ C+N  +          PI GAK+ ++C +       N +V
Sbjct: 174 TPTTKPYVPEILKAVDGIILCKNGYE--------TYPILGAKIQIVCSDPASYGKSNTEV 233

Query: 83  NFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGA 139
             Y    T+  GYF+  L   K         L  C+VK   SP+  C   TN+N GL G 
Sbjct: 234 VIYS-NPTDSKGYFHLSLTSIK--------DLAYCRVKLYLSPVETCKNPTNVNKGLTGV 278


HSP 2 Score: 50.8 bits (120), Expect = 2.8e-05
Identity = 38/131 (29.01%), Postives = 55/131 (41.98%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKN------HNNQVNFYKVYETNKDGYFYA 213
           G++ C+N  +          PI GAK+ ++C +       N +V  Y    T+  GYF+ 
Sbjct: 190 GIILCKNGYE--------TYPILGAKIQIVCSDPASYGKSNTEVVIYS-NPTDSKGYFHL 249

Query: 214 ELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLR-YEKKLIVGPNYR 273
            L   K         L  C+VK   SP+  C   TN+N GL G PL  Y  +    P+  
Sbjct: 250 SLTSIK--------DLAYCRVKLYLSPVETCKNPTNVNKGLTGVPLALYGYRFY--PDKN 301

Query: 274 AAVFAAGPLAF 278
             +F+ GP  +
Sbjct: 310 LELFSVGPFYY 301

BLAST of ClCG07G004420 vs. Swiss-Prot
Match: PRP1_ARATH (Proline-rich protein 1 OS=Arabidopsis thaliana GN=PRP1 PE=2 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 8.7e-07
Identity = 45/141 (31.91%), Postives = 62/141 (43.97%), Query Frame = 1

Query: 23  TTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVY 82
           T  T+P +  +  AV G++ C+N  +          PI GAK  ++C    +       Y
Sbjct: 200 TPPTKPYVPEIIKAVGGIILCKNGYE--------TYPIQGAKAKIVCSERGS-------Y 259

Query: 83  ETNKDGY-FYAELVGYK-MNHPVLDH--PLQACKVKPVYSPLSDCNLLTNLNDGLVGAPL 142
           E +K+    Y++   +K   H VL H   L  C+VK   SP+  C   TN+N GL G P 
Sbjct: 260 EKSKNEVVIYSDPTDFKGYFHVVLTHIKNLSNCRVKLYTSPVETCKNPTNVNKGLTGVPF 319

Query: 143 -RYEKKLI----VGPNYRAAG 155
             Y  K +    VGP Y  AG
Sbjct: 320 SMYSDKNLKLFNVGPFYFTAG 325


HSP 2 Score: 48.1 bits (113), Expect = 1.8e-04
Identity = 41/130 (31.54%), Postives = 55/130 (42.31%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGY-FYAELVGY 213
           G++ C+N  +          PI GAK  ++C    +       YE +K+    Y++   +
Sbjct: 216 GIILCKNGYE--------TYPIQGAKAKIVCSERGS-------YEKSKNEVVIYSDPTDF 275

Query: 214 K-MNHPVLDH--PLQACKVKPVYSPLSDCNLLTNLNDGLVGAPL-RYEKKLI----VGPN 273
           K   H VL H   L  C+VK   SP+  C   TN+N GL G P   Y  K +    VGP 
Sbjct: 276 KGYFHVVLTHIKNLSNCRVKLYTSPVETCKNPTNVNKGLTGVPFSMYSDKNLKLFNVGPF 330

Query: 274 YRAAVFAAGP 275
           Y  A   A P
Sbjct: 336 YFTAGSKAAP 330

BLAST of ClCG07G004420 vs. TrEMBL
Match: A0A0A0KP60_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G211030 PE=4 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 1.8e-64
Identity = 123/153 (80.39%), Postives = 136/153 (88.89%), Query Frame = 1

Query: 1   MSTKNLLSLSLLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPI 60
           MSTKNLLSLS L LLLHIA++     R   KP+S+A+EG+VYCQNC+KIGTWSLT AKPI
Sbjct: 1   MSTKNLLSLSFLLLLLHIASADPV--RLPHKPLSMAIEGLVYCQNCKKIGTWSLTEAKPI 60

Query: 61  SGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPL 120
           SGAK+SVICKNHN+QV FYKVY+TNKDGYFYAELVGY+MNHPVLDHPLQACKVKPV SPL
Sbjct: 61  SGAKISVICKNHNDQVKFYKVYQTNKDGYFYAELVGYQMNHPVLDHPLQACKVKPVSSPL 120

Query: 121 SDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAA 154
           SDCNLLTNLN GL GAPLR+EKK +VG NYRAA
Sbjct: 121 SDCNLLTNLNYGLTGAPLRFEKKFVVGTNYRAA 151

BLAST of ClCG07G004420 vs. TrEMBL
Match: A0A0A0KP60_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G211030 PE=4 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 1.0e-62
Identity = 113/130 (86.92%), Postives = 123/130 (94.62%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYK 213
           G+VYCQNC+KIGTWSLT AKPISGAK+SVICKNHN+QV FYKVY+TNKDGYFYAELVGY+
Sbjct: 37  GLVYCQNCKKIGTWSLTEAKPISGAKISVICKNHNDQVKFYKVYQTNKDGYFYAELVGYQ 96

Query: 214 MNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAG 273
           MNHPVLDHPLQACKVKPV SPLSDCNLLTNLN GL GAPLR+EKK +VG NYRAAV+AAG
Sbjct: 97  MNHPVLDHPLQACKVKPVSSPLSDCNLLTNLNYGLTGAPLRFEKKFVVGTNYRAAVYAAG 156

Query: 274 PLAFHPEKCL 284
           PLAFHP+KCL
Sbjct: 157 PLAFHPQKCL 166


HSP 2 Score: 185.7 bits (470), Expect = 8.1e-44
Identity = 83/129 (64.34%), Postives = 103/129 (79.84%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYK 213
           GVVYCQ+C + GTWSLT A+PI  AKVSVICKNH  QV+FYK +E++  GYFYAEL GYK
Sbjct: 44  GVVYCQSCDRFGTWSLTDAEPIPSAKVSVICKNHRGQVSFYKAFESDSKGYFYAELEGYK 103

Query: 214 MNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAG 273
           ++H +LDHPLQACKVK V SP ++C+LL+N+N G+ G+PLRYEKK +   NY A ++AAG
Sbjct: 104 ISHVLLDHPLQACKVKLVSSPNANCSLLSNVNYGMYGSPLRYEKKSLRSKNYEAVIYAAG 163

Query: 274 PLAFHPEKC 283
           PLAF P  C
Sbjct: 164 PLAFRPNHC 172

BLAST of ClCG07G004420 vs. TrEMBL
Match: W9QNK7_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_001723 PE=4 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 2.6e-42
Identity = 91/152 (59.87%), Postives = 112/152 (73.68%), Query Frame = 1

Query: 6   LLSLSLLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAKV 65
           LL L L F    IAA+    T+PA   V V VEGVVYCQ+C + GTWSLT A+PI  AKV
Sbjct: 13  LLLLPLAFP--SIAANEDVPTKPAENKVDVVVEGVVYCQSCDRFGTWSLTDAEPIPSAKV 72

Query: 66  SVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNL 125
           SVICKNH  QV+FYK +E++  GYFYAEL GYK++H +LDHPLQACKVK V SP ++C+L
Sbjct: 73  SVICKNHRGQVSFYKAFESDSKGYFYAELEGYKISHVLLDHPLQACKVKLVSSPNANCSL 132

Query: 126 LTNLNDGLVGAPLRYEKKLIVGPNYRAAGVVY 158
           L+N+N G+ G+PLRYEKK +   NY A  V+Y
Sbjct: 133 LSNVNYGMYGSPLRYEKKSLRSKNYEA--VIY 160


HSP 2 Score: 184.1 bits (466), Expect = 2.3e-43
Identity = 81/129 (62.79%), Postives = 104/129 (80.62%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYK 213
           G+VYCQ+C + G+WSLTGA+PI  AKVSVICK+H +QV+FYK +ET+ +GYFYAEL G+K
Sbjct: 44  GMVYCQSCDQFGSWSLTGAEPIPAAKVSVICKDHKDQVSFYKAFETDGNGYFYAELDGFK 103

Query: 214 MNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAG 273
           M+H +LDHPL +C VK V SPL +CNLL+N+N GL G+PLRYE K + G +Y A ++AAG
Sbjct: 104 MSHNILDHPLHSCHVKLVSSPLENCNLLSNVNYGLYGSPLRYENKRLFGKHYEAVIYAAG 163

Query: 274 PLAFHPEKC 283
           PLAF P  C
Sbjct: 164 PLAFRPAHC 172

BLAST of ClCG07G004420 vs. TrEMBL
Match: D7T4L1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0067g01540 PE=4 SV=1)

HSP 1 Score: 171.4 bits (433), Expect = 1.6e-39
Identity = 85/151 (56.29%), Postives = 110/151 (72.85%), Query Frame = 1

Query: 7   LSLSLLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAKVS 66
           L LSL F  + +A      T    K  +V VEG+VYCQ+C + G+WSLTGA+PI  AKVS
Sbjct: 14  LLLSLSFPSVTMAEEIPKKTEE--KTAAVVVEGMVYCQSCDQFGSWSLTGAEPIPAAKVS 73

Query: 67  VICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLL 126
           VICK+H +QV+FYK +ET+ +GYFYAEL G+KM+H +LDHPL +C VK V SPL +CNLL
Sbjct: 74  VICKDHKDQVSFYKAFETDGNGYFYAELDGFKMSHNILDHPLHSCHVKLVSSPLENCNLL 133

Query: 127 TNLNDGLVGAPLRYEKKLIVGPNYRAAGVVY 158
           +N+N GL G+PLRYE K + G +Y A  V+Y
Sbjct: 134 SNVNYGLYGSPLRYENKRLFGKHYEA--VIY 160


HSP 2 Score: 183.0 bits (463), Expect = 5.2e-43
Identity = 86/146 (58.90%), Postives = 104/146 (71.23%), Query Frame = 1

Query: 137 PLRYEKKLIVGPNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKV 196
           P   EKK+ V         VYCQ+C   GTWSL GAKPI  AKVSV CK+HN  V++YKV
Sbjct: 28  PKTTEKKIEVV----VEATVYCQSCDHFGTWSLIGAKPIPSAKVSVTCKSHNGHVSYYKV 87

Query: 197 YETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYE 256
           +ET+KDGY YA L G+KM H +LDHPL +C VKPV+SPL  C+LL+N+N GL GAPLRYE
Sbjct: 88  FETDKDGYLYAPLEGFKMQHYILDHPLHSCYVKPVWSPLESCSLLSNVNYGLNGAPLRYE 147

Query: 257 KKLIVGPNYRAAVFAAGPLAFHPEKC 283
            K + G  Y A ++AAGPLAF P +C
Sbjct: 148 NKKLHGSKYEAVIYAAGPLAFRPSEC 169

BLAST of ClCG07G004420 vs. TrEMBL
Match: I1LMT9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_11G232100 PE=4 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 2.4e-40
Identity = 83/153 (54.25%), Postives = 104/153 (67.97%), Query Frame = 1

Query: 5   NLLSLSLLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAK 64
           ++LSL LL +      +     +   K + V VE  VYCQ+C   GTWSL GAKPI  AK
Sbjct: 7   SILSLLLLLVAFSCTEANADVPKTTEKKIEVVVEATVYCQSCDHFGTWSLIGAKPIPSAK 66

Query: 65  VSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCN 124
           VSV CK+HN  V++YKV+ET+KDGY YA L G+KM H +LDHPL +C VKPV+SPL  C+
Sbjct: 67  VSVTCKSHNGHVSYYKVFETDKDGYLYAPLEGFKMQHYILDHPLHSCYVKPVWSPLESCS 126

Query: 125 LLTNLNDGLVGAPLRYEKKLIVGPNYRAAGVVY 158
           LL+N+N GL GAPLRYE K + G  Y A  V+Y
Sbjct: 127 LLSNVNYGLNGAPLRYENKKLHGSKYEA--VIY 157


HSP 2 Score: 182.6 bits (462), Expect = 6.8e-43
Identity = 81/127 (63.78%), Postives = 98/127 (77.17%), Query Frame = 1

Query: 156 VYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMN 215
           VYCQ+C   GTWSL GAKPI  AKVSV CK+HN  V++YKV+ET+KDGY YA L G+KM 
Sbjct: 35  VYCQSCDHFGTWSLIGAKPIPSAKVSVTCKSHNGHVSYYKVFETDKDGYLYAPLEGFKMQ 94

Query: 216 HPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAGPL 275
           H +LDHPL +C VKPV+SPL  C+LL+N+N GL GAPLRYE K + G  Y A ++AAGPL
Sbjct: 95  HYILDHPLHSCYVKPVWSPLESCSLLSNVNYGLNGAPLRYENKKLHGSKYEAVIYAAGPL 154

Query: 276 AFHPEKC 283
           AF P +C
Sbjct: 155 AFRPSEC 161

BLAST of ClCG07G004420 vs. TAIR10
Match: AT5G05500.1 (AT5G05500.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 172.2 bits (435), Expect = 4.7e-43
Identity = 75/129 (58.14%), Postives = 96/129 (74.42%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYK 213
           G+VYCQ+C K G+WSL GA+ I+GAK+S+ICKNH  QV+FYKV+ T+  G+FY EL G+K
Sbjct: 47  GMVYCQSCDKFGSWSLAGAEAIAGAKISIICKNHRQQVSFYKVFRTDSYGHFYGELKGFK 106

Query: 214 MNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAG 273
           M    LDHPL +C+ K V SP  DCNL +N+N+ L GAPLRYE+K +   NY A ++AAG
Sbjct: 107 MTPHFLDHPLHSCRAKLVSSPREDCNLFSNINNALDGAPLRYEEKRLKWTNYEAVIYAAG 166

Query: 274 PLAFHPEKC 283
           PLAF P+ C
Sbjct: 167 PLAFRPDHC 175


HSP 2 Score: 166.0 bits (419), Expect = 3.3e-41
Identity = 77/134 (57.46%), Postives = 97/134 (72.39%), Query Frame = 1

Query: 24  TTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYE 83
           TT  PA+K V  AVEG+VYCQ+C K G+WSL GA+ I+GAK+S+ICKNH  QV+FYKV+ 
Sbjct: 32  TTAYPAVKTVEAAVEGMVYCQSCDKFGSWSLAGAEAIAGAKISIICKNHRQQVSFYKVFR 91

Query: 84  TNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKK 143
           T+  G+FY EL G+KM    LDHPL +C+ K V SP  DCNL +N+N+ L GAPLRYE+K
Sbjct: 92  TDSYGHFYGELKGFKMTPHFLDHPLHSCRAKLVSSPREDCNLFSNINNALDGAPLRYEEK 151

Query: 144 LIVGPNYRAAGVVY 158
            +   NY A  V+Y
Sbjct: 152 RLKWTNYEA--VIY 163

BLAST of ClCG07G004420 vs. TAIR10
Match: AT2G33790.1 (AT2G33790.1 arabinogalactan protein 30)

HSP 1 Score: 70.9 bits (172), Expect = 1.5e-12
Identity = 48/151 (31.79%), Postives = 78/151 (51.66%), Query Frame = 1

Query: 137 PLRYEKKLIVGPNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKV 196
           P +Y K L+        GVVYC+ C+  G  ++ GAKP+  A V ++CKN  N ++  K 
Sbjct: 102 PPKYNKTLVA-----VRGVVYCKACKYAGVNNVQGAKPVKDAVVRLVCKNKKNSISETK- 161

Query: 197 YETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYE 256
             T+K+GYF   L+  K    V ++ ++ C+   V SP + C+ +++L+DG  G+ L   
Sbjct: 162 --TDKNGYFM--LLAPK---TVTNYDIKGCRAFLVKSPDTKCSKVSSLHDGGKGSVL--- 221

Query: 257 KKLIVGPN--------YRAAVFAAGPLAFHP 280
            K ++ P         ++ +V+  GP AF P
Sbjct: 222 -KPVLKPGFSSTIMRWFKYSVYNVGPFAFEP 235


HSP 2 Score: 68.2 bits (165), Expect = 9.5e-12
Identity = 40/112 (35.71%), Postives = 63/112 (56.25%), Query Frame = 1

Query: 28  PALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKD 87
           P      VAV GVVYC+ C+  G  ++ GAKP+  A V ++CKN  N ++  K   T+K+
Sbjct: 103 PKYNKTLVAVRGVVYCKACKYAGVNNVQGAKPVKDAVVRLVCKNKKNSISETK---TDKN 162

Query: 88  GYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLR 140
           GYF   L+  K    V ++ ++ C+   V SP + C+ +++L+DG  G+ L+
Sbjct: 163 GYFM--LLAPK---TVTNYDIKGCRAFLVKSPDTKCSKVSSLHDGGKGSVLK 206

BLAST of ClCG07G004420 vs. TAIR10
Match: AT1G28290.1 (AT1G28290.1 arabinogalactan protein 31)

HSP 1 Score: 65.5 bits (158), Expect = 6.2e-11
Identity = 49/147 (33.33%), Postives = 72/147 (48.98%), Query Frame = 1

Query: 137 PLRYEKKLIVGPNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKV 196
           P ++ + L+        G VYC++C+     +L GAKPI GA V ++CK   ++ N    
Sbjct: 222 PPKFNRSLVA-----VRGTVYCKSCKYAAFNTLLGAKPIEGATVKLVCK---SKKNITAE 281

Query: 197 YETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYE 256
             T+K+GYF   L+  K    V +   + C+V  V S    C+ ++ L  G VGA L+ E
Sbjct: 282 TTTDKNGYFL--LLAPK---TVTNFGFRGCRVYLVKSKDYKCSKVSKLFGGDVGAELKPE 341

Query: 257 KKL----IVGPNYRAAVFAAGPLAFHP 280
           KKL    +V       +F  GP AF+P
Sbjct: 342 KKLGKSTVVVNKLVYGLFNVGPFAFNP 355


HSP 2 Score: 63.2 bits (152), Expect = 3.1e-10
Identity = 43/110 (39.09%), Postives = 59/110 (53.64%), Query Frame = 1

Query: 35  VAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAEL 94
           VAV G VYC++C+     +L GAKPI GA V ++CK   ++ N      T+K+GYF   L
Sbjct: 230 VAVRGTVYCKSCKYAAFNTLLGAKPIEGATVKLVCK---SKKNITAETTTDKNGYFL--L 289

Query: 95  VGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKL 145
           +  K    V +   + C+V  V S    C+ ++ L  G VGA L+ EKKL
Sbjct: 290 LAPK---TVTNFGFRGCRVYLVKSKDYKCSKVSKLFGGDVGAELKPEKKL 331

BLAST of ClCG07G004420 vs. TAIR10
Match: AT3G09925.1 (AT3G09925.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 64.7 bits (156), Expect = 1.0e-10
Identity = 50/155 (32.26%), Postives = 73/155 (47.10%), Query Frame = 1

Query: 3   TKNLLSLSLLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPISG 62
           T  +L + +LFL+  +   TTT        + VA  G V CQ+C       + G++PI G
Sbjct: 6   TTLVLLIEILFLVSCVTHITTTAAYNNGDKIHVA--GKVMCQDCSLNYDEWINGSEPIKG 65

Query: 63  AKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSD 122
           A VS+ C +   +V++Y    T++ G F   +V   ++H  +  P Q C V+ V SP   
Sbjct: 66  AVVSITCMDERRRVSYYGSDLTDERGQFDL-MVNKVLSHGKVLKP-QLCNVRLVSSPDLS 125

Query: 123 CNLLTNLNDGLVGAPL-------RYEKKLIVGPNY 151
           CN+ TN  +G  G  L       R   K +VGP Y
Sbjct: 126 CNIPTNFGNGQTGVKLVRPFTVFRDLVKYVVGPFY 156


HSP 2 Score: 61.2 bits (147), Expect = 1.2e-09
Identity = 42/126 (33.33%), Postives = 59/126 (46.83%), Query Frame = 1

Query: 147 GPNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFY 206
           G     AG V CQ+C       + G++PI GA VS+ C +   +V++Y    T++ G F 
Sbjct: 33  GDKIHVAGKVMCQDCSLNYDEWINGSEPIKGAVVSITCMDERRRVSYYGSDLTDERGQFD 92

Query: 207 AELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPL-------RYEKKL 266
             +V   ++H  +  P Q C V+ V SP   CN+ TN  +G  G  L       R   K 
Sbjct: 93  L-MVNKVLSHGKVLKP-QLCNVRLVSSPDLSCNIPTNFGNGQTGVKLVRPFTVFRDLVKY 152

BLAST of ClCG07G004420 vs. TAIR10
Match: AT2G47530.1 (AT2G47530.1 Pollen Ole e 1 allergen and extensin family protein)

HSP 1 Score: 60.8 bits (146), Expect = 1.5e-09
Identity = 43/133 (32.33%), Postives = 62/133 (46.62%), Query Frame = 1

Query: 17  HIAASTTTTTRPALKPV------SVAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICK 76
           ++   TTT T P   P        +A+EG + C++  K          PI G KV V+C 
Sbjct: 31  YVPKPTTTYTSPVKTPYLPKSNPDIAIEGFILCKSGYK--------TYPIQGGKVKVVCP 90

Query: 77  ---NHNNQVNFYKV--YETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNL 136
              ++   V    +  Y T+  GYFY   + Y ++H V  + + +CKVK   SP+  C  
Sbjct: 91  VVDSYGKLVAKVTISSYPTDLKGYFY--FITYGLSHKV--NNISSCKVKLESSPVFTCKT 150

Query: 137 LTNLNDGLVGAPL 139
            TN+N G+ GAPL
Sbjct: 151 PTNVNKGVTGAPL 151


HSP 2 Score: 52.0 bits (123), Expect = 7.0e-07
Identity = 36/111 (32.43%), Postives = 52/111 (46.85%), Query Frame = 1

Query: 148 PNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICK---NHNNQVNFYKV--YETNKD 207
           P+    G + C++  K          PI G KV V+C    ++   V    +  Y T+  
Sbjct: 53  PDIAIEGFILCKSGYK--------TYPIQGGKVKVVCPVVDSYGKLVAKVTISSYPTDLK 112

Query: 208 GYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPL 254
           GYFY   + Y ++H V  + + +CKVK   SP+  C   TN+N G+ GAPL
Sbjct: 113 GYFY--FITYGLSHKV--NNISSCKVKLESSPVFTCKTPTNVNKGVTGAPL 151

BLAST of ClCG07G004420 vs. NCBI nr
Match: gi|659109661|ref|XP_008454812.1| (PREDICTED: uncharacterized protein LOC103495126 [Cucumis melo])

HSP 1 Score: 256.5 bits (654), Expect = 5.3e-65
Identity = 126/154 (81.82%), Postives = 138/154 (89.61%), Query Frame = 1

Query: 1   MSTKNLLSLS-LLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKP 60
           MSTKNLLSLS LL LLLHIA++     RP  KP+SVAVEG+VYCQNC+K+GTWSLT AKP
Sbjct: 1   MSTKNLLSLSFLLLLLLHIASANPV--RPPHKPLSVAVEGLVYCQNCKKVGTWSLTEAKP 60

Query: 61  ISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSP 120
           ISGAK+SVICKNHN+QV FYKVY+TNKDGYFYAELVGY+MNHPVLDHPLQACKVKPV SP
Sbjct: 61  ISGAKISVICKNHNDQVKFYKVYQTNKDGYFYAELVGYQMNHPVLDHPLQACKVKPVSSP 120

Query: 121 LSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAA 154
           LSDCNLLTNLN GL GAPLRYEKK ++G NYRAA
Sbjct: 121 LSDCNLLTNLNYGLTGAPLRYEKKFVLGHNYRAA 152

BLAST of ClCG07G004420 vs. NCBI nr
Match: gi|659109661|ref|XP_008454812.1| (PREDICTED: uncharacterized protein LOC103495126 [Cucumis melo])

HSP 1 Score: 248.1 bits (632), Expect = 1.9e-62
Identity = 112/130 (86.15%), Postives = 123/130 (94.62%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYK 213
           G+VYCQNC+K+GTWSLT AKPISGAK+SVICKNHN+QV FYKVY+TNKDGYFYAELVGY+
Sbjct: 38  GLVYCQNCKKVGTWSLTEAKPISGAKISVICKNHNDQVKFYKVYQTNKDGYFYAELVGYQ 97

Query: 214 MNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAG 273
           MNHPVLDHPLQACKVKPV SPLSDCNLLTNLN GL GAPLRYEKK ++G NYRAAV+AAG
Sbjct: 98  MNHPVLDHPLQACKVKPVSSPLSDCNLLTNLNYGLTGAPLRYEKKFVLGHNYRAAVYAAG 157

Query: 274 PLAFHPEKCL 284
           PLAFHP+KCL
Sbjct: 158 PLAFHPQKCL 167


HSP 2 Score: 254.2 bits (648), Expect = 2.6e-64
Identity = 123/153 (80.39%), Postives = 136/153 (88.89%), Query Frame = 1

Query: 1   MSTKNLLSLSLLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPI 60
           MSTKNLLSLS L LLLHIA++     R   KP+S+A+EG+VYCQNC+KIGTWSLT AKPI
Sbjct: 1   MSTKNLLSLSFLLLLLHIASADPV--RLPHKPLSMAIEGLVYCQNCKKIGTWSLTEAKPI 60

Query: 61  SGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPL 120
           SGAK+SVICKNHN+QV FYKVY+TNKDGYFYAELVGY+MNHPVLDHPLQACKVKPV SPL
Sbjct: 61  SGAKISVICKNHNDQVKFYKVYQTNKDGYFYAELVGYQMNHPVLDHPLQACKVKPVSSPL 120

Query: 121 SDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAA 154
           SDCNLLTNLN GL GAPLR+EKK +VG NYRAA
Sbjct: 121 SDCNLLTNLNYGLTGAPLRFEKKFVVGTNYRAA 151

BLAST of ClCG07G004420 vs. NCBI nr
Match: gi|700195502|gb|KGN50679.1| (hypothetical protein Csa_5G211030 [Cucumis sativus])

HSP 1 Score: 248.4 bits (633), Expect = 1.5e-62
Identity = 113/130 (86.92%), Postives = 123/130 (94.62%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYK 213
           G+VYCQNC+KIGTWSLT AKPISGAK+SVICKNHN+QV FYKVY+TNKDGYFYAELVGY+
Sbjct: 37  GLVYCQNCKKIGTWSLTEAKPISGAKISVICKNHNDQVKFYKVYQTNKDGYFYAELVGYQ 96

Query: 214 MNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAG 273
           MNHPVLDHPLQACKVKPV SPLSDCNLLTNLN GL GAPLR+EKK +VG NYRAAV+AAG
Sbjct: 97  MNHPVLDHPLQACKVKPVSSPLSDCNLLTNLNYGLTGAPLRFEKKFVVGTNYRAAVYAAG 156

Query: 274 PLAFHPEKCL 284
           PLAFHP+KCL
Sbjct: 157 PLAFHPQKCL 166


HSP 2 Score: 248.4 bits (633), Expect = 1.5e-62
Identity = 113/130 (86.92%), Postives = 123/130 (94.62%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYK 213
           G+VYCQNC+KIGTWSLT AKPISGAK+SVICKNHN+QV FYKVY+TNKDGYFYAELVGY+
Sbjct: 5   GLVYCQNCKKIGTWSLTEAKPISGAKISVICKNHNDQVKFYKVYQTNKDGYFYAELVGYQ 64

Query: 214 MNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAG 273
           MNHPVLDHPLQACKVKPV SPLSDCNLLTNLN GL GAPLR+EKK +VG NYRAAV+AAG
Sbjct: 65  MNHPVLDHPLQACKVKPVSSPLSDCNLLTNLNYGLTGAPLRFEKKFVVGTNYRAAVYAAG 124

Query: 274 PLAFHPEKCL 284
           PLAFHP+KCL
Sbjct: 125 PLAFHPQKCL 134

BLAST of ClCG07G004420 vs. NCBI nr
Match: gi|778701337|ref|XP_004148198.2| (PREDICTED: proline-rich protein 3 [Cucumis sativus])

HSP 1 Score: 223.8 bits (569), Expect = 3.8e-55
Identity = 102/119 (85.71%), Postives = 112/119 (94.12%), Query Frame = 1

Query: 35  VAVEGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAEL 94
           +A+EG+VYCQNC+KIGTWSLT AKPISGAK+SVICKNHN+QV FYKVY+TNKDGYFYAEL
Sbjct: 1   MAIEGLVYCQNCKKIGTWSLTEAKPISGAKISVICKNHNDQVKFYKVYQTNKDGYFYAEL 60

Query: 95  VGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAA 154
           VGY+MNHPVLDHPLQACKVKPV SPLSDCNLLTNLN GL GAPLR+EKK +VG NYRAA
Sbjct: 61  VGYQMNHPVLDHPLQACKVKPVSSPLSDCNLLTNLNYGLTGAPLRFEKKFVVGTNYRAA 119


HSP 2 Score: 194.9 bits (494), Expect = 1.9e-46
Identity = 92/143 (64.34%), Postives = 110/143 (76.92%), Query Frame = 1

Query: 140 YEKKLIVGPNYRAAGVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYET 199
           Y+KK+ V       G+VYCQ+C   GTWS+TGAKPI  AKVSVICKNH +QV+FYK ++T
Sbjct: 29  YQKKIDVV----VEGMVYCQSCDHYGTWSMTGAKPIPSAKVSVICKNHKDQVSFYKAFQT 88

Query: 200 NKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKL 259
           N DGYFYA L G+KMNH +LDHPLQAC VKPV SPL DC  L+N+N GL GAPLRYE K 
Sbjct: 89  NADGYFYAPLDGFKMNH-MLDHPLQACHVKPVSSPLEDCRFLSNVNYGLNGAPLRYEDKR 148

Query: 260 IVGPNYRAAVFAAGPLAFHPEKC 283
           ++G NY A V++AGPLAF P+ C
Sbjct: 149 VMGSNYEAVVYSAGPLAFRPQHC 166

BLAST of ClCG07G004420 vs. NCBI nr
Match: gi|764633987|ref|XP_011469935.1| (PREDICTED: proline-rich protein 3-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 188.0 bits (476), Expect = 2.3e-44
Identity = 94/157 (59.87%), Postives = 113/157 (71.97%), Query Frame = 1

Query: 1   MSTKNLLSLSLLFLLLHIAASTTTTTRPALKPVSVAVEGVVYCQNCQKIGTWSLTGAKPI 60
           M+ + L+ L    LL  +A   +TT     K + V VEG+VYCQ+C   GTWS+TGAKPI
Sbjct: 1   MAGRQLIVLVSSLLLTALAFFPSTTATEYQKKIDVVVEGMVYCQSCDHYGTWSMTGAKPI 60

Query: 61  SGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYKMNHPVLDHPLQACKVKPVYSPL 120
             AKVSVICKNH +QV+FYK ++TN DGYFYA L G+KMNH +LDHPLQAC VKPV SPL
Sbjct: 61  PSAKVSVICKNHKDQVSFYKAFQTNADGYFYAPLDGFKMNH-MLDHPLQACHVKPVSSPL 120

Query: 121 SDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAGVVY 158
            DC  L+N+N GL GAPLRYE K ++G NY A  VVY
Sbjct: 121 EDCRFLSNVNYGLNGAPLRYEDKRVMGSNYEA--VVY 154


HSP 2 Score: 187.6 bits (475), Expect = 3.0e-44
Identity = 85/129 (65.89%), Postives = 103/129 (79.84%), Query Frame = 1

Query: 154 GVVYCQNCQKIGTWSLTGAKPISGAKVSVICKNHNNQVNFYKVYETNKDGYFYAELVGYK 213
           GVVYCQ+C   G+WSLTGAKPI  A VSVICK+H N+V+FYK + T+ +GYFYAEL G++
Sbjct: 39  GVVYCQSCNSYGSWSLTGAKPIESATVSVICKDHRNRVSFYKAFATDGNGYFYAELKGFR 98

Query: 214 MNHPVLDHPLQACKVKPVYSPLSDCNLLTNLNDGLVGAPLRYEKKLIVGPNYRAAVFAAG 273
           M+H  LDHPLQACKVK V SP+  CN+L+N+N GL GAPLRYE+K +VG NY A V+AAG
Sbjct: 99  MSHYFLDHPLQACKVKLVASPMEACNVLSNVNYGLYGAPLRYEEKRLVGSNYEAVVYAAG 158

Query: 274 PLAFHPEKC 283
           PLAF P  C
Sbjct: 159 PLAFRPAHC 167

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AGP30_ARATH2.6e-1131.79Non-classical arabinogalactan protein 30 OS=Arabidopsis thaliana GN=AGP30 PE=2 S... [more]
AGP31_ARATH1.1e-0933.33Non-classical arabinogalactan protein 31 OS=Arabidopsis thaliana GN=AGP31 PE=1 S... [more]
PRP3_ARATH3.5e-0831.97Proline-rich protein 3 OS=Arabidopsis thaliana GN=PRP3 PE=2 SV=1[more]
PRP1_ARATH8.7e-0731.91Proline-rich protein 1 OS=Arabidopsis thaliana GN=PRP1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KP60_CUCSA1.8e-6480.39Uncharacterized protein OS=Cucumis sativus GN=Csa_5G211030 PE=4 SV=1[more]
A0A0A0KP60_CUCSA1.0e-6286.92Uncharacterized protein OS=Cucumis sativus GN=Csa_5G211030 PE=4 SV=1[more]
W9QNK7_9ROSA2.6e-4259.87Uncharacterized protein OS=Morus notabilis GN=L484_001723 PE=4 SV=1[more]
D7T4L1_VITVI1.6e-3956.29Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0067g01540 PE=4 SV=... [more]
I1LMT9_SOYBN2.4e-4054.25Uncharacterized protein OS=Glycine max GN=GLYMA_11G232100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G05500.14.7e-4358.14 Pollen Ole e 1 allergen and extensin family protein[more]
AT2G33790.11.5e-1231.79 arabinogalactan protein 30[more]
AT1G28290.16.2e-1133.33 arabinogalactan protein 31[more]
AT3G09925.11.0e-1032.26 Pollen Ole e 1 allergen and extensin family protein[more]
AT2G47530.11.5e-0932.33 Pollen Ole e 1 allergen and extensin family protein[more]
Match NameE-valueIdentityDescription
gi|659109661|ref|XP_008454812.1|5.3e-6581.82PREDICTED: uncharacterized protein LOC103495126 [Cucumis melo][more]
gi|659109661|ref|XP_008454812.1|1.9e-6286.15PREDICTED: uncharacterized protein LOC103495126 [Cucumis melo][more]
gi|700195502|gb|KGN50679.1|1.5e-6286.92hypothetical protein Csa_5G211030 [Cucumis sativus][more]
gi|778701337|ref|XP_004148198.2|3.8e-5585.71PREDICTED: proline-rich protein 3 [Cucumis sativus][more]
gi|764633987|ref|XP_011469935.1|2.3e-4459.87PREDICTED: proline-rich protein 3-like [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048767 root hair elongation
biological_process GO:0016049 cell growth
biological_process GO:0060560 developmental growth involved in morphogenesis
biological_process GO:0010054 trichoblast differentiation
biological_process GO:0048869 cellular developmental process
biological_process GO:0009826 unidimensional cell growth
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0031982 vesicle
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG07G004420.1ClCG07G004420.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33470FAMILY NOT NAMEDcoord: 156..282
score: 3.9
NoneNo IPR availablePANTHERPTHR33470:SF5SUBFAMILY NOT NAMEDcoord: 156..282
score: 3.9
NoneNo IPR availablePFAMPF01190Pollen_Ole_e_Icoord: 153..248
score: 1.6E-24coord: 37..133
score: 1.3

The following gene(s) are paralogous to this gene:

None