ClCG01G009210 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G009210
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPhloem lectin
LocationCG_Chr01: 12000923 .. 12003121 (-)
RNA-Seq ExpressionClCG01G009210
SyntenyClCG01G009210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAATGGATAATTTTCATGTCACACCTTTCAAATGTTGTGGCACACCAAGATGTTATATATATCTCTTCTCCATTTCCATCTTTATACACATCTTAATTGGCTTCAATCAAAATAGCCTCAAAATGAGCTCCGAGGAGAATGAAGCAAGAAAGATTTTACTTGGCCATTGCTTAGGTAATATTTTGCCACAGTCAGACGTGCCAATAAAATTACCTTCCTTGGTTCCGCTTTTTGATCAACTTCTTGATGGGATCCCCTTGAACAACGGAGCTCAGGTAAGTATAACAATATATTTTATTTTTAAAAATGTTTTAAAAAATAGAAAAATAGTTTAAAATAATTACAAATATAGTAAAATTTTAGTTTCTATTGGTGATAGACCGCGCTAGACCCAAATAGACACCTATCAATAGATACAAATAGATGTCTATTTAGATCTATCGGTTTTTATCACAAATAGAAAGTAAAATTTTGTTATATGTATAAATAGTTTGATATTTTTTTATGTTATAATAATTTTCTTTATTTTTACTTTATTTACTTATTAATTACGTTGAATGCAGAAATTCTATTTGAATAAGGAGACAAACAGTAATCGAGTGTTTATACCAGTGAAATCACTCACAATATATGGGATTGATGATCCCCGATACTGGATACTGTCATGCCTTGAAGAGTGGGGGTATAATCTTAACCCCATATTTTCTTTGTTGCAAACCGAAATGATTACTAATTTCAAACCATATAAAAACCTTTGGCAGTATTAAATTATTAATAATATATGTTTAGGTTTGACTATGAATGAGCTTAATATTTTCTAAAAGACGTTGGGTATGGTTTTGAAGAAAAGTTGTATCCCTGTTCTTAATTAGAATTTTTTATTTTGCGTCGAGTTTTGATTTGATCCCTATGTTTTAAAATATTAATATTTAATTCTTAAGTTTTGAGTTTGGTGCCAATTTAGCCCTTTAATTTCAAAATATTACAATTTTAACTTTCAGATTTGAGTTTTCTTTCAATTTAGTGGTCTTTAAGTTCCAATATCTACCATAATTTTAACCTCATATTTCATATCCAGTAATTGGCATCAATGTTGATTAATTAACTTTAAATAATTATAAAGTGGAATTTTAAATTCAATTTTAACAATAATGGAAATATAGTGAAACTTAATTAATTATAGTTATTATAATTATTTTAAATTAATTAATGGAAAGTTATTTTAAATTACAAATAATTGTAAAATATCAGTCTATTTGCGATATACCATGATAGATTGCGAGAGATTAGCATATGTGAATATCATGATAGACTGCAATAGACTACTATCTGTGCCTATCATGATAGACACAAATAGTACTCTATTGCGGTCTATTACAGACAGACAGCAAAAAAATTTTATATTTACAAATATTTTAATTCATTTTATTATATTAGAAACAAGCCTTAATCAATGGACATTAAAACAAATATAGAAAACTGAATATTTAGTGAAAAATCAAGGTTAAAAGTGTAAATCTTGAAACTTAGGAATCAAATTGAAAGAAAACTCAAATCTAAAGGGTAAATTTGAAACATTTTGAAACTTGGGGACTTCATTGAAATTAAACTAAAAACTTAAGTACTAAATATGTAAGATTAGAAAAGAGGGACCAAATAAAAACTAGAACCTAAACCTAAGGTCGAAAAAGATAGTTTTCCCACTCTGGTGTTGTAACTAATTTGAAGTTAACTCAAGTAGTTAAAACGTTATATTGGCAACGATCTCACAATGATGTCTAAAAACGTGCAGTAAGAAAGTGACCATCGCTGAAGTTAGAGGAATAAGTCAGTTTGATATTAGTGGATCAGTGAAGACAGGTTTGCTGACACCAAAGGTAATATATATAGTAGTATTTGTGGTATTGCTCACTATTGATGCCAAGGGATGGACGTCTCCTGTGAACCTGATACTGAAGAAGCCAAATGGGAGCAAGATAGAGAGCAAAATAAGCCTAGAGGGGAAGCCAAGAGGAGAGTATTTCGAGGTAATTGTTGGAGAACTCACATTGGATGACCGCGGATGTGCCGATACCAGCGTGATCGAGTTCGGCATGTATGAACATGGAAGTCAACTCAAAAGTGGGCTGGTCTTGAAAGGTGGCCTTTTACGATCAAAGGCCTCACCTGGCTGCCCCCATGCTGATACCAAGTAG

mRNA sequence

ATGAGAATGGATAATTTTCATGTCACACCTTTCAAATGTTGTGGCACACCAAGATGTTATATATATCTCTTCTCCATTTCCATCTTTATACACATCTTAATTGGCTTCAATCAAAATAGCCTCAAAATGAGCTCCGAGGAGAATGAAGCAAGAAAGATTTTACTTGGCCATTGCTTAGGTAATATTTTGCCACAGTCAGACGTGCCAATAAAATTACCTTCCTTGGTTCCGCTTTTTGATCAACTTCTTGATGGGATCCCCTTGAACAACGGAGCTCAGAAATTCTATTTGAATAAGGAGACAAACAGTAATCGAGTGTTTATACCAGTGAAATCACTCACAATATATGGGATTGATGATCCCCGATACTGGATACTGTCATGCCTTGAAGAGTGGGGTAAGAAAGTGACCATCGCTGAAGTTAGAGGAATAAGTCAGTTTGATATTAGTGGATCAGTGAAGACAGGTTTGCTGACACCAAAGGTAATATATATAGTAGTATTTGTGGTATTGCTCACTATTGATGCCAAGGGATGGACGTCTCCTGTGAACCTGATACTGAAGAAGCCAAATGGGAGCAAGATAGAGAGCAAAATAAGCCTAGAGGGGAAGCCAAGAGGAGAGTATTTCGAGGTAATTGTTGGAGAACTCACATTGGATGACCGCGGATGTGCCGATACCAGCGTGATCGAGTTCGGCATGTATGAACATGGAAGTCAACTCAAAAGTGGGCTGGTCTTGAAAGGTGGCCTTTTACGATCAAAGGCCTCACCTGGCTGCCCCCATGCTGATACCAAGTAG

Coding sequence (CDS)

ATGAGAATGGATAATTTTCATGTCACACCTTTCAAATGTTGTGGCACACCAAGATGTTATATATATCTCTTCTCCATTTCCATCTTTATACACATCTTAATTGGCTTCAATCAAAATAGCCTCAAAATGAGCTCCGAGGAGAATGAAGCAAGAAAGATTTTACTTGGCCATTGCTTAGGTAATATTTTGCCACAGTCAGACGTGCCAATAAAATTACCTTCCTTGGTTCCGCTTTTTGATCAACTTCTTGATGGGATCCCCTTGAACAACGGAGCTCAGAAATTCTATTTGAATAAGGAGACAAACAGTAATCGAGTGTTTATACCAGTGAAATCACTCACAATATATGGGATTGATGATCCCCGATACTGGATACTGTCATGCCTTGAAGAGTGGGGTAAGAAAGTGACCATCGCTGAAGTTAGAGGAATAAGTCAGTTTGATATTAGTGGATCAGTGAAGACAGGTTTGCTGACACCAAAGGTAATATATATAGTAGTATTTGTGGTATTGCTCACTATTGATGCCAAGGGATGGACGTCTCCTGTGAACCTGATACTGAAGAAGCCAAATGGGAGCAAGATAGAGAGCAAAATAAGCCTAGAGGGGAAGCCAAGAGGAGAGTATTTCGAGGTAATTGTTGGAGAACTCACATTGGATGACCGCGGATGTGCCGATACCAGCGTGATCGAGTTCGGCATGTATGAACATGGAAGTCAACTCAAAAGTGGGCTGGTCTTGAAAGGTGGCCTTTTACGATCAAAGGCCTCACCTGGCTGCCCCCATGCTGATACCAAGTAG

Protein sequence

MRMDNFHVTPFKCCGTPRCYIYLFSISIFIHILIGFNQNSLKMSSEENEARKILLGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYLNKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGELTLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGCPHADTK
Homology
BLAST of ClCG01G009210 vs. NCBI nr
Match: WP_219729244.1 (PP2 domain-containing protein, partial [Pectobacterium odoriferum] >POE05059.1 hypothetical protein BV921_23310, partial [Pectobacterium odoriferum])

HSP 1 Score: 292.7 bits (748), Expect = 3.1e-75
Identity = 143/234 (61.11%), Postives = 172/234 (73.50%), Query Frame = 0

Query: 38  QNSLKMSSEENEARKIL-----LGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGA 97
           QN LKM  EE EARK L     LGHCL  ILPQ+D  +  PS +PLFDQL+ GI LNNGA
Sbjct: 1   QNRLKMDPEEVEARKYLGLQVQLGHCLSYILPQADEKLLWPSYIPLFDQLVYGISLNNGA 60

Query: 98  QKFYLNKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGS 157
           QK+YLNK+TNSNRV+IP KSL I    DPRYW    LEEW K+  +A +  ++ FD+ GS
Sbjct: 61  QKYYLNKQTNSNRVYIPPKSLNIAWGHDPRYWKWELLEEWDKRALLAVLIKVNWFDVRGS 120

Query: 158 VKTGLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEV 217
           VK  LL+ +++Y+V FVV    DA GW  PVNL+LKKPNG KIESK+++EGKP+GEYFEV
Sbjct: 121 VKASLLSSRILYLVAFVVYFNEDAHGWHVPVNLVLKKPNGCKIESKVNIEGKPKGEYFEV 180

Query: 218 IVGELTLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGCPHADTK 267
             GEL LD+ GC D+ +IEF MYEHGS  K GLV+KG +LRSKAS GCPHA+ K
Sbjct: 181 DAGELILDNCGCGDSGIIEFAMYEHGSHEKRGLVMKGVILRSKASDGCPHANVK 234

BLAST of ClCG01G009210 vs. NCBI nr
Match: XP_038893562.1 (lectin-like [Benincasa hispida])

HSP 1 Score: 235.7 bits (600), Expect = 4.5e-58
Identity = 117/225 (52.00%), Postives = 153/225 (68.00%), Query Frame = 0

Query: 43  MSSEENEARKIL-----LGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYL 102
           M  EE EARK L      GH L  I   SD  ++ P+ VPLFDQLLDG+  N G +KF L
Sbjct: 1   MDPEELEARKYLGVKLSYGHNLDCIFSHSDDKLQNPTFVPLFDQLLDGVSFNQGTRKFRL 60

Query: 103 NKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGL 162
           N+ TNSNRVFI  K+L+I  ++D RYW    + +   +V   ++  +S FDI GSVK   
Sbjct: 61  NQPTNSNRVFILPKALSIAWVNDTRYWKWIIVTDNRNEVEAPQLIQVSWFDIHGSVKASW 120

Query: 163 LTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGEL 222
           L+ +V+Y+V F+V+LT DA GW   VNL +KKPNG  +E+K+SLEGKP+GE+FEV  G+ 
Sbjct: 121 LSSRVVYLVAFLVMLTEDASGWNHHVNLTIKKPNGCTVETKVSLEGKPKGEFFEVWAGDF 180

Query: 223 TLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGCPH 263
            LD+ GC D+ V+EFGMYEHG   K GL++KG ++RSKAS GCPH
Sbjct: 181 ILDNCGCGDSGVVEFGMYEHGGHWKRGLIVKGVVIRSKASAGCPH 225

BLAST of ClCG01G009210 vs. NCBI nr
Match: WP_181002512.1 (PP2 domain-containing protein, partial [Pectobacterium odoriferum] >POE16080.1 hypothetical protein BV923_23560, partial [Pectobacterium odoriferum])

HSP 1 Score: 226.5 bits (576), Expect = 2.7e-55
Identity = 105/174 (60.34%), Postives = 131/174 (75.29%), Query Frame = 0

Query: 93  QKFYLNKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGS 152
           +K+YLNK+TNSNRV+IP KSL I    DPRYW    LEEW K+  +A +  ++ FD+ GS
Sbjct: 5   EKYYLNKQTNSNRVYIPPKSLNIAWGHDPRYWKWELLEEWDKRALLAVLIKVNWFDVRGS 64

Query: 153 VKTGLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEV 212
           VK  LL+ +++Y+V FVV    DA GW  PVNL+LKKPNG KIESK+++EGKP+GEYFEV
Sbjct: 65  VKASLLSSRILYLVAFVVYFNEDAHGWHVPVNLVLKKPNGCKIESKVNIEGKPKGEYFEV 124

Query: 213 IVGELTLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGCPHADTK 267
             GEL LD+ GC D+ +IEF MYEHGS  K GLV+KG +LRSKAS GCPHA+ K
Sbjct: 125 DAGELILDNCGCGDSGIIEFAMYEHGSHEKRGLVMKGVILRSKASDGCPHANVK 178

BLAST of ClCG01G009210 vs. NCBI nr
Match: AAA92465.1 (phloem protein 2 [Cucurbita argyrosperma])

HSP 1 Score: 185.7 bits (470), Expect = 5.4e-43
Identity = 98/223 (43.95%), Postives = 138/223 (61.88%), Query Frame = 0

Query: 43  MSSEENEA-----RKILLGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYL 102
           M S+E EA     R++ LGHCL  IL  +DV +  PS + L+DQL+ GI LN GA K+  
Sbjct: 1   MDSKEKEAREKLGREVKLGHCLDVILKNADVALHYPSFLKLYDQLVAGILLNKGAIKYIF 60

Query: 103 NKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGL 162
           +K+ NS+  FI  ++L+I  I+D RYW      +WG    IAE+  +S  DI G +K  +
Sbjct: 61  DKKLNSHWYFIFARALSIAWIEDKRYW------KWGSCNEIAELIQVSWLDIRGKIKESM 120

Query: 163 LTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGEL 222
           L+P ++Y V   V L   A GW  P+N+ LKKPNGSKIE +  L GKP+ ++FE+++ E 
Sbjct: 121 LSPNIVYEVALQVQLNSGASGWNHPMNIELKKPNGSKIERQECLLGKPKNQWFEIVI-EF 180

Query: 223 TLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGC 261
            +D+ GC  +  IEFG YEHG   KSGL++KG  + +K   GC
Sbjct: 181 KVDNHGCGSSGEIEFGFYEHGGHWKSGLLVKGVRIGAKGC-GC 215

BLAST of ClCG01G009210 vs. NCBI nr
Match: KAG6583769.1 (hypothetical protein SDJN03_19701, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 185.3 bits (469), Expect = 7.0e-43
Identity = 97/223 (43.50%), Postives = 138/223 (61.88%), Query Frame = 0

Query: 43  MSSEENEAR-----KILLGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYL 102
           M ++E EAR     ++ LGHCL  IL  +DV +  PS + L+DQL+ GI LN GA K+  
Sbjct: 1   MDNKEKEAREKLGGEVKLGHCLDVILKNADVALHYPSFLKLYDQLVAGILLNKGAIKYIF 60

Query: 103 NKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGL 162
           +K++NSN  FI  ++L+I  I+D RYW      +WG    IAE+  +S  DI G +   +
Sbjct: 61  DKKSNSNWYFIFARALSIAWIEDKRYW------KWGSCNKIAELIQVSWLDIRGKINESM 120

Query: 163 LTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGEL 222
           L+P ++Y V   V L   A GW  P+N+ LKKPNGSKIE +  L GKP+ ++FE+++ E 
Sbjct: 121 LSPNIVYEVALQVQLNSGASGWNHPMNIELKKPNGSKIERQECLLGKPKNQWFEIVI-EF 180

Query: 223 TLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGC 261
            +D+ GC  +  IEFG YEHG   KSGL++KG  + +K   GC
Sbjct: 181 KVDNHGCGSSGEIEFGFYEHGGHWKSGLLVKGVRIGAKGC-GC 215

BLAST of ClCG01G009210 vs. ExPASy Swiss-Prot
Match: C0HJV2 (Lectin OS=Luffa acutangula OX=56866 PE=1 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 1.7e-28
Identity = 71/206 (34.47%), Postives = 120/206 (58.25%), Query Frame = 0

Query: 52  KILLGHCLGNILPQSDVPI-KLPSLVPLFDQLLDGIPLNNGAQKFYLNKETNSNRVFIPV 111
           ++ +GH L  IL   DV +  +PS + L+DQ+  GI LNN  ++++ +K   SN   +  
Sbjct: 7   EVKVGHNLEAILKGLDVDVYSVPSFIKLYDQVTAGIFLNNRTKRYWFDKNAESNCFMLYA 66

Query: 112 KSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGLLTPKVIYIVVFVV 171
           + L I    D RYW  +  +E G  + +AE+  +   +I G+++T +L+P + Y   F V
Sbjct: 67  RDLLITWSQDKRYWRWNPFQEHGNTLEVAELIDVCWLNIVGNIETSVLSPGISYEAAFEV 126

Query: 172 LLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGELTLDDRGCADT-SV 231
           +LT  A GW  PV++ LK P+GS+ ES+++L+ KPRG +F + VG   +      +T   
Sbjct: 127 MLTNSASGWRIPVDVKLKMPDGSEQESQVNLQDKPRGVWFFISVGHFKI---SVGETIGN 186

Query: 232 IEFGMYEHGSQLKSGLVLKGGLLRSK 256
           IEF + +H  + K GL++KG +++ K
Sbjct: 187 IEFSIVQH-QEAKRGLLVKGLVIQPK 208

BLAST of ClCG01G009210 vs. ExPASy Swiss-Prot
Match: O81865 (Protein PHLOEM PROTEIN 2-LIKE A1 OS=Arabidopsis thaliana OX=3702 GN=PP2A1 PE=2 SV=1)

HSP 1 Score: 104.8 bits (260), Expect = 1.6e-21
Identity = 76/198 (38.38%), Postives = 101/198 (51.01%), Query Frame = 0

Query: 62  ILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYLNKETNSNRVFIPVKSLTIYGIDDP 121
           IL  +D PI L S V L +QL  G+ L    Q  Y   E NSN   +  K+L+I   DD 
Sbjct: 52  ILRDADPPISLSS-VNLSEQLRSGVFLKPKKQIKYWVDERNSNCFMLFAKNLSITWSDDV 111

Query: 122 RYWI-LSCLEEWGKKVTIAEVRGISQFDISGSVKTGLLTPKVIYIVVFVVLLTIDAKGWT 181
            YW   +  E   + V    ++ +   DI+G   T  LTP ++Y VVF V L   A GW 
Sbjct: 112 NYWTWFTEKESPNENVEAVGLKNVCWLDITGKFDTRNLTPGIVYEVVFKVKLEDPAYGWD 171

Query: 182 SPVNLILKKPNGSK--IESKISLEGKPRGEYFEVIVGELTLDDRGCADTSVIEFGMYEHG 241
           +PVNL L  PNG +   E K+SL   PR ++ +V VGE   +     +   I F MYEH 
Sbjct: 172 TPVNLKLVLPNGKEKPQEKKVSLRELPRYKWVDVRVGEFVPEKSAAGE---ITFSMYEHA 231

Query: 242 SQL-KSGLVLKGGLLRSK 256
           + + K GL LKG  +R K
Sbjct: 232 AGVWKKGLSLKGVAIRPK 245

BLAST of ClCG01G009210 vs. ExPASy Swiss-Prot
Match: P0DSP5 (Lectin OS=Coccinia grandis OX=387127 PE=1 SV=1)

HSP 1 Score: 78.6 bits (192), Expect = 1.2e-13
Identity = 48/152 (31.58%), Postives = 78/152 (51.32%), Query Frame = 0

Query: 97  LNKETNSNRVFIPV-KSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKT 156
           LN+E  S+  F+   ++ T+   DD RYW  + ++  G ++  A++  +S FD   +V T
Sbjct: 1   LNQEKLSSTHFLLFPRAATLTWSDDTRYWSWNPVDFCGYQLEEAQLSRVSWFDCRWTVNT 60

Query: 157 GLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVG 216
             L   V Y V   V +   A GW +P+NL L+ PNGSK  S++ L  +PR  +F++ +G
Sbjct: 61  TDLKTNVWYNVFLKVQMGSGASGWNTPLNLELEMPNGSKQASQVVLNDRPRDVWFKLQMG 120

Query: 217 ELTLDDRGCADTSVIEFGMYEHGSQLKSGLVL 248
            L + D        +   +Y H +  K G  L
Sbjct: 121 NLMVSD--SETCGALRMSLYNHQTNWKMGATL 150

BLAST of ClCG01G009210 vs. ExPASy Swiss-Prot
Match: Q9C5Q9 (Protein PHLOEM PROTEIN 2-LIKE A5 OS=Arabidopsis thaliana OX=3702 GN=PP2A5 PE=2 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 9.6e-11
Identity = 55/184 (29.89%), Postives = 85/184 (46.20%), Query Frame = 0

Query: 79  FDQLLDGIPLNNGAQKFYLNKETNSNRVF-IPVKSLTIYGIDDPRYWILSCLEEWGKKVT 138
           F Q+ +  P+ +   KF+++       VF I  + L+I   +D  +W    L       +
Sbjct: 231 FYQMKNQSPVPSYEFKFWVDLTRPKGNVFMIDARDLSIAWSEDSNHWTWLPLPNQNSNES 290

Query: 139 IAEV---RGISQFDISGSVKTGLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPN--G 198
           + E+   +  S  D++G   T  LTP+  Y VVFVV L    + W + V L L  PN   
Sbjct: 291 VMEIAFLKSASWLDVAGKFDTRYLTPRTRYEVVFVVKLEYTFE-WETLVKLKLDLPNTWE 350

Query: 199 SKIESKISLEGKPRGEYFEVIVGELTLDDRGCADTSVIEFGMYEHGSQL-KSGLVLKGGL 256
              E  + +      ++ ++ VGE T   +   +   I F MYEH  QL KSGL +KG  
Sbjct: 351 KPQEQSVDMFDYISDQWLDIPVGEFTTSKKNVGE---ISFAMYEHECQLWKSGLFVKGVT 410

BLAST of ClCG01G009210 vs. ExPASy Swiss-Prot
Match: O81866 (Protein PHLOEM PROTEIN 2-LIKE A2 OS=Arabidopsis thaliana OX=3702 GN=PP2A2 PE=2 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 1.4e-09
Identity = 41/122 (33.61%), Postives = 62/122 (50.82%), Query Frame = 0

Query: 135 KVTIAEVRGISQFDISGSVKTGLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSK 194
           +  +A++  ++  ++ G  +T  LTP  +Y VVFVV L   AKGW   VN  L  P G  
Sbjct: 74  RTEVAKMERVAWLEVVGKFETEKLTPNSLYEVVFVVKLIDSAKGWDFRVNFKLVLPTGET 133

Query: 195 IESKISLEGKPRGEYFEVIVGELTLDDRGCADTSVIEFGMYE-HGSQLKSGLVLKGGLLR 254
            E + ++    R ++ E+  GE  +       +  IEF M E    Q KSGL++KG  +R
Sbjct: 134 KERRENVNLLERNKWVEIPAGEFMISPEHL--SGKIEFSMLEVKSDQWKSGLIVKGVAIR 193

Query: 255 SK 256
            K
Sbjct: 194 PK 193

BLAST of ClCG01G009210 vs. ExPASy TrEMBL
Match: Q39461 (Phloem protein 2 OS=Cucurbita argyrosperma OX=34294 GN=PP2 PE=2 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 2.6e-43
Identity = 98/223 (43.95%), Postives = 138/223 (61.88%), Query Frame = 0

Query: 43  MSSEENEA-----RKILLGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYL 102
           M S+E EA     R++ LGHCL  IL  +DV +  PS + L+DQL+ GI LN GA K+  
Sbjct: 1   MDSKEKEAREKLGREVKLGHCLDVILKNADVALHYPSFLKLYDQLVAGILLNKGAIKYIF 60

Query: 103 NKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGL 162
           +K+ NS+  FI  ++L+I  I+D RYW      +WG    IAE+  +S  DI G +K  +
Sbjct: 61  DKKLNSHWYFIFARALSIAWIEDKRYW------KWGSCNEIAELIQVSWLDIRGKIKESM 120

Query: 163 LTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGEL 222
           L+P ++Y V   V L   A GW  P+N+ LKKPNGSKIE +  L GKP+ ++FE+++ E 
Sbjct: 121 LSPNIVYEVALQVQLNSGASGWNHPMNIELKKPNGSKIERQECLLGKPKNQWFEIVI-EF 180

Query: 223 TLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGC 261
            +D+ GC  +  IEFG YEHG   KSGL++KG  + +K   GC
Sbjct: 181 KVDNHGCGSSGEIEFGFYEHGGHWKSGLLVKGVRIGAKGC-GC 215

BLAST of ClCG01G009210 vs. ExPASy TrEMBL
Match: A0A6J1EHQ4 (lectin-like OS=Cucurbita moschata OX=3662 GN=LOC111434188 PE=4 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 1.1e-41
Identity = 96/225 (42.67%), Postives = 139/225 (61.78%), Query Frame = 0

Query: 43  MSSEENEAR-----KILLGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYL 102
           M ++E EAR     ++ LGHCL  IL  +DV +  PS V L+DQL+ GI LN GA K+  
Sbjct: 1   MDNKEREAREKLGGEVKLGHCLDVILKNADVALHYPSFVKLYDQLVAGILLNKGAIKYIF 60

Query: 103 NKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGK--KVTIAEVRGISQFDISGSVKT 162
           +K+ NS+  FI  ++L+I  I+D RYW      +WG      +AE+  +S  DI G +K 
Sbjct: 61  DKKLNSHWYFIFARALSIAWIEDKRYW------KWGSCGNSNVAELIQVSWLDIRGKIKE 120

Query: 163 GLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVG 222
            +L+P ++Y+V   V L   A GW  P+N+ LKKP+GSKIE +  L GKP+ ++FE+++ 
Sbjct: 121 FMLSPNIVYVVALEVQLNSGASGWNLPMNIELKKPDGSKIERQECLLGKPKNQWFEIVI- 180

Query: 223 ELTLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGC 261
           E  +D+ GC  +  IEFG YEHG   KSGL++KG  + +K   GC
Sbjct: 181 EFKVDNPGCGSSGEIEFGFYEHGGHWKSGLLVKGVRIGAKGC-GC 217

BLAST of ClCG01G009210 vs. ExPASy TrEMBL
Match: Q8LK67 (Phloem lectin OS=Cucurbita argyrosperma subsp. sororia OX=37648 GN=PP2 PE=2 SV=1)

HSP 1 Score: 179.1 bits (453), Expect = 2.4e-41
Identity = 94/223 (42.15%), Postives = 136/223 (60.99%), Query Frame = 0

Query: 43  MSSEENEAR-----KILLGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYL 102
           M+ +E EAR     ++ LGHCL  IL  +DV +  PS + L+DQL+ GI LN GA K+  
Sbjct: 1   MNHKEKEAREKLGGEVKLGHCLDVILKNADVALHYPSFLKLYDQLVAGILLNKGAIKYIF 60

Query: 103 NKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGL 162
           +K++NSN  FI  ++L+I  I+D RYW      +WG    IAE+  +S  DI G +   +
Sbjct: 61  DKKSNSNWYFIFARALSIAWIEDKRYW------KWGSCNKIAELIQVSWLDIRGKINESM 120

Query: 163 LTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGEL 222
           L+P ++Y V   V L   A GW  P+N+ LKKPNGSKIE +  L GKP+ ++FE+++ E 
Sbjct: 121 LSPNIVYEVALQVQLNSGASGWNHPMNIELKKPNGSKIERQECLLGKPKNQWFEIVI-EF 180

Query: 223 TLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGC 261
            +D+ GC  +  IEF  +EHG   K GL++KG  + +K   GC
Sbjct: 181 KVDNHGCGSSGEIEFSFFEHGGHWKRGLLVKGVRIGAKGC-GC 215

BLAST of ClCG01G009210 vs. ExPASy TrEMBL
Match: Q9LLT3 (Phloem protein 2 OS=Cucurbita moschata OX=3662 GN=pp2 PE=2 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 9.2e-41
Identity = 94/223 (42.15%), Postives = 136/223 (60.99%), Query Frame = 0

Query: 43  MSSEENEAR-----KILLGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYL 102
           M ++E EAR     ++ LGHCL  IL  +DV +  PS V L+DQL+ GI LN GA ++  
Sbjct: 1   MDNKEREAREKLGGEVKLGHCLDVILKNADVALHYPSFVKLYDQLVAGILLNKGAIRYIF 60

Query: 103 NKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGL 162
           +K++NSN  FI  ++L+I  I+D RYW      +WG  + IAE+  +S  DI G +   +
Sbjct: 61  DKKSNSNWYFIFARALSIAWIEDKRYW------KWGSCIKIAELIEVSWLDIRGKINESM 120

Query: 163 LTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGEL 222
           L+P ++Y V   V L   A GW +P+N+ LKKP+GSKI  +  L GKPR ++FE++V E 
Sbjct: 121 LSPNIVYEVALQVQLNDRASGWDAPLNIELKKPDGSKIVRQECLLGKPRNQWFEIVV-EF 180

Query: 223 TLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGC 261
            + + GC  +  IEF  YEHG   K GL++KG  + +K   GC
Sbjct: 181 KVGNHGCGSSGEIEFAFYEHGGHWKRGLLVKGVRIGAKGC-GC 215

BLAST of ClCG01G009210 vs. ExPASy TrEMBL
Match: Q39462 (Phloem protein 2 OS=Cucurbita argyrosperma OX=34294 GN=PP2 PE=2 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 1.2e-40
Identity = 95/223 (42.60%), Postives = 135/223 (60.54%), Query Frame = 0

Query: 43  MSSEENEAR-----KILLGHCLGNILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYL 102
           M + E EAR     ++ LGHCL  IL  +DV +  PS + L+DQL+ GI LN GA K+  
Sbjct: 1   MVNNEKEAREKLGGEVKLGHCLDVILKNADVALHYPSFLKLYDQLVAGILLNKGAIKYIF 60

Query: 103 NKETNSNRVFIPVKSLTIYGIDDPRYWILSCLEEWGKKVTIAEVRGISQFDISGSVKTGL 162
           +K++NSN  FI  ++L+I  I+D RYW      +WG    IAE+  +S  DI G +K  +
Sbjct: 61  DKKSNSNWYFIFARALSIAWIEDKRYW------KWGSCNKIAELIQVSWLDIRGKIKESM 120

Query: 163 LTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGEL 222
           L+P ++Y V   V L   A GW  P+N+ LKKPNGSKIE +  L GKP+ ++FE++V E 
Sbjct: 121 LSPNIVYEVALQVQLNSGASGWNHPMNIELKKPNGSKIERQECLLGKPKNQWFEIVV-EY 180

Query: 223 TLDDRGCADTSVIEFGMYEHGSQLKSGLVLKGGLLRSKASPGC 261
            + + GC  +  IEF  +EHG   K GL++KG  + +K   GC
Sbjct: 181 KVGNPGCGSSGEIEFSFFEHGGHWKRGLLVKGVRIGAKGC-GC 215

BLAST of ClCG01G009210 vs. TAIR 10
Match: AT4G19840.1 (phloem protein 2-A1 )

HSP 1 Score: 104.8 bits (260), Expect = 1.1e-22
Identity = 76/198 (38.38%), Postives = 101/198 (51.01%), Query Frame = 0

Query: 62  ILPQSDVPIKLPSLVPLFDQLLDGIPLNNGAQKFYLNKETNSNRVFIPVKSLTIYGIDDP 121
           IL  +D PI L S V L +QL  G+ L    Q  Y   E NSN   +  K+L+I   DD 
Sbjct: 52  ILRDADPPISLSS-VNLSEQLRSGVFLKPKKQIKYWVDERNSNCFMLFAKNLSITWSDDV 111

Query: 122 RYWI-LSCLEEWGKKVTIAEVRGISQFDISGSVKTGLLTPKVIYIVVFVVLLTIDAKGWT 181
            YW   +  E   + V    ++ +   DI+G   T  LTP ++Y VVF V L   A GW 
Sbjct: 112 NYWTWFTEKESPNENVEAVGLKNVCWLDITGKFDTRNLTPGIVYEVVFKVKLEDPAYGWD 171

Query: 182 SPVNLILKKPNGSK--IESKISLEGKPRGEYFEVIVGELTLDDRGCADTSVIEFGMYEHG 241
           +PVNL L  PNG +   E K+SL   PR ++ +V VGE   +     +   I F MYEH 
Sbjct: 172 TPVNLKLVLPNGKEKPQEKKVSLRELPRYKWVDVRVGEFVPEKSAAGE---ITFSMYEHA 231

Query: 242 SQL-KSGLVLKGGLLRSK 256
           + + K GL LKG  +R K
Sbjct: 232 AGVWKKGLSLKGVAIRPK 245

BLAST of ClCG01G009210 vs. TAIR 10
Match: AT1G65390.1 (phloem protein 2 A5 )

HSP 1 Score: 68.9 bits (167), Expect = 6.8e-12
Identity = 55/184 (29.89%), Postives = 85/184 (46.20%), Query Frame = 0

Query: 79  FDQLLDGIPLNNGAQKFYLNKETNSNRVF-IPVKSLTIYGIDDPRYWILSCLEEWGKKVT 138
           F Q+ +  P+ +   KF+++       VF I  + L+I   +D  +W    L       +
Sbjct: 231 FYQMKNQSPVPSYEFKFWVDLTRPKGNVFMIDARDLSIAWSEDSNHWTWLPLPNQNSNES 290

Query: 139 IAEV---RGISQFDISGSVKTGLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPN--G 198
           + E+   +  S  D++G   T  LTP+  Y VVFVV L    + W + V L L  PN   
Sbjct: 291 VMEIAFLKSASWLDVAGKFDTRYLTPRTRYEVVFVVKLEYTFE-WETLVKLKLDLPNTWE 350

Query: 199 SKIESKISLEGKPRGEYFEVIVGELTLDDRGCADTSVIEFGMYEHGSQL-KSGLVLKGGL 256
              E  + +      ++ ++ VGE T   +   +   I F MYEH  QL KSGL +KG  
Sbjct: 351 KPQEQSVDMFDYISDQWLDIPVGEFTTSKKNVGE---ISFAMYEHECQLWKSGLFVKGVT 410

BLAST of ClCG01G009210 vs. TAIR 10
Match: AT4G19850.1 (lectin-related )

HSP 1 Score: 65.1 bits (157), Expect = 9.8e-11
Identity = 41/122 (33.61%), Postives = 62/122 (50.82%), Query Frame = 0

Query: 135 KVTIAEVRGISQFDISGSVKTGLLTPKVIYIVVFVVLLTIDAKGWTSPVNLILKKPNGSK 194
           +  +A++  ++  ++ G  +T  LTP  +Y VVFVV L   AKGW   VN  L  P G  
Sbjct: 74  RTEVAKMERVAWLEVVGKFETEKLTPNSLYEVVFVVKLIDSAKGWDFRVNFKLVLPTGET 133

Query: 195 IESKISLEGKPRGEYFEVIVGELTLDDRGCADTSVIEFGMYE-HGSQLKSGLVLKGGLLR 254
            E + ++    R ++ E+  GE  +       +  IEF M E    Q KSGL++KG  +R
Sbjct: 134 KERRENVNLLERNKWVEIPAGEFMISPEHL--SGKIEFSMLEVKSDQWKSGLIVKGVAIR 193

Query: 255 SK 256
            K
Sbjct: 194 PK 193

BLAST of ClCG01G009210 vs. TAIR 10
Match: AT1G33920.1 (phloem protein 2-A4 )

HSP 1 Score: 54.3 bits (129), Expect = 1.7e-07
Identity = 48/154 (31.17%), Postives = 65/154 (42.21%), Query Frame = 0

Query: 108 IPVKSLTIYGIDDPRYWILSCLE---EWGKKVTIAEVRGISQFDISGSVKTGLLTPKVIY 167
           I  + L+I   D   YW    L       K V  A +  +   D++G   T  LT +  Y
Sbjct: 14  IYARDLSIAWSDKDEYWSWLPLRYDISSEKLVDAAVLEAVCWLDVNGKFDTRELTLETTY 73

Query: 168 IVVFVVLLTIDAKGWTSPVNLILKKPNGSK--IESKISLEGKPRGEYFEVIVGELTLDDR 227
            VV+VV L   A GW  PVNL L  P+G K   E  + L+      + ++  GE      
Sbjct: 74  EVVYVVKLEDTASGWNIPVNLKLTLPDGKKRPQERSMCLKEHIGKRWIDISAGEFVTSPD 133

Query: 228 GCADTSVIEFGMYEHGSQL-KSGLVLKGGLLRSK 256
              +   I F MYE  S   K GL +K   +R K
Sbjct: 134 NAGE---ISFSMYETKSCCWKRGLFVKCVEIRPK 164

BLAST of ClCG01G009210 vs. TAIR 10
Match: AT4G19850.2 (lectin-related )

HSP 1 Score: 50.8 bits (120), Expect = 1.9e-06
Identity = 32/101 (31.68%), Postives = 51/101 (50.50%), Query Frame = 0

Query: 122 RYWI-LSCLEEWGKKV--TIAEVRGISQFDISGSVKTGLLTPKVIYIVVFVVLLTIDAKG 181
           +YW   S L++    V   +A++  ++  ++ G  +T  LTP  +Y VVFVV L   AKG
Sbjct: 83  KYWSWFSDLDQTSSDVRTEVAKMERVAWLEVVGKFETEKLTPNSLYEVVFVVKLIDSAKG 142

Query: 182 WTSPVNLILKKPNGSKIESKISLEGKPRGEYFEVIVGELTL 220
           W   VN  L  P G   E + ++    R ++ E+  GE  +
Sbjct: 143 WDFRVNFKLVLPTGETKERRENVNLLERNKWVEIPAGEFMI 183

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WP_219729244.13.1e-7561.11PP2 domain-containing protein, partial [Pectobacterium odoriferum] >POE05059.1 h... [more]
XP_038893562.14.5e-5852.00lectin-like [Benincasa hispida][more]
WP_181002512.12.7e-5560.34PP2 domain-containing protein, partial [Pectobacterium odoriferum] >POE16080.1 h... [more]
AAA92465.15.4e-4343.95phloem protein 2 [Cucurbita argyrosperma][more]
KAG6583769.17.0e-4343.50hypothetical protein SDJN03_19701, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
C0HJV21.7e-2834.47Lectin OS=Luffa acutangula OX=56866 PE=1 SV=1[more]
O818651.6e-2138.38Protein PHLOEM PROTEIN 2-LIKE A1 OS=Arabidopsis thaliana OX=3702 GN=PP2A1 PE=2 S... [more]
P0DSP51.2e-1331.58Lectin OS=Coccinia grandis OX=387127 PE=1 SV=1[more]
Q9C5Q99.6e-1129.89Protein PHLOEM PROTEIN 2-LIKE A5 OS=Arabidopsis thaliana OX=3702 GN=PP2A5 PE=2 S... [more]
O818661.4e-0933.61Protein PHLOEM PROTEIN 2-LIKE A2 OS=Arabidopsis thaliana OX=3702 GN=PP2A2 PE=2 S... [more]
Match NameE-valueIdentityDescription
Q394612.6e-4343.95Phloem protein 2 OS=Cucurbita argyrosperma OX=34294 GN=PP2 PE=2 SV=1[more]
A0A6J1EHQ41.1e-4142.67lectin-like OS=Cucurbita moschata OX=3662 GN=LOC111434188 PE=4 SV=1[more]
Q8LK672.4e-4142.15Phloem lectin OS=Cucurbita argyrosperma subsp. sororia OX=37648 GN=PP2 PE=2 SV=1[more]
Q9LLT39.2e-4142.15Phloem protein 2 OS=Cucurbita moschata OX=3662 GN=pp2 PE=2 SV=1[more]
Q394621.2e-4042.60Phloem protein 2 OS=Cucurbita argyrosperma OX=34294 GN=PP2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G19840.11.1e-2238.38phloem protein 2-A1 [more]
AT1G65390.16.8e-1229.89phloem protein 2 A5 [more]
AT4G19850.19.8e-1133.61lectin-related [more]
AT1G33920.11.7e-0731.17phloem protein 2-A4 [more]
AT4G19850.21.9e-0631.68lectin-related [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025886Phloem protein 2-likePFAMPF14299PP2coord: 106..249
e-value: 4.2E-17
score: 62.6
NoneNo IPR availablePANTHERPTHR32278FAMILY NOT NAMEDcoord: 75..252
NoneNo IPR availablePANTHERPTHR32278:SF42PROTEIN PHLOEM PROTEIN 2-LIKE A2coord: 75..252

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G009210.2ClCG01G009210.2mRNA