ClCG02G005150 (gene) Watermelon (Charleston Gray)

NameClCG02G005150
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionBURP domain-containing protein
LocationCG_Chr02 : 5545718 .. 5547991 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAAAACATTACTTCCAATTTTAAAAAGTAGTTTATTATTGCACATCACTTTTTTTTATATCTCATCCCAATTTGCACTTGAGCTACAGAAGTTGAAGAAAAAACACTTGAGGGACAAATGATGGAGGGCAACAAACTGGCTTCTTGCTTCATCTTCCTAATTCTTCTCTCTCTTCTTATGGTGAAGTTTCAAAAAAGAAAAAAAAAAAAAAAAAAAAAAAAATCTTCTTATTGAAAATTGGAAGTAATGTTTTTTTTTTTTTTTTTCTTTTTTGAAACTTCACCATAAGAAGAGAGAGAAGAATTAGGAAGATGAAGTTTCAAAAAAGAAAAAAAAAAAAAAAAAAATCTTCTTATTGAAAATTGGAAGTAATGTTTTTTTTTTTTTTTTTCTTTTTTGAAACTTCACCATAAGAAGAGAGAGAAGAATTAGGAAGATGAAGTTTCAAAAAAGAAAAAAAAAAAAAAAAAAATCTTCTTATTGAAAATTGGAAGTAATGTTTTTTTTTTTTTTTTTCTTTTTTGAAACTTCACCATAAGAAGAGAGAGAAGAATTAGGAAGATGAAGTTTCAAAAAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCTTCTTATTGAAAATTTAGTGTTGTTACTAAGTGTCATGTGATAAGGATTTCGTTTTTATTTTTATATTTTTGAAGTTTATGTTTCTTTTTTCTTCCCAATTTTCTGTATGGACTTTGGTTCTTACCTATTTCTTAAATAAACATTTCAAAAGTTTCTAAAAATATTTTAAAAAAAAAAAAATTGTTTGATTTTTGAAAATATGCATGGTACAACATGGATGGATTGCAAAACAAAGAAACTTATAGTATCTTCTGACGTTTACCATCAGGAGATCCTTTAATGATTTTCCCTCCTTTTAACCTCTTGATGTCGTATTTACAGCCATCGGGAGATCCTCAAATTATCCCCCCTTGCACTTGCTCTCTCTCGACGCTTAGAAATATGGCGTCAAGAGATATGATCGATCTCTCAACATTTTTTTGTACAACATGGAAAACTGTTACTGATTCTTAAAGCCTTTGTTACACCGTCGGGAGATAGTATATCTCTCGACTATATGACAACTCTCGACGTCCATGTGTAGTGTTGGGAGATTTTTTTTTATATATAATTGGAGTGAGAGTAGTATTTATAAGCAACTTATTTTTCAAAAACCAAATGTCCGTTATTAAACCGCCATAGGTATTATCAGAAAAAACTCACAAATTATTTCCTATTTTGCAGTTTGTTGAAGGAAGAATTAAACCTTCCATGGGTGAAAGTGATGAGAAGAGCAAGCAAATTGAGATGAAGGATTTTATTGAAGCAACATCAAATACAAAAAGTCCCATCATAAAAATGGTGACTCAAATGGGAGATAATATCCAAAATACCTATACCCAAGATCATGATCAACTTCCCAAGAACAATAATAATAATGGTCATTCTTCTTCATCATCATCTCATATGAATCACATAAACCCTTTACTCATAGTTTTCTTCACAATCAATGACCTAAAAGTTGGCAAAAAACTCCCAATCTATTTCCCCAAAAGAGACCCTTCAAAATCCCCTCCATTTTTACCCAAACAAAAAGCTGATTCAATCCCTTTCTCCCTCAAACAACTCCCTCAAATCCTCTCTTATTTCTCCTTCCCTTCAGATTCCCATCAAGCTCAAGCAGTCAAAGAAACCCTCCAACAATGTGAGCTCAAACCCATCAAAGGAGAGACCAAATTTTGTGCAACTTCCATGGAATCAATGTTGGATTTTGTAAGAACATCCTTAATAATTCCCACAAAATCTTCTTCATTATCATCATTTAAACTTGTCAAAACTTCACATCTCACAAAATCCAATGTCCATTTTCAAAATTACACAATCTTTGATACCCCTATGGAAATTGCTGCTCCTAAGCTTGTGGCTTGTCACACTATGCCATATCCTTATGCAATTTATTATTGTCATTACCAAGAAGGAGATAATAATGTGTTGAAAATTAATTTGGAAGGTGAAAATGGAGATAGAGTTGAAGCTTTGGCAATTTGTCATATGGATACTTCTCAATGGAGCCCTACCCATCCTTCCTTTCAAGTCCTCAAACTTCAACCAGGGGACATGCCCATTTGCCATTTCTTCCCTGCTGATGATTTTGTTTGGATTCCAATAGCTGCTTGAAATATGATTCCTTAGGATTATTATGTAATATTCTTAAAAAAAAAAAAAAAAAAAAA

mRNA sequence

AAAAAAAAAAACATTACTTCCAATTTTAAAAAGTAGTTTATTATTGCACATCACTTTTTTTTATATCTCATCCCAATTTGCACTTGAGCTACAGAAGTTGAAGAAAAAACACTTGAGGGACAAATGATGGAGGGCAACAAACTGGCTTCTTGCTTCATCTTCCTAATTCTTCTCTCTCTTCTTATGTTTGTTGAAGGAAGAATTAAACCTTCCATGGGTGAAAGTGATGAGAAGAGCAAGCAAATTGAGATGAAGGATTTTATTGAAGCAACATCAAATACAAAAAGTCCCATCATAAAAATGGTGACTCAAATGGGAGATAATATCCAAAATACCTATACCCAAGATCATGATCAACTTCCCAAGAACAATAATAATAATGGTCATTCTTCTTCATCATCATCTCATATGAATCACATAAACCCTTTACTCATAGTTTTCTTCACAATCAATGACCTAAAAGTTGGCAAAAAACTCCCAATCTATTTCCCCAAAAGAGACCCTTCAAAATCCCCTCCATTTTTACCCAAACAAAAAGCTGATTCAATCCCTTTCTCCCTCAAACAACTCCCTCAAATCCTCTCTTATTTCTCCTTCCCTTCAGATTCCCATCAAGCTCAAGCAGTCAAAGAAACCCTCCAACAATGTGAGCTCAAACCCATCAAAGGAGAGACCAAATTTTGTGCAACTTCCATGGAATCAATGTTGGATTTTGTAAGAACATCCTTAATAATTCCCACAAAATCTTCTTCATTATCATCATTTAAACTTGTCAAAACTTCACATCTCACAAAATCCAATGTCCATTTTCAAAATTACACAATCTTTGATACCCCTATGGAAATTGCTGCTCCTAAGCTTGTGGCTTGTCACACTATGCCATATCCTTATGCAATTTATTATTGTCATTACCAAGAAGGAGATAATAATGTGTTGAAAATTAATTTGGAAGGTGAAAATGGAGATAGAGTTGAAGCTTTGGCAATTTGTCATATGGATACTTCTCAATGGAGCCCTACCCATCCTTCCTTTCAAGTCCTCAAACTTCAACCAGGGGACATGCCCATTTGCCATTTCTTCCCTGCTGATGATTTTGTTTGGATTCCAATAGCTGCTTGAAATATGATTCCTTAGGATTATTATGTAATATTCTTAAAAAAAAAAAAAAAAAAAAA

Coding sequence (CDS)

ATGATGGAGGGCAACAAACTGGCTTCTTGCTTCATCTTCCTAATTCTTCTCTCTCTTCTTATGTTTGTTGAAGGAAGAATTAAACCTTCCATGGGTGAAAGTGATGAGAAGAGCAAGCAAATTGAGATGAAGGATTTTATTGAAGCAACATCAAATACAAAAAGTCCCATCATAAAAATGGTGACTCAAATGGGAGATAATATCCAAAATACCTATACCCAAGATCATGATCAACTTCCCAAGAACAATAATAATAATGGTCATTCTTCTTCATCATCATCTCATATGAATCACATAAACCCTTTACTCATAGTTTTCTTCACAATCAATGACCTAAAAGTTGGCAAAAAACTCCCAATCTATTTCCCCAAAAGAGACCCTTCAAAATCCCCTCCATTTTTACCCAAACAAAAAGCTGATTCAATCCCTTTCTCCCTCAAACAACTCCCTCAAATCCTCTCTTATTTCTCCTTCCCTTCAGATTCCCATCAAGCTCAAGCAGTCAAAGAAACCCTCCAACAATGTGAGCTCAAACCCATCAAAGGAGAGACCAAATTTTGTGCAACTTCCATGGAATCAATGTTGGATTTTGTAAGAACATCCTTAATAATTCCCACAAAATCTTCTTCATTATCATCATTTAAACTTGTCAAAACTTCACATCTCACAAAATCCAATGTCCATTTTCAAAATTACACAATCTTTGATACCCCTATGGAAATTGCTGCTCCTAAGCTTGTGGCTTGTCACACTATGCCATATCCTTATGCAATTTATTATTGTCATTACCAAGAAGGAGATAATAATGTGTTGAAAATTAATTTGGAAGGTGAAAATGGAGATAGAGTTGAAGCTTTGGCAATTTGTCATATGGATACTTCTCAATGGAGCCCTACCCATCCTTCCTTTCAAGTCCTCAAACTTCAACCAGGGGACATGCCCATTTGCCATTTCTTCCCTGCTGATGATTTTGTTTGGATTCCAATAGCTGCTTGA

Protein sequence

MMEGNKLASCFIFLILLSLLMFVEGRIKPSMGESDEKSKQIEMKDFIEATSNTKSPIIKMVTQMGDNIQNTYTQDHDQLPKNNNNNGHSSSSSSHMNHINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVWIPIAA
BLAST of ClCG02G005150 vs. Swiss-Prot
Match: BNM2C_BRANA (BURP domain-containing protein BNM2C OS=Brassica napus GN=BNM2C PE=2 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 4.1e-56
Identity = 106/249 (42.57%), Postives = 151/249 (60.64%), Query Frame = 1

Query: 84  NNNGHSSSSSSHM----NHINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKA 143
           +NN     + SH+       +P + +FF I+DLK+G KLPIYF K D  K PP L +Q+A
Sbjct: 33  SNNEQEGQNISHLFKDGEFEDPTMYMFFKISDLKLGTKLPIYFNKNDLRKVPPLLTRQEA 92

Query: 144 DSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVR 203
           D IPFS   L  +L++FS   DS Q +A+KETLQ+C+ K I+GE KFC TS+ESMLD  +
Sbjct: 93  DLIPFSESNLDFLLNHFSISKDSPQGEAMKETLQRCDFKAIEGEYKFCGTSLESMLDLAK 152

Query: 204 TSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIY 263
            ++        +++  +V   +     +H  NYT  + P E+   K++ CH MPYPY +Y
Sbjct: 153 KTIASNADLKVMTTKVMVPDQNRISYALH--NYTFAEVPKELDGIKVLGCHRMPYPYVVY 212

Query: 264 YCHYQEGDNNVLKINLEGENG-DRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHF 323
           YCH  +    V ++NL  ++G   V   A+CHMDTS W+  H +F+VLK++P   P+CHF
Sbjct: 213 YCHGHKSGTKVFEVNLMSDDGIQLVVGPAVCHMDTSMWNADHVAFKVLKIEPRSAPVCHF 272

Query: 324 FPADDFVWI 328
           FP D+ VW+
Sbjct: 273 FPLDNIVWV 279

BLAST of ClCG02G005150 vs. Swiss-Prot
Match: USPL1_ARATH (BURP domain protein USPL1 OS=Arabidopsis thaliana GN=USPL1 PE=2 SV=1)

HSP 1 Score: 219.2 bits (557), Expect = 6.9e-56
Identity = 102/229 (44.54%), Postives = 146/229 (63.76%), Query Frame = 1

Query: 100 NPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFP 159
           +P L ++FT+NDLK+G KL IYF K D  K PP L +Q+AD IPF+  +L  +L +FS  
Sbjct: 52  DPSLYMYFTLNDLKLGTKLLIYFYKNDLQKLPPLLTRQQADLIPFTKSKLDFLLDHFSIT 111

Query: 160 SDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKT 219
            DS Q +A+KETL  C+ K I+GE KFC TS+ES++D V+ ++        +++  +V  
Sbjct: 112 KDSPQGKAIKETLGHCDAKAIEGEHKFCGTSLESLIDLVKKTMGYNVDLKVMTTKVMVPA 171

Query: 220 SHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGEN 279
            +     +H  NYT  + P E+   K++ CH MPYPYA+YYCH  +G + V ++NL  ++
Sbjct: 172 QNSISYALH--NYTFVEAPKELVGIKMLGCHRMPYPYAVYYCHGHKGGSRVFEVNLVTDD 231

Query: 280 G-DRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVWI 328
           G  RV   A+CHMDTS W   H +F+VLK++P   P+CHFFP D+ VW+
Sbjct: 232 GRQRVVGPAVCHMDTSTWDADHVAFKVLKMEPRSAPVCHFFPLDNIVWV 278

BLAST of ClCG02G005150 vs. Swiss-Prot
Match: BNM2A_BRANA (BURP domain-containing protein BNM2A OS=Brassica napus GN=BNM2A PE=2 SV=1)

HSP 1 Score: 218.0 bits (554), Expect = 1.5e-55
Identity = 105/249 (42.17%), Postives = 151/249 (60.64%), Query Frame = 1

Query: 84  NNNGHSSSSSSHM----NHINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKA 143
           +NN     + SH+       +P + +FF I+DLK+G KLPIYF K D  K PP L +Q+A
Sbjct: 34  SNNEQEGQNISHLFKDGEFEDPTMYMFFKISDLKLGTKLPIYFNKNDLRKVPPLLTRQEA 93

Query: 144 DSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVR 203
           D IPFS   L  +L++FS   DS Q +A+KETL++C+ K I+GE KFC TS+ESMLD  +
Sbjct: 94  DLIPFSESNLDFLLNHFSISKDSPQGKAMKETLKRCDFKAIEGEYKFCGTSLESMLDLAK 153

Query: 204 TSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIY 263
            ++        +++  +V   +     +H  NYT  + P E+   K++ CH MPYPY +Y
Sbjct: 154 KTIASNADLKVMTTKVMVPDQNRISYALH--NYTFAEVPKELDGIKVLGCHRMPYPYVVY 213

Query: 264 YCHYQEGDNNVLKINLEGENG-DRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHF 323
           YCH  +    V ++NL  ++G   V   A+CHMDTS W+  H +F+VLK++P   P+CHF
Sbjct: 214 YCHGHKSGTKVFEVNLMSDDGIQLVVGPAVCHMDTSMWNADHVAFKVLKIEPRSAPVCHF 273

Query: 324 FPADDFVWI 328
           FP D+ VW+
Sbjct: 274 FPLDNIVWV 280

BLAST of ClCG02G005150 vs. Swiss-Prot
Match: BURPH_ORYSJ (BURP domain-containing protein 17 OS=Oryza sativa subsp. japonica GN=BURP17 PE=2 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 4.6e-44
Identity = 93/229 (40.61%), Postives = 131/229 (57.21%), Query Frame = 1

Query: 100 NPLLIVFFTINDLKVGKKL--PIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFS 159
           +P + +FF   +L+ GKK+   ++F     + +  FLP+ KADSIPFS K+LP+IL  F 
Sbjct: 357 DPDMALFFLEKNLQQGKKINNALHFANLLATTNSKFLPRGKADSIPFSSKELPEILDRFG 416

Query: 160 FPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLV 219
               S  A  +  TLQ CEL   KGE K CATS+ES++DFV +S      +S + +   V
Sbjct: 417 VRPGSDDAAEMSATLQDCELPANKGEKKACATSLESIVDFVTSSF----GASDVDAASTV 476

Query: 220 KTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEG 279
             S   +S+   Q+YT+          +L+ACH   YPYA++ CH  E      K +L G
Sbjct: 477 VLSKAVESSSLAQDYTVSGVRRMAGTGQLIACHPESYPYAVFMCHLTEATTRAYKASLVG 536

Query: 280 ENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVW 327
           ++G  VEA+A+CH DTS W+P H +F VL ++PG +P+CHF   D  VW
Sbjct: 537 KDGTAVEAVAVCHTDTSDWNPEHAAFHVLGVKPGTVPVCHFMQPDAVVW 581

BLAST of ClCG02G005150 vs. Swiss-Prot
Match: BURP3_ORYSJ (BURP domain-containing protein 3 OS=Oryza sativa subsp. japonica GN=BURP3 PE=2 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 6.3e-41
Identity = 89/232 (38.36%), Postives = 131/232 (56.47%), Query Frame = 1

Query: 98  HINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFS 157
           H +P + +FF   DL  GK + ++F      +   FLP+ +AD++PFS +++P+ILS FS
Sbjct: 205 HDDPNVALFFLEKDLHPGKTMAVHFTATTAGEK--FLPRSEADAMPFSSEKVPEILSRFS 264

Query: 158 FPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLV 217
               S +A  + +TL+ CE  P +GE K CATS+ESM+DF  +SL         +S    
Sbjct: 265 VKPGSVEAAEMAQTLRDCEAPPAQGERKACATSLESMVDFATSSLG--------TSHVRA 324

Query: 218 KTSHLTKSNVHFQNYTIFDTPMEIAA---PKLVACHTMPYPYAIYYCHYQEGDNNVLKIN 277
            ++ + K     Q YT+       A     +LVACH  PY YA++ CH          ++
Sbjct: 325 ASTVVGKEGSPEQEYTVTAVKRAAAGGDQDQLVACHAEPYAYAVFACHLTRA-TRAYAVS 384

Query: 278 LEGENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVW 327
           + G +G  VEA+A+CH DT+ W+P H +FQVLK++PG +P+CHF P D  VW
Sbjct: 385 MAGRDGTGVEAVAVCHADTAGWNPKHVAFQVLKVKPGTVPVCHFLPQDHVVW 425

BLAST of ClCG02G005150 vs. TrEMBL
Match: A0A097BU29_CITLA (BURP domain-containing protein 17 OS=Citrullus lanatus PE=2 SV=1)

HSP 1 Score: 675.2 bits (1741), Expect = 3.9e-191
Identity = 330/331 (99.70%), Postives = 330/331 (99.70%), Query Frame = 1

Query: 1   MMEGNKLASCFIFLILLSLLMFVEGRIKPSMGESDEKSKQIEMKDFIEATSNTKSPIIKM 60
           MMEGNKLASCFIFLILLSLLMFVEGRIKPSMGESDEKSKQIEMKDFIEATSNTKSPIIKM
Sbjct: 1   MMEGNKLASCFIFLILLSLLMFVEGRIKPSMGESDEKSKQIEMKDFIEATSNTKSPIIKM 60

Query: 61  VTQMGDNIQNTYTQDHDQLPKNNNNNGHSSSSSSHMNHINPLLIVFFTINDLKVGKKLPI 120
           VTQMGDNIQNTYTQDHDQLPKNNNNNGHSSSSSSHMNHINPLLIVFFTINDLKVGKKLPI
Sbjct: 61  VTQMGDNIQNTYTQDHDQLPKNNNNNGHSSSSSSHMNHINPLLIVFFTINDLKVGKKLPI 120

Query: 121 YFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPI 180
           YFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPI
Sbjct: 121 YFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPI 180

Query: 181 KGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPME 240
           KGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDT ME
Sbjct: 181 KGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTHME 240

Query: 241 IAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTH 300
           IAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTH
Sbjct: 241 IAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTH 300

Query: 301 PSFQVLKLQPGDMPICHFFPADDFVWIPIAA 332
           PSFQVLKLQPGDMPICHFFPADDFVWIPIAA
Sbjct: 301 PSFQVLKLQPGDMPICHFFPADDFVWIPIAA 331

BLAST of ClCG02G005150 vs. TrEMBL
Match: A0A0A0K955_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G259360 PE=4 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 1.7e-125
Identity = 225/252 (89.29%), Postives = 233/252 (92.46%), Query Frame = 1

Query: 79  LPKNNNNNGHSSSSSSHMNH-INPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQ 138
           +  NN +N HSSSSSSHMNH INPLL+VFFTINDLKVGKKL IYFPKRDPSKSPPFLPK+
Sbjct: 1   MKNNNKHNDHSSSSSSHMNHNINPLLMVFFTINDLKVGKKLSIYFPKRDPSKSPPFLPKE 60

Query: 139 KADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDF 198
           KAD I FS KQLPQILS F FPS+S QAQAVKETLQQCELKPIK ETKFCATSMESMLDF
Sbjct: 61  KADQISFSFKQLPQILSSFHFPSNSPQAQAVKETLQQCELKPIKVETKFCATSMESMLDF 120

Query: 199 VRTSLIIPTKSSSLSS-FKLVKTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPY 258
           VRTSLIIPTKSS LSS FKL+KTSHLTKSNVH QNYTIFDTP  I+APKLVACHTMPYPY
Sbjct: 121 VRTSLIIPTKSSPLSSSFKLLKTSHLTKSNVHLQNYTIFDTPELISAPKLVACHTMPYPY 180

Query: 259 AIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPIC 318
           AIYYCHYQEGDNNVLKI LEGENGDRV+ALAICHMDTSQWSPTHPSFQVLKLQPGDMPIC
Sbjct: 181 AIYYCHYQEGDNNVLKIALEGENGDRVDALAICHMDTSQWSPTHPSFQVLKLQPGDMPIC 240

Query: 319 HFFPADDFVWIP 329
           HFFPADDFVWIP
Sbjct: 241 HFFPADDFVWIP 252

BLAST of ClCG02G005150 vs. TrEMBL
Match: A0A061DZB0_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_006908 PE=4 SV=1)

HSP 1 Score: 325.9 bits (834), Expect = 5.8e-86
Identity = 171/352 (48.58%), Postives = 232/352 (65.91%), Query Frame = 1

Query: 6   KLASCFIFLILLSLLMFVEGRIKPSMGESDEK--------SKQIEMKDFIEATS-----N 65
           K  SC  FLILL L+M         M +   K        SKQ++++   +  +     N
Sbjct: 2   KFVSCSSFLILL-LVMHAHASGAREMADDHSKDLINHDHGSKQVKLQGLSDVVNGNEIRN 61

Query: 66  TKSPIIKMVTQ----MGDNIQNTYTQDHDQLPKNNNNN----GHSSSS-------SSHMN 125
               + ++V       G+N+    + D+D+  +++++     GH+          SSHMN
Sbjct: 62  EVQGVERLVDDEGIPKGNNVLRLPSMDYDRHGEDDSSTVQTRGHAHQHAMFDNHVSSHMN 121

Query: 126 HINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFS 185
           H++  L+VFF +NDLKVGK +PIYFPK DPS SP  LP+++ADSIPFSLK+LP +L +FS
Sbjct: 122 HMDRSLMVFFILNDLKVGKSMPIYFPKNDPSTSPHLLPREEADSIPFSLKELPYLLRFFS 181

Query: 186 FPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLV 245
           F  DS QA+A+++TL++CE K IKGETKFCATS+ESMLDF R+   +       S FK++
Sbjct: 182 FLQDSPQAKAMEDTLRECETKAIKGETKFCATSLESMLDFARSIFGLN------SHFKIL 241

Query: 246 KTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEG 305
            T+HLTKS+  FQNYTI  TP E +APK+VACHTMPYPYA+ YCH QE  N V K++L G
Sbjct: 242 TTAHLTKSSTLFQNYTILATPQETSAPKMVACHTMPYPYAVLYCHSQETQNKVFKVSLGG 301

Query: 306 ENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVWIPI 330
           +NGDR EA+A+CHMDTSQW+  H SF+VL ++PG   +CHFFPAD+FV IP+
Sbjct: 302 DNGDRAEAVAVCHMDTSQWTRNHVSFRVLGIEPGTPGVCHFFPADNFVLIPV 346

BLAST of ClCG02G005150 vs. TrEMBL
Match: A0A061E082_THECC (Uncharacterized protein isoform 3 OS=Theobroma cacao GN=TCM_006908 PE=4 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 1.9e-84
Identity = 160/315 (50.79%), Postives = 219/315 (69.52%), Query Frame = 1

Query: 35  DEKSKQIEMKDFIEATS-----NTKSPIIKMVTQ----MGDNIQNTYTQDHDQLPKNNNN 94
           D  SKQ++++   +  +     N    + ++V       G+N+    + D+D+  +++++
Sbjct: 30  DHGSKQVKLQGLSDVVNGNEIRNEVQGVERLVDDEGIPKGNNVLRLPSMDYDRHGEDDSS 89

Query: 95  N----GHSSSS-------SSHMNHINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFL 154
                GH+          SSHMNH++  L+VFF +NDLKVGK +PIYFPK DPS SP  L
Sbjct: 90  TVQTRGHAHQHAMFDNHVSSHMNHMDRSLMVFFILNDLKVGKSMPIYFPKNDPSTSPHLL 149

Query: 155 PKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESM 214
           P+++ADSIPFSLK+LP +L +FSF  DS QA+A+++TL++CE K IKGETKFCATS+ESM
Sbjct: 150 PREEADSIPFSLKELPYLLRFFSFLQDSPQAKAMEDTLRECETKAIKGETKFCATSLESM 209

Query: 215 LDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPY 274
           LDF R+   +       S FK++ T+HLTKS+  FQNYTI  TP E +APK+VACHTMPY
Sbjct: 210 LDFARSIFGLN------SHFKILTTAHLTKSSTLFQNYTILATPQETSAPKMVACHTMPY 269

Query: 275 PYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMP 330
           PYA+ YCH QE  N V K++L G+NGDR EA+A+CHMDTSQW+  H SF+VL ++PG   
Sbjct: 270 PYAVLYCHSQETQNKVFKVSLGGDNGDRAEAVAVCHMDTSQWTRNHVSFRVLGIEPGTPG 329

BLAST of ClCG02G005150 vs. TrEMBL
Match: A0A061E141_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_006908 PE=4 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 1.9e-84
Identity = 160/315 (50.79%), Postives = 219/315 (69.52%), Query Frame = 1

Query: 35  DEKSKQIEMKDFIEATS-----NTKSPIIKMVTQ----MGDNIQNTYTQDHDQLPKNNNN 94
           D  SKQ++++   +  +     N    + ++V       G+N+    + D+D+  +++++
Sbjct: 32  DHGSKQVKLQGLSDVVNGNEIRNEVQGVERLVDDEGIPKGNNVLRLPSMDYDRHGEDDSS 91

Query: 95  N----GHSSSS-------SSHMNHINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFL 154
                GH+          SSHMNH++  L+VFF +NDLKVGK +PIYFPK DPS SP  L
Sbjct: 92  TVQTRGHAHQHAMFDNHVSSHMNHMDRSLMVFFILNDLKVGKSMPIYFPKNDPSTSPHLL 151

Query: 155 PKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESM 214
           P+++ADSIPFSLK+LP +L +FSF  DS QA+A+++TL++CE K IKGETKFCATS+ESM
Sbjct: 152 PREEADSIPFSLKELPYLLRFFSFLQDSPQAKAMEDTLRECETKAIKGETKFCATSLESM 211

Query: 215 LDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPY 274
           LDF R+   +       S FK++ T+HLTKS+  FQNYTI  TP E +APK+VACHTMPY
Sbjct: 212 LDFARSIFGLN------SHFKILTTAHLTKSSTLFQNYTILATPQETSAPKMVACHTMPY 271

Query: 275 PYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMP 330
           PYA+ YCH QE  N V K++L G+NGDR EA+A+CHMDTSQW+  H SF+VL ++PG   
Sbjct: 272 PYAVLYCHSQETQNKVFKVSLGGDNGDRAEAVAVCHMDTSQWTRNHVSFRVLGIEPGTPG 331

BLAST of ClCG02G005150 vs. TAIR10
Match: AT1G49320.1 (AT1G49320.1 unknown seed protein like 1)

HSP 1 Score: 219.2 bits (557), Expect = 3.9e-57
Identity = 102/229 (44.54%), Postives = 146/229 (63.76%), Query Frame = 1

Query: 100 NPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFP 159
           +P L ++FT+NDLK+G KL IYF K D  K PP L +Q+AD IPF+  +L  +L +FS  
Sbjct: 52  DPSLYMYFTLNDLKLGTKLLIYFYKNDLQKLPPLLTRQQADLIPFTKSKLDFLLDHFSIT 111

Query: 160 SDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKT 219
            DS Q +A+KETL  C+ K I+GE KFC TS+ES++D V+ ++        +++  +V  
Sbjct: 112 KDSPQGKAIKETLGHCDAKAIEGEHKFCGTSLESLIDLVKKTMGYNVDLKVMTTKVMVPA 171

Query: 220 SHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGEN 279
            +     +H  NYT  + P E+   K++ CH MPYPYA+YYCH  +G + V ++NL  ++
Sbjct: 172 QNSISYALH--NYTFVEAPKELVGIKMLGCHRMPYPYAVYYCHGHKGGSRVFEVNLVTDD 231

Query: 280 G-DRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVWI 328
           G  RV   A+CHMDTS W   H +F+VLK++P   P+CHFFP D+ VW+
Sbjct: 232 GRQRVVGPAVCHMDTSTWDADHVAFKVLKMEPRSAPVCHFFPLDNIVWV 278

BLAST of ClCG02G005150 vs. TAIR10
Match: AT5G25610.1 (AT5G25610.1 BURP domain-containing protein)

HSP 1 Score: 166.0 bits (419), Expect = 3.9e-41
Identity = 85/232 (36.64%), Postives = 132/232 (56.90%), Query Frame = 1

Query: 98  HINPLLIVFFTINDLKVGKKLPIYFPKRDP-SKSPPFLPKQKADSIPFSLKQLPQILSYF 157
           H +P   +FF   DL  GK++ + F   D       FLP+ +A+++PF  ++  + L  F
Sbjct: 168 HDDPNAALFFLEKDLVRGKEMNVRFNAEDGYGGKTAFLPRGEAETVPFGSEKFSETLKRF 227

Query: 158 SFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKL 217
           S  + S +A+ +K+T+++CE + + GE K+CATS+ESM+DF           S L  + +
Sbjct: 228 SVEAGSEEAEMMKKTIEECEARKVSGEEKYCATSLESMVDF---------SVSKLGKYHV 287

Query: 218 VKTS-HLTKSNVHFQNYTIFDTPME-IAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKIN 277
              S  + K N   Q Y I    ++ ++  K V CH   YP+A++YCH +     V  + 
Sbjct: 288 RAVSTEVAKKNAPMQKYKIAAAGVKKLSDDKSVVCHKQKYPFAVFYCH-KAMMTTVYAVP 347

Query: 278 LEGENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVW 327
           LEGENG R +A+A+CH +TS W+P H +F+VLK++PG +P+CHF P    VW
Sbjct: 348 LEGENGMRAKAVAVCHKNTSAWNPNHLAFKVLKVKPGTVPVCHFLPETHVVW 389

BLAST of ClCG02G005150 vs. TAIR10
Match: AT1G70370.1 (AT1G70370.1 polygalacturonase 2)

HSP 1 Score: 104.4 bits (259), Expect = 1.4e-22
Identity = 68/229 (29.69%), Postives = 104/229 (45.41%), Query Frame = 1

Query: 106 FFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQA 165
           FF  + LK G  +P+   K D      FLP+     +PFS  +L +I   F    +S   
Sbjct: 411 FFRESSLKEGTVIPMPDIK-DKMPKRSFLPRSIITKLPFSTSKLGEIKRIFHAVENSTMG 470

Query: 166 QAVKETLQQCELKPIKGETKFCATSMESMLDFVRT----SLIIPTKSSSLSSFKLVKTSH 225
             + + + +CE  P  GETK C  S E M+DF  +    S+++ T  +   S + V    
Sbjct: 471 GIITDAVTECERPPSVGETKRCVGSAEDMIDFATSVLGRSVVLRTTENVAGSKEKVVIGK 530

Query: 226 LTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIYYCH----YQEGDNNVLKINLEG 285
           +   N                  K V+CH   YPY +YYCH     +  + ++L++N + 
Sbjct: 531 VNGINGG-------------KLTKAVSCHQSLYPYLLYYCHSVPKVRVYEADLLELNSKK 590

Query: 286 ENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVW 327
           +       +AICHMDTS W P+H +F  L  +PG + +CH+   +D  W
Sbjct: 591 KIN---HGIAICHMDTSSWGPSHGAFLALGSKPGRIEVCHWIFENDMNW 622

BLAST of ClCG02G005150 vs. TAIR10
Match: AT1G23760.1 (AT1G23760.1 BURP domain-containing protein)

HSP 1 Score: 98.2 bits (243), Expect = 1.0e-20
Identity = 65/227 (28.63%), Postives = 99/227 (43.61%), Query Frame = 1

Query: 106 FFTINDLKVGKKLPIYFPK-RDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQ 165
           FF  + LK G    I+ P  +D      FLP+     +PFS  ++ +I   F    +S  
Sbjct: 407 FFRESMLKEGTL--IWMPDIKDKMPKRSFLPRSIVSKLPFSTSKIAEIKRVFHANDNSTM 466

Query: 166 AQAVKETLQQCELKPIKGETKFCATSMESMLDFVRT----SLIIPTKSSSLSSFKLVKTS 225
              + + +++CE  P   ETK C  S E M+DF  +    S+++ T  S   S + V   
Sbjct: 467 EGIITDAVRECERPPTVSETKRCVGSAEDMIDFATSVLGRSVVLRTTESVAGSKEKVMIG 526

Query: 226 HLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENG 285
            +   N                  K V+CH   YPY +YYCH            L+ ++ 
Sbjct: 527 KVNGINGG-------------RVTKSVSCHQSLYPYLLYYCHSVPKVRVYESDLLDPKSK 586

Query: 286 DRVE-ALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVW 327
            ++   +AICHMDTS W   H +F +L  +PG + +CH+   +D  W
Sbjct: 587 AKINHGIAICHMDTSAWGANHGAFMLLGSRPGQIEVCHWIFENDMNW 618

BLAST of ClCG02G005150 vs. TAIR10
Match: AT1G60390.1 (AT1G60390.1 polygalacturonase 1)

HSP 1 Score: 97.8 bits (242), Expect = 1.3e-20
Identity = 59/207 (28.50%), Postives = 91/207 (43.96%), Query Frame = 1

Query: 125 RDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGET 184
           +D      FLP+    ++PFS   + +I   F    +S  A  +   + +CE     GET
Sbjct: 427 KDKMPKRTFLPRNIVKNLPFSSSTIGEIWRVFGAGENSSMAGIISSAVSECERPASHGET 486

Query: 185 KFCATSMESMLDFVRTSL----IIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPME 244
           K C  S E M+DF  + L    ++ T  + + S K V    +   N              
Sbjct: 487 KRCVGSAEDMIDFATSVLGRGVVVRTTENVVGSKKKVVIGKVNGINGG------------ 546

Query: 245 IAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVE-ALAICHMDTSQWSPT 304
               + V+CH   YPY +YYCH            L+ ++ +++   +AICH+DTS WSP+
Sbjct: 547 -DVTRAVSCHQSLYPYLLYYCHSVPRVRVYETDLLDPKSLEKINHGVAICHIDTSAWSPS 606

Query: 305 HPSFQVLKLQPGDMPICHFFPADDFVW 327
           H +F  L   PG + +CH+   +D  W
Sbjct: 607 HGAFLALGSGPGQIEVCHWIFENDMTW 620

BLAST of ClCG02G005150 vs. NCBI nr
Match: gi|694188375|gb|AIS71928.1| (BURP domain-containing protein 17 [Citrullus lanatus])

HSP 1 Score: 675.2 bits (1741), Expect = 5.6e-191
Identity = 330/331 (99.70%), Postives = 330/331 (99.70%), Query Frame = 1

Query: 1   MMEGNKLASCFIFLILLSLLMFVEGRIKPSMGESDEKSKQIEMKDFIEATSNTKSPIIKM 60
           MMEGNKLASCFIFLILLSLLMFVEGRIKPSMGESDEKSKQIEMKDFIEATSNTKSPIIKM
Sbjct: 1   MMEGNKLASCFIFLILLSLLMFVEGRIKPSMGESDEKSKQIEMKDFIEATSNTKSPIIKM 60

Query: 61  VTQMGDNIQNTYTQDHDQLPKNNNNNGHSSSSSSHMNHINPLLIVFFTINDLKVGKKLPI 120
           VTQMGDNIQNTYTQDHDQLPKNNNNNGHSSSSSSHMNHINPLLIVFFTINDLKVGKKLPI
Sbjct: 61  VTQMGDNIQNTYTQDHDQLPKNNNNNGHSSSSSSHMNHINPLLIVFFTINDLKVGKKLPI 120

Query: 121 YFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPI 180
           YFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPI
Sbjct: 121 YFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPI 180

Query: 181 KGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPME 240
           KGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDT ME
Sbjct: 181 KGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTHME 240

Query: 241 IAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTH 300
           IAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTH
Sbjct: 241 IAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTH 300

Query: 301 PSFQVLKLQPGDMPICHFFPADDFVWIPIAA 332
           PSFQVLKLQPGDMPICHFFPADDFVWIPIAA
Sbjct: 301 PSFQVLKLQPGDMPICHFFPADDFVWIPIAA 331

BLAST of ClCG02G005150 vs. NCBI nr
Match: gi|659092391|ref|XP_008447041.1| (PREDICTED: BURP domain-containing protein 5-like [Cucumis melo])

HSP 1 Score: 545.4 bits (1404), Expect = 6.7e-152
Identity = 279/332 (84.04%), Postives = 294/332 (88.55%), Query Frame = 1

Query: 2   MEGNKLASCFIFLILLSLLMFVEGRIKPSMGESDEKSKQIEMKDFIEATSNT-KSPIIKM 61
           MEGN + SCFI L+LL L MF EGRI+ SM E  EK KQIEM DFIEA S+T ++PIIKM
Sbjct: 1   MEGNTV-SCFILLVLLFLHMFAEGRIESSMSEIVEKRKQIEMNDFIEAASSTVQNPIIKM 60

Query: 62  VTQMGDNIQNTYTQDHDQLP--KNNNNNGHSSSSSSHMNH-INPLLIVFFTINDLKVGKK 121
           VT +GDNIQ   T   DQ P   NNN+N HSSSS+SHMNH INPLLIVFFTINDLKVGKK
Sbjct: 61  VTSIGDNIQ---TSQDDQFPMKNNNNHNDHSSSSTSHMNHNINPLLIVFFTINDLKVGKK 120

Query: 122 LPIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCEL 181
           LPIYFPKRDPSKSPPFLPK+KAD IPFS KQLPQILS FSFPS+S QAQAVKETLQQCEL
Sbjct: 121 LPIYFPKRDPSKSPPFLPKEKADQIPFSFKQLPQILSSFSFPSNSPQAQAVKETLQQCEL 180

Query: 182 KPIKGETKFCATSMESMLDFVRTSLIIPTKSSSL-SSFKLVKTSHLTKSNVHFQNYTIFD 241
           KPIKGETKFCATSMESMLDFV+TSLIIP KSS L SSFKL+KTSHLTKSNVH QNYTIFD
Sbjct: 181 KPIKGETKFCATSMESMLDFVKTSLIIPKKSSPLSSSFKLLKTSHLTKSNVHLQNYTIFD 240

Query: 242 TPMEIAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQW 301
           TP  I+APKLVACHTMPYPYAIYYCHYQEGDNNVLKI LEGENGDRV+ALAICHMDTSQW
Sbjct: 241 TPKLISAPKLVACHTMPYPYAIYYCHYQEGDNNVLKIALEGENGDRVDALAICHMDTSQW 300

Query: 302 SPTHPSFQVLKLQPGDMPICHFFPADDFVWIP 329
           SPTHPSFQVLKLQPGDMPICHFFPADDFVWIP
Sbjct: 301 SPTHPSFQVLKLQPGDMPICHFFPADDFVWIP 328

BLAST of ClCG02G005150 vs. NCBI nr
Match: gi|700189103|gb|KGN44336.1| (hypothetical protein Csa_7G259360 [Cucumis sativus])

HSP 1 Score: 457.2 bits (1175), Expect = 2.4e-125
Identity = 225/252 (89.29%), Postives = 233/252 (92.46%), Query Frame = 1

Query: 79  LPKNNNNNGHSSSSSSHMNH-INPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQ 138
           +  NN +N HSSSSSSHMNH INPLL+VFFTINDLKVGKKL IYFPKRDPSKSPPFLPK+
Sbjct: 1   MKNNNKHNDHSSSSSSHMNHNINPLLMVFFTINDLKVGKKLSIYFPKRDPSKSPPFLPKE 60

Query: 139 KADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDF 198
           KAD I FS KQLPQILS F FPS+S QAQAVKETLQQCELKPIK ETKFCATSMESMLDF
Sbjct: 61  KADQISFSFKQLPQILSSFHFPSNSPQAQAVKETLQQCELKPIKVETKFCATSMESMLDF 120

Query: 199 VRTSLIIPTKSSSLSS-FKLVKTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPY 258
           VRTSLIIPTKSS LSS FKL+KTSHLTKSNVH QNYTIFDTP  I+APKLVACHTMPYPY
Sbjct: 121 VRTSLIIPTKSSPLSSSFKLLKTSHLTKSNVHLQNYTIFDTPELISAPKLVACHTMPYPY 180

Query: 259 AIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPIC 318
           AIYYCHYQEGDNNVLKI LEGENGDRV+ALAICHMDTSQWSPTHPSFQVLKLQPGDMPIC
Sbjct: 181 AIYYCHYQEGDNNVLKIALEGENGDRVDALAICHMDTSQWSPTHPSFQVLKLQPGDMPIC 240

Query: 319 HFFPADDFVWIP 329
           HFFPADDFVWIP
Sbjct: 241 HFFPADDFVWIP 252

BLAST of ClCG02G005150 vs. NCBI nr
Match: gi|590685896|ref|XP_007042224.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 325.9 bits (834), Expect = 8.4e-86
Identity = 171/352 (48.58%), Postives = 232/352 (65.91%), Query Frame = 1

Query: 6   KLASCFIFLILLSLLMFVEGRIKPSMGESDEK--------SKQIEMKDFIEATS-----N 65
           K  SC  FLILL L+M         M +   K        SKQ++++   +  +     N
Sbjct: 2   KFVSCSSFLILL-LVMHAHASGAREMADDHSKDLINHDHGSKQVKLQGLSDVVNGNEIRN 61

Query: 66  TKSPIIKMVTQ----MGDNIQNTYTQDHDQLPKNNNNN----GHSSSS-------SSHMN 125
               + ++V       G+N+    + D+D+  +++++     GH+          SSHMN
Sbjct: 62  EVQGVERLVDDEGIPKGNNVLRLPSMDYDRHGEDDSSTVQTRGHAHQHAMFDNHVSSHMN 121

Query: 126 HINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFLPKQKADSIPFSLKQLPQILSYFS 185
           H++  L+VFF +NDLKVGK +PIYFPK DPS SP  LP+++ADSIPFSLK+LP +L +FS
Sbjct: 122 HMDRSLMVFFILNDLKVGKSMPIYFPKNDPSTSPHLLPREEADSIPFSLKELPYLLRFFS 181

Query: 186 FPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESMLDFVRTSLIIPTKSSSLSSFKLV 245
           F  DS QA+A+++TL++CE K IKGETKFCATS+ESMLDF R+   +       S FK++
Sbjct: 182 FLQDSPQAKAMEDTLRECETKAIKGETKFCATSLESMLDFARSIFGLN------SHFKIL 241

Query: 246 KTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPYPYAIYYCHYQEGDNNVLKINLEG 305
            T+HLTKS+  FQNYTI  TP E +APK+VACHTMPYPYA+ YCH QE  N V K++L G
Sbjct: 242 TTAHLTKSSTLFQNYTILATPQETSAPKMVACHTMPYPYAVLYCHSQETQNKVFKVSLGG 301

Query: 306 ENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMPICHFFPADDFVWIPI 330
           +NGDR EA+A+CHMDTSQW+  H SF+VL ++PG   +CHFFPAD+FV IP+
Sbjct: 302 DNGDRAEAVAVCHMDTSQWTRNHVSFRVLGIEPGTPGVCHFFPADNFVLIPV 346

BLAST of ClCG02G005150 vs. NCBI nr
Match: gi|590685904|ref|XP_007042226.1| (Uncharacterized protein isoform 3 [Theobroma cacao])

HSP 1 Score: 320.9 bits (821), Expect = 2.7e-84
Identity = 160/315 (50.79%), Postives = 219/315 (69.52%), Query Frame = 1

Query: 35  DEKSKQIEMKDFIEATS-----NTKSPIIKMVTQ----MGDNIQNTYTQDHDQLPKNNNN 94
           D  SKQ++++   +  +     N    + ++V       G+N+    + D+D+  +++++
Sbjct: 30  DHGSKQVKLQGLSDVVNGNEIRNEVQGVERLVDDEGIPKGNNVLRLPSMDYDRHGEDDSS 89

Query: 95  N----GHSSSS-------SSHMNHINPLLIVFFTINDLKVGKKLPIYFPKRDPSKSPPFL 154
                GH+          SSHMNH++  L+VFF +NDLKVGK +PIYFPK DPS SP  L
Sbjct: 90  TVQTRGHAHQHAMFDNHVSSHMNHMDRSLMVFFILNDLKVGKSMPIYFPKNDPSTSPHLL 149

Query: 155 PKQKADSIPFSLKQLPQILSYFSFPSDSHQAQAVKETLQQCELKPIKGETKFCATSMESM 214
           P+++ADSIPFSLK+LP +L +FSF  DS QA+A+++TL++CE K IKGETKFCATS+ESM
Sbjct: 150 PREEADSIPFSLKELPYLLRFFSFLQDSPQAKAMEDTLRECETKAIKGETKFCATSLESM 209

Query: 215 LDFVRTSLIIPTKSSSLSSFKLVKTSHLTKSNVHFQNYTIFDTPMEIAAPKLVACHTMPY 274
           LDF R+   +       S FK++ T+HLTKS+  FQNYTI  TP E +APK+VACHTMPY
Sbjct: 210 LDFARSIFGLN------SHFKILTTAHLTKSSTLFQNYTILATPQETSAPKMVACHTMPY 269

Query: 275 PYAIYYCHYQEGDNNVLKINLEGENGDRVEALAICHMDTSQWSPTHPSFQVLKLQPGDMP 330
           PYA+ YCH QE  N V K++L G+NGDR EA+A+CHMDTSQW+  H SF+VL ++PG   
Sbjct: 270 PYAVLYCHSQETQNKVFKVSLGGDNGDRAEAVAVCHMDTSQWTRNHVSFRVLGIEPGTPG 329

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BNM2C_BRANA4.1e-5642.57BURP domain-containing protein BNM2C OS=Brassica napus GN=BNM2C PE=2 SV=1[more]
USPL1_ARATH6.9e-5644.54BURP domain protein USPL1 OS=Arabidopsis thaliana GN=USPL1 PE=2 SV=1[more]
BNM2A_BRANA1.5e-5542.17BURP domain-containing protein BNM2A OS=Brassica napus GN=BNM2A PE=2 SV=1[more]
BURPH_ORYSJ4.6e-4440.61BURP domain-containing protein 17 OS=Oryza sativa subsp. japonica GN=BURP17 PE=2... [more]
BURP3_ORYSJ6.3e-4138.36BURP domain-containing protein 3 OS=Oryza sativa subsp. japonica GN=BURP3 PE=2 S... [more]
Match NameE-valueIdentityDescription
A0A097BU29_CITLA3.9e-19199.70BURP domain-containing protein 17 OS=Citrullus lanatus PE=2 SV=1[more]
A0A0A0K955_CUCSA1.7e-12589.29Uncharacterized protein OS=Cucumis sativus GN=Csa_7G259360 PE=4 SV=1[more]
A0A061DZB0_THECC5.8e-8648.58Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_006908 PE=4 SV=1[more]
A0A061E082_THECC1.9e-8450.79Uncharacterized protein isoform 3 OS=Theobroma cacao GN=TCM_006908 PE=4 SV=1[more]
A0A061E141_THECC1.9e-8450.79Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_006908 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G49320.13.9e-5744.54 unknown seed protein like 1[more]
AT5G25610.13.9e-4136.64 BURP domain-containing protein[more]
AT1G70370.11.4e-2229.69 polygalacturonase 2[more]
AT1G23760.11.0e-2028.63 BURP domain-containing protein[more]
AT1G60390.11.3e-2028.50 polygalacturonase 1[more]
Match NameE-valueIdentityDescription
gi|694188375|gb|AIS71928.1|5.6e-19199.70BURP domain-containing protein 17 [Citrullus lanatus][more]
gi|659092391|ref|XP_008447041.1|6.7e-15284.04PREDICTED: BURP domain-containing protein 5-like [Cucumis melo][more]
gi|700189103|gb|KGN44336.1|2.4e-12589.29hypothetical protein Csa_7G259360 [Cucumis sativus][more]
gi|590685896|ref|XP_007042224.1|8.4e-8648.58Uncharacterized protein isoform 1 [Theobroma cacao][more]
gi|590685904|ref|XP_007042226.1|2.7e-8450.79Uncharacterized protein isoform 3 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004873BURP_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0048316 seed development
biological_process GO:0009651 response to salt stress
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0000326 protein storage vacuole
cellular_component GO:0005768 endosome
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0005802 trans-Golgi network
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG02G005150.1ClCG02G005150.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004873BURP domainPFAMPF03181BURPcoord: 105..327
score: 3.9
IPR004873BURP domainSMARTSM01045BURP_2coord: 104..329
score: 3.8
IPR004873BURP domainPROFILEPS51277BURPcoord: 106..329
score: 72
NoneNo IPR availablePANTHERPTHR31236FAMILY NOT NAMEDcoord: 36..328
score: 4.5E
NoneNo IPR availablePANTHERPTHR31236:SF4SUBFAMILY NOT NAMEDcoord: 36..328
score: 4.5E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
ClCG02G005150Csa7G259360Cucumber (Chinese Long) v2cuwcgB548
ClCG02G005150Csa2G245470Cucumber (Chinese Long) v2cuwcgB127
ClCG02G005150MELO3C012542Melon (DHL92) v3.5.1mewcgB196
ClCG02G005150Cla016551Watermelon (97103) v1wcgwmB217
ClCG02G005150Cla97C02G032090Watermelon (97103) v2wcgwmbB138
ClCG02G005150Cla97C11G211250Watermelon (97103) v2wcgwmbB137
ClCG02G005150Bhi05G001077Wax gourdwcgwgoB275
ClCG02G005150Bhi10G001630Wax gourdwcgwgoB286
ClCG02G005150CSPI02G12100Wild cucumber (PI 183967)cpiwcgB131
ClCG02G005150CSPI07G11490Wild cucumber (PI 183967)cpiwcgB577
ClCG02G005150Cucsa.109700Cucumber (Gy14) v1cgywcgB156
ClCG02G005150Cucsa.240440Cucumber (Gy14) v1cgywcgB429
ClCG02G005150CmaCh19G006560Cucurbita maxima (Rimu)cmawcgB460
ClCG02G005150CmaCh20G008570Cucurbita maxima (Rimu)cmawcgB485
ClCG02G005150CmaCh02G004080Cucurbita maxima (Rimu)cmawcgB531
ClCG02G005150CmaCh11G011070Cucurbita maxima (Rimu)cmawcgB098
ClCG02G005150CmoCh02G004120Cucurbita moschata (Rifu)cmowcgB530
ClCG02G005150CmoCh20G008680Cucurbita moschata (Rifu)cmowcgB485
ClCG02G005150CmoCh11G011350Cucurbita moschata (Rifu)cmowcgB089
ClCG02G005150Lsi10G003540Bottle gourd (USVL1VR-Ls)lsiwcgB057
ClCG02G005150Lsi11G010880Bottle gourd (USVL1VR-Ls)lsiwcgB097
ClCG02G005150Cp4.1LG04g00840Cucurbita pepo (Zucchini)cpewcgB591
ClCG02G005150Cp4.1LG05g12490Cucurbita pepo (Zucchini)cpewcgB662
ClCG02G005150Cp4.1LG16g02420Cucurbita pepo (Zucchini)cpewcgB267
ClCG02G005150CsGy2G012080Cucumber (Gy14) v2cgybwcgB122
ClCG02G005150CsGy7G010750Cucumber (Gy14) v2cgybwcgB508
ClCG02G005150MELO3C012542.2Melon (DHL92) v3.6.1medwcgB191
ClCG02G005150Carg22952Silver-seed gourdcarwcgB0868
ClCG02G005150Carg22023Silver-seed gourdcarwcgB0203
ClCG02G005150CsaV3_7G023440Cucumber (Chinese Long) v3cucwcgB573
ClCG02G005150CsaV3_2G014600Cucumber (Chinese Long) v3cucwcgB140
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
ClCG02G005150ClCG11G004500Watermelon (Charleston Gray)wcgwcgB063
The following block(s) are covering this gene:
GeneOrganismBlock
ClCG02G005150Cucurbita moschata (Rifu)cmowcgB459
ClCG02G005150Watermelon (97103) v1wcgwmB198
ClCG02G005150Cucurbita pepo (Zucchini)cpewcgB233