Cp4.1LG00g00170 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG00g00170
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionUHRF1-binding protein 1-like
LocationCp4.1LG00 : 262831 .. 268088 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCCGCCCCATGGAGTCGATTCTGGCCCGAGCCCTCGAGTACACTCTCAAGTACTGGTTAAAATCTTTCTCACGGGACCAGTTCAAATTGCAGGGCCGGTCTGTGCAGCTCTCCAATTTGGGTCTGTGCTCGCTGATTCGTATGCTAATTTATTCACTTTTCACTGGTTACTTCTACCAATCGCGGTAATCGGGTTTCGTTCTTTTTTTCCTTTTCTTTTTGAATTTTTGTAGATATCAATGGCGATGCTTTGCATTCCAGTATGGGGTTGCCTCCGGCGCTAAATGTTACGACGGCGAGGGTTGGCAAGTTGGAGATTATGGTTGGTAGCGAATTGCTTAGAATTTCCCTTTCCTTCCCCTAGATGATTTGTGATTGCTTGGATCAATTGGAAATGTGCTTCTTCGCCCTTTACTTTTCTTGTTGAGCTTTGTAAAAGTATAAAGTTATGGTAGAATGATGCATATCAGTTATTTTTGGTTCCTGGAATTATAGCAGTGGAGAGGAAGATGAAATTTCGGCTGGTTTTTGCAGCATTTTCTATGCTTTTTGTACTCTGATTCTTTCCCTAATTGAGTTTTCTTCTTGATCATGTAGCTACCATCGCTGAGTAATGTACAAGTGGAACCAGTCGTTGTGCAAATAGATAAATTGGATTTAGTTTTAGAGGAGAATCCAGATGCAGATGTGGGTAGAAGCAATAGTAGGTAATTCACATGCCTCAATTCGTTTCTCTGCCCCTCCCCCTGCTCCTTGCTTCGTACCATTTTGCGAGCTTTATGTATCAACCTTGGAAGGATGCTTATCTGATCACAGTTAGTTAGAACTGAGCCTTTGCTTTTATAACTTGCAGTAGTCAGACTTCTTCCAGCACCGTGAAGGGTGGTGGTTATGGATTTGCTGATAAGGTAAAACACTTTTGCGAAATCGACCGGTTCACTAAACTCAAAAGTGCTAAGTTATCTTAAATCTATTTAACAGTTCTCTAACAATGAATGGTTTGACCGATCCATTCGCATTTCATGAAATTTGTGGGATGTGCATAATTTCATTCTAGATTTGGAAAATTTCACCCCATGATATGCAATCTCTTGAAAAGTGATTAGTATTGGGCCTTCGGTTTTTTGTTTATAAATCAGTGATGGACTGTTTCTTTTTCCTGTCAGCCGATTTCTTAAGTTTTTCTTCTTCCTCTGATTTTAGTTTTTCCTTTTTGAAGCAAGAAATGAAACGTCTCATTGAAGAAATAAAAAGAGACTAATACTCAAAAGATACAAACTCCATCAGGTAGTGAAACAAAAAGCAATAAAAGTATAACCTAAACTAGAAAGAGCAACCCAGGAAAAACAAAAACAAAGTATTCCAATTTAGAGAAATATCTTGTATAGAAAATCTGTCCAAAGACTTTGATAGAGAACTTCAAGGCGAGGCTTTCAAACGAGCTGATTGAAAACGATCAAGCCAAAAGAGATGCTTGTCTTGGAAGATCCTTTGATTCCTTTCCAACCACAATTCTGATAAAAGGCTTTTAATTGCTCCATTAAACATTTGCCTTTGGAATCAAAGAGGGACCAAAAAGAAAATGACGAACATTTGAAAATGTTAGAAAGAACCCACTGAAGATTAAATAAAGAAAATAACTCGCCAACAAGCTCGAGAATAGATGGCATTAAAAAAGACGTGATTAAGAGTGTTCCCATCCAATCAGCAAAGAGAACAAACCGATGGCATAAGAAAATTGAAAGGGAGTTTCGTTCGTAAAACTTCAGAAGTGTTAAGACTAGCATTGACCATGATCCACAAAAGAACATGAATTTTCTTTGGGCATCTGTATTACCATTTAGCTTCCTCCGATTTTAGTTTCTACAATTATATGGTATCTCCATATCTATTGGGGATGCAGCAACATGAAAGGATGTTAGATGTTGTAAGTTTTTAGTTTTGATGCCAATAATTCACGCCCTCACCCCCCTCTTTGGCACTTATATTTCATTTACATTCCACATTCATATATATTTTTAGATTGTGGAGCTTTTCGTATGTAATAGATTGCAGATGGAATGACAGTAGAGGTTCGCACTGTCAATTTGCTACTTGAAACTGGTGGTGGATCGCGACATCAAGGAGGAGCAACTTGGTGAATCTTTTGCTTATTTCCTAATTTTTTTTATTCTTCATTCCCTGTTTCTCTTTCCTCCTCCCCCCACCCCCTATTTTGTTACTTGTACGATAGCTGTATGCATGTGAACTTTTGCATGTATGGTTTCCATTCAGCATATGCGAATACAGTGTGAAAGATGAGCACCCAGTCAATTAATTTGTTAGCTATAGTTCAATCCGTGACTTATGATTTCAGTGAGGAGAATATTGTATTTAAGCTTCTTAATTCTTATTTTATTTTCTCTGATACAAGAATGTATATATCATTATGGAAAAATATGTACAAGTAAAAAGAGGAGTTTTTTGCTTTCTTTTTTTATTTTTATTTATAAAGGATCATTGAAGTAGCAATAGTATTTAAACCATAATTTTTGTGTAACTCTTGCTACCATCTGTATGAATGAATCAAGACGGAAATATTCCTTAGAGATAGAGAGAGAAGAGAGAAAGAGAACTGGATACAGAAAACTGAACCCCATCCCACACTCAAGTTTAGTCCTAGCATTCTCAAAGAGAAATAAAAGGAAGTTAGTATACTTTCTTTCAAAATCTCCCAACATACAACTAAAATAACCTAGAAGGACCGGACAAATTTTTTGACCAAAGGCACAAAGAAAGACGTAACTTTTTAGGTTCGAGGGTTAGAATATGTAATACGATTTTTAAGGATCTGGCCTTTGGTAAACAACATCCGAGTTCCAGATTGCAGTTTAAAGAGCTGAGGTCAGTGACCTTACTAGAACAGTATCTTTTCTACTAGGTTGTACAACTATATCCTTGAGTTGTAAAGATTTCAGTTTAAAGAGATGAGGTTATGGTCTGCATGTGACTATTGGCAGCGGATGGAAGATTGGGAGACTGTCAGCTTATCTATCCACACGTCTTTCTGTATCTAATCCTCTTACATGGCCTCACCTATAAGAAAGATAATCGGCAAAATGATGACATACACTAGCCACAAAATATTGTGCCTACCTACTGCTTGTGCATTGAACTAAGTAAAAGCCCACATCTAGTTATATAGTCCTACGAAGCTTTCAGTTATGATTTTTAGGTTTAAATCTGTTAAATATTGTGTTAGTACTTAGCTCTCATGTTATTTCCTTTTTTGCTTCCCCCTTCTTTGTAGGGCTTCACCTTTGGCATCTATCACTATACGCAACCTTTTGCTGTATACCACAAATGAAAATTGGCAGGTGGATGTTTTACATCGGTGTGGCTGATATTTTTAGTCACAACATATATTTTCTAACAAATTTTCCATCATAGGTGGTCAATCTTAAGGAAGCCCGTGATTTCTCTGCAAATAAGAAATTTATATATGTTTTCAAGGTAATTCTAGCTTCTATGCATGGGTTTATTTGGATTTTAGGTTATATTATTCTGAGCAAGAAACTATATTTTCATTGAATGTATGAAAAGGTACAAAAGTAGGAGAAAACTTTGGAAACCACTGAAGCTCATAAAGAATGTCTTTAGCTTACCATAGAAAAATTAGAATAATGACAATTTTAACTTATTTTGGTGTACACTTTTCCTCTTTTCTTTTTCTGTAATTACATTTATTAGATTATAACAATGTTATGAAAGTGTAAAAGTGGCCTTAACTTTTTTGGAGGAGGCAAAGAGTAAGGCTTGAGGAACTATCAGATGTGACCTAAGCTTAGATAAAAAATGTATTGGTGAGGCTTTTGTTTGTATATATTCTTTCATTTGTCCTAATGAACGTGTGGTTTCTTACCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANGAAATGCATAGAAGAGAATCATGTATAGGCTATGTATCATTAAAACTACGTTCACCATTCCCAACATCATCGTTATTAACACTAGAATCTACTTTCTTTTTAGTTTGTCTTCAAATTTTTGTAAACCTTCGTAGTAAATTGTTGATGGGCTAAATTTCTCACTTTGTATGTATTGGCATTAAACAGCTGAGTGAATTCTAATATTTTTTAAGAATATGATAAATTGGTACCCTCGTTTAGTCATTTCAAGTCATAAACTCAAAGCGTAAATGTGTTCTTATGGAAATGCCTCTTCCATAAGAGATGTTAGCTTCTTATGATCCAAAGAAGCCGTAAAACTAAGGCTACAAATTAAAAATTCACTCTTAAGGTGATCCTGAAGGAGTGAGCCTCAAACACCTTCAAAAGCATTGGTTCATCATTGGTAGAGAACGTGTTTTTAAGTACTGGCCTGGCTCCTCTGTGCATGCGTCAAATTTCAACTTAACATGAAATGATGTTATGTTCAGTGACTTATAATGCTAAATTCCTCTTGATACTAGAAACTTGAATGGGAATCTTTGTCAATTGATCTTCTGCCTCATCCGGATATGTTTGCTGATGCTACTTTCGCTCGTGCTAAAGAGGGAGCAGTTAGTAGGGATGATGATGGTGCTAAGCGTGTTTTCTTTGGTGGAGAGCGATTTATTGAAGGGATATCTGGTGAAGCTAATGTAATACAAATCTGATGTTTCAGTTTGATTCTTGATGAACTTGTATTTATGGTGCAATCTTAGAGAGCAGCTTCTATAAGTAGGATGGTTTTGCATTCCTGATACACTTCGTTTTGTAAACATGAGTAGATAACATTGCAAAGGACCGAGCTAAACAGTCCACTTGGTCTCGAGGTGAATTTACATATTACAGAAGCTGTATGCCCAGCCTTAAGTGAACCAGGTTTTTAAATCATTTGGCTGTTAGCTCAGTTGATCACAATGTGAACAAGGTGTTGTATGATGGTGATCTATCATGCTGATAGGATTTTTCTATGGCATCAACAGGACTTCGTGCCTTCCTTCGTTTTTTGACGGGATTGTATGTTTGTCTAAATAGAGGAGATGTGGATATGAAGGCTCAAAAGGTTATCATTTACTCTACTCTTTGAATTGTACTTTTGAATATGATATGGAAAAGTCTCACACATCTCTCTCTTTCTTGTTTTCATTGGCAGCGTTCAACAGAAGCAGCAGGACGTTCTTTAGTTTCTATTACTGCAGACCATATATTTCTCTGTGTAAAAGACCCTGGTTCGTGCCTTTTTCCTTAA

mRNA sequence

ATCCGCCCCATGGAGTCGATTCTGGCCCGAGCCCTCGAGTACACTCTCAAGTACTGGTTAAAATCTTTCTCACGGGACCAGTTCAAATTGCAGGGCCGGTCTGTGCAGCTCTCCAATTTGGATATCAATGGCGATGCTTTGCATTCCAGTATGGGGTTGCCTCCGGCGCTAAATGTTACGACGGCGAGGGTTGGCAAGTTGGAGATTATGCTACCATCGCTGAGTAATGTACAAGTGGAACCAGTCGTTGTGCAAATAGATAAATTGGATTTAGTTTTAGAGGAGAATCCAGATGCAGATGTGGGTAGAAGCAATAGTAGTAGTCAGACTTCTTCCAGCACCGTGAAGGGTGGTGGTTATGGATTTGCTGATAAGATTGCAGATGGAATGACAGTAGAGGTTCGCACTGTCAATTTGCTACTTGAAACTGGTGGTGGATCGCGACATCAAGGAGGAGCAACTTGGGCTTCACCTTTGGCATCTATCACTATACGCAACCTTTTGCTGTATACCACAAATGAAAATTGGCAGGTGGTCAATCTTAAGGAAGCCCGTGATTTCTCTGCAAATAAGAAATTTATATATGTTTTCAAGAAACTTGAATGGGAATCTTTGTCAATTGATCTTCTGCCTCATCCGGATATGTTTGCTGATGCTACTTTCGCTCGTGCTAAAGAGGGAGCAGTTAGTAGGGATGATGATGGTGCTAAGCGTGTTTTCTTTGGTGGAGAGCGATTTATTGAAGGGATATCTGGTGAAGCTAATATAACATTGCAAAGGACCGAGCTAAACAGTCCACTTGGTCTCGAGGTGAATTTACATATTACAGAAGCTGTATGCCCAGCCTTAAGTGAACCAGGACTTCGTGCCTTCCTTCGTTTTTTGACGGGATTGTATGTTTGTCTAAATAGAGGAGATGTGGATATGAAGGCTCAAAAGCGTTCAACAGAAGCAGCAGGACGTTCTTTAGTTTCTATTACTGCAGACCATATATTTCTCTGTGTAAAAGACCCTGGTTCGTGCCTTTTTCCTTAA

Coding sequence (CDS)

ATGGAGTCGATTCTGGCCCGAGCCCTCGAGTACACTCTCAAGTACTGGTTAAAATCTTTCTCACGGGACCAGTTCAAATTGCAGGGCCGGTCTGTGCAGCTCTCCAATTTGGATATCAATGGCGATGCTTTGCATTCCAGTATGGGGTTGCCTCCGGCGCTAAATGTTACGACGGCGAGGGTTGGCAAGTTGGAGATTATGCTACCATCGCTGAGTAATGTACAAGTGGAACCAGTCGTTGTGCAAATAGATAAATTGGATTTAGTTTTAGAGGAGAATCCAGATGCAGATGTGGGTAGAAGCAATAGTAGTAGTCAGACTTCTTCCAGCACCGTGAAGGGTGGTGGTTATGGATTTGCTGATAAGATTGCAGATGGAATGACAGTAGAGGTTCGCACTGTCAATTTGCTACTTGAAACTGGTGGTGGATCGCGACATCAAGGAGGAGCAACTTGGGCTTCACCTTTGGCATCTATCACTATACGCAACCTTTTGCTGTATACCACAAATGAAAATTGGCAGGTGGTCAATCTTAAGGAAGCCCGTGATTTCTCTGCAAATAAGAAATTTATATATGTTTTCAAGAAACTTGAATGGGAATCTTTGTCAATTGATCTTCTGCCTCATCCGGATATGTTTGCTGATGCTACTTTCGCTCGTGCTAAAGAGGGAGCAGTTAGTAGGGATGATGATGGTGCTAAGCGTGTTTTCTTTGGTGGAGAGCGATTTATTGAAGGGATATCTGGTGAAGCTAATATAACATTGCAAAGGACCGAGCTAAACAGTCCACTTGGTCTCGAGGTGAATTTACATATTACAGAAGCTGTATGCCCAGCCTTAAGTGAACCAGGACTTCGTGCCTTCCTTCGTTTTTTGACGGGATTGTATGTTTGTCTAAATAGAGGAGATGTGGATATGAAGGCTCAAAAGCGTTCAACAGAAGCAGCAGGACGTTCTTTAGTTTCTATTACTGCAGACCATATATTTCTCTGTGTAAAAGACCCTGGTTCGTGCCTTTTTCCTTAA

Protein sequence

MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTARVGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFADKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKEARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGGERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLNRGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDPGSCLFP
BLAST of Cp4.1LG00g00170 vs. Swiss-Prot
Match: UH1BL_XENLA (UHRF1-binding protein 1-like OS=Xenopus laevis GN=uhrf1bp1l PE=2 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 1.4e-06
Identity = 45/208 (21.63%), Postives = 96/208 (46.15%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKL---QGRSVQLSNLDINGDALHSSMGLPPALNVT 60
           M  ++ + +   L  + K+ S D+  L   +G   QL+NL+++ + L + + LP  L + 
Sbjct: 1   MAGLIKKQILKHLSRFTKNLSPDKINLSTLKGEG-QLTNLELDEEVLQNMLDLPTWLAIN 60

Query: 61  TARVGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGY 120
                K  I +P  + ++  P+ + +DK   V+ E    +  RS +      +      Y
Sbjct: 61  KVFCNKAAIRIP-WTKLKTHPISLSLDK---VIMEMSTCEEPRSCNGPSPLVTASGQSEY 120

Query: 121 GFADKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVN 180
           GFA+K+ +G+++ V ++ + +     +            AS  +  L +Y+ N +WQ  +
Sbjct: 121 GFAEKVVEGISLSVNSIIIRIRAKAFN------------ASFELSQLRIYSVNPSWQHGD 180

Query: 181 LKEARDFSANKKFIYVFKKLEWESLSID 206
           L+  R     +  +  FK++ W+ + I+
Sbjct: 181 LRFTRIQDPQRGEVLTFKEINWQMIRIE 191

BLAST of Cp4.1LG00g00170 vs. TrEMBL
Match: A0A0A0L7Q7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G081370 PE=4 SV=1)

HSP 1 Score: 632.5 bits (1630), Expect = 3.0e-178
Identity = 318/335 (94.93%), Postives = 327/335 (97.61%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+ QLSNLDINGDALHSS+GLPPALNVTTAR
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTAQLSNLDINGDALHSSLGLPPALNVTTAR 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDAD+GRS SSSQTSSSTVKGGGYGFA
Sbjct: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADMGRSTSSSQTSSSTVKGGGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADA  ARA+EG + RDDDGAKRVFFGG
Sbjct: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADANLARAQEGPIGRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERFIEGISGEANITLQRTELNSPLGLEVNL+ITEAVCPALSEPGLRAFLRFLTGLYVCLN
Sbjct: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLYITEAVCPALSEPGLRAFLRFLTGLYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+K+Q+RSTEAAGRSLVSI  DHIFLCVKDP
Sbjct: 301 RGDVDLKSQQRSTEAAGRSLVSIIVDHIFLCVKDP 335

BLAST of Cp4.1LG00g00170 vs. TrEMBL
Match: A0A061F5G6_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_031110 PE=4 SV=1)

HSP 1 Score: 577.4 bits (1487), Expect = 1.1e-161
Identity = 282/335 (84.18%), Postives = 315/335 (94.03%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+VQLSNLDINGDALH+SMGLPPALNVTTA+
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEI+LP +SNVQ+EP++VQID+LDLVLEENPDAD  RS+SS+Q+S+S+ KG GYGFA
Sbjct: 61  VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMT++V+TVNLLLET GG+R +GGA WASP+ASIT+RN+LLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFS+NKKFIYVFKKLEWESLSIDLLPHPDMF+DA  AR++EGA  RDDDGAKRVFFGG
Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERF+EGISGEA IT+QRTELNSPLGLEV LH+TEAVCPALSEPGLRA LRFLTG YVCLN
Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+KAQ+ S EAAGRSLVS+  DHIFLC+KDP
Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDP 335

BLAST of Cp4.1LG00g00170 vs. TrEMBL
Match: A0A061F6D5_THECC (Uncharacterized protein isoform 3 (Fragment) OS=Theobroma cacao GN=TCM_031110 PE=4 SV=1)

HSP 1 Score: 577.4 bits (1487), Expect = 1.1e-161
Identity = 282/335 (84.18%), Postives = 315/335 (94.03%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+VQLSNLDINGDALH+SMGLPPALNVTTA+
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEI+LP +SNVQ+EP++VQID+LDLVLEENPDAD  RS+SS+Q+S+S+ KG GYGFA
Sbjct: 61  VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMT++V+TVNLLLET GG+R +GGA WASP+ASIT+RN+LLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFS+NKKFIYVFKKLEWESLSIDLLPHPDMF+DA  AR++EGA  RDDDGAKRVFFGG
Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERF+EGISGEA IT+QRTELNSPLGLEV LH+TEAVCPALSEPGLRA LRFLTG YVCLN
Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+KAQ+ S EAAGRSLVS+  DHIFLC+KDP
Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDP 335

BLAST of Cp4.1LG00g00170 vs. TrEMBL
Match: A0A061F7C8_THECC (Uncharacterized protein isoform 4 OS=Theobroma cacao GN=TCM_031110 PE=4 SV=1)

HSP 1 Score: 577.4 bits (1487), Expect = 1.1e-161
Identity = 282/335 (84.18%), Postives = 315/335 (94.03%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+VQLSNLDINGDALH+SMGLPPALNVTTA+
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEI+LP +SNVQ+EP++VQID+LDLVLEENPDAD  RS+SS+Q+S+S+ KG GYGFA
Sbjct: 61  VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMT++V+TVNLLLET GG+R +GGA WASP+ASIT+RN+LLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFS+NKKFIYVFKKLEWESLSIDLLPHPDMF+DA  AR++EGA  RDDDGAKRVFFGG
Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERF+EGISGEA IT+QRTELNSPLGLEV LH+TEAVCPALSEPGLRA LRFLTG YVCLN
Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+KAQ+ S EAAGRSLVS+  DHIFLC+KDP
Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDP 335

BLAST of Cp4.1LG00g00170 vs. TrEMBL
Match: A0A061FDN6_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_031110 PE=4 SV=1)

HSP 1 Score: 577.4 bits (1487), Expect = 1.1e-161
Identity = 282/335 (84.18%), Postives = 315/335 (94.03%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+VQLSNLDINGDALH+SMGLPPALNVTTA+
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEI+LP +SNVQ+EP++VQID+LDLVLEENPDAD  RS+SS+Q+S+S+ KG GYGFA
Sbjct: 61  VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMT++V+TVNLLLET GG+R +GGA WASP+ASIT+RN+LLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFS+NKKFIYVFKKLEWESLSIDLLPHPDMF+DA  AR++EGA  RDDDGAKRVFFGG
Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERF+EGISGEA IT+QRTELNSPLGLEV LH+TEAVCPALSEPGLRA LRFLTG YVCLN
Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+KAQ+ S EAAGRSLVS+  DHIFLC+KDP
Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDP 335

BLAST of Cp4.1LG00g00170 vs. TAIR10
Match: AT3G20720.2 (AT3G20720.2 unknown protein)

HSP 1 Score: 529.3 bits (1362), Expect = 1.8e-150
Identity = 262/334 (78.44%), Postives = 301/334 (90.12%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSF+RDQFKLQGR+ QLSNLDING+A+H+SMGLPPAL+VTTA+
Sbjct: 1   MESILARALEYTLKYWLKSFTRDQFKLQGRTAQLSNLDINGEAIHASMGLPPALSVTTAK 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEIMLP +SNVQ EP+VVQIDKLDLVLEENPDADV +  SSSQ+ +++ K  GYGFA
Sbjct: 61  VGKLEIMLPYVSNVQTEPIVVQIDKLDLVLEENPDADVTKGPSSSQSPTASAKSNGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMT++V+ VNLLLETGGG+  +GGA WA+PLASITIRNL+LYTTNE+W+VVNLKE
Sbjct: 121 DKIADGMTLQVKVVNLLLETGGGANREGGAAWAAPLASITIRNLVLYTTNESWKVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFS N  FIY+FKKLEWE+LSIDLLPHPDMF +A  AR++E A  RD+DGAKRVFFGG
Sbjct: 181 ARDFSTNTGFIYLFKKLEWEALSIDLLPHPDMFTEANLARSEE-ANLRDEDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERF+EGISG+A IT+QRT LNSPLGLEV LHI EAVCPALSEPGLRA LRFLTG+Y+CLN
Sbjct: 241 ERFLEGISGQAYITVQRTALNSPLGLEVQLHIPEAVCPALSEPGLRALLRFLTGMYLCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKD 335
           RGDVD K+Q +S EAAGRSLVS+  DH+FLC+KD
Sbjct: 301 RGDVDPKSQ-QSAEAAGRSLVSVLVDHVFLCIKD 332

BLAST of Cp4.1LG00g00170 vs. NCBI nr
Match: gi|659126966|ref|XP_008463451.1| (PREDICTED: uncharacterized protein LOC103501618 [Cucumis melo])

HSP 1 Score: 636.0 bits (1639), Expect = 3.9e-179
Identity = 320/335 (95.52%), Postives = 327/335 (97.61%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+ QLSNLDINGDALHSS+GLPPALNVTTAR
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTAQLSNLDINGDALHSSLGLPPALNVTTAR 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRS SSSQTSSSTVKGGGYGFA
Sbjct: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSTSSSQTSSSTVKGGGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADA  ARA+EG + RDDDGAKRVFFGG
Sbjct: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADANLARAQEGPIGRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN
Sbjct: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+K+Q+RSTEAAGRSLVSI  DHIFLCVKDP
Sbjct: 301 RGDVDLKSQQRSTEAAGRSLVSIIVDHIFLCVKDP 335

BLAST of Cp4.1LG00g00170 vs. NCBI nr
Match: gi|449470413|ref|XP_004152911.1| (PREDICTED: uncharacterized protein LOC101210396 [Cucumis sativus])

HSP 1 Score: 632.5 bits (1630), Expect = 4.3e-178
Identity = 318/335 (94.93%), Postives = 327/335 (97.61%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+ QLSNLDINGDALHSS+GLPPALNVTTAR
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTAQLSNLDINGDALHSSLGLPPALNVTTAR 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDAD+GRS SSSQTSSSTVKGGGYGFA
Sbjct: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADMGRSTSSSQTSSSTVKGGGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADA  ARA+EG + RDDDGAKRVFFGG
Sbjct: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADANLARAQEGPIGRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERFIEGISGEANITLQRTELNSPLGLEVNL+ITEAVCPALSEPGLRAFLRFLTGLYVCLN
Sbjct: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLYITEAVCPALSEPGLRAFLRFLTGLYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+K+Q+RSTEAAGRSLVSI  DHIFLCVKDP
Sbjct: 301 RGDVDLKSQQRSTEAAGRSLVSIIVDHIFLCVKDP 335

BLAST of Cp4.1LG00g00170 vs. NCBI nr
Match: gi|590607736|ref|XP_007021070.1| (Uncharacterized protein isoform 2 [Theobroma cacao])

HSP 1 Score: 577.4 bits (1487), Expect = 1.6e-161
Identity = 282/335 (84.18%), Postives = 315/335 (94.03%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+VQLSNLDINGDALH+SMGLPPALNVTTA+
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEI+LP +SNVQ+EP++VQID+LDLVLEENPDAD  RS+SS+Q+S+S+ KG GYGFA
Sbjct: 61  VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMT++V+TVNLLLET GG+R +GGA WASP+ASIT+RN+LLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFS+NKKFIYVFKKLEWESLSIDLLPHPDMF+DA  AR++EGA  RDDDGAKRVFFGG
Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERF+EGISGEA IT+QRTELNSPLGLEV LH+TEAVCPALSEPGLRA LRFLTG YVCLN
Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+KAQ+ S EAAGRSLVS+  DHIFLC+KDP
Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDP 335

BLAST of Cp4.1LG00g00170 vs. NCBI nr
Match: gi|590607739|ref|XP_007021071.1| (Uncharacterized protein isoform 3, partial [Theobroma cacao])

HSP 1 Score: 577.4 bits (1487), Expect = 1.6e-161
Identity = 282/335 (84.18%), Postives = 315/335 (94.03%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+VQLSNLDINGDALH+SMGLPPALNVTTA+
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEI+LP +SNVQ+EP++VQID+LDLVLEENPDAD  RS+SS+Q+S+S+ KG GYGFA
Sbjct: 61  VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMT++V+TVNLLLET GG+R +GGA WASP+ASIT+RN+LLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFS+NKKFIYVFKKLEWESLSIDLLPHPDMF+DA  AR++EGA  RDDDGAKRVFFGG
Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERF+EGISGEA IT+QRTELNSPLGLEV LH+TEAVCPALSEPGLRA LRFLTG YVCLN
Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+KAQ+ S EAAGRSLVS+  DHIFLC+KDP
Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDP 335

BLAST of Cp4.1LG00g00170 vs. NCBI nr
Match: gi|590607743|ref|XP_007021072.1| (Uncharacterized protein isoform 4 [Theobroma cacao])

HSP 1 Score: 577.4 bits (1487), Expect = 1.6e-161
Identity = 282/335 (84.18%), Postives = 315/335 (94.03%), Query Frame = 1

Query: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRSVQLSNLDINGDALHSSMGLPPALNVTTAR 60
           MESILARALEYTLKYWLKSFSRDQFKLQGR+VQLSNLDINGDALH+SMGLPPALNVTTA+
Sbjct: 1   MESILARALEYTLKYWLKSFSRDQFKLQGRTVQLSNLDINGDALHASMGLPPALNVTTAK 60

Query: 61  VGKLEIMLPSLSNVQVEPVVVQIDKLDLVLEENPDADVGRSNSSSQTSSSTVKGGGYGFA 120
           VGKLEI+LP +SNVQ+EP++VQID+LDLVLEENPDAD  RS+SS+Q+S+S+ KG GYGFA
Sbjct: 61  VGKLEIILPYVSNVQIEPIIVQIDRLDLVLEENPDADSSRSSSSTQSSTSSGKGSGYGFA 120

Query: 121 DKIADGMTVEVRTVNLLLETGGGSRHQGGATWASPLASITIRNLLLYTTNENWQVVNLKE 180
           DKIADGMT++V+TVNLLLET GG+R +GGA WASP+ASIT+RN+LLYTTNENWQVVNLKE
Sbjct: 121 DKIADGMTLQVQTVNLLLETRGGARGKGGAAWASPMASITMRNILLYTTNENWQVVNLKE 180

Query: 181 ARDFSANKKFIYVFKKLEWESLSIDLLPHPDMFADATFARAKEGAVSRDDDGAKRVFFGG 240
           ARDFS+NKKFIYVFKKLEWESLSIDLLPHPDMF+DA  AR++EGA  RDDDGAKRVFFGG
Sbjct: 181 ARDFSSNKKFIYVFKKLEWESLSIDLLPHPDMFSDANLARSQEGATHRDDDGAKRVFFGG 240

Query: 241 ERFIEGISGEANITLQRTELNSPLGLEVNLHITEAVCPALSEPGLRAFLRFLTGLYVCLN 300
           ERF+EGISGEA IT+QRTELNSPLGLEV LH+TEAVCPALSEPGLRA LRFLTG YVCLN
Sbjct: 241 ERFLEGISGEAYITVQRTELNSPLGLEVQLHVTEAVCPALSEPGLRALLRFLTGFYVCLN 300

Query: 301 RGDVDMKAQKRSTEAAGRSLVSITADHIFLCVKDP 336
           RGDVD+KAQ+ S EAAGRSLVS+  DHIFLC+KDP
Sbjct: 301 RGDVDLKAQQGSIEAAGRSLVSVVVDHIFLCIKDP 335

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UH1BL_XENLA1.4e-0621.63UHRF1-binding protein 1-like OS=Xenopus laevis GN=uhrf1bp1l PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L7Q7_CUCSA3.0e-17894.93Uncharacterized protein OS=Cucumis sativus GN=Csa_3G081370 PE=4 SV=1[more]
A0A061F5G6_THECC1.1e-16184.18Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_031110 PE=4 SV=1[more]
A0A061F6D5_THECC1.1e-16184.18Uncharacterized protein isoform 3 (Fragment) OS=Theobroma cacao GN=TCM_031110 PE... [more]
A0A061F7C8_THECC1.1e-16184.18Uncharacterized protein isoform 4 OS=Theobroma cacao GN=TCM_031110 PE=4 SV=1[more]
A0A061FDN6_THECC1.1e-16184.18Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_031110 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G20720.21.8e-15078.44 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659126966|ref|XP_008463451.1|3.9e-17995.52PREDICTED: uncharacterized protein LOC103501618 [Cucumis melo][more]
gi|449470413|ref|XP_004152911.1|4.3e-17894.93PREDICTED: uncharacterized protein LOC101210396 [Cucumis sativus][more]
gi|590607736|ref|XP_007021070.1|1.6e-16184.18Uncharacterized protein isoform 2 [Theobroma cacao][more]
gi|590607739|ref|XP_007021071.1|1.6e-16184.18Uncharacterized protein isoform 3, partial [Theobroma cacao][more]
gi|590607743|ref|XP_007021072.1|1.6e-16184.18Uncharacterized protein isoform 4 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR026854VPS13_N
IPR026728UHRF1BP1-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG00g00170.1Cp4.1LG00g00170.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026728UHRF1-binding protein 1-likePANTHERPTHR22774UNCHARACTERIZEDcoord: 2..334
score: 2.8E
IPR026854Vacuolar protein sorting-associated protein 13, N-terminal domainPFAMPF12624Chorein_Ncoord: 2..100
score: 1.2
NoneNo IPR availablePANTHERPTHR22774:SF11PROTEIN C44H4.4coord: 2..334
score: 2.8E