Cp4.1LG18g02150 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG18g02150
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function, DUF538
LocationCp4.1LG18 : 3774605 .. 3777497 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATCCCTTCCCCACACCTCAAACCCTCCATCATCAATGGCTTCCTTCTCAAGAAACCTCTCCCCATTTTCACTATTCCTTCTAATTCTCGCCATTTCCACCCAAACCCATCTCTCCTTTTCATCAATAGACAAACCCCTCAACCCAACAGACATCCACGAACTTCTACTCCGTTACGGTTTTCCCGCTGGTCTCTTACCCAACAATGTCAAGTCCTACACTCTCTCAGACGACGGCGCCTTCGAAATCGAACTCGAAACCGATTGCTATGTGATGCTCAGTGAGCTGGTTTACTACGAAAAGAAAATCACGGGGAAATTGAGCTATGGGTCTGTGACCGATGTTTCTGGGATTCAAGTTAAGAAGCTGTTCTTGTGGGTCTCTGTTTCTGGGTTCAAGTCTAATCAAGGATCTGGAACCATCGTGTTCTTTGTCGGACCCTTGTCTGAAACTTTTCCGGCAGAAATGTTCCGGACGATTCCTGGTTGTAGAAGGAAGGCTTGCCTAGGAGGAAGAACAGAAGCCATATGAGGTAATTTTGATATGGGTTGGATTGGATTTGATTTTCTTGAGTGATTGATTGTTTGTTGAGTTTGATTTCTGCTTATGAATGTTCTTGTGAATCATGGGTCTGGAATTTGTGTTCTGAAATTTGGGTTGTTTGCTGTGATTTTGGATTTTGTTGACTGAGATTCTTGTTCGTATGATCTATCTATGTGGTTAAAATGAAGATCGGAAAGAAAATGTGAACTTTTATAACGGCCGAAGCACACCGCTACCGGAGATTGTTTTATTTGGGCTTTTTCTTTTCGGGATTCTCTTCAACGTTTTTAAGACGTGTCTGGTAAGGAGAGATTTACACACCCTTATGAAGAATGTTTTGTTCCCCTCTCCAACCGATGTGCACCCCTCCTGGGGTCAAGCGTCTTCGTTCTCCTCTCCCATTGATGTGGAACCCCCAACTCCACTCCTTTTTGGGGCCTAGTGTTCTCGTTGGCACACCGTTTAGTGACCATCCTCCTTTGAAGTTCAGCCTCCTCGCGCTCGGTGTCTGACTCTAATACCATTGTAAGATTCCATATTGATTGGAGAGGGGAACAAGTCATTCTTTATAAGGATGTAGAAACCTCTCCCTAACCGATGCGTTTTAAAAATCTCAAGGACATTAGGTCCCTTTACGCTACAACTACAATGATTGTGATTGGAAAGTTGTTGGATGATAGCAACATTAGCATGAATTTCTTGTCACTAAATGATTATAATCTTCGATGTCTTCTAGATATTGTTGTGTCTTGTCTGATAATCATGCTTTTGTTTTTTCTTTCCAGAGCTGACATTAATCTCTTTGTCTGTCTCTTGCAGATGGAAATGATAGCTTTCCAAGCATGTTCATCTTTGTTTCCACAATGTTAACAAAATGTTTCTTTCCATAACATTATCCTTCTACTATATCAAATTTCATTATCAAATGAAAATTTATGTATTGTCTACGGCTGCATGGCCGTTAAATGTCAAACCTCTTGCATTGTATCACTTTATTGCCTTCTTTTAACGTTTTCATGATGTATAGTTTCCTTTTTGTATTTTTTTAGTGTACTACGTTTAAACAAAATCATGTCTACGCCTCGTAGACCTAAAATGCCTCTGAATGTACCATTAAGTCTTTCCTAACAACTCCTTTAGAGCTCTCGAACAAACTACATTATTCAGACTTTCCTTATCTAAAGCCTTGAACCATCAAGTAAAGTACACTAAGTTTTTCTTTAAATAGTCATATCTATCCGCTTAATACTATCTAGATGATTCCTTTGAAACACGTTGCCAGTCTAATAAAATTTTACCATGACTACACTTTCCGGACTTCTTTATTCGACACCATAATATTTGTCAGTACAAGTTAATCAAACCAACCTTCATCCTTAAGTATTCCAATCCAGGAGAAGATGCACAAATCAAAATGGAGGGCAGAAAGGGCAAAAGCAGATAAAACCCAAATAAGGGACACTATGGAGCTTTGGATTGATGCAAAAGCATGATGATATATACATATACTTCCTACATTCTTTACAAGTATTCTTCACCTACATGATATGTTCGTCACTTTCTTTCTTTTTTCTTCTTTTATTTCCTTCTTTCCCCTATTTACAAGCCAATTTGTATGGTTCGTATACTATACTATCTTGCTATTCCAATCAAGAATATTCATGTTCATTCATTTACCTTACGACCCAGCGCCATTGGCTTCACTGCAGCCAAAACGACAGCAACTTTACACGCAAGAAATCCCAGCATTCCTATCACCACTTCTTTAGGTGACAGCATCATGGTTGAATCTCCCAAGTTGAACTTAACTGTAAGAATGGAAAGTCCAACTGCCAGTGCCAAGACCGATAAAGGGCCTCTATATCTTCGATCTTCATTCACAGTTGTTTCAGAATTTGAAGTCGGGGCAGGTAACTCGTCCACAGATCTTTGCAAGAGCAACAGATACAAGAAACCAAGAATGCCACCAGCTAAGAATGCAAGGCCGGCATTTTCACCAGCTGAGAAGGATGAAACAGATGTGCCAGCAAGAATCAACAAGGCATCATAAGTAAGCAAAGAGAGCTTCAAGTCTGCATATTCTTTCATGCTTTCCTCATTGGAAATACTGTCAATAGTGGCTAAATTTGAAAGGTTGGAGCTTCCATTCAGGAAGAACAAGGGCTCAATTCCACATACTTCTGATACCAGGCAAGGCCTAAGTTCTACCATTGATTTGTCGCCGCCCTCCCCGAGCAATATATCCTCGGCTGGGAATTCATACTTGAGACACATGTATTGAAGCTCATGCCCTTCAGATTTCGACTGAGATATGACATACAAGCTTAGACTTCCAAGTCTCCACTGGCCT

mRNA sequence

CGATCCCTTCCCCACACCTCAAACCCTCCATCATCAATGGCTTCCTTCTCAAGAAACCTCTCCCCATTTTCACTATTCCTTCTAATTCTCGCCATTTCCACCCAAACCCATCTCTCCTTTTCATCAATAGACAAACCCCTCAACCCAACAGACATCCACGAACTTCTACTCCGTTACGGTTTTCCCGCTGGTCTCTTACCCAACAATGTCAAGTCCTACACTCTCTCAGACGACGGCGCCTTCGAAATCGAACTCGAAACCGATTGCTATGTGATGCTCAGTGAGCTGGTTTACTACGAAAAGAAAATCACGGGGAAATTGAGCTATGGGTCTGTGACCGATGTTTCTGGGATTCAAGTTAAGAAGCTGTTCTTGTGGGTCTCTGTTTCTGGGTTCAAGTCTAATCAAGGATCTGGAACCATCGTGTTCTTTGTCGGACCCTTGTCTGAAACTTTTCCGGCAGAAATGTTCCGGACGATTCCTGGTTGTAGAAGGAAGGCTTGCCTAGGAGGAAGAACAGAAGCCATATGAATGGAAATGATAGCTTTCCAAGCATGTTCATCTTTGTTTCCACAATGTTAACAAAATGTTTCTTTCCATAACATTATCCTTCTACTATATCAAATTTCATTATCAAATGAAAATTTATGTATTGTCTACGGCTGCATGGCCGTTAAATGTCAAACCTCTTGCATTGTATCACTTTATTGCCTTCTTTTAACGTTTTCATGATGTATAGTTTCCTTTTTGTATTTTTTTAGTGTACTACGTTTAAACAAAATCATGTCTACGCCTCGTAGACCTAAAATGCCTCTGAATGTACCATTAAGTCTTTCCTAACAACTCCTTTAGAGCTCTCGAACAAACTACATTATTCAGACTTTCCTTATCTAAAGCCTTGAACCATCAAGTAAAGTACACTAAGTTTTTCTTTAAATAGTCATATCTATCCGCTTAATACTATCTAGATGATTCCTTTGAAACACGTTGCCAGTCTAATAAAATTTTACCATGACTACACTTTCCGGACTTCTTTATTCGACACCATAATATTTGTCAGTACAAGTTAATCAAACCAACCTTCATCCTTAAGTATTCCAATCCAGGAGAAGATGCACAAATCAAAATGGAGGGCAGAAAGGGCAAAAGCAGATAAAACCCAAATAAGGGACACTATGGAGCTTTGGATTGATGCAAAAGCATGATGATATATACATATACTTCCTACATTCTTTACAAGTATTCTTCACCTACATGATATCCAAAACGACAGCAACTTTACACGCAAGAAATCCCAGCATTCCTATCACCACTTCTTTAGGTGACAGCATCATGGTTGAATCTCCCAAGTTGAACTTAACTGTAAGAATGGAAAGTCCAACTGCCAGTGCCAAGACCGATAAAGGGCCTCTATATCTTCGATCTTCATTCACAGTTGTTTCAGAATTTGAAGTCGGGGCAGGTAACTCGTCCACAGATCTTTGCAAGAGCAACAGATACAAGAAACCAAGAATGCCACCAGCTAAGAATGCAAGGCCGGCATTTTCACCAGCTGAGAAGGATGAAACAGATGTGCCAGCAAGAATCAACAAGGCATCATAAGTAAGCAAAGAGAGCTTCAAGTCTGCATATTCTTTCATGCTTTCCTCATTGGAAATACTGTCAATAGTGGCTAAATTTGAAAGGTTGGAGCTTCCATTCAGGAAGAACAAGGGCTCAATTCCACATACTTCTGATACCAGGCAAGGCCTAAGTTCTACCATTGATTTGTCGCCGCCCTCCCCGAGCAATATATCCTCGGCTGGGAATTCATACTTGAGACACATGTATTGAAGCTCATGCCCTTCAGATTTCGACTGAGATATGACATACAAGCTTAGACTTCCAAGTCTCCACTGGCCT

Coding sequence (CDS)

ATGGCTTCCTTCTCAAGAAACCTCTCCCCATTTTCACTATTCCTTCTAATTCTCGCCATTTCCACCCAAACCCATCTCTCCTTTTCATCAATAGACAAACCCCTCAACCCAACAGACATCCACGAACTTCTACTCCGTTACGGTTTTCCCGCTGGTCTCTTACCCAACAATGTCAAGTCCTACACTCTCTCAGACGACGGCGCCTTCGAAATCGAACTCGAAACCGATTGCTATGTGATGCTCAGTGAGCTGGTTTACTACGAAAAGAAAATCACGGGGAAATTGAGCTATGGGTCTGTGACCGATGTTTCTGGGATTCAAGTTAAGAAGCTGTTCTTGTGGGTCTCTGTTTCTGGGTTCAAGTCTAATCAAGGATCTGGAACCATCGTGTTCTTTGTCGGACCCTTGTCTGAAACTTTTCCGGCAGAAATGTTCCGGACGATTCCTGGTTGTAGAAGGAAGGCTTGCCTAGGAGGAAGAACAGAAGCCATATGA

Protein sequence

MASFSRNLSPFSLFLLILAISTQTHLSFSSIDKPLNPTDIHELLLRYGFPAGLLPNNVKSYTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGFKSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACLGGRTEAI
BLAST of Cp4.1LG18g02150 vs. TrEMBL
Match: A0A0A0M0C4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G630320 PE=4 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 1.8e-51
Identity = 107/164 (65.24%), Postives = 125/164 (76.22%), Query Frame = 1

Query: 1   MASFSRNLSPFSLFLLILAISTQTHLSFSSIDKPLNPTDIHELLLRYGFPAGLLPNNVKS 60
           MASFS  LSPFSLFLLIL  STQTHLSFS+ D PL  +DIH+LL  YGFP GLLP+NV S
Sbjct: 1   MASFSTILSPFSLFLLILLFSTQTHLSFSARDFPLRSSDIHDLLPLYGFPVGLLPDNVNS 60

Query: 61  YTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGF 120
           YTLSDDG FEI+L++ CYV  S+LVYY K I GKLS  S++DVSGI+VKKLF W+ ++G 
Sbjct: 61  YTLSDDGTFEIQLQSSCYVHFSDLVYYGKNIKGKLSNRSLSDVSGIEVKKLFAWLPITGI 120

Query: 121 KSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACLGGRTEAI 165
           K    S +I F VG LSE  P  MF +IP CRRKACL G+TEA+
Sbjct: 121 KVTPDSKSIEFAVGFLSEILPVSMFESIPTCRRKACLEGKTEAM 164

BLAST of Cp4.1LG18g02150 vs. TrEMBL
Match: M5W1G2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa012400mg PE=4 SV=1)

HSP 1 Score: 177.6 bits (449), Expect = 1.3e-41
Identity = 88/168 (52.38%), Postives = 115/168 (68.45%), Query Frame = 1

Query: 7   NLSPFSLFLLILAISTQTHLSFSSIDKPLNP----------TDIHELLLRYGFPAGLLPN 66
           +++  S +LLIL   ++TH   S  D   +P          TD+H+LL +YG P GLLP+
Sbjct: 4   SIAQISFYLLILTFFSETHFGLSLRDLKSDPNRPSTTAQSITDVHDLLPKYGLPKGLLPD 63

Query: 67  NVKSYTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVS 126
           NV SYTLS+DG+FEI LE+ CYV   +LVYY K I GKLSYGSV+DVSGIQ KKLF+WVS
Sbjct: 64  NVNSYTLSEDGSFEIYLESPCYVHFDQLVYYNKNIKGKLSYGSVSDVSGIQAKKLFIWVS 123

Query: 127 VSGFKSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACLGGRTEAI 165
           V+G + +QGS ++ F+VG LSE  PA+ F  IP C+ KAC G   ++I
Sbjct: 124 VTGIQVDQGSDSVEFYVGALSEKLPAKQFEDIPVCKSKACQGTYVDSI 171

BLAST of Cp4.1LG18g02150 vs. TrEMBL
Match: A0A061GS48_THECC (Uncharacterized protein isoform 2 (Fragment) OS=Theobroma cacao GN=TCM_040161 PE=4 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 2.2e-41
Identity = 91/161 (56.52%), Postives = 114/161 (70.81%), Query Frame = 1

Query: 5   SRNLSPFSLFLLILAISTQTHLSFSSI------DKPLN--PTDIHELLLRYGFPAGLLPN 64
           S+  +P SL+LL+LA+  QTHLSFS+        +PL+   +D+H+LL  YG P G+LPN
Sbjct: 61  SKMAAPISLYLLLLALFCQTHLSFSTTGPAIDHSRPLSISTSDVHDLLPTYGLPKGILPN 120

Query: 65  NVKSYTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVS 124
           NVKSYTLS  G F IELE+ CYV   +LVYYEKKI GKLSYG+V DVSGIQ KKLFLW+ 
Sbjct: 121 NVKSYTLSTTGDFTIELESTCYVQFDQLVYYEKKIKGKLSYGAVHDVSGIQAKKLFLWLP 180

Query: 125 VSGFKSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACL 158
           V+G + ++ SG + FFVG LSE  PA+ F  IP C+  A L
Sbjct: 181 VTGIEVDENSGMVQFFVGALSEKLPAKQFEDIPVCKGNAFL 221

BLAST of Cp4.1LG18g02150 vs. TrEMBL
Match: A0A061GR70_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_040161 PE=4 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 4.8e-41
Identity = 90/157 (57.32%), Postives = 112/157 (71.34%), Query Frame = 1

Query: 9   SPFSLFLLILAISTQTHLSFSSI------DKPLN--PTDIHELLLRYGFPAGLLPNNVKS 68
           +P SL+LL+LA+  QTHLSFS+        +PL+   +D+H+LL  YG P G+LPNNVKS
Sbjct: 3   APISLYLLLLALFCQTHLSFSTTGPAIDHSRPLSISTSDVHDLLPTYGLPKGILPNNVKS 62

Query: 69  YTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGF 128
           YTLS  G F IELE+ CYV   +LVYYEKKI GKLSYG+V DVSGIQ KKLFLW+ V+G 
Sbjct: 63  YTLSTTGDFTIELESTCYVQFDQLVYYEKKIKGKLSYGAVHDVSGIQAKKLFLWLPVTGI 122

Query: 129 KSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACL 158
           + ++ SG + FFVG LSE  PA+ F  IP C+  A L
Sbjct: 123 EVDENSGMVQFFVGALSEKLPAKQFEDIPVCKGNAFL 159

BLAST of Cp4.1LG18g02150 vs. TrEMBL
Match: A0A0D2SD80_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G341700 PE=4 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 3.5e-39
Identity = 85/157 (54.14%), Postives = 107/157 (68.15%), Query Frame = 1

Query: 9   SPFSLFLLILAISTQTHLSFSSIDKPLNP--------TDIHELLLRYGFPAGLLPNNVKS 68
           SP   +LL L++ +QT LSFS+   P++          D+H+LL +YG P G++PNNVKS
Sbjct: 46  SPICFYLLFLSLLSQTQLSFSTTHTPIDHGRPLVTSIADVHDLLPKYGLPKGIVPNNVKS 105

Query: 69  YTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGF 128
           YTLS  G F IELE+ CYV   +LVYY+K I+GKLSYG+V DVSGIQ KKLFLW+ V+  
Sbjct: 106 YTLSSTGDFTIELESTCYVQFDDLVYYDKTISGKLSYGAVHDVSGIQAKKLFLWLPVTAI 165

Query: 129 KSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACL 158
           + N  SG + FFVG  S+ FPA  F  IP CRRKA L
Sbjct: 166 EVNGKSGMVQFFVGAFSKEFPAVQFEDIPVCRRKAVL 202

BLAST of Cp4.1LG18g02150 vs. TAIR10
Match: AT1G55265.1 (AT1G55265.1 Protein of unknown function, DUF538)

HSP 1 Score: 149.8 bits (377), Expect = 1.4e-36
Identity = 81/157 (51.59%), Postives = 105/157 (66.88%), Query Frame = 1

Query: 13  LFLLILAISTQTHLS------------FSSI--DKP-LNPTDIHELLLRYGFPAGLLPNN 72
           LFL +L++S+  +L             FSS+  D+P L   DIH+LL RYGFP GLLPNN
Sbjct: 12  LFLSLLSLSSSLNLRRPIFSQSNDLDLFSSLNLDRPSLAADDIHDLLPRYGFPKGLLPNN 71

Query: 73  VKSYTLSDDGAFEIELETDCYVMLSE-LVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVS 132
           VKSYT+SDDG F ++L + CYV  S+ LV+Y K I GKLSYGSV DV GIQ K+ FLW+ 
Sbjct: 72  VKSYTISDDGDFTVDLISSCYVKFSDQLVFYGKNIAGKLSYGSVKDVRGIQAKEAFLWLP 131

Query: 133 VSGFKSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRR 154
           ++  +S+  S T+VF VG +S+T PA MF  +P C R
Sbjct: 132 ITAMESDPSSATVVFSVGFVSKTLPASMFENVPSCSR 168

BLAST of Cp4.1LG18g02150 vs. TAIR10
Match: AT5G19860.1 (AT5G19860.1 Protein of unknown function, DUF538)

HSP 1 Score: 112.5 bits (280), Expect = 2.5e-25
Identity = 61/154 (39.61%), Postives = 92/154 (59.74%), Query Frame = 1

Query: 8   LSPF--SLFLLILAISTQTHLSFSSIDKPLNPTDIHELLLRYGFPAGLLPNNVKSYTLSD 67
           L PF  S+F+  L + T T  S +  D P + + ++ELL +YG P+GLLP+ V  +TLSD
Sbjct: 3   LYPFFISIFIFSLTLFTTTTHSLNEPD-PDSISTVYELLPKYGLPSGLLPDTVTDFTLSD 62

Query: 68  DGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGFKSN-Q 127
           DG F + L   C +    LV+Y+K I+G++ YGS+T++ GIQVKK F+W+ V   K +  
Sbjct: 63  DGRFVVHLPNSCEIEFDYLVHYDKTISGRIGYGSITELKGIQVKKFFIWLDVDEIKVDLP 122

Query: 128 GSGTIVFFVGPLSETFPAEMFRTIPGCRRKACLG 159
            S +I F VG +++    + F+TI  C      G
Sbjct: 123 PSDSIYFKVGFINKKLDIDQFKTIHSCHDNGVSG 155

BLAST of Cp4.1LG18g02150 vs. TAIR10
Match: AT5G54530.1 (AT5G54530.1 Protein of unknown function, DUF538)

HSP 1 Score: 94.4 bits (233), Expect = 7.2e-20
Identity = 53/139 (38.13%), Postives = 74/139 (53.24%), Query Frame = 1

Query: 15  LLILAISTQTHLSFSSIDKPLNPTDIHELLLRYGFPAGLLPNNVKSYTLSDDGAFEIELE 74
           +++L  + +  LS SS   P  PT +H++L   G PAGLLP  V SY L +DG  E+ L 
Sbjct: 8   MILLLTTLRLSLSLSS---PSYPT-VHDVLRSEGLPAGLLPQEVDSYILHNDGRLEVFLA 67

Query: 75  TDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGF-KSNQGSGTIVFFV 134
             CY      V++E  + G LSYGS+  V G+  K+LFLW+ V      N  SG IVF +
Sbjct: 68  APCYAKFETNVHFEAVVRGNLSYGSLVGVEGLSQKELFLWLQVKDIVVENPNSGVIVFDI 127

Query: 135 GPLSETFPAEMFRTIPGCR 153
           G   +     +F   P C+
Sbjct: 128 GVAFKQLSLSLFEDPPKCK 142

BLAST of Cp4.1LG18g02150 vs. TAIR10
Match: AT1G61667.1 (AT1G61667.1 Protein of unknown function, DUF538)

HSP 1 Score: 84.0 bits (206), Expect = 9.7e-17
Identity = 41/120 (34.17%), Postives = 64/120 (53.33%), Query Frame = 1

Query: 34  PLNPTDIHELLLRYGFPAGLLPNNVKSYTLSDD-GAFEIELETDCYVMLSELVYYEKKIT 93
           P + + I  LL   G P GL P+NV+SY+L D  G  E++L+  C+      VY+++ I 
Sbjct: 16  PSSQSSIRNLLEARGLPGGLFPDNVESYSLDDKTGELEVQLQNPCFARFENRVYFDRVIK 75

Query: 94  GKLSYGSVTDVSGIQVKKLFLWVSVSGFKSNQ-GSGTIVFFVGPLSETFPAEMFRTIPGC 152
             LSYG +  + G+  ++LFLW+ V G   N   SG ++F +G   +     +F   P C
Sbjct: 76  ANLSYGGLVGLEGLTQEELFLWLPVKGIAVNDPSSGLVLFDIGVAHKQISRSLFEDPPVC 135

BLAST of Cp4.1LG18g02150 vs. TAIR10
Match: AT3G07460.2 (AT3G07460.2 Protein of unknown function, DUF538)

HSP 1 Score: 79.3 bits (194), Expect = 2.4e-15
Identity = 38/114 (33.33%), Postives = 63/114 (55.26%), Query Frame = 1

Query: 40  IHELLLRYGFPAGLLPNNVKSYTLSDD-GAFEIELETDCYVMLSELVYYEKKITGKLSYG 99
           I E+LL  G P GL P  VK +T++ + G F + L   C       ++Y++ ++G + Y 
Sbjct: 30  IDEILLANGLPLGLFPKGVKGFTVNGETGRFSVYLNQSCQAKYETELHYDEIVSGTIGYA 89

Query: 100 SVTDVSGIQVKKLFLWVSVSGFKSNQGSGTIVFF-VGPLSETFPAEMFRTIPGC 152
            + D+SGI  ++LFLW+ V G + +  S  ++FF VG L + +   +F T   C
Sbjct: 90  QIRDLSGISAQELFLWLQVKGIRVDVPSSGLIFFDVGVLRKQYSLSLFETPRDC 143

BLAST of Cp4.1LG18g02150 vs. NCBI nr
Match: gi|659098800|ref|XP_008450299.1| (PREDICTED: uncharacterized protein LOC103491951 [Cucumis melo])

HSP 1 Score: 216.1 bits (549), Expect = 4.6e-53
Identity = 109/164 (66.46%), Postives = 129/164 (78.66%), Query Frame = 1

Query: 1   MASFSRNLSPFSLFLLILAISTQTHLSFSSIDKPLNPTDIHELLLRYGFPAGLLPNNVKS 60
           MASFSR LSPFSLFLLIL ISTQTHLSFS+ D  L  +DIH+LL  YGFP GLLP+NVKS
Sbjct: 1   MASFSRILSPFSLFLLILVISTQTHLSFSARDLLLKSSDIHDLLPLYGFPVGLLPSNVKS 60

Query: 61  YTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGF 120
           YTLSDDG+F IEL++ CYV  ++LVYY K I GKLSYGS++DVSGIQVKKLF W+ ++G 
Sbjct: 61  YTLSDDGSFVIELDSACYVQFADLVYYGKTIKGKLSYGSLSDVSGIQVKKLFAWLPITGM 120

Query: 121 KSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACLGGRTEAI 165
           +    S +I F VG LSE  P  MF +IP CR+KACL G+TEA+
Sbjct: 121 RVTSDSKSIEFQVGFLSEALPFSMFESIPTCRKKACLEGKTEAV 164

BLAST of Cp4.1LG18g02150 vs. NCBI nr
Match: gi|449442610|ref|XP_004139074.1| (PREDICTED: uncharacterized protein LOC101202755 [Cucumis sativus])

HSP 1 Score: 210.3 bits (534), Expect = 2.5e-51
Identity = 107/164 (65.24%), Postives = 125/164 (76.22%), Query Frame = 1

Query: 1   MASFSRNLSPFSLFLLILAISTQTHLSFSSIDKPLNPTDIHELLLRYGFPAGLLPNNVKS 60
           MASFS  LSPFSLFLLIL  STQTHLSFS+ D PL  +DIH+LL  YGFP GLLP+NV S
Sbjct: 1   MASFSTILSPFSLFLLILLFSTQTHLSFSARDFPLRSSDIHDLLPLYGFPVGLLPDNVNS 60

Query: 61  YTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGF 120
           YTLSDDG FEI+L++ CYV  S+LVYY K I GKLS  S++DVSGI+VKKLF W+ ++G 
Sbjct: 61  YTLSDDGTFEIQLQSSCYVHFSDLVYYGKNIKGKLSNRSLSDVSGIEVKKLFAWLPITGI 120

Query: 121 KSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACLGGRTEAI 165
           K    S +I F VG LSE  P  MF +IP CRRKACL G+TEA+
Sbjct: 121 KVTPDSKSIEFAVGFLSEILPVSMFESIPTCRRKACLEGKTEAM 164

BLAST of Cp4.1LG18g02150 vs. NCBI nr
Match: gi|1009123863|ref|XP_015878766.1| (PREDICTED: uncharacterized protein LOC107415025 [Ziziphus jujuba])

HSP 1 Score: 183.3 bits (464), Expect = 3.3e-43
Identity = 89/154 (57.79%), Postives = 116/154 (75.32%), Query Frame = 1

Query: 12  SLFLLILAISTQTHLSFSSIDKPLN----PTDIHELLLRYGFPAGLLPNNVKSYTLSDDG 71
           S+ LL++ + +QT+LS S+   PL+     TDIHELL +YGFP G+LPNN+KSYTLS+DG
Sbjct: 11  SVCLLLVTLFSQTNLSLSTTTSPLHIEESVTDIHELLPKYGFPKGILPNNIKSYTLSEDG 70

Query: 72  AFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVSVSGFKSNQGSG 131
            FEI L + CYV   +LVYY+KKITGKL+YGSV++VSGIQ KKLFLW+SV+G K+++ SG
Sbjct: 71  YFEIYLLSPCYVQFDQLVYYDKKITGKLTYGSVSNVSGIQTKKLFLWLSVTGIKADKDSG 130

Query: 132 TIVFFVGPLSETFPAEMFRTIPGCRRKACLGGRT 162
            I F+VG LSE  PA  F  +P C+ KAC G  +
Sbjct: 131 MIEFYVGSLSERLPANQFEQVPTCKAKACHGSES 164

BLAST of Cp4.1LG18g02150 vs. NCBI nr
Match: gi|657951224|ref|XP_008351693.1| (PREDICTED: uncharacterized protein LOC103415116 [Malus domestica])

HSP 1 Score: 178.7 bits (452), Expect = 8.2e-42
Identity = 92/174 (52.87%), Postives = 120/174 (68.97%), Query Frame = 1

Query: 1   MASFSRNLSPFSLFLLILAISTQTHLSFSSIDKPLNPT----------DIHELLLRYGFP 60
           MAS S  +S   L+LL+L + ++THL+ S  D   NP           D+H+LL +YG P
Sbjct: 1   MASASAQIS---LYLLLLTLFSKTHLTLSLRDLKPNPNGSSNTSQSIADVHDLLPQYGLP 60

Query: 61  AGLLPNNVKSYTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKK 120
            GLLPNNVKSYTL++DG+FEI LE+ CYV   +LVYY K I GKLSYG V+DVSGIQ KK
Sbjct: 61  KGLLPNNVKSYTLTEDGSFEIFLESPCYVHFDQLVYYNKHIKGKLSYGEVSDVSGIQAKK 120

Query: 121 LFLWVSVSGFKSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACLGGRTEAI 165
           LF+WVSV+G   ++GS ++ F+VG LSE  PA+ F  IP C+ KAC G R+ ++
Sbjct: 121 LFIWVSVTGIHRDKGSDSVEFYVGALSEKLPAKQFEDIPDCKNKACHGIRSGSV 171

BLAST of Cp4.1LG18g02150 vs. NCBI nr
Match: gi|595830172|ref|XP_007206017.1| (hypothetical protein PRUPE_ppa012400mg [Prunus persica])

HSP 1 Score: 177.6 bits (449), Expect = 1.8e-41
Identity = 88/168 (52.38%), Postives = 115/168 (68.45%), Query Frame = 1

Query: 7   NLSPFSLFLLILAISTQTHLSFSSIDKPLNP----------TDIHELLLRYGFPAGLLPN 66
           +++  S +LLIL   ++TH   S  D   +P          TD+H+LL +YG P GLLP+
Sbjct: 4   SIAQISFYLLILTFFSETHFGLSLRDLKSDPNRPSTTAQSITDVHDLLPKYGLPKGLLPD 63

Query: 67  NVKSYTLSDDGAFEIELETDCYVMLSELVYYEKKITGKLSYGSVTDVSGIQVKKLFLWVS 126
           NV SYTLS+DG+FEI LE+ CYV   +LVYY K I GKLSYGSV+DVSGIQ KKLF+WVS
Sbjct: 64  NVNSYTLSEDGSFEIYLESPCYVHFDQLVYYNKNIKGKLSYGSVSDVSGIQAKKLFIWVS 123

Query: 127 VSGFKSNQGSGTIVFFVGPLSETFPAEMFRTIPGCRRKACLGGRTEAI 165
           V+G + +QGS ++ F+VG LSE  PA+ F  IP C+ KAC G   ++I
Sbjct: 124 VTGIQVDQGSDSVEFYVGALSEKLPAKQFEDIPVCKSKACQGTYVDSI 171

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0M0C4_CUCSA1.8e-5165.24Uncharacterized protein OS=Cucumis sativus GN=Csa_1G630320 PE=4 SV=1[more]
M5W1G2_PRUPE1.3e-4152.38Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa012400mg PE=4 SV=1[more]
A0A061GS48_THECC2.2e-4156.52Uncharacterized protein isoform 2 (Fragment) OS=Theobroma cacao GN=TCM_040161 PE... [more]
A0A061GR70_THECC4.8e-4157.32Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_040161 PE=4 SV=1[more]
A0A0D2SD80_GOSRA3.5e-3954.14Uncharacterized protein OS=Gossypium raimondii GN=B456_009G341700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G55265.11.4e-3651.59 Protein of unknown function, DUF538[more]
AT5G19860.12.5e-2539.61 Protein of unknown function, DUF538[more]
AT5G54530.17.2e-2038.13 Protein of unknown function, DUF538[more]
AT1G61667.19.7e-1734.17 Protein of unknown function, DUF538[more]
AT3G07460.22.4e-1533.33 Protein of unknown function, DUF538[more]
Match NameE-valueIdentityDescription
gi|659098800|ref|XP_008450299.1|4.6e-5366.46PREDICTED: uncharacterized protein LOC103491951 [Cucumis melo][more]
gi|449442610|ref|XP_004139074.1|2.5e-5165.24PREDICTED: uncharacterized protein LOC101202755 [Cucumis sativus][more]
gi|1009123863|ref|XP_015878766.1|3.3e-4357.79PREDICTED: uncharacterized protein LOC107415025 [Ziziphus jujuba][more]
gi|657951224|ref|XP_008351693.1|8.2e-4252.87PREDICTED: uncharacterized protein LOC103415116 [Malus domestica][more]
gi|595830172|ref|XP_007206017.1|1.8e-4152.38hypothetical protein PRUPE_ppa012400mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007493DUF538
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0019344 cysteine biosynthetic process
biological_process GO:0009765 photosynthesis, light harvesting
biological_process GO:0018298 protein-chromophore linkage
biological_process GO:0009735 response to cytokinin
cellular_component GO:0005575 cellular_component
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009522 photosystem I
cellular_component GO:0009523 photosystem II
cellular_component GO:0010287 plastoglobule
molecular_function GO:0003674 molecular_function
molecular_function GO:0016168 chlorophyll binding
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g02150.1Cp4.1LG18g02150.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007493Protein of unknown function DUF538GENE3DG3DSA:2.30.240.10coord: 34..154
score: 1.9
IPR007493Protein of unknown function DUF538PFAMPF04398DUF538coord: 41..145
score: 2.0
IPR007493Protein of unknown function DUF538unknownSSF141562At5g01610-likecoord: 13..154
score: 6.8
NoneNo IPR availablePANTHERPTHR31676FAMILY NOT NAMEDcoord: 7..164
score: 5.4
NoneNo IPR availablePANTHERPTHR31676:SF6SUBFAMILY NOT NAMEDcoord: 7..164
score: 5.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG18g02150Cp4.1LG04g12570Cucurbita pepo (Zucchini)cpecpeB363
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG18g02150Cucurbita pepo (Zucchini)cpecpeB369
Cp4.1LG18g02150Cucurbita maxima (Rimu)cmacpeB126
Cp4.1LG18g02150Cucurbita maxima (Rimu)cmacpeB632
Cp4.1LG18g02150Cucurbita moschata (Rifu)cmocpeB104
Cp4.1LG18g02150Cucurbita moschata (Rifu)cmocpeB581
Cp4.1LG18g02150Bottle gourd (USVL1VR-Ls)cpelsiB299
Cp4.1LG18g02150Watermelon (Charleston Gray)cpewcgB327
Cp4.1LG18g02150Watermelon (97103) v1cpewmB356