Cp4.1LG18g04910 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g04910
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDUF4050 family protein
LocationCp4.1LG18: 5748551 .. 5751595 (+)
RNA-Seq ExpressionCp4.1LG18g04910
SyntenyCp4.1LG18g04910
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTAACCCAGTGGGGAGTGGGGGCCAGAGCCACACATAAAAGGGAGTATAAACCACCAACAAAAATCCATCACTTGAACTATCCATATTACCCCCGGAGGAATTCATTACTCCACATCGATACCCATCTCTTTTTCTTTTCTGATTTCTGAGTTTCTAGGAAACCCACCCATTTTCTTTTATGGTATTTGGTTGATTCTCACTTCCACTCGAGTTTTCCCTTTTTCTCTCATCTAACTTTCAGTTCTCTTGGCCCCAAAGTCCACTCTATCCCTTTCCAGTTTGCTGGGTTTTGGGTTCCCATTGGTTTTCCCTGAATCGATGCCCTTCTCTGCGCTTTCGGCTCATCATCAACATGGTCATGCTGAATAGTTCCTTCGCCGCGTGGATCAGCCGCTTGTTCGCTTGCATGGGGTAAGTTCTTTATCTTCCTCACTTATCATTCAATGCAGTTTTGGAAAGGACTTATATGTAATCCTTGATCCTAATGGAGTTCTGTGATGCCCCTTTTGTGTATCCCTTGTGAATGATTCTGTTTTTTGACTTACTGAGCAGTACACAGTTTTCTTTGGGCATCTTTGATTAATATGAAAGAATTTTGATTACTTATTGGTTTTACAGAGCTTGTTGAGTATTCAATTTTGATTGTGTATATTCCAAGTTGTGCGGAAATGGGGCCAATAATATGAACTAGTTCATTTCTTGTTTGTTTTCTGGGCGAAGAAAGTGAAAAGCCAAGTCTTATTCCTTTATGAGGTTGCCCTAACAACTTTATCCCCGATCAAAAAGGTCAAATACTAAAGAATCCATTATCATCCATCATAGATTATTATAACTTTTGTTCATTTACGAGAAGGAAGTAGAGCTTAGAGCTCTACTTTTTGAACATAATTCAAAAACTGCTTGTGAACATAATTCTTTTTGAACATAATTCTTTTGCTTTCATATTGCATTACCCTTGTGCTGTGTGAATCCATGGAGTTGTCAAGAGGGTATTAGCCGCCTAATACTTGTCTGTTGTAGACTGTAGCCTTAAGATTATAGTTTTCTTGTGCTGATTTAGTAGAAAGAATTGATATCAAGGTGGATCCTTGAAAAAATGAAACTAAATTCATTAATATTCGAAGTTTATGCTTGGAAGTATGAAACCCCCACACTTCTCTTAGTTCTGTAGAAGCTGAATTTTGTTTCTGAATTGTAGTTGAGATGTTTAATTGCTGCACAATCATTCCCAGTTTCCCTTAGGCTCTTCTTTCGTTTTTCGCCCTGTTTAAATGGTGTTATTGCTTACAACTTCATGAGTAAATGCGTAATCATTGTTGTTGTAATAGCCAAAACCCACCGTTGGCAGATATTGTCCTCTTTAGACTTTCCCTTCCGGACTTCCCCTCAAAGTTTTAAAACGCGTCCACTAGAGAGAGGTTTCCACACCCTTATAACTGATGTGGGAATGTCGTAGTTGTCATCCAACCTATGTTGTACTATAATATTCGGTTGCCATGGTTTTTAACGTGCACGTTCTTTTTTCCGAAGTGTTCTGCTTGTGTTCATGGTGACATCCATTCAATTCATCATGAGAACTCGAGCAATTTAGGATTAATCAATGATCATTCTACATTCAATGTAGAACGTGCGTTCTCATAATTTCTATTGTTAATAGTTGTGTGTGTGCATCTAACCATAAATGAAATATTTGACAACTGCCTTGCATTATTGATGAATGGTGCACATGCTATGTTTTTGCTGCAATTTTATAGCATATACTGCTTTATTAGGTTTATAACAGTGCAGGGGTTGTTTTGGATGCTGCACTAAGCCCACACCTATTATTGCTGTGGATGAGCCATCTAAGGGATTAAGAATTCAAGGACGAGTAGTCAAGAAACCGAGCATATCCGATGGTTTTTGGAGCACAAGCACGTGTGATTTGGATAACAGCACCATTCAGTCTCAACGAAGCATCTCGTCTATCAGTACATCAAACCTCACACACAGTCAAAGCAATGTTGGTGGCAGTGTGAGCAACCCTTCTGAATATGTAAACCATGGCAAGTTTCCCTCTTTATCATCTGATCTTTGTTGAAGTCGTGGGTAACATTCGAGATAATTAACTGCATTTAACGAGCTATCGTTTCTTCATTATAACCTGGTCAATATGGAGACGAGCAATCTACTTATCTTCATTGCTTTCAGGTTTGCTTCTCTGGAACCAGACTAGGTTGCAGTGGATTGGAAGTAGTAATACAAACACAACGGATGAAACTCAAGAACGACAGAAGGCAAAAATCAGGTCAGTTTACGAGCTTCGTGTGCCTTTGTAAATAGTAGGTATCATAGAGGTATGAAAGGTGTATTAATCATCATACCATGCGTATACGTATGCATTTGTATGCAGTTGGCGTGCAACATATGACAGTTTACTGGGTACGAGACAGCCTTTTCCCCATCGAATTCCTTTGTCGGTAAGTTCGCTGTGAAGTTTTGTCATTTTTTCATTCTCCAATCACTGTTCGTTCTACGAATAAGAGATTCAAAATCTTGGTTTGGACAGGAAATGGTGAACTTTCTTGTTGAAGTATGGGAACAAGAGGGCCTATATGATTGAAACTGGTTTTTATTTTGGATACATTCCTTGAATCTCTAGGAAGCTTCTCAGTGTACGGATTGCAAAAGGAGCAAAAAGGGTGTTTTTCTTTATCTTCATCTTTCTCTTTATCTTTATCAATCCTCCTGCATTTTTCAGGTGTACAAATGTATTAACACCATCATGCGCTCGAGCGCTTTTGGGTTCTTAAAACCGAGACAGCAGAAGAAGAACTCGATACTCAATATAGAATGATAGATAGATAGATAGATATCTACCATTTTGTATGCATTCTGCTTCGGTTTTCTCCGCTGTTCATTCTATGAATAAGAACACTTTGAAACTCTCCGCCATTCACTACCTAGTTTCAAAACTTTTCAAGTTAAGTGAAGTTGGATGGAAATTTTAGGGAAAAAAATGATTTGAATTGGAATTTGGTATTGTTTTTATTGAGCCAACC

mRNA sequence

ATGTTCTCTTGGCCCCAAAGTCCACTCTATCCCTTTCCAGTTTGCTGGGTTTTGGGTTCCCATTGGTTTTCCCTGAATCGATGCCCTTCTCTGCGCTTTCGGCTCATCATCAACATGGTCATGCTGAATAGTTCCTTCGCCGCGTGGATCAGCCGCTTGTTCGCTTGCATGGGGGGTTGTTTTGGATGCTGCACTAAGCCCACACCTATTATTGCTGTGGATGAGCCATCTAAGGGATTAAGAATTCAAGGACGAGTAGTCAAGAAACCGAGCATATCCGATGGTTTTTGGAGCACAAGCACGTGTGATTTGGATAACAGCACCATTCAGTCTCAACGAAGCATCTCGTCTATCAGTACATCAAACCTCACACACAGTCAAAGCAATGTTGGTGGCAGTGTGAGCAACCCTTCTGAATATGTAAACCATGGCAAACGAGCAATCTACTTATCTTCATTGCTTTCAGGTTTGCTTCTCTGGAACCAGACTAGGTTGCAGTGGATTGGAAGTAGTAATACAAACACAACGGATGAAACTCAAGAACGACAGAAGGCAAAAATCAGTTGGCGTGCAACATATGACAGTTTACTGGGTACGAGACAGCCTTTTCCCCATCGAATTCCTTTGTCGGAAATGGTGAACTTTCTTGTTGAAGTATGGGAACAAGAGGGCCTATATGATTGAAACTGGTTTTTATTTTGGATACATTCCTTGAATCTCTAGGAAGCTTCTCAGTGTACGGATTGCAAAAGGAGCAAAAAGGGTGTTTTTCTTTATCTTCATCTTTCTCTTTATCTTTATCAATCCTCCTGCATTTTTCAGGTGTACAAATGTATTAACACCATCATGCGCTCGAGCGCTTTTGGGTTCTTAAAACCGAGACAGCAGAAGAAGAACTCGATACTCAATATAGAATGATAGATAGATAGATAGATATCTACCATTTTGTATGCATTCTGCTTCGGTTTTCTCCGCTGTTCATTCTATGAATAAGAACACTTTGAAACTCTCCGCCATTCACTACCTAGTTTCAAAACTTTTCAAGTTAAGTGAAGTTGGATGGAAATTTTAGGGAAAAAAATGATTTGAATTGGAATTTGGTATTGTTTTTATTGAGCCAACC

Coding sequence (CDS)

ATGTTCTCTTGGCCCCAAAGTCCACTCTATCCCTTTCCAGTTTGCTGGGTTTTGGGTTCCCATTGGTTTTCCCTGAATCGATGCCCTTCTCTGCGCTTTCGGCTCATCATCAACATGGTCATGCTGAATAGTTCCTTCGCCGCGTGGATCAGCCGCTTGTTCGCTTGCATGGGGGGTTGTTTTGGATGCTGCACTAAGCCCACACCTATTATTGCTGTGGATGAGCCATCTAAGGGATTAAGAATTCAAGGACGAGTAGTCAAGAAACCGAGCATATCCGATGGTTTTTGGAGCACAAGCACGTGTGATTTGGATAACAGCACCATTCAGTCTCAACGAAGCATCTCGTCTATCAGTACATCAAACCTCACACACAGTCAAAGCAATGTTGGTGGCAGTGTGAGCAACCCTTCTGAATATGTAAACCATGGCAAACGAGCAATCTACTTATCTTCATTGCTTTCAGGTTTGCTTCTCTGGAACCAGACTAGGTTGCAGTGGATTGGAAGTAGTAATACAAACACAACGGATGAAACTCAAGAACGACAGAAGGCAAAAATCAGTTGGCGTGCAACATATGACAGTTTACTGGGTACGAGACAGCCTTTTCCCCATCGAATTCCTTTGTCGGAAATGGTGAACTTTCTTGTTGAAGTATGGGAACAAGAGGGCCTATATGATTGA

Protein sequence

MFSWPQSPLYPFPVCWVLGSHWFSLNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLLLWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD
Homology
BLAST of Cp4.1LG18g04910 vs. NCBI nr
Match: KAG6589849.1 (hypothetical protein SDJN03_15272, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 381 bits (979), Expect = 1.81e-131
Identity = 192/205 (93.66%), Postives = 192/205 (93.66%), Query Frame = 0

Query: 23  FSLNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRI 82
           F LNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRI
Sbjct: 86  FYLNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRI 145

Query: 83  QGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVN 142
           QGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVN
Sbjct: 146 QGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVN 205

Query: 143 HGKRAIYLSSLLSGLLLWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQP 202
           HG            LLLWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQP
Sbjct: 206 HG------------LLLWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQP 265

Query: 203 FPHRIPLSEMVNFLVEVWEQEGLYD 227
           FPHRIPLSEMVNFLVEVWEQEGLYD
Sbjct: 266 FPHRIPLSEMVNFLVEVWEQEGLYD 278

BLAST of Cp4.1LG18g04910 vs. NCBI nr
Match: KAG7023519.1 (hypothetical protein SDJN02_14545, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 355 bits (912), Expect = 2.45e-122
Identity = 184/223 (82.51%), Postives = 185/223 (82.96%), Query Frame = 0

Query: 5   PQSPLYPFPVCWVLGSHWFSLNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCC 64
           P+    PF          F LNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCC
Sbjct: 9   PKVHSIPFQFAGFWVPIGFYLNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCC 68

Query: 65  TKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT 124
           TKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT
Sbjct: 69  TKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT 128

Query: 125 HSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLLLWNQTRLQWIGSSNTNTTDETQERQK 184
           HSQSNVGGS                       LLLWNQTRLQWIGSSNTNTTDETQERQK
Sbjct: 129 HSQSNVGGS-----------------------LLLWNQTRLQWIGSSNTNTTDETQERQK 188

Query: 185 AKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD 227
           AKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD
Sbjct: 189 AKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD 208

BLAST of Cp4.1LG18g04910 vs. NCBI nr
Match: XP_023516635.1 (uncharacterized protein LOC111780448 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 351 bits (901), Expect = 3.88e-121
Identity = 177/189 (93.65%), Postives = 177/189 (93.65%), Query Frame = 0

Query: 39  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 98
           MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS
Sbjct: 1   MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 60

Query: 99  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLL 158
           TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHG            LL
Sbjct: 61  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHG------------LL 120

Query: 159 LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE 218
           LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE
Sbjct: 121 LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE 177

Query: 219 VWEQEGLYD 227
           VWEQEGLYD
Sbjct: 181 VWEQEGLYD 177

BLAST of Cp4.1LG18g04910 vs. NCBI nr
Match: XP_022961227.1 (uncharacterized protein LOC111461800 [Cucurbita moschata])

HSP 1 Score: 349 bits (895), Expect = 3.19e-120
Identity = 176/189 (93.12%), Postives = 176/189 (93.12%), Query Frame = 0

Query: 39  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 98
           MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS
Sbjct: 1   MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 60

Query: 99  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLL 158
           TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHG            LL
Sbjct: 61  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHG------------LL 120

Query: 159 LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE 218
           LWNQTRLQWIGSSNTNTTDET ERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE
Sbjct: 121 LWNQTRLQWIGSSNTNTTDETPERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE 177

Query: 219 VWEQEGLYD 227
           VWEQEGLYD
Sbjct: 181 VWEQEGLYD 177

BLAST of Cp4.1LG18g04910 vs. NCBI nr
Match: XP_022144929.1 (uncharacterized protein LOC111014486 [Momordica charantia])

HSP 1 Score: 348 bits (892), Expect = 1.32e-118
Identity = 177/211 (83.89%), Postives = 185/211 (87.68%), Query Frame = 0

Query: 17  VLGSHWFSLNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEP 76
           VLG H FS NRCP+ RF L INMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEP
Sbjct: 56  VLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEP 115

Query: 77  SKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSN 136
           SKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT + SNVGGS SN
Sbjct: 116 SKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGSTSN 175

Query: 137 PSEYVNHGKRAIYLSSLLSGLLLWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSL 196
           PSE+VNHG            LLLWNQ RLQW GSS+  TTD+TQ+R+KAKISWRATYDSL
Sbjct: 176 PSEFVNHG------------LLLWNQNRLQWTGSSS-KTTDQTQQRRKAKISWRATYDSL 235

Query: 197 LGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD 227
           LGTRQPFPH IPLSEMVNFLVEVWEQEGLYD
Sbjct: 236 LGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD 253

BLAST of Cp4.1LG18g04910 vs. ExPASy TrEMBL
Match: A0A6J1H9M2 (uncharacterized protein LOC111461800 OS=Cucurbita moschata OX=3662 GN=LOC111461800 PE=4 SV=1)

HSP 1 Score: 349 bits (895), Expect = 1.54e-120
Identity = 176/189 (93.12%), Postives = 176/189 (93.12%), Query Frame = 0

Query: 39  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 98
           MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS
Sbjct: 1   MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 60

Query: 99  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLL 158
           TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHG            LL
Sbjct: 61  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHG------------LL 120

Query: 159 LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE 218
           LWNQTRLQWIGSSNTNTTDET ERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE
Sbjct: 121 LWNQTRLQWIGSSNTNTTDETPERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE 177

Query: 219 VWEQEGLYD 227
           VWEQEGLYD
Sbjct: 181 VWEQEGLYD 177

BLAST of Cp4.1LG18g04910 vs. ExPASy TrEMBL
Match: A0A6J1CTQ5 (uncharacterized protein LOC111014486 OS=Momordica charantia OX=3673 GN=LOC111014486 PE=4 SV=1)

HSP 1 Score: 348 bits (892), Expect = 6.40e-119
Identity = 177/211 (83.89%), Postives = 185/211 (87.68%), Query Frame = 0

Query: 17  VLGSHWFSLNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEP 76
           VLG H FS NRCP+ RF L INMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEP
Sbjct: 56  VLGFHLFSQNRCPAFRFWLNINMVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEP 115

Query: 77  SKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSN 136
           SKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT + SNVGGS SN
Sbjct: 116 SKGLRIQGRIVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLNPSNVGGSTSN 175

Query: 137 PSEYVNHGKRAIYLSSLLSGLLLWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSL 196
           PSE+VNHG            LLLWNQ RLQW GSS+  TTD+TQ+R+KAKISWRATYDSL
Sbjct: 176 PSEFVNHG------------LLLWNQNRLQWTGSSS-KTTDQTQQRRKAKISWRATYDSL 235

Query: 197 LGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD 227
           LGTRQPFPH IPLSEMVNFLVEVWEQEGLYD
Sbjct: 236 LGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD 253

BLAST of Cp4.1LG18g04910 vs. ExPASy TrEMBL
Match: A0A6J1JKP7 (uncharacterized protein LOC111485294 OS=Cucurbita maxima OX=3661 GN=LOC111485294 PE=4 SV=1)

HSP 1 Score: 344 bits (883), Expect = 1.04e-118
Identity = 174/189 (92.06%), Postives = 175/189 (92.59%), Query Frame = 0

Query: 39  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 98
           MVMLNSSFAAWISRLFACMGGCFGCCTKPT IIAVDEPSKGLRIQGRVVKKPSISDGFWS
Sbjct: 1   MVMLNSSFAAWISRLFACMGGCFGCCTKPTHIIAVDEPSKGLRIQGRVVKKPSISDGFWS 60

Query: 99  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLL 158
           TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVG SVSNPSEYVNHG            LL
Sbjct: 61  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGASVSNPSEYVNHG------------LL 120

Query: 159 LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE 218
           LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYD+LLGTRQPFPHRIPLSEMVNFLVE
Sbjct: 121 LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDNLLGTRQPFPHRIPLSEMVNFLVE 177

Query: 219 VWEQEGLYD 227
           VWEQEGLYD
Sbjct: 181 VWEQEGLYD 177

BLAST of Cp4.1LG18g04910 vs. ExPASy TrEMBL
Match: A0A6J1I744 (uncharacterized protein LOC111470604 OS=Cucurbita maxima OX=3661 GN=LOC111470604 PE=4 SV=1)

HSP 1 Score: 330 bits (847), Expect = 4.21e-112
Identity = 177/226 (78.32%), Postives = 185/226 (81.86%), Query Frame = 0

Query: 3   SWPQSPLYPFPVCWVLGSHWFSLNRCPSLRFRLIINMVMLNSSFAAWISRLFACMGGCFG 62
           S P    Y F     L S  FSLNRCPSLRF L INMVMLNSSFAAWISRLFACMGGCFG
Sbjct: 43  STPSLASYRF-----LASLSFSLNRCPSLRFWLNINMVMLNSSFAAWISRLFACMGGCFG 102

Query: 63  CCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSN 122
           CCTKPTPIIAVDEPSKGLRIQGRVVKK SISDGFWSTSTCDLDNSTIQSQ SISSISTSN
Sbjct: 103 CCTKPTPIIAVDEPSKGLRIQGRVVKKRSISDGFWSTSTCDLDNSTIQSQPSISSISTSN 162

Query: 123 LTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLLLWNQTRLQWIG-SSNTNTTDETQE 182
           LT + SNVG SVSNPSE+VNHG            LLLWNQ RLQWIG SS++ TTD+TQ 
Sbjct: 163 LTLTHSNVGASVSNPSEFVNHG------------LLLWNQNRLQWIGNSSSSKTTDQTQL 222

Query: 183 RQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD 227
           ++KAKISWRATYDSLL TRQ FPH IPL+EMV FLVEVWEQEGLYD
Sbjct: 223 KRKAKISWRATYDSLLSTRQCFPHPIPLAEMVKFLVEVWEQEGLYD 251

BLAST of Cp4.1LG18g04910 vs. ExPASy TrEMBL
Match: A0A5A7U2N6 (DUF4050 family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold675G002060 PE=4 SV=1)

HSP 1 Score: 326 bits (836), Expect = 2.06e-111
Identity = 165/189 (87.30%), Postives = 171/189 (90.48%), Query Frame = 0

Query: 39  MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 98
           MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS
Sbjct: 1   MVMLNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWS 60

Query: 99  TSTCDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLL 158
           TSTCDLDNSTIQSQRSISSISTSNLT S SNV GSVS+ SE+VNHGK  +  +  L   L
Sbjct: 61  TSTCDLDNSTIQSQRSISSISTSNLTLSNSNVAGSVSSSSEFVNHGKFPLLSNPCL---L 120

Query: 159 LWNQTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVE 218
           LWNQTR+QWIGS  T  TDETQ+RQKAKISWRATYDSLLGTRQPFPH IPLSEMVNFLVE
Sbjct: 121 LWNQTRMQWIGSGTTKLTDETQQRQKAKISWRATYDSLLGTRQPFPHPIPLSEMVNFLVE 180

Query: 219 VWEQEGLYD 227
           VWEQEGLYD
Sbjct: 181 VWEQEGLYD 186

BLAST of Cp4.1LG18g04910 vs. TAIR 10
Match: AT5G25360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 209.1 bits (531), Expect = 3.6e-54
Identity = 107/186 (57.53%), Postives = 132/186 (70.97%), Query Frame = 0

Query: 42  LNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTST 101
           L     +WI +LF CMGGCFGCC KP  I+AVDEPSKGLRIQGR+VKKPS+S+ FWSTST
Sbjct: 3   LREIIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTST 62

Query: 102 CDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLLLWN 161
           C++DNST+QSQRS+SSIS +N T    +   S SNP+E+VNH            GL LWN
Sbjct: 63  CEMDNSTLQSQRSMSSISFTNNT----STSASTSNPTEFVNH------------GLNLWN 122

Query: 162 QTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVEVWE 221
           QTR QW+ +    T+ +  + ++  ISW ATY+SLLG  + F   IPL EMV+FLV+VWE
Sbjct: 123 QTRQQWLAN---GTSQKKAKVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWE 169

Query: 222 QEGLYD 228
           QEGLYD
Sbjct: 183 QEGLYD 169

BLAST of Cp4.1LG18g04910 vs. TAIR 10
Match: AT5G25360.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1). )

HSP 1 Score: 209.1 bits (531), Expect = 3.6e-54
Identity = 107/186 (57.53%), Postives = 132/186 (70.97%), Query Frame = 0

Query: 42  LNSSFAAWISRLFACMGGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTST 101
           L     +WI +LF CMGGCFGCC KP  I+AVDEPSKGLRIQGR+VKKPS+S+ FWSTST
Sbjct: 3   LREIIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTST 62

Query: 102 CDLDNSTIQSQRSISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLLLWN 161
           C++DNST+QSQRS+SSIS +N T    +   S SNP+E+VNH            GL LWN
Sbjct: 63  CEMDNSTLQSQRSMSSISFTNNT----STSASTSNPTEFVNH------------GLNLWN 122

Query: 162 QTRLQWIGSSNTNTTDETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVEVWE 221
           QTR QW+ +    T+ +  + ++  ISW ATY+SLLG  + F   IPL EMV+FLV+VWE
Sbjct: 123 QTRQQWLAN---GTSQKKAKVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWE 169

Query: 222 QEGLYD 228
           QEGLYD
Sbjct: 183 QEGLYD 169

BLAST of Cp4.1LG18g04910 vs. TAIR 10
Match: AT1G15350.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 137.5 bits (345), Expect = 1.3e-32
Identity = 82/174 (47.13%), Postives = 102/174 (58.62%), Query Frame = 0

Query: 57  MGGCFGCCT--KPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRS 116
           MGGC GC    + T     D PS  +    R  KKPS+S+ FWSTST D+DN T  SQ S
Sbjct: 1   MGGCVGCYREHRSTAASLKDPPSNSI---ARPCKKPSVSEDFWSTSTVDMDNITFPSQGS 60

Query: 117 ISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLLLWNQTRLQWIGSSNTN 176
           +SS   SN T    +   + + P EYVN             GLLLWNQTR +W+G    N
Sbjct: 61  LSS---SNQTFDSQSAARNSNAPPEYVN------------QGLLLWNQTRERWVGKDKPN 120

Query: 177 TTDETQERQKAKISWR-ATYDSLLGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD 228
             +     Q AK++W  ATYDSLLG+ + FP  IPL+EMV+FLV++WEQEGLYD
Sbjct: 121 --NPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 154

BLAST of Cp4.1LG18g04910 vs. TAIR 10
Match: AT1G15350.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 137.5 bits (345), Expect = 1.3e-32
Identity = 82/174 (47.13%), Postives = 102/174 (58.62%), Query Frame = 0

Query: 57  MGGCFGCCT--KPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRS 116
           MGGC GC    + T     D PS  +    R  KKPS+S+ FWSTST D+DN T  SQ S
Sbjct: 1   MGGCVGCYREHRSTAASLKDPPSNSI---ARPCKKPSVSEDFWSTSTVDMDNITFPSQGS 60

Query: 117 ISSISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLLLWNQTRLQWIGSSNTN 176
           +SS   SN T    +   + + P EYVN             GLLLWNQTR +W+G    N
Sbjct: 61  LSS---SNQTFDSQSAARNSNAPPEYVN------------QGLLLWNQTRERWVGKDKPN 120

Query: 177 TTDETQERQKAKISWR-ATYDSLLGTRQPFPHRIPLSEMVNFLVEVWEQEGLYD 228
             +     Q AK++W  ATYDSLLG+ + FP  IPL+EMV+FLV++WEQEGLYD
Sbjct: 121 --NPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 154

BLAST of Cp4.1LG18g04910 vs. TAIR 10
Match: AT4G32342.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25360.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 137.5 bits (345), Expect = 1.3e-32
Identity = 83/170 (48.82%), Postives = 102/170 (60.00%), Query Frame = 0

Query: 60  CFGCCTKPTP-IIAVDEPSKGLRIQGRVVKKPSI-SDGFWSTSTCDLD-NSTIQSQRSIS 119
           CFGCC +    ++ VDEPSKGL+IQG++VKK S  SD FWSTSTCD+D N TIQSQ S  
Sbjct: 17  CFGCCNRERRLVVEVDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNITIQSQSS-- 76

Query: 120 SISTSNLTHSQSNVGGSVSNPSEYVNHGKRAIYLSSLLSGLLLWNQTRLQWIGSSNTNTT 179
                   +   +   S SN +E+VNH            GL+LWN TR QW        T
Sbjct: 77  --------NPPFDPQCSTSNSTEFVNH------------GLILWNHTRQQW----RECLT 136

Query: 180 DETQERQKAKISWRATYDSLLGTRQPFPHRIPLSEMVNFLVEVWEQEGLY 227
            +     +  ISW +TYDSLL T + FP  IPL EMV+FLV+VWE+EGLY
Sbjct: 137 RQQCLVPEPAISWNSTYDSLLSTNKLFPQPIPLKEMVHFLVDVWEEEGLY 160

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6589849.11.81e-13193.66hypothetical protein SDJN03_15272, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7023519.12.45e-12282.51hypothetical protein SDJN02_14545, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023516635.13.88e-12193.65uncharacterized protein LOC111780448 [Cucurbita pepo subsp. pepo][more]
XP_022961227.13.19e-12093.12uncharacterized protein LOC111461800 [Cucurbita moschata][more]
XP_022144929.11.32e-11883.89uncharacterized protein LOC111014486 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1H9M21.54e-12093.12uncharacterized protein LOC111461800 OS=Cucurbita moschata OX=3662 GN=LOC1114618... [more]
A0A6J1CTQ56.40e-11983.89uncharacterized protein LOC111014486 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A6J1JKP71.04e-11892.06uncharacterized protein LOC111485294 OS=Cucurbita maxima OX=3661 GN=LOC111485294... [more]
A0A6J1I7444.21e-11278.32uncharacterized protein LOC111470604 OS=Cucurbita maxima OX=3661 GN=LOC111470604... [more]
A0A5A7U2N62.06e-11187.30DUF4050 family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold6... [more]
Match NameE-valueIdentityDescription
AT5G25360.13.6e-5457.53unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G25360.23.6e-5457.53unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G15350.21.3e-3247.13unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G15350.11.3e-3247.13unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G32342.11.3e-3248.82unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025124Domain of unknown function DUF4050PFAMPF13259DUF4050coord: 186..227
e-value: 3.2E-12
score: 47.1
coord: 100..182
e-value: 2.7E-6
score: 27.8
NoneNo IPR availablePANTHERPTHR33373OS07G0479600 PROTEINcoord: 39..227
NoneNo IPR availablePANTHERPTHR33373:SF13DUF4050 FAMILY PROTEINcoord: 39..227

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g04910.1Cp4.1LG18g04910.1mRNA