Cp4.1LG07g02020 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG07g02020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionselenoprotein H-like
LocationCp4.1LG07: 1603138 .. 1608614 (+)
RNA-Seq ExpressionCp4.1LG07g02020
SyntenyCp4.1LG07g02020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGATAACTATGTGTATTAAAATTGCTTTTTCCAAATAATGAGCTACTAATATCCAACTTTTTAAATTACGACGGACACATTTAAAATGATCCGGCAGAAAAAACACGTGGACGGAAATTTAGTGATTAAAAAAAAATGAATAAATTAACTCTTGGGACTTCGTGCCGCCATTTCCTTCCACAGAAGAAACCAAACTGGAACTGCGTGGACGTCGTACGACCACCCAATCAACGGCCACGATCCATTCTCATCCTCCAAATTCATTTCCACCGTCGCTTCAAAATCCCACCGCCGATACCAAAAGCGCGGGAAAACCAAACCTAATCCATCCCATATACCCTGTATCACTGCCAACCAACGGTCACATACACAACTCTCAGTGAAAGTGGAACAGATAGATTGAACTTTGGAGCCGATAAATCTCGTTTCCTCCATTACACTGAGTAGCTCTGTGTTTTTAGGGTTTCAGGTCTTCTTCCATGGCGCCGAAAAAGCGAAGCAAGATCCAGGAAGACGAGACGGCGGCGGCGAAGCCGGTCCCGGCGTCGTCGAGGGCGACGAGAAGGTCGCCCCGGCTGGCGGCGAACTCGAAGGCCGATTTGACGGTGGAGGAGCCTGTGGTGAAGTTGCCGAAGAGGAAGAAGGCGAAACGTGCGCCGAAGGAGAATGGGAAGGCGGAGGAGGTTGAAAATGAGGGAGAGGATCTTGACGCTGCCTCGGAGAAGCTTGGCGTGGAAGCTAAGAACAGAGCGGTTGTGATTGAACATTGGTAAGGTTTTTTTGTTCGATTTCTTGCTTGGAAATGAATCGAAAATTGGTTCGGTTAATTTGAAGTTCAAGTTTCTAGGGTTTGAAGTTTCCTAGAAATCTGCATTGGCACTAAAGACTATATTCTTAGTGTAATACTCCAAGCTCTTCAGACTCTCTTTTTAGGACTTCCTCTCTCAAAGTGCTCCACAATGGTACGATAATATTGTCCACATTAGGTGTAAGCTCTCATGGTTTTGCTTCCCAAAATGCCTCGAGTATTCTTTGTTTATAAATCCATGATCATTCCCTTTGGAGTCTTAGTCATTTTTTACTATCTTCGAAGAAGGGCTCGACTCCTTTTCTTTTGGAGTCCTTTGTTCGACATTTGAGGATTTACCAATCTATTGGCACGACTAAGTTTAGGGCATGACTTTGATACCATGTTAGACGAACACGATTCTCCACAATAGTATGATATTGTCCACTTTGAGTGTAAGCTCTCATGGTTTCGTTTTGGGCTTTCCAAAATGCCTCATACCAATGGAGAGAGTATTCTTTGTTTATAAACCCAGGATCATTCCCTAAATGAGCCGATGTGGGACTTCCATCATCCAATACACATAAAGCGTTTTGTTCCCCCTCCAACTGATGTGGGACTTCACAATCCACCCCCATTTGGGGCCAGCGTTATCGTTGGCACTCATTCCGCTCTCCAATCGATGGAATCTCACAATCTATCTCCTTCGAGTTCCAGTCCTCACTGGCTCTCTCCAATGGATGGAATCTCACAATCCATCTCCTTCAAGGTCCAGCGTCCTCACTGTCACTCATTCTGCTCTCCAATCAACGTGAGATCTCACAATCCACTCCTTTGGGGCTCAGCATCCTTGACACCGGGTAGTGTTGGACTCTGATACCTTTTAGGGCGAAAACATGTTTCTTTTTAACGTTGTTACTCTTTGTTCTTGCATTTGCCTCTGCTTTTTGCAATGGAACTTCTAATGAATGAAAAGGAAAAGTACATTAGTGATTCTTCTGCCTTTCACATGTCATGTGCAGTGTTTGTTTTGTGACACTTTCCCCGTTTTTTTGTTTGTTCCCTTTGTAGCAAGCAGTGCCAATCGTTCAAGAAAAGGGCGATCGAGGTGCAGAATGGTCTAGAGAAGGGTGTTCCTGGAATCACTGTTCTGCTCAACCCTGATAAGGTAACCTTTTGACTTCTTCAACTCTTACATGCTATGCTTAATAAAAATAATTAATTTGGGTTCTTTATTTTGGTACCCAAGCAGATTGTTCTTCATTTTCCTTCCATCTTGTAGAAGGTGTAGGTTTTAGTTTCCAAGCATAATGCTTTACTTTTTTAAGCTCTCATGTTCTCTGCTGGGTAGAGTCATAGTGAGGTGGAGGTTCTAGAGGGAAATCTTGAGTTTGCGACTTTAGGACTCGGGTTTTCAGCTGATTCACGAGCAAAGGTTAGAATTTGCACTCGGGAATCATTGATTAACTTTGAACTTTTAGGCTAGGGAAACATCACAGGCATCTATGTATTCGATTATTGGTCACCGACCCATATCTCTACAATGATATGTATTCCATGATTGAGACTTTAGGACTCGGGTTTTCAGCTGATTCACAAACAAAAGTTAGAATTCGCACTGGGAATCAATGATGAATTTTGAACTTTTAGGTTAGGAAACATAGCAGATAGGTAATCTATGTATTCCATGATCAAATTGCTGGCTACTACTGATCCATATCTCTACAATGATATGATATTATCTCTAAATCCCCACGACCTCTCCTTTCCTTACAAGAATGCTATCGATCCCCGAGAATATTGTGAAGGAAACTTAAAAATATATGAAAGAAATTGTTGTTGTTTTTCCCTCTTCTCATTTTGTTTTAATAATTATGCAGCCAAGAAGAGGGTGCTTTGAAATTCGGAGTGAAGATGGCGAGAAGTTTATCAGTCTTCTGGTAATCCTTTTATTTATTATTACTTTCTTGGAACAACCTTCAACTGTCTAGAATTTTCCCATTTATTCTGATAAGTTCACACATAACCTCATTGCAACAATACTCGTTAGACGAGATCAATAAAGTCCCATATAACATCATCATACATTGCGTCGTGTTAGGACATGAAGCGACCGTTTACACGAATGAAGGAACTGGACATGGAGGAGGTAATTTCAGATATCATTGAGAAGATAAAAGGATAATGAGAAAACCCTTGATGATAGATGATATATATGATGACTAATTGGCTGCTGGCTCATAAGGATTATGCACTGTTCTTTTGTGGGAGTAGCTTTGATGGTGGAAACTTGAATGCCAATGTCAGTGTGTGCTGGTATTTGATTGCAAAGGTAAAGCCATCAGGTTTGAACAACAATCACAGGGTGGGTTTATTTTGAATGGGTTGGTCCTTTGTATTAATGATTTAGACAGAGTGGCTCCATGCTGCATCTTTATATTGTTTGTGTTTTCTTTTTTTTTAATGACCGATGGGTTTTGAAAGGCAATTTTGCACTTATAACAATTGTGGTATTCTAGGTTTTATTTTCTAAATTTATTTCTAAATTAATGTTTTTTTTTTTTTTTTTTAAATACCTAGATATCTTTATAAGAGGTTTTTAGTTATAATGTTTGGTTTCTCTCATGTATTCTACCTTTTTTAAGCTTTTTAATTTGATTAATTACTATTTTAATCTTATAAAATTTTAATACTAACCTCGTTAAGATTGTATTTTTGCTGAAATTTTAATACTAAGCTCATACTAAGCTCATTAGTCTTAGACCAACAGGTCGACCTATGCTATTTTGGCTGAAGCCTTGGTTAACTAAACAATGTAGTCAATCCTTCAAAGAGTCCTAACTTATTTGACCTAAAGAAACTTAGTGTCGATCAAGGCTTTAAGTGTTGGCTAATGACCAACCATTTGACACTCAAAGGTTTTGAATTGAGTTATAGCGTTTCATAATTATTATGACTCAACAACCTGATGAGATTTTTGTAGAGTGTCTAAATACTCAAATGTACAATTTTTGTACTGTGTCAAAATACTTTGTACAAATGTATGAGTATAAACTTCGAATTTTAACTACATTTTTATTCAAATTTTAACTACGTTCTAAGACAAGCTGAAAAACGCATCTTTAGTTTCTGCTCTTTAAGTTATCTACAAATCCGATTCCTAATCCTAATAGAGCAATGAAGTATTGGTAGCTTAACAACACGCTAGTGTTAGTTACTCTAGTTTTGAAGACGTATTGACTCAATAACTTAGCACTAAATACTTTGTGATATTTTTTTGGGTTAGTATAGTCTTTATAATTGATAATTTGTCACAAGAAAAATATATAGATTCGTTAAGAGAACCGAATCCTCAACCTATAACGATCTCTTATGAACTAATCTGTGAAACGAGATATTCCCAAACAGACTAACAAATTAATTTACTAAATATAGTGAAGATGAACACATACCACTGTGTTCTTGTTGTTGCTTTGAGAGAAATTCACGCCCAACCAAGCAATTGCTTTGCCAATCTGTGATCTGCTTGATCTTTTGCTCCTATAAATTCCTCCCTTTGGTTCTGCAACAGAGACAAGCAGCAACAAAAAAGTTAATTCATTGTTCATTTGACAAAAAACGAACAAAAAAAAGGAAAAAAAAGAATAAGCAGTGAAGCAACGATCATGAATAAGTCCTCGGTTTCGATCTCTGTCACTTTTGTTCTGTTCTTCCTCTTTGCCTGCTCTGTTCCTGTTTCAGGCTTCAACATCACAAGGCTCCTCAATCGGTTCCCTGAATTTGGCGTTTTCAATGGATTTCTGACGAAAACTCGTCTCTTCGAACAGATCAACATTCGCCAAACTATCACCATTCTCGCCCTCGATAATGGCGTCGTTCCGAGCATCACTGGAAACTCTCTCGACGTAATCAAGCAGATTTTGAGTGCTCATGTTATTCTCGATTACTACGATTCTGCCAAGTTCAGGAAGCTCTCCACCAACAAGCCTACAGTACTTACTACCATGTTCCAGGCCACTGGCGACGCCGTACGCGAGCAAGGGTTTGTGAAAGTTGTGCTGAACAGGAGAGGTCAAATCGAATTCGGATCTGCTTGGAAAGGCGCGCCTTTCACCTCTATGTTTGTCAGAGCTGTTGCCTCACAGCCCTACAATATCTCTGTTATTCAAATCAGTTCTCCGATTGTGGTCCCTGGCATTGGCCGTTACAATTTGCCTCCTCCAGCGCCTGTTGCACCGGAACCAGATGTTGCTCCTGTTCCCGCTCCGACTCCATTGGCTGATACTCCTTCACCGGCGGACGAGTCTCCAGCCGATGCCCCCTCACCAGACGCCGACTCTCCAGTTCCGACGGCGGATGCACCTGATGCACCGTCGAGTGCTCCTCATCCATCGGCTGATGACGAAGAGGATGCAGATGCACCGAGTCCGGACGACGAGGAAGACCATTCGGCAGCTTCACGTGGCCGCGTCGCCGGCGCCGGAGTGATGGTGGCCGGATTGATGTCACTCTACATGGCTTTCTAAATTCAGAAAATGTGAAGGAACGAAAGAGAAAATCAGAGAGAGAGAGAGAGATCAACAGCGGAAGAATCGGTAGGATTGTTTATGCCTACAATCATCAAAACATCATATGAAAATGAAAAAAAAAGGAGAAAAAATGTCCAATAATGTATGAGTAGACCTTGAAC

mRNA sequence

TGATAACTATGTGTATTAAAATTGCTTTTTCCAAATAATGAGCTACTAATATCCAACTTTTTAAATTACGACGGACACATTTAAAATGATCCGGCAGAAAAAACACGTGGACGGAAATTTAGTGATTAAAAAAAAATGAATAAATTAACTCTTGGGACTTCGTGCCGCCATTTCCTTCCACAGAAGAAACCAAACTGGAACTGCGTGGACGTCGTACGACCACCCAATCAACGGCCACGATCCATTCTCATCCTCCAAATTCATTTCCACCGTCGCTTCAAAATCCCACCGCCGATACCAAAAGCGCGGGAAAACCAAACCTAATCCATCCCATATACCCTGTATCACTGCCAACCAACGGTCACATACACAACTCTCAGTGAAAGTGGAACAGATAGATTGAACTTTGGAGCCGATAAATCTCGTTTCCTCCATTACACTGAGTAGCTCTGTGTTTTTAGGGTTTCAGGTCTTCTTCCATGGCGCCGAAAAAGCGAAGCAAGATCCAGGAAGACGAGACGGCGGCGGCGAAGCCGGTCCCGGCGTCGTCGAGGGCGACGAGAAGGTCGCCCCGGCTGGCGGCGAACTCGAAGGCCGATTTGACGGTGGAGGAGCCTGTGGTGAAGTTGCCGAAGAGGAAGAAGGCGAAACGTGCGCCGAAGGAGAATGGGAAGGCGGAGGAGGTTGAAAATGAGGGAGAGGATCTTGACGCTGCCTCGGAGAAGCTTGGCGTGGAAGCTAAGAACAGAGCGGTTGTGATTGAACATTGCAAGCAGTGCCAATCGTTCAAGAAAAGGGCGATCGAGGTGCAGAATGGTCTAGAGAAGGGTGTTCCTGGAATCACTGTTCTGCTCAACCCTGATAAGAGTCATAGTGAGGTGGAGGTTCTAGAGGGAAATCTTGAGTTTGCGACTTTAGGACTCGGGTTTTCAGCTGATTCACGAGCAAAGCCAAGAAGAGGGTGCTTTGAAATTCGGAGTGAAGATGGCGAGAAGTTTATCAGTCTTCTGGACATGAAGCGACCGTTTACACGAATGAAGGAACTGGACATGGAGGAGGTAATTTCAGATATCATTGAGAAGATAAAAGGATAATGAGAAAACCCTTGATGATAGATGATATATATGATGACTAATTGGCTGCTGGCTCATAAGGATTATGCACTGTTCTTTTGTGGGAGTAGCTTTGATGGTGGAAACTTGAATGCCAATGTCAGTGTGTGCTGGTATTTGATTGCAAAGGTAAAGCCATCAGGTTTGAACAACAATCACAGGGTGGGTTTATTTTGAATGGGTTGGTCCTTTGTATTAATGATTTAGACAGAGTGGCTCCATGCTGCATCTTTATATTGTTTGTGTTTTCTTTTTTTTTAATGACCGATGGGTTTTGAAAGGCAATTTTGCACTTATAACAATTGTGGTATTCTAGGTTTTATTTTCTAAATTTATTTCTAAATTAATGTTTTTTTTTTTTTTTTTTAAATACCTAGATATCTTTATAAGAGGTTTTTAGTTATAATGTTTGGTTTCTCTCATGTATTCTACCTTTTTTAAGCTTTTTAATTTGATTAATTACTATTTTAATCTTATAAAATTTTAATACTAACCTCGTTAAGATTGTATTTTTGCTGAAATTTTAATACTAAGCTCATACTAAGCTCATTAGTCTTAGACCAACAGGTCGACCTATGCTATTTTGGCTGAAGCCTTGGTTAACTAAACAATGTAGTCAATCCTTCAAAGAGTCCTAACTTATTTGACCTAAAGAAACTTAGTGTCGATCAAGGCTTTAAGTGTTGGCTAATGACCAACCATTTGACACTCAAAGGTTTTGAATTGAGTTATAGCGTTTCATAATTATTATGACTCAACAACCTGATGAGATTTTTGTAGAGTGTCTAAATACTCAAATGTACAATTTTTGTACTGTGTCAAAATACTTTGTACAAATGTATGAGTATAAACTTCGAATTTTAACTACATTTTTATTCAAATTTTAACTACGTTCTAAGACAAGCTGAAAAACGCATCTTTAGTTTCTGCTCTTTAAGTTATCTACAAATCCGATTCCTAATCCTAATAGAGCAATGAAGTATTGGTAGCTTAACAACACGCTAGTGTTAGTTACTCTAGTTTTGAAGACGTATTGACTCAATAACTTAGCACTAAATACTTTGTGATATTTTTTTGGGTTAGTATAGTCTTTATAATTGATAATTTGTCACAAGAAAAATATATAGATTCGTTAAGAGAACCGAATCCTCAACCTATAACGATCTCTTATGAACTAATCTGTGAAACGAGATATTCCCAAACAGACTAACAAATTAATTTACTAAATATAGTGAAGATGAACACATACCACTGTGTTCTTGTTGTTGCTTTGAGAGAAATTCACGCCCAACCAAGCAATTGCTTTGCCAATCTGTGATCTGCTTGATCTTTTGCTCCTATAAATTCCTCCCTTTGGTTCTGCAACAGAGACAAGCAGCAACAAAAAAGTTAATTCATTGTTCATTTGACAAAAAACGAACAAAAAAAAGGAAAAAAAAGAATAAGCAGTGAAGCAACGATCATGAATAAGTCCTCGGTTTCGATCTCTGTCACTTTTGTTCTGTTCTTCCTCTTTGCCTGCTCTGTTCCTGTTTCAGGCTTCAACATCACAAGGCTCCTCAATCGGTTCCCTGAATTTGGCGTTTTCAATGGATTTCTGACGAAAACTCGTCTCTTCGAACAGATCAACATTCGCCAAACTATCACCATTCTCGCCCTCGATAATGGCGTCGTTCCGAGCATCACTGGAAACTCTCTCGACGTAATCAAGCAGATTTTGAGTGCTCATGTTATTCTCGATTACTACGATTCTGCCAAGTTCAGGAAGCTCTCCACCAACAAGCCTACAGTACTTACTACCATGTTCCAGGCCACTGGCGACGCCGTACGCGAGCAAGGGTTTGTGAAAGTTGTGCTGAACAGGAGAGGTCAAATCGAATTCGGATCTGCTTGGAAAGGCGCGCCTTTCACCTCTATGTTTGTCAGAGCTGTTGCCTCACAGCCCTACAATATCTCTGTTATTCAAATCAGTTCTCCGATTGTGGTCCCTGGCATTGGCCGTTACAATTTGCCTCCTCCAGCGCCTGTTGCACCGGAACCAGATGTTGCTCCTGTTCCCGCTCCGACTCCATTGGCTGATACTCCTTCACCGGCGGACGAGTCTCCAGCCGATGCCCCCTCACCAGACGCCGACTCTCCAGTTCCGACGGCGGATGCACCTGATGCACCGTCGAGTGCTCCTCATCCATCGGCTGATGACGAAGAGGATGCAGATGCACCGAGTCCGGACGACGAGGAAGACCATTCGGCAGCTTCACGTGGCCGCGTCGCCGGCGCCGGAGTGATGGTGGCCGGATTGATGTCACTCTACATGGCTTTCTAAATTCAGAAAATGTGAAGGAACGAAAGAGAAAATCAGAGAGAGAGAGAGAGATCAACAGCGGAAGAATCGGTAGGATTGTTTATGCCTACAATCATCAAAACATCATATGAAAATGAAAAAAAAAGGAGAAAAAATGTCCAATAATGTATGAGTAGACCTTGAAC

Coding sequence (CDS)

ATGGCGCCGAAAAAGCGAAGCAAGATCCAGGAAGACGAGACGGCGGCGGCGAAGCCGGTCCCGGCGTCGTCGAGGGCGACGAGAAGGTCGCCCCGGCTGGCGGCGAACTCGAAGGCCGATTTGACGGTGGAGGAGCCTGTGGTGAAGTTGCCGAAGAGGAAGAAGGCGAAACGTGCGCCGAAGGAGAATGGGAAGGCGGAGGAGGTTGAAAATGAGGGAGAGGATCTTGACGCTGCCTCGGAGAAGCTTGGCGTGGAAGCTAAGAACAGAGCGGTTGTGATTGAACATTGCAAGCAGTGCCAATCGTTCAAGAAAAGGGCGATCGAGGTGCAGAATGGTCTAGAGAAGGGTGTTCCTGGAATCACTGTTCTGCTCAACCCTGATAAGAGTCATAGTGAGGTGGAGGTTCTAGAGGGAAATCTTGAGTTTGCGACTTTAGGACTCGGGTTTTCAGCTGATTCACGAGCAAAGCCAAGAAGAGGGTGCTTTGAAATTCGGAGTGAAGATGGCGAGAAGTTTATCAGTCTTCTGGACATGAAGCGACCGTTTACACGAATGAAGGAACTGGACATGGAGGAGGTAATTTCAGATATCATTGAGAAGATAAAAGGATAA

Protein sequence

MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAPKENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPGITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMKRPFTRMKELDMEEVISDIIEKIKG
Homology
BLAST of Cp4.1LG07g02020 vs. NCBI nr
Match: KAG7020338.1 (hypothetical protein SDJN02_17022 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 378 bits (970), Expect = 1.22e-131
Identity = 200/204 (98.04%), Postives = 201/204 (98.53%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETAAAKPVPASSR TRRSPRLAANSKADL VEEPVVKLPKRKKAKR P
Sbjct: 1   MAPKKRSKIQEDETAAAKPVPASSRVTRRSPRLAANSKADLMVEEPVVKLPKRKKAKRGP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRA+EVQNGLEKGVPG
Sbjct: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAMEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK 180
           ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK
Sbjct: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK 180

Query: 181 RPFTRMKELDMEEVISDIIEKIKG 204
           RPFTRMKELDMEEVISDIIEKIKG
Sbjct: 181 RPFTRMKELDMEEVISDIIEKIKG 204

BLAST of Cp4.1LG07g02020 vs. NCBI nr
Match: XP_023536837.1 (selenoprotein H-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 317 bits (811), Expect = 7.82e-108
Identity = 176/204 (86.27%), Postives = 176/204 (86.27%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP
Sbjct: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG
Sbjct: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK 180
           ITVLLNPDK                            PRRGCFEIRSEDGEKFISLLDMK
Sbjct: 121 ITVLLNPDK----------------------------PRRGCFEIRSEDGEKFISLLDMK 176

Query: 181 RPFTRMKELDMEEVISDIIEKIKG 204
           RPFTRMKELDMEEVISDIIEKIKG
Sbjct: 181 RPFTRMKELDMEEVISDIIEKIKG 176

BLAST of Cp4.1LG07g02020 vs. NCBI nr
Match: XP_022950994.1 (selenoprotein H-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 306 bits (784), Expect = 1.02e-103
Identity = 170/204 (83.33%), Postives = 171/204 (83.82%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETA AKPVPASSR TRRSPRLA NSKADLTVEE VVKLPKRKKAKR P
Sbjct: 1   MAPKKRSKIQEDETAVAKPVPASSRVTRRSPRLAGNSKADLTVEETVVKLPKRKKAKRGP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRA+EVQNGLEKGVPG
Sbjct: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAMEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK 180
           ITVLLNPDK                            PRRGCFEIRSEDGEKFISLLDMK
Sbjct: 121 ITVLLNPDK----------------------------PRRGCFEIRSEDGEKFISLLDMK 176

Query: 181 RPFTRMKELDMEEVISDIIEKIKG 204
           RPFTRMKELDMEEVISDIIEKIKG
Sbjct: 181 RPFTRMKELDMEEVISDIIEKIKG 176

BLAST of Cp4.1LG07g02020 vs. NCBI nr
Match: XP_023002520.1 (selenoprotein H-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 305 bits (781), Expect = 2.91e-103
Identity = 170/204 (83.33%), Postives = 173/204 (84.80%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETAAAKPVP SSR TRRSPRLAANSKADLTVEEPVVKLPK+KKAKRAP
Sbjct: 1   MAPKKRSKIQEDETAAAKPVPVSSRVTRRSPRLAANSKADLTVEEPVVKLPKKKKAKRAP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KEN KAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRA+EVQNGLEKGVPG
Sbjct: 61  KENEKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAMEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK 180
           ITVLLNPDKS                            RRGCFEIRSEDGEKFISLLDMK
Sbjct: 121 ITVLLNPDKS----------------------------RRGCFEIRSEDGEKFISLLDMK 176

Query: 181 RPFTRMKELDMEEVISDIIEKIKG 204
           RPFT+MKELDMEEVISDIIEKIKG
Sbjct: 181 RPFTQMKELDMEEVISDIIEKIKG 176

BLAST of Cp4.1LG07g02020 vs. NCBI nr
Match: KAG6585418.1 (Fasciclin-like arabinogalactan protein 5, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 290 bits (741), Expect = 2.53e-93
Identity = 161/193 (83.42%), Postives = 162/193 (83.94%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETAAAKPVPASSR TRRSPRLAANSKADL VEEPVVKLPKRKKAKR P
Sbjct: 1   MAPKKRSKIQEDETAAAKPVPASSRVTRRSPRLAANSKADLMVEEPVVKLPKRKKAKRGP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRA+EVQNGLEKGVPG
Sbjct: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAMEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK 180
           ITVLLNPDK                            PRRGCFEIRSEDGEKFISLLDMK
Sbjct: 121 ITVLLNPDK----------------------------PRRGCFEIRSEDGEKFISLLDMK 165

Query: 181 RPFTRMKELDMEE 193
           RPFTRMKELDMEE
Sbjct: 181 RPFTRMKELDMEE 165

BLAST of Cp4.1LG07g02020 vs. ExPASy TrEMBL
Match: A0A6J1GHF7 (selenoprotein H-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453972 PE=4 SV=1)

HSP 1 Score: 306 bits (784), Expect = 4.92e-104
Identity = 170/204 (83.33%), Postives = 171/204 (83.82%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETA AKPVPASSR TRRSPRLA NSKADLTVEE VVKLPKRKKAKR P
Sbjct: 1   MAPKKRSKIQEDETAVAKPVPASSRVTRRSPRLAGNSKADLTVEETVVKLPKRKKAKRGP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRA+EVQNGLEKGVPG
Sbjct: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAMEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK 180
           ITVLLNPDK                            PRRGCFEIRSEDGEKFISLLDMK
Sbjct: 121 ITVLLNPDK----------------------------PRRGCFEIRSEDGEKFISLLDMK 176

Query: 181 RPFTRMKELDMEEVISDIIEKIKG 204
           RPFTRMKELDMEEVISDIIEKIKG
Sbjct: 181 RPFTRMKELDMEEVISDIIEKIKG 176

BLAST of Cp4.1LG07g02020 vs. ExPASy TrEMBL
Match: A0A6J1KP64 (selenoprotein H-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111496337 PE=4 SV=1)

HSP 1 Score: 305 bits (781), Expect = 1.41e-103
Identity = 170/204 (83.33%), Postives = 173/204 (84.80%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETAAAKPVP SSR TRRSPRLAANSKADLTVEEPVVKLPK+KKAKRAP
Sbjct: 1   MAPKKRSKIQEDETAAAKPVPVSSRVTRRSPRLAANSKADLTVEEPVVKLPKKKKAKRAP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KEN KAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRA+EVQNGLEKGVPG
Sbjct: 61  KENEKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAMEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMK 180
           ITVLLNPDKS                            RRGCFEIRSEDGEKFISLLDMK
Sbjct: 121 ITVLLNPDKS----------------------------RRGCFEIRSEDGEKFISLLDMK 176

Query: 181 RPFTRMKELDMEEVISDIIEKIKG 204
           RPFT+MKELDMEEVISDIIEKIKG
Sbjct: 181 RPFTQMKELDMEEVISDIIEKIKG 176

BLAST of Cp4.1LG07g02020 vs. ExPASy TrEMBL
Match: A0A6J1KLJ2 (uncharacterized protein LOC111496337 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111496337 PE=4 SV=1)

HSP 1 Score: 254 bits (648), Expect = 1.16e-83
Identity = 144/177 (81.36%), Postives = 146/177 (82.49%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETAAAKPVP SSR TRRSPRLAANSKADLTVEEPVVKLPK+KKAKRAP
Sbjct: 1   MAPKKRSKIQEDETAAAKPVPVSSRVTRRSPRLAANSKADLTVEEPVVKLPKKKKAKRAP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KEN KAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRA+EVQNGLEKGVPG
Sbjct: 61  KENEKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAMEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLL 177
           ITVLLNPDKS                            RRGCFEIRSEDGEKFISLL
Sbjct: 121 ITVLLNPDKS----------------------------RRGCFEIRSEDGEKFISLL 149

BLAST of Cp4.1LG07g02020 vs. ExPASy TrEMBL
Match: A0A6J1GHC6 (selenoprotein H-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111453972 PE=4 SV=1)

HSP 1 Score: 253 bits (647), Expect = 1.64e-83
Identity = 143/177 (80.79%), Postives = 144/177 (81.36%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKRAP 60
           MAPKKRSKIQEDETA AKPVPASSR TRRSPRLA NSKADLTVEE VVKLPKRKKAKR P
Sbjct: 1   MAPKKRSKIQEDETAVAKPVPASSRVTRRSPRLAGNSKADLTVEETVVKLPKRKKAKRGP 60

Query: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPG 120
           KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRA+EVQNGLEKGVPG
Sbjct: 61  KENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAMEVQNGLEKGVPG 120

Query: 121 ITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLL 177
           ITVLLNPDK                            PRRGCFEIRSEDGEKFISLL
Sbjct: 121 ITVLLNPDK----------------------------PRRGCFEIRSEDGEKFISLL 149

BLAST of Cp4.1LG07g02020 vs. ExPASy TrEMBL
Match: A0A6J1K9R2 (selenoprotein H-like OS=Cucurbita maxima OX=3661 GN=LOC111492377 PE=4 SV=1)

HSP 1 Score: 248 bits (632), Expect = 7.56e-81
Identity = 142/207 (68.60%), Postives = 156/207 (75.36%), Query Frame = 0

Query: 1   MAPKKRSKIQEDETAAA---KPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAK 60
           MAP+KRSK QED+ AAA   KP P SS  TRRSPRLA NS ADL VEE V +LPK KK K
Sbjct: 1   MAPRKRSKNQEDKPAAAAEEKPAPVSSMVTRRSPRLAVNSMADLVVEEAVTELPKSKKVK 60

Query: 61  RAPKENGKAEEVENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKG 120
           RAPKENGKA+EV NEGE++DAAS+KL  +AKNR VVIE CKQCQSFKKRAI+VQ+GLE G
Sbjct: 61  RAPKENGKAKEVGNEGEEIDAASKKLSKDAKNRTVVIEFCKQCQSFKKRAIQVQSGLENG 120

Query: 121 VPGITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLL 180
           V GITVLLNP+K                            PR+GCFEIRS+DGEKFISLL
Sbjct: 121 VSGITVLLNPNK----------------------------PRKGCFEIRSDDGEKFISLL 179

Query: 181 DMKRPFTRMKELDMEEVISDIIEKIKG 204
           DMKRPFTRMKEL+MEEVISDIIEKIKG
Sbjct: 181 DMKRPFTRMKELNMEEVISDIIEKIKG 179

BLAST of Cp4.1LG07g02020 vs. TAIR 10
Match: AT2G24440.1 (selenium binding )

HSP 1 Score: 132.5 bits (332), Expect = 3.9e-31
Identity = 89/196 (45.41%), Postives = 112/196 (57.14%), Query Frame = 0

Query: 15  AAAKPVPASSRATRRSPRLAANSKADLTVEEPVVKLPKRKKAKR--APKENGKAEEV--- 74
           A  + + +  R TR   +   +S   + +E P  K  K  KAK   A K+  K EEV   
Sbjct: 16  ANTRMLRSMDRKTRSDTKRDGSSSKLMKIESPEKKKRKTTKAKNVGAAKKKVKKEEVAVK 75

Query: 75  --ENEGEDLDAASEKLGVEAKNRAVVIEHCKQCQSFKKRAIEVQNGLEKGVPGITVLLNP 134
             + E ED DAA ++   ++  + +VIEHCKQC+SFK+RA EV+ GLE+ VPGI V +NP
Sbjct: 76  IEKEEEEDDDAAEKEEDDDSDKKKIVIEHCKQCKSFKERANEVKEGLEEAVPGIIVTVNP 135

Query: 135 DKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDGEKFISLLDMKRPFTRMK 194
           D                            KPRRGCFEIR E GE FISLL MKRPFT MK
Sbjct: 136 D----------------------------KPRRGCFEIREEGGETFISLLAMKRPFTPMK 183

Query: 195 ELDMEEVISDIIEKIK 204
           EL+MEEVI+DI+EKIK
Sbjct: 196 ELNMEEVIADIVEKIK 183

BLAST of Cp4.1LG07g02020 vs. TAIR 10
Match: AT4G31360.1 (selenium binding )

HSP 1 Score: 125.2 bits (313), Expect = 6.1e-29
Identity = 91/213 (42.72%), Postives = 112/213 (52.58%), Query Frame = 0

Query: 3   PKKRSKIQEDETAAAKPVPASSRATR------RSPRLAANSKADLTVEEPVVKLPKRKKA 62
           P K+SK   +E A      ASSR TR      RS      +KA  +  +P  K  KRK +
Sbjct: 2   PPKKSKADGEEKAKPLTTLASSRVTRSMDSRTRSQTQQNGAKAAGSATKPATKKAKRKNS 61

Query: 63  ---KRAPKENGKAEEVENEGEDLDAASEKLGVEAKN---RAVVIEHCKQCQSFKKRAIEV 122
                  K+  K EEVE   E ++   EK   E ++     +VIEHCKQC +FK RAI+V
Sbjct: 62  AIETGRAKKGKKEEEVEEPEEAVEEEVEKEEPEVEDPTRTKIVIEHCKQCNAFKTRAIQV 121

Query: 123 QNGLEKGVPGITVLLNPDKSHSEVEVLEGNLEFATLGLGFSADSRAKPRRGCFEIRSEDG 182
           +  LE  VPG+TV LNP+                            KPRRGCFEIR E G
Sbjct: 122 KEALEGAVPGVTVSLNPE----------------------------KPRRGCFEIREEGG 181

Query: 183 EKFISLLDMKRPFTRMKELDMEEVISDIIEKIK 204
           + FISLL+MKRPF  MK LDMEEVI DII+K+K
Sbjct: 182 QTFISLLEMKRPFAPMKALDMEEVIEDIIKKVK 186

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG7020338.11.22e-13198.04hypothetical protein SDJN02_17022 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023536837.17.82e-10886.27selenoprotein H-like [Cucurbita pepo subsp. pepo][more]
XP_022950994.11.02e-10383.33selenoprotein H-like isoform X1 [Cucurbita moschata][more]
XP_023002520.12.91e-10383.33selenoprotein H-like isoform X1 [Cucurbita maxima][more]
KAG6585418.12.53e-9383.42Fasciclin-like arabinogalactan protein 5, partial [Cucurbita argyrosperma subsp.... [more]
Match NameE-valueIdentityDescription
A0A6J1GHF74.92e-10483.33selenoprotein H-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453972 PE... [more]
A0A6J1KP641.41e-10383.33selenoprotein H-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111496337 PE=4... [more]
A0A6J1KLJ21.16e-8381.36uncharacterized protein LOC111496337 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1GHC61.64e-8380.79selenoprotein H-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111453972 PE... [more]
A0A6J1K9R27.56e-8168.60selenoprotein H-like OS=Cucurbita maxima OX=3661 GN=LOC111492377 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G24440.13.9e-3145.41selenium binding [more]
AT4G31360.16.1e-2942.72selenium binding [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..75
NoneNo IPR availablePANTHERPTHR33638SELENOPROTEIN Hcoord: 1..203

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g02020.1Cp4.1LG07g02020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005794 Golgi apparatus