Cp4.1LG10g10970 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG10g10970
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionHeat shock protein DnaJ, cysteine-rich domain containing protein
LocationCp4.1LG10: 6452747 .. 6458702 (+)
RNA-Seq ExpressionCp4.1LG10g10970
SyntenyCp4.1LG10g10970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATTGCATATTCGTTGTCTGGTTGCCAGAGGAGTGCGATTTTGACACCTAGTTGGGCGCCGGATTGTTGCACTAGGAACCAAAATTCATTCACCAATTTGTCATCCGCAATTCGTTAAAAAACGCGAGAAAAAAAGCCATTATATTGGCGATTTCCTCGGGAATTTCATCGATTCGAGCTTCAATTTCCCTTACCGCTTGGAGGCCAATTTCATCACAGATTTCTGCTATTCATACGCAATGGATTCAACTTCTTTAACCATTTCATCATCTTTCATTTTCCGCAATTCATCGCAGATATTACTTGCCATTAGAAAGATTCAAGACGGTTTGTGTTTCGAGAGGAACAAATTCTCCAAGATCTTCGCGGTCTATCCCAATGGCTCTGCTTCTGGGGTTCGTACTTTTCAAATTGGTTATGTGATTCTATTTGCTGTTTTATATTCAATGGACGAATATGGTCCTGCGTTCATTTAGGTTGAATCTATTGTACAGTCATTTTTGTGCATTTGAAGGTATTTGGTTGTTTGGTGCATACATTGGATTATAAGCGACTTTTTTGAGCTATGCTTTTCTGCTAGAAGCTGAACATTTTGGTTTAAAGTTAGTGCTATGTTGTGTTATTTTTTTTCGTTCTTATGTTTGCATTCAGTTTAAACTTGACAAGGTTTAATGAGAAATGGTAAAGCATAGAACTCCCACATGTGAAAATGGCGGTCTCTTTATATAACTATTGTTTGTCTGATGGTCAAATCGCAGGCTCGTTTCATATCTGTAAAAAGAACCATTTTATACCGTTAATTCCTTAGTGTGAGATTCCACATCGGTTAGGGAGGAGGACGAAACATTCTTTTTATAAGGGTGTAGAAACTTCTCCCTAGCAAATGCGTTTTAAAAACTTTGAGGGAAGCCTATAAGGGAGAGCCCAAAGAGGACAAATGGTATCATAACCAAACACCGGGTGATGTGCCAGTAGGAGGTTGAGCCCCAAAGAGGGGTGGACACGAGGCGGTGTGCCAGCAAGGACGCTAGGCCCCGAAGGGGTGGATTGGGGGGAGGTCCCACATCGATTGGAGAAGGAAACAAGTGCCAGCGAGGAGTGCCAGCAAGGATGCTGGGTTCCGAAGGGGGTGGATGGGGGGAGGTCCCACATCAATTGCAAAAGGGAACAAGTGCCAGTAANGGGGGGGGGGGAGGTCCCACATCGATTGGAGAAGGGAACAGTGCCAGTGAGGACGCTGGACCTCTAAGGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGGAGAACCAAACATTCTTTATAAGGGTGTGAAAACCTCTCCCTAGCATACATGTTTTAAAATCATTGAATGAAAGCCCGAAAGGGAAAGCCCAAGGAGGACAAAAGCTCCTAACGGTGGGTTTGAGCCGTGTACCACGGCCTTTTGCCTGCTGAATTGGCATAATTGATTGAAACTGGCAAGAGCATTTGTAATAAGTAAACAAGCATTTTTGATAAACCTCAGCAATCAATATCTAATGTCACTAGCTGTTGGCAATAGTAGCTAGACATCAAACGTAACCTCTGCTTTTCGCCTTTTCTGTCATATCCCGATCTACGTGTTCGTGATTGTGTATTCAATCATATGCTTCTAAGTATATGATCTGCATTCATAGTTTGTGATTGCCATATAATTACGGTTTTGAGAATCTTGTCAAATTTTCTGGATGTAGCCGGGGGATTTTTCAGCTGCAGATGTTCACAGACAACGAAGTGCTTTTGAATCTTTATTTTGCTATGATAAGGCTATTCCGGAAGAAATAATTGAGATGCCTGTTGGAATATCTCTGGCAGAGAAAATGATTGGAGATAATCCCCGTTGCACTGATTGTCATGCTAAAGGTGCTGTTCTTTGCACAACTTGCTCCGGGTCGGGATTGTATGTCGACTCGATATTGGAGAGCCAGGGAATCATTGTGAAAGTTCGTTGCCTCGGTAAAATCTGTATGTTTAGTTCTCAGATTACTTCTTGAAAAGTTTCTGTCCGAACTCTAATCATCTTCTTGTAGGTTGTGGTGGAACTGGTAATACCATGTGTTCAGAATGCGGCGGTCTTGGTCACCTGGGATCTAAATGAGTTCCAAAGACTCGTTTCGAACGTTTTGTCTTGAACCGCCCCCAGCTTTAATATTGGGCATGGATTATGCAATGATGATGAGATATATTAATCTTGAAATTTAGTGAAGATTGATAGAGTTCATCTTTCATTTAAGCAGCTTGTCTTGCCATAGTTTGGAGCAATTATATCTGATTCAGTTCAACTTACTCCATAGTGTTCATTTCCCTCAAGATGTGTACTTGTTAGTCAAACCGACACGAATTCTTTCTGAACTGACTACGAAATCAGAAATATATGAACTTCAAGGAAAGTTTTCATAAATTACTGCATGAATCTCTTTGTTTTTATTCTTGTTAGTAGTTCTGAAGCATTATTATGTGTTTGAAAAAAGAATCTACCATGAATTTCTCTCTAAGAACAAGATTTGCAGGGAAAAAGGAAGAAGCTTAAAGCTGGCATTGCTGTTTCAGTGGGTGAAGGCATTTTTTTCTTATGCCAGTCTCTTACTGGAGCTTTAGGAATGGTCTAAATCTGCTTCTTCACTTGTTTTCATGTTTCAGGGTTGGTTAAACTTGCAACCTCCGCCGAAGTTCTCGAATCCGGTCTGGCACGGTGGAGTCGAACCGAGGATGCAGGGGATTGTCTGCATAAGCTACGTAGTAGGCAGACACACATGCTTGTGGCCACGCCATTCCGATTCGACACTGCAACAACAACAACAACAACAACAACAACAACAACCATGAACATCAATCCAGAAGTTAAATTCCTCTCAGGTATTACCATTAAATTCAGTTACAGTTTAGAACAGTTCAAATATTTCAGCTCATTACCACAAAATAGCCAATCAGGAAGGCGTAGATTGCTAATTCTGTTGCGTAGCTCTTATGTATTACAAGTGCCCAGATCCCACTTATAATGCTACAGATTGCACCACCAGAAACCCCGCAAAGAACGCAGAACGACACGGTGAGATCGGAGTCTATAACGATTTCCAAATCCACCTTCTCGAACGCCTCCCAGGTATCATATGAAGCTTGCACAATCCCTTTATTGTAAACTCCAACATGGACAAAACCCCACCTGTTTCCATGGTTTCTCAGCATTGAAGCTAATTCTGAGTAGCAGGGTGCACAAGAGAAAAGGAACTCATCCGAGTCTCCTGCCACCAACCTCATGGCACGGGCCGATCCGTGAATGAAGCTGATAAAAGGGATGATGGCTGAGCCAAGAGAAATAGTTCCAGCTGAATGCTTTATTATGTCGTAGAAAGCTGCTCGTATGTCCTTTTCCCGACCCCAAGCTAAATTAAAGTACTTTATGCGCGAGATTGTAACGTGCACTACGTTCTTGATAACCTGCAGAGTCCAGGTTAGGCTCAACAGTATTGCTGCTATGAACAATGCCTTAAAACTCGACGTAATAGAAGTTGCTCCACCGATGCCGATGACCACGAAAGCTGCGTATAGAACGCCAATGAGGATTGATCCAAAGACAAAGATTGGAGTGTTCGTAGGAGGGTACTTAGTAGAAAGTGATAGAAGTTGAATTGCATAATTCAACCGGTGATTGATCCAACAGACATATACAGACACAATCAAGGAAGAAACTATGAGGATGACTCCTGCTGCTAAACCGCCACGAGATCCGACAATGATAAAGAAAACACCAGAAGCAAGTGATAGTACAGGGCTAAACCAGAAGGCTGTCTTGAAGGCCATAGAAGGTGAACAGCCAGTGAAGGCCTGCCATGAAAAGGCAATGATTCCAGAGCTCCCAATAGAAGCCAACAATGGAGGGTACCATTCTTTGGGGTGAAAATGATGCGTTTTAGAAGCCGACCGAAGTCCGAGAACTGTCAAAACGATCACTAACACGGCTACCAGACATATATGAATCTTGAAAACTATGCTAGAAAGCCACCTGAAAATTCTCCATCCCATGGCAGGCTCGATCGGATGGCTCGGTTCTTGAACCTACAACAACAGCCAATAAGATTCCTTCATATACATGTAAGTCAACGTCCATACACCAAAAATGGATCTGAATTTTCATGGATATGATAATGGAAATGCTTTCTTGATTGAGAAAGAGAGAGAAACCTGGTTTGGTGTTGAAGATGAAACTCTTCTTTCTTGAGCTTGGTGGGAGTGGTTCTGTAACACACTTGGTGGCGTTCTGGCTTCCATAATTTCATTTCAGGGGGAAATCAGCTCCTAAAATCCACCAAGAACAGCACCAGTTTTTGTAGTAGTTGCTGTGTTCTTGAAGTTCTATGGGTTGGTTGAAAAGCAAGTAATCTCCTTTGAAATCCTGGGCCCTCTGAGATTTGGACTTCTCAAGTCTTCTTAAACTAATCAGGTTTTGAATTCTCAAGATCTCTCTCTCTCTCTCTAGATATATACACATGTATGTGGAATTTCATAATTTTGGTCAAAGTAGGTGCGGGGTTCTATTGTGCTGATGTCGTGGCAGCACGAGGGAGCAGCAACGTGGATCAAAGGCTATCTTCCTCCTGATCCACATCTGTGAGATCCCCATCGATTGGAAAGGGGAACGAAACATTCTGTATTAGGGTGTGGAAACCTCTAAAAACCGTGAGGCTGACGGTGATACGTATCGGGTTTACAACCTCTCTGGAACGAAAAAGGTCTCCGCTTTCTGTTATCCTTTTTCTTTTTCTTTTTGTGTTTCATTTTCTGAATGAACTCAAAGTTGAACAAGTTAGCTAAAACCTTTTTCTTTTCTTCTCTTTAGTTATTTATTTAAGAATCCTATGTGTTTTTCAACTCTTTTGGGTTTTTTTATACAAAGAACCAAATTACAGTTTTGCCCTTCCAAGTTCCAATAATCTAAAAGTTTTTCTTCATTGTTAATGATTATCAATGAATGATTGCTTTGCAAGTTAAATTTTGACTTTGTCAACGTTTTTGTTCACAGATAAGATTGGAGGGACTGTGAGGAAGAAAGCATGACAGTCAAATCCACAAACGCCATGTGGCAGCCAGGTTAGTCGAGTTAACGTAATCTCGCTTCTCATATTCTAAAGCTGGAAAGTTGCATGCTTTTCATGTCTTACTGATAGACCTTCCCAAAAGGACAAACATAAAGTAGGCAAATTGGTTAGGCTTGGTTTCATTTTCTTAACCCGAAACGCGAAACCGAACAATTTGTTTGATAATTTCTTTTTAAAAAATTTAATAGTTATATTGTTTTGGAATTGATAAAGTTGTTTGAAATAAATGGAAATAACTTTATAAAATGAATTAGGTTCTTATGGGAATTTCACAAATGTTCTTTTTTTTTTTTTTTTTTTTTATCCTTTATGAAAATATGCTTCTCCAAACTTTAGAATATTAGAAACTCTTACGGAACAAATTTGAGTGCTAAATTAAAATGTCCTTTGTTTTCATTATATTCCATGCATTATACCTTTTATGAAAAAGAAATATCAATTTCTACCATGTTCTAACATTTACTAAGTTACACCAACCGCTTCTGATGCGATTAATCTAGCTTGGGTTTGATACGCTCGGACTCTTCCCTCTTAGATGTCAAATGGGTTCATAGATTATGAAGCCAATTAGACTATGATTTAAATACTTTTTTCTGATTCTTTTGTTTATTTTTTATTCTTTTTTGGGGGAAAAAAATTCGTTCTTATCCTCGTAGCTCAAGTTGTATAACTCTCACTCAAAGGAAAACAACCCTTAAGTTTAAGCATTTGATTTATTGAGTCGTCTCTGATCGTTTAGTTTTTGTTTGTCTCCTTTAGTCATGTAAGGTTAGTATGACTACTTTTTAGCAAGTTTAAATTATTTGTTGCTCCTGACTCTTCCCTCTTCGTTTGAACT

mRNA sequence

TATTGCATATTCGTTGTCTGGTTGCCAGAGGAGTGCGATTTTGACACCTAGTTGGGCGCCGGATTGTTGCACTAGGAACCAAAATTCATTCACCAATTTGTCATCCGCAATTCGTTAAAAAACGCGAGAAAAAAAGCCATTATATTGGCGATTTCCTCGGGAATTTCATCGATTCGAGCTTCAATTTCCCTTACCGCTTGGAGGCCAATTTCATCACAGATTTCTGCTATTCATACGCAATGGATTCAACTTCTTTAACCATTTCATCATCTTTCATTTTCCGCAATTCATCGCAGATATTACTTGCCATTAGAAAGATTCAAGACGGTTTGTGTTTCGAGAGGAACAAATTCTCCAAGATCTTCGCGGTCTATCCCAATGGCTCTGCTTCTGGGCCGGGGGATTTTTCAGCTGCAGATGTTCACAGACAACGAAGTGCTTTTGAATCTTTATTTTGCTATGATAAGGCTATTCCGGAAGAAATAATTGAGATGCCTGTTGGAATATCTCTGGCAGAGAAAATGATTGGAGATAATCCCCGTTGCACTGATTGTCATGCTAAAGGTGCTGTTCTTTGCACAACTTGCTCCGGGTCGGGATTGTATGTCGACTCGATATTGGAGAGCCAGGGAATCATTGTGAAAGTTCGTTGCCTCGGTTGTGGTGGAACTGGTAATACCATGTGTTCAGAATGCGGCGGTCTTGGTCACCTGGGATCTAAATGAGTTCCAAAGACTCGTTTCGAACGTTTTGTCTTGAACCGCCCCCAGCTTTAATATTGGGCATGGATTATGCAATGATGATGAGATATATTAATCTTGAAATTTAGTGAAGATTGATAGAGTTCATCTTTCATTTAAGCAGCTTGTCTTGCCATAGTTTGGAGCAATTATATCTGATTCAGTTCAACTTACTCCATAGTGTTCATTTCCCTCAAGATGTGTACTTGTTAGTCAAACCGACACGAATTCTTTCTGAACTGACTACGAAATCAGAAATATATGAACTTCAAGGAAAGTTTTCATAAATTACTGCATGAATCTCTTTGTTTTTATTCTTGTTAGTAGTTCTGAAGCATTATTATGTGTTTGAAAAAAGAATCTACCATGAATTTCTCTCTAAGAACAAGATTTGCAGGGAAAAAGGAAGAAGCTTAAAGCTGGCATTGCTGTTTCAGTGGGTGAAGGCATTTTTTTCTTATGCCAGTCTCTTACTGGAGCTTTAGGAATGGTCTAAATCTGCTTCTTCACTTGTTTTCATGTTTCAGGGTTGGTTAAACTTGCAACCTCCGCCGAAGTTCTCGAATCCGGTCTGGCACGGTGGAGTCGAACCGAGGATGCAGGGGATTGTCTGCATAAGCTACGTAGTAGGCAGACACACATGCTTGTGGCCACGCCATTCCGATTCGACACTGCAACAACAACAACAACAACAACAACAACAACAACCATGAACATCAATCCAGAAGTTAAATTCCTCTCAGGTATTACCATTAAATTCAGTTACAGTTTAGAACAGTTCAAATATTTCAGCTCATTACCACAAAATAGCCAATCAGGAAGGCGTAGATTGCTAATTCTGTTGCGTAGCTCTTATGTATTACAAGTGCCCAGATCCCACTTATAATGCTACAGATTGCACCACCAGAAACCCCGCAAAGAACGCAGAACGACACGGTGAGATCGGAGTCTATAACGATTTCCAAATCCACCTTCTCGAACGCCTCCCAGCATTGAAGCTAATTCTGAGTAGCAGGGTGCACAAGAGAAAAGGAACTCATCCGAGTCTCCTGCCACCAACCTCATGGCACGGGCCGATCCGTGAATGAAGCTGATAAAAGGGATGATGGCTGAGCCAAGAGAAATAGTTCCAGCTGAATGCTTTATTATGTCGTAGAAAGCTGCTCAGTCCAGGTTAGGCTCAACAGTATTGCTGCTATGAACAATGCCTTAAAACTCGACGTAATAGAAGTTGCTCCACCGATGCCGATGACCACGAAAGCTGCGTATAGAACGCCAATGAGGATTGATCCAAAGACAAAGATTGGAGTGTTCGTAGGAGGGTACTTAGTAGAAAGTGATAGAAGTTGAATTGCATAATTCAACCGGTGATTGATCCAACAGACATATACAGACACAATCAAGGAAGAAACTATGAGGATGACTCCTGCTGCTAAACCGCCACGAGATCCGACAATGATAAAGAAAACACCAGAAGCAAGTGATAGTACAGGGCTAAACCAGAAGGCTGTCTTGAAGGCCATAGAAGGTGAACAGCCAGTGAAGGCCTGCCATGAAAAGGCAATGATTCCAGAGCTCCCAATAGAAGCCAACAATGGAGGGTACCATTCTTTGGGGTGAAAATGATGCGTTTTAGAAGCCGACCGAAGTCCGAGAACTGTCAAAACGATCACTAACACGGCTACCAGACATATATGAATCTTGAAAACTATGCTAGAAAGCCACCTGAAAATTCTCCATCCCATGGCAGGCTCGATCGGATGGCTCGGTTCTTGAACCTACAACAACAGCCAATAAGATTCCTTCATATACATGTAAGTCAACGTCCATACACCAAAAATGGATCTGAATTTTCATGGATATGATAATGGAAATGCTTTCTTGATTGAGAAAGAGAGAGAAACCTGGTTTGGTGTTGAAGATGAAACTCTTCTTTCTTGAGCTTGGTGGGAGTGGTTCTGTAACACACTTGGTGGCGTTCTGGCTTCCATAATTTCATTTCAGGGGGAAATCAGCTCCTAAAATCCACCAAGAACAGCACCAGTTTTTGTAGTAGTTGCTGTGTTCTTGAAGTTCTATGGGTTGGTTGAAAAGCAAGTAATCTCCTTTGAAATCCTGGGCCCTCTGAGATTTGGACTTCTCAAGTCTTCTTAAACTAATCAGGTGCGGGGTTCTATTGTGCTGATGTCGTGGCAGCACGAGGGAGCAGCAACGTGGATCAAAGGCTATCTTCCTCCTGATCCACATCTGTGAGATCCCCATCGATTGGAAAGGGGAACGAAACATTCTGTATTAGGGTGTGGAAACCTCTAAAAACCGTGAGGCTGACGGTGATACGTATCGGGTTTACAACCTCTCTGGAACGAAAAAGATAAGATTGGAGGGACTGTGAGGAAGAAAGCATGACAGTCAAATCCACAAACGCCATGTGGCAGCCAGGTTAGTCGAGTTAACGTAATCTCGCTTCTCATATTCTAAAGCTGGAAAGTTGCATGCTTTTCATGTCTTACTGATAGACCTTCCCAAAAGGACAAACATAAAGTAGGCAAATTGGTTAGGCTTGGTTTCATTTTCTTAACCCGAAACGCGAAACCGAACAATTTGTTTGATAATTTCTTTTTAAAAAATTTAATAGTTATATTGTTTTGGAATTGATAAAGTTGTTTGAAATAAATGGAAATAACTTTATAAAATGAATTAGGTTCTTATGGGAATTTCACAAATGTTCTTTTTTTTTTTTTTTTTTTTTATCCTTTATGAAAATATGCTTCTCCAAACTTTAGAATATTAGAAACTCTTACGGAACAAATTTGAGTGCTAAATTAAAATGTCCTTTGTTTTCATTATATTCCATGCATTATACCTTTTATGAAAAAGAAATATCAATTTCTACCATGTTCTAACATTTACTAAGTTACACCAACCGCTTCTGATGCGATTAATCTAGCTTGGGTTTGATACGCTCGGACTCTTCCCTCTTAGATGTCAAATGGGTTCATAGATTATGAAGCCAATTAGACTATGATTTAAATACTTTTTTCTGATTCTTTTGTTTATTTTTTATTCTTTTTTGGGGGAAAAAAATTCGTTCTTATCCTCGTAGCTCAAGTTGTATAACTCTCACTCAAAGGAAAACAACCCTTAAGTTTAAGCATTTGATTTATTGAGTCGTCTCTGATCGTTTAGTTTTTGTTTGTCTCCTTTAGTCATGTAAGGTTAGTATGACTACTTTTTAGCAAGTTTAAATTATTTGTTGCTCCTGACTCTTCCCTCTTCGTTTGAACT

Coding sequence (CDS)

ATGGATTCAACTTCTTTAACCATTTCATCATCTTTCATTTTCCGCAATTCATCGCAGATATTACTTGCCATTAGAAAGATTCAAGACGGTTTGTGTTTCGAGAGGAACAAATTCTCCAAGATCTTCGCGGTCTATCCCAATGGCTCTGCTTCTGGGCCGGGGGATTTTTCAGCTGCAGATGTTCACAGACAACGAAGTGCTTTTGAATCTTTATTTTGCTATGATAAGGCTATTCCGGAAGAAATAATTGAGATGCCTGTTGGAATATCTCTGGCAGAGAAAATGATTGGAGATAATCCCCGTTGCACTGATTGTCATGCTAAAGGTGCTGTTCTTTGCACAACTTGCTCCGGGTCGGGATTGTATGTCGACTCGATATTGGAGAGCCAGGGAATCATTGTGAAAGTTCGTTGCCTCGGTTGTGGTGGAACTGGTAATACCATGTGTTCAGAATGCGGCGGTCTTGGTCACCTGGGATCTAAATGA

Protein sequence

MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAADVHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSGLYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK
Homology
BLAST of Cp4.1LG10g10970 vs. NCBI nr
Match: XP_022950676.1 (uncharacterized protein LOC111453701 [Cucurbita moschata])

HSP 1 Score: 317 bits (811), Expect = 8.20e-109
Identity = 156/161 (96.89%), Postives = 158/161 (98.14%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAAD 60
           MDS+SLTISSSFIFRNSSQ LLAIRKIQDGLCF+RNKFSKIFAVYPNGSASGP DFSAAD
Sbjct: 1   MDSSSLTISSSFIFRNSSQKLLAIRKIQDGLCFQRNKFSKIFAVYPNGSASGPRDFSAAD 60

Query: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120
           VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG
Sbjct: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120

Query: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGG GHLGSK
Sbjct: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGRGHLGSK 161

BLAST of Cp4.1LG10g10970 vs. NCBI nr
Match: KAG6603460.1 (hypothetical protein SDJN03_04069, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 314 bits (805), Expect = 6.74e-108
Identity = 155/161 (96.27%), Postives = 157/161 (97.52%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAAD 60
           MDS+SLTISSSFIFRNSSQ LLAIRKIQDGLCF+RNKFSKIFAVYPNGSASGP DFSAAD
Sbjct: 1   MDSSSLTISSSFIFRNSSQKLLAIRKIQDGLCFQRNKFSKIFAVYPNGSASGPRDFSAAD 60

Query: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120
           VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTC GSG
Sbjct: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCFGSG 120

Query: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGG GHLGSK
Sbjct: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGRGHLGSK 161

BLAST of Cp4.1LG10g10970 vs. NCBI nr
Match: KAG7033643.1 (hypothetical protein SDJN02_03367, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 306 bits (784), Expect = 5.50e-102
Identity = 151/158 (95.57%), Postives = 154/158 (97.47%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAAD 60
           MDS+SLTISSSFIFRNSSQ LLAIRKIQDGLCF+RNKFSKIFAVYPNGSASGP DFSAAD
Sbjct: 1   MDSSSLTISSSFIFRNSSQKLLAIRKIQDGLCFQRNKFSKIFAVYPNGSASGPRDFSAAD 60

Query: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120
           VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG
Sbjct: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120

Query: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHL 158
           LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGG G +
Sbjct: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGRGLI 158

BLAST of Cp4.1LG10g10970 vs. NCBI nr
Match: XP_038881865.1 (uncharacterized protein LOC120073222 isoform X1 [Benincasa hispida])

HSP 1 Score: 280 bits (715), Expect = 3.56e-94
Identity = 139/161 (86.34%), Postives = 144/161 (89.44%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAAD 60
           MDS +LT+SSS I RNSSQ LLA RKIQ GLCF RNKFSKI AVYPNGSASG GD S AD
Sbjct: 1   MDSATLTMSSSVILRNSSQKLLANRKIQPGLCFARNKFSKISAVYPNGSASGQGDSSPAD 60

Query: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120
           VHR+RS+FESLFCYDKAIPEE IE PVGISLAE+MIGDNPRCTDCHAKG VLC TCSGSG
Sbjct: 61  VHRRRSSFESLFCYDKAIPEERIETPVGISLAERMIGDNPRCTDCHAKGVVLCATCSGSG 120

Query: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           LYVDSILESQGIIVKVRCLGCGGTGN MCSECGG GHLGSK
Sbjct: 121 LYVDSILESQGIIVKVRCLGCGGTGNIMCSECGGRGHLGSK 161

BLAST of Cp4.1LG10g10970 vs. NCBI nr
Match: XP_038881866.1 (uncharacterized protein LOC120073222 isoform X2 [Benincasa hispida])

HSP 1 Score: 275 bits (704), Expect = 1.63e-92
Identity = 139/161 (86.34%), Postives = 144/161 (89.44%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAAD 60
           MDS +LT+SSS I RNSSQ LLA RKIQ GLCF RNKFSKI AVYPNGSASG GD S AD
Sbjct: 1   MDSATLTMSSSVILRNSSQKLLANRKIQPGLCFARNKFSKISAVYPNGSASG-GDSSPAD 60

Query: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120
           VHR+RS+FESLFCYDKAIPEE IE PVGISLAE+MIGDNPRCTDCHAKG VLC TCSGSG
Sbjct: 61  VHRRRSSFESLFCYDKAIPEERIETPVGISLAERMIGDNPRCTDCHAKGVVLCATCSGSG 120

Query: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           LYVDSILESQGIIVKVRCLGCGGTGN MCSECGG GHLGSK
Sbjct: 121 LYVDSILESQGIIVKVRCLGCGGTGNIMCSECGGRGHLGSK 160

BLAST of Cp4.1LG10g10970 vs. ExPASy TrEMBL
Match: A0A6J1GFH6 (uncharacterized protein LOC111453701 OS=Cucurbita moschata OX=3662 GN=LOC111453701 PE=4 SV=1)

HSP 1 Score: 317 bits (811), Expect = 3.97e-109
Identity = 156/161 (96.89%), Postives = 158/161 (98.14%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAAD 60
           MDS+SLTISSSFIFRNSSQ LLAIRKIQDGLCF+RNKFSKIFAVYPNGSASGP DFSAAD
Sbjct: 1   MDSSSLTISSSFIFRNSSQKLLAIRKIQDGLCFQRNKFSKIFAVYPNGSASGPRDFSAAD 60

Query: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120
           VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG
Sbjct: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120

Query: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGG GHLGSK
Sbjct: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGRGHLGSK 161

BLAST of Cp4.1LG10g10970 vs. ExPASy TrEMBL
Match: A0A6J1DJS1 (uncharacterized protein LOC111021761 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021761 PE=4 SV=1)

HSP 1 Score: 271 bits (694), Expect = 3.14e-91
Identity = 137/165 (83.03%), Postives = 143/165 (86.67%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSAS----GPGDF 60
           MDS +LT+SSS I RNSSQ LL  RKIQ GLCF R +FSKIFAVYPNGSA     G GD 
Sbjct: 1   MDSATLTVSSSIILRNSSQKLLPGRKIQAGLCFRRRRFSKIFAVYPNGSARSSAPGQGDS 60

Query: 61  SAADVHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTC 120
           SAADVHR+RS FESLFCYDKAIPEE IE PVGISLAEK+IGDNPRCTDCHAKGAVLCTTC
Sbjct: 61  SAADVHRRRSTFESLFCYDKAIPEERIEKPVGISLAEKIIGDNPRCTDCHAKGAVLCTTC 120

Query: 121 SGSGLYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           SGSGLYVD+ILESQGIIVKVRCLGCGGTGN MCSECGG GHL SK
Sbjct: 121 SGSGLYVDAILESQGIIVKVRCLGCGGTGNIMCSECGGRGHLASK 165

BLAST of Cp4.1LG10g10970 vs. ExPASy TrEMBL
Match: A0A6J1DNW0 (uncharacterized protein LOC111021761 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111021761 PE=4 SV=1)

HSP 1 Score: 270 bits (691), Expect = 8.69e-91
Identity = 138/164 (84.15%), Postives = 144/164 (87.80%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSA--SGPG-DFS 60
           MDS +LT+SSS I RNSSQ LL  RKIQ GLCF R +FSKIFAVYPNGSA  S PG D S
Sbjct: 1   MDSATLTVSSSIILRNSSQKLLPGRKIQAGLCFRRRRFSKIFAVYPNGSARSSAPGGDSS 60

Query: 61  AADVHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCS 120
           AADVHR+RS FESLFCYDKAIPEE IE PVGISLAEK+IGDNPRCTDCHAKGAVLCTTCS
Sbjct: 61  AADVHRRRSTFESLFCYDKAIPEERIEKPVGISLAEKIIGDNPRCTDCHAKGAVLCTTCS 120

Query: 121 GSGLYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           GSGLYVD+ILESQGIIVKVRCLGCGGTGN MCSECGG GHL SK
Sbjct: 121 GSGLYVDAILESQGIIVKVRCLGCGGTGNIMCSECGGRGHLASK 164

BLAST of Cp4.1LG10g10970 vs. ExPASy TrEMBL
Match: A0A0A0L441 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G580370 PE=4 SV=1)

HSP 1 Score: 270 bits (689), Expect = 1.53e-90
Identity = 136/161 (84.47%), Postives = 144/161 (89.44%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAAD 60
           MDS +LT+SSS I RNSS  LL IRKIQ  LCF+RNKFSKI A+YPNGSASG GD SAAD
Sbjct: 1   MDSATLTMSSSVILRNSSLKLLVIRKIQPPLCFKRNKFSKISALYPNGSASG-GDSSAAD 60

Query: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120
           VHR+RS+FESLFCYDKAIPEE IE P+GISLAEKMIG+NPRCTDC AKGAVLC TCSGSG
Sbjct: 61  VHRRRSSFESLFCYDKAIPEERIETPIGISLAEKMIGNNPRCTDCQAKGAVLCATCSGSG 120

Query: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           LYVDSILESQGIIVKVRCLGCGGTGN MCSECGG GHLGSK
Sbjct: 121 LYVDSILESQGIIVKVRCLGCGGTGNIMCSECGGRGHLGSK 160

BLAST of Cp4.1LG10g10970 vs. ExPASy TrEMBL
Match: A0A1S3B4T2 (uncharacterized protein LOC103486023 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486023 PE=4 SV=1)

HSP 1 Score: 269 bits (688), Expect = 2.25e-90
Identity = 133/161 (82.61%), Postives = 143/161 (88.82%), Query Frame = 0

Query: 1   MDSTSLTISSSFIFRNSSQILLAIRKIQDGLCFERNKFSKIFAVYPNGSASGPGDFSAAD 60
           MDS +LT+SSS I RNSS  LL IRKIQ  LCF+RN+FS+I A+Y NGSASG GD SAAD
Sbjct: 1   MDSATLTLSSSVILRNSSLKLLVIRKIQPSLCFKRNEFSRISALYANGSASGQGDSSAAD 60

Query: 61  VHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGSG 120
           VHR+RS+FESLFCYDKAIPEE IE P+GISLAEKMIG+NPRCTDC AKGAVLC TCSGSG
Sbjct: 61  VHRRRSSFESLFCYDKAIPEERIETPIGISLAEKMIGNNPRCTDCQAKGAVLCATCSGSG 120

Query: 121 LYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLGSK 161
           LYVDSILESQGIIVKVRCLGCGGTGN MCSECGG GHLGSK
Sbjct: 121 LYVDSILESQGIIVKVRCLGCGGTGNIMCSECGGRGHLGSK 161

BLAST of Cp4.1LG10g10970 vs. TAIR 10
Match: AT5G17840.1 (DnaJ/Hsp40 cysteine-rich domain superfamily protein )

HSP 1 Score: 172.6 bits (436), Expect = 2.7e-43
Identity = 77/100 (77.00%), Postives = 89/100 (89.00%), Query Frame = 0

Query: 60  DVHRQRSAFESLFCYDKAIPEEIIEMPVGISLAEKMIGDNPRCTDCHAKGAVLCTTCSGS 119
           DVHRQRS+ ES+FCYDK IPEEIIE PVG+S++E+ IGDN RCT C AKGA+LC+TCSG+
Sbjct: 54  DVHRQRSSLESMFCYDKPIPEEIIEEPVGLSMSEREIGDNQRCTCCEAKGALLCSTCSGT 113

Query: 120 GLYVDSILESQGIIVKVRCLGCGGTGNTMCSECGGLGHLG 160
           GLYVDSI+ESQGIIVKVRCLGCGG+GN MC  CGG GH+G
Sbjct: 114 GLYVDSIMESQGIIVKVRCLGCGGSGNIMCKLCGGRGHVG 153

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022950676.18.20e-10996.89uncharacterized protein LOC111453701 [Cucurbita moschata][more]
KAG6603460.16.74e-10896.27hypothetical protein SDJN03_04069, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7033643.15.50e-10295.57hypothetical protein SDJN02_03367, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_038881865.13.56e-9486.34uncharacterized protein LOC120073222 isoform X1 [Benincasa hispida][more]
XP_038881866.11.63e-9286.34uncharacterized protein LOC120073222 isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1GFH63.97e-10996.89uncharacterized protein LOC111453701 OS=Cucurbita moschata OX=3662 GN=LOC1114537... [more]
A0A6J1DJS13.14e-9183.03uncharacterized protein LOC111021761 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DNW08.69e-9184.15uncharacterized protein LOC111021761 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A0A0L4411.53e-9084.47Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G580370 PE=4 SV=1[more]
A0A1S3B4T22.25e-9082.61uncharacterized protein LOC103486023 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G17840.12.7e-4377.00DnaJ/Hsp40 cysteine-rich domain superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR15852:SF13DNAJ/HSP40 CYSTEINE-RICH DOMAIN SUPERFAMILY PROTEINcoord: 42..154
NoneNo IPR availablePANTHERPTHR15852PLASTID TRANSCRIPTIONALLY ACTIVE PROTEINcoord: 42..154
IPR036410Heat shock protein DnaJ, cysteine-rich domain superfamilySUPERFAMILY57938DnaJ/Hsp40 cysteine-rich domaincoord: 99..158

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g10970.1Cp4.1LG10g10970.1mRNA