Cp4.1LG01g01610 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g01610
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionIENR2 domain-containing protein
LocationCp4.1LG01: 2959564 .. 2964153 (+)
RNA-Seq ExpressionCp4.1LG01g01610
SyntenyCp4.1LG01g01610
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAAGCAAGCAATGGTTTATATAAATTCTACCGCCAACCACAGTGGTTTCAGTGCCATTAGCGTAATTGAATTTTGCATTTTATGGGCATTGGGCTGTTGGGTCCTTCGGGTTCGTCTTCTGCGAATCCCTCTCGGACACTGACTCTCCGTCGCCGGTTTCTTCATCTGTCTCCACCTTATGGATTACCATTTCACTCGAATTCCATACATTCACATGTAAGTTCTTCTCCATTCTCTTACATTTTCAAGCTCGCTGGAAATTTCTTTGTGTAGGTTTAGCGAAATTTTGTATCGTTTCTCTGTTTCCCCTTATGGATCTGTAAGTCTCGGCCAGATTAACATCTCTGAATTAGATTGCAAGGAAGTGGTCATATTCCTTCTGTTTTTTAAAATTATTTTGCGTGCATTTGTCAGTATTGTGAAGGCTGGGTGAAGTTGAAGGGCAATGTGAATCAATAGACTCTCCTGCATTTTAGAACCTTATTTATCTTCTTTGGTTGATACTAATGTACTTTAAGTACTAGGATCGAATTGAAACTACAATTGAGAAGAGTGTTGTTTAATCTTTTATTCATTATTTTTCATAGACTACTACATACGTTAATATTAATTAGTCTGAAAGAAGTGAAGGTTTATGAATCTAAAATTTGAATGCTGTTCTTCTTTTGAGAGTTGAAAGGAAAAACATAGATGTTACAACTTTCTTTTATAAGACATACTTATCTTATTGCATTTCTGAATGTATAACAGGAAATTATTAGGTGCAACATTATCTGTTAATCTTGCTCCAAATTCTGCTATCTGGAAGACTTTCTATTACCCAGTTGCCAACGTCAATCATCCATCAAATGTGATGCCAATGAATCAGCAGATATCTATATGCAGGAATGATTCATTATCTTCTCCTTTCAATGTCTTCAACAGAACAAATTCTTCCCAGTCCTTGCTGTTCATTGTTGCTGAGGGTAGAATTTCCAACTCTGGTGAGTGCTACAAGCCTAAATGTTCCTCAGTTTCCTTTGAGAAGCAGGTTTCAAGCAGAAATATCGGCGACGACGATTGCCCAGAAAATCATGAAACTGAAAATGACAAGGAGTGGCAAAGACGAAGAAAAATAGGAGTGGCAAATAAGGGCAAAGTACCATGGAACAAGGGCAAGAAACACAGCTTGGGTAAGAACATTTAGTAGGTTCTCTAGCAGTGGTGAATTTTAAGTGGCTAAGGTTATACTTGGTGGAGTTTGGGATTTTAACATATAATTTATCTGCAGAAACTCGTAAGCGAATCAAGCAGAGAACAATTGAAGCCTTGAAAAACCCCAAGGTGAGGTTCACATTTCAGGCTGATTCACCAGTCATATTCTGTAATTTCAACTTTTTCATTTTTCTTTAATATTTCAGGTGAGGAGGAAGATGTCCGAATATCCCCGCCCCACTCATAGGTTTCACTCTTAACCCTTCACTCTTTTATTTCTGAATGCCGAAATGAACTTGAATAACAGAAGATTTAACATCAGATGCTTTCAGATTTTGTTGGATATTTTAGTGAAAAGCAAAGATCTCTTGTAGCATTGTCACTTTATAGAGTTAATATGAAGTTTTATGTAAGAATGAATTGTAAACCTTACAATCTACTTTCAAGATCTCTTATAAAAGAGGGAAAATAGAAAAATCCTGATTTTGGATCTGGCACTGAAATCATGAACAGGCTGTGGGCGAAAATGCTCGACTCTTATAGTCCTAGAATTGGTGCCATTGGCTCCCAAATTTCAACATTTAGCACTTGACCATGTGGAATTAAGAATCTCGCTTATGTTCAACATTCCTCAGAATTTTACTGTCTATTTCAGTTCTTTGTCAGGAAAACTTATTATTTTGAGGGAATTTCCTCGTTATCAAGTGGAATTGTTCAGCTTGACGAGGCAAATCTCGTCTGTTTATTAGGCATAAACATGCTATAAGATACAGTCGTGCTTTATTCTATTTGCAGTGATCAGGTCAAAACTAAAATTAGCTCCTCACTCAGACGTGTATGGGGAAAGAGATTATTGAAGAAGAGATTAAACGAGGCGTTCTTCCAGTCTTGGAAGGAAAGCATAGCTGTTGCTGCGAAGAAAGGAGGGAAGGGAGAACAAGAACATGACTGGGACAGCCATGACAAGATAATACAAGAAATGCTTCATCAAAAGCTTAAAATGGTTGAAGAGAAGGAAAAATTAAAGCTGATGAGAGCAGAGAATGCGAAAAAGAGAAAAATCCAAGGAAGGGGTGCCAAAATAAAGAAAAGGAAAATGTGTTCTAGAAGAAGAAAAGGAGGAAAAAGAAAGATGAAAGAGGGAGAAGACATCCAGAGAACGATGAAGGAGCTAACTGCAATTGAAAGATCGGGACTTAAGCAAAGATTGAAGAAGGTAAAGATCAGGGAGGTCCCGTAGATTTACTTAATAGTTTGACTGATCCACTAAATGGATCAAAATGATAGCATGCAATGGAACTGATCTATAACAAAGGCCAACCTACCATAGCTATTTGGAGATGTAAATTTCTAGGCCAGCCTTCATTTTAATCTGTCAGCCCTATGATAATGATGTTCTTATTATGTTTTCATGTCTTCTATTAAGGTTATCCTCTTTCACTTACCTTTTGATTAGATTCGCAAAAAGATTGCAATAAACAGTGTAGTTGCTGCTCAAGGAAGCGTTGCATCAGTGGTGCCCCGAGGCACAACCTGGGAAAAAATGGATCTAGATCTTATAAAGAAGGGTAAACTGAGGGAAGAAGTGTCGCTGGCAGATCAAATTCAATTTGCCAAGAACAGAAAAGCAGAATCTATAGCTTGCAAAATTCTTGTAGCTTCTACTTTGTCGTACGAATGCACCAGGGGAGCGTAAAGATAACTTTTGACTGACCTTCACTGCTTGCAAGAAGACAATTTTATGGTGTCTTAAGGTGACTTTTCCTCCCATCCAAGGTGATTTTTCATTTTTATCAATAGCTTATGTTCTTATTTGTTAGTGATTCTTCTTGAAAGGAAGAAAATTTTCTTCACTTTCAATGTCACTTTTCCACCGGGAGATCAAAAGGTTTCTTTTGTTTTGTTCAACGTGATAAGATCACTAACAGTAATTCTGCAAAACTCACACCCCACACGCGCGCACACACACAAAACTTCTGTAATCCTTGTCTTGACTCCTGCACATTTATATCCATTGGAGTCGCAGAATCTTGTTCAATTCGACACGAGTAAAAAGGCTTGGGAAACTGCTACAGCTCACGAGCTTTTGACGACTTGAAGAAAGGTAGTGAGAAGAAGTTGAAAAGTTTTACTTTGCAGGATGGATTCTGATAAGCGATCGAGCAAGAGCTCCACGTCCATGGCAGCAGATAGATGGCTTGCCTTTTGCTCTGCCTGCTTCATAATTGTGTGCGGGTATGTAAAGATACTAAGTTCCTTGATACAATTCTGAATTCAAATTTTTCCTCTAATTTTCTTCTTTAGGAATTGATAAACTATCTCAAAGTTAAGGTGGAAAAATAATTCGAATTGTGATTCAACTAGAAAAGTTTAGAGAACTATTGCCATTGCCATACCAGTAAGTGAGCCACAATACAGGAGAGTTGAGGGGGGCACAAATGAGATTGCCCAAAAACATAGATGACATTGTGAATGTTATTACATCATCTCATGCATCTTTGAAACTATACTACCCACAAAGCAAAACTAAACTCATGGTTTGCCGCAGTAGTACCTGAACAAATACTCAGTGGCCAGAAGGATTTCAGGTTTGAACCAAGCAACAAGTAAATCTTAAAAAACCAAAGAATTCCGCTACTTTGAGACTTTCCTGAACAGTTTATGACTATTCTCGTTGATTGTTATGCACATTCCTAGACCACCCACCAGCATTAGGCTGCATCTCTCTATAGCTGTTTTGAGCAGGTGGCCCACCCGTGTTTGAAGCTCCACCATAGTAGTTGGACGGTGGTGCTCCCCAATGGTTGTTTGGCTGTGGCTCTCCAGAGTTGTATGGTGGTGGTGGTGGTGCCCCGGAGTTGTATGGTGGTGGTGGTGCCCCGGAGTTGTATGGTGGTGGTGGTGCATAGTTGTGCGGTGGTGGTACGCTAAAGTTGTTCTGCGGTGGTCGTGGTGGTGGTGGTCCGCTAAAGTTGTTCTGTGGTGGTGGAGGTGGTCTGCTAAAGTTATTCGGTGGTGGTGGTGGTCCACTAAAGTTGTTGGGTCGTGGCGGTGGTGGTCCGCTAAAGTTGCTTGATGGTGGTGGTGGTGGTGGCGGCGGCCGGCTAAAGGTGTTGGGTGGTGGCGGCGGTGGTCGGCTATAGTTGCCTTGGGGCAATCCATCATGATTGCGAGGTGGCATATTACTTGGAGCTGTTCCTGAAGTAGGTGAAGCATTTGGGAATTCTCTGTTCCGGGTATGTTCCCTCCTATTGAAATTTCTTGAGCCGTCTGAATTACGCCTATTCCTTTCATTTGCTCTAGCATTGTTTCTTACCCATTCCTCATGGTACTTCGGGTCATATGGAACAGCCTGACCATTTATAAAAGGCTCCCCTGCCAAAAGAAGAAGAATG

mRNA sequence

TGAAGCAAGCAATGGTTTATATAAATTCTACCGCCAACCACAGTGGTTTCAGTGCCATTAGCGTAATTGAATTTTGCATTTTATGGGCATTGGGCTGTTGGGTCCTTCGGGTTCGTCTTCTGCGAATCCCTCTCGGACACTGACTCTCCGTCGCCGGTTTCTTCATCTGTCTCCACCTTATGGATTACCATTTCACTCGAATTCCATACATTCACATGAAATTATTAGGTGCAACATTATCTGTTAATCTTGCTCCAAATTCTGCTATCTGGAAGACTTTCTATTACCCAGTTGCCAACGTCAATCATCCATCAAATGTGATGCCAATGAATCAGCAGATATCTATATGCAGGAATGATTCATTATCTTCTCCTTTCAATGTCTTCAACAGAACAAATTCTTCCCAGTCCTTGCTGTTCATTGTTGCTGAGGGTAGAATTTCCAACTCTGGTGAGTGCTACAAGCCTAAATGTTCCTCAGTTTCCTTTGAGAAGCAGGTTTCAAGCAGAAATATCGGCGACGACGATTGCCCAGAAAATCATGAAACTGAAAATGACAAGGAGTGGCAAAGACGAAGAAAAATAGGAGTGGCAAATAAGGGCAAAGTACCATGGAACAAGGGCAAGAAACACAGCTTGGAAACTCGTAAGCGAATCAAGCAGAGAACAATTGAAGCCTTGAAAAACCCCAAGGTGAGGAGGAAGATGTCCGAATATCCCCGCCCCACTCATAGTGATCAGGTCAAAACTAAAATTAGCTCCTCACTCAGACGTGTATGGGGAAAGAGATTATTGAAGAAGAGATTAAACGAGGCGTTCTTCCAGTCTTGGAAGGAAAGCATAGCTGTTGCTGCGAAGAAAGGAGGGAAGGGAGAACAAGAACATGACTGGGACAGCCATGACAAGATAATACAAGAAATGCTTCATCAAAAGCTTAAAATGGTTGAAGAGAAGGAAAAATTAAAGCTGATGAGAGCAGAGAATGCGAAAAAGAGAAAAATCCAAGGAAGGGGTGCCAAAATAAAGAAAAGGAAAATGTGTTCTAGAAGAAGAAAAGGAGGAAAAAGAAAGATGAAAGAGGGAGAAGACATCCAGAGAACGATGAAGGAGCTAACTGCAATTGAAAGATCGGGACTTAAGCAAAGATTGAAGAAGATTCGCAAAAAGATTGCAATAAACAGTGTAGTTGCTGCTCAAGGAAGCGTTGCATCAGTGGTGCCCCGAGGCACAACCTGGGAAAAAATGGATCTAGATCTTATAAAGAAGGGTAAACTGAGGGAAGAAGTGTCGCTGGCAGATCAAATTCAATTTGCCAAGAACAGAAAAGCAGAATCTATAGCTTGCAAAATTCTTGTAGCTTCTACTTTGTCGTACGAATGCACCAGGGGAGCGTAAAGATAACTTTTGACTGACCTTCACTGCTTGCAAGAAGACAATTTTATGGTGTCTTAAGGTGACTTTTCCTCCCATCCAAGAATCTTGTTCAATTCGACACGAGTAAAAAGGCTTGGGAAACTGCTACAGCTCACGAGCTTTTGACGACTTGAAGAAAGGTAGTGAGAAGAAGTTGAAAAGTTTTACTTTGCAGGATGGATTCTGATAAGCGATCGAGCAAGAGCTCCACGTCCATGGCAGCAGATAGATGGCTTGCCTTTTGCTCTGCCTGCTTCATAATTGTGTGCGGTACCTGAACAAATACTCAGTGGCCAGAAGGATTTCAGGTTTGAACCAAGCAACAAGTAAATCTTAAAAAACCAAAGAATTCCGCTACTTTGAGACTTTCCTGAACAGTTTATGACTATTCTCGTTGATTGTTATGCACATTCCTAGACCACCCACCAGCATTAGGCTGCATCTCTCTATAGCTGTTTTGAGCAGGTGGCCCACCCGTGTTTGAAGCTCCACCATAGTAGTTGGACGGTGGTGCTCCCCAATGGTTGTTTGGCTGTGGCTCTCCAGAGTTGTATGGTGGTGGTGGTGGTGCCCCGGAGTTGTATGGTGGTGGTGGTGCCCCGGAGTTGTATGGTGGTGGTGGTGCATAGTTGTGCGGTGGTGGTACGCTAAAGTTGTTCTGCGGTGGTCGTGGTGGTGGTGGTCCGCTAAAGTTGTTCTGTGGTGGTGGAGGTGGTCTGCTAAAGTTATTCGGTGGTGGTGGTGGTCCACTAAAGTTGTTGGGTCGTGGCGGTGGTGGTCCGCTAAAGTTGCTTGATGGTGGTGGTGGTGGTGGCGGCGGCCGGCTAAAGGTGTTGGGTGGTGGCGGCGGTGGTCGGCTATAGTTGCCTTGGGGCAATCCATCATGATTGCGAGGTGGCATATTACTTGGAGCTGTTCCTGAAGTAGGTGAAGCATTTGGGAATTCTCTGTTCCGGGTATGTTCCCTCCTATTGAAATTTCTTGAGCCGTCTGAATTACGCCTATTCCTTTCATTTGCTCTAGCATTGTTTCTTACCCATTCCTCATGGTACTTCGGGTCATATGGAACAGCCTGACCATTTATAAAAGGCTCCCCTGCCAAAAGAAGAAGAATG

Coding sequence (CDS)

ATGGATTACCATTTCACTCGAATTCCATACATTCACATGAAATTATTAGGTGCAACATTATCTGTTAATCTTGCTCCAAATTCTGCTATCTGGAAGACTTTCTATTACCCAGTTGCCAACGTCAATCATCCATCAAATGTGATGCCAATGAATCAGCAGATATCTATATGCAGGAATGATTCATTATCTTCTCCTTTCAATGTCTTCAACAGAACAAATTCTTCCCAGTCCTTGCTGTTCATTGTTGCTGAGGGTAGAATTTCCAACTCTGGTGAGTGCTACAAGCCTAAATGTTCCTCAGTTTCCTTTGAGAAGCAGGTTTCAAGCAGAAATATCGGCGACGACGATTGCCCAGAAAATCATGAAACTGAAAATGACAAGGAGTGGCAAAGACGAAGAAAAATAGGAGTGGCAAATAAGGGCAAAGTACCATGGAACAAGGGCAAGAAACACAGCTTGGAAACTCGTAAGCGAATCAAGCAGAGAACAATTGAAGCCTTGAAAAACCCCAAGGTGAGGAGGAAGATGTCCGAATATCCCCGCCCCACTCATAGTGATCAGGTCAAAACTAAAATTAGCTCCTCACTCAGACGTGTATGGGGAAAGAGATTATTGAAGAAGAGATTAAACGAGGCGTTCTTCCAGTCTTGGAAGGAAAGCATAGCTGTTGCTGCGAAGAAAGGAGGGAAGGGAGAACAAGAACATGACTGGGACAGCCATGACAAGATAATACAAGAAATGCTTCATCAAAAGCTTAAAATGGTTGAAGAGAAGGAAAAATTAAAGCTGATGAGAGCAGAGAATGCGAAAAAGAGAAAAATCCAAGGAAGGGGTGCCAAAATAAAGAAAAGGAAAATGTGTTCTAGAAGAAGAAAAGGAGGAAAAAGAAAGATGAAAGAGGGAGAAGACATCCAGAGAACGATGAAGGAGCTAACTGCAATTGAAAGATCGGGACTTAAGCAAAGATTGAAGAAGATTCGCAAAAAGATTGCAATAAACAGTGTAGTTGCTGCTCAAGGAAGCGTTGCATCAGTGGTGCCCCGAGGCACAACCTGGGAAAAAATGGATCTAGATCTTATAAAGAAGGGTAAACTGAGGGAAGAAGTGTCGCTGGCAGATCAAATTCAATTTGCCAAGAACAGAAAAGCAGAATCTATAGCTTGCAAAATTCTTGTAGCTTCTACTTTGTCGTACGAATGCACCAGGGGAGCGTAA

Protein sequence

MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRNDSLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPENHETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSHDKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKEGEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLIKKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYECTRGA
Homology
BLAST of Cp4.1LG01g01610 vs. NCBI nr
Match: XP_023514976.1 (uncharacterized protein LOC111779131 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 774 bits (1998), Expect = 1.29e-281
Identity = 404/404 (100.00%), Postives = 404/404 (100.00%), Query Frame = 0

Query: 1   MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND 60
           MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND
Sbjct: 1   MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND 60

Query: 61  SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN 120
           SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN
Sbjct: 61  SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN 120

Query: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180
           HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP
Sbjct: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180

Query: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH 240
           RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH
Sbjct: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH 240

Query: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKE 300
           DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKE
Sbjct: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKE 300

Query: 301 GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLI 360
           GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLI
Sbjct: 301 GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLI 360

Query: 361 KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYECTRGA 404
           KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYECTRGA
Sbjct: 361 KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYECTRGA 404

BLAST of Cp4.1LG01g01610 vs. NCBI nr
Match: KAG7031061.1 (hypothetical protein SDJN02_05100 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 738 bits (1905), Expect = 9.57e-267
Identity = 390/404 (96.53%), Postives = 393/404 (97.28%), Query Frame = 0

Query: 1   MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND 60
           MDYHFTR+PYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVN PSNVMPMNQQISICRND
Sbjct: 1   MDYHFTRMPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNLPSNVMPMNQQISICRND 60

Query: 61  SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN 120
           SLSSP NVFNRTNSSQSLLFIVAEGRISNSGECYK KCSS SFEKQVSSRNIGDDDCPEN
Sbjct: 61  SLSSPSNVFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPEN 120

Query: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180
           HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP
Sbjct: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180

Query: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH 240
           RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFF+SWKESIAVAAKKGGK EQE DWDSH
Sbjct: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSH 240

Query: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKE 300
           DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRR GGKR+MKE
Sbjct: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRNGGKRRMKE 300

Query: 301 GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLI 360
           GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLD I
Sbjct: 301 GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDRI 360

Query: 361 KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYECTRGA 404
           KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSY C  GA
Sbjct: 361 KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYGCAGGA 404

BLAST of Cp4.1LG01g01610 vs. NCBI nr
Match: KAG6600400.1 (hypothetical protein SDJN03_05633, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 738 bits (1905), Expect = 4.55e-266
Identity = 390/404 (96.53%), Postives = 393/404 (97.28%), Query Frame = 0

Query: 1   MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND 60
           MDYHFTR+PYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVN PSNVMPMNQQISICRND
Sbjct: 1   MDYHFTRMPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNLPSNVMPMNQQISICRND 60

Query: 61  SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN 120
           SLSSP NVFNRTNSSQSLLFIVAEGRISNSGECYK KCSS SFEKQVSSRNIGDDDCPEN
Sbjct: 61  SLSSPSNVFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPEN 120

Query: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180
           HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP
Sbjct: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180

Query: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH 240
           RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFF+SWKESIAVAAKKGGK EQE DWDSH
Sbjct: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSH 240

Query: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKE 300
           DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRR GGKR+MKE
Sbjct: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRNGGKRRMKE 300

Query: 301 GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLI 360
           GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLD I
Sbjct: 301 GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDRI 360

Query: 361 KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYECTRGA 404
           KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSY C  GA
Sbjct: 361 KKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYGCAGGA 404

BLAST of Cp4.1LG01g01610 vs. NCBI nr
Match: XP_022981197.1 (uncharacterized protein LOC111480410 [Cucurbita maxima] >XP_022981210.1 uncharacterized protein LOC111480410 [Cucurbita maxima] >XP_022981223.1 uncharacterized protein LOC111480410 [Cucurbita maxima])

HSP 1 Score: 705 bits (1819), Expect = 2.17e-254
Identity = 376/393 (95.67%), Postives = 381/393 (96.95%), Query Frame = 0

Query: 8   IPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRNDSLSSPFN 67
           +PYIHMKLLGAT+SVNLAPNSAIWKTFYYPVANVN PSNVMPMNQQISICRNDSLSSPFN
Sbjct: 1   MPYIHMKLLGATVSVNLAPNSAIWKTFYYPVANVNLPSNVMPMNQQISICRNDSLSSPFN 60

Query: 68  VFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPENHETENDK 127
           VFNRTNSSQSLLFIVAEGRISNSGECYK KCSS SFEKQVSSRNIGDDDCPEN ETENDK
Sbjct: 61  VFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPENCETENDK 120

Query: 128 EWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQ 187
           EWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQ
Sbjct: 121 EWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQ 180

Query: 188 VKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSHDKIIQEM 247
           VKTKISSSLRRVWGKRLLKKRLNEAFF+SWKESIAVAAKKGGK EQE DWDSHDKIIQEM
Sbjct: 181 VKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSHDKIIQEM 240

Query: 248 LHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKEGEDIQRT 307
           LHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRR GGKRKMKE EDIQRT
Sbjct: 241 LHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRNGGKRKMKEVEDIQRT 300

Query: 308 MKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLIKKGKLRE 367
           +KELTAIERS LKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEK+DLDLIKKGKLRE
Sbjct: 301 LKELTAIERSRLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKLDLDLIKKGKLRE 360

Query: 368 EVSLADQIQFAKNRKAESIACKILVASTLSYEC 400
            VSLADQIQFAK RKAESIACKILVASTLSY C
Sbjct: 361 GVSLADQIQFAKIRKAESIACKILVASTLSYGC 393

BLAST of Cp4.1LG01g01610 vs. NCBI nr
Match: XP_022941889.1 (uncharacterized protein LOC111447116 [Cucurbita moschata])

HSP 1 Score: 691 bits (1784), Expect = 2.56e-249
Identity = 365/382 (95.55%), Postives = 371/382 (97.12%), Query Frame = 0

Query: 1   MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND 60
           MDYHFTR+PYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVN PSN+MPMNQQISICRND
Sbjct: 1   MDYHFTRMPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNLPSNLMPMNQQISICRND 60

Query: 61  SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN 120
           SLSSP NVFNRTNSSQSLLFIVAEGRISNSGECYK KCSS SFEKQVSSRNIGDDDCPEN
Sbjct: 61  SLSSPSNVFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPEN 120

Query: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180
           HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEAL+NPKVRRKMSEYP
Sbjct: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALRNPKVRRKMSEYP 180

Query: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH 240
           RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFF+SWKESIAVAAKKGGK EQE DWDSH
Sbjct: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSH 240

Query: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKE 300
           DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKM SRRR GGKR+MKE
Sbjct: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMRSRRRNGGKRRMKE 300

Query: 301 GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLI 360
           GED+QRT KELTAIERS LKQRLKKIRKKIAIN VVAAQGSVASVVPRGTTWEKMDLDLI
Sbjct: 301 GEDVQRTKKELTAIERSRLKQRLKKIRKKIAINGVVAAQGSVASVVPRGTTWEKMDLDLI 360

Query: 361 KKGKLREEVSLADQIQFAKNRK 382
           KKGKLREEVSLADQIQFAKNRK
Sbjct: 361 KKGKLREEVSLADQIQFAKNRK 382

BLAST of Cp4.1LG01g01610 vs. ExPASy TrEMBL
Match: A0A6J1ITB7 (uncharacterized protein LOC111480410 OS=Cucurbita maxima OX=3661 GN=LOC111480410 PE=4 SV=1)

HSP 1 Score: 705 bits (1819), Expect = 1.05e-254
Identity = 376/393 (95.67%), Postives = 381/393 (96.95%), Query Frame = 0

Query: 8   IPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRNDSLSSPFN 67
           +PYIHMKLLGAT+SVNLAPNSAIWKTFYYPVANVN PSNVMPMNQQISICRNDSLSSPFN
Sbjct: 1   MPYIHMKLLGATVSVNLAPNSAIWKTFYYPVANVNLPSNVMPMNQQISICRNDSLSSPFN 60

Query: 68  VFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPENHETENDK 127
           VFNRTNSSQSLLFIVAEGRISNSGECYK KCSS SFEKQVSSRNIGDDDCPEN ETENDK
Sbjct: 61  VFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPENCETENDK 120

Query: 128 EWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQ 187
           EWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQ
Sbjct: 121 EWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQ 180

Query: 188 VKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSHDKIIQEM 247
           VKTKISSSLRRVWGKRLLKKRLNEAFF+SWKESIAVAAKKGGK EQE DWDSHDKIIQEM
Sbjct: 181 VKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSHDKIIQEM 240

Query: 248 LHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKEGEDIQRT 307
           LHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRR GGKRKMKE EDIQRT
Sbjct: 241 LHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRNGGKRKMKEVEDIQRT 300

Query: 308 MKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLIKKGKLRE 367
           +KELTAIERS LKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEK+DLDLIKKGKLRE
Sbjct: 301 LKELTAIERSRLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKLDLDLIKKGKLRE 360

Query: 368 EVSLADQIQFAKNRKAESIACKILVASTLSYEC 400
            VSLADQIQFAK RKAESIACKILVASTLSY C
Sbjct: 361 GVSLADQIQFAKIRKAESIACKILVASTLSYGC 393

BLAST of Cp4.1LG01g01610 vs. ExPASy TrEMBL
Match: A0A6J1FPR6 (uncharacterized protein LOC111447116 OS=Cucurbita moschata OX=3662 GN=LOC111447116 PE=4 SV=1)

HSP 1 Score: 691 bits (1784), Expect = 1.24e-249
Identity = 365/382 (95.55%), Postives = 371/382 (97.12%), Query Frame = 0

Query: 1   MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND 60
           MDYHFTR+PYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVN PSN+MPMNQQISICRND
Sbjct: 1   MDYHFTRMPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNLPSNLMPMNQQISICRND 60

Query: 61  SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN 120
           SLSSP NVFNRTNSSQSLLFIVAEGRISNSGECYK KCSS SFEKQVSSRNIGDDDCPEN
Sbjct: 61  SLSSPSNVFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPEN 120

Query: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180
           HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEAL+NPKVRRKMSEYP
Sbjct: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALRNPKVRRKMSEYP 180

Query: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH 240
           RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFF+SWKESIAVAAKKGGK EQE DWDSH
Sbjct: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSH 240

Query: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGKRKMKE 300
           DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKM SRRR GGKR+MKE
Sbjct: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRGAKIKKRKMRSRRRNGGKRRMKE 300

Query: 301 GEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLI 360
           GED+QRT KELTAIERS LKQRLKKIRKKIAIN VVAAQGSVASVVPRGTTWEKMDLDLI
Sbjct: 301 GEDVQRTKKELTAIERSRLKQRLKKIRKKIAINGVVAAQGSVASVVPRGTTWEKMDLDLI 360

Query: 361 KKGKLREEVSLADQIQFAKNRK 382
           KKGKLREEVSLADQIQFAKNRK
Sbjct: 361 KKGKLREEVSLADQIQFAKNRK 382

BLAST of Cp4.1LG01g01610 vs. ExPASy TrEMBL
Match: A0A1S3BS78 (uncharacterized protein LOC103492929 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492929 PE=4 SV=1)

HSP 1 Score: 539 bits (1388), Expect = 6.23e-189
Identity = 296/411 (72.02%), Postives = 335/411 (81.51%), Query Frame = 0

Query: 1   MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND 60
           MD HFTR+PYIHM+LLG T +V LAPN A+WK  YYPVAN+N PSN  P+N Q+SI R+D
Sbjct: 1   MDCHFTRMPYIHMRLLGTTFTVKLAPNPALWKISYYPVANINFPSNATPINHQMSIIRSD 60

Query: 61  SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN 120
           SL SPFNVFNRT+SSQ+ LF+V EGR S+ GECYK KCSS S EKQV S     DD PEN
Sbjct: 61  SLFSPFNVFNRTSSSQAFLFMVDEGRNSHFGECYKSKCSSCSIEKQVLSNK---DDSPEN 120

Query: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180
            ETEND EWQRR+KIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEAL++P+VRRKMSEYP
Sbjct: 121 LETENDNEWQRRKKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYP 180

Query: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH 240
           R THSDQVK KISSSLRRVWGKRLLKKRLNE FF SW ESIAVAAKKGGK EQE DWDS+
Sbjct: 181 R-THSDQVKVKISSSLRRVWGKRLLKKRLNETFFLSWMESIAVAAKKGGKEEQELDWDSY 240

Query: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGRG---------AKIKKRKMCSRRR 300
           DKI QE LHQ+L+ V EKEKLK MRAENAK R++Q R          AK KK KMCSRRR
Sbjct: 241 DKIKQETLHQELQRVAEKEKLKAMRAENAKMREVQRRVRKKEKGDDYAKTKKLKMCSRRR 300

Query: 301 KGGKRKMKEGEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTT 360
             GKRK KE +D  R MK+ T IERS LKQRLKKIRKKI+IN  V  QGS+ASV P+ T+
Sbjct: 301 DAGKRKGKEEDDNLRKMKKSTTIERSKLKQRLKKIRKKISINGAVTTQGSIASVAPQNTS 360

Query: 361 WEKMDLDLIKKGKLREEVSLADQIQFAKNRKAESIACKILVA-STLSYECT 401
           WE +DLDLIKKG++R+E SLADQIQ AKNRKAES ACK+L+A STL+++CT
Sbjct: 361 WETLDLDLIKKGQMRKEASLADQIQVAKNRKAESTACKVLIAASTLAFQCT 407

BLAST of Cp4.1LG01g01610 vs. ExPASy TrEMBL
Match: A0A0A0L0E1 (IENR2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G083650 PE=4 SV=1)

HSP 1 Score: 526 bits (1355), Expect = 5.80e-184
Identity = 290/410 (70.73%), Postives = 330/410 (80.49%), Query Frame = 0

Query: 1   MDYHFTRIPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRND 60
           MD HFTR+PYIHM+LLG T +V LAPN A+WK  YYPVAN+N PSN  P+N Q+SI RND
Sbjct: 1   MDCHFTRMPYIHMRLLGTTFTVKLAPNPALWKISYYPVANINFPSNAAPINHQMSIVRND 60

Query: 61  SLSSPFNVFNRTNSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPEN 120
           S+ SPFN+FNRT+ SQ+ LF+V EGR SN GECYK KCSS S EKQV S     DD PEN
Sbjct: 61  SVFSPFNIFNRTSFSQAFLFMVDEGRNSNFGECYKSKCSSCSIEKQVLSNK---DDSPEN 120

Query: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180
            ETENDKEWQRR+KIG+ANKG+VPWNKGKKH+LETR RIKQRTIEAL++P+VRRKMSEYP
Sbjct: 121 LETENDKEWQRRKKIGLANKGRVPWNKGKKHNLETRTRIKQRTIEALRDPEVRRKMSEYP 180

Query: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSH 240
           R  HSDQVK KISSSLRRVWGKRL+KKRLNE FF SW ESIAVAAKKGGK EQE DWDS+
Sbjct: 181 R-IHSDQVKVKISSSLRRVWGKRLMKKRLNETFFLSWMESIAVAAKKGGKEEQELDWDSY 240

Query: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGR---------GAKIKKRKMCSRRR 300
           DKI QE LHQ+L+ V EKEKLK MR ENAK +K+Q R          AK KK KMCSRRR
Sbjct: 241 DKIKQETLHQELRRVAEKEKLKAMR-ENAKMKKVQRRVGKKEKGDDNAKTKKLKMCSRRR 300

Query: 301 KGGKRKMKEGEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTT 360
             GKRK KE +++ R  K+ T IERS LKQRLKKIRKKI+IN  V AQGS+ASV P+   
Sbjct: 301 DEGKRKGKEDDNL-RKKKKSTTIERSKLKQRLKKIRKKISINGAVTAQGSIASVAPQNPC 360

Query: 361 WEKMDLDLIKKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYECT 401
           WEK+DLDLIKKG+  +E SLADQIQ AKNRKAES ACK+L+ASTL+++CT
Sbjct: 361 WEKLDLDLIKKGQTWKEASLADQIQVAKNRKAESTACKVLIASTLAFQCT 404

BLAST of Cp4.1LG01g01610 vs. ExPASy TrEMBL
Match: A0A6J1CMC2 (uncharacterized protein LOC111012783 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111012783 PE=4 SV=1)

HSP 1 Score: 491 bits (1265), Expect = 2.07e-170
Identity = 271/398 (68.09%), Postives = 320/398 (80.40%), Query Frame = 0

Query: 13  MKLLGATLSVNLAPNSAIWKTFYYPVANVNHPSNVMPMNQQISICRNDSLSSPFNVFNRT 72
           M L GAT S+NLA NS +WK F YPVA +N PSNV+P+N QIS+ ++DS  SP ++ NRT
Sbjct: 1   MSLSGATPSINLARNSVLWKIFCYPVA-INLPSNVVPVNHQISVIKHDSSVSPISILNRT 60

Query: 73  NSSQSLLFIVAEGRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPENHETENDKEWQRR 132
           + S  LLF+  EGR SN G CYK KCS  S EK+V  R I DDDCP+N   ENDKE QRR
Sbjct: 61  SHSLPLLFMADEGRNSNFGWCYKSKCSLDSLEKRVYYREISDDDCPQNLGKENDKESQRR 120

Query: 133 RKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQVKTKI 192
           R+IG+ANKG VPWNKGKKH++ETR+RIKQRTIEAL++PKVRRKMSEYPR THSDQVK KI
Sbjct: 121 RRIGLANKGNVPWNKGKKHNMETRERIKQRTIEALRDPKVRRKMSEYPR-THSDQVKVKI 180

Query: 193 SSSLRRVWGKRLLKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSHDKIIQEMLHQKL 252
           SSSLRRVWGKRL+KKRLNE FF SW+ESIAVAAKKGGK  +E DWDS+ KI QEML QKL
Sbjct: 181 SSSLRRVWGKRLMKKRLNETFFLSWRESIAVAAKKGGKEAEELDWDSYQKIKQEMLRQKL 240

Query: 253 KMVEEKEKLKLMRAENAKKRKIQGR---------GAKIKKRKMCSRRRKGGKRKMKEGED 312
           +   EK  LK  RAENAKKRK++ R           K+K+ KMCS+ R G KRK KEGED
Sbjct: 241 QRAAEKANLKETRAENAKKRKVERRIRKEEKGDGNGKLKRMKMCSKGRNGRKRKAKEGED 300

Query: 313 IQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLIKKG 372
           IQR MK+LTAIERS LKQRLK+IRKKI+IN  VAA+GS+ASV+P+ T+WEK+DLDLIKKG
Sbjct: 301 IQREMKKLTAIERSRLKQRLKRIRKKISINGAVAARGSIASVIPQNTSWEKLDLDLIKKG 360

Query: 373 KLREEVSLADQIQFAKNRKAESIACKILVASTLSYECT 401
           ++R+ VSLA+QIQ AK+RKAESIACK+L+AST +Y+CT
Sbjct: 361 QMRKGVSLAEQIQVAKSRKAESIACKVLLASTSTYQCT 396

BLAST of Cp4.1LG01g01610 vs. TAIR 10
Match: AT1G53250.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53800.1); Has 11909 Blast hits to 7704 proteins in 757 species: Archae - 51; Bacteria - 1338; Metazoa - 4550; Fungi - 987; Plants - 464; Viruses - 24; Other Eukaryotes - 4495 (source: NCBI BLink). )

HSP 1 Score: 199.9 bits (507), Expect = 3.9e-51
Identity = 136/308 (44.16%), Postives = 195/308 (63.31%), Query Frame = 0

Query: 85  GRISNSGECYKPKCSSVSFEKQVSSRNIGDDDCPENHETENDKEWQRRRKIGVANKGKVP 144
           G + N  E ++ + +S   E +  ++    D   ++      KE +RRRKIG+ANKGKVP
Sbjct: 63  GSLYNVFEIHRKEVNSSLLEVKAMNK----DTEADSDSDRKIKEEERRRKIGLANKGKVP 122

Query: 145 WNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQVKTKISSSLRRVWGKRL 204
           WNKG+KHS +TR+RIKQRTIEAL NPKVR+KMS++ +P HS++ K KI +S+++VW +R 
Sbjct: 123 WNKGRKHSEDTRRRIKQRTIEALTNPKVRKKMSDHQQP-HSNETKEKIRASVKQVWAERS 182

Query: 205 LKKRLNEAFFQSWKESIAVAAKKGGKGEQEHDWDSHDKIIQEMLHQKLKMVEE----KEK 264
             KRL E F  SW E+IA AA+KGG GE E DWDS++KI Q+   ++L++ EE    KE+
Sbjct: 183 RSKRLKEKFMSSWSENIAEAARKGGSGEAELDWDSYEKIKQDFSSEQLQLAEEKARAKEQ 242

Query: 265 LKLMRAENAKKRKIQGRGAKIKKRKMCSRRRKGGK-RKMKEGEDIQRTMKELTAIERSGL 324
            K++  E AK R  + R A  KK++   + R+ GK RK K+  +        T   RS L
Sbjct: 243 TKMIAKEAAKARTEKMRRAAEKKKEREEKDRREGKIRKPKQERE------NPTIASRSKL 302

Query: 325 KQRLKKI-RKKIAINSVVAAQGSVASVVPRGTTWEKMDLDLIKKGKLREEVSLADQIQFA 384
           K+RL KI +KK ++  +      V SV  +    EK+DLDLI+K + R ++SLADQIQ A
Sbjct: 303 KKRLTKIHKKKTSLGKIAIGTDRVVSVAAK---LEKLDLDLIRKERTRGDISLADQIQAA 356

Query: 385 KNRKAESI 387
           KN++   +
Sbjct: 363 KNQRGSDV 356

BLAST of Cp4.1LG01g01610 vs. TAIR 10
Match: AT1G53800.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53250.1); Has 1136 Blast hits to 882 proteins in 242 species: Archae - 2; Bacteria - 216; Metazoa - 257; Fungi - 77; Plants - 87; Viruses - 4; Other Eukaryotes - 493 (source: NCBI BLink). )

HSP 1 Score: 89.4 bits (220), Expect = 7.4e-18
Identity = 55/166 (33.13%), Postives = 96/166 (57.83%), Query Frame = 0

Query: 97  KCSSVSFEKQVSSRNIGDDDCPENHETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETR 156
           + SS+S     SS    DD      E  +D+E  RR +I  AN+G  PWNKG+KHS ET 
Sbjct: 78  RSSSLSSASSKSSNGSADD----GEEQVDDREKLRRMRISKANRGNTPWNKGRKHSPETL 137

Query: 157 KRIKQRTIEALKNPKVRRKMSEYPRPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQS 216
           ++I++RT  A+++PK++ K++       + + + KI   +R  W +R  ++++ E     
Sbjct: 138 QKIRERTKIAMQDPKIKMKLANLGH-AQNKETRMKIGEGVRMRWARRKERRKVQETCHFE 197

Query: 217 WKESIAVAAKKGGKGEQEHDWDSHDKIIQEMLHQKLKMVEEKEKLK 263
           W+  +A AAK+G   E+E  WDS++ + Q+   + L+ VE+++ +K
Sbjct: 198 WQNLLAEAAKQGYTDEEELQWDSYNILDQQNQLEWLESVEQRKAIK 238

BLAST of Cp4.1LG01g01610 vs. TAIR 10
Match: AT1G53800.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53250.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 89.4 bits (220), Expect = 7.4e-18
Identity = 55/166 (33.13%), Postives = 96/166 (57.83%), Query Frame = 0

Query: 97  KCSSVSFEKQVSSRNIGDDDCPENHETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETR 156
           + SS+S     SS    DD      E  +D+E  RR +I  AN+G  PWNKG+KHS ET 
Sbjct: 82  RSSSLSSASSKSSNGSADD----GEEQVDDREKLRRMRISKANRGNTPWNKGRKHSPETL 141

Query: 157 KRIKQRTIEALKNPKVRRKMSEYPRPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFQS 216
           ++I++RT  A+++PK++ K++       + + + KI   +R  W +R  ++++ E     
Sbjct: 142 QKIRERTKIAMQDPKIKMKLANLGH-AQNKETRMKIGEGVRMRWARRKERRKVQETCHFE 201

Query: 217 WKESIAVAAKKGGKGEQEHDWDSHDKIIQEMLHQKLKMVEEKEKLK 263
           W+  +A AAK+G   E+E  WDS++ + Q+   + L+ VE+++ +K
Sbjct: 202 WQNLLAEAAKQGYTDEEELQWDSYNILDQQNQLEWLESVEQRKAIK 242

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023514976.11.29e-281100.00uncharacterized protein LOC111779131 [Cucurbita pepo subsp. pepo][more]
KAG7031061.19.57e-26796.53hypothetical protein SDJN02_05100 [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6600400.14.55e-26696.53hypothetical protein SDJN03_05633, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022981197.12.17e-25495.67uncharacterized protein LOC111480410 [Cucurbita maxima] >XP_022981210.1 uncharac... [more]
XP_022941889.12.56e-24995.55uncharacterized protein LOC111447116 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1ITB71.05e-25495.67uncharacterized protein LOC111480410 OS=Cucurbita maxima OX=3661 GN=LOC111480410... [more]
A0A6J1FPR61.24e-24995.55uncharacterized protein LOC111447116 OS=Cucurbita moschata OX=3662 GN=LOC1114471... [more]
A0A1S3BS786.23e-18972.02uncharacterized protein LOC103492929 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0L0E15.80e-18470.73IENR2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G083650 PE=4 ... [more]
A0A6J1CMC22.07e-17068.09uncharacterized protein LOC111012783 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT1G53250.13.9e-5144.16unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G53800.17.4e-1833.13unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G53800.27.4e-1833.13unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003611Nuclease associated modular domain 3PFAMPF07460NUMOD3coord: 132..159
e-value: 5.9E-9
score: 35.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 109..128
NoneNo IPR availablePANTHERPTHR34199:SF1HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-79 SPECIFIC-LIKE PROTEINcoord: 13..394
NoneNo IPR availablePANTHERPTHR34199NUMOD3 MOTIF FAMILY PROTEIN, EXPRESSEDcoord: 13..394

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g01610.1Cp4.1LG01g01610.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding