Cp4.1LG11g02700 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG11g02700
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDNA ligase 1 isoform X1
LocationCp4.1LG11: 1441177 .. 1443374 (-)
RNA-Seq ExpressionCp4.1LG11g02700
SyntenyCp4.1LG11g02700
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGCCGCGTTAGAGTCATTTAGGTTTTCCAATTTGCCGCTCCAAAGAAGTCTTCTTCATTCTCCGCTAGGGTTTTCATCGCTATACCGTCTCTTCTGTTTTTCTCAAATCTGAACTGCGATTCTCCTTTTTCTTTCAAATCTTCGCCGAACCCTCCCCATAATAGACACAGATGTCTCGTTGTTTTCCTTACCCACCTCCTGGTTACGTGAGGAAGGTGGCTAGGACCGAGGCGGCCTTGATCGAATCGATTAAGGTTTGTTTGAGCTATTTGGTTGGTTTTGATAATGTATGCTGAATTCAATTGTGTTCTGATTCCAATTGATTGCGAAATCTTCTTGGAATTGTAACTATTTGGATAGGGGTATATCTATAATCAGGAACGGCTTTTTTTTTCCTCTAATTTTTGCGACGATGGGGGAAGAATGGATTTGGGTTCTCTTGTTACGATGTGCGAGATTTTTTGTACCCTAATAATTGGATTCTGATTTTGGATTATTTGGAGTTCAGAGTGTGTAGATGGTGGTTTGGGGTTCTGATAGTTGCATGTCGTACAACTCGTTCTTAGATCTTGTATGGATGTTCGGCCTTTGGATCTCGTCATGGTTTTCTAAGTGCCCTTAAACTCCGTTCCTTTTGCTTTCAAATGGATTGTTCTCTACAATTTTAAAAGGAAGATATTTAGGGTTTCCCGTAAGCTAATTAACATGAACCCTTATTTGGGAATTTTTCGCACAGACCTTCTTTGACCCGATATCCTTAGCCATTTTAATACACGATTTGAATGTACGTATTTGCTACTTCGAATTTATGATTATCTGTGTCCAGATGAAATACTCCTTTACTACAGGAATATCTACCCTTTTGAAGGGTTGCATTTTATTAGTAAGGCATGGTTGACATGTTTGGTGCTTGCGTCGTCTACTCTGTTTTAAATGGTTATTCTGGTGCGTTATAGCTCCAATCTGAAAGACGACAGCACAAGACTGATAGCAAGAAAGAGAAAAGTAAGCACAAGAAAGAAAAGAGCAAGGACAGGAAACACAAAAGCAAAGAACGTAAGGAAAGTAAGGAAAAATCTTCTCGTAGCCGCGACTTGAATGATCAGAAACAGAAGGCATGCGTAAAAGAGGTCAAGGATCGTCTAGAGGGAACCAAGGTTGAAGCAGAACAATTAGAAAAGAGTGGTCTCACTGAAGAGCATGGACAACCAGTATGGCCTCACAGTCCTGGCTACTTGTCTGATGGAACTCAGATCAACCAAAAGAGGAAAAGGGATGATTCATTACAGCCTGATGAAGGTTGTAAACCTGGTGTGTTCTCTTTATAGATAATAGTCTCACTTATGAAATAAATTCTCTTGGCCACATGAACAAACGCTAATATGTTATTGATTTCAAACAGGAAAAGTAATTCGGATCAAACTGGCTTCTTCACTAAGCCAGCAAGAGAATTCATCAGCTGGCAGTGAACAGATGTGTTCTGTATCTGGTCGTGATTGTTCTCGTGATCAAAAGAGTGATGAAAACAGCTCAGTTCGGCGATCAACTTGCTTTGCTAATTCTGAAACGGCCCTTGCTGTCAAGGATTGCACATCTTCTAAACCTAAGATCAAAGACCCTCCTCCGCATGCTGTCAAGGATCGCACTTCTTCTAAACCTAAGATCAAAGACCCTTCTCCGCATGCTGTCAAGGAAATTAGCTCACTAGGTAATGTTATGTCATTACCACGCACCAGAAGCCCTGTCGAATCTGCTTATGAGGCCTTGTTTGAGAAGTGGGTACCACCTCCACTTCAGTTGGAGCAACAAATGGATGATGAAGAATGGCTCTTCCGAACCGAAAAGCAAGATGGACGAAGTACAAAGACCAATGAAGCCTTCAGTTCTGTCCCCAGCTGTAGAAGTTCCAGTCTGTGGCCGAGAGGACAATATCTTGCCGATGCCGATGTTTATTCATTGCCTTACACGATCCCATATTGATTTTGAATTCTTTACTCGCAGAGATGTGTACAGCCAGTAGGAACAATAGTCTTGGCGTTATAATTGTTTTAGAATCAAATATAATTGACATTCAATTTTGTTCTCTCCTGTGAGACAATGAACAGGCATGCTTTGGGGATGGTGATGCAAGTGTAAATTCTTAGTGTTGATTCTATATATAGCAGCTTTTGCCTTAAAGATTTTGTTCATATCCATTT

mRNA sequence

AGCCGCGTTAGAGTCATTTAGGTTTTCCAATTTGCCGCTCCAAAGAAGTCTTCTTCATTCTCCGCTAGGGTTTTCATCGCTATACCGTCTCTTCTGTTTTTCTCAAATCTGAACTGCGATTCTCCTTTTTCTTTCAAATCTTCGCCGAACCCTCCCCATAATAGACACAGATGTCTCGTTGTTTTCCTTACCCACCTCCTGGTTACGTGAGGAAGGTGGCTAGGACCGAGGCGGCCTTGATCGAATCGATTAAGCTCCAATCTGAAAGACGACAGCACAAGACTGATAGCAAGAAAGAGAAAAGTAAGCACAAGAAAGAAAAGAGCAAGGACAGGAAACACAAAAGCAAAGAACGTAAGGAAAGTAAGGAAAAATCTTCTCGTAGCCGCGACTTGAATGATCAGAAACAGAAGGCATGCGTAAAAGAGGTCAAGGATCGTCTAGAGGGAACCAAGGTTGAAGCAGAACAATTAGAAAAGAGTGGTCTCACTGAAGAGCATGGACAACCAGTATGGCCTCACAGTCCTGGCTACTTGTCTGATGGAACTCAGATCAACCAAAAGAGGAAAAGGGATGATTCATTACAGCCTGATGAAGGTTGTAAACCTGGAAAAGTAATTCGGATCAAACTGGCTTCTTCACTAAGCCAGCAAGAGAATTCATCAGCTGGCAGTGAACAGATGTGTTCTGTATCTGGTCGTGATTGTTCTCGTGATCAAAAGAGTGATGAAAACAGCTCAGTTCGGCGATCAACTTGCTTTGCTAATTCTGAAACGGCCCTTGCTGTCAAGGATTGCACATCTTCTAAACCTAAGATCAAAGACCCTCCTCCGCATGCTGTCAAGGATCGCACTTCTTCTAAACCTAAGATCAAAGACCCTTCTCCGCATGCTGTCAAGGAAATTAGCTCACTAGGTAATGTTATGTCATTACCACGCACCAGAAGCCCTGTCGAATCTGCTTATGAGGCCTTGTTTGAGAAGTGGGTACCACCTCCACTTCAGTTGGAGCAACAAATGGATGATGAAGAATGGCTCTTCCGAACCGAAAAGCAAGATGGACGAAGTACAAAGACCAATGAAGCCTTCAGTTCTGTCCCCAGCTGTAGAAGTTCCAGTCTGTGGCCGAGAGGACAATATCTTGCCGATGCCGATGTTTATTCATTGCCTTACACGATCCCATATTGATTTTGAATTCTTTACTCGCAGAGATGTGTACAGCCAGTAGGAACAATAGTCTTGGCGTTATAATTGTTTTAGAATCAAATATAATTGACATTCAATTTTGTTCTCTCCTGTGAGACAATGAACAGGCATGCTTTGGGGATGGTGATGCAAGTGTAAATTCTTAGTGTTGATTCTATATATAGCAGCTTTTGCCTTAAAGATTTTGTTCATATCCATTT

Coding sequence (CDS)

ATGTCTCGTTGTTTTCCTTACCCACCTCCTGGTTACGTGAGGAAGGTGGCTAGGACCGAGGCGGCCTTGATCGAATCGATTAAGCTCCAATCTGAAAGACGACAGCACAAGACTGATAGCAAGAAAGAGAAAAGTAAGCACAAGAAAGAAAAGAGCAAGGACAGGAAACACAAAAGCAAAGAACGTAAGGAAAGTAAGGAAAAATCTTCTCGTAGCCGCGACTTGAATGATCAGAAACAGAAGGCATGCGTAAAAGAGGTCAAGGATCGTCTAGAGGGAACCAAGGTTGAAGCAGAACAATTAGAAAAGAGTGGTCTCACTGAAGAGCATGGACAACCAGTATGGCCTCACAGTCCTGGCTACTTGTCTGATGGAACTCAGATCAACCAAAAGAGGAAAAGGGATGATTCATTACAGCCTGATGAAGGTTGTAAACCTGGAAAAGTAATTCGGATCAAACTGGCTTCTTCACTAAGCCAGCAAGAGAATTCATCAGCTGGCAGTGAACAGATGTGTTCTGTATCTGGTCGTGATTGTTCTCGTGATCAAAAGAGTGATGAAAACAGCTCAGTTCGGCGATCAACTTGCTTTGCTAATTCTGAAACGGCCCTTGCTGTCAAGGATTGCACATCTTCTAAACCTAAGATCAAAGACCCTCCTCCGCATGCTGTCAAGGATCGCACTTCTTCTAAACCTAAGATCAAAGACCCTTCTCCGCATGCTGTCAAGGAAATTAGCTCACTAGGTAATGTTATGTCATTACCACGCACCAGAAGCCCTGTCGAATCTGCTTATGAGGCCTTGTTTGAGAAGTGGGTACCACCTCCACTTCAGTTGGAGCAACAAATGGATGATGAAGAATGGCTCTTCCGAACCGAAAAGCAAGATGGACGAAGTACAAAGACCAATGAAGCCTTCAGTTCTGTCCCCAGCTGTAGAAGTTCCAGTCTGTGGCCGAGAGGACAATATCTTGCCGATGCCGATGTTTATTCATTGCCTTACACGATCCCATATTGA

Protein sequence

MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSKERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPGYLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCSRDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSPHAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRSTKTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY
Homology
BLAST of Cp4.1LG11g02700 vs. NCBI nr
Match: XP_023545923.1 (uncharacterized protein LOC111805212 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 643 bits (1658), Expect = 4.99e-232
Identity = 338/338 (100.00%), Postives = 338/338 (100.00%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS
Sbjct: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180

Query: 181 RDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSPH 240
           RDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSPH
Sbjct: 181 RDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSPH 240

Query: 241 AVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRST 300
           AVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRST
Sbjct: 241 AVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRST 300

Query: 301 KTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           KTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY
Sbjct: 301 KTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338

BLAST of Cp4.1LG11g02700 vs. NCBI nr
Match: XP_023545924.1 (serine/threonine-protein kinase PRP4 homolog isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 627 bits (1616), Expect = 1.08e-225
Identity = 334/338 (98.82%), Postives = 334/338 (98.82%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQINQKRKRDDSLQPDEG    KVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS
Sbjct: 121 YLSDGTQINQKRKRDDSLQPDEG----KVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180

Query: 181 RDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSPH 240
           RDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSPH
Sbjct: 181 RDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSPH 240

Query: 241 AVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRST 300
           AVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRST
Sbjct: 241 AVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRST 300

Query: 301 KTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           KTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY
Sbjct: 301 KTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 334

BLAST of Cp4.1LG11g02700 vs. NCBI nr
Match: XP_022997629.1 (uncharacterized protein LOC111492505 isoform X1 [Cucurbita maxima])

HSP 1 Score: 608 bits (1569), Expect = 1.89e-218
Identity = 324/339 (95.58%), Postives = 326/339 (96.17%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKS 
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSN 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKESKEKSSRSRDLNDQK K CVKE KDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKESKEKSSRSRDLNDQKHKVCVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQIN KRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAG E  CSVSGRD S
Sbjct: 121 YLSDGTQINHKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGCELTCSVSGRDIS 180

Query: 181 RDQKSDENSSV-RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240
           RDQKSDENSSV RRSTCFANSETA AVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP
Sbjct: 181 RDQKSDENSSVVRRSTCFANSETARAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240

Query: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRS 300
           HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLF TEKQDGRS
Sbjct: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQDGRS 300

Query: 301 TKTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           TKTNEAFSS+PSCR+SSLWPRGQYLA ADVYSLPYTIPY
Sbjct: 301 TKTNEAFSSIPSCRNSSLWPRGQYLAVADVYSLPYTIPY 339

BLAST of Cp4.1LG11g02700 vs. NCBI nr
Match: KAG7029449.1 (hypothetical protein SDJN02_07788 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 603 bits (1556), Expect = 1.62e-216
Identity = 325/339 (95.87%), Postives = 327/339 (96.46%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKE KEKSSRS  LNDQKQKACVKE KDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKERKEKSSRS--LNDQKQKACVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQIN KRKRD SLQPDEGCKPGKVIRIKLASSLSQQENSSA SEQ CSVSG DCS
Sbjct: 121 YLSDGTQINHKRKRD-SLQPDEGCKPGKVIRIKLASSLSQQENSSADSEQTCSVSGCDCS 180

Query: 181 RDQKSDENSSV-RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240
           RDQK DENSSV RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDP P
Sbjct: 181 RDQKRDENSSVVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPPP 240

Query: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRS 300
           HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLF+TEKQDGRS
Sbjct: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFQTEKQDGRS 300

Query: 301 TKTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           TKTNEAFSS+PSCRSSSLWPRGQYLADADVYSLPYTIPY
Sbjct: 301 TKTNEAFSSIPSCRSSSLWPRGQYLADADVYSLPYTIPY 336

BLAST of Cp4.1LG11g02700 vs. NCBI nr
Match: XP_022997630.1 (uncharacterized protein LOC111492505 isoform X2 [Cucurbita maxima])

HSP 1 Score: 592 bits (1527), Expect = 4.10e-212
Identity = 320/339 (94.40%), Postives = 322/339 (94.99%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKS 
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSN 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKESKEKSSRSRDLNDQK K CVKE KDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKESKEKSSRSRDLNDQKHKVCVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQIN KRKRDDSLQPDEG    KVIRIKLASSLSQQENSSAG E  CSVSGRD S
Sbjct: 121 YLSDGTQINHKRKRDDSLQPDEG----KVIRIKLASSLSQQENSSAGCELTCSVSGRDIS 180

Query: 181 RDQKSDENSSV-RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240
           RDQKSDENSSV RRSTCFANSETA AVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP
Sbjct: 181 RDQKSDENSSVVRRSTCFANSETARAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240

Query: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRS 300
           HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLF TEKQDGRS
Sbjct: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQDGRS 300

Query: 301 TKTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           TKTNEAFSS+PSCR+SSLWPRGQYLA ADVYSLPYTIPY
Sbjct: 301 TKTNEAFSSIPSCRNSSLWPRGQYLAVADVYSLPYTIPY 335

BLAST of Cp4.1LG11g02700 vs. ExPASy TrEMBL
Match: A0A6J1KC15 (uncharacterized protein LOC111492505 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492505 PE=4 SV=1)

HSP 1 Score: 608 bits (1569), Expect = 9.17e-219
Identity = 324/339 (95.58%), Postives = 326/339 (96.17%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKS 
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSN 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKESKEKSSRSRDLNDQK K CVKE KDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKESKEKSSRSRDLNDQKHKVCVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQIN KRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAG E  CSVSGRD S
Sbjct: 121 YLSDGTQINHKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGCELTCSVSGRDIS 180

Query: 181 RDQKSDENSSV-RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240
           RDQKSDENSSV RRSTCFANSETA AVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP
Sbjct: 181 RDQKSDENSSVVRRSTCFANSETARAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240

Query: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRS 300
           HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLF TEKQDGRS
Sbjct: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQDGRS 300

Query: 301 TKTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           TKTNEAFSS+PSCR+SSLWPRGQYLA ADVYSLPYTIPY
Sbjct: 301 TKTNEAFSSIPSCRNSSLWPRGQYLAVADVYSLPYTIPY 339

BLAST of Cp4.1LG11g02700 vs. ExPASy TrEMBL
Match: A0A6J1KAB9 (uncharacterized protein LOC111492505 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111492505 PE=4 SV=1)

HSP 1 Score: 592 bits (1527), Expect = 1.98e-212
Identity = 320/339 (94.40%), Postives = 322/339 (94.99%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKS 
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSN 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKESKEKSSRSRDLNDQK K CVKE KDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKESKEKSSRSRDLNDQKHKVCVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQIN KRKRDDSLQPDEG    KVIRIKLASSLSQQENSSAG E  CSVSGRD S
Sbjct: 121 YLSDGTQINHKRKRDDSLQPDEG----KVIRIKLASSLSQQENSSAGCELTCSVSGRDIS 180

Query: 181 RDQKSDENSSV-RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240
           RDQKSDENSSV RRSTCFANSETA AVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP
Sbjct: 181 RDQKSDENSSVVRRSTCFANSETARAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240

Query: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRS 300
           HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLF TEKQDGRS
Sbjct: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQDGRS 300

Query: 301 TKTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           TKTNEAFSS+PSCR+SSLWPRGQYLA ADVYSLPYTIPY
Sbjct: 301 TKTNEAFSSIPSCRNSSLWPRGQYLAVADVYSLPYTIPY 335

BLAST of Cp4.1LG11g02700 vs. ExPASy TrEMBL
Match: A0A6J1HBK0 (DNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111462522 PE=4 SV=1)

HSP 1 Score: 553 bits (1424), Expect = 4.49e-197
Identity = 306/339 (90.27%), Postives = 308/339 (90.86%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKE KEKSSRS  LNDQKQKACVKE KDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKERKEKSSRS--LNDQKQKACVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQIN KRKRD SLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQ CSVSGRDCS
Sbjct: 121 YLSDGTQINHKRKRD-SLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQTCSVSGRDCS 180

Query: 181 RDQKSDENSSV-RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240
           R    DENSSV RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVK              
Sbjct: 181 R----DENSSVVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVK-------------- 240

Query: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRS 300
               EISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLF TEKQDGRS
Sbjct: 241 ----EISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQDGRS 300

Query: 301 TKTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           +KTNEAFSS+PSCRSSSLWPRGQYLADADVYSLPYTIPY
Sbjct: 301 SKTNEAFSSIPSCRSSSLWPRGQYLADADVYSLPYTIPY 314

BLAST of Cp4.1LG11g02700 vs. ExPASy TrEMBL
Match: A0A6J1HD43 (uncharacterized protein LOC111462522 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111462522 PE=4 SV=1)

HSP 1 Score: 536 bits (1382), Expect = 9.68e-191
Identity = 302/339 (89.09%), Postives = 304/339 (89.68%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           ERKE KEKSSRS  LNDQKQKACVKE KDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG
Sbjct: 61  ERKERKEKSSRS--LNDQKQKACVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQIN KRKRD SLQPDEG    KVIRIKLASSLSQQENSSAGSEQ CSVSGRDCS
Sbjct: 121 YLSDGTQINHKRKRD-SLQPDEG----KVIRIKLASSLSQQENSSAGSEQTCSVSGRDCS 180

Query: 181 RDQKSDENSSV-RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPSP 240
           R    DENSSV RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVK              
Sbjct: 181 R----DENSSVVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVK-------------- 240

Query: 241 HAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDGRS 300
               EISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLF TEKQDGRS
Sbjct: 241 ----EISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQDGRS 300

Query: 301 TKTNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           +KTNEAFSS+PSCRSSSLWPRGQYLADADVYSLPYTIPY
Sbjct: 301 SKTNEAFSSIPSCRSSSLWPRGQYLADADVYSLPYTIPY 310

BLAST of Cp4.1LG11g02700 vs. ExPASy TrEMBL
Match: A0A6J1CT76 (chromatin assembly factor 1 subunit A-like OS=Momordica charantia OX=3673 GN=LOC111013996 PE=4 SV=1)

HSP 1 Score: 415 bits (1067), Expect = 1.46e-142
Identity = 238/343 (69.39%), Postives = 265/343 (77.26%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSRCFPYPPPGY  KVARTEAALIESIKLQSER+Q K D KKEKSKH+KE+S+  K K +
Sbjct: 1   MSRCFPYPPPGYAGKVARTEAALIESIKLQSERQQSKHDRKKEKSKHRKERSEKSKEKKQ 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
            RKE KEKSS S DLNDQKQK C K+ +DRL+GTKVEAEQLEKSGLTEEHGQPVWP SPG
Sbjct: 61  RRKERKEKSSCSCDLNDQKQKECAKQAEDRLKGTKVEAEQLEKSGLTEEHGQPVWPQSPG 120

Query: 121 YLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDCS 180
           YLSDGTQIN KRKRD  LQP+E  KPGK+IRIKLASSLS QE+SSA ++Q CS SGR   
Sbjct: 121 YLSDGTQINHKRKRDAKLQPNEDSKPGKIIRIKLASSLSNQEDSSADTQQTCSTSGRYDC 180

Query: 181 RDQKSDENSSV--RRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDPS 240
            DQK DENS    ++  CF NS T +AV++          PP   +KD + S        
Sbjct: 181 VDQKRDENSCGPNQQKPCFTNSNTVVAVEEA---------PPKPRIKDHSRSV------- 240

Query: 241 PHAVKEISSLGNVMSLP-RTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQDG 300
            HAVK+I   GNV+  P RTRSP ES YEALFEKW+PPPLQLEQQMDDEEWLF T KQDG
Sbjct: 241 -HAVKDIRPQGNVVPFPTRTRSPAESEYEALFEKWIPPPLQLEQQMDDEEWLFGTRKQDG 300

Query: 301 RSTK--TNEAFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 338
           ++TK  TN+AFS VPSCRSSSLWPRGQYL DADVYSLPYTIP+
Sbjct: 301 QTTKATTNKAFSPVPSCRSSSLWPRGQYLPDADVYSLPYTIPF 326

BLAST of Cp4.1LG11g02700 vs. TAIR 10
Match: AT1G20100.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75860.1); Has 471 Blast hits to 438 proteins in 92 species: Archae - 0; Bacteria - 14; Metazoa - 217; Fungi - 43; Plants - 91; Viruses - 1; Other Eukaryotes - 105 (source: NCBI BLink). )

HSP 1 Score: 75.9 bits (185), Expect = 7.1e-14
Identity = 105/350 (30.00%), Postives = 159/350 (45.43%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSR F  PPP Y R  A  +  L+E  K++    +   DSKK   K KKEK K++K K K
Sbjct: 1   MSRYFTSPPPVYARNWANGQ-NLVEWTKIE----RPIVDSKKLHRKEKKEKKKEKKLK-K 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           E+K  ++K S ++ ++                    E+EQLEKS LTEE  QP      G
Sbjct: 61  EKKSLEQKYSTTKTVS-------------------YESEQLEKSCLTEEFEQP----QVG 120

Query: 121 YLSDGTQINQKRKRDDS---LQPDEGCKP--GKVIRIKLASSLSQQENSSAGSEQMCSVS 180
           YLSDG+Q ++KR+R+ S   ++      P  GK +RI++     ++  +    + +CS S
Sbjct: 121 YLSDGSQNSKKRRRETSPAVVESQIKATPVAGKPLRIRIVFKKPKEAEAVPQEDPVCSTS 180

Query: 181 GRDCSRDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIK 240
           G      Q+  E  S        + + A+      S K  I            S K K  
Sbjct: 181 G-----TQRPSELPSSVSLPSICDHDVAVPSTSLESGKVAIIS---------ESKKRKKH 240

Query: 241 DPSPHAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQ-QMDDEEWLFRTEK 300
            PS                       ES Y +LF++ VPP + LE+     ++WLF T +
Sbjct: 241 KPSK----------------------ESRYNSLFDELVPPCISLEEDDSSSDDWLFGTSR 285

Query: 301 QDGRST-----KTNE-AFSSVPSCRSSSLWPRGQYLADADVYSLPYTIPY 339
           ++  S+     KT+E    S+ + R  S  PR   L++  ++SLPYT+P+
Sbjct: 301 KENVSSAKSSYKTDEDTIMSLQTSRDCSSLPRAMLLSEVGIFSLPYTVPF 285

BLAST of Cp4.1LG11g02700 vs. TAIR 10
Match: AT1G75860.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G20100.1); Has 258 Blast hits to 235 proteins in 58 species: Archae - 0; Bacteria - 4; Metazoa - 59; Fungi - 16; Plants - 90; Viruses - 0; Other Eukaryotes - 89 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 5.6e-11
Identity = 96/348 (27.59%), Postives = 150/348 (43.10%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSR    PP  + R     +  L+ES KL    ++   DSKK     KKEK + RK K +
Sbjct: 1   MSRVLTCPPLVFARNHVGVQ-NLVESTKL----KRITLDSKKAHRIEKKEKKEKRKEKKE 60

Query: 61  ERKESKEKSS-RSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSP 120
            ++E   K S ++ D + +      K+V D       E++ LEKSGLT+E  +P      
Sbjct: 61  TKREKSHKHSIKATDNHHKLIFLPSKKVSD-------ESDSLEKSGLTDELEEP--QKHL 120

Query: 121 GYLSDGTQINQKRKRDDSLQPDEGC-----KPGKVIRIKLASSLSQQENSSAGSEQMCSV 180
           GYLSDG+Q ++KR RDDS    E         GK +RI++     ++E  +   E +   
Sbjct: 121 GYLSDGSQNSKKRIRDDSPPAVESLIKAAPVAGKPLRIRMVFKKPKEEVPTLPREAV--- 180

Query: 181 SGRDCSRDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKI 240
                                C      +L+ +D  +S          ++    +S+ + 
Sbjct: 181 --------------------VCSTTVAKSLSHQDVITS----------SISSSKTSELEK 240

Query: 241 KDPSPHAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEE---WLFR 300
             PS      I+++       + RS  E  Y ALF+ W PP + +     ++    WLF 
Sbjct: 241 NLPS----TSIAAIDETKKRKKHRSSKEDQYNALFDGWTPPSMCIADASSNDNGDYWLFG 297

Query: 301 TEKQDGRSTKTNEAFSSVPSCR-SSSLWPRGQYLADADVYSLPYTIPY 339
            + Q+    K           R   S WPR Q+L++  +YSLPYT+P+
Sbjct: 301 NKTQEVLKPKAAVKVDDDTMMRPGDSSWPRAQFLSEVGIYSLPYTVPF 297

BLAST of Cp4.1LG11g02700 vs. TAIR 10
Match: AT1G20100.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75860.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 51.2 bits (121), Expect = 1.9e-06
Identity = 63/181 (34.81%), Postives = 93/181 (51.38%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKEKSKDRKHKSK 60
           MSR F  PPP Y R  A  +  L+E  K++    +   DSKK   K KKEK K++K K K
Sbjct: 1   MSRYFTSPPPVYARNWANGQ-NLVEWTKIE----RPIVDSKKLHRKEKKEKKKEKKLK-K 60

Query: 61  ERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPG 120
           E+K  ++K S ++ ++                    E+EQLEKS LTEE  QP      G
Sbjct: 61  EKKSLEQKYSTTKTVS-------------------YESEQLEKSCLTEEFEQP----QVG 120

Query: 121 YLSDGTQINQKRKRDDS---LQPDEGCKP--GKVIRIKLASSLSQQENSSAGSEQMCSVS 177
           YLSDG+Q ++KR+R+ S   ++      P  GK +RI++     ++  +    + +CS S
Sbjct: 121 YLSDGSQNSKKRRRETSPAVVESQIKATPVAGKPLRIRIVFKKPKEAEAVPQEDPVCSTS 152

BLAST of Cp4.1LG11g02700 vs. TAIR 10
Match: AT4G35940.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G17787.1). )

HSP 1 Score: 49.7 bits (117), Expect = 5.4e-06
Identity = 100/405 (24.69%), Postives = 166/405 (40.99%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESI-------KLQSERRQHKTDSKKEKSKHKKEKSK 60
           MSRCFP+PPPGYV    R EA ++ SI       K +  R+  ++D K +K K ++++ K
Sbjct: 1   MSRCFPFPPPGYVLNGIRDEAVIVSSIKGVEEKAKKEQRRKDRRSDKKDKKDKKERKEKK 60

Query: 61  DRKHKSKERKESKEKSSRSRDLNDQKQKACVK-EVKDRLEGTKVEAEQLEKSGLTEE--- 120
           ++K K ++ +E KE  S  R    ++++   K ++  +L+ ++V    LEKS LT E   
Sbjct: 61  EKKEKKRKEREGKEVGSEKRSHKRRRKEDGAKVDLFHKLKESEVNC--LEKSSLTVEREL 120

Query: 121 -------------HGQPVWPHSPGYLS---------------------DGTQINQKRKRD 180
                        +   + P                            DG   N   KR 
Sbjct: 121 LQSTSQNSCDSTLNSNEMLPKQKEVQQPLDGRHNNNNNEKRVEKQQPLDGRHNNNNEKRV 180

Query: 181 DSLQPDEG-CKPGKVIRIKLASSLSQQENSSAGS--EQMCSVSGRDCSRDQKSDENSSVR 240
           +  QP +G        RI+    L+ + N++     E+   ++GR  + ++K  E     
Sbjct: 181 EKQQPLDGRHNNNNEKRIEKQQPLNGRHNNNNEKLMEKQQPLNGRHNNNNEKRIEKQQPL 240

Query: 241 RSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTSSKPKIKDP---SPHAVKEISSLG 300
                 N E     +     +    D   HA K R   + K KDP     H  ++ISS  
Sbjct: 241 NGR-HNNKEKQKEKQQPLDVRHNNNDSAEHASKPR---EEKRKDPIFRGKHGKEKISS-- 300

Query: 301 NVMSLPRTRSPVES----------AYEALFEKWVPPPLQLEQQM---DDEEWLFRTEKQD 339
              S   T  P +S           +  + E WVP  ++    +   +DEE  +  +K  
Sbjct: 301 --SSTRETYQPPKSLCNCPPSMVLQFLDVVENWVPNTIERRVDLINSEDEECWWSMKKPP 360

BLAST of Cp4.1LG11g02700 vs. TAIR 10
Match: AT4G35940.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G17787.1); Has 45288 Blast hits to 24095 proteins in 1140 species: Archae - 93; Bacteria - 2895; Metazoa - 13424; Fungi - 2873; Plants - 1183; Viruses - 123; Other Eukaryotes - 24697 (source: NCBI BLink). )

HSP 1 Score: 45.8 bits (107), Expect = 7.9e-05
Identity = 41/117 (35.04%), Postives = 68/117 (58.12%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVARTEAALIESI-------KLQSERRQHKTDSKKEKSKHKKEKSK 60
           MSRCFP+PPPGYV    R EA ++ SI       K +  R+  ++D K +K K ++++ K
Sbjct: 1   MSRCFPFPPPGYVLNGIRDEAVIVSSIKGVEEKAKKEQRRKDRRSDKKDKKDKKERKEKK 60

Query: 61  DRKHKSKERKESKEKSSRSRDLNDQKQKACVK-EVKDRLEGTKVEAEQLEKSGLTEE 110
           ++K K ++ +E KE  S  R    ++++   K ++  +L+ ++V    LEKS LT E
Sbjct: 61  EKKEKKRKEREGKEVGSEKRSHKRRRKEDGAKVDLFHKLKESEVNC--LEKSSLTVE 115

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023545923.14.99e-232100.00uncharacterized protein LOC111805212 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023545924.11.08e-22598.82serine/threonine-protein kinase PRP4 homolog isoform X2 [Cucurbita pepo subsp. p... [more]
XP_022997629.11.89e-21895.58uncharacterized protein LOC111492505 isoform X1 [Cucurbita maxima][more]
KAG7029449.11.62e-21695.87hypothetical protein SDJN02_07788 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022997630.14.10e-21294.40uncharacterized protein LOC111492505 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1KC159.17e-21995.58uncharacterized protein LOC111492505 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1KAB91.98e-21294.40uncharacterized protein LOC111492505 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HBK04.49e-19790.27DNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111462522 PE=4 SV=1[more]
A0A6J1HD439.68e-19189.09uncharacterized protein LOC111462522 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1CT761.46e-14269.39chromatin assembly factor 1 subunit A-like OS=Momordica charantia OX=3673 GN=LOC... [more]
Match NameE-valueIdentityDescription
AT1G20100.17.1e-1430.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G75860.15.6e-1127.59unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G20100.21.9e-0634.81unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G35940.25.4e-0624.69unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G35940.17.9e-0535.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 84..104
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 154..178
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 60..106
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 190..207
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 40..59
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 301..316
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 128..148
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 211..240
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..240
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 292..316
NoneNo IPR availablePANTHERPTHR34660MYB-LIKE PROTEIN Xcoord: 1..338
NoneNo IPR availablePANTHERPTHR34660:SF7DNA LIGASEcoord: 1..338

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG11g02700.1Cp4.1LG11g02700.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016874 ligase activity