CSPI04G26530 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G26530
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDNA ligase
LocationChr4: 23623572 .. 23624277 (-)
RNA-Seq ExpressionCSPI04G26530
SyntenyCSPI04G26530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCTTCATTCCTCTTCTTATTTTGCTCCCAATAATTCTCTATAAATATCTCTTCTATTTCTCTCCCCCATTTCCATCATCACTGCTTAACTCTCACCTCTCATCCATTTTTCATTACAAAACTTCCCATCTTCGTCACCATGTTCAACCACTACAATCCATTTTCCTTCACCCACAAGCTCAAATCCCCAATCTCTTGTTTCCGCCCCGATCACGATCCCGAATTACACGATAACCCTTCTTCTCCAACATCTCCTGCTCTCAAATCAATGACCTCTGACCTCCCGGAGCTCCGTGGAATGTGCCGGAGTCTTGTCGCTCGGATCAGCTGGCCGAGTCGCCGACGCTACCATCACTCCACTGATTTTCGATACGACCCTTCCAGCTATGCCCTTAATTTCGAGGACGAACAACTTCGATATGACGACGAGTTCCCAATTAGGGATTTCACTTCGAGATTGCCGGCGTCACCACCGCCCGTAAATCTCATGCGATTTTGAAACCATGGGGTTGGGGATCTTCAGGGAGTCTCTACCCATTGATTCTAACTGTAATTTGCGATTTGCTAAAGGTGTTATTCGTAGAGCTAAATGTATAGATATCTAAAAAGCTCTCTAATTAATTATTAATTGTATATTCTTTCTATTGTTATTACTAATTTACTGATTATGTTTGCCTCTAATTTCGTTTAATTTCAATCAACCTT

mRNA sequence

CTTCTTCATTCCTCTTCTTATTTTGCTCCCAATAATTCTCTATAAATATCTCTTCTATTTCTCTCCCCCATTTCCATCATCACTGCTTAACTCTCACCTCTCATCCATTTTTCATTACAAAACTTCCCATCTTCGTCACCATGTTCAACCACTACAATCCATTTTCCTTCACCCACAAGCTCAAATCCCCAATCTCTTGTTTCCGCCCCGATCACGATCCCGAATTACACGATAACCCTTCTTCTCCAACATCTCCTGCTCTCAAATCAATGACCTCTGACCTCCCGGAGCTCCGTGGAATGTGCCGGAGTCTTGTCGCTCGGATCAGCTGGCCGAGTCGCCGACGCTACCATCACTCCACTGATTTTCGATACGACCCTTCCAGCTATGCCCTTAATTTCGAGGACGAACAACTTCGATATGACGACGAGTTCCCAATTAGGGATTTCACTTCGAGATTGCCGGCGTCACCACCGCCCGTAAATCTCATGCGATTTTGAAACCATGGGGTTGGGGATCTTCAGGGAGTCTCTACCCATTGATTCTAACTGTAATTTGCGATTTGCTAAAGGTGTTATTCGTAGAGCTAAATGTATAGATATCTAAAAAGCTCTCTAATTAATTATTAATTGTATATTCTTTCTATTGTTATTACTAATTTACTGATTATGTTTGCCTCTAATTTCGTTTAATTTCAATCAACCTT

Coding sequence (CDS)

ATGTTCAACCACTACAATCCATTTTCCTTCACCCACAAGCTCAAATCCCCAATCTCTTGTTTCCGCCCCGATCACGATCCCGAATTACACGATAACCCTTCTTCTCCAACATCTCCTGCTCTCAAATCAATGACCTCTGACCTCCCGGAGCTCCGTGGAATGTGCCGGAGTCTTGTCGCTCGGATCAGCTGGCCGAGTCGCCGACGCTACCATCACTCCACTGATTTTCGATACGACCCTTCCAGCTATGCCCTTAATTTCGAGGACGAACAACTTCGATATGACGACGAGTTCCCAATTAGGGATTTCACTTCGAGATTGCCGGCGTCACCACCGCCCGTAAATCTCATGCGATTTTGA

Protein sequence

MFNHYNPFSFTHKLKSPISCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLVARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPPPVNLMRF*
Homology
BLAST of CSPI04G26530 vs. ExPASy TrEMBL
Match: A0A0A0L4E2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G652740 PE=4 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 1.7e-63
Identity = 118/119 (99.16%), Postives = 118/119 (99.16%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPISCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLVA 60
           MFNHYNPFSFTHKLKSPISCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLVA
Sbjct: 1   MFNHYNPFSFTHKLKSPISCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLVA 60

Query: 61  RISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPPPVNLMRF 120
           RISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPP VNLMRF
Sbjct: 61  RISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPPHVNLMRF 119

BLAST of CSPI04G26530 vs. ExPASy TrEMBL
Match: A0A5A7VBS3 (DNA ligase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G00970 PE=4 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 9.0e-52
Identity = 106/121 (87.60%), Postives = 110/121 (90.91%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPI-SCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLV 60
           MFNHYNPFSFTHKLKS I SCFRPDHD ELH+N     +PALKSMTSDLPELRGMCRSLV
Sbjct: 1   MFNHYNPFSFTHKLKSSIPSCFRPDHDAELHENEDD--NPALKSMTSDLPELRGMCRSLV 60

Query: 61  ARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPAS-PPPVNLMR 120
           ARISWPSRRRYHHS DFRYDPSSYALNFEDEQ+R D+EFPIRDFTSRLPAS PPPVNLMR
Sbjct: 61  ARISWPSRRRYHHSADFRYDPSSYALNFEDEQVRDDNEFPIRDFTSRLPASPPPPVNLMR 119

BLAST of CSPI04G26530 vs. ExPASy TrEMBL
Match: A0A1S3BVQ9 (uncharacterized protein LOC103493778 OS=Cucumis melo OX=3656 GN=LOC103493778 PE=4 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 9.0e-52
Identity = 106/121 (87.60%), Postives = 110/121 (90.91%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPI-SCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLV 60
           MFNHYNPFSFTHKLKS I SCFRPDHD ELH+N     +PALKSMTSDLPELRGMCRSLV
Sbjct: 1   MFNHYNPFSFTHKLKSSIPSCFRPDHDAELHENEDD--NPALKSMTSDLPELRGMCRSLV 60

Query: 61  ARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPAS-PPPVNLMR 120
           ARISWPSRRRYHHS DFRYDPSSYALNFEDEQ+R D+EFPIRDFTSRLPAS PPPVNLMR
Sbjct: 61  ARISWPSRRRYHHSADFRYDPSSYALNFEDEQVRDDNEFPIRDFTSRLPASPPPPVNLMR 119

BLAST of CSPI04G26530 vs. ExPASy TrEMBL
Match: A0A6J1FP70 (uncharacterized protein LOC111446022 OS=Cucurbita moschata OX=3662 GN=LOC111446022 PE=4 SV=1)

HSP 1 Score: 157.1 bits (396), Expect = 4.5e-35
Identity = 83/116 (71.55%), Postives = 91/116 (78.45%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPISCFRPDH----DPELHDNPSSPTSPALKSMTSDLPELRGMCR 60
           M N Y+PFSFT KLKS I CFR DH    + E   +P +P SP LKSM ++ PELRGMCR
Sbjct: 2   MLNDYHPFSFTRKLKSSILCFRRDHGDFLENEGDSSPRTPRSP-LKSMAAEFPELRGMCR 61

Query: 61  SLVARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPP 113
           +LVARI   SRRRYHHS DFRYDPSSYALNFEDEQ+R DDEFPIRDF SRLPASPP
Sbjct: 62  NLVARIG-RSRRRYHHSADFRYDPSSYALNFEDEQVRDDDEFPIRDFASRLPASPP 115

BLAST of CSPI04G26530 vs. ExPASy TrEMBL
Match: A0A6J1J1H3 (uncharacterized protein LOC111480417 OS=Cucurbita maxima OX=3661 GN=LOC111480417 PE=4 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 1.3e-34
Identity = 81/116 (69.83%), Postives = 90/116 (77.59%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPISCFRPDH----DPELHDNPSSPTSPALKSMTSDLPELRGMCR 60
           M N Y+PFSFT KLKS I CFR DH    + E   +P +P SP LKSM ++ PE+RGMCR
Sbjct: 2   MLNEYHPFSFTRKLKSSIRCFRRDHSDFFESEDDSSPRTPRSP-LKSMAAEFPEIRGMCR 61

Query: 61  SLVARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPP 113
           +LVARI    RRRYHHS DFRYDPSSYALNFEDEQ+R DDEFPIRDF SRLPASPP
Sbjct: 62  NLVARIG-RCRRRYHHSADFRYDPSSYALNFEDEQVRDDDEFPIRDFASRLPASPP 115

BLAST of CSPI04G26530 vs. NCBI nr
Match: KGN55462.1 (hypothetical protein Csa_012581 [Cucumis sativus])

HSP 1 Score: 251.5 bits (641), Expect = 3.6e-63
Identity = 118/119 (99.16%), Postives = 118/119 (99.16%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPISCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLVA 60
           MFNHYNPFSFTHKLKSPISCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLVA
Sbjct: 1   MFNHYNPFSFTHKLKSPISCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLVA 60

Query: 61  RISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPPPVNLMRF 120
           RISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPP VNLMRF
Sbjct: 61  RISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPPHVNLMRF 119

BLAST of CSPI04G26530 vs. NCBI nr
Match: XP_008452891.1 (PREDICTED: uncharacterized protein LOC103493778 [Cucumis melo] >KAA0064610.1 DNA ligase [Cucumis melo var. makuwa] >TYK19982.1 DNA ligase [Cucumis melo var. makuwa])

HSP 1 Score: 212.6 bits (540), Expect = 1.8e-51
Identity = 106/121 (87.60%), Postives = 110/121 (90.91%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPI-SCFRPDHDPELHDNPSSPTSPALKSMTSDLPELRGMCRSLV 60
           MFNHYNPFSFTHKLKS I SCFRPDHD ELH+N     +PALKSMTSDLPELRGMCRSLV
Sbjct: 1   MFNHYNPFSFTHKLKSSIPSCFRPDHDAELHENEDD--NPALKSMTSDLPELRGMCRSLV 60

Query: 61  ARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPAS-PPPVNLMR 120
           ARISWPSRRRYHHS DFRYDPSSYALNFEDEQ+R D+EFPIRDFTSRLPAS PPPVNLMR
Sbjct: 61  ARISWPSRRRYHHSADFRYDPSSYALNFEDEQVRDDNEFPIRDFTSRLPASPPPPVNLMR 119

BLAST of CSPI04G26530 vs. NCBI nr
Match: XP_023523424.1 (uncharacterized protein LOC111787637 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 157.1 bits (396), Expect = 9.2e-35
Identity = 83/115 (72.17%), Postives = 92/115 (80.00%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPISCFRPDHDPEL-HDNPSSPTSP--ALKSMTSDLPELRGMCRS 60
           M N Y+PFSFT KLKS I CFR DH   L +++ SSP +P   LKSM ++ PELRGMCR+
Sbjct: 1   MLNDYHPFSFTRKLKSSILCFRRDHGDFLENEDDSSPRTPRSPLKSMAAEFPELRGMCRN 60

Query: 61  LVARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPP 113
           LVARI   SRRRYHHS DFRYDPSSYALNFEDEQ+R DDEFPIRDF SRLPASPP
Sbjct: 61  LVARIG-RSRRRYHHSADFRYDPSSYALNFEDEQVRDDDEFPIRDFASRLPASPP 114

BLAST of CSPI04G26530 vs. NCBI nr
Match: KAG7031776.1 (hypothetical protein SDJN02_05817, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 157.1 bits (396), Expect = 9.2e-35
Identity = 83/116 (71.55%), Postives = 91/116 (78.45%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPISCFRPDH----DPELHDNPSSPTSPALKSMTSDLPELRGMCR 60
           M N Y+PFSFT KLKS I CFR DH    + E   +P +P SP LKSM ++ PELRGMCR
Sbjct: 1   MLNDYHPFSFTRKLKSSILCFRRDHGDFLENEGDSSPRTPRSP-LKSMAAEFPELRGMCR 60

Query: 61  SLVARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPP 113
           +LVARI   SRRRYHHS DFRYDPSSYALNFEDEQ+R DDEFPIRDF SRLPASPP
Sbjct: 61  NLVARIG-RSRRRYHHSADFRYDPSSYALNFEDEQVRDDDEFPIRDFASRLPASPP 114

BLAST of CSPI04G26530 vs. NCBI nr
Match: XP_022940408.1 (uncharacterized protein LOC111446022 [Cucurbita moschata] >KAG6608139.1 hypothetical protein SDJN03_01481, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 157.1 bits (396), Expect = 9.2e-35
Identity = 83/116 (71.55%), Postives = 91/116 (78.45%), Query Frame = 0

Query: 1   MFNHYNPFSFTHKLKSPISCFRPDH----DPELHDNPSSPTSPALKSMTSDLPELRGMCR 60
           M N Y+PFSFT KLKS I CFR DH    + E   +P +P SP LKSM ++ PELRGMCR
Sbjct: 2   MLNDYHPFSFTRKLKSSILCFRRDHGDFLENEGDSSPRTPRSP-LKSMAAEFPELRGMCR 61

Query: 61  SLVARISWPSRRRYHHSTDFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPASPP 113
           +LVARI   SRRRYHHS DFRYDPSSYALNFEDEQ+R DDEFPIRDF SRLPASPP
Sbjct: 62  NLVARIG-RSRRRYHHSADFRYDPSSYALNFEDEQVRDDDEFPIRDFASRLPASPP 115

BLAST of CSPI04G26530 vs. TAIR 10
Match: AT5G11070.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 62.0 bits (149), Expect = 3.8e-10
Identity = 48/121 (39.67%), Postives = 63/121 (52.07%), Query Frame = 0

Query: 12  HKLKSPISCFRPDH--DPE---LHDNPSSPTSPA--LKSMTSDLPELRGMCRSLVARISW 71
           H+++SPI C   +   +PE   +     +P SP   LKS   +L ELR  CR + +RI  
Sbjct: 20  HRMRSPICCIGANAVVEPEAMMIGGEQRTPRSPYEWLKSTAQEL-ELRDRCRRVKSRIKV 79

Query: 72  PSR----------RRYHHST----DFRYDPSSYALNFEDEQLRYDDEFPIRDFTSRLPAS 112
             R            +HHS     DF YDP SYALNFED  +R DD+    +FT+RLP S
Sbjct: 80  TCRNNNCAYNNCVHHHHHSQSYPGDFSYDPLSYALNFED-NVRADDDGSFPNFTARLPQS 138

BLAST of CSPI04G26530 vs. TAIR 10
Match: AT5G35090.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 58.5 bits (140), Expect = 4.2e-09
Identity = 49/136 (36.03%), Postives = 64/136 (47.06%), Query Frame = 0

Query: 9   SFTHKLKSPI---SCFRP----DHDPELHDNPSSP--TSPALKS------------MTSD 68
           S   KLKS      CFR      HD    D PSSP  T  + +S            +T  
Sbjct: 10  SLQKKLKSRFCIAGCFRTTNHHHHDNIPDDLPSSPVTTEKSTQSPHGGGMKTKSPRLTRT 69

Query: 69  LPELRGMCRSLVAR-------ISWPSRRRYHHSTDFRYDPSSYALNF----EDEQLRYDD 113
           L +    C+SL+ R       +    +    H+ DF YDPSSYALNF    ED+ +   D
Sbjct: 70  LSKSHEKCKSLIHRMGGGVGGVGGHGKHIRRHTADFHYDPSSYALNFDKGDEDDNI---D 129

BLAST of CSPI04G26530 vs. TAIR 10
Match: AT3G01430.1 (BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1); Has 98 Blast hits to 98 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 43.9 bits (102), Expect = 1.1e-04
Identity = 21/42 (50.00%), Postives = 28/42 (66.67%), Query Frame = 0

Query: 76  FRYDPSSYALNFED--EQLRYDDEFPIRDFTSRLPASPPPVN 116
           FRYD  SY+LNF+D  +   +DDEFP RD++ R  A   PV+
Sbjct: 125 FRYDQLSYSLNFDDGNQTGHFDDEFPYRDYSMRFAAPSLPVS 166

BLAST of CSPI04G26530 vs. TAIR 10
Match: AT5G14890.1 (NHL domain-containing protein )

HSP 1 Score: 42.0 bits (97), Expect = 4.0e-04
Identity = 20/42 (47.62%), Postives = 28/42 (66.67%), Query Frame = 0

Query: 76  FRYDPSSYALNFED--EQLRYDDEFPIRDFTSRLPASPPPVN 116
           FRYD  SY+LNF+D  +   ++DEFP RD++ R  A   PV+
Sbjct: 695 FRYDSWSYSLNFDDGKQTGHFEDEFPYRDYSMRFAAPSLPVS 736

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L4E21.7e-6399.16Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G652740 PE=4 SV=1[more]
A0A5A7VBS39.0e-5287.60DNA ligase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G00970 PE=... [more]
A0A1S3BVQ99.0e-5287.60uncharacterized protein LOC103493778 OS=Cucumis melo OX=3656 GN=LOC103493778 PE=... [more]
A0A6J1FP704.5e-3571.55uncharacterized protein LOC111446022 OS=Cucurbita moschata OX=3662 GN=LOC1114460... [more]
A0A6J1J1H31.3e-3469.83uncharacterized protein LOC111480417 OS=Cucurbita maxima OX=3661 GN=LOC111480417... [more]
Match NameE-valueIdentityDescription
KGN55462.13.6e-6399.16hypothetical protein Csa_012581 [Cucumis sativus][more]
XP_008452891.11.8e-5187.60PREDICTED: uncharacterized protein LOC103493778 [Cucumis melo] >KAA0064610.1 DNA... [more]
XP_023523424.19.2e-3572.17uncharacterized protein LOC111787637 [Cucurbita pepo subsp. pepo][more]
KAG7031776.19.2e-3571.55hypothetical protein SDJN02_05817, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022940408.19.2e-3571.55uncharacterized protein LOC111446022 [Cucurbita moschata] >KAG6608139.1 hypothet... [more]
Match NameE-valueIdentityDescription
AT5G11070.13.8e-1039.67unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... [more]
AT5G35090.14.2e-0936.03unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... [more]
AT3G01430.11.1e-0450.00BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:... [more]
AT5G14890.14.0e-0447.62NHL domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 21..46
NoneNo IPR availablePANTHERPTHR33168:SF61SUBFAMILY NOT NAMEDcoord: 11..113
NoneNo IPR availablePANTHERPTHR33168STRESS INDUCED PROTEIN-RELATEDcoord: 11..113

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G26530.1CSPI04G26530.1mRNA