CSPI02G16720 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI02G16720
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDNA ligase 1-like isoform X2
LocationChr2: 16036523 .. 16038965 (-)
RNA-Seq ExpressionCSPI02G16720
SyntenyCSPI02G16720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCAAGTCGGTAGAACGCCGTTATAAATAGAGGGCCTCCCCCCATTGAGCGACGCGTAAGAGGCGAATTAGGTTTTCCAATTATCCGCCTCAACGAAGAATCCTTCGTTTGTTCTGCGCTAGGGTTTTCATCGCTAACTCGTTTCCTCTTCTGTTTTCGCAAATCTCGATCTCTCCTCTCTTTTTTCTTTTAAATCTTCGCCGCGAATCCCTTTCCATAGTAGACACACGATGTCTCGTTGCTTTCCTTACCCACCTCCTGGTTACGTGAGGAAGGTGGCTAGCACCGAGGCGGCCTTGATCGAATCGATTAAGGTTTGTTTCAGCTATTTGGTTGGTTTTCATAAATGTATGCTGGATTCAATTGTGTTATGATTCCAATTGATTGCGAGATCTTCTTGGGATTGTAACTTTTTGGATGGGGGTATATGTAATCCGTAGTCGGGAAACGGCTTTTCTTGAAAAAAAAATTTCGACGATTGGGGAAATTTTACATGTGGGTTTTTCTTGTATCGATGGAGACTGAGCGCTGTAAGTTGTGTACCATGATGATTATTTGGACTGTAGTTTCCAGTTTAGATCTTTGAGAGTTCAAGTGTATAGGTTAGTGGATTAGGGTTCTGCTAGTTGCATGTCGTACTCTTCGTTCATGGATCTTGTATGGATGTATTCCTTGTTTCAAAATCAAAATAATTTAATAAAAGTTTGTTTCTATATCAAAATAATTTAATGAAAGTTTAAACTTGAAACTTTTTATGGATCGTTTATTCATCAATCTTTATGGATTTCTAATGCTTTTAAAAACCCTGGTTTAAATGCTTTCTAAATAGTTTTCTGCAAGTTTTAAATAGAAGCGACTTAGGGTTTCTTTAAGTTTGTTATTATGAACGCTCTTGTAGCAAGTTTTCATCGAGGCCTTTTTTTCTTTAATATCTTCAGGCATATGAATACATGATATGAATGTATGAGTTTGCTTGTTGGAATATATGATTATCTGAATCCAGATACAATTTCTCCTTTAGAATATAGATAGCTACCCGTTTTGAAGGGTTGAATTATATTAGTTAGGGAATGGTTGATCTGTTTGGTGCTTCTGTCGTCTGCTCTTTTTGAACTGGTTATTCTGGTTCGTTATAGCTCCAATCTGAAAGACAGAGCAAGAATGATAGAAAGAATGAGAAACGTAGGCACAAGAAAGAGAAGAAAGAGAAGTCCAAGGACAAGAAAGAGAGAAGTAAGGACAAAAAACACAAAAGCAAAGAACGTAAAGAACATAAGGGGAAATCCTCCCGTAGCCAGGGCTTGAATGATCAAAAACACGACAAATGCTTTAAAGAAGTCAAGGACCTAGATGGATCCAAAGTTGAAGCAGAACAATTAGAAAGGAGTGGTCTCACTGAAGAGCATGGACAACCTGTATGGCCTCAAAGCCCTGCCTACTTGTCTGATGGAACTCAGATCGACCACAAGAGAAAAAGGGAAGCTGCAACACAGCCTGATGAAGGTTGTAAGCCTGGTGGGTTCTCTTTATGAGTAATAGGCTGTTTTATGAAATAAGTTATAATGGCCACTTGTACAAACGCTAATGTATTATCAATTAACAGGTAAAATCATCCGAATCAAACTTGCCTCGGCCTCTTCACTTAGCCAGCAAGAGGATTCATCAGCTGGCAGTGAACAGATGTGTTCTACATCTGGTCGCTATAATTCTGTTGATCAAAAGACAGATGGAGACAGTCATGGATCCATAGCCAATGCTGAAACAGCTGTCACTGTTTATCCCACTTTGTCCAACCCAAAGACTCCTTTACATCCCATCAGGGACAGTAATTCTAACGATAAGGTTGCGTCGGTACCTTCTCGCAAAAGAAGTTCAGCTGAATCTGCTTATGAGGCATTGTTTGAGAAGTGGGTAGCACCACCACTTCTGTTGGAGCAACAAACTGATGACGAGGAATGGCTCTTCGGAACAACAAGAAAACAAGATGGACGAAGTAGTACCATGGCCAACAACAATGCTCTCAGTACTGTTTCCAGCTGTGGTAGAAGTTCAAATCTGTGGCCGAGAGGACAATATCTTGTCGATGCTGATGTTTATTCATTGCCTTATACGATCCCATTTTGATTTTGGATTCTTACTTTGCAGAGTTATATGTACAGTACGCCAGTGGGAATAATAGTCTTCCTAGGTGTTATAATCGGTTTTAGAATCAATATATTTGACATTCAATTTTGTTCTCTGGTGGGTGCCTATTATCCCCGTCGGATGGGTTGATTATTGAACAAACGACGAACATGTTGGAGATATGATGTAAATGTTCAGCGCTAAGTGATATCTTTTATGACAGAGTATAATAGTAGCTTTTTCCTCGAAGATCTTGTCTCTCAAACATCAAGCTTAGTTGAAGTTTTCTCAATTTCATTTACACTTGAATATTTTCAA

mRNA sequence

TTTCAAGTCGGTAGAACGCCGTTATAAATAGAGGGCCTCCCCCCATTGAGCGACGCGTAAGAGGCGAATTAGGTTTTCCAATTATCCGCCTCAACGAAGAATCCTTCGTTTGTTCTGCGCTAGGGTTTTCATCGCTAACTCGTTTCCTCTTCTGTTTTCGCAAATCTCGATCTCTCCTCTCTTTTTTCTTTTAAATCTTCGCCGCGAATCCCTTTCCATAGTAGACACACGATGTCTCGTTGCTTTCCTTACCCACCTCCTGGTTACGTGAGGAAGGTGGCTAGCACCGAGGCGGCCTTGATCGAATCGATTAAGCTCCAATCTGAAAGACAGAGCAAGAATGATAGAAAGAATGAGAAACGTAGGCACAAGAAAGAGAAGAAAGAGAAGTCCAAGGACAAGAAAGAGAGAAGTAAGGACAAAAAACACAAAAGCAAAGAACGTAAAGAACATAAGGGGAAATCCTCCCGTAGCCAGGGCTTGAATGATCAAAAACACGACAAATGCTTTAAAGAAGTCAAGGACCTAGATGGATCCAAAGTTGAAGCAGAACAATTAGAAAGGAGTGGTCTCACTGAAGAGCATGGACAACCTGTATGGCCTCAAAGCCCTGCCTACTTGTCTGATGGAACTCAGATCGACCACAAGAGAAAAAGGGAAGCTGCAACACAGCCTGATGAAGGTTGTAAGCCTGGTAAAATCATCCGAATCAAACTTGCCTCGGCCTCTTCACTTAGCCAGCAAGAGGATTCATCAGCTGGCAGTGAACAGATGTGTTCTACATCTGGTCGCTATAATTCTGTTGATCAAAAGACAGATGGAGACAGTCATGGATCCATAGCCAATGCTGAAACAGCTGTCACTGTTTATCCCACTTTGTCCAACCCAAAGACTCCTTTACATCCCATCAGGGACAGTAATTCTAACGATAAGGTTGCGTCGGTACCTTCTCGCAAAAGAAGTTCAGCTGAATCTGCTTATGAGGCATTGTTTGAGAAGTGGGTAGCACCACCACTTCTGTTGGAGCAACAAACTGATGACGAGGAATGGCTCTTCGGAACAACAAGAAAACAAGATGGACGAAGTAGTACCATGGCCAACAACAATGCTCTCAGTACTGTTTCCAGCTGTGGTAGAAGTTCAAATCTGTGGCCGAGAGGACAATATCTTGTCGATGCTGATGTTTATTCATTGCCTTATACGATCCCATTTTGATTTTGGATTCTTACTTTGCAGAGTTATATGTACAGTACGCCAGTGGGAATAATAGTCTTCCTAGGTGTTATAATCGGTTTTAGAATCAATATATTTGACATTCAATTTTGTTCTCTGGTGGGTGCCTATTATCCCCGTCGGATGGGTTGATTATTGAACAAACGACGAACATGTTGGAGATATGATGTAAATGTTCAGCGCTAAGTGATATCTTTTATGACAGAGTATAATAGTAGCTTTTTCCTCGAAGATCTTGTCTCTCAAACATCAAGCTTAGTTGAAGTTTTCTCAATTTCATTTACACTTGAATATTTTCAA

Coding sequence (CDS)

ATGTCTCGTTGCTTTCCTTACCCACCTCCTGGTTACGTGAGGAAGGTGGCTAGCACCGAGGCGGCCTTGATCGAATCGATTAAGCTCCAATCTGAAAGACAGAGCAAGAATGATAGAAAGAATGAGAAACGTAGGCACAAGAAAGAGAAGAAAGAGAAGTCCAAGGACAAGAAAGAGAGAAGTAAGGACAAAAAACACAAAAGCAAAGAACGTAAAGAACATAAGGGGAAATCCTCCCGTAGCCAGGGCTTGAATGATCAAAAACACGACAAATGCTTTAAAGAAGTCAAGGACCTAGATGGATCCAAAGTTGAAGCAGAACAATTAGAAAGGAGTGGTCTCACTGAAGAGCATGGACAACCTGTATGGCCTCAAAGCCCTGCCTACTTGTCTGATGGAACTCAGATCGACCACAAGAGAAAAAGGGAAGCTGCAACACAGCCTGATGAAGGTTGTAAGCCTGGTAAAATCATCCGAATCAAACTTGCCTCGGCCTCTTCACTTAGCCAGCAAGAGGATTCATCAGCTGGCAGTGAACAGATGTGTTCTACATCTGGTCGCTATAATTCTGTTGATCAAAAGACAGATGGAGACAGTCATGGATCCATAGCCAATGCTGAAACAGCTGTCACTGTTTATCCCACTTTGTCCAACCCAAAGACTCCTTTACATCCCATCAGGGACAGTAATTCTAACGATAAGGTTGCGTCGGTACCTTCTCGCAAAAGAAGTTCAGCTGAATCTGCTTATGAGGCATTGTTTGAGAAGTGGGTAGCACCACCACTTCTGTTGGAGCAACAAACTGATGACGAGGAATGGCTCTTCGGAACAACAAGAAAACAAGATGGACGAAGTAGTACCATGGCCAACAACAATGCTCTCAGTACTGTTTCCAGCTGTGGTAGAAGTTCAAATCTGTGGCCGAGAGGACAATATCTTGTCGATGCTGATGTTTATTCATTGCCTTATACGATCCCATTTTGA

Protein sequence

MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKERSKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSEQMCSTSGRYNSVDQKTDGDSHGSIANAETAVTVYPTLSNPKTPLHPIRDSNSNDKVASVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALSTVSSCGRSSNLWPRGQYLVDADVYSLPYTIPF*
Homology
BLAST of CSPI02G16720 vs. ExPASy TrEMBL
Match: A0A0A0LK14 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G345960 PE=4 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 1.4e-171
Identity = 324/327 (99.08%), Postives = 326/327 (99.69%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSK+KKER
Sbjct: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKNKKER 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ 120
           SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ
Sbjct: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ 120

Query: 121 PVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSEQ 180
           PVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSEQ
Sbjct: 121 PVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSEQ 180

Query: 181 MCSTSGRYNSVDQKTDGDSHGSIANAETAVTVYPTLSNPKTPLHPIRDSNSNDKVASVPS 240
           MCSTSGRYNSVDQKTDGDSHGSIANAETAVTV+PTLSNPKTPLHPIRDSNS DKVASVPS
Sbjct: 181 MCSTSGRYNSVDQKTDGDSHGSIANAETAVTVFPTLSNPKTPLHPIRDSNSTDKVASVPS 240

Query: 241 RKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALSTVSSC 300
           RKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALSTVSSC
Sbjct: 241 RKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALSTVSSC 300

Query: 301 GRSSNLWPRGQYLVDADVYSLPYTIPF 328
           GRSSNLWPRGQYLVDADVYSLPYTIPF
Sbjct: 301 GRSSNLWPRGQYLVDADVYSLPYTIPF 327

BLAST of CSPI02G16720 vs. ExPASy TrEMBL
Match: A0A5A7VB55 (DNA ligase 1-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G001500 PE=4 SV=1)

HSP 1 Score: 546.6 bits (1407), Expect = 7.2e-152
Identity = 296/331 (89.43%), Postives = 305/331 (92.15%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRH   KKEKSKDKKER
Sbjct: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRH---KKEKSKDKKER 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKD-LDGSKVEAEQLERSGLTEEHG 120
           SKDKKHKSKERKEHK KSS S+ LNDQKH+KC KEVKD LDG+KVEAEQLERSGLTEEHG
Sbjct: 61  SKDKKHKSKERKEHKEKSSHSRSLNDQKHNKCLKEVKDLLDGTKVEAEQLERSGLTEEHG 120

Query: 121 QPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSE 180
           QPVWPQSPAYLSDGTQIDHKRKR+A TQPDEGCKPGKIIRIKLASA SLSQQEDS+AGSE
Sbjct: 121 QPVWPQSPAYLSDGTQIDHKRKRQAETQPDEGCKPGKIIRIKLASAPSLSQQEDSAAGSE 180

Query: 181 QMCSTSGRYNSVDQKTDGDSHGSIANAETAVTVYPTLSNPK---TPLHPIRDSNSNDKVA 240
           QMCSTSGRYNS DQKTDGDSHGS+ANAETAV V+PTLSNPK    PLHPI D NS   V 
Sbjct: 181 QMCSTSGRYNSFDQKTDGDSHGSVANAETAVAVHPTLSNPKIEHPPLHPIGDRNSKSTVV 240

Query: 241 SVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALST 300
           SVPSRKRSSAESAYEALFE+WVAPPLLLEQQTDDEEWLFGTTRKQDGRS+   NNNA ST
Sbjct: 241 SVPSRKRSSAESAYEALFEEWVAPPLLLEQQTDDEEWLFGTTRKQDGRSTMANNNNAFST 300

Query: 301 VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
           VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF
Sbjct: 301 VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328

BLAST of CSPI02G16720 vs. ExPASy TrEMBL
Match: A0A1S3BBZ0 (uncharacterized protein LOC103488397 OS=Cucumis melo OX=3656 GN=LOC103488397 PE=4 SV=1)

HSP 1 Score: 546.6 bits (1407), Expect = 7.2e-152
Identity = 296/331 (89.43%), Postives = 305/331 (92.15%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRH   KKEKSKDKKER
Sbjct: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRH---KKEKSKDKKER 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKD-LDGSKVEAEQLERSGLTEEHG 120
           SKDKKHKSKERKEHK KSS S+ LNDQKH+KC KEVKD LDG+KVEAEQLERSGLTEEHG
Sbjct: 61  SKDKKHKSKERKEHKEKSSHSRSLNDQKHNKCLKEVKDLLDGTKVEAEQLERSGLTEEHG 120

Query: 121 QPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSE 180
           QPVWPQSPAYLSDGTQIDHKRKR+A TQPDEGCKPGKIIRIKLASA SLSQQEDS+AGSE
Sbjct: 121 QPVWPQSPAYLSDGTQIDHKRKRQAETQPDEGCKPGKIIRIKLASAPSLSQQEDSAAGSE 180

Query: 181 QMCSTSGRYNSVDQKTDGDSHGSIANAETAVTVYPTLSNPK---TPLHPIRDSNSNDKVA 240
           QMCSTSGRYNS DQKTDGDSHGS+ANAETAV V+PTLSNPK    PLHPI D NS   V 
Sbjct: 181 QMCSTSGRYNSFDQKTDGDSHGSVANAETAVAVHPTLSNPKIEHPPLHPIGDRNSKSTVV 240

Query: 241 SVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALST 300
           SVPSRKRSSAESAYEALFE+WVAPPLLLEQQTDDEEWLFGTTRKQDGRS+   NNNA ST
Sbjct: 241 SVPSRKRSSAESAYEALFEEWVAPPLLLEQQTDDEEWLFGTTRKQDGRSTMANNNNAFST 300

Query: 301 VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
           VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF
Sbjct: 301 VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328

BLAST of CSPI02G16720 vs. ExPASy TrEMBL
Match: A0A6J1CT76 (chromatin assembly factor 1 subunit A-like OS=Momordica charantia OX=3673 GN=LOC111013996 PE=4 SV=1)

HSP 1 Score: 360.5 bits (924), Expect = 7.3e-96
Identity = 217/340 (63.82%), Postives = 247/340 (72.65%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSER-QSKNDRKNEKRRHKKEKKEKSKDKKE 60
           MSRCFPYPPPGY  KVA TEAALIESIKLQSER QSK+DRK EK +H+KE+ EKSK+KK+
Sbjct: 1   MSRCFPYPPPGYAGKVARTEAALIESIKLQSERQQSKHDRKKEKSKHRKERSEKSKEKKQ 60

Query: 61  RSKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKD-LDGSKVEAEQLERSGLTEEH 120
           R          RKE K KSS S  LNDQK  +C K+ +D L G+KVEAEQLE+SGLTEEH
Sbjct: 61  R----------RKERKEKSSCSCDLNDQKQKECAKQAEDRLKGTKVEAEQLEKSGLTEEH 120

Query: 121 GQPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGS 180
           GQPVWPQSP YLSDGTQI+HKRKR+A  QP+E  KPGKIIRIKL  ASSLS QEDSSA +
Sbjct: 121 GQPVWPQSPGYLSDGTQINHKRKRDAKLQPNEDSKPGKIIRIKL--ASSLSNQEDSSADT 180

Query: 181 EQMCSTSGRYNSVDQKTDGDSHG------SIANAETAVTV-----YPTLSNPKTPLHPIR 240
           +Q CSTSGRY+ VDQK D +S G         N+ T V V      P + +    +H ++
Sbjct: 181 QQTCSTSGRYDCVDQKRDENSCGPNQQKPCFTNSNTVVAVEEAPPKPRIKDHSRSVHAVK 240

Query: 241 DSNSNDKVASVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSST 300
           D      V   P+R RS AES YEALFEKW+ PPL LEQQ DDEEWLFG TRKQDG+++ 
Sbjct: 241 DIRPQGNVVPFPTRTRSPAESEYEALFEKWIPPPLQLEQQMDDEEWLFG-TRKQDGQTTK 300

Query: 301 MANNNALSTVSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
              N A S V SC RSS+LWPRGQYL DADVYSLPYTIPF
Sbjct: 301 ATTNKAFSPVPSC-RSSSLWPRGQYLPDADVYSLPYTIPF 326

BLAST of CSPI02G16720 vs. ExPASy TrEMBL
Match: A0A6J1HBK0 (DNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111462522 PE=4 SV=1)

HSP 1 Score: 359.8 bits (922), Expect = 1.2e-95
Identity = 228/334 (68.26%), Postives = 254/334 (76.05%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKE-KKEKSKDKKE 60
           MSRCFPYPPPGYVRKVA TEAALIESIKLQSER          R+HK + KKEKSK KKE
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSER----------RQHKTDSKKEKSKHKKE 60

Query: 61  RSKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKD-LDGSKVEAEQLERSGLTEEH 120
           +SKD+KHKSKERKE K KSSRS  LNDQK   C KE KD L+G+KVEAEQLE+SGLTEEH
Sbjct: 61  KSKDRKHKSKERKERKEKSSRS--LNDQKQKACVKEAKDRLEGTKVEAEQLEKSGLTEEH 120

Query: 121 GQPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGS 180
           GQPVWP SP YLSDGTQI+HKRKR+ + QPDEGCKPGK+IRIKL  ASSLSQQE+SSAGS
Sbjct: 121 GQPVWPHSPGYLSDGTQINHKRKRD-SLQPDEGCKPGKVIRIKL--ASSLSQQENSSAGS 180

Query: 181 EQMCSTSGRYNSVDQKTDGDSHGS-IANAETAVTVYP-TLSNPK---TPLHPIRDSNSND 240
           EQ CS SGR  S D+ +      +  AN+ETA+ V   T S PK    P H +++ +S  
Sbjct: 181 EQTCSVSGRDCSRDENSSVVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKEISSLG 240

Query: 241 KVASVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNA 300
            V S+P R RS  ESAYEALFEKWV PPL LEQQ DDEEWLF  T KQDGRSS    N A
Sbjct: 241 NVMSLP-RTRSPVESAYEALFEKWVPPPLQLEQQMDDEEWLF-PTEKQDGRSS--KTNEA 300

Query: 301 LSTVSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
            S++ SC RSS+LWPRGQYL DADVYSLPYTIP+
Sbjct: 301 FSSIPSC-RSSSLWPRGQYLADADVYSLPYTIPY 314

BLAST of CSPI02G16720 vs. NCBI nr
Match: XP_004143000.1 (uncharacterized protein LOC101215840 [Cucumis sativus] >KGN62255.1 hypothetical protein Csa_018621 [Cucumis sativus])

HSP 1 Score: 612.1 bits (1577), Expect = 2.9e-171
Identity = 324/327 (99.08%), Postives = 326/327 (99.69%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSK+KKER
Sbjct: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKNKKER 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ 120
           SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ
Sbjct: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ 120

Query: 121 PVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSEQ 180
           PVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSEQ
Sbjct: 121 PVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSEQ 180

Query: 181 MCSTSGRYNSVDQKTDGDSHGSIANAETAVTVYPTLSNPKTPLHPIRDSNSNDKVASVPS 240
           MCSTSGRYNSVDQKTDGDSHGSIANAETAVTV+PTLSNPKTPLHPIRDSNS DKVASVPS
Sbjct: 181 MCSTSGRYNSVDQKTDGDSHGSIANAETAVTVFPTLSNPKTPLHPIRDSNSTDKVASVPS 240

Query: 241 RKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALSTVSSC 300
           RKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALSTVSSC
Sbjct: 241 RKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALSTVSSC 300

Query: 301 GRSSNLWPRGQYLVDADVYSLPYTIPF 328
           GRSSNLWPRGQYLVDADVYSLPYTIPF
Sbjct: 301 GRSSNLWPRGQYLVDADVYSLPYTIPF 327

BLAST of CSPI02G16720 vs. NCBI nr
Match: XP_008445332.1 (PREDICTED: uncharacterized protein LOC103488397 [Cucumis melo] >KAA0064838.1 DNA ligase 1-like isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 546.6 bits (1407), Expect = 1.5e-151
Identity = 296/331 (89.43%), Postives = 305/331 (92.15%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRH   KKEKSKDKKER
Sbjct: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRH---KKEKSKDKKER 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKD-LDGSKVEAEQLERSGLTEEHG 120
           SKDKKHKSKERKEHK KSS S+ LNDQKH+KC KEVKD LDG+KVEAEQLERSGLTEEHG
Sbjct: 61  SKDKKHKSKERKEHKEKSSHSRSLNDQKHNKCLKEVKDLLDGTKVEAEQLERSGLTEEHG 120

Query: 121 QPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGSE 180
           QPVWPQSPAYLSDGTQIDHKRKR+A TQPDEGCKPGKIIRIKLASA SLSQQEDS+AGSE
Sbjct: 121 QPVWPQSPAYLSDGTQIDHKRKRQAETQPDEGCKPGKIIRIKLASAPSLSQQEDSAAGSE 180

Query: 181 QMCSTSGRYNSVDQKTDGDSHGSIANAETAVTVYPTLSNPK---TPLHPIRDSNSNDKVA 240
           QMCSTSGRYNS DQKTDGDSHGS+ANAETAV V+PTLSNPK    PLHPI D NS   V 
Sbjct: 181 QMCSTSGRYNSFDQKTDGDSHGSVANAETAVAVHPTLSNPKIEHPPLHPIGDRNSKSTVV 240

Query: 241 SVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANNNALST 300
           SVPSRKRSSAESAYEALFE+WVAPPLLLEQQTDDEEWLFGTTRKQDGRS+   NNNA ST
Sbjct: 241 SVPSRKRSSAESAYEALFEEWVAPPLLLEQQTDDEEWLFGTTRKQDGRSTMANNNNAFST 300

Query: 301 VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
           VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF
Sbjct: 301 VSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328

BLAST of CSPI02G16720 vs. NCBI nr
Match: XP_038885448.1 (DNA ligase 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 416.4 bits (1069), Expect = 2.3e-112
Identity = 249/336 (74.11%), Postives = 270/336 (80.36%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSE-RQSKNDRKNEKRRHKKEKKEKSKDKKE 60
           MSRCFPYPPPGYVRKVASTEAALIESIKLQSE RQSKND K EK +H   KKEKSKDKKE
Sbjct: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERRQSKNDSKKEKSKH---KKEKSKDKKE 60

Query: 61  RSKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKD-LDGSKVEAEQLERSGLTEEH 120
           RSKDKKHKSKERKE   KSS S+ LNDQK  +C KE K+ L G+KVEAEQLERSGLTEEH
Sbjct: 61  RSKDKKHKSKERKE---KSSHSRDLNDQKQKECLKEAKELLKGTKVEAEQLERSGLTEEH 120

Query: 121 GQPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGS 180
           GQPVWPQSP YLSDGTQI+HKRKR+A+ Q +EGCKPGKIIRIKL    SLSQQEDSSAGS
Sbjct: 121 GQPVWPQSPGYLSDGTQINHKRKRDASLQSNEGCKPGKIIRIKL----SLSQQEDSSAGS 180

Query: 181 EQMCSTSGRYNSVDQKTDGDSHGSIAN------AETAVTVY-PTLSNPKTPLHPIRDSNS 240
           EQ CSTSGR  SVDQK D +S GSI        A TAV V  P+ S PK  +  ++D +S
Sbjct: 181 EQTCSTSGRDISVDQKRDENSRGSIQQNTGFTYAGTAVAVNDPSSSKPK--IQSVKDISS 240

Query: 241 NDKVASVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANN 300
              V S+P R RS AESAYEALFEKWVAPPL LEQQTDDE+WLFG TRKQDG+S+   NN
Sbjct: 241 KGNVVSLPPRTRSPAESAYEALFEKWVAPPLQLEQQTDDEDWLFGRTRKQDGQST---NN 300

Query: 301 NALSTVSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
            A S+V SCGRSS+LWPRGQYL DADVYSLPYTIPF
Sbjct: 301 KAFSSVPSCGRSSSLWPRGQYLADADVYSLPYTIPF 321

BLAST of CSPI02G16720 vs. NCBI nr
Match: XP_038885449.1 (nucleoporin GLE1 isoform X2 [Benincasa hispida])

HSP 1 Score: 400.2 bits (1027), Expect = 1.7e-107
Identity = 245/336 (72.92%), Postives = 266/336 (79.17%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSE-RQSKNDRKNEKRRHKKEKKEKSKDKKE 60
           MSRCFPYPPPGYVRKVASTEAALIESIKLQSE RQSKND K EK +H   KKEKSKDKKE
Sbjct: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERRQSKNDSKKEKSKH---KKEKSKDKKE 60

Query: 61  RSKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKD-LDGSKVEAEQLERSGLTEEH 120
           RSKDKKHKSKERKE   KSS S+ LNDQK  +C KE K+ L G+KVEAEQLERSGLTEEH
Sbjct: 61  RSKDKKHKSKERKE---KSSHSRDLNDQKQKECLKEAKELLKGTKVEAEQLERSGLTEEH 120

Query: 121 GQPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGS 180
           GQPVWPQSP YLSDGTQI+HKRKR+A+ Q +E    GKIIRIKL    SLSQQEDSSAGS
Sbjct: 121 GQPVWPQSPGYLSDGTQINHKRKRDASLQSNE----GKIIRIKL----SLSQQEDSSAGS 180

Query: 181 EQMCSTSGRYNSVDQKTDGDSHGSIAN------AETAVTVY-PTLSNPKTPLHPIRDSNS 240
           EQ CSTSGR  SVDQK D +S GSI        A TAV V  P+ S PK  +  ++D +S
Sbjct: 181 EQTCSTSGRDISVDQKRDENSRGSIQQNTGFTYAGTAVAVNDPSSSKPK--IQSVKDISS 240

Query: 241 NDKVASVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEEWLFGTTRKQDGRSSTMANN 300
              V S+P R RS AESAYEALFEKWVAPPL LEQQTDDE+WLFG TRKQDG+S+   NN
Sbjct: 241 KGNVVSLPPRTRSPAESAYEALFEKWVAPPLQLEQQTDDEDWLFGRTRKQDGQST---NN 300

Query: 301 NALSTVSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
            A S+V SCGRSS+LWPRGQYL DADVYSLPYTIPF
Sbjct: 301 KAFSSVPSCGRSSSLWPRGQYLADADVYSLPYTIPF 317

BLAST of CSPI02G16720 vs. NCBI nr
Match: XP_023545923.1 (uncharacterized protein LOC111805212 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 366.3 bits (939), Expect = 2.7e-97
Identity = 233/354 (65.82%), Postives = 257/354 (72.60%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKE-KKEKSKDKKE 60
           MSRCFPYPPPGYVRKVA TEAALIESIKLQSER          R+HK + KKEKSK KKE
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSER----------RQHKTDSKKEKSKHKKE 60

Query: 61  RSKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKD-LDGSKVEAEQLERSGLTEEH 120
           +SKD+KHKSKERKE K KSSRS+ LNDQK   C KEVKD L+G+KVEAEQLE+SGLTEEH
Sbjct: 61  KSKDRKHKSKERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEH 120

Query: 121 GQPVWPQSPAYLSDGTQIDHKRKREAATQPDEGCKPGKIIRIKLASASSLSQQEDSSAGS 180
           GQPVWP SP YLSDGTQI+ KRKR+ + QPDEGCKPGK+IRIKL  ASSLSQQE+SSAGS
Sbjct: 121 GQPVWPHSPGYLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKL--ASSLSQQENSSAGS 180

Query: 181 EQMCSTSGRYNSVDQKTDGDS----HGSIANAETAVTVYP-TLSNPK---TPLHPIRDSN 240
           EQMCS SGR  S DQK+D +S        AN+ETA+ V   T S PK    P H ++D  
Sbjct: 181 EQMCSVSGRDCSRDQKSDENSSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRT 240

Query: 241 SNDKVASVPS-----------------RKRSSAESAYEALFEKWVAPPLLLEQQTDDEEW 300
           S+      PS                 R RS  ESAYEALFEKWV PPL LEQQ DDEEW
Sbjct: 241 SSKPKIKDPSPHAVKEISSLGNVMSLPRTRSPVESAYEALFEKWVPPPLQLEQQMDDEEW 300

Query: 301 LFGTTRKQDGRSSTMANNNALSTVSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
           LF  T KQDGRS+    N A S+V SC RSS+LWPRGQYL DADVYSLPYTIP+
Sbjct: 301 LF-RTEKQDGRST--KTNEAFSSVPSC-RSSSLWPRGQYLADADVYSLPYTIPY 338

BLAST of CSPI02G16720 vs. TAIR 10
Match: AT1G20100.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75860.1); Has 471 Blast hits to 438 proteins in 92 species: Archae - 0; Bacteria - 14; Metazoa - 217; Fungi - 43; Plants - 91; Viruses - 1; Other Eukaryotes - 105 (source: NCBI BLink). )

HSP 1 Score: 77.0 bits (188), Expect = 3.1e-14
Identity = 104/335 (31.04%), Postives = 157/335 (46.87%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSR F  PPP Y R  A+ +  L+E  K++          + K+ H+KEKKEK K+KK +
Sbjct: 1   MSRYFTSPPPVYARNWANGQ-NLVEWTKIE------RPIVDSKKLHRKEKKEKKKEKKLK 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ 120
              K+ KS E+K    K+                          E+EQLE+S LTEE  Q
Sbjct: 61  ---KEKKSLEQKYSTTKT-----------------------VSYESEQLEKSCLTEEFEQ 120

Query: 121 PVWPQSPAYLSDGTQIDHKRKRE---AATQPDEGCKP--GKIIRIKLASASSLSQQEDSS 180
           P       YLSDG+Q   KR+RE   A  +      P  GK +RI++       ++ ++ 
Sbjct: 121 P----QVGYLSDGSQNSKKRRRETSPAVVESQIKATPVAGKPLRIRIVFKK--PKEAEAV 180

Query: 181 AGSEQMCSTSGRYNSVDQKTDGDSHGSIANAETAVTVYPTLSNPKTPLHPIRDSNSNDKV 240
              + +CSTSG      +     S  SI + + AV   P+ S     +  I +S    K 
Sbjct: 181 PQEDPVCSTSGTQRP-SELPSSVSLPSICDHDVAV---PSTSLESGKVAIISESKKRKK- 240

Query: 241 ASVPSRKRSSAESAYEALFEKWVAPPLLLEQ-QTDDEEWLFGTTRKQDGRSSTMANNNAL 300
                  + S ES Y +LF++ V P + LE+  +  ++WLFGT+RK++  S+  +     
Sbjct: 241 ------HKPSKESRYNSLFDELVPPCISLEEDDSSSDDWLFGTSRKENVSSAKSSYKTDE 285

Query: 301 STVSS--CGRSSNLWPRGQYLVDADVYSLPYTIPF 328
            T+ S    R  +  PR   L +  ++SLPYT+PF
Sbjct: 301 DTIMSLQTSRDCSSLPRAMLLSEVGIFSLPYTVPF 285

BLAST of CSPI02G16720 vs. TAIR 10
Match: AT1G75860.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G20100.1); Has 258 Blast hits to 235 proteins in 58 species: Archae - 0; Bacteria - 4; Metazoa - 59; Fungi - 16; Plants - 90; Viruses - 0; Other Eukaryotes - 89 (source: NCBI BLink). )

HSP 1 Score: 70.1 bits (170), Expect = 3.8e-12
Identity = 101/335 (30.15%), Postives = 148/335 (44.18%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSR    PP  + R     +  L+ES KL   ++   D K   R  KKEKKEK K+KKE 
Sbjct: 1   MSRVLTCPPLVFARNHVGVQ-NLVESTKL---KRITLDSKKAHRIEKKEKKEKRKEKKET 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ 120
            ++K HK      H  K++      D  H   F   K +     E++ LE+SGLT+E  +
Sbjct: 61  KREKSHK------HSIKAT------DNHHKLIFLPSKKVSD---ESDSLEKSGLTDELEE 120

Query: 121 PVWPQSPAYLSDGTQIDHKRKREAATQPDEGC-----KPGKIIRIKLASASSLSQQEDSS 180
           P   +   YLSDG+Q   KR R+ +    E         GK +RI++       ++  + 
Sbjct: 121 P--QKHLGYLSDGSQNSKKRIRDDSPPAVESLIKAAPVAGKPLRIRMVFKKP-KEEVPTL 180

Query: 181 AGSEQMCSTSGRYNSVDQKTDGDSHGSIANAETAVTVYPTLSNPKTPLHPIRDSNSNDKV 240
                +CST+   +   Q     S  S   +E    +      P T +  I ++    K 
Sbjct: 181 PREAVVCSTTVAKSLSHQDVITSSISSSKTSELEKNL------PSTSIAAIDETKKRKK- 240

Query: 241 ASVPSRKRSSAESAYEALFEKWVAPPLLLEQQTDDEE---WLFGTTRKQDGRSSTMANNN 300
                  RSS E  Y ALF+ W  P + +   + ++    WLFG  + Q+      A   
Sbjct: 241 ------HRSSKEDQYNALFDGWTPPSMCIADASSNDNGDYWLFG-NKTQEVLKPKAAVKV 297

Query: 301 ALSTVSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
              T+   G SS  WPR Q+L +  +YSLPYT+PF
Sbjct: 301 DDDTMMRPGDSS--WPRAQFLSEVGIYSLPYTVPF 297

BLAST of CSPI02G16720 vs. TAIR 10
Match: AT4G35940.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G17787.1); Has 45288 Blast hits to 24095 proteins in 1140 species: Archae - 93; Bacteria - 2895; Metazoa - 13424; Fungi - 2873; Plants - 1183; Viruses - 123; Other Eukaryotes - 24697 (source: NCBI BLink). )

HSP 1 Score: 53.9 bits (128), Expect = 2.8e-07
Identity = 53/126 (42.06%), Postives = 69/126 (54.76%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSRCFP+PPPGYV      EA ++ SIK   E+  K  R+ ++R  KK+K    KDKKER
Sbjct: 1   MSRCFPFPPPGYVLNGIRDEAVIVSSIKGVEEKAKKEQRRKDRRSDKKDK----KDKKER 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKV---------EAEQLER 118
            + K+ K K+RKE +GK     G   + H +  KE    DG+KV         E   LE+
Sbjct: 61  KEKKEKKEKKRKEREGK---EVGSEKRSHKRRRKE----DGAKVDLFHKLKESEVNCLEK 115

BLAST of CSPI02G16720 vs. TAIR 10
Match: AT4G35940.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G17787.1). )

HSP 1 Score: 52.8 bits (125), Expect = 6.2e-07
Identity = 108/416 (25.96%), Postives = 165/416 (39.66%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSRCFP+PPPGYV      EA ++ SIK   E+  K  R+ ++R  KK+K    KDKKER
Sbjct: 1   MSRCFPFPPPGYVLNGIRDEAVIVSSIKGVEEKAKKEQRRKDRRSDKKDK----KDKKER 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKV---------EAEQLER 120
            + K+ K K+RKE +GK     G   + H +  KE    DG+KV         E   LE+
Sbjct: 61  KEKKEKKEKKRKEREGK---EVGSEKRSHKRRRKE----DGAKVDLFHKLKESEVNCLEK 120

Query: 121 SGLT---------------------------EEHGQP-------------VWPQSPAYLS 180
           S LT                           +E  QP             V  Q P    
Sbjct: 121 SSLTVERELLQSTSQNSCDSTLNSNEMLPKQKEVQQPLDGRHNNNNNEKRVEKQQPL--- 180

Query: 181 DGTQIDHKRKREAATQPDEGC----------KPGKIIRIKLASASSLSQQEDSSAGS--- 240
           DG   ++  KR    QP +G           K   +      +   L +++    G    
Sbjct: 181 DGRHNNNNEKRVEKQQPLDGRHNNNNEKRIEKQQPLNGRHNNNNEKLMEKQQPLNGRHNN 240

Query: 241 ------EQMCSTSGRYNSVDQKTDGDS-----HGSIANAETAVTVYPTLSNPKTPLHPIR 300
                 E+    +GR+N+ +++ +        H +  +AE A    P     K P+   R
Sbjct: 241 NNEKRIEKQQPLNGRHNNKEKQKEKQQPLDVRHNNNDSAEHASK--PREEKRKDPI--FR 300

Query: 301 DSNSNDKVASVPSRKR-----------SSAESAYEALFEKWVAPPLLLEQQTD-----DE 328
             +  +K++S  +R+             S    +  + E WV  P  +E++ D     DE
Sbjct: 301 GKHGKEKISSSSTRETYQPPKSLCNCPPSMVLQFLDVVENWV--PNTIERRVDLINSEDE 360

BLAST of CSPI02G16720 vs. TAIR 10
Match: AT2G17787.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G35940.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 43.5 bits (101), Expect = 3.8e-04
Identity = 87/347 (25.07%), Postives = 147/347 (42.36%), Query Frame = 0

Query: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERQSKNDRKNEKRRHKKEKKEKSKDKKER 60
           MSRC+P+PPPGYV K      +LIESIK   E   K+      R+HK+ +K++     E 
Sbjct: 1   MSRCYPFPPPGYVWK-----ESLIESIKGAKEEVKKD------RKHKRNEKDRKDRDNEA 60

Query: 61  SKDKKHKSKERKEHKGKSSRSQGLNDQKHDKCFKEVKDLDGSKVEAEQLERSGLTEEHGQ 120
            + +KH+ K R++ +G +  S  L   + +   K  + ++     + Q         + +
Sbjct: 61  GRSRKHRHKRRRKDEG-AIASGKLVSSEVELLEKSCQTVELELQTSSQNSCDSTLHSNER 120

Query: 121 PVWPQSPAYLSDGTQIDHKRKREAATQPDEGC-----KPGKIIRIKLASASSLSQQEDSS 180
           P   QS     D T I  +   +    P++G         +    ++  AS  +   + S
Sbjct: 121 PKQIQSQPL--DETSIRTRLPDKGQEDPEDGVMMTSKDQKQRFSREMLDASQAATAPNES 180

Query: 181 AGSEQMCS------TSGRYNSVDQKTDGDSHGSIANAETAVT---VYPTLS--NPKTPLH 240
            G  ++C       T G    +  K + +     +     V+     P+LS  NP     
Sbjct: 181 VGHSRVCQEKRIDPTFGSSREITTKLNKEKKSVPSKDNRKVSKEKKMPSLSSCNPLEQEK 240

Query: 241 PIRDSNSNDKVASVPSRK-RSSAESAYEALFEKWVAPPLLLEQQTDDEE---WLFGTTRK 300
           P          + +  RK   S       L E W AP  +  + TD E+   WLF    K
Sbjct: 241 PTSSHQETPGPSKLLCRKCPPSMAGQLLNLIENW-APDRVESKLTDSEDQEWWLF---IK 300

Query: 301 QDGRSSTMANNNALSTVSSCGRSSNLWPRGQYLVDADVYSLPYTIPF 328
              +S  ++N       ++ G SS +WP  ++L +A+V++LP+T+PF
Sbjct: 301 FGAKSPQVSNQK-----TNQGSSSMVWPTARFLPEAEVHALPFTVPF 324

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LK141.4e-17199.08Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G345960 PE=4 SV=1[more]
A0A5A7VB557.2e-15289.43DNA ligase 1-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... [more]
A0A1S3BBZ07.2e-15289.43uncharacterized protein LOC103488397 OS=Cucumis melo OX=3656 GN=LOC103488397 PE=... [more]
A0A6J1CT767.3e-9663.82chromatin assembly factor 1 subunit A-like OS=Momordica charantia OX=3673 GN=LOC... [more]
A0A6J1HBK01.2e-9568.26DNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111462522 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_004143000.12.9e-17199.08uncharacterized protein LOC101215840 [Cucumis sativus] >KGN62255.1 hypothetical ... [more]
XP_008445332.11.5e-15189.43PREDICTED: uncharacterized protein LOC103488397 [Cucumis melo] >KAA0064838.1 DNA... [more]
XP_038885448.12.3e-11274.11DNA ligase 1 isoform X1 [Benincasa hispida][more]
XP_038885449.11.7e-10772.92nucleoporin GLE1 isoform X2 [Benincasa hispida][more]
XP_023545923.12.7e-9765.82uncharacterized protein LOC111805212 isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT1G20100.13.1e-1431.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G75860.13.8e-1230.15unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G35940.12.8e-0742.06unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G35940.26.2e-0725.96unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G17787.13.8e-0425.07unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 37..71
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 26..201
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 75..114
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 60..74
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 163..201
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 215..242
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..149
NoneNo IPR availablePANTHERPTHR34660:SF7DNA LIGASEcoord: 1..327
NoneNo IPR availablePANTHERPTHR34660MYB-LIKE PROTEIN Xcoord: 1..327

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G16720.1CSPI02G16720.1mRNA