CsGy4G023810 (gene) Cucumber (Gy14) v2

NameCsGy4G023810
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionputative nuclease HARBI1
LocationChr4 : 29754132 .. 29755622 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGATTCCACCAACGGAAACGTGAGGAAGAGGACTAGAGCTGATGAAGTCGATGAAGACGACGATTTGATGGGAAAAAATGGCGGAGGAAAGGGTTTGAAAGGATTGGTTACGTCTCTGTTGTTGTTGGATGAACAGGACAAGTGTGAACAGGATGAACAAGACAGAATTTCCGTGGAGGCGAAGATTTCGATGGAGGTGAATCACAGGAAGAAGACCAAAGCTATGGTTGATTTCTATTCCGAAGTTCAAGATTACTATTCTGAAGTTGAAGAATCCGACCGAATGAAACGGAAGAAATCGCGATTGGCAGCTAACTCTGTTGCGGTTGCGGCCGTTTCCGATGGATTACAGAAAATCGAAAGCGAAAAATCAAACAAACGCGGCGGCGATGGCGGTGGAGGAAGCGGTGGTGGTGGTGGCCATCACCGGAGACTCTGGGTAAAAGATAGGTCAAAAGCTTGGTGGGATGAATGTAACAGTCCCGATTATCCCGATGAAGAATTCAAGAAGCAATTCAGAATGGGTAGAGCAACTTTCGATATGATTTGTGAAGAACTTAATTCCGCCATAGCTAAAGAAGACACAACCCTCCGAACCGCCATTCCCGTCCAGCAAAGAGTCGCGGTTTGCCTATGGAGATTAGCCACCGGCGATCCACTTCGAGTTGTATCGAAGAAATTCGGATTAGGTATTTCAACTTGCCACAAACTTGTTCTCGAGGTTTGCACAGCCATTAGAACAGTACTAATGCCGAAGCATCTTCAATGGCCGGAAGAAGAAACACTCAGAAGAATCAAAGAAGAATACGAATCAATTTCCGGAATCCCTAACGTCGTTGGTTCAATGTACACCACACACATTCCGATCATCGCTCCCAAAATCAGCGTCGCAGCTTATTTCAACAAACGCCATACAGAAAGAAATCAAAAAACATCATACTCAATTACAGTTCAAGGAGTGGTGGATCCAAGAGGAGTTTTCACGGACGTTTGCATCGGTTGGCCGGGATCAATGCCAGACGATCAAGTTCTTGAGAAATCTGCTCTGTTTCAAAGAGCAAATGGGGGATTACTGAAAGGAGTTTGGATTGTTGGAGGATCAAGTTATCCATTAATGGATTGGGTTTTAGTTCCTTATACACAGCAACATTTAACATGGACACAACATGCATTTAACGAGAAGATTGGAGAGATTCAGAAGGTGGCTAAAGATGCATTTGCACGGCTGAAGGGACGGTGGCGCTGCCTACAGAAACGGACAGAGGTGAAGCTTCAAGATTTGCCGGTGGTGCTCGGAGCTTGTTGTGTTCTTCATAATATTTGTGAATTAGGGAATCAAGAAATGGATACAGAGCTTTTAACAGAGCTTCAAGATGATGAAATGGCACCTGAAATGGCTTTAAGGTCAGTACCTTCCATGAAAGCAAGAGATGCCATTGCTCATAATCTGCTCCACCATGGCCTTGCTGGGACTTCTTTTCTTTAA

mRNA sequence

ATGAATGATTCCACCAACGGAAACGTGAGGAAGAGGACTAGAGCTGATGAAGTCGATGAAGACGACGATTTGATGGGAAAAAATGGCGGAGGAAAGGGTTTGAAAGGATTGGTTACGTCTCTGTTGTTGTTGGATGAACAGGACAAGTGTGAACAGGATGAACAAGACAGAATTTCCGTGGAGGCGAAGATTTCGATGGAGGTGAATCACAGGAAGAAGACCAAAGCTATGGTTGATTTCTATTCCGAAGTTCAAGATTACTATTCTGAAGTTGAAGAATCCGACCGAATGAAACGGAAGAAATCGCGATTGGCAGCTAACTCTGTTGCGGTTGCGGCCGTTTCCGATGGATTACAGAAAATCGAAAGCGAAAAATCAAACAAACGCGGCGGCGATGGCGGTGGAGGAAGCGGTGGTGGTGGTGGCCATCACCGGAGACTCTGGGTAAAAGATAGGTCAAAAGCTTGGTGGGATGAATGTAACAGTCCCGATTATCCCGATGAAGAATTCAAGAAGCAATTCAGAATGGGTAGAGCAACTTTCGATATGATTTGTGAAGAACTTAATTCCGCCATAGCTAAAGAAGACACAACCCTCCGAACCGCCATTCCCGTCCAGCAAAGAGTCGCGGTTTGCCTATGGAGATTAGCCACCGGCGATCCACTTCGAGTTGTATCGAAGAAATTCGGATTAGGTATTTCAACTTGCCACAAACTTGTTCTCGAGGTTTGCACAGCCATTAGAACAGTACTAATGCCGAAGCATCTTCAATGGCCGGAAGAAGAAACACTCAGAAGAATCAAAGAAGAATACGAATCAATTTCCGGAATCCCTAACGTCGTTGGTTCAATGTACACCACACACATTCCGATCATCGCTCCCAAAATCAGCGTCGCAGCTTATTTCAACAAACGCCATACAGAAAGAAATCAAAAAACATCATACTCAATTACAGTTCAAGGAGTGGTGGATCCAAGAGGAGTTTTCACGGACGTTTGCATCGGTTGGCCGGGATCAATGCCAGACGATCAAGTTCTTGAGAAATCTGCTCTGTTTCAAAGAGCAAATGGGGGATTACTGAAAGGAGTTTGGATTGTTGGAGGATCAAGTTATCCATTAATGGATTGGGTTTTAGTTCCTTATACACAGCAACATTTAACATGGACACAACATGCATTTAACGAGAAGATTGGAGAGATTCAGAAGGTGGCTAAAGATGCATTTGCACGGCTGAAGGGACGGTGGCGCTGCCTACAGAAACGGACAGAGGTGAAGCTTCAAGATTTGCCGGTGGTGCTCGGAGCTTGTTGTGTTCTTCATAATATTTGTGAATTAGGGAATCAAGAAATGGATACAGAGCTTTTAACAGAGCTTCAAGATGATGAAATGGCACCTGAAATGGCTTTAAGGTCAGTACCTTCCATGAAAGCAAGAGATGCCATTGCTCATAATCTGCTCCACCATGGCCTTGCTGGGACTTCTTTTCTTTAA

Coding sequence (CDS)

ATGAATGATTCCACCAACGGAAACGTGAGGAAGAGGACTAGAGCTGATGAAGTCGATGAAGACGACGATTTGATGGGAAAAAATGGCGGAGGAAAGGGTTTGAAAGGATTGGTTACGTCTCTGTTGTTGTTGGATGAACAGGACAAGTGTGAACAGGATGAACAAGACAGAATTTCCGTGGAGGCGAAGATTTCGATGGAGGTGAATCACAGGAAGAAGACCAAAGCTATGGTTGATTTCTATTCCGAAGTTCAAGATTACTATTCTGAAGTTGAAGAATCCGACCGAATGAAACGGAAGAAATCGCGATTGGCAGCTAACTCTGTTGCGGTTGCGGCCGTTTCCGATGGATTACAGAAAATCGAAAGCGAAAAATCAAACAAACGCGGCGGCGATGGCGGTGGAGGAAGCGGTGGTGGTGGTGGCCATCACCGGAGACTCTGGGTAAAAGATAGGTCAAAAGCTTGGTGGGATGAATGTAACAGTCCCGATTATCCCGATGAAGAATTCAAGAAGCAATTCAGAATGGGTAGAGCAACTTTCGATATGATTTGTGAAGAACTTAATTCCGCCATAGCTAAAGAAGACACAACCCTCCGAACCGCCATTCCCGTCCAGCAAAGAGTCGCGGTTTGCCTATGGAGATTAGCCACCGGCGATCCACTTCGAGTTGTATCGAAGAAATTCGGATTAGGTATTTCAACTTGCCACAAACTTGTTCTCGAGGTTTGCACAGCCATTAGAACAGTACTAATGCCGAAGCATCTTCAATGGCCGGAAGAAGAAACACTCAGAAGAATCAAAGAAGAATACGAATCAATTTCCGGAATCCCTAACGTCGTTGGTTCAATGTACACCACACACATTCCGATCATCGCTCCCAAAATCAGCGTCGCAGCTTATTTCAACAAACGCCATACAGAAAGAAATCAAAAAACATCATACTCAATTACAGTTCAAGGAGTGGTGGATCCAAGAGGAGTTTTCACGGACGTTTGCATCGGTTGGCCGGGATCAATGCCAGACGATCAAGTTCTTGAGAAATCTGCTCTGTTTCAAAGAGCAAATGGGGGATTACTGAAAGGAGTTTGGATTGTTGGAGGATCAAGTTATCCATTAATGGATTGGGTTTTAGTTCCTTATACACAGCAACATTTAACATGGACACAACATGCATTTAACGAGAAGATTGGAGAGATTCAGAAGGTGGCTAAAGATGCATTTGCACGGCTGAAGGGACGGTGGCGCTGCCTACAGAAACGGACAGAGGTGAAGCTTCAAGATTTGCCGGTGGTGCTCGGAGCTTGTTGTGTTCTTCATAATATTTGTGAATTAGGGAATCAAGAAATGGATACAGAGCTTTTAACAGAGCTTCAAGATGATGAAATGGCACCTGAAATGGCTTTAAGGTCAGTACCTTCCATGAAAGCAAGAGATGCCATTGCTCATAATCTGCTCCACCATGGCCTTGCTGGGACTTCTTTTCTTTAA

Protein sequence

MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRKKSRLAANSVAVAAVSDGLQKIESEKSNKRGGDGGGGSGGGGGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDAIAHNLLHHGLAGTSFL
BLAST of CsGy4G023810 vs. NCBI nr
Match: XP_004141329.1 (PREDICTED: putative nuclease HARBI1 [Cucumis sativus] >KGN55352.1 hypothetical protein Csa_4G646260 [Cucumis sativus])

HSP 1 Score: 922.5 bits (2383), Expect = 5.9e-265
Identity = 496/496 (100.00%), Postives = 496/496 (100.00%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV
Sbjct: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK
Sbjct: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 496

BLAST of CsGy4G023810 vs. NCBI nr
Match: XP_008452747.1 (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 910.2 bits (2351), Expect = 3.0e-261
Identity = 490/496 (98.79%), Postives = 493/496 (99.40%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV
Sbjct: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMK XX RLAANSVAVAAVSDGLQ+
Sbjct: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKXXXXRLAANSVAVAAVSDGLQR 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE+EK XXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IENEK-XXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEE+ESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEFESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 495

BLAST of CsGy4G023810 vs. NCBI nr
Match: XP_022977009.1 (protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita maxima])

HSP 1 Score: 860.1 bits (2221), Expect = 3.6e-246
Identity = 437/496 (88.10%), Postives = 447/496 (90.12%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNG  RKR R DE DEDD  +GKNG GK LKGLVTSLLLLDEQ+K EQ+E DR S+
Sbjct: 1   MNDSTNGGARKRNRGDEADEDDGSIGKNGRGKELKGLVTSLLLLDEQEKYEQEEHDRASM 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAK+SMEVNHRKKTKAM DFYSE Q XXXXX     +KR  SRLAANSVAVAA SDGLQK
Sbjct: 61  EAKVSMEVNHRKKTKAMDDFYSEAQDXXXXXEESDRLKRKKSRLAANSVAVAAASDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE  K                     LWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IEIVK-------SNKRGGDGGGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL+
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLI 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPL+DWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLLDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGN+EMD EL TELQDDEMAPE+ALRSV SMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNEEMDRELSTELQDDEMAPEVALRSVSSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 489

BLAST of CsGy4G023810 vs. NCBI nr
Match: XP_022936710.1 (protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita moschata])

HSP 1 Score: 858.6 bits (2217), Expect = 1.0e-245
Identity = 451/496 (90.93%), Postives = 461/496 (92.94%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNG  RKR R DE DEDD  +GKNG GK LKGLVTSLLLLDEQ+K EQ+E DR S+
Sbjct: 1   MNDSTNGGARKRNRGDEADEDDGSIGKNGRGKELKGLVTSLLLLDEQEKYEQEEHDRASM 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAK+SMEVNHRKKTKAM DFYSE Q  XXXX     +KR  SRLAANSVAVAA SDGLQK
Sbjct: 61  EAKVSMEVNHRKKTKAMDDFYSEAQDYXXXXEESDRLKRKKSRLAANSVAVAAASDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE        XXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IE------IVXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL+
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLI 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPL+DWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLLDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGN+EMD EL TELQDDEMAPE+ALRSV SMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNEEMDRELSTELQDDEMAPEVALRSVSSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 490

BLAST of CsGy4G023810 vs. NCBI nr
Match: XP_023535595.1 (protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 858.6 bits (2217), Expect = 1.0e-245
Identity = 450/496 (90.73%), Postives = 460/496 (92.74%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNG  RKR R DE DEDD  +GKNG GK LKGLVTSLLLLDEQ+K EQ+E DR S+
Sbjct: 1   MNDSTNGGARKRNRGDEADEDDGSIGKNGRGKELKGLVTSLLLLDEQEKYEQEEHDRASM 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAK+SMEVNHRKKTK M DFYSE Q  XXX      +KR  SRLAANSVAVAA SDGLQK
Sbjct: 61  EAKVSMEVNHRKKTKVMDDFYSEAQDYXXXVEESDRLKRKKSRLAANSVAVAAASDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE  K     XXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IEIVK-----XXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL+
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLI 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPL+DWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLLDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGN+EMD EL TELQDDEMAPE+ALRSV SMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNEEMDRELSTELQDDEMAPEVALRSVSSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 491

BLAST of CsGy4G023810 vs. TAIR10
Match: AT5G12010.1 (unknown protein)

HSP 1 Score: 599.0 bits (1543), Expect = 2.7e-171
Identity = 313/466 (67.17%), Postives = 374/466 (80.26%), Query Frame = 0

Query: 32  KGLKGLVTSLLLLDEQDKCEQDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQXXXXXX 91
           K LKG  T           +Q+ ++  S       + N+RK+ + M D+YS++ XXXXXX
Sbjct: 41  KNLKGFFTXXXXXXXXXXXDQEARNAASRREMSDFQSNYRKRARTMSDYYSDLXXXXXXX 100

Query: 92  XXXXXMKRXXSRLAANSVAVAAVSDGLQKIESEKXXXXXXXXXXXXXXXXXXXXXLWVKD 151
                +    SR++                    XXXXXXXXXX   XXXXXXXXLWVKD
Sbjct: 101 EESGDINLKKSRVS----RAVXXXXXXXXXXXXXXXXXXXXXXXVRGXXXXXXXXLWVKD 160

Query: 152 RSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAV 211
           RS+AWW+EC+  DYP+E+FKK FRM ++TF++IC+ELNSA+AKEDT LR AIPV+QRVAV
Sbjct: 161 RSRAWWEECSRLDYPEEDFKKAFRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAV 220

Query: 212 CLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEY 271
           C+WRLATG+PLR+VSKKFGLGISTCHKLVLEVC AI+ VLMPK+LQWP++E+LR I+E +
Sbjct: 221 CIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERF 280

Query: 272 ESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTD 331
           ES+SGIPNVVGSMYTTHIPIIAPKISVA+YFNKRHTERNQKTSYSIT+Q VV+P+GVFTD
Sbjct: 281 ESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTD 340

Query: 332 VCIGWPGSMPDDQVLEKSALFQRA-NGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQ 391
           +CIGWPGSMPDD+VLEKS L+QRA NGGLLKG+W+ GG  +PL+DWVLVPYTQQ+LTWTQ
Sbjct: 341 LCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGMWVAGGPGHPLLDWVLVPYTQQNLTWTQ 400

Query: 392 HAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEM 451
           HAFNEK+ E+Q VAK+AF RLKGRW CLQKRTEVKLQDLP VLGACCVLHNICE+  ++M
Sbjct: 401 HAFNEKMSEVQGVAKEAFGRLKGRWACLQKRTEVKLQDLPTVLGACCVLHNICEMREEKM 460

Query: 452 DTELLTELQDDEMAPEMALRSVPSMKARDAIAHNLLHHGLAGTSFL 497
           + EL+ E+ DDE+ PE  LRSV +MKARD I+HNLLHHGLAGTSFL
Sbjct: 461 EPELMVEVIDDEVLPENVLRSVNAMKARDTISHNLLHHGLAGTSFL 502

BLAST of CsGy4G023810 vs. TAIR10
Match: AT4G29780.1 (unknown protein)

HSP 1 Score: 502.3 bits (1292), Expect = 3.5e-142
Identity = 247/438 (56.39%), Postives = 312/438 (71.23%), Query Frame = 0

Query: 60  VEAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQ 119
           ++ K  +E NH+KK K M  +Y+++Q            +   +R  A +  V+AV+ G  
Sbjct: 105 IKEKSLLEANHKKKVKTMDGYYNQMQDHYSAAGETDGSRSKRARKTAVAAVVSAVASGAD 164

Query: 120 KIESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRA 179
              +                       LWVK+R+  WWD  + PD+P++EF+++FRM ++
Sbjct: 165 --TTGLAAPVPTADIASGSGSGPSHRRLWVKERTTDWWDRVSRPDFPEDEFRREFRMSKS 224

Query: 180 TFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL 239
           TF++ICEEL++ + K++T LR AIP  +RV VC+WRLATG PLR VS++FGLGISTCHKL
Sbjct: 225 TFNLICEELDTTVTKKNTMLRDAIPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKL 284

Query: 240 VLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVA 299
           V+EVC AI  VLMPK+L WP +  +   K ++ES+  IPNVVGS+YTTHIPIIAPK+ VA
Sbjct: 285 VIEVCRAIYDVLMPKYLLWPSDSEINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVA 344

Query: 300 AYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALF-QRANGG 359
           AYFNKRHTERNQKTSYSITVQGVV+  G+FTDVCIG PGS+ DDQ+LEKS+L  QRA  G
Sbjct: 345 AYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPGSLTDDQILEKSSLSRQRAARG 404

Query: 360 LLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCL 419
           +L+  WIVG S +PL D++LVPYT+Q+LTWTQHAFNE IGEIQ +A  AF RLKGRW CL
Sbjct: 405 MLRDSWIVGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACL 464

Query: 420 QKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKAR 479
           QKRTEVKLQDLP VLGACCVLHNICE+  +EM  EL  E+ DD   PE  +RS  ++  R
Sbjct: 465 QKRTEVKLQDLPYVLGACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSASAVNTR 524

Query: 480 DAIAHNLLHHGLAGTSFL 497
           D I+HNLLH GLAGT  L
Sbjct: 525 DHISHNLLHRGLAGTRTL 540

BLAST of CsGy4G023810 vs. TAIR10
Match: AT3G63270.1 (Putative harbinger transposase-derived nuclease (InterPro:IPR006912))

HSP 1 Score: 162.5 bits (410), Expect = 6.5e-40
Identity = 102/318 (32.08%), Postives = 161/318 (50.63%), Query Frame = 0

Query: 156 WWD----ECNSPDYPDEE---FKKQFRMGRATFDMICEELNSAIAKEDTTLR-------- 215
           WWD      +SP  P +E   FK  FR  + TF  IC     ++ +ED   R        
Sbjct: 44  WWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTFSYIC-----SLVREDLISRPPSGLINI 103

Query: 216 --TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQW 275
               + V+++VA+ L RLA+GD    V   FG+G ST  ++      A+       HL+W
Sbjct: 104 EGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE-RAKHHLRW 163

Query: 276 PEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSIT 335
           P+ + +  IK ++E + G+PN  G++ TTHI +  P +  +  +       +Q+ +YS+ 
Sbjct: 164 PDSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDW------CDQEKNYSMF 223

Query: 336 VQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRA-NGGLLKG------------VWI 395
           +QGV D    F ++  GWPG M   ++L+ S  F+   N  +L G             ++
Sbjct: 224 LQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYV 283

Query: 396 VGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK-RTEV 443
           VGG SYPL+ W++ P+   H + +  AFNE+  +++ VA  AF +LKG WR L K     
Sbjct: 284 VGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRP 343

BLAST of CsGy4G023810 vs. TAIR10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 161.0 bits (406), Expect = 1.9e-39
Identity = 108/333 (32.43%), Postives = 172/333 (51.65%), Query Frame = 0

Query: 156 WWDECNSPDY----PDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA--------- 215
           WWD  +   Y      + F+  F++ R TFD IC     ++ K D T + A         
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYIC-----SLVKADFTAKPANFSDSNGNP 113

Query: 216 IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEE 275
           + +  RVAV L RL +G+ L V+ + FG+  ST  ++      ++    +  HL WP + 
Sbjct: 114 LSLNDRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAI-HHLSWPSK- 173

Query: 276 TLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGV 335
            L  IK ++E ISG+PN  G++  THI +  P +  +   NK   +  +  ++S+T+Q V
Sbjct: 174 -LDEIKSKFEKISGLPNCCGAIDITHIVMNLPAVEPS---NKVWLDGEK--NFSMTLQAV 233

Query: 336 VDPRGVFTDVCIGWPGSMPDDQVLEKSALF------QRANGGLLK-------GVWIVGGS 395
           VDP   F DV  GWPGS+ DD VL+ S  +      +R NG  L          +IVG S
Sbjct: 234 VDPDMRFLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDS 293

Query: 396 SYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQD- 455
            +PL+ W+L PY  +  +  Q  FN++  E  K A+ A ++LK RWR +     +  ++ 
Sbjct: 294 GFPLLPWLLTPYQGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNR 353

Query: 456 LPVVLGACCVLHN-ICELGNQEMDTELLTELQD 461
           LP ++  CC+LHN I ++ +Q +D + L++  D
Sbjct: 354 LPRIIFVCCLLHNIIIDMEDQTLDDQPLSQQHD 373

BLAST of CsGy4G023810 vs. TAIR10
Match: AT3G19120.1 (PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 100.5 bits (249), Expect = 3.0e-21
Identity = 79/328 (24.09%), Postives = 147/328 (44.82%), Query Frame = 0

Query: 167 DEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVS 226
           D  ++  + +    F  + ++L   I    T    ++P    VA+ L RLA G   + ++
Sbjct: 114 DARWRSLYGLSYPVFITVVDKLKPFI----TASNLSLPADYAVAMVLSRLAHGCSAKTLA 173

Query: 227 KKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWP-EEETLRRIKEEYESISGIPNVVGSMY 286
            ++ L      K+   V   + T L P+ ++ P  +  L    + +E ++ +PN+ G++ 
Sbjct: 174 SRYSLDPYLISKITNMVTRLLATKLYPEFIKIPVGKRRLIETTQGFEELTSLPNICGAID 233

Query: 287 TTHIPIIAPKISVAAYFNKRHTERNQKTSY-------SITVQGVVDPRGVFTDVCIGWPG 346
           +T + +            +R T+ N +  Y       ++ +Q V D + +F DVC+  PG
Sbjct: 234 STPVKL------------RRRTKLNPRNIYGCKYGYDAVLLQVVADHKKIFWDVCVKAPG 293

Query: 347 SMPDDQVLEKSALFQRANGGLLKGVW--------------IVGGSSYPLMDWVLVPYTQQ 406
              D      S L++R   G +  VW              IVG   YPL+ +++ P++  
Sbjct: 294 GEDDSSHFRDSLLYKRLTSGDI--VWEKVINIRGHHVRPYIVGDWCYPLLSFLMTPFSPN 353

Query: 407 HL-TWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNIC 466
              T  ++ F+  + + + V  +A   LK RW+ LQ    V +   P  + ACCVLHN+C
Sbjct: 354 GSGTPPENLFDGMLMKGRSVVVEAIGLLKARWKILQS-LNVGVNHAPQTIVACCVLHNLC 413

Query: 467 ELGNQEMDTELLTELQDDEMAPEMALRS 472
           ++  +E + E+  +  D+   P   L S
Sbjct: 414 QIA-REPEPEIWKD-PDEAGTPARVLES 420

BLAST of CsGy4G023810 vs. Swiss-Prot
Match: sp|Q94K49|ALP1_ARATH (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.2e-38
Identity = 102/318 (32.08%), Postives = 161/318 (50.63%), Query Frame = 0

Query: 156 WWD----ECNSPDYPDEE---FKKQFRMGRATFDMICEELNSAIAKEDTTLR-------- 215
           WWD      +SP  P +E   FK  FR  + TF  IC     ++ +ED   R        
Sbjct: 44  WWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTFSYIC-----SLVREDLISRPPSGLINI 103

Query: 216 --TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQW 275
               + V+++VA+ L RLA+GD    V   FG+G ST  ++      A+       HL+W
Sbjct: 104 EGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE-RAKHHLRW 163

Query: 276 PEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSIT 335
           P+ + +  IK ++E + G+PN  G++ TTHI +  P +  +  +       +Q+ +YS+ 
Sbjct: 164 PDSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDW------CDQEKNYSMF 223

Query: 336 VQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRA-NGGLLKG------------VWI 395
           +QGV D    F ++  GWPG M   ++L+ S  F+   N  +L G             ++
Sbjct: 224 LQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYV 283

Query: 396 VGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK-RTEV 443
           VGG SYPL+ W++ P+   H + +  AFNE+  +++ VA  AF +LKG WR L K     
Sbjct: 284 VGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRP 343

BLAST of CsGy4G023810 vs. Swiss-Prot
Match: sp|Q9M2U3|ALPL_ARATH (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 3.4e-38
Identity = 108/333 (32.43%), Postives = 172/333 (51.65%), Query Frame = 0

Query: 156 WWDECNSPDY----PDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA--------- 215
           WWD  +   Y      + F+  F++ R TFD IC     ++ K D T + A         
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYIC-----SLVKADFTAKPANFSDSNGNP 113

Query: 216 IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEE 275
           + +  RVAV L RL +G+ L V+ + FG+  ST  ++      ++    +  HL WP + 
Sbjct: 114 LSLNDRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAI-HHLSWPSK- 173

Query: 276 TLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGV 335
            L  IK ++E ISG+PN  G++  THI +  P +  +   NK   +  +  ++S+T+Q V
Sbjct: 174 -LDEIKSKFEKISGLPNCCGAIDITHIVMNLPAVEPS---NKVWLDGEK--NFSMTLQAV 233

Query: 336 VDPRGVFTDVCIGWPGSMPDDQVLEKSALF------QRANGGLLK-------GVWIVGGS 395
           VDP   F DV  GWPGS+ DD VL+ S  +      +R NG  L          +IVG S
Sbjct: 234 VDPDMRFLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDS 293

Query: 396 SYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQD- 455
            +PL+ W+L PY  +  +  Q  FN++  E  K A+ A ++LK RWR +     +  ++ 
Sbjct: 294 GFPLLPWLLTPYQGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNR 353

Query: 456 LPVVLGACCVLHN-ICELGNQEMDTELLTELQD 461
           LP ++  CC+LHN I ++ +Q +D + L++  D
Sbjct: 354 LPRIIFVCCLLHNIIIDMEDQTLDDQPLSQQHD 373

BLAST of CsGy4G023810 vs. Swiss-Prot
Match: sp|Q17QR8|HARB1_BOVIN (Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 3.2e-20
Identity = 77/292 (26.37%), Postives = 134/292 (45.89%), Query Frame = 0

Query: 158 DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLA 217
           D     D  DE     +   R     + E L +++++  T    AI  + ++   L    
Sbjct: 23  DRFKLDDVTDEYLMSMYGFPRQFIYYLVELLGASLSR-PTQRSRAISPETQILAALGFYT 82

Query: 218 TGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWP-EEETLRRIKEEYESISG 277
           +G     +    G+  ++  + V  V  A+      + + +P +E +++ +K+E+  ++G
Sbjct: 83  SGSFQTRMGDAIGISQASMSRCVANVTEAL-VERASQFIHFPADEASVQALKDEFYGLAG 142

Query: 278 IPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGW 337
           IP V+G +   H+ I AP     +Y N+       K  +S+    V D RG    V   W
Sbjct: 143 IPGVIGVVDCMHVAIKAPNAEDLSYVNR-------KGLHSLNCLMVCDIRGALMTVETSW 202

Query: 338 PGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHL--TWTQHAFN 397
           PGS+ D  VL++S+L  +   G+ K  W++G SS+ L  W++ P    H+  T  ++ +N
Sbjct: 203 PGSLQDCVVLQQSSLSSQFEAGMHKESWLLGDSSFFLRTWLMTPL---HIPETPAEYRYN 262

Query: 398 EKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPV----VLGACCVLHNI 443
                   V +  F  L  R+RCL   ++  LQ  P     ++ ACCVLHNI
Sbjct: 263 MAHSATHSVIEKTFRTLCSRFRCLD-GSKGALQYSPEKSSHIILACCVLHNI 301

BLAST of CsGy4G023810 vs. Swiss-Prot
Match: sp|Q96MB7|HARB1_HUMAN (Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 7.1e-20
Identity = 77/292 (26.37%), Postives = 134/292 (45.89%), Query Frame = 0

Query: 158 DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLA 217
           D     D  DE     +   R     + E L + +++  T    AI  + +V   L    
Sbjct: 23  DRFKLDDVTDEYLMSMYGFPRQFIYYLVELLGANLSR-PTQRSRAISPETQVLAALGFYT 82

Query: 218 TGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWP-EEETLRRIKEEYESISG 277
           +G     +    G+  ++  + V  V  A+      + +++P +E +++ +K+E+  ++G
Sbjct: 83  SGSFQTRMGDAIGISQASMSRCVANVTEAL-VERASQFIRFPADEASIQALKDEFYGLAG 142

Query: 278 IPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGW 337
           +P V+G +   H+ I AP     +Y N+       K  +S+    V D RG    V   W
Sbjct: 143 MPGVMGVVDCIHVAIKAPNAEDLSYVNR-------KGLHSLNCLMVCDIRGTLMTVETNW 202

Query: 338 PGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHL--TWTQHAFN 397
           PGS+ D  VL++S+L  +   G+ K  W++G SS+ L  W++ P    H+  T  ++ +N
Sbjct: 203 PGSLQDCAVLQQSSLSSQFEAGMHKDSWLLGDSSFFLRTWLMTPL---HIPETPAEYRYN 262

Query: 398 EKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPV----VLGACCVLHNI 443
                   V +  F  L  R+RCL   ++  LQ  P     ++ ACCVLHNI
Sbjct: 263 MAHSATHSVIEKTFRTLCSRFRCLD-GSKGALQYSPEKSSHIILACCVLHNI 301

BLAST of CsGy4G023810 vs. Swiss-Prot
Match: sp|B0BN95|HARB1_RAT (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 1.6e-19
Identity = 76/292 (26.03%), Postives = 133/292 (45.55%), Query Frame = 0

Query: 158 DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLA 217
           D     D  DE     +   R     + E L +++++  T    AI  + ++   L    
Sbjct: 23  DRFKLDDVTDEYLMSMYGFPRQFIYYLVELLGASLSR-PTQRSRAISPETQILAALGFYT 82

Query: 218 TGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWP-EEETLRRIKEEYESISG 277
           +G     +    G+  ++  + V  V  A+      + + +P +E  ++ +K+E+  ++G
Sbjct: 83  SGSFQTRMGDAIGISQASMSRCVANVTEAL-VERASQFIHFPADEAAIQSLKDEFYGLAG 142

Query: 278 IPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGW 337
           +P V+G++   H+ I AP     +Y N+       K  +S+    V D RG    V   W
Sbjct: 143 MPGVIGAVDCIHVAIKAPNAEDLSYVNR-------KGLHSLNCLVVCDIRGALMTVETSW 202

Query: 338 PGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHL--TWTQHAFN 397
           PGS+ D  VL++S+L  +   G+ K  W++G SS+ L  W+L P    H+  T  ++ +N
Sbjct: 203 PGSLQDCAVLQQSSLSSQFETGMPKDSWLLGDSSFFLHTWLLTPL---HIPETPAEYRYN 262

Query: 398 EKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPV----VLGACCVLHNI 443
                   V +     L  R+RCL   ++  LQ  P     ++ ACCVLHNI
Sbjct: 263 RAHSATHSVIEKTLRTLCCRFRCLD-GSKGALQYSPEKSSHIILACCVLHNI 301

BLAST of CsGy4G023810 vs. TrEMBL
Match: tr|A0A0A0L420|A0A0A0L420_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G646260 PE=4 SV=1)

HSP 1 Score: 922.5 bits (2383), Expect = 3.9e-265
Identity = 496/496 (100.00%), Postives = 496/496 (100.00%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV
Sbjct: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK
Sbjct: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 496

BLAST of CsGy4G023810 vs. TrEMBL
Match: tr|A0A1S3BVR8|A0A1S3BVR8_CUCME (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103493676 PE=4 SV=1)

HSP 1 Score: 910.2 bits (2351), Expect = 2.0e-261
Identity = 490/496 (98.79%), Postives = 493/496 (99.40%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV
Sbjct: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMK XX RLAANSVAVAAVSDGLQ+
Sbjct: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKXXXXRLAANSVAVAAVSDGLQR 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE+EK XXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IENEK-XXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEE+ESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEFESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 495

BLAST of CsGy4G023810 vs. TrEMBL
Match: tr|A0A1S2Y3N5|A0A1S2Y3N5_CICAR (uncharacterized protein LOC101491352 OS=Cicer arietinum OX=3827 GN=LOC101491352 PE=4 SV=1)

HSP 1 Score: 693.7 bits (1789), Expect = 2.9e-196
Identity = 338/465 (72.69%), Postives = 385/465 (82.80%), Query Frame = 0

Query: 34  LKGLVTSLLLLDEQDKCEQDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXX 93
           LKG++TS+LLLDEQ+K E +  +++  + K  +E NH+KKTKAMVD+Y+ +         
Sbjct: 80  LKGILTSILLLDEQEKQEFENNNKVLEDEKFCLETNHKKKTKAMVDYYTNLDDSYSQVEE 139

Query: 94  XXXMKRXXSRLAANSVAVAAV--SDGLQKIESEKXXXXXXXXXXXXXXXXXXXXXLWVKD 153
              ++R  +R  ++SVA+AA   SDG+++  SE                      LWVKD
Sbjct: 140 SERVRRKKTRNMSSSVAIAATTFSDGIEETNSES-VVNNMKNNDNSSGKSGSQRRLWVKD 199

Query: 154 RSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAV 213
           RS AWWDECN  D+P+ EF+K FRMG++TFD+ICEELNSAI KEDTTLR AIPV+QRVAV
Sbjct: 200 RSGAWWDECNKDDFPENEFRKAFRMGKSTFDLICEELNSAIVKEDTTLRNAIPVRQRVAV 259

Query: 214 CLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEY 273
           CLWRLATGDPLR+VSK+FGLGISTCHKLVLEVCTAI+TVLMPK+LQWP E  LR+IK E+
Sbjct: 260 CLWRLATGDPLRIVSKRFGLGISTCHKLVLEVCTAIKTVLMPKYLQWPNEVNLRKIKGEF 319

Query: 274 ESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTD 333
           ESISGIPNVVGSMYT+H+PIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDP GVFTD
Sbjct: 320 ESISGIPNVVGSMYTSHVPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPNGVFTD 379

Query: 334 VCIGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQH 393
           VCIGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVG SSYPLMDWVLVPY QQ+LTWTQH
Sbjct: 380 VCIGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVGSSSYPLMDWVLVPYNQQNLTWTQH 439

Query: 394 AFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMD 453
            FNEKIGEIQKVAKDAF RLKGRW CLQKRTEVKLQDLPVVLGACCVLHNICE+  ++M+
Sbjct: 440 GFNEKIGEIQKVAKDAFGRLKGRWCCLQKRTEVKLQDLPVVLGACCVLHNICEMKGEKME 499

Query: 454 TELLTELQDDEMAPEMALRSVPSMKARDAIAHNLLHHGLAGTSFL 497
            EL  ++ DDEM PE+ LRSV S+KARDAIAHNLLHHGLAGTSFL
Sbjct: 500 DELKVDVLDDEMVPEVGLRSVNSLKARDAIAHNLLHHGLAGTSFL 543

BLAST of CsGy4G023810 vs. TrEMBL
Match: tr|B9RQS8|B9RQS8_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0706300 PE=4 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 1.6e-194
Identity = 352/505 (69.70%), Postives = 408/505 (80.79%), Query Frame = 0

Query: 1   MNDSTNGNVRKRT--RADEVDEDDDLMGKNGGG-------KGLKGLVTSLLLLDEQDKCE 60
           MN++ N   R+R   R + VD+DD    +           K L G++TSL+L+++Q+KC+
Sbjct: 1   MNETNNTKKRQRKGYRQESVDKDDSNSFEEDSNNTTSLKTKDLSGIITSLILIEDQEKCD 60

Query: 61  QDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAV 120
           Q+E++R   E K  +E NH+KKT+  V++YS +Q           +KR  SR  A + A+
Sbjct: 61  QEEENRAFSEEKHLLEANHKKKTRTAVEYYSNLQDYYSEIEETDRVKRKKSRAIAGAAAI 120

Query: 121 AAVSDGLQKIESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFK 180
           +A S+G+               XXXXXXXXXXXXXLWVKDR K WWDECN PDYP+EEFK
Sbjct: 121 SASSNGVAN-------KATGDAXXXXXXXXXXXXXLWVKDRDKEWWDECNRPDYPEEEFK 180

Query: 181 KQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGL 240
           K FRM +ATFD+ICEEL+S I KEDTTLR AIPV+QRVAVC+WRLATG+PLR+VSK+FGL
Sbjct: 181 KAFRMSKATFDLICEELHSCIQKEDTTLRNAIPVRQRVAVCIWRLATGEPLRLVSKRFGL 240

Query: 241 GISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPI 300
           GISTCHKLVLEVC+AI+ VLMPK+LQWP+E++L+++K E+ESISGIPNVVGSMYTTHIPI
Sbjct: 241 GISTCHKLVLEVCSAIKNVLMPKYLQWPDEDSLKKVKNEFESISGIPNVVGSMYTTHIPI 300

Query: 301 IAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSAL 360
           IAPKISVAAYFNKRHTERNQKTSYSITVQGVVDP+GVFTDVCIGWPGSMPDDQVLEKSAL
Sbjct: 301 IAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPKGVFTDVCIGWPGSMPDDQVLEKSAL 360

Query: 361 FQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARL 420
           +QRANGGLLK VWIVG S YPLMDWVLVPYTQQHLTWTQHAFNEKIGE+Q VAK+AF RL
Sbjct: 361 YQRANGGLLKDVWIVGSSGYPLMDWVLVPYTQQHLTWTQHAFNEKIGEVQTVAKEAFTRL 420

Query: 421 KGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRS 480
           KGRW CLQKRTEVKLQDLPVVLGACCVLHNICEL  +E+D +L  EL DDEM PE+ALRS
Sbjct: 421 KGRWSCLQKRTEVKLQDLPVVLGACCVLHNICELRKEEIDPKLRVELVDDEMVPEVALRS 480

Query: 481 VPSMKARDAIAHNLLHHGLAGTSFL 497
             SMKARDAIAHNLLHH  AGT FL
Sbjct: 481 ASSMKARDAIAHNLLHHCHAGTGFL 498

BLAST of CsGy4G023810 vs. TrEMBL
Match: tr|A0A218VSM8|A0A218VSM8_PUNGR (Uncharacterized protein OS=Punica granatum OX=22663 GN=CDL15_Pgr022111 PE=4 SV=1)

HSP 1 Score: 673.3 bits (1736), Expect = 4.1e-190
Identity = 332/463 (71.71%), Postives = 377/463 (81.43%), Query Frame = 0

Query: 34  LKGLVTSLLLLDEQDKCEQDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXX 93
           LKG++TSL LL++Q+K +  E++  +VE +  +E N+RKK++A  DFYS V+        
Sbjct: 60  LKGIITSLSLLEDQEKEDLREREVAAVEERQLLENNYRKKSRATADFYSNVEDYYAETDE 119

Query: 94  XXXMKRXXSRLAANSVAVAAVSDGLQKIESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRS 153
               +R  SR  A +VA     +G+ K +SEK                     LWVKDRS
Sbjct: 120 LDRTRRKKSRALAGAVAAGIAEEGVLK-KSEK-------GGKKSGGEGGQSRRLWVKDRS 179

Query: 154 KAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCL 213
            +WWDECN PDYP+ EF+K FRMGR TFD+ICEELNSAIAKEDT LR AIPV+QRVAVC+
Sbjct: 180 NSWWDECNRPDYPEHEFRKAFRMGRKTFDVICEELNSAIAKEDTALRNAIPVRQRVAVCI 239

Query: 214 WRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYES 273
           WRLATG+PLR+VSKKFGLGISTCHKLVLEVC AI++VLMPK LQWPE+  LR+IKEE+ES
Sbjct: 240 WRLATGEPLRLVSKKFGLGISTCHKLVLEVCAAIKSVLMPKFLQWPED--LRKIKEEFES 299

Query: 274 ISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVC 333
           +S IPNVVGSMYTTH+PIIAPKISVAAYFNKRHTERNQKTSYSIT+QGVVDPRGVFTDVC
Sbjct: 300 VSAIPNVVGSMYTTHVPIIAPKISVAAYFNKRHTERNQKTSYSITLQGVVDPRGVFTDVC 359

Query: 334 IGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAF 393
           IGWPGSMPDDQVLEKSAL+QRA GGLLKGVWIVGGS YPL+DWVLVPYTQ +LTWTQHAF
Sbjct: 360 IGWPGSMPDDQVLEKSALYQRAQGGLLKGVWIVGGSGYPLLDWVLVPYTQPNLTWTQHAF 419

Query: 394 NEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTE 453
           NEKIGE+Q VAKDAFARLKGRW CLQKRTEVKLQDLP+VLGACCVLHNICE+  +EMD E
Sbjct: 420 NEKIGEVQNVAKDAFARLKGRWSCLQKRTEVKLQDLPIVLGACCVLHNICEMRGEEMDPE 479

Query: 454 LLTELQDDEMAPEMALRSVPSMKARDAIAHNLLHHGLAGTSFL 497
           L  E+ DDEM PE ALRSV  MKARDAIAHN+LH GLAGTSFL
Sbjct: 480 LRIEIMDDEMVPEAALRSVSLMKARDAIAHNILHKGLAGTSFL 512

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004141329.15.9e-265100.00PREDICTED: putative nuclease HARBI1 [Cucumis sativus] >KGN55352.1 hypothetical p... [more]
XP_008452747.13.0e-26198.79PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
XP_022977009.13.6e-24688.10protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita maxima][more]
XP_022936710.11.0e-24590.93protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita moschata][more]
XP_023535595.11.0e-24590.73protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita pepo subsp.... [more]
Match NameE-valueIdentityDescription
AT5G12010.12.7e-17167.17unknown protein[more]
AT4G29780.13.5e-14256.39unknown protein[more]
AT3G63270.16.5e-4032.08Putative harbinger transposase-derived nuclease (InterPro:IPR006912)[more]
AT3G55350.11.9e-3932.43PIF / Ping-Pong family of plant transposases[more]
AT3G19120.13.0e-2124.09PIF / Ping-Pong family of plant transposases[more]
Match NameE-valueIdentityDescription
sp|Q94K49|ALP1_ARATH1.2e-3832.08Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
sp|Q9M2U3|ALPL_ARATH3.4e-3832.43Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
sp|Q17QR8|HARB1_BOVIN3.2e-2026.37Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1[more]
sp|Q96MB7|HARB1_HUMAN7.1e-2026.37Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1[more]
sp|B0BN95|HARB1_RAT1.6e-1926.03Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0L420|A0A0A0L420_CUCSA3.9e-265100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G646260 PE=4 SV=1[more]
tr|A0A1S3BVR8|A0A1S3BVR8_CUCME2.0e-26198.79putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103493676 PE=4 SV=1[more]
tr|A0A1S2Y3N5|A0A1S2Y3N5_CICAR2.9e-19672.69uncharacterized protein LOC101491352 OS=Cicer arietinum OX=3827 GN=LOC101491352 ... [more]
tr|B9RQS8|B9RQS8_RICCO1.6e-19469.70Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0706300 PE=4 SV=1[more]
tr|A0A218VSM8|A0A218VSM8_PUNGR4.1e-19071.71Uncharacterized protein OS=Punica granatum OX=22663 GN=CDL15_Pgr022111 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027806HARBI1_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:1902600 hydrogen ion transmembrane transport
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0015299 solute:proton antiporter activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G023810.1CsGy4G023810.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 285..441
e-value: 3.3E-36
score: 124.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..145
NoneNo IPR availablePANTHERPTHR22930:SF113SUBFAMILY NOT NAMEDcoord: 84..492
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 84..492

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsGy4G023810Cla020861Watermelon (97103) v1cgybwmB297
CsGy4G023810Csa1G589710Cucumber (Chinese Long) v2cgybcuB155
CsGy4G023810Csa4G646260Cucumber (Chinese Long) v2cgybcuB166
CsGy4G023810ClCG05G018050Watermelon (Charleston Gray)cgybwcgB291
CsGy4G023810ClCG03G015390Watermelon (Charleston Gray)cgybwcgB274
CsGy4G023810CSPI01G28370Wild cucumber (PI 183967)cgybcpiB158
CsGy4G023810CSPI04G25490Wild cucumber (PI 183967)cgybcpiB169
CsGy4G023810CmaCh11G007070Cucurbita maxima (Rimu)cgybcmaB476
CsGy4G023810CmaCh10G007370Cucurbita maxima (Rimu)cgybcmaB473
CsGy4G023810CmaCh09G003510Cucurbita maxima (Rimu)cgybcmaB465
CsGy4G023810CmaCh01G017220Cucurbita maxima (Rimu)cgybcmaB514
CsGy4G023810CmoCh09G003510Cucurbita moschata (Rifu)cgybcmoB433
CsGy4G023810CmoCh10G007630Cucurbita moschata (Rifu)cgybcmoB441
CsGy4G023810CmoCh01G017910Cucurbita moschata (Rifu)cgybcmoB475
CsGy4G023810CmoCh11G007230Cucurbita moschata (Rifu)cgybcmoB445
CsGy4G023810Lsi06G002980Bottle gourd (USVL1VR-Ls)cgyblsiB276
CsGy4G023810Lsi04G014110Bottle gourd (USVL1VR-Ls)cgyblsiB269
CsGy4G023810Cp4.1LG18g03410Cucurbita pepo (Zucchini)cgybcpeB498
CsGy4G023810MELO3C032348.2Melon (DHL92) v3.6.1cgybmedB267
CsGy4G023810MELO3C029341.2Melon (DHL92) v3.6.1cgybmedB241
CsGy4G023810CsaV3_1G040660Cucumber (Chinese Long) v3cgybcucB162
CsGy4G023810Cla97C05G099360Watermelon (97103) v2cgybwmbB291
CsGy4G023810Cla97C03G065660Watermelon (97103) v2cgybwmbB275
CsGy4G023810Bhi02G001540Wax gourdcgybwgoB406
CsGy4G023810Bhi09G002562Wax gourdcgybwgoB398
CsGy4G023810Carg10701Silver-seed gourdcarcgybB0767
CsGy4G023810Carg02778Silver-seed gourdcarcgybB0633
CsGy4G023810Carg12982Silver-seed gourdcarcgybB0103
CsGy4G023810Carg21947Silver-seed gourdcarcgybB0876
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CsGy4G023810CsGy1G027540Cucumber (Gy14) v2cgybcgybB022