CsGy4G023810.1 (mRNA) Cucumber (Gy14) v2

NameCsGy4G023810.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionputative nuclease HARBI1
LocationChr4 : 29754132 .. 29755622 (+)
Sequence length1491
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGATTCCACCAACGGAAACGTGAGGAAGAGGACTAGAGCTGATGAAGTCGATGAAGACGACGATTTGATGGGAAAAAATGGCGGAGGAAAGGGTTTGAAAGGATTGGTTACGTCTCTGTTGTTGTTGGATGAACAGGACAAGTGTGAACAGGATGAACAAGACAGAATTTCCGTGGAGGCGAAGATTTCGATGGAGGTGAATCACAGGAAGAAGACCAAAGCTATGGTTGATTTCTATTCCGAAGTTCAAGATTACTATTCTGAAGTTGAAGAATCCGACCGAATGAAACGGAAGAAATCGCGATTGGCAGCTAACTCTGTTGCGGTTGCGGCCGTTTCCGATGGATTACAGAAAATCGAAAGCGAAAAATCAAACAAACGCGGCGGCGATGGCGGTGGAGGAAGCGGTGGTGGTGGTGGCCATCACCGGAGACTCTGGGTAAAAGATAGGTCAAAAGCTTGGTGGGATGAATGTAACAGTCCCGATTATCCCGATGAAGAATTCAAGAAGCAATTCAGAATGGGTAGAGCAACTTTCGATATGATTTGTGAAGAACTTAATTCCGCCATAGCTAAAGAAGACACAACCCTCCGAACCGCCATTCCCGTCCAGCAAAGAGTCGCGGTTTGCCTATGGAGATTAGCCACCGGCGATCCACTTCGAGTTGTATCGAAGAAATTCGGATTAGGTATTTCAACTTGCCACAAACTTGTTCTCGAGGTTTGCACAGCCATTAGAACAGTACTAATGCCGAAGCATCTTCAATGGCCGGAAGAAGAAACACTCAGAAGAATCAAAGAAGAATACGAATCAATTTCCGGAATCCCTAACGTCGTTGGTTCAATGTACACCACACACATTCCGATCATCGCTCCCAAAATCAGCGTCGCAGCTTATTTCAACAAACGCCATACAGAAAGAAATCAAAAAACATCATACTCAATTACAGTTCAAGGAGTGGTGGATCCAAGAGGAGTTTTCACGGACGTTTGCATCGGTTGGCCGGGATCAATGCCAGACGATCAAGTTCTTGAGAAATCTGCTCTGTTTCAAAGAGCAAATGGGGGATTACTGAAAGGAGTTTGGATTGTTGGAGGATCAAGTTATCCATTAATGGATTGGGTTTTAGTTCCTTATACACAGCAACATTTAACATGGACACAACATGCATTTAACGAGAAGATTGGAGAGATTCAGAAGGTGGCTAAAGATGCATTTGCACGGCTGAAGGGACGGTGGCGCTGCCTACAGAAACGGACAGAGGTGAAGCTTCAAGATTTGCCGGTGGTGCTCGGAGCTTGTTGTGTTCTTCATAATATTTGTGAATTAGGGAATCAAGAAATGGATACAGAGCTTTTAACAGAGCTTCAAGATGATGAAATGGCACCTGAAATGGCTTTAAGGTCAGTACCTTCCATGAAAGCAAGAGATGCCATTGCTCATAATCTGCTCCACCATGGCCTTGCTGGGACTTCTTTTCTTTAA

mRNA sequence

ATGAATGATTCCACCAACGGAAACGTGAGGAAGAGGACTAGAGCTGATGAAGTCGATGAAGACGACGATTTGATGGGAAAAAATGGCGGAGGAAAGGGTTTGAAAGGATTGGTTACGTCTCTGTTGTTGTTGGATGAACAGGACAAGTGTGAACAGGATGAACAAGACAGAATTTCCGTGGAGGCGAAGATTTCGATGGAGGTGAATCACAGGAAGAAGACCAAAGCTATGGTTGATTTCTATTCCGAAGTTCAAGATTACTATTCTGAAGTTGAAGAATCCGACCGAATGAAACGGAAGAAATCGCGATTGGCAGCTAACTCTGTTGCGGTTGCGGCCGTTTCCGATGGATTACAGAAAATCGAAAGCGAAAAATCAAACAAACGCGGCGGCGATGGCGGTGGAGGAAGCGGTGGTGGTGGTGGCCATCACCGGAGACTCTGGGTAAAAGATAGGTCAAAAGCTTGGTGGGATGAATGTAACAGTCCCGATTATCCCGATGAAGAATTCAAGAAGCAATTCAGAATGGGTAGAGCAACTTTCGATATGATTTGTGAAGAACTTAATTCCGCCATAGCTAAAGAAGACACAACCCTCCGAACCGCCATTCCCGTCCAGCAAAGAGTCGCGGTTTGCCTATGGAGATTAGCCACCGGCGATCCACTTCGAGTTGTATCGAAGAAATTCGGATTAGGTATTTCAACTTGCCACAAACTTGTTCTCGAGGTTTGCACAGCCATTAGAACAGTACTAATGCCGAAGCATCTTCAATGGCCGGAAGAAGAAACACTCAGAAGAATCAAAGAAGAATACGAATCAATTTCCGGAATCCCTAACGTCGTTGGTTCAATGTACACCACACACATTCCGATCATCGCTCCCAAAATCAGCGTCGCAGCTTATTTCAACAAACGCCATACAGAAAGAAATCAAAAAACATCATACTCAATTACAGTTCAAGGAGTGGTGGATCCAAGAGGAGTTTTCACGGACGTTTGCATCGGTTGGCCGGGATCAATGCCAGACGATCAAGTTCTTGAGAAATCTGCTCTGTTTCAAAGAGCAAATGGGGGATTACTGAAAGGAGTTTGGATTGTTGGAGGATCAAGTTATCCATTAATGGATTGGGTTTTAGTTCCTTATACACAGCAACATTTAACATGGACACAACATGCATTTAACGAGAAGATTGGAGAGATTCAGAAGGTGGCTAAAGATGCATTTGCACGGCTGAAGGGACGGTGGCGCTGCCTACAGAAACGGACAGAGGTGAAGCTTCAAGATTTGCCGGTGGTGCTCGGAGCTTGTTGTGTTCTTCATAATATTTGTGAATTAGGGAATCAAGAAATGGATACAGAGCTTTTAACAGAGCTTCAAGATGATGAAATGGCACCTGAAATGGCTTTAAGGTCAGTACCTTCCATGAAAGCAAGAGATGCCATTGCTCATAATCTGCTCCACCATGGCCTTGCTGGGACTTCTTTTCTTTAA

Coding sequence (CDS)

ATGAATGATTCCACCAACGGAAACGTGAGGAAGAGGACTAGAGCTGATGAAGTCGATGAAGACGACGATTTGATGGGAAAAAATGGCGGAGGAAAGGGTTTGAAAGGATTGGTTACGTCTCTGTTGTTGTTGGATGAACAGGACAAGTGTGAACAGGATGAACAAGACAGAATTTCCGTGGAGGCGAAGATTTCGATGGAGGTGAATCACAGGAAGAAGACCAAAGCTATGGTTGATTTCTATTCCGAAGTTCAAGATTACTATTCTGAAGTTGAAGAATCCGACCGAATGAAACGGAAGAAATCGCGATTGGCAGCTAACTCTGTTGCGGTTGCGGCCGTTTCCGATGGATTACAGAAAATCGAAAGCGAAAAATCAAACAAACGCGGCGGCGATGGCGGTGGAGGAAGCGGTGGTGGTGGTGGCCATCACCGGAGACTCTGGGTAAAAGATAGGTCAAAAGCTTGGTGGGATGAATGTAACAGTCCCGATTATCCCGATGAAGAATTCAAGAAGCAATTCAGAATGGGTAGAGCAACTTTCGATATGATTTGTGAAGAACTTAATTCCGCCATAGCTAAAGAAGACACAACCCTCCGAACCGCCATTCCCGTCCAGCAAAGAGTCGCGGTTTGCCTATGGAGATTAGCCACCGGCGATCCACTTCGAGTTGTATCGAAGAAATTCGGATTAGGTATTTCAACTTGCCACAAACTTGTTCTCGAGGTTTGCACAGCCATTAGAACAGTACTAATGCCGAAGCATCTTCAATGGCCGGAAGAAGAAACACTCAGAAGAATCAAAGAAGAATACGAATCAATTTCCGGAATCCCTAACGTCGTTGGTTCAATGTACACCACACACATTCCGATCATCGCTCCCAAAATCAGCGTCGCAGCTTATTTCAACAAACGCCATACAGAAAGAAATCAAAAAACATCATACTCAATTACAGTTCAAGGAGTGGTGGATCCAAGAGGAGTTTTCACGGACGTTTGCATCGGTTGGCCGGGATCAATGCCAGACGATCAAGTTCTTGAGAAATCTGCTCTGTTTCAAAGAGCAAATGGGGGATTACTGAAAGGAGTTTGGATTGTTGGAGGATCAAGTTATCCATTAATGGATTGGGTTTTAGTTCCTTATACACAGCAACATTTAACATGGACACAACATGCATTTAACGAGAAGATTGGAGAGATTCAGAAGGTGGCTAAAGATGCATTTGCACGGCTGAAGGGACGGTGGCGCTGCCTACAGAAACGGACAGAGGTGAAGCTTCAAGATTTGCCGGTGGTGCTCGGAGCTTGTTGTGTTCTTCATAATATTTGTGAATTAGGGAATCAAGAAATGGATACAGAGCTTTTAACAGAGCTTCAAGATGATGAAATGGCACCTGAAATGGCTTTAAGGTCAGTACCTTCCATGAAAGCAAGAGATGCCATTGCTCATAATCTGCTCCACCATGGCCTTGCTGGGACTTCTTTTCTTTAA

Protein sequence

MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQDYYSEVEESDRMKRKKSRLAANSVAVAAVSDGLQKIESEKSNKRGGDGGGGSGGGGGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDAIAHNLLHHGLAGTSFL
BLAST of CsGy4G023810.1 vs. NCBI nr
Match: XP_004141329.1 (PREDICTED: putative nuclease HARBI1 [Cucumis sativus] >KGN55352.1 hypothetical protein Csa_4G646260 [Cucumis sativus])

HSP 1 Score: 922.5 bits (2383), Expect = 5.9e-265
Identity = 496/496 (100.00%), Postives = 496/496 (100.00%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV
Sbjct: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK
Sbjct: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 496

BLAST of CsGy4G023810.1 vs. NCBI nr
Match: XP_008452747.1 (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 910.2 bits (2351), Expect = 3.0e-261
Identity = 490/496 (98.79%), Postives = 493/496 (99.40%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV
Sbjct: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMK XX RLAANSVAVAAVSDGLQ+
Sbjct: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKXXXXRLAANSVAVAAVSDGLQR 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE+EK XXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IENEK-XXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEE+ESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEFESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 495

BLAST of CsGy4G023810.1 vs. NCBI nr
Match: XP_022977009.1 (protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita maxima])

HSP 1 Score: 860.1 bits (2221), Expect = 3.6e-246
Identity = 437/496 (88.10%), Postives = 447/496 (90.12%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNG  RKR R DE DEDD  +GKNG GK LKGLVTSLLLLDEQ+K EQ+E DR S+
Sbjct: 1   MNDSTNGGARKRNRGDEADEDDGSIGKNGRGKELKGLVTSLLLLDEQEKYEQEEHDRASM 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAK+SMEVNHRKKTKAM DFYSE Q XXXXX     +KR  SRLAANSVAVAA SDGLQK
Sbjct: 61  EAKVSMEVNHRKKTKAMDDFYSEAQDXXXXXEESDRLKRKKSRLAANSVAVAAASDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE  K                     LWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IEIVK-------SNKRGGDGGGHHRRLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL+
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLI 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPL+DWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLLDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGN+EMD EL TELQDDEMAPE+ALRSV SMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNEEMDRELSTELQDDEMAPEVALRSVSSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 489

BLAST of CsGy4G023810.1 vs. NCBI nr
Match: XP_022936710.1 (protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita moschata])

HSP 1 Score: 858.6 bits (2217), Expect = 1.0e-245
Identity = 451/496 (90.93%), Postives = 461/496 (92.94%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNG  RKR R DE DEDD  +GKNG GK LKGLVTSLLLLDEQ+K EQ+E DR S+
Sbjct: 1   MNDSTNGGARKRNRGDEADEDDGSIGKNGRGKELKGLVTSLLLLDEQEKYEQEEHDRASM 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAK+SMEVNHRKKTKAM DFYSE Q  XXXX     +KR  SRLAANSVAVAA SDGLQK
Sbjct: 61  EAKVSMEVNHRKKTKAMDDFYSEAQDYXXXXEESDRLKRKKSRLAANSVAVAAASDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE        XXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IE------IVXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL+
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLI 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPL+DWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLLDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGN+EMD EL TELQDDEMAPE+ALRSV SMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNEEMDRELSTELQDDEMAPEVALRSVSSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 490

BLAST of CsGy4G023810.1 vs. NCBI nr
Match: XP_023535595.1 (protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 858.6 bits (2217), Expect = 1.0e-245
Identity = 450/496 (90.73%), Postives = 460/496 (92.74%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNG  RKR R DE DEDD  +GKNG GK LKGLVTSLLLLDEQ+K EQ+E DR S+
Sbjct: 1   MNDSTNGGARKRNRGDEADEDDGSIGKNGRGKELKGLVTSLLLLDEQEKYEQEEHDRASM 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAK+SMEVNHRKKTK M DFYSE Q  XXX      +KR  SRLAANSVAVAA SDGLQK
Sbjct: 61  EAKVSMEVNHRKKTKVMDDFYSEAQDYXXXVEESDRLKRKKSRLAANSVAVAAASDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE  K     XXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IEIVK-----XXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL+
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLI 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPL+DWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLLDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGN+EMD EL TELQDDEMAPE+ALRSV SMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNEEMDRELSTELQDDEMAPEVALRSVSSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 491

BLAST of CsGy4G023810.1 vs. TAIR10
Match: AT5G12010.1 (unknown protein)

HSP 1 Score: 599.0 bits (1543), Expect = 2.7e-171
Identity = 313/466 (67.17%), Postives = 374/466 (80.26%), Query Frame = 0

Query: 32  KGLKGLVTSLLLLDEQDKCEQDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQXXXXXX 91
           K LKG  T           +Q+ ++  S       + N+RK+ + M D+YS++ XXXXXX
Sbjct: 41  KNLKGFFTXXXXXXXXXXXDQEARNAASRREMSDFQSNYRKRARTMSDYYSDLXXXXXXX 100

Query: 92  XXXXXMKRXXSRLAANSVAVAAVSDGLQKIESEKXXXXXXXXXXXXXXXXXXXXXLWVKD 151
                +    SR++                    XXXXXXXXXX   XXXXXXXXLWVKD
Sbjct: 101 EESGDINLKKSRVS----RAVXXXXXXXXXXXXXXXXXXXXXXXVRGXXXXXXXXLWVKD 160

Query: 152 RSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAV 211
           RS+AWW+EC+  DYP+E+FKK FRM ++TF++IC+ELNSA+AKEDT LR AIPV+QRVAV
Sbjct: 161 RSRAWWEECSRLDYPEEDFKKAFRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAV 220

Query: 212 CLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEY 271
           C+WRLATG+PLR+VSKKFGLGISTCHKLVLEVC AI+ VLMPK+LQWP++E+LR I+E +
Sbjct: 221 CIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERF 280

Query: 272 ESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTD 331
           ES+SGIPNVVGSMYTTHIPIIAPKISVA+YFNKRHTERNQKTSYSIT+Q VV+P+GVFTD
Sbjct: 281 ESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTD 340

Query: 332 VCIGWPGSMPDDQVLEKSALFQRA-NGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQ 391
           +CIGWPGSMPDD+VLEKS L+QRA NGGLLKG+W+ GG  +PL+DWVLVPYTQQ+LTWTQ
Sbjct: 341 LCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGMWVAGGPGHPLLDWVLVPYTQQNLTWTQ 400

Query: 392 HAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEM 451
           HAFNEK+ E+Q VAK+AF RLKGRW CLQKRTEVKLQDLP VLGACCVLHNICE+  ++M
Sbjct: 401 HAFNEKMSEVQGVAKEAFGRLKGRWACLQKRTEVKLQDLPTVLGACCVLHNICEMREEKM 460

Query: 452 DTELLTELQDDEMAPEMALRSVPSMKARDAIAHNLLHHGLAGTSFL 497
           + EL+ E+ DDE+ PE  LRSV +MKARD I+HNLLHHGLAGTSFL
Sbjct: 461 EPELMVEVIDDEVLPENVLRSVNAMKARDTISHNLLHHGLAGTSFL 502

BLAST of CsGy4G023810.1 vs. TAIR10
Match: AT4G29780.1 (unknown protein)

HSP 1 Score: 502.3 bits (1292), Expect = 3.5e-142
Identity = 247/438 (56.39%), Postives = 312/438 (71.23%), Query Frame = 0

Query: 60  VEAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQ 119
           ++ K  +E NH+KK K M  +Y+++Q            +   +R  A +  V+AV+ G  
Sbjct: 105 IKEKSLLEANHKKKVKTMDGYYNQMQDHYSAAGETDGSRSKRARKTAVAAVVSAVASGAD 164

Query: 120 KIESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRA 179
              +                       LWVK+R+  WWD  + PD+P++EF+++FRM ++
Sbjct: 165 --TTGLAAPVPTADIASGSGSGPSHRRLWVKERTTDWWDRVSRPDFPEDEFRREFRMSKS 224

Query: 180 TFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKL 239
           TF++ICEEL++ + K++T LR AIP  +RV VC+WRLATG PLR VS++FGLGISTCHKL
Sbjct: 225 TFNLICEELDTTVTKKNTMLRDAIPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKL 284

Query: 240 VLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVA 299
           V+EVC AI  VLMPK+L WP +  +   K ++ES+  IPNVVGS+YTTHIPIIAPK+ VA
Sbjct: 285 VIEVCRAIYDVLMPKYLLWPSDSEINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVA 344

Query: 300 AYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALF-QRANGG 359
           AYFNKRHTERNQKTSYSITVQGVV+  G+FTDVCIG PGS+ DDQ+LEKS+L  QRA  G
Sbjct: 345 AYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPGSLTDDQILEKSSLSRQRAARG 404

Query: 360 LLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCL 419
           +L+  WIVG S +PL D++LVPYT+Q+LTWTQHAFNE IGEIQ +A  AF RLKGRW CL
Sbjct: 405 MLRDSWIVGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACL 464

Query: 420 QKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKAR 479
           QKRTEVKLQDLP VLGACCVLHNICE+  +EM  EL  E+ DD   PE  +RS  ++  R
Sbjct: 465 QKRTEVKLQDLPYVLGACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSASAVNTR 524

Query: 480 DAIAHNLLHHGLAGTSFL 497
           D I+HNLLH GLAGT  L
Sbjct: 525 DHISHNLLHRGLAGTRTL 540

BLAST of CsGy4G023810.1 vs. TAIR10
Match: AT3G63270.1 (Putative harbinger transposase-derived nuclease (InterPro:IPR006912))

HSP 1 Score: 162.5 bits (410), Expect = 6.5e-40
Identity = 102/318 (32.08%), Postives = 161/318 (50.63%), Query Frame = 0

Query: 156 WWD----ECNSPDYPDEE---FKKQFRMGRATFDMICEELNSAIAKEDTTLR-------- 215
           WWD      +SP  P +E   FK  FR  + TF  IC     ++ +ED   R        
Sbjct: 44  WWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTFSYIC-----SLVREDLISRPPSGLINI 103

Query: 216 --TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQW 275
               + V+++VA+ L RLA+GD    V   FG+G ST  ++      A+       HL+W
Sbjct: 104 EGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE-RAKHHLRW 163

Query: 276 PEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSIT 335
           P+ + +  IK ++E + G+PN  G++ TTHI +  P +  +  +       +Q+ +YS+ 
Sbjct: 164 PDSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDW------CDQEKNYSMF 223

Query: 336 VQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRA-NGGLLKG------------VWI 395
           +QGV D    F ++  GWPG M   ++L+ S  F+   N  +L G             ++
Sbjct: 224 LQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYV 283

Query: 396 VGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK-RTEV 443
           VGG SYPL+ W++ P+   H + +  AFNE+  +++ VA  AF +LKG WR L K     
Sbjct: 284 VGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRP 343

BLAST of CsGy4G023810.1 vs. TAIR10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 161.0 bits (406), Expect = 1.9e-39
Identity = 108/333 (32.43%), Postives = 172/333 (51.65%), Query Frame = 0

Query: 156 WWDECNSPDY----PDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA--------- 215
           WWD  +   Y      + F+  F++ R TFD IC     ++ K D T + A         
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYIC-----SLVKADFTAKPANFSDSNGNP 113

Query: 216 IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEE 275
           + +  RVAV L RL +G+ L V+ + FG+  ST  ++      ++    +  HL WP + 
Sbjct: 114 LSLNDRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAI-HHLSWPSK- 173

Query: 276 TLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGV 335
            L  IK ++E ISG+PN  G++  THI +  P +  +   NK   +  +  ++S+T+Q V
Sbjct: 174 -LDEIKSKFEKISGLPNCCGAIDITHIVMNLPAVEPS---NKVWLDGEK--NFSMTLQAV 233

Query: 336 VDPRGVFTDVCIGWPGSMPDDQVLEKSALF------QRANGGLLK-------GVWIVGGS 395
           VDP   F DV  GWPGS+ DD VL+ S  +      +R NG  L          +IVG S
Sbjct: 234 VDPDMRFLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDS 293

Query: 396 SYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQD- 455
            +PL+ W+L PY  +  +  Q  FN++  E  K A+ A ++LK RWR +     +  ++ 
Sbjct: 294 GFPLLPWLLTPYQGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNR 353

Query: 456 LPVVLGACCVLHN-ICELGNQEMDTELLTELQD 461
           LP ++  CC+LHN I ++ +Q +D + L++  D
Sbjct: 354 LPRIIFVCCLLHNIIIDMEDQTLDDQPLSQQHD 373

BLAST of CsGy4G023810.1 vs. TAIR10
Match: AT3G19120.1 (PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 100.5 bits (249), Expect = 3.0e-21
Identity = 79/328 (24.09%), Postives = 147/328 (44.82%), Query Frame = 0

Query: 167 DEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVS 226
           D  ++  + +    F  + ++L   I    T    ++P    VA+ L RLA G   + ++
Sbjct: 114 DARWRSLYGLSYPVFITVVDKLKPFI----TASNLSLPADYAVAMVLSRLAHGCSAKTLA 173

Query: 227 KKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWP-EEETLRRIKEEYESISGIPNVVGSMY 286
            ++ L      K+   V   + T L P+ ++ P  +  L    + +E ++ +PN+ G++ 
Sbjct: 174 SRYSLDPYLISKITNMVTRLLATKLYPEFIKIPVGKRRLIETTQGFEELTSLPNICGAID 233

Query: 287 TTHIPIIAPKISVAAYFNKRHTERNQKTSY-------SITVQGVVDPRGVFTDVCIGWPG 346
           +T + +            +R T+ N +  Y       ++ +Q V D + +F DVC+  PG
Sbjct: 234 STPVKL------------RRRTKLNPRNIYGCKYGYDAVLLQVVADHKKIFWDVCVKAPG 293

Query: 347 SMPDDQVLEKSALFQRANGGLLKGVW--------------IVGGSSYPLMDWVLVPYTQQ 406
              D      S L++R   G +  VW              IVG   YPL+ +++ P++  
Sbjct: 294 GEDDSSHFRDSLLYKRLTSGDI--VWEKVINIRGHHVRPYIVGDWCYPLLSFLMTPFSPN 353

Query: 407 HL-TWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNIC 466
              T  ++ F+  + + + V  +A   LK RW+ LQ    V +   P  + ACCVLHN+C
Sbjct: 354 GSGTPPENLFDGMLMKGRSVVVEAIGLLKARWKILQS-LNVGVNHAPQTIVACCVLHNLC 413

Query: 467 ELGNQEMDTELLTELQDDEMAPEMALRS 472
           ++  +E + E+  +  D+   P   L S
Sbjct: 414 QIA-REPEPEIWKD-PDEAGTPARVLES 420

BLAST of CsGy4G023810.1 vs. Swiss-Prot
Match: sp|Q94K49|ALP1_ARATH (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.2e-38
Identity = 102/318 (32.08%), Postives = 161/318 (50.63%), Query Frame = 0

Query: 156 WWD----ECNSPDYPDEE---FKKQFRMGRATFDMICEELNSAIAKEDTTLR-------- 215
           WWD      +SP  P +E   FK  FR  + TF  IC     ++ +ED   R        
Sbjct: 44  WWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTFSYIC-----SLVREDLISRPPSGLINI 103

Query: 216 --TAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQW 275
               + V+++VA+ L RLA+GD    V   FG+G ST  ++      A+       HL+W
Sbjct: 104 EGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE-RAKHHLRW 163

Query: 276 PEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSIT 335
           P+ + +  IK ++E + G+PN  G++ TTHI +  P +  +  +       +Q+ +YS+ 
Sbjct: 164 PDSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDW------CDQEKNYSMF 223

Query: 336 VQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRA-NGGLLKG------------VWI 395
           +QGV D    F ++  GWPG M   ++L+ S  F+   N  +L G             ++
Sbjct: 224 LQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYV 283

Query: 396 VGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK-RTEV 443
           VGG SYPL+ W++ P+   H + +  AFNE+  +++ VA  AF +LKG WR L K     
Sbjct: 284 VGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRP 343

BLAST of CsGy4G023810.1 vs. Swiss-Prot
Match: sp|Q9M2U3|ALPL_ARATH (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 3.4e-38
Identity = 108/333 (32.43%), Postives = 172/333 (51.65%), Query Frame = 0

Query: 156 WWDECNSPDY----PDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA--------- 215
           WWD  +   Y      + F+  F++ R TFD IC     ++ K D T + A         
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYIC-----SLVKADFTAKPANFSDSNGNP 113

Query: 216 IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEE 275
           + +  RVAV L RL +G+ L V+ + FG+  ST  ++      ++    +  HL WP + 
Sbjct: 114 LSLNDRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAI-HHLSWPSK- 173

Query: 276 TLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGV 335
            L  IK ++E ISG+PN  G++  THI +  P +  +   NK   +  +  ++S+T+Q V
Sbjct: 174 -LDEIKSKFEKISGLPNCCGAIDITHIVMNLPAVEPS---NKVWLDGEK--NFSMTLQAV 233

Query: 336 VDPRGVFTDVCIGWPGSMPDDQVLEKSALF------QRANGGLLK-------GVWIVGGS 395
           VDP   F DV  GWPGS+ DD VL+ S  +      +R NG  L          +IVG S
Sbjct: 234 VDPDMRFLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDS 293

Query: 396 SYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQD- 455
            +PL+ W+L PY  +  +  Q  FN++  E  K A+ A ++LK RWR +     +  ++ 
Sbjct: 294 GFPLLPWLLTPYQGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNR 353

Query: 456 LPVVLGACCVLHN-ICELGNQEMDTELLTELQD 461
           LP ++  CC+LHN I ++ +Q +D + L++  D
Sbjct: 354 LPRIIFVCCLLHNIIIDMEDQTLDDQPLSQQHD 373

BLAST of CsGy4G023810.1 vs. Swiss-Prot
Match: sp|Q17QR8|HARB1_BOVIN (Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 3.2e-20
Identity = 77/292 (26.37%), Postives = 134/292 (45.89%), Query Frame = 0

Query: 158 DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLA 217
           D     D  DE     +   R     + E L +++++  T    AI  + ++   L    
Sbjct: 23  DRFKLDDVTDEYLMSMYGFPRQFIYYLVELLGASLSR-PTQRSRAISPETQILAALGFYT 82

Query: 218 TGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWP-EEETLRRIKEEYESISG 277
           +G     +    G+  ++  + V  V  A+      + + +P +E +++ +K+E+  ++G
Sbjct: 83  SGSFQTRMGDAIGISQASMSRCVANVTEAL-VERASQFIHFPADEASVQALKDEFYGLAG 142

Query: 278 IPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGW 337
           IP V+G +   H+ I AP     +Y N+       K  +S+    V D RG    V   W
Sbjct: 143 IPGVIGVVDCMHVAIKAPNAEDLSYVNR-------KGLHSLNCLMVCDIRGALMTVETSW 202

Query: 338 PGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHL--TWTQHAFN 397
           PGS+ D  VL++S+L  +   G+ K  W++G SS+ L  W++ P    H+  T  ++ +N
Sbjct: 203 PGSLQDCVVLQQSSLSSQFEAGMHKESWLLGDSSFFLRTWLMTPL---HIPETPAEYRYN 262

Query: 398 EKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPV----VLGACCVLHNI 443
                   V +  F  L  R+RCL   ++  LQ  P     ++ ACCVLHNI
Sbjct: 263 MAHSATHSVIEKTFRTLCSRFRCLD-GSKGALQYSPEKSSHIILACCVLHNI 301

BLAST of CsGy4G023810.1 vs. Swiss-Prot
Match: sp|Q96MB7|HARB1_HUMAN (Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 7.1e-20
Identity = 77/292 (26.37%), Postives = 134/292 (45.89%), Query Frame = 0

Query: 158 DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLA 217
           D     D  DE     +   R     + E L + +++  T    AI  + +V   L    
Sbjct: 23  DRFKLDDVTDEYLMSMYGFPRQFIYYLVELLGANLSR-PTQRSRAISPETQVLAALGFYT 82

Query: 218 TGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWP-EEETLRRIKEEYESISG 277
           +G     +    G+  ++  + V  V  A+      + +++P +E +++ +K+E+  ++G
Sbjct: 83  SGSFQTRMGDAIGISQASMSRCVANVTEAL-VERASQFIRFPADEASIQALKDEFYGLAG 142

Query: 278 IPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGW 337
           +P V+G +   H+ I AP     +Y N+       K  +S+    V D RG    V   W
Sbjct: 143 MPGVMGVVDCIHVAIKAPNAEDLSYVNR-------KGLHSLNCLMVCDIRGTLMTVETNW 202

Query: 338 PGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHL--TWTQHAFN 397
           PGS+ D  VL++S+L  +   G+ K  W++G SS+ L  W++ P    H+  T  ++ +N
Sbjct: 203 PGSLQDCAVLQQSSLSSQFEAGMHKDSWLLGDSSFFLRTWLMTPL---HIPETPAEYRYN 262

Query: 398 EKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPV----VLGACCVLHNI 443
                   V +  F  L  R+RCL   ++  LQ  P     ++ ACCVLHNI
Sbjct: 263 MAHSATHSVIEKTFRTLCSRFRCLD-GSKGALQYSPEKSSHIILACCVLHNI 301

BLAST of CsGy4G023810.1 vs. Swiss-Prot
Match: sp|B0BN95|HARB1_RAT (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 1.6e-19
Identity = 76/292 (26.03%), Postives = 133/292 (45.55%), Query Frame = 0

Query: 158 DECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLA 217
           D     D  DE     +   R     + E L +++++  T    AI  + ++   L    
Sbjct: 23  DRFKLDDVTDEYLMSMYGFPRQFIYYLVELLGASLSR-PTQRSRAISPETQILAALGFYT 82

Query: 218 TGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWP-EEETLRRIKEEYESISG 277
           +G     +    G+  ++  + V  V  A+      + + +P +E  ++ +K+E+  ++G
Sbjct: 83  SGSFQTRMGDAIGISQASMSRCVANVTEAL-VERASQFIHFPADEAAIQSLKDEFYGLAG 142

Query: 278 IPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGW 337
           +P V+G++   H+ I AP     +Y N+       K  +S+    V D RG    V   W
Sbjct: 143 MPGVIGAVDCIHVAIKAPNAEDLSYVNR-------KGLHSLNCLVVCDIRGALMTVETSW 202

Query: 338 PGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHL--TWTQHAFN 397
           PGS+ D  VL++S+L  +   G+ K  W++G SS+ L  W+L P    H+  T  ++ +N
Sbjct: 203 PGSLQDCAVLQQSSLSSQFETGMPKDSWLLGDSSFFLHTWLLTPL---HIPETPAEYRYN 262

Query: 398 EKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPV----VLGACCVLHNI 443
                   V +     L  R+RCL   ++  LQ  P     ++ ACCVLHNI
Sbjct: 263 RAHSATHSVIEKTLRTLCCRFRCLD-GSKGALQYSPEKSSHIILACCVLHNI 301

BLAST of CsGy4G023810.1 vs. TrEMBL
Match: tr|A0A0A0L420|A0A0A0L420_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G646260 PE=4 SV=1)

HSP 1 Score: 922.5 bits (2383), Expect = 3.9e-265
Identity = 496/496 (100.00%), Postives = 496/496 (100.00%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV
Sbjct: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK
Sbjct: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 496

BLAST of CsGy4G023810.1 vs. TrEMBL
Match: tr|A0A1S3BVR8|A0A1S3BVR8_CUCME (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103493676 PE=4 SV=1)

HSP 1 Score: 910.2 bits (2351), Expect = 2.0e-261
Identity = 490/496 (98.79%), Postives = 493/496 (99.40%), Query Frame = 0

Query: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60
           MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV
Sbjct: 1   MNDSTNGNVRKRTRADEVDEDDDLMGKNGGGKGLKGLVTSLLLLDEQDKCEQDEQDRISV 60

Query: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAVAAVSDGLQK 120
           EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMK XX RLAANSVAVAAVSDGLQ+
Sbjct: 61  EAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKXXXXRLAANSVAVAAVSDGLQR 120

Query: 121 IESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180
           IE+EK XXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT
Sbjct: 121 IENEK-XXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRAT 180

Query: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240
           FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV
Sbjct: 181 FDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLV 240

Query: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAA 300
           LEVCTAIRTVLMPKHLQWPEEETLRRIKEE+ESISGIPNVVGSMYTTHIPIIAPKISVAA
Sbjct: 241 LEVCTAIRTVLMPKHLQWPEEETLRRIKEEFESISGIPNVVGSMYTTHIPIIAPKISVAA 300

Query: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360
           YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL
Sbjct: 301 YFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALFQRANGGLL 360

Query: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420
           KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK
Sbjct: 361 KGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQK 420

Query: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480
           RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA
Sbjct: 421 RTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSMKARDA 480

Query: 481 IAHNLLHHGLAGTSFL 497
           IAHNLLHHGLAGTSFL
Sbjct: 481 IAHNLLHHGLAGTSFL 495

BLAST of CsGy4G023810.1 vs. TrEMBL
Match: tr|A0A1S2Y3N5|A0A1S2Y3N5_CICAR (uncharacterized protein LOC101491352 OS=Cicer arietinum OX=3827 GN=LOC101491352 PE=4 SV=1)

HSP 1 Score: 693.7 bits (1789), Expect = 2.9e-196
Identity = 338/465 (72.69%), Postives = 385/465 (82.80%), Query Frame = 0

Query: 34  LKGLVTSLLLLDEQDKCEQDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXX 93
           LKG++TS+LLLDEQ+K E +  +++  + K  +E NH+KKTKAMVD+Y+ +         
Sbjct: 80  LKGILTSILLLDEQEKQEFENNNKVLEDEKFCLETNHKKKTKAMVDYYTNLDDSYSQVEE 139

Query: 94  XXXMKRXXSRLAANSVAVAAV--SDGLQKIESEKXXXXXXXXXXXXXXXXXXXXXLWVKD 153
              ++R  +R  ++SVA+AA   SDG+++  SE                      LWVKD
Sbjct: 140 SERVRRKKTRNMSSSVAIAATTFSDGIEETNSES-VVNNMKNNDNSSGKSGSQRRLWVKD 199

Query: 154 RSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAV 213
           RS AWWDECN  D+P+ EF+K FRMG++TFD+ICEELNSAI KEDTTLR AIPV+QRVAV
Sbjct: 200 RSGAWWDECNKDDFPENEFRKAFRMGKSTFDLICEELNSAIVKEDTTLRNAIPVRQRVAV 259

Query: 214 CLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEY 273
           CLWRLATGDPLR+VSK+FGLGISTCHKLVLEVCTAI+TVLMPK+LQWP E  LR+IK E+
Sbjct: 260 CLWRLATGDPLRIVSKRFGLGISTCHKLVLEVCTAIKTVLMPKYLQWPNEVNLRKIKGEF 319

Query: 274 ESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTD 333
           ESISGIPNVVGSMYT+H+PIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDP GVFTD
Sbjct: 320 ESISGIPNVVGSMYTSHVPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPNGVFTD 379

Query: 334 VCIGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQH 393
           VCIGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVG SSYPLMDWVLVPY QQ+LTWTQH
Sbjct: 380 VCIGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVGSSSYPLMDWVLVPYNQQNLTWTQH 439

Query: 394 AFNEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMD 453
            FNEKIGEIQKVAKDAF RLKGRW CLQKRTEVKLQDLPVVLGACCVLHNICE+  ++M+
Sbjct: 440 GFNEKIGEIQKVAKDAFGRLKGRWCCLQKRTEVKLQDLPVVLGACCVLHNICEMKGEKME 499

Query: 454 TELLTELQDDEMAPEMALRSVPSMKARDAIAHNLLHHGLAGTSFL 497
            EL  ++ DDEM PE+ LRSV S+KARDAIAHNLLHHGLAGTSFL
Sbjct: 500 DELKVDVLDDEMVPEVGLRSVNSLKARDAIAHNLLHHGLAGTSFL 543

BLAST of CsGy4G023810.1 vs. TrEMBL
Match: tr|B9RQS8|B9RQS8_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0706300 PE=4 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 1.6e-194
Identity = 352/505 (69.70%), Postives = 408/505 (80.79%), Query Frame = 0

Query: 1   MNDSTNGNVRKRT--RADEVDEDDDLMGKNGGG-------KGLKGLVTSLLLLDEQDKCE 60
           MN++ N   R+R   R + VD+DD    +           K L G++TSL+L+++Q+KC+
Sbjct: 1   MNETNNTKKRQRKGYRQESVDKDDSNSFEEDSNNTTSLKTKDLSGIITSLILIEDQEKCD 60

Query: 61  QDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXXXXXMKRXXSRLAANSVAV 120
           Q+E++R   E K  +E NH+KKT+  V++YS +Q           +KR  SR  A + A+
Sbjct: 61  QEEENRAFSEEKHLLEANHKKKTRTAVEYYSNLQDYYSEIEETDRVKRKKSRAIAGAAAI 120

Query: 121 AAVSDGLQKIESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRSKAWWDECNSPDYPDEEFK 180
           +A S+G+               XXXXXXXXXXXXXLWVKDR K WWDECN PDYP+EEFK
Sbjct: 121 SASSNGVAN-------KATGDAXXXXXXXXXXXXXLWVKDRDKEWWDECNRPDYPEEEFK 180

Query: 181 KQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCLWRLATGDPLRVVSKKFGL 240
           K FRM +ATFD+ICEEL+S I KEDTTLR AIPV+QRVAVC+WRLATG+PLR+VSK+FGL
Sbjct: 181 KAFRMSKATFDLICEELHSCIQKEDTTLRNAIPVRQRVAVCIWRLATGEPLRLVSKRFGL 240

Query: 241 GISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPI 300
           GISTCHKLVLEVC+AI+ VLMPK+LQWP+E++L+++K E+ESISGIPNVVGSMYTTHIPI
Sbjct: 241 GISTCHKLVLEVCSAIKNVLMPKYLQWPDEDSLKKVKNEFESISGIPNVVGSMYTTHIPI 300

Query: 301 IAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSAL 360
           IAPKISVAAYFNKRHTERNQKTSYSITVQGVVDP+GVFTDVCIGWPGSMPDDQVLEKSAL
Sbjct: 301 IAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPKGVFTDVCIGWPGSMPDDQVLEKSAL 360

Query: 361 FQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARL 420
           +QRANGGLLK VWIVG S YPLMDWVLVPYTQQHLTWTQHAFNEKIGE+Q VAK+AF RL
Sbjct: 361 YQRANGGLLKDVWIVGSSGYPLMDWVLVPYTQQHLTWTQHAFNEKIGEVQTVAKEAFTRL 420

Query: 421 KGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRS 480
           KGRW CLQKRTEVKLQDLPVVLGACCVLHNICEL  +E+D +L  EL DDEM PE+ALRS
Sbjct: 421 KGRWSCLQKRTEVKLQDLPVVLGACCVLHNICELRKEEIDPKLRVELVDDEMVPEVALRS 480

Query: 481 VPSMKARDAIAHNLLHHGLAGTSFL 497
             SMKARDAIAHNLLHH  AGT FL
Sbjct: 481 ASSMKARDAIAHNLLHHCHAGTGFL 498

BLAST of CsGy4G023810.1 vs. TrEMBL
Match: tr|A0A218VSM8|A0A218VSM8_PUNGR (Uncharacterized protein OS=Punica granatum OX=22663 GN=CDL15_Pgr022111 PE=4 SV=1)

HSP 1 Score: 673.3 bits (1736), Expect = 4.1e-190
Identity = 332/463 (71.71%), Postives = 377/463 (81.43%), Query Frame = 0

Query: 34  LKGLVTSLLLLDEQDKCEQDEQDRISVEAKISMEVNHRKKTKAMVDFYSEVQXXXXXXXX 93
           LKG++TSL LL++Q+K +  E++  +VE +  +E N+RKK++A  DFYS V+        
Sbjct: 60  LKGIITSLSLLEDQEKEDLREREVAAVEERQLLENNYRKKSRATADFYSNVEDYYAETDE 119

Query: 94  XXXMKRXXSRLAANSVAVAAVSDGLQKIESEKXXXXXXXXXXXXXXXXXXXXXLWVKDRS 153
               +R  SR  A +VA     +G+ K +SEK                     LWVKDRS
Sbjct: 120 LDRTRRKKSRALAGAVAAGIAEEGVLK-KSEK-------GGKKSGGEGGQSRRLWVKDRS 179

Query: 154 KAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTAIPVQQRVAVCL 213
            +WWDECN PDYP+ EF+K FRMGR TFD+ICEELNSAIAKEDT LR AIPV+QRVAVC+
Sbjct: 180 NSWWDECNRPDYPEHEFRKAFRMGRKTFDVICEELNSAIAKEDTALRNAIPVRQRVAVCI 239

Query: 214 WRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHLQWPEEETLRRIKEEYES 273
           WRLATG+PLR+VSKKFGLGISTCHKLVLEVC AI++VLMPK LQWPE+  LR+IKEE+ES
Sbjct: 240 WRLATGEPLRLVSKKFGLGISTCHKLVLEVCAAIKSVLMPKFLQWPED--LRKIKEEFES 299

Query: 274 ISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDPRGVFTDVC 333
           +S IPNVVGSMYTTH+PIIAPKISVAAYFNKRHTERNQKTSYSIT+QGVVDPRGVFTDVC
Sbjct: 300 VSAIPNVVGSMYTTHVPIIAPKISVAAYFNKRHTERNQKTSYSITLQGVVDPRGVFTDVC 359

Query: 334 IGWPGSMPDDQVLEKSALFQRANGGLLKGVWIVGGSSYPLMDWVLVPYTQQHLTWTQHAF 393
           IGWPGSMPDDQVLEKSAL+QRA GGLLKGVWIVGGS YPL+DWVLVPYTQ +LTWTQHAF
Sbjct: 360 IGWPGSMPDDQVLEKSALYQRAQGGLLKGVWIVGGSGYPLLDWVLVPYTQPNLTWTQHAF 419

Query: 394 NEKIGEIQKVAKDAFARLKGRWRCLQKRTEVKLQDLPVVLGACCVLHNICELGNQEMDTE 453
           NEKIGE+Q VAKDAFARLKGRW CLQKRTEVKLQDLP+VLGACCVLHNICE+  +EMD E
Sbjct: 420 NEKIGEVQNVAKDAFARLKGRWSCLQKRTEVKLQDLPIVLGACCVLHNICEMRGEEMDPE 479

Query: 454 LLTELQDDEMAPEMALRSVPSMKARDAIAHNLLHHGLAGTSFL 497
           L  E+ DDEM PE ALRSV  MKARDAIAHN+LH GLAGTSFL
Sbjct: 480 LRIEIMDDEMVPEAALRSVSLMKARDAIAHNILHKGLAGTSFL 512

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004141329.15.9e-265100.00PREDICTED: putative nuclease HARBI1 [Cucumis sativus] >KGN55352.1 hypothetical p... [more]
XP_008452747.13.0e-26198.79PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
XP_022977009.13.6e-24688.10protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita maxima][more]
XP_022936710.11.0e-24590.93protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita moschata][more]
XP_023535595.11.0e-24590.73protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Cucurbita pepo subsp.... [more]
Match NameE-valueIdentityDescription
AT5G12010.12.7e-17167.17unknown protein[more]
AT4G29780.13.5e-14256.39unknown protein[more]
AT3G63270.16.5e-4032.08Putative harbinger transposase-derived nuclease (InterPro:IPR006912)[more]
AT3G55350.11.9e-3932.43PIF / Ping-Pong family of plant transposases[more]
AT3G19120.13.0e-2124.09PIF / Ping-Pong family of plant transposases[more]
Match NameE-valueIdentityDescription
sp|Q94K49|ALP1_ARATH1.2e-3832.08Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
sp|Q9M2U3|ALPL_ARATH3.4e-3832.43Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
sp|Q17QR8|HARB1_BOVIN3.2e-2026.37Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1[more]
sp|Q96MB7|HARB1_HUMAN7.1e-2026.37Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1[more]
sp|B0BN95|HARB1_RAT1.6e-1926.03Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0L420|A0A0A0L420_CUCSA3.9e-265100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G646260 PE=4 SV=1[more]
tr|A0A1S3BVR8|A0A1S3BVR8_CUCME2.0e-26198.79putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103493676 PE=4 SV=1[more]
tr|A0A1S2Y3N5|A0A1S2Y3N5_CICAR2.9e-19672.69uncharacterized protein LOC101491352 OS=Cicer arietinum OX=3827 GN=LOC101491352 ... [more]
tr|B9RQS8|B9RQS8_RICCO1.6e-19469.70Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0706300 PE=4 SV=1[more]
tr|A0A218VSM8|A0A218VSM8_PUNGR4.1e-19071.71Uncharacterized protein OS=Punica granatum OX=22663 GN=CDL15_Pgr022111 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR027806HARBI1_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy4G023810CsGy4G023810gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy4G023810.1.CDS.1CsGy4G023810.1.CDS.1CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy4G023810.1CsGy4G023810.1-proteinpolypeptide


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 285..441
e-value: 3.3E-36
score: 124.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..145
NoneNo IPR availablePANTHERPTHR22930:SF113SUBFAMILY NOT NAMEDcoord: 84..492
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 84..492