CsGy2G002290 (gene) Cucumber (Gy14) v2

NameCsGy2G002290
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionputative nuclease HARBI1
LocationChr2 : 1506275 .. 1507618 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGTTCCGCCATGAACGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAAGAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTTCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAATCCCTACTTTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCACTACCTCCACCGCGCCAATGCTGGTTCCAGCGCTTCCTATCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTCCTTTCTCCGATTCAAAGCTCCCCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGGAGACGGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCTAGTGAATTACTGAAGGGTCCTGTTTATAATCTCGACAATGAAAAACCCATTCCCCAATACTTGATCGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAAGATAGTTCTGGCTTTTGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAAAAGTTTCCTCTTTTCGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGTAGCCTGAACTACAGAAGATGA

mRNA sequence

ATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGTTCCGCCATGAACGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAAGAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTTCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAATCCCTACTTTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCACTACCTCCACCGCGCCAATGCTGGTTCCAGCGCTTCCTATCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTCCTTTCTCCGATTCAAAGCTCCCCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGGAGACGGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCTAGTGAATTACTGAAGGGTCCTGTTTATAATCTCGACAATGAAAAACCCATTCCCCAATACTTGATCGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAAGATAGTTCTGGCTTTTGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAAAAGTTTCCTCTTTTCGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGTAGCCTGAACTACAGAAGATGA

Coding sequence (CDS)

ATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGTTCCGCCATGAACGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAAGAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTTCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAATCCCTACTTTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCACTACCTCCACCGCGCCAATGCTGGTTCCAGCGCTTCCTATCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTCCTTTCTCCGATTCAAAGCTCCCCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGGAGACGGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCTAGTGAATTACTGAAGGGTCCTGTTTATAATCTCGACAATGAAAAACCCATTCCCCAATACTTGATCGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAAGATAGTTCTGGCTTTTGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAAAAGTTTCCTCTTTTCGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGTAGCCTGAACTACAGAAGATGA

Protein sequence

MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR
BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_004139403.1 (PREDICTED: putative nuclease HARBI1 [Cucumis sativus] >KGN60730.1 hypothetical protein Csa_2G008700 [Cucumis sativus])

HSP 1 Score: 896.0 bits (2314), Expect = 5.3e-257
Identity = 447/447 (100.00%), Postives = 447/447 (100.00%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 448
           DGEIGDGRGKDIRDALALHLSSLNYRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447

BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_008457314.1 (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 859.8 bits (2220), Expect = 4.2e-246
Identity = 434/447 (97.09%), Postives = 437/447 (97.76%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMN AAAAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMN-AAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP F
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPPF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 448
           DGEIGDGRGKDIRDALALHLSSL+YRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLSYRR 446

BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_023536803.1 (protein ALP1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 767.3 bits (1980), Expect = 2.9e-218
Identity = 389/449 (86.64%), Postives = 412/449 (91.76%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSL
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVT-TRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALES + STSSSL ALSP LPKLSL    PPPRQCWFQRFLSAT++VDC
Sbjct: 61  NDLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH---PPPRQCWFQRFLSATAEVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNL FRMSKSSFSLLLRLLSPIQS  S+SVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLFFRMSKSSFSLLLRLLSPIQSCSSTSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+
Sbjct: 181 IDSADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGD 240

Query: 241 L--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 300
           L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Sbjct: 241 LLGKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNL 300

Query: 301 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 360
           D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++
Sbjct: 301 DDGKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKV 360

Query: 361 RARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP 420
           RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL EEQD+++GASCSSEEQKFP
Sbjct: 361 RARWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLVEEQDEDDGASCSSEEQKFP 420

Query: 421 LFDGEIGDGRGKDIRDALALHLSSLNYRR 448
           L+DGE GD RGKDIRDALALHLS L++RR
Sbjct: 421 LYDGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_022938170.1 (protein ALP1-like [Cucurbita moschata] >XP_022938171.1 protein ALP1-like [Cucurbita moschata])

HSP 1 Score: 766.9 bits (1979), Expect = 3.7e-218
Identity = 389/449 (86.64%), Postives = 412/449 (91.76%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSL
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVT-TRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALES + STSSSL ALSP LPKLSL    PPPRQCWFQRFLSAT++VDC
Sbjct: 61  NDLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH---PPPRQCWFQRFLSATAEVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+
Sbjct: 181 IDSADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGD 240

Query: 241 L--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 300
           L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Sbjct: 241 LLGKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNL 300

Query: 301 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 360
           D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++
Sbjct: 301 DDGKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKV 360

Query: 361 RARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP 420
           RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL+EEQD+ +GASCSSEEQKF 
Sbjct: 361 RARWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLEEEQDEXDGASCSSEEQKFA 420

Query: 421 LFDGEIGDGRGKDIRDALALHLSSLNYRR 448
           L+DGE GD RGKDIRDALALHLS L++RR
Sbjct: 421 LYDGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_022965738.1 (protein ALP1-like [Cucurbita maxima] >XP_022965739.1 protein ALP1-like [Cucurbita maxima])

HSP 1 Score: 736.5 bits (1900), Expect = 5.4e-209
Identity = 376/449 (83.74%), Postives = 398/449 (88.64%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSL
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVT-TRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALES + STSSSL ALSP LPKLSL    PPPRQCWFQRFLSAT++VDC
Sbjct: 61  NDLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH---PPPRQCWFQRFLSATAEVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+
Sbjct: 181 IDSADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGD 240

Query: 241 L--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 300
           L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Sbjct: 241 LLGKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNL 300

Query: 301 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 360
           D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++
Sbjct: 301 DDGKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKV 360

Query: 361 RARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP 420
           RARWKLLSKPWKE CRDFFPF++LTGCLL NFLIKCSEKL+                KFP
Sbjct: 361 RARWKLLSKPWKEECRDFFPFVVLTGCLLHNFLIKCSEKLEXXXXXXXXXXXXXXXXKFP 420

Query: 421 LFDGEIGDGRGKDIRDALALHLSSLNYRR 448
           L+DGE GD RGKDIRDALALHLS L++RR
Sbjct: 421 LYDGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of CsGy2G002290 vs. TAIR10
Match: AT1G72270.1 (Ribosome 60S biogenesis N-terminal (InterPro:IPR021714))

HSP 1 Score: 271.6 bits (693), Expect = 8.9e-73
Identity = 175/415 (42.17%), Postives = 233/415 (56.14%), Query Frame = 0

Query: 39  LNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPP 98
           L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S       +SP         
Sbjct: 21  LKDPLLRRLSSAAAVTNSFLQANDLFLSPSQTLRLESLISSL-----PISPS----XXXX 80

Query: 99  PLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCA 158
                    F RFL++ ++ + DPRW L FRMSKS+F  L  +LS       SS+P   +
Sbjct: 81  XXXXXXXXXFNRFLTSATEDEDDPRWCLYFRMSKSTFFSLYSILS------HSSLP---S 140

Query: 159 LAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 218
            AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     
Sbjct: 141 FAATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEKLS------QQLDDPKPD 200

Query: 219 FGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATIL 278
           F    LPNC GV+G  RF  +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I 
Sbjct: 201 FSPNLLPNCYGVVGFGRFEVKGKLLGAKGSILVQALVDSNGRFVDISAGWPSTMKPEAIF 260

Query: 279 RQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF 338
           RQ+KL++  E   E+L G    L N   +P+Y++GDSC PLLPWL+TPY   ++E+S   
Sbjct: 261 RQTKLFSIAE---EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLVTPYDLTSDEES--- 320

Query: 339 CGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCS 398
               FN+     +  V  AF ++RARW++L K WK    +F PF+I TGCLL NFL+   
Sbjct: 321 FREEFNNVVHTGLHSVEIAFAKVRARWRILDKKWKPETIEFMPFVITTGCLLHNFLVNSG 380

Query: 399 EKLD---------EEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLS 442
           +  D         E  D  E      +E++   F+GE      K IRDA+A +LS
Sbjct: 381 DDDDSVEECVNGCEAGDNGEMRKDDDKEEETRSFEGE-AYRESKRIRDAIAENLS 404

BLAST of CsGy2G002290 vs. TAIR10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 145.6 bits (366), Expect = 7.4e-35
Identity = 104/361 (28.81%), Postives = 161/361 (44.60%), Query Frame = 0

Query: 107 WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSPSSSVPPDC-- 166
           W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P     
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLND 113

Query: 167 ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 226
            +A AL RL  G S   +G  FG++ +   +  +   +++ E+  H L   S +D I   
Sbjct: 114 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPSKLDEIKSK 173

Query: 227 FGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSA 286
           F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV A
Sbjct: 174 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGE-KNFSMTLQAVVDPDMRFLDVIA 233

Query: 287 GWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTP 346
           GWP S+    +L+ S  Y  +EK    L G    L     + +Y++GDS FPLLPWLLTP
Sbjct: 234 GWPGSLNDDVVLKNSGFYKLVEKGKR-LNGEKLPLSERTELREYIVGDSGFPLLPWLLTP 293

Query: 347 YMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT 406
           Y    +   +      FN  H  A      A  +L+ RW++++       R+  P II  
Sbjct: 294 Y----QGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFV 353

Query: 407 GCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSL 445
            CLL N +I       E+Q  ++       +  +     ++ D     +RD L+  L   
Sbjct: 354 CCLLHNIIIDM-----EDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 403

BLAST of CsGy2G002290 vs. TAIR10
Match: AT3G63270.1 (Putative harbinger transposase-derived nuclease (InterPro:IPR006912))

HSP 1 Score: 131.3 bits (329), Expect = 1.4e-30
Identity = 92/297 (30.98%), Postives = 143/297 (48.15%), Query Frame = 0

Query: 128 FRMSKSSFSLLLRLLSP--IQSSPSSSVPPDCAL-------AAALFRLAHGASYKAVGRR 187
           FR SK++FS +  L+    I   PS  +  +  L       A AL RLA G S  +VG  
Sbjct: 69  FRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAA 128

Query: 188 FGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRF 247
           FG+  +   +  +   +A+ E+  H L       I+ I   F     LPNCCG +     
Sbjct: 129 FGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEMYGLPNCCGAIDTTHI 188

Query: 248 -----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEI 307
                       +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  + ++
Sbjct: 189 IMTLPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFF-KL 248

Query: 308 EKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTH 367
            +++++L G    L     I +Y++G   +PLLPWL+TP+   +  DS      AFN  H
Sbjct: 249 CENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSM----VAFNERH 308

Query: 368 GRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE 402
            +  ++  TAF +L+  W++LSK      R   P IIL  CLL N +I C + L E+
Sbjct: 309 EKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQED 360

BLAST of CsGy2G002290 vs. TAIR10
Match: AT5G12010.1 (unknown protein)

HSP 1 Score: 94.7 bits (234), Expect = 1.5e-19
Identity = 77/300 (25.67%), Postives = 126/300 (42.00%), Query Frame = 0

Query: 127 SFRMSKSSFSLLLRLLSPI----QSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGID 186
           +FRMSKS+F L+   L+       ++  +++P    +A  ++RLA G   + V ++FG+ 
Sbjct: 178 AFRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAVCIWRLATGEPLRLVSKKFGLG 237

Query: 187 SADACRSFYAVCKAINE----------------KLGHLLELRSDIDRIVVGFGWISLPNC 246
            +   +    VCKAI +                 +    E  S I  +V       +P  
Sbjct: 238 ISTCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERFESVSGIPNVVGSMYTTHIPII 297

Query: 247 CGVLGL-----RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY 306
              + +     +R     +  + S+ +QA+V+ +G F D+  GWP SM    +L +S LY
Sbjct: 298 APKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLY 357

Query: 307 AEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFN 366
                   LLKG             ++ G    PLL W+L PY + N      +   AFN
Sbjct: 358 QRANNGG-LLKG------------MWVAGGPGHPLLDWVLVPYTQQN----LTWTQHAFN 417

Query: 367 STHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE 402
                   +   AF RL+ RW  L K  +   +D  P ++   C+L N      EK++ E
Sbjct: 418 EKMSEVQGVAKEAFGRLKGRWACLQKRTEVKLQD-LPTVLGACCVLHNICEMREEKMEPE 459

BLAST of CsGy2G002290 vs. TAIR10
Match: AT4G29780.1 (unknown protein)

HSP 1 Score: 85.9 bits (211), Expect = 6.9e-17
Identity = 83/319 (26.02%), Postives = 136/319 (42.63%), Query Frame = 0

Query: 128 FRMSKSSFSLLLRLLSPIQSSPSS----SVPPDCALAAALFRLAHGASYKAVGRRFGIDS 187
           FRMSKS+F+L+   L    +  ++    ++P    +   ++RLA GA  + V  RFG+  
Sbjct: 217 FRMSKSTFNLICEELDTTVTKKNTMLRDAIPAPKRVGVCVWRLATGAPLRHVSERFGLGI 276

Query: 188 ADACRSFYAVCKAINEKLGH---LLELRSDIDRIVVGFGWI-SLPNCCGVLGL------- 247
           +   +    VC+AI + L     L    S+I+     F  +  +PN  G +         
Sbjct: 277 STCHKLVIEVCRAIYDVLMPKYLLWPSDSEINSTKAKFESVHKIPNVVGSIYTTHIPIIA 336

Query: 248 ----------RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYA 307
                     +R     +  + S+ VQ +V+A+G F DV  G P S+    IL +S L  
Sbjct: 337 PKVHVAAYFNKRHTERNQKTSYSITVQGVVNADGIFTDVCIGNPGSLTDDQILEKSSLSR 396

Query: 308 EIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNS 367
           +   +  +L+              +++G+S FPL  +LL PY   N      +   AFN 
Sbjct: 397 Q-RAARGMLR------------DSWIVGNSGFPLTDYLLVPYTRQN----LTWTQHAFNE 456

Query: 368 THGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQ 422
           + G    +   AF RL+ RW  L K  +   +D  P+++   C+L N    C  + +E  
Sbjct: 457 SIGEIQGIATAAFERLKGRWACLQKRTEVKLQD-LPYVLGACCVLHNI---CEMRKEE-- 504

BLAST of CsGy2G002290 vs. Swiss-Prot
Match: sp|Q9M2U3|ALPL_ARATH (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 1.3e-33
Identity = 104/361 (28.81%), Postives = 161/361 (44.60%), Query Frame = 0

Query: 107 WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSPSSSVPPDC-- 166
           W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P     
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLND 113

Query: 167 ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 226
            +A AL RL  G S   +G  FG++ +   +  +   +++ E+  H L   S +D I   
Sbjct: 114 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPSKLDEIKSK 173

Query: 227 FGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSA 286
           F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV A
Sbjct: 174 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGE-KNFSMTLQAVVDPDMRFLDVIA 233

Query: 287 GWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTP 346
           GWP S+    +L+ S  Y  +EK    L G    L     + +Y++GDS FPLLPWLLTP
Sbjct: 234 GWPGSLNDDVVLKNSGFYKLVEKGKR-LNGEKLPLSERTELREYIVGDSGFPLLPWLLTP 293

Query: 347 YMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT 406
           Y    +   +      FN  H  A      A  +L+ RW++++       R+  P II  
Sbjct: 294 Y----QGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFV 353

Query: 407 GCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSL 445
            CLL N +I       E+Q  ++       +  +     ++ D     +RD L+  L   
Sbjct: 354 CCLLHNIIIDM-----EDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 403

BLAST of CsGy2G002290 vs. Swiss-Prot
Match: sp|Q94K49|ALP1_ARATH (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 2.6e-29
Identity = 92/297 (30.98%), Postives = 143/297 (48.15%), Query Frame = 0

Query: 128 FRMSKSSFSLLLRLLSP--IQSSPSSSVPPDCAL-------AAALFRLAHGASYKAVGRR 187
           FR SK++FS +  L+    I   PS  +  +  L       A AL RLA G S  +VG  
Sbjct: 69  FRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAA 128

Query: 188 FGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRF 247
           FG+  +   +  +   +A+ E+  H L       I+ I   F     LPNCCG +     
Sbjct: 129 FGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEMYGLPNCCGAIDTTHI 188

Query: 248 -----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEI 307
                       +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  + ++
Sbjct: 189 IMTLPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFF-KL 248

Query: 308 EKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTH 367
            +++++L G    L     I +Y++G   +PLLPWL+TP+   +  DS      AFN  H
Sbjct: 249 CENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSM----VAFNERH 308

Query: 368 GRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE 402
            +  ++  TAF +L+  W++LSK      R   P IIL  CLL N +I C + L E+
Sbjct: 309 EKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQED 360

BLAST of CsGy2G002290 vs. Swiss-Prot
Match: sp|B0BN95|HARB1_RAT (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 1.2e-10
Identity = 67/277 (24.19%), Postives = 104/277 (37.55%), Query Frame = 0

Query: 138 LLRLLSPIQSSP---SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC 197
           L+ LL    S P   S ++ P+  + AAL     G+    +G   GI  A   R    V 
Sbjct: 49  LVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVT 108

Query: 198 KAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVL----------GLRRFGFEGE 257
           +A+ E+    +   +D   I       +G   +P   G +                +   
Sbjct: 109 EALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCIHVAIKAPNAEDLSYVNR 168

Query: 258 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 317
               SL    + D  G  + V   WP S++   +L+QS L ++ E               
Sbjct: 169 KGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSLSSQFETG------------- 228

Query: 318 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLR- 377
             P   +L+GDS F L  WLLTP + + E  +     RA ++TH      + T  CR R 
Sbjct: 229 -MPKDSWLLGDSSFFLHTWLLTP-LHIPETPAEYRYNRAHSATHSVIEKTLRTLCCRFRC 288

Query: 378 ---ARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIK 394
              ++  L   P K         IIL  C+L N  ++
Sbjct: 289 LDGSKGALQYSPEKSS------HIILACCVLHNISLE 304

BLAST of CsGy2G002290 vs. Swiss-Prot
Match: sp|Q8BR93|HARB1_MOUSE (Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 1.0e-09
Identity = 66/275 (24.00%), Postives = 101/275 (36.73%), Query Frame = 0

Query: 138 LLRLLSPIQSSP---SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC 197
           L+ LL    S P   S ++ P+  + AAL     G+    +G   GI  A   R    V 
Sbjct: 49  LVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVT 108

Query: 198 KAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVL----------GLRRFGFE 257
           +A+ E+    +     +D   V       +G   +P   GV                 + 
Sbjct: 109 EALVERASQFIHF--PVDEAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDLSYV 168

Query: 258 GELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 317
                 SL    + D  G  + V   WP S++   +L++S L ++ E             
Sbjct: 169 NRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSLTSQFETG----------- 228

Query: 318 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 377
               P   +L+GDS F L  WLLTP + + E  +     RA ++TH      + T  CR 
Sbjct: 229 ---MPKDSWLLGDSSFFLRSWLLTP-LPIPETAAEYRYNRAHSATHSVIERTLQTLCCRF 288

Query: 378 RARWKLLSKPWKEGCRDFFP----FIILTGCLLQN 390
           R           +G   + P     IIL  C+L N
Sbjct: 289 RC------LDGSKGALQYSPEKCSHIILACCVLHN 300

BLAST of CsGy2G002290 vs. TrEMBL
Match: tr|A0A0A0LFB5|A0A0A0LFB5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G008700 PE=4 SV=1)

HSP 1 Score: 896.0 bits (2314), Expect = 3.5e-257
Identity = 447/447 (100.00%), Postives = 447/447 (100.00%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 448
           DGEIGDGRGKDIRDALALHLSSLNYRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447

BLAST of CsGy2G002290 vs. TrEMBL
Match: tr|A0A1S3C5W6|A0A1S3C5W6_CUCME (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103497032 PE=4 SV=1)

HSP 1 Score: 859.8 bits (2220), Expect = 2.8e-246
Identity = 434/447 (97.09%), Postives = 437/447 (97.76%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMN AAAAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMN-AAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP F
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPPF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 448
           DGEIGDGRGKDIRDALALHLSSL+YRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLSYRR 446

BLAST of CsGy2G002290 vs. TrEMBL
Match: tr|M5VQK6|M5VQK6_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_7G257400 PE=4 SV=1)

HSP 1 Score: 466.1 bits (1198), Expect = 9.0e-128
Identity = 242/410 (59.02%), Postives = 298/410 (72.68%), Query Frame = 0

Query: 43  LITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPP 102
           L++L+ T  S AHSFLS NDL LLPSQTL LE+LL                         
Sbjct: 35  LVSLVATATSLAHSFLSQNDLLLLPSQTLTLETLLSXXXXXXXXXXXXXXXXXXXXXXXX 94

Query: 103 PR--QCWFQRFLSATS-DVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCAL 162
               +CWF RFLSATS   + D RW+ +FRMS+ SFS+LL LLSP  +S   S+PP+  L
Sbjct: 95  XXXLECWFSRFLSATSASRNFDSRWSYTFRMSEHSFSILLSLLSPFLNSTIPSIPPNFVL 154

Query: 163 AAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFG 222
           AAA++RLAHGASYKAVGRRFG+DS +ACR+F+AVCKA+++KLG+L E RSDI RIV GFG
Sbjct: 155 AAAIYRLAHGASYKAVGRRFGLDSVEACRAFFAVCKAVSDKLGNLFEFRSDIARIVGGFG 214

Query: 223 WISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQ 282
           WISLPNCCGVLG  RFG  GE+   NGSLLVQALVD+EGRFLDVSAGWPS+MK  +I RQ
Sbjct: 215 WISLPNCCGVLGFGRFGVGGEVLGPNGSLLVQALVDSEGRFLDVSAGWPSAMKLESIFRQ 274

Query: 283 SKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCG 342
           +KLY  +E+S +LL GPVY L N K IPQY++GDSCFPLLPWLLTPY+  +E DS G   
Sbjct: 275 TKLYLGVEESRDLLNGPVYELGNGKAIPQYILGDSCFPLLPWLLTPYIRSDEADSFGSLE 334

Query: 343 RAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEK 402
           +AFNS H RAM LV+TAF R+RARW+LLS+ WKE C +F PF+I+TGCLL NFLIKCSE 
Sbjct: 335 KAFNSVHSRAMGLVDTAFGRVRARWQLLSRQWKEECVEFLPFVIVTGCLLHNFLIKCSEP 394

Query: 403 LDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR 448
           + ++  +      SS E++ P+F G++ D  G+ +RD LA HLS ++ RR
Sbjct: 395 MPDDNVK------SSREEELPVFHGQV-DESGERMRDVLAAHLSRVSLRR 437

BLAST of CsGy2G002290 vs. TrEMBL
Match: tr|A0A068UQF2|A0A068UQF2_COFCA (Uncharacterized protein OS=Coffea canephora OX=49390 GN=GSCOC_T00031239001 PE=4 SV=1)

HSP 1 Score: 453.4 bits (1165), Expect = 6.0e-124
Identity = 245/413 (59.32%), Postives = 296/413 (71.67%), Query Frame = 0

Query: 43  LITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSS---LHALSPRLPKLSLP-- 102
           LI  + T   SA SFL   DLHLLPSQ+L+LESLLCSTS+S   L +L+   P+ SLP  
Sbjct: 53  LIPHLVTATYSAISFLRHQDLHLLPSQSLSLESLLCSTSTSFSKLLSLTSFFPE-SLPLX 112

Query: 103 -PPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPD 162
                 P QCWF RFL++ +  D DPRW   F +SK SF+LLLRLL+P  SS  S +PP+
Sbjct: 113 XXXXXXPAQCWFDRFLTSAA-ADYDPRWTHFFNLSKPSFTLLLRLLTPSLSS-LSPLPPN 172

Query: 163 CALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVV 222
            ALAA LFRLAH AS+ AV RRF IDS  ACR+FY VCKAINE LGHL E +SDI+RI+V
Sbjct: 173 FALAATLFRLAHSASFSAVSRRFNIDSPAACRAFYTVCKAINENLGHLFEFKSDINRIIV 232

Query: 223 GFGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATI 282
           GFGWISLPNCCGVLGL +F  +G+L  +NGSL+VQALVD+EGRFLDVSAGWPS++ P  +
Sbjct: 233 GFGWISLPNCCGVLGLEKFKLDGDLLGENGSLVVQALVDSEGRFLDVSAGWPSTLTPEKV 292

Query: 283 LRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSG 342
           LRQSKL + +E++ E L GP + L +   IPQY++GDSCFPLLPWLLTPY +L+E     
Sbjct: 293 LRQSKLLSGVEETKEYLNGPSFELSDGNSIPQYILGDSCFPLLPWLLTPYKKLDENAGLN 352

Query: 343 FCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKC 402
               AFNS H   M LV  AF R+R RWKL++K W E C + FPF+I+T CLL NFLIKC
Sbjct: 353 SSEMAFNSVHSSGMELVRMAFGRVRKRWKLVAKKWSEQCVEAFPFVIVTCCLLHNFLIKC 412

Query: 403 SEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR 448
           SE +     Q+E A C S +Q+FP+FDGE+ D  GK IRDALA HLS  N RR
Sbjct: 413 SEAV-----QDEDAEC-SRDQEFPVFDGEV-DESGKRIRDALASHLSRANERR 455

BLAST of CsGy2G002290 vs. TrEMBL
Match: tr|A0A2R6S1I5|A0A2R6S1I5_ACTCH (Nuclease OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc00673 PE=4 SV=1)

HSP 1 Score: 452.2 bits (1162), Expect = 1.3e-123
Identity = 246/412 (59.71%), Postives = 300/412 (72.82%), Query Frame = 0

Query: 42  QLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRL---PKLSLPP 101
           ++++ +    S+AHSFLS +DL LLPSQTL+LES L STS SL  L   L   P LS   
Sbjct: 33  RILSAVAAATSAAHSFLSHHDLRLLPSQTLSLESHLSSTSLSLSNLFSLLSLPPSLSSQL 92

Query: 102 PLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCA 161
           P PP    WF RFLSA +  D DPRW  +FRMSK SFSLLLRLL+P  SS  S +PP+ A
Sbjct: 93  PPPPSSPSWFHRFLSAAA-ADYDPRWTDAFRMSKPSFSLLLRLLTPSLSS-LSPIPPNLA 152

Query: 162 LAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGF 221
           LAA LFRLAH ASYKAV RRF +DSA +CR+FY VCKAI +KLGH+ E +SDI+RIVVGF
Sbjct: 153 LAATLFRLAHAASYKAVSRRFMLDSATSCRAFYTVCKAIVDKLGHMFEFKSDINRIVVGF 212

Query: 222 GWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILR 281
           GWISLPNCCGVLG  RF  +G++  +NGSL+VQ LVD+EGRFLDVSAGWPS+M+P TILR
Sbjct: 213 GWISLPNCCGVLGFDRFQMDGKILGENGSLMVQGLVDSEGRFLDVSAGWPSTMRPETILR 272

Query: 282 QSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYM-ELNEEDSSGF 341
           QS+L+A +++S ELL GP + L +   IPQY++G+SCFPLLPWLLTPY+   N  +SS +
Sbjct: 273 QSRLFAGVDESRELLNGPCFELGDGNSIPQYILGESCFPLLPWLLTPYVGHRNGLNSSEW 332

Query: 342 CGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCS 401
              AFNS H + M LV TAF R+RARWKLLSK WKE C + FPF+I+T CLL NFLIKCS
Sbjct: 333 ---AFNSVHRQGMELVGTAFGRVRARWKLLSKNWKEECIEAFPFVIVTCCLLHNFLIKCS 392

Query: 402 EKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR 448
           E L      +      S EQ+   F+GE+ D  G+  RDALALHL+ ++ RR
Sbjct: 393 EAL-----PDVNVEYYSREQELLPFEGEV-DENGRRTRDALALHLNRVSQRR 433

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139403.15.3e-257100.00PREDICTED: putative nuclease HARBI1 [Cucumis sativus] >KGN60730.1 hypothetical p... [more]
XP_008457314.14.2e-24697.09PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
XP_023536803.12.9e-21886.64protein ALP1-like [Cucurbita pepo subsp. pepo][more]
XP_022938170.13.7e-21886.64protein ALP1-like [Cucurbita moschata] >XP_022938171.1 protein ALP1-like [Cucurb... [more]
XP_022965738.15.4e-20983.74protein ALP1-like [Cucurbita maxima] >XP_022965739.1 protein ALP1-like [Cucurbit... [more]
Match NameE-valueIdentityDescription
AT1G72270.18.9e-7342.17Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)[more]
AT3G55350.17.4e-3528.81PIF / Ping-Pong family of plant transposases[more]
AT3G63270.11.4e-3030.98Putative harbinger transposase-derived nuclease (InterPro:IPR006912)[more]
AT5G12010.11.5e-1925.67unknown protein[more]
AT4G29780.16.9e-1726.02unknown protein[more]
Match NameE-valueIdentityDescription
sp|Q9M2U3|ALPL_ARATH1.3e-3328.81Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
sp|Q94K49|ALP1_ARATH2.6e-2930.98Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
sp|B0BN95|HARB1_RAT1.2e-1024.19Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
sp|Q8BR93|HARB1_MOUSE1.0e-0924.00Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0LFB5|A0A0A0LFB5_CUCSA3.5e-257100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G008700 PE=4 SV=1[more]
tr|A0A1S3C5W6|A0A1S3C5W6_CUCME2.8e-24697.09putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103497032 PE=4 SV=1[more]
tr|M5VQK6|M5VQK6_PRUPE9.0e-12859.02Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_7G257400 PE=4 SV=1[more]
tr|A0A068UQF2|A0A068UQF2_COFCA6.0e-12459.32Uncharacterized protein OS=Coffea canephora OX=49390 GN=GSCOC_T00031239001 PE=4 ... [more]
tr|A0A2R6S1I5|A0A2R6S1I5_ACTCH1.3e-12359.71Nuclease OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc00673 PE=4... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027806HARBI1_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy2G002290.1CsGy2G002290.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 239..389
e-value: 1.1E-15
score: 57.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 404..427
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 32..444
NoneNo IPR availablePANTHERPTHR22930:SF87SUBFAMILY NOT NAMEDcoord: 32..444