Cla97C07G131470 (gene) Watermelon (97103) v2

NameCla97C07G131470
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionHarbinger transposase-derived nuclease
LocationCla97Chr07 : 2994523 .. 2999277 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGACCCATTAGAGGGTTCAAGAGAAAGAAGAAGGCAGAGAAAAAGGTTGACCAAAATGTCTTCGCTGCTGCTTCACTATCGTCTCAGCCCCAGCCCTTGGATTGGTGGGATGAGTTCTCCCAGAGGATTACTGGTACAAACTATTTTCTTTCTTTTTTTTGCTTCTCTTCGAACTCTGCCTCTTCTTTACTACTTGTTAATTTTATTCTCTCCGTTGCTCTTTTGTTGAATTTGTGCATTGTCATTGCTCTTTGTTTCTTTCTGTTTACTGGGTCTCTCGACTGAATTCTCTAGTGTTTGGTGAGGTTTTTGGACGGCGGTTGAATGTAGTGTTCGGTAATCATATATCTTCGGTTGTGCGTTTAAATATTGAGTTTTGCAATTATAGAACCGTGATATTGTTCCTCATAATCTGTAAAATCTTGAAGTTTTTCTTTGACATCTTTATAACTTTGAGAGGGTCAGAAGGGGCAGCCTGTTGAAGTGATCTTATGAGTATTACTGCTTTTGGAATTCTGTGGAGATGGATCAAATCTGTTCAGCAGTAGTTATTGACTTTAATTGATGTGTACAAGTAGTGGCTACTCACTTAGCTGGTTTATTGAGTGTTAACTTGTGTTGTTCAAATTTGATGATGAAGTTGGTGTGGTACGTTTTTGTTTCCAAGGAGTTGTGTCTTCAAAATTTTGTTTAGGACATTGTTCGATTTCACTCTTGATATCTTTAATTAATTGGATTATTATGCTATATCTTCATTGAGATGAGTTTTTATATAGAATCATGGACAATGCTGCTTGAAGAATTGGTTGTCACTGACACTATGACTGATTGGTTATTAACTATTTCTTAGCTGCTCTTACTTCTTCAAGCTTTGTTGTTCTTAGCCACGTTTGCTAATTTCATGACTTAAAATTCCTCTATGTTTTAATTGATTGGCTAGATTATCTGGTCTGGAGCATTAGCTTAATCTATGGGTCATATTCAACAAAGTTTTAGGAAAGAGTAAAATTGATTTCTTATATATTGACTTTCTCCTCTGACATTCACATAGTATAGAAATTCTCCATTTGACACCATTCATTTTTATGTATTGATTTCTAAAAGAATTGTTTCACAATTCCTAGTTCCATCTCAAAAGGTCTTTCTTCTTCTGTGGGGAAAGCATGGGCAGGCTTTCCATCTTACTATATGAATTTTATTCCCAAAATATCCTCTTGGTTTTCTTTGGTAGCAGCAGCAATTATTTTTGTTTAATGTAAGTAAATAATACTAATATAACATAAAGATGTACTATGTCATCAATTAATTGATCTTTAGCCAACATCAACTGAAAGTTCCTAAACAATTTTTTGTCGCAGCATTTGCTGGCAATAACCATCTTTAAAGAACTGAGTTCACTGTGATCCACAACTGTTGTTTTCTTCTTTACCAATTCGTATTAAACATATCACTCTTGAGTATAGTATATTGAAGTTAGAATCTGTCATCTGTTGATATTTGTAAGGCCACTGAGGAATGGATGGATGCCTATGTAAGTAATAGTGTAGAATTGTAGATAACACGGGGGAATGGGCAGTATTTGACCTCAAGTATGTGTTTTTTATGGTCTGTTGTTGTATGTGTATGGAAAACAAGAGAAATCGATTAAAATATGTTATTCATGTCAATATAGTATGTCAGAATTTAGAAGGCATGCTAAACAATGAGATCTAGAGAGCTAATTCTTTTTGGTTAATATCATTTGTTCTATTGTTACATAATCATGGTTAATGGCCGTCAAATTCAAATCATTACACATAGTTGTATACTTTAATGCAACCTGATTTAGGTAGAATTTTAGAATATTATTCTAGAGATATCGTTGTGACTTTGTATCTTAGACAACCCCTTTATAATATCATTTATTAATTTTTAGTTATAAACATCTTTATCTGGTTGTTTTAGAGGTTTCTTGGAAATAGGCATGTTTAAGTTTCAATTCGATTCAATTAATAATGATGCAATGCATAAAAGATCACAGAAAACCATACTCTTGGATCTCTTCCACGTGGTCAATATTTGTGTCAACCTTCTGATATGGATAAGTGCTCAAGATTTGTTTCTGGTATTCCAGAAATGGAAATTTTCTTTATAATCTGTATTTTTCGCCAACACTTTTCAGGAGTTTAGAATGACAAGAAATCTGGAATTTTAGGACAGCCCATATTGCTAGTATGTATTCTTGATTGAATTTCACATATGAGACAAATTTTTAGCACATGAAAAAACAAAAAAAATTAAGACCCCACGATTGGAAAACAATTATATCTGAGAATTTTATCAACTTCTAAATGTCCCTAACGTAAAATATTGTTTCTTGACGTGTAAAAGCTTTATTAGGACAGCAGTTCTTGATGATTTCTTTTAGTGAAATAATCAAGAATGGATGTAAAATCAGATTTCTTAATTTTGTTCCCGCATGATCCCTTGAAACTAGAGTTTATACAAAATTTATTAGAAATTAAGGACACAGTTGTAGAACTTATAGAATACATCCTTTTCAAGTAAGGTCGTAGTTTGTACAAAATCTATTAATCAATAGATTTTCCCCCCTTAGCAGTAAATTTACTAGTTTTATCTCGATCTACCCTCTATTTTTCATGTCTAATGTCCTTCATTCTTTATTCATATCTTGTTCATTCCTCCCTTCTTGCCCATTGTCTTTTATAGTGTATCAGCATTCTTTTCAAATTGGACCTATGCTTCTAGCATGTCACAATTGGTATTTATTATACTAAAAGGATGTTTTGGGCACCTATTAGCAAAGATTGTTATTCGGCCTATTCGAACACTATCATTTTCGAATATGTATCAACTATTGTATTTTTTATTTTTTATTTTTTCAAAAAAAGAACCTTAGATGAGCTGACAACCAAACAACAATGGCTCTAACTTCAATCCAAATTCCCAGAAATACTTAGAAGTTGGCAACGAAACAGCTGGTTTAATTTATCTTGATTTGAAGTTATTAGAGAACCTAACAACATGATCAAAGTGGGTAGATTTTGTTAGATGTTAGTAAAACCAGAAGGCTAAATGAGCTTAAGGCACCAAAAATTAACACACGCTCAGTTCAGCTTTAGATATCAGTTCTTCTATTTTTATATTTCATGTTCTGAAATGTTGGTTACAGTTCAAGATAATTTCGGCTTCGTAATCTGAATGTCCACTACACATTAATTCACGTGAAGGCTTTAACTTTTTATTGTCGAGATTTTTCTGTAGCATTAGTCTATGTAGTAAAATAAAAGTTCTCACTCTTGTCAATGTTGAGGAAGCCATTGACCCGTTGTCACATTTAGCAGTTACTGTGCTCTGGCCAAGATTTATGATGCAATAAGTAAATCCTTACTTTTAGTTTGAAATCAAAGGGTTTGACTAAATGTGTGGTCAAATGACTTCGTCAGATTTTCGTCTGAGTATAACTACAGCATTTCTTCTGGCTCATTGCTCACAAAATTTTACTTTAGTATTTGTTTAGTCATAATTGAGTTGGACATCTTGATCTGATATTAATTAGCATCACTAATAATGTCAATTGCTTACCTTTACTATGAATTATCTCTCTCTATCTGTCATACCTGAAAAAGTAAAATGCCATGCAGTAAAGAGTTTCTCTTGGGACATTTGCAGCAGGGCTGATAATATCCTTGTTTTTGCCTCCAATCAGGGCCATTATCTCAGTCAAAGAATACAAAATTTGAGTCGGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTCAAGGAAGTTATGATGGCTAAAACCTCAAATTTTACCGACTTAAATGGCAAGCCTTTGTCTCTAAATGATCAAGTCGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCTAATATTGGTGATTCGTTTGGAATGAATCAATCATCAGTTTCTCAAATAACTTGGCGTTTCGTGGAGGCAATGGAAGAGAAAGGCCTCCACCACCTCTCGTGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCTAAGTTTAAGAAAATCAGAGGTCTTCCTAATTGTTGTGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGGCAGAATCTGCAAACGGCATCTGGCTTGATGGTGAGAAAAACTGCAGCATGGTCTTGCAAGTGATTGTAGATCCGGAAATGAGATTCTGTGACATCATCACAGGTTGGCCAGGAAGTTTAAGCGATGCTCTTGTGCTCCAAAGCTCGGGTTTTTTCAAACTTTCACAAGATGGCGAGCGGTTGAACGGCAAGAAGATGAAGCTCTCAGAAAGTTCAGAACTAGGAGAGTACATTATAGGAGACTCTGGTTTTCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTTCGGATTATCAGACCGAGTTCAATAAGCGACATTTCGCCACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGCTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGACAAACATAGACTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTCATCGATATGGAGGACGAGGTGCAAGACGAAATGCCCTTGTCTCATCATCACGATCCTAGTTACCGACAACAAAGTTGCAAATTCGCTGACAATACTGCTTCCATCGCTAGGGAGAAGCTTTCGATGTACTTATCAGGAAAGTTACCACCCTAA

mRNA sequence

ATGGGACCCATTAGAGGGTTCAAGAGAAAGAAGAAGGCAGAGAAAAAGGTTGACCAAAATGTCTTCGCTGCTGCTTCACTATCGTCTCAGCCCCAGCCCTTGGATTGGTGGGATGAGTTCTCCCAGAGGATTACTGGGCCATTATCTCAGTCAAAGAATACAAAATTTGAGTCGGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTCAAGGAAGTTATGATGGCTAAAACCTCAAATTTTACCGACTTAAATGGCAAGCCTTTGTCTCTAAATGATCAAGTCGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCTAATATTGGTGATTCGTTTGGAATGAATCAATCATCAGTTTCTCAAATAACTTGGCGTTTCGTGGAGGCAATGGAAGAGAAAGGCCTCCACCACCTCTCGTGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCTAAGTTTAAGAAAATCAGAGGTCTTCCTAATTGTTGTGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGGCAGAATCTGCAAACGGCATCTGGCTTGATGGTGAGAAAAACTGCAGCATGGTCTTGCAAGTGATTGTAGATCCGGAAATGAGATTCTGTGACATCATCACAGGTTGGCCAGGAAGTTTAAGCGATGCTCTTGTGCTCCAAAGCTCGGGTTTTTTCAAACTTTCACAAGATGGCGAGCGGTTGAACGGCAAGAAGATGAAGCTCTCAGAAAGTTCAGAACTAGGAGAGTACATTATAGGAGACTCTGGTTTTCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTTCGGATTATCAGACCGAGTTCAATAAGCGACATTTCGCCACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGCTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGACAAACATAGACTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTCATCGATATGGAGGACGAGGTGCAAGACGAAATGCCCTTGTCTCATCATCACGATCCTAGTTACCGACAACAAAGTTGCAAATTCGCTGACAATACTGCTTCCATCGCTAGGGAGAAGCTTTCGATGTACTTATCAGGAAAGTTACCACCCTAA

Coding sequence (CDS)

ATGGGACCCATTAGAGGGTTCAAGAGAAAGAAGAAGGCAGAGAAAAAGGTTGACCAAAATGTCTTCGCTGCTGCTTCACTATCGTCTCAGCCCCAGCCCTTGGATTGGTGGGATGAGTTCTCCCAGAGGATTACTGGGCCATTATCTCAGTCAAAGAATACAAAATTTGAGTCGGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTCAAGGAAGTTATGATGGCTAAAACCTCAAATTTTACCGACTTAAATGGCAAGCCTTTGTCTCTAAATGATCAAGTCGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCTAATATTGGTGATTCGTTTGGAATGAATCAATCATCAGTTTCTCAAATAACTTGGCGTTTCGTGGAGGCAATGGAAGAGAAAGGCCTCCACCACCTCTCGTGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCTAAGTTTAAGAAAATCAGAGGTCTTCCTAATTGTTGTGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGGCAGAATCTGCAAACGGCATCTGGCTTGATGGTGAGAAAAACTGCAGCATGGTCTTGCAAGTGATTGTAGATCCGGAAATGAGATTCTGTGACATCATCACAGGTTGGCCAGGAAGTTTAAGCGATGCTCTTGTGCTCCAAAGCTCGGGTTTTTTCAAACTTTCACAAGATGGCGAGCGGTTGAACGGCAAGAAGATGAAGCTCTCAGAAAGTTCAGAACTAGGAGAGTACATTATAGGAGACTCTGGTTTTCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTTCGGATTATCAGACCGAGTTCAATAAGCGACATTTCGCCACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGCTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGACAAACATAGACTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTCATCGATATGGAGGACGAGGTGCAAGACGAAATGCCCTTGTCTCATCATCACGATCCTAGTTACCGACAACAAAGTTGCAAATTCGCTGACAATACTGCTTCCATCGCTAGGGAGAAGCTTTCGATGTACTTATCAGGAAAGTTACCACCCTAA

Protein sequence

MGPIRGFKRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQQSCKFADNTASIAREKLSMYLSGKLPP
BLAST of Cla97C07G131470 vs. NCBI nr
Match: XP_004147700.1 (PREDICTED: putative nuclease HARBI1 [Cucumis sativus] >KGN50531.1 hypothetical protein Csa_5G180900 [Cucumis sativus])

HSP 1 Score: 778.5 bits (2009), Expect = 1.1e-221
Identity = 377/392 (96.17%), Postives = 386/392 (98.47%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKK EKKVDQNVFA+ASLSSQ QPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG
Sbjct: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120

Query: 121 MNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIM 180
           +NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IKSKFKKIRGLPNCCGV+ETTHIM
Sbjct: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTHIM 180

Query: 181 MTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPT+ESANGIWLD EKNCSM+LQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS
Sbjct: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRL 300
           QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHFATRL
Sbjct: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCKFADNTASIAREKLSMYLSGKLPP 393
           PSYRQQSC+F DNTASI+REKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of Cla97C07G131470 vs. NCBI nr
Match: XP_008461643.1 (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 777.7 bits (2007), Expect = 1.9e-221
Identity = 377/392 (96.17%), Postives = 386/392 (98.47%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKK EKKVDQNVFA+ASLSSQ QPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG
Sbjct: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120

Query: 121 MNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIM 180
           +NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IKSKFKKIRGLPNCCGVIETTHIM
Sbjct: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTHIM 180

Query: 181 MTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPT+ESANGIWLD EKNCSM+LQVIVDPEMRFCDIITGWPGSLSD+LVLQSSGFFKLS
Sbjct: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRL 300
           QDGERLNGKKM+LSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHFATRL
Sbjct: 241 QDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCKFADNTASIAREKLSMYLSGKLPP 393
           PSYRQQSC+F DNTASIAREKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of Cla97C07G131470 vs. NCBI nr
Match: XP_022138922.1 (protein ALP1-like [Momordica charantia])

HSP 1 Score: 763.5 bits (1970), Expect = 3.6e-217
Identity = 372/393 (94.66%), Postives = 380/393 (96.69%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKN-TKFESV 60
           MGPIRGFKRKKKAEKKVDQNV AAASLSSQPQPLDWWD+FSQRITGPLSQSKN TKFESV
Sbjct: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60

Query: 61  FKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSF 120
           FKISRKTFSYICSLVKE MMAKTSNFTDLNGKPLS+NDQVAVALRRL SGESLS IGDSF
Sbjct: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHI 180
           GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKI+GLPNCCGVIETTHI
Sbjct: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180

Query: 181 MMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 240
           MMTLPTAES NG+WLD EKNCSM+LQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTAESXNGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATR 300
           SQDGERLNGK MKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRH+ATR
Sbjct: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DPSYRQQSCKFADNTASIAREKLSMYLSGKLPP 393
           D  YRQQSCKF DNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393

BLAST of Cla97C07G131470 vs. NCBI nr
Match: XP_023545365.1 (protein ALP1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 753.8 bits (1945), Expect = 2.9e-214
Identity = 366/394 (92.89%), Postives = 379/394 (96.19%), Query Frame = 0

Query: 1   MGPIRGFKRK--KKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRK  KKA+KKV Q VFAAASLS QPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMDQIKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK 240
           IMMTLPT ESANG+WLD EKNCSM+LQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 LSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFAT 300
            SQDGERLNGKKMKLSE+SELGEYIIGDSGFPLLPWLLTPYQGKGL+DYQTEFNKRHF+T
Sbjct: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFADNTASIAREKLSMYLSGKLPP 393
           HDPSYRQQSCKF DNT SI REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394

BLAST of Cla97C07G131470 vs. NCBI nr
Match: XP_022956519.1 (protein ALP1-like [Cucurbita moschata])

HSP 1 Score: 753.4 bits (1944), Expect = 3.7e-214
Identity = 366/394 (92.89%), Postives = 379/394 (96.19%), Query Frame = 0

Query: 1   MGPIRGFKRK--KKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRK  KKA+KKV Q VFAAASLS QPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK 240
           IMMTLPT ESANG+WLD EKNCSM+LQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 LSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFAT 300
            SQDGERLNGKKMKLSES+ELGEYIIGDSGFPLLPWLLTPYQGKGL+DYQTEFNKRHF T
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+ID+EDE+QDEMPLSHH
Sbjct: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFADNTASIAREKLSMYLSGKLPP 393
           HDPSYRQQSCKF DNTASI REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of Cla97C07G131470 vs. TrEMBL
Match: tr|A0A0A0KS64|A0A0A0KS64_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE=4 SV=1)

HSP 1 Score: 778.5 bits (2009), Expect = 7.2e-222
Identity = 377/392 (96.17%), Postives = 386/392 (98.47%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKK EKKVDQNVFA+ASLSSQ QPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG
Sbjct: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120

Query: 121 MNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIM 180
           +NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IKSKFKKIRGLPNCCGV+ETTHIM
Sbjct: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTHIM 180

Query: 181 MTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPT+ESANGIWLD EKNCSM+LQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS
Sbjct: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRL 300
           QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHFATRL
Sbjct: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCKFADNTASIAREKLSMYLSGKLPP 393
           PSYRQQSC+F DNTASI+REKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of Cla97C07G131470 vs. TrEMBL
Match: tr|A0A1S3CEZ1|A0A1S3CEZ1_CUCME (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=4 SV=1)

HSP 1 Score: 777.7 bits (2007), Expect = 1.2e-221
Identity = 377/392 (96.17%), Postives = 386/392 (98.47%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKK EKKVDQNVFA+ASLSSQ QPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG
Sbjct: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120

Query: 121 MNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIM 180
           +NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IKSKFKKIRGLPNCCGVIETTHIM
Sbjct: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTHIM 180

Query: 181 MTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPT+ESANGIWLD EKNCSM+LQVIVDPEMRFCDIITGWPGSLSD+LVLQSSGFFKLS
Sbjct: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRL 300
           QDGERLNGKKM+LSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHFATRL
Sbjct: 241 QDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCKFADNTASIAREKLSMYLSGKLPP 393
           PSYRQQSC+F DNTASIAREKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of Cla97C07G131470 vs. TrEMBL
Match: tr|A0A2I4FY10|A0A2I4FY10_9ROSI (putative nuclease HARBI1 OS=Juglans regia OX=51240 GN=LOC109003036 PE=4 SV=1)

HSP 1 Score: 660.2 bits (1702), Expect = 2.9e-186
Identity = 314/391 (80.31%), Postives = 357/391 (91.30%), Query Frame = 0

Query: 1   MGPIRGFKRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKN-TKFESV 60
           MGPIRGFKRKK+AEKKVDQNV AA  L SQPQPLDWWD+FSQRITGPLSQSKN  KFESV
Sbjct: 1   MGPIRGFKRKKRAEKKVDQNVLAAL-LRSQPQPLDWWDDFSQRITGPLSQSKNPNKFESV 60

Query: 61  FKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSF 120
           FKISRKTF+YICSLV++ MMA+ SNF  +NGKPLSLNDQVAVALRRL SGESLS++G+SF
Sbjct: 61  FKISRKTFNYICSLVRDNMMARPSNFNGINGKPLSLNDQVAVALRRLSSGESLSSVGESF 120

Query: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHI 180
           GMNQS+VSQITWRFVEA+EE+G+HHL WPSTE +M+++KSKF+KIRGLPNCCG I+ THI
Sbjct: 121 GMNQSTVSQITWRFVEAIEERGIHHLCWPSTEAEMEELKSKFQKIRGLPNCCGAIDITHI 180

Query: 181 MMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 240
           MMTLPT +S + IWLD EKNCSM+LQ +VDPEMRF DIITGWPGSLSD LVL+SSGFFKL
Sbjct: 181 MMTLPTMDSTDDIWLDHEKNCSMILQAVVDPEMRFRDIITGWPGSLSDDLVLRSSGFFKL 240

Query: 241 SQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATR 300
           S++G+RLNGKK++LS+  EL EYIIGD+GFPLLPWLLTPYQG+GL D+Q+EFNKRHFATR
Sbjct: 241 SEEGKRLNGKKVELSQGKELREYIIGDAGFPLLPWLLTPYQGRGLLDFQSEFNKRHFATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           +VAQRAL RLKEMWKII+GVMWKPDKHRLPRIIL CC+LHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 MVAQRALARLKEMWKIIQGVMWKPDKHRLPRIILACCILHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DPSYRQQSCKFADNTASIAREKLSMYLSGKL 391
           D +YRQQ+C+ ADNTA + RE LS+ LSGKL
Sbjct: 361 DSNYRQQTCQSADNTAVVIRENLSLCLSGKL 390

BLAST of Cla97C07G131470 vs. TrEMBL
Match: tr|A0A061FMZ6|A0A061FMZ6_THECC (RNA binding protein, putative OS=Theobroma cacao OX=3641 GN=TCM_042838 PE=4 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 1.7e-175
Identity = 299/400 (74.75%), Postives = 350/400 (87.50%), Query Frame = 0

Query: 1   MGPIRGFKRKKKA--EKKVDQNVF-----AAASLSSQPQPLDWWDEFSQRITGPLSQSKN 60
           MGPIRGFKR+KKA  +K VDQNV       A+SL SQPQPLDWWDEFS+RI+G LSQSK+
Sbjct: 1   MGPIRGFKRRKKAADKKVVDQNVLPSSAAVASSLGSQPQPLDWWDEFSKRISGTLSQSKD 60

Query: 61  TK-FESVFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESL 120
           +K FESVF+ISRKTF YICSLVKE MMA+ S+FTDLNGKPLSLNDQVAVALRRL SGESL
Sbjct: 61  SKSFESVFRISRKTFDYICSLVKEDMMARQSSFTDLNGKPLSLNDQVAVALRRLSSGESL 120

Query: 121 SNIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCG 180
           S IGD+FGMNQS+VSQITWRFVEAMEE+GLHHLSWPSTE +M+QIKSKF+KIRGLPNCCG
Sbjct: 121 SIIGDTFGMNQSTVSQITWRFVEAMEERGLHHLSWPSTEAEMEQIKSKFEKIRGLPNCCG 180

Query: 181 VIETTHIMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQ 240
            I+ TH++MTLPT + +N +W D EKN SM+LQ +VDPEMRF D+I GWPGSLSDA+VL+
Sbjct: 181 AIDITHVVMTLPTMDPSNNVWFDREKNYSMILQAVVDPEMRFRDVIAGWPGSLSDAIVLR 240

Query: 241 SSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFN 300
           SSGFF+LS++G+RLNGKK+ +SE +++ EYIIGD+GFPLLPWL TPYQGKGLSD Q EFN
Sbjct: 241 SSGFFRLSEEGKRLNGKKLNISEGTDIREYIIGDAGFPLLPWLFTPYQGKGLSDLQVEFN 300

Query: 301 KRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDE 360
           KRH ATR+VAQ AL RLKEMW+II GVMW PDK+RLPRI+LVCCLLHNI+ID+EDEV D+
Sbjct: 301 KRHAATRMVAQMALARLKEMWRIIHGVMWMPDKNRLPRIVLVCCLLHNILIDLEDEVLDD 360

Query: 361 MPLSHHHDPSYRQQSCKFADNTASIAREKLSMYLSGKLPP 393
           M LSHHHD  YR+Q+C+  D +A I R+KLS+YL+GKLPP
Sbjct: 361 MSLSHHHDTGYRRQNCESLDKSALIMRDKLSLYLTGKLPP 400

BLAST of Cla97C07G131470 vs. TrEMBL
Match: tr|A0A1U8PK17|A0A1U8PK17_GOSHI (putative nuclease HARBI1 OS=Gossypium hirsutum OX=3635 GN=LOC107959884 PE=4 SV=1)

HSP 1 Score: 617.5 bits (1591), Expect = 2.1e-173
Identity = 294/393 (74.81%), Postives = 350/393 (89.06%), Query Frame = 0

Query: 1   MGPIRGFKRKKK-AEKK-VDQNVFAAASLSSQPQPLDWWDEFSQRITGPLSQSKNTK-FE 60
           MGPIRGFKR+KK A+KK VD NVF ++SL SQ QPLDWWD+FS+RI+GPLSQSK ++ FE
Sbjct: 1   MGPIRGFKRRKKTADKKVVDHNVF-SSSLESQLQPLDWWDDFSKRISGPLSQSKGSRSFE 60

Query: 61  SVFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 120
           S+F+IS+KTF+YICSLVKE MMA+ S++TD+NGKPLSLNDQVAVALRRL SGESLS IGD
Sbjct: 61  SIFRISKKTFNYICSLVKEDMMARQSSYTDINGKPLSLNDQVAVALRRLSSGESLSVIGD 120

Query: 121 SFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETT 180
           +FGMNQS+VSQITWRFVEAMEEKGLHHL+WPSTE +M+QIKSKF+KIRGLPNCCG I+ T
Sbjct: 121 TFGMNQSTVSQITWRFVEAMEEKGLHHLTWPSTEAEMEQIKSKFEKIRGLPNCCGAIDIT 180

Query: 181 HIMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFF 240
           H++MTLPT + +N +W D EKN SMVLQ +VDPEMR  D+I GWPGSLSDA+VL+SSGFF
Sbjct: 181 HVVMTLPTMDPSNNVWFDREKNYSMVLQAVVDPEMRLRDVIAGWPGSLSDAVVLRSSGFF 240

Query: 241 KLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFA 300
           +LS++G+RLNGKK+ +SE +E+GEYIIGD+GFPLLPWLLTPYQGKGLSD Q EFNKRH A
Sbjct: 241 RLSEEGKRLNGKKLNISEGTEIGEYIIGDAGFPLLPWLLTPYQGKGLSDLQIEFNKRHAA 300

Query: 301 TRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSH 360
           TR+VAQ  L RLKEMW+II GVMW PDK+RLPRI+LVCCLLHNI+IDMEDEV D+M LSH
Sbjct: 301 TRMVAQMTLARLKEMWRIIHGVMWMPDKNRLPRIVLVCCLLHNILIDMEDEVFDDMSLSH 360

Query: 361 HHDPSYRQQSCKFADNTASIAREKLSMYLSGKL 391
           HHD  YRQQ+C++ D +A I R+KLS+Y++GKL
Sbjct: 361 HHDTGYRQQNCEYFDQSAMIMRDKLSLYINGKL 392

BLAST of Cla97C07G131470 vs. Swiss-Prot
Match: sp|Q9M2U3|ALPL_ARATH (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 497.3 bits (1279), Expect = 1.6e-139
Identity = 238/356 (66.85%), Postives = 285/356 (80.06%), Query Frame = 0

Query: 34  LDWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPL 93
           LDWWD FS+RI G  +  K   FESVFKISRKTF YICSLVK    AK +NF+D NG PL
Sbjct: 52  LDWWDGFSRRIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPL 111

Query: 94  SLNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEED 153
           SLND+VAVALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ +HHLSWPS    
Sbjct: 112 SLNDRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---K 171

Query: 154 MDQIKSKFKKIRGLPNCCGVIETTHIMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMR 213
           +D+IKSKF+KI GLPNCCG I+ THI+M LP  E +N +WLDGEKN SM LQ +VDP+MR
Sbjct: 172 LDEIKSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMR 231

Query: 214 FCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLP 273
           F D+I GWPGSL+D +VL++SGF+KL + G+RLNG+K+ LSE +EL EYI+GDSGFPLLP
Sbjct: 232 FLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLP 291

Query: 274 WLLTPYQGKGLSDYQTEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIIL 333
           WLLTPYQGK  S  QTEFNKRH      AQ AL++LK+ W+II GVMW PD++RLPRII 
Sbjct: 292 WLLTPYQGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIF 351

Query: 334 VCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQQSCKFADNTASIAREKLSMYLSGK 390
           VCCLLHNI+IDMED+  D+ PLS  HD +YRQ+SCK AD  +S+ R++LS  L GK
Sbjct: 352 VCCLLHNIIIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of Cla97C07G131470 vs. Swiss-Prot
Match: sp|Q94K49|ALP1_ARATH (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 357.8 bits (917), Expect = 1.5e-97
Identity = 175/381 (45.93%), Postives = 257/381 (67.45%), Query Frame = 0

Query: 8   KRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGP-LSQSKNTKFESVFKISRKT 67
           K KK A+ K  + V  A  L  +    DWWD F  R + P +   ++  F+  F+ S+ T
Sbjct: 17  KAKKLAKNKEKKRV-NAVPLDPEAIDCDWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTT 76

Query: 68  FSYICSLVKEVMMAK-TSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGMNQSS 127
           FSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+S  ++G +FG+ QS+
Sbjct: 77  FSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQST 136

Query: 128 VSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIMMTLPT 187
           VSQ+TWRF+EA+EE+  HHL WP ++  +++IKSKF+++ GLPNCCG I+TTHI+MTLP 
Sbjct: 137 VSQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPA 196

Query: 188 AESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGER 247
            ++++  W D EKN SM LQ + D EMRF +++TGWPG ++ + +L+ SGFFKL ++ + 
Sbjct: 197 VQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQI 256

Query: 248 LNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRLVAQRA 307
           L+G    LS+ +++ EY++G   +PLLPWL+TP+     SD    FN+RH   R VA  A
Sbjct: 257 LDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATA 316

Query: 308 LTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQ 367
             +LK  W+I+  VMW+PD+ +LP IILVCCLLHNI+ID  D +Q+++PLS HHD  Y  
Sbjct: 317 FQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYAD 376

Query: 368 QSCKFADNTASIAREKLSMYL 387
           + CK  +   S  R  L+ +L
Sbjct: 377 RYCKQTEPLGSELRGCLTEHL 394

BLAST of Cla97C07G131470 vs. Swiss-Prot
Match: sp|Q6AZB8|HARB1_DANRE (Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 1.1e-26
Identity = 78/289 (26.99%), Postives = 143/289 (49.48%), Query Frame = 0

Query: 58  SVFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 117
           + F   R+   Y+  L+K+ ++ +T        + +S + Q+  AL    SG   S +GD
Sbjct: 37  NTFGFPREFIYYLVELLKDSLLRRTQ-----RSRAISPDVQILAALGFYTSGSFQSKMGD 96

Query: 118 SFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETT 177
           + G++Q+S+S+      +A+ EK    + +   E    Q K +F +I G+PN  GV++  
Sbjct: 97  AIGISQASMSRCVSNVTKALIEKAPEFIGFTRDEATKQQFKDEFYRIAGIPNVTGVVDCA 156

Query: 178 HIMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFF 237
           HI +  P A+ ++  +++ +   S+  Q++ D         T WPGSL+D  V + S   
Sbjct: 157 HIAIKAPNADDSS--YVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVA 216

Query: 238 KLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQG-KGLSDYQTEFNKRHF 297
           KL ++            E+ + G +++GD+ +PL  WL+TP Q  +  +DY+  +N  H 
Sbjct: 217 KLFEE-----------QENDDEG-WLLGDNRYPLKKWLMTPVQSPESPADYR--YNLAHT 276

Query: 298 ATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLLHNI 342
            T  +  R    ++  ++ + G    + + P+K     II  CC+LHNI
Sbjct: 277 TTHEIVDRTFRAIQTRFRCLDGAKGYLQYSPEK--CSHIIQACCVLHNI 302

BLAST of Cla97C07G131470 vs. Swiss-Prot
Match: sp|Q17QR8|HARB1_BOVIN (Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1)

HSP 1 Score: 115.2 bits (287), Expect = 1.7e-24
Identity = 85/306 (27.78%), Postives = 141/306 (46.08%), Query Frame = 0

Query: 60  FKISRKTFSYICSL----------VKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSG 119
           FK+   T  Y+ S+          + E++ A  S  T    + +S   Q+  AL    SG
Sbjct: 25  FKLDDVTDEYLMSMYGFPRQFIYYLVELLGASLSRPTQ-RSRAISPETQILAALGFYTSG 84

Query: 120 ESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPN 179
              + +GD+ G++Q+S+S+      EA+ E+    + +P+ E  +  +K +F  + G+P 
Sbjct: 85  SFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPADEASVQALKDEFYGLAGIPG 144

Query: 180 CCGVIETTHIMMTLPTAESANGIWLDG--EKNCSMVLQVIVDPEMRFCDIITGWPGSLSD 239
             GV++  H+ +  P AE  + +   G    NC MV     D       + T WPGSL D
Sbjct: 145 VIGVVDCMHVAIKAPNAEDLSYVNRKGLHSLNCLMV----CDIRGALMTVETSWPGSLQD 204

Query: 240 ALVLQSSGFFKLSQDGERLNGKKMKLSESSELG----EYIIGDSGFPLLPWLLTP-YQGK 299
            +VLQ S                  LS   E G     +++GDS F L  WL+TP +  +
Sbjct: 205 CVVLQQS-----------------SLSSQFEAGMHKESWLLGDSSFFLRTWLMTPLHIPE 264

Query: 300 GLSDYQTEFNKRHFATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLL 345
             ++Y+  +N  H AT  V ++    L   ++ + G    + + P+K     IIL CC+L
Sbjct: 265 TPAEYR--YNMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKS--SHIILACCVL 304

BLAST of Cla97C07G131470 vs. Swiss-Prot
Match: sp|Q96MB7|HARB1_HUMAN (Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 2.2e-24
Identity = 86/306 (28.10%), Postives = 140/306 (45.75%), Query Frame = 0

Query: 60  FKISRKTFSYICSL----------VKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSG 119
           FK+   T  Y+ S+          + E++ A  S  T    + +S   QV  AL    SG
Sbjct: 25  FKLDDVTDEYLMSMYGFPRQFIYYLVELLGANLSRPTQ-RSRAISPETQVLAALGFYTSG 84

Query: 120 ESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPN 179
              + +GD+ G++Q+S+S+      EA+ E+    + +P+ E  +  +K +F  + G+P 
Sbjct: 85  SFQTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPG 144

Query: 180 CCGVIETTHIMMTLPTAESANGIWLDG--EKNCSMVLQVIVDPEMRFCDIITGWPGSLSD 239
             GV++  H+ +  P AE  + +   G    NC MV     D       + T WPGSL D
Sbjct: 145 VMGVVDCIHVAIKAPNAEDLSYVNRKGLHSLNCLMV----CDIRGTLMTVETNWPGSLQD 204

Query: 240 ALVLQSSGFFKLSQDGERLNGKKMKLSESSELG----EYIIGDSGFPLLPWLLTP-YQGK 299
             VLQ S                  LS   E G     +++GDS F L  WL+TP +  +
Sbjct: 205 CAVLQQS-----------------SLSSQFEAGMHKDSWLLGDSSFFLRTWLMTPLHIPE 264

Query: 300 GLSDYQTEFNKRHFATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLL 345
             ++Y+  +N  H AT  V ++    L   ++ + G    + + P+K     IIL CC+L
Sbjct: 265 TPAEYR--YNMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKS--SHIILACCVL 304

BLAST of Cla97C07G131470 vs. TAIR10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases)

HSP 1 Score: 497.3 bits (1279), Expect = 8.8e-141
Identity = 238/356 (66.85%), Postives = 285/356 (80.06%), Query Frame = 0

Query: 34  LDWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPL 93
           LDWWD FS+RI G  +  K   FESVFKISRKTF YICSLVK    AK +NF+D NG PL
Sbjct: 52  LDWWDGFSRRIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPL 111

Query: 94  SLNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEED 153
           SLND+VAVALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ +HHLSWPS    
Sbjct: 112 SLNDRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---K 171

Query: 154 MDQIKSKFKKIRGLPNCCGVIETTHIMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMR 213
           +D+IKSKF+KI GLPNCCG I+ THI+M LP  E +N +WLDGEKN SM LQ +VDP+MR
Sbjct: 172 LDEIKSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMR 231

Query: 214 FCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLP 273
           F D+I GWPGSL+D +VL++SGF+KL + G+RLNG+K+ LSE +EL EYI+GDSGFPLLP
Sbjct: 232 FLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLP 291

Query: 274 WLLTPYQGKGLSDYQTEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIIL 333
           WLLTPYQGK  S  QTEFNKRH      AQ AL++LK+ W+II GVMW PD++RLPRII 
Sbjct: 292 WLLTPYQGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIF 351

Query: 334 VCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQQSCKFADNTASIAREKLSMYLSGK 390
           VCCLLHNI+IDMED+  D+ PLS  HD +YRQ+SCK AD  +S+ R++LS  L GK
Sbjct: 352 VCCLLHNIIIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of Cla97C07G131470 vs. TAIR10
Match: AT3G63270.1 (Putative harbinger transposase-derived nuclease (InterPro:IPR006912))

HSP 1 Score: 357.8 bits (917), Expect = 8.3e-99
Identity = 175/381 (45.93%), Postives = 257/381 (67.45%), Query Frame = 0

Query: 8   KRKKKAEKKVDQNVFAAASLSSQPQPLDWWDEFSQRITGP-LSQSKNTKFESVFKISRKT 67
           K KK A+ K  + V  A  L  +    DWWD F  R + P +   ++  F+  F+ S+ T
Sbjct: 17  KAKKLAKNKEKKRV-NAVPLDPEAIDCDWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTT 76

Query: 68  FSYICSLVKEVMMAK-TSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGMNQSS 127
           FSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+S  ++G +FG+ QS+
Sbjct: 77  FSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQST 136

Query: 128 VSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIMMTLPT 187
           VSQ+TWRF+EA+EE+  HHL WP ++  +++IKSKF+++ GLPNCCG I+TTHI+MTLP 
Sbjct: 137 VSQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPA 196

Query: 188 AESANGIWLDGEKNCSMVLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGER 247
            ++++  W D EKN SM LQ + D EMRF +++TGWPG ++ + +L+ SGFFKL ++ + 
Sbjct: 197 VQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQI 256

Query: 248 LNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRLVAQRA 307
           L+G    LS+ +++ EY++G   +PLLPWL+TP+     SD    FN+RH   R VA  A
Sbjct: 257 LDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATA 316

Query: 308 LTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQ 367
             +LK  W+I+  VMW+PD+ +LP IILVCCLLHNI+ID  D +Q+++PLS HHD  Y  
Sbjct: 317 FQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYAD 376

Query: 368 QSCKFADNTASIAREKLSMYL 387
           + CK  +   S  R  L+ +L
Sbjct: 377 RYCKQTEPLGSELRGCLTEHL 394

BLAST of Cla97C07G131470 vs. TAIR10
Match: AT5G12010.1 (unknown protein)

HSP 1 Score: 141.7 bits (356), Expect = 9.4e-34
Identity = 93/324 (28.70%), Postives = 161/324 (49.69%), Query Frame = 0

Query: 36  WWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSL 95
           WW+E S R+  P        F+  F++S+ TF  IC    E+  A     T L    + +
Sbjct: 161 WWEECS-RLDYP-----EEDFKKAFRMSKSTFELICD---ELNSAVAKEDTALR-NAIPV 220

Query: 96  NDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGL-HHLSWPSTEEDM 155
             +VAV + RL +GE L  +   FG+  S+  ++     +A+++  +  +L WP  +E +
Sbjct: 221 RQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPD-DESL 280

Query: 156 DQIKSKFKKIRGLPNCCGVIETTHIMMTLPTAESAN-----GIWLDGEKNCSMVLQVIVD 215
             I+ +F+ + G+PN  G + TTHI +  P    A+         + + + S+ +Q +V+
Sbjct: 281 RNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVN 340

Query: 216 PEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGF 275
           P+  F D+  GWPGS+ D  VL+ S  ++ + +G  L G             ++ G  G 
Sbjct: 341 PKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------WVAGGPGH 400

Query: 276 PLLPWLLTPYQGKGLSDYQTEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLP 335
           PLL W+L PY  + L+  Q  FN++    + VA+ A  RLK  W  ++    +     LP
Sbjct: 401 PLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQ-KRTEVKLQDLP 460

Query: 336 RIILVCCLLHNIVIDMEDEVQDEM 354
            ++  CC+LHNI    E++++ E+
Sbjct: 461 TVLGACCVLHNICEMREEKMEPEL 460

BLAST of Cla97C07G131470 vs. TAIR10
Match: AT4G29780.1 (unknown protein)

HSP 1 Score: 124.0 bits (310), Expect = 2.0e-28
Identity = 91/343 (26.53%), Postives = 160/343 (46.65%), Query Frame = 0

Query: 35  DWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLS 94
           DWWD  S+            +F   F++S+ TF+ IC  +   +  K +   D    P  
Sbjct: 198 DWWDRVSR------PDFPEDEFRREFRMSKSTFNLICEELDTTVTKKNTMLRDAIPAP-- 257

Query: 95  LNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGL-HHLSWPSTEED 154
              +V V + RL +G  L ++ + FG+  S+  ++      A+ +  +  +L WPS + +
Sbjct: 258 --KRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPS-DSE 317

Query: 155 MDQIKSKFKKIRGLPNCCGVIETTHIMMTLPTAESA-----NGIWLDGEKNCSMVLQVIV 214
           ++  K+KF+ +  +PN  G I TTHI +  P    A          + + + S+ +Q +V
Sbjct: 318 INSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVV 377

Query: 215 DPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSG 274
           + +  F D+  G PGSL+D  +L+ S          R    +  L +S     +I+G+SG
Sbjct: 378 NADGIFTDVCIGNPGSLTDDQILEKSSL-------SRQRAARGMLRDS-----WIVGNSG 437

Query: 275 FPLLPWLLTPYQGKGLSDYQTEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRL 334
           FPL  +LL PY  + L+  Q  FN+     + +A  A  RLK  W  ++    +     L
Sbjct: 438 FPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQ-KRTEVKLQDL 497

Query: 335 PRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQQSCKFA 372
           P ++  CC+LHNI    ++E+  E+      D +  + + + A
Sbjct: 498 PYVLGACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSA 516

BLAST of Cla97C07G131470 vs. TAIR10
Match: AT1G72270.1 (Ribosome 60S biogenesis N-terminal (InterPro:IPR021714))

HSP 1 Score: 95.9 bits (237), Expect = 5.9e-20
Identity = 72/252 (28.57%), Postives = 114/252 (45.24%), Query Frame = 0

Query: 100 AVALRRLCSGESLSNIGDSFGMNQSS-VSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIK 159
           A  + RL  G S   +   FG + +S  S+  +   + + EK           + +D  K
Sbjct: 124 AATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEK---------LSQQLDDPK 183

Query: 160 SKFKKIRGLPNCCGVIETTHIMMTLPTAESANGIWLDGEKNCSMVLQVIVDPEMRFCDII 219
             F     LPNC GV+                G  L G K  S+++Q +VD   RF DI 
Sbjct: 184 PDFSP-NLLPNCYGVVGFGRF--------EVKGKLL-GAKG-SILVQALVDSNGRFVDIS 243

Query: 220 TGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTP 279
            GWP ++    + + +  F +++  E L+G   KL     +  YI+GDS  PLLPWL+TP
Sbjct: 244 AGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLVTP 303

Query: 280 YQ-GKGLSDYQTEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDK-HRLPRIILVCC 339
           Y        ++ EFN          + A  +++  W+I+    WKP+    +P +I   C
Sbjct: 304 YDLTSDEESFREEFNNVVHTGLHSVEIAFAKVRARWRIL-DKKWKPETIEFMPFVITTGC 352

Query: 340 LLHNIVIDMEDE 349
           LLHN +++  D+
Sbjct: 364 LLHNFLVNSGDD 352

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004147700.11.1e-22196.17PREDICTED: putative nuclease HARBI1 [Cucumis sativus] >KGN50531.1 hypothetical p... [more]
XP_008461643.11.9e-22196.17PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
XP_022138922.13.6e-21794.66protein ALP1-like [Momordica charantia][more]
XP_023545365.12.9e-21492.89protein ALP1-like [Cucurbita pepo subsp. pepo][more]
XP_022956519.13.7e-21492.89protein ALP1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KS64|A0A0A0KS64_CUCSA7.2e-22296.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE=4 SV=1[more]
tr|A0A1S3CEZ1|A0A1S3CEZ1_CUCME1.2e-22196.17putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=4 SV=1[more]
tr|A0A2I4FY10|A0A2I4FY10_9ROSI2.9e-18680.31putative nuclease HARBI1 OS=Juglans regia OX=51240 GN=LOC109003036 PE=4 SV=1[more]
tr|A0A061FMZ6|A0A061FMZ6_THECC1.7e-17574.75RNA binding protein, putative OS=Theobroma cacao OX=3641 GN=TCM_042838 PE=4 SV=1[more]
tr|A0A1U8PK17|A0A1U8PK17_GOSHI2.1e-17374.81putative nuclease HARBI1 OS=Gossypium hirsutum OX=3635 GN=LOC107959884 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q9M2U3|ALPL_ARATH1.6e-13966.85Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
sp|Q94K49|ALP1_ARATH1.5e-9745.93Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
sp|Q6AZB8|HARB1_DANRE1.1e-2626.99Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1[more]
sp|Q17QR8|HARB1_BOVIN1.7e-2427.78Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1[more]
sp|Q96MB7|HARB1_HUMAN2.2e-2428.10Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
AT3G55350.18.8e-14166.85PIF / Ping-Pong family of plant transposases[more]
AT3G63270.18.3e-9945.93Putative harbinger transposase-derived nuclease (InterPro:IPR006912)[more]
AT5G12010.19.4e-3428.70unknown protein[more]
AT4G29780.12.0e-2826.53unknown protein[more]
AT1G72270.15.9e-2028.57Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027806HARBI1_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006270 DNA replication initiation
biological_process GO:0006275 regulation of DNA replication
cellular_component GO:0005575 cellular_component
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0005524 ATP binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G131470.1Cla97C07G131470.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 175..340
e-value: 3.3E-30
score: 104.8
NoneNo IPR availablePANTHERPTHR22930:SF110SUBFAMILY NOT NAMEDcoord: 21..389
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 21..389

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C07G131470Watermelon (97103) v2wmbwmbB170
Cla97C07G131470Cucumber (Gy14) v1cgywmbB295
Cla97C07G131470Cucurbita maxima (Rimu)cmawmbB495
Cla97C07G131470Wild cucumber (PI 183967)cpiwmbB416
Cla97C07G131470Cucumber (Chinese Long) v3cucwmbB412
Cla97C07G131470Bottle gourd (USVL1VR-Ls)lsiwmbB205
Cla97C07G131470Melon (DHL92) v3.5.1mewmbB065
Cla97C07G131470Watermelon (97103) v1wmwmbB039