CsaV3_1G033670 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_1G033670
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
Descriptionprotein ALP1-like
Locationchr1: 20753345 .. 20756841 (-)
RNA-Seq ExpressionCsaV3_1G033670
SyntenyCsaV3_1G033670
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAGAGGCTTTTATGAAGACCTCCGCCGTCAGATTTGTCTCACAAAGCAAAATCGCCTTTAGCTCATTCGCAAACCCAAAACTTCTTGCTCTATCTCATGGACTACCACCATTGATTGGACCTTTTGTTCTTAGACTAATTTAAAGTTGGTTTCATTCAAAGGGCGGCCCTCGATCTTTCTCCATTGCATGTGCTACAAAACCCAATTCCTGTTTTTACCAAATTTGAAGAATCTGTGTGCGATTTTGTGGAATATTATGTGTCCTCTTCATCAATAACAATTTTTTATATTTGGGTTTTAATCACTCACTCGTTGTGGCTAATGGGTCCAATCAGAGGGTTGAGAAAGAAGAAGAAATTAGAGAGAAAGCTCGATTGCAATGGTACTGCTTCTGATTCTTCTGAAAAGGATGATGCTATAGATTGGTGGGATGATTTCTCCAAACGAACCAATGGTATTTCTATCTTGATTTTTTCTTTACTTTTAATCGAAGTTCTTCAATTGTAAGATCGGTTCTGTTTGACTATCGACTGTTTTGGAACAAATGATGTGAACTTGGTTGACTTCTTATCTGGTATGTAGAAAGAATTGACGGTACTGCATGTGCATTGGGAGGCTTGGTAATTATGTTAGCTTCGAGTCATTGTATAGAAGGGCAAGAACTTTTGTTTAGTTTCTATGAAGAAATATGTATGTTATGTGATTCTGATTATAAGAGTTTGAACCAGTCTGTGTTGTGGCCATGTTTTCTGAGATTTTGATCTGTTTTCTGAGCTTGTAATGTTTAGAGCTTGACAGGGGATCTCGTTTTGGGTTTGATGTGTTAGTGTTTAGTTTGTGAAGGGATAGATTATGCTGTTTGACTGGTGTAACTTATGACTTTACACCTTCATAAGCTGTATTTTGTATTCTTTGGTTGTCGAGAGCTTCTCGAATTTGTAACTAGTTGACCCTGATTCGACATTAGTAGTTTATTGATGATAGTGACGATATAGATCCAAAGCTATGGAACTTGAGTGTGAGTCTCTTTGTTGAAATGGAATGGCCTCATCTTTTTTTCATCTCTCCACACATTTGAGTGCTAGTTCTTTGAATTACAAGGATGCTTCTATATACCAGGGTGAGGGGTTTGTGTGTTCCTTCTTCAAACCTTTTGCTATGTATTGGCTGTCCTTCCTTCGTCAGGGTGATGACAATGTCCTCTGCCAACGACTACCCCTGAAAGTGAAGTCCAGACTCAACAAGAATGACACATATCTTGTTTAGGAATTCCTTCCTATTATCTTAGAGAATGTAGAGAATTTATGCTATTCCTCTATCGTATCTGATTCTAGCAATTAGATAATATAACTTCCACATATTTACTAAATTGTGTTTTGAAGTGTTGAGACAACTATAGTAGACTAAGATAAACTTATTGGACTGATGTAGAGTCGAGTGTGGATTGGAGAGTTAGAAACATAGATGTGGATTGTTAACGGAATGTAGTTATTCGATTAGTAGGCTATGTAAATCACATCAAATAATTTACATCCTTATTGCTTTGGCATTGACATAATGTCATTAGATTTTTTCTTCTGGTGTTTATCAGTTTCTCAAGTCTTCTACTTTACATAATTGAAAGCCAGCGAGTATCTATTTCCATTACACTACAGAGTGAAAAACTAATCATACTGGTCATGAATGTATCCAATGATTTGGGGATTAGCTCAGCCTTTTCTCCTCAAAATATTCTGGACCAAATTTCAGTGTTTTAGTTTGAAAATTTTGGTTACACTTTCTTGACCCCATTTCTATGCAGGTCTTCATTCTGCATCAAAAGGTTTGGATAGATTCAAATCCATTTTCAAGGTCTCCCGAAAGACTTTCGATTACATATGTTTGCTTGTCAAGGACGACATGACAGCTAAATCTGGTCATTTTACATTTTTGAACGGTAGGCCATTGTCTTTATGTGATCAAGTAGCTGTAGCTTTAAGAAGGTTGGGGTCCGGTGAATCATTAGTGACAATAGGTGATTCGCTTGGGTTGAACCATTCGACTGTGTCTCAAGTCACATGGCGATTTGTGGAGTCAATGGAAGAAAGGGGGCTTCACCATCTTCATTGGCCTTCAAACGAAGTGGAAATGGCTCAAGTGAAATCAAAATTTGAGAAAATACAAGGACTCCCTAACTGTTGTGGTTCGATCGATACCACTCACATCACAATGTGTCTGCCTGCTTCGGATCCCACAAGTTATGTGTGGCTTGATGACAAAAAAAACCACAGCATGGTTTTGCAAGTGATTGTAGACGCAGAAATGAGGTTCCGAGACATATTAACTGGATTGCCTGGAAAATTGTCGGATTGGTTAGTTTTCCAGAGTTCAAACTTCCACAAGCTTTGTGACAAGGGGGAGAGGCTGAATGGGAAGAGGTTTGAGCTTCCCGACAGATCGGAAATCCGAGAATATATAATTGGAGATTCCGGCTATCCTTTACTTCCTTATCTTGTCACTCCCTATGATGGAAAAGAACTCTCAACATCAAAAACTGAGTTCAACAAGCGACACAAAGAGACTCGGTTGGTGGTTCAACGAGCACTAGCAATGTTGAAAGAACGGTGGAGAATCATTCAAGGAGTGATGTGGAGACCCGATAAACATCGGTTGCCGAGGATCATTCTAGTCTGCTGCTTACTTCATAACATTATCATTGATATCGGAGATGAAACGGAGGAGGGTGAAGTTCCTTTGTCTATCGAACATGACGTTGATTACAAACAACAGGTATGTGATGTTTTTGACTCAAAGGGTGCATATGTGAGAGATAGATTGTCTTTGCTGTTCATATGAAACTGAGAGGCTTGGTTACCATATTTACTATTCTTTTATATTCTACAATGTTCAAATGATGAGATTAAGTGCCAACATGAGGTCAGCTCAACTGGCACCTATGTACCAAAGGATAAGAGGTTTCAAGTTCAAATCTTCCATCGCAAATTGAATTATGACTCCTTTTCAAAAAAAAAAAAATTATGGGATTAAGCTTCTGCCGCATGTACATATCATCAATTGATGCTTTAGTCATTTATTGGGGAGTGATTGATGAATTGATTCAGGTCCAATGATCTGAAATTTGTCTTATGAACATTGAAAGAGGGAAATTACAAAACACACAAGTTATTAGTATATGAGCTGTGGTTTTTGTTCTTTTGCTGTGAACAGTATGGAATTACAGCGTTACATGCAAGTTTCAACTCCTATCATTTTCATTTTGCCTGCCTGTTTGCATTTTTCTTTTCTAATTGGAATAATTTTCTCTCTTTTTTATTTTTTTTGGGAGCTTTGTTTCAAATAGACTCTAAAAAGACCATTAGTCAATTGCTTAATAAGAGCTGACATGACTTGATATGTGGGGTTGTAGAATCGAACAGTTGGCCTCAAGATTGTTAATACACTCTTCATGGCAATGGAGATATATATATATAGATATAGTTATAGATATCATAGAAG

mRNA sequence

ATGGGTCCAATCAGAGGGTTGAGAAAGAAGAAGAAATTAGAGAGAAAGCTCGATTGCAATGGTACTGCTTCTGATTCTTCTGAAAAGGATGATGCTATAGATTGGTGGGATGATTTCTCCAAACGAACCAATGGTCTTCATTCTGCATCAAAAGGTTTGGATAGATTCAAATCCATTTTCAAGGTCTCCCGAAAGACTTTCGATTACATATGTTTGCTTGTCAAGGACGACATGACAGCTAAATCTGGTCATTTTACATTTTTGAACGGTAGGCCATTGTCTTTATGTGATCAAGTAGCTGTAGCTTTAAGAAGGTTGGGGTCCGGTGAATCATTAGTGACAATAGGTGATTCGCTTGGGTTGAACCATTCGACTGTGTCTCAAGTCACATGGCGATTTGTGGAGTCAATGGAAGAAAGGGGGCTTCACCATCTTCATTGGCCTTCAAACGAAGTGGAAATGGCTCAAGTGAAATCAAAATTTGAGAAAATACAAGGACTCCCTAACTGTTGTGGTTCGATCGATACCACTCACATCACAATGTGTCTGCCTGCTTCGGATCCCACAAGTTATGTGTGGCTTGATGACAAAAAAAACCACAGCATGGTTTTGCAAGTGATTGTAGACGCAGAAATGAGGTTCCGAGACATATTAACTGGATTGCCTGGAAAATTGTCGGATTGGTTAGTTTTCCAGAGTTCAAACTTCCACAAGCTTTGTGACAAGGGGGAGAGGCTGAATGGGAAGAGGTTTGAGCTTCCCGACAGATCGGAAATCCGAGAATATATAATTGGAGATTCCGGCTATCCTTTACTTCCTTATCTTGTCACTCCCTATGATGGAAAAGAACTCTCAACATCAAAAACTGAGTTCAACAAGCGACACAAAGAGACTCGGTTGGTGGTTCAACGAGCACTAGCAATGTTGAAAGAACGGTGGAGAATCATTCAAGGAGTGATGTGGAGACCCGATAAACATCGGTTGCCGAGGATCATTCTAGTCTGCTGCTTACTTCATAACATTATCATTGATATCGGAGATGAAACGGAGGAGGGTGAAGTTCCTTTGTCTATCGAACATGACGTTGATTACAAACAACAGGTATGTGATGTTTTTGACTCAAAGGGTGCATATGTGAGAGATAGATTGTCTTTGCTGTTCATATGA

Coding sequence (CDS)

ATGGGTCCAATCAGAGGGTTGAGAAAGAAGAAGAAATTAGAGAGAAAGCTCGATTGCAATGGTACTGCTTCTGATTCTTCTGAAAAGGATGATGCTATAGATTGGTGGGATGATTTCTCCAAACGAACCAATGGTCTTCATTCTGCATCAAAAGGTTTGGATAGATTCAAATCCATTTTCAAGGTCTCCCGAAAGACTTTCGATTACATATGTTTGCTTGTCAAGGACGACATGACAGCTAAATCTGGTCATTTTACATTTTTGAACGGTAGGCCATTGTCTTTATGTGATCAAGTAGCTGTAGCTTTAAGAAGGTTGGGGTCCGGTGAATCATTAGTGACAATAGGTGATTCGCTTGGGTTGAACCATTCGACTGTGTCTCAAGTCACATGGCGATTTGTGGAGTCAATGGAAGAAAGGGGGCTTCACCATCTTCATTGGCCTTCAAACGAAGTGGAAATGGCTCAAGTGAAATCAAAATTTGAGAAAATACAAGGACTCCCTAACTGTTGTGGTTCGATCGATACCACTCACATCACAATGTGTCTGCCTGCTTCGGATCCCACAAGTTATGTGTGGCTTGATGACAAAAAAAACCACAGCATGGTTTTGCAAGTGATTGTAGACGCAGAAATGAGGTTCCGAGACATATTAACTGGATTGCCTGGAAAATTGTCGGATTGGTTAGTTTTCCAGAGTTCAAACTTCCACAAGCTTTGTGACAAGGGGGAGAGGCTGAATGGGAAGAGGTTTGAGCTTCCCGACAGATCGGAAATCCGAGAATATATAATTGGAGATTCCGGCTATCCTTTACTTCCTTATCTTGTCACTCCCTATGATGGAAAAGAACTCTCAACATCAAAAACTGAGTTCAACAAGCGACACAAAGAGACTCGGTTGGTGGTTCAACGAGCACTAGCAATGTTGAAAGAACGGTGGAGAATCATTCAAGGAGTGATGTGGAGACCCGATAAACATCGGTTGCCGAGGATCATTCTAGTCTGCTGCTTACTTCATAACATTATCATTGATATCGGAGATGAAACGGAGGAGGGTGAAGTTCCTTTGTCTATCGAACATGACGTTGATTACAAACAACAGGTATGTGATGTTTTTGACTCAAAGGGTGCATATGTGAGAGATAGATTGTCTTTGCTGTTCATATGA

Protein sequence

MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIFKVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRLSLLFI*
Homology
BLAST of CsaV3_1G033670 vs. NCBI nr
Match: XP_004149039.1 (protein ALP1-like [Cucumis sativus] >KGN65671.1 hypothetical protein Csa_019843 [Cucumis sativus])

HSP 1 Score: 802.7 bits (2072), Expect = 1.4e-228
Identity = 388/388 (100.00%), Postives = 388/388 (100.00%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC
Sbjct: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL
Sbjct: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
           VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH
Sbjct: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360

Query: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           DVDYKQQVCDVFDSKGAYVRDRLSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 388

BLAST of CsaV3_1G033670 vs. NCBI nr
Match: TYK01291.1 (putative nuclease HARBI1 [Cucumis melo var. makuwa])

HSP 1 Score: 766.9 bits (1979), Expect = 8.2e-218
Identity = 371/388 (95.62%), Postives = 380/388 (97.94%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLD NGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGL HLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLRHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASDPTS+VWLDD+KNHSMVLQVIVDAEMRFRDILTGLPGKLSD LVFQSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDDEKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKR ELPDRSEI+EYI+GDSGYPLL YLVTPYDGKELSTSK EFNKRH  TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
           VVQ+ALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETE+G+VPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360

Query: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           DVDYKQQVCDVFDSKGAY+R+RLSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388

BLAST of CsaV3_1G033670 vs. NCBI nr
Match: KAA0033909.1 (putative nuclease HARBI1 [Cucumis melo var. makuwa])

HSP 1 Score: 765.8 bits (1976), Expect = 1.8e-217
Identity = 371/388 (95.62%), Postives = 380/388 (97.94%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLD NGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGL HLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLCHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASDPTS+VWLDD+KNHSMVLQVIVDAEMRFRDILTGLPGKLSD LVFQSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDDEKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKR ELPDRSEI+EYI+GDSGYPLL YLVTPYDGKELSTSK EFNKRH  TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
           VVQ+ALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETE+G+VPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360

Query: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           DVDYKQQVCDVFDSKGAY+R+RLSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388

BLAST of CsaV3_1G033670 vs. NCBI nr
Match: XP_038893506.1 (protein ALP1-like [Benincasa hispida])

HSP 1 Score: 765.4 bits (1975), Expect = 2.4e-217
Identity = 373/389 (95.89%), Postives = 379/389 (97.43%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLD NGTASDSSEKD+AIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDSNGTASDSSEKDEAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGL HLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLRHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASD TSYVWLD++KNHSMVLQVIVDAEMRFRDILTGLPGK+SDWLVFQSSNFHKLC
Sbjct: 181 MCLPASDSTSYVWLDEEKNHSMVLQVIVDAEMRFRDILTGLPGKMSDWLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKR EL DRSEIREYIIGDSGYPLLPYLVTPYDGKE STSK EFNKRHKETRL
Sbjct: 241 DKGERLNGKRLELNDRSEIREYIIGDSGYPLLPYLVTPYDGKEHSTSKAEFNKRHKETRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEE-GEVPLSIE 360
           VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGD+TEE G VPLSIE
Sbjct: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDDTEEDGGVPLSIE 360

Query: 361 HDVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           HDVDYKQQVCDVFD KGAY+RDRLSLLFI
Sbjct: 361 HDVDYKQQVCDVFDPKGAYLRDRLSLLFI 389

BLAST of CsaV3_1G033670 vs. NCBI nr
Match: XP_008457540.1 (PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 763.5 bits (1970), Expect = 9.1e-217
Identity = 370/388 (95.36%), Postives = 379/388 (97.68%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLD NGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGL HLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLCHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASDPTS+VWLD +KNHSMVLQVIVDAEMRFRDILTGLPGKLSD LVFQSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDXRKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKR ELPDRSEI+EYI+GDSGYPLL YLVTPYDGKELSTSK EFNKRH  TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
           VVQ+ALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETE+G+VPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360

Query: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           DVDYKQQVCDVFDSKGAY+R+RLSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388

BLAST of CsaV3_1G033670 vs. ExPASy Swiss-Prot
Match: Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 5.7e-129
Identity = 224/403 (55.58%), Postives = 296/403 (73.45%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCN--------------GTASDSSEKDD-----AIDWWDDFSK 60
           MGPI+ ++KKK+ E+K+D N                A ++++ DD     ++DWWD FS+
Sbjct: 1   MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60

Query: 61  RTNGLHSASKGLDRFKSIFKVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAV 120
           R   ++  S     F+S+FK+SRKTFDYIC LVK D TAK  +F+  NG PLSL D+VAV
Sbjct: 61  R---IYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAV 120

Query: 121 ALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKF 180
           ALRRLGSGESL  IG++ G+N STVSQ+TWRFVESMEER +HHL WPS   ++ ++KSKF
Sbjct: 121 ALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSKF 180

Query: 181 EKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGL 240
           EKI GLPNCCG+ID THI M LPA +P++ VWLD +KN SM LQ +VD +MRF D++ G 
Sbjct: 181 EKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGW 240

Query: 241 PGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDG 300
           PG L+D +V ++S F+KL +KG+RLNG++  L +R+E+REYI+GDSG+PLLP+L+TPY G
Sbjct: 241 PGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQG 300

Query: 301 KELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNI 360
           K  S  +TEFNKRH E     Q AL+ LK+RWRII GVMW PD++RLPRII VCCLLHNI
Sbjct: 301 KPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNI 360

Query: 361 IIDIGDETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRLS 385
           IID+ D+T + + PLS +HD++Y+Q+ C + D   + +RD LS
Sbjct: 361 IIDMEDQTLDDQ-PLSQQHDMNYRQRSCKLADEASSVLRDELS 396

BLAST of CsaV3_1G033670 vs. ExPASy Swiss-Prot
Match: Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 335.9 bits (860), Expect = 6.2e-91
Identity = 181/397 (45.59%), Postives = 249/397 (62.72%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEK---------DDAI--DWWDDFSKRTNGLHSA 60
           M P++  +KKK  ++ LD     + + EK          +AI  DWWD F  R +     
Sbjct: 1   MAPVK--QKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVP 60

Query: 61  SKGLDRFKSIFKVSRKTFDYICLLVKDDMTAK--SGHFTFLNGRPLSLCDQVAVALRRLG 120
           S     FK  F+ S+ TF YIC LV++D+ ++  SG    + GR LS+  QVA+ALRRL 
Sbjct: 61  SDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLIN-IEGRLLSVEKQVAIALRRLA 120

Query: 121 SGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGL 180
           SG+S V++G + G+  STVSQVTWRF+E++EER  HHL WP ++  + ++KSKFE++ GL
Sbjct: 121 SGDSQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSD-RIEEIKSKFEEMYGL 180

Query: 181 PNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSD 240
           PNCCG+IDTTHI M LPA    S  W D +KN+SM LQ + D EMRF +++TG PG ++ 
Sbjct: 181 PNCCGAIDTTHIIMTLPAVQ-ASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTV 240

Query: 241 WLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTS 300
             + + S F KLC+  + L+G    L   ++IREY++G   YPLLP+L+TP+D    S S
Sbjct: 241 SKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDS 300

Query: 301 KTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGD 360
              FN+RH++ R V   A   LK  WRI+  VMWRPD+ +LP IILVCCLLHNIIID GD
Sbjct: 301 MVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGD 360

Query: 361 ETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRLS 385
             +E +VPLS  HD  Y  + C   +  G+ +R  L+
Sbjct: 361 YLQE-DVPLSGHHDSGYADRYCKQTEPLGSELRGCLT 391

BLAST of CsaV3_1G033670 vs. ExPASy Swiss-Prot
Match: Q8BR93 (Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 6.3e-27
Identity = 82/299 (27.42%), Postives = 142/299 (47.49%), Query Frame = 0

Query: 91  RPLSLCDQVAVALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPSN 150
           R +S   Q+  AL    SG     +GD++G++ +++S+      E++ ER    +H+P +
Sbjct: 65  RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPVD 124

Query: 151 EVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDA 210
           E  +  +K +F  + G+P   G  D  H+ +  P ++  SYV  + K  HS+   V+ D 
Sbjct: 125 EAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDLSYV--NRKGLHSLNCLVVCDI 184

Query: 211 EMRFRDILTGLPGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYP 270
                 + T  PG L D  V Q S+     + G         +P  S    +++GDS + 
Sbjct: 185 RGALMTVETSWPGSLQDCAVLQRSSLTSQFETG---------MPKDS----WLLGDSSFF 244

Query: 271 LLPYLVTPYDGKELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQG----VMWRPDKH 330
           L  +L+TP    E + ++  +N+ H  T  V++R L  L  R+R + G    + + P+K 
Sbjct: 245 LRSWLLTPLPIPE-TAAEYRYNRAHSATHSVIERTLQTLCCRFRCLDGSKGALQYSPEK- 304

Query: 331 RLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRLSL 386
               IIL CC+LHNI +D G +     VP  I+   + + +  +  D +   +R  L L
Sbjct: 305 -CSHIILACCVLHNISLDHGMDVWSSPVPGPIDQPPEGEDEHMESLDLEADRIRQELIL 345

BLAST of CsaV3_1G033670 vs. ExPASy Swiss-Prot
Match: B0BN95 (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 6.3e-27
Identity = 80/299 (26.76%), Postives = 144/299 (48.16%), Query Frame = 0

Query: 91  RPLSLCDQVAVALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPSN 150
           R +S   Q+  AL    SG     +GD++G++ +++S+      E++ ER    +H+P++
Sbjct: 65  RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPAD 124

Query: 151 EVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDA 210
           E  +  +K +F  + G+P   G++D  H+ +  P ++  SYV  + K  HS+   V+ D 
Sbjct: 125 EAAIQSLKDEFYGLAGMPGVIGAVDCIHVAIKAPNAEDLSYV--NRKGLHSLNCLVVCDI 184

Query: 211 EMRFRDILTGLPGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYP 270
                 + T  PG L D  V Q S+     + G         +P  S    +++GDS + 
Sbjct: 185 RGALMTVETSWPGSLQDCAVLQQSSLSSQFETG---------MPKDS----WLLGDSSFF 244

Query: 271 LLPYLVTPYDGKELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQG----VMWRPDKH 330
           L  +L+TP    E + ++  +N+ H  T  V+++ L  L  R+R + G    + + P+K 
Sbjct: 245 LHTWLLTPLHIPE-TPAEYRYNRAHSATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKS 304

Query: 331 RLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRLSL 386
               IIL CC+LHNI ++ G +     V   IE   + + +  +  D +   +R  L L
Sbjct: 305 --SHIILACCVLHNISLEHGMDVWSSPVTGPIEQPPEGEDEQMESLDLEADRIRQELIL 345

BLAST of CsaV3_1G033670 vs. ExPASy Swiss-Prot
Match: Q17QR8 (Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 6.9e-26
Identity = 77/299 (25.75%), Postives = 141/299 (47.16%), Query Frame = 0

Query: 91  RPLSLCDQVAVALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPSN 150
           R +S   Q+  AL    SG     +GD++G++ +++S+      E++ ER    +H+P++
Sbjct: 65  RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPAD 124

Query: 151 EVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDA 210
           E  +  +K +F  + G+P   G +D  H+ +  P ++  SYV  + K  HS+   ++ D 
Sbjct: 125 EASVQALKDEFYGLAGIPGVIGVVDCMHVAIKAPNAEDLSYV--NRKGLHSLNCLMVCDI 184

Query: 211 EMRFRDILTGLPGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYP 270
                 + T  PG L D +V Q S+              +FE     E   +++GDS + 
Sbjct: 185 RGALMTVETSWPGSLQDCVVLQQSSL-----------SSQFEAGMHKE--SWLLGDSSFF 244

Query: 271 LLPYLVTPYDGKELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQG----VMWRPDKH 330
           L  +L+TP    E + ++  +N  H  T  V+++    L  R+R + G    + + P+K 
Sbjct: 245 LRTWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKS 304

Query: 331 RLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRLSL 386
               IIL CC+LHNI ++ G +     V   +E   + + +  +  D +   +R  L L
Sbjct: 305 --SHIILACCVLHNISLEHGMDVWSSPVTGPVEQPPEEEYEHMESLDLEADRIRQELML 345

BLAST of CsaV3_1G033670 vs. ExPASy TrEMBL
Match: A0A0A0M0C2 (DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G481730 PE=3 SV=1)

HSP 1 Score: 802.7 bits (2072), Expect = 6.6e-229
Identity = 388/388 (100.00%), Postives = 388/388 (100.00%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC
Sbjct: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL
Sbjct: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
           VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH
Sbjct: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360

Query: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           DVDYKQQVCDVFDSKGAYVRDRLSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 388

BLAST of CsaV3_1G033670 vs. ExPASy TrEMBL
Match: A0A5D3BNB9 (Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold49G00830 PE=3 SV=1)

HSP 1 Score: 766.9 bits (1979), Expect = 4.0e-218
Identity = 371/388 (95.62%), Postives = 380/388 (97.94%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLD NGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGL HLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLRHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASDPTS+VWLDD+KNHSMVLQVIVDAEMRFRDILTGLPGKLSD LVFQSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDDEKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKR ELPDRSEI+EYI+GDSGYPLL YLVTPYDGKELSTSK EFNKRH  TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
           VVQ+ALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETE+G+VPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360

Query: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           DVDYKQQVCDVFDSKGAY+R+RLSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388

BLAST of CsaV3_1G033670 vs. ExPASy TrEMBL
Match: A0A5A7SXL0 (Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43059G001000 PE=3 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 8.9e-218
Identity = 371/388 (95.62%), Postives = 380/388 (97.94%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLD NGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGL HLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLCHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASDPTS+VWLDD+KNHSMVLQVIVDAEMRFRDILTGLPGKLSD LVFQSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDDEKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKR ELPDRSEI+EYI+GDSGYPLL YLVTPYDGKELSTSK EFNKRH  TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
           VVQ+ALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETE+G+VPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360

Query: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           DVDYKQQVCDVFDSKGAY+R+RLSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388

BLAST of CsaV3_1G033670 vs. ExPASy TrEMBL
Match: A0A1S3C6F2 (LOW QUALITY PROTEIN: putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103497206 PE=3 SV=1)

HSP 1 Score: 763.5 bits (1970), Expect = 4.4e-217
Identity = 370/388 (95.36%), Postives = 379/388 (97.68%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60
           MGPIRGLRKKKKLERKLD NGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF
Sbjct: 1   MGPIRGLRKKKKLERKLDSNGTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSIF 60

Query: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120
           KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG
Sbjct: 61  KVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSLG 120

Query: 121 LNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180
           LNHSTVSQVTWRFVESMEERGL HLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT
Sbjct: 121 LNHSTVSQVTWRFVESMEERGLCHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHIT 180

Query: 181 MCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLC 240
           MCLPASDPTS+VWLD +KNHSMVLQVIVDAEMRFRDILTGLPGKLSD LVFQSSNFHKLC
Sbjct: 181 MCLPASDPTSFVWLDXRKNHSMVLQVIVDAEMRFRDILTGLPGKLSDRLVFQSSNFHKLC 240

Query: 241 DKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRL 300
           DKGERLNGKR ELPDRSEI+EYI+GDSGYPLL YLVTPYDGKELSTSK EFNKRH  TRL
Sbjct: 241 DKGERLNGKRLELPDRSEIQEYIVGDSGYPLLSYLVTPYDGKELSTSKAEFNKRHTATRL 300

Query: 301 VVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEH 360
           VVQ+ALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETE+G+VPLSIEH
Sbjct: 301 VVQQALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEDGDVPLSIEH 360

Query: 361 DVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           DVDYKQQVCDVFDSKGAY+R+RLSLLFI
Sbjct: 361 DVDYKQQVCDVFDSKGAYLRERLSLLFI 388

BLAST of CsaV3_1G033670 vs. ExPASy TrEMBL
Match: A0A6J1GBJ6 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111452471 PE=3 SV=1)

HSP 1 Score: 698.0 bits (1800), Expect = 2.3e-197
Identity = 338/389 (86.89%), Postives = 357/389 (91.77%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCN-GTASDSSEKDDAIDWWDDFSKRTNGLHSASKGLDRFKSI 60
           MGPIRG RKKKKLERKLD N  TASDSSEKDDA+DWWDDFS+RT GLHS  +GLD FKSI
Sbjct: 1   MGPIRGSRKKKKLERKLDANASTASDSSEKDDALDWWDDFSRRTIGLHSELEGLDGFKSI 60

Query: 61  FKVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVALRRLGSGESLVTIGDSL 120
           FKVSRKTFDYICLLVKDDMTA+S +FTFLNGRPLSL DQVAVALRRLGSG+SLVTIG S 
Sbjct: 61  FKVSRKTFDYICLLVKDDMTAESSNFTFLNGRPLSLYDQVAVALRRLGSGDSLVTIGYSF 120

Query: 121 GLNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHI 180
           GLNHSTVSQVTWRFVESME RGL HLHWPS E EMAQVK KFEKIQGLPNCCGSIDTTHI
Sbjct: 121 GLNHSTVSQVTWRFVESMEVRGLRHLHWPSTEEEMAQVKLKFEKIQGLPNCCGSIDTTHI 180

Query: 181 TMCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKL 240
           TMCLP  DPTS VWLD +KNHSMVLQVIVDAEMRFRDI+TGLPGK+SDWLVFQSSNFHKL
Sbjct: 181 TMCLPVLDPTSNVWLDAEKNHSMVLQVIVDAEMRFRDIVTGLPGKMSDWLVFQSSNFHKL 240

Query: 241 CDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETR 300
           C+KGERLNGKR E  +RSEIREYIIGDSGYPLLPYLVTPYDGKEL  SK EFNKRH ETR
Sbjct: 241 CEKGERLNGKRLEFINRSEIREYIIGDSGYPLLPYLVTPYDGKELQPSKAEFNKRHTETR 300

Query: 301 LVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIE 360
           LVVQRALA LKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIID+GDE E+G VP+S+E
Sbjct: 301 LVVQRALASLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDVGDEMEDGNVPMSME 360

Query: 361 HDVDYKQQVCDVFDSKGAYVRDRLSLLFI 389
           HD DYKQQ+CDV+DSKGAY+RD+LSLLFI
Sbjct: 361 HDADYKQQICDVYDSKGAYLRDKLSLLFI 389

BLAST of CsaV3_1G033670 vs. TAIR 10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 462.2 bits (1188), Expect = 4.0e-130
Identity = 224/403 (55.58%), Postives = 296/403 (73.45%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCN--------------GTASDSSEKDD-----AIDWWDDFSK 60
           MGPI+ ++KKK+ E+K+D N                A ++++ DD     ++DWWD FS+
Sbjct: 1   MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60

Query: 61  RTNGLHSASKGLDRFKSIFKVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAV 120
           R   ++  S     F+S+FK+SRKTFDYIC LVK D TAK  +F+  NG PLSL D+VAV
Sbjct: 61  R---IYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAV 120

Query: 121 ALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKF 180
           ALRRLGSGESL  IG++ G+N STVSQ+TWRFVESMEER +HHL WPS   ++ ++KSKF
Sbjct: 121 ALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSKF 180

Query: 181 EKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGL 240
           EKI GLPNCCG+ID THI M LPA +P++ VWLD +KN SM LQ +VD +MRF D++ G 
Sbjct: 181 EKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGW 240

Query: 241 PGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDG 300
           PG L+D +V ++S F+KL +KG+RLNG++  L +R+E+REYI+GDSG+PLLP+L+TPY G
Sbjct: 241 PGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQG 300

Query: 301 KELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNI 360
           K  S  +TEFNKRH E     Q AL+ LK+RWRII GVMW PD++RLPRII VCCLLHNI
Sbjct: 301 KPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNI 360

Query: 361 IIDIGDETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRLS 385
           IID+ D+T + + PLS +HD++Y+Q+ C + D   + +RD LS
Sbjct: 361 IIDMEDQTLDDQ-PLSQQHDMNYRQRSCKLADEASSVLRDELS 396

BLAST of CsaV3_1G033670 vs. TAIR 10
Match: AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 335.9 bits (860), Expect = 4.4e-92
Identity = 181/397 (45.59%), Postives = 249/397 (62.72%), Query Frame = 0

Query: 1   MGPIRGLRKKKKLERKLDCNGTASDSSEK---------DDAI--DWWDDFSKRTNGLHSA 60
           M P++  +KKK  ++ LD     + + EK          +AI  DWWD F  R +     
Sbjct: 1   MAPVK--QKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVP 60

Query: 61  SKGLDRFKSIFKVSRKTFDYICLLVKDDMTAK--SGHFTFLNGRPLSLCDQVAVALRRLG 120
           S     FK  F+ S+ TF YIC LV++D+ ++  SG    + GR LS+  QVA+ALRRL 
Sbjct: 61  SDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLIN-IEGRLLSVEKQVAIALRRLA 120

Query: 121 SGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPSNEVEMAQVKSKFEKIQGL 180
           SG+S V++G + G+  STVSQVTWRF+E++EER  HHL WP ++  + ++KSKFE++ GL
Sbjct: 121 SGDSQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSD-RIEEIKSKFEEMYGL 180

Query: 181 PNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSD 240
           PNCCG+IDTTHI M LPA    S  W D +KN+SM LQ + D EMRF +++TG PG ++ 
Sbjct: 181 PNCCGAIDTTHIIMTLPAVQ-ASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTV 240

Query: 241 WLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELSTS 300
             + + S F KLC+  + L+G    L   ++IREY++G   YPLLP+L+TP+D    S S
Sbjct: 241 SKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDS 300

Query: 301 KTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNIIIDIGD 360
              FN+RH++ R V   A   LK  WRI+  VMWRPD+ +LP IILVCCLLHNIIID GD
Sbjct: 301 MVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGD 360

Query: 361 ETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRLS 385
             +E +VPLS  HD  Y  + C   +  G+ +R  L+
Sbjct: 361 YLQE-DVPLSGHHDSGYADRYCKQTEPLGSELRGCLT 391

BLAST of CsaV3_1G033670 vs. TAIR 10
Match: AT5G12010.1 (unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, plasma membrane, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G29780.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 136.0 bits (341), Expect = 6.6e-32
Identity = 89/339 (26.25%), Postives = 163/339 (48.08%), Query Frame = 0

Query: 29  KDDAIDWWDDFSKRTNGLHSASKGLDRFKSIFKVSRKTFDYICLLVKDDMTAKSGHFTFL 88
           KD +  WW++ S+            + FK  F++S+ TF+ IC    D++ +        
Sbjct: 155 KDRSRAWWEECSR-------LDYPEEDFKKAFRMSKSTFELIC----DELNSAVAKEDTA 214

Query: 89  NGRPLSLCDQVAVALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGL-HHLHW 148
               + +  +VAV + RL +GE L  +    GL  ST  ++     +++++  +  +L W
Sbjct: 215 LRNAIPVRQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQW 274

Query: 149 PSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSY-----VWLDDKKNHSM 208
           P +E  +  ++ +FE + G+PN  GS+ TTHI +  P     SY        + K ++S+
Sbjct: 275 PDDE-SLRNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSI 334

Query: 209 VLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREY 268
            +Q +V+ +  F D+  G PG + D  V + S  ++  + G  L G             +
Sbjct: 335 TIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------W 394

Query: 269 IIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWR 328
           + G  G+PLL +++ PY  + L+ ++  FN++  E + V + A   LK RW  +Q     
Sbjct: 395 VAGGPGHPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQ-KRTE 454

Query: 329 PDKHRLPRIILVCCLLHNIIIDIGDETEEGEVPLSIEHD 362
                LP ++  CC+LHN I ++ +E  E E+ + +  D
Sbjct: 455 VKLQDLPTVLGACCVLHN-ICEMREEKMEPELMVEVIDD 467

BLAST of CsaV3_1G033670 vs. TAIR 10
Match: AT4G29780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519; Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 126.7 bits (317), Expect = 4.0e-29
Identity = 93/319 (29.15%), Postives = 152/319 (47.65%), Query Frame = 0

Query: 29  KDDAIDWWDDFSKRTNGLHSASKGLDRFKSIFKVSRKTFDYICLLVKDDMTAKSGHFTFL 88
           K+   DWWD  S+            D F+  F++S+ TF+ IC  +   +T K+   T L
Sbjct: 193 KERTTDWWDRVSR-------PDFPEDEFRREFRMSKSTFNLICEELDTTVTKKN---TML 252

Query: 89  NGRPLSLCDQVAVALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGL-HHLHW 148
               +    +V V + RL +G  L  + +  GL  ST  ++      ++ +  +  +L W
Sbjct: 253 RD-AIPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLW 312

Query: 149 PSNEVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSY-----VWLDDKKNHSM 208
           PS+  E+   K+KFE +  +PN  GSI TTHI +  P     +Y        + K ++S+
Sbjct: 313 PSDS-EINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSI 372

Query: 209 VLQVIVDAEMRFRDILTGLPGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREY 268
            +Q +V+A+  F D+  G PG L+D  + + S+         R    R  L D      +
Sbjct: 373 TVQGVVNADGIFTDVCIGNPGSLTDDQILEKSSL-------SRQRAARGMLRD-----SW 432

Query: 269 IIGDSGYPLLPYLVTPYDGKELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWR 328
           I+G+SG+PL  YL+ PY  + L+ ++  FN+   E + +   A   LK RW  +Q     
Sbjct: 433 IVGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQ-KRTE 486

Query: 329 PDKHRLPRIILVCCLLHNI 342
                LP ++  CC+LHNI
Sbjct: 493 VKLQDLPYVLGACCVLHNI 486

BLAST of CsaV3_1G033670 vs. TAIR 10
Match: AT3G19120.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 101.7 bits (252), Expect = 1.4e-21
Identity = 80/292 (27.40%), Postives = 137/292 (46.92%), Query Frame = 0

Query: 55  RFKSIFKVSRKTFDYICLLVKDDMTAKSGHFTFLNGRPLSLCDQVAVA--LRRLGSGESL 114
           R++S++ +S   F  +   +K  +TA +          LSL    AVA  L RL  G S 
Sbjct: 116 RWRSLYGLSYPVFITVVDKLKPFITASN----------LSLPADYAVAMVLSRLAHGCSA 175

Query: 115 VTIGDSLGLNHSTVSQVTWRFVESMEERGLH--HLHWPSNEVEMAQVKSKFEKIQGLPNC 174
            T+     L+   +S++T   V  +    L+   +  P  +  + +    FE++  LPN 
Sbjct: 176 KTLASRYSLDPYLISKIT-NMVTRLLATKLYPEFIKIPVGKRRLIETTQGFEELTSLPNI 235

Query: 175 CGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVDAEMRFRDILTGLPGKLSDWLV 234
           CG+ID+T + +          ++       +++LQV+ D +  F D+    PG   D   
Sbjct: 236 CGAIDSTPVKLRRRTKLNPRNIYGCKYGYDAVLLQVVADHKKIFWDVCVKAPGGEDDSSH 295

Query: 235 FQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGYPLLPYLVTPYDGKELST-SKT 294
           F+ S  +K    G+ +  K   +     +R YI+GD  YPLL +L+TP+      T  + 
Sbjct: 296 FRDSLLYKRLTSGDIVWEKVINIRGH-HVRPYIVGDWCYPLLSFLMTPFSPNGSGTPPEN 355

Query: 295 EFNKRHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLPRIILVCCLLHNI 342
            F+    + R VV  A+ +LK RW+I+Q +      +  P+ I+ CC+LHN+
Sbjct: 356 LFDGMLMKGRSVVVEAIGLLKARWKILQSL--NVGVNHAPQTIVACCVLHNL 393

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004149039.11.4e-228100.00protein ALP1-like [Cucumis sativus] >KGN65671.1 hypothetical protein Csa_019843 ... [more]
TYK01291.18.2e-21895.62putative nuclease HARBI1 [Cucumis melo var. makuwa][more]
KAA0033909.11.8e-21795.62putative nuclease HARBI1 [Cucumis melo var. makuwa][more]
XP_038893506.12.4e-21795.89protein ALP1-like [Benincasa hispida][more]
XP_008457540.19.1e-21795.36PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q9M2U35.7e-12955.58Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
Q94K496.2e-9145.59Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
Q8BR936.3e-2727.42Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1[more]
B0BN956.3e-2726.76Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Q17QR86.9e-2625.75Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0M0C26.6e-229100.00DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G481730 PE... [more]
A0A5D3BNB94.0e-21895.62Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A5A7SXL08.9e-21895.62Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
A0A1S3C6F24.4e-21795.36LOW QUALITY PROTEIN: putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC1034... [more]
A0A6J1GBJ62.3e-19786.89protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111452471 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G55350.14.0e-13055.58PIF / Ping-Pong family of plant transposases [more]
AT3G63270.14.4e-9245.59CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G12010.16.6e-3226.25unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, ... [more]
AT4G29780.14.0e-2929.15unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G19120.11.4e-2127.40PIF / Ping-Pong family of plant transposases [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 174..340
e-value: 1.2E-29
score: 103.1
NoneNo IPR availablePANTHERPTHR22930:SF194NUCLEASE HARBI1 ISOFORM X1-RELATEDcoord: 1..386
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 1..386

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G033670.1CsaV3_1G033670.1mRNA