CSPI05G09500 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI05G09500
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein ALP1-like
LocationChr5: 7960267 .. 7964494 (+)
RNA-Seq ExpressionCSPI05G09500
SyntenyCSPI05G09500
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGCCCATATCCGTCTCTCGTTTTTTCAATTTTCTTTTTTTAACCTTTTCTTCGGCTACATTGTCCTTTCCCCAACATGATGTAATTGGCCAATGCATAACTTTTCTTCTCTTTTGTTTTTACAACTTCCTTTGAGTCCTTTGCCTTTGCCTTTGCCTTTCTCCAAGCTCTCTCTCTATTTCATATTCTTCCATTTATTATTGCATGCTTTGAAATCAATACCAGAAGAAACAACACATACCCATCAATACCAAACATATCTTTTTAACCCAAATAGCCTCAAAACACCCCACTACTACTTCATCTTCCCCTTCCCTCTTGGTTTTTCCAAAATGCTTCTCTCCTTCTTTTGCTACCCTTTTCTGTTTTTCTTCTATAAACTTCAAATCTGAGACCTTTCTTCATGTGAATTCTATTCTATTCTTCGATTCGCTTGCTGGGTTTTTGTGGGTGTTTCATCTTTTGCCTTATTTCTGCTTCTTTTTCGACTTTCGAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGAAGGTTGAGAAAAAGGTTGACCAAAATGTCTTCGCTTCTGCTTCACTATCGTCTCAACTCCAGCCTTTGGATTGGTGGGATGAGTTCTCCCAAAGGATTACTGGTAAACTAGTTTTTTTACCTGTTAATTTTACTCTCTCTTGCTCTTTTGTTCAATTTGTCCATTGCCATTGCTCTTATTGGGCCTCTCGACTGAATTCTCGTGTTTCGTGAGGTTTTTGGACGGGGGTTGAATGTAGTGTTCAGTAATCATATATCTTCGATTATGCGTTTAAATTTTGAGTTTTATGATTATAGAACCGTGATATCGTTGTAAAATCTTCAACTCTTTCTGACATCTTATAACTTTGAGAGGGTCAGAAGGGGCAGCCTGTTGAAGTGATCTTATAAGTAGTACTGCTTTTGGAATTCTGTGGAGATGGATCAAATCTGTTGAGGAATAGTTATTGACTTAGATATGTTAGTTCACTTGATAATTGATGCGTACAAGTAGTGGCTACTCACTTAGCTAGTTTATAGAGTGTTAACTTGTGTTGTATTGTGATGATCTTCTCTTTTTCTCCTCATTGAACTGTTGAAATTTGATAATGAAGATGGTGCAGTAAGAAGTTAGAAGTTCACTGTCATCAGTTAAAAGTTCCTACAAATCTTTTGTCTCAGCATTGCTGGCAATAAACATCTTTAGGGAACTGAGTTCACTGTGATCCTCAACTCTGGCTTTCTTCTTTATGATTCATATTAAAGATGTCAATCTTGAGTATGGTATATTGAAGTTAGAATCTGTCATCTGCTGATAGTTGTAAGAAGGCCAGTGGAAATCGAAGGACGACTCTGAAAGTAATAATGTAGAATTTTAGATAATATAGGGGGGATGAACAGGATTTGACTAAAAGCATGTGTTCTCCGTGGAAAACAAGAGAAACAAACTAATATGTTGTTCATGTCAATATAGTATGTTAGAATTTGGAAGGCATGCTAAACAATGAGAACTAGAGAGCCAATTTTTTGTAGATAGTACTTAGTATTATTTGTTCTATTGTTAATATAATTGTGGTGGATGGCCGTCAAATTCAATTCATTATATGTAGTTTTTATACTATAATGCAACCTGATTTAGGTGGCATTTGAGTATATTATTCTAGAGATATTGTTGTGAGTTTTGTATCTTAGGCAACCCCTTTATAAACTTAATTATCTGGTTGTTTTAGAGGCATCTTGGAAAGGGTATGTTTAAGTCCAAATTTGATTCAATTTAGTAATGCTGCAAGGCAAAAAATAGCGTAGAAACCGTACTCTTGAGATCTCTTCCACATGGTCAAGATTTGTTTCAACCTACTGATATGGATAAGTGATCAAGATTTGTTTTTGGTATTTCTGAAACGCATATTTTATTTATAATCTGTTCAAGTAAGTACAGCCCTGAATGCATCATGTTCAAGTAAGGTCATAGTTTATACACAATCTATCAGTCAATAGATTCTCCCCCATTAGCAGTAATCATTAGCTTTATCATGATCAATCCTCCATTTTTCATGTCTCATGTCCTTCATTCTTCCTTTATATCTAGTTTATTCCTCCTTTCTTCCCCATTGTCTTTTTAGTGTGTCAGCTTTCTTTTCAAATTGGACCTATACTTCTTCCATGTCAGTTTGCTCTCCATTATACTAAAAAGATGTTTCATGCACCTATTTAGCAAAGATTTTTTTTTTTTTTCATGCTATTGGGACCCAACCATTTTTTAATATGCGTTGACTATTGTATTTTTAGATAACAAACCTTAGATGAGCTAACAACCAAACAGCCATGGAGCAAATTTTAATCCAAATTCCCAGAAATACTTAAAGTTGGCAACGGAAACAGTAGTTTAATTTTTCTTGATTTTAAGTTACTAGATAACCCAACAGCATGATCAAAGTGGGTAGATTTTGTTTCACGTTAGTAAAATCAGAAGATCAAATTAGCTTCAGGTACCAAAAACTAACACATGCGCAAGTTCAGTGTTATTAGATACTAGTAATTCTATCTTCTTATTCCATGGTTCTTAAATGTTGGTTACAGTTTAAAATGATTTGGACTTCGTAAACTGAATGTCCACTACATATTAATTCATGTTAAGGTTTTTAATTTTTATTATCAGAGTTTTTCTGTAGCATTAGTCTACATAGGCTCTTTATCACATTTAGCAGTTACTTTGTTCTGTCCACGATTTACAATGCAATAAGTCAACCTTTACTATTAGTTTAAAATAAAAGCATTTGACTAAACGTGTGATTGAATGGCTTCATCAGATTCTCATCTGAATATAACTATAGCATTTCTTCTGGTTTATTGGTCACAAAATTTTACTTTAATCTTTATTTAGTTATCAATAAGTTGGATGACATGATCTGATATTAGTTAGCTTCAGTAATAATGTCAAATGTTACATCTTTACTATGAATTATCTCTCATTTTTTCTTACCTGAGAAAGTGCAATGGCATGGAGTAAAGAGTTTTTCTTGGGAACAGAACAATTGCAGCAGGGACTGATAATATCTTTTATTGTTGTTTTTGCCTCTAATCAGGGCCATTATCTCAGTCAAAGAATACAAAATTTGAATCAGTTTTCAAAATTTCCAGAAAGACATTCAGCTATATCTGTTCTCTAGTTAAGGAAGTTATGATGGCTAAAACTTCAAGTTTTACCGACTTAAATGGGAAGCCTTTGTCTCTAAATGATCAAGTTGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCGTTATCTAACATTGGTGATTCGTTTGGACTGAATCAATCATCGGTTTCTCAAATAACTTGGCGTTTCGTGGAGGCAATGGAAGAGAAAGGCCTCCACCATCTCTCATGGCCTTCAACAGAAGAAGATATGGATAAGATAAAGTGCAAGTTCAAGAAAATCAGAGGTCTCCCTAATTGTTGCGGTGTAGTTGAAACGACACACATTATGATGACTTTGCCAACATCAGAATCTGCAAACGGCATCTGGCTTGATCGTGAGAAAAACTGCAGTATGATCCTGCAAGTGATTGTAGATCCAGAGATGAGATTCTGTGACATCATCACAGGTTGGCCAGGAAGTTTGAGCGACGCTCTTGTGCTCCAAAGTTCAGGATTCTTCAAACTTTCACAGGATGGTGAACGGTTAAACGGCAAAAAAATGAAGCTCTCGGAAAGTTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGTTTTCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTCCGGATTATCAAGCCGAGTTCAATAAGCGGCATTTTGCCACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGACAAACATAGGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTGATCGATATGGAGGATGAGGTGCAAGACGAAATGCCTTTGTCTCATCATCACGACCCAAGTTACCGACAACAAAGTTGCGAATTCGTTGACAACACCGCTTCTATTTCAAGGGAGAAGCTTTCCATGTACTTATCTGGAAAGCTACCACCCTAAGAGAACATTCTTTTCCTCCCTTTCTTCCCTTTTGATAGATTGATTTGTGTTGTTGCTTATTCAAACTGTGATGCTCTTTCTGTCCAAATTGTT

mRNA sequence

TTGCCCATATCCGTCTCTCGTTTTTTCAATTTTCTTTTTTTAACCTTTTCTTCGGCTACATTGTCCTTTCCCCAACATGATGTAATTGGCCAATGCATAACTTTTCTTCTCTTTTGTTTTTACAACTTCCTTTGAGTCCTTTGCCTTTGCCTTTGCCTTTCTCCAAGCTCTCTCTCTATTTCATATTCTTCCATTTATTATTGCATGCTTTGAAATCAATACCAGAAGAAACAACACATACCCATCAATACCAAACATATCTTTTTAACCCAAATAGCCTCAAAACACCCCACTACTACTTCATCTTCCCCTTCCCTCTTGGTTTTTCCAAAATGCTTCTCTCCTTCTTTTGCTACCCTTTTCTGTTTTTCTTCTATAAACTTCAAATCTGAGACCTTTCTTCATGTGAATTCTATTCTATTCTTCGATTCGCTTGCTGGGTTTTTGTGGGTGTTTCATCTTTTGCCTTATTTCTGCTTCTTTTTCGACTTTCGAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGAAGGTTGAGAAAAAGGTTGACCAAAATGTCTTCGCTTCTGCTTCACTATCGTCTCAACTCCAGCCTTTGGATTGGTGGGATGAGTTCTCCCAAAGGATTACTGGGCCATTATCTCAGTCAAAGAATACAAAATTTGAATCAGTTTTCAAAATTTCCAGAAAGACATTCAGCTATATCTGTTCTCTAGTTAAGGAAGTTATGATGGCTAAAACTTCAAGTTTTACCGACTTAAATGGGAAGCCTTTGTCTCTAAATGATCAAGTTGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCGTTATCTAACATTGGTGATTCGTTTGGACTGAATCAATCATCGGTTTCTCAAATAACTTGGCGTTTCGTGGAGGCAATGGAAGAGAAAGGCCTCCACCATCTCTCATGGCCTTCAACAGAAGAAGATATGGATAAGATAAAGTGCAAGTTCAAGAAAATCAGAGGTCTCCCTAATTGTTGCGGTGTAGTTGAAACGACACACATTATGATGACTTTGCCAACATCAGAATCTGCAAACGGCATCTGGCTTGATCGTGAGAAAAACTGCAGTATGATCCTGCAAGTGATTGTAGATCCAGAGATGAGATTCTGTGACATCATCACAGGTTGGCCAGGAAGTTTGAGCGACGCTCTTGTGCTCCAAAGTTCAGGATTCTTCAAACTTTCACAGGATGGTGAACGGTTAAACGGCAAAAAAATGAAGCTCTCGGAAAGTTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGTTTTCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTCCGGATTATCAAGCCGAGTTCAATAAGCGGCATTTTGCCACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGACAAACATAGGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTGATCGATATGGAGGATGAGGTGCAAGACGAAATGCCTTTGTCTCATCATCACGACCCAAGTTACCGACAACAAAGTTGCGAATTCGTTGACAACACCGCTTCTATTTCAAGGGAGAAGCTTTCCATGTACTTATCTGGAAAGCTACCACCCTAAGAGAACATTCTTTTCCTCCCTTTCTTCCCTTTTGATAGATTGATTTGTGTTGTTGCTTATTCAAACTGTGATGCTCTTTCTGTCCAAATTGTT

Coding sequence (CDS)

ATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGAAGGTTGAGAAAAAGGTTGACCAAAATGTCTTCGCTTCTGCTTCACTATCGTCTCAACTCCAGCCTTTGGATTGGTGGGATGAGTTCTCCCAAAGGATTACTGGGCCATTATCTCAGTCAAAGAATACAAAATTTGAATCAGTTTTCAAAATTTCCAGAAAGACATTCAGCTATATCTGTTCTCTAGTTAAGGAAGTTATGATGGCTAAAACTTCAAGTTTTACCGACTTAAATGGGAAGCCTTTGTCTCTAAATGATCAAGTTGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCGTTATCTAACATTGGTGATTCGTTTGGACTGAATCAATCATCGGTTTCTCAAATAACTTGGCGTTTCGTGGAGGCAATGGAAGAGAAAGGCCTCCACCATCTCTCATGGCCTTCAACAGAAGAAGATATGGATAAGATAAAGTGCAAGTTCAAGAAAATCAGAGGTCTCCCTAATTGTTGCGGTGTAGTTGAAACGACACACATTATGATGACTTTGCCAACATCAGAATCTGCAAACGGCATCTGGCTTGATCGTGAGAAAAACTGCAGTATGATCCTGCAAGTGATTGTAGATCCAGAGATGAGATTCTGTGACATCATCACAGGTTGGCCAGGAAGTTTGAGCGACGCTCTTGTGCTCCAAAGTTCAGGATTCTTCAAACTTTCACAGGATGGTGAACGGTTAAACGGCAAAAAAATGAAGCTCTCGGAAAGTTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGTTTTCCCCTCTTGCCATGGCTACTAACTCCTTATCAAGGGAAAGGCCTTCCGGATTATCAAGCCGAGTTCAATAAGCGGCATTTTGCCACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGACAAACATAGGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAGTGATCGATATGGAGGATGAGGTGCAAGACGAAATGCCTTTGTCTCATCATCACGACCCAAGTTACCGACAACAAAGTTGCGAATTCGTTGACAACACCGCTTCTATTTCAAGGGAGAAGCTTTCCATGTACTTATCTGGAAAGCTACCACCCTAA

Protein sequence

MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQQSCEFVDNTASISREKLSMYLSGKLPP*
Homology
BLAST of CSPI05G09500 vs. ExPASy Swiss-Prot
Match: Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 502.3 bits (1292), Expect = 5.0e-141
Identity = 246/407 (60.44%), Postives = 304/407 (74.69%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLS------------------SQLQPLDWWDEFSQ 60
           MGPI+  K+KK+ EKKVD+NV  +A+ +                  S  Q LDWWD FS+
Sbjct: 1   MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60

Query: 61  RITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVA 120
           RI G  +  K   FESVFKISRKTF YICSLVK    AK ++F+D NG PLSLND+VAVA
Sbjct: 61  RIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAVA 120

Query: 121 LRRLCSGESLSNIGDSFGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFK 180
           LRRL SGESLS IG++FG+NQS+VSQITWRFVE+MEE+ +HHLSWPS    +D+IK KF+
Sbjct: 121 LRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSKFE 180

Query: 181 KIRGLPNCCGVVETTHIMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWP 240
           KI GLPNCCG ++ THI+M LP  E +N +WLD EKN SM LQ +VDP+MRF D+I GWP
Sbjct: 181 KISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGWP 240

Query: 241 GSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGK 300
           GSL+D +VL++SGF+KL + G+RLNG+K+ LSE +EL EYI+GDSGFPLLPWLLTPYQGK
Sbjct: 241 GSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGK 300

Query: 301 GLPDYQAEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIV 360
                Q EFNKRH      AQ AL++LK+ W+II GVMW PD++RLPRII VCCLLHNI+
Sbjct: 301 PTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNII 360

Query: 361 IDMEDEVQDEMPLSHHHDPSYRQQSCEFVDNTASISREKLSMYLSGK 390
           IDMED+  D+ PLS  HD +YRQ+SC+  D  +S+ R++LS  L GK
Sbjct: 361 IDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of CSPI05G09500 vs. ExPASy Swiss-Prot
Match: Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 352.4 bits (903), Expect = 6.4e-96
Identity = 167/380 (43.95%), Postives = 255/380 (67.11%), Query Frame = 0

Query: 9   RKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGP-LSQSKNTKFESVFKISRKTF 68
           + KK+ K  ++    +  L  +    DWWD F  R + P +   ++  F+  F+ S+ TF
Sbjct: 17  KAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTF 76

Query: 69  SYICSLVKEVMMAK-TSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGLNQSSV 128
           SYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+S  ++G +FG+ QS+V
Sbjct: 77  SYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTV 136

Query: 129 SQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIMMTLPTS 188
           SQ+TWRF+EA+EE+  HHL WP ++  +++IK KF+++ GLPNCCG ++TTHI+MTLP  
Sbjct: 137 SQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAV 196

Query: 189 ESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERL 248
           ++++  W D+EKN SM LQ + D EMRF +++TGWPG ++ + +L+ SGFFKL ++ + L
Sbjct: 197 QASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQIL 256

Query: 249 NGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRLVAQRAL 308
           +G    LS+ +++ EY++G   +PLLPWL+TP+      D    FN+RH   R VA  A 
Sbjct: 257 DGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAF 316

Query: 309 TRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQQ 368
            +LK  W+I+  VMW+PD+ +LP IILVCCLLHNI+ID  D +Q+++PLS HHD  Y  +
Sbjct: 317 QQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYADR 376

Query: 369 SCEFVDNTASISREKLSMYL 387
            C+  +   S  R  L+ +L
Sbjct: 377 YCKQTEPLGSELRGCLTEHL 394

BLAST of CSPI05G09500 vs. ExPASy Swiss-Prot
Match: Q6AZB8 (Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 1.4e-26
Identity = 78/289 (26.99%), Postives = 143/289 (49.48%), Query Frame = 0

Query: 58  SVFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 117
           + F   R+   Y+  L+K+ ++ +T        + +S + Q+  AL    SG   S +GD
Sbjct: 37  NTFGFPREFIYYLVELLKDSLLRRTQ-----RSRAISPDVQILAALGFYTSGSFQSKMGD 96

Query: 118 SFGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETT 177
           + G++Q+S+S+      +A+ EK    + +   E    + K +F +I G+PN  GVV+  
Sbjct: 97  AIGISQASMSRCVSNVTKALIEKAPEFIGFTRDEATKQQFKDEFYRIAGIPNVTGVVDCA 156

Query: 178 HIMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFF 237
           HI +  P ++ ++  +++++   S+  Q++ D         T WPGSL+D  V + S   
Sbjct: 157 HIAIKAPNADDSS--YVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVA 216

Query: 238 KLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLP-DYQAEFNKRHF 297
           KL ++            E+ + G +++GD+ +PL  WL+TP Q    P DY+  +N  H 
Sbjct: 217 KLFEE-----------QENDDEG-WLLGDNRYPLKKWLMTPVQSPESPADYR--YNLAHT 276

Query: 298 ATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLLHNI 342
            T  +  R    ++  ++ + G    + + P+K     II  CC+LHNI
Sbjct: 277 TTHEIVDRTFRAIQTRFRCLDGAKGYLQYSPEK--CSHIIQACCVLHNI 302

BLAST of CSPI05G09500 vs. ExPASy Swiss-Prot
Match: Q8BR93 (Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 1.2e-25
Identity = 93/340 (27.35%), Postives = 154/340 (45.29%), Query Frame = 0

Query: 60  FKISRKTFSYICSL----------VKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSG 119
           FK+   T  Y+ S+          + E++ A  S  T    + +S   Q+  AL    SG
Sbjct: 25  FKLDDVTDEYLMSMYGFPRQFIYFLVELLGASLSRPTQ-RSRAISPETQILAALGFYTSG 84

Query: 120 ESLSNIGDSFGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPN 179
              + +GD+ G++Q+S+S+      EA+ E+    + +P  E  +  +K +F  + G+P 
Sbjct: 85  SFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPVDEAAVQSLKDEFYGLAGMPG 144

Query: 180 CCGVVETTHIMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDAL 239
             GV +  H+ +  P +E  +  +++R+   S+   V+ D       + T WPGSL D  
Sbjct: 145 VIGVADCIHVAIKAPNAEDLS--YVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCA 204

Query: 240 VLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQA 299
           VLQ S      + G                  +++GDS F L  WLLTP     +P+  A
Sbjct: 205 VLQRSSLTSQFETG-------------MPKDSWLLGDSSFFLRSWLLTPLP---IPETAA 264

Query: 300 E--FNKRHFATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLLHNIVI 359
           E  +N+ H AT  V +R L  L   ++ + G    + + P+K     IIL CC+LHNI +
Sbjct: 265 EYRYNRAHSATHSVIERTLQTLCCRFRCLDGSKGALQYSPEK--CSHIILACCVLHNISL 324

Query: 360 DMEDEV-QDEMPLSHHHDPSYRQQSCEFVDNTASISREKL 383
           D   +V    +P      P    +  E +D  A   R++L
Sbjct: 325 DHGMDVWSSPVPGPIDQPPEGEDEHMESLDLEADRIRQEL 343

BLAST of CSPI05G09500 vs. ExPASy Swiss-Prot
Match: Q96MB7 (Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 2.0e-25
Identity = 93/346 (26.88%), Postives = 156/346 (45.09%), Query Frame = 0

Query: 60  FKISRKTFSYICSL----------VKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSG 119
           FK+   T  Y+ S+          + E++ A  S  T    + +S   QV  AL    SG
Sbjct: 25  FKLDDVTDEYLMSMYGFPRQFIYYLVELLGANLSRPTQ-RSRAISPETQVLAALGFYTSG 84

Query: 120 ESLSNIGDSFGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPN 179
              + +GD+ G++Q+S+S+      EA+ E+    + +P+ E  +  +K +F  + G+P 
Sbjct: 85  SFQTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPG 144

Query: 180 CCGVVETTHIMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDAL 239
             GVV+  H+ +  P +E  +  +++R+   S+   ++ D       + T WPGSL D  
Sbjct: 145 VMGVVDCIHVAIKAPNAEDLS--YVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCA 204

Query: 240 VLQSSGFFKLSQDGERLNGKKMKLSESSELG----EYIIGDSGFPLLPWLLTPYQGKGLP 299
           VLQ S                  LS   E G     +++GDS F L  WL+TP     +P
Sbjct: 205 VLQQS-----------------SLSSQFEAGMHKDSWLLGDSSFFLRTWLMTPLH---IP 264

Query: 300 DYQAE--FNKRHFATRLVAQRALTRLKEMWKIIKG----VMWKPDKHRLPRIILVCCLLH 359
           +  AE  +N  H AT  V ++    L   ++ + G    + + P+K     IIL CC+LH
Sbjct: 265 ETPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKS--SHIILACCVLH 324

Query: 360 NIVIDMEDEV-QDEMPLSHHHDPSYRQQSCEFVDNTASISREKLSM 385
           NI ++   +V    M       P    +  E +D  A   R++L +
Sbjct: 325 NISLEHGMDVWSSPMTGPMEQPPEEEYEHMESLDLEADRIRQELML 345

BLAST of CSPI05G09500 vs. ExPASy TrEMBL
Match: A0A0A0KS64 (DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE=3 SV=1)

HSP 1 Score: 800.4 bits (2066), Expect = 3.3e-228
Identity = 391/392 (99.74%), Postives = 391/392 (99.74%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG
Sbjct: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120

Query: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIM 180
           LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIK KFKKIRGLPNCCGVVETTHIM
Sbjct: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTHIM 180

Query: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS
Sbjct: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300
           QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL
Sbjct: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           PSYRQQSCEFVDNTASISREKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of CSPI05G09500 vs. ExPASy TrEMBL
Match: A0A1S3CEZ1 (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=3 SV=1)

HSP 1 Score: 796.6 bits (2056), Expect = 4.8e-227
Identity = 387/392 (98.72%), Postives = 391/392 (99.74%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG
Sbjct: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120

Query: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIM 180
           LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIK KFKKIRGLPNCCGV+ETTHIM
Sbjct: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTHIM 180

Query: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSD+LVLQSSGFFKLS
Sbjct: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300
           QDGERLNGKKM+LSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL
Sbjct: 241 QDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           PSYRQQSCEFVDNTASI+REKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of CSPI05G09500 vs. ExPASy TrEMBL
Match: A0A6J1CCK2 (protein ALP1-like OS=Momordica charantia OX=3673 GN=LOC111009982 PE=3 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 1.1e-212
Identity = 363/393 (92.37%), Postives = 377/393 (95.93%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKN-TKFESV 60
           MGPIRGFKRKKK EKKVDQNV A+ASLSSQ QPLDWWD+FSQRITGPLSQSKN TKFESV
Sbjct: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60

Query: 61  FKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSF 120
           FKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLS+NDQVAVALRRL SGESLS IGDSF
Sbjct: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120

Query: 121 GLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHI 180
           G+NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IK KFKKI+GLPNCCGV+ETTHI
Sbjct: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180

Query: 181 MMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 240
           MMTLPT+ES NG+WLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTAESXNGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATR 300
           SQDGERLNGK MKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRH+ATR
Sbjct: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393

BLAST of CSPI05G09500 vs. ExPASy TrEMBL
Match: A0A6J1K3E1 (protein ALP1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490769 PE=3 SV=1)

HSP 1 Score: 748.4 bits (1931), Expect = 1.5e-212
Identity = 362/394 (91.88%), Postives = 379/394 (96.19%), Query Frame = 0

Query: 1   MGPIRGFKRK--KKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRK  KK +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVQQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IK KFKKIRGLPNCCGV+ETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 LSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300
            SQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF+T
Sbjct: 241 RSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           HDPSYRQQSC+FVDNTASI+REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of CSPI05G09500 vs. ExPASy TrEMBL
Match: A0A6J1FP85 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111446995 PE=3 SV=1)

HSP 1 Score: 745.3 bits (1923), Expect = 1.3e-211
Identity = 360/392 (91.84%), Postives = 377/392 (96.17%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKK   KKVDQNV   +SL+SQ QPLDWWDEFSQRITGPLS+SKNT FESVF
Sbjct: 1   MGPIRGFKRKK---KKVDQNVLVPSSLTSQPQPLDWWDEFSQRITGPLSESKNTNFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYI SLVKE MMAKTS+FTDLNGKPLS+NDQVAVALRRL SGESLSNIGDSFG
Sbjct: 61  KISRKTFSYISSLVKEAMMAKTSNFTDLNGKPLSINDQVAVALRRLSSGESLSNIGDSFG 120

Query: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIM 180
           +NQSSVSQITWRFVEAMEEKGLHHLSWPSTEE MD+IK KFKKI+GLPNCCGV+ETTHIM
Sbjct: 121 MNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEGMDEIKSKFKKIKGLPNCCGVIETTHIM 180

Query: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPT+ESA+G+WLDREKNCSM+LQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS
Sbjct: 181 MTLPTTESAHGVWLDREKNCSMLLQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300
           QDGERLNGKKMKLSESSE+GEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHFATRL
Sbjct: 241 QDGERLNGKKMKLSESSEVGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRI+LVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIVLVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           PSYRQQSCEFVDNTAS++REKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASMAREKLSMYLSGKLPP 389

BLAST of CSPI05G09500 vs. NCBI nr
Match: XP_004147700.1 (protein ALP1-like [Cucumis sativus] >KGN50531.1 hypothetical protein Csa_000507 [Cucumis sativus])

HSP 1 Score: 800.4 bits (2066), Expect = 6.8e-228
Identity = 391/392 (99.74%), Postives = 391/392 (99.74%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG
Sbjct: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120

Query: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIM 180
           LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIK KFKKIRGLPNCCGVVETTHIM
Sbjct: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTHIM 180

Query: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS
Sbjct: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300
           QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL
Sbjct: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           PSYRQQSCEFVDNTASISREKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of CSPI05G09500 vs. NCBI nr
Match: XP_008461643.1 (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 796.6 bits (2056), Expect = 9.8e-227
Identity = 387/392 (98.72%), Postives = 391/392 (99.74%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60

Query: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG
Sbjct: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120

Query: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIM 180
           LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIK KFKKIRGLPNCCGV+ETTHIM
Sbjct: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTHIM 180

Query: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSD+LVLQSSGFFKLS
Sbjct: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFKLS 240

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300
           QDGERLNGKKM+LSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL
Sbjct: 241 QDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360

Query: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           PSYRQQSCEFVDNTASI+REKLSMYLSGKLPP
Sbjct: 361 PSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of CSPI05G09500 vs. NCBI nr
Match: XP_038891834.1 (protein ALP1-like [Benincasa hispida])

HSP 1 Score: 784.6 bits (2025), Expect = 3.9e-223
Identity = 380/392 (96.94%), Postives = 388/392 (98.98%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 60
           MGPIRGFKRKKKVEKKVDQNVFA+ASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF
Sbjct: 134 MGPIRGFKRKKKVEKKVDQNVFAAASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVF 193

Query: 61  KISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFG 120
           KISRKTFSYICSLVKEVMMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIG+SFG
Sbjct: 194 KISRKTFSYICSLVKEVMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGESFG 253

Query: 121 LNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIM 180
           +NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IK KFKKIRGLPNCCGV+ETTHIM
Sbjct: 254 MNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIM 313

Query: 181 MTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 240
           MTLPT+ESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS
Sbjct: 314 MTLPTTESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLS 373

Query: 241 QDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRL 300
           QD ERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHFATRL
Sbjct: 374 QDSERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHFATRL 433

Query: 301 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 360
           VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD
Sbjct: 434 VAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD 493

Query: 361 PSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           PSYRQQSCEFVDNTASI+REKLSMYLSGKLPP
Sbjct: 494 PSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 525

BLAST of CSPI05G09500 vs. NCBI nr
Match: XP_022138922.1 (protein ALP1-like [Momordica charantia])

HSP 1 Score: 748.8 bits (1932), Expect = 2.3e-212
Identity = 363/393 (92.37%), Postives = 377/393 (95.93%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKN-TKFESV 60
           MGPIRGFKRKKK EKKVDQNV A+ASLSSQ QPLDWWD+FSQRITGPLSQSKN TKFESV
Sbjct: 1   MGPIRGFKRKKKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFESV 60

Query: 61  FKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSF 120
           FKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLS+NDQVAVALRRL SGESLS IGDSF
Sbjct: 61  FKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGDSF 120

Query: 121 GLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHI 180
           G+NQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMD+IK KFKKI+GLPNCCGV+ETTHI
Sbjct: 121 GMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETTHI 180

Query: 181 MMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKL 240
           MMTLPT+ES NG+WLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFKL
Sbjct: 181 MMTLPTAESXNGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFFKL 240

Query: 241 SQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATR 300
           SQDGERLNGK MKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRH+ATR
Sbjct: 241 SQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYATR 300

Query: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360
           LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH
Sbjct: 301 LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHH 360

Query: 361 DPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           D  YRQQSC+FVDNTAS+ REKLSMYLSGKLPP
Sbjct: 361 DSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393

BLAST of CSPI05G09500 vs. NCBI nr
Match: XP_022995175.1 (protein ALP1-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 748.4 bits (1931), Expect = 3.1e-212
Identity = 362/394 (91.88%), Postives = 379/394 (96.19%), Query Frame = 0

Query: 1   MGPIRGFKRK--KKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRK  KK +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVQQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IK KFKKIRGLPNCCGV+ETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 LSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300
            SQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF+T
Sbjct: 241 RSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 393
           HDPSYRQQSC+FVDNTASI+REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of CSPI05G09500 vs. TAIR 10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 502.3 bits (1292), Expect = 3.6e-142
Identity = 246/407 (60.44%), Postives = 304/407 (74.69%), Query Frame = 0

Query: 1   MGPIRGFKRKKKVEKKVDQNVFASASLS------------------SQLQPLDWWDEFSQ 60
           MGPI+  K+KK+ EKKVD+NV  +A+ +                  S  Q LDWWD FS+
Sbjct: 1   MGPIKTIKKKKRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSR 60

Query: 61  RITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVA 120
           RI G  +  K   FESVFKISRKTF YICSLVK    AK ++F+D NG PLSLND+VAVA
Sbjct: 61  RIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAVA 120

Query: 121 LRRLCSGESLSNIGDSFGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFK 180
           LRRL SGESLS IG++FG+NQS+VSQITWRFVE+MEE+ +HHLSWPS    +D+IK KF+
Sbjct: 121 LRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSKFE 180

Query: 181 KIRGLPNCCGVVETTHIMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWP 240
           KI GLPNCCG ++ THI+M LP  E +N +WLD EKN SM LQ +VDP+MRF D+I GWP
Sbjct: 181 KISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAGWP 240

Query: 241 GSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGK 300
           GSL+D +VL++SGF+KL + G+RLNG+K+ LSE +EL EYI+GDSGFPLLPWLLTPYQGK
Sbjct: 241 GSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGK 300

Query: 301 GLPDYQAEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIV 360
                Q EFNKRH      AQ AL++LK+ W+II GVMW PD++RLPRII VCCLLHNI+
Sbjct: 301 PTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNII 360

Query: 361 IDMEDEVQDEMPLSHHHDPSYRQQSCEFVDNTASISREKLSMYLSGK 390
           IDMED+  D+ PLS  HD +YRQ+SC+  D  +S+ R++LS  L GK
Sbjct: 361 IDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of CSPI05G09500 vs. TAIR 10
Match: AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 352.4 bits (903), Expect = 4.6e-97
Identity = 167/380 (43.95%), Postives = 255/380 (67.11%), Query Frame = 0

Query: 9   RKKKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGP-LSQSKNTKFESVFKISRKTF 68
           + KK+ K  ++    +  L  +    DWWD F  R + P +   ++  F+  F+ S+ TF
Sbjct: 17  KAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTF 76

Query: 69  SYICSLVKEVMMAK-TSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGLNQSSV 128
           SYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+S  ++G +FG+ QS+V
Sbjct: 77  SYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTV 136

Query: 129 SQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIMMTLPTS 188
           SQ+TWRF+EA+EE+  HHL WP ++  +++IK KF+++ GLPNCCG ++TTHI+MTLP  
Sbjct: 137 SQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAV 196

Query: 189 ESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERL 248
           ++++  W D+EKN SM LQ + D EMRF +++TGWPG ++ + +L+ SGFFKL ++ + L
Sbjct: 197 QASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQIL 256

Query: 249 NGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFATRLVAQRAL 308
           +G    LS+ +++ EY++G   +PLLPWL+TP+      D    FN+RH   R VA  A 
Sbjct: 257 DGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAF 316

Query: 309 TRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQQ 368
            +LK  W+I+  VMW+PD+ +LP IILVCCLLHNI+ID  D +Q+++PLS HHD  Y  +
Sbjct: 317 QQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYADR 376

Query: 369 SCEFVDNTASISREKLSMYL 387
            C+  +   S  R  L+ +L
Sbjct: 377 YCKQTEPLGSELRGCLTEHL 394

BLAST of CSPI05G09500 vs. TAIR 10
Match: AT5G12010.1 (unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, plasma membrane, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G29780.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 140.6 bits (353), Expect = 2.7e-33
Identity = 90/324 (27.78%), Postives = 162/324 (50.00%), Query Frame = 0

Query: 36  WWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSL 95
           WW+E S R+  P        F+  F++S+ TF  IC  +   +  + ++  +     + +
Sbjct: 161 WWEECS-RLDYP-----EEDFKKAFRMSKSTFELICDELNSAVAKEDTALRN----AIPV 220

Query: 96  NDQVAVALRRLCSGESLSNIGDSFGLNQSSVSQITWRFVEAMEEKGL-HHLSWPSTEEDM 155
             +VAV + RL +GE L  +   FGL  S+  ++     +A+++  +  +L WP  +E +
Sbjct: 221 RQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPD-DESL 280

Query: 156 DKIKCKFKKIRGLPNCCGVVETTHIMMTLPTSESAN-----GIWLDREKNCSMILQVIVD 215
             I+ +F+ + G+PN  G + TTHI +  P    A+         +++ + S+ +Q +V+
Sbjct: 281 RNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVN 340

Query: 216 PEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGF 275
           P+  F D+  GWPGS+ D  VL+ S  ++ + +G  L G             ++ G  G 
Sbjct: 341 PKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------WVAGGPGH 400

Query: 276 PLLPWLLTPYQGKGLPDYQAEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLP 335
           PLL W+L PY  + L   Q  FN++    + VA+ A  RLK  W  ++    +     LP
Sbjct: 401 PLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQ-KRTEVKLQDLP 460

Query: 336 RIILVCCLLHNIVIDMEDEVQDEM 354
            ++  CC+LHNI    E++++ E+
Sbjct: 461 TVLGACCVLHNICEMREEKMEPEL 460

BLAST of CSPI05G09500 vs. TAIR 10
Match: AT4G29780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519; Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 127.1 bits (318), Expect = 3.1e-29
Identity = 100/363 (27.55%), Postives = 166/363 (45.73%), Query Frame = 0

Query: 35  DWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLS 94
           DWWD  S+            +F   F++S+ TF+ IC  +   +  K +   D    P  
Sbjct: 198 DWWDRVSR------PDFPEDEFRREFRMSKSTFNLICEELDTTVTKKNTMLRDAIPAP-- 257

Query: 95  LNDQVAVALRRLCSGESLSNIGDSFGLNQSSVSQITWRFVEAMEEKGL-HHLSWPSTEED 154
              +V V + RL +G  L ++ + FGL  S+  ++      A+ +  +  +L WPS + +
Sbjct: 258 --KRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPS-DSE 317

Query: 155 MDKIKCKFKKIRGLPNCCGVVETTHIMMTLPTSESA---NGIWLDREK--NCSMILQVIV 214
           ++  K KF+ +  +PN  G + TTHI +  P    A   N    +R +  + S+ +Q +V
Sbjct: 318 INSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVV 377

Query: 215 DPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSG 274
           + +  F D+  G PGSL+D  +L+ S          R    +  L +S     +I+G+SG
Sbjct: 378 NADGIFTDVCIGNPGSLTDDQILEKSSL-------SRQRAARGMLRDS-----WIVGNSG 437

Query: 275 FPLLPWLLTPYQGKGLPDYQAEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRL 334
           FPL  +LL PY  + L   Q  FN+     + +A  A  RLK  W  ++    +     L
Sbjct: 438 FPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQ-KRTEVKLQDL 497

Query: 335 PRIILVCCLLHNIVIDMEDEVQDEMPLSHHHD---PSYRQQSCEFVDNTASISREKLSMY 389
           P ++  CC+LHNI    ++E+  E+      D   P    +S   V+    IS   L   
Sbjct: 498 PYVLGACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSASAVNTRDHISHNLLHRG 536

BLAST of CSPI05G09500 vs. TAIR 10
Match: AT1G72270.1 (CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G27010.1); Has 772 Blast hits to 657 proteins in 120 species: Archae - 0; Bacteria - 0; Metazoa - 344; Fungi - 94; Plants - 322; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )

HSP 1 Score: 97.4 bits (241), Expect = 2.6e-20
Identity = 84/327 (25.69%), Postives = 141/327 (43.12%), Query Frame = 0

Query: 25  ASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKTSS 84
           +S SS +    W++ F   +T       + ++   F++S+ TF  + S++     +   S
Sbjct: 69  SSSSSAITTTTWFNRF---LTSATEDEDDPRWCLYFRMSKSTFFSLYSILSH---SSLPS 128

Query: 85  FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGLNQSS-VSQITWRFVEAMEEKGLH 144
           F              A  + RL  G S   +   FG + +S  S+  +   + + EK   
Sbjct: 129 F--------------AATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEK--- 188

Query: 145 HLSWPSTEEDMDKIKCKFKKIRGLPNCCGVVETTHIMMTLPTSESANGIWLDREKNCSMI 204
                   + +D  K  F     LPNC GVV                G  L  +   S++
Sbjct: 189 ------LSQQLDDPKPDFSP-NLLPNCYGVVGFGRF--------EVKGKLLGAKG--SIL 248

Query: 205 LQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYI 264
           +Q +VD   RF DI  GWP ++    + + +  F +++  E L+G   KL     +  YI
Sbjct: 249 VQALVDSNGRFVDISAGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVPRYI 308

Query: 265 IGDSGFPLLPWLLTPYQ-GKGLPDYQAEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWK 324
           +GDS  PLLPWL+TPY        ++ EFN          + A  +++  W+I+    WK
Sbjct: 309 LGDSCLPLLPWLVTPYDLTSDEESFREEFNNVVHTGLHSVEIAFAKVRARWRIL-DKKWK 352

Query: 325 PDK-HRLPRIILVCCLLHNIVIDMEDE 349
           P+    +P +I   CLLHN +++  D+
Sbjct: 369 PETIEFMPFVITTGCLLHNFLVNSGDD 352

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M2U35.0e-14160.44Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
Q94K496.4e-9643.95Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
Q6AZB81.4e-2626.99Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1[more]
Q8BR931.2e-2527.35Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1[more]
Q96MB72.0e-2526.88Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KS643.3e-22899.74DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE... [more]
A0A1S3CEZ14.8e-22798.72putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=3 SV=1[more]
A0A6J1CCK21.1e-21292.37protein ALP1-like OS=Momordica charantia OX=3673 GN=LOC111009982 PE=3 SV=1[more]
A0A6J1K3E11.5e-21291.88protein ALP1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490769 PE=3 SV... [more]
A0A6J1FP851.3e-21191.84protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111446995 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_004147700.16.8e-22899.74protein ALP1-like [Cucumis sativus] >KGN50531.1 hypothetical protein Csa_000507 ... [more]
XP_008461643.19.8e-22798.72PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
XP_038891834.13.9e-22396.94protein ALP1-like [Benincasa hispida][more]
XP_022138922.12.3e-21292.37protein ALP1-like [Momordica charantia][more]
XP_022995175.13.1e-21291.88protein ALP1-like isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G55350.13.6e-14260.44PIF / Ping-Pong family of plant transposases [more]
AT3G63270.14.6e-9743.95CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G12010.12.7e-3327.78unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, ... [more]
AT4G29780.13.1e-2927.55unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G72270.12.6e-2025.69CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR0217... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 175..340
e-value: 5.8E-29
score: 100.8
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 1..388
NoneNo IPR availablePANTHERPTHR22930:SF205PROTEIN ALP1-LIKEcoord: 1..388

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G09500.1CSPI05G09500.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding