CmoCh04G012760 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh04G012760
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionprotein ALP1-like
LocationCmo_Chr04: 6470229 .. 6476633 (-)
RNA-Seq ExpressionCmoCh04G012760
SyntenyCmoCh04G012760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCGTCACTTCTCTTTCTTTTTTACAACTTCCATCGCCTTCCTCTAAACTTCTATTCCATCTTCTTCCATTCATCATTGCATGCTTTGGAATCAAAACCAGAGGAAATACACACATACCCATCAATACCCATCAATACCCATCATGTCTTTTTAACCCAAATACTTGAACCCCTCCTTTCCTTTTGAGTTATTGTTTCATAAATTTCGAATTTGAGCGTTTTCTTTATGTGAATTCTATCGTATTCTTCGATTCTCTTGGTGGGTTCTTGTGGGTGTTTCTTCTTTTGCCTCATCTTTGCTTCTTTTTCCAGTTTCGAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGTAAACCATTTTTTGCTTCTCTTCTACTTGAATTTGTGCATTGTTATTGCTCTTAGTTGCTTTCTGTTTATTGGGTCTCTCCGTTGAATTCTTTTGCTGTTTGGTGAGCTTTCTGGACGGTGGTGGAATGTAGTGTTTAATAATCATATATCTTTGATGCCTGCAATTATAGAACCGTGATATGTTCCTAATAACATTGAGAGGGTCAGAAGGGGCAGCCTGTTGAAGTGATCCTATGGGCATGGAGATGGATCAAATCTGTTTAGCAGCAGATATTAGCTTAGATATGTTAGTTCATTTGATAATTGATGTGTACAAGTTGTGGCTACTCACTTAGCTGGTTTATAGAGTGTTAACTGTGTTGTGTTCTGATGTTCTTTCTTTTTCTCATTGAATTGTATCAAAGTACTAATTATTTCTTAGCTGCTCCTACTTTTTTGAGCTTTGGTCATTGTAACAGCCCCAATCCCACCGCTAGTAGATATTGTCCTATTTCGGCTTTCCCTTTCGAGCTTCCCCTCAAAGTTTTTAAAATGCATCTGCTAGGAAGAGGTTTCCACACCCTTATAAACAATGCTTCGTTCTCCTCTCCAACCGATGTGGGGTCTCACAATCCACCCCCTCCTGGGCCCAGTATCCTTGTTAGCACTCGTTCCCTTCTCCCACCCCTCTTTGGGGCCCAGCGTTCTTGCTAGTACTCGGTCCCCTCTCCAATCGATGTGGGATCTCACATTTTGTGTCAGCATTGCTGACAATAAACATCTTTAGAGAACCGAGTCCAATGTGATCCAAAACTCTCGTATTCTTCTTTGTTGATTCGTATTAATGATATCACTCTGACTCTAAATATAGTAAATTGAAGTTAAAATCTGTCATCCGTTGACATTTGTAAGAAGGATGTAACGACCCAGATCCACCGCCAGTAGATATTGTCCTCTTTCGGTTTTCCCTTTCTGGCTTCTCTTCAAGGCTTTAAAATGCGTCTGCTGGGGGAAGGGTGATGTCCAACATTGGTTGGGGAGGAGAACAAAACACCATTTATAAGGGTGTGGAAACTTTCCCCTAGCAAACGCGTTTTAAAACCTTGAGGGGAAGTCCGAAAGGGAAAACCCAAAAAAGACAACATCTGCTAGCAGTGGATCTGGATCCTTAGAAAGACAGTGTCTGAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGAAATGGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCGAAGGATGAGTCTGAAAGTAATGAAGTACATAATACAGAGAAGAAGGGCGGGGATTGAACAGCATTCAACATCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCCTAGAAATGGAAGGATGACTACAAATATCTTTATCTGGTTGTTTTAGAGGCATCTTGGAAACGGGTATGTTTAAGTTTCGATTCGAGTCAATGTAGTAATGCTACAATTGCATAAAAGTTCATAGAAAATGGATCTCTTCTAGGTGTTAAGATTTGTTTCAACCTACTGATATGGCTAAGTGCTCAATATTTGATTTTGGTATTCCAGAGGACCTAACAACGACTGCCGAAGATTAACGCAGGCTGGGGTCTAGTTTTATTAGATATCAGTTCTTATATTTTGATATTCCATGGTTCTCAAATGTTGGGTACACATTTAGCAGTTACTTTGTTCTGGCCATGATTTATGACATGAAGTAAATCTTTACTTGTAGCTTAAAACCAAAGGATATGACTAAATGTGCGGATGAATGTGTAACTGCTCAAGCTCACCGTAGCAGATGTTGTCTTCTTTGGGCTTTCCCCTCAAGATTTTAAAACGCGTTTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAAGTTTCGTTTCCCTCTCCAATCAATGTGGGGTCTCACAATCTACTTCCCTTGGGGGCCCAACATCCTCGCTGGCACACCATTCGGTGCCTGGCCCTGATACCATTTGTAACCGCACAAGCCCACCACGAGCAGATATTATCCTCTTTGGGTTTTCCCTTCCGAGCTTCCCCTCAAGGTTTTAAAATGCGTCTATTAGAGAGAGGTTTCCACACCCTTATAAACAATATTTCGTTCCCCTCTCTAACTAGTGTGGGATATCACAATCCACCCCATTGGGAGCCCAACATCCTCGTTGGCGCACCTCCCCTGGTGCCTGGCTCTAATACCATTTGTAACAGCCCAACCCCAGTCCTCTTTGGGCTTTTCCTTCCGACCTCCCCCTCAAAGTTTTAAAACGTTTCTGTTAGGGAGAGGTTTCCACACCCTTTTAAAGAATGTTTTGTTCCCCTCTCCAACCAATGTAGGATCTCACACAATGAAATCGTCGAATTCTATCGCCTCACCAATGTAGTTTATCTTTGTTTGGTGATGAATGAGTTGGATATATTGATCTGATATTAGTTAGTCTCACTAATAATGTCACTTGTTATATCTTTACTGTGAGTTCTCTCTCTTTCTATCTGTCATATCTGAAAAGGTACAATGCCTTTTCTTAGGAACAGAACATTTGCAGCAGGGACTGATTATGAGTTTTATCCTTGTTTTTTTGCCTCCAATCAGGACCATTATCTCAGTCCAAGAACACCAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTCAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGACTTAAACGGCAAGCCTTTGTCCCTAAATGACCAAGTCGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATCGGTGACTCGTTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCGATGGAAGAGAAAGGCATTCGCCATCTCTCGTGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCAAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCCAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAGCTCTCAGAAAGTGCAGAACTAGGAGAGTATATTATAGGAGACTCTGGTTTTCCACTCTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCGGTACCCGGCTGGTGGCTCGAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATGTGGAGGACGAGATGCAAGACGAAATGCCATTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGCTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCCTAAGAGAACCTCGTCGTGCACGAAAACATTCTTTTTTTTCTTTTCTTTTTGATAGATTGATCTGTGCTGCTGTTTATTCAAATTGCCCAAAGTGTTGTTCACTCATTTAGATGCTATTGATGGTTCATTATAGATATCTCTGGTGATTATGGTCCTGCTATATCTAAACAGATAGATACCCAGTTTCCTTTTCTACAAACATTATGCTGAACTGTTTAGAACTTTGAGCTGAAGAAATTTCAC

mRNA sequence

TGCGTCACTTCTCTTTCTTTTTTACAACTTCCATCGCCTTCCTCTAAACTTCTATTCCATCTTCTTCCATTCATCATTGCATGCTTTGGAATCAAAACCAGAGGAAATACACACATACCCATCAATACCCATCAATACCCATCATGTCTTTTTAACCCAAATACTTGAACCCCTCCTTTCCTTTTGAGTTATTGTTTCATAAATTTCGAATTTGAGCGTTTTCTTTATGTGAATTCTATCGTATTCTTCGATTCTCTTGGTGGGTTCTTGTGGGTGTTTCTTCTTTTGCCTCATCTTTGCTTCTTTTTCCAGTTTCGAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGACCATTATCTCAGTCCAAGAACACCAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTCAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGACTTAAACGGCAAGCCTTTGTCCCTAAATGACCAAGTCGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATCGGTGACTCGTTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCGATGGAAGAGAAAGGCATTCGCCATCTCTCGTGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCAAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCCAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAGCTCTCAGAAAGTGCAGAACTAGGAGAGTATATTATAGGAGACTCTGGTTTTCCACTCTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCGGTACCCGGCTGGTGGCTCGAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATGTGGAGGACGAGATGCAAGACGAAATGCCATTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGCTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCCTAAGAGAACCTCGTCGTGCACGAAAACATTCTTTTTTTTCTTTTCTTTTTGATAGATTGATCTGTGCTGCTGTTTATTCAAATTGCCCAAAGTGTTGTTCACTCATTTAGATGCTATTGATGGTTCATTATAGATATCTCTGGTGATTATGGTCCTGCTATATCTAAACAGATAGATACCCAGTTTCCTTTTCTACAAACATTATGCTGAACTGTTTAGAACTTTGAGCTGAAGAAATTTCAC

Coding sequence (CDS)

ATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGACCATTATCTCAGTCCAAGAACACCAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCTCTTGTCAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGACTTAAACGGCAAGCCTTTGTCCCTAAATGACCAAGTCGCTGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATCGGTGACTCGTTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCGATGGAAGAGAAAGGCATTCGCCATCTCTCGTGGCCTTCAACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCAAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCCAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAGCTCTCAGAAAGTGCAGAACTAGGAGAGTATATTATAGGAGACTCTGGTTTTCCACTCTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCGGTACCCGGCTGGTGGCTCGAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATGTGGAGGACGAGATGCAAGACGAAATGCCATTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGCTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCCTAA

Protein sequence

MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGTRLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHHHDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP
Homology
BLAST of CmoCh04G012760 vs. ExPASy Swiss-Prot
Match: Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 4.9e-136
Identity = 245/409 (59.90%), Postives = 301/409 (73.59%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLS------------------FQPQPLDWWDEF 60
           MGPI+  K+K  K+A+KKV + V  AA+ +                     Q LDWWD F
Sbjct: 1   MGPIKTIKKK--KRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGF 60

Query: 61  SQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVA 120
           S+RI G  +  K   FESVFKISRKTF YICSLVK    AK +NF+D NG PLSLND+VA
Sbjct: 61  SRRIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVA 120

Query: 121 VALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSK 180
           VALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ I HLSWPS    +D+IKSK
Sbjct: 121 VALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSK 180

Query: 181 FKKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITG 240
           F+KI GLPNCCG I+ THI+M LP  E +N VWLD EKN SM LQ +VDP+MRF D+I G
Sbjct: 181 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAG 240

Query: 241 WPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQ 300
           WPGSL+D +VL++SGF+K  + G+RLNG+K+ LSE  EL EYI+GDSGFPLLPWLLTPYQ
Sbjct: 241 WPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQ 300

Query: 301 GKGLADYQTEFNKRHFGTRLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHN 360
           GK  +  QTEFNKRH      A+ AL++LK+ W+II G+MW PD+++LPRII VCCLLHN
Sbjct: 301 GKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHN 360

Query: 361 IMIDVEDEMQDEMPLSHHHDPSYRQQSCKFVDNTASITREKLSMYLSEK 392
           I+ID+ED+  D+ PLS  HD +YRQ+SCK  D  +S+ R++LS  L  K
Sbjct: 361 IIIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of CmoCh04G012760 vs. ExPASy Swiss-Prot
Match: Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 8.4e-96
Identity = 174/396 (43.94%), Postives = 260/396 (65.66%), Query Frame = 0

Query: 1   MGPIRGFKRKKQ------KKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGP-LSQS 60
           M P++  K+ K+      KK  K   +    A  L  +    DWWD F  R + P +   
Sbjct: 1   MAPVKQKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSD 60

Query: 61  KNTKFESVFKISRKTFSYICSLVKEAMMAK-TSNFTDLNGKPLSLNDQVAVALRRLCSGE 120
           ++  F+  F+ S+ TFSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+
Sbjct: 61  EDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGD 120

Query: 121 SLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNC 180
           S  ++G +FG+ QS+VSQ+TWRF+EA+EE+   HL WP ++  +++IKSKF+++ GLPNC
Sbjct: 121 SQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNC 180

Query: 181 CGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALV 240
           CG I+TTHI+MTLP  ++++  W D+EKN SM LQ + D EMRF +++TGWPG ++ + +
Sbjct: 181 CGAIDTTHIIMTLPAVQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKL 240

Query: 241 LESSGFFKRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTE 300
           L+ SGFFK  ++ + L+G    LS+ A++ EY++G   +PLLPWL+TP+     +D    
Sbjct: 241 LKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVA 300

Query: 301 FNKRHFGTRLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQ 360
           FN+RH   R VA  A  +LK  W+I+  +MW+PD+ KLP IILVCCLLHNI+ID  D +Q
Sbjct: 301 FNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQ 360

Query: 361 DEMPLSHHHDPSYRQQSCKFVDNTASITREKLSMYL 389
           +++PLS HHD  Y  + CK  +   S  R  L+ +L
Sbjct: 361 EDVPLSGHHDSGYADRYCKQTEPLGSELRGCLTEHL 394

BLAST of CmoCh04G012760 vs. ExPASy Swiss-Prot
Match: Q6AZB8 (Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 4.1e-26
Identity = 77/291 (26.46%), Postives = 144/291 (49.48%), Query Frame = 0

Query: 60  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 119
           + F   R+   Y+  L+K++++ +T        + +S + Q+  AL    SG   S +GD
Sbjct: 37  NTFGFPREFIYYLVELLKDSLLRRTQ-----RSRAISPDVQILAALGFYTSGSFQSKMGD 96

Query: 120 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETT 179
           + G++Q+S+S+      +A+ EK    + +   E    Q K +F +I G+PN  GV++  
Sbjct: 97  AIGISQASMSRCVSNVTKALIEKAPEFIGFTRDEATKQQFKDEFYRIAGIPNVTGVVDCA 156

Query: 180 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 239
           HI +  P  + ++  +++++   S+  Q++ D         T WPGSL+D  V + S   
Sbjct: 157 HIAIKAPNADDSS--YVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVA 216

Query: 240 KRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQG-KGLADYQTEFNKRHF 299
           K  ++            E+ + G +++GD+ +PL  WL+TP Q  +  ADY+  +N  H 
Sbjct: 217 KLFEE-----------QENDDEG-WLLGDNRYPLKKWLMTPVQSPESPADYR--YNLAHT 276

Query: 300 GTRLVARRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMI 346
            T  +  R    ++  ++ + G    + + P+  K   II  CC+LHNI +
Sbjct: 277 TTHEIVDRTFRAIQTRFRCLDGAKGYLQYSPE--KCSHIIQACCVLHNISL 304

BLAST of CmoCh04G012760 vs. ExPASy Swiss-Prot
Match: B0BN95 (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 3.9e-24
Identity = 75/292 (25.68%), Postives = 138/292 (47.26%), Query Frame = 0

Query: 60  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 119
           S++   R+   Y+  L+  ++   T        + +S   Q+  AL    SG   + +GD
Sbjct: 37  SMYGFPRQFIYYLVELLGASLSRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGD 96

Query: 120 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETT 179
           + G++Q+S+S+      EA+ E+  + + +P+ E  +  +K +F  + G+P   G ++  
Sbjct: 97  AIGISQASMSRCVANVTEALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCI 156

Query: 180 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 239
           H+ +  P  E  +  +++R+   S+   V+ D       + T WPGSL D  VL+ S   
Sbjct: 157 HVAIKAPNAEDLS--YVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSLS 216

Query: 240 KRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTP-YQGKGLADYQTEFNKRHF 299
            + + G                  +++GDS F L  WLLTP +  +  A+Y+  +N+ H 
Sbjct: 217 SQFETG-------------MPKDSWLLGDSSFFLHTWLLTPLHIPETPAEYR--YNRAHS 276

Query: 300 GTRLVARRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMID 347
            T  V  + L  L   ++ + G    + + P+K     IIL CC+LHNI ++
Sbjct: 277 ATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKSS--HIILACCVLHNISLE 304

BLAST of CmoCh04G012760 vs. ExPASy Swiss-Prot
Match: Q17QR8 (Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1)

HSP 1 Score: 113.6 bits (283), Expect = 5.0e-24
Identity = 73/292 (25.00%), Postives = 138/292 (47.26%), Query Frame = 0

Query: 60  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 119
           S++   R+   Y+  L+  ++   T        + +S   Q+  AL    SG   + +GD
Sbjct: 37  SMYGFPRQFIYYLVELLGASLSRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGD 96

Query: 120 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETT 179
           + G++Q+S+S+      EA+ E+  + + +P+ E  +  +K +F  + G+P   GV++  
Sbjct: 97  AIGISQASMSRCVANVTEALVERASQFIHFPADEASVQALKDEFYGLAGIPGVIGVVDCM 156

Query: 180 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 239
           H+ +  P  E  +  +++R+   S+   ++ D       + T WPGSL D +VL+ S   
Sbjct: 157 HVAIKAPNAEDLS--YVNRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVLQQSSLS 216

Query: 240 KRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTP-YQGKGLADYQTEFNKRHF 299
            + + G                  +++GDS F L  WL+TP +  +  A+Y+  +N  H 
Sbjct: 217 SQFEAGMHKE-------------SWLLGDSSFFLRTWLMTPLHIPETPAEYR--YNMAHS 276

Query: 300 GTRLVARRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMID 347
            T  V  +    L   ++ + G    + + P+K     IIL CC+LHNI ++
Sbjct: 277 ATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSS--HIILACCVLHNISLE 304

BLAST of CmoCh04G012760 vs. ExPASy TrEMBL
Match: A0A6J1GZA0 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111458235 PE=3 SV=1)

HSP 1 Score: 804.7 bits (2077), Expect = 1.7e-229
Identity = 394/394 (100.00%), Postives = 394/394 (100.00%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
           RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH
Sbjct: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 395
           HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of CmoCh04G012760 vs. ExPASy TrEMBL
Match: A0A6J1K3E1 (protein ALP1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490769 PE=3 SV=1)

HSP 1 Score: 796.2 bits (2055), Expect = 6.2e-227
Identity = 389/394 (98.73%), Postives = 392/394 (99.49%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKV QYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVQQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
           RSQDGERLNGKKMKLSES+ELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHF T
Sbjct: 241 RSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMID+EDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 395
           HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of CmoCh04G012760 vs. ExPASy TrEMBL
Match: A0A0A0KS64 (DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE=3 SV=1)

HSP 1 Score: 741.9 bits (1914), Expect = 1.4e-210
Identity = 360/394 (91.37%), Postives = 379/394 (96.19%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRK  KK +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRK--KKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IKSKFKKIRGLPNCCGV+ETTH
Sbjct: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFK
Sbjct: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
            SQDGERLNGKKMKLSES+ELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF T
Sbjct: 241 LSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+ID+EDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 395
           HDPSYRQQSC+FVDNTASI+REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of CmoCh04G012760 vs. ExPASy TrEMBL
Match: A0A1S3CEZ1 (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=3 SV=1)

HSP 1 Score: 739.6 bits (1908), Expect = 6.9e-210
Identity = 359/394 (91.12%), Postives = 378/394 (95.94%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRK  KK +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRK--KKVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IKSKFKKIRGLPNCCGVIETTH
Sbjct: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSD+LVL+SSGFFK
Sbjct: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
            SQDGERLNGKKM+LSES+ELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF T
Sbjct: 241 LSQDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+ID+EDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 395
           HDPSYRQQSC+FVDNTASI REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of CmoCh04G012760 vs. ExPASy TrEMBL
Match: A0A6J1CCK2 (protein ALP1-like OS=Momordica charantia OX=3673 GN=LOC111009982 PE=3 SV=1)

HSP 1 Score: 728.0 bits (1878), Expect = 2.1e-206
Identity = 357/395 (90.38%), Postives = 374/395 (94.68%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKN-TKFE 60
           MGPIRGFKRK  KKA+KKV Q V AAASLS QPQPLDWWD+FSQRITGPLSQSKN TKFE
Sbjct: 1   MGPIRGFKRK--KKAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFE 60

Query: 61  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 120
           SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLS+NDQVAVALRRL SGESLS IGD
Sbjct: 61  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGD 120

Query: 121 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETT 180
           SFGMNQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMDQIKSKFKKI+GLPNCCGVIETT
Sbjct: 121 SFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETT 180

Query: 181 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 240
           HIMMTLPT ES NGVWLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSDALVL+SSGFF
Sbjct: 181 HIMMTLPTAESXNGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFF 240

Query: 241 KRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFG 300
           K SQDGERLNGK MKLSES+ELGEYIIGDSGFPLLPWLLTPYQGKGL+DYQTEFNKRH+ 
Sbjct: 241 KLSQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYA 300

Query: 301 TRLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSH 360
           TRLVA+RALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+ID+EDE+QDEMPLSH
Sbjct: 301 TRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSH 360

Query: 361 HHDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 395
           HHD  YRQQSCKFVDNTAS+ REKLSMYLS KL P
Sbjct: 361 HHDSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393

BLAST of CmoCh04G012760 vs. NCBI nr
Match: XP_022956519.1 (protein ALP1-like [Cucurbita moschata])

HSP 1 Score: 804.7 bits (2077), Expect = 3.6e-229
Identity = 394/394 (100.00%), Postives = 394/394 (100.00%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
           RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH
Sbjct: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 395
           HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of CmoCh04G012760 vs. NCBI nr
Match: KAG6601011.1 (Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 797.0 bits (2057), Expect = 7.5e-227
Identity = 390/393 (99.24%), Postives = 392/393 (99.75%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQ KNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQFKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
           RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMID+EDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLT 394
           HDPSYRQQSCKFVDNTASITREKLSMYLSEKLT
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLT 393

BLAST of CmoCh04G012760 vs. NCBI nr
Match: XP_022995175.1 (protein ALP1-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 796.2 bits (2055), Expect = 1.3e-226
Identity = 389/394 (98.73%), Postives = 392/394 (99.49%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKV QYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVQQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
           RSQDGERLNGKKMKLSES+ELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHF T
Sbjct: 241 RSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMID+EDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 395
           HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of CmoCh04G012760 vs. NCBI nr
Match: XP_023545365.1 (protein ALP1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 794.3 bits (2050), Expect = 4.9e-226
Identity = 387/394 (98.22%), Postives = 392/394 (99.49%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
           RSQDGERLNGKKMKLSE++ELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHF T
Sbjct: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMID+EDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 395
           HDPSYRQQSCKFVDNT SITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394

BLAST of CmoCh04G012760 vs. NCBI nr
Match: KAG7031625.1 (Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 789.6 bits (2038), Expect = 1.2e-224
Identity = 388/393 (98.73%), Postives = 391/393 (99.49%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQ KNTKF+ 
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQFKNTKFD- 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300
           RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMID+EDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLT 394
           HDPSYRQQSCKFVDNTASITREKLSMYLSEKLT
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLT 392

BLAST of CmoCh04G012760 vs. TAIR 10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 485.7 bits (1249), Expect = 3.5e-137
Identity = 245/409 (59.90%), Postives = 301/409 (73.59%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLS------------------FQPQPLDWWDEF 60
           MGPI+  K+K  K+A+KKV + V  AA+ +                     Q LDWWD F
Sbjct: 1   MGPIKTIKKK--KRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGF 60

Query: 61  SQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVA 120
           S+RI G  +  K   FESVFKISRKTF YICSLVK    AK +NF+D NG PLSLND+VA
Sbjct: 61  SRRIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVA 120

Query: 121 VALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSK 180
           VALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ I HLSWPS    +D+IKSK
Sbjct: 121 VALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSK 180

Query: 181 FKKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITG 240
           F+KI GLPNCCG I+ THI+M LP  E +N VWLD EKN SM LQ +VDP+MRF D+I G
Sbjct: 181 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAG 240

Query: 241 WPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQ 300
           WPGSL+D +VL++SGF+K  + G+RLNG+K+ LSE  EL EYI+GDSGFPLLPWLLTPYQ
Sbjct: 241 WPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQ 300

Query: 301 GKGLADYQTEFNKRHFGTRLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHN 360
           GK  +  QTEFNKRH      A+ AL++LK+ W+II G+MW PD+++LPRII VCCLLHN
Sbjct: 301 GKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHN 360

Query: 361 IMIDVEDEMQDEMPLSHHHDPSYRQQSCKFVDNTASITREKLSMYLSEK 392
           I+ID+ED+  D+ PLS  HD +YRQ+SCK  D  +S+ R++LS  L  K
Sbjct: 361 IIIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of CmoCh04G012760 vs. TAIR 10
Match: AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 352.1 bits (902), Expect = 6.0e-97
Identity = 174/396 (43.94%), Postives = 260/396 (65.66%), Query Frame = 0

Query: 1   MGPIRGFKRKKQ------KKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGP-LSQS 60
           M P++  K+ K+      KK  K   +    A  L  +    DWWD F  R + P +   
Sbjct: 1   MAPVKQKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSD 60

Query: 61  KNTKFESVFKISRKTFSYICSLVKEAMMAK-TSNFTDLNGKPLSLNDQVAVALRRLCSGE 120
           ++  F+  F+ S+ TFSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+
Sbjct: 61  EDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGD 120

Query: 121 SLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNC 180
           S  ++G +FG+ QS+VSQ+TWRF+EA+EE+   HL WP ++  +++IKSKF+++ GLPNC
Sbjct: 121 SQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNC 180

Query: 181 CGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALV 240
           CG I+TTHI+MTLP  ++++  W D+EKN SM LQ + D EMRF +++TGWPG ++ + +
Sbjct: 181 CGAIDTTHIIMTLPAVQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKL 240

Query: 241 LESSGFFKRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTE 300
           L+ SGFFK  ++ + L+G    LS+ A++ EY++G   +PLLPWL+TP+     +D    
Sbjct: 241 LKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVA 300

Query: 301 FNKRHFGTRLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQ 360
           FN+RH   R VA  A  +LK  W+I+  +MW+PD+ KLP IILVCCLLHNI+ID  D +Q
Sbjct: 301 FNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQ 360

Query: 361 DEMPLSHHHDPSYRQQSCKFVDNTASITREKLSMYL 389
           +++PLS HHD  Y  + CK  +   S  R  L+ +L
Sbjct: 361 EDVPLSGHHDSGYADRYCKQTEPLGSELRGCLTEHL 394

BLAST of CmoCh04G012760 vs. TAIR 10
Match: AT5G12010.1 (unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, plasma membrane, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G29780.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 148.7 bits (374), Expect = 1.0e-35
Identity = 93/324 (28.70%), Postives = 164/324 (50.62%), Query Frame = 0

Query: 38  WWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSL 97
           WW+E S R+  P        F+  F++S+ TF  IC  +  A+  + +   +     + +
Sbjct: 161 WWEECS-RLDYP-----EEDFKKAFRMSKSTFELICDELNSAVAKEDTALRN----AIPV 220

Query: 98  NDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGI-RHLSWPSTEEDM 157
             +VAV + RL +GE L  +   FG+  S+  ++     +A+++  + ++L WP  +E +
Sbjct: 221 RQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPD-DESL 280

Query: 158 DQIKSKFKKIRGLPNCCGVIETTHIMMTLPTTESAN-----GVWLDREKNCSMILQVIVD 217
             I+ +F+ + G+PN  G + TTHI +  P    A+         +++ + S+ +Q +V+
Sbjct: 281 RNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVN 340

Query: 218 PEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSESAELGEYIIGDSGF 277
           P+  F D+  GWPGS+ D  VLE S  ++R+ +G  L G             ++ G  G 
Sbjct: 341 PKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------WVAGGPGH 400

Query: 278 PLLPWLLTPYQGKGLADYQTEFNKRHFGTRLVARRALTRLKEMWKIIKGIMWKPDKHKLP 337
           PLL W+L PY  + L   Q  FN++    + VA+ A  RLK  W  ++    +     LP
Sbjct: 401 PLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQK-RTEVKLQDLP 460

Query: 338 RIILVCCLLHNIMIDVEDEMQDEM 356
            ++  CC+LHNI    E++M+ E+
Sbjct: 461 TVLGACCVLHNICEMREEKMEPEL 460

BLAST of CmoCh04G012760 vs. TAIR 10
Match: AT4G29780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519; Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 127.1 bits (318), Expect = 3.1e-29
Identity = 96/355 (27.04%), Postives = 167/355 (47.04%), Query Frame = 0

Query: 37  DWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLS 96
           DWWD  S+            +F   F++S+ TF+ IC  +   +  K +   D    P  
Sbjct: 198 DWWDRVSR------PDFPEDEFRREFRMSKSTFNLICEELDTTVTKKNTMLRDAIPAP-- 257

Query: 97  LNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGI-RHLSWPSTEED 156
              +V V + RL +G  L ++ + FG+  S+  ++      A+ +  + ++L WPS + +
Sbjct: 258 --KRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPS-DSE 317

Query: 157 MDQIKSKFKKIRGLPNCCGVIETTHIMMTLPTTESA-----NGVWLDREKNCSMILQVIV 216
           ++  K+KF+ +  +PN  G I TTHI +  P    A          +++ + S+ +Q +V
Sbjct: 318 INSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVV 377

Query: 217 DPEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSESAELGEYIIGDSG 276
           + +  F D+  G PGSL+D  +LE S          R    +  L +S     +I+G+SG
Sbjct: 378 NADGIFTDVCIGNPGSLTDDQILEKSSL-------SRQRAARGMLRDS-----WIVGNSG 437

Query: 277 FPLLPWLLTPYQGKGLADYQTEFNKRHFGTRLVARRALTRLKEMWKIIKGIMWKPDKHKL 336
           FPL  +LL PY  + L   Q  FN+     + +A  A  RLK  W  ++    +     L
Sbjct: 438 FPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQK-RTEVKLQDL 497

Query: 337 PRIILVCCLLHNIMIDVEDEMQDEMPLSHHHDPSYRQQSCKFVDNTASITREKLS 386
           P ++  CC+LHNI    ++EM  E+      D +  + + +    +A  TR+ +S
Sbjct: 498 PYVLGACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIR--SASAVNTRDHIS 526

BLAST of CmoCh04G012760 vs. TAIR 10
Match: AT1G72270.1 (CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G27010.1); Has 772 Blast hits to 657 proteins in 120 species: Archae - 0; Bacteria - 0; Metazoa - 344; Fungi - 94; Plants - 322; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )

HSP 1 Score: 95.5 bits (236), Expect = 1.0e-19
Identity = 80/313 (25.56%), Postives = 135/313 (43.13%), Query Frame = 0

Query: 42  FSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQV 101
           F++ +T       + ++   F++S+ TF  + S++  + +                    
Sbjct: 81  FNRFLTSATEDEDDPRWCLYFRMSKSTFFSLYSILSHSSL-----------------PSF 140

Query: 102 AVALRRLCSGESLSNIGDSFGMNQSS-VSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIK 161
           A  + RL  G S   +   FG + +S  S+  +   + + EK           + +D  K
Sbjct: 141 AATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEK---------LSQQLDDPK 200

Query: 162 SKFKKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDII 221
             F     LPNC GV+                G  L  +   S+++Q +VD   RF DI 
Sbjct: 201 PDFSP-NLLPNCYGVVGFGRF--------EVKGKLLGAKG--SILVQALVDSNGRFVDIS 260

Query: 222 TGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTP 281
            GWP ++    +   +  F  ++  E L+G   KL     +  YI+GDS  PLLPWL+TP
Sbjct: 261 AGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLVTP 320

Query: 282 YQ-GKGLADYQTEFNK-RHFGTRLVARRALTRLKEMWKIIKGIMWKPDKHK-LPRIILVC 341
           Y        ++ EFN   H G   V   A  +++  W+I+    WKP+  + +P +I   
Sbjct: 321 YDLTSDEESFREEFNNVVHTGLHSV-EIAFAKVRARWRILDK-KWKPETIEFMPFVITTG 352

Query: 342 CLLHNIMIDVEDE 351
           CLLHN +++  D+
Sbjct: 381 CLLHNFLVNSGDD 352

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M2U34.9e-13659.90Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
Q94K498.4e-9643.94Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
Q6AZB84.1e-2626.46Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1[more]
B0BN953.9e-2425.68Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Q17QR85.0e-2425.00Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1GZA01.7e-229100.00protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111458235 PE=3 SV=1[more]
A0A6J1K3E16.2e-22798.73protein ALP1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490769 PE=3 SV... [more]
A0A0A0KS641.4e-21091.37DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE... [more]
A0A1S3CEZ16.9e-21091.12putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=3 SV=1[more]
A0A6J1CCK22.1e-20690.38protein ALP1-like OS=Momordica charantia OX=3673 GN=LOC111009982 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_022956519.13.6e-229100.00protein ALP1-like [Cucurbita moschata][more]
KAG6601011.17.5e-22799.24Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022995175.11.3e-22698.73protein ALP1-like isoform X1 [Cucurbita maxima][more]
XP_023545365.14.9e-22698.22protein ALP1-like [Cucurbita pepo subsp. pepo][more]
KAG7031625.11.2e-22498.73Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
AT3G55350.13.5e-13759.90PIF / Ping-Pong family of plant transposases [more]
AT3G63270.16.0e-9743.94CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G12010.11.0e-3528.70unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, ... [more]
AT4G29780.13.1e-2927.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G72270.11.0e-1925.56CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR0217... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 177..342
e-value: 5.6E-29
score: 100.8
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 1..390
NoneNo IPR availablePANTHERPTHR22930:SF205PROTEIN ALP1-LIKEcoord: 1..390

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G012760.1CmoCh04G012760.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding