Cp4.1LG01g10530 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g10530
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionprotein ALP1-like
LocationCp4.1LG01: 6538968 .. 6542897 (-)
RNA-Seq ExpressionCp4.1LG01g10530
SyntenyCp4.1LG01g10530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTGCTTCCCTCAACATGATGAATTGGCCAATGCATCACTTCTCTTTCTTTTTTACAACTTCCATTGCCTTCCTCTAAACTTCTATTCCATCTTCTTCCATTCATCATTGCATGCTTTGGAATCAAAACCAGAGGAAATACACACATACCCATCAATACCCATCATGTCTTTTTAACCCAAATACTTGAACCCCTCCTTTCCTTTTGAGTTATTGTTTCATAAATTTCGAATTTGAGCGTTTTTCTTTATGTGAATTCTATCGTATTCTTCGATTCTCTTCGTGGGTTCTTGTGGGTGTTTCTTCTTTTGCCTCATCTTTGCTTCTTTTTCGAGTTTCTAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGTAAACCACTTTTTGCTTCTCTTCTACTTGAATTTGTGCATTGTTATTGCTCTTAGTTGCTTTCTGTTTATTGGGTCTCTCCGTTGAATTCTTTTGCTGTTTGGTGAGCTTTTTGGACGGTGGTGGAATGTAGTGTTTAATAATCATATATCTTTGATGCCTGCAATTATAGAACCATGATATGTTCCTAATAACATTGAGAGGGTTAGAAGGGGCAGCCTGTTGAAGTGATCCTATGGGCATGGAGATGGATCAACCCTGTTTAGCAGCAGATATTAGCATAGATATGTTAGTTCATTTGATAATTGATGTGTACAAGTTGTGGCTACTCACTTAGCTGGTTTATAGAGTGTTAACTGTGTTGTGTTCTGATGTTCTTCTTCTTTTTCTCATTGAATTGTTGAAATATTAATTATTTCTTAGCTGCTCCTACTTTTTTGAGCTTTGGTCACTGTAACGGCCCCAATCCCACCGCTAGTAGATATTGTCCTCTTTCGGCTTTCCCTCAAAGTTTTTAAAACGTGTCTACTAGGGAGAGGTTTCCACACCCTTATAAAGAATGCTTCGTTCCCCTCTCTAACCGATGTGGGATCTCACAATCCACCCCCTCTGGGGCCCAGTGTCCTTGCGGGCACTCGTTCCCCTCTCCAATCGATGTGCGATCTCACGGCAATAAACATCTTTAGAGAATTGAGTCTAATGTGATCCAAAATGCTCGTTTTATTCTTTATTGATTCGTATTAAAGATATCACTCTGAATATATAGTACCTCGAAGTTAGAATCAGTCATCTGTTGACATTTGTAAGAAGGCAGTGTCTGAGAAATCGAAGATGAAGAAGAAGGGCGGGGGATGAACAATATTCAACTTCAAGTATGTGTTTTTTATGGTCTGCTGTTGTTTATGCATAGAAATGGAAGGATGACTATAAATATCTTTATCTGGTTGTTTTAGAGGCATCTTGGAAACGGGTATGTTTAAGTTTCGATTCGAGTCAATGTAGTAATGCTACAATGCATAGAAGTTCATAGAAAATGGATCTCTTCTAGGTATTATAAGATTTGTTTCAACCTACTGATATGGCTAAGTGCTCAATATTTGATTTTGGTACTCCAGAGGACCTAACAACATGATCAAAGTGGGTAGGTTTTGTTAGATGTTAGTAAAGCAAGAAGACTAGATGAGCTTTTAACGCATGCTGGAGTCTAGTTTCACTAGATATCAGTTCTTATATTTTGATATTCCATGGTTCTGAAATGTTGGGTACACATTTAGCAGTTACTTTGTTCTGGCCATGATTTATGATGCAACAAGTAAATCTTTACTTTTAGCTTAAAACCAACGGATTTGACTAAATATGTGGACCTATGTGTAATAGCCCAAACTCACTACTAGTAGATACTGTCTTCTTTGGGATACCCTCTAGGTTTTAAAACGCGTATGTTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTCGTTTCCCTCTCCAATCAATGTGGGGTCTCACAATCCACTTCCCTTGGGGCCCAACATCTCGTTGGCACACCGTCCGGTGCATGACTCTAATACCATTTGTACCAGCTCAAGCCCACCACTAGTAGATATTATCTTCTTTGGGCTTTCCCTTTCGAGCTTCTCCTCAAGGTTTTAAAACGTATCTATTAGGGAGAGGTTTTTACACTCTTATAATGCTTCGTTCCCCTCTCTAACCAATGTGGAATCTCACAATTCACTCCTCTTGGGCGCCCAGTGTCCTCGTTCGCACACCGCTCGGTGCCTGACTCTGATACCATTTGTAATAGTCTAAGCTCACCGCTAGCAAATATTGTCTTCTTTGGACTTTTTCTTCCAGACTTCCTCTCAAAGTTTTAAAACGCGTATGTTCAGGAGAGATTTCACACCCTTGTAAAGAATGTTTTGTTCCCATCTCCAACCAATGTAGGATCTCATACAATGTGTTTGTCGAATTTTATCGTTGCATCGAGTATAAGTACAGCATTTCTTCTGGCGTTATGCTCACAAAATTTTACTTTATCTTTGTTCGGTGATCAATGAGTTGGATATATTGATCTGATATTGTCAGTTGTTCTTTCTGTCTGTCATATCTGAAAAAGTACAATGCCTTTTCTTAGTAGCAGAACATTTGCAGCAGGGACTGATTATGAGTTTTATCCTTGTTTTTGCCTCCAATCAGGACCATTATCTCAGTCCAAGAACACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAAACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGATTTAAACGGCAAGCCTTTGTCTCTAAACGACCAAGTCGCCGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATTGGTGACTCATTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCAATGGAAGAGAAAGGCATCCGCCATCTCTCGTGGCCTTCGACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCCAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCAAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAACTCTCAGAAAATTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGCTTTCCACTGTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCAGTACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATATGGAGGACGAGATGCAAGACGAAATGCCGTTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGGTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCTTAAGAGAATCTCGTCGTGCACGAAAACATTCTTTTTTTTCTTTTCTTTTTGATAGATTGATCTGTGCTGCTGTTTATTCAAATTGCCCAAATTGTTGTTCACTCATTTAGATGCTATTGATAGTTCATTATAGATATCTCTGGTGATTATATGGTCTGCTAAATCTAAACAGATAGATACCCAGTTTCCTTCTCTACAAACATTATGCT

mRNA sequence

TGTTGCTTCCCTCAACATGATGAATTGGCCAATGCATCACTTCTCTTTCTTTTTTACAACTTCCATTGCCTTCCTCTAAACTTCTATTCCATCTTCTTCCATTCATCATTGCATGCTTTGGAATCAAAACCAGAGGAAATACACACATACCCATCAATACCCATCATGTCTTTTTAACCCAAATACTTGAACCCCTCCTTTCCTTTTGAGTTATTGTTTCATAAATTTCGAATTTGAGCGTTTTTCTTTATGTGAATTCTATCGTATTCTTCGATTCTCTTCGTGGGTTCTTGTGGGTGTTTCTTCTTTTGCCTCATCTTTGCTTCTTTTTCGAGTTTCTAATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGACCATTATCTCAGTCCAAGAACACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAAACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGATTTAAACGGCAAGCCTTTGTCTCTAAACGACCAAGTCGCCGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATTGGTGACTCATTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCAATGGAAGAGAAAGGCATCCGCCATCTCTCGTGGCCTTCGACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCCAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCAAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAACTCTCAGAAAATTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGCTTTCCACTGTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCAGTACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATATGGAGGACGAGATGCAAGACGAAATGCCGTTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGGTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCTTAAGAGAATCTCGTCGTGCACGAAAACATTCTTTTTTTTCTTTTCTTTTTGATAGATTGATCTGTGCTGCTGTTTATTCAAATTGCCCAAATTGTTGTTCACTCATTTAGATGCTATTGATAGTTCATTATAGATATCTCTGGTGATTATATGGTCTGCTAAATCTAAACAGATAGATACCCAGTTTCCTTCTCTACAAACATTATGCT

Coding sequence (CDS)

ATGGGACCCATTAGAGGGTTCAAGAGGAAGAAGCAGAAGAAGGCACAGAAAAAGGTTGTCCAATATGTCTTCGCTGCTGCTTCACTGTCGTTTCAGCCACAGCCCTTGGATTGGTGGGATGAGTTCTCACAGAGGATTACTGGACCATTATCTCAGTCCAAGAACACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAAACATTCAGCTATATCTGTTCTCTTGTTAAGGAAGCTATGATGGCTAAAACCTCAAATTTTACCGATTTAAACGGCAAGCCTTTGTCTCTAAACGACCAAGTCGCCGTTGCTCTTAGGCGACTTTGCTCCGGTGAATCATTATCAAATATTGGTGACTCATTTGGAATGAATCAATCATCAGTTTCTCAAATAACATGGCGTTTTGTGGAGGCAATGGAAGAGAAAGGCATCCGCCATCTCTCGTGGCCTTCGACAGAAGAAGATATGGATCAGATAAAGTCCAAGTTCCAGAAAATCAGAGGTCTACCTAATTGTTGCGGTGTAATCGAAACGACGCACATTATGATGACTTTGCCAACGACAGAATCTGCAAATGGCGTCTGGCTTGATCGCGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGATCCAGAAATGAGATTCTGTGACATCATAACAGGTTGGCCAGGAAGCTTAAGCGATGCACTCGTGCTTGAAAGCTCAGGTTTTTTCAAACGTTCACAAGATGGCGAGCGGTTGAACGGAAAGAAGATGAAACTCTCAGAAAATTCAGAACTAGGAGAGTATATTATAGGAGACTCTGGCTTTCCACTGTTGCCATGGCTACTAACTCCTTACCAAGGGAAAGGCCTTGCAGATTACCAGACCGAGTTCAATAAGCGCCATTTCAGTACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGATAATGTGGAAGCCTGATAAACATAAGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCACAATATAATGATCGATATGGAGGACGAGATGCAAGACGAAATGCCGTTGTCTCATCATCACGATCCGAGTTACCGACAACAAAGTTGCAAATTCGTCGACAATACTGGTTCGATCACGAGGGAGAAACTTTCCATGTACTTATCTGAAAAGTTAACACCTTAA

Protein sequence

MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP
Homology
BLAST of Cp4.1LG01g10530 vs. ExPASy Swiss-Prot
Match: Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 2.0e-137
Identity = 247/409 (60.39%), Postives = 301/409 (73.59%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLS------------------FQPQPLDWWDEF 60
           MGPI+  K+K  K+A+KKV + V  AA+ +                     Q LDWWD F
Sbjct: 1   MGPIKTIKKK--KRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGF 60

Query: 61  SQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVA 120
           S+RI G  +  K   FESVFKISRKTF YICSLVK    AK +NF+D NG PLSLND+VA
Sbjct: 61  SRRIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVA 120

Query: 121 VALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSK 180
           VALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ I HLSWPS    +D+IKSK
Sbjct: 121 VALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSK 180

Query: 181 FQKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITG 240
           F+KI GLPNCCG I+ THI+M LP  E +N VWLD EKN SM LQ +VDP+MRF D+I G
Sbjct: 181 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAG 240

Query: 241 WPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQ 300
           WPGSL+D +VL++SGF+K  + G+RLNG+K+ LSE +EL EYI+GDSGFPLLPWLLTPYQ
Sbjct: 241 WPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQ 300

Query: 301 GKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHN 360
           GK  +  QTEFNKRH      AQ AL++LK+ W+II G+MW PD+++LPRII VCCLLHN
Sbjct: 301 GKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHN 360

Query: 361 IMIDMEDEMQDEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEK 392
           I+IDMED+  D+ PLS  HD +YRQ+SCK  D   S+ R++LS  L  K
Sbjct: 361 IIIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of Cp4.1LG01g10530 vs. ExPASy Swiss-Prot
Match: Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 354.0 bits (907), Expect = 2.2e-96
Identity = 174/396 (43.94%), Postives = 261/396 (65.91%), Query Frame = 0

Query: 1   MGPIRGFKRKKQ------KKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGP-LSQS 60
           M P++  K+ K+      KK  K   +    A  L  +    DWWD F  R + P +   
Sbjct: 1   MAPVKQKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSD 60

Query: 61  KNTKFESVFKISRKTFSYICSLVKEAMMAK-TSNFTDLNGKPLSLNDQVAVALRRLCSGE 120
           ++  F+  F+ S+ TFSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+
Sbjct: 61  EDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGD 120

Query: 121 SLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNC 180
           S  ++G +FG+ QS+VSQ+TWRF+EA+EE+   HL WP ++  +++IKSKF+++ GLPNC
Sbjct: 121 SQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNC 180

Query: 181 CGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALV 240
           CG I+TTHI+MTLP  ++++  W D+EKN SM LQ + D EMRF +++TGWPG ++ + +
Sbjct: 181 CGAIDTTHIIMTLPAVQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKL 240

Query: 241 LESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTE 300
           L+ SGFFK  ++ + L+G    LS+ +++ EY++G   +PLLPWL+TP+     +D    
Sbjct: 241 LKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVA 300

Query: 301 FNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQ 360
           FN+RH   R VA  A  +LK  W+I+  +MW+PD+ KLP IILVCCLLHNI+ID  D +Q
Sbjct: 301 FNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQ 360

Query: 361 DEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYL 389
           +++PLS HHD  Y  + CK  +  GS  R  L+ +L
Sbjct: 361 EDVPLSGHHDSGYADRYCKQTEPLGSELRGCLTEHL 394

BLAST of Cp4.1LG01g10530 vs. ExPASy Swiss-Prot
Match: Q6AZB8 (Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.7e-27
Identity = 78/291 (26.80%), Postives = 145/291 (49.83%), Query Frame = 0

Query: 60  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 119
           + F   R+   Y+  L+K++++ +T        + +S + Q+  AL    SG   S +GD
Sbjct: 37  NTFGFPREFIYYLVELLKDSLLRRTQ-----RSRAISPDVQILAALGFYTSGSFQSKMGD 96

Query: 120 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETT 179
           + G++Q+S+S+      +A+ EK    + +   E    Q K +F +I G+PN  GV++  
Sbjct: 97  AIGISQASMSRCVSNVTKALIEKAPEFIGFTRDEATKQQFKDEFYRIAGIPNVTGVVDCA 156

Query: 180 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 239
           HI +  P  + ++  +++++   S+  Q++ D         T WPGSL+D  V + S   
Sbjct: 157 HIAIKAPNADDSS--YVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVA 216

Query: 240 KRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQG-KGLADYQTEFNKRHF 299
           K  ++            EN + G +++GD+ +PL  WL+TP Q  +  ADY+  +N  H 
Sbjct: 217 KLFEE-----------QENDDEG-WLLGDNRYPLKKWLMTPVQSPESPADYR--YNLAHT 276

Query: 300 STRLVAQRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMI 346
           +T  +  R    ++  ++ + G    + + P+  K   II  CC+LHNI +
Sbjct: 277 TTHEIVDRTFRAIQTRFRCLDGAKGYLQYSPE--KCSHIIQACCVLHNISL 304

BLAST of Cp4.1LG01g10530 vs. ExPASy Swiss-Prot
Match: Q8BR93 (Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 1.0e-24
Identity = 83/300 (27.67%), Postives = 144/300 (48.00%), Query Frame = 0

Query: 62  FKISRKTFSYICSL----------VKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSG 121
           FK+   T  Y+ S+          + E + A  S  T    + +S   Q+  AL    SG
Sbjct: 25  FKLDDVTDEYLMSMYGFPRQFIYFLVELLGASLSRPTQ-RSRAISPETQILAALGFYTSG 84

Query: 122 ESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPN 181
              + +GD+ G++Q+S+S+      EA+ E+  + + +P  E  +  +K +F  + G+P 
Sbjct: 85  SFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPVDEAAVQSLKDEFYGLAGMPG 144

Query: 182 CCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDAL 241
             GV +  H+ +  P  E  +  +++R+   S+   V+ D       + T WPGSL D  
Sbjct: 145 VIGVADCIHVAIKAPNAEDLS--YVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCA 204

Query: 242 VLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQ-GKGLADYQ 301
           VL+ S    + + G         + ++S    +++GDS F L  WLLTP    +  A+Y+
Sbjct: 205 VLQRSSLTSQFETG---------MPKDS----WLLGDSSFFLRSWLLTPLPIPETAAEYR 264

Query: 302 TEFNKRHFSTRLVAQRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMID 347
             +N+ H +T  V +R L  L   ++ + G    + + P+  K   IIL CC+LHNI +D
Sbjct: 265 --YNRAHSATHSVIERTLQTLCCRFRCLDGSKGALQYSPE--KCSHIILACCVLHNISLD 304

BLAST of Cp4.1LG01g10530 vs. ExPASy Swiss-Prot
Match: B0BN95 (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.3e-24
Identity = 76/292 (26.03%), Postives = 144/292 (49.32%), Query Frame = 0

Query: 60  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 119
           S++   R+   Y+  L+  ++   T        + +S   Q+  AL    SG   + +GD
Sbjct: 37  SMYGFPRQFIYYLVELLGASLSRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGD 96

Query: 120 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETT 179
           + G++Q+S+S+      EA+ E+  + + +P+ E  +  +K +F  + G+P   G ++  
Sbjct: 97  AIGISQASMSRCVANVTEALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCI 156

Query: 180 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 239
           H+ +  P  E  +  +++R+   S+   V+ D       + T WPGSL D  VL+ S   
Sbjct: 157 HVAIKAPNAEDLS--YVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSLS 216

Query: 240 KRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTP-YQGKGLADYQTEFNKRHF 299
            + + G         + ++S    +++GDS F L  WLLTP +  +  A+Y+  +N+ H 
Sbjct: 217 SQFETG---------MPKDS----WLLGDSSFFLHTWLLTPLHIPETPAEYR--YNRAHS 276

Query: 300 STRLVAQRALTRLKEMWKIIKG----IMWKPDKHKLPRIILVCCLLHNIMID 347
           +T  V ++ L  L   ++ + G    + + P+K     IIL CC+LHNI ++
Sbjct: 277 ATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKSS--HIILACCVLHNISLE 304

BLAST of Cp4.1LG01g10530 vs. NCBI nr
Match: XP_023545365.1 (protein ALP1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 803 bits (2074), Expect = 1.53e-293
Identity = 394/394 (100.00%), Postives = 394/394 (100.00%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
           RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST
Sbjct: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394
           HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394

BLAST of Cp4.1LG01g10530 vs. NCBI nr
Match: XP_022995175.1 (protein ALP1-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 795 bits (2053), Expect = 2.43e-290
Identity = 390/394 (98.98%), Postives = 392/394 (99.49%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKV QYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVQQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
           RSQDGERLNGKKMKLSE+SELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST
Sbjct: 241 RSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394
           HDPSYRQQSCKFVDNT SITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of Cp4.1LG01g10530 vs. NCBI nr
Match: XP_022956519.1 (protein ALP1-like [Cucurbita moschata])

HSP 1 Score: 791 bits (2044), Expect = 5.73e-289
Identity = 387/394 (98.22%), Postives = 392/394 (99.49%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
           RSQDGERLNGKKMKLSE++ELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHF T
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMID+EDEMQDEMPLSHH
Sbjct: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394
           HDPSYRQQSCKFVDNT SITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of Cp4.1LG01g10530 vs. NCBI nr
Match: KAG6601011.1 (Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 790 bits (2039), Expect = 3.31e-288
Identity = 387/393 (98.47%), Postives = 390/393 (99.24%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQ KNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQFKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
           RSQDGERLNGKKMKLSE++ELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHF T
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLT 393
           HDPSYRQQSCKFVDNT SITREKLSMYLSEKLT
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLT 393

BLAST of Cp4.1LG01g10530 vs. NCBI nr
Match: KAG7031625.1 (Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 782 bits (2020), Expect = 2.51e-285
Identity = 385/393 (97.96%), Postives = 389/393 (98.98%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQ KNTKF+ 
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQFKNTKFD- 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
           RSQDGERLNGKKMKLSE++ELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHF T
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLT 393
           HDPSYRQQSCKFVDNT SITREKLSMYLSEKLT
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLT 392

BLAST of Cp4.1LG01g10530 vs. ExPASy TrEMBL
Match: A0A6J1K3E1 (protein ALP1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490769 PE=3 SV=1)

HSP 1 Score: 795 bits (2053), Expect = 1.18e-290
Identity = 390/394 (98.98%), Postives = 392/394 (99.49%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKV QYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVQQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
           RSQDGERLNGKKMKLSE+SELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST
Sbjct: 241 RSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394
           HDPSYRQQSCKFVDNT SITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of Cp4.1LG01g10530 vs. ExPASy TrEMBL
Match: A0A6J1GZA0 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111458235 PE=3 SV=1)

HSP 1 Score: 791 bits (2044), Expect = 2.77e-289
Identity = 387/394 (98.22%), Postives = 392/394 (99.49%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK
Sbjct: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
           RSQDGERLNGKKMKLSE++ELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHF T
Sbjct: 241 RSQDGERLNGKKMKLSESAELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFGT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVA+RALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMID+EDEMQDEMPLSHH
Sbjct: 301 RLVARRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDVEDEMQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394
           HDPSYRQQSCKFVDNT SITREKLSMYLSEKLTP
Sbjct: 361 HDPSYRQQSCKFVDNTASITREKLSMYLSEKLTP 394

BLAST of Cp4.1LG01g10530 vs. ExPASy TrEMBL
Match: A0A0A0KS64 (DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE=3 SV=1)

HSP 1 Score: 739 bits (1908), Expect = 1.38e-268
Identity = 360/394 (91.37%), Postives = 379/394 (96.19%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKK  K +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKK--KVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IKSKF+KIRGLPNCCGV+ETTH
Sbjct: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVVETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFK
Sbjct: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
            SQDGERLNGKKMKLSE+SELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF+T
Sbjct: 241 LSQDGERLNGKKMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394
           HDPSYRQQSC+FVDNT SI+REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCEFVDNTASISREKLSMYLSGKLPP 392

BLAST of Cp4.1LG01g10530 vs. ExPASy TrEMBL
Match: A0A1S3CEZ1 (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=3 SV=1)

HSP 1 Score: 737 bits (1902), Expect = 1.13e-267
Identity = 359/394 (91.12%), Postives = 378/394 (95.94%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKNTKFES 60
           MGPIRGFKRKK  K +KKV Q VFA+ASLS Q QPLDWWDEFSQRITGPLSQSKNTKFES
Sbjct: 1   MGPIRGFKRKK--KVEKKVDQNVFASASLSSQLQPLDWWDEFSQRITGPLSQSKNTKFES 60

Query: 61  VFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120
           VFKISRKTFSYICSLVKE MMAKTS+FTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS
Sbjct: 61  VFKISRKTFSYICSLVKEVMMAKTSSFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGDS 120

Query: 121 FGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETTH 180
           FG+NQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMD+IKSKF+KIRGLPNCCGVIETTH
Sbjct: 121 FGLNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDKIKSKFKKIRGLPNCCGVIETTH 180

Query: 181 IMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFFK 240
           IMMTLPT+ESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSD+LVL+SSGFFK
Sbjct: 181 IMMTLPTSESANGIWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDSLVLQSSGFFK 240

Query: 241 RSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFST 300
            SQDGERLNGKKM+LSE+SELGEYIIGDSGFPLLPWLLTPYQGKGL DYQ EFNKRHF+T
Sbjct: 241 LSQDGERLNGKKMRLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLPDYQAEFNKRHFAT 300

Query: 301 RLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSHH 360
           RLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSHH
Sbjct: 301 RLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSHH 360

Query: 361 HDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394
           HDPSYRQQSC+FVDNT SI REKLSMYLS KL P
Sbjct: 361 HDPSYRQQSCEFVDNTASIAREKLSMYLSGKLPP 392

BLAST of Cp4.1LG01g10530 vs. ExPASy TrEMBL
Match: A0A6J1CCK2 (protein ALP1-like OS=Momordica charantia OX=3673 GN=LOC111009982 PE=3 SV=1)

HSP 1 Score: 725 bits (1872), Expect = 4.38e-263
Identity = 357/395 (90.38%), Postives = 374/395 (94.68%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGPLSQSKN-TKFE 60
           MGPIRGFKRKK  KA+KKV Q V AAASLS QPQPLDWWD+FSQRITGPLSQSKN TKFE
Sbjct: 1   MGPIRGFKRKK--KAEKKVDQNVLAAASLSSQPQPLDWWDDFSQRITGPLSQSKNPTKFE 60

Query: 61  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVAVALRRLCSGESLSNIGD 120
           SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLS+NDQVAVALRRL SGESLS IGD
Sbjct: 61  SVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSVNDQVAVALRRLSSGESLSIIGD 120

Query: 121 SFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNCCGVIETT 180
           SFGMNQSSVSQITWRFVEAMEEKG+ HLSWPSTEEDMDQIKSKF+KI+GLPNCCGVIETT
Sbjct: 121 SFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSTEEDMDQIKSKFKKIKGLPNCCGVIETT 180

Query: 181 HIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLESSGFF 240
           HIMMTLPT ES NGVWLDREKNCSMILQVIVDPEMRFCDI+ GWPGSLSDALVL+SSGFF
Sbjct: 181 HIMMTLPTAESXNGVWLDREKNCSMILQVIVDPEMRFCDIMAGWPGSLSDALVLQSSGFF 240

Query: 241 KRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTEFNKRHFS 300
           K SQDGERLNGK MKLSE+SELGEYIIGDSGFPLLPWLLTPYQGKGL+DYQTEFNKRH++
Sbjct: 241 KLSQDGERLNGKNMKLSESSELGEYIIGDSGFPLLPWLLTPYQGKGLSDYQTEFNKRHYA 300

Query: 301 TRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQDEMPLSH 360
           TRLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDMEDE+QDEMPLSH
Sbjct: 301 TRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMEDEVQDEMPLSH 360

Query: 361 HHDPSYRQQSCKFVDNTGSITREKLSMYLSEKLTP 394
           HHD  YRQQSCKFVDNT S+ REKLSMYLS KL P
Sbjct: 361 HHDSGYRQQSCKFVDNTASVVREKLSMYLSGKLPP 393

BLAST of Cp4.1LG01g10530 vs. TAIR 10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 490.3 bits (1261), Expect = 1.4e-138
Identity = 247/409 (60.39%), Postives = 301/409 (73.59%), Query Frame = 0

Query: 1   MGPIRGFKRKKQKKAQKKVVQYVFAAASLS------------------FQPQPLDWWDEF 60
           MGPI+  K+K  K+A+KKV + V  AA+ +                     Q LDWWD F
Sbjct: 1   MGPIKTIKKK--KRAEKKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGF 60

Query: 61  SQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQVA 120
           S+RI G  +  K   FESVFKISRKTF YICSLVK    AK +NF+D NG PLSLND+VA
Sbjct: 61  SRRIYGGSTDPKT--FESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVA 120

Query: 121 VALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSK 180
           VALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ I HLSWPS    +D+IKSK
Sbjct: 121 VALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPS---KLDEIKSK 180

Query: 181 FQKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITG 240
           F+KI GLPNCCG I+ THI+M LP  E +N VWLD EKN SM LQ +VDP+MRF D+I G
Sbjct: 181 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDPDMRFLDVIAG 240

Query: 241 WPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQ 300
           WPGSL+D +VL++SGF+K  + G+RLNG+K+ LSE +EL EYI+GDSGFPLLPWLLTPYQ
Sbjct: 241 WPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQ 300

Query: 301 GKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHN 360
           GK  +  QTEFNKRH      AQ AL++LK+ W+II G+MW PD+++LPRII VCCLLHN
Sbjct: 301 GKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHN 360

Query: 361 IMIDMEDEMQDEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYLSEK 392
           I+IDMED+  D+ PLS  HD +YRQ+SCK  D   S+ R++LS  L  K
Sbjct: 361 IIIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 402

BLAST of Cp4.1LG01g10530 vs. TAIR 10
Match: AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 354.0 bits (907), Expect = 1.6e-97
Identity = 174/396 (43.94%), Postives = 261/396 (65.91%), Query Frame = 0

Query: 1   MGPIRGFKRKKQ------KKAQKKVVQYVFAAASLSFQPQPLDWWDEFSQRITGP-LSQS 60
           M P++  K+ K+      KK  K   +    A  L  +    DWWD F  R + P +   
Sbjct: 1   MAPVKQKKKNKKKPLDKAKKLAKNKEKKRVNAVPLDPEAIDCDWWDTFWLRNSSPSVPSD 60

Query: 61  KNTKFESVFKISRKTFSYICSLVKEAMMAK-TSNFTDLNGKPLSLNDQVAVALRRLCSGE 120
           ++  F+  F+ S+ TFSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL SG+
Sbjct: 61  EDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGD 120

Query: 121 SLSNIGDSFGMNQSSVSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIKSKFQKIRGLPNC 180
           S  ++G +FG+ QS+VSQ+TWRF+EA+EE+   HL WP ++  +++IKSKF+++ GLPNC
Sbjct: 121 SQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDR-IEEIKSKFEEMYGLPNC 180

Query: 181 CGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALV 240
           CG I+TTHI+MTLP  ++++  W D+EKN SM LQ + D EMRF +++TGWPG ++ + +
Sbjct: 181 CGAIDTTHIIMTLPAVQASDD-WCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKL 240

Query: 241 LESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTPYQGKGLADYQTE 300
           L+ SGFFK  ++ + L+G    LS+ +++ EY++G   +PLLPWL+TP+     +D    
Sbjct: 241 LKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMVA 300

Query: 301 FNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLPRIILVCCLLHNIMIDMEDEMQ 360
           FN+RH   R VA  A  +LK  W+I+  +MW+PD+ KLP IILVCCLLHNI+ID  D +Q
Sbjct: 301 FNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQ 360

Query: 361 DEMPLSHHHDPSYRQQSCKFVDNTGSITREKLSMYL 389
           +++PLS HHD  Y  + CK  +  GS  R  L+ +L
Sbjct: 361 EDVPLSGHHDSGYADRYCKQTEPLGSELRGCLTEHL 394

BLAST of Cp4.1LG01g10530 vs. TAIR 10
Match: AT5G12010.1 (unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, plasma membrane, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G29780.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 149.4 bits (376), Expect = 5.9e-36
Identity = 93/324 (28.70%), Postives = 164/324 (50.62%), Query Frame = 0

Query: 38  WWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSL 97
           WW+E S R+  P        F+  F++S+ TF  IC  +  A+  + +   +     + +
Sbjct: 161 WWEECS-RLDYP-----EEDFKKAFRMSKSTFELICDELNSAVAKEDTALRN----AIPV 220

Query: 98  NDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGI-RHLSWPSTEEDM 157
             +VAV + RL +GE L  +   FG+  S+  ++     +A+++  + ++L WP  +E +
Sbjct: 221 RQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPD-DESL 280

Query: 158 DQIKSKFQKIRGLPNCCGVIETTHIMMTLPTTESAN-----GVWLDREKNCSMILQVIVD 217
             I+ +F+ + G+PN  G + TTHI +  P    A+         +++ + S+ +Q +V+
Sbjct: 281 RNIRERFESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVN 340

Query: 218 PEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGF 277
           P+  F D+  GWPGS+ D  VLE S  ++R+ +G  L G             ++ G  G 
Sbjct: 341 PKGVFTDLCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGM------------WVAGGPGH 400

Query: 278 PLLPWLLTPYQGKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKLP 337
           PLL W+L PY  + L   Q  FN++    + VA+ A  RLK  W  ++    +     LP
Sbjct: 401 PLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLKGRWACLQK-RTEVKLQDLP 460

Query: 338 RIILVCCLLHNIMIDMEDEMQDEM 356
            ++  CC+LHNI    E++M+ E+
Sbjct: 461 TVLGACCVLHNICEMREEKMEPEL 460

BLAST of Cp4.1LG01g10530 vs. TAIR 10
Match: AT4G29780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519; Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 129.0 bits (323), Expect = 8.2e-30
Identity = 88/325 (27.08%), Postives = 153/325 (47.08%), Query Frame = 0

Query: 37  DWWDEFSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLS 96
           DWWD  S+            +F   F++S+ TF+ IC  +   +  K +   D    P  
Sbjct: 198 DWWDRVSR------PDFPEDEFRREFRMSKSTFNLICEELDTTVTKKNTMLRDAIPAP-- 257

Query: 97  LNDQVAVALRRLCSGESLSNIGDSFGMNQSSVSQITWRFVEAMEEKGI-RHLSWPSTEED 156
              +V V + RL +G  L ++ + FG+  S+  ++      A+ +  + ++L WPS + +
Sbjct: 258 --KRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLWPS-DSE 317

Query: 157 MDQIKSKFQKIRGLPNCCGVIETTHIMMTLPTTESA-----NGVWLDREKNCSMILQVIV 216
           ++  K+KF+ +  +PN  G I TTHI +  P    A          +++ + S+ +Q +V
Sbjct: 318 INSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSITVQGVV 377

Query: 217 DPEMRFCDIITGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSG 276
           + +  F D+  G PGSL+D  +LE S               + + +       +I+G+SG
Sbjct: 378 NADGIFTDVCIGNPGSLTDDQILEKSSL------------SRQRAARGMLRDSWIVGNSG 437

Query: 277 FPLLPWLLTPYQGKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHKL 336
           FPL  +LL PY  + L   Q  FN+     + +A  A  RLK  W  ++    +     L
Sbjct: 438 FPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQK-RTEVKLQDL 497

Query: 337 PRIILVCCLLHNIMIDMEDEMQDEM 356
           P ++  CC+LHNI    ++EM  E+
Sbjct: 498 PYVLGACCVLHNICEMRKEEMLPEL 498

BLAST of Cp4.1LG01g10530 vs. TAIR 10
Match: AT1G72270.1 (CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G27010.1); Has 772 Blast hits to 657 proteins in 120 species: Archae - 0; Bacteria - 0; Metazoa - 344; Fungi - 94; Plants - 322; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )

HSP 1 Score: 95.1 bits (235), Expect = 1.3e-19
Identity = 77/312 (24.68%), Postives = 134/312 (42.95%), Query Frame = 0

Query: 42  FSQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEAMMAKTSNFTDLNGKPLSLNDQV 101
           F++ +T       + ++   F++S+ TF  + S++  + +                    
Sbjct: 81  FNRFLTSATEDEDDPRWCLYFRMSKSTFFSLYSILSHSSL-----------------PSF 140

Query: 102 AVALRRLCSGESLSNIGDSFGMNQSS-VSQITWRFVEAMEEKGIRHLSWPSTEEDMDQIK 161
           A  + RL  G S   +   FG + +S  S+  +   + + EK           + +D  K
Sbjct: 141 AATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEK---------LSQQLDDPK 200

Query: 162 SKFQKIRGLPNCCGVIETTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDII 221
             F     LPNC GV+                G  L  +   S+++Q +VD   RF DI 
Sbjct: 201 PDFSP-NLLPNCYGVVGFGRF--------EVKGKLLGAKG--SILVQALVDSNGRFVDIS 260

Query: 222 TGWPGSLSDALVLESSGFFKRSQDGERLNGKKMKLSENSELGEYIIGDSGFPLLPWLLTP 281
            GWP ++    +   +  F  ++  E L+G   KL     +  YI+GDS  PLLPWL+TP
Sbjct: 261 AGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLVTP 320

Query: 282 YQ-GKGLADYQTEFNKRHFSTRLVAQRALTRLKEMWKIIKGIMWKPDKHK-LPRIILVCC 341
           Y        ++ EFN    +     + A  +++  W+I+    WKP+  + +P +I   C
Sbjct: 321 YDLTSDEESFREEFNNVVHTGLHSVEIAFAKVRARWRILDK-KWKPETIEFMPFVITTGC 352

Query: 342 LLHNIMIDMEDE 351
           LLHN +++  D+
Sbjct: 381 LLHNFLVNSGDD 352

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M2U32.0e-13760.39Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
Q94K492.2e-9643.94Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
Q6AZB81.7e-2726.80Putative nuclease HARBI1 OS=Danio rerio OX=7955 GN=harbi1 PE=2 SV=1[more]
Q8BR931.0e-2427.67Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1[more]
B0BN951.3e-2426.03Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_023545365.11.53e-293100.00protein ALP1-like [Cucurbita pepo subsp. pepo][more]
XP_022995175.12.43e-29098.98protein ALP1-like isoform X1 [Cucurbita maxima][more]
XP_022956519.15.73e-28998.22protein ALP1-like [Cucurbita moschata][more]
KAG6601011.13.31e-28898.47Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7031625.12.51e-28597.96Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1K3E11.18e-29098.98protein ALP1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490769 PE=3 SV... [more]
A0A6J1GZA02.77e-28998.22protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111458235 PE=3 SV=1[more]
A0A0A0KS641.38e-26891.37DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G180900 PE... [more]
A0A1S3CEZ11.13e-26791.12putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103500196 PE=3 SV=1[more]
A0A6J1CCK24.38e-26390.38protein ALP1-like OS=Momordica charantia OX=3673 GN=LOC111009982 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G55350.11.4e-13860.39PIF / Ping-Pong family of plant transposases [more]
AT3G63270.11.6e-9743.94CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G12010.15.9e-3628.70unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, ... [more]
AT4G29780.18.2e-3027.08unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G72270.11.3e-1924.68CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR0217... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 177..342
e-value: 8.8E-30
score: 103.5
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 1..390
NoneNo IPR availablePANTHERPTHR22930:SF205PROTEIN ALP1-LIKEcoord: 1..390

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g10530.1Cp4.1LG01g10530.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding