CmoCh07G004230 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh07G004230
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionApurinic-apyrimidinic endonuclease 2
LocationCmo_Chr07: 1915641 .. 1921729 (-)
RNA-Seq ExpressionCmoCh07G004230
SyntenyCmoCh07G004230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAAAATGGACCCCAAGAGGGCAAAATCCTCGCCGGGAAAACGTTGCAACTACGACGCCGACCGACGTTTCCGGTGAGCAACGTCGTTTTTCCATCTCCCGGTGGCCGTTTTTTGGCCGGCGCTGTTGTAACAGAGTGTTCACCTGGTTCTTCAGGACAAATTCTCCATTTCTGTTCAGGCTGAGGGAGACGGTATGCATTGAAATGAACAAACAACGAATTACAAGATGAAGATAGTTACTTACAACGTCAATGGTCTTAGGCCACGCATCGCACAGTATGGTTCACTCCTTAAGTTGCTCGATTCCTTCGATGCCGACATTATCTGCTTTCAGGTCTGCTGTGCTTTCATGTTTGATTAATTAGTTTTAGTCTGGTTCGGTATTCGTGATTTCCGTCTTCCATTTAGGAAACGAAATTAAGGAGGCAGGAATTGCGAGCGGATTTAATCATTGCTGATGGTTATGAATCATTCGTTTCATGCACTCGTACCTCTGAGAAAGGTCGAACCGGCTACTCAGGTTAGCGAGTTCTTTGTTGAATCGCCTAAACTAGGTCTACTATAAGTTTGTTGTAACTCGATGTCATTTTGTTTATCGAATTCTAATCTTGAGAGTTGGTTCACTGTTCAAATTTGCTGTCGTTTGACTATGCCTTTGTATCTCTTAGCAACGACATGGAATTTAAATGTCAATTATCTTCAGTCGTCGGCCATCCTAGTGTCAATGAAAGGTAGCTATGTCAGCATTAGTTTATTTAGCCGAAGTTTTGTAGGGGTTGCTACGTTTTGCCGAGTTAAGTCAGCATTTTCAAGTAATGAAGTAGCCTTGCCAGTTCGGGCGGAGGAAGGCTTCACTGGACTTTTAGAAAGTTCGCACAAAGGAGAAGGCACAATGCCTGCAGTTGCAGAAGGGCTTGAGGAATTTTCGAAAGAGGAGCTCCTCAAAGTTGACAGAGAGGGGCGTTGTATTGTCACAGATCATGGTCATTTTGGTAAGTTAAAAGACTATAGTTCATAAGTTTGTCATCCAAGTGTGACATTATGGTTTTGTTGTTGGTCTAGTAGTCTATACTTGCTGTGTCAAGTGAACCTTATTCTTGTAGAAAATGCATGCCAGCCATTTTCTTATGCCACACTTCTGAAATGCAAACATTCGGAGTTGTTCATTTCTCTCGCTCTACATTTCTTTTTTAAATTCATAACTTGATGGAATTTGTTTGGTTTCTATTGATTAGTTCTCTTCAATATTTATGGACCTCGAGCTCAAAGTGATGATACAGAAAGGGTTCTATTTAAGTCAAACTTTTACAATATACTACAGGTGATTATATTTTTGAACCCTACCGTGAATATATGATCATTATATAATTTTCAGGCAGTATGGATTTACAATAAGTTATCATTTGACTAATTTTGGATAATTCCTTTGTAGAAAAGATGGGAGCACCTTCTGCGTCAAGGAAAAAGGATATTTGTTGTTGGTGATCTCAATATTGCACCCACCTCTATGGATCATTGTGATGCAGGACCAGATTTTGAGAATAATGAGTAAGCATGTTATCTGGCCTTCTAAATGTTAGTGGCCTTATGCTCTTGCCTAATTCTATTGCTGTCTTGCTCTCAACGACTATAATTTCGTTTGAATTTTATTGGGGGACATATATGATTTTTCTTATAGGTTTCGGAGATGGCTGAGATCTTTACTGGTGGGATGTGGTGGTCATTTCACTGACATTTTCAGAGCGAAACATCCCGATCGGTAGGGTTTTGTGGATGAGAAGTTTCTGATTTGCTCTACTCTGTTGGAATATGTCTATTGCCATTGATAAAACAAAAATCATCGGTACTTTCTGGCTGTCTATTTGTGGTATTAGCTAATAAGATGGAATTGTTGTTTGATGATATTAACCCACCCACCAGTACATGCATTTTGAATTAGTGACATAAGATACAATTATGCCATTATCATCGTTGCTCAAGTACCACTGGCAGTCATTTTAATTTTGGATATAACACTTCTACCTCCTTTTTATCACTATGGTCTGTCTTAGCCCTGTTTCTCAGCTTATAGACAGACTAGAATAATATATTTCTCTGTGTGTAGATGATTTGACCAATTCACTTGCAATCAAATTTAGTAGCTCTATTGTTTTGTTTTTGCAGAAACGATGCATATACATGCTGGCCTCAAAGTACAGGTGCCGAAGTATTCAATTATGGGACAAGGATTGACCATATTTTGTGTGCTGGACCATGCTTACATCCGGACAGCAACTTGCCAAGCCATGATATTGTGAATTGCCATGTTCTGGAATGTGACATACTGTCACAGTATAAACGCTGGAAAGATGGAAATTCATTTAGGTACATAAATCAAATTGGATGTCAAGGTCTATTTTCCATAAATTATTAAATTGATTATGAAGCTATGGTATTATACGTACTCACATATTTCAGCTATTGGATTCTTCATTTTGATGATGCTTGACCCTGAATTTCTGCTGAGCATCAATTATTTGGTTGTATTTATAGTGGATTATTATGTGTGTTCCTAATCCTGCTATAACATCAAATACTAGAAGGATTTTTATAATATTGTGAAAAGTTGTGCTGTTGTCATCAACATTACCATTGCGAATATCAACTTAAACTAATTCAAGATATAATGGAGTTCAACGCACCAAATTCAACCAAGAAGCTTGGATGTGTTAGGAGGTGGACGATACAGAATAAGAACTAGGGGTCGAATAAAGTCTAATGGAATTCCATTCCACATTGAGTTAGGTTTATATTTCTTCAAGTACAGTTGCTACAAGAATTCAGATTAGAACAAAGTTTTGTTAGCTTTTTATGACTTGGCATATGACTCAAGTCCAAGCTCCGAACTAGATTCCCAAGGAATTGTAGACAATTCTACATTTCTCAATTTTTCTCGGTGTTTTTTTCTTTTCTTTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTCTGAATGATACCCAAATCTGATGTTAGTCCCTTATGATTGCCTTTTGATCATTTTTCCATTTCAGTTGAACTGTGGAAAGTTGAGTATCGCCTTTTGTCACGGTGAAAAACAATGACAATGTCATAAAACCAAGTGCTCACTGTTGCTTGACGCAAACAGCTCTTAGTGCTTTGTAGATGGATAACATAAATTATTTAGAGTTTCTTCAAAATGAATATTAAAGAATATCCAATTACTGGTTATGAAGTAATACACACTGTCTATCATATTTTCTCATCCCTGTGGAGGGTCTCTAATATCACTTTGCTATATAGGTGGAAAGGAGAGCAGACCGTTAAACTAGAAGGTTCTGATCATGCACCCGTTTATGCAAGTCTACTGGAAATCCCTGATACCCCCCAACATAGTGCTCCATCTTTATCTGCAAGATACAATCCCAAGATTCACGGGCTTCAGCAAACGCTTGGTAAGGAGATATTTCCTCATTTTTGCTTGGTCGCCCCATTCAGAATTAATAATATTCTTATGCTTATGATGGGTGTAGGAATTGTGTTTACTCTCGACTGCACTTTTTGCTCTGATGCATCATATTTACATCTTGAATGTGTTCTGATCATATTTTTATCATCTGATAAATGTGGCTCATATATGTANTTTTGTTGATTTTTTTTTTTTTTTTTTGTCCGAAAGGCAATGAGGAAAATTTCAAAATGGACAAGTGGTTTACAGTCCTACGATATTAAAAAAAATGTCCATCCCTGTAGAAGTGTGAACGTATAAGAATGCATGTTATTGAATGTATCTATAAGACCTCTTCTATTTCTTTTAAATTGTTGAGGAAATGCCATTCTCAACTCCAAAATTCAACGTAGGGCTCGGATTTGATAGTGTTAATTCTTACTTTGAGGCTGTATCAATTGTTTTTGGAAGGATTAGTTTCATTTCATGGCACTGCCAACGTCTCAGCATAACTGAATCACATGTTTTAATTGGAAGTCAGATTTGTCAGCTGTTTTGTTGCTTCTTTTTATATTCTCCTCTTGAGTCTAATGTGACCATGAACACATGCAGTATCTATGCTACTGAAAAAACAAGCTGCTGAAGGTTCAGCAACATGCAAGATATCAAATTCATTTTCGAGTGGGAACGACGTCTTAGGGAATTGTTCCCAGGGGCTCAATGGATCATTTGATAATGGTGATTTATCAGGCCTTCTTCCTGGTGAATCATGTTCCTCGACAAACATAGAAACTGAAGATTCTCTATTAAAAACAGAAGAGAGTTCCGGTGGAGGCTATTCTGAGGAAGCTCCATGCAACACCTTAATTACGCACGAGTCTCTACATACAAAAACATTACCTGAGAATGAAACTAGGAAAAGAGTAAGAAGGAGTTCCCAGATGTCATTAAAGTCGTTCTTCCAGACGAACTCGGTTATTAGCAACGTTGCTGACAGCTCTAATGCTAATTCTACAATTAACAAAGCAGATACCTCCGAATCTAATCCTATTGAAATTCCTAGATCAGATACTCATATTACCGATTCAGGGAAATATTTAGAAGAAAACCCGGATCAGTCTCATATCAATGCATCTTCTGTAGAGAGAGAGAAGAGTGGCGTTGCCTTATTGGAGTGGCGGAGGATACAGCAGGTTATGCAGAATAGCATACCTCTTTGCAAGCGCCATAAGGAACCTTGTGTTGCTCGAGTAGTTAAGAAACAAGGTCCTAATAATGGCCGAAGATTTTATGTTTGTGCTCGTGCTGAGGTAACTGAACACTCTTATTTTCTCAGTTACGCATTTTATTAAAGTATGGATAGAACTTAAGATTTGATGGATTTTTACAAATTACCAAGTCCACAGGTGATGAACAGTACTCAAGTATTGATGAATGAATAAATTTATCATAAATATGCGTTCAATAGCAATTTCTGCCACTAACTTATGTAATTTGCATTGATACTTGTCTTGTAGCCAAGGTTTTTATTGATTTACACCTTAGAGAACTGAAATTTTCTTTTACTTCACCAGGGTCCCGCATCGAATCCTGAAGCAAATTGTGGTTACTTTAAATGGGCGGTTTCCAAATCTCGGCATAACTGATGATGGTAGTTGCTGATTTCGTAATACCAACTTGGCCATATTCATTCCTTGTTATGTATCTCTTCATGGATGGCTGAGTCACTGACCGTCAGTTTTGTAAGTGTTCTCCTCTTCTTACCTGTTTATATGAAATTCAATTATGTAGCTAGCTGGAATCAGCTCCATTCAGTAATGCCTTGTGCACATAAAATTGTCATTTCTTTTCCTTAATGATAATATGAATTAGAAACCACTTTCTTTACCAAAACACACCCACAAAACCAACATTTGCTTCAGGAAATTATGATCCAACTGTCCTGGCTTTGCAATGGCTTGTTAAAAAGCAGAAATTATGATCCAACTTATCTGCAATTGGCATAGTGACGATTGTTTGCAATATGGATCCTCCTATGCTCTTTGAACAAAGTGGCTGGGTGGTTTGCTGGGAAAGCTAAAGAGGTATGCTGGTTCAGTGTTCACCATTACCTAAAGCTCTGACATTTTTTTTTTATTATTATTTGCTTGTGAACTTGCTACACTAAATTAGGAGCCTTTGCATGCATATTCTGTTGAGAACCTCAATGAAGAGATGAGCCATATAACATTTCTCCTCCCCTTCTTCACACACTGATTAAGAACCATGAATTCAGTAGAAGATCTCCATTTCGATCAGACGTCAATTGAACCTTTCTCCTTCCTCTAAACCTCCTACATGTAGCAGCTAACTCATTACTATCATATAGTGATAGTCACTTGTTGGATGATGAAAGTCCCACATCGGCTAATTTAGGGAATGATCAT

mRNA sequence

TGAAAATGGACCCCAAGAGGGCAAAATCCTCGCCGGGAAAACGTTGCAACTACGACGCCGACCGACGTTTCCGGACAAATTCTCCATTTCTGTTCAGGCTGAGGGAGACGTATGGTTCACTCCTTAAGTTGCTCGATTCCTTCGATGCCGACATTATCTGCTTTCAGGAAACGAAATTAAGGAGGCAGGAATTGCGAGCGGATTTAATCATTGCTGATGGTTATGAATCATTCGTTTCATGCACTCGTACCTCTGAGAAAGGTCGAACCGGCTACTCAGGGGTTGCTACGTTTTGCCGAGTTAAGTCAGCATTTTCAAGTAATGAAGTAGCCTTGCCAGTTCGGGCGGAGGAAGGCTTCACTGGACTTTTAGAAAGTTCGCACAAAGGAGAAGGCACAATGCCTGCAGTTGCAGAAGGGCTTGAGGAATTTTCGAAAGAGGAGCTCCTCAAAGTTGACAGAGAGGGGCGTTGTATTGTCACAGATCATGGTCATTTTGTTCTCTTCAATATTTATGGACCTCGAGCTCAAAGTGATGATACAGAAAGGGTTCTATTTAAGTCAAACTTTTACAATATACTACAGAAAAGATGGGAGCACCTTCTGCGTCAAGGAAAAAGGATATTTGTTGTTGGTGATCTCAATATTGCACCCACCTCTATGGATCATTGTGATGCAGGACCAGATTTTGAGAATAATGAGTTTCGGAGATGGCTGAGATCTTTACTGGTGGGATGTGGTGGTCATTTCACTGACATTTTCAGAGCGAAACATCCCGATCGAAACGATGCATATACATGCTGGCCTCAAAGTACAGGTGCCGAAGTATTCAATTATGGGACAAGGATTGACCATATTTTGTGTGCTGGACCATGCTTACATCCGGACAGCAACTTGCCAAGCCATGATATTGTGAATTGCCATGTTCTGGAATGTGACATACTGTCACAGTATAAACGCTGGAAAGATGGAAATTCATTTAGGTGGAAAGGAGAGCAGACCGTTAAACTAGAAGGTTCTGATCATGCACCCGTTTATGCAAGTCTACTGGAAATCCCTGATACCCCCCAACATAGTGCTCCATCTTTATCTGCAAGATACAATCCCAAGATTCACGGGCTTCAGCAAACGCTTGTATCTATGCTACTGAAAAAACAAGCTGCTGAAGGTTCAGCAACATGCAAGATATCAAATTCATTTTCGAGTGGGAACGACGTCTTAGGGAATTGTTCCCAGGGGCTCAATGGATCATTTGATAATGGTGATTTATCAGGCCTTCTTCCTGGTGAATCATGTTCCTCGACAAACATAGAAACTGAAGATTCTCTATTAAAAACAGAAGAGAGTTCCGGTGGAGGCTATTCTGAGGAAGCTCCATGCAACACCTTAATTACGCACGAGTCTCTACATACAAAAACATTACCTGAGAATGAAACTAGGAAAAGAGTAAGAAGGAGTTCCCAGATGTCATTAAAGTCGTTCTTCCAGACGAACTCGGTTATTAGCAACGTTGCTGACAGCTCTAATGCTAATTCTACAATTAACAAAGCAGATACCTCCGAATCTAATCCTATTGAAATTCCTAGATCAGATACTCATATTACCGATTCAGGGAAATATTTAGAAGAAAACCCGGATCAGTCTCATATCAATGCATCTTCTGTAGAGAGAGAGAAGAGTGGCGTTGCCTTATTGGAGTGGCGGAGGATACAGCAGGTTATGCAGAATAGCATACCTCTTTGCAAGCGCCATAAGGAACCTTGTGTTGCTCGAGTAGTTAAGAAACAAGGTCCTAATAATGGCCGAAGATTTTATGTTTGTGCTCGTGCTGAGGGTCCCGCATCGAATCCTGAAGCAAATTGTGGTTACTTTAAATGGGCGGTTTCCAAATCTCGGCATAACTGATGATGGTAGTTGCTGATTTCGTAATACCAACTTGGCCATATTCATTCCTTGTTATGTATCTCTTCATGGATGGCTGAGTCACTGACCGTCAGTTTTGAAATTATGATCCAACTGTCCTGGCTTTGCAATGGCTTGTTAAAAAGCAGAAATTATGATCCAACTTATCTGCAATTGGCATAGTGACGATTGTTTGCAATATGGATCCTCCTATGCTCTTTGAACAAAGTGGCTGGGTGGTTTGCTGGGAAAGCTAAAGAGGTATGCTGGTTCAGTGTTCACCATTACCTAAAGCTCTGACATTTTTTTTTTATTATTATTTGCTTGTGAACTTGCTACACTAAATTAGGAGCCTTTGCATGCATATTCTGTTGAGAACCTCAATGAAGAGATGAGCCATATAACATTTCTCCTCCCCTTCTTCACACACTGATTAAGAACCATGAATTCAGTAGAAGATCTCCATTTCGATCAGACGTCAATTGAACCTTTCTCCTTCCTCTAAACCTCCTACATGTAGCAGCTAACTCATTACTATCATATAGTGATAGTCACTTGTTGGATGATGAAAGTCCCACATCGGCTAATTTAGGGAATGATCAT

Coding sequence (CDS)

ATGGACCCCAAGAGGGCAAAATCCTCGCCGGGAAAACGTTGCAACTACGACGCCGACCGACGTTTCCGGACAAATTCTCCATTTCTGTTCAGGCTGAGGGAGACGTATGGTTCACTCCTTAAGTTGCTCGATTCCTTCGATGCCGACATTATCTGCTTTCAGGAAACGAAATTAAGGAGGCAGGAATTGCGAGCGGATTTAATCATTGCTGATGGTTATGAATCATTCGTTTCATGCACTCGTACCTCTGAGAAAGGTCGAACCGGCTACTCAGGGGTTGCTACGTTTTGCCGAGTTAAGTCAGCATTTTCAAGTAATGAAGTAGCCTTGCCAGTTCGGGCGGAGGAAGGCTTCACTGGACTTTTAGAAAGTTCGCACAAAGGAGAAGGCACAATGCCTGCAGTTGCAGAAGGGCTTGAGGAATTTTCGAAAGAGGAGCTCCTCAAAGTTGACAGAGAGGGGCGTTGTATTGTCACAGATCATGGTCATTTTGTTCTCTTCAATATTTATGGACCTCGAGCTCAAAGTGATGATACAGAAAGGGTTCTATTTAAGTCAAACTTTTACAATATACTACAGAAAAGATGGGAGCACCTTCTGCGTCAAGGAAAAAGGATATTTGTTGTTGGTGATCTCAATATTGCACCCACCTCTATGGATCATTGTGATGCAGGACCAGATTTTGAGAATAATGAGTTTCGGAGATGGCTGAGATCTTTACTGGTGGGATGTGGTGGTCATTTCACTGACATTTTCAGAGCGAAACATCCCGATCGAAACGATGCATATACATGCTGGCCTCAAAGTACAGGTGCCGAAGTATTCAATTATGGGACAAGGATTGACCATATTTTGTGTGCTGGACCATGCTTACATCCGGACAGCAACTTGCCAAGCCATGATATTGTGAATTGCCATGTTCTGGAATGTGACATACTGTCACAGTATAAACGCTGGAAAGATGGAAATTCATTTAGGTGGAAAGGAGAGCAGACCGTTAAACTAGAAGGTTCTGATCATGCACCCGTTTATGCAAGTCTACTGGAAATCCCTGATACCCCCCAACATAGTGCTCCATCTTTATCTGCAAGATACAATCCCAAGATTCACGGGCTTCAGCAAACGCTTGTATCTATGCTACTGAAAAAACAAGCTGCTGAAGGTTCAGCAACATGCAAGATATCAAATTCATTTTCGAGTGGGAACGACGTCTTAGGGAATTGTTCCCAGGGGCTCAATGGATCATTTGATAATGGTGATTTATCAGGCCTTCTTCCTGGTGAATCATGTTCCTCGACAAACATAGAAACTGAAGATTCTCTATTAAAAACAGAAGAGAGTTCCGGTGGAGGCTATTCTGAGGAAGCTCCATGCAACACCTTAATTACGCACGAGTCTCTACATACAAAAACATTACCTGAGAATGAAACTAGGAAAAGAGTAAGAAGGAGTTCCCAGATGTCATTAAAGTCGTTCTTCCAGACGAACTCGGTTATTAGCAACGTTGCTGACAGCTCTAATGCTAATTCTACAATTAACAAAGCAGATACCTCCGAATCTAATCCTATTGAAATTCCTAGATCAGATACTCATATTACCGATTCAGGGAAATATTTAGAAGAAAACCCGGATCAGTCTCATATCAATGCATCTTCTGTAGAGAGAGAGAAGAGTGGCGTTGCCTTATTGGAGTGGCGGAGGATACAGCAGGTTATGCAGAATAGCATACCTCTTTGCAAGCGCCATAAGGAACCTTGTGTTGCTCGAGTAGTTAAGAAACAAGGTCCTAATAATGGCCGAAGATTTTATGTTTGTGCTCGTGCTGAGGGTCCCGCATCGAATCCTGAAGCAAATTGTGGTTACTTTAAATGGGCGGTTTCCAAATCTCGGCATAACTGA

Protein sequence

MDPKRAKSSPGKRCNYDADRRFRTNSPFLFRLRETYGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGRCIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIAPTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVFNYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKLEGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKISNSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEEAPCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKADTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNSIPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN
Homology
BLAST of CmoCh07G004230 vs. ExPASy Swiss-Prot
Match: F4JNY0 (DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Arabidopsis thaliana OX=3702 GN=APE2 PE=1 SV=1)

HSP 1 Score: 613.2 bits (1580), Expect = 3.2e-174
Identity = 325/580 (56.03%), Postives = 400/580 (68.97%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           + SLLKLLDSFDADIICFQETKLRRQEL ADL IADGYESF SCTRTSEKGRTGYSGVAT
Sbjct: 18  FDSLLKLLDSFDADIICFQETKLRRQELTADLAIADGYESFFSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLES-SHKGEGTMPAVAEGLEEFSKEELLKVDREG 155
           FCRVKSA SS E ALPV AEEG TGL+ S S  G+     VAEGLEE+ KEELL +D+EG
Sbjct: 78  FCRVKSASSSCETALPVTAEEGITGLVNSNSRGGKSETSTVAEGLEEYEKEELLMIDQEG 137

Query: 156 RCIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNI 215
           RC++TDHGHFV+FN+YGPRA +DD +R+ FK  FY +L++RWE LLRQG+R+FVVGDLNI
Sbjct: 138 RCVITDHGHFVVFNVYGPRAVADDADRIEFKHRFYGVLERRWECLLRQGRRVFVVGDLNI 197

Query: 216 APTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEV 275
           AP +MD C+AGPDFE NEFR+W RSLLV  GG F+D+FR+KHP+R DA+TCW  S+GAE 
Sbjct: 198 APFAMDRCEAGPDFEKNEFRKWFRSLLVERGGSFSDVFRSKHPERKDAFTCWSSSSGAEQ 257

Query: 276 FNYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGN-SFRWKGEQTV 335
           FNYG+RIDHIL AG CLH D +   H  + CHV ECDIL++YKR+K+ N   RWKG    
Sbjct: 258 FNYGSRIDHILVAGSCLHQDEDKQGHSFLACHVKECDILTEYKRFKNENMPTRWKGGLVT 317

Query: 336 KLEGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCK 395
           K +GSDH PV+ S  ++PD P+HS P L++RY P I+G QQTLVS+  K++A E +   +
Sbjct: 318 KFKGSDHVPVFISFDDLPDIPEHSTPPLASRYLPMIYGFQQTLVSVFKKRRANEEAKAIE 377

Query: 396 ISNSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYS 455
           +S S S+ ++    C     G   N    G+   +SCS  N  T      TE  +     
Sbjct: 378 VSCSSSTQSNTSSICGDISTGPLRNCGSMGISLEKSCSFENKSTSG---VTEAETVAATG 437

Query: 456 EEAPCNTLITHESLHTKTLPENETRKRVRR--SSQMSLKSFFQTNSVISNVADSSNANST 515
                +  I   S+    +  +  RK+ R+  SSQ+SLKSFF TNS ++NV DSS+    
Sbjct: 438 SIDNLSDGIRASSVRALNISRDGDRKKARKIQSSQLSLKSFFTTNSKVNNVEDSSS---- 497

Query: 516 INKADTSESNPIEIPRSDTHITDSGKYLEE--NPDQSHINASSVEREKSGVALLEWRRIQ 575
            +   +S S+ +E   S T    SGK   E     Q      S  ++K+  AL+EW+RIQ
Sbjct: 498 -SYVSSSPSSQVE---SITEPNVSGKEDSEPTTSTQEQDQTGSSAKQKNDAALMEWQRIQ 557

Query: 576 QVMQNSIPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAE 610
            +MQNSIPLCK HKE CVARVVKK GP  GRRFYVC+RAE
Sbjct: 558 NLMQNSIPLCKGHKEACVARVVKKPGPTFGRRFYVCSRAE 586

BLAST of CmoCh07G004230 vs. ExPASy Swiss-Prot
Match: Q68G58 (DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Mus musculus OX=10090 GN=Apex2 PE=1 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 1.1e-57
Identity = 174/608 (28.62%), Postives = 262/608 (43.09%), Query Frame = 0

Query: 38  SLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVATFC 97
           +L ++LD  DADI+C QETK+ R  L   L I +GY S+ S +R+    R+GYSGVATFC
Sbjct: 30  ALRRVLDELDADIVCLQETKVTRDVLTEPLAIVEGYNSYFSFSRS----RSGYSGVATFC 89

Query: 98  RVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGRCI 157
           +        + A PV AEEG +G+  + +   G        ++EF++EEL  +D EGR +
Sbjct: 90  K--------DSATPVAAEEGLSGVFATLNGDIGCY----GNMDEFTQEELRVLDSEGRAL 149

Query: 158 VTDH---------GHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFV 217
           +T H             L N+Y P A     ER+ FK  FY +LQ R E LL  G  + +
Sbjct: 150 LTQHKIRTLEGKEKTLTLINVYCPHADPGKPERLTFKMRFYRLLQMRAEALLAAGSHVII 209

Query: 218 VGDLNIAPTSMDHCDAG--PDFENNEFRRWLRSLLVGCG-------GHFTDIFRAKHPDR 277
           +GDLN A   +DHCDA     FE +  R+W+  LL   G       G F D +R  HP +
Sbjct: 210 LGDLNTAHRPIDHCDASSLECFEEDPGRKWMDGLLSNPGDEAGPHIGLFMDSYRYLHPKQ 269

Query: 278 NDAYTCWPQSTGAEVFNYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRW 337
             A+TCW   +GA   NYG+R+D++L        D  L         +L           
Sbjct: 270 QRAFTCWSVVSGARHLNYGSRLDYVL-------GDRALVIDTFQASFLLP---------- 329

Query: 338 KDGNSFRWKGEQTVKLEGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSM 397
                         ++ GSDH PV  ++L +   P    P+L  R+ P+  G Q  ++  
Sbjct: 330 --------------EVMGSDHCPV-GAVLNVSCVPAKQCPALCTRFLPEFAGTQLKILRF 389

Query: 398 L--LKKQAAEGSATCKISNSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIET 457
           L  L+++        + S+   +                         P ++C  +    
Sbjct: 390 LVPLEQEPVREQQVLQPSHQIQAQRQ----------------------PRKACMHST--- 449

Query: 458 EDSLLKTEESSGGGYSEEAPCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNS 517
                +  +S GG                                +  Q +L S+FQ +S
Sbjct: 450 -----RLRKSQGG-------------------------------PKRKQKNLMSYFQPSS 509

Query: 518 VISNVADSSNANSTINKADTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREK 577
            +S         S +         P+  P++   +  +   LEE       N     +++
Sbjct: 510 SLSQ-------TSGVELPTLPLVGPLTTPKTAEEVA-TATVLEEK------NKVPESKDE 513

Query: 578 SGVALLEWRRIQQVMQNSIPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEA 626
            G     W+ +     + +PLC  H+EPCV R VKK GPN GR+FY+CAR  GP S+P +
Sbjct: 570 KGERTAFWKSMLS-GPSPMPLCGGHREPCVMRTVKKTGPNFGRQFYMCARPRGPPSDPSS 513

BLAST of CmoCh07G004230 vs. ExPASy Swiss-Prot
Match: Q5E9N9 (DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Bos taurus OX=9913 GN=APEX2 PE=2 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 6.1e-56
Identity = 177/605 (29.26%), Postives = 259/605 (42.81%), Query Frame = 0

Query: 41  KLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVATFCRVK 100
           ++LD  DADI+C QETK+ R  L   L I +GY S+ S +R     R+GYSGVATFC+  
Sbjct: 34  RILDKLDADIVCLQETKVTRDVLTEPLAIIEGYNSYFSFSR----NRSGYSGVATFCK-- 93

Query: 101 SAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGRCIVTD 160
                 + A PV AEEG +GLL + +   G        +++F++EEL  +D EGR ++T 
Sbjct: 94  ------DSATPVAAEEGLSGLLSTQNGDVGCY----GNMDDFTQEELRALDSEGRALLTQ 153

Query: 161 H---------GHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGD 220
           H             L N+Y P A     ER+ FK  FY +LQ R E LL  G  + ++GD
Sbjct: 154 HKICTWEGKEKTLTLINVYCPHADPGKPERLTFKMRFYRLLQIRAEALLAAGSHVIILGD 213

Query: 221 LNIAPTSMDHCDA--GPDFENNEFRRWLRSLL--VGC--GGH---FTDIFRAKHPDRNDA 280
           LN A   +DH DA     FE +  R+W+  LL  +GC  G H   F D +R   P +  A
Sbjct: 214 LNTAHRPIDHWDAVNMECFEEDPGRKWMDGLLSNLGCESGSHMGPFIDSYRCFQPKQKGA 273

Query: 281 YTCWPQSTGAEVFNYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDG 340
           +TCW   +GA   NYG+R+D++L        D  L      +  +L              
Sbjct: 274 FTCWSTVSGARHLNYGSRLDYVL-------GDRTLVIDTFQSSFLLP------------- 333

Query: 341 NSFRWKGEQTVKLEGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLK 400
                      ++ GSDH PV  ++L +   P    P L   + P+  G Q  ++  L+ 
Sbjct: 334 -----------EVMGSDHCPV-GAVLSVSSVPAKQCPPLCTCFLPEFAGTQLKILRFLVH 393

Query: 401 KQAAEGSATCKISNSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLL 460
            +                                                     +D + 
Sbjct: 394 FK-----------------------------------------------------QDPVF 453

Query: 461 KTEESSGGGYSEEAPCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNV 520
           K         S   P N    H       + +N+ R R  RS      S     +++S  
Sbjct: 454 K--------QSALQPSNQTQVH-------MRKNKARVRSTRSRPSKTGSSRGQKNLMSYF 511

Query: 521 ADSSNANSTINKADTSESNPIEIPRSDTHIT--DSGKYLEENPDQSHINASSVEREKSGV 580
             SS+   T N         +++P   T IT   S + +  N  +    AS  + EK  +
Sbjct: 514 QPSSSGPQTSN---------LDLPSLGTLITPKTSEEDVMANVVEGQTKASEAKDEKE-I 511

Query: 581 ALLEWRRIQQVMQNSIPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCG 626
               W+ +     + +PLC  H+EPCV R VKK GPN GR FY+CAR +GP ++P + C 
Sbjct: 574 RTSFWKSLLG-GPSPMPLCGGHREPCVMRTVKKPGPNLGRHFYMCARPQGPPTDPSSRCN 511

BLAST of CmoCh07G004230 vs. ExPASy Swiss-Prot
Match: Q9UBZ4 (DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Homo sapiens OX=9606 GN=APEX2 PE=1 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 2.0e-54
Identity = 173/604 (28.64%), Postives = 253/604 (41.89%), Query Frame = 0

Query: 41  KLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVATFCRVK 100
           ++LD  DADI+C QETK+ R  L   L I +GY S+ S +R     R+GYSGVATFC+  
Sbjct: 34  RILDELDADIVCLQETKVTRDALTEPLAIVEGYNSYFSFSR----NRSGYSGVATFCK-- 93

Query: 101 SAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGRCIVTD 160
                 + A PV AEEG +GL  + +   G        ++EF++EEL  +D EGR ++T 
Sbjct: 94  ------DNATPVAAEEGLSGLFATQNGDVGCY----GNMDEFTQEELRALDSEGRALLTQ 153

Query: 161 H---------GHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGD 220
           H             L N+Y P A     ER++FK  FY +LQ R E LL  G  + ++GD
Sbjct: 154 HKIRTWEGKEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGD 213

Query: 221 LNIAPTSMDHCDAG--PDFENNEFRRWLRSLL--VGCG-----GHFTDIFRAKHPDRNDA 280
           LN A   +DH DA     FE +  R+W+ SLL  +GC      G F D +R   P +  A
Sbjct: 214 LNTAHRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHVGPFIDSYRCFQPKQEGA 273

Query: 281 YTCWPQSTGAEVFNYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDG 340
           +TCW   TGA   NYG+R+D++L        D  L         +L              
Sbjct: 274 FTCWSAVTGARHLNYGSRLDYVL-------GDRTLVIDTFQASFLLP------------- 333

Query: 341 NSFRWKGEQTVKLEGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLK 400
                      ++ GSDH PV  ++L +   P    P L  R+ P+  G Q  ++  L+ 
Sbjct: 334 -----------EVMGSDHCPV-GAVLSVSSVPAKQCPPLCTRFLPEFAGTQLKILRFLVP 393

Query: 401 KQAAEGSATCKISNSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLL 460
            + +       + ++                                 + T ++T     
Sbjct: 394 LEQSPVLEQSTLQHN---------------------------------NQTRVQT----- 453

Query: 461 KTEESSGGGYSEEAPCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNV 520
                          C       S  T+  P      R     Q +LKS+FQ +      
Sbjct: 454 ---------------CQNKAQVRS--TRPQPSQVGSSR----GQKNLKSYFQPSPSCPQA 513

Query: 521 A-DSSNANSTINKADTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVA 580
           + D    +  +  A  +   P E  ++   +        E  D+  +  S  +   +G  
Sbjct: 514 SPDIELPSLPLMSALMTPKTPEE--KAVAKVVKGQAKTSEAKDEKELRTSFWKSVLAGPL 515

Query: 581 LLEWRRIQQVMQNSIPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGY 626
                          PLC  H+EPCV R VKK GPN GRRFY+CAR  GP ++P + C +
Sbjct: 574 -------------RTPLCGGHREPCVMRTVKKPGPNLGRRFYMCARPRGPPTDPSSRCNF 515

BLAST of CmoCh07G004230 vs. ExPASy Swiss-Prot
Match: P38207 (DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=APN2 PE=1 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 3.5e-19
Identity = 114/441 (25.85%), Postives = 172/441 (39.00%), Query Frame = 0

Query: 38  SLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVATFC 97
           SL  + D F ADII FQE K  +  + +     DG+ SF+S  +T    R GYSGV  + 
Sbjct: 42  SLRSVFDFFRADIITFQELKTEKLSI-SKWGRVDGFYSFISIPQT----RKGYSGVGCWI 101

Query: 98  RVKSAFSSNEVALP-VRAEEGFTGLL---ESSHKGEGTMPAVAEGL-------EEFSKEE 157
           R+         AL  V+AEEG TG L      H        V +G+        +  ++ 
Sbjct: 102 RIPEKNHPLYHALQVVKAEEGITGYLTIKNGKHSAISYRNDVNQGIGGYDSLDPDLDEKS 161

Query: 158 LLKVDREGRCIVTDHG-HFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKR 217
            L++D EGRC++ +     V+ ++Y P   +   E  +F+  F  +L +R  +L + GK+
Sbjct: 162 ALELDSEGRCVMVELACGIVIISVYCPANSNSSEEGEMFRLRFLKVLLRRVRNLDKIGKK 221

Query: 218 IFVVGDLNIAPTSMDHCDAGPDFE---------------------------NNEFRRWLR 277
           I ++GD+N+    +D  D    F                            +   RR   
Sbjct: 222 IVLMGDVNVCRDLIDSADTLEQFSIPITDPMGGTKLEAQYRDKAIQFIINPDTPHRRIFN 281

Query: 278 SLLVGC-------GGHFTDIFR-AKHPDRNDAYTCWPQSTGAEVFNYGTRIDHILCAGPC 337
            +L           G   D  R  +  +R   YT W         NYG+RID IL +   
Sbjct: 282 QILADSLLPDASKRGILIDTTRLIQTRNRLKMYTVWNMLKNLRPSNYGSRIDFILVS--- 341

Query: 338 LHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKLEGSDHAPVYASLLEI 397
           L  +  + + DI+       DIL                       GSDH PVY+ L  +
Sbjct: 342 LKLERCIKAADILP------DIL-----------------------GSDHCPVYSDLDIL 401

Query: 398 -----PDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGS-----ATCKISNSFSS 419
                P T Q   P   ARY  K +     ++ M  KK   + S        K+ N+  +
Sbjct: 402 DDRIEPGTTQVPIPKFEARY--KYNLRNHNVLEMFAKKDTNKESNKQKYCVSKVMNTKKN 443

BLAST of CmoCh07G004230 vs. ExPASy TrEMBL
Match: A0A6J1EKQ4 (Apurinic-apyrimidinic endonuclease 2 OS=Cucurbita moschata OX=3662 GN=LOC111433494 PE=3 SV=1)

HSP 1 Score: 1218.8 bits (3152), Expect = 0.0e+00
Identity = 597/597 (100.00%), Postives = 597/597 (100.00%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT
Sbjct: 18  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR
Sbjct: 78  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 137

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA
Sbjct: 138 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 197

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF
Sbjct: 198 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 257

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL
Sbjct: 258 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 317

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS
Sbjct: 318 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 377

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE
Sbjct: 378 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 437

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA
Sbjct: 438 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 497

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS
Sbjct: 498 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 557

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 633
           IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN
Sbjct: 558 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 614

BLAST of CmoCh07G004230 vs. ExPASy TrEMBL
Match: A0A6J1KNT8 (DNA-(apurinic or apyrimidinic site) endonuclease OS=Cucurbita maxima OX=3661 GN=LOC111497328 PE=3 SV=1)

HSP 1 Score: 1167.5 bits (3019), Expect = 0.0e+00
Identity = 573/597 (95.98%), Postives = 581/597 (97.32%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT
Sbjct: 19  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 78

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSSNEVALPVRAEEGF+GLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR
Sbjct: 79  FCRVKSAFSSNEVALPVRAEEGFSGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 138

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRAQSDDTERVLFK+NFYNILQKRWEHLLRQGKRIFVVGDLNIA
Sbjct: 139 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKTNFYNILQKRWEHLLRQGKRIFVVGDLNIA 198

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMD CDAGPDFENNEFRRW+RSLLV CGGHFTDIFRAKHPDR DAYTCW QSTGAEVF
Sbjct: 199 PTSMDRCDAGPDFENNEFRRWMRSLLVRCGGHFTDIFRAKHPDRKDAYTCWSQSTGAEVF 258

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYGTRIDHILCAGPCLHPDSN P HDIVNCHV+ECDILSQYKRWKDGNSFR KGEQTVKL
Sbjct: 259 NYGTRIDHILCAGPCLHPDSNFPGHDIVNCHVIECDILSQYKRWKDGNSFRRKGEQTVKL 318

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPVYASLLEIPDTPQHS PSLSARYNPKIHGLQQTLVSMLLK+QAAEGSATCKIS
Sbjct: 319 EGSDHAPVYASLLEIPDTPQHSTPSLSARYNPKIHGLQQTLVSMLLKRQAAEGSATCKIS 378

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NSFS GN VLGNCSQGLNGSFDNGDLSGLLP ESCSSTNI+TEDSLLKTEESSGG YSEE
Sbjct: 379 NSFSRGNIVLGNCSQGLNGSFDNGDLSGLLPDESCSSTNIDTEDSLLKTEESSGGDYSEE 438

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           APCNTLITHESLHTKTL ENETRKRVRRSSQMSLKSFFQ NSVISNVADSSNANS+INKA
Sbjct: 439 APCNTLITHESLHTKTLHENETRKRVRRSSQMSLKSFFQKNSVISNVADSSNANSSINKA 498

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           DTSESNPIEIPRSDTHITDSGKY EENPDQSHINA SVEREKSGVALLEWRRIQ+VMQNS
Sbjct: 499 DTSESNPIEIPRSDTHITDSGKYFEENPDQSHINAFSVEREKSGVALLEWRRIQEVMQNS 558

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 633
           IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN
Sbjct: 559 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 615

BLAST of CmoCh07G004230 vs. ExPASy TrEMBL
Match: A0A6J1BYG7 (Apurinic-apyrimidinic endonuclease 2 OS=Momordica charantia OX=3673 GN=LOC111005866 PE=3 SV=1)

HSP 1 Score: 1042.3 bits (2694), Expect = 8.0e-301
Identity = 510/596 (85.57%), Postives = 545/596 (91.44%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           +GSL KLLDSFDADIICFQETKLR+QE RADL+IADGYESFVSCTRTSEKGRTGYSGVAT
Sbjct: 18  FGSLRKLLDSFDADIICFQETKLRKQEWRADLVIADGYESFVSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSS+EVALPV AEEGFTGLLESS  G+ TMPAVAEGLEEFSK ELLKVD EGR
Sbjct: 78  FCRVKSAFSSSEVALPVGAEEGFTGLLESSQNGKVTMPAVAEGLEEFSKAELLKVDSEGR 137

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRA SDD+ERVLFK  FY+ILQKRWEHLL QGKRIFVVGDLNIA
Sbjct: 138 CIVTDHGHFVLFNIYGPRADSDDSERVLFKLKFYDILQKRWEHLLHQGKRIFVVGDLNIA 197

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMD CDAGPDFENNEFRRWLRSLLVGCGGHF DIFR KHPDR DAYTCWPQSTGAEVF
Sbjct: 198 PTSMDRCDAGPDFENNEFRRWLRSLLVGCGGHFIDIFRTKHPDRRDAYTCWPQSTGAEVF 257

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYG+RIDHILCAGPCLH DSNLP H+IV CHV+EC+IL QYKRWKDGNS RWKGE+T KL
Sbjct: 258 NYGSRIDHILCAGPCLHHDSNLPGHNIVACHVVECEILIQYKRWKDGNSNRWKGERTFKL 317

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPVYASLLEIPDTPQHS PSLSARYNP IHGLQQTLVSMLLK+QA E SA+CKIS
Sbjct: 318 EGSDHAPVYASLLEIPDTPQHSTPSLSARYNPMIHGLQQTLVSMLLKRQATEDSASCKIS 377

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NS S  N +LG+CSQGL+GSFDNGDLSG LP ESCSSTN+ETEDSLLKTEESSGGG+ E+
Sbjct: 378 NSLSRRNIILGSCSQGLDGSFDNGDLSGFLPSESCSSTNLETEDSLLKTEESSGGGFPEK 437

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           A CNTLIT +SL TKTLPENETRKRVRRSSQMSLKSFFQ NSVISN +DSSNA+S  NKA
Sbjct: 438 AACNTLITDKSLQTKTLPENETRKRVRRSSQMSLKSFFQKNSVISNDSDSSNADSLFNKA 497

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           DTS+SN IE+P+SDT  ++S +YL+ + DQ  ++ SSVE+EKSGVALLEWRRIQQVMQNS
Sbjct: 498 DTSQSNSIEVPKSDTQSSNSEQYLDASQDQYQLDVSSVEKEKSGVALLEWRRIQQVMQNS 557

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRH 632
           IPLCK HKEPCVARVVKKQGPNNGRRFYVCARAEGPASNP+ANCGYFKWA SKSRH
Sbjct: 558 IPLCKGHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPQANCGYFKWAASKSRH 613

BLAST of CmoCh07G004230 vs. ExPASy TrEMBL
Match: A0A5A7SJ45 (DNA-(apurinic or apyrimidinic site) endonuclease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G001280 PE=3 SV=1)

HSP 1 Score: 1035.0 bits (2675), Expect = 1.3e-298
Identity = 507/596 (85.07%), Postives = 544/596 (91.28%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           +GSLLKLLDSFDADIIC QETKLRRQELRADL+IADGYE+FVSCTRTSEKGRTGYSGVAT
Sbjct: 18  FGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYETFVSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSSNEVALPVRAEEGFTGLLESS  G+ TM AVAEGLEEFSKEELL++D EGR
Sbjct: 78  FCRVKSAFSSNEVALPVRAEEGFTGLLESSQDGKRTMGAVAEGLEEFSKEELLQLDSEGR 137

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRA+SDD++RVLFK  FYN+LQKRWEHLL  GKR+FVVGDLNIA
Sbjct: 138 CIVTDHGHFVLFNIYGPRAESDDSDRVLFKLKFYNVLQKRWEHLLHMGKRVFVVGDLNIA 197

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMD CDAGPDFENNEFRRWLRSLLV CGG F D+FRAKHPDR DAYTCWPQSTGAEVF
Sbjct: 198 PTSMDRCDAGPDFENNEFRRWLRSLLVACGGRFIDVFRAKHPDRRDAYTCWPQSTGAEVF 257

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYGTRIDHILCAGPCLH D++LP H+IV CHV+ECDILS+YKRWKDGNSFRWKGEQ+VKL
Sbjct: 258 NYGTRIDHILCAGPCLHHDNSLPGHNIVACHVMECDILSRYKRWKDGNSFRWKGEQSVKL 317

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPV ASLLEIPDTPQHS PSLSARYNPKIHGLQQTLVSMLLK+QAAE SA CK S
Sbjct: 318 EGSDHAPVSASLLEIPDTPQHSTPSLSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCKKS 377

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NS S GN +LGNCSQG NGSF+NGD  G+LP ESCS TN+ETEDSLLKT E +GG Y+EE
Sbjct: 378 NSSSRGNIILGNCSQGFNGSFNNGDQPGVLPSESCSLTNLETEDSLLKTGECAGGSYAEE 437

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           A CNTLI+HESLH K LPENETRKRV+R SQMSLKSFFQ NSV+SN A+SSNA+S I+KA
Sbjct: 438 AACNTLISHESLHAKALPENETRKRVKRCSQMSLKSFFQKNSVVSNDANSSNADSPISKA 497

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           +TSESNPIEIPRS+T  ++SG+ LE   DQS INAS VE+EKSGVALLEWRRIQQVMQNS
Sbjct: 498 ETSESNPIEIPRSNTQ-SNSGRQLEAYQDQSQINASPVEKEKSGVALLEWRRIQQVMQNS 557

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRH 632
           IPLCK HKE CVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWA SKSRH
Sbjct: 558 IPLCKGHKETCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAASKSRH 612

BLAST of CmoCh07G004230 vs. ExPASy TrEMBL
Match: A0A1S4DUC2 (DNA-(apurinic or apyrimidinic site) endonuclease OS=Cucumis melo OX=3656 GN=LOC103485130 PE=3 SV=1)

HSP 1 Score: 1035.0 bits (2675), Expect = 1.3e-298
Identity = 507/596 (85.07%), Postives = 544/596 (91.28%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           +GSLLKLLDSFDADIIC QETKLRRQELRADL+IADGYE+FVSCTRTSEKGRTGYSGVAT
Sbjct: 18  FGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYETFVSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSSNEVALPVRAEEGFTGLLESS  G+ TM AVAEGLEEFSKEELL++D EGR
Sbjct: 78  FCRVKSAFSSNEVALPVRAEEGFTGLLESSQDGKRTMGAVAEGLEEFSKEELLQLDSEGR 137

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRA+SDD++RVLFK  FYN+LQKRWEHLL  GKR+FVVGDLNIA
Sbjct: 138 CIVTDHGHFVLFNIYGPRAESDDSDRVLFKLKFYNVLQKRWEHLLHMGKRVFVVGDLNIA 197

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMD CDAGPDFENNEFRRWLRSLLV CGG F D+FRAKHPDR DAYTCWPQSTGAEVF
Sbjct: 198 PTSMDRCDAGPDFENNEFRRWLRSLLVACGGRFIDVFRAKHPDRRDAYTCWPQSTGAEVF 257

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYGTRIDHILCAGPCLH D++LP H+IV CHV+ECDILS+YKRWKDGNSFRWKGEQ+VKL
Sbjct: 258 NYGTRIDHILCAGPCLHHDNSLPGHNIVACHVMECDILSRYKRWKDGNSFRWKGEQSVKL 317

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPV ASLLEIPDTPQHS PSLSARYNPKIHGLQQTLVSMLLK+QAAE SA CK S
Sbjct: 318 EGSDHAPVSASLLEIPDTPQHSTPSLSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCKKS 377

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NS S GN +LGNCSQG NGSF+NGD  G+LP ESCS TN+ETEDSLLKT E +GG Y+EE
Sbjct: 378 NSSSRGNIILGNCSQGFNGSFNNGDQPGVLPSESCSLTNLETEDSLLKTGECAGGSYAEE 437

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           A CNTLI+HESLH K LPENETRKRV+R SQMSLKSFFQ NSV+SN A+SSNA+S I+KA
Sbjct: 438 AACNTLISHESLHAKALPENETRKRVKRCSQMSLKSFFQKNSVVSNDANSSNADSPISKA 497

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           +TSESNPIEIPRS+T  ++SG+ LE   DQS INAS VE+EKSGVALLEWRRIQQVMQNS
Sbjct: 498 ETSESNPIEIPRSNTQ-SNSGRQLEAYQDQSQINASPVEKEKSGVALLEWRRIQQVMQNS 557

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRH 632
           IPLCK HKE CVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWA SKSRH
Sbjct: 558 IPLCKGHKETCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAASKSRH 612

BLAST of CmoCh07G004230 vs. NCBI nr
Match: KAG6594773.1 (DNA-(apurinic or apyrimidinic site) lyase 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 604/632 (95.57%), Postives = 607/632 (96.04%), Query Frame = 0

Query: 1   MDPKRAKSSPGKRCNYDADRRFRTNSPFLFRLRETYGSLLKLLDSFDADIICFQETKLRR 60
           MDPKRAKSSPGKRCNYDADRRFRTNSPFLFRLRET                   ETKLRR
Sbjct: 1   MDPKRAKSSPGKRCNYDADRRFRTNSPFLFRLRETATHR--------------TETKLRR 60

Query: 61  QELRADLIIADGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVRAEEGFTG 120
           QELRADLIIADGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVRAEEGFTG
Sbjct: 61  QELRADLIIADGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVRAEEGFTG 120

Query: 121 LLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGRCIVTDHGHFVLFNIYGPRAQSDDTE 180
           LLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGRCIVTDHGHFVLFNIYGPRAQSDDTE
Sbjct: 121 LLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGRCIVTDHGHFVLFNIYGPRAQSDDTE 180

Query: 181 RVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIAPTSMDHCDAGPDFENNEFRRWLRSL 240
           RVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIAPTSMD CDAGPDFENNEFRRWLRSL
Sbjct: 181 RVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIAPTSMDRCDAGPDFENNEFRRWLRSL 240

Query: 241 LVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVFNYGTRIDHILCAGPCLHPDSNLPSH 300
           LVGCGGHFTDIFRAKHPDR DAYTCWPQSTGAEVFNYGTRIDHILCAGPCLHPDSNLPSH
Sbjct: 241 LVGCGGHFTDIFRAKHPDRKDAYTCWPQSTGAEVFNYGTRIDHILCAGPCLHPDSNLPSH 300

Query: 301 DIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKLEGSDHAPVYASLLEIPDTPQHSAPS 360
           DIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKLEGSDHAPVYASLLEIPDTPQHS PS
Sbjct: 301 DIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKLEGSDHAPVYASLLEIPDTPQHSTPS 360

Query: 361 LSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKISNSFSSGNDVLGNCSQGLNGSFDNGD 420
           LSARY+PKIHGLQQTLVSMLLK+QAAEGSATCKISNSFS GN VLGNCSQGLNGSFDNGD
Sbjct: 361 LSARYDPKIHGLQQTLVSMLLKRQAAEGSATCKISNSFSRGNVVLGNCSQGLNGSFDNGD 420

Query: 421 LSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEEAPCNTLITHESLHTKTLPENETRKR 480
           LSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEEAPCNTLITHESLHTKTLPENETRKR
Sbjct: 421 LSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEEAPCNTLITHESLHTKTLPENETRKR 480

Query: 481 VRRSSQMSLKSFFQTNSVISNVADSSNANSTINKADTSESNPIEIPRSDTHITDSGKYLE 540
           VRRSSQMSLKSFFQ NSVISNVADSSNANS+INKADTSESNPIEIPRSDTHITDSGKYLE
Sbjct: 481 VRRSSQMSLKSFFQKNSVISNVADSSNANSSINKADTSESNPIEIPRSDTHITDSGKYLE 540

Query: 541 ENPDQSHINASSVEREKSGVALLEWRRIQQVMQNSIPLCKRHKEPCVARVVKKQGPNNGR 600
           ENPDQSHINASSVEREKSGVALLEWRRIQQVMQNSIPLCKRHKEPCVARVVKKQGPNNGR
Sbjct: 541 ENPDQSHINASSVEREKSGVALLEWRRIQQVMQNSIPLCKRHKEPCVARVVKKQGPNNGR 600

Query: 601 RFYVCARAEGPASNPEANCGYFKWAVSKSRHN 633
           RFYVCARAEGPASNPEANCGYFKWAVSKSRHN
Sbjct: 601 RFYVCARAEGPASNPEANCGYFKWAVSKSRHN 618

BLAST of CmoCh07G004230 vs. NCBI nr
Match: XP_022926300.1 (DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita moschata])

HSP 1 Score: 1218.8 bits (3152), Expect = 0.0e+00
Identity = 597/597 (100.00%), Postives = 597/597 (100.00%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT
Sbjct: 18  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR
Sbjct: 78  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 137

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA
Sbjct: 138 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 197

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF
Sbjct: 198 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 257

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL
Sbjct: 258 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 317

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS
Sbjct: 318 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 377

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE
Sbjct: 378 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 437

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA
Sbjct: 438 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 497

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS
Sbjct: 498 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 557

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 633
           IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN
Sbjct: 558 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 614

BLAST of CmoCh07G004230 vs. NCBI nr
Match: KAG7026737.1 (DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1197.6 bits (3097), Expect = 0.0e+00
Identity = 587/597 (98.32%), Postives = 590/597 (98.83%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT
Sbjct: 18  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR
Sbjct: 78  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 137

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA
Sbjct: 138 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 197

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMD CDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDR DAYTCWPQSTGAEVF
Sbjct: 198 PTSMDRCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRKDAYTCWPQSTGAEVF 257

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL
Sbjct: 258 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 317

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPVYASLLEIPDTPQHS PSLSARY+PKIHGLQQTLVSMLLK+QAAEGSATCKIS
Sbjct: 318 EGSDHAPVYASLLEIPDTPQHSTPSLSARYDPKIHGLQQTLVSMLLKRQAAEGSATCKIS 377

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NSFS GN VLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE
Sbjct: 378 NSFSRGNVVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 437

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQ N VISNVADSSNANS+INKA
Sbjct: 438 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQKNPVISNVADSSNANSSINKA 497

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS
Sbjct: 498 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 557

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 633
           IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN
Sbjct: 558 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 614

BLAST of CmoCh07G004230 vs. NCBI nr
Match: XP_023517127.1 (DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1189.9 bits (3077), Expect = 0.0e+00
Identity = 583/597 (97.65%), Postives = 588/597 (98.49%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT
Sbjct: 18  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPA+AEGLEEFSKEELLKVDREGR
Sbjct: 78  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAIAEGLEEFSKEELLKVDREGR 137

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA
Sbjct: 138 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 197

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMD CDAGPDFENNEFRRWLRSLLVG GGHFTDIFR+KHPDR DAYTCWPQSTGAEVF
Sbjct: 198 PTSMDRCDAGPDFENNEFRRWLRSLLVGYGGHFTDIFRSKHPDRKDAYTCWPQSTGAEVF 257

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL
Sbjct: 258 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 317

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPVYASLLEIPDTPQHS PSLSARYNPKIHGLQQTLVSMLLK+QAAEGSATCKIS
Sbjct: 318 EGSDHAPVYASLLEIPDTPQHSTPSLSARYNPKIHGLQQTLVSMLLKRQAAEGSATCKIS 377

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NSFS GN VLGNCSQGLNGSFDNGDLSGLLPGESCSST+IETEDSLLKTEESSGGGYSEE
Sbjct: 378 NSFSRGNVVLGNCSQGLNGSFDNGDLSGLLPGESCSSTDIETEDSLLKTEESSGGGYSEE 437

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQ NSVISNVADSSNANS+ NK 
Sbjct: 438 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQKNSVISNVADSSNANSSTNKP 497

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS
Sbjct: 498 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 557

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 633
           IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN
Sbjct: 558 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 614

BLAST of CmoCh07G004230 vs. NCBI nr
Match: XP_023003867.1 (DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita maxima])

HSP 1 Score: 1167.5 bits (3019), Expect = 0.0e+00
Identity = 573/597 (95.98%), Postives = 581/597 (97.32%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT
Sbjct: 19  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 78

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 155
           FCRVKSAFSSNEVALPVRAEEGF+GLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR
Sbjct: 79  FCRVKSAFSSNEVALPVRAEEGFSGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDREGR 138

Query: 156 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNIA 215
           CIVTDHGHFVLFNIYGPRAQSDDTERVLFK+NFYNILQKRWEHLLRQGKRIFVVGDLNIA
Sbjct: 139 CIVTDHGHFVLFNIYGPRAQSDDTERVLFKTNFYNILQKRWEHLLRQGKRIFVVGDLNIA 198

Query: 216 PTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVF 275
           PTSMD CDAGPDFENNEFRRW+RSLLV CGGHFTDIFRAKHPDR DAYTCW QSTGAEVF
Sbjct: 199 PTSMDRCDAGPDFENNEFRRWMRSLLVRCGGHFTDIFRAKHPDRKDAYTCWSQSTGAEVF 258

Query: 276 NYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGNSFRWKGEQTVKL 335
           NYGTRIDHILCAGPCLHPDSN P HDIVNCHV+ECDILSQYKRWKDGNSFR KGEQTVKL
Sbjct: 259 NYGTRIDHILCAGPCLHPDSNFPGHDIVNCHVIECDILSQYKRWKDGNSFRRKGEQTVKL 318

Query: 336 EGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKIS 395
           EGSDHAPVYASLLEIPDTPQHS PSLSARYNPKIHGLQQTLVSMLLK+QAAEGSATCKIS
Sbjct: 319 EGSDHAPVYASLLEIPDTPQHSTPSLSARYNPKIHGLQQTLVSMLLKRQAAEGSATCKIS 378

Query: 396 NSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEE 455
           NSFS GN VLGNCSQGLNGSFDNGDLSGLLP ESCSSTNI+TEDSLLKTEESSGG YSEE
Sbjct: 379 NSFSRGNIVLGNCSQGLNGSFDNGDLSGLLPDESCSSTNIDTEDSLLKTEESSGGDYSEE 438

Query: 456 APCNTLITHESLHTKTLPENETRKRVRRSSQMSLKSFFQTNSVISNVADSSNANSTINKA 515
           APCNTLITHESLHTKTL ENETRKRVRRSSQMSLKSFFQ NSVISNVADSSNANS+INKA
Sbjct: 439 APCNTLITHESLHTKTLHENETRKRVRRSSQMSLKSFFQKNSVISNVADSSNANSSINKA 498

Query: 516 DTSESNPIEIPRSDTHITDSGKYLEENPDQSHINASSVEREKSGVALLEWRRIQQVMQNS 575
           DTSESNPIEIPRSDTHITDSGKY EENPDQSHINA SVEREKSGVALLEWRRIQ+VMQNS
Sbjct: 499 DTSESNPIEIPRSDTHITDSGKYFEENPDQSHINAFSVEREKSGVALLEWRRIQEVMQNS 558

Query: 576 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 633
           IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN
Sbjct: 559 IPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSRHN 615

BLAST of CmoCh07G004230 vs. TAIR 10
Match: AT4G36050.2 (endonuclease/exonuclease/phosphatase family protein )

HSP 1 Score: 613.2 bits (1580), Expect = 2.3e-175
Identity = 325/580 (56.03%), Postives = 400/580 (68.97%), Query Frame = 0

Query: 36  YGSLLKLLDSFDADIICFQETKLRRQELRADLIIADGYESFVSCTRTSEKGRTGYSGVAT 95
           + SLLKLLDSFDADIICFQETKLRRQEL ADL IADGYESF SCTRTSEKGRTGYSGVAT
Sbjct: 18  FDSLLKLLDSFDADIICFQETKLRRQELTADLAIADGYESFFSCTRTSEKGRTGYSGVAT 77

Query: 96  FCRVKSAFSSNEVALPVRAEEGFTGLLES-SHKGEGTMPAVAEGLEEFSKEELLKVDREG 155
           FCRVKSA SS E ALPV AEEG TGL+ S S  G+     VAEGLEE+ KEELL +D+EG
Sbjct: 78  FCRVKSASSSCETALPVTAEEGITGLVNSNSRGGKSETSTVAEGLEEYEKEELLMIDQEG 137

Query: 156 RCIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLNI 215
           RC++TDHGHFV+FN+YGPRA +DD +R+ FK  FY +L++RWE LLRQG+R+FVVGDLNI
Sbjct: 138 RCVITDHGHFVVFNVYGPRAVADDADRIEFKHRFYGVLERRWECLLRQGRRVFVVGDLNI 197

Query: 216 APTSMDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEV 275
           AP +MD C+AGPDFE NEFR+W RSLLV  GG F+D+FR+KHP+R DA+TCW  S+GAE 
Sbjct: 198 APFAMDRCEAGPDFEKNEFRKWFRSLLVERGGSFSDVFRSKHPERKDAFTCWSSSSGAEQ 257

Query: 276 FNYGTRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGN-SFRWKGEQTV 335
           FNYG+RIDHIL AG CLH D +   H  + CHV ECDIL++YKR+K+ N   RWKG    
Sbjct: 258 FNYGSRIDHILVAGSCLHQDEDKQGHSFLACHVKECDILTEYKRFKNENMPTRWKGGLVT 317

Query: 336 KLEGSDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCK 395
           K +GSDH PV+ S  ++PD P+HS P L++RY P I+G QQTLVS+  K++A E +   +
Sbjct: 318 KFKGSDHVPVFISFDDLPDIPEHSTPPLASRYLPMIYGFQQTLVSVFKKRRANEEAKAIE 377

Query: 396 ISNSFSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYS 455
           +S S S+ ++    C     G   N    G+   +SCS  N  T      TE  +     
Sbjct: 378 VSCSSSTQSNTSSICGDISTGPLRNCGSMGISLEKSCSFENKSTSG---VTEAETVAATG 437

Query: 456 EEAPCNTLITHESLHTKTLPENETRKRVRR--SSQMSLKSFFQTNSVISNVADSSNANST 515
                +  I   S+    +  +  RK+ R+  SSQ+SLKSFF TNS ++NV DSS+    
Sbjct: 438 SIDNLSDGIRASSVRALNISRDGDRKKARKIQSSQLSLKSFFTTNSKVNNVEDSSS---- 497

Query: 516 INKADTSESNPIEIPRSDTHITDSGKYLEE--NPDQSHINASSVEREKSGVALLEWRRIQ 575
            +   +S S+ +E   S T    SGK   E     Q      S  ++K+  AL+EW+RIQ
Sbjct: 498 -SYVSSSPSSQVE---SITEPNVSGKEDSEPTTSTQEQDQTGSSAKQKNDAALMEWQRIQ 557

Query: 576 QVMQNSIPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAE 610
            +MQNSIPLCK HKE CVARVVKK GP  GRRFYVC+RAE
Sbjct: 558 NLMQNSIPLCKGHKEACVARVVKKPGPTFGRRFYVCSRAE 586

BLAST of CmoCh07G004230 vs. TAIR 10
Match: AT4G36050.1 (endonuclease/exonuclease/phosphatase family protein )

HSP 1 Score: 373.2 bits (957), Expect = 4.0e-103
Identity = 206/417 (49.40%), Postives = 263/417 (63.07%), Query Frame = 0

Query: 219 MDHCDAGPDFENNEFRRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQSTGAEVFNYG 278
           MD C+AGPDFE NEFR+W RSLLV  GG F+D+FR+KHP+R DA+TCW  S+GAE FNYG
Sbjct: 1   MDRCEAGPDFEKNEFRKWFRSLLVERGGSFSDVFRSKHPERKDAFTCWSSSSGAEQFNYG 60

Query: 279 TRIDHILCAGPCLHPDSNLPSHDIVNCHVLECDILSQYKRWKDGN-SFRWKGEQTVKLEG 338
           +RIDHIL AG CLH D +   H  + CHV ECDIL++YKR+K+ N   RWKG    K +G
Sbjct: 61  SRIDHILVAGSCLHQDEDKQGHSFLACHVKECDILTEYKRFKNENMPTRWKGGLVTKFKG 120

Query: 339 SDHAPVYASLLEIPDTPQHSAPSLSARYNPKIHGLQQTLVSMLLKKQAAEGSATCKISNS 398
           SDH PV+ S  ++PD P+HS P L++RY P I+G QQTLVS+  K++A E +   ++S S
Sbjct: 121 SDHVPVFISFDDLPDIPEHSTPPLASRYLPMIYGFQQTLVSVFKKRRANEEAKAIEVSCS 180

Query: 399 FSSGNDVLGNCSQGLNGSFDNGDLSGLLPGESCSSTNIETEDSLLKTEESSGGGYSEEAP 458
            S+ ++    C     G   N    G+   +SCS  N  T      TE  +         
Sbjct: 181 SSTQSNTSSICGDISTGPLRNCGSMGISLEKSCSFENKSTSG---VTEAETVAATGSIDN 240

Query: 459 CNTLITHESLHTKTLPENETRKRVRR--SSQMSLKSFFQTNSVISNVADSSNANSTINKA 518
            +  I   S+    +  +  RK+ R+  SSQ+SLKSFF TNS ++NV DSS+     +  
Sbjct: 241 LSDGIRASSVRALNISRDGDRKKARKIQSSQLSLKSFFTTNSKVNNVEDSSS-----SYV 300

Query: 519 DTSESNPIEIPRSDTHITDSGKYLEE--NPDQSHINASSVEREKSGVALLEWRRIQQVMQ 578
            +S S+ +E   S T    SGK   E     Q      S  ++K+  AL+EW+RIQ +MQ
Sbjct: 301 SSSPSSQVE---SITEPNVSGKEDSEPTTSTQEQDQTGSSAKQKNDAALMEWQRIQNLMQ 360

Query: 579 NSIPLCKRHKEPCVARVVKKQGPNNGRRFYVCARAEGPASNPEANCGYFKWAVSKSR 631
           NSIPLCK HKE CVARVVKK GP  GRRFYVC+RAEGP+SNPEANCGYFKWA SK R
Sbjct: 361 NSIPLCKGHKEACVARVVKKPGPTFGRRFYVCSRAEGPSSNPEANCGYFKWASSKFR 406

BLAST of CmoCh07G004230 vs. TAIR 10
Match: AT2G41460.1 (apurinic endonuclease-redox protein )

HSP 1 Score: 73.9 bits (180), Expect = 5.0e-13
Identity = 71/256 (27.73%), Postives = 105/256 (41.02%), Query Frame = 0

Query: 38  SLLKLLDSFDADIICFQETKLR---RQELRADLIIADGYE-SFVSCTRTSEKGRTGYSGV 97
           S L+L    + DI+C QETKL+    +E++  LI  DGY+ SF SC+      + GYSG 
Sbjct: 296 SALQLAQRENFDILCLQETKLQVKDVEEIKKTLI--DGYDHSFWSCS----VSKLGYSGT 355

Query: 98  ATFCRVKSAFSSNEVALPVRAEEGFTGLLESSHKGEGTMPAVAEGLEEFSKEELLKVDRE 157
           A   R+K         L VR   G +G                              D E
Sbjct: 356 AIISRIK--------PLSVRYGTGLSG-----------------------------HDTE 415

Query: 158 GRCIVTDHGHFVLFNIYGPRAQSDDTERVLFKSNFYNILQKRWEHLLRQGKRIFVVGDLN 217
           GR +  +   F L N Y P +  D  +R+ ++   ++         L + K + + GDLN
Sbjct: 416 GRIVTAEFDSFYLINTYVPNS-GDGLKRLSYRIEEWDRTLSNHIKELEKSKPVVLTGDLN 475

Query: 218 IAPTSMDHCDAGPDFENNEF----RRWLRSLLVGCGGHFTDIFRAKHPDRNDAYTCWPQS 277
            A   +D  +   +  +  F    R+   + L+  G  F D FR +HP     YT W   
Sbjct: 476 CAHEEIDIFNPAGNKRSAGFTIEERQSFGANLLDKG--FVDTFRKQHPG-VVGYTYWGYR 504

Query: 278 TGAEVFNYGTRIDHIL 286
            G    N G R+D+ L
Sbjct: 536 HGGRKTNKGWRLDYFL 504

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4JNY03.2e-17456.03DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Arabidopsis thaliana OX=37... [more]
Q68G581.1e-5728.62DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Mus musculus OX=10090 GN=A... [more]
Q5E9N96.1e-5629.26DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Bos taurus OX=9913 GN=APEX... [more]
Q9UBZ42.0e-5428.64DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Homo sapiens OX=9606 GN=AP... [more]
P382073.5e-1925.85DNA-(apurinic or apyrimidinic site) endonuclease 2 OS=Saccharomyces cerevisiae (... [more]
Match NameE-valueIdentityDescription
A0A6J1EKQ40.0e+00100.00Apurinic-apyrimidinic endonuclease 2 OS=Cucurbita moschata OX=3662 GN=LOC1114334... [more]
A0A6J1KNT80.0e+0095.98DNA-(apurinic or apyrimidinic site) endonuclease OS=Cucurbita maxima OX=3661 GN=... [more]
A0A6J1BYG78.0e-30185.57Apurinic-apyrimidinic endonuclease 2 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A5A7SJ451.3e-29885.07DNA-(apurinic or apyrimidinic site) endonuclease OS=Cucumis melo var. makuwa OX=... [more]
A0A1S4DUC21.3e-29885.07DNA-(apurinic or apyrimidinic site) endonuclease OS=Cucumis melo OX=3656 GN=LOC1... [more]
Match NameE-valueIdentityDescription
KAG6594773.10.0e+0095.57DNA-(apurinic or apyrimidinic site) lyase 2, partial [Cucurbita argyrosperma sub... [more]
XP_022926300.10.0e+00100.00DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita moschata][more]
KAG7026737.10.0e+0098.32DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita argyrosperma subsp. argyr... [more]
XP_023517127.10.0e+0097.65DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita pepo subsp. pepo][more]
XP_023003867.10.0e+0095.98DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT4G36050.22.3e-17556.03endonuclease/exonuclease/phosphatase family protein [more]
AT4G36050.14.0e-10349.40endonuclease/exonuclease/phosphatase family protein [more]
AT2G41460.15.0e-1327.73apurinic endonuclease-redox protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 24..348
e-value: 2.4E-73
score: 249.6
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 35..347
IPR005135Endonuclease/exonuclease/phosphatasePFAMPF03372Exo_endo_phoscoord: 39..288
e-value: 5.7E-17
score: 62.1
IPR010666Zinc finger, GRF-typePFAMPF06839zf-GRFcoord: 577..625
e-value: 4.8E-14
score: 52.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 510..530
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 510..528
NoneNo IPR availablePANTHERPTHR22748:SF4DNA-(APURINIC OR APYRIMIDINIC SITE) LYASE 2coord: 31..622
IPR004808AP endonuclease 1PANTHERPTHR22748AP ENDONUCLEASEcoord: 31..622
IPR004808AP endonuclease 1PROSITEPS51435AP_NUCLEASE_F1_4coord: 20..349
score: 35.81757

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh07G004230.1CmoCh07G004230.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0080111 DNA demethylation
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0098506 polynucleotide 3' dephosphorylation
biological_process GO:0006281 DNA repair
cellular_component GO:0005634 nucleus
molecular_function GO:0003906 DNA-(apurinic or apyrimidinic site) endonuclease activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0008311 double-stranded DNA 3'-5' exodeoxyribonuclease activity
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0016829 lyase activity
molecular_function GO:0008081 phosphoric diester hydrolase activity
molecular_function GO:0046403 polynucleotide 3'-phosphatase activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004518 nuclease activity