Cp4.1LG06g01240 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG06g01240
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDNA-3-methyladenine glycosylase, putative
LocationCp4.1LG06 : 616800 .. 618962 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGTTGTCTCTGGAAAATTCGACGTTCGTTGCCGAGCTTTCTTTGTTTTGTTCCACACAGAATTCGATTCTGGAAGGGATTTTCGTTGACGGATATCTTTCACGAGCAATGTCTGTGGCTACGAAGCTCCATTCGCACGCTAAACCTGTTTTGGAGTCCCGAGAAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCGAAATGTAAACAGGAGACTTTGAAGAACTCAGAGAAGCAGAACAAGGCGCTTCCGGCGATTCCTGAATCGGTTATTCGAGACAATAGCTCCATCGGGAGCTCCTGCTCATCCGATTCTTTATTAAGCAGCTATTCGACCAAATTGTTGAATCCGAAAGTGAAGCCCTGCGATGTGAAGCCTGTGAAGGCTGTTGCTGCCGGAGGTGATCCAAACGTAACCTCAACGACGCCTAGTCTCTCGGTTCCGGGGAAACGCTGTGGTTGGATAACGCCTTATTCTGGTAAGCAAGCTCAAACTCTGCTTATTATTGACTATTGCTTATTTGAATCGAGGAAACAATGGATATGAGATGGACAGAAGACAATCGCACAATTTCAATTCGACATAATAATAGAGTTTATCTTAGTTCAATACTCATTTTCTACTTGCATTGAAATATTTTGATTATTGTATTGTAATGCAATCAAATATTTAACCTCTTCATATTTTATTGTTGCATGAAGGATTTTACTTACTATGTTATTTGAACATTACAATTCATTATTATATGTGAATTTAACTATTTTTAAAAATCAGTTACATTCACATGGATTATGAATCTCTGGTTATCCTGTAATGTATTGTGATGGTTTCTTGTTGTAGACCCACTTTACATCGCTTTCCATGACGAAGAATGGGGAGTCCCAGTTCATGACGACAGGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATTCTTTGCAAGAGAGATATATTTAGGTCTCTTCATTTTTTCCCAAAACTTGAATACAAATCAGTCCCTAAAAGAACATTGTTCTAAATTGTCTTCCATTTCTCTCTTCTTTGCAGGAAAGTTTTTAATGATTTTGACCCATCTACCATCGCTAAGTTCACACAGAATGAGTTTACGACACTAAAAGAAAATGGCATCCAGCTCCTATCTGAACCAAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTACTCAAGGTATTGAGTTTTGGTTAAGCTTTAGCACTTCCTATTTTTTGTTAAGCCAAAGCCAACCCACAAGAACAGGCGTTTGGTTGTCTCAACTTCCATCCCTTTCAAGCGGTTGAAAGCTGAATCTCTACGTTTCCATTGCCTTTTGACTTCTCCACAGATTCAACAGGAATTTGGTACCTTCAGCAACTATTGTTGGAGCTTTGTCAACAAGAAGCCTATAACAAACAGATTTCGATATGCCCGTCAAGTACCGGTAAAGACGCCGAAAGCAGAGTTCATGAGCAAGGATTTGCTAAGAAGAGGCTTTCGTTGTGTTGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTACCGGAATTGTTAACGATCACTTAGTCAATTGCTTCAGATATCAAGAGTGCGATGGTATGAAATTAAGAGTAGAAGGTCAGCGATCGGAGTTGCTCACCGGAGCTTTGGAGACTAGATCTTGACATGTTCATAGAAAAAAAAAAAAGGTTATCTCTAACTAATCTTAGAAATCAGTGTGATAAAATGGTTCTGATGATCTGATGCTGAGTAGTTTAAATTCTCCTTTGTTTAGAATTCTTAATTGTAATTTGTATTTACAGACGACAAAAACATGTTGGCTTCCTGTAACTATTTTTCAACCTTCAGAACACGCTTCGGTGGCAGAGCATAAGAACAATGGTATGTTTCTCTTACTCCAACATTTATAACGTTAGTAGTTTCCGTAAGATTTAAGGTTACGAGTCGCAGAATTCCAGGGTCGAATTCGAATATTCGGACTCATTAGGCCTTAAGTCTCCTTGCAAAATTTAGCCCAATAGAAATGTGGGCCCAAGTTCCCGGGAAGGCCCAATGATTTCGTAGCCGTCGCAATTCCTCCTGCTTGAAACCCAACAAATAATTCATAAAAATTAACCAATAAAGCAATAAAA

mRNA sequence

GTGTTGTCTCTGGAAAATTCGACGTTCGTTGCCGAGCTTTCTTTGTTTTGTTCCACACAGAATTCGATTCTGGAAGGGATTTTCGTTGACGGATATCTTTCACGAGCAATGTCTGTGGCTACGAAGCTCCATTCGCACGCTAAACCTGTTTTGGAGTCCCGAGAAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCGAAATGTAAACAGGAGACTTTGAAGAACTCAGAGAAGCAGAACAAGGCGCTTCCGGCGATTCCTGAATCGGTTATTCGAGACAATAGCTCCATCGGGAGCTCCTGCTCATCCGATTCTTTATTAAGCAGCTATTCGACCAAATTGTTGAATCCGAAAGTGAAGCCCTGCGATGTGAAGCCTGTGAAGGCTGTTGCTGCCGGAGGTGATCCAAACGTAACCTCAACGACGCCTAGTCTCTCGGTTCCGGGGAAACGCTGTGGTTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTCCATGACGAAGAATGGGGAGTCCCAGTTCATGACGACAGGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATTCTTTGCAAGAGAGATATATTTAGGAAAGTTTTTAATGATTTTGACCCATCTACCATCGCTAAGTTCACACAGAATGAGTTTACGACACTAAAAGAAAATGGCATCCAGCTCCTATCTGAACCAAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTACTCAAGATTCAACAGGAATTTGGTACCTTCAGCAACTATTGTTGGAGCTTTGTCAACAAGAAGCCTATAACAAACAGATTTCGATATGCCCGTCAAGTACCGGTAAAGACGCCGAAAGCAGAGTTCATGAGCAAGGATTTGCTAAGAAGAGGCTTTCGTTGTGTTGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTACCGGAATTGTTAACGATCACTTAGTCAATTGCTTCAGATATCAAGAGTGCGATGGTATGAAATTAAGAGTAGAAGGTCAGCGATCGGAGTTGCTCACCGGAGCTTTGGAGACTAGATCTTGACATGTTCATAGAAAAAAAAAAAAGACGACAAAAACATGTTGGCTTCCTGTAACTATTTTTCAACCTTCAGAACACGCTTCGGTGGCAGAGCATAAGAACAATGGTATGTTTCTCTTACTCCAACATTTATAACGTTAGTAGTTTCCGTAAGATTTAAGGTTACGAGTCGCAGAATTCCAGGGTCGAATTCGAATATTCGGACTCATTAGGCCTTAAGTCTCCTTGCAAAATTTAGCCCAATAGAAATGTGGGCCCAAGTTCCCGGGAAGGCCCAATGATTTCGTAGCCGTCGCAATTCCTCCTGCTTGAAACCCAACAAATAATTCATAAAAATTAACCAATAAAGCAATAAAA

Coding sequence (CDS)

GTGTTGTCTCTGGAAAATTCGACGTTCGTTGCCGAGCTTTCTTTGTTTTGTTCCACACAGAATTCGATTCTGGAAGGGATTTTCGTTGACGGATATCTTTCACGAGCAATGTCTGTGGCTACGAAGCTCCATTCGCACGCTAAACCTGTTTTGGAGTCCCGAGAAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCGAAATGTAAACAGGAGACTTTGAAGAACTCAGAGAAGCAGAACAAGGCGCTTCCGGCGATTCCTGAATCGGTTATTCGAGACAATAGCTCCATCGGGAGCTCCTGCTCATCCGATTCTTTATTAAGCAGCTATTCGACCAAATTGTTGAATCCGAAAGTGAAGCCCTGCGATGTGAAGCCTGTGAAGGCTGTTGCTGCCGGAGGTGATCCAAACGTAACCTCAACGACGCCTAGTCTCTCGGTTCCGGGGAAACGCTGTGGTTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTCCATGACGAAGAATGGGGAGTCCCAGTTCATGACGACAGGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATTCTTTGCAAGAGAGATATATTTAGGAAAGTTTTTAATGATTTTGACCCATCTACCATCGCTAAGTTCACACAGAATGAGTTTACGACACTAAAAGAAAATGGCATCCAGCTCCTATCTGAACCAAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTACTCAAGATTCAACAGGAATTTGGTACCTTCAGCAACTATTGTTGGAGCTTTGTCAACAAGAAGCCTATAACAAACAGATTTCGATATGCCCGTCAAGTACCGGTAAAGACGCCGAAAGCAGAGTTCATGAGCAAGGATTTGCTAAGAAGAGGCTTTCGTTGTGTTGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTACCGGAATTGTTAACGATCACTTAGTCAATTGCTTCAGATATCAAGAGTGCGATGGTATGAAATTAAGAGTAGAAGGTCAGCGATCGGAGTTGCTCACCGGAGCTTTGGAGACTAGATCTTGA

Protein sequence

VLSLENSTFVAELSLFCSTQNSILEGIFVDGYLSRAMSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVIRDNSSIGSSCSSDSLLSSYSTKLLNPKVKPCDVKPVKAVAAGGDPNVTSTTPSLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKVFNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEGQRSELLTGALETRS
BLAST of Cp4.1LG06g01240 vs. Swiss-Prot
Match: GUAA_HELHP (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) GN=guaA PE=3 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 1.1e-41
Identity = 81/192 (42.19%), Postives = 115/192 (59.90%), Query Frame = 1

Query: 155 RCGWITPYSDP---LYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFR 214
           RC W T   +    LY  +HD EWG P+H+D+KLFE LVL    A L+W  IL KR+ FR
Sbjct: 787 RCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVLEGFQAGLSWITILKKREAFR 846

Query: 215 KVFNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCW 274
             F+DFDP  +A + +++   L  N   + +  K+ A + NA   + +Q+EFG+F  Y W
Sbjct: 847 VAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAIINAKAFMAVQREFGSFDKYIW 906

Query: 275 SFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLV 334
            FV  KPI N F     +P  TP ++ ++KDL +RGF+ VG T +Y+ MQ  G+VNDHL 
Sbjct: 907 GFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFVGTTTMYAMMQSIGMVNDHLT 966

Query: 335 NCFRYQECDGMK 344
           +CF+     GM+
Sbjct: 967 SCFKCNSSLGMQ 978

BLAST of Cp4.1LG06g01240 vs. Swiss-Prot
Match: 3MG1_ECOLI (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 5.4e-38
Identity = 74/183 (40.44%), Postives = 112/183 (61.20%), Query Frame = 1

Query: 154 KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKV 213
           +RCGW++   DPLYIA+HD EWGVP  D +KLFE++ L    A L+W  +L KR+ +R  
Sbjct: 2   ERCGWVS--QDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRAC 61

Query: 214 FNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSF 273
           F+ FDP  +A   + +   L ++   +    K++AI+ NA   L+++Q    F ++ WSF
Sbjct: 62  FHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWSF 121

Query: 274 VNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNC 333
           VN +P   +     ++P  T  ++ +SK L +RGF+ VG T+ YSFMQ  G+VNDH+V C
Sbjct: 122 VNHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVGC 181

Query: 334 FRY 337
             Y
Sbjct: 182 CCY 182

BLAST of Cp4.1LG06g01240 vs. Swiss-Prot
Match: 3MGA_HAEIN (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=tag PE=3 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 2.5e-35
Identity = 76/179 (42.46%), Postives = 106/179 (59.22%), Query Frame = 1

Query: 155 RCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKVF 214
           RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W  +L KR+ +R+ F
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 215 NDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSFV 274
           + FDP  IAK T  +     +N   +    KL AIV+NA   L +++    FS++ WSFV
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 275 NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNC 334
           N KPI N     R VP KT  ++ +SK L +RGF  +G T  Y+FMQ  G+V+DHL +C
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of Cp4.1LG06g01240 vs. TrEMBL
Match: A0A0A0LG22_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G855280 PE=4 SV=1)

HSP 1 Score: 508.4 bits (1308), Expect = 6.9e-141
Identity = 252/319 (79.00%), Postives = 277/319 (86.83%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVIR 96
           MSVATKL SH KP LE R ILGPGGNRDRAP+ PKCK ETLK +EKQ+KALPAI ESVIR
Sbjct: 1   MSVATKLQSHVKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIR 60

Query: 97  DNSSIGSSCSSDSLLSSYSTKLLNPKVKPCDVKPVKAVAAGGDPNVTSTTPSLSVPGKRC 156
           DN S+GSSCSSDSL S+YS KLL P         VK V+AGGD N T+T+P+LS+PGKRC
Sbjct: 61  DNVSVGSSCSSDSLSSNYSAKLLKPYA-------VKPVSAGGDSNATTTSPALSLPGKRC 120

Query: 157 GWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKVFND 216
            WIT +SDPLYIAFHDEEWGVP+HDD+KLFELLVLSQALAELTWPLIL KRD+FRKV ND
Sbjct: 121 DWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDVFRKVLND 180

Query: 217 FDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSFVNK 276
           FDPS+IA+FT+NEFTTLK NGIQLLSEPKLRAIV+NANQVLKIQ+EFG+FSNYCWSFVNK
Sbjct: 181 FDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNK 240

Query: 277 KPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY 336
           KPI NR RY RQVPVKTPKAEFMSKD++RRGFRCVGPTVVYSFMQV GIVNDHLV+CFRY
Sbjct: 241 KPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRY 300

Query: 337 QEC-----DGMKLRVEGQR 351
           +EC     D  KLRVE +R
Sbjct: 301 EECDPKVKDDKKLRVEDKR 312

BLAST of Cp4.1LG06g01240 vs. TrEMBL
Match: W9RT44_9ROSA (Putative Glutamine amidotransferase OS=Morus notabilis GN=L484_021208 PE=4 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 1.5e-106
Identity = 207/338 (61.24%), Postives = 244/338 (72.19%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPA----IPE 96
           MSVATKLHS   PV E R ILGPGGNR R  E PK K E LK  + Q    P     +P+
Sbjct: 1   MSVATKLHS---PVCEPRAILGPGGNRVRVSEYPKRKGEALKKPQAQRTRKPTAVSEVPQ 60

Query: 97  SVIRDNSSIGSSCSSDS-----LLSSYSTKLLNPKVKPCDVKPVKAVAAGGDPNVTSTTP 156
           SV+R N S+ SSCSSDS     L  + S+K   P VK   +KPVK V  G +    +  P
Sbjct: 61  SVVRSNGSVDSSCSSDSSSSGSLAKTVSSKKTPPTVKRKGLKPVKVVPVGVE--AVAALP 120

Query: 157 SLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKR 216
            +  P KRC WITP SD +Y +FHDEEWGVP+HDDRKLFELLV SQALAELTWP IL KR
Sbjct: 121 KILGPPKRCDWITPNSDSIYTSFHDEEWGVPIHDDRKLFELLVFSQALAELTWPAILNKR 180

Query: 217 DIFRKVFNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFS 276
           +IFRK+F +FDPS+IA+F + +  +LK NG  LLSEPKLRAIVENA Q+LKIQQEFG+FS
Sbjct: 181 EIFRKLFENFDPSSIAQFNEKKLLSLKVNGNLLLSEPKLRAIVENAKQILKIQQEFGSFS 240

Query: 277 NYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVN 336
           NYCWSFVN KPI N FRY RQVPVK+PKA+ +SKD+++RGFRCVGPTV+YSFMQV GIVN
Sbjct: 241 NYCWSFVNDKPIKNGFRYGRQVPVKSPKADLISKDMMQRGFRCVGPTVIYSFMQVAGIVN 300

Query: 337 DHLVNCFRYQECD---GMKLRVEGQRSELLTGALETRS 363
           DHL++CFRY+EC       L+   + S +LT ALE  S
Sbjct: 301 DHLLSCFRYEECKINVEKDLKPRTEESAILTEALEKTS 333

BLAST of Cp4.1LG06g01240 vs. TrEMBL
Match: A5BIS5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_033233 PE=4 SV=1)

HSP 1 Score: 391.0 bits (1003), Expect = 1.6e-105
Identity = 203/339 (59.88%), Postives = 248/339 (73.16%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVIR 96
           MSVATKL S AKP+ E R +LGPGGNR R  E+ KCK+E LK  ++  K    +PE+VIR
Sbjct: 1   MSVATKLQSPAKPLSEGRVVLGPGGNRFRVSEEAKCKKEGLKKPQQHRKQSSEVPEAVIR 60

Query: 97  DNSSIGSSCSSDSLLSSYSTKLLNPK--VKPCDVKPVKAVAAGGDPNVTSTTPSLSVPGK 156
           +N S  SSCSSDS  S  S K++N +  VK   +KPVK V  G           + VP K
Sbjct: 61  NNLSFESSCSSDSSSSGSSVKMVNSRGRVKRNGLKPVKVVPHG-----------VEVPAK 120

Query: 157 RCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKVF 216
           RC WITP SDPLY +FHDEEWGVPVHDD+KLFELLVLSQALAEL+WP IL KRDIFRK+F
Sbjct: 121 RCDWITPNSDPLYTSFHDEEWGVPVHDDKKLFELLVLSQALAELSWPTILNKRDIFRKLF 180

Query: 217 NDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKI----------QQEFG 276
           ++FDPS+IAKFT  +  +LK +G  LLSEPKLRA++ENANQ+LK+           QEFG
Sbjct: 181 DNFDPSSIAKFTDKKLLSLKASGGTLLSEPKLRAVIENANQMLKVIKFITRCLWFSQEFG 240

Query: 277 TFSNYCWSFVNKKPITNRFRYARQVPVKTP-KAEFMSKDLLRRGFRCVGPTVVYSFMQVT 336
           +FSNYCWSF+N KP+ N FRYARQVPVKT  +   +SKDL++RGFRCVGPTV+YSFMQV 
Sbjct: 241 SFSNYCWSFINHKPMKNGFRYARQVPVKTQNQNNIISKDLMQRGFRCVGPTVIYSFMQVA 300

Query: 337 GIVNDHLVNCFRYQECDG---MKLRVEGQRSELLTGALE 360
           G+VNDHL+ CFR+QEC+      L+ + + +E+LT ALE
Sbjct: 301 GLVNDHLLTCFRFQECNSNIKKDLQAKTEETEVLTNALE 328

BLAST of Cp4.1LG06g01240 vs. TrEMBL
Match: A0A061FDE4_THECC (DNA-3-methyladenine glycosylase, putative OS=Theobroma cacao GN=TCM_033995 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 2.8e-105
Identity = 199/307 (64.82%), Postives = 234/307 (76.22%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVIR 96
           MSVATKL S   PV E R ILGP GNR R  ++ K K E  K  ++    +   P+SV++
Sbjct: 1   MSVATKLKSSPTPVTEPRAILGPTGNRVRVSDESKRKPEAQKKPQRPKFRVSKSPQSVVQ 60

Query: 97  DNSSIGSSCSSDSLLSSYSTKLLNPK--VKPCDVKPVKA-VAAGGDPNVTSTTPSLSVPG 156
            N S+ SSCSSDS  S+ S K ++ K  VK   VKPVKA VA   D  V   +P L  P 
Sbjct: 61  SNVSVDSSCSSDSSSSNSSVKTVSSKKTVKRIGVKPVKAKVAPTADEVVAEPSPVLPEPL 120

Query: 157 KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKV 216
           KRC WITP+SDPLY + HD+EWGVPVHDDRKLFELLV SQALAEL+WP IL KRDIFRK+
Sbjct: 121 KRCDWITPFSDPLYTSLHDKEWGVPVHDDRKLFELLVFSQALAELSWPTILNKRDIFRKL 180

Query: 217 FNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSF 276
           F++FDPS+IA+FT+ +  +LK NG  LLSEPKLRA+VENA Q+LK+QQEFG+FS+YCW F
Sbjct: 181 FDNFDPSSIAQFTEKKLLSLKVNGSLLLSEPKLRAVVENAKQMLKVQQEFGSFSSYCWGF 240

Query: 277 VNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNC 336
           VN KPI N FRY RQVPVKTPKAE +SKD+++RGFRCVGPTVVYSFMQV GIVNDHLV C
Sbjct: 241 VNHKPIRNGFRYVRQVPVKTPKAELISKDMMQRGFRCVGPTVVYSFMQVAGIVNDHLVTC 300

Query: 337 FRYQECD 341
           FRYQEC+
Sbjct: 301 FRYQECN 307

BLAST of Cp4.1LG06g01240 vs. TrEMBL
Match: F6H1G2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g12030 PE=4 SV=1)

HSP 1 Score: 385.2 bits (988), Expect = 8.9e-104
Identity = 199/329 (60.49%), Postives = 242/329 (73.56%), Query Frame = 1

Query: 36  AMSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVI 95
           AMSVATKL S AKP+ E R +LGPGGNR R  E+ K K+E LK  ++  K    +PE+V+
Sbjct: 2   AMSVATKLQSPAKPLSEGRVVLGPGGNRFRVSEEAKRKKEGLKKPQQHRKQSSEVPEAVV 61

Query: 96  RDNSSIGSSCSSDSLLSSYSTKLLNPK--VKPCDVKPVKAVAAGGDPNVTSTTPSLSVPG 155
           R+N S  SSCSSDS  S  S K++N +  VK   +KPVK V  G           + VP 
Sbjct: 62  RNNLSFESSCSSDSSSSGSSVKMVNSRGRVKRNGLKPVKVVPHG-----------VEVPA 121

Query: 156 KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKV 215
           KRC WITP SDPLY +FHDEEWGVPVHDD+KLFELLVLSQALAEL+WP IL KRDIFRK+
Sbjct: 122 KRCDWITPNSDPLYTSFHDEEWGVPVHDDKKLFELLVLSQALAELSWPTILNKRDIFRKL 181

Query: 216 FNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSF 275
           F++FDPS+IAKFT  +  +LK +G  LLSEPKLRA++ENANQ+LK+QQEFG+FSNYCWSF
Sbjct: 182 FDNFDPSSIAKFTDKKLLSLKASGGTLLSEPKLRAVIENANQMLKVQQEFGSFSNYCWSF 241

Query: 276 VNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNC 335
           +N KP+ N FRYARQVPVKT           +RGFRCVGPTV+YSFMQV G+VNDHL+ C
Sbjct: 242 INHKPMKNGFRYARQVPVKT-----------QRGFRCVGPTVIYSFMQVAGLVNDHLLTC 301

Query: 336 FRYQECDG---MKLRVEGQRSELLTGALE 360
           FR+QEC+      L+ + + +E+LT ALE
Sbjct: 302 FRFQECNSNIKKDLQAKTEETEVLTNALE 308

BLAST of Cp4.1LG06g01240 vs. TAIR10
Match: AT1G75090.1 (AT1G75090.1 DNA glycosylase superfamily protein)

HSP 1 Score: 320.1 bits (819), Expect = 1.8e-87
Identity = 166/306 (54.25%), Postives = 212/306 (69.28%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKAL--PAIPESV 96
           MS+ +KL S  KP+ ESR IL   GNR +  +    K+  L     ++ A   P    SV
Sbjct: 1   MSIVSKLRSPVKPIDESRAILCSTGNRFKVTKTEMTKKPQLNPRVTKSPATKKPDSNFSV 60

Query: 97  IRDNSSIGSSCSSDSLLSSYSTKLLNPKVKPCDVKPVKAVAAGGDPNVTSTTPSLSVPGK 156
             D+SS  SS S  S +++ ++  +    K   V+ +  V A     V   +P +  P K
Sbjct: 61  STDDSSSSSSSSERSSVNTTNSGKVTTPSKRNGVEKLNNVVASVAV-VEDISPKIPGPVK 120

Query: 157 RCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKVF 216
           RC WITP SDP+Y+ FHDEEWGVPV DD+KLFELLV SQALAE +WP IL +RD FRK+F
Sbjct: 121 RCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKLF 180

Query: 217 NDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSFV 276
            +FDPS IA+FT+    +L+ NG  +LSE KLRAIVENA  VLK++QEFG+FSNYCW FV
Sbjct: 181 EEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRFV 240

Query: 277 NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCF 336
           N KP+ N +RY RQVPVK+PKAE++SKD+++RGFRCVGPTV+YSF+Q +GIVNDHL  CF
Sbjct: 241 NHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTACF 300

Query: 337 RYQECD 341
           RYQEC+
Sbjct: 301 RYQECN 305

BLAST of Cp4.1LG06g01240 vs. TAIR10
Match: AT1G15970.1 (AT1G15970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 267.3 bits (682), Expect = 1.4e-71
Identity = 152/334 (45.51%), Postives = 200/334 (59.88%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNR-DRAP-----EKPKCKQETLKNSEKQNK--ALP 96
           MSV  +  S      E R +LGP GN+  R P     EKP  ++  + + +++ K    P
Sbjct: 1   MSVPPRFRSVNSDEREFRSVLGPTGNKLQRKPPGMKLEKPMMEKTIIDSKDEKAKKPTTP 60

Query: 97  AIPESVIRDNSSIGSSC---SSDSLLSSYSTKLLNPKVKPCDVKPVKAVAAGGDPNVT-- 156
           A P + ++  SS+ SS    +S S+ +SYS+   +     C+  P+   ++     V   
Sbjct: 61  ASPRTTLKQCSSLCSSILRKNSASMTASYSSDASSS----CESSPLSVASSSSCKKVVRR 120

Query: 157 ----STTPSLSVPG--------------KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKL 216
               S+T  LSV                KRC WITP +DP Y+AFHDEEWGVPVHDD+KL
Sbjct: 121 SGSVSSTRKLSVGKEEEKVSGDCFADGRKRCAWITPKADPCYVAFHDEEWGVPVHDDKKL 180

Query: 217 FELLVLSQALAELTWPLILCKRDIFRKVFNDFDPSTIAKFTQNEFTTLKENGIQLLSEPK 276
           FELL LS ALAEL+W  IL +R I R+VF DFDP  +A+    + T      I LLSE K
Sbjct: 181 FELLCLSGALAELSWTDILSRRHILREVFMDFDPVAVAELNDKKLTAPGTAAISLLSEVK 240

Query: 277 LRAIVENANQVLKIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLR 336
           +R+I++N+  V KI  E G+   Y W+FVN KP  ++FRY RQVPVKT KAEF+SKDL+R
Sbjct: 241 IRSILDNSRHVRKIIAECGSLKKYMWNFVNNKPTQSQFRYQRQVPVKTSKAEFISKDLVR 300

Query: 337 RGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQEC 340
           RGFR V PTV+YSFMQ  G+ NDHL+ CFRYQ+C
Sbjct: 301 RGFRSVSPTVIYSFMQAAGLTNDHLIGCFRYQDC 330

BLAST of Cp4.1LG06g01240 vs. TAIR10
Match: AT5G57970.1 (AT5G57970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 266.2 bits (679), Expect = 3.0e-71
Identity = 134/258 (51.94%), Postives = 168/258 (65.12%), Query Frame = 1

Query: 92  ESVIRDNSSIGSSCSSDSLLSSYSTKL----------LNPKVKPCDVKPVKAVAAGGDPN 151
           E  +  N S+ +S SSD+ + S+ ++           +  + K    KP   V+ G    
Sbjct: 86  EQNLNSNLSLNASFSSDASMDSFHSRASTGRLIRSYSVGSRSKSYPSKPRSVVSEGA--- 145

Query: 152 VTSTTPSLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWP 211
              + P+ S   KRC W+TP SDP YI FHDEEWGVPVHDD++LFELLVLS ALAE TWP
Sbjct: 146 -LDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWP 205

Query: 212 LILCKRDIFRKVFNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQ 271
            IL KR  FR+VF DFDP+ I K  + +          LLS+ KLRA++ENA Q+LK+ +
Sbjct: 206 TILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIE 265

Query: 272 EFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQ 331
           E+G+F  Y WSFV  K I ++FRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSFMQ
Sbjct: 266 EYGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQ 325

Query: 332 VTGIVNDHLVNCFRYQEC 340
             GI NDHL +CFR+  C
Sbjct: 326 AAGITNDHLTSCFRFHHC 339

BLAST of Cp4.1LG06g01240 vs. TAIR10
Match: AT1G80850.1 (AT1G80850.1 DNA glycosylase superfamily protein)

HSP 1 Score: 255.8 bits (652), Expect = 4.1e-68
Identity = 150/319 (47.02%), Postives = 190/319 (59.56%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNR------DRAPEKPKC-KQETLKNSEKQNKALPA 96
           MS   ++ S      E R +LGP GN+       +  +KP   K + L  +EK  +  P 
Sbjct: 1   MSAPPRVRSVDSSDREFRSVLGPAGNKLQQKPLSKPVKKPVAEKTKNLTFTEKMPQCSPL 60

Query: 97  IPESVIRDNSSIGSSCSSD-------SLLSSYSTKLLNPKVKPC-DVKPVKAVAAGGDPN 156
            P  + R+  S+ +S SSD       S LS  ST      ++    V    ++       
Sbjct: 61  SPPILRRNGISMTASYSSDASSSCESSPLSMTSTSSGKRVLRRSGSVSSSSSLRRNLTEE 120

Query: 157 VTSTTPSLSVPG-KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTW 216
                      G KRC WITP SD  YIAFHDEEWGVPVHDD++LFELL LS ALAEL+W
Sbjct: 121 RDEKASDCFCDGRKRCAWITPKSDQCYIAFHDEEWGVPVHDDKRLFELLSLSGALAELSW 180

Query: 217 PLILCKRDIFRKVFNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQ 276
             IL KR +FR+VF DFDP  I++ T  + T+ +     LLSE KLR+I+ENANQV KI 
Sbjct: 181 KDILSKRQLFREVFMDFDPIAISELTNKKITSPEIAATTLLSEQKLRSILENANQVCKII 240

Query: 277 QEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFM 336
             FG+F  Y W+FVN+KP  ++FRY RQVPVKT KAE +SKDL+RRGFR V PTV+YSFM
Sbjct: 241 GAFGSFDKYIWNFVNQKPTQSQFRYPRQVPVKTSKAELISKDLVRRGFRSVSPTVIYSFM 300

Query: 337 QVTGIVNDHLVNCFRYQEC 340
           Q  G+ NDHL  CFR+ +C
Sbjct: 301 QTAGLTNDHLTCCFRHHDC 319

BLAST of Cp4.1LG06g01240 vs. TAIR10
Match: AT1G13635.1 (AT1G13635.1 DNA glycosylase superfamily protein)

HSP 1 Score: 214.9 bits (546), Expect = 8.0e-56
Identity = 90/186 (48.39%), Postives = 133/186 (71.51%), Query Frame = 1

Query: 154 KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKV 213
           KRC WIT  SD +Y+ FHD++WGVPV+DD  LFE L +S  L +  W  IL +++ FR+ 
Sbjct: 115 KRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREA 174

Query: 214 FNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSF 273
           F +FDP+ +AK  + E   +  N   +L E ++R IV+NA  + K+  EFG+FS++ W F
Sbjct: 175 FCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGF 234

Query: 274 VNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNC 333
           ++ KPI N+F+Y+R VP+++PKAE +SKD+++RGFR VGP +V+SFMQ  G+  DHLV+C
Sbjct: 235 MDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDC 294

Query: 334 FRYQEC 340
           FR+ +C
Sbjct: 295 FRHGDC 300

BLAST of Cp4.1LG06g01240 vs. NCBI nr
Match: gi|449460123|ref|XP_004147795.1| (PREDICTED: uncharacterized protein LOC101206397 [Cucumis sativus])

HSP 1 Score: 508.4 bits (1308), Expect = 1.0e-140
Identity = 252/319 (79.00%), Postives = 277/319 (86.83%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVIR 96
           MSVATKL SH KP LE R ILGPGGNRDRAP+ PKCK ETLK +EKQ+KALPAI ESVIR
Sbjct: 1   MSVATKLQSHVKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIR 60

Query: 97  DNSSIGSSCSSDSLLSSYSTKLLNPKVKPCDVKPVKAVAAGGDPNVTSTTPSLSVPGKRC 156
           DN S+GSSCSSDSL S+YS KLL P         VK V+AGGD N T+T+P+LS+PGKRC
Sbjct: 61  DNVSVGSSCSSDSLSSNYSAKLLKPYA-------VKPVSAGGDSNATTTSPALSLPGKRC 120

Query: 157 GWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKVFND 216
            WIT +SDPLYIAFHDEEWGVP+HDD+KLFELLVLSQALAELTWPLIL KRD+FRKV ND
Sbjct: 121 DWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDVFRKVLND 180

Query: 217 FDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSFVNK 276
           FDPS+IA+FT+NEFTTLK NGIQLLSEPKLRAIV+NANQVLKIQ+EFG+FSNYCWSFVNK
Sbjct: 181 FDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNK 240

Query: 277 KPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY 336
           KPI NR RY RQVPVKTPKAEFMSKD++RRGFRCVGPTVVYSFMQV GIVNDHLV+CFRY
Sbjct: 241 KPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRY 300

Query: 337 QEC-----DGMKLRVEGQR 351
           +EC     D  KLRVE +R
Sbjct: 301 EECDPKVKDDKKLRVEDKR 312

BLAST of Cp4.1LG06g01240 vs. NCBI nr
Match: gi|659133198|ref|XP_008466607.1| (PREDICTED: uncharacterized protein LOC103503975 [Cucumis melo])

HSP 1 Score: 505.8 bits (1301), Expect = 6.5e-140
Identity = 253/320 (79.06%), Postives = 274/320 (85.62%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVIR 96
           MSVATKL SHAKP LE R ILGPGGNRDRAP  PKCK ETLK +EKQ+KALP I E VIR
Sbjct: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPHNPKCKPETLKKTEKQSKALPVISELVIR 60

Query: 97  DNSSIGSSCSSDSLLSSYSTKLLNPKVKPCDVKPVKAVAAGGDPNVTSTTPSLSVPGKRC 156
           DN S+GSSCSSDSL S+YS KLL P         VK V+AGGD + T+T+P+LS+PGKRC
Sbjct: 61  DNVSVGSSCSSDSLSSNYSVKLLKPYA-------VKPVSAGGDSSATTTSPALSLPGKRC 120

Query: 157 GWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKVFND 216
            WIT +SDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL KRDIFRKV ND
Sbjct: 121 DWITLHSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDIFRKVLND 180

Query: 217 FDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSFVNK 276
           FDPS+IA+F +NEFTTLK NGIQLLSEPKLRAIVENANQVLKIQ+EFG+FSNYCWSFVNK
Sbjct: 181 FDPSSIAQFKENEFTTLKVNGIQLLSEPKLRAIVENANQVLKIQKEFGSFSNYCWSFVNK 240

Query: 277 KPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY 336
           KPI NRFRY RQVPVKTPKAEFMSKD++RRGFRCVGPTVVYSFMQV GIVNDHL NCFRY
Sbjct: 241 KPIRNRFRYNRQVPVKTPKAEFMSKDMMRRGFRCVGPTVVYSFMQVAGIVNDHLANCFRY 300

Query: 337 QEC------DGMKLRVEGQR 351
           +EC      D  KLRVE +R
Sbjct: 301 EECDTTKIKDDKKLRVEDKR 313

BLAST of Cp4.1LG06g01240 vs. NCBI nr
Match: gi|703135970|ref|XP_010106031.1| (Putative Glutamine amidotransferase [Morus notabilis])

HSP 1 Score: 394.4 bits (1012), Expect = 2.1e-106
Identity = 207/338 (61.24%), Postives = 244/338 (72.19%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPA----IPE 96
           MSVATKLHS   PV E R ILGPGGNR R  E PK K E LK  + Q    P     +P+
Sbjct: 1   MSVATKLHS---PVCEPRAILGPGGNRVRVSEYPKRKGEALKKPQAQRTRKPTAVSEVPQ 60

Query: 97  SVIRDNSSIGSSCSSDS-----LLSSYSTKLLNPKVKPCDVKPVKAVAAGGDPNVTSTTP 156
           SV+R N S+ SSCSSDS     L  + S+K   P VK   +KPVK V  G +    +  P
Sbjct: 61  SVVRSNGSVDSSCSSDSSSSGSLAKTVSSKKTPPTVKRKGLKPVKVVPVGVE--AVAALP 120

Query: 157 SLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKR 216
            +  P KRC WITP SD +Y +FHDEEWGVP+HDDRKLFELLV SQALAELTWP IL KR
Sbjct: 121 KILGPPKRCDWITPNSDSIYTSFHDEEWGVPIHDDRKLFELLVFSQALAELTWPAILNKR 180

Query: 217 DIFRKVFNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFS 276
           +IFRK+F +FDPS+IA+F + +  +LK NG  LLSEPKLRAIVENA Q+LKIQQEFG+FS
Sbjct: 181 EIFRKLFENFDPSSIAQFNEKKLLSLKVNGNLLLSEPKLRAIVENAKQILKIQQEFGSFS 240

Query: 277 NYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVN 336
           NYCWSFVN KPI N FRY RQVPVK+PKA+ +SKD+++RGFRCVGPTV+YSFMQV GIVN
Sbjct: 241 NYCWSFVNDKPIKNGFRYGRQVPVKSPKADLISKDMMQRGFRCVGPTVIYSFMQVAGIVN 300

Query: 337 DHLVNCFRYQECD---GMKLRVEGQRSELLTGALETRS 363
           DHL++CFRY+EC       L+   + S +LT ALE  S
Sbjct: 301 DHLLSCFRYEECKINVEKDLKPRTEESAILTEALEKTS 333

BLAST of Cp4.1LG06g01240 vs. NCBI nr
Match: gi|147867293|emb|CAN83288.1| (hypothetical protein VITISV_033233 [Vitis vinifera])

HSP 1 Score: 391.0 bits (1003), Expect = 2.3e-105
Identity = 203/339 (59.88%), Postives = 248/339 (73.16%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVIR 96
           MSVATKL S AKP+ E R +LGPGGNR R  E+ KCK+E LK  ++  K    +PE+VIR
Sbjct: 1   MSVATKLQSPAKPLSEGRVVLGPGGNRFRVSEEAKCKKEGLKKPQQHRKQSSEVPEAVIR 60

Query: 97  DNSSIGSSCSSDSLLSSYSTKLLNPK--VKPCDVKPVKAVAAGGDPNVTSTTPSLSVPGK 156
           +N S  SSCSSDS  S  S K++N +  VK   +KPVK V  G           + VP K
Sbjct: 61  NNLSFESSCSSDSSSSGSSVKMVNSRGRVKRNGLKPVKVVPHG-----------VEVPAK 120

Query: 157 RCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKVF 216
           RC WITP SDPLY +FHDEEWGVPVHDD+KLFELLVLSQALAEL+WP IL KRDIFRK+F
Sbjct: 121 RCDWITPNSDPLYTSFHDEEWGVPVHDDKKLFELLVLSQALAELSWPTILNKRDIFRKLF 180

Query: 217 NDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKI----------QQEFG 276
           ++FDPS+IAKFT  +  +LK +G  LLSEPKLRA++ENANQ+LK+           QEFG
Sbjct: 181 DNFDPSSIAKFTDKKLLSLKASGGTLLSEPKLRAVIENANQMLKVIKFITRCLWFSQEFG 240

Query: 277 TFSNYCWSFVNKKPITNRFRYARQVPVKTP-KAEFMSKDLLRRGFRCVGPTVVYSFMQVT 336
           +FSNYCWSF+N KP+ N FRYARQVPVKT  +   +SKDL++RGFRCVGPTV+YSFMQV 
Sbjct: 241 SFSNYCWSFINHKPMKNGFRYARQVPVKTQNQNNIISKDLMQRGFRCVGPTVIYSFMQVA 300

Query: 337 GIVNDHLVNCFRYQECDG---MKLRVEGQRSELLTGALE 360
           G+VNDHL+ CFR+QEC+      L+ + + +E+LT ALE
Sbjct: 301 GLVNDHLLTCFRFQECNSNIKKDLQAKTEETEVLTNALE 328

BLAST of Cp4.1LG06g01240 vs. NCBI nr
Match: gi|590593103|ref|XP_007017467.1| (DNA-3-methyladenine glycosylase, putative [Theobroma cacao])

HSP 1 Score: 390.2 bits (1001), Expect = 4.0e-105
Identity = 199/307 (64.82%), Postives = 234/307 (76.22%), Query Frame = 1

Query: 37  MSVATKLHSHAKPVLESREILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVIR 96
           MSVATKL S   PV E R ILGP GNR R  ++ K K E  K  ++    +   P+SV++
Sbjct: 1   MSVATKLKSSPTPVTEPRAILGPTGNRVRVSDESKRKPEAQKKPQRPKFRVSKSPQSVVQ 60

Query: 97  DNSSIGSSCSSDSLLSSYSTKLLNPK--VKPCDVKPVKA-VAAGGDPNVTSTTPSLSVPG 156
            N S+ SSCSSDS  S+ S K ++ K  VK   VKPVKA VA   D  V   +P L  P 
Sbjct: 61  SNVSVDSSCSSDSSSSNSSVKTVSSKKTVKRIGVKPVKAKVAPTADEVVAEPSPVLPEPL 120

Query: 157 KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRKV 216
           KRC WITP+SDPLY + HD+EWGVPVHDDRKLFELLV SQALAEL+WP IL KRDIFRK+
Sbjct: 121 KRCDWITPFSDPLYTSLHDKEWGVPVHDDRKLFELLVFSQALAELSWPTILNKRDIFRKL 180

Query: 217 FNDFDPSTIAKFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKIQQEFGTFSNYCWSF 276
           F++FDPS+IA+FT+ +  +LK NG  LLSEPKLRA+VENA Q+LK+QQEFG+FS+YCW F
Sbjct: 181 FDNFDPSSIAQFTEKKLLSLKVNGSLLLSEPKLRAVVENAKQMLKVQQEFGSFSSYCWGF 240

Query: 277 VNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNC 336
           VN KPI N FRY RQVPVKTPKAE +SKD+++RGFRCVGPTVVYSFMQV GIVNDHLV C
Sbjct: 241 VNHKPIRNGFRYVRQVPVKTPKAELISKDMMQRGFRCVGPTVVYSFMQVAGIVNDHLVTC 300

Query: 337 FRYQECD 341
           FRYQEC+
Sbjct: 301 FRYQECN 307

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUAA_HELHP1.1e-4142.19Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
3MG1_ECOLI5.4e-3840.44DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 S... [more]
3MGA_HAEIN2.5e-3542.46DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0LG22_CUCSA6.9e-14179.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G855280 PE=4 SV=1[more]
W9RT44_9ROSA1.5e-10661.24Putative Glutamine amidotransferase OS=Morus notabilis GN=L484_021208 PE=4 SV=1[more]
A5BIS5_VITVI1.6e-10559.88Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_033233 PE=4 SV=1[more]
A0A061FDE4_THECC2.8e-10564.82DNA-3-methyladenine glycosylase, putative OS=Theobroma cacao GN=TCM_033995 PE=4 ... [more]
F6H1G2_VITVI8.9e-10460.49Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g12030 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT1G75090.11.8e-8754.25 DNA glycosylase superfamily protein[more]
AT1G15970.11.4e-7145.51 DNA glycosylase superfamily protein[more]
AT5G57970.13.0e-7151.94 DNA glycosylase superfamily protein[more]
AT1G80850.14.1e-6847.02 DNA glycosylase superfamily protein[more]
AT1G13635.18.0e-5648.39 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449460123|ref|XP_004147795.1|1.0e-14079.00PREDICTED: uncharacterized protein LOC101206397 [Cucumis sativus][more]
gi|659133198|ref|XP_008466607.1|6.5e-14079.06PREDICTED: uncharacterized protein LOC103503975 [Cucumis melo][more]
gi|703135970|ref|XP_010106031.1|2.1e-10661.24Putative Glutamine amidotransferase [Morus notabilis][more]
gi|147867293|emb|CAN83288.1|2.3e-10559.88hypothetical protein VITISV_033233 [Vitis vinifera][more]
gi|590593103|ref|XP_007017467.1|4.0e-10564.82DNA-3-methyladenine glycosylase, putative [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006281DNA repair
GO:0006284base-excision repair
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0008725DNA-3-methyladenine glycosylase activity
Vocabulary: INTERPRO
TermDefinition
IPR011257DNA_glycosylase
IPR005019Adenine_glyco
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g01240.1Cp4.1LG06g01240.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 163..335
score: 1.4
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 154..337
score: 1.4
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 154..339
score: 2.51
NoneNo IPR availablePANTHERPTHR31116FAMILY NOT NAMEDcoord: 38..340
score: 6.8E
NoneNo IPR availablePANTHERPTHR31116:SF53-METHYLADENINE GLYCOSYLASE I-RELATEDcoord: 38..340
score: 6.8E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG06g01240Cp4.1LG02g07420Cucurbita pepo (Zucchini)cpecpeB459