Cp4.1LG16g06960 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG16g06960
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDNA-3-methyladenine glycosylase, putative
LocationCp4.1LG16 : 7207047 .. 7209325 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTCGTTCGGAGCAGGCCTTGGAAGCCACCTCTGTCGTCGTTGATTCCAAATTCACCGCCCGGCCCGTCCTTCAACCTACCGGCAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACCCCCTTCCGCCGCCGTCTCCCCTACCTCCCCCAAGTCCAAATCGCCGCGTCCTCCGGCCACCAAGCGAGCTAATGACACTAACCCCATGAACTCCAGCTCTGACAAGATTCTAATTCCGGCCGCTGCTCTGTCTCGCCCCAAGGCTGCCTTGGATAGGAAGAAATCAAAAAGCTTCAAATTGGCTGGAAATGGGAATGTTGTGATTTGTGATAATGTTGCAGGTGGTGGTGGATTTGAGGTTGCGTCCTTGAGCTACGCTTCTTCGTTGATTACTGACTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGAAGATCTAAATCTGCCCGGTTTGATAAAGTTGTTCCTATTGATTCTAAAATTAAACCCGCCGTTGAAGATCGGAGATGCAGCTTCATCACTCCCAATTCAGGTACCAATTATGTTTTCTTTTTAAATTTCCCCTGTTTATGATATTCATAAATTCAGTTTAATTCAATTCAATTATCTCTTTCCCTTTATAGATCCCATTTATGTTGCTTACCATGATGAAGAATGGGGCGTCCCTGTTCATGATGACAAGTGAGTTTCTCTCTCTCTCTCTCTCTCTCTCTCCGACTCTGTTTCTCTCTCTCTCTCTCTCTCCCTCTCCGACTCTGTTTCTCTCTCATTCTCTCTCTCTCCGACTCAGTTTCTCTCCCTCTCTCTCCGACTCTGTTTCTCTCTGTCTCTCTCTCCGACTCTGTTTCTCTCTCTCTCTCCGACTCTGTTTTTTTTCTCTCTCTCTCTCTCTCTCTTCGACTCTGTTTCTATCTCTTCTCCATCGGATTAAAGTCAAATTTAATATAATTAAATGAAGTTTACTACATAATCCATGAATTTGAGTACTGAATTCTGGAATTGAACACAGAATGCTGTTTGAATTGCTGGTTCTAAGCGTGGCCCAGGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGGTACTATAAACAACAATTTCCTTCAATTTATTCATTTGTGTTCTTATAAATTCACTAATTTAATTAATGTTCTTCGTTGCCCAGAAATGCATTTTCAAGTTTCGTTGCAGAAACGGTGGCCATTTTTTCCGACAAACAGATGCTATCAATCAGCTCGGAATACGGCATAGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCTATCCGAATCCTCGAGGTAAATGAGAATTAAATATTTCAAATCAAAACCCATTGACTTAGTGGTCAACCCCAATTAATTAATATTTTTAATTACAGATTAAGAAGGAGTTTCGATCATTCGACAAATACATTTGGGGGTTTGTGAACAACAAGCCGTTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAGACCATAAGCAAAGACATGATCAGACGAGGTTTCCGGTCGGTCGGACCAACCGTCCTCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTGACCAGCTGCCACAGGCATCTCCACTGCTCCATAACCGCCGCCGACCGTCGCGCTCCGGCGGTGGTAGTGGAGGAGACAACGACGGCGTCTGAAACTGTGTAGAATTGATTCGAGAATTTAATTAACAAGACAAAAAGAGAAAGTGGTAACCTTTACGAGGAGTCAGTCGATCAACGATGATTTGCTTGCTAATTAACTAGATAACCTTTTTTTTGTTTTTGTTTTTTTGTGGGGTTTGTGTATATTAATGTCTATATATAAATAGACTTGTAAGTGAGAGAGAACAAAAAATTAAAGAAAAAAAAAAAAAAGAGATTGTGNTGGTGGGGATTTTAGTGAAAATGCTTTGTATGATTAGAAGAAGAAAAAAAAAGGAGACAGACAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGGGTCTGTCAGTTTGCTTTTTTGTAAATTCCCATGTGATCCATCCAAATTTCAAAACATTATTAATCATTATTATTATTATTTTGTATTTTATGTTTTTTTTTCATAACAGCCTGTTGCTTTGCTTTATTCACAAGGAGCAAATACCCAACTTTGCTTCCATCAAACCATCCCTCAATGTCTCTCATAGATCCACAT

mRNA sequence

ATGTGTCGTTCGGAGCAGGCCTTGGAAGCCACCTCTGTCGTCGTTGATTCCAAATTCACCGCCCGGCCCGTCCTTCAACCTACCGGCAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACCCCCTTCCGCCGCCGTCTCCCCTACCTCCCCCAAGTCCAAATCGCCGCGTCCTCCGGCCACCAAGCGAGCTAATGACACTAACCCCATGAACTCCAGCTCTGACAAGATTCTAATTCCGGCCGCTGCTCTGTCTCGCCCCAAGGCTGCCTTGGATAGGAAGAAATCAAAAAGCTTCAAATTGGCTGGAAATGGGAATGTTGTGATTTGTGATAATGTTGCAGGTGGTGGTGGATTTGAGGTTGCGTCCTTGAGCTACGCTTCTTCGTTGATTACTGACTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGAAGATCTAAATCTGCCCGGTTTGATAAAGTTGTTCCTATTGATTCTAAAATTAAACCCGCCGTTGAAGATCGGAGATGCAGCTTCATCACTCCCAATTCAGGTACCAATTATGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCATTTTCAAGTTTCGTTGCAGAAACGGTGGCCATTTTTTCCGACAAACAGATGCTATCAATCAGCTCGGAATACGGCATAGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCTATCCGAATCCTCGAGATTAAGAAGGAGTTTCGATCATTCGACAAATACATTTGGGGGTTTGTGAACAACAAGCCGTTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAGACCATAAGCAAAGACATGATCAGACGAGGTTTCCGGTCGGTCGGACCAACCGTCCTCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTGACCAGCTGCCACAGGCATCTCCACTGCTCCATAACCGCCGCCGACCGTCGCGCTCCGGCGGTGGTAGTGGAGGAGACAACGACGGCGTCTGAAACTGTGTAGAATTGATTCGAGAATTTAATTAACAAGACAAAAAGAGAAAGTGGTAACCTTTACGAGGAGTCAGTCGATCAACGATGATTTGCTTGCTAATTAACTAGATAACCTTTTTTTTGTTTTTGTTTTTTTGTGGGGTTTGTGTATATTAATGTCTATATATAAATAGACTTGTAAGTGAGAGAGAACAAAAAATTAAAGAAAAAAAAAAAAAAGAGATTGTGNTGGTGGGGATTTTAGTGAAAATGCTTTGTATGATTAGAAGAAGAAAAAAAAAGGAGACAGACAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGGGTCTGTCAGTTTGCTTTTTTGTAAATTCCCATGTGATCCATCCAAATTTCAAAACATTATTAATCATTATTATTATTATTTTGTATTTTATGTTTTTTTTTCATAACAGCCTGTTGCTTTGCTTTATTCACAAGGAGCAAATACCCAACTTTGCTTCCATCAAACCATCCCTCAATGTCTCTCATAGATCCACAT

Coding sequence (CDS)

ATGTGTCGTTCGGAGCAGGCCTTGGAAGCCACCTCTGTCGTCGTTGATTCCAAATTCACCGCCCGGCCCGTCCTTCAACCTACCGGCAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACCCCCTTCCGCCGCCGTCTCCCCTACCTCCCCCAAGTCCAAATCGCCGCGTCCTCCGGCCACCAAGCGAGCTAATGACACTAACCCCATGAACTCCAGCTCTGACAAGATTCTAATTCCGGCCGCTGCTCTGTCTCGCCCCAAGGCTGCCTTGGATAGGAAGAAATCAAAAAGCTTCAAATTGGCTGGAAATGGGAATGTTGTGATTTGTGATAATGTTGCAGGTGGTGGTGGATTTGAGGTTGCGTCCTTGAGCTACGCTTCTTCGTTGATTACTGACTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGAAGATCTAAATCTGCCCGGTTTGATAAAGTTGTTCCTATTGATTCTAAAATTAAACCCGCCGTTGAAGATCGGAGATGCAGCTTCATCACTCCCAATTCAGGTACCAATTATGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCATTTTCAAGTTTCGTTGCAGAAACGGTGGCCATTTTTTCCGACAAACAGATGCTATCAATCAGCTCGGAATACGGCATAGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCTATCCGAATCCTCGAGATTAAGAAGGAGTTTCGATCATTCGACAAATACATTTGGGGGTTTGTGAACAACAAGCCGTTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAGACCATAAGCAAAGACATGATCAGACGAGGTTTCCGGTCGGTCGGACCAACCGTCCTCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTGACCAGCTGCCACAGGCATCTCCACTGCTCCATAACCGCCGCCGACCGTCGCGCTCCGGCGGTGGTAGTGGAGGAGACAACGACGGCGTCTGAAACTGTGTAG

Protein sequence

MCRSEQALEATSVVVDSKFTARPVLQPTGNRVLDRRNSLKKPPSAAVSPTSPKSKSPRPPATKRANDTNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICDNVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKVVPIDSKIKPAVEDRRCSFITPNSGTNYVGSDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKEFRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQAAGLTNDHLTSCHRHLHCSITAADRRAPAVVVEETTTASETV
BLAST of Cp4.1LG16g06960 vs. Swiss-Prot
Match: GUAA_HELHP (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) GN=guaA PE=3 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.2e-24
Identity = 57/139 (41.01%), Postives = 84/139 (60.43%), Query Frame = 1

Query: 201 GSDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINR--VRGVVDNAIRI 260
           G  W +ILKKR+ FR AF  F    VA + + ++  +    GI  NR  +   + NA   
Sbjct: 832 GLSWITILKKREAFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAIINAKAF 891

Query: 261 LEIKKEFRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVL 320
           + +++EF SFDKYIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +
Sbjct: 892 MAVQREFGSFDKYIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFVGTTTM 951

Query: 321 HSFMQAAGLTNDHLTSCHR 338
           ++ MQ+ G+ NDHLTSC +
Sbjct: 952 YAMMQSIGMVNDHLTSCFK 970

BLAST of Cp4.1LG16g06960 vs. Swiss-Prot
Match: 3MG1_ECOLI (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 1.5e-19
Identity = 48/137 (35.04%), Postives = 81/137 (59.12%), Query Frame = 1

Query: 201 GSDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINR--VRGVVDNAIRI 260
           G  W ++LKKR+++R  F  F    VA   ++ +  +  + GI  +R  ++ ++ NA   
Sbjct: 43  GLSWITVLKKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAY 102

Query: 261 LEIKKEFRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVL 320
           L++++    F  ++W FVN++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ 
Sbjct: 103 LQMEQNGEPFVDFVWSFVNHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTIC 162

Query: 321 HSFMQAAGLTNDHLTSC 336
           +SFMQA GL NDH+  C
Sbjct: 163 YSFMQACGLVNDHVVGC 179

BLAST of Cp4.1LG16g06960 vs. Swiss-Prot
Match: 3MGA_HAEIN (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=tag PE=3 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 7.0e-17
Identity = 46/137 (33.58%), Postives = 74/137 (54.01%), Query Frame = 1

Query: 201 GSDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVR--GVVDNAIRI 260
           G  W ++LKKR+ +R AF  F  + +A  +   + +     G+  +R +   +V NA   
Sbjct: 44  GLSWITVLKKRESYREAFHQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAY 103

Query: 261 LEIKKEFRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVL 320
           L ++K   +F  +IW FVN+KP          +P KT  S+ +SK + +RGF  +G T  
Sbjct: 104 LAMEKCGENFSDFIWSFVNHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTC 163

Query: 321 HSFMQAAGLTNDHLTSC 336
           ++FMQ+ GL +DHL  C
Sbjct: 164 YAFMQSMGLVDDHLNDC 180

BLAST of Cp4.1LG16g06960 vs. TrEMBL
Match: A0A0A0KED6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134890 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 5.3e-157
Identity = 315/406 (77.59%), Postives = 335/406 (82.51%), Query Frame = 1

Query: 1   MCRSEQALEATSVVVDSKFTARPVLQPTGNRVLDRRNSLKK------PPSAA-VSPTSPK 60
           MCRSE+ LEATSVVVDSKF +RPVLQPTGNRVLDRRNSLKK      PPSAA VSPTSPK
Sbjct: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60

Query: 61  SKSPRPPATKRAND-TNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICD 120
           SKSPRPPATKRAND  NPMNSSS+KILIPAA +SRP+A LDRKKSKSFKL GNGNV ICD
Sbjct: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAA-VSRPRATLDRKKSKSFKLGGNGNV-ICD 120

Query: 121 NVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKV 180
           N    GGFEVA   YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+
Sbjct: 121 N----GGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKI 180

Query: 181 VPIDSKIKPAVEDRRCSFITPNSGTNY----------------------------VGSDW 240
           VP+DSKIKPAVEDRRCSFITPNS   Y                            VGSDW
Sbjct: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240

Query: 241 TSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKE 300
           TSILKKRQDFRNAFSSF +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL+IKKE
Sbjct: 241 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKE 300

Query: 301 FRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQA 360
           F SFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTV+HSFMQA
Sbjct: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 360

Query: 361 AGLTNDHLTSCHRHLHCSITAADRRAPAVV-----VEETTTASETV 366
           AGLTNDHLT+CHRHLHC++ AA RR PA       VE+T    ET+
Sbjct: 361 AGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL 397

BLAST of Cp4.1LG16g06960 vs. TrEMBL
Match: M5X1J5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006139mg PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 4.6e-108
Identity = 245/423 (57.92%), Postives = 283/423 (66.90%), Query Frame = 1

Query: 1   MCRSEQ----ALEATSVVVDSKFTARPVLQPTGNRV--LDRRNSLKK------------P 60
           MC S+      +E T +V  ++   RPVLQPT NRV  LDRRNS+KK            P
Sbjct: 1   MCSSKAKVTIGVEVTPMV--ARINGRPVLQPTCNRVPSLDRRNSIKKISTPRAPPPPPLP 60

Query: 61  PSAAVS---------------PTSPKSKSPRPPATKRANDTNPMNSSSDKILIPAAALSR 120
            S+A S               P SPKSKSPRPPA KR ND N +NSSS+K++ P    +R
Sbjct: 61  TSSASSTSPRISNKASSLLTPPISPKSKSPRPPAIKRGNDPNGLNSSSEKVVTPGGT-TR 120

Query: 121 PKAALDRKKSKSFKLAGNGNVVICDNVAGGGGFEV----------ASLSYASSLITDSPG 180
            K  L+RKKSKSFK A  G      ++   G F            ASLSY+SSLIT++PG
Sbjct: 121 AKI-LERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLITEAPG 180

Query: 181 SIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKVVPIDS----KIKPAVEDRRCSFITP 240
           SIAAVRREQ+ALQ AQRKMRIAHYGRSKSA F++VVP+D+    + K A E++RCSFIT 
Sbjct: 181 SIAAVRREQMALQHAQRKMRIAHYGRSKSANFERVVPVDASGNIEAKGAEEEKRCSFITA 240

Query: 241 NSGTNY----------------------------VGSDWTSILKKRQDFRNAFSSFVAET 300
           NS   Y                            VGSDWTSILKKRQDFRNAFS F AE 
Sbjct: 241 NSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRNAFSDFDAEI 300

Query: 301 VAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKEFRSFDKYIWGFVNNKPFSPQY 349
           VA F+DKQM+SI SEYGIDI+RVRGVVDN+ RILEIKKEF SFDKYIWGFVN KP SPQY
Sbjct: 301 VANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQKPISPQY 360

BLAST of Cp4.1LG16g06960 vs. TrEMBL
Match: I1MJF6_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_15G265200 PE=4 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.0e-107
Identity = 235/385 (61.04%), Postives = 270/385 (70.13%), Query Frame = 1

Query: 17  SKFTARPVLQPTGNRV--LDRRNSLKK--------PPS-------AAVSPTSPKSKSPRP 76
           ++   RPVLQPT NRV  L+RRNS+KK        PPS       +   P SPKSKSPR 
Sbjct: 29  ARINGRPVLQPTCNRVPNLERRNSIKKVAPAKSLSPPSPPLPSKTSLTPPVSPKSKSPRL 88

Query: 77  PATKRANDTNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICDNVAGGGG 136
           PATKR ND N +NSS +KI+IP +++  P   L+RKKSKSFK    G+ V          
Sbjct: 89  PATKRGNDNNGLNSSYEKIVIPRSSIKTP--TLERKKSKSFK---EGSCV-------SAS 148

Query: 137 FEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKVVPIDSK- 196
            E ASLSY+SSLITDSPGSIAAVRREQ+ALQQAQRKM+IAHYGRSKSA+F++VVP+D   
Sbjct: 149 IE-ASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFERVVPLDPSN 208

Query: 197 ----IKPAVEDRRCSFITPNSGTNY----------------------------VGSDWTS 256
                KP  E++RCSFIT NS   Y                            VGSDWTS
Sbjct: 209 TSLASKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTS 268

Query: 257 ILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKEFR 316
            LKKR DFR AFS F AETVA  +DKQM+SISSEYGIDI+RVRGVVDNA +ILEIKK+F 
Sbjct: 269 TLKKRLDFRAAFSEFDAETVANLTDKQMMSISSEYGIDISRVRGVVDNANQILEIKKDFG 328

Query: 317 SFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQAAG 352
           SFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDM+RRGFR VGPTV+HSFMQA+G
Sbjct: 329 SFDKYIWGFVNHKPLSTQYKFGHKIPVKTSKSESISKDMVRRGFRYVGPTVVHSFMQASG 388

BLAST of Cp4.1LG16g06960 vs. TrEMBL
Match: V7BVU6_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_005G045900g PE=4 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.0e-107
Identity = 237/392 (60.46%), Postives = 270/392 (68.88%), Query Frame = 1

Query: 7   ALEATSVVVDS--KFTARPVLQPTGNRV--LDRRNSLKK--------PPSAAVS------ 66
           A   TS V+ S  +   RPVLQPT NRV  L+RRNS+KK        PPS  +S      
Sbjct: 18  AATTTSTVMPSVARINGRPVLQPTCNRVPNLERRNSIKKVQPPKSLSPPSPPLSSKTSLT 77

Query: 67  -PTSPKSKSPRPPATKRANDTNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGN 126
            P SPKSKSPR PA KR ND N +N+S +KI IP ++   P   L+RKKSKSFK    G+
Sbjct: 78  PPVSPKSKSPRLPAVKRGNDNNGLNTSYEKIAIPKSSSKAP--TLERKKSKSFK---EGS 137

Query: 127 VVICDNVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA 186
                  A        S SYASSLITDSPGSIAAVRREQ+ALQQAQRKM+IAHYGRSKSA
Sbjct: 138 CAPASTEA--------SFSYASSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSA 197

Query: 187 RFDKVVPIDSKI-----KPAVEDRRCSFITPNSGTNY----------------------- 246
           +F++VVP+D        KP  E++RCSFIT NS   Y                       
Sbjct: 198 KFERVVPLDPSTTTLTSKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLV 257

Query: 247 -----VGSDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDN 306
                VGSDWTS LKKRQDFR AFS F AETVA  +DKQM+SISSEYGIDI+RVRGVVDN
Sbjct: 258 LSGAQVGSDWTSTLKKRQDFRAAFSDFDAETVANLTDKQMMSISSEYGIDISRVRGVVDN 317

Query: 307 AIRILEIKKEFRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVG 347
           A +ILEIKK+F SFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDM+RRG+R VG
Sbjct: 318 ANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGYRFVG 377

BLAST of Cp4.1LG16g06960 vs. TrEMBL
Match: A0A0B2QMF8_GLYSO (Putative GMP synthase [glutamine-hydrolyzing] OS=Glycine soja GN=glysoja_026565 PE=4 SV=1)

HSP 1 Score: 397.1 bits (1019), Expect = 2.3e-107
Identity = 240/401 (59.85%), Postives = 276/401 (68.83%), Query Frame = 1

Query: 1   MCRSEQALEATSVVVDSK-----FTARPVLQPTGNRV--LDRRNSLKK--------PPS- 60
           MC S+   + T+VV  +K        RPVLQPT NRV  L+RRNS+KK        PPS 
Sbjct: 1   MCSSKT--KVTAVVAAAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPPKSLSPPSP 60

Query: 61  ------AAVSPTSPKSKSPRPPATKRANDTNPMNSSSDKILIPAAALSRPKAALDRKKSK 120
                 +   P SPK KSPR PATKR ND N +NSS +KI+IP ++   P   L+RKKSK
Sbjct: 61  PLPSKTSLTPPVSPKLKSPRLPATKRGNDNNGLNSSYEKIVIPRSSTKTP--TLERKKSK 120

Query: 121 SFKLAGNGNVVICDNVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRI 180
           SFK    G+ V           E ASLSY+SSLITDSPGSIAAVRREQ+ALQQAQRKM+I
Sbjct: 121 SFK---EGSCV-------SASIE-ASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKI 180

Query: 181 AHYGRSKSARFDKVVPIDSK-----IKPAVEDRRCSFITPNSGTNY-------------- 240
           AHYGRSKSA+F++VVP+D        KP  E++RCSFITPNS   Y              
Sbjct: 181 AHYGRSKSAKFERVVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIAYHDEEWGVPVHD 240

Query: 241 --------------VGSDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDI 300
                         VGSDWTS LKKR DFR AFS F AETVA  +DKQM+SISSEYGIDI
Sbjct: 241 DKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMSISSEYGIDI 300

Query: 301 NRVRGVVDNAIRILEIKKEFRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM 347
           +RVRGVVDNA +ILEIKK+F SFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDM
Sbjct: 301 SRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDM 360

BLAST of Cp4.1LG16g06960 vs. TAIR10
Match: AT3G12710.1 (AT3G12710.1 DNA glycosylase superfamily protein)

HSP 1 Score: 265.8 bits (678), Expect = 4.0e-71
Identity = 158/320 (49.38%), Postives = 200/320 (62.50%), Query Frame = 1

Query: 59  PPATKRANDTNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICDNVAGGG 118
           PP++  +      +   D ++   AA  + + +L+RKKSKSFK                 
Sbjct: 16  PPSSCNSLMDRSESLKRDSVMGNGAA--KVRGSLERKKSKSFKEGD-------------- 75

Query: 119 GFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---RFDKVVPI 178
                  SY+S LIT++PGSIAAVRREQVA QQA RK++IAHYGRSKS       KVVP+
Sbjct: 76  -------SYSSWLITEAPGSIAAVRREQVAAQQALRKLKIAHYGRSKSTINFTSSKVVPL 135

Query: 179 DSKIKPAVEDRRCSFITPNSGTNY----------------------------VGSDWTSI 238
            +   P    +RCSF+TP S   Y                            VGSDWTS 
Sbjct: 136 LNP-NPNPHPQRCSFLTPTSDPIYVAYHDEEWGVPVHDDKTLFELLTLSGAQVGSDWTST 195

Query: 239 LKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKEFRS 298
           L+KR D+R AF  F AE VA  ++K+M +IS EY I++++VRGVV+NA +I+EIKK F S
Sbjct: 196 LRKRHDYRKAFMEFEAEVVAKLTEKEMNAISIEYKIEMSKVRGVVENAKKIVEIKKAFVS 255

Query: 299 FDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQAAGL 348
            +KY+WGFVN+KP S  YK GHKIPVKTSKSE+ISKDM+RRGFR VGPTV+HSFMQAAGL
Sbjct: 256 LEKYLWGFVNHKPISTNYKLGHKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQAAGL 311

BLAST of Cp4.1LG16g06960 vs. TAIR10
Match: AT5G44680.1 (AT5G44680.1 DNA glycosylase superfamily protein)

HSP 1 Score: 258.5 bits (659), Expect = 6.4e-69
Identity = 163/361 (45.15%), Postives = 221/361 (61.22%), Query Frame = 1

Query: 17  SKFTARPVLQPTGNRV--LDRRNSLKKPPSAAVSPTSPKSKSPRPPATKRANDTNPMNSS 76
           S+   RPVLQP  N+V  LDRRNSLKK P   ++P + K  SPRP +      + P++ +
Sbjct: 15  SQINGRPVLQPKSNQVPTLDRRNSLKKSPPKPLNPIASKIPSPRPISLI----SPPLSPN 74

Query: 77  SDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICDNVAGGGGFEVASLSYASSLITD 136
           +  +  PA +    K  L    +KS         VI    + GG  EV  +     ++  
Sbjct: 75  TKSLRKPAGSC---KELLRSSSTKS-------KPVISPENSDGGYKEVMPMV----IVQK 134

Query: 137 SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-DKVVPIDSKIKPAVEDRRCSFITP 196
            PGSIAA RRE+VA++Q +RK +I+HYGR KS +  +K + ++ + K     +RCSFIT 
Sbjct: 135 QPGSIAAARREEVAMKQEERKKKISHYGRIKSVKSNEKNLNVEHEKK-----KRCSFITT 194

Query: 197 NSGTNY----------------------------VGSDWTSILKKRQDFRNAFSSFVAET 256
           +S   Y                            VGSDWTS+LK+R  FR AFS F AE 
Sbjct: 195 SSDPIYVAYHDKEWGVPVHDDNLLFELLVLTGAQVGSDWTSVLKRRNTFREAFSGFEAEL 254

Query: 257 VAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKEFRSFDKYIWGFVNNKPFSPQY 316
           VA F++K++ SI ++YGI++++V  VVDNA +IL++K++  SF+KYIWGF+ +KP + +Y
Sbjct: 255 VADFNEKKIQSIVNDYGINLSQVLAVVDNAKQILKVKRDLGSFNKYIWGFMKHKPVTTKY 314

Query: 317 KSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQAAGLTNDHLTSCHRHLHCSITA 347
            S  KIPVKTSKSETISKDM+RRGFR VGPTV+HS MQAAGLTNDHL +C RHL C+  A
Sbjct: 315 TSCQKIPVKTSKSETISKDMVRRGFRFVGPTVIHSLMQAAGLTNDHLITCPRHLECTAMA 352

BLAST of Cp4.1LG16g06960 vs. TAIR10
Match: AT5G57970.1 (AT5G57970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 148.7 bits (374), Expect = 7.1e-36
Identity = 77/197 (39.09%), Postives = 114/197 (57.87%), Query Frame = 1

Query: 175 IDSKIKPAVEDRRCSFITPNSGTNYV----------------------------GSDWTS 234
           +DS    +   +RC+++TPNS   Y+                               W +
Sbjct: 143 LDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPT 202

Query: 235 ILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDIN--RVRGVVDNAIRILEIKKE 294
           IL KRQ FR  F+ F    +   ++K+++   S     ++  ++R V++NA +IL++ +E
Sbjct: 203 ILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEE 262

Query: 295 FRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQA 342
           + SFDKYIW FV NK    +++   ++P KT K+E ISKD++RRGFRSVGPTV++SFMQA
Sbjct: 263 YGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQA 322

BLAST of Cp4.1LG16g06960 vs. TAIR10
Match: AT1G75090.1 (AT1G75090.1 DNA glycosylase superfamily protein)

HSP 1 Score: 147.9 bits (372), Expect = 1.2e-35
Identity = 74/188 (39.36%), Postives = 112/188 (59.57%), Query Frame = 1

Query: 186 RRCSFITPNSGTNYV----------------------------GSDWTSILKKRQDFRNA 245
           +RC +ITPNS   YV                               W SIL++R DFR  
Sbjct: 119 KRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKL 178

Query: 246 FSSFVAETVAIFSDKQMLSISSEYGIDI--NRVRGVVDNAIRILEIKKEFRSFDKYIWGF 305
           F  F    +A F++K+++S+     + +   ++R +V+NA  +L++K+EF SF  Y W F
Sbjct: 179 FEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRF 238

Query: 306 VNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQAAGLTNDHLTSC 344
           VN+KP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT+C
Sbjct: 239 VNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTAC 298

BLAST of Cp4.1LG16g06960 vs. TAIR10
Match: AT1G80850.1 (AT1G80850.1 DNA glycosylase superfamily protein)

HSP 1 Score: 142.5 bits (358), Expect = 5.1e-34
Identity = 74/163 (45.40%), Postives = 103/163 (63.19%), Query Frame = 1

Query: 181 PAVEDRRCSFITPNSGTNYVGSDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLS--IS 240
           P  +D+R   +   SG       W  IL KRQ FR  F  F    ++  ++K++ S  I+
Sbjct: 158 PVHDDKRLFELLSLSGA-LAELSWKDILSKRQLFREVFMDFDPIAISELTNKKITSPEIA 217

Query: 241 SEYGIDINRVRGVVDNAIRILEIKKEFRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKS 300
           +   +   ++R +++NA ++ +I   F SFDKYIW FVN KP   Q++   ++PVKTSK+
Sbjct: 218 ATTLLSEQKLRSILENANQVCKIIGAFGSFDKYIWNFVNQKPTQSQFRYPRQVPVKTSKA 277

Query: 301 ETISKDMIRRGFRSVGPTVLHSFMQAAGLTNDHLTSCHRHLHC 342
           E ISKD++RRGFRSV PTV++SFMQ AGLTNDHLT C RH  C
Sbjct: 278 ELISKDLVRRGFRSVSPTVIYSFMQTAGLTNDHLTCCFRHHDC 319

BLAST of Cp4.1LG16g06960 vs. NCBI nr
Match: gi|778713005|ref|XP_004139917.2| (PREDICTED: uncharacterized protein LOC101218536 [Cucumis sativus])

HSP 1 Score: 562.0 bits (1447), Expect = 7.7e-157
Identity = 315/406 (77.59%), Postives = 335/406 (82.51%), Query Frame = 1

Query: 1   MCRSEQALEATSVVVDSKFTARPVLQPTGNRVLDRRNSLKK------PPSAA-VSPTSPK 60
           MCRSE+ LEATSVVVDSKF +RPVLQPTGNRVLDRRNSLKK      PPSAA VSPTSPK
Sbjct: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60

Query: 61  SKSPRPPATKRAND-TNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICD 120
           SKSPRPPATKRAND  NPMNSSS+KILIPAA +SRP+A LDRKKSKSFKL GNGNV ICD
Sbjct: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAA-VSRPRATLDRKKSKSFKLGGNGNV-ICD 120

Query: 121 NVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKV 180
           N    GGFEVA   YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+
Sbjct: 121 N----GGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKI 180

Query: 181 VPIDSKIKPAVEDRRCSFITPNSGTNY----------------------------VGSDW 240
           VP+DSKIKPAVEDRRCSFITPNS   Y                            VGSDW
Sbjct: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240

Query: 241 TSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKE 300
           TSILKKRQDFRNAFSSF +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL+IKKE
Sbjct: 241 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKE 300

Query: 301 FRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQA 360
           F SFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTV+HSFMQA
Sbjct: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 360

Query: 361 AGLTNDHLTSCHRHLHCSITAADRRAPAVV-----VEETTTASETV 366
           AGLTNDHLT+CHRHLHC++ AA RR PA       VE+T    ET+
Sbjct: 361 AGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL 397

BLAST of Cp4.1LG16g06960 vs. NCBI nr
Match: gi|595864201|ref|XP_007211731.1| (hypothetical protein PRUPE_ppa006139mg [Prunus persica])

HSP 1 Score: 399.4 bits (1025), Expect = 6.6e-108
Identity = 245/423 (57.92%), Postives = 283/423 (66.90%), Query Frame = 1

Query: 1   MCRSEQ----ALEATSVVVDSKFTARPVLQPTGNRV--LDRRNSLKK------------P 60
           MC S+      +E T +V  ++   RPVLQPT NRV  LDRRNS+KK            P
Sbjct: 1   MCSSKAKVTIGVEVTPMV--ARINGRPVLQPTCNRVPSLDRRNSIKKISTPRAPPPPPLP 60

Query: 61  PSAAVS---------------PTSPKSKSPRPPATKRANDTNPMNSSSDKILIPAAALSR 120
            S+A S               P SPKSKSPRPPA KR ND N +NSSS+K++ P    +R
Sbjct: 61  TSSASSTSPRISNKASSLLTPPISPKSKSPRPPAIKRGNDPNGLNSSSEKVVTPGGT-TR 120

Query: 121 PKAALDRKKSKSFKLAGNGNVVICDNVAGGGGFEV----------ASLSYASSLITDSPG 180
            K  L+RKKSKSFK A  G      ++   G F            ASLSY+SSLIT++PG
Sbjct: 121 AKI-LERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLITEAPG 180

Query: 181 SIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKVVPIDS----KIKPAVEDRRCSFITP 240
           SIAAVRREQ+ALQ AQRKMRIAHYGRSKSA F++VVP+D+    + K A E++RCSFIT 
Sbjct: 181 SIAAVRREQMALQHAQRKMRIAHYGRSKSANFERVVPVDASGNIEAKGAEEEKRCSFITA 240

Query: 241 NSGTNY----------------------------VGSDWTSILKKRQDFRNAFSSFVAET 300
           NS   Y                            VGSDWTSILKKRQDFRNAFS F AE 
Sbjct: 241 NSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRNAFSDFDAEI 300

Query: 301 VAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKEFRSFDKYIWGFVNNKPFSPQY 349
           VA F+DKQM+SI SEYGIDI+RVRGVVDN+ RILEIKKEF SFDKYIWGFVN KP SPQY
Sbjct: 301 VANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQKPISPQY 360

BLAST of Cp4.1LG16g06960 vs. NCBI nr
Match: gi|645239095|ref|XP_008225980.1| (PREDICTED: uncharacterized protein LOC103325571 [Prunus mume])

HSP 1 Score: 398.3 bits (1022), Expect = 1.5e-107
Identity = 244/421 (57.96%), Postives = 283/421 (67.22%), Query Frame = 1

Query: 1   MCRSEQ----ALEATSVVVDSKFTARPVLQPTGNRV--LDRRNSLKK----------PPS 60
           MC S+      +E T +V  ++   RPVLQPT NRV  LDRRNS+KK          P S
Sbjct: 1   MCSSKAKVTIGVEVTPMV--ARINGRPVLQPTCNRVPSLDRRNSIKKISTPPPPPPLPTS 60

Query: 61  AAVS---------------PTSPKSKSPRPPATKRANDTNPMNSSSDKILIPAAALSRPK 120
           +A S               P SPKSKSPRPPA KR ND N +NSSS+K++ P    +R K
Sbjct: 61  SASSTSPRISSKASSLLTPPISPKSKSPRPPAIKRGNDPNGLNSSSEKVVTPGGT-TRAK 120

Query: 121 AALDRKKSKSFKLAGNGNVVICDNVAGGGGFEV----------ASLSYASSLITDSPGSI 180
             L+RKKSKSFK A  G      ++   G F            ASLSY+SSLIT++PGSI
Sbjct: 121 I-LERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSSNIEASLSYSSSLITEAPGSI 180

Query: 181 AAVRREQVALQQAQRKMRIAHYGRSKSARFDKVVPIDS----KIKPAVEDRRCSFITPNS 240
           AAVRREQ+ALQ AQRKMRIAHYGRSKSA F++VVP+D+    + K A E++RCSFIT NS
Sbjct: 181 AAVRREQMALQHAQRKMRIAHYGRSKSANFERVVPVDASGNIEAKGAEEEKRCSFITANS 240

Query: 241 GTNY----------------------------VGSDWTSILKKRQDFRNAFSSFVAETVA 300
              Y                            VGSDWTSILKKRQDFR+AFS F AE VA
Sbjct: 241 DPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRSAFSDFDAEIVA 300

Query: 301 IFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKEFRSFDKYIWGFVNNKPFSPQYKS 349
            F+DKQM+SI SEYGIDI+RVRGVVDN+ RILEIKKEF SFDKYIWGFVN KP SPQYK 
Sbjct: 301 NFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQKPISPQYKL 360

BLAST of Cp4.1LG16g06960 vs. NCBI nr
Match: gi|356557064|ref|XP_003546838.1| (PREDICTED: probable GMP synthase [glutamine-hydrolyzing])

HSP 1 Score: 398.3 bits (1022), Expect = 1.5e-107
Identity = 235/385 (61.04%), Postives = 270/385 (70.13%), Query Frame = 1

Query: 17  SKFTARPVLQPTGNRV--LDRRNSLKK--------PPS-------AAVSPTSPKSKSPRP 76
           ++   RPVLQPT NRV  L+RRNS+KK        PPS       +   P SPKSKSPR 
Sbjct: 29  ARINGRPVLQPTCNRVPNLERRNSIKKVAPAKSLSPPSPPLPSKTSLTPPVSPKSKSPRL 88

Query: 77  PATKRANDTNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICDNVAGGGG 136
           PATKR ND N +NSS +KI+IP +++  P   L+RKKSKSFK    G+ V          
Sbjct: 89  PATKRGNDNNGLNSSYEKIVIPRSSIKTP--TLERKKSKSFK---EGSCV-------SAS 148

Query: 137 FEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKVVPIDSK- 196
            E ASLSY+SSLITDSPGSIAAVRREQ+ALQQAQRKM+IAHYGRSKSA+F++VVP+D   
Sbjct: 149 IE-ASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFERVVPLDPSN 208

Query: 197 ----IKPAVEDRRCSFITPNSGTNY----------------------------VGSDWTS 256
                KP  E++RCSFIT NS   Y                            VGSDWTS
Sbjct: 209 TSLASKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTS 268

Query: 257 ILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKEFR 316
            LKKR DFR AFS F AETVA  +DKQM+SISSEYGIDI+RVRGVVDNA +ILEIKK+F 
Sbjct: 269 TLKKRLDFRAAFSEFDAETVANLTDKQMMSISSEYGIDISRVRGVVDNANQILEIKKDFG 328

Query: 317 SFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVLHSFMQAAG 352
           SFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDM+RRGFR VGPTV+HSFMQA+G
Sbjct: 329 SFDKYIWGFVNHKPLSTQYKFGHKIPVKTSKSESISKDMVRRGFRYVGPTVVHSFMQASG 388

BLAST of Cp4.1LG16g06960 vs. NCBI nr
Match: gi|593697344|ref|XP_007149154.1| (hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris])

HSP 1 Score: 398.3 bits (1022), Expect = 1.5e-107
Identity = 237/392 (60.46%), Postives = 270/392 (68.88%), Query Frame = 1

Query: 7   ALEATSVVVDS--KFTARPVLQPTGNRV--LDRRNSLKK--------PPSAAVS------ 66
           A   TS V+ S  +   RPVLQPT NRV  L+RRNS+KK        PPS  +S      
Sbjct: 18  AATTTSTVMPSVARINGRPVLQPTCNRVPNLERRNSIKKVQPPKSLSPPSPPLSSKTSLT 77

Query: 67  -PTSPKSKSPRPPATKRANDTNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGN 126
            P SPKSKSPR PA KR ND N +N+S +KI IP ++   P   L+RKKSKSFK    G+
Sbjct: 78  PPVSPKSKSPRLPAVKRGNDNNGLNTSYEKIAIPKSSSKAP--TLERKKSKSFK---EGS 137

Query: 127 VVICDNVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA 186
                  A        S SYASSLITDSPGSIAAVRREQ+ALQQAQRKM+IAHYGRSKSA
Sbjct: 138 CAPASTEA--------SFSYASSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSA 197

Query: 187 RFDKVVPIDSKI-----KPAVEDRRCSFITPNSGTNY----------------------- 246
           +F++VVP+D        KP  E++RCSFIT NS   Y                       
Sbjct: 198 KFERVVPLDPSTTTLTSKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLV 257

Query: 247 -----VGSDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDN 306
                VGSDWTS LKKRQDFR AFS F AETVA  +DKQM+SISSEYGIDI+RVRGVVDN
Sbjct: 258 LSGAQVGSDWTSTLKKRQDFRAAFSDFDAETVANLTDKQMMSISSEYGIDISRVRGVVDN 317

Query: 307 AIRILEIKKEFRSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVG 347
           A +ILEIKK+F SFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDM+RRG+R VG
Sbjct: 318 ANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGYRFVG 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUAA_HELHP1.2e-2441.01Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
3MG1_ECOLI1.5e-1935.04DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 S... [more]
3MGA_HAEIN7.0e-1733.58DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0KED6_CUCSA5.3e-15777.59Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134890 PE=4 SV=1[more]
M5X1J5_PRUPE4.6e-10857.92Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006139mg PE=4 SV=1[more]
I1MJF6_SOYBN1.0e-10761.04Uncharacterized protein OS=Glycine max GN=GLYMA_15G265200 PE=4 SV=1[more]
V7BVU6_PHAVU1.0e-10760.46Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_005G045900g PE=4 SV=1[more]
A0A0B2QMF8_GLYSO2.3e-10759.85Putative GMP synthase [glutamine-hydrolyzing] OS=Glycine soja GN=glysoja_026565 ... [more]
Match NameE-valueIdentityDescription
AT3G12710.14.0e-7149.38 DNA glycosylase superfamily protein[more]
AT5G44680.16.4e-6945.15 DNA glycosylase superfamily protein[more]
AT5G57970.17.1e-3639.09 DNA glycosylase superfamily protein[more]
AT1G75090.11.2e-3539.36 DNA glycosylase superfamily protein[more]
AT1G80850.15.1e-3445.40 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778713005|ref|XP_004139917.2|7.7e-15777.59PREDICTED: uncharacterized protein LOC101218536 [Cucumis sativus][more]
gi|595864201|ref|XP_007211731.1|6.6e-10857.92hypothetical protein PRUPE_ppa006139mg [Prunus persica][more]
gi|645239095|ref|XP_008225980.1|1.5e-10757.96PREDICTED: uncharacterized protein LOC103325571 [Prunus mume][more]
gi|356557064|ref|XP_003546838.1|1.5e-10761.04PREDICTED: probable GMP synthase [glutamine-hydrolyzing][more]
gi|593697344|ref|XP_007149154.1|1.5e-10760.46hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006281DNA repair
GO:0006284base-excision repair
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0008725DNA-3-methyladenine glycosylase activity
Vocabulary: INTERPRO
TermDefinition
IPR011257DNA_glycosylase
IPR005019Adenine_glyco
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005575 cellular_component
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g06960.1Cp4.1LG16g06960.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 200..337
score: 9.6
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 199..338
score: 8.0
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 198..341
score: 1.43
NoneNo IPR availablePANTHERPTHR31116FAMILY NOT NAMEDcoord: 117..357
score: 7.1E-161coord: 6..95
score: 7.1E
NoneNo IPR availablePANTHERPTHR31116:SF6SUBFAMILY NOT NAMEDcoord: 117..357
score: 7.1E-161coord: 6..95
score: 7.1E