CSPI03G42480 (gene) Wild cucumber (PI 183967)

NameCSPI03G42480
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionDNA-3-methyladenine glycosylase, putative
LocationChr3 : 36512099 .. 36514615 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCCGTTGTTTCTTCGTCGTTCTCCGCCCACGCTCTCACTGCTCTCTCTCTCTCTCTCTCTCTCGCCACGCCCCTTCTCCTCTTCTCTCCCAAACCCTAACCCCTTTCTTTTCATCTCTTTCCCCTTCTCTTCATTTTCCATCTTCTCTTATCTTCCCTCTCCATAAATTCTTTTTCAACTTCCTTACTCAGTGGAGCCAATTTTCACTGACGACGGATATCTCTCTTCACCAATGTCTGTTGCTACCAAGCTCCAATCCCATGCTAAACCGGCTTTGGAGCCCCGAGCTATTCTTGGACCTGGCGGGAACAGAGATAGGGCGCCTCAGAACCCCAAATGTAAACCCGAAACTTTAAAGAAGACGGAGAAGCAGAGCAAGGCGCTTCCGGCCATTTCTGAATCGGTTATCCAAGATAATGTCTCCGTCGGGAGTTCCTGCTCTTCTGATTCTTTATCCAGTAATTATTCTGCCAAATTGTTGAAGCCCTACGCTGTGAAGCCTGTTTCGGCCGGCGGTGACTCAAACGCCACCACAACGTCGCCTGCGCTCTCGCTTCCGGGGAAACGCTGTGATTGGATAACGCTTCATTCGGGTAATCAATATTAAACCTTGCTTATCATTGACTATTTCTTTGATTAATTTGCTAAGTGATGGAATATATGAGAGAAAACAATCGGATAATTATTTCTGTTATGGGTATAATGACAGTTTATCTTAATTCAATAGTCAATTCGTACTTGCATTGCAAATATTTTGTATTTTAATGCAATTACAAATTTTATTTAGTTCAAAACATTCGAGCACAAATTAGAATAATATAATCAAATTATATAGAATTAAAATATCGGTTAAGTATGCAATTGTCCTGTGCAAAAATTAATTATTTAATTCTTCCATTTAGTAATTAATAATTTAATTTTTAAGGTAATGTTGTGATCACAAAGTAATTATTGCGTCATCTAAGTTGTTATCGCCATAATTGGACCATCATTTAATAACAGATTGACCTCAGTCTTTTTCATGCCATGGACGGATTTATTTTGAGTTGTGTTCTACGTCTGCCCATTTATATATTGAAAAAACTGGGTTAGGTTTTGTATATTCTAGCCTTCTAAATTGGTTCATTTGGACGAAAATCAAAATTATTATTGTATGCGGTAATTTTATTTATTAAATTATTTGAATATTACATTTTATTATCATATGTGAATGCAAGTGTTATATAAAAAGAATTGTGCTCTAATCATTTGTTTTTTTGTTGGTCGTTGTTCTTCTGTTTAGTCTATAAAGCAATGTAATTCAAGAATTTCATCTTTCATTAAACCATTCCTTTCATTTTCAACTTTAATTTGAATCTCTTCTAATCTTGTAATGTAATATGATGATTTCTTGTAGATCCACTTTACATCGCTTTTCATGATGAAGAATGGGGAGTCCCAATTCATGATGACAAGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCAAAAGAGATATATTTAGGTCTAACTTCTCATTTTTCGATTCTCTTTTGCCCCAAACTAGCAAAAGAAAAAAAGAAAATCATTCCCCTGAAAGAATATTGTTCTAAAATGTCAACTAACTTCTTCTCTGTCTTTGCAGGAAAGTTTTGAATGATTTTGACCCATCTTCAATCGCACAGTTCACAGAGAACGAGTTTACGACACTAAAAGTAAATGGCATCCAACTCCTGTCTGAACCGAAGCTTCGTGCAATTGTTGACAACGCTAACCAAGTACTCAAGGTATTGAGTTTTGGTTAAGCTTTACCACTTTTTAATTTTTTTTTTTTTTTTTTTTCTGTTCACAGGAAGTAACGTTTGGTTGTCTCAACTTCCATCCCTTTCAAGTGGTTGAAACTTGAAAGCTGAATCTTCACGTTTTCATTTGCCTTTTTACCTTTACAGATTCAAAAGGAATTTGGTTCCTTCAGCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATACGAAACAGACATCGATACAATCGTCAAGTACCTGTGAAGACGCCGAAAGCAGAGTTCATGAGCAAGGATATGATCAGGAGGGGATTCCGTTGCGTCGGGCCAACTGTGGTTTATTCCTTCATGCAGGTGGCTGGAATTGTTAACGATCACTTGGTGAGTTGCTTCAGATATGAGGAGTGTGACCCAAAGGTAAAAGACGATAAGAAATTAAGAGTAGAAGATAAACGGTAGGAGTCACTTAGGGGAGCTTTGGAGAAGCCTTGCTTGACTTAGATGCTGACAGCTGATTCATACAAAAAGGTTTATCTCTAACTAATCTTACAAAATCATTGTAATAAAATGGTTATGTTGTTGATCTGATGCTGAATAGTTTGAGTTCGTTTTTGTTTAGAGTTTTATAATTTCTTATTTGTAGATGATGTTATAACGTAATTGTAATTTGTATTTACAAACAACAAACACACCGTTGGTTTCTTGTAATTATTATTTC

mRNA sequence

ATGTCTGTTGCTACCAAGCTCCAATCCCATGCTAAACCGGCTTTGGAGCCCCGAGCTATTCTTGGACCTGGCGGGAACAGAGATAGGGCGCCTCAGAACCCCAAATGTAAACCCGAAACTTTAAAGAAGACGGAGAAGCAGAGCAAGGCGCTTCCGGCCATTTCTGAATCGGTTATCCAAGATAATGTCTCCGTCGGGAGTTCCTGCTCTTCTGATTCTTTATCCAGTAATTATTCTGCCAAATTGTTGAAGCCCTACGCTGTGAAGCCTGTTTCGGCCGGCGGTGACTCAAACGCCACCACAACGTCGCCTGCGCTCTCGCTTCCGGGGAAACGCTGTGATTGGATAACGCTTCATTCGGATCCACTTTACATCGCTTTTCATGATGAAGAATGGGGAGTCCCAATTCATGATGACAAGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCAAAAGAGATATATTTAGGAAAGTTTTGAATGATTTTGACCCATCTTCAATCGCACAGTTCACAGAGAACGAGTTTACGACACTAAAAGTAAATGGCATCCAACTCCTGTCTGAACCGAAGCTTCGTGCAATTGTTGACAACGCTAACCAAGTACTCAAGATTCAAAAGGAATTTGGTTCCTTCAGCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATACGAAACAGACATCGATACAATCGTCAAGTACCTGTGAAGACGCCGAAAGCAGAGTTCATGAGCAAGGATATGATCAGGAGGGGATTCCGTTGCGTCGGGCCAACTGTGGTTTATTCCTTCATGCAGGTGGCTGGAATTGTTAACGATCACTTGGTGAGTTGCTTCAGATATGAGGAGTGTGACCCAAAGGTAAAAGACGATAAGAAATTAAGAGTAGAAGATAAACGGTAG

Coding sequence (CDS)

ATGTCTGTTGCTACCAAGCTCCAATCCCATGCTAAACCGGCTTTGGAGCCCCGAGCTATTCTTGGACCTGGCGGGAACAGAGATAGGGCGCCTCAGAACCCCAAATGTAAACCCGAAACTTTAAAGAAGACGGAGAAGCAGAGCAAGGCGCTTCCGGCCATTTCTGAATCGGTTATCCAAGATAATGTCTCCGTCGGGAGTTCCTGCTCTTCTGATTCTTTATCCAGTAATTATTCTGCCAAATTGTTGAAGCCCTACGCTGTGAAGCCTGTTTCGGCCGGCGGTGACTCAAACGCCACCACAACGTCGCCTGCGCTCTCGCTTCCGGGGAAACGCTGTGATTGGATAACGCTTCATTCGGATCCACTTTACATCGCTTTTCATGATGAAGAATGGGGAGTCCCAATTCATGATGACAAGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCAAAAGAGATATATTTAGGAAAGTTTTGAATGATTTTGACCCATCTTCAATCGCACAGTTCACAGAGAACGAGTTTACGACACTAAAAGTAAATGGCATCCAACTCCTGTCTGAACCGAAGCTTCGTGCAATTGTTGACAACGCTAACCAAGTACTCAAGATTCAAAAGGAATTTGGTTCCTTCAGCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATACGAAACAGACATCGATACAATCGTCAAGTACCTGTGAAGACGCCGAAAGCAGAGTTCATGAGCAAGGATATGATCAGGAGGGGATTCCGTTGCGTCGGGCCAACTGTGGTTTATTCCTTCATGCAGGTGGCTGGAATTGTTAACGATCACTTGGTGAGTTGCTTCAGATATGAGGAGTGTGACCCAAAGGTAAAAGACGATAAGAAATTAAGAGTAGAAGATAAACGGTAG
BLAST of CSPI03G42480 vs. Swiss-Prot
Match: GUAA_HELHP (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) GN=guaA PE=3 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 1.5e-39
Identity = 80/184 (43.48%), Postives = 110/184 (59.78%), Query Frame = 1

Query: 112 RCDWITLHSDP---LYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFR 171
           RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W  IL KR+ FR
Sbjct: 787 RCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVLEGFQAGLSWITILKKREAFR 846

Query: 172 KVLNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCW 231
              +DFDP  +A + E++   L  N   + +  K+ A + NA   + +Q+EFGSF  Y W
Sbjct: 847 VAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAIINAKAFMAVQREFGSFDKYIW 906

Query: 232 SFVNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLV 291
            FV  KPI N       +P  TP ++ ++KD+ +RGF+ VG T +Y+ MQ  G+VNDHL 
Sbjct: 907 GFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFVGTTTMYAMMQSIGMVNDHLT 966

Query: 292 SCFR 293
           SCF+
Sbjct: 967 SCFK 970

BLAST of CSPI03G42480 vs. Swiss-Prot
Match: 3MG1_ECOLI (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 7.5e-36
Identity = 72/183 (39.34%), Postives = 109/183 (59.56%), Query Frame = 1

Query: 111 KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKV 170
           +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W  +L KR+ +R  
Sbjct: 2   ERCGWVS--QDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRAC 61

Query: 171 LNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSF 230
            + FDP  +A   E +   L  +   +    K++AI+ NA   L++++    F ++ WSF
Sbjct: 62  FHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWSF 121

Query: 231 VNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSC 290
           VN +P   +     ++P  T  ++ +SK + +RGF+ VG T+ YSFMQ  G+VNDH+V C
Sbjct: 122 VNHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVGC 181

Query: 291 FRY 294
             Y
Sbjct: 182 CCY 182

BLAST of CSPI03G42480 vs. Swiss-Prot
Match: 3MGA_HAEIN (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=tag PE=3 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 6.0e-33
Identity = 74/179 (41.34%), Postives = 103/179 (57.54%), Query Frame = 1

Query: 112 RCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKVL 171
           RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W  +L KR+ +R+  
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 172 NDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFV 231
           + FDP  IA+ T  +      N   +    KL AIV NA   L ++K   +FS++ WSFV
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 232 NKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSC 291
           N KPI N     R VP KT  ++ +SK + +RGF  +G T  Y+FMQ  G+V+DHL  C
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of CSPI03G42480 vs. TrEMBL
Match: A0A0A0LG22_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G855280 PE=4 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 1.3e-175
Identity = 309/312 (99.04%), Postives = 311/312 (99.68%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVATKLQSH KPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVI+
Sbjct: 1   MSVATKLQSHVKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIR 60

Query: 61  DNVSVGSSCSSDSLSSNYSAKLLKPYAVKPVSAGGDSNATTTSPALSLPGKRCDWITLHS 120
           DNVSVGSSCSSDSLSSNYSAKLLKPYAVKPVSAGGDSNATTTSPALSLPGKRCDWITLHS
Sbjct: 61  DNVSVGSSCSSDSLSSNYSAKLLKPYAVKPVSAGGDSNATTTSPALSLPGKRCDWITLHS 120

Query: 121 DPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKVLNDFDPSSIA 180
           DPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRD+FRKVLNDFDPSSIA
Sbjct: 121 DPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDVFRKVLNDFDPSSIA 180

Query: 181 QFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRH 240
           QFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRH
Sbjct: 181 QFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRH 240

Query: 241 RYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRYEECDPKV 300
           RYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRYEECDPKV
Sbjct: 241 RYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRYEECDPKV 300

Query: 301 KDDKKLRVEDKR 313
           KDDKKLRVEDKR
Sbjct: 301 KDDKKLRVEDKR 312

BLAST of CSPI03G42480 vs. TrEMBL
Match: A0A061FDE4_THECC (DNA-3-methyladenine glycosylase, putative OS=Theobroma cacao GN=TCM_033995 PE=4 SV=1)

HSP 1 Score: 398.7 bits (1023), Expect = 6.7e-108
Identity = 207/319 (64.89%), Postives = 238/319 (74.61%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVATKL+S   P  EPRAILGP GNR R     K KPE  KK ++    +    +SV+Q
Sbjct: 1   MSVATKLKSSPTPVTEPRAILGPTGNRVRVSDESKRKPEAQKKPQRPKFRVSKSPQSVVQ 60

Query: 61  DNVSVGSSCSSDSLSSNYSAKLL------KPYAVKPVSAG----GDSNATTTSPALSLPG 120
            NVSV SSCSSDS SSN S K +      K   VKPV A      D      SP L  P 
Sbjct: 61  SNVSVDSSCSSDSSSSNSSVKTVSSKKTVKRIGVKPVKAKVAPTADEVVAEPSPVLPEPL 120

Query: 121 KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKV 180
           KRCDWIT  SDPLY + HD+EWGVP+HDD+KLFELLV SQALAEL+WP IL+KRDIFRK+
Sbjct: 121 KRCDWITPFSDPLYTSLHDKEWGVPVHDDRKLFELLVFSQALAELSWPTILNKRDIFRKL 180

Query: 181 LNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSF 240
            ++FDPSSIAQFTE +  +LKVNG  LLSEPKLRA+V+NA Q+LK+Q+EFGSFS+YCW F
Sbjct: 181 FDNFDPSSIAQFTEKKLLSLKVNGSLLLSEPKLRAVVENAKQMLKVQQEFGSFSSYCWGF 240

Query: 241 VNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSC 300
           VN KPIRN  RY RQVPVKTPKAE +SKDM++RGFRCVGPTVVYSFMQVAGIVNDHLV+C
Sbjct: 241 VNHKPIRNGFRYVRQVPVKTPKAELISKDMMQRGFRCVGPTVVYSFMQVAGIVNDHLVTC 300

Query: 301 FRYEECDPKVKDDKKLRVE 310
           FRY+EC+  VK D K  +E
Sbjct: 301 FRYQECNANVKKDIKPEIE 319

BLAST of CSPI03G42480 vs. TrEMBL
Match: A0A0D2TRX9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G204500 PE=4 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 3.1e-105
Identity = 202/320 (63.12%), Postives = 238/320 (74.38%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVAT+ +S  KP  EPRAILGP GNR R     K + E LKK ++    +    +SV+Q
Sbjct: 1   MSVATRPRSSTKPLTEPRAILGPAGNRVRVSDESKRRTEALKKPQRPKVPVSQSPKSVVQ 60

Query: 61  DNVSVGSSCSSDSLSSNYS------AKLLKPYAVKP----VSAGGDSNATTTSPALSLPG 120
            NVSV S CSSDS SSN S       K +K   VK     V++  D   T  SPA+S P 
Sbjct: 61  SNVSVDSCCSSDSSSSNSSFKTASSRKTVKQNGVKQAKPKVASTADEVVTEISPAMSGPL 120

Query: 121 KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKV 180
           KRCDWIT  SDPLY +FHDEEWGVP+HDD+KLFELLV SQALAEL+WP +L KR+IFRK 
Sbjct: 121 KRCDWITPFSDPLYTSFHDEEWGVPVHDDRKLFELLVFSQALAELSWPTVLKKREIFRKF 180

Query: 181 LNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSF 240
            +DFDPSS+AQFTE +  +LKV+G  LLSE KLRAIV+NA  +LK+Q+EFGSFS+YCW F
Sbjct: 181 FDDFDPSSMAQFTEKKMLSLKVDGCLLLSEAKLRAIVENAKLILKVQQEFGSFSSYCWGF 240

Query: 241 VNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSC 300
           VN KP+RN  RY RQVPVKTPKAE MSKDM+RRGF CVGPTVVYSFMQVAGIVNDHLV+C
Sbjct: 241 VNHKPLRNAFRYARQVPVKTPKAEVMSKDMMRRGFCCVGPTVVYSFMQVAGIVNDHLVTC 300

Query: 301 FRYEECDPKVKDDKKLRVED 311
           FRY+EC+  VK D K ++E+
Sbjct: 301 FRYQECNATVKKDIKPKIEE 320

BLAST of CSPI03G42480 vs. TrEMBL
Match: A0A0B0MFZ1_GOSAR (Putative GMP synthase [glutamine-hydrolyzing] OS=Gossypium arboreum GN=F383_22284 PE=4 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 6.9e-105
Identity = 202/320 (63.12%), Postives = 240/320 (75.00%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVAT+ +S AKP  EPRAILGP GNR R     K + E LKK ++    +    +SV+Q
Sbjct: 1   MSVATRPRSSAKPLTEPRAILGPAGNRVRVSDESKRRTEALKKPQRPKVPVSQSPKSVVQ 60

Query: 61  DNVSVGSSCSSDSLSSNYS------AKLLKPYAVKP----VSAGGDSNATTTSPALSLPG 120
            NVSV S CSSDS SSN S       K +K   VK     V++  D   T  SPA+S P 
Sbjct: 61  SNVSVDSCCSSDSSSSNSSFKTASSRKTVKQNGVKQAKPKVASTADEVVTEISPAMSGPL 120

Query: 121 KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKV 180
           KRCDWIT  SDPLY +FHDEEWGVP+H+D+KLFELLV SQALAEL+WP IL KR+IFRK+
Sbjct: 121 KRCDWITPFSDPLYTSFHDEEWGVPVHNDRKLFELLVFSQALAELSWPTILKKREIFRKL 180

Query: 181 LNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSF 240
            ++FDPSS+AQFTE +  +LKV+G  LLSE KLRAIV+NA  +LK+Q+EFGSFS+YCW F
Sbjct: 181 FDNFDPSSMAQFTEKKMLSLKVDGCLLLSEAKLRAIVENAKLILKVQQEFGSFSSYCWGF 240

Query: 241 VNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSC 300
           VN KP+RN  RY RQVPVKTPKAE MSKDM+RRGF CVGPTVVYSFMQVAGIVNDHLV+C
Sbjct: 241 VNHKPLRNAFRYARQVPVKTPKAEVMSKDMMRRGFCCVGPTVVYSFMQVAGIVNDHLVTC 300

Query: 301 FRYEECDPKVKDDKKLRVED 311
           FRY+EC+  VK D K ++E+
Sbjct: 301 FRYQECNATVKKDIKPKIEE 320

BLAST of CSPI03G42480 vs. TrEMBL
Match: W9RT44_9ROSA (Putative Glutamine amidotransferase OS=Morus notabilis GN=L484_021208 PE=4 SV=1)

HSP 1 Score: 387.9 bits (995), Expect = 1.2e-104
Identity = 210/328 (64.02%), Postives = 239/328 (72.87%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALP-AISE--- 60
           MSVATKL S   P  EPRAILGPGGNR R  + PK K E LKK + Q    P A+SE   
Sbjct: 1   MSVATKLHS---PVCEPRAILGPGGNRVRVSEYPKRKGEALKKPQAQRTRKPTAVSEVPQ 60

Query: 61  SVIQDNVSVGSSCSSDSLSSNYSAKL--------------LKPYAVKPVSAGGDSNATTT 120
           SV++ N SV SSCSSDS SS   AK               LKP  V PV       A   
Sbjct: 61  SVVRSNGSVDSSCSSDSSSSGSLAKTVSSKKTPPTVKRKGLKPVKVVPVGV----EAVAA 120

Query: 121 SPALSLPGKRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILS 180
            P +  P KRCDWIT +SD +Y +FHDEEWGVPIHDD+KLFELLV SQALAELTWP IL+
Sbjct: 121 LPKILGPPKRCDWITPNSDSIYTSFHDEEWGVPIHDDRKLFELLVFSQALAELTWPAILN 180

Query: 181 KRDIFRKVLNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGS 240
           KR+IFRK+  +FDPSSIAQF E +  +LKVNG  LLSEPKLRAIV+NA Q+LKIQ+EFGS
Sbjct: 181 KREIFRKLFENFDPSSIAQFNEKKLLSLKVNGNLLLSEPKLRAIVENAKQILKIQQEFGS 240

Query: 241 FSNYCWSFVNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGI 300
           FSNYCWSFVN KPI+N  RY RQVPVK+PKA+ +SKDM++RGFRCVGPTV+YSFMQVAGI
Sbjct: 241 FSNYCWSFVNDKPIKNGFRYGRQVPVKSPKADLISKDMMQRGFRCVGPTVIYSFMQVAGI 300

Query: 301 VNDHLVSCFRYEECDPKVKDDKKLRVED 311
           VNDHL+SCFRYEEC   V+ D K R E+
Sbjct: 301 VNDHLLSCFRYEECKINVEKDLKPRTEE 321

BLAST of CSPI03G42480 vs. TAIR10
Match: AT1G75090.1 (AT1G75090.1 DNA glycosylase superfamily protein)

HSP 1 Score: 320.1 bits (819), Expect = 1.5e-87
Identity = 169/313 (53.99%), Postives = 216/313 (69.01%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKAL--PAISESV 60
           MS+ +KL+S  KP  E RAIL   GNR +  +    K   L     +S A   P  + SV
Sbjct: 1   MSIVSKLRSPVKPIDESRAILCSTGNRFKVTKTEMTKKPQLNPRVTKSPATKKPDSNFSV 60

Query: 61  IQDNVSVGSSCSS-DSLSSNYSAKLLKPYAVKPVSAGGDSNATTT-----SPALSLPGKR 120
             D+ S  SS S   S+++  S K+  P     V    +  A+       SP +  P KR
Sbjct: 61  STDDSSSSSSSSERSSVNTTNSGKVTTPSKRNGVEKLNNVVASVAVVEDISPKIPGPVKR 120

Query: 121 CDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKVLN 180
           C WIT +SDP+Y+ FHDEEWGVP+ DDKKLFELLV SQALAE +WP IL +RD FRK+  
Sbjct: 121 CHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKLFE 180

Query: 181 DFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVN 240
           +FDPS+IAQFTE    +L+VNG  +LSE KLRAIV+NA  VLK+++EFGSFSNYCW FVN
Sbjct: 181 EFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRFVN 240

Query: 241 KKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFR 300
            KP+RN +RY RQVPVK+PKAE++SKDM++RGFRCVGPTV+YSF+Q +GIVNDHL +CFR
Sbjct: 241 HKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTACFR 300

Query: 301 YEECDPKVKDDKK 306
           Y+EC+ + + + K
Sbjct: 301 YQECNVETERETK 313

BLAST of CSPI03G42480 vs. TAIR10
Match: AT5G57970.1 (AT5G57970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 262.3 bits (669), Expect = 3.8e-70
Identity = 132/254 (51.97%), Postives = 168/254 (66.14%), Query Frame = 1

Query: 56  ESVIQDNVSVGSSCSSDSLSSNYSAKLLKPYAVKPVSAGGDSNATTTSP-------ALSL 115
           E  +  N+S+ +S SSD+   ++ ++      ++  S G  S +  + P       AL  
Sbjct: 86  EQNLNSNLSLNASFSSDASMDSFHSRASTGRLIRSYSVGSRSKSYPSKPRSVVSEGALDS 145

Query: 116 PG------KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILS 175
           P       KRC W+T +SDP YI FHDEEWGVP+HDDK+LFELLVLS ALAE TWP ILS
Sbjct: 146 PPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILS 205

Query: 176 KRDIFRKVLNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGS 235
           KR  FR+V  DFDP++I +  E +          LLS+ KLRA+++NA Q+LK+ +E+GS
Sbjct: 206 KRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGS 265

Query: 236 FSNYCWSFVNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGI 295
           F  Y WSFV  K I ++ RY RQVP KTPKAE +SKD++RRGFR VGPTVVYSFMQ AGI
Sbjct: 266 FDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGI 325

Query: 296 VNDHLVSCFRYEEC 297
            NDHL SCFR+  C
Sbjct: 326 TNDHLTSCFRFHHC 339

BLAST of CSPI03G42480 vs. TAIR10
Match: AT1G80850.1 (AT1G80850.1 DNA glycosylase superfamily protein)

HSP 1 Score: 260.0 bits (663), Expect = 1.9e-69
Identity = 151/324 (46.60%), Postives = 196/324 (60.49%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKP---------ETLKKTEKQSKAL 60
           MS   +++S      E R++LGP GN+    Q P  KP         + L  TEK  +  
Sbjct: 1   MSAPPRVRSVDSSDREFRSVLGPAGNK--LQQKPLSKPVKKPVAEKTKNLTFTEKMPQCS 60

Query: 61  PAISESVIQDNVSVGSSCSSDSLSSNYSAKL-----------LKPYAVKPVSAGGDSNAT 120
           P     + ++ +S+ +S SSD+ SS  S+ L           L+       S+    N T
Sbjct: 61  PLSPPILRRNGISMTASYSSDASSSCESSPLSMTSTSSGKRVLRRSGSVSSSSSLRRNLT 120

Query: 121 T-----TSPALSLPGKRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAEL 180
                  S       KRC WIT  SD  YIAFHDEEWGVP+HDDK+LFELL LS ALAEL
Sbjct: 121 EERDEKASDCFCDGRKRCAWITPKSDQCYIAFHDEEWGVPVHDDKRLFELLSLSGALAEL 180

Query: 181 TWPLILSKRDIFRKVLNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLK 240
           +W  ILSKR +FR+V  DFDP +I++ T  + T+ ++    LLSE KLR+I++NANQV K
Sbjct: 181 SWKDILSKRQLFREVFMDFDPIAISELTNKKITSPEIAATTLLSEQKLRSILENANQVCK 240

Query: 241 IQKEFGSFSNYCWSFVNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYS 300
           I   FGSF  Y W+FVN+KP +++ RY RQVPVKT KAE +SKD++RRGFR V PTV+YS
Sbjct: 241 IIGAFGSFDKYIWNFVNQKPTQSQFRYPRQVPVKTSKAELISKDLVRRGFRSVSPTVIYS 300

BLAST of CSPI03G42480 vs. TAIR10
Match: AT1G15970.1 (AT1G15970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 256.5 bits (654), Expect = 2.1e-68
Identity = 149/348 (42.82%), Postives = 206/348 (59.20%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNR-DRAPQNPKCKPETLKKTEKQSK-------ALP 60
           MSV  + +S      E R++LGP GN+  R P   K +   ++KT   SK         P
Sbjct: 1   MSVPPRFRSVNSDEREFRSVLGPTGNKLQRKPPGMKLEKPMMEKTIIDSKDEKAKKPTTP 60

Query: 61  AISESVIQDNVSVGSSC---SSDSLSSNYSAKLLKPYAVKPVSAGGDSN---------AT 120
           A   + ++   S+ SS    +S S++++YS+         P+S    S+         + 
Sbjct: 61  ASPRTTLKQCSSLCSSILRKNSASMTASYSSDASSSCESSPLSVASSSSCKKVVRRSGSV 120

Query: 121 TTSPALSLPG--------------KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELL 180
           +++  LS+                KRC WIT  +DP Y+AFHDEEWGVP+HDDKKLFELL
Sbjct: 121 SSTRKLSVGKEEEKVSGDCFADGRKRCAWITPKADPCYVAFHDEEWGVPVHDDKKLFELL 180

Query: 181 VLSQALAELTWPLILSKRDIFRKVLNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAI 240
            LS ALAEL+W  ILS+R I R+V  DFDP ++A+  + + T      I LLSE K+R+I
Sbjct: 181 CLSGALAELSWTDILSRRHILREVFMDFDPVAVAELNDKKLTAPGTAAISLLSEVKIRSI 240

Query: 241 VDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFR 300
           +DN+  V KI  E GS   Y W+FVN KP +++ RY RQVPVKT KAEF+SKD++RRGFR
Sbjct: 241 LDNSRHVRKIIAECGSLKKYMWNFVNNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFR 300

Query: 301 CVGPTVVYSFMQVAGIVNDHLVSCFRYEEC--DPKVKDDKKLRVEDKR 313
            V PTV+YSFMQ AG+ NDHL+ CFRY++C  D +     K + +++R
Sbjct: 301 SVSPTVIYSFMQAAGLTNDHLIGCFRYQDCCVDAETTTTTKAKKKNER 348

BLAST of CSPI03G42480 vs. TAIR10
Match: AT1G13635.1 (AT1G13635.1 DNA glycosylase superfamily protein)

HSP 1 Score: 215.3 bits (547), Expect = 5.3e-56
Identity = 111/270 (41.11%), Postives = 169/270 (62.59%), Query Frame = 1

Query: 38  PETLKKTEKQSKALPAISESVIQ---DNVSVGSSCSSDS--------LSSNYSAKLLKPY 97
           P TL+++   S +L +IS S+ Q   D+VS  S+ + +         +SS +  ++  P 
Sbjct: 39  PITLQRSTSSSFSLSSISLSLSQNSTDSVSTDSNSTLEQKISLALGLISSPHRREIFVPK 98

Query: 98  AVKPVSAGGDSNATTTSPALSLPGKRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELL 157
           ++ P     D N++          KRC+WIT  SD +Y+ FHD++WGVP++DD  LFE L
Sbjct: 99  SI-PQQLCQDFNSSDEP-------KRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFL 158

Query: 158 VLSQALAELTWPLILSKRDIFRKVLNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAI 217
            +S  L +  W  IL +++ FR+   +FDP+ +A+  E E   +  N   +L E ++R I
Sbjct: 159 AMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCI 218

Query: 218 VDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFR 277
           VDNA  + K+  EFGSFS++ W F++ KPI N+ +Y+R VP+++PKAE +SKDMI+RGFR
Sbjct: 219 VDNAKCITKVVNEFGSFSSFVWGFMDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFR 278

Query: 278 CVGPTVVYSFMQVAGIVNDHLVSCFRYEEC 297
            VGP +V+SFMQ AG+  DHLV CFR+ +C
Sbjct: 279 FVGPVIVHSFMQAAGLTIDHLVDCFRHGDC 300

BLAST of CSPI03G42480 vs. NCBI nr
Match: gi|449460123|ref|XP_004147795.1| (PREDICTED: uncharacterized protein LOC101206397 [Cucumis sativus])

HSP 1 Score: 623.6 bits (1607), Expect = 1.8e-175
Identity = 309/312 (99.04%), Postives = 311/312 (99.68%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVATKLQSH KPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVI+
Sbjct: 1   MSVATKLQSHVKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIR 60

Query: 61  DNVSVGSSCSSDSLSSNYSAKLLKPYAVKPVSAGGDSNATTTSPALSLPGKRCDWITLHS 120
           DNVSVGSSCSSDSLSSNYSAKLLKPYAVKPVSAGGDSNATTTSPALSLPGKRCDWITLHS
Sbjct: 61  DNVSVGSSCSSDSLSSNYSAKLLKPYAVKPVSAGGDSNATTTSPALSLPGKRCDWITLHS 120

Query: 121 DPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKVLNDFDPSSIA 180
           DPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRD+FRKVLNDFDPSSIA
Sbjct: 121 DPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDVFRKVLNDFDPSSIA 180

Query: 181 QFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRH 240
           QFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRH
Sbjct: 181 QFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRH 240

Query: 241 RYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRYEECDPKV 300
           RYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRYEECDPKV
Sbjct: 241 RYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRYEECDPKV 300

Query: 301 KDDKKLRVEDKR 313
           KDDKKLRVEDKR
Sbjct: 301 KDDKKLRVEDKR 312

BLAST of CSPI03G42480 vs. NCBI nr
Match: gi|659133198|ref|XP_008466607.1| (PREDICTED: uncharacterized protein LOC103503975 [Cucumis melo])

HSP 1 Score: 596.7 bits (1537), Expect = 2.4e-167
Identity = 297/313 (94.89%), Postives = 304/313 (97.12%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVATKLQSHAKPALEPRAILGPGGNRDRAP NPKCKPETLKKTEKQSKALP ISE VI+
Sbjct: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPHNPKCKPETLKKTEKQSKALPVISELVIR 60

Query: 61  DNVSVGSSCSSDSLSSNYSAKLLKPYAVKPVSAGGDSNATTTSPALSLPGKRCDWITLHS 120
           DNVSVGSSCSSDSLSSNYS KLLKPYAVKPVSAGGDS+ATTTSPALSLPGKRCDWITLHS
Sbjct: 61  DNVSVGSSCSSDSLSSNYSVKLLKPYAVKPVSAGGDSSATTTSPALSLPGKRCDWITLHS 120

Query: 121 DPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKVLNDFDPSSIA 180
           DPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPLILSKRDIFRKVLNDFDPSSIA
Sbjct: 121 DPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDIFRKVLNDFDPSSIA 180

Query: 181 QFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSFVNKKPIRNRH 240
           QF ENEFTTLKVNGIQLLSEPKLRAIV+NANQVLKIQKEFGSFSNYCWSFVNKKPIRNR 
Sbjct: 181 QFKENEFTTLKVNGIQLLSEPKLRAIVENANQVLKIQKEFGSFSNYCWSFVNKKPIRNRF 240

Query: 241 RYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSCFRYEECD-PK 300
           RYNRQVPVKTPKAEFMSKDM+RRGFRCVGPTVVYSFMQVAGIVNDHL +CFRYEECD  K
Sbjct: 241 RYNRQVPVKTPKAEFMSKDMMRRGFRCVGPTVVYSFMQVAGIVNDHLANCFRYEECDTTK 300

Query: 301 VKDDKKLRVEDKR 313
           +KDDKKLRVEDKR
Sbjct: 301 IKDDKKLRVEDKR 313

BLAST of CSPI03G42480 vs. NCBI nr
Match: gi|590593103|ref|XP_007017467.1| (DNA-3-methyladenine glycosylase, putative [Theobroma cacao])

HSP 1 Score: 398.7 bits (1023), Expect = 9.6e-108
Identity = 207/319 (64.89%), Postives = 238/319 (74.61%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVATKL+S   P  EPRAILGP GNR R     K KPE  KK ++    +    +SV+Q
Sbjct: 1   MSVATKLKSSPTPVTEPRAILGPTGNRVRVSDESKRKPEAQKKPQRPKFRVSKSPQSVVQ 60

Query: 61  DNVSVGSSCSSDSLSSNYSAKLL------KPYAVKPVSAG----GDSNATTTSPALSLPG 120
            NVSV SSCSSDS SSN S K +      K   VKPV A      D      SP L  P 
Sbjct: 61  SNVSVDSSCSSDSSSSNSSVKTVSSKKTVKRIGVKPVKAKVAPTADEVVAEPSPVLPEPL 120

Query: 121 KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKV 180
           KRCDWIT  SDPLY + HD+EWGVP+HDD+KLFELLV SQALAEL+WP IL+KRDIFRK+
Sbjct: 121 KRCDWITPFSDPLYTSLHDKEWGVPVHDDRKLFELLVFSQALAELSWPTILNKRDIFRKL 180

Query: 181 LNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSF 240
            ++FDPSSIAQFTE +  +LKVNG  LLSEPKLRA+V+NA Q+LK+Q+EFGSFS+YCW F
Sbjct: 181 FDNFDPSSIAQFTEKKLLSLKVNGSLLLSEPKLRAVVENAKQMLKVQQEFGSFSSYCWGF 240

Query: 241 VNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSC 300
           VN KPIRN  RY RQVPVKTPKAE +SKDM++RGFRCVGPTVVYSFMQVAGIVNDHLV+C
Sbjct: 241 VNHKPIRNGFRYVRQVPVKTPKAELISKDMMQRGFRCVGPTVVYSFMQVAGIVNDHLVTC 300

Query: 301 FRYEECDPKVKDDKKLRVE 310
           FRY+EC+  VK D K  +E
Sbjct: 301 FRYQECNANVKKDIKPEIE 319

BLAST of CSPI03G42480 vs. NCBI nr
Match: gi|823224426|ref|XP_012444979.1| (PREDICTED: uncharacterized protein LOC105769104 [Gossypium raimondii])

HSP 1 Score: 389.8 bits (1000), Expect = 4.5e-105
Identity = 202/320 (63.12%), Postives = 238/320 (74.38%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVAT+ +S  KP  EPRAILGP GNR R     K + E LKK ++    +    +SV+Q
Sbjct: 1   MSVATRPRSSTKPLTEPRAILGPAGNRVRVSDESKRRTEALKKPQRPKVPVSQSPKSVVQ 60

Query: 61  DNVSVGSSCSSDSLSSNYS------AKLLKPYAVKP----VSAGGDSNATTTSPALSLPG 120
            NVSV S CSSDS SSN S       K +K   VK     V++  D   T  SPA+S P 
Sbjct: 61  SNVSVDSCCSSDSSSSNSSFKTASSRKTVKQNGVKQAKPKVASTADEVVTEISPAMSGPL 120

Query: 121 KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKV 180
           KRCDWIT  SDPLY +FHDEEWGVP+HDD+KLFELLV SQALAEL+WP +L KR+IFRK 
Sbjct: 121 KRCDWITPFSDPLYTSFHDEEWGVPVHDDRKLFELLVFSQALAELSWPTVLKKREIFRKF 180

Query: 181 LNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSF 240
            +DFDPSS+AQFTE +  +LKV+G  LLSE KLRAIV+NA  +LK+Q+EFGSFS+YCW F
Sbjct: 181 FDDFDPSSMAQFTEKKMLSLKVDGCLLLSEAKLRAIVENAKLILKVQQEFGSFSSYCWGF 240

Query: 241 VNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSC 300
           VN KP+RN  RY RQVPVKTPKAE MSKDM+RRGF CVGPTVVYSFMQVAGIVNDHLV+C
Sbjct: 241 VNHKPLRNAFRYARQVPVKTPKAEVMSKDMMRRGFCCVGPTVVYSFMQVAGIVNDHLVTC 300

Query: 301 FRYEECDPKVKDDKKLRVED 311
           FRY+EC+  VK D K ++E+
Sbjct: 301 FRYQECNATVKKDIKPKIEE 320

BLAST of CSPI03G42480 vs. NCBI nr
Match: gi|728813161|gb|KHG01098.1| (putative GMP synthase [glutamine-hydrolyzing])

HSP 1 Score: 388.7 bits (997), Expect = 9.9e-105
Identity = 202/320 (63.12%), Postives = 240/320 (75.00%), Query Frame = 1

Query: 1   MSVATKLQSHAKPALEPRAILGPGGNRDRAPQNPKCKPETLKKTEKQSKALPAISESVIQ 60
           MSVAT+ +S AKP  EPRAILGP GNR R     K + E LKK ++    +    +SV+Q
Sbjct: 1   MSVATRPRSSAKPLTEPRAILGPAGNRVRVSDESKRRTEALKKPQRPKVPVSQSPKSVVQ 60

Query: 61  DNVSVGSSCSSDSLSSNYS------AKLLKPYAVKP----VSAGGDSNATTTSPALSLPG 120
            NVSV S CSSDS SSN S       K +K   VK     V++  D   T  SPA+S P 
Sbjct: 61  SNVSVDSCCSSDSSSSNSSFKTASSRKTVKQNGVKQAKPKVASTADEVVTEISPAMSGPL 120

Query: 121 KRCDWITLHSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPLILSKRDIFRKV 180
           KRCDWIT  SDPLY +FHDEEWGVP+H+D+KLFELLV SQALAEL+WP IL KR+IFRK+
Sbjct: 121 KRCDWITPFSDPLYTSFHDEEWGVPVHNDRKLFELLVFSQALAELSWPTILKKREIFRKL 180

Query: 181 LNDFDPSSIAQFTENEFTTLKVNGIQLLSEPKLRAIVDNANQVLKIQKEFGSFSNYCWSF 240
            ++FDPSS+AQFTE +  +LKV+G  LLSE KLRAIV+NA  +LK+Q+EFGSFS+YCW F
Sbjct: 181 FDNFDPSSMAQFTEKKMLSLKVDGCLLLSEAKLRAIVENAKLILKVQQEFGSFSSYCWGF 240

Query: 241 VNKKPIRNRHRYNRQVPVKTPKAEFMSKDMIRRGFRCVGPTVVYSFMQVAGIVNDHLVSC 300
           VN KP+RN  RY RQVPVKTPKAE MSKDM+RRGF CVGPTVVYSFMQVAGIVNDHLV+C
Sbjct: 241 VNHKPLRNAFRYARQVPVKTPKAEVMSKDMMRRGFCCVGPTVVYSFMQVAGIVNDHLVTC 300

Query: 301 FRYEECDPKVKDDKKLRVED 311
           FRY+EC+  VK D K ++E+
Sbjct: 301 FRYQECNATVKKDIKPKIEE 320

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUAA_HELHP1.5e-3943.48Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
3MG1_ECOLI7.5e-3639.34DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 S... [more]
3MGA_HAEIN6.0e-3341.34DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0LG22_CUCSA1.3e-17599.04Uncharacterized protein OS=Cucumis sativus GN=Csa_3G855280 PE=4 SV=1[more]
A0A061FDE4_THECC6.7e-10864.89DNA-3-methyladenine glycosylase, putative OS=Theobroma cacao GN=TCM_033995 PE=4 ... [more]
A0A0D2TRX9_GOSRA3.1e-10563.13Uncharacterized protein OS=Gossypium raimondii GN=B456_009G204500 PE=4 SV=1[more]
A0A0B0MFZ1_GOSAR6.9e-10563.13Putative GMP synthase [glutamine-hydrolyzing] OS=Gossypium arboreum GN=F383_2228... [more]
W9RT44_9ROSA1.2e-10464.02Putative Glutamine amidotransferase OS=Morus notabilis GN=L484_021208 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G75090.11.5e-8753.99 DNA glycosylase superfamily protein[more]
AT5G57970.13.8e-7051.97 DNA glycosylase superfamily protein[more]
AT1G80850.11.9e-6946.60 DNA glycosylase superfamily protein[more]
AT1G15970.12.1e-6842.82 DNA glycosylase superfamily protein[more]
AT1G13635.15.3e-5641.11 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449460123|ref|XP_004147795.1|1.8e-17599.04PREDICTED: uncharacterized protein LOC101206397 [Cucumis sativus][more]
gi|659133198|ref|XP_008466607.1|2.4e-16794.89PREDICTED: uncharacterized protein LOC103503975 [Cucumis melo][more]
gi|590593103|ref|XP_007017467.1|9.6e-10864.89DNA-3-methyladenine glycosylase, putative [Theobroma cacao][more]
gi|823224426|ref|XP_012444979.1|4.5e-10563.13PREDICTED: uncharacterized protein LOC105769104 [Gossypium raimondii][more]
gi|728813161|gb|KHG01098.1|9.9e-10563.13putative GMP synthase [glutamine-hydrolyzing][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005019Adenine_glyco
IPR011257DNA_glycosylase
Vocabulary: Biological Process
TermDefinition
GO:0006284base-excision repair
GO:0006281DNA repair
Vocabulary: Molecular Function
TermDefinition
GO:0008725DNA-3-methyladenine glycosylase activity
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G42480.1CSPI03G42480.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 119..292
score: 1.6
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 111..293
score: 1.1
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 111..296
score: 1.1
NoneNo IPR availablePANTHERPTHR31116FAMILY NOT NAMEDcoord: 2..310
score: 2.4E
NoneNo IPR availablePANTHERPTHR31116:SF53-METHYLADENINE GLYCOSYLASE I-RELATEDcoord: 2..310
score: 2.4E

The following gene(s) are paralogous to this gene:

None