Cp4.1LG02g08970.1 (mRNA) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g08970.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUracil-DNA glycosylase
LocationCp4.1LG02: 6309188 .. 6316355 (+)
Sequence length1515
RNA-Seq ExpressionCp4.1LG02g08970.1
SyntenyCp4.1LG02g08970.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTGACGCATATCCCATAGGCTACGATCAAATTTCAAACAAGCGGGGAGAGTGGAAAGCTTTTAGCGGATTCTCTGTTGGTTCCCGCCATTAAAGCCTGCTCCACCTCCACTCTACCGGAGCAACCAGACCCAAAATGGCTTCCTTCTCGCCTTCACTTAAATCCAAAACCAGAACCCTAATCGACATCTTCCAGCCAGCGCTTTCCAAACGCTTAAAAACCTCACAGACGCTGAAAACCCTTGCAACCACGGACGACAAATGCGATTCAGAGCTCACATTGGCTTCCTCTTCGATGGACATCTCTTCCTCTCAGAAATCCCGCATGGAAACCAACAAATGGTTGGCCAGATCGAAGCGCAATCTCAAAATTTCCTCAGATAGGGTTTCCAAATGGGAGAATGGATGTGTGAAGTTGGAGGAGCTTTTGGTGGACGAGACATGGTTCGAAGCCCTTCCCGGAGAGTTTGAGAAGCCCTATGCTCTTAACCTCTGCAAGTTCGTAGAGACGGAGATTTGCAGCAGCGGTGTCCCTGTTTATCCTCCTCCCTCTTTGATCTTTAATGCTCTGAATTCTACCCCTTTCGATAGGGTGAAAGTTGTTATTCTCGGTCAAGATCCTTATCATGGGCCTGGTCAAGCTATGGGTCTTTCTTTTTCTGTTCCTGAAGGAGTTAAAATCCCATCAAGTCTTCTCAACATATTCAAGGAACTGAGGGAAGATCTTGGTTGTTCCATTCCATCCCACGGAAATCTCGAGAAATGGGCTGTTCAGGTGCTGTTGTCTCTCATCTCTTTATTCTAAATACTTAGAAATGCATACTTAGATGTAGCTACTTCTAATGTGTTGGGGCTACAGAAATGCTACCCATAAAGATCTTGCTTATTTTTCATTAGATGAAAGATATGTAAATATAGGTTTGGGGTAGATGGATTATATAATAGATATGGTTCATCACAAGGTTCGGATTTTGGTGACTGCTAAGGGAATGAAAAAATATTTTCTGAAATCGGGTGCCTTTTCTTGAATTAAGAACACAAAGATAACTAATAAGCTATGCTGGTTTCAAAATATATATTTTTTAAAGTCATTAATTCAAGTTGGAAATTTGGGTTGTAATTAGGTGACTTCTCTAAGTTGAGAGATTGGATCATACAATTAGTAATTCATATTGGCAAAGGATTCTAAATGAAGTTTTTCATGACATTACTCGTATGAATCTGGGCTTGTTTGGCTTCAAAGGATAAGTTGATCAATTCGGGAGAACTTTTCTGCGTGGAAAAATGCATGAGTAGTTATTGTGATCTGAAAAAATTATTGTGTAAGAATGTAGTCAGAGTTCTATCTAGTGTCTTTTTCACTATTATGTTGTCTAAAATGATTTTTATATACATTGTAGGGAGTCTTGTTGCTTAATGCTGTTCTCACAGGTAATAATCTTCTTTTCACCTTAAGTTCCATACCTTCTTTATCAATGTACTTCCATTTCTAAAATCAGTAAAGTGAAGTTGAGGTAATATGTATAACGCCTTGTTATGTTCTTTGTGATTGTTTTTAAACAAATATTTCATTGATGAGTTTTGCTTTTGACCATCTAAAAAATTTTTCAATAGCTACCAGCTTAAAAGATACTAGCTAACTTTAATGCCATCGGCAACATCTATGACATCAAACCAAAAACTCATCAAACATCACCAAACCGTCACAAAGAGCAACTGCGCTAATATATAGCCATTTCCAGCAGCTAACTGTTCAAGCCGAGATTAGCTACAACTAAAGAAAAATTATAATCTTTCCAAGTTTGCTCATAAACTGCCACCTTATCCTGGGAGAAAAAACAGCAAAAGGCCTCCAAATGGGTGATCTTCGACAACAGGCTTTATAAAGTAACAACCTAACATTGCTGCAATGGAAGCCCTTGGGGAACAAGTTAACAAACACTCCAGCCCCTCAAAACATCCATTATAAAAAGCCAATAACATCAGACTCAAAGACGAGGCCAAAGCGCCCTTTTGAAGATTGATAATCAAACCAAAAAGAACCGAAAACACAAGACGTCGATAGACACTCAGAAGGAATCCACTGCTAGAATCAACACTTTAAAGGATGTAGTTTCTTGTTTGTTCCCATACATTAATAAGAGTTTTACATTATTTTGCTTTTAGTAACTATACATGCTTTCCTGCATTTACTTATTGATAAATACTTTGGGTTTTCCTGTTTAGTTCGAAAGCATCAAGCCAACTCTCATGCTAAAAAAGGATGGGAACAATTCACCGATGCTGTCATCCAAACAATATCACAAAAGAAGGAAGGAGTTGTCTTTCTCCTTTGGGGGAACTCTGCTCAAGCGAAATTGAGGTACTTCTATTGTATGATATTAGTTCTAGCAGGGAGTGCATGACTCTGGAAAGGACCAAAGTAATAATCGAATCATGGTTAAAATATAAACAACGACCTTTTAATTTTTCTTGTTGAAAAAAGAAAATCAGTGACCTTGAGGTTAAAGTTTTTTCTTCTATGTAAATTTCATCTATTGATCAGAATCTGGCATTACAAATTTAATTGGGAACAGGAAATAAAAACCAAGAACTTTCAGTTGAAGACTTGATGTATTTTAATAGATGAAGTGATGTAGGATGAAATATTAGATAATTACTATAATTTAGTTAATATTAGTCTAGTTGACATGGTACTCTATGAACTTGCCTTGGAGTTCGCCGGACACAAGTTTTAACACCTATGCATCAAGTTGGATCAGCGGCGAAGGTTTGAATGGATTTACCTTGGTTAAAGGCCAAGAGTTAAGCGCACTAGTTCCTATCTCAAGGCTAAAACACTCCAGAAGTTTGGTGACCAACCAAGCTTGAATGCCTCAAAACATGCGATCCTAAACACAAGTCTTCAACTTCAATCTGTTGAAACAAGAGGACCACTTGGGTTTATATAGCCAAGGAGTCCAAAAACCAACCTAGATTCTCTAAGTAATATAATTTTATTTAGAATAACCAACCTATTCTAGAAACCTTAAAAGGAATACAAGTTCAAAAATCTTCTAAAAATTCTCCTAAAATATCCTTCTAGAAACATTGTAGGAGTGAGACTAAGTAATGAAGGTTGAACATGTGTTTTTGGCCTGCTTGGTCGATAGCGTGTGTATAGTGAACTCCAAGAGAAGTTCATAAAGCACTATAGCACTAGCTGTTAACTTTTAGATTAACTAGTATTTGATTAATATTTTAGATTTGATTCGTTGCTTGTTAATTTTATTGTAAGGATATAAATATCCACTGTTTAGGATTATAAAAAAACTTTTGAATATTATTTGAAACATGACTTTTTTAAGAAGAAAATAGAACTTTGTTTTTAGAGAGTGTTCTCTCAACTTTTAGGATGAATTTTCCTTGCTTGGCCATAATTTAGCTGTTAGGTACCTATAGAACTTTGTTCTTAAAGAGTGTTCTCTCAACTTTTGAGTGAATTTTCTTTGTTTGGCCATGTTTAGCTGTTAGGTACCCAAACACTGAAAATACACAGAAACTCAAAGATAACACAAGATAGTAATATATTGCAATATTCAAATAAGTCGCAATGTAACAATAGCTTTTAGGGCTATCTCTCTTCCAATGTCCTACAATGGATAATCCCACTCAAATTGATCCCCTTTTAAAACCTCAACTCCCTTTGCTCTTAACCAAGATCCCCTGGATAATTACCACTATACCCCTTACTAATACGCTACTAATATTCTCAATATATACCCTTCCTAGTACTCTCACATCAGCCTAATCTCAAATCCATTACATCAATTGATACGAGAGCCCTTGGAGGTTCTTGGAAAGGTACAAAACTATATACAGTGGTCGAAGATGGCAACATCAACAGTTGACTCCTAGAATTTGTTTCGTCGTCTGGGTATTTACCACCACCATATTTATCTTCATCACTTCCATATTCATCAATCCACTATCATCATACCACATCCAAACATTCCGAACCATCCAAACCAATGTTTCAGCAAACAAAAAACAAGAAACGCCACACTTTCTCAACTAATTTCATAGTCAACTTCGACAACTTTGCAAAGATTAGATGCTCTTGTGTACCATTAAAGAATTCCCTCCAAATCAGCCTAACTAACTACAAGGGCAGAGACAAAGCAAATGATAGTGATGTAGAAGTCGATTACATTATTAAAGAAGTTGAAGAAGACGATGCTGTTATAATAATCAAAGCTAAAGATAAAGACATCGAAAAACACAAGGAAGATGATGAGGAAGTCAGAAAGGACAAGGATTAAAAGAGGAAGTCAAAAAAGAAGAGGAAAATGAAGGTGACTTGATGTCGGACAAGAACATACGATGCTACATGTTGCAACATTGTTAGAGGAAATAACCTTTGTTGTTGGGCTTTGTAAGAGAGATTTGTTTTGATCGCCATGGAATTGTTGAGTGCTACAATTTAGTTAGTGGCTGTTGGTTCATCAGAGGTGACCAATTTGTCAGAGCTTGATGTTGTCAGACAAGGACATAAGATGCTAGATGTTGTGGCATTGTTGGAGGGGGAAGATTTTGCTGCTACAGCTTTTTCAATGGGTTTATTTTGAACATTGCATCTTCATAGGAGGATGCTGGTTTGTCGGAAAGGAACGTTTACGGCGCCACAATGTCATCTGAGGGGCACATTGCAACGCAGAAAAAAAAGGAAAGAAAGAAAGTGGAGACATTGTGGCGCTGATGCAAAGAAATGTGAAAACAGAACGTGCTTCGTATTTTTGTTAATCCTAAACTTACGTGAGACCATATTGACTATAGCATTGAGCCCATTAGTTGGTCACTGGAAGCCTAATTTCCACTTCATTTTCGTGAGATTTTCATGGAGTATTATTTAATTCCAATCTCAATCCAGAAAAAATTTGCACCAAGATTTTTCAATTGGAGTTTACCACACTCAAAAGAGCAGAGGAGAGTTTTCATTTTTGTTACCTCAAACTTGAGGATAGGTCCTTTTTCTACGTTGGGTGATGTTAGTTACTCAATATAACTTGGTTAATATTACTTTAGTTGTTAAGTTAACTTTAGACTAATTAGTTTTAAATTAGTATTTTGCATTTAACTCGTTTGTGTTAATTGTATTGCAAGCCTATAAGTAGCCACCTTTTAGGATTGTGAGGAAACTCTTGAATATTATTTTGAAATAGAACTTTAACTTTAGTGTTCTCTCAACTAAAGGAATGGATTTCTCTTGTTTGGCCTTGGATGGGACCTAATCACAAATTCTGTGCATCATAAGAACAGTAGGGTCAGGAGTAATAAAACTTTGTAATTCCTGCATGCTATTAGGAGTGTTTGTTTTTAGCTGATTTGTGGTTTCGTTGGTTCATTAGCGGGAGTAAGACTCTTGGGTTCATTGGCCAGAGTAAGACTCGTGTGCATCTTTTTCTTTGTTCAGCAGCACATAGGTTCTTGCTTTCCTCCGTGAACTCTAAGACCTTCAGGCATGCGTGTCTCTGTGTGTGTTTTACCTCTCTAATATGACTTTCACCCTCTTGAATTTGGATTATTTCCCACAACCATCGCCCCTATTCCACAGCCTGGAGTGCTGTTCTCATCTGATAATTTTTCTTCTATCTAAGTGCTGCAAGTTTACTCATCATTTCTTCTAAAATGTTCTGCAATGTCAGCATATTCTCCTAGACTCTGAATCCTCAAGTTTGAATTGCTTTAATACCAATTTGATTGGATTGTTATGGTAGAAATTAATGAAAGCTACTACCAAATTAGAAAGGAAAATTAATTTGAGAGGCTAACCACTCAACTTTGACGAACTTATTAAGTTACTATGTATATGCCACTGGTAATATTCCTTATATTGCTCAATTGCATTTCATATTAATAACTCACATATCCCATGCAAATATAAACGCCACACCCACATTTTAGGGGATGTGTGGTAAATACTTTACTTTAGTCGTCATTTGGTACATGTCTTGCTATGCTGGCTCTAAGCTTACAAGTATAGGAACTTTTCCTGTGATGAACAGGAAACTATTTGAGAAAACATGGTTTTGCTGAGCTTTAAAATCAGCATAGTGCAATTTTCATTTACTTTATCTAGCCTAGAACAGCTTAGGTTCTATTGTCTTGATAATTAACTTACAAGAGCATCATCAGAGGGGTTTAATTCCTCTTCTTTCTTGGCAATAGCGTCCTACTTCACATAGAATCTGATGTATGAACAGGTTAATTGATGAGAAAAAACATCACATTCTCAAAGCCGCGCATCCTTCTGGTTTGTCGGCCAACAGAGGCTTCTTTGGTTGCAGGTTAGTTCCATTGTTTAATTCCATTATAAATTTTTTAAACTTTCCAGTTCAAATATTAATTGGAATTTGCTCACACCTTTGTACTTCATCATTTCCCTTCAATATGGTTCTTACAATAACAACATTTGCTATATGCCACTTGGTATTTTGTGTATATACAGAACACTTGGTTCTCTGCCTTTTTTCTCTCTCAAAGACTTGAAAACACTTAAGGAACATAGATGTTTATGGATAAGAGAGAAAATGCTTCAAGTGATTCACTAATATGAGGCAAAGTTGATGGATGCCTTTTCATTTCCCTTGGAATGTTTTCTCATCAAGATTGTTTACTGCCCATTGTTGATTTTCTACAGGCATTTTTCTCGAACAAACATGCTTCTCAAGGAACTGGGTATTGGCGCCATAGATTGGCAACTCTGATCAAAATACTTGCTTGAACCATTGCAGTTAATTTTGTGGCTCCACAGTCTGCTGGATTGACGATTTTTGTGGAATTTTCACTTGTTGATTCATCATTGTTTAAATGATGAGATGCCCATCCTTTCCCATTTTTGTATTATAGTGTTTATACTTGTAGTTAGATTTGATCCCAAAGCAGCAAAACCCTTCTTTATGAATTAGGGTAAAATTAGAAGAAAGTTGGGCTAAATGTGTCTTGGTAGCAAATGTTTGGGTGTTCTTTCAACTCTATTGTTTAAAAGATTTTCTTTTAGCCCCCTTGTGCCTTTAGACTTGTTATCGATGAATGTTCAAATGAATTGTATAATACTTAGTAAGTTAATATAAGTTTAAAGGAGAAATTAGGTCCTTCAATAATTAAAATAGATGATATTTCTACGATTA

mRNA sequence

AGTGACGCATATCCCATAGGCTACGATCAAATTTCAAACAAGCGGGGAGAGTGGAAAGCTTTTAGCGGATTCTCTGTTGGTTCCCGCCATTAAAGCCTGCTCCACCTCCACTCTACCGGAGCAACCAGACCCAAAATGGCTTCCTTCTCGCCTTCACTTAAATCCAAAACCAGAACCCTAATCGACATCTTCCAGCCAGCGCTTTCCAAACGCTTAAAAACCTCACAGACGCTGAAAACCCTTGCAACCACGGACGACAAATGCGATTCAGAGCTCACATTGGCTTCCTCTTCGATGGACATCTCTTCCTCTCAGAAATCCCGCATGGAAACCAACAAATGGTTGGCCAGATCGAAGCGCAATCTCAAAATTTCCTCAGATAGGGTTTCCAAATGGGAGAATGGATGTGTGAAGTTGGAGGAGCTTTTGGTGGACGAGACATGGTTCGAAGCCCTTCCCGGAGAGTTTGAGAAGCCCTATGCTCTTAACCTCTGCAAGTTCGTAGAGACGGAGATTTGCAGCAGCGGTGTCCCTGTTTATCCTCCTCCCTCTTTGATCTTTAATGCTCTGAATTCTACCCCTTTCGATAGGGTGAAAGTTGTTATTCTCGGTCAAGATCCTTATCATGGGCCTGGTCAAGCTATGGGTCTTTCTTTTTCTGTTCCTGAAGGAGTTAAAATCCCATCAAGTCTTCTCAACATATTCAAGGAACTGAGGGAAGATCTTGGTTGTTCCATTCCATCCCACGGAAATCTCGAGAAATGGGCTGTTCAGGGAGTCTTGTTGCTTAATGCTGTTCTCACAGTTCGAAAGCATCAAGCCAACTCTCATGCTAAAAAAGGATGGGAACAATTCACCGATGCTGTCATCCAAACAATATCACAAAAGAAGGAAGGAGTTGTCTTTCTCCTTTGGGGGAACTCTGCTCAAGCGAAATTGAGGTTAATTGATGAGAAAAAACATCACATTCTCAAAGCCGCGCATCCTTCTGGTTTGTCGGCCAACAGAGGCTTCTTTGGTTGCAGGCATTTTTCTCGAACAAACATGCTTCTCAAGGAACTGGGTATTGGCGCCATAGATTGGCAACTCTGATCAAAATACTTGCTTGAACCATTGCAGTTAATTTTGTGGCTCCACAGTCTGCTGGATTGACGATTTTTGTGGAATTTTCACTTGTTGATTCATCATTGTTTAAATGATGAGATGCCCATCCTTTCCCATTTTTGTATTATAGTGTTTATACTTGTAGTTAGATTTGATCCCAAAGCAGCAAAACCCTTCTTTATGAATTAGGGTAAAATTAGAAGAAAGTTGGGCTAAATGTGTCTTGGTAGCAAATGTTTGGGTGTTCTTTCAACTCTATTGTTTAAAAGATTTTCTTTTAGCCCCCTTGTGCCTTTAGACTTGTTATCGATGAATGTTCAAATGAATTGTATAATACTTAGTAAGTTAATATAAGTTTAAAGGAGAAATTAGGTCCTTCAATAATTAAAATAGATGATATTTCTACGATTA

Coding sequence (CDS)

ATGGCTTCCTTCTCGCCTTCACTTAAATCCAAAACCAGAACCCTAATCGACATCTTCCAGCCAGCGCTTTCCAAACGCTTAAAAACCTCACAGACGCTGAAAACCCTTGCAACCACGGACGACAAATGCGATTCAGAGCTCACATTGGCTTCCTCTTCGATGGACATCTCTTCCTCTCAGAAATCCCGCATGGAAACCAACAAATGGTTGGCCAGATCGAAGCGCAATCTCAAAATTTCCTCAGATAGGGTTTCCAAATGGGAGAATGGATGTGTGAAGTTGGAGGAGCTTTTGGTGGACGAGACATGGTTCGAAGCCCTTCCCGGAGAGTTTGAGAAGCCCTATGCTCTTAACCTCTGCAAGTTCGTAGAGACGGAGATTTGCAGCAGCGGTGTCCCTGTTTATCCTCCTCCCTCTTTGATCTTTAATGCTCTGAATTCTACCCCTTTCGATAGGGTGAAAGTTGTTATTCTCGGTCAAGATCCTTATCATGGGCCTGGTCAAGCTATGGGTCTTTCTTTTTCTGTTCCTGAAGGAGTTAAAATCCCATCAAGTCTTCTCAACATATTCAAGGAACTGAGGGAAGATCTTGGTTGTTCCATTCCATCCCACGGAAATCTCGAGAAATGGGCTGTTCAGGGAGTCTTGTTGCTTAATGCTGTTCTCACAGTTCGAAAGCATCAAGCCAACTCTCATGCTAAAAAAGGATGGGAACAATTCACCGATGCTGTCATCCAAACAATATCACAAAAGAAGGAAGGAGTTGTCTTTCTCCTTTGGGGGAACTCTGCTCAAGCGAAATTGAGGTTAATTGATGAGAAAAAACATCACATTCTCAAAGCCGCGCATCCTTCTGGTTTGTCGGCCAACAGAGGCTTCTTTGGTTGCAGGCATTTTTCTCGAACAAACATGCTTCTCAAGGAACTGGGTATTGGCGCCATAGATTGGCAACTCTGA

Protein sequence

MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQKSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLCKFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGVKIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQFTDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFSRTNMLLKELGIGAIDWQL
Homology
BLAST of Cp4.1LG02g08970.1 vs. ExPASy Swiss-Prot
Match: Q9LIH6 (Uracil-DNA glycosylase, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=UNG PE=1 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 1.4e-109
Identity = 210/329 (63.83%), Postives = 245/329 (74.47%), Query Frame = 0

Query: 10  SKTRTLIDIFQPALSKRLKT---------------SQTLKTLATTDDKCDSELTLASSSM 69
           S  +TL+D FQPA  KRLK                S+ L ++A +  +     ++A  S 
Sbjct: 4   STPKTLMDFFQPA--KRLKASPSSSSFPAVSVAGGSRDLGSVANSPPRVTVTTSVADDSS 63

Query: 70  DISSSQKSRMETNKWLARSKRNLKISSDRV--SKWENGC-VKLEELLVDETWFEALPGEF 129
            ++  Q +R E NK++A+SKRNL + S+RV  +K E  C V L ELLV+E+W +ALPGEF
Sbjct: 64  GLTPEQIARAEFNKFVAKSKRNLAVCSERVTKAKSEGNCYVPLSELLVEESWLKALPGEF 123

Query: 130 EKPYALNLCKFVETEIC--SSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQA 189
            KPYA +L  F+E EI   S    +YPP  LIFNALN+TPFDRVK VI+GQDPYHGPGQA
Sbjct: 124 HKPYAKSLSDFLEREIITDSKSPLIYPPQHLIFNALNTTPFDRVKTVIIGQDPYHGPGQA 183

Query: 190 MGLSFSVPEGVKIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQA 249
           MGLSFSVPEG K+PSSLLNIFKEL +D+GCSIP HGNL+KWAVQGVLLLNAVLTVR  Q 
Sbjct: 184 MGLSFSVPEGEKLPSSLLNIFKELHKDVGCSIPRHGNLQKWAVQGVLLLNAVLTVRSKQP 243

Query: 250 NSHAKKGWEQFTDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSA 309
           NSHAKKGWEQFTDAVIQ+ISQ+KEGVVFLLWG  AQ K +LID  KHHIL AAHPSGLSA
Sbjct: 244 NSHAKKGWEQFTDAVIQSISQQKEGVVFLLWGRYAQEKSKLIDATKHHILTAAHPSGLSA 303

Query: 310 NRGFFGCRHFSRTNMLLKELGIGAIDWQL 319
           NRGFF CRHFSR N LL+E+GI  IDWQL
Sbjct: 304 NRGFFDCRHFSRANQLLEEMGIPPIDWQL 330

BLAST of Cp4.1LG02g08970.1 vs. ExPASy Swiss-Prot
Match: Q88N05 (Uracil-DNA glycosylase OS=Pseudomonas putida (strain ATCC 47054 / DSM 6125 / NCIMB 11950 / KT2440) OX=160488 GN=ung PE=3 SV=1)

HSP 1 Score: 277.3 bits (708), Expect = 2.1e-73
Identity = 132/224 (58.93%), Postives = 162/224 (72.32%), Query Frame = 0

Query: 95  EELLVDETWFEALPGEFEKPYALNLCKFVETEICSSGVPVYPPPSLIFNALNSTPFDRVK 154
           + + ++ +W  AL GEF++PY   L +F+  E  ++G  +YPP  LIFNALNSTP D+VK
Sbjct: 5   DRIKLEPSWKAALRGEFDQPYMHQLREFLRGEY-AAGKEIYPPGPLIFNALNSTPLDQVK 64

Query: 155 VVILGQDPYHGPGQAMGLSFSVPEGVKIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQG 214
           VVILGQDPYHGPGQA GL FSV  GV  P SL+NI+KEL+ DL   IPSHG L+ WA QG
Sbjct: 65  VVILGQDPYHGPGQAHGLCFSVQPGVATPPSLVNIYKELQRDLNIPIPSHGYLQSWAEQG 124

Query: 215 VLLLNAVLTVRKHQANSHAKKGWEQFTDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEK 274
           VLLLN  +TV +  A SHAKKGWE FTD +IQ +S++   VVFLLWG  AQ+K +LID  
Sbjct: 125 VLLLNTTMTVERANAASHAKKGWELFTDRIIQVVSEQCPNVVFLLWGAHAQSKQKLIDGT 184

Query: 275 KHHILKAAHPSGLSANRGFFGCRHFSRTNMLLKELGIGAIDWQL 319
           KH +LK+ HPS LSA RGF GC HFSR N  L++ G+G IDW L
Sbjct: 185 KHLVLKSVHPSPLSAYRGFIGCGHFSRANSFLEQRGLGPIDWAL 227

BLAST of Cp4.1LG02g08970.1 vs. ExPASy Swiss-Prot
Match: A5W8H2 (Uracil-DNA glycosylase OS=Pseudomonas putida (strain ATCC 700007 / DSM 6899 / BCRC 17059 / F1) OX=351746 GN=ung PE=3 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 3.6e-73
Identity = 132/224 (58.93%), Postives = 162/224 (72.32%), Query Frame = 0

Query: 95  EELLVDETWFEALPGEFEKPYALNLCKFVETEICSSGVPVYPPPSLIFNALNSTPFDRVK 154
           + + ++ +W  AL GEF++PY   L +F+  E  ++G  +YPP  LIFNALNSTP  +VK
Sbjct: 5   DRIKLEPSWKAALRGEFDQPYMHQLREFLRGEY-AAGKEIYPPGPLIFNALNSTPLGQVK 64

Query: 155 VVILGQDPYHGPGQAMGLSFSVPEGVKIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQG 214
           VVILGQDPYHGPGQA GL FSV  GV  P SL+NI+KEL+ DL   IPSHG L+ WA QG
Sbjct: 65  VVILGQDPYHGPGQAHGLCFSVQPGVATPPSLVNIYKELQRDLNIPIPSHGYLQSWAEQG 124

Query: 215 VLLLNAVLTVRKHQANSHAKKGWEQFTDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEK 274
           VLLLN  +TV +  A SHAKKGWE FTD +IQ +S++   VVFLLWG  AQ+K +LID  
Sbjct: 125 VLLLNTTMTVERANAASHAKKGWELFTDRIIQVVSEQCPNVVFLLWGAHAQSKQKLIDGT 184

Query: 275 KHHILKAAHPSGLSANRGFFGCRHFSRTNMLLKELGIGAIDWQL 319
           KH +LK+ HPS LSA RGF GC HFSRTN  L++ G+G IDW L
Sbjct: 185 KHLVLKSVHPSPLSAYRGFIGCGHFSRTNSFLEQRGLGPIDWAL 227

BLAST of Cp4.1LG02g08970.1 vs. ExPASy Swiss-Prot
Match: C1DQR0 (Uracil-DNA glycosylase OS=Azotobacter vinelandii (strain DJ / ATCC BAA-1303) OX=322710 GN=ung PE=3 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 4.7e-73
Identity = 131/224 (58.48%), Postives = 164/224 (73.21%), Query Frame = 0

Query: 95  EELLVDETWFEALPGEFEKPYALNLCKFVETEICSSGVPVYPPPSLIFNALNSTPFDRVK 154
           + + ++ +W EAL  EFEKPY   L  F+  E  ++G  +YPP SLIFNAL+STP D+VK
Sbjct: 6   DRVRLEASWKEALHDEFEKPYMQELSDFLRRE-KAAGKEIYPPGSLIFNALDSTPLDQVK 65

Query: 155 VVILGQDPYHGPGQAMGLSFSVPEGVKIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQG 214
           VVI+GQDPYHGPGQA GL FSV  GV +P SL NIFKEL+ DL   IP HG+L++WA QG
Sbjct: 66  VVIIGQDPYHGPGQAHGLCFSVQPGVPVPPSLQNIFKELKRDLNIDIPKHGHLQRWAEQG 125

Query: 215 VLLLNAVLTVRKHQANSHAKKGWEQFTDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEK 274
           VLLLN  LTV +  A SHA  GW++FTD VI+ +SQ++E VVF+LWG+ AQ+K RLID  
Sbjct: 126 VLLLNTSLTVERGNAGSHAGMGWQRFTDRVIEVVSQRREHVVFMLWGSHAQSKRRLIDSS 185

Query: 275 KHHILKAAHPSGLSANRGFFGCRHFSRTNMLLKELGIGAIDWQL 319
           KH +L +AHPS LSA+RGF G  HFSR N  L++ G+  IDW L
Sbjct: 186 KHLVLCSAHPSPLSAHRGFIGNGHFSRANQFLEQHGLTPIDWHL 228

BLAST of Cp4.1LG02g08970.1 vs. ExPASy Swiss-Prot
Match: B0KV50 (Uracil-DNA glycosylase OS=Pseudomonas putida (strain GB-1) OX=76869 GN=ung PE=3 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 1.2e-71
Identity = 131/224 (58.48%), Postives = 161/224 (71.88%), Query Frame = 0

Query: 95  EELLVDETWFEALPGEFEKPYALNLCKFVETEICSSGVPVYPPPSLIFNALNSTPFDRVK 154
           + + ++ +W  AL GEF++PY   L +F+  E  ++G  +YPP  LIFNALNSTP D+VK
Sbjct: 5   DRIKLEPSWKAALRGEFDQPYMHQLREFLRGEY-AAGKEIYPPGPLIFNALNSTPLDQVK 64

Query: 155 VVILGQDPYHGPGQAMGLSFSVPEGVKIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQG 214
           VVILGQDPYHGPGQA GL FSV  GV  P SL+NI+KEL+ DL   I SHG L+ WA QG
Sbjct: 65  VVILGQDPYHGPGQAHGLCFSVQPGVATPPSLVNIYKELQRDLNIPIASHGYLQSWAEQG 124

Query: 215 VLLLNAVLTVRKHQANSHAKKGWEQFTDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEK 274
           VLLLN  +TV +  A SHAKKGWE FTD VIQ +S++   VVFLLWG  AQ+K +LID  
Sbjct: 125 VLLLNTTMTVERANAASHAKKGWEFFTDRVIQVVSEQCPNVVFLLWGAHAQSKQKLIDGT 184

Query: 275 KHHILKAAHPSGLSANRGFFGCRHFSRTNMLLKELGIGAIDWQL 319
           KH +LK+ HPS LSA RGF GC HFSRTN  L++ G+  I+W L
Sbjct: 185 KHLVLKSVHPSPLSAYRGFLGCGHFSRTNSFLEQRGMAPINWAL 227

BLAST of Cp4.1LG02g08970.1 vs. NCBI nr
Match: XP_023525611.1 (uracil-DNA glycosylase, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 636 bits (1640), Expect = 5.83e-230
Identity = 318/318 (100.00%), Postives = 318/318 (100.00%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ
Sbjct: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
           KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC
Sbjct: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF
Sbjct: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTNMLLKELGIGAIDWQL
Sbjct: 301 RTNMLLKELGIGAIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. NCBI nr
Match: XP_022981553.1 (uracil-DNA glycosylase, mitochondrial [Cucurbita maxima])

HSP 1 Score: 619 bits (1595), Expect = 4.23e-223
Identity = 308/318 (96.86%), Postives = 314/318 (98.74%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MAS S SLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTD+KCDSELTLASSSMDISSSQ
Sbjct: 1   MASSSASLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDEKCDSELTLASSSMDISSSQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
           KSRMETNKWLARS RNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEF+KPYALNLC
Sbjct: 61  KSRMETNKWLARSNRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFQKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFVETEICSSGVP+YPPP LIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVETEICSSGVPIYPPPCLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKELREDLGCSIPSHGNL+KWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF
Sbjct: 181 KIPSSLLNIFKELREDLGCSIPSHGNLKKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTN+LLKELGIG+IDWQL
Sbjct: 301 RTNVLLKELGIGSIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. NCBI nr
Match: KAG6607671.1 (Uracil-DNA glycosylase, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 613 bits (1582), Expect = 4.05e-221
Identity = 307/318 (96.54%), Postives = 313/318 (98.43%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MAS S SLKSKTRTLIDIFQPALSKR+KTSQTLKTLATTD+K DSELTLASSSMDISSSQ
Sbjct: 1   MASSSASLKSKTRTLIDIFQPALSKRIKTSQTLKTLATTDEKGDSELTLASSSMDISSSQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
           KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEF+KPYALNLC
Sbjct: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFDKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFVETEICSSGVP+YPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVETEICSSGVPIYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF
Sbjct: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TDAVI+TISQ KEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDAVIKTISQNKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTN+LLKELGI AIDWQL
Sbjct: 301 RTNVLLKELGIDAIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. NCBI nr
Match: XP_022926296.1 (uracil-DNA glycosylase, mitochondrial [Cucurbita moschata])

HSP 1 Score: 612 bits (1578), Expect = 1.65e-220
Identity = 306/318 (96.23%), Postives = 312/318 (98.11%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MAS S SLKSKTRTLIDIFQPALSKR+KTSQTLKTLATTD+K DSELTLASSSMDISSSQ
Sbjct: 1   MASSSASLKSKTRTLIDIFQPALSKRIKTSQTLKTLATTDEKGDSELTLASSSMDISSSQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
           KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEF+KPYALNLC
Sbjct: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFDKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFVETEICSSGVP+YPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVETEICSSGVPIYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF
Sbjct: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TD VI+TISQ KEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDVVIKTISQNKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTN+LLKELGI AIDWQL
Sbjct: 301 RTNVLLKELGIDAIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. NCBI nr
Match: KAG7028480.1 (Uracil-DNA glycosylase, mitochondrial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 610 bits (1572), Expect = 1.35e-219
Identity = 305/318 (95.91%), Postives = 312/318 (98.11%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MA  S SLKSKTRTLIDIFQPALSKR+KTSQTLKTLATTD+K DSELTLASSSMDISSSQ
Sbjct: 1   MAFSSASLKSKTRTLIDIFQPALSKRIKTSQTLKTLATTDEKGDSELTLASSSMDISSSQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
           KSRMETN+WLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEF+KPYALNLC
Sbjct: 61  KSRMETNEWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFDKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFVETEICSSGVP+YPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVETEICSSGVPIYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF
Sbjct: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TDAVI+TISQ KEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDAVIKTISQNKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTN+LLKELGI AIDWQL
Sbjct: 301 RTNVLLKELGIDAIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. ExPASy TrEMBL
Match: A0A6J1J2E4 (Uracil-DNA glycosylase OS=Cucurbita maxima OX=3661 GN=LOC111480636 PE=3 SV=1)

HSP 1 Score: 619 bits (1595), Expect = 2.05e-223
Identity = 308/318 (96.86%), Postives = 314/318 (98.74%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MAS S SLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTD+KCDSELTLASSSMDISSSQ
Sbjct: 1   MASSSASLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDEKCDSELTLASSSMDISSSQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
           KSRMETNKWLARS RNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEF+KPYALNLC
Sbjct: 61  KSRMETNKWLARSNRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFQKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFVETEICSSGVP+YPPP LIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVETEICSSGVPIYPPPCLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKELREDLGCSIPSHGNL+KWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF
Sbjct: 181 KIPSSLLNIFKELREDLGCSIPSHGNLKKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTN+LLKELGIG+IDWQL
Sbjct: 301 RTNVLLKELGIGSIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. ExPASy TrEMBL
Match: A0A6J1EEH8 (Uracil-DNA glycosylase OS=Cucurbita moschata OX=3662 GN=LOC111433476 PE=3 SV=1)

HSP 1 Score: 612 bits (1578), Expect = 7.98e-221
Identity = 306/318 (96.23%), Postives = 312/318 (98.11%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MAS S SLKSKTRTLIDIFQPALSKR+KTSQTLKTLATTD+K DSELTLASSSMDISSSQ
Sbjct: 1   MASSSASLKSKTRTLIDIFQPALSKRIKTSQTLKTLATTDEKGDSELTLASSSMDISSSQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
           KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEF+KPYALNLC
Sbjct: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFDKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFVETEICSSGVP+YPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVETEICSSGVPIYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF
Sbjct: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TD VI+TISQ KEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDVVIKTISQNKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTN+LLKELGI AIDWQL
Sbjct: 301 RTNVLLKELGIDAIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. ExPASy TrEMBL
Match: A0A0A0KMG3 (Uracil-DNA glycosylase OS=Cucumis sativus OX=3659 GN=Csa_5G289610 PE=3 SV=1)

HSP 1 Score: 593 bits (1528), Expect = 3.34e-213
Identity = 291/318 (91.51%), Postives = 307/318 (96.54%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MAS S SL SKTRTLIDIFQPALSKRLKTSQTLKTLAT DDKCDS+LTLASSS DIS+SQ
Sbjct: 1   MASSSASLSSKTRTLIDIFQPALSKRLKTSQTLKTLATNDDKCDSDLTLASSSADISASQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
            SRMETNKW+ARSKRNLK  SDRVSKWENGCVKLEELLV+ETWFEALPGEF+KPYALNLC
Sbjct: 61  ISRMETNKWIARSKRNLKTCSDRVSKWENGCVKLEELLVEETWFEALPGEFQKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFV+TEICSSGVP+YPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVQTEICSSGVPIYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKELR+DLGCSIPSHGNL KWAVQGVLLLNAVL+VRKHQANSHAK+GWEQF
Sbjct: 181 KIPSSLLNIFKELRDDLGCSIPSHGNLGKWAVQGVLLLNAVLSVRKHQANSHAKRGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TDAVI+TISQKKEG++FLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDAVIKTISQKKEGIIFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTN+LLKE+G  +IDWQL
Sbjct: 301 RTNILLKEMGTASIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. ExPASy TrEMBL
Match: A0A1S3CPW3 (Uracil-DNA glycosylase OS=Cucumis melo OX=3656 GN=LOC103502882 PE=3 SV=1)

HSP 1 Score: 592 bits (1526), Expect = 6.73e-213
Identity = 291/318 (91.51%), Postives = 308/318 (96.86%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MAS S SL SKTRTLIDIFQPALSKRLKTSQTLKTLAT DDKCDS+LTLASSS D+S+SQ
Sbjct: 1   MASSSASLSSKTRTLIDIFQPALSKRLKTSQTLKTLATNDDKCDSDLTLASSSTDMSASQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
            SRMETNKW+ARSKRNLKI SDRVSKWENGC+KLEELLV+ETWFEALPGEFEKPYALNLC
Sbjct: 61  ISRMETNKWMARSKRNLKICSDRVSKWENGCMKLEELLVEETWFEALPGEFEKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFV+TEICSSGVP+YPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVQTEICSSGVPIYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQF 240
           KIPSSLLNIFKEL++DLGCSIPSHGNL KWAVQGVLLLNAVL+VR+HQANSHAK+GWEQF
Sbjct: 181 KIPSSLLNIFKELKDDLGCSIPSHGNLGKWAVQGVLLLNAVLSVREHQANSHAKRGWEQF 240

Query: 241 TDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300
           TDAVI+TISQKKEG+VFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS
Sbjct: 241 TDAVIKTISQKKEGIVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFS 300

Query: 301 RTNMLLKELGIGAIDWQL 318
           RTN+LLKELG  +IDWQL
Sbjct: 301 RTNILLKELGTASIDWQL 318

BLAST of Cp4.1LG02g08970.1 vs. ExPASy TrEMBL
Match: A0A1S3CND7 (Uracil-DNA glycosylase OS=Cucumis melo OX=3656 GN=LOC103502882 PE=3 SV=1)

HSP 1 Score: 578 bits (1490), Expect = 5.19e-207
Identity = 291/343 (84.84%), Postives = 308/343 (89.80%), Query Frame = 0

Query: 1   MASFSPSLKSKTRTLIDIFQPALSKRLKTSQTLKTLATTDDKCDSELTLASSSMDISSSQ 60
           MAS S SL SKTRTLIDIFQPALSKRLKTSQTLKTLAT DDKCDS+LTLASSS D+S+SQ
Sbjct: 1   MASSSASLSSKTRTLIDIFQPALSKRLKTSQTLKTLATNDDKCDSDLTLASSSTDMSASQ 60

Query: 61  KSRMETNKWLARSKRNLKISSDRVSKWENGCVKLEELLVDETWFEALPGEFEKPYALNLC 120
            SRMETNKW+ARSKRNLKI SDRVSKWENGC+KLEELLV+ETWFEALPGEFEKPYALNLC
Sbjct: 61  ISRMETNKWMARSKRNLKICSDRVSKWENGCMKLEELLVEETWFEALPGEFEKPYALNLC 120

Query: 121 KFVETEICSSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180
           KFV+TEICSSGVP+YPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV
Sbjct: 121 KFVQTEICSSGVPIYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGV 180

Query: 181 KIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLT----------------- 240
           KIPSSLLNIFKEL++DLGCSIPSHGNL KWAVQGVLLLNAVL+                 
Sbjct: 181 KIPSSLLNIFKELKDDLGCSIPSHGNLGKWAVQGVLLLNAVLSATSRILNQKLFKHHQTV 240

Query: 241 --------VRKHQANSHAKKGWEQFTDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKK 300
                   VR+HQANSHAK+GWEQFTDAVI+TISQKKEG+VFLLWGNSAQAKLRLIDEKK
Sbjct: 241 MKSNKADIVREHQANSHAKRGWEQFTDAVIKTISQKKEGIVFLLWGNSAQAKLRLIDEKK 300

Query: 301 HHILKAAHPSGLSANRGFFGCRHFSRTNMLLKELGIGAIDWQL 318
           HHILKAAHPSGLSANRGFFGCRHFSRTN+LLKELG  +IDWQL
Sbjct: 301 HHILKAAHPSGLSANRGFFGCRHFSRTNILLKELGTASIDWQL 343

BLAST of Cp4.1LG02g08970.1 vs. TAIR 10
Match: AT3G18630.1 (uracil dna glycosylase )

HSP 1 Score: 397.5 bits (1020), Expect = 1.0e-110
Identity = 210/329 (63.83%), Postives = 245/329 (74.47%), Query Frame = 0

Query: 10  SKTRTLIDIFQPALSKRLKT---------------SQTLKTLATTDDKCDSELTLASSSM 69
           S  +TL+D FQPA  KRLK                S+ L ++A +  +     ++A  S 
Sbjct: 4   STPKTLMDFFQPA--KRLKASPSSSSFPAVSVAGGSRDLGSVANSPPRVTVTTSVADDSS 63

Query: 70  DISSSQKSRMETNKWLARSKRNLKISSDRV--SKWENGC-VKLEELLVDETWFEALPGEF 129
            ++  Q +R E NK++A+SKRNL + S+RV  +K E  C V L ELLV+E+W +ALPGEF
Sbjct: 64  GLTPEQIARAEFNKFVAKSKRNLAVCSERVTKAKSEGNCYVPLSELLVEESWLKALPGEF 123

Query: 130 EKPYALNLCKFVETEIC--SSGVPVYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQA 189
            KPYA +L  F+E EI   S    +YPP  LIFNALN+TPFDRVK VI+GQDPYHGPGQA
Sbjct: 124 HKPYAKSLSDFLEREIITDSKSPLIYPPQHLIFNALNTTPFDRVKTVIIGQDPYHGPGQA 183

Query: 190 MGLSFSVPEGVKIPSSLLNIFKELREDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRKHQA 249
           MGLSFSVPEG K+PSSLLNIFKEL +D+GCSIP HGNL+KWAVQGVLLLNAVLTVR  Q 
Sbjct: 184 MGLSFSVPEGEKLPSSLLNIFKELHKDVGCSIPRHGNLQKWAVQGVLLLNAVLTVRSKQP 243

Query: 250 NSHAKKGWEQFTDAVIQTISQKKEGVVFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSA 309
           NSHAKKGWEQFTDAVIQ+ISQ+KEGVVFLLWG  AQ K +LID  KHHIL AAHPSGLSA
Sbjct: 244 NSHAKKGWEQFTDAVIQSISQQKEGVVFLLWGRYAQEKSKLIDATKHHILTAAHPSGLSA 303

Query: 310 NRGFFGCRHFSRTNMLLKELGIGAIDWQL 319
           NRGFF CRHFSR N LL+E+GI  IDWQL
Sbjct: 304 NRGFFDCRHFSRANQLLEEMGIPPIDWQL 330

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LIH61.4e-10963.83Uracil-DNA glycosylase, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=UNG PE=... [more]
Q88N052.1e-7358.93Uracil-DNA glycosylase OS=Pseudomonas putida (strain ATCC 47054 / DSM 6125 / NCI... [more]
A5W8H23.6e-7358.93Uracil-DNA glycosylase OS=Pseudomonas putida (strain ATCC 700007 / DSM 6899 / BC... [more]
C1DQR04.7e-7358.48Uracil-DNA glycosylase OS=Azotobacter vinelandii (strain DJ / ATCC BAA-1303) OX=... [more]
B0KV501.2e-7158.48Uracil-DNA glycosylase OS=Pseudomonas putida (strain GB-1) OX=76869 GN=ung PE=3 ... [more]
Match NameE-valueIdentityDescription
XP_023525611.15.83e-230100.00uracil-DNA glycosylase, mitochondrial [Cucurbita pepo subsp. pepo][more]
XP_022981553.14.23e-22396.86uracil-DNA glycosylase, mitochondrial [Cucurbita maxima][more]
KAG6607671.14.05e-22196.54Uracil-DNA glycosylase, mitochondrial, partial [Cucurbita argyrosperma subsp. so... [more]
XP_022926296.11.65e-22096.23uracil-DNA glycosylase, mitochondrial [Cucurbita moschata][more]
KAG7028480.11.35e-21995.91Uracil-DNA glycosylase, mitochondrial [Cucurbita argyrosperma subsp. argyrosperm... [more]
Match NameE-valueIdentityDescription
A0A6J1J2E42.05e-22396.86Uracil-DNA glycosylase OS=Cucurbita maxima OX=3661 GN=LOC111480636 PE=3 SV=1[more]
A0A6J1EEH87.98e-22196.23Uracil-DNA glycosylase OS=Cucurbita moschata OX=3662 GN=LOC111433476 PE=3 SV=1[more]
A0A0A0KMG33.34e-21391.51Uracil-DNA glycosylase OS=Cucumis sativus OX=3659 GN=Csa_5G289610 PE=3 SV=1[more]
A0A1S3CPW36.73e-21391.51Uracil-DNA glycosylase OS=Cucumis melo OX=3656 GN=LOC103502882 PE=3 SV=1[more]
A0A1S3CND75.19e-20784.84Uracil-DNA glycosylase OS=Cucumis melo OX=3656 GN=LOC103502882 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G18630.11.0e-11063.83uracil dna glycosylase [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005122Uracil-DNA glycosylase-likeSMARTSM00986UDG_2coord: 146..306
e-value: 2.9E-39
score: 146.4
IPR005122Uracil-DNA glycosylase-likePFAMPF03167UDGcoord: 151..307
e-value: 3.5E-24
score: 85.6
NoneNo IPR availableSMARTSM00987UDG_2_acoord: 146..306
e-value: 2.9E-39
score: 146.4
IPR002043Uracil-DNA glycosylase family 1TIGRFAMTIGR00628TIGR00628coord: 101..308
e-value: 3.2E-80
score: 266.9
IPR002043Uracil-DNA glycosylase family 1PANTHERPTHR11264URACIL-DNA GLYCOSYLASEcoord: 34..318
IPR002043Uracil-DNA glycosylase family 1HAMAPMF_00148UDGcoord: 100..317
score: 38.755238
IPR002043Uracil-DNA glycosylase family 1CDDcd10027UDG-F1-likecoord: 115..316
e-value: 1.84669E-131
score: 370.241
IPR036895Uracil-DNA glycosylase-like domain superfamilyGENE3D3.40.470.10coord: 79..318
e-value: 2.3E-98
score: 330.6
IPR036895Uracil-DNA glycosylase-like domain superfamilySUPERFAMILY52141Uracil-DNA glycosylase-likecoord: 98..318
IPR018085Uracil-DNA glycosylase, active sitePROSITEPS00130U_DNA_GLYCOSYLASEcoord: 154..163

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG02g08970Cp4.1LG02g08970gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g08970.1:five_prime_utr:001Cp4.1LG02g08970.1:five_prime_utr:001five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g08970.1:exon:001Cp4.1LG02g08970.1:exon:001exon
Cp4.1LG02g08970.1:exon:002Cp4.1LG02g08970.1:exon:002exon
Cp4.1LG02g08970.1:exon:003Cp4.1LG02g08970.1:exon:003exon
Cp4.1LG02g08970.1:exon:004Cp4.1LG02g08970.1:exon:004exon
Cp4.1LG02g08970.1:exon:005Cp4.1LG02g08970.1:exon:005exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g08970.1:cds:005Cp4.1LG02g08970.1:cds:005CDS
Cp4.1LG02g08970.1:cds:004Cp4.1LG02g08970.1:cds:004CDS
Cp4.1LG02g08970.1:cds:003Cp4.1LG02g08970.1:cds:003CDS
Cp4.1LG02g08970.1:cds:002Cp4.1LG02g08970.1:cds:002CDS
Cp4.1LG02g08970.1:cds:001Cp4.1LG02g08970.1:cds:001CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g08970.1:three_prime_utr:001Cp4.1LG02g08970.1:three_prime_utr:001three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG02g08970.1Cp4.1LG02g08970.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
biological_process GO:0097510 base-excision repair, AP site formation via deaminated base removal
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005634 nucleus
molecular_function GO:0016799 hydrolase activity, hydrolyzing N-glycosyl compounds
molecular_function GO:0004844 uracil DNA N-glycosylase activity