Lsi01G018860 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi01G018860
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionUreidoglycolate hydrolase, putative
Locationchr01 : 21611298 .. 21615742 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAACTTCTATTTGCCCCTTTTCATCAACTTCTCTTCTTTCTCTCAAACTTCAAACTCTTTTTCTCACTTTTTCCAGTTTTCCTTATTACTGATTTTTTTTTTCTTATTCAACTTCTCCTCTTTTAGTTTACCGGAAATTTTCTTTATTAAGATATTAATTAATATATTTTCTTTAATATAAATAGAGAGAGAGGAATTGGAAGTTGTTAGCGATTTGGTAATCGAAAGCGAACGGCAAAGAGAGAAAGAAATGGAAGAGAAAACAATAATAAAGTTGAAAGCCATAGACGCAACAGCAGAGAGCTTCGCAGAGTACGGGCAAGTAATCGAGGCTACAGACGACGGCGCTGAATTCGGAGCTGAAGACGCTCAATTAGACCTCAGCAATGGAATCCCTAGGTAAATTTTCACAAAACAAATCCTTCAATTATTTATATTTGCTTAGCAAAATCTAGATCCTAGAGTTCTGAATCGATTAGATCTGAGAAAACAGGTTTTACATCCTTCACATCGAGAATCGACCATTGGAATTCTCGAGGATAACATATCACGCGAGAGTAACGCAGTGCCTGGGATCGGTGGATCGGGAGCCTTGGTATCTCGGAGTTGCGAAGCCGTCGATTGTTGTGGAAGAGGAGAATGGAGGCGGCGGGCATTTGTATGTGGCGCCGAGTGTGGATGAAATTCGGGCGTTTAGGATATCGGGAGCGAAGTTTGTGAAGCTGAATAAAGGGACATGGCATGCGGGCCCTCTGTTTAGAGAAAGCGCCAGAGATTTCTACAACTTGGAATTGACTGATACTAATGTGAGTTCTTCTTCTTCTTCTTTCCTTTTCTCTCTCTCTCTTTTTGCCTTTTTGCCTTTTTGCTTTCTTTTTCACAAACAGTCTGCCTTCTGTTTGATTGATCTGGTTCTCAACAAAATCTTCTACAGTTTTTTAGAATAAGTTCAAACCAATCAATATATCAATCTACACTCTCTCTGCTTGTTTAGATAAGGGTTGGAGTTGTGAAATTTGGAGATGTGTGTAGTTGTGAAGCCCAAAATTGGAAAAAATTCTAATAACGACGCCATACTACCTTCCTTCATTGTTTCCTAGTACTAAATTATGAAGGGAGGTAGTAGGTAATATCACAGTAAAATGATGATAGCTTTTTTTTTTTTTTTTTTTTTTATCTGTGGGAGTTACAAACTCTTTGTATTAAATTTAAAATTAGAAGTGGTTCTCATTGGTTTTGAAGACATTGTGTTGGCTAACGCCGACTCTGAACGGATGGTCGGCCTTTCTCAGTTGATGCTTCTGATGAACTAAACAAAAAAAACAAAAAAACAAAAAAAAAAAGTGAGAAGAAATAAAGAAAAAACATGGAAAGGAGAAGAAGTATGATAAAAAAGAAATGAATAATATCTACTCAGAAGCTCATAAACAAAGAAGATCCACAAGAAATTAAGAAAAAAATACCAAGAGGAGAAAGAAAAAATACAGAGGAAAAAAAAGAAGGAAAACAAACTCAATCGTGATGGTAGTAGTTGGAAGAGACGGTTGTGCCCATAGCAGGAGACGGTGGGAGAAGGGAAGAGATAAAAGAGAGTAGAAAAATTAAATGAAATCAATGGGAAAGGAAAAACGTGAGAAAGAGATTGAAACATTAAGAACTATTTGGGAGGCAAATGCAATCCCTGGAAATCAGACTAGTGTATTGGAATGTGTGAGAAATCGAAGACAAAATAAAAATGGGTTGAAAACTGTGCGAATGGAATGAGTTTGGTATTATAAATTAGGTCTAAAATGCTCTATCCAAATATGAGTTTTGGATTACATTACAATCAAGTCTCATTCCAAACCTCGCCTCCAAACAATCCCTATCAACATTATTTTAGCATTTTCTTTTAAATTTTCGAAAATTTGTATTTTCCTTGACCGACTGACTAATCGGTCAATTTGTTAAAAAACAAAACCTGAATGGTTACTAAACGAAGCCTTGATTTTTTAGTTTTTAGTTTTTGAAAATTAAGCATATTATATTTCTTTCACCTTTAAATTTATCAATTTGTTATATATTTTTTTACCAATGTTTTCAAAAACCAAGTCAATGTTTGAAAACTAAAAAAAGTTTTACCAAGCAGGAATTTAAATATCTAATGGACACAAAATTTAAAATATAAGAATATATTAAATAAATATTAAAGTTTAGGAACTTGATACTTGAGCAACATAAACTTGAAAGCTTAAAAATCTATGAGACACTTTTTAAAGTTGAAACTATTAGATACAAAATTAAAAGTTTAAGGATATATTATATTTTCAAAAATTTGAAAGTCAAATAGATACAAACTCGAGAATTTAGAGGTTGAACTTGTAATTTAACTTTGAAATTTTAATAGATACACTCTGAACTAAGAGATTTAAATTATTTTTTCTCCTATTTTGAACATTATTTTTAAAATCCAAATGTTGCTCTATTGAAGACACTACCCATTTGAACCAACCTACTATTAGGTTAATCATTTAATAAAAATTAAAGAAAAAAAAGAAAAAGAATAGAAGCCAATCTGATCCATAGATACACCTGTTTCCATAGGGCCCCTCTAAATACTCAATTGAAACTTTTTGGCACACTAGTCACTTTTTCATGTTTAATACACAAAATTTGAACTTTTTAGTTTCTTTTACAAAAAGTAATGTATGGGTTATCATATAAAAAAGTTATTCATGATTCATATTTCGCATTATCCGTATTTAAAGATGTATATGAATGTCAAAATGAACATAGTTCAATTGACACGGTTTGTATTATCAACTTTAATTTCAGAGGTTCAATTTTCACCTCACATATTGGAAAAAAAATGACTTGTGAATAAAGTGATATTTAATAATTATTTCTTACTATTTTTTTATTAAGTAAAAATACTACGTTTTTGAGAAATATGATCTTAATTAATTAATGTTTTTAACATGTTTCAGAAAAAAAAAATCGTGAATAATTAGTACAAACTAACTATATAATTTAAGTTCAATTTATAAAACAACTATGATGTTTTAATTACTTCTACTTTTTAACTAATTTCACTTAATTAACTAATCTATTTCACAAAGAAGAATTATGATCAAATTTAAGTAAATTTATTATTATTTTTTAAACGGAGTCAAGTGATTTTGTTGGTTGCTATAACTTAGTGTAAGGAGAGTTTTTAAAAATATAAAAATAAGAGAAACTATTTACACAAAATAGTAAAATTTTTAGATAGTTGTGATAGATGCTGATAGAAGTCTATCAGGGTCTATCAGTGATAGAAATGATAGATGCTGATAGAAGTCTATCAATGTCTATCTGTGTTTTTTTTTTTTTTTACTATTTTCTGTAAATAGTTTGACATTTTTTCTATGAAAATTTTCCTTAGTGTAATAAAGTAATTTTAATTGGCCAAATCCAACATAACTCAACTGGTTGTGTTGCAAACCGAGAGGTCTATGGTTCGAATCCTCCTATTCCTATCGTACTAAAAAAAGTAGTTTTAATTGATTAATATAAATCAGTATATAAATAATAGTTTACTAATTTTGTTGAAGGATGTGGAAGGATTTGCCTAAAATAATTCAAACGCAAGACTTCTATGTTAAATCACCCATAAACATGCTTGGTTTTAATTTCAAGATACGACTCCCATTAATTCCTTTTCAACTCCTTTGTTCTTTCTTAATTTGCAAATATGCCCATTTGACAATTTCATTTTCACCCGTCAATCTCTTTTCTTCAAAGAACATTTCTTAGAGCTTTCAAGTCTATTTTCTTCAAAGAGGAATCTCTTTTCTTATCTTCAATCCTTTTTAAATCTTCACCCTAAAAAAGAAGGAAAAAAAAAAAAAAGAGAATTGGTCTAAGTTTGATAAGCAAACCAAATAGGCTGATGCATAATGAAAGGTGAATCAATTTAGAAGAAATTTGAAGAAATGAAAGACAGAATGACCAAAAATATCAATTTTCATTTAAAATATATTATAATAATTTAATATTACTTCAATGACTTTTGTATTTTCTTGTAGATTGTTGATCATACTTGTTACAACTTTGGAAAAGAAAATGGGGTATTATTTCACATTGAGGATTAGATCCATTGTTCTACACCACCAAGATTGTTATCCTTATGGAGATTAAAGGATTGTAGATGGAGTTGTTCTGTGAAATATTGAAATTATGTTTTATTGTTAGACTAAATTCATATGAAGATGAAGTTTGTAATGATTTGTTTGGGCTACATGGAGATTGAAGATTGTAGTATTTTACTTGAAATTAATTCTAAGAATACTAAATTTTAGAAAAAATGTATTTTGATTGTAATTAAGGACATGTATGTAATTTGGGAATGATTCTAGAATAAGTGTTCTTAGACAAAGAGATGTATTTATAATAATTCTAATTGTAAAATGTGTTTAATTTT

mRNA sequence

AAAAAAACTTCTATTTGCCCCTTTTCATCAACTTCTCTTCTTTCTCTCAAACTTCAAACTCTTTTTCTCACTTTTTCCAGTTTTCCTTATTACTGATTTTTTTTTTCTTATTCAACTTCTCCTCTTTTAGTTTACCGGAAATTTTCTTTATTAAGATATTAATTAATATATTTTCTTTAATATAAATAGAGAGAGAGGAATTGGAAGTTGTTAGCGATTTGGTAATCGAAAGCGAACGGCAAAGAGAGAAAGAAATGGAAGAGAAAACAATAATAAAGTTGAAAGCCATAGACGCAACAGCAGAGAGCTTCGCAGAGTACGGGCAAGTAATCGAGGCTACAGACGACGGCGCTGAATTCGGAGCTGAAGACGCTCAATTAGACCTCAGCAATGGAATCCCTAGGTTTTACATCCTTCACATCGAGAATCGACCATTGGAATTCTCGAGGATAACATATCACGCGAGAGTAACGCAGTGCCTGGGATCGGTGGATCGGGAGCCTTGGTATCTCGGAGTTGCGAAGCCGTCGATTGTTGTGGAAGAGGAGAATGGAGGCGGCGGGCATTTGTATGTGGCGCCGAGTGTGGATGAAATTCGGGCGTTTAGGATATCGGGAGCGAAGTTTGTGAAGCTGAATAAAGGGACATGGCATGCGGGCCCTCTGTTTAGAGAAAGCGCCAGAGATTTCTACAACTTGGAATTGACTGATACTAATATTGTTGATCATACTTGTTACAACTTTGGAAAAGAAAATGGGGTATTATTTCACATTGAGGATTAGATCCATTGTTCTACACCACCAAGATTGTTATCCTTATGGAGATTAAAGGATTGTAGATGGAGTTGTTCTGTGAAATATTGAAATTATGTTTTATTGTTAGACTAAATTCATATGAAGATGAAGTTTGTAATGATTTGTTTGGGCTACATGGAGATTGAAGATTGTAGTATTTTACTTGAAATTAATTCTAAGAATACTAAATTTTAGAAAAAATGTATTTTGATTGTAATTAAGGACATGTATGTAATTTGGGAATGATTCTAGAATAAGTGTTCTTAGACAAAGAGATGTATTTATAATAATTCTAATTGTAAAATGTGTTTAATTTT

Coding sequence (CDS)

ATGGAAGAGAAAACAATAATAAAGTTGAAAGCCATAGACGCAACAGCAGAGAGCTTCGCAGAGTACGGGCAAGTAATCGAGGCTACAGACGACGGCGCTGAATTCGGAGCTGAAGACGCTCAATTAGACCTCAGCAATGGAATCCCTAGGTTTTACATCCTTCACATCGAGAATCGACCATTGGAATTCTCGAGGATAACATATCACGCGAGAGTAACGCAGTGCCTGGGATCGGTGGATCGGGAGCCTTGGTATCTCGGAGTTGCGAAGCCGTCGATTGTTGTGGAAGAGGAGAATGGAGGCGGCGGGCATTTGTATGTGGCGCCGAGTGTGGATGAAATTCGGGCGTTTAGGATATCGGGAGCGAAGTTTGTGAAGCTGAATAAAGGGACATGGCATGCGGGCCCTCTGTTTAGAGAAAGCGCCAGAGATTTCTACAACTTGGAATTGACTGATACTAATATTGTTGATCATACTTGTTACAACTTTGGAAAAGAAAATGGGGTATTATTTCACATTGAGGATTAG

Protein sequence

MEEKTIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLEFSRITYHARVTQCLGSVDREPWYLGVAKPSIVVEEENGGGGHLYVAPSVDEIRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFHIED
BLAST of Lsi01G018860 vs. TrEMBL
Match: A0A0A0K4Z5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G320020 PE=4 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 1.1e-78
Identity = 148/182 (81.32%), Postives = 161/182 (88.46%), Query Frame = 1

Query: 5   TIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLEFS 64
           TI+ LKAI+ATAESFAEYGQVI+ATDD AEFG EDAQLDL+NGIPRFYILHIENRP EFS
Sbjct: 9   TIMNLKAIEATAESFAEYGQVIQATDDRAEFGNEDAQLDLTNGIPRFYILHIENRPFEFS 68

Query: 65  RITYHARVTQCLGSVDREPWYLGVAKPSIVV-EEENGG----------GGHLYVAPSVDE 124
           +IT+HARVTQCLGSVDRE WYLGVAK SIV  +E NGG          GGHLYVAP+VDE
Sbjct: 69  KITHHARVTQCLGSVDREAWYLGVAKASIVEGDEVNGGGGGRKLRSESGGHLYVAPNVDE 128

Query: 125 IRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFHI 176
           IRAF+ISGAKFVKLNKGTWHAGPLFRE+ARDFYNLELT+TNIVDHTCYN G+EN V+FHI
Sbjct: 129 IRAFKISGAKFVKLNKGTWHAGPLFRENARDFYNLELTNTNIVDHTCYNIGEENRVVFHI 188

BLAST of Lsi01G018860 vs. TrEMBL
Match: A0A061FMI4_THECC (Ureidoglycolate hydrolases OS=Theobroma cacao GN=TCM_043078 PE=4 SV=1)

HSP 1 Score: 243.4 bits (620), Expect = 2.0e-61
Identity = 117/183 (63.93%), Postives = 143/183 (78.14%), Query Frame = 1

Query: 3   EKTIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLE 62
           E  ++KLK I+AT ESF EYGQVIEA+ DG EFG +DAQLDLS GIPRFYI+H+++RPLE
Sbjct: 10  EPALMKLKPIEATQESFKEYGQVIEASPDGDEFGPKDAQLDLSKGIPRFYIMHLQDRPLE 69

Query: 63  FSRITYHARVTQCLGSVDREPWYLGVAKPSIVVEEE----NG------GGGHLYVAPSVD 122
           FS+IT+HA VTQCLGS+    WYLG+AKPSI+  EE    NG        GH YV P+VD
Sbjct: 70  FSKITHHASVTQCLGSIGGHVWYLGIAKPSIMDSEEIRSDNGKILIQSHCGHRYVPPAVD 129

Query: 123 EIRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFH 176
           ++  FRISG KF+KLN+GTWHAGPLF+    DFYNLEL+DTN+VDHT ++F KENGVLF 
Sbjct: 130 DVCVFRISGPKFLKLNRGTWHAGPLFKADTMDFYNLELSDTNVVDHTTHDFIKENGVLFS 189

BLAST of Lsi01G018860 vs. TrEMBL
Match: A0A0B0PP80_GOSAR (Nucleosome assembly OS=Gossypium arboreum GN=F383_04345 PE=4 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 1.3e-60
Identity = 114/183 (62.30%), Postives = 143/183 (78.14%), Query Frame = 1

Query: 3   EKTIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLE 62
           E T++KLK I+AT ESF E+GQVIEA+ DG +FG  DAQLDLS GIPRFYI++++NRPL+
Sbjct: 9   ELTVMKLKPIEATPESFKEFGQVIEASPDGEKFGPTDAQLDLSKGIPRFYIMNLQNRPLK 68

Query: 63  FSRITYHARVTQCLGSVDREPWYLGVAKPSIVVEEE--NGGG--------GHLYVAPSVD 122
           FS IT+HA VTQCLGS+    WYLG+AKPS+V  EE  N  G        GH YV P+VD
Sbjct: 69  FSTITHHASVTQCLGSIGGHVWYLGIAKPSLVDSEEVKNEKGKIAVQSRCGHFYVPPAVD 128

Query: 123 EIRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFH 176
           ++R FRI+G KF+KLN+GTWHAGPLF+  A DFYNLEL++TN+VDHT + F KENGV+F 
Sbjct: 129 DVRVFRIAGPKFIKLNRGTWHAGPLFKADAMDFYNLELSNTNVVDHTTHVFKKENGVIFS 188

BLAST of Lsi01G018860 vs. TrEMBL
Match: A0A0D2V3K0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G093700 PE=4 SV=1)

HSP 1 Score: 240.4 bits (612), Expect = 1.7e-60
Identity = 114/183 (62.30%), Postives = 141/183 (77.05%), Query Frame = 1

Query: 3   EKTIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLE 62
           E T+IKLK I+AT ESF E+GQVIEA+ DG EFG  DAQLDLS GIPRFYI++++NRPL+
Sbjct: 4   ELTVIKLKPIEATPESFKEFGQVIEASPDGEEFGPTDAQLDLSKGIPRFYIMNLQNRPLK 63

Query: 63  FSRITYHARVTQCLGSVDREPWYLGVAKPSIVVEEE--NGGG--------GHLYVAPSVD 122
           FS IT+HA VTQCLGS+    WYLG+AKPS+V  EE  N  G        GH YV P++D
Sbjct: 64  FSTITHHASVTQCLGSIGGHVWYLGIAKPSLVDSEEVKNEKGKIAVQSRCGHFYVPPAID 123

Query: 123 EIRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFH 176
            +  FRI+G KF+KLN+GTWHAGPLF+  A DFYNLEL++TN+VDHT + F KENGV+F 
Sbjct: 124 NVHVFRIAGPKFIKLNRGTWHAGPLFKADAMDFYNLELSNTNVVDHTTHVFKKENGVIFS 183

BLAST of Lsi01G018860 vs. TrEMBL
Match: A0A068U6Y8_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00017638001 PE=4 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 2.9e-60
Identity = 113/180 (62.78%), Postives = 140/180 (77.78%), Query Frame = 1

Query: 6   IIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLEFSR 65
           ++KLK I+AT E+F E+GQVIEA+ DG EFG  DAQLDLS GIPRFYI+H+E+R L+FS 
Sbjct: 16  VVKLKPIEATPETFQEFGQVIEASPDGEEFGPADAQLDLSRGIPRFYIMHLEDRALKFSN 75

Query: 66  ITYHARVTQCLGSVDREPWYLGVAKPSIVVEEENGGG----------GHLYVAPSVDEIR 125
           IT+HA VTQCLGS+    WYLGVAKPSIV   E  G           GH +V PSVD++R
Sbjct: 76  ITHHANVTQCLGSIGGNVWYLGVAKPSIVDPSEIKGTTGVDVIQSHCGHYHVPPSVDDVR 135

Query: 126 AFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFHIED 176
           AFRISG KF+KLN+GTWHAGPLF++ A DFYNLEL++TN+VDHT +NF K+N V+F ++D
Sbjct: 136 AFRISGPKFLKLNRGTWHAGPLFKQDAMDFYNLELSNTNVVDHTTHNFVKKNNVVFMLDD 195

BLAST of Lsi01G018860 vs. TAIR10
Match: AT2G35810.1 (AT2G35810.1 unknown protein)

HSP 1 Score: 218.0 bits (554), Expect = 4.6e-57
Identity = 103/177 (58.19%), Postives = 132/177 (74.58%), Query Frame = 1

Query: 7   IKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLEFSRI 66
           + L  I+AT E+FAEYGQVIEA+ DGA +G  DAQLDLS GIPR YIL ++  PL F +I
Sbjct: 22  VNLIPIEATPETFAEYGQVIEASRDGAGYGPNDAQLDLSKGIPRLYILRLKETPLGFFKI 81

Query: 67  TYHARVTQCLGSVDREPWYLGVAKPSIVVEEENG--------GGGHLYVAPSVDEIRAFR 126
           T+HA+VTQCLGS+  + WY+GVAKPS++ ++++G          GHLY+ P V+EIR FR
Sbjct: 82  THHAKVTQCLGSIGGDIWYMGVAKPSLIEDDDDGRRVDTVKAKSGHLYIPPEVEEIRVFR 141

Query: 127 ISGAKFVKLNKGTWHAGPLFR-ESARDFYNLELTDTNIVDHTCYNFGKENGVLFHIE 175
            SG KFVKL++GTWHAGPLF   S  DFYNLEL++TN+VDHT ++F K NGV F  +
Sbjct: 142 FSGPKFVKLHRGTWHAGPLFSGSSIMDFYNLELSNTNVVDHTSHDFTKNNGVSFRFD 198

BLAST of Lsi01G018860 vs. TAIR10
Match: AT2G35830.2 (AT2G35830.2 unknown protein)

HSP 1 Score: 216.9 bits (551), Expect = 1.0e-56
Identity = 103/175 (58.86%), Postives = 131/175 (74.86%), Query Frame = 1

Query: 7   IKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLEFSRI 66
           + L  I+AT E+FAEYGQVIEA+ DGA FG  DAQLDLS G PR YIL ++  PL F +I
Sbjct: 8   VNLIPIEATPENFAEYGQVIEASRDGAGFGPHDAQLDLSRGTPRLYILRLKETPLGFFKI 67

Query: 67  TYHARVTQCLGSVDREPWYLGVAKPSIVVEEENGG---------GGHLYVAPSVDEIRAF 126
           T+HA+VTQCLGS+  + WY+GVAKPS++ ++++ G          GHLY+ P V+EIR F
Sbjct: 68  THHAKVTQCLGSIGGDVWYMGVAKPSLIEDDDDDGRSVDTVKSKSGHLYIPPEVEEIRVF 127

Query: 127 RISGAKFVKLNKGTWHAGPLFRESA-RDFYNLELTDTNIVDHTCYNFGKENGVLF 172
           R SG KFVKL++GTWHAGPLF  S+  DFYNLEL++TN+VDHT ++F K NGV F
Sbjct: 128 RFSGPKFVKLHRGTWHAGPLFSGSSFMDFYNLELSNTNVVDHTSHDFTKNNGVSF 182

BLAST of Lsi01G018860 vs. TAIR10
Match: AT2G35820.1 (AT2G35820.1 ureidoglycolate hydrolases)

HSP 1 Score: 216.1 bits (549), Expect = 1.7e-56
Identity = 101/176 (57.39%), Postives = 128/176 (72.73%), Query Frame = 1

Query: 7   IKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLEFSRI 66
           +KL  I+AT E+FA+YGQVIEA+ DGA FG  DAQLDLS GIPRFYI+ I + P +FS +
Sbjct: 8   VKLIPIEATPENFADYGQVIEASRDGAGFGPNDAQLDLSRGIPRFYIMRIRDTPFDFSVL 67

Query: 67  TYHARVTQCLGSVDREPWYLGVAKPSIVVEEENG--------GGGHLYVAPSVDEIRAFR 126
           T+HA VTQCLGS+    WYLGVAKP+++ + ++G          GHLY  P+V+EIR FR
Sbjct: 68  THHASVTQCLGSIGGHVWYLGVAKPTLIEDGDDGKMVDKLKSRSGHLYAPPAVEEIRVFR 127

Query: 127 ISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFHIE 175
           +SG KF+KLN GTWH GPLF +S  DFYNLEL++TN VD T Y+F K  GV   ++
Sbjct: 128 VSGPKFIKLNHGTWHVGPLFSDSYMDFYNLELSNTNAVDRTTYDFIKNKGVTIRVD 183

BLAST of Lsi01G018860 vs. NCBI nr
Match: gi|659123109|ref|XP_008461494.1| (PREDICTED: uncharacterized protein LOC103500077 [Cucumis melo])

HSP 1 Score: 305.8 bits (782), Expect = 4.7e-80
Identity = 151/184 (82.07%), Postives = 162/184 (88.04%), Query Frame = 1

Query: 5   TIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLEFS 64
           TI+KLKAI+AT ESFAEYGQVIEAT DGAEFG++DAQLDL+NGIPRFYILHIENRP EFS
Sbjct: 12  TIMKLKAIEATPESFAEYGQVIEATGDGAEFGSQDAQLDLTNGIPRFYILHIENRPFEFS 71

Query: 65  RITYHARVTQCLGSVDREPWYLGVAKPSIVVEEE-NGG------------GGHLYVAPSV 124
           +IT+HARVTQCLGSVDRE WYLGVAK SIV  EE NGG            GGHLYVAP+V
Sbjct: 72  KITHHARVTQCLGSVDREAWYLGVAKASIVEGEEINGGGGGGGRNLRSERGGHLYVAPNV 131

Query: 125 DEIRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLF 176
           DEIRAFRISGAKFVKLNKGTWHAGPLFRE+ARDFYNLELTDTNIVDHTCYN G+EN V+F
Sbjct: 132 DEIRAFRISGAKFVKLNKGTWHAGPLFRENARDFYNLELTDTNIVDHTCYNIGEENRVVF 191

BLAST of Lsi01G018860 vs. NCBI nr
Match: gi|449443816|ref|XP_004139672.1| (PREDICTED: uncharacterized protein LOC101212947 [Cucumis sativus])

HSP 1 Score: 300.8 bits (769), Expect = 1.5e-78
Identity = 148/182 (81.32%), Postives = 161/182 (88.46%), Query Frame = 1

Query: 5   TIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLEFS 64
           TI+ LKAI+ATAESFAEYGQVI+ATDD AEFG EDAQLDL+NGIPRFYILHIENRP EFS
Sbjct: 9   TIMNLKAIEATAESFAEYGQVIQATDDRAEFGNEDAQLDLTNGIPRFYILHIENRPFEFS 68

Query: 65  RITYHARVTQCLGSVDREPWYLGVAKPSIVV-EEENGG----------GGHLYVAPSVDE 124
           +IT+HARVTQCLGSVDRE WYLGVAK SIV  +E NGG          GGHLYVAP+VDE
Sbjct: 69  KITHHARVTQCLGSVDREAWYLGVAKASIVEGDEVNGGGGGRKLRSESGGHLYVAPNVDE 128

Query: 125 IRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFHI 176
           IRAF+ISGAKFVKLNKGTWHAGPLFRE+ARDFYNLELT+TNIVDHTCYN G+EN V+FHI
Sbjct: 129 IRAFKISGAKFVKLNKGTWHAGPLFRENARDFYNLELTNTNIVDHTCYNIGEENRVVFHI 188

BLAST of Lsi01G018860 vs. NCBI nr
Match: gi|1021570092|ref|XP_016176239.1| (PREDICTED: uncharacterized protein LOC107618640 isoform X2 [Arachis ipaensis])

HSP 1 Score: 244.2 bits (622), Expect = 1.7e-61
Identity = 113/185 (61.08%), Postives = 145/185 (78.38%), Query Frame = 1

Query: 1   MEEKTIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRP 60
           MEE  ++ LK I+AT  +F +YGQVIEA+ DG EFG  DAQLDLS GIPRFYI+HIENRP
Sbjct: 1   MEETKVVTLKPIEATPSTFKDYGQVIEASPDGDEFGPHDAQLDLSKGIPRFYIMHIENRP 60

Query: 61  LEFSRITYHARVTQCLGSVDREPWYLGVAKPSIVVEEE---NGGG-------GHLYVAPS 120
           L+FS IT+HA VTQCLGS+    WYLGVAKPSIV  +E   N G        GH YV P+
Sbjct: 61  LKFSNITHHASVTQCLGSIGGHAWYLGVAKPSIVESDELKDNTGKKIVQSRCGHSYVPPA 120

Query: 121 VDEIRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVL 176
           +D+++ F++SG+KF+KLN+GTWHAGP+F+E A DFYNLEL++TN++DHT ++F K+NGV+
Sbjct: 121 IDDVQIFKVSGSKFLKLNRGTWHAGPIFKEDAMDFYNLELSNTNVIDHTTHSFKKDNGVV 180

BLAST of Lsi01G018860 vs. NCBI nr
Match: gi|590564690|ref|XP_007009735.1| (Ureidoglycolate hydrolases [Theobroma cacao])

HSP 1 Score: 243.4 bits (620), Expect = 2.9e-61
Identity = 117/183 (63.93%), Postives = 143/183 (78.14%), Query Frame = 1

Query: 3   EKTIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRPLE 62
           E  ++KLK I+AT ESF EYGQVIEA+ DG EFG +DAQLDLS GIPRFYI+H+++RPLE
Sbjct: 10  EPALMKLKPIEATQESFKEYGQVIEASPDGDEFGPKDAQLDLSKGIPRFYIMHLQDRPLE 69

Query: 63  FSRITYHARVTQCLGSVDREPWYLGVAKPSIVVEEE----NG------GGGHLYVAPSVD 122
           FS+IT+HA VTQCLGS+    WYLG+AKPSI+  EE    NG        GH YV P+VD
Sbjct: 70  FSKITHHASVTQCLGSIGGHVWYLGIAKPSIMDSEEIRSDNGKILIQSHCGHRYVPPAVD 129

Query: 123 EIRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVLFH 176
           ++  FRISG KF+KLN+GTWHAGPLF+    DFYNLEL+DTN+VDHT ++F KENGVLF 
Sbjct: 130 DVCVFRISGPKFLKLNRGTWHAGPLFKADTMDFYNLELSDTNVVDHTTHDFIKENGVLFS 189

BLAST of Lsi01G018860 vs. NCBI nr
Match: gi|1012246807|ref|XP_015942467.1| (PREDICTED: uncharacterized protein LOC107467793 [Arachis duranensis])

HSP 1 Score: 241.5 bits (615), Expect = 1.1e-60
Identity = 112/185 (60.54%), Postives = 143/185 (77.30%), Query Frame = 1

Query: 1   MEEKTIIKLKAIDATAESFAEYGQVIEATDDGAEFGAEDAQLDLSNGIPRFYILHIENRP 60
           MEE   + LK I+AT  +F +YGQVIEA+ DG EFG  DAQLDLS GIPRFYI+HIENRP
Sbjct: 1   MEETKAVTLKPIEATPSTFKDYGQVIEASPDGDEFGPNDAQLDLSKGIPRFYIMHIENRP 60

Query: 61  LEFSRITYHARVTQCLGSVDREPWYLGVAKPSIVVEE---ENGGG-------GHLYVAPS 120
           L+FS IT+HA VTQCLGS+    WYLGVAKPSIV  +   +N G        GH YV P+
Sbjct: 61  LKFSNITHHASVTQCLGSIGGHAWYLGVAKPSIVESDDIKDNAGKKIVQSRCGHSYVPPA 120

Query: 121 VDEIRAFRISGAKFVKLNKGTWHAGPLFRESARDFYNLELTDTNIVDHTCYNFGKENGVL 176
           +D+++ F++SG+KF+KLN+GTWHAGP+F+E A DFYNLEL++TN++DHT ++F K+NGV 
Sbjct: 121 IDDVQIFKVSGSKFLKLNRGTWHAGPIFKEDAMDFYNLELSNTNVIDHTTHSFKKDNGVA 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K4Z5_CUCSA1.1e-7881.32Uncharacterized protein OS=Cucumis sativus GN=Csa_7G320020 PE=4 SV=1[more]
A0A061FMI4_THECC2.0e-6163.93Ureidoglycolate hydrolases OS=Theobroma cacao GN=TCM_043078 PE=4 SV=1[more]
A0A0B0PP80_GOSAR1.3e-6062.30Nucleosome assembly OS=Gossypium arboreum GN=F383_04345 PE=4 SV=1[more]
A0A0D2V3K0_GOSRA1.7e-6062.30Uncharacterized protein OS=Gossypium raimondii GN=B456_012G093700 PE=4 SV=1[more]
A0A068U6Y8_COFCA2.9e-6062.78Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00017638001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G35810.14.6e-5758.19 unknown protein[more]
AT2G35830.21.0e-5658.86 unknown protein[more]
AT2G35820.11.7e-5657.39 ureidoglycolate hydrolases[more]
Match NameE-valueIdentityDescription
gi|659123109|ref|XP_008461494.1|4.7e-8082.07PREDICTED: uncharacterized protein LOC103500077 [Cucumis melo][more]
gi|449443816|ref|XP_004139672.1|1.5e-7881.32PREDICTED: uncharacterized protein LOC101212947 [Cucumis sativus][more]
gi|1021570092|ref|XP_016176239.1|1.7e-6161.08PREDICTED: uncharacterized protein LOC107618640 isoform X2 [Arachis ipaensis][more]
gi|590564690|ref|XP_007009735.1|2.9e-6163.93Ureidoglycolate hydrolases [Theobroma cacao][more]
gi|1012246807|ref|XP_015942467.1|1.1e-6060.54PREDICTED: uncharacterized protein LOC107467793 [Arachis duranensis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004848ureidoglycolate hydrolase activity
Vocabulary: INTERPRO
TermDefinition
IPR024060Ureidoglycolate_lyase_dom_sf
IPR011051RmlC_Cupin_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000256 allantoin catabolic process
biological_process GO:0006144 purine nucleobase metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004848 ureidoglycolate hydrolase activity
molecular_function GO:0050385 ureidoglycolate lyase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G018860.1Lsi01G018860.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 7..173
score: 1.69
IPR024060Ureidoglycolate lyase domainGENE3DG3DSA:2.60.120.480coord: 8..173
score: 1.0
NoneNo IPR availablePANTHERPTHR35721FAMILY NOT NAMEDcoord: 1..175
score: 7.4