CSPI01G12020.2 (mRNA) Cucumber (PI 183967) v1

Overview
NameCSPI01G12020.2
TypemRNA
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionAllantoinase
LocationChr1: 7578796 .. 7588027 (+)
Sequence length2148
RNA-Seq ExpressionCSPI01G12020.2
SyntenyCSPI01G12020.2
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTAGTAATTGACATGTTTCCCGCCGTACTAAAAAAAATGTCACCTTAAAAGCTTCTTAATCAGAGTTTAGCTGAAAGGTGACTAAGTGAGCATCGGTAATTTTACTATAATCGTCTGATCTGACTTTAATCATTGATTAGCGAAATGGTTCACCACAGTTGCCCTTCACTGGTCAGACGATAAGCTGAATTCCTAAGCCTGGGCTGAAGCCGGCGACGGTGAAATACTTCAAATTCCTATCAACTACATGATCGTTCACTCCAATCGCATTACTGATTAGACGGAAAGAATTACGATGAATTTGCTGCAGTGGAAGCTACTTCCTCTATTAACATTGCTCGCTTCCATTTTCTTATTTTTCTACTTAAAGGATCCATCCGATGTAAGCTTCAGTGATTTTCAATCCGTGTAGCTATGTGTTTTTCTTTTTTGTCTTATTTGCTATCTACCACTGTCTGCACCGTCTGCCATCCAGTGATCTTTATGAAGAACTCGGTTTCGTAAGAGATGAAAATTGGTTCCAGTGTGATTAGGTTTCTTTTGTTTCTTCGCGATATTAGACTTAGAAGTTAGAACGAACATCGTTACATATCGCATATGACTTGTTGATGTTTTTATTCCTGCGCTGGTGTTGGGAATCAAATTTCGATAATCGAGTTCTAGAAATTTTGGGAATTAAATGGGTTCTAACTAGGAACTTAAGTCCTTAAGTAAATTAGGAGTTTCACGGGGAATTGGAATAAGAATTCATAATTGAAACATGGCAGTGTTGTTCTTTTTGACTCGTTTTGTATACGTTTTCGGCAGAATATATTTATTGAATGGAAGTTGTTGACTTCATGTTGCAATCATAGATGGGGTCCTATGTATTTTGGGCTTATACTGTTTTAGGTATGACTAATTTTGTATGTCATGTGAAAGAGTTTTACATCTTGGTTCCAACCCTCTCAGAATGAATGCAGCCTCCTTCCTCACAAGCACTTTTGGATAACAAGCAAGCGCATTGTTACGCCACAAGGAGTCATTTCTGGTGCTGGTTAGTACACTGGTTAATGCTGTAGGCTCATGAAAAGAAGTATTACCTTAGGGAGTCTACTCTACATGCTTTATTCATTTATTTCAATATACCTTATGTTTGACTCTCATAGAGTTGGCTTCGGTCTACTCTGTTGAGAAAACATCTTGTTTTCCCTCTATGTTATATTCAAGATTATCATTTTTCTCTGATAAATCAGTTCATTACTAACAAATCAGGAGGTGTTTCTTTGTCAATCATGGCTTTGTATCTTTCTTTTCTTGATGTCTTATGGATTTTCTTTATCAAATGCTAAACTGGGCAGTTGAGATAAATGGAGGGAAGATTGTATCCATTGTTAAGGAAGAAGAAAAGCATGGGAAGATTATGGGTAATCACGTGGTTGATTACAAAGATGCTGTTGTAATGCCTGGCTTGGTTGACGTGTAAGACCTAAAATTTATTTTTCAATTGTTGAATATAAATGTGATTTATAGTCTCTGGTAGAAAGTAAAATTACGTAAAACAGGATATCAAAAAGAAAATAGTTATTATTACAGCAACCATTCTATTATATGTTTCTCAGGATCAAAATTCATTAGATAGATTTTATCTGGGAGGTAAGATTCCATGTGTAGTTGGAAATTTCAAGTTCTATGATTTAGAAATCTGAACTGTACGATCATACTGTTTTTTTTTTATTTAAAAAAATTGACTGCAATTTGAAATTTAAATGAGCATTTGACTGGAACTGGTTCCTAGAATAACGTTAGGTCTATATTTCTCCTTAAATTAATTTGATCCATTTGAGGGGGTTTTCTCCAGTGGACACGGGCTGTTTCTTGATGAGAAAATTAGTAATCGTGCTATCGATTTCTTGGCAGCCATGTTCATCTCGATGATCCTGGACGGTCTGAATGGGAAGGGTTTCCATCTGGAACAAAGGCTGCAGCTGCTGGTATGATTTTTTTCTGTCATTTCTTCATTTGTTTACCTTTTTCTTTTCTTTTAATAAGAATTCCTCATTTAGAAGCTTTCGCCAACAAGAAATAGAATTACTATACCAGAAGACTTTGTCATCCACCAACAGAACGTAGTCTTTTTAAGTTCCAACTCAACGTTTATGGCTTCTAGTTAACCATCTAGAATCATTTTTATTGAATTCAGAATATTTTTGTGCAATAAGTTTTTTGTTTCTTCGATAATGTGTTCATATGAACTTTACTTTAGGCCATATGGACAACCTGTGAAACTGGTATTGATATGTTTGTATAGGTGGGGTAACTACTCTGGTTGATATGCCATTAAATAATTTTCCCTCAACTACGTCCGAAGAAACTCTAAAACTCAAGGTTTGTTTTTTATTGTTTTTTTCCTGGAAATATTTTAGACTGTTTTAAGTTCATTGATCAAATGATCTTTGCTCTCACAAACATATATAAATTTGTCCATGTTTTTTTTTTTGCTTAGTCAGTGCTCTTCCAGATTCTACTAATGTGTATTACTCTATTAATAGATTAAGGCTGCTGAAGGAAGAATCTATGTTGACGTTGGTATACTCAAAACTTTACTTTTTTAACTCAGTTGCTTTAATCTAAGGACTAATTTCATTCTATTAAATATTAGGCTTTTGGGGAGGTCTTGTTCCTGAGAATGCTTTCAATGCAAGTGCTCTGGAAAATCTCCTGAAAGCAGGGGCTCTCGGCCTAAAGGTATTTCTTGATAGTTTCTTTGTAGTTTGCTGCTTACATAGTTGATAGTGACGTCTCAGCCTTATACAATGCACACGAGCGGCAAGGCAACTCGCTATGCTAATTCAAAGTGTAGAACTCCAATTTTAGATCCTTTCTAAACCCTCGTTTTCATTATCTAAGATTTACTACAGTGTGGTTGTAAATATTGGTTTTCCATTGTATTTAGCAAATTGGCAATGCCAAACTGTCTTGCTCCCATCAAACCTTCTCATCAGGGGACTCCTAGTGAGGGGTTTCCCTCTGTTGAAGGTCCTTTCTTGAGGACAAATTTTATGATATATAACTATTCTATGGAACCTTGATGTTCGCAGAAAATTATTATTACCTTATCAATTTTCTACCACGTAACAGGTTTCTCTATCATTGTTTTGTAGTCATTTATGTGTCCTTCCGGGATCAATGACTTTCCTATGACAAATATTTCTCATATCAAGGTATATGGACATTTTAAGTTGCAATATTCATATCATCAGTTTGATCGGGAAGAATGAATTGAACACTTCTATTTGCCTTCCAAATTAACATTGCCATATGCATGAAAATATATGCTTATTAGGAAGGACTGTCAGTTTTAGCAAAATACAAAAGACCTTTGCTTGTGCATTCAGAGATTGAACAAAGTTCTCCAAGCCCTGTGCAACTTGAAGGTAGTCAAGATGACCCTCGTACTTACTCAACATATCTTGCAACCAGACCACCTTCTTGGTAAGATATTAAAATTTCCTACTTTCTTTCGTGTTTGCATTTTGATATAATCCAAGAATTTAATCATGTAGTTTCCTTCTGTTTCTTTCTATTTGAGTACTATGCTACCTTTAGTATTATTTATGTGTACAGAATTCACTAAATCATGCAGACATATTTGTCTTTTTCATTGTTGGTTCTGATTTCCTAGATATTTTCTTCATAACACAATAATTAATAAAATGAAGATTACTGTTGATGATATATAATTAAATTTGCGTTCAACCACTAGCTTAGGTTTTTGGGTGAATTGGTGGTTTAATATGGTATCAGAGCAGGTGGTCCAGAGAGGTCCTGTGTTCAAGCCTCTGCATTGTCGTTTTCTTCCCAATTAAAATTGATTTCCACTTGTTGGACCTTCAAATATTTCAAGCCCACAAGTGAGGGGAGTGTTGGTGATATATAATTAAATTTGCCTTCAACCACCAACTTAAACATTTGGGTGAATTTGTGGTTTAATAATTTTTGCACGAGTTCAGGTTGAGATCATCTAGTTGTGGGCAGTGAAGATATAGTGCATCTAGACCAGAACATCCCCATAAACTCAGCCTTTTCAGGTTGTTGTGTTTAATGATCAACTTTTGGTAAATACTGTGAAACTTAGAGTTTGGTGGTGGTTTTGATAACTTGTGATCACCAGTATCAGATTTAGAATCCGTAGCATTAGGGTCGCAAATAGTCATCCCACAATCCATGAGTTCAATTGGCAATTCAACCATAGCAAATTGTTTGCCACTTGAAGTGATATTAGGACAGAGGGCAAGGAGAAGTCTAGACAATGAATCTGGGAAAACATTGCAGCTCATTCCTATGCCACTGTCACTGATGCTAGACCCGCTAAGATCAAGCAACTCCAAGTTTGGATAGCTTGAAGCAATAGAAGCATCTGATGCATTAGTGATGTCAGCTCCAAGAACAAGTGAAAGCATCCGTAATCCCCTCAATTGTGCAGCTGTGAGTGCTAGAACTACAGCATGAGAAAGACGGAAAGACGATATATGTATGTTTTGTAATCTTGGGCAACCCCTTCCCATACCATCAACCATTGCAACAAGATCAGAGCTATCATTTTCTTGACAAGAAAACTCCGATGAAATCTCCTTTAAATTGGGGCAATTGAAAGCCATCTTAGAGAGCAAGCAAAGATCAAAAAGCCAAAGTGTTTCAAGACTTGATGAACAGAGGGAAAAACTCCCCAAACTAGAACAACCTTCCATCTTAAAACTCTTAAGATAGCGTTTATCGACAACAAATCGACCAAGTTCACCACCTGTGATGCGATTTATCGATGATTTAGACTTGGTGATCTTCAGAACTTCAAGACTAGGGCACGAGAATGCAATGCAAGCCAACAGCATCGAATCCAAGTCACTTTCCTATCTAAACGAGTGTCACAAGTCCAGAATACATCTGCAGCAACGAACCAATAAATCCAATTGGCAATGAGAGGAGTAGCACATCTATCTTATATAGAGTGTGAGTCTCTTGGTTTTTCCAATGTGAGACTCTCAACATCTTGTTGGTGGTTGAAGGTTATTAAACCACCAATTCACCAAAAAGCTTAAGCTGGTGGTTGAATATGGTATCAGAGTCGGTGGTCCAGGGACTTATGTTCAAGCCCATGCATTGTTGTTTCTTTCCCAATTAAAATCGATTTCCACTTGTTAGACCTTTCAAATATTTCAAGCCCATAAGTGAGGTAAAATTGATTCCACATGTTGAGCCTTTCAAATATTTCAAGCCCACAAGTGAGGGGAATGTTGTTGCTGATATATAATTAAATTTGCCTTCACATCTTAACATGCCCCCTCAAGATGGTGCCTTTTTGGGTTCACCTATCTTGGACCGAATACCCGTTTGGGTTTAATGGACTCTGATACCATATTTGATAGTATAGAGTTCCATCTCAAAACAAATTGACAATTAGAGGACTAGTCCATCTATCTTATATAGAGTGTGAGTCTCACGGTTTTTCCAATGTGGGACTCTCAACATCTCATCAGTTACAAATGGTGTTGTGCTTATCAAACTTACAAAGCTGAAGGGATGATGTAATAAGTTCCTCTTTATTGCATTGATTATAAAAAATTATAGCTGCTACAACATACTTTTGAAGAAGTGGTTGAAGGGAGACTTTGAATTGTAACATTTCATTATTCTTTTTTATTTTCAGAAGAAACTTCATAAATATTATTTATTGCCAATAGGAAACAATCTTGTTGATGGAATGAAATATTGAAAGAAAAATAAAAGGAAAAATAAAGCTTATCTTAATTATGCTACAAAAAGAAAAAGTCTCCAGTTGGATGTAAAAATGTTTAATCTATACTTGGTGAGAGGAGGAACAAGTTCACTGAGATATAAAGCCAACCAAAGAAAAAAGAAATTGATGTGTTCAACTATAACCTCGATGGCTCTCTCCCCGTCACAGAATATCTTTGCATTTGGCTTTTGCTCATTCCAAGCAATGCTTATGAAAGGTCTATTGATCTAAAGCTGTATAGTATTTTTCTGAAGCTTGGTAGGATGATCGGATAGAATTTGAGGCAACCAATATTGCACCTCAAAAGGATGGTCAATGTCCAGCAAATGGCTGTGAGGAAATGGGACCATATCTTTGAAGTGAAAGGGCAGTAAATGAATATATGTTTCTGGGACACTATTTGATTTACACGAACTGCAGCATTGAGGAGATAGAGCCATCCAGTGTTTTCTTTTTTGTAACTGATCTGGGTTGTTTAGGTGAGTGTGGCAGAGTTTCCAAGGGCAGAACTTGATCTTATTTGGATAAAATCCTTTCAAATGACTTTGAATTCGGGTACAGTTTGATGAGGATCCCACCAAAAGATTGCTGGTCACAGATTTGGTGGAGTAATAGCCTTCCTTGTCTAGATCTGAAAACCACAAGTCTTCCCTGTCTATATGGGTTGATAAGATAAAAGATATTTCCTCATACTTTAGATTTCCTCAAAAGTTCAAAACTCATGACGCTTTTTAGTGGTCCCAAATGTCTGCCACGGAGGCATCATTTGTTGCCATCAGTGCAAATAAGAGAGGAAATAGTGTAGTTTTACATTACAATGTTAACAAATTTGATATCATAAATCTTAGAAACTACACATCCAAGTTTTCTTCTATTAGAAAAAAACCTTCTGTTTAACTTGATTTGTATGAACTCAATTTCTTGGTAATATTCTTTTCTTGTATCATTAAACAGGGAAGAGGCAGCTGTAAGAGAGCTCTTAAAGGTGACAAGTAATACAAGGCCAGGTGGCCCGGCAGAAGGAGCTCATATTCACGTTGCTCATTTGTCTGATTCAGGTTCTACCTTAGAACTTATTAAGGTACTTTAGACTACTCTGACAAATGCATGTATGTTCCTGTTTCCCAATGAGCATATACAAATTGGCTATTCAACCCATAAGTTTTCACTCTTGAGGTAATTTTGGTCTGCACATTTTGCAATGGGCTTCCTTAATGTCGTCTGATAGTTTTTTCTGTGACTCTTGGTTTAATATACACATCAGCTTCGATATATTATGTACTATGCCACTGTATCCAAGAGCTATAATCTAATGTCTTAAGTGACTGAAAAATGAAGCATCCAAAGAATGTTATTTTCACAGGAAGCCAAAAGGAGTGGAGATAGTGTGTCAGTTGAGACGTGCACCCACTATCTAGCTTTCTCAGAAGAAGATATAAAAGATGGAGATACTCGTTTCAAGTGTGCTCCACCAATTCGTGATAAAGCCAACAAAGAAAAACTATGGGATGCTTTGATGGTCGGTGTTTTTGTAGATCTTTCTTGCTATATATTTCATAAGTCCTCTCTGAACTTGCATCTGTTTCAACTAACATTACAGGAAGGACATATTGACATGTTAAGTTCTGATCATTCGCCAACAGTGCCACATCTAAAGCTACCTGATTCTGGGGATTTTTTAAAGGCTTGGGGAGGCGTATCATCTTTGCAGGTCACAATTCAATATTTACTATAGATTCTCTCACGTCAGTACCATCTGCCGCTTTTTTATTTTCCATCTTGCACTCGTAGGCTTGGAGAAACAATGTATGCACACTGAAAATTATACTTCAAGTTGATGAATGAGGATGAATAGTCTTGTTGCTGATTTTGATAAAACAGTCTTTGAATCGTATTTGGATTTGGATCATTAGGTTTTCAGACATTTATAGCCATTCCCTTAATAACTGCTCGCCATTTATCGGCTGACCTCATTTGAGAGATTTGAAACGGGTTGGTCCCTTAGTTGTAAGTAACATATCACGGGCAGACGCATTTAAATTATATCCTGCGTGGTACAAAGACAAGTGATATACTTGGATTCCTTGTGCTAGCTGTTGATAGATGCTTATATAAGAAAATTGTCAACCAAGTTATGAATAGTGACATCATTTTCTTGAGAAACTGCAGTTTGATCTCTCTGCAACCTGGTCACATGCAAAGAAACGTGGAGTAACAATGGAGCAAATTGCTTTGTGGTGGAGTGAGCGGCCTGCCAAGCTTGCTGGTCTAGAATTAAAGGTTACAAAATTTTCATACTTTTGTTGTCAATTTTACATTACCACTCTTTGGAAGAGTAGATTGAGATACTTTTATTAATTTCTTCACGAGCTTGTTATGTTATGAATCTATCATGAGTTGTATCTCAATAAATAGTCTAAATAACATCTTGTGTTTTCATTAAGGGTAGCAAATTTGCTTATAAATGTTTAAACAATGACATTATAGCCAGTTTGATTGCACCTTCAAGGAAAATATATATCTATATCTCTATCTCTTTCTATCTATCTATATATGTATATATTACCATGTTTAGTGGAGATACTTGGCTACTGGCAATCATCCCTATATCATCGTTGACATAAGAAAGCCTGGTTTTCTCTTACAGGGTGCTATTGCTATTGGAAAGCATGCAGATATTGTTGCGTGGGCACCAGATGAAGAGTACGATGTCAATGACATTCCCGTATACTTGAAACATCCCGTATGTCCAGTTTTTTCTTTTCACCACTGTTTCCTCTTCATATATCCATTTTCAAGTTGCAATTTTCTCAAACTGAATCGTATTTCCAGCATTTTAATTCTACAGATTCAGAAACATCAAATATAAACTGTACATACTCCAAGCACTGAAACTCGTACTGCTGTATGAAACAAGAATTTGCTTTCGTTGAAATTAAACCCAAATCTCCTTACAATATAGGCCAGCATTTGTCACAAATTTTCTGTTGCAAAAATGATTGAGTTGGCTTGTTATGCAGCTATGCTGACTTGGTTTTTTCTTGTCATTCTTTTAACTCAGAGCATTTCAGCCTATATGGGAATGAAACTGTCTGGAAAAGTTTTGGCCACTTTTGTAAGAGGACAACTCGTATACGAAGAGAAGCATGCTCCTGCTGCTTGTGGAACTCCAATCCTTGCAAGAGTAACAGATTAGGACTCTGATATATATATCCATTAATCTTCATGTTATTATGACTTCCATTAGTACCTTCCATGCGCTTATAAGATAAAGAATAAAAGGTCATACTTTCTAAAAATTGTTTGGTTACATAAAGCATGAAGGATACTGATGTTTGTGCTCGACTTCATTTACATATTAAGATCAGCTCTAGGATGATGTTGCGACTGCACTCTTTTTTTGGGAGACAGACAGGTTCCTTAATCAAACTTAATTAACCCAATCAATGGCACCC

mRNA sequence

GTTAGTAATTGACATGTTTCCCGCCGTACTAAAAAAAATGTCACCTTAAAAGCTTCTTAATCAGAGTTTAGCTGAAAGGTGACTAAGTGAGCATCGGTAATTTTACTATAATCGTCTGATCTGACTTTAATCATTGATTAGCGAAATGGTTCACCACAGTTGCCCTTCACTGGTCAGACGATAAGCTGAATTCCTAAGCCTGGGCTGAAGCCGGCGACGGTGAAATACTTCAAATTCCTATCAACTACATGATCGTTCACTCCAATCGCATTACTGATTAGACGGAAAGAATTACGATGAATTTGCTGCAGTGGAAGCTACTTCCTCTATTAACATTGCTCGCTTCCATTTTCTTATTTTTCTACTTAAAGGATCCATCCGATAATGAATGCAGCCTCCTTCCTCACAAGCACTTTTGGATAACAAGCAAGCGCATTGTTACGCCACAAGGAGTCATTTCTGGTGCTGTTGAGATAAATGGAGGGAAGATTGTATCCATTGTTAAGGAAGAAGAAAAGCATGGGAAGATTATGGGTAATCACGTGGTTGATTACAAAGATGCTGTTGTAATGCCTGGCTTGGTTGACGTCCATGTTCATCTCGATGATCCTGGACGGTCTGAATGGGAAGGGTTTCCATCTGGAACAAAGGCTGCAGCTGCTGGTGGGGTAACTACTCTGGTTGATATGCCATTAAATAATTTTCCCTCAACTACGTCCGAAGAAACTCTAAAACTCAAGATTAAGGCTGCTGAAGGAAGAATCTATGTTGACGTTGGCTTTTGGGGAGGTCTTGTTCCTGAGAATGCTTTCAATGCAAGTGCTCTGGAAAATCTCCTGAAAGCAGGGGCTCTCGGCCTAAAGTCATTTATGTGTCCTTCCGGGATCAATGACTTTCCTATGACAAATATTTCTCATATCAAGGAAGGACTGTCAGTTTTAGCAAAATACAAAAGACCTTTGCTTGTGCATTCAGAGATTGAACAAAGTTCTCCAAGCCCTGTGCAACTTGAAGGTAGTCAAGATGACCCTCGTACTTACTCAACATATCTTGCAACCAGACCACCTTCTTGGGAAGAGGCAGCTGTAAGAGAGCTCTTAAAGGTGACAAGTAATACAAGGCCAGGTGGCCCGGCAGAAGGAGCTCATATTCACGTTGCTCATTTGTCTGATTCAGGTTCTACCTTAGAACTTATTAAGGAAGCCAAAAGGAGTGGAGATAGTGTGTCAGTTGAGACGTGCACCCACTATCTAGCTTTCTCAGAAGAAGATATAAAAGATGGAGATACTCGTTTCAAGTGTGCTCCACCAATTCGTGATAAAGCCAACAAAGAAAAACTATGGGATGCTTTGATGGTCGGTGTTTTTGTAGATCTTTCTTGCTATATATTTCATAAGTCCTCTCTGAACTTGCATCTGTTTCAACTAACATTACAGGAAGGACATATTGACATGTTAAGTTCTGATCATTCGCCAACAGTGCCACATCTAAAGCTACCTGATTCTGGGGATTTTTTAAAGGCTTGGGGAGGCGTATCATCTTTGCAGTTTGATCTCTCTGCAACCTGGTCACATGCAAAGAAACGTGGAGTAACAATGGAGCAAATTGCTTTGTGGTGGAGTGAGCGGCCTGCCAAGCTTGCTGGTCTAGAATTAAAGGGTGCTATTGCTATTGGAAAGCATGCAGATATTGTTGCGTGGGCACCAGATGAAGAGTACGATGTCAATGACATTCCCGTATACTTGAAACATCCCAGCATTTCAGCCTATATGGGAATGAAACTGTCTGGAAAAGTTTTGGCCACTTTTGTAAGAGGACAACTCGTATACGAAGAGAAGCATGCTCCTGCTGCTTGTGGAACTCCAATCCTTGCAAGAGTAACAGATTAGGACTCTGATATATATATCCATTAATCTTCATGTTATTATGACTTCCATTAGTACCTTCCATGCGCTTATAAGATAAAGAATAAAAGGTCATACTTTCTAAAAATTGTTTGGTTACATAAAGCATGAAGGATACTGATGTTTGTGCTCGACTTCATTTACATATTAAGATCAGCTCTAGGATGATGTTGCGACTGCACTCTTTTTTTGGGAGACAGACAGGTTCCTTAATCAAACTTAATTAACCCAATCAATGGCACCC

Coding sequence (CDS)

ATGAATTTGCTGCAGTGGAAGCTACTTCCTCTATTAACATTGCTCGCTTCCATTTTCTTATTTTTCTACTTAAAGGATCCATCCGATAATGAATGCAGCCTCCTTCCTCACAAGCACTTTTGGATAACAAGCAAGCGCATTGTTACGCCACAAGGAGTCATTTCTGGTGCTGTTGAGATAAATGGAGGGAAGATTGTATCCATTGTTAAGGAAGAAGAAAAGCATGGGAAGATTATGGGTAATCACGTGGTTGATTACAAAGATGCTGTTGTAATGCCTGGCTTGGTTGACGTCCATGTTCATCTCGATGATCCTGGACGGTCTGAATGGGAAGGGTTTCCATCTGGAACAAAGGCTGCAGCTGCTGGTGGGGTAACTACTCTGGTTGATATGCCATTAAATAATTTTCCCTCAACTACGTCCGAAGAAACTCTAAAACTCAAGATTAAGGCTGCTGAAGGAAGAATCTATGTTGACGTTGGCTTTTGGGGAGGTCTTGTTCCTGAGAATGCTTTCAATGCAAGTGCTCTGGAAAATCTCCTGAAAGCAGGGGCTCTCGGCCTAAAGTCATTTATGTGTCCTTCCGGGATCAATGACTTTCCTATGACAAATATTTCTCATATCAAGGAAGGACTGTCAGTTTTAGCAAAATACAAAAGACCTTTGCTTGTGCATTCAGAGATTGAACAAAGTTCTCCAAGCCCTGTGCAACTTGAAGGTAGTCAAGATGACCCTCGTACTTACTCAACATATCTTGCAACCAGACCACCTTCTTGGGAAGAGGCAGCTGTAAGAGAGCTCTTAAAGGTGACAAGTAATACAAGGCCAGGTGGCCCGGCAGAAGGAGCTCATATTCACGTTGCTCATTTGTCTGATTCAGGTTCTACCTTAGAACTTATTAAGGAAGCCAAAAGGAGTGGAGATAGTGTGTCAGTTGAGACGTGCACCCACTATCTAGCTTTCTCAGAAGAAGATATAAAAGATGGAGATACTCGTTTCAAGTGTGCTCCACCAATTCGTGATAAAGCCAACAAAGAAAAACTATGGGATGCTTTGATGGTCGGTGTTTTTGTAGATCTTTCTTGCTATATATTTCATAAGTCCTCTCTGAACTTGCATCTGTTTCAACTAACATTACAGGAAGGACATATTGACATGTTAAGTTCTGATCATTCGCCAACAGTGCCACATCTAAAGCTACCTGATTCTGGGGATTTTTTAAAGGCTTGGGGAGGCGTATCATCTTTGCAGTTTGATCTCTCTGCAACCTGGTCACATGCAAAGAAACGTGGAGTAACAATGGAGCAAATTGCTTTGTGGTGGAGTGAGCGGCCTGCCAAGCTTGCTGGTCTAGAATTAAAGGGTGCTATTGCTATTGGAAAGCATGCAGATATTGTTGCGTGGGCACCAGATGAAGAGTACGATGTCAATGACATTCCCGTATACTTGAAACATCCCAGCATTTCAGCCTATATGGGAATGAAACTGTCTGGAAAAGTTTTGGCCACTTTTGTAAGAGGACAACTCGTATACGAAGAGAAGCATGCTCCTGCTGCTTGTGGAACTCCAATCCTTGCAAGAGTAACAGATTAG

Protein sequence

MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEINGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAAAAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENLLKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEGSQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELIKEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDLSCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDLSATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIPVYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD*
Homology
BLAST of CSPI01G12020.2 vs. ExPASy Swiss-Prot
Match: Q94AP0 (Allantoinase OS=Arabidopsis thaliana OX=3702 GN=ALN PE=1 SV=1)

HSP 1 Score: 707.2 bits (1824), Expect = 1.4e-202
Identity = 352/531 (66.29%), Postives = 420/531 (79.10%), Query Frame = 0

Query: 3   LLQWKLLPLLTLLASIFLFFYLKDPS---DNECSLLPHKHFWITSKRIVTPQGVISGAVE 62
           LLQW+LLPLL L+ ++F FF+    S   +N+CSLLPH H+WI+SKRIVTP G+ISG+VE
Sbjct: 5   LLQWRLLPLLALIVALFSFFFASPRSLQGNNKCSLLPHDHYWISSKRIVTPNGLISGSVE 64

Query: 63  INGGKIVSIVKEEEKHGKIMGN-HVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTK 122
           + GG IVS+VKE + H        V+DY +AV+MPGL+DVHVHLDDPGRSEWEGFPSGTK
Sbjct: 65  VKGGIIVSVVKEVDWHKSQRSRVKVIDYGEAVLMPGLIDVHVHLDDPGRSEWEGFPSGTK 124

Query: 123 AAAAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALE 182
           AAAAGG+TTLVDMPLN+FPST S ETLKLKI+AA+ RI+VDVGFWGGLVP+NA N+SALE
Sbjct: 125 AAAAGGITTLVDMPLNSFPSTVSPETLKLKIEAAKNRIHVDVGFWGGLVPDNALNSSALE 184

Query: 183 NLLKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQL 242
           +LL AG LGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVH+EIE+     +++
Sbjct: 185 SLLDAGVLGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHAEIERD----LEI 244

Query: 243 E-GSQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTL 302
           E GS++DPR+Y TYL TRP SWEE A+R LL VT NTR GG AEGAH+H+ HLSD+ S+L
Sbjct: 245 EDGSENDPRSYLTYLKTRPTSWEEGAIRNLLSVTENTRIGGSAEGAHLHIVHLSDASSSL 304

Query: 303 ELIKEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVF 362
           +LIKEAK  GDSV+VETC HYLAFS E+I +GDTRFKC+PPIRD AN+EKLW+ALM    
Sbjct: 305 DLIKEAKGKGDSVTVETCPHYLAFSAEEIPEGDTRFKCSPPIRDAANREKLWEALM---- 364

Query: 363 VDLSCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQ 422
                                  EG IDMLSSDHSPT P LKL   G+FLKAWGG+SSLQ
Sbjct: 365 -----------------------EGDIDMLSSDHSPTKPELKLMSDGNFLKAWGGISSLQ 424

Query: 423 FDLSATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVN 482
           F L  TWS+ KK GVT+EQ+  WWS+RP+KLAGL  KGA+ +GKHAD+V W P+ E+DV+
Sbjct: 425 FVLPITWSYGKKYGVTLEQVTSWWSDRPSKLAGLHSKGAVTVGKHADLVVWEPEAEFDVD 484

Query: 483 -DIPVYLKHPSISAYMGMKLSGKVLATFVRGQLVY-EEKHAPAACGTPILA 527
            D P++ KHPSISAY+G +LSGKV++TFVRG LV+ E KHA  ACG+  LA
Sbjct: 485 EDHPIHFKHPSISAYLGRRLSGKVVSTFVRGNLVFGEGKHASDACGSLQLA 504

BLAST of CSPI01G12020.2 vs. ExPASy Swiss-Prot
Match: B9FDB8 (Probable allantoinase OS=Oryza sativa subsp. japonica OX=39947 GN=ALN PE=2 SV=1)

HSP 1 Score: 610.1 bits (1572), Expect = 2.3e-173
Identity = 299/528 (56.63%), Postives = 387/528 (73.30%), Query Frame = 0

Query: 7   KLLPLLTLLASIFLFFYLKDPSDNE-----CSLLPHKHFWITSKRIVTPQGVISGAVEIN 66
           ++LPLL + A++      + P         CSLLPH HFWI S+R+VT   V   AVE+ 
Sbjct: 9   RVLPLLAVAAALAAALLYRAPFSKSLGGEGCSLLPHDHFWIASERVVTLGRVGPAAVEVK 68

Query: 67  GGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAAA 126
           GG +++ +   +    ++   VVDY DAV+MPGL+DVH HLD+PGR+EWEGF +GT+AAA
Sbjct: 69  GG-LINAIAVGDYRSFLLRRPVVDYGDAVIMPGLIDVHAHLDEPGRAEWEGFSTGTRAAA 128

Query: 127 AGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENLL 186
           AGG+TTLVDMPLN++PST SEETLKLK+ AA+ +++VDVGFWGGLVPENA N SALE+LL
Sbjct: 129 AGGITTLVDMPLNSYPSTVSEETLKLKLDAAKDKLHVDVGFWGGLVPENALNPSALESLL 188

Query: 187 KAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEGS 246
            AG LGLKSFMCPSGINDFPMTN +HI+EGL  LAKYKRPLL+H+E      +   ++G 
Sbjct: 189 NAGVLGLKSFMCPSGINDFPMTNSTHIEEGLVTLAKYKRPLLIHAERIPDVQNEDGIDG- 248

Query: 247 QDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELIK 306
           + DP+ Y+TYL +RPP+WEEAA+++L +   +T  GG +EGAHIH+ HLSD+ ++L L+K
Sbjct: 249 ELDPKAYTTYLKSRPPAWEEAAIKDLQRAMKDTEIGGRSEGAHIHIVHLSDAKTSLGLLK 308

Query: 307 EAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDLS 366
           +AK++G  VSVETC HYLAFS E++ DGDTRFKCAPPIRD  N++ LW+AL+        
Sbjct: 309 DAKQNGARVSVETCPHYLAFSAEEVPDGDTRFKCAPPIRDSTNRDNLWEALL-------- 368

Query: 367 CYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDLS 426
                              +GHIDMLSSDHSP+ P LKL + G+FL+AWGG+SSLQF L 
Sbjct: 369 -------------------DGHIDMLSSDHSPSAPDLKLMEEGNFLRAWGGISSLQFVLP 428

Query: 427 ATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDI-P 486
            TWSH KK G+++ Q+A WWSERPA LAGL+ KGA+  G  ADIV W P+ ++ ++D  P
Sbjct: 429 VTWSHGKKYGISLNQLASWWSERPAMLAGLKKKGAVLPGYRADIVVWKPEAQFHLDDSHP 488

Query: 487 VYLKHPSISAYMGMKLSGKVLATFVRGQLVY-EEKHAPAACGTPILAR 528
           VY KH +ISAY+G +LSGK+L+TFV G LV+ E+KHA AACG PILA+
Sbjct: 489 VYHKHRNISAYLGKQLSGKILSTFVGGNLVFAEDKHAKAACGAPILAK 507

BLAST of CSPI01G12020.2 vs. ExPASy Swiss-Prot
Match: Q82LL4 (Allantoinase OS=Streptomyces avermitilis (strain ATCC 31267 / DSM 46492 / JCM 5070 / NBRC 14893 / NCIMB 12804 / NRRL 8165 / MA-4680) OX=227882 GN=allB PE=3 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 1.0e-88
Identity = 184/470 (39.15%), Postives = 265/470 (56.38%), Query Frame = 0

Query: 42  ITSKRIVTPQGVISGAVEINGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVH 101
           + S R++TP+G    AV +  GKI +++  + +     G  + D  D V++PGLVD HVH
Sbjct: 8   LRSTRVITPEGTRPAAVAVAAGKITAVLPHDAE--VPAGARLEDLGDDVLLPGLVDTHVH 67

Query: 102 LDDPGRSEWEGFPSGTKAAAAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVG 161
           ++DPGR+ WEGF + T+AAAAGG+TTLVDMPLN+ P TT+   L+ K   A  + ++DVG
Sbjct: 68  VNDPGRTHWEGFWTATRAAAAGGITTLVDMPLNSLPPTTTVGNLRTKRDVAADKAHIDVG 127

Query: 162 FWGGLVPENAFNASALENLLKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRP 221
           FWGG +P+   N   L  L  AG  G K+F+ PSG+++FP  +   +   ++ +A +   
Sbjct: 128 FWGGALPD---NVKDLRPLHDAGVFGFKAFLSPSGVDEFPELDQERLARSMAEIAGFGGL 187

Query: 222 LLVHSEIEQSSPSPVQLEGSQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAE 281
           L+VH+E     P  +     +  PR Y+ +LA+RP   E+ A+  LL             
Sbjct: 188 LIVHAE----DPHHLAAAPQRGGPR-YTDFLASRPRDAEDTAIANLLAQAKRL------- 247

Query: 282 GAHIHVAHLSDSGSTLELIKEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRD 341
            A +HV HLS S   L LI  AK  G  V+VETC HYL  + E++ DG + FKC PPIR+
Sbjct: 248 NARVHVLHLS-SSDALPLIAGAKAEGVRVTVETCPHYLTLTAEEVPDGASEFKCCPPIRE 307

Query: 342 KANKEKLWDALMVGVFVDLSCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLP 401
            AN++ LW A                           L +G ID + +DHSP+   LK  
Sbjct: 308 AANQDLLWQA---------------------------LADGTIDCVVTDHSPSTADLK-- 367

Query: 402 DSGDFLKAWGGVSSLQFDLSATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGK 461
            + DF  AWGG+S LQ  L A W+ A++RG ++E +  W S R A+L GL  KGAI  G+
Sbjct: 368 -TDDFATAWGGISGLQLSLPAIWTEARRRGHSLEDVVRWMSARTARLVGLAQKGAIEAGR 427

Query: 462 HADIVAWAPDEEYDVNDIPVYLKHPS-ISAYMGMKLSGKVLATFVRGQLV 511
            AD    APDE + V+  P  L+H + ++AY G  LSG V +T++RG+ +
Sbjct: 428 DADFAVLAPDETFTVD--PAALQHRNRVTAYAGKTLSGVVKSTWLRGERI 427

BLAST of CSPI01G12020.2 vs. ExPASy Swiss-Prot
Match: Q9RKU5 (Allantoinase OS=Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) OX=100226 GN=allB PE=3 SV=1)

HSP 1 Score: 328.2 bits (840), Expect = 1.8e-88
Identity = 186/471 (39.49%), Postives = 269/471 (57.11%), Query Frame = 0

Query: 42  ITSKRIVTPQGVISGAVEINGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVH 101
           + S R++TP+G  + +V + G KI +++  +       G  + D  D VV+PGLVD HVH
Sbjct: 8   LRSTRVITPEGTRAASVAVTGEKITAVLPYDAP--VPAGARLEDVGDHVVLPGLVDTHVH 67

Query: 102 LDDPGRSEWEGFPSGTKAAAAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVG 161
           ++DPGR+EWEGF + T+AAAAGG+TTLVDMPLN+ P TT+ + L+ K + A  + ++DVG
Sbjct: 68  VNDPGRTEWEGFWTATRAAAAGGITTLVDMPLNSIPPTTTVDNLRTKREVAADKAHIDVG 127

Query: 162 FWGGLVPENAFNASALENLLKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRP 221
           FWGG +P+   N   L  L +AG  G K+F+ PSG+++FP  +   +   L+ +A +   
Sbjct: 128 FWGGALPD---NVKDLRPLHEAGVFGFKAFLSPSGVDEFPHLDQEQLARSLAEIAAFDGL 187

Query: 222 LLVHSEIEQSSPSPVQLEGSQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAE 281
           L+VH+E     P  +     Q  P+ Y+ +LA+RP   E+ A+  LL             
Sbjct: 188 LIVHAE----DPHHLAAAPQQGGPK-YTHFLASRPRDAEDTAIATLLAQAKRF------- 247

Query: 282 GAHIHVAHLSDSGSTLELIKEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRD 341
            A +HV HLS S   L LI EA+  G  V+VETC HYL  + E++ DG + FKC PPIR+
Sbjct: 248 NARVHVLHLS-SSDALPLIAEARADGVRVTVETCPHYLTLTAEEVPDGASEFKCCPPIRE 307

Query: 342 KANKEKLWDALMVGVFVDLSCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLP 401
            AN++ LW A                           L +G ID + +DHSP+   LK  
Sbjct: 308 AANQDLLWQA---------------------------LADGTIDCVVTDHSPSTADLK-- 367

Query: 402 DSGDFLKAWGGVSSLQFDLSATWSHAKKRGVTMEQIALWWSERPAKLAGLEL-KGAIAIG 461
            + DF  AWGG++ LQ  L A W+ A+ RG+ +E +  W SER A L GL+  KGAIA G
Sbjct: 368 -TDDFATAWGGIAGLQLSLPAMWTAARGRGLGLEDVVRWMSERTAALVGLDARKGAIAPG 427

Query: 462 KHADIVAWAPDEEYDVNDIPVYLKHPS-ISAYMGMKLSGKVLATFVRGQLV 511
             AD    APDE + V+  P  L+H + ++AY G  L G V +T++RG+ +
Sbjct: 428 HDADFAVLAPDETFTVD--PAALQHRNRVTAYAGKTLYGVVKSTWLRGERI 428

BLAST of CSPI01G12020.2 vs. ExPASy Swiss-Prot
Match: Q55C91 (Probable allantoinase 2 OS=Dictyostelium discoideum OX=44689 GN=allB2 PE=3 SV=1)

HSP 1 Score: 310.1 bits (793), Expect = 4.9e-83
Identity = 193/530 (36.42%), Postives = 291/530 (54.91%), Query Frame = 0

Query: 6   WKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEINGGKI 65
           WK++  +  L  +F  F L   +D+      +K   I  + ++    VI  ++ I  GK 
Sbjct: 4   WKVIFSIWFL--LFQNFVLSAKNDD------NKLKVIRGRNVIYNGNVIPLSILIRNGKT 63

Query: 66  VSIVKEEEKHGKIMGNHVVDY--------KDAVVMPGLVDVHVHLDDPGRSEWEGFPSGT 125
           + I        K+  N+ + Y        +D ++M GLVD HVH+++PGR+EWEGF S T
Sbjct: 64  IGIKDYSFNPKKLNENYEILYDDRECNNNEDFIIMGGLVDSHVHVNEPGRTEWEGFESAT 123

Query: 126 KAAAAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASAL 185
            AAAAGGVTT+VDMPLN+ P TTS + L  KI++ +G++ VDVG  GG+VP N+     +
Sbjct: 124 SAAAAGGVTTIVDMPLNSSPVTTSFKNLLDKIESMKGKLRVDVGLLGGIVPGNSKEIKKM 183

Query: 186 ENLLKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRP-------LLVHSEIEQ 245
             +L+ G LG KSF+ PSGI++FP  N + I+E ++ +   K         ++ H+E+E+
Sbjct: 184 --VLQGGVLGFKSFLLPSGIDEFPPVNENDIQEAMNEMKLLKCQYNNSDVIMMFHAEVEE 243

Query: 246 S-SPSPVQLEGSQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAH 305
               + V+L+    DP+ Y TYL +RP   E  A+ +L+ +T         +    H+ H
Sbjct: 244 PIKEATVRLKNENADPKLYKTYLDSRPKISENQAISKLIDITRQN------QIVSTHIVH 303

Query: 306 LSDSGSTLELIKEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLW 365
           LS S S +E I+EA   G  +S ET  +YL  + E +  G+T FK APP+R+  NKE LW
Sbjct: 304 LSSSES-IEQIREAMDQGVPISAETTYNYLHLTSESVPYGNTLFKSAPPVREHENKELLW 363

Query: 366 DALMVGVFVDLSCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKL-----PDSG 425
           +A++                            G I ++ SDHSP   +LK         G
Sbjct: 364 NAII---------------------------NGTIKLIVSDHSPCTINLKQLKEDNQSIG 423

Query: 426 DFLKAWGGVSSLQFDLSATWSHAKKRGVTMEQIALWWSERPAKLAGL-ELKGAIAIGKHA 485
           DFLKAWGG+SSL+  L   W+  K RG+ + Q++ W S  P+KL GL + KG+I IG+ A
Sbjct: 424 DFLKAWGGISSLELGLPIIWTECKNRGIPITQLSEWLSNGPSKLVGLNDRKGSIEIGRDA 483

Query: 486 DIVAWAPDEEYDVNDIPVYLKHPSISAYMGMKLSGKVLATFVRGQLVYEE 514
           D V + P+E + VN+  ++LK+   SAY G KL G V  T +RG  ++++
Sbjct: 484 DFVIFNPNESFIVNEKKLFLKN-KFSAYNGEKLFGVVYETILRGNSIFKK 488

BLAST of CSPI01G12020.2 vs. ExPASy TrEMBL
Match: A0A0A0LUK0 (Allantoinase OS=Cucumis sativus OX=3659 GN=Csa_1G073760 PE=3 SV=1)

HSP 1 Score: 1006.1 bits (2600), Expect = 5.3e-290
Identity = 501/530 (94.53%), Postives = 502/530 (94.72%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           NGGKIVSIVKEEEKHGKIMGNHVVDY DAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYADAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL
Sbjct: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           LKAGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG
Sbjct: 181 LKAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
           SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI
Sbjct: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM       
Sbjct: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP 480
           SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP
Sbjct: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP 480

Query: 481 VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD 531
           VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD
Sbjct: 481 VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD 503

BLAST of CSPI01G12020.2 vs. ExPASy TrEMBL
Match: A0A1S3B681 (Allantoinase OS=Cucumis melo OX=3656 GN=LOC103486263 PE=3 SV=1)

HSP 1 Score: 985.7 bits (2547), Expect = 7.5e-284
Identity = 488/530 (92.08%), Postives = 497/530 (93.77%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLTLLAS+FL FYLKDPS+NECSLLPHKHFWITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTLLASVFLVFYLKDPSENECSLLPHKHFWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           NGG+IVSIVKEEE+HGKIMGNHVVDY DAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NGGRIVSIVKEEERHGKIMGNHVVDYADAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL
Sbjct: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           LKAGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQ EG
Sbjct: 181 LKAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQREG 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
           SQDDPR+YSTYL TRPPSWEEAAVRELLKVT+NTRPGGPAEGAHIHVAHLSDSGSTLELI
Sbjct: 241 SQDDPRSYSTYLTTRPPSWEEAAVRELLKVTNNTRPGGPAEGAHIHVAHLSDSGSTLELI 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM       
Sbjct: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPTVP LKLPDSGDFLKAWGG+SSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTVPDLKLPDSGDFLKAWGGISSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP 480
           SATWSHAKKRGVTMEQ+ALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP
Sbjct: 421 SATWSHAKKRGVTMEQLALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP 480

Query: 481 VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD 531
           VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILA VTD
Sbjct: 481 VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILATVTD 503

BLAST of CSPI01G12020.2 vs. ExPASy TrEMBL
Match: A0A6J1GUY1 (Allantoinase OS=Cucurbita moschata OX=3662 GN=LOC111457155 PE=3 SV=1)

HSP 1 Score: 912.5 bits (2357), Expect = 8.0e-262
Identity = 449/527 (85.20%), Postives = 477/527 (90.51%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLT++ SIF+ FYL+ PS+N+CSLLP+KH+WITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTVIVSIFVVFYLQHPSENKCSLLPYKHYWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           N GKIVSIVKEEE+HGKIMG HVVDY DAVV PGLVD+HVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NEGKIVSIVKEEERHGKIMGAHVVDYSDAVVFPGLVDIHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPST SEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNA+ALE L
Sbjct: 121 AAGGVTTLVDMPLNNFPSTVSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNATALERL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           L AGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSE++ SSPS  QLE 
Sbjct: 181 LSAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEVQPSSPSSTQLED 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
            QDDPR+Y TYLATRPPSWEEAAVRELL VT NTRPGGPAEGAH+HVAHLSDSGSTLELI
Sbjct: 241 MQDDPRSYLTYLATRPPSWEEAAVRELLTVTKNTRPGGPAEGAHLHVAHLSDSGSTLELI 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKR GDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLW+ALM       
Sbjct: 301 KEAKRHGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWEALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPT+P LKL DSGDFLKAWGG+SSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTLPELKLLDSGDFLKAWGGISSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVND-I 480
           SATWSHAKKRGVT+EQ+ALWWSERPAKLAGL+LKGAIAIGKHADIVAWAPDEE+DV+D  
Sbjct: 421 SATWSHAKKRGVTIEQLALWWSERPAKLAGLDLKGAIAIGKHADIVAWAPDEEFDVDDKF 480

Query: 481 PVYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILA 527
           P+++KHPSISAYMGMKLSGKVLATFVRGQLVYE+KHAPAACGTPILA
Sbjct: 481 PIHIKHPSISAYMGMKLSGKVLATFVRGQLVYEDKHAPAACGTPILA 500

BLAST of CSPI01G12020.2 vs. ExPASy TrEMBL
Match: A0A6J1K128 (Allantoinase OS=Cucurbita maxima OX=3661 GN=LOC111490122 PE=3 SV=1)

HSP 1 Score: 907.9 bits (2345), Expect = 2.0e-260
Identity = 447/527 (84.82%), Postives = 477/527 (90.51%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLT++ASIF+ FYL+  S+N+CSLLP+KH+WITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTVIASIFVVFYLQHLSENKCSLLPYKHYWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           N GKIVSIVKEEE+HGKIMG HVVDY DAVV PGLVD+HVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NEGKIVSIVKEEERHGKIMGAHVVDYSDAVVFPGLVDIHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPST SEETLKLKIKAAEGRIYVD+GFWGGLVPENAFNA+ALE L
Sbjct: 121 AAGGVTTLVDMPLNNFPSTVSEETLKLKIKAAEGRIYVDLGFWGGLVPENAFNATALERL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           L AGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSE++ SSPS  QLE 
Sbjct: 181 LSAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEVQPSSPSSTQLED 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
            QDDPR+Y TYLATRPPSWEEAAVRELL VT NTRPGGPAEGAH+HVAHLSDSGSTL+LI
Sbjct: 241 MQDDPRSYLTYLATRPPSWEEAAVRELLTVTKNTRPGGPAEGAHLHVAHLSDSGSTLDLI 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKR GDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLW+ALM       
Sbjct: 301 KEAKRHGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWEALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPT+P LKL DSGDFLKAWGG+SSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTLPELKLLDSGDFLKAWGGISSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVND-I 480
           SATWSHAKKRGVT+EQ+ALWWSERPAKLAGL+LKGAIAIGKHADIVAWAPDEE+DV+D  
Sbjct: 421 SATWSHAKKRGVTIEQLALWWSERPAKLAGLDLKGAIAIGKHADIVAWAPDEEFDVDDKF 480

Query: 481 PVYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILA 527
           P+++KHPSISAYMGMKLSGKVLATFVRGQLVYE+KHAPAACGTPILA
Sbjct: 481 PIHIKHPSISAYMGMKLSGKVLATFVRGQLVYEDKHAPAACGTPILA 500

BLAST of CSPI01G12020.2 vs. ExPASy TrEMBL
Match: A0A6J1CH40 (Allantoinase OS=Momordica charantia OX=3673 GN=LOC111010798 PE=3 SV=1)

HSP 1 Score: 903.3 bits (2333), Expect = 4.9e-259
Identity = 446/527 (84.63%), Postives = 482/527 (91.46%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           +NLLQWKLLPLLTLLASIF+ FYL+ PS N+CSLLP++H+WITSKRIVTPQGVISGAVEI
Sbjct: 2   LNLLQWKLLPLLTLLASIFVVFYLQYPSKNDCSLLPYQHYWITSKRIVTPQGVISGAVEI 61

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           N GKIVSIV+EEE+HGKI G+HVVD+ DAVVMPGLVD+HVHLDDPGRSEWEGFPSGTKAA
Sbjct: 62  NEGKIVSIVREEERHGKITGSHVVDFSDAVVMPGLVDIHVHLDDPGRSEWEGFPSGTKAA 121

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALE+L
Sbjct: 122 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALESL 181

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           L AGALGLKSFMCPSGI+DFPMT+I+HIKEGLSVLAKYKRPLLVHSEI++SSPS  QLE 
Sbjct: 182 LSAGALGLKSFMCPSGIDDFPMTDITHIKEGLSVLAKYKRPLLVHSEIQKSSPSSSQLED 241

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
           SQDDPR+YSTYLATRPPSWEEAAVRELL VT+NTRPGGPAEGAH+HV HLSDSGSTLELI
Sbjct: 242 SQDDPRSYSTYLATRPPSWEEAAVRELLTVTNNTRPGGPAEGAHLHVVHLSDSGSTLELI 301

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKR GDSVSVETCTHYLAFSEEDIK+GDTRFKCAPP+RDKANKEKLW+ALM       
Sbjct: 302 KEAKRIGDSVSVETCTHYLAFSEEDIKNGDTRFKCAPPLRDKANKEKLWEALM------- 361

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPT+P LKL DSGDFLKAWGGVSSLQFDL
Sbjct: 362 --------------------EGHIDMLSSDHSPTLPELKLLDSGDFLKAWGGVSSLQFDL 421

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVND-I 480
           SATWSHAKKRGV++EQ+ALWWSERPAKLAGLELKGAIA+GKHADIVA+ P+EE+DVND +
Sbjct: 422 SATWSHAKKRGVSIEQLALWWSERPAKLAGLELKGAIAVGKHADIVAFVPEEEFDVNDKL 481

Query: 481 PVYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILA 527
           PVYL+HPSISAYMGMKLSGKVLATFVRGQLV++EKHAPAACG PILA
Sbjct: 482 PVYLRHPSISAYMGMKLSGKVLATFVRGQLVFKEKHAPAACGAPILA 501

BLAST of CSPI01G12020.2 vs. NCBI nr
Match: XP_004146596.1 (allantoinase [Cucumis sativus] >KGN64669.1 hypothetical protein Csa_014334 [Cucumis sativus])

HSP 1 Score: 1006.1 bits (2600), Expect = 1.1e-289
Identity = 501/530 (94.53%), Postives = 502/530 (94.72%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           NGGKIVSIVKEEEKHGKIMGNHVVDY DAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYADAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL
Sbjct: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           LKAGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG
Sbjct: 181 LKAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
           SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI
Sbjct: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM       
Sbjct: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP 480
           SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP
Sbjct: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP 480

Query: 481 VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD 531
           VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD
Sbjct: 481 VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD 503

BLAST of CSPI01G12020.2 vs. NCBI nr
Match: XP_008442380.1 (PREDICTED: allantoinase [Cucumis melo])

HSP 1 Score: 985.7 bits (2547), Expect = 1.5e-283
Identity = 488/530 (92.08%), Postives = 497/530 (93.77%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLTLLAS+FL FYLKDPS+NECSLLPHKHFWITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTLLASVFLVFYLKDPSENECSLLPHKHFWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           NGG+IVSIVKEEE+HGKIMGNHVVDY DAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NGGRIVSIVKEEERHGKIMGNHVVDYADAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL
Sbjct: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           LKAGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQ EG
Sbjct: 181 LKAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQREG 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
           SQDDPR+YSTYL TRPPSWEEAAVRELLKVT+NTRPGGPAEGAHIHVAHLSDSGSTLELI
Sbjct: 241 SQDDPRSYSTYLTTRPPSWEEAAVRELLKVTNNTRPGGPAEGAHIHVAHLSDSGSTLELI 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM       
Sbjct: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPTVP LKLPDSGDFLKAWGG+SSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTVPDLKLPDSGDFLKAWGGISSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP 480
           SATWSHAKKRGVTMEQ+ALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP
Sbjct: 421 SATWSHAKKRGVTMEQLALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDIP 480

Query: 481 VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD 531
           VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILA VTD
Sbjct: 481 VYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILATVTD 503

BLAST of CSPI01G12020.2 vs. NCBI nr
Match: XP_038895576.1 (allantoinase isoform X1 [Benincasa hispida])

HSP 1 Score: 959.5 bits (2479), Expect = 1.2e-275
Identity = 474/531 (89.27%), Postives = 493/531 (92.84%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLTL+ASIFL FYL+DPS N CSLLPHKHFWITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTLVASIFLVFYLQDPSQNGCSLLPHKHFWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           N GKIVSIVKEEE+HGKIMG+HV+DY DAVVMPGLVD+HVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NEGKIVSIVKEEERHGKIMGHHVIDYVDAVVMPGLVDIHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFN+SALE+L
Sbjct: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNSSALESL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           L AGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSEI+QSSPSP+QLEG
Sbjct: 181 LSAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEIQQSSPSPLQLEG 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
           SQDDPR+YSTYLATRPPSWEEAAVRELLKVT+NTRPGG AEGAHIHVAHLSDSGSTLEL+
Sbjct: 241 SQDDPRSYSTYLATRPPSWEEAAVRELLKVTNNTRPGGSAEGAHIHVAHLSDSGSTLELL 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM       
Sbjct: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPT+P LKLPDSGDFLKAWGG+SSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTLPDLKLPDSGDFLKAWGGISSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVND-I 480
           SATWSHAKKRGVTMEQ+ALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVND  
Sbjct: 421 SATWSHAKKRGVTMEQLALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVNDKF 480

Query: 481 PVYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILARVTD 531
           P+YLKHPSISAYMGM+LSGKVLATFVRGQLVYEEKHAP ACGTPILARVTD
Sbjct: 481 PIYLKHPSISAYMGMRLSGKVLATFVRGQLVYEEKHAPTACGTPILARVTD 504

BLAST of CSPI01G12020.2 vs. NCBI nr
Match: XP_022955084.1 (allantoinase [Cucurbita moschata])

HSP 1 Score: 912.5 bits (2357), Expect = 1.7e-261
Identity = 449/527 (85.20%), Postives = 477/527 (90.51%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLT++ SIF+ FYL+ PS+N+CSLLP+KH+WITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTVIVSIFVVFYLQHPSENKCSLLPYKHYWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           N GKIVSIVKEEE+HGKIMG HVVDY DAVV PGLVD+HVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NEGKIVSIVKEEERHGKIMGAHVVDYSDAVVFPGLVDIHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPST SEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNA+ALE L
Sbjct: 121 AAGGVTTLVDMPLNNFPSTVSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNATALERL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           L AGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSE++ SSPS  QLE 
Sbjct: 181 LSAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEVQPSSPSSTQLED 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
            QDDPR+Y TYLATRPPSWEEAAVRELL VT NTRPGGPAEGAH+HVAHLSDSGSTLELI
Sbjct: 241 MQDDPRSYLTYLATRPPSWEEAAVRELLTVTKNTRPGGPAEGAHLHVAHLSDSGSTLELI 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKR GDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLW+ALM       
Sbjct: 301 KEAKRHGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWEALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPT+P LKL DSGDFLKAWGG+SSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTLPELKLLDSGDFLKAWGGISSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVND-I 480
           SATWSHAKKRGVT+EQ+ALWWSERPAKLAGL+LKGAIAIGKHADIVAWAPDEE+DV+D  
Sbjct: 421 SATWSHAKKRGVTIEQLALWWSERPAKLAGLDLKGAIAIGKHADIVAWAPDEEFDVDDKF 480

Query: 481 PVYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILA 527
           P+++KHPSISAYMGMKLSGKVLATFVRGQLVYE+KHAPAACGTPILA
Sbjct: 481 PIHIKHPSISAYMGMKLSGKVLATFVRGQLVYEDKHAPAACGTPILA 500

BLAST of CSPI01G12020.2 vs. NCBI nr
Match: KAG6573259.1 (Allantoinase, partial [Cucurbita argyrosperma subsp. sororia] >KAG7012428.1 Allantoinase [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 912.1 bits (2356), Expect = 2.2e-261
Identity = 448/527 (85.01%), Postives = 477/527 (90.51%), Query Frame = 0

Query: 1   MNLLQWKLLPLLTLLASIFLFFYLKDPSDNECSLLPHKHFWITSKRIVTPQGVISGAVEI 60
           MNLLQWKLLPLLT++ SIF+ FYL+ PS+N+CSLLP+KH+WITSKRIVTPQGVISGAVEI
Sbjct: 1   MNLLQWKLLPLLTVIVSIFVVFYLQHPSENKCSLLPYKHYWITSKRIVTPQGVISGAVEI 60

Query: 61  NGGKIVSIVKEEEKHGKIMGNHVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTKAA 120
           N GKIVSIVKEEE+HGKIMG HVVDY DAVV PGLVD+HVHLDDPGRSEWEGFPSGTKAA
Sbjct: 61  NEGKIVSIVKEEERHGKIMGAHVVDYSDAVVFPGLVDIHVHLDDPGRSEWEGFPSGTKAA 120

Query: 121 AAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALENL 180
           AAGGVTTLVDMPLNNFPST SEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNA+ALE L
Sbjct: 121 AAGGVTTLVDMPLNNFPSTVSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNATALERL 180

Query: 181 LKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQLEG 240
           L AGALGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVHSE++ SSPS  QLE 
Sbjct: 181 LSAGALGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHSEVQPSSPSSTQLED 240

Query: 241 SQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTLELI 300
            QDDPR+Y TYLATRPPSWEEAAVRELL VT NTRPGGPAEGAH+HVAHLSDSGSTLELI
Sbjct: 241 MQDDPRSYLTYLATRPPSWEEAAVRELLTVTKNTRPGGPAEGAHLHVAHLSDSGSTLELI 300

Query: 301 KEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVFVDL 360
           KEAKR GDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLW+ALM       
Sbjct: 301 KEAKRHGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWEALM------- 360

Query: 361 SCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQFDL 420
                               EGHIDMLSSDHSPT+P LKL DSGDFLKAWGG+SSLQFDL
Sbjct: 361 --------------------EGHIDMLSSDHSPTLPELKLLDSGDFLKAWGGISSLQFDL 420

Query: 421 SATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVND-I 480
           SATWSHAKKRGVT+EQ+ALWWSERPAKLAGL+LKGAIAIGKHADIVAWAPDEE+DV+D  
Sbjct: 421 SATWSHAKKRGVTIEQLALWWSERPAKLAGLDLKGAIAIGKHADIVAWAPDEEFDVDDKF 480

Query: 481 PVYLKHPSISAYMGMKLSGKVLATFVRGQLVYEEKHAPAACGTPILA 527
           P+++KHPSISAYMGMKLSGKVLATF+RGQLVYE+KHAPAACGTPILA
Sbjct: 481 PIHIKHPSISAYMGMKLSGKVLATFIRGQLVYEDKHAPAACGTPILA 500

BLAST of CSPI01G12020.2 vs. TAIR 10
Match: AT4G04955.1 (allantoinase )

HSP 1 Score: 707.2 bits (1824), Expect = 9.9e-204
Identity = 352/531 (66.29%), Postives = 420/531 (79.10%), Query Frame = 0

Query: 3   LLQWKLLPLLTLLASIFLFFYLKDPS---DNECSLLPHKHFWITSKRIVTPQGVISGAVE 62
           LLQW+LLPLL L+ ++F FF+    S   +N+CSLLPH H+WI+SKRIVTP G+ISG+VE
Sbjct: 5   LLQWRLLPLLALIVALFSFFFASPRSLQGNNKCSLLPHDHYWISSKRIVTPNGLISGSVE 64

Query: 63  INGGKIVSIVKEEEKHGKIMGN-HVVDYKDAVVMPGLVDVHVHLDDPGRSEWEGFPSGTK 122
           + GG IVS+VKE + H        V+DY +AV+MPGL+DVHVHLDDPGRSEWEGFPSGTK
Sbjct: 65  VKGGIIVSVVKEVDWHKSQRSRVKVIDYGEAVLMPGLIDVHVHLDDPGRSEWEGFPSGTK 124

Query: 123 AAAAGGVTTLVDMPLNNFPSTTSEETLKLKIKAAEGRIYVDVGFWGGLVPENAFNASALE 182
           AAAAGG+TTLVDMPLN+FPST S ETLKLKI+AA+ RI+VDVGFWGGLVP+NA N+SALE
Sbjct: 125 AAAAGGITTLVDMPLNSFPSTVSPETLKLKIEAAKNRIHVDVGFWGGLVPDNALNSSALE 184

Query: 183 NLLKAGALGLKSFMCPSGINDFPMTNISHIKEGLSVLAKYKRPLLVHSEIEQSSPSPVQL 242
           +LL AG LGLKSFMCPSGINDFPMTNI+HIKEGLSVLAKYKRPLLVH+EIE+     +++
Sbjct: 185 SLLDAGVLGLKSFMCPSGINDFPMTNITHIKEGLSVLAKYKRPLLVHAEIERD----LEI 244

Query: 243 E-GSQDDPRTYSTYLATRPPSWEEAAVRELLKVTSNTRPGGPAEGAHIHVAHLSDSGSTL 302
           E GS++DPR+Y TYL TRP SWEE A+R LL VT NTR GG AEGAH+H+ HLSD+ S+L
Sbjct: 245 EDGSENDPRSYLTYLKTRPTSWEEGAIRNLLSVTENTRIGGSAEGAHLHIVHLSDASSSL 304

Query: 303 ELIKEAKRSGDSVSVETCTHYLAFSEEDIKDGDTRFKCAPPIRDKANKEKLWDALMVGVF 362
           +LIKEAK  GDSV+VETC HYLAFS E+I +GDTRFKC+PPIRD AN+EKLW+ALM    
Sbjct: 305 DLIKEAKGKGDSVTVETCPHYLAFSAEEIPEGDTRFKCSPPIRDAANREKLWEALM---- 364

Query: 363 VDLSCYIFHKSSLNLHLFQLTLQEGHIDMLSSDHSPTVPHLKLPDSGDFLKAWGGVSSLQ 422
                                  EG IDMLSSDHSPT P LKL   G+FLKAWGG+SSLQ
Sbjct: 365 -----------------------EGDIDMLSSDHSPTKPELKLMSDGNFLKAWGGISSLQ 424

Query: 423 FDLSATWSHAKKRGVTMEQIALWWSERPAKLAGLELKGAIAIGKHADIVAWAPDEEYDVN 482
           F L  TWS+ KK GVT+EQ+  WWS+RP+KLAGL  KGA+ +GKHAD+V W P+ E+DV+
Sbjct: 425 FVLPITWSYGKKYGVTLEQVTSWWSDRPSKLAGLHSKGAVTVGKHADLVVWEPEAEFDVD 484

Query: 483 -DIPVYLKHPSISAYMGMKLSGKVLATFVRGQLVY-EEKHAPAACGTPILA 527
            D P++ KHPSISAY+G +LSGKV++TFVRG LV+ E KHA  ACG+  LA
Sbjct: 485 EDHPIHFKHPSISAYLGRRLSGKVVSTFVRGNLVFGEGKHASDACGSLQLA 504

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94AP01.4e-20266.29Allantoinase OS=Arabidopsis thaliana OX=3702 GN=ALN PE=1 SV=1[more]
B9FDB82.3e-17356.63Probable allantoinase OS=Oryza sativa subsp. japonica OX=39947 GN=ALN PE=2 SV=1[more]
Q82LL41.0e-8839.15Allantoinase OS=Streptomyces avermitilis (strain ATCC 31267 / DSM 46492 / JCM 50... [more]
Q9RKU51.8e-8839.49Allantoinase OS=Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) OX=... [more]
Q55C914.9e-8336.42Probable allantoinase 2 OS=Dictyostelium discoideum OX=44689 GN=allB2 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LUK05.3e-29094.53Allantoinase OS=Cucumis sativus OX=3659 GN=Csa_1G073760 PE=3 SV=1[more]
A0A1S3B6817.5e-28492.08Allantoinase OS=Cucumis melo OX=3656 GN=LOC103486263 PE=3 SV=1[more]
A0A6J1GUY18.0e-26285.20Allantoinase OS=Cucurbita moschata OX=3662 GN=LOC111457155 PE=3 SV=1[more]
A0A6J1K1282.0e-26084.82Allantoinase OS=Cucurbita maxima OX=3661 GN=LOC111490122 PE=3 SV=1[more]
A0A6J1CH404.9e-25984.63Allantoinase OS=Momordica charantia OX=3673 GN=LOC111010798 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_004146596.11.1e-28994.53allantoinase [Cucumis sativus] >KGN64669.1 hypothetical protein Csa_014334 [Cucu... [more]
XP_008442380.11.5e-28392.08PREDICTED: allantoinase [Cucumis melo][more]
XP_038895576.11.2e-27589.27allantoinase isoform X1 [Benincasa hispida][more]
XP_022955084.11.7e-26185.20allantoinase [Cucurbita moschata][more]
KAG6573259.12.2e-26185.01Allantoinase, partial [Cucurbita argyrosperma subsp. sororia] >KAG7012428.1 Alla... [more]
Match NameE-valueIdentityDescription
AT4G04955.19.9e-20466.29allantoinase [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006680Amidohydrolase-relatedPFAMPF01979Amidohydro_1coord: 90..510
e-value: 9.6E-25
score: 87.6
NoneNo IPR availableGENE3D3.20.20.140coord: 30..362
e-value: 2.5E-80
score: 272.5
NoneNo IPR availableGENE3D3.20.20.140coord: 366..514
e-value: 2.7E-34
score: 120.7
NoneNo IPR availablePANTHERPTHR43668ALLANTOINASEcoord: 373..527
NoneNo IPR availablePANTHERPTHR43668ALLANTOINASEcoord: 35..355
NoneNo IPR availablePANTHERPTHR43668:SF2ZGC:103559coord: 35..355
NoneNo IPR availablePANTHERPTHR43668:SF2ZGC:103559coord: 373..527
IPR011059Metal-dependent hydrolase, composite domain superfamilySUPERFAMILY51338Composite domain of metallo-dependent hydrolasescoord: 40..515
IPR032466Metal-dependent hydrolaseSUPERFAMILY51556Metallo-dependent hydrolasescoord: 94..454

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI01G12020CSPI01G12020gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G12020.2.utr5p1CSPI01G12020.2.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G12020.2.cds1CSPI01G12020.2.cds1CDS
CSPI01G12020.2.cds2CSPI01G12020.2.cds2CDS
CSPI01G12020.2.cds3CSPI01G12020.2.cds3CDS
CSPI01G12020.2.cds4CSPI01G12020.2.cds4CDS
CSPI01G12020.2.cds5CSPI01G12020.2.cds5CDS
CSPI01G12020.2.cds6CSPI01G12020.2.cds6CDS
CSPI01G12020.2.cds7CSPI01G12020.2.cds7CDS
CSPI01G12020.2.cds8CSPI01G12020.2.cds8CDS
CSPI01G12020.2.cds9CSPI01G12020.2.cds9CDS
CSPI01G12020.2.cds10CSPI01G12020.2.cds10CDS
CSPI01G12020.2.cds11CSPI01G12020.2.cds11CDS
CSPI01G12020.2.cds12CSPI01G12020.2.cds12CDS
CSPI01G12020.2.cds13CSPI01G12020.2.cds13CDS
CSPI01G12020.2.cds14CSPI01G12020.2.cds14CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G12020.2.utr3p1CSPI01G12020.2.utr3p1three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI01G12020.2CSPI01G12020.2-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000256 allantoin catabolic process
biological_process GO:0006995 cellular response to nitrogen starvation
biological_process GO:0006145 purine nucleobase catabolic process
biological_process GO:0010136 ureide catabolic process
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
molecular_function GO:0004038 allantoinase activity
molecular_function GO:0050897 cobalt ion binding
molecular_function GO:0008270 zinc ion binding