CSPI01G10060 (gene) Wild cucumber (PI 183967)

NameCSPI01G10060
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
Descriptionallantoate amidohydrolase
LocationChr1 : 6268988 .. 6273349 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTATTAATTGATTTTAGAATAATTAAGGAGAAGTTTAGTTTAGTAAGGTATTTTAAATGGAATTCGTAATATTATTAGAAATGGCGTAAACAGTAATAGCCTCCACCAATAATTAATGTGTAGACTCATGGAGGCCCACCTGAATTCCGCACCACAAATACAGTGAAGATTCCGGAATTCACAACCCGGGAGCCACAAGCCAACAAAGCTAATTTCCAAATTCCACCAAAAGGCTTAAGGCTTCCGCCCCTCCGCCCCTCCGCCGTCTCTTCTTCCTCCTCCTATCCCTTCACCCATGGCCATTGCCTTTTCCAACTCCGATTCTCTTTCCGCTTTTCTCATCAAAGATCATCACTATCACCACCACCATCATTCTTCCTTCTCCGTTTTGTTCCCTTATCTGCTCTTCTTCCTTTTGTTTTCATCTCCTCCCACCGCTTACGCATTCACTGGTATATTCATCATCTCTACCCTTTTACCCTTCATCTTCGTCATATGCTATTTTTCTCAACTGCCTCGGGATAGAGAAGGGATGTTTCGTTGCTAAAATACATGGAGATGAATCGAAATGAATATCAACCACTCGTTTATGCTTTTACGGATAACTTACATGATTGAATTTCCCTCAACGTTAATGAACTCCTTTGTTTTAATTTCTTTTTCTCATACTTAGTTTCTTCTTGGAGGATTTGACAGATCTGTTCCTTAATATTCATCGAGTTAACTATCGTGGTATAATCATAAATACTGTATATGGCTCATTAAAAGTAGGTTTTCTTTTCGTTCCTTCGTAAGTCGGGAGTAGACTGTTTGGCTGAGTAGTTCAATCAGCTTTTATGATGATATCTTAGTTATCAGAACTATACGTCGAATGAAATTATTTTTCATGTTTAGTGAAGGATTCTGTGTTGAACACTTCATATAATATTCATTTTTTTCAGGAGATGTAGCCGAGGATTCAAAGAACAGAAGGGCTGATTTGTTTGTGCAAATTCTTAAGGACGAAGCGGTAGGAAGATTGAACGAACTAGGGAAGGTAGCGGACCACGATACGATTTTTTTCATTTCTCTGTTTGCTAAGATATGCGTGCGCTTGCAAATGTTTTGAACTTGTTAGATTCAGTGAGTAATTGGACGATTTGCATGTGGATCTTCGTCTGTTTTATGTGATTCTGCTGTAGTTTTCTATTCATAATATAAAAATGTATAGTATGGACCATGTAAATTGATTTTGTTGACTGTTTGGGTCCTTTGAAGTATCCCCTTCATTGATTCGAATATTAAAAAGTTCCGGCTATTTTATATAATTTTATTATTCTTTCTTAGAGGTCTTAATGTAAATTTTGATTGAAAGACTTGGGGAATTTCCCGATTTCTTTTTTATCTCTGAAGTTTTTTCCTTGGTTGCTTTCGTTTTCAATCATCAAACACTTTATTCTATGCCCTACTGCATATATTATAGCTTAGTGCATGAAAAATTGTCCTCCCCTTCCCCCAGATAAACCTGATGAACACCAGAAGTTGAGTTCTAATCCAACTAGAAACTGTTCGATGGTGAGTTTTCTGTGTTGTGTTAAATATTCCGAGTTGGTAACACGAGTTGTCTTACGATTGTTCCCAATTACTAGGTGAGTGATGCTGCTCGATATCTTGAGAGAACGTTCTTGAGTCCAGCTTCTATCAAAGCAAGCTTTCTTCTTCAAAAATGGATGGAGGATGCTGGATTAAGAACGTTAGATTTCTACTTCTCCTCATCTCGAAGAATTCTTAAATTGTTGTTGACTTTTACTCTTGAACATTTTTCAGGTGGGTCGACTGCATGGGCAATCTACATGGTCGAACTGAGGGAAGGAATGCAAGTGCTGAAGCATTACTGATTGGTTCTCATTTGGTAATTATTTGACTTCTCCTAAGCAGATGCAAGAGTTTAGCTTACACGTCAAGCTAAGAATTAAATTTTCTTCTTTTCCTTGTAGGATACCGTTGTTGATGCTGGAAAATTTGATGGTGCATTAGGCATCATTTCTGCTATCTCTGCTTTAAAAGTTCTTAATATGAATGGGAAGTTAGAGGAACTAAAGAGGCCAATTGAGGTTTGAGTTATTTACTTTTTCATTAGCAATGGTCAATATGGTAGAAATAACTGACTTTGTTCACTATTAGTTCTGATAATGACATCATTTCTATGTTGTCAAATTTCGTTATAATCAACTACAAAATAATGCAGGTAATTGCTTTCAGTGATGAGGAAGGCGTGAGGTTTCAATCAACATTCTTAGGAAGTGCTGCTATTGCTGGTATTTTACCAGTTTCATCTTTGGAAATATCAGATAAAAGGTTTTGCTGCTCATAGTTAAACTCTCTGGATGCTTACCGCTCATTATTGTCTGCATATTAACGTGCAATGCTTGTATCAGTGGCATTACTATAAAAGATGTAATTAAGGAGAGTGGAGTACAGATAACAGAGGAGAACTTGTTGCAACTCAAGTATGACCGCAAATCTGTTTGGGGATATGTTGAGGTATGGCTCTGGGTCTCTTATGAAATTTATAGTCTGTTGCAATCTTAAATCTTGGTACCTTAATCAGTACATTTCAAACATTATGGGAACATCATCTTTTGCTTTGATCTTCAGGTTCATATTGAACAAGGTCCCGTACTTGAGTGGTCTGGTTTTCCTCTGGGGGTGGTTAGAGGCATAGCTGGGCAGACACGGCTAAAGGTAACATGAGTTAACTAATCAGGATGTTGCAGAGATCTACTTGAACAATAACTACTCTAGTAAAAGAATTGCGATCATGTTTATTCACATAACATAATGGCCATGTTTCTTGACTGCTAACTTGCTTGTATCATGATTGATAATCCCACAGAAGATTCTTATAACTTGTTTTAGTTGCTCTATACTTGCCATCCAAACCGTGGAAAATCTAATTAATGGAATAATCTTGAAGCTCAGTCATTTTGACTTCACACTACATTGATTCTAGACGTTGTTATCTTCCTAGAAATCTGAATTATTCTCCCACCCAATCCTAGAAGTTGATACATTATATTGTGGATAATTTACCTACTCCGTCCAAATTTATGAGGTAGCTGAGGTCGCTGATGTATAAAACTGAAAGGAAGGAAATCTAATTTTCTTGTTCTATAAAAAAAAATCTAAAATGGATTTACATTAAGGAGCAACTTTTGATAGAGCGATCCGTGCACAAAGATTGCAGGTTACAGTGAGAGGTTCTCAGGGGCATGCCGGAACGGTTCCAATGCCTATGCGCCAAGATCCCATGGCAGCTTCAGCTGAATTGATCGTACAATTGGAAAAACTCTGTAAGCAACCAGAGAGCTACTTATCATTTGATGGGCATTGCACTGATTCTACATTGAAATCACTTTCCACATCCCTTGTGTGTACGGTTGGAGAGATATCAACATGGCCAAGTGCAAGCAATGTCATTCCAGGCCAGGCAAGAATTTGATATAAGAGCCACTGCAGTGCAGACCCTACCTTTTTACCAGACAATTTTTAATGTTTGGAATGTTACAGGTGACCTTCACTGTAGATTTACGTACTATCGATGACATAGGACGAGAAGCTGTAATTTATGAATTCTCTAACCAGGTACATAATATATGCAGCAGCCGGTCAGTTTCGTGCAATATCGAACGTAAGGTGTGTATCTTCCATCATGTACATTGTCCGAATTTATCATTTGCTTCCATTTCACAATCTTAATCTGCTGTCTATTCATTTCAGCATGATGCAAATGCCATAATTAGCAATTCGGAGCTGAGCTCACAACTGAAATCTGCTGCTTCAACTGCACTCAAAAAGATGGTAGGCGAGATTCAGGAGGAAGTACCTGTATTAATGAGTGGAGCCGGGCACGATGCAATGGCAATGTCTCATTTGACAAAGGTTTGTTCTCTTTAACTTTTGTACCGCAACAATTTCAAGTTTTAGTTTTGCACCTGCATCAACATGGTTATATCAAATTTGATAGGTTGGAATGTTGTTCGTCCGTTGTCGTGGAGGCGTGAGTCACTCCCCTGCCGAGCATGTACTGGATGACGACATTTGGGCTGCGGGTTTGGCTGTCTTGGAATTTCTAGAAAACCATCTGTAATATTTGTACCCGTTTTGTTCCTTGTACGCAAAAATCTAGTTAGTAACTTTAGTATAAGTGCTACATGTTAATTCAATTGTACAATCCATCTTCGTAGACTTGCCTATTCCTCCACTTCATCGAACCCAAAACTAATAAACTTTATGATGCTATATGTAAATTCAATTTTTACAACCCATTTTCGTGGGTTGTATTCAAACTCAGATTTTTTTTTTTTTTTGTTTAATCCCTGA

mRNA sequence

ATGGCCATTGCCTTTTCCAACTCCGATTCTCTTTCCGCTTTTCTCATCAAAGATCATCACTATCACCACCACCATCATTCTTCCTTCTCCGTTTTGTTCCCTTATCTGCTCTTCTTCCTTTTGTTTTCATCTCCTCCCACCGCTTACGCATTCACTGGAGATGTAGCCGAGGATTCAAAGAACAGAAGGGCTGATTTGTTTGTGCAAATTCTTAAGGACGAAGCGGTAGGAAGATTGAACGAACTAGGGAAGGTGAGTGATGCTGCTCGATATCTTGAGAGAACGTTCTTGAGTCCAGCTTCTATCAAAGCAAGCTTTCTTCTTCAAAAATGGATGGAGGATGCTGGATTAAGAACGTGGGTCGACTGCATGGGCAATCTACATGGTCGAACTGAGGGAAGGAATGCAAGTGCTGAAGCATTACTGATTGGTTCTCATTTGGATACCGTTGTTGATGCTGGAAAATTTGATGGTGCATTAGGCATCATTTCTGCTATCTCTGCTTTAAAAGTTCTTAATATGAATGGGAAGTTAGAGGAACTAAAGAGGCCAATTGAGGTAATTGCTTTCAGTGATGAGGAAGGCGTGAGGTTTCAATCAACATTCTTAGGAAGTGCTGCTATTGCTGGTATTTTACCAGTTTCATCTTTGGAAATATCAGATAAAAGTGGCATTACTATAAAAGATGTAATTAAGGAGAGTGGAGTACAGATAACAGAGGAGAACTTGTTGCAACTCAAGTATGACCGCAAATCTGTTTGGGGATATGTTGAGGTTCATATTGAACAAGGTCCCGTACTTGAGTGGTCTGGTTTTCCTCTGGGGGTGGTTAGAGGCATAGCTGGGCAGACACGGCTAAAGGTAACATGA

Coding sequence (CDS)

ATGGCCATTGCCTTTTCCAACTCCGATTCTCTTTCCGCTTTTCTCATCAAAGATCATCACTATCACCACCACCATCATTCTTCCTTCTCCGTTTTGTTCCCTTATCTGCTCTTCTTCCTTTTGTTTTCATCTCCTCCCACCGCTTACGCATTCACTGGAGATGTAGCCGAGGATTCAAAGAACAGAAGGGCTGATTTGTTTGTGCAAATTCTTAAGGACGAAGCGGTAGGAAGATTGAACGAACTAGGGAAGGTGAGTGATGCTGCTCGATATCTTGAGAGAACGTTCTTGAGTCCAGCTTCTATCAAAGCAAGCTTTCTTCTTCAAAAATGGATGGAGGATGCTGGATTAAGAACGTGGGTCGACTGCATGGGCAATCTACATGGTCGAACTGAGGGAAGGAATGCAAGTGCTGAAGCATTACTGATTGGTTCTCATTTGGATACCGTTGTTGATGCTGGAAAATTTGATGGTGCATTAGGCATCATTTCTGCTATCTCTGCTTTAAAAGTTCTTAATATGAATGGGAAGTTAGAGGAACTAAAGAGGCCAATTGAGGTAATTGCTTTCAGTGATGAGGAAGGCGTGAGGTTTCAATCAACATTCTTAGGAAGTGCTGCTATTGCTGGTATTTTACCAGTTTCATCTTTGGAAATATCAGATAAAAGTGGCATTACTATAAAAGATGTAATTAAGGAGAGTGGAGTACAGATAACAGAGGAGAACTTGTTGCAACTCAAGTATGACCGCAAATCTGTTTGGGGATATGTTGAGGTTCATATTGAACAAGGTCCCGTACTTGAGTGGTCTGGTTTTCCTCTGGGGGTGGTTAGAGGCATAGCTGGGCAGACACGGCTAAAGGTAACATGA
BLAST of CSPI01G10060 vs. Swiss-Prot
Match: AAH_ARATH (Allantoate deiminase OS=Arabidopsis thaliana GN=AAH PE=1 SV=2)

HSP 1 Score: 682.9 bits (1761), Expect = 2.6e-195
Identity = 343/501 (68.46%), Postives = 421/501 (84.03%), Query Frame = 1

Query: 19  HHYHHHHHSSFSVLFPYLLFFLL----FSSPPTAYAFTGDVAEDSKNR-----------R 78
           HH+HHH+H S  VLF  L+F LL     SS  ++ + + D +  S +            +
Sbjct: 26  HHHHHHNHPSL-VLFWCLVFSLLSPLALSSSSSSSSSSSDSSSSSSSHISLGIGETEGTK 85

Query: 79  ADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDC 138
            DL   IL+DEAV RL+ELG+VSDAA +LERTF+SPASI+A  L++ WMEDAGL TWVD 
Sbjct: 86  HDLHQAILRDEAVARLHELGQVSDAATHLERTFMSPASIRAIPLIRGWMEDAGLSTWVDY 145

Query: 139 MGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKR 198
           MGN+HGR E +N S++ALLIGSH+DTV+DAGK+DG+LGIISAISALKVL ++G+L ELKR
Sbjct: 146 MGNVHGRVEPKNGSSQALLIGSHMDTVIDAGKYDGSLGIISAISALKVLKIDGRLGELKR 205

Query: 199 PIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENL 258
           P+EVIAFSDEEGVRFQSTFLGSAA+AGI+PVS LE++DKSGI+++D +KE+ + IT+ENL
Sbjct: 206 PVEVIAFSDEEGVRFQSTFLGSAALAGIMPVSRLEVTDKSGISVQDALKENSIDITDENL 265

Query: 259 LQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMP 318
           +QLKYD  SVWGYVEVHIEQGPVLEW G+PLGVV+GIAGQTRLKVTV+GSQGHAGTVPM 
Sbjct: 266 MQLKYDPASVWGYVEVHIEQGPVLEWVGYPLGVVKGIAGQTRLKVTVKGSQGHAGTVPMS 325

Query: 319 MRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASN 378
           MRQDPM  +AELIV LE +CK P+ YLS +  C + T++SL+ SLVCTVGEISTWPSASN
Sbjct: 326 MRQDPMTGAAELIVLLESVCKNPKDYLSCNVQCNEDTVESLANSLVCTVGEISTWPSASN 385

Query: 379 VIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELS 438
           VIPGQVTFTVDLRTIDD+GR+A++++ S +++ IC  RS+ C+IERKHDA+A++S+ +LS
Sbjct: 386 VIPGQVTFTVDLRTIDDVGRKAILHDLSTRMYQICDKRSLLCSIERKHDADAVMSDPQLS 445

Query: 439 SQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEH 498
            QLKSAA +ALKKM GE+Q+EVPVLMSGAGHDAMAM+HLTKVGMLFVRCRGG+SHSPAEH
Sbjct: 446 LQLKSAAQSALKKMTGEVQDEVPVLMSGAGHDAMAMAHLTKVGMLFVRCRGGISHSPAEH 505

Query: 499 VLDDDIWAAGLAVLEFLENHL 505
           VLDDD+ AAGLA+LEFLE+ +
Sbjct: 506 VLDDDVGAAGLAILEFLESQM 525

BLAST of CSPI01G10060 vs. Swiss-Prot
Match: AAH_ORYSJ (Probable allantoate deiminase OS=Oryza sativa subsp. japonica GN=AAH PE=1 SV=1)

HSP 1 Score: 540.8 bits (1392), Expect = 1.6e-152
Identity = 273/441 (61.90%), Postives = 344/441 (78.00%), Query Frame = 1

Query: 66  LFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMG 125
           L+ +IL+DE V RL ELGK+SD   YLERTFLSPASI+AS ++  WM+DAGL TW+D MG
Sbjct: 40  LYREILRDETVLRLKELGKISDGEGYLERTFLSPASIRASAVIISWMKDAGLTTWIDQMG 99

Query: 126 NLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPI 185
           N+HGR E  N++ EALLIGSH+DTV+DAG +DGALGIISAISALKVL + G+L+ L RP+
Sbjct: 100 NIHGRFEPTNSTKEALLIGSHMDTVIDAGMYDGALGIISAISALKVLKVTGRLQRLTRPV 159

Query: 186 EVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENLLQ 245
           EVIAFSDEEGVRFQ+TFLGSAA+AG LP S L++SDKSG T++DV+K + ++ T   L +
Sbjct: 160 EVIAFSDEEGVRFQTTFLGSAAVAGTLPESILQVSDKSGTTVQDVLKLNSLEGTANALGE 219

Query: 246 LKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMR 305
           ++Y  +SV  YVEVHIEQGPVLE   +PLGVV+GIAGQTRLKV + GSQGHAGTVPM +R
Sbjct: 220 VRYSPESVGSYVEVHIEQGPVLEALRYPLGVVKGIAGQTRLKVIINGSQGHAGTVPMKLR 279

Query: 306 QDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVI 365
           +DPM A+AEL++ LE LCK+P  +L++D  C   T +SL+  LVCTVGE+ TWPSASNVI
Sbjct: 280 RDPMVAAAELVLTLETLCKEPNKFLTYDEECGCFTEESLA-GLVCTVGELLTWPSASNVI 339

Query: 366 PGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQ 425
           PGQV FTVD+R +DD  RE ++  FS  V   C  R V C +E+KH A A   ++EL+S+
Sbjct: 340 PGQVNFTVDIRAMDDKVRETIVTSFSRLVLQRCDDRLVDCAVEQKHAAAATPCDAELTSR 399

Query: 426 LKSAASTALKKMVGEIQE---EVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAE 485
           L+ A  + +  M   ++    E PVLMSGAGHDAMAM+ LTKVGMLFVRCRGGVSHSP E
Sbjct: 400 LERATRSTISSMAAGVRRAGGETPVLMSGAGHDAMAMARLTKVGMLFVRCRGGVSHSPEE 459

Query: 486 HVLDDDIWAAGLAVLEFLENH 504
            V+DDD+WAAGLA++ F++ +
Sbjct: 460 SVMDDDVWAAGLALVNFIDQN 479

BLAST of CSPI01G10060 vs. Swiss-Prot
Match: HYUC_PSESN (Hydantoin utilization protein C OS=Pseudomonas sp. (strain NS671) GN=hyuC PE=1 SV=1)

HSP 1 Score: 263.5 bits (672), Expect = 4.9e-69
Identity = 149/429 (34.73%), Postives = 236/429 (55.01%), Query Frame = 1

Query: 76  VGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRN 135
           + +L E+GK  D    ++R  LS    +A+ L+ +WM +AGL    D  GNL GR EG  
Sbjct: 15  IEQLGEIGKTKDKG--VQRLALSKEDREATLLVSEWMREAGLTVTHDHFGNLIGRKEGET 74

Query: 136 ASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEG 195
            S  +++IGSH+D+V + GKFDG +G+++ I  +  ++    + E    IEV+AF +EEG
Sbjct: 75  PSLPSVMIGSHIDSVRNGGKFDGVIGVLAGIEIVHAISEANVVHE--HSIEVVAFCEEEG 134

Query: 196 VRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWG 255
            RF     GS  + G +    L+  D + +T  + +K  G  I  +   Q   +   +  
Sbjct: 135 SRFNDGLFGSRGMVGKVKPEDLQKVDDNNVTRYEALKTFGFGIDPDFTHQSIREIGDIKH 194

Query: 256 YVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAEL 315
           Y E+HIEQGP LE + +P+G+V GIAG +  KV + G  GHAGTVPM +R+DP+  +AE+
Sbjct: 195 YFEMHIEQGPYLEKNNYPIGIVSGIAGPSWFKVRLVGEAGHAGTVPMSLRKDPLVGAAEV 254

Query: 316 IVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDL 375
           I ++E LC                 +   +   V TVG I+ +P  SN+IP  V FT+D+
Sbjct: 255 IKEVETLC-----------------MNDPNAPTVGTVGRIAAFPGGSNIIPESVEFTLDI 314

Query: 376 RTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALK 435
           R I+   R  +I +   ++  + ++R +   IE+   A  +  +  L + LK +      
Sbjct: 315 RDIELERRNKIIEKIEEKIKLVSNTRGLEYQIEKNMAAVPVKCSENLINSLKQSCK---- 374

Query: 436 KMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLA 495
               E++ + P+++SGAGHDAM ++ +T++GM+FVRCR G+SHSP E    DDI      
Sbjct: 375 ----ELEIDAPIIVSGAGHDAMFLAEITEIGMVFVRCRNGISHSPKEWAEIDDILTGTKV 414

Query: 496 VLEFLENHL 505
           + E +  H+
Sbjct: 435 LYESIIKHI 414

BLAST of CSPI01G10060 vs. Swiss-Prot
Match: AMAB2_GEOSE (N-carbamoyl-L-amino acid hydrolase OS=Geobacillus stearothermophilus GN=amaB PE=1 SV=1)

HSP 1 Score: 250.0 bits (637), Expect = 5.6e-65
Identity = 156/421 (37.05%), Postives = 238/421 (56.53%), Query Frame = 1

Query: 78  RLNELGKVS-DAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNA 137
           RL ELG+V    +  + R   +    +A  L+  +M +AGL  + D  GNL GR EG N 
Sbjct: 10  RLMELGEVGKQPSGGVTRLSFTAEERRAKDLVASYMREAGLFVYEDAAGNLIGRKEGTNP 69

Query: 138 SAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGV 197
            A  +L+GSHLD+V + G FDG LG+++ +  ++ +N +G +     PIEV+AF+DEEG 
Sbjct: 70  DATVVLVGSHLDSVYNGGCFDGPLGVLAGVEVVQTMNEHGVVTH--HPIEVVAFTDEEGA 129

Query: 198 RFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGY 257
           RF+   +GS A+AG LP  +LE  D  GI++ + +K++G+    + L Q      +V  Y
Sbjct: 130 RFRFGMIGSRAMAGTLPPEALECRDAEGISLAEAMKQAGLD--PDRLPQAARKPGTVKAY 189

Query: 258 VEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELI 317
           VE+HIEQG VLE +G P+G+V GIAG   +K T+ G   HAG  PM +R+DPMAA+A++I
Sbjct: 190 VELHIEQGRVLEETGLPVGIVTGIAGLIWVKFTIEGKAEHAGATPMSLRRDPMAAAAQII 249

Query: 318 VQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLR 377
           + +E+  ++                   + + V TVG++  +P   NVIP +V F +DLR
Sbjct: 250 IVIEEEARR-------------------TGTTVGTVGQLHVYPGGINVIPERVEFVLDLR 309

Query: 378 TIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKK 437
            +    R+ V    + +   I   R+V    ER  +   ++     S ++K AA  A +K
Sbjct: 310 DLKAEVRDQVWKAIAVRAETIAKERNVRVTTERLQEMPPVL----CSDEVKRAAEAACQK 369

Query: 438 MVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAV 497
           + G     +P   SGA HD++ ++ +  +GM+FVR + GVSHSPAE    +D  AAG  V
Sbjct: 370 L-GYPSFWLP---SGAAHDSVQLAPICPIGMIFVRSQDGVSHSPAEWSTKEDC-AAGAEV 398

BLAST of CSPI01G10060 vs. Swiss-Prot
Match: AMAB1_GEOSE (N-carbamoyl-L-amino acid hydrolase OS=Geobacillus stearothermophilus GN=amaB PE=1 SV=2)

HSP 1 Score: 245.4 bits (625), Expect = 1.4e-63
Identity = 155/421 (36.82%), Postives = 235/421 (55.82%), Query Frame = 1

Query: 78  RLNELGKVS-DAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNA 137
           RL ELG+V    +  + R   +    +A  L+  +M +AGL  + D  GNL GR EG N 
Sbjct: 10  RLMELGEVGKQPSGGVTRLSFTAEERRAKDLVASYMREAGLFVYEDAAGNLIGRKEGTNP 69

Query: 138 SAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGV 197
            A  +L+GSHLD+V + G FDG LG+++ +  ++ +N +G +     PIEV+AF+DEEG 
Sbjct: 70  DATVVLVGSHLDSVYNGGCFDGPLGVLAGVEVVQTMNEHGVVTH--HPIEVVAFTDEEGA 129

Query: 198 RFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGY 257
           RF+   +GS A+AG LP  +LE  D  GI++ + +K++G+    + L Q      +V  Y
Sbjct: 130 RFRFGMIGSRAMAGTLPPEALECRDAEGISLAEAMKQAGLD--PDRLPQAARKPGTVKAY 189

Query: 258 VEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELI 317
           VE+HIEQG VLE +G P+G+V GIAG   +K T+ G   HAG  PM +R+DPMAA+A++I
Sbjct: 190 VELHIEQGRVLEEAGLPVGIVTGIAGLIWVKFTIAGPAEHAGATPMSLRRDPMAAAAQII 249

Query: 318 VQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLR 377
           + +E+  ++                   + + V TVG++  +P   NVIP +V F +DLR
Sbjct: 250 IVIEEEARR-------------------TGTTVGTVGQLHVYPGGINVIPERVEFVLDLR 309

Query: 378 TIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKK 437
            +    R+ V    + +   I   R+V    ER  +   ++     S  +K AA  A K+
Sbjct: 310 DLKAEVRDQVWKAIAVRAETIAKERNVRLTTERLQEMAPVL----CSEVVKQAAERACKQ 369

Query: 438 MVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAV 497
           + G     +P   SGA HD + ++ +  +GM+FVR + GVSHSPAE    +D  A G  V
Sbjct: 370 L-GYPPFWLP---SGAAHDGVQLAPICPIGMIFVRSQDGVSHSPAEWSTKEDC-AVGAEV 398

BLAST of CSPI01G10060 vs. TrEMBL
Match: A0A0A0LWV2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G056970 PE=4 SV=1)

HSP 1 Score: 984.9 bits (2545), Expect = 3.5e-284
Identity = 504/504 (100.00%), Postives = 504/504 (100.00%), Query Frame = 1

Query: 1   MAIAFSNSDSLSAFLIKDHHYHHHHHSSFSVLFPYLLFFLLFSSPPTAYAFTGDVAEDSK 60
           MAIAFSNSDSLSAFLIKDHHYHHHHHSSFSVLFPYLLFFLLFSSPPTAYAFTGDVAEDSK
Sbjct: 1   MAIAFSNSDSLSAFLIKDHHYHHHHHSSFSVLFPYLLFFLLFSSPPTAYAFTGDVAEDSK 60

Query: 61  NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTW 120
           NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTW
Sbjct: 61  NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTW 120

Query: 121 VDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEE 180
           VDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEE
Sbjct: 121 VDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEE 180

Query: 181 LKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITE 240
           LKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITE
Sbjct: 181 LKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITE 240

Query: 241 ENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTV 300
           ENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTV
Sbjct: 241 ENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTV 300

Query: 301 PMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPS 360
           PMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPS
Sbjct: 301 PMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPS 360

Query: 361 ASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNS 420
           ASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNS
Sbjct: 361 ASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNS 420

Query: 421 ELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSP 480
           ELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSP
Sbjct: 421 ELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSP 480

Query: 481 AEHVLDDDIWAAGLAVLEFLENHL 505
           AEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 481 AEHVLDDDIWAAGLAVLEFLENHL 504

BLAST of CSPI01G10060 vs. TrEMBL
Match: A0A067JV84_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_27110 PE=4 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 9.8e-202
Identity = 353/474 (74.47%), Postives = 413/474 (87.13%), Query Frame = 1

Query: 31  VLFPYLLFFLLFSSPPTAYAFTGDVAEDSKNRRADLFVQILKDEAVGRLNELGKVSDAAR 90
           + F  ++ FLL +   +AY F+    ED  +RR DL+ +IL+DEAV RLN+LGKVSDA  
Sbjct: 6   IFFLVMVIFLLSTCYASAYTFSDIGNEDVDSRRNDLYREILRDEAVARLNDLGKVSDADG 65

Query: 91  YLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTV 150
           YLERTF+S AS+KA  L++ WMEDAGL TW+D MGN+HGR EG N SAEALLIGSHLDTV
Sbjct: 66  YLERTFMSAASVKAGNLIRSWMEDAGLMTWMDHMGNVHGRVEGSNPSAEALLIGSHLDTV 125

Query: 151 VDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAG 210
           VDAG FDG+LGIISA+SALKVL   G L +LKRP+EVIAFSDEEGVRFQSTFLGSAA+AG
Sbjct: 126 VDAGIFDGSLGIISALSALKVLKSKGMLSKLKRPVEVIAFSDEEGVRFQSTFLGSAAVAG 185

Query: 211 ILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWS 270
           ILPV++L+ISDKSG+T+++ +KE  + ITEE+LLQLKYD +SVWGY+EVHIEQGPVLEW+
Sbjct: 186 ILPVTALQISDKSGVTVQESLKEKSIGITEESLLQLKYDPRSVWGYIEVHIEQGPVLEWA 245

Query: 271 GFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYL 330
           GFPLGVV+GIAGQTRLKV V+GSQGHAGTVPM +RQDPMAA+AELIV LE LCK P+ +L
Sbjct: 246 GFPLGVVKGIAGQTRLKVMVKGSQGHAGTVPMSLRQDPMAAAAELIVLLESLCKHPKDFL 305

Query: 331 SFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEF 390
           S+DG+C DS ++SLS+SLVCTVGEISTWPSASNVIPGQVTFTVDLR +DD+GREAV+YE 
Sbjct: 306 SYDGYCNDSIVESLSSSLVCTVGEISTWPSASNVIPGQVTFTVDLRAMDDMGREAVLYEL 365

Query: 391 SNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEEVPVLMS 450
           SNQ+++IC  RSVSC IERKHDA A+I +SELS QLKSAA+ ALK+M GEIQ+EVPVLMS
Sbjct: 366 SNQIYHICDRRSVSCIIERKHDAKAVICDSELSLQLKSAANAALKRMTGEIQDEVPVLMS 425

Query: 451 GAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENHL 505
           GAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDD+WAAGLAV+ FLE  +
Sbjct: 426 GAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDVWAAGLAVMAFLETQM 479

BLAST of CSPI01G10060 vs. TrEMBL
Match: U5GU02_POPTR (Peptidase M20/M25/M40 family protein OS=Populus trichocarpa GN=POPTR_0001s16010g PE=4 SV=1)

HSP 1 Score: 707.6 bits (1825), Expect = 1.1e-200
Identity = 352/460 (76.52%), Postives = 404/460 (87.83%), Query Frame = 1

Query: 45  PPTAYAFTGDVAEDSKNRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKA 104
           P T+ AF     ED      DLF +IL+DEAV RLN+LGKVSDA  YLERTF+SPAS++A
Sbjct: 18  PTTSSAFMLSGRED------DLFAEILRDEAVSRLNQLGKVSDADGYLERTFMSPASVRA 77

Query: 105 SFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIIS 164
           + L++ WMEDAGL TWVD MGN+HGR EG NASAEALLIGSHLDTVVDAG FDG+LGIIS
Sbjct: 78  ANLIRAWMEDAGLTTWVDYMGNVHGRVEGLNASAEALLIGSHLDTVVDAGIFDGSLGIIS 137

Query: 165 AISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSG 224
           AISALKVL  NG L  L RP+EVIAFSDEEGVRFQSTFLGSAA+AGILPVS+L+ISDKSG
Sbjct: 138 AISALKVLKSNGTLTNLIRPVEVIAFSDEEGVRFQSTFLGSAAVAGILPVSALQISDKSG 197

Query: 225 ITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQT 284
           + ++D +KE+ + ITEE+L QLKYD +SVWGY+EVHIEQGPVLEW GFPLGVV+GIAGQT
Sbjct: 198 VNVQDALKENSIAITEESLFQLKYDPQSVWGYIEVHIEQGPVLEWVGFPLGVVKGIAGQT 257

Query: 285 RLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSL 344
           RLKVTVRGSQGHAGTVPM +RQDPMAASAELI+ LE LCK P+ +LS+DGHC DST++SL
Sbjct: 258 RLKVTVRGSQGHAGTVPMSLRQDPMAASAELIMLLESLCKNPKDFLSYDGHCNDSTVESL 317

Query: 345 STSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVS 404
           S SLVCTVGEISTWPSASNVIPGQVTFTVDLR +D++GREAV+YE SN+++ IC  RSVS
Sbjct: 318 SNSLVCTVGEISTWPSASNVIPGQVTFTVDLRAMDNMGREAVLYELSNRMYEICERRSVS 377

Query: 405 CNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTK 464
           C IERKHDANA+I +SEL+S+LK AA+ ALK++ GEIQ+EVPVLMSGAGHDAMAMSHLTK
Sbjct: 378 CIIERKHDANAVICDSELTSELKFAANAALKRITGEIQDEVPVLMSGAGHDAMAMSHLTK 437

Query: 465 VGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENHL 505
           VGMLFVRCRGGVSHSPAEHVLDDD+WAAGL++L FLE H+
Sbjct: 438 VGMLFVRCRGGVSHSPAEHVLDDDVWAAGLSILAFLETHM 471

BLAST of CSPI01G10060 vs. TrEMBL
Match: A0A0B2R419_GLYSO (Allantoate deiminase, chloroplastic OS=Glycine soja GN=glysoja_019851 PE=4 SV=1)

HSP 1 Score: 704.5 bits (1817), Expect = 9.2e-200
Identity = 354/466 (75.97%), Postives = 406/466 (87.12%), Query Frame = 1

Query: 37  LFFLLFSSPPTAYAFTGDVAEDSKNRRADLFVQILKDEAVGRLNELGKVSDAARYLERTF 96
           L F L S+P     F+G    D + +R DLF QIL+DEAV RL ELGKVSDA+ YLERTF
Sbjct: 17  LLFCLLSAPSCVSMFSGIETGDLE-KRDDLFPQILRDEAVARLYELGKVSDASGYLERTF 76

Query: 97  LSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKF 156
           LSPAS++A  L++KWMEDAGLRTWVD MGN+HGR +G NA+AEALLIGSH+DTVVDAG F
Sbjct: 77  LSPASMRAINLIRKWMEDAGLRTWVDQMGNVHGRVDGANANAEALLIGSHMDTVVDAGMF 136

Query: 157 DGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSS 216
           DG+LGI+SAISALK +++NGKL++LKRP+EVIAFSDEEGVRFQ+TFLGS AIAGILP ++
Sbjct: 137 DGSLGIVSAISALKAMHVNGKLQKLKRPVEVIAFSDEEGVRFQTTFLGSGAIAGILPGTT 196

Query: 217 LEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGV 276
           LEISDK  + IKD +KE+ + ITEE+LL+LKYD KSVWGYVEVHIEQGPVLE  GFPLGV
Sbjct: 197 LEISDKREVMIKDFLKENSIDITEESLLKLKYDPKSVWGYVEVHIEQGPVLEQVGFPLGV 256

Query: 277 VRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHC 336
           V+GIAGQTRLKVTVRGSQGHAGTVPM MRQDPMAA+AE IV LE LCK PE YLS+DGHC
Sbjct: 257 VKGIAGQTRLKVTVRGSQGHAGTVPMSMRQDPMAAAAEQIVVLESLCKHPEEYLSYDGHC 316

Query: 337 TDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHN 396
           +DST+KSLSTSLVCTVGEISTWPSASNVIPGQVT+TVD+R IDD+GREAVIY+ S Q++ 
Sbjct: 317 SDSTVKSLSTSLVCTVGEISTWPSASNVIPGQVTYTVDIRAIDDLGREAVIYDLSKQIYQ 376

Query: 397 ICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDA 456
           IC  RSVSC IE KHDA A+I +S+LSSQLKSAA +ALKKM G+IQ+EVP LMSGAGHDA
Sbjct: 377 ICDKRSVSCIIEHKHDAGAVICDSDLSSQLKSAAYSALKKMEGDIQDEVPTLMSGAGHDA 436

Query: 457 MAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLEN 503
           MA+SHLTKVGMLFVRCRGG+SHSP EHVLD+D+WAA LA L FLEN
Sbjct: 437 MAISHLTKVGMLFVRCRGGISHSPQEHVLDNDVWAASLATLSFLEN 481

BLAST of CSPI01G10060 vs. TrEMBL
Match: I1L153_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_09G050800 PE=4 SV=1)

HSP 1 Score: 704.5 bits (1817), Expect = 9.2e-200
Identity = 354/466 (75.97%), Postives = 406/466 (87.12%), Query Frame = 1

Query: 37  LFFLLFSSPPTAYAFTGDVAEDSKNRRADLFVQILKDEAVGRLNELGKVSDAARYLERTF 96
           L F L S+P     F+G    D + +R DLF QIL+DEAV RL ELGKVSDA+ YLERTF
Sbjct: 17  LLFCLLSAPSCVSMFSGIETGDLE-KRDDLFPQILRDEAVARLYELGKVSDASGYLERTF 76

Query: 97  LSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKF 156
           LSPAS++A  L++KWMEDAGLRTWVD MGN+HGR +G NA+AEALLIGSH+DTVVDAG F
Sbjct: 77  LSPASMRAINLIRKWMEDAGLRTWVDQMGNVHGRVDGANANAEALLIGSHMDTVVDAGMF 136

Query: 157 DGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSS 216
           DG+LGI+SAISALK +++NGKL++LKRP+EVIAFSDEEGVRFQ+TFLGS AIAGILP ++
Sbjct: 137 DGSLGIVSAISALKAMHVNGKLQKLKRPVEVIAFSDEEGVRFQTTFLGSGAIAGILPGTT 196

Query: 217 LEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGV 276
           LEISDK  + IKD +KE+ + ITEE+LL+LKYD KSVWGYVEVHIEQGPVLE  GFPLGV
Sbjct: 197 LEISDKREVMIKDFLKENSIDITEESLLKLKYDPKSVWGYVEVHIEQGPVLEQVGFPLGV 256

Query: 277 VRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHC 336
           V+GIAGQTRLKVTVRGSQGHAGTVPM MRQDPMAA+AE IV LE LCK PE YLS+DGHC
Sbjct: 257 VKGIAGQTRLKVTVRGSQGHAGTVPMSMRQDPMAAAAEQIVVLESLCKHPEEYLSYDGHC 316

Query: 337 TDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHN 396
           +DST+KSLSTSLVCTVGEISTWPSASNVIPGQVT+TVD+R IDD+GREAVIY+ S Q++ 
Sbjct: 317 SDSTVKSLSTSLVCTVGEISTWPSASNVIPGQVTYTVDIRAIDDLGREAVIYDLSKQIYQ 376

Query: 397 ICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDA 456
           IC  RSVSC IE KHDA A+I +S+LSSQLKSAA +ALKKM G+IQ+EVP LMSGAGHDA
Sbjct: 377 ICDKRSVSCIIEHKHDAGAVICDSDLSSQLKSAAYSALKKMEGDIQDEVPTLMSGAGHDA 436

Query: 457 MAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLEN 503
           MA+SHLTKVGMLFVRCRGG+SHSP EHVLD+D+WAA LA L FLEN
Sbjct: 437 MAISHLTKVGMLFVRCRGGISHSPQEHVLDNDVWAASLATLSFLEN 481

BLAST of CSPI01G10060 vs. TAIR10
Match: AT4G20070.1 (AT4G20070.1 allantoate amidohydrolase)

HSP 1 Score: 682.9 bits (1761), Expect = 1.5e-196
Identity = 343/501 (68.46%), Postives = 421/501 (84.03%), Query Frame = 1

Query: 19  HHYHHHHHSSFSVLFPYLLFFLL----FSSPPTAYAFTGDVAEDSKNR-----------R 78
           HH+HHH+H S  VLF  L+F LL     SS  ++ + + D +  S +            +
Sbjct: 26  HHHHHHNHPSL-VLFWCLVFSLLSPLALSSSSSSSSSSSDSSSSSSSHISLGIGETEGTK 85

Query: 79  ADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDC 138
            DL   IL+DEAV RL+ELG+VSDAA +LERTF+SPASI+A  L++ WMEDAGL TWVD 
Sbjct: 86  HDLHQAILRDEAVARLHELGQVSDAATHLERTFMSPASIRAIPLIRGWMEDAGLSTWVDY 145

Query: 139 MGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKR 198
           MGN+HGR E +N S++ALLIGSH+DTV+DAGK+DG+LGIISAISALKVL ++G+L ELKR
Sbjct: 146 MGNVHGRVEPKNGSSQALLIGSHMDTVIDAGKYDGSLGIISAISALKVLKIDGRLGELKR 205

Query: 199 PIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENL 258
           P+EVIAFSDEEGVRFQSTFLGSAA+AGI+PVS LE++DKSGI+++D +KE+ + IT+ENL
Sbjct: 206 PVEVIAFSDEEGVRFQSTFLGSAALAGIMPVSRLEVTDKSGISVQDALKENSIDITDENL 265

Query: 259 LQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMP 318
           +QLKYD  SVWGYVEVHIEQGPVLEW G+PLGVV+GIAGQTRLKVTV+GSQGHAGTVPM 
Sbjct: 266 MQLKYDPASVWGYVEVHIEQGPVLEWVGYPLGVVKGIAGQTRLKVTVKGSQGHAGTVPMS 325

Query: 319 MRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASN 378
           MRQDPM  +AELIV LE +CK P+ YLS +  C + T++SL+ SLVCTVGEISTWPSASN
Sbjct: 326 MRQDPMTGAAELIVLLESVCKNPKDYLSCNVQCNEDTVESLANSLVCTVGEISTWPSASN 385

Query: 379 VIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELS 438
           VIPGQVTFTVDLRTIDD+GR+A++++ S +++ IC  RS+ C+IERKHDA+A++S+ +LS
Sbjct: 386 VIPGQVTFTVDLRTIDDVGRKAILHDLSTRMYQICDKRSLLCSIERKHDADAVMSDPQLS 445

Query: 439 SQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEH 498
            QLKSAA +ALKKM GE+Q+EVPVLMSGAGHDAMAM+HLTKVGMLFVRCRGG+SHSPAEH
Sbjct: 446 LQLKSAAQSALKKMTGEVQDEVPVLMSGAGHDAMAMAHLTKVGMLFVRCRGGISHSPAEH 505

Query: 499 VLDDDIWAAGLAVLEFLENHL 505
           VLDDD+ AAGLA+LEFLE+ +
Sbjct: 506 VLDDDVGAAGLAILEFLESQM 525

BLAST of CSPI01G10060 vs. TAIR10
Match: AT5G43600.1 (AT5G43600.1 ureidoglycolate amidohydrolase)

HSP 1 Score: 187.6 bits (475), Expect = 1.9e-47
Identity = 134/430 (31.16%), Postives = 214/430 (49.77%), Query Frame = 1

Query: 78  RLNELGKVSDA-ARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNA 137
           +++EL   SDA +  + R   +   + A   ++  M  AGL    D +GN+ G+ +G   
Sbjct: 69  QIDELSSFSDAPSPSVTRVLYTDKDVSARRYVKNLMALAGLTVREDAVGNIFGKWDGLEP 128

Query: 138 SAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGV 197
           +  A+  GSH+D +  +GK+DG +G++ AI A+ VL  +G   + KR +E+I F+ EE  
Sbjct: 129 NLPAVATGSHIDAIPYSGKYDGVVGVLGAIEAINVLKRSGF--KPKRSLEIILFTSEEPT 188

Query: 198 RFQSTFLGSAAIAG---ILPVSSLEISDKSGITIKDVIKESG-VQITEENLLQLKYDRKS 257
           RF  + LGS  +AG   +       + D   ++  +  + +G  +  +++L  +   + S
Sbjct: 189 RFGISCLGSRLLAGSKELAEALKTTVVDGQNVSFIEAARSAGYAEDKDDDLSSVFLKKGS 248

Query: 258 VWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAAS 317
            + ++E+HIEQGP+LE  G  +GVV  IA    LKV   G+ GHAG V MP R D   A+
Sbjct: 249 YFAFLELHIEQGPILEDEGLDIGVVTAIAAPASLKVEFEGNGGHAGAVLMPYRNDAGLAA 308

Query: 318 AELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFT 377
           AEL + +EK                   L+S S   V TVG +   P A N IP +    
Sbjct: 309 AELALAVEK-----------------HVLESESIDTVGTVGILELHPGAINSIPSKSHLE 368

Query: 378 VDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAAST 437
           +D R ID+  R  VI +     + I   R V             +S  ++ +Q   A S 
Sbjct: 369 IDTRDIDEARRNTVIKKIQESANTIAKKRKVK------------LSEFKIVNQDPPALSD 428

Query: 438 AL--KKM---VGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDD 497
            L  KKM     E+     +++S A HD++ M+ ++ +GM+F+ C  G SH P E+   +
Sbjct: 429 KLVIKKMAEAATELNLSHKMMISRAYHDSLFMARISPMGMIFIPCYKGYSHKPEEYSSPE 466

BLAST of CSPI01G10060 vs. NCBI nr
Match: gi|449454780|ref|XP_004145132.1| (PREDICTED: allantoate deiminase isoform X1 [Cucumis sativus])

HSP 1 Score: 984.9 bits (2545), Expect = 5.1e-284
Identity = 504/504 (100.00%), Postives = 504/504 (100.00%), Query Frame = 1

Query: 1   MAIAFSNSDSLSAFLIKDHHYHHHHHSSFSVLFPYLLFFLLFSSPPTAYAFTGDVAEDSK 60
           MAIAFSNSDSLSAFLIKDHHYHHHHHSSFSVLFPYLLFFLLFSSPPTAYAFTGDVAEDSK
Sbjct: 1   MAIAFSNSDSLSAFLIKDHHYHHHHHSSFSVLFPYLLFFLLFSSPPTAYAFTGDVAEDSK 60

Query: 61  NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTW 120
           NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTW
Sbjct: 61  NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTW 120

Query: 121 VDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEE 180
           VDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEE
Sbjct: 121 VDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEE 180

Query: 181 LKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITE 240
           LKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITE
Sbjct: 181 LKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITE 240

Query: 241 ENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTV 300
           ENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTV
Sbjct: 241 ENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTV 300

Query: 301 PMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPS 360
           PMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPS
Sbjct: 301 PMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPS 360

Query: 361 ASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNS 420
           ASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNS
Sbjct: 361 ASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNS 420

Query: 421 ELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSP 480
           ELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSP
Sbjct: 421 ELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSP 480

Query: 481 AEHVLDDDIWAAGLAVLEFLENHL 505
           AEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 481 AEHVLDDDIWAAGLAVLEFLENHL 504

BLAST of CSPI01G10060 vs. NCBI nr
Match: gi|659068799|ref|XP_008446305.1| (PREDICTED: allantoate deiminase, partial [Cucumis melo])

HSP 1 Score: 899.8 bits (2324), Expect = 2.1e-258
Identity = 460/471 (97.66%), Postives = 464/471 (98.51%), Query Frame = 1

Query: 34  PYLLFFLLFSSPPTAYAFTGDVAEDSKNRRADLFVQILKDEAVGRLNELGKVSDAARYLE 93
           P    FLLFSSPPTAYAFTGDVAEDSKNRRADLFVQILKDEAVGRLNELGKVSDA RYLE
Sbjct: 68  PLSALFLLFSSPPTAYAFTGDVAEDSKNRRADLFVQILKDEAVGRLNELGKVSDADRYLE 127

Query: 94  RTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDA 153
           RTFLSPASIKA FLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDA
Sbjct: 128 RTFLSPASIKARFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDA 187

Query: 154 GKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILP 213
           GKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILP
Sbjct: 188 GKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILP 247

Query: 214 VSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFP 273
           VSSLEISDKSG+TIKDVI ESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFP
Sbjct: 248 VSSLEISDKSGMTIKDVITESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFP 307

Query: 274 LGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYLSFD 333
           LGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYLSFD
Sbjct: 308 LGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYLSFD 367

Query: 334 GHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQ 393
           GHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQ
Sbjct: 368 GHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQ 427

Query: 394 VHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEEVPVLMSGAG 453
           VHNICSSRSVSCNIERKHDANAIIS+SELSSQLKSAASTALKKMVGEIQEEVPVLMSGAG
Sbjct: 428 VHNICSSRSVSCNIERKHDANAIISDSELSSQLKSAASTALKKMVGEIQEEVPVLMSGAG 487

Query: 454 HDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENHL 505
           HDAMA+SHLTKVGMLFVRCRGG+SHSPAEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 488 HDAMAISHLTKVGMLFVRCRGGISHSPAEHVLDDDIWAAGLAVLEFLENHL 538

BLAST of CSPI01G10060 vs. NCBI nr
Match: gi|778658264|ref|XP_011652398.1| (PREDICTED: allantoate deiminase isoform X2 [Cucumis sativus])

HSP 1 Score: 822.0 bits (2122), Expect = 5.7e-235
Identity = 420/420 (100.00%), Postives = 420/420 (100.00%), Query Frame = 1

Query: 85  VSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIG 144
           VSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIG
Sbjct: 2   VSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIG 61

Query: 145 SHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLG 204
           SHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLG
Sbjct: 62  SHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLG 121

Query: 205 SAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQG 264
           SAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQG
Sbjct: 122 SAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQG 181

Query: 265 PVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCK 324
           PVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCK
Sbjct: 182 PVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCK 241

Query: 325 QPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGRE 384
           QPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGRE
Sbjct: 242 QPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGRE 301

Query: 385 AVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEE 444
           AVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEE
Sbjct: 302 AVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEE 361

Query: 445 VPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENHL 504
           VPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 362 VPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENHL 421

BLAST of CSPI01G10060 vs. NCBI nr
Match: gi|1012110344|ref|XP_015959808.1| (PREDICTED: allantoate deiminase [Arachis duranensis])

HSP 1 Score: 723.0 bits (1865), Expect = 3.6e-205
Identity = 363/490 (74.08%), Postives = 421/490 (85.92%), Query Frame = 1

Query: 19  HHYHHHHHSSFSVLFPYL----LFFLLFSSPPTAYA--FTGDVAEDSKNRRADLFVQILK 78
           HH+HHHHH   +  F +L     FF L S+PP A A  F+G+   D + ++ DLF QIL+
Sbjct: 30  HHHHHHHHQHHAFTFTFLHSCFFFFFLLSAPPPACASTFSGNETGDLE-KQGDLFPQILR 89

Query: 79  DEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTE 138
           DEAVGRL +LGKVSD   YLERTFLSPAS++A  ++++WMEDAGLRTWVD MGN+HGR E
Sbjct: 90  DEAVGRLYQLGKVSDGNGYLERTFLSPASMRAINVIREWMEDAGLRTWVDQMGNVHGRVE 149

Query: 139 GRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSD 198
           G N +AEALLIGSH+DTVVDAG FDG+LGI+SAISALK L +NGKLE+LKRP+EVIAF D
Sbjct: 150 GANPNAEALLIGSHMDTVVDAGMFDGSLGIVSAISALKALKVNGKLEKLKRPVEVIAFCD 209

Query: 199 EEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKS 258
           EEGVRFQ+TFLGS AIAGILP ++LEISDK    IKDV+KE+ ++ITEE+LLQLKYD KS
Sbjct: 210 EEGVRFQTTFLGSGAIAGILPATTLEISDKRDAMIKDVLKENSIEITEESLLQLKYDPKS 269

Query: 259 VWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAAS 318
           VWGYVE+HIEQGPVLE  GFPLGVV+GIAGQTRLKVTVRGSQGHAGTVPM MRQDPMAA+
Sbjct: 270 VWGYVEIHIEQGPVLEQKGFPLGVVKGIAGQTRLKVTVRGSQGHAGTVPMSMRQDPMAAA 329

Query: 319 AELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFT 378
           AELIV +E LCK PE +LS+D HC+DST+KSLS+SLVCTVGEISTWPSASNVIPGQVT+T
Sbjct: 330 AELIVVMENLCKNPEEFLSYDDHCSDSTVKSLSSSLVCTVGEISTWPSASNVIPGQVTYT 389

Query: 379 VDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAAST 438
           VD+R IDD+GREAVIY+ S +++ IC  RSVSCNIE KHDA A+I +SELSSQLKSAA +
Sbjct: 390 VDIRAIDDLGREAVIYDLSKRMYQICDKRSVSCNIEHKHDAGAVICDSELSSQLKSAAYS 449

Query: 439 ALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAA 498
           ALK+M G+IQ+EVP LMSGAGHDAMAMSHLTKVGMLFVRCRGG+SHSP EHVLD+D+WAA
Sbjct: 450 ALKRMEGDIQDEVPTLMSGAGHDAMAMSHLTKVGMLFVRCRGGISHSPQEHVLDNDVWAA 509

Query: 499 GLAVLEFLEN 503
           GLA L FLEN
Sbjct: 510 GLATLSFLEN 518

BLAST of CSPI01G10060 vs. NCBI nr
Match: gi|802753778|ref|XP_012088570.1| (PREDICTED: allantoate deiminase isoform X1 [Jatropha curcas])

HSP 1 Score: 711.1 bits (1834), Expect = 1.4e-201
Identity = 353/474 (74.47%), Postives = 413/474 (87.13%), Query Frame = 1

Query: 31  VLFPYLLFFLLFSSPPTAYAFTGDVAEDSKNRRADLFVQILKDEAVGRLNELGKVSDAAR 90
           + F  ++ FLL +   +AY F+    ED  +RR DL+ +IL+DEAV RLN+LGKVSDA  
Sbjct: 6   IFFLVMVIFLLSTCYASAYTFSDIGNEDVDSRRNDLYREILRDEAVARLNDLGKVSDADG 65

Query: 91  YLERTFLSPASIKASFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTV 150
           YLERTF+S AS+KA  L++ WMEDAGL TW+D MGN+HGR EG N SAEALLIGSHLDTV
Sbjct: 66  YLERTFMSAASVKAGNLIRSWMEDAGLMTWMDHMGNVHGRVEGSNPSAEALLIGSHLDTV 125

Query: 151 VDAGKFDGALGIISAISALKVLNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAG 210
           VDAG FDG+LGIISA+SALKVL   G L +LKRP+EVIAFSDEEGVRFQSTFLGSAA+AG
Sbjct: 126 VDAGIFDGSLGIISALSALKVLKSKGMLSKLKRPVEVIAFSDEEGVRFQSTFLGSAAVAG 185

Query: 211 ILPVSSLEISDKSGITIKDVIKESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWS 270
           ILPV++L+ISDKSG+T+++ +KE  + ITEE+LLQLKYD +SVWGY+EVHIEQGPVLEW+
Sbjct: 186 ILPVTALQISDKSGVTVQESLKEKSIGITEESLLQLKYDPRSVWGYIEVHIEQGPVLEWA 245

Query: 271 GFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYL 330
           GFPLGVV+GIAGQTRLKV V+GSQGHAGTVPM +RQDPMAA+AELIV LE LCK P+ +L
Sbjct: 246 GFPLGVVKGIAGQTRLKVMVKGSQGHAGTVPMSLRQDPMAAAAELIVLLESLCKHPKDFL 305

Query: 331 SFDGHCTDSTLKSLSTSLVCTVGEISTWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEF 390
           S+DG+C DS ++SLS+SLVCTVGEISTWPSASNVIPGQVTFTVDLR +DD+GREAV+YE 
Sbjct: 306 SYDGYCNDSIVESLSSSLVCTVGEISTWPSASNVIPGQVTFTVDLRAMDDMGREAVLYEL 365

Query: 391 SNQVHNICSSRSVSCNIERKHDANAIISNSELSSQLKSAASTALKKMVGEIQEEVPVLMS 450
           SNQ+++IC  RSVSC IERKHDA A+I +SELS QLKSAA+ ALK+M GEIQ+EVPVLMS
Sbjct: 366 SNQIYHICDRRSVSCIIERKHDAKAVICDSELSLQLKSAANAALKRMTGEIQDEVPVLMS 425

Query: 451 GAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENHL 505
           GAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDD+WAAGLAV+ FLE  +
Sbjct: 426 GAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDVWAAGLAVMAFLETQM 479

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AAH_ARATH2.6e-19568.46Allantoate deiminase OS=Arabidopsis thaliana GN=AAH PE=1 SV=2[more]
AAH_ORYSJ1.6e-15261.90Probable allantoate deiminase OS=Oryza sativa subsp. japonica GN=AAH PE=1 SV=1[more]
HYUC_PSESN4.9e-6934.73Hydantoin utilization protein C OS=Pseudomonas sp. (strain NS671) GN=hyuC PE=1 S... [more]
AMAB2_GEOSE5.6e-6537.05N-carbamoyl-L-amino acid hydrolase OS=Geobacillus stearothermophilus GN=amaB PE=... [more]
AMAB1_GEOSE1.4e-6336.82N-carbamoyl-L-amino acid hydrolase OS=Geobacillus stearothermophilus GN=amaB PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0LWV2_CUCSA3.5e-284100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G056970 PE=4 SV=1[more]
A0A067JV84_JATCU9.8e-20274.47Uncharacterized protein OS=Jatropha curcas GN=JCGZ_27110 PE=4 SV=1[more]
U5GU02_POPTR1.1e-20076.52Peptidase M20/M25/M40 family protein OS=Populus trichocarpa GN=POPTR_0001s16010g... [more]
A0A0B2R419_GLYSO9.2e-20075.97Allantoate deiminase, chloroplastic OS=Glycine soja GN=glysoja_019851 PE=4 SV=1[more]
I1L153_SOYBN9.2e-20075.97Uncharacterized protein OS=Glycine max GN=GLYMA_09G050800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G20070.11.5e-19668.46 allantoate amidohydrolase[more]
AT5G43600.11.9e-4731.16 ureidoglycolate amidohydrolase[more]
Match NameE-valueIdentityDescription
gi|449454780|ref|XP_004145132.1|5.1e-284100.00PREDICTED: allantoate deiminase isoform X1 [Cucumis sativus][more]
gi|659068799|ref|XP_008446305.1|2.1e-25897.66PREDICTED: allantoate deiminase, partial [Cucumis melo][more]
gi|778658264|ref|XP_011652398.1|5.7e-235100.00PREDICTED: allantoate deiminase isoform X2 [Cucumis sativus][more]
gi|1012110344|ref|XP_015959808.1|3.6e-20574.08PREDICTED: allantoate deiminase [Arachis duranensis][more]
gi|802753778|ref|XP_012088570.1|1.4e-20174.47PREDICTED: allantoate deiminase isoform X1 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002933Peptidase_M20
IPR010158Amidase_Cbmase
IPR011650Peptidase_M20_dimer
IPR002933Peptidase_M20
IPR010158Amidase_Cbmase
IPR011650Peptidase_M20_dimer
IPR002933Peptidase_M20
IPR010158Amidase_Cbmase
IPR011650Peptidase_M20_dimer
IPR002933Peptidase_M20
IPR002933Peptidase_M20
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO:0008152metabolic process
GO:0008152metabolic process
GO:0008152metabolic process
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016787hydrolase activity
GO:0016813hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amidines
GO:0016787hydrolase activity
GO:0016813hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amidines
GO:0016787hydrolase activity
GO:0016813hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amidines
GO:0016787hydrolase activity
GO:0016787hydrolase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015996 chlorophyll catabolic process
biological_process GO:0006508 proteolysis
biological_process GO:0006145 purine nucleobase catabolic process
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0010136 ureide catabolic process
biological_process GO:0008152 metabolic process
biological_process GO:0044270 cellular nitrogen compound catabolic process
biological_process GO:0071704 organic substance metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0005575 cellular_component
molecular_function GO:0047652 allantoate deiminase activity
molecular_function GO:0008237 metallopeptidase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016813 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amidines

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G10060.5CSPI01G10060.5mRNA
CSPI01G10060.4CSPI01G10060.4mRNA
CSPI01G10060.3CSPI01G10060.3mRNA
CSPI01G10060.2CSPI01G10060.2mRNA
CSPI01G10060.1CSPI01G10060.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002933Peptidase M20PFAMPF01546Peptidase_M20coord: 142..277
score: 2.
NoneNo IPR availableGENE3DG3DSA:3.40.630.10coord: 69..289
score: 4.5
NoneNo IPR availablePANTHERPTHR32494FAMILY NOT NAMEDcoord: 26..289
score: 6.5E
NoneNo IPR availablePANTHERPTHR32494:SF7ALLANTOATE DEIMINASEcoord: 26..289
score: 6.5E
NoneNo IPR availableunknownSSF53187Zn-dependent exopeptidasescoord: 67..288
score: 9.88