HG10021011 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021011
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAllantoate deiminase isoform X1
LocationChr05: 4461651 .. 4465452 (-)
RNA-Seq ExpressionHG10021011
SyntenyHG10021011
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCATTGCCTATGCCAACACCGATTGTATTACCGCTTTTCTCATCAAGGATCATCAACATCATCACCACCACCACCATTCTTCCTTCTCCATCTTGTTTTCCTATCTACTCTTTATCCTTTTGTTTTCATCTACTCCCACTGCTTACTCATTTACTGGTATGTTCCTCATCTTCATCTTTTGTTCCATCTTCATATGCTATTTTTCTCTACAGCCTCGGTTGGAAAATGGATATTCCTTTGCCAAAATACAAGAAACTGAAACGAAATGAATACGAACCACTCGTTTATGCTTTAACGGATAACGTACATGATTAAATTCACTCACCGTCAATGACCTCTGGAAAATCGTTTGTTCTAATTTCTGCTCATACTTAGTTTTTTCTCGGAGGATTTGACAGATCTGTTTCTTCTCCATCGAGTTATCTATCGTGGTATAATCATAAATATTGTATGTTGCTCATTAAAGTCAGGTTTTCTATTCTTTTTCTTTGTAAGTCGAGAATAGACTGTTTGGTTGGGTACTTTTATCAGCTTTTATCAAACGAAATTATTTTTCATGTTAGTGAACGATTGTATGCTGAACATTGAAAATATTATTCAATTTTTTTCAGGAGATGTAGCCGAAGATTCAGAGAATAGAAGGGCTGATCTGTTTGTTCAAATTCTTAAGGACGAAGCAGTAGGAAGATTGAATGAACTAGGGAAGGTAGCGGACCACGATTCCACTTTTTCCTCTCTCTGTTAGCTATGATATGCGCGCCCTTGCAAATGTTTGGTACTTGTTAGATTCAGTGAATAATTGGATGATTTGCATATGATATGCGCGCCCTTGCAAATGTTTTTTTCTAGTTTCTTGTATGTGATTCTGCTGTAGTTTTCTATTCATAATATAAAAATGTCTAGTGTTGACCATGTAAATTGATTAGGTTGACTGTTTGGCTCGTTTGAAATATTCCCTTGGTTTGGTTATTAAAAAGTTCCGGCAATTTTATATAATTTTATTATTCTTTCTTAGAGGCCTTATTGTAATTTTTGATTGAAAGATTTAGGGAATTCCTGATTCTTTCCTCTCTGAAGTCTTTACCTTGGTTACTTCCATTTCCAATTATTAAACACATAATTCTATGCTGTACTGCATATATTATAGCTTAGTGCATGAAAAGCTGTCCTCCCCTTCCCCCAGATAAACCTGATGAATACCAGAGGTTGAGTTGTAATCCAACTTGAAACTGTTCCATGGTGAGTTGTCTGTGTTGTGTTAAATATTCCCAGTTGGTAACTTGAGTTGTCTTACAATTGTTCCCAATTACTAGGTGAGTGATGCTGCTCGATATCTTGAGAGAACGTTCTTGAGTCCAGCTTCTATCAAAGCAAGATTTCTTCTTCAAAAATGGATGGAGGATGCTGGATTAAGAACGTTAGATTTCTACTTCTGCTCTTCTTGAAGAAGTCTTAATTGTTAATTGACTCCTCTTGAACATTTTTCAGGTGGGTTGACTGCATGGGCAATCTACACGGTCGAACTGAGGGAAGGAATGCAAGTGCTGAAGCATTATTGATTGGTTCTCACTTGGTAATTATTTGACTTCTCCTAAGCAGAGACAAGAATTTATGTTACATATCAAGCTAAGAATTAAATTTTCTTCCTTTCCTCGCAGGATACTGTTGTTGATGCTGGAAAATTTGATGGCGCATTAGGCATCATATCTGCTATCTCTGCTTTGAAAGTTTTTAATATGAATGGGAAGTTAGAAGAACTAAAGAGGCCAATTGAGGTTTGAGTTACTTTCTCATTTGCAATATTTGGAAGAAATAAGTGACTTGTTCATTATTAGTCCTGATAATGACAATATTTTCTACCTTGTCACATTTAGTTATAATCAACTACAAAATTATGCAGGTGATTGCTTTCAGTGATGAGGAGGGCGTGAGGTTTCAATCAACCTTCTTAGGAAGTGCTGCTATTGCTGGTATTTTACCAGTTTCGTCTTTGGAAATATCAGATAAAAGGTTTTGCTGCGTAACTCCAAACTCTCTGGATGCTTATCACTCATTATTACCTGTATATTAATTCACAATCCTTTTATCAGTGGCATGACTATAAAAGATGTAATTACGGAGAGTGGAGTACAGATAACAGAGGAAAACTTGTTGCAACTCAAGTATGACCGCAAGTCTGTCTGGGGATATGTGGAGGTATGGCTCTGGGTGCTCTTCTGGAATTTCTAGTGTTTGTTCTACACTTAAATCTTATTTTTAGAACGTATGTACGTTAATCAGTACTTTTCGAACATTATTCTTTGCTTTGATGTTCAGGTTCATATTGAACAAGGCCCTGTACTTGAGTGGTCTGGTTTTCCTCTGGGAGTGGTTAGAGGCATAGCTGGGCAGACACGGCTAAAGGTACATGAGTTAAAAAACCTGTAGGTCACGGAGAAATATACTTGCACAATAACTAATTTAGTCAAAGAATCATGAAAATTCTTATAACTTATTAAGTCATTCTATACTACTTACCATCCAAACCGAAGAAAATCTAATTAATGGAATAATTTTGAAGCTTGGTCATTTTGACTTCACACTACAATCGATTCTAGAAATGTTTAGGTCGAATCAACCCAAGGTCCACTATTGGACGTATTTATCTTCCTGAAAAACTGTATTATTCCCCTACCTAATACTAAAAGTTGATACATTATATTGTGATTAATTTACCTACTTCTCCTCCCAAATTTATAATTTAGCTGGGGTCTCTGATGTATAAAACTGAAAGGAAGGAAATCTAATTTTCTTGATTCATAAAAAAAACTAAAATGGATTTATTCAAGGATGTGAGTTTCGTTTGATAGAGCGATCCGTGCACAAAGATTGCAGGTTACAGTGAGAGGTTCTCAGGGGCATGCAGGAACGGTTCCAATGCCTATGCGCCAAGATCCCATGGCAGCTTCAGCCGAATTGATTGTACAATTGGAAAAACTCTGTAAGCAACCAGAGAGCTACTTATCTTTTGATGGGCATTGCACTGATACTACCTTGAAATCACTTTCCACATCCCTTGTCTGTACGGTTGGAGAGATATCGACATGGCCCAGTGCAAGCAATGTCATTCCAGGCCAGGCAAGAATTTGATATAGAGGCACTGCAGACCGTCATACCTTTTTACCAGACAATTTTTAATGTTTGGCATGTTACAGGTGACCTTCACTGTAGATTTACGTACTATCGATGACATAGGACGAGAAGCTGTAATTTATGAATTCTCTAATCAGGTACATAAAATTTGCAGCAGCCGGTCAGTTTCGTGCAATATTGAACGTAAGGTGTGTATCTTCCATCAAGTCCATTGTCCACATTTATATCATTTGCTTCCCTTCCACAACCTTAATCTGCTATGCCTGTTCCTTTCAGCATGATGCAAATGCCATAATCAGCGATTCGAAGCTGAGCTCGCAGCTGAAATCTGCTGCTTCCACTGCACTCAAAAAAATGGTAGGCGAGCTTCAGGAGGAAGTTCCTGTATTAATGAGCGGAGCAGGGCATGATGCGATGGCAATGTCTCATTTGACGAAGGTTTGTTCTCAACTTATATACCGTACCTCTTAAAGTTCTAGTTTTGCACCTGCATCGACACGGTTATACATGTTATACAATGATGTTAATGAGAGAATTTGATAGGTGGGAATGTTGTTTGTCCGCTGTCGTGGAGGCGTAAGTCACTCTCCTGCCGAGCATGTATTGGACGACGACATTTGGGCTGCGGGTTTGGCCGTCTTGGAATTCTTAGAAAACCATCTCTAG

mRNA sequence

ATGGCCATTGCCTATGCCAACACCGATTGTATTACCGCTTTTCTCATCAAGGATCATCAACATCATCACCACCACCACCATTCTTCCTTCTCCATCTTGTTTTCCTATCTACTCTTTATCCTTTTGTTTTCATCTACTCCCACTGCTTACTCATTTACTGGAGATGTAGCCGAAGATTCAGAGAATAGAAGGGCTGATCTGTTTGTTCAAATTCTTAAGGACGAAGCAGTAGGAAGATTGAATGAACTAGGGAAGGTGAGTGATGCTGCTCGATATCTTGAGAGAACGTTCTTGAGTCCAGCTTCTATCAAAGCAAGATTTCTTCTTCAAAAATGGATGGAGGATGCTGGATTAAGAACGTGGGTTGACTGCATGGGCAATCTACACGGTCGAACTGAGGGAAGGAATGCAAGTGCTGAAGCATTATTGATTGGTTCTCACTTGGATACTGTTGTTGATGCTGGAAAATTTGATGGCGCATTAGGCATCATATCTGCTATCTCTGCTTTGAAAGTTTTTAATATGAATGGGAAGTTAGAAGAACTAAAGAGGCCAATTGAGGTGATTGCTTTCAGTGATGAGGAGGGCGTGAGGTTTCAATCAACCTTCTTAGGAAGTGCTGCTATTGCTGGTATTTTACCAGTTTCGTCTTTGGAAATATCAGATAAAAGTGGCATGACTATAAAAGATGTAATTACGGAGAGTGGAGTACAGATAACAGAGGAAAACTTGTTGCAACTCAAGTATGACCGCAAGTCTGTCTGGGGATATGTGGAGGTTCATATTGAACAAGGCCCTGTACTTGAGTGGTCTGGTTTTCCTCTGGGAGTGGTTAGAGGCATAGCTGGGCAGACACGGCTAAAGGTTACAGTGAGAGGTTCTCAGGGGCATGCAGGAACGGTTCCAATGCCTATGCGCCAAGATCCCATGGCAGCTTCAGCCGAATTGATTGTACAATTGGAAAAACTCTGTAAGCAACCAGAGAGCTACTTATCTTTTGATGGGCATTGCACTGATACTACCTTGAAATCACTTTCCACATCCCTTGTCTGTACGGTTGGAGAGATATCGACATGGCCCAGTGCAAGCAATGTGACCTTCACTGTAGATTTACGTACTATCGATGACATAGGACGAGAAGCTGTAATTTATGAATTCTCTAATCAGCATGATGCAAATGCCATAATCAGCGATTCGAAGCTGAGCTCGCAGCTGAAATCTGCTGCTTCCACTGCACTCAAAAAAATGGTAGGCGAGCTTCAGGAGGAAGTTCCTGTATTAATGAGCGGAGCAGGGCATGATGCGATGGCAATGTCTCATTTGACGAAGGTGGGAATGTTGTTTGTCCGCTGTCGTGGAGGCGTAAGTCACTCTCCTGCCGAGCATGTATTGGACGACGACATTTGGGCTGCGGGTTTGGCCGTCTTGGAATTCTTAGAAAACCATCTCTAG

Coding sequence (CDS)

ATGGCCATTGCCTATGCCAACACCGATTGTATTACCGCTTTTCTCATCAAGGATCATCAACATCATCACCACCACCACCATTCTTCCTTCTCCATCTTGTTTTCCTATCTACTCTTTATCCTTTTGTTTTCATCTACTCCCACTGCTTACTCATTTACTGGAGATGTAGCCGAAGATTCAGAGAATAGAAGGGCTGATCTGTTTGTTCAAATTCTTAAGGACGAAGCAGTAGGAAGATTGAATGAACTAGGGAAGGTGAGTGATGCTGCTCGATATCTTGAGAGAACGTTCTTGAGTCCAGCTTCTATCAAAGCAAGATTTCTTCTTCAAAAATGGATGGAGGATGCTGGATTAAGAACGTGGGTTGACTGCATGGGCAATCTACACGGTCGAACTGAGGGAAGGAATGCAAGTGCTGAAGCATTATTGATTGGTTCTCACTTGGATACTGTTGTTGATGCTGGAAAATTTGATGGCGCATTAGGCATCATATCTGCTATCTCTGCTTTGAAAGTTTTTAATATGAATGGGAAGTTAGAAGAACTAAAGAGGCCAATTGAGGTGATTGCTTTCAGTGATGAGGAGGGCGTGAGGTTTCAATCAACCTTCTTAGGAAGTGCTGCTATTGCTGGTATTTTACCAGTTTCGTCTTTGGAAATATCAGATAAAAGTGGCATGACTATAAAAGATGTAATTACGGAGAGTGGAGTACAGATAACAGAGGAAAACTTGTTGCAACTCAAGTATGACCGCAAGTCTGTCTGGGGATATGTGGAGGTTCATATTGAACAAGGCCCTGTACTTGAGTGGTCTGGTTTTCCTCTGGGAGTGGTTAGAGGCATAGCTGGGCAGACACGGCTAAAGGTTACAGTGAGAGGTTCTCAGGGGCATGCAGGAACGGTTCCAATGCCTATGCGCCAAGATCCCATGGCAGCTTCAGCCGAATTGATTGTACAATTGGAAAAACTCTGTAAGCAACCAGAGAGCTACTTATCTTTTGATGGGCATTGCACTGATACTACCTTGAAATCACTTTCCACATCCCTTGTCTGTACGGTTGGAGAGATATCGACATGGCCCAGTGCAAGCAATGTGACCTTCACTGTAGATTTACGTACTATCGATGACATAGGACGAGAAGCTGTAATTTATGAATTCTCTAATCAGCATGATGCAAATGCCATAATCAGCGATTCGAAGCTGAGCTCGCAGCTGAAATCTGCTGCTTCCACTGCACTCAAAAAAATGGTAGGCGAGCTTCAGGAGGAAGTTCCTGTATTAATGAGCGGAGCAGGGCATGATGCGATGGCAATGTCTCATTTGACGAAGGTGGGAATGTTGTTTGTCCGCTGTCGTGGAGGCGTAAGTCACTCTCCTGCCGAGCATGTATTGGACGACGACATTTGGGCTGCGGGTTTGGCCGTCTTGGAATTCTTAGAAAACCATCTCTAG

Protein sequence

MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDSENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWPSASNVTFTVDLRTIDDIGREAVIYEFSNQHDANAIISDSKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENHL
Homology
BLAST of HG10021011 vs. NCBI nr
Match: XP_038876942.1 (allantoate deiminase 2 isoform X1 [Benincasa hispida])

HSP 1 Score: 902.9 bits (2332), Expect = 1.2e-258
Identity = 468/509 (91.94%), Postives = 480/509 (94.30%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQH----HHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDV 60
           MAIAYANTDCI AFLIKDHQH    HHHHHHSSFSILFSYLLF LLFSS PTAY+FTGDV
Sbjct: 1   MAIAYANTDCIAAFLIKDHQHHHHNHHHHHHSSFSILFSYLLFFLLFSSPPTAYAFTGDV 60

Query: 61  AEDSENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDA 120
           AEDS+NRRADLFVQILKDEAVG+LNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDA
Sbjct: 61  AEDSKNRRADLFVQILKDEAVGKLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDA 120

Query: 121 GLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMN 180
           GLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMN
Sbjct: 121 GLRTWVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMN 180

Query: 181 GKLEELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESG 240
           GKLE+LKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESG
Sbjct: 181 GKLEQLKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESG 240

Query: 241 VQITEENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQG 300
           VQITEE+LLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQG
Sbjct: 241 VQITEESLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQG 300

Query: 301 HAGTVPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEI 360
           HAGTVPMPMRQDPMAA+AELIVQLEKLCKQPESYLSFDGHCTD+T+KSLSTSLVCTVGEI
Sbjct: 301 HAGTVPMPMRQDPMAATAELIVQLEKLCKQPESYLSFDGHCTDSTMKSLSTSLVCTVGEI 360

Query: 361 STWPSASN-----VTFTVDLRTIDDIGREAVIYEFSN-----------------QHDANA 420
           STWPSASN     VTFTVDLRTIDDIGREAVIYEFSN                 +HDANA
Sbjct: 361 STWPSASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNRVHQICSSRSVSCNIERKHDANA 420

Query: 421 IISDSKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGG 480
           IISDS+LSSQLKSAASTALKKMVGE+QEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGG
Sbjct: 421 IISDSELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGG 480

Query: 481 VSHSPAEHVLDDDIWAAGLAVLEFLENHL 484
           VSHSPAEHVLDDDIWAAGLAV+EFLENHL
Sbjct: 481 VSHSPAEHVLDDDIWAAGLAVMEFLENHL 509

BLAST of HG10021011 vs. NCBI nr
Match: XP_004145132.1 (allantoate deiminase 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 886.7 bits (2290), Expect = 8.9e-254
Identity = 461/505 (91.29%), Postives = 475/505 (94.06%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           MAIA++N+D ++AFLIKDH H+HHHHHSSFS+LF YLLF LLFSS PTAY+FTGDVAEDS
Sbjct: 1   MAIAFSNSDSLSAFLIKDH-HYHHHHHSSFSVLFPYLLFFLLFSSPPTAYAFTGDVAEDS 60

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
           +NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKA FLLQKWMEDAGLRT
Sbjct: 61  KNRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRT 120

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKV NMNGKLE
Sbjct: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLE 180

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSG+TIKDVI ESGVQIT
Sbjct: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQIT 240

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT
Sbjct: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTD+TLKSLSTSLVCTVGEISTWP
Sbjct: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWP 360

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDDIGREAVIYEFSNQ                 HDANAIIS+
Sbjct: 361 SASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISN 420

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAASTALKKMVGE+QEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS
Sbjct: 421 SELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           PAEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 481 PAEHVLDDDIWAAGLAVLEFLENHL 504

BLAST of HG10021011 vs. NCBI nr
Match: XP_023542649.1 (allantoate deiminase 2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 874.0 bits (2257), Expect = 5.9e-250
Identity = 451/505 (89.31%), Postives = 470/505 (93.07%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           MAIAYANTD + AFLIKDH HHHHHHHSS S  FSYLLF LLFSS PTAY+FTGDVAEDS
Sbjct: 1   MAIAYANTDSMAAFLIKDH-HHHHHHHSSLSFFFSYLLFFLLFSSPPTAYAFTGDVAEDS 60

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
           +NRR DLFV+ILKDEAVGRLNELGKVSDA RYLERTFLSPASIKARFLL+KWMED+GLRT
Sbjct: 61  KNRRDDLFVEILKDEAVGRLNELGKVSDADRYLERTFLSPASIKARFLLKKWMEDSGLRT 120

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGR EGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVF MNGKLE
Sbjct: 121 WVDCMGNLHGRAEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFYMNGKLE 180

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGM+IKDVITESGV+IT
Sbjct: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMSIKDVITESGVEIT 240

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEW+GFPLGVVRGIAGQ+RLKVTVRGSQGHAGT
Sbjct: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWTGFPLGVVRGIAGQSRLKVTVRGSQGHAGT 300

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMP+RQDPMAASAELIVQLEKLCKQPESYLSFDGHC+D TLKSLSTSLVCTVGEISTWP
Sbjct: 301 VPMPLRQDPMAASAELIVQLEKLCKQPESYLSFDGHCSDFTLKSLSTSLVCTVGEISTWP 360

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDDIGREAVIYEFSNQ                 HDANA+ISD
Sbjct: 361 SASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHKICSSRSVSCNIERKHDANAVISD 420

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAASTALKKMVG++QEEVP+LMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS
Sbjct: 421 SELSSQLKSAASTALKKMVGDIQEEVPILMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           PAEHVLDDD+WAAGLAV+EFLENHL
Sbjct: 481 PAEHVLDDDVWAAGLAVMEFLENHL 504

BLAST of HG10021011 vs. NCBI nr
Match: KAG7012345.1 (Allantoate deiminase 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 870.2 bits (2247), Expect = 8.6e-249
Identity = 451/505 (89.31%), Postives = 469/505 (92.87%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           MAIAYANTD + AFLIKDH HHHHH HSS S  FSYLLF LLFSS PTAY+FTGDVAEDS
Sbjct: 1   MAIAYANTDSMAAFLIKDH-HHHHHQHSSLSFFFSYLLFFLLFSSPPTAYAFTGDVAEDS 60

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
           +NRR DLFV+ILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLL+KWMED+GLRT
Sbjct: 61  KNRRDDLFVEILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLKKWMEDSGLRT 120

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGR EGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVF MNGKLE
Sbjct: 121 WVDCMGNLHGRAEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFYMNGKLE 180

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGM+IKDVITESGV+IT
Sbjct: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMSIKDVITESGVEIT 240

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEW+GFPLGVVRGIAGQ+RLKVTVRGSQGHAGT
Sbjct: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWTGFPLGVVRGIAGQSRLKVTVRGSQGHAGT 300

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMP+RQDPMAASAELIVQLEKLCKQPESYLSFDGHC+D TLKSLSTSLVCTVGEISTWP
Sbjct: 301 VPMPLRQDPMAASAELIVQLEKLCKQPESYLSFDGHCSDFTLKSLSTSLVCTVGEISTWP 360

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDDIGREAVIYEFSNQ                 HDANA+ISD
Sbjct: 361 SASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHKICSSRSVSCNIERKHDANAVISD 420

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAAS ALKKMVG++QEEVP+LMSGAGHDAMAMS+LTKVGMLFVRCRGGVSHS
Sbjct: 421 SELSSQLKSAASAALKKMVGDIQEEVPILMSGAGHDAMAMSYLTKVGMLFVRCRGGVSHS 480

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           PAEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 481 PAEHVLDDDIWAAGLAVLEFLENHL 504

BLAST of HG10021011 vs. NCBI nr
Match: XP_022954835.1 (allantoate deiminase 2-like [Cucurbita moschata])

HSP 1 Score: 868.2 bits (2242), Expect = 3.3e-248
Identity = 450/505 (89.11%), Postives = 469/505 (92.87%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           MAIAYANTD + AFLIKDH HHHHH HSS S  FSYLLF LLFSS PTAY+FTGDVAEDS
Sbjct: 1   MAIAYANTDSMAAFLIKDH-HHHHHQHSSLSFFFSYLLFFLLFSSPPTAYAFTGDVAEDS 60

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
           +NRR DLFV+ILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLL+KWMED+GLRT
Sbjct: 61  KNRRDDLFVEILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLKKWMEDSGLRT 120

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGR EGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVF M+GKLE
Sbjct: 121 WVDCMGNLHGRAEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFYMSGKLE 180

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGM+IKDVITESGV+IT
Sbjct: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMSIKDVITESGVEIT 240

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEW+GFPLGVVRGIAGQ+RLKVTVRGSQGHAGT
Sbjct: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWTGFPLGVVRGIAGQSRLKVTVRGSQGHAGT 300

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMP+RQDPMAASAELIVQLEKLCKQPESYLSFDGHC+D TLKSLSTSLVCTVGEISTWP
Sbjct: 301 VPMPLRQDPMAASAELIVQLEKLCKQPESYLSFDGHCSDFTLKSLSTSLVCTVGEISTWP 360

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDDIGREAVIYEFSNQ                 HDANA+ISD
Sbjct: 361 SASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHKICSSRSVSCNIERKHDANAVISD 420

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAAS ALKKMVG++QEEVP+LMSGAGHDAMAMS+LTKVGMLFVRCRGGVSHS
Sbjct: 421 SELSSQLKSAASAALKKMVGDIQEEVPILMSGAGHDAMAMSYLTKVGMLFVRCRGGVSHS 480

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           PAEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 481 PAEHVLDDDIWAAGLAVLEFLENHL 504

BLAST of HG10021011 vs. ExPASy Swiss-Prot
Match: C0M0V4 (Allantoate deiminase 1 OS=Glycine max OX=3847 GN=AAH1 PE=1 SV=1)

HSP 1 Score: 652.5 bits (1682), Expect = 3.7e-186
Identity = 338/476 (71.01%), Postives = 391/476 (82.14%), Query Frame = 0

Query: 28  SSFSILFSYLLFILLFSSTPTAYSFTGDVAEDSENRRADLFVQILKDEAVGRLNELGKVS 87
           ++F +L  +LLF LL  S P+  S    +      +R DLF QIL+DEAV RL ELGKVS
Sbjct: 8   NTFFLLSCFLLFCLL--SAPSCVSMFSGIETGDLEKRDDLFPQILRDEAVARLYELGKVS 67

Query: 88  DAARYLERTFLSPASIKARFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSH 147
           DA+ YLERTFLSPAS+KA  L++KWMEDAGLRTWVD MGN+HGR +G N +AEALLIGSH
Sbjct: 68  DASGYLERTFLSPASMKAIDLIRKWMEDAGLRTWVDQMGNVHGRVDGANENAEALLIGSH 127

Query: 148 LDTVVDAGKFDGALGIISAISALKVFNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSA 207
           +DTVVDAG FDG+LGI+SAISA+K  ++NGKL++LKRP+EVIAFSDEEGVRFQ+TFLGS 
Sbjct: 128 MDTVVDAGMFDGSLGIVSAISAVKAMHVNGKLQKLKRPVEVIAFSDEEGVRFQTTFLGSG 187

Query: 208 AIAGILPVSSLEISDKSGMTIKDVITESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPV 267
           AIAGILP ++LEISDK  + IKD + E+ + ITEE+LL+LKYD KSVWGYVEVHIEQGPV
Sbjct: 188 AIAGILPGTTLEISDKREVMIKDFLKENSMDITEESLLKLKYDPKSVWGYVEVHIEQGPV 247

Query: 268 LEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQP 327
           LE  GFPLGVV+GIAGQTRLKVTVRGSQGHAGTVPM MRQDPMAA+AE IV LE LCK P
Sbjct: 248 LEQVGFPLGVVKGIAGQTRLKVTVRGSQGHAGTVPMSMRQDPMAAAAEQIVVLESLCKHP 307

Query: 328 ESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWPSASN-----VTFTVDLRTIDDIGREAV 387
           E YLS+DGHC+D+T+KSLS+SLVCTVGEISTWPSASN     VT+TVD+R IDD+GREAV
Sbjct: 308 EEYLSYDGHCSDSTVKSLSSSLVCTVGEISTWPSASNVIPGQVTYTVDIRAIDDLGREAV 367

Query: 388 IYEFSNQ-----------------HDANAIISDSKLSSQLKSAASTALKKMVGELQEEVP 447
           IY+ S Q                 HDA A+I DS LSSQLKSAA +ALKKM G++Q+EVP
Sbjct: 368 IYDLSKQIYQICDKRSVSCIIEHKHDAGAVICDSDLSSQLKSAAYSALKKMEGDIQDEVP 427

Query: 448 VLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLEN 482
            LMSGAGHDAMA+SHLTKVGMLFVRCRGG+SHSP EHVLD+D+WAAGLA L FLEN
Sbjct: 428 TLMSGAGHDAMAISHLTKVGMLFVRCRGGISHSPQEHVLDNDVWAAGLATLSFLEN 481

BLAST of HG10021011 vs. ExPASy Swiss-Prot
Match: I1L153 (Allantoate deiminase 2 OS=Glycine max OX=3847 GN=AAH2 PE=1 SV=1)

HSP 1 Score: 651.7 bits (1680), Expect = 6.3e-186
Identity = 338/476 (71.01%), Postives = 390/476 (81.93%), Query Frame = 0

Query: 28  SSFSILFSYLLFILLFSSTPTAYSFTGDVAEDSENRRADLFVQILKDEAVGRLNELGKVS 87
           ++F +   +LLF LL  S P+  S    +      +R DLF QIL+DEAV RL ELGKVS
Sbjct: 8   NTFFLHSCFLLFCLL--SAPSCVSMFSGIETGDLEKRDDLFPQILRDEAVARLYELGKVS 67

Query: 88  DAARYLERTFLSPASIKARFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNASAEALLIGSH 147
           DA+ YLERTFLSPAS++A  L++KWMEDAGLRTWVD MGN+HGR +G NA+AEALLIGSH
Sbjct: 68  DASGYLERTFLSPASMRAINLIRKWMEDAGLRTWVDQMGNVHGRVDGANANAEALLIGSH 127

Query: 148 LDTVVDAGKFDGALGIISAISALKVFNMNGKLEELKRPIEVIAFSDEEGVRFQSTFLGSA 207
           +DTVVDAG FDG+LGI+SAISALK  ++NGKL++LKRP+EVIAFSDEEGVRFQ+TFLGS 
Sbjct: 128 MDTVVDAGMFDGSLGIVSAISALKAMHVNGKLQKLKRPVEVIAFSDEEGVRFQTTFLGSG 187

Query: 208 AIAGILPVSSLEISDKSGMTIKDVITESGVQITEENLLQLKYDRKSVWGYVEVHIEQGPV 267
           AIAGILP ++LEISDK  + IKD + E+ + ITEE+LL+LKYD KSVWGYVEVHIEQGPV
Sbjct: 188 AIAGILPGTTLEISDKREVMIKDFLKENSIDITEESLLKLKYDPKSVWGYVEVHIEQGPV 247

Query: 268 LEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAELIVQLEKLCKQP 327
           LE  GFPLGVV+GIAGQTRLKVTVRGSQGHAGTVPM MRQDPMAA+AE IV LE LCK P
Sbjct: 248 LEQVGFPLGVVKGIAGQTRLKVTVRGSQGHAGTVPMSMRQDPMAAAAEQIVVLESLCKHP 307

Query: 328 ESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWPSASN-----VTFTVDLRTIDDIGREAV 387
           E YLS+DGHC+D+T+KSLSTSLVCTVGEISTWPSASN     VT+TVD+R IDD+GREAV
Sbjct: 308 EEYLSYDGHCSDSTVKSLSTSLVCTVGEISTWPSASNVIPGQVTYTVDIRAIDDLGREAV 367

Query: 388 IYEFSNQ-----------------HDANAIISDSKLSSQLKSAASTALKKMVGELQEEVP 447
           IY+ S Q                 HDA A+I DS LSSQLKSAA +ALKKM G++Q+EVP
Sbjct: 368 IYDLSKQIYQICDKRSVSCIIEHKHDAGAVICDSDLSSQLKSAAYSALKKMEGDIQDEVP 427

Query: 448 VLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLEN 482
            LMSGAGHDAMA+SHLTKVGMLFVRCRGG+SHSP EHVLD+D+WAA LA L FLEN
Sbjct: 428 TLMSGAGHDAMAISHLTKVGMLFVRCRGGISHSPQEHVLDNDVWAASLATLSFLEN 481

BLAST of HG10021011 vs. ExPASy Swiss-Prot
Match: O49434 (Allantoate deiminase OS=Arabidopsis thaliana OX=3702 GN=AAH PE=1 SV=2)

HSP 1 Score: 634.8 bits (1636), Expect = 8.0e-181
Identity = 327/500 (65.40%), Postives = 400/500 (80.00%), Query Frame = 0

Query: 21  HHHHHHHSSFSILFSYLLFILL----FSSTPTAYSFTGDVAEDS-----------ENRRA 80
           HHHHHH+    +LF  L+F LL     SS+ ++ S + D +  S           E  + 
Sbjct: 26  HHHHHHNHPSLVLFWCLVFSLLSPLALSSSSSSSSSSSDSSSSSSSHISLGIGETEGTKH 85

Query: 81  DLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRTWVDCM 140
           DL   IL+DEAV RL+ELG+VSDAA +LERTF+SPASI+A  L++ WMEDAGL TWVD M
Sbjct: 86  DLHQAILRDEAVARLHELGQVSDAATHLERTFMSPASIRAIPLIRGWMEDAGLSTWVDYM 145

Query: 141 GNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLEELKRP 200
           GN+HGR E +N S++ALLIGSH+DTV+DAGK+DG+LGIISAISALKV  ++G+L ELKRP
Sbjct: 146 GNVHGRVEPKNGSSQALLIGSHMDTVIDAGKYDGSLGIISAISALKVLKIDGRLGELKRP 205

Query: 201 IEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQITEENLL 260
           +EVIAFSDEEGVRFQSTFLGSAA+AGI+PVS LE++DKSG++++D + E+ + IT+ENL+
Sbjct: 206 VEVIAFSDEEGVRFQSTFLGSAALAGIMPVSRLEVTDKSGISVQDALKENSIDITDENLM 265

Query: 261 QLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPM 320
           QLKYD  SVWGYVEVHIEQGPVLEW G+PLGVV+GIAGQTRLKVTV+GSQGHAGTVPM M
Sbjct: 266 QLKYDPASVWGYVEVHIEQGPVLEWVGYPLGVVKGIAGQTRLKVTVKGSQGHAGTVPMSM 325

Query: 321 RQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWPSASN- 380
           RQDPM  +AELIV LE +CK P+ YLS +  C + T++SL+ SLVCTVGEISTWPSASN 
Sbjct: 326 RQDPMTGAAELIVLLESVCKNPKDYLSCNVQCNEDTVESLANSLVCTVGEISTWPSASNV 385

Query: 381 ----VTFTVDLRTIDDIGREAVIYEFS-----------------NQHDANAIISDSKLSS 440
               VTFTVDLRTIDD+GR+A++++ S                  +HDA+A++SD +LS 
Sbjct: 386 IPGQVTFTVDLRTIDDVGRKAILHDLSTRMYQICDKRSLLCSIERKHDADAVMSDPQLSL 445

Query: 441 QLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHV 484
           QLKSAA +ALKKM GE+Q+EVPVLMSGAGHDAMAM+HLTKVGMLFVRCRGG+SHSPAEHV
Sbjct: 446 QLKSAAQSALKKMTGEVQDEVPVLMSGAGHDAMAMAHLTKVGMLFVRCRGGISHSPAEHV 505

BLAST of HG10021011 vs. ExPASy Swiss-Prot
Match: Q655X8 (Probable allantoate deiminase OS=Oryza sativa subsp. japonica OX=39947 GN=AAH PE=1 SV=1)

HSP 1 Score: 496.5 bits (1277), Expect = 3.4e-139
Identity = 258/441 (58.50%), Postives = 328/441 (74.38%), Query Frame = 0

Query: 67  LFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRTWVDCMG 126
           L+ +IL+DE V RL ELGK+SD   YLERTFLSPASI+A  ++  WM+DAGL TW+D MG
Sbjct: 40  LYREILRDETVLRLKELGKISDGEGYLERTFLSPASIRASAVIISWMKDAGLTTWIDQMG 99

Query: 127 NLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLEELKRPI 186
           N+HGR E  N++ EALLIGSH+DTV+DAG +DGALGIISAISALKV  + G+L+ L RP+
Sbjct: 100 NIHGRFEPTNSTKEALLIGSHMDTVIDAGMYDGALGIISAISALKVLKVTGRLQRLTRPV 159

Query: 187 EVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQITEENLLQ 246
           EVIAFSDEEGVRFQ+TFLGSAA+AG LP S L++SDKSG T++DV+  + ++ T   L +
Sbjct: 160 EVIAFSDEEGVRFQTTFLGSAAVAGTLPESILQVSDKSGTTVQDVLKLNSLEGTANALGE 219

Query: 247 LKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMR 306
           ++Y  +SV  YVEVHIEQGPVLE   +PLGVV+GIAGQTRLKV + GSQGHAGTVPM +R
Sbjct: 220 VRYSPESVGSYVEVHIEQGPVLEALRYPLGVVKGIAGQTRLKVIINGSQGHAGTVPMKLR 279

Query: 307 QDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWPSASN-- 366
           +DPM A+AEL++ LE LCK+P  +L++D  C   T +SL+  LVCTVGE+ TWPSASN  
Sbjct: 280 RDPMVAAAELVLTLETLCKEPNKFLTYDEECGCFTEESLA-GLVCTVGELLTWPSASNVI 339

Query: 367 ---VTFTVDLRTIDDIGREAVIYEFS-----------------NQHDANAIISDSKLSSQ 426
              V FTVD+R +DD  RE ++  FS                  +H A A   D++L+S+
Sbjct: 340 PGQVNFTVDIRAMDDKVRETIVTSFSRLVLQRCDDRLVDCAVEQKHAAAATPCDAELTSR 399

Query: 427 LKSAASTALKKMVGELQE---EVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAE 483
           L+ A  + +  M   ++    E PVLMSGAGHDAMAM+ LTKVGMLFVRCRGGVSHSP E
Sbjct: 400 LERATRSTISSMAAGVRRAGGETPVLMSGAGHDAMAMARLTKVGMLFVRCRGGVSHSPEE 459

BLAST of HG10021011 vs. ExPASy Swiss-Prot
Match: Q01264 (N-carbamoyl-L-amino-acid hydrolase OS=Pseudomonas sp. (strain NS671) OX=29441 GN=hyuC PE=1 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 5.7e-62
Identity = 146/421 (34.68%), Postives = 226/421 (53.68%), Query Frame = 0

Query: 77  VGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRTWVDCMGNLHGRTEGRN 136
           + +L E+GK  D  + ++R  LS    +A  L+ +WM +AGL    D  GNL GR EG  
Sbjct: 15  IEQLGEIGKTKD--KGVQRLALSKEDREATLLVSEWMREAGLTVTHDHFGNLIGRKEGET 74

Query: 137 ASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLEELKRPIEVIAFSDEEG 196
            S  +++IGSH+D+V + GKFDG +G+++ I  +   +    + E    IEV+AF +EEG
Sbjct: 75  PSLPSVMIGSHIDSVRNGGKFDGVIGVLAGIEIVHAISEANVVHE--HSIEVVAFCEEEG 134

Query: 197 VRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQITEENLLQLKYDRKSVWG 256
            RF     GS  + G +    L+  D + +T  + +   G  I  +   Q   +   +  
Sbjct: 135 SRFNDGLFGSRGMVGKVKPEDLQKVDDNNVTRYEALKTFGFGIDPDFTHQSIREIGDIKH 194

Query: 257 YVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAASAEL 316
           Y E+HIEQGP LE + +P+G+V GIAG +  KV + G  GHAGTVPM +R+DP+  +AE+
Sbjct: 195 YFEMHIEQGPYLEKNNYPIGIVSGIAGPSWFKVRLVGEAGHAGTVPMSLRKDPLVGAAEV 254

Query: 317 IVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWPSASN-----VTFTVDL 376
           I ++E LC                 +   +   V TVG I+ +P  SN     V FT+D+
Sbjct: 255 IKEVETLC-----------------MNDPNAPTVGTVGRIAAFPGGSNIIPESVEFTLDI 314

Query: 377 RTIDDIGREAVIYEF-------SNQHDANAIISDSKLSSQLKSAAS--TALKKMVGELQE 436
           R I+   R  +I +        SN       I  +  +  +K + +   +LK+   EL+ 
Sbjct: 315 RDIELERRNKIIEKIEEKIKLVSNTRGLEYQIEKNMAAVPVKCSENLINSLKQSCKELEI 374

Query: 437 EVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVLEFLENH 484
           + P+++SGAGHDAM ++ +T++GM+FVRCR G+SHSP E    DDI      + E +  H
Sbjct: 375 DAPIIVSGAGHDAMFLAEITEIGMVFVRCRNGISHSPKEWAEIDDILTGTKVLYESIIKH 414

BLAST of HG10021011 vs. ExPASy TrEMBL
Match: A0A0A0LWV2 (M20_dimer domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G056970 PE=4 SV=1)

HSP 1 Score: 886.7 bits (2290), Expect = 4.3e-254
Identity = 461/505 (91.29%), Postives = 475/505 (94.06%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           MAIA++N+D ++AFLIKDH H+HHHHHSSFS+LF YLLF LLFSS PTAY+FTGDVAEDS
Sbjct: 1   MAIAFSNSDSLSAFLIKDH-HYHHHHHSSFSVLFPYLLFFLLFSSPPTAYAFTGDVAEDS 60

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
           +NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKA FLLQKWMEDAGLRT
Sbjct: 61  KNRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKASFLLQKWMEDAGLRT 120

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKV NMNGKLE
Sbjct: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVLNMNGKLE 180

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSG+TIKDVI ESGVQIT
Sbjct: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGITIKDVIKESGVQIT 240

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT
Sbjct: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTD+TLKSLSTSLVCTVGEISTWP
Sbjct: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDSTLKSLSTSLVCTVGEISTWP 360

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDDIGREAVIYEFSNQ                 HDANAIIS+
Sbjct: 361 SASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHNICSSRSVSCNIERKHDANAIISN 420

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAASTALKKMVGE+QEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS
Sbjct: 421 SELSSQLKSAASTALKKMVGEIQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           PAEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 481 PAEHVLDDDIWAAGLAVLEFLENHL 504

BLAST of HG10021011 vs. ExPASy TrEMBL
Match: A0A6J1GTI4 (allantoate deiminase 2-like OS=Cucurbita moschata OX=3662 GN=LOC111456979 PE=4 SV=1)

HSP 1 Score: 868.2 bits (2242), Expect = 1.6e-248
Identity = 450/505 (89.11%), Postives = 469/505 (92.87%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           MAIAYANTD + AFLIKDH HHHHH HSS S  FSYLLF LLFSS PTAY+FTGDVAEDS
Sbjct: 1   MAIAYANTDSMAAFLIKDH-HHHHHQHSSLSFFFSYLLFFLLFSSPPTAYAFTGDVAEDS 60

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
           +NRR DLFV+ILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLL+KWMED+GLRT
Sbjct: 61  KNRRDDLFVEILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLKKWMEDSGLRT 120

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGR EGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVF M+GKLE
Sbjct: 121 WVDCMGNLHGRAEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFYMSGKLE 180

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGM+IKDVITESGV+IT
Sbjct: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMSIKDVITESGVEIT 240

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEW+GFPLGVVRGIAGQ+RLKVTVRGSQGHAGT
Sbjct: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWTGFPLGVVRGIAGQSRLKVTVRGSQGHAGT 300

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMP+RQDPMAASAELIVQLEKLCKQPESYLSFDGHC+D TLKSLSTSLVCTVGEISTWP
Sbjct: 301 VPMPLRQDPMAASAELIVQLEKLCKQPESYLSFDGHCSDFTLKSLSTSLVCTVGEISTWP 360

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDDIGREAVIYEFSNQ                 HDANA+ISD
Sbjct: 361 SASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHKICSSRSVSCNIERKHDANAVISD 420

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAAS ALKKMVG++QEEVP+LMSGAGHDAMAMS+LTKVGMLFVRCRGGVSHS
Sbjct: 421 SELSSQLKSAASAALKKMVGDIQEEVPILMSGAGHDAMAMSYLTKVGMLFVRCRGGVSHS 480

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           PAEHVLDDDIWAAGLAVLEFLENHL
Sbjct: 481 PAEHVLDDDIWAAGLAVLEFLENHL 504

BLAST of HG10021011 vs. ExPASy TrEMBL
Match: A0A6J1K2Q5 (allantoate deiminase 2-like OS=Cucurbita maxima OX=3661 GN=LOC111490121 PE=4 SV=1)

HSP 1 Score: 866.3 bits (2237), Expect = 6.0e-248
Identity = 446/505 (88.32%), Postives = 470/505 (93.07%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           +AIAYANTD +  FLIKD  HHHHHH+SS S  FSYLLF+LLFSS PTAY+FTGDVAEDS
Sbjct: 3   IAIAYANTDSMADFLIKD--HHHHHHYSSLSYFFSYLLFLLLFSSPPTAYAFTGDVAEDS 62

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
           +NRR DLFV+ILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLL+KWMED+GLRT
Sbjct: 63  KNRRDDLFVEILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLKKWMEDSGLRT 122

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGR EGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVF MNGKLE
Sbjct: 123 WVDCMGNLHGRAEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFYMNGKLE 182

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGM+IKDVITESGV+IT
Sbjct: 183 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMSIKDVITESGVEIT 242

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEW+GFPLGVVRGIAGQ+RLKVTVRGSQGHAGT
Sbjct: 243 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWTGFPLGVVRGIAGQSRLKVTVRGSQGHAGT 302

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMP+RQDPMAASAELIVQLEKLCKQPESYLSFDGHC+D TLKSLSTSLVCTVGEIS+WP
Sbjct: 303 VPMPLRQDPMAASAELIVQLEKLCKQPESYLSFDGHCSDFTLKSLSTSLVCTVGEISSWP 362

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDD+GREAVIYEFSNQ                 HDANA+ISD
Sbjct: 363 SASNVIPGQVTFTVDLRTIDDVGREAVIYEFSNQVHKICSSRSVLCNIERKHDANAVISD 422

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAASTALKKMVG++QEEVP+LMSGAGHDAMAMSHLTKVGMLFVRCRGG+SHS
Sbjct: 423 SELSSQLKSAASTALKKMVGDIQEEVPILMSGAGHDAMAMSHLTKVGMLFVRCRGGISHS 482

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           PAEHVLDDD+WAAGLAVLEFLENHL
Sbjct: 483 PAEHVLDDDVWAAGLAVLEFLENHL 505

BLAST of HG10021011 vs. ExPASy TrEMBL
Match: A0A6J1E7N8 (allantoate deiminase 1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431551 PE=4 SV=1)

HSP 1 Score: 866.3 bits (2237), Expect = 6.0e-248
Identity = 452/505 (89.50%), Postives = 470/505 (93.07%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           MAIA A+T   T F IK+HQ HH+ ++SSFSILFSYL+F+LLFSS PTAY+FTGDVAEDS
Sbjct: 1   MAIASAST---TVFFIKNHQRHHYPYYSSFSILFSYLIFLLLFSSPPTAYAFTGDVAEDS 60

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
            NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT
Sbjct: 61  NNRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVF MNGKLE
Sbjct: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFYMNGKLE 180

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT
Sbjct: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT
Sbjct: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMP+RQDPMAASAELIVQLEKLCKQPESYLSFDGHC+D+TLKSLSTSLVCTVGEISTWP
Sbjct: 301 VPMPLRQDPMAASAELIVQLEKLCKQPESYLSFDGHCSDSTLKSLSTSLVCTVGEISTWP 360

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDDIGREAVIYEFSNQ                 HDANA+ISD
Sbjct: 361 SASNVIPGQVTFTVDLRTIDDIGREAVIYEFSNQVHKICSSRSVLCNIERKHDANAVISD 420

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAAS ALKKMVGE+QEEVPVLMSGAGHDAMAMS+LTKVGMLFVRCRGG SHS
Sbjct: 421 SELSSQLKSAASAALKKMVGEIQEEVPVLMSGAGHDAMAMSYLTKVGMLFVRCRGGASHS 480

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           P+EHVL+DD+WAAGLAVLEFLENHL
Sbjct: 481 PSEHVLEDDVWAAGLAVLEFLENHL 502

BLAST of HG10021011 vs. ExPASy TrEMBL
Match: A0A6J1KP96 (allantoate deiminase 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495274 PE=4 SV=1)

HSP 1 Score: 862.4 bits (2227), Expect = 8.7e-247
Identity = 450/505 (89.11%), Postives = 468/505 (92.67%), Query Frame = 0

Query: 1   MAIAYANTDCITAFLIKDHQHHHHHHHSSFSILFSYLLFILLFSSTPTAYSFTGDVAEDS 60
           MAI  ++T   T F IK+HQ HH+ +HSSFSILFSYL F+LLFSS PTAY+FTGDV EDS
Sbjct: 1   MAITSSST---TVFFIKNHQRHHYPYHSSFSILFSYLFFLLLFSSPPTAYAFTGDVVEDS 60

Query: 61  ENRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120
            NRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT
Sbjct: 61  NNRRADLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRT 120

Query: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLE 180
           WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVF MNGKLE
Sbjct: 121 WVDCMGNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFYMNGKLE 180

Query: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240
           ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT
Sbjct: 181 ELKRPIEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQIT 240

Query: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300
           EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT
Sbjct: 241 EENLLQLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGT 300

Query: 301 VPMPMRQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWP 360
           VPMP+RQDPMAASAELIVQLEKLCKQPESYLSFDGHC+D+TLKSLSTSLVCTVGEISTWP
Sbjct: 301 VPMPLRQDPMAASAELIVQLEKLCKQPESYLSFDGHCSDSTLKSLSTSLVCTVGEISTWP 360

Query: 361 SASN-----VTFTVDLRTIDDIGREAVIYEFSNQ-----------------HDANAIISD 420
           SASN     VTFTVDLRTIDDIGREAVI+EFSNQ                 HDANA+ISD
Sbjct: 361 SASNVIPGQVTFTVDLRTIDDIGREAVIFEFSNQVHKICSSRSVLCNIERKHDANAVISD 420

Query: 421 SKLSSQLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHS 480
           S+LSSQLKSAAS AL+KMVGE+QEEVPVLMSGAGHDAMAMS+LTKVGMLFVRCRGGVSHS
Sbjct: 421 SELSSQLKSAASAALQKMVGEIQEEVPVLMSGAGHDAMAMSYLTKVGMLFVRCRGGVSHS 480

Query: 481 PAEHVLDDDIWAAGLAVLEFLENHL 484
           P+EHVL+DDIWAAGLAVLEFLENHL
Sbjct: 481 PSEHVLEDDIWAAGLAVLEFLENHL 502

BLAST of HG10021011 vs. TAIR 10
Match: AT4G20070.1 (allantoate amidohydrolase )

HSP 1 Score: 634.8 bits (1636), Expect = 5.7e-182
Identity = 327/500 (65.40%), Postives = 400/500 (80.00%), Query Frame = 0

Query: 21  HHHHHHHSSFSILFSYLLFILL----FSSTPTAYSFTGDVAEDS-----------ENRRA 80
           HHHHHH+    +LF  L+F LL     SS+ ++ S + D +  S           E  + 
Sbjct: 26  HHHHHHNHPSLVLFWCLVFSLLSPLALSSSSSSSSSSSDSSSSSSSHISLGIGETEGTKH 85

Query: 81  DLFVQILKDEAVGRLNELGKVSDAARYLERTFLSPASIKARFLLQKWMEDAGLRTWVDCM 140
           DL   IL+DEAV RL+ELG+VSDAA +LERTF+SPASI+A  L++ WMEDAGL TWVD M
Sbjct: 86  DLHQAILRDEAVARLHELGQVSDAATHLERTFMSPASIRAIPLIRGWMEDAGLSTWVDYM 145

Query: 141 GNLHGRTEGRNASAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLEELKRP 200
           GN+HGR E +N S++ALLIGSH+DTV+DAGK+DG+LGIISAISALKV  ++G+L ELKRP
Sbjct: 146 GNVHGRVEPKNGSSQALLIGSHMDTVIDAGKYDGSLGIISAISALKVLKIDGRLGELKRP 205

Query: 201 IEVIAFSDEEGVRFQSTFLGSAAIAGILPVSSLEISDKSGMTIKDVITESGVQITEENLL 260
           +EVIAFSDEEGVRFQSTFLGSAA+AGI+PVS LE++DKSG++++D + E+ + IT+ENL+
Sbjct: 206 VEVIAFSDEEGVRFQSTFLGSAALAGIMPVSRLEVTDKSGISVQDALKENSIDITDENLM 265

Query: 261 QLKYDRKSVWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPM 320
           QLKYD  SVWGYVEVHIEQGPVLEW G+PLGVV+GIAGQTRLKVTV+GSQGHAGTVPM M
Sbjct: 266 QLKYDPASVWGYVEVHIEQGPVLEWVGYPLGVVKGIAGQTRLKVTVKGSQGHAGTVPMSM 325

Query: 321 RQDPMAASAELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWPSASN- 380
           RQDPM  +AELIV LE +CK P+ YLS +  C + T++SL+ SLVCTVGEISTWPSASN 
Sbjct: 326 RQDPMTGAAELIVLLESVCKNPKDYLSCNVQCNEDTVESLANSLVCTVGEISTWPSASNV 385

Query: 381 ----VTFTVDLRTIDDIGREAVIYEFS-----------------NQHDANAIISDSKLSS 440
               VTFTVDLRTIDD+GR+A++++ S                  +HDA+A++SD +LS 
Sbjct: 386 IPGQVTFTVDLRTIDDVGRKAILHDLSTRMYQICDKRSLLCSIERKHDADAVMSDPQLSL 445

Query: 441 QLKSAASTALKKMVGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHV 484
           QLKSAA +ALKKM GE+Q+EVPVLMSGAGHDAMAM+HLTKVGMLFVRCRGG+SHSPAEHV
Sbjct: 446 QLKSAAQSALKKMTGEVQDEVPVLMSGAGHDAMAMAHLTKVGMLFVRCRGGISHSPAEHV 505

BLAST of HG10021011 vs. TAIR 10
Match: AT5G43600.1 (ureidoglycolate amidohydrolase )

HSP 1 Score: 178.7 bits (452), Expect = 1.1e-44
Identity = 135/420 (32.14%), Postives = 211/420 (50.24%), Query Frame = 0

Query: 79  RLNELGKVSDA-ARYLERTFLSPASIKARFLLQKWMEDAGLRTWVDCMGNLHGRTEGRNA 138
           +++EL   SDA +  + R   +   + AR  ++  M  AGL    D +GN+ G+ +G   
Sbjct: 69  QIDELSSFSDAPSPSVTRVLYTDKDVSARRYVKNLMALAGLTVREDAVGNIFGKWDGLEP 128

Query: 139 SAEALLIGSHLDTVVDAGKFDGALGIISAISALKVFNMNGKLEELKRPIEVIAFSDEEGV 198
           +  A+  GSH+D +  +GK+DG +G++ AI A+ V   +G   + KR +E+I F+ EE  
Sbjct: 129 NLPAVATGSHIDAIPYSGKYDGVVGVLGAIEAINVLKRSG--FKPKRSLEIILFTSEEPT 188

Query: 199 RFQSTFLGSAAIAG---ILPVSSLEISDKSGMTIKDVITESG-VQITEENLLQLKYDRKS 258
           RF  + LGS  +AG   +       + D   ++  +    +G  +  +++L  +   + S
Sbjct: 189 RFGISCLGSRLLAGSKELAEALKTTVVDGQNVSFIEAARSAGYAEDKDDDLSSVFLKKGS 248

Query: 259 VWGYVEVHIEQGPVLEWSGFPLGVVRGIAGQTRLKVTVRGSQGHAGTVPMPMRQDPMAAS 318
            + ++E+HIEQGP+LE  G  +GVV  IA    LKV   G+ GHAG V MP R D   A+
Sbjct: 249 YFAFLELHIEQGPILEDEGLDIGVVTAIAAPASLKVEFEGNGGHAGAVLMPYRNDAGLAA 308

Query: 319 AELIVQLEKLCKQPESYLSFDGHCTDTTLKSLSTSLVCTVGEISTWPSASNVT-----FT 378
           AEL + +EK                   L+S S   V TVG +   P A N         
Sbjct: 309 AELALAVEK-----------------HVLESESIDTVGTVGILELHPGAINSIPSKSHLE 368

Query: 379 VDLRTIDDIGREAVIYEFSNQHDANAI-------ISDSKLSSQLKSAASTAL--KKM--- 438
           +D R ID+  R  VI +   Q  AN I       +S+ K+ +Q   A S  L  KKM   
Sbjct: 369 IDTRDIDEARRNTVIKKI--QESANTIAKKRKVKLSEFKIVNQDPPALSDKLVIKKMAEA 428

Query: 439 VGELQEEVPVLMSGAGHDAMAMSHLTKVGMLFVRCRGGVSHSPAEHVLDDDIWAAGLAVL 477
             EL     +++S A HD++ M+ ++ +GM+F+ C  G SH P E+   +D+ A G+ VL
Sbjct: 429 ATELNLSHKMMISRAYHDSLFMARISPMGMIFIPCYKGYSHKPEEYSSPEDM-ANGVKVL 466

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876942.11.2e-25891.94allantoate deiminase 2 isoform X1 [Benincasa hispida][more]
XP_004145132.18.9e-25491.29allantoate deiminase 2 isoform X1 [Cucumis sativus][more]
XP_023542649.15.9e-25089.31allantoate deiminase 2-like [Cucurbita pepo subsp. pepo][more]
KAG7012345.18.6e-24989.31Allantoate deiminase 2, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022954835.13.3e-24889.11allantoate deiminase 2-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
C0M0V43.7e-18671.01Allantoate deiminase 1 OS=Glycine max OX=3847 GN=AAH1 PE=1 SV=1[more]
I1L1536.3e-18671.01Allantoate deiminase 2 OS=Glycine max OX=3847 GN=AAH2 PE=1 SV=1[more]
O494348.0e-18165.40Allantoate deiminase OS=Arabidopsis thaliana OX=3702 GN=AAH PE=1 SV=2[more]
Q655X83.4e-13958.50Probable allantoate deiminase OS=Oryza sativa subsp. japonica OX=39947 GN=AAH PE... [more]
Q012645.7e-6234.68N-carbamoyl-L-amino-acid hydrolase OS=Pseudomonas sp. (strain NS671) OX=29441 GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LWV24.3e-25491.29M20_dimer domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G056970 P... [more]
A0A6J1GTI41.6e-24889.11allantoate deiminase 2-like OS=Cucurbita moschata OX=3662 GN=LOC111456979 PE=4 S... [more]
A0A6J1K2Q56.0e-24888.32allantoate deiminase 2-like OS=Cucurbita maxima OX=3661 GN=LOC111490121 PE=4 SV=... [more]
A0A6J1E7N86.0e-24889.50allantoate deiminase 1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431551 ... [more]
A0A6J1KP968.7e-24789.11allantoate deiminase 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495274 PE... [more]
Match NameE-valueIdentityDescription
AT4G20070.15.7e-18265.40allantoate amidohydrolase [more]
AT5G43600.11.1e-4432.14ureidoglycolate amidohydrolase [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002933Peptidase M20PFAMPF01546Peptidase_M20coord: 143..480
e-value: 6.7E-22
score: 78.2
IPR010158Amidase, carbamoylase-typeTIGRFAMTIGR01879TIGR01879coord: 78..476
e-value: 1.1E-88
score: 295.8
IPR010158Amidase, carbamoylase-typePANTHERPTHR32494ALLANTOATE DEIMINASE-RELATEDcoord: 64..476
NoneNo IPR availableGENE3D3.40.630.10Zn peptidasescoord: 86..476
e-value: 9.7E-118
score: 395.2
NoneNo IPR availableGENE3D3.30.70.360coord: 283..404
e-value: 9.7E-118
score: 395.2
NoneNo IPR availablePIRSRPIRSR001235-1PIRSR001235-1coord: 80..479
e-value: 5.2E-105
score: 349.2
NoneNo IPR availablePANTHERPTHR32494:SF5ALLANTOATE DEIMINASE-RELATEDcoord: 64..476
NoneNo IPR availableCDDcd03884M20_bAScoord: 79..480
e-value: 4.34711E-149
score: 429.251
NoneNo IPR availableSUPERFAMILY53187Zn-dependent exopeptidasescoord: 69..482
IPR036264Bacterial exopeptidase dimerisation domainSUPERFAMILY55031Bacterial exopeptidase dimerisation domaincoord: 283..400

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021011.1HG10021011.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006145 purine nucleobase catabolic process
biological_process GO:0010136 ureide catabolic process
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0047652 allantoate deiminase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016813 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amidines