Cp4.1LG08g07140 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g07140
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionCupin 2 conserved barrel domain protein
LocationCp4.1LG08 : 5529045 .. 5533587 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATCGTCAAAAAACGAGAAATTATCACGTGACTTCAAAATCAACCAGTGAATGTGAAGCTCACTGTGCTCATCGGCTGCCATTTCTCCACCTCGAGCATCCAATAAAAAGCGCCACCATGGATTGCTATTCCTACCGATTCTAATTCGTCCTGATCAAAGAGAAAAAAAGTTCGATCGGAATTCAATAAACAGAATCAGAAGATAAAATCCGTCTCCCGTTTGATATGTGGAGCTTCTCAGCGCACTCCCATGGAGCATCTCTTCTAATTATCATCACCACTTCAAGTAAACCATCTCTTGTATCTCTTTGTATCTTCCCCCATTTGATTCCCGATACAGCTTTGAAGAATGGCGAATTGAAACTCATGTTACTGTAATTTATTTCAGGTTTGCTTGGAGTTACATTTGGTGGAGAAGGGTTTTGTTCAACGCCCTCCATTATAGATTCGGATGCCGATTCGAAGCCTCTGTATTATAAAGTGACCAATCCTACTCTTTCTCCTTCTCATCTGCAAGGTTGGTAATTTTGTATGGGGCGAGAAATTGTTCTTGTTATGAAGTGGTTTTGCAGAATTCCGGCATTTCCATTTTTGTTATTTAGTTTGATCAAGTACTTAGTTCTGGGCGTGAAGTTGAATATTCAATTCATCTTATGTGGAGTATGCTGTTGCAATTTTCCAGTTTATTAACGACTCTCTGGCTTTTCTGATACAGTTTATGTGGAGTATAATGTTAGAATTTTCCCACTAGTTGATCTACACCTTATGAAAATTTGCTTAGAGTTCAATTCATGGTAACCACCTACCTAGACAACCAAATGTAGTGAAATCAAATGATTTTCTTTTGTGATCAGTGAAGGTATGCTCATAAATTTCAAAAGAAGGGTAAAAAGAAAATAAAGAAGAACGTAGTTACTATTTGAACGAGATGGAAACTGCTTGAAACTAATTAACATTTTTTAAAGTGAAACGAAATGTAGAAATGGATCCTACCAAAGCTCTAGAGATGTCTTAAGACTCTTAACAATGTTTTGGTGATGACATTAAGAACTTAAGCGTCATGGTTGATGGGAGCCCATTTTTGTTTTAGATTGGCTTGTATTCTCCTTGTTTTCTTTCATTTGTTCTCGATTGAGGTTTCGTTATTGATAAATTTAAGAAAAAACACGAAAAAAATTACTAATGTCCATGATTGGCTATCACCAGATCGTAATGTTTAACATTCTTATTCTGGTAAATGATCTTTATTTTTCAAGAATGTAACAGTAAAACATAACTTCTACTTTTGGGGACTTGATTTGAATAATTTACAATGAAAGTATGATAGCTACGTATTGCTATCTCGACTTCACTTGTTTTTTATTTGCAGATTTGCCTGGTTACACCCGCAGTGTTTACAAAAGGGATCATGCCTTGATAACACCGGAAAGTCAAGTGTTTAGTCCTCTACCTGAGTGGTAATTTTGGTCACTTTTGAACATTCTTGTTAATTATTCTTTGCAAGGCCTGTATTATTTTTTTCCCTTTGCAGGACTAAAACACTTGGCGCATATTTAATCACACCAGCACTTGGTGCACATTTTGTGATGTATCTTGCGCAGATGCAAGGTTTAGTTGAACTTTGCCACGAACCTTTACATAATGCGCATATATACACAACGAGTATTTTTTATAATATTTTATTCATATTTTCTTTTATTGTATGAGCTGCAGAGAAATCAAAATCAGGATTGCCCCCACATGATGTTGAGAGGTGTACTTTCTTTTGCCTTCTTTATGTAGTCCCATTGCAGAGAAATTGATAATATTTACCTAATACCTATATCATGCAACTGACATTTCAAATTTTCATCTATGGTTTTCATACTAGATTTCTATTTGTTGTTCAAGGAGCAGTGAAACTTACCAATTCGTCAGGCATAAGCGAAAAACTTACGGTGAGACAGAGTCGTCAATTTACTTGTTCATATTTTTTCTCCAGTGTCAGATGCTTGCTTTGTAGTTAAGAATAAGATTTAATGGTTTATTTCCTAGAAATGTCTATAAGATTCATTCTTCATAGTTCATACATTTGGATTTCATTTGATATTCTCAAGGCTGCAAATCAGCTCATACAAGAAAGGTTTGTGTTGAATAAAGTACCTAATTCCCTCATCATATTATTTCATTGTCCTGTTTGAATGGTTGGCTAGCATGAAGGCCGATGATTTGTTGCTAGTGAGGTTCACAGATAAGAACAGCACCTCGAATTTTATAGATCGAACGTTGAAATACAAGTATCTAAATATATTCATTTTTATTCAATGAAAACAGTGATTTGTGCTCATGTAGACTGTCTGTTATGGTTGATAATTGGATTGTTTGCGGAGTGAAACTGCTCCCGTGATACGATCAATAGTCGAGAAGTCTGATGATATTTATGAATCTGTGAAATGAAGGTTGATTCATTTGCTTATCTACCTCCGAACTTTGACCATTCCGTGAAGTGTGACTCTTCTGCTACCCTTGTGGTTTTCGAAAGAAGGTTTGTTTCCACCTATTAACTATAATATACATGAAATGGCTTCACATTTGTGCACTAGTCATCTTCACTCCAGTTCATGATAAGCATTTATCAATACTTGCATCATGTATGTATTTGTTCTTTCACCTGAAACCAATGTTCTAAGGAATGATTATGAAACTGTTCAGTATATCTAGTGTCACATATATCATAGGAAATACTTAAATAACATTGCTATAATTGGCTTATAAACAGTTCACCTCTCGTGACTTCGAATTGTCTCATTTAATCAGGTATGCTTCTCTGGAGAATCATCACCCCAAGCAGATCATTGGTTCGACAGACAAGCAGCCCCTCCTCGAAACTCCTGGTGAGGTAAGCGAGGATGTTTCTCCTTGCTTTAGGAAGGTTGAAAGTTTCTTTTGGAAAAGGAAACACAAATATTCATTGATAATATGCAAATTACGAGTAAAGATGCAAGTTCTCAGTAAGAACTGCTATAGATTTAAACTTACAAAAATTAATAAACTATCCTTGGAACAAATGGTTCTTCTGAACATCTTACAGATATGTTCTTACATCATCCTATAATATATTTGAAAGTTCTTCCTTGTGTTGTATCACATTACCAGATCTTTCAACTTAGGAAGCTCCTTCCCATGTCCATGGCGTACGATTTCAATGTCCATGTAAGTTATGAGTTGCAATTGAAAGATTAGTTCTTTTGTCTCGGTCATGAGCTTTCTCATAGTTATATTGAACATTTTTTCCCACTAGATCATGGATTTTGAGCCTGGGGAGTTTCTTAATGTGAAGGTAATTTCATGAAGTTCAAATCTTACATTTCTTAGCTTTGGTTGGTTTTAAACTTCCTTCTAATTATTCGGTTTATACGTTATTCGATGGAAAAATACATTCCTTTTAAATTGGATCTATGTTATGCCTCTTAATTATAGTTTTTGTTGAATGGTTGACAGGAGGTTCATTATAATCAGCATGGTTTGTTGCTCTTAGAGGGCCAGGGCATTTATCGTTTGGGCGAAAACTGGTAAATATTTCATTCCTCTTGCTCGGCGTTGATCGCCTTTCATATATATATATATATATATATAATAAGCTTGAAACTAGTTTCACTTTTTCTTGGGCAGAGCTGGAAATTTTCTTAGGGTGGGATAAATTTTTTATTTGAACCTCTTGTGCACATAAATCTTTTAAATTTGGTTAACTAGACAAGGTAGAGACATCTATCCTCAACCAAGAAAGAAGTTAGAAGTAGAGAAACATAGAAATCTTTTGGTTTAAAAAATTTAGGAGGGCAAGTTAGAGACAAGATTTAGAACTAAGAGGCGCCCCCTGCTCTGCCCCTCTCCTTGTTTACTCCACTCTTCTGGACATCATCCATTCATCACAAAGTCCATCATATAACAAATAAATCAGCAAACGAACATGTATTTTAAATATAAGATAGATATGCATAGCTTTTATCTTGTAAGCTGCACTGTAGGGTTCTTATGACATGTGACTAAAATCAAAGAAATACAATTCATTTTTCGTTCTTGTTCAAAATATTTCTCATGGACTTCGCACTTGTTCTTATTATTGCTGTTCTTGTAGGTACCCTGTTCAATCTGGTGATGTCATTTGGATGGCTCCCTTTGTACCGCAATGGTTAGTCAAATATGTTATACTCGTTCTTCTTCATCCTCATTACAATATAGTGTAAACTAACTGGGTTTGGGTTTGGGTTTATATTATATGATCTTTTCAGGTATGCTGCACTCGGGAAAACACGAAGTCGCTATCTATTGTACAAGGATACGAACCGTGACCCCCTCGATCACAAGTAGTATGTTGATGCAACATTATAAGATTACTATCATATGAACACAATGAACTGGTAGGATTTATTTTAGATTGTTGAAATGGAATAGTGCATTTGCCCCTGCAGACCCCTAAGTTTTTAAGGTTAAGCTTACAATAAATATTTGATGCAAATTTCATAGTCCATTCAATTCATATTGAAGGACTTTTTTCTCCTAA

mRNA sequence

CATCGTCAAAAAACGAGAAATTATCACGTGACTTCAAAATCAACCAGTGAATGTGAAGCTCACTGTGCTCATCGGCTGCCATTTCTCCACCTCGAGCATCCAATAAAAAGCGCCACCATGGATTGCTATTCCTACCGATTCTAATTCGTCCTGATCAAAGAGAAAAAAAGTTCGATCGGAATTCAATAAACAGAATCAGAAGATAAAATCCGTCTCCCGTTTGATATGTGGAGCTTCTCAGCGCACTCCCATGGAGCATCTCTTCTAATTATCATCACCACTTCAAGTTTGCTTGGAGTTACATTTGGTGGAGAAGGGTTTTGTTCAACGCCCTCCATTATAGATTCGGATGCCGATTCGAAGCCTCTGTATTATAAAGTGACCAATCCTACTCTTTCTCCTTCTCATCTGCAAGATTTGCCTGGTTACACCCGCAGTGTTTACAAAAGGGATCATGCCTTGATAACACCGGAAAGTCAAGTGTTTAGTCCTCTACCTGAGTGGACTAAAACACTTGGCGCATATTTAATCACACCAGCACTTGGTGCACATTTTGTGATGTATCTTGCGCAGATGCAAGAGAAATCAAAATCAGGATTGCCCCCACATGATGTTGAGAGATTTCTATTTGTTGTTCAAGGAGCAGTGAAACTTACCAATTCGTCAGGCATAAGCGAAAAACTTACGGTTGATTCATTTGCTTATCTACCTCCGAACTTTGACCATTCCGTGAAGTGTGACTCTTCTGCTACCCTTGTGGTTTTCGAAAGAAGGTATGCTTCTCTGGAGAATCATCACCCCAAGCAGATCATTGGTTCGACAGACAAGCAGCCCCTCCTCGAAACTCCTGGTGAGATCTTTCAACTTAGGAAGCTCCTTCCCATGTCCATGGCGTACGATTTCAATGTCCATATCATGGATTTTGAGCCTGGGGAGTTTCTTAATGTGAAGGAGGTTCATTATAATCAGCATGGTTTGTTGCTCTTAGAGGGCCAGGGCATTTATCGTTTGGGCGAAAACTGGTACCCTGTTCAATCTGGTGATGTCATTTGGATGGCTCCCTTTGTACCGCAATGGTATGCTGCACTCGGGAAAACACGAAGTCGCTATCTATTGTACAAGGATACGAACCGTGACCCCCTCGATCACAAGTAGTATGTTGATGCAACATTATAAGATTACTATCATATGAACACAATGAACTGGTAGGATTTATTTTAGATTGTTGAAATGGAATAGTGCATTTGCCCCTGCAGACCCCTAAGTTTTTAAGGTTAAGCTTACAATAAATATTTGATGCAAATTTCATAGTCCATTCAATTCATATTGAAGGACTTTTTTCTCCTAA

Coding sequence (CDS)

ATGTGGAGCTTCTCAGCGCACTCCCATGGAGCATCTCTTCTAATTATCATCACCACTTCAAGTTTGCTTGGAGTTACATTTGGTGGAGAAGGGTTTTGTTCAACGCCCTCCATTATAGATTCGGATGCCGATTCGAAGCCTCTGTATTATAAAGTGACCAATCCTACTCTTTCTCCTTCTCATCTGCAAGATTTGCCTGGTTACACCCGCAGTGTTTACAAAAGGGATCATGCCTTGATAACACCGGAAAGTCAAGTGTTTAGTCCTCTACCTGAGTGGACTAAAACACTTGGCGCATATTTAATCACACCAGCACTTGGTGCACATTTTGTGATGTATCTTGCGCAGATGCAAGAGAAATCAAAATCAGGATTGCCCCCACATGATGTTGAGAGATTTCTATTTGTTGTTCAAGGAGCAGTGAAACTTACCAATTCGTCAGGCATAAGCGAAAAACTTACGGTTGATTCATTTGCTTATCTACCTCCGAACTTTGACCATTCCGTGAAGTGTGACTCTTCTGCTACCCTTGTGGTTTTCGAAAGAAGGTATGCTTCTCTGGAGAATCATCACCCCAAGCAGATCATTGGTTCGACAGACAAGCAGCCCCTCCTCGAAACTCCTGGTGAGATCTTTCAACTTAGGAAGCTCCTTCCCATGTCCATGGCGTACGATTTCAATGTCCATATCATGGATTTTGAGCCTGGGGAGTTTCTTAATGTGAAGGAGGTTCATTATAATCAGCATGGTTTGTTGCTCTTAGAGGGCCAGGGCATTTATCGTTTGGGCGAAAACTGGTACCCTGTTCAATCTGGTGATGTCATTTGGATGGCTCCCTTTGTACCGCAATGGTATGCTGCACTCGGGAAAACACGAAGTCGCTATCTATTGTACAAGGATACGAACCGTGACCCCCTCGATCACAAGTAG

Protein sequence

MWSFSAHSHGASLLIIITTSSLLGVTFGGEGFCSTPSIIDSDADSKPLYYKVTNPTLSPSHLQDLPGYTRSVYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVERFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHHPKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGLLLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPLDHK
BLAST of Cp4.1LG08g07140 vs. Swiss-Prot
Match: UGHY_ARATH ((S)-ureidoglycine aminohydrolase OS=Arabidopsis thaliana GN=UGLYAH PE=1 SV=1)

HSP 1 Score: 487.3 bits (1253), Expect = 1.3e-136
Identity = 224/293 (76.45%), Postives = 257/293 (87.71%), Query Frame = 1

Query: 14  LIIITTSSLLGVTFGGEGFCSTPSIIDSDADSKPLYYKVTNPTLSPSHLQDLPGYTRSVY 73
           LI+    SL+  +   +GFCS PSI++SD  + P+Y+K TNPTLSPSHLQDLPG+TRSVY
Sbjct: 6   LIVFIVISLVKASKSDDGFCSAPSIVESDEKTNPIYWKATNPTLSPSHLQDLPGFTRSVY 65

Query: 74  KRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVERF 133
           KRDHALITPES V+SPLP+WT TLGAYLITPA G+HFVMYLA+M+E S SGLPP D+ER 
Sbjct: 66  KRDHALITPESHVYSPLPDWTNTLGAYLITPATGSHFVMYLAKMKEMSSSGLPPQDIERL 125

Query: 134 LFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHHPK 193
           +FVV+GAV LTN+S  S+KLTVDS+AYLPPNF HS+ C  SATLVVFERRY  L +H  +
Sbjct: 126 IFVVEGAVTLTNTSSSSKKLTVDSYAYLPPNFHHSLDCVESATLVVFERRYEYLGSHTTE 185

Query: 194 QIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGLLL 253
            I+GSTDKQPLLETPGE+F+LRKLLPMS+AYDFN+H MDF+PGEFLNVKEVHYNQHGLLL
Sbjct: 186 LIVGSTDKQPLLETPGEVFELRKLLPMSVAYDFNIHTMDFQPGEFLNVKEVHYNQHGLLL 245

Query: 254 LEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           LEGQGIYRLG+NWYPVQ+GDVIWMAPFVPQWYAALGKTRSRYLLYKD NR+PL
Sbjct: 246 LEGQGIYRLGDNWYPVQAGDVIWMAPFVPQWYAALGKTRSRYLLYKDVNRNPL 298

BLAST of Cp4.1LG08g07140 vs. Swiss-Prot
Match: UGHY_ORYSJ (Probable (S)-ureidoglycine aminohydrolase OS=Oryza sativa subsp. japonica GN=UGLYAH PE=1 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 2.1e-123
Identity = 208/301 (69.10%), Postives = 242/301 (80.40%), Query Frame = 1

Query: 13  LLIIITTSSLLGVTFG----GEGFCSTPSIIDSDADS---KPLYYKVTNPTLSPSHLQDL 72
           LL++ +   L  V  G    GEGFCS      S   S    PLY+K TNPTL+P+HLQDL
Sbjct: 8   LLVVASALPLASVAAGAVGVGEGFCSAEPSAASGGCSGVRPPLYWKATNPTLAPAHLQDL 67

Query: 73  PGYTRSVYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGL 132
           PG+TRSVYKRDHALITPES VFSPLP+W  TLGAYLI+PA+GAHF MYLA+M + SKS L
Sbjct: 68  PGFTRSVYKRDHALITPESHVFSPLPDWINTLGAYLISPAIGAHFTMYLAKMHDGSKSAL 127

Query: 133 PPHDVERFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYA 192
           PP  VER +FV+QG++ L+  SG +  L VDS+AYLP N  HSV  D   TLV+FERRY 
Sbjct: 128 PPKGVERLIFVLQGSILLSEESGNTHTLLVDSYAYLPANMKHSVISDEVTTLVIFERRYT 187

Query: 193 SLENHHPKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVH 252
           ++E +HP  I+GSTDKQPLLETPGE+F+LRKLLP S+ YDFN+HIMDF+PGE+LNVKEVH
Sbjct: 188 TIEGYHPDLIVGSTDKQPLLETPGEVFELRKLLPTSLPYDFNIHIMDFQPGEYLNVKEVH 247

Query: 253 YNQHGLLLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDP 307
           YNQHGLLLLEGQGIYRLG++WYPVQSGD IWMAPFVPQWYAALGKT++RYLLYKD NRDP
Sbjct: 248 YNQHGLLLLEGQGIYRLGDSWYPVQSGDTIWMAPFVPQWYAALGKTKTRYLLYKDVNRDP 307

BLAST of Cp4.1LG08g07140 vs. Swiss-Prot
Match: ALLE_ECOLI ((S)-ureidoglycine aminohydrolase OS=Escherichia coli (strain K12) GN=allE PE=1 SV=1)

HSP 1 Score: 125.9 bits (315), Expect = 7.4e-28
Identity = 78/259 (30.12%), Postives = 128/259 (49.42%), Query Frame = 1

Query: 61  HLQDLPGY------TRSVYKRDH-ALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMY 120
           +L ++ GY       R++ K  + AL+TP+  V + +P +       L TP LGA FV Y
Sbjct: 3   YLNNVTGYREDLLANRAIVKHGNFALLTPDGLVKNIIPGFENCDATILSTPKLGASFVDY 62

Query: 121 LAQMQEK--SKSGLPPHDVERFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPN----FDH 180
           L  + +   ++ G     +E FL+V+ G +    + G +  L+   + Y PP     F +
Sbjct: 63  LVTLHQNGGNQQGFGGEGIETFLYVISGNIT-AKAEGKTFALSEGGYLYCPPGSLMTFVN 122

Query: 181 SVKCDSSATLVVFERRYASLENHHPKQIIGSTDKQPLLETPG-EIFQLRKLLPMSMAYDF 240
           +   DS   + +++RRY  +E + P  + G+  +   +   G +   L   LP  + +D 
Sbjct: 123 AQAEDSQ--IFLYKRRYVPVEGYAPWLVSGNASELERIHYEGMDDVILLDFLPKELGFDM 182

Query: 241 NVHIMDFEPGEFLNVKEVHYNQHGLLLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYA 300
           N+HI+ F PG      E H  +HG  +L GQG+Y L  NW PV+ GD I+M  +  Q   
Sbjct: 183 NMHILSFAPGASHGYIETHVQEHGAYILSGQGVYNLDNNWIPVKKGDYIFMGAYSLQAGY 242

Query: 301 ALGKTRS-RYLLYKDTNRD 305
            +G+  +  Y+  KD NRD
Sbjct: 243 GVGRGEAFSYIYSKDCNRD 258

BLAST of Cp4.1LG08g07140 vs. TrEMBL
Match: A0A0A0L7K1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G073830 PE=4 SV=1)

HSP 1 Score: 587.0 bits (1512), Expect = 1.3e-164
Identity = 278/309 (89.97%), Postives = 291/309 (94.17%), Query Frame = 1

Query: 1   MWSFSAHSHGASLLIIITTSSLLGVTFGGEGFCSTPSIIDSDADSKPLYYKVTNPTLSPS 60
           MWSFS +SHG SLLI+  TSSLLG  FGGEGFCS PS++DSDADSK LYYKVTNPTLSPS
Sbjct: 1   MWSFSTYSHGLSLLILFATSSLLGFAFGGEGFCSAPSVVDSDADSKALYYKVTNPTLSPS 60

Query: 61  HLQDLPGYTRSVYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEK 120
           HLQDLPG+TRSVYKRDHALITPESQVFSPLPEWT TLGAYLITPALG+HFVMYLAQM+EK
Sbjct: 61  HLQDLPGFTRSVYKRDHALITPESQVFSPLPEWTNTLGAYLITPALGSHFVMYLAQMKEK 120

Query: 121 SKSGLPPHDVERFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVF 180
           SKSGLPP DVERFLFV+QGAVKLTNSSGISEKLTVDSFAYLPPNFDHSV  DSSATLVVF
Sbjct: 121 SKSGLPPTDVERFLFVIQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVMSDSSATLVVF 180

Query: 181 ERRYASLENHHPKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLN 240
           ERRYASL +HH KQI+GSTDKQPLLETPGE+FQLRKLLPMSM YDFNVHIMDFEPGEFLN
Sbjct: 181 ERRYASLVDHHTKQIVGSTDKQPLLETPGEVFQLRKLLPMSMPYDFNVHIMDFEPGEFLN 240

Query: 241 VKEVHYNQHGLLLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKD 300
           VKEVHYNQHGLLLLEGQGIYRLG+ WYPVQSGD IWMAPFVPQWYAALGKTRSRYLLYKD
Sbjct: 241 VKEVHYNQHGLLLLEGQGIYRLGDYWYPVQSGDAIWMAPFVPQWYAALGKTRSRYLLYKD 300

Query: 301 TNRDPLDHK 310
            NR+PLDHK
Sbjct: 301 MNRNPLDHK 309

BLAST of Cp4.1LG08g07140 vs. TrEMBL
Match: A0A0D2TB23_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G230800 PE=4 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 1.2e-141
Identity = 236/295 (80.00%), Postives = 267/295 (90.51%), Query Frame = 1

Query: 13  LLIIITTSSLLGVTFGGEGFCSTPSIID-SDADSKPLYYKVTNPTLSPSHLQDLPGYTRS 72
           LL+  T +S+     G EGFCS PSI+D +D+ SKPLY+KVT+PTLSPSHLQDLPG+TRS
Sbjct: 10  LLLSFTLTSVFNQVLGDEGFCSAPSILDQTDSSSKPLYWKVTSPTLSPSHLQDLPGFTRS 69

Query: 73  VYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVE 132
           VY+RDHALITPES VFSPLP+WT TLGAYLITPA+G+HFVMYLA+MQE S+SGLPP+DVE
Sbjct: 70  VYRRDHALITPESHVFSPLPDWTNTLGAYLITPAIGSHFVMYLAKMQENSRSGLPPNDVE 129

Query: 133 RFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHH 192
           R +FV QGAV LTNSSGIS KL VDS+AYLPPNFDHS+KCD SATLVVFERRYA L+NH 
Sbjct: 130 RLIFVTQGAVTLTNSSGISNKLVVDSYAYLPPNFDHSLKCDGSATLVVFERRYAFLDNHI 189

Query: 193 PKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGL 252
            + I+GSTDKQPLLETPGE+F+LRKLLP SM YDFN+H+MDF+PGEFLNVKEVHYNQHGL
Sbjct: 190 TEHIVGSTDKQPLLETPGEVFELRKLLPASMPYDFNIHVMDFQPGEFLNVKEVHYNQHGL 249

Query: 253 LLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           LLLEGQGIYRLG++WYPVQ+GDVIWMAPFVPQWYAALGKTRSRYLLYKD NR+PL
Sbjct: 250 LLLEGQGIYRLGDSWYPVQAGDVIWMAPFVPQWYAALGKTRSRYLLYKDVNRNPL 304

BLAST of Cp4.1LG08g07140 vs. TrEMBL
Match: A0A0B0PV98_GOSAR (Putative ylbA OS=Gossypium arboreum GN=F383_14213 PE=4 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 1.3e-140
Identity = 235/295 (79.66%), Postives = 266/295 (90.17%), Query Frame = 1

Query: 13  LLIIITTSSLLGVTFGGEGFCSTPSIID-SDADSKPLYYKVTNPTLSPSHLQDLPGYTRS 72
           LL+  T  S+  +  G EGFCS PSI+D +D+ SKPLY+KVT+PTLSPSHLQDLPG+TRS
Sbjct: 10  LLLSFTLISVFNLVLGDEGFCSAPSILDQTDSSSKPLYWKVTSPTLSPSHLQDLPGFTRS 69

Query: 73  VYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVE 132
           VY+RDHALITPES VFSPLP+WT TLGAYLITPA+G+HFVMYLA+MQE S+SGLPP+DVE
Sbjct: 70  VYRRDHALITPESHVFSPLPDWTNTLGAYLITPAIGSHFVMYLAKMQENSRSGLPPNDVE 129

Query: 133 RFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHH 192
           R +FV QGAV LTNSSGIS KL VDS+AYLPPNFDHS+KCD SATLVVFERRYA L+NH 
Sbjct: 130 RLIFVTQGAVTLTNSSGISNKLVVDSYAYLPPNFDHSLKCDGSATLVVFERRYAFLDNHI 189

Query: 193 PKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGL 252
            + I+GSTDKQPLLETPGE+F+LRKLLP SM YDFN+HIMDF+PGEFLNVKE+HYNQHGL
Sbjct: 190 TEHIVGSTDKQPLLETPGEVFELRKLLPASMPYDFNIHIMDFQPGEFLNVKELHYNQHGL 249

Query: 253 LLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           LLLEG GIYRLG++WYPVQ+GDVIWMAPFVPQWYAALGKTRSRYLLYKD NR+PL
Sbjct: 250 LLLEGLGIYRLGDSWYPVQAGDVIWMAPFVPQWYAALGKTRSRYLLYKDVNRNPL 304

BLAST of Cp4.1LG08g07140 vs. TrEMBL
Match: A0A0D2Q007_GOSRA (Uncharacterized protein (Fragment) OS=Gossypium raimondii GN=B456_008G230800 PE=4 SV=1)

HSP 1 Score: 506.1 bits (1302), Expect = 2.9e-140
Identity = 235/297 (79.12%), Postives = 265/297 (89.23%), Query Frame = 1

Query: 11  ASLLIIITTSSLLGVTFGGEGFCSTPSIID-SDADSKPLYYKVTNPTLSPSHLQDLPGYT 70
           A + I    S +     G EGFCS PSI+D +D+ SKPLY+KVT+PTLSPSHLQDLPG+T
Sbjct: 5   ACVRISFAFSGVFNQVLGDEGFCSAPSILDQTDSSSKPLYWKVTSPTLSPSHLQDLPGFT 64

Query: 71  RSVYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHD 130
           RSVY+RDHALITPES VFSPLP+WT TLGAYLITPA+G+HFVMYLA+MQE S+SGLPP+D
Sbjct: 65  RSVYRRDHALITPESHVFSPLPDWTNTLGAYLITPAIGSHFVMYLAKMQENSRSGLPPND 124

Query: 131 VERFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLEN 190
           VER +FV QGAV LTNSSGIS KL VDS+AYLPPNFDHS+KCD SATLVVFERRYA L+N
Sbjct: 125 VERLIFVTQGAVTLTNSSGISNKLVVDSYAYLPPNFDHSLKCDGSATLVVFERRYAFLDN 184

Query: 191 HHPKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQH 250
           H  + I+GSTDKQPLLETPGE+F+LRKLLP SM YDFN+H+MDF+PGEFLNVKEVHYNQH
Sbjct: 185 HITEHIVGSTDKQPLLETPGEVFELRKLLPASMPYDFNIHVMDFQPGEFLNVKEVHYNQH 244

Query: 251 GLLLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           GLLLLEGQGIYRLG++WYPVQ+GDVIWMAPFVPQWYAALGKTRSRYLLYKD NR+PL
Sbjct: 245 GLLLLEGQGIYRLGDSWYPVQAGDVIWMAPFVPQWYAALGKTRSRYLLYKDVNRNPL 301

BLAST of Cp4.1LG08g07140 vs. TrEMBL
Match: M5WMV3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009303mg PE=4 SV=1)

HSP 1 Score: 501.1 bits (1289), Expect = 9.5e-139
Identity = 234/295 (79.32%), Postives = 265/295 (89.83%), Query Frame = 1

Query: 12  SLLIIITTSSLLGVTFGGEGFCSTPSIIDSDADSKPLYYKVTNPTLSPSHLQDLPGYTRS 71
           +L II TT SLL +    E FCS P I+DS ++SK LY+KVTNPTLSPSHLQDLPG+TRS
Sbjct: 4   ALAIIFTTLSLLKMAVAEEEFCSAPLIVDSGSNSKHLYWKVTNPTLSPSHLQDLPGFTRS 63

Query: 72  VYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVE 131
           VYK+DHALITPES VFSPLPEWT TLGAYLITPA+G+HFVMYLA+MQE S SGLPP+D E
Sbjct: 64  VYKQDHALITPESHVFSPLPEWTMTLGAYLITPAMGSHFVMYLAKMQENSLSGLPPYDAE 123

Query: 132 RFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHH 191
           RF+FVVQGAV LTN SGIS KLTVDS+AYLPPN +HS+KCD SATLVVFERR+ASLEN  
Sbjct: 124 RFIFVVQGAVTLTNVSGISHKLTVDSYAYLPPNVEHSLKCDGSATLVVFERRHASLENQP 183

Query: 192 PKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGL 251
            +QI+GSTD+QPLLETPGE+FQLRKL+P S+ YDFN+HIMDF+PGE+LNVKEVHYNQHGL
Sbjct: 184 TEQIVGSTDQQPLLETPGEVFQLRKLIPTSIPYDFNIHIMDFQPGEYLNVKEVHYNQHGL 243

Query: 252 LLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           LLLEGQGIYRLG++WYPVQ+GD IWMAPFVPQWYAALGK+RSRYLLYKD NR+PL
Sbjct: 244 LLLEGQGIYRLGDSWYPVQAGDAIWMAPFVPQWYAALGKSRSRYLLYKDVNRNPL 298

BLAST of Cp4.1LG08g07140 vs. TAIR10
Match: AT4G17050.1 (AT4G17050.1 ureidoglycine aminohydrolase)

HSP 1 Score: 487.3 bits (1253), Expect = 7.2e-138
Identity = 224/293 (76.45%), Postives = 257/293 (87.71%), Query Frame = 1

Query: 14  LIIITTSSLLGVTFGGEGFCSTPSIIDSDADSKPLYYKVTNPTLSPSHLQDLPGYTRSVY 73
           LI+    SL+  +   +GFCS PSI++SD  + P+Y+K TNPTLSPSHLQDLPG+TRSVY
Sbjct: 6   LIVFIVISLVKASKSDDGFCSAPSIVESDEKTNPIYWKATNPTLSPSHLQDLPGFTRSVY 65

Query: 74  KRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVERF 133
           KRDHALITPES V+SPLP+WT TLGAYLITPA G+HFVMYLA+M+E S SGLPP D+ER 
Sbjct: 66  KRDHALITPESHVYSPLPDWTNTLGAYLITPATGSHFVMYLAKMKEMSSSGLPPQDIERL 125

Query: 134 LFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHHPK 193
           +FVV+GAV LTN+S  S+KLTVDS+AYLPPNF HS+ C  SATLVVFERRY  L +H  +
Sbjct: 126 IFVVEGAVTLTNTSSSSKKLTVDSYAYLPPNFHHSLDCVESATLVVFERRYEYLGSHTTE 185

Query: 194 QIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGLLL 253
            I+GSTDKQPLLETPGE+F+LRKLLPMS+AYDFN+H MDF+PGEFLNVKEVHYNQHGLLL
Sbjct: 186 LIVGSTDKQPLLETPGEVFELRKLLPMSVAYDFNIHTMDFQPGEFLNVKEVHYNQHGLLL 245

Query: 254 LEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           LEGQGIYRLG+NWYPVQ+GDVIWMAPFVPQWYAALGKTRSRYLLYKD NR+PL
Sbjct: 246 LEGQGIYRLGDNWYPVQAGDVIWMAPFVPQWYAALGKTRSRYLLYKDVNRNPL 298

BLAST of Cp4.1LG08g07140 vs. NCBI nr
Match: gi|449463234|ref|XP_004149339.1| (PREDICTED: (S)-ureidoglycine aminohydrolase [Cucumis sativus])

HSP 1 Score: 587.0 bits (1512), Expect = 1.9e-164
Identity = 278/309 (89.97%), Postives = 291/309 (94.17%), Query Frame = 1

Query: 1   MWSFSAHSHGASLLIIITTSSLLGVTFGGEGFCSTPSIIDSDADSKPLYYKVTNPTLSPS 60
           MWSFS +SHG SLLI+  TSSLLG  FGGEGFCS PS++DSDADSK LYYKVTNPTLSPS
Sbjct: 1   MWSFSTYSHGLSLLILFATSSLLGFAFGGEGFCSAPSVVDSDADSKALYYKVTNPTLSPS 60

Query: 61  HLQDLPGYTRSVYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEK 120
           HLQDLPG+TRSVYKRDHALITPESQVFSPLPEWT TLGAYLITPALG+HFVMYLAQM+EK
Sbjct: 61  HLQDLPGFTRSVYKRDHALITPESQVFSPLPEWTNTLGAYLITPALGSHFVMYLAQMKEK 120

Query: 121 SKSGLPPHDVERFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVF 180
           SKSGLPP DVERFLFV+QGAVKLTNSSGISEKLTVDSFAYLPPNFDHSV  DSSATLVVF
Sbjct: 121 SKSGLPPTDVERFLFVIQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVMSDSSATLVVF 180

Query: 181 ERRYASLENHHPKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLN 240
           ERRYASL +HH KQI+GSTDKQPLLETPGE+FQLRKLLPMSM YDFNVHIMDFEPGEFLN
Sbjct: 181 ERRYASLVDHHTKQIVGSTDKQPLLETPGEVFQLRKLLPMSMPYDFNVHIMDFEPGEFLN 240

Query: 241 VKEVHYNQHGLLLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKD 300
           VKEVHYNQHGLLLLEGQGIYRLG+ WYPVQSGD IWMAPFVPQWYAALGKTRSRYLLYKD
Sbjct: 241 VKEVHYNQHGLLLLEGQGIYRLGDYWYPVQSGDAIWMAPFVPQWYAALGKTRSRYLLYKD 300

Query: 301 TNRDPLDHK 310
            NR+PLDHK
Sbjct: 301 MNRNPLDHK 309

BLAST of Cp4.1LG08g07140 vs. NCBI nr
Match: gi|823213343|ref|XP_012439412.1| (PREDICTED: (S)-ureidoglycine aminohydrolase [Gossypium raimondii])

HSP 1 Score: 510.8 bits (1314), Expect = 1.7e-141
Identity = 236/295 (80.00%), Postives = 267/295 (90.51%), Query Frame = 1

Query: 13  LLIIITTSSLLGVTFGGEGFCSTPSIID-SDADSKPLYYKVTNPTLSPSHLQDLPGYTRS 72
           LL+  T +S+     G EGFCS PSI+D +D+ SKPLY+KVT+PTLSPSHLQDLPG+TRS
Sbjct: 10  LLLSFTLTSVFNQVLGDEGFCSAPSILDQTDSSSKPLYWKVTSPTLSPSHLQDLPGFTRS 69

Query: 73  VYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVE 132
           VY+RDHALITPES VFSPLP+WT TLGAYLITPA+G+HFVMYLA+MQE S+SGLPP+DVE
Sbjct: 70  VYRRDHALITPESHVFSPLPDWTNTLGAYLITPAIGSHFVMYLAKMQENSRSGLPPNDVE 129

Query: 133 RFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHH 192
           R +FV QGAV LTNSSGIS KL VDS+AYLPPNFDHS+KCD SATLVVFERRYA L+NH 
Sbjct: 130 RLIFVTQGAVTLTNSSGISNKLVVDSYAYLPPNFDHSLKCDGSATLVVFERRYAFLDNHI 189

Query: 193 PKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGL 252
            + I+GSTDKQPLLETPGE+F+LRKLLP SM YDFN+H+MDF+PGEFLNVKEVHYNQHGL
Sbjct: 190 TEHIVGSTDKQPLLETPGEVFELRKLLPASMPYDFNIHVMDFQPGEFLNVKEVHYNQHGL 249

Query: 253 LLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           LLLEGQGIYRLG++WYPVQ+GDVIWMAPFVPQWYAALGKTRSRYLLYKD NR+PL
Sbjct: 250 LLLEGQGIYRLGDSWYPVQAGDVIWMAPFVPQWYAALGKTRSRYLLYKDVNRNPL 304

BLAST of Cp4.1LG08g07140 vs. NCBI nr
Match: gi|728849502|gb|KHG28945.1| (putative ylbA [Gossypium arboreum])

HSP 1 Score: 507.3 bits (1305), Expect = 1.9e-140
Identity = 235/295 (79.66%), Postives = 266/295 (90.17%), Query Frame = 1

Query: 13  LLIIITTSSLLGVTFGGEGFCSTPSIID-SDADSKPLYYKVTNPTLSPSHLQDLPGYTRS 72
           LL+  T  S+  +  G EGFCS PSI+D +D+ SKPLY+KVT+PTLSPSHLQDLPG+TRS
Sbjct: 10  LLLSFTLISVFNLVLGDEGFCSAPSILDQTDSSSKPLYWKVTSPTLSPSHLQDLPGFTRS 69

Query: 73  VYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVE 132
           VY+RDHALITPES VFSPLP+WT TLGAYLITPA+G+HFVMYLA+MQE S+SGLPP+DVE
Sbjct: 70  VYRRDHALITPESHVFSPLPDWTNTLGAYLITPAIGSHFVMYLAKMQENSRSGLPPNDVE 129

Query: 133 RFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHH 192
           R +FV QGAV LTNSSGIS KL VDS+AYLPPNFDHS+KCD SATLVVFERRYA L+NH 
Sbjct: 130 RLIFVTQGAVTLTNSSGISNKLVVDSYAYLPPNFDHSLKCDGSATLVVFERRYAFLDNHI 189

Query: 193 PKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGL 252
            + I+GSTDKQPLLETPGE+F+LRKLLP SM YDFN+HIMDF+PGEFLNVKE+HYNQHGL
Sbjct: 190 TEHIVGSTDKQPLLETPGEVFELRKLLPASMPYDFNIHIMDFQPGEFLNVKELHYNQHGL 249

Query: 253 LLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           LLLEG GIYRLG++WYPVQ+GDVIWMAPFVPQWYAALGKTRSRYLLYKD NR+PL
Sbjct: 250 LLLEGLGIYRLGDSWYPVQAGDVIWMAPFVPQWYAALGKTRSRYLLYKDVNRNPL 304

BLAST of Cp4.1LG08g07140 vs. NCBI nr
Match: gi|763784631|gb|KJB51702.1| (hypothetical protein B456_008G230800, partial [Gossypium raimondii])

HSP 1 Score: 506.1 bits (1302), Expect = 4.2e-140
Identity = 235/297 (79.12%), Postives = 265/297 (89.23%), Query Frame = 1

Query: 11  ASLLIIITTSSLLGVTFGGEGFCSTPSIID-SDADSKPLYYKVTNPTLSPSHLQDLPGYT 70
           A + I    S +     G EGFCS PSI+D +D+ SKPLY+KVT+PTLSPSHLQDLPG+T
Sbjct: 5   ACVRISFAFSGVFNQVLGDEGFCSAPSILDQTDSSSKPLYWKVTSPTLSPSHLQDLPGFT 64

Query: 71  RSVYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHD 130
           RSVY+RDHALITPES VFSPLP+WT TLGAYLITPA+G+HFVMYLA+MQE S+SGLPP+D
Sbjct: 65  RSVYRRDHALITPESHVFSPLPDWTNTLGAYLITPAIGSHFVMYLAKMQENSRSGLPPND 124

Query: 131 VERFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLEN 190
           VER +FV QGAV LTNSSGIS KL VDS+AYLPPNFDHS+KCD SATLVVFERRYA L+N
Sbjct: 125 VERLIFVTQGAVTLTNSSGISNKLVVDSYAYLPPNFDHSLKCDGSATLVVFERRYAFLDN 184

Query: 191 HHPKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQH 250
           H  + I+GSTDKQPLLETPGE+F+LRKLLP SM YDFN+H+MDF+PGEFLNVKEVHYNQH
Sbjct: 185 HITEHIVGSTDKQPLLETPGEVFELRKLLPASMPYDFNIHVMDFQPGEFLNVKEVHYNQH 244

Query: 251 GLLLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           GLLLLEGQGIYRLG++WYPVQ+GDVIWMAPFVPQWYAALGKTRSRYLLYKD NR+PL
Sbjct: 245 GLLLLEGQGIYRLGDSWYPVQAGDVIWMAPFVPQWYAALGKTRSRYLLYKDVNRNPL 301

BLAST of Cp4.1LG08g07140 vs. NCBI nr
Match: gi|645278924|ref|XP_008244458.1| (PREDICTED: ureidoglycine aminohydrolase [Prunus mume])

HSP 1 Score: 505.8 bits (1301), Expect = 5.5e-140
Identity = 237/295 (80.34%), Postives = 265/295 (89.83%), Query Frame = 1

Query: 12  SLLIIITTSSLLGVTFGGEGFCSTPSIIDSDADSKPLYYKVTNPTLSPSHLQDLPGYTRS 71
           +L II TT SLL +    E FCS PSI+DS  +SK LY+KVTNPTLSPSHLQDLPG+TRS
Sbjct: 54  ALAIIFTTLSLLKMAVAEEEFCSAPSIVDSGLNSKHLYWKVTNPTLSPSHLQDLPGFTRS 113

Query: 72  VYKRDHALITPESQVFSPLPEWTKTLGAYLITPALGAHFVMYLAQMQEKSKSGLPPHDVE 131
           VYKRDHALITPES VFSPLPEWT TLGAYLITPA+G+HFVMYLA+MQE S SGLPP+D E
Sbjct: 114 VYKRDHALITPESHVFSPLPEWTMTLGAYLITPAMGSHFVMYLAKMQENSLSGLPPYDAE 173

Query: 132 RFLFVVQGAVKLTNSSGISEKLTVDSFAYLPPNFDHSVKCDSSATLVVFERRYASLENHH 191
           RF+FVVQGAV LTN SGIS KLTVDS+AYLPPN DHS+KCD SATLVVFERR+ASLEN  
Sbjct: 174 RFIFVVQGAVTLTNVSGISHKLTVDSYAYLPPNVDHSLKCDGSATLVVFERRHASLENQP 233

Query: 192 PKQIIGSTDKQPLLETPGEIFQLRKLLPMSMAYDFNVHIMDFEPGEFLNVKEVHYNQHGL 251
            +QI+GSTD+QPLLETPGE+FQLRKL+P S+ YDFN+HIMDF+PGE+LNVKEVHYNQHGL
Sbjct: 234 TEQIVGSTDQQPLLETPGEVFQLRKLIPTSIPYDFNIHIMDFQPGEYLNVKEVHYNQHGL 293

Query: 252 LLLEGQGIYRLGENWYPVQSGDVIWMAPFVPQWYAALGKTRSRYLLYKDTNRDPL 307
           LLLEGQGIYRLG++WYPVQ+GD IWMAPFVPQWYAALGK+RSRYLLYKD NR+PL
Sbjct: 294 LLLEGQGIYRLGDSWYPVQAGDAIWMAPFVPQWYAALGKSRSRYLLYKDVNRNPL 348

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UGHY_ARATH1.3e-13676.45(S)-ureidoglycine aminohydrolase OS=Arabidopsis thaliana GN=UGLYAH PE=1 SV=1[more]
UGHY_ORYSJ2.1e-12369.10Probable (S)-ureidoglycine aminohydrolase OS=Oryza sativa subsp. japonica GN=UGL... [more]
ALLE_ECOLI7.4e-2830.12(S)-ureidoglycine aminohydrolase OS=Escherichia coli (strain K12) GN=allE PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A0A0L7K1_CUCSA1.3e-16489.97Uncharacterized protein OS=Cucumis sativus GN=Csa_3G073830 PE=4 SV=1[more]
A0A0D2TB23_GOSRA1.2e-14180.00Uncharacterized protein OS=Gossypium raimondii GN=B456_008G230800 PE=4 SV=1[more]
A0A0B0PV98_GOSAR1.3e-14079.66Putative ylbA OS=Gossypium arboreum GN=F383_14213 PE=4 SV=1[more]
A0A0D2Q007_GOSRA2.9e-14079.12Uncharacterized protein (Fragment) OS=Gossypium raimondii GN=B456_008G230800 PE=... [more]
M5WMV3_PRUPE9.5e-13979.32Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009303mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G17050.17.2e-13876.45 ureidoglycine aminohydrolase[more]
Match NameE-valueIdentityDescription
gi|449463234|ref|XP_004149339.1|1.9e-16489.97PREDICTED: (S)-ureidoglycine aminohydrolase [Cucumis sativus][more]
gi|823213343|ref|XP_012439412.1|1.7e-14180.00PREDICTED: (S)-ureidoglycine aminohydrolase [Gossypium raimondii][more]
gi|728849502|gb|KHG28945.1|1.9e-14079.66putative ylbA [Gossypium arboreum][more]
gi|763784631|gb|KJB51702.1|4.2e-14079.12hypothetical protein B456_008G230800, partial [Gossypium raimondii][more]
gi|645278924|ref|XP_008244458.1|5.5e-14080.34PREDICTED: ureidoglycine aminohydrolase [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR014710RmlC-like_jellyroll
IPR011051RmlC_Cupin_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000256 allantoin catabolic process
biological_process GO:0006145 purine nucleobase catabolic process
biological_process GO:0010136 ureide catabolic process
biological_process GO:0000023 maltose metabolic process
biological_process GO:0043085 positive regulation of catalytic activity
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0019252 starch biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0071522 ureidoglycine aminohydrolase activity
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g07140.1Cp4.1LG08g07140.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 65..305
score: 9.98
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 60..306
score: 2.6
NoneNo IPR availablePANTHERPTHR34571FAMILY NOT NAMEDcoord: 10..307
score: 8.5E
NoneNo IPR availablePANTHERPTHR34571:SF1UREIDOGLYCINE AMINOHYDROLASEcoord: 10..307
score: 8.5E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG08g07140CmaCh06G008170Cucurbita maxima (Rimu)cmacpeB852
Cp4.1LG08g07140CmoCh06G008390Cucurbita moschata (Rifu)cmocpeB797
Cp4.1LG08g07140Carg12048Silver-seed gourdcarcpeB0808
The following gene(s) are paralogous to this gene:

None