Cp4.1LG20g05000 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g05000
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionUrease accessory protein ureG, putative
LocationCp4.1LG20 : 2935490 .. 2939115 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGGGAAAAAAAATAAAAAAACCATCTACGTTTCAATCCTGGATGGATTGCAGACATTGGAATCCGTAGTTCAAACAAAAAACTTGTGAAGCCCTAAAAGTTGGAACATTTTGAGAATTGCGTTAAAAATTGGCTCCATCATTTATGAACACTGATGTTTGTTTCTGACATGTTTGTCTTTGTTCGACAGCTGTGTGGAGGAAGGAGGATCTTGCACCTGTGATACTCTTTTGGATTTTTCTTTCTTCCCTCAACAATCGAGCTCTGTTCTTGATTCTATTTCGATTTCTCTTCAAAATCCGTTTGAATTTTGATCCGCCGTTGTCAATCTCCGCCGCATTTTACGATCTTCCTCCTCTCGAACTCTTCTTCGAGCTTCATCAGTGTAACAATGGCTTCGCAAGATACTCATAGCCATAGCCACGACCACCACCATGACCACGACCACGGACATGATCATCACCACAGCCACGAGTAATTTCCATTCAATTTGTGCATTTAGGTCTTGATATCTGATCTCATTTCTCTGCTTAGCGTTCATTGTTTACTTGTAACAGAAAACCTCAGGGGGATTCATCCTTTGTTGGAGCAGATGGTAGGGTTTATCACAGTCATGATGGACTGGCGCCTCACTCCCATGAACCCATTTACTCTCCTGGCTTCTTCAGCCGGAGAGCTCCGCCTCTTCTTACTAGAGATTTCACTGAAAGGGCTTTTACCGTCGGTATTGGTGGCCCAGTTGGTACTGGGTATATTTCCATTTGCTTTCATTGTTTTTTACTCATTCCCATAGAAGGAAAATGCAGAGGAGATTTAGTTTTGAGCCTCTAAGAACATTCTACTGTTTTTTTTTATGAACTTTGAAGCCAATTTTGTTTTACTATTGTTTAGGAAGACAGCTCTGATGCTAGCTCTGTGTACCTTCTTACGTGACAAATACAGTCTAGCTGCGGTAAGTGAGATTTTGAATCGATTCTCTATATCACTTTCACTTTTCAGTTTCTGTGCTTCACTTCCTAATTTAATATTTGTGGAATCTTGGGTCTGTTATGCTTGACTTACTTTGTATTTGTTCATTCTAAAATAAAGCTGTCTATTCAGCTTTTTGTTGAGTTTTATTGAACCAGTTTCTCTGTTAGTCTCAGACCCTGCTTCTTCCCAGACTGATAATGTAACACCTTAAGCCTACCGCTAGGAGATATTGTCCTTTGAGCTTTCCCTTTCGAACTTCTTCTCAAGATTTTTAAAACGCGTCTGCTAGGAAGAGGTTTCCACACCCTTTTAAAGAATGATTCGTTCTCTTCCCCAACTGATGTGAGATAACCTACCCCATTTGGAGCCCAGTGTTCTCTCTAGCATTCGTTCCCTTCTCCAATCGATGTTGGACCCCCTAATCCATCCCCTTCGGGCCCAACATCCTTGCTGGCACACCCCCTCATGTCCACCCCCTTCGGGGCTCAGCCTCCTTGCTAGCACATCGTCCAGTGTCTGGCTATGATACCATTTGTAATAGCCTAAGTCCACCACTAACAGATATTGTCCTCTTTGGGCTTTTCCTATCAGGGGAACGTTCCTTTCTCCAATCGATGTGGGACCCTCCAATCCATCCCCTTCGGGGCCCAACGTCCTTGCTGGCACACCCCTTCAGGGCTCAGCCTCCTTGCTAGCACATCGTCTAGTGTTTGGCTCTGATACCATTTGTAATAGTCTAAACCCACCACCGACAAATATTGTCCTCTTTGGGCTTTTCCTCTCAGGCTTCTCCTAAAGGTTTTTAGAAAACGTCTGCTAAACAACATCTTTATAAAGAATGCTTCCGAGGTGGGATTTCACAGATAAAAGCACCAATTCCATTGTGTTTTAGAATATAATCAGCTGGATAACCTTCACTGCTCTATACATGTATACAAGAGGTTGTATACTATGATTATTATCTGCTGATATTGTTTTAAAGTAGGCTGATTCAATTACATCAAGTTCTTTGTTGTGGTAGAATGAAATGTTTTTCTGCTAGAAACCAAACGATATTTATTTGACATGATCACTGTTAGGTCACAAATGACATCTTCACGAAAGAGGATGGAGAATTTTTGGTGAAGCATGGAGCTCTACCAGAGGAAAGGATTAGAGCAGTGGAAACTGGTGGCTGCCCGCATGCTGCTATTCGCGAAGACATCAGCATTAATCTTGGACCTCTTGAGGAGCTTTCTAACTTGTACAAAACAGACATACTTCTATGTGAATCTGGTGGAGGTAATTTGGAAATCCCCAATTATGAAGCTTGAATAACATGTGTGTTAGATTACTTAACTTCACTGGATCTTTGTTAGCCCTTCCTTCTCGTTAATATACTTGCATTAGCACCCGAAAAATCAATTGTCGGTTCAAATGGTATGTTGAATGTTTAGCATTCAAAAGAAAAATAGTTCACAGTAATTTAGATATCTCGGGTTTATCTCGGGTTCTCATGTTTGATGTTAAAGAAGAAAATAATCTTGAATGTGAGATCCCACATCGGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCCAACAGACGCCTTCTAAAAACCTTGAGGGAAGCCCAAAAGGAAATGTCCAAAAAGGACAATATTTGGTAGCGGTGGGCTTGGACCGTTAAAATGTTGCAGTGGGTGGTTTGAGTTTAGTACCTGGAAGTCTTTGATTCATGTTTTATCTGTTTTGCTGTGCAGATAACTTAGCTGCCAACTTCAGCAGGGAACTTGCTGACTATATCATCTATATCATAGATGTGTCTGGTGGGGATAAAATTCCTCGAAAAGGCGGCCCCGGAATCACTCAGGCTGATCTCCTTGTATGTCATTTCCTAATTTTGTTTCAAAGTATCCATCTCAGTCAGTATTGCAATCAAGACAGGCATCTCTCTAAATGGTTCCATTTCAACAACAGGTAATAAACAAAACTGACCTTGCACAAGCTGTTGGAGCTGATCTAGCGGTTATGGAGCGTGACGCACTGAAAATGCGAGATGGGGGGCCGTTCGTATTTGCTCAGGTTAGCTCCCCCCTCGGTTCGATATCTTGATCACACTTTGTAGCTAGAGAATCAAGTAGTTGATTATCTCAGTTAATGTCTTCGAGTCTGAACGAGAACGAAGAACAGAAAACCTTATTTTTGTCGATGTTAAGCGTATCTGCCTGACACGCAGGTCAAACATGGAGTAGGCGTCGAGGAAATCGTTAACCACATTATACAGGCGTGGGAAGGAGCGACAGGGAAGAAACGGCATTGACATCTCTCAGCTGGGAAATGAGAGGGTCTGGTTTAAGTTTTAAGCAGTGCAATAGGAGCCGTTGTACACAGCTTTGGTGGCTTCGTTTCAGAATTATGTAAAATCTTTGTTCTAAAAAGCTTAGGAGGTTTGTTCCTCTTGGCAGAGTTCAAACTTTTAGGTTGGAAACAGGACTTGTGAAGTTCTAAGATAGATCATAAGAGGCTCCAAACTCATATGGGAATTTTCCAAGGCTCAAACTTATTATGCCGCTTGGGATGATTTGAAAAAATACAATCTAATCCGACTTGAATTGTTGATCATTCATTAGAATTTTACAAATATTGACCAAAATAT

mRNA sequence

GGGGGAAAAAAAATAAAAAAACCATCTACGTTTCAATCCTGGATGGATTGCAGACATTGGAATCCGTAGTTCAAACAAAAAACTTGTGAAGCCCTAAAAGTTGGAACATTTTGAGAATTGCGTTAAAAATTGGCTCCATCATTTATGAACACTGATGTTTGTTTCTGACATGTTTGTCTTTGTTCGACAGCTGTGTGGAGGAAGGAGGATCTTGCACCTGTGATACTCTTTTGGATTTTTCTTTCTTCCCTCAACAATCGAGCTCTGTTCTTGATTCTATTTCGATTTCTCTTCAAAATCCGTTTGAATTTTGATCCGCCGTTGTCAATCTCCGCCGCATTTTACGATCTTCCTCCTCTCGAACTCTTCTTCGAGCTTCATCAGTGTAACAATGGCTTCGCAAGATACTCATAGCCATAGCCACGACCACCACCATGACCACGACCACGGACATGATCATCACCACAGCCACGAAAAACCTCAGGGGGATTCATCCTTTGTTGGAGCAGATGGTAGGGTTTATCACAGTCATGATGGACTGGCGCCTCACTCCCATGAACCCATTTACTCTCCTGGCTTCTTCAGCCGGAGAGCTCCGCCTCTTCTTACTAGAGATTTCACTGAAAGGGCTTTTACCGTCGGTATTGGTGGCCCAGTTGGTACTGGGAAGACAGCTCTGATGCTAGCTCTGTGTACCTTCTTACGTGACAAATACAGTCTAGCTGCGGTCACAAATGACATCTTCACGAAAGAGGATGGAGAATTTTTGGTGAAGCATGGAGCTCTACCAGAGGAAAGGATTAGAGCAGTGGAAACTGGTGGCTGCCCGCATGCTGCTATTCGCGAAGACATCAGCATTAATCTTGGACCTCTTGAGGAGCTTTCTAACTTGTACAAAACAGACATACTTCTATGTGAATCTGGTGGAGATAACTTAGCTGCCAACTTCAGCAGGGAACTTGCTGACTATATCATCTATATCATAGATGTGTCTGGTGGGGATAAAATTCCTCGAAAAGGCGGCCCCGGAATCACTCAGGCTGATCTCCTTGTAATAAACAAAACTGACCTTGCACAAGCTGTTGGAGCTGATCTAGCGGTTATGGAGCGTGACGCACTGAAAATGCGAGATGGGGGGCCGTTCGTATTTGCTCAGGTCAAACATGGAGTAGGCGTCGAGGAAATCGTTAACCACATTATACAGGCGTGGGAAGGAGCGACAGGGAAGAAACGGCATTGACATCTCTCAGCTGGGAAATGAGAGGGTCTGGTTTAAGTTTTAAGCAGTGCAATAGGAGCCGTTGTACACAGCTTTGGTGGCTTCGTTTCAGAATTATGTAAAATCTTTGTTCTAAAAAGCTTAGGAGGTTTGTTCCTCTTGGCAGAGTTCAAACTTTTAGGTTGGAAACAGGACTTGTGAAGTTCTAAGATAGATCATAAGAGGCTCCAAACTCATATGGGAATTTTCCAAGGCTCAAACTTATTATGCCGCTTGGGATGATTTGAAAAAATACAATCTAATCCGACTTGAATTGTTGATCATTCATTAGAATTTTACAAATATTGACCAAAATAT

Coding sequence (CDS)

ATGGCTTCGCAAGATACTCATAGCCATAGCCACGACCACCACCATGACCACGACCACGGACATGATCATCACCACAGCCACGAAAAACCTCAGGGGGATTCATCCTTTGTTGGAGCAGATGGTAGGGTTTATCACAGTCATGATGGACTGGCGCCTCACTCCCATGAACCCATTTACTCTCCTGGCTTCTTCAGCCGGAGAGCTCCGCCTCTTCTTACTAGAGATTTCACTGAAAGGGCTTTTACCGTCGGTATTGGTGGCCCAGTTGGTACTGGGAAGACAGCTCTGATGCTAGCTCTGTGTACCTTCTTACGTGACAAATACAGTCTAGCTGCGGTCACAAATGACATCTTCACGAAAGAGGATGGAGAATTTTTGGTGAAGCATGGAGCTCTACCAGAGGAAAGGATTAGAGCAGTGGAAACTGGTGGCTGCCCGCATGCTGCTATTCGCGAAGACATCAGCATTAATCTTGGACCTCTTGAGGAGCTTTCTAACTTGTACAAAACAGACATACTTCTATGTGAATCTGGTGGAGATAACTTAGCTGCCAACTTCAGCAGGGAACTTGCTGACTATATCATCTATATCATAGATGTGTCTGGTGGGGATAAAATTCCTCGAAAAGGCGGCCCCGGAATCACTCAGGCTGATCTCCTTGTAATAAACAAAACTGACCTTGCACAAGCTGTTGGAGCTGATCTAGCGGTTATGGAGCGTGACGCACTGAAAATGCGAGATGGGGGGCCGTTCGTATTTGCTCAGGTCAAACATGGAGTAGGCGTCGAGGAAATCGTTAACCACATTATACAGGCGTGGGAAGGAGCGACAGGGAAGAAACGGCATTGA

Protein sequence

MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQGDSSFVGADGRVYHSHDGLAPHSHEPIYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVMERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH
BLAST of Cp4.1LG20g05000 vs. Swiss-Prot
Match: UREG_ORYSI (Urease accessory protein G OS=Oryza sativa subsp. indica GN=UREG PE=2 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 2.9e-135
Identity = 245/285 (85.96%), Postives = 258/285 (90.53%), Query Frame = 1

Query: 1   MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQ----GDSSFVGADGRVYHSHDGLAPHSHE 60
           MAS D H H H HHH HD G DHHHSH +      G  S+VG DGRV+HSHDGLAPHSHE
Sbjct: 1   MASHD-HDHHHHHHHSHDDG-DHHHSHHQDGSHGGGGGSWVGEDGRVWHSHDGLAPHSHE 60

Query: 61  PIYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTND 120
           PIYSPG FS+RAPPL++R F ERAFTVGIGGPVGTGKTALMLALC  LR+KYSLAAVTND
Sbjct: 61  PIYSPGDFSKRAPPLISRRFAERAFTVGIGGPVGTGKTALMLALCRSLREKYSLAAVTND 120

Query: 121 IFTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCE 180
           IFTKEDGEFL+KHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYK D+LLCE
Sbjct: 121 IFTKEDGEFLIKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKADLLLCE 180

Query: 181 SGGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLA 240
           SGGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLL+INKTDLA AVGADLA
Sbjct: 181 SGGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLIINKTDLAPAVGADLA 240

Query: 241 VMERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKR 282
           VMERDAL+MR+GGPFVFAQVKHGVGVEEIVNHI+QAWE ATG KR
Sbjct: 241 VMERDALRMREGGPFVFAQVKHGVGVEEIVNHILQAWEIATGNKR 283

BLAST of Cp4.1LG20g05000 vs. Swiss-Prot
Match: UREG_ARATH (Urease accessory protein G OS=Arabidopsis thaliana GN=UREG PE=2 SV=1)

HSP 1 Score: 479.9 bits (1234), Expect = 1.9e-134
Identity = 238/275 (86.55%), Postives = 255/275 (92.73%), Query Frame = 1

Query: 10  SHDHHHDHDHGHDHHHSHEKP---QGDSSFVGADGRVYHSHDGLAPHSHEPIYSPGFFSR 69
           SHDHHH H   HDH H HEK    +G +S+VG DG+VYHSHDGLAPHSHEPIYSPG+FSR
Sbjct: 3   SHDHHHHH---HDHEHDHEKSDGGEGKASWVGKDGKVYHSHDGLAPHSHEPIYSPGYFSR 62

Query: 70  RAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTKEDGEFL 129
           RAPPL  R+F+ERAFTVGIGGPVGTGKTALMLALC FLRDKYSLAAVTNDIFTKEDGEFL
Sbjct: 63  RAPPLHDRNFSERAFTVGIGGPVGTGKTALMLALCRFLRDKYSLAAVTNDIFTKEDGEFL 122

Query: 130 VKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGDNLAANF 189
           VK+GALPEERIRAVETGGCPHAAIREDISINLGPLEELSNL+K D+LLCESGGDNLAANF
Sbjct: 123 VKNGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLFKADLLLCESGGDNLAANF 182

Query: 190 SRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVMERDALKMR 249
           SRELADYIIYIIDVS GDKIPRKGGPGITQADLLVINKTDLA AVGADL+VMERD+L+MR
Sbjct: 183 SRELADYIIYIIDVSAGDKIPRKGGPGITQADLLVINKTDLAAAVGADLSVMERDSLRMR 242

Query: 250 DGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKR 282
           DGGPFVFAQVKHG+GVEEIVNH++ +WE ATGKKR
Sbjct: 243 DGGPFVFAQVKHGLGVEEIVNHVMHSWEHATGKKR 274

BLAST of Cp4.1LG20g05000 vs. Swiss-Prot
Match: UREG_ORYSJ (Urease accessory protein G OS=Oryza sativa subsp. japonica GN=UREG PE=2 SV=1)

HSP 1 Score: 479.6 bits (1233), Expect = 2.4e-134
Identity = 244/285 (85.61%), Postives = 257/285 (90.18%), Query Frame = 1

Query: 1   MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQ----GDSSFVGADGRVYHSHDGLAPHSHE 60
           MAS D H H H HHH HD G DHHHSH +      G  S+VG DGRV+HSHDGLAPHSHE
Sbjct: 1   MASHD-HDHDHHHHHSHDDG-DHHHSHHQDGSHGGGGGSWVGEDGRVWHSHDGLAPHSHE 60

Query: 61  PIYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTND 120
           PIYSPG FS+RAPPL++R F ERAFTVGIGGPVGTGKTALMLALC  LR+KYSLAAVTND
Sbjct: 61  PIYSPGDFSKRAPPLISRRFAERAFTVGIGGPVGTGKTALMLALCRSLREKYSLAAVTND 120

Query: 121 IFTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCE 180
           IFTKEDGEFL+KHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNL K D+LLCE
Sbjct: 121 IFTKEDGEFLIKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLCKADLLLCE 180

Query: 181 SGGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLA 240
           SGGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLL+INKTDLA AVGADLA
Sbjct: 181 SGGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLIINKTDLAPAVGADLA 240

Query: 241 VMERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKR 282
           VMERDAL+MR+GGPFVFAQVKHGVGVEEIVNHI+QAWE ATG KR
Sbjct: 241 VMERDALRMREGGPFVFAQVKHGVGVEEIVNHILQAWEIATGNKR 283

BLAST of Cp4.1LG20g05000 vs. Swiss-Prot
Match: UREG_SCHPO (Uncharacterized urease accessory protein ureG-like OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPCPB16A4.05c PE=3 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 2.9e-87
Identity = 171/282 (60.64%), Postives = 215/282 (76.24%), Query Frame = 1

Query: 2   ASQDTHSHSHDH-HHDHDH-GHDHHHSHEKPQGDSSFVGADGRVYHSHDGLAPHSHEPIY 61
           +   TH H+HD+ HH+HDH GHDHH SH+     SS   A  +    H     HSH+ + 
Sbjct: 11  SDDSTHHHTHDYDHHNHDHHGHDHH-SHDSSSNSSS-EAARLQFIQEHG----HSHDAME 70

Query: 62  SPG-FFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIF 121
           +PG +  R  P    RDF+ RAFT+G+GGPVG+GKTAL+L LC  L +KYS+  VTNDIF
Sbjct: 71  TPGSYLKRELPQFNHRDFSRRAFTIGVGGPVGSGKTALLLQLCRLLGEKYSIGVVTNDIF 130

Query: 122 TKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESG 181
           T+ED EFL+++ ALPEERIRA+ETGGCPHAAIRED+S NL  LEEL + + T++LL ESG
Sbjct: 131 TREDQEFLIRNKALPEERIRAIETGGCPHAAIREDVSGNLVALEELQSEFNTELLLVESG 190

Query: 182 GDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVM 241
           GDNLAAN+SR+LAD+IIY+IDVSGGDKIPRKGGPGIT++DLL+INKTDLA+ VGADL+VM
Sbjct: 191 GDNLAANYSRDLADFIIYVIDVSGGDKIPRKGGPGITESDLLIINKTDLAKLVGADLSVM 250

Query: 242 ERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKK 281
           +RDA K+R+ GP VFAQVK+ VG++EI   I+ A + A   K
Sbjct: 251 DRDAKKIRENGPIVFAQVKNQVGMDEITELILGAAKSAGALK 286

BLAST of Cp4.1LG20g05000 vs. Swiss-Prot
Match: UREG_ANADF (Urease accessory protein UreG OS=Anaeromyxobacter sp. (strain Fw109-5) GN=ureG PE=3 SV=1)

HSP 1 Score: 315.1 bits (806), Expect = 7.9e-85
Identity = 155/232 (66.81%), Postives = 177/232 (76.29%), Query Frame = 1

Query: 47  HDGLAPHSHEPIYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRD 106
           HD      H+    PG F  R  P    D   RAFTVG+GGPVG+GKTAL+LALC  LRD
Sbjct: 2   HDHSLHSGHDHGLGPGSFHDRGAPHARGDLRRRAFTVGVGGPVGSGKTALVLALCRALRD 61

Query: 107 KYSLAAVTNDIFTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSN 166
             SL  VTNDIFT+ED EFLV++ ALP ERIRAVETGGCPHAAIRED++ NL  LEEL+ 
Sbjct: 62  SRSLGVVTNDIFTREDAEFLVRNDALPAERIRAVETGGCPHAAIREDVTANLLALEELTE 121

Query: 167 LYKTDILLCESGGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTD 226
            ++ +IL CESGGDNLAA+FSRELADY IY+IDV+GGDK+PRKGGPGITQADLLV+NKTD
Sbjct: 122 AHRPEILFCESGGDNLAAHFSRELADYTIYVIDVAGGDKVPRKGGPGITQADLLVVNKTD 181

Query: 227 LAQAVGADLAVMERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATG 279
           LA AVGADL VM RDA +MR  GP VFAQV  GVGV EI  H++ A+  A G
Sbjct: 182 LATAVGADLDVMARDAARMRGDGPVVFAQVTRGVGVPEIAGHVLHAYRHAVG 233

BLAST of Cp4.1LG20g05000 vs. TrEMBL
Match: A0A0A0LUY7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G447410 PE=3 SV=1)

HSP 1 Score: 533.9 bits (1374), Expect = 1.2e-148
Identity = 266/282 (94.33%), Postives = 270/282 (95.74%), Query Frame = 1

Query: 1   MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQGDSSFVGADGRVYHSHDGLAPHSHEPIYS 60
           MASQDTH     HHH H  GHDHHH+HEKP+GDSSFVGADGRVYHSHDGLAPHSHEPIYS
Sbjct: 1   MASQDTH-----HHHHHHDGHDHHHTHEKPKGDSSFVGADGRVYHSHDGLAPHSHEPIYS 60

Query: 61  PGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTK 120
           PGFF+RRAPPLLTR+F ERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTK
Sbjct: 61  PGFFTRRAPPLLTRNFNERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTK 120

Query: 121 EDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGD 180
           EDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGD
Sbjct: 121 EDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGD 180

Query: 181 NLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVMER 240
           NLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLA AVGADLAVMER
Sbjct: 181 NLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLATAVGADLAVMER 240

Query: 241 DALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 283
           DALKMRDGGPFVFAQVKHGVGV EIVNHIIQAWE ATGKKRH
Sbjct: 241 DALKMRDGGPFVFAQVKHGVGVGEIVNHIIQAWEAATGKKRH 277

BLAST of Cp4.1LG20g05000 vs. TrEMBL
Match: A0A061GKY2_THECC (Urease accessory protein G OS=Theobroma cacao GN=TCM_029508 PE=3 SV=1)

HSP 1 Score: 506.9 bits (1304), Expect = 1.6e-140
Identity = 256/284 (90.14%), Postives = 261/284 (91.90%), Query Frame = 1

Query: 1   MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQGD---SSFVGADGRVYHSHDGLAPHSHEP 60
           MAS D H H H HHHDHDH HDHHH H     D   +S+VGADGRVYHSHDGLAPHSHEP
Sbjct: 27  MASNDHHVHDH-HHHDHDHDHDHHHHHHDHDHDKSTTSWVGADGRVYHSHDGLAPHSHEP 86

Query: 61  IYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDI 120
           IYSPGFFSRRAPPL  RDF ERAFTVGIGGPVGTGKTALMLALC FLRDKYSLAAVTNDI
Sbjct: 87  IYSPGFFSRRAPPLGNRDFNERAFTVGIGGPVGTGKTALMLALCKFLRDKYSLAAVTNDI 146

Query: 121 FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES 180
           FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES
Sbjct: 147 FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES 206

Query: 181 GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAV 240
           GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLA AVGADL V
Sbjct: 207 GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAPAVGADLGV 266

Query: 241 MERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKR 282
           MERDAL+MRDGGPFVFAQVKHG GVE+IVNHI+QAWE ATGKKR
Sbjct: 267 MERDALRMRDGGPFVFAQVKHGHGVEDIVNHILQAWEAATGKKR 309

BLAST of Cp4.1LG20g05000 vs. TrEMBL
Match: W9SHA3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_021695 PE=3 SV=1)

HSP 1 Score: 501.5 bits (1290), Expect = 6.6e-139
Identity = 252/285 (88.42%), Postives = 262/285 (91.93%), Query Frame = 1

Query: 1   MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQGDS---SFVGADGRVYHSHDGLAPHSHEP 60
           MAS D H H HDHHH HDH HDHH       GDS   ++VG DGRVYHSHDGLAPHSHEP
Sbjct: 1   MASDD-HHHHHDHHHHHDHDHDHH-------GDSRTGAWVGPDGRVYHSHDGLAPHSHEP 60

Query: 61  IYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDI 120
           IYSPGFFS+RAPPLLTRDF ERAFT+GIGGPVGTGKTALMLALC FLRDKYSLAAVTNDI
Sbjct: 61  IYSPGFFSKRAPPLLTRDFNERAFTIGIGGPVGTGKTALMLALCKFLRDKYSLAAVTNDI 120

Query: 121 FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES 180
           FTKEDGEFLVK+GALPEERIRAVETGGCPHAAIREDISINLGPLEELSNL+K DILLCES
Sbjct: 121 FTKEDGEFLVKNGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLFKADILLCES 180

Query: 181 GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAV 240
           GGDNLAANFSRELADYIIYIIDVS GDKIPRKGGPGITQADLLVINKTDLA AVGADLAV
Sbjct: 181 GGDNLAANFSRELADYIIYIIDVSAGDKIPRKGGPGITQADLLVINKTDLAPAVGADLAV 240

Query: 241 MERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 283
           MERDAL+MRDGGPFVFAQVKHGVG+EEIVNH++QAWE ATGKKRH
Sbjct: 241 MERDALRMRDGGPFVFAQVKHGVGIEEIVNHVLQAWEAATGKKRH 277

BLAST of Cp4.1LG20g05000 vs. TrEMBL
Match: B9S131_RICCO (Urease accessory protein ureG, putative OS=Ricinus communis GN=RCOM_0633220 PE=3 SV=1)

HSP 1 Score: 500.7 bits (1288), Expect = 1.1e-138
Identity = 245/273 (89.74%), Postives = 259/273 (94.87%), Query Frame = 1

Query: 10  SHDHHHDHDHGHDHHHSHEKPQGDSSFVGADGRVYHSHDGLAPHSHEPIYSPGFFSRRAP 69
           SHDHHH HDH HDH H H  P   +S++GADGRVYHSHDGLAPHSHEPIYSPGFFSRRAP
Sbjct: 4   SHDHHHTHDHEHDHQH-HRHPNEKTSWLGADGRVYHSHDGLAPHSHEPIYSPGFFSRRAP 63

Query: 70  PLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTKEDGEFLVKH 129
           P+LTRDF+ERAFTVGIGGPVGTGKTALMLA+C FLRDKYSLAAVTNDIFTKEDGEFL+K+
Sbjct: 64  PILTRDFSERAFTVGIGGPVGTGKTALMLAICKFLRDKYSLAAVTNDIFTKEDGEFLIKN 123

Query: 130 GALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGDNLAANFSRE 189
           GALPEERIRAVETGGCPHAAIREDISINLGPLEELS L+KTDILLCESGGDNLAANFSRE
Sbjct: 124 GALPEERIRAVETGGCPHAAIREDISINLGPLEELSKLFKTDILLCESGGDNLAANFSRE 183

Query: 190 LADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVMERDALKMRDGG 249
           LADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLA AVGADL VMERDA++MRDGG
Sbjct: 184 LADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLASAVGADLTVMERDAVRMRDGG 243

Query: 250 PFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 283
           PFVFAQVKHGVGVEEIVNH++QAWE ATGKK+H
Sbjct: 244 PFVFAQVKHGVGVEEIVNHVLQAWEVATGKKQH 275

BLAST of Cp4.1LG20g05000 vs. TrEMBL
Match: A0A067L1H1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04602 PE=3 SV=1)

HSP 1 Score: 500.0 bits (1286), Expect = 1.9e-138
Identity = 247/277 (89.17%), Postives = 261/277 (94.22%), Query Frame = 1

Query: 7   HSHSHDHHHDHDHGHDHHH-SHEKPQGDSSFVGADGRVYHSHDGLAPHSHEPIYSPGFFS 66
           H H HDH HDHDH HDHHH SHE+    +S+VG DGRVYHSHDGLAPHSHEPIYSPG+FS
Sbjct: 14  HGHGHDHDHDHDHDHDHHHHSHEQT---TSWVGPDGRVYHSHDGLAPHSHEPIYSPGYFS 73

Query: 67  RRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTKEDGEF 126
           RRAPP+LTRDF ERAFTVGIGGPVGTGKTALMLA+C FLRDKYSLAAVTNDIFTKEDGEF
Sbjct: 74  RRAPPILTRDFNERAFTVGIGGPVGTGKTALMLAICKFLRDKYSLAAVTNDIFTKEDGEF 133

Query: 127 LVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGDNLAAN 186
           L+K+GALPEERIRAVETGGCPHAAIREDISINLGPLEELS L+K DILLCESGGDNLAAN
Sbjct: 134 LIKNGALPEERIRAVETGGCPHAAIREDISINLGPLEELSRLFKADILLCESGGDNLAAN 193

Query: 187 FSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVMERDALKM 246
           FSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLA AVGADLAVMERDAL+M
Sbjct: 194 FSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAPAVGADLAVMERDALRM 253

Query: 247 RDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 283
           RDGGPFVFAQVKHGVGV+EIVNH++QAWE ATGKK+H
Sbjct: 254 RDGGPFVFAQVKHGVGVQEIVNHVLQAWEAATGKKQH 287

BLAST of Cp4.1LG20g05000 vs. TAIR10
Match: AT2G34470.2 (AT2G34470.2 urease accessory protein G)

HSP 1 Score: 477.6 bits (1228), Expect = 5.2e-135
Identity = 236/274 (86.13%), Postives = 254/274 (92.70%), Query Frame = 1

Query: 8   SHSHDHHHDHDHGHDHHHSHEKPQGDSSFVGADGRVYHSHDGLAPHSHEPIYSPGFFSRR 67
           SH H HHH HDH HDH    +  +G +S+VG DG+VYHSHDGLAPHSHEPIYSPG+FSRR
Sbjct: 3   SHDHHHHH-HDHEHDHDRKSDGGEGKASWVGKDGKVYHSHDGLAPHSHEPIYSPGYFSRR 62

Query: 68  APPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTKEDGEFLV 127
           APPL  R+F+ERAFTVGIGGPVGTGKTALMLALC FLRDKYSLAAVTNDIFTKEDGEFLV
Sbjct: 63  APPLHDRNFSERAFTVGIGGPVGTGKTALMLALCRFLRDKYSLAAVTNDIFTKEDGEFLV 122

Query: 128 KHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGDNLAANFS 187
           K+GALPEERIRAVETGGCPHAAIREDISINLGPLEELSNL+K D+LLCESGGDNLAANFS
Sbjct: 123 KNGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLFKADLLLCESGGDNLAANFS 182

Query: 188 RELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVMERDALKMRD 247
           RELADYIIYIIDVS GDKIPRKGGPGITQADLLVINKTDLA AVGADL+VMERD+L+MRD
Sbjct: 183 RELADYIIYIIDVSAGDKIPRKGGPGITQADLLVINKTDLAAAVGADLSVMERDSLRMRD 242

Query: 248 GGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKR 282
           GGPFVFAQVKHG+GVEEIVNH++ +WE ATGKKR
Sbjct: 243 GGPFVFAQVKHGLGVEEIVNHVMHSWEHATGKKR 275

BLAST of Cp4.1LG20g05000 vs. NCBI nr
Match: gi|659068374|ref|XP_008444095.1| (PREDICTED: urease accessory protein G [Cucumis melo])

HSP 1 Score: 543.1 bits (1398), Expect = 2.8e-151
Identity = 272/285 (95.44%), Postives = 276/285 (96.84%), Query Frame = 1

Query: 1   MASQDTHSH--SHDHHHDHD-HGHDHHHSHEKPQGDSSFVGADGRVYHSHDGLAPHSHEP 60
           MASQDTH H   HDHHH HD H H HHH+HEKP+GDSSFVGADGRVYHSHDGLAPHSHEP
Sbjct: 1   MASQDTHHHHHEHDHHHHHDEHDHHHHHTHEKPKGDSSFVGADGRVYHSHDGLAPHSHEP 60

Query: 61  IYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDI 120
           IYSPGFF+RRAPPLLTR+F ERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDI
Sbjct: 61  IYSPGFFTRRAPPLLTRNFNERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDI 120

Query: 121 FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES 180
           FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES
Sbjct: 121 FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES 180

Query: 181 GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAV 240
           GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAV
Sbjct: 181 GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAV 240

Query: 241 MERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 283
           MERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH
Sbjct: 241 MERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 285

BLAST of Cp4.1LG20g05000 vs. NCBI nr
Match: gi|778661023|ref|XP_011657367.1| (PREDICTED: urease accessory protein G [Cucumis sativus])

HSP 1 Score: 533.9 bits (1374), Expect = 1.7e-148
Identity = 266/282 (94.33%), Postives = 270/282 (95.74%), Query Frame = 1

Query: 1   MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQGDSSFVGADGRVYHSHDGLAPHSHEPIYS 60
           MASQDTH     HHH H  GHDHHH+HEKP+GDSSFVGADGRVYHSHDGLAPHSHEPIYS
Sbjct: 1   MASQDTH-----HHHHHHDGHDHHHTHEKPKGDSSFVGADGRVYHSHDGLAPHSHEPIYS 60

Query: 61  PGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTK 120
           PGFF+RRAPPLLTR+F ERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTK
Sbjct: 61  PGFFTRRAPPLLTRNFNERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTK 120

Query: 121 EDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGD 180
           EDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGD
Sbjct: 121 EDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGD 180

Query: 181 NLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVMER 240
           NLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLA AVGADLAVMER
Sbjct: 181 NLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLATAVGADLAVMER 240

Query: 241 DALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 283
           DALKMRDGGPFVFAQVKHGVGV EIVNHIIQAWE ATGKKRH
Sbjct: 241 DALKMRDGGPFVFAQVKHGVGVGEIVNHIIQAWEAATGKKRH 277

BLAST of Cp4.1LG20g05000 vs. NCBI nr
Match: gi|590622651|ref|XP_007025107.1| (Urease accessory protein G [Theobroma cacao])

HSP 1 Score: 506.9 bits (1304), Expect = 2.3e-140
Identity = 256/284 (90.14%), Postives = 261/284 (91.90%), Query Frame = 1

Query: 1   MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQGD---SSFVGADGRVYHSHDGLAPHSHEP 60
           MAS D H H H HHHDHDH HDHHH H     D   +S+VGADGRVYHSHDGLAPHSHEP
Sbjct: 27  MASNDHHVHDH-HHHDHDHDHDHHHHHHDHDHDKSTTSWVGADGRVYHSHDGLAPHSHEP 86

Query: 61  IYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDI 120
           IYSPGFFSRRAPPL  RDF ERAFTVGIGGPVGTGKTALMLALC FLRDKYSLAAVTNDI
Sbjct: 87  IYSPGFFSRRAPPLGNRDFNERAFTVGIGGPVGTGKTALMLALCKFLRDKYSLAAVTNDI 146

Query: 121 FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES 180
           FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES
Sbjct: 147 FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES 206

Query: 181 GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAV 240
           GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLA AVGADL V
Sbjct: 207 GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAPAVGADLGV 266

Query: 241 MERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKR 282
           MERDAL+MRDGGPFVFAQVKHG GVE+IVNHI+QAWE ATGKKR
Sbjct: 267 MERDALRMRDGGPFVFAQVKHGHGVEDIVNHILQAWEAATGKKR 309

BLAST of Cp4.1LG20g05000 vs. NCBI nr
Match: gi|703154657|ref|XP_010110999.1| (hypothetical protein L484_021695 [Morus notabilis])

HSP 1 Score: 501.5 bits (1290), Expect = 9.5e-139
Identity = 252/285 (88.42%), Postives = 262/285 (91.93%), Query Frame = 1

Query: 1   MASQDTHSHSHDHHHDHDHGHDHHHSHEKPQGDS---SFVGADGRVYHSHDGLAPHSHEP 60
           MAS D H H HDHHH HDH HDHH       GDS   ++VG DGRVYHSHDGLAPHSHEP
Sbjct: 1   MASDD-HHHHHDHHHHHDHDHDHH-------GDSRTGAWVGPDGRVYHSHDGLAPHSHEP 60

Query: 61  IYSPGFFSRRAPPLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDI 120
           IYSPGFFS+RAPPLLTRDF ERAFT+GIGGPVGTGKTALMLALC FLRDKYSLAAVTNDI
Sbjct: 61  IYSPGFFSKRAPPLLTRDFNERAFTIGIGGPVGTGKTALMLALCKFLRDKYSLAAVTNDI 120

Query: 121 FTKEDGEFLVKHGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCES 180
           FTKEDGEFLVK+GALPEERIRAVETGGCPHAAIREDISINLGPLEELSNL+K DILLCES
Sbjct: 121 FTKEDGEFLVKNGALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLFKADILLCES 180

Query: 181 GGDNLAANFSRELADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAV 240
           GGDNLAANFSRELADYIIYIIDVS GDKIPRKGGPGITQADLLVINKTDLA AVGADLAV
Sbjct: 181 GGDNLAANFSRELADYIIYIIDVSAGDKIPRKGGPGITQADLLVINKTDLAPAVGADLAV 240

Query: 241 MERDALKMRDGGPFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 283
           MERDAL+MRDGGPFVFAQVKHGVG+EEIVNH++QAWE ATGKKRH
Sbjct: 241 MERDALRMRDGGPFVFAQVKHGVGIEEIVNHVLQAWEAATGKKRH 277

BLAST of Cp4.1LG20g05000 vs. NCBI nr
Match: gi|255557339|ref|XP_002519700.1| (PREDICTED: urease accessory protein G isoform X1 [Ricinus communis])

HSP 1 Score: 500.7 bits (1288), Expect = 1.6e-138
Identity = 245/273 (89.74%), Postives = 259/273 (94.87%), Query Frame = 1

Query: 10  SHDHHHDHDHGHDHHHSHEKPQGDSSFVGADGRVYHSHDGLAPHSHEPIYSPGFFSRRAP 69
           SHDHHH HDH HDH H H  P   +S++GADGRVYHSHDGLAPHSHEPIYSPGFFSRRAP
Sbjct: 4   SHDHHHTHDHEHDHQH-HRHPNEKTSWLGADGRVYHSHDGLAPHSHEPIYSPGFFSRRAP 63

Query: 70  PLLTRDFTERAFTVGIGGPVGTGKTALMLALCTFLRDKYSLAAVTNDIFTKEDGEFLVKH 129
           P+LTRDF+ERAFTVGIGGPVGTGKTALMLA+C FLRDKYSLAAVTNDIFTKEDGEFL+K+
Sbjct: 64  PILTRDFSERAFTVGIGGPVGTGKTALMLAICKFLRDKYSLAAVTNDIFTKEDGEFLIKN 123

Query: 130 GALPEERIRAVETGGCPHAAIREDISINLGPLEELSNLYKTDILLCESGGDNLAANFSRE 189
           GALPEERIRAVETGGCPHAAIREDISINLGPLEELS L+KTDILLCESGGDNLAANFSRE
Sbjct: 124 GALPEERIRAVETGGCPHAAIREDISINLGPLEELSKLFKTDILLCESGGDNLAANFSRE 183

Query: 190 LADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLAQAVGADLAVMERDALKMRDGG 249
           LADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLA AVGADL VMERDA++MRDGG
Sbjct: 184 LADYIIYIIDVSGGDKIPRKGGPGITQADLLVINKTDLASAVGADLTVMERDAVRMRDGG 243

Query: 250 PFVFAQVKHGVGVEEIVNHIIQAWEGATGKKRH 283
           PFVFAQVKHGVGVEEIVNH++QAWE ATGKK+H
Sbjct: 244 PFVFAQVKHGVGVEEIVNHVLQAWEVATGKKQH 275

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UREG_ORYSI2.9e-13585.96Urease accessory protein G OS=Oryza sativa subsp. indica GN=UREG PE=2 SV=1[more]
UREG_ARATH1.9e-13486.55Urease accessory protein G OS=Arabidopsis thaliana GN=UREG PE=2 SV=1[more]
UREG_ORYSJ2.4e-13485.61Urease accessory protein G OS=Oryza sativa subsp. japonica GN=UREG PE=2 SV=1[more]
UREG_SCHPO2.9e-8760.64Uncharacterized urease accessory protein ureG-like OS=Schizosaccharomyces pombe ... [more]
UREG_ANADF7.9e-8566.81Urease accessory protein UreG OS=Anaeromyxobacter sp. (strain Fw109-5) GN=ureG P... [more]
Match NameE-valueIdentityDescription
A0A0A0LUY7_CUCSA1.2e-14894.33Uncharacterized protein OS=Cucumis sativus GN=Csa_1G447410 PE=3 SV=1[more]
A0A061GKY2_THECC1.6e-14090.14Urease accessory protein G OS=Theobroma cacao GN=TCM_029508 PE=3 SV=1[more]
W9SHA3_9ROSA6.6e-13988.42Uncharacterized protein OS=Morus notabilis GN=L484_021695 PE=3 SV=1[more]
B9S131_RICCO1.1e-13889.74Urease accessory protein ureG, putative OS=Ricinus communis GN=RCOM_0633220 PE=3... [more]
A0A067L1H1_JATCU1.9e-13889.17Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04602 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G34470.25.2e-13586.13 urease accessory protein G[more]
Match NameE-valueIdentityDescription
gi|659068374|ref|XP_008444095.1|2.8e-15195.44PREDICTED: urease accessory protein G [Cucumis melo][more]
gi|778661023|ref|XP_011657367.1|1.7e-14894.33PREDICTED: urease accessory protein G [Cucumis sativus][more]
gi|590622651|ref|XP_007025107.1|2.3e-14090.14Urease accessory protein G [Theobroma cacao][more]
gi|703154657|ref|XP_010110999.1|9.5e-13988.42hypothetical protein L484_021695 [Morus notabilis][more]
gi|255557339|ref|XP_002519700.1|1.6e-13889.74PREDICTED: urease accessory protein G isoform X1 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016151nickel cation binding
GO:0003924GTPase activity
Vocabulary: Biological Process
TermDefinition
GO:0006807nitrogen compound metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
IPR004400UreG
IPR003495CobW/HypB/UreG_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0043085 positive regulation of catalytic activity
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003924 GTPase activity
molecular_function GO:0016151 nickel cation binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g05000.1Cp4.1LG20g05000.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003495CobW/HypB/UreG, nucleotide-binding domainPFAMPF02492cobWcoord: 83..252
score: 2.9
IPR004400Urease accessory protein UreGHAMAPMF_01389UreGcoord: 79..276
score: 42
IPR004400Urease accessory protein UreGPANTHERPTHR31715FAMILY NOT NAMEDcoord: 14..281
score: 1.1E
IPR004400Urease accessory protein UreGTIGRFAMsTIGR00101TIGR00101coord: 82..270
score: 2.2
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3DG3DSA:3.40.50.300coord: 79..271
score: 6.4
IPR027417P-loop containing nucleoside triphosphate hydrolaseunknownSSF52540P-loop containing nucleoside triphosphate hydrolasescoord: 75..274
score: 4.85

The following gene(s) are paralogous to this gene:

None