Clc02G15970 (gene) Watermelon (cordophanus) v2

Overview
NameClc02G15970
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionurease accessory protein F
LocationClcChr02: 28496917 .. 28498467 (+)
RNA-Seq ExpressionClc02G15970
SyntenyClc02G15970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTATTTCTTATTCTTTTTCTTTTCTGAATGAAAGTCAAAATTCTCATATAAAAAAGAAACTATTATGGATTCAGATTGTTGTGACCTCGAGTAGAGATGTGTAGTTTATTGATTTTTGCATTTCTTGGTGAACCCTTTGTATATGCATACCATCCAGAGGCAATATTCCCTGATAAAAATCTTATTGACAGTGTGAACATGTTTAGTTATTGTCACTCGTTAGATAATTTTTAAGCTGGCCACTTAAAACTCAAAGTTGTTTTATCCATTTTCTAGATAGGTATCTTTAATGCAGTTGTGTCTTCATGTTTAAGTTTTGCAGAATATATGCAATAGTCCAAACTTGACATGGACGACAAACATTTTCACTGGAGCCAGTGGCAATTGCTCGATTCTATCCTTCCTACTGGTGGCTTTGCCCATTCGTTCGGTCTCGAGGCTGCAATACAAGCTCGCGTTGTCTCACATCAGGAAGATCTGAAAACTTTTGTGATCCATTTATTGGACAACACAGGAAGTTTGCTTCTTCCCTTTGTGCATTCAGCCACACTATCACCTGATTTAGAAACTTGGAAGAAAAATGACAGATTATTGGATGCTCTGTTGATTAATGAAGTTAGTCGAAAGGCATCAGTTACGCAAGGGTCAGCGCTTCTGAGAGTTGCAGCAATAGTATTTTCTGAAATCTCATATCTGAAAACCATGAGGGAATCCTTGTGTGGGACCAGGGCTGTTTCGTTCCACCACGCTCCCATCTTCGGGCTAATCTGCGGTCTACTTGGATGGGATAGCACGATGTCACAGAGGGCATATCTGTTTATTACCATGAGAGATGTTATTTCTGCTGCAACAAGATTGAATTTAGTAGGACCCTTGGGTGCAGCTGTGTTGCAGCATCAGCTTGCGCTTGTAGCTGAAGACATACTGAAAAAATGGATGAACCGTCCTGTTGAAGAAGCTTGCCAAACGGTTCCTCTGTTAGAGACAGTACAAGGATGCCATATGTACCTGTCTTCTAGACTGTTTTGTTCTTGAAAAGTTTTGAAGTGTTAAACTGTTTTCCGGTGATTTTGGGTTATAGGAGTCCCCTTCTTTCCTTGTCTGTGCTTGAAACTAGTTGTGGATTATGAGATAGTAGTGCCACTTTAGAAATTCTCCTACTTACATATATGTAGCCTTCAACCCTCCGAGTATATGCTTCAACCATGCAGATAAAGGATATTACATAGTTTGTTGTAGGGATTGTCGATTTACCCATTAGTGACTTCAGCACGAGACATTTCAAAATTTGGATACTATTAGGTCGAGTGAATTCAATAATAGAGGCTTGTTGGCGTTCCAACATTCCTTCTTTCATGGCATACCCGAAAGAAAATCTAGTGTTTCTTAGTCATGAATATATGAATGTCATGCCATGATTTTTCCTGGCTCATATATACCATAATTTTGTCTCTTGTTTTTAAGGATCTTAAATATTCTAATACTTTGCTAGTCTATGGTCATGATTCAAAATATCAAGTGAACTTAAATGAACTTATACATACTTTGAA

mRNA sequence

ATTATTTCTTATTCTTTTTCTTTTCTGAATGAAAGTCAAAATTCTCATATAAAAAAGAAACTATTATGGATTCAGATTGTTGTGACCTCGAGTAGAGATGTGTAGTTTATTGATTTTTGCATTTCTTGGTGAACCCTTTGTATATGCATACCATCCAGAGGCAATATTCCCTGATAAAAATCTTATTGACAGTGTGAACATGTTTAGTTATTGTCACTCGTTAGATAATTTTTAAGCTGGCCACTTAAAACTCAAAGTTGTTTTATCCATTTTCTAGATAGGTATCTTTAATGCAGTTGTGTCTTCATGTTTAAGTTTTGCAGAATATATGCAATAGTCCAAACTTGACATGGACGACAAACATTTTCACTGGAGCCAGTGGCAATTGCTCGATTCTATCCTTCCTACTGGTGGCTTTGCCCATTCGTTCGGTCTCGAGGCTGCAATACAAGCTCGCGTTGTCTCACATCAGGAAGATCTGAAAACTTTTGTGATCCATTTATTGGACAACACAGGAAGTTTGCTTCTTCCCTTTGTGCATTCAGCCACACTATCACCTGATTTAGAAACTTGGAAGAAAAATGACAGATTATTGGATGCTCTGTTGATTAATGAAGTTAGTCGAAAGGCATCAGTTACGCAAGGGTCAGCGCTTCTGAGAGTTGCAGCAATAGTATTTTCTGAAATCTCATATCTGAAAACCATGAGGGAATCCTTGTGTGGGACCAGGGCTGTTTCGTTCCACCACGCTCCCATCTTCGGGCTAATCTGCGGTCTACTTGGATGGGATAGCACGATGTCACAGAGGGCATATCTGTTTATTACCATGAGAGATGTTATTTCTGCTGCAACAAGATTGAATTTAGTAGGACCCTTGGGTGCAGCTGTGTTGCAGCATCAGCTTGCGCTTGTAGCTGAAGACATACTGAAAAAATGGATGAACCGTCCTGTTGAAGAAGCTTGCCAAACGGTTCCTCTGTTAGAGACAGTACAAGGATGCCATATGTACCTGTCTTCTAGACTGTTTTGTTCTTGAAAAGTTTTGAAGTGTTAAACTGTTTTCCGGTGATTTTGGGTTATAGGAGTCCCCTTCTTTCCTTGTCTGTGCTTGAAACTAGTTGTGGATTATGAGATAGTAGTGCCACTTTAGAAATTCTCCTACTTACATATATGTAGCCTTCAACCCTCCGAGTATATGCTTCAACCATGCAGATAAAGGATATTACATAGTTTGTTGTAGGGATTGTCGATTTACCCATTAGTGACTTCAGCACGAGACATTTCAAAATTTGGATACTATTAGGTCGAGTGAATTCAATAATAGAGGCTTGTTGGCGTTCCAACATTCCTTCTTTCATGGCATACCCGAAAGAAAATCTAGTGTTTCTTAGTCATGAATATATGAATGTCATGCCATGATTTTTCCTGGCTCATATATACCATAATTTTGTCTCTTGTTTTTAAGGATCTTAAATATTCTAATACTTTGCTAGTCTATGGTCATGATTCAAAATATCAAGTGAACTTAAATGAACTTATACATACTTTGAA

Coding sequence (CDS)

ATGGACGACAAACATTTTCACTGGAGCCAGTGGCAATTGCTCGATTCTATCCTTCCTACTGGTGGCTTTGCCCATTCGTTCGGTCTCGAGGCTGCAATACAAGCTCGCGTTGTCTCACATCAGGAAGATCTGAAAACTTTTGTGATCCATTTATTGGACAACACAGGAAGTTTGCTTCTTCCCTTTGTGCATTCAGCCACACTATCACCTGATTTAGAAACTTGGAAGAAAAATGACAGATTATTGGATGCTCTGTTGATTAATGAAGTTAGTCGAAAGGCATCAGTTACGCAAGGGTCAGCGCTTCTGAGAGTTGCAGCAATAGTATTTTCTGAAATCTCATATCTGAAAACCATGAGGGAATCCTTGTGTGGGACCAGGGCTGTTTCGTTCCACCACGCTCCCATCTTCGGGCTAATCTGCGGTCTACTTGGATGGGATAGCACGATGTCACAGAGGGCATATCTGTTTATTACCATGAGAGATGTTATTTCTGCTGCAACAAGATTGAATTTAGTAGGACCCTTGGGTGCAGCTGTGTTGCAGCATCAGCTTGCGCTTGTAGCTGAAGACATACTGAAAAAATGGATGAACCGTCCTGTTGAAGAAGCTTGCCAAACGGTTCCTCTGTTAGAGACAGTACAAGGATGCCATATGTACCTGTCTTCTAGACTGTTTTGTTCTTGA

Protein sequence

MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLLPFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMRESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAVLQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS
Homology
BLAST of Clc02G15970 vs. NCBI nr
Match: XP_038890480.1 (urease accessory protein F [Benincasa hispida])

HSP 1 Score: 431.0 bits (1107), Expect = 6.3e-117
Identity = 214/229 (93.45%), Postives = 221/229 (96.51%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQAR+VSHQEDLK +VIHLLDNTGSLLL
Sbjct: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARIVSHQEDLKNYVIHLLDNTGSLLL 60

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHSATLSPD ETWKKNDRLLD++L NEVSRKASVTQGSAL+RVAAIVFSEIS LK MR
Sbjct: 61  PFVHSATLSPDFETWKKNDRLLDSMLTNEVSRKASVTQGSALMRVAAIVFSEISSLKAMR 120

Query: 121 E-SLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAA 180
           E +L GT AVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAA
Sbjct: 121 ETTLSGTGAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAA 180

Query: 181 VLQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           VLQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYL+SRLFCS
Sbjct: 181 VLQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLNSRLFCS 229

BLAST of Clc02G15970 vs. NCBI nr
Match: XP_022946416.1 (urease accessory protein F [Cucurbita moschata])

HSP 1 Score: 417.9 bits (1073), Expect = 5.5e-113
Identity = 207/228 (90.79%), Postives = 214/228 (93.86%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDK FHWSQWQLLDSILPTGGFAHSFGLEAAIQAR+VS  EDLKTFVIHLLDNTGSLLL
Sbjct: 8   MDDKQFHWSQWQLLDSILPTGGFAHSFGLEAAIQARIVSQPEDLKTFVIHLLDNTGSLLL 67

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHS TLSPDLETW+KNDR L+A+L NEVSRKASVTQGSAL+RVAAIVFSEIS LK MR
Sbjct: 68  PFVHSTTLSPDLETWEKNDRFLNAILTNEVSRKASVTQGSALMRVAAIVFSEISSLKAMR 127

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E+  GT AVSFHHAPIFGLICGLLGWDSTMSQRAYLFIT+RDVISAATRLNLVGPLGAAV
Sbjct: 128 ETTTGTGAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITLRDVISAATRLNLVGPLGAAV 187

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           LQHQLALVAED LKKWMNRPVEEACQTVPLLETVQGCH YL SRLFCS
Sbjct: 188 LQHQLALVAEDTLKKWMNRPVEEACQTVPLLETVQGCHTYLFSRLFCS 235

BLAST of Clc02G15970 vs. NCBI nr
Match: XP_023545319.1 (urease accessory protein F [Cucurbita pepo subsp. pepo])

HSP 1 Score: 417.5 bits (1072), Expect = 7.2e-113
Identity = 207/228 (90.79%), Postives = 214/228 (93.86%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDK FHWSQWQLLDSILPTGGFAHSFGLEAAIQAR+VS  EDLKTFVIHLLDNTGSLLL
Sbjct: 14  MDDKQFHWSQWQLLDSILPTGGFAHSFGLEAAIQARIVSQPEDLKTFVIHLLDNTGSLLL 73

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHS TLSPDLETW+KNDR L+A+L NEVSRKASVTQGSAL+RVAAIVFSEIS LK MR
Sbjct: 74  PFVHSTTLSPDLETWEKNDRFLNAILTNEVSRKASVTQGSALMRVAAIVFSEISSLKAMR 133

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E+  GT AVSFHHAPIFGLICGLLGWDSTMSQRAYLFIT+RDVISAATRLNLVGPLGAAV
Sbjct: 134 ETSTGTGAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITLRDVISAATRLNLVGPLGAAV 193

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           LQHQLALVAED LKKWMNRPVEEACQTVPLLETVQGCH YL SRLFCS
Sbjct: 194 LQHQLALVAEDTLKKWMNRPVEEACQTVPLLETVQGCHTYLFSRLFCS 241

BLAST of Clc02G15970 vs. NCBI nr
Match: XP_022999571.1 (urease accessory protein F [Cucurbita maxima] >XP_022999572.1 urease accessory protein F [Cucurbita maxima])

HSP 1 Score: 414.5 bits (1064), Expect = 6.1e-112
Identity = 205/228 (89.91%), Postives = 213/228 (93.42%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDK FHWSQWQLLDSILPTGGFAHSFGLEAAIQAR+VS  EDLKTFVIHLLDNTGSLLL
Sbjct: 8   MDDKQFHWSQWQLLDSILPTGGFAHSFGLEAAIQARIVSQPEDLKTFVIHLLDNTGSLLL 67

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHS TLSPDLETW+KNDR L+A+L NEVSRKAS+TQGSAL+RVAAIVFSEIS LK MR
Sbjct: 68  PFVHSTTLSPDLETWEKNDRFLNAILTNEVSRKASITQGSALMRVAAIVFSEISSLKAMR 127

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E+  GT AVSFHHAPIFGLICGLLGWDSTMSQRAYLFIT+RDVISAATRLNLVGPLGAAV
Sbjct: 128 ETSTGTGAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITLRDVISAATRLNLVGPLGAAV 187

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           LQHQLALVAE  LKKWMNRPVEEACQTVPLLETVQGCH YL SRLFCS
Sbjct: 188 LQHQLALVAEGTLKKWMNRPVEEACQTVPLLETVQGCHTYLFSRLFCS 235

BLAST of Clc02G15970 vs. NCBI nr
Match: KAG7030122.1 (Urease accessory protein F, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 414.5 bits (1064), Expect = 6.1e-112
Identity = 205/228 (89.91%), Postives = 213/228 (93.42%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDK FHWSQWQLLDSILP+GGFAHSFGLEAAIQAR+VS  EDLKTFVIHLLDNTGSLLL
Sbjct: 1   MDDKQFHWSQWQLLDSILPSGGFAHSFGLEAAIQARIVSQPEDLKTFVIHLLDNTGSLLL 60

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHS TLSPDLETW+ NDR L+A+L NEVSRKASVTQGSAL+RVAAIVFSEIS LK MR
Sbjct: 61  PFVHSTTLSPDLETWENNDRFLNAILTNEVSRKASVTQGSALMRVAAIVFSEISSLKAMR 120

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E+  GT AVSFHHAPIFGLICGLLGWDSTMSQRAYLFIT+RDVISAATRLNLVGPLGAAV
Sbjct: 121 ETTTGTGAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITLRDVISAATRLNLVGPLGAAV 180

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           LQHQLALVAED LKKWMNRPVEEACQTVPLLETVQGCH YL SRLFCS
Sbjct: 181 LQHQLALVAEDTLKKWMNRPVEEACQTVPLLETVQGCHTYLFSRLFCS 228

BLAST of Clc02G15970 vs. ExPASy Swiss-Prot
Match: Q9XHZ3 (Urease accessory protein F OS=Arabidopsis thaliana OX=3702 GN=UREF PE=2 SV=1)

HSP 1 Score: 323.9 bits (829), Expect = 1.4e-87
Identity = 154/221 (69.68%), Postives = 187/221 (84.62%), Query Frame = 0

Query: 8   WSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLLPFVHSAT 67
           WSQWQLLDSILPTGGFAHSFGLEAAIQ R+VS  EDL+T +IH+LDNT SLLLPFV+SA 
Sbjct: 20  WSQWQLLDSILPTGGFAHSFGLEAAIQTRLVSSPEDLETHIIHVLDNTASLLLPFVYSAL 79

Query: 68  LSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMRESLCGTR 127
            SPD+ETW K D +L+A L N+VS KAS++QGSAL R+AA VF+E+  LK +R++  G++
Sbjct: 80  KSPDIETWHKLDGILNATLTNQVSSKASMSQGSALFRIAASVFTEVPNLKMIRDASLGSK 139

Query: 128 AVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAVLQHQLAL 187
            V FHHAPIFGL+CGLLG DS  SQRAYLF+T+RDV+SAATRLN+VGP+GA+V+QH++A+
Sbjct: 140 NVCFHHAPIFGLVCGLLGMDSETSQRAYLFVTLRDVLSAATRLNIVGPMGASVMQHRIAI 199

Query: 188 VAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           V E +L+KWMNR   EACQT PLL+ VQGCH YL SRLFCS
Sbjct: 200 VTETVLEKWMNREAGEACQTSPLLDVVQGCHGYLFSRLFCS 240

BLAST of Clc02G15970 vs. ExPASy Swiss-Prot
Match: E0ZS46 (Urease accessory protein F OS=Oryza sativa subsp. indica OX=39946 GN=UREF PE=2 SV=1)

HSP 1 Score: 314.3 bits (804), Expect = 1.1e-84
Identity = 151/226 (66.81%), Postives = 193/226 (85.40%), Query Frame = 0

Query: 3   DKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLLPF 62
           ++H  WSQWQLLDSILPTGGFAHS+GLEAA+Q+R+V++ E+L++FV+ +L+NTGSLLLPF
Sbjct: 36  NQHSLWSQWQLLDSILPTGGFAHSYGLEAAMQSRMVNNPEELRSFVVQVLENTGSLLLPF 95

Query: 63  VHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMRES 122
           V  A  SPD  TW K D+LL+A+L NEVSRKAS++QGSALLRVAA VF+EI  L+ +R++
Sbjct: 96  VCCANKSPDAATWVKLDQLLEAMLTNEVSRKASMSQGSALLRVAASVFTEIQSLQDLRQT 155

Query: 123 LCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAVLQ 182
             G++ VSFHHAPIFGLICGL+G+DS  +QRAY+F+TMRDVISAATRLNL+GPL A+VLQ
Sbjct: 156 FLGSKIVSFHHAPIFGLICGLVGFDSETTQRAYMFVTMRDVISAATRLNLIGPLAASVLQ 215

Query: 183 HQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           HQ+A  AE +++KW +R VEEA QT PLL+ +QGCH Y+ SRLFC+
Sbjct: 216 HQVAEDAERMVQKWKDRGVEEATQTSPLLDALQGCHAYMFSRLFCT 261

BLAST of Clc02G15970 vs. ExPASy Swiss-Prot
Match: Q0E3L5 (Urease accessory protein F OS=Oryza sativa subsp. japonica OX=39947 GN=UREF PE=2 SV=2)

HSP 1 Score: 314.3 bits (804), Expect = 1.1e-84
Identity = 151/226 (66.81%), Postives = 193/226 (85.40%), Query Frame = 0

Query: 3   DKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLLPF 62
           ++H  WSQWQLLDSILPTGGFAHS+GLEAA+Q+R+V++ E+L++FV+ +L+NTGSLLLPF
Sbjct: 36  NQHSLWSQWQLLDSILPTGGFAHSYGLEAAMQSRMVNNPEELRSFVVQVLENTGSLLLPF 95

Query: 63  VHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMRES 122
           V  A  SPD  TW K D+LL+A+L NEVSRKAS++QGSALLRVAA VF+EI  L+ +R++
Sbjct: 96  VCCANKSPDAATWVKLDQLLEAMLTNEVSRKASMSQGSALLRVAASVFTEIQSLQDLRQT 155

Query: 123 LCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAVLQ 182
             G++ VSFHHAPIFGLICGL+G+DS  +QRAY+F+TMRDVISAATRLNL+GPL A+VLQ
Sbjct: 156 FLGSKIVSFHHAPIFGLICGLVGFDSETTQRAYMFVTMRDVISAATRLNLIGPLAASVLQ 215

Query: 183 HQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           HQ+A  AE +++KW +R VEEA QT PLL+ +QGCH Y+ SRLFC+
Sbjct: 216 HQVAEDAERMVQKWKDRGVEEATQTSPLLDALQGCHAYMFSRLFCT 261

BLAST of Clc02G15970 vs. ExPASy Swiss-Prot
Match: B7K911 (Urease accessory protein UreF OS=Gloeothece citriformis (strain PCC 7424) OX=65393 GN=ureF PE=3 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 1.6e-14
Identity = 65/221 (29.41%), Postives = 102/221 (46.15%), Query Frame = 0

Query: 12  QLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDN----TGSLLLPFVHSAT 71
           QL DS  P+G F  S GLEA  Q   +   +DL+ F+  LL N    T  + L   +  +
Sbjct: 14  QLSDSFFPSGSFTLSHGLEALAQGEQIHSIKDLEIFLQILLHNKLGTTDLVALIHAYRGS 73

Query: 72  LSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMRESLCGTR 131
              DL+  ++ D  L A  + E +R+     G ALL VA   + +       +E+  GT 
Sbjct: 74  KREDLDAVREADHQLFAQTLIEKNREMGRKSGRALLMVARETWQDTQLETLEKETAKGT- 133

Query: 132 AVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAVLQHQLAL 191
            ++  H  IF ++  + G   T +  A+L   +  ++ AA RLN++G L A  +  +LA 
Sbjct: 134 -INGLHPIIFAVVGRVAGLSETDTGLAFLHSFITGLLGAAIRLNIIGHLQAQKILLKLAP 193

Query: 192 VAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
             E   +K     + E     PL++  Q  H  L  RLF +
Sbjct: 194 DLETTYQKATEINLNEMFSCTPLIDIAQMNHQNLDYRLFAN 232

BLAST of Clc02G15970 vs. ExPASy Swiss-Prot
Match: Q117Z4 (Urease accessory protein UreF OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=ureF PE=3 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 3.9e-13
Identity = 60/222 (27.03%), Postives = 106/222 (47.75%), Query Frame = 0

Query: 12  QLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSL--LLPFVHS--AT 71
           QL DS  PTG F  S GLE  +Q   +  Q ++  F+  LL N   +  ++  +HS    
Sbjct: 14  QLSDSFFPTGSFTFSHGLETLVQTGKIQSQPEILYFLQILLRNKVGVSDVVALIHSYRGC 73

Query: 72  LSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR-ESLCGT 131
            + D+E  +  DR+L        +R+     G ALL VA+  + +   LKT+  +++ G 
Sbjct: 74  KNGDIEAVRVADRMLFVQTAIAKNRETQRQSGRALLMVASSTWQD-ERLKTLNIDAVSGN 133

Query: 132 RAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAVLQHQLA 191
             +   H  IFG +  ++G D   +  A+L   + +++ AA RL ++G + A  +  QL 
Sbjct: 134 --IHCLHPVIFGAVTSVVGLDERNAVLAFLHGLVTNILGAAIRLGILGHIQAQQILLQLV 193

Query: 192 LVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
              E +    ++  +E+     P ++  Q  H  L+ +LF +
Sbjct: 194 PDIEAVWSTAVSMNLEQMWSCTPFIDIAQMQHPKLAHKLFAN 232

BLAST of Clc02G15970 vs. ExPASy TrEMBL
Match: A0A6J1G3M3 (urease accessory protein F OS=Cucurbita moschata OX=3662 GN=LOC111450480 PE=4 SV=1)

HSP 1 Score: 417.9 bits (1073), Expect = 2.7e-113
Identity = 207/228 (90.79%), Postives = 214/228 (93.86%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDK FHWSQWQLLDSILPTGGFAHSFGLEAAIQAR+VS  EDLKTFVIHLLDNTGSLLL
Sbjct: 8   MDDKQFHWSQWQLLDSILPTGGFAHSFGLEAAIQARIVSQPEDLKTFVIHLLDNTGSLLL 67

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHS TLSPDLETW+KNDR L+A+L NEVSRKASVTQGSAL+RVAAIVFSEIS LK MR
Sbjct: 68  PFVHSTTLSPDLETWEKNDRFLNAILTNEVSRKASVTQGSALMRVAAIVFSEISSLKAMR 127

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E+  GT AVSFHHAPIFGLICGLLGWDSTMSQRAYLFIT+RDVISAATRLNLVGPLGAAV
Sbjct: 128 ETTTGTGAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITLRDVISAATRLNLVGPLGAAV 187

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           LQHQLALVAED LKKWMNRPVEEACQTVPLLETVQGCH YL SRLFCS
Sbjct: 188 LQHQLALVAEDTLKKWMNRPVEEACQTVPLLETVQGCHTYLFSRLFCS 235

BLAST of Clc02G15970 vs. ExPASy TrEMBL
Match: A0A6J1KB78 (urease accessory protein F OS=Cucurbita maxima OX=3661 GN=LOC111493898 PE=4 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 2.9e-112
Identity = 205/228 (89.91%), Postives = 213/228 (93.42%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDK FHWSQWQLLDSILPTGGFAHSFGLEAAIQAR+VS  EDLKTFVIHLLDNTGSLLL
Sbjct: 8   MDDKQFHWSQWQLLDSILPTGGFAHSFGLEAAIQARIVSQPEDLKTFVIHLLDNTGSLLL 67

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHS TLSPDLETW+KNDR L+A+L NEVSRKAS+TQGSAL+RVAAIVFSEIS LK MR
Sbjct: 68  PFVHSTTLSPDLETWEKNDRFLNAILTNEVSRKASITQGSALMRVAAIVFSEISSLKAMR 127

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E+  GT AVSFHHAPIFGLICGLLGWDSTMSQRAYLFIT+RDVISAATRLNLVGPLGAAV
Sbjct: 128 ETSTGTGAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITLRDVISAATRLNLVGPLGAAV 187

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           LQHQLALVAE  LKKWMNRPVEEACQTVPLLETVQGCH YL SRLFCS
Sbjct: 188 LQHQLALVAEGTLKKWMNRPVEEACQTVPLLETVQGCHTYLFSRLFCS 235

BLAST of Clc02G15970 vs. ExPASy TrEMBL
Match: A0A6J1DLZ1 (urease accessory protein F OS=Momordica charantia OX=3673 GN=LOC111021278 PE=4 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 6.6e-112
Identity = 202/228 (88.60%), Postives = 213/228 (93.42%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MD KH  W+QWQLLDS+LPTGGFAHSFGLEAAIQAR+VS  EDLKTFVIH+LDNTGSLLL
Sbjct: 1   MDGKHLQWNQWQLLDSVLPTGGFAHSFGLEAAIQARIVSQPEDLKTFVIHVLDNTGSLLL 60

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHSATLSPDLETW+KNDRLLDA+L NEV RKAS+TQGSAL+RVAA+VFSEI  LKTMR
Sbjct: 61  PFVHSATLSPDLETWQKNDRLLDAMLTNEVGRKASITQGSALMRVAAVVFSEIPSLKTMR 120

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E+  GT AV FHHAPIFGLICGLLGWDS MSQRAYLFITMRDVISAATRLNLVGPLGAAV
Sbjct: 121 ETFSGTGAVPFHHAPIFGLICGLLGWDSAMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           LQHQLALVAEDILKKWMNRPVEEACQ+VPLLETVQGCHMYL SRLFCS
Sbjct: 181 LQHQLALVAEDILKKWMNRPVEEACQSVPLLETVQGCHMYLFSRLFCS 228

BLAST of Clc02G15970 vs. ExPASy TrEMBL
Match: A0A0A0LJ08 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G033980 PE=4 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 3.1e-109
Identity = 199/228 (87.28%), Postives = 210/228 (92.11%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDKH HWSQWQLLDSILPTGGFAHSFGLEAAIQA++VS  +DLKTFVIHLLDNTGSL L
Sbjct: 1   MDDKHCHWSQWQLLDSILPTGGFAHSFGLEAAIQAQIVSSPDDLKTFVIHLLDNTGSLFL 60

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFVHSAT SPD ETWKKND LLDA+L NEVSRKASVTQGSAL+RV+AIVFSEI  LK MR
Sbjct: 61  PFVHSATQSPDFETWKKNDMLLDAMLTNEVSRKASVTQGSALMRVSAIVFSEIPSLKAMR 120

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E+L GT AVSFHHAPIFGLICGLLGWD TMSQRAYLFIT+RDVISAATRLNLVGPLGAAV
Sbjct: 121 ENLYGTGAVSFHHAPIFGLICGLLGWDGTMSQRAYLFITLRDVISAATRLNLVGPLGAAV 180

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           LQHQLA VAEDILK+WMNRPVEEACQTVPLLETVQGCH  L S++FCS
Sbjct: 181 LQHQLAFVAEDILKRWMNRPVEEACQTVPLLETVQGCHSCLFSKMFCS 228

BLAST of Clc02G15970 vs. ExPASy TrEMBL
Match: A0A5D3B9D0 (Lon protease-like protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold546G001170 PE=3 SV=1)

HSP 1 Score: 381.3 bits (978), Expect = 2.8e-102
Identity = 189/214 (88.32%), Postives = 199/214 (92.99%), Query Frame = 0

Query: 1   MDDKHFHWSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLL 60
           MDDKH HWSQWQLLDSILPTGGFAHSFGLEAAIQA++VS  EDLKTFVIHLLDNTGSL L
Sbjct: 1   MDDKHCHWSQWQLLDSILPTGGFAHSFGLEAAIQAQIVSSPEDLKTFVIHLLDNTGSLFL 60

Query: 61  PFVHSATLSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMR 120
           PFV+SATLSPD ETWKKNDRLLDA+L NEVSRKASVTQGSAL+RV+AIVFSEIS LK MR
Sbjct: 61  PFVYSATLSPDFETWKKNDRLLDAILTNEVSRKASVTQGSALMRVSAIVFSEISSLKAMR 120

Query: 121 ESLCGTRAVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAV 180
           E L GT  VSFHHAPIFGLICGLLGWD TMSQRAYLFIT+RDVISAATRLNLVGPLGAAV
Sbjct: 121 EHLYGTGTVSFHHAPIFGLICGLLGWDGTMSQRAYLFITLRDVISAATRLNLVGPLGAAV 180

Query: 181 LQHQLALVAEDILKKWMNRPVEEACQTVPLLETV 215
           LQHQLA VAE++LK+WMNRPVEEACQ VPLLETV
Sbjct: 181 LQHQLAQVAENVLKRWMNRPVEEACQAVPLLETV 214

BLAST of Clc02G15970 vs. TAIR 10
Match: AT1G21840.1 (urease accessory protein F )

HSP 1 Score: 323.9 bits (829), Expect = 1.0e-88
Identity = 154/221 (69.68%), Postives = 187/221 (84.62%), Query Frame = 0

Query: 8   WSQWQLLDSILPTGGFAHSFGLEAAIQARVVSHQEDLKTFVIHLLDNTGSLLLPFVHSAT 67
           WSQWQLLDSILPTGGFAHSFGLEAAIQ R+VS  EDL+T +IH+LDNT SLLLPFV+SA 
Sbjct: 20  WSQWQLLDSILPTGGFAHSFGLEAAIQTRLVSSPEDLETHIIHVLDNTASLLLPFVYSAL 79

Query: 68  LSPDLETWKKNDRLLDALLINEVSRKASVTQGSALLRVAAIVFSEISYLKTMRESLCGTR 127
            SPD+ETW K D +L+A L N+VS KAS++QGSAL R+AA VF+E+  LK +R++  G++
Sbjct: 80  KSPDIETWHKLDGILNATLTNQVSSKASMSQGSALFRIAASVFTEVPNLKMIRDASLGSK 139

Query: 128 AVSFHHAPIFGLICGLLGWDSTMSQRAYLFITMRDVISAATRLNLVGPLGAAVLQHQLAL 187
            V FHHAPIFGL+CGLLG DS  SQRAYLF+T+RDV+SAATRLN+VGP+GA+V+QH++A+
Sbjct: 140 NVCFHHAPIFGLVCGLLGMDSETSQRAYLFVTLRDVLSAATRLNIVGPMGASVMQHRIAI 199

Query: 188 VAEDILKKWMNRPVEEACQTVPLLETVQGCHMYLSSRLFCS 229
           V E +L+KWMNR   EACQT PLL+ VQGCH YL SRLFCS
Sbjct: 200 VTETVLEKWMNREAGEACQTSPLLDVVQGCHGYLFSRLFCS 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890480.16.3e-11793.45urease accessory protein F [Benincasa hispida][more]
XP_022946416.15.5e-11390.79urease accessory protein F [Cucurbita moschata][more]
XP_023545319.17.2e-11390.79urease accessory protein F [Cucurbita pepo subsp. pepo][more]
XP_022999571.16.1e-11289.91urease accessory protein F [Cucurbita maxima] >XP_022999572.1 urease accessory p... [more]
KAG7030122.16.1e-11289.91Urease accessory protein F, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Q9XHZ31.4e-8769.68Urease accessory protein F OS=Arabidopsis thaliana OX=3702 GN=UREF PE=2 SV=1[more]
E0ZS461.1e-8466.81Urease accessory protein F OS=Oryza sativa subsp. indica OX=39946 GN=UREF PE=2 S... [more]
Q0E3L51.1e-8466.81Urease accessory protein F OS=Oryza sativa subsp. japonica OX=39947 GN=UREF PE=2... [more]
B7K9111.6e-1429.41Urease accessory protein UreF OS=Gloeothece citriformis (strain PCC 7424) OX=653... [more]
Q117Z43.9e-1327.03Urease accessory protein UreF OS=Trichodesmium erythraeum (strain IMS101) OX=203... [more]
Match NameE-valueIdentityDescription
A0A6J1G3M32.7e-11390.79urease accessory protein F OS=Cucurbita moschata OX=3662 GN=LOC111450480 PE=4 SV... [more]
A0A6J1KB782.9e-11289.91urease accessory protein F OS=Cucurbita maxima OX=3661 GN=LOC111493898 PE=4 SV=1[more]
A0A6J1DLZ16.6e-11288.60urease accessory protein F OS=Momordica charantia OX=3673 GN=LOC111021278 PE=4 S... [more]
A0A0A0LJ083.1e-10987.28Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G033980 PE=4 SV=1[more]
A0A5D3B9D02.8e-10288.32Lon protease-like protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... [more]
Match NameE-valueIdentityDescription
AT1G21840.11.0e-8869.68urease accessory protein F [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002639Urease accessory protein UreFPIRSFPIRSF009467Urease_acces_UreFcoord: 3..228
e-value: 4.6E-46
score: 155.4
IPR002639Urease accessory protein UreFPFAMPF01730UreFcoord: 38..185
e-value: 5.9E-19
score: 69.0
IPR038277UreF domain superfamilyGENE3D1.10.4190.10Urease accessory protein UreFcoord: 3..207
e-value: 8.0E-50
score: 171.7
NoneNo IPR availablePANTHERPTHR33620UREASE ACCESSORY PROTEIN Fcoord: 1..228

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc02G15970.1Clc02G15970.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030163 protein catabolic process
biological_process GO:0006508 proteolysis
biological_process GO:0006807 nitrogen compound metabolic process
molecular_function GO:0005524 ATP binding
molecular_function GO:0004176 ATP-dependent peptidase activity
molecular_function GO:0016887 ATP hydrolysis activity
molecular_function GO:0016151 nickel cation binding
molecular_function GO:0004252 serine-type endopeptidase activity