HG10005029 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10005029
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUPF0587 protein C1orf123 homolog
LocationChr08: 22383590 .. 22386295 (+)
RNA-Seq ExpressionHG10005029
SyntenyHG10005029
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTTTCAAAGTATTCCCTTAATATCATCCAATCTTGTTTTCTTCGTTCATTTGATTAATCTTTTGGAATGTTTTTTCTTCTTCCATGGTATTTCTTCAATCGCTTTCTGAATTTGTAACTTATTCGCTAGAACTTGTGGAATTCGAAGTTTATAGCCTTCGTAGCTGGATTTTTATATGTCTGCATGTTCCTCTTTATCATCCTCCTCCTGTTCGAAACTATATATTGTCGTTTTCGGAGTTTTGTCCCTGGAATCGATCCAGGGAAATCATGTATAGTGATGATTCTTGGCTTCGTTTTTCTTTCTAGCTGCTGTTGATTGGTTGTTACCTTATTTTCCTCCTTGATTGTGTTAATTGAGCTTGTCTGTTTATCTTGTTTGTGTGATGTTTTCGATTTAAAGGTGAAATGCGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTAGTTCAAAAGGTAATTCTCTTTTTTCTGGAAAGTATGCAGATTAGTTATTGACATAAATATGATTGTTACATGGTTGATGAGAAAATGAATGGAAAGAGAAGGAAATTTCTGCCTCCCATACTCTTAGGTCCATAGCAATGGAGAAGTGTTTTTATGAGAATGTGTGGTTTTGCCCTAATAACCATTAGAAATAGATGTAGATCTGAAATACGTTTTGTAATTCCACTTTCATTTATCCAATGGTATAAACTACTAGTCGTACTGTGGTGGTCTCATTTCAATAAATTTTCAAAGCAATGGTACTAGTAGGATCCAATCCTTCCTGATTATATTACTAGTCACTTGATAACTTGGTAACTTTTTAACGCTCAAGGCTAGCCTAAGTGTAGTTCAACTGGTTAAGACATATGCCCTCCACCAAGATTTTAGAGGTTCAACTTCTCCACCCTCACTTGTTGTCGAAGAATAAATAAAAAATTGACACTCAAGATCACTATTTCTTAGTTAAACACCATGGGTTGACCTAGTGTTTAGTAAGGGCCATGTAAATAATAAAAGGCTCAGAGACAACGAGTTCAAGTTGGTCCCATGAGAATAGACGAGGTGTGCACAAGTTGGTCCTAACAGTCGCAGCTGGCTTAGTTAGCTTATAATTTGTTTCTTACTTAAGCTTCATCCGTTACTGATTCTAATTTGATTGGTGTACTTTTTTAGAAAAATCAAATAAATGTAGTTTGCCACTGCTGTTGTAAATTTTAAAAGTAATTTATTACAGTGTCTTTGGAAATTTGGGTCAATAGTCAAACTGACATGAAGATATCAGTTTTCAAGTGATATGGAAAACAAAATTGCTAATATGTTTTCCAATATCATTTGACCATGAAGTAATAACATTTTTCAGCAATTCGGGTTTAGTGTGAATGTCAGTTCTGAAATTAGGGACTTGTGGCGATACTGAATGTGATATTCAGTTTACTTTCCACTTCCACCACTCCTTTCATGGGGATACTGAAATTATATTATATAGATTTATTTTTCTTTTTTTAAGGGAACGGCTTGAAAATTAAATATGATAAGTAGTCAAAGATGATTGTCAGTTCCATTTGCACTTAAAATAATTACCTCCAAACTCGATTGCAATATTCGTGCCCATTTTCTTATAGTGCAAGTTCTGTGGGAGGGATGGAACGATTACAATGATTCCAGGGCGAGGTCAAGCATTGACTCAGGAAACAAGTGAATCAGGGAAGTCATCTCCCTTGATGTTATTTGATTGCAGAGGTTATGAGCCTATGGACTTTGTATTTGGACCTGGATGGAAAGTAGAATCTGTAAGTCCTCCAATCTATCTACCTTCCAACTCATGTAACGAATTCCATGATCATGTATAACTTTATTATGAAATCAATATATATAACTTGTTAATCATGCAGATTCACAACGTTCTAGACTTGCCATGAGACTAGAATTTCCTTCCCCTTTCTTAACCCCCCCAGCCCCCCTTTCCTGCCGAGGTTTTCTATTCTTGACTTAATAGCTCATTATGGATGTCTGTTATGTGACCTTAGATGTCTGTAATTGTAAATTTGATACAATAAATTTTCTTTACTTGGTCCTTTCAACTTCAGTGAGCATCATTCTTTTCCTTGAAGATTTTTTCTTCAAAATTTTGATGCTGACCATACTATAACAGCCCCACTGTCTGGCATTTTGATGTAATTGAGAACTGGTAGTAATTCGTATAGCCTGTGTTCTTGCAGCCTATCCTCGAACATTAATATTTTTGAAATAGAGGCATGAAAGTTTAATAACTAGATAAAAAATGAAGAATATGACGTAGATACTTACCCTTTTATCTTAATCATGCTGTCTAATAGTTTGCCCAACCTACTAATTATGCACAATAATGTTTGCCTATATTCACTTTATTCCTAAAGTTGGCATGATTTTGTATAGAAGTTTTAAAACAGCTGCCTTATTTTTCGCAGATAGAGGGGACTAAATTTGAGGATATTGACTTGAGTGGAGGTGAGTTTGCAGAGTATGATGAGAAGGGAGAATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGACTTGGTAAAGTAG

mRNA sequence

ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTTTCAAAGTGAAATGCGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTAGTTCAAAAGTGCAAGTTCTGTGGGAGGGATGGAACGATTACAATGATTCCAGGGCGAGGTCAAGCATTGACTCAGGAAACAAGTGAATCAGGGAAGTCATCTCCCTTGATGTTATTTGATTGCAGAGGTTATGAGCCTATGGACTTTGTATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTGAGTGGAGGTGAGTTTGCAGAGTATGATGAGAAGGGAGAATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGACTTGGTAAAGTAG

Coding sequence (CDS)

ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCTCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTTTCAAAGTGAAATGCGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTAGTTCAAAAGTGCAAGTTCTGTGGGAGGGATGGAACGATTACAATGATTCCAGGGCGAGGTCAAGCATTGACTCAGGAAACAAGTGAATCAGGGAAGTCATCTCCCTTGATGTTATTTGATTGCAGAGGTTATGAGCCTATGGACTTTGTATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATATTGACTTGAGTGGAGGTGAGTTTGCAGAGTATGATGAGAAGGGAGAATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGACTTGGTAAAGTAG

Protein sequence

MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGPGWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK
Homology
BLAST of HG10005029 vs. NCBI nr
Match: XP_038884333.1 (CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida])

HSP 1 Score: 339.0 bits (868), Expect = 2.4e-89
Identity = 159/166 (95.78%), Postives = 161/166 (96.99%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA
Sbjct: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGR+GTITMIPGRGQ LTQETSE GKSSPLMLFDCRGYEPMDFVFGP
Sbjct: 61  GKGTTNLVQKCKFCGREGTITMIPGRGQPLTQETSELGKSSPLMLFDCRGYEPMDFVFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS LEATF+LVK
Sbjct: 121 GWKAESIEGTKFEDIDLSEGEFAEYDEKGECPVMISKLEATFELVK 166

BLAST of HG10005029 vs. NCBI nr
Match: XP_022952030.1 (UPF0587 protein C1orf123 homolog [Cucurbita moschata] >XP_023002822.1 UPF0587 protein C1orf123 homolog [Cucurbita maxima] >XP_023536671.1 UPF0587 protein C1orf123 homolog [Cucurbita pepo subsp. pepo] >KAG7020640.1 hypothetical protein SDJN02_17326 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 336.7 bits (862), Expect = 1.2e-88
Identity = 156/166 (93.98%), Postives = 161/166 (96.99%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQA
Sbjct: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQA 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGRDGTITMIPGRGQ LTQETSESGKSSPLMLFDCRGYEP+ F+FGP
Sbjct: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLMLFDCRGYEPVGFIFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Sbjct: 121 GWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK 166

BLAST of HG10005029 vs. NCBI nr
Match: KAG6585732.1 (CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 336.7 bits (862), Expect = 1.2e-88
Identity = 156/166 (93.98%), Postives = 161/166 (96.99%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQA
Sbjct: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQA 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGRDGTITMIPGRGQ LTQETSESGKSSPLMLFDCRGYEP+ F+FGP
Sbjct: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLMLFDCRGYEPVGFIFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Sbjct: 121 GWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK 166

BLAST of HG10005029 vs. NCBI nr
Match: XP_008444495.1 (PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo] >KAA0060987.1 UPF0587 protein C1orf123-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 327.8 bits (839), Expect = 5.5e-86
Identity = 152/166 (91.57%), Postives = 160/166 (96.39%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQA
Sbjct: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQA 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGR+GTITMIPGRG+ LTQE SESG  SPLMLFDCRGYEP+ FVFGP
Sbjct: 61  GKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDFSPLMLFDCRGYEPIGFVFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Sbjct: 121 GWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK 166

BLAST of HG10005029 vs. NCBI nr
Match: XP_022132352.1 (UPF0587 protein C1orf123 [Momordica charantia])

HSP 1 Score: 326.6 bits (836), Expect = 1.2e-85
Identity = 149/166 (89.76%), Postives = 157/166 (94.58%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKI AELENLTNLQPQDGCDDPNF YLFK+KCGRCGEVSQKETC+TLNET+ L  
Sbjct: 1   MVNFLLKINAELENLTNLQPQDGCDDPNFAYLFKLKCGRCGEVSQKETCLTLNETVALHG 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGRDGT+TMIPGRG+ LTQETSESGKSSPLMLFDCRGYEP+DF+FGP
Sbjct: 61  GKGTTNLVQKCKFCGRDGTVTMIPGRGKPLTQETSESGKSSPLMLFDCRGYEPLDFIFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS L+ATFDLVK
Sbjct: 121 GWKAESIEGTKFEDIDLSDGEFAEYDEKGECPVMISKLKATFDLVK 166

BLAST of HG10005029 vs. ExPASy Swiss-Prot
Match: Q3B8G0 (CXXC motif containing zinc binding protein OS=Xenopus laevis OX=8355 GN=czib PE=2 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 2.4e-28
Identity = 65/163 (39.88%), Postives = 98/163 (60.12%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MV F L+ KA LENLT L+P       +F +  K+KCG CGEVS K   +TL +++PL+ 
Sbjct: 1   MVKFALQFKASLENLTQLRPH----GEDFRWFLKLKCGNCGEVSDKWQYITLMDSVPLKG 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           G+G+ ++VQ+CK C R+ +I ++         E SE+ K+  ++ F+CRG EP+DF    
Sbjct: 61  GRGSASMVQRCKLCSRENSIDILAASLHPYNAEDSETFKT--IVEFECRGLEPIDFQPQA 120

Query: 121 GWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATF 163
           G+  E  E GT F +I+L   ++ +YDEK +  V I  +E  F
Sbjct: 121 GFAAEGAETGTPFHEINLQEKDWTDYDEKAKESVGIYEVEHRF 157

BLAST of HG10005029 vs. ExPASy Swiss-Prot
Match: Q8BHG2 (CXXC motif containing zinc binding protein OS=Mus musculus OX=10090 GN=Czib PE=1 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 2.3e-26
Identity = 63/167 (37.72%), Postives = 100/167 (59.88%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ 
Sbjct: 1   MGKIALQLKATLENVTNLRPV----GEDFRWYLKMKCGNCGEISEKWQYIRLMDSVALKG 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           G+G+ ++VQKCK C R+ +I ++    +A   E +E  K+  ++ F+CRG EP+DF    
Sbjct: 61  GRGSASMVQKCKLCARENSIDILSSTIKAYNAEDNEKFKT--IVEFECRGLEPVDFQPQA 120

Query: 121 GWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           G+  + +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Sbjct: 121 GFAADGVESGTVFSDINLQEKDWTDYDEKAQESVGI--FEVTHQFVK 159

BLAST of HG10005029 vs. ExPASy Swiss-Prot
Match: Q32P66 (CXXC motif containing zinc binding protein OS=Bos taurus OX=9913 GN=CZIB PE=2 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 3.9e-26
Identity = 62/162 (38.27%), Postives = 99/162 (61.11%), Query Frame = 0

Query: 6   LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTT 65
           L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ 
Sbjct: 6   LQLKATLENVTNLRPV----GEDFRWYLKMKCGNCGEISEKWQYIRLMDSVALKGGRGSA 65

Query: 66  NLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGPGWKVE 125
           ++VQKCK C R+ +I ++    ++   E +E  K+  ++ F+CRG EP+DF    G+  E
Sbjct: 66  SMVQKCKLCSRENSIEILSSTIKSYNAEDNEKFKT--IVEFECRGLEPVDFQPQAGFAAE 125

Query: 126 SIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
            +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Sbjct: 126 GVESGTVFSDINLQEKDWTDYDEKAQESVGI--YEVTHQFVK 159

BLAST of HG10005029 vs. ExPASy Swiss-Prot
Match: Q9NWV4 (CXXC motif containing zinc binding protein OS=Homo sapiens OX=9606 GN=CZIB PE=1 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 3.9e-26
Identity = 63/167 (37.72%), Postives = 99/167 (59.28%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L +++ L+ 
Sbjct: 1   MGKIALQLKATLENITNLRPV----GEDFRWYLKMKCGNCGEISDKWQYIRLMDSVALKG 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           G+G+ ++VQKCK C R+ +I ++    +    E +E+ K+  ++ F+CRG EP+DF    
Sbjct: 61  GRGSASMVQKCKLCARENSIEILSSTIKPYNAEDNENFKT--IVEFECRGLEPVDFQPQA 120

Query: 121 GWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           G+  E +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Sbjct: 121 GFAAEGVESGTAFSDINLQEKDWTDYDEKAQESVGI--YEVTHQFVK 159

BLAST of HG10005029 vs. ExPASy Swiss-Prot
Match: Q498R7 (CXXC motif containing zinc binding protein OS=Rattus norvegicus OX=10116 GN=Czib PE=2 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 3.9e-26
Identity = 63/167 (37.72%), Postives = 100/167 (59.88%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ 
Sbjct: 1   MGKIALQLKATLENVTNLRPV----GEDFRWYLKMKCGNCGEISEKWQYIRLMDSVALKG 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           G+G+ ++VQKCK C R+ +I ++    ++   E +E  K+  ++ F+CRG EP+DF    
Sbjct: 61  GRGSASMVQKCKLCARENSIEILSSTIKSYNAEDNEKFKT--IVEFECRGLEPVDFQPQA 120

Query: 121 GWKVESIE-GTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           G+  E +E GT F DI+L   ++ +YDEK +  V I   E T   VK
Sbjct: 121 GFAAEGVESGTVFSDINLQEKDWTDYDEKTQESVGI--FEVTHQFVK 159

BLAST of HG10005029 vs. ExPASy TrEMBL
Match: A0A6J1KKK0 (UPF0587 protein C1orf123 homolog OS=Cucurbita maxima OX=3661 GN=LOC111496570 PE=4 SV=1)

HSP 1 Score: 336.7 bits (862), Expect = 5.7e-89
Identity = 156/166 (93.98%), Postives = 161/166 (96.99%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQA
Sbjct: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQA 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGRDGTITMIPGRGQ LTQETSESGKSSPLMLFDCRGYEP+ F+FGP
Sbjct: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLMLFDCRGYEPVGFIFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Sbjct: 121 GWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK 166

BLAST of HG10005029 vs. ExPASy TrEMBL
Match: A0A6J1GKL1 (UPF0587 protein C1orf123 homolog OS=Cucurbita moschata OX=3662 GN=LOC111454740 PE=4 SV=1)

HSP 1 Score: 336.7 bits (862), Expect = 5.7e-89
Identity = 156/166 (93.98%), Postives = 161/166 (96.99%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQA
Sbjct: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFPYLFKVKCGRCGELSQKETCVTLNETIPLQA 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGRDGTITMIPGRGQ LTQETSESGKSSPLMLFDCRGYEP+ F+FGP
Sbjct: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLMLFDCRGYEPVGFIFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWK ESIEGTKFEDIDLSGGE+AEYDEKGECPVMISNLEATF+ VK
Sbjct: 121 GWKAESIEGTKFEDIDLSGGEYAEYDEKGECPVMISNLEATFESVK 166

BLAST of HG10005029 vs. ExPASy TrEMBL
Match: A0A5A7V299 (UPF0587 protein C1orf123-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold501G001220 PE=4 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 2.6e-86
Identity = 152/166 (91.57%), Postives = 160/166 (96.39%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQA
Sbjct: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQA 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGR+GTITMIPGRG+ LTQE SESG  SPLMLFDCRGYEP+ FVFGP
Sbjct: 61  GKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDFSPLMLFDCRGYEPIGFVFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Sbjct: 121 GWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK 166

BLAST of HG10005029 vs. ExPASy TrEMBL
Match: A0A1S3BAF1 (UPF0587 protein C1orf123 homolog OS=Cucumis melo OX=3656 GN=LOC103487797 PE=4 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 2.6e-86
Identity = 152/166 (91.57%), Postives = 160/166 (96.39%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQA
Sbjct: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLSETIPLQA 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGR+GTITMIPGRG+ LTQE SESG  SPLMLFDCRGYEP+ FVFGP
Sbjct: 61  GKGTTNLVQKCKFCGREGTITMIPGRGKPLTQEISESGDFSPLMLFDCRGYEPIGFVFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWKVESIEGTKFEDIDL+GGEFAEYDEKGECPVMISNL+A F+L+K
Sbjct: 121 GWKVESIEGTKFEDIDLNGGEFAEYDEKGECPVMISNLDAKFELLK 166

BLAST of HG10005029 vs. ExPASy TrEMBL
Match: A0A6J1BTL8 (UPF0587 protein C1orf123 OS=Momordica charantia OX=3673 GN=LOC111005220 PE=4 SV=1)

HSP 1 Score: 326.6 bits (836), Expect = 5.9e-86
Identity = 149/166 (89.76%), Postives = 157/166 (94.58%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVNFLLKI AELENLTNLQPQDGCDDPNF YLFK+KCGRCGEVSQKETC+TLNET+ L  
Sbjct: 1   MVNFLLKINAELENLTNLQPQDGCDDPNFAYLFKLKCGRCGEVSQKETCLTLNETVALHG 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           GKGTTNLVQKCKFCGRDGT+TMIPGRG+ LTQETSESGKSSPLMLFDCRGYEP+DF+FGP
Sbjct: 61  GKGTTNLVQKCKFCGRDGTVTMIPGRGKPLTQETSESGKSSPLMLFDCRGYEPLDFIFGP 120

Query: 121 GWKVESIEGTKFEDIDLSGGEFAEYDEKGECPVMISNLEATFDLVK 167
           GWK ESIEGTKFEDIDLS GEFAEYDEKGECPVMIS L+ATFDLVK
Sbjct: 121 GWKAESIEGTKFEDIDLSDGEFAEYDEKGECPVMISKLKATFDLVK 166

BLAST of HG10005029 vs. TAIR 10
Match: AT4G32930.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF866, eukaryotic (InterPro:IPR008584); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 251.9 bits (642), Expect = 3.5e-67
Identity = 114/167 (68.26%), Postives = 136/167 (81.44%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTLNET     
Sbjct: 1   MVNYVLKITADLENLTNLQPSGGCDDSNFPYLFKLKCERCGEVTPKETCVTLNETFTPPG 60

Query: 61  GKGTTNLVQKCKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYEPMDFVFGP 120
           G+GT +LVQKCKFCGR+G +TMIPG+G+ LT E SE+G+ +PLM+FDCRGYEP+DF FG 
Sbjct: 61  GRGTCHLVQKCKFCGREGNVTMIPGKGRPLTLEDSEAGEHAPLMVFDCRGYEPIDFGFGG 120

Query: 121 GWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK 167
            WK ++  GTKF++IDLS G EF EYDEKGECPVMISN  A+F + K
Sbjct: 121 YWKAQAESGTKFDEIDLSSGEEFTEYDEKGECPVMISNFRASFSVTK 167

BLAST of HG10005029 vs. TAIR 10
Match: AT4G32930.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF866, eukaryotic (InterPro:IPR008584). )

HSP 1 Score: 244.6 bits (623), Expect = 5.7e-65
Identity = 114/175 (65.14%), Postives = 136/175 (77.71%), Query Frame = 0

Query: 1   MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQA 60
           MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTLNET     
Sbjct: 1   MVNYVLKITADLENLTNLQPSGGCDDSNFPYLFKLKCERCGEVTPKETCVTLNETFTPPG 60

Query: 61  GKGTTNLVQK--------CKFCGRDGTITMIPGRGQALTQETSESGKSSPLMLFDCRGYE 120
           G+GT +LVQK        CKFCGR+G +TMIPG+G+ LT E SE+G+ +PLM+FDCRGYE
Sbjct: 61  GRGTCHLVQKKANIGSDLCKFCGREGNVTMIPGKGRPLTLEDSEAGEHAPLMVFDCRGYE 120

Query: 121 PMDFVFGPGWKVESIEGTKFEDIDLSGG-EFAEYDEKGECPVMISNLEATFDLVK 167
           P+DF FG  WK ++  GTKF++IDLS G EF EYDEKGECPVMISN  A+F + K
Sbjct: 121 PIDFGFGGYWKAQAESGTKFDEIDLSSGEEFTEYDEKGECPVMISNFRASFSVTK 175

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884333.12.4e-8995.78CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida][more]
XP_022952030.11.2e-8893.98UPF0587 protein C1orf123 homolog [Cucurbita moschata] >XP_023002822.1 UPF0587 pr... [more]
KAG6585732.11.2e-8893.98CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subs... [more]
XP_008444495.15.5e-8691.57PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo] >KAA0060987.1 UPF0587... [more]
XP_022132352.11.2e-8589.76UPF0587 protein C1orf123 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q3B8G02.4e-2839.88CXXC motif containing zinc binding protein OS=Xenopus laevis OX=8355 GN=czib PE=... [more]
Q8BHG22.3e-2637.72CXXC motif containing zinc binding protein OS=Mus musculus OX=10090 GN=Czib PE=1... [more]
Q32P663.9e-2638.27CXXC motif containing zinc binding protein OS=Bos taurus OX=9913 GN=CZIB PE=2 SV... [more]
Q9NWV43.9e-2637.72CXXC motif containing zinc binding protein OS=Homo sapiens OX=9606 GN=CZIB PE=1 ... [more]
Q498R73.9e-2637.72CXXC motif containing zinc binding protein OS=Rattus norvegicus OX=10116 GN=Czib... [more]
Match NameE-valueIdentityDescription
A0A6J1KKK05.7e-8993.98UPF0587 protein C1orf123 homolog OS=Cucurbita maxima OX=3661 GN=LOC111496570 PE=... [more]
A0A6J1GKL15.7e-8993.98UPF0587 protein C1orf123 homolog OS=Cucurbita moschata OX=3662 GN=LOC111454740 P... [more]
A0A5A7V2992.6e-8691.57UPF0587 protein C1orf123-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A1S3BAF12.6e-8691.57UPF0587 protein C1orf123 homolog OS=Cucumis melo OX=3656 GN=LOC103487797 PE=4 SV... [more]
A0A6J1BTL85.9e-8689.76UPF0587 protein C1orf123 OS=Momordica charantia OX=3673 GN=LOC111005220 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G32930.13.5e-6768.26unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF866,... [more]
AT4G32930.25.7e-6565.14unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008584CXXC motif containing zinc binding protein, eukaryoticPFAMPF05907DUF866coord: 6..162
e-value: 9.0E-53
score: 178.3
IPR008584CXXC motif containing zinc binding protein, eukaryoticPANTHERPTHR12857UNCHARACTERIZEDcoord: 1..166
NoneNo IPR availableSUPERFAMILY141678MAL13P1.257-likecoord: 1..162

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10005029.1HG10005029.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008270 zinc ion binding