Cp4.1LG04g08850 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g08850
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionHydroxymethylbilane hydrolyase [cyclizing]
LocationCp4.1LG04: 10334790 .. 10335725 (-)
RNA-Seq ExpressionCp4.1LG04g08850
SyntenyCp4.1LG04g08850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCACCGGCGCCGCCCACCCCAGCCCAATCCACCACCTAAATCCACTCGCTTCATCTCCTTCTCCCGCCATTCCCACCAACTCCCCTTCGCGTACGGTGGCGTTTACGACGCCTCAGAACTATGCCGGCAGCCTCTCGAACCTCCTCTCTCTCAAAGGCTTCGACCCCCTCTGGTGCCCCACCGTCACCGTCCACCCCACTCCCATCGCCATCAAATCCCATCTCGTTCCTCCAAATCTCCATTTCTACTCCGCTGTCGCCTTCACCTCCCGCTCTGGCATCACAGCCCTCCTCGACGCCGCTACTGAAATCGACGAGCCCTTGCTATCGCCTCAGGGCGACACTTTTCTAATCGCAGCCCTAGGTAAGGACTCGGAGCTTCTCGATCATGGAGTTCTTTCCAAATTTTGCCCTAACGCGAACCGGATTAGAGTCGTCGTACCTAAAATAGCTTCGCCGAGTGGTCTAGTGGAGGCTCTTGGGATTGGAAACCACCGTAGGGTTCTGTGTCCGGTTCCTCGCGTCGTGGGGCTGAACGAGCCTCCGGTGGTTCCAAACTTCCTCCGCGACCTCGCGGCGAGCGGGTGGGTTCCGGTTCGTGTCGATGCGTACGAGACCCGATGGGTCGGACCCGAGTGCGCGAGGAAGCTAGCAGAGAGAGGGGAGGATGAGAAATTGGATGCCATTGTGTTTACTAGTACTGGGGAAGTGGAGGGGCTGCTAAAAAGCTTGAGGGCTTTGGGATTGCAGTGGGAGATGATGAGAAAAAAGTGGCCGGAAATGGTGTTGGCGGCGCACGGTCCGGTGACGGCGGCGGGAGCTGAGAGGCTCGGCGTTAAGATTGATTTGGTGAGTTCTAAATTCGACAGCTTCAATGGTGTGGTCGATGCTCTTCATTCGAGATGGCAGAGCTTAGAACAGAACCCGGAGTAA

mRNA sequence

ATGAGCACCGGCGCCGCCCACCCCAGCCCAATCCACCACCTAAATCCACTCGCTTCATCTCCTTCTCCCGCCATTCCCACCAACTCCCCTTCGCGTACGGTGGCGTTTACGACGCCTCAGAACTATGCCGGCAGCCTCTCGAACCTCCTCTCTCTCAAAGGCTTCGACCCCCTCTGGTGCCCCACCGTCACCGTCCACCCCACTCCCATCGCCATCAAATCCCATCTCGTTCCTCCAAATCTCCATTTCTACTCCGCTGTCGCCTTCACCTCCCGCTCTGGCATCACAGCCCTCCTCGACGCCGCTACTGAAATCGACGAGCCCTTGCTATCGCCTCAGGGCGACACTTTTCTAATCGCAGCCCTAGGTAAGGACTCGGAGCTTCTCGATCATGGAGTTCTTTCCAAATTTTGCCCTAACGCGAACCGGATTAGAGTCGTCGTACCTAAAATAGCTTCGCCGAGTGGTCTAGTGGAGGCTCTTGGGATTGGAAACCACCGTAGGGTTCTGTGTCCGGTTCCTCGCGTCGTGGGGCTGAACGAGCCTCCGGTGGTTCCAAACTTCCTCCGCGACCTCGCGGCGAGCGGGTGGGTTCCGGTTCGTGTCGATGCGTACGAGACCCGATGGGTCGGACCCGAGTGCGCGAGGAAGCTAGCAGAGAGAGGGGAGGATGAGAAATTGGATGCCATTGTGTTTACTAGTACTGGGGAAGTGGAGGGGCTGCTAAAAAGCTTGAGGGCTTTGGGATTGCAGTGGGAGATGATGAGAAAAAAGTGGCCGGAAATGGTGTTGGCGGCGCACGGTCCGGTGACGGCGGCGGGAGCTGAGAGGCTCGGCGTTAAGATTGATTTGGTGAGTTCTAAATTCGACAGCTTCAATGGTGTGGTCGATGCTCTTCATTCGAGATGGCAGAGCTTAGAACAGAACCCGGAGTAA

Coding sequence (CDS)

ATGAGCACCGGCGCCGCCCACCCCAGCCCAATCCACCACCTAAATCCACTCGCTTCATCTCCTTCTCCCGCCATTCCCACCAACTCCCCTTCGCGTACGGTGGCGTTTACGACGCCTCAGAACTATGCCGGCAGCCTCTCGAACCTCCTCTCTCTCAAAGGCTTCGACCCCCTCTGGTGCCCCACCGTCACCGTCCACCCCACTCCCATCGCCATCAAATCCCATCTCGTTCCTCCAAATCTCCATTTCTACTCCGCTGTCGCCTTCACCTCCCGCTCTGGCATCACAGCCCTCCTCGACGCCGCTACTGAAATCGACGAGCCCTTGCTATCGCCTCAGGGCGACACTTTTCTAATCGCAGCCCTAGGTAAGGACTCGGAGCTTCTCGATCATGGAGTTCTTTCCAAATTTTGCCCTAACGCGAACCGGATTAGAGTCGTCGTACCTAAAATAGCTTCGCCGAGTGGTCTAGTGGAGGCTCTTGGGATTGGAAACCACCGTAGGGTTCTGTGTCCGGTTCCTCGCGTCGTGGGGCTGAACGAGCCTCCGGTGGTTCCAAACTTCCTCCGCGACCTCGCGGCGAGCGGGTGGGTTCCGGTTCGTGTCGATGCGTACGAGACCCGATGGGTCGGACCCGAGTGCGCGAGGAAGCTAGCAGAGAGAGGGGAGGATGAGAAATTGGATGCCATTGTGTTTACTAGTACTGGGGAAGTGGAGGGGCTGCTAAAAAGCTTGAGGGCTTTGGGATTGCAGTGGGAGATGATGAGAAAAAAGTGGCCGGAAATGGTGTTGGCGGCGCACGGTCCGGTGACGGCGGCGGGAGCTGAGAGGCTCGGCGTTAAGATTGATTTGGTGAGTTCTAAATTCGACAGCTTCAATGGTGTGGTCGATGCTCTTCATTCGAGATGGCAGAGCTTAGAACAGAACCCGGAGTAA

Protein sequence

MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWCPTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIAALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLNEPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEGLLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALHSRWQSLEQNPE
Homology
BLAST of Cp4.1LG04g08850 vs. ExPASy Swiss-Prot
Match: Q8KCJ3 (Uroporphyrinogen-III synthase OS=Chlorobaculum tepidum (strain ATCC 49652 / DSM 12025 / NBRC 103806 / TLS) OX=194439 GN=hemD PE=3 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 9.8e-07
Identity = 61/266 (22.93%), Postives = 111/266 (41.73%), Query Frame = 0

Query: 32  RTVAFTTPQNYAGSLSNLLSLKGFDPLWCPTVTVHPTPIAIKSHLVPPNLHFYSAVAFTS 91
           +TV  T P++ A      L+  G D +  PT+ + P      +    P+L  ++ + FTS
Sbjct: 2   KTVLVTRPKHQAEPFVRELAQYGLDSVVFPTIEIRPV-----TGWSVPDLTRFAGIFFTS 61

Query: 92  RSGITALLDAATEIDEPLLSPQGDTFLIAALGKDS--ELLDHGVLSKFCPNANRIRVVVP 151
            + +   L+   E + P   P      + A+GK +  +L  HGV  +           +P
Sbjct: 62  PNSVQFFLERLLE-ESPDELPNLQQARVWAVGKTTGGDLEKHGVSIE----------PLP 121

Query: 152 KIASPSGLVEALGIGN-HRRVLCPVPRVVGLNEPPVVPNFLRDLAASGWVPVRVDAYETR 211
           K A    L+  +       +    V   + L   P V      +A  G + V +  Y+  
Sbjct: 122 KSADAVSLMSGIDASEIEGKTFLFVRGSLSLGTIPEV------IAKRGGICVELTVYDNI 181

Query: 212 WVGPECARKLAERGEDEKLDAIVFTSTGEVEGLLKSLRALGLQWEMMRKKWP-EMVLAAH 271
               E  +K+     + K+D + FTS        +++ +         K+ P ++++AA 
Sbjct: 182 QPSLEETQKIKSLLTEGKIDCLSFTSPSTAINFFEAIDS---------KEVPSDVLIAAI 236

Query: 272 GPVTAAGAERLGVKIDLVSSKFDSFN 294
           G  T++  E+LGVK+D++   FD  N
Sbjct: 242 GTTTSSALEKLGVKVDIIPEYFDGPN 236

BLAST of Cp4.1LG04g08850 vs. NCBI nr
Match: XP_023530963.1 (uncharacterized protein LOC111793350 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 630 bits (1625), Expect = 6.54e-228
Identity = 311/311 (100.00%), Postives = 311/311 (100.00%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC
Sbjct: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA
Sbjct: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH
Sbjct: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300

Query: 301 SRWQSLEQNPE 311
           SRWQSLEQNPE
Sbjct: 301 SRWQSLEQNPE 311

BLAST of Cp4.1LG04g08850 vs. NCBI nr
Match: KAG6587847.1 (hypothetical protein SDJN03_16412, partial [Cucurbita argyrosperma subsp. sororia] >KAG7035661.1 hypothetical protein SDJN02_02459, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 625 bits (1611), Expect = 8.92e-226
Identity = 307/311 (98.71%), Postives = 311/311 (100.00%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC
Sbjct: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PTVTVHPTP+AIKSHL+PPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA
Sbjct: 61  PTVTVHPTPLAIKSHLLPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDHGVLSKFCPNA+RIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHGVLSKFCPNASRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSLRALGL+WEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH
Sbjct: 241 LLKSLRALGLKWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300

Query: 301 SRWQSLEQNPE 311
           SRWQSLEQNPE
Sbjct: 301 SRWQSLEQNPE 311

BLAST of Cp4.1LG04g08850 vs. NCBI nr
Match: XP_022926996.1 (uncharacterized protein LOC111433956 [Cucurbita moschata])

HSP 1 Score: 617 bits (1591), Expect = 9.98e-223
Identity = 304/311 (97.75%), Postives = 308/311 (99.04%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHPSPI HLNPLASSPSPAI TNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC
Sbjct: 1   MSTGAAHPSPIRHLNPLASSPSPAISTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PTVTVHPTP+AIKSHL+PPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA
Sbjct: 61  PTVTVHPTPLAIKSHLLPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDHGV SKFCPNA+RIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHGVFSKFCPNASRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSLRALGL+WEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH
Sbjct: 241 LLKSLRALGLKWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300

Query: 301 SRWQSLEQNPE 311
           SRWQSLEQNPE
Sbjct: 301 SRWQSLEQNPE 311

BLAST of Cp4.1LG04g08850 vs. NCBI nr
Match: XP_023003818.1 (uncharacterized protein LOC111497286 [Cucurbita maxima])

HSP 1 Score: 601 bits (1549), Expect = 2.43e-216
Identity = 297/311 (95.50%), Postives = 304/311 (97.75%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHPS IH LNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC
Sbjct: 1   MSTGAAHPSLIH-LNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PTVTVHPTP+AIKSHL+PPNLHFYSAVAFTSRSGITALLDAATEID+PLLSPQGDTFLIA
Sbjct: 61  PTVTVHPTPLAIKSHLLPPNLHFYSAVAFTSRSGITALLDAATEIDDPLLSPQGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDHGVLSKFCPN +RIRVVVPKIA+PSGLVEALGIGNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHGVLSKFCPNTSRIRVVVPKIATPSGLVEALGIGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDLAASGWVPVRVDAYETRW GPECAR L +RGEDEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWAGPECARMLVKRGEDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSLR LGL+WEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH
Sbjct: 241 LLKSLRTLGLKWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300

Query: 301 SRWQSLEQNPE 311
           SRWQSLEQNPE
Sbjct: 301 SRWQSLEQNPE 310

BLAST of Cp4.1LG04g08850 vs. NCBI nr
Match: XP_008444040.1 (PREDICTED: uncharacterized protein LOC103487492 [Cucumis melo] >KAA0050031.1 Tetrapyrrole biosynthesis, uroporphyrinogen III synthase [Cucumis melo var. makuwa])

HSP 1 Score: 527 bits (1357), Expect = 3.95e-187
Identity = 255/307 (83.06%), Postives = 281/307 (91.53%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHP+   H++PL SSPSPAIPT+S  RTVAFTTPQNYAGSLS+LLSLKGF+PLWC
Sbjct: 1   MSTGAAHPNGPFHISPLTSSPSPAIPTHSSPRTVAFTTPQNYAGSLSHLLSLKGFEPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PT+TV PTP+AIKSHL+PP LH +SAVAFTSRSGITALLDAATEI EPLL   GDTFLIA
Sbjct: 61  PTLTVQPTPLAIKSHLLPPILHSFSAVAFTSRSGITALLDAATEIGEPLLPSHGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDH  L+  CPN +RIRVVVP+IA+P+GLVEALG+GNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHEFLTTICPNTSRIRVVVPEIATPTGLVEALGVGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDL A GWVPVRVDAYETRW GP+CARKL ERG+DEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLEAKGWVPVRVDAYETRWAGPDCARKLVERGKDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSL  LGL+W+MM+K+WPEMV+AAHGPVTAAGAERLGVK+DLVS KFDSFNGVVD+LH
Sbjct: 241 LLKSLEHLGLEWDMMKKRWPEMVVAAHGPVTAAGAERLGVKVDLVSPKFDSFNGVVDSLH 300

Query: 301 SRWQSLE 307
            RWQSL+
Sbjct: 301 WRWQSLD 307

BLAST of Cp4.1LG04g08850 vs. ExPASy TrEMBL
Match: A0A6J1EGG3 (Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucurbita moschata OX=3662 GN=LOC111433956 PE=3 SV=1)

HSP 1 Score: 617 bits (1591), Expect = 4.83e-223
Identity = 304/311 (97.75%), Postives = 308/311 (99.04%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHPSPI HLNPLASSPSPAI TNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC
Sbjct: 1   MSTGAAHPSPIRHLNPLASSPSPAISTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PTVTVHPTP+AIKSHL+PPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA
Sbjct: 61  PTVTVHPTPLAIKSHLLPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDHGV SKFCPNA+RIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHGVFSKFCPNASRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSLRALGL+WEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH
Sbjct: 241 LLKSLRALGLKWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300

Query: 301 SRWQSLEQNPE 311
           SRWQSLEQNPE
Sbjct: 301 SRWQSLEQNPE 311

BLAST of Cp4.1LG04g08850 vs. ExPASy TrEMBL
Match: A0A6J1KXQ2 (Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucurbita maxima OX=3661 GN=LOC111497286 PE=3 SV=1)

HSP 1 Score: 601 bits (1549), Expect = 1.18e-216
Identity = 297/311 (95.50%), Postives = 304/311 (97.75%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHPS IH LNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC
Sbjct: 1   MSTGAAHPSLIH-LNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PTVTVHPTP+AIKSHL+PPNLHFYSAVAFTSRSGITALLDAATEID+PLLSPQGDTFLIA
Sbjct: 61  PTVTVHPTPLAIKSHLLPPNLHFYSAVAFTSRSGITALLDAATEIDDPLLSPQGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDHGVLSKFCPN +RIRVVVPKIA+PSGLVEALGIGNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHGVLSKFCPNTSRIRVVVPKIATPSGLVEALGIGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDLAASGWVPVRVDAYETRW GPECAR L +RGEDEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWAGPECARMLVKRGEDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSLR LGL+WEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH
Sbjct: 241 LLKSLRTLGLKWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300

Query: 301 SRWQSLEQNPE 311
           SRWQSLEQNPE
Sbjct: 301 SRWQSLEQNPE 310

BLAST of Cp4.1LG04g08850 vs. ExPASy TrEMBL
Match: A0A5A7U8V3 (Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold675G00050 PE=3 SV=1)

HSP 1 Score: 527 bits (1357), Expect = 1.91e-187
Identity = 255/307 (83.06%), Postives = 281/307 (91.53%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHP+   H++PL SSPSPAIPT+S  RTVAFTTPQNYAGSLS+LLSLKGF+PLWC
Sbjct: 1   MSTGAAHPNGPFHISPLTSSPSPAIPTHSSPRTVAFTTPQNYAGSLSHLLSLKGFEPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PT+TV PTP+AIKSHL+PP LH +SAVAFTSRSGITALLDAATEI EPLL   GDTFLIA
Sbjct: 61  PTLTVQPTPLAIKSHLLPPILHSFSAVAFTSRSGITALLDAATEIGEPLLPSHGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDH  L+  CPN +RIRVVVP+IA+P+GLVEALG+GNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHEFLTTICPNTSRIRVVVPEIATPTGLVEALGVGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDL A GWVPVRVDAYETRW GP+CARKL ERG+DEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLEAKGWVPVRVDAYETRWAGPDCARKLVERGKDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSL  LGL+W+MM+K+WPEMV+AAHGPVTAAGAERLGVK+DLVS KFDSFNGVVD+LH
Sbjct: 241 LLKSLEHLGLEWDMMKKRWPEMVVAAHGPVTAAGAERLGVKVDLVSPKFDSFNGVVDSLH 300

Query: 301 SRWQSLE 307
            RWQSL+
Sbjct: 301 WRWQSLD 307

BLAST of Cp4.1LG04g08850 vs. ExPASy TrEMBL
Match: A0A1S3BA78 (Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucumis melo OX=3656 GN=LOC103487492 PE=3 SV=1)

HSP 1 Score: 527 bits (1357), Expect = 1.91e-187
Identity = 255/307 (83.06%), Postives = 281/307 (91.53%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHP+   H++PL SSPSPAIPT+S  RTVAFTTPQNYAGSLS+LLSLKGF+PLWC
Sbjct: 1   MSTGAAHPNGPFHISPLTSSPSPAIPTHSSPRTVAFTTPQNYAGSLSHLLSLKGFEPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PT+TV PTP+AIKSHL+PP LH +SAVAFTSRSGITALLDAATEI EPLL   GDTFLIA
Sbjct: 61  PTLTVQPTPLAIKSHLLPPILHSFSAVAFTSRSGITALLDAATEIGEPLLPSHGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDH  L+  CPN +RIRVVVP+IA+P+GLVEALG+GNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHEFLTTICPNTSRIRVVVPEIATPTGLVEALGVGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDL A GWVPVRVDAYETRW GP+CARKL ERG+DEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLEAKGWVPVRVDAYETRWAGPDCARKLVERGKDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSL  LGL+W+MM+K+WPEMV+AAHGPVTAAGAERLGVK+DLVS KFDSFNGVVD+LH
Sbjct: 241 LLKSLEHLGLEWDMMKKRWPEMVVAAHGPVTAAGAERLGVKVDLVSPKFDSFNGVVDSLH 300

Query: 301 SRWQSLE 307
            RWQSL+
Sbjct: 301 WRWQSLD 307

BLAST of Cp4.1LG04g08850 vs. ExPASy TrEMBL
Match: A0A5D3BX13 (Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1369G00630 PE=3 SV=1)

HSP 1 Score: 525 bits (1353), Expect = 7.78e-187
Identity = 254/307 (82.74%), Postives = 281/307 (91.53%), Query Frame = 0

Query: 1   MSTGAAHPSPIHHLNPLASSPSPAIPTNSPSRTVAFTTPQNYAGSLSNLLSLKGFDPLWC 60
           MSTGAAHP+   H++PL SSPSPAIPT+S  RTVAFTTPQNYAGSLS+LLSLKGF+PLWC
Sbjct: 1   MSTGAAHPNGPFHISPLTSSPSPAIPTHSSPRTVAFTTPQNYAGSLSHLLSLKGFEPLWC 60

Query: 61  PTVTVHPTPIAIKSHLVPPNLHFYSAVAFTSRSGITALLDAATEIDEPLLSPQGDTFLIA 120
           PT+TV PTP+AIKSHL+PP LH +SAVAFTSRSGITALLDAATEI EPLL   GDTFLIA
Sbjct: 61  PTLTVQPTPLAIKSHLLPPILHSFSAVAFTSRSGITALLDAATEIGEPLLPSHGDTFLIA 120

Query: 121 ALGKDSELLDHGVLSKFCPNANRIRVVVPKIASPSGLVEALGIGNHRRVLCPVPRVVGLN 180
           ALGKDSELLDH  L+  CPN +RIRVVVP+IA+P+GLVEALG+GNHRRVLCPVPRVVGLN
Sbjct: 121 ALGKDSELLDHEFLTTICPNTSRIRVVVPEIATPTGLVEALGVGNHRRVLCPVPRVVGLN 180

Query: 181 EPPVVPNFLRDLAASGWVPVRVDAYETRWVGPECARKLAERGEDEKLDAIVFTSTGEVEG 240
           EPPVVPNFLRDL A GWVPVRVDAYETRW GP+CARKL ERG+DEKLDAIVFTSTGEVEG
Sbjct: 181 EPPVVPNFLRDLEAKGWVPVRVDAYETRWAGPDCARKLVERGKDEKLDAIVFTSTGEVEG 240

Query: 241 LLKSLRALGLQWEMMRKKWPEMVLAAHGPVTAAGAERLGVKIDLVSSKFDSFNGVVDALH 300
           LLKSL  LGL+W++M+K+WPEMV+AAHGPVTAAGAERLGVK+DLVS KFDSFNGVVD+LH
Sbjct: 241 LLKSLEHLGLEWDIMKKRWPEMVVAAHGPVTAAGAERLGVKVDLVSPKFDSFNGVVDSLH 300

Query: 301 SRWQSLE 307
            RWQSL+
Sbjct: 301 WRWQSLD 307

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8KCJ39.8e-0722.93Uroporphyrinogen-III synthase OS=Chlorobaculum tepidum (strain ATCC 49652 / DSM ... [more]
Match NameE-valueIdentityDescription
XP_023530963.16.54e-228100.00uncharacterized protein LOC111793350 [Cucurbita pepo subsp. pepo][more]
KAG6587847.18.92e-22698.71hypothetical protein SDJN03_16412, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022926996.19.98e-22397.75uncharacterized protein LOC111433956 [Cucurbita moschata][more]
XP_023003818.12.43e-21695.50uncharacterized protein LOC111497286 [Cucurbita maxima][more]
XP_008444040.13.95e-18783.06PREDICTED: uncharacterized protein LOC103487492 [Cucumis melo] >KAA0050031.1 Tet... [more]
Match NameE-valueIdentityDescription
A0A6J1EGG34.83e-22397.75Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A6J1KXQ21.18e-21695.50Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A5A7U8V31.91e-18783.06Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucumis melo var. makuwa OX=119469... [more]
A0A1S3BA781.91e-18783.06Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucumis melo OX=3656 GN=LOC1034874... [more]
A0A5D3BX137.78e-18782.74Hydroxymethylbilane hydrolyase [cyclizing] OS=Cucumis melo var. makuwa OX=119469... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036108Tetrapyrrole biosynthesis, uroporphyrinogen III synthase superfamilyGENE3D3.40.50.10090coord: 185..305
e-value: 2.1E-13
score: 52.1
coord: 28..161
e-value: 2.4E-5
score: 26.0
IPR036108Tetrapyrrole biosynthesis, uroporphyrinogen III synthase superfamilySUPERFAMILY69618HemD-likecoord: 32..303
IPR003754Tetrapyrrole biosynthesis, uroporphyrinogen III synthasePFAMPF02602HEM4coord: 45..293
e-value: 3.5E-23
score: 82.2
IPR003754Tetrapyrrole biosynthesis, uroporphyrinogen III synthaseCDDcd06578HemDcoord: 34..299
e-value: 2.19158E-25
score: 99.6891
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availablePANTHERPTHR38020UROPORPHYRINOGEN-III SYNTHASEcoord: 18..305

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g08850.1Cp4.1LG04g08850.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006782 protoporphyrinogen IX biosynthetic process
biological_process GO:0033014 tetrapyrrole biosynthetic process
molecular_function GO:0004852 uroporphyrinogen-III synthase activity