Cp4.1LG03g04790 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g04790
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPorphobilinogen deaminase
LocationCp4.1LG03 : 5288781 .. 5292330 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGAAGTGAAATGGATGAGAATCCTATCTGCCAACGCGAATGCAGAGATTTGGCCTGTTGGGTCATCAAAAACAATCCCTTCTAAGCTAACCCATCTCCTTTTTTCTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTCAAATGGTTGCTTTATCTTCCCCTTCTGTAAGCCACTCTCCCATGGCTCGCCCTTGTAATCTTGCCTCCCTTTCTCTTCCTGGGTTCTCCTCACTTTCCCTTAAACCTCCGGCTTTCATTAATGCCCCCAAGAAACTTCATGGGCTTGGCCTCATAAGAGCCTCTGTTGCTGTGGAGCAGCAGGCAGGGAAGACCAAGGTTGCTCTCCTCAGAATTGGCACCAGAGGAAGGTATAACTTCTTTGTTTCGTGGGTCTTTTTTCGATTTTCTTTTGGTTTATGTTTTAGGATGTGTTCAGTTTCTAATCATGAGTTTGTGGCTGATTTGAGGGATAATGTAGGGCATTCAGTTGTTGTTTAAGGGTTTGGATCACTTCAATTTGTGCCTTTTAACTGTGATTAAGTTCTGATAAAGTGTTTATGCTGCTCATGCTATCAATCATGGCAGTGATAGTGTTTTACCTTACAATCTAAATGTTTGGATCCAATCTGGTGTTGTAGAGTAGAATCAGTTCTTCGTTTGAAGAGTAAAACTAATGTGAGATCCTACGTCGGTTGTGGAGGAGAATGGAACATTCTTTATAAGGGTGTGGAAACCCCTCCCTGGCATACGCATTTTAAAATCTTGAGGGGAAGCCCAGAAGGGGAAGTTCAAAGAGGACAATATATGTTAGTGGTGGGATTAGGTCGTTATAAATGGTATTGTCACTAGACATTGGGCGATGTGCCAGTGGAAGGATGAGCCCCGAAGGAGGGTGGACACGAGGCAGTGTGCCAACAAGAACGTTGGGCTACGAAGAGGTGTGGATTGGGGTCCCACGTTGATTTGAGAAGGGAACAAGTGCGAGAGAGGACGTTGGACCCTAAAGGGGGTGGATTGGGAGATCCCACATCGGTTGAGGAGAACAAAACATTCTCCATAAGAGTGTGGAAACTTCTCCGTAGCAAACGCGTTTTAAAACCTTGAGGGGAAGCCCGGAAGGGAAGCCCAGAGAGGACAATATTTACTAGCGATGGGCTTAGGTCGTTACAAATGGTATCAGAGCCAGACACCGGGCAATGTGCCAGTGGAAGGTTGAGCCCCGAAGGAGGGTGGACACGAGGCGGTGTGCTAGCAAGGTCATTGGGCTCAAAAGGGGTGTGGATTAGGGGTCTCACGTCGATTTGAGAAGAGAACAAGTGCCAGTGAGGACGCTGGGACCCAAAGGGGGTGGATTGTGAGATCCCACATCGGTTGGGGAGGGCGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTACCAAACGCGTTTTAAAAACCTTAAAGGGAAGCCTAGAAGGAAAAGCGTAAAGAGGACAAGGTTACTTCATTTTCTATATGCTTGTTTGATTGGGTTTACTTTATCTTCTTCTGTGTCAGGACAGCCCATTAGCACTAGCCCAGGCTCATGAGACAAGAGACAAGCTCATGGCTTCCCATCCTGAGCTAGCTGAAGATGGTGCCATTCAGATTGTTGTAATCAAAACCACTGGTGACAAAATACTCTCTCAGCCACTTGCAGACATTGGTGGGAAAGGCTTATTTACAAAAGAGATTGATGATGCACTTATCAATGGTGACATAGACATTGCTGTTCACTCAATGAAAGATGTCCCAACTTACTTACCTGAAAAAACCATCCTCCCGTGTAACCTTCCGAGGGAGGACGTCCGGGACGCATTCATTTCTCTACGTGCAGGTTCACTTGCCGAGCTTCCAGCTGGAAGCGTCGTCGGTACAGCTTCACTGAGAAGAAAGTCTCAGCTACTGAATCGATATCCGTCCCTCGAAGTAAGTATGGTGTGTTATTTAATGGAAAAACATCCAAAAAGTTTATGTTCTACGTTTACATTGTTGTTGTGTATGTTGAATTCTGATTCTTGATGGAATGTTTATGTGTGAGATCCCACGTTGGTTGGAGAGGTGAACGAAGCATTTCTTATAAGGGTGTGGAAACCTCTTCCTAGTAGACTCGTTTTAAAATTTTGAGAGGAAGCCCTTGAAGGGAAAGCCCAAAGAGATCAGTATCTTCTAGCGATGGGCTTGGACTGTTACAAATGGTATTAGAATCAGACACTGGACGGTGTGCCAGCGAGTACACTGGGCCCCCAAGGGGGTGGATTGTAAGATCTCACATCGGTTAGAGAGAGGAACAAAGCATTCTTTATACGGGTGTGGAAACCTCTCACTAGCATACGCGTTTTAAAACCGTGAGATTGATAGCGATATGTAACGGGCCAAAACGGGCAATATCTACTAGCATGGTAGTGTTTAACGAGTCTTAAATCTTCAATCTTGTATGTTGGTTCTGATGCTCGCTCTCGTTTTTTCCATTGTTTGTTGGTGTCACTCGTTAGGTACTCGAGAATTTTCGGGGTAATGTCCAAACAAGGTTGAAAAAACTAAACGAAGGAGTAGTCCAAGCAACACTATTAGCTCTAGCTGGACTTCGACGTTTAAATATGACAGAAAACGTTACTTCGATCCTTTCAATCGATGAAATGCTTCCAGCTGTTGCTCAGGGAGCTATCGGTATCGCCTGTCGAAGCGACGATGACAAAATGGTGCGTTTTCATTGGCTATACAAATCAATATTTGTCCATGAAAGTTCATTGCTTGAGAAACACAACTAAATGACATCTTATTTGAGCTGAACTTCGCCATTCCCGTTTGGTTTCTCATTTGCAGGCTGAGTACTTAGCCTCATTAAACCATGAGGAAACAAGACTGGCTGTTGTTTGTGAGAGAGCTTTCCTCGAGACTCTTGATGGATCATGCCGAACCCCGATTGCGGGATATGCGTCGAGGGATGAAGATGGCAATTGTATATTCAAAGGGTTGGTAGCTTCCCCGGACGGAACCCGAGGTACGATCTCAAACGTTCATGTTCGAATGAAGTAACTTCTACATTGCTAGAACATTGAATATCTACATATTTTACAGTTCTTGAAACTTCTCGACGAGGTTCGTATGCCATCGACGATATGATAGCAATGGGGAAGGATGCTGGGAAGGAGCTACTTTCTCGAGCAGGTCCGGGTTTTTTTGATAGCTAGTTAGTTTAAAATTTAGGATAAAGATGAAGGCTTCATATCCAAATCTTCAAGTGAGTAGGAGTTATGTTCATCCTTTGTTTTGTTAATTGGTTGTGTAAATTAGAATAGAAATCATTTAGACTATCTTGGGTTCCTCAAATAAGTCTATGCAAACTAGCCATTTCTAATCCTCAAGCTCTATACAAACATGTGTTACTCTTTAATGGTCGTATTTGATCCGTTGATTGGATGATCGAACGCTCCGACCCGAACCGAAAGCTACCGAGCCTCTTCATGTTCACTTCTTCATTTACAAATGATACGTTTGGAATATAAACTCTCTTGTACCAGCTC

mRNA sequence

TGGAAGTGAAATGGATGAGAATCCTATCTGCCAACGCGAATGCAGAGATTTGGCCTGTTGGGTCATCAAAAACAATCCCTTCTAAGCTAACCCATCTCCTTTTTTCTTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTCAAATGGTTGCTTTATCTTCCCCTTCTGTAAGCCACTCTCCCATGGCTCGCCCTTGTAATCTTGCCTCCCTTTCTCTTCCTGGGTTCTCCTCACTTTCCCTTAAACCTCCGGCTTTCATTAATGCCCCCAAGAAACTTCATGGGCTTGGCCTCATAAGAGCCTCTGTTGCTGTGGAGCAGCAGGCAGGGAAGACCAAGGTTGCTCTCCTCAGAATTGGCACCAGAGGAAGGATGTGTTCAGTTTCTAATCATGAGTTTGTGGCTGATTTGAGGGATAATGACAGCCCATTAGCACTAGCCCAGGCTCATGAGACAAGAGACAAGCTCATGGCTTCCCATCCTGAGCTAGCTGAAGATGGTGCCATTCAGATTGTTGTAATCAAAACCACTGGTGACAAAATACTCTCTCAGCCACTTGCAGACATTGGTGGGAAAGGCTTATTTACAAAAGAGATTGATGATGCACTTATCAATGGTGACATAGACATTGCTGTTCACTCAATGAAAGATGTCCCAACTTACTTACCTGAAAAAACCATCCTCCCGTGTAACCTTCCGAGGGAGGACGTCCGGGACGCATTCATTTCTCTACGTGCAGGTTCACTTGCCGAGCTTCCAGCTGGAAGCGTCGTCGGTACAGCTTCACTGAGAAGAAAGTCTCAGCTACTGAATCGATATCCGTCCCTCGAAGTACTCGAGAATTTTCGGGGTAATGTCCAAACAAGGTTGAAAAAACTAAACGAAGGAGTAGTCCAAGCAACACTATTAGCTCTAGCTGGACTTCGACGTTTAAATATGACAGAAAACGTTACTTCGATCCTTTCAATCGATGAAATGCTTCCAGCTGTTGCTCAGGGAGCTATCGGTATCGCCTGTCGAAGCGACGATGACAAAATGGCTGAGTACTTAGCCTCATTAAACCATGAGGAAACAAGACTGGCTGTTGTTTGTGAGAGAGCTTTCCTCGAGACTCTTGATGGATCATGCCGAACCCCGATTGCGGGATATGCGTCGAGGGATGAAGATGGCAATTGTATATTCAAAGGGTTGGTAGCTTCCCCGGACGGAACCCGAGTTCTTGAAACTTCTCGACGAGGTTCGTATGCCATCGACGATATGATAGCAATGGGGAAGGATGCTGGGAAGGAGCTACTTTCTCGAGCAGGTCCGGGTTTTTTTGATAGCTAGTTAGTTTAAAATTTAGGATAAAGATGAAGGCTTCATATCCAAATCTTCAAGTGAGTAGGAGTTATGTTCATCCTTTGTTTTGTTAATTGGTTGTGTAAATTAGAATAGAAATCATTTAGACTATCTTGGGTTCCTCAAATAAGTCTATGCAAACTAGCCATTTCTAATCCTCAAGCTCTATACAAACATGTGTTACTCTTTAATGGTCGTATTTGATCCGTTGATTGGATGATCGAACGCTCCGACCCGAACCGAAAGCTACCGAGCCTCTTCATGTTCACTTCTTCATTTACAAATGATACGTTTGGAATATAAACTCTCTTGTACCAGCTC

Coding sequence (CDS)

ATGGTTGCTTTATCTTCCCCTTCTGTAAGCCACTCTCCCATGGCTCGCCCTTGTAATCTTGCCTCCCTTTCTCTTCCTGGGTTCTCCTCACTTTCCCTTAAACCTCCGGCTTTCATTAATGCCCCCAAGAAACTTCATGGGCTTGGCCTCATAAGAGCCTCTGTTGCTGTGGAGCAGCAGGCAGGGAAGACCAAGGTTGCTCTCCTCAGAATTGGCACCAGAGGAAGGATGTGTTCAGTTTCTAATCATGAGTTTGTGGCTGATTTGAGGGATAATGACAGCCCATTAGCACTAGCCCAGGCTCATGAGACAAGAGACAAGCTCATGGCTTCCCATCCTGAGCTAGCTGAAGATGGTGCCATTCAGATTGTTGTAATCAAAACCACTGGTGACAAAATACTCTCTCAGCCACTTGCAGACATTGGTGGGAAAGGCTTATTTACAAAAGAGATTGATGATGCACTTATCAATGGTGACATAGACATTGCTGTTCACTCAATGAAAGATGTCCCAACTTACTTACCTGAAAAAACCATCCTCCCGTGTAACCTTCCGAGGGAGGACGTCCGGGACGCATTCATTTCTCTACGTGCAGGTTCACTTGCCGAGCTTCCAGCTGGAAGCGTCGTCGGTACAGCTTCACTGAGAAGAAAGTCTCAGCTACTGAATCGATATCCGTCCCTCGAAGTACTCGAGAATTTTCGGGGTAATGTCCAAACAAGGTTGAAAAAACTAAACGAAGGAGTAGTCCAAGCAACACTATTAGCTCTAGCTGGACTTCGACGTTTAAATATGACAGAAAACGTTACTTCGATCCTTTCAATCGATGAAATGCTTCCAGCTGTTGCTCAGGGAGCTATCGGTATCGCCTGTCGAAGCGACGATGACAAAATGGCTGAGTACTTAGCCTCATTAAACCATGAGGAAACAAGACTGGCTGTTGTTTGTGAGAGAGCTTTCCTCGAGACTCTTGATGGATCATGCCGAACCCCGATTGCGGGATATGCGTCGAGGGATGAAGATGGCAATTGTATATTCAAAGGGTTGGTAGCTTCCCCGGACGGAACCCGAGTTCTTGAAACTTCTCGACGAGGTTCGTATGCCATCGACGATATGATAGCAATGGGGAAGGATGCTGGGAAGGAGCTACTTTCTCGAGCAGGTCCGGGTTTTTTTGATAGCTAG

Protein sequence

MVALSSPSVSHSPMARPCNLASLSLPGFSSLSLKPPAFINAPKKLHGLGLIRASVAVEQQAGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYAIDDMIAMGKDAGKELLSRAGPGFFDS
BLAST of Cp4.1LG03g04790 vs. Swiss-Prot
Match: HEM3_PEA (Porphobilinogen deaminase, chloroplastic OS=Pisum sativum GN=HEMC PE=1 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 1.6e-155
Identity = 291/373 (78.02%), Postives = 320/373 (85.79%), Query Frame = 1

Query: 24  SLPGFSSLSLKPPAFINAPKKLHGLGL--IRASVAVEQQAGKTKVALLRIGTRGRMCSVS 83
           S P   SLSL   +F  +  K        IRAS+AVEQQ  + K AL+RIGTRG      
Sbjct: 15  SAPSNPSLSLFTSSFRFSSFKTSPFSKCRIRASLAVEQQTQQNKTALIRIGTRG------ 74

Query: 84  NHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADI 143
                       SPLALAQAHETRDKLMASH ELAE+GAIQIV+IKTTGDKILSQPLADI
Sbjct: 75  ------------SPLALAQAHETRDKLMASHTELAEEGAIQIVIIKTTGDKILSQPLADI 134

Query: 144 GGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISLRAGSL 203
           GGKGLFTKEID+ALINGDIDIAVHSMKDVPTYLPE+TILPCNLPREDVRDAFISL A SL
Sbjct: 135 GGKGLFTKEIDEALINGDIDIAVHSMKDVPTYLPEETILPCNLPREDVRDAFISLSAASL 194

Query: 204 AELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEGVVQATLLALAGLR 263
           A+LPAGSV+GTASLRRKSQ+L+RYPSL V +NFRGNVQTRL+KL+EGVV+ATLLALAGL+
Sbjct: 195 ADLPAGSVIGTASLRRKSQILHRYPSLTVQDNFRGNVQTRLRKLSEGVVKATLLALAGLK 254

Query: 264 RLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHEETRLAVVCERAFL 323
           RLNMTENVTS LSID+MLPAVAQGAIGIACRS+DDKMAEYLASLNHEETRLA+ CERAFL
Sbjct: 255 RLNMTENVTSTLSIDDMLPAVAQGAIGIACRSNDDKMAEYLASLNHEETRLAISCERAFL 314

Query: 324 ETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYAIDDMIAMGKDAGK 383
            TLDGSCRTPIAGYASRD+DGNC+F+GLVASPDGTRVLETSR GSY  +DM+ +GKDAG+
Sbjct: 315 TTLDGSCRTPIAGYASRDKDGNCLFRGLVASPDGTRVLETSRIGSYTYEDMMKIGKDAGE 369

Query: 384 ELLSRAGPGFFDS 395
           ELLSRAGPGFF+S
Sbjct: 375 ELLSRAGPGFFNS 369

BLAST of Cp4.1LG03g04790 vs. Swiss-Prot
Match: HEM3_ARATH (Porphobilinogen deaminase, chloroplastic OS=Arabidopsis thaliana GN=HEMC PE=1 SV=1)

HSP 1 Score: 537.3 bits (1383), Expect = 1.4e-151
Identity = 286/399 (71.68%), Postives = 319/399 (79.95%), Query Frame = 1

Query: 2   VALSSPSVSH--------SPMARPCNLASLSLPGFSSLSLKPPAFINAPKKLHGLGLIRA 61
           +A SS S +H        S     C+L S+S  GFS   +  PA     +K    G ++A
Sbjct: 3   IASSSLSQAHKVVLTRQPSSRVNTCSLGSVSAIGFSLPQISSPALGKCRRKQSSSGFVKA 62

Query: 62  SVAVEQQAGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHP 121
            VAVEQ   KT+ A++RIGTRG                  SPLALAQA+ETR+KL   HP
Sbjct: 63  CVAVEQ---KTRTAIIRIGTRG------------------SPLALAQAYETREKLKKKHP 122

Query: 122 ELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTY 181
           EL EDGAI I +IKTTGDKILSQPLADIGGKGLFTKEID+ALING IDIAVHSMKDVPTY
Sbjct: 123 ELVEDGAIHIEIIKTTGDKILSQPLADIGGKGLFTKEIDEALINGHIDIAVHSMKDVPTY 182

Query: 182 LPEKTILPCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLEN 241
           LPEKTILPCNLPREDVRDAFI L A +LAELPAGSVVGTASLRRKSQ+L++YP+L V EN
Sbjct: 183 LPEKTILPCNLPREDVRDAFICLTAATLAELPAGSVVGTASLRRKSQILHKYPALHVEEN 242

Query: 242 FRGNVQTRLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRS 301
           FRGNVQTRL KL  G VQATLLALAGL+RL+MTENV SILS+DEMLPAVAQGAIGIACR+
Sbjct: 243 FRGNVQTRLSKLQGGKVQATLLALAGLKRLSMTENVASILSLDEMLPAVAQGAIGIACRT 302

Query: 302 DDDKMAEYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASP 361
           DDDKMA YLASLNHEETRLA+ CERAFLETLDGSCRTPIAGYAS+DE+GNCIF+GLVASP
Sbjct: 303 DDDKMATYLASLNHEETRLAISCERAFLETLDGSCRTPIAGYASKDEEGNCIFRGLVASP 362

Query: 362 DGTRVLETSRRGSYAIDDMIAMGKDAGKELLSRAGPGFF 393
           DGT+VLETSR+G Y  +DM+ MGKDAG+ELLSRAGPGFF
Sbjct: 363 DGTKVLETSRKGPYVYEDMVKMGKDAGQELLSRAGPGFF 380

BLAST of Cp4.1LG03g04790 vs. Swiss-Prot
Match: HEM3_ORYSJ (Porphobilinogen deaminase, chloroplastic OS=Oryza sativa subsp. japonica GN=HEMC PE=2 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 1.4e-135
Identity = 251/344 (72.97%), Postives = 285/344 (82.85%), Query Frame = 1

Query: 51  IRASVAVEQQAGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMA 110
           +RA+VAV+ +A + KV+L+RIGTRG                  SPLALAQAHETRDKL A
Sbjct: 33  VRAAVAVQAEA-QAKVSLIRIGTRG------------------SPLALAQAHETRDKLKA 92

Query: 111 SHPELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDV 170
           +H ELAE+GA++IV+IKTTGD IL +PLADIGGKGLFTKEIDDAL+ G IDIAVHSMKDV
Sbjct: 93  AHSELAEEGAVEIVIIKTTGDMILDKPLADIGGKGLFTKEIDDALLQGRIDIAVHSMKDV 152

Query: 171 PTYLPEKTILPCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEV 230
           PTYLPE TILPCNLPREDVRDAFI L A SLAELPAGSVVG+ASLRR+SQ+L +YPSL+V
Sbjct: 153 PTYLPEGTILPCNLPREDVRDAFICLTASSLAELPAGSVVGSASLRRQSQILYKYPSLKV 212

Query: 231 LENFRGNVQTRLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIA 290
           + NFRGNVQTRL+KL EG V ATLLALAGL+RLNM E  TS+LS+DEMLPAVAQGAIGIA
Sbjct: 213 V-NFRGNVQTRLRKLKEGDVHATLLALAGLKRLNMAETATSVLSVDEMLPAVAQGAIGIA 272

Query: 291 CRSDDDKMAEYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLV 350
           CRS DD M  YL+SLNHE+TRLAV CER FL  LDG+CRTPIA YASRD+DGNC F+GL+
Sbjct: 273 CRSSDDTMMNYLSSLNHEDTRLAVACEREFLSVLDGNCRTPIAAYASRDKDGNCSFRGLL 332

Query: 351 ASPDGTRVLETSRRGSYAIDDMIAMGKDAGKELLSRAGPGFFDS 395
           ASPDG+ V ETSR G Y  D M+ MGKDAG EL ++AGPGFFDS
Sbjct: 333 ASPDGSTVYETSRTGPYDFDIMVEMGKDAGHELKAKAGPGFFDS 356

BLAST of Cp4.1LG03g04790 vs. Swiss-Prot
Match: HEM3_NITWN (Porphobilinogen deaminase OS=Nitrobacter winogradskyi (strain Nb-255 / ATCC 25391) GN=hemC PE=3 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 6.7e-82
Identity = 169/316 (53.48%), Postives = 212/316 (67.09%), Query Frame = 1

Query: 77  MCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKTTGDKILSQ 136
           M S    + +A +    SPLALAQAHE RD+L  +H    E   I I  I+T+GD I  +
Sbjct: 1   MQSSDETDILATIGTRGSPLALAQAHEVRDRLARAHQVAPE--RIAIKTIRTSGDAIQDR 60

Query: 137 PLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISL 196
           PL D+GGKGLFTKEI++AL+ G ID AVHS KDVPT+LP+ T LP  LPREDVRD FIS 
Sbjct: 61  PLFDVGGKGLFTKEIEEALLAGTIDFAVHSSKDVPTFLPDATWLPAFLPREDVRDVFISP 120

Query: 197 RAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEGVVQATLLA 256
            AGSL +LPAG+ VGTASLRR++ +L   P L+V  + RGNV+TRL+K++ G   ATLLA
Sbjct: 121 HAGSLNDLPAGATVGTASLRRQAMVLKLRPDLKV-NSLRGNVETRLRKISVGEADATLLA 180

Query: 257 LAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHEETRLAVVC 316
           LAGL RL + +  T IL  DE LPAV QGAI I  R DDD++  ++ ++   ET +A+  
Sbjct: 181 LAGLNRLGLQDKATRILETDEFLPAVGQGAIAIESRRDDDRINAFVKAIGDPETEVALSA 240

Query: 317 ERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYAIDDMIAMG 376
           ER+FL  LDGSCRTPI G+   + D    F+GL+ SPDGT   ET+R G+ A  D  A+G
Sbjct: 241 ERSFLALLDGSCRTPIGGHCRVNGD-RIDFRGLIISPDGTEFYETTREGARA--DAAALG 300

Query: 377 KDAGKELLSRAGPGFF 393
            DA  EL  RAG  FF
Sbjct: 301 ADAAHELRERAGEKFF 310

BLAST of Cp4.1LG03g04790 vs. Swiss-Prot
Match: HEM3_PARDP (Porphobilinogen deaminase OS=Paracoccus denitrificans (strain Pd 1222) GN=hemC PE=3 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 6.7e-82
Identity = 173/325 (53.23%), Postives = 219/325 (67.38%), Query Frame = 1

Query: 69  LRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKT 128
           +RIGTRG                  S LALAQAHETRD+LMA+H   A+  A +IVVIKT
Sbjct: 11  IRIGTRG------------------SALALAQAHETRDRLMAAHGLAAD--AFRIVVIKT 70

Query: 129 TGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPRED 188
           TGD++L +PL +IGGKGLFT+EI+DAL+  +IDIAVHSMKD+PT  PE  ++ C LPRED
Sbjct: 71  TGDRVLDRPLKEIGGKGLFTREIEDALLAHEIDIAVHSMKDMPTIQPEGLVIDCYLPRED 130

Query: 189 VRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEG 248
           VRDAF+S +  +++ELP G+VVG++SLRR++QL  R P L+++E FRGNVQTRLKKL +G
Sbjct: 131 VRDAFVSAQFAAISELPQGAVVGSSSLRRRAQLAARRPDLKLVE-FRGNVQTRLKKLEDG 190

Query: 249 VVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHE 308
           V  AT LA+AGL RL M       +  DEMLPAVAQG IG+  R+DD + A  LA+++  
Sbjct: 191 VAVATFLAMAGLTRLGMLHVARGAVEPDEMLPAVAQGCIGVERRADDARTASLLAAISDR 250

Query: 309 ETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYA 368
           ++ L V  ERAFL  LDGSC+TPIAG A    D     +G +  PDG+ V+   R G  A
Sbjct: 251 DSALRVTAERAFLARLDGSCQTPIAGLAELQGD-RLRLRGEILRPDGSEVIAAERVGPAA 310

Query: 369 IDDMIAMGKDAGKELLSRAGPGFFD 394
             D  AMG D  +EL  RA   FFD
Sbjct: 311 --DGAAMGTDLAEELRGRAPADFFD 311

BLAST of Cp4.1LG03g04790 vs. TrEMBL
Match: A0A0A0L9F2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G682170 PE=3 SV=1)

HSP 1 Score: 600.5 bits (1547), Expect = 1.5e-168
Identity = 315/349 (90.26%), Postives = 322/349 (92.26%), Query Frame = 1

Query: 46  HGLGLIRASVAVEQQAGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETR 105
           HG+GLIRA VA EQQ  KTKVALLRIGTRG                  SPLALAQAHETR
Sbjct: 35  HGVGLIRA-VAAEQQVEKTKVALLRIGTRG------------------SPLALAQAHETR 94

Query: 106 DKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVH 165
           DKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVH
Sbjct: 95  DKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVH 154

Query: 166 SMKDVPTYLPEKTILPCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRY 225
           SMKDVPTYLPEKTILPCNLPREDVRDAFISL AGS AELPAGS++GTASLRRKSQLLNRY
Sbjct: 155 SMKDVPTYLPEKTILPCNLPREDVRDAFISLSAGSFAELPAGSIIGTASLRRKSQLLNRY 214

Query: 226 PSLEVLENFRGNVQTRLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQG 285
           PSL+VLENFRGNVQTRL+KLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQG
Sbjct: 215 PSLKVLENFRGNVQTRLRKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQG 274

Query: 286 AIGIACRSDDDKMAEYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCI 345
           AIGIACRSDDD MA YLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCI
Sbjct: 275 AIGIACRSDDDIMANYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCI 334

Query: 346 FKGLVASPDGTRVLETSRRGSYAIDDMIAMGKDAGKELLSRAGPGFFDS 395
           FKGLVASPDGTRVLETSRRG YAI+DMIAMGKDAG+ELLSRAGPGFFDS
Sbjct: 335 FKGLVASPDGTRVLETSRRGPYAIEDMIAMGKDAGQELLSRAGPGFFDS 364

BLAST of Cp4.1LG03g04790 vs. TrEMBL
Match: M5XYM7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007124mg PE=3 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 3.2e-163
Identity = 303/373 (81.23%), Postives = 330/373 (88.47%), Query Frame = 1

Query: 22  SLSLPGFSSLSLKPPAFINAPKKLHGLGLIRASVAVEQQAGKTKVALLRIGTRGRMCSVS 81
           S+S+PGFS  SLK  AF +  +K   +G+ RASVAVEQQ  K K+AL+RIGTRG      
Sbjct: 27  SVSVPGFSLPSLKTRAFPHCIRKHSAVGIPRASVAVEQQTQKAKLALIRIGTRG------ 86

Query: 82  NHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADI 141
                       SPLALAQAHETRDKLMASHP+LAE+GAIQIV+IKTTGDKILSQPLADI
Sbjct: 87  ------------SPLALAQAHETRDKLMASHPDLAEEGAIQIVIIKTTGDKILSQPLADI 146

Query: 142 GGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISLRAGSL 201
           GGKGLFTKEID+ALING+IDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISL A SL
Sbjct: 147 GGKGLFTKEIDEALINGEIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISLTASSL 206

Query: 202 AELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEGVVQATLLALAGLR 261
           A+LPAGS +GTASLRRKSQ+LNRYPSL VLENFRGNVQTRL+KLNE VVQATLLALAGL+
Sbjct: 207 ADLPAGSTIGTASLRRKSQILNRYPSLNVLENFRGNVQTRLRKLNEKVVQATLLALAGLK 266

Query: 262 RLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHEETRLAVVCERAFL 321
           RL+MTENVTSILS+DEMLPAVAQGAIGIACRS+DDKMA Y+ASLNHEETRLAV CERAFL
Sbjct: 267 RLDMTENVTSILSLDEMLPAVAQGAIGIACRSNDDKMANYIASLNHEETRLAVACERAFL 326

Query: 322 ETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYAIDDMIAMGKDAGK 381
            TLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSR+G+YA  DMI MGK+AG+
Sbjct: 327 LTLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRKGTYAFQDMINMGKEAGQ 381

Query: 382 ELLSRAGPGFFDS 395
           ELLS+AGPGFFDS
Sbjct: 387 ELLSQAGPGFFDS 381

BLAST of Cp4.1LG03g04790 vs. TrEMBL
Match: W9SZ70_9ROSA (Porphobilinogen deaminase OS=Morus notabilis GN=L484_007545 PE=3 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 7.0e-163
Identity = 304/382 (79.58%), Postives = 328/382 (85.86%), Query Frame = 1

Query: 14  MARPCNLASLSLPG-FSSLSLKPPAFINAPKKLHGLGLIRASVAVEQQAGKTKVALLRIG 73
           MARPC   S S  G  S L    P+     ++ HG+G+ RASVAVEQQ  K++VALLRIG
Sbjct: 17  MARPCFPVSFSSSGSVSVLGFSLPSLKTTSRRKHGIGVTRASVAVEQQTQKSRVALLRIG 76

Query: 74  TRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKTTGDK 133
           TRG                  SPLALAQAHETRDKL ASHPELAE+GAI+IV+IKTTGDK
Sbjct: 77  TRG------------------SPLALAQAHETRDKLKASHPELAEEGAIEIVIIKTTGDK 136

Query: 134 ILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDA 193
           ILSQPLADIGGKGLFTKEID+ALIN DIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDA
Sbjct: 137 ILSQPLADIGGKGLFTKEIDEALINSDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDA 196

Query: 194 FISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEGVVQA 253
           FISL A SLAELPAGS+VGTASLRRKSQ+L RYPSL+V +NFRGNVQTRL+KLNEGVVQA
Sbjct: 197 FISLSAASLAELPAGSIVGTASLRRKSQILYRYPSLKVEDNFRGNVQTRLRKLNEGVVQA 256

Query: 254 TLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHEETRL 313
           TLLALAGL+RLNMTENVT ILSIDEMLPAVAQGAIGIACRSDDDKMA Y+ASLNHEETRL
Sbjct: 257 TLLALAGLKRLNMTENVTCILSIDEMLPAVAQGAIGIACRSDDDKMASYIASLNHEETRL 316

Query: 314 AVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYAIDDM 373
           A+ CERAFL  LDGSCRTPIAGYAS+DEDGNCIFKGLVASPDGTRVLETSR+G YA +DM
Sbjct: 317 AIACERAFLTKLDGSCRTPIAGYASKDEDGNCIFKGLVASPDGTRVLETSRKGPYAFEDM 376

Query: 374 IAMGKDAGKELLSRAGPGFFDS 395
           + MGKDAG+ELLSRAGPGFFDS
Sbjct: 377 MKMGKDAGEELLSRAGPGFFDS 380

BLAST of Cp4.1LG03g04790 vs. TrEMBL
Match: B9HGK9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s07680g PE=3 SV=2)

HSP 1 Score: 568.2 bits (1463), Expect = 8.0e-159
Identity = 305/399 (76.44%), Postives = 332/399 (83.21%), Query Frame = 1

Query: 1   MVALSSPSVSHSPMARP------CNLASLSLPGFSSLSLKPPAFINAPKKLHGLGLIRAS 60
           M  LSS   S + M+RP      C   S+S  GFS   LK  AF    KK   L  ++AS
Sbjct: 1   METLSSLCTSQALMSRPSSPAIFCTSGSVSFTGFS---LKTQAF---SKKKQTLSFVKAS 60

Query: 61  VAVEQQAGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHPE 120
           VAVEQQ  + KVAL+RIGTRG                  SPLALAQAHETRDKLMASH +
Sbjct: 61  VAVEQQTQEAKVALIRIGTRG------------------SPLALAQAHETRDKLMASHSD 120

Query: 121 LAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYL 180
           LAE+GAIQIV+IKTTGDKI SQPLADIGGKGLFTKEID+ALINGDIDIAVHSMKDVPTYL
Sbjct: 121 LAEEGAIQIVIIKTTGDKIQSQPLADIGGKGLFTKEIDEALINGDIDIAVHSMKDVPTYL 180

Query: 181 PEKTILPCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLENF 240
           PEKTILPCNLPREDVRDAFISL A SLA+LPAGS++GTASLRRKSQ+L+RYPSL V ENF
Sbjct: 181 PEKTILPCNLPREDVRDAFISLSAASLADLPAGSIIGTASLRRKSQILHRYPSLSVEENF 240

Query: 241 RGNVQTRLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSD 300
           RGNVQTRL+KLNEGVV+ATLLALAGL+RLNMTENVTSIL +D+MLPAVAQGAIGIACRS+
Sbjct: 241 RGNVQTRLRKLNEGVVKATLLALAGLKRLNMTENVTSILPLDDMLPAVAQGAIGIACRSN 300

Query: 301 DDKMAEYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPD 360
           DDKM  YLASLNHEETRLAV CERAFLETLDGSCRTPIAGYA +DE+G+CIFKGLVASPD
Sbjct: 301 DDKMVNYLASLNHEETRLAVACERAFLETLDGSCRTPIAGYARKDENGDCIFKGLVASPD 360

Query: 361 GTRVLETSRRGSYAIDDMIAMGKDAGKELLSRAGPGFFD 394
           G RVLETSR+G YA DDMIAMGKDAGKELLS+AGPGFFD
Sbjct: 361 GRRVLETSRKGPYAFDDMIAMGKDAGKELLSQAGPGFFD 375

BLAST of Cp4.1LG03g04790 vs. TrEMBL
Match: B9S2Z0_RICCO (Porphobilinogen deaminase, putative OS=Ricinus communis GN=RCOM_1534040 PE=3 SV=1)

HSP 1 Score: 567.4 bits (1461), Expect = 1.4e-158
Identity = 305/394 (77.41%), Postives = 329/394 (83.50%), Query Frame = 1

Query: 1   MVALSSPSVSHSPMARPCNLASLSLPGFSSLSLKPPAFINAPKKLHGLGLIRASVAVEQQ 60
           M  LSS S     M       S+S+ G S    K P  I    K   L + RASVAVEQQ
Sbjct: 1   MDTLSSISTMQGLMVPRPAAVSVSVLGSSLPQFKSPNCI----KKQSLRITRASVAVEQQ 60

Query: 61  AGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGA 120
               KVAL+RIGTRG                  SPLALAQAHETRDKLMA H ELAE+GA
Sbjct: 61  TQDPKVALIRIGTRG------------------SPLALAQAHETRDKLMAKHSELAEEGA 120

Query: 121 IQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTIL 180
           IQIV+IKTTGDKIL+QPLADIGGKGLFTKEID+ALING+IDIAVHSMKDVPTYLPEKTIL
Sbjct: 121 IQIVIIKTTGDKILTQPLADIGGKGLFTKEIDEALINGEIDIAVHSMKDVPTYLPEKTIL 180

Query: 181 PCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQT 240
           PCNLPREDVRDAFISL A SLAELP+GSV+GTASLRRKSQ+L+RYPSL VLENFRGNVQT
Sbjct: 181 PCNLPREDVRDAFISLSASSLAELPSGSVIGTASLRRKSQILHRYPSLSVLENFRGNVQT 240

Query: 241 RLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAE 300
           RL+KLNEGVVQATLLALAGL+RLNMTENVTS+LSID+MLPAVAQGAIGIACRS+DDKMA 
Sbjct: 241 RLRKLNEGVVQATLLALAGLKRLNMTENVTSVLSIDDMLPAVAQGAIGIACRSNDDKMAN 300

Query: 301 YLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLE 360
           YLASLNHEETRLAV CERAFLETLDGSCRTPIAGYAS+DE+G+CIFKGLVASPDGTRVLE
Sbjct: 301 YLASLNHEETRLAVACERAFLETLDGSCRTPIAGYASKDENGDCIFKGLVASPDGTRVLE 360

Query: 361 TSRRGSYAIDDMIAMGKDAGKELLSRAGPGFFDS 395
           TSR+G YA+DDMI MGKDAGKELL +AGPGFFDS
Sbjct: 361 TSRKGPYALDDMIMMGKDAGKELLLQAGPGFFDS 372

BLAST of Cp4.1LG03g04790 vs. TAIR10
Match: AT5G08280.1 (AT5G08280.1 hydroxymethylbilane synthase)

HSP 1 Score: 537.3 bits (1383), Expect = 7.7e-153
Identity = 286/399 (71.68%), Postives = 319/399 (79.95%), Query Frame = 1

Query: 2   VALSSPSVSH--------SPMARPCNLASLSLPGFSSLSLKPPAFINAPKKLHGLGLIRA 61
           +A SS S +H        S     C+L S+S  GFS   +  PA     +K    G ++A
Sbjct: 3   IASSSLSQAHKVVLTRQPSSRVNTCSLGSVSAIGFSLPQISSPALGKCRRKQSSSGFVKA 62

Query: 62  SVAVEQQAGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHP 121
            VAVEQ   KT+ A++RIGTRG                  SPLALAQA+ETR+KL   HP
Sbjct: 63  CVAVEQ---KTRTAIIRIGTRG------------------SPLALAQAYETREKLKKKHP 122

Query: 122 ELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTY 181
           EL EDGAI I +IKTTGDKILSQPLADIGGKGLFTKEID+ALING IDIAVHSMKDVPTY
Sbjct: 123 ELVEDGAIHIEIIKTTGDKILSQPLADIGGKGLFTKEIDEALINGHIDIAVHSMKDVPTY 182

Query: 182 LPEKTILPCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLEN 241
           LPEKTILPCNLPREDVRDAFI L A +LAELPAGSVVGTASLRRKSQ+L++YP+L V EN
Sbjct: 183 LPEKTILPCNLPREDVRDAFICLTAATLAELPAGSVVGTASLRRKSQILHKYPALHVEEN 242

Query: 242 FRGNVQTRLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRS 301
           FRGNVQTRL KL  G VQATLLALAGL+RL+MTENV SILS+DEMLPAVAQGAIGIACR+
Sbjct: 243 FRGNVQTRLSKLQGGKVQATLLALAGLKRLSMTENVASILSLDEMLPAVAQGAIGIACRT 302

Query: 302 DDDKMAEYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASP 361
           DDDKMA YLASLNHEETRLA+ CERAFLETLDGSCRTPIAGYAS+DE+GNCIF+GLVASP
Sbjct: 303 DDDKMATYLASLNHEETRLAISCERAFLETLDGSCRTPIAGYASKDEEGNCIFRGLVASP 362

Query: 362 DGTRVLETSRRGSYAIDDMIAMGKDAGKELLSRAGPGFF 393
           DGT+VLETSR+G Y  +DM+ MGKDAG+ELLSRAGPGFF
Sbjct: 363 DGTKVLETSRKGPYVYEDMVKMGKDAGQELLSRAGPGFF 380

BLAST of Cp4.1LG03g04790 vs. NCBI nr
Match: gi|449464030|ref|XP_004149732.1| (PREDICTED: porphobilinogen deaminase, chloroplastic [Cucumis sativus])

HSP 1 Score: 660.2 bits (1702), Expect = 2.2e-186
Identity = 348/394 (88.32%), Postives = 356/394 (90.36%), Query Frame = 1

Query: 1   MVALSSPSVSHSPMARPCNLASLSLPGFSSLSLKPPAFINAPKKLHGLGLIRASVAVEQQ 60
           M  LSSPSVSHSPM RPCN+ SLS  GFSSLSLKPP F N  KK HG+GLIRA VA EQQ
Sbjct: 1   MGVLSSPSVSHSPMPRPCNVGSLSFLGFSSLSLKPPTFSNGAKKFHGVGLIRA-VAAEQQ 60

Query: 61  AGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGA 120
             KTKVALLRIGTRG                  SPLALAQAHETRDKLMASHPELAEDGA
Sbjct: 61  VEKTKVALLRIGTRG------------------SPLALAQAHETRDKLMASHPELAEDGA 120

Query: 121 IQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTIL 180
           IQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTIL
Sbjct: 121 IQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTIL 180

Query: 181 PCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQT 240
           PCNLPREDVRDAFISL AGS AELPAGS++GTASLRRKSQLLNRYPSL+VLENFRGNVQT
Sbjct: 181 PCNLPREDVRDAFISLSAGSFAELPAGSIIGTASLRRKSQLLNRYPSLKVLENFRGNVQT 240

Query: 241 RLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAE 300
           RL+KLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDD MA 
Sbjct: 241 RLRKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDIMAN 300

Query: 301 YLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLE 360
           YLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLE
Sbjct: 301 YLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLE 360

Query: 361 TSRRGSYAIDDMIAMGKDAGKELLSRAGPGFFDS 395
           TSRRG YAI+DMIAMGKDAG+ELLSRAGPGFFDS
Sbjct: 361 TSRRGPYAIEDMIAMGKDAGQELLSRAGPGFFDS 375

BLAST of Cp4.1LG03g04790 vs. NCBI nr
Match: gi|700203422|gb|KGN58555.1| (hypothetical protein Csa_3G682170 [Cucumis sativus])

HSP 1 Score: 600.5 bits (1547), Expect = 2.1e-168
Identity = 315/349 (90.26%), Postives = 322/349 (92.26%), Query Frame = 1

Query: 46  HGLGLIRASVAVEQQAGKTKVALLRIGTRGRMCSVSNHEFVADLRDNDSPLALAQAHETR 105
           HG+GLIRA VA EQQ  KTKVALLRIGTRG                  SPLALAQAHETR
Sbjct: 35  HGVGLIRA-VAAEQQVEKTKVALLRIGTRG------------------SPLALAQAHETR 94

Query: 106 DKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVH 165
           DKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVH
Sbjct: 95  DKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADIGGKGLFTKEIDDALINGDIDIAVH 154

Query: 166 SMKDVPTYLPEKTILPCNLPREDVRDAFISLRAGSLAELPAGSVVGTASLRRKSQLLNRY 225
           SMKDVPTYLPEKTILPCNLPREDVRDAFISL AGS AELPAGS++GTASLRRKSQLLNRY
Sbjct: 155 SMKDVPTYLPEKTILPCNLPREDVRDAFISLSAGSFAELPAGSIIGTASLRRKSQLLNRY 214

Query: 226 PSLEVLENFRGNVQTRLKKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQG 285
           PSL+VLENFRGNVQTRL+KLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQG
Sbjct: 215 PSLKVLENFRGNVQTRLRKLNEGVVQATLLALAGLRRLNMTENVTSILSIDEMLPAVAQG 274

Query: 286 AIGIACRSDDDKMAEYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCI 345
           AIGIACRSDDD MA YLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCI
Sbjct: 275 AIGIACRSDDDIMANYLASLNHEETRLAVVCERAFLETLDGSCRTPIAGYASRDEDGNCI 334

Query: 346 FKGLVASPDGTRVLETSRRGSYAIDDMIAMGKDAGKELLSRAGPGFFDS 395
           FKGLVASPDGTRVLETSRRG YAI+DMIAMGKDAG+ELLSRAGPGFFDS
Sbjct: 335 FKGLVASPDGTRVLETSRRGPYAIEDMIAMGKDAGQELLSRAGPGFFDS 364

BLAST of Cp4.1LG03g04790 vs. NCBI nr
Match: gi|596287238|ref|XP_007225744.1| (hypothetical protein PRUPE_ppa007124mg [Prunus persica])

HSP 1 Score: 582.8 bits (1501), Expect = 4.5e-163
Identity = 303/373 (81.23%), Postives = 330/373 (88.47%), Query Frame = 1

Query: 22  SLSLPGFSSLSLKPPAFINAPKKLHGLGLIRASVAVEQQAGKTKVALLRIGTRGRMCSVS 81
           S+S+PGFS  SLK  AF +  +K   +G+ RASVAVEQQ  K K+AL+RIGTRG      
Sbjct: 27  SVSVPGFSLPSLKTRAFPHCIRKHSAVGIPRASVAVEQQTQKAKLALIRIGTRG------ 86

Query: 82  NHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADI 141
                       SPLALAQAHETRDKLMASHP+LAE+GAIQIV+IKTTGDKILSQPLADI
Sbjct: 87  ------------SPLALAQAHETRDKLMASHPDLAEEGAIQIVIIKTTGDKILSQPLADI 146

Query: 142 GGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISLRAGSL 201
           GGKGLFTKEID+ALING+IDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISL A SL
Sbjct: 147 GGKGLFTKEIDEALINGEIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISLTASSL 206

Query: 202 AELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEGVVQATLLALAGLR 261
           A+LPAGS +GTASLRRKSQ+LNRYPSL VLENFRGNVQTRL+KLNE VVQATLLALAGL+
Sbjct: 207 ADLPAGSTIGTASLRRKSQILNRYPSLNVLENFRGNVQTRLRKLNEKVVQATLLALAGLK 266

Query: 262 RLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHEETRLAVVCERAFL 321
           RL+MTENVTSILS+DEMLPAVAQGAIGIACRS+DDKMA Y+ASLNHEETRLAV CERAFL
Sbjct: 267 RLDMTENVTSILSLDEMLPAVAQGAIGIACRSNDDKMANYIASLNHEETRLAVACERAFL 326

Query: 322 ETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYAIDDMIAMGKDAGK 381
            TLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSR+G+YA  DMI MGK+AG+
Sbjct: 327 LTLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRKGTYAFQDMINMGKEAGQ 381

Query: 382 ELLSRAGPGFFDS 395
           ELLS+AGPGFFDS
Sbjct: 387 ELLSQAGPGFFDS 381

BLAST of Cp4.1LG03g04790 vs. NCBI nr
Match: gi|703160461|ref|XP_010112535.1| (Porphobilinogen deaminase [Morus notabilis])

HSP 1 Score: 581.6 bits (1498), Expect = 1.0e-162
Identity = 304/382 (79.58%), Postives = 328/382 (85.86%), Query Frame = 1

Query: 14  MARPCNLASLSLPG-FSSLSLKPPAFINAPKKLHGLGLIRASVAVEQQAGKTKVALLRIG 73
           MARPC   S S  G  S L    P+     ++ HG+G+ RASVAVEQQ  K++VALLRIG
Sbjct: 17  MARPCFPVSFSSSGSVSVLGFSLPSLKTTSRRKHGIGVTRASVAVEQQTQKSRVALLRIG 76

Query: 74  TRGRMCSVSNHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKTTGDK 133
           TRG                  SPLALAQAHETRDKL ASHPELAE+GAI+IV+IKTTGDK
Sbjct: 77  TRG------------------SPLALAQAHETRDKLKASHPELAEEGAIEIVIIKTTGDK 136

Query: 134 ILSQPLADIGGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDA 193
           ILSQPLADIGGKGLFTKEID+ALIN DIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDA
Sbjct: 137 ILSQPLADIGGKGLFTKEIDEALINSDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDA 196

Query: 194 FISLRAGSLAELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEGVVQA 253
           FISL A SLAELPAGS+VGTASLRRKSQ+L RYPSL+V +NFRGNVQTRL+KLNEGVVQA
Sbjct: 197 FISLSAASLAELPAGSIVGTASLRRKSQILYRYPSLKVEDNFRGNVQTRLRKLNEGVVQA 256

Query: 254 TLLALAGLRRLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHEETRL 313
           TLLALAGL+RLNMTENVT ILSIDEMLPAVAQGAIGIACRSDDDKMA Y+ASLNHEETRL
Sbjct: 257 TLLALAGLKRLNMTENVTCILSIDEMLPAVAQGAIGIACRSDDDKMASYIASLNHEETRL 316

Query: 314 AVVCERAFLETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYAIDDM 373
           A+ CERAFL  LDGSCRTPIAGYAS+DEDGNCIFKGLVASPDGTRVLETSR+G YA +DM
Sbjct: 317 AIACERAFLTKLDGSCRTPIAGYASKDEDGNCIFKGLVASPDGTRVLETSRKGPYAFEDM 376

Query: 374 IAMGKDAGKELLSRAGPGFFDS 395
           + MGKDAG+ELLSRAGPGFFDS
Sbjct: 377 MKMGKDAGEELLSRAGPGFFDS 380

BLAST of Cp4.1LG03g04790 vs. NCBI nr
Match: gi|645228588|ref|XP_008221067.1| (PREDICTED: porphobilinogen deaminase, chloroplastic [Prunus mume])

HSP 1 Score: 578.9 bits (1491), Expect = 6.5e-162
Identity = 302/373 (80.97%), Postives = 327/373 (87.67%), Query Frame = 1

Query: 22  SLSLPGFSSLSLKPPAFINAPKKLHGLGLIRASVAVEQQAGKTKVALLRIGTRGRMCSVS 81
           S+S+PGFS  SLK  AF    +K   +G+ RASVAVEQQ  K K+AL+RIGTRG      
Sbjct: 22  SVSVPGFSLPSLKTRAFPVCIRKHSAVGIPRASVAVEQQTQKAKLALIRIGTRG------ 81

Query: 82  NHEFVADLRDNDSPLALAQAHETRDKLMASHPELAEDGAIQIVVIKTTGDKILSQPLADI 141
                       SPLALAQAHETRDKLMASHP+LAE+GAIQIV+IKTTGDKILSQPLADI
Sbjct: 82  ------------SPLALAQAHETRDKLMASHPDLAEEGAIQIVIIKTTGDKILSQPLADI 141

Query: 142 GGKGLFTKEIDDALINGDIDIAVHSMKDVPTYLPEKTILPCNLPREDVRDAFISLRAGSL 201
           GGKGLFTKEID+ALING+IDIAVHSMKDVPTYLP+KTILPCNLPREDVRDAFISL   SL
Sbjct: 142 GGKGLFTKEIDEALINGEIDIAVHSMKDVPTYLPDKTILPCNLPREDVRDAFISLTVSSL 201

Query: 202 AELPAGSVVGTASLRRKSQLLNRYPSLEVLENFRGNVQTRLKKLNEGVVQATLLALAGLR 261
           A+LPAGS +GTASLRRKSQ+LNRYPSL VLENFRGNVQTRL+KLNE VVQATLLALAGL+
Sbjct: 202 ADLPAGSTIGTASLRRKSQILNRYPSLNVLENFRGNVQTRLRKLNEKVVQATLLALAGLK 261

Query: 262 RLNMTENVTSILSIDEMLPAVAQGAIGIACRSDDDKMAEYLASLNHEETRLAVVCERAFL 321
           RL+MTENVTSILS+DEMLPAVAQGAIGIACRS+DDKMA Y+ASLNHEETRLAV CERAFL
Sbjct: 262 RLDMTENVTSILSLDEMLPAVAQGAIGIACRSNDDKMANYIASLNHEETRLAVACERAFL 321

Query: 322 ETLDGSCRTPIAGYASRDEDGNCIFKGLVASPDGTRVLETSRRGSYAIDDMIAMGKDAGK 381
            TLDGSCRTPIAGYASRD DGNCIFKGLVASPDGTRVLETSR+G+YA  DMI MGKDAG+
Sbjct: 322 LTLDGSCRTPIAGYASRDGDGNCIFKGLVASPDGTRVLETSRKGTYAFQDMINMGKDAGQ 376

Query: 382 ELLSRAGPGFFDS 395
           ELLSRAGPGFFDS
Sbjct: 382 ELLSRAGPGFFDS 376

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HEM3_PEA1.6e-15578.02Porphobilinogen deaminase, chloroplastic OS=Pisum sativum GN=HEMC PE=1 SV=1[more]
HEM3_ARATH1.4e-15171.68Porphobilinogen deaminase, chloroplastic OS=Arabidopsis thaliana GN=HEMC PE=1 SV... [more]
HEM3_ORYSJ1.4e-13572.97Porphobilinogen deaminase, chloroplastic OS=Oryza sativa subsp. japonica GN=HEMC... [more]
HEM3_NITWN6.7e-8253.48Porphobilinogen deaminase OS=Nitrobacter winogradskyi (strain Nb-255 / ATCC 2539... [more]
HEM3_PARDP6.7e-8253.23Porphobilinogen deaminase OS=Paracoccus denitrificans (strain Pd 1222) GN=hemC P... [more]
Match NameE-valueIdentityDescription
A0A0A0L9F2_CUCSA1.5e-16890.26Uncharacterized protein OS=Cucumis sativus GN=Csa_3G682170 PE=3 SV=1[more]
M5XYM7_PRUPE3.2e-16381.23Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007124mg PE=3 SV=1[more]
W9SZ70_9ROSA7.0e-16379.58Porphobilinogen deaminase OS=Morus notabilis GN=L484_007545 PE=3 SV=1[more]
B9HGK9_POPTR8.0e-15976.44Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s07680g PE=3 SV=2[more]
B9S2Z0_RICCO1.4e-15877.41Porphobilinogen deaminase, putative OS=Ricinus communis GN=RCOM_1534040 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT5G08280.17.7e-15371.68 hydroxymethylbilane synthase[more]
Match NameE-valueIdentityDescription
gi|449464030|ref|XP_004149732.1|2.2e-18688.32PREDICTED: porphobilinogen deaminase, chloroplastic [Cucumis sativus][more]
gi|700203422|gb|KGN58555.1|2.1e-16890.26hypothetical protein Csa_3G682170 [Cucumis sativus][more]
gi|596287238|ref|XP_007225744.1|4.5e-16381.23hypothetical protein PRUPE_ppa007124mg [Prunus persica][more]
gi|703160461|ref|XP_010112535.1|1.0e-16279.58Porphobilinogen deaminase [Morus notabilis][more]
gi|645228588|ref|XP_008221067.1|6.5e-16280.97PREDICTED: porphobilinogen deaminase, chloroplastic [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0018160peptidyl-pyrromethane cofactor linkage
GO:0033014tetrapyrrole biosynthetic process
Vocabulary: Molecular Function
TermDefinition
GO:0004418hydroxymethylbilane synthase activity
Vocabulary: INTERPRO
TermDefinition
IPR022419Porphobilin_deaminase_cofac_BS
IPR022418Porphobilinogen_deaminase_C
IPR022417Porphobilin_deaminase_N
IPR000860HemC
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009073 aromatic amino acid family biosynthetic process
biological_process GO:0019684 photosynthesis, light reaction
biological_process GO:0006744 ubiquinone biosynthetic process
biological_process GO:0033014 tetrapyrrole biosynthetic process
biological_process GO:0009697 salicylic acid biosynthetic process
biological_process GO:0006364 rRNA processing
biological_process GO:0009409 response to cold
biological_process GO:0030154 cell differentiation
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0018160 peptidyl-pyrromethane cofactor linkage
biological_process GO:0006098 pentose-phosphate shunt
biological_process GO:0009965 leaf morphogenesis
biological_process GO:0019288 isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway
biological_process GO:0009814 defense response, incompatible interaction
biological_process GO:0019344 cysteine biosynthetic process
biological_process GO:0015994 chlorophyll metabolic process
cellular_component GO:0048046 apoplast
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0005575 cellular_component
molecular_function GO:0004418 hydroxymethylbilane synthase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g04790.1Cp4.1LG03g04790.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000860Porphobilinogen deaminasePRINTSPR00151PORPHBDMNASEcoord: 130..150
score: 2.0E-44coord: 161..180
score: 2.0E-44coord: 209..226
score: 2.0E-44coord: 229..246
score: 2.0E-44coord: 316..333
score: 2.0
IPR000860Porphobilinogen deaminaseHAMAPMF_00260Porphobil_deamcoord: 86..386
score: 32
IPR000860Porphobilinogen deaminasePANTHERPTHR11557PORPHOBILINOGEN DEAMINASEcoord: 94..393
score: 2.3E-163coord: 47..75
score: 2.3E
IPR000860Porphobilinogen deaminaseTIGRFAMsTIGR00212TIGR00212coord: 93..385
score: 1.3
IPR022417Porphobilinogen deaminase, N-terminalPFAMPF01379Porphobil_deamcoord: 93..299
score: 1.7
IPR022418Porphobilinogen deaminase, C-terminalGENE3DG3DSA:3.30.160.40coord: 307..388
score: 6.2
IPR022418Porphobilinogen deaminase, C-terminalPFAMPF03900Porphobil_deamCcoord: 312..384
score: 2.2
IPR022418Porphobilinogen deaminase, C-terminalunknownSSF54782Porphobilinogen deaminase (hydroxymethylbilane synthase), C-terminal domaincoord: 306..393
score: 1.24
IPR022419Porphobilinogen deaminase, dipyrromethane cofactor binding sitePROSITEPS00533PORPHOBILINOGEN_DEAMcoord: 317..333
scor
NoneNo IPR availableGENE3DG3DSA:3.40.190.10coord: 196..285
score: 8.6E-33coord: 93..195
score: 4.9
NoneNo IPR availableunknownSSF53850Periplasmic binding protein-like IIcoord: 93..304
score: 2.82

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG03g04790Cp4.1LG02g15840Cucurbita pepo (Zucchini)cpecpeB449