Homology
BLAST of HG10009343 vs. NCBI nr
Match:
CAB4301873.1 (unnamed protein product [Prunus armeniaca])
HSP 1 Score: 1004.2 bits (2595), Expect = 5.9e-289
Identity = 496/751 (66.05%), Postives = 584/751 (77.76%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILCG P++ECV CL C RW WKRCLHTAGHDSETWG AT +EFEP+PR+CRYILAVYE
Sbjct: 14 MSILCGCPLIECVYCLACTRWAWKRCLHTAGHDSETWGIATAEEFEPVPRLCRYILAVYE 73
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R+PLWEP GGYGI PDWL++KKTY+DTQG APPYILYLDHDHADIVLA RGLN+A+E
Sbjct: 74 DDLRQPLWEPPGGYGIKPDWLILKKTYEDTQGQAPPYILYLDHDHADIVLAFRGLNLARE 133
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVL+DN+LGKKKFDGGYVHNGLLKAA WVLD E E LKDLV+KYP+YTLTF GHSLG
Sbjct: 134 SDYAVLMDNKLGKKKFDGGYVHNGLLKAAEWVLDAECENLKDLVEKYPNYTLTFTGHSLG 193
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+VVVQ+ ++L NIDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ
Sbjct: 194 SGVAALLTMVVVQSRDRLGNIDRKRVRGYAIAPARCVSLNLAVRYADVINSVVLQ----- 253
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
TPLEDIFKSLFCLPCLLC+RC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR G
Sbjct: 254 -ATTPLEDIFKSLFCLPCLLCIRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRLG 313
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D++MEIPP+QKM
Sbjct: 314 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEREAQRALKLMLEKDQIMEIPPKQKM 373
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGGSSRRK 420
ERQ+TLA+EH+EEY+AALQRAVTLAVPHAY+ S YGTF + D EEE S SSG SS
Sbjct: 374 ERQETLAKEHTEEYRAALQRAVTLAVPHAYSPSMYGTFDEKD--EEEHSYGSSGESS--- 433
Query: 421 KETWDELIERLYDKDDSRHAVLKKSLSTTAFSFSTCLAQSNLISGRKGLRDQLLDRP--- 480
+ KKS T +A+S RK LR + ++
Sbjct: 434 ------------------FSSAKKS--------KTFVARS-----RKELRSEEANKETFI 493
Query: 481 -LSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN 540
+S HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+ LA E+N N
Sbjct: 494 HFGHSVHSNRIDPSRAVQLSWRPRVFLYQGFLSDEECDHLVSLAHGGEENSLTEYDDLGN 553
Query: 541 TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYG 600
T + L S + LN D+I++RIE RI+ WT LPK++S Q+ + EEAE F+G
Sbjct: 554 TNTIRLRKSLQIPLNMEDEIVSRIEERISAWTFLPKENSRALQVSRNGVEEAEKNLNFFG 613
Query: 601 NRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILF 660
N+S + SEPL+ATV+LY+S+ RGGE+LFPES+++S+ WS K ++ L P KGNAILF
Sbjct: 614 NKSTLEQSEPLIATVILYISNVTRGGEILFPESELRSEVWSDCGKSSSILKPTKGNAILF 673
Query: 661 FSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQ 720
F++ NASPDKSS H+R P+L GE+W ATKF Y + G K + +S+ C DED +CP
Sbjct: 674 FTLRPNASPDKSSPHSRCPVLEGEMWCATKFIYAKAIGGEKVSPDSESSECTDEDDNCPN 722
Query: 721 WAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
WA+IGEC+RN VFM+GSPDYYGTCRKSCN C
Sbjct: 734 WASIGECQRNPVFMVGSPDYYGTCRKSCNVC 722
BLAST of HG10009343 vs. NCBI nr
Match:
CAB4271435.1 (unnamed protein product [Prunus armeniaca])
HSP 1 Score: 1003.4 bits (2593), Expect = 1.0e-288
Identity = 496/751 (66.05%), Postives = 584/751 (77.76%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILCG P++ECV CL C RW WKRCLHTAGHDSETWG AT +EFEP+PR+CRYILAVYE
Sbjct: 1 MSILCGCPLIECVYCLACTRWAWKRCLHTAGHDSETWGIATAEEFEPVPRLCRYILAVYE 60
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R+PLWEP GGYGI PDWL++KKTY+DTQG APPYILYLDHDHADIVLA RGLN+A+E
Sbjct: 61 DDLRQPLWEPPGGYGIKPDWLILKKTYEDTQGQAPPYILYLDHDHADIVLAFRGLNLARE 120
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVL+DN+LGKKKFDGGYVHNGLLKAA WVLD E E LKDLV+KYP+YTLTF GHSLG
Sbjct: 121 SDYAVLMDNKLGKKKFDGGYVHNGLLKAAEWVLDAECENLKDLVEKYPNYTLTFTGHSLG 180
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+VVVQ+ ++L NIDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ
Sbjct: 181 SGVAALLTMVVVQSRDRLGNIDRKRVRGYAIAPARCVSLNLAVRYADVINSVVLQ----- 240
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
TPLEDIFKSLFCLPCLLC+RC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR G
Sbjct: 241 -ATTPLEDIFKSLFCLPCLLCIRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRLG 300
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D++MEIPP+QKM
Sbjct: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEREAQRALKLMLEKDQIMEIPPKQKM 360
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGGSSRRK 420
ERQ+TLA+EH+EEY+AALQRAVTLAVPHAY+ S YGTF + D EEE S SSG SS
Sbjct: 361 ERQETLAKEHTEEYRAALQRAVTLAVPHAYSPSMYGTFDEKD--EEEHSYGSSGESS--- 420
Query: 421 KETWDELIERLYDKDDSRHAVLKKSLSTTAFSFSTCLAQSNLISGRKGLRDQLLDRP--- 480
+ KKS T +A+S RK LR + ++
Sbjct: 421 ------------------FSSAKKS--------KTFVARS-----RKELRSEEANKETFI 480
Query: 481 -LSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN 540
+S HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+ LA E+N N
Sbjct: 481 HFGHSVHSNRIDPSRAVQLSWRPRVFLYQGFLSDEECDHLVSLAHGGEENSLTEYDDLGN 540
Query: 541 TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYG 600
T + L S + LN D+I++RIE RI+ WT LPK++S Q+ + EEAE F+G
Sbjct: 541 TNTIRLRISLQIPLNMEDEIVSRIEERISAWTFLPKENSRALQVSRNGVEEAEKNINFFG 600
Query: 601 NRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILF 660
N+S + SEPL+ATV+LY+S+ RGGE+LFPES+++S+ WS K ++ L P KGNAILF
Sbjct: 601 NKSTLEQSEPLIATVILYISNVTRGGEILFPESELRSEVWSDCGKSSSILKPTKGNAILF 660
Query: 661 FSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQ 720
F++ NASPDKSS H+R P+L GE+W ATKF Y + G K + +S+ C DED +CP
Sbjct: 661 FTLRPNASPDKSSPHSRCPVLEGEMWCATKFIYAKAIGGEKVSSDSESSECTDEDDNCPN 709
Query: 721 WAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
WA+IGEC+RN VFM+GSPDYYGTCRKSCN C
Sbjct: 721 WASIGECQRNPVFMVGSPDYYGTCRKSCNVC 709
BLAST of HG10009343 vs. NCBI nr
Match:
RXH95088.1 (hypothetical protein DVH24_024772 [Malus domestica])
HSP 1 Score: 978.8 bits (2529), Expect = 2.6e-281
Identity = 478/748 (63.90%), Postives = 587/748 (78.48%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILC P+LECV CL C RW WKRCLHTAGHDSETWG +T +EFEP+PR+CRYILAVYE
Sbjct: 994 MSILCACPVLECVYCLACTRWAWKRCLHTAGHDSETWGLSTAEEFEPVPRLCRYILAVYE 1053
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R PLWEP GGYGINPDWL++KKTY+DT G APPYILYLDH+HADIVLA RGLN+A+E
Sbjct: 1054 DDLRCPLWEPPGGYGINPDWLILKKTYEDTGGLAPPYILYLDHNHADIVLAFRGLNLARE 1113
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVL+DN+LG++KFDGGYVHNGLLK+A WV+D E E LKDLV+ YP+YTLTFAGHSLG
Sbjct: 1114 SDYAVLMDNKLGQRKFDGGYVHNGLLKSAQWVMDAECEILKDLVQNYPNYTLTFAGHSLG 1173
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+VVV+N ++L +IDRKR+R YAIAPARCMSLNLAVRYADVINSVVLQDDFLP
Sbjct: 1174 SGVAALLTMVVVKNRDRLGDIDRKRVRGYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 1233
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
RTATPLEDIF LPC+LCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFRCG
Sbjct: 1234 RTATPLEDIFN----LPCILCLRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRCG 1293
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D +MEIP +Q+M
Sbjct: 1294 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEREAQRALDLMLQKDHIMEIPSKQRM 1353
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGGS---S 420
ERQ+TLA+EH+EEYKAALQRAVTLAVPHAY+ SPYGTF + D EE+ S SSG S S
Sbjct: 1354 ERQETLAKEHTEEYKAALQRAVTLAVPHAYSPSPYGTFDEKD--EEDHSYGSSGESSFGS 1413
Query: 421 RRKKETWDELIERLYDKDDSRHAVLKKSLSTTAFSFSTCLAQSNLISGRKGLRDQLLDRP 480
+K +++ L S++++ FS S +++ L + + +++ ++
Sbjct: 1414 TKKSKSFTARRGGSASMASLASIFLLLSVTSSFFSSSAEISRKELRTNQT-IQETVIH-- 1473
Query: 481 LSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNT 540
+S HS RIDPSRVVQ+SW+PR SDEECDHL+ LA ED NT
Sbjct: 1474 FGHSVHSNRIDPSRVVQLSWQPR--------SDEECDHLVSLALGGEDKSVTEYDELGNT 1533
Query: 541 VSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYGN 600
+ L+ S + L+ D++++RIE RI+ WT LPK++S Q+ + EE + + ++GN
Sbjct: 1534 NTMRLIKSLEIPLDMEDEVVSRIEARISAWTFLPKENSRAIQVFHFGNEEGDKNFNYFGN 1593
Query: 601 RSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFF 660
+S + +EPL+ATV+LYLS+ RGGE+LFPES++ SK S R+ ++ L PVKGNAILFF
Sbjct: 1594 KSTLEQTEPLLATVILYLSNVTRGGEILFPESELTSKVQSDCRRSSSILRPVKGNAILFF 1653
Query: 661 SVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQW 720
++H NASPDKSS HTR P+L GE+W ATKF + + G K + +S C DED +CP+W
Sbjct: 1654 TLHPNASPDKSSPHTRCPVLEGEMWCATKFLHAKAIAGEKISSDSGSSECTDEDDNCPRW 1713
Query: 721 AAIGECERNAVFMIGSPDYYGTCRKSCN 745
A++GEC+RN VFM+GSPDYYGTCRKSCN
Sbjct: 1714 ASMGECQRNPVFMVGSPDYYGTCRKSCN 1724
BLAST of HG10009343 vs. NCBI nr
Match:
PON44192.1 (Mono-/di-acylglycerol lipase [Trema orientale])
HSP 1 Score: 976.9 bits (2524), Expect = 1.0e-280
Identity = 495/767 (64.54%), Postives = 588/767 (76.66%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILCG+P+LECV CL CARW WKRCLHTAGHDSETWG AT +EFEP+PRIC YILAVYE
Sbjct: 1 MSILCGLPLLECVYCLACARWAWKRCLHTAGHDSETWGLATAEEFEPVPRICCYILAVYE 60
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R PLWEP GYGINPDWL++K+TY+DT+G APPYILYLDHDHADIVLA RGLN+AKE
Sbjct: 61 DDLRHPLWEPPEGYGINPDWLVLKRTYEDTKGQAPPYILYLDHDHADIVLAFRGLNLAKE 120
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVLLDN+LGK+KFDGGYVHNGLLKAAGWVL TE++ LKDLV+KYP+YTLTFAGHSLG
Sbjct: 121 SDYAVLLDNKLGKRKFDGGYVHNGLLKAAGWVLQTESDILKDLVEKYPNYTLTFAGHSLG 180
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+V VQN +KL NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP
Sbjct: 181 SGVAALLTMVAVQNRDKLGNIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
RTATPLEDIFKSLFCLPCLLCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR G
Sbjct: 241 RTATPLEDIFKSLFCLPCLLCLRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRMG 300
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALELM + D +MEIP +Q+M
Sbjct: 301 RFPPVVRTAVPVDGRFEHIVLSCNATSDHAIIWIEREARRALELMLEKDHIMEIPAKQRM 360
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLA---SSGGSS 420
ERQ+TLA+E SEEYKAALQRAVTLAVPHAY+ S YGTF DEG SS GSS
Sbjct: 361 ERQETLAKEKSEEYKAALQRAVTLAVPHAYSPSQYGTFD--DEGGSSAGSTPGESSFGSS 420
Query: 421 RRK--KETWDELIERLYDKDDSRHAVLKKSLSTTAFSFSTCLAQSNLISGRKG--LRDQL 480
R+ KETWDELIERL+DKDDS H + ++ +SFS Q + G G +
Sbjct: 421 RKTKGKETWDELIERLFDKDDSGHILAPEA----EYSFSK--HQIFTLEGFHGFIFSEIF 480
Query: 481 LDRPLSYSNHSGRIDPSRVVQVS----------WRPRVFLYKGFLSDEECDHLIFLASSS 540
+ R +S SG + + +S + R F+ G L +
Sbjct: 481 VGRGISV-EFSGISNGFPCLSLSPIGGLFFFALFFFRNFIPAGLTHQGLSSSLGNQGQAQ 540
Query: 541 EDN--PSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIM 600
E N S ++ S +T+ LL SSG DDI++ IE RI+ WT LPK++ Q++
Sbjct: 541 ERNKKSSRDTDDSGDTIQKELLKSSGTPQQNEDDIVSSIEERISAWTFLPKENGKALQVL 600
Query: 601 QYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRK 660
Y E++E ++GN S + +PL+ATVVLYLS+ +GG++LFP+S+VK K WS K
Sbjct: 601 HYENEDSEKNLNYFGNSSLLDRGKPLIATVVLYLSNVTKGGQILFPQSEVKGKIWSDCTK 660
Query: 661 KN-NFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTV 720
+ N P+KGNAILFF+++ N + D SS H R P+L GE+W ATKFF ++ T G K ++
Sbjct: 661 SSGNIPRPIKGNAILFFNLYPNTTSDSSSSHARCPVLEGEMWFATKFFQVKSTAGEKLSL 720
Query: 721 ESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
ESDE+ C D+D++CP WAA+GEC+RN VFM+GSPDYYGTCR+SCNAC
Sbjct: 721 ESDEEECTDQDENCPGWAAMGECQRNPVFMVGSPDYYGTCRESCNAC 758
BLAST of HG10009343 vs. NCBI nr
Match:
PON51727.1 (Mono-/di-acylglycerol lipase [Parasponia andersonii])
HSP 1 Score: 968.0 bits (2501), Expect = 4.7e-278
Identity = 481/764 (62.96%), Postives = 581/764 (76.05%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILCG+P+LECV CL CARW WKRCLHTAGHDSETWG AT +EFEP+PRIC YILAVYE
Sbjct: 1 MSILCGLPLLECVYCLACARWAWKRCLHTAGHDSETWGLATAEEFEPVPRICCYILAVYE 60
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R+PLWEP GYGINPDWL++K+TY+DT+G APPYILYLDHDHADIVLA RGLN+AKE
Sbjct: 61 DDLRRPLWEPPEGYGINPDWLVLKRTYEDTKGQAPPYILYLDHDHADIVLAFRGLNLAKE 120
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVLLDN+LGK+KFDGGYVHNGLLKAAGWVL TE++ LKDLV++YP+YTLTFAGHSLG
Sbjct: 121 SDYAVLLDNKLGKRKFDGGYVHNGLLKAAGWVLQTESDILKDLVERYPNYTLTFAGHSLG 180
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+V VQN +KL NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP
Sbjct: 181 SGVAALLTMVAVQNRDKLGNIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
RTATPLEDIFKSLFCLPCLLCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR G
Sbjct: 241 RTATPLEDIFKSLFCLPCLLCLRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRMG 300
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALEL+ + D +MEIP +Q+M
Sbjct: 301 RFPPVVRTAVPVDGRFEHIVLSCNATSDHAIIWIEREARRALELVSEKDHIMEIPAKQRM 360
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGGSSRRK 420
ERQ+TLA+E SEEYKAALQRAVTLAVPHAY+ S YGTF + S S RK
Sbjct: 361 ERQETLAKEKSEEYKAALQRAVTLAVPHAYSPSQYGTFDDEGGSSAGSTPGESSFDSSRK 420
Query: 421 ---KETWDELIERLYDKDDSRHAV-------LKKSLSTTAFSFSTCLAQSNLISGRKGLR 480
KETWDELIERL+DKDDS H K T F + + N + +G+
Sbjct: 421 TKGKETWDELIERLFDKDDSGHITAPEAEYSFSKHQILTLEGFHSFIFSENFVG--RGIS 480
Query: 481 DQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVF----LYKGFLSDEECDHLIFLASSSEDN 540
+ SN ++ S + + + F + G L + E N
Sbjct: 481 VEF----SGISNGFPCLNLSPIGGLFFFALFFFRNSIPTGLTHQGLSSSLGNQGQAQERN 540
Query: 541 --PSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYR 600
S ++ S +T+ LL+SSG DD+++ IE RI+ WT LPK++ Q++ Y
Sbjct: 541 KKSSRDTDDSGDTIQKGLLNSSGPPQQNEDDVVSSIEERISAWTFLPKENGKALQVLHYE 600
Query: 601 GEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNN 660
E++E ++GN S + S+PL+ATVVLYLS+ +GG++LFP+S+VK K WS K +
Sbjct: 601 NEDSEKNLNYFGNSSLLDHSKPLIATVVLYLSNVTKGGQILFPQSQVKGKIWSDCTKSSG 660
Query: 661 FL-TPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESD 720
+ P+KGNAILFF+++ N + D SS H R P++ GE+W ATKFF ++ T G K ++ESD
Sbjct: 661 KIPRPIKGNAILFFNLYPNTTSDSSSSHARCPVVEGEMWFATKFFQVKSTAGEKLSLESD 720
Query: 721 EDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
++ C D+D++CP WAA+GEC+RN VFM+GSPDYYGTCR+SCNAC
Sbjct: 721 KEECTDQDENCPGWAAMGECQRNPVFMVGSPDYYGTCRESCNAC 758
BLAST of HG10009343 vs. ExPASy Swiss-Prot
Match:
Q8GXT7 (Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 SV=1)
HSP 1 Score: 235.0 bits (598), Expect = 2.8e-60
Identity = 130/288 (45.14%), Postives = 176/288 (61.11%), Query Frame = 0
Query: 466 RKGLRDQLL-----DRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA 525
RK LRD+ + D SY S +DP+RV+Q+SW PRVFLY+GFLS+EECDHLI L
Sbjct: 28 RKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSWLPRVFLYRGFLSEEECDHLISLR 87
Query: 526 SSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQI 585
+ + S ++ G D ++A IE +++ WT LP ++ ++
Sbjct: 88 KETTEVYSVDADGK----------------TQLDPVVAGIEEKVSAWTFLPGENGGSIKV 147
Query: 586 MQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR 645
Y E++ K ++G + E L+ATVVLYLS++ +GGE+LFP S++K K +S
Sbjct: 148 RSYTSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSEMKPK--NSCL 207
Query: 646 KKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTV 705
+ N L PVKGNAILFF+ LNAS D S H R P++ GEL VATK Y K
Sbjct: 208 EGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYA------KKQA 267
Query: 706 ESDEDG-CIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
+E G C DED++C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Sbjct: 268 RIEESGECSDEDENCGRWAKLGECKKNPVYMIGSPDYYGTCRKSCNAC 291
BLAST of HG10009343 vs. ExPASy Swiss-Prot
Match:
F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)
HSP 1 Score: 210.3 bits (534), Expect = 7.5e-53
Identity = 112/276 (40.58%), Postives = 165/276 (59.78%), Query Frame = 0
Query: 481 SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS-EDNPSGNSAGSRNTVS 540
S+ S +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA E + S +
Sbjct: 21 SSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESED 80
Query: 541 TNLLSSSGVIL-NTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRG---EEAEHKYFYG 600
+ + +SSG+ L DDI+A +E ++A WT LP+++ QI+ Y + YFY
Sbjct: 81 SEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFY- 140
Query: 601 NRSAMSSSEPLMATVVLYLSDSARGGEMLFPESK-----VKSKFWSSRRKKNNFLTPVKG 660
++ A+ +ATV++YLS+ +GGE +FP K +K WS K+ + P KG
Sbjct: 141 DKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKG 200
Query: 661 NAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDED 720
+A+LFF++HLN + D +S H P++ GE W AT++ ++R + G K V C+D+
Sbjct: 201 DALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVR-SFGKKKLV------CVDDH 260
Query: 721 KSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
+SC +WA GECE+N ++M+GS G CRKSC AC
Sbjct: 261 ESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288
BLAST of HG10009343 vs. ExPASy Swiss-Prot
Match:
Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)
HSP 1 Score: 205.7 bits (522), Expect = 1.8e-51
Identity = 103/267 (38.58%), Postives = 166/267 (62.17%), Query Frame = 0
Query: 488 DPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSG 547
DP+RV Q+SW PRVFLY+GFLSDEECDH I LA + S +V + + +SSG
Sbjct: 52 DPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSG 111
Query: 548 VILN-TTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSE 607
+ L+ DDI++ +E ++A WT LP+++ QI+ Y G++ E H ++ +++ +
Sbjct: 112 MFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGG 171
Query: 608 PLMATVVLYLSDSARGGEMLFP-----ESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVH 667
+ATV++YLS+ +GGE +FP +++K W+ K+ + P KG+A+LFF++H
Sbjct: 172 HRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLH 231
Query: 668 LNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAI 727
NA+ D +S H P++ GE W AT++ +++ + + + GC+DE+ SC +WA
Sbjct: 232 PNATTDSNSLHGSCPVVEGEKWSATRWIHVK----SFERAFNKQSGCMDENVSCEKWAKA 291
Query: 728 GECERNAVFMIGSPDYYGTCRKSCNAC 747
GEC++N +M+GS +G CRKSC AC
Sbjct: 292 GECQKNPTYMVGSDKDHGYCRKSCKAC 314
BLAST of HG10009343 vs. ExPASy Swiss-Prot
Match:
F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)
HSP 1 Score: 182.6 bits (462), Expect = 1.7e-44
Identity = 103/282 (36.52%), Postives = 165/282 (58.51%), Query Frame = 0
Query: 481 SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSGNSAGS 540
S+ S I+PS+V QVS +PR F+Y+GFL+D ECDHLI LA S+ DN +G S S
Sbjct: 27 SSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVS 86
Query: 541 RNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKY 600
S+ S G D I++ IE +++ WT LPK++ Q+++Y G++ + H
Sbjct: 87 DVRTSSGTFISKG-----KDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFD 146
Query: 601 FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNF 660
++ ++ ++ +ATV+LYLS+ +GGE +FP+++ S+ S KK
Sbjct: 147 YFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIA 206
Query: 661 LTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDED 720
+ P KGNA+LFF++ +A PD S H P++ GE W ATK+ ++ + + + +
Sbjct: 207 VKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHV----DSFDKILTHDG 266
Query: 721 GCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
C D ++SC +WA +GEC +N +M+G+P+ G CR+SC AC
Sbjct: 267 NCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299
BLAST of HG10009343 vs. ExPASy Swiss-Prot
Match:
Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)
HSP 1 Score: 179.9 bits (455), Expect = 1.1e-43
Identity = 99/283 (34.98%), Postives = 162/283 (57.24%), Query Frame = 0
Query: 481 SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS------EDNPSGNSAGS 540
S+ S ++PS+V QVS +PR F+Y+GFL++ ECDH++ LA +S DN SG S S
Sbjct: 26 SSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFS 85
Query: 541 RNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQY---RGEEAEHK 600
S+ S G D I++ IE +I+ WT LPK++ Q+++Y + +A
Sbjct: 86 EVRTSSGTFISKG-----KDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFD 145
Query: 601 YFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNN 660
YF+ + + MAT+++YLS+ +GGE +FP++++ S+ S K+
Sbjct: 146 YFHDKVNIVRGGH-RMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGI 205
Query: 661 FLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDE 720
+ P KG+A+LFF++H +A PD S H P++ GE W ATK+ ++ + + +
Sbjct: 206 AVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHV----DSFDRIVTPS 265
Query: 721 DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
C D ++SC +WA +GEC +N +M+G+ + G CR+SC AC
Sbjct: 266 GNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298
BLAST of HG10009343 vs. ExPASy TrEMBL
Match:
A0A6J5WND9 (Procollagen-proline 4-dioxygenase OS=Prunus armeniaca OX=36596 GN=ORAREDHAP_LOCUS17440 PE=3 SV=1)
HSP 1 Score: 1004.2 bits (2595), Expect = 2.8e-289
Identity = 496/751 (66.05%), Postives = 584/751 (77.76%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILCG P++ECV CL C RW WKRCLHTAGHDSETWG AT +EFEP+PR+CRYILAVYE
Sbjct: 14 MSILCGCPLIECVYCLACTRWAWKRCLHTAGHDSETWGIATAEEFEPVPRLCRYILAVYE 73
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R+PLWEP GGYGI PDWL++KKTY+DTQG APPYILYLDHDHADIVLA RGLN+A+E
Sbjct: 74 DDLRQPLWEPPGGYGIKPDWLILKKTYEDTQGQAPPYILYLDHDHADIVLAFRGLNLARE 133
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVL+DN+LGKKKFDGGYVHNGLLKAA WVLD E E LKDLV+KYP+YTLTF GHSLG
Sbjct: 134 SDYAVLMDNKLGKKKFDGGYVHNGLLKAAEWVLDAECENLKDLVEKYPNYTLTFTGHSLG 193
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+VVVQ+ ++L NIDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ
Sbjct: 194 SGVAALLTMVVVQSRDRLGNIDRKRVRGYAIAPARCVSLNLAVRYADVINSVVLQ----- 253
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
TPLEDIFKSLFCLPCLLC+RC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR G
Sbjct: 254 -ATTPLEDIFKSLFCLPCLLCIRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRLG 313
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D++MEIPP+QKM
Sbjct: 314 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEREAQRALKLMLEKDQIMEIPPKQKM 373
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGGSSRRK 420
ERQ+TLA+EH+EEY+AALQRAVTLAVPHAY+ S YGTF + D EEE S SSG SS
Sbjct: 374 ERQETLAKEHTEEYRAALQRAVTLAVPHAYSPSMYGTFDEKD--EEEHSYGSSGESS--- 433
Query: 421 KETWDELIERLYDKDDSRHAVLKKSLSTTAFSFSTCLAQSNLISGRKGLRDQLLDRP--- 480
+ KKS T +A+S RK LR + ++
Sbjct: 434 ------------------FSSAKKS--------KTFVARS-----RKELRSEEANKETFI 493
Query: 481 -LSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN 540
+S HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+ LA E+N N
Sbjct: 494 HFGHSVHSNRIDPSRAVQLSWRPRVFLYQGFLSDEECDHLVSLAHGGEENSLTEYDDLGN 553
Query: 541 TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYG 600
T + L S + LN D+I++RIE RI+ WT LPK++S Q+ + EEAE F+G
Sbjct: 554 TNTIRLRKSLQIPLNMEDEIVSRIEERISAWTFLPKENSRALQVSRNGVEEAEKNLNFFG 613
Query: 601 NRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILF 660
N+S + SEPL+ATV+LY+S+ RGGE+LFPES+++S+ WS K ++ L P KGNAILF
Sbjct: 614 NKSTLEQSEPLIATVILYISNVTRGGEILFPESELRSEVWSDCGKSSSILKPTKGNAILF 673
Query: 661 FSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQ 720
F++ NASPDKSS H+R P+L GE+W ATKF Y + G K + +S+ C DED +CP
Sbjct: 674 FTLRPNASPDKSSPHSRCPVLEGEMWCATKFIYAKAIGGEKVSPDSESSECTDEDDNCPN 722
Query: 721 WAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
WA+IGEC+RN VFM+GSPDYYGTCRKSCN C
Sbjct: 734 WASIGECQRNPVFMVGSPDYYGTCRKSCNVC 722
BLAST of HG10009343 vs. ExPASy TrEMBL
Match:
A0A6J5U8N9 (Procollagen-proline 4-dioxygenase OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS17819 PE=3 SV=1)
HSP 1 Score: 1003.4 bits (2593), Expect = 4.9e-289
Identity = 496/751 (66.05%), Postives = 584/751 (77.76%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILCG P++ECV CL C RW WKRCLHTAGHDSETWG AT +EFEP+PR+CRYILAVYE
Sbjct: 1 MSILCGCPLIECVYCLACTRWAWKRCLHTAGHDSETWGIATAEEFEPVPRLCRYILAVYE 60
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R+PLWEP GGYGI PDWL++KKTY+DTQG APPYILYLDHDHADIVLA RGLN+A+E
Sbjct: 61 DDLRQPLWEPPGGYGIKPDWLILKKTYEDTQGQAPPYILYLDHDHADIVLAFRGLNLARE 120
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVL+DN+LGKKKFDGGYVHNGLLKAA WVLD E E LKDLV+KYP+YTLTF GHSLG
Sbjct: 121 SDYAVLMDNKLGKKKFDGGYVHNGLLKAAEWVLDAECENLKDLVEKYPNYTLTFTGHSLG 180
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+VVVQ+ ++L NIDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ
Sbjct: 181 SGVAALLTMVVVQSRDRLGNIDRKRVRGYAIAPARCVSLNLAVRYADVINSVVLQ----- 240
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
TPLEDIFKSLFCLPCLLC+RC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR G
Sbjct: 241 -ATTPLEDIFKSLFCLPCLLCIRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRLG 300
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D++MEIPP+QKM
Sbjct: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEREAQRALKLMLEKDQIMEIPPKQKM 360
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGGSSRRK 420
ERQ+TLA+EH+EEY+AALQRAVTLAVPHAY+ S YGTF + D EEE S SSG SS
Sbjct: 361 ERQETLAKEHTEEYRAALQRAVTLAVPHAYSPSMYGTFDEKD--EEEHSYGSSGESS--- 420
Query: 421 KETWDELIERLYDKDDSRHAVLKKSLSTTAFSFSTCLAQSNLISGRKGLRDQLLDRP--- 480
+ KKS T +A+S RK LR + ++
Sbjct: 421 ------------------FSSAKKS--------KTFVARS-----RKELRSEEANKETFI 480
Query: 481 -LSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRN 540
+S HS RIDPSR VQ+SWRPRVFLY+GFLSDEECDHL+ LA E+N N
Sbjct: 481 HFGHSVHSNRIDPSRAVQLSWRPRVFLYQGFLSDEECDHLVSLAHGGEENSLTEYDDLGN 540
Query: 541 TVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYG 600
T + L S + LN D+I++RIE RI+ WT LPK++S Q+ + EEAE F+G
Sbjct: 541 TNTIRLRISLQIPLNMEDEIVSRIEERISAWTFLPKENSRALQVSRNGVEEAEKNINFFG 600
Query: 601 NRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILF 660
N+S + SEPL+ATV+LY+S+ RGGE+LFPES+++S+ WS K ++ L P KGNAILF
Sbjct: 601 NKSTLEQSEPLIATVILYISNVTRGGEILFPESELRSEVWSDCGKSSSILKPTKGNAILF 660
Query: 661 FSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQ 720
F++ NASPDKSS H+R P+L GE+W ATKF Y + G K + +S+ C DED +CP
Sbjct: 661 FTLRPNASPDKSSPHSRCPVLEGEMWCATKFIYAKAIGGEKVSSDSESSECTDEDDNCPN 709
Query: 721 WAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
WA+IGEC+RN VFM+GSPDYYGTCRKSCN C
Sbjct: 721 WASIGECQRNPVFMVGSPDYYGTCRKSCNVC 709
BLAST of HG10009343 vs. ExPASy TrEMBL
Match:
A0A498JHB5 (Procollagen-proline 4-dioxygenase OS=Malus domestica OX=3750 GN=DVH24_024772 PE=3 SV=1)
HSP 1 Score: 978.8 bits (2529), Expect = 1.3e-281
Identity = 478/748 (63.90%), Postives = 587/748 (78.48%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILC P+LECV CL C RW WKRCLHTAGHDSETWG +T +EFEP+PR+CRYILAVYE
Sbjct: 994 MSILCACPVLECVYCLACTRWAWKRCLHTAGHDSETWGLSTAEEFEPVPRLCRYILAVYE 1053
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R PLWEP GGYGINPDWL++KKTY+DT G APPYILYLDH+HADIVLA RGLN+A+E
Sbjct: 1054 DDLRCPLWEPPGGYGINPDWLILKKTYEDTGGLAPPYILYLDHNHADIVLAFRGLNLARE 1113
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVL+DN+LG++KFDGGYVHNGLLK+A WV+D E E LKDLV+ YP+YTLTFAGHSLG
Sbjct: 1114 SDYAVLMDNKLGQRKFDGGYVHNGLLKSAQWVMDAECEILKDLVQNYPNYTLTFAGHSLG 1173
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+VVV+N ++L +IDRKR+R YAIAPARCMSLNLAVRYADVINSVVLQDDFLP
Sbjct: 1174 SGVAALLTMVVVKNRDRLGDIDRKRVRGYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 1233
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
RTATPLEDIF LPC+LCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFRCG
Sbjct: 1234 RTATPLEDIFN----LPCILCLRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRCG 1293
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D +MEIP +Q+M
Sbjct: 1294 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEREAQRALDLMLQKDHIMEIPSKQRM 1353
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGGS---S 420
ERQ+TLA+EH+EEYKAALQRAVTLAVPHAY+ SPYGTF + D EE+ S SSG S S
Sbjct: 1354 ERQETLAKEHTEEYKAALQRAVTLAVPHAYSPSPYGTFDEKD--EEDHSYGSSGESSFGS 1413
Query: 421 RRKKETWDELIERLYDKDDSRHAVLKKSLSTTAFSFSTCLAQSNLISGRKGLRDQLLDRP 480
+K +++ L S++++ FS S +++ L + + +++ ++
Sbjct: 1414 TKKSKSFTARRGGSASMASLASIFLLLSVTSSFFSSSAEISRKELRTNQT-IQETVIH-- 1473
Query: 481 LSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNT 540
+S HS RIDPSRVVQ+SW+PR SDEECDHL+ LA ED NT
Sbjct: 1474 FGHSVHSNRIDPSRVVQLSWQPR--------SDEECDHLVSLALGGEDKSVTEYDELGNT 1533
Query: 541 VSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYGN 600
+ L+ S + L+ D++++RIE RI+ WT LPK++S Q+ + EE + + ++GN
Sbjct: 1534 NTMRLIKSLEIPLDMEDEVVSRIEARISAWTFLPKENSRAIQVFHFGNEEGDKNFNYFGN 1593
Query: 601 RSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFF 660
+S + +EPL+ATV+LYLS+ RGGE+LFPES++ SK S R+ ++ L PVKGNAILFF
Sbjct: 1594 KSTLEQTEPLLATVILYLSNVTRGGEILFPESELTSKVQSDCRRSSSILRPVKGNAILFF 1653
Query: 661 SVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQW 720
++H NASPDKSS HTR P+L GE+W ATKF + + G K + +S C DED +CP+W
Sbjct: 1654 TLHPNASPDKSSPHTRCPVLEGEMWCATKFLHAKAIAGEKISSDSGSSECTDEDDNCPRW 1713
Query: 721 AAIGECERNAVFMIGSPDYYGTCRKSCN 745
A++GEC+RN VFM+GSPDYYGTCRKSCN
Sbjct: 1714 ASMGECQRNPVFMVGSPDYYGTCRKSCN 1724
BLAST of HG10009343 vs. ExPASy TrEMBL
Match:
A0A2P5B5Y0 (Procollagen-proline 4-dioxygenase OS=Trema orientale OX=63057 GN=TorRG33x02_331640 PE=3 SV=1)
HSP 1 Score: 976.9 bits (2524), Expect = 4.9e-281
Identity = 495/767 (64.54%), Postives = 588/767 (76.66%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILCG+P+LECV CL CARW WKRCLHTAGHDSETWG AT +EFEP+PRIC YILAVYE
Sbjct: 1 MSILCGLPLLECVYCLACARWAWKRCLHTAGHDSETWGLATAEEFEPVPRICCYILAVYE 60
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R PLWEP GYGINPDWL++K+TY+DT+G APPYILYLDHDHADIVLA RGLN+AKE
Sbjct: 61 DDLRHPLWEPPEGYGINPDWLVLKRTYEDTKGQAPPYILYLDHDHADIVLAFRGLNLAKE 120
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVLLDN+LGK+KFDGGYVHNGLLKAAGWVL TE++ LKDLV+KYP+YTLTFAGHSLG
Sbjct: 121 SDYAVLLDNKLGKRKFDGGYVHNGLLKAAGWVLQTESDILKDLVEKYPNYTLTFAGHSLG 180
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+V VQN +KL NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP
Sbjct: 181 SGVAALLTMVAVQNRDKLGNIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
RTATPLEDIFKSLFCLPCLLCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR G
Sbjct: 241 RTATPLEDIFKSLFCLPCLLCLRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRMG 300
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALELM + D +MEIP +Q+M
Sbjct: 301 RFPPVVRTAVPVDGRFEHIVLSCNATSDHAIIWIEREARRALELMLEKDHIMEIPAKQRM 360
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLA---SSGGSS 420
ERQ+TLA+E SEEYKAALQRAVTLAVPHAY+ S YGTF DEG SS GSS
Sbjct: 361 ERQETLAKEKSEEYKAALQRAVTLAVPHAYSPSQYGTFD--DEGGSSAGSTPGESSFGSS 420
Query: 421 RRK--KETWDELIERLYDKDDSRHAVLKKSLSTTAFSFSTCLAQSNLISGRKG--LRDQL 480
R+ KETWDELIERL+DKDDS H + ++ +SFS Q + G G +
Sbjct: 421 RKTKGKETWDELIERLFDKDDSGHILAPEA----EYSFSK--HQIFTLEGFHGFIFSEIF 480
Query: 481 LDRPLSYSNHSGRIDPSRVVQVS----------WRPRVFLYKGFLSDEECDHLIFLASSS 540
+ R +S SG + + +S + R F+ G L +
Sbjct: 481 VGRGISV-EFSGISNGFPCLSLSPIGGLFFFALFFFRNFIPAGLTHQGLSSSLGNQGQAQ 540
Query: 541 EDN--PSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIM 600
E N S ++ S +T+ LL SSG DDI++ IE RI+ WT LPK++ Q++
Sbjct: 541 ERNKKSSRDTDDSGDTIQKELLKSSGTPQQNEDDIVSSIEERISAWTFLPKENGKALQVL 600
Query: 601 QYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRK 660
Y E++E ++GN S + +PL+ATVVLYLS+ +GG++LFP+S+VK K WS K
Sbjct: 601 HYENEDSEKNLNYFGNSSLLDRGKPLIATVVLYLSNVTKGGQILFPQSEVKGKIWSDCTK 660
Query: 661 KN-NFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTV 720
+ N P+KGNAILFF+++ N + D SS H R P+L GE+W ATKFF ++ T G K ++
Sbjct: 661 SSGNIPRPIKGNAILFFNLYPNTTSDSSSSHARCPVLEGEMWFATKFFQVKSTAGEKLSL 720
Query: 721 ESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
ESDE+ C D+D++CP WAA+GEC+RN VFM+GSPDYYGTCR+SCNAC
Sbjct: 721 ESDEEECTDQDENCPGWAAMGECQRNPVFMVGSPDYYGTCRESCNAC 758
BLAST of HG10009343 vs. ExPASy TrEMBL
Match:
A0A2P5BSE2 (Procollagen-proline 4-dioxygenase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_214270 PE=3 SV=1)
HSP 1 Score: 968.0 bits (2501), Expect = 2.3e-278
Identity = 481/764 (62.96%), Postives = 581/764 (76.05%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILCG+P+LECV CL CARW WKRCLHTAGHDSETWG AT +EFEP+PRIC YILAVYE
Sbjct: 1 MSILCGLPLLECVYCLACARWAWKRCLHTAGHDSETWGLATAEEFEPVPRICCYILAVYE 60
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
DD+R+PLWEP GYGINPDWL++K+TY+DT+G APPYILYLDHDHADIVLA RGLN+AKE
Sbjct: 61 DDLRRPLWEPPEGYGINPDWLVLKRTYEDTKGQAPPYILYLDHDHADIVLAFRGLNLAKE 120
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
SDYAVLLDN+LGK+KFDGGYVHNGLLKAAGWVL TE++ LKDLV++YP+YTLTFAGHSLG
Sbjct: 121 SDYAVLLDNKLGKRKFDGGYVHNGLLKAAGWVLQTESDILKDLVERYPNYTLTFAGHSLG 180
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
SGVAA+LT+V VQN +KL NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP
Sbjct: 181 SGVAALLTMVAVQNRDKLGNIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
RTATPLEDIFKSLFCLPCLLCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR G
Sbjct: 241 RTATPLEDIFKSLFCLPCLLCLRCMRDTCIPEEKMLKDPRRLYAPGRLYHIVERKPFRMG 300
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
RFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALEL+ + D +MEIP +Q+M
Sbjct: 301 RFPPVVRTAVPVDGRFEHIVLSCNATSDHAIIWIEREARRALELVSEKDHIMEIPAKQRM 360
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGGSSRRK 420
ERQ+TLA+E SEEYKAALQRAVTLAVPHAY+ S YGTF + S S RK
Sbjct: 361 ERQETLAKEKSEEYKAALQRAVTLAVPHAYSPSQYGTFDDEGGSSAGSTPGESSFDSSRK 420
Query: 421 ---KETWDELIERLYDKDDSRHAV-------LKKSLSTTAFSFSTCLAQSNLISGRKGLR 480
KETWDELIERL+DKDDS H K T F + + N + +G+
Sbjct: 421 TKGKETWDELIERLFDKDDSGHITAPEAEYSFSKHQILTLEGFHSFIFSENFVG--RGIS 480
Query: 481 DQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVF----LYKGFLSDEECDHLIFLASSSEDN 540
+ SN ++ S + + + F + G L + E N
Sbjct: 481 VEF----SGISNGFPCLNLSPIGGLFFFALFFFRNSIPTGLTHQGLSSSLGNQGQAQERN 540
Query: 541 --PSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYR 600
S ++ S +T+ LL+SSG DD+++ IE RI+ WT LPK++ Q++ Y
Sbjct: 541 KKSSRDTDDSGDTIQKGLLNSSGPPQQNEDDVVSSIEERISAWTFLPKENGKALQVLHYE 600
Query: 601 GEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNN 660
E++E ++GN S + S+PL+ATVVLYLS+ +GG++LFP+S+VK K WS K +
Sbjct: 601 NEDSEKNLNYFGNSSLLDHSKPLIATVVLYLSNVTKGGQILFPQSQVKGKIWSDCTKSSG 660
Query: 661 FL-TPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESD 720
+ P+KGNAILFF+++ N + D SS H R P++ GE+W ATKFF ++ T G K ++ESD
Sbjct: 661 KIPRPIKGNAILFFNLYPNTTSDSSSSHARCPVVEGEMWFATKFFQVKSTAGEKLSLESD 720
Query: 721 EDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
++ C D+D++CP WAA+GEC+RN VFM+GSPDYYGTCR+SCNAC
Sbjct: 721 KEECTDQDENCPGWAAMGECQRNPVFMVGSPDYYGTCRESCNAC 758
BLAST of HG10009343 vs. TAIR 10
Match:
AT3G49050.1 (alpha/beta-Hydrolases superfamily protein )
HSP 1 Score: 689.9 bits (1779), Expect = 2.3e-198
Identity = 342/473 (72.30%), Postives = 399/473 (84.36%), Query Frame = 0
Query: 1 MSILCG-VPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVY 60
MSILCG P+LECV CLGCARW +KRCL+TAGHDSE WG AT DEFEP+PR CRYILAVY
Sbjct: 1 MSILCGCCPLLECVYCLGCARWGYKRCLYTAGHDSEDWGLATTDEFEPVPRFCRYILAVY 60
Query: 61 EDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAK 120
EDDIR PLWEP GYGINPDWLL+KKTY+DTQG AP YILYLDH H DIV+AIRGLN+AK
Sbjct: 61 EDDIRNPLWEPPEGYGINPDWLLLKKTYEDTQGRAPAYILYLDHVHQDIVVAIRGLNLAK 120
Query: 121 ESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSL 180
ESDYA+LLDN+LG++KFDGGYVHNGL+K+AG+VLD E + LK+LVKKYP YTLTFAGHSL
Sbjct: 121 ESDYAMLLDNKLGERKFDGGYVHNGLVKSAGYVLDEECKVLKELVKKYPSYTLTFAGHSL 180
Query: 181 GSGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFL 240
GSGVA ML L+VV++ E+L NIDRKR+RC+AIAPARCMSLNLAVRYADVINSV+LQDDFL
Sbjct: 181 GSGVATMLALLVVRHPERLGNIDRKRVRCFAIAPARCMSLNLAVRYADVINSVILQDDFL 240
Query: 241 PRTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRC 300
PRTATPLEDIFKS+FCLPCLLC+RC++DTCV E KMLKDPRRLYAPGR+YHIVERKP R
Sbjct: 241 PRTATPLEDIFKSVFCLPCLLCIRCMKDTCVPEQKMLKDPRRLYAPGRMYHIVERKPCRL 300
Query: 301 GRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQK 360
GR+PPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL LM +++K MEIP +Q+
Sbjct: 301 GRYPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEREAQRALNLMMENEKKMEIPEKQR 360
Query: 361 MERQKTLAREHSEEYKAALQRAVTLAVPHAYALS-PYGTFSQT--DEGEEEK-------- 420
MERQ++LAREH+ EY+AAL+RAVTL VPHA +++ YGTF +T DE EEE+
Sbjct: 361 MERQESLAREHNLEYRAALRRAVTLDVPHAESMAYEYGTFDKTQEDETEEEEVETEEEEE 420
Query: 421 ---SLA-----SSGGSS--------RRKKETWDELIERLYDKDDSRHAVLKKS 446
S+A SS SS R ++ +WDELIE L+++D+S + +KS
Sbjct: 421 DTDSIAPMVGESSSSSSVKPTYRIRRNRRVSWDELIEHLFERDESGNLTFEKS 473
BLAST of HG10009343 vs. TAIR 10
Match:
AT4G00500.1 (alpha/beta-Hydrolases superfamily protein )
HSP 1 Score: 586.6 bits (1511), Expect = 2.7e-167
Identity = 277/452 (61.28%), Postives = 355/452 (78.54%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILC VP+LECV CLGC W+WK+CL++AGH+SE WG AT DEFEPIPRICR ILAVYE
Sbjct: 1 MSILCCVPVLECVYCLGCTHWLWKKCLYSAGHESENWGLATSDEFEPIPRICRLILAVYE 60
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
+++ P+W P GYGI+P+ +++KK Y T+G PY++YLDH++ D+VLAIRGLN+AKE
Sbjct: 61 ENLHDPMWAPPDGYGIDPNHVILKKDYDQTEGRVTPYMIYLDHENGDVVLAIRGLNLAKE 120
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
DYAVLLDN+LG+ KFDGGYVHNGLLKAA WV + E+ L++L++ P Y+LTF GHSLG
Sbjct: 121 CDYAVLLDNKLGQTKFDGGYVHNGLLKAAMWVFEEEHVVLRELLEANPSYSLTFVGHSLG 180
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
+GV ++L L V+QN +L NI+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQDDFLP
Sbjct: 181 AGVVSLLVLFVIQNRVRLGNIERKRIRCFAIAPPRCMSLHLAVTYADVINSVVLQDDFLP 240
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
RT T LE++FKS+ CLPCLLCL C++DT E++ LKD RRLYAPGRLYHIV RKP R G
Sbjct: 241 RTTTALENVFKSIICLPCLLCLTCLKDTFTFEERKLKDARRLYAPGRLYHIVVRKPLRLG 300
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
R+PPVV+TAVPVDGRFE IVLSCNAT+DHAIIWIE+E++ AL+LM ++D+VM+IP +QK+
Sbjct: 301 RYPPVVRTAVPVDGRFEQIVLSCNATADHAIIWIERESQRALDLMVEEDQVMQIPVEQKI 360
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSL----ASSGGS 420
RQK++ +H EEY+AA+ +A +L +P + + S YGTF T+EGE + SG S
Sbjct: 361 VRQKSIVEDHDEEYRAAIMKAASLNIPMSPSPS-YGTFHDTEEGESSAGSGMEGSPSGWS 420
Query: 421 SRRKKETWDELIERLYD-KDDSRHAVLKKSLS 448
+ + WD+ I+ + D+S H + K S
Sbjct: 421 FKGMRRKWDQFIDCHFPVNDNSEHMIFKNQES 451
BLAST of HG10009343 vs. TAIR 10
Match:
AT4G00500.2 (alpha/beta-Hydrolases superfamily protein )
HSP 1 Score: 586.6 bits (1511), Expect = 2.7e-167
Identity = 277/452 (61.28%), Postives = 355/452 (78.54%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MSILC VP+LECV CLGC W+WK+CL++AGH+SE WG AT DEFEPIPRICR ILAVYE
Sbjct: 1 MSILCCVPVLECVYCLGCTHWLWKKCLYSAGHESENWGLATSDEFEPIPRICRLILAVYE 60
Query: 61 DDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAKE 120
+++ P+W P GYGI+P+ +++KK Y T+G PY++YLDH++ D+VLAIRGLN+AKE
Sbjct: 61 ENLHDPMWAPPDGYGIDPNHVILKKDYDQTEGRVTPYMIYLDHENGDVVLAIRGLNLAKE 120
Query: 121 SDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKKYPDYTLTFAGHSLG 180
DYAVLLDN+LG+ KFDGGYVHNGLLKAA WV + E+ L++L++ P Y+LTF GHSLG
Sbjct: 121 CDYAVLLDNKLGQTKFDGGYVHNGLLKAAMWVFEEEHVVLRELLEANPSYSLTFVGHSLG 180
Query: 181 SGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLP 240
+GV ++L L V+QN +L NI+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQDDFLP
Sbjct: 181 AGVVSLLVLFVIQNRVRLGNIERKRIRCFAIAPPRCMSLHLAVTYADVINSVVLQDDFLP 240
Query: 241 RTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCG 300
RT T LE++FKS+ CLPCLLCL C++DT E++ LKD RRLYAPGRLYHIV RKP R G
Sbjct: 241 RTTTALENVFKSIICLPCLLCLTCLKDTFTFEERKLKDARRLYAPGRLYHIVVRKPLRLG 300
Query: 301 RFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRKDDKVMEIPPQQKM 360
R+PPVV+TAVPVDGRFE IVLSCNAT+DHAIIWIE+E++ AL+LM ++D+VM+IP +QK+
Sbjct: 301 RYPPVVRTAVPVDGRFEQIVLSCNATADHAIIWIERESQRALDLMVEEDQVMQIPVEQKI 360
Query: 361 ERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSL----ASSGGS 420
RQK++ +H EEY+AA+ +A +L +P + + S YGTF T+EGE + SG S
Sbjct: 361 VRQKSIVEDHDEEYRAAIMKAASLNIPMSPSPS-YGTFHDTEEGESSAGSGMEGSPSGWS 420
Query: 421 SRRKKETWDELIERLYD-KDDSRHAVLKKSLS 448
+ + WD+ I+ + D+S H + K S
Sbjct: 421 FKGMRRKWDQFIDCHFPVNDNSEHMIFKNQES 451
BLAST of HG10009343 vs. TAIR 10
Match:
AT5G37710.1 (alpha/beta-Hydrolases superfamily protein )
HSP 1 Score: 513.1 bits (1320), Expect = 3.8e-145
Identity = 257/451 (56.98%), Postives = 335/451 (74.28%), Query Frame = 0
Query: 1 MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYE 60
MS+ CG LECV C+G +RW WKRC H DS TW ATP+EFEPIPRI R ILAVYE
Sbjct: 1 MSVACG---LECVFCVGFSRWAWKRCTHVGSDDSATWTSATPEEFEPIPRISRVILAVYE 60
Query: 61 DDIRKPLWEP-VGGYGINPDWLLMKKTYKDTQGWAPPYILYLDHDHADIVLAIRGLNMAK 120
D+R P P +G + +NP+W++ + T++ TQG +PPYI+Y+DHDH +IVLAIRGLN+AK
Sbjct: 61 PDLRNPKISPSLGTFDLNPEWVIKRVTHEKTQGRSPPYIIYIDHDHREIVLAIRGLNLAK 120
Query: 121 ESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETL-KDLVKKYPDYTLTFAGHS 180
ESDY +LLDN+LG+K GGYVH GLLK+A WVL+ E+ETL + + +Y L FAGHS
Sbjct: 121 ESDYKILLDNKLGQKMLGGGYVHRGLLKSAAWVLNQESETLWRVWEENGREYDLVFAGHS 180
Query: 181 LGSGVAAMLTLVVVQNHEKLENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDF 240
LGSGVAA++ ++VV + +I R ++RC+A+APARCMSLNLAV+YADVI+SV+LQDDF
Sbjct: 181 LGSGVAALMAVLVVNTPAMIGDIPRNKVRCFALAPARCMSLNLAVKYADVISSVILQDDF 240
Query: 241 LPRTATPLEDIFKSLFCLPCLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFR 300
LPRTATPLEDIFKS+FCLPCLL L C+RDT + E + L+DPRRLYAPGR+YHIVERK
Sbjct: 241 LPRTATPLEDIFKSVFCLPCLLFLVCLRDTFIPEGRKLRDPRRLYAPGRIYHIVERK--- 300
Query: 301 CGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMRK---DDKVMEIP 360
RFPP V+TA+PVDGRFEHIVLS NATSDHAI+WIE+EA+ AL+++R+ + V P
Sbjct: 301 FCRFPPEVRTAIPVDGRFEHIVLSSNATSDHAILWIEREAEKALQILREKSSETVVTMAP 360
Query: 361 PQQKMERQKTLAREHSEEYKAALQRAVTLAVPHAYALSPYGTFSQTDEGEEEKSLASSGG 420
+++MER TL +EH K AL+RAV+L +PHA + T E EEE + +
Sbjct: 361 KEKRMERLSTLEKEH----KDALERAVSLNIPHAVS---------TAEEEEECNNGEASA 420
Query: 421 SSRRKKETWDELIERLYDKDDSRHAVLKKSL 447
+ KK+ WDE++++L+ + +S VL ++
Sbjct: 421 ELKTKKKNWDEVVDKLFHRSNSGEFVLNDNV 432
BLAST of HG10009343 vs. TAIR 10
Match:
AT4G25600.1 (Oxoglutarate/iron-dependent oxygenase )
HSP 1 Score: 235.0 bits (598), Expect = 2.0e-61
Identity = 130/288 (45.14%), Postives = 176/288 (61.11%), Query Frame = 0
Query: 466 RKGLRDQLL-----DRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA 525
RK LRD+ + D SY S +DP+RV+Q+SW PRVFLY+GFLS+EECDHLI L
Sbjct: 28 RKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSWLPRVFLYRGFLSEEECDHLISLR 87
Query: 526 SSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQI 585
+ + S ++ G D ++A IE +++ WT LP ++ ++
Sbjct: 88 KETTEVYSVDADGK----------------TQLDPVVAGIEEKVSAWTFLPGENGGSIKV 147
Query: 586 MQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR 645
Y E++ K ++G + E L+ATVVLYLS++ +GGE+LFP S++K K +S
Sbjct: 148 RSYTSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSEMKPK--NSCL 207
Query: 646 KKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTV 705
+ N L PVKGNAILFF+ LNAS D S H R P++ GEL VATK Y K
Sbjct: 208 EGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYA------KKQA 267
Query: 706 ESDEDG-CIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 747
+E G C DED++C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Sbjct: 268 RIEESGECSDEDENCGRWAKLGECKKNPVYMIGSPDYYGTCRKSCNAC 291
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
CAB4301873.1 | 5.9e-289 | 66.05 | unnamed protein product [Prunus armeniaca] | [more] |
CAB4271435.1 | 1.0e-288 | 66.05 | unnamed protein product [Prunus armeniaca] | [more] |
RXH95088.1 | 2.6e-281 | 63.90 | hypothetical protein DVH24_024772 [Malus domestica] | [more] |
PON44192.1 | 1.0e-280 | 64.54 | Mono-/di-acylglycerol lipase [Trema orientale] | [more] |
PON51727.1 | 4.7e-278 | 62.96 | Mono-/di-acylglycerol lipase [Parasponia andersonii] | [more] |
Match Name | E-value | Identity | Description | |
Q8GXT7 | 2.8e-60 | 45.14 | Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 S... | [more] |
F4J0A8 | 7.5e-53 | 40.58 | Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... | [more] |
Q8L970 | 1.8e-51 | 38.58 | Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... | [more] |
F4JAU3 | 1.7e-44 | 36.52 | Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1 | [more] |
Q8LAN3 | 1.1e-43 | 34.98 | Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J5WND9 | 2.8e-289 | 66.05 | Procollagen-proline 4-dioxygenase OS=Prunus armeniaca OX=36596 GN=ORAREDHAP_LOCU... | [more] |
A0A6J5U8N9 | 4.9e-289 | 66.05 | Procollagen-proline 4-dioxygenase OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS17... | [more] |
A0A498JHB5 | 1.3e-281 | 63.90 | Procollagen-proline 4-dioxygenase OS=Malus domestica OX=3750 GN=DVH24_024772 PE=... | [more] |
A0A2P5B5Y0 | 4.9e-281 | 64.54 | Procollagen-proline 4-dioxygenase OS=Trema orientale OX=63057 GN=TorRG33x02_3316... | [more] |
A0A2P5BSE2 | 2.3e-278 | 62.96 | Procollagen-proline 4-dioxygenase OS=Parasponia andersonii OX=3476 GN=PanWU01x14... | [more] |