Homology
BLAST of HG10016739 vs. NCBI nr
Match:
XP_038881516.1 (uncharacterized protein LOC120073023 isoform X2 [Benincasa hispida])
HSP 1 Score: 2320.4 bits (6012), Expect = 0.0e+00
Identity = 1176/1275 (92.24%), Postives = 1205/1275 (94.51%), Query Frame = 0
Query: 1 MTGVYPFGLFRGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNN 60
MTGVYPFGLFRGLFH DFAKAIIS+LVLSCAFFHHAACGPCFISELQSASNEDSGHYMNN
Sbjct: 1 MTGVYPFGLFRGLFHSDFAKAIISMLVLSCAFFHHAACGPCFISELQSASNEDSGHYMNN 60
Query: 61 PASANGIHTTFPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLG 120
P NGIH TFPADISSGSNPTTHLSFESVCTDS LFCFPSTV DFSF EKGIGVEASLG
Sbjct: 61 P--TNGIHGTFPADISSGSNPTTHLSFESVCTDSHLFCFPSTVTDFSFKEKGIGVEASLG 120
Query: 121 LFDGSSPPVGSNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDST 180
LFDGSSP VGS QDDKLAANKSQSSDYG+FELFEGGIISCSLNSRQDVNELSSIQK++ST
Sbjct: 121 LFDGSSPAVGSTQDDKLAANKSQSSDYGIFELFEGGIISCSLNSRQDVNELSSIQKHEST 180
Query: 181 SKVDLSTCRGDPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPS 240
SKVDLSTCRGDPHYQTSPSSTQKKN DVTNS +SDSS+SPFVDISPTELDWEHKFLYLPS
Sbjct: 181 SKVDLSTCRGDPHYQTSPSSTQKKNLDVTNSDYSDSSLSPFVDISPTELDWEHKFLYLPS 240
Query: 241 LASITVTNTCNRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAH 300
LA ITVTNTCN+S LHIYEPFSTDSQFYSCNFSE VLGPGEAVSIYFVFLPKYLGLSSAH
Sbjct: 241 LALITVTNTCNQSTLHIYEPFSTDSQFYSCNFSEVVLGPGEAVSIYFVFLPKYLGLSSAH 300
Query: 301 LILQTSFGGFLVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTG 360
LILQT+FGGFLVPAKGFAIQSPYGIQPL SLNV SSGRWTKNLSLFNPY+DVLYVEELTG
Sbjct: 301 LILQTNFGGFLVPAKGFAIQSPYGIQPLLSLNVQSSGRWTKNLSLFNPYDDVLYVEELTG 360
Query: 361 WISVFKEDKCYHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIE 420
WIS+FKEDK YHTEAVCRVDRYQVFDEPKPS+IKEGLVVQHGH+ SPLLSMRPYKQWKIE
Sbjct: 361 WISIFKEDKHYHTEAVCRVDRYQVFDEPKPSVIKEGLVVQHGHIDSPLLSMRPYKQWKIE 420
Query: 421 PHSNETIIEVDLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSV 480
PHSNETIIEVDLSFEYG TIIGTFWLQLLRPSQDKPDVVAVSLEAELE GSTHDDHKGS+
Sbjct: 421 PHSNETIIEVDLSFEYGGTIIGTFWLQLLRPSQDKPDVVAVSLEAELERGSTHDDHKGSI 480
Query: 481 FASFEPLLYHGNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA 540
FASFEPLLYHGNVFVAL+LKNSA HL SVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA
Sbjct: 481 FASFEPLLYHGNVFVALSLKNSASHLFSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA 540
Query: 541 LITCNEEHAHFHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTE 600
LITCNE+ AHFHK SPEI NMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEY + SF E
Sbjct: 541 LITCNEQDAHFHKASPEIVNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYWKDSFME 600
Query: 601 DQKQNEHFSYGNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHV 660
D KQNEHFS G V T LANHV LQ EIKAVERAEADE+VLENWASMGT KSMSVLDEH
Sbjct: 601 DGKQNEHFSSGFVGTGFLANHVRLQPEIKAVERAEADELVLENWASMGTTKSMSVLDEHE 660
Query: 661 VFFPMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDS 720
VFFPMVEVGSHS KWITVKNPS+WPV+MQLIINSGEIIDECRDPEGFIHL SGG+IHNDS
Sbjct: 661 VFFPMVEVGSHSAKWITVKNPSEWPVVMQLIINSGEIIDECRDPEGFIHLSSGGLIHNDS 720
Query: 721 TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLR 780
TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPI F+PS+RCHWRSSVLIRNNLSGVEWLSLR
Sbjct: 721 TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPIFFYPSERCHWRSSVLIRNNLSGVEWLSLR 780
Query: 781 GYGGSSSLLLLEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKN 840
GYGGSSSLLLLE SKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSK+FYAKN
Sbjct: 781 GYGGSSSLLLLEGSKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKDFYAKN 840
Query: 841 TGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLEL 900
TGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGES KLTISY+TDLSA+VVYRDLEL
Sbjct: 841 TGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESKKLTISYKTDLSATVVYRDLEL 900
Query: 901 ALATGILVIPMKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISL 960
ALATGILVIPMKASLPFYMLNNCRKSV WTRLKKF+FAVLLISSVM LFFCWILPHMISL
Sbjct: 901 ALATGILVIPMKASLPFYMLNNCRKSVSWTRLKKFSFAVLLISSVMFLFFCWILPHMISL 960
Query: 961 SSLDCLCKNDIKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVI 1020
SLD CKN+IK ISSST+S+EK SV HSEKSSQ SDVWSVFEGE APQ SLQS SLVI
Sbjct: 961 GSLDFSCKNEIKRISSSTKSVEKTYSVRHSEKSSQLSDVWSVFEGEGAPQPSLQSKSLVI 1020
Query: 1021 ENSGAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTA 1080
ENS AVEASQPNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTA
Sbjct: 1021 ENSDAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTA 1080
Query: 1081 SSTPKRTWPMSPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDT 1140
SSTPKRTWPMSPDVNQSIEVSSLFARVVDETQCHKAQTS TSV NSPKPEVSVKNCIDT
Sbjct: 1081 SSTPKRTWPMSPDVNQSIEVSSLFARVVDETQCHKAQTSELTSVTNSPKPEVSVKNCIDT 1140
Query: 1141 LVSSSKETLPESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKL 1200
VSSSKET ESRKSYSKPIL PSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKL
Sbjct: 1141 FVSSSKETPLESRKSYSKPILQPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKL 1200
Query: 1201 FNQKAPLEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSP 1260
FNQKA LEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEK SDSFFETSP
Sbjct: 1201 FNQKASLEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKDSDSFFETSP 1260
Query: 1261 QTLIAKSQPMSDGKF 1276
QTLIAKSQPMS F
Sbjct: 1261 QTLIAKSQPMSVSSF 1273
BLAST of HG10016739 vs. NCBI nr
Match:
XP_038881515.1 (uncharacterized protein LOC120073023 isoform X1 [Benincasa hispida])
HSP 1 Score: 2298.9 bits (5956), Expect = 0.0e+00
Identity = 1166/1265 (92.17%), Postives = 1195/1265 (94.47%), Query Frame = 0
Query: 11 RGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNPASANGIHTT 70
RGLFH DFAKAIIS+LVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNP NGIH T
Sbjct: 20 RGLFHSDFAKAIISMLVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNP--TNGIHGT 79
Query: 71 FPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLGLFDGSSPPVG 130
FPADISSGSNPTTHLSFESVCTDS LFCFPSTV DFSF EKGIGVEASLGLFDGSSP VG
Sbjct: 80 FPADISSGSNPTTHLSFESVCTDSHLFCFPSTVTDFSFKEKGIGVEASLGLFDGSSPAVG 139
Query: 131 SNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDSTSKVDLSTCRG 190
S QDDKLAANKSQSSDYG+FELFEGGIISCSLNSRQDVNELSSIQK++STSKVDLSTCRG
Sbjct: 140 STQDDKLAANKSQSSDYGIFELFEGGIISCSLNSRQDVNELSSIQKHESTSKVDLSTCRG 199
Query: 191 DPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPSLASITVTNTC 250
DPHYQTSPSSTQKKN DVTNS +SDSS+SPFVDISPTELDWEHKFLYLPSLA ITVTNTC
Sbjct: 200 DPHYQTSPSSTQKKNLDVTNSDYSDSSLSPFVDISPTELDWEHKFLYLPSLALITVTNTC 259
Query: 251 NRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAHLILQTSFGGF 310
N+S LHIYEPFSTDSQFYSCNFSE VLGPGEAVSIYFVFLPKYLGLSSAHLILQT+FGGF
Sbjct: 260 NQSTLHIYEPFSTDSQFYSCNFSEVVLGPGEAVSIYFVFLPKYLGLSSAHLILQTNFGGF 319
Query: 311 LVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTGWISVFKEDKC 370
LVPAKGFAIQSPYGIQPL SLNV SSGRWTKNLSLFNPY+DVLYVEELTGWIS+FKEDK
Sbjct: 320 LVPAKGFAIQSPYGIQPLLSLNVQSSGRWTKNLSLFNPYDDVLYVEELTGWISIFKEDKH 379
Query: 371 YHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIEPHSNETIIEV 430
YHTEAVCRVDRYQVFDEPKPS+IKEGLVVQHGH+ SPLLSMRPYKQWKIEPHSNETIIEV
Sbjct: 380 YHTEAVCRVDRYQVFDEPKPSVIKEGLVVQHGHIDSPLLSMRPYKQWKIEPHSNETIIEV 439
Query: 431 DLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSVFASFEPLLYH 490
DLSFEYG TIIGTFWLQLLRPSQDKPDVVAVSLEAELE GSTHDDHKGS+FASFEPLLYH
Sbjct: 440 DLSFEYGGTIIGTFWLQLLRPSQDKPDVVAVSLEAELERGSTHDDHKGSIFASFEPLLYH 499
Query: 491 GNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNEEHAH 550
GNVFVAL+LKNSA HL SVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNE+ AH
Sbjct: 500 GNVFVALSLKNSASHLFSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNEQDAH 559
Query: 551 FHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTEDQKQNEHFSY 610
FHK SPEI NMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEY + SF ED KQNEHFS
Sbjct: 560 FHKASPEIVNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYWKDSFMEDGKQNEHFSS 619
Query: 611 GNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHVVFFPMVEVGS 670
G V T LANHV LQ EIKAVERAEADE+VLENWASMGT KSMSVLDEH VFFPMVEVGS
Sbjct: 620 GFVGTGFLANHVRLQPEIKAVERAEADELVLENWASMGTTKSMSVLDEHEVFFPMVEVGS 679
Query: 671 HSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDSTMPKKYGFSL 730
HS KWITVKNPS+WPV+MQLIINSGEIIDECRDPEGFIHL SGG+IHNDSTMPKKYGFSL
Sbjct: 680 HSAKWITVKNPSEWPVVMQLIINSGEIIDECRDPEGFIHLSSGGLIHNDSTMPKKYGFSL 739
Query: 731 AEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL 790
AEGAVTEAYVHPYGDVLFGPI F+PS+RCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL
Sbjct: 740 AEGAVTEAYVHPYGDVLFGPIFFYPSERCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL 799
Query: 791 LEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKNTGDLPLEFKK 850
LE SKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSK+FYAKNTGDLPLEFKK
Sbjct: 800 LEGSKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKDFYAKNTGDLPLEFKK 859
Query: 851 IKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLELALATGILVIP 910
IKISGTECALDGFLVHNCKDFALEPGES KLTISY+TDLSA+VVYRDLELALATGILVIP
Sbjct: 860 IKISGTECALDGFLVHNCKDFALEPGESKKLTISYKTDLSATVVYRDLELALATGILVIP 919
Query: 911 MKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISLSSLDCLCKND 970
MKASLPFYMLNNCRKSV WTRLKKF+FAVLLISSVM LFFCWILPHMISL SLD CKN+
Sbjct: 920 MKASLPFYMLNNCRKSVSWTRLKKFSFAVLLISSVMFLFFCWILPHMISLGSLDFSCKNE 979
Query: 971 IKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVIENSGAVEASQ 1030
IK ISSST+S+EK SV HSEKSSQ SDVWSVFEGE APQ SLQS SLVIENS AVEASQ
Sbjct: 980 IKRISSSTKSVEKTYSVRHSEKSSQLSDVWSVFEGEGAPQPSLQSKSLVIENSDAVEASQ 1039
Query: 1031 PNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPM 1090
PNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPM
Sbjct: 1040 PNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPM 1099
Query: 1091 SPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDTLVSSSKETLP 1150
SPDVNQSIEVSSLFARVVDETQCHKAQTS TSV NSPKPEVSVKNCIDT VSSSKET
Sbjct: 1100 SPDVNQSIEVSSLFARVVDETQCHKAQTSELTSVTNSPKPEVSVKNCIDTFVSSSKETPL 1159
Query: 1151 ESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKAPLEGE 1210
ESRKSYSKPIL PSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKA LEGE
Sbjct: 1160 ESRKSYSKPILQPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKASLEGE 1219
Query: 1211 GKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSPQTLIAKSQPM 1270
GKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEK SDSFFETSPQTLIAKSQPM
Sbjct: 1220 GKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKDSDSFFETSPQTLIAKSQPM 1279
Query: 1271 SDGKF 1276
S F
Sbjct: 1280 SVSSF 1282
BLAST of HG10016739 vs. NCBI nr
Match:
XP_038881517.1 (uncharacterized protein LOC120073023 isoform X3 [Benincasa hispida])
HSP 1 Score: 2296.5 bits (5950), Expect = 0.0e+00
Identity = 1165/1264 (92.17%), Postives = 1194/1264 (94.46%), Query Frame = 0
Query: 12 GLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNPASANGIHTTF 71
GLFH DFAKAIIS+LVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNP NGIH TF
Sbjct: 7 GLFHSDFAKAIISMLVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNP--TNGIHGTF 66
Query: 72 PADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLGLFDGSSPPVGS 131
PADISSGSNPTTHLSFESVCTDS LFCFPSTV DFSF EKGIGVEASLGLFDGSSP VGS
Sbjct: 67 PADISSGSNPTTHLSFESVCTDSHLFCFPSTVTDFSFKEKGIGVEASLGLFDGSSPAVGS 126
Query: 132 NQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDSTSKVDLSTCRGD 191
QDDKLAANKSQSSDYG+FELFEGGIISCSLNSRQDVNELSSIQK++STSKVDLSTCRGD
Sbjct: 127 TQDDKLAANKSQSSDYGIFELFEGGIISCSLNSRQDVNELSSIQKHESTSKVDLSTCRGD 186
Query: 192 PHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPSLASITVTNTCN 251
PHYQTSPSSTQKKN DVTNS +SDSS+SPFVDISPTELDWEHKFLYLPSLA ITVTNTCN
Sbjct: 187 PHYQTSPSSTQKKNLDVTNSDYSDSSLSPFVDISPTELDWEHKFLYLPSLALITVTNTCN 246
Query: 252 RSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAHLILQTSFGGFL 311
+S LHIYEPFSTDSQFYSCNFSE VLGPGEAVSIYFVFLPKYLGLSSAHLILQT+FGGFL
Sbjct: 247 QSTLHIYEPFSTDSQFYSCNFSEVVLGPGEAVSIYFVFLPKYLGLSSAHLILQTNFGGFL 306
Query: 312 VPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTGWISVFKEDKCY 371
VPAKGFAIQSPYGIQPL SLNV SSGRWTKNLSLFNPY+DVLYVEELTGWIS+FKEDK Y
Sbjct: 307 VPAKGFAIQSPYGIQPLLSLNVQSSGRWTKNLSLFNPYDDVLYVEELTGWISIFKEDKHY 366
Query: 372 HTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIEPHSNETIIEVD 431
HTEAVCRVDRYQVFDEPKPS+IKEGLVVQHGH+ SPLLSMRPYKQWKIEPHSNETIIEVD
Sbjct: 367 HTEAVCRVDRYQVFDEPKPSVIKEGLVVQHGHIDSPLLSMRPYKQWKIEPHSNETIIEVD 426
Query: 432 LSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSVFASFEPLLYHG 491
LSFEYG TIIGTFWLQLLRPSQDKPDVVAVSLEAELE GSTHDDHKGS+FASFEPLLYHG
Sbjct: 427 LSFEYGGTIIGTFWLQLLRPSQDKPDVVAVSLEAELERGSTHDDHKGSIFASFEPLLYHG 486
Query: 492 NVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNEEHAHF 551
NVFVAL+LKNSA HL SVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNE+ AHF
Sbjct: 487 NVFVALSLKNSASHLFSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNEQDAHF 546
Query: 552 HKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTEDQKQNEHFSYG 611
HK SPEI NMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEY + SF ED KQNEHFS G
Sbjct: 547 HKASPEIVNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYWKDSFMEDGKQNEHFSSG 606
Query: 612 NVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHVVFFPMVEVGSH 671
V T LANHV LQ EIKAVERAEADE+VLENWASMGT KSMSVLDEH VFFPMVEVGSH
Sbjct: 607 FVGTGFLANHVRLQPEIKAVERAEADELVLENWASMGTTKSMSVLDEHEVFFPMVEVGSH 666
Query: 672 STKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDSTMPKKYGFSLA 731
S KWITVKNPS+WPV+MQLIINSGEIIDECRDPEGFIHL SGG+IHNDSTMPKKYGFSLA
Sbjct: 667 SAKWITVKNPSEWPVVMQLIINSGEIIDECRDPEGFIHLSSGGLIHNDSTMPKKYGFSLA 726
Query: 732 EGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLLL 791
EGAVTEAYVHPYGDVLFGPI F+PS+RCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLLL
Sbjct: 727 EGAVTEAYVHPYGDVLFGPIFFYPSERCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLLL 786
Query: 792 EASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKNTGDLPLEFKKI 851
E SKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSK+FYAKNTGDLPLEFKKI
Sbjct: 787 EGSKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKDFYAKNTGDLPLEFKKI 846
Query: 852 KISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLELALATGILVIPM 911
KISGTECALDGFLVHNCKDFALEPGES KLTISY+TDLSA+VVYRDLELALATGILVIPM
Sbjct: 847 KISGTECALDGFLVHNCKDFALEPGESKKLTISYKTDLSATVVYRDLELALATGILVIPM 906
Query: 912 KASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISLSSLDCLCKNDI 971
KASLPFYMLNNCRKSV WTRLKKF+FAVLLISSVM LFFCWILPHMISL SLD CKN+I
Sbjct: 907 KASLPFYMLNNCRKSVSWTRLKKFSFAVLLISSVMFLFFCWILPHMISLGSLDFSCKNEI 966
Query: 972 KPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVIENSGAVEASQP 1031
K ISSST+S+EK SV HSEKSSQ SDVWSVFEGE APQ SLQS SLVIENS AVEASQP
Sbjct: 967 KRISSSTKSVEKTYSVRHSEKSSQLSDVWSVFEGEGAPQPSLQSKSLVIENSDAVEASQP 1026
Query: 1032 NYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPMS 1091
NYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPMS
Sbjct: 1027 NYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPMS 1086
Query: 1092 PDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDTLVSSSKETLPE 1151
PDVNQSIEVSSLFARVVDETQCHKAQTS TSV NSPKPEVSVKNCIDT VSSSKET E
Sbjct: 1087 PDVNQSIEVSSLFARVVDETQCHKAQTSELTSVTNSPKPEVSVKNCIDTFVSSSKETPLE 1146
Query: 1152 SRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKAPLEGEG 1211
SRKSYSKPIL PSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKA LEGEG
Sbjct: 1147 SRKSYSKPILQPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKASLEGEG 1206
Query: 1212 KSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSPQTLIAKSQPMS 1271
KSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEK SDSFFETSPQTLIAKSQPMS
Sbjct: 1207 KSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKDSDSFFETSPQTLIAKSQPMS 1266
Query: 1272 DGKF 1276
F
Sbjct: 1267 VSSF 1268
BLAST of HG10016739 vs. NCBI nr
Match:
XP_038881518.1 (uncharacterized protein LOC120073023 isoform X4 [Benincasa hispida])
HSP 1 Score: 2217.2 bits (5744), Expect = 0.0e+00
Identity = 1134/1265 (89.64%), Postives = 1163/1265 (91.94%), Query Frame = 0
Query: 11 RGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNPASANGIHTT 70
RGLFH DFAKAIIS+LVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNP NGIH T
Sbjct: 20 RGLFHSDFAKAIISMLVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNP--TNGIHVT 79
Query: 71 FPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLGLFDGSSPPVG 130
DFSF EKGIGVEASLGLFDGSSP VG
Sbjct: 80 ----------------------------------DFSFKEKGIGVEASLGLFDGSSPAVG 139
Query: 131 SNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDSTSKVDLSTCRG 190
S QDDKLAANKSQSSDYG+FELFEGGIISCSLNSRQDVNELSSIQK++STSKVDLSTCRG
Sbjct: 140 STQDDKLAANKSQSSDYGIFELFEGGIISCSLNSRQDVNELSSIQKHESTSKVDLSTCRG 199
Query: 191 DPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPSLASITVTNTC 250
DPHYQTSPSSTQKKN DVTNS +SDSS+SPFVDISPTELDWEHKFLYLPSLA ITVTNTC
Sbjct: 200 DPHYQTSPSSTQKKNLDVTNSDYSDSSLSPFVDISPTELDWEHKFLYLPSLALITVTNTC 259
Query: 251 NRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAHLILQTSFGGF 310
N+S LHIYEPFSTDSQFYSCNFSE VLGPGEAVSIYFVFLPKYLGLSSAHLILQT+FGGF
Sbjct: 260 NQSTLHIYEPFSTDSQFYSCNFSEVVLGPGEAVSIYFVFLPKYLGLSSAHLILQTNFGGF 319
Query: 311 LVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTGWISVFKEDKC 370
LVPAKGFAIQSPYGIQPL SLNV SSGRWTKNLSLFNPY+DVLYVEELTGWIS+FKEDK
Sbjct: 320 LVPAKGFAIQSPYGIQPLLSLNVQSSGRWTKNLSLFNPYDDVLYVEELTGWISIFKEDKH 379
Query: 371 YHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIEPHSNETIIEV 430
YHTEAVCRVDRYQVFDEPKPS+IKEGLVVQHGH+ SPLLSMRPYKQWKIEPHSNETIIEV
Sbjct: 380 YHTEAVCRVDRYQVFDEPKPSVIKEGLVVQHGHIDSPLLSMRPYKQWKIEPHSNETIIEV 439
Query: 431 DLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSVFASFEPLLYH 490
DLSFEYG TIIGTFWLQLLRPSQDKPDVVAVSLEAELE GSTHDDHKGS+FASFEPLLYH
Sbjct: 440 DLSFEYGGTIIGTFWLQLLRPSQDKPDVVAVSLEAELERGSTHDDHKGSIFASFEPLLYH 499
Query: 491 GNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNEEHAH 550
GNVFVAL+LKNSA HL SVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNE+ AH
Sbjct: 500 GNVFVALSLKNSASHLFSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNEQDAH 559
Query: 551 FHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTEDQKQNEHFSY 610
FHK SPEI NMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEY + SF ED KQNEHFS
Sbjct: 560 FHKASPEIVNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYWKDSFMEDGKQNEHFSS 619
Query: 611 GNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHVVFFPMVEVGS 670
G V T LANHV LQ EIKAVERAEADE+VLENWASMGT KSMSVLDEH VFFPMVEVGS
Sbjct: 620 GFVGTGFLANHVRLQPEIKAVERAEADELVLENWASMGTTKSMSVLDEHEVFFPMVEVGS 679
Query: 671 HSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDSTMPKKYGFSL 730
HS KWITVKNPS+WPV+MQLIINSGEIIDECRDPEGFIHL SGG+IHNDSTMPKKYGFSL
Sbjct: 680 HSAKWITVKNPSEWPVVMQLIINSGEIIDECRDPEGFIHLSSGGLIHNDSTMPKKYGFSL 739
Query: 731 AEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL 790
AEGAVTEAYVHPYGDVLFGPI F+PS+RCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL
Sbjct: 740 AEGAVTEAYVHPYGDVLFGPIFFYPSERCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL 799
Query: 791 LEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKNTGDLPLEFKK 850
LE SKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSK+FYAKNTGDLPLEFKK
Sbjct: 800 LEGSKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKDFYAKNTGDLPLEFKK 859
Query: 851 IKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLELALATGILVIP 910
IKISGTECALDGFLVHNCKDFALEPGES KLTISY+TDLSA+VVYRDLELALATGILVIP
Sbjct: 860 IKISGTECALDGFLVHNCKDFALEPGESKKLTISYKTDLSATVVYRDLELALATGILVIP 919
Query: 911 MKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISLSSLDCLCKND 970
MKASLPFYMLNNCRKSV WTRLKKF+FAVLLISSVM LFFCWILPHMISL SLD CKN+
Sbjct: 920 MKASLPFYMLNNCRKSVSWTRLKKFSFAVLLISSVMFLFFCWILPHMISLGSLDFSCKNE 979
Query: 971 IKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVIENSGAVEASQ 1030
IK ISSST+S+EK SV HSEKSSQ SDVWSVFEGE APQ SLQS SLVIENS AVEASQ
Sbjct: 980 IKRISSSTKSVEKTYSVRHSEKSSQLSDVWSVFEGEGAPQPSLQSKSLVIENSDAVEASQ 1039
Query: 1031 PNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPM 1090
PNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPM
Sbjct: 1040 PNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPM 1099
Query: 1091 SPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDTLVSSSKETLP 1150
SPDVNQSIEVSSLFARVVDETQCHKAQTS TSV NSPKPEVSVKNCIDT VSSSKET
Sbjct: 1100 SPDVNQSIEVSSLFARVVDETQCHKAQTSELTSVTNSPKPEVSVKNCIDTFVSSSKETPL 1159
Query: 1151 ESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKAPLEGE 1210
ESRKSYSKPIL PSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKA LEGE
Sbjct: 1160 ESRKSYSKPILQPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKASLEGE 1219
Query: 1211 GKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSPQTLIAKSQPM 1270
GKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEK SDSFFETSPQTLIAKSQPM
Sbjct: 1220 GKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKDSDSFFETSPQTLIAKSQPM 1248
Query: 1271 SDGKF 1276
S F
Sbjct: 1280 SVSSF 1248
BLAST of HG10016739 vs. NCBI nr
Match:
TYK12899.1 (O-Glycosyl hydrolases family 17 protein, putative isoform 2 [Cucumis melo var. makuwa])
HSP 1 Score: 2213.0 bits (5733), Expect = 0.0e+00
Identity = 1128/1289 (87.51%), Postives = 1175/1289 (91.16%), Query Frame = 0
Query: 1 MTGVYPFGLFRGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNN 60
MTGVYPFGLFRGLFH DF KA+ISILVL C FF HAACGPCFISELQSASNEDSGHYMNN
Sbjct: 1 MTGVYPFGLFRGLFHQDFVKAVISILVLLCVFFQHAACGPCFISELQSASNEDSGHYMNN 60
Query: 61 PASANGIHTTFPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLG 120
ANGI + FPADISSGSNPTTHLSFESVCTDSRLFCFPS V DFS+NEKGIGV AS G
Sbjct: 61 --HANGIRSNFPADISSGSNPTTHLSFESVCTDSRLFCFPSMVTDFSYNEKGIGVVASSG 120
Query: 121 LFDGSSPPVGSNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDST 180
LFDGSS PVGS QDDKLAAN++Q SDYGMFELFEGGIISCSLNSR DVNELSSIQKY ST
Sbjct: 121 LFDGSSSPVGSPQDDKLAANETQLSDYGMFELFEGGIISCSLNSRNDVNELSSIQKYGST 180
Query: 181 SKVDLSTCRGDPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPS 240
SKVDLSTCR DP+YQTSPSSTQKKN DVTNS +SDS M+PFVD+SPTEL+WEHKFLYLPS
Sbjct: 181 SKVDLSTCRRDPYYQTSPSSTQKKNLDVTNSDYSDSYMAPFVDVSPTELNWEHKFLYLPS 240
Query: 241 LASITVTNTCNRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAH 300
LASITV NTCN+S LHIYEPFSTDSQFYSCNFSE VLGPGEAVSIYFVFLPKYLGLSSAH
Sbjct: 241 LASITVMNTCNQSFLHIYEPFSTDSQFYSCNFSEVVLGPGEAVSIYFVFLPKYLGLSSAH 300
Query: 301 LILQTSFGGFLVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTG 360
LILQT+FGGFLVPAKGFAIQSPYGIQPL SLN+HSSG+WTKNLSLFNPY+DVLYVEELTG
Sbjct: 301 LILQTNFGGFLVPAKGFAIQSPYGIQPLLSLNIHSSGKWTKNLSLFNPYDDVLYVEELTG 360
Query: 361 WISVFKEDKCYHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIE 420
WISV KEDKCYHTEAVCRVDRY+VF EPKP IIKEGLVVQHGH+GSPLLSMRPYKQWKIE
Sbjct: 361 WISVLKEDKCYHTEAVCRVDRYKVFHEPKPLIIKEGLVVQHGHIGSPLLSMRPYKQWKIE 420
Query: 421 PHSNETIIEVDLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSV 480
PHSNETIIEVDLSFEYG TIIGTFWLQLLRPSQDK DVVAVSLEAELEGGSTHDDHKGSV
Sbjct: 421 PHSNETIIEVDLSFEYGGTIIGTFWLQLLRPSQDKSDVVAVSLEAELEGGSTHDDHKGSV 480
Query: 481 FASFEPLLYHGNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA 540
FASFEP+LYHGNVFVAL+LKNSA HL SVLKIIEVAE KVFEFKSLEGLLLFP TVTQVA
Sbjct: 481 FASFEPILYHGNVFVALSLKNSASHLFSVLKIIEVAERKVFEFKSLEGLLLFPETVTQVA 540
Query: 541 LITCNEEHAHFHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTE 600
LITCNE+HAHFHK SPEI NMY KCKLLVLTNESTSSHIEVPCKDIFLLCSEY + SF E
Sbjct: 541 LITCNEQHAHFHKDSPEIVNMYGKCKLLVLTNESTSSHIEVPCKDIFLLCSEYRKDSFME 600
Query: 601 DQKQNEHFSYGNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHV 660
++KQNEHFS GNVRT SL NHV QSEIK VERAEADE+VLENWASMGT KSMSVLDEH
Sbjct: 601 NEKQNEHFSSGNVRTGSLVNHVRSQSEIKDVERAEADELVLENWASMGTTKSMSVLDEHE 660
Query: 661 VFFPMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDS 720
VFFPMVEVGSHSTKWITVKNPS+WPV+MQLIINSGEIIDECR+PEGFIHL SG +I NDS
Sbjct: 661 VFFPMVEVGSHSTKWITVKNPSEWPVVMQLIINSGEIIDECRNPEGFIHLSSGALIQNDS 720
Query: 721 TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLR 780
TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPI+F+PS+RCHWRSSVLIRNNLSGVEWLSLR
Sbjct: 721 TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPIIFYPSERCHWRSSVLIRNNLSGVEWLSLR 780
Query: 781 GYGGSSSLLLLEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKN 840
GYGGSSSLLLLE SKPV SIEFELESPILLNISPSERSVHMEEISHACTLPLSK+FYAKN
Sbjct: 781 GYGGSSSLLLLEGSKPVFSIEFELESPILLNISPSERSVHMEEISHACTLPLSKDFYAKN 840
Query: 841 TGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLEL 900
+GDLPLEFKKIKISGTEC LDGFLVHNCK+FALEPGES KLTISYETDLSA+VVYRDLEL
Sbjct: 841 SGDLPLEFKKIKISGTECGLDGFLVHNCKNFALEPGESKKLTISYETDLSATVVYRDLEL 900
Query: 901 ALATGILVIPMKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISL 960
+LATGILV+PMKASLPFYMLNNCR+SVLWTRLKKF+FAVLLISS M LFFCWI+PHMISL
Sbjct: 901 SLATGILVVPMKASLPFYMLNNCRRSVLWTRLKKFSFAVLLISSAMFLFFCWIVPHMISL 960
Query: 961 SSLDCLCKNDIKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVI 1020
S LD L KN+IK I SST+S+EK CSVHHSEKSSQ SDVWSVFEGE PQS L S SLVI
Sbjct: 961 SPLDFLSKNEIKRILSSTKSVEKTCSVHHSEKSSQLSDVWSVFEGEGTPQSPLHSKSLVI 1020
Query: 1021 ENSGAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTA 1080
NS AVEASQPNYLTVKTGKERGRRRKKKKAGGMKL GLFEVSSSQSGNSTPSSPLSPT
Sbjct: 1021 GNSDAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLPGLFEVSSSQSGNSTPSSPLSPTV 1080
Query: 1081 SSTPKRTWPMSPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDT 1140
S TPKRTWPMSPDVNQSIE SS FARVVD T KAQTS PTSV N PKPE
Sbjct: 1081 SGTPKRTWPMSPDVNQSIEESSPFARVVDGT---KAQTSEPTSVTNLPKPE--------- 1140
Query: 1141 LVSSSKETLPESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKL 1200
++SSK T ESRK YSKPILL SATFPSAGRPAPNVICSPLAASTSKIALHARAPGSK
Sbjct: 1141 -MTSSKGTPSESRKCYSKPILLSSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKP 1200
Query: 1201 FNQKAPLEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSP 1260
FNQKA LEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDV PMIPS IEK SDSFFETSP
Sbjct: 1201 FNQKASLEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVHPMIPSAIEKDSDSFFETSP 1260
Query: 1261 QTLIAKSQPMSDGKFLLG--VDTQPDGLE 1288
QTLIAKSQP SD FL+ V+T D LE
Sbjct: 1261 QTLIAKSQPTSDDIFLVAGWVETNADELE 1274
BLAST of HG10016739 vs. ExPASy Swiss-Prot
Match:
A2VDJ0 (Transmembrane protein 131-like OS=Homo sapiens OX=9606 GN=TMEM131L PE=1 SV=2)
HSP 1 Score: 89.4 bits (220), Expect = 3.4e-16
Identity = 83/316 (26.27%), Postives = 140/316 (44.30%), Query Frame = 0
Query: 663 FPMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHN-DST 722
F + S K+ V+NPS WPV +QL+ + PE +HL +H T
Sbjct: 594 FSATALRSRMIKYFVVQNPSSWPVSLQLL-----PLSLYPKPEALVHL-----LHRWFGT 653
Query: 723 MPKKYGFSLAEGAVTEA--YVHPYG-DVLFG--------------PILFHPSDRCHWRSS 782
+ F+ E +TEA Y+ + + FG ++F P+D S
Sbjct: 654 DMQMINFTTGEFQLTEACPYLGTHSEESRFGILHLHLQPLEMKRVGVVFTPADYGKVTSL 713
Query: 783 VLIRNNLSGVEWLSLRGYGGSSSLLLLEASKPVI--SIEFELESPILLNISPSERSVHME 842
+LIRNNL+ ++ + + G+ G+ LL + P S+ F++ L++ +
Sbjct: 714 ILIRNNLTVIDMIGVEGF-GARELLKVGGRLPGAGGSLRFKVPESTLMDCRRQLKDSKQ- 773
Query: 843 EISHACTLPLSKEFYAKNTGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLT 902
L ++K F +N G LP+ +KI+G C GF V +C F+L+P S ++
Sbjct: 774 ------ILSITKNFKVENIGPLPITVSSLKINGYNCQGYGFEVLDCHQFSLDPNTSRDIS 833
Query: 903 ISYETDLSASVVYRDLELALATGI-LVIPMKASLPFYMLNNCRKSV--------LWTRLK 950
I + D ++S V RDL L A + + +LP ++L C V W RL
Sbjct: 834 IVFTPDFTSSWVIRDLSLVTAADLEFRFTLNVTLPHHLLPLCADVVPGPSWEESFW-RLT 890
BLAST of HG10016739 vs. ExPASy Swiss-Prot
Match:
Q08DV9 (Transmembrane protein 131-like OS=Bos taurus OX=9913 GN=TMEM131L PE=2 SV=2)
HSP 1 Score: 81.3 bits (199), Expect = 9.1e-14
Identity = 77/312 (24.68%), Postives = 138/312 (44.23%), Query Frame = 0
Query: 660 VVFFPMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHND 719
V+ F + + K+ VKNPS WPV +QL+ S + + +H G +
Sbjct: 590 VLNFSATALRNSMVKYFVVKNPSSWPVSLQLLPVS--LYPKPEAAARLLHKWFGTDMQMI 649
Query: 720 STMPKKYGFSLA--------EGA---VTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIR 779
+ ++ + A EG+ + ++ P G ++F P+D S +LIR
Sbjct: 650 NLTTSEFQLTKACPYLGVRSEGSRFGILHLHLQPLERKRVG-VVFTPADYGKVSSLILIR 709
Query: 780 NNLSGVEWLSLRGYGGSSSLLLLEASKPVI--SIEFELESPILLNISPSERSVHMEEISH 839
NNL+ ++ + + G+ G+ LL + P S+ F++ L++ +
Sbjct: 710 NNLTVIDMIGVEGF-GARELLKVGGRLPGTGGSLRFKVPESTLMDCRRQLKDSKQ----- 769
Query: 840 ACTLPLSKEFYAKNTGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLTISYE 899
L ++K F +N G LP+ +KI+G C GF V +C F+L P S ++I +
Sbjct: 770 --ILSITKNFKVENIGPLPITVTSLKINGYNCQGYGFEVLDCHQFSLGPNTSRDISIVFT 829
Query: 900 TDLSASVVYRDLELALATGI-LVIPMKASLPFYMLNNCRKSV--------LWTRLKKFTF 950
D ++S V R+L L A + + +LP ++L C V W RL F
Sbjct: 830 PDFTSSWVIRELTLVTAADLEFRFTLNVTLPHHLLPLCADVVPGPSWEESFW-RLTVFFV 889
BLAST of HG10016739 vs. ExPASy Swiss-Prot
Match:
Q3U3D7 (Transmembrane protein 131-like OS=Mus musculus OX=10090 GN=Tmem131l PE=1 SV=1)
HSP 1 Score: 58.2 bits (139), Expect = 8.3e-07
Identity = 133/588 (22.62%), Postives = 225/588 (38.27%), Query Frame = 0
Query: 660 VVFFPMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHL--------- 719
V+ F + + + K+ V+NP+ PV +QL+ + PE + L
Sbjct: 591 VLNFSATALRNSAVKYFVVRNPTPQPVSLQLL-----PLSLYPRPEAAVRLLHKWFGTDM 650
Query: 720 ----PSGGMIHNDSTMPKKYGFSLAEGAVTEAYVHPYG-DVLFGPILFHPSDRCHWRSSV 779
S G P + G E ++ +VH + ++F P+D S +
Sbjct: 651 QMVNLSTGEFQLTQACPYQ-GEPSEESSLGALHVHLQALETRRVGVVFTPADYGKVTSLI 710
Query: 780 LIRNNLSGVEWLSLRGYGGSSSLLLLEASKPVI--SIEFELESPILLNISPSERSVHMEE 839
LIRNNL+ V+ + + G+ G+ LL + P S+ F++ L++ H +
Sbjct: 711 LIRNNLTVVDMVGVEGF-GAQELLKVGGRLPGAGGSLRFKVPESTLMD-------CHRQL 770
Query: 840 ISHACTLPLSKEFYAKNTGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLTI 899
L ++K F +N G LP+ +KI+G C GF V +C F+L P S ++I
Sbjct: 771 KDSKQILSITKNFKVENIGPLPITVTSLKINGYNCQGYGFEVLDCHPFSLSPNTSRDISI 830
Query: 900 SYETDLSASVVYRDLELALATGI-LVIPMKASLPFYMLNNCRKSV--------LWTRLKK 959
+ D ++S V R+L L A + + +LP +ML C + V W RL
Sbjct: 831 VFTPDFTSSWVIRELTLVTAADLEFHFTLNVTLPHHMLPLCAEVVPGPSWEESFW-RLTV 890
Query: 960 FTFAVLLISSVMLLF--FCWILPHMISLSSLDCLCKNDIKPISSSTRSLEKNCSVHHSEK 1019
F ++ L+ +++ F +IL + +N + + S H
Sbjct: 891 FFVSLSLLGVILIAFQQAQYILMEFMKTRQR----QNGSSSSQQNGDPVAMISSHPHKST 950
Query: 1020 SSQFSDVWSVFEGERAPQSSLQSNSLVIENSGAVEASQPNYLTVKTGKERGRRRKKKKAG 1079
F D +S + R +S L + A + S Y G +KK K
Sbjct: 951 CKNFLDTYSPSDKGRG-KSCLPVGPSLSRLQNAAKRSPATY---------GHSQKKHKCS 1010
Query: 1080 GMKLAGLFEVSSSQSGNSTPSSPLSPT-ASSTPKRTWPMSPDVNQSIEVSSLFARVVD-- 1139
S++ S N T + T ASS + +V VS +A ++
Sbjct: 1011 FYYSKQKPSASAASSANVTTEEKQTVTLASSLSVAKEDICTNVLSENWVSLRYASGINGS 1070
Query: 1140 --------ETQCHKAQTSGPTSVKNSPKPEVSVKNCIDTL-------------VSSSKET 1197
+ HK ++S +V + E S+K + T V+ KE
Sbjct: 1071 LQKNLTLPKNVLHKEESSLKNTVVTNTPSECSMKEGVHTYMFPKETDSKISENVAELKEQ 1130
BLAST of HG10016739 vs. ExPASy Swiss-Prot
Match:
Q9V7H4 (Transmembrane protein 131 homolog OS=Drosophila melanogaster OX=7227 GN=CG8370 PE=1 SV=2)
HSP 1 Score: 52.4 bits (124), Expect = 4.5e-05
Identity = 37/160 (23.12%), Postives = 66/160 (41.25%), Query Frame = 0
Query: 204 KNHDVTNSGFSDSSMSPFVDIS-----PTELDWEHKFLYLPSLASITVTNTCNRSILHIY 263
K V F +P D+ P+ LD+ + ++T+ N + L +
Sbjct: 31 KGEKVLQETFLGLQEAPIHDLGDLRLVPSRLDFGTWSVGQARSQTVTLFNQHSNRTLQLN 90
Query: 264 EPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAHLILQTSFGGFLVPAKGFA 323
FYS + P + VFLP+ LG +A L++ TSFG + +G
Sbjct: 91 AVAGPSPAFYSSFLGTREVPPQGNTTFNVVFLPRQLGAIAADLLIHTSFGQAELAVQGEG 150
Query: 324 IQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEEL 359
+ PY ++PL + + T + ++NP+ L + E+
Sbjct: 151 SECPYRLKPLVGIKAPMNATLTPEIHMYNPHERPLQILEI 190
HSP 2 Score: 43.1 bits (100), Expect = 2.8e-02
Identity = 104/509 (20.43%), Postives = 188/509 (36.94%), Query Frame = 0
Query: 664 PMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEII--DECRDPEGFIHLPSGGMIHNDST 723
P +EVG +WIT+ NPS+ P+++ ++ + P I + S D
Sbjct: 789 PAIEVGQVQRQWITLTNPSQSPLLLDYFLSDPAFARRTQLSLPHEVIDVSSTSCYLTD-- 848
Query: 724 MPKKYGFSLAEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVE--WLSL 783
K FSL E + + P G L PI F + + + +R+NL+ E WL
Sbjct: 849 ---KEVFSLPEAG--DPILLPGGASLTIPITFSAQLPEKYCTLLHVRSNLTLYEAVWLQA 908
Query: 784 RGYGGSSSLLLLEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAK 843
R S +P + SP+L ++ ++ S + +++ F A+
Sbjct: 909 RAV---QSQFRFGNRRPGAA------SPLLFEMATNQFQGCQ---SGNEAVVVTRSFTAR 968
Query: 844 NTGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLE 903
N+G +P+ + I C GF V +C F L E+ K+ I++ D + S V R L
Sbjct: 969 NSGVIPIRIEGFLIGSLPCEDFGFKVMDCAGFDLGENEARKVEIAFSADFTTSAVKRSLT 1028
Query: 904 LAL-ATGILVIPMKASLPFYMLNNCRKSVL---WTRLKKFTFAVLLISSVMLLFFCWILP 963
L T + + A +P + C ++ W K V+L++S L+ +
Sbjct: 1029 LLTNLTYDISYKLLAQMPAESVELCASLLVRPGWESSLKNAALVVLLASFGLVLVAAVF- 1088
Query: 964 HMISLSSLDCLCKNDIKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQS 1023
D K I + + + + + ++ + E A ++
Sbjct: 1089 --------------DAKAIMVQQNAYDAARNKGPLQPTFNLRNIVKLQAEEAAAKAESVQ 1148
Query: 1024 NSLVIENSGAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSP 1083
++N E + V + + + + M + L + + S+P
Sbjct: 1149 QQQKVKNGQLKELRKRT--VVNSTNSKSKSKSSWSPWSMDMNALSKHLQKAKPKTVVSTP 1208
Query: 1084 LSPTASSTPKRT---WPMSPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSV--KNSPKP 1143
++P A+S P P + V +S S + + + K P V PK
Sbjct: 1209 VTPPAASAPAAAPVPLPEAKPVKKSSTPSPQGVPISVQVRPQKKVKPTPAVVLGTTKPKQ 1261
Query: 1144 EVSVKNCIDTLVSSSKETLPESRKSYSKP 1160
EVS S +K + P+ KP
Sbjct: 1269 EVSTPVADQHEKSLAKSSPPQQENISPKP 1261
BLAST of HG10016739 vs. ExPASy TrEMBL
Match:
A0A5D3CRD8 (O-Glycosyl hydrolases family 17 protein, putative isoform 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004930 PE=3 SV=1)
HSP 1 Score: 2213.0 bits (5733), Expect = 0.0e+00
Identity = 1128/1289 (87.51%), Postives = 1175/1289 (91.16%), Query Frame = 0
Query: 1 MTGVYPFGLFRGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNN 60
MTGVYPFGLFRGLFH DF KA+ISILVL C FF HAACGPCFISELQSASNEDSGHYMNN
Sbjct: 1 MTGVYPFGLFRGLFHQDFVKAVISILVLLCVFFQHAACGPCFISELQSASNEDSGHYMNN 60
Query: 61 PASANGIHTTFPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLG 120
ANGI + FPADISSGSNPTTHLSFESVCTDSRLFCFPS V DFS+NEKGIGV AS G
Sbjct: 61 --HANGIRSNFPADISSGSNPTTHLSFESVCTDSRLFCFPSMVTDFSYNEKGIGVVASSG 120
Query: 121 LFDGSSPPVGSNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDST 180
LFDGSS PVGS QDDKLAAN++Q SDYGMFELFEGGIISCSLNSR DVNELSSIQKY ST
Sbjct: 121 LFDGSSSPVGSPQDDKLAANETQLSDYGMFELFEGGIISCSLNSRNDVNELSSIQKYGST 180
Query: 181 SKVDLSTCRGDPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPS 240
SKVDLSTCR DP+YQTSPSSTQKKN DVTNS +SDS M+PFVD+SPTEL+WEHKFLYLPS
Sbjct: 181 SKVDLSTCRRDPYYQTSPSSTQKKNLDVTNSDYSDSYMAPFVDVSPTELNWEHKFLYLPS 240
Query: 241 LASITVTNTCNRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAH 300
LASITV NTCN+S LHIYEPFSTDSQFYSCNFSE VLGPGEAVSIYFVFLPKYLGLSSAH
Sbjct: 241 LASITVMNTCNQSFLHIYEPFSTDSQFYSCNFSEVVLGPGEAVSIYFVFLPKYLGLSSAH 300
Query: 301 LILQTSFGGFLVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTG 360
LILQT+FGGFLVPAKGFAIQSPYGIQPL SLN+HSSG+WTKNLSLFNPY+DVLYVEELTG
Sbjct: 301 LILQTNFGGFLVPAKGFAIQSPYGIQPLLSLNIHSSGKWTKNLSLFNPYDDVLYVEELTG 360
Query: 361 WISVFKEDKCYHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIE 420
WISV KEDKCYHTEAVCRVDRY+VF EPKP IIKEGLVVQHGH+GSPLLSMRPYKQWKIE
Sbjct: 361 WISVLKEDKCYHTEAVCRVDRYKVFHEPKPLIIKEGLVVQHGHIGSPLLSMRPYKQWKIE 420
Query: 421 PHSNETIIEVDLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSV 480
PHSNETIIEVDLSFEYG TIIGTFWLQLLRPSQDK DVVAVSLEAELEGGSTHDDHKGSV
Sbjct: 421 PHSNETIIEVDLSFEYGGTIIGTFWLQLLRPSQDKSDVVAVSLEAELEGGSTHDDHKGSV 480
Query: 481 FASFEPLLYHGNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA 540
FASFEP+LYHGNVFVAL+LKNSA HL SVLKIIEVAE KVFEFKSLEGLLLFP TVTQVA
Sbjct: 481 FASFEPILYHGNVFVALSLKNSASHLFSVLKIIEVAERKVFEFKSLEGLLLFPETVTQVA 540
Query: 541 LITCNEEHAHFHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTE 600
LITCNE+HAHFHK SPEI NMY KCKLLVLTNESTSSHIEVPCKDIFLLCSEY + SF E
Sbjct: 541 LITCNEQHAHFHKDSPEIVNMYGKCKLLVLTNESTSSHIEVPCKDIFLLCSEYRKDSFME 600
Query: 601 DQKQNEHFSYGNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHV 660
++KQNEHFS GNVRT SL NHV QSEIK VERAEADE+VLENWASMGT KSMSVLDEH
Sbjct: 601 NEKQNEHFSSGNVRTGSLVNHVRSQSEIKDVERAEADELVLENWASMGTTKSMSVLDEHE 660
Query: 661 VFFPMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDS 720
VFFPMVEVGSHSTKWITVKNPS+WPV+MQLIINSGEIIDECR+PEGFIHL SG +I NDS
Sbjct: 661 VFFPMVEVGSHSTKWITVKNPSEWPVVMQLIINSGEIIDECRNPEGFIHLSSGALIQNDS 720
Query: 721 TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLR 780
TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPI+F+PS+RCHWRSSVLIRNNLSGVEWLSLR
Sbjct: 721 TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPIIFYPSERCHWRSSVLIRNNLSGVEWLSLR 780
Query: 781 GYGGSSSLLLLEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKN 840
GYGGSSSLLLLE SKPV SIEFELESPILLNISPSERSVHMEEISHACTLPLSK+FYAKN
Sbjct: 781 GYGGSSSLLLLEGSKPVFSIEFELESPILLNISPSERSVHMEEISHACTLPLSKDFYAKN 840
Query: 841 TGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLEL 900
+GDLPLEFKKIKISGTEC LDGFLVHNCK+FALEPGES KLTISYETDLSA+VVYRDLEL
Sbjct: 841 SGDLPLEFKKIKISGTECGLDGFLVHNCKNFALEPGESKKLTISYETDLSATVVYRDLEL 900
Query: 901 ALATGILVIPMKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISL 960
+LATGILV+PMKASLPFYMLNNCR+SVLWTRLKKF+FAVLLISS M LFFCWI+PHMISL
Sbjct: 901 SLATGILVVPMKASLPFYMLNNCRRSVLWTRLKKFSFAVLLISSAMFLFFCWIVPHMISL 960
Query: 961 SSLDCLCKNDIKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVI 1020
S LD L KN+IK I SST+S+EK CSVHHSEKSSQ SDVWSVFEGE PQS L S SLVI
Sbjct: 961 SPLDFLSKNEIKRILSSTKSVEKTCSVHHSEKSSQLSDVWSVFEGEGTPQSPLHSKSLVI 1020
Query: 1021 ENSGAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTA 1080
NS AVEASQPNYLTVKTGKERGRRRKKKKAGGMKL GLFEVSSSQSGNSTPSSPLSPT
Sbjct: 1021 GNSDAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLPGLFEVSSSQSGNSTPSSPLSPTV 1080
Query: 1081 SSTPKRTWPMSPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDT 1140
S TPKRTWPMSPDVNQSIE SS FARVVD T KAQTS PTSV N PKPE
Sbjct: 1081 SGTPKRTWPMSPDVNQSIEESSPFARVVDGT---KAQTSEPTSVTNLPKPE--------- 1140
Query: 1141 LVSSSKETLPESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKL 1200
++SSK T ESRK YSKPILL SATFPSAGRPAPNVICSPLAASTSKIALHARAPGSK
Sbjct: 1141 -MTSSKGTPSESRKCYSKPILLSSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKP 1200
Query: 1201 FNQKAPLEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSP 1260
FNQKA LEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDV PMIPS IEK SDSFFETSP
Sbjct: 1201 FNQKASLEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVHPMIPSAIEKDSDSFFETSP 1260
Query: 1261 QTLIAKSQPMSDGKFLLG--VDTQPDGLE 1288
QTLIAKSQP SD FL+ V+T D LE
Sbjct: 1261 QTLIAKSQPTSDDIFLVAGWVETNADELE 1274
BLAST of HG10016739 vs. ExPASy TrEMBL
Match:
A0A6J1GFE5 (uncharacterized protein LOC111453427 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111453427 PE=3 SV=1)
HSP 1 Score: 2210.3 bits (5726), Expect = 0.0e+00
Identity = 1122/1271 (88.28%), Postives = 1173/1271 (92.29%), Query Frame = 0
Query: 1 MTGVYPFGLFRGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNN 60
MTG+YPFGLFRGLFHPDFA+AII IL+L CAFFHHAACGPCF S+LQ SNED+GHYMN+
Sbjct: 1 MTGIYPFGLFRGLFHPDFARAIIYILILLCAFFHHAACGPCFTSDLQPVSNEDNGHYMND 60
Query: 61 PASANGIHTTFPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLG 120
P A GIH+T PADISSGSNPT+ LSFESVCTDSRLFCFPSTV +FSFN+KGI VEAS
Sbjct: 61 P--AYGIHSTLPADISSGSNPTSRLSFESVCTDSRLFCFPSTVLEFSFNKKGIDVEAS-- 120
Query: 121 LFDGSSPPVGSNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDST 180
L GSSPPVGS QDDKLAA KSQSSDYGMFELFEGGI+SCSLNS QDV+ELSSIQKYDST
Sbjct: 121 LVGGSSPPVGSTQDDKLAAYKSQSSDYGMFELFEGGIVSCSLNSGQDVSELSSIQKYDST 180
Query: 181 SKVDLSTCRGDPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPS 240
SK DLSTCRGD H Q SPSS QKKN DVTNS SDSS+SP VDISPTELDWEHKFLYLPS
Sbjct: 181 SKFDLSTCRGDHHCQKSPSSGQKKNLDVTNSDLSDSSISPLVDISPTELDWEHKFLYLPS 240
Query: 241 LASITVTNTCNRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAH 300
LAS+TVTNTCNRS+LHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVF PKYLGLSS H
Sbjct: 241 LASLTVTNTCNRSVLHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFYPKYLGLSSGH 300
Query: 301 LILQTSFGGFLVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTG 360
LILQTSFGG LVPAKGFAIQSPYGIQPL SLN+HSSGRWTKNLSLFNPY+DVLYVEELTG
Sbjct: 301 LILQTSFGGLLVPAKGFAIQSPYGIQPLLSLNIHSSGRWTKNLSLFNPYDDVLYVEELTG 360
Query: 361 WISVFKEDKCYHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIE 420
WISV KEDKCYHTE VCRVDRYQVF+EPKPSI+KEGLVVQ GH+GSP LSMRPYKQWKIE
Sbjct: 361 WISVLKEDKCYHTEVVCRVDRYQVFEEPKPSIVKEGLVVQLGHIGSPSLSMRPYKQWKIE 420
Query: 421 PHSNETIIEVDLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSV 480
PHS E IIEVDLSFEYG TIIGTFWLQLLRPSQDKPDVVAV LEAELEGGSTH DHKGSV
Sbjct: 421 PHSTENIIEVDLSFEYGGTIIGTFWLQLLRPSQDKPDVVAVPLEAELEGGSTHADHKGSV 480
Query: 481 FASFEPLLYHGNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA 540
FASFEPLLYHGNVFVA+ALKNSA HLLSVLKIIEVAESKVFEFKSLEGLLLFPGTV+QVA
Sbjct: 481 FASFEPLLYHGNVFVAIALKNSASHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVSQVA 540
Query: 541 LITCNEEHAHFHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTE 600
LITCNE+HA K SPEIF+MYSKCKLL+LTNESTSSHIEVPCKDIFLLCSEY ++SF E
Sbjct: 541 LITCNEQHADVDKASPEIFSMYSKCKLLMLTNESTSSHIEVPCKDIFLLCSEYWKYSFME 600
Query: 601 DQKQNEHFSYGNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHV 660
KQNEHFS GNVR +LANHV LQSEIKAV AEADE+VLENWASMGTR+SMSVLDEH
Sbjct: 601 YGKQNEHFSSGNVREGTLANHVQLQSEIKAVAGAEADELVLENWASMGTRRSMSVLDEHD 660
Query: 661 VFFPMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDS 720
VFFPMVEVGSHSTKWITVKNPSKWPV+MQLIINSGEIIDEC+DPE FIHLPSGG+IHNDS
Sbjct: 661 VFFPMVEVGSHSTKWITVKNPSKWPVVMQLIINSGEIIDECKDPEEFIHLPSGGLIHNDS 720
Query: 721 TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLR 780
TMPKKYGFSLAE A+TEAYVHPYGDVLFGPILF+PS RCHWRSSVLIRNNLSGVEWLS+R
Sbjct: 721 TMPKKYGFSLAEDAITEAYVHPYGDVLFGPILFYPSGRCHWRSSVLIRNNLSGVEWLSMR 780
Query: 781 GYGGSSSLLLLEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKN 840
GYGGSSSLLLLE SKPVISI+FELESPILLNISPSERSVH EEISHACTLPL KEFYAKN
Sbjct: 781 GYGGSSSLLLLEGSKPVISIDFELESPILLNISPSERSVHKEEISHACTLPLLKEFYAKN 840
Query: 841 TGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLEL 900
TGDLPLEFKKIKISGTECALDGFLVHNCK FALEPGES KLTISY+TDLSASVVYRDLEL
Sbjct: 841 TGDLPLEFKKIKISGTECALDGFLVHNCKYFALEPGESKKLTISYQTDLSASVVYRDLEL 900
Query: 901 ALATGILVIPMKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISL 960
ALATGILVIPMKASLPFYML+NCRKSVLWTRLKKF+FAVLLISSVM L FCWI PHMISL
Sbjct: 901 ALATGILVIPMKASLPFYMLDNCRKSVLWTRLKKFSFAVLLISSVMFLLFCWIFPHMISL 960
Query: 961 SSLDCLCKNDIKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVI 1020
SSLD LCKN+IK +SSSTRS+EK CSVHH+EK SQFSDVWSVFEG+ AP+SSLQS SL I
Sbjct: 961 SSLDFLCKNEIKHLSSSTRSVEKACSVHHNEKRSQFSDVWSVFEGKGAPESSLQSKSLAI 1020
Query: 1021 ENSGAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTA 1080
ENS AVEASQPNYLTVKTGKERGRRRKKKK GGM LAGLFEVSSSQSGNSTPSSPLSPTA
Sbjct: 1021 ENSDAVEASQPNYLTVKTGKERGRRRKKKKGGGMNLAGLFEVSSSQSGNSTPSSPLSPTA 1080
Query: 1081 SSTPKRTWPMSPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDT 1140
S TPKR WPMSPDVNQSIE SSLF RV+DET HKAQTS PTSV +SPKPEVSVKNCID+
Sbjct: 1081 SGTPKRRWPMSPDVNQSIEASSLFDRVIDET--HKAQTSKPTSVMSSPKPEVSVKNCIDS 1140
Query: 1141 LVSSSKETLPESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKL 1200
LVSSSKET ESRKS SKPILLPSATFPSAGRPAPNVICSPLAAS SKI L ARAPGSKL
Sbjct: 1141 LVSSSKETPSESRKSCSKPILLPSATFPSAGRPAPNVICSPLAASASKIDLQARAPGSKL 1200
Query: 1201 FNQKAPLEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSP 1260
FN+KA LEGEGKSGIQDKYKYDIWGDHFSGLHLI KSKDV PMIPS IEK SDSFFETSP
Sbjct: 1201 FNRKASLEGEGKSGIQDKYKYDIWGDHFSGLHLIKKSKDVLPMIPSAIEKDSDSFFETSP 1260
Query: 1261 QTLIAKSQPMS 1272
QTLIAK+QP S
Sbjct: 1261 QTLIAKTQPTS 1265
BLAST of HG10016739 vs. ExPASy TrEMBL
Match:
A0A6J1IKS9 (uncharacterized protein LOC111477951 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111477951 PE=3 SV=1)
HSP 1 Score: 2204.9 bits (5712), Expect = 0.0e+00
Identity = 1120/1271 (88.12%), Postives = 1166/1271 (91.74%), Query Frame = 0
Query: 1 MTGVYPFGLFRGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNN 60
MTG+YPFGLFRGLFHPDFA+AII ILVL CAFFHHAACGPCF S+LQ SNEDSGH+MN+
Sbjct: 1 MTGIYPFGLFRGLFHPDFARAIIYILVLLCAFFHHAACGPCFTSDLQPVSNEDSGHFMND 60
Query: 61 PASANGIHTTFPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLG 120
P A GIH+T PADISSGSNPT+ LSFESVCTDSRLFCFPSTV +FSFN+KGI VEASLG
Sbjct: 61 P--AYGIHSTLPADISSGSNPTSRLSFESVCTDSRLFCFPSTVLEFSFNKKGIDVEASLG 120
Query: 121 LFDGSSPPVGSNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDST 180
LF GSSPPVGS Q+DKLAA KSQSSDYGMFELFEGGI+SCSLNS Q V+ELSSIQKYDST
Sbjct: 121 LFGGSSPPVGSTQNDKLAAYKSQSSDYGMFELFEGGIVSCSLNSGQGVSELSSIQKYDST 180
Query: 181 SKVDLSTCRGDPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPS 240
SK DLSTCRGD H + SPSS K DVTNS SDSS+SP VDISPTELDWEHKFLYLPS
Sbjct: 181 SKFDLSTCRGDHHCRKSPSSALKIKLDVTNSDLSDSSISPLVDISPTELDWEHKFLYLPS 240
Query: 241 LASITVTNTCNRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAH 300
LAS+TVTN CNRS+L IYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVF PKYLGLSS H
Sbjct: 241 LASLTVTNICNRSVLRIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFYPKYLGLSSGH 300
Query: 301 LILQTSFGGFLVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTG 360
LILQTSFGG LVPAKGFAIQSPYGIQPL SLN+HSSGRWTKNLSLFNPY+DVLYVEELTG
Sbjct: 301 LILQTSFGGLLVPAKGFAIQSPYGIQPLLSLNIHSSGRWTKNLSLFNPYDDVLYVEELTG 360
Query: 361 WISVFKEDKCYHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIE 420
WISV KEDKCYHTE VCRVDRYQVF+EPKPSI+KEGLVVQ GH+GSP SMRPYKQWKIE
Sbjct: 361 WISVLKEDKCYHTEVVCRVDRYQVFEEPKPSIVKEGLVVQLGHIGSPSFSMRPYKQWKIE 420
Query: 421 PHSNETIIEVDLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSV 480
P SNE IIEVDLSFEYG TIIGTFWLQLLRPSQDKPDVVAV EA+LEGGSTH DHKGSV
Sbjct: 421 PLSNENIIEVDLSFEYGGTIIGTFWLQLLRPSQDKPDVVAVPFEAQLEGGSTHADHKGSV 480
Query: 481 FASFEPLLYHGNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA 540
FASFEPLLYHGNVFVA+ALKNSA HLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA
Sbjct: 481 FASFEPLLYHGNVFVAIALKNSASHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVA 540
Query: 541 LITCNEEHAHFHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTE 600
LITCNE+HA K SPEIFNMYSKCKLL+LTNESTSSHIEVPC DIFLLCSEY ++SF E
Sbjct: 541 LITCNEQHADVDKASPEIFNMYSKCKLLMLTNESTSSHIEVPCNDIFLLCSEYWKYSFME 600
Query: 601 DQKQNEHFSYGNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHV 660
KQNEHFS GNVR SLANHV LQSEIKAV AEADE+VLENWASMGTR+SMSVLDEH
Sbjct: 601 YGKQNEHFSSGNVRAGSLANHVQLQSEIKAVAGAEADELVLENWASMGTRRSMSVLDEHD 660
Query: 661 VFFPMVEVGSHSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDS 720
VFFPMVEVGSHS KWITVKNPSKWPV+MQLIINSGEIIDEC+DPE FIHLPSG +IHNDS
Sbjct: 661 VFFPMVEVGSHSNKWITVKNPSKWPVVMQLIINSGEIIDECKDPEEFIHLPSGSLIHNDS 720
Query: 721 TMPKKYGFSLAEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLR 780
TMPKKYGFSLAE A+TEAYVHPYGDVLFGPILF+PS RCHWRSSVLIRNNLSGVEWLS+R
Sbjct: 721 TMPKKYGFSLAEDAITEAYVHPYGDVLFGPILFYPSGRCHWRSSVLIRNNLSGVEWLSMR 780
Query: 781 GYGGSSSLLLLEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKN 840
GYGGSSSLLLLE SKPVISIEFELESPILLNISPSERSVH EEISHACTLPL KEFYAKN
Sbjct: 781 GYGGSSSLLLLEGSKPVISIEFELESPILLNISPSERSVHKEEISHACTLPLLKEFYAKN 840
Query: 841 TGDLPLEFKKIKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLEL 900
TGDLPLEFKKIKISGTECALDGFLVHNCK FALEPGES KLTISY+TDLSASVVYRDLEL
Sbjct: 841 TGDLPLEFKKIKISGTECALDGFLVHNCKYFALEPGESKKLTISYQTDLSASVVYRDLEL 900
Query: 901 ALATGILVIPMKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISL 960
ALATGILVIPMKASLP YML+NCRKSVLWTRLKKF+FAVLLISSVM L FCWI PHMISL
Sbjct: 901 ALATGILVIPMKASLPIYMLDNCRKSVLWTRLKKFSFAVLLISSVMFLLFCWIFPHMISL 960
Query: 961 SSLDCLCKNDIKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVI 1020
SSLD L KN+IK ISSSTRS+EK CSVHH+EK SQFSDVWSVFEG+ AP+SSLQS SLVI
Sbjct: 961 SSLDFLYKNEIKHISSSTRSVEKACSVHHNEKRSQFSDVWSVFEGKGAPESSLQSKSLVI 1020
Query: 1021 ENSGAVEASQPNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTA 1080
ENS AVEASQPNYLTVKTGKERGRRRKKKK G M LAGLFEVSSSQSGNSTPSSPLSPTA
Sbjct: 1021 ENSDAVEASQPNYLTVKTGKERGRRRKKKKGGAMNLAGLFEVSSSQSGNSTPSSPLSPTA 1080
Query: 1081 SSTPKRTWPMSPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDT 1140
SSTPKR WPMSPDVNQSIE SSLF RV+DETQCHKAQTS PTSV +SPKPEVSVKNCID+
Sbjct: 1081 SSTPKRRWPMSPDVNQSIEASSLFDRVIDETQCHKAQTSKPTSVMSSPKPEVSVKNCIDS 1140
Query: 1141 LVSSSKETLPESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKL 1200
LVSSSKET ESRKS SKPILLPSATFPSAGRPAPNVICSPLAAS SKI L ARAPGSKL
Sbjct: 1141 LVSSSKETPSESRKSCSKPILLPSATFPSAGRPAPNVICSPLAASASKIDLQARAPGSKL 1200
Query: 1201 FNQKAPLEGEGKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSP 1260
FNQKA LEGEGKSGIQDKYKYDIWGDHFSGLHLI KSKDV PMIPS IEK SDSFFETSP
Sbjct: 1201 FNQKASLEGEGKSGIQDKYKYDIWGDHFSGLHLIKKSKDVLPMIPSAIEKDSDSFFETSP 1260
Query: 1261 QTLIAKSQPMS 1272
QTLIAKSQP S
Sbjct: 1261 QTLIAKSQPTS 1269
BLAST of HG10016739 vs. ExPASy TrEMBL
Match:
A0A0A0KJI8 (TMEM131_like domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G490270 PE=3 SV=1)
HSP 1 Score: 2202.2 bits (5705), Expect = 0.0e+00
Identity = 1120/1265 (88.54%), Postives = 1167/1265 (92.25%), Query Frame = 0
Query: 11 RGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNPASANGIHTT 70
RGL HPDFAKAIISILVL CAFF +AACGPCFISELQSASNED+GHYMNN ANGI +
Sbjct: 20 RGLLHPDFAKAIISILVLLCAFFQYAACGPCFISELQSASNEDTGHYMNN--HANGIRSN 79
Query: 71 FPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLGLFDGSSPPVG 130
FPADISSGSNPTTHLSFESVCTDSRLFCFPSTV DFSFNEKGIGV AS GLFDGSS PVG
Sbjct: 80 FPADISSGSNPTTHLSFESVCTDSRLFCFPSTVTDFSFNEKGIGVVASSGLFDGSSSPVG 139
Query: 131 SNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDSTSKVDLSTCRG 190
S QDDKLAANKSQSSDYGMFELFEGGIISCSLNSR+DVNELSSIQKY STS+VDLSTCRG
Sbjct: 140 STQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRKDVNELSSIQKYGSTSRVDLSTCRG 199
Query: 191 DPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPSLASITVTNTC 250
DP+YQTSPSSTQKKN DVTNS +SDSSM+PFVD+SPTEL+WEHKFLYLPSLASITVTNTC
Sbjct: 200 DPYYQTSPSSTQKKNLDVTNSDYSDSSMAPFVDVSPTELNWEHKFLYLPSLASITVTNTC 259
Query: 251 NRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAHLILQTSFGGF 310
N+S LHIYEPFSTDSQFYSCNFSE VLGPGEAVSIYFVFLPKYLGLSSAHLILQT+FGGF
Sbjct: 260 NQSFLHIYEPFSTDSQFYSCNFSEVVLGPGEAVSIYFVFLPKYLGLSSAHLILQTNFGGF 319
Query: 311 LVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTGWISVFKEDKC 370
LVPAKGFAIQSPYGIQPL SLN+HSSGRWTKNLSLFNPY+DVLYVEELTGWISVFKEDKC
Sbjct: 320 LVPAKGFAIQSPYGIQPLLSLNIHSSGRWTKNLSLFNPYDDVLYVEELTGWISVFKEDKC 379
Query: 371 YHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIEPHSNETIIEV 430
YHTEAVCRVDRY+VF EPKPSIIKEGLV+QHGH+GSPLLSMRPYKQWKIEPHSNETIIEV
Sbjct: 380 YHTEAVCRVDRYKVFHEPKPSIIKEGLVIQHGHIGSPLLSMRPYKQWKIEPHSNETIIEV 439
Query: 431 DLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSVFASFEPLLYH 490
DLSFEYG TIIGTFWLQLLRPSQDK DVVAVSLEAELEG STH+DHKGSVFASFEP+LYH
Sbjct: 440 DLSFEYGGTIIGTFWLQLLRPSQDKSDVVAVSLEAELEGWSTHNDHKGSVFASFEPILYH 499
Query: 491 GNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNEEHAH 550
GNVFVAL+LKNSA HL SVLK+IEVAESKVFEFKSLEGLLLFP TVTQVALITCNE+HAH
Sbjct: 500 GNVFVALSLKNSASHLFSVLKVIEVAESKVFEFKSLEGLLLFPETVTQVALITCNEQHAH 559
Query: 551 FHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTEDQKQNEHFSY 610
FHK SPEI N Y KCKLLVLTNESTS HIEVPC+DIFLLCS+Y + SF ED+KQNEHFS
Sbjct: 560 FHKDSPEIVNTYGKCKLLVLTNESTSPHIEVPCEDIFLLCSKYWKDSFMEDEKQNEHFSS 619
Query: 611 GNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHVVFFPMVEVGS 670
GNVRT SLANHV LQSEIK V+RAEADE+VLENWASMGTRKSMSVLDEH VFFPMVEVGS
Sbjct: 620 GNVRTGSLANHVSLQSEIKDVKRAEADELVLENWASMGTRKSMSVLDEHEVFFPMVEVGS 679
Query: 671 HSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDSTMPKKYGFSL 730
HSTKWITVKNPS+WPV+MQLIINSGEIIDEC DPEGF HL SG +I NDST+PKKYGFSL
Sbjct: 680 HSTKWITVKNPSEWPVVMQLIINSGEIIDECHDPEGFTHLSSGALIQNDSTLPKKYGFSL 739
Query: 731 AEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL 790
AE AVTEAYVHPYGDV FGPI+F+PS RCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL
Sbjct: 740 AEDAVTEAYVHPYGDVHFGPIIFYPSKRCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL 799
Query: 791 LEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKNTGDLPLEFKK 850
LE SKPV SIEFELESPILLNISPSERSVHMEEISHACTLPLSK+FYAKN+GDLPLEFKK
Sbjct: 800 LEGSKPVFSIEFELESPILLNISPSERSVHMEEISHACTLPLSKDFYAKNSGDLPLEFKK 859
Query: 851 IKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLELALATGILVIP 910
IKISGTEC LDGFLVHNCK+FALEPGES KLTISYETDLSA+VVYRDLELALATGILVIP
Sbjct: 860 IKISGTECGLDGFLVHNCKNFALEPGESKKLTISYETDLSATVVYRDLELALATGILVIP 919
Query: 911 MKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISLSSLDCLCKND 970
MKASLPFYMLNNCR+SVLWTRLKKF+FAVLLISS M LFFCWI+PHMISLS LD L KN+
Sbjct: 920 MKASLPFYMLNNCRRSVLWTRLKKFSFAVLLISSAMFLFFCWIVPHMISLSPLDFLSKNE 979
Query: 971 IKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVIENSGAVEASQ 1030
IK I SST+S+EK CSVHH EKSSQ SDVWSVFEGE P SSL S S+VIENS AVEASQ
Sbjct: 980 IKRILSSTKSVEKTCSVHHGEKSSQLSDVWSVFEGEGTPPSSLLSKSVVIENSDAVEASQ 1039
Query: 1031 PNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPM 1090
NYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPT S TPKRTWPM
Sbjct: 1040 SNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTVSGTPKRTWPM 1099
Query: 1091 SPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDTLVSSSKETLP 1150
SPDVNQSIEVSSLFARVVDET KAQTS PTSV NSPKPE ++SSK T
Sbjct: 1100 SPDVNQSIEVSSLFARVVDET---KAQTSEPTSVTNSPKPE----------ITSSKGTPL 1159
Query: 1151 ESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKAPLEGE 1210
ES KSYSKPILL SATFPSAGRPAPNVICSPLAASTSKIALHARAPGSK FNQKA LEGE
Sbjct: 1160 ESGKSYSKPILLSSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKPFNQKASLEGE 1219
Query: 1211 GKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSPQTLIAKSQPM 1270
GKSGIQDKYKYDIWGDHFSGLHLINKSKDV PMIPSTIEK SDSFFETSPQTLIAKSQP
Sbjct: 1220 GKSGIQDKYKYDIWGDHFSGLHLINKSKDVHPMIPSTIEKDSDSFFETSPQTLIAKSQPT 1269
Query: 1271 SDGKF 1276
S F
Sbjct: 1280 SVSSF 1269
BLAST of HG10016739 vs. ExPASy TrEMBL
Match:
A0A6J1GEH9 (uncharacterized protein LOC111453427 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453427 PE=3 SV=1)
HSP 1 Score: 2189.1 bits (5671), Expect = 0.0e+00
Identity = 1113/1261 (88.26%), Postives = 1163/1261 (92.23%), Query Frame = 0
Query: 11 RGLFHPDFAKAIISILVLSCAFFHHAACGPCFISELQSASNEDSGHYMNNPASANGIHTT 70
RGLFHPDFA+AII IL+L CAFFHHAACGPCF S+LQ SNED+GHYMN+P A GIH+T
Sbjct: 20 RGLFHPDFARAIIYILILLCAFFHHAACGPCFTSDLQPVSNEDNGHYMNDP--AYGIHST 79
Query: 71 FPADISSGSNPTTHLSFESVCTDSRLFCFPSTVPDFSFNEKGIGVEASLGLFDGSSPPVG 130
PADISSGSNPT+ LSFESVCTDSRLFCFPSTV +FSFN+KGI VEAS L GSSPPVG
Sbjct: 80 LPADISSGSNPTSRLSFESVCTDSRLFCFPSTVLEFSFNKKGIDVEAS--LVGGSSPPVG 139
Query: 131 SNQDDKLAANKSQSSDYGMFELFEGGIISCSLNSRQDVNELSSIQKYDSTSKVDLSTCRG 190
S QDDKLAA KSQSSDYGMFELFEGGI+SCSLNS QDV+ELSSIQKYDSTSK DLSTCRG
Sbjct: 140 STQDDKLAAYKSQSSDYGMFELFEGGIVSCSLNSGQDVSELSSIQKYDSTSKFDLSTCRG 199
Query: 191 DPHYQTSPSSTQKKNHDVTNSGFSDSSMSPFVDISPTELDWEHKFLYLPSLASITVTNTC 250
D H Q SPSS QKKN DVTNS SDSS+SP VDISPTELDWEHKFLYLPSLAS+TVTNTC
Sbjct: 200 DHHCQKSPSSGQKKNLDVTNSDLSDSSISPLVDISPTELDWEHKFLYLPSLASLTVTNTC 259
Query: 251 NRSILHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFLPKYLGLSSAHLILQTSFGGF 310
NRS+LHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVF PKYLGLSS HLILQTSFGG
Sbjct: 260 NRSVLHIYEPFSTDSQFYSCNFSEAVLGPGEAVSIYFVFYPKYLGLSSGHLILQTSFGGL 319
Query: 311 LVPAKGFAIQSPYGIQPLSSLNVHSSGRWTKNLSLFNPYNDVLYVEELTGWISVFKEDKC 370
LVPAKGFAIQSPYGIQPL SLN+HSSGRWTKNLSLFNPY+DVLYVEELTGWISV KEDKC
Sbjct: 320 LVPAKGFAIQSPYGIQPLLSLNIHSSGRWTKNLSLFNPYDDVLYVEELTGWISVLKEDKC 379
Query: 371 YHTEAVCRVDRYQVFDEPKPSIIKEGLVVQHGHMGSPLLSMRPYKQWKIEPHSNETIIEV 430
YHTE VCRVDRYQVF+EPKPSI+KEGLVVQ GH+GSP LSMRPYKQWKIEPHS E IIEV
Sbjct: 380 YHTEVVCRVDRYQVFEEPKPSIVKEGLVVQLGHIGSPSLSMRPYKQWKIEPHSTENIIEV 439
Query: 431 DLSFEYGATIIGTFWLQLLRPSQDKPDVVAVSLEAELEGGSTHDDHKGSVFASFEPLLYH 490
DLSFEYG TIIGTFWLQLLRPSQDKPDVVAV LEAELEGGSTH DHKGSVFASFEPLLYH
Sbjct: 440 DLSFEYGGTIIGTFWLQLLRPSQDKPDVVAVPLEAELEGGSTHADHKGSVFASFEPLLYH 499
Query: 491 GNVFVALALKNSAFHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVTQVALITCNEEHAH 550
GNVFVA+ALKNSA HLLSVLKIIEVAESKVFEFKSLEGLLLFPGTV+QVALITCNE+HA
Sbjct: 500 GNVFVAIALKNSASHLLSVLKIIEVAESKVFEFKSLEGLLLFPGTVSQVALITCNEQHAD 559
Query: 551 FHKTSPEIFNMYSKCKLLVLTNESTSSHIEVPCKDIFLLCSEYLEHSFTEDQKQNEHFSY 610
K SPEIF+MYSKCKLL+LTNESTSSHIEVPCKDIFLLCSEY ++SF E KQNEHFS
Sbjct: 560 VDKASPEIFSMYSKCKLLMLTNESTSSHIEVPCKDIFLLCSEYWKYSFMEYGKQNEHFSS 619
Query: 611 GNVRTESLANHVPLQSEIKAVERAEADEMVLENWASMGTRKSMSVLDEHVVFFPMVEVGS 670
GNVR +LANHV LQSEIKAV AEADE+VLENWASMGTR+SMSVLDEH VFFPMVEVGS
Sbjct: 620 GNVREGTLANHVQLQSEIKAVAGAEADELVLENWASMGTRRSMSVLDEHDVFFPMVEVGS 679
Query: 671 HSTKWITVKNPSKWPVIMQLIINSGEIIDECRDPEGFIHLPSGGMIHNDSTMPKKYGFSL 730
HSTKWITVKNPSKWPV+MQLIINSGEIIDEC+DPE FIHLPSGG+IHNDSTMPKKYGFSL
Sbjct: 680 HSTKWITVKNPSKWPVVMQLIINSGEIIDECKDPEEFIHLPSGGLIHNDSTMPKKYGFSL 739
Query: 731 AEGAVTEAYVHPYGDVLFGPILFHPSDRCHWRSSVLIRNNLSGVEWLSLRGYGGSSSLLL 790
AE A+TEAYVHPYGDVLFGPILF+PS RCHWRSSVLIRNNLSGVEWLS+RGYGGSSSLLL
Sbjct: 740 AEDAITEAYVHPYGDVLFGPILFYPSGRCHWRSSVLIRNNLSGVEWLSMRGYGGSSSLLL 799
Query: 791 LEASKPVISIEFELESPILLNISPSERSVHMEEISHACTLPLSKEFYAKNTGDLPLEFKK 850
LE SKPVISI+FELESPILLNISPSERSVH EEISHACTLPL KEFYAKNTGDLPLEFKK
Sbjct: 800 LEGSKPVISIDFELESPILLNISPSERSVHKEEISHACTLPLLKEFYAKNTGDLPLEFKK 859
Query: 851 IKISGTECALDGFLVHNCKDFALEPGESIKLTISYETDLSASVVYRDLELALATGILVIP 910
IKISGTECALDGFLVHNCK FALEPGES KLTISY+TDLSASVVYRDLELALATGILVIP
Sbjct: 860 IKISGTECALDGFLVHNCKYFALEPGESKKLTISYQTDLSASVVYRDLELALATGILVIP 919
Query: 911 MKASLPFYMLNNCRKSVLWTRLKKFTFAVLLISSVMLLFFCWILPHMISLSSLDCLCKND 970
MKASLPFYML+NCRKSVLWTRLKKF+FAVLLISSVM L FCWI PHMISLSSLD LCKN+
Sbjct: 920 MKASLPFYMLDNCRKSVLWTRLKKFSFAVLLISSVMFLLFCWIFPHMISLSSLDFLCKNE 979
Query: 971 IKPISSSTRSLEKNCSVHHSEKSSQFSDVWSVFEGERAPQSSLQSNSLVIENSGAVEASQ 1030
IK +SSSTRS+EK CSVHH+EK SQFSDVWSVFEG+ AP+SSLQS SL IENS AVEASQ
Sbjct: 980 IKHLSSSTRSVEKACSVHHNEKRSQFSDVWSVFEGKGAPESSLQSKSLAIENSDAVEASQ 1039
Query: 1031 PNYLTVKTGKERGRRRKKKKAGGMKLAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPM 1090
PNYLTVKTGKERGRRRKKKK GGM LAGLFEVSSSQSGNSTPSSPLSPTAS TPKR WPM
Sbjct: 1040 PNYLTVKTGKERGRRRKKKKGGGMNLAGLFEVSSSQSGNSTPSSPLSPTASGTPKRRWPM 1099
Query: 1091 SPDVNQSIEVSSLFARVVDETQCHKAQTSGPTSVKNSPKPEVSVKNCIDTLVSSSKETLP 1150
SPDVNQSIE SSLF RV+DET HKAQTS PTSV +SPKPEVSVKNCID+LVSSSKET
Sbjct: 1100 SPDVNQSIEASSLFDRVIDET--HKAQTSKPTSVMSSPKPEVSVKNCIDSLVSSSKETPS 1159
Query: 1151 ESRKSYSKPILLPSATFPSAGRPAPNVICSPLAASTSKIALHARAPGSKLFNQKAPLEGE 1210
ESRKS SKPILLPSATFPSAGRPAPNVICSPLAAS SKI L ARAPGSKLFN+KA LEGE
Sbjct: 1160 ESRKSCSKPILLPSATFPSAGRPAPNVICSPLAASASKIDLQARAPGSKLFNRKASLEGE 1219
Query: 1211 GKSGIQDKYKYDIWGDHFSGLHLINKSKDVPPMIPSTIEKVSDSFFETSPQTLIAKSQPM 1270
GKSGIQDKYKYDIWGDHFSGLHLI KSKDV PMIPS IEK SDSFFETSPQTLIAK+QP
Sbjct: 1220 GKSGIQDKYKYDIWGDHFSGLHLIKKSKDVLPMIPSAIEKDSDSFFETSPQTLIAKTQPT 1274
Query: 1271 S 1272
S
Sbjct: 1280 S 1274
BLAST of HG10016739 vs. TAIR 10
Match:
AT5G66820.1 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 154.1 bits (388), Expect = 7.9e-37
Identity = 161/513 (31.38%), Postives = 230/513 (44.83%), Query Frame = 0
Query: 763 SSVLIRNNLSGVEWLSLRGYGGSSSLLLLEASKPVISIEFELESPILLNISPSERSVHME 822
SS LIR NLSGV WLSL KPV IEF+ P H
Sbjct: 121 SSALIRKNLSGVVWLSL---------------KPVHIIEFQ----------PFTGFFH-- 180
Query: 823 EISHACTLPLSKEFYAKNTGDLPLEFKKIKISGTECALDGFLV-HNCKDFALEPGESIKL 882
I C P+SKE Y K T I +SG +C +GF+V H C+ F+LEPG+SIK
Sbjct: 181 -IGDTCYEPMSKELYTKKT----TRELSITVSGKQCGGNGFMVNHPCEGFSLEPGDSIKF 240
Query: 883 TISYETDLSASVVYRDLELALATGILVIPMKASLPFYMLNNCRKSVLWTRLKKFTFAVLL 942
Y+++LS A + +PMKA+ P ML+ +K V W R KKF AVL+
Sbjct: 241 LFFYQSELS---------WASGVAVFAVPMKATAPVLMLSLYKKPVFWVRTKKFAIAVLI 300
Query: 943 ISSVMLLFFCW---ILPHMISLSSLDCLCKNDIKPISSSTRSLEKNCSVHHSEKSSQFSD 1002
+++++L FC+ + ++ + + +++ S+ T S E + + K S
Sbjct: 301 AAALLILIFCFNDHFIEENNKRNNSNHMESREVEKPSTITISPEMDSLLRSISKES---- 360
Query: 1003 VWSVFEGERAPQSSLQSNSLVIENSGAVEASQPNYLTVKTGKERGRRR-KKKKAGGMK-- 1062
VF + P++S S+ + +S EAS+ LTVKT K++ RRR KKKK GG+
Sbjct: 361 -LQVF--DEVPKNS--SSVKPVASSHEEEASEAVNLTVKTAKDKKRRRNKKKKKGGINGL 420
Query: 1063 LAGLFEVSSSQSGNSTPSSPLSPTASSTPKRTWPMSPDVNQSIEVSSLFARVVDETQCHK 1122
+VSSS SGNSTP SP+SP +T T + P
Sbjct: 421 TPECTDVSSSYSGNSTPRSPISPEPPTTQAATKLVKP----------------------- 480
Query: 1123 AQTSGPTSVKNSPKPEVSVKNCIDTLVSSSKETLPESRKSYSKPILLPSATFPSAGRPAP 1182
PT KP+L SATFP +G
Sbjct: 481 -----PT-----------------------------------KPVLSHSATFPVSG---- 507
Query: 1183 NVICSPLAASTSKIALHARAPGSKLFNQKAPLEGEGKSGIQDKYKYDIWGDHFSGLHLIN 1242
+ S +A + RAPG+ K+ E + + + +Y YDIWGDH +GL+L++
Sbjct: 541 ---VKSMIIQRSSLAPNVRAPGA-----KSRTEVKEEKAKEYRY-YDIWGDHLTGLNLMD 507
Query: 1243 KSKDVPPMIPSTIE-KVSDSFFETSPQTLIAKS 1268
K K+V S + + +SFF PQ L+A S
Sbjct: 601 KLKEVREGKSSGFDGEECESFFVKGPQNLLADS 507
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038881516.1 | 0.0e+00 | 92.24 | uncharacterized protein LOC120073023 isoform X2 [Benincasa hispida] | [more] |
XP_038881515.1 | 0.0e+00 | 92.17 | uncharacterized protein LOC120073023 isoform X1 [Benincasa hispida] | [more] |
XP_038881517.1 | 0.0e+00 | 92.17 | uncharacterized protein LOC120073023 isoform X3 [Benincasa hispida] | [more] |
XP_038881518.1 | 0.0e+00 | 89.64 | uncharacterized protein LOC120073023 isoform X4 [Benincasa hispida] | [more] |
TYK12899.1 | 0.0e+00 | 87.51 | O-Glycosyl hydrolases family 17 protein, putative isoform 2 [Cucumis melo var. m... | [more] |
Match Name | E-value | Identity | Description | |
A2VDJ0 | 3.4e-16 | 26.27 | Transmembrane protein 131-like OS=Homo sapiens OX=9606 GN=TMEM131L PE=1 SV=2 | [more] |
Q08DV9 | 9.1e-14 | 24.68 | Transmembrane protein 131-like OS=Bos taurus OX=9913 GN=TMEM131L PE=2 SV=2 | [more] |
Q3U3D7 | 8.3e-07 | 22.62 | Transmembrane protein 131-like OS=Mus musculus OX=10090 GN=Tmem131l PE=1 SV=1 | [more] |
Q9V7H4 | 4.5e-05 | 23.13 | Transmembrane protein 131 homolog OS=Drosophila melanogaster OX=7227 GN=CG8370 P... | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3CRD8 | 0.0e+00 | 87.51 | O-Glycosyl hydrolases family 17 protein, putative isoform 2 OS=Cucumis melo var.... | [more] |
A0A6J1GFE5 | 0.0e+00 | 88.28 | uncharacterized protein LOC111453427 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1IKS9 | 0.0e+00 | 88.12 | uncharacterized protein LOC111477951 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A0A0KJI8 | 0.0e+00 | 88.54 | TMEM131_like domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G49027... | [more] |
A0A6J1GEH9 | 0.0e+00 | 88.26 | uncharacterized protein LOC111453427 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G66820.1 | 7.9e-37 | 31.38 | unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... | [more] |