HG10023069 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023069
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptiontolB protein-related
LocationChr05: 30889675 .. 30891612 (+)
RNA-Seq ExpressionHG10023069
SyntenyHG10023069
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAATCCCACCGGCGCCGTCCTCTTCACCACCATCGGCCTCCAACAATACGGCTTCGACATCTTCTCTGTTCCACTCAATTCTCCCACCACCGAACGCCGCCTCACCGACGGAATCTCCGTCAATTTCAACGCCCAATTCCTCGATAACCAACTTTCCGTCGTCTTCATCTCTGAACGCTCCGGCTCCTCTAGAATTTACCTCTCCGATTCCCCTAATTCCGCCCCTAAATTGCTCCCTTCCGCCCCCGGAAGCTTCTTTAACGACCGACCCATCGTCAGAAGTGGTCGACTCTTTTTCATCTCTGCTCATGAAAATCCCCAAAAGCCCTTCACGAGCTGGTCCGCTCTGTATTCCACTGCGTTGGACGGTGCCGATTCGATTACCCGGCTGACTCCTCCCGGGTCCGTCGATTTCAGTCCGGCGGTTTCGGAAAGTGGTAAGTTTGTGGCGGTTGCTTCTTATGGCTCTCGTTCTTGGGGCGGCGAATTTCACGAGCTCCACACAGAAATTGTTGTTTTTAAGTCTTCTGACCCGGATCGACGGGTTGTGGTTTCCGGTCGGGGTGGATGGCCGTCGTGGTCCGGCGACTCCACTGTCTTCTTCCACCGGAAGGCCGACGATGGGTGGTGGAGCATTTTCAGAGTGGAGATTCCTGAAAATCTTGATTCCTCTGTTTCCCCTGTCCCGACCCGGGTCACTCCAGCGGGTCTCCATTGCTTCACTCCGGCTGCCATGAACGACAGAAAACGACTTGTCGTCGCTACCCGAAGAGCCGACAGTAAATTCCGACACATCGAGATTTTCAATTCCGAATTGAACGAATTCATCCCAATTACCCAGAAATTGAATCCCGATTTCCATCATTACAATCCGTTTGTCTCGCCGGATTCCAACTCCATCGGGTACCACCGATTTCGCGGCGAGTCAGCTCAGGGCGAGTTAACAATTCCATATCTCAACCCGGTAATTTCCCCAATTAATGAACTCCGTATGATCCGAATAAACGGTTCGTTCCCAACGCCGTCACCTGACGGCGATTTAATCGCGTTTAATCCCAATTTCATGGGATTACAAATCGTCAAATCAGACGGCTCAAAATGTCGAACCGTTTTAAAAGATCGAACCGCTTTTTACAATTCGTGGAGCCCAACGGAAAAAAACGTGATTTACACTTCGTTGGGGCCCATTTTCGGGGCGGTAACAGCGACGGTCCAGATCGCTCGGATTACGATCAACTCTGATGATCTCAACGGTGGTCGTAGCGATGAAATTGCCAGTGAAGTGAAGATTCTCACAAAAGACGACACCGGAAACAACGCCTTTCCGGCGTGCTCGCCGGACGGTAAGACTCTGGTCTTCCGATCCGGTCGGTCAGGGCACAAGAATCTTTACATCGTCGACGCCGTGAATGGCGAGTTTGACGGCGAATTACGGCGGCTGACCGACGGACCGTGGATAGACACGATGCCAAGTTGGTCGCCGGCGGGAGATCTCATAGTATTTTCTTCAAACATGCACAATCCAAAGAACACCGAAGCATTTAGTATTTATGTCATCAGACCGGACGGTTCGGGGTTGAGGAGAGTACACGTGGCTGGACCGGAAGGGTCTAGTGACGTGGACAGAGAGAGAATTAACCACGTGTGTTTTAGTAGGGACGGTAAATGGCTTTTATTTACGGCGAATTTGAGTGGGGTGACAGCGGAGCCGGTGTCGTTGCCCAATCAGTTTCAGCCATATGGCGATTTGTTTGTGGTGAAATTGGATGGGACCGGTTTGAGGAGGCTGACGTGTAGTGGATATGAAAATGGAACTCCCATGTGGTATTATGGGAGTGAGCTGGCGCTTTCTGGGTTGAGTTTGAAAGATGAGGTGGTTGGTGAGAAGTTGAAGGGAGAGTTTGATGAACCGCTTTGGATAAAGTTCGATTGA

mRNA sequence

ATGGATAATCCCACCGGCGCCGTCCTCTTCACCACCATCGGCCTCCAACAATACGGCTTCGACATCTTCTCTGTTCCACTCAATTCTCCCACCACCGAACGCCGCCTCACCGACGGAATCTCCGTCAATTTCAACGCCCAATTCCTCGATAACCAACTTTCCGTCGTCTTCATCTCTGAACGCTCCGGCTCCTCTAGAATTTACCTCTCCGATTCCCCTAATTCCGCCCCTAAATTGCTCCCTTCCGCCCCCGGAAGCTTCTTTAACGACCGACCCATCGTCAGAAGTGGTCGACTCTTTTTCATCTCTGCTCATGAAAATCCCCAAAAGCCCTTCACGAGCTGGTCCGCTCTGTATTCCACTGCGTTGGACGGTGCCGATTCGATTACCCGGCTGACTCCTCCCGGGTCCGTCGATTTCAGTCCGGCGGTTTCGGAAAGTGGTAAGTTTGTGGCGGTTGCTTCTTATGGCTCTCGTTCTTGGGGCGGCGAATTTCACGAGCTCCACACAGAAATTGTTGTTTTTAAGTCTTCTGACCCGGATCGACGGGTTGTGGTTTCCGGTCGGGGTGGATGGCCGTCGTGGTCCGGCGACTCCACTGTCTTCTTCCACCGGAAGGCCGACGATGGGTGGTGGAGCATTTTCAGAGTGGAGATTCCTGAAAATCTTGATTCCTCTGTTTCCCCTGTCCCGACCCGGGTCACTCCAGCGGGTCTCCATTGCTTCACTCCGGCTGCCATGAACGACAGAAAACGACTTGTCGTCGCTACCCGAAGAGCCGACAGTAAATTCCGACACATCGAGATTTTCAATTCCGAATTGAACGAATTCATCCCAATTACCCAGAAATTGAATCCCGATTTCCATCATTACAATCCGTTTGTCTCGCCGGATTCCAACTCCATCGGGTACCACCGATTTCGCGGCGAGTCAGCTCAGGGCGAGTTAACAATTCCATATCTCAACCCGGTAATTTCCCCAATTAATGAACTCCGTATGATCCGAATAAACGGTTCGTTCCCAACGCCGTCACCTGACGGCGATTTAATCGCGTTTAATCCCAATTTCATGGGATTACAAATCGTCAAATCAGACGGCTCAAAATGTCGAACCGTTTTAAAAGATCGAACCGCTTTTTACAATTCGTGGAGCCCAACGGAAAAAAACGTGATTTACACTTCGTTGGGGCCCATTTTCGGGGCGGTAACAGCGACGGTCCAGATCGCTCGGATTACGATCAACTCTGATGATCTCAACGGTGGTCGTAGCGATGAAATTGCCAGTGAAGTGAAGATTCTCACAAAAGACGACACCGGAAACAACGCCTTTCCGGCGTGCTCGCCGGACGGTAAGACTCTGGTCTTCCGATCCGGTCGGTCAGGGCACAAGAATCTTTACATCGTCGACGCCGTGAATGGCGAGTTTGACGGCGAATTACGGCGGCTGACCGACGGACCGTGGATAGACACGATGCCAAGTTGGTCGCCGGCGGGAGATCTCATAGTATTTTCTTCAAACATGCACAATCCAAAGAACACCGAAGCATTTAGTATTTATGTCATCAGACCGGACGGTTCGGGGTTGAGGAGAGTACACGTGGCTGGACCGGAAGGGTCTAGTGACGTGGACAGAGAGAGAATTAACCACGTGTGTTTTAGTAGGGACGGTAAATGGCTTTTATTTACGGCGAATTTGAGTGGGGTGACAGCGGAGCCGGTGTCGTTGCCCAATCAGTTTCAGCCATATGGCGATTTGTTTGTGGTGAAATTGGATGGGACCGGTTTGAGGAGGCTGACGTGTAGTGGATATGAAAATGGAACTCCCATGTGGTATTATGGGAGTGAGCTGGCGCTTTCTGGGTTGAGTTTGAAAGATGAGGTGGTTGGTGAGAAGTTGAAGGGAGAGTTTGATGAACCGCTTTGGATAAAGTTCGATTGA

Coding sequence (CDS)

ATGGATAATCCCACCGGCGCCGTCCTCTTCACCACCATCGGCCTCCAACAATACGGCTTCGACATCTTCTCTGTTCCACTCAATTCTCCCACCACCGAACGCCGCCTCACCGACGGAATCTCCGTCAATTTCAACGCCCAATTCCTCGATAACCAACTTTCCGTCGTCTTCATCTCTGAACGCTCCGGCTCCTCTAGAATTTACCTCTCCGATTCCCCTAATTCCGCCCCTAAATTGCTCCCTTCCGCCCCCGGAAGCTTCTTTAACGACCGACCCATCGTCAGAAGTGGTCGACTCTTTTTCATCTCTGCTCATGAAAATCCCCAAAAGCCCTTCACGAGCTGGTCCGCTCTGTATTCCACTGCGTTGGACGGTGCCGATTCGATTACCCGGCTGACTCCTCCCGGGTCCGTCGATTTCAGTCCGGCGGTTTCGGAAAGTGGTAAGTTTGTGGCGGTTGCTTCTTATGGCTCTCGTTCTTGGGGCGGCGAATTTCACGAGCTCCACACAGAAATTGTTGTTTTTAAGTCTTCTGACCCGGATCGACGGGTTGTGGTTTCCGGTCGGGGTGGATGGCCGTCGTGGTCCGGCGACTCCACTGTCTTCTTCCACCGGAAGGCCGACGATGGGTGGTGGAGCATTTTCAGAGTGGAGATTCCTGAAAATCTTGATTCCTCTGTTTCCCCTGTCCCGACCCGGGTCACTCCAGCGGGTCTCCATTGCTTCACTCCGGCTGCCATGAACGACAGAAAACGACTTGTCGTCGCTACCCGAAGAGCCGACAGTAAATTCCGACACATCGAGATTTTCAATTCCGAATTGAACGAATTCATCCCAATTACCCAGAAATTGAATCCCGATTTCCATCATTACAATCCGTTTGTCTCGCCGGATTCCAACTCCATCGGGTACCACCGATTTCGCGGCGAGTCAGCTCAGGGCGAGTTAACAATTCCATATCTCAACCCGGTAATTTCCCCAATTAATGAACTCCGTATGATCCGAATAAACGGTTCGTTCCCAACGCCGTCACCTGACGGCGATTTAATCGCGTTTAATCCCAATTTCATGGGATTACAAATCGTCAAATCAGACGGCTCAAAATGTCGAACCGTTTTAAAAGATCGAACCGCTTTTTACAATTCGTGGAGCCCAACGGAAAAAAACGTGATTTACACTTCGTTGGGGCCCATTTTCGGGGCGGTAACAGCGACGGTCCAGATCGCTCGGATTACGATCAACTCTGATGATCTCAACGGTGGTCGTAGCGATGAAATTGCCAGTGAAGTGAAGATTCTCACAAAAGACGACACCGGAAACAACGCCTTTCCGGCGTGCTCGCCGGACGGTAAGACTCTGGTCTTCCGATCCGGTCGGTCAGGGCACAAGAATCTTTACATCGTCGACGCCGTGAATGGCGAGTTTGACGGCGAATTACGGCGGCTGACCGACGGACCGTGGATAGACACGATGCCAAGTTGGTCGCCGGCGGGAGATCTCATAGTATTTTCTTCAAACATGCACAATCCAAAGAACACCGAAGCATTTAGTATTTATGTCATCAGACCGGACGGTTCGGGGTTGAGGAGAGTACACGTGGCTGGACCGGAAGGGTCTAGTGACGTGGACAGAGAGAGAATTAACCACGTGTGTTTTAGTAGGGACGGTAAATGGCTTTTATTTACGGCGAATTTGAGTGGGGTGACAGCGGAGCCGGTGTCGTTGCCCAATCAGTTTCAGCCATATGGCGATTTGTTTGTGGTGAAATTGGATGGGACCGGTTTGAGGAGGCTGACGTGTAGTGGATATGAAAATGGAACTCCCATGTGGTATTATGGGAGTGAGCTGGCGCTTTCTGGGTTGAGTTTGAAAGATGAGGTGGTTGGTGAGAAGTTGAAGGGAGAGTTTGATGAACCGCTTTGGATAAAGTTCGATTGA

Protein sequence

MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISERSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYSTALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDPDRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSVSPVPTRVTPAGLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSPDSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFMGLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDDLNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDGELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPEGSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRRLTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD
Homology
BLAST of HG10023069 vs. NCBI nr
Match: XP_038899648.1 (uncharacterized protein LOC120086912 [Benincasa hispida])

HSP 1 Score: 1246.1 bits (3223), Expect = 0.0e+00
Identity = 600/648 (92.59%), Postives = 618/648 (95.37%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTE RLTDGISVNFNAQFLDNQLSVVFISE
Sbjct: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTEHRLTDGISVNFNAQFLDNQLSVVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGSSRIYLSDSPNSAPKLLPSAPGSFF+DRPIVR+ RLFFISAHENPQKPFTSWSALYS
Sbjct: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFHDRPIVRNDRLFFISAHENPQKPFTSWSALYS 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T+LDG DSITRLTPPGSVDFSPAVS+SGKFVAVASY SRSWGGEFHELHT IVVFKSSDP
Sbjct: 121 TSLDGGDSITRLTPPGSVDFSPAVSKSGKFVAVASYESRSWGGEFHELHTAIVVFKSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLD---SSVSPVPTRVTPA 240
           +RRVVV+GRGGWPSWSGDSTVFFHRKADDGWWSIF+VEIPE LD   SSVSPVP RVTPA
Sbjct: 181 NRRVVVAGRGGWPSWSGDSTVFFHRKADDGWWSIFKVEIPEKLDSSVSSVSPVPIRVTPA 240

Query: 241 GLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSP 300
           GLHCFTPAA+ND KRL VATRRADSKFRHIEIFNSE +EFIPITQKLNPDFHHYNPF+SP
Sbjct: 241 GLHCFTPAALNDSKRLAVATRRADSKFRHIEIFNSESDEFIPITQKLNPDFHHYNPFISP 300

Query: 301 DSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFM 360
           DSNSIGYHRFRGESAQ ELTIPYL+PVISPI ELRMIRINGSFPTPSPDGDLIAFNPNF 
Sbjct: 301 DSNSIGYHRFRGESAQSELTIPYLDPVISPIKELRMIRINGSFPTPSPDGDLIAFNPNFN 360

Query: 361 GLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420
           GLQI+KSDGSKC TVLKDRTAFYNSWSP+EKNVIYTSLGPIFG VT TVQIARITINSDD
Sbjct: 361 GLQIIKSDGSKCWTVLKDRTAFYNSWSPSEKNVIYTSLGPIFGEVTVTVQIARITINSDD 420

Query: 421 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 480
           LN G  DE+ASEVKILTKDDTGNNAFPACSPDGK LVFRSGRSGHKNLYI+DAVNGEFDG
Sbjct: 421 LNSGDGDEVASEVKILTKDDTGNNAFPACSPDGKFLVFRSGRSGHKNLYIIDAVNGEFDG 480

Query: 481 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 540
           ELRRLTDGPWIDTMPSWSP GDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE
Sbjct: 481 ELRRLTDGPWIDTMPSWSPTGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 540

Query: 541 GSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRR 600
           GSSDV RERINHVCFSRDG+WLLFTANLSGVTAEPVS PNQFQPYGDLFVV+LDGTGLRR
Sbjct: 541 GSSDVGRERINHVCFSRDGEWLLFTANLSGVTAEPVSFPNQFQPYGDLFVVRLDGTGLRR 600

Query: 601 LTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           LT +GYENGTP WYYGSELAL GLSLKDEVVGEKLKGEFDEPLWI FD
Sbjct: 601 LTWNGYENGTPTWYYGSELALYGLSLKDEVVGEKLKGEFDEPLWITFD 648

BLAST of HG10023069 vs. NCBI nr
Match: KAA0058562.1 (TolB protein-related isoform 1 [Cucumis melo var. makuwa] >TYK10360.1 TolB protein-related isoform 1 [Cucumis melo var. makuwa])

HSP 1 Score: 1205.7 bits (3118), Expect = 0.0e+00
Identity = 586/648 (90.43%), Postives = 610/648 (94.14%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           MDNPTGAVLFTTIGL QYGFDIFSV LNSPT ERRLTDGISVNFNAQFL+NQLSVVFISE
Sbjct: 1   MDNPTGAVLFTTIGLNQYGFDIFSVALNSPTVERRLTDGISVNFNAQFLNNQLSVVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGSSRIYLSDSPNS+PKLL SAPGS F+DRPIVR+GRL FISAHENP KPFTSW+ALYS
Sbjct: 61  RSGSSRIYLSDSPNSSPKLLASAPGSCFHDRPIVRNGRLLFISAHENPHKPFTSWAALYS 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG DSITRLTP GSVDFSPAVSESGKFVAVASYGSRSWGGEFHEL+ EIVVFKSSDP
Sbjct: 121 TPLDGDDSITRLTPLGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELNLEIVVFKSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSV---SPVPTRVTPA 240
           DRRVVV+ RGGWPSWSGDSTVFFHRKA+DGWWSIF+VEIPENLDSS+   SPVP RVTPA
Sbjct: 181 DRRVVVASRGGWPSWSGDSTVFFHRKAEDGWWSIFKVEIPENLDSSMSSDSPVPIRVTPA 240

Query: 241 GLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSP 300
           GLHCFTPAAMND + +VVATRRADSKFRHIEIF+SEL EFIPITQKLNPDFHHYNPFVSP
Sbjct: 241 GLHCFTPAAMNDSRLVVVATRRADSKFRHIEIFDSELEEFIPITQKLNPDFHHYNPFVSP 300

Query: 301 DSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFM 360
           DSN IGYHRFRGES Q EL IPYL+PVISPI EL+MIRINGSFPTPSPDGDLIAFNPNF 
Sbjct: 301 DSNFIGYHRFRGESTQSEL-IPYLSPVISPIKELQMIRINGSFPTPSPDGDLIAFNPNFN 360

Query: 361 GLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420
           GLQIVK DGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD
Sbjct: 361 GLQIVKFDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420

Query: 421 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 480
           L  G SDE++SEVKILTKDDTGNNAFPACSPDGK LVFRSGRSGHKNLYIVDAV GEF+G
Sbjct: 421 LKDGDSDEVSSEVKILTKDDTGNNAFPACSPDGKFLVFRSGRSGHKNLYIVDAVKGEFEG 480

Query: 481 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 540
           ELRRLTDG WIDTMP+WSP GDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRV+VAGPE
Sbjct: 481 ELRRLTDGAWIDTMPNWSPRGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVYVAGPE 540

Query: 541 GSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRR 600
           GSS+VDRERINHVCFSRDGKWLLFTANLSGVTAEPVS PNQFQPYGDLFVV+LDGTGLRR
Sbjct: 541 GSSEVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSWPNQFQPYGDLFVVRLDGTGLRR 600

Query: 601 LTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           LT +GYENGTP WYYGSE+ALSGLSLKDEVVGEKLKG FDEPLWI FD
Sbjct: 601 LTWNGYENGTPTWYYGSEVALSGLSLKDEVVGEKLKGNFDEPLWITFD 647

BLAST of HG10023069 vs. NCBI nr
Match: XP_008461387.1 (PREDICTED: uncharacterized protein LOC103499982 [Cucumis melo])

HSP 1 Score: 1198.7 bits (3100), Expect = 0.0e+00
Identity = 583/648 (89.97%), Postives = 607/648 (93.67%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           MDNPTGAVLFTTIGL QYGFDIFSV LNSPT ERRLTDGISVNFNAQFL+NQLSVVFISE
Sbjct: 1   MDNPTGAVLFTTIGLNQYGFDIFSVALNSPTVERRLTDGISVNFNAQFLNNQLSVVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGSSRIYLSDSPNS+PKLL SAPGS F+DRPIVR+GRL FISAHENP KPFTSW+ALYS
Sbjct: 61  RSGSSRIYLSDSPNSSPKLLASAPGSCFHDRPIVRNGRLLFISAHENPHKPFTSWAALYS 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG DSITRLTP GSVDFSPAVS SGKFVAVASYGSRSWGGEFHEL+ EIVVFKSSDP
Sbjct: 121 TPLDGDDSITRLTPLGSVDFSPAVSGSGKFVAVASYGSRSWGGEFHELNLEIVVFKSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSV---SPVPTRVTPA 240
           DRRVVV+ RGGWPSWSGDSTVFFHRKA+DGWWSIF+VEIPENLDSS+   SPVP RVTPA
Sbjct: 181 DRRVVVASRGGWPSWSGDSTVFFHRKAEDGWWSIFKVEIPENLDSSMSSDSPVPIRVTPA 240

Query: 241 GLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSP 300
           GLHCFTPAAMND + +VVATRRADSKFRHIEIF+SEL EFIPITQKLNPDFHHYNPFVSP
Sbjct: 241 GLHCFTPAAMNDSRLVVVATRRADSKFRHIEIFDSELEEFIPITQKLNPDFHHYNPFVSP 300

Query: 301 DSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFM 360
           DSN IGYHRFRGES Q EL IPYL+PVISPI EL+MIRINGSFP PSPDGDLIAFNPNF 
Sbjct: 301 DSNFIGYHRFRGESTQSEL-IPYLSPVISPIKELQMIRINGSFPAPSPDGDLIAFNPNFN 360

Query: 361 GLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420
           GLQIVK DGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD
Sbjct: 361 GLQIVKFDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420

Query: 421 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 480
           L  G SDE++SEVKILTKDDTGNNAFPACSPDGK LVFRSGRSGHKNLYIVDAV GEF+G
Sbjct: 421 LKDGDSDEVSSEVKILTKDDTGNNAFPACSPDGKFLVFRSGRSGHKNLYIVDAVKGEFEG 480

Query: 481 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 540
           ELRRLTDG WIDTMP+WSP GDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRV+VAGPE
Sbjct: 481 ELRRLTDGAWIDTMPNWSPRGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVYVAGPE 540

Query: 541 GSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRR 600
           GSS+VDRERINHVCFSRDGKWLLFTANLSGVTAEPVS PNQFQPYGDLFVV+LDGTGL R
Sbjct: 541 GSSEVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSWPNQFQPYGDLFVVRLDGTGLSR 600

Query: 601 LTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           LT +GYENGTP WYYGSE+ALSGLSLKDEVVGEKLKG FDEPLWI FD
Sbjct: 601 LTWNGYENGTPTWYYGSEVALSGLSLKDEVVGEKLKGNFDEPLWITFD 647

BLAST of HG10023069 vs. NCBI nr
Match: XP_004135965.1 (uncharacterized protein LOC101214858 [Cucumis sativus] >KGN45063.1 hypothetical protein Csa_015982 [Cucumis sativus])

HSP 1 Score: 1181.0 bits (3054), Expect = 0.0e+00
Identity = 572/648 (88.27%), Postives = 607/648 (93.67%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           MDNPTGAVLFTT+GL QYGFDIFSVPLNS T ER+LTDGISVNFNAQFL+NQLSVVFISE
Sbjct: 1   MDNPTGAVLFTTVGLNQYGFDIFSVPLNSLTVERQLTDGISVNFNAQFLNNQLSVVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGSSRIYLSDSPNS+PKLL SAPGS F+DRPIV +GRL FISAHENP KPFTSW+ALYS
Sbjct: 61  RSGSSRIYLSDSPNSSPKLLASAPGSCFHDRPIVTNGRLLFISAHENPHKPFTSWAALYS 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG DS+TRLTP GSVDFSPAVSESGKFVAVASYGSRSWGGEFHEL+ EIVVFKSSDP
Sbjct: 121 TPLDGHDSVTRLTPLGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELNLEIVVFKSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLD---SSVSPVPTRVTPA 240
            +RVVV+GRGGWPSWSGDSTVFFHRKADDGWWSIF+VEIPENLD   SSVSPV  RVTPA
Sbjct: 181 GQRVVVAGRGGWPSWSGDSTVFFHRKADDGWWSIFKVEIPENLDSSRSSVSPVAIRVTPA 240

Query: 241 GLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSP 300
           GLHCFTPAAMND +R+VVATRRADSK+RHIEIF+SEL EFIPITQKLNP+FHHYNPFVSP
Sbjct: 241 GLHCFTPAAMNDGRRVVVATRRADSKYRHIEIFDSELEEFIPITQKLNPEFHHYNPFVSP 300

Query: 301 DSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFM 360
           DSN IGYHRFRGES Q EL IPYL PVISPI EL++IR+NGSFPTPSPDGDLIAFNP F+
Sbjct: 301 DSNFIGYHRFRGESTQSEL-IPYLYPVISPIKELQIIRVNGSFPTPSPDGDLIAFNPGFI 360

Query: 361 GLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420
           GLQIVK DGSKCRTVLKDRTAF NSWSPTEKNVIYTSLGPIFGAVTATVQIARITINS D
Sbjct: 361 GLQIVKFDGSKCRTVLKDRTAFCNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSGD 420

Query: 421 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 480
                SDE+++EVKILTKD+TGNNAFPACSPDGK LVFRSGR+GHKNLYIVDA+ GEF+G
Sbjct: 421 -----SDEVSNEVKILTKDNTGNNAFPACSPDGKFLVFRSGRTGHKNLYIVDAMKGEFEG 480

Query: 481 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 540
           ELR+LTDGPWIDTMP+WSP GDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRV+VAGPE
Sbjct: 481 ELRQLTDGPWIDTMPNWSPRGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVYVAGPE 540

Query: 541 GSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRR 600
           GSS+VDRERINHVCFSRDG WLLFTANLSGVTAEPVSLPNQFQPYGDLFVV+LDGTGLRR
Sbjct: 541 GSSEVDRERINHVCFSRDGNWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVRLDGTGLRR 600

Query: 601 LTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           LTC+ YENGTP WYYGSELALSGLSLKDEVVGEKLKG+FDEPLWIKFD
Sbjct: 601 LTCNAYENGTPTWYYGSELALSGLSLKDEVVGEKLKGDFDEPLWIKFD 642

BLAST of HG10023069 vs. NCBI nr
Match: XP_023547812.1 (uncharacterized protein LOC111806666 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1179.1 bits (3049), Expect = 0.0e+00
Identity = 558/645 (86.51%), Postives = 598/645 (92.71%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           M NPTG V+FTT+G  QYGFD FSVPLNSPTTE  LTDGISVNFNAQF+DNQLS+VFISE
Sbjct: 1   MTNPTGTVIFTTVGRTQYGFDTFSVPLNSPTTEHCLTDGISVNFNAQFVDNQLSIVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGS R+YLS+SPNSAPKLLPSAPGS F+DRPI+R+ RL+FISAHENP KPFTSWSALY 
Sbjct: 61  RSGSPRVYLSNSPNSAPKLLPSAPGSCFHDRPIIRNDRLYFISAHENPHKPFTSWSALYF 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG+DS+TRLTP GSVDFSPAVSESGKFVAVASYGSRSWGGEF ELHTEIVVF+SSDP
Sbjct: 121 TGLDGSDSVTRLTPRGSVDFSPAVSESGKFVAVASYGSRSWGGEFQELHTEIVVFRSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSVSPVPTRVTPAGLH 240
           DRRVVVSGRGGWPSWSGDSTV+FHR+A+DGWWSIFRVEIPENLDSSV PVP RVTPAGLH
Sbjct: 181 DRRVVVSGRGGWPSWSGDSTVYFHRQAEDGWWSIFRVEIPENLDSSVPPVPIRVTPAGLH 240

Query: 241 CFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSPDSN 300
           CFTPAAMNDRKR+VVATRR D+KFRHIEI+NS  +EF PITQKLNP FHHYNPFVSPDSN
Sbjct: 241 CFTPAAMNDRKRVVVATRRPDNKFRHIEIYNSGTDEFDPITQKLNPSFHHYNPFVSPDSN 300

Query: 301 SIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFMGLQ 360
            IGYHRFRGES+ GELTIP+   +ISPINELRMIRINGSFPT SPDG+ IAFNP+F+GL+
Sbjct: 301 YIGYHRFRGESSHGELTIPHFERIISPINELRMIRINGSFPTQSPDGNFIAFNPDFVGLR 360

Query: 361 IVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDDLNG 420
           IVK+DGSKC TVLKDRTAFYNSWSPTEKNVIY+SLGPIFG   ATVQIAR TINSDDLN 
Sbjct: 361 IVKADGSKCVTVLKDRTAFYNSWSPTEKNVIYSSLGPIFGPARATVQIARTTINSDDLNN 420

Query: 421 GRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDGELR 480
           G SDE+A EVKILTK+DTGNNAFPACSPDGK LVFRSGRSGHKNLYI+DAVNG+F GE R
Sbjct: 421 GDSDEVAGEVKILTKEDTGNNAFPACSPDGKFLVFRSGRSGHKNLYIIDAVNGDFGGEAR 480

Query: 481 RLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPEGSS 540
           RLTDGPWIDTMPSWSPAGDLI FSSNMHNPKNTEAFSIYVIRPDGS LRRVHVAGPEGSS
Sbjct: 481 RLTDGPWIDTMPSWSPAGDLIAFSSNMHNPKNTEAFSIYVIRPDGSNLRRVHVAGPEGSS 540

Query: 541 DVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRRLTC 600
           DVD+ERINHVCFSRDG+WLLFT+NL GV+AEPVS+PNQFQPYGDLFVV+LDGTGLRRLT 
Sbjct: 541 DVDKERINHVCFSRDGEWLLFTSNLGGVSAEPVSMPNQFQPYGDLFVVRLDGTGLRRLTW 600

Query: 601 SGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           S YENGTP WYYGSELALSGLSLKDEVVGEKLKGEFDEPLWI F+
Sbjct: 601 SAYENGTPTWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWITFN 645

BLAST of HG10023069 vs. ExPASy Swiss-Prot
Match: A8LHQ6 (Tol-Pal system protein TolB OS=Dinoroseobacter shibae (strain DSM 16493 / NCIMB 14021 / DFL 12) OX=398580 GN=tolB PE=3 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 5.4e-15
Identity = 63/196 (32.14%), Postives = 89/196 (45.41%), Query Frame = 0

Query: 419 NGGRSD----EIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGE 478
           NGG SD    +++S  +            P+ SPDG+ +VF S RSG + LY++ A    
Sbjct: 277 NGGNSDIYRRDLSSGAQTRLTATPAIETAPSFSPDGRQIVFESDRSGSQQLYVMSATG-- 336

Query: 479 FDGELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVA 538
             GE RR++ GP     P WSP GDLI F+      +N   F I V+R DGS  R +   
Sbjct: 337 --GEARRISFGPGRYGTPVWSPRGDLIAFTK-----QNQGRFHIGVMRTDGSEERLL--- 396

Query: 539 GPEGSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTG 598
               SS +D        +S +G+ ++FT   SG    P            L+ V + G  
Sbjct: 397 ---TSSFLDES----PTWSPNGRVIMFTRETSGAGGAP-----------SLYSVDISGRN 441

Query: 599 LRRLTCSGYENGTPMW 611
           LRR+   G  +  P W
Sbjct: 457 LRRVPTPGAAS-DPAW 441

BLAST of HG10023069 vs. ExPASy Swiss-Prot
Match: Q3APB5 (Protein TolB homolog OS=Chlorobium chlorochromatii (strain CaD3) OX=340177 GN=tolB PE=3 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 9.1e-15
Identity = 56/184 (30.43%), Postives = 83/184 (45.11%), Query Frame = 0

Query: 427 ASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDGELRRLTDGP 486
           A  VK      +G +  P  SPDG+ + F S RSG+  +++ D  +    G+ +RLT   
Sbjct: 283 AGAVKQRLTSSSGIDLSPTFSPDGRKMAFVSARSGNPQIFVYDFSS----GKSQRLTFSG 342

Query: 487 WIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPEGSSDVDRER 546
             +T P+WSP GD I FS+     ++    +I+VI  DGSGL ++     E  S      
Sbjct: 343 RYNTQPAWSPIGDKIAFST----WESGGEINIFVINTDGSGLTQLTTQSGENESP----- 402

Query: 547 INHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRRLTCSGYENG 606
                +S DG+ ++F +N  GV                L+V+  DG   R L   G E  
Sbjct: 403 ----SWSPDGRMIVFASNRQGVK--------------KLYVMMADGKNQRPLLAIGGEQT 435

Query: 607 TPMW 611
            P W
Sbjct: 463 QPSW 435

BLAST of HG10023069 vs. ExPASy Swiss-Prot
Match: Q0AC40 (Tol-Pal system protein TolB OS=Alkalilimnicola ehrlichii (strain ATCC BAA-1101 / DSM 17681 / MLHE-1) OX=187272 GN=tolB PE=3 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 3.5e-14
Identity = 72/252 (28.57%), Postives = 108/252 (42.86%), Query Frame = 0

Query: 359 LQIVKSDGSKCRTVLKDRTAFYN-SWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 418
           L +  +DG + R +L+ R    + +WSP    + Y S            +  R  I   +
Sbjct: 183 LMVADADGQRPREILESRQPIMSPAWSPERDRLAYVSF-----------EGGRSEIYVQN 242

Query: 419 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 478
           L  G+ D IAS          G N+ PA SPDG+ L     R G  N+Y++       DG
Sbjct: 243 LRSGQRDRIAS--------FRGINSAPAWSPDGRHLAVTLSRDGRANIYLLRL----DDG 302

Query: 479 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDG-SGLRRV----- 538
            +RRLTD   IDT P++SP G+ I F+S+           +Y +  +G  G+ RV     
Sbjct: 303 HVRRLTDHWAIDTEPTFSPDGERIAFTSDRGGRP-----QVYTLAVNGPGGVERVTFDGD 362

Query: 539 HVAGPEGSSDVDRERINHVCFSRDGKWLLFTANLSG-----VTAEPVSLPNQFQPYGDLF 598
           + A P  S D  R  + H     +G + +   +L       +T  P      F P GD+ 
Sbjct: 363 YNARPNWSPDGRRIAMVH---RHNGSFRIAVHDLESDRTRVLTDGPWDESPVFAPNGDMI 403

BLAST of HG10023069 vs. ExPASy Swiss-Prot
Match: Q167Z6 (Tol-Pal system protein TolB OS=Roseobacter denitrificans (strain ATCC 33942 / OCh 114) OX=375451 GN=tolB PE=3 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 2.2e-13
Identity = 57/166 (34.34%), Postives = 84/166 (50.60%), Query Frame = 0

Query: 417 DLNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFD 476
           ++NGG +  + S   I T         P+ SPDG  +VF S RSG + LY++ A NG   
Sbjct: 284 NVNGGSATRLTSAPSIETA--------PSYSPDGTQIVFESDRSGSQQLYVMPA-NG--- 343

Query: 477 GELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAG- 536
           GE RR++ GP     P WSP GDL+ F+      +N   F I V+R DGS  R +  +  
Sbjct: 344 GEARRISFGPGRYGTPVWSPRGDLVAFTK-----QNAGRFHIGVMRLDGSEERLLTASFL 403

Query: 537 PEGSSDVDRERINHVCFSRD-----GKWLLFTANLSGVTAEPVSLP 577
            EG +     R+  + FSR+     G+  L++ +++G    PV  P
Sbjct: 404 DEGPTWSPNGRV--IMFSRETQGAQGRATLYSVDITGRNLRPVRTP 430

BLAST of HG10023069 vs. ExPASy Swiss-Prot
Match: B2JGA0 (Tol-Pal system protein TolB OS=Paraburkholderia phymatum (strain DSM 17167 / CIP 108236 / LMG 21445 / STM815) OX=391038 GN=tolB PE=3 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 1.1e-12
Identity = 64/210 (30.48%), Postives = 90/210 (42.86%), Query Frame = 0

Query: 359 LQIVKSDGSKCRTVLKDRTAFYN-SWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 418
           LQI  SDG      L       + +WSP    V Y S            +  +  +   D
Sbjct: 178 LQISDSDGQDAHIALSSPEPIISPAWSPDGTKVAYVSF-----------EKKKPIVYIHD 237

Query: 419 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 478
           L  GR        +++  D  GNN+ PA SPDG+TL     R+G+  ++   AVN +  G
Sbjct: 238 LPTGR--------RVVVSDQKGNNSAPAWSPDGRTLAVALSRTGNTQIF---AVNADGSG 297

Query: 479 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 538
            LRRLT G  IDT PS+SP G  I F+S+       + + +        G +RV   G  
Sbjct: 298 -LRRLTQGSSIDTEPSYSPDGQSIYFTSDRGG--QPQIYKMPASGEASGGAQRVTFTGNY 353

Query: 539 GSSDVDRERINHVCFSRDGKWLLFTANLSG 568
            +S     R+     S DGK L + +   G
Sbjct: 358 NTS----PRV-----SPDGKQLAYISRTGG 353

BLAST of HG10023069 vs. ExPASy TrEMBL
Match: A0A5A7URS2 (TolB protein-related isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G00020 PE=4 SV=1)

HSP 1 Score: 1205.7 bits (3118), Expect = 0.0e+00
Identity = 586/648 (90.43%), Postives = 610/648 (94.14%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           MDNPTGAVLFTTIGL QYGFDIFSV LNSPT ERRLTDGISVNFNAQFL+NQLSVVFISE
Sbjct: 1   MDNPTGAVLFTTIGLNQYGFDIFSVALNSPTVERRLTDGISVNFNAQFLNNQLSVVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGSSRIYLSDSPNS+PKLL SAPGS F+DRPIVR+GRL FISAHENP KPFTSW+ALYS
Sbjct: 61  RSGSSRIYLSDSPNSSPKLLASAPGSCFHDRPIVRNGRLLFISAHENPHKPFTSWAALYS 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG DSITRLTP GSVDFSPAVSESGKFVAVASYGSRSWGGEFHEL+ EIVVFKSSDP
Sbjct: 121 TPLDGDDSITRLTPLGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELNLEIVVFKSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSV---SPVPTRVTPA 240
           DRRVVV+ RGGWPSWSGDSTVFFHRKA+DGWWSIF+VEIPENLDSS+   SPVP RVTPA
Sbjct: 181 DRRVVVASRGGWPSWSGDSTVFFHRKAEDGWWSIFKVEIPENLDSSMSSDSPVPIRVTPA 240

Query: 241 GLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSP 300
           GLHCFTPAAMND + +VVATRRADSKFRHIEIF+SEL EFIPITQKLNPDFHHYNPFVSP
Sbjct: 241 GLHCFTPAAMNDSRLVVVATRRADSKFRHIEIFDSELEEFIPITQKLNPDFHHYNPFVSP 300

Query: 301 DSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFM 360
           DSN IGYHRFRGES Q EL IPYL+PVISPI EL+MIRINGSFPTPSPDGDLIAFNPNF 
Sbjct: 301 DSNFIGYHRFRGESTQSEL-IPYLSPVISPIKELQMIRINGSFPTPSPDGDLIAFNPNFN 360

Query: 361 GLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420
           GLQIVK DGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD
Sbjct: 361 GLQIVKFDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420

Query: 421 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 480
           L  G SDE++SEVKILTKDDTGNNAFPACSPDGK LVFRSGRSGHKNLYIVDAV GEF+G
Sbjct: 421 LKDGDSDEVSSEVKILTKDDTGNNAFPACSPDGKFLVFRSGRSGHKNLYIVDAVKGEFEG 480

Query: 481 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 540
           ELRRLTDG WIDTMP+WSP GDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRV+VAGPE
Sbjct: 481 ELRRLTDGAWIDTMPNWSPRGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVYVAGPE 540

Query: 541 GSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRR 600
           GSS+VDRERINHVCFSRDGKWLLFTANLSGVTAEPVS PNQFQPYGDLFVV+LDGTGLRR
Sbjct: 541 GSSEVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSWPNQFQPYGDLFVVRLDGTGLRR 600

Query: 601 LTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           LT +GYENGTP WYYGSE+ALSGLSLKDEVVGEKLKG FDEPLWI FD
Sbjct: 601 LTWNGYENGTPTWYYGSEVALSGLSLKDEVVGEKLKGNFDEPLWITFD 647

BLAST of HG10023069 vs. ExPASy TrEMBL
Match: A0A1S3CEJ8 (uncharacterized protein LOC103499982 OS=Cucumis melo OX=3656 GN=LOC103499982 PE=4 SV=1)

HSP 1 Score: 1198.7 bits (3100), Expect = 0.0e+00
Identity = 583/648 (89.97%), Postives = 607/648 (93.67%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           MDNPTGAVLFTTIGL QYGFDIFSV LNSPT ERRLTDGISVNFNAQFL+NQLSVVFISE
Sbjct: 1   MDNPTGAVLFTTIGLNQYGFDIFSVALNSPTVERRLTDGISVNFNAQFLNNQLSVVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGSSRIYLSDSPNS+PKLL SAPGS F+DRPIVR+GRL FISAHENP KPFTSW+ALYS
Sbjct: 61  RSGSSRIYLSDSPNSSPKLLASAPGSCFHDRPIVRNGRLLFISAHENPHKPFTSWAALYS 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG DSITRLTP GSVDFSPAVS SGKFVAVASYGSRSWGGEFHEL+ EIVVFKSSDP
Sbjct: 121 TPLDGDDSITRLTPLGSVDFSPAVSGSGKFVAVASYGSRSWGGEFHELNLEIVVFKSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSV---SPVPTRVTPA 240
           DRRVVV+ RGGWPSWSGDSTVFFHRKA+DGWWSIF+VEIPENLDSS+   SPVP RVTPA
Sbjct: 181 DRRVVVASRGGWPSWSGDSTVFFHRKAEDGWWSIFKVEIPENLDSSMSSDSPVPIRVTPA 240

Query: 241 GLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSP 300
           GLHCFTPAAMND + +VVATRRADSKFRHIEIF+SEL EFIPITQKLNPDFHHYNPFVSP
Sbjct: 241 GLHCFTPAAMNDSRLVVVATRRADSKFRHIEIFDSELEEFIPITQKLNPDFHHYNPFVSP 300

Query: 301 DSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFM 360
           DSN IGYHRFRGES Q EL IPYL+PVISPI EL+MIRINGSFP PSPDGDLIAFNPNF 
Sbjct: 301 DSNFIGYHRFRGESTQSEL-IPYLSPVISPIKELQMIRINGSFPAPSPDGDLIAFNPNFN 360

Query: 361 GLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420
           GLQIVK DGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD
Sbjct: 361 GLQIVKFDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420

Query: 421 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 480
           L  G SDE++SEVKILTKDDTGNNAFPACSPDGK LVFRSGRSGHKNLYIVDAV GEF+G
Sbjct: 421 LKDGDSDEVSSEVKILTKDDTGNNAFPACSPDGKFLVFRSGRSGHKNLYIVDAVKGEFEG 480

Query: 481 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 540
           ELRRLTDG WIDTMP+WSP GDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRV+VAGPE
Sbjct: 481 ELRRLTDGAWIDTMPNWSPRGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVYVAGPE 540

Query: 541 GSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRR 600
           GSS+VDRERINHVCFSRDGKWLLFTANLSGVTAEPVS PNQFQPYGDLFVV+LDGTGL R
Sbjct: 541 GSSEVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSWPNQFQPYGDLFVVRLDGTGLSR 600

Query: 601 LTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           LT +GYENGTP WYYGSE+ALSGLSLKDEVVGEKLKG FDEPLWI FD
Sbjct: 601 LTWNGYENGTPTWYYGSEVALSGLSLKDEVVGEKLKGNFDEPLWITFD 647

BLAST of HG10023069 vs. ExPASy TrEMBL
Match: A0A0A0K6G2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G419590 PE=4 SV=1)

HSP 1 Score: 1181.0 bits (3054), Expect = 0.0e+00
Identity = 572/648 (88.27%), Postives = 607/648 (93.67%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           MDNPTGAVLFTT+GL QYGFDIFSVPLNS T ER+LTDGISVNFNAQFL+NQLSVVFISE
Sbjct: 1   MDNPTGAVLFTTVGLNQYGFDIFSVPLNSLTVERQLTDGISVNFNAQFLNNQLSVVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGSSRIYLSDSPNS+PKLL SAPGS F+DRPIV +GRL FISAHENP KPFTSW+ALYS
Sbjct: 61  RSGSSRIYLSDSPNSSPKLLASAPGSCFHDRPIVTNGRLLFISAHENPHKPFTSWAALYS 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG DS+TRLTP GSVDFSPAVSESGKFVAVASYGSRSWGGEFHEL+ EIVVFKSSDP
Sbjct: 121 TPLDGHDSVTRLTPLGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELNLEIVVFKSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLD---SSVSPVPTRVTPA 240
            +RVVV+GRGGWPSWSGDSTVFFHRKADDGWWSIF+VEIPENLD   SSVSPV  RVTPA
Sbjct: 181 GQRVVVAGRGGWPSWSGDSTVFFHRKADDGWWSIFKVEIPENLDSSRSSVSPVAIRVTPA 240

Query: 241 GLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSP 300
           GLHCFTPAAMND +R+VVATRRADSK+RHIEIF+SEL EFIPITQKLNP+FHHYNPFVSP
Sbjct: 241 GLHCFTPAAMNDGRRVVVATRRADSKYRHIEIFDSELEEFIPITQKLNPEFHHYNPFVSP 300

Query: 301 DSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFM 360
           DSN IGYHRFRGES Q EL IPYL PVISPI EL++IR+NGSFPTPSPDGDLIAFNP F+
Sbjct: 301 DSNFIGYHRFRGESTQSEL-IPYLYPVISPIKELQIIRVNGSFPTPSPDGDLIAFNPGFI 360

Query: 361 GLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDD 420
           GLQIVK DGSKCRTVLKDRTAF NSWSPTEKNVIYTSLGPIFGAVTATVQIARITINS D
Sbjct: 361 GLQIVKFDGSKCRTVLKDRTAFCNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSGD 420

Query: 421 LNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDG 480
                SDE+++EVKILTKD+TGNNAFPACSPDGK LVFRSGR+GHKNLYIVDA+ GEF+G
Sbjct: 421 -----SDEVSNEVKILTKDNTGNNAFPACSPDGKFLVFRSGRTGHKNLYIVDAMKGEFEG 480

Query: 481 ELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPE 540
           ELR+LTDGPWIDTMP+WSP GDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRV+VAGPE
Sbjct: 481 ELRQLTDGPWIDTMPNWSPRGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVYVAGPE 540

Query: 541 GSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRR 600
           GSS+VDRERINHVCFSRDG WLLFTANLSGVTAEPVSLPNQFQPYGDLFVV+LDGTGLRR
Sbjct: 541 GSSEVDRERINHVCFSRDGNWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVRLDGTGLRR 600

Query: 601 LTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           LTC+ YENGTP WYYGSELALSGLSLKDEVVGEKLKG+FDEPLWIKFD
Sbjct: 601 LTCNAYENGTPTWYYGSELALSGLSLKDEVVGEKLKGDFDEPLWIKFD 642

BLAST of HG10023069 vs. ExPASy TrEMBL
Match: A0A6J1GRK1 (uncharacterized protein LOC111456480 OS=Cucurbita moschata OX=3662 GN=LOC111456480 PE=4 SV=1)

HSP 1 Score: 1168.7 bits (3022), Expect = 0.0e+00
Identity = 552/645 (85.58%), Postives = 596/645 (92.40%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           M NPTG V+FTT+G  QYGFD FSVPL+SPTTE  LTDGISVNFNAQF+DNQ S+VFISE
Sbjct: 1   MTNPTGTVIFTTVGRTQYGFDTFSVPLSSPTTEHCLTDGISVNFNAQFVDNQQSIVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGS R+YLS+SPNSAPKLLPSAPGS F+DRPI+R+ RL+FISAHENP KPFTSWSALY 
Sbjct: 61  RSGSPRVYLSNSPNSAPKLLPSAPGSCFHDRPIIRNDRLYFISAHENPHKPFTSWSALYF 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG+DS+TRLTP GSVDFSPAVSESGKFVAVASYGSRSWGGEF ELHTEIVVF+SSDP
Sbjct: 121 TGLDGSDSVTRLTPCGSVDFSPAVSESGKFVAVASYGSRSWGGEFQELHTEIVVFRSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSVSPVPTRVTPAGLH 240
           DRRVVVSGRGGWPSWSGDSTV+FHR+A+DGWWSIFRVEIPENLDSSV PVP RVTPAGLH
Sbjct: 181 DRRVVVSGRGGWPSWSGDSTVYFHRQAEDGWWSIFRVEIPENLDSSVPPVPIRVTPAGLH 240

Query: 241 CFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSPDSN 300
           CFTPAAMNDRKR+VVATRR D+KFRHIEI+NS  +EF PITQKLNP FHHYNPFVSPDS 
Sbjct: 241 CFTPAAMNDRKRVVVATRRPDNKFRHIEIYNSGTDEFDPITQKLNPSFHHYNPFVSPDSK 300

Query: 301 SIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFMGLQ 360
            IGYHRFRGES+ GELTIP+   +ISPINELR+IRINGSFPT SPDG+ IAFNP+F+GL+
Sbjct: 301 YIGYHRFRGESSHGELTIPHFERIISPINELRLIRINGSFPTQSPDGNFIAFNPDFVGLK 360

Query: 361 IVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDDLNG 420
           IVK+DGSKC TVLKDRTAFYNSWSPTEKNVIY+SLGPIFG   ATVQIAR TINSDDLN 
Sbjct: 361 IVKADGSKCWTVLKDRTAFYNSWSPTEKNVIYSSLGPIFGPARATVQIARTTINSDDLNN 420

Query: 421 GRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDGELR 480
           G SDE+A EVKILTK+DTGNNAFPACSPDGK LVFRSGRSGHKNLYI+DAVNG+F+GE R
Sbjct: 421 GDSDEVAGEVKILTKEDTGNNAFPACSPDGKFLVFRSGRSGHKNLYIIDAVNGDFNGEAR 480

Query: 481 RLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPEGSS 540
           RLTDGPWIDTMPSWSPAGDLI FSSNMHNP+NTE FSIYVIRPDGS LRRVHVAGPEGSS
Sbjct: 481 RLTDGPWIDTMPSWSPAGDLIAFSSNMHNPENTETFSIYVIRPDGSNLRRVHVAGPEGSS 540

Query: 541 DVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRRLTC 600
           DVD+ERINHVCFSRDG+WLLFT+NL GV+AEPVS+PNQFQPYGDLFVV+LDGTGLRRLT 
Sbjct: 541 DVDKERINHVCFSRDGEWLLFTSNLGGVSAEPVSMPNQFQPYGDLFVVRLDGTGLRRLTW 600

Query: 601 SGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           S YENGTP WYYGSELALSGLSLKDEVVGEKLKGEFDEPLWI F+
Sbjct: 601 SAYENGTPTWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWITFN 645

BLAST of HG10023069 vs. ExPASy TrEMBL
Match: A0A6J1JTR3 (uncharacterized protein LOC111488250 OS=Cucurbita maxima OX=3661 GN=LOC111488250 PE=4 SV=1)

HSP 1 Score: 1159.4 bits (2998), Expect = 0.0e+00
Identity = 546/645 (84.65%), Postives = 595/645 (92.25%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFLDNQLSVVFISE 60
           M NPTG V+FTT+G  QYGFD FSVPL+SPTTE  LTDGISVNFNAQF+DNQ S+VFISE
Sbjct: 1   MTNPTGTVIFTTVGRTQYGFDTFSVPLSSPTTEHCLTDGISVNFNAQFVDNQQSIVFISE 60

Query: 61  RSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYS 120
           RSGS R+YLS+SPNSAPKLLPSAPGS F+DRPI+R+ RL+FISAHENP KPFTSWSALY 
Sbjct: 61  RSGSPRVYLSNSPNSAPKLLPSAPGSCFHDRPIIRNDRLYFISAHENPHKPFTSWSALYV 120

Query: 121 TALDGADSITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDP 180
           T LDG+ S+TRLTP GSVDFSPAVSESGKFVAVASYGSRSWGGEF ELHTEIVVF+SSDP
Sbjct: 121 TGLDGSVSVTRLTPRGSVDFSPAVSESGKFVAVASYGSRSWGGEFQELHTEIVVFRSSDP 180

Query: 181 DRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSVSPVPTRVTPAGLH 240
           DRRV+VSGRGGWPSWSGDSTVFFHR+A+DGWWSIFR EIPENLDSSV PVP RVTPAGLH
Sbjct: 181 DRRVIVSGRGGWPSWSGDSTVFFHRQAEDGWWSIFRGEIPENLDSSVPPVPIRVTPAGLH 240

Query: 241 CFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSPDSN 300
           CFTPAAMNDRKR+VVATRR D+KFRHIEI+NS  +EF PITQKLNP FHHYNPFVSPDSN
Sbjct: 241 CFTPAAMNDRKRVVVATRRPDNKFRHIEIYNSGTDEFDPITQKLNPSFHHYNPFVSPDSN 300

Query: 301 SIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFMGLQ 360
            IGYHRFRGES+ GE+TIP+   +ISPINELR+IRINGSFPT SPDG+ IAFNP+F+GL+
Sbjct: 301 YIGYHRFRGESSHGEITIPHFERIISPINELRLIRINGSFPTQSPDGNFIAFNPDFVGLK 360

Query: 361 IVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDDLNG 420
           IVK+DGSKC TVLKDRTAFYNSWSPTEKNVIY+SLGPIFG   ATVQIAR TINSDDLN 
Sbjct: 361 IVKADGSKCWTVLKDRTAFYNSWSPTEKNVIYSSLGPIFGPARATVQIARTTINSDDLNN 420

Query: 421 GRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDGELR 480
           G +DE++ EVKILTK+DTGNNAFPACSPDGK LVFRSGRSGHKNLYI+DAVNG+F+GE R
Sbjct: 421 GDNDEVSGEVKILTKEDTGNNAFPACSPDGKFLVFRSGRSGHKNLYIIDAVNGDFNGEAR 480

Query: 481 RLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPEGSS 540
           +LT+GPWIDTMPSWSPAGDLI FSSNMHNP+NTE FSIYVIRPDGS LRRVHVAGPEGSS
Sbjct: 481 QLTNGPWIDTMPSWSPAGDLIAFSSNMHNPENTETFSIYVIRPDGSNLRRVHVAGPEGSS 540

Query: 541 DVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRRLTC 600
           DVD+ERINHVCFSRDG+WLLFT+NL GV+AEPVS+PNQFQPYGDLFVV+LDGTGLRRLT 
Sbjct: 541 DVDKERINHVCFSRDGEWLLFTSNLGGVSAEPVSMPNQFQPYGDLFVVRLDGTGLRRLTW 600

Query: 601 SGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           S YENGTP WYYGSELALSGLSLKDEVVGEKLKGEFDEPLWI F+
Sbjct: 601 SAYENGTPTWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWITFN 645

BLAST of HG10023069 vs. TAIR 10
Match: AT4G01870.1 (tolB protein-related )

HSP 1 Score: 873.6 bits (2256), Expect = 9.7e-254
Identity = 415/654 (63.46%), Postives = 516/654 (78.90%), Query Frame = 0

Query: 1   MDNPTGAVLFTTIGLQQYGFDIFSVPLNSPTTERRLTDGISVNFNAQFL-DNQLSVVFIS 60
           M+ P G ++FTT+G   YGFD+FS+ + + + ERRLTDG+SVNFNAQF+ D    VVF+S
Sbjct: 1   METPKGTIIFTTVGRTHYGFDVFSLNI-ATSVERRLTDGVSVNFNAQFVNDKSDDVVFVS 60

Query: 61  ERSGSSRIYLSDSPNSAPKLLPSAPGSFFNDRPIV-RSGRLFFISAHENPQKPFTSWSAL 120
           ER+GS+RIY + S  S P+ +P AP S+F+DRPI+ ++ RL+FISAHE P + F +WSAL
Sbjct: 61  ERNGSARIYKTRSGISKPEQIPGAPESYFHDRPIITQNNRLYFISAHEQPDRYFKNWSAL 120

Query: 121 YSTALDGAD-SITRLTPPGSVDFSPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKS 180
           Y+  L+ A   +TR+TPP + DFSPAVS+SG F+AVASYG+RSWGGEFHE++T+I VFK+
Sbjct: 121 YTVELNSAKREVTRVTPPDTADFSPAVSQSGDFLAVASYGTRSWGGEFHEINTDITVFKA 180

Query: 181 SDPDRRVVVSGRGGWPSWSGDSTVFFHRKADDGWWSIFRVEIPENLDSSVS-PV-PTRVT 240
           S P+ RVV+  RGGWP+WSGDSTVFFH +ADDGWWSIFRV+IPEN       P+ P RVT
Sbjct: 181 SKPETRVVICERGGWPTWSGDSTVFFHHQADDGWWSIFRVDIPENFTEYTDFPITPIRVT 240

Query: 241 PAGLHCFTPAAMNDRKRLVVATRRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFV 300
           P+GLHCFTPAA  D KR+ +ATRR     RHIEI++ E   F P+T+ LNP FHHYNPFV
Sbjct: 241 PSGLHCFTPAAFRDGKRIALATRRRGVNHRHIEIYDLENTTFQPVTESLNPSFHHYNPFV 300

Query: 301 SPDSNSIGYHRFRGESAQGELTIPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPN 360
           SPDS  +GYHRFRGES QGE  +P +  ++SPI  LR++RINGSFP+ SP+GDLIA N +
Sbjct: 301 SPDSEFLGYHRFRGESTQGESIVPNIESIVSPIKTLRLLRINGSFPSSSPNGDLIALNSD 360

Query: 361 F---MGLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARIT 420
           F    G+++ KSDGSK  T++KDRTAFYNSWSPTE++VIYTSLGPIF      VQIARI 
Sbjct: 361 FDINGGIKVSKSDGSKRWTLIKDRTAFYNSWSPTERHVIYTSLGPIFSPARIAVQIARIK 420

Query: 421 INSDDLNGGRSDEIASEVKILTKDDTGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVN 480
            +  DL   + D +  +VKILT ++TGNNAFP+CSPDGK++VFRSGRSGHKNLYIVDAVN
Sbjct: 421 FDPSDLTADKED-LPCDVKILTLENTGNNAFPSCSPDGKSIVFRSGRSGHKNLYIVDAVN 480

Query: 481 GEFD-GELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRV 540
           GE + G +RRLTDGPWIDTMP WSP GDLI FSSN HNP+NT  F  YV+RPDG+GLRR+
Sbjct: 481 GESNGGGIRRLTDGPWIDTMPCWSPKGDLIGFSSNRHNPENTAVFGAYVVRPDGTGLRRI 540

Query: 541 HVAGPEGSSDVDRERINHVCFSRDGKWLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLD 600
            ++GPEGS +  RER+NHV F++DG WL+F ANLSGVTAEPV++PNQFQPYGDL+VVKLD
Sbjct: 541 QISGPEGSEEAARERVNHVSFNKDGDWLVFAANLSGVTAEPVTMPNQFQPYGDLYVVKLD 600

Query: 601 GTGLRRLTCSGYENGTPMWYYGSELALSGLSLKDEVVGEKLKGEFDEPLWIKFD 646
           GTGLRRLT +GYE+GTP W+   EL LS L+L  +  G+KL+G+F+EPLWI  D
Sbjct: 601 GTGLRRLTWNGYEDGTPTWHTADELDLSQLNLNGQ-DGDKLEGQFEEPLWISCD 651

BLAST of HG10023069 vs. TAIR 10
Match: AT1G21680.1 (DPP6 N-terminal domain-like protein )

HSP 1 Score: 483.4 bits (1243), Expect = 2.8e-136
Identity = 269/653 (41.19%), Postives = 384/653 (58.81%), Query Frame = 0

Query: 8   VLFTTIGLQQYGFDIFSVPLNSPTT---ERRLTDGISVNFNAQFL--------------- 67
           ++FTT+G   Y FDIF++    P +   E R+TDG SVNFN  F                
Sbjct: 34  IIFTTLGRSHYEFDIFALSTTQPPSVSGELRITDGESVNFNGYFPSPSPALLSLLPDETL 93

Query: 68  -----DNQLSVVFISERSGSSRIY-----------------LSDSPN--SAPKL-----L 127
                 + L +++++ER+G+S +Y                 + ++P+    P L     L
Sbjct: 94  IQMEDSSPLHLIYVTERNGTSSLYYDLVYGGNSDFKTKRRSVLEAPSRVQVPLLSRFDHL 153

Query: 128 PSAPGSFFNDRPIVRSGRLFFISAHENPQKPFTSWSALYSTALDGADSITRLTPPGSVDF 187
                + F D+P +    + ++S HE+  +P  SW+A+YST L       RLTP G  DF
Sbjct: 154 SGMTVNSFKDKPSLSGEFIVYVSTHESSGEPRASWTAVYSTELK-TGLTRRLTPSGVADF 213

Query: 188 SPAVSESGKFVAVASYGSRSWGGEFHELHTEIVVFKSSDPDRRVVVSGRGGWPSWSGDST 247
           SPAVS SG   AVASYG R W GE  EL T+I VF + D   RV V   GGWP W  +ST
Sbjct: 214 SPAVSPSGNLTAVASYGERGWTGEVEELRTDIYVFLTRDGSHRVKVVEHGGWPCWVDEST 273

Query: 248 VFFHRKA-DDGWWSIFRVEIPENLDSSVSPVP-TRVTPAGLHCFTPA-AMNDRKRLVVAT 307
           ++FHR++ +DGW S++R  +PEN   +   V   RVTP G+H FTPA + N+ + + VAT
Sbjct: 274 LYFHRRSEEDGWISVYRAILPENGPLTTESVTIQRVTPPGVHAFTPATSPNNHEFVAVAT 333

Query: 308 RRADSKFRHIEIFNSELNEFIPITQKLNPDFHHYNPFVSPDSNSIGYHRFRGESAQGELT 367
           RR  S +RH+E+F+ + NEFI +T+ + P  HH NPF+SPDS+ +GYH  RG++      
Sbjct: 334 RRPGSDYRHVELFDLKRNEFIELTRLVAPKSHHLNPFLSPDSSRVGYHSCRGDANGRRSP 393

Query: 368 IPYLNPVISPINELRMIRINGSFPTPSPDGDLIAFNPNFMGLQIVKSDGSKCRTVLKDRT 427
           + +L  + +   +L + RI+GSFP+ SP GD IA+     G+ +VK DGS  R V K   
Sbjct: 394 LLFLENIQTTTRDLSLFRIDGSFPSFSPGGDRIAY-VKMPGVFVVKPDGSGQREVYKG-M 453

Query: 428 AFYNSWSPTEKNVIYTSLGPIFGAVTATVQIARITINSDDLNGGRSDEIASEVKILTKDD 487
           AF  +W P    ++Y+S GP F      V +  I +++ D         +S V+ LT + 
Sbjct: 454 AFSTAWDPVRPGIVYSSSGPTFATERTEVDVISIDVDAADK--------SSSVRRLTTNG 513

Query: 488 TGNNAFPACSPDGKTLVFRSGRSGHKNLYIVDAVNGEFDGELRRLTDGPWIDTMPSWSPA 547
             NNAFP  SPDGK +VFRSGR+GHKNLYI+DA  GE  G L RLT+G W DTM +WSP 
Sbjct: 514 K-NNAFPWPSPDGKRIVFRSGRTGHKNLYIMDAEKGE-SGGLWRLTEGAWTDTMCNWSPD 573

Query: 548 GDLIVFSSNMHNPKNTEAFSIYVIRPDGSGLRRVHVAGPEGSSDVDRERINHVCFSRDGK 607
           G+ I F+S+  +P  + +F +++I P+G+GLR++  +G  G       R NH  FS D K
Sbjct: 574 GEWIAFASDRESP-GSGSFELFLIHPNGTGLRKLIQSGTGG-------RTNHPIFSPDSK 633

Query: 608 WLLFTANLSGVTAEPVSLPNQFQPYGDLFVVKLDGTGLRRLTCSGYENGTPMW 611
            L+FT++ +G++AEP+S P+ +QPYGD+F VKLDG+ +RRLT + YE+GTP W
Sbjct: 634 SLVFTSDYAGISAEPISNPHHYQPYGDIFTVKLDGSNVRRLTHNSYEDGTPAW 665

BLAST of HG10023069 vs. TAIR 10
Match: AT1G21670.1 (LOCATED IN: cell wall, plant-type cell wall; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: WD40-like Beta Propeller (InterPro:IPR011659), Six-bladed beta-propeller, TolB-like (InterPro:IPR011042); BEST Arabidopsis thaliana protein match is: DPP6 N-terminal domain-like protein (TAIR:AT1G21680.1); Has 8461 Blast hits to 5060 proteins in 1257 species: Archae - 79; Bacteria - 5567; Metazoa - 37; Fungi - 70; Plants - 117; Viruses - 0; Other Eukaryotes - 2591 (source: NCBI BLink). )

HSP 1 Score: 452.2 bits (1162), Expect = 6.9e-127
Identity = 259/642 (40.34%), Postives = 372/642 (57.94%), Query Frame = 0

Query: 8   VLFTTIGLQQYGFDIFSVPLN----SPTTERRLTDGISVNFNAQFLD------------- 67
           +LFTTIG   + FDIF++P +    SP  E RLTDG S+NFN  F               
Sbjct: 34  ILFTTIGRPTFEFDIFTLPTSHRPPSPADEHRLTDGKSINFNGYFASPSTALISLLPKRT 93

Query: 68  ----NQLSVVFISERSGSSRIY--LSDSPNSAPKL-LPSAPG---------SFFNDRPIV 127
                 + +++++ER+G+  +   +  S N   ++ +P   G         +   D P++
Sbjct: 94  QIQPQDVHLIYVTERAGTPSLNYDVVHSDNVGSRIQVPLFSGEEQQSGMNVNSMKDTPVL 153

Query: 128 RSGRLFFISAHENPQKPFTSWSALYSTALDGADSITRLTPPGSVDFSPAVSESGKFVAVA 187
            +G L  +S HENP KP  SW+A+YST L    S  RLTP G  DFSPAVS SGK+ AVA
Sbjct: 154 TNGYLVHVSTHENPGKPMASWAAVYSTEL-RTKSTRRLTPLGIADFSPAVSPSGKWTAVA 213

Query: 188 SYGSRSWGGEF--HELHTEIVVFKSSDPDRRVVVSGRGGWPSWSGDSTVFFHRKADDGWW 247
           S+G + W       E+ +++ VF + D  +RV V  +GGWP W  DST++FHRK+DDGW 
Sbjct: 214 SFGEKGWTWSMVEKEISSDVYVFLTQDGTQRVKVVEQGGWPRWVDDSTLYFHRKSDDGWI 273

Query: 248 SIFRVEIPENLDSSVSPVP-TRVTPAGLHCFTPA-AMNDRKRLVVATRRADSKFRHIEIF 307
           S++R  +P+    +   V   RVTP GLH FTPA + N+   + VATRR  S+ RH+E+F
Sbjct: 274 SVYRAILPKTGPVTTKSVTIQRVTPPGLHAFTPATSPNNNNFIAVATRRPGSEIRHVELF 333

Query: 308 NSELNEFIPITQKLNPDFHHYNPFVSPDSNSIGYHRFRGESAQGELTIPYLNPVISPINE 367
           + + NEF+ +T+ ++P  HH+NPF+SPDS+ +GYH  RG++   +     L  + +  N+
Sbjct: 334 DLKKNEFVELTRLVSPKSHHFNPFLSPDSSRVGYHSCRGDATGRKTPRNLLQSLKTTSND 393

Query: 368 LRMIRINGSFPTPSPDGDLIAFNPNFMGLQIVKSDGSKCRTVLKDRTAFYNSWSPTEKNV 427
           L + R +G+FP+ SP+GD  AF  +F G+ +V  DGS  R +L  +  F   W P    +
Sbjct: 394 LSLFRFDGAFPSISPEGDRFAF-VSFTGVFVVNPDGSGLRQLL-PQMGFGTVWDPIRHGI 453

Query: 428 IYTSLGPIFGAVTATVQIARITINSDDLNGGRSDEIASEVKILTKDDTG-NNAFPACSPD 487
           +YTS GP      + + I  I ++        +   A+ VK LT   TG NNAFP  SPD
Sbjct: 454 VYTSSGPALAPGKSQIDILAINVD--------APSPATAVKKLT--TTGENNAFPWPSPD 513

Query: 488 GKTLVFRSGRSGHKNLYIVDAVNGEFDGELRRLTDGPWIDTMPSWSPAGDLIVFSSNMHN 547
           GK +VFRS RSG KNLYI+DA  GE  G L RLT+G W DT+ +WSP G+ IVF+SN   
Sbjct: 514 GKRIVFRSARSGTKNLYIMDAEKGE-SGGLFRLTNGNWNDTIATWSPDGNWIVFASNREF 573

Query: 548 PKNTEAFSIYVIRPDGSGLRRVHVAGPEGSSDVDRERINHVCFSRDGKWLLFTANLSGVT 607
           P  T   +IYV+ PDG+GLR++       + ++      H  FS D K ++FT   +G++
Sbjct: 574 P-GTLLMNIYVVHPDGTGLRKL-------AQNLTGLVSMHPMFSPDSKRIVFTTIYAGIS 633

Query: 608 AEPVSLPNQFQPYGDLFVVKLDGTGLRRLTCSGYENGTPMWY 612
           AE +  P+   P  ++F V LDG+GL RLT +  E+G PMW+
Sbjct: 634 AEQIGNPHFNVPSSEIFTVNLDGSGLTRLTHNSVEDGPPMWF 653

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899648.10.0e+0092.59uncharacterized protein LOC120086912 [Benincasa hispida][more]
KAA0058562.10.0e+0090.43TolB protein-related isoform 1 [Cucumis melo var. makuwa] >TYK10360.1 TolB prote... [more]
XP_008461387.10.0e+0089.97PREDICTED: uncharacterized protein LOC103499982 [Cucumis melo][more]
XP_004135965.10.0e+0088.27uncharacterized protein LOC101214858 [Cucumis sativus] >KGN45063.1 hypothetical ... [more]
XP_023547812.10.0e+0086.51uncharacterized protein LOC111806666 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A8LHQ65.4e-1532.14Tol-Pal system protein TolB OS=Dinoroseobacter shibae (strain DSM 16493 / NCIMB ... [more]
Q3APB59.1e-1530.43Protein TolB homolog OS=Chlorobium chlorochromatii (strain CaD3) OX=340177 GN=to... [more]
Q0AC403.5e-1428.57Tol-Pal system protein TolB OS=Alkalilimnicola ehrlichii (strain ATCC BAA-1101 /... [more]
Q167Z62.2e-1334.34Tol-Pal system protein TolB OS=Roseobacter denitrificans (strain ATCC 33942 / OC... [more]
B2JGA01.1e-1230.48Tol-Pal system protein TolB OS=Paraburkholderia phymatum (strain DSM 17167 / CIP... [more]
Match NameE-valueIdentityDescription
A0A5A7URS20.0e+0090.43TolB protein-related isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3CEJ80.0e+0089.97uncharacterized protein LOC103499982 OS=Cucumis melo OX=3656 GN=LOC103499982 PE=... [more]
A0A0A0K6G20.0e+0088.27Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G419590 PE=4 SV=1[more]
A0A6J1GRK10.0e+0085.58uncharacterized protein LOC111456480 OS=Cucurbita moschata OX=3662 GN=LOC1114564... [more]
A0A6J1JTR30.0e+0084.65uncharacterized protein LOC111488250 OS=Cucurbita maxima OX=3661 GN=LOC111488250... [more]
Match NameE-valueIdentityDescription
AT4G01870.19.7e-25463.46tolB protein-related [more]
AT1G21680.12.8e-13641.19DPP6 N-terminal domain-like protein [more]
AT1G21670.16.9e-12740.34LOCATED IN: cell wall, plant-type cell wall; EXPRESSED IN: 23 plant structures; ... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011659WD40-like Beta PropellerPFAMPF07676PD40coord: 438..467
e-value: 5.1E-5
score: 23.1
coord: 479..506
e-value: 0.0013
score: 18.6
IPR011042Six-bladed beta-propeller, TolB-likeGENE3D2.120.10.30coord: 178..311
e-value: 9.7E-6
score: 27.0
IPR011042Six-bladed beta-propeller, TolB-likeGENE3D2.120.10.30coord: 357..572
e-value: 2.3E-33
score: 117.6
NoneNo IPR availableGENE3D2.140.10.30coord: 19..160
e-value: 5.2E-6
score: 27.5
NoneNo IPR availablePANTHERPTHR32161:SF9TOLB PROTEIN-LIKE PROTEINcoord: 3..645
NoneNo IPR availablePANTHERPTHR32161DPP6 N-TERMINAL DOMAIN-LIKE PROTEINcoord: 3..645
NoneNo IPR availableSUPERFAMILY82171DPP6 N-terminal domain-likecoord: 96..395
NoneNo IPR availableSUPERFAMILY82171DPP6 N-terminal domain-likecoord: 341..611

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023069.1HG10023069.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis