Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAACTCCCTTTGAACAACCCCTCTCTTTGACCTTCCCTACTGAGTTTCCTTACGATTTTGACTCCTTTCTCTCTAACTCCGACCTCAACTCCCCCGTCGAATCGCTTGGGAGTTCTACTACGGATTCCACTGATAGCTGTGGCAGCGATGATGATGACTTCTTTGTGGGTTTGGCTCAGCAACTTGCATGGACCTCCCTTTGTGAAACTGAGAAATCTGTATCTCCTTCCTTTAATCCAAACGAATTTGAGGTACCCTTTTCATGAAATCTTCACTTCTCCCTATCTTTTATGTGTTTGAATTAGTTTGTTTCTGATTTGATTTTTTTTGGTTTTTTTTAGAAGAAGTATGTTAAAGCTGGCTCTCCTCAGTCGACTTTAAGTGGAATAGACGCCTGGTTTCGCCCTGAAAGTCCTTCCTCTCAGTTGCAATCGCCTCCCATGGCGGTTTTTGGGGCTGAGAATGATGCTAGAGCCCTTCTACATGCAGCTGCTAGAGAGGCGGCGAGGTTGAAGATGAGCGGCGAAACAACCCTGTTTCACAACAATGACCCTTTTATGAGAGGATTTGTGGGTGCTCGATCTTCCATTCCATTTAAATCTGCAAACAATGTGGATTATGGGCTTTACTCGAATCAAAACTGTGCTCGGAATCTGGCATTTGCCGCGCAGGTAAATTACTTCCAACTTCCAAATGTCCAAATCCTTCCATCTGTTGAATATCTCAGAACAGAGTATTTTAACAGTTGGGTACTCTGTTTAAGTTATGGGTCCTCTTTCATTCCTGAGGTTTTTCTGTTTTGTTGTTCTATTAAGTTATGGGTCCTCTTTCATTCCTGAGGTTTTTCTGTTTTGTTGTTCTACAGGTGCAGCAAGTGAAACACGATCTTGTATTACAGGCGATTCGTGCCTCTTCTTGGGGCGGAAGACAAGCTAAAGTCAGCTGGTCTGCTCAGCCACACTGGAAGCCGGAGATTCAAAACAGAGAGAGATATGTTGTTAATGCAAGTCGCAACGGCGGATTGTACCCTTCCCCATGGCTTCCGCCGCCCCAAATTCAGCAACCACCGTCCAACGCCTCCGTCATGCGTTGCATTCATCCCAGCAGATCTGGCGTGAAAAGAGCTTCCTCCTGCACCGGCGTTTTCTTGCCTCGCAGATATGTAAACCCTTCAGAATGCCGCCAGAAACAAGGTATTTCAGAACCTCCCTTCCTTCATCTGAAGAATCACCTTTGTTTCTTTAATTAAGCTTCATAATCATCATTGTCATGCCTAATTAGGAAGCCCAGCCGTTCGATTTCCAGAGGAGATGAAAAGCCCCATTCAAGCCCCATTAAACGGTTGCCTAGCACCTGGCTCCGGTAAGCAAGAATTTCCCCAAAATTTTCACCACATGGCCCAAAATTAAGATTGTTGTAATGGGTTTCCTTGATGGACTTTTACAAATCACAGATTCCATTTTATCTCGAAGAAATAATCCCCTTCTGCCATCGCCAAGGATTTTCCGGACAGATGGAGCCATGAATCAAGAACATCACCTACCTCAGGAATGGACATACTGA
mRNA sequence
ATGGCAACTCCCTTTGAACAACCCCTCTCTTTGACCTTCCCTACTGAGTTTCCTTACGATTTTGACTCCTTTCTCTCTAACTCCGACCTCAACTCCCCCGTCGAATCGCTTGGGAGTTCTACTACGGATTCCACTGATAGCTGTGGCAGCGATGATGATGACTTCTTTGTGGGTTTGGCTCAGCAACTTGCATGGACCTCCCTTTGTGAAACTGAGAAATCTGTATCTCCTTCCTTTAATCCAAACGAATTTGAGAAGAAGTATGTTAAAGCTGGCTCTCCTCAGTCGACTTTAAGTGGAATAGACGCCTGGTTTCGCCCTGAAAGTCCTTCCTCTCAGTTGCAATCGCCTCCCATGGCGGTTTTTGGGGCTGAGAATGATGCTAGAGCCCTTCTACATGCAGCTGCTAGAGAGGCGGCGAGGTTGAAGATGAGCGGCGAAACAACCCTGTTTCACAACAATGACCCTTTTATGAGAGGATTTGTGGGTGCTCGATCTTCCATTCCATTTAAATCTGCAAACAATGTGGATTATGGGCTTTACTCGAATCAAAACTGTGCTCGGAATCTGGCATTTGCCGCGCAGGTGCAGCAAGTGAAACACGATCTTGTATTACAGGCGATTCGTGCCTCTTCTTGGGGCGGAAGACAAGCTAAAGTCAGCTGGTCTGCTCAGCCACACTGGAAGCCGGAGATTCAAAACAGAGAGAGATATGTTGTTAATGCAAGTCGCAACGGCGGATTGTACCCTTCCCCATGGCTTCCGCCGCCCCAAATTCAGCAACCACCGTCCAACGCCTCCGTCATGCGTTGCATTCATCCCAGCAGATCTGGCGTGAAAAGAGCTTCCTCCTGCACCGGCGTTTTCTTGCCTCGCAGATATGTAAACCCTTCAGAATGCCGCCAGAAACAAGGAAGCCCAGCCGTTCGATTTCCAGAGGAGATGAAAAGCCCCATTCAAGCCCCATTAAACGGTTGCCTAGCACCTGGCTCCGATTCCATTTTATCTCGAAGAAATAATCCCCTTCTGCCATCGCCAAGGATTTTCCGGACAGATGGAGCCATGAATCAAGAACATCACCTACCTCAGGAATGGACATACTGA
Coding sequence (CDS)
ATGGCAACTCCCTTTGAACAACCCCTCTCTTTGACCTTCCCTACTGAGTTTCCTTACGATTTTGACTCCTTTCTCTCTAACTCCGACCTCAACTCCCCCGTCGAATCGCTTGGGAGTTCTACTACGGATTCCACTGATAGCTGTGGCAGCGATGATGATGACTTCTTTGTGGGTTTGGCTCAGCAACTTGCATGGACCTCCCTTTGTGAAACTGAGAAATCTGTATCTCCTTCCTTTAATCCAAACGAATTTGAGAAGAAGTATGTTAAAGCTGGCTCTCCTCAGTCGACTTTAAGTGGAATAGACGCCTGGTTTCGCCCTGAAAGTCCTTCCTCTCAGTTGCAATCGCCTCCCATGGCGGTTTTTGGGGCTGAGAATGATGCTAGAGCCCTTCTACATGCAGCTGCTAGAGAGGCGGCGAGGTTGAAGATGAGCGGCGAAACAACCCTGTTTCACAACAATGACCCTTTTATGAGAGGATTTGTGGGTGCTCGATCTTCCATTCCATTTAAATCTGCAAACAATGTGGATTATGGGCTTTACTCGAATCAAAACTGTGCTCGGAATCTGGCATTTGCCGCGCAGGTGCAGCAAGTGAAACACGATCTTGTATTACAGGCGATTCGTGCCTCTTCTTGGGGCGGAAGACAAGCTAAAGTCAGCTGGTCTGCTCAGCCACACTGGAAGCCGGAGATTCAAAACAGAGAGAGATATGTTGTTAATGCAAGTCGCAACGGCGGATTGTACCCTTCCCCATGGCTTCCGCCGCCCCAAATTCAGCAACCACCGTCCAACGCCTCCGTCATGCGTTGCATTCATCCCAGCAGATCTGGCGTGAAAAGAGCTTCCTCCTGCACCGGCGTTTTCTTGCCTCGCAGATATGTAAACCCTTCAGAATGCCGCCAGAAACAAGGAAGCCCAGCCGTTCGATTTCCAGAGGAGATGAAAAGCCCCATTCAAGCCCCATTAAACGGTTGCCTAGCACCTGGCTCCGATTCCATTTTATCTCGAAGAAATAATCCCCTTCTGCCATCGCCAAGGATTTTCCGGACAGATGGAGCCATGAATCAAGAACATCACCTACCTCAGGAATGGACATACTGA
Protein sequence
MATPFEQPLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRNGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQEHHLPQEWTY
Homology
BLAST of HG10018445 vs. NCBI nr
Match:
XP_038884388.1 (uncharacterized protein LOC120075245 isoform X1 [Benincasa hispida])
HSP 1 Score: 629.8 bits (1623), Expect = 1.5e-176
Identity = 317/372 (85.22%), Postives = 332/372 (89.25%), Query Frame = 0
Query: 1 MATPFEQPLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLA 60
MATP SL+FPTEFPYDFDSF SNSDLNSPVES+GSS TDSTDS GSDDDDFFVGLA
Sbjct: 1 MATP-----SLSFPTEFPYDFDSFFSNSDLNSPVESVGSSVTDSTDSSGSDDDDFFVGLA 60
Query: 61 QQLAWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMA 120
QQLAWTSLCETEKS SPSFNP FEK YVKAGSPQSTL+GID WFRPESPSSQLQSPPMA
Sbjct: 61 QQLAWTSLCETEKSTSPSFNPKNFEKMYVKAGSPQSTLTGIDTWFRPESPSSQLQSPPMA 120
Query: 121 VFGAENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGL 180
VFGAENDARALLHAAAREAARLKMSGETT F NNDPFMR +VGARSSIP KS NNVDYG+
Sbjct: 121 VFGAENDARALLHAAAREAARLKMSGETTPFRNNDPFMREYVGARSSIPVKSTNNVDYGV 180
Query: 181 YSNQNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVV 240
+SNQNCARNLAFAAQ+QQVK DLVLQA+RASSWGGRQAKVSWSAQPHWK EIQ+RER V+
Sbjct: 181 FSNQNCARNLAFAAQMQQVKQDLVLQALRASSWGGRQAKVSWSAQPHWKAEIQSRERNVL 240
Query: 241 NAS-----RNGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYV 300
NAS GGLY SPWLPPPQ QQPPSN SVMRCIHP RSGVKRASS TGVFLPRRY+
Sbjct: 241 NASGRCGGNTGGLYHSPWLPPPQNQQPPSNPSVMRCIHPGRSGVKRASSGTGVFLPRRYI 300
Query: 301 NPSECRQKQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAM 360
+PSECRQKQGSPAVRF EEMKSPIQAPLNG L+P DS+LSRRNNPLLP PR FRT+GAM
Sbjct: 301 SPSECRQKQGSPAVRFAEEMKSPIQAPLNGWLSPSIDSMLSRRNNPLLPLPRSFRTEGAM 360
Query: 361 NQEHHLPQEWTY 368
NQE HLPQEWTY
Sbjct: 361 NQELHLPQEWTY 367
BLAST of HG10018445 vs. NCBI nr
Match:
XP_038884389.1 (uncharacterized protein LOC120075245 isoform X2 [Benincasa hispida])
HSP 1 Score: 623.6 bits (1607), Expect = 1.1e-174
Identity = 316/372 (84.95%), Postives = 331/372 (88.98%), Query Frame = 0
Query: 1 MATPFEQPLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLA 60
MATP SL+FPTEFPYDFDSF SNSDLNSPVES+GSS TDSTDS GSDDDDFFVGLA
Sbjct: 1 MATP-----SLSFPTEFPYDFDSFFSNSDLNSPVESVGSSVTDSTDSSGSDDDDFFVGLA 60
Query: 61 QQLAWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMA 120
QQLAWTSLCETEKS SPSFNP FE YVKAGSPQSTL+GID WFRPESPSSQLQSPPMA
Sbjct: 61 QQLAWTSLCETEKSTSPSFNPKNFE-MYVKAGSPQSTLTGIDTWFRPESPSSQLQSPPMA 120
Query: 121 VFGAENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGL 180
VFGAENDARALLHAAAREAARLKMSGETT F NNDPFMR +VGARSSIP KS NNVDYG+
Sbjct: 121 VFGAENDARALLHAAAREAARLKMSGETTPFRNNDPFMREYVGARSSIPVKSTNNVDYGV 180
Query: 181 YSNQNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVV 240
+SNQNCARNLAFAAQ+QQVK DLVLQA+RASSWGGRQAKVSWSAQPHWK EIQ+RER V+
Sbjct: 181 FSNQNCARNLAFAAQMQQVKQDLVLQALRASSWGGRQAKVSWSAQPHWKAEIQSRERNVL 240
Query: 241 NAS-----RNGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYV 300
NAS GGLY SPWLPPPQ QQPPSN SVMRCIHP RSGVKRASS TGVFLPRRY+
Sbjct: 241 NASGRCGGNTGGLYHSPWLPPPQNQQPPSNPSVMRCIHPGRSGVKRASSGTGVFLPRRYI 300
Query: 301 NPSECRQKQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAM 360
+PSECRQKQGSPAVRF EEMKSPIQAPLNG L+P DS+LSRRNNPLLP PR FRT+GAM
Sbjct: 301 SPSECRQKQGSPAVRFAEEMKSPIQAPLNGWLSPSIDSMLSRRNNPLLPLPRSFRTEGAM 360
Query: 361 NQEHHLPQEWTY 368
NQE HLPQEWTY
Sbjct: 361 NQELHLPQEWTY 366
BLAST of HG10018445 vs. NCBI nr
Match:
XP_008441420.1 (PREDICTED: uncharacterized protein LOC103485542 [Cucumis melo] >KAA0041485.1 uncharacterized protein E6C27_scaffold6G00720 [Cucumis melo var. makuwa] >TYK24358.1 uncharacterized protein E5676_scaffold205G001390 [Cucumis melo var. makuwa])
HSP 1 Score: 595.9 bits (1535), Expect = 2.4e-166
Identity = 304/364 (83.52%), Postives = 319/364 (87.64%), Query Frame = 0
Query: 10 SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
SLTFPTEFPYDFDSFLSNSDLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQQLAWTSLC
Sbjct: 5 SLTFPTEFPYDFDSFLSNSDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQQLAWTSLC 64
Query: 70 ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
E E N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65 EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124
Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
A+LHAAAREAARLKMSGETT F N DPFMRGFVGARSSIP KS NNVDYG++SNQN ARN
Sbjct: 125 AILHAAAREAARLKMSGETTPFQNIDPFMRGFVGARSSIPVKSTNNVDYGVFSNQNSARN 184
Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRN---- 249
LAFAAQVQQVK DLVLQA+RASS GGRQAKVSWSAQPHWK EIQNRER VVNAS
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLGGRQAKVSWSAQPHWKQEIQNRERNVVNASGRCGVG 244
Query: 250 -GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
GGLY SPWLPP Q QQP N +V+RCIHP RSGVKRASS TGVFLPRRY+NP++CRQKQ
Sbjct: 245 AGGLYHSPWLPPLQNQQPTPNTTVVRCIHPVRSGVKRASSGTGVFLPRRYINPTDCRQKQ 304
Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
G P+VRF EEMKSPIQAPLNGCL+PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFAEEMKSPIQAPLNGCLSPGFDPILSRRNNPLLPLPRSFRTEGGMNQEHHHLPQ 359
BLAST of HG10018445 vs. NCBI nr
Match:
XP_004138440.2 (uncharacterized protein LOC101208139 [Cucumis sativus] >KGN45759.1 hypothetical protein Csa_005113 [Cucumis sativus])
HSP 1 Score: 583.9 bits (1504), Expect = 9.3e-163
Identity = 298/364 (81.87%), Postives = 315/364 (86.54%), Query Frame = 0
Query: 10 SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
+LTFPTEFPYDFDSFLSN DLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQ+LAWTSLC
Sbjct: 5 TLTFPTEFPYDFDSFLSNCDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQKLAWTSLC 64
Query: 70 ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
E E N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65 EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124
Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
A+LHAAAREAA+LKMSGETT F NNDPFMRGFVGARSS+P KS NNVDYG++S QN ARN
Sbjct: 125 AILHAAAREAAKLKMSGETTPFQNNDPFMRGFVGARSSVPVKSTNNVDYGVFSTQNSARN 184
Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNAS-----R 249
LAFAAQVQQVK DLVLQA+RASS RQAK SWSAQPHWK EIQNRER VVNAS
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLRERQAKASWSAQPHWKQEIQNRERNVVNASGRCGGG 244
Query: 250 NGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
GGLY SPWLPP Q QQP SN +V+RCIHP RSGVKRASS TGVFLPRRY+NPSECRQKQ
Sbjct: 245 TGGLYHSPWLPPLQNQQPTSNPTVVRCIHPVRSGVKRASSGTGVFLPRRYINPSECRQKQ 304
Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
G P+VRF EEMKSPIQAPLNGC +PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFVEEMKSPIQAPLNGCHSPGFDPILSRRNNPLLPLPRSFRTEGVMNQEHHHLPQ 359
BLAST of HG10018445 vs. NCBI nr
Match:
XP_022134429.1 (uncharacterized protein LOC111006679 [Momordica charantia])
HSP 1 Score: 537.7 bits (1384), Expect = 7.7e-149
Identity = 274/362 (75.69%), Postives = 305/362 (84.25%), Query Frame = 0
Query: 10 SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
+L+FPT+FPY+FDS SN DLNSPVES+ SS T+STDS SDDDDFFVGLA+QLAWT LC
Sbjct: 18 TLSFPTDFPYEFDSLASNFDLNSPVESVVSS-TESTDS--SDDDDFFVGLARQLAWTYLC 77
Query: 70 ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
ETE S +P FNPN+FEKKYVKAGSPQSTLSGIDAWFRP SPSSQL+SPP+AVFGAENDAR
Sbjct: 78 ETESSPTPCFNPNKFEKKYVKAGSPQSTLSGIDAWFRPHSPSSQLKSPPIAVFGAENDAR 137
Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
L+HAAAREAARLKMS ETT F +NDPF+RGF+GARSSIP KS +NVDYGL+SN+ CARN
Sbjct: 138 VLVHAAAREAARLKMSAETTPFQSNDPFVRGFMGARSSIPVKSTSNVDYGLFSNEGCARN 197
Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYV----VNASRN 249
LAF+AQVQQV+HDLVLQAI ASSW GRQAKV W+A PH KPEIQNRER + S
Sbjct: 198 LAFSAQVQQVRHDLVLQAICASSW-GRQAKVDWAAPPHRKPEIQNRERNIGLGGGRCSGA 257
Query: 250 GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQG 309
GLY S WLPPPQ Q PP NAS +RCIHP VKRASS TGVFLPRRYVNPSECRQKQG
Sbjct: 258 AGLYQSAWLPPPQSQPPPPNASAVRCIHPGGPAVKRASSGTGVFLPRRYVNPSECRQKQG 317
Query: 310 SPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQEHHLPQEW 368
+PAVRFPEEM +PIQAP NGCL+PG D++L+RRN LLP PR R + A+NQE HLPQEW
Sbjct: 318 TPAVRFPEEMINPIQAPFNGCLSPGFDAMLARRNTSLLPLPRSLRGEAAINQELHLPQEW 375
BLAST of HG10018445 vs. ExPASy TrEMBL
Match:
A0A5A7TJ83 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold205G001390 PE=4 SV=1)
HSP 1 Score: 595.9 bits (1535), Expect = 1.1e-166
Identity = 304/364 (83.52%), Postives = 319/364 (87.64%), Query Frame = 0
Query: 10 SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
SLTFPTEFPYDFDSFLSNSDLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQQLAWTSLC
Sbjct: 5 SLTFPTEFPYDFDSFLSNSDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQQLAWTSLC 64
Query: 70 ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
E E N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65 EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124
Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
A+LHAAAREAARLKMSGETT F N DPFMRGFVGARSSIP KS NNVDYG++SNQN ARN
Sbjct: 125 AILHAAAREAARLKMSGETTPFQNIDPFMRGFVGARSSIPVKSTNNVDYGVFSNQNSARN 184
Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRN---- 249
LAFAAQVQQVK DLVLQA+RASS GGRQAKVSWSAQPHWK EIQNRER VVNAS
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLGGRQAKVSWSAQPHWKQEIQNRERNVVNASGRCGVG 244
Query: 250 -GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
GGLY SPWLPP Q QQP N +V+RCIHP RSGVKRASS TGVFLPRRY+NP++CRQKQ
Sbjct: 245 AGGLYHSPWLPPLQNQQPTPNTTVVRCIHPVRSGVKRASSGTGVFLPRRYINPTDCRQKQ 304
Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
G P+VRF EEMKSPIQAPLNGCL+PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFAEEMKSPIQAPLNGCLSPGFDPILSRRNNPLLPLPRSFRTEGGMNQEHHHLPQ 359
BLAST of HG10018445 vs. ExPASy TrEMBL
Match:
A0A1S3B425 (uncharacterized protein LOC103485542 OS=Cucumis melo OX=3656 GN=LOC103485542 PE=4 SV=1)
HSP 1 Score: 595.9 bits (1535), Expect = 1.1e-166
Identity = 304/364 (83.52%), Postives = 319/364 (87.64%), Query Frame = 0
Query: 10 SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
SLTFPTEFPYDFDSFLSNSDLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQQLAWTSLC
Sbjct: 5 SLTFPTEFPYDFDSFLSNSDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQQLAWTSLC 64
Query: 70 ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
E E N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65 EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124
Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
A+LHAAAREAARLKMSGETT F N DPFMRGFVGARSSIP KS NNVDYG++SNQN ARN
Sbjct: 125 AILHAAAREAARLKMSGETTPFQNIDPFMRGFVGARSSIPVKSTNNVDYGVFSNQNSARN 184
Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRN---- 249
LAFAAQVQQVK DLVLQA+RASS GGRQAKVSWSAQPHWK EIQNRER VVNAS
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLGGRQAKVSWSAQPHWKQEIQNRERNVVNASGRCGVG 244
Query: 250 -GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
GGLY SPWLPP Q QQP N +V+RCIHP RSGVKRASS TGVFLPRRY+NP++CRQKQ
Sbjct: 245 AGGLYHSPWLPPLQNQQPTPNTTVVRCIHPVRSGVKRASSGTGVFLPRRYINPTDCRQKQ 304
Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
G P+VRF EEMKSPIQAPLNGCL+PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFAEEMKSPIQAPLNGCLSPGFDPILSRRNNPLLPLPRSFRTEGGMNQEHHHLPQ 359
BLAST of HG10018445 vs. ExPASy TrEMBL
Match:
A0A0A0KA94 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G009380 PE=4 SV=1)
HSP 1 Score: 583.9 bits (1504), Expect = 4.5e-163
Identity = 298/364 (81.87%), Postives = 315/364 (86.54%), Query Frame = 0
Query: 10 SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
+LTFPTEFPYDFDSFLSN DLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQ+LAWTSLC
Sbjct: 5 TLTFPTEFPYDFDSFLSNCDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQKLAWTSLC 64
Query: 70 ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
E E N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65 EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124
Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
A+LHAAAREAA+LKMSGETT F NNDPFMRGFVGARSS+P KS NNVDYG++S QN ARN
Sbjct: 125 AILHAAAREAAKLKMSGETTPFQNNDPFMRGFVGARSSVPVKSTNNVDYGVFSTQNSARN 184
Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNAS-----R 249
LAFAAQVQQVK DLVLQA+RASS RQAK SWSAQPHWK EIQNRER VVNAS
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLRERQAKASWSAQPHWKQEIQNRERNVVNASGRCGGG 244
Query: 250 NGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
GGLY SPWLPP Q QQP SN +V+RCIHP RSGVKRASS TGVFLPRRY+NPSECRQKQ
Sbjct: 245 TGGLYHSPWLPPLQNQQPTSNPTVVRCIHPVRSGVKRASSGTGVFLPRRYINPSECRQKQ 304
Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
G P+VRF EEMKSPIQAPLNGC +PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFVEEMKSPIQAPLNGCHSPGFDPILSRRNNPLLPLPRSFRTEGVMNQEHHHLPQ 359
BLAST of HG10018445 vs. ExPASy TrEMBL
Match:
A0A6J1BYQ6 (uncharacterized protein LOC111006679 OS=Momordica charantia OX=3673 GN=LOC111006679 PE=4 SV=1)
HSP 1 Score: 537.7 bits (1384), Expect = 3.7e-149
Identity = 274/362 (75.69%), Postives = 305/362 (84.25%), Query Frame = 0
Query: 10 SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
+L+FPT+FPY+FDS SN DLNSPVES+ SS T+STDS SDDDDFFVGLA+QLAWT LC
Sbjct: 18 TLSFPTDFPYEFDSLASNFDLNSPVESVVSS-TESTDS--SDDDDFFVGLARQLAWTYLC 77
Query: 70 ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
ETE S +P FNPN+FEKKYVKAGSPQSTLSGIDAWFRP SPSSQL+SPP+AVFGAENDAR
Sbjct: 78 ETESSPTPCFNPNKFEKKYVKAGSPQSTLSGIDAWFRPHSPSSQLKSPPIAVFGAENDAR 137
Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
L+HAAAREAARLKMS ETT F +NDPF+RGF+GARSSIP KS +NVDYGL+SN+ CARN
Sbjct: 138 VLVHAAAREAARLKMSAETTPFQSNDPFVRGFMGARSSIPVKSTSNVDYGLFSNEGCARN 197
Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYV----VNASRN 249
LAF+AQVQQV+HDLVLQAI ASSW GRQAKV W+A PH KPEIQNRER + S
Sbjct: 198 LAFSAQVQQVRHDLVLQAICASSW-GRQAKVDWAAPPHRKPEIQNRERNIGLGGGRCSGA 257
Query: 250 GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQG 309
GLY S WLPPPQ Q PP NAS +RCIHP VKRASS TGVFLPRRYVNPSECRQKQG
Sbjct: 258 AGLYQSAWLPPPQSQPPPPNASAVRCIHPGGPAVKRASSGTGVFLPRRYVNPSECRQKQG 317
Query: 310 SPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQEHHLPQEW 368
+PAVRFPEEM +PIQAP NGCL+PG D++L+RRN LLP PR R + A+NQE HLPQEW
Sbjct: 318 TPAVRFPEEMINPIQAPFNGCLSPGFDAMLARRNTSLLPLPRSLRGEAAINQELHLPQEW 375
BLAST of HG10018445 vs. ExPASy TrEMBL
Match:
A0A6J1JZ67 (uncharacterized protein LOC111489193 OS=Cucurbita maxima OX=3661 GN=LOC111489193 PE=4 SV=1)
HSP 1 Score: 498.0 bits (1281), Expect = 3.3e-137
Identity = 265/374 (70.86%), Postives = 295/374 (78.88%), Query Frame = 0
Query: 1 MATPFEQPLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLA 60
MATP + P TFPTEFPY+ DSF S SDLNSP+ES SS TDSTDS GSDDDDFF GLA
Sbjct: 1 MATPLD-PSLFTFPTEFPYESDSFASFSDLNSPLESAVSS-TDSTDSSGSDDDDFFEGLA 60
Query: 61 QQLAWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMA 120
Q AWTSL ET+KS SPS N FE KYVK GSPQSTL+GID WFR E+P SQ+QSPP+A
Sbjct: 61 HQFAWTSLSETDKSTSPSSKANNFENKYVKTGSPQSTLAGIDTWFRSETP-SQVQSPPLA 120
Query: 121 VFGAENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGL 180
+ GA NDARAL+HAAA EA R +MS TTLFH N PF+RGF+GARSS+P +S N +DYGL
Sbjct: 121 LSGAVNDARALVHAAAIEATRFQMSRGTTLFHTNVPFVRGFLGARSSVPVESVNELDYGL 180
Query: 181 YSNQNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVV 240
+SN+NC RNLAFAAQ QQVK DLVLQA+ ASSW GRQAKV WSAQPHWKPE +NRE V
Sbjct: 181 FSNRNCDRNLAFAAQAQQVKRDLVLQALSASSW-GRQAKVGWSAQPHWKPEFRNREGTFV 240
Query: 241 NA-----SRNGGLYPSPWLPPPQIQQPPSNAS-VMRCIHPSRSGVKRASSCTGVFLPRRY 300
+A +G Y SPWLPPP+ QQP NAS RCIHP RSGVKRASS TGVFLPR+Y
Sbjct: 241 DARGRCSGGSGDFYHSPWLPPPRHQQPTPNASAATRCIHPGRSGVKRASSGTGVFLPRKY 300
Query: 301 VNPSECRQKQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGA 360
VNPSE QK+GSPAVRFPEEM+SPIQAPLNG L PG +SI SRRN P+LP PR FR + A
Sbjct: 301 VNPSESFQKKGSPAVRFPEEMRSPIQAPLNGFLWPGFNSISSRRNKPVLPLPRSFRGEVA 360
Query: 361 MNQEH-HLPQEWTY 368
+NQE HLPQEWTY
Sbjct: 361 INQEQLHLPQEWTY 370
BLAST of HG10018445 vs. TAIR 10
Match:
AT3G55690.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G39870.1); Has 76 Blast hits to 69 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 3; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 88.6 bits (218), Expect = 1.1e-17
Identity = 104/365 (28.49%), Postives = 150/365 (41.10%), Query Frame = 0
Query: 5 FEQPL-SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQL 64
FEQP+ LTFP EFPY+ F S++ SP +S T++ D D+DDF GL ++L
Sbjct: 21 FEQPMEKLTFPNEFPYE---FASSTFSTSPEDS-----TETEDETTDDEDDFLAGLTRRL 80
Query: 65 AWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFG 124
A T++ SPSF +K +K +ST SG+ + P P SQ+ SPP +
Sbjct: 81 A----LSTQRLSSPSF---VTDKSQMKPKVTESTQSGLGS---PNGPFSQVPSPPTSP-S 140
Query: 125 AENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSN 184
E D+ +L AAA E A++K + N D + + NV Y
Sbjct: 141 REEDSLKVLSAAAGEVAKIKKA-------NFDAKPISYPNPNPNYLTSFPQNVAY----- 200
Query: 185 QNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNAS 244
NC W +PH+ P+ Q
Sbjct: 201 YNC----------------------------------YWLWEPHY-PQSQM--------- 260
Query: 245 RNGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSEC-RQ 304
G+ P+ W PP S +R + + VK S+ TGVFLPR+Y NPS+ ++
Sbjct: 261 ---GIVPNAWHIPP---------SPVRAFYTLPTAVKSPSTGTGVFLPRKYSNPSDSPKK 293
Query: 305 KQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQEHHLP 364
K G V+ + K I+ C P S + LS + + G + E L
Sbjct: 321 KSGDGCVKVVNQQKPKIEVLPVRC-KPNSKAGLSTGRSKI----DYVAGGGCLKHEKPLL 293
Query: 365 QEWTY 368
QEW Y
Sbjct: 381 QEWMY 293
BLAST of HG10018445 vs. TAIR 10
Match:
AT2G39870.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55690.1); Has 73 Blast hits to 71 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 2; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 86.3 bits (212), Expect = 5.7e-17
Identity = 98/325 (30.15%), Postives = 133/325 (40.92%), Query Frame = 0
Query: 8 PLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTS 67
P L FP EFPY+FDS + SP +S T++ D D++DF GL ++LA
Sbjct: 26 PTRLGFPNEFPYEFDSPSFSPGFTSPGDS-----TETEDESSDDEEDFLAGLTRRLA--- 85
Query: 68 LCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAEND 127
T++ SP F EK+ V A SPQSTLSG+ ++ S S L SPP D
Sbjct: 86 -PSTQRLPSPLFKSE--EKRQVAATSPQSTLSGLGSFSNSGSRSPILPSPPAPTSSFRRD 145
Query: 128 -ARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNC 187
A ++ AAA E ARLK+ G H +P ++ + L QN
Sbjct: 146 NAWDVISAAAGEVARLKL-GSYEPHH---------------LPLQTPES----LLRRQNA 205
Query: 188 ARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRNG 247
A + +L Q + W SAQ +K R VVN
Sbjct: 206 A-----------IHAELQHQRLIEQMW-------LCSAQSRFKLSENRIPRRVVNEE--- 265
Query: 248 GLYPSP---------WLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNP 307
GL+ +P WLPP Q P +KR S+ TGVFLPRRY
Sbjct: 266 GLFENPRYVRRNNPTWLPPQQAAAP----------------LKRPSAGTGVFLPRRY--- 270
Query: 308 SECRQKQGSPAVRFPEEMKSPIQAP 323
P+ + +K+P+ P
Sbjct: 326 ---------PSAAPSDSLKTPVNTP 270
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038884388.1 | 1.5e-176 | 85.22 | uncharacterized protein LOC120075245 isoform X1 [Benincasa hispida] | [more] |
XP_038884389.1 | 1.1e-174 | 84.95 | uncharacterized protein LOC120075245 isoform X2 [Benincasa hispida] | [more] |
XP_008441420.1 | 2.4e-166 | 83.52 | PREDICTED: uncharacterized protein LOC103485542 [Cucumis melo] >KAA0041485.1 unc... | [more] |
XP_004138440.2 | 9.3e-163 | 81.87 | uncharacterized protein LOC101208139 [Cucumis sativus] >KGN45759.1 hypothetical ... | [more] |
XP_022134429.1 | 7.7e-149 | 75.69 | uncharacterized protein LOC111006679 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7TJ83 | 1.1e-166 | 83.52 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3B425 | 1.1e-166 | 83.52 | uncharacterized protein LOC103485542 OS=Cucumis melo OX=3656 GN=LOC103485542 PE=... | [more] |
A0A0A0KA94 | 4.5e-163 | 81.87 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G009380 PE=4 SV=1 | [more] |
A0A6J1BYQ6 | 3.7e-149 | 75.69 | uncharacterized protein LOC111006679 OS=Momordica charantia OX=3673 GN=LOC111006... | [more] |
A0A6J1JZ67 | 3.3e-137 | 70.86 | uncharacterized protein LOC111489193 OS=Cucurbita maxima OX=3661 GN=LOC111489193... | [more] |
Match Name | E-value | Identity | Description | |
AT3G55690.1 | 1.1e-17 | 28.49 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G39870.1 | 5.7e-17 | 30.15 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |