Batch Bayesian Optimization via Local Penalization

Javier GonzalezZhenwen DaiPhilipp HennigNeil D. Lawrence
Proceedings of the Nineteenth International Workshop on Artificial Intelligence and Statistics, PMLR 51:648-657, 2016.

Abstract

The popularity of Bayesian optimization methods for efficient exploration of parameter spaces has lead to a series of papers applying Gaussian processes as surrogates in the optimization of functions. However, most proposed approaches only allow the exploration of the parameter space to occur sequentially. Often, it is desirable to simultaneously propose batches of parameter values to explore. This is particularly the case when large parallel processing facilities are available. These could either be computational or physical facets of the process being optimized. Batch methods, however, require the modeling of the interaction between the different evaluations in the batch, which can be expensive in complex scenarios. We investigate this issue and propose a highly effective heuristic based on an estimate of the function's Lipschitz constant that captures the most important aspect of this interaction---local repulsion---at negligible computational overhead. A penalized acquisition function is used to collect batches of points minimizing the non-parallelizable computational effort. The resulting algorithm compares very well, in run-time, with much more elaborate alternatives.

Cite this Paper


BibTeX
@InProceedings{Gonzalez-batch16, title = {Batch Bayesian Optimization via Local Penalization}, author = {Gonzalez, Javier and Dai, Zhenwen and Hennig, Philipp and Lawrence, Neil D.}, booktitle = {Proceedings of the Nineteenth International Workshop on Artificial Intelligence and Statistics}, pages = {648--657}, year = {2016}, editor = {Gretton, Arthur and Robert, Cristian}, volume = {51}, address = {Cadiz, Spain}, publisher = {PMLR}, pdf = {http://jmlr.org/proceedings/papers/v51/gonzalez16a.pdf}, url = {http://inverseprobability.com/publications/gonzalez-batch16.html}, abstract = {The popularity of Bayesian optimization methods for efficient exploration of parameter spaces has lead to a series of papers applying Gaussian processes as surrogates in the optimization of functions. However, most proposed approaches only allow the exploration of the parameter space to occur sequentially. Often, it is desirable to simultaneously propose batches of parameter values to explore. This is particularly the case when large parallel processing facilities are available. These could either be computational or physical facets of the process being optimized. Batch methods, however, require the modeling of the interaction between the different evaluations in the batch, which can be expensive in complex scenarios. We investigate this issue and propose a highly effective heuristic based on an estimate of the function's Lipschitz constant that captures the most important aspect of this interaction---local repulsion---at negligible computational overhead. A penalized acquisition function is used to collect batches of points minimizing the non-parallelizable computational effort. The resulting algorithm compares very well, in run-time, with much more elaborate alternatives. } }
Endnote
%0 Conference Paper %T Batch Bayesian Optimization via Local Penalization %A Javier Gonzalez %A Zhenwen Dai %A Philipp Hennig %A Neil D. Lawrence %B Proceedings of the Nineteenth International Workshop on Artificial Intelligence and Statistics %D 2016 %E Arthur Gretton %E Cristian Robert %F Gonzalez-batch16 %I PMLR %P 648--657 %U http://inverseprobability.com/publications/gonzalez-batch16.html %V 51 %X The popularity of Bayesian optimization methods for efficient exploration of parameter spaces has lead to a series of papers applying Gaussian processes as surrogates in the optimization of functions. However, most proposed approaches only allow the exploration of the parameter space to occur sequentially. Often, it is desirable to simultaneously propose batches of parameter values to explore. This is particularly the case when large parallel processing facilities are available. These could either be computational or physical facets of the process being optimized. Batch methods, however, require the modeling of the interaction between the different evaluations in the batch, which can be expensive in complex scenarios. We investigate this issue and propose a highly effective heuristic based on an estimate of the function's Lipschitz constant that captures the most important aspect of this interaction---local repulsion---at negligible computational overhead. A penalized acquisition function is used to collect batches of points minimizing the non-parallelizable computational effort. The resulting algorithm compares very well, in run-time, with much more elaborate alternatives.
RIS
TY - CPAPER TI - Batch Bayesian Optimization via Local Penalization AU - Javier Gonzalez AU - Zhenwen Dai AU - Philipp Hennig AU - Neil D. Lawrence BT - Proceedings of the Nineteenth International Workshop on Artificial Intelligence and Statistics DA - 2016/01/01 ED - Arthur Gretton ED - Cristian Robert ID - Gonzalez-batch16 PB - PMLR VL - 51 SP - 648 EP - 657 L1 - http://jmlr.org/proceedings/papers/v51/gonzalez16a.pdf UR - http://inverseprobability.com/publications/gonzalez-batch16.html AB - The popularity of Bayesian optimization methods for efficient exploration of parameter spaces has lead to a series of papers applying Gaussian processes as surrogates in the optimization of functions. However, most proposed approaches only allow the exploration of the parameter space to occur sequentially. Often, it is desirable to simultaneously propose batches of parameter values to explore. This is particularly the case when large parallel processing facilities are available. These could either be computational or physical facets of the process being optimized. Batch methods, however, require the modeling of the interaction between the different evaluations in the batch, which can be expensive in complex scenarios. We investigate this issue and propose a highly effective heuristic based on an estimate of the function's Lipschitz constant that captures the most important aspect of this interaction---local repulsion---at negligible computational overhead. A penalized acquisition function is used to collect batches of points minimizing the non-parallelizable computational effort. The resulting algorithm compares very well, in run-time, with much more elaborate alternatives. ER -
APA
Gonzalez, J., Dai, Z., Hennig, P. & Lawrence, N.D.. (2016). Batch Bayesian Optimization via Local Penalization. Proceedings of the Nineteenth International Workshop on Artificial Intelligence and Statistics 51:648-657 Available from http://inverseprobability.com/publications/gonzalez-batch16.html.

Related Material