Computationally Efficient Convolved Multiple Output <span>Gaussian</span> Processes

Mauricio A. Álvarez; Neil D. Lawrence

edit

Back to publications

Computationally Efficient Convolved Multiple Output Gaussian Processes

Mauricio A. Álvarez, Neil D. Lawrence

Journal of Machine Learning Research, 12:1425-1466, 2011.

Abstract

Recently there has been an increasing interest in regression methods that deal with multiple outputs. This has been motivated partly by frameworks like multitask learning, multisensor networks or structured output data. From a Gaussian processes perspective, the problem reduces to specifying an appropriate covariance function that, whilst being positive semi-definite, captures the dependencies between all the data points and across all the outputs. One approach to account for non-trivial correlations between outputs employs convolution processes. Under a latent function interpretation of the convolution transform we establish dependencies between output variables. The main drawbacks of this approach are the associated computational and storage demands. In this paper we address these issues. We present different efficient approximations for dependent output Gaussian processes constructed through the convolution formalism. We exploit the conditional independencies present naturally in the model. This leads to a form of the covariance similar in spirit to the so called PITC and FITC approximations for a single output. We show experimental results with synthetic and real data, in particular, we show results in school exams score prediction, pollution prediction and gene expression data

Links

Cite this Paper

BibTeX


@Article{Alvarez-computationally11,
  title = 	 {Computationally Efficient Convolved Multiple Output Gaussian Processes},
  author = 	 {Álvarez, Mauricio A. and Lawrence, Neil D.},
  journal =      {Journal of Machine Learning Research},
  pages = 	 {1425--1466},
  year = 	 {2011},
  volume = 	 {12},
  pdf = 	 {http://www.jmlr.org/papers/volume12/alvarez11a/alvarez11a.pdf},
  url = 	 {/publications/alvarez-computationally11.html},
  abstract = 	 {Recently there has been an increasing interest in regression methods that deal with multiple outputs. This has been motivated partly by frameworks like multitask learning, multisensor networks or structured output data. From a Gaussian processes perspective, the problem reduces to specifying an appropriate covariance function that, whilst being positive semi-definite, captures the dependencies between all the data points and across all the outputs. One approach to account for non-trivial correlations between outputs employs convolution processes. Under a latent function interpretation of the convolution transform we establish dependencies between output variables. The main drawbacks of this approach are the associated computational and storage demands. In this paper we address these issues. We present different efficient approximations for dependent output Gaussian processes constructed through the convolution formalism. We exploit the conditional independencies present naturally in the model. This leads to a form of the covariance similar in spirit to the so called PITC and FITC approximations for a single output. We show experimental results with synthetic and real data, in particular, we show results in school exams score prediction, pollution prediction and gene expression data}
}

Endnote

%0 Journal Article
%T Computationally Efficient Convolved Multiple Output Gaussian Processes
%A Mauricio A. Álvarez
%A Neil D. Lawrence
%J Journal of Machine Learning Research
%D 2011	
%F Alvarez-computationally11
%P 1425--1466
%U /publications/alvarez-computationally11.html
%V 12
%X Recently there has been an increasing interest in regression methods that deal with multiple outputs. This has been motivated partly by frameworks like multitask learning, multisensor networks or structured output data. From a Gaussian processes perspective, the problem reduces to specifying an appropriate covariance function that, whilst being positive semi-definite, captures the dependencies between all the data points and across all the outputs. One approach to account for non-trivial correlations between outputs employs convolution processes. Under a latent function interpretation of the convolution transform we establish dependencies between output variables. The main drawbacks of this approach are the associated computational and storage demands. In this paper we address these issues. We present different efficient approximations for dependent output Gaussian processes constructed through the convolution formalism. We exploit the conditional independencies present naturally in the model. This leads to a form of the covariance similar in spirit to the so called PITC and FITC approximations for a single output. We show experimental results with synthetic and real data, in particular, we show results in school exams score prediction, pollution prediction and gene expression data

RIS


TY  - JOUR
TI  - Computationally Efficient Convolved Multiple Output Gaussian Processes
AU  - Mauricio A. Álvarez
AU  - Neil D. Lawrence
DA  - 2011/05/01	
ID  - Alvarez-computationally11
VL  - 12
SP  - 1425
EP  - 1466
L1  - http://www.jmlr.org/papers/volume12/alvarez11a/alvarez11a.pdf
UR  - /publications/alvarez-computationally11.html
AB  - Recently there has been an increasing interest in regression methods that deal with multiple outputs. This has been motivated partly by frameworks like multitask learning, multisensor networks or structured output data. From a Gaussian processes perspective, the problem reduces to specifying an appropriate covariance function that, whilst being positive semi-definite, captures the dependencies between all the data points and across all the outputs. One approach to account for non-trivial correlations between outputs employs convolution processes. Under a latent function interpretation of the convolution transform we establish dependencies between output variables. The main drawbacks of this approach are the associated computational and storage demands. In this paper we address these issues. We present different efficient approximations for dependent output Gaussian processes constructed through the convolution formalism. We exploit the conditional independencies present naturally in the model. This leads to a form of the covariance similar in spirit to the so called PITC and FITC approximations for a single output. We show experimental results with synthetic and real data, in particular, we show results in school exams score prediction, pollution prediction and gene expression data
ER  -

APA


Álvarez, M.A. & Lawrence, N.D.. (2011). Computationally Efficient Convolved Multiple Output Gaussian Processes. Journal of Machine Learning Research 12:1425-1466 Available from /publications/alvarez-computationally11.html.