# Gaussian Process Modelling of Latent Chemical Species: Applications to Inferring Transcription Factor Activities

Pei Gao, Peking University
Antti Honkela, University of Helsinki
Magnus Rattray, University of Manchester
Neil D. Lawrence, University of Sheffield

Bioinformatics 24, pp 0-0

#### Abstract

Motivation: Inference of latent chemical species in biochemical interaction networks is a key problem in estimation of the structure and parameters of the genetic, metabolic and protein interaction networks that underpin all biological processes. We present a framework for Bayesian marginalisation of these latent chemical species through Gaussian process priors.\ \ Results: We demonstrate our general approach on three different biological examples of single input motifs, including both activation and repression of transcription. We focus in particular on the problem of inferring transcription factor activity when the concentration of active protein cannot easily be measured. We show how the uncertainty in the inferred transcription factor activity can be integrated out in order to derive a likelihood function that can be used for the estimation of regulatory model parameters. An advantage of our approach is that we avoid the use of a coarse-grained discretization of continuous-time functions, which would lead to a large number of additional parameters to be estimated. We develop efficient exact and approximate inference schemes, which are much more efficient than competing sampling-based schemes and therefore provide us with a practical toolkit for model-based inference.\ \ Availability: The software and data for recreating all the experiments in this paper is available in MATLAB from http://inverseprobability.com/gpsim\ \ Contact: Neil Lawrence

  @Article{gao-latent08, title = {Gaussian Process Modelling of Latent Chemical Species: Applications to Inferring Transcription Factor Activities}, journal = {Bioinformatics}, author = {Pei Gao and Antti Honkela and Magnus Rattray and Neil D. Lawrence}, pages = {0}, year = {2008}, volume = {24}, month = {00}, edit = {https://github.com/lawrennd//publications/edit/gh-pages/_posts/2008-01-01-gao-latent08.md}, url = {http://inverseprobability.com/publications/gao-latent08.html}, abstract = {**Motivation:** Inference of *latent chemical species* in biochemical interaction networks is a key problem in estimation of the structure and parameters of the genetic, metabolic and protein interaction networks that underpin all biological processes. We present a framework for Bayesian marginalisation of these latent chemical species through Gaussian process priors.\ \ **Results:** We demonstrate our general approach on three different biological examples of single input motifs, including both activation and repression of transcription. We focus in particular on the problem of inferring transcription factor activity when the concentration of active protein cannot easily be measured. We show how the uncertainty in the inferred transcription factor activity can be integrated out in order to derive a likelihood function that can be used for the estimation of regulatory model parameters. An advantage of our approach is that we avoid the use of a coarse-grained discretization of continuous-time functions, which would lead to a large number of additional parameters to be estimated. We develop efficient exact and approximate inference schemes, which are much more efficient than competing sampling-based schemes and therefore provide us with a practical toolkit for model-based inference.\ \ **Availability:** The software and data for recreating all the experiments in this paper is available in MATLAB from \ \ **Contact:** Neil Lawrence}, key = {Gao-latent08}, doi = {10.1093/bioinformatics/btn278}, linkpdf = {http://bioinformatics.oxfordjournals.org/cgi/reprint/24/16/i70.pdf?ijkey=FauSn114lAUC1Ey&keytype=ref}, linksoftware = {http://inverseprobability.com/gpsim/}, group = {gene networks, TFA, gp} }
 %T Gaussian Process Modelling of Latent Chemical Species: Applications to Inferring Transcription Factor Activities %A Pei Gao and Antti Honkela and Magnus Rattray and Neil D. Lawrence %B %C Bioinformatics %D %F gao-latent08 %J Bioinformatics %P 0--0 %R 10.1093/bioinformatics/btn278 %U http://inverseprobability.com/publications/gao-latent08.html %V 24 %X **Motivation:** Inference of *latent chemical species* in biochemical interaction networks is a key problem in estimation of the structure and parameters of the genetic, metabolic and protein interaction networks that underpin all biological processes. We present a framework for Bayesian marginalisation of these latent chemical species through Gaussian process priors.\ \ **Results:** We demonstrate our general approach on three different biological examples of single input motifs, including both activation and repression of transcription. We focus in particular on the problem of inferring transcription factor activity when the concentration of active protein cannot easily be measured. We show how the uncertainty in the inferred transcription factor activity can be integrated out in order to derive a likelihood function that can be used for the estimation of regulatory model parameters. An advantage of our approach is that we avoid the use of a coarse-grained discretization of continuous-time functions, which would lead to a large number of additional parameters to be estimated. We develop efficient exact and approximate inference schemes, which are much more efficient than competing sampling-based schemes and therefore provide us with a practical toolkit for model-based inference.\ \ **Availability:** The software and data for recreating all the experiments in this paper is available in MATLAB from \ \ **Contact:** Neil Lawrence 
 TY - CPAPER TI - Gaussian Process Modelling of Latent Chemical Species: Applications to Inferring Transcription Factor Activities AU - Pei Gao AU - Antti Honkela AU - Magnus Rattray AU - Neil D. Lawrence PY - 2008/01/01 DA - 2008/01/01 ID - gao-latent08 SP - 0 EP - 0 DO - 10.1093/bioinformatics/btn278 L1 - http://bioinformatics.oxfordjournals.org/cgi/reprint/24/16/i70.pdf?ijkey=FauSn114lAUC1Ey&keytype=ref UR - http://inverseprobability.com/publications/gao-latent08.html AB - **Motivation:** Inference of *latent chemical species* in biochemical interaction networks is a key problem in estimation of the structure and parameters of the genetic, metabolic and protein interaction networks that underpin all biological processes. We present a framework for Bayesian marginalisation of these latent chemical species through Gaussian process priors.\ \ **Results:** We demonstrate our general approach on three different biological examples of single input motifs, including both activation and repression of transcription. We focus in particular on the problem of inferring transcription factor activity when the concentration of active protein cannot easily be measured. We show how the uncertainty in the inferred transcription factor activity can be integrated out in order to derive a likelihood function that can be used for the estimation of regulatory model parameters. An advantage of our approach is that we avoid the use of a coarse-grained discretization of continuous-time functions, which would lead to a large number of additional parameters to be estimated. We develop efficient exact and approximate inference schemes, which are much more efficient than competing sampling-based schemes and therefore provide us with a practical toolkit for model-based inference.\ \ **Availability:** The software and data for recreating all the experiments in this paper is available in MATLAB from \ \ **Contact:** Neil Lawrence ER - 
 Gao, P., Honkela, A., Rattray, M. & Lawrence, N.D.. (2008). Gaussian Process Modelling of Latent Chemical Species: Applications to Inferring Transcription Factor Activities. Bioinformatics 24:0-0