Questions tagged [gaussian-process]

Gaussian processes refer to stochastic processes whose realization consists of normally distributed random variables, with the additional property that any finite collection of these random variables have a multivariate normal distribution. The machinery of Gaussian processes can be employed in regression and classification problems.

Overview

Gaussian processes refer to stochastic processes whose realization consists of normally distributed random variables, with the additional property that any finite collection of these random variables have a multivariate normal distribution. The machinery of Gaussian processes can be employed in regression and classification problems.

Formulation

We consider a Gaussian process as an infinite version of a multivariate Gaussian distribution and has two parameters:

  1. mean function $\mu$ corresponding to the mean vector
  2. positive definite covariance or kernel function $K$ corresponding to a positive definite covariance matrix

Consider $x_1,...x_n$ as a finite collection of points in $\mathcal{X}$. For a Gaussian process over function $f \in \mathcal{H}$ the probability density of $\mathbf{f} = \\{f(x_1),...,f(x_n)\\}^T$ is a multivariate normal:

$$\mathbf{f} \sim MVN({\bf \mu}, {\bf \Sigma})$$

with:

  • mean vector $\mathbf{\mu} = \\{\mu(x_1),...,\mu(x_n)\\}^T$
  • covariance $\mathbf{\Sigma}_{ij} = K(x_i,x_j)$
  • $\mu(x) = E[f(x)]$
  • $K(x_i,x_j) = E[(f(x_i) - \mu(x_i))(f(x_j) - \mu(x_j)]$

Software Packages

  • mlegp, GPfit and CGP for R.
  • GPy, pygp and gaussian_processes for Python

References

1073 questions
43
votes
3 answers

Why is the mean function in Gaussian Process uninteresting?

I have just started reading about GPs and analogous to the regular Gaussian distribution it is characterized by a mean function and the covariance function or the kernel. I was at a talk and the speaker said that the mean function is usually quite…
Luca
  • 4,650
18
votes
3 answers

Confusion related to difference of kriging and gaussian processes

I am having a hard time understanding what is the difference between kriging and gaussian processes. I mean wiki says they are the same but their formulas for prediction are so different. I am a bit confused why they are called similar.…
user34790
  • 6,757
  • 10
  • 46
  • 69
16
votes
1 answer

Gaussian process : function approximation properties

I'm learning about Gaussian Process and have heard only bits and pieces. Would really appreciate comments and answers. For any set of data, is it true that a Gaussian Process function approximation would give zero or negligible fitting error at the…
oalah
  • 297
15
votes
2 answers

Gaussian processes benefits

I have this confusion related to the benefits of Gaussian processes. I mean comparing it to simple linear regression, where we have defined that the linear function models the data. However, in Gaussian processes we define the distribution of the…
user34790
  • 6,757
  • 10
  • 46
  • 69
13
votes
3 answers

Main advantages of Gaussian process models

The Gaussian process has been widely used, especially in emulation. It is known that the computational demand is high ($0(n^3)$). What makes them popular? What are their main and hidden advantages? Why are they used instead of parametric models…
Wis
  • 2,164
11
votes
1 answer

Criteria for choosing a mean function for a GP

When choosing a covariance function for a Gaussian Process, there are several criteria one can use to choose a class of covariance functions, for example how 'much smoothness' we want, whether we want some periodicity, etc. However, I cannot find…
8
votes
1 answer

derivation of predictive distribution of Gaussian Process

There is a duplicate, and the reason why I still ask this question is that, the answer to that duplicate doesn't answer the question well. The Gaussian Process prior is $$u\sim GP(0,k(x,x'))$$ I tend to write it this way, $$p(u)=N(0,K)$$ The…
avocado
  • 3,581
  • 6
  • 35
  • 49
8
votes
4 answers

Understanding the predictive distribution in gaussian linear regression

I'm reading through the Gaussian Process book http://www.gaussianprocess.org/gpml/chapters/RW2.pdf and there's one section here I don't quite understand (page 11). The author says: "the predictive distribution is given by averaging the output of…
victor
  • 211
8
votes
2 answers

Gaussian processes with finite sampling area

I apologize in advance if this question is poorly-posed: I'm an astronomer, not a statistician. My question is specifically aimed to help me figure out whether Gaussian processes is an appropriate technique for my problem. Using a telescope and a…
8
votes
1 answer

Is removing duplicate data necessary for Gaussian Process Regression (GPR)?

I will consider Non-Noisy Observations i.e. $y=f(x)$ Lets say we have the following data set of 5 training examples with one of the examples duplicated $(1,2,3,4,4)$ maps to $(2,4,6,8,8)$. Since for GPR we have to invert a Kernel Matrix and a…
Buna
  • 153
7
votes
2 answers

How to implement Gaussian process using GPML toolbox with known output noise?

I want to implement a simple regression model using Gaussian process. I chose GPML toolbox of Rasmussen for simplicity. My question is how we can let the toolbox know that we already have a different known output noise for each data point. So the…
Phong Le
  • 341
6
votes
2 answers

Why Gaussian process has marginalisation/consistency property?

According to the book GPML, " ... A Gaussian process is defined as a collection of random variables. Thus, the definition automatically implies a consistency requirement, ...". Can I know why this definition automatically defines the consistency…
user10024395
  • 1
  • 2
  • 11
  • 21
5
votes
1 answer

Intuition behind variance of Gaussian Process Classification

I found this post very helpful in learning about GPC: https://krasserm.github.io/2020/11/04/gaussian-processes-classification/ Looking at the last figure in the post, I'm puzzled by the variances of the latent logit variables. I assumed the…
Glen
  • 7,250
4
votes
1 answer

Interpolation and extrapolation in Gaussian Process Regression

I'm looking for some references that have studied the behaviour of Gaussian process regression for the different settings of interpolation, and extrapolation. I've found answers(e.g. like this one) that states when extrapolating far enough from a…
4
votes
1 answer

Sigma in Gaussian Process Regression

I am studying the Gaussian Process Regression for a personal project and knowledge. I know that Sigma is the noise standard deviation used by the algorithm. Anyway, looking in Matlab library, it set the initial value equal to std(y)/sqrt(2). Why did…
youngz
  • 43
  • 4
1
2 3 4 5 6