What is the real-life benefit and application of Bayesian regressionBayes regression: how is it done in...

What is a Recurrent Neural Network?

Is creating your own "experiment" considered cheating during a physics exam?

Weird result in complex limit

How to pass attribute when redirecting from lwc to aura component

How did Arya manage the sneak attack?

Colliding particles and activation energy

Why “le” behind?

Transformation connecting two representations - Quantum mechanics

How can I get precisely a certain cubic cm by changing the following factors?

Is it cheaper to drop cargo drop than to land it?

When to use 1/Ka vs Kb

What is the real-life benefit and application of Bayesian regression

Tikz oriented simplex

Are some sounds more pleasing to the ear, like ㄴ and ㅁ?

What is the strongest case that can be made in favour of the UK regaining some control over fishing policy after Brexit?

Past Perfect Tense

What is the difference between `a[bc]d` (brackets) and `a{b,c}d` (braces)?

Sudo command gets executed as root instead of specified user

Illegal assignment from SObject to Contact

What's the metal clinking sound at the end of credits in Avengers: Endgame?

Any examples of headwear for races with animal ears?

In the time of the mishna, were there Jewish cities without courts?

Does a creature that is immune to a condition still make a saving throw?

Why do computer-science majors learn calculus?

What is the real-life benefit and application of Bayesian regression

Bayes regression: how is it done in comparison to standard regression?Is Bayesian statistics genuinely an improvement over traditional (frequentist) statistics for behavioral research?What's the advantages of bayesian version of linear regression, logistic regression etcBayesian approach and least-squares approach to multivariate regression with structural designBayesian Linear Regression is so hard to understand?parameter distribution and predictive distributionUnderstanding how the prior affects the estimate slope in Bayesian regressionComputing Bayesian Credible Intervals for Bayesian RegressionBayesian linear regression with parameter restrictionsProbabilistic interpretation of Thin Plate Smoothing SplinesDifference between Gaussian process regression and other regression techniques (say linear regression)Singular Normal matrix (GP for ML book)Mean of the posterior distribution in bayesian linear regression with infinitely broad prior

.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ margin-bottom:0;
}

Question

What is the real-life example of the benefit and application of the benefit of Bayesian regression?

Having read the items and it looks having the range of inference (possible values and likelihood) available is the benefit.

What's the advantages of bayesian version of linear regression, logistic regression etc (1)

compare bayesian linear regression VS linear regression [closed] (2)

第14回　ベイズ線形回帰を実装してみよう (implement bayesian regression) (3)

But how it will be utilised in real life (finance, engineering, ...)? I suppose without real-life applications of the benefit, it would not be so useful.

It seems both linear regression and Bayesian regression can produce similar predictions as below.

enter image description here

According to 3, the predictive distribution can give the confidence on the prediction if it is within the dense-color area because of the data is dense, but not in sparse area, eg. prediction at x=5 may not be trustworthy. But I suppose there should be more than that.

What does "having statistical inference" make a difference and how it is utilised in real life? Or Is it a matter of choice to use linear or Bayesian?

Code

Taken from (3) and made a few changes. I am not completely familiar with the logic so if there are mistakes, please correct.

import math

import numpy as np

import matplotlib.pyplot as plt



# --------------------------------------------------------------------------------

# Gaussian basis function

# --------------------------------------------------------------------------------

def gaussian(mean, sigma):

    """

    Args:

        mean:  

        sigma: 

    """

    def _gaussian(x):

        return np.exp(-0.5 * ((x - mean) / sigma) ** 2)

    return _gaussian



# --------------------------------------------------------------------------------

# Design matrix

# --------------------------------------------------------------------------------

def phi(f, x): 

    bias = np.array([1])  # bias parameter(i = 0)

    # bias+basis

    return np.append(bias, f(x))



# --------------------------------------------------------------------------------

# Data generation utility

# --------------------------------------------------------------------------------

from numpy.random import rand

def uniform_variable_generator(samples):

    _random = rand(samples)

    return _random



def noise_generator(samples, mu=0.0, beta=0.1):

    noise = np.random.normal(mu, beta, samples)

    return noise



def sigmoid(x):

  return 1 / (1 + np.exp(-x))



def generator_t_function(x):

    #return np.sin(x)

    return sigmoid(x)



def generator_X_function(x):

    return 2 * np.pi * x

    #return 2 * np.pi * x



# --------------------------------------------------------------------------------

# Observations

# --------------------------------------------------------------------------------

#X = np.array([0.02, 0.12, 0.19, 0.27, 0.42, 0.51, 0.64, 0.84, 0.88, 0.99])

#t = np.array([0.05, 0.87, 0.94, 0.92, 0.54, -0.11, -0.78, -0.79, -0.89, -0.04])

samples = 20



#X = np.array([0.02, 0.12, 0.19, 0.27, 0.42, 0.51, 0.64, 0.84, 0.88, 0.99])

#t = np.array([0.05, 0.87, 0.94, 0.92, 0.54, -0.11, -0.78, -0.79, -0.89, -0.04])

X = generator_X_function(uniform_variable_generator(samples))

t = generator_t_function(X) + noise_generator(samples, beta=0.1)



MAX_X = max(X)

NUM_X = len(X)

MAX_T = max(t)

NUM_T = len(t)



# --------------------------------------------------------------------------------

# Gaussian basis function parameters

# --------------------------------------------------------------------------------

sigma = 0.1 * MAX_X



# mean of gaussian basis function (11 dimension w1, w2, ... w11)

mean = np.arange(0, MAX_X + sigma, sigma)  



# Basis function

f = gaussian(mean, sigma)



# --------------------------------------------------------------------------------

# Design matrix

# --------------------------------------------------------------------------------

PHI = np.array([phi(f, x) for x in X])



#alpha = 0.1 

#beta = 9.0 

alpha = 0.5  # larger alpha gives smaller w preventing overfitting (0 -> same with linear regression)

beta = 5   # Small beta allows more variance (deviation)



Sigma_N = np.linalg.inv(alpha * np.identity(PHI.shape[1]) + beta * np.dot(PHI.T, PHI))



mean_N = beta * np.dot(Sigma_N, np.dot(PHI.T, t))



# --------------------------------------------------------------------------------

# Bayesian regression

# --------------------------------------------------------------------------------

xlist = np.arange(0, MAX_X, 0.01) 

plt.title("Bayesian regression")

plt.plot(xlist, [np.dot(mean_N, phi(f, x)) for x in xlist], 'b') 

plt.plot(X, t, 'o', color='r') 

plt.show()



# --------------------------------------------------------------------------------

# Linear regression

# --------------------------------------------------------------------------------

# w for linear regression parameter

#w = np.linalg.solve(np.dot(PHI.T, PHI), np.dot(PHI.T, t))

# --------------------------------------------------------------------------------

l = 0.05

regularization = np.identity(PHI.shape[1])

w = np.linalg.solve(

    np.dot(PHI.T, PHI) + (l * regularization), 

    np.dot(PHI.T, t)

)



xlist = np.arange(0, MAX_X, 0.01) 

plt.title("Linear regression")

plt.plot(xlist, [np.dot(w, phi(f, x)) for x in xlist], 'g') 

plt.plot(X, t, 'o', color='r') 

plt.show()



# --------------------------------------------------------------------------------

# Predictive Distribution

# --------------------------------------------------------------------------------

def normal_dist_pdf(x, mean, var): 

    return np.exp(-(x-mean) ** 2 / (2 * var)) / np.sqrt(2 * np.pi * var)



def quad_form(A, x):

    return np.dot(x, np.dot(A, x))



xlist = np.arange(0, MAX_X, 0.01)

#tlist = np.arange(-1.5 * MAX_T, 1.5 * MAX_T, 0.01)

tlist = np.arange(

    np.mean(t) - (np.max(t)-np.min(t)), 

    np.mean(t) + (np.max(t)-np.min(t)), 

    0.01

)

z = np.array([

    normal_dist_pdf(tlist, np.dot(mean_N, phi(f, x)),

    1 / beta + quad_form(Sigma_N, phi(f, x))) for x in xlist

]).T



plt.contourf(xlist, tlist, z, 5, cmap=plt.cm.coolwarm)

plt.title("Predictive distribution")

plt.plot(xlist, [np.dot(mean_N, phi(f, x)) for x in xlist], 'r')

plt.plot(X, t, 'go')

plt.show()

edited 4 hours ago

asked 7 hours ago

mon

20317

$begingroup$
Starting from the first image in your question you have lost me... How do the images and the attached python code relate to the question about general use?
$endgroup$
– Frans Rodenburg
6 hours ago

$begingroup$
Related/duplicate: stats.stackexchange.com/q/8347/176202
$endgroup$
– Frans Rodenburg
6 hours ago

$begingroup$
@FransRodenburg, thanks for the follow up. Intended show Linear and Bayesian could do the same as in the image as far as I experimented as in the attached code. Which make me think there may not be much benefit if they could do the similar, and only difference I saw was the density plot. I suppose this density would make the difference. So I wonder this density is being utilised and if so how.
$endgroup$
– mon
6 hours ago

$begingroup$
A related question: stats.stackexchange.com/q/252577/103153
$endgroup$
– Lerner Zhang
5 hours ago

add a comment |

Question

What is the real-life example of the benefit and application of the benefit of Bayesian regression?

Having read the items and it looks having the range of inference (possible values and likelihood) available is the benefit.

What's the advantages of bayesian version of linear regression, logistic regression etc (1)

compare bayesian linear regression VS linear regression [closed] (2)

第14回　ベイズ線形回帰を実装してみよう (implement bayesian regression) (3)

But how it will be utilised in real life (finance, engineering, ...)? I suppose without real-life applications of the benefit, it would not be so useful.

It seems both linear regression and Bayesian regression can produce similar predictions as below.

enter image description here

What does "having statistical inference" make a difference and how it is utilised in real life? Or Is it a matter of choice to use linear or Bayesian?

Code

Taken from (3) and made a few changes. I am not completely familiar with the logic so if there are mistakes, please correct.

import math

import numpy as np

import matplotlib.pyplot as plt



# --------------------------------------------------------------------------------

# Gaussian basis function

# --------------------------------------------------------------------------------

def gaussian(mean, sigma):

    """

    Args:

        mean:  

        sigma: 

    """

    def _gaussian(x):

        return np.exp(-0.5 * ((x - mean) / sigma) ** 2)

    return _gaussian



# --------------------------------------------------------------------------------

# Design matrix

# --------------------------------------------------------------------------------

def phi(f, x): 

    bias = np.array([1])  # bias parameter(i = 0)

    # bias+basis

    return np.append(bias, f(x))



# --------------------------------------------------------------------------------

# Data generation utility

# --------------------------------------------------------------------------------

from numpy.random import rand

def uniform_variable_generator(samples):

    _random = rand(samples)

    return _random



def noise_generator(samples, mu=0.0, beta=0.1):

    noise = np.random.normal(mu, beta, samples)

    return noise



def sigmoid(x):

  return 1 / (1 + np.exp(-x))



def generator_t_function(x):

    #return np.sin(x)

    return sigmoid(x)



def generator_X_function(x):

    return 2 * np.pi * x

    #return 2 * np.pi * x



# --------------------------------------------------------------------------------

# Observations

# --------------------------------------------------------------------------------

#X = np.array([0.02, 0.12, 0.19, 0.27, 0.42, 0.51, 0.64, 0.84, 0.88, 0.99])

#t = np.array([0.05, 0.87, 0.94, 0.92, 0.54, -0.11, -0.78, -0.79, -0.89, -0.04])

samples = 20



#X = np.array([0.02, 0.12, 0.19, 0.27, 0.42, 0.51, 0.64, 0.84, 0.88, 0.99])

#t = np.array([0.05, 0.87, 0.94, 0.92, 0.54, -0.11, -0.78, -0.79, -0.89, -0.04])

X = generator_X_function(uniform_variable_generator(samples))

t = generator_t_function(X) + noise_generator(samples, beta=0.1)



MAX_X = max(X)

NUM_X = len(X)

MAX_T = max(t)

NUM_T = len(t)



# --------------------------------------------------------------------------------

# Gaussian basis function parameters

# --------------------------------------------------------------------------------

sigma = 0.1 * MAX_X



# mean of gaussian basis function (11 dimension w1, w2, ... w11)

mean = np.arange(0, MAX_X + sigma, sigma)  



# Basis function

f = gaussian(mean, sigma)



# --------------------------------------------------------------------------------

# Design matrix

# --------------------------------------------------------------------------------

PHI = np.array([phi(f, x) for x in X])



#alpha = 0.1 

#beta = 9.0 

alpha = 0.5  # larger alpha gives smaller w preventing overfitting (0 -> same with linear regression)

beta = 5   # Small beta allows more variance (deviation)



Sigma_N = np.linalg.inv(alpha * np.identity(PHI.shape[1]) + beta * np.dot(PHI.T, PHI))



mean_N = beta * np.dot(Sigma_N, np.dot(PHI.T, t))



# --------------------------------------------------------------------------------

# Bayesian regression

# --------------------------------------------------------------------------------

xlist = np.arange(0, MAX_X, 0.01) 

plt.title("Bayesian regression")

plt.plot(xlist, [np.dot(mean_N, phi(f, x)) for x in xlist], 'b') 

plt.plot(X, t, 'o', color='r') 

plt.show()



# --------------------------------------------------------------------------------

# Linear regression

# --------------------------------------------------------------------------------

# w for linear regression parameter

#w = np.linalg.solve(np.dot(PHI.T, PHI), np.dot(PHI.T, t))

# --------------------------------------------------------------------------------

l = 0.05

regularization = np.identity(PHI.shape[1])

w = np.linalg.solve(

    np.dot(PHI.T, PHI) + (l * regularization), 

    np.dot(PHI.T, t)

)



xlist = np.arange(0, MAX_X, 0.01) 

plt.title("Linear regression")

plt.plot(xlist, [np.dot(w, phi(f, x)) for x in xlist], 'g') 

plt.plot(X, t, 'o', color='r') 

plt.show()



# --------------------------------------------------------------------------------

# Predictive Distribution

# --------------------------------------------------------------------------------

def normal_dist_pdf(x, mean, var): 

    return np.exp(-(x-mean) ** 2 / (2 * var)) / np.sqrt(2 * np.pi * var)



def quad_form(A, x):

    return np.dot(x, np.dot(A, x))



xlist = np.arange(0, MAX_X, 0.01)

#tlist = np.arange(-1.5 * MAX_T, 1.5 * MAX_T, 0.01)

tlist = np.arange(

    np.mean(t) - (np.max(t)-np.min(t)), 

    np.mean(t) + (np.max(t)-np.min(t)), 

    0.01

)

z = np.array([

    normal_dist_pdf(tlist, np.dot(mean_N, phi(f, x)),

    1 / beta + quad_form(Sigma_N, phi(f, x))) for x in xlist

]).T



plt.contourf(xlist, tlist, z, 5, cmap=plt.cm.coolwarm)

plt.title("Predictive distribution")

plt.plot(xlist, [np.dot(mean_N, phi(f, x)) for x in xlist], 'r')

plt.plot(X, t, 'go')

plt.show()

edited 4 hours ago

asked 7 hours ago

mon

20317

$begingroup$
Starting from the first image in your question you have lost me... How do the images and the attached python code relate to the question about general use?
$endgroup$
– Frans Rodenburg
6 hours ago

$begingroup$
Related/duplicate: stats.stackexchange.com/q/8347/176202
$endgroup$
– Frans Rodenburg
6 hours ago

$begingroup$
@FransRodenburg, thanks for the follow up. Intended show Linear and Bayesian could do the same as in the image as far as I experimented as in the attached code. Which make me think there may not be much benefit if they could do the similar, and only difference I saw was the density plot. I suppose this density would make the difference. So I wonder this density is being utilised and if so how.
$endgroup$
– mon
6 hours ago

$begingroup$
A related question: stats.stackexchange.com/q/252577/103153
$endgroup$
– Lerner Zhang
5 hours ago

add a comment |

Question

What is the real-life example of the benefit and application of the benefit of Bayesian regression?

Having read the items and it looks having the range of inference (possible values and likelihood) available is the benefit.

What's the advantages of bayesian version of linear regression, logistic regression etc (1)

compare bayesian linear regression VS linear regression [closed] (2)

第14回　ベイズ線形回帰を実装してみよう (implement bayesian regression) (3)

But how it will be utilised in real life (finance, engineering, ...)? I suppose without real-life applications of the benefit, it would not be so useful.

It seems both linear regression and Bayesian regression can produce similar predictions as below.

enter image description here

What does "having statistical inference" make a difference and how it is utilised in real life? Or Is it a matter of choice to use linear or Bayesian?

Code

Taken from (3) and made a few changes. I am not completely familiar with the logic so if there are mistakes, please correct.

import math

import numpy as np

import matplotlib.pyplot as plt



# --------------------------------------------------------------------------------

# Gaussian basis function

# --------------------------------------------------------------------------------

def gaussian(mean, sigma):

    """

    Args:

        mean:  

        sigma: 

    """

    def _gaussian(x):

        return np.exp(-0.5 * ((x - mean) / sigma) ** 2)

    return _gaussian



# --------------------------------------------------------------------------------

# Design matrix

# --------------------------------------------------------------------------------

def phi(f, x): 

    bias = np.array([1])  # bias parameter(i = 0)

    # bias+basis

    return np.append(bias, f(x))



# --------------------------------------------------------------------------------

# Data generation utility

# --------------------------------------------------------------------------------

from numpy.random import rand

def uniform_variable_generator(samples):

    _random = rand(samples)

    return _random



def noise_generator(samples, mu=0.0, beta=0.1):

    noise = np.random.normal(mu, beta, samples)

    return noise



def sigmoid(x):

  return 1 / (1 + np.exp(-x))



def generator_t_function(x):

    #return np.sin(x)

    return sigmoid(x)



def generator_X_function(x):

    return 2 * np.pi * x

    #return 2 * np.pi * x



# --------------------------------------------------------------------------------

# Observations

# --------------------------------------------------------------------------------

#X = np.array([0.02, 0.12, 0.19, 0.27, 0.42, 0.51, 0.64, 0.84, 0.88, 0.99])

#t = np.array([0.05, 0.87, 0.94, 0.92, 0.54, -0.11, -0.78, -0.79, -0.89, -0.04])

samples = 20



#X = np.array([0.02, 0.12, 0.19, 0.27, 0.42, 0.51, 0.64, 0.84, 0.88, 0.99])

#t = np.array([0.05, 0.87, 0.94, 0.92, 0.54, -0.11, -0.78, -0.79, -0.89, -0.04])

X = generator_X_function(uniform_variable_generator(samples))

t = generator_t_function(X) + noise_generator(samples, beta=0.1)



MAX_X = max(X)

NUM_X = len(X)

MAX_T = max(t)

NUM_T = len(t)



# --------------------------------------------------------------------------------

# Gaussian basis function parameters

# --------------------------------------------------------------------------------

sigma = 0.1 * MAX_X



# mean of gaussian basis function (11 dimension w1, w2, ... w11)

mean = np.arange(0, MAX_X + sigma, sigma)  



# Basis function

f = gaussian(mean, sigma)



# --------------------------------------------------------------------------------

# Design matrix

# --------------------------------------------------------------------------------

PHI = np.array([phi(f, x) for x in X])



#alpha = 0.1 

#beta = 9.0 

alpha = 0.5  # larger alpha gives smaller w preventing overfitting (0 -> same with linear regression)

beta = 5   # Small beta allows more variance (deviation)



Sigma_N = np.linalg.inv(alpha * np.identity(PHI.shape[1]) + beta * np.dot(PHI.T, PHI))



mean_N = beta * np.dot(Sigma_N, np.dot(PHI.T, t))



# --------------------------------------------------------------------------------

# Bayesian regression

# --------------------------------------------------------------------------------

xlist = np.arange(0, MAX_X, 0.01) 

plt.title("Bayesian regression")

plt.plot(xlist, [np.dot(mean_N, phi(f, x)) for x in xlist], 'b') 

plt.plot(X, t, 'o', color='r') 

plt.show()



# --------------------------------------------------------------------------------

# Linear regression

# --------------------------------------------------------------------------------

# w for linear regression parameter

#w = np.linalg.solve(np.dot(PHI.T, PHI), np.dot(PHI.T, t))

# --------------------------------------------------------------------------------

l = 0.05

regularization = np.identity(PHI.shape[1])

w = np.linalg.solve(

    np.dot(PHI.T, PHI) + (l * regularization), 

    np.dot(PHI.T, t)

)



xlist = np.arange(0, MAX_X, 0.01) 

plt.title("Linear regression")

plt.plot(xlist, [np.dot(w, phi(f, x)) for x in xlist], 'g') 

plt.plot(X, t, 'o', color='r') 

plt.show()



# --------------------------------------------------------------------------------

# Predictive Distribution

# --------------------------------------------------------------------------------

def normal_dist_pdf(x, mean, var): 

    return np.exp(-(x-mean) ** 2 / (2 * var)) / np.sqrt(2 * np.pi * var)



def quad_form(A, x):

    return np.dot(x, np.dot(A, x))



xlist = np.arange(0, MAX_X, 0.01)

#tlist = np.arange(-1.5 * MAX_T, 1.5 * MAX_T, 0.01)

tlist = np.arange(

    np.mean(t) - (np.max(t)-np.min(t)), 

    np.mean(t) + (np.max(t)-np.min(t)), 

    0.01

)

z = np.array([

    normal_dist_pdf(tlist, np.dot(mean_N, phi(f, x)),

    1 / beta + quad_form(Sigma_N, phi(f, x))) for x in xlist

]).T



plt.contourf(xlist, tlist, z, 5, cmap=plt.cm.coolwarm)

plt.title("Predictive distribution")

plt.plot(xlist, [np.dot(mean_N, phi(f, x)) for x in xlist], 'r')

plt.plot(X, t, 'go')

plt.show()

edited 4 hours ago

asked 7 hours ago

mon

20317

Question

What is the real-life example of the benefit and application of the benefit of Bayesian regression?

Having read the items and it looks having the range of inference (possible values and likelihood) available is the benefit.

What's the advantages of bayesian version of linear regression, logistic regression etc (1)

compare bayesian linear regression VS linear regression [closed] (2)

第14回　ベイズ線形回帰を実装してみよう (implement bayesian regression) (3)

But how it will be utilised in real life (finance, engineering, ...)? I suppose without real-life applications of the benefit, it would not be so useful.

It seems both linear regression and Bayesian regression can produce similar predictions as below.

enter image description here

What does "having statistical inference" make a difference and how it is utilised in real life? Or Is it a matter of choice to use linear or Bayesian?

Code

Taken from (3) and made a few changes. I am not completely familiar with the logic so if there are mistakes, please correct.

import math

import numpy as np

import matplotlib.pyplot as plt



# --------------------------------------------------------------------------------

# Gaussian basis function

# --------------------------------------------------------------------------------

def gaussian(mean, sigma):

    """

    Args:

        mean:  

        sigma: 

    """

    def _gaussian(x):

        return np.exp(-0.5 * ((x - mean) / sigma) ** 2)

    return _gaussian



# --------------------------------------------------------------------------------

# Design matrix

# --------------------------------------------------------------------------------

def phi(f, x): 

    bias = np.array([1])  # bias parameter(i = 0)

    # bias+basis

    return np.append(bias, f(x))



# --------------------------------------------------------------------------------

# Data generation utility

# --------------------------------------------------------------------------------

from numpy.random import rand

def uniform_variable_generator(samples):

    _random = rand(samples)

    return _random



def noise_generator(samples, mu=0.0, beta=0.1):

    noise = np.random.normal(mu, beta, samples)

    return noise



def sigmoid(x):

  return 1 / (1 + np.exp(-x))



def generator_t_function(x):

    #return np.sin(x)

    return sigmoid(x)



def generator_X_function(x):

    return 2 * np.pi * x

    #return 2 * np.pi * x



# --------------------------------------------------------------------------------

# Observations

# --------------------------------------------------------------------------------

#X = np.array([0.02, 0.12, 0.19, 0.27, 0.42, 0.51, 0.64, 0.84, 0.88, 0.99])

#t = np.array([0.05, 0.87, 0.94, 0.92, 0.54, -0.11, -0.78, -0.79, -0.89, -0.04])

samples = 20



#X = np.array([0.02, 0.12, 0.19, 0.27, 0.42, 0.51, 0.64, 0.84, 0.88, 0.99])

#t = np.array([0.05, 0.87, 0.94, 0.92, 0.54, -0.11, -0.78, -0.79, -0.89, -0.04])

X = generator_X_function(uniform_variable_generator(samples))

t = generator_t_function(X) + noise_generator(samples, beta=0.1)



MAX_X = max(X)

NUM_X = len(X)

MAX_T = max(t)

NUM_T = len(t)



# --------------------------------------------------------------------------------

# Gaussian basis function parameters

# --------------------------------------------------------------------------------

sigma = 0.1 * MAX_X



# mean of gaussian basis function (11 dimension w1, w2, ... w11)

mean = np.arange(0, MAX_X + sigma, sigma)  



# Basis function

f = gaussian(mean, sigma)



# --------------------------------------------------------------------------------

# Design matrix

# --------------------------------------------------------------------------------

PHI = np.array([phi(f, x) for x in X])



#alpha = 0.1 

#beta = 9.0 

alpha = 0.5  # larger alpha gives smaller w preventing overfitting (0 -> same with linear regression)

beta = 5   # Small beta allows more variance (deviation)



Sigma_N = np.linalg.inv(alpha * np.identity(PHI.shape[1]) + beta * np.dot(PHI.T, PHI))



mean_N = beta * np.dot(Sigma_N, np.dot(PHI.T, t))



# --------------------------------------------------------------------------------

# Bayesian regression

# --------------------------------------------------------------------------------

xlist = np.arange(0, MAX_X, 0.01) 

plt.title("Bayesian regression")

plt.plot(xlist, [np.dot(mean_N, phi(f, x)) for x in xlist], 'b') 

plt.plot(X, t, 'o', color='r') 

plt.show()



# --------------------------------------------------------------------------------

# Linear regression

# --------------------------------------------------------------------------------

# w for linear regression parameter

#w = np.linalg.solve(np.dot(PHI.T, PHI), np.dot(PHI.T, t))

# --------------------------------------------------------------------------------

l = 0.05

regularization = np.identity(PHI.shape[1])

w = np.linalg.solve(

    np.dot(PHI.T, PHI) + (l * regularization), 

    np.dot(PHI.T, t)

)



xlist = np.arange(0, MAX_X, 0.01) 

plt.title("Linear regression")

plt.plot(xlist, [np.dot(w, phi(f, x)) for x in xlist], 'g') 

plt.plot(X, t, 'o', color='r') 

plt.show()



# --------------------------------------------------------------------------------

# Predictive Distribution

# --------------------------------------------------------------------------------

def normal_dist_pdf(x, mean, var): 

    return np.exp(-(x-mean) ** 2 / (2 * var)) / np.sqrt(2 * np.pi * var)



def quad_form(A, x):

    return np.dot(x, np.dot(A, x))



xlist = np.arange(0, MAX_X, 0.01)

#tlist = np.arange(-1.5 * MAX_T, 1.5 * MAX_T, 0.01)

tlist = np.arange(

    np.mean(t) - (np.max(t)-np.min(t)), 

    np.mean(t) + (np.max(t)-np.min(t)), 

    0.01

)

z = np.array([

    normal_dist_pdf(tlist, np.dot(mean_N, phi(f, x)),

    1 / beta + quad_form(Sigma_N, phi(f, x))) for x in xlist

]).T



plt.contourf(xlist, tlist, z, 5, cmap=plt.cm.coolwarm)

plt.title("Predictive distribution")

plt.plot(xlist, [np.dot(mean_N, phi(f, x)) for x in xlist], 'r')

plt.plot(X, t, 'go')

plt.show()

regression bayesian

edited 4 hours ago

asked 7 hours ago

mon

20317

edited 4 hours ago

asked 7 hours ago

mon

20317

edited 4 hours ago

asked 7 hours ago

mon

20317

asked 7 hours ago

mon

20317

asked 7 hours ago

mon

20317

$begingroup$
Starting from the first image in your question you have lost me... How do the images and the attached python code relate to the question about general use?
$endgroup$
– Frans Rodenburg
6 hours ago

$begingroup$
Related/duplicate: stats.stackexchange.com/q/8347/176202
$endgroup$
– Frans Rodenburg
6 hours ago

$begingroup$
@FransRodenburg, thanks for the follow up. Intended show Linear and Bayesian could do the same as in the image as far as I experimented as in the attached code. Which make me think there may not be much benefit if they could do the similar, and only difference I saw was the density plot. I suppose this density would make the difference. So I wonder this density is being utilised and if so how.
$endgroup$
– mon
6 hours ago

$begingroup$
A related question: stats.stackexchange.com/q/252577/103153
$endgroup$
– Lerner Zhang
5 hours ago

add a comment |

$begingroup$
Starting from the first image in your question you have lost me... How do the images and the attached python code relate to the question about general use?
$endgroup$
– Frans Rodenburg
6 hours ago

$begingroup$
Related/duplicate: stats.stackexchange.com/q/8347/176202
$endgroup$
– Frans Rodenburg
6 hours ago

$begingroup$
@FransRodenburg, thanks for the follow up. Intended show Linear and Bayesian could do the same as in the image as far as I experimented as in the attached code. Which make me think there may not be much benefit if they could do the similar, and only difference I saw was the density plot. I suppose this density would make the difference. So I wonder this density is being utilised and if so how.
$endgroup$
– mon
6 hours ago

$begingroup$
A related question: stats.stackexchange.com/q/252577/103153
$endgroup$
– Lerner Zhang
5 hours ago

Starting from the first image in your question you have lost me... How do the images and the attached python code relate to the question about general use?

– Frans Rodenburg
6 hours ago

Related/duplicate: stats.stackexchange.com/q/8347/176202

– Frans Rodenburg
6 hours ago

@FransRodenburg, thanks for the follow up. Intended show Linear and Bayesian could do the same as in the image as far as I experimented as in the attached code. Which make me think there may not be much benefit if they could do the similar, and only difference I saw was the density plot. I suppose this density would make the difference. So I wonder this density is being utilised and if so how.

– mon
6 hours ago

A related question: stats.stackexchange.com/q/252577/103153

– Lerner Zhang
5 hours ago

add a comment |

1 Answer
1

active

oldest

votes

We use a Bayesian model in forecasting retail sales with the SAP Unified Demand Forecast (UDF). The Bayesian approach offers two advantages:

We can use priors. For instance, suppose your supermarkets stocks a new item, and we would like to forecast its first Christmas sales. We can simply use the average effect of similar items as a prior for the new item's Christmas effect. Which has the added benefit that the prior gets automatically updated once we have seen the new item's first Christmas sales, so both the prior and the item's own data influence the prediction for the second Christmas.

The priors regularize. Our model is heavily over-parameterized, with seasonality, day of week, trend, holidays and tons of promotion predictors, so regularization is hugely important to keep the predictions under control. (Note that the pictures you show in your question do not show such an overparameterized model.)

Aspect 1 is more important to our marketers and to business users, and aspect 2 is more important to me and to statistician or data scientist users.

Yes, there are ways to address both issues without a Bayesian model, e.g., by pooling for aspect 1, and the Lasso or Elastic Net for aspect 2. We just chose the Bayesian approach, and I find the updating particularly elegant.

So, yes, Bayesian regression is used in real life, and it has paid my salary for the last couple of years, along with the salaries of a number of my colleagues. And while making sure your supermarket does not run out of your favorite shampoo.

answered 2 hours ago

Stephan Kolassa

48.8k8102185

add a comment |

Your Answer

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "65"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f405572%2fwhat-is-the-real-life-benefit-and-application-of-bayesian-regression%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

We use a Bayesian model in forecasting retail sales with the SAP Unified Demand Forecast (UDF). The Bayesian approach offers two advantages:

We can use priors. For instance, suppose your supermarkets stocks a new item, and we would like to forecast its first Christmas sales. We can simply use the average effect of similar items as a prior for the new item's Christmas effect. Which has the added benefit that the prior gets automatically updated once we have seen the new item's first Christmas sales, so both the prior and the item's own data influence the prediction for the second Christmas.

The priors regularize. Our model is heavily over-parameterized, with seasonality, day of week, trend, holidays and tons of promotion predictors, so regularization is hugely important to keep the predictions under control. (Note that the pictures you show in your question do not show such an overparameterized model.)

Aspect 1 is more important to our marketers and to business users, and aspect 2 is more important to me and to statistician or data scientist users.

answered 2 hours ago

Stephan Kolassa

48.8k8102185

add a comment |

We use a Bayesian model in forecasting retail sales with the SAP Unified Demand Forecast (UDF). The Bayesian approach offers two advantages:

We can use priors. For instance, suppose your supermarkets stocks a new item, and we would like to forecast its first Christmas sales. We can simply use the average effect of similar items as a prior for the new item's Christmas effect. Which has the added benefit that the prior gets automatically updated once we have seen the new item's first Christmas sales, so both the prior and the item's own data influence the prediction for the second Christmas.

The priors regularize. Our model is heavily over-parameterized, with seasonality, day of week, trend, holidays and tons of promotion predictors, so regularization is hugely important to keep the predictions under control. (Note that the pictures you show in your question do not show such an overparameterized model.)

Aspect 1 is more important to our marketers and to business users, and aspect 2 is more important to me and to statistician or data scientist users.

answered 2 hours ago

Stephan Kolassa

48.8k8102185

add a comment |

We use a Bayesian model in forecasting retail sales with the SAP Unified Demand Forecast (UDF). The Bayesian approach offers two advantages:

We can use priors. For instance, suppose your supermarkets stocks a new item, and we would like to forecast its first Christmas sales. We can simply use the average effect of similar items as a prior for the new item's Christmas effect. Which has the added benefit that the prior gets automatically updated once we have seen the new item's first Christmas sales, so both the prior and the item's own data influence the prediction for the second Christmas.

The priors regularize. Our model is heavily over-parameterized, with seasonality, day of week, trend, holidays and tons of promotion predictors, so regularization is hugely important to keep the predictions under control. (Note that the pictures you show in your question do not show such an overparameterized model.)

Aspect 1 is more important to our marketers and to business users, and aspect 2 is more important to me and to statistician or data scientist users.

answered 2 hours ago

Stephan Kolassa

48.8k8102185

We use a Bayesian model in forecasting retail sales with the SAP Unified Demand Forecast (UDF). The Bayesian approach offers two advantages:

We can use priors. For instance, suppose your supermarkets stocks a new item, and we would like to forecast its first Christmas sales. We can simply use the average effect of similar items as a prior for the new item's Christmas effect. Which has the added benefit that the prior gets automatically updated once we have seen the new item's first Christmas sales, so both the prior and the item's own data influence the prediction for the second Christmas.

The priors regularize. Our model is heavily over-parameterized, with seasonality, day of week, trend, holidays and tons of promotion predictors, so regularization is hugely important to keep the predictions under control. (Note that the pictures you show in your question do not show such an overparameterized model.)

Aspect 1 is more important to our marketers and to business users, and aspect 2 is more important to me and to statistician or data scientist users.

answered 2 hours ago

Stephan Kolassa

48.8k8102185

answered 2 hours ago

Stephan Kolassa

48.8k8102185

answered 2 hours ago

Stephan Kolassa

48.8k8102185

answered 2 hours ago

Stephan Kolassa

48.8k8102185

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Cross Validated!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Kdtyjb

What is the real-life benefit and application of Bayesian regressionBayes regression: how is it done in...

Question

Code

Question

Code

Question

Code

Question

Code

1 Answer
1

Your Answer

Post as a guest

1 Answer
1

1 Answer
1

Post as a guest

Popular posts from this blog

miktex-makemf did not succeed for the following reasonHow to fix the “Sorry, but C:…miktex-pdftex.exe did...

How to insert first subfigures with caption in IEEE ACCESS LaTeX?Figure caption line breaking after first...

Pacman circle in TikZ The 2019 Stack Overflow Developer Survey Results Are In ...

What is the real-life benefit and application of Bayesian regressionBayes regression: how is it done in...

Question

Code

Question

Code

Question

Code

Question

Code

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

1 Answer 1

1 Answer 1

Sign up or log in

Post as a guest

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Popular posts from this blog

miktex-makemf did not succeed for the following reasonHow to fix the “Sorry, but C:…miktex-pdftex.exe did...

How to insert first subfigures with caption in IEEE ACCESS LaTeX?Figure caption line breaking after first...

Pacman circle in TikZ The 2019 Stack Overflow Developer Survey Results Are In ...

1 Answer
1

1 Answer
1

1 Answer
1