Inverse Gaussian distribution

Family of continuous probability distributions From Wikipedia, the free encyclopedia

In probability theory, the inverse Gaussian distribution (also known as the Wald distribution) is a two-parameter family of continuous probability distributions with support on .

Notation
Parameters
Support
PDF
Quick facts Notation, Parameters ...
Inverse Gaussian
Probability density function
Cumulative distribution function
Notation
Parameters
Support
PDF
CDF

where is the standard normal (standard Gaussian) distribution c.d.f.
Mean


Mode
Variance


Skewness
Excess kurtosis
MGF
CF
Close

Its probability density function is given by

for , where is the mean and is a shape parameter.[1] Either or (or more generally any combination of the form for any real ) can serve as a scale parameter, so a proper (i.e., unscaled) shape parameter would be any non-zero power of : Tweedie proposed to use the and parametrizations in addition to the standard parametrization (“Each of these forms is convenient or suggestive for some purpose.”[2]), and later on uses exclusively the parametrization.[3]

The inverse Gaussian distribution has several properties analogous to a Gaussian distribution. The name can be misleading: it is an inverse only in that, while the Gaussian describes a Brownian motion's level at a fixed time, the inverse Gaussian describes the distribution of the time a Brownian motion with positive drift takes to reach a fixed positive level. The relationship between the Gaussian and inverse Gaussian distributions is thus the same as the relationship between the binomial (number of successes for a fixed number of Bernoulli trials) and negative binomial (number of Bernoulli trials for a fixed number of successes) distributions.[4]

The y-axis reflections of the cumulant generating functions of the Gaussian and inverse Gaussian distributions are inverse of each other (i.e., the graphs of the two cumulant generating functions are reflections of each other across the line ), a property that is also shared between the binomial and negative binomial distributions (after dividing their cumulant generating functions by their respective fixed parameter).[4]

To indicate that a random variable is inverse Gaussian-distributed with mean and shape parameter we write .

Properties

Single parameter form

The probability density function (pdf) of the inverse Gaussian distribution has a single parameter form given by

In this form, the mean and variance of the distribution are equal, .

Also, the cumulative distribution function (cdf) of the single parameter inverse Gaussian distribution is related to the standard normal distribution by

where , , and the is the cdf of standard normal distribution. The variables and are related to each other by the identity .

In the single parameter form, the MGF simplifies to

An inverse Gaussian distribution in double parameter form can be transformed into a single parameter form by appropriate scaling , where .

The above paragraph can be re-written as: if , then [5]. This approach is better in the sense that it clearly shows dimensionless nature of the single parameter form (note that ). This property follows from a more general fact: if and , then [2].

The standard form of inverse Gaussian distribution is

Summation

If has an distribution for and all are independent, then

The special case shows that the inverse Gaussian distribution is infinitely divisible.

Note that

is constant for all . This is a necessary condition for the summation. Otherwise would not be Inverse Gaussian distributed.

Scaling

For any it holds that

Exponential family

The inverse Gaussian distribution is a two-parameter exponential family with natural parameters and , and natural statistics and .

For fixed, it is also a single-parameter natural exponential family distribution[6] where the base distribution has density

Indeed, with ,

is a density over the reals. Evaluating the integral, we get

Substituting makes the above expression equal to .

Relationship with Brownian motion

Example of stopped random walks with . The upper figure shows the histogram of waiting times, along with the prediction according to inverse gaussian distribution. The lower figure shows the trajectories.

Let the stochastic process be given by

where is a standard Brownian motion. That is, is a Brownian motion with drift .

Then the first passage time for a fixed level by is distributed according to an inverse-Gaussian:

i.e

(cf. Schrödinger[7] equation 19, Smoluchowski[8], equation 8, and Folks[5], equation 1).

More information Suppose that we have a Brownian motion ...
Close

When drift is zero

A common special case of the above arises when the Brownian motion has no drift. In that case, parameter tends to infinity, and the first passage time for fixed level has probability density function

(see also Bachelier[9]:74[10]:39). This is a Lévy distribution with parameters and .

Maximum likelihood

The model where

with all known, unknown and all independent has the following likelihood function:

Solving the likelihood equation yields the following maximum likelihood estimates

and are independent and

Sampling from an inverse-Gaussian distribution

The following algorithm may be used.[11]

Generate a random variate from a normal distribution with mean and standard deviation equal

Square the value

and use the relation

Generate another random variate, this time sampled from a uniform distribution between and

If then return else return

Sample code in Java:

public double inverseGaussian(double mu, double lambda) {
    Random rand = new Random();
    double v = rand.nextGaussian();  // Sample from a normal distribution with a mean of 0 and 1 standard deviation
    double y = v * v;
    double x = mu + (mu * mu * y) / (2 * lambda) - (mu / (2 * lambda)) * Math.sqrt(4 * mu * lambda * y + mu * mu * y * y);
    double test = rand.nextDouble();  // Sample from a uniform distribution between 0 and 1
    if (test <= (mu) / (mu + x))
        return x;
    else
        return (mu * mu) / x;
}
Wald distribution using Python with aid of matplotlib and NumPy

And to plot Wald distribution in Python using matplotlib and NumPy:

import matplotlib.pyplot as plt
import numpy as np

h = plt.hist(np.random.wald(3, 2, 100000), bins = 200, density = True)

plt.show()
  • If , then for any number .[1]
  • If then .
  • If for then .
  • If then .
  • If , then .[12]

The convolution of an inverse Gaussian distribution (a Wald distribution) and an exponential (an ex-Wald distribution) is used as a model for response times in psychology,[13] with visual search as one example.[14]

History

This distribution appears to have been first derived in 1900 by Louis Bachelier[9][10] as the time a stock reaches a certain price for the first time. In 1915 it was used independently by Erwin Schrödinger[7] and Marian v. Smoluchowski[8] as the time to first passage of a Brownian motion. In the field of reproduction modeling it is known as the Hadwiger function, after Hugo Hadwiger who described it in 1940.[15] Abraham Wald re-derived this distribution in 1944[16] as the limiting form of a sample in a sequential probability ratio test. The name inverse Gaussian was proposed by Maurice Tweedie in 1945.[4] Tweedie investigated this distribution in 1956[17] and 1957[2][3] and established some of its statistical properties. The distribution was extensively reviewed by Folks and Chhikara in 1978.[5]

Rated inverse Gaussian distribution

Assuming that the time intervals between occurrences of a random phenomenon follow an inverse Gaussian distribution, the probability distribution for the number of occurrences of this event within a specified time window is referred to as rated inverse Gaussian.[18] While, first and second moment of this distribution are calculated, the derivation of the moment generating function remains an open problem.

Numeric computation and software

Despite the simple formula for the probability density function, numerical probability calculations for the inverse Gaussian distribution nevertheless require special care to achieve full machine accuracy in floating point arithmetic for all parameter values.[19] Functions for the inverse Gaussian distribution are provided for the R programming language by several packages including rmutil,[20][21] SuppDists,[22] STAR,[23] invGauss,[24] LaplacesDemon,[25] and statmod.[26]

See also

References

Further reading

Related Articles

Wikiwand AI