11  Stochastic equations

11.1 Introduction to the stochastic simulation

The problem of identifiability is basic to all statistical methods and data analysis and it occurs in diverse areas such as reliability theory, survival analysis, econometrics, etc., where stochastic modelling is widely used (Prakasa Rao 1992).

The module Stochastic of the Expert frame has been implemented for launching multiple simulations depending of the object’s selected parameters. This module uses probability distributions to calculate the simulation parameters. The defaults results are two CSV files, one of them is a statistics series of all the simulations and the other is a parameter series with a selected distribution. The user can save all the simulation in a single file when the process is finished.

11.2 Probability distributions

A probability distribution assigns a probability to each measurable subset of the possible outcomes of a random experiment. The predefined distributions are uniform, normal, log normal, exponential and finally it is possible use a customized distribution (Figure 11.1).

Figure 11.1: Probability distributions of the stochastic simulations module

For define a correct probability distribution, the area under the probability density function must be equal to 1. For other hand, cumulative distribution function is bounded between 0 and 1 and also it must be non-decreasing and right-continuous.

Uniform

The continuous uniform distribution or rectangular distribution is a family of symmetric probability distributions such that for each member of the family, all intervals of the same length on the distribution’s support are equally probable. Figure 11.2 shows an example with a bounded \(X\) between [-5, 5].

Figure 11.2: Uniform distribution. Probability density function (left), cumulative distribution function (right)

Normal

The normal distribution (or Gaussian distribution) is a very common continuous probability distribution. Normal distributions are important in statistics and are often used to represent distributions are not known. Several characterizations of the univariate and the multivariate normal distribution are known (Kagan and Shalaevsky 1967; Prakasa Rao 1992). The normal distribution is sometimes informally called the bell curve. The following Figure 11.3 shows an example with a bounded \(X\) axis between [-5,5], where is more probable to find an \(X\) value.

Figure 11.3: Normal distribution. Probability density function (left), cumulative distribution function (right)

Probability density function:

\[ \varphi = f(x\ |\ \mu,\ \sigma) = \frac{1}{\sigma\sqrt{2\pi}} \cdot e^{- \frac{(x - \mu)^{2}}{2\sigma^{2}}} \tag{11.1}\]

with \(\mu\): mean or expectation of the distribution (and also its median and mode); \(\sigma^{2}\): variance; \(\sigma\): standard deviation.

Cumulative distribution function:

\[ \phi = f(x\ |\ \mu,\ \sigma) = \frac{1}{2}\cdot \Bigg( 1 + \frac{1}{\sqrt{\pi}}\int_{- \Big( \frac{x - \mu}{\sigma\sqrt{2}} \Big)}^{\Big( \frac{x - \mu}{\sigma\sqrt{2}} \Big)}e^{{- t}^{2}} \mathrm{d}t \Bigg) \tag{11.2}\]

The normal distribution is useful because of the central limit theorem. In its most general form, under some conditions, it states that averages of random variables independently drawn from independent distributions converge in distribution to the normal, that is, become normally distributed when the number of random variables is sufficiently large.

Log normal

A log-normal distribution is a continuous probability distribution of a random variable whose logarithm is normally distributed. Consequently, if the random variable \(X\) is log-normally distributed, then \(Y = ln(X)\) has a normal distribution. Likewise, if \(Y\) has a normal distribution, then \(X = e^y\) has a log-normal distribution. A random variable which is log-normally distributed takes only positive real values (Johnson, Kotz, and Balakrishnan 1994). Figure 11.4 shows an example with a bounded \(X\) axis between [0, 2], where is more probable to find an \(X\) value.

Figure 11.4: Log normal distribution. Probability density function (left), cumulative distribution function (right)

Probability density function:

\[ \varphi = f(x\ |\ \mu,\ \sigma) = \frac{1}{x\sigma\sqrt{2\pi}}e^{- \frac{\left( \ln x - \mu \right)^{2}}{2\sigma^{2}}} \tag{11.3}\]

with \(\mu\): mean or expectation of the distribution (and also its median and mode); \(\sigma^{2}\): variance; \(\sigma\) standard deviation.

Cumulative distribution function:

\[ \phi = f(x\ |\ \mu,\ \sigma) = \frac{1}{2}\left\lbrack 1 + \frac{1}{\sqrt{\pi}}\int_{- \left( \frac{\ln x - \mu}{\sigma\sqrt{2}} \right)}^{\left( \frac{\ln x - \mu}{\sigma\sqrt{2}} \right)}e^{{- t}^{2}}dt \right\rbrack \tag{11.4}\]

A log-normal process is the statistical realization of the multiplicative product of many independent random variables, each of which is positive. This is justified by considering the central limit theorem in the log domain. The log-normal distribution is the maximum entropy probability distribution for a random variate \(X\) for which the mean and variance of \(ln(X)\) are specified (Park and Bera 2009).

Exponential

The exponential distribution describes the time between events in a Poisson point process. Is a type of random mathematical object that consists of points randomly located on a mathematical space (Chiu et al. 2013). It is a particular case of the gamma distribution. It is the continuous analogue of the geometric distribution, and it has the key property of being memoryless. Figure 11.5 shows an example with a bounded \(X\) axis between [0, 10], where is more probable to find an \(X\) value.

Figure 11.5: Exponential distribution. Probability density function (left), cumulative distribution function (right)

Probability density function:

\[ \varphi = f(x\ |\ \lambda) = \lambda e^{- \lambda x} \tag{11.5}\]

with \(\lambda\): the parameter of the distribution, often called the rate parameter (\(λ>0\)).

Cumulative distribution function:

\[ \phi = f(x\ |\ \lambda) = 1 - e^{- \lambda x} \tag{11.6}\]

User defined

For define a custom probability distribution, the user must create a text file with two columns (Figure 11.6, left). The first column is the cumulated probability and the second it is for the \(X\) value associated. The middle values are calculated with a linear interpolation method. Figure 11.6 right is the cumulative distribution for a user defined distribution.

Figure 11.6: Custom distribution. Format distribution file (left). Cumulative distribution function (right).

Figure 11.7 shows the probability density function for the user defined distribution. As same to cumulative distribution, program calculate middle point through linear interpolation.

Figure 11.7: User distribution. Probability density function.

References

Chiu, Sung Nok, Dietrich Stoyan, Wilfrid S. Kendall, and Joseph Mecke, eds. 2013. Stochastic Geometry and Its Applications. 1st ed. Wiley Series in Probability and Statistics. Wiley. https://doi.org/10.1002/9781118658222.
Johnson, N. L., S. Kotz, and N. Balakrishnan. 1994. “14: Lognormal Distributions.” In Continuous Univariate Distributions. Vol. 1. New York, USA: John Wiley & Sons.
Kagan, A., and O. Shalaevsky. 1967. “Characterization of Normal Law by a Property of the Non-Central Χ²-Distribution.” Lithuanian Mathematical Journal 7 (1): 57–58. https://doi.org/10.15388/LMJ.1967.19934.
Park, Sung Y., and Anil K. Bera. 2009. “Maximum Entropy Autoregressive Conditional Heteroskedasticity Model.” Journal of Econometrics 150 (2): 219–30. https://doi.org/10.1016/j.jeconom.2008.12.014.
Prakasa Rao, B. L. S. 1992. Identifiability in Stochastic Models : Characterization of Probability Distributions. Probability and Mathematical Statistics. Boston: Academic Press.