Probability density function

format_list_bulleted Contenido keyboard_arrow_down

ImprimirCitar

Box diagram and probability density function of a normal distribution N(0,σ²).

In probability theory, the probability density function, density function, or simply density of a continuous random variable describes the relative probability that the random variable will take on a certain value.
The probability that the random variable falls in a specific region of the possibility space will be given by the integral of the density of this variable between one and the other limit of said region.
The probability density function (PDF) is positive throughout its entire domain and its integral over the entire space is of value unitary.

Definition

A probability density function characterizes the probable behavior of a population as it specifies the relative possibility that a continuous random variable $X$ take a value close to $x$ .

A random variable $X$ has density function $f_{X}$ , being $f_{X}$ a non-negative integrated function of Lebesgue, yes:

{displaystyle operatorname {P} [aleq Xleq b]=int _{a}^{b}f_{X}(x),dx}

Yeah. $F_{X}$ is the distribution function $X$ , then

{displaystyle F_{X}(x)=int _{-infty }^{x}f_{X}(u),du}

and (yes) $f_{X}$ is continuous $x$ )

{displaystyle f_{X}(x)={frac {d}{dx}}F_{X}(x)}

Intuitively, it can be considered ${displaystyle f_{X}(x)dx}$ like the probability of $X$ of fall in the infinitesimal interval ${displaystyle [x,x+dx]}$ .

Formal definition

The formal definition of the density function requires concepts from measurement theory.

A continuous random variable $X$ with values in a measurable space $(mathcal{X}, mathcal{A})$ (usually $mathbb {R} ^{n}$ with Borel assemblies as measurable subsets), has as probability distribution the measurement X_↓P in $(mathcal{X}, mathcal{A})$ : the density of $X$ with respect to the reference measure $mu$ on $(mathcal{X}, mathcal{A})$ is the derivative of Radon-Nikodym.

{displaystyle f={frac {dX_{*}P}{dmu }}.}

This is, $f$ is a measurable function with the following property:

{displaystyle operatorname {P} [Xin A]=int _{X^{-1}A},dP=int _{A}f,dmu }

for all measurable set $A in mathcal{A}$ .

Properties

From the properties of the density function follow the following properties of the nfd (sometimes seen as pdf):

${displaystyle f_{X}(x)geq 0}$ for all $x$ .
The total area enclosed under the curve is equal to 1:

{displaystyle int _{-infty }^{infty }f_{X}(x),dx=1}

The probability that $X$ take a value in the interval $[a,b]$ is the area under the curve of the density function in that interval or what is the same, the integral defined in that interval. The chart f(x) is sometimes known as density curve.

{displaystyle operatorname {P} [aleq Xleq b]=int _{a}^{b}f_{X}(x),dx=F(b)-F(a)}

Some FDPs are declared in ranks of $-infty ;$ a $+infty ;$ Like normal distribution.

Densities associated with multiple variables

For continuous random variables ${displaystyle X_{1},X_{2},dotsX_{n}}$ It is possible to define a function of probability of density, this is called joint density function. Joint density function is defined as a function of $n$ variables, such that for any domain $D$ in space $n$ - dimensional of the values of the variables ${displaystyle X_{1},X_{2},dotsX_{n}}$ , the probability of occurrence of a set of variables is found within $D$ That's it.

{displaystyle operatorname {P} [X_{1},dotsX_{n}in D]=int cdots int _{D}f_{X_{1},dotsX_{n}}(x_{1},dotsx_{n});dx_{1}cdots dx_{n}}

Yeah. ${displaystyle F(x_{1},dotsx_{n})=operatorname {P} [X_{1}leq x_{1},dotsX_{n}leq x_{n}]}$ is the vector distribution function ${displaystyle (X_{1},X_{2},dotsX_{n})}$ then the joint density function can be obtained as a partial derivative

{displaystyle f(x_{1},dotsx_{n})={frac {partial F}{partial x_{1}cdots partial x_{n}}}}

Marginal density

Stop. ${displaystyle i=1,dotsn}$ Whatever. ${displaystyle f_{X_{i}}(x_{i})}$ the density function associated with the variable $X_i$ , this function is called marginal density function and can be obtained from the joint density function associated with the variables ${displaystyle X_{1},X_{2},dotsX_{n}}$ Like

{displaystyle f_{X_{i}}(x_{i})=underbrace {int cdots int } _{n-1}f_{X_{1},dotsX_{n}}(x_{1},dotsx_{n});dx_{1}cdots dx_{i-1}dx_{i+1}cdots dx_{n}}

Sum of independent random variables

The density function of the sum of two independent random variables $U$ and $V$ , each with density function, is the convolution of its density functions:

{displaystyle f_{U+V}(x)=int _{-infty }^{infty }f_{U}(y)f_{V}(x-y);dy=(f_{U}*f_{V})(x)}

It is possible to generalize the result prior to the sum of $N$ independent random variables with density ${displaystyle U_{1},dotsU_{N}}$

{displaystyle f_{U_{1}+cdots +U_{N}}(x)=(f_{U_{1}}*cdots *f_{U_{N}})(x)}

Example

Suppose bacteria of a certain species normally live for 4 to 6 hours. The probability that a bacterium lives exactly 5 hours is equal to zero. Many bacteria live for about 5 hours, but there is no probability that a given bacterium will die at exactly 5:00... hours. However, the probability that the bacteria will die between 5 hours and 5.01 hours is quantifiable. Suppose the answer is 0.02 (ie 2%). So, the probability that the bacterium will die between 5 hours and 5,001 hours should be about 0.002, since this time interval is one-tenth of the previous one. The probability that the bacteria will die between 5 hours and 5.0001 hours should be about 0.0002, and so on.

In this example, the ratio (probability of dying during an interval)/(interval duration) is approximately constant, and equals 2 per hour (or 2hour^-1). For example, there is a 0.02 probability of dying in the 0.01 hour interval between 5 and 5.01 hours, and (0.02 probability/0.01 hours) = 2hour^-1. This quantity 2hour^-1 is called the probability density of dying at around 5 hours. Therefore, the probability that the bacterium will die within 5 hours can be written as (2 hour^-¹) dt. This is the probability that the bacterium will die within an infinitesimal time window of around 5 hours, where dt is the duration of this window. For example, the probability that you will live more than 5 hours, but less than (5 hours + 1 nanosecond), is (2 hours-1)×(1 nanosecond) ≈ 6×10-13 (using the unit conversion 3, 6×1012 nanoseconds = 1 hour).

There is a probability density function f with f(5hours) = 2 hours^-1. The integral of f over any time window (not only infinitesimal windows but also large windows) is the probability that the bacteria will die in that window.

Probability density function for normal distribution.

Additional bibliography

Billingsley, Patrick (1979). Probability and Measure (in English). New York, Toronto, London: John Wiley and Sons. ISBN 0-471-00710-2.
Casella, George; Berger, Roger L. (2002). Statistical Inference (in English) (Second edition). Thomson Learning. pp. 34-37. ISBN 0-534-24312-6.
Stirzaker, David (2003). Elementary Probability (in English). ISBN 0-521-42028-8. (requires registration). Chapters 7 to 9 are about continuous variable.

Contenido relacionado

Más resultados...