Hypergeometric distribution

format_list_bulleted Contenido keyboard_arrow_down

ImprimirCitar

In theory of probability and statistics, the hypergeometric distribution is a discreet probability distribution related to random sampling and no replacement. Suppose you have a population of $N$ of which, $K$ belong to the category $A$ and ${displaystyle N-K}$ belong to the category $B$ . Hypergeometric distribution measures the likelihood of obtaining $x$ ( ${displaystyle 0leq xleq K}$ ) elements of the category $A$ in a sample without replacement $n$ elements of the original population.

Definition

Probability Function

A discreet random variable $X$ has a hypergeometric distribution with parameters ${displaystyle N=0,1,dots }$ , ${displaystyle K=0,1,dotsN}$ and ${displaystyle n=0,1,dotsN}$ and write ${displaystyle Xsim operatorname {HG} (N,K,n)}$ if your probability function is

{displaystyle operatorname {P} [X=x]={frac {{K choose x}{N-K choose n-x}}{N choose n}},}

for values $x$ between ${displaystyle max{0,n-N+K}}$ and ${displaystyle min{K,N-K}}$ where $N$ is the size of population, $n$ is the size of the sample extracted, $K$ is the number of elements in the original population belonging to the desired category and $x$ is the number of elements in the sample that belong to that category.

The notation

{displaystyle {b choose a}={frac {b!}{a!(b-a)!}}}

refers to the binomial coefficient, i.e. the number of possible combinations when selecting $a$ elements of a total $b$ .

Recursive formula

Yeah. ${displaystyle Xsim operatorname {HG} (N,K,n)}$ Then it can be proved that

{displaystyle {begin{aligned}operatorname {P} [X=x+1]&={frac {(K-x)(n-x)}{(x+1)(N-K-n+x-1)}};operatorname {P} [X=x]end{aligned}}}

Properties

Yeah. ${displaystyle Xsim operatorname {HG} (N,K,n)}$ then. $X$ fulfills some properties:

The expected value of the random variable $X$ That's it.

{displaystyle operatorname {E} [X]={frac {nK}{N}}}

and its variance is given by

{displaystyle operatorname {Var} [X]={frac {nK}{N}}{bigg (}{frac {N-K}{N}}{bigg)}{bigg (}{frac {N-n}{N-1}}{bigg)}}

The hypergeometric distribution is applicable to sampling without replacement and the binomial to sampling with replacement. In situations where the expected number of repetitions in the sample is presumably low, the first can be approximated by the second. This is so when N is large and the relative size of the drawn sample, n/N, is small.

Related Distributions

If a random variable ${displaystyle Xsim operatorname {HG} (N,K,1)}$ then. ${displaystyle Xsim operatorname {Bernoulli} left({frac {K}{N}}right)}$ .
Yeah. ${displaystyle Xsim operatorname {HG} (N,K,n)}$ then. ${displaystyle Xsim operatorname {Binomial} (n,p)}$ When $Ntoinfty$ and ${displaystyle Kto infty }$ in such a way that ${displaystyle K/Nto p}$ .

Contenido relacionado

Más resultados...