# Statistics for Beginners

## Statistics for Beginners in Excel – Distribution Property Functions using Real Statistics

(Basic Statistics for Citizen Data Scientist) Distribution Property Functions In the descriptions of the distributions described throughout the website, we have provided formulas for the distribution mean and variance. Real Statistics provides the following functions to carry out these calculations. Real Statistics Functions: The Real Statistics Resource Pack contains the following functions. MEAN_DIST(dist, param1, param2, …

## Statistics for Beginners in Excel – Laplace Distribution

(Basic Statistics for Citizen Data Scientist) Laplace Distribution The pdf of the Laplace distribution (aka the double exponential distribution) with location parameter μ and scale parameter β is where β > 0. The cdf is The inverse of the Laplace distribution is Key statistical properties of the Laplace distribution are shown in Figure 1.   Figure 1 – Statistical properties of …

## Statistics for Beginners in Excel – Gumbel Distribution

(Basic Statistics for Citizen Data Scientist) Gumbel Distribution The Gumbel distribution is used to model the largest value from a relatively large set of independent elements from distributions whose tails decay relatively fast, such as a normal or exponential distribution. As a result, it can be used to analyze annual maximum daily rainfall volumes. In …

## Statistics for Beginners in Excel – Logistic Distribution

(Basic Statistics for Citizen Data Scientist) Logistic Distribution The pdf of the Logistic distribution at location parameter µ and scale parameter β is where β > 0. The cdf is The inverse of the logistic distribution is The standard Gumbel distribution is the case where μ = 0 and β = 1. Key statistical properties of the Logistic distribution are shown in Figure 1.   …

## Statistics for Beginners in Excel – Weibull Distribution

(Basic Statistics for Citizen Data Scientist) Weibull Distribution Definition 1: The Weibull distribution has the probability density function (pdf) for x ≥ 0. Here β > 0 is the shape parameter and α > 0 is the scale parameter. The cumulative distribution function (cdf) is The inverse cumulative distribution function is I(p) = Observation: There is also a three-parameter version of the Weibull distribution. Observation: If x represents “time-to-failure”, the …

## Statistics for Beginners in Excel – Uniform Distribution

(Basic Statistics for Citizen Data Scientist) Uniform Distribution When you ask for a random set of say 100 numbers between 1 and 10, you are looking for a sample from a continuous uniform distribution, where α = 1 and β = 10 according to the following definition. Definition 1: The continuous uniform distribution has probability density function (pdf) given by where α and β are any parameters …

## Statistics for Beginners in Excel – Exponential Distribution

(Basic Statistics for Citizen Data Scientist) Exponential Distribution The exponential distribution can be used to determine the probability that it will take a given number of trials to arrive at the first success in a Poisson distribution; i.e. it describes the inter-arrival times in a Poisson process. It is the continuous counterpart to the geometric distribution, and …

## Statistics for Beginners in Excel – Gamma Distribution

(Basic Statistics for Citizen Data Scientist) Gamma Distribution The gamma distribution has the same relationship to the Poisson distribution that the negative binomial distribution has to the binomial distribution. We aren’t going to study the gamma distribution directly, but it is related to the exponential distribution and especially to the chi-square distribution which will receive a lot more attention in this website. Definition 1: …

## Statistics for Beginners in Excel – Two Sample Hypothesis Testing to Compare Variances

(Basic Statistics for Citizen Data Scientist) Two Sample Hypothesis Testing to Compare Variances Theorem 1 of F Distribution can be used to test whether the variances of two populations are equal, using the Excel functions and tools which follows. In order to deal exclusively with the right tail of the distribution, when taking ratios of sample variances from …

## Statistics for Beginners in Excel – F Distribution

(Basic Statistics for Citizen Data Scientist) F Distribution The F-distribution is primarily used to compare the variances of two populations, as described in Hypothesis Testing to Compare Variances. This is particularly relevant in the analysis of variance testing (ANOVA) and in regression analysis. Definition 1: The The F-distribution with n1, n2degrees of freedom is defined by Theorem 1: If we draw two …

## Statistics for Beginners in Excel – Fisher’s Exact Test

(Basic Statistics for Citizen Data Scientist) Fisher’s Exact Test When the conditions for Pearson’s chi-square test are not met, especially when one of more of the cells have expi < 5, an alternative approach with 2 × 2 contingency tables is to use Fisher’s exact test. Since this method is more computationally intense, it is best used for smaller …

## Statistics for Beginners in Excel – Independence Testing

(Basic Statistics for Citizen Data Scientist) Independence Testing The method described in Goodness of Fit can also be used to determine whether two sets of data are independent of each other. Such data are organized in what are called contingency tables, as described in Example 1. In these cases df = (row count – 1) (column count – 1). Excel …