Statistics for Beginners Archives

Statistics for Beginners in Excel – Identifying Outliers and Missing Data using Real Statistics

By SETScholars Team on Wednesday, January 27, 2021

(Basic Statistics for Citizen Data Scientist) Identifying Outliers and Missing Data The Real Statistics Resource Pack provides an option for identifying potential outliers in a sample. Assuming the sample is normally distributed (based on the Central Limit Theorem), we know that NORM.S.DIST(-2.5,TRUE) = 0.621% of the data should have a z-score less than -2.5. Similarly …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Power and Sample Size using Real Statistics

By SETScholars Team on Wednesday, January 27, 2021

(Basic Statistics for Citizen Data Scientist) Power and Sample Size using Real Statistics Real Statistics Functions: The Real Statistics Resource Pack supplies the following functions for calculating the power and sample size requirements for one-sample and two-sample hypothesis testing of the mean using the normal distribution. NORM1_POWER(d, n, tails, α) = the power of a …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Sampling using Real Statistics

By SETScholars Team on Wednesday, January 27, 2021

(Basic Statistics for Citizen Data Scientist) Sampling Excel provides a Sampling data analysis tool that can be used to create samples. The tool works by defining the population as an array in an Excel worksheet and then using the following input parameters to determine how you would like to carry out the sampling. Input Range – Specify …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Simulation using Real Statistics

By SETScholars Team on Tuesday, January 26, 2021

(Basic Statistics for Citizen Data Scientist) Simulation It is often useful to create a model using simulation. Usually, this takes the form of generating a series of random observations (often based on a specific statistical distribution) and then studying the resulting observations using techniques described throughout the rest of this website. This approach is commonly called Monte …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Comparing two means when variances are known

By SETScholars Team on Tuesday, January 26, 2021

(Basic Statistics for Citizen Data Scientist) Comparing two means when variances are known Theorem 1: Let x̄ and ȳ be the means of two samples of size nx and ny respectively. If x and y are normal or nx and ny are sufficiently large for the Central Limit Theorem to hold, then x̄ – ȳ has normal distribution with mean μx – μy and standard deviation Proof: Since the samples are random, x̄ and ȳ are normally and independently distributed. By the Central Limit …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Hypothesis Testing using the Central Limit Theorem

By SETScholars Team on Tuesday, January 26, 2021

(Basic Statistics for Citizen Data Scientist) Hypothesis Testing using the Central Limit Theorem Using the Central Limit Theorem we can extend the approach employed in Single Sample Hypothesis Testing for normally distributed populations to those that are not normally distributed. Suppose we take a sample of size n, where n is sufficiently large, and pose a null hypothesis that the …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Central Limit Theorem

By SETScholars Team on Monday, January 25, 2021

(Basic Statistics for Citizen Data Scientist) Central Limit Theorem Theorem 1 – Central Limit Theorem: If x has a distribution with mean μ and standard deviation σ then for n sufficiently large, the variable has a distribution which is approximately the standard normal distribution. Observation: The larger the value of n the better the approximation will be. For practical purposes when n ≥ 30, then the approximation will be …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Confidence Intervals for Sampling Distributions

By SETScholars Team on Monday, January 25, 2021

(Basic Statistics for Citizen Data Scientist) Confidence Intervals for Sampling Distributions Suppose we take a sample of size n from a normal population N(μ, σ) and ask whether the sample mean differs significantly from the overall population mean. As we have seen in Single Sample Hypothesis Testing The exact point of rejection (at the right tail), zcrit, has value And so …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Standardized Effect Size

By SETScholars Team on Monday, January 25, 2021

(Basic Statistics for Citizen Data Scientist) Standardized Effect Size Definition 1: Cohen’s d, a statistic which is independent of the sample size and is defined as where m1 and m2 represent two means and σpooled is some combined value for the standard deviation. The effect size given by d is conventionally viewed as small, medium or large as follows: d = 0.20 – small effect d = …

Applied Statistics Data Analytics Data Science Statistics for Beginners

Statistics for Beginners in Excel – Single Sample Hypothesis Testing

By SETScholars Team on Sunday, January 24, 2021

(Basic Statistics for Citizen Data Scientist) Single Sample Hypothesis Testing Suppose we take a sample of size n from a normal population N(μ, σ) and ask whether the sample mean differs significantly from the overall population mean. This is equivalent to testing the following null hypothesis H0: We use a two-tailed hypothesis, although sometimes a one-tailed hypothesis is preferred …

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Towards Advanced Analytics Specialist & Analytics Engineer

Tag Archives: Statistics for Beginners

Statistics for Beginners in Excel – Identifying Outliers and Missing Data using Real Statistics

Statistics for Beginners in Excel – Power and Sample Size using Real Statistics

Statistics for Beginners in Excel – Sampling using Real Statistics

Statistics for Beginners in Excel – Simulation using Real Statistics

Statistics for Beginners in Excel – Comparing two means when variances are known

Statistics for Beginners in Excel – Hypothesis Testing using the Central Limit Theorem

Statistics for Beginners in Excel – Central Limit Theorem

Statistics for Beginners in Excel – Confidence Intervals for Sampling Distributions

Statistics for Beginners in Excel – Standardized Effect Size

Statistics for Beginners in Excel – Single Sample Hypothesis Testing