Month: January 2021

Year 1 – Math Worksheet 003 – Understanding Measurements Length and Width of an Object

  Year 1 – Math Worksheet 003 – Understanding Measurements Length and Width of an Object   Year 1 – Math Worksheet 002 – Properties of Shapes   Personal Career & Learning Guide for Data Analyst, Data Engineer and Data Scientist Applied Machine Learning & Data Science Projects and Coding Recipes for Beginners A list …

Statistics for Beginners in Excel – Normal Distribution

(Basic Statistics for Citizen Data Scientist) Basic Characteristics of the Normal Distribution Definition 1: The probability density function of the normal distribution is defined as: Here is the constant e = 2.7183…, and is the constant π = 3.1415… . The normal distribution is completely determined by the parameters µ and σ. It turns out that µ is the mean of the normal distribution and σ is …

Statistics for Beginners in Excel – Real Statistics Power Data Analysis Tool

(Basic Statistics for Citizen Data Scientist) Real Statistics Power Data Analysis Tool Real Statistics Data Analysis Tool: The Real Statistics Resource Pack supplies the Statistical Power and Sample Size data analysis tool to determine the power which results from a statistical test for a specified effect size, sample size and alpha, as well as the sample size …

Statistics for Beginners in Excel – Null and Alternative Hypothesis

(Basic Statistics for Citizen Data Scientist) Null and Alternative Hypothesis Generally to understand some characteristic of the general population we take a random sample and study the corresponding property of the sample. We then determine whether any conclusions we reach about the sample are representative of the population. This is done by choosing an estimator function for …

Statistics for Beginners in Excel – Dealing with Missing Data

(Basic Statistics for Citizen Data Scientist) Dealing with Missing Data This tutorial is based on the Real Statistics Resource Pack. Another problem faced when collecting data is that some data may be missing. For example, in conducting a survey with ten questions, perhaps some of the people who take the survey don’t answer all ten …

Statistics for Beginners in Excel – Box Plots with Outliers

(Basic Statistics for Citizen Data Scientist) Box Plots with Outliers Excel 2016 has added a Box and Whiskers chart capability. To access this capability for Example 1 of Creating Box Plots in Excel, highlight the data range A2:C11 (from Figure 1) and select Insert > Charts|Statistical > Box and Whiskers. The chart shown on the right side of Figure 1 will …

Statistics for Beginners in Excel – Outliers and Robustness

(Basic Statistics for Citizen Data Scientist) Outliers and Robustness One problem that we face in analyzing data is the presence of outliers, i.e. a data element that is much bigger or much smaller than the other data elements. For example, the mean of the sample {2, 3, 4, 5, 6} is 4, while the mean of …

Statistics for Beginners in Excel – ROC and Classification Table Data Analysis Tool

(Basic Statistics for Citizen Data Scientist) ROC and Classification Table Data Analysis Tool Real Statistics Data Analysis Tools: The Real Statistics Resource Pack supplies the ROC Curve and Classification Table data analysis tool which provides an easier way to construct the ROC curve and classification table. We show how this is done for Example 1 of Classification Table and ROC Curve. …

Statistics for Beginners in Excel – AUC Confidence Interval

(Basic Statistics for Citizen Data Scientist) AUC Confidence Interval For large samples, AUC (area under the curve for a ROC curve) is approximately normally distributed, and so a 1-α confidence interval for AUC may be calculated as described in Confidence Interval for Sampling Distributions. The confidence interval is equal to AUC  ± se · zcrit where zcrit is the two-tailed critical value …