Tag Archives: Applied Statistics

Statistics with R for Business Analysts – Analysis of Covariance

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Analysis of Covariance We use Regression analysis to create models which describe the effect of variation in predictor variables on the response variable. Sometimes, if we have a categorical variable with values like Yes/No or Male/Female etc. The simple regression analysis gives …

Statistics with R for Business Analysts – Poisson Regression

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Poisson Regression Poisson Regression involves regression models in which the response variable is in the form of counts and not fractional numbers. For example, the count of number of births or number of wins in a football match series. Also the values …

Statistics with R for Business Analysts – Binomial Distribution

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Binomial Distribution The binomial distribution model deals with finding the probability of success of an event which has only two possible outcomes in a series of experiments. For example, tossing of a coin always gives a head or a tail. The probability …

Statistics with R for Business Analysts – Normal Distribution

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Normal Distribution In a random collection of data from independent sources, it is generally observed that the distribution of data is normal. Which means, on plotting a graph with the value of the variable in the horizontal axis and the count of …

Statistics with R for Business Analysts – Logistic Regression

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Logistic Regression The Logistic Regression is a regression model in which the response variable (dependent variable) has categorical values such as True/False or 0/1. It actually measures the probability of a binary response as the value of response variable based on the …

Statistics with R for Business Analysts – Multiple Regression

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Multiple Regression Multiple regression is an extension of linear regression into relationship between more than two variables. In simple linear relation we have one predictor and one response variable, but in multiple regression we have more than one predictor variable and one …

Statistics with R for Business Analysts – Linear Regression

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Linear Regression Regression analysis is a very widely used statistical tool to establish a relationship model between two variables. One of these variable is called predictor variable whose value is gathered through experiments. The other variable is called response variable whose value …

Time Series Forecasting in Python using SARIMAX Model with BJ Sales Dataset | Data Science tutorials

Time Series Forecasting in Python using SARIMAX Model with BJ Sales Dataset       The BJ Sales dataset from UCI (University of California, Irvine) is a collection of 42 observations and 1 feature that are used to forecast the number of sales of a certain product in Beijing. Each observation represents a month, and …

How to get Statistical Summary of a Dataset | Jupyter Notebook | Python Data Science for beginners

How to get Statistical Summary of a Dataset   A statistical summary is a set of descriptive statistics that provide an overview of the key features of a dataset. In this essay, we will go over the steps needed to get a statistical summary of a dataset in Python. The first step is to load …

Applied Data Science Coding | Forecasting in R | SARIMA model | Air Quality Dataset

Applied Data Science Coding | Forecasting in R | SARIMA model | Air Quality Dataset   Data science is a field that uses various techniques to extract insights and knowledge from data. One important aspect of data science is forecasting, which involves using historical data to predict future events. Forecasting is important in many industries …