R Tutorials for Citizen Data Scientists

Statistics with R for Business Analysts – Analysis of Covariance

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Analysis of Covariance We use Regression analysis to create models which describe the effect of variation in predictor variables on the response variable. Sometimes, if we have a categorical variable with values like Yes/No or Male/Female etc. The simple regression analysis gives …

Statistics with R for Business Analysts – Poisson Regression

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Poisson Regression Poisson Regression involves regression models in which the response variable is in the form of counts and not fractional numbers. For example, the count of number of births or number of wins in a football match series. Also the values …

Statistics with R for Business Analysts – Binomial Distribution

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Binomial Distribution The binomial distribution model deals with finding the probability of success of an event which has only two possible outcomes in a series of experiments. For example, tossing of a coin always gives a head or a tail. The probability …

Statistics with R for Business Analysts – Normal Distribution

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Normal Distribution In a random collection of data from independent sources, it is generally observed that the distribution of data is normal. Which means, on plotting a graph with the value of the variable in the horizontal axis and the count of …

Statistics with R for Business Analysts – Logistic Regression

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Logistic Regression The Logistic Regression is a regression model in which the response variable (dependent variable) has categorical values such as True/False or 0/1. It actually measures the probability of a binary response as the value of response variable based on the …

Statistics with R for Business Analysts – Multiple Regression

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Multiple Regression Multiple regression is an extension of linear regression into relationship between more than two variables. In simple linear relation we have one predictor and one response variable, but in multiple regression we have more than one predictor variable and one …

Statistics with R for Business Analysts – Linear Regression

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Linear Regression Regression analysis is a very widely used statistical tool to establish a relationship model between two variables. One of these variable is called predictor variable whose value is gathered through experiments. The other variable is called response variable whose value …

Statistics with R for Business Analysts – Mean, Median and Mode

(R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Mean, Median and Mode Statistical analysis in R is performed by using many in-built functions. Most of these functions are part of the R base package. These functions take R vector as an input along with the arguments and give the result. …

R Visualisation for Beginners – Scatterplots

(R Tutorials for Citizen Data Scientist) R Visualisation for Beginners – Scatterplots Scatterplots show many points plotted in the Cartesian plane. Each point represents the values of two variables. One variable is chosen in the horizontal axis and another in the vertical axis. The simple scatterplot is created using the plot() function. Syntax The basic syntax for …

R Visualisation for Beginners – Line Graphs

(R Tutorials for Citizen Data Scientist) R Visualisation for Beginners – Line Graphs A line chart is a graph that connects a series of points by drawing line segments between them. These points are ordered in one of their coordinate (usually the x-coordinate) value. Line charts are usually used in identifying the trends in data. …