R Tutorials for Citizen Data Scientists

Statistics with R for Business Analysts – Random Forest

Hits: 39 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Random Forest In the random forest approach, a large number of decision trees are created. Every observation is fed into every decision tree. The most common outcome for each observation is used as the final output. A new observation is …

Statistics with R for Business Analysts – Decision Tree

Hits: 23 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Decision Tree Decision tree is a graph to represent choices and their results in form of a tree. The nodes in the graph represent an event or choice and the edges of the graph represent the decision rules or conditions. …

Statistics with R for Business Analysts – Nonlinear Least Square

Hits: 8 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Nonlinear Least Square When modeling real world data for regression analysis, we observe that it is rarely the case that the equation of the model is a linear equation giving a linear graph. Most of the time, the equation of …

Statistics with R for Business Analysts – Time Series Analysis

Hits: 11 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Time Series Analysis Time series is a series of data points in which each data point is associated with a timestamp. A simple example is the price of a stock in the stock market at different points of time on …

Statistics with R for Business Analysts – Analysis of Covariance

Hits: 17 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Analysis of Covariance We use Regression analysis to create models which describe the effect of variation in predictor variables on the response variable. Sometimes, if we have a categorical variable with values like Yes/No or Male/Female etc. The simple regression …

Statistics with R for Business Analysts – Poisson Regression

Hits: 8 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Poisson Regression Poisson Regression involves regression models in which the response variable is in the form of counts and not fractional numbers. For example, the count of number of births or number of wins in a football match series. Also …

Statistics with R for Business Analysts – Binomial Distribution

Hits: 11 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Binomial Distribution The binomial distribution model deals with finding the probability of success of an event which has only two possible outcomes in a series of experiments. For example, tossing of a coin always gives a head or a tail. …

Statistics with R for Business Analysts – Normal Distribution

Hits: 8 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Normal Distribution In a random collection of data from independent sources, it is generally observed that the distribution of data is normal. Which means, on plotting a graph with the value of the variable in the horizontal axis and the …

Statistics with R for Business Analysts – Logistic Regression

Hits: 3 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Logistic Regression The Logistic Regression is a regression model in which the response variable (dependent variable) has categorical values such as True/False or 0/1. It actually measures the probability of a binary response as the value of response variable based …

Statistics with R for Business Analysts – Multiple Regression

Hits: 23 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Multiple Regression Multiple regression is an extension of linear regression into relationship between more than two variables. In simple linear relation we have one predictor and one response variable, but in multiple regression we have more than one predictor variable …

Statistics with R for Business Analysts – Linear Regression

Hits: 8 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Linear Regression Regression analysis is a very widely used statistical tool to establish a relationship model between two variables. One of these variable is called predictor variable whose value is gathered through experiments. The other variable is called response variable …

Statistics with R for Business Analysts – Mean, Median and Mode

Hits: 14 (R Tutorials for Citizen Data Scientist) Statistics with R for Business Analysts – Mean, Median and Mode Statistical analysis in R is performed by using many in-built functions. Most of these functions are part of the R base package. These functions take R vector as an input along with the arguments and give …

R Visualisation for Beginners – Scatterplots

Hits: 5 (R Tutorials for Citizen Data Scientist) R Visualisation for Beginners – Scatterplots Scatterplots show many points plotted in the Cartesian plane. Each point represents the values of two variables. One variable is chosen in the horizontal axis and another in the vertical axis. The simple scatterplot is created using the plot() function. Syntax The basic …

R Visualisation for Beginners – Line Graphs

Hits: 3 (R Tutorials for Citizen Data Scientist) R Visualisation for Beginners – Line Graphs A line chart is a graph that connects a series of points by drawing line segments between them. These points are ordered in one of their coordinate (usually the x-coordinate) value. Line charts are usually used in identifying the trends …

R Visualisation for Beginners – Histograms

Hits: 12 (R Tutorials for Citizen Data Scientist) R Visualisation for Beginners – Histograms A histogram represents the frequencies of values of a variable bucketed into ranges. Histogram is similar to bar chat but the difference is it groups the values into continuous ranges. Each bar in histogram represents the height of the number of …

R Visualisation for Beginners – Boxplots

Hits: 6 (R Tutorials for Citizen Data Scientist) R Visualisation for Beginners – Boxplots Boxplots are a measure of how well distributed is the data in a data set. It divides the data set into three quartiles. This graph represents the minimum, maximum, median, first quartile and third quartile in the data set. It is …