Data Frame Archives

IRIS Flower Classification using SKLEARN DecisionTree Classifier with Grid Search Cross Validation

By SETScholars Team on Thursday, January 2, 2020

IRIS Flower Classification using SKLEARN DecisionTree Classifier with Grid Search Cross Validation The IRIS flower is a popular example in the field of machine learning. It is a type of flower that has different variations, such as the setosa, virginica, and versicolor. In this blog, we will be discussing how to classify the …

Classification Data Analytics Data Science Machine Learning Recipe R Classification R for Beginners R for Data Science R Machine Learning R Machine Learning Crash Course Supervised Learning Tabular Data Analytics

End-to-End Machine Learning: logloss metric in R

By SETScholars Team on Saturday, November 23, 2019

End-to-End Machine Learning: logloss metric in R When training a machine learning model, it’s important to evaluate its performance to understand how well it will work on new, unseen data. One common way to evaluate the performance of a model is by using a metric called “log loss” or “cross-entropy loss”. Log loss is a …

Classification Data Analytics Data Science R for Beginners R for Business Analytics R for Data Science R for Data Visualisation R for Excel Users R Machine Learning R Machine Learning Crash Course R Regression Regression

Support Vector Machine in R

By SETScholars Team on Saturday, November 9, 2019

Support Vector Machine in R Support Vector Machine (SVM) is a type of supervised machine learning algorithm that can be used for both classification and regression tasks. It works by finding the best boundary, called a hyperplane, that separates different classes or predicts the target variable with the highest accuracy. In R, there are several …

Applied Statistics Data Analytics Data Science R for Beginners R for Business Analytics R for Data Science R Machine Learning R Machine Learning Crash Course Tabular Data Analytics

How to do Feature Selection – remove highly correlated features in R

By SETScholars Team on Sunday, October 27, 2019

How to do Feature Selection – remove highly correlated features in R When working with a large dataset, it’s common to have features that are highly correlated with each other. These correlated features provide redundant information to the model and can negatively impact the performance. To overcome this issue, we can use feature selection techniques …

Applied Statistics Data Analytics Data Science Data Visualisation R for Beginners R for Data Science R for Data Visualisation R Machine Learning R Machine Learning Crash Course

How to do Feature Selection – recursive feature elimination in R

By SETScholars Team on Friday, October 25, 2019

How to do Feature Selection – recursive feature elimination in R Recursive feature elimination (RFE) is a feature selection technique that recursively removes the least important features from the dataset. The goal of RFE is to select a subset of features that are most informative and relevant to the target variable, while reducing the dimensionality …

Applied Statistics Data Analytics Data Science R for Beginners R for Business Analytics R for Data Science R for Excel Users R Machine Learning Crash Course

Data Cleaning in R – mark missing values in R

By SETScholars Team on Thursday, October 24, 2019

Data Cleaning in R – mark missing values in R Data cleaning is an important step in the data analysis process, and one of the first tasks is often identifying and marking missing values. Missing values can occur for a variety of reasons, such as data entry errors or survey respondents not answering certain questions. …

Applied Statistics Data Analytics Data Science Data Visualisation R for Beginners R for Business Analytics R for Data Visualisation R for Excel Users

Visualize Multivariate Data – Correlation plot in R

By SETScholars Team on Tuesday, October 15, 2019

Visualize Multivariate Data – Correlation plot in R A correlation plot is a useful tool for visualizing the relationship between multiple variables in a dataset. It allows to quickly identify patterns and trends in the data, and to determine whether variables are positively or negatively correlated. In R, there are different ways to create a …

Data Analytics Data Science Data Visualisation R Classification R for Beginners R for Business Analytics R for Data Science R for Data Visualisation R for Excel Users R Machine Learning Crash Course

Visualize Univariate Data – BOX plot in R

By SETScholars Team on Monday, October 7, 2019

Visualize Univariate Data – BOX plot in R In R, a box plot is a useful tool for visualizing univariate data, or data that has only one variable. A box plot is a graph that uses boxes to represent the distribution of the data and to identify any potential outliers. To create a box plot …

Applied Statistics Data Analytics Data Science Python Machine Learning Python Machine Learning Crash Course R for Beginners R for Business Analytics R for Data Science R for Excel Users R Machine Learning R Machine Learning Crash Course

Summarise Data in R – How to know datatypes in R

By SETScholars Team on Tuesday, October 1, 2019

Summarise Data in R – How to know datatypes in R In R, it is important to know the data types of variables in a dataset, as different data types require different types of analysis and processing. The most common data types in R are numeric, character, and factor. To check the data types of …

Applied Statistics Data Analytics Data Science R for Beginners R for Business Analytics R for Data Science R for Excel Users R Machine Learning R Machine Learning Crash Course

Summarise Data in R – How to get summary statistics in R