Tag Archives: python data science

Understanding ROC Curves with Python

Hits: 5  Understanding ROC Curves with Python In the current age where Data Science / AI is booming, it is important to understand how Machine Learning is used in the industry to solve complex business problems. In order to select which Machine Learning model should be used in production, a selection metric is chosen upon …

How to do Cross Validation and Grid Search for Model Selection in Python

Hits: 2 How to do Cross Validation and Grid Search for Model Selection in Python Introduction A typical machine learning process involves training different models on the dataset and selecting the one with best performance. However, evaluating the performance of algorithm is not always a straight forward task. There are several factors that can help …

Hierarchical Clustering with Python and Scikit-Learn

Hits: 4  Hierarchical Clustering with Python and Scikit-Learn Hierarchical clustering is a type of unsupervised machine learning algorithm used to cluster unlabeled data points. Like K-means clustering, hierarchical clustering also groups together the data points with similar characteristics. In some cases the result of hierarchical and K-Means clustering can be similar. Before implementing hierarchical clustering using Scikit-Learn, let’s first …

The Naive Bayes Algorithm in Python with Scikit-Learn

Hits: 1  The Naive Bayes Algorithm in Python with Scikit-Learn When studying Probability & Statistics, one of the first and most important theorems students learn is the Bayes’ Theorem. This theorem is the foundation of deductive reasoning, which focuses on determining the probability of an event occurring based on prior knowledge of conditions that might be …

How to implement Random Forest Algorithm with Python and Scikit-Learn

Hits: 2  How to implement Random Forest Algorithm with Python and Scikit-Learn Random forest is a type of supervised machine learning algorithm based on ensemble learning. Ensemble learning is a type of learning where you join different types of algorithms or same algorithm multiple times to form a more powerful prediction model. The random forest algorithm combines multiple …

How to Implement LDA in Python with Scikit-Learn

Hits: 2  How to Implement LDA in Python with Scikit-Learn In our previous article Implementing PCA in Python with Scikit-Learn, we studied how we can reduce dimensionality of the feature set using PCA. In this article we will study another very important dimensionality reduction technique: linear discriminant analysis (or LDA). But first let’s briefly discuss how PCA and …

How to Implement PCA in Python with Scikit-Learn

Hits: 3  How to Implement PCA in Python with Scikit-Learn With the availability of high performance CPUs and GPUs, it is pretty much possible to solve every regression, classification, clustering and other related problems using machine learning and deep learning models. However, there are still various factors that cause performance bottlenecks while developing such models. …

How to implement Decision Trees in Python with Scikit-Learn

Hits: 1 How to implement Decision Trees in Python with Scikit-Learn Introduction A decision tree is one of most frequently and widely used supervised machine learning algorithms that can perform both regression and classification tasks. The intuition behind the decision tree algorithm is simple, yet also very powerful. For each attribute in the dataset, the decision …

Introduction to Neural Networks with Scikit-Learn in Python

Hits: 1 Introduction to Neural Networks with Scikit-Learn in Python What is a Neural Network? Humans have an ability to identify patterns within the accessible information with an astonishingly high degree of accuracy. Whenever you see a car or a bicycle you can immediately recognize what they are. This is because we have learned over …

Loops in Python for Business Analyst

Hits: 3 Loops in Python for Business Analyst Choosing the Right Loop Construct Python offers a variety of constructs to do loops. This article presents them and gives advice on their specific usage. Furthermore, we will also have a look at the performance of each looping construct in your Python code. It might be surprising …

TensorFlow Neural Network Tutorial in Python

Hits: 4  TensorFlow Neural Network Tutorial in Python TensorFlow is an open-source library for machine learning applications. It’s the Google Brain’s second generation system, after replacing the close-sourced DistBelief, and is used by Google for both research and production applications. TensorFlow applications can be written in a few languages: Python, Go, Java and C. This …

Python Exception Handling

Hits: 3  Python Exception Handling This tutorial will give an introduction to what Python exceptions are, the most common types of exceptions, and how to handle raised exceptions with the try and except clauses. What is a Python Exception? A Python exception is a construct used to signal an important event, usually an error, that occurs when executing a …

How to List Files in a Directory using Python

Hits: 9 How to List Files in a Directory using Python Using os.walk() The os module contains a long list of methods that deal with the filesystem, and the operating system. One of them is walk(), which generates the filenames in a directory tree by walking the tree either top-down or bottom-up (with top-down being the default setting). os.walk() returns …