Month: May 2021

Data Wrangling in Python – How to Do Descriptive Statistics For pandas Dataframe

Descriptive Statistics For pandas Dataframe Import modules import pandas as pd Create dataframe data = {‘name’: [‘Jason’, ‘Molly’, ‘Tina’, ‘Jake’, ‘Amy’], ‘age’: [42, 52, 36, 24, 73], ‘preTestScore’: [4, 24, 31, 2, 3], ‘postTestScore’: [25, 94, 57, 62, 70]} df = pd.DataFrame(data, columns = [‘name’, ‘age’, ‘preTestScore’, ‘postTestScore’]) df name age preTestScore postTestScore 0 Jason …

Data Wrangling in Python – Delete Duplicates In pandas

Delete Duplicates In pandas import modules import pandas as pd Create dataframe with duplicates raw_data = {‘first_name’: [‘Jason’, ‘Jason’, ‘Jason’,’Tina’, ‘Jake’, ‘Amy’], ‘last_name’: [‘Miller’, ‘Miller’, ‘Miller’,’Ali’, ‘Milner’, ‘Cooze’], ‘age’: [42, 42, 1111111, 36, 24, 73], ‘preTestScore’: [4, 4, 4, 31, 2, 3], ‘postTestScore’: [25, 25, 25, 57, 62, 70]} df = pd.DataFrame(raw_data, columns = [‘first_name’, …

Data Wrangling in Python – Crosstabs In pandas

Crosstabs In pandas Import pandas import pandas as pd raw_data = {‘regiment’: [‘Nighthawks’, ‘Nighthawks’, ‘Nighthawks’, ‘Nighthawks’, ‘Dragoons’, ‘Dragoons’, ‘Dragoons’, ‘Dragoons’, ‘Scouts’, ‘Scouts’, ‘Scouts’, ‘Scouts’], ‘company’: [‘infantry’, ‘infantry’, ‘cavalry’, ‘cavalry’, ‘infantry’, ‘infantry’, ‘cavalry’, ‘cavalry’,’infantry’, ‘infantry’, ‘cavalry’, ‘cavalry’], ‘experience’: [‘veteran’, ‘rookie’, ‘veteran’, ‘rookie’, ‘veteran’, ‘rookie’, ‘veteran’, ‘rookie’,’veteran’, ‘rookie’, ‘veteran’, ‘rookie’], ‘name’: [‘Miller’, ‘Jacobson’, ‘Ali’, ‘Milner’, ‘Cooze’, ‘Jacon’, …

Data Wrangling in Python – Creating Lists From Dictionary Keys And Values

Creating Lists From Dictionary Keys And Values Create a dictionary dict = {‘county’: [‘Cochice’, ‘Pima’, ‘Santa Cruz’, ‘Maricopa’, ‘Yuma’], ‘year’: [2012, 2012, 2013, 2014, 2014], ‘fireReports’: [4, 24, 31, 2, 3]} Create a list from the dictionary keys /* Create a list of keys */ list(dict.keys()) [‘fireReports’, ‘year’, ‘county’] Create a list from the dictionary …

Data Wrangling in Python – How to Create a Column Based on a Conditional in pandas

Create a Column Based on a Conditional in pandas Preliminaries /* Import required modules */ import pandas as pd import numpy as np Make a dataframe data = {‘name’: [‘Jason’, ‘Molly’, ‘Tina’, ‘Jake’, ‘Amy’], ‘age’: [42, 52, 36, 24, 73], ‘preTestScore’: [4, 24, 31, 2, 3], ‘postTestScore’: [25, 94, 57, 62, 70]} df = pd.DataFrame(data, …

Data Wrangling in Python – How to Create Counts Of Items

Create Counts Of Items Preliminaries from collections import Counter Create A Counter /* Create a counter of the fruits eaten today */ fruit_eaten = Counter([‘Apple’, ‘Apple’, ‘Apple’, ‘Banana’, ‘Pear’, ‘Pineapple’]) /* View counter */ fruit_eaten Counter({‘Apple’: 3, ‘Banana’: 1, ‘Pear’: 1, ‘Pineapple’: 1}) Update The Count For An Element /* Update the count for ‘Pineapple’ …

Data Wrangling in Python – How to Create A pandas Column With A For Loop

Create A pandas Column With A For Loop Preliminaries import pandas as pd import numpy as np Create an example dataframe raw_data = {‘student_name’: [‘Miller’, ‘Jacobson’, ‘Ali’, ‘Milner’, ‘Cooze’, ‘Jacon’, ‘Ryaner’, ‘Sone’, ‘Sloan’, ‘Piger’, ‘Riani’, ‘Ali’], ‘test_score’: [76, 88, 84, 67, 53, 96, 64, 91, 77, 73, 52, np.NaN]} df = pd.DataFrame(raw_data, columns = [‘student_name’, …

Data Wrangling in Python – How to Create A pandas Column With A For Loop

Create A pandas Column With A For Loop Preliminaries import pandas as pd import numpy as np Create an example dataframe raw_data = {‘student_name’: [‘Miller’, ‘Jacobson’, ‘Ali’, ‘Milner’, ‘Cooze’, ‘Jacon’, ‘Ryaner’, ‘Sone’, ‘Sloan’, ‘Piger’, ‘Riani’, ‘Ali’], ‘test_score’: [76, 88, 84, 67, 53, 96, 64, 91, 77, 73, 52, np.NaN]} df = pd.DataFrame(raw_data, columns = [‘student_name’, …

Data Wrangling in Python – How to Create A Pipeline In Pandas

Create A Pipeline In Pandas Pandas’ pipeline feature allows you to string together Python functions in order to build a pipeline of data processing. Preliminaries import pandas as pd Create Dataframe /* Create empty dataframe */ df = pd.DataFrame() /* Create a column */ df[‘name’] = [‘John’, ‘Steve’, ‘Sarah’] df[‘gender’] = [‘Male’, ‘Male’, ‘Female’] df[‘age’] …

Data Wrangling in Python – How to Count Values In Pandas Dataframe

Count Values In Pandas Dataframe Import the pandas module import pandas as pd Create all the columns of the dataframe as series year = pd.Series([1875, 1876, 1877, 1878, 1879, 1880, 1881, 1882, 1883, 1884, 1885, 1886, 1887, 1888, 1889, 1890, 1891, 1892, 1893, 1894]) guardCorps = pd.Series([0,2,2,1,0,0,1,1,0,3,0,2,1,0,0,1,0,1,0,1]) corps1 = pd.Series([0,0,0,2,0,3,0,2,0,0,0,1,1,1,0,2,0,3,1,0]) corps2 = pd.Series([0,0,0,2,0,2,0,0,1,1,0,0,2,1,1,0,0,2,0,0]) corps3 = …