SlideShare a Scribd company logo
Data Analysis with Pandas
When you think of Python...
Meet Jupyter Notebook
And me
job_title != “Developer”
I’m a Consultant at Distilled (since September 2015)
I do build some software in Python
But I mainly use it for data analysis
Getting Started
Python for scientific computing
Huge community
Fantastic ecosystem of packages other people have written
Can be tedious to actually install everything
Just use this!
(https://continuum.io/downloads)
What is Anaconda?
Essentially a large (~400 MB) Python installation
But contains everything* you need for data analysis
Unless you have a special reason not to, you should just
install and use this
You need the command line (but only for a minute)
On Windows, open Powershell
On mac, Terminal or iTerm2
Just one line, though:
1. Just type “jupyter notebook”
2. Wait
3. ...
Back to safety
Open a new Notebook
Your very own data analysis environment
So that was fairly easy...
but why is it better than Excel?
There’s not enough room to list everything, but:
1. Handle larger data sets—no set limit on rows
2. Combine multiple files and data sources together
instantaneously. Pull data straight from APIs or scraping
3. Everything is completely customisable—if you can
imagine a query, it can be done (though not always easily)
4. It’s a safe place to mess things up
...and it’s the perfect playground for
learning Python
Side note: don’t know any Python?
Can’t cover it all today, so go here:
1. Learn Python the Hard Way (free)
2. Real Python ($60, but good)
3. Writing Idiomatic Python (~$15)
Unless you’re building applications:
1. Stick with the small building blocks
2. Learn how to write a function (we’ll do this today)
3. Learn about loops, conditional statements, and handling
data
4. Probably no need to learn about managing projects and
Jupyter Notebook
Save notebooks for later
Run and re-run Python code
Really cool features like post-mortem debugging if you make
a mistake
Cells
1. Type all the code you want
2. Shift+Enter to run it
3. View the result
Now we have our Jupyter Notebook up and running, you
can start playing around with almost any Python code
We’re going to look at Pandas, though—a data analysis
library written in Python
Started its life in finance
Great for fast, flexible computation
The Star of the Show
A little setup, first
You’ll do this more or less at the beginning of each session
It’ll become second nature; just import the workhorse
libraries we always use: numpy, pandas, pyplot.
The DataFrame
If you’re used to spreadsheets, the DataFrame isn’t too
difficult to understand
It’s the fundamental, flexible building block in Pandas
At its simplest, it looks rather like a spreadsheet would
The only obvious difference with Excel is the column
indexes, which are numeric instead of A, B, C...
You’ll usually create them from some other source:
The Pandas library provides some nice functions for
importing from common file formats, so you won’t usually be
building “by hand”:
1. pd.read_csv()
2. pd.read_table()
3. pd.read_sql()
We have so much data stored in CSVs
Our first function call will just read some data into the
DataFrame, where we can analyse it
Reading a CSV
Get help at any time with Shift+Tab
1. pd.read_csv() will read in the data
2. Fields are separated by tabs
3. The encoding is UTF-16 (don’t ask…)
Get a quick sense of the data (658k rows, here)
See the columns
Filtering
What’s happening there?
df[‘Link Active?’] is:
1. Checking that whole column for values that are True or
False
2. Returning an array of True/False values
3. This is fast, and lets us filter in an amazing variety of ways
Filtering (again)
We’re probably ready for this one, now:
Example project: Getting data from
SEMRush
Writing your own function
Call our function, get a DataFrame!
Write to disk in case anything goes wrong
Reading in multiple files
Apply custom filters
Drill down into individual words:
Counter() will save you a huge amount of work
Here we wanted to hone in on modifier words
More detailed questions
How local are the searches?
Do people search by state code or full name?
Do people search by hotel category?
Second example: Custom Rank Tracking
Charts
Where to begin?
If you don’t know Python, start with those books I shared
earlier.
If you do, check out Python for Data Analysis
Keep Jupyter Notebook open at all times
Experiment!
Questions?

More Related Content

What's hot

Metapwn
MetapwnMetapwn
[null]Metapwn - Pwn at a puff by Prajwal Panchmahalkar
[null]Metapwn - Pwn at a puff by Prajwal Panchmahalkar[null]Metapwn - Pwn at a puff by Prajwal Panchmahalkar
[null]Metapwn - Pwn at a puff by Prajwal Panchmahalkar
Prajwal Panchmahalkar
 
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
Brian Brazil
 
Go profiling introduction
Go profiling introductionGo profiling introduction
Go profiling introduction
William Lin
 
Parallel Programming in Python: Speeding up your analysis
Parallel Programming in Python: Speeding up your analysisParallel Programming in Python: Speeding up your analysis
Parallel Programming in Python: Speeding up your analysis
Manojit Nandi
 
Data analytics in the cloud with Jupyter notebooks.
Data analytics in the cloud with Jupyter notebooks.Data analytics in the cloud with Jupyter notebooks.
Data analytics in the cloud with Jupyter notebooks.
Graham Dumpleton
 
Mac backup and cloning
Mac backup and cloningMac backup and cloning
Mac backup and cloning
Benzaminwatson
 
MongoDb scalability and high availability with Replica-Set
MongoDb scalability and high availability with Replica-SetMongoDb scalability and high availability with Replica-Set
MongoDb scalability and high availability with Replica-Set
Vivek Parihar
 

What's hot (8)

Metapwn
MetapwnMetapwn
Metapwn
 
[null]Metapwn - Pwn at a puff by Prajwal Panchmahalkar
[null]Metapwn - Pwn at a puff by Prajwal Panchmahalkar[null]Metapwn - Pwn at a puff by Prajwal Panchmahalkar
[null]Metapwn - Pwn at a puff by Prajwal Panchmahalkar
 
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
 
Go profiling introduction
Go profiling introductionGo profiling introduction
Go profiling introduction
 
Parallel Programming in Python: Speeding up your analysis
Parallel Programming in Python: Speeding up your analysisParallel Programming in Python: Speeding up your analysis
Parallel Programming in Python: Speeding up your analysis
 
Data analytics in the cloud with Jupyter notebooks.
Data analytics in the cloud with Jupyter notebooks.Data analytics in the cloud with Jupyter notebooks.
Data analytics in the cloud with Jupyter notebooks.
 
Mac backup and cloning
Mac backup and cloningMac backup and cloning
Mac backup and cloning
 
MongoDb scalability and high availability with Replica-Set
MongoDb scalability and high availability with Replica-SetMongoDb scalability and high availability with Replica-Set
MongoDb scalability and high availability with Replica-Set
 

Viewers also liked

Public Outreach - Running an effective campaign
 Public Outreach - Running an effective campaign  Public Outreach - Running an effective campaign
Public Outreach - Running an effective campaign
Partners in Emergency Preparedness Conference
 
Python Hype June
Python Hype JunePython Hype June
Python Hype June
Brian Ray
 
James Jesus Bermas on Crash Course on Python
James Jesus Bermas on Crash Course on PythonJames Jesus Bermas on Crash Course on Python
James Jesus Bermas on Crash Course on Python
CP-Union
 
Python Hype?
Python Hype?Python Hype?
Python Hype?
Brian Ray
 
Python for Data Analysis: Chapter 2
Python for Data Analysis: Chapter 2Python for Data Analysis: Chapter 2
Python for Data Analysis: Chapter 2
智哉 今西
 
Hello World! with Python
Hello World! with PythonHello World! with Python
Hello World! with Python
Dhanashree Prasad
 
Intro to Python Data Analysis in Wakari
Intro to Python Data Analysis in WakariIntro to Python Data Analysis in Wakari
Intro to Python Data Analysis in Wakari
Karissa Rae McKelvey
 
Practical Data Analysis in Python
Practical Data Analysis in PythonPractical Data Analysis in Python
Practical Data Analysis in Python
Hilary Mason
 
Pydata-Python tools for webscraping
Pydata-Python tools for webscrapingPydata-Python tools for webscraping
Pydata-Python tools for webscraping
Jose Manuel Ortega Candel
 
Getting started with pandas
Getting started with pandasGetting started with pandas
Getting started with pandas
maikroeder
 
Data Analysis in Python
Data Analysis in PythonData Analysis in Python
Data Analysis in Python
Richard Herrell
 
Python and Data Analysis
Python and Data AnalysisPython and Data Analysis
Python and Data Analysis
Praveen Nair
 
Categorical Data Analysis in Python
Categorical Data Analysis in PythonCategorical Data Analysis in Python
Categorical Data Analysis in Python
Jaidev Deshpande
 
Intro to Python
Intro to PythonIntro to Python
Big data analysis in python @ PyCon.tw 2013
Big data analysis in python @ PyCon.tw 2013Big data analysis in python @ PyCon.tw 2013
Big data analysis in python @ PyCon.tw 2013
Jimmy Lai
 
pandas - Python Data Analysis
pandas - Python Data Analysispandas - Python Data Analysis
pandas - Python Data Analysis
Andrew Henshaw
 
Python Pandas for Data Science cheatsheet
Python Pandas for Data Science cheatsheet Python Pandas for Data Science cheatsheet
Python Pandas for Data Science cheatsheet
Dr. Volkan OBAN
 
Data analysis with Pandas and Spark
Data analysis with Pandas and SparkData analysis with Pandas and Spark
Data analysis with Pandas and Spark
Felix Crisan
 
Wellcome to python
Wellcome to pythonWellcome to python
Wellcome to python
Nanra Sukedy
 
Bringing Down the House - How One Python Script Ruled Over AntiVirus
Bringing Down the House - How One Python Script Ruled Over AntiVirusBringing Down the House - How One Python Script Ruled Over AntiVirus
Bringing Down the House - How One Python Script Ruled Over AntiVirus
CTruncer
 

Viewers also liked (20)

Public Outreach - Running an effective campaign
 Public Outreach - Running an effective campaign  Public Outreach - Running an effective campaign
Public Outreach - Running an effective campaign
 
Python Hype June
Python Hype JunePython Hype June
Python Hype June
 
James Jesus Bermas on Crash Course on Python
James Jesus Bermas on Crash Course on PythonJames Jesus Bermas on Crash Course on Python
James Jesus Bermas on Crash Course on Python
 
Python Hype?
Python Hype?Python Hype?
Python Hype?
 
Python for Data Analysis: Chapter 2
Python for Data Analysis: Chapter 2Python for Data Analysis: Chapter 2
Python for Data Analysis: Chapter 2
 
Hello World! with Python
Hello World! with PythonHello World! with Python
Hello World! with Python
 
Intro to Python Data Analysis in Wakari
Intro to Python Data Analysis in WakariIntro to Python Data Analysis in Wakari
Intro to Python Data Analysis in Wakari
 
Practical Data Analysis in Python
Practical Data Analysis in PythonPractical Data Analysis in Python
Practical Data Analysis in Python
 
Pydata-Python tools for webscraping
Pydata-Python tools for webscrapingPydata-Python tools for webscraping
Pydata-Python tools for webscraping
 
Getting started with pandas
Getting started with pandasGetting started with pandas
Getting started with pandas
 
Data Analysis in Python
Data Analysis in PythonData Analysis in Python
Data Analysis in Python
 
Python and Data Analysis
Python and Data AnalysisPython and Data Analysis
Python and Data Analysis
 
Categorical Data Analysis in Python
Categorical Data Analysis in PythonCategorical Data Analysis in Python
Categorical Data Analysis in Python
 
Intro to Python
Intro to PythonIntro to Python
Intro to Python
 
Big data analysis in python @ PyCon.tw 2013
Big data analysis in python @ PyCon.tw 2013Big data analysis in python @ PyCon.tw 2013
Big data analysis in python @ PyCon.tw 2013
 
pandas - Python Data Analysis
pandas - Python Data Analysispandas - Python Data Analysis
pandas - Python Data Analysis
 
Python Pandas for Data Science cheatsheet
Python Pandas for Data Science cheatsheet Python Pandas for Data Science cheatsheet
Python Pandas for Data Science cheatsheet
 
Data analysis with Pandas and Spark
Data analysis with Pandas and SparkData analysis with Pandas and Spark
Data analysis with Pandas and Spark
 
Wellcome to python
Wellcome to pythonWellcome to python
Wellcome to python
 
Bringing Down the House - How One Python Script Ruled Over AntiVirus
Bringing Down the House - How One Python Script Ruled Over AntiVirusBringing Down the House - How One Python Script Ruled Over AntiVirus
Bringing Down the House - How One Python Script Ruled Over AntiVirus
 

Similar to Data analysis with pandas

Introduction to python
Introduction to pythonIntroduction to python
Introduction to python
mckennadglyn
 
Pandas tool for data scientist
Pandas tool for data scientist Pandas tool for data scientist
Pandas tool for data scientist
MoTechInc
 
python-160403194316.pdf
python-160403194316.pdfpython-160403194316.pdf
python-160403194316.pdf
gmadhu8
 
Python Seminar PPT
Python Seminar PPTPython Seminar PPT
Python Seminar PPT
Shivam Gupta
 
Python
PythonPython
Python
Shivam Gupta
 
First Steps in Python Programming
First Steps in Python ProgrammingFirst Steps in Python Programming
First Steps in Python Programming
Dozie Agbo
 
プレゼンテーション2
プレゼンテーション2プレゼンテーション2
プレゼンテーション2
KA_ZU
 
Python_Introduction&DataType.pptx
Python_Introduction&DataType.pptxPython_Introduction&DataType.pptx
Python_Introduction&DataType.pptx
HaythamBarakeh1
 
How to Build Tools for Data Scientists That Don't Suck
How to Build Tools for Data Scientists That Don't SuckHow to Build Tools for Data Scientists That Don't Suck
How to Build Tools for Data Scientists That Don't Suck
Diana Tkachenko
 
Ardian Haxha- Flying with Python (OSCAL2014)
Ardian Haxha- Flying with Python  (OSCAL2014)Ardian Haxha- Flying with Python  (OSCAL2014)
Ardian Haxha- Flying with Python (OSCAL2014)
Open Labs Albania
 
Basic of python for data analysis
Basic of python for data analysisBasic of python for data analysis
Basic of python for data analysis
Pramod Toraskar
 
python into.pptx
python into.pptxpython into.pptx
python into.pptx
Punithavel Ramani
 
How Many Slaves (Ukoug)
How Many Slaves (Ukoug)How Many Slaves (Ukoug)
How Many Slaves (Ukoug)
Doug Burns
 
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
DataWorks Summit
 
python presntation 2.pptx
python presntation 2.pptxpython presntation 2.pptx
python presntation 2.pptx
Arpittripathi45
 
THE POWER OF INTELLIGENT FLOWS REAL-TIME IOT BOTNET CLASSIFICATION WITH APACH...
THE POWER OF INTELLIGENT FLOWS REAL-TIME IOT BOTNET CLASSIFICATION WITH APACH...THE POWER OF INTELLIGENT FLOWS REAL-TIME IOT BOTNET CLASSIFICATION WITH APACH...
THE POWER OF INTELLIGENT FLOWS REAL-TIME IOT BOTNET CLASSIFICATION WITH APACH...
André Fucs de Miranda
 
Python and Pytorch tutorial and walkthrough
Python and Pytorch tutorial and walkthroughPython and Pytorch tutorial and walkthrough
Python and Pytorch tutorial and walkthrough
gabriellekuruvilla
 
The program reads data from two files, itemsList-0x.txt and .docx
The program reads data from two files, itemsList-0x.txt and .docxThe program reads data from two files, itemsList-0x.txt and .docx
The program reads data from two files, itemsList-0x.txt and .docx
oscars29
 
Database Management Assignment Help
Database Management Assignment Help Database Management Assignment Help
Database Management Assignment Help
Database Homework Help
 
Astronomy_python_data_Analysis_made_easy.pdf
Astronomy_python_data_Analysis_made_easy.pdfAstronomy_python_data_Analysis_made_easy.pdf
Astronomy_python_data_Analysis_made_easy.pdf
ZainRahim3
 

Similar to Data analysis with pandas (20)

Introduction to python
Introduction to pythonIntroduction to python
Introduction to python
 
Pandas tool for data scientist
Pandas tool for data scientist Pandas tool for data scientist
Pandas tool for data scientist
 
python-160403194316.pdf
python-160403194316.pdfpython-160403194316.pdf
python-160403194316.pdf
 
Python Seminar PPT
Python Seminar PPTPython Seminar PPT
Python Seminar PPT
 
Python
PythonPython
Python
 
First Steps in Python Programming
First Steps in Python ProgrammingFirst Steps in Python Programming
First Steps in Python Programming
 
プレゼンテーション2
プレゼンテーション2プレゼンテーション2
プレゼンテーション2
 
Python_Introduction&DataType.pptx
Python_Introduction&DataType.pptxPython_Introduction&DataType.pptx
Python_Introduction&DataType.pptx
 
How to Build Tools for Data Scientists That Don't Suck
How to Build Tools for Data Scientists That Don't SuckHow to Build Tools for Data Scientists That Don't Suck
How to Build Tools for Data Scientists That Don't Suck
 
Ardian Haxha- Flying with Python (OSCAL2014)
Ardian Haxha- Flying with Python  (OSCAL2014)Ardian Haxha- Flying with Python  (OSCAL2014)
Ardian Haxha- Flying with Python (OSCAL2014)
 
Basic of python for data analysis
Basic of python for data analysisBasic of python for data analysis
Basic of python for data analysis
 
python into.pptx
python into.pptxpython into.pptx
python into.pptx
 
How Many Slaves (Ukoug)
How Many Slaves (Ukoug)How Many Slaves (Ukoug)
How Many Slaves (Ukoug)
 
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
The Power of Intelligent Flows: Real-Time IoT Botnet Classification with Apac...
 
python presntation 2.pptx
python presntation 2.pptxpython presntation 2.pptx
python presntation 2.pptx
 
THE POWER OF INTELLIGENT FLOWS REAL-TIME IOT BOTNET CLASSIFICATION WITH APACH...
THE POWER OF INTELLIGENT FLOWS REAL-TIME IOT BOTNET CLASSIFICATION WITH APACH...THE POWER OF INTELLIGENT FLOWS REAL-TIME IOT BOTNET CLASSIFICATION WITH APACH...
THE POWER OF INTELLIGENT FLOWS REAL-TIME IOT BOTNET CLASSIFICATION WITH APACH...
 
Python and Pytorch tutorial and walkthrough
Python and Pytorch tutorial and walkthroughPython and Pytorch tutorial and walkthrough
Python and Pytorch tutorial and walkthrough
 
The program reads data from two files, itemsList-0x.txt and .docx
The program reads data from two files, itemsList-0x.txt and .docxThe program reads data from two files, itemsList-0x.txt and .docx
The program reads data from two files, itemsList-0x.txt and .docx
 
Database Management Assignment Help
Database Management Assignment Help Database Management Assignment Help
Database Management Assignment Help
 
Astronomy_python_data_Analysis_made_easy.pdf
Astronomy_python_data_Analysis_made_easy.pdfAstronomy_python_data_Analysis_made_easy.pdf
Astronomy_python_data_Analysis_made_easy.pdf
 

More from Outreach Digital

PR101 A Guide to Public Relations
PR101 A Guide to Public RelationsPR101 A Guide to Public Relations
PR101 A Guide to Public Relations
Outreach Digital
 
Outreach Digital - PPC & CRO for Lead Acquisition - Killer Tactics You Would ...
Outreach Digital - PPC & CRO for Lead Acquisition - Killer Tactics You Would ...Outreach Digital - PPC & CRO for Lead Acquisition - Killer Tactics You Would ...
Outreach Digital - PPC & CRO for Lead Acquisition - Killer Tactics You Would ...
Outreach Digital
 
Outreach Digital: Recipe for Creating High-converting Landing Pages
Outreach Digital: Recipe for Creating High-converting Landing PagesOutreach Digital: Recipe for Creating High-converting Landing Pages
Outreach Digital: Recipe for Creating High-converting Landing Pages
Outreach Digital
 
Split Testing for Fun Profit (Beginner-Intermediate) - Stephen Pratley
Split Testing for Fun Profit (Beginner-Intermediate) - Stephen PratleySplit Testing for Fun Profit (Beginner-Intermediate) - Stephen Pratley
Split Testing for Fun Profit (Beginner-Intermediate) - Stephen Pratley
Outreach Digital
 
Data Visualisation & Analytics with Tableau (Beginner) - by Maria Koumandraki
Data Visualisation & Analytics with Tableau (Beginner) - by Maria KoumandrakiData Visualisation & Analytics with Tableau (Beginner) - by Maria Koumandraki
Data Visualisation & Analytics with Tableau (Beginner) - by Maria Koumandraki
Outreach Digital
 
Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau
Outreach Digital
 
Web And App Design
Web And App DesignWeb And App Design
Web And App Design
Outreach Digital
 
R vs Python vs SAS
R vs Python vs SASR vs Python vs SAS
R vs Python vs SAS
Outreach Digital
 
SEO PPC CRO hacks and anomalies that you would die to know
SEO PPC CRO hacks and anomalies that you would die to knowSEO PPC CRO hacks and anomalies that you would die to know
SEO PPC CRO hacks and anomalies that you would die to know
Outreach Digital
 
Programmatic Advertising
Programmatic AdvertisingProgrammatic Advertising
Programmatic Advertising
Outreach Digital
 
The Four Steps to SEO Domination
The Four Steps to SEO DominationThe Four Steps to SEO Domination
The Four Steps to SEO Domination
Outreach Digital
 
How to Integrate Social Media in Your Marketing Mix
How to Integrate Social Media in Your Marketing MixHow to Integrate Social Media in Your Marketing Mix
How to Integrate Social Media in Your Marketing Mix
Outreach Digital
 
How Hospitality Is Embracing Technology
How Hospitality Is Embracing TechnologyHow Hospitality Is Embracing Technology
How Hospitality Is Embracing Technology
Outreach Digital
 
Introduction to Voucher Marketing
Introduction to Voucher MarketingIntroduction to Voucher Marketing
Introduction to Voucher Marketing
Outreach Digital
 
Measuring Cross-Channel Attribution & Programmatic Ads
Measuring Cross-Channel Attribution & Programmatic AdsMeasuring Cross-Channel Attribution & Programmatic Ads
Measuring Cross-Channel Attribution & Programmatic Ads
Outreach Digital
 
Startup Growth & Effective Marketing
Startup Growth & Effective MarketingStartup Growth & Effective Marketing
Startup Growth & Effective Marketing
Outreach Digital
 
Marketing to the Younger Generation
 Marketing to the Younger Generation Marketing to the Younger Generation
Marketing to the Younger Generation
Outreach Digital
 
Discovering Customer Love
Discovering Customer LoveDiscovering Customer Love
Discovering Customer Love
Outreach Digital
 
Machine Learning for Digital Advertising
Machine Learning for Digital AdvertisingMachine Learning for Digital Advertising
Machine Learning for Digital Advertising
Outreach Digital
 
Building & Scaling Data Teams
Building & Scaling Data TeamsBuilding & Scaling Data Teams
Building & Scaling Data Teams
Outreach Digital
 

More from Outreach Digital (20)

PR101 A Guide to Public Relations
PR101 A Guide to Public RelationsPR101 A Guide to Public Relations
PR101 A Guide to Public Relations
 
Outreach Digital - PPC & CRO for Lead Acquisition - Killer Tactics You Would ...
Outreach Digital - PPC & CRO for Lead Acquisition - Killer Tactics You Would ...Outreach Digital - PPC & CRO for Lead Acquisition - Killer Tactics You Would ...
Outreach Digital - PPC & CRO for Lead Acquisition - Killer Tactics You Would ...
 
Outreach Digital: Recipe for Creating High-converting Landing Pages
Outreach Digital: Recipe for Creating High-converting Landing PagesOutreach Digital: Recipe for Creating High-converting Landing Pages
Outreach Digital: Recipe for Creating High-converting Landing Pages
 
Split Testing for Fun Profit (Beginner-Intermediate) - Stephen Pratley
Split Testing for Fun Profit (Beginner-Intermediate) - Stephen PratleySplit Testing for Fun Profit (Beginner-Intermediate) - Stephen Pratley
Split Testing for Fun Profit (Beginner-Intermediate) - Stephen Pratley
 
Data Visualisation & Analytics with Tableau (Beginner) - by Maria Koumandraki
Data Visualisation & Analytics with Tableau (Beginner) - by Maria KoumandrakiData Visualisation & Analytics with Tableau (Beginner) - by Maria Koumandraki
Data Visualisation & Analytics with Tableau (Beginner) - by Maria Koumandraki
 
Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau
 
Web And App Design
Web And App DesignWeb And App Design
Web And App Design
 
R vs Python vs SAS
R vs Python vs SASR vs Python vs SAS
R vs Python vs SAS
 
SEO PPC CRO hacks and anomalies that you would die to know
SEO PPC CRO hacks and anomalies that you would die to knowSEO PPC CRO hacks and anomalies that you would die to know
SEO PPC CRO hacks and anomalies that you would die to know
 
Programmatic Advertising
Programmatic AdvertisingProgrammatic Advertising
Programmatic Advertising
 
The Four Steps to SEO Domination
The Four Steps to SEO DominationThe Four Steps to SEO Domination
The Four Steps to SEO Domination
 
How to Integrate Social Media in Your Marketing Mix
How to Integrate Social Media in Your Marketing MixHow to Integrate Social Media in Your Marketing Mix
How to Integrate Social Media in Your Marketing Mix
 
How Hospitality Is Embracing Technology
How Hospitality Is Embracing TechnologyHow Hospitality Is Embracing Technology
How Hospitality Is Embracing Technology
 
Introduction to Voucher Marketing
Introduction to Voucher MarketingIntroduction to Voucher Marketing
Introduction to Voucher Marketing
 
Measuring Cross-Channel Attribution & Programmatic Ads
Measuring Cross-Channel Attribution & Programmatic AdsMeasuring Cross-Channel Attribution & Programmatic Ads
Measuring Cross-Channel Attribution & Programmatic Ads
 
Startup Growth & Effective Marketing
Startup Growth & Effective MarketingStartup Growth & Effective Marketing
Startup Growth & Effective Marketing
 
Marketing to the Younger Generation
 Marketing to the Younger Generation Marketing to the Younger Generation
Marketing to the Younger Generation
 
Discovering Customer Love
Discovering Customer LoveDiscovering Customer Love
Discovering Customer Love
 
Machine Learning for Digital Advertising
Machine Learning for Digital AdvertisingMachine Learning for Digital Advertising
Machine Learning for Digital Advertising
 
Building & Scaling Data Teams
Building & Scaling Data TeamsBuilding & Scaling Data Teams
Building & Scaling Data Teams
 

Recently uploaded

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 

Recently uploaded (20)

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 

Data analysis with pandas

  • 2. When you think of Python...
  • 4. And me job_title != “Developer” I’m a Consultant at Distilled (since September 2015) I do build some software in Python But I mainly use it for data analysis
  • 6. Python for scientific computing Huge community Fantastic ecosystem of packages other people have written Can be tedious to actually install everything
  • 8. What is Anaconda? Essentially a large (~400 MB) Python installation But contains everything* you need for data analysis Unless you have a special reason not to, you should just install and use this
  • 9. You need the command line (but only for a minute) On Windows, open Powershell On mac, Terminal or iTerm2
  • 10. Just one line, though: 1. Just type “jupyter notebook” 2. Wait 3. ...
  • 12. Open a new Notebook
  • 13. Your very own data analysis environment
  • 14. So that was fairly easy...
  • 15. but why is it better than Excel?
  • 16. There’s not enough room to list everything, but: 1. Handle larger data sets—no set limit on rows 2. Combine multiple files and data sources together instantaneously. Pull data straight from APIs or scraping 3. Everything is completely customisable—if you can imagine a query, it can be done (though not always easily) 4. It’s a safe place to mess things up
  • 17. ...and it’s the perfect playground for learning Python
  • 18. Side note: don’t know any Python?
  • 19. Can’t cover it all today, so go here: 1. Learn Python the Hard Way (free) 2. Real Python ($60, but good) 3. Writing Idiomatic Python (~$15)
  • 20. Unless you’re building applications: 1. Stick with the small building blocks 2. Learn how to write a function (we’ll do this today) 3. Learn about loops, conditional statements, and handling data 4. Probably no need to learn about managing projects and
  • 21. Jupyter Notebook Save notebooks for later Run and re-run Python code Really cool features like post-mortem debugging if you make a mistake
  • 22. Cells 1. Type all the code you want 2. Shift+Enter to run it 3. View the result
  • 23. Now we have our Jupyter Notebook up and running, you can start playing around with almost any Python code We’re going to look at Pandas, though—a data analysis library written in Python Started its life in finance Great for fast, flexible computation The Star of the Show
  • 24. A little setup, first You’ll do this more or less at the beginning of each session It’ll become second nature; just import the workhorse libraries we always use: numpy, pandas, pyplot.
  • 25. The DataFrame If you’re used to spreadsheets, the DataFrame isn’t too difficult to understand It’s the fundamental, flexible building block in Pandas
  • 26. At its simplest, it looks rather like a spreadsheet would The only obvious difference with Excel is the column indexes, which are numeric instead of A, B, C...
  • 27. You’ll usually create them from some other source: The Pandas library provides some nice functions for importing from common file formats, so you won’t usually be building “by hand”: 1. pd.read_csv() 2. pd.read_table() 3. pd.read_sql()
  • 28. We have so much data stored in CSVs Our first function call will just read some data into the DataFrame, where we can analyse it Reading a CSV
  • 29. Get help at any time with Shift+Tab
  • 30. 1. pd.read_csv() will read in the data 2. Fields are separated by tabs 3. The encoding is UTF-16 (don’t ask…)
  • 31. Get a quick sense of the data (658k rows, here)
  • 34. What’s happening there? df[‘Link Active?’] is: 1. Checking that whole column for values that are True or False 2. Returning an array of True/False values 3. This is fast, and lets us filter in an amazing variety of ways
  • 36. We’re probably ready for this one, now:
  • 37. Example project: Getting data from SEMRush
  • 38. Writing your own function
  • 39. Call our function, get a DataFrame!
  • 40. Write to disk in case anything goes wrong
  • 43. Drill down into individual words: Counter() will save you a huge amount of work Here we wanted to hone in on modifier words
  • 44. More detailed questions How local are the searches? Do people search by state code or full name? Do people search by hotel category?
  • 45. Second example: Custom Rank Tracking Charts
  • 46. Where to begin? If you don’t know Python, start with those books I shared earlier. If you do, check out Python for Data Analysis Keep Jupyter Notebook open at all times Experiment!

Editor's Notes

  1. we think of an IDE but, to oversimplify, there are two main workflows there’s this one, which I also use
  2. Exploratory data analysis Loading data from somewhere, cleaning and preparing it,
  3. NOTES ON SLIDE USE: As with the title slides, this comes in the four Distilled colours, plus dark grey.