SlideShare a Scribd company logo
1 of 20
Download to read offline
Prediction of the direction of Bitcoin
Price using “Web Search” Data
Presented By:
Group 9: Pengfei Zhao, Rohit Khurana, Haixin Wang
ST 502
Under the guidance of:
Dr Ana-Maria Staicu
Stephanie Chen
Problem Statement &
Motivation
01
Data Exploration, Cleaning &
Aggregation
02
Experiment Design &
Parameter Fitting
03
Results, Recommendations
& Future Enhancements04
Problem Statement &
Motivation
Motivation & Problem Statement
• How to predict direction for tomorrow’s bitcoin price using public
sentiment & Emotions?
• One of Bitcoin’s uniqueness's is that its price highly relies on people’s
opinion and attitude instead of institutionalized money regulation.
Data Exploration, Cleaning &
Aggregation
Association Rule for Bitcoin
increase profit
Bitcoin
Future
USD
value
Price
China
https://www.sas.com/content/dam/SAS/support/en/sas-global-forum-proceedings/2018/3601-2018.pdf
Data Exploration & Cleaning
• Bitcoin Prices (daily prices)
• Last 5 years
• Date
• Opening Price
• Highest Price
• Lowest Price
• Closing Price
• Volume
• Market Cap
https://data.bitcoinity.org/markets/volum
e/30d?c=e&t=b
• Emotions/ Web Search Data
(weekly)
• Last 5 years
• Date
• “String”
• Increase
• Future
• Bitcoin
• USD
• Bitcoin Price
• Normalized Data (0-100)
• Global Data
https://trends.google.com/trends/explore?d
ate=all&q=bitcoin,increase,future
Data Aggregation
• Date Merged based upon the date
• Bitcoin Price data for the day (T) has been merged with Web Search Data of
T-1.
Association of Bitcoin Price Various strings
Vs Increase_Str Vs Bitcoin_Str
Vs USD_Str
Vs Future_Str Vs Bitcoin_Price_Str
Merged Data
• Date column has been dropped
• ‘Direction’ Column has been added based upon the returns/ change of price
Experiment Design &
Parameter Fitting
Linear Regression
• Y – Closing Price
• X1 – Bitcoin_str
• X2 - USD_str
• X3 - Bitcoin_price_str
Logistics Regression
Y – Direction (1 – up, 0-down)
X1 – Bitcoin_str
X2 - USD_str
X3 - Bitcoin_price_str
Y – Direction (1 – up, 0-down)
X1 - USD_str
X2 - Bitcoin_price_str
• Beta0 = 1.79, Beta1 = -0.16, Beta2 = 0.87
Distribution Fitting of Independent Variables
By MLE Parameters
Usd_Str – Normal Distribution Bitcoin_price_Str – Beta Distribution
AD test, p-value: 0.06 AD Test, Beta p-value: 0.01
Results, Recommendations &
Future Enhancements
Sampling from the Existing Data (Non-Parametric BS)
Beta0.hat from MC Beta1.hat from MC Beta2.hat from MC
n=100
n=200
Sampling from the Existing Data (Non-Parametric BS)
Parameters Fitted n=100 n=200
CI Fitted CI Mean SD CI Mean SD
Beta0 (0.05, 3.60) 1.80 (-7.36, 1.81) -2.00 2.40 (-0.63, 4.65) 1.89 1.35
Beta1 (-0.26, -0.06) -0.16 (-0.16, 0.30) 0.04 0.124 (-0.35, -0.03) -0.17 0.08
Beta2 (0.57, 1.21) 0.87 (0.07, 1.40) 0.46 0.35 (0.46, 1.57) 0.92 0.29
Conclusion
• Learnings
• Usage of Logistic Regression & Non-Parametric BS for prediction of bitcoin price movement and its
dependencies on Web Search Data
• With large sample size, the average length of 95% CI of the slopes becomes smaller and standard
error decreases
• Business Outcome
• In order to predict the bitcoin price, count of following strings from the Web Search Data can be use
as per logistic regression model
• USD
• Bitcoin Price
• Limitations
• Limited Data, i.e. weekly data
• Could have back tested to ascertain the claim
• Distribution Fitting to lognormal / transformed data might give better result
• Association of Price to the Google trends
• Usage of ARIMA for predicting the price
References
Data Reference
● https://cran.r-project.org/web/packages/gtrendsR/gtrendsR.pdf
● https://data.bitcoinity.org/markets/volume/30d?c=e&t=b
● https://trends.google.com/trends/explore?date=all&q=bitcoin,increase,future
● https://www.google.com/trends/correlate/search?e=bitcoin&t=weekly&p=us
Business Reference
● https://www.sas.com/content/dam/SAS/support/en/sas-global-forum-
proceedings/2018/3601-2018.pdf
● https://www.reddit.com/r/CryptoCurrency/comments/aaacft/google_trends_vs_bitcoin_pri
ce_interesting/
● https://www.chepicap.com/en/news/3336/can-you-predict-price-raises-by-looking-at-
search-trends-check-out-this-chart.html
● https://www.google.com/trends/correlate/whitepaper.pdf
Thank you
Question?

More Related Content

Recently uploaded

Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
fztigerwe
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Stephen266013
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
ppy8zfkfm
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 

Recently uploaded (20)

Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor NetworksSensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
 
123.docx. .
123.docx.                                 .123.docx.                                 .
123.docx. .
 
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic information
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptx
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeral
 

Featured

Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 

Forecast bitcoin price direction

  • 1. Prediction of the direction of Bitcoin Price using “Web Search” Data Presented By: Group 9: Pengfei Zhao, Rohit Khurana, Haixin Wang ST 502 Under the guidance of: Dr Ana-Maria Staicu Stephanie Chen
  • 2. Problem Statement & Motivation 01 Data Exploration, Cleaning & Aggregation 02 Experiment Design & Parameter Fitting 03 Results, Recommendations & Future Enhancements04
  • 4. Motivation & Problem Statement • How to predict direction for tomorrow’s bitcoin price using public sentiment & Emotions? • One of Bitcoin’s uniqueness's is that its price highly relies on people’s opinion and attitude instead of institutionalized money regulation.
  • 6. Association Rule for Bitcoin increase profit Bitcoin Future USD value Price China https://www.sas.com/content/dam/SAS/support/en/sas-global-forum-proceedings/2018/3601-2018.pdf
  • 7. Data Exploration & Cleaning • Bitcoin Prices (daily prices) • Last 5 years • Date • Opening Price • Highest Price • Lowest Price • Closing Price • Volume • Market Cap https://data.bitcoinity.org/markets/volum e/30d?c=e&t=b • Emotions/ Web Search Data (weekly) • Last 5 years • Date • “String” • Increase • Future • Bitcoin • USD • Bitcoin Price • Normalized Data (0-100) • Global Data https://trends.google.com/trends/explore?d ate=all&q=bitcoin,increase,future
  • 8. Data Aggregation • Date Merged based upon the date • Bitcoin Price data for the day (T) has been merged with Web Search Data of T-1.
  • 9. Association of Bitcoin Price Various strings Vs Increase_Str Vs Bitcoin_Str Vs USD_Str Vs Future_Str Vs Bitcoin_Price_Str
  • 10. Merged Data • Date column has been dropped • ‘Direction’ Column has been added based upon the returns/ change of price
  • 12. Linear Regression • Y – Closing Price • X1 – Bitcoin_str • X2 - USD_str • X3 - Bitcoin_price_str
  • 13. Logistics Regression Y – Direction (1 – up, 0-down) X1 – Bitcoin_str X2 - USD_str X3 - Bitcoin_price_str Y – Direction (1 – up, 0-down) X1 - USD_str X2 - Bitcoin_price_str • Beta0 = 1.79, Beta1 = -0.16, Beta2 = 0.87
  • 14. Distribution Fitting of Independent Variables By MLE Parameters Usd_Str – Normal Distribution Bitcoin_price_Str – Beta Distribution AD test, p-value: 0.06 AD Test, Beta p-value: 0.01
  • 16. Sampling from the Existing Data (Non-Parametric BS) Beta0.hat from MC Beta1.hat from MC Beta2.hat from MC n=100 n=200
  • 17. Sampling from the Existing Data (Non-Parametric BS) Parameters Fitted n=100 n=200 CI Fitted CI Mean SD CI Mean SD Beta0 (0.05, 3.60) 1.80 (-7.36, 1.81) -2.00 2.40 (-0.63, 4.65) 1.89 1.35 Beta1 (-0.26, -0.06) -0.16 (-0.16, 0.30) 0.04 0.124 (-0.35, -0.03) -0.17 0.08 Beta2 (0.57, 1.21) 0.87 (0.07, 1.40) 0.46 0.35 (0.46, 1.57) 0.92 0.29
  • 18. Conclusion • Learnings • Usage of Logistic Regression & Non-Parametric BS for prediction of bitcoin price movement and its dependencies on Web Search Data • With large sample size, the average length of 95% CI of the slopes becomes smaller and standard error decreases • Business Outcome • In order to predict the bitcoin price, count of following strings from the Web Search Data can be use as per logistic regression model • USD • Bitcoin Price • Limitations • Limited Data, i.e. weekly data • Could have back tested to ascertain the claim • Distribution Fitting to lognormal / transformed data might give better result • Association of Price to the Google trends • Usage of ARIMA for predicting the price
  • 19. References Data Reference ● https://cran.r-project.org/web/packages/gtrendsR/gtrendsR.pdf ● https://data.bitcoinity.org/markets/volume/30d?c=e&t=b ● https://trends.google.com/trends/explore?date=all&q=bitcoin,increase,future ● https://www.google.com/trends/correlate/search?e=bitcoin&t=weekly&p=us Business Reference ● https://www.sas.com/content/dam/SAS/support/en/sas-global-forum- proceedings/2018/3601-2018.pdf ● https://www.reddit.com/r/CryptoCurrency/comments/aaacft/google_trends_vs_bitcoin_pri ce_interesting/ ● https://www.chepicap.com/en/news/3336/can-you-predict-price-raises-by-looking-at- search-trends-check-out-this-chart.html ● https://www.google.com/trends/correlate/whitepaper.pdf