SlideShare a Scribd company logo
Fields of Gold
Scraping Web Data
for Marketing Insights
Boegershausen, Datta, Borah, and Stephen (2022)
A Wealth of Data for Marketing Research
is Created on the Internet
Boegershausen, Datta, Borah, and Stephen (2022)
~ 244m reviews
> 1b reviews & opinions
556K projects
500m/day
7:11
hours
time spent online per
day by the average
American consumer
85%
proportion of US
consumers that
use the Internet
every single day
based on available company and market research statistics in May 2022
Boegershausen, Datta, Borah, and Stephen (2022)
Web Scraping
EXAMPLE SOURCES
… allow programmatic access to the internal
databases or algorithms of data providers
Example articles:
Tellis et al. (2019); Toubia and Stephen (2013)
… the process of developing software to automatically
collect information displayed in a web browser
EXAMPLE SOURCES
Example articles:
Chevalier and Mayzlin (2006); Ludwig et al. (2013)
Web Scraping & APIs Can be Used
to Extract Web Data at Scale
Boosting ecological value
This Data Collection Technique can be Used in a
Variety of Settings
Boegershausen, Datta, Borah, and Stephen (2022)
Studying new phenomena
Facilitating methodological advancement Improving measurement
Pathway
①
Pathway
②
Pathway
③
Pathway
④
e.g., Zervas et al. (2017); Datta et al. (2018) e.g., Du et al. (2015); Ludwig et al. (2013)
e.g., Netzer et al. (2012); Liu et al. (2020) e.g., Li et al. (2017); Datta et al. (2022)
Collecting Valid Web Data Poses Many Challenges…
Validity concerns may arise from:
• Failing to capture contextual information in a rapidly changing environment
(e.g., updates to the website’s data-generating process, such as changes to how and where information is
displayed)
• Not sufficiently aligning the psychological processes of interest with the
frequency of data extraction on review platforms
(e.g., the collected information does not capture the time when the behavior occurred)
• Overlooking the influence of algorithmic interference on e-commerce websites
(e.g., the effect of personalization algorithms on information display)
• …and many more.
Boegershausen, Datta, Borah, and Stephen (2022)
How to Extract Valid Web Data?
Boegershausen, Datta, Borah, and Stephen (2022)
Validity
Technical
feasibility
Legal and
ethical risks
2. Collection Design
3. Data Extraction
1. Source Selection
- Jointly consider validity concerns, alongside
technical and legal/ethical questions
- Selected examples and solutions
- Collecting user data from social networks
may infringe upon users’ privacy rights 
anonymize user IDs
- Product review data may be biased by
personalization algorithms  check whether
own browsing behavior affects information
display
- Extraction of all of the information from a
website may take too long  consider taking
a sample
Want to get started collecting and using web data?
Read the paper, and visit https://web-scraping.org.
Boegershausen, Datta, Borah, and Stephen (2022)
o Explore a database with 300+ published
marketing articles using web data
& get inspired!
o Discover web datasets & APIs for your
research projects.
o Find tutorials and example code for
collecting web data using web scraping &
APIs

More Related Content

What's hot

신라면 IMC 전략 기획서
신라면 IMC 전략 기획서신라면 IMC 전략 기획서
신라면 IMC 전략 기획서
Seoung Hyun Yang
 
사업계획서 빈스홀릭
사업계획서 빈스홀릭사업계획서 빈스홀릭
사업계획서 빈스홀릭
Seong-su Park
 
[메조미디어] 2023 소비 트렌드 시리즈 2. 알뜰 합리적 소비
[메조미디어] 2023 소비 트렌드 시리즈 2. 알뜰 합리적 소비[메조미디어] 2023 소비 트렌드 시리즈 2. 알뜰 합리적 소비
[메조미디어] 2023 소비 트렌드 시리즈 2. 알뜰 합리적 소비
MezzoMedia
 
하이트 제로 마케팅 제안서
하이트 제로 마케팅 제안서하이트 제로 마케팅 제안서
하이트 제로 마케팅 제안서
April7
 
2022 한양대_내셔널브랜드_PARAN_NNHOZ_최종발표.pdf
2022 한양대_내셔널브랜드_PARAN_NNHOZ_최종발표.pdf2022 한양대_내셔널브랜드_PARAN_NNHOZ_최종발표.pdf
2022 한양대_내셔널브랜드_PARAN_NNHOZ_최종발표.pdf
Artcoon
 
2020식육가공품 보고 슬라이드
2020식육가공품 보고 슬라이드2020식육가공품 보고 슬라이드
2020식육가공품 보고 슬라이드
KyuriKim19
 
[신세계] 신세계의 온라인 쇼핑몰 성공 전략
[신세계] 신세계의 온라인 쇼핑몰 성공 전략 [신세계] 신세계의 온라인 쇼핑몰 성공 전략
[신세계] 신세계의 온라인 쇼핑몰 성공 전략 nceo
 
[메조미디어] 2023 건강기능식품 업종 분석 리포트
[메조미디어] 2023 건강기능식품 업종 분석 리포트[메조미디어] 2023 건강기능식품 업종 분석 리포트
[메조미디어] 2023 건강기능식품 업종 분석 리포트
MezzoMedia
 
Motos honda de occidente s.a.s
Motos honda de occidente s.a.sMotos honda de occidente s.a.s
Motos honda de occidente s.a.sorfy2011
 
Introduce kongtech co., ltd. (콩테크 회사소개서)
Introduce kongtech co., ltd. (콩테크 회사소개서)Introduce kongtech co., ltd. (콩테크 회사소개서)
Introduce kongtech co., ltd. (콩테크 회사소개서)
콩테크(kongtech)
 
51274331 apostila-gpo1-2011-1
51274331 apostila-gpo1-2011-151274331 apostila-gpo1-2011-1
51274331 apostila-gpo1-2011-1Kátia Correia
 
사업계획서 샘플 - 판매, 유통(Sales business, distribution ppt templates sample)
사업계획서 샘플 - 판매, 유통(Sales business, distribution ppt templates sample)사업계획서 샘플 - 판매, 유통(Sales business, distribution ppt templates sample)
사업계획서 샘플 - 판매, 유통(Sales business, distribution ppt templates sample)
Bizforms
 
[창업자&예비창업자] 2020년 매출향상을 위한 마케팅 전략
[창업자&예비창업자] 2020년 매출향상을 위한 마케팅 전략[창업자&예비창업자] 2020년 매출향상을 위한 마케팅 전략
[창업자&예비창업자] 2020년 매출향상을 위한 마케팅 전략
더게임체인저스
 
[메조미디어] Media&Market Report (2022.10)
[메조미디어] Media&Market Report (2022.10)[메조미디어] Media&Market Report (2022.10)
[메조미디어] Media&Market Report (2022.10)
MezzoMedia
 
Desenvolvimento de fornecedores tiago lemos
Desenvolvimento de fornecedores tiago lemosDesenvolvimento de fornecedores tiago lemos
Desenvolvimento de fornecedores tiago lemoscomitesestrategicospoa
 
최성연 포트폴리오 디지털마케터_패스트캠퍼스
최성연 포트폴리오 디지털마케터_패스트캠퍼스최성연 포트폴리오 디지털마케터_패스트캠퍼스
최성연 포트폴리오 디지털마케터_패스트캠퍼스
sungyeun choi
 
[창업자&예비창업자) 코로나19 이 후 스타트업 글로벌 판로개척
[창업자&예비창업자) 코로나19 이 후 스타트업 글로벌 판로개척[창업자&예비창업자) 코로나19 이 후 스타트업 글로벌 판로개척
[창업자&예비창업자) 코로나19 이 후 스타트업 글로벌 판로개척
더게임체인저스
 
Subway brand book
Subway brand book Subway brand book
Subway brand book
joonyBak
 
Marketing de serviços
Marketing de serviçosMarketing de serviços
Marketing de serviçosPaulo Gomes
 
2022 한양대_내셔널브랜드_ZEXTR_팀1918_최종발표.pdf
2022 한양대_내셔널브랜드_ZEXTR_팀1918_최종발표.pdf2022 한양대_내셔널브랜드_ZEXTR_팀1918_최종발표.pdf
2022 한양대_내셔널브랜드_ZEXTR_팀1918_최종발표.pdf
Artcoon
 

What's hot (20)

신라면 IMC 전략 기획서
신라면 IMC 전략 기획서신라면 IMC 전략 기획서
신라면 IMC 전략 기획서
 
사업계획서 빈스홀릭
사업계획서 빈스홀릭사업계획서 빈스홀릭
사업계획서 빈스홀릭
 
[메조미디어] 2023 소비 트렌드 시리즈 2. 알뜰 합리적 소비
[메조미디어] 2023 소비 트렌드 시리즈 2. 알뜰 합리적 소비[메조미디어] 2023 소비 트렌드 시리즈 2. 알뜰 합리적 소비
[메조미디어] 2023 소비 트렌드 시리즈 2. 알뜰 합리적 소비
 
하이트 제로 마케팅 제안서
하이트 제로 마케팅 제안서하이트 제로 마케팅 제안서
하이트 제로 마케팅 제안서
 
2022 한양대_내셔널브랜드_PARAN_NNHOZ_최종발표.pdf
2022 한양대_내셔널브랜드_PARAN_NNHOZ_최종발표.pdf2022 한양대_내셔널브랜드_PARAN_NNHOZ_최종발표.pdf
2022 한양대_내셔널브랜드_PARAN_NNHOZ_최종발표.pdf
 
2020식육가공품 보고 슬라이드
2020식육가공품 보고 슬라이드2020식육가공품 보고 슬라이드
2020식육가공품 보고 슬라이드
 
[신세계] 신세계의 온라인 쇼핑몰 성공 전략
[신세계] 신세계의 온라인 쇼핑몰 성공 전략 [신세계] 신세계의 온라인 쇼핑몰 성공 전략
[신세계] 신세계의 온라인 쇼핑몰 성공 전략
 
[메조미디어] 2023 건강기능식품 업종 분석 리포트
[메조미디어] 2023 건강기능식품 업종 분석 리포트[메조미디어] 2023 건강기능식품 업종 분석 리포트
[메조미디어] 2023 건강기능식품 업종 분석 리포트
 
Motos honda de occidente s.a.s
Motos honda de occidente s.a.sMotos honda de occidente s.a.s
Motos honda de occidente s.a.s
 
Introduce kongtech co., ltd. (콩테크 회사소개서)
Introduce kongtech co., ltd. (콩테크 회사소개서)Introduce kongtech co., ltd. (콩테크 회사소개서)
Introduce kongtech co., ltd. (콩테크 회사소개서)
 
51274331 apostila-gpo1-2011-1
51274331 apostila-gpo1-2011-151274331 apostila-gpo1-2011-1
51274331 apostila-gpo1-2011-1
 
사업계획서 샘플 - 판매, 유통(Sales business, distribution ppt templates sample)
사업계획서 샘플 - 판매, 유통(Sales business, distribution ppt templates sample)사업계획서 샘플 - 판매, 유통(Sales business, distribution ppt templates sample)
사업계획서 샘플 - 판매, 유통(Sales business, distribution ppt templates sample)
 
[창업자&예비창업자] 2020년 매출향상을 위한 마케팅 전략
[창업자&예비창업자] 2020년 매출향상을 위한 마케팅 전략[창업자&예비창업자] 2020년 매출향상을 위한 마케팅 전략
[창업자&예비창업자] 2020년 매출향상을 위한 마케팅 전략
 
[메조미디어] Media&Market Report (2022.10)
[메조미디어] Media&Market Report (2022.10)[메조미디어] Media&Market Report (2022.10)
[메조미디어] Media&Market Report (2022.10)
 
Desenvolvimento de fornecedores tiago lemos
Desenvolvimento de fornecedores tiago lemosDesenvolvimento de fornecedores tiago lemos
Desenvolvimento de fornecedores tiago lemos
 
최성연 포트폴리오 디지털마케터_패스트캠퍼스
최성연 포트폴리오 디지털마케터_패스트캠퍼스최성연 포트폴리오 디지털마케터_패스트캠퍼스
최성연 포트폴리오 디지털마케터_패스트캠퍼스
 
[창업자&예비창업자) 코로나19 이 후 스타트업 글로벌 판로개척
[창업자&예비창업자) 코로나19 이 후 스타트업 글로벌 판로개척[창업자&예비창업자) 코로나19 이 후 스타트업 글로벌 판로개척
[창업자&예비창업자) 코로나19 이 후 스타트업 글로벌 판로개척
 
Subway brand book
Subway brand book Subway brand book
Subway brand book
 
Marketing de serviços
Marketing de serviçosMarketing de serviços
Marketing de serviços
 
2022 한양대_내셔널브랜드_ZEXTR_팀1918_최종발표.pdf
2022 한양대_내셔널브랜드_ZEXTR_팀1918_최종발표.pdf2022 한양대_내셔널브랜드_ZEXTR_팀1918_최종발표.pdf
2022 한양대_내셔널브랜드_ZEXTR_팀1918_최종발표.pdf
 

Similar to Boegershausen et al. (2022).pptx

Web mining
Web miningWeb mining
A Clustering Based Approach for knowledge discovery on web.
A Clustering Based Approach for knowledge discovery on web.A Clustering Based Approach for knowledge discovery on web.
A Clustering Based Approach for knowledge discovery on web.
NIET Journal of Engineering & Technology (NIETJET)
 
A Study Web Data Mining Challenges And Application For Information Extraction
A Study  Web Data Mining Challenges And Application For Information ExtractionA Study  Web Data Mining Challenges And Application For Information Extraction
A Study Web Data Mining Challenges And Application For Information Extraction
Scott Bou
 
Business Analytics and Data mining.pdf
Business Analytics and Data mining.pdfBusiness Analytics and Data mining.pdf
Business Analytics and Data mining.pdf
ssuser0413ec
 
BIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxBIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptx
muflehaljarrah
 
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEEMEMTECHSTUDENTPROJECTS
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 
Introduction to Business and Data Analysis Undergraduate.pdf
Introduction to Business and Data Analysis Undergraduate.pdfIntroduction to Business and Data Analysis Undergraduate.pdf
Introduction to Business and Data Analysis Undergraduate.pdf
AbdulrahimShaibuIssa
 
IRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its Challenges
IRJET Journal
 
Big data Introduction
Big data IntroductionBig data Introduction
Big data Introduction
Musa Kalimullah
 
Web mining and social media mining
Web mining and social media miningWeb mining and social media mining
Web mining and social media mining
Roxana Tadayon
 
Web
WebWeb
Sample
Sample Sample
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...
Emily Kolvitz
 
Creating Your Own Technology Plan Toledo
Creating Your Own Technology Plan   ToledoCreating Your Own Technology Plan   Toledo
Creating Your Own Technology Plan Toledo
Michigan Nonprofit Association
 
IRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search ResultsIRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search Results
IRJET Journal
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websites
Bhanu Prakash
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
inventionjournals
 
2.What Data Collection Method Involves Tracking_.pdf
2.What Data Collection Method Involves Tracking_.pdf2.What Data Collection Method Involves Tracking_.pdf
2.What Data Collection Method Involves Tracking_.pdf
Belayet Hossain
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine Scrapper
IRJET Journal
 

Similar to Boegershausen et al. (2022).pptx (20)

Web mining
Web miningWeb mining
Web mining
 
A Clustering Based Approach for knowledge discovery on web.
A Clustering Based Approach for knowledge discovery on web.A Clustering Based Approach for knowledge discovery on web.
A Clustering Based Approach for knowledge discovery on web.
 
A Study Web Data Mining Challenges And Application For Information Extraction
A Study  Web Data Mining Challenges And Application For Information ExtractionA Study  Web Data Mining Challenges And Application For Information Extraction
A Study Web Data Mining Challenges And Application For Information Extraction
 
Business Analytics and Data mining.pdf
Business Analytics and Data mining.pdfBusiness Analytics and Data mining.pdf
Business Analytics and Data mining.pdf
 
BIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxBIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptx
 
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Introduction to Business and Data Analysis Undergraduate.pdf
Introduction to Business and Data Analysis Undergraduate.pdfIntroduction to Business and Data Analysis Undergraduate.pdf
Introduction to Business and Data Analysis Undergraduate.pdf
 
IRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its Challenges
 
Big data Introduction
Big data IntroductionBig data Introduction
Big data Introduction
 
Web mining and social media mining
Web mining and social media miningWeb mining and social media mining
Web mining and social media mining
 
Web
WebWeb
Web
 
Sample
Sample Sample
Sample
 
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...
 
Creating Your Own Technology Plan Toledo
Creating Your Own Technology Plan   ToledoCreating Your Own Technology Plan   Toledo
Creating Your Own Technology Plan Toledo
 
IRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search ResultsIRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search Results
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websites
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
 
2.What Data Collection Method Involves Tracking_.pdf
2.What Data Collection Method Involves Tracking_.pdf2.What Data Collection Method Involves Tracking_.pdf
2.What Data Collection Method Involves Tracking_.pdf
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine Scrapper
 

More from American Marketing Association | Journals

The Value of an Online Retailer Building Their Own Delivery Service
The Value of an Online Retailer Building Their Own Delivery ServiceThe Value of an Online Retailer Building Their Own Delivery Service
The Value of an Online Retailer Building Their Own Delivery Service
American Marketing Association | Journals
 
Liadeli, Sotgiu, and Verlegh (2022).pptx
Liadeli, Sotgiu, and Verlegh (2022).pptxLiadeli, Sotgiu, and Verlegh (2022).pptx
Liadeli, Sotgiu, and Verlegh (2022).pptx
American Marketing Association | Journals
 
Ghosh Dastidar, Sunder, and Shah (2022)pptx (2).pptx
Ghosh Dastidar, Sunder, and Shah (2022)pptx (2).pptxGhosh Dastidar, Sunder, and Shah (2022)pptx (2).pptx
Ghosh Dastidar, Sunder, and Shah (2022)pptx (2).pptx
American Marketing Association | Journals
 
Wiseman et al. (2022).pptx
Wiseman et al. (2022).pptxWiseman et al. (2022).pptx
Wiseman et al. (2022).pptx
American Marketing Association | Journals
 
Wies, Moorman &Chandy (2022).pptx
Wies, Moorman &Chandy (2022).pptxWies, Moorman &Chandy (2022).pptx
Wies, Moorman &Chandy (2022).pptx
American Marketing Association | Journals
 
Thompson & Kumar (2022).pptx
Thompson & Kumar (2022).pptxThompson & Kumar (2022).pptx
Thompson & Kumar (2022).pptx
American Marketing Association | Journals
 
Musarra, Robson, and Katsikeas (2022).pptx
Musarra, Robson, and Katsikeas (2022).pptxMusarra, Robson, and Katsikeas (2022).pptx
Musarra, Robson, and Katsikeas (2022).pptx
American Marketing Association | Journals
 
Maesen & Lamey (2022).pptx
Maesen & Lamey (2022).pptxMaesen & Lamey (2022).pptx
Maesen & Lamey (2022).pptx
American Marketing Association | Journals
 
Herhausen et al (2022).pptx
Herhausen et al (2022).pptxHerhausen et al (2022).pptx
Herhausen et al (2022).pptx
American Marketing Association | Journals
 
Dellaert et al (2022).pptx
Dellaert et al (2022).pptxDellaert et al (2022).pptx
Dellaert et al (2022).pptx
American Marketing Association | Journals
 
Dolbec et al. (2022).pptx
Dolbec et al. (2022).pptxDolbec et al. (2022).pptx
Heide, Bell & Tracey (2022).pptx
Heide, Bell & Tracey (2022).pptxHeide, Bell & Tracey (2022).pptx
Heide, Bell & Tracey (2022).pptx
American Marketing Association | Journals
 
Wang, Wang & Jiang (2022).pptx
Wang, Wang & Jiang (2022).pptxWang, Wang & Jiang (2022).pptx
Wang, Wang & Jiang (2022).pptx
American Marketing Association | Journals
 
Malhotra & Bhattacharyya (2022).pptx
Malhotra & Bhattacharyya (2022).pptxMalhotra & Bhattacharyya (2022).pptx
Malhotra & Bhattacharyya (2022).pptx
American Marketing Association | Journals
 
Kim, Kim & Arora (2021).pptx
Kim, Kim & Arora (2021).pptxKim, Kim & Arora (2021).pptx
Kim, Kim & Arora (2021).pptx
American Marketing Association | Journals
 
Jia, Yang, and Jiang 2022.pptx
Jia, Yang, and Jiang 2022.pptxJia, Yang, and Jiang 2022.pptx
Jia, Yang, and Jiang 2022.pptx
American Marketing Association | Journals
 
Jia, Yang, and Jiang 2022.pptx
Jia, Yang, and Jiang 2022.pptxJia, Yang, and Jiang 2022.pptx
Jia, Yang, and Jiang 2022.pptx
American Marketing Association | Journals
 
Goldfarb, Tucker & Wang (2022).pptx
Goldfarb, Tucker & Wang (2022).pptxGoldfarb, Tucker & Wang (2022).pptx
Goldfarb, Tucker & Wang (2022).pptx
American Marketing Association | Journals
 
Anatoli Colicev: The PhD Journey
Anatoli Colicev: The PhD JourneyAnatoli Colicev: The PhD Journey
Anatoli Colicev: The PhD Journey
American Marketing Association | Journals
 
Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...
Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...
Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...
American Marketing Association | Journals
 

More from American Marketing Association | Journals (20)

The Value of an Online Retailer Building Their Own Delivery Service
The Value of an Online Retailer Building Their Own Delivery ServiceThe Value of an Online Retailer Building Their Own Delivery Service
The Value of an Online Retailer Building Their Own Delivery Service
 
Liadeli, Sotgiu, and Verlegh (2022).pptx
Liadeli, Sotgiu, and Verlegh (2022).pptxLiadeli, Sotgiu, and Verlegh (2022).pptx
Liadeli, Sotgiu, and Verlegh (2022).pptx
 
Ghosh Dastidar, Sunder, and Shah (2022)pptx (2).pptx
Ghosh Dastidar, Sunder, and Shah (2022)pptx (2).pptxGhosh Dastidar, Sunder, and Shah (2022)pptx (2).pptx
Ghosh Dastidar, Sunder, and Shah (2022)pptx (2).pptx
 
Wiseman et al. (2022).pptx
Wiseman et al. (2022).pptxWiseman et al. (2022).pptx
Wiseman et al. (2022).pptx
 
Wies, Moorman &Chandy (2022).pptx
Wies, Moorman &Chandy (2022).pptxWies, Moorman &Chandy (2022).pptx
Wies, Moorman &Chandy (2022).pptx
 
Thompson & Kumar (2022).pptx
Thompson & Kumar (2022).pptxThompson & Kumar (2022).pptx
Thompson & Kumar (2022).pptx
 
Musarra, Robson, and Katsikeas (2022).pptx
Musarra, Robson, and Katsikeas (2022).pptxMusarra, Robson, and Katsikeas (2022).pptx
Musarra, Robson, and Katsikeas (2022).pptx
 
Maesen & Lamey (2022).pptx
Maesen & Lamey (2022).pptxMaesen & Lamey (2022).pptx
Maesen & Lamey (2022).pptx
 
Herhausen et al (2022).pptx
Herhausen et al (2022).pptxHerhausen et al (2022).pptx
Herhausen et al (2022).pptx
 
Dellaert et al (2022).pptx
Dellaert et al (2022).pptxDellaert et al (2022).pptx
Dellaert et al (2022).pptx
 
Dolbec et al. (2022).pptx
Dolbec et al. (2022).pptxDolbec et al. (2022).pptx
Dolbec et al. (2022).pptx
 
Heide, Bell & Tracey (2022).pptx
Heide, Bell & Tracey (2022).pptxHeide, Bell & Tracey (2022).pptx
Heide, Bell & Tracey (2022).pptx
 
Wang, Wang & Jiang (2022).pptx
Wang, Wang & Jiang (2022).pptxWang, Wang & Jiang (2022).pptx
Wang, Wang & Jiang (2022).pptx
 
Malhotra & Bhattacharyya (2022).pptx
Malhotra & Bhattacharyya (2022).pptxMalhotra & Bhattacharyya (2022).pptx
Malhotra & Bhattacharyya (2022).pptx
 
Kim, Kim & Arora (2021).pptx
Kim, Kim & Arora (2021).pptxKim, Kim & Arora (2021).pptx
Kim, Kim & Arora (2021).pptx
 
Jia, Yang, and Jiang 2022.pptx
Jia, Yang, and Jiang 2022.pptxJia, Yang, and Jiang 2022.pptx
Jia, Yang, and Jiang 2022.pptx
 
Jia, Yang, and Jiang 2022.pptx
Jia, Yang, and Jiang 2022.pptxJia, Yang, and Jiang 2022.pptx
Jia, Yang, and Jiang 2022.pptx
 
Goldfarb, Tucker & Wang (2022).pptx
Goldfarb, Tucker & Wang (2022).pptxGoldfarb, Tucker & Wang (2022).pptx
Goldfarb, Tucker & Wang (2022).pptx
 
Anatoli Colicev: The PhD Journey
Anatoli Colicev: The PhD JourneyAnatoli Colicev: The PhD Journey
Anatoli Colicev: The PhD Journey
 
Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...
Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...
Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...
 

Recently uploaded

Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
Delivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and TrainingDelivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and Training
AG2 Design
 
Assignment_4_ArianaBusciglio Marvel(1).docx
Assignment_4_ArianaBusciglio Marvel(1).docxAssignment_4_ArianaBusciglio Marvel(1).docx
Assignment_4_ArianaBusciglio Marvel(1).docx
ArianaBusciglio
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
Bisnar Chase Personal Injury Attorneys
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
Celine George
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
Krisztián Száraz
 

Recently uploaded (20)

Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
Delivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and TrainingDelivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and Training
 
Assignment_4_ArianaBusciglio Marvel(1).docx
Assignment_4_ArianaBusciglio Marvel(1).docxAssignment_4_ArianaBusciglio Marvel(1).docx
Assignment_4_ArianaBusciglio Marvel(1).docx
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
 

Boegershausen et al. (2022).pptx

  • 1. Fields of Gold Scraping Web Data for Marketing Insights Boegershausen, Datta, Borah, and Stephen (2022)
  • 2. A Wealth of Data for Marketing Research is Created on the Internet Boegershausen, Datta, Borah, and Stephen (2022) ~ 244m reviews > 1b reviews & opinions 556K projects 500m/day 7:11 hours time spent online per day by the average American consumer 85% proportion of US consumers that use the Internet every single day based on available company and market research statistics in May 2022
  • 3. Boegershausen, Datta, Borah, and Stephen (2022) Web Scraping EXAMPLE SOURCES … allow programmatic access to the internal databases or algorithms of data providers Example articles: Tellis et al. (2019); Toubia and Stephen (2013) … the process of developing software to automatically collect information displayed in a web browser EXAMPLE SOURCES Example articles: Chevalier and Mayzlin (2006); Ludwig et al. (2013) Web Scraping & APIs Can be Used to Extract Web Data at Scale
  • 4. Boosting ecological value This Data Collection Technique can be Used in a Variety of Settings Boegershausen, Datta, Borah, and Stephen (2022) Studying new phenomena Facilitating methodological advancement Improving measurement Pathway ① Pathway ② Pathway ③ Pathway ④ e.g., Zervas et al. (2017); Datta et al. (2018) e.g., Du et al. (2015); Ludwig et al. (2013) e.g., Netzer et al. (2012); Liu et al. (2020) e.g., Li et al. (2017); Datta et al. (2022)
  • 5. Collecting Valid Web Data Poses Many Challenges… Validity concerns may arise from: • Failing to capture contextual information in a rapidly changing environment (e.g., updates to the website’s data-generating process, such as changes to how and where information is displayed) • Not sufficiently aligning the psychological processes of interest with the frequency of data extraction on review platforms (e.g., the collected information does not capture the time when the behavior occurred) • Overlooking the influence of algorithmic interference on e-commerce websites (e.g., the effect of personalization algorithms on information display) • …and many more. Boegershausen, Datta, Borah, and Stephen (2022)
  • 6. How to Extract Valid Web Data? Boegershausen, Datta, Borah, and Stephen (2022) Validity Technical feasibility Legal and ethical risks 2. Collection Design 3. Data Extraction 1. Source Selection - Jointly consider validity concerns, alongside technical and legal/ethical questions - Selected examples and solutions - Collecting user data from social networks may infringe upon users’ privacy rights  anonymize user IDs - Product review data may be biased by personalization algorithms  check whether own browsing behavior affects information display - Extraction of all of the information from a website may take too long  consider taking a sample
  • 7. Want to get started collecting and using web data? Read the paper, and visit https://web-scraping.org. Boegershausen, Datta, Borah, and Stephen (2022) o Explore a database with 300+ published marketing articles using web data & get inspired! o Discover web datasets & APIs for your research projects. o Find tutorials and example code for collecting web data using web scraping & APIs