This document describes how to summarize sales data by state and time period. It involves transforming customer and sales records to key-value pairs with state, joining on customer ID, and then reducing by state and time period (hourly, daily, monthly, yearly) to sum the sales amounts. The results are unioned and saved as a text file with the state, year, month, day, hour and total sales for each time period. Techniques like broadcasting the smaller customer dataset, using a partitioner based on state, and persisting datasets are discussed to reduce shuffling during the joins and reductions.
The document provides an overview of search engine marketing (SEM) and search engine optimization (SEO). It defines key SEM concepts like paid search, pay-per-click advertising, and discusses how SEM campaigns can be optimized. It also covers SEO best practices like on-page and off-page optimization factors and how to structure content to affect search engine rankings. Resources for learning more about SEM, SEO, and related topics are listed at the end.
This document discusses how Groupon uses big data and machine learning to optimize search engine marketing campaigns at scale. Key points include:
- Groupon runs marketing campaigns across multiple platforms that are automatically refreshed daily based on deals. Machine learning is used to optimize the process.
- Billions of performance records like bids, impressions, clicks and purchases must be managed for thousands of keywords across locations. Analyzing this data at scale using traditional systems can take many minutes.
- Groupon developed a new architecture using Cassandra, Redis, and external indexing to enable insights on keywords and performance to be retrieved in under a second, compared to 5-6 minutes previously. This allows for much faster optimization of campaigns.
Put Your Money Where Your Mouth Is - Pay Per Click Advertising - A Case Study...The Creative Collective
What would you say if we were to tell you that rather than throwing valuable marketing and advertising dollars out there month after month in the hope that the right people will see your advertisement or marketing and take some action, that THERE IS A WAY TO KNOW the right people will see your advertisement for sure, and that you won't pay a dime unless they actually notice and click on your ad? Sound good? Welcome to the world of pay per click advertising!
At our Web Wednesday event, an exciting line up of knowledgeable panellists took attendees through the ins and outs of pay per click advertising as it applies to Google Adwords and Facebook.
SEO with Google Analytics - Organic KeywordsTobias Kraeft
This document discusses combining Google Search Console and Google Analytics data to better understand organic search traffic and conversions. It proposes:
1. Exporting keyword, device, landing page and other data from GSC and GA daily into BigQuery for storage and analysis.
2. Combining the datasets in BigQuery to attribute conversions to specific keywords and landing pages, and split traffic into brand vs non-brand categories.
3. Using DataStudio to provide interfaces for exploring, filtering and automatically updating the combined analytics data.
This approach aims to provide more granular keyword-level insights than is possible with Google Analytics alone, after Google stopped reporting organic keyword data directly.
Presentation to Foster Tech Club (MBA student organization) on Search 101 - and advertising with Google - the largest media company on earth.
What are Adwords? How does PPC search work? What fuels search engines?
This document provides an overview of Adwords tools and scripts. It discusses the differences between Adwords scripts and the API, and outlines various use cases for scripts. Examples of scripts are presented, including a campaign generator, anomaly detector, settings change detector, ad group duplicator, and call tracking tools. Additional sources for useful Adwords scripts are also listed.
Google Analytics Website Optimizer Slidesharetmg_ltd
How to use web optimiser to test multiple designs and identify which is the best performing. The use of analytics and testing is a vital part of any web strategy and takes the guess work out of identifying design that will work for you.
The document discusses Google Analytics and Google Website Optimizer tools for analyzing website traffic and optimizing website content. It provides an overview of using profiles and filters in Google Analytics to segment traffic data. It also covers setting up goals and funnels to track conversions. For Google Website Optimizer, it describes how to design A/B tests to compare variations of content and evaluate which performs better. The overall purpose is to help users plan, measure, and improve their websites through data-driven optimization.
The document provides an overview of search engine marketing (SEM) and search engine optimization (SEO). It defines key SEM concepts like paid search, pay-per-click advertising, and discusses how SEM campaigns can be optimized. It also covers SEO best practices like on-page and off-page optimization factors and how to structure content to affect search engine rankings. Resources for learning more about SEM, SEO, and related topics are listed at the end.
This document discusses how Groupon uses big data and machine learning to optimize search engine marketing campaigns at scale. Key points include:
- Groupon runs marketing campaigns across multiple platforms that are automatically refreshed daily based on deals. Machine learning is used to optimize the process.
- Billions of performance records like bids, impressions, clicks and purchases must be managed for thousands of keywords across locations. Analyzing this data at scale using traditional systems can take many minutes.
- Groupon developed a new architecture using Cassandra, Redis, and external indexing to enable insights on keywords and performance to be retrieved in under a second, compared to 5-6 minutes previously. This allows for much faster optimization of campaigns.
Put Your Money Where Your Mouth Is - Pay Per Click Advertising - A Case Study...The Creative Collective
What would you say if we were to tell you that rather than throwing valuable marketing and advertising dollars out there month after month in the hope that the right people will see your advertisement or marketing and take some action, that THERE IS A WAY TO KNOW the right people will see your advertisement for sure, and that you won't pay a dime unless they actually notice and click on your ad? Sound good? Welcome to the world of pay per click advertising!
At our Web Wednesday event, an exciting line up of knowledgeable panellists took attendees through the ins and outs of pay per click advertising as it applies to Google Adwords and Facebook.
SEO with Google Analytics - Organic KeywordsTobias Kraeft
This document discusses combining Google Search Console and Google Analytics data to better understand organic search traffic and conversions. It proposes:
1. Exporting keyword, device, landing page and other data from GSC and GA daily into BigQuery for storage and analysis.
2. Combining the datasets in BigQuery to attribute conversions to specific keywords and landing pages, and split traffic into brand vs non-brand categories.
3. Using DataStudio to provide interfaces for exploring, filtering and automatically updating the combined analytics data.
This approach aims to provide more granular keyword-level insights than is possible with Google Analytics alone, after Google stopped reporting organic keyword data directly.
Presentation to Foster Tech Club (MBA student organization) on Search 101 - and advertising with Google - the largest media company on earth.
What are Adwords? How does PPC search work? What fuels search engines?
This document provides an overview of Adwords tools and scripts. It discusses the differences between Adwords scripts and the API, and outlines various use cases for scripts. Examples of scripts are presented, including a campaign generator, anomaly detector, settings change detector, ad group duplicator, and call tracking tools. Additional sources for useful Adwords scripts are also listed.
Google Analytics Website Optimizer Slidesharetmg_ltd
How to use web optimiser to test multiple designs and identify which is the best performing. The use of analytics and testing is a vital part of any web strategy and takes the guess work out of identifying design that will work for you.
The document discusses Google Analytics and Google Website Optimizer tools for analyzing website traffic and optimizing website content. It provides an overview of using profiles and filters in Google Analytics to segment traffic data. It also covers setting up goals and funnels to track conversions. For Google Website Optimizer, it describes how to design A/B tests to compare variations of content and evaluate which performs better. The overall purpose is to help users plan, measure, and improve their websites through data-driven optimization.
The document discusses Google Analytics and Google Website Optimizer tools for analyzing website traffic and optimizing website content. It provides an overview of using profiles and filters in Google Analytics to segment traffic data. It also covers setting up goals and funnels to track conversions. For Google Website Optimizer, it describes how to design A/B tests to compare variations of content and evaluate which performs better. The overall purpose is to help users plan, measure, and improve their websites through data-driven optimization.
[SMX ADVANCED - BERLIN 2019] Automate & Scale STR Treasure Hunt - Marco Fri...Marco Frighetto
Search term reports optimisation can be a tedious process. A PPC Specialist will invest many hours do dive in into each campaigns trying to extract the most and worst performing keywords. In this presentation, we will share how the heavy lifting of the Search Term report can be automated and how technology can help to identify trends invisible to the human eye.
SMX Advanced Europe 2019 Automate and Scale Google Ads Search Term ReportBooster Box
Search term reports optimization can be a tedious process. A PPC Specialist will invest many hours do dive in into each campaigns trying to extract the most and worst performing keywords. In this presentation, we will share how the heavy lifting of the Search Term report can be automated and how technology can help to identify trends invisible to the human eye.
This document discusses search engine marketing (SEM) strategies, including pay-per-click (PPC) advertising on Google networks. It covers the different ad formats available on Google like text, display, video, and shopping ads. It also discusses factors that affect paid search results like ad rank, domain authority, bids, and quality score. Additionally, it provides guidance on keyword selection, match types, bidding strategies, budgeting, landing page optimization, and A/B testing.
Google Analytics (GA) is a Web Analytic tool, that helps a user analyze the traffic to their website site. It is free, requires no installation and easy to setup. This presentation illustrates how to setup an account, understand how GA works, the dashboard, Key Performance Index (KPI), and Balance Score Card (BSC).
This document discusses how artificial intelligence and machine learning are being applied in pay-per-click advertising. It provides definitions of AI and machine learning, and gives examples of how they are used in areas like automated bid management, ad rotation, and opportunities in the Google Ads interface. The document also outlines several automated bidding options in Google Ads like Target CPA, Target ROAS, and Enhanced CPC that utilize machine learning. It notes how these options can analyze contextual signals to optimize bids in real-time.
Better conversion with Intelligent AnalyticsTim Stewart
As presented at #BrightonSEO and #SearchElite Manchester September 2017
Uploaded version is the longer deck as used for #SASCon 2017 - examples and use cases for Google Analytics features and tools to make more from the data you can collect.
This document provides an overview of artificial intelligence and machine learning applications in pay-per-click advertising. It defines AI and machine learning, and gives examples of how they are used in automated bid management, attribution models, ad rotation settings, and new AdWords experiences. The document also discusses future AI uses in Google Optimize and attribution, and reviews several AdWords automated bidding options like Target CPA, Target ROAS, and Enhanced CPC. It notes that machine learning uses contextual signals to optimize bids. In closing, it mentions coming AI applications to BingAds and new attribution modeling features in AdWords.
This document provides an overview of search engine marketing (SEM) and how to boost website presence and build a brand online. It discusses key aspects of SEM including search engine optimization (SEO), pay-per-click (PPC) advertising, keyword research, optimizing web pages, submitting sites to search engines, and measuring success. The document also covers setting up and monitoring PPC ad campaigns and balancing SEM and SEO strategies.
This document summarizes a presentation on search marketing. It includes the following key points:
1. Search marketing includes both paid search engine marketing (PPC) and search engine optimization (SEO). PPC involves bidding on search keywords, while SEO improves natural search rankings.
2. Google is the dominant search engine, accounting for over 90% of searches in many European countries. Search will represent nearly half of all online advertising spending by 2010.
3. Paid search allows for precision targeting of consumers actively seeking information on specific topics. Advertisers bid on keywords and pay only when ads are clicked.
4. Running a successful paid search campaign involves choosing relevant keywords, writing effective ad copy,
The document discusses using Google's API and services to optimize AdWords campaigns. It covers using the TargetingIdeaService to generate keyword ideas and get statistics on existing keywords. It also discusses using the TrafficEstimatorService to estimate traffic and costs for new campaigns. Finally, it discusses how to optimize existing campaigns by analyzing report data and replacing underperforming keywords.
Back to SEO/SEM Basics - Search Marketing Math for Fun and ProfitDavid Langrock
SEO and SEM are over 20 years old, and lots has changed over the years. But the fundamental KPIs, analytical practices, and basic economics of Search Marketing are all the same. Learn/relearn some of the oldest tricks and maybe some new ones as we take a mathematical tour through a handful or two of entertaining and useful search scenarios.
Originally presented at the Seattle Search Meetup on January 10, 2018
1. The document discusses search analytics and tracking visitors across various platforms like search engines, websites, and applications.
2. It highlights the need for tools that can track user behavior throughout their journey, from search to clicks to purchases, and verify that online ad campaigns are properly executing.
3. Challenges mentioned include the difficulty in measuring performance for search engine optimization and paid search campaigns, and ensuring ads appear where and when specified for paid search.
Gianluca Binelli — Advanced PPC: Create Your Own Automated Bid StrategiesSemrush
These slides were presented during the SEMrush webinar 'Advanced PPC #3: Create Your Own Automated Bid Strategies'. You can access the video and the transcript here >>> https://www.semrush.com/webinars/advanced-ppc-3-create-your-own-automated-bid-strategies/
Digital marketing road map for a real estate company in indiaJishnu Brahma
This document outlines a digital marketing roadmap for a real estate company in India. It includes:
1. An analysis of potential digital lead sources like Google AdWords, Facebook, email marketing and their quality.
2. A strategy for testing different channels, measuring results, and optimizing based on cost per lead and quality.
3. Details on implementing specific tactics like search and display ads, landing pages, chat support, CRM integration, and closed-loop marketing to improve leads.
The goal is to generate quality leads across multiple channels in a cost-effective way and funnel them through the sales process. Testing and measurement is key to optimizing the strategy over time.
Search Ads 360 is a tool that allows users to centralize, automate, and optimize search marketing campaigns at scale. It includes features like unified reporting, strategic bid optimization, and maximizing retail performance. Users can manage campaigns across multiple search engines and platforms. Search Ads 360 uses machine learning to enhance performance through features like predictive modeling, portfolio bidding, and automated bid strategies. It also allows integration of offline data sources and custom metrics to improve optimization.
Learnings from Successful Jobs SearchersBruce Bennett
Are you interested to know what actions help in a job search? This webinar is the summary of several individuals who discussed their job search journey for others to follow. You will learn there are common actions that helped them succeed in their quest for gainful employment.
A Guide to a Winning Interview June 2024Bruce Bennett
This webinar is an in-depth review of the interview process. Preparation is a key element to acing an interview. Learn the best approaches from the initial phone screen to the face-to-face meeting with the hiring manager. You will hear great answers to several standard questions, including the dreaded “Tell Me About Yourself”.
The document discusses Google Analytics and Google Website Optimizer tools for analyzing website traffic and optimizing website content. It provides an overview of using profiles and filters in Google Analytics to segment traffic data. It also covers setting up goals and funnels to track conversions. For Google Website Optimizer, it describes how to design A/B tests to compare variations of content and evaluate which performs better. The overall purpose is to help users plan, measure, and improve their websites through data-driven optimization.
[SMX ADVANCED - BERLIN 2019] Automate & Scale STR Treasure Hunt - Marco Fri...Marco Frighetto
Search term reports optimisation can be a tedious process. A PPC Specialist will invest many hours do dive in into each campaigns trying to extract the most and worst performing keywords. In this presentation, we will share how the heavy lifting of the Search Term report can be automated and how technology can help to identify trends invisible to the human eye.
SMX Advanced Europe 2019 Automate and Scale Google Ads Search Term ReportBooster Box
Search term reports optimization can be a tedious process. A PPC Specialist will invest many hours do dive in into each campaigns trying to extract the most and worst performing keywords. In this presentation, we will share how the heavy lifting of the Search Term report can be automated and how technology can help to identify trends invisible to the human eye.
This document discusses search engine marketing (SEM) strategies, including pay-per-click (PPC) advertising on Google networks. It covers the different ad formats available on Google like text, display, video, and shopping ads. It also discusses factors that affect paid search results like ad rank, domain authority, bids, and quality score. Additionally, it provides guidance on keyword selection, match types, bidding strategies, budgeting, landing page optimization, and A/B testing.
Google Analytics (GA) is a Web Analytic tool, that helps a user analyze the traffic to their website site. It is free, requires no installation and easy to setup. This presentation illustrates how to setup an account, understand how GA works, the dashboard, Key Performance Index (KPI), and Balance Score Card (BSC).
This document discusses how artificial intelligence and machine learning are being applied in pay-per-click advertising. It provides definitions of AI and machine learning, and gives examples of how they are used in areas like automated bid management, ad rotation, and opportunities in the Google Ads interface. The document also outlines several automated bidding options in Google Ads like Target CPA, Target ROAS, and Enhanced CPC that utilize machine learning. It notes how these options can analyze contextual signals to optimize bids in real-time.
Better conversion with Intelligent AnalyticsTim Stewart
As presented at #BrightonSEO and #SearchElite Manchester September 2017
Uploaded version is the longer deck as used for #SASCon 2017 - examples and use cases for Google Analytics features and tools to make more from the data you can collect.
This document provides an overview of artificial intelligence and machine learning applications in pay-per-click advertising. It defines AI and machine learning, and gives examples of how they are used in automated bid management, attribution models, ad rotation settings, and new AdWords experiences. The document also discusses future AI uses in Google Optimize and attribution, and reviews several AdWords automated bidding options like Target CPA, Target ROAS, and Enhanced CPC. It notes that machine learning uses contextual signals to optimize bids. In closing, it mentions coming AI applications to BingAds and new attribution modeling features in AdWords.
This document provides an overview of search engine marketing (SEM) and how to boost website presence and build a brand online. It discusses key aspects of SEM including search engine optimization (SEO), pay-per-click (PPC) advertising, keyword research, optimizing web pages, submitting sites to search engines, and measuring success. The document also covers setting up and monitoring PPC ad campaigns and balancing SEM and SEO strategies.
This document summarizes a presentation on search marketing. It includes the following key points:
1. Search marketing includes both paid search engine marketing (PPC) and search engine optimization (SEO). PPC involves bidding on search keywords, while SEO improves natural search rankings.
2. Google is the dominant search engine, accounting for over 90% of searches in many European countries. Search will represent nearly half of all online advertising spending by 2010.
3. Paid search allows for precision targeting of consumers actively seeking information on specific topics. Advertisers bid on keywords and pay only when ads are clicked.
4. Running a successful paid search campaign involves choosing relevant keywords, writing effective ad copy,
The document discusses using Google's API and services to optimize AdWords campaigns. It covers using the TargetingIdeaService to generate keyword ideas and get statistics on existing keywords. It also discusses using the TrafficEstimatorService to estimate traffic and costs for new campaigns. Finally, it discusses how to optimize existing campaigns by analyzing report data and replacing underperforming keywords.
Back to SEO/SEM Basics - Search Marketing Math for Fun and ProfitDavid Langrock
SEO and SEM are over 20 years old, and lots has changed over the years. But the fundamental KPIs, analytical practices, and basic economics of Search Marketing are all the same. Learn/relearn some of the oldest tricks and maybe some new ones as we take a mathematical tour through a handful or two of entertaining and useful search scenarios.
Originally presented at the Seattle Search Meetup on January 10, 2018
1. The document discusses search analytics and tracking visitors across various platforms like search engines, websites, and applications.
2. It highlights the need for tools that can track user behavior throughout their journey, from search to clicks to purchases, and verify that online ad campaigns are properly executing.
3. Challenges mentioned include the difficulty in measuring performance for search engine optimization and paid search campaigns, and ensuring ads appear where and when specified for paid search.
Gianluca Binelli — Advanced PPC: Create Your Own Automated Bid StrategiesSemrush
These slides were presented during the SEMrush webinar 'Advanced PPC #3: Create Your Own Automated Bid Strategies'. You can access the video and the transcript here >>> https://www.semrush.com/webinars/advanced-ppc-3-create-your-own-automated-bid-strategies/
Digital marketing road map for a real estate company in indiaJishnu Brahma
This document outlines a digital marketing roadmap for a real estate company in India. It includes:
1. An analysis of potential digital lead sources like Google AdWords, Facebook, email marketing and their quality.
2. A strategy for testing different channels, measuring results, and optimizing based on cost per lead and quality.
3. Details on implementing specific tactics like search and display ads, landing pages, chat support, CRM integration, and closed-loop marketing to improve leads.
The goal is to generate quality leads across multiple channels in a cost-effective way and funnel them through the sales process. Testing and measurement is key to optimizing the strategy over time.
Search Ads 360 is a tool that allows users to centralize, automate, and optimize search marketing campaigns at scale. It includes features like unified reporting, strategic bid optimization, and maximizing retail performance. Users can manage campaigns across multiple search engines and platforms. Search Ads 360 uses machine learning to enhance performance through features like predictive modeling, portfolio bidding, and automated bid strategies. It also allows integration of offline data sources and custom metrics to improve optimization.
Learnings from Successful Jobs SearchersBruce Bennett
Are you interested to know what actions help in a job search? This webinar is the summary of several individuals who discussed their job search journey for others to follow. You will learn there are common actions that helped them succeed in their quest for gainful employment.
A Guide to a Winning Interview June 2024Bruce Bennett
This webinar is an in-depth review of the interview process. Preparation is a key element to acing an interview. Learn the best approaches from the initial phone screen to the face-to-face meeting with the hiring manager. You will hear great answers to several standard questions, including the dreaded “Tell Me About Yourself”.
Resumes, Cover Letters, and Applying OnlineBruce Bennett
This webinar showcases resume styles and the elements that go into building your resume. Every job application requires unique skills, and this session will show you how to improve your resume to match the jobs to which you are applying. Additionally, we will discuss cover letters and learn about ideas to include. Every job application requires unique skills so learn ways to give you the best chance of success when applying for a new position. Learn how to take advantage of all the features when uploading a job application to a company’s applicant tracking system.
Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...dsnow9802
Jill Pizzola's tenure as Senior Talent Acquisition Partner at THOMSON REUTERS in Marlton, New Jersey, from 2018 to 2023, was marked by innovation and excellence.
Joyce M Sullivan, Founder & CEO of SocMediaFin, Inc. shares her "Five Questions - The Story of You", "Reflections - What Matters to You?" and "The Three Circle Exercise" to guide those evaluating what their next move may be in their careers.
Job Finding Apps Everything You Need to Know in 2024SnapJob
SnapJob is revolutionizing the way people connect with work opportunities and find talented professionals for their projects. Find your dream job with ease using the best job finding apps. Discover top-rated apps that connect you with employers, provide personalized job recommendations, and streamline the application process. Explore features, ratings, and reviews to find the app that suits your needs and helps you land your next opportunity.
5 Common Mistakes to Avoid During the Job Application Process.pdfAlliance Jobs
The journey toward landing your dream job can be both exhilarating and nerve-wracking. As you navigate through the intricate web of job applications, interviews, and follow-ups, it’s crucial to steer clear of common pitfalls that could hinder your chances. Let’s delve into some of the most frequent mistakes applicants make during the job application process and explore how you can sidestep them. Plus, we’ll highlight how Alliance Job Search can enhance your local job hunt.
Leadership Ambassador club Adventist modulekakomaeric00
Aims to equip people who aspire to become leaders with good qualities,and with Christian values and morals as per Biblical teachings.The you who aspire to be leaders should first read and understand what the ambassador module for leadership says about leadership and marry that to what the bible says.Christians sh
IT Career Hacks Navigate the Tech Jungle with a RoadmapBase Camp
Feeling overwhelmed by IT options? This presentation unlocks your personalized roadmap! Learn key skills, explore career paths & build your IT dream job strategy. Visit now & navigate the tech world with confidence! Visit https://www.basecamp.com.sg for more details.
IT Career Hacks Navigate the Tech Jungle with a Roadmap
Intuit0810
1. ABOUT ME
➤ EE/CS/CB
➤ BS EE, Robotics, Shanghai Univ
➤ MS Computational Biology,
School of Computer Science, CMU
2. ABOUT ME
Tech~
➤ Large Genome Dataset ~ Open Big
data world
➤ Interested in data and open source
➤ Familiar with Hadoop ecosystem
and various open source data tools
➤ Insight Data Engineer Fellow
3. ABOUT ME
Non-tech~
➤ Enjoy most of the indoor non-
physical activities ~
➤ Like drawing small picture stories
to record fun facts happen around
➤ Like dressing little yellow and
observing his love life
5. MOTIVATION
➤ Every website, every application with search engine can do
search advertisement business
url:searchad.fun or http://107.23.229.58/
Ads
Ads
6. SEARCH ADVERTISEMENT RANKING
“Don’t be evil.” —— Google
How do these ads being selected?
Quality Bid Price
Relevancy P-Click
Ad
Search
Result
url:searchad.fun or http://107.23.229.58/
7. ONLINE SEARCH AD PLATFORM
➤ Quality Score = Relevancy * Click_Prediction (p-click)
➤ Final Ranking Score = Quality Score * Bid Price
Search Query
batch ETL
product data
user click log
Top ads
“black chair”
Ad Engine
url:searchad.fun or http://107.23.229.58/
8. ETL
➤ Clean raw product data - store in MySql
➤ Train word2vector - generate synonyms - store in Memcached
➤ Build inverted index in Memcached
➤ Process log file to extract features
➤ Process log file to generate regression model
9. CTR FEATURES
➤ pClick = probability of click/estimated Click Through Rate
(CTR)
➤ pClick Features extracted from search log and stored in key-
value store
log: Device IP, Device id,Session id,Query,AdId,CampaignId,Ad_category_Query_category(0/1),clicked(0/1)
10. REALTIME MONITORING
➤ Let advertiser know …
➤ Can be extend to monitor activities in other system
url:searchad.fun or http://107.23.229.58/
11. REALTIME MONITORING
➤ Log events
Ad Impression Events
Ad Click Events
Chance Events
AD Server
url:searchad.fun or http://107.23.229.58/
12. ANALYTICS DASHBOARD
➤ Query by one product category
➤ Query by one campaign (an ad set follows similar strategy)
Top Key Words Cloud Top Winning Brands Average Winning Bid
Top Impression Ads Top Click Ads Top CTR Ads
Current Budget Current spend Total Impression
Total Click Cost per Click Click/Impression
url:searchad.fun or http://107.23.229.58/
13. FURTHER CHALLENGE - ONLINE SYSTEM
Scaling up
➤ Sharding by campaignId
➤ Select top k in distributed way
➤ More replicas
A/B Testing
➤ Test different algorithms, give 10% load to server with
alternative algorithm
14. FURTHER CHALLENGE - REAL TIME MONITOR
➤ Log raw records to S3
➤ Other charge option(CPM) - optimize counting
15. FURTHER CHALLENGE
➤ Feed the real time click stream to update the prediction model
➤ Fast reaction to some special event
➤ More click, more income (advertiser charged by cost per click)
25. OVERVIEW
➤ Problem
➤ Transform -> Join -> Reduce By State and Time
Customer information
<customer_id, name, street, city,state,zip>
123#AAA Inc#1 First Ave Mountain View CA#94040
Sales information
<timestamp,customer_id,sales_price>
1454313600#123#123456
State Sales
<State, Time, Total Sales>
State - hourly sales
State - daily sales
State - monthly sales
State - yearly sales
26. BASIC IDEA - TRANSFORM TO PAIR ADD
➤ Map Transformation - customer_rdd
➤ Map Transformation - sales_rdd
Customer information
<customer_id, name, street, city,state,zip>
123#AAA Inc#1 First Ave Mountain View CA#94040
(customer_id, state)
Sales information
<timestamp,customer_id,sales_price>
1454313600#123#123456
(customer_id, (year, month, date, hour, price))
27. BASIC IDEA - JOIN TWO RDD
➤ Join customer and sales on customer id
(customer_id, state)(customer_id, (year, month, date, hour, price))
(customer_id, (year, month, date, hour, price), state)
28. BASIC IDEA - REDUCE BY STATE AND TIME
➤ Extract
(customer_id, (year, month, date, hour, price), state)
Hourly: (state#year#month#date#hour, price))
Daily: (state#year#month#date##, price))
Monthly: (state#year#month###, price))
yearly: (state#year####, price))
29. BASIC IDEA - REDUCE BY STATE AND HOUR
➤ Reduce by state and hourly time to sum the sales
""" key: AL#2017#08#01#09 “""
hourly = joined_rdd.map(hourly_state_records).reduceByKey(lambda v1,v2)
pair: (customer_id, (year, month, date, hour, price), state) -> (state#year#month#date#hour, price)
34. (JOIN) IF BOTH DATASET CANNOT FIT INTO MEMORY
➤ Hard to avoid shuffling without prior knowledge
➤ Take advantage of pre-existing data distribution - like sorted
➤ Many missing values in inner join - pre-filter