SlideShare a Scribd company logo
1 of 20
A CLOSER LOOK AT BOTS
A Vindico Investigation – 1Q2014
What’s a Bot?
• An Internet bot is a software application that runs automated
tasks over the Internet
• Bots can be used for good (search indexing) or bad (ad
impressions, hacking, etc.)
• Reports now indicate there is more bot traffic than human traffic
on the Internet
• There are 3 main ‘types’ of bots:
• Crawler/Spider
• Covert Crawler
• Zombie Computers (Botnet)
• Bad bots are impacting the video advertising industry
Crawler/Spider
Covert Crawler
Zombie Computers
Bot: Crawler / Spider
• USES: Automated data collection, indexing
• HARDWARE: Typically runs on a cluster of Virtual Machines (VM) on servers located in a
datacenter
• ACTIVITY: Generally just makes ‘GET’ requests to static webpages and analyzes responses
for links, content, etc. Crawler/spiders do not render the webpage in a browser
• EXAMPLES: GoogleBot, BingBot
• DETECTION: These bots usually identify themselves in their user-agent string
• ADS: Typically would not render an ad. In addition, these bots are almost always on the
IAB Bot List and are excluded in impression accounts for MRC accredited ad servers by
leveraging the fact that they identify themselves in their user agent string
• VIEWABILITY: Not Applicable (ads not rendered, impressions filtered)
Benign
Bot: Covert Crawler
• USES: Generally malicious – associated with ad fraud, spam, hacking, scraping
• HARDWARE: Typically runs on a cluster of Virtual Machines (VM) on servers located in a
datacenter
• ACTIVITY: Mimics a human with full browsing and rendering behavior (plugins, cookies,
user-agent, mouse movement, time delays, engage with pages of site)
• EXAMPLES: Client Connections Media, VERSA*, DDC*
• ADS: Attempts to trick ad tracking systems so it registers as a true impression. These
crawlers do not identify themselves. In fact, they use a variety of real user-agent strings
that are undistinguishable from real users
• VIEWABILITY: Both geometric and browser optimization approaches to viewability will
think ads are viewable
Generally Malicious
*Source: detailed within this deck
Bot: Zombie Computer (Botnet)
Real machines ‘infected’ with software (‘virus,’ ‘worm,’ ‘malware’) that allows
a remote party to take control of various parts of the system.
• USES: Malicious – associated with ad fraud, hacking (bank accounts, emails, credit cards),
Bitcoin Mining, Ransomware
• HARDWARE: Can take over any PC, smart phone, or device. Typically created for
Windows (PC) and Android (mobile) environments, but not limited to those
• ACTIVITY: ‘Borrows’ users’ machine, processing or Internet / IP as a proxy, for opening
invisible browser windows and loading sites/ads, snooping on users. Replication over
network
• EXAMPLES: CryptoLocker, ZeuS, TDSS, ZeroAccess, ASPROX
• ADS: Attempt to trick ad tracking systems so they get paid. Use real user machines,
inherit real user IP addresses, real user agent strings, cookies, etc.
• VIEWABILITY: Exploits geometric viewability flaws
Malicious
Bots Have Negative Impact on Video Advertising
• Ad fraud has become an incredibly lucrative business for bot operators, especially with
the rise of online video where CPMs are much higher and detection capabilities have
historically been much lower.
• This has caused two major trends in the industry over the past 2 years:
• Number of impressions to skyrocket
• CPMs to decrease
• The two parties that are negatively impacted the most are advertisers and real
publishers.
• Middle men are still able to make their margin, but lower CPMs force them to use the
(cheaper) fraudulent inventory sources, which therefore continue to feed the beast and
grow the problem.
Soure: Vindico Adtricity, Q1 2014; Annual Estimate based on $15 CPM
What Vindico Bot Detection Uncovered
Using the Adtricity system we’ve identified the top 700,000 bots and zombie machines
(botnets) over Q1 2014.
• Initial launch will focus on the top 50% of Bots:
• 11.23% of all Vindico-Adtricity VPAID Imps in Q1
• 7.9B Vindico-Adtricity Bot Impressions in Q1
• $76 million* in fraud in Q1 alone, just in US online video.
• 66% of bot impressions were from ‘zombie computers’; 34% were from
‘covert crawlers’
• Affected Advertisers
• Avg: 10.06% of impressions
• Highest: 52.66% of impressions
• Breakdown by Publishers:
• Highest: 50.9% of impressions
• Media Companies: <2% of impressions
• Networks: 24% of impressions
*Estimate based on $15 CPM
The number of bots is rising and number of impressions
affected are rising (see Q1 trend graph above)
Exposing Bots: Covert Crawlers
Covert Crawler ‘Versa’
• Stats: 55 million imps / month = $825k / month*
• Total Sites: 5 Core with at least 100 total
• Notes: Sites are same template, fake display ads, tokenized urls, VMs spoofing
user agents, exact amount of caps ads / IP, rotated screen resolutions, etc.
Distributed Data Center
• 150 million imps / month = $2.2 million / month*
• Top Sites: techbrowsing.com (1/2 the size of all Versa), anchorfree.us,
recipeaccess.com
• Total Sites: 15 – 20 Core with at least 100 total
• 7-10 Core datacenters
• Examples: Host Protocol, EGIHosting, MyPrivateProxy.net, GIGLINX, Alentus%,
ManageDNS
Generally Malicious
*Estimate based on $15 CPM
Exposing Bots: Botnets
The Asprox / Kuluoz Botnet
• Currently this botnet is extremely active
• Current main method of initial infection: malware-phishing emails
• WhatsApp Message (via a link)
• Notice to Appear in Court (via an attachment)
• Once installed, it follows the below chain to PPC networks *:
*Source: techhelplist.com
Malicious
Exposing Bots: Reality v. Perception
*Source: techhelplist.comReality Perception
How to Fight Bots in Video Advertising
Viewability alone is not enough.
• Bots can fool viewability
• Good viewability vendors will record bot impressions as non-viewable, but some bots can
manipulate viewability metrics for the campaign
Bot filtering alone is not enough.
• 1x1 iframes can still be manipulated
Bot filtering + viewability is not enough.
• Certain sites and measurements can be manipulated (i.e. porn sites, player size, etc.)
A combination of multiple metrics including viewability, execution,
content, and traffic are the only way to truly protect ad dollars and grow
the ecosystem to the point where it can truly complement TV for brand
advertisers.
How Vindico Helps
There are 3 strategic components to our Detection System:
1. Data Collection
• 40% of all online videos. More data points than anyone else.
2. Data Processing
• Big Data.
• Adtricity servers processes over 1 million events every minute.
• This data has to be logged, loaded, and ready for analysis in real time.
• Even Hadoop, the most well known Big Data framework, was not
enough.
• Adtricity utilizes a cutting edge Big Data framework called Spark.
3. Data Analysis
 More data than a human could ever analyze.
 Adtricity uses cognitive thinking (artificial intelligence) through machine
learning to detect and block bots in real time. Adtricity is a comprehensive measure of quality
offering a standardized and transparent system of
measurement to the industry. Adtricity brings together
viewability and verification into a single solution.
Conclusion
Bots have infiltrated the video advertising industry and are increasing scale and impressions at an alarming rate.
Vindico’s Bot Detection technology was developed to help advertisers combat fraudulent activity in video advertising. Bot detection is most powerful when
part of a buy-side platform as it is organically integrated from the point of delivery and can be used across the full scope of the advertiser’s buy.
Appendix: Top 100 Domains Affected by Bots (pg1)
 sekindo.com
 menscraft.com
 recipegroove.com
 menswheels.com
 tonightsrecipe.com
 tophomegardens.com
 outfox.tv
 sportsfave.com
 allsportshub.com
 videolulu.com
 sportsidea.com
 recipeaccess.com
 automotiveboss.com
 suggestrecipe.com
 clipsgo.com
 sportspond.com
 beautytrend.tv
 athletesvenue.com
 everymansfitness.com
 expertbites.com
Appendix: Top 100 Domains Affected by Bots (pg2)
 trendyidea.com
 sportsflare.com
 cookingniche.com
 sportsadvise.com
 cooltraveller.com
 athleticsplay.com
 financeknow.com
 hobbymind.com
 homesinspiration.com
 loveablehomes.com
 sheglamour.com
 glamourvibe.com
 bettermotorcars.com
 plantingforum.com
 outstandingvacations.com
 recipegrandma.com
 financesadviser.com
 fitnesstrue.com
 leisurenook.com
 journeyexplorer.com
Appendix: Top 100 Domains Affected by Bots (pg3)
 travellersdirect.com
 motorcarsplus.com
 cliptimes.com
 kitchensview.com
 womenschatter.com
 fancyrides.com
 fitnesswow.com
 culinaryswap.com
 growersgreen.com
 cookingkudos.com
 femalevogue.com
 motherhoodchic.com
 babywhat.com
 currenciesforum.com
 culinaryflare.com
 craftseasy.com
 planterstime.com
 womenhour.com
 insiderfoodie.com
 lifestyleanswer.com
Appendix: Top 100 Domains Affected by Bots (pg4)
 magazinebaby.com
 kitchencuisines.com
 plantersforum.com
 cookingmogul.com
 tastekitchens.com
 lifestyleselection.com
 gardenleisures.com
 travelconnoisseurs.com
 extendgame.com
 sportsmansmag.com
 athleticsleague.com
 beautykittens.com
 chefspoon.com
 travelleralert.com
 leisurelocator.com
 sportsthrive.com
 medicineshub.com
 greenflourish.com
 athleticsinteractive.com
 clipsindex.com
Appendix: Top 100 Domains Affected by Bots (pg5)
 lifestylereader.com
 sportscompete.com
 cookinghours.com
 travellerstube.com
 sportscircular.com
 womenconcierge.com
 athleteman.com
 leisuretourist.com
 travelleradventures.com
 carsmenu.com
 athleteinsight.com
 athletestoday.com
 gardenswise.com
 sportsrevealed.com
 sightscenes.com
 foodsac.com
 sportsyards.com
 makeupbag.tv
 womenvenue.com
 leisureadventure.com
Bot detection deck 042514 final

More Related Content

Viewers also liked

140607 a blusas, ropa casual & pijamas
140607 a   blusas, ropa casual & pijamas140607 a   blusas, ropa casual & pijamas
140607 a blusas, ropa casual & pijamasProductos Linnova
 
Personal Branding for Geeks
Personal Branding for GeeksPersonal Branding for Geeks
Personal Branding for GeeksTodd Burgess
 
Ejemplo para campus
Ejemplo para campusEjemplo para campus
Ejemplo para campusJose Molina
 
Get easy plano de marketing atualizado
Get easy plano de marketing atualizado Get easy plano de marketing atualizado
Get easy plano de marketing atualizado renatogeteasy
 
140530A - Blusas, Ropa Casual & Calzado (Zapatos)
140530A - Blusas, Ropa Casual & Calzado (Zapatos)140530A - Blusas, Ropa Casual & Calzado (Zapatos)
140530A - Blusas, Ropa Casual & Calzado (Zapatos)Productos Linnova
 
Galeria fotografica
Galeria fotograficaGaleria fotografica
Galeria fotograficaJose Molina
 
From Archive to Gateway: The Evolution of the Research Library
From Archive to Gateway: The Evolution of the Research LibraryFrom Archive to Gateway: The Evolution of the Research Library
From Archive to Gateway: The Evolution of the Research LibraryMichael Levine-Clark
 
Nwill 2013 Whither ILL? Wither ILL: The Changing Nature of Resource Sharing i...
Nwill 2013 Whither ILL? Wither ILL: The Changing Nature of Resource Sharing i...Nwill 2013 Whither ILL? Wither ILL: The Changing Nature of Resource Sharing i...
Nwill 2013 Whither ILL? Wither ILL: The Changing Nature of Resource Sharing i...Michael Levine-Clark
 
Informationsheet
InformationsheetInformationsheet
Informationsheetpjaskot
 
Beowulf
BeowulfBeowulf
BeowulfGema
 

Viewers also liked (14)

140607 a blusas, ropa casual & pijamas
140607 a   blusas, ropa casual & pijamas140607 a   blusas, ropa casual & pijamas
140607 a blusas, ropa casual & pijamas
 
Personal Branding for Geeks
Personal Branding for GeeksPersonal Branding for Geeks
Personal Branding for Geeks
 
Ejemplo para campus
Ejemplo para campusEjemplo para campus
Ejemplo para campus
 
Get easy plano de marketing atualizado
Get easy plano de marketing atualizado Get easy plano de marketing atualizado
Get easy plano de marketing atualizado
 
140530A - Blusas, Ropa Casual & Calzado (Zapatos)
140530A - Blusas, Ropa Casual & Calzado (Zapatos)140530A - Blusas, Ropa Casual & Calzado (Zapatos)
140530A - Blusas, Ropa Casual & Calzado (Zapatos)
 
141139 a jeans
141139 a   jeans141139 a   jeans
141139 a jeans
 
150109 b jeans
150109 b   jeans150109 b   jeans
150109 b jeans
 
141151 mayorista
141151   mayorista141151   mayorista
141151 mayorista
 
Galeria fotografica
Galeria fotograficaGaleria fotografica
Galeria fotografica
 
From Archive to Gateway: The Evolution of the Research Library
From Archive to Gateway: The Evolution of the Research LibraryFrom Archive to Gateway: The Evolution of the Research Library
From Archive to Gateway: The Evolution of the Research Library
 
Nwill 2013 Whither ILL? Wither ILL: The Changing Nature of Resource Sharing i...
Nwill 2013 Whither ILL? Wither ILL: The Changing Nature of Resource Sharing i...Nwill 2013 Whither ILL? Wither ILL: The Changing Nature of Resource Sharing i...
Nwill 2013 Whither ILL? Wither ILL: The Changing Nature of Resource Sharing i...
 
my life
my lifemy life
my life
 
Informationsheet
InformationsheetInformationsheet
Informationsheet
 
Beowulf
BeowulfBeowulf
Beowulf
 

Similar to Bot detection deck 042514 final

Bot how to find them 2014_27_03
Bot how to find them 2014_27_03Bot how to find them 2014_27_03
Bot how to find them 2014_27_03IABmembership
 
Iab bots how to_find_them_webinar_2014_03_27
Iab bots how to_find_them_webinar_2014_03_27Iab bots how to_find_them_webinar_2014_03_27
Iab bots how to_find_them_webinar_2014_03_27IABmembership
 
Are Bot Operators Eating Your Lunch?
Are Bot Operators Eating Your Lunch?Are Bot Operators Eating Your Lunch?
Are Bot Operators Eating Your Lunch?Distil Networks
 
Field Guide for Validating Premium Ad Inventory
Field Guide for Validating Premium Ad InventoryField Guide for Validating Premium Ad Inventory
Field Guide for Validating Premium Ad InventoryDistil Networks
 
Ias guide ad fraud essentials_2017 (1)
Ias guide ad fraud essentials_2017 (1)Ias guide ad fraud essentials_2017 (1)
Ias guide ad fraud essentials_2017 (1)Wossname
 
2015 Bot Baseline Report - White Ops & ANA
2015 Bot Baseline Report - White Ops & ANA2015 Bot Baseline Report - White Ops & ANA
2015 Bot Baseline Report - White Ops & ANAWhite Ops
 
StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...
StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...
StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...G3 Communications
 
DEFCON 23 - Mark Ryan Talabis - The Bieber Project
DEFCON 23 - Mark Ryan Talabis - The Bieber ProjectDEFCON 23 - Mark Ryan Talabis - The Bieber Project
DEFCON 23 - Mark Ryan Talabis - The Bieber ProjectFelipe Prado
 
Ana White OPS - the bot baseline - fraud in digital advertising - 2015
Ana White OPS - the bot baseline - fraud in digital advertising - 2015Ana White OPS - the bot baseline - fraud in digital advertising - 2015
Ana White OPS - the bot baseline - fraud in digital advertising - 2015Romain Fonnier
 
Fraud in Digital Advertising (ANA study)
Fraud in Digital Advertising (ANA study)Fraud in Digital Advertising (ANA study)
Fraud in Digital Advertising (ANA study)Margarita Zlatkova
 
The Bot Baseline - Fraud in Digital Advertising
The Bot Baseline - Fraud in Digital AdvertisingThe Bot Baseline - Fraud in Digital Advertising
The Bot Baseline - Fraud in Digital Advertisingyann le gigan
 
Rtp rsp16-distil networks-final-deck
Rtp rsp16-distil networks-final-deckRtp rsp16-distil networks-final-deck
Rtp rsp16-distil networks-final-deckG3 Communications
 
What is online ad fraud and what does um do about it
What is online ad fraud and what does um do about itWhat is online ad fraud and what does um do about it
What is online ad fraud and what does um do about itAlan King
 

Similar to Bot detection deck 042514 final (20)

Digital ad fraud superheroes the good guys by augustine fou
Digital ad fraud superheroes the good guys by augustine fouDigital ad fraud superheroes the good guys by augustine fou
Digital ad fraud superheroes the good guys by augustine fou
 
ComplianceBrief
ComplianceBriefComplianceBrief
ComplianceBrief
 
Bot how to find them 2014_27_03
Bot how to find them 2014_27_03Bot how to find them 2014_27_03
Bot how to find them 2014_27_03
 
How To Protect Your Website From Bot Attacks
How To Protect Your Website From Bot AttacksHow To Protect Your Website From Bot Attacks
How To Protect Your Website From Bot Attacks
 
Botman Profile Deck
Botman Profile DeckBotman Profile Deck
Botman Profile Deck
 
Iab bots how to_find_them_webinar_2014_03_27
Iab bots how to_find_them_webinar_2014_03_27Iab bots how to_find_them_webinar_2014_03_27
Iab bots how to_find_them_webinar_2014_03_27
 
Are Bot Operators Eating Your Lunch?
Are Bot Operators Eating Your Lunch?Are Bot Operators Eating Your Lunch?
Are Bot Operators Eating Your Lunch?
 
Field Guide for Validating Premium Ad Inventory
Field Guide for Validating Premium Ad InventoryField Guide for Validating Premium Ad Inventory
Field Guide for Validating Premium Ad Inventory
 
Ias guide ad fraud essentials_2017 (1)
Ias guide ad fraud essentials_2017 (1)Ias guide ad fraud essentials_2017 (1)
Ias guide ad fraud essentials_2017 (1)
 
Independent Objective Reviews of Anti-Fraud Companies by Augustine Fou
Independent Objective Reviews of Anti-Fraud Companies by Augustine FouIndependent Objective Reviews of Anti-Fraud Companies by Augustine Fou
Independent Objective Reviews of Anti-Fraud Companies by Augustine Fou
 
2015 Bot Baseline Report - White Ops & ANA
2015 Bot Baseline Report - White Ops & ANA2015 Bot Baseline Report - White Ops & ANA
2015 Bot Baseline Report - White Ops & ANA
 
StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...
StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...
StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...
 
DEFCON 23 - Mark Ryan Talabis - The Bieber Project
DEFCON 23 - Mark Ryan Talabis - The Bieber ProjectDEFCON 23 - Mark Ryan Talabis - The Bieber Project
DEFCON 23 - Mark Ryan Talabis - The Bieber Project
 
Ana White OPS - the bot baseline - fraud in digital advertising - 2015
Ana White OPS - the bot baseline - fraud in digital advertising - 2015Ana White OPS - the bot baseline - fraud in digital advertising - 2015
Ana White OPS - the bot baseline - fraud in digital advertising - 2015
 
Fraud in Digital Advertising (ANA study)
Fraud in Digital Advertising (ANA study)Fraud in Digital Advertising (ANA study)
Fraud in Digital Advertising (ANA study)
 
The Bot Baseline - Fraud in Digital Advertising
The Bot Baseline - Fraud in Digital AdvertisingThe Bot Baseline - Fraud in Digital Advertising
The Bot Baseline - Fraud in Digital Advertising
 
Rtp rsp16-distil networks-final-deck
Rtp rsp16-distil networks-final-deckRtp rsp16-distil networks-final-deck
Rtp rsp16-distil networks-final-deck
 
IAB Best Practices Traffic Fraud Final
IAB Best Practices Traffic Fraud FinalIAB Best Practices Traffic Fraud Final
IAB Best Practices Traffic Fraud Final
 
8 types of mobile ad fraud
8 types of mobile ad fraud8 types of mobile ad fraud
8 types of mobile ad fraud
 
What is online ad fraud and what does um do about it
What is online ad fraud and what does um do about itWhat is online ad fraud and what does um do about it
What is online ad fraud and what does um do about it
 

Recently uploaded

SORA AI: Will It Be the Future of Video Creation?
SORA AI: Will It Be the Future of Video Creation?SORA AI: Will It Be the Future of Video Creation?
SORA AI: Will It Be the Future of Video Creation?Searchable Design
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdfDIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdfmayanksharma0441
 
Social Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa
 
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCRCall Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Local SEO Domination: Put your business at the forefront of local searches!
Local SEO Domination:  Put your business at the forefront of local searches!Local SEO Domination:  Put your business at the forefront of local searches!
Local SEO Domination: Put your business at the forefront of local searches!dstvtechnician
 
VIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceVIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceSapana Sha
 
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdfDigital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdfDemandbase
 
pptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxpptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxarsathsahil
 
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceAvoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceDamien ROBERT
 
Forecast of Content Marketing through AI
Forecast of Content Marketing through AIForecast of Content Marketing through AI
Forecast of Content Marketing through AIRinky
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentationdgtivemarketingagenc
 
Red bull marketing presentation pptxxxxx
Red bull marketing presentation pptxxxxxRed bull marketing presentation pptxxxxx
Red bull marketing presentation pptxxxxx216310017
 
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon GarsideInbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garsiderobwhite630290
 
BrightonSEO - Addressing SEO & CX - CMDL - Apr 24 .pptx
BrightonSEO -  Addressing SEO & CX - CMDL - Apr 24 .pptxBrightonSEO -  Addressing SEO & CX - CMDL - Apr 24 .pptx
BrightonSEO - Addressing SEO & CX - CMDL - Apr 24 .pptxcollette15
 
McDonald's: A Journey Through Time (PPT)
McDonald's: A Journey Through Time (PPT)McDonald's: A Journey Through Time (PPT)
McDonald's: A Journey Through Time (PPT)DEVARAJV16
 
2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)Jomer Gregorio
 
GreenSEO April 2024: Join the Green Web Revolution
GreenSEO April 2024: Join the Green Web RevolutionGreenSEO April 2024: Join the Green Web Revolution
GreenSEO April 2024: Join the Green Web RevolutionWilliam Barnes
 
Word Count for Writers: Examples of Word Counts for Sample Genres
Word Count for Writers: Examples of Word Counts for Sample GenresWord Count for Writers: Examples of Word Counts for Sample Genres
Word Count for Writers: Examples of Word Counts for Sample GenresLisa M. Masiello
 
How To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot SetupHow To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot Setupssuser4571da
 

Recently uploaded (20)

SORA AI: Will It Be the Future of Video Creation?
SORA AI: Will It Be the Future of Video Creation?SORA AI: Will It Be the Future of Video Creation?
SORA AI: Will It Be the Future of Video Creation?
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
 
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdfDIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
 
Social Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdf
 
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCRCall Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
Call Girls In Aerocity Delhi ❤️8860477959 Good Looking Escorts In 24/7 Delhi NCR
 
Local SEO Domination: Put your business at the forefront of local searches!
Local SEO Domination:  Put your business at the forefront of local searches!Local SEO Domination:  Put your business at the forefront of local searches!
Local SEO Domination: Put your business at the forefront of local searches!
 
VIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceVIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts Service
 
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdfDigital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
 
pptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxpptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptx
 
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceAvoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
 
Forecast of Content Marketing through AI
Forecast of Content Marketing through AIForecast of Content Marketing through AI
Forecast of Content Marketing through AI
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentation
 
Red bull marketing presentation pptxxxxx
Red bull marketing presentation pptxxxxxRed bull marketing presentation pptxxxxx
Red bull marketing presentation pptxxxxx
 
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon GarsideInbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
 
BrightonSEO - Addressing SEO & CX - CMDL - Apr 24 .pptx
BrightonSEO -  Addressing SEO & CX - CMDL - Apr 24 .pptxBrightonSEO -  Addressing SEO & CX - CMDL - Apr 24 .pptx
BrightonSEO - Addressing SEO & CX - CMDL - Apr 24 .pptx
 
McDonald's: A Journey Through Time (PPT)
McDonald's: A Journey Through Time (PPT)McDonald's: A Journey Through Time (PPT)
McDonald's: A Journey Through Time (PPT)
 
2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)
 
GreenSEO April 2024: Join the Green Web Revolution
GreenSEO April 2024: Join the Green Web RevolutionGreenSEO April 2024: Join the Green Web Revolution
GreenSEO April 2024: Join the Green Web Revolution
 
Word Count for Writers: Examples of Word Counts for Sample Genres
Word Count for Writers: Examples of Word Counts for Sample GenresWord Count for Writers: Examples of Word Counts for Sample Genres
Word Count for Writers: Examples of Word Counts for Sample Genres
 
How To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot SetupHow To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot Setup
 

Bot detection deck 042514 final

  • 1.
  • 2. A CLOSER LOOK AT BOTS A Vindico Investigation – 1Q2014
  • 3. What’s a Bot? • An Internet bot is a software application that runs automated tasks over the Internet • Bots can be used for good (search indexing) or bad (ad impressions, hacking, etc.) • Reports now indicate there is more bot traffic than human traffic on the Internet • There are 3 main ‘types’ of bots: • Crawler/Spider • Covert Crawler • Zombie Computers (Botnet) • Bad bots are impacting the video advertising industry Crawler/Spider Covert Crawler Zombie Computers
  • 4. Bot: Crawler / Spider • USES: Automated data collection, indexing • HARDWARE: Typically runs on a cluster of Virtual Machines (VM) on servers located in a datacenter • ACTIVITY: Generally just makes ‘GET’ requests to static webpages and analyzes responses for links, content, etc. Crawler/spiders do not render the webpage in a browser • EXAMPLES: GoogleBot, BingBot • DETECTION: These bots usually identify themselves in their user-agent string • ADS: Typically would not render an ad. In addition, these bots are almost always on the IAB Bot List and are excluded in impression accounts for MRC accredited ad servers by leveraging the fact that they identify themselves in their user agent string • VIEWABILITY: Not Applicable (ads not rendered, impressions filtered) Benign
  • 5. Bot: Covert Crawler • USES: Generally malicious – associated with ad fraud, spam, hacking, scraping • HARDWARE: Typically runs on a cluster of Virtual Machines (VM) on servers located in a datacenter • ACTIVITY: Mimics a human with full browsing and rendering behavior (plugins, cookies, user-agent, mouse movement, time delays, engage with pages of site) • EXAMPLES: Client Connections Media, VERSA*, DDC* • ADS: Attempts to trick ad tracking systems so it registers as a true impression. These crawlers do not identify themselves. In fact, they use a variety of real user-agent strings that are undistinguishable from real users • VIEWABILITY: Both geometric and browser optimization approaches to viewability will think ads are viewable Generally Malicious *Source: detailed within this deck
  • 6. Bot: Zombie Computer (Botnet) Real machines ‘infected’ with software (‘virus,’ ‘worm,’ ‘malware’) that allows a remote party to take control of various parts of the system. • USES: Malicious – associated with ad fraud, hacking (bank accounts, emails, credit cards), Bitcoin Mining, Ransomware • HARDWARE: Can take over any PC, smart phone, or device. Typically created for Windows (PC) and Android (mobile) environments, but not limited to those • ACTIVITY: ‘Borrows’ users’ machine, processing or Internet / IP as a proxy, for opening invisible browser windows and loading sites/ads, snooping on users. Replication over network • EXAMPLES: CryptoLocker, ZeuS, TDSS, ZeroAccess, ASPROX • ADS: Attempt to trick ad tracking systems so they get paid. Use real user machines, inherit real user IP addresses, real user agent strings, cookies, etc. • VIEWABILITY: Exploits geometric viewability flaws Malicious
  • 7. Bots Have Negative Impact on Video Advertising • Ad fraud has become an incredibly lucrative business for bot operators, especially with the rise of online video where CPMs are much higher and detection capabilities have historically been much lower. • This has caused two major trends in the industry over the past 2 years: • Number of impressions to skyrocket • CPMs to decrease • The two parties that are negatively impacted the most are advertisers and real publishers. • Middle men are still able to make their margin, but lower CPMs force them to use the (cheaper) fraudulent inventory sources, which therefore continue to feed the beast and grow the problem. Soure: Vindico Adtricity, Q1 2014; Annual Estimate based on $15 CPM
  • 8. What Vindico Bot Detection Uncovered Using the Adtricity system we’ve identified the top 700,000 bots and zombie machines (botnets) over Q1 2014. • Initial launch will focus on the top 50% of Bots: • 11.23% of all Vindico-Adtricity VPAID Imps in Q1 • 7.9B Vindico-Adtricity Bot Impressions in Q1 • $76 million* in fraud in Q1 alone, just in US online video. • 66% of bot impressions were from ‘zombie computers’; 34% were from ‘covert crawlers’ • Affected Advertisers • Avg: 10.06% of impressions • Highest: 52.66% of impressions • Breakdown by Publishers: • Highest: 50.9% of impressions • Media Companies: <2% of impressions • Networks: 24% of impressions *Estimate based on $15 CPM The number of bots is rising and number of impressions affected are rising (see Q1 trend graph above)
  • 9. Exposing Bots: Covert Crawlers Covert Crawler ‘Versa’ • Stats: 55 million imps / month = $825k / month* • Total Sites: 5 Core with at least 100 total • Notes: Sites are same template, fake display ads, tokenized urls, VMs spoofing user agents, exact amount of caps ads / IP, rotated screen resolutions, etc. Distributed Data Center • 150 million imps / month = $2.2 million / month* • Top Sites: techbrowsing.com (1/2 the size of all Versa), anchorfree.us, recipeaccess.com • Total Sites: 15 – 20 Core with at least 100 total • 7-10 Core datacenters • Examples: Host Protocol, EGIHosting, MyPrivateProxy.net, GIGLINX, Alentus%, ManageDNS Generally Malicious *Estimate based on $15 CPM
  • 10. Exposing Bots: Botnets The Asprox / Kuluoz Botnet • Currently this botnet is extremely active • Current main method of initial infection: malware-phishing emails • WhatsApp Message (via a link) • Notice to Appear in Court (via an attachment) • Once installed, it follows the below chain to PPC networks *: *Source: techhelplist.com Malicious
  • 11. Exposing Bots: Reality v. Perception *Source: techhelplist.comReality Perception
  • 12. How to Fight Bots in Video Advertising Viewability alone is not enough. • Bots can fool viewability • Good viewability vendors will record bot impressions as non-viewable, but some bots can manipulate viewability metrics for the campaign Bot filtering alone is not enough. • 1x1 iframes can still be manipulated Bot filtering + viewability is not enough. • Certain sites and measurements can be manipulated (i.e. porn sites, player size, etc.) A combination of multiple metrics including viewability, execution, content, and traffic are the only way to truly protect ad dollars and grow the ecosystem to the point where it can truly complement TV for brand advertisers.
  • 13. How Vindico Helps There are 3 strategic components to our Detection System: 1. Data Collection • 40% of all online videos. More data points than anyone else. 2. Data Processing • Big Data. • Adtricity servers processes over 1 million events every minute. • This data has to be logged, loaded, and ready for analysis in real time. • Even Hadoop, the most well known Big Data framework, was not enough. • Adtricity utilizes a cutting edge Big Data framework called Spark. 3. Data Analysis  More data than a human could ever analyze.  Adtricity uses cognitive thinking (artificial intelligence) through machine learning to detect and block bots in real time. Adtricity is a comprehensive measure of quality offering a standardized and transparent system of measurement to the industry. Adtricity brings together viewability and verification into a single solution.
  • 14. Conclusion Bots have infiltrated the video advertising industry and are increasing scale and impressions at an alarming rate. Vindico’s Bot Detection technology was developed to help advertisers combat fraudulent activity in video advertising. Bot detection is most powerful when part of a buy-side platform as it is organically integrated from the point of delivery and can be used across the full scope of the advertiser’s buy.
  • 15. Appendix: Top 100 Domains Affected by Bots (pg1)  sekindo.com  menscraft.com  recipegroove.com  menswheels.com  tonightsrecipe.com  tophomegardens.com  outfox.tv  sportsfave.com  allsportshub.com  videolulu.com  sportsidea.com  recipeaccess.com  automotiveboss.com  suggestrecipe.com  clipsgo.com  sportspond.com  beautytrend.tv  athletesvenue.com  everymansfitness.com  expertbites.com
  • 16. Appendix: Top 100 Domains Affected by Bots (pg2)  trendyidea.com  sportsflare.com  cookingniche.com  sportsadvise.com  cooltraveller.com  athleticsplay.com  financeknow.com  hobbymind.com  homesinspiration.com  loveablehomes.com  sheglamour.com  glamourvibe.com  bettermotorcars.com  plantingforum.com  outstandingvacations.com  recipegrandma.com  financesadviser.com  fitnesstrue.com  leisurenook.com  journeyexplorer.com
  • 17. Appendix: Top 100 Domains Affected by Bots (pg3)  travellersdirect.com  motorcarsplus.com  cliptimes.com  kitchensview.com  womenschatter.com  fancyrides.com  fitnesswow.com  culinaryswap.com  growersgreen.com  cookingkudos.com  femalevogue.com  motherhoodchic.com  babywhat.com  currenciesforum.com  culinaryflare.com  craftseasy.com  planterstime.com  womenhour.com  insiderfoodie.com  lifestyleanswer.com
  • 18. Appendix: Top 100 Domains Affected by Bots (pg4)  magazinebaby.com  kitchencuisines.com  plantersforum.com  cookingmogul.com  tastekitchens.com  lifestyleselection.com  gardenleisures.com  travelconnoisseurs.com  extendgame.com  sportsmansmag.com  athleticsleague.com  beautykittens.com  chefspoon.com  travelleralert.com  leisurelocator.com  sportsthrive.com  medicineshub.com  greenflourish.com  athleticsinteractive.com  clipsindex.com
  • 19. Appendix: Top 100 Domains Affected by Bots (pg5)  lifestylereader.com  sportscompete.com  cookinghours.com  travellerstube.com  sportscircular.com  womenconcierge.com  athleteman.com  leisuretourist.com  travelleradventures.com  carsmenu.com  athleteinsight.com  athletestoday.com  gardenswise.com  sportsrevealed.com  sightscenes.com  foodsac.com  sportsyards.com  makeupbag.tv  womenvenue.com  leisureadventure.com