Harnessing Big Data to Better Serve Your Audience - Core Audience / iCrossingiCrossing
Understanding big data is important. But mastering audience data creates break-throughs for CMOs. During a December 5 Direct Marketing Association webinar, "Owning Your Audience: Harnessing Big Data to Better Serve Your Audience," Core Audience CEO Adam Lavelle helped marketing executives understand how to apply big data to optimize audience relationships, not channels, especially with the use of a data management platform (DMP).
Webinar Audience Insight in the Real-time EraiCrossing
How well do you know your customer? On March 6, iCrossing and our guest Forrester Research hosted a webinar that gave attendees the tools to better understand your audience across the customer journey. For more information visit www.icrossing.com
Before The Moment of Intent: Utilizing Content to Shape The Buyer's Journeyjohn harris
Marketers often focus on the moment of intent—that time period when the consumer is ready to move forward and take action. Maybe it’s a phone call, maybe it’s an in-store visit or maybe just a specific search query. But the consumer’s journey does not start at the moment of intent. It begins many steps back with needs, desires and interest. To reach these consumers at their pre-intent phases, brands need to develop a content marketing strategy that accounts for the consumer’s mindset while in the interest and discovery phases, where content focus is less confined.
Best Practices from the World's Most Social Big BrandsSprinklr
Sprinklr has assembled over 20 of the leading enterprise social practitioners (we call them the “All-Star Team”) from some of the world’s most social brands and put their advice in one eBook.
What you will learn:
- How the biggest, most social brands align teams and functions for impact
- Strategies for successfully being social at scale
- Guidelines for policies, security, and governance
- Insights into metrics and ROI across geographies and divisions
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Retargeting is one of the most effective means of reaching and converting visitors to your site, but it's even more effective when augmented with data mined from social media resources.
"From Likes to Loyalty: The Power of Social Media"Ma Foi Analytics
This paper, based on investigation of distributed writing in the Indian market, expects to highlight the developing significance of social media for advertisers in India by aggregating how a few brands are now harvesting advantages from their ventures and moving beyond just “likes” and “followers”
Harnessing Big Data to Better Serve Your Audience - Core Audience / iCrossingiCrossing
Understanding big data is important. But mastering audience data creates break-throughs for CMOs. During a December 5 Direct Marketing Association webinar, "Owning Your Audience: Harnessing Big Data to Better Serve Your Audience," Core Audience CEO Adam Lavelle helped marketing executives understand how to apply big data to optimize audience relationships, not channels, especially with the use of a data management platform (DMP).
Webinar Audience Insight in the Real-time EraiCrossing
How well do you know your customer? On March 6, iCrossing and our guest Forrester Research hosted a webinar that gave attendees the tools to better understand your audience across the customer journey. For more information visit www.icrossing.com
Before The Moment of Intent: Utilizing Content to Shape The Buyer's Journeyjohn harris
Marketers often focus on the moment of intent—that time period when the consumer is ready to move forward and take action. Maybe it’s a phone call, maybe it’s an in-store visit or maybe just a specific search query. But the consumer’s journey does not start at the moment of intent. It begins many steps back with needs, desires and interest. To reach these consumers at their pre-intent phases, brands need to develop a content marketing strategy that accounts for the consumer’s mindset while in the interest and discovery phases, where content focus is less confined.
Best Practices from the World's Most Social Big BrandsSprinklr
Sprinklr has assembled over 20 of the leading enterprise social practitioners (we call them the “All-Star Team”) from some of the world’s most social brands and put their advice in one eBook.
What you will learn:
- How the biggest, most social brands align teams and functions for impact
- Strategies for successfully being social at scale
- Guidelines for policies, security, and governance
- Insights into metrics and ROI across geographies and divisions
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Retargeting is one of the most effective means of reaching and converting visitors to your site, but it's even more effective when augmented with data mined from social media resources.
"From Likes to Loyalty: The Power of Social Media"Ma Foi Analytics
This paper, based on investigation of distributed writing in the Indian market, expects to highlight the developing significance of social media for advertisers in India by aggregating how a few brands are now harvesting advantages from their ventures and moving beyond just “likes” and “followers”
Internet advertising an interplay among advertisers,online publishers, ad exc...Trieu Nguyen
Internet Advertising An Interplay among Advertisers,Online Publishers, Ad Exchanges and Web Users, Computational Advertising, contextual advertising, sponsored search, user behaviour targeting
The following report contains statements that are forwardlooking, including expectations and predictions regarding future industry trends and developments. Actual results may differ materially from our expectations or projections. This report also contains opinions, estimates, and forwardlooking statements by industry leaders.
Summer Internship Project Report | The Fundamental of Digital Marketing | Lov...Kodexhub
Summer Internship Project Report on ✨The Fundamental of Digital Marketing ✨ With 6% Plagiarism | MRX.
Submitted in Lovely Professional University Jalandhar, Punjab.
✨ 9 October 2021 ✨
This Report Contain are these Things:-
Abstract
Introduction
Why Digital Marketing
Online Marketing
Website
Site design improvement:
Google Search Console/Google website admin
Slithering and Indexing
Web-based Media Marketing:
Web-based Media Marketers' Platform Usage Trends
Google Analytics
AdWords
AdSense
Pay-per-click (PPC)
How the PPC Model Works
Marketing is Future
Project on Facebook Ads
Reference
Social Media CRM is just as important as your traditional CRM strategies. The objectives of this slide are in the following.
1. Define CRM & Social CRM
2. Explain the usage and relevance of social media for CRM
3. Talk about the different platforms
4. Define how the platforms can be used
a. Events / Calendar
b. Inviting People
c. Sharing content, information
d. Feedback
5. Discuss analytical data
6. Define engagements
a. Like, dislike, comment, share, Direct Messages (DM)
7. Define listening tools
8. Look at 4 specific tools
9. Define ownership of the information
10. Conclusion / Recap
a. Summarize the objectives Q/A
Performics SVP of Communications Planning Jon Wegman brings to light the true meaning behind "performance marketing" in his June presentation at an L2 event. Learn how to harness actions through observing, measuring, analyzing and applying data to drive results.
This is a presentation for The Global Big Data Conference where I am focusing on how creative people can leverage algorithms and Big Data without having it take over their entire lives.
My presentation is on the afternoon of October 23rd, 2017 and I am publishing the deck a few days earlier than that. This presentation also coincides with the publication of the Digital Analytics for Marketing (Sponder/Khan) textbook that is now available (Routledge, 2018) online and in bookstores.
In this introductory course, industry experts and leading publishers will provide valuable tips and guidance on managing search campaigns and maximizing performance.
An Exploration of Volunteered Geographic Information stakeholdersChristopher J. Parker
Volunteered Geographic Information (VGI) has huge potential for influencing the use of geographic information systems. However, there is a wide range of individuals involved in this process, each with their own motivations for contributing and using volunteered data. This paper investigates the range of stakeholders involved with VGI, their relationships and the main tensions and issues involved. The research was based on a series of detailed interviews and theory-driven coding of data. From this, a Rich Picture (Monk, Howard 1998) was developed to graphically present and relate stakeholder relationship information. The findings have implications for how stakeholder groups may be described, and how VGI can lead to enhanced products and services.
PARKER, C.J., 2010. An Exploration of Volunteered Geographic Information Stakeholders, M. HAKLAY, J. MORLEY and H. RAHEMTULLA, eds. In: Proceedings of the GIS Research UK 18th Annual Conference, 14-16 April 2010 2010, UCL pp137-142.
Internet advertising an interplay among advertisers,online publishers, ad exc...Trieu Nguyen
Internet Advertising An Interplay among Advertisers,Online Publishers, Ad Exchanges and Web Users, Computational Advertising, contextual advertising, sponsored search, user behaviour targeting
The following report contains statements that are forwardlooking, including expectations and predictions regarding future industry trends and developments. Actual results may differ materially from our expectations or projections. This report also contains opinions, estimates, and forwardlooking statements by industry leaders.
Summer Internship Project Report | The Fundamental of Digital Marketing | Lov...Kodexhub
Summer Internship Project Report on ✨The Fundamental of Digital Marketing ✨ With 6% Plagiarism | MRX.
Submitted in Lovely Professional University Jalandhar, Punjab.
✨ 9 October 2021 ✨
This Report Contain are these Things:-
Abstract
Introduction
Why Digital Marketing
Online Marketing
Website
Site design improvement:
Google Search Console/Google website admin
Slithering and Indexing
Web-based Media Marketing:
Web-based Media Marketers' Platform Usage Trends
Google Analytics
AdWords
AdSense
Pay-per-click (PPC)
How the PPC Model Works
Marketing is Future
Project on Facebook Ads
Reference
Social Media CRM is just as important as your traditional CRM strategies. The objectives of this slide are in the following.
1. Define CRM & Social CRM
2. Explain the usage and relevance of social media for CRM
3. Talk about the different platforms
4. Define how the platforms can be used
a. Events / Calendar
b. Inviting People
c. Sharing content, information
d. Feedback
5. Discuss analytical data
6. Define engagements
a. Like, dislike, comment, share, Direct Messages (DM)
7. Define listening tools
8. Look at 4 specific tools
9. Define ownership of the information
10. Conclusion / Recap
a. Summarize the objectives Q/A
Performics SVP of Communications Planning Jon Wegman brings to light the true meaning behind "performance marketing" in his June presentation at an L2 event. Learn how to harness actions through observing, measuring, analyzing and applying data to drive results.
This is a presentation for The Global Big Data Conference where I am focusing on how creative people can leverage algorithms and Big Data without having it take over their entire lives.
My presentation is on the afternoon of October 23rd, 2017 and I am publishing the deck a few days earlier than that. This presentation also coincides with the publication of the Digital Analytics for Marketing (Sponder/Khan) textbook that is now available (Routledge, 2018) online and in bookstores.
In this introductory course, industry experts and leading publishers will provide valuable tips and guidance on managing search campaigns and maximizing performance.
An Exploration of Volunteered Geographic Information stakeholdersChristopher J. Parker
Volunteered Geographic Information (VGI) has huge potential for influencing the use of geographic information systems. However, there is a wide range of individuals involved in this process, each with their own motivations for contributing and using volunteered data. This paper investigates the range of stakeholders involved with VGI, their relationships and the main tensions and issues involved. The research was based on a series of detailed interviews and theory-driven coding of data. From this, a Rich Picture (Monk, Howard 1998) was developed to graphically present and relate stakeholder relationship information. The findings have implications for how stakeholder groups may be described, and how VGI can lead to enhanced products and services.
PARKER, C.J., 2010. An Exploration of Volunteered Geographic Information Stakeholders, M. HAKLAY, J. MORLEY and H. RAHEMTULLA, eds. In: Proceedings of the GIS Research UK 18th Annual Conference, 14-16 April 2010 2010, UCL pp137-142.
Influence of donor-donor transport on excitation energy transfer in binary sy...iosrjce
IOSR Journal of Applied Physics (IOSR-JAP) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of physics and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in applied physics. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
Data storytelling connects two worlds, As a result, storytelling is fast becoming the most effective way to reach people. Storytelling is the new way to get consumers and the answers they need about you and your business. It's also becoming a vital element of a strong campaign, and stories are a proven medium for teaching, explaining, and influencing. Data storytelling is a communication technique that uses data as the guiding source. For more just check it out this presentation, and you will get best ideas from it. For more email us on info@nds.group
Running head FINAL REPORT2FINAL REPORT2.docxjeanettehully
Running head: FINAL REPORT 2
FINAL REPORT 2
Data Analytics and prediction for Travel Companies
Umair Afzal
IGlobal University
EXECUTIVE OVERVIEW
Customers demand more personalized services, presented to them without even having to look for themselves, with faster access. It is that time, companies are proactively notifying customers of what is relevant to their preferences and push customers to act. This is no different in the travel industry. As a matter of fact, travel industry is one of the industries that need the magic of Big Data Analytics the most, to please their customers. Therefore, our company “Global Tech” will collect and analyze the big data generated from social media pertaining to travel industry. We will get our required data via a subscription to NapoleonCat, a social media marketing and analytics platform. Our analytical tools will provide information allowing our customers to understand and predict consumers patterns, behaviors, pre and post travel feedback. In other words, our company will gather, analyze, and sell social media trends and predictions to companies across the travel industry.
Who is our customer? Everyone related to travel industry companies are our customers. Travel agencies travel focused marketing companies, foreign and domestic tourist and travel government agencies. What need will our company fill? Our company will help the customer’s design business strategy by leveraging customer insights, allowing them to personalize their offerings, improving their marketing and pricing strategies, and gaining competitive differentiation. We believe that companies need to understand their customer’s preferences to build business strategies. Our company’s Big data analytics will allow travel companies to understand their consumers patterns, behaviors, and feedback collected from various sources to help our customers design the right business and marketing strategy. Tracking, analyzing and understanding this valuable data will help our customers determine what offerings and services they should offer in the future.
According to our initial research, most travel agencies are either not performing social media analysis or are doing it in house. Most of the research articles outlined that the companies performing big data analysis within the travel industry employ B2C business model, which makes my offering quite unique in the current marketplace. Furthermore, our company will provide analytical report, and consulting and explanatory services regarding our analytics, the tools we use, and our insights gained from these tool and methods.
Our company’s expenses will be divided to three main categories: infrastructure, software and human resources. Estimated cost of $160,000/month for infrastructure, $200,000/month for software, $100,000/month for human resources, and $40,000 as miscellaneous cost are budgeted. The total amount of $500,000 per month is required to run Global Tech smo ...
Evolution of Digital Marketing and its Impact on Privacyijtsrd
The trend of digital marketing had a big impact on the company’s marketing strategy. However digital marketing is the huge concepts contain various types of digital marketing but this paper focuses on how digital marketing came into existence. We subsequently discuss how social media, E commerce, Over the top media services used for digital marketing, how big data analysis and behavior analytics introduced in digital marketing, and how smartphones changed the digital marketing scenario. Finally, we discuss the impact of data collection of the users on the user’s privacy. Piyush Sharma | Ms. Deepali Shahane "Evolution of Digital Marketing and its Impact on Privacy" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-4 | Issue-6 , October 2020, URL: https://www.ijtsrd.com/papers/ijtsrd33386.pdf Paper Url: https://www.ijtsrd.com/computer-science/computer-security/33386/evolution-of-digital-marketing-and-its-impact-on-privacy/piyush-sharma
SAAL A - 1445 - KEYNOTE - The Content Marketing Overhaul with Mike King, iPul...PerformanceIN
A recognised authority in SEO, content strategy, social media and measurement, iPullRank's Mike King will take you through a theoretical content ‘overhaul’, covering the distinctions between your media strategy and marketing, idea generation tools, amplification and models for predicting success.
With brands and advertisers harnessing the power of engaging and relevant web media, it’s no surprise to see content marketing arrive in the mainstream. It’s often overlooked, however, that content is operating right at the heart of many B2B and B2C strategies, and is continually proving a key point of focus for businesses.
But with adoption and crowds of experts comes rivalry of the highest level. With so many on the content trail, how can you develop your own strategy to stand a cut above the rest?
All this and more will be ‘contentualised’ during the keynote session, with plenty of time for questions at the close.
A strategic framework for digital measurementPeter Isaksson
Presentationen innehåller ett övergripande synsätt om hur man bör hantera mätning av den digitala kanalen. Ofta fastnar organisationer i frågor kring verktyg eller hur man ska mäta den enskilda webbplatsen/-erna. Insikten kring vad målgruppen anser viktigt har på senare tid uppmärksammats genom att "Consumer decision Journey" har blivit ett begrepp och arbetssätt som ligger till grund för hur företag ska möta sin målgrupp. Svårigheten där är för många att skifta synsätt från att se ur ett företagsperspektiv till ett målgruppsperspektiv. Kommer vi till mätning så blir det än svårare för många att översätta och få något vettig ur sina analyser av målgruppens agerande och hur detta knyter an till den egna verksamheten. Lättare är då att falla tillbaka på att optimera mätning av flöden och KPI:r på den egna webbplatsen eller appen. Med detta material vill jag bidra till att belysa diskussionen och synsättet genom att introducera en ny indikator bredvid KPI:n som jag gett namnet SPI. SPI står för "Story Performance Indicator" och ska reflektera målgruppens agerande i den digitala kanalen. En "story" kan vara det mest diskuterade ämnet, mest nedladdade appen inom ert verksamhetsområde eller ett beteende. Idén är att man genom att mäta detta kontinuerligt och sätta det i ett sammanhang där man i slutändan knyter det till företagets KPI kan identifiera företagets totala potential (SPI) och utnyttjandegrad (KPI). En KPI mäter den sista delen i en beslutsresa medans en SPI mäter början.
Kommentera och dela gärna materialet. Materialet är på intet sätt statiskt utan kommer att utvecklas med nya erfarenheter och kommentarer.
// Peter Isaksson, PI Exponent AB
__________________________________________________________________________________
Peter Isaksson har arbetat med digitala frågeställningar i 14 år. Bland erfarenheterna kan följande företag och organisationer räknas in; SSAB, Postnord, Stora Enso, Riskpolisstyrelsen, Hero AG, AFA Försäkring, SJ, Lantmäteriet, Stockholms Läns Landsting, SEB, Folksam, Skandia, PRV, Statens Fastighetsverk, INCF, Kläppen, Ostnor, Electrolux och ett flertal andra uppdragsgivare.
Performics 2014 Digital Trends: Participation ActivatedPerformics
Performics 2014 Participation Activated Trends Report explores our major themes in digital marketing in 2014, including Data Shadowing, Content A.I. and Virtual Marketplaces.
Social Media is one of the key channels to reach consumers and the possibilities of tracking and analyzing certain actions throughout such media is simply infinite. Marketers should use and incorporate Social Media Analytics more into their overall market plan. Media monitoring platforms such as Newspoint accurately gather and present data from all the different media sources into one interface.
Digital marketing analytics paths of value - 12-4-17Marshall Sponder
Digital Marketing Analytics Paths of Value is an adaption of many of the highlights of my book along with the Connecting the Dots methodology used in the book, and that sets it apart.
Similar to Big Data Final Paper - Warriors Final (20)
Digital marketing analytics paths of value - 12-4-17
Big Data Final Paper - Warriors Final
1. Project Final Report
Exploratory Analysis of Social Media
Images to Inform Product Innovation,
Marketing & Promotions
Big Data Analytics Summer2015
Matthew Blough, Eric DeFina, Zixin Mao, Sandilya Tumma
8/12/2015
2. 1
Abstract
Social Listening is an established activity allowing organizations to generate
consumer/customer insights and make more informed business decisions from public data in
social media. While traditionally based on text analytics tools, the rise of platforms such as
Instagram, Pinterest, Snapchat, Tumblr and more, have transformed the content, and therefore
data, generated in social media. As such, the analysis of unstructured data from images will be
critical to “social listening” on today’s platforms to fully understand context, sentiment,
meaning, and more. Through this research, we will explore whether we can use big data
platforms to read, analyze (trends, commonalities) and summarize unstructured data from social
media images to develop insights that feed business and marketing decisions for an online travel
agency company (e.g. Travelocity).
Introduction
Social Listening is an activity that allows brands and organizations to learn from public
data generated by consumers in social networks. By mining this unstructured data, companies
can generate insights from observing online consumer conversation, and then use these insights
to make smarter and more informed business decisions, such as product innovations, decisions
and changes, marketing campaigns, promotional offers, and more.
Over the past couple years, however, there has been a transformational shift in the
content published by consumers to social networks. With the rise of platforms such as
Instagram, Pinterest, Snapchat, and more, social conversation has become dominated by visual
communication and content. In addition, “traditional” social networks such as Facebook and
Twitter have also seen an influx of visual posts verses traditional comments, tags, and other text-
3. 2
based content. In 2014, 500 million image-based posts shared each day in social media, often
without the pairing of text to provide context to the image. This shift has been largely enabled
by the advancement and adoption of smartphones, as well as faster data connection speeds. For
users, visuals are easier and faster to consume.
In order to continue to mine the full sphere of social media for business insights and
questions, we must go beyond text analytics and use big data tools to collect and analyze
imagery quickly and efficiently. As image content now dominates the social web, it will be
critical to understand the context, sentiment and meaning of images in the same way tools have
historically parsed this data from text.
Business Significance
Market research, and specifically understanding consumer needs and the market
environment, has long been a tenant of running a successful and profitable business. In 2014
alone, the market research industry boasted of over $40 billion in sales globally. Today, the
Internet and rise of social media has created new opportunities for research and insights. It is no
longer necessary, in many cases, to set up formal, and expensive, studies in order to understand
and listen to consumers. In addition, the scale and size of the data has offered the ability to
analyze behavior of much larger groups of people compared to the smaller sample sizes of
traditional research studies. Since access to social media has become freely available to
interested organizations, many have turned to the analysis of this massive public data set as a
new form of consumer, market, and competitive insights.
Advertising has been a key activity for large online travel agencies to convert consumers
and drive sales. Expedia and Travelocity along spent over $4 billion in advertising in 2014
4. 3
alone. To make that advertising most effective, it is critical to understand the consumer insight
and create advertisement plans that can drive consumers down the purchase funnel, from product
awareness, to actually purchasing a trip. Analysis of social media images can provide these key
insights that can bolster our advertisement effectiveness and ultimately sales. By knowing what
the types of images consumers are posting, and what the images consist of, we can draw
conclusions of what travel options consumers are looking for, who people most commonly travel
with, and what activities they are doing. This information, far more insightful than transactional
data we have traditionally had access to, can be utilized to create more engaging opportunities
for our advertising creativity and promotional bundles to better meet the wants and needs of our
target audience.
Problem Statement
An online travel agency, TravelWeb, would like to determine the most effective new
advertising and promotion campaign based on consumers’ travel behavior, activity and trends, in
order to increase sales. Based on information extracted from social media imagery, we want to
answer:
• What imagery and creative should we be using for our marketing campaigns, ads,
website imagery and social media content?
• What deals and packages should we be offering?
• How should we structure our offers to best meet the wants and desires of
consumers?
• What bundles and deals should we create?
5. 4
Methodology
The process of this project is to take a large set of unstructured data from social media in
the form of images, transform it into a structured data set by using a computer vision algorithm,
then analyze the structured data set via data mining techniques in order to gain insights into
consumer compositions and preferences.
The first step of our methodology is hypothesis development, in other words, we needed
to outline our business interests. Our hypothesis spans across several topics. One is to understand
type of imagery, whether it may be hiking, camping, skiing, cruise travel, can be used to create
marketing campaigns or ads for prospective clients. Another hypothesis is to see who people are
traveling with in order to understand how to cater to their needs and interests on vacation trips.
What kind of bundles should be offered in terms of activities, foods, excursions based off of
these social media images. For example, people generally enjoy taking pictures on water skiing
more than when sitting around a campfire due to the thrill of the activity. Traveling companies
can offer packages for jet skiing in order to maximize revenue for that specific activity.
The dataset of images has already been given to us. Following Figure 1.2, the next task is
to take the images and run them through Microsoft ComputerVision API to extract structured
quantitative and qualitative information in each picture. This provides information on facial
recognition features such as gender and age, image colors, object categories, and how well these
predictions are doing in terms of a score. The dataset has over 150,000 images and the big data
platform can be useful in running these pictures quickly and efficiently. Since API calls are made
per image and one output in JSON format is produced for each image, we end up with 150,000
individual JSON records.
6. 5
Figure 1.1
While these JSON records we obtained from ComputerVision API are structured data, in
order to conveniently perform analysis, the data set needs to be further transformed into a
relational data structure. To achieve this takes two steps. The first step is to aggregate the
individual JSON records into one single file. This is necessary due to the flexible nature of JSON
format, i.e. it doesn’t require individual files to share the same number of fields. Therefore, in
order to make sure fields from all the files are included, these files need to be properly
aggregated. The second step is converting the single JSON data file into a simple relational
structures with columns and rows. To accomplish this, we utilized a tool named Konklone.
•Background Information
•Hypothesis Statement
•Business Insight
•Big Data usefulness
Hypothesis
•Vision API
•Python to extract raw data
•Clean up the data for analysis (ETL process)
Data Collection/ ETL
•Data Mining Tool IBM SPSS Modeler
•Classification
•Clustering
•Association
•Exploratory analysis
Analysis
•Find insight for business value
•Business decisions for advertising companies
Expected
Conclusion/Implication
7. 6
After a relational database is constructed, it is time to perform analytics. The platforms
we selected are Microsoft Azure and its Machine Learning Studio component. Azure is a
powerful big data platform with easy navigation and access to numerous plug-ins. Its Machine
Learning Studio allows us to apply different types of analytical techniques on the data. We can
easily perform descriptive analytics by slicing and dicing the data set using SQLite queries then
calculate their statistics. At a little more advanced level, we can create and run data mining
models such as classification, clustering and association. Here we can attempt to find different
patterns, trends, and correlations which can be useful to the business insight at hand. The
business implication here is to see what kind of images consumers are taking and begin advising
traveling companies and/or agencies on how to better promote their advertisements to specific
activities and leisure events. The goal here is to assist these companies in increasing revenue and
maximizing profits so that there are no dead costs in promoting the wrong activities. Why
promote parasailing at a location which isn’t suited for that type of activity as opposed to
parasailing on some off shore islands which is much better with consumers taking daily pictures?
8. 7
Figure 1.2
Project Domain
The project domain is broken down by the ETL process - Extraction, Transformation,
Load. Extraction is the challenging aspect of the process where we must connect the online
vision API to Microsoft Azure. This will allow us to feed images into python so that the script
can pull information from the API and give us output JSON files on each image. Python will be
running a loop function to run through all the images on the directory folder and spit out
thousands of files for analysis. Once we have compiled these JSON files together, we are ready
to transform these files into useful data. We have a couple options here. We can go either go
through Amazon Web Service’s MapReduce in order to compress the data into one big file. Once
we have one big file of the unstructured data, we can run this through Microsoft Excel as a CSV
file and clean it up as a proper dataset. This dataset will be our primary source of analysis once
we load it into the IBM SPSS Modeler application. The load process takes us to the modeler
where we can perform exploratory analysis and find key insights into the data. The key findings
Data Extraction
• Vision API
• Py thon
• JSON
Data Transf ormation
(Classif ication)
• Clean up data
f rom JSON output
• AWS/Cloudera
• Microsof t Excel
Data Loading
• Creation of CSV
f iles
• Load into IBM
SPSS Modeler
• Use of Text
Mining application
of SPSS Modeler
Data Analy sis
• Classif ication
• Clustering
• Association
• Word
count/f requency
9. 8
are what will be useful to businesses promoting their vacation packages in a more efficient
manner and spending resources where they find it best to maximize revenue.
Analytical Methods
We are looking to use three main categories of analytics: Classification, clustering, and
association. Classification will allow us to understand how categories of color, objects, age, and
scores are seen together as a large collection of images. Which image characteristics are more
common with social media images? Is there a commonality of pictures being taken of a younger
generation than the elder generation? Classification can help us understand the trend of these
images. Another tool we can look into is clustering. Clustering allows us to group together
characteristics of images which have more relation with each other. This can be with colors or
category images, just to name a few. The association method gives us connection analysis on
images of various categories. Age, gender, and color attributes can be analyzed to see which
combination of characteristics are closely associated with each other. Observing images which
are associated with each other should have a high confidence % in determining how closely they
are related to each other. One example of this: Are there more buildings in the background vs.
pictures of faces with buildings in the background?
Output/Results
We were interested in looking into insightful results through exploratory analysis and see
if we can identify any patterns of trends throughout the dataset of image information. To start
this off we built a simple model in Microsoft Azure identifying descriptive statistics from
important variables most relevant to the insight. Figure 1.3 shows the different nodes connected
10. 9
as we imported our dataset in the reader node and connected it to the project columns to select
the columns which we were most interested in looking at it. The project columns resembles a
“filter” node from other applications and we are able to concentrate on specific variables which
are of value to us. Lastly the descriptive statistics node had to be connected to the project
columns in order to spit out the statistical values of our categories for analysis.
Figure 1.3
In order to get some deeper insights, we needed to drill down the data set by slicing and
dicing it. For instance, an interesting aspect to look at is consumer composition by gender,
gender association and age groups. Figure 1.4 shows us the different slices we created for our
analysis. For example, we ran the following query to isolate records about images with two male
faces.
11. 10
Another absolutely important use for queries is to filter trustworthy data from noisy data.
This is important because although computer vision is getting more accurate day by day, it is still
far from being 100% accurate. Therefore, we need to take into account of filtering out data that
have very low prediction accuracy or ambiguous categories that are too generic for any
meaningful analysis. Take the following query for example, it removes records in which
ComputerVision API produced a prediction accuracy of lower than 10% in the first object
category. In addition, it takes out records that are categorized as “abstract” or “others”.
Figure 1.5 shows us the detailed results of the statistics of various types. The count,
median, mode, range, min, max, average statistics are displayed for numeric variables. Especially
for variables such as category score and face age, where numeric values are given to these
categories and we can identify certain patterns such as the average age of faces being produced
from this collection of images is around the age of 30. But you can also identify face ages which
vary as low as 1 and as high as 96.
13. 12
Clustering analysis helps identify similar characteristics grouped together. With image
clustering, one can identify the different types of images which are similar to each other through
various measurements. One of them being the Euclidian distance which allows to determine how
far one cluster is from another. Figure 1.6 shows how the cluster model was built through
Microsoft Azure. A reader and project column node were once again inserted to filter through
selected variables. We were most interested in identifying 3 columns of categories along with
their main color categorization. Then we added the “train clustering model” model in order to
train the variables into forming four different clusters. This process took some extra time as it
had to train the model so that we can extract results from it. Running it through azure did take 5
times as fast as running through other platforms such as IBM SPSS Modeler. This was a major
advantage to us from a big data perspective. Running 120,000+ images can be done in a quicker
process through azure than other platforms. The K-Means clustering node was added to cluster
our final results together. The metadata editor was used to name the cluster names as 1, 2, 3, 4
numeric values.
Figure 1.6
14. 13
The clustering results were extracted into a CSV format through the K-Means Clustering node.
From here we compiled the CSV table results into clustering results shown in Figure 1.7. Four
clusters formed categories and colors which were in close proximity to each other in
characteristics. They were all distinctively different and shows the type of category names
associated with the colors.
Figure 1.7
Cluster 1:
Outdoor
Building
Street Tree Text
Grey
Black
Cluster 2:
Food
Drinks
Crowd People
Yellow Blue
Green Black
Cluster 3:
Abstract
Others
Cluster 4:
Beach
Water
Sky
Blue
White
15. 14
Scope & Limitations
The main limit of this project is the amount of data. With more data the scores generated
would be more robust resulting in greater precision and accuracy in analyzing the images.
Moreover, our data being images from just one source is another limit. While the old adage, “a
picture is worth a thousand words,” may stand true, we are hoping to narrow results to find the
most important elements of description for analysis of an image. Additionally, due to the lack of
specificity of the ComputerVision API, classifications of the data were unable to be performed
for enhanced insights.
Policy/Managerial Implications
Improved picture recognition and description can allow managers to bolster “social
listening” on today’s media platforms to better understand context, sentiment, and the meaning
behind why a picture is shared and the context of the image. With enhanced image recognition,
more informed business decisions, such as product innovations, decisions and changes,
marketing campaigns, and promotional offers can emulate successful targeted fields for own
campaigns. Moreover, this project can help understand which elements of an image make it go
viral. Greater analysis and understanding of what customers take pictures of and what they share
allows a business to create a better product search for customers as well.
Conclusions & Future Research
Images are the new text on the web. They are easy to share and more engaging than text.
The trend will continue in favor of images and we believe that analysis of images will grow
tremendously in the coming years. Expanding on the importance of social listening, more
insights can be drawn from interpreting pictures. Enhancements in image perspective analysis,
16. 15
GPS and sentiment overlay, will improve clustering and classification in order to better predict
what appeals to specific customers to increase sales. Social media company Snapchat, an
ephemeral photo and video sharing app currently charges $400,000 worth of ad space for a story
generating 20 million views. Meanwhile, Facebook has utilized its massive storage of photos to
developed a way to recognize people in photos even if their faces are obstructed, identified
individuals with 83% accuracy using a method dubbed PIPER, an acronym for pose invariant
person recognition. As the quantity of images shared online increases the quality of data
algorithms processing photos will bolster analysis. Why pictures were taken, understanding the
important elements, what sparked the instance, and how to better react and cater to what
customer desires are the driving forces on how image analytics will proceed in the future.
Sources
1. http://www.fastcompany.com/3000794/rise-visual-social-media
2. http://blogs.adobe.com/digitalmarketing/social-media/visual-social-snapchat-pinterest-
and-the-rise-of-media-rich-marketing/
3. http://wersm.com/visual-web-the-next-big-thing/
4. http://blogs.wsj.com/digits/2015/06/23/facebook-claims-photo-recognition-breakthrough/
5. http://recode.net/2015/06/17/snapchats-making-some-pretty-serious-money-from-live-
stories/
6. https://www.esomar.org/uploads/industry/reports/global-market-research-
2014/ESOMAR-GMR2014-Preview.pdf
7. http://skift.com/2015/02/20/priceline-and-expedias-advertising-arms-race-in-2014/
8. Mary Meeker; 2014 Internet Trends Report
9. https://www.forrester.com/Big+Datas+Big+Meaning+For+Marketing/quickscan/-/E-
res114782
10. http://www.forbes.com/sites/groupthink/2015/05/01/visual-listening-social-medias-next-
frontier/3/