Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at Big Data Spain 2017

MeStrata Conference
Hadoop Summit
Big
Data
Spain

Miguel Romero [short bio]
@donkelito
• Working in SDG Group.
• My professional experience.
• 7 years as Data Engineer
• 6 years as Business Developer and
Solution Architect in BIG Data &
Advanced Analytics practice
• 1 year as Big Data Head
• Executive MBA
• Not working on my PhD 
• Big Data Master Professor
• Member of M team.
• Agile mode in each instant
• Currently: Developing my
speaker facet! #BDS15 #BDS17

W TF is a m em e?
• Big Data technology has brought an unprecedented explosion in
unstructured data.
• A meme (neologism modeled on gene) that is a shortening of
mimeme (greek) that means imitated thing, is an attempt to
explain the way cultural information spreads in terms of
evolutionary principles between humans;
• Internet memes are
– a subset of this general meme concept specific to the
culture and environment of the Internet.
– a piece of art which spreads from Person to Person via the
Internet which carries an additional property that
ordinary memes do not (the media through which they
propagate that renders them traceable and analyzable)

Memes could be
your advisor
[fads & sensations]

The Big M em e
IndexAnalyzing fads and sensations
on the Internet

tw itter
D ata
Sources
A PIs # m em e
Sites
D ata
Collect
D ata Fast A nalytics
Stream M edia
Processor
R D D
D Stream
FA Stream Media Collector
[http source]
FA Stream Media Collector
[http source]
M em e R epository
R D D
{ photo:’distracted-boyfriend’,
sense:‘opinionchange’,
w ords:[{w :‘H IV E’,p:1},
{w :’D S’,p:2},
{w :’SPA R K ’,p:3}]}
# spark
# ds
# hive
Enrich R epository
R D D
D Stream
D ata
Services
microservices
What do you have?
What do you like?
Breadth of the Influencers
according to trend memes
Cultural VS Education
VS Trending
Stream M edia
B uffer

Template matching is a technique for finding areas of an image
that match (are similar) to a template image (patch).
• Use the OpenCV function matchTemplate to search for matches between an image patch and an input
image
• Use the OpenCV function minMaxLoc to find the maximum and minimum values (as well as their
positions) in a given array
Tem plate m atching

tesseract.exe
staktreak_meme1_youwanadata.png
staktreak_meme1_youwanadata.txt
--oem 2 -l eng –psm 6
A You want data?
3 im} 3~
Here‘s your data
O penCV interface
Tesseract
[plain text]

Takeaw ay points
• Memes are a vehicle whose information
could be used as advisory
• Memes are unstructured data and until
now was not easy to analyze
• With Big Data and Advance Analytics
technology an architecture to analyze
memes (image + text) in NRT or even
video on live can be built.

B ig D ata Spain 2018
idea!W hat is there in com m on am ong
the m usic that data-lovers listen
to?

Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at Big Data Spain 2017

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at Big Data Spain 2017

Similar to Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at Big Data Spain 2017 (20)

More from Big Data Spain

More from Big Data Spain (20)

Recently uploaded

Recently uploaded (20)

Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at Big Data Spain 2017