Dan Kuster led a talk at Sentiment Analysis Symposium discussing why businesses should consider adopting deep learning solutions. Key takeaways include simplicity, accuracy, flexibility, and some hacks for working with the tech.
About the Session:
Machine learning is becoming the tool of choice for analyzing text and image data. While traditional text processing solutions rely on the ability of experts to encode domain knowledge, machine learning models learn this directly from the data. Deep learning is a branch of machine learning that like the human brain quickly learns hierarchical representations of concepts, and it has been key to unlocking state-of-the-art results on a range of text and image classification tasks such as sentiment analysis and beyond.
In this session, we will show the impact of a deep learning based approach over NLP and traditional machine learning based methods for text analysis across key dimensions such as accuracy, flexibility, and the amount of required training data. Specifically, we will discuss how deep learning models are now setting the records for state-of-the-art accuracy in sentiment analysis. We will also demonstrate the flexibility of this approach by showing how the features learned by one model can be easily reused in different domains (e.g., handling additional languages, or predicting new categories) to drastically reduce the time to deployment. Finally, we will touch on the ability of this method to handle additional types of data beyond text, e.g, images, for maximum insight.
2. “All good researchers will tell you
that the most promising direction is
the one they are currently pursuing.
If they thought something else was
more promising, they would be
doing that instead.”
— G. Hinton
3. What is deep learning?
…a method for applying simple
mathematical functions to data.
4. web: search, facial recognition
smartphones: speech -> text
email: smart reply
mail: handwriting -> digits
cars: pedestrian detection
art & design: artistic style transfer
5. Wait, why now?
~1960’s (visual cortex is a deep neural network)
Simple neurons ⇾ hierarchical features ⇾ complex
~1990’s (computational models)
Neural networks ⇾ simple functions, applied piecewise
~ 2000’s (the internet + cheap storage)
Lots of data
~2012 (2 GPUs beat Google’s 16,000 CPU cluster)
Very fast and cheap parallel computing power
Deep neural networks ≈ mathematics + data
11. Maybe you are skeptical that deep learning
will have a lasting impact…
12. Maybe you are skeptical that mathematics
will have a lasting impact?
13. “The enormous usefulness of
mathematics in the natural sciences is
something bordering the mysterious and
there is no rational explanation for it.”
—Eugene Wigner (1960)
“The Unreasonable Effectiveness of
Mathematics in the Natural Sciences”
Unreasonable Benefits of Deep Learning
14. “…mathematical formulation…leads in an uncanny
number of cases to an amazingly accurate
description of a large class of phenomena.”
“…the concepts of mathematics are not chosen for
their conceptual simplicity…but for their
amenability to clever manipulations and to
striking, brilliant arguments.”
—Eugene Wigner (1960)
“The Unreasonable Effectiveness of
Mathematics in the Natural Sciences”
Unreasonable Benefits of Deep Learning
17. Simple? Compared to what?
• Expert systems
Domain expertise ⇾ think a lot ⇾ codify rules (e.g., 1700 pages of English grammar)
More data, more pain.
Previous wave of “A.I.” (good rules can seem magical).
• Traditional machine learning
Data ⇾ domain expertise ⇾ feature extraction ⇾ learned weights
Learn everything from scratch.
Manual feature engineering, biased and tedious.
More data helps!
• Deep neural networks
Data ⇾ model ⇾ learned weights
End-to-end learning, directly from examples. Like we (humans) do.
Can learn transferable features.
More data really helps!
27. Content filtering
(especially for user-generated content)
You have a brand
Your brand has an identity
(Disney vs. Calvin Klein)
Your audience might
have different sensibilities
than you do, about
what is appropriate
for your brand
Filter out the
inappropriate content at
your own custom threshold
49. Q: What problems can be solved with a
deep neural network?
A: If a human mind can do it in 1/10th of a second, a
deep neural network can probably do it well enough…
assuming you have data!
50. “Many scientists (myself included) take a sadistic
pleasure in proving other people wrong.
— Y. LeCun
51. The Unreasonable Benefits of Deep Learning:
simplicity, accuracy, flexibility, hacks
Questions?
Daniel
Kuster,
Ph.D.
@djkust @indicodata
52. Image credits:
Unsplash (backgrounds)
Google search, Facebook AI Research (DeepFace), nVidia, Gatys et al. (arXiv: 1508.06576)
Brigitewear International (Borat swimsuit)
Imgur (skeptical dogs)
Jack the cat
…and the team at indico!
53. “A machine learning researcher,
a crypto-currency expert,
and an Erlang programmer
walk into a bar.
Facebook buys the bar for $27 billion.”
@ML_Hipster