SlideShare a Scribd company logo
1 of 39
(Technical) Big Data
Analytics
for non-technical end-users
ErikTromp – CEO UnderstandLing
Contents
• (Short) into
• Rationale
• Tuktu platform
• Deep learning for computational linguistics
• CEMistry – Customer Experience Monitoring on steroids
(Short) Intro
• Big data science experts
• Specialisms
• Computational Linguistics
• Customer Experience Management
• Service: strategic advices all the way to operational implementation
• Own platform:Tuktu
• Soon: own product:CEMistry
• Trainings/education on big data science
(Short) Intro
Quantify every touchpoint of a customer with your company
4 major areas
• Text Analytics
• Web Analytics
• Mobile Analytics
• CRM/Backend Analytics
(Short) Intro
• ErikTromp
• Age: 28
• CEO UnderstandLing
• Graduated on Sentiment Analysis in 2011
• Multilingual Sentiment Analysis on Social Media
• Software engineer – Scala
• Machine learning
• Author of platformTuktu
Contents
• (Short) into
• Rationale
• Tuktu platform
• Deep learning for computational linguistics
• CEMistry – Customer Experience Monitoring on steroids
Rationale
Big data science allows to utilize opportunities
Rationale
Big data science allows to utilize opportunities
Big data science drives business
Rationale
Big data science allows to utilize opportunities
Big data science drives business
But is very much a technical revolution, with business implications
Rationale
Many companies want to utilize the opportunities big data science brings
Rationale
Many companies want to utilize the opportunities big data science brings
These companies do not have sufficient capabilities to do so
Rationale
Many companies want to utilize the opportunities big data science brings
These companies do not have sufficient capabilities to do so
Nor are there many suppliers that can do tech, analytics and know their
business
Rationale
But these companies often do have their own (business) analysts
Rationale
IDEA
Make big data science accessible to non-technical users
Contents
• (Short) into
• Rationale
• Tuktu platform
• Deep learning for computational linguistics
• CEMistry – Customer Experience Monitoring on steroids
Tuktu
http://www.tuktu.io
https://github.com/UnderstandLingBV/Tuktu
Tuktu – Early Days
• Started off as a personal project to make life easier
• Out of a collaboration with the Maastricht University
• Idea: save time on coding/engineering, focus on logic and functionalities
Tuktu – Early Days
• Started off as a personal project to make life easier
• Out of a collaboration with the Maastricht University
• Idea: save time on coding/engineering, focus on logic and functionalities
Instead of writing code over and over again, have it present and configure its
building blocks
Tuktu – Early Days
• Started off as a personal project to make life easier
• Out of a collaboration with the Maastricht University
• Idea: save time on coding/engineering, focus on logic and functionalities
Instead of writing code over and over again, have it present and configure its
building blocks
In a visual and straightforward way!
Tuktu – Now
Your one-stop shop for everything big data science
Tuktu – Now
• Realtime and batch processing
• Synchronous and asynchronous
processing
• REST API
• Drag-and-drop modelling of jobs
• Distributed file system:TDFS
• Key/value-sture:TuktuDB
• Real-time visualization
• Web analytics support
• Scheduling
• No master/slave architecture
• Local or distributed computing
• Machine learning
• Deep learning
• Cross-platform due to JVM
• Easy installation: just unzip!
Tuktu
DEMO
Contents
• (Short) into
• Rationale
• Tuktu platform
• Deep learning for computational linguistics
• CEMistry – Customer Experience Monitoring on steroids
Deep Learning for Computational Linguistics
IDEA
Learn language models generically
Deep Learning for Computational Linguistics
IDEA
Learn language models generically
Model every CL-problem on top on the generic model
Deep Learning for Computational Linguistics
This way, we can do almost any task on almost any language
Without too much/with less effort
Deep Learning for Computational Linguistics
How?
Deep Learning for Computational Linguistics
There are many linguistics resources available
Sadly; most is for English
In particular: AnnotatedTreebanks for deep parsing
Deep Learning for Computational Linguistics
We can use this however
Deep Learning for Computational Linguistics
1. Co-train word vectors for target language and English
Deep Learning for Computational Linguistics
1. Co-train word vectors for target language and English
2. Train parsing models on English language
Deep Learning for Computational Linguistics
1. Co-train word vectors for target language and English
2. Train parsing models on English language
3. Co-finetune models on co-trained word vectors
Deep Learning for Computational Linguistics
1. Co-train word vectors for target language and English
2. Train parsing models on English language
3. Co-finetune models on co-trained word vectors
4. Pre-train (recursive) auto-encoder using parsing model for target
language
Deep Learning for Computational Linguistics
1. Co-train word vectors for target language and English
2. Train parsing models on English language
3. Co-finetune models on co-trained word vectors
4. Pre-train (recursive) auto-encoder using parsing model for target
language
5. Use recursive auto-encoder for specific task in target language
• Topic detection, sentiment analysis, named entity recoginition, authorship profiling
Deep Learning for Computational Linguistics
DEMO
Unsupervised parsing in Dutch
Contents
• (Short) into
• Rationale
• Tuktu platform
• Deep learning for computational linguistics
• CEMistry – Customer Experience Monitoring on steroids
CEMistry
TEXT ANALYTICS
MOBILEANALYTICS
WEB ANALYTICS
BACKEND/CRM
Customer Profile
Tuktu.js
Visitor Customer
Events
Page views
“SDK”
Visitor Customer
Events
AppTriggers
User Customer
Collectors
Events
Communication
(NLP)
(Database)
Connectors
Customer
Events
Transactions
Questions?
ErikTromp
CEO UnderstandLing
erik@understandling.com
http://www.understandling.com
http://www.tuktu.io
http://www.linkedin.com/in/eriktromp
https://github.com/UnderstandLingBV/Tuktu
Talk to us on Gitter!
https://gitter.im/UnderstandLingBV/Tuktu

More Related Content

Viewers also liked

Natural language processing (Python)
Natural language processing (Python)Natural language processing (Python)
Natural language processing (Python)Sumit Raj
 
Natural Language Processing and Python
Natural Language Processing and PythonNatural Language Processing and Python
Natural Language Processing and Pythonanntp
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsMd. Main Uddin Rony
 
Online algorithms in Machine Learning
Online algorithms in Machine LearningOnline algorithms in Machine Learning
Online algorithms in Machine LearningAmrinder Arora
 
PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning" PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning" Joshua Bloom
 
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...Jimmy Lai
 
Machine Learning and Data Mining: 10 Introduction to Classification
Machine Learning and Data Mining: 10 Introduction to ClassificationMachine Learning and Data Mining: 10 Introduction to Classification
Machine Learning and Data Mining: 10 Introduction to ClassificationPier Luca Lanzi
 
Deep Learning with Python (PyData Seattle 2015)
Deep Learning with Python (PyData Seattle 2015)Deep Learning with Python (PyData Seattle 2015)
Deep Learning with Python (PyData Seattle 2015)Alexander Korbonits
 
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)Márton Miháltz
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
Machine learning pt.1: Artificial Neural Networks ® All Rights Reserved
Machine learning pt.1: Artificial Neural Networks ® All Rights ReservedMachine learning pt.1: Artificial Neural Networks ® All Rights Reserved
Machine learning pt.1: Artificial Neural Networks ® All Rights ReservedJonathan Mitchell
 
Statistical Machine Learning for Text Classification with scikit-learn and NLTK
Statistical Machine Learning for Text Classification with scikit-learn and NLTKStatistical Machine Learning for Text Classification with scikit-learn and NLTK
Statistical Machine Learning for Text Classification with scikit-learn and NLTKOlivier Grisel
 
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習台灣資料科學年會
 
Deep Learning - The Past, Present and Future of Artificial Intelligence
Deep Learning - The Past, Present and Future of Artificial IntelligenceDeep Learning - The Past, Present and Future of Artificial Intelligence
Deep Learning - The Past, Present and Future of Artificial IntelligenceLukas Masuch
 
Deep Learning through Examples
Deep Learning through ExamplesDeep Learning through Examples
Deep Learning through ExamplesSri Ambati
 

Viewers also liked (16)

Natural language processing (Python)
Natural language processing (Python)Natural language processing (Python)
Natural language processing (Python)
 
Natural Language Processing and Python
Natural Language Processing and PythonNatural Language Processing and Python
Natural Language Processing and Python
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
 
Online algorithms in Machine Learning
Online algorithms in Machine LearningOnline algorithms in Machine Learning
Online algorithms in Machine Learning
 
PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning" PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning"
 
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
 
Machine Learning and Data Mining: 10 Introduction to Classification
Machine Learning and Data Mining: 10 Introduction to ClassificationMachine Learning and Data Mining: 10 Introduction to Classification
Machine Learning and Data Mining: 10 Introduction to Classification
 
Deep Learning with Python (PyData Seattle 2015)
Deep Learning with Python (PyData Seattle 2015)Deep Learning with Python (PyData Seattle 2015)
Deep Learning with Python (PyData Seattle 2015)
 
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
Machine learning pt.1: Artificial Neural Networks ® All Rights Reserved
Machine learning pt.1: Artificial Neural Networks ® All Rights ReservedMachine learning pt.1: Artificial Neural Networks ® All Rights Reserved
Machine learning pt.1: Artificial Neural Networks ® All Rights Reserved
 
Statistical Machine Learning for Text Classification with scikit-learn and NLTK
Statistical Machine Learning for Text Classification with scikit-learn and NLTKStatistical Machine Learning for Text Classification with scikit-learn and NLTK
Statistical Machine Learning for Text Classification with scikit-learn and NLTK
 
NLTK in 20 minutes
NLTK in 20 minutesNLTK in 20 minutes
NLTK in 20 minutes
 
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
 
Deep Learning - The Past, Present and Future of Artificial Intelligence
Deep Learning - The Past, Present and Future of Artificial IntelligenceDeep Learning - The Past, Present and Future of Artificial Intelligence
Deep Learning - The Past, Present and Future of Artificial Intelligence
 
Deep Learning through Examples
Deep Learning through ExamplesDeep Learning through Examples
Deep Learning through Examples
 

Similar to Deep learning for text analytics

LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?FIAT/IFTA
 
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.Lifeng (Aaron) Han
 
FEC2017-Introduction-to-programming
FEC2017-Introduction-to-programmingFEC2017-Introduction-to-programming
FEC2017-Introduction-to-programmingHenrikki Tenkanen
 
Requirementv4
Requirementv4Requirementv4
Requirementv4stat
 
DMDS Winter 2015 Workshop 1 slides
DMDS Winter 2015 Workshop 1 slidesDMDS Winter 2015 Workshop 1 slides
DMDS Winter 2015 Workshop 1 slidesPaige Morgan
 
Creating a compiler for your own language
Creating a compiler for your own languageCreating a compiler for your own language
Creating a compiler for your own languageAndrea Tino
 
KantanFest: Andy Way
KantanFest: Andy WayKantanFest: Andy Way
KantanFest: Andy Waykantanmt
 
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinalProf. Wim Van Criekinge
 
Stat Tech Reportv1
Stat Tech Reportv1Stat Tech Reportv1
Stat Tech Reportv1stat
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models BootcampData Science Dojo
 
Agile Languages for Rapid Prototyping
Agile Languages for Rapid PrototypingAgile Languages for Rapid Prototyping
Agile Languages for Rapid PrototypingTharindu Weerasinghe
 
Introducing language technology in the editing process: How to do things righ...
Introducing language technology in the editing process: How to do things righ...Introducing language technology in the editing process: How to do things righ...
Introducing language technology in the editing process: How to do things righ...Loctimize GmbH
 
Needs of others November 2011
Needs of others November 2011Needs of others November 2011
Needs of others November 2011Razi Masri
 
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchrohitcse52
 
Dmdh winter 2015 session #1
Dmdh winter 2015 session #1Dmdh winter 2015 session #1
Dmdh winter 2015 session #1sarahkh12
 
Single-Source XML: The Secret Behind Mobile and eLearning
Single-Source XML: The Secret Behind Mobile and eLearningSingle-Source XML: The Secret Behind Mobile and eLearning
Single-Source XML: The Secret Behind Mobile and eLearningXyleme
 
DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016
DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016
DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016IXIASOFT
 

Similar to Deep learning for text analytics (20)

LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?
 
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
 
FEC2017-Introduction-to-programming
FEC2017-Introduction-to-programmingFEC2017-Introduction-to-programming
FEC2017-Introduction-to-programming
 
Requirementv4
Requirementv4Requirementv4
Requirementv4
 
Chatbots: Automated Conversational Model using Machine Learning
Chatbots: Automated Conversational Model using Machine LearningChatbots: Automated Conversational Model using Machine Learning
Chatbots: Automated Conversational Model using Machine Learning
 
DMDS Winter 2015 Workshop 1 slides
DMDS Winter 2015 Workshop 1 slidesDMDS Winter 2015 Workshop 1 slides
DMDS Winter 2015 Workshop 1 slides
 
Creating a compiler for your own language
Creating a compiler for your own languageCreating a compiler for your own language
Creating a compiler for your own language
 
KantanFest: Andy Way
KantanFest: Andy WayKantanFest: Andy Way
KantanFest: Andy Way
 
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
 
Stat Tech Reportv1
Stat Tech Reportv1Stat Tech Reportv1
Stat Tech Reportv1
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
 
Raising the Bar
Raising the BarRaising the Bar
Raising the Bar
 
Agile Languages for Rapid Prototyping
Agile Languages for Rapid PrototypingAgile Languages for Rapid Prototyping
Agile Languages for Rapid Prototyping
 
Introducing language technology in the editing process: How to do things righ...
Introducing language technology in the editing process: How to do things righ...Introducing language technology in the editing process: How to do things righ...
Introducing language technology in the editing process: How to do things righ...
 
Needs of others November 2011
Needs of others November 2011Needs of others November 2011
Needs of others November 2011
 
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
 
Dmdh winter 2015 session #1
Dmdh winter 2015 session #1Dmdh winter 2015 session #1
Dmdh winter 2015 session #1
 
Single-Source XML: The Secret Behind Mobile and eLearning
Single-Source XML: The Secret Behind Mobile and eLearningSingle-Source XML: The Secret Behind Mobile and eLearning
Single-Source XML: The Secret Behind Mobile and eLearning
 
DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016
DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016
DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016
 
eloranta_1293
eloranta_1293eloranta_1293
eloranta_1293
 

Recently uploaded

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 

Recently uploaded (20)

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 

Deep learning for text analytics

  • 1. (Technical) Big Data Analytics for non-technical end-users ErikTromp – CEO UnderstandLing
  • 2. Contents • (Short) into • Rationale • Tuktu platform • Deep learning for computational linguistics • CEMistry – Customer Experience Monitoring on steroids
  • 3. (Short) Intro • Big data science experts • Specialisms • Computational Linguistics • Customer Experience Management • Service: strategic advices all the way to operational implementation • Own platform:Tuktu • Soon: own product:CEMistry • Trainings/education on big data science
  • 4. (Short) Intro Quantify every touchpoint of a customer with your company 4 major areas • Text Analytics • Web Analytics • Mobile Analytics • CRM/Backend Analytics
  • 5. (Short) Intro • ErikTromp • Age: 28 • CEO UnderstandLing • Graduated on Sentiment Analysis in 2011 • Multilingual Sentiment Analysis on Social Media • Software engineer – Scala • Machine learning • Author of platformTuktu
  • 6. Contents • (Short) into • Rationale • Tuktu platform • Deep learning for computational linguistics • CEMistry – Customer Experience Monitoring on steroids
  • 7. Rationale Big data science allows to utilize opportunities
  • 8. Rationale Big data science allows to utilize opportunities Big data science drives business
  • 9. Rationale Big data science allows to utilize opportunities Big data science drives business But is very much a technical revolution, with business implications
  • 10. Rationale Many companies want to utilize the opportunities big data science brings
  • 11. Rationale Many companies want to utilize the opportunities big data science brings These companies do not have sufficient capabilities to do so
  • 12. Rationale Many companies want to utilize the opportunities big data science brings These companies do not have sufficient capabilities to do so Nor are there many suppliers that can do tech, analytics and know their business
  • 13. Rationale But these companies often do have their own (business) analysts
  • 14. Rationale IDEA Make big data science accessible to non-technical users
  • 15. Contents • (Short) into • Rationale • Tuktu platform • Deep learning for computational linguistics • CEMistry – Customer Experience Monitoring on steroids
  • 17. Tuktu – Early Days • Started off as a personal project to make life easier • Out of a collaboration with the Maastricht University • Idea: save time on coding/engineering, focus on logic and functionalities
  • 18. Tuktu – Early Days • Started off as a personal project to make life easier • Out of a collaboration with the Maastricht University • Idea: save time on coding/engineering, focus on logic and functionalities Instead of writing code over and over again, have it present and configure its building blocks
  • 19. Tuktu – Early Days • Started off as a personal project to make life easier • Out of a collaboration with the Maastricht University • Idea: save time on coding/engineering, focus on logic and functionalities Instead of writing code over and over again, have it present and configure its building blocks In a visual and straightforward way!
  • 20. Tuktu – Now Your one-stop shop for everything big data science
  • 21. Tuktu – Now • Realtime and batch processing • Synchronous and asynchronous processing • REST API • Drag-and-drop modelling of jobs • Distributed file system:TDFS • Key/value-sture:TuktuDB • Real-time visualization • Web analytics support • Scheduling • No master/slave architecture • Local or distributed computing • Machine learning • Deep learning • Cross-platform due to JVM • Easy installation: just unzip!
  • 23. Contents • (Short) into • Rationale • Tuktu platform • Deep learning for computational linguistics • CEMistry – Customer Experience Monitoring on steroids
  • 24. Deep Learning for Computational Linguistics IDEA Learn language models generically
  • 25. Deep Learning for Computational Linguistics IDEA Learn language models generically Model every CL-problem on top on the generic model
  • 26. Deep Learning for Computational Linguistics This way, we can do almost any task on almost any language Without too much/with less effort
  • 27. Deep Learning for Computational Linguistics How?
  • 28. Deep Learning for Computational Linguistics There are many linguistics resources available Sadly; most is for English In particular: AnnotatedTreebanks for deep parsing
  • 29. Deep Learning for Computational Linguistics We can use this however
  • 30. Deep Learning for Computational Linguistics 1. Co-train word vectors for target language and English
  • 31. Deep Learning for Computational Linguistics 1. Co-train word vectors for target language and English 2. Train parsing models on English language
  • 32. Deep Learning for Computational Linguistics 1. Co-train word vectors for target language and English 2. Train parsing models on English language 3. Co-finetune models on co-trained word vectors
  • 33. Deep Learning for Computational Linguistics 1. Co-train word vectors for target language and English 2. Train parsing models on English language 3. Co-finetune models on co-trained word vectors 4. Pre-train (recursive) auto-encoder using parsing model for target language
  • 34. Deep Learning for Computational Linguistics 1. Co-train word vectors for target language and English 2. Train parsing models on English language 3. Co-finetune models on co-trained word vectors 4. Pre-train (recursive) auto-encoder using parsing model for target language 5. Use recursive auto-encoder for specific task in target language • Topic detection, sentiment analysis, named entity recoginition, authorship profiling
  • 35. Deep Learning for Computational Linguistics DEMO Unsupervised parsing in Dutch
  • 36. Contents • (Short) into • Rationale • Tuktu platform • Deep learning for computational linguistics • CEMistry – Customer Experience Monitoring on steroids
  • 37. CEMistry TEXT ANALYTICS MOBILEANALYTICS WEB ANALYTICS BACKEND/CRM Customer Profile Tuktu.js Visitor Customer Events Page views “SDK” Visitor Customer Events AppTriggers User Customer Collectors Events Communication (NLP) (Database) Connectors Customer Events Transactions