Big Data introduction - Café Numérique Bruxelles

16,518 views
16,146 views

Published on

General introduction to Big Data terms and technologies: Velocity, Volume, Variety (3V) and Veracity (4V), NoSQL, Data Science, main data stores (key-value, column, document, graph), Elasticsearch, ...
Presentation of data.be products leveraging Big Data & Elasticsearch

0 Comments
8 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
16,518
On SlideShare
0
From Embeds
0
Number of Embeds
14
Actions
Shares
0
Downloads
100
Comments
0
Likes
8
Embeds 0
No embeds

No notes for slide

Big Data introduction - Café Numérique Bruxelles

  1. 1. Big Data Introduction
  2. 2. About Me Eric Rodriguez Founder of data.be ! • Web entrepreneur • Data addict • Multi-Language: PHP, Java/ Groovy/Grails, .Net, … be.linkedin.com/in/erodriguez ! github.com/wavyx ! @wavyx
  3. 3. Big? Data!
  4. 4. Big Data is like teenage sex Everyone talks about it Nobody really knows how to do it Everyone thinks everyone else is doing it So everyone claims they are doing it… Quote: Dan Ariely
  5. 5. 3V -Volume,Variety,Velocity
  6. 6. Source: http://pennystocks.la/internet-in-real-time/
  7. 7. Variety of Data
  8. 8. • Health & Body sensors • Smart Home • Smart City • Industry applications • Environment
  9. 9. Health & Body Sensors Source: http://postscapes.com/internet-of-things-examples/
  10. 10. Smart Home Source: http://postscapes.com/internet-of-things-examples/
  11. 11. Smart Cities Source: http://postscapes.com/internet-of-things-examples/
  12. 12. Industry Source: http://postscapes.com/internet-of-things-examples/
  13. 13. Environment Source: http://postscapes.com/internet-of-things-examples/
  14. 14. The rise of Data Science
  15. 15. NoSQL
  16. 16. Big Data Landscape 2.0
  17. 17. Keep Calm and Big Data
  18. 18. Big DataTools
  19. 19. Technologies Source:
  20. 20. Not Only SQL Key-Value Column Document Graph
  21. 21. real time, search and analytics engine open-source Lucene JSON schema free document
 store RESTful API documentation scalability high availability distributed multi tenancy per-operation
 persistence
  22. 22. Elasticsearch core • Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java • Elasticsearch added value: “Simple is best” • Simple API (with documentation) • JSON & RESTful • Sharding & Replication • Extensibility: plugins and scripts • Interoperability: clients and integrations
  23. 23. Use Cases • Full-Text Search • Data Store • Analytics • Alerts • Ads • …
  24. 24. 4V -Veracity !
  25. 25. 4V -Veracity !
  26. 26. From Big Data toValue Wisdom! Knowledge! Information! Data!
  27. 27. HOW TO FIND RELEVANT COMPANY INFORMATION ?
  28. 28. BEFORE...
  29. 29. WHY IS IT SO HARD TO FIND COMPREHENSIVE INFORMATION ?
  30. 30. AFTER !
  31. 31. COMPANY PAGE
  32. 32. • VATValidity • Company Information • Geographic Search • EuropeanVAT Check API.DATA.BE
  33. 33. PUBLICATION SEARCH
  34. 34. Thank you! eric@data.be be.linkedin.com/in/erodriguez @wavyx

×