Big Data Introduction
About Me
Eric Rodriguez
Founder of data.be
!
• Web entrepreneur
• Data addict
• Multi-Language: PHP, Java/
Groovy/Grails, ...
Big? Data!
Big Data is like teenage sex
Everyone talks about it
Nobody really knows how to do it
Everyone thinks everyone else is doi...
3V -Volume,Variety,Velocity
Source: http://pennystocks.la/internet-in-real-time/
Variety of Data
• Health & Body sensors	

• Smart Home	

• Smart City	

• Industry applications	

• Environment
Health & Body Sensors
Source: http://postscapes.com/internet-of-things-examples/
Smart Home
Source: http://postscapes.com/internet-of-things-examples/
Smart Cities
Source: http://postscapes.com/internet-of-things-examples/
Industry
Source: http://postscapes.com/internet-of-things-examples/
Environment
Source: http://postscapes.com/internet-of-things-examples/
The rise of Data Science
NoSQL
Big Data Landscape 2.0
Keep Calm and Big Data
Big DataTools
Technologies
Source:
Not Only SQL
Key-Value Column
Document Graph
real time, 	

search and 	

analytics engine	

open-source
Lucene
JSON
schema 	

free	

document

store
RESTful
API
docume...
Elasticsearch core
• Apache Lucene is a high-performance, full-featured text search engine library
written entirely in Jav...
Use Cases
• Full-Text Search	

• Data Store	

• Analytics	

• Alerts	

• Ads	

• …
4V -Veracity !
4V -Veracity !
From Big Data toValue
Wisdom!
Knowledge!
Information!
Data!
HOW TO FIND RELEVANT COMPANY INFORMATION ?
BEFORE...
WHY IS IT SO HARD TO FIND
COMPREHENSIVE INFORMATION ?
AFTER !
COMPANY PAGE
• VATValidity	

• Company Information	

• Geographic Search	

• EuropeanVAT Check
API.DATA.BE
PUBLICATION SEARCH
Thank you!
eric@data.be
be.linkedin.com/in/erodriguez
@wavyx
Big Data introduction - Café Numérique Bruxelles
Big Data introduction - Café Numérique Bruxelles
Upcoming SlideShare
Loading in...5
×

Big Data introduction - Café Numérique Bruxelles

15,657

Published on

General introduction to Big Data terms and technologies: Velocity, Volume, Variety (3V) and Veracity (4V), NoSQL, Data Science, main data stores (key-value, column, document, graph), Elasticsearch, ...
Presentation of data.be products leveraging Big Data & Elasticsearch

0 Comments
7 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
15,657
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
76
Comments
0
Likes
7
Embeds 0
No embeds

No notes for slide

Transcript of "Big Data introduction - Café Numérique Bruxelles"

  1. 1. Big Data Introduction
  2. 2. About Me Eric Rodriguez Founder of data.be ! • Web entrepreneur • Data addict • Multi-Language: PHP, Java/ Groovy/Grails, .Net, … be.linkedin.com/in/erodriguez ! github.com/wavyx ! @wavyx
  3. 3. Big? Data!
  4. 4. Big Data is like teenage sex Everyone talks about it Nobody really knows how to do it Everyone thinks everyone else is doing it So everyone claims they are doing it… Quote: Dan Ariely
  5. 5. 3V -Volume,Variety,Velocity
  6. 6. Source: http://pennystocks.la/internet-in-real-time/
  7. 7. Variety of Data
  8. 8. • Health & Body sensors • Smart Home • Smart City • Industry applications • Environment
  9. 9. Health & Body Sensors Source: http://postscapes.com/internet-of-things-examples/
  10. 10. Smart Home Source: http://postscapes.com/internet-of-things-examples/
  11. 11. Smart Cities Source: http://postscapes.com/internet-of-things-examples/
  12. 12. Industry Source: http://postscapes.com/internet-of-things-examples/
  13. 13. Environment Source: http://postscapes.com/internet-of-things-examples/
  14. 14. The rise of Data Science
  15. 15. NoSQL
  16. 16. Big Data Landscape 2.0
  17. 17. Keep Calm and Big Data
  18. 18. Big DataTools
  19. 19. Technologies Source:
  20. 20. Not Only SQL Key-Value Column Document Graph
  21. 21. real time, search and analytics engine open-source Lucene JSON schema free document
 store RESTful API documentation scalability high availability distributed multi tenancy per-operation
 persistence
  22. 22. Elasticsearch core • Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java • Elasticsearch added value: “Simple is best” • Simple API (with documentation) • JSON & RESTful • Sharding & Replication • Extensibility: plugins and scripts • Interoperability: clients and integrations
  23. 23. Use Cases • Full-Text Search • Data Store • Analytics • Alerts • Ads • …
  24. 24. 4V -Veracity !
  25. 25. 4V -Veracity !
  26. 26. From Big Data toValue Wisdom! Knowledge! Information! Data!
  27. 27. HOW TO FIND RELEVANT COMPANY INFORMATION ?
  28. 28. BEFORE...
  29. 29. WHY IS IT SO HARD TO FIND COMPREHENSIVE INFORMATION ?
  30. 30. AFTER !
  31. 31. COMPANY PAGE
  32. 32. • VATValidity • Company Information • Geographic Search • EuropeanVAT Check API.DATA.BE
  33. 33. PUBLICATION SEARCH
  34. 34. Thank you! eric@data.be be.linkedin.com/in/erodriguez @wavyx
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×