SlideShare a Scribd company logo
1 of 19
Download to read offline
Evolution of Big Data 
ICT Business Breakfast 
Durban, 17 September 2014 
Willy Govender
What is Big Data? 
“Large volumes of a wide variety of data collected from various sources across the enterprise including transactional data from enterprise applications/databases, social media data, mobile device data, unstructured data/documents, machine-generated data and more.“ Source: IDG: Big Data – Growing Trends and Emerging Opportunities
Data Sources 
Structured 
•Spreadsheets 
•Relational Databases 
•ERP 
•CRM 
•Legacy systems 
•File share 
Unstructured 
•Documents 
•Machine Data 
•Messaging 
•Photographs 
•Video 
•Social Media 
•Web traffic logs 
"90% of all data ever created, was created in the past two years. From now on, the amount of data in the world will double every two years." 
Enterprise 
Cloud
The Evolution of Big Data 
Big data is traditionally referred to as 3Vs (now 5V, 7V) 
Volume (amount of data collected – terabytes/exabytes) 
Velocity (speed/frequency at which data is collected) 
Variety (different types of data collected) 
Now experts are adding “veracity, variability, visualization, and value” 
Big data is not new 
Supercomputers have been collecting scientific/research data for decades 
However, now its uses are being seen in commercial competitive advantages 
And now we are able to collect a variety of data from multiple devices and sources 
Is the evolution of the BI ecosystem from data warehousing 
Does not make DW obsolete 
Big Data approaches are reducing the costs of data management 
Data still needs to be standardized, data quality maintained, and access provided to constituent communities. 
Data management will continue to be an evolutionary process. 
Big data is simply a new data challenge that requires leveraging existing systems in a different way
So, what does Big Data do? 
Focuses on finding hidden threads, trends, or patterns which may be invisible to the naked eye 
Data store of clusters of servers (eg. Apache Hadoop used for Amazon Cloud) 
A set of tasks that processes the data in different segments of the cluster then breaks down the results to more manageable chunks which are 
Requires mathematical and statistical expertise as well as creative, communicative, problem-solving, and business skills summarized 
Obviates the need for Data alignment or Data migration, or the requirement to move data into one place for cross-referencing. This achieved through indexes and crawlers (like Google) which constantly mine data update the indexes.
Framework and Data Flows 
Data Models, Structures, Types 
•Data formats, non/relational, file systems, etc. 
•Big Data Management 
Big Data Lifecycle (Management) 
•Big Data transformation/staging 
•Recording, Storage, Archiving 
Big Data Analytics and Tools 
•Big Data Applications 
•Target use, presentation, visualisation 
Big Data Infrastructure (BDI) 
•Storage, Compute, (High Performance Computing,) Network 
•Sensor network, target/actionable devices 
•Big Data Operational support 
Big Data Security 
•Data security in-rest, in-move, trusted processing environments 
Collection and Registration 
Filtering, Classification and Enrichment 
Analytics, Modelling and Prediction 
Presentation and Visualization
What challenges can you expect 
Platforms 
•High end data warehousing tools 
•Open source technologies challenging with accessing data from multiple servers rapidly in native form 
•Selection of Enterprise Search Tools 
Skills 
•Managing Data Volumes 
•Ability to really understand what can be achieved 
•Open source platforms not easy to use 
•Data scientists now required 
Leadership 
•New territory for IT professionals, so planning, marketing, ROI etc is an issue 
•Getting Data on the Board's agenda 
Walmart analyses real-time social media data for trend to guide online ad purchases
Enterprise Search: Vendors 
TCO 
FEATURE SET 
Low 
High 
Low 
High 
Niche Progressive 
Niche Traditional 
Niche Progressive 
Niche Traditional
Challenges in Big Data 
— Increasing Amount of Disorganized Data and Data Sources (structured & unstructured) 
Provides greater opportunity for failure – lack of information can lead to wrong decisions 
Limits productivity – more time and effort needed to find information 
Frustrates search users – 
information is expected to be readily available and complete 
— 
Not tackling Big Data in enterprises … 
Marketing Data 
Data Warehouse 
Social Media 
Research Databases 
Office Files 
Transactional Data 
Acquisition Data 
→ 
DIGITAL DATA VOLUME 
2010 
2012 
2014 
2016 
2018 
2020 
Etc.
Opportunity in Big Data 
Source: IDC 
35 Zetabytes 
DIGITAL DATA VOLUME 
2010 
2012 
2014 
2016 
2018 
2020 
STATUS QUO 
— Accessible Data Has Value 
48% CAGR1 
No Specific Solutions Too hard and expensive 
Homegrown 
Hard to maintain and insufficient 
Traditional Solutions 
Waste countless months on inflexible solutions 
— 
Solution Types
Q-Sensei Product – Aimed at bringing Big Data approach to all Enterprises 
— 
Traditional Approaches 
— Q-Sensei Revolution 
•Complex products 
•Rigid delivery model 
•Pre-defined usage 
•Expensive 
•Limited audience 
•Exhausting implementation 
•Disparate solutions 
•Poor interaction design 
•Simple 
•Powerful 
•Fast 
•Flexible 
•Broad application 
•Interactive 
•Easy delivery model 
•For everyone
Case Study mention in Wall Street Journal in 2012 
They were able to analyze traffic details for various devices, spot problem areas and add network throughput to help prepare for future demand. Netflix was also able to get more insight into the type of content customers preferred, which enabled them to make more accurate suggestions as to what subscribers might like.
Case Study 
— 
Overview 
•Premiere Internet subscription service for streaming media and DVD-by-mail services 
•Over 50 million subscribers in 40+ countries; Revenue 2013: $4.37 billion 
•Contract Management: Permission/licensing agreements with content creators 
•Leader in interactive, contextual search changing the way companies search and analyze data 
•Patented powerful multidimensional search and index capability 
•Gives developers full access to award- winning technology and empowers them to built robust search and analytics applications for all data needs 
World's Leading Internet television network (ITN)
Case Study – Search in Contracts 
— 
Goals and Key Challenges 
1.Make searching their copious contract documentation better manageable and easier to use for end users 
2.Integrate and unify their highly structured metadata with their unstructured content data 
3.Incorporate Optical Character Recognition (OCR) of scanned documents during data ingestion process 
4.Integrate with in-house, Drupal-based content management system 
5.Flexibility to consume the data from their custom system 
6.Data model that meets various needs of personnel 
7.Timeline of only 3 month
Case Study – Search in Contracts 
— Solution and Successes 
1.In 3 month Q-Sensei conceptualized and deployed a solution for contract search needs using Fuse (including usability testing) 
2.Addition of further capabilities based on end user feedback: 
•n-gram phrase search 
•date range search 
•multi-sort of facets 
•grid view of results 
3.The flexibility and modular architecture of Fuse enables customer to implement the platform for further use cases (knowledge base search, log analysis, usage analysis, etc.)
Demo 
— Q-Sensei Medical Demo 
•Unified Access to Publications, Grants, Patents, Office Files, Person 
•Content-Based Faceted Auto Complete 
•Dynamic Faceting 
•Search-within-a-search capability 
•Data Interaction and deep Data Correlations 
•360-degree view of information 
•Multi-Dimensional Visualization 
•Customizable Search Interface 
•Integrated Data Sources (21m Publications, 1,8m Grants, 1,5m Patents, Office Files (DOC, XLS, PPT, PDF,…) , Person DB ) 
Set-up (Harvesting, Importing, Data Transformation, Indexing) in 5 days
Performance Metrics 
Sample System 
System Configuration 
Performance 
Based on Sample System 
•Intel Ivy Bridge Quadcore 3.4GHz 
•32GB RAM 
•1TB HD 
•64-bit Linux 
•Up to 80 million documents can be indexed 
•Up to 20 million records can be uploaded per hour (more than 5,000/sec) 
•100,000 search queries can be processed per minute per million documents; a query includes: 
•processing of search expression (including fulltext) 
•computation of eight (8) standard facets 
(Latest test: September 2013)
Contract Management Search 
•Create a more accurate and efficient contract search by exposing all metadata and using facets 
•Search scanned documents with advanced OCR capabilities Knowledge Base / Support Center Search 
•Increase the efficiency of finding answers by utilizing more metadata in your knowledge base 
•Embrace tags and faceted search over hierarchy to find answers more quickly Enterprise Search 
•Unify your company’s information by searching all sources simultaneously 
•Increase the productivity of everyone with better data accessibility 
Usage Analysis 
•Increase speed and agility of customer activity analysis by embracing a multidimensional view of your data 
•Drive dynamic visualizations and build complex queries Structured Data Analysis 
•Understand the composition of data, find relationships, and identify trends 
•View data more accurately by analyzing all attributes simultaneously E-Commerce Faceted Navigation 
•More accurately represent your products with dynamically updating facets that perform at scale 
•Power more meaningful recommendations with the capability to use more metadata 
Further Use Cases 
— 
A Single Platform for Everything
Other Examples East London Rural Mapping

More Related Content

What's hot

Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notesMohit Saini
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesAshraf Uddin
 
Big Data Ppt PowerPoint Presentation Slides
Big Data Ppt PowerPoint Presentation Slides Big Data Ppt PowerPoint Presentation Slides
Big Data Ppt PowerPoint Presentation Slides SlideTeam
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analyticsUmasree Raghunath
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingMinhazul Arefin
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big datahktripathy
 
Introduction to Business Intelligence
Introduction to Business IntelligenceIntroduction to Business Intelligence
Introduction to Business IntelligenceAlmog Ramrajkar
 
Big data visualization
Big data visualizationBig data visualization
Big data visualizationAnurag Gupta
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data AnalyticsRohithND
 

What's hot (20)

Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Dynamic Itemset Counting
Dynamic Itemset CountingDynamic Itemset Counting
Dynamic Itemset Counting
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Big Data Ppt PowerPoint Presentation Slides
Big Data Ppt PowerPoint Presentation Slides Big Data Ppt PowerPoint Presentation Slides
Big Data Ppt PowerPoint Presentation Slides
 
Big_data_ppt
Big_data_ppt Big_data_ppt
Big_data_ppt
 
BIG DATA and USE CASES
BIG DATA and USE CASESBIG DATA and USE CASES
BIG DATA and USE CASES
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
Big data analysis
Big data analysisBig data analysis
Big data analysis
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Data analytics
Data analyticsData analytics
Data analytics
 
Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data Ecosystem
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Big data and Hadoop
Big data and HadoopBig data and Hadoop
Big data and Hadoop
 
Introduction to Business Intelligence
Introduction to Business IntelligenceIntroduction to Business Intelligence
Introduction to Business Intelligence
 
Big data visualization
Big data visualizationBig data visualization
Big data visualization
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 

Similar to Big Data Evolution

Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigManish Chopra
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewDenodo
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadhMithlesh Sadh
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusersBob Hardaway
 
Manoj Kolhe - Presentation - ITW_PPT_Big_Data_Testingv1.6
Manoj Kolhe - Presentation - ITW_PPT_Big_Data_Testingv1.6Manoj Kolhe - Presentation - ITW_PPT_Big_Data_Testingv1.6
Manoj Kolhe - Presentation - ITW_PPT_Big_Data_Testingv1.6Manoj Kolhe
 
Digital intelligence satish bhatia
Digital intelligence satish bhatiaDigital intelligence satish bhatia
Digital intelligence satish bhatiaSatish Bhatia
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataSpringPeople
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Denodo
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsEmbarcadero Technologies
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Denodo
 
SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview Rajesh Menon
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Denodo
 
Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedcedrinemadera
 

Similar to Big Data Evolution (20)

Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Manoj Kolhe - Presentation - ITW_PPT_Big_Data_Testingv1.6
Manoj Kolhe - Presentation - ITW_PPT_Big_Data_Testingv1.6Manoj Kolhe - Presentation - ITW_PPT_Big_Data_Testingv1.6
Manoj Kolhe - Presentation - ITW_PPT_Big_Data_Testingv1.6
 
Digital intelligence satish bhatia
Digital intelligence satish bhatiaDigital intelligence satish bhatia
Digital intelligence satish bhatia
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data Assets
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
 
Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-shared
 

More from itnewsafrica

Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Kenneth Palliam- Cybersecurity Maturity: The Role of the GITO Considering New...
Kenneth Palliam- Cybersecurity Maturity: The Role of the GITO Considering New...Kenneth Palliam- Cybersecurity Maturity: The Role of the GITO Considering New...
Kenneth Palliam- Cybersecurity Maturity: The Role of the GITO Considering New...itnewsafrica
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...itnewsafrica
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sectoritnewsafrica
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Ansgar Pabst- Disruptive Innovation through Corporate Collaboration with Star...
Ansgar Pabst- Disruptive Innovation through Corporate Collaboration with Star...Ansgar Pabst- Disruptive Innovation through Corporate Collaboration with Star...
Ansgar Pabst- Disruptive Innovation through Corporate Collaboration with Star...itnewsafrica
 
Koen den Hollander- The Future is Omni
Koen den Hollander- The Future is OmniKoen den Hollander- The Future is Omni
Koen den Hollander- The Future is Omniitnewsafrica
 
Wongama Millie- South African Social Media Insights 2023
Wongama Millie- South African Social Media Insights 2023Wongama Millie- South African Social Media Insights 2023
Wongama Millie- South African Social Media Insights 2023itnewsafrica
 
Emphasising Personalization and Customer Journey Mapping in Digital Retail
Emphasising Personalization and  Customer Journey Mapping in Digital  RetailEmphasising Personalization and  Customer Journey Mapping in Digital  Retail
Emphasising Personalization and Customer Journey Mapping in Digital Retailitnewsafrica
 
Munyaradzi Nyikavaranda- Assessing the intersect between UX, AI, Big Data: Cr...
Munyaradzi Nyikavaranda- Assessing the intersect between UX, AI, Big Data: Cr...Munyaradzi Nyikavaranda- Assessing the intersect between UX, AI, Big Data: Cr...
Munyaradzi Nyikavaranda- Assessing the intersect between UX, AI, Big Data: Cr...itnewsafrica
 
Data Analytics & Customer Insights as enablers of businesses to employ predic...
Data Analytics & Customer Insights as enablers of businesses to employ predic...Data Analytics & Customer Insights as enablers of businesses to employ predic...
Data Analytics & Customer Insights as enablers of businesses to employ predic...itnewsafrica
 
Mark Cockerell- A New Era of Retail Data Integration Mark Cockerell Retail ...
Mark Cockerell- A New Era of  Retail Data  Integration Mark Cockerell Retail ...Mark Cockerell- A New Era of  Retail Data  Integration Mark Cockerell Retail ...
Mark Cockerell- A New Era of Retail Data Integration Mark Cockerell Retail ...itnewsafrica
 
Pravir Ishvarlal- Artificial Intelligence in Healthcare
Pravir Ishvarlal- Artificial Intelligence in HealthcarePravir Ishvarlal- Artificial Intelligence in Healthcare
Pravir Ishvarlal- Artificial Intelligence in Healthcareitnewsafrica
 
Braden van Breda- The Role of AI, Robotics in African Healthcare
Braden van Breda- The Role of AI, Robotics in African HealthcareBraden van Breda- The Role of AI, Robotics in African Healthcare
Braden van Breda- The Role of AI, Robotics in African Healthcareitnewsafrica
 
Rodney Taylor- AVA Disrupts Primary Healthcare with the Latest Asynchronous I...
Rodney Taylor- AVA Disrupts Primary Healthcare with the Latest Asynchronous I...Rodney Taylor- AVA Disrupts Primary Healthcare with the Latest Asynchronous I...
Rodney Taylor- AVA Disrupts Primary Healthcare with the Latest Asynchronous I...itnewsafrica
 
Anish Gupta- Smart Care Coordination Platform
Anish Gupta- Smart Care Coordination PlatformAnish Gupta- Smart Care Coordination Platform
Anish Gupta- Smart Care Coordination Platformitnewsafrica
 
Andrew Roberts- How Technology can Transform Healthcare for the Better
Andrew Roberts- How Technology can Transform Healthcare for the BetterAndrew Roberts- How Technology can Transform Healthcare for the Better
Andrew Roberts- How Technology can Transform Healthcare for the Betteritnewsafrica
 
Andrew Roberts - Mobile Health Apps for Improved Patient Engagement and Educa...
Andrew Roberts - Mobile Health Apps for Improved Patient Engagement and Educa...Andrew Roberts - Mobile Health Apps for Improved Patient Engagement and Educa...
Andrew Roberts - Mobile Health Apps for Improved Patient Engagement and Educa...itnewsafrica
 

More from itnewsafrica (20)

Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Kenneth Palliam- Cybersecurity Maturity: The Role of the GITO Considering New...
Kenneth Palliam- Cybersecurity Maturity: The Role of the GITO Considering New...Kenneth Palliam- Cybersecurity Maturity: The Role of the GITO Considering New...
Kenneth Palliam- Cybersecurity Maturity: The Role of the GITO Considering New...
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Ansgar Pabst- Disruptive Innovation through Corporate Collaboration with Star...
Ansgar Pabst- Disruptive Innovation through Corporate Collaboration with Star...Ansgar Pabst- Disruptive Innovation through Corporate Collaboration with Star...
Ansgar Pabst- Disruptive Innovation through Corporate Collaboration with Star...
 
Koen den Hollander- The Future is Omni
Koen den Hollander- The Future is OmniKoen den Hollander- The Future is Omni
Koen den Hollander- The Future is Omni
 
Wongama Millie- South African Social Media Insights 2023
Wongama Millie- South African Social Media Insights 2023Wongama Millie- South African Social Media Insights 2023
Wongama Millie- South African Social Media Insights 2023
 
Emphasising Personalization and Customer Journey Mapping in Digital Retail
Emphasising Personalization and  Customer Journey Mapping in Digital  RetailEmphasising Personalization and  Customer Journey Mapping in Digital  Retail
Emphasising Personalization and Customer Journey Mapping in Digital Retail
 
Munyaradzi Nyikavaranda- Assessing the intersect between UX, AI, Big Data: Cr...
Munyaradzi Nyikavaranda- Assessing the intersect between UX, AI, Big Data: Cr...Munyaradzi Nyikavaranda- Assessing the intersect between UX, AI, Big Data: Cr...
Munyaradzi Nyikavaranda- Assessing the intersect between UX, AI, Big Data: Cr...
 
Data Analytics & Customer Insights as enablers of businesses to employ predic...
Data Analytics & Customer Insights as enablers of businesses to employ predic...Data Analytics & Customer Insights as enablers of businesses to employ predic...
Data Analytics & Customer Insights as enablers of businesses to employ predic...
 
Mark Cockerell- A New Era of Retail Data Integration Mark Cockerell Retail ...
Mark Cockerell- A New Era of  Retail Data  Integration Mark Cockerell Retail ...Mark Cockerell- A New Era of  Retail Data  Integration Mark Cockerell Retail ...
Mark Cockerell- A New Era of Retail Data Integration Mark Cockerell Retail ...
 
Pravir Ishvarlal- Artificial Intelligence in Healthcare
Pravir Ishvarlal- Artificial Intelligence in HealthcarePravir Ishvarlal- Artificial Intelligence in Healthcare
Pravir Ishvarlal- Artificial Intelligence in Healthcare
 
Braden van Breda- The Role of AI, Robotics in African Healthcare
Braden van Breda- The Role of AI, Robotics in African HealthcareBraden van Breda- The Role of AI, Robotics in African Healthcare
Braden van Breda- The Role of AI, Robotics in African Healthcare
 
Rodney Taylor- AVA Disrupts Primary Healthcare with the Latest Asynchronous I...
Rodney Taylor- AVA Disrupts Primary Healthcare with the Latest Asynchronous I...Rodney Taylor- AVA Disrupts Primary Healthcare with the Latest Asynchronous I...
Rodney Taylor- AVA Disrupts Primary Healthcare with the Latest Asynchronous I...
 
Anish Gupta- Smart Care Coordination Platform
Anish Gupta- Smart Care Coordination PlatformAnish Gupta- Smart Care Coordination Platform
Anish Gupta- Smart Care Coordination Platform
 
Andrew Roberts- How Technology can Transform Healthcare for the Better
Andrew Roberts- How Technology can Transform Healthcare for the BetterAndrew Roberts- How Technology can Transform Healthcare for the Better
Andrew Roberts- How Technology can Transform Healthcare for the Better
 
Andrew Roberts - Mobile Health Apps for Improved Patient Engagement and Educa...
Andrew Roberts - Mobile Health Apps for Improved Patient Engagement and Educa...Andrew Roberts - Mobile Health Apps for Improved Patient Engagement and Educa...
Andrew Roberts - Mobile Health Apps for Improved Patient Engagement and Educa...
 

Recently uploaded

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

Big Data Evolution

  • 1. Evolution of Big Data ICT Business Breakfast Durban, 17 September 2014 Willy Govender
  • 2. What is Big Data? “Large volumes of a wide variety of data collected from various sources across the enterprise including transactional data from enterprise applications/databases, social media data, mobile device data, unstructured data/documents, machine-generated data and more.“ Source: IDG: Big Data – Growing Trends and Emerging Opportunities
  • 3. Data Sources Structured •Spreadsheets •Relational Databases •ERP •CRM •Legacy systems •File share Unstructured •Documents •Machine Data •Messaging •Photographs •Video •Social Media •Web traffic logs "90% of all data ever created, was created in the past two years. From now on, the amount of data in the world will double every two years." Enterprise Cloud
  • 4. The Evolution of Big Data Big data is traditionally referred to as 3Vs (now 5V, 7V) Volume (amount of data collected – terabytes/exabytes) Velocity (speed/frequency at which data is collected) Variety (different types of data collected) Now experts are adding “veracity, variability, visualization, and value” Big data is not new Supercomputers have been collecting scientific/research data for decades However, now its uses are being seen in commercial competitive advantages And now we are able to collect a variety of data from multiple devices and sources Is the evolution of the BI ecosystem from data warehousing Does not make DW obsolete Big Data approaches are reducing the costs of data management Data still needs to be standardized, data quality maintained, and access provided to constituent communities. Data management will continue to be an evolutionary process. Big data is simply a new data challenge that requires leveraging existing systems in a different way
  • 5. So, what does Big Data do? Focuses on finding hidden threads, trends, or patterns which may be invisible to the naked eye Data store of clusters of servers (eg. Apache Hadoop used for Amazon Cloud) A set of tasks that processes the data in different segments of the cluster then breaks down the results to more manageable chunks which are Requires mathematical and statistical expertise as well as creative, communicative, problem-solving, and business skills summarized Obviates the need for Data alignment or Data migration, or the requirement to move data into one place for cross-referencing. This achieved through indexes and crawlers (like Google) which constantly mine data update the indexes.
  • 6. Framework and Data Flows Data Models, Structures, Types •Data formats, non/relational, file systems, etc. •Big Data Management Big Data Lifecycle (Management) •Big Data transformation/staging •Recording, Storage, Archiving Big Data Analytics and Tools •Big Data Applications •Target use, presentation, visualisation Big Data Infrastructure (BDI) •Storage, Compute, (High Performance Computing,) Network •Sensor network, target/actionable devices •Big Data Operational support Big Data Security •Data security in-rest, in-move, trusted processing environments Collection and Registration Filtering, Classification and Enrichment Analytics, Modelling and Prediction Presentation and Visualization
  • 7. What challenges can you expect Platforms •High end data warehousing tools •Open source technologies challenging with accessing data from multiple servers rapidly in native form •Selection of Enterprise Search Tools Skills •Managing Data Volumes •Ability to really understand what can be achieved •Open source platforms not easy to use •Data scientists now required Leadership •New territory for IT professionals, so planning, marketing, ROI etc is an issue •Getting Data on the Board's agenda Walmart analyses real-time social media data for trend to guide online ad purchases
  • 8. Enterprise Search: Vendors TCO FEATURE SET Low High Low High Niche Progressive Niche Traditional Niche Progressive Niche Traditional
  • 9. Challenges in Big Data — Increasing Amount of Disorganized Data and Data Sources (structured & unstructured) Provides greater opportunity for failure – lack of information can lead to wrong decisions Limits productivity – more time and effort needed to find information Frustrates search users – information is expected to be readily available and complete — Not tackling Big Data in enterprises … Marketing Data Data Warehouse Social Media Research Databases Office Files Transactional Data Acquisition Data → DIGITAL DATA VOLUME 2010 2012 2014 2016 2018 2020 Etc.
  • 10. Opportunity in Big Data Source: IDC 35 Zetabytes DIGITAL DATA VOLUME 2010 2012 2014 2016 2018 2020 STATUS QUO — Accessible Data Has Value 48% CAGR1 No Specific Solutions Too hard and expensive Homegrown Hard to maintain and insufficient Traditional Solutions Waste countless months on inflexible solutions — Solution Types
  • 11. Q-Sensei Product – Aimed at bringing Big Data approach to all Enterprises — Traditional Approaches — Q-Sensei Revolution •Complex products •Rigid delivery model •Pre-defined usage •Expensive •Limited audience •Exhausting implementation •Disparate solutions •Poor interaction design •Simple •Powerful •Fast •Flexible •Broad application •Interactive •Easy delivery model •For everyone
  • 12. Case Study mention in Wall Street Journal in 2012 They were able to analyze traffic details for various devices, spot problem areas and add network throughput to help prepare for future demand. Netflix was also able to get more insight into the type of content customers preferred, which enabled them to make more accurate suggestions as to what subscribers might like.
  • 13. Case Study — Overview •Premiere Internet subscription service for streaming media and DVD-by-mail services •Over 50 million subscribers in 40+ countries; Revenue 2013: $4.37 billion •Contract Management: Permission/licensing agreements with content creators •Leader in interactive, contextual search changing the way companies search and analyze data •Patented powerful multidimensional search and index capability •Gives developers full access to award- winning technology and empowers them to built robust search and analytics applications for all data needs World's Leading Internet television network (ITN)
  • 14. Case Study – Search in Contracts — Goals and Key Challenges 1.Make searching their copious contract documentation better manageable and easier to use for end users 2.Integrate and unify their highly structured metadata with their unstructured content data 3.Incorporate Optical Character Recognition (OCR) of scanned documents during data ingestion process 4.Integrate with in-house, Drupal-based content management system 5.Flexibility to consume the data from their custom system 6.Data model that meets various needs of personnel 7.Timeline of only 3 month
  • 15. Case Study – Search in Contracts — Solution and Successes 1.In 3 month Q-Sensei conceptualized and deployed a solution for contract search needs using Fuse (including usability testing) 2.Addition of further capabilities based on end user feedback: •n-gram phrase search •date range search •multi-sort of facets •grid view of results 3.The flexibility and modular architecture of Fuse enables customer to implement the platform for further use cases (knowledge base search, log analysis, usage analysis, etc.)
  • 16. Demo — Q-Sensei Medical Demo •Unified Access to Publications, Grants, Patents, Office Files, Person •Content-Based Faceted Auto Complete •Dynamic Faceting •Search-within-a-search capability •Data Interaction and deep Data Correlations •360-degree view of information •Multi-Dimensional Visualization •Customizable Search Interface •Integrated Data Sources (21m Publications, 1,8m Grants, 1,5m Patents, Office Files (DOC, XLS, PPT, PDF,…) , Person DB ) Set-up (Harvesting, Importing, Data Transformation, Indexing) in 5 days
  • 17. Performance Metrics Sample System System Configuration Performance Based on Sample System •Intel Ivy Bridge Quadcore 3.4GHz •32GB RAM •1TB HD •64-bit Linux •Up to 80 million documents can be indexed •Up to 20 million records can be uploaded per hour (more than 5,000/sec) •100,000 search queries can be processed per minute per million documents; a query includes: •processing of search expression (including fulltext) •computation of eight (8) standard facets (Latest test: September 2013)
  • 18. Contract Management Search •Create a more accurate and efficient contract search by exposing all metadata and using facets •Search scanned documents with advanced OCR capabilities Knowledge Base / Support Center Search •Increase the efficiency of finding answers by utilizing more metadata in your knowledge base •Embrace tags and faceted search over hierarchy to find answers more quickly Enterprise Search •Unify your company’s information by searching all sources simultaneously •Increase the productivity of everyone with better data accessibility Usage Analysis •Increase speed and agility of customer activity analysis by embracing a multidimensional view of your data •Drive dynamic visualizations and build complex queries Structured Data Analysis •Understand the composition of data, find relationships, and identify trends •View data more accurately by analyzing all attributes simultaneously E-Commerce Faceted Navigation •More accurately represent your products with dynamically updating facets that perform at scale •Power more meaningful recommendations with the capability to use more metadata Further Use Cases — A Single Platform for Everything
  • 19. Other Examples East London Rural Mapping