SlideShare a Scribd company logo
Beverages

Big Data : Market Research
Business Situation
                                                                                      Client Profile
The client sources the market research data from one of the largest market
research companies, which provides around 34 trillion data points of relevant         The client is division of
market research data, which was made available to client using their proprietary      the world's largest
tool. The market research department of client would then work through this data      beverage company.
for about two months to manually create a 220 tabbed excel report, after doing        Globally, through the
analytics and calculations on raw data manually, using tools provided by one of       world's largest beverage
the largest market research companies, and using other tools, to be used by           distribution system,
marketers and brand managers to make rightful business decisions.                     consumers in more than
                                                                                      200 countries enjoy their
This was a very inefficient approach since there was loss of valuable time until the    sparkling beverages,
report was published. Most of the process was manual; giving scope to human           ready-to-drink coffees,
errors. The reports were shared through emails, physical documents creating lack      juices, juice drinks and
of flexibility and easy availability. Creating new reports was extremely time          beverages at the rate of
consuming and an expensive process. The client has to source some of the data         1.8 billion servings a day!
from one of the largest market research companies, which usually turned out to
be expensive.

Solution Approach

We got the data from the market research company in the form of raw database
files, and processed this data, to generate a macro level database in the form of
OLAP cubes, to be served to end users through a web based, advanced data
visualization platform called iCharts. Since this was a huge data set that needs to
be processed and analyzed, the data was processed in parallel through a
MapReduce implementation.

The highly complex, proprietary format raw data files provided by the research
companies firm , amounted to around 130GB! We had to deal with the data at the
SKU level to get some of the KPIs required by client. The reports to be generated




You can read more            Reach out to us at     Via email
about Compassites at         +91 - 80- 4203 2572    info@compassitesinc.com
www.compassitesinc.com       +91 - 80- 6500 2371
needed at highly complex groupings of products and markets. The technical solution designed was as shown below.
The complex data is decoded and loaded by custom written decoders into the tables in the MySQL database in the
form of readable raw data. MapReduce was used to split the data into multiple EC2 instances for massive parallel
processing, with each EC2 instance executing a pipeline of Linux and MySQL processes on the raw data to reduce it to
macro level data, suitable for analytical processing and load it into the star schema. This data is then loaded into OLAP
cubes for analytical processing.

Technology Used

Adobe Flex, Adobe Life Cycle Data Services, J2EE, MySQL & Pentaho



Benefits & Results

   Fully automatic process preventing any human errors.

   Quick turnaround of reports and the monthly data is now available to managers within 3 days.

   Highly organized dashboard, with advanced data visualization capabilities like filtering.

   Access controlled dashboard to give selective access to different users.

   Individual reports can be downloaded, to be shared with management.

   Availability of filtered raw data allows for creating new reports in short time, with fractional cost.

   Intermediate database allows for merging data from other sources to create an integrated report.

   Built on open source technology stack, thereby saving tremendous cost on software licenses.

   The whole process happens on the elastic cloud of Amazon EC2, which is instantiated only for the duration of
    process, reducing the need to have permanent infrastructure to process the data.




Bangalore Center                                       Pune Center
"Gaayatthri Chambers", #948, 24th Main Rd,2nd Phase,   608 - Kapil Zenith Building, Near Chandani Chowk,
 J.P.Nagar, Bangalore - 560078 Karnataka, India        Off Bangalore - Mumbai bypass, Bavdhan, Pune - 411021
Phone: +91 -80 42032572                                Maharashtra, India Phone:+91-20-6500 2371

More Related Content

What's hot

Business case for Big Data Analytics
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data Analytics
Vijay Rao
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
Parviz Vakili
 
Delivering Healthcare Value Through Transformation to Big Data Streams
Delivering Healthcare Value Through Transformation to Big Data StreamsDelivering Healthcare Value Through Transformation to Big Data Streams
Delivering Healthcare Value Through Transformation to Big Data Streams
Andy Ashta
 
Denodo DataFest 2016: ROI Justification in Data Virtualization
Denodo DataFest 2016: ROI Justification in Data VirtualizationDenodo DataFest 2016: ROI Justification in Data Virtualization
Denodo DataFest 2016: ROI Justification in Data Virtualization
Denodo
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo
 
5 Big Data Use Cases for 2013
5 Big Data Use Cases for 20135 Big Data Use Cases for 2013
5 Big Data Use Cases for 2013
Infochimps, a CSC Big Data Business
 
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB
 
Why Business Intelligence Should Consider Agile Modern Data Delivery Platform
Why Business Intelligence Should Consider Agile Modern Data Delivery PlatformWhy Business Intelligence Should Consider Agile Modern Data Delivery Platform
Why Business Intelligence Should Consider Agile Modern Data Delivery Platform
syed_javed
 
What is Big Data ?
What is Big Data ?What is Big Data ?
What is Big Data ?
AkhmadZakiAlsafi
 
Ensuring compliance of patient data with big data
Ensuring compliance of patient data with big dataEnsuring compliance of patient data with big data
Ensuring compliance of patient data with big data
Ayad Shammout
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
Denodo
 
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
AYESHA JAVED
 
Exercise solution of chapter3 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter3 of datawarehouse cs614(solution of exercise)Exercise solution of chapter3 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter3 of datawarehouse cs614(solution of exercise)
AYESHA JAVED
 
IoTconnect Data Sheet Final Version
IoTconnect Data Sheet Final VersionIoTconnect Data Sheet Final Version
IoTconnect Data Sheet Final VersionDaniel Sapir
 
Big data
Big dataBig data
Big data
kalyani reddy
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
iACT Global
 
Log analyzer Needle in a haystack
Log analyzer  Needle in a haystackLog analyzer  Needle in a haystack
Log analyzer Needle in a haystack
CenterRetro
 
Data Virtualization at Logitech = #Winning
Data Virtualization at Logitech = #WinningData Virtualization at Logitech = #Winning
Data Virtualization at Logitech = #Winning
Denodo
 

What's hot (20)

Business case for Big Data Analytics
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data Analytics
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
 
Delivering Healthcare Value Through Transformation to Big Data Streams
Delivering Healthcare Value Through Transformation to Big Data StreamsDelivering Healthcare Value Through Transformation to Big Data Streams
Delivering Healthcare Value Through Transformation to Big Data Streams
 
Denodo DataFest 2016: ROI Justification in Data Virtualization
Denodo DataFest 2016: ROI Justification in Data VirtualizationDenodo DataFest 2016: ROI Justification in Data Virtualization
Denodo DataFest 2016: ROI Justification in Data Virtualization
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
 
5 Big Data Use Cases for 2013
5 Big Data Use Cases for 20135 Big Data Use Cases for 2013
5 Big Data Use Cases for 2013
 
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
MongoDB World 2019: Managing a Heterogeneous Data Stack with Informatica and ...
 
Why Business Intelligence Should Consider Agile Modern Data Delivery Platform
Why Business Intelligence Should Consider Agile Modern Data Delivery PlatformWhy Business Intelligence Should Consider Agile Modern Data Delivery Platform
Why Business Intelligence Should Consider Agile Modern Data Delivery Platform
 
What is Big Data ?
What is Big Data ?What is Big Data ?
What is Big Data ?
 
Ensuring compliance of patient data with big data
Ensuring compliance of patient data with big dataEnsuring compliance of patient data with big data
Ensuring compliance of patient data with big data
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
 
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
 
Exercise solution of chapter3 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter3 of datawarehouse cs614(solution of exercise)Exercise solution of chapter3 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter3 of datawarehouse cs614(solution of exercise)
 
IoTconnect Data Sheet Final Version
IoTconnect Data Sheet Final VersionIoTconnect Data Sheet Final Version
IoTconnect Data Sheet Final Version
 
Big data
Big dataBig data
Big data
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
 
Big Data analytics
Big Data analyticsBig Data analytics
Big Data analytics
 
Log analyzer Needle in a haystack
Log analyzer  Needle in a haystackLog analyzer  Needle in a haystack
Log analyzer Needle in a haystack
 
Data Virtualization at Logitech = #Winning
Data Virtualization at Logitech = #WinningData Virtualization at Logitech = #Winning
Data Virtualization at Logitech = #Winning
 

Viewers also liked

ejercicios tercera evaluación
ejercicios tercera evaluación ejercicios tercera evaluación
ejercicios tercera evaluación Raul Diaz Alvarez
 
Que es un podcast axel sosa
Que es un podcast axel sosaQue es un podcast axel sosa
Que es un podcast axel sosaaxelsosa
 
Captains cool on M&E revenue
Captains cool on M&E revenueCaptains cool on M&E revenue
Captains cool on M&E revenuemxmindia
 
STC Associates - Social Media Presentation
STC Associates - Social Media PresentationSTC Associates - Social Media Presentation
STC Associates - Social Media PresentationSTC Associates
 
West point bridge
West point bridgeWest point bridge
West point bridge
Elba Sepúlveda
 
New realities 2014
New realities 2014New realities 2014
New realities 2014mxmindia
 
Droidcon 2011: Mosync mobile framework, Stefan Sels, Tronicum
Droidcon 2011: Mosync mobile framework, Stefan Sels, TronicumDroidcon 2011: Mosync mobile framework, Stefan Sels, Tronicum
Droidcon 2011: Mosync mobile framework, Stefan Sels, Tronicum
Droidcon Berlin
 
Legislacion[2]
Legislacion[2]Legislacion[2]
Legislacion[2]
guesta17bc2
 
Los pasos del Cristiano 7
Los pasos del Cristiano 7Los pasos del Cristiano 7
Los pasos del Cristiano 7
Adriana Delgadillo
 
My last summer
My last summerMy last summer
My last summernujaen
 
An introduction to "BatAAr" & a brief overview of 'Art Metal'.
An introduction to "BatAAr" & a brief overview of 'Art Metal'. An introduction to "BatAAr" & a brief overview of 'Art Metal'.
An introduction to "BatAAr" & a brief overview of 'Art Metal'. Jordan Ellis
 
Fortum Klaipėda: new energy phase
Fortum Klaipėda: new energy phaseFortum Klaipėda: new energy phase
Fortum Klaipėda: new energy phaseFortum Heat Lietuva
 
Google docs axel sosa10
Google docs axel sosa10Google docs axel sosa10
Google docs axel sosa10axelsosa
 
Pw c table03
Pw c table03Pw c table03
Pw c table03mxmindia
 
10mandamientos
10mandamientos10mandamientos
10mandamientos
Adriana Delgadillo
 
رضا شهاب المكّي - الصحافة
رضا شهاب المكّي - الصحافةرضا شهاب المكّي - الصحافة
رضا شهاب المكّي - الصحافةSonia Charbti
 
Kierkegaard. seduzitzailearen egunkaria
Kierkegaard. seduzitzailearen egunkariaKierkegaard. seduzitzailearen egunkaria
Kierkegaard. seduzitzailearen egunkariahausnartzen
 
Presocraticos
PresocraticosPresocraticos
Presocraticosmrlnlrnt
 

Viewers also liked (20)

ejercicios tercera evaluación
ejercicios tercera evaluación ejercicios tercera evaluación
ejercicios tercera evaluación
 
Que es un podcast axel sosa
Que es un podcast axel sosaQue es un podcast axel sosa
Que es un podcast axel sosa
 
Captains cool on M&E revenue
Captains cool on M&E revenueCaptains cool on M&E revenue
Captains cool on M&E revenue
 
STC Associates - Social Media Presentation
STC Associates - Social Media PresentationSTC Associates - Social Media Presentation
STC Associates - Social Media Presentation
 
43 Oportunismo
43 Oportunismo43 Oportunismo
43 Oportunismo
 
West point bridge
West point bridgeWest point bridge
West point bridge
 
New realities 2014
New realities 2014New realities 2014
New realities 2014
 
Sistémica subir
Sistémica subirSistémica subir
Sistémica subir
 
Droidcon 2011: Mosync mobile framework, Stefan Sels, Tronicum
Droidcon 2011: Mosync mobile framework, Stefan Sels, TronicumDroidcon 2011: Mosync mobile framework, Stefan Sels, Tronicum
Droidcon 2011: Mosync mobile framework, Stefan Sels, Tronicum
 
Legislacion[2]
Legislacion[2]Legislacion[2]
Legislacion[2]
 
Los pasos del Cristiano 7
Los pasos del Cristiano 7Los pasos del Cristiano 7
Los pasos del Cristiano 7
 
My last summer
My last summerMy last summer
My last summer
 
An introduction to "BatAAr" & a brief overview of 'Art Metal'.
An introduction to "BatAAr" & a brief overview of 'Art Metal'. An introduction to "BatAAr" & a brief overview of 'Art Metal'.
An introduction to "BatAAr" & a brief overview of 'Art Metal'.
 
Fortum Klaipėda: new energy phase
Fortum Klaipėda: new energy phaseFortum Klaipėda: new energy phase
Fortum Klaipėda: new energy phase
 
Google docs axel sosa10
Google docs axel sosa10Google docs axel sosa10
Google docs axel sosa10
 
Pw c table03
Pw c table03Pw c table03
Pw c table03
 
10mandamientos
10mandamientos10mandamientos
10mandamientos
 
رضا شهاب المكّي - الصحافة
رضا شهاب المكّي - الصحافةرضا شهاب المكّي - الصحافة
رضا شهاب المكّي - الصحافة
 
Kierkegaard. seduzitzailearen egunkaria
Kierkegaard. seduzitzailearen egunkariaKierkegaard. seduzitzailearen egunkaria
Kierkegaard. seduzitzailearen egunkaria
 
Presocraticos
PresocraticosPresocraticos
Presocraticos
 

Similar to Beverages

Offshore Projects
Offshore ProjectsOffshore Projects
Offshore Projects
Steve Sugulas
 
Denodo DataFest 2017: Conquering the Edge with Data Virtualization
Denodo DataFest 2017: Conquering the Edge with Data VirtualizationDenodo DataFest 2017: Conquering the Edge with Data Virtualization
Denodo DataFest 2017: Conquering the Edge with Data Virtualization
Denodo
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Sreedhar Chowdam
 
Paris FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationParis FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant Presentation
Abdelkrim Hadjidj
 
Streaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsStreaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of Things
DatawatchCorporation
 
Hitachi Streaming Data Platform_v8
Hitachi Streaming Data Platform_v8Hitachi Streaming Data Platform_v8
Hitachi Streaming Data Platform_v8Navaid Khan
 
Hitachi Streaming Data Platform
Hitachi Streaming Data PlatformHitachi Streaming Data Platform
Hitachi Streaming Data Platform
Navaid Khan
 
Hitachi streaming data platform v8
Hitachi streaming data platform v8Hitachi streaming data platform v8
Hitachi streaming data platform v8
Navaid Khan
 
Big Data Paris - A Modern Enterprise Architecture
Big Data Paris - A Modern Enterprise ArchitectureBig Data Paris - A Modern Enterprise Architecture
Big Data Paris - A Modern Enterprise Architecture
MongoDB
 
Bigdata
BigdataBigdata
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Denodo
 
Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"
Lviv Startup Club
 
intelligent-data-lake_executive-brief
intelligent-data-lake_executive-briefintelligent-data-lake_executive-brief
intelligent-data-lake_executive-briefLindy-Anne Botha
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A Review
IRJET Journal
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
VMware Tanzu
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
FredReynolds2
 
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Databricks
 
Unlocking Operational Intelligence from the Data Lake
Unlocking Operational Intelligence from the Data LakeUnlocking Operational Intelligence from the Data Lake
Unlocking Operational Intelligence from the Data Lake
MongoDB
 

Similar to Beverages (20)

Offshore Projects
Offshore ProjectsOffshore Projects
Offshore Projects
 
Denodo DataFest 2017: Conquering the Edge with Data Virtualization
Denodo DataFest 2017: Conquering the Edge with Data VirtualizationDenodo DataFest 2017: Conquering the Edge with Data Virtualization
Denodo DataFest 2017: Conquering the Edge with Data Virtualization
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Paris FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationParis FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant Presentation
 
Streaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of ThingsStreaming and Visual Data Discovery for the Internet of Things
Streaming and Visual Data Discovery for the Internet of Things
 
Hitachi Streaming Data Platform_v8
Hitachi Streaming Data Platform_v8Hitachi Streaming Data Platform_v8
Hitachi Streaming Data Platform_v8
 
Hitachi Streaming Data Platform
Hitachi Streaming Data PlatformHitachi Streaming Data Platform
Hitachi Streaming Data Platform
 
Hitachi streaming data platform v8
Hitachi streaming data platform v8Hitachi streaming data platform v8
Hitachi streaming data platform v8
 
Big Data Paris - A Modern Enterprise Architecture
Big Data Paris - A Modern Enterprise ArchitectureBig Data Paris - A Modern Enterprise Architecture
Big Data Paris - A Modern Enterprise Architecture
 
Bigdata
BigdataBigdata
Bigdata
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
 
Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"Borys Pratsiuk "How to be NVidia partner"
Borys Pratsiuk "How to be NVidia partner"
 
intelligent-data-lake_executive-brief
intelligent-data-lake_executive-briefintelligent-data-lake_executive-brief
intelligent-data-lake_executive-brief
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A Review
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
 
Unlocking Operational Intelligence from the Data Lake
Unlocking Operational Intelligence from the Data LakeUnlocking Operational Intelligence from the Data Lake
Unlocking Operational Intelligence from the Data Lake
 

More from Compassites Software Solutions

Acrowit
AcrowitAcrowit
Zyme
ZymeZyme
Transformanz
TransformanzTransformanz
Seedling
SeedlingSeedling
Payback
PaybackPayback
Jiffle
JiffleJiffle
WPA
WPAWPA
Nri touch
Nri touchNri touch
Immumetrix
ImmumetrixImmumetrix
iCharts
iChartsiCharts
Hydratech
HydratechHydratech
Graymatics
GraymaticsGraymatics
Creatlive
CreatliveCreatlive
Authentix
AuthentixAuthentix
9 Lenses
9 Lenses9 Lenses
Compassites Is Hiring
Compassites Is HiringCompassites Is Hiring
Compassites Is Hiring
Compassites Software Solutions
 

More from Compassites Software Solutions (16)

Acrowit
AcrowitAcrowit
Acrowit
 
Zyme
ZymeZyme
Zyme
 
Transformanz
TransformanzTransformanz
Transformanz
 
Seedling
SeedlingSeedling
Seedling
 
Payback
PaybackPayback
Payback
 
Jiffle
JiffleJiffle
Jiffle
 
WPA
WPAWPA
WPA
 
Nri touch
Nri touchNri touch
Nri touch
 
Immumetrix
ImmumetrixImmumetrix
Immumetrix
 
iCharts
iChartsiCharts
iCharts
 
Hydratech
HydratechHydratech
Hydratech
 
Graymatics
GraymaticsGraymatics
Graymatics
 
Creatlive
CreatliveCreatlive
Creatlive
 
Authentix
AuthentixAuthentix
Authentix
 
9 Lenses
9 Lenses9 Lenses
9 Lenses
 
Compassites Is Hiring
Compassites Is HiringCompassites Is Hiring
Compassites Is Hiring
 

Recently uploaded

Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
ViralQR
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 

Recently uploaded (20)

Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 

Beverages

  • 1. Beverages Big Data : Market Research Business Situation Client Profile The client sources the market research data from one of the largest market research companies, which provides around 34 trillion data points of relevant The client is division of market research data, which was made available to client using their proprietary the world's largest tool. The market research department of client would then work through this data beverage company. for about two months to manually create a 220 tabbed excel report, after doing Globally, through the analytics and calculations on raw data manually, using tools provided by one of world's largest beverage the largest market research companies, and using other tools, to be used by distribution system, marketers and brand managers to make rightful business decisions. consumers in more than 200 countries enjoy their This was a very inefficient approach since there was loss of valuable time until the sparkling beverages, report was published. Most of the process was manual; giving scope to human ready-to-drink coffees, errors. The reports were shared through emails, physical documents creating lack juices, juice drinks and of flexibility and easy availability. Creating new reports was extremely time beverages at the rate of consuming and an expensive process. The client has to source some of the data 1.8 billion servings a day! from one of the largest market research companies, which usually turned out to be expensive. Solution Approach We got the data from the market research company in the form of raw database files, and processed this data, to generate a macro level database in the form of OLAP cubes, to be served to end users through a web based, advanced data visualization platform called iCharts. Since this was a huge data set that needs to be processed and analyzed, the data was processed in parallel through a MapReduce implementation. The highly complex, proprietary format raw data files provided by the research companies firm , amounted to around 130GB! We had to deal with the data at the SKU level to get some of the KPIs required by client. The reports to be generated You can read more Reach out to us at Via email about Compassites at +91 - 80- 4203 2572 info@compassitesinc.com www.compassitesinc.com +91 - 80- 6500 2371
  • 2. needed at highly complex groupings of products and markets. The technical solution designed was as shown below. The complex data is decoded and loaded by custom written decoders into the tables in the MySQL database in the form of readable raw data. MapReduce was used to split the data into multiple EC2 instances for massive parallel processing, with each EC2 instance executing a pipeline of Linux and MySQL processes on the raw data to reduce it to macro level data, suitable for analytical processing and load it into the star schema. This data is then loaded into OLAP cubes for analytical processing. Technology Used Adobe Flex, Adobe Life Cycle Data Services, J2EE, MySQL & Pentaho Benefits & Results  Fully automatic process preventing any human errors.  Quick turnaround of reports and the monthly data is now available to managers within 3 days.  Highly organized dashboard, with advanced data visualization capabilities like filtering.  Access controlled dashboard to give selective access to different users.  Individual reports can be downloaded, to be shared with management.  Availability of filtered raw data allows for creating new reports in short time, with fractional cost.  Intermediate database allows for merging data from other sources to create an integrated report.  Built on open source technology stack, thereby saving tremendous cost on software licenses.  The whole process happens on the elastic cloud of Amazon EC2, which is instantiated only for the duration of process, reducing the need to have permanent infrastructure to process the data. Bangalore Center Pune Center "Gaayatthri Chambers", #948, 24th Main Rd,2nd Phase, 608 - Kapil Zenith Building, Near Chandani Chowk, J.P.Nagar, Bangalore - 560078 Karnataka, India Off Bangalore - Mumbai bypass, Bavdhan, Pune - 411021 Phone: +91 -80 42032572 Maharashtra, India Phone:+91-20-6500 2371