Implementation challenges in Big Data - Dr. Nilesh Karnik
Upcoming SlideShare
Loading in...5
×
 

Implementation challenges in Big Data - Dr. Nilesh Karnik

on

  • 461 views

In todays competitive environment companies are faced with different types of challenges. Implementation of Big Data is one of them. Dr. Nilesh Karnik takes us through some of them.

In todays competitive environment companies are faced with different types of challenges. Implementation of Big Data is one of them. Dr. Nilesh Karnik takes us through some of them.

Statistics

Views

Total Views
461
Views on SlideShare
461
Embed Views
0

Actions

Likes
0
Downloads
3
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Implementation challenges in Big Data - Dr. Nilesh Karnik Implementation challenges in Big Data - Dr. Nilesh Karnik Presentation Transcript

  • Aureus Claims Solution Implementation Challenges in Big Data Footer Option 2 Analytics • Dr. Nilesh N. Karnik Copyright 2013 RESTRICTED CIRCULATION
  • What we will discuss The Challenge of BIG Data ADVANCED Analytics SOLUTIONS in the Pipeline Copyright 2013 RESTRICTED CIRCULATION 2
  • Big Data : Distributed Processing Aureus Claims Solution ! Footer Option 2 OLD IDEA Copyright 2013 RESTRICTED CIRCULATION NEW IDEA 3
  • EXAMPLE 1: Task of storing books on a shelf Aureus Claims Solution Footer Option 2 Image source Flickr. Image copyright belongs with original artist. Simple, right? Copyright 2013 RESTRICTED CIRCULATION 5
  • EXAMPLE 1: Task of storing books on a shelf Aureus Claims Solution And now? Footer Option 2 Image source Flickr. Image copyright belongs with original artist. Copyright 2013 RESTRICTED CIRCULATION 6
  • Aureus Claims Solution Footer Option 2 Image source Flickr. Image copyright belongs with original artist. Copyright 2013 RESTRICTED CIRCULATION 7
  • EXAMPLE 2 : Summarizing a Report Aureus Claims Solution SUMMER PROJECT REPORT Footer Option 2 Simple, right? Copyright 2013 RESTRICTED CIRCULATION 8
  • EXAMPLE 2 : Summarizing a Report Aureus Claims Solution Footer Option 2 And now? Copyright 2013 RESTRICTED CIRCULATION 9
  • EXAMPLE 3 : Baking a Cake Simple, right? Aureus Claims Solution And now? Footer Option 2 Image source PINTEREST. Image copyright belongs with original artist. Copyright 2013 RESTRICTED CIRCULATION 10
  • Advanced Analytics • Well developed tool set for “small data” environment • Aureus Claims Solution Challenges in Big Data environment Footer Option 2 Copyright 2013 RESTRICTED CIRCULATION 11
  • Advanced Analytics: MapReduce Difficulties Aureus Claims Solution Footer Option 2 ITERATIVE Image source Flickr. Image copyright belongs with original artist. Copyright 2013 RESTRICTED CIRCULATION 12
  • Advanced Analytics: MapReduce Difficulties Aureus Claims Solution Footer Option 2 INCREMENTAL PROCESSING REQUIRES RESTART Image source Flickr. Image copyright belongs with original artist. Copyright 2013 RESTRICTED CIRCULATION 13
  • Advanced Analytics: MapReduce Difficulties Aureus Claims Solution Footer Option 2 BATCH LEARNING SCANS ALL DATA IN ONE GO Copyright 2013 RESTRICTED CIRCULATION 14
  • Some Solutions Data Scientists are working on Aureus Claims Solution New frameworks • E.g., HaLoop*, PrIter# (Extensions of Hadoop) • Percolator$ (Proprietary Google framework) Footer Option 2 * Y. Bu, B. Howe, M. Balazinska, and M. Ernst, “HaLoop: Efficient iterative data processing on large clusters”, VLDB, 2010. # Y. Zhang, Q. Gao, L. Gao and C. Wang, “PrIter: A distributed framework for prioritized iterative computations”, SoCC, 2011. $ D. Peng and F. Dabek, “Large-scale incremental processing using distributed transactions and notifications”, OSDI, 2010 Copyright 2013 RESTRICTED CIRCULATION 15
  • Some Solutions Data Scientists are working on Aureus Claims Solution Smarter algorithms / Different implementations • Random forest • Parallelized Stochastic Gradient Descent Footer Option 2 Copyright 2013 RESTRICTED CIRCULATION 16
  • @nilesh_karnik Nilesh@aureusanalytics.com Copyright 2013 RESTRICTED CIRCULATION 17
  • Aureus Claims Solution ThankOption 2 Footer You! SINGAPORE INDIA Aureus Analytics Pte. Ltd. 17, Phillip Street, #05-01, Grand Building Singapore (048695) Aureus Analytics Pvt. Ltd. 706, Powai Plaza Hiranandani Gardens, Powai Mumbai – 400076 info@aureusanalytics.com Copyright 2013 RESTRICTED CIRCULATION www.aureusanalytics.com