Your SlideShare is downloading. ×
Webinar - Introducing Datameer 4.0: Visual, End-to-End
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Saving this for later?

Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime - even offline.

Text the download link to your phone

Standard text messaging rates apply

Webinar - Introducing Datameer 4.0: Visual, End-to-End

433
views

Published on

In Big Data projects, analysts often spend 80% of their time preparing data for analysis. In addition, users don’t have a good understanding of their data quality. …

In Big Data projects, analysts often spend 80% of their time preparing data for analysis. In addition, users don’t have a good understanding of their data quality.

Today there are multiple tools that assist with integration, data preparation, analysis and visualization. However, data quality continues to be one of the biggest challenges businesses face when deploying big data analytics.

People need to profile their data and ensure data quality to get accurate insights and make informed business decisions. Join Datameer as we address these pain points with visualizations at every step.

This webinar will highlight and showcase:

-How visual data profiling reduces the guesswork in the data wrangling process
-Enhanced interactive data mining capabilities reduce time to insight
-A demonstration of the new 4.0 features and functions

Published in: Technology, Business

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
433
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
22
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Introducing Datameer 4.0! Visual, End-to-End!
  • 2. © 2014 Datameer, Inc. All rights reserved. View Recording!! ! You can view the recording of this webinar at:! ! http://info.datameer.com/Online-Slideshare- Datameer-4-0-Visual-End-to-End- OnDemand.html!
  • 3. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. About Our Speakers! Matt Schumpert @datameer! Senior Director, Solutions Engineering! ! Matt has been working in the enterprise infrastructure software space for over 14 years in various capacities, including sales engineering, strategic alliances and consulting.! ! Matt currently runs the pre-sales engineering team at Datameer, supporting all technical aspects of customer engagement from initial contact through roll-out of customers into production.! ! Matt holds a BS in Computer Science from the University of Virginia. ! #datameer @datameer!
  • 4. © 2013 Datameer, Inc. All rights reserved. About Our Speaker ! Matt McManus @datameer Vice President, Engineering Matt has been building enterprise software products for over 10 years with deep experience in architecture, software engineering and team management roles. Matt currently leads the engineering team at Datameer, managing all aspects of product development, releases and quality assurance. Matt attended Boston University where he earned a Bachelor’s degree in Computer Science. #datameer @datameer!
  • 5. © 2014 Datameer, Inc. All rights reserved. The Lean Data Supply Chain!
  • 6. Classical Data Pipeline!
  • 7. Modern Data Pipeline!
  • 8. © 2014 Datameer, Inc. All rights reserved. The Lean Data Supply Chain!
  • 9. © 2014 Datameer, Inc. All rights reserved. Informatica! Talend! Flume! Sqoop! Trifacta! Paxata! PIG! Hive! Impala! Tableau! Platfora! © 2013 Datameer, Inc. All rights reserved. The Lean Data Supply Chain! Integrate! Analyze! Visualize!Prepare!
  • 10. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. An end-to-end Solution! Analytics! Visualization!Data Integration! Any Distro!
  • 11. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Smart Analytics! Clustering gg Column Dependencies Recommendation Decision Trees
  • 12. © 2014 Datameer, Inc. All rights reserved. Enterprise Integration!
  • 13. Introducing Datameer 4.0! Visual Insights at Every Step!
  • 14. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Introducing ‘Flip-Side’
  • 15. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Before ! Integrate! Analyze! Visualize!Prepare!
  • 16. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Now! Integrate! Analyze! Visualize!Prepare!
  • 17. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Problems Solved Before:! With Datameer 4.0:! Multiple Tools! Not for business! Visualize at End! Single Platform! Self-Service! Visual Insights at Every Step!
  • 18. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. Use Cases and Impact Industry! Challenge! Impact! Banking! Identify credit scores that were out of range based on zip code (credit scores in affluent areas tend to be higher than in others)! ! Identify loans that have highest risk and better quantify risk exposure (>$13M)! ! Retail! Identify missing product id or inaccurate product descriptions! ! Inventory: Slower turnover of stock! Fulfillment: Out of stock at customers! Logistics: Distribution errors and rework, extra shipping costs (>$1M)! Telco! Identify incorrect subscriber data (e.g. invalid email addresses) that will skew results on usage in particular area! By correlating subscriber data with network performance data, meet existing and forecasted demand, but not excess capacity resulting in inflated capital expenditures. (>$140M)! Telco! Identify incorrect subscriber data (e.g. negative ages) that will skew segments used for churn analysis! Discount and retention campaigns are executed optimally and targeted to the right clusters, avoiding lost revenue!
  • 19. © 2014 Datameer, Inc. All rights reserved. 4.0 Technical Details! Matt McManus! VP, Engineering!
  • 20. © 2014 Datameer, Inc. All rights reserved. Column Metrics Collection! Metric! Supported Column Types! Cardinality*! All! Histogram*! Numeric + Date! Frequency* (Top K)! All! Summary (min/max/mean)! Numeric + Date! Null vs. Present! All! * indicates estimated value!
  • 21. © 2014 Datameer, Inc. All rights reserved. Performance Implications! !   Metrics are calculated using streaming techniques designed to minimize performance impacts! !   Often an estimate is provided to achieve high performance! !   Collection can be disabled on a per job or cluster wide basis!
  • 22. © 2014 Datameer, Inc. All rights reserved. Visual Profiling of Full Results! !   Column statistics available on full results of every worksheet (without leaving workbook)! !   Column statistics fall back to “preview” in certain circumstances! ! Visual cues guide users:!
  • 23. © 2014 Datameer, Inc. All rights reserved. Flip-side with Smart Analytics! !   Visualize model on full results! • Decision trees! • Column dependencies! !   Visually explore cluster composition! •  Compare data shape across clusters ! !   Enhancements to recommendation visualizations!
  • 24. © 2014 Datameer, Inc. All rights reserved. Demo …! Customer Churn!
  • 25. @Datameer!
  • 26. © 2014 Datameer, Inc. All rights reserved. © 2013 Datameer, Inc. All rights reserved. For More Information! #datameer @datameer! !   http://www.datameer.com! !  @datameer! mschumpert@datameer.com! mmcmanus@datameer.com!