Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Making a Small Job Out of Big Data With In-Database Connections - Inspire 2017

120 views

Published on

The volume of data is growing exponentially and it's now the norm to store every bit of it. With the innovative in-database tools, Alteryx puts Big Data in the hands of all analysts. Learn how to maximize your existing Big Data infrastructure, when to move data out for processing, and when to use in-database blending. To watch a recording of this session from Inspire 2017, visit www.alteryx.com/inspire-2017-tracks

Published in: Data & Analytics
  • Be the first to comment

  • Be the first to like this

Making a Small Job Out of Big Data With In-Database Connections - Inspire 2017

  1. 1. MAKING A SMALL JOB OUT OF BIG DATAWITH IN- DATABASE CONNECTIONS Presented by Alex Patten, Product Manager- Data Platforms 6/7/17
  2. 2. FORWARD-LOOKING STATEMENTS This presentation includes “forward-looking statements” within the meaning of the Private Securities Litigation Reform Act of 1995. These forward-looking statements may be identified by the use of terminology such as “believe,” “may,” “will,” “intend,” “expect,” “plan,” “anticipate,” “estimate,” “potential,” or “continue,” or other comparable terminology. All statements other than statements of historical fact could be deemed forward-looking, including any projections of product availability, growth and financial metrics and any statements regarding product roadmaps, strategies, plans or use cases. Although Alteryx believes that the expectations reflected in any of these forward-looking statements are reasonable, these expectations or any of the forward-looking statements could prove to be incorrect, and actual results or outcomes could differ materially from those projected or assumed in the forward-looking statements. Alteryx’s future financial condition and results of operations, as well as any forward-looking statements, are subject to risks and uncertainties, including but not limited to the factors set forth in Alteryx’s press releases, public statements and/or filings with the Securities and Exchange Commission, especially the “Risk Factors” sections of Alteryx’s Quarterly Report on Form 10-Q.These documents and others containing important disclosures are available at www.sec.gov or in the “Investors” section of Alteryx’s website at www.alteryx.com. All forward-looking statements are made as of the date of this presentation and Alteryx assumes no obligation to update any such forward-looking statements. Any unreleased services or features referenced in this or other presentations, press releases or public statements are only intended to outline Alteryx’s general product direction. They are intended for information purposes only, and may not be incorporated into any contract. This is not a commitment to deliver any material, code, or functionality (which may not be released on time or at all) and customers should not rely upon this presentation or any such statements to make purchasing decisions. The development, release, and timing of any features or functionality described for Alteryx’s products remains at the sole discretion of Alteryx.
  3. 3. AGENDA • In-DB Overview • In-DB Benefits • In-DBTips andTricks • In-DB Roadmap & Strategy
  4. 4. IN-DB OVERVIEW 4
  5. 5. WHY IN-DATABASE? • Data is Growing • More data has been created in the past two years than in the entire previous history of the human race • By 2020, 1.7 megabytes of new information will be created every second for every human being on the planet • By 2020, digital universe of data will contain 44 zettabytes, up from 4.4 zettabytes today 5 Scenario: This is a lot of data to move if you need to clean, blend, prep, or analyze it Solution: With In-DB, you don’t have to! Source: http://www.forbes.com/sites/bernardmarr/2015/ 09/30/big-data-20-mind-boggling-facts- everyone-must-read/#35f9a3786c1d
  6. 6. WHY IN-DATABASE? • Data Storage is Changing • By 2020, at least a third of all data will pass through the cloud • The Hadoop market is forecast to grow at a compound annual growth of 58%, surpassing $1 billion by 2020 • Every day, Google uses distributed computing to involve about 1,000 computers in answering a single search query.This takes no more than .2 seconds to compute. 6 Source: http://www.forbes.com/sites/bernardmarr/2015/ 09/30/big-data-20-mind-boggling-facts- everyone-must-read/#35f9a3786c1d Scenario: Infrastructure is costly and customized Solution: In-DB leverages this infrastructure
  7. 7. WHY IN-DATABASE? • Access to Data is Key toValuable Insights • Currently, less than 0.5% of all data is ever analyzed and used • For a typical Fortune 1000 company, a 10% increase in data accessibility will increase net income by $65 million • Retailers who leverage the full power of big data could increase their operating margins by 60% 7 Source: http://www.forbes.com/sites/bernardmarr/2015/ 09/30/big-data-20-mind-boggling-facts- everyone-must-read/#35f9a3786c1d Scenario: Data is valuable, but hard to access Solution: In-DB tools make this data accessible to anyone, no coding required!
  8. 8. IN-DB BENEFITS
  9. 9. IN- DB BENEFITS • SPEED
  10. 10. IN- DB BENEFITS • Speed • Flexibility • HybridWorkflows
  11. 11. IN- DB BENEFITS
  12. 12. Requirements: • Gather, clean, and blend daily purchase activity across ecommerce sites trafficked by millions of consumers • Use and create massive datasets stored in Redshift SPEED 13
  13. 13. Requirements: • Blend data from local file with tables from Redshift • Provide recommendations for which markets to target based on that data FLEXIBILITY 14
  14. 14. Business Need: • Make predictions based on gathered data. • Create reports and datasets that can be accessed by all users. HYBRIDWORKFLOWS 15
  15. 15. IN-DB DATA SOURCES
  16. 16. NEW IN-DB DATA SOURCES 1
  17. 17. IN-DB DATA SOURCES
  18. 18. PREDICTIVE IN-DB CAPABILITIES
  19. 19. TIPS &TRICKS
  20. 20. *THINK ABOUT USING IN-DB IF… • You’re working with large datasets • Data source is sitting on top of HDFS (Hive, Impala, Spark) • Data source is in the cloud (Azure and Redshift) • Data source is not on a local server • Data source is supported in Alteryx • Any part of your workflow is working with a reduced dataset
  21. 21. USETHE BROWSE IN-DB AND DATA STREAM OUT TOOLSWISELY 22
  22. 22. REDUCEYOUR DATASET AS QUICKLY AS POSSIBLE 23
  23. 23. TRICK! 24 How toView a Query
  24. 24. ANOTHERTRICK! 25 Using In-DB PredictiveTools
  25. 25. IN-DB ROADMAP & STRATEGY
  26. 26. NEW DATA SOURCES 27 • Vertica • EXASOL • Greenplum/Postgres • Snowflake
  27. 27. IN-DB INITIATIVES • Extending In-DBToolset and Functionality • Simplify Connection Configuration
  28. 28. KEYTAKEAWAYS • In-DB = faster workflows • Continued expansion of In-DB capabilities • In-DB Predictive model support
  29. 29. ADDITIONAL RESOURCES 30 • Alteryx Community - Ideas and Solutions (community.alteryx.com) • Alteryx Help - Complete Help Documentation (help.alteryx.com) • Alteryx Downloads - Latest Release and release notes (downloads.alteryx.com) • Alteryx - FreeTrial Download (alteryx.com)
  30. 30. SESSION SURVEY –WEWANTYOUR FEEDBACK! 4 1. Open the mobile app 2. Go to your schedule 3. Find the session 4. Take Survey!
  31. 31. THANK YOU Please complete a feedback survey 720.259.0566 | apatten@alteryx.com Alex Patten

×