Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Making a Small Job Out of Big Data With In-Database Connections - Inspire Europe 2017

331 views

Published on

The volume of data is growing exponentially and it's now the norm to store every bit of it. With the innovative in-database tools, Alteryx puts Big Data in the hands of all analysts. Learn how to maximize your existing Big Data infrastructure, when to move data out for processing, and when to use in-database blending.

To watch a recording of this session from Inspire 2017, visit https://www.alteryx.com/inspire-europe-2017-tracks

Published in: Data & Analytics
  • Be the first to comment

  • Be the first to like this

Making a Small Job Out of Big Data With In-Database Connections - Inspire Europe 2017

  1. 1. MAKING A SMALL JOB OUT OF BIG DATAWITH IN- DATABASE CONNECTIONS Presented by Alex Patten, Product Manager- Data Platforms
  2. 2. FORWARD-LOOKING STATEMENTS This presentation includes “forward-looking statements” within the meaning of the Private Securities Litigation Reform Act of 1995. These forward-looking statements may be identified by the use of terminology such as “believe,” “may,” “will,” “intend,” “expect,” “plan,” “anticipate,” “estimate,” “potential,” or “continue,” or other comparable terminology. All statements other than statements of historical fact could be deemed forward-looking, including any projections of product availability, growth and financial metrics and any statements regarding product roadmaps, strategies, plans or use cases. Although Alteryx believes that the expectations reflected in any of these forward-looking statements are reasonable, these expectations or any of the forward-looking statements could prove to be incorrect, and actual results or outcomes could differ materially from those projected or assumed in the forward-looking statements. Alteryx’s future financial condition and results of operations, as well as any forward-looking statements, are subject to risks and uncertainties, including but not limited to the factors set forth in Alteryx’s press releases, public statements and/or filings with the Securities and Exchange Commission, especially the “Risk Factors” sections of Alteryx’s Quarterly Report on Form 10-Q.These documents and others containing important disclosures are available at www.sec.gov or in the “Investors” section of Alteryx’s website at www.alteryx.com. All forward-looking statements are made as of the date of this presentation and Alteryx assumes no obligation to update any such forward-looking statements. Any unreleased services or features referenced in this or other presentations, press releases or public statements are only intended to outline Alteryx’s general product direction. They are intended for information purposes only, and may not be incorporated into any contract. This is not a commitment to deliver any material, code, or functionality (which may not be released on time or at all) and customers should not rely upon this presentation or any such statements to make purchasing decisions. The development, release, and timing of any features or functionality described for Alteryx’s products remains at the sole discretion of Alteryx.
  3. 3. AGENDA • In-DB Overview • In-DB Benefits • In-DBTips andTricks • In-DB Roadmap &Vision • In-DB Demo
  4. 4. PRESENTER
  5. 5. To watch a recording of this session from Inspire Europe 2017, visit alteryx.com/inspire-europe-2017-tracks
  6. 6. IN-DB OVERVIEW
  7. 7. WHY IN-DATABASE? • Data is Growing • More data has been created in the past two years than in the entire previous history of the human race • By 2020, 1.7 megabytes of new information will be created every second for every human being on the planet • By 2020, digital universe of data will contain 44 zettabytes, up from 4.4 zettabytes today Scenario: This is a lot of data to move if you need to clean, blend, prep, or analyze it Solution: With In-DB, you don’t have to! Source: http://www.forbes.com/sites/bernardmarr/2015/ 09/30/big-data-20-mind-boggling-facts- everyone-must-read/#35f9a3786c1d
  8. 8. WHY IN-DATABASE? • Data Storage is Changing • By 2020, at least a third of all data will pass through the cloud • The Hadoop market is forecast to grow at a compound annual growth of 58%, surpassing $1 billion by 2020 • Every day, Google uses distributed computing to involve about 1,000 computers in answering a single search query.This takes no more than .2 seconds to compute. Source: http://www.forbes.com/sites/bernardmarr/2015/ 09/30/big-data-20-mind-boggling-facts- everyone-must-read/#35f9a3786c1d Scenario: Infrastructure is costly and customized Solution: In-DB leverages this infrastructure
  9. 9. WHY IN-DATABASE? • Access to Data is Key toValuable Insights • Currently, less than 0.5% of all data is ever analyzed and used • For a typical Fortune 1000 company, a 10% increase in data accessibility will increase net income by $65 million • Retailers who leverage the full power of big data could increase their operating margins by 60% Source: http://www.forbes.com/sites/bernardmarr/2015/ 09/30/big-data-20-mind-boggling-facts- everyone-must-read/#35f9a3786c1d Scenario: Data is valuable, but hard to access Solution: In-DB tools make this data accessible to anyone, no coding required!
  10. 10. IN-DB BENEFITS
  11. 11. PERFORMANCE GAINS • Workflow using 5 gigabyte dataset Transfer data into Alteryx 2-4 minutes Transfer data back to data source 3-5 minutesStandardWorkflow: Transfer data into Alteryx 2-4 minutes Transfer data back to data source 3-5 minutesIn-DBWorkflow: Total Runtime: ~8 min Total Runtime: ~30 sec
  12. 12. IN- DB BENEFITS • SPEED
  13. 13. IN- DB BENEFITS • Speed • Flexibility • HybridWorkflows
  14. 14. IN- DB BENEFITS
  15. 15. Requirements: • Use and create massive datasets stored in Redshift SPEED
  16. 16. Requirements: • Blend data from local file with tables from Redshift FLEXIBILITY 17
  17. 17. Business Need: • Create reports and datasets that can be accessed by all users. HYBRIDWORKFLOWS
  18. 18. IN-DB DATA SOURCES
  19. 19. NEW IN-DB DATA SOURCES
  20. 20. IN-DB DATA SOURCES
  21. 21. PREDICTIVE IN-DB CAPABILITIES
  22. 22. TIPS &TRICKS
  23. 23. *THINK ABOUT USING IN-DB IF… • You’re working with large datasets • Data source is sitting on top of HDFS (Hive, Impala, Spark) • Data source is in the cloud (Azure, Redshift, Snowflake) • Data source is not on a local server • Data source is supported in Alteryx • Any part of your workflow is working with a reduced dataset
  24. 24. USETHE BROWSE IN-DB AND DATA STREAM OUT TOOLSWISELY
  25. 25. REDUCEYOUR DATASET AS QUICKLY AS POSSIBLE
  26. 26. TRICK! How toView a Query
  27. 27. USING IN-DB PREDICTIVETOOLS
  28. 28. FASTERWRITETO HIVE AND IMPALA
  29. 29. LASTTRICK! USINGTHE SORT FUNCTIONALITY
  30. 30. IN-DB ROADMAP & STRATEGY
  31. 31. IN-DB STRATEGY • Make In-DB More Accessible • Make In-DB experience feel the same as the standard experience • Build out In-DB sources, functionality, and tools • Make In-DB “smarter”
  32. 32. IN-DB ROADMAP • Extending In-DBToolset • Simplify Connection Configuration
  33. 33. KEYTAKEAWAYS • In-DB = faster workflows • Continued expansion of In-DB capabilities • In-DB Predictive model support
  34. 34. ADDITIONAL RESOURCES • Alteryx Community - Ideas and Solutions (community.alteryx.com) • Alteryx Help - Complete Help Documentation (help.alteryx.com) • Alteryx Downloads - Latest Release and Release Notes (downloads.alteryx.com) • Alteryx - FreeTrial Download (alteryx.com)
  35. 35. DEMO – GETTING STARTEDWITH IN-DB
  36. 36. FORWARD-LOOKING STATEMENTS This presentation includes “forward-looking statements” within the meaning of the Private Securities Litigation Reform Act of 1995. These forward-looking statements may be identified by the use of terminology such as “believe,” “may,” “will,” “intend,” “expect,” “plan,” “anticipate,” “estimate,” “potential,” or “continue,” or other comparable terminology. All statements other than statements of historical fact could be deemed forward-looking, including any projections of product availability, growth and financial metrics and any statements regarding product roadmaps, strategies, plans or use cases. Although Alteryx believes that the expectations reflected in any of these forward-looking statements are reasonable, these expectations or any of the forward-looking statements could prove to be incorrect, and actual results or outcomes could differ materially from those projected or assumed in the forward-looking statements. Alteryx’s future financial condition and results of operations, as well as any forward-looking statements, are subject to risks and uncertainties, including but not limited to the factors set forth in Alteryx’s press releases, public statements and/or filings with the Securities and Exchange Commission, especially the “Risk Factors” sections of Alteryx’s Quarterly Report on Form 10-Q.These documents and others containing important disclosures are available at www.sec.gov or in the “Investors” section of Alteryx’s website at www.alteryx.com. All forward-looking statements are made as of the date of this presentation and Alteryx assumes no obligation to update any such forward-looking statements. Any unreleased services or features referenced in this or other presentations, press releases or public statements are only intended to outline Alteryx’s general product direction. They are intended for information purposes only, and may not be incorporated into any contract. This is not a commitment to deliver any material, code, or functionality (which may not be released on time or at all) and customers should not rely upon this presentation or any such statements to make purchasing decisions. The development, release, and timing of any features or functionality described for Alteryx’s products remains at the sole discretion of Alteryx.
  37. 37. THANK YOU Please complete a feedback survey 720-259-0566 | apatten@alteryx.com Alex Patten
  38. 38. #inspire16# alteryx.com/trial Ready to bring these incredible and tangible benefits to your organization? Download a FREETrial of Alteryx and start making your data work for you, instead of you working for your data

×