Azure Data Lake Store is a hyper-scale repository for big data analytics workloads that allows storing petabytes of data in its native format with unlimited storage. Azure Data Lake Analytics is an on-demand analytics job service that runs massively parallel data processing programs and integrates with Visual Studio, charging only for jobs run. U-SQL is a query language that allows querying multiple Azure data sources and includes cognitive capabilities like image tagging and sentiment analysis.
1. Azure Data Lake Store
&
Azure Data Lake Analytics
Sergio Zenatti Filho,
Associate Director Data & Analytics,
Satalyst
2. Sergio Zenatti Filho
Associate Director Data &Analytics - Satalyst
I am Data and Analytics Director with over 16 years
experience in the delivery of Business Intelligence
and Analytics Solutions. I worked internationally
around Australia, New Zealand and Brazil, in sectors
that include Mining, Oil & Gas, Government,
Healthcare, Financial Services, Telecom, Automotive
and dairy. I enjoy learning new technology and help
people to learn.
Place your
photo here
/sergiozenatti @SergioZenatti zenatti.net
5. What is Data Lake?
Ingest all data
regardless of requirements
Store all data
in native format
without schema
definition
Do analysis
Hadoop, Spark, R,
Azure Data Lake
Analytics (ADLA)
Interactive queries
Batch queries
Machine Learning
Data warehouse
Devices
7. Azure Data Lake (ADL) Store
• A hyper-scale repository for Big Data
analytics workloads;
• Hadoop File System (HDFS) for the cloud;
• Unlimited storage and can host petabyte files;
• Store any data in its native format;
• Enterprise-grade access control and
encryption;
10. Azure Data Lake Analytics
• An on-demand analytics job service in the cloud;
• Run massively parallel data transformation and processing programs
in U-SQL, R, Python, and .NET;
• No infrastructure to manage, you can process data on demand, scale
instantly, and only pay per job;
• Integrates with Visual Studio to develop, debug and tune code faster;
Azure Data Lake Analytics Unit (AU): is a unit of computation made
available to your U-SQL job. Each AU gives your job access to a set of
underlying resources such as CPU and memory.
12. U-SQL
• It’s a framework for Big Data;
• Familiar syntax to millions of
SQL & .NET developers;
• Built on the same distributed
runtime that powers the big
data systems inside
Microsoft;
• Querying multiple Azure Data
Sources (Federated Query);
13. Cognitive Capabilities in U-SQL
• Image Tagging
• Emotion Extraction
• Face Detection
• Optical Character Recognition
• Key Phrases Extraction
• Sentiment Analysis