A Virtual Assistant Ecosystem for Workflow and Workplace Optimization

•

1 like•326 views

This document discusses an approach for creating a virtual assistant ecosystem using analytics and machine learning to optimize workflows. It describes using intent recognition and natural language processing to enable dynamic access to insights from device data. A demo is then presented of a chatbot interface that can understand requests and fulfill them using predictive analytics services.

WIFI SSID:SparkAISummit | Password: UnifiedAnalytics

Rafael Zotto, HP
Franco Vieira, HP
A Virtual Assistant
Ecosystem for Workflow and
Workplace Optimization
#UnifiedAnalytics #SparkAISummit

//ABOUT US
3#UnifiedAnalytics #SparkAISummit
//RAFAEL ZOTTO
Holds a master degree in Computer Science focused on high-performance
computing. Specialized in parallel and distributed computing with a special interest
in cloud and serverless computing. Works for HP Inc. for more than a decade acting
as a software engineer for print firmware and wearable technologies. Currently works
in the data science research team as a software engineer and solutions architect
having most activities related to applied AI and conversational interfaces.
//FRANCO VIEIRA
Professional with vast experience in information technology working in
software development, cloud computing, and machine learning projects. In
the past 5 years, Franco has been developing new services focused on activity
recognition, content sharing, and device health. Currently, works in HP
Personal Systems Software producing machine learning solutions on the edge.
Franco holds a degree in computer science.

//LEARNING FROM EXPERIENCES
4#UnifiedAnalytics #SparkAISummit

//UNIFIED APPROACH TO ANALYTICS
5#UnifiedAnalytics #SparkAISummit

//RICH INSIGHTS
6#UnifiedAnalytics #SparkAISummit

//PROBLEM STATEMENT
7#UnifiedAnalytics #SparkAISummit
_LOTS of INFORMATION
_Just a FEW reports
_Not always CONFIGURABLE
_TIED to development cycles
//Offer a QUICK way to share INFORMATION

//CONTEXT
8#UnifiedAnalytics #SparkAISummit
//FLEET MANAGER
_MANAGES a fleet of devices
_Need to UNDERSTAND status
_FIX problems
//END-USER
_USES one or more devices
_Wants assets AVAILABLE
_Interested on PERFORMANCE

//OUR APPROACH
9#UnifiedAnalytics #SparkAISummit
//Be AVAILABLE
//Offer DYNAMIC ways to access the INFORMATION
//Using NATURAL LANGUAGE

//INVESTIGATION
10#UnifiedAnalytics #SparkAISummit
//Generate STRUCTURED QUERIES from Natural Language
_SEQ2SQL, SQLNet, Coarse2Fine, …
//INTENT recognition / Chatbot PLATFORMS
_DialogFlow, LUIS, AWS LEX, Watson, …

//CHATBOT PLATFORM ANATOMY
11#UnifiedAnalytics #SparkAISummit
INTENTS UTTERANCES SLOTS
FULFILLMENT
COMPOSED
BY
MIGHT
HAVE
//EXAMPLE
_“I need to travel from San Francisco to Philadelphia next weekend”
SLOT 1: Origin SLOT 2: Destination SLOT 3: Date

//CHATBOT AS PART OF A SOLUTION
12#UnifiedAnalytics #SparkAISummit
//Our INSIGHTS and PREDICTIONS refers to a DOMAIN
//Our DOMAINS can be mapped to INTENTS
_Battery
_Thermal
_CPU
_…
//INTENTS can be fulfilled with INSIGHTS and PREDICTIONS
_MULTI-CONTEXT dialog management

//HIGH-LEVEL OVERVIEW
13#UnifiedAnalytics #SparkAISummit
ANALYTICS SERVICES
GRAPHQL
SAVES TO
WEB PORTAL
SLACK
CALL CENTER
MOBILE APP
INTENT
RECOGNITION
DESKTOP APP
LAMBDA
FUNCTION
FULFILLMENT
USES
_DYNAMIC response created
CONTEXT is created for the SESSION RDS
QUERY

//EASY access to our GOLD DATA
//UNDERSTAND user needs.
//DEMO
14#UnifiedAnalytics #SparkAISummit

//CREATING A PLATFORM
15#UnifiedAnalytics #SparkAISummit
ANALYTICS SERVICES
GRAPHQL
SAVES TO
WEB PORTAL
SLACK
CALL CENTER
MOBILE APP
INTENT
RECOGNITION
DESKTOP APP
LAMBDA
FUNCTION
FULFILLMENT
USES
RDS
QUERY
_Support for
MULTIPLE INTENTS
_Gathering data from
MULTIPLE SOURCES

//LEARN FROM MISSED UTTERANCES
16#UnifiedAnalytics #SparkAISummit
//UNDERSTAND what we still DON’T KNOW
MISSED UTTERANCES INPUT TEXT
ENTITIES KEY PHRASES
FILTERING SLOTS KNOWLEDGE NEEDED
INSIGHT DISCOVERY
INTENT RECOGNITION
//EXAMPLE
_“What was the health grade of my device fleet yesterday”
KNOWLEDGE : 0.99+ KNOWLEDGE: 0.99+ DATE: 0.98

//ONGOING WORK
_DEVICES are THINGS connected to our stack.
_We have INSIGHTS and PREDICTIONS ready to be used.
_Why not DELIVER them to the interested part?
17#UnifiedAnalytics #SparkAISummit
//IMMEDIATE delivery
_As soon as detected, NOTIFICATION is delivered to the USER.
//SCHEDULED delivery
_Kept to be delivered as a FUTURE or RECURRENT NOTIFICATION.

//DELIVERING INSIGHTS
18#UnifiedAnalytics #SparkAISummit
ANALYTICS SERVICES
EVENTS
NOTIFICATIONS ACTIONS
PRODUCE
TRIGGER
DELIVERED AS
_EXTENSIBLE list of PLUGINS
_ACT to address an issue
_WATCH the system
WRITES TO
EVENT SINK
WRITES TO
PROCESSING
INITIATE
SIMPLE QUEUE
MQTT

//DEMO
19#UnifiedAnalytics #SparkAISummit
00

DON’T FORGET TO RATE
AND REVIEW THE SESSIONS
SEARCH SPARK + AI SUMMIT

F# is a powerful open-source language which Microsoft, other companies and the F# community all contribute to. In this talk, Don will discuss how the “F# space” has recently opened up significantly in interesting ways. F# now includes contributions that range from Cloud IDE platforms, Cloud Compute frameworks, Data interoperability components, Cross-platform execution, Try F#, MonoDevelop, and even Emacs editor integration with surprising tooling support, as well as the Visual F# tools from Microsoft and the broader NuGet package ecosystem. Don will also talk about some of the latest contributions from Microsoft Research, including new type provider components for F#, and describe how his team work with the Visual F# team and other teams around Microsoft. There will also be demos of some fun new stuff that’s been going on with F# at MSR and the community.

Software solution - Lean development and Agile methodologies lesson 1

Francesco Mapelli

Pradeep_iOS_DeveloperPradeep kn

Logic tree mobile_gvLogictreeit

Logictree Mobile Practice PresentationLogictreeit

5G Network Introduction

Michelle Holley

Windows Phone Garage - Application JumpstartGlen Gordon

Snap4City November 2019 Course: Smart City IOT platform installation, deploy,...

Paolo Nesi

• Snap4City Architecture • Snap4City: Smart City IOT as a Service • Snap4City Living Lab For Collaborative Work • Smart City Development Life Cycle • Analysis and Design for Innovation (Co-Creation and Co-Working) • Development Tools • How to Add Functions that are not present in the Platform • Snap4City vs Fi-Ware • Snap4City vs State of the Art Solutions • Snap4City Services: Consulting and Developing • Snap4City vs Snap4Industry 4.0 • Installing Snap4City • The view of the Administrator • Monitoring Resource Consumption and Traffic • Managing and Monitoring Data Traffic in the BackOffice • Auditing Activities • Managing Back Office processes via Containers • Acknowledgement

Hcplphx920

Thinkful

CV_Serhiy_Medvedyev_2015Serhiy Medvedyev

It's All About The Bot's - Oracle Forms

AuraPlayer

Michael Hoffmann zu MobilityIBM Lotus

Developing Applications for Windows Phone 7 - Chris Ismael

Spiffy

3-Way Scripts as a Base Unit for Flexible Scale-Out Code

Tokyo University of Science

Distributed and/or parallel code is normally based on elaborate platforms. The main problems with such platforms are (1) constraints placed on operation of the code and (2) overhead imposed by the platform that arbitrates among multiple instances within the running code. This paper argues in favor of platform-less distribution of code. The base unit is referred to as 3-way script, where the three ways are (1) calling a method/function of an instantiated class, (2) executing the code from the command line, and (3) calling a method/function using HTTP requests to a remote web API. The key merit of the proposal is that all the three uses are possible on the same code, which by developer only one -- this code is referred to as a 3-way script. This paper discusses examples of the code written in PHP, while the same design is possible in several other popular programming languages.

How to Hybrid : Effective Tactics in HTML5-Native App Development

DroidConTLV

Gartner has predicted that by 2016, “more Than 50 Percent of Mobile Apps Deployed Will be Hybrid.” Knowing how and when to utilize HTML5 technology in your application will help you prepare for that future. This lecture will cover several techniques and real life examples on how to utilize hybrid development in your applications. The tools and tactics for how to connect (or bridge) your “native” Java code implementations with HTML5 will be presented with code samples. The lecture will also cover the right and wrong ways to implement HTML5 in your application, and when to “stick to native.”

Curriculum vitae enImVaStO

Resumeandre catto

Using Technology to Make People More Powerful

Ian Heron

DW Migration Webinar-March 2022.pptx

Databricks

Data Lakehouse Symposium | Day 1 | Part 1

Databricks

The world of data architecture began with applications. Next came data warehouses. Then text was organized into a data warehouse. Then one day the world discovered a whole new kind of data that was being generated by organizations. The world found that machines generated data that could be transformed into valuable insights. This was the origin of what is today called the data lakehouse. The evolution of data architecture continues today. Come listen to industry experts describe this transformation of ordinary data into a data architecture that is invaluable to business. Simply put, organizations that take data architecture seriously are going to be at the forefront of business tomorrow. This is an educational event. Several of the authors of the book Building the Data Lakehouse will be presenting at this symposium.

Similar to A Virtual Assistant Ecosystem for Workflow and Workplace Optimization

Windows Phone 7.5 Mango - What's New

Sascha Corti

MDW Boulder April '11 | Scott Prindle_The Role of the Creative Technologst Boulder Digital Works at CU

Progressive f# tutorials nyc don syme on keynote f# in the open source world

Skills Matter

Software solution - Lean development and Agile methodologies lesson 1

Francesco Mapelli

Pradeep_iOS_DeveloperPradeep kn

Logic tree mobile_gvLogictreeit

Logictree Mobile Practice PresentationLogictreeit

5G Network Introduction

Michelle Holley

Windows Phone Garage - Application JumpstartGlen Gordon

Snap4City November 2019 Course: Smart City IOT platform installation, deploy,...

Paolo Nesi

Hcplphx920

Thinkful

CV_Serhiy_Medvedyev_2015Serhiy Medvedyev

It's All About The Bot's - Oracle Forms

AuraPlayer

Michael Hoffmann zu MobilityIBM Lotus

Developing Applications for Windows Phone 7 - Chris Ismael

Spiffy

3-Way Scripts as a Base Unit for Flexible Scale-Out Code

Tokyo University of Science

How to Hybrid : Effective Tactics in HTML5-Native App Development

DroidConTLV

Curriculum vitae enImVaStO

Resumeandre catto

Using Technology to Make People More Powerful

Ian Heron

Similar to A Virtual Assistant Ecosystem for Workflow and Workplace Optimization (20)

Windows Phone 7.5 Mango - What's New

MDW Boulder April '11 | Scott Prindle_The Role of the Creative Technologst

Progressive f# tutorials nyc don syme on keynote f# in the open source world

Software solution - Lean development and Agile methodologies lesson 1

Pradeep_iOS_Developer

Logic tree mobile_gv

Logictree Mobile Practice Presentation

5G Network Introduction

Windows Phone Garage - Application Jumpstart

Snap4City November 2019 Course: Smart City IOT platform installation, deploy,...

Hcplphx920

CV_Serhiy_Medvedyev_2015

It's All About The Bot's - Oracle Forms

Michael Hoffmann zu Mobility

Developing Applications for Windows Phone 7 - Chris Ismael

3-Way Scripts as a Base Unit for Flexible Scale-Out Code

How to Hybrid : Effective Tactics in HTML5-Native App Development

Curriculum vitae en

Resume

Using Technology to Make People More Powerful

More from Databricks

DW Migration Webinar-March 2022.pptx

Databricks

Data Lakehouse Symposium | Day 1 | Part 1

Databricks

Data Lakehouse Symposium | Day 1 | Part 2

Databricks

Data Lakehouse Symposium | Day 2

Databricks

Data Lakehouse Symposium | Day 4

Databricks

5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop

Databricks

In this session, learn how to quickly supplement your on-premises Hadoop environment with a simple, open, and collaborative cloud architecture that enables you to generate greater value with scaled application of analytics and AI on all your data. You will also learn five critical steps for a successful migration to the Databricks Lakehouse Platform along with the resources available to help you begin to re-skill your data teams.

Democratizing Data Quality Through a Centralized Platform

Databricks

Bad data leads to bad decisions and broken customer experiences. Organizations depend on complete and accurate data to power their business, maintain efficiency, and uphold customer trust. With thousands of datasets and pipelines running, how do we ensure that all data meets quality standards, and that expectations are clear between producers and consumers? Investing in shared, flexible components and practices for monitoring data health is crucial for a complex data organization to rapidly and effectively scale. At Zillow, we built a centralized platform to meet our data quality needs across stakeholders. The platform is accessible to engineers, scientists, and analysts, and seamlessly integrates with existing data pipelines and data discovery tools. In this presentation, we will provide an overview of our platform’s capabilities, including: Giving producers and consumers the ability to define and view data quality expectations using a self-service onboarding portal Performing data quality validations using libraries built to work with spark Dynamically generating pipelines that can be abstracted away from users Flagging data that doesn’t meet quality standards at the earliest stage and giving producers the opportunity to resolve issues before use by downstream consumers Exposing data quality metrics alongside each dataset to provide producers and consumers with a comprehensive picture of health over time

Learn to Use Databricks for Data Science

Databricks

Data scientists face numerous challenges throughout the data science workflow that hinder productivity. As organizations continue to become more data-driven, a collaborative environment is more critical than ever — one that provides easier access and visibility into the data, reports and dashboards built against the data, reproducibility, and insights uncovered within the data.. Join us to hear how Databricks’ open and collaborative platform simplifies data science by enabling you to run all types of analytics workloads, from data preparation to exploratory analysis and predictive analytics, at scale — all on one unified platform.

Why APM Is Not the Same As ML Monitoring

Databricks

Application performance monitoring (APM) has become the cornerstone of software engineering allowing engineering teams to quickly identify and remedy production issues. However, as the world moves to intelligent software applications that are built using machine learning, traditional APM quickly becomes insufficient to identify and remedy production issues encountered in these modern software applications. As a lead software engineer at NewRelic, my team built high-performance monitoring systems including Insights, Mobile, and SixthSense. As I transitioned to building ML Monitoring software, I found the architectural principles and design choices underlying APM to not be a good fit for this brand new world. In fact, blindly following APM designs led us down paths that would have been better left unexplored. In this talk, I draw upon my (and my team’s) experience building an ML Monitoring system from the ground up and deploying it on customer workloads running large-scale ML training with Spark as well as real-time inference systems. I will highlight how the key principles and architectural choices of APM don’t apply to ML monitoring. You’ll learn why, understand what ML Monitoring can successfully borrow from APM, and hear what is required to build a scalable, robust ML Monitoring architecture.

The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix

Databricks

Autonomy and ownership are core to working at Stitch Fix, particularly on the Algorithms team. We enable data scientists to deploy and operate their models independently, with minimal need for handoffs or gatekeeping. By writing a simple function and calling out to an intuitive API, data scientists can harness a suite of platform-provided tooling meant to make ML operations easy. In this talk, we will dive into the abstractions the Data Platform team has built to enable this. We will go over the interface data scientists use to specify a model and what that hooks into, including online deployment, batch execution on Spark, and metrics tracking and visualization.

Stage Level Scheduling Improving Big Data and AI Integration

Databricks

In this talk, I will dive into the stage level scheduling feature added to Apache Spark 3.1. Stage level scheduling extends upon Project Hydrogen by improving big data ETL and AI integration and also enables multiple other use cases. It is beneficial any time the user wants to change container resources between stages in a single Apache Spark application, whether those resources are CPU, Memory or GPUs. One of the most popular use cases is enabling end-to-end scalable Deep Learning and AI to efficiently use GPU resources. In this type of use case, users read from a distributed file system, do data manipulation and filtering to get the data into a format that the Deep Learning algorithm needs for training or inference and then sends the data into a Deep Learning algorithm. Using stage level scheduling combined with accelerator aware scheduling enables users to seamlessly go from ETL to Deep Learning running on the GPU by adjusting the container requirements for different stages in Spark within the same application. This makes writing these applications easier and can help with hardware utilization and costs. There are other ETL use cases where users want to change CPU and memory resources between stages, for instance there is data skew or perhaps the data size is much larger in certain stages of the application. In this talk, I will go over the feature details, cluster requirements, the API and use cases. I will demo how the stage level scheduling API can be used by Horovod to seamlessly go from data preparation to training using the Tensorflow Keras API using GPUs. The talk will also touch on other new Apache Spark 3.1 functionality, such as pluggable caching, which can be used to enable faster dataframe access when operating from GPUs.

Simplify Data Conversion from Spark to TensorFlow and PyTorch

Databricks

In this talk, I would like to introduce an open-source tool built by our team that simplifies the data conversion from Apache Spark to deep learning frameworks. Imagine you have a large dataset, say 20 GBs, and you want to use it to train a TensorFlow model. Before feeding the data to the model, you need to clean and preprocess your data using Spark. Now you have your dataset in a Spark DataFrame. When it comes to the training part, you may have the problem: How can I convert my Spark DataFrame to some format recognized by my TensorFlow model? The existing data conversion process can be tedious. For example, to convert an Apache Spark DataFrame to a TensorFlow Dataset file format, you need to either save the Apache Spark DataFrame on a distributed filesystem in parquet format and load the converted data with third-party tools such as Petastorm, or save it directly in TFRecord files with spark-tensorflow-connector and load it back using TFRecordDataset. Both approaches take more than 20 lines of code to manage the intermediate data files, rely on different parsing syntax, and require extra attention for handling vector columns in the Spark DataFrames. In short, all these engineering frictions greatly reduced the data scientists’ productivity. The Databricks Machine Learning team contributed a new Spark Dataset Converter API to Petastorm to simplify these tedious data conversion process steps. With the new API, it takes a few lines of code to convert a Spark DataFrame to a TensorFlow Dataset or a PyTorch DataLoader with default parameters. In the talk, I will use an example to show how to use the Spark Dataset Converter to train a Tensorflow model and how simple it is to go from single-node training to distributed training on Databricks.

Scaling your Data Pipelines with Apache Spark on Kubernetes

Databricks

There is no doubt Kubernetes has emerged as the next generation of cloud native infrastructure to support a wide variety of distributed workloads. Apache Spark has evolved to run both Machine Learning and large scale analytics workloads. There is growing interest in running Apache Spark natively on Kubernetes. By combining the flexibility of Kubernetes and scalable data processing with Apache Spark, you can run any data and machine pipelines on this infrastructure while effectively utilizing resources at disposal. In this talk, Rajesh Thallam and Sougata Biswas will share how to effectively run your Apache Spark applications on Google Kubernetes Engine (GKE) and Google Cloud Dataproc, orchestrate the data and machine learning pipelines with managed Apache Airflow on GKE (Google Cloud Composer). Following topics will be covered: – Understanding key traits of Apache Spark on Kubernetes- Things to know when running Apache Spark on Kubernetes such as autoscaling- Demonstrate running analytics pipelines on Apache Spark orchestrated with Apache Airflow on Kubernetes cluster.

Scaling and Unifying SciKit Learn and Apache Spark Pipelines

Databricks

Pipelines have become ubiquitous, as the need for stringing multiple functions to compose applications has gained adoption and popularity. Common pipeline abstractions such as “fit” and “transform” are even shared across divergent platforms such as Python Scikit-Learn and Apache Spark. Scaling pipelines at the level of simple functions is desirable for many AI applications, however is not directly supported by Ray’s parallelism primitives. In this talk, Raghu will describe a pipeline abstraction that takes advantage of Ray’s compute model to efficiently scale arbitrarily complex pipeline workflows. He will demonstrate how this abstraction cleanly unifies pipeline workflows across multiple platforms such as Scikit-Learn and Spark, and achieves nearly optimal scale-out parallelism on pipelined computations. Attendees will learn how pipelined workflows can be mapped to Ray’s compute model and how they can both unify and accelerate their pipelines with Ray.

Sawtooth Windows for Feature Aggregations

Databricks

In this talk about zipline, we will introduce a new type of windowing construct called a sawtooth window. We will describe various properties about sawtooth windows that we utilize to achieve online-offline consistency, while still maintaining high-throughput, low-read latency and tunable write latency for serving machine learning features.We will also talk about a simple deployment strategy for correcting feature drift – due operations that are not “abelian groups”, that operate over change data.

Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink

Databricks

We want to present multiple anti patterns utilizing Redis in unconventional ways to get the maximum out of Apache Spark.All examples presented are tried and tested in production at Scale at Adobe. The most common integration is spark-redis which interfaces with Redis as a Dataframe backing Store or as an upstream for Structured Streaming. We deviate from the common use cases to explore where Redis can plug gaps while scaling out high throughput applications in Spark. Niche 1 : Long Running Spark Batch Job – Dispatch New Jobs by polling a Redis Queue · Why? o Custom queries on top a table; We load the data once and query N times · Why not Structured Streaming · Working Solution using Redis Niche 2 : Distributed Counters · Problems with Spark Accumulators · Utilize Redis Hashes as distributed counters · Precautions for retries and speculative execution · Pipelining to improve performance

Re-imagine Data Monitoring with whylogs and Spark

Databricks

In the era of microservices, decentralized ML architectures and complex data pipelines, data quality has become a bigger challenge than ever. When data is involved in complex business processes and decisions, bad data can, and will, affect the bottom line. As a result, ensuring data quality across the entire ML pipeline is both costly, and cumbersome while data monitoring is often fragmented and performed ad hoc. To address these challenges, we built whylogs, an open source standard for data logging. It is a lightweight data profiling library that enables end-to-end data profiling across the entire software stack. The library implements a language and platform agnostic approach to data quality and data monitoring. It can work with different modes of data operations, including streaming, batch and IoT data. In this talk, we will provide an overview of the whylogs architecture, including its lightweight statistical data collection approach and various integrations. We will demonstrate how the whylogs integration with Apache Spark achieves large scale data profiling, and we will show how users can apply this integration into existing data and ML pipelines.

Raven: End-to-end Optimization of ML Prediction Queries

Databricks

Machine learning (ML) models are typically part of prediction queries that consist of a data processing part (e.g., for joining, filtering, cleaning, featurization) and an ML part invoking one or more trained models. In this presentation, we identify significant and unexplored opportunities for optimization. To the best of our knowledge, this is the first effort to look at prediction queries holistically, optimizing across both the ML and SQL components. We will present Raven, an end-to-end optimizer for prediction queries. Raven relies on a unified intermediate representation that captures both data processing and ML operators in a single graph structure. This allows us to introduce optimization rules that (i) reduce unnecessary computations by passing information between the data processing and ML operators (ii) leverage operator transformations (e.g., turning a decision tree to a SQL expression or an equivalent neural network) to map operators to the right execution engine, and (iii) integrate compiler techniques to take advantage of the most efficient hardware backend (e.g., CPU, GPU) for each operator. We have implemented Raven as an extension to Spark’s Catalyst optimizer to enable the optimization of SparkSQL prediction queries. Our implementation also allows the optimization of prediction queries in SQL Server. As we will show, Raven is capable of improving prediction query performance on Apache Spark and SQL Server by up to 13.1x and 330x, respectively. For complex models, where GPU acceleration is beneficial, Raven provides up to 8x speedup compared to state-of-the-art systems. As part of the presentation, we will also give a demo showcasing Raven in action.

Processing Large Datasets for ADAS Applications using Apache Spark

Databricks

Semantic segmentation is the classification of every pixel in an image/video. The segmentation partitions a digital image into multiple objects to simplify/change the representation of the image into something that is more meaningful and easier to analyze [1][2]. The technique has a wide variety of applications ranging from perception in autonomous driving scenarios to cancer cell segmentation for medical diagnosis. Exponential growth in the datasets that require such segmentation is driven by improvements in the accuracy and quality of the sensors generating the data extending to 3D point cloud data. This growth is further compounded by exponential advances in cloud technologies enabling the storage and compute available for such applications. The need for semantically segmented datasets is a key requirement to improve the accuracy of inference engines that are built upon them. Streamlining the accuracy and efficiency of these systems directly affects the value of the business outcome for organizations that are developing such functionalities as a part of their AI strategy. This presentation details workflows for labeling, preprocessing, modeling, and evaluating performance/accuracy. Scientists and engineers leverage domain-specific features/tools that support the entire workflow from labeling the ground truth, handling data from a wide variety of sources/formats, developing models and finally deploying these models. Users can scale their deployments optimally on GPU-based cloud infrastructure to build accelerated training and inference pipelines while working with big datasets. These environments are optimized for engineers to develop such functionality with ease and then scale against large datasets with Spark-based clusters on the cloud.

Massive Data Processing in Adobe Using Delta Lake

Databricks

At Adobe Experience Platform, we ingest TBs of data every day and manage PBs of data for our customers as part of the Unified Profile Offering. At the heart of this is a bunch of complex ingestion of a mix of normalized and denormalized data with various linkage scenarios power by a central Identity Linking Graph. This helps power various marketing scenarios that are activated in multiple platforms and channels like email, advertisements etc. We will go over how we built a cost effective and scalable data pipeline using Apache Spark and Delta Lake and share our experiences. What are we storing? Multi Source – Multi Channel Problem Data Representation and Nested Schema Evolution Performance Trade Offs with Various formats Go over anti-patterns used (String FTW) Data Manipulation using UDFs Writer Worries and How to Wipe them Away Staging Tables FTW Datalake Replication Lag Tracking Performance Time!

More from Databricks (20)

DW Migration Webinar-March 2022.pptx

Data Lakehouse Symposium | Day 1 | Part 1

Data Lakehouse Symposium | Day 1 | Part 2

Data Lakehouse Symposium | Day 2

Data Lakehouse Symposium | Day 4

5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop

Democratizing Data Quality Through a Centralized Platform

Learn to Use Databricks for Data Science

Why APM Is Not the Same As ML Monitoring

The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix

Stage Level Scheduling Improving Big Data and AI Integration

Simplify Data Conversion from Spark to TensorFlow and PyTorch

Scaling your Data Pipelines with Apache Spark on Kubernetes

Scaling and Unifying SciKit Learn and Apache Spark Pipelines

Sawtooth Windows for Feature Aggregations

Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink

Re-imagine Data Monitoring with whylogs and Spark

Raven: End-to-end Optimization of ML Prediction Queries

Processing Large Datasets for ADAS Applications using Apache Spark

Massive Data Processing in Adobe Using Delta Lake

Recently uploaded

Business update Q1 2024 Lar España Real Estate SOCIMI

AlejandraGmez176757

Ch03-Managing the Object-Oriented Information Systems Project a.pdf

haila53

Tabula.io Cheatsheet: automate your data workflows

alex933524

一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单

ewymefz

UPenn毕业证【微信95270640】办理宾夕法尼亚大学毕业证原版一模一样、UPenn毕业证制作【Q微信95270640】《宾夕法尼亚大学毕业证购买流程》《UPenn成绩单制作》宾夕法尼亚大学毕业证书UPenn毕业证文凭宾夕法尼亚大学本科毕业证书,学历学位认证如何办理【留学国外学位学历认证、毕业证、成绩单、大学Offer、雅思托福代考、语言证书、学生卡、高仿教育部认证等一切高仿或者真实可查认证服务】代办国外（海外）英国、加拿大、美国、新西兰、澳大利亚、新西兰等国外各大学毕业证、文凭学历证书、成绩单、学历学位认证真实可查。办国外宾夕法尼亚大学宾夕法尼亚大学硕士学位证成绩单教育部学历学位认证留信认证大使馆认证留学回国人员证明修改成绩单信封申请学校offer录取通知书在读证明offer letter。快速办理高仿国外毕业证成绩单： 1宾夕法尼亚大学毕业证+成绩单+留学回国人员证明+教育部学历认证（全套留学回国必备证明材料给父母及亲朋好友一份完美交代）; 2雅思成绩单托福成绩单OFFER在读证明等留学相关材料（申请学校转学甚至是申请工签都可以用到）。 3.毕业证 #成绩单等全套材料从防伪到印刷从水印到钢印烫金高精仿度跟学校原版100%相同。专业服务请勿犹豫联系我！联系人微信号：95270640诚招代理：本公司诚聘当地代理人员如果你有业余时间有兴趣就请联系我们。国外宾夕法尼亚大学宾夕法尼亚大学硕士学位证成绩单办理过程： 1客户提供办理信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询）； 2开始安排制作毕业证成绩单电子图； 3毕业证成绩单电子版做好以后发送给您确认； 4毕业证成绩单电子版您确认信息无误之后安排制作成品； 5成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快读邮寄）。我们在哪里父母对我们的爱和思念为我们的生命增加了光彩给予我们自由追求的力量生活的力量我们也不忘感恩正因为这股感恩的线牵着我们使我们在一年的结束时刻义无反顾的踏上了回家的旅途人们常说父母恩最难回报愿我能以当年爸爸妈妈对待小时候的我们那样耐心温柔地对待我将渐渐老去的父母体谅他们以反哺之心奉敬父母以感恩之心孝顺父母哪怕只为父母换洗衣服为父母喂饭送汤按摩酸痛的腰背握着父母的手扶着他们一步一步地慢慢散步.娃

Opendatabay - Open Data Marketplace.pptx

Opendatabay

Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets. First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience. From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets. Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.

Investigate & Recover / StarCompliance.io / Crypto_Crimes

StarCompliance.io

StarCompliance is a leading firm specializing in the recovery of stolen cryptocurrency. Our comprehensive services are designed to assist individuals and organizations in navigating the complex process of fraud reporting, investigation, and fund recovery. We combine cutting-edge technology with expert legal support to provide a robust solution for victims of crypto theft. Our Services Include: Reporting to Tracking Authorities: We immediately notify all relevant centralized exchanges (CEX), decentralized exchanges (DEX), and wallet providers about the stolen cryptocurrency. This ensures that the stolen assets are flagged as scam transactions, making it impossible for the thief to use them. Assistance with Filing Police Reports: We guide you through the process of filing a valid police report. Our support team provides detailed instructions on which police department to contact and helps you complete the necessary paperwork within the critical 72-hour window. Launching the Refund Process: Our team of experienced lawyers can initiate lawsuits on your behalf and represent you in various jurisdictions around the world. They work diligently to recover your stolen funds and ensure that justice is served. At StarCompliance, we understand the urgency and stress involved in dealing with cryptocurrency theft. Our dedicated team works quickly and efficiently to provide you with the support and expertise needed to recover your assets. Trust us to be your partner in navigating the complexities of the crypto world and safeguarding your investments.

一比一原版(YU毕业证)约克大学毕业证成绩单

enxupq

YU毕业证【微信95270640】（约克大学毕业证高仿学位证书((+《Q微信95270640》))）购买YU毕业证修改YU成绩单购买约克大学毕业证办YU文凭办高仿毕业证约克大学毕业证购买修改成绩单挂科退学如何进行学历认证留学退学办毕业证书/ 出国留学无法毕业买毕业证留学被劝退买毕业证（非正常毕业教育部认证咨询） York University 办理国外约克大学毕业证书 #成绩单改成绩 #教育部学历学位认证 #毕业证认证 #留服认证 #使馆认证（留学回国人员证明） #（证）等真实教育部认证教育部存档中国教育部留学服务中心认证（即教育部留服认证）网站100%可查. 真实使馆认证（即留学人员回国证明）使馆存档可通过大使馆查询确认. 留信网认证国家专业人才认证中心颁发入库证书留信网永久存档可查. 约克大学约克大学毕业证学历书毕业证 #成绩单等全套材料从防伪到印刷从水印到钢印烫金跟学校原版100%相同. 国际留学归国服务中心：实体公司注册经营行业标杆精益求精！国外毕业证学位证成绩单办理流程： 1客户提供办理约克大学约克大学毕业证学历书信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询）； 2开始安排制作约克大学毕业证成绩单电子图； 3约克大学毕业证成绩单电子版做好以后发送给您确认； 4约克大学毕业证成绩单电子版您确认信息无误之后安排制作成品； 5约克大学成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快递邮寄约克大学约克大学毕业证学历书）。心温柔地对待我将渐渐老去的父母体谅他们以反哺之心奉敬父母以感恩之心孝顺父母哪怕只为父母换洗衣服为父母喂饭送汤按摩酸痛的腰背握着父母的手扶着他们一步一步地慢慢散步.让我们的父母幸福快乐地度过一生挽着清风芒耀似金的骄阳如将之绽放的花蕾一般静静的从远方的山峦间缓缓升起这一片寂静的城市默默的等待着它的第一缕光芒将之唤醒那飘散在它前方的几层薄云像是新娘的婚纱一般为它的光芒添上了几分淡淡的浮晕在悄无声息间这怕

一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单

vcaxypu

ArtEZ毕业证【微信95270640】☀《ArtEZ艺术学院毕业证购买》Q微信95270640《ArtEZ毕业证模板办理》文凭、本科、硕士、研究生学历都可以做,《文凭ArtEZ毕业证书原版制作ArtEZ成绩单》《仿制ArtEZ毕业证成绩单ArtEZ艺术学院学位证书pdf电子图》毕业证 [留学文凭学历认证(留信认证使馆认证)ArtEZ艺术学院毕业证成绩单毕业证证书大学Offer请假条成绩单语言证书国际回国人员证明高仿教育部认证申请学校等一切高仿或者真实可查认证服务。多年留学服务公司,拥有海外样板无数能完美1:1还原海外各国大学degreeDiplomaTranscripts等毕业材料。海外大学毕业材料都有哪些工艺呢？工艺难度主要由：烫金.钢印.底纹.水印.防伪光标.热敏防伪等等组成。而且我们每天都在更新海外文凭的样板以求所有同学都能享受到完美的品质服务。国外毕业证学位证成绩单办理方法： 1客户提供办理ArtEZ艺术学院ArtEZ艺术学院毕业证假文凭信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询）； 2开始安排制作毕业证成绩单电子图； 3毕业证成绩单电子版做好以后发送给您确认； 4毕业证成绩单电子版您确认信息无误之后安排制作成品； 5成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快读邮寄） — — — — 我们是挂科和未毕业同学们的福音我们是实体公司精益求精的工艺！ — — — - 一真实留信认证的作用(私企外企荣誉的见证): 1：该专业认证可证明留学生真实留学身份同时对留学生所学专业等级给予评定。 2：国家专业人才认证中心颁发入库证书这个入网证书并且可以归档到地方。 3：凡是获得留信网入网的信息将会逐步更新到个人身份内将在公安部网内查询个人身份证信息后同步读取人才网入库信息。 4：个人职称评审加20分个人信誉贷款加10分。 5：在国家人才网主办的全国网络招聘大会中纳入资料供国家500强等高端企业选择人才。却怎么也笑不出来山娃很迷惑父亲的家除了一扇小铁门连窗户也没有墓穴一般阴森森有些骇人父亲的城也便成了山娃的城父亲的家也便成了山娃的家父亲让山娃呆在屋里做作业看电视最多只能在门口透透气不能跟陌生人搭腔更不能乱跑一怕迷路二怕拐子拐人山娃很惊惧去年村里的田鸡就因为跟父亲进城一不小心被人拐跑了至今不见踪影害得田鸡娘天天哭得死去活来疯了一般那情那景无不令人摧肝裂肺山娃很听话天天呆在小屋里除了看书写作业就是睡带

Best best suvichar in gujarati english meaning of this sentence as Silk road ...

AbhimanyuSinha9

Innovative Methods in Media and Communication Research by Sebastian Kubitschk...

correoyaya

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单

ukgaet

UVic毕业证【微信95270640】（维多利亚大学毕业证成绩单本科学历）Q微信95270640(补办UVic学位文凭证书)维多利亚大学留信网学历认证怎么办理维多利亚大学毕业证成绩单精仿本科学位证书硕士文凭证书认证Seneca College diplomaoffer,Transcript办理硕士学位证书造假维多利亚大学假文凭学位证书制作UVic本科毕业证书硕士学位证书精仿维多利亚大学学历认证成绩单修改制作，办理真实认证、留信认证、使馆公证、购买成绩单，购买假文凭，购买假学位证，制造假国外大学文凭、毕业公证、毕业证明书、录取通知书、Offer、在读证明、雅思托福成绩单、假文凭、假毕业证、请假条、国际驾照、网上存档可查！【实体公司】办维多利亚大学维多利亚大学毕业证文凭证书学历认证学位证文凭认证办留信网认证办留服认证办教育部认证（网上可查实体公司专业可靠） — — — 留学归国服务中心 — — - 【主营项目】一.维多利亚大学毕业证成绩单使馆认证教育部认证成绩单等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 国外毕业证学位证成绩单办理流程： 1客户提供维多利亚大学维多利亚大学毕业证文凭证书办理信息：姓名生日专业学位毕业时间等（如信息不确定可以咨询顾问：我们有专业老师帮你查询）； 2开始安排制作毕业证成绩单电子图； 3毕业证成绩单电子版做好以后发送给您确认； 4毕业证成绩单电子版您确认信息无误之后安排制作成品； 5成品做好拍照或者视频给您确认； 6快递给客户（国内顺丰国外DHLUPS等快读邮寄）。专业服务请勿犹豫联系我！本公司是留学创业和海归创业者们的桥梁。一次办理终生受用一步到位高效服务。详情请在线咨询办理,欢迎有诚意办理的客户咨询!洽谈。招聘代理：本公司诚聘英国加拿大澳洲新西兰美国法国德国新加坡各地代理人员如果你有业余时间有兴趣就请联系我们咨询顾问：+微信:95270640刀劈开抑或用拳头砸开每人抱起一大块就啃啃得满嘴满脸猴屁股般的红艳大家一个劲地指着对方吃吃地笑瓜裂得古怪奇形怪状却丝毫不影响瓜味甜丝丝的满嘴生津遍地都是瓜横七竖八的活像掷满了一地的大石块摘走二三只爷爷是断然发现不了的即便发现爷爷也不恼反而教山娃辨认孰熟孰嫩孰甜孰淡名义上是护瓜往往在瓜棚里坐上一刻饱吃一顿后山娃就领着阿黑漫山遍野地跑阿黑是一条黑色的大猎狗挺机灵的是山娃多年的忠实伙伴平时山娃上学阿黑也静

社内勉強会資料_LLM Agents　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　.

NABLAS株式会社

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】

NABLAS株式会社

SOCRadar Germany 2024 Threat Landscape Report

SOCRadar

As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape. In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity. 🔑 Key findings include: 🔍 Increased frequency and complexity of cyber threats. 🔍 Escalation of state-sponsored and criminally motivated cyber operations. 🔍 Active dark web exchanges of malicious tools and tactics. Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities. This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.

哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样

axoqas

原版定制【Q微信:741003700】《(usq毕业证书)南昆士兰大学毕业证研究生文凭证书》【Q微信:741003700】成绩单、雅思、外壳、留信学历认证永久存档查询，采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【Q微信741003700】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信741003700】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才【关于价格问题（保证一手价格）】我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。

FP Growth Algorithm and its Applications

MaleehaSheikh2

The affect of service quality and online reviews on customer loyalty in the E...

jerlynmaetalle

1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx

Tiktokethiodaily

tapal brand analysis PPT slide for comptetive data

theahmadsaood

Criminal IP - Threat Hunting Webinar.pdf

Criminal IP

Recently uploaded (20)

Business update Q1 2024 Lar España Real Estate SOCIMI

Ch03-Managing the Object-Oriented Information Systems Project a.pdf

Tabula.io Cheatsheet: automate your data workflows

一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单

Opendatabay - Open Data Marketplace.pptx

Investigate & Recover / StarCompliance.io / Crypto_Crimes

一比一原版(YU毕业证)约克大学毕业证成绩单

一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单

Best best suvichar in gujarati english meaning of this sentence as Silk road ...

Innovative Methods in Media and Communication Research by Sebastian Kubitschk...

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单

社内勉強会資料_LLM Agents　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　.

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】

SOCRadar Germany 2024 Threat Landscape Report

哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样

FP Growth Algorithm and its Applications

The affect of service quality and online reviews on customer loyalty in the E...

1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx

tapal brand analysis PPT slide for comptetive data

Criminal IP - Threat Hunting Webinar.pdf

A Virtual Assistant Ecosystem for Workflow and Workplace Optimization

1. WIFI SSID:SparkAISummit | Password: UnifiedAnalytics

2. Rafael Zotto, HP Franco Vieira, HP A Virtual Assistant Ecosystem for Workflow and Workplace Optimization #UnifiedAnalytics #SparkAISummit

3. //ABOUT US 3#UnifiedAnalytics #SparkAISummit //RAFAEL ZOTTO Holds a master degree in Computer Science focused on high-performance computing. Specialized in parallel and distributed computing with a special interest in cloud and serverless computing. Works for HP Inc. for more than a decade acting as a software engineer for print firmware and wearable technologies. Currently works in the data science research team as a software engineer and solutions architect having most activities related to applied AI and conversational interfaces. //FRANCO VIEIRA Professional with vast experience in information technology working in software development, cloud computing, and machine learning projects. In the past 5 years, Franco has been developing new services focused on activity recognition, content sharing, and device health. Currently, works in HP Personal Systems Software producing machine learning solutions on the edge. Franco holds a degree in computer science.

4. //LEARNING FROM EXPERIENCES 4#UnifiedAnalytics #SparkAISummit

5. //UNIFIED APPROACH TO ANALYTICS 5#UnifiedAnalytics #SparkAISummit

6. //RICH INSIGHTS 6#UnifiedAnalytics #SparkAISummit

7. //PROBLEM STATEMENT 7#UnifiedAnalytics #SparkAISummit _LOTS of INFORMATION _Just a FEW reports _Not always CONFIGURABLE _TIED to development cycles //Offer a QUICK way to share INFORMATION

8. //CONTEXT 8#UnifiedAnalytics #SparkAISummit //FLEET MANAGER _MANAGES a fleet of devices _Need to UNDERSTAND status _FIX problems //END-USER _USES one or more devices _Wants assets AVAILABLE _Interested on PERFORMANCE

9. //OUR APPROACH 9#UnifiedAnalytics #SparkAISummit //Be AVAILABLE //Offer DYNAMIC ways to access the INFORMATION //Using NATURAL LANGUAGE

10. //INVESTIGATION 10#UnifiedAnalytics #SparkAISummit //Generate STRUCTURED QUERIES from Natural Language _SEQ2SQL, SQLNet, Coarse2Fine, … //INTENT recognition / Chatbot PLATFORMS _DialogFlow, LUIS, AWS LEX, Watson, …

11. //CHATBOT PLATFORM ANATOMY 11#UnifiedAnalytics #SparkAISummit INTENTS UTTERANCES SLOTS FULFILLMENT COMPOSED BY MIGHT HAVE //EXAMPLE _“I need to travel from San Francisco to Philadelphia next weekend” SLOT 1: Origin SLOT 2: Destination SLOT 3: Date

12. //CHATBOT AS PART OF A SOLUTION 12#UnifiedAnalytics #SparkAISummit //Our INSIGHTS and PREDICTIONS refers to a DOMAIN //Our DOMAINS can be mapped to INTENTS _Battery _Thermal _CPU _… //INTENTS can be fulfilled with INSIGHTS and PREDICTIONS _MULTI-CONTEXT dialog management

13. //HIGH-LEVEL OVERVIEW 13#UnifiedAnalytics #SparkAISummit ANALYTICS SERVICES GRAPHQL SAVES TO WEB PORTAL SLACK CALL CENTER MOBILE APP INTENT RECOGNITION DESKTOP APP LAMBDA FUNCTION FULFILLMENT USES _DYNAMIC response created CONTEXT is created for the SESSION RDS QUERY

14. //EASY access to our GOLD DATA //UNDERSTAND user needs. //DEMO 14#UnifiedAnalytics #SparkAISummit

15. //CREATING A PLATFORM 15#UnifiedAnalytics #SparkAISummit ANALYTICS SERVICES GRAPHQL SAVES TO WEB PORTAL SLACK CALL CENTER MOBILE APP INTENT RECOGNITION DESKTOP APP LAMBDA FUNCTION FULFILLMENT USES RDS QUERY _Support for MULTIPLE INTENTS _Gathering data from MULTIPLE SOURCES

16. //LEARN FROM MISSED UTTERANCES 16#UnifiedAnalytics #SparkAISummit //UNDERSTAND what we still DON’T KNOW MISSED UTTERANCES INPUT TEXT ENTITIES KEY PHRASES FILTERING SLOTS KNOWLEDGE NEEDED INSIGHT DISCOVERY INTENT RECOGNITION //EXAMPLE _“What was the health grade of my device fleet yesterday” KNOWLEDGE : 0.99+ KNOWLEDGE: 0.99+ DATE: 0.98

17. //ONGOING WORK _DEVICES are THINGS connected to our stack. _We have INSIGHTS and PREDICTIONS ready to be used. _Why not DELIVER them to the interested part? 17#UnifiedAnalytics #SparkAISummit //IMMEDIATE delivery _As soon as detected, NOTIFICATION is delivered to the USER. //SCHEDULED delivery _Kept to be delivered as a FUTURE or RECURRENT NOTIFICATION.

18. //DELIVERING INSIGHTS 18#UnifiedAnalytics #SparkAISummit ANALYTICS SERVICES EVENTS NOTIFICATIONS ACTIONS PRODUCE TRIGGER DELIVERED AS _EXTENSIBLE list of PLUGINS _ACT to address an issue _WATCH the system WRITES TO EVENT SINK WRITES TO PROCESSING INITIATE SIMPLE QUEUE MQTT

19. //DEMO 19#UnifiedAnalytics #SparkAISummit 00

20. //NEXT STEPS //It’s all about USER-EXPERIENCE _HELP fleet-managers. _AMAZE end-users. _ACT in advance. _UNDERSTAND users profile. 20#UnifiedAnalytics #SparkAISummit

21. DON’T FORGET TO RATE AND REVIEW THE SESSIONS SEARCH SPARK + AI SUMMIT

A Virtual Assistant Ecosystem for Workflow and Workplace Optimization

Recommended

Recommended

More Related Content

Similar to A Virtual Assistant Ecosystem for Workflow and Workplace Optimization

Similar to A Virtual Assistant Ecosystem for Workflow and Workplace Optimization (20)

More from Databricks

More from Databricks (20)

Recently uploaded

Recently uploaded (20)

A Virtual Assistant Ecosystem for Workflow and Workplace Optimization