Quoc Le at AI Frontiers : Automated Machine Learning

•Download as PPTX, PDF•

1 like•1,307 views

Traditional machine learning systems are hand-designed and tuned by machine learning experts. To scale up the impact of machine learning to many real-world applications, we must figure out a way to automate the designing process of these pipelines. In this talk, I will discuss the use of machine learning to automate the process of designing neural architectures and data augmentation strategies (Neural Architecture Search and AutoAugment).

Technology

AutoML - Automated Machine Learning
Quoc Le
@quocleix
Joint work with many colleagues in the Google Brain team

Current:
Solution =
ML Expertise + Data + Computation

Current:
Solution =
ML Expertise + Data + Computation
But can we turn this into:
Solution =
Data + 100X Computation

Data processing
Machine
Learning Model
Data

Importance of architectures for Vision
● Designing neural network architectures is hard
● Lots of human efforts go into tuning them
● Can we try and learn good architectures automatically?
Canziani et al, 2017

Convolutional Architectures
Krizhevsky et al, 2012

Neural Architecture Search
● We can specify the structure and connectivity of a neural network by a
configuration string
○ [“Filter Width: 5”, “Filter Height: 3”, “Num Filters: 24”]
● Use a RNN (“Controller”) to generate this string (the “Child Network”)
● Train the Child Network to see how well it performs on a validation set
● Use reinforcement learning to update the Controller based on the accuracy of
the Child Network

Controller: proposes Child Networks Train & evaluate Child Networks
20K
times
Iterate to
find the
most
accurate
Child
Network

Controller: proposes Child Networks Train & evaluate Child Networks
20K
times
Iterate to
find the
most
accurate
Child
Network
Reinforcement Learning
or Evolution Search

Neural Architecture Search for Convolutional Networks
Controller RNN
Softmax classifier
Embedding

Confidential + Proprietary
Strange hump
Basically
linear

Confidential + Proprietary
Mobile NASNet-A on ImageNet

Data processing
Machine
Learning Model
Data
Focus of machine
learning research

Data processing
Machine
Learning Model
Data
Focus of machine
learning research
Very important but
manually tuned

AutoAugment: Example Policy
Probability of applying
Magnitude

CIFAR-10
State-of-art: 2.1% error
AutoAugment: 1.5% error
ImageNet
State-of-art: 3.9% error
AutoAugment: 3.5% error

ImageNet
AutoML
Top-1Accuracy
ML Experts

References
● Neural Architecture Search with Reinforcement Learning. Barret Zoph and Quoc
V. Le. ICLR, 2017
● Learning Transferable Architectures for Large Scale Image Recognition. Barret
Zoph, Vijay Vasudevan, Jonathon Shlens, Quoc V. Le. CVPR, 2018
● AutoAugment: Learning Augmentation Policies from Data. Ekin D. Cubuk, Barret
Zoph, Dandelion Mane, Vijay Vasudevan, Quoc V. Le. Arxiv, 2018
● Searching for Activation Functions. Prajit Ramachandran, Barret Zoph, Quoc Le.
ICLR Workshop, 2018

Topic: How to use big data to enhance AI Outline: 1. Spark ETL Spark SQL Spark Streaming 2. Spark ML Spark ML pipeline Distributed model tuning Spark ML model and data lineage management 3. Spark XGboost XGboost introduction XGboost with Spark XGboost with GPU 4. Spark Deep Learning pipeline Transfer learning Build Spark ML pipeline with TensorFlow Model selection on distributed TF model

Jay Yagnik at AI Frontiers : A History Lesson on AI

AI Frontiers

We have reached a remarkable point in history with the evolution of AI, from applying this technology to incredible use cases in healthcare, to addressing the world's biggest humanitarian and environmental issues. Our ability to learn task-specific functions for vision, language, sequence and control tasks is getting better at a rapid pace. This talk will survey some of the current advances in AI, compare AI to other fields that have historically developed over time, and calibrate where we are in the relative advancement timeline. We will also speculate about the next inflection points and capabilities that AI can offer down the road, and look at how those might intersect with other emergent fields, e.g. Quantum computing.

Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016

MLconf

Building a Machine Learning Platform at Quora: Each month, over 100 million people use Quora to share and grow their knowledge. Machine learning has played a critical role in enabling us to grow to this scale, with applications ranging from understanding content quality to identifying users’ interests and expertise. By investing in a reusable, extensible machine learning platform, our small team of ML engineers has been able to productionize dozens of different models and algorithms that power many features across Quora. In this talk, I’ll discuss the core ideas behind our ML platform, as well as some of the specific systems, tools, and abstractions that have enabled us to scale our approach to machine learning.

Anomaly Detection using Deep Auto-Encoders | Gianmario Spacagna

Data Science Milan

One of the determinants for a good anomaly detector is finding smart data representations that can easily evince deviations from the normal distribution. Traditional supervised approaches would require a strong assumption about what is normal and what not plus a non negligible effort in labeling the training dataset. Deep auto-encoders work very well in learning high-level abstractions and non-linear relationships of the data without requiring data labels. In this talk we will review a few popular techniques used in shallow machine learning and propose two semi-supervised approaches for novelty detection: one based on reconstruction error and another based on lower-dimensional feature compression.

Think Big | Enterprise Artificial Intelligence

Data Science Milan

TensorFlow London: Cutting edge generative models

Seldon

Speaker: Pierre Richemond, Data Science Institute of Imperial College Title: Cutting edge generative models: Applications and implications Abstract: This talk will examine recent developments in deep learning content generation at scale. Whether it be images or text, the latest methods have now reached a level of quality making it hard to discriminate between human- and AI-generated content. We will review recent examples of such generative models, and put their significance in a broader context, in light of such powerful tools’ potential for dual use. Bio: Pierre is currently researching his PhD in deep reinforcement learning at the Data Science Institute of Imperial College. He also teaches Deep Learning at the Graduate School, and helps to run the Deep Learning Network and organises thematic reading groups. His background is in mathematics - he has studied electrical engineering at ENST, probability theory and stochastic processes at Universite Paris VI - Ecole Polytechnique, and business management at HEC.

Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016

MLconf

DL4J and DataVec for Enterprise Deep Learning Workflows: Applications in NLP, sensor processing (IoT), image processing, and audio processing have all emerged as prime deep learning applications. In this session we will take a look at a practical review of building practical and secure Deep Learning workflows in the enterprise. We’ll see how DL4J’s DataVec tool enables scalable ETL and vectorization pipelines to be created for a single machine or scale out to Spark on Hadoop. We’ll also see how Deep Networks such as Recurrent Neural Networks are able to leverage DataVec to more quickly process data for modeling.

MLConf 2016 SigOpt Talk by Scott Clark

SigOpt

As an e-commerce company leading in fashion and lifestyle in the Netherlands, Wehkamp dedicates itself to provide a better shopping experience for customers. Using Spark, the data science team is able to develop various machine-learning projects that improve the shopping experience. One of the applications is to create a service for retrieving visually similar products, which can then be used to show substitutional products, to build visual recommenders and to improve the overall recommendation system. In this project, Spark is used throughout the entire pipeline: retrieving and processing the image data, training model distributedly with Tensorflow, extracting image features, and computing similarity. In this talk, we are going to demonstrate how Spark and the Databricks enable a small team to unify data and AI workflows, develop a pipeline for visual similarity and train dedicated neural network models.

Driverless AI - Arno Candel, H2O.ai

Sri Ambati

This talk was given at H2O World 2018 NYC and can be viewed here: https://youtu.be/oxLZZMR1lVY Description: Driverless AI is H2O.ai's latest flagship product for automatic machine learning. It fully automates some of the most challenging and productive tasks in applied data science such as feature engineering, model tuning, model ensembling and model deployment. Driverless AI turns Kaggle-winning grandmaster recipes into production-ready code, and is specifically designed to avoid common mistakes such as under- or overfitting, data leakage or improper model validation, some of the hardest challenges in data science. Avoiding these pitfalls alone can save weeks or more for each model, and is necessary to achieve high modeling accuracy, especially for time-series problems. With Driverless AI, data scientists of all proficiency levels can train and deploy modeling pipelines with just a few clicks from the GUI. Advanced users can use the client API from Python. Driverless AI builds hundreds or thousands of models under the hood to select the best feature engineering and modeling pipeline for every specific problem such as churn prediction, fraud detection, real-estate pricing, store sales prediction, marketing ad campaigns and many more. To speed up training, Driverless AI uses highly optimized C++/CUDA algorithms to take full advantage of the latest compute hardware. For example, Driverless AI runs orders of magnitudes faster on the latest Nvidia GPU supercomputers on Intel and IBM platforms, both in the cloud or on premise. Driverless AI is fully supported on all major cloud providers. There are two more product innovations in Driverless AI: statistically rigorous automatic data visualization and machine learning interpretability with reason codes and explanations in plain English. Both help data scientists and analysts to quickly validate the data and the models. In this talk, we explain how Driverless AI works and show how easy it is to reach top 5% rankings for several highly competitive Kaggle competitions. (edited) Speaker's Bio: Arno Candel is the Chief Technology Officer at H2O.ai. He is the main committer of H2O-3 and Driverless AI and has been designing and implementing high-performance machine-learning algorithms since 2012. Previously, he spent a decade in supercomputing at ETH and SLAC and collaborated with CERN on next-generation particle accelerators. Arno holds a PhD and Masters summa cum laude in Physics from ETH Zurich, Switzerland. He was named “2014 Big Data All-Star” by Fortune Magazine and featured by ETH GLOBE in 2015. Follow him on Twitter: @ArnoCandel.

Video Analytics on Hadoop webinar victor fang-201309

DrVictorFang

Introduction to Generative Adversarial Networks (GAN) with Apache MXNet

Amazon Web Services

GANs are a type of deep neural network that allow us to generate data. In this webinar, we’ll take a look at the concept and theory behind GANs, which can be used to train neural nets with data that is generated by the network. We’ll explore the GAN framework along with its components -- generator and discriminator networks. We’ll then learn how to use Apache MXNet on AWS using the popular MNIST dataset, which contains images of handwritten numbers. In the end, we’ll create a GAN model that is able to generate similar images of handwritten numbers from our test dataset.

Python for Data Science with Anaconda

Travis Oliphant

Webinar - Analyzing Video

Turi, Inc.

Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016

MLconf

Applying Deep Learning at Facebook Scale: Facebook leverages Deep Learning for various applications including event prediction, machine translation, natural language understanding and computer vision at a very large scale. There are more than a billion users logging on to Facebook every daily generating thousands of posts per second and uploading more than a billion images and videos every day. This talk will explain how Facebook scaled Deep Learning inference for realtime applications with latency budgets in the milliseconds.

Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...

Sri Ambati

This talk was given at H2O World 2018 NYC and can be viewed here: https://youtu.be/xc3j20Om3UM Description: Data science is indeed one of the sexy jobs of the 21st century. But it is also a lot of hard work. And the hard work is seldom about the math or the algorithms. It is about building relevant machine learning products for the real world. We will go over some of the must-haves as you take your machine learning model out of the sandbox and make it work in the big, bad world outside. Speaker's Bio: Krish Swamy is an experienced professional with deep skills in applying analytics and BigData capabilities to challenging business problems and driving customer insights. Krish's analytic experience includes marketing and pricing, credit risk, digital analytics and most recently, big data analytics and data transformation. His key experiences lie in banking and financial services, the digital customer experience domain, with a background in management consulting. Other key skills include influencing organizational change towards a data and analytics driven culture, and building teams of analysts, statisticians and data scientists.

Image Classification Done Simply using Keras and TensorFlow

Rajiv Shah

This presentation walks through the process of building an image classifier using Keras with a TensorFlow backend. It will give a basic understanding of image classification and show the techniques used in industry to build image classifiers. The presentation will start with building a simple convolutional network, augmenting the data, using a pretrained network, and finally using transfer learning by modifying the last few layers of a pretrained network. The classification will be based on the classic example of classifying cats and dogs. The code for the presentation can be found at https://github.com/rajshah4/image_keras, and the presentation will discuss how to extend the code to your own pictures to make a custom image classifier.

Predicting Medical Test Results using Driverless AI

Sri Ambati

This talk was given at H2O World 2018 NYC and can be viewed here: https://youtu.be/n9g9GxIJoT4 Description: The goal of the research was to develop an approach to predict individual medical test results based on longitudinal medical and pharma claims data without direct lab measures using data-driven techniques. Such discoveries may result in improved treatment strategies. In the presentation we demonstrate how Driverless AI was used both for estimating highly accurate model and results explanations. Speaker's Bio: Alexander is the Data Science leader at poder.IO. He is responsible for data flow architecture and insight mining, all powered by machine learning. Before joining poder.IO, Alexander made an academic career at Belarusian State University and Minsk Innovation University where he was Head of the Informatics and Mathematics Department.

SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...

Impetus Technologies

SPARK SUMMIT SESSION - A majority of the electricity in the U.S. is traded in independent system operator (ISO) based wholesale markets. ISO-based markets typically function in a two-step settlement process with day-ahead (DA) financial settlements followed by physical real-time (spot) market settlements for electricity. In this work, we focus on obtaining equilibrium bidding strategies for electricity generators in DA markets. Electricity prices in DA markets are determined by the ISO, which matches competing supply offers from power generators with demand bids from load serving entities. Since there are multiple generators competing with one another to supply power, this can be modeled as a competitive Markov decision problem, which we solve using a reinforcement learning approach. For power networks of realistic sizes, the state-action space could explode, making the RL procedure computationally intensive. This has motivated us to solve the above problem over Spark. The talk provides the following takeaways: 1. Modeling the day-ahead market as a Markov decision process 2. Code sketches to show the markov decision process solution over Spark and Mahout over Apache Tez 3. Performance results comparing Mahout over Apache Tez and Spark.

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

Data Science Milan

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrigoni, Senior Data Scientist, Pirelli (pirelli.com) Abstract: Pirelli, a global performance tire manufacturer, uses data science in its 20 factories to improve quality and efficiency, and reduce energy consumption. For this “Smart Manufacturing” initiative, Pirelli’s data science team has developed predictive models and analytics tools to monitor processes, machines and materials on the factory floors. In this talk we will show some of the solutions we deploy, demonstrate how we used Domino’s data science platform and Plot.ly to build these solutions, and discuss the next steps in this journey towards predictive maintenance. Bio: Alberto Arrigoni is a data scientist at Pirelli, where he works to process sensors and telemetry data for IoT, Smart Factories and connected-vehicle applications. He works closely with all major business units such as R&D, industrial engineering and BI to develop tailored machine learning algorithms and production systems. He holds a PhD in biostatistics from the University of Milan Bicocca and prior to joining Pirelli was a staff data scientist at the National Institute of Molecular Genetics (Milan), as well as a Fulbright student at the Santa Clara University and visiting PhD student at Pacific Biosciences (Menlo Park, CA).

Nitin sharma - Deep Learning Applications to Online Payment Fraud Detection

MLconf

Highly-scalable Reinforcement Learning RLlib for Real-world Applications

Bill Liu

website: https://learn.xnextcon.com/event/eventdetails/W20051110 video: https://www.youtube.com/watch?v=8tG8PJC6oaU In reinforcement learning (RL), an agent learns how to optimize performance solely by collecting experience in the real world or via a simulator. RL is being applied to problems such as decision making, process optimization (e.g., manufacturing and supply chains), ad serving, recommendations, self-driving cars, and algorithmic trading. In this talk, I will discuss RLlib, a reinforcement learning library built on Ray with a strong focus on large-scale execution and scalability, ease-of-use for general users, as well as customizability for developers and researchers. RLlib offers autonomous task-learning via many common RL algorithms and it scales from a laptop to a cluster with hundreds of machines. It is used by dozens of organizations, from startups to research labs to large organizations. You will see RLlib in action with a live demo.

Scaling AI in production using PyTorch

geetachauhan

Slides from my talk at MLOps World' 21 Deploying AI models in production and scaling the ML services is still a big challenge. In this talk we will cover details of how to deploy your AI models, best practices for the deployment scenarios, and techniques for performance optimization and scaling the ML services. Come join us to learn how you can jumpstart the journey of taking your PyTorch models from Research to production.

Deep learning for FinTech

geetachauhan

B4UConference_machine learning_deeplearning

Hoa Le

Amazon SageMaker 內建機器學習演算法 (Level 400)

Amazon Web Services

What's hot

Love & Innovative technology presented by a technology pioneer and an AI expe...

Romeo Kienzler

IBM Middle East Data Science Connect 2016 - Doha, Qatar

Romeo Kienzler

Production ready big ml workflows from zero to hero daniel marcous @ waze

Ido Shilon

DeepLearning and Advanced Machine Learning on IoT

Romeo Kienzler

Retrieving Visually-Similar Products for Shopping Recommendations using Spark...

Databricks

Driverless AI - Arno Candel, H2O.ai

Sri Ambati

Video Analytics on Hadoop webinar victor fang-201309

DrVictorFang

Introduction to Generative Adversarial Networks (GAN) with Apache MXNet

Amazon Web Services

Python for Data Science with Anaconda

Travis Oliphant

Webinar - Analyzing Video

Turi, Inc.

Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016

MLconf

Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...

Sri Ambati

Image Classification Done Simply using Keras and TensorFlow

Rajiv Shah

Predicting Medical Test Results using Driverless AI

Sri Ambati

SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...

Impetus Technologies

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

Data Science Milan

Nitin sharma - Deep Learning Applications to Online Payment Fraud Detection

MLconf

Highly-scalable Reinforcement Learning RLlib for Real-world Applications

Bill Liu

Scaling AI in production using PyTorch

geetachauhan

Deep learning for FinTech

geetachauhan

What's hot (20)

Love & Innovative technology presented by a technology pioneer and an AI expe...

IBM Middle East Data Science Connect 2016 - Doha, Qatar

Production ready big ml workflows from zero to hero daniel marcous @ waze

DeepLearning and Advanced Machine Learning on IoT

Retrieving Visually-Similar Products for Shopping Recommendations using Spark...

Driverless AI - Arno Candel, H2O.ai

Video Analytics on Hadoop webinar victor fang-201309

Introduction to Generative Adversarial Networks (GAN) with Apache MXNet

Python for Data Science with Anaconda

Webinar - Analyzing Video

Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016

Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...

Image Classification Done Simply using Keras and TensorFlow

Predicting Medical Test Results using Driverless AI

SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

Nitin sharma - Deep Learning Applications to Online Payment Fraud Detection

Highly-scalable Reinforcement Learning RLlib for Real-world Applications

Scaling AI in production using PyTorch

Deep learning for FinTech

Similar to Quoc Le at AI Frontiers : Automated Machine Learning

B4UConference_machine learning_deeplearning

Hoa Le

Amazon SageMaker 內建機器學習演算法 (Level 400)

Amazon Web Services

Amazon SageMaker

Amazon Web Services

Build a Custom Model for Object & Logo Detection (AIM421) - AWS re:Invent 2018

Amazon Web Services

Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...

gdgsurrey

From notebook to production with Amazon Sagemaker

Amazon Web Services

Semi-Supervised Insight Generation from Petabyte Scale Text Data

Tech Triveni

Existing state-of-the-art supervised methods in Machine Learning require large amounts of annotated data to achieve good performance and generalization. However, manually constructing such a training data set with sentiment labels is a labor-intensive and time-consuming task. With the proliferation of data acquisition in domains such as images, text and video, the rate at which we acquire data is greater than the rate at which we can label them. Techniques that reduce the amount of labeled data needed to achieve competitive accuracies are of paramount importance for deploying scalable, data-driven, real-world solutions. At Envestnet | Yodlee, we have deployed several advanced state-of-the-art Machine Learning solutions that process millions of data points on a daily basis with very stringent service level commitments. A key aspect of our Natural Language Processing solutions is Semi-supervised learning (SSL): A family of methods that also make use of unlabelled data for training – typically a small amount of labeled data with a large amount of unlabelled data. Pure supervised solutions fail to exploit the rich syntactic structure of the unlabelled data to improve decision boundaries. There is an abundance of published work in the field - but few papers have succeeded in showing significantly better results than state-of-the-art supervised learning. Often, methods have simplifying assumptions that fail to transfer to real-world scenarios. There is a lack of practical guidelines for deploying effective SSL solutions. We attempt to bridge that gap by sharing our learning from successful SSL models deployed in production

Machine Learning Algorithms

DezyreAcademy

Machine Learning 2 deep Learning: An Intro

Si Krishan

Computer Vision for Beginners

Sanghamitra Deb

Computer Vision abbreviated as CV aims to teach computers to achieve human level vision capabilities. Applications of CV in self driving cars, robotics, healthcare, education and the multitude of apps that allow customers to use the smartphone cameras to convey information has made it one of the most popular fields in Artificial Intelligence. The recent advances in Deep Learning, data storage and computing capabilities has lead to the huge success of CV. There are several tasks in computer vision, such as classification, object detection, image segmentation, optical character recognition, scene reconstruction and many others. In this presentation I will talk about applying Transfer Learning, Image classification, object detection and the metrics required to measure them on still images. The increase in accuracy over of CV tasks over the past decade is due to Convolutional Neural Networks (CNN), CNN is the base used in architectures such as RESNET or VGGNET. I will go through how to use these pre-trained models for image classification and feature extraction. One of the break throughs in object detection has come with one-shot learning, where the bounding box and the class of the object is predicted simultaneously. This leads to low latency during inference (155 frames per second) and high accuracy. This is the framework behind object detection using YOLO , I will explain how to use yolo for specific use cases.

Build, train, and deploy machine learning models at scale - AWS Summit Cape T...

Amazon Web Services

Speaker: Adrian Hornsby, AWS Level: 300 Machine learning often feels a lot harder than it should be to most developers because the process to build and train models, and then deploy them into production is too complicated and too slow. Amazon SageMaker is a fully-managed service that enables developers and data scientists to quickly and easily build, train, and deploy machine learning models at any scale. In this session, I will make a quick introduction to machine learning and walk through leveraging Sagemaker for your machine learning projects.

MLOps and Reproducible ML on AWS with Kubeflow and SageMaker

Provectus

Looking to implement MLOps using AWS services and Kubeflow? Come and learn about machine learning from the experts of Provectus and Amazon Web Services (AWS)! Businesses recognize that machine learning projects are important but go beyond just building and deploying models, which is mostly done by organizations. Successful ML projects entail a complete lifecycle involving ML, DevOps, and data engineering and are built on top of ML infrastructure. AWS and Amazon SageMaker provide a foundation for building infrastructure for machine learning while Kubeflow is a great open source project, which is not given enough credit in the AWS community. In this webinar, we show how to design and build an end-to-end ML infrastructure on AWS. Agenda - Introductions - Case Study: GoCheck Kids - Overview of AWS Infrastructure for Machine Learning - Provectus ML Infrastructure on AWS - Experimentation - MLOps - Feature Store Intended Audience Technology executives & decision makers, manager-level tech roles, data engineers & data scientists, ML practitioners & ML engineers, and developers Presenters - Stepan Pushkarev, Chief Technology Officer, Provectus - Qingwei Li, ML Specialist Solutions Architect, AWS Feel free to share this presentation with your colleagues and don't hesitate to reach out to us at info@provectus.com if you have any questions! REQUEST WEBINAR: https://provectus.com/webinar-mlops-and-reproducible-ml-on-aws-with-kubeflow-and-sagemaker-aug-2020/

Performance evaluation of GANs in a semisupervised OCR use case

Florian Wilhelm

Performance evaluation of GANs in a semisupervised OCR use case

inovex GmbH

Online vehicle marketplaces are embracing artificial intelligence to ease the process of selling a vehicle on their platform. The tedious work of copying information from the vehicle registration document into some web form can be automated with the help of smart text-spotting systems, in which the seller takes a picture of the document, and the necessary information is extracted automatically. Florian Wilhelm details the components of a text-spotting system, including the subtasks of object detection and optical character recognition (OCR). Florian elaborates on the challenges of OCR in documents with various distortions and artifacts, which rule out off-the-shelf products for this task. After offering an overview of semisupervised learning based on generative adversarial networks (GANs), Florian evaluates the performance gains of this method compared to supervised learning. More specifically, for a varying amount of labeled data, he compares the accuracy of a convolution neural network (CNN) to a GANthat uses additional unlabeled data during the training phase, showing that GANs significantly outperform classical CNNs in use cases with a lack of labeled data. What you'll learn: Understand how semisupervised learning with GANs works Explore beneficial semisupervised methods based on GANs for use cases with a limited amount of labeled data Gain insight into an interesting OCR use case of an online vehicle marketplace Event: O'Reilly Artificial Intelligence Conference, London, 11.10.2018 Speaker: Dr. Florian Wilhelm Mehr Tech-Vorträge: www.inovex.de/vortraege Mehr Tech-Artikel: www.inovex.de/blog

K anonymity for crowdsourcing database

LeMeniz Infotech

Task Adaptive Neural Network Search with Meta-Contrastive Learning

MLAI2

Most conventional Neural Architecture Search (NAS) approaches are limited in that they only generate architectures without searching for the optimal parameters. While some NAS methods handle this issue by utilizing a supernet trained on a large-scale dataset such as ImageNet, they may be suboptimal if the target tasks are highly dissimilar from the dataset the supernet is trained on. To address such limitations, we introduce a novel problem of Neural Network Search (NNS), whose goal is to search for the optimal pretrained network for a novel dataset and constraints (e.g. number of parameters), from a model zoo. Then, we propose a novel framework to tackle the problem, namely Task-Adaptive Neural Network Search (TANS). Given a model-zoo that consists of network pretrained on diverse datasets, we use a novel amortized meta-learning framework to learn a cross-modal latent space with contrastive loss, to maximize the similarity between a dataset and a high-performing network on it, and minimize the similarity between irrelevant dataset-network pairs. We validate the effectiveness and efficiency of our method on ten real-world datasets, against existing NAS/AutoML baselines. The results show that our method instantly retrieves networks that outperform models obtained with the baselines with significantly fewer training steps to reach the target performance, thus minimizing the total cost of obtaining a task-optimal network. Our code and the model-zoo are available at https://anonymous.4open.science/r/TANS-33D6

[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...

Amazon Web Services Korea

발표자료 다시보기: https://youtu.be/xnimaVNTWfc 여러분의 애플리케이션에 인공 지능 기능을 추가하는 방법 중 하나로, GluonCV 및 AutoGluon 라이브러리를 이용해서 간단한 Python 코드로 높은 성능의 기계 학습 모델을 만들고 이를 예측에 사용하는 방법을 소개합니다. 정형 데이터에 대한 분류 또는 수치 예측 모델 생성부터 이미지 분류, 객체 탐지, 세그먼테이션, 행동 인식 등의 모델을 기계 학습에 대한 전문 지식이 없이도 자동으로 만들고 활용하는 방법을 알아봅니다.

AI @ Microsoft, How we do it and how you can too!

Microsoft Tech Community

The importance of model fairness and interpretability in AI systems

Francesca Lazzeri, PhD

Machine learning model fairness and interpretability are critical for data scientists, researchers and developers to explain their models and understand the value and accuracy of their findings. Interpretability is also important to debug machine learning models and make informed decisions about how to improve them. In this session, Francesca will go over a few methods and tools that enable you to "unpack” machine learning models, gain insights into how and why they produce specific results, assess your AI systems fairness and mitigate any observed fairness issues. Using open-source fairness and interpretability packages, attendees will learn how to: - Explain model prediction by generating feature importance values for the entire model and/or individual data points. - Achieve model interpretability on real-world datasets at scale, during training and inference. - Use an interactive visualization dashboard to discover patterns in data and explanations at training time. - Leverage additional interactive visualizations to assess which groups of users might be negatively impacted by a model and compare multiple models in terms of their fairness and performance.

深度學習在AOI的應用

CHENHuiMei

Similar to Quoc Le at AI Frontiers : Automated Machine Learning (20)

B4UConference_machine learning_deeplearning

Amazon SageMaker 內建機器學習演算法 (Level 400)

Amazon SageMaker

Build a Custom Model for Object & Logo Detection (AIM421) - AWS re:Invent 2018

Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...

From notebook to production with Amazon Sagemaker

Semi-Supervised Insight Generation from Petabyte Scale Text Data

Machine Learning Algorithms

Machine Learning 2 deep Learning: An Intro

Computer Vision for Beginners

Build, train, and deploy machine learning models at scale - AWS Summit Cape T...

MLOps and Reproducible ML on AWS with Kubeflow and SageMaker

Performance evaluation of GANs in a semisupervised OCR use case

K anonymity for crowdsourcing database

Task Adaptive Neural Network Search with Meta-Contrastive Learning

[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...

AI @ Microsoft, How we do it and how you can too!

The importance of model fairness and interpretability in AI systems

深度學習在AOI的應用

More from AI Frontiers

Divya Jain at AI Frontiers : Video Summarization

AI Frontiers

As video content is becoming mainstream, video summarization is becoming a hot research topic in academia and industry. Video thumbnail generation and summarization has been worked on for years, but deep learning and reinforcement learning is changing the landscape and emerging as the winner for optimal frame selection. Recent advances in GANs are improving the quality, aesthetics and relevancy of the frames to represent the original videos. Come join this session to get an understanding of various challenges and emerging solutions around video summarization.

Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 1: Heuristi...

AI Frontiers

Topic：Heuristic Search and how does it apply to self-driving cars? (for Beginners) Outline: 1. Technologies behind self-driving vehicles - Motion Planning - Decision Making 2. Graph Search Algorithms - Depth-First Search - Breadth-First Search - A* Search 3. Incremental Heuristic Search Algorithms - Repeated A* Search - Adaptive A* - Generalized Fringe-Retrieving A*

Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...

AI Frontiers

In this tutorial I will introduce recent work in applying weak supervision and reinforcement learning to Questions Answering (QA) systems. Specifically we discuss the semantic parsing task for which natural language queries are converted to computation steps on knowledge graphs or data tables and produce the expected answers. State-of-the-art results can be achieved by novel memory structure for sequence models and improvements in reinforcement learning algorithms. Related code and experiment setup can be found at https://github.com/crazydonkey200/neural-symbolic-machines. Related paper: https://openreview.net/pdf?id=SyK00v5xx.

Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-lecture 2: Incremen...

AI Frontiers

Training at AI Frontiers 2018 - Udacity: Enhancing NLP with Deep Neural Networks

AI Frontiers

Instructor: Mat Leonard Outline 1. Text Processing Using Python + NLTK Cleaning Normalization Tokenization Part-of-speech Tagging Stemming and Lemmatization 2. Feature Extraction Bag of Words TF-IDF Word Embeddings Word2Vec GloVe 3. Topic Modeling Latent Variables Beta and Dirichlet Distributions Laten Dirichlet Allocation 4. NLP with Deep Learning Neural Networks Recurrent Neural Networks (RNNs) Word Embeddings Sentiment Analysis with RNNs

Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 3: Any-Angl...

AI Frontiers

Training at AI Frontiers 2018 - Lukasz Kaiser: Sequence to Sequence Learning ...

AI Frontiers

Percy Liang at AI Frontiers : Pushing the Limits of Machine Learning

AI Frontiers

In recent years, machine learning has undoubtedly been hugely successful in driving progress in AI applications. However, as we will explore in this talk, even state-of-the-art systems have "blind spots" which make them generalize poorly out of domain and render them vulnerable to adversarial examples. We then suggest that more unsupervised learning settings can encourage the development of more robust systems. We show positive results on two tasks: (i) text style and attribute transfer, the task of converting a sentence with one attribute (e.g., sentiment) to one with another; and (ii) solving SAT instances (classical problems requiring logical reasoning) using end-to-end neural networks.

Ilya Sutskever at AI Frontiers : Progress towards the OpenAI mission

AI Frontiers

I will present several advances in deep learning from OpenAI. First, I will present OpenAI Five, a neural network that learned to play on par with some of the strongest professional Dota 2 teams in the world in an 18-hero version of the game. Next, I will present Dactyl, a human-like robot hand trained entirely in simulation with reinforcement learning that has achieved unprecedented dexterity on a physical robot. I will also present our results on unsupervised learning in language, that show that pre-training and finetuning can achieve a significant improvement over state of the art. Finally, I will present an overview of the historical progress in the field.

Mark Moore at AI Frontiers : Uber Elevate

AI Frontiers

Mario Munich at AI Frontiers : Consumer robotics: embedding affordable AI in ...

AI Frontiers

The availability of affordable electronics components, powerful embedded microprocessors, and ubiquitous internet access and WiFi in the household has enabled a new generation of connected consumer robots. In 2015, iRobot launched the Roomba 980, introducing intelligent visual navigation to its successful line of vacuum cleaning robots. In 2018, iRobot launched the Roomba i7, equipped with the latest mapping and navigation technology that provides spatial information to the broader ecosystem of connected devices in the home. In this talk, I will describe the challenges and the potential of introducing consumer robots capable of developing spatial context by exploring the physical space of the home, and I will elaborate on the impact of AI in the future of robotics applications. Moreover, I will describe our vision of the Smart Home, an AI-powered home that maintains itself and magically just does the right thing in anticipation of occupant needs. This home will be built on an ecosystem of connected and coordinated robots, sensors, and devices that provides the occupants with a high quality of life by seamlessly responding to the needs of daily living – from comfort to convenience to security to efficiency.

Arnaud Thiercelin at AI Frontiers : AI in the Sky

AI Frontiers

Anima Anandkumar at AI Frontiers : Modern ML : Deep, distributed, Multi-dimen...

AI Frontiers

As the data and models scale, it becomes necessary to have multiple processing units for both training and inference. SignSGD is a gradient compression algorithm that only transmits the sign of the stochastic gradients during distributed training. This algorithm uses 32 times less communication per iteration than distributed SGD. We show that signSGD obtains free lunch both in theory and practice: no loss in accuracy while yielding speedups. Pushing the current boundaries of deep learning also requires using multiple dimensions and modalities. These can be encoded into tensors, which are natural extensions of matrices. These functionalities are available in the Tensorly package with multiple backend interfaces for large-scale deep learning.

Wei Xu at AI Frontiers : Language Learning in an Interactive and Embodied Set...

AI Frontiers

Sumit Gupta at AI Frontiers : AI for Enterprise

AI Frontiers

The use of AI for voice search and image recognition is talked about often. Enterprises, however, have different challenges and requirements. In this talk, we will focus on talking about use cases in the enterprise and challenges in building out AI solutions. We will talk about how an Auto-machine learning software for videos and images called PowerAI Vision enables quick AI model training & deployment for various enterprise use cases.

Yuandong Tian at AI Frontiers : Planning in Reinforcement Learning

AI Frontiers

Deep Reinforcement Learning (DRL) has made strong progress in many tasks, such as board games, robotics, navigation, neural architecture search, etc. I will present our recent open-sourced DRL frameworks to facilitate game research and development. Our framework is scalable so we can can reproduce AlphaGoZero and AlphaZero using 2000 GPUs, achieving super-human performance of Go AI that beats 4 top-30 professional players. We also show usability of our platform by training agents in real-time strategy games, and show interesting behaviors with a small amount of resource.

Alex Ermolaev at AI Frontiers : Major Applications of AI in Healthcare

AI Frontiers

The latest AI advances have the potential to massively improve our health and well being. However, most of the work is yet to be done. In this talk, we will explore the most important opportunities for AI in healthcare. For example, we will explore how AI can diagnose major life-threatening conditions even before those conditions emerge. We will talk about AI ability to recommend dramatically more effective and less harmful treatment plans based on AI understanding of patient's medical history and current conditions. Finally, we will talk about AI role in making our healthcare system effective and affordable for everyone.

Long Lin at AI Frontiers : AI in Gaming

AI Frontiers

Games have been leveraging AI since the 1950s, when people built a rules-based AI engine that played tic-tac-toe. With technological advances over the years, AI has become increasingly popular and widely used in the gaming industry. The typical characteristics of games and game development makes them an ideal playground for practicing and implementing AI techniques, especially deep learning and reinforcement learning. Most games are well scoped; it is relatively easy to generate and use the data; and states/actions/rewards are relatively clear. In this talk, I will show a couple of use cases where ML/AI helps in-game development and enhances player experience. Examples include AI agents playing game and services that provide personalized experience to players.

Melissa Goldman at AI Frontiers : AI & Finance

AI Frontiers

Li Deng at AI Frontiers : From Modeling Speech/Language to Modeling Financial...

AI Frontiers

More from AI Frontiers (20)