The document discusses Bayesian optimization and the apsis toolkit. It summarizes that apsis is an open source framework for automated hyperparameter optimization using techniques like random search and Bayesian optimization. It implements state-of-the-art Bayesian optimization methods with Gaussian processes and expected improvement acquisition. The document outlines apsis' architecture, parameter representation methods, optimization procedures, and development practices like testing, documentation, and issue tracking to encourage collaboration.
Industrial project and machine scheduling with Constraint ProgrammingPhilippe Laborie
More often than not, project and machine scheduling problems are addressed either by generic mathematical programming techniques (like MILP) or by problem-specific exact or heuristic approaches. MILP formulations are commonly used to describe the problem in mathematical terms and to provide optimal solutions or bounds to small problem instances. As they usually do not scale well, one usually resorts to using heuristics for handling large and complex industrial problems.
Though constraint programming (CP) techniques represent the state of the art in several classical project and machine scheduling benchmarks and have been used for almost 30 years for solving industrial problems, they are still seldom considered as an alternative approach in the scheduling community. A possible reason is that, for years, in the absence of efficient and robust automatic search algorithms, CP techniques have been difficult to use for non-CP experts.
We will explain why we think this time is over and illustrate our arguments with CP Optimizer, a generic system, largely based on CP, for modeling and solving real-world scheduling problems.
CP Optimizer extends linear programming with an algebraic language using simple mathematical concepts (such as intervals, sequences and functions) to capture the temporal dimension of scheduling problems in a combinatorial optimization framework. CP Optimizer implements a model-and-run paradigm that vastly reduces the burden on the user to understand CP or scheduling algorithms: modeling is by far the most important. The automatic search combines a wide variety of techniques from Artificial Intelligence (constraint programming, temporal reasoning, learning etc.) and Operations Research (mathematical programming, graph algorithms, local search, etc.) in an exact algorithm that provides good performance out of the box and which is continuously improving.
new optimization algorithm for topology optimizationSeonho Park
authors devise new convex approximation called DQA which utilizes information of two consecutive points at iterates. Also, to guarantee global convergence, filter method is illustrated.
Accelerated life testing (ALT) is widely used to expedite failures of a product in a short time period for predicting the product’s reliability under normal operating conditions. The resulting ALT data are often characterized by a probability distribution, such as Weibull, Lognormal, Gamma distribution, along with a life-stress relationship. However, if the selected failure time distribution is not adequate in describing the ALT data, the resulting reliability prediction would be misleading. In this talk, we provide a generic method for modeling ALT data which will assist engineers in dealing with a variety of failure time distributions. The method uses Erlang-Coxian (EC) distributions, which belong to a particular subset of phase-type (PH) distributions, to approximate the underlying failure time distributions arbitrarily closely. To estimate the parameters of such an EC-based ALT model, two statistical inference approaches are proposed. First, a mathematical programming approach is formulated to simultaneously match the moments of the EC-based ALT model to the ALT data collected at all test stress levels. This approach resolves the feasibility issue of the method of moments. In addition, the maximum likelihood estimation (MLE) approach is proposed to handle ALT data with type-I censoring. Numerical examples are provided to illustrate the capability of the generic method in modeling ALT data.
Industrial project and machine scheduling with Constraint ProgrammingPhilippe Laborie
More often than not, project and machine scheduling problems are addressed either by generic mathematical programming techniques (like MILP) or by problem-specific exact or heuristic approaches. MILP formulations are commonly used to describe the problem in mathematical terms and to provide optimal solutions or bounds to small problem instances. As they usually do not scale well, one usually resorts to using heuristics for handling large and complex industrial problems.
Though constraint programming (CP) techniques represent the state of the art in several classical project and machine scheduling benchmarks and have been used for almost 30 years for solving industrial problems, they are still seldom considered as an alternative approach in the scheduling community. A possible reason is that, for years, in the absence of efficient and robust automatic search algorithms, CP techniques have been difficult to use for non-CP experts.
We will explain why we think this time is over and illustrate our arguments with CP Optimizer, a generic system, largely based on CP, for modeling and solving real-world scheduling problems.
CP Optimizer extends linear programming with an algebraic language using simple mathematical concepts (such as intervals, sequences and functions) to capture the temporal dimension of scheduling problems in a combinatorial optimization framework. CP Optimizer implements a model-and-run paradigm that vastly reduces the burden on the user to understand CP or scheduling algorithms: modeling is by far the most important. The automatic search combines a wide variety of techniques from Artificial Intelligence (constraint programming, temporal reasoning, learning etc.) and Operations Research (mathematical programming, graph algorithms, local search, etc.) in an exact algorithm that provides good performance out of the box and which is continuously improving.
new optimization algorithm for topology optimizationSeonho Park
authors devise new convex approximation called DQA which utilizes information of two consecutive points at iterates. Also, to guarantee global convergence, filter method is illustrated.
Accelerated life testing (ALT) is widely used to expedite failures of a product in a short time period for predicting the product’s reliability under normal operating conditions. The resulting ALT data are often characterized by a probability distribution, such as Weibull, Lognormal, Gamma distribution, along with a life-stress relationship. However, if the selected failure time distribution is not adequate in describing the ALT data, the resulting reliability prediction would be misleading. In this talk, we provide a generic method for modeling ALT data which will assist engineers in dealing with a variety of failure time distributions. The method uses Erlang-Coxian (EC) distributions, which belong to a particular subset of phase-type (PH) distributions, to approximate the underlying failure time distributions arbitrarily closely. To estimate the parameters of such an EC-based ALT model, two statistical inference approaches are proposed. First, a mathematical programming approach is formulated to simultaneously match the moments of the EC-based ALT model to the ALT data collected at all test stress levels. This approach resolves the feasibility issue of the method of moments. In addition, the maximum likelihood estimation (MLE) approach is proposed to handle ALT data with type-I censoring. Numerical examples are provided to illustrate the capability of the generic method in modeling ALT data.
Conventionally when we talk about Recommender Systems, we talk about collaborative filtering. While providing personalized recommendations through collaborative filtering is an essential aspect to providing effective recommendations, it is but a piece of a much broader ecosystem of functionality, tools, and development pipelines. This presentation will discuss an holistic approach to building recommendation systems including 1) iterating towards better recommendations, 2) the data pipelines required, 3) a machine-learned ranking approach based on an Information Retrieval formulation that leverages collaborative filtering, 4) ways to make recommendations more relevant and interpretable.
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...Data Con LA
The advent of modern deep learning techniques has given organizations new tools to understand, query, and structure their data. However, maintaining complex pipelines, versioning models, and tracking accuracy regressions over time remain ongoing struggles of even the most advanced data engineering teams. This talk presents a simple architecture for deploying machine learning at scale and offer suggestions for how companies can get their feet wet with open source technologies they already deploy.
Spark, Deep Learning and Life Sciences, Systems Biology in the Big Data Agebatchinsights
In this talk I will outline current advances in the use of Spark for next generation sequencing, protein interaction networks and folding challenges. I will outline how Spark with Cassandra can be used with Deep Learning to predict biological function and disease. I also outline use cases for virtual screening and drug discovery.
One of the most important, yet often overlooked, aspects of predictive modeling is the transformation of data to create model inputs, better known as feature engineering (FE). This talk will go into the theoretical background behind FE, showing how it leverages existing data to produce better modeling results. It will then detail some important FE techniques that should be in every data scientist’s tool kit.
Feature engineering--the underdog of machine learning. This deck provides an overview of feature generation methods for text, image, audio, feature cleaning and transformation methods, how well they work and why.
TensorFrames: Google Tensorflow on Apache SparkDatabricks
Presentation at Bay Area Spark Meetup by Databricks Software Engineer and Spark committer Tim Hunter.
This presentation covers how you can use TensorFrames with Tensorflow to distributed computing on GPU.
Scikit-Learn is a powerful machine learning library implemented in Python with numeric and scientific computing powerhouses Numpy, Scipy, and matplotlib for extremely fast analysis of small to medium sized data sets. It is open source, commercially usable and contains many modern machine learning algorithms for classification, regression, clustering, feature extraction, and optimization. For this reason Scikit-Learn is often the first tool in a Data Scientists toolkit for machine learning of incoming data sets.
The purpose of this one day course is to serve as an introduction to Machine Learning with Scikit-Learn. We will explore several clustering, classification, and regression algorithms for a variety of machine learning tasks and learn how to implement these tasks with our data using Scikit-Learn and Python. In particular, we will structure our machine learning models as though we were producing a data product, an actionable model that can be used in larger programs or algorithms; rather than as simply a research or investigation methodology.
Automatic Task-based Code Generation for High Performance DSELJoel Falcou
Providing high level tools for parallel programming while sustaining a high level of performance has been a challenge that techniques like Domain Specific Embedded Languages try to solve. In previous works, we investigated the design of such a DSEL – NT2 – providing a Matlab -like syntax for parallel numerical computations inside a C++ library.
Main issues addressed here is how liimtaions of classical DSEL generation and multithreaded code generation can be overcome.
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016MLconf
Using Bayesian Optimization to Tune Machine Learning Models: In this talk we briefly introduce Bayesian Global Optimization as an efficient way to optimize machine learning model parameters, especially when evaluating different parameters is time-consuming or expensive. We will motivate the problem and give example applications.
We will also talk about our development of a robust benchmark suite for our algorithms including test selection, metric design, infrastructure architecture, visualization, and comparison to other standard and open source methods. We will discuss how this evaluation framework empowers our research engineers to confidently and quickly make changes to our core optimization engine.
We will end with an in-depth example of using these methods to tune the features and hyperparameters of a real world problem and give several real world applications.
International Refereed Journal of Engineering and Science (IRJES) is a peer reviewed online journal for professionals and researchers in the field of computer science. The main aim is to resolve emerging and outstanding problems revealed by recent social and technological change. IJRES provides the platform for the researchers to present and evaluate their work from both theoretical and technical aspects and to share their views.
www.irjes.com
Conventionally when we talk about Recommender Systems, we talk about collaborative filtering. While providing personalized recommendations through collaborative filtering is an essential aspect to providing effective recommendations, it is but a piece of a much broader ecosystem of functionality, tools, and development pipelines. This presentation will discuss an holistic approach to building recommendation systems including 1) iterating towards better recommendations, 2) the data pipelines required, 3) a machine-learned ranking approach based on an Information Retrieval formulation that leverages collaborative filtering, 4) ways to make recommendations more relevant and interpretable.
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...Data Con LA
The advent of modern deep learning techniques has given organizations new tools to understand, query, and structure their data. However, maintaining complex pipelines, versioning models, and tracking accuracy regressions over time remain ongoing struggles of even the most advanced data engineering teams. This talk presents a simple architecture for deploying machine learning at scale and offer suggestions for how companies can get their feet wet with open source technologies they already deploy.
Spark, Deep Learning and Life Sciences, Systems Biology in the Big Data Agebatchinsights
In this talk I will outline current advances in the use of Spark for next generation sequencing, protein interaction networks and folding challenges. I will outline how Spark with Cassandra can be used with Deep Learning to predict biological function and disease. I also outline use cases for virtual screening and drug discovery.
One of the most important, yet often overlooked, aspects of predictive modeling is the transformation of data to create model inputs, better known as feature engineering (FE). This talk will go into the theoretical background behind FE, showing how it leverages existing data to produce better modeling results. It will then detail some important FE techniques that should be in every data scientist’s tool kit.
Feature engineering--the underdog of machine learning. This deck provides an overview of feature generation methods for text, image, audio, feature cleaning and transformation methods, how well they work and why.
TensorFrames: Google Tensorflow on Apache SparkDatabricks
Presentation at Bay Area Spark Meetup by Databricks Software Engineer and Spark committer Tim Hunter.
This presentation covers how you can use TensorFrames with Tensorflow to distributed computing on GPU.
Scikit-Learn is a powerful machine learning library implemented in Python with numeric and scientific computing powerhouses Numpy, Scipy, and matplotlib for extremely fast analysis of small to medium sized data sets. It is open source, commercially usable and contains many modern machine learning algorithms for classification, regression, clustering, feature extraction, and optimization. For this reason Scikit-Learn is often the first tool in a Data Scientists toolkit for machine learning of incoming data sets.
The purpose of this one day course is to serve as an introduction to Machine Learning with Scikit-Learn. We will explore several clustering, classification, and regression algorithms for a variety of machine learning tasks and learn how to implement these tasks with our data using Scikit-Learn and Python. In particular, we will structure our machine learning models as though we were producing a data product, an actionable model that can be used in larger programs or algorithms; rather than as simply a research or investigation methodology.
Automatic Task-based Code Generation for High Performance DSELJoel Falcou
Providing high level tools for parallel programming while sustaining a high level of performance has been a challenge that techniques like Domain Specific Embedded Languages try to solve. In previous works, we investigated the design of such a DSEL – NT2 – providing a Matlab -like syntax for parallel numerical computations inside a C++ library.
Main issues addressed here is how liimtaions of classical DSEL generation and multithreaded code generation can be overcome.
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016MLconf
Using Bayesian Optimization to Tune Machine Learning Models: In this talk we briefly introduce Bayesian Global Optimization as an efficient way to optimize machine learning model parameters, especially when evaluating different parameters is time-consuming or expensive. We will motivate the problem and give example applications.
We will also talk about our development of a robust benchmark suite for our algorithms including test selection, metric design, infrastructure architecture, visualization, and comparison to other standard and open source methods. We will discuss how this evaluation framework empowers our research engineers to confidently and quickly make changes to our core optimization engine.
We will end with an in-depth example of using these methods to tune the features and hyperparameters of a real world problem and give several real world applications.
International Refereed Journal of Engineering and Science (IRJES) is a peer reviewed online journal for professionals and researchers in the field of computer science. The main aim is to resolve emerging and outstanding problems revealed by recent social and technological change. IJRES provides the platform for the researchers to present and evaluate their work from both theoretical and technical aspects and to share their views.
www.irjes.com
HDR Defence - Software Abstractions for Parallel ArchitecturesJoel Falcou
Performing large, intensive or non-trivial computing on array like data
structures is one of the most common task in scientific computing, video game
development and other fields. This matter of fact is backed up by the large number
of tools, languages and libraries to perform such tasks. If we restrict ourselves to
C++ based solutions, more than a dozen such libraries exists from BLAS/LAPACK
C++ binding to template meta-programming based Blitz++ or Eigen.
If all of these libraries provide good performance or good abstraction, none of
them seems to fit the need of so many different user types. Moreover, as parallel
system complexity grows, the need to maintain all those components quickly
become unwieldy. This thesis explores various software design techniques - like
Generative Programming, MetaProgramming and Generic Programming - and their
application to the implementation of various parallel computing libraries in such a
way that abstraction and expressiveness are maximized while efficiency overhead is
minimized.
top nidhi software solution freedownloadvrstrong314
This presentation emphasizes the importance of data security and legal compliance for Nidhi companies in India. It highlights how online Nidhi software solutions, like Vector Nidhi Software, offer advanced features tailored to these needs. Key aspects include encryption, access controls, and audit trails to ensure data security. The software complies with regulatory guidelines from the MCA and RBI and adheres to Nidhi Rules, 2014. With customizable, user-friendly interfaces and real-time features, these Nidhi software solutions enhance efficiency, support growth, and provide exceptional member services. The presentation concludes with contact information for further inquiries.
Enterprise Resource Planning System includes various modules that reduce any business's workload. Additionally, it organizes the workflows, which drives towards enhancing productivity. Here are a detailed explanation of the ERP modules. Going through the points will help you understand how the software is changing the work dynamics.
To know more details here: https://blogs.nyggs.com/nyggs/enterprise-resource-planning-erp-system-modules/
Cyaniclab : Software Development Agency Portfolio.pdfCyanic lab
CyanicLab, an offshore custom software development company based in Sweden,India, Finland, is your go-to partner for startup development and innovative web design solutions. Our expert team specializes in crafting cutting-edge software tailored to meet the unique needs of startups and established enterprises alike. From conceptualization to execution, we offer comprehensive services including web and mobile app development, UI/UX design, and ongoing software maintenance. Ready to elevate your business? Contact CyanicLab today and let us propel your vision to success with our top-notch IT solutions.
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns
Unlocking Business Potential: Tailored Technology Solutions by Prosigns
Discover how Prosigns, a leading technology solutions provider, partners with businesses to drive innovation and success. Our presentation showcases our comprehensive range of services, including custom software development, web and mobile app development, AI & ML solutions, blockchain integration, DevOps services, and Microsoft Dynamics 365 support.
Custom Software Development: Prosigns specializes in creating bespoke software solutions that cater to your unique business needs. Our team of experts works closely with you to understand your requirements and deliver tailor-made software that enhances efficiency and drives growth.
Web and Mobile App Development: From responsive websites to intuitive mobile applications, Prosigns develops cutting-edge solutions that engage users and deliver seamless experiences across devices.
AI & ML Solutions: Harnessing the power of Artificial Intelligence and Machine Learning, Prosigns provides smart solutions that automate processes, provide valuable insights, and drive informed decision-making.
Blockchain Integration: Prosigns offers comprehensive blockchain solutions, including development, integration, and consulting services, enabling businesses to leverage blockchain technology for enhanced security, transparency, and efficiency.
DevOps Services: Prosigns' DevOps services streamline development and operations processes, ensuring faster and more reliable software delivery through automation and continuous integration.
Microsoft Dynamics 365 Support: Prosigns provides comprehensive support and maintenance services for Microsoft Dynamics 365, ensuring your system is always up-to-date, secure, and running smoothly.
Learn how our collaborative approach and dedication to excellence help businesses achieve their goals and stay ahead in today's digital landscape. From concept to deployment, Prosigns is your trusted partner for transforming ideas into reality and unlocking the full potential of your business.
Join us on a journey of innovation and growth. Let's partner for success with Prosigns.
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Globus
The Earth System Grid Federation (ESGF) is a global network of data servers that archives and distributes the planet’s largest collection of Earth system model output for thousands of climate and environmental scientists worldwide. Many of these petabyte-scale data archives are located in proximity to large high-performance computing (HPC) or cloud computing resources, but the primary workflow for data users consists of transferring data, and applying computations on a different system. As a part of the ESGF 2.0 US project (funded by the United States Department of Energy Office of Science), we developed pre-defined data workflows, which can be run on-demand, capable of applying many data reduction and data analysis to the large ESGF data archives, transferring only the resultant analysis (ex. visualizations, smaller data files). In this talk, we will showcase a few of these workflows, highlighting how Globus Flows can be used for petabyte-scale climate analysis.
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus
As part of the DOE Integrated Research Infrastructure (IRI) program, NERSC at Lawrence Berkeley National Lab and ALCF at Argonne National Lab are working closely with General Atomics on accelerating the computing requirements of the DIII-D experiment. As part of the work the team is investigating ways to speedup the time to solution for many different parts of the DIII-D workflow including how they run jobs on HPC systems. One of these routes is looking at Globus Compute as a way to replace the current method for managing tasks and we describe a brief proof of concept showing how Globus Compute could help to schedule jobs and be a tool to connect compute at different facilities.
Experience our free, in-depth three-part Tendenci Platform Corporate Membership Management workshop series! In Session 1 on May 14th, 2024, we began with an Introduction and Setup, mastering the configuration of your Corporate Membership Module settings to establish membership types, applications, and more. Then, on May 16th, 2024, in Session 2, we focused on binding individual members to a Corporate Membership and Corporate Reps, teaching you how to add individual members and assign Corporate Representatives to manage dues, renewals, and associated members. Finally, on May 28th, 2024, in Session 3, we covered questions and concerns, addressing any queries or issues you may have.
For more Tendenci AMS events, check out www.tendenci.com/events
Understanding Globus Data Transfers with NetSageGlobus
NetSage is an open privacy-aware network measurement, analysis, and visualization service designed to help end-users visualize and reason about large data transfers. NetSage traditionally has used a combination of passive measurements, including SNMP and flow data, as well as active measurements, mainly perfSONAR, to provide longitudinal network performance data visualization. It has been deployed by dozens of networks world wide, and is supported domestically by the Engagement and Performance Operations Center (EPOC), NSF #2328479. We have recently expanded the NetSage data sources to include logs for Globus data transfers, following the same privacy-preserving approach as for Flow data. Using the logs for the Texas Advanced Computing Center (TACC) as an example, this talk will walk through several different example use cases that NetSage can answer, including: Who is using Globus to share data with my institution, and what kind of performance are they able to achieve? How many transfers has Globus supported for us? Which sites are we sharing the most data with, and how is that changing over time? How is my site using Globus to move data internally, and what kind of performance do we see for those transfers? What percentage of data transfers at my institution used Globus, and how did the overall data transfer performance compare to the Globus users?
In software engineering, the right architecture is essential for robust, scalable platforms. Wix has undergone a pivotal shift from event sourcing to a CRUD-based model for its microservices. This talk will chart the course of this pivotal journey.
Event sourcing, which records state changes as immutable events, provided robust auditing and "time travel" debugging for Wix Stores' microservices. Despite its benefits, the complexity it introduced in state management slowed development. Wix responded by adopting a simpler, unified CRUD model. This talk will explore the challenges of event sourcing and the advantages of Wix's new "CRUD on steroids" approach, which streamlines API integration and domain event management while preserving data integrity and system resilience.
Participants will gain valuable insights into Wix's strategies for ensuring atomicity in database updates and event production, as well as caching, materialization, and performance optimization techniques within a distributed system.
Join us to discover how Wix has mastered the art of balancing simplicity and extensibility, and learn how the re-adoption of the modest CRUD has turbocharged their development velocity, resilience, and scalability in a high-growth environment.
How to Position Your Globus Data Portal for Success Ten Good PracticesGlobus
Science gateways allow science and engineering communities to access shared data, software, computing services, and instruments. Science gateways have gained a lot of traction in the last twenty years, as evidenced by projects such as the Science Gateways Community Institute (SGCI) and the Center of Excellence on Science Gateways (SGX3) in the US, The Australian Research Data Commons (ARDC) and its platforms in Australia, and the projects around Virtual Research Environments in Europe. A few mature frameworks have evolved with their different strengths and foci and have been taken up by a larger community such as the Globus Data Portal, Hubzero, Tapis, and Galaxy. However, even when gateways are built on successful frameworks, they continue to face the challenges of ongoing maintenance costs and how to meet the ever-expanding needs of the community they serve with enhanced features. It is not uncommon that gateways with compelling use cases are nonetheless unable to get past the prototype phase and become a full production service, or if they do, they don't survive more than a couple of years. While there is no guaranteed pathway to success, it seems likely that for any gateway there is a need for a strong community and/or solid funding streams to create and sustain its success. With over twenty years of examples to draw from, this presentation goes into detail for ten factors common to successful and enduring gateways that effectively serve as best practices for any new or developing gateway.
Listen to the keynote address and hear about the latest developments from Rachana Ananthakrishnan and Ian Foster who review the updates to the Globus Platform and Service, and the relevance of Globus to the scientific community as an automation platform to accelerate scientific discovery.
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamtakuyayamamoto1800
In this slide, we show the simulation example and the way to compile this solver.
In this solver, the Helmholtz equation can be solved by helmholtzFoam. Also, the Helmholtz equation with uniformly dispersed bubbles can be simulated by helmholtzBubbleFoam.
SOCRadar Research Team: Latest Activities of IntelBrokerSOCRadar
The European Union Agency for Law Enforcement Cooperation (Europol) has suffered an alleged data breach after a notorious threat actor claimed to have exfiltrated data from its systems. Infamous data leaker IntelBroker posted on the even more infamous BreachForums hacking forum, saying that Europol suffered a data breach this month.
The alleged breach affected Europol agencies CCSE, EC3, Europol Platform for Experts, Law Enforcement Forum, and SIRIUS. Infiltration of these entities can disrupt ongoing investigations and compromise sensitive intelligence shared among international law enforcement agencies.
However, this is neither the first nor the last activity of IntekBroker. We have compiled for you what happened in the last few days. To track such hacker activities on dark web sources like hacker forums, private Telegram channels, and other hidden platforms where cyber threats often originate, you can check SOCRadar’s Dark Web News.
Stay Informed on Threat Actors’ Activity on the Dark Web with SOCRadar!
Globus Connect Server Deep Dive - GlobusWorld 2024Globus
We explore the Globus Connect Server (GCS) architecture and experiment with advanced configuration options and use cases. This content is targeted at system administrators who are familiar with GCS and currently operate—or are planning to operate—broader deployments at their institution.
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...Anthony Dahanne
Les Buildpacks existent depuis plus de 10 ans ! D’abord, ils étaient utilisés pour détecter et construire une application avant de la déployer sur certains PaaS. Ensuite, nous avons pu créer des images Docker (OCI) avec leur dernière génération, les Cloud Native Buildpacks (CNCF en incubation). Sont-ils une bonne alternative au Dockerfile ? Que sont les buildpacks Paketo ? Quelles communautés les soutiennent et comment ?
Venez le découvrir lors de cette session ignite
Quarkus Hidden and Forbidden ExtensionsMax Andersen
Quarkus has a vast extension ecosystem and is known for its subsonic and subatomic feature set. Some of these features are not as well known, and some extensions are less talked about, but that does not make them less interesting - quite the opposite.
Come join this talk to see some tips and tricks for using Quarkus and some of the lesser known features, extensions and development techniques.
Unleash Unlimited Potential with One-Time Purchase
BoxLang is more than just a language; it's a community. By choosing a Visionary License, you're not just investing in your success, you're actively contributing to the ongoing development and support of BoxLang.
apsis - Automatic Hyperparameter Optimization Framework for Machine Learning
1. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
apsis
Automated Hyperparameter Optimization Using Bayesian
Optimization
Frederik Diehl Andreas Jauch
March 10, 2015
State March 10, 2015
2. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
AGENDA
Problem Description
Bayesian Optimization
apsis and its Architecture
Project Organisation
Performance Evaluation
3. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
MOTIVATION
Why Hyperparameter Optimization and why automating it?
hyperparameter tuning often leads to huge performance
gain
”more of an art than a science”
reproducibility of published results
automatic methods might be better than humans
provide ml algorithms to non-expert users
4. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
ML PROCESS OVERVIEW
5. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
FORMAL PROBLEM DESCRIPTION
λ : hyperparameter vector λ = (λ(1)
, ..., λ(n)
)
L(X, f) : loss function evaluated for model f and dataset x
Aλ(X) : learning algorithm with hyperparameter vector λ
learning on dataset X
Xtrain : training data, Xvalid : validation data, Xtest : test data
Ψ(λ) : hyperparameter response function/surface
Hyperparameter Optimization Problem
ˆλ ≈ argmin
λ Λ
mean
Xi∈Xvalid
(L(xi, Aλ(Xtrain))
Ψ(λ)
(1)
= argmin
λ Λ
(Ψ(λ)) (2)
6. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
KEY PROBLEM PROPERTIES
unknown, probably non-convex response surface Ψ
no derivative-based optimization possible
every evaluation of Ψ is expensive
evaluation time of Ψ depends on individual value of λ
low effective dimensionality of Ψ
which dimensions are important is dataset dependent
tree-structured configuration space1
1
not addressed in apsis yet
7. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
STATE OF THE ART
optimization still manual in many projects
grid search most common method
often the only provided method by many ml frameworks
random search
bayesian optimization
code of Jasper Snoek et. al. Harvard/Toronto
whetlab - bay opt in the cloud
8. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
BAYESIAN OPTIMIZATION
approximate Ψ(λ) by a surrogate function M(λ) = y
surrogate function cheaper to evaluate than Ψ
interpret model to find minimization candidates for Ψ
evaluate Ψ for promising candidates
9. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
BAYESIAN OPTIMIZATION FUNDAMENTALS
We need two design choices
Surrogate Modelling Function - Gaussian Processes
universal approximation
very flexible and have many useful properties
closed under sampling
Acquisition Function
Probability of Improvement
Expected Improvement
10. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
ACQUISITION FUNCTION u
measures the expected utility of evaluating the objective
function at a point λnext
exploitation vs. exploration trade-off
(picture adopted from [1])
11. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
OPTIMIZATION - SUCCESSIVELY UPDATING THE GP
1. find
max
λ
(u(λ)) = λnext
max of acquisition
2. Evaluate M at λnext
3. Update the GP
(picture adopted from [1])
12. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
FITTING THE GP TO THE PROBLEM
by tuning the covariance!
Squared Exponential Kernel
KSE(λ, λ ) = exp −
1
2l2
·
1..dim(D)
(λd − λd)2
use Automatic Relevance Determination (ARD)
KSE(λ, λ ) = θ0 · exp
−
1
2
·
1..dim(D)
1
θd
2
(λd − λd)2
with ARD vector θ
θ = ( θ0
bias
, θ1, . . . , θd
dimension weights
)
13. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
HOW DO ACQUISITION FUNCTIONS LOOK LIKE?
Expected Improvement (EI)
uEI(λ) =
−∞
∞
max(Ψ(λ∗
) − y, 0) · pM(y|λ) dy
closed form solution for GPs available
uEI(λ|Mt) = σ(λ) ·
f(λ∗) − µ(λ)
σ(λ)
· Φ(λ) + φ(λ)
gradient analytically derived in apsis for more effective
optimization
14. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
THE apsis TOOLKIT
Automated Hyperparameter Optimization Framework for
random search
bayesian optimization
as an open source framework featuring
flexible architecture, ready to be extended for more
optimizers
ready for use with scikit-learn and theano
implemented in Python
15. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
PROJECT OBJECTIVES
open source implementation of state of the art research in
bayesian optimization
extendible project to encourage collaboration with other
researchers
easy integration with existing machine learning
frameworks
multi core support
16. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
apsis ARCHITECTURE OVERVIEW
17. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
apsis CORE MODEL COMPONENTS
Parameter Definitions
define the meta information for each hyperparameter
Candidates
represent a specific hyperparameter vector and its value
holds function value if available
Experiments
represent an optimization object
keeps track of finished and unfinished Candidates
18. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
USING apsis - EXPERIMENT ASSISTANTS
single experiment interaction interface
provides plots and result bookkeeping
19. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
USING apsis - LAB ASSISTANTS
multiple experiments to compare different optimization
techniques
cross validation
20. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
EXTENSIVE EXPERIMENT TRACKING
automated plot writing
automated results writing
write out information at every step
21. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
AUTOMATED PLOTTING
plot function evaluations and best results
plot confidence bars when using cross validation
write out plots at every step
22. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
PARAMETER REPRESENTATION IN apsis
different representation by parameter type
various nominal and numeric types
23. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
NUMERIC PARAMETERS IN apsis
Warping Mechanism
parameters are warped into [0, 1] interval
optimization core can assume a uniform and equal
distribution in [0, 1] space
warping can be user defined
Provided Warpings for
normalization of arbitrary intervals [a, b] into [0, 1] space.
asymptotic parameters, e.g. learning rate asymptotic at 0
24. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
NOMINAL PARAMETERS IN apsis
generally supported in apsis
no support in Bayesian Optimization
GP kernels based on distance metrics between parameters
interesting topic for further research
no publications on this topic so far
whetlab pretends to deal well with them - but doesn’t say
how
25. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
EXPECTED IMPROVEMENT OPTIMIZATION
gradient analytically derived2
1000 Steps Random Search for Initialization
Several iterative optimization methods integrated
L-BFGS-B Bounded Low Memory Quasi Newton Method
BFGS Quasi Newton Method
Nelder-Mead
Inexact Newton with Conjugate Gradient Solver
...
2
See our paper for derivation.
26. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
DEALING WITH GP HYPERPARAMETERS
the GP surrogate model introduces new hyperparameter
not subject of optimization ⇒ hyper-hyperparameters
optimization by maximum likelihood method
integrating over these parameters in the acquisition
function using Hybrid Monte Carlo sampling
27. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
apsis PROJECT SET UP
Open-Source project from the beginning
MIT-License
active issue tracking
PEP-8 code styling convention
Fully automated sphinx documentation build on every
commit
90% test coverage
clear commit messages
28. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
apsis GITHUB REPOSITORY
Check out http://github.com/FrederikDiehl/apsis !
29. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
ISSUE TRACKING AND DISCUSSION
30. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
UNIT TESTS
90% overall test coverage
100% in most core components
31. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
GOOD CODE DOCUMENTATION
almost 50:50 ratio of code vs. doc
documented according to sphinx standard
32. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
FULLY AUTOMATED DOCUMENTATION BUILD
builds on every commit
Visit http://apsis.readthedocs.org !
33. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
BRANIN HOO OPTIMIZATION
Best Value by Optimizer
Values by Optimizer
random search finds better end result but bay opt is more
stable
similar performance as in other bay opt literature
no other group publishes comparison to random search
34. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
OPTIMIZING ARTIFICIAL NOISE FUNCTION
One dimensional noise function with several smoothing
variances
35. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
OPTIMIZING ARTIFICIAL NOISE FUNCTION
Minimization result on 3d noise by smoothing factor.
36. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
BREZE MNIST NEURAL NETWORK
Neural Network on MNIST using uniform parameters
37. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
BREZE MNIST NEURAL NETWORK (2)
Neural Network on MNIST using asymptotic parameters for
learning rate and learning rate decay
38. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
TAKING THE PROJECT TO THE NEXT LEVEL -
PROGRAM
implement full multicore support
implement a REST web-service to offer interoperability
with any language
improve integration of matplotlib
39. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
TAKING THE PROJECT TO THE NEXT LEVEL -
BAYESIAN OPTIMIZATION
deal with nominal parameters
try replacing GPs with Student-t processes
try to take tree structured configuration space into account
account for evaluation cost depending on hyperparameter
setting
implement freeze-thaw optimization idea [3]
automated learning of input warping [2]
40. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
Thank You!
41. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
REFERENCES I
Eric Brochu, Vlad M. Cora, and Nando de Freitas.
A Tutorial on Bayesian Optimization of Expensive Cost
Functions, with Application to Active User Modeling and
Hierarchical Reinforcement Learning.
IEEE Transactions on Reliability, abs/1012.2, 2010.
Jasper Snoek, Kevin Swersky, Richard S Zemel, and Ryan P
Adams.
Input warping for bayesian optimization of non-stationary
functions.
arXiv preprint arXiv:1402.0929, 2014.
Kevin Swersky, Jasper Snoek, and Ryan Prescott Adams.
Freeze-thaw bayesian optimization.
arXiv preprint arXiv:1406.3896, 2014.
42. Problem Description Bayesian Optimization apsis and its Architecture Project Organisation Performance Evaluation
REFERENCES II