Presentation delivered during Data Science Rzeszow meetup:
I will present reasons for optimization being superior to predictive algorithms in data science practical applications. I will cover exemplary case studies, tools and hints from my experience on delivering hybrid solutions that exploit both prediction and optimization
Always Be Deploying. How to make R great for machine learning in (not only) E...Wit Jakuczun
The presentation I delivered at WhyR 2019.
Abstract:
For many years software engineers have put enormous effort to develop best practices to deliver stable and maintainable software. How R users can benefit from this experience? I will try to answer this question going through several concepts and tools that are natural for software engineers but are often undervalued by R users.
I will start with a description of the deployment process because this is the ultimate step that exposes all weaknesses. You will learn about structuring R project, using abstractions to manage model’s features, automating models building process, optimizing the performance of the solution and the challenges of the deployment process itself.
Driving your marketing automation with multi-armed bandits in real timeWit Jakuczun
Presentation delivered at Big Data Tech Warsaw 2019 by me and Maciej Próchniak from TouK.
Multiarmed bandits vs simple A/B testing. Architecture of solution – how to connect Flink, Nussknacker and R? Other uses cases – what are other good fits for similar architecture.
We observe that many of our customers are actively adapting various marketing automation solutions. While most of them offer some basic A/B testing modules they are often too simple for highly dynamic conditions. Better outcomes can be achieved using e.g. multiarmed bandits algorithms, however, it’s not so straightforward to deploy them in a realtime production environment.
In our presentation, we will use a platform based on Apache Flink, Nussknacker – our custom GUI and R Studio + R Suite – everything deployed on Kubernetes.The main goal of our talk is to show how using proposed tools we can create complete flow – from model creation, through deployment and reinforcement learning – that helps to automate marketing communication without the need for custom code development.
The talk is partially based on our former deployments of similar solutions, many ideas are new, however.
Continuous Delivery for Machine LearningThoughtworks
Your Data Scientists or Machine Learning experts have developed a machine learning model which runs perfectly in your notebook? Now you want to deploy it into corporate IT to let it run “in the wild”. And a bunch of new problems comes up: How to integrate the work of your data scientists and machine learning experts into the development processes like CI/CD of your corporate IT? How to prevent a “throw-it-over-the-fence” mentality? How to test, monitor and continuously improve your machine learning application “in the wild”?
In a compact workshop, we will discuss the new challenges of integrating machine learning approaches in modern IT development processes and demonstrate our “Continuous Delivery for Machine Learning” (CD4ML) methodology with some live coding examples.
EuroSTAR Software Testing Conference 2008 presentation on SOA - Agile or Fragile? by Colin Robb. See more at conferences.eurostarsoftwaretesting.com/past-presentations/
Driving Innovation with Kanban at Jaguar Land RoverLeanKit
Find out how Kanban is accelerating product design and development at Jaguar Land Rover.
Watch the recorded webinar here: https://vimeo.com/172780037
Hamish McMinn, Automotive and IT Project Manager, will explain how Kanban is improving time, cost and quality across new vehicle development projects at Jaguar Land Rover.
You'll learn:
-Why new product development provides rich opportunities for continuous process improvement.
-Benefits and challenges of transferring agile software techniques to hardware design and development.
-How to visualize work, focus on flow and increase cross-functional collaboration using LeanKit.
Hamish will share learnings from the initial pilot project, and how Kanban is now being scaled across multiple engineering teams.
Fix-Price Projects And Agile – PyCon SettePeter Bittner
You are a digital agency struggling with your Django projects. You’re over budget and you’ve run out of time, that’s the norm not the exception. And of course you promise to deliver all features on time for a fixed budget, don’t you? – And nobody told you this is a problem?
See the original presentation at http://slides.com/bittner/pycon7-fix-price-projects-and-agile
Always Be Deploying. How to make R great for machine learning in (not only) E...Wit Jakuczun
The presentation I delivered at WhyR 2019.
Abstract:
For many years software engineers have put enormous effort to develop best practices to deliver stable and maintainable software. How R users can benefit from this experience? I will try to answer this question going through several concepts and tools that are natural for software engineers but are often undervalued by R users.
I will start with a description of the deployment process because this is the ultimate step that exposes all weaknesses. You will learn about structuring R project, using abstractions to manage model’s features, automating models building process, optimizing the performance of the solution and the challenges of the deployment process itself.
Driving your marketing automation with multi-armed bandits in real timeWit Jakuczun
Presentation delivered at Big Data Tech Warsaw 2019 by me and Maciej Próchniak from TouK.
Multiarmed bandits vs simple A/B testing. Architecture of solution – how to connect Flink, Nussknacker and R? Other uses cases – what are other good fits for similar architecture.
We observe that many of our customers are actively adapting various marketing automation solutions. While most of them offer some basic A/B testing modules they are often too simple for highly dynamic conditions. Better outcomes can be achieved using e.g. multiarmed bandits algorithms, however, it’s not so straightforward to deploy them in a realtime production environment.
In our presentation, we will use a platform based on Apache Flink, Nussknacker – our custom GUI and R Studio + R Suite – everything deployed on Kubernetes.The main goal of our talk is to show how using proposed tools we can create complete flow – from model creation, through deployment and reinforcement learning – that helps to automate marketing communication without the need for custom code development.
The talk is partially based on our former deployments of similar solutions, many ideas are new, however.
Continuous Delivery for Machine LearningThoughtworks
Your Data Scientists or Machine Learning experts have developed a machine learning model which runs perfectly in your notebook? Now you want to deploy it into corporate IT to let it run “in the wild”. And a bunch of new problems comes up: How to integrate the work of your data scientists and machine learning experts into the development processes like CI/CD of your corporate IT? How to prevent a “throw-it-over-the-fence” mentality? How to test, monitor and continuously improve your machine learning application “in the wild”?
In a compact workshop, we will discuss the new challenges of integrating machine learning approaches in modern IT development processes and demonstrate our “Continuous Delivery for Machine Learning” (CD4ML) methodology with some live coding examples.
EuroSTAR Software Testing Conference 2008 presentation on SOA - Agile or Fragile? by Colin Robb. See more at conferences.eurostarsoftwaretesting.com/past-presentations/
Driving Innovation with Kanban at Jaguar Land RoverLeanKit
Find out how Kanban is accelerating product design and development at Jaguar Land Rover.
Watch the recorded webinar here: https://vimeo.com/172780037
Hamish McMinn, Automotive and IT Project Manager, will explain how Kanban is improving time, cost and quality across new vehicle development projects at Jaguar Land Rover.
You'll learn:
-Why new product development provides rich opportunities for continuous process improvement.
-Benefits and challenges of transferring agile software techniques to hardware design and development.
-How to visualize work, focus on flow and increase cross-functional collaboration using LeanKit.
Hamish will share learnings from the initial pilot project, and how Kanban is now being scaled across multiple engineering teams.
Fix-Price Projects And Agile – PyCon SettePeter Bittner
You are a digital agency struggling with your Django projects. You’re over budget and you’ve run out of time, that’s the norm not the exception. And of course you promise to deliver all features on time for a fixed budget, don’t you? – And nobody told you this is a problem?
See the original presentation at http://slides.com/bittner/pycon7-fix-price-projects-and-agile
When we design together - Sabrina Mach, Ammara Gafoor and James Emmott
From three distinct perspectives, this talk will contend that design is an activity undertaken by everyone in a software development team. It occurs throughout the process of delivery — not only at the beginning or the end — and it is a powerful instrument for learning about and adapting to the problems our work seeks to solve, which is a shared responsibility. Making the best use of our multidisciplinary expertise in the activity of design requires forms of collaboration that are too often disrupted by the role-based silos that keep us separated and weaken the valuable contribution our diverse approaches could make to our collective efforts. If you care about accelerating time to market, improving customer experience, or building happy and productive teams, you will want to know why and how it matters that we believe ‘design is in everything that we do’.
Why change code that works - On Technical Debt and RefactoringCarsten Windler
Why should you change code that works? In this presentation we'll cover why Technical debt is harmful, how you deal with existing debt and how you can prevent to pile up new debt.
Structured Authoring for Business-Critical ContentLavaCon
By Jason Aiken, Quark
For years DITA has armed technical documentation professionals with a componentized approach to content that overcomes the many challenges caused by stand-alone, static documents created in silos. The problem, however, is that there is so much other business-critical content out there that could benefit from a structured approach to authoring.
In this session you will learn:
--The definition of business-critical content and understand just how much exists.
--How expansive the market opportunity is for helping non-technical authors transform their businesses with structured authoring.
--What it takes for a non-technical author to adopt a structured authoring tool.
--Why it is critical for technical documentation experts to translate their best practices into solutions that non-technical content creators can apply to business-critical content.
When you're responsible for testing, it's almost a given that you will find yourself in a situation in which you feel alone and out in the cold. Management’s commitment for testing might be lacking, your colleagues in the project might be ignoring you, your team members might lack motivation, or the automated testing you had planned is more complicated and difficult than you anticipated. You feel you can't test enough, and you will be blamed for post-release quality problems. Hans Buwalda shares a number of chilly situations and offers suggestions for overcoming them, based on his experiences worldwide in large projects. Specifically, Hans focuses on management commitment, politics, project dependencies, managing expectations, motivating team members, testing and automation difficulties, and dealing with overwhelming numbers of day-to-day problems. Take away more than forty-five tips and approaches to use when temperatures drop on you.
Thoughts on productivity in software developmentMichael Vax
This talks explores different aspects of productivity in developing software.
- What is productivity and how to measure it
- Productivity on multiple levels
- Individual
- Team
- Organization
Patterns & Anti-patterns
ML Products have become a prolific and integral part of taking the insights of Data Science from theory to reality. Oddly though, the path from conception to implementation is often unclear with seemingly few similar examples to work from. The result is often a sea of agony between sliding deadlines, heroic efforts of people working though unforeseen challenges and haphazard innovation. Each time a beautiful model makes its impact on the business bottom line, something worked. In this talk we present the ML Playbook. It pulls together the best aspects from a variety of successful ML Product launches into a cohesive strategy to Plan, Build, Test, Learn, and Release ML Products. We'll demonstrate the ML Playbook in action with the story of launching an alert monitoring product for the world's most powerful jet engines, the GE90-115B.
Testing Transformation: The Art and Science for SuccessTechWell
Technologies, testing processes, and the role of the tester have evolved significantly in the past few years with the advent of agile, DevOps, and other new technologies. It is critical that we testing professionals evaluate ourselves and continue to add tangible value to our organizations. In your work, are you focused on the trivial or on real game changers? Jennifer Bonine describes critical elements that help you artfully blend people, process, and technology to create a synergistic relationship that adds value. Jennifer shares ideas on mastering politics, maneuvering core vs. context, and innovating your technology strategies and processes. She explores how new processes can be introduced in an organization, what the role of organizational culture is in determining the success of a project, and how you can know what tools will add value vs. simply adding overhead and complexity. Jennifer reviews critically needed tester skills and discusses a continual learning model to evolve your skills and stay relevant. This discussion can lead you to technologies, processes, and skills you can stake your career on.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2021/09/may-2021-embedded-vision-summit-opening-remarks-may-26/
Jeff Bier, Founder of the Edge AI and Vision Alliance, welcomes attendees to the May 2021 Embedded Vision Summit on May 26, 2021. Bier provides an overview of the edge AI and vision market opportunities, challenges, solutions and trends. He also introduces the Edge AI and Vision Alliance and the resources it offers for both product creators and members, and reviews the day’s agenda and other logistics.
NUS-ISS 4th Agile Community of Practice (CoP): Agile Development with Agile Contract, by Mr Steven Koh, Assistant Director (Agile Coach), IDA-Government Digital Services (GDS)
My presentation in Agile4U (Agile for University) program of HanoiScrum in 2013.
This presentation may have some customised content for University of Science and Technology of Ha Noi.
Evidence Based Management - Measuring value to enable improvement and agilityScrum Australia Pty Ltd
by Mia Horrigan
Organisations invest in agile processes, tools, training, and coaching, but how much are they getting back?
Has product delivery improved?
How much happier are users and the business customers?
Are employees empowered and enabled?
Traditional metrics might give you insight into improvements of operational efficiency, but the real conversation is about the value created for your organisation by the improved processes. Without measuring value, the success of any agile initiative is based on nothing more than intuition and assumption.
Mia will discuss Evidence based management and how this empirical process can help agile transformations measure and manage the value derived from the transformation initiative. Mia will focus on the 4 Key Value Areas: Current Value, Ability to Innovate, Unrealised Value and time to market, and how these contribute to an organisation’s ability to deliver business value.
Despite the belief that a shared context and collaboration drives quality, too often, software testers and quality professionals struggle to find their place within today's integrated agile teams. This session is a practitioner’s view of testing and testing practices within an iterative/incremental development environment. We will begin with a discussion of some of the challenges of testing within an agile environment and delve into the guiding principles of Agile Testing and key enabling practices. Agile Testing necessitates a change in mindset, and it is as much, if not more, about behavior, as it is about skills and tooling, all of which will be explored.
Technical Debt is a gap between Computer Science and Software Engineering. Common understanding of causes for the Technical Debt is centered on the careless software development choices for the sake of speed and expediency. However Technical Debt usually goes beyond just Technology. This presentation covers the origins of Technical and Product Debt, how to manage it and mitigate it
The product is not "the product". Who owns it anyway? donato mangialardo
The business of software is not about the product really Does "P" mean Product or Project? Does it matter? We always talk about Product though... are we talking about the same Product here? Answer: "A product is something you build a sustainable business around."
When we design together - Sabrina Mach, Ammara Gafoor and James Emmott
From three distinct perspectives, this talk will contend that design is an activity undertaken by everyone in a software development team. It occurs throughout the process of delivery — not only at the beginning or the end — and it is a powerful instrument for learning about and adapting to the problems our work seeks to solve, which is a shared responsibility. Making the best use of our multidisciplinary expertise in the activity of design requires forms of collaboration that are too often disrupted by the role-based silos that keep us separated and weaken the valuable contribution our diverse approaches could make to our collective efforts. If you care about accelerating time to market, improving customer experience, or building happy and productive teams, you will want to know why and how it matters that we believe ‘design is in everything that we do’.
Why change code that works - On Technical Debt and RefactoringCarsten Windler
Why should you change code that works? In this presentation we'll cover why Technical debt is harmful, how you deal with existing debt and how you can prevent to pile up new debt.
Structured Authoring for Business-Critical ContentLavaCon
By Jason Aiken, Quark
For years DITA has armed technical documentation professionals with a componentized approach to content that overcomes the many challenges caused by stand-alone, static documents created in silos. The problem, however, is that there is so much other business-critical content out there that could benefit from a structured approach to authoring.
In this session you will learn:
--The definition of business-critical content and understand just how much exists.
--How expansive the market opportunity is for helping non-technical authors transform their businesses with structured authoring.
--What it takes for a non-technical author to adopt a structured authoring tool.
--Why it is critical for technical documentation experts to translate their best practices into solutions that non-technical content creators can apply to business-critical content.
When you're responsible for testing, it's almost a given that you will find yourself in a situation in which you feel alone and out in the cold. Management’s commitment for testing might be lacking, your colleagues in the project might be ignoring you, your team members might lack motivation, or the automated testing you had planned is more complicated and difficult than you anticipated. You feel you can't test enough, and you will be blamed for post-release quality problems. Hans Buwalda shares a number of chilly situations and offers suggestions for overcoming them, based on his experiences worldwide in large projects. Specifically, Hans focuses on management commitment, politics, project dependencies, managing expectations, motivating team members, testing and automation difficulties, and dealing with overwhelming numbers of day-to-day problems. Take away more than forty-five tips and approaches to use when temperatures drop on you.
Thoughts on productivity in software developmentMichael Vax
This talks explores different aspects of productivity in developing software.
- What is productivity and how to measure it
- Productivity on multiple levels
- Individual
- Team
- Organization
Patterns & Anti-patterns
ML Products have become a prolific and integral part of taking the insights of Data Science from theory to reality. Oddly though, the path from conception to implementation is often unclear with seemingly few similar examples to work from. The result is often a sea of agony between sliding deadlines, heroic efforts of people working though unforeseen challenges and haphazard innovation. Each time a beautiful model makes its impact on the business bottom line, something worked. In this talk we present the ML Playbook. It pulls together the best aspects from a variety of successful ML Product launches into a cohesive strategy to Plan, Build, Test, Learn, and Release ML Products. We'll demonstrate the ML Playbook in action with the story of launching an alert monitoring product for the world's most powerful jet engines, the GE90-115B.
Testing Transformation: The Art and Science for SuccessTechWell
Technologies, testing processes, and the role of the tester have evolved significantly in the past few years with the advent of agile, DevOps, and other new technologies. It is critical that we testing professionals evaluate ourselves and continue to add tangible value to our organizations. In your work, are you focused on the trivial or on real game changers? Jennifer Bonine describes critical elements that help you artfully blend people, process, and technology to create a synergistic relationship that adds value. Jennifer shares ideas on mastering politics, maneuvering core vs. context, and innovating your technology strategies and processes. She explores how new processes can be introduced in an organization, what the role of organizational culture is in determining the success of a project, and how you can know what tools will add value vs. simply adding overhead and complexity. Jennifer reviews critically needed tester skills and discusses a continual learning model to evolve your skills and stay relevant. This discussion can lead you to technologies, processes, and skills you can stake your career on.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2021/09/may-2021-embedded-vision-summit-opening-remarks-may-26/
Jeff Bier, Founder of the Edge AI and Vision Alliance, welcomes attendees to the May 2021 Embedded Vision Summit on May 26, 2021. Bier provides an overview of the edge AI and vision market opportunities, challenges, solutions and trends. He also introduces the Edge AI and Vision Alliance and the resources it offers for both product creators and members, and reviews the day’s agenda and other logistics.
NUS-ISS 4th Agile Community of Practice (CoP): Agile Development with Agile Contract, by Mr Steven Koh, Assistant Director (Agile Coach), IDA-Government Digital Services (GDS)
My presentation in Agile4U (Agile for University) program of HanoiScrum in 2013.
This presentation may have some customised content for University of Science and Technology of Ha Noi.
Evidence Based Management - Measuring value to enable improvement and agilityScrum Australia Pty Ltd
by Mia Horrigan
Organisations invest in agile processes, tools, training, and coaching, but how much are they getting back?
Has product delivery improved?
How much happier are users and the business customers?
Are employees empowered and enabled?
Traditional metrics might give you insight into improvements of operational efficiency, but the real conversation is about the value created for your organisation by the improved processes. Without measuring value, the success of any agile initiative is based on nothing more than intuition and assumption.
Mia will discuss Evidence based management and how this empirical process can help agile transformations measure and manage the value derived from the transformation initiative. Mia will focus on the 4 Key Value Areas: Current Value, Ability to Innovate, Unrealised Value and time to market, and how these contribute to an organisation’s ability to deliver business value.
Despite the belief that a shared context and collaboration drives quality, too often, software testers and quality professionals struggle to find their place within today's integrated agile teams. This session is a practitioner’s view of testing and testing practices within an iterative/incremental development environment. We will begin with a discussion of some of the challenges of testing within an agile environment and delve into the guiding principles of Agile Testing and key enabling practices. Agile Testing necessitates a change in mindset, and it is as much, if not more, about behavior, as it is about skills and tooling, all of which will be explored.
Technical Debt is a gap between Computer Science and Software Engineering. Common understanding of causes for the Technical Debt is centered on the careless software development choices for the sake of speed and expediency. However Technical Debt usually goes beyond just Technology. This presentation covers the origins of Technical and Product Debt, how to manage it and mitigate it
The product is not "the product". Who owns it anyway? donato mangialardo
The business of software is not about the product really Does "P" mean Product or Project? Does it matter? We always talk about Product though... are we talking about the same Product here? Answer: "A product is something you build a sustainable business around."
New Principles for Digital Experiences That PerformOptimizely
Whether you’re an executive, marketer, or developer, you’re likely in on the secret that digital customer experiences matter more than ever. Despite this, it can feel increasingly daunting to design the amazing experiences your customers desire — and that perform for your business.
Leaning on his decades of experience as a digital operator, Matty Wishnow, Managing Director of Experience Design & Optimization at Accenture Interactive and Founder of Clearhead, will share key principles for data-driven experience design.
Using real world examples, Matty will share how these principles enable enterprises to transform customer experiences through evidence-based design and a focus on solving the biggest problems your customers face.
How to justify technical debt mitigations in Software EngineeringAndré Agostinho
In this presentation André Agostinho e Cassio Silva covers the importance in dealing with technical debt in software engineering showing the real impacts, daily approaches and best practices for mitigations
The art of problem solving --> ensure you right the right business requiremen...Chris Lamoureux
This presentation was initially developed a couple of years ago and presented to the leadership team of a business banking area in a Global Financial Institution. It's focus was to give the practitioner some philosophical guidance on thinking through problems in the context of writing better business requirements. The goal here was to foster thinking about what problem you are solving for first before jumping into writing business requirements for project related activities
Half Double – An Entrepreneurial Approach to Agile Projects - Implement Consu...Wellingtone
The Half Double approach can help teams take ownership of their impact vision and deliver results much faster than normally. With just three core elements, Impact, Flow & Leadership, the method help teams and the key leadership stakeholders to focus on the essentials.
The presentation will contain case examples and tools from each element. As a conclusion we will discuss/ reflect on how you can unleash entrepreneurial power in your projects.
Henrik Sonnenberg presented at FuturePMO 2019 which is a 1 day PMO conference hosted by Wellingtone. www.FuturePMO.com
The Gift of Scripts: Unwrap Their Potential For Google Ads + MoreHanapin Marketing
Scripts are, well…confusing. You might have an idea that they’re awesome and useful and will make your life easier but, where do you start? Let’s turn those overwhelming feelings into excitement, because scripts are cheap, profitable, AND easy! That’s why we have experts here to walk you through the idea behind them and how to implement.
In this presentation, Hanapin’s Jacob Fairclough and Optmyzr’s Fred Vallaeys will provide you the pro tips you need to get started with scripts, provide examples of scripts to use in Google Ads, and take it a step further with some advanced strategies as well. Don’t miss it!
You’ll learn:
- What scripts are, and why they are useful for automation in Google Ads
- Why they are better than the API solution
- Creative and unique ideas of how to use scripts
Recording available here: https://youtu.be/zZVoo5AbANI
As technologists, we love to build things. And we sometimes forget that our customers (or potential customers) don’t care about what we’re building-- they care about what they’re building, doing, or feeling. In this talk, we’ll explore methodologies that help us continually focus on our customers’ needs, building just enough to learn and iterate towards their desired outcomes.
This SolidWorks World 2007 presentation from Paul Gimbel of Razorleaf Corporation focuses on preparing your company, your engineering design process, and your SolidWorks models for design automation.
Rethinking an organization in an Agile manner is a challenge that affects every organizational aspects and is surrounded by risks that must be appropriately managed.
Beyond the used methodologies and frameworks, the goal is always to develop a mindset that allows the organization to " stand on their own feet" and embrace antifragility.
In this talk we will describe a concrete transformation experience in a company working on the medical sector, with the operational office in Italy, and how it has been completely revolutionized. We will talk about successful changes and the less fortunate experiments, how the company developed its Way of Working (WoW) in agile manner, even going so far as to reorganize of the internal physical spaces. We will also take a look at how the aspects of the Program were developed: from the Portfolio to the Risk Management System, up to the revision of the Quality procedures.
Challenges are consistent in Big Data environments; resource-intensive processes, unwieldy time commitments, and challenging variations in infrastructure. Big Data has grown so large that traditional data analysis and management solutions are too slow, too small and too expensive to handle it. Many companies are in the discovery stage of evaluating the best means of extracting value from it. This Enterprise Tech Journal interview with Kevin Goulet, VP Product Management, CA Technologies, explores the challenges of Big Data, the approach to resolving them. With Big Data environments, the challenges are consistent – resource-intensive processes, unwieldy time commitments, and challenging variations in infrastructure. For more information visit http://www.ca.com/us/products/detail/business-intelligence-and-big-data-management.aspx?mrm=425887
Know your R usage workflow to handle reproducibility challengesWit Jakuczun
R is used in a vast ways. From pure ad-hoc by hobbysts to an organized and structured way in an enterprise. Each way of R usage brings different reproducibility challenges. Going through range of typical workflows we will show that understanding reproducibility must start with understanding your workflow. Presenting workflows we will show how we deal reproducibiilty challenges with open-source R Suite (http://rsuite.io) solution developed by us to support our large scale R development.
Large scale machine learning projects with r suiteWit Jakuczun
Agenda for the workshop I conducted at ML@Enterprise conference that took place on 14th of December 2017 in Warsaw.
Machine Learning is not only about algorithms. Machine learning is about value and this can be achieved only after proper deployment of Machine Learning solutions. I will present best practices regarding managing R based ML projects. I will use our open-source tool R Suite (http://rsuite.io/). During the workshop I will talk about:
– project structure
– development cycle
– deployment
– test
Managing large (and small) R based solutions with R SuiteWit Jakuczun
The presentation I gave at DataMass Gdańsk Summit in 2017:
R is a great tool for data scientist. Being very dynamic and popular is now one of the most important technology on the market. Unfortunately out-of-the-box R is not suited for large scale applications. I will present R Suite that is an open-source solution developed by us for us to manage R development process.
20170928 why r_r jako główna platforma do zaawansowanej analityki w enterpriseWit Jakuczun
Presentation (in polish) I gave at WhyR conference in Warsaw. The abstract:
The world of hermetic analytical platforms is slowly becoming history. Today, advanced analytics is being pushed forward by the open-source world supported by the biggest players. In various discussions R's maturity is being questioned if Enterprise point of view is considered. Based on the R deployment in large telecom, I will tell why I claim R can be number one in advanced analytics in any large corporation. I will show what virtues and vices of migrating to R.
Prezentacja z Data Science Summit 2017:
R spowodował wywrócenie świata analityki. Widzą to duzi gracze jak np. Microsoft czy Oracle. Ale powstaje pytanie jak nowoczesność i zmienność R przełożyć na wartość w stabilnym świecie Enterprise? Ile to kosztuje czasu i pieniędzy? I jak to zrobić bezpiecznie? Odpowiem na te pytania na podstawie wdrażania R w dużym telekomie.
Case Studies in advanced analytics with RWit Jakuczun
A talk I gave at SQLDay 2017:
About 1,5 years ago Microsoft finalised acquisition of Revolution Analytics – a provider of software and services for R. In my opinion this was one of the most important event for R community. Now it is crucial to present its capabilities to SQL Server community. It will be beneficial for both parties. I will present three case studies: cash optimisation in Deutsche Bank, midterm model for energy prices forecasting, workforce demand optimising. The case studies were implemented with our analytical workflow R Suite that will be also shortly presented.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...pchutichetpong
M Capital Group (“MCG”) expects to see demand and the changing evolution of supply, facilitated through institutional investment rotation out of offices and into work from home (“WFH”), while the ever-expanding need for data storage as global internet usage expands, with experts predicting 5.3 billion users by 2023. These market factors will be underpinned by technological changes, such as progressing cloud services and edge sites, allowing the industry to see strong expected annual growth of 13% over the next 4 years.
Whilst competitive headwinds remain, represented through the recent second bankruptcy filing of Sungard, which blames “COVID-19 and other macroeconomic trends including delayed customer spending decisions, insourcing and reductions in IT spending, energy inflation and reduction in demand for certain services”, the industry has seen key adjustments, where MCG believes that engineering cost management and technological innovation will be paramount to success.
MCG reports that the more favorable market conditions expected over the next few years, helped by the winding down of pandemic restrictions and a hybrid working environment will be driving market momentum forward. The continuous injection of capital by alternative investment firms, as well as the growing infrastructural investment from cloud service providers and social media companies, whose revenues are expected to grow over 3.6x larger by value in 2026, will likely help propel center provision and innovation. These factors paint a promising picture for the industry players that offset rising input costs and adapt to new technologies.
According to M Capital Group: “Specifically, the long-term cost-saving opportunities available from the rise of remote managing will likely aid value growth for the industry. Through margin optimization and further availability of capital for reinvestment, strong players will maintain their competitive foothold, while weaker players exit the market to balance supply and demand.”
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
2. Once upon a time Wit met a Customer that needed
demand forecasts to ...
3. Customer: I need better demand forecasts.
Me: I understand. Can I have a simple question?
Customer: Yes
Me: Imagine I created a demand forecasting model and provided you with
desired 5M forecasts (numbers). What are you going to do with them?
Customer: Well… I will take the forecasts and optimize my logistics decisions using
the numbers.
Me: I see. Why don’t we talk about the whole decision problem? Maybe the
inefficiency is not in demand forecasts but in optimization part?
Customer: Can you create a math model for such complex business problem with
many constraints and exceptions? I thought it was impossible.
Recommendations are calculated in “Sheet” and it is a bottleneck.
Me: It is possible to build decision support system that uses mathematical
optimization for your problem
Customer: Great, let’s talk about the details.
4. What has happened in the past?
What is an optimal course of
actions for the past?
What can happen in the future?
What is an optimal course of
actions for the future?
DataAnalyticsOptimization
7. Forecasts/predictions are “just” a tool for better decisions.
And better decisions are based on right recommendations.
And right recommendations are result of optimizing business KPIs that are linked
to business decisions.
Optimization model deals helps to improve robustness of decision making
process robust.
8. Automation of the complex business process.
Transition to central/global planning.
Learning from best (optimal) decision.
9. I was selling vehicle routing solution to a logistics company.
I managed to persuade manager/owner to meet and talk with the team.
After 1 hour presentation of the solution to the company I got one question
How can I create an invoice in this solution?
This is real story that happened to me. Similar story can be read in “Being wrong with Clarke & Wright” by Robert E.D.Woolsey
10.
11. There are three commercial highly efficient solvers for mixed integer problems.
Solutions have been on the market for 25+ years…. and are still in development.
Very resistant to parallelization and distributed computing techniques.
Very sensitive to data.
Tightly coupled with business.
Only “auto” for mip.
12. There is only one good library for mixed integer programming that is open-source.
14. Most common ML libs are open-source.
Open-source is very efficient compared to commercial solutions.
Easy to create distributed implementations
Fairly insensitive to data.
Less tightly coupled with business.
Quite a few AutoML solutions that work.
17. Mixed-integer
programming
black-box
hard to customize
limited applicability
(still wide!)
Constraint
programming
white-box
easy to customize
not limited applicability
Metaheuristics
custom-box
easy to customize
not limited applicability
Easy Difficult
18. Mixed Integer Programming
IBM CPLEX
Gurobi
Fico
Local Solver
Constraint programming
Sicstus
IBM CP Optimizer
Commercial world
Mixed Integer Programming
CBC (solver)
MIP (wrapper)
Or-tools (wrapper)
Constraint programming
ECLiPSe
Choco
Gecode
Or-tools
Open-source world
19. Easy
I can declare model using existing solver.
Fairly difficult
I can solve problem with a sequence of easy problems.
Very Difficult
I must implement custom solver.
20. Business requirements are almost impossible to be collected upfront.
Performance is not satisfactory.
Solution quality is not satisfactory.
No solution found is not acceptable.
22. Start with business process and decisions
Start with small and iterate.
Use real data since the first day of the project.
Assume problems are infeasible or internally contradictor.
Deal with must vs nice to have requirements.
23.
24. Saving up to 20% of cash management costs in Deutsche Bank
25. Challenge
• Factory throughput was too low
• Upgrade or not to increase throughput
Solution
• Integrated planning and scheduling optimization model
• Scenario generation to support investment decision
• Tailor made optimisation model
Effects
• Ability to support investment decision with numbers
Based on academic work by Roman Barták
26. Challenge
• Dynamic and unpredictable orders flow
• Complex tasks
Solution
• Automation by optimisation
• Tailor made optimisation model
Effects
• In progress - feasibility tests of the
working solution
Collect Pay Drive Deliver Drive
Collect Pay Drive Deliver Drive
Collect Pay Drive Deliver Drive