Despite huge progress in machine learning over the past decade, building production-ready machine learning systems is still hard. Three years ago when we set out to build machine learning capabilities into the Salesforce platform we learned that building enterprise-scale machine learning systems is even harder.To solve the problems we encountered, we built TransmogrifAI (https://transmogrif.ai) (pronounced trans-mog-ri-phi), an end-to-end automated machine learning library for structured data, that is used in production today to help power our Salesforce Einstein AI platform. This talk highlights key capabilities of TransmogrifAI library and demonstrates them in action on a real-life machine learning application.
Meet TransmogrifAI, Open Source AutoML That Powers Einstein PredictionsMatthew Tovbin
Despite huge progress in machine learning over the past decade, building production-ready machine learning systems is still hard. Three years ago when we set out to build machine learning capabilities into the Salesforce platform we learned that building enterprise-scale machine learning systems is even harder. To solve the problems we encountered, we built TransmogrifAI (https://transmogrif.ai) (pronounced trans-mog-ri-phi), an end-to-end automated machine learning library for structured data, that is used in production today to help power our Einstein AI platform.
This talk highlights key capabilities of TransmogrifAI library and demonstrates them in action on a real-life machine learning application.
Einstein Object Detection allows you to easily integrate advanced computer vision technology into your Salesforce workflows and external applications. Join us and learn how to train and deploy Einstein Object Detection models so that you can automate inventory analysis, streamline compliance checks, and optimize share of shelf analysis to bring new levels of productivity to the user experience. The use-cases are limitless!
Event notification is an important part of most business applications. When something important happens, you sometimes need one or more users to take a specific action. This intermediate- to advanced- level webinar teaches you all about event notification, contrasts the different Salesforce features you can use to implement it, and shows you practical examples of how to use those Salesforce features to suit your specific event notification use case.
Key Takeaways
:: Understand how use case requirements help you determine the appropriate Force.com feature to use for event notification.
:: Implement and contrast event notification with Visualforce pages that poll, Visualforce pages that use the Force.com Streaming API, workflow rules, Chatter feeds and scheduled reports.
Intended Audience
:: This intermediate-level webinar is geared towards Force.com application developers and architects.
CodeLive with Cynthia Thomas - Refactoring data dependent code.JackGuo20
Cloning data can be harder than it looks; especially if there are multiple records. In this CodeLive session, Cynthia Thomas and Kevin Poorman work to refactor and test code to elegantly clone large numbers of records. Register now to learn about handling collections of data, refactoring, and testing practices.
Sandbox Templates allow you to create sandboxes with some, but not all, of your data. They work with both Full sandboxes (released Summer '13) and Partial Data sandboxes (released Winter '14). Sandbox Templates allow you to choose which objects and object data to copy into a new sandbox so you can control it's size and content.Join us to hear the details of how Sandbox Templates work with both Full and Partial Data sandboxes. We'll also discuss the algorithms used to determine what objects get copied when you create a sandbox using Sandbox Templates.
This webinar series brings the best of Dreamforce to the Salesforce developer community. This latest edition explores how to make apps smarter with Einstein. Join us and learn more about Einstein Platform Services including Prediction Builder, Bots and our Language and Vision APIs.
Clover Rings Up Digital Growth to Drive ExperimentationOptimizely
Clover's Digital Growth team is responsible for optimizing the merchant's digital experience and they rely on experimentation to guide digital decision-making. This enables them to quickly learn and measure what changes deliver the best outcomes for users.
Join us with Lead Product Manager of Growth, Monil Shah, to learn how Clover:
- Increased digital conversions amongst merchants with an investment in experimentation
- Grew experiment velocity by 4x after replacing Adobe Target
- Designed a framework to efficiently capture and prioritize test ideas, and roll out winners
Meet TransmogrifAI, Open Source AutoML That Powers Einstein PredictionsMatthew Tovbin
Despite huge progress in machine learning over the past decade, building production-ready machine learning systems is still hard. Three years ago when we set out to build machine learning capabilities into the Salesforce platform we learned that building enterprise-scale machine learning systems is even harder. To solve the problems we encountered, we built TransmogrifAI (https://transmogrif.ai) (pronounced trans-mog-ri-phi), an end-to-end automated machine learning library for structured data, that is used in production today to help power our Einstein AI platform.
This talk highlights key capabilities of TransmogrifAI library and demonstrates them in action on a real-life machine learning application.
Einstein Object Detection allows you to easily integrate advanced computer vision technology into your Salesforce workflows and external applications. Join us and learn how to train and deploy Einstein Object Detection models so that you can automate inventory analysis, streamline compliance checks, and optimize share of shelf analysis to bring new levels of productivity to the user experience. The use-cases are limitless!
Event notification is an important part of most business applications. When something important happens, you sometimes need one or more users to take a specific action. This intermediate- to advanced- level webinar teaches you all about event notification, contrasts the different Salesforce features you can use to implement it, and shows you practical examples of how to use those Salesforce features to suit your specific event notification use case.
Key Takeaways
:: Understand how use case requirements help you determine the appropriate Force.com feature to use for event notification.
:: Implement and contrast event notification with Visualforce pages that poll, Visualforce pages that use the Force.com Streaming API, workflow rules, Chatter feeds and scheduled reports.
Intended Audience
:: This intermediate-level webinar is geared towards Force.com application developers and architects.
CodeLive with Cynthia Thomas - Refactoring data dependent code.JackGuo20
Cloning data can be harder than it looks; especially if there are multiple records. In this CodeLive session, Cynthia Thomas and Kevin Poorman work to refactor and test code to elegantly clone large numbers of records. Register now to learn about handling collections of data, refactoring, and testing practices.
Sandbox Templates allow you to create sandboxes with some, but not all, of your data. They work with both Full sandboxes (released Summer '13) and Partial Data sandboxes (released Winter '14). Sandbox Templates allow you to choose which objects and object data to copy into a new sandbox so you can control it's size and content.Join us to hear the details of how Sandbox Templates work with both Full and Partial Data sandboxes. We'll also discuss the algorithms used to determine what objects get copied when you create a sandbox using Sandbox Templates.
This webinar series brings the best of Dreamforce to the Salesforce developer community. This latest edition explores how to make apps smarter with Einstein. Join us and learn more about Einstein Platform Services including Prediction Builder, Bots and our Language and Vision APIs.
Clover Rings Up Digital Growth to Drive ExperimentationOptimizely
Clover's Digital Growth team is responsible for optimizing the merchant's digital experience and they rely on experimentation to guide digital decision-making. This enables them to quickly learn and measure what changes deliver the best outcomes for users.
Join us with Lead Product Manager of Growth, Monil Shah, to learn how Clover:
- Increased digital conversions amongst merchants with an investment in experimentation
- Grew experiment velocity by 4x after replacing Adobe Target
- Designed a framework to efficiently capture and prioritize test ideas, and roll out winners
Description
Final installation of the 3 part webinar series! Time to use all the Apex skills you have learned in this series and combine SOQL with Apex to build one glorious trigger.
This series serves as an Introduction to Apex for Salesforce Administrators with no programming background. This is the last in a 3-part series with David Liu and LeeAnne Templeman. David Liu is the creator of sfdc99.com, an intro to Apex exercise guide for all Salesforce Admins. David is a self-taught Salesforce MVP who began his path as a marketer and is now the Salesforce Technical Architect for Google. He will share some of his own learning path, as well as tips and tricks on how to become a Salesforce developer.
Key Takeaways
::Learn the principles of a good test class
::Maintain your data quality by writing a deduping trigger
::Get an outline of the steps to become a Salesforce Developer
Series
Apex for Admins series:
::Get Started with Apex in 30 Minutes!
::Build on the Basics
::Beyond the Basics ← you are here!
Intended Audience
::Salesforce Admins who have begun learning Apex. You should be familiar with basic triggers, Apex classes, and basic Apex.
::Anyone who joined us last week for Apex for Admins: Build on the Basics
::If you missed the previous session, get up to speed with these awesome tutorials from sfdc99.com:
Chapter 2: SOQL: A Beginner’s Guide
Chapter 3: Core Apex Tools
Recommended Resources
https://developer.salesforce.com/en/events/webinars/apex-for-admins-beyond-the-basics?d=70130000000hUks
Planning Your Migration to the Lightning ExperienceShell Black
Learn how to migrate to the Salesforce Lighting UI in four steps. Shell Black in this presentation discusses how to make the business case to justify the costs to moving to the Lightning Experience (LEX) from Classic. In the four steps Shell covers how to assess the current state of your org, plan out the migration, manage your build sprints and UAT (User Acceptance Testing), and train your end users. He also covers the factors that drive the time needed to complete the project. As you will find, the biggest hurdle to migrating to Lightning is not a missing feature, but Admin knowledge. Click the link on Slide 2 to watch a video of this presentation!
The Business of Flow - Point and Click Workflow ApplicationsDreamforce
Salesforce Visual Workflow is a power "clicks not code" tool you can use to automate work and build workflow applications. In this session we'll cover two in-depth real work workflow applications built by customers using Visual Workflow. They'll detail their use case, show how they got started, what it took to build, and demo their applications. Watch the video now: https://www.youtube.com/watch?v=2PhDeQgKzLY
In this presentation given at Dreamforce 2013 Mike Gerholdt, Garry Polmateer, and Jared Miller give Salesforce Admins advice on managing their Salesforce instance and common mistakes we have made over the years.
Intuit - How to Scale Your Experimentation ProgramOptimizely
Here’s the playbook Intuit uses to increase its experimentation velocity — even when they face traffic limitations.
Mike Loveridge is not new to running experimentation teams. Before Intuit, he built out programs at Ancestry.com, GE, Humana, and CheapOair. He's an expert at making experimentation work at high velocity, even in traffic-challenged situations.
In this webinar, Mike Loveridge shared his best practices for making CRO work at high velocity, key lessons from scaling multiple teams, and why he's bullish on the future of "test and learn".
Are you hitting your Governor Limits? Is your system performance not up to expectations? Are you worried about your capacity to grow or merge multiple orgs? Then this session is for you. Join us as we line up the suspects, find out who's guilty, and how you can avoid being a victim in the closest thing to a murder-mystery at this year's DreamForce. We'll walk you through real situations, and most importantly, how we solved them.
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamtakuyayamamoto1800
In this slide, we show the simulation example and the way to compile this solver.
In this solver, the Helmholtz equation can be solved by helmholtzFoam. Also, the Helmholtz equation with uniformly dispersed bubbles can be simulated by helmholtzBubbleFoam.
Quarkus Hidden and Forbidden ExtensionsMax Andersen
Quarkus has a vast extension ecosystem and is known for its subsonic and subatomic feature set. Some of these features are not as well known, and some extensions are less talked about, but that does not make them less interesting - quite the opposite.
Come join this talk to see some tips and tricks for using Quarkus and some of the lesser known features, extensions and development techniques.
More Related Content
Similar to Meet TransmogrifAI, Open Source AutoML That Powers Einstein Predictions
Description
Final installation of the 3 part webinar series! Time to use all the Apex skills you have learned in this series and combine SOQL with Apex to build one glorious trigger.
This series serves as an Introduction to Apex for Salesforce Administrators with no programming background. This is the last in a 3-part series with David Liu and LeeAnne Templeman. David Liu is the creator of sfdc99.com, an intro to Apex exercise guide for all Salesforce Admins. David is a self-taught Salesforce MVP who began his path as a marketer and is now the Salesforce Technical Architect for Google. He will share some of his own learning path, as well as tips and tricks on how to become a Salesforce developer.
Key Takeaways
::Learn the principles of a good test class
::Maintain your data quality by writing a deduping trigger
::Get an outline of the steps to become a Salesforce Developer
Series
Apex for Admins series:
::Get Started with Apex in 30 Minutes!
::Build on the Basics
::Beyond the Basics ← you are here!
Intended Audience
::Salesforce Admins who have begun learning Apex. You should be familiar with basic triggers, Apex classes, and basic Apex.
::Anyone who joined us last week for Apex for Admins: Build on the Basics
::If you missed the previous session, get up to speed with these awesome tutorials from sfdc99.com:
Chapter 2: SOQL: A Beginner’s Guide
Chapter 3: Core Apex Tools
Recommended Resources
https://developer.salesforce.com/en/events/webinars/apex-for-admins-beyond-the-basics?d=70130000000hUks
Planning Your Migration to the Lightning ExperienceShell Black
Learn how to migrate to the Salesforce Lighting UI in four steps. Shell Black in this presentation discusses how to make the business case to justify the costs to moving to the Lightning Experience (LEX) from Classic. In the four steps Shell covers how to assess the current state of your org, plan out the migration, manage your build sprints and UAT (User Acceptance Testing), and train your end users. He also covers the factors that drive the time needed to complete the project. As you will find, the biggest hurdle to migrating to Lightning is not a missing feature, but Admin knowledge. Click the link on Slide 2 to watch a video of this presentation!
The Business of Flow - Point and Click Workflow ApplicationsDreamforce
Salesforce Visual Workflow is a power "clicks not code" tool you can use to automate work and build workflow applications. In this session we'll cover two in-depth real work workflow applications built by customers using Visual Workflow. They'll detail their use case, show how they got started, what it took to build, and demo their applications. Watch the video now: https://www.youtube.com/watch?v=2PhDeQgKzLY
In this presentation given at Dreamforce 2013 Mike Gerholdt, Garry Polmateer, and Jared Miller give Salesforce Admins advice on managing their Salesforce instance and common mistakes we have made over the years.
Intuit - How to Scale Your Experimentation ProgramOptimizely
Here’s the playbook Intuit uses to increase its experimentation velocity — even when they face traffic limitations.
Mike Loveridge is not new to running experimentation teams. Before Intuit, he built out programs at Ancestry.com, GE, Humana, and CheapOair. He's an expert at making experimentation work at high velocity, even in traffic-challenged situations.
In this webinar, Mike Loveridge shared his best practices for making CRO work at high velocity, key lessons from scaling multiple teams, and why he's bullish on the future of "test and learn".
Are you hitting your Governor Limits? Is your system performance not up to expectations? Are you worried about your capacity to grow or merge multiple orgs? Then this session is for you. Join us as we line up the suspects, find out who's guilty, and how you can avoid being a victim in the closest thing to a murder-mystery at this year's DreamForce. We'll walk you through real situations, and most importantly, how we solved them.
Similar to Meet TransmogrifAI, Open Source AutoML That Powers Einstein Predictions (20)
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamtakuyayamamoto1800
In this slide, we show the simulation example and the way to compile this solver.
In this solver, the Helmholtz equation can be solved by helmholtzFoam. Also, the Helmholtz equation with uniformly dispersed bubbles can be simulated by helmholtzBubbleFoam.
Quarkus Hidden and Forbidden ExtensionsMax Andersen
Quarkus has a vast extension ecosystem and is known for its subsonic and subatomic feature set. Some of these features are not as well known, and some extensions are less talked about, but that does not make them less interesting - quite the opposite.
Come join this talk to see some tips and tricks for using Quarkus and some of the lesser known features, extensions and development techniques.
How to Position Your Globus Data Portal for Success Ten Good PracticesGlobus
Science gateways allow science and engineering communities to access shared data, software, computing services, and instruments. Science gateways have gained a lot of traction in the last twenty years, as evidenced by projects such as the Science Gateways Community Institute (SGCI) and the Center of Excellence on Science Gateways (SGX3) in the US, The Australian Research Data Commons (ARDC) and its platforms in Australia, and the projects around Virtual Research Environments in Europe. A few mature frameworks have evolved with their different strengths and foci and have been taken up by a larger community such as the Globus Data Portal, Hubzero, Tapis, and Galaxy. However, even when gateways are built on successful frameworks, they continue to face the challenges of ongoing maintenance costs and how to meet the ever-expanding needs of the community they serve with enhanced features. It is not uncommon that gateways with compelling use cases are nonetheless unable to get past the prototype phase and become a full production service, or if they do, they don't survive more than a couple of years. While there is no guaranteed pathway to success, it seems likely that for any gateway there is a need for a strong community and/or solid funding streams to create and sustain its success. With over twenty years of examples to draw from, this presentation goes into detail for ten factors common to successful and enduring gateways that effectively serve as best practices for any new or developing gateway.
Navigating the Metaverse: A Journey into Virtual Evolution"Donna Lenk
Join us for an exploration of the Metaverse's evolution, where innovation meets imagination. Discover new dimensions of virtual events, engage with thought-provoking discussions, and witness the transformative power of digital realms."
In the ever-evolving landscape of technology, enterprise software development is undergoing a significant transformation. Traditional coding methods are being challenged by innovative no-code solutions, which promise to streamline and democratize the software development process.
This shift is particularly impactful for enterprises, which require robust, scalable, and efficient software to manage their operations. In this article, we will explore the various facets of enterprise software development with no-code solutions, examining their benefits, challenges, and the future potential they hold.
Large Language Models and the End of ProgrammingMatt Welsh
Talk by Matt Welsh at Craft Conference 2024 on the impact that Large Language Models will have on the future of software development. In this talk, I discuss the ways in which LLMs will impact the software industry, from replacing human software developers with AI, to replacing conventional software with models that perform reasoning, computation, and problem-solving.
May Marketo Masterclass, London MUG May 22 2024.pdfAdele Miller
Can't make Adobe Summit in Vegas? No sweat because the EMEA Marketo Engage Champions are coming to London to share their Summit sessions, insights and more!
This is a MUG with a twist you don't want to miss.
Check out the webinar slides to learn more about how XfilesPro transforms Salesforce document management by leveraging its world-class applications. For more details, please connect with sales@xfilespro.com
If you want to watch the on-demand webinar, please click here: https://www.xfilespro.com/webinars/salesforce-document-management-2-0-smarter-faster-better/
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisGlobus
JASMIN is the UK’s high-performance data analysis platform for environmental science, operated by STFC on behalf of the UK Natural Environment Research Council (NERC). In addition to its role in hosting the CEDA Archive (NERC’s long-term repository for climate, atmospheric science & Earth observation data in the UK), JASMIN provides a collaborative platform to a community of around 2,000 scientists in the UK and beyond, providing nearly 400 environmental science projects with working space, compute resources and tools to facilitate their work. High-performance data transfer into and out of JASMIN has always been a key feature, with many scientists bringing model outputs from supercomputers elsewhere in the UK, to analyse against observational or other model data in the CEDA Archive. A growing number of JASMIN users are now realising the benefits of using the Globus service to provide reliable and efficient data movement and other tasks in this and other contexts. Further use cases involve long-distance (intercontinental) transfers to and from JASMIN, and collecting results from a mobile atmospheric radar system, pushing data to JASMIN via a lightweight Globus deployment. We provide details of how Globus fits into our current infrastructure, our experience of the recent migration to GCSv5.4, and of our interest in developing use of the wider ecosystem of Globus services for the benefit of our user community.
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...Juraj Vysvader
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I didn't get rich from it but it did have 63K downloads (powered possible tens of thousands of websites).
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxrickgrimesss22
Discover the essential features to incorporate in your Winzo clone app to boost business growth, enhance user engagement, and drive revenue. Learn how to create a compelling gaming experience that stands out in the competitive market.
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Globus
Large Language Models (LLMs) are currently the center of attention in the tech world, particularly for their potential to advance research. In this presentation, we'll explore a straightforward and effective method for quickly initiating inference runs on supercomputers using the vLLM tool with Globus Compute, specifically on the Polaris system at ALCF. We'll begin by briefly discussing the popularity and applications of LLMs in various fields. Following this, we will introduce the vLLM tool, and explain how it integrates with Globus Compute to efficiently manage LLM operations on Polaris. Attendees will learn the practical aspects of setting up and remotely triggering LLMs from local machines, focusing on ease of use and efficiency. This talk is ideal for researchers and practitioners looking to leverage the power of LLMs in their work, offering a clear guide to harnessing supercomputing resources for quick and effective LLM inference.
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Mind IT Systems
Healthcare providers often struggle with the complexities of chronic conditions and remote patient monitoring, as each patient requires personalized care and ongoing monitoring. Off-the-shelf solutions may not meet these diverse needs, leading to inefficiencies and gaps in care. It’s here, custom healthcare software offers a tailored solution, ensuring improved care and effectiveness.
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns
Unlocking Business Potential: Tailored Technology Solutions by Prosigns
Discover how Prosigns, a leading technology solutions provider, partners with businesses to drive innovation and success. Our presentation showcases our comprehensive range of services, including custom software development, web and mobile app development, AI & ML solutions, blockchain integration, DevOps services, and Microsoft Dynamics 365 support.
Custom Software Development: Prosigns specializes in creating bespoke software solutions that cater to your unique business needs. Our team of experts works closely with you to understand your requirements and deliver tailor-made software that enhances efficiency and drives growth.
Web and Mobile App Development: From responsive websites to intuitive mobile applications, Prosigns develops cutting-edge solutions that engage users and deliver seamless experiences across devices.
AI & ML Solutions: Harnessing the power of Artificial Intelligence and Machine Learning, Prosigns provides smart solutions that automate processes, provide valuable insights, and drive informed decision-making.
Blockchain Integration: Prosigns offers comprehensive blockchain solutions, including development, integration, and consulting services, enabling businesses to leverage blockchain technology for enhanced security, transparency, and efficiency.
DevOps Services: Prosigns' DevOps services streamline development and operations processes, ensuring faster and more reliable software delivery through automation and continuous integration.
Microsoft Dynamics 365 Support: Prosigns provides comprehensive support and maintenance services for Microsoft Dynamics 365, ensuring your system is always up-to-date, secure, and running smoothly.
Learn how our collaborative approach and dedication to excellence help businesses achieve their goals and stay ahead in today's digital landscape. From concept to deployment, Prosigns is your trusted partner for transforming ideas into reality and unlocking the full potential of your business.
Join us on a journey of innovation and growth. Let's partner for success with Prosigns.
Software Engineering, Software Consulting, Tech Lead.
Spring Boot, Spring Cloud, Spring Core, Spring JDBC, Spring Security,
Spring Transaction, Spring MVC,
Log4j, REST/SOAP WEB-SERVICES.
In software engineering, the right architecture is essential for robust, scalable platforms. Wix has undergone a pivotal shift from event sourcing to a CRUD-based model for its microservices. This talk will chart the course of this pivotal journey.
Event sourcing, which records state changes as immutable events, provided robust auditing and "time travel" debugging for Wix Stores' microservices. Despite its benefits, the complexity it introduced in state management slowed development. Wix responded by adopting a simpler, unified CRUD model. This talk will explore the challenges of event sourcing and the advantages of Wix's new "CRUD on steroids" approach, which streamlines API integration and domain event management while preserving data integrity and system resilience.
Participants will gain valuable insights into Wix's strategies for ensuring atomicity in database updates and event production, as well as caching, materialization, and performance optimization techniques within a distributed system.
Join us to discover how Wix has mastered the art of balancing simplicity and extensibility, and learn how the re-adoption of the modest CRUD has turbocharged their development velocity, resilience, and scalability in a high-growth environment.
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Globus
The Earth System Grid Federation (ESGF) is a global network of data servers that archives and distributes the planet’s largest collection of Earth system model output for thousands of climate and environmental scientists worldwide. Many of these petabyte-scale data archives are located in proximity to large high-performance computing (HPC) or cloud computing resources, but the primary workflow for data users consists of transferring data, and applying computations on a different system. As a part of the ESGF 2.0 US project (funded by the United States Department of Energy Office of Science), we developed pre-defined data workflows, which can be run on-demand, capable of applying many data reduction and data analysis to the large ESGF data archives, transferring only the resultant analysis (ex. visualizations, smaller data files). In this talk, we will showcase a few of these workflows, highlighting how Globus Flows can be used for petabyte-scale climate analysis.
Code reviews are vital for ensuring good code quality. They serve as one of our last lines of defense against bugs and subpar code reaching production.
Yet, they often turn into annoying tasks riddled with frustration, hostility, unclear feedback and lack of standards. How can we improve this crucial process?
In this session we will cover:
- The Art of Effective Code Reviews
- Streamlining the Review Process
- Elevating Reviews with Automated Tools
By the end of this presentation, you'll have the knowledge on how to organize and improve your code review proces
Meet TransmogrifAI, Open Source AutoML That Powers Einstein Predictions
1. Meet TransmogrifAI, Open
Source AutoML That Powers
Einstein Predictions
mtovbin@salesforce.com, @tovbinm
Matthew Tovbin, Principal Engineer, Einstein
2. Forward Looking Statement
Statement under the Private Securities Litigation Reform Act of 1995:
This presentation may contain forward-looking statements that involve risks, uncertainties, and assumptions. If any such uncertainties
materialize or if any of the assumptions proves incorrect, the results of salesforce.com, inc. could differ materially from the results
expressed or implied by the forward-looking statements we make. All statements other than statements of historical fact could be deemed
forward-looking, including any projections of product or service availability, subscriber growth, earnings, revenues, or other financial items
and any statements regarding strategies or plans of management for future operations, statements of belief, any statements concerning
new, planned, or upgraded services or technology developments and customer contracts or use of our services.
The risks and uncertainties referred to above include – but are not limited to – risks associated with developing and delivering new
functionality for our service, new products and services, our new business model, our past operating losses, possible fluctuations in our
operating results and rate of growth, interruptions or delays in our Web hosting, breach of our security measures, the outcome of any
litigation, risks associated with completed and any possible mergers and acquisitions, the immature market in which we operate, our
relatively limited operating history, our ability to expand, retain, and motivate our employees and manage our growth, new releases of our
service and successful customer deployment, our limited history reselling non-salesforce.com products, and utilization and selling to larger
enterprise customers. Further information on potential factors that could affect the financial results of salesforce.com, inc. is included in
our annual report on Form 10-K for the most recent fiscal year and in our quarterly report on Form 10-Q for the most recent fiscal quarter.
These documents and others containing important disclosures are available on the SEC Filings section of the Investor Information section
of our Web site.
Any unreleased services or features referenced in this or other presentations, press releases or public statements are not currently
available and may not be delivered on time or at all. Customers who purchase our services should make the purchase decisions based
upon features that are currently available. Salesforce.com, inc. assumes no obligation and does not intend to update these
forward-looking statements.
4. 1. Customer-specific models beat global models
2. Majority of business data is structured
3. Too many use cases, too few data scientists
Machine Learning is Hard and Even Harder for the Enterprise
Lessons our Data Scientists Learned
while Building Einstein
5. 1. Customer-specific Models Beat Global Models
● Customers care about data privacy
● Every customer’s data is different
Enterprise Machine Learning
6. 2. Majority of Business Data is Structured
https://www.kaggle.com/surveys/2017
8. ML is exponentially harder in the Enterprise with
many, customer-specific models
3. Too Many Use Cases, Too Few Data Scientists
Data
Prep
Feat.
Eng
Feat.
Selection
Model
Training
Model
Data
Prep
Feat.
Eng
Feat.
Selection
Model
Training
Model
Data
Prep
Feat.
Eng
Feat.
Selection
Model
Training
Model
Data
Prep
Feat.
Eng
Feat.
Selection
Model
Training
Model
Data
Prep
Feat.
Eng
Feat.
Selection
Model
Training
Model
Data
Prep
Feat.
Eng
Feat.
Selection
Model
Training
Model
Data
Prep
Feat.
Eng
Feat.
Selection
Model
Training
Model
Data
Prep
Feat.
Eng
Feat.
Selection
Model
Training
Model
10. ● Automated feature engineering, feature
selection & model selection
● ML abstractions that improve developer
productivity & collaboration
● Model explainability to improve
debuggability and transparency
>90% accuracy with 100x reduction in time
Introducing TransmogrifAI
Automated Machine Learning for Structured Data
11. Transform in a surprising or magical manner
What’s in a name?
transmogrify
12. 5B+
predictions
per day
Einstein
Platform
Compute
Orchestration
Data Store
Model Lifecycle
Management
Data Science
Experience
Configuration
Services
Infrastructure
Metrics
Health Monitoring
ETL/GDPR/
Data
Processing
DL TransmogrifAI
Machine Learning
The AutoML Engine in the Einstein Platform
Lead Scoring Engagement ScoringCase Classification Prediction Builder
...
13. Einstein Prediction Builder
• Product: Point.
Click. Predict.
• Engineering: any
customer can create
any number of ML
applications on any
data?! Impossible!
14. Under the Hood
● Automated Feature Engineering
● Automated Feature Selection
● Automated Model Selection
16. Type Hierarchy For Machine Learning
FeatureType
OPNumeric OPCollection
OPSetOPList
NonNullableText
Email
Base64
Phone
ID
URL
ComboBox
PickList
TextArea
OPVector OPMap
BinaryMap
IntegralMap
DateList
DateTimeList
Integral
Real
Binary
Percent
Currency
Date
DateTime
MultiPickList TextMap
TextListCity
Street
Country
PostalCode
Location
State
Geolocation
StateMap
SingleResponse
RealNN
Categorical
MultiResponse
Legend: bold - abstract type, regular - concrete type, italic - trait, solid line - inheritance, dashed line - trait mixin
...
RealMap
https://developer.salesforce.com/docs/atlas.en-us.api.meta/api/field_types.htm
Prediction
17. Automatic Feature Engineering
transmogrify()
Lat LonSubjectPhoneEmail Age
Age
[0-15]
Age
[15-35]
Age
[>35]
Email Is
Spammy
Top Email
Domains
Country
Code
Phone Is
Valid
Top
TF-IDF
Terms
City, State
Feature Vector
21. Temporal: Circular Statistics
Circular distributions are those that
have no true zero. Great for temporal
features and deals with seasonality:
● Hours of the Day
● Weeks on the Month
● Months of the Year
22. Numeric Categorical SpatialTemporal
Reverse Geocoding
Nearest POI
Text
Time difference
Circular Statistics
Time extraction (day,
week, month, year)
Language Detection
Language-wise
Tokenization
Hash Encoding
Tf-Idf
Word2Vec
Name Entity
Resolution
Smart Categorical
Imputation
Track null value
One Hot Encoding
Dynamic Top K pivot
Imputation
Track null value
Scaling - zNormalize,
log, linear
Smart Binning
Automatic Feature Engineering
24. Problems with doing Machine
Learning on Enterprise Data
1. Hindsight Bias
2. Field Usage Changes
3. Bulk Uploads
4. Field Type Abuse
5. More...
25. Lead Before Conversion Lead At Conversion
Problem #1 – Hindsight Bias (aka Label Leakage)
26. In layman terms, it is like Marty McFly traveling to the future, getting his hands on
the Sports Almanac, and using it to bet on the games of the present.
28. Problem #3 – Bulk Upload by Business Workflow
A business process updated records having different
distribution - biased towards negative outcome
29. The quick, brown fox jumps over a lazy dog. DJs flock by when MTV ax quiz prog. Junk
MTV quiz graced by fox whelps. Bawds jog, flick quartz, vex nymphs. Waltz, bad nymph,
for quick jigs vex! Fox nymphs grab quick-jived waltz. Brick quiz whangs jumpy veldt fox.
Bright vixens jump; dozy fowl quack. Quick wafting zephyrs vex bold Jim. Quick zephyrs
blow, vexing daft Jim. Sex-charged fop blew my junk TV quiz. How quickly daft jumping
zebras vex. Two driven jocks help fax my big quiz. Quick, Baz, get my woven flax
jodhpurs! "Now fax quiz Jack!" my brave ghost pled. Five quacking zephyrs jolt my wax
bed. Flummoxed by job, kvetching W. zaps Iraq. Cozy sphinx waves quart jug of bad milk.
A very bad quack might jinx zippy fowls. Few quips galvanized the mock jury box. Quick
brown dogs jump over the lazy fox. The jay, pig, fox, zebra, and my wolves quack! Blowzy
red vixens fight for a quick jump. Joaquin Phoenix was gazed by MTV for luck. A wizard’s
job is to vex chumps quickly in fog. Watch "Jeopardy!", Alex Trebek's fun TV quiz game.
Woven silk pyjamas exchanged for blue quartz. Brawny gods just
Typical Text Feature ‘Last Open Stage’ Text Feature
align
answer
collect
contracting
negotiate
opportunity won
qualify
qualify/align
Problem #4 – Feature types abused
31. ● Analyze every feature and output descriptive statistics
○ Mean
○ Min
○ Max
○ Variance
○ Number of Nulls
● Ensure Features have acceptable ranges
Automatic Feature Selection
32. ● Analyse each feature
correlation to the label, who
has the most and least
predictive power?
● Drop features with low
predictive power
Automatic Feature Selection
34. Need to know the true label to evaluate the model
● Usually do a random train/holdout split on the labeled data and use cross-validation on
training set
Evaluating Models
Training set
Holdout set
35. ● Time-based evaluation dataset is the true test of
how well a model is performing
○ Wait for existing (or new) records to have their
label determined
○ Predict from older state of that record and
compare to the true label
● Biggest problem is usually waiting for enough data to
be available
● We can also switch over to constructing the model
from the true event sequence rather than a snapshot
Evaluating Models
41. Automated Model Selection
● Many hyperparameters for each algorithm
● Automated Hyperparameter tuning
○ Faster model creation with improved metrics
○ Search algorithms to find the optimal hyperparameters,
e.g grid search, random search
Grid Search Bayesian SearchRandom Search
42. Random Forests
Decision Trees
Logistic Regression w/ ElasticNet Regularization
Naive Bayes
Gradient Boosted trees
Decision Trees
Random Forests
Linear Regression w/ ElasticNet Regularization
Random Forests
Decision Trees
Multinomial Logistic Regression w/ ElasticNet
Naive Bayes
Compete Algorithms
RMSE
AccuracyAuROC
Regression
Binary Classification Multi-Class Classification
Automated Model Selection
45. How well does it work?
• TransmogrifAI empowers:
• Predictive Journeys
• Lead Scoring
• Prediction Builder
• Case Classification
• Most of the models deployed in
production are completely hands free
• Serves 3B+ 5B+ predictions per day
46. Where do WE go next?
• Deeper model & score insights – LOCO, LIME
• Hyper parameter search strategies – Bayesian, Bandit-based
• Feature engineering – text embeddings, model specific
• Model portability
• Enable more applications – recommenders, unsupervised learning
• Perf tuning, bug fixes, docs, examples
• <Your requirements / feedback>
47. Where do YOU go next?
• Read the blog post - https://www.sfdc.co/open-sourcing-transmogrifai
• Try it out - https://transmogrif.ai
• Reach out and contribute - https://sfdc.co/transmogrifai-contributing
• Student? Apply to Google Summer of Code (GSoC) 2019 to work with us!
• Feeling creative? We need a logo.