The document discusses using MARS (Multivariate Adaptive Regression Splines) modeling to predict trends in time series data. MARS is a nonparametric regression technique that can capture nonlinear relationships and variable interactions. It builds a prediction model as a series of piecewise regression splines. The document demonstrates applying MARS to predict software license usage over time, accounting for factors like time of day, day of week, holidays, and previous usage levels. MARS identifies the most important predictors and builds an autoregressive license usage model with good predictive performance.
This document summarizes research using random forests and archetypal analysis to analyze dietary patterns in the Cache County Memory Study. Random forests were used to predict cookbook authors based on recipe nutrition and predict dementia status based on diet. Archetypal analysis identified extreme dietary patterns or "archetypes" that approximated individual diets. Different weighting schemes in random forest algorithms and varying numbers of archetypes were explored. Important predictive variables and archetypes were identified.
This document proposes a video genre classification method using only audio features extracted from video clips. It uses Multivariate Adaptive Regression Splines (MARS) to build classification models for different genres based on low-level audio features like MFCCs, zero crossing rate, short-time energy, etc. extracted from a dataset of news, cartoons, sports, music and dahmas video clips. The models are able to accurately classify video genres with an overall classification rate of 91.83% based on the important audio features identified for each genre by the MARS models.
On Foundations of Parameter Estimation for Generalized Partial Linear Models ...SSA KPI
1) The document discusses estimation methods for generalized linear models (GLMs) and generalized partial linear models (GPLMs). 2) GPLMs extend GLMs by adding a single nonparametric component to the linear predictor. 3) Parameter estimation for GPLMs is performed by maximizing a penalized likelihood function, where the penalty term controls the tradeoff between model fit and smoothness of the nonparametric component. 4) An iterative algorithm such as Newton-Raphson is used to solve the penalized maximum likelihood estimation problem.
This document provides an overview of support vector machines (SVMs) as kernel machines. It discusses how SVMs can be formulated as optimization problems in reproducing kernel Hilbert spaces using kernels. Specifically, it covers:
1) How the SVM primal optimization problem can be solved using Lagrange multipliers and the representer theorem to obtain the dual quadratic program.
2) How the regularization parameter C in the C-SVM formulation allows data points to lie on or outside the margin.
3) The active set method for solving the SVM quadratic program, which iteratively optimizes over the sets of active and inactive constraints.
Fundamentals of Machine Learning Bootcamp - 24 Nov London 2014 Persontyle
Fundamentals of Machine Learning Bootcamp will take you through the conceptual and applied foundations of the subject. Topics covered will include Machine Learning theory, types of learning, techniques, models and methods. Labs are developed to practically learn how to use the R programming language and packages for applying the main concepts and techniques of Machine Learning.
For corporate bookings or to organize on-site training email hello@persontyle.comor call now +44 (0)20 3239 3141
www.persontyle.com
The document discusses various techniques for analyzing relationships between two variables using scatter plots and regression analysis. It explains how to use scatter plots to identify different types of relationships between two variables, including no relationship, strong simple relationships, and multivariate relationships. It also discusses linear regression, its limitations, and how to identify inappropriate uses through residuals. Additional techniques covered include smoothing noisy data using splines and LOESS, dealing with non-linear relationships through logarithmic transforms, and optimizing models through variable scaling and transformations. Examples are provided to illustrate key points about revealing relationships and optimizing models for two-variable data.
The document outlines various regression techniques including:
1) Ordinary Least Squares regression which finds the best fitting linear model by minimizing the sum of squared errors between predicted and actual values.
2) Overfitting can occur when the model learns from noise in the data. Regularization addresses this by imposing a penalty on complexity.
3) Kernel regression, spline regression, and Multiple Adaptive Regression Splines (MARS) allow for nonlinear relationships between variables.
The key ingredients of a strong relationship between a buyer, broker, and insurer include open communication, understanding each other's businesses, and working together as long-term partners. Regular meetings allow each to provide expertise that helps the others achieve their risk management goals. Clear roles and ongoing collaboration allow the buyer to reduce risks, the broker to design suitable coverage, and the insurer to provide loss control services, leading to better risk programs and outcomes over the long term.
This document summarizes research using random forests and archetypal analysis to analyze dietary patterns in the Cache County Memory Study. Random forests were used to predict cookbook authors based on recipe nutrition and predict dementia status based on diet. Archetypal analysis identified extreme dietary patterns or "archetypes" that approximated individual diets. Different weighting schemes in random forest algorithms and varying numbers of archetypes were explored. Important predictive variables and archetypes were identified.
This document proposes a video genre classification method using only audio features extracted from video clips. It uses Multivariate Adaptive Regression Splines (MARS) to build classification models for different genres based on low-level audio features like MFCCs, zero crossing rate, short-time energy, etc. extracted from a dataset of news, cartoons, sports, music and dahmas video clips. The models are able to accurately classify video genres with an overall classification rate of 91.83% based on the important audio features identified for each genre by the MARS models.
On Foundations of Parameter Estimation for Generalized Partial Linear Models ...SSA KPI
1) The document discusses estimation methods for generalized linear models (GLMs) and generalized partial linear models (GPLMs). 2) GPLMs extend GLMs by adding a single nonparametric component to the linear predictor. 3) Parameter estimation for GPLMs is performed by maximizing a penalized likelihood function, where the penalty term controls the tradeoff between model fit and smoothness of the nonparametric component. 4) An iterative algorithm such as Newton-Raphson is used to solve the penalized maximum likelihood estimation problem.
This document provides an overview of support vector machines (SVMs) as kernel machines. It discusses how SVMs can be formulated as optimization problems in reproducing kernel Hilbert spaces using kernels. Specifically, it covers:
1) How the SVM primal optimization problem can be solved using Lagrange multipliers and the representer theorem to obtain the dual quadratic program.
2) How the regularization parameter C in the C-SVM formulation allows data points to lie on or outside the margin.
3) The active set method for solving the SVM quadratic program, which iteratively optimizes over the sets of active and inactive constraints.
Fundamentals of Machine Learning Bootcamp - 24 Nov London 2014 Persontyle
Fundamentals of Machine Learning Bootcamp will take you through the conceptual and applied foundations of the subject. Topics covered will include Machine Learning theory, types of learning, techniques, models and methods. Labs are developed to practically learn how to use the R programming language and packages for applying the main concepts and techniques of Machine Learning.
For corporate bookings or to organize on-site training email hello@persontyle.comor call now +44 (0)20 3239 3141
www.persontyle.com
The document discusses various techniques for analyzing relationships between two variables using scatter plots and regression analysis. It explains how to use scatter plots to identify different types of relationships between two variables, including no relationship, strong simple relationships, and multivariate relationships. It also discusses linear regression, its limitations, and how to identify inappropriate uses through residuals. Additional techniques covered include smoothing noisy data using splines and LOESS, dealing with non-linear relationships through logarithmic transforms, and optimizing models through variable scaling and transformations. Examples are provided to illustrate key points about revealing relationships and optimizing models for two-variable data.
The document outlines various regression techniques including:
1) Ordinary Least Squares regression which finds the best fitting linear model by minimizing the sum of squared errors between predicted and actual values.
2) Overfitting can occur when the model learns from noise in the data. Regularization addresses this by imposing a penalty on complexity.
3) Kernel regression, spline regression, and Multiple Adaptive Regression Splines (MARS) allow for nonlinear relationships between variables.
The key ingredients of a strong relationship between a buyer, broker, and insurer include open communication, understanding each other's businesses, and working together as long-term partners. Regular meetings allow each to provide expertise that helps the others achieve their risk management goals. Clear roles and ongoing collaboration allow the buyer to reduce risks, the broker to design suitable coverage, and the insurer to provide loss control services, leading to better risk programs and outcomes over the long term.
This document is a menu sheet that lists and describes various coffee, tea, and hot chocolate options available. There are categories for dark and intense coffees, flavored and decaf coffees, black teas, green teas, flavored teas, herbal infusions, espresso, light and smooth coffees, medium and balanced coffees, medium and bright coffees, licensed coffee brands including Segafredo and Peet's, and fabulous froth options for lattes and cappuccinos. Each listing provides a brief description of the taste profile and flavor notes for that particular item.
GAMs (Generalized Additive Models) allow for nonlinear relationships between predictor and response variables using splines. Splines break data into sections with different formulas to better model threshold effects and complex relationships. The mgcv package in R is used to fit GAMs and includes functions for building, summarizing, checking, and comparing GAMs. Key decisions in GAM modeling include which variables to include, which to model nonlinearly using what type of splines, and the number of knots.
The document provides an in-depth company profile report on Mars, Incorporated from Company Profiles and Conferences. The report contains a detailed company overview, products, services, SWOT analysis, history, locations, subsidiaries, and executive biographies. It is a crucial resource for industry executives and analysts seeking key information about Mars, Incorporated and their operations. The report utilizes primary and secondary research sources to objectively study the company's strengths, weaknesses, opportunities and threats.
Cook with your kids and enter to win $15,000 for your family and $30,000 for your child's school cafeteria. To enter, cook a recipe with your child, make a fun 3 minute or less video about it, and upload the video to unclebens.com between July 29th and October 6th. Your school can also win $30,000 if families from that school collectively submit the most entries, so encourage other families to enter as well. The contest is open to US parents of children in grades K-8 and no purchase is necessary to enter.
The document provides information about the Seattle Seahawks American football team. It notes that Marshawn Lynch enjoys skittles and was given a two year supply by Mars Inc., Russell Wilson was the shortest quarterback in 2013, and the Seahawks won the Super Bowl in 2014. It also mentions Pete Carroll is the head coach and the team plays home games at Century Link Field in Seattle, Washington which holds 67,000 fans.
The document discusses MARS (Multivariate Adaptive Regression Splines), a new tool for regression analysis. MARS can automatically select variables, detect interactions between variables, and produce models that are protected against overfitting. It was developed by Jerome Friedman and produces smooth curves rather than step functions like CART. The document provides an introduction to MARS concepts and guidelines for using MARS in practice.
This document provides an overview of Mars, Incorporated, a global manufacturer founded in 1911 that generates $33 billion annually in sales. It discusses Mars' cocoa supply chain management through training programs and certification. The report also examines Mars' corporate social responsibility initiatives in energy/climate, water impact, waste, and treatment of employees and customers. Mars' global operations, leadership, and impacts on communities are reviewed. The company is praised for its successful environmental cooperation and innovative supply chain management.
M&M's are colorful candy shells with lowercase "m" printed on one side that surround various fillings. They were invented in the 1930s by Forrest Mars who saw soldiers eating chocolate pellets with a hard shell during the Spanish Civil War. Over the years, M&M's were introduced internationally and in new varieties like Peanut Butter, Crispy, Pretzel, and Premium varieties sold in cartons. Production of M&M's has increased over time to perfect producing over 3,300 pounds of chocolate centers per hour.
Mars incorporated interview questions and answersPenelopeCruz99
This document provides interview preparation materials for Mars Incorporated, including answers to common interview questions, tips for researching the company, and additional job interview resources. Sample answers are provided for questions like "What is your greatest weakness?" and "Why should we hire you?". The document emphasizes being honest, relating your skills to the role, and showing passion for the company. Other sections list interview question types, thank you letter guidelines, and suggested questions for candidates to ask.
Customer Success Story: Mars Inc. [New York]SAP Ariba
1) Mars Inc. is a global manufacturer and distributor of food products operating in over 74 countries with over $33 billion in annual sales and 75,000 employees.
2) In 2011, Mars Inc. conducted a survey that found their buyers were dissatisfied with their existing procurement technologies and processes.
3) Mars Inc. embarked on a journey to implement a new strategic sourcing platform called eMSSM from Ariba to transform their procurement function, aiming to be in the top 10% of companies within 3-5 years and increase value creation by 25%.
M&M's is known for its colorful candy shells and the tagline "Melts in your mouth, not in your hand." The document discusses M&M's marketing history including joint campaigns to build the brand. It highlights a 2014 Super Bowl ad that introduced a new character which brought confusion, and provides links to videos of the teaser and full ad. The document also discusses M&M's social media marketing strategies on sites like Twitter, including using the hashtag #RedForHire to promote humor around the Super Bowl spot. Graphs are included showing M&M's Twitter engagement.
Using data from over 200,000 pets, standardised growth charts for puppies have been developed which can help veterinarians spot abnormal patterns early on and recommend interventions. Early obesity often leads to overweight in adult dogs so the charts can potentially make a contribution to pet obesity reduction.
This document discusses Mars Inc., a large multinational manufacturer of confectionery, pet food, and other food products. It operates factories and offices across the UK, employing over 4,000 people. However, Mars is facing some challenges related to childhood obesity rates in the UK and increasing public health concerns. The document examines Mars' business strategies, product portfolio, marketing campaigns, and potential areas of diversification.
Mars, Incorporated is a global confectionery and pet care company. It manufactures and distributes confectionery, food, and pet care products under several brands such as M&M's, Snickers, Pedigree, and Whiskas. The report provides an in-depth profile of Mars, including its history, operations, products, competitors, and executives. It also includes a detailed SWOT analysis and business strategy overview to help understand the company's partners, customers, and competitors.
This document provides a history of Whiskas cat food, including that it was originally called Kal Kan foods but changed its name to Whiskas in 1988. It discusses Whiskas' advertising strategies over the years, which have used humor and portrayed cats' love for their owners. The document also outlines UK legislation regarding food claims and Whiskas' dealings with the Advertising Standards Authority regarding preference claims.
In this digital transformation era, we have seen the rise of digital platforms and increased usages of devices particularly in the area of wearables and the Internet of Things (IoT). Given the fast pace change to the IoT landscape and devices, data has become one of the important source of truth for analytics and continuous streaming of data from sensors have also emerged as one of the fuel that revolutionise the emergence of IoT. These includes health telematics, vehicle telematics, predictive maintenance of equipment, manufacturing quality management, consumer behaviour, and more. With this, we will give you an introduction on how to leverage the power of data science and machine learning to understand and explore feature engineering of IoT and sensor data.
The DREAM approach makes it possible to find 100% of differences in data sets and to analyze the root causes of these differences. DREAM is able to aggregate differences to a higher level but it is also able to drill down to see details. DREAM is based on XML and is thereby compatible to all types of databases, like RMBMS, NOSQL, Excel etc. DREAM is an excellent means for testers to reduce time and costs of comparing data and testing, especially regression testing.
This document contains project details including a team structure, project timeline with milestones, project budget breakdown, and work breakdown structure. It outlines the key tasks and timelines for the project across multiple quarters and years. The project budget is split into sections for labour, materials, and travel costs. The work breakdown structure lists specific tasks and associated duration and resources.
This document provides an overview of the Android mobile ecosystem and operating system. It discusses the key players in the ecosystem including OEMs, service providers, and developers. It also summarizes Android's history and growth in the mobile market. The document then describes the architecture and core components of the Android operating system including its Linux kernel, native libraries, application framework, and building blocks like activities, services, content providers, and intents. It provides examples of how these components work together in applications.
"You can download this product from SlideTeam.net"
Present the information related to project output using our presentation deck. In project management, output is the word used to describe the goals and objectives that are associated with it. Our 30 slide PowerPoint deck includes the details that are required to be a part of the presentation which is based on project management such as project timeline, team structure, budget, work breakdown structure, activities sequence, communication plan, project work plan, task matrix, project management dashboard etc. Creating a presentation to highlight all these points may require you to spend lot of time on research and then come up with individual slides. However, with our PPT deck you can access all this in just one download. You can communicate the information using this deck with your clients. Outputs help to describe business activities such as measurements, tracking processes and status report related to planning, managing and completing the project itself. Create the most innovative presentation using project output PowerPoint deck that can help you to engage your audience with it. Explain the intent behind each clause with our Project Output Powerpoint Presentation Slides. Be able to decipher the judgement. https://bit.ly/3tIgD8G
What is the Lifecycle Modeling Language?SarahCraig7
The document discusses the Lifecycle Modeling Language (LML). It provides the following key details:
- LML combines logical constructs and an ontology to capture systems engineering information across the full lifecycle.
- The LML ontology has 12 primary element classes and standardized relationships between classes.
- LML supports requirements management, modeling, simulation, and verification through diagrams and models.
- LML is intended to simplify modeling compared to other languages while supporting the full lifecycle.
- The LML standard is overseen by a steering committee and SPEC Innovations' Innoslate product supports its implementation.
This document is a menu sheet that lists and describes various coffee, tea, and hot chocolate options available. There are categories for dark and intense coffees, flavored and decaf coffees, black teas, green teas, flavored teas, herbal infusions, espresso, light and smooth coffees, medium and balanced coffees, medium and bright coffees, licensed coffee brands including Segafredo and Peet's, and fabulous froth options for lattes and cappuccinos. Each listing provides a brief description of the taste profile and flavor notes for that particular item.
GAMs (Generalized Additive Models) allow for nonlinear relationships between predictor and response variables using splines. Splines break data into sections with different formulas to better model threshold effects and complex relationships. The mgcv package in R is used to fit GAMs and includes functions for building, summarizing, checking, and comparing GAMs. Key decisions in GAM modeling include which variables to include, which to model nonlinearly using what type of splines, and the number of knots.
The document provides an in-depth company profile report on Mars, Incorporated from Company Profiles and Conferences. The report contains a detailed company overview, products, services, SWOT analysis, history, locations, subsidiaries, and executive biographies. It is a crucial resource for industry executives and analysts seeking key information about Mars, Incorporated and their operations. The report utilizes primary and secondary research sources to objectively study the company's strengths, weaknesses, opportunities and threats.
Cook with your kids and enter to win $15,000 for your family and $30,000 for your child's school cafeteria. To enter, cook a recipe with your child, make a fun 3 minute or less video about it, and upload the video to unclebens.com between July 29th and October 6th. Your school can also win $30,000 if families from that school collectively submit the most entries, so encourage other families to enter as well. The contest is open to US parents of children in grades K-8 and no purchase is necessary to enter.
The document provides information about the Seattle Seahawks American football team. It notes that Marshawn Lynch enjoys skittles and was given a two year supply by Mars Inc., Russell Wilson was the shortest quarterback in 2013, and the Seahawks won the Super Bowl in 2014. It also mentions Pete Carroll is the head coach and the team plays home games at Century Link Field in Seattle, Washington which holds 67,000 fans.
The document discusses MARS (Multivariate Adaptive Regression Splines), a new tool for regression analysis. MARS can automatically select variables, detect interactions between variables, and produce models that are protected against overfitting. It was developed by Jerome Friedman and produces smooth curves rather than step functions like CART. The document provides an introduction to MARS concepts and guidelines for using MARS in practice.
This document provides an overview of Mars, Incorporated, a global manufacturer founded in 1911 that generates $33 billion annually in sales. It discusses Mars' cocoa supply chain management through training programs and certification. The report also examines Mars' corporate social responsibility initiatives in energy/climate, water impact, waste, and treatment of employees and customers. Mars' global operations, leadership, and impacts on communities are reviewed. The company is praised for its successful environmental cooperation and innovative supply chain management.
M&M's are colorful candy shells with lowercase "m" printed on one side that surround various fillings. They were invented in the 1930s by Forrest Mars who saw soldiers eating chocolate pellets with a hard shell during the Spanish Civil War. Over the years, M&M's were introduced internationally and in new varieties like Peanut Butter, Crispy, Pretzel, and Premium varieties sold in cartons. Production of M&M's has increased over time to perfect producing over 3,300 pounds of chocolate centers per hour.
Mars incorporated interview questions and answersPenelopeCruz99
This document provides interview preparation materials for Mars Incorporated, including answers to common interview questions, tips for researching the company, and additional job interview resources. Sample answers are provided for questions like "What is your greatest weakness?" and "Why should we hire you?". The document emphasizes being honest, relating your skills to the role, and showing passion for the company. Other sections list interview question types, thank you letter guidelines, and suggested questions for candidates to ask.
Customer Success Story: Mars Inc. [New York]SAP Ariba
1) Mars Inc. is a global manufacturer and distributor of food products operating in over 74 countries with over $33 billion in annual sales and 75,000 employees.
2) In 2011, Mars Inc. conducted a survey that found their buyers were dissatisfied with their existing procurement technologies and processes.
3) Mars Inc. embarked on a journey to implement a new strategic sourcing platform called eMSSM from Ariba to transform their procurement function, aiming to be in the top 10% of companies within 3-5 years and increase value creation by 25%.
M&M's is known for its colorful candy shells and the tagline "Melts in your mouth, not in your hand." The document discusses M&M's marketing history including joint campaigns to build the brand. It highlights a 2014 Super Bowl ad that introduced a new character which brought confusion, and provides links to videos of the teaser and full ad. The document also discusses M&M's social media marketing strategies on sites like Twitter, including using the hashtag #RedForHire to promote humor around the Super Bowl spot. Graphs are included showing M&M's Twitter engagement.
Using data from over 200,000 pets, standardised growth charts for puppies have been developed which can help veterinarians spot abnormal patterns early on and recommend interventions. Early obesity often leads to overweight in adult dogs so the charts can potentially make a contribution to pet obesity reduction.
This document discusses Mars Inc., a large multinational manufacturer of confectionery, pet food, and other food products. It operates factories and offices across the UK, employing over 4,000 people. However, Mars is facing some challenges related to childhood obesity rates in the UK and increasing public health concerns. The document examines Mars' business strategies, product portfolio, marketing campaigns, and potential areas of diversification.
Mars, Incorporated is a global confectionery and pet care company. It manufactures and distributes confectionery, food, and pet care products under several brands such as M&M's, Snickers, Pedigree, and Whiskas. The report provides an in-depth profile of Mars, including its history, operations, products, competitors, and executives. It also includes a detailed SWOT analysis and business strategy overview to help understand the company's partners, customers, and competitors.
This document provides a history of Whiskas cat food, including that it was originally called Kal Kan foods but changed its name to Whiskas in 1988. It discusses Whiskas' advertising strategies over the years, which have used humor and portrayed cats' love for their owners. The document also outlines UK legislation regarding food claims and Whiskas' dealings with the Advertising Standards Authority regarding preference claims.
In this digital transformation era, we have seen the rise of digital platforms and increased usages of devices particularly in the area of wearables and the Internet of Things (IoT). Given the fast pace change to the IoT landscape and devices, data has become one of the important source of truth for analytics and continuous streaming of data from sensors have also emerged as one of the fuel that revolutionise the emergence of IoT. These includes health telematics, vehicle telematics, predictive maintenance of equipment, manufacturing quality management, consumer behaviour, and more. With this, we will give you an introduction on how to leverage the power of data science and machine learning to understand and explore feature engineering of IoT and sensor data.
The DREAM approach makes it possible to find 100% of differences in data sets and to analyze the root causes of these differences. DREAM is able to aggregate differences to a higher level but it is also able to drill down to see details. DREAM is based on XML and is thereby compatible to all types of databases, like RMBMS, NOSQL, Excel etc. DREAM is an excellent means for testers to reduce time and costs of comparing data and testing, especially regression testing.
This document contains project details including a team structure, project timeline with milestones, project budget breakdown, and work breakdown structure. It outlines the key tasks and timelines for the project across multiple quarters and years. The project budget is split into sections for labour, materials, and travel costs. The work breakdown structure lists specific tasks and associated duration and resources.
This document provides an overview of the Android mobile ecosystem and operating system. It discusses the key players in the ecosystem including OEMs, service providers, and developers. It also summarizes Android's history and growth in the mobile market. The document then describes the architecture and core components of the Android operating system including its Linux kernel, native libraries, application framework, and building blocks like activities, services, content providers, and intents. It provides examples of how these components work together in applications.
"You can download this product from SlideTeam.net"
Present the information related to project output using our presentation deck. In project management, output is the word used to describe the goals and objectives that are associated with it. Our 30 slide PowerPoint deck includes the details that are required to be a part of the presentation which is based on project management such as project timeline, team structure, budget, work breakdown structure, activities sequence, communication plan, project work plan, task matrix, project management dashboard etc. Creating a presentation to highlight all these points may require you to spend lot of time on research and then come up with individual slides. However, with our PPT deck you can access all this in just one download. You can communicate the information using this deck with your clients. Outputs help to describe business activities such as measurements, tracking processes and status report related to planning, managing and completing the project itself. Create the most innovative presentation using project output PowerPoint deck that can help you to engage your audience with it. Explain the intent behind each clause with our Project Output Powerpoint Presentation Slides. Be able to decipher the judgement. https://bit.ly/3tIgD8G
What is the Lifecycle Modeling Language?SarahCraig7
The document discusses the Lifecycle Modeling Language (LML). It provides the following key details:
- LML combines logical constructs and an ontology to capture systems engineering information across the full lifecycle.
- The LML ontology has 12 primary element classes and standardized relationships between classes.
- LML supports requirements management, modeling, simulation, and verification through diagrams and models.
- LML is intended to simplify modeling compared to other languages while supporting the full lifecycle.
- The LML standard is overseen by a steering committee and SPEC Innovations' Innoslate product supports its implementation.
Allscripts will host monthly two-day Developer Workshops from October 2012 to June 2013 in Raleigh, North Carolina to help developers refine their skills and integrate with Allscripts platforms. The workshops will cover connecting to SDKs, commonly used APIs and web services, and sample code. Developers should bring their laptops. The workshops will be held at Allscripts' Raleigh office and hotel recommendations are provided.
Allscripts will host monthly two-day Developer Workshops from October 2012 to June 2013 in Raleigh, North Carolina to help developers refine their skills and integrate with Allscripts platforms. The workshops will cover connecting to SDKs, commonly used APIs and web services, and sample code. Developers should bring their laptops. The workshops will be held at Allscripts' Raleigh office and hotel recommendations are provided.
Allscripts will host monthly two-day Developer Workshops from October 2012 to June 2013 in Raleigh, North Carolina to help developers refine their skills and integrate with Allscripts platforms. The workshops will cover connecting to SDKs, commonly used APIs and web services, and sample code. Developers should bring their laptops. The workshops will be held at Allscripts' Raleigh office and hotel recommendations are provided.
This is one of 7 reports provided in work package 3: Micro services for small and medium institutions.
Authors:
Odo Benda, Astrid Höller and Gerda Koch
AIT Angewandte Informationstechnik Forschungsgesellschaft mbH
This document discusses meter testing, including field testing and shop testing. Field testing involves testing meters at customer locations and can involve issues with using customer load versus artificial load. Various equipment is used for field testing like adapters, test switches, and artificial loads. Best practices for field testing include being professional, keeping equipment in good repair, and being consistent in setup, execution, and reporting. Shop testing focuses on accuracy, communications performance, software/firmware verification, and functionality testing. A demonstration of meter testing tools, setup, testing, and reporting is also provided.
The document summarizes updates and new features for The Raiser's Edge 7.92 including improved query lists, event improvements, better duplicate prevention, and the introduction of The Giving Score to help target fundraising efforts. It also previews upcoming mobile functionality and discusses Product Discovery for providing feedback on desired new features.
Demys&fying
Cloud
Security
J o u r n e y
f r o m
P r o j e c t
t o
P a t e n t
t o
P u b l i c
C o n s u m p & o n
Platform as a Service
Software as a Service Database as a Service Load Balancing as a Service
Monitoring as a Service
Central Access Control as a Service
Infrastructure as a Service
Notification as a Service
Validation as a Service
Health Information Exchange as a Service
This document discusses PhoneGap Build, a cloud service that allows developers to package mobile apps for multiple platforms with just a few clicks. Some key benefits of PhoneGap Build include not having to manage hardware, easy code signing, building for multiple platforms from one codebase, and new features being rolled out with minimal maintenance. It also allows for features like Git/SVN uploads, collaborators, debugging tools, and using a single configuration file across platforms. Usage and new account creation on PhoneGap Build have grown significantly since its launch. Contact information and a demo application are provided.
From 12 to 3500 deployments per year in production Archie Cowan
Practices that enabled ITHAKA's engineering team to increase its change velocity in production from 12 releases per year to over 70 per week on average.
Project Result PowerPoint Presentation Slides SlideTeam
Select our visually appealing content ready Project Result PowerPoint Presentation Slides for project performance evaluation. The project performance management PowerPoint complete deck has various pre-made PPT slides such as project team, budgeting and time management, timeline, work breakdown structure, activities sequence, communication plan, task matrix, project work plan, cost estimate project management dashboard, etc. Furthermore, there are lots of advantages of using this creative professional looking presentation deck like all slides are editable, you can change the color, image, size, font in each template to make it suitable for your business presentation. Showcase the process of measuring business project performance using easy to use project outcomes PPT visuals. Utilize project outcome PPT slides to convey your ideas more effectively. Download project deliverables presentation graphics to monitor & optimize your business project performance. Create a unique and amazing business presentation to present in conferences, meetings, and seminars to surprise the audience. Delve on examples of fraud with our Project Result PowerPoint Presentation Slides. Enlighten folks about different attempts at cheating.
This document summarizes several projects completed by the individual. It includes:
1) A password saving iOS application developed using Swift that stores all data locally without internet connectivity.
2) Several POS terminal applications, barcode generators, and a loyalty program website, mobile app, and POS app developed using languages like C, C#, Java, PHP, and Android.
3) Additional projects completed including SMS applications, a social security model, and more developed using technologies such as Asp.NET, Oracle, PHP, and Java.
This document provides an overview of the Android mobile platform, including:
1) It describes the Android ecosystem and key components like OEMs, service providers, developers, and users.
2) It outlines the major mobile operating systems and highlights some key differences between Android and iOS.
3) It provides a brief history of Android and the Open Handset Alliance.
4) It discusses Google services that are integrated with Android and the Android Marketplace.
5) It explains why Android is growing in popularity with developers, OEMs, and service providers.
Innovative it project management practicesTathagat Varma
The document discusses innovative IT project management practices for delivering projects faster, better, and cheaper. It covers Agile, Scrum, Lean, Kanban, and Theory of Constraints approaches and how tighter integration of people, process, and technology can help achieve the goals of Faster, Better, Cheaper 2.0. The missing piece in many projects is the interlock between these elements, and optimizing the whole system rather than just parts. Failing fast allows for faster success.
Improve Your Regression with CART and RandomForestsSalford Systems
Why You Should Watch: Learn the fundamentals of tree-based machine learning algorithms and how to easily fine tune and improve your Random Forest regression models.
Abstract: In this webinar we'll introduce you to two tree-based machine learning algorithms, CART® decision trees and RandomForests®. We will discuss the advantages of tree based techniques including their ability to automatically handle variable selection, variable interactions, nonlinear relationships, outliers, and missing values. We'll explore the CART algorithm, bootstrap sampling, and the Random Forest algorithm (all with animations) and compare their predictive performance using a real world dataset.
Improved Predictions in Structure Based Drug Design Using Cart and Bayesian M...Salford Systems
The document discusses using in silico methods like virtual screening and predictive modeling to improve drug discovery. It presents results from applying techniques like receptor docking, machine learning algorithms, and Bayesian modeling to develop improved scoring functions that better distinguish active from inactive compounds. These scoring functions helped identify key molecular properties that correlated with active hits. The methods showed improved ability to find active hits compared to previous scoring functions.
Churn Modeling-For-Mobile-Telecommunications Salford Systems
This document summarizes a study on predicting customer churn for a major mobile provider. TreeNet models were used to predict the probability of customers churning (switching providers) within a 30-60 day period. TreeNet models significantly outperformed other methods, increasing accuracy and the proportion of high-risk customers identified. Applying the most accurate TreeNet models could translate to millions in additional annual revenue by helping the provider preemptively retain more customers.
This document provides dos and don'ts for data mining based on experiences from various practitioners. It lists important steps like clearly defining objectives, simplifying solutions, preparing data, using multiple techniques, and checking models. It warns against underestimating preparation, overfitting models, and collecting excessive unhelpful data. Practitioners emphasize the importance of domain knowledge, transparency, and creating models that are understandable to stakeholders.
9 Data Mining Challenges From Data Scientists Like YouSalford Systems
The document outlines 9 challenges faced by data scientists: 1) poor quality data issues like dirty, missing, or inadequate data, 2) lack of understanding of data mining techniques, 3) lack of good literature on important topics and techniques, 4) difficulty for academic institutions accessing commercial-grade software at reasonable costs, 5) accommodating data from different sources and formats, 6) updating models constantly with new incoming data for online machine learning, 7) dealing with huge datasets requiring distributed approaches, 8) determining the right questions to ask of the data, and 9) remaining objective and letting the data lead rather than preconceptions.
This document contains a collection of quotes related to statistics and data. Some key quotes emphasize that while data and information are important, they must be used carefully and combined with human intelligence, judgement, and insight. Other quotes note that statistics can be flexible and misleading if not interpreted carefully, and that collecting quality data over long periods of time is important for analysis. The overall message is that statistics are a useful tool but have limitations, and human discernment is still needed.
Using CART For Beginners with A Teclo Example DatasetSalford Systems
Familiarize yourself with CART Decision Tree technology in this beginner's tutorial using a telecommunications example dataset from the 1990s. By the end of this tutorial you should feel comfortable using CART on your own with sample or real-world data.
The document provides an overview of a 4-part webinar covering the evolution of regression techniques from classical least squares to more advanced machine learning methods like random forests and gradient boosting. It outlines the topics to be covered in each part, including classical regression, regularized regression techniques like ridge regression, LASSO, and MARS, and ensemble methods like random forests and TreeNet gradient boosted trees. Examples using the Boston housing data set are provided to illustrate some of these techniques.
This document discusses how educational institutions can use data mining software to better understand and support their students. It outlines several areas where data analysis can provide insights, such as predicting student performance based on more than just grades, understanding factors that lead to success or failure and graduation, determining the effectiveness of support programs, identifying which recruitment strategies and financial packages attract students, and predicting those most at risk of dropping out or defaulting on loans. The overall goal is to enhance student outcomes and institutional management through analytics.
Comparison of statistical methods commonly used in predictive modelingSalford Systems
This document compares four statistical methods commonly used in predictive modelling: Logistic Multiple Regression (LMR), Principal Component Regression (PCR), Classification and Regression Tree analysis (CART), and Multivariate Adaptive Regression Splines (MARS). It applies these methods to two ecological data sets to test their accuracy, reliability, ease of use, and implementation in a geographic information system (GIS). The results show that independent data is needed to validate models, and that MARS and CART achieved the best prediction success, although CART models became too complex for cartographic purposes with a large number of data points.
This document discusses Dr. Wayne Danter's research using artificial intelligence tools to predict biological activity of molecular structures. His method involves using CART to analyze public HIV data and build predictive models. CART generates decision trees to identify important variables that predict if a molecule is biologically active against HIV. Dr. Danter then uses MARS and NeuroShell Classifier to further improve prediction accuracy. His proprietary CHEMSASTM algorithm teaches neural networks to relate molecular structure to function for screening potential HIV drugs. Using these methods, Dr. Danter has achieved over 96% accuracy in classifying 311 drugs' activity against HIV.
TreeNet Tree Ensembles & CART Decision Trees: A Winning CombinationSalford Systems
Understand CART decision tree pros/cons, how TreeNet stochastic gradient boosting ca n help overcome single-tree challenges, and what the advantages are when using CART and TreeNet in combination for predictive modeling success.
Salford Systems offers several products for data mining and predictive modeling. The table compares features of their Basic, Pro, ProEx, and Ultra components. The Basic component includes basic modeling, reporting, and automation features. Pro adds additional modeling engines and missing data handling capabilities. ProEx further expands the supported modeling techniques and automations. Ultra provides the most extensive set of features, including additional modeling pipelines, ensemble methods, and tree-based algorithms.
This document provides an introduction to MARS (Multivariate Adaptive Regression Splines), an automated regression modeling tool. MARS can build accurate predictive models for continuous and binary dependent variables by automatically selecting variables, determining transformations and interactions between variables, and handling missing data. It efficiently searches through all possible models to identify an optimal solution. The document explains how MARS works, provides settings to configure MARS, and uses the Boston housing dataset to demonstrate the basic steps of building a MARS model.
The document discusses combining CART (Classification and Regression Tree) and logistic regression models to take advantage of their respective strengths in classification and data mining tasks. It describes how running a logistic regression on the entire dataset using CART terminal node assignments as dummy variables allows the logistic model to find effects across nodes that CART cannot detect. This improves CART's predictions by imposing slopes on cases within nodes and providing a more granular, continuous response than CART alone. The approach also allows compensating for some of CART's weaknesses like coarse-grained responses.
When building a predictive model in SPM, you'll want to know exactly what you did to get your results. This short slide deck will show you how to review your work in the session logs.
The document discusses techniques for compressing and extracting rules from TreeNet models. It describes how TreeNet has achieved high predictive performance but its models can be refined further. Regularized regression can be applied to the trees or nodes in a TreeNet model to combine similar trees, reweight trees, and select a compressed subset of trees without much loss in accuracy. This "model compression" technique aims to simplify TreeNet models for improved deployment while maintaining good predictive performance.
Taking AI to the Next Level in Manufacturing.pdfssuserfac0301
Read Taking AI to the Next Level in Manufacturing to gain insights on AI adoption in the manufacturing industry, such as:
1. How quickly AI is being implemented in manufacturing.
2. Which barriers stand in the way of AI adoption.
3. How data quality and governance form the backbone of AI.
4. Organizational processes and structures that may inhibit effective AI adoption.
6. Ideas and approaches to help build your organization's AI strategy.
Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API?
Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose?
Which one is cheapest? Which one is fastest? Which one will scale to meet our needs?
Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfChart Kalyan
A Mix Chart displays historical data of numbers in a graphical or tabular form. The Kalyan Rajdhani Mix Chart specifically shows the results of a sequence of numbers over different periods.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
Driving Business Innovation: Latest Generative AI Advancements & Success StorySafe Software
Are you ready to revolutionize how you handle data? Join us for a webinar where we’ll bring you up to speed with the latest advancements in Generative AI technology and discover how leveraging FME with tools from giants like Google Gemini, Amazon, and Microsoft OpenAI can supercharge your workflow efficiency.
During the hour, we’ll take you through:
Guest Speaker Segment with Hannah Barrington: Dive into the world of dynamic real estate marketing with Hannah, the Marketing Manager at Workspace Group. Hear firsthand how their team generates engaging descriptions for thousands of office units by integrating diverse data sources—from PDF floorplans to web pages—using FME transformers, like OpenAIVisionConnector and AnthropicVisionConnector. This use case will show you how GenAI can streamline content creation for marketing across the board.
Ollama Use Case: Learn how Scenario Specialist Dmitri Bagh has utilized Ollama within FME to input data, create custom models, and enhance security protocols. This segment will include demos to illustrate the full capabilities of FME in AI-driven processes.
Custom AI Models: Discover how to leverage FME to build personalized AI models using your data. Whether it’s populating a model with local data for added security or integrating public AI tools, find out how FME facilitates a versatile and secure approach to AI.
We’ll wrap up with a live Q&A session where you can engage with our experts on your specific use cases, and learn more about optimizing your data workflows with AI.
This webinar is ideal for professionals seeking to harness the power of AI within their data management systems while ensuring high levels of customization and security. Whether you're a novice or an expert, gain actionable insights and strategies to elevate your data processes. Join us to see how FME and AI can revolutionize how you work with data!
Project Management Semester Long Project - Acuityjpupo2018
Acuity is an innovative learning app designed to transform the way you engage with knowledge. Powered by AI technology, Acuity takes complex topics and distills them into concise, interactive summaries that are easy to read & understand. Whether you're exploring the depths of quantum mechanics or seeking insight into historical events, Acuity provides the key information you need without the burden of lengthy texts.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Speck&Tech
ABSTRACT: A prima vista, un mattoncino Lego e la backdoor XZ potrebbero avere in comune il fatto di essere entrambi blocchi di costruzione, o dipendenze di progetti creativi e software. La realtà è che un mattoncino Lego e il caso della backdoor XZ hanno molto di più di tutto ciò in comune.
Partecipate alla presentazione per immergervi in una storia di interoperabilità, standard e formati aperti, per poi discutere del ruolo importante che i contributori hanno in una comunità open source sostenibile.
BIO: Sostenitrice del software libero e dei formati standard e aperti. È stata un membro attivo dei progetti Fedora e openSUSE e ha co-fondato l'Associazione LibreItalia dove è stata coinvolta in diversi eventi, migrazioni e formazione relativi a LibreOffice. In precedenza ha lavorato a migrazioni e corsi di formazione su LibreOffice per diverse amministrazioni pubbliche e privati. Da gennaio 2020 lavora in SUSE come Software Release Engineer per Uyuni e SUSE Manager e quando non segue la sua passione per i computer e per Geeko coltiva la sua curiosità per l'astronomia (da cui deriva il suo nickname deneb_alpha).
5th LF Energy Power Grid Model Meet-up SlidesDanBrown980551
5th Power Grid Model Meet-up
It is with great pleasure that we extend to you an invitation to the 5th Power Grid Model Meet-up, scheduled for 6th June 2024. This event will adopt a hybrid format, allowing participants to join us either through an online Mircosoft Teams session or in person at TU/e located at Den Dolech 2, Eindhoven, Netherlands. The meet-up will be hosted by Eindhoven University of Technology (TU/e), a research university specializing in engineering science & technology.
Power Grid Model
The global energy transition is placing new and unprecedented demands on Distribution System Operators (DSOs). Alongside upgrades to grid capacity, processes such as digitization, capacity optimization, and congestion management are becoming vital for delivering reliable services.
Power Grid Model is an open source project from Linux Foundation Energy and provides a calculation engine that is increasingly essential for DSOs. It offers a standards-based foundation enabling real-time power systems analysis, simulations of electrical power grids, and sophisticated what-if analysis. In addition, it enables in-depth studies and analysis of the electrical power grid’s behavior and performance. This comprehensive model incorporates essential factors such as power generation capacity, electrical losses, voltage levels, power flows, and system stability.
Power Grid Model is currently being applied in a wide variety of use cases, including grid planning, expansion, reliability, and congestion studies. It can also help in analyzing the impact of renewable energy integration, assessing the effects of disturbances or faults, and developing strategies for grid control and optimization.
What to expect
For the upcoming meetup we are organizing, we have an exciting lineup of activities planned:
-Insightful presentations covering two practical applications of the Power Grid Model.
-An update on the latest advancements in Power Grid -Model technology during the first and second quarters of 2024.
-An interactive brainstorming session to discuss and propose new feature requests.
-An opportunity to connect with fellow Power Grid Model enthusiasts and users.
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc
How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024?
In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores.
See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe.
This webinar will review:
- The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey
- The top challenges for privacy leaders, practitioners, and organizations in 2024
- Key themes to consider in developing and maintaining your privacy program
1. May 2012
Maria Lupetini
Engineering Asset Management & Analytics
Qualcomm Incorporated
2. Advantages of MARS Modeling
Predicting Demand for an Asset
Capturing Trends and Seasonal Effects
Finding Interactive Effects
Weighting More Recent Data
Autoregressive Model for Time Series
Using Lag Variables
Don’t be Afraid of Missing Values
Summary of Findings
3. Regression: Linear, Logistic, GLM, MARS
ARIMA Time Series
Decision Trees
Neural Networks
Support Vector Machines
And more
Need to pick one or more approaches tailored
to problem you are tackling
4. Sales - Dollars, Number of Chips
Resources - People, Software Assets
Performance of a Semiconductor - Seconds
to load a web page
…You name it.
5. Data contains continuous numbers
$123,456.00
Number of employees
Understand influences of categories
Geographical regions
Operating system: Windows, Android
Seasonal or repeated trends
Months of the year
Christmas season
Special Effects
Consumer Promotions and Advertising
Switch turned on
6. What do you do if you want to predict a trend or find a pattern in data….and
There are hundreds of possible variables that influence your outcome -
◦ Which ones matter?
What if the variables interact with each other and effect the outcome
◦ How do you find that those relationships?
What if variables are not linearly related to the outcome
◦ How do determine the what the relationship curves will look like?
◦ Threshold or plateau relationship
What if the data you are using to predict is a mixture of numbers and categories
◦ How do you build a prediction formula?
How do I build a prediction model that is easy to understand?
… USE MARS
7. MARS short for Multivariate Adaptive Regression Splines
Technique introduced in 1991, Jerome Friedman, Stanford
University
Nonparametric, data driven algorithm
Prediction is a regression model with additional side
equations (basis functions)
Uses piecewise regression splines to build the prediction
Provides data reduction to select which variables matter
8. Software Used in Designing Semiconductor Chips
Is the use of the software growing?
What time of day are the software licenses most
demanded?
Does demand change over the weekend?
How many copies do we need next week?
9. 100
150
200
250
300
350
50
0
8/28/2011 12…
9/2/2011 4 PM
9/8/2011 8 AM
9/14/2011 12…
9/19/2011 4 PM
9/25/2011 8 AM
10/1/2011 12…
10/6/2011 4 PM
10/12/2011 8…
10/18/2011 12…
10/23/2011 4…
10/29/2011 8…
11/4/2011 12…
11/9/2011 4 PM
11/15/2011 8…
11/21/2011 12…
11/26/2011 4…
12/2/2011 8 AM
12/8/2011 12…
12/13/2011 4…
12/19/2011 8…
12/25/2011 12…
12/30/2011 4…
1/5/2012 8 AM
1/11/2012 12…
1/16/2012 4 PM
1/22/2012 8 AM
1/28/2012 12…
from Aug 2011 to April 2012
2/2/2012 4 PM
2/8/2012 8 AM
2/14/2012 12…
2/19/2012 4 PM
Number of Software Licenses Used in an Hour
2/25/2012 8 AM
3/2/2012 12 AM
3/7/2012 4 PM
3/13/2012 9 AM
3/19/2012 1 AM
3/24/2012 5 PM
How do you forecast this time series of demand data?
3/30/2012 9 AM
4/5/2012 1 AM
4/10/2012 5 PM
10. Actual
Licenses Week Day Week
Time Used Number WeekDay Name end Holiday Hour
9/4/2011 9 PM 58
37 1 Sun 1 Y 21
9/4/2011 10 PM 75
37 1 Sun 1 Y 22
9/4/2011 11 PM 88
37 1 Sun 1 Y 23
9/5/2011 12 AM 81
37 2 Mon 0 Y 0
9/5/2011 1 AM 74
37 2 Mon 0 Y 1
9/5/2011 2 AM 80
37 2 Mon 0 Y 2
9/5/2011 3 AM 81
37 2 Mon 0 Y 3
• Real Continuous or Integer Variables: License Counts, Week Number
• Categorical Text Variables: Holiday flag, Day Name
• Binary Numbers: Weekend flag
• Choice of Categorical or Real Number: Week Day, Hour
11. Can we building a prediction model of the form?
Demand =
Constant Base+
Baseline trend +
Hour of day effect +
Day of Week effect +
Holiday effect
13. Trend line captures:
• Growing use of this software product from Sep 20112 to Apr 2012
• Deadlines of semiconductor chip projects (Jan. and March)
14. Additional
licenses
needed as
function of
hour of the
day
Hour Predictor Captures:
• Highest use of licenses during 10 to 1pm US Pacific time
• Effect of Use in European/Indian time zones
15. Additional Weekday was coded as
licenses a continuous variable.
needed as Coding it as a
function of categorical can also
day of the work here.
week 1= Sunday,
2=Monday, etc
Day of Week Predictor Captures:
• Highest use of licenses during Wednesday to Friday
16. Possible Interactive Effects Between Variables
Look to find an interactive
effects between hour of day
and day of week.
Did not want to allow
interactive effects between
week_number and holiday
variables with other variables
17. Additional
licenses needed
as function of
hour and day
Interactive effect
• Work patterns are different on the weekends when
compared to the work week.
18. Additional
licenses
needed on
non-holidays
Holiday Predictor Captures:
• The difference in demand in a hour if it is a holiday
19. Weighting of Observations
5/21/2012 12 AM
Day and Hour Observation
4/1/2012 12 AM
2/11/2012 12 AM
12/23/2011 12 AM
11/3/2011 12 AM
9/14/2011 12 AM
7/26/2011 12 AM
0 1 2 3 4
Weight Applied to Observations
MARS will consider a “variable” as a weighting factor.
Here, the observations in April 2012 were 3 times
more important than observations in Sep 2011.
20. 100
150
200
250
300
350
50
0
4/8/2012 12 AM
4/8/2012 8 AM
4/8/2012 4 PM
4/9/2012 12 AM
4/9/2012 8 AM
4/9/2012 4 PM
4/10/2012 12 AM
4/10/2012 8 AM
4/10/2012 4 PM
4/11/2012 12 AM
4/11/2012 8 AM
4/11/2012 4 PM
4/12/2012 12 AM
4/12/2012 8 AM
Blue line Actual Licenses Used
4/12/2012 4 PM
Part of the Training Dataset
4/13/2012 12 AM
4/13/2012 8 AM
4/13/2012 4 PM
4/14/2012 12 AM
4/14/2012 8 AM
4/14/2012 4 PM
4/15/2012 12 AM
4/15/2012 8 AM
4/15/2012 4 PM
4/16/2012 12 AM
4/16/2012 8 AM
4/16/2012 4 PM
4/17/2012 12 AM
4/17/2012 8 AM
4/17/2012 4 PM
4/18/2012 12 AM
4/18/2012 8 AM
4/18/2012 4 PM
4/19/2012 12 AM
Number of Software Licenses Used and Predicted
4/19/2012 8 AM
4/19/2012 4 PM
4/20/2012 12 AM
Prediction on Unseen Data
4/20/2012 8 AM
4/20/2012 4 PM
Red line is MARS fit on Training Data for 4/18 to 4/15 and Prediction on 4/15 to 4/21
4/21/2012 12 AM
4/21/2012 8 AM
4/21/2012 4 PM
21. 100
150
200
250
300
350
50
0
8/28/2011 12 AM
9/2/2011 4 PM
9/8/2011 8 AM
9/14/2011 12 AM
9/19/2011 4 PM
9/25/2011 8 AM
10/1/2011 12 AM
10/6/2011 4 PM
10/12/2011 8 AM
10/18/2011 12 AM
10/23/2011 4 PM
10/29/2011 8 AM
11/4/2011 12 AM
11/9/2011 4 PM
11/15/2011 8 AM
11/21/2011 12 AM
11/26/2011 4 PM
12/2/2011 8 AM
12/8/2011 12 AM
12/13/2011 4 PM
12/19/2011 8 AM
12/25/2011 12 AM
Prediction Model
• Overall trend
12/30/2011 4 PM
1/5/2012 8 AM
Training Dataset
1/11/2012 12 AM
1/16/2012 4 PM
1/22/2012 8 AM
1/28/2012 12 AM
Actual
MARS was able to capture:
2/2/2012 4 PM
Number of Software Licenses Used
2/8/2012 8 AM
2/14/2012 12 AM
• Hourly and Week Day effect
2/19/2012 4 PM
2/25/2012 8 AM
• Somewhat captured US holidays
3/2/2012 12 AM
3/7/2012 4 PM
3/13/2012 9 AM
3/19/2012 1 AM
3/24/2012 5 PM
3/30/2012 9 AM
4/5/2012 1 AM
4/10/2012 5 PM
22. Variable Importance -gcv
--------------------------------------------------------------- MARS tells you
WEEKDAY 100.00000 2713.86182 which variables
are most
HOUR 93.20326 2418.96997
WEEK_NUMBER 44.00605 903.06390
HOLIDAY$ 21.76427 574.55463 important.
Great R-Squared
==============================
of 90%. Other
diagnostics, not
N: 15217.52 R-SQUARED: 0.90281 presented here,
MEAN DEP VAR: 158.15640 ADJ R-SQUARED: 0.90214
UNCENTERED R-SQUARED = R-0 SQUARED: 0.98493 looked good too.
F-STATISTIC = 1344.99320 S.E. OF REGRESSION = 35.12427
P-VALUE = 0.00000 RESIDUAL SUM OF SQUARES = .678790E+07
[MDF,NDF] = [ 38, 5502 ] REGRESSION SUM OF SQUARES = .630548E+08
Actual Used: Range 45 to 344 Licenses
Average 95
Standard Dev. 70
23. Can we build a prediction model of the
autoregressive form?
Demand =
Constant Base+
Baseline trend +
Effect of Licenses Used from a week ago +
Workweek vs. Weekend effect +
Holiday effect
24.
25. Set Up Autoregressive Model, Part 2
Creating lag variable for “Used Lag168.”
This predictor is the number of licenses
used in the same hour, in the same day,
in the prior week.
26. MARS found underlying trend when adjusting for other
factors in the Autoregressive model version.
Adjusting for underlying trend makes series
stationary. This is necessary for ARIMA models.
28. Selected MARS Output Showing Model Form and Fit
BF1 = ( USED<168> ne . );
BF2 = ( USED<168> = . ); Basis Functions and
BF3 = max( 0, USED<168> - 42) * BF1; Prediction Equation
BF4 = max( 0, 42 - USED<168>) * BF1; from MARS.
BF5 = (HOLIDAY$ in ( "Y" ));
BF7 = (MON_TO_FRI in ( 0 )); Note the handling of
BF9 = max( 0, WEEK_NUMBER - 50) * BF1;
missing values.
BF10 = max( 0, 50 - WEEK_NUMBER) * BF1;
BF11 = max( 0, USED<168> - 137) * BF1;
BF13 = max( 0, USED<168> - 265) * BF1; Reasonable fit with
BF15 = (MON_TO_FRI in ( 0 )) * BF2; 82% R-squared
Number of Lucenses Needed = 134- 39 * BF1 + 0.58 * BF3 - 2.12 * BF4
- 42* BF5 - 21.6 * BF7 - 0.235 * BF9 - 1.598 * BF10 + 0.338 * BF11
- 0.535 * BF13 - 38 * BF15;
N: 15055.88 R-SQUARED: 0.82525
MEAN DEP VAR: 158.75413 ADJ R-SQUARED: 0.82493
F-STATISTIC = 2533.14901 S.E. OF REGRESSION = 47.37796
29. For observations where the 168 lag of the “Used” variable is not missing:
Holiday = 1 if it’s a holiday, else 0
Weekend = 1 if it’s Saturday or Sunday, else 0
A = max( 0, USED<168> - 42)
B = max( 0, 42 - USED<168>) Autoregressive
C = max( 0, USED<168> - 137) Splines
D = max( 0, USED<168> - 265)
E = max( 0, WEEK_NUMBER - 50)
F = max( 0, 50 - WEEK_NUMBER) Trend line Splines
Forecasted License Need= 95 - 42*Holiday - 22 * Weekend
[0.6 * A - 2.1 * B + 0.3 * C - 0.5 * D] +
[- 0.2 * E - 1.6 * F]
30. 100
150
200
250
350
400
300
50
0
9/4/2011 12 AM
9/10/2011 6 AM
9/16/2011 12 PM
9/22/2011 6 PM
9/29/2011 12 AM
10/5/2011 6 AM
10/11/2011 12 PM
10/17/2011 6 PM
10/24/2011 12 AM
10/30/2011 6 AM
11/5/2011 12 PM
11/11/2011 6 PM
11/18/2011 12 AM
11/24/2011 6 AM
11/30/2011 12 PM
12/6/2011 6 PM
12/13/2011 12 AM
12/19/2011 6 AM
12/25/2011 12 PM
12/31/2011 6 PM
1/7/2012 12 AM
1/13/2012 6 AM
1/19/2012 12 PM
1/25/2012 6 PM
2/1/2012 12 AM
2/7/2012 6 AM
2/13/2012 12 PM
2/19/2012 6 PM
2/26/2012 12 AM
3/3/2012 6 AM
3/9/2012 12 PM
3/15/2012 7 PM
3/22/2012 1 AM
3/28/2012 7 AM
4/3/2012 1 PM
4/9/2012 7 PM
4/16/2012 1 AM
USED
Predicted
31. 100
150
200
250
300
350
400
0
50
4/8/2012 12 AM
4/8/2012 8 AM
4/8/2012 4 PM
4/9/2012 12 AM
4/9/2012 8 AM
4/9/2012 4 PM
4/10/2012 12 AM
4/10/2012 8 AM
4/10/2012 4 PM
4/11/2012 12 AM
4/11/2012 8 AM
Blue line is Actual Used
4/11/2012 4 PM
Part of Training Dataset
4/12/2012 12 AM
4/12/2012 8 AM
4/12/2012 4 PM
4/13/2012 12 AM
4/13/2012 8 AM
4/13/2012 4 PM
4/14/2012 12 AM
4/14/2012 8 AM
4/14/2012 4 PM
4/15/2012 12 AM
4/15/2012 8 AM
4/15/2012 4 PM
4/16/2012 12 AM
4/16/2012 8 AM
4/16/2012 4 PM
4/17/2012 12 AM
4/17/2012 8 AM
4/17/2012 4 PM
4/18/2012 12 AM
4/18/2012 8 AM
4/18/2012 4 PM
Number of Licenses Used and Predicted
4/19/2012 12 AM
4/19/2012 8 AM
Forecasting Unseen Data
4/19/2012 4 PM
4/20/2012 12 AM
4/20/2012 8 AM
4/20/2012 4 PM
4/21/2012 12 AM
Red line is MARS fit on Training data for 4/8 to 4/14 and Prediction on 4/15 to 4/21 data
4/21/2012 8 AM
4/21/2012 4 PM
32. Number of Licenses
100
150
200
250
300
350
400
50
0
4/8/2012 12 AM
4/8/2012 9 AM
4/8/2012 6 PM
4/9/2012 3 AM
4/9/2012 12 PM
4/9/2012 9 PM
4/10/2012 6 AM
4/10/2012 3 PM
4/11/2012 12 AM
4/11/2012 9 AM
4/11/2012 6 PM
4/12/2012 3 AM
4/12/2012 12 PM
4/12/2012 9 PM
4/13/2012 6 AM
Predicted_AutoRegressive
4/13/2012 3 PM
4/14/2012 12 AM
4/14/2012 9 AM
4/14/2012 6 PM
4/15/2012 3 AM
4/15/2012 12 PM
4/15/2012 9 PM
Actual Used
4/16/2012 6 AM
to Actual Licenses Used
4/16/2012 3 PM
4/17/2012 12 AM
4/17/2012 9 AM
Compare Forecast of Two Models
4/17/2012 6 PM
4/18/2012 3 AM
4/18/2012 12 PM
4/18/2012 9 PM
4/19/2012 6 AM
4/19/2012 3 PM
4/20/2012 12 AM
4/20/2012 9 AM
Predicted Not Auto Reg
4/20/2012 6 PM
4/21/2012 3 AM
4/21/2012 12 PM
4/21/2012 9 PM
33. Mathematically
MARS is versatile; it models most data types
Selects best predictors
Models nonlinear relationships
Easily finds selective interactive effects
Simple to create lag variables as predictors
Flexible weighting schemes for observations
Can handle missing values
Operationally
Don’t call me for more software license copies on
Thursday at noon; everyone else is!