SlideShare a Scribd company logo
1 of 40
2
Using Xpress-Mosel for Modeling and
Solving Data Mining Problems
Alkis Vazacopoulos
Dash Optimization
3
Agenda
• New Customers
• Mosel (modeling environment)
• IVE (Integrated Visual Env.)
• Optimization Technologies
• Data Mining Problems & applications
• Applications
4
New Customers
• Frito-Lay
• Carmen Systems
• Du Pont
• Deutsche Bank
• Siemens
• Toyota
5
Mosel: key features
• Integration of modeling and solving
• Programming facilities for pre/post processing, algorithms
– No separation between modeling statement and procedure
to solve the problem
• Open, modular architecture
• Highly flexible and extensible
6
Embedding a Mosel model
Problem
Solving
Program
Starts
Program
Terminates
Model
Execution
Result
Retrieval
Data
Input
Output
Results
7
Mosel: A modeling language
• Decision variables, linear constraints
• Arrays, (index-) sets
• Operators: standard arithmetic, aggregate and set
operators e.g. sum, prod, max, and, or, union, inter
• Loops, Selections e.g. forall-[do], if-then-[elif-then]
• Subroutines: functions and procedures
8
Architecture
LP
Xpress-Mosel
e.g. decision
variables and
constracts, etc.
Enterprise
dataEnterprise
data
• Customer history
• Available products
• Profitability models
• Content
• …Pre- and
Post-
processing
Algorithms
e.g.Optimal solutions
MIP
Constraint
Programming
Stochastic
Programming
9
Mosel: components and interfaces
• Mosel Language: to implement problems and solution algorithms
→ Model or Mosel program
• Mosel Model Compiler and Run-time Libraries: to compile,
execute and access models from a programming language
→ C, C++ , Java or VB program
• Mosel Native Interface (NI): to provide new or extend existing
functionality of Mosel Language
→ module
10
XPRESS-IVE
Benchmarking Solving
Modeling
Programming
11
Xpress - IVE
• Development environment
• Enables rapid prototyping and testing
• Entity tree for data, variables and constraints
• Matrix visualization
• Branch and Bound tree visualization
• LP, MIP and user defined charts
12
13
Matrix Visualization
14
Optimization Technologies
Quadratic P
(QP)
Mixed
Integer
Linear
MIQP
Stochastic
Constrained
Programmin
gHeuristics
Nonlinear
15
Data Mining Application Areas
Extracting useful information from large datasets
of various nature and origin arising in
• Finance
• Manufacturing
• Biomedicine
• Telecommunications
• Military Systems
• Other areas
16
Problems
• Revealing internal structure and
patterns of the data:
–Classification
–Regression
–Clustering
17
Approaches
• LP, MIP
• QP, MIQP
• Network Optimization
• Statistical Preprocessing
• Combinations of these Approaches
18
Classification Problems:
general setup
• “Training dataset”: N elements (xi, yi),
i = 1,…,N.
xi is an n-dimensional vector of
element’s attributes (features)
yi denotes the class attribute
(the number of classes is specified)
19
Classification Problems:
general setup
• A new element with known attributes x,
but unknown class attribute y
• The problem is to determine, which class
this element belongs to
• The classification model is “trained” on
the training dataset and applied to new
elements
20
Classification Problems:
general setup
• Main Idea: Constructing separating
surfaces in the n-dimensional space that
would divide it into several regions
• Each region corresponds to a certain
class
• The new element is classified according
to its geometrical location in the vector
space
21
Classification Problems:
example
22
Classification Problems:
LP approach
• Consider binary classification, one
separating plane
• The plane is represented by the standard
equation
• The problem is to find the optimal values
of the parameters w and γ
23
Classification Problems:
LP approach
• Suppose that vectors xi from the training
dataset are stored in two matrices
A(m×n) and B(k×n) corresponding to m
elements of the 1st class and k elements
of the 2nd class.
• The plane will perfectly separate
elements in A and elements in B if
24
Classification Problems:
LP approach
• Extra variables y and z are introduced to
model classification errors:
• The parameters w and γ are determined from
the LP problem of minimizing the total
misclassification error
25
Classification Problems:
LP formulation
26
Classification Problems:
generalized approaches
• Using multiple, non-linear separating
surfaces (e.g., polynomial, exponential,
logarithmic)
– Finding parameters of these surfaces can also
be reduced to LP
• Selecting a minimum number of attributes
(features) that are taken into account in
classification – feature selection
27
Classification Problems:
Application Examples
• Cancer Diagnosis
(Mangasarian et al, 1995 –
linear separating surfaces)
• Classification of Credit Card Applications,
Bonds Rating
28
Regression Problems:
General Setup
• N elements (xi, yi), i = 1,…,N, xi is a vector in
Rn, yi is a scalar in R
• Find a linear relationship between xi and yi,
i.e., find a vector β in Rn, such that
• We need to minimize
or
29
Regression Problems: LP
formulation
• The problem
can be reformulated as LP:
30
Clustering Problems
• Given a dataset, we need to assign the
elements to K clusters, according to an
appropriate similarity criteria. The number of
clusters K is usually not known a priori.
• Standard algorithms for fixed number of
clusters:
– K-median
– K-mean
31
Integer Programming approach to
classification and regression using
clustering techniques
• CRIO software package (Bertsimas & Shioda, 2002)
• Similar approaches for both classification and
regression
• Outline
– Preprocess data by assigning points to small clusters to
reduce the dimensionality
– Solve a mixed integer problem that assigns clusters to groups
and removes outliers. In the case of regression the model also
selects the regression coefficients for each group.
– Solve continuous optimization problems (quadratic
optimization problems for classification and linear optimization
problems for regression) that assign groups to polyhedral
regions.
32
Extending MOSEL-Native Int.
• Modular environment and open
architecture
• Module = dynamic libraries
• Not dedicated to any particular use:
– Solvers: Xpress-Optimizer, CHIP, OptQuest
– Database access: ODBC
– System commands
33
Stochastic
Modeling
Uncertainty
Stochastic
Solvers
Solution
Techniques
34
Stochastic Programming (SP)
• Stochastic Programming: Decision
making under uncertainty
– Model future uncertainty into mathematical
programming as scenarios
– Make optimal decisions to hedge against
future
35
Available features
New Types
• Svalue: Stochastic values that take different
values with certain probability e.g demand
• Smpvar: Stochastic decision variables that
take different values under different scenarios
• Slinctr: Stochastic constraints built on linear
expressions containing real,Svalue and
Smpvar
36
Example
1 32stage
Svalue
Dem1=
2 w.p 0.6
8 w.p 0.4
Dem2=
3 w.p 0.3
7 w.p 0.6
9 w.p 0.1
Smpvar x1 x2 x3
Slinctr x1+x2+x3<=Inventory
x1>=Dem1
x2>=Dem2
37
Advantages
• Automatic scenario tree generation
2
8
3
7
1
3
7
1
Scenario w.p
1 .18
2 .36
3 .06
4 .12
5 .24
6 .04
38
Advantages
• Elimination of scenario indexed entities e.g
T=3
x: array(1..T) of Smpvar
Dem:array(1..T-1) of Svalue
c:Slinctr
c:=sum(t in 1..T) x(t)<=Inventory
instead of
Scenarios=1..6
x: array(1..T,Scenarios) of mpvar
Dem:array(1..T-1 ,Scenarios) of real
c: arrray(Scenarios) of linctr
forall(s in Scenarios )
c(s):=sum(t in 1..T) x(t,s)<= Inventory
39
Advantages
• Elimination of writing Non-Anticipative Constraints
Scenarios=1..6
x: array(1..T,Scenarios) of mpvar
x(t,s)=x(t,s’) t=1; s,s’ {1..6} :s s’
x(t,s)=x(t,s’) t=2; s,s’ {1..3} :s s’
x(t,s)=x(t,s’) t=2; s,s’ {4..6} :s s’

 






1
2
3
4
5
6
t: 1 2 3
40
Statistical Preprocessing of the
Data
• In many cases, it is helpful to use
statistical preprocessing of the data
before applying mathematical
programming techniques

More Related Content

Similar to Using Xpress-Mosel for Modeling and Solving Data Mining Problems

Computational Giants_nhom.pptx
Computational Giants_nhom.pptxComputational Giants_nhom.pptx
Computational Giants_nhom.pptxThAnhonc
 
UNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptx
UNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptxUNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptx
UNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptxMinilikDerseh1
 
Machine learning and linear regression programming
Machine learning and linear regression programmingMachine learning and linear regression programming
Machine learning and linear regression programmingSoumya Mukherjee
 
Stochastic Optimization: Solvers and Tools
Stochastic Optimization: Solvers and ToolsStochastic Optimization: Solvers and Tools
Stochastic Optimization: Solvers and ToolsSSA KPI
 
6 data envelopment_analysis
6 data envelopment_analysis6 data envelopment_analysis
6 data envelopment_analysisFEG
 
Final Presentation - Edan&Itzik
Final Presentation - Edan&ItzikFinal Presentation - Edan&Itzik
Final Presentation - Edan&Itzikitzik cohen
 
Deep learning with keras
Deep learning with kerasDeep learning with keras
Deep learning with kerasMOHITKUMAR1379
 
General Tips for participating Kaggle Competitions
General Tips for participating Kaggle CompetitionsGeneral Tips for participating Kaggle Competitions
General Tips for participating Kaggle CompetitionsMark Peng
 
Automated Testing of Autonomous Driving Assistance Systems
Automated Testing of Autonomous Driving Assistance SystemsAutomated Testing of Autonomous Driving Assistance Systems
Automated Testing of Autonomous Driving Assistance SystemsLionel Briand
 
AlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in PythonAlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in PythonMark Conway
 
Machine Learning and AI: Core Methods and Applications
Machine Learning and AI: Core Methods and ApplicationsMachine Learning and AI: Core Methods and Applications
Machine Learning and AI: Core Methods and ApplicationsQuantUniversity
 
Multi-class Classification on Riemannian Manifolds for Video Surveillance
Multi-class Classification on Riemannian Manifolds for Video SurveillanceMulti-class Classification on Riemannian Manifolds for Video Surveillance
Multi-class Classification on Riemannian Manifolds for Video SurveillanceDiego Tosato
 

Similar to Using Xpress-Mosel for Modeling and Solving Data Mining Problems (20)

lecture.ppt
lecture.pptlecture.ppt
lecture.ppt
 
Computational Giants_nhom.pptx
Computational Giants_nhom.pptxComputational Giants_nhom.pptx
Computational Giants_nhom.pptx
 
UNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptx
UNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptxUNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptx
UNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptx
 
Machine learning and linear regression programming
Machine learning and linear regression programmingMachine learning and linear regression programming
Machine learning and linear regression programming
 
Stochastic Optimization: Solvers and Tools
Stochastic Optimization: Solvers and ToolsStochastic Optimization: Solvers and Tools
Stochastic Optimization: Solvers and Tools
 
Machine learning meetup
Machine learning meetupMachine learning meetup
Machine learning meetup
 
Lecture 1 (bce-7)
Lecture   1 (bce-7)Lecture   1 (bce-7)
Lecture 1 (bce-7)
 
6 data envelopment_analysis
6 data envelopment_analysis6 data envelopment_analysis
6 data envelopment_analysis
 
Final Presentation - Edan&Itzik
Final Presentation - Edan&ItzikFinal Presentation - Edan&Itzik
Final Presentation - Edan&Itzik
 
RBF2.ppt
RBF2.pptRBF2.ppt
RBF2.ppt
 
Deep learning with keras
Deep learning with kerasDeep learning with keras
Deep learning with keras
 
Manufacturing Data Analytics
Manufacturing Data AnalyticsManufacturing Data Analytics
Manufacturing Data Analytics
 
General Tips for participating Kaggle Competitions
General Tips for participating Kaggle CompetitionsGeneral Tips for participating Kaggle Competitions
General Tips for participating Kaggle Competitions
 
Automated Testing of Autonomous Driving Assistance Systems
Automated Testing of Autonomous Driving Assistance SystemsAutomated Testing of Autonomous Driving Assistance Systems
Automated Testing of Autonomous Driving Assistance Systems
 
lecture_16.pptx
lecture_16.pptxlecture_16.pptx
lecture_16.pptx
 
AlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in PythonAlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in Python
 
AlphaPy
AlphaPyAlphaPy
AlphaPy
 
Machine Learning and AI: Core Methods and Applications
Machine Learning and AI: Core Methods and ApplicationsMachine Learning and AI: Core Methods and Applications
Machine Learning and AI: Core Methods and Applications
 
Supervised Learning.pptx
Supervised Learning.pptxSupervised Learning.pptx
Supervised Learning.pptx
 
Multi-class Classification on Riemannian Manifolds for Video Surveillance
Multi-class Classification on Riemannian Manifolds for Video SurveillanceMulti-class Classification on Riemannian Manifolds for Video Surveillance
Multi-class Classification on Riemannian Manifolds for Video Surveillance
 

More from Alkis Vazacopoulos

Automatic Fine-tuning Xpress-MP to Solve MIP
Automatic Fine-tuning Xpress-MP to Solve MIPAutomatic Fine-tuning Xpress-MP to Solve MIP
Automatic Fine-tuning Xpress-MP to Solve MIPAlkis Vazacopoulos
 
Amazing results with ODH|CPLEX
Amazing results with ODH|CPLEXAmazing results with ODH|CPLEX
Amazing results with ODH|CPLEXAlkis Vazacopoulos
 
Bia project poster fantasy football
Bia project poster  fantasy football Bia project poster  fantasy football
Bia project poster fantasy football Alkis Vazacopoulos
 
NFL Game schedule optimization
NFL Game schedule optimization NFL Game schedule optimization
NFL Game schedule optimization Alkis Vazacopoulos
 
2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...
2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...
2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...Alkis Vazacopoulos
 
Very largeoptimizationparallel
Very largeoptimizationparallelVery largeoptimizationparallel
Very largeoptimizationparallelAlkis Vazacopoulos
 
Optimization Direct: Introduction and recent case studies
Optimization Direct: Introduction and recent case studiesOptimization Direct: Introduction and recent case studies
Optimization Direct: Introduction and recent case studiesAlkis Vazacopoulos
 
Informs 2016 Solving Planning and Scheduling Problems with CPLEX
Informs 2016 Solving Planning and Scheduling Problems with CPLEX Informs 2016 Solving Planning and Scheduling Problems with CPLEX
Informs 2016 Solving Planning and Scheduling Problems with CPLEX Alkis Vazacopoulos
 
Missing-Value Handling in Dynamic Model Estimation using IMPL
Missing-Value Handling in Dynamic Model Estimation using IMPL Missing-Value Handling in Dynamic Model Estimation using IMPL
Missing-Value Handling in Dynamic Model Estimation using IMPL Alkis Vazacopoulos
 
Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...
Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...
Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...Alkis Vazacopoulos
 
Industrial Modeling Service (IMS-IMPL)
Industrial Modeling Service (IMS-IMPL)Industrial Modeling Service (IMS-IMPL)
Industrial Modeling Service (IMS-IMPL)Alkis Vazacopoulos
 
Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...
Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...
Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...Alkis Vazacopoulos
 
Distillation Curve Optimization Using Monotonic Interpolation
Distillation Curve Optimization Using Monotonic InterpolationDistillation Curve Optimization Using Monotonic Interpolation
Distillation Curve Optimization Using Monotonic InterpolationAlkis Vazacopoulos
 
Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...
Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...
Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...Alkis Vazacopoulos
 
Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB) Indust...
Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB)  Indust...Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB)  Indust...
Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB) Indust...Alkis Vazacopoulos
 
Hybrid Dynamic Simulation (HDS) Industrial Modeling Framework (HDS-IMF)
Hybrid Dynamic Simulation (HDS)  Industrial Modeling Framework (HDS-IMF)Hybrid Dynamic Simulation (HDS)  Industrial Modeling Framework (HDS-IMF)
Hybrid Dynamic Simulation (HDS) Industrial Modeling Framework (HDS-IMF)Alkis Vazacopoulos
 

More from Alkis Vazacopoulos (20)

Automatic Fine-tuning Xpress-MP to Solve MIP
Automatic Fine-tuning Xpress-MP to Solve MIPAutomatic Fine-tuning Xpress-MP to Solve MIP
Automatic Fine-tuning Xpress-MP to Solve MIP
 
Amazing results with ODH|CPLEX
Amazing results with ODH|CPLEXAmazing results with ODH|CPLEX
Amazing results with ODH|CPLEX
 
Bia project poster fantasy football
Bia project poster  fantasy football Bia project poster  fantasy football
Bia project poster fantasy football
 
NFL Game schedule optimization
NFL Game schedule optimization NFL Game schedule optimization
NFL Game schedule optimization
 
2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...
2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...
2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...
 
Posters 2017
Posters 2017Posters 2017
Posters 2017
 
Very largeoptimizationparallel
Very largeoptimizationparallelVery largeoptimizationparallel
Very largeoptimizationparallel
 
Retail Pricing Optimization
Retail Pricing Optimization Retail Pricing Optimization
Retail Pricing Optimization
 
Optimization Direct: Introduction and recent case studies
Optimization Direct: Introduction and recent case studiesOptimization Direct: Introduction and recent case studies
Optimization Direct: Introduction and recent case studies
 
Informs 2016 Solving Planning and Scheduling Problems with CPLEX
Informs 2016 Solving Planning and Scheduling Problems with CPLEX Informs 2016 Solving Planning and Scheduling Problems with CPLEX
Informs 2016 Solving Planning and Scheduling Problems with CPLEX
 
ODHeuristics
ODHeuristicsODHeuristics
ODHeuristics
 
Missing-Value Handling in Dynamic Model Estimation using IMPL
Missing-Value Handling in Dynamic Model Estimation using IMPL Missing-Value Handling in Dynamic Model Estimation using IMPL
Missing-Value Handling in Dynamic Model Estimation using IMPL
 
Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...
Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...
Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...
 
Industrial Modeling Service (IMS-IMPL)
Industrial Modeling Service (IMS-IMPL)Industrial Modeling Service (IMS-IMPL)
Industrial Modeling Service (IMS-IMPL)
 
Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...
Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...
Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...
 
Xmr im
Xmr imXmr im
Xmr im
 
Distillation Curve Optimization Using Monotonic Interpolation
Distillation Curve Optimization Using Monotonic InterpolationDistillation Curve Optimization Using Monotonic Interpolation
Distillation Curve Optimization Using Monotonic Interpolation
 
Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...
Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...
Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...
 
Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB) Indust...
Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB)  Indust...Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB)  Indust...
Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB) Indust...
 
Hybrid Dynamic Simulation (HDS) Industrial Modeling Framework (HDS-IMF)
Hybrid Dynamic Simulation (HDS)  Industrial Modeling Framework (HDS-IMF)Hybrid Dynamic Simulation (HDS)  Industrial Modeling Framework (HDS-IMF)
Hybrid Dynamic Simulation (HDS) Industrial Modeling Framework (HDS-IMF)
 

Recently uploaded

B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 

Recently uploaded (20)

B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 

Using Xpress-Mosel for Modeling and Solving Data Mining Problems

  • 1.
  • 2. 2 Using Xpress-Mosel for Modeling and Solving Data Mining Problems Alkis Vazacopoulos Dash Optimization
  • 3. 3 Agenda • New Customers • Mosel (modeling environment) • IVE (Integrated Visual Env.) • Optimization Technologies • Data Mining Problems & applications • Applications
  • 4. 4 New Customers • Frito-Lay • Carmen Systems • Du Pont • Deutsche Bank • Siemens • Toyota
  • 5. 5 Mosel: key features • Integration of modeling and solving • Programming facilities for pre/post processing, algorithms – No separation between modeling statement and procedure to solve the problem • Open, modular architecture • Highly flexible and extensible
  • 6. 6 Embedding a Mosel model Problem Solving Program Starts Program Terminates Model Execution Result Retrieval Data Input Output Results
  • 7. 7 Mosel: A modeling language • Decision variables, linear constraints • Arrays, (index-) sets • Operators: standard arithmetic, aggregate and set operators e.g. sum, prod, max, and, or, union, inter • Loops, Selections e.g. forall-[do], if-then-[elif-then] • Subroutines: functions and procedures
  • 8. 8 Architecture LP Xpress-Mosel e.g. decision variables and constracts, etc. Enterprise dataEnterprise data • Customer history • Available products • Profitability models • Content • …Pre- and Post- processing Algorithms e.g.Optimal solutions MIP Constraint Programming Stochastic Programming
  • 9. 9 Mosel: components and interfaces • Mosel Language: to implement problems and solution algorithms → Model or Mosel program • Mosel Model Compiler and Run-time Libraries: to compile, execute and access models from a programming language → C, C++ , Java or VB program • Mosel Native Interface (NI): to provide new or extend existing functionality of Mosel Language → module
  • 11. 11 Xpress - IVE • Development environment • Enables rapid prototyping and testing • Entity tree for data, variables and constraints • Matrix visualization • Branch and Bound tree visualization • LP, MIP and user defined charts
  • 12. 12
  • 15. 15 Data Mining Application Areas Extracting useful information from large datasets of various nature and origin arising in • Finance • Manufacturing • Biomedicine • Telecommunications • Military Systems • Other areas
  • 16. 16 Problems • Revealing internal structure and patterns of the data: –Classification –Regression –Clustering
  • 17. 17 Approaches • LP, MIP • QP, MIQP • Network Optimization • Statistical Preprocessing • Combinations of these Approaches
  • 18. 18 Classification Problems: general setup • “Training dataset”: N elements (xi, yi), i = 1,…,N. xi is an n-dimensional vector of element’s attributes (features) yi denotes the class attribute (the number of classes is specified)
  • 19. 19 Classification Problems: general setup • A new element with known attributes x, but unknown class attribute y • The problem is to determine, which class this element belongs to • The classification model is “trained” on the training dataset and applied to new elements
  • 20. 20 Classification Problems: general setup • Main Idea: Constructing separating surfaces in the n-dimensional space that would divide it into several regions • Each region corresponds to a certain class • The new element is classified according to its geometrical location in the vector space
  • 22. 22 Classification Problems: LP approach • Consider binary classification, one separating plane • The plane is represented by the standard equation • The problem is to find the optimal values of the parameters w and γ
  • 23. 23 Classification Problems: LP approach • Suppose that vectors xi from the training dataset are stored in two matrices A(m×n) and B(k×n) corresponding to m elements of the 1st class and k elements of the 2nd class. • The plane will perfectly separate elements in A and elements in B if
  • 24. 24 Classification Problems: LP approach • Extra variables y and z are introduced to model classification errors: • The parameters w and γ are determined from the LP problem of minimizing the total misclassification error
  • 26. 26 Classification Problems: generalized approaches • Using multiple, non-linear separating surfaces (e.g., polynomial, exponential, logarithmic) – Finding parameters of these surfaces can also be reduced to LP • Selecting a minimum number of attributes (features) that are taken into account in classification – feature selection
  • 27. 27 Classification Problems: Application Examples • Cancer Diagnosis (Mangasarian et al, 1995 – linear separating surfaces) • Classification of Credit Card Applications, Bonds Rating
  • 28. 28 Regression Problems: General Setup • N elements (xi, yi), i = 1,…,N, xi is a vector in Rn, yi is a scalar in R • Find a linear relationship between xi and yi, i.e., find a vector β in Rn, such that • We need to minimize or
  • 29. 29 Regression Problems: LP formulation • The problem can be reformulated as LP:
  • 30. 30 Clustering Problems • Given a dataset, we need to assign the elements to K clusters, according to an appropriate similarity criteria. The number of clusters K is usually not known a priori. • Standard algorithms for fixed number of clusters: – K-median – K-mean
  • 31. 31 Integer Programming approach to classification and regression using clustering techniques • CRIO software package (Bertsimas & Shioda, 2002) • Similar approaches for both classification and regression • Outline – Preprocess data by assigning points to small clusters to reduce the dimensionality – Solve a mixed integer problem that assigns clusters to groups and removes outliers. In the case of regression the model also selects the regression coefficients for each group. – Solve continuous optimization problems (quadratic optimization problems for classification and linear optimization problems for regression) that assign groups to polyhedral regions.
  • 32. 32 Extending MOSEL-Native Int. • Modular environment and open architecture • Module = dynamic libraries • Not dedicated to any particular use: – Solvers: Xpress-Optimizer, CHIP, OptQuest – Database access: ODBC – System commands
  • 34. 34 Stochastic Programming (SP) • Stochastic Programming: Decision making under uncertainty – Model future uncertainty into mathematical programming as scenarios – Make optimal decisions to hedge against future
  • 35. 35 Available features New Types • Svalue: Stochastic values that take different values with certain probability e.g demand • Smpvar: Stochastic decision variables that take different values under different scenarios • Slinctr: Stochastic constraints built on linear expressions containing real,Svalue and Smpvar
  • 36. 36 Example 1 32stage Svalue Dem1= 2 w.p 0.6 8 w.p 0.4 Dem2= 3 w.p 0.3 7 w.p 0.6 9 w.p 0.1 Smpvar x1 x2 x3 Slinctr x1+x2+x3<=Inventory x1>=Dem1 x2>=Dem2
  • 37. 37 Advantages • Automatic scenario tree generation 2 8 3 7 1 3 7 1 Scenario w.p 1 .18 2 .36 3 .06 4 .12 5 .24 6 .04
  • 38. 38 Advantages • Elimination of scenario indexed entities e.g T=3 x: array(1..T) of Smpvar Dem:array(1..T-1) of Svalue c:Slinctr c:=sum(t in 1..T) x(t)<=Inventory instead of Scenarios=1..6 x: array(1..T,Scenarios) of mpvar Dem:array(1..T-1 ,Scenarios) of real c: arrray(Scenarios) of linctr forall(s in Scenarios ) c(s):=sum(t in 1..T) x(t,s)<= Inventory
  • 39. 39 Advantages • Elimination of writing Non-Anticipative Constraints Scenarios=1..6 x: array(1..T,Scenarios) of mpvar x(t,s)=x(t,s’) t=1; s,s’ {1..6} :s s’ x(t,s)=x(t,s’) t=2; s,s’ {1..3} :s s’ x(t,s)=x(t,s’) t=2; s,s’ {4..6} :s s’          1 2 3 4 5 6 t: 1 2 3
  • 40. 40 Statistical Preprocessing of the Data • In many cases, it is helpful to use statistical preprocessing of the data before applying mathematical programming techniques