Using Xpress-Mosel for Modeling and Solving Data Mining Problems

2
Using Xpress-Mosel for Modeling and
Solving Data Mining Problems
Alkis Vazacopoulos
Dash Optimization

3
Agenda
• New Customers
• Mosel (modeling environment)
• IVE (Integrated Visual Env.)
• Optimization Technologies
• Data Mining Problems & applications
• Applications

4
New Customers
• Frito-Lay
• Carmen Systems
• Du Pont
• Deutsche Bank
• Siemens
• Toyota

5
Mosel: key features
• Integration of modeling and solving
• Programming facilities for pre/post processing, algorithms
– No separation between modeling statement and procedure
to solve the problem
• Open, modular architecture
• Highly flexible and extensible

6
Embedding a Mosel model
Problem
Solving
Program
Starts
Program
Terminates
Model
Execution
Result
Retrieval
Data
Input
Output
Results

7
Mosel: A modeling language
• Decision variables, linear constraints
• Arrays, (index-) sets
• Operators: standard arithmetic, aggregate and set
operators e.g. sum, prod, max, and, or, union, inter
• Loops, Selections e.g. forall-[do], if-then-[elif-then]
• Subroutines: functions and procedures

8
Architecture
LP
Xpress-Mosel
e.g. decision
variables and
constracts, etc.
Enterprise
dataEnterprise
data
• Customer history
• Available products
• Profitability models
• Content
• …Pre- and
Post-
processing
Algorithms
e.g.Optimal solutions
MIP
Constraint
Programming
Stochastic
Programming

9
Mosel: components and interfaces
• Mosel Language: to implement problems and solution algorithms
→ Model or Mosel program
• Mosel Model Compiler and Run-time Libraries: to compile,
execute and access models from a programming language
→ C, C++ , Java or VB program
• Mosel Native Interface (NI): to provide new or extend existing
functionality of Mosel Language
→ module

10
XPRESS-IVE
Benchmarking Solving
Modeling
Programming

11
Xpress - IVE
• Development environment
• Enables rapid prototyping and testing
• Entity tree for data, variables and constraints
• Matrix visualization
• Branch and Bound tree visualization
• LP, MIP and user defined charts

14
Optimization Technologies
Quadratic P
(QP)
Mixed
Integer
Linear
MIQP
Stochastic
Constrained
Programmin
gHeuristics
Nonlinear

15
Data Mining Application Areas
Extracting useful information from large datasets
of various nature and origin arising in
• Finance
• Manufacturing
• Biomedicine
• Telecommunications
• Military Systems
• Other areas

16
Problems
• Revealing internal structure and
patterns of the data:
–Classification
–Regression
–Clustering

17
Approaches
• LP, MIP
• QP, MIQP
• Network Optimization
• Statistical Preprocessing
• Combinations of these Approaches

18
Classification Problems:
general setup
• “Training dataset”: N elements (xi, yi),
i = 1,…,N.
xi is an n-dimensional vector of
element’s attributes (features)
yi denotes the class attribute
(the number of classes is specified)

19
general setup
• A new element with known attributes x,
but unknown class attribute y
• The problem is to determine, which class
this element belongs to
• The classification model is “trained” on
the training dataset and applied to new
elements

20
general setup
• Main Idea: Constructing separating
surfaces in the n-dimensional space that
would divide it into several regions
• Each region corresponds to a certain
class
• The new element is classified according
to its geometrical location in the vector
space

21
example

22
LP approach
• Consider binary classification, one
separating plane
• The plane is represented by the standard
equation
• The problem is to find the optimal values
of the parameters w and γ

23
LP approach
• Suppose that vectors xi from the training
dataset are stored in two matrices
A(m×n) and B(k×n) corresponding to m
elements of the 1st class and k elements
of the 2nd class.
• The plane will perfectly separate
elements in A and elements in B if

24
LP approach
• Extra variables y and z are introduced to
model classification errors:
• The parameters w and γ are determined from
the LP problem of minimizing the total
misclassification error

25
LP formulation

26
generalized approaches
• Using multiple, non-linear separating
surfaces (e.g., polynomial, exponential,
logarithmic)
– Finding parameters of these surfaces can also
be reduced to LP
• Selecting a minimum number of attributes
(features) that are taken into account in
classification – feature selection

27
Application Examples
• Cancer Diagnosis
(Mangasarian et al, 1995 –
linear separating surfaces)
• Classification of Credit Card Applications,
Bonds Rating

28
Regression Problems:
General Setup
• N elements (xi, yi), i = 1,…,N, xi is a vector in
Rn, yi is a scalar in R
• Find a linear relationship between xi and yi,
i.e., find a vector β in Rn, such that
• We need to minimize
or

29
Regression Problems: LP
formulation
• The problem
can be reformulated as LP:

30
Clustering Problems
• Given a dataset, we need to assign the
elements to K clusters, according to an
appropriate similarity criteria. The number of
clusters K is usually not known a priori.
• Standard algorithms for fixed number of
clusters:
– K-median
– K-mean

31
Integer Programming approach to
classification and regression using
clustering techniques
• CRIO software package (Bertsimas & Shioda, 2002)
• Similar approaches for both classification and
regression
• Outline
– Preprocess data by assigning points to small clusters to
reduce the dimensionality
– Solve a mixed integer problem that assigns clusters to groups
and removes outliers. In the case of regression the model also
selects the regression coefficients for each group.
– Solve continuous optimization problems (quadratic
optimization problems for classification and linear optimization
problems for regression) that assign groups to polyhedral
regions.

32
Extending MOSEL-Native Int.
• Modular environment and open
architecture
• Module = dynamic libraries
• Not dedicated to any particular use:
– Solvers: Xpress-Optimizer, CHIP, OptQuest
– Database access: ODBC
– System commands

33
Stochastic
Modeling
Uncertainty
Stochastic
Solvers
Solution
Techniques

34
Stochastic Programming (SP)
• Stochastic Programming: Decision
making under uncertainty
– Model future uncertainty into mathematical
programming as scenarios
– Make optimal decisions to hedge against
future

35
Available features
New Types
• Svalue: Stochastic values that take different
values with certain probability e.g demand
• Smpvar: Stochastic decision variables that
take different values under different scenarios
• Slinctr: Stochastic constraints built on linear
expressions containing real,Svalue and
Smpvar

36
Example
1 32stage
Svalue
Dem1=
2 w.p 0.6
8 w.p 0.4
Dem2=
3 w.p 0.3
7 w.p 0.6
9 w.p 0.1
Smpvar x1 x2 x3
Slinctr x1+x2+x3<=Inventory
x1>=Dem1
x2>=Dem2

37
Advantages
• Automatic scenario tree generation
2
8
3
7
1
3
7
1
Scenario w.p
1 .18
2 .36
3 .06
4 .12
5 .24
6 .04

38
Advantages
• Elimination of scenario indexed entities e.g
T=3
x: array(1..T) of Smpvar
Dem:array(1..T-1) of Svalue
c:Slinctr
c:=sum(t in 1..T) x(t)<=Inventory
instead of
Scenarios=1..6
x: array(1..T,Scenarios) of mpvar
Dem:array(1..T-1 ,Scenarios) of real
c: arrray(Scenarios) of linctr
forall(s in Scenarios )
c(s):=sum(t in 1..T) x(t,s)<= Inventory

39
Advantages
• Elimination of writing Non-Anticipative Constraints
Scenarios=1..6
x: array(1..T,Scenarios) of mpvar
x(t,s)=x(t,s’) t=1; s,s’ {1..6} :s s’
x(t,s)=x(t,s’) t=2; s,s’ {1..3} :s s’
x(t,s)=x(t,s’) t=2; s,s’ {4..6} :s s’

 






1
2
3
4
5
6
t: 1 2 3

40
Statistical Preprocessing of the
Data
• In many cases, it is helpful to use
statistical preprocessing of the data
before applying mathematical
programming techniques

Using Xpress-Mosel for Modeling and Solving Data Mining Problems

Recommended

Recommended

More Related Content

Similar to Using Xpress-Mosel for Modeling and Solving Data Mining Problems

Similar to Using Xpress-Mosel for Modeling and Solving Data Mining Problems (20)

More from Alkis Vazacopoulos

More from Alkis Vazacopoulos (20)

Recently uploaded

Recently uploaded (20)

Using Xpress-Mosel for Modeling and Solving Data Mining Problems