Adaptive Parallelization of Queries over Dependent Web Service Calls

Adaptive Parallelization of Queries over Dependent Web Service Calls Manivasakan Sabesan and Tore Risch Uppsala Database Laboratory Dept. of Information Technology Uppsala University Sweden

Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

WSMED System (Web Service MEDiator) WSMED OWF 1 WSDL metadata 1 WS Operation 1 WS Operation n WS Operation 1 WS Operation m WS 1 WS n WSDL metadata n SOAP call Import metadata SQL Query OWF n Automatically generated O peration W rapper F unction(OWF) makes web services queryable. Meta store 1 3 3 1 2

[object Object],[object Object],[object Object],[object Object],Research Problems WS 1 WS 2 WS 3 WS n

Dependent join f( x- , y+ ) Λ g( y- , z+ ) ,[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Query1 select gl.City , gl.TypeId from GetAllStates gs, GetPlacesWithin gp, GetPlaceList gl where gs.state=gp.state and gp.distance=15.0 and gp.placeTypeToFind='City' and gp.place='Atlanta' and gl.placeName=gp.ToPlace+' ,'+gp.ToState and gl.MaxItems=100 and gl.imagePresence='true' Finds information about places located within 15 km from each city whose name starts with ’Atlanta‘ in all US states. ,[object Object],[object Object]

Query 2 ,[object Object],[object Object],[object Object],[object Object],[object Object]

Query Processing in WSMED Parallel query plan SQL query Calculus Generator Parallel pipeliner Plan function generator Central plan creator Plan splitter Phase 1 Phase 2

Central plan - Phase1 Query1(pl,st) :- GetAllStates() and GetPlacesWithin(‘Atlanta’,_,15.0,’City’) and GetPlaceList(_, 100,’true’) Calculus expression γ GetPlacesWithin(‘Atlanta’, st1, 15.0, ‘City’) <pl, st> γ GetPlaceList (str, 100, ‘true’) γ GetAllStates() < st1 > <city , st2 > γ concat(city,’, ‘, st2) <str > Algebra expression

Plan Splitting and Plan Function Generation - Phase2 <city, state2> γ GetPlacesWithin(Atlanta’, st1, 15.0, ‘City’) γ concat(city,’, ‘, state2) <str > PF1 PF1(Charstring st1) -> Stream of Charstring str γ GetPlaceList(str,100,’true’) <pl, st> PF2 PF2(Charstring str) -> Stream of <Charstring pl, Charstring st>

WSMED Process Tree q i - query process (i=0,1,......n) Level 2 q0 q1 q3 q4 q2 PF1 GetAllStates PF2 q5 q8 q7 q6 Coordinator Level 1 Query1

Make Parallel Pipeline < str > < st1 > FF_APPLYLP( PF2, 3,str ) <pl, st> γ GetAllStates() FF_ APPLYP( PF1, 2, st1 ) Manually set fanouts on both levels

FF_APPLYP(Function PF , Integer fo , Stream pstream ) -> Stream result ,[object Object],[object Object],[object Object],[object Object],q3 q4 q5 PF PF PF p 1 p 2 p 3 First Finished Apply in Parallel ( FF_APPLYP ) FF_APPLYP r 1 r 2 r 3 p 4 p 5 p 6

Experimental Setup ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],f 1 =2 f 2 =4 f 1 =2 f 2 =2 f 1 =5

Observation –Query1 ,[object Object],[object Object]

Observation- Query2 ,[object Object],[object Object]

Observations of Preliminary Experiments ,[object Object],[object Object],[object Object],[object Object]

Adaptive First Finished Apply in Parallel (A FF_APPLYP) The AFF_APPLYP operator adapts the process plan at run time and starts with a binary tree. ,[object Object],[object Object],[object Object],[object Object],[object Object]

Functionalities of AFF_APPLYP ,[object Object],q0 q1 q3 q4 q2 q6 q5 Coordinator Level 1 Level 2

.......... 2. A monitoring cycle for a non-leaf query process is defined when number of received end-of-call messages equal to number of children. 2.1 After the first monitoring cycle A FF_APPLYP adds p new child processes - an add stage . 3. When an added node has several levels of children, the init stages of A FF_APPLYP s in the children will produce a binary sub–tree . q0 q1 q3 q4 q2 q5 Coordinator Level 1 q7 q9 q8 q10 Level 2 q6 q11

...... 4. A FF_APPLYP records per monitoring cycle i the average time t i to produce an incoming tuple from the children. 4.1 If t i decreases more than a threshold ( 25% ) the add stage is rerun. 4.2 If t i increases we either stop or run a drop stage that drops one child and its children. q0 q1 q3 q4 q2 q5 Coordinator Level 1 q12 q10 Level 2 q6 q11

Observations with AFF_APPLYP ,[object Object],[object Object],[object Object],[object Object]

Related work ,[object Object],[object Object],[object Object]

Conclusion ,[object Object],[object Object],[object Object],[object Object]

Future ..... ,[object Object],[object Object],[object Object],[object Object]

Thank you for your attention ,[object Object]

Adaptive Parallelization of Queries over Dependent Web Service Calls

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Adaptive Parallelization of Queries over Dependent Web Service Calls

Similar to Adaptive Parallelization of Queries over Dependent Web Service Calls (20)

Adaptive Parallelization of Queries over Dependent Web Service Calls

Editor's Notes