SciQLA Query Language for Unified Scientific Data Processing and                      Management                        Ja...
What is scientific data?November 2, 2012                                         javad.chamanara@uni-jena.de              ...
What is available?November 2, 2012                                         javad.chamanara@uni-jena.de                    ...
What is proposed here?November 2, 2012                                         javad.chamanara@uni-jena.de                ...
What does it provide?November 2, 2012                                         javad.chamanara@uni-jena.de                 ...
A SampleDefine Perspective p1 As{       Attribute Temp_Fahrenheit MapTo Function(1.8 * Temp_Celsius + 32)       Attribute ...
How does it work?Var x = Select Average(Temp_Fahrenheit) As Avg FrompdLatest Where Year > 2001 Group By YearNovember 2, 20...
How does it work? (AST)                                                        =                                          ...
How does it work? (E-AST, CSV Adapter)                                                        =                           ...
How does it work? (E-AST, Excel Adapter)                                                          =                       ...
How does it work? (E-AST, Database Adapter)                                                        =                      ...
Design• Grammar• Architecture• Execution EngineNovember 2, 2012                                         javad.chamanara@un...
SciQL Language ConstructsNovember 2, 2012                                         javad.chamanara@uni-jena.de             ...
The GrammarNovember 2, 2012                                         javad.chamanara@uni-jena.de                           ...
General Architecture       cmp Components                  Custom                             Matlab                      ...
Query Execution Engine                                                                                               Query...
Mappingcmp Perspectiv e                     Perpectiv e 1                                     Data                        ...
What would be the benefits?• Scientists deal with just one language• It has a data source independent instruction  set• It...
The Evaluation Plan• To be used in the context of BExIS        – Big and diverse user community        – Various data• Ope...
The Work Plan• Define the grammar of the language        – 6-9 months• Compare to related works and revise        – 3-6 mo...
ThanksNovember 2, 2012                                         javad.chamanara@uni-jena.de                                ...
Upcoming SlideShare
Loading in …5
×

SciQL: A Scientific Query Language

567 views

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
567
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Describe that these slide are showing concept maps in that, boxes are the concepts and labels on the relationships are the meanings/ purposes/ reasonsArrows and lines are equal, it is a tool issueData driven scienceHere are just some attributes of the scientific data
  • Related workTools have single focus/ general on processing and visualizationversioning/ provenance issuesData processing pipelineImpedance mismatch Format, Data type, Unit, AccuracyShaping data to work in workflowsMulti tool integration:
  • Is customized to work on scientific dataConsiders VersionsProduces provenance datathe difference and similarities to the slide before
  • BKR: Again, a layout more similar to the previous one would make it easier for the listener to get the picture ;-)
  • Describe the sample in briefPoint to the last select statement and tell that you like to investigate what happens to it.
  • User InputState Information
  • Input ParsingTree Construction
  • CSV AdapterDefault adapterAdapter capability matchingAST node selection based on the adapter’s capabilities
  • Spreadsheet Adapter
  • Database Adapter
  • The designed grammar is implemented in a language design framework likeAntLR/ JavaBKR: I think this won’t be readable
  • QEE: Optimization, Caching, State Management, AST Node selection and delegation, Result compilationAdapter: E-AST node implementation, executes the received node against the actual data sourceData Source: Is data + functionality. Data sources like spreadsheets, DBMSs, etc. have functions that the adapter may rely on them.
  • The sample is finished here.
  • UnificationDecouplingAdaptabilityIntegrationExtendibilityCollaboration
  • The overall duration: 27-39
  • SciQL: A Scientific Query Language

    1. 1. SciQLA Query Language for Unified Scientific Data Processing and Management Javad Chamanara University of Jena, Germany javad.chamanara@uni-jena.de At: CIKM 2012, Maui, HI, USA Nov. 2, 2012
    2. 2. What is scientific data?November 2, 2012 javad.chamanara@uni-jena.de 2SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    3. 3. What is available?November 2, 2012 javad.chamanara@uni-jena.de 3SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    4. 4. What is proposed here?November 2, 2012 javad.chamanara@uni-jena.de 4SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    5. 5. What does it provide?November 2, 2012 javad.chamanara@uni-jena.de 5SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    6. 6. A SampleDefine Perspective p1 As{ Attribute Temp_Fahrenheit MapTo Function(1.8 * Temp_Celsius + 32) Attribute SN_mg MapTo Function(SN_g * 1000) Attribute Year MapTo Function(Year(Timestamp)) DataType=Integer}Connection d Adapter=Spreadsheet Source_URI="c:datadata1.xls"Bind Perspective=p1 Connection=d Version=Latest As pdLatestVar pdAll = Select From pdLatestDraw Data=pdLatest GraphType=Scatter V-Axis=NS_mg H-Axis=Temp_FahrenheitVar pdGroupped = Select Average(Temp_Fahrenheit) As Avg From pdLatest GroupBy YearNovember 2, 2012 javad.chamanara@uni-jena.de 6SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    7. 7. How does it work?Var x = Select Average(Temp_Fahrenheit) As Avg FrompdLatest Where Year > 2001 Group By YearNovember 2, 2012 javad.chamanara@uni-jena.de 7SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    8. 8. How does it work? (AST) = Select VAR DEF Project Fetch Filter Aggregate pdLat Var x Avg est > Group Year 2001 YearNovember 2, 2012 javad.chamanara@uni-jena.de 8SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    9. 9. How does it work? (E-AST, CSV Adapter) = Select VAR DEF Project Fetch Filter Aggregate CSV pdLat Var x Avg est > Group Year 2001 YearNovember 2, 2012 javad.chamanara@uni-jena.de 9SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    10. 10. How does it work? (E-AST, Excel Adapter) = Select Default VAR DEF Project Fetch Filter Aggregate Default Excel Excel Excel Default pdLat Var x Avg est > Group Year 2001 YearNovember 2, 2012 javad.chamanara@uni-jena.de 10SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    11. 11. How does it work? (E-AST, Database Adapter) = Select DB VAR DEF Project Fetch Filter Aggregate Default DB DB DB DB pdLat Var x Avg est > Group Year 2001 YearNovember 2, 2012 javad.chamanara@uni-jena.de 11SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    12. 12. Design• Grammar• Architecture• Execution EngineNovember 2, 2012 javad.chamanara@uni-jena.de 12SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    13. 13. SciQL Language ConstructsNovember 2, 2012 javad.chamanara@uni-jena.de 13SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    14. 14. The GrammarNovember 2, 2012 javad.chamanara@uni-jena.de 14SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    15. 15. General Architecture cmp Components Custom Matlab R Console Declarative Console Application SciQL Spreadsheet Adapter RDBMS Adapter Vendor Specific Adapter CSV Spreadsheet R DBMS OtherNovember 2, 2012 javad.chamanara@uni-jena.de 15SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    16. 16. Query Execution Engine Query Engine Data Source Adapter E-AST Result set Query Execution EngineNovember 2, 2012 javad.chamanara@uni-jena.de 16SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    17. 17. Mappingcmp Perspectiv e Perpectiv e 1 Data Perspectiv e 2 Data Field 1 Data Field 1 Attribute A Attribute 1 Data Field 2 Data Field 2 Attribute B Attribute 2 Data Field 3 Data Field 3 Port1 Attribute C Attribute 3 Data Field 4November 2, 2012 javad.chamanara@uni-jena.de 17SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    18. 18. What would be the benefits?• Scientists deal with just one language• It has a data source independent instruction set• Its easier to learn and share• Integration to other tools is easy• Mitigates the need for computer knowledgeNovember 2, 2012 javad.chamanara@uni-jena.de 18SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    19. 19. The Evaluation Plan• To be used in the context of BExIS – Big and diverse user community – Various data• Open source and free – Early feedback – ContributionNovember 2, 2012 javad.chamanara@uni-jena.de 19SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    20. 20. The Work Plan• Define the grammar of the language – 6-9 months• Compare to related works and revise – 3-6 months• Compile the formal specification of the language – 3-6 months• Develop the proof of concept implementation – 9-12 months• Evaluation – 6 monthsNovember 2, 2012 javad.chamanara@uni-jena.de 20SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA
    21. 21. ThanksNovember 2, 2012 javad.chamanara@uni-jena.de 21SciQL: A Query Language for Unified Scientific Data Processing and Management, 5th Ph.D. Workshop (PIKM) at CIKM 2012, Maui, HI, USA

    ×