A presentation on how OSS projects may be used for providing practical training for undergraduate software engineering students. Host: UNU-IIST, Macau-SAR, China
Transforming repositories: from repository managers to institutional data man...JISC KeepIt project
The last decade has seen support for digital preservation transformed. There are now a multitude of organisations, training courses, and software development tools to help guide managers of digital data towards preservation decisions and solutions. But how well do these approaches understand the needs and requirements of users? This presentation was given at ECA 2010, a conference for digital archiving professionals. But not everyone can be a digital archiving specialist. At a time of exploding volumes of digital content, especially on the Web, many non-specialists need help in preserving digital content. The presentation looks at the applicability and practicality of all this support for one class of user, digital repositories, and in particular institutional repositories (IRs) and their managers. We report on a course on digital preservation tools, designed by repository managers as part of the JISC KeepIt project. Positive feedback from the evaluations of this course have show that the emergence of the tools used in this course is a great story for digital preservation.
Due to the increasing uptake of semantic technologies, ontologies are becoming part of a growing number of software development projects. As a result, ontology development teams have to combine their activities with software development practices. In this presentation some practices, tools and examples of new trends in ontological engineering are provided.
Congresso Sociedade Brasileira de Computação CSBC2016 Porto Alegre (Brazil)
Workshop on Cloud Networks & Cloudscape Brazil
Priscila Solis - University of Brasilia and EUBrasilCloudFORUM Brazilian coordinator, Brazil
The excellence of cloud computing research and industry in Europe and Brazil. Opening and welcome messaged from representatives of the Brazilian Government and the European Commission.
Congresso Sociedade Brasileira de Computação CSBC2016 Porto Alegre (Brazil)
Workshop on Cloud Networks & Cloudscape Brazil
Priscila Solis - University of Brasilia and EUBrasilCloudFORUM Brazilian coordinator, Brazil
Funded jointly by the European Commission (EC) and the Ministry of Science, Technology and Innovation; Portuguese: Ministério da Ciência, Tecnologia e Inovação (MCTI) of Brazil, the EUBrasilCloudFORUM project supports EU-BR collaborative projects in the collection and promotion of their results and activities. The results will be used to draft a research Roadmap on cloud computing, identifying collaboration needs and opportunities between Europe and Brazil for the European Commission and to MCTI, thus contributing to the definition of future cooperation priorities between the two regions.
A presentation on how OSS projects may be used for providing practical training for undergraduate software engineering students. Host: UNU-IIST, Macau-SAR, China
Transforming repositories: from repository managers to institutional data man...JISC KeepIt project
The last decade has seen support for digital preservation transformed. There are now a multitude of organisations, training courses, and software development tools to help guide managers of digital data towards preservation decisions and solutions. But how well do these approaches understand the needs and requirements of users? This presentation was given at ECA 2010, a conference for digital archiving professionals. But not everyone can be a digital archiving specialist. At a time of exploding volumes of digital content, especially on the Web, many non-specialists need help in preserving digital content. The presentation looks at the applicability and practicality of all this support for one class of user, digital repositories, and in particular institutional repositories (IRs) and their managers. We report on a course on digital preservation tools, designed by repository managers as part of the JISC KeepIt project. Positive feedback from the evaluations of this course have show that the emergence of the tools used in this course is a great story for digital preservation.
Due to the increasing uptake of semantic technologies, ontologies are becoming part of a growing number of software development projects. As a result, ontology development teams have to combine their activities with software development practices. In this presentation some practices, tools and examples of new trends in ontological engineering are provided.
Congresso Sociedade Brasileira de Computação CSBC2016 Porto Alegre (Brazil)
Workshop on Cloud Networks & Cloudscape Brazil
Priscila Solis - University of Brasilia and EUBrasilCloudFORUM Brazilian coordinator, Brazil
The excellence of cloud computing research and industry in Europe and Brazil. Opening and welcome messaged from representatives of the Brazilian Government and the European Commission.
Congresso Sociedade Brasileira de Computação CSBC2016 Porto Alegre (Brazil)
Workshop on Cloud Networks & Cloudscape Brazil
Priscila Solis - University of Brasilia and EUBrasilCloudFORUM Brazilian coordinator, Brazil
Funded jointly by the European Commission (EC) and the Ministry of Science, Technology and Innovation; Portuguese: Ministério da Ciência, Tecnologia e Inovação (MCTI) of Brazil, the EUBrasilCloudFORUM project supports EU-BR collaborative projects in the collection and promotion of their results and activities. The results will be used to draft a research Roadmap on cloud computing, identifying collaboration needs and opportunities between Europe and Brazil for the European Commission and to MCTI, thus contributing to the definition of future cooperation priorities between the two regions.
Fundamental physics and accelerator science in developing countriesChristine Darve
APS2020
Session M19: Physics for Development
Abstract: Education and industrialization are essential to promote positive developments in the least developed countries. The recent evolution of technologies and ICT open the doors to innovative way to support education in developing countries. To enhance the existing African School of Fundamental Physics and Applications (ASP), Massive Open On-line Courses (MOOC) have been prepared and implemented to teach accelerator physics and technologies.
The goal of those initiatives is to catalyze the development of world-class institutions through the production of high–quality scientists and engineers to stimulate economic growth and employment creation. Pursuant to this goal, the objective is to produce the next generation of African scientists and engineers by training them in the necessary technical, entrepreneurial and leadership capacities to solve African problems thus contributing to economic and social transformation.
A Jisc perspective of digital notebooks including a summary of work on e-Lab notebooks, VREs, the next generation research environment and the research data shared service. How might ELNs be incorporated into a future open science shared service? Presented at "Digital Notebooks - how to provide solutions for researchers?" workshop in TU Delft (16 March 2018)
Presentation during the 14th Association of African Universities (AAU) Conference and African Open Science Platform (AOSP)/Research Data Alliance (RDA) Workshop in Accra, Ghana, 7-8 June 2017.
Immersive informatics - research data management at Pitt iSchool and Carnegie...Keith Webster
A joint presentation by Liz Lyon and Keith Webster on providing education for librarians engaged in research data management. This was delivered at Library Research Seminar VI, at the University of Illinois Urbana Champaign in September 2014. The presentation looks at a class delivered by Lyon at the University of Pittsburgh's iSchool in 2014, and the related needs for immersive training opportunities amongst experienced practicing librarians, using Carnegie Mellon University's library, led by Webster, as a case study.
This presentation gives an oiverview of the Sci-GaIA project, in the context of the CHAIN-REDS workshop at EGI2015 (Lisbon).
Aspects covered are :
1. The Sci-GaIA project: facts, figures and bjectives
2. The legacy of other projects (ei4Africa and CHAIN-REDS
3. The Sci-GaIA work programme
2011.10.10 Multi-Disciplinary Research Themes and TrainingNUI Galway
Dr Diane Payne, Director of the Dynamics Lab, Geary Institute, University College Dublin talked about the Geary Institute in this seminar "Multi-Disciplinary Research Themes and Training" at the Whitaker Institute on 10th October 2011.
28_09_2018 eMadrid seminar on MOOCs by Pedro Plaza, UNEDeMadrid network
«Local MOOC solution for thight budgets or limited internet access», eMadrid seminar on «MOOCs as part of the future of digital learning» at UNED, as part of LWMOOCS Conference
Data models in precision agriculture: from IoT to big data analyticsUniversity of Bologna
Data models are abstract models that standardize data formats and relationships.
In other words, data models describe the concepts that belong to a certain application domain (e.g., "Device" and "Farm" are concepts that belong to the "Agriculture" domain, and "Device" is also a concept in the domain of "Smart Cities").
Over the years, many data models (and ontologies) have been produced for the precision agriculture domain.
On the one hand, such models provide standards for data transmission and representation.
On the other hand, these models are not suited for (automated) data integration and analysis, which are core tasks in building decision support systems for precision agriculture ---digitalized systems that support farmers and technicians in making data-driven decisions.
Following the advancements in big data technologies and internet of things systems, managing such systems is increasingly harder and requires not only standards to transmit and represent the data, but also to automatically integrate heterogeneous data into a uniform medium and to automate data analysis and fruition.
While this is a well-known issue in the field of precision agriculture, where data models usually fuel data silos for ad-hoc independent applications (e.g., smart watering management, autonomous weeding systems, vegetation index computation), the synergy with computer science and database techniques could both answer these challenges and open novel research directions.
In this poster, we (i) describe some of the state-of-the-art models for precision agriculture and their application (e.g., from the FIWARE ecosystem), (ii) factorize the limitations and issues of such models (e.g., inter-domain ambiguities, intra-domain inconsistency, wrong modeling practices), (iii) show how computer science techniques (e.g., entity resolution, data normalization, data provenance collection) can answer these issues, and (iv) introduce novel data-driven research directions for building unifying decision support systems.
[EDBT2023] Describing and Assessing Cubes Through Intentional Analytics (demo...University of Bologna
The Intentional Analytics Model (IAM) has been envisioned as a way to tightly couple OLAP and analytics by (i) letting users explore multidimensional cubes stating their intentions, and (ii) returning multidimensional data coupled with knowledge insights in the form of annotations of subsets of data. Goal of this demonstration is to showcase the IAM approach using a notebook where the user can create a data exploration session by writing describe and assess statements, whose results are displayed by combining tabular data and charts so as to bring the highlights discovered to the user's attention. The demonstration plan will show the effectiveness of the IAM approach in supporting data exploration and analysis and its added value as compared to a traditional OLAP session by proposing two scenarios with guided interaction and letting users run custom sessions.
Fundamental physics and accelerator science in developing countriesChristine Darve
APS2020
Session M19: Physics for Development
Abstract: Education and industrialization are essential to promote positive developments in the least developed countries. The recent evolution of technologies and ICT open the doors to innovative way to support education in developing countries. To enhance the existing African School of Fundamental Physics and Applications (ASP), Massive Open On-line Courses (MOOC) have been prepared and implemented to teach accelerator physics and technologies.
The goal of those initiatives is to catalyze the development of world-class institutions through the production of high–quality scientists and engineers to stimulate economic growth and employment creation. Pursuant to this goal, the objective is to produce the next generation of African scientists and engineers by training them in the necessary technical, entrepreneurial and leadership capacities to solve African problems thus contributing to economic and social transformation.
A Jisc perspective of digital notebooks including a summary of work on e-Lab notebooks, VREs, the next generation research environment and the research data shared service. How might ELNs be incorporated into a future open science shared service? Presented at "Digital Notebooks - how to provide solutions for researchers?" workshop in TU Delft (16 March 2018)
Presentation during the 14th Association of African Universities (AAU) Conference and African Open Science Platform (AOSP)/Research Data Alliance (RDA) Workshop in Accra, Ghana, 7-8 June 2017.
Immersive informatics - research data management at Pitt iSchool and Carnegie...Keith Webster
A joint presentation by Liz Lyon and Keith Webster on providing education for librarians engaged in research data management. This was delivered at Library Research Seminar VI, at the University of Illinois Urbana Champaign in September 2014. The presentation looks at a class delivered by Lyon at the University of Pittsburgh's iSchool in 2014, and the related needs for immersive training opportunities amongst experienced practicing librarians, using Carnegie Mellon University's library, led by Webster, as a case study.
This presentation gives an oiverview of the Sci-GaIA project, in the context of the CHAIN-REDS workshop at EGI2015 (Lisbon).
Aspects covered are :
1. The Sci-GaIA project: facts, figures and bjectives
2. The legacy of other projects (ei4Africa and CHAIN-REDS
3. The Sci-GaIA work programme
2011.10.10 Multi-Disciplinary Research Themes and TrainingNUI Galway
Dr Diane Payne, Director of the Dynamics Lab, Geary Institute, University College Dublin talked about the Geary Institute in this seminar "Multi-Disciplinary Research Themes and Training" at the Whitaker Institute on 10th October 2011.
28_09_2018 eMadrid seminar on MOOCs by Pedro Plaza, UNEDeMadrid network
«Local MOOC solution for thight budgets or limited internet access», eMadrid seminar on «MOOCs as part of the future of digital learning» at UNED, as part of LWMOOCS Conference
Data models in precision agriculture: from IoT to big data analyticsUniversity of Bologna
Data models are abstract models that standardize data formats and relationships.
In other words, data models describe the concepts that belong to a certain application domain (e.g., "Device" and "Farm" are concepts that belong to the "Agriculture" domain, and "Device" is also a concept in the domain of "Smart Cities").
Over the years, many data models (and ontologies) have been produced for the precision agriculture domain.
On the one hand, such models provide standards for data transmission and representation.
On the other hand, these models are not suited for (automated) data integration and analysis, which are core tasks in building decision support systems for precision agriculture ---digitalized systems that support farmers and technicians in making data-driven decisions.
Following the advancements in big data technologies and internet of things systems, managing such systems is increasingly harder and requires not only standards to transmit and represent the data, but also to automatically integrate heterogeneous data into a uniform medium and to automate data analysis and fruition.
While this is a well-known issue in the field of precision agriculture, where data models usually fuel data silos for ad-hoc independent applications (e.g., smart watering management, autonomous weeding systems, vegetation index computation), the synergy with computer science and database techniques could both answer these challenges and open novel research directions.
In this poster, we (i) describe some of the state-of-the-art models for precision agriculture and their application (e.g., from the FIWARE ecosystem), (ii) factorize the limitations and issues of such models (e.g., inter-domain ambiguities, intra-domain inconsistency, wrong modeling practices), (iii) show how computer science techniques (e.g., entity resolution, data normalization, data provenance collection) can answer these issues, and (iv) introduce novel data-driven research directions for building unifying decision support systems.
[EDBT2023] Describing and Assessing Cubes Through Intentional Analytics (demo...University of Bologna
The Intentional Analytics Model (IAM) has been envisioned as a way to tightly couple OLAP and analytics by (i) letting users explore multidimensional cubes stating their intentions, and (ii) returning multidimensional data coupled with knowledge insights in the form of annotations of subsets of data. Goal of this demonstration is to showcase the IAM approach using a notebook where the user can create a data exploration session by writing describe and assess statements, whose results are displayed by combining tabular data and charts so as to bring the highlights discovered to the user's attention. The demonstration plan will show the effectiveness of the IAM approach in supporting data exploration and analysis and its added value as compared to a traditional OLAP session by proposing two scenarios with guided interaction and letting users run custom sessions.
The Intentional Analytics Model (IAM) has been devised to couple OLAP and analytics by (i) letting users express their analysis intentions on multidimensional data cubes and (ii) returning enhanced cubes, i.e., multidimensional data annotated with knowledge insights in the form of models (e.g., correlations). Five intention operators were proposed to this end; of these, describe and assess have been investigated in previous papers. In this work we enrich the IAM picture by focusing on the explain operator, whose goal is to provide an answer to the user asking "why does a measure show these values?". Specifically, we propose a syntax for the operator and discuss how enhanced cubes are built by (i) finding the polynomials that best approximate the relationship between a measure and the other cube measures, and (ii) highlighting the most interesting one. Finally, we test the operator implementation in terms of efficiency.
Carrying out OLAP analyses in hands-free scenarios requires lean forms of communication between the users and the system, based for instance on natural language. In this paper we introduce VOOL, a framework specifically devised for vocalizing the insights resulting from OLAP sessions. VOOL is self-configurable, extensible, and is aware of the user's intentions expressed by OLAP operators. To avoid overwhelming the user with very long descriptions, we pursue the vocalization of selected insights automatically extracted from query results. These insights are detected by a set of modules, each returning a set of independent insights that characterize data. After describing and formalizing our approach, we evaluate it in terms of efficiency and effectiveness.
The democratization of data access and the adoption of OLAP in scenarios requiring hand-free interfaces push towards the creation of smart OLAP interfaces. We describe COOL, a framework devised for COnversational OLap applications. COOL interprets and translates a natural language dialog into an OLAP session that starts with a GPSJ (Generalized Projection, Selection, and Join) query and continues with the application of OLAP operators. The interpretation relies on a formal grammar and on a repository storing metadata and values from a multidimensional cube. In case of ambiguous text description, COOL can obtain the correct query either through automatic inference or user interactions to disambiguate the text.
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...University of Bologna
The volume, variety, and high availability of data backing decision support systems have impacted on business intelligence, the discipline providing strategies to transform raw data into decision-making insights. Such transformation is usually abstracted in the “knowledge pyramid,” where data collected from the real world are processed into meaningful patterns. In this context, volume, variety, and data availability have opened for challenges in augmenting the knowledge pyramid. On the one hand, the volume and variety of unconventional data (i.e., unstructured non-relational data generated by heterogeneous sources such as sensor networks) demand novel and type-specific data management, integration, and analysis techniques. On the other hand, the high availability of unconventional data is increasingly attracting data scientists with high competence in the business domain but low competence in computer science and data engineering; enabling effective participation requires the investigation of new paradigms to drive and ease knowledge extraction. The goal of this thesis is to augment the knowledge pyramid from two points of view, namely, by including unconventional data and by providing advanced analytics. As to unconventional data, we focus on mobility data and on the privacy issues related to them by providing (de-)anonymization models. As to analytics, we introduce a higher abstraction level than writing formal queries. Specifically, we design advanced techniques that allow data scientists to explore data either by expressing intentions or by interacting with smart assistants in hand-free scenarios.
[EDBT2021] Conversational OLAP in Action (Best Demo Award EDBT2021)University of Bologna
Demo Paper presented at EDBT 2021: Conversational OLAP in Action (Best Demo Award)
Link to the paper: https://edbt2021proceedings.github.io/docs/p145.pdf
The democratization of data access and the adoption of OLAP in scenarios requiring hand-free interfaces push towards the creation of smart OLAP interfaces. In this demonstration we present COOL, a tool supporting natural language COnversational OLap sessions. COOL interprets and translates a natural language dialogue into an OLAP session that starts with a GPSJ (Generalized Projection, Selection and Join) query. The interpretation relies on a formal grammar and a knowledge base storing metadata from a multidimensional cube. COOL is portable, robust, and requires minimal user intervention. It adopts an n-gram based model and a string similarity function to match known entities in the natural language description. In case of incomplete text description, COOL can obtain the correct query either through automatic inference or through interactions with the user to disambiguate the text. The goal of the demonstration is to let the audience evaluate the usability of COOL and its capabilities in assisting query formulation and ambiguity/error resolution.
[EDBT2021] Assess Queries for Interactive Analysis of Data CubesUniversity of Bologna
Paper presented at EDBT 2021: Assess Queries for Interactive Analysis of Data Cubes
Link to the paper: https://edbt2021proceedings.github.io/docs/p41.pdf
Assessment is the process of comparing the actual to the expected behavior of a business phenomenon and judging the outcome of the comparison. In this paper, we propose `assess`, a novel querying operator that supports assessment based on the results of a query on a data cube. This operator requires (1) the specification of an OLAP query over a measure of a data cube, to define the target cube to be assessed; (2) the specification of a reference cube of comparison (benchmark), which represents the expected performance of the measure; (3) the specification of how to perform the comparison between the target cube and the benchmark, and (4) a labeling function that classifies the result of this comparison using a set of labels. After introducing an SQL-like syntax for our operator, we formally define its semantics in terms of a set of logical operators. To support the computation of `assess` we propose a basic plan as well as some optimization strategies, then we experimentally evaluate their performance using a prototype.
[SEBD2020] OLAP Querying of Document Stores in the Presence of Schema VarietyUniversity of Bologna
Paper presented at SEBD 2020
Document stores are preferred to relational ones for storing heterogeneous data due to their schemaless nature. However, the absence of a unique schema adds complexity to analytical applications. In a previous paper we have proposed an original approach to OLAP on document stores; its basic idea was to stop fighting against schema variety and welcome it as an inherent source of information wealth in schemaless sources. In this paper we focus on the querying phase, showing how queries can be directly rewritten on a heterogeneous collection in an inclusive way, i.e., also including the concepts present in a subset of documents only.
Authors: Matteo Francia, Enrico Gallinucci, Matteo Golfarelli, Stefano Rizzi
Paper presented at DOLAP 2020: Towards Conversational OLAP
Link to the presentation: https://youtu.be/IfBc1H46s8Y
Abstract: The democratization of data access and the adoption of OLAP in scenarios requiring hand-free interfaces push towards the creation of smart OLAP interfaces. In this paper, we envisage a conversational framework specifically devised for OLAP applications. The system converts natural language text in GPSJ (Generalized Projection, Selection and Join) queries. The approach relies on an ad-hoc grammar and a knowledge base storing multidimensional metadata and cubes values. In case of ambiguous or incomplete query description, the system is able to obtain the correct query either through automatic inference or through interactions with the user to disambiguate the text. Our tests show very promising results both in terms of effectiveness and efficiency.
Authors: Matteo Francia, Enrico Gallinucci, Matteo Golfarelli
[MIPRO2019] Map-Matching on Big Data: a Distributed and Efficient Algorithm w...University of Bologna
In urban mobility, map-matching aims to project GPS points generated by moving objects onto the road segments representing the actual object positions. Up to now, map-matching has found interesting applications in traffic analysis, frequent path extraction, and location prediction. However, state-of-art implementations of map-matching algorithms are either private, sequential or inefficient. In this paper, we propose an extension of an existing serial algorithm of known efficiency by reformulating it in a distributed way, in order to achieve great scalability on real big data scenarios. Furthermore, we enhance the robustness of the algorithm, which is based on a first-order Hidden Markov Model, by introducing a smart strategy to avoid gaps in the matched road segments; indeed, this problem may occur under sparse GPS sampling or in urban areas with highly fragmented road segments. Our implementation is based on Apache Spark and is publicly available on Github. The implementation is tested against a dataset with 7.8 million GPS points in Milan.
Augmented reality allows users to superimpose digital information (typically, of operational type) upon real world entities. The synergy of analytical frameworks and augmented reality opens the door to a new wave of situated OLAP, in which users within a physical environment are provided with immersive analyses of local contextual data. In this paper we propose an approach that, based on the sensed augmented context (provided by wearable and smart devices), proposes a set of relevant analytical queries to the user. This is done by relying on a mapping between the entities that can be recognized by the devices and the elements of the enterprise data, and also taking into account the queries preferred by users during previous interactions that occurred in similar contexts. A set of experimental tests evaluates the proposed approach in terms of efficiency and effectiveness.
http://ceur-ws.org/Vol-2324/Paper02-MGolfarelli.pdf
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfGetInData
Recently we have observed the rise of open-source Large Language Models (LLMs) that are community-driven or developed by the AI market leaders, such as Meta (Llama3), Databricks (DBRX) and Snowflake (Arctic). On the other hand, there is a growth in interest in specialized, carefully fine-tuned yet relatively small models that can efficiently assist programmers in day-to-day tasks. Finally, Retrieval-Augmented Generation (RAG) architectures have gained a lot of traction as the preferred approach for LLMs context and prompt augmentation for building conversational SQL data copilots, code copilots and chatbots.
In this presentation, we will show how we built upon these three concepts a robust Data Copilot that can help to democratize access to company data assets and boost performance of everyone working with data platforms.
Why do we need yet another (open-source ) Copilot?
How can we build one?
Architecture and evaluation
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfEnterprise Wired
In this guide, we'll explore the key considerations and features to look for when choosing a Trusted analytics platform that meets your organization's needs and delivers actionable intelligence you can trust.
2. 28/03/2023 DataPlat & CoMoNoS workshops 2023
CoMoNoS
DataPlat: Program Chairs
2nd Int. Workshop on Data Platform Design, Management, & Optimization
Many thanks to the fellow organizers…
▪ Matteo Francia – DISI, University of Bologna
▪ Enrico Gallinucci – DISI, University of Bologna
▪ Patrick Marcel – LIFAT, Université de Tours
▪ Veronika Peralta – LIFAT, Université de Tours
…and to EDBT/ICDT Workshop Chairs
▪ Verena Kantere and George Fletcher
2
3. 28/03/2023 DataPlat & CoMoNoS workshops 2023
CoMoNoS
CoMoNoS: Program Chairs
3rd Workshop on Conceptual Modeling for NoSQL Data Stores
Many thanks to the fellow organizers…
▪ Meike Klettke, University of Regensburg
▪ Stefanie Scherzinger, University of Passau
▪ Uta Störl, University of Hagen
…and to EDBT/ICDT Workshop Chairs
▪ Verena Kantere and George Fletcher
3
4. 28/03/2023 DataPlat & CoMoNoS workshops 2023
CoMoNoS
DataPlat & CoMoNos scope
Information systems have evolved into complex data platforms
▪ A paradigm change imposed by Big Data
▪ Support to data-intensive storage, computation, and analysis of heterogeneous data
Address lack of smart support to govern data through the whole life-cycle
▪ Need for metadata collection and activation
▪ Need to cope with the heterogeneity of storage and computation engines
Explore opportunities for conceptual modeling for NoSQL data stores
▪ Addressing real-world problems that arise with NoSQL data stores
▪ Addressing challenges that arise from data model evolution
4
5. 28/03/2023 DataPlat & CoMoNoS workshops 2023
CoMoNoS
their contributions and revisions
● Paolo Atzeni - Roma Tre University and NCA of Italy, Italy
● Mohamed-Amine Baazizi - Sorbonne University, Paris, France
● Francesca Bugiotti - University Paris-Saclay, France
● Anthony Cleve - University of Namur, Belgium
● Dario Colazzo - Dauphine University, Paris, France
● Isabelle Comyn-Wattiau - ESSEC Business School Paris, France
● Irena Holubova - Charles University of Prag, Czech Republic
● Jiaheng Lu - University of Helsinki, Finland
● Michael Mior - Rochester Institute of Technology, New York, USA
● Carlo Sartiani - University of Pisa, Italy
● Diego Sevilla Ruiz - University of Murcia, Spain
● Heiko Schuldt - University of Basel, Switzerland
● Arnon Sturm - Ben Gurion University of the Negev, Israel
Program Committee
Many thanks to the PC members for
● Duncan Ruiz - Escola Politécnica - PUCRS, Brazil
● Franck Ravat - Université Paul Sabatier, France
● Jérome Darmont - University of Lion, France
● Sana Sellami - Aix Marseille University, France
● Sandra Sampaio - University of Manchester, UK
● Sandro Bimonte - INRAE Clermont Ferrand, France
● Sergi Nadal - Universitat Politècnica de Catalunya,
Spain
● Shaleen Deep - Microsoft, US
● Theodoros Toliopoulos - Aristotle University of
Thessaloniki, Greece
5
6. 28/03/2023 DataPlat & CoMoNoS workshops 2023
CoMoNoS
Received 14 submissions around the globe
Accepted: 6 long papers, 4 short papers
Statistics
6
7. 28/03/2023 DataPlat & CoMoNoS workshops 2023
CoMoNoS
Review phase: 22/01 - 15/02
2-4 reviews per paper
Selection procedure
7
8. 28/03/2023 DataPlat & CoMoNoS workshops 2023
CoMoNoS
Program
• 11.00 Opening
• 11.05 1st Research paper session
Session chair: Matteo Francia
• 13.00 Lunch break
• 14.30 2nd Research paper session
Session chairs: Meike Klettke & Stefanie Scherzinger
• 16.15 Farewell
8
9. 28/03/2023 DataPlat & CoMoNoS workshops 2023
CoMoNoS
Special issue
Special Issue on Data Platform Design, Management, and Optimization
▪ Journal: Information System Frontiers (IF: 5.261)
Invitation for the best DataPlat papers
▪ Based on originality, reviews, quality of presentation
▪ Does not ensure publication
Timeline
▪ Submissions are now open (to anyone)
▪ Best DataPlat papers invitation: April 5th, 2023
▪ Deadline for paper submission: July 15th, 2023
▪ Reviewing: Continuous basis
▪ Revision deadline: September 15th, 2023
▪ Latest acceptance deadline for all papers: December 16th, 2023
9