Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search

•

1 like•1,449 views

Linked Data offers an entity-based infrastructure to resolve indirect relations between resources, expressed as chains of links. If we could benchmark how effective retrieving chains of links from these sources is, we can motivate why they are a reliable addition for exploratory search interfaces. A vast number of applications could reap the benefits from encouraging insights in this field. Especially all kinds of knowledge discovery tasks related for instance to ad-hoc decision support and digital assistance systems. In this paper, we explain a benchmark model for evaluating the effectiveness of associating chains of links with keyword-based queries. We illustrate the benchmark model with an example case using academic library and conference metadata where we measured precision involving targeted expert users and directed it towards search effectiveness. This kind of typical semantic search engine evaluation focusing on information retrieval metrics such as precision is typically biased towards the final result only. However, in an exploratory search scenario, the dynamics of the intermediary links that could lead to potentially relevant discoveries are not to be neglected.

Science

Benchmarking the Effectiveness of Associating
Chains of Links for Exploratory Semantic Search
Laurens DeVocht
Selver Softic, RubenVerborgh, Erik Mannens, Martin Ebner, RikVan de Walle

:Paris :Anne_Hidalgo
:mayor
:Bethlehem,_PA
?
4

:Anne_Hidalgo
:Bethlehem,_PA
Exploratory Semantic Search Engine
7

:Paris
9 :mayor :Anne_Hidalgo
< :birthPlace :San_Fernando,_Caldiz
9 :country :Spain
< :birthPlace :Edward_Ferrero
9 :battle :Battle_of_Roanoke_Island
< :battle :Charles_Adam_Heckman
9 :birthPlace :Easton,_Pennsylvania
9 :mouthMountain :Lehigh_River
9 :city
:Bethlehem,_Pennsylvania
A
9

:Paris
< :capital :France
< :citizenship :Cyril_Bourlon_de_Rouvre
9 :education :Aerospace_engineering
< :occupation :Dick_Johnson_(glider_pilot)
9 :almaMater :Mississippi_State_University
< :almaMater :Clara_Southmayd_Ludlow
9 :birthPlace :Easton,_Pennsylvania
< :mouthMountain :Lehigh_River
9 :city
:Bethlehem,_Pennsylvania
B
10

How effective does an exploratory semantic
search engine reveal initially hidden associations,
as chains of links between interlinked resources?

Introduction
Exploratory Search
Benchmark Model
Motivating Example
Discussion and Conclusion

[EXPLORATORY SEARCH: FROM FINDINGTO UNDERSTANDING, Machionini, 2006]
Lookup Learn Investigation
Exploratory Search
`Learning searches involve multiple iterations and return sets of
objects that require cognitive processing and interpretation’
`Searches that support investigation involve multiple iterations that take place
over perhaps very long periods of time and may return results that are critically
assessed before being integrated into personal and professional knowledge bases’
Definition
15

1. Lookup
2. Relate/Expand
Lookup and learn: interpretation
16

lookup
expand relate
An exploratory semantic search engine
17

Iterative Exploratory Queries
Exploratory Semantic Search Engine
Datasets
Baseline
Effectiveness
22

Effectiveness
The effectiveness E indicates the overall perception of the results by
the users taking into account expert-user feedback.
# user marked relevant objects
E =
# retrieved objects
Note:
E can be interpreted as precision in traditional IR.
Typical IR examine both precision and recall.
23
[TALKEXPLORER,Verbert et al., 2013]

Motivating Example
ResXplorer.org
Everything Is Connected Engine
Virtuoso
User Study Extracted Queries
25

User Study Extracted Queries
1. lookup; 2. expand; 3. relate
26

LDOW
P(0)
P(1)
P(2)
P(3)
Effectiveness : Interpretation
27

Sample Results
Everything Is Connected Engine
28

Limitations
 Only indicate comparisons to baseline within the same use case.
 Not possible to use the benchmark as a leverage to compare different
approaches across use cases
 Could better demonstrate in which aspects an exploratory approach
excels traditional systems.
31

Future Work
 Put the results in perspective by indicating the nuances among different
expert user ratings.
Especially when there is expert disagreement or inconsistencies.
 Facilitate generalization of the preliminary search context,
so results for engines can be reusable across datasets: avoiding that a
certain engine’s results differ strongly when changing the data and queries.
 Make sure that the approach is generic and can be applied to other search
contexts with different data and use cases.
32

Benefits
 Compare exploratory search engines to a baseline:
 show use cases when the baseline can be outperformed;
 for which queries the ‘engine under test’ is relatively more effective.
 Sensitive to initial query keywords as inputted by the user:
when there are inconsistencies or vague terms,
even mismatches in the query context, or when expert users disagreed.
33

Contact
@laurens_d_v
laurens.devocht@ugent.be
http://slideshare.net/laurensdv
http://semweb.mmlab.be/

Viewers also liked

Travelers expect access to tourism information at anytime, anywhere, with any media. Mobile tourism guides, accessible via the Web, provide an omnipresent approach to this. Thereby it is expensive and not trivial to (re)model, translate and transform data over and over. This inhibits many players, including governments, in developing such applications. We report on our experience in running a project on mobile tourism in Flanders, Belgium where we develop a methodology and reusable formalization for the data disclosure. We apply open data standards to achieve a reusable and interoperable datahub for mobile tourism. We organized working groups resulting in a re-usable formal specification and serialization of the domain model that is immediately usable for building mobile tourism applications. This increased the awareness and lead to semantic convergence which is forming a regional foundation to develop sustainable mobile guides for tourism.

Providing Interchangeable Open Data to Accelerate Development of Sustainable ...

Laurens De Vocht

The DataTank, RML and Domain Modelling

Laurens De Vocht

Slides of LDOW2013 presentation, May 14th, Rio De Janeiro, Brazil We will show that semantically annotated paths lead to discovering meaningful, non-trivial relations and connections between multiple resources in large online datasets such as the Web of Data. Graph algorithms have always been key in pathfinding applications (e.g., navigation systems). They make optimal use of available computation resources to find paths in structured data. Applying these algorithms to Linked Data can facilitate the resolving of complex queries that involve the semantics of the relations between resources. In this paper, we introduce a new approach for finding paths in Linked Data that takes into account the meaning of the connections and also deals with scalability. An efficient technique combining pre-processing and indexing of datasets is used for finding paths between two resources in largedatasets within a couple of seconds. To demonstrate our approach, we have implemented a testcase using the DBpedia dataset.

Discovering Meaningful Connections between Resources in the Web of Data

Laurens De Vocht

Querying Heterogeneous Linked Date Interfaces through Reasoning

Joachim Van Herwegen

LDOW2013 r&wbase: git for triples

Miel Vander Sande

Situation of open data in Flanders

Pieter Colpaert

Linked Data generation and publication remain challenging and complicated, in particular for data owners who are not Semantic Web experts or tech-savvy. The situation deteriorates when data from multiple heterogeneous sources, accessed via different interfaces, is integrated, and the Linked Data generation is a long-lasting activity repeated periodically, often adjusted and incrementally enriched with new data. Therefore, we propose the rmlworkbench, a graphical user interface to support data owners administrating their Linked Data generation and publication workflow. The rmlworkbench’s underlying language is rml, since it allows to declaratively describe the complete Linked Data generation workflow. Thus, any Linked Data generation workflow specified by a user can be exported and reused by other tools interpreting RML.

Towards an Interface for User-Friendly Linked Data Generation Administration

andimou

Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...

Pieter Heyvaert

Between uri dereferencing and the sparql protocol lies a largely unexplored axis of possible interfaces to Linked Data, eachwith its own combination of trade-offs. One of these interfaces is Triple Pattern Fragments, which allows clients to execute sparql queries against low-cost servers, at the cost of higher bandwidth. Increasing a client’s efficiency means lowering the number of requests, which can among others be achieved through additional metadata in responses. We noted that typical sparql query evaluations against Triple Pattern Fragments require a significant portion of membership subqueries, which check the presence of a specific triple, rather than a variable pattern. This paper studies the impact of providing approximate membership functions, i.e., Bloom filters and Golombcoded sets, as extra metadata. In addition to reducing http requests, such functions allow to achieve full result recall earlier when temporarily allowing lower precision. Half of the tested queries from aWatDiv benchmark test set could be executed with up to a third fewer http requests with only marginally higher server cost. Query times, however, did not improve, likely due to slower metadata generation and transfer. This indicates that approximate membership functions can partly improve the client-side query process with minimal impact on the server and its interface.

Opportunistic Linked Data Querying through Approximate Membership Metadata

Miel Vander Sande

It’s 2025. Internet of things has taken of, and both machines have joined humans on the Web and in society. What do they do, how do we cope, and more importantly, how did we end up here? User experience has a major peak in importance: digital devices are everywhere, users are utterly diverse, and technology is getting highly advanced. Inventing better ways to let people interact with such technology has been the challenge of the 21st century. UX expectations are through the roof: children grow up as digital natives and think any thing is science. However, that is not all. The user spectrum has some new participants: machines. The rise of digital assistants introduce new challenges about how to deal with these newcomers. But how are we handling the next-gen AI-driven machine players, as they are the real digital natives. In this talk, we go over interesting use cases, positions and developments on AI. Furthermore, we touch on some technical and ethical fallacies in existing systems such as IBM Watson or self-driving cars. Finally, we illustrate future prospects and raise some challenging ethical questions.

Machines are the new Digital Natives

Miel Vander Sande

Querying federations  of Triple Pattern Fragments

Ruben Verborgh

iRail: History & current issues

Pieter Colpaert

ESWC2015 - Query Optimization for Clients of Linked Data Fragments

Joachim Van Herwegen

Data on the World Wide Web changes at the speed of light—today’s facts are tomorrow’s history. This makes the ability to look back important: how do facts grow and change over time? It gets even more interesting when we zoom out beyond individual facts: how do answers to questions evolve when data ages? With Linked Data, we are used to query the latest version of information, because updating a sparql endpoint is easier than maintaining every historical version. With the lightweight Triple Pattern Fragments interface, it becomes very easy for a server to host multiple versions. Using the Memento framework to switch between versions based on a timestamp, your browser can evaluate sparql queries over any point in time. We tried this with dbpedia—and so can you!

Time travelling through DBpedia

Miel Vander Sande

Towards a Uniform User Interface for Editing Mapping Definitions

Pieter Heyvaert

Presentation Data Science Challenge

Dieter De Witte

The root of schema violations for rdf data generated from (semi-)structured data, often derives from mappings, which are repeatedly applied and specify how an RDF dataset is generated. The DBpedia dataset, which derives from Wikipedia infoboxes, is no exception. To mitigate the violations, we proposed in previous work to validate the mappings which generate the data, instead of validating the generated data afterwards. In this work, we demonstrate how mappings validation is applied to DBpedia. dbpedia mappings are automatically translated to RML and validated by RDFUnit. The DBpedia mappings assessment can be frequently executed, because it requires significantly less time compared to validating the dataset. The validation results become available via a user-friendly interface. The DBpedia community takes them into consideration to refine the DBpedia mappings or ontology and thus, increase the dataset quality.

DBpedia Mappings Quality Assessment

andimou

Scaling out federated queries for Life Sciences Data In Production

Dieter De Witte

ComparativeMotifFinding

Dieter De Witte

Although several tools have been implemented to generate Linked Data from raw data, users still need to be aware of the underlying technologies and Linked Data principles to use them. Mapping languages enable to detach the mapping definitions from the implementation that executes them. However, no thorough research has been conducted on how to facilitate the editing of mappings. We propose the RMLEditor, a visual graph-based user interface, which allows users to easily define the mappings that deliver the RDF representation of the corresponding raw data. Neither knowledge of the underlying mapping language nor the used technologies is required. The RMLEditor aims to facilitate the editing of mappings, and thereby lowers the barriers to create Linked Data. The RMLEditor is developed for use by data specialists who are partners of (i) a companies-driven pilot and (ii) a community group. The current version of the RMLEditor was validated: participants indicate that it is adequate for its purpose and the graph-based approach enables users to conceive the linked nature of the data.

RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings

Pieter Heyvaert

Viewers also liked (20)

Providing Interchangeable Open Data to Accelerate Development of Sustainable ...

The DataTank, RML and Domain Modelling

Discovering Meaningful Connections between Resources in the Web of Data

Querying Heterogeneous Linked Date Interfaces through Reasoning

LDOW2013 r&wbase: git for triples

Situation of open data in Flanders

Towards an Interface for User-Friendly Linked Data Generation Administration

Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...

Opportunistic Linked Data Querying through Approximate Membership Metadata

Machines are the new Digital Natives

Querying federations  of Triple Pattern Fragments

iRail: History & current issues

ESWC2015 - Query Optimization for Clients of Linked Data Fragments

Time travelling through DBpedia

Towards a Uniform User Interface for Editing Mapping Definitions

Presentation Data Science Challenge

DBpedia Mappings Quality Assessment

Scaling out federated queries for Life Sciences Data In Production

ComparativeMotifFinding

RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings

Similar to Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search

Presented at http://mcbios-maqc.org. The FAIR Principles have propelled the global debate in all disciplines about better RDM, transparent and reproducible data worldwide, and in all disciplines. FAIR has de facto become a global norm for good RDM, a prerequisite for data science, since their endorsement by global and intergovernmental leaders. Funding bodies are consolidating FAIR into their funding agreements; publishers have united behind FAIR as a way to remain at the forefront of open research; and in the private sector FAIR is adopted and enshrined in policy in major biopharmas, libraries, and unions. FAIR is changing the culture of data science, but work is needed to turn the principles into reality. I will use the work of the FAIRplus project as examplar to illustrate challenges and progresses.

FAIR, FAIRplus and the FAIR Cookbook

Susanna-Assunta Sansone

Social Phrases Having Impact in Altmetrics - SOPHIA

Insight_Altmetrics

master_thesis.pdf

EL MAJJODI Ayoub

PATHS state of the art monitoring report

pathsproject

SHAHBAZ_TECHNICAL_SEMINAR.docx

ShahbazKhan77289

At Elsevier, a lot of effort is focussed on content discovery for users, allowing them to find the most relevant articles for their research. This, at its core, blurs the boundaries of search and recommendation as we are both pushing content to the user and allowing them to search the world’s largest catalogue of scientific research. Apart from using the content as is, we can make new content more discoverable with the help of authors at submission time, for example by getting them to write an executive summary of their paper. However, doing this at submission time means that this additional information is not available for older content. This raises the question of how we can utilise the author’s input on new content to create the same feature retrospectively to the whole Elsevier corpus. Focusing on one use case, we discuss how an extractive summarization model (which is trained on the user-submitted summaries), is used to retrospectively generate executive summaries for articles in the catalogue. Further, we show how extractive summarization is used to highlight the salient points (methods, results and finding) within research articles across the complete corpus. This helps users to identify whether an article is of particular interest for them. As a logical next step, we investigate how these extractions can be used to make the research papers more discoverable through connecting it to other papers which share similar findings, methods or conclusion. In this talk we start from the beginning, understanding what users want from summarization systems. We discuss how the proposed use cases were developed and how this ties into the discovery of new content. We then look in more technical detail at what data is available and which methods can be utilised to implement such a system. Finally, while we are working toward taking this extractive summarization system into production, we need to understand the quality of what is being produced before going live. We discuss how internal annotators were used to confirming the quality of the summaries. Though the monitoring of quality does not stop there, we continually monitor user interaction with the extractive summaries as a proxy for quality and satisfaction.

Elsevier Industry Talk - WSDM 2020

Daniel Kershaw

My experiment

Boshra Albayaty

Building a multi headed model thats capable of detecting different types of toxicity like threats, obscenity, insult and identity based hate. Discussing things you care about can be difficult. The threat of abuse and harassment online means that many people stop expressing themselves and give up on seeking different opinions. Platforms struggle to efficiently facilitate conversations, leading many communities to limit or completely shut down user comments. So far we have a range of publicly available models served through the perspective APIs, including toxicity. But the current models still make errors, and they dont allow users to select which type of toxicity theyre interested in finding. Pallam Ravi | Hari Narayana Batta | Greeshma S | Shaik Yaseen ""Toxic Comment Classification"" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-4 , June 2019, URL: https://www.ijtsrd.com/papers/ijtsrd23464.pdf Paper URL: https://www.ijtsrd.com/computer-science/other/23464/toxic-comment-classification/pallam-ravi

Toxic Comment Classification

ijtsrd

Invited Lecture on Interactive Information Retrieval

DavidMaxwell77

Predicting query performance and explaining results to assist Linked Data con...

Rakebul Hasan

October 24, 2014: Joseph DeCarolis, Assistant Professor at North Carolina State University, will present The Importance of Open Data and Models for Energy Systems Analysis. Energy system models represent a critical planning tool that can be used to deliver policy-relevant insights at scales ranging from local to global. When such models are used to inform public policy, the associated data and source code should be open in order to enable third party replication of results, expose hidden assumptions, and identify key model sensitivities. In this talk, I describe my own effort to push open data and models within the international energy modeling community.

The Importance of Open Data and Models for Energy Systems Analysis

The Open Data Institute of North Carolina

Visualizing the Maturing Global API Ecosystem

SaeidHeshmatisafa1

VII Jornadas eMadrid "Education in exponential times". "Analysing and Alterin...

eMadrid network

With the emergence of the Web of Data, most notably Linked Open Data (LOD), an abundance of data has become available on the web. However, LOD datasets and their inherent subgraphs vary heavily with respect to their size, topic and domain coverage, the schemas and their data dynamicity (respectively schemas and metadata) over the time. To this extent, identifying suitable datasets, which meet spefic criteria, has become an increasingly important, yet challenging task to support issues such as entity retrieval or semantic search and data linking. Particularly with respect to the interlinking issue, the current topology of the LOD cloud underlines the need for practical and ecient means to recommend suitable datasets: currently, only well-known reference graphs such as DBpedia (the most obvious target), YAGO or Freebase show a high amount of in-links, while there exists a long tail of potentially suitable yet under-recognized datasets. This problem is due to the semantic web tradition in dealing with "fnding candidate datasets to link to", where data publishers are used to identify target datasets for interlinking. While an understanding of the nature of the content of specic datasets is a crucial prerequisite for the mentioned issues, we adopt in this dissertation the notion of \dataset prole" | a set of features that describe a dataset and allow the comparison of dierent datasets with regard to their represented characteristics. Our rst research direction was to implement a collaborative ltering-like dataset recommendation approach, which exploits both existing dataset topic proles, as well as traditional dataset connectivity measures, in order to link LOD datasets into a global dataset-topic-graph. This approach relies on the LOD graph in order to learn the connectivity behaviour between LOD datasets. However, experiments have shown that the current topology of the LOD cloud group is far from being complete to be considered as a ground truth and consequently as learning data. Facing the limits the current topology of LOD (as learning data), our research has led to break away from the topic proles representation of \learn to rank" approach and to adopt a new approach for candidate datasets identication where the recommendation is based on the intensional proles overlap between dierent datasets. By intensional prole, we understand the formal representation of a set of schema concept labels that best describe a dataset and can be potentially enriched

Profile-based Dataset Recommendation for RDF Data Linking

Mohamed BEN ELLEFI

Replication and Benchmarking in Software Analytics

University of Zurich

Introduction to FAIR Data and Research Objects

Diego López-de-Ipiña González-de-Artaza

Human-centered AI: how can we support end-users to interact with AI?

Katrien Verbert

Journal Club - Best Practices for Scientific Computing

Bram Zandbelt

Linked (Open) Data is one key to coping with Big Data: it enables decentralised, collaborative management of big datasets, low-overhead information retrieval, and scalable reasoning. Big Data are created or consumed by technical processes or business processes. Their formal description, e.g. for software verification or compliance checking, requires logics whose complexity far exceeds that of the data. Restricting LOD to the RDF logic does not allow for integrating rich process descriptions with the data that these processes create, and therefore does not enable knowledge management, information retrieval and reasoning to take full advantage of rich background knowledge. In this talk I demonstrate different frontiers at which I have worked towards achieving an integration of process descriptions and data.

Linking Big Data to Rich Process Descriptions

Christoph Lange

Overview of methodologies

Mickael Pero

Similar to Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search (20)

FAIR, FAIRplus and the FAIR Cookbook

Social Phrases Having Impact in Altmetrics - SOPHIA

master_thesis.pdf

PATHS state of the art monitoring report

SHAHBAZ_TECHNICAL_SEMINAR.docx

Elsevier Industry Talk - WSDM 2020

My experiment

Toxic Comment Classification

Invited Lecture on Interactive Information Retrieval

Predicting query performance and explaining results to assist Linked Data con...

The Importance of Open Data and Models for Energy Systems Analysis

Visualizing the Maturing Global API Ecosystem

VII Jornadas eMadrid "Education in exponential times". "Analysing and Alterin...

Profile-based Dataset Recommendation for RDF Data Linking

Replication and Benchmarking in Software Analytics

Introduction to FAIR Data and Research Objects

Human-centered AI: how can we support end-users to interact with AI?

Journal Club - Best Practices for Scientific Computing

Linking Big Data to Rich Process Descriptions

Overview of methodologies

Recently uploaded

ManganesehasbeenobservedonMarsbytheNASACuriosityroverinavarietyofcontextsand isanimportantindicatorofredoxprocessesinhydrologicsystemsonEarth.WithintheMurrayformation,an ancientprimarilyfine‐grainedlacustrinesedimentarydeposit inGalecrater,Mars,haveobservedupto45× enrichmentinmanganeseandupto1.5×enrichmentinironwithincoarsergrainedbedrocktargetscomparedto themeanMurraysedimentcomposition.Thisenrichment inmanganesecoincideswiththetransitionbetween twostratigraphicunitswithintheMurray:SuttonIsland, interpretedasalakemarginenvironment,andBlunts Point,interpretedasalakeenvironment.OnEarth,lacustrineenvironmentsarecommonlocationsofmanganese precipitationduetohighlyoxidizingconditionsinthelakes.Here,weexplorethreemechanismsfor ferromanganeseoxideprecipitationatthislocation:authigenicprecipitationfromlakewateralongalakeshore, authigenicprecipitationfromreducedgroundwaterdischargingthroughporoussandsalongalakeshore,and earlydiageneticprecipitationfromgroundwaterthroughporoussands.All threescenariosrequirehighly oxidizingconditionsandwediscussoxidantsthatmayberesponsiblefortheoxidationandprecipitationof manganeseoxides.Thisworkhasimportant implicationsforthehabitabilityofMarstomicrobesthatcould haveusedMnredoxreactions,owingtoitsmultipleredoxstates,asanenergysourceformetabolism.

Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...

Sérgio Sacani

Plasma proteins_ Dr.Muralinath_Dr.c. kalyan

muralinath2

NuGOweek 2024 programme final FLYER short.pdf

pablovgd

Mammals have dominated Earth for approximately 55 Myr thanks to their adaptations and resilience to warming and cooling during the Cenozoic. All life will eventually perish in a runaway greenhouse once absorbed solar radiation exceeds the emission of thermal radiation in several billions of years. However, conditions rendering the Earth naturally inhospitable to mammals may develop sooner because of long-term processes linked to plate tectonics (short-term perturbations are not considered here). In ~250 Myr, all continents will converge to form Earth’s next supercontinent, Pangea Ultima. A natural consequence of the creation and decay of Pangea Ultima will be extremes in pCO2 due to changes in volcanic rifting and outgassing. Here we show that increased pCO2, solar energy (F⨀; approximately +2.5% W m−2 greater than today) and continentality (larger range in temperatures away from the ocean) lead to increasing warming hostile to mammalian life. We assess their impact on mammalian physiological limits (dry bulb, wet bulb and Humidex heat stress indicators) as well as a planetary habitability index. Given mammals’ continued survival, predicted background pCO2 levels of 410–816 ppm combined with increased F⨀ will probably lead to a climate tipping point and their mass extinction. The results also highlight how global landmass configuration, pCO2 and F⨀ play a critical role in planetary habitability.

Climate extremes likely to drive land mammal extinction during next supercont...

Sérgio Sacani

Recent observations of galaxy clusters and groups with misalignments between their central AGN jets and X-ray cavities, or with multiple misaligned cavities, have raised concerns about the jet – bubble connection in cooling cores, and the processes responsible for jet realignment. To investigate the frequency and causes of such misalignments, we construct a sample of 16 cool core galaxy clusters and groups. Using VLBA radio data we measure the parsec-scale position angle of the jets, and compare it with the position angle of the X-ray cavities detected in Chandra data. Using the overall sample and selected subsets, we consistently find that there is a 30% – 38% chance to find a misalignment larger than ∆Ψ = 45◦ when observing a cluster/group with a detected jet and at least one cavity. We determine that projection may account for an apparently large ∆Ψ only in a fraction of objects (∼35%), and given that gas dynamical disturbances (as sloshing) are found in both aligned and misaligned systems, we exclude environmental perturbation as the main driver of cavity – jet misalignment. Moreover, we find that large misalignments (up to ∼ 90◦ ) are favored over smaller ones (45◦ ≤ ∆Ψ ≤ 70◦ ), and that the change in jet direction can occur on timescales between one and a few tens of Myr. We conclude that misalignments are more likely related to actual reorientation of the jet axis, and we discuss several engine-based mechanisms that may cause these dramatic changes.

Jet reorientation in central galaxies of clusters and groups: insights from V...

Sérgio Sacani

GBSN - Microbiology (Unit 6) Human and Microbial interaction

Areesha Ahmad

Virulence Analysis of Citrus canker caused by Xanthomonas axonopodis pv. citr...

TALAPATI ARUNA CHENNA VYDYANAD

Presentation for the Most Influential Paper (MIP) award at the IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) 2024 Abstract: Existing defect prediction models use product or process metrics and machine learning methods to identify defect- prone source code entities. Different classifiers (e.g., linear regression, logistic regression, or classification trees) have been investigated in the last decade. The results achieved so far are sometimes contrasting and do not show a clear winner. In this paper we present an empirical study aiming at statistically analyzing the equivalence of different defect predictors. We also propose a combined approach, coined as CODEP (COmbined DEfect Predictor), that employs the classification provided by different machine learning techniques to improve the detection of defect-prone entities. The study was conducted on 10 open source software systems and in the context of cross-project defect prediction, that represents one of the main challenges in the defect prediction field. The statistical analysis of the results indicates that the investigated classifiers are not equivalent and they can complement each other. This is also confirmed by the superior prediction accuracy achieved by CODEP when compared to stand-alone defect predictors.

MIP Award presentation at the IEEE International Conference on Software Analy...

Annibale Panichella

GBSN - Microbiology Lab (Compound Microscope)

Areesha Ahmad

The magnetic dynamo cycle of the Sun features a distinct pattern: a propagating region of sunspot emergence appears around 30° latitude and vanishes near the equator every 11 years (ref. 1). Moreover, longitudinal flows called torsional oscillations closely shadow sunspot migration, undoubtedly sharing a common cause2. Contrary to theories suggesting deep origins of these phenomena, helioseismology pinpoints low-latitude torsional oscillations to the outer 5–10% of the Sun, the near-surface shear layer3,4. Within this zone, inwardly increasing differential rotation coupled with a poloidal magnetic field strongly implicates the magneto-rotational instability5,6, prominent in accretion-disk theory and observed in laboratory experiments7. Together, these two facts prompt the general question: whether the solar dynamo is possibly a near-surface instability. Here we report strong affirmative evidence in stark contrast to traditional models8 focusing on the deeper tachocline. Simple analytic estimates show that the near-surface magneto-rotational instability better explains the spatiotemporal scales of the torsional oscillations and inferred subsurface magnetic field amplitudes9. State-of-the-art numerical simulations corroborate these estimates and reproduce hemispherical magnetic current helicity laws10. The dynamo resulting from a well-understood near-surface phenomenon improves prospects for accurate predictions of full magnetic cycles and space weather, affecting the electromagnetic infrastructure of Earth.

The solar dynamo begins near the surface

Sérgio Sacani

In recent years, the growth of scientific data and the increasing need for data sharing and collaboration in the field of environmental chemistry has led to the creation of various software and databases that facilitate research and development into the safety and toxicity of chemicals. The US-EPA Center for Computational Toxicology and Exposure has been developing software and databases that serve the chemistry community for many years. This presentation will focus on several web-based software applications which have been developed at the USEPA and made available to the community. While the primary software application from the Center is the CompTox Chemicals Dashboard almost a dozen proof-of-concept applications have been built serving various capabilities. The publicly accessible Cheminformatics Modules (https://www.epa.gov/chemicalresearch/cheminformatics) provides access to six individual modules to allow for hazard comparison for sets of chemicals, structure-substructure-similarity searching, structure alerts and batch QSAR prediction of both physicochemical and toxicity endpoints. A number of other applications in development include a chemical transformations database (ChET) and a database of analytical methods and open mass spectral data (AMOS). Each of these depends on the underlying DSSTox chemicals database, a rich source of chemistry data for over 1.2 million chemical substances. I will provide an overview of all tools in development and the integrated nature of the applications based on the underlying chemistry data. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.

Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...

US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

family therapy psychotherapy types .pdf

haseebahmeddrama

SCHISTOSOMA HEAMATOBIUM life cycle .pdf

DebdattaGhosh6

Cyanobacteria (also known as blue-green algae) are ubiquitous photosynthetic microorganisms found in diverse habitats such as fresh water, marine water, moist rocks, etc. The photosynthetic mode of nutrition makes them significant global oxygen producers along with nitrogen-fixing ability of heterocyst and carbon sequestration. Some cyanobacterial species have the ability to perform a dual mode of nutritional procurement. This unique capability of cyanobacteria to utilize both organic (heterotrophic) and inorganic (autotrophic) carbon sources for energy production and growth is termed as mixotrophy which impart nutritional flexibility and competitive ability to them. Cyanobacterial mixotrophy provides the promising avenues in biotechnological applications such as wastewater treatment, bioremediation, pharmaceuticals, food supplements, biofertilizer, coloring agents, synthesis of bioactive compounds and as an agent for eco-friendly bio-fuels generation, etc Mixotrophically grown cyanobacteria, demonstrate significant potential for efficient and economical applications beyond their conventional agricultural application, thereby offering a versatile and impactful resource for future technological and environmental challenges.

mixotrophy in cyanobacteria: a dual nutritional strategy

MansiBishnoi1

Factor Causing low production and physiology of mamary Gland

Rcvets

SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx

Pat (JS) Heslop-Harrison

NuGOweek 2024 full programme - hosted by Ghent University

pablovgd

This edition of our Newsletter is a testament to our collective dedication and the exciting progress we’ve achieved. The completion of our first Periodic Report marks a significant milestone, and the advancements in tetrahedrite mineral-based thermoelectric materials are not just promising -they are a lap towards a sustainable future. We’re excited to share updates on our ongoing activities, our synergistic collaborations with the EHRASE cluster and THERMOS project, and insightful technical information on thermoelectric generators. But that’s not all, join us on the Consortium Tour, where this time SGUDS and IGME-CSIC take centre stage. Plus, don’t miss the insightful interview with Doug Crane from our Scientific Advisory Board, whose expertise enriches our understanding of thermoelectrics. This edition also features the fascinating adventures of Starty, exploring the practical uses of thermoelectric devices in a narrative that’s both educational and engaging. Looking ahead, we eagerly anticipate your visit to the START booth at the upcoming 40th International and 20th European Thermoelectric Conference, ICT/ECT 2024, in Krakow. We hope this Newsletter serves not only as a source of information but also as an inspiration for continued excellence. Stay connected with us for more exciting updates from START on our website and social media channels.

EU START PROJECT. START-Newsletter_Issue_4.pdf

Start Project

It should be no surprise that AI is treading a similar path to computing which began with single-purpose machines tasked for payroll calculations, banking transactions, or weapons targeting et al, but nothing more! It took decades for General Purpose Computing to emerge in the form of the now ubiquitous PC. Today, AI is still in a single-purpose/task-specific phase, and we have no general-purpose platforms, but their emergence is only a matter of time! Recent AI progress has seen a repeat of the media debate and alarmist warnings for our computing past, compounded by consequential advances in robotics. In turn, this has promoted numerous attempts to draw biological equivalences defining the time when machines will overtake humans. But without any workable definitions or framework that tend to little more than un/educated guesses. Recourse to IQ measures and the Touring test have proved to be irrelevant, and without a reference framework or formal characterisation, continued discussion and debate remain futile We therefore approach this AI problem from the bottom up by defining the simplest of machines and lifeforms to derive clues, pointers and basic boundary conditions . This sees a fundamental Entropic description emerge that is applicable to both machine and lifeforms. This presentation is suitable for professionals and the public alike, and is fully illustrated by high-quality graphics, animations and, movies. Inevitably, it contains some mathematics that non-practitioners will have to take on trust, but the focus is on defining the key characteristics, parameters, and important features of AI, our total dependence, and the future! Note: A 40 min session for a predominantly ley audience and not all the slides presented here were used on the day. Their inclusion here is in response to those audience members requesting more detail at the end of/during the event.

Quantifying Artificial Intelligence and What Comes Next!

University of Hertfordshire

The highest priority recommendation of the Astro2020 Decadal Survey for space-based astronomy was the construction of an observatory capable of characterizing habitable worlds. In this paper series we explore the detectability of and interference from exomoons and exorings serendipitously observed with the proposed Habitable Worlds Observatory (HWO) as it seeks to characterize exoplanets, starting in this manuscript with Earth-Moon analog mutual events. Unlike transits, which only occur in systems viewed near edge-on, shadow (i.e., solar eclipse) and lunar eclipse mutual events occur in almost every star-planet-moon system. The cadence of these events can vary widely from ∼yearly to multiple events per day, as was the case in our younger Earth-Moon system. Leveraging previous space-based (EPOXI) lightcurves of a Moon transit and performance predictions from the LUVOIR-B concept, we derive the detectability of Moon analogs with HWO. We determine that Earth-Moon analogs are detectable with observation of ∼2-20 mutual events for systems within 10 pc, and larger moons should remain detectable out to 20 pc. We explore the extent to which exomoon mutual events can mimic planet features and weather. We find that HWO wavelength coverage in the near-IR, specifically in the 1.4 µm water band where large moons can outshine their host planet, will aid in differentiating exomoon signals from exoplanet variability. Finally, we predict that exomoons formed through collision processes akin to our Moon are more likely to be detected in younger systems, where shorter orbital periods and favorable geometry enhance the probability and frequency of mutual events.

Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...

Sérgio Sacani

Recently uploaded (20)

Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...

Plasma proteins_ Dr.Muralinath_Dr.c. kalyan

NuGOweek 2024 programme final FLYER short.pdf

Climate extremes likely to drive land mammal extinction during next supercont...

Jet reorientation in central galaxies of clusters and groups: insights from V...

GBSN - Microbiology (Unit 6) Human and Microbial interaction

Virulence Analysis of Citrus canker caused by Xanthomonas axonopodis pv. citr...

MIP Award presentation at the IEEE International Conference on Software Analy...

GBSN - Microbiology Lab (Compound Microscope)

The solar dynamo begins near the surface

Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...

family therapy psychotherapy types .pdf

SCHISTOSOMA HEAMATOBIUM life cycle .pdf

mixotrophy in cyanobacteria: a dual nutritional strategy

Factor Causing low production and physiology of mamary Gland

SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx

NuGOweek 2024 full programme - hosted by Ghent University

EU START PROJECT. START-Newsletter_Issue_4.pdf

Quantifying Artificial Intelligence and What Comes Next!

Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...

Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search

1. Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search Laurens DeVocht Selver Softic, RubenVerborgh, Erik Mannens, Martin Ebner, RikVan de Walle

2. :Paris ? 2

3. :Paris :Anne_Hidalgo :mayor 3

4. :Paris :Anne_Hidalgo :mayor :Bethlehem,_PA ? 4

5. :Anne_Hidalgo ? 5

6. :Anne_Hidalgo ? :Bethlehem,_PA

7. :Anne_Hidalgo :Bethlehem,_PA Exploratory Semantic Search Engine 7

8. ? 8

9. :Paris 9 :mayor :Anne_Hidalgo < :birthPlace :San_Fernando,_Caldiz 9 :country :Spain < :birthPlace :Edward_Ferrero 9 :battle :Battle_of_Roanoke_Island < :battle :Charles_Adam_Heckman 9 :birthPlace :Easton,_Pennsylvania 9 :mouthMountain :Lehigh_River 9 :city :Bethlehem,_Pennsylvania A 9

10. :Paris < :capital :France < :citizenship :Cyril_Bourlon_de_Rouvre 9 :education :Aerospace_engineering < :occupation :Dick_Johnson_(glider_pilot) 9 :almaMater :Mississippi_State_University < :almaMater :Clara_Southmayd_Ludlow 9 :birthPlace :Easton,_Pennsylvania < :mouthMountain :Lehigh_River 9 :city :Bethlehem,_Pennsylvania B 10

11. BA ?

12. How effective does an exploratory semantic search engine reveal initially hidden associations, as chains of links between interlinked resources?

13. Introduction Exploratory Search Benchmark Model Motivating Example Discussion and Conclusion

14. Introduction Exploratory Search Benchmark Model Motivating Example Discussion and Conclusion

15. [EXPLORATORY SEARCH: FROM FINDINGTO UNDERSTANDING, Machionini, 2006] Lookup Learn Investigation Exploratory Search `Learning searches involve multiple iterations and return sets of objects that require cognitive processing and interpretation’ `Searches that support investigation involve multiple iterations that take place over perhaps very long periods of time and may return results that are critically assessed before being integrated into personal and professional knowledge bases’ Definition 15

16. 1. Lookup 2. Relate/Expand Lookup and learn: interpretation 16

17. lookup expand relate An exploratory semantic search engine 17

18. lookup :Paris Paris 18

19. expand :Paris :Paris 19

20. relate 20

21. Introduction Exploratory Search Benchmark Model Motivating Example Discussion and Conclusion

22. Iterative Exploratory Queries Exploratory Semantic Search Engine Datasets Baseline Effectiveness 22

23. Effectiveness The effectiveness E indicates the overall perception of the results by the users taking into account expert-user feedback. # user marked relevant objects E = # retrieved objects Note: E can be interpreted as precision in traditional IR. Typical IR examine both precision and recall. 23 [TALKEXPLORER,Verbert et al., 2013]

24. Introduction Exploratory Search Benchmark Model Motivating Example Discussion and Conclusion

25. Motivating Example ResXplorer.org Everything Is Connected Engine Virtuoso User Study Extracted Queries 25

26. User Study Extracted Queries 1. lookup; 2. expand; 3. relate 26

27. LDOW P(0) P(1) P(2) P(3) Effectiveness : Interpretation 27

28. Sample Results Everything Is Connected Engine 28

29. Sample Results Virtuoso 29

30. Introduction Exploratory Search Benchmark Model Motivating Example Discussion and Conclusion

31. Limitations  Only indicate comparisons to baseline within the same use case.  Not possible to use the benchmark as a leverage to compare different approaches across use cases  Could better demonstrate in which aspects an exploratory approach excels traditional systems. 31

32. Future Work  Put the results in perspective by indicating the nuances among different expert user ratings. Especially when there is expert disagreement or inconsistencies.  Facilitate generalization of the preliminary search context, so results for engines can be reusable across datasets: avoiding that a certain engine’s results differ strongly when changing the data and queries.  Make sure that the approach is generic and can be applied to other search contexts with different data and use cases. 32

33. Benefits  Compare exploratory search engines to a baseline:  show use cases when the baseline can be outperformed;  for which queries the ‘engine under test’ is relatively more effective.  Sensitive to initial query keywords as inputted by the user: when there are inconsistencies or vague terms, even mismatches in the query context, or when expert users disagreed. 33

34. Contact @laurens_d_v laurens.devocht@ugent.be http://slideshare.net/laurensdv http://semweb.mmlab.be/

Editor's Notes

Zwarte slide met structuur die blijft terugkomen
Zwarte slide met structuur die blijft terugkomen
Zwarte slide met structuur die blijft terugkomen
Benchmark Model : components Input Engine Output
In this part of exploratory search - only precision because we are interested in how a each search result is ‘effective’ in help the user reach its search goal, not giving complete results at this point.
Zwarte slide met structuur die blijft terugkomen
Zwarte slide met structuur die blijft terugkomen

Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (20)

Similar to Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search

Similar to Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search (20)

Recently uploaded

Recently uploaded (20)

Benchmarking the Effectiveness of Associating Chains of Links for Exploratory Semantic Search

Editor's Notes