Our evaluation reveals that there is a preference for certain features depending on the search task. In addition, we touch on the current pain point of faceted search: the acquisition of faceted subject metadata for unstructured documents. We found a strong preference for prototypes displaying just a few facets generated based on either the query or the matching documents.
Performance Evaluation of Query Processing Techniques in Information Retrievalidescitation
The first element of the search process is the query.
The user query being on an average restricted to two or three
keywords makes the query ambiguous to the search engine.
Given the user query, the goal of an Information Retrieval
[IR] system is to retrieve information which might be useful
or relevant to the information need of the user. Hence, the
query processing plays an important role in IR system.
The query processing can be divided into four categories
i.e. query expansion, query optimization, query classification and
query parsing. In this paper an attempt is made to evaluate the
performance of query processing algorithms in each of the
category. The evaluation was based on dataset as specified by
Forum for Information Retrieval [FIRE15]. The criteria used
for evaluation are precision and relative recall. The analysis is
based on the importance of each step in query processing. The
experimental results show that the significance of each step
in query processing and also the relevance of web semantics
and spelling correction in the user query.
ONTOLOGY-DRIVEN INFORMATION RETRIEVAL FOR HEALTHCARE INFORMATION SYSTEM : ...IJNSA Journal
In health research, one of the major tasks is to retrieve, and analyze heterogeneous databases containing
one single patient’s information gathered from a large volume of data over a long period of time. The
main objective of this paper is to represent our ontology-based information retrieval approach for
clinical Information System. We have performed a Case Study in the real life hospital settings. The results
obtained illustrate the feasibility of the proposed approach which significantly improved the information
retrieval process on a large volume of data over a long period of time from August 2011 until January
2012
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...ijdmtaiir
-In this study a comprehensive evaluation of two
supervised feature selection methods for dimensionality
reduction is performed - Latent Semantic Indexing (LSI) and
Principal Component Analysis (PCA). This is gauged against
unsupervised techniques like fuzzy feature clustering using
hard fuzzy C-means (FCM) . The main objective of the study is
to estimate the relative efficiency of two supervised techniques
against unsupervised fuzzy techniques while reducing the
feature space. It is found that clustering using FCM leads to
better accuracy in classifying documents in the face of
evolutionary algorithms like LSI and PCA. Results show that
the clustering of features improves the accuracy of document
classification
Performance Evaluation of Query Processing Techniques in Information Retrievalidescitation
The first element of the search process is the query.
The user query being on an average restricted to two or three
keywords makes the query ambiguous to the search engine.
Given the user query, the goal of an Information Retrieval
[IR] system is to retrieve information which might be useful
or relevant to the information need of the user. Hence, the
query processing plays an important role in IR system.
The query processing can be divided into four categories
i.e. query expansion, query optimization, query classification and
query parsing. In this paper an attempt is made to evaluate the
performance of query processing algorithms in each of the
category. The evaluation was based on dataset as specified by
Forum for Information Retrieval [FIRE15]. The criteria used
for evaluation are precision and relative recall. The analysis is
based on the importance of each step in query processing. The
experimental results show that the significance of each step
in query processing and also the relevance of web semantics
and spelling correction in the user query.
ONTOLOGY-DRIVEN INFORMATION RETRIEVAL FOR HEALTHCARE INFORMATION SYSTEM : ...IJNSA Journal
In health research, one of the major tasks is to retrieve, and analyze heterogeneous databases containing
one single patient’s information gathered from a large volume of data over a long period of time. The
main objective of this paper is to represent our ontology-based information retrieval approach for
clinical Information System. We have performed a Case Study in the real life hospital settings. The results
obtained illustrate the feasibility of the proposed approach which significantly improved the information
retrieval process on a large volume of data over a long period of time from August 2011 until January
2012
Scaling Down Dimensions and Feature Extraction in Document Repository Classif...ijdmtaiir
-In this study a comprehensive evaluation of two
supervised feature selection methods for dimensionality
reduction is performed - Latent Semantic Indexing (LSI) and
Principal Component Analysis (PCA). This is gauged against
unsupervised techniques like fuzzy feature clustering using
hard fuzzy C-means (FCM) . The main objective of the study is
to estimate the relative efficiency of two supervised techniques
against unsupervised fuzzy techniques while reducing the
feature space. It is found that clustering using FCM leads to
better accuracy in classifying documents in the face of
evolutionary algorithms like LSI and PCA. Results show that
the clustering of features improves the accuracy of document
classification
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLSIJDKP
The biomedical research literature is one among many other domains that hides a precious knowledge, and
the biomedical community made an extensive use of this scientific literature to discover the facts of
biomedical entities, such as disease, drugs,etc.MEDLINE is a huge database of biomedical research
papers which remain a significantly underutilized source of biological information. Discovering the useful
knowledge from such huge corpus leads to various problems related to the type of information such as the
concepts related to the domain of texts and the semantic relationship associated with them. In this paper,
we propose a Two-level model for Self-supervised relation extraction from MEDLINE using Unified
Medical Language System (UMLS) Knowledge base. The model uses a Self-supervised Approach for
Relation Extraction (RE) by constructing enhanced training examples using information from UMLS. The
model shows a better result in comparison with current state of the art and naïve approaches
Context Driven Technique for Document ClassificationIDES Editor
In this paper we present an innovative hybrid Text
Classification (TC) system that bridges the gap between
statistical and context based techniques. Our algorithm
harnesses contextual information at two stages. First it extracts
a cohesive set of keywords for each category by using lexical
references, implicit context as derived from LSA and wordvicinity
driven semantics. And secondly, each document is
represented by a set of context rich features whose values are
derived by considering both lexical cohesion as well as the extent
of coverage of salient concepts via lexical chaining. After
keywords are extracted, a subset of the input documents is
apportioned as training set. Its members are assigned categories
based on their keyword representation. These labeled
documents are used to train binary SVM classifiers, one for
each category. The remaining documents are supplied to the
trained classifiers in the form of their context-enhanced feature
vectors. Each document is finally ascribed its appropriate
category by an SVM classifier.
A Semantic Retrieval System for Extracting Relationships from Biological Corpusijcsit
The World Wide Web holds a large size of different information. Sometimes while searching the World Wide Web, users always do not gain the type of information they expect. In the subject of information extraction, extracting semantic relationships between terms from documents become a challenge. This
paper proposes a system helps in retrieving documents based on the query expansion and tackles the extracting of semantic relationships from biological documents. This system retrieved documents that are relevant to the input terms then it extracts the existence of a relationship. In this system, we use Boolean
model and the pattern recognition which helps in determining the relevant documents and determining the place of the relationship in the biological document. The system constructs a term-relation table that accelerates the relation extracting part. The proposed method offers another usage of the system so the
researchers can use it to figure out the relationship between two biological terms through the available information in the biological documents. Also for the retrieved documents, the system measures the percentage of the precision and recall.
https://www.youtube.com/watch?v=Y_-o-4rKxUk
Machine learning powered metabolomic network analysis
Dmitry Grapov PhD,
Director of Data Science and Bioinformatics,
CDS- Creative Data Solutions
www.createdatasol.com
Metabolomic network analysis can be used to interpret experimental results within a variety of contexts including: biochemical relationships, structural and spectral similarity and empirical correlation. Machine learning is useful for modeling relationships in the context of pattern recognition, clustering, classification and regression based predictive modeling. The combination of developed metabolomic networks and machine learning based predictive models offer a unique method to visualize empirical relationships while testing key experimental hypotheses. The following presentation focuses on data analysis, visualization, machine learning and network mapping approaches used to create richly mapped metabolomic networks. Learn more at www.createdatasol.com
Using data mining methods knowledge discovery for text miningeSAT Journals
Abstract Many data mining techniques have been proposed for mining useful patterns in text documents. However, how to effectively use and update discovered patterns is still an open research issue, especially in the domain of text mining. Since most existing text mining methods adopted term-based approaches, they all suffer from the problems of polysemy and synonymy. Over the years, people have often held the hypothesis that pattern (or phrase)-based approaches should perform better than the term-based ones, but many experiments do not support this hypothesis. Proposed work presents an innovative and effective pattern discovery technique which includes the processes of pattern deploying and pattern evolving, to improve the effectiveness of using and updating discovered patterns for finding relevant and interesting information. Keywords:-Text mining, text classification, pattern mining, pattern evolving, information filtering.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Clinical Information Models (CIMs) expressed as archetypes play an essential role in the design and development of current Electronic Health Record (EHR) information structures. Although there exist many experiences about using archetypes in the literature, a comprehensive and formal methodology for archetype modeling does not exist. Having a modeling methodology is essential to develop quality archetypes, in order to guide the development of EHR systems and to allow the semantic interoperability of health data. In this work, an archetype modeling methodology is proposed. This paper describes its phases, the inputs and outputs of each phase, and the involved participants and tools. It also includes the description of the possible strategies to organize the modeling process. The proposed methodology is inspired by existing best practices of CIMs, software and ontology development. The methodology has been applied and evaluated in regional and national EHR projects. The application of the methodology provided useful feedback and improvements, and confirmed its advantages. The conclusion of this work is that having a formal methodology for archetype development facilitates the definition and adoption of interoperable archetypes, improves their quality, and facilitates their reuse among different information systems and EHR projects. Moreover, the proposed methodology can be also a reference for CIMs development using any other formalism.
The 10th Annual Utah Health Services Research Conference: Data Quality in Multi-Site Health Services and Comparative Effectiveness Research: Lessons from PHIS+ By: Ram Gouripeddi
Health Services Research Conference: March 16, 2015
Patient Centered Research Methods Core, University of Utah, CCTS
Call for paper 2012, hard copy of Certificate, research paper publishing, where to publish research paper,
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJCER, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, research and review articles, IJCER Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathematics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer review journal, indexed journal, research and review articles, engineering journal, www.ijceronline.com, research journals,
yahoo journals, bing journals, International Journal of Computational Engineering Research, Google journals, hard copy of Certificate,
journal of engineering, online Submission
Domain ontology development for communicable diseasescsandit
Web has become the very first resource to search for any kind of information. With the
emergence of semantic web, our search queries have started generating more informed results.
Ontologies are at the core of any semantic web application. They help in rapid development of
distributed systems by providing information on the fly. This key feature of distribution and
sharing of information has made ontologies as a new knowledge representation mechanism. A
mechanism which is strongly backed by a sound inference system. In this paper, we shall discuss
the development, verification and validation of an ontology in a health domain.
DOMAIN ONTOLOGY DEVELOPMENT FOR COMMUNICABLE DISEASEScscpconf
Web has become the very first resource to search for any kind of information. With the emergence of semantic web, our search queries have started generating more informed results.Ontologies are at the core of any semantic web application. They help in rapid development of
distributed systems by providing information on the fly. This key feature of distribution and
sharing of information has made ontologies as a new knowledge representation mechanism. A
mechanism which is strongly backed by a sound inference system. In this paper, we shall discuss the development, verification and validation of an ontology in a health domain.
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLSIJDKP
The biomedical research literature is one among many other domains that hides a precious knowledge, and
the biomedical community made an extensive use of this scientific literature to discover the facts of
biomedical entities, such as disease, drugs,etc.MEDLINE is a huge database of biomedical research
papers which remain a significantly underutilized source of biological information. Discovering the useful
knowledge from such huge corpus leads to various problems related to the type of information such as the
concepts related to the domain of texts and the semantic relationship associated with them. In this paper,
we propose a Two-level model for Self-supervised relation extraction from MEDLINE using Unified
Medical Language System (UMLS) Knowledge base. The model uses a Self-supervised Approach for
Relation Extraction (RE) by constructing enhanced training examples using information from UMLS. The
model shows a better result in comparison with current state of the art and naïve approaches
Context Driven Technique for Document ClassificationIDES Editor
In this paper we present an innovative hybrid Text
Classification (TC) system that bridges the gap between
statistical and context based techniques. Our algorithm
harnesses contextual information at two stages. First it extracts
a cohesive set of keywords for each category by using lexical
references, implicit context as derived from LSA and wordvicinity
driven semantics. And secondly, each document is
represented by a set of context rich features whose values are
derived by considering both lexical cohesion as well as the extent
of coverage of salient concepts via lexical chaining. After
keywords are extracted, a subset of the input documents is
apportioned as training set. Its members are assigned categories
based on their keyword representation. These labeled
documents are used to train binary SVM classifiers, one for
each category. The remaining documents are supplied to the
trained classifiers in the form of their context-enhanced feature
vectors. Each document is finally ascribed its appropriate
category by an SVM classifier.
A Semantic Retrieval System for Extracting Relationships from Biological Corpusijcsit
The World Wide Web holds a large size of different information. Sometimes while searching the World Wide Web, users always do not gain the type of information they expect. In the subject of information extraction, extracting semantic relationships between terms from documents become a challenge. This
paper proposes a system helps in retrieving documents based on the query expansion and tackles the extracting of semantic relationships from biological documents. This system retrieved documents that are relevant to the input terms then it extracts the existence of a relationship. In this system, we use Boolean
model and the pattern recognition which helps in determining the relevant documents and determining the place of the relationship in the biological document. The system constructs a term-relation table that accelerates the relation extracting part. The proposed method offers another usage of the system so the
researchers can use it to figure out the relationship between two biological terms through the available information in the biological documents. Also for the retrieved documents, the system measures the percentage of the precision and recall.
https://www.youtube.com/watch?v=Y_-o-4rKxUk
Machine learning powered metabolomic network analysis
Dmitry Grapov PhD,
Director of Data Science and Bioinformatics,
CDS- Creative Data Solutions
www.createdatasol.com
Metabolomic network analysis can be used to interpret experimental results within a variety of contexts including: biochemical relationships, structural and spectral similarity and empirical correlation. Machine learning is useful for modeling relationships in the context of pattern recognition, clustering, classification and regression based predictive modeling. The combination of developed metabolomic networks and machine learning based predictive models offer a unique method to visualize empirical relationships while testing key experimental hypotheses. The following presentation focuses on data analysis, visualization, machine learning and network mapping approaches used to create richly mapped metabolomic networks. Learn more at www.createdatasol.com
Using data mining methods knowledge discovery for text miningeSAT Journals
Abstract Many data mining techniques have been proposed for mining useful patterns in text documents. However, how to effectively use and update discovered patterns is still an open research issue, especially in the domain of text mining. Since most existing text mining methods adopted term-based approaches, they all suffer from the problems of polysemy and synonymy. Over the years, people have often held the hypothesis that pattern (or phrase)-based approaches should perform better than the term-based ones, but many experiments do not support this hypothesis. Proposed work presents an innovative and effective pattern discovery technique which includes the processes of pattern deploying and pattern evolving, to improve the effectiveness of using and updating discovered patterns for finding relevant and interesting information. Keywords:-Text mining, text classification, pattern mining, pattern evolving, information filtering.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Clinical Information Models (CIMs) expressed as archetypes play an essential role in the design and development of current Electronic Health Record (EHR) information structures. Although there exist many experiences about using archetypes in the literature, a comprehensive and formal methodology for archetype modeling does not exist. Having a modeling methodology is essential to develop quality archetypes, in order to guide the development of EHR systems and to allow the semantic interoperability of health data. In this work, an archetype modeling methodology is proposed. This paper describes its phases, the inputs and outputs of each phase, and the involved participants and tools. It also includes the description of the possible strategies to organize the modeling process. The proposed methodology is inspired by existing best practices of CIMs, software and ontology development. The methodology has been applied and evaluated in regional and national EHR projects. The application of the methodology provided useful feedback and improvements, and confirmed its advantages. The conclusion of this work is that having a formal methodology for archetype development facilitates the definition and adoption of interoperable archetypes, improves their quality, and facilitates their reuse among different information systems and EHR projects. Moreover, the proposed methodology can be also a reference for CIMs development using any other formalism.
The 10th Annual Utah Health Services Research Conference: Data Quality in Multi-Site Health Services and Comparative Effectiveness Research: Lessons from PHIS+ By: Ram Gouripeddi
Health Services Research Conference: March 16, 2015
Patient Centered Research Methods Core, University of Utah, CCTS
Call for paper 2012, hard copy of Certificate, research paper publishing, where to publish research paper,
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJCER, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, research and review articles, IJCER Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathematics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer review journal, indexed journal, research and review articles, engineering journal, www.ijceronline.com, research journals,
yahoo journals, bing journals, International Journal of Computational Engineering Research, Google journals, hard copy of Certificate,
journal of engineering, online Submission
Domain ontology development for communicable diseasescsandit
Web has become the very first resource to search for any kind of information. With the
emergence of semantic web, our search queries have started generating more informed results.
Ontologies are at the core of any semantic web application. They help in rapid development of
distributed systems by providing information on the fly. This key feature of distribution and
sharing of information has made ontologies as a new knowledge representation mechanism. A
mechanism which is strongly backed by a sound inference system. In this paper, we shall discuss
the development, verification and validation of an ontology in a health domain.
DOMAIN ONTOLOGY DEVELOPMENT FOR COMMUNICABLE DISEASEScscpconf
Web has become the very first resource to search for any kind of information. With the emergence of semantic web, our search queries have started generating more informed results.Ontologies are at the core of any semantic web application. They help in rapid development of
distributed systems by providing information on the fly. This key feature of distribution and
sharing of information has made ontologies as a new knowledge representation mechanism. A
mechanism which is strongly backed by a sound inference system. In this paper, we shall discuss the development, verification and validation of an ontology in a health domain.
Co-Extracting Opinions from Online ReviewsEditor IJCATR
Exclusion of opinion targets and words from online reviews is an important and challenging task in opinion mining. The
opinion mining is the use of natural language processing, text analysis and computational process to identify and recover the subjective
information in source materials. This paper propose a Supervised word alignment model, which identifying the opinion relation. Rather
than this paper focused on topical relation, in which to extract the relevant information or features only from a particular online reviews.
It is based on feature extraction algorithm to identify the potential features. Finally the items are ranked based on the frequency of
positive and negative reviews. Compared to previous methods, our model captures opinion relation and feature extraction more precisely.
One of the most advantages that our model obtain better precision because of supervised alignment model. In addition, an opinion
relation graph is used to refer the relationship between opinion targets and opinion words.
Custom-Made Ranking in Databases Establishing and Utilizing an Appropriate Wo...ijsrd.com
Custom Rating System which provides a facility to the users, that they can search and download best articles or anything on the system in the database. The article or anything can be any text content which can describe a product, a book, an institution, an application, a company or anything. This system consists of two set of users, one is the normal user and another is the administrator. The users have to register and login to the system first, in order to use the system. The users have the following privileges. Write Article and Upload Relevant Files, Post Related URL to each article for other users reference, Search and Read Article posted by other users, Rate the articles posted by other users. The articles which are written by any user are sent to the Administrator for Approval. After approval of the articles by the administrator, they are available for the users to search and download. Based on the Description provided in an article, it can be searched by any registered user on the system. The user can see the article, download a file if available and the user can rate the article based on the article. Rating can be given in terms of 1 Star to 5 Star. The users can search the article. The list of articles displayed can be sorted based on following parameters: Rating, Popularity (Number of Clicks on the Article),Relevance (Based on number of matching keywords provided),All Articles uploaded by a specific user.
Quest Trail: An Effective Approach for Construction of Personalized Search En...Editor IJCATR
Personalized search refers to search experiences that are tailored specifically to an individual's interests by incorporating information about the individual beyond specific query provided. Especially people working in a software development organization (analysts, developers, testers, maintenance team members), find it increasingly difficult to get relevant results to their searches. We propose methods to personalize searches by resolving the ambiguity of query terms, and increase the relevance of search results in order to match the user’s interests. Difficulty in web searches has given rise to the need for development of personalized search engines. Personalized search engines create user profiles to capture the users’ personal preferences and as such identify the actual goal of the input query. Since users are usually reluctant to explicitly provide their preferences due to the extra manual effort involved, the search engine faces the entire burden of predicting the user’s preferences and intentions behind a query in order to yield more relevant search results. In this paper we define a QUEST to be the objective of user’s search; here we combine quest level analysis of user’s search logs and semantic analysis of the user’s query in order to personalize user’s search results. Most personalization methods focus on the creation of one single profile for a user and apply the same profile to all of the user’s queries. Hence we propose a personalized search for a software development organization by creating QUEST or domain based profile rather than individual user based profile.
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...ISAR Publications
Mobile search engine is a meta search engine that imprisonments the user’s favorite in
the form of concepts by mining their click through data. But the search query is limited to small
words unlike those used when interacting with search engines through computers. It has become
popular because of presence of huge number of applications. Smartphone’s carry large amount of
personal information, such as user’s personal details, contacts, messages, emails, credit card
information, etc. User type specific search and finally Ontology based Search. Moreover opinion
mining is conducted to provide feedback and valuable suggestions given by the mobile users. Due
to the different characteristics of the content concepts and location concepts, use different
techniques for their concept extraction and ontology formulation. Moreover the individual users
can use this search engine, which runs on android platform. They can give feedbacks and
suggestions about the search result. Based on the feedback other users can get valuable
information about the services available in their location or nearby location.
A presentation that I gave at the Query Log Analysis: From Research to Best Practice Workshop 27 - 28 May 20098 in London, UK http://ir.shef.ac.uk/cloughie/qlaw2009/index.html
PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...ijseajournal
There is a wide range of available biological databases developed by bioinformatics experts, employing different methods to extract biological data. In this paper, we investigate and evaluate the performance of some of these methods in terms of their ability to efficiently access bioinformatics databases using webbased interfaces. These methods retrieve bioinformatics information using structured and semi-structured data tools, which are able to retrieve data from remote database servers. This study distinguishes each of these approaches and contrasts these tools. We used Sequence Retrieval System (SRS) and Entrez search tools for structured data, while Perl and BioPerl search programs were used for semi-structured data to retrieve complex queries including a combination of text and numeric information. The study concludes that the use of semi-structured data tools for accessing bioinformatics databases is a viable alternative to the structured tools, though each method is shown to have certain inherent advantages and disadvantages.
PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...ijseajournal
There is a wide range of available biological databases developed by bioinformatics experts, employing different methods to extract biological data. In this paper, we investigate and evaluate the performance of some of these methods in terms of their ability to efficiently access bioinformatics databases using webbased interfaces. These methods retrieve bioinformatics information using structured and semi-structured
data tools, which are able to retrieve data from remote database servers. This study distinguishes each of these approaches and contrasts these tools. We used Sequence Retrieval System (SRS) and Entrez search tools for structured data, while Perl and BioPerl search programs were used for semi-structured data to retrieve complex queries including a combination of text and numeric information. The study concludes that the use of semi-structured data tools for accessing bioinformatics databases is a viable alternative to the structured tools, though each method is shown to have certain inherent advantages and disadvantages.
Intranet and Enterprise Content Management Systems (ECMS) that do not deliver information quickly and efficiently are expensive electronic dumping grounds. Service desks, search, security, records handling, information sharing, all require timely information retrieval. These information retrieval applications need to be powered by categorizing and tagging your content just as e-commerce sites such as Amazon or online publishers like BBC do.
Findability is the ease of which someone can locate the information they want. Often, it is confused with search – but search is just one method of achieving findability. Search allows people to enter in words that they hope are contained in the content they want to retrieve. Findability includes any method of locating this content, including but not limited to searching. Pingar DiscoveryOne improves findability.
The storage requirements for organizations is growing between 45-60% each year. Much of this content is emails and documents rather than databases. This means that your IT storage budget is probably growing at least 5% annually just to keep up.
M&A deal teams work under heavy time pressure. During due- diligence and post M&A integration, Pingar reduces this pressure by making it easier to find and collect the right information, based on our deep understanding of text analytics.
Discover the cost of not finding documents. Familiarize yourself with statistics and risks associated to poor findability in Enterprise Content Managment System.
For more information go to: http://pingar.com/information-management/
The news is that SharePoint 2016 will improve search.
This can mean many things. It may, for example, mean that something that impacts search quality has improved. Taken in this sense, maybe the news means that SharePoint usability has improved, and this improvement supports user adoption, which translates into more inputs for an adaptive system to work with, which in turn can be expected to encourage adaptation in the search mechanism in a way that bene ts search, which itself supports further adoption and utilization, and creates a feedback loop that results in search improvement.
Global launch of the Healthy Ageing and Prevention Index 2nd wave – alongside...ILC- UK
The Healthy Ageing and Prevention Index is an online tool created by ILC that ranks countries on six metrics including, life span, health span, work span, income, environmental performance, and happiness. The Index helps us understand how well countries have adapted to longevity and inform decision makers on what must be done to maximise the economic benefits that comes with living well for longer.
Alongside the 77th World Health Assembly in Geneva on 28 May 2024, we launched the second version of our Index, allowing us to track progress and give new insights into what needs to be done to keep populations healthier for longer.
The speakers included:
Professor Orazio Schillaci, Minister of Health, Italy
Dr Hans Groth, Chairman of the Board, World Demographic & Ageing Forum
Professor Ilona Kickbusch, Founder and Chair, Global Health Centre, Geneva Graduate Institute and co-chair, World Health Summit Council
Dr Natasha Azzopardi Muscat, Director, Country Health Policies and Systems Division, World Health Organisation EURO
Dr Marta Lomazzi, Executive Manager, World Federation of Public Health Associations
Dr Shyam Bishen, Head, Centre for Health and Healthcare and Member of the Executive Committee, World Economic Forum
Dr Karin Tegmark Wisell, Director General, Public Health Agency of Sweden
The dimensions of healthcare quality refer to various attributes or aspects that define the standard of healthcare services. These dimensions are used to evaluate, measure, and improve the quality of care provided to patients. A comprehensive understanding of these dimensions ensures that healthcare systems can address various aspects of patient care effectively and holistically. Dimensions of Healthcare Quality and Performance of care include the following; Appropriateness, Availability, Competence, Continuity, Effectiveness, Efficiency, Efficacy, Prevention, Respect and Care, Safety as well as Timeliness.
Deep Leg Vein Thrombosis (DVT): Meaning, Causes, Symptoms, Treatment, and Mor...The Lifesciences Magazine
Deep Leg Vein Thrombosis occurs when a blood clot forms in one or more of the deep veins in the legs. These clots can impede blood flow, leading to severe complications.
Health Education on prevention of hypertensionRadhika kulvi
Hypertension is a chronic condition of concern due to its role in the causation of coronary heart diseases. Hypertension is a worldwide epidemic and important risk factor for coronary artery disease, stroke and renal diseases. Blood pressure is the force exerted by the blood against the walls of the blood vessels and is sufficient to maintain tissue perfusion during activity and rest. Hypertension is sustained elevation of BP. In adults, HTN exists when systolic blood pressure is equal to or greater than 140mmHg or diastolic BP is equal to or greater than 90mmHg. The
Explore our infographic on 'Essential Metrics for Palliative Care Management' which highlights key performance indicators crucial for enhancing the quality and efficiency of palliative care services.
This visual guide breaks down important metrics across four categories: Patient-Centered Metrics, Care Efficiency Metrics, Quality of Life Metrics, and Staff Metrics. Each section is designed to help healthcare professionals monitor and improve care delivery for patients facing serious illnesses. Understand how to implement these metrics in your palliative care practices for better outcomes and higher satisfaction levels.
Navigating Challenges: Mental Health, Legislation, and the Prison System in B...Guillermo Rivera
This conference will delve into the intricate intersections between mental health, legal frameworks, and the prison system in Bolivia. It aims to provide a comprehensive overview of the current challenges faced by mental health professionals working within the legislative and correctional landscapes. Topics of discussion will include the prevalence and impact of mental health issues among the incarcerated population, the effectiveness of existing mental health policies and legislation, and potential reforms to enhance the mental health support system within prisons.
Antibiotic Stewardship by Anushri Srivastava.pptxAnushriSrivastav
Stewardship is the act of taking good care of something.
Antimicrobial stewardship is a coordinated program that promotes the appropriate use of antimicrobials (including antibiotics), improves patient outcomes, reduces microbial resistance, and decreases the spread of infections caused by multidrug-resistant organisms.
WHO launched the Global Antimicrobial Resistance and Use Surveillance System (GLASS) in 2015 to fill knowledge gaps and inform strategies at all levels.
ACCORDING TO apic.org,
Antimicrobial stewardship is a coordinated program that promotes the appropriate use of antimicrobials (including antibiotics), improves patient outcomes, reduces microbial resistance, and decreases the spread of infections caused by multidrug-resistant organisms.
ACCORDING TO pewtrusts.org,
Antibiotic stewardship refers to efforts in doctors’ offices, hospitals, long term care facilities, and other health care settings to ensure that antibiotics are used only when necessary and appropriate
According to WHO,
Antimicrobial stewardship is a systematic approach to educate and support health care professionals to follow evidence-based guidelines for prescribing and administering antimicrobials
In 1996, John McGowan and Dale Gerding first applied the term antimicrobial stewardship, where they suggested a causal association between antimicrobial agent use and resistance. They also focused on the urgency of large-scale controlled trials of antimicrobial-use regulation employing sophisticated epidemiologic methods, molecular typing, and precise resistance mechanism analysis.
Antimicrobial Stewardship(AMS) refers to the optimal selection, dosing, and duration of antimicrobial treatment resulting in the best clinical outcome with minimal side effects to the patients and minimal impact on subsequent resistance.
According to the 2019 report, in the US, more than 2.8 million antibiotic-resistant infections occur each year, and more than 35000 people die. In addition to this, it also mentioned that 223,900 cases of Clostridoides difficile occurred in 2017, of which 12800 people died. The report did not include viruses or parasites
VISION
Being proactive
Supporting optimal animal and human health
Exploring ways to reduce overall use of antimicrobials
Using the drugs that prevent and treat disease by killing microscopic organisms in a responsible way
GOAL
to prevent the generation and spread of antimicrobial resistance (AMR). Doing so will preserve the effectiveness of these drugs in animals and humans for years to come.
being to preserve human and animal health and the effectiveness of antimicrobial medications.
to implement a multidisciplinary approach in assembling a stewardship team to include an infectious disease physician, a clinical pharmacist with infectious diseases training, infection preventionist, and a close collaboration with the staff in the clinical microbiology laboratory
to prevent antimicrobial overuse, misuse and abuse.
to minimize the developme
Defecation
Normal defecation begins with movement in the left colon, moving stool toward the anus. When stool reaches the rectum, the distention causes relaxation of the internal sphincter and an awareness of the need to defecate. At the time of defecation, the external sphincter relaxes, and abdominal muscles contract, increasing intrarectal pressure and forcing the stool out
The Valsalva maneuver exerts pressure to expel faeces through a voluntary contraction of the abdominal muscles while maintaining forced expiration against a closed airway. Patients with cardiovascular disease, glaucoma, increased intracranial pressure, or a new surgical wound are at greater risk for cardiac dysrhythmias and elevated blood pressure with the Valsalva maneuver and need to avoid straining to pass the stool.
Normal defecation is painless, resulting in passage of soft, formed stool
CONSTIPATION
Constipation is a symptom, not a disease. Improper diet, reduced fluid intake, lack of exercise, and certain medications can cause constipation. For example, patients receiving opiates for pain after surgery often require a stool softener or laxative to prevent constipation. The signs of constipation include infrequent bowel movements (less than every 3 days), difficulty passing stools, excessive straining, inability to defecate at will, and hard feaces
IMPACTION
Fecal impaction results from unrelieved constipation. It is a collection of hardened feces wedged in the rectum that a person cannot expel. In cases of severe impaction the mass extends up into the sigmoid colon.
DIARRHEA
Diarrhea is an increase in the number of stools and the passage of liquid, unformed feces. It is associated with disorders affecting digestion, absorption, and secretion in the GI tract. Intestinal contents pass through the small and large intestine too quickly to allow for the usual absorption of fluid and nutrients. Irritation within the colon results in increased mucus secretion. As a result, feces become watery, and the patient is unable to control the urge to defecate. Normally an anal bag is safe and effective in long-term treatment of patients with fecal incontinence at home, in hospice, or in the hospital. Fecal incontinence is expensive and a potentially dangerous condition in terms of contamination and risk of skin ulceration
HEMORRHOIDS
Hemorrhoids are dilated, engorged veins in the lining of the rectum. They are either external or internal.
FLATULENCE
As gas accumulates in the lumen of the intestines, the bowel wall stretches and distends (flatulence). It is a common cause of abdominal fullness, pain, and cramping. Normally intestinal gas escapes through the mouth (belching) or the anus (passing of flatus)
FECAL INCONTINENCE
Fecal incontinence is the inability to control passage of feces and gas from the anus. Incontinence harms a patient’s body image
PREPARATION AND GIVING OF LAXATIVESACCORDING TO POTTER AND PERRY,
An enema is the instillation of a solution into the rectum and sig
CHAPTER 1 SEMESTER V PREVENTIVE-PEDIATRICS.pdfSachin Sharma
This content provides an overview of preventive pediatrics. It defines preventive pediatrics as preventing disease and promoting children's physical, mental, and social well-being to achieve positive health. It discusses antenatal, postnatal, and social preventive pediatrics. It also covers various child health programs like immunization, breastfeeding, ICDS, and the roles of organizations like WHO, UNICEF, and nurses in preventive pediatrics.
2. ABSTRACT
This paper reports findings on desirable interface features for different
search tasks in the biomedical domain. We conducted a user study where
we asked bioscientists to evaluate the usefulness of autocomplete, query
expansions, faceted refinement, related searches and results preview
implementations in new pilot interfaces and publicly available systems
while using baseline and their own queries. Our evaluation reveals that
there is a preference for certain features depending on the search task.
In addition, we touch on the current pain point of faceted search: the
acquisition of faceted subject metadata for unstructured documents.
We found a strong preference for prototypes displaying just a few facets
generated based on either the query or the matching documents.
Topics:
Design Tools and Techniques, Measurement, User Interfaces, Search User
Interfaces, UI Design, Human Factors, Qualitative User Study
01
Search Interface Feature Evaluation in Biosciences
3. INTRO
Interface features are elements of search user interfaces, which facilitate
the search process. Examples of such features are autocomplete and query
expansion suggestions, faceted navigation, and document surrogates in
search results previews. Vast research exists on the usefulness of interface
features on the web [6], although less so in the biomedical domain.
We identified and addressed two open questions in studies of search user
interfaces: Queries and search tasks can be classified into categories [7],
but how should interface differ depending on the task? Faceted navigation
has been demonstrated useful for search in structured data [11], but which
approach to generating facet categories for unstructured documents works
best?
We conducted a qualitative user study to systematically evaluate
techniques to compute and present individual search features. In
extended interviews, bioscientists rated the usefulness of features in
common interfaces on baseline and their own queries, which we classified
as browsing, gathering information, and search for facts. Side-by-side
comparison allowed us to identify clear preferences in interface features
depending on these tasks.
After an overview of state-of-the-art in computing the features and
user-study outcomes, we discuss search tasks in the biomedical
domain. We then turn to the experiment and participating systems.
Finally, we discuss how the study was conducted and its findings.
02
Search Interface Feature Evaluation in Biosciences
1
4. Related work can be grouped based on the studied features.
Autocomplete (autosuggest) provides dynamic search suggestions
as the user types the query. Commonly suggestions originate from existing
query logs, but could also be computed using biomedical terminology
resources [9]. Users intuitively interact with autocomplete, increasingly so
with time [1]. It is recommended to display results before suggesting new
queries and to compute suggestions starting from all existing terms [6].
Query expansion suggests alternative query terms when users’ guesses
result in only a few or incorrect results. Such terms can be computed from
query logs or thesauri. Users react positively to search expansions as long
as their number is limited [6].
Faceted refinement helps to narrow down results based on a
dimension (facet) of the searched item. Computing facets for products,
accommodation or any structured data search is straightforward; however
search in unstructured documents limits facets to pre-existing metadata
such as author, subject headings or social bookmarking tags, and such may
not exist. Various approaches address this shortcoming. Named entity
extraction can generate facets such as people and organizations names [12].
Hierarchical clustering of search results allows using clusters’ labels as facet
categories (e.g. clusty.com). In biomedicine, existing controlled vocabularies
and ontologies are used to derive facets [5, 9] (see section 4.1.1). Faceted
refinement is welcomed by users in all studies, but good execution is the
key [11].
03
Search Interface Feature Evaluation in Biosciences
“Faceted refinement
is welcomed by users
in all studies, but good
execution is the key.”
2RELATED WORK
5. Related searches are query suggestions that lead to new searches
by either changing the query focus or refining it. Such suggestions
can be derived dynamically from top search results [2]. A quarter
of search sessions made use of these suggestions, but their
effectiveness is questionable. A study of related searches in the
biomedical domain reports a strong desire for gene and organism
names that can be driven from bioscience ontologies [4].
Preview of results helps users judge the relevance of their searches
by listing surrogates for each document containing its title, URL,
preview and sometimes keywords. Document previews can be
most relevant sentences derived via query-based summarization
[10] or snippets that combine multiple sentences while replacing
their irrelevant parts with ellipses [3]. Studies indicate that users
prefer non-truncated sentences and the preview including
document summary should put all query terms in context [6].
The above studies provide insights into the usefulness and
effectiveness of interface features, but do not tell if there is a
preference in certain features depending on the search task.
Another gap is a comparison of different techniques that
implement faceted navigation in unstructured documents.
04
Search Interface Feature Evaluation in Biosciences
RELATED WORK
“The above studies
provide insights
into usefulness and
effectiveness of interface
features.”
6. Kellar et al. classify information seeking tasks into four major
categories and analyze how often people conduct and repeat these
tasks [7]. Nearly 50% of search queries relate to Transactions
(email, banking, shopping), all of which are frequently repeated.
Other queries are somewhat equally split between Browsing
(blogs, news), Information Gathering (e.g. graduate schools to
apply) and Fact Finding (e.g. weather forecasts). The latter three
are conducted by bioscientists in their daily work when they
browse for new publications, gather information on particular
genes, proteins or diseases, or search for facts, for instance in
biomedical databases. Our study takes this differentiation into
account when analyzing scientists’ rankings of interface features.
05
Search Interface Feature Evaluation in Biosciences
3SEARCH TASKS IN BIOSCIENCE
I need to collect publications
by others on connexins
& how they relate to our studies
BROWSING
FACT FINDING
7. The aim of this study is to identify which search interface features
are useful for searching the biomedical literature. Additionally, we
strived to understand which approaches to faceted navigation for
this domain work best. Based on the current knowledge of user
preferences we hypothesized that users prefer different interface
features depending on the search task.
4.1 Interface Features & Evaluated Systems
To test our hypothesis, we implemented two prototype systems. To
test the usefulness of the features more broadly, we included two
systems primarily used in the biomedical domain (PubMed) and
general search (Google), as well as additional publicly available
systems that handle such features in different ways in bioscience
(GoPubMed, Semedico, NextBio) and general search (Bing).
06
Search Interface Feature Evaluation in Biosciences
4EXPERIMENT DESCRIPTION
8. 4.1.1 Overview of the studied systems
PubMed (ncbi.nlm.nih.gov/pubmed) is the primary search engine used by
bioscientists for their research, as it comprises over 20 million citations
for biomedical literature from MEDLINE, life science journals, and books.
Articles are indexed with Medical Subject Headings (MeSH) [8]. GoPubMed
(gopubmed.com) is a semantic search engine for the biomedical domain. It
provides refinement of PubMed search results using the original hierarchy
of structured vocabularies: Gene Ontology (GO) and MeSH [5]. Semedico
(semedico.org) [9] is a faceted biomedical search system with a ranked
list interface. The facets are populated from a semantic index generated
by disambiguating words in articles to corresponding concepts in MeSH
and UniProt. Its hierarchy of top 20 categories was defined by biologists.
NextBio (nextbio.com) is a commercial ontology-based semantic framework
based on gene, tissue, disease and compound ontologies. It combines
literature with data such as clinical trials. Google (google.com) is the second
most common search engine used by scientists for their work. According
to our findings, it is often preferred to PubMed for searching methodology
and techniques (laboratory protocols). Bing (bing.com) handles queries,
ranking and some of the features we study in this paper somewhat different
to Google.
Our two prototypes were built to test additional ways of implementing and
representing features of a search user interface. We used the Pingar API
(pingar.com) for semantic analysis of queries and documents and Apache
Solr (lucene.apache.org/solr) for full-text indexing and searching. We
indexed the 85,000 articles in the Open Access PubMed dataset for this
purpose (ncbi.nlm.nih.gov/pmc/tools/openftlist). The prototypes allowed
us to test Pingar’s tools for generating query expansions, related searches,
keywords, summaries and taxonomy mapping, as well as Solr’s built-in
faceted search and snippet extraction features.
07
Search Interface Feature Evaluation in Biosciences
EXPERIMENT DESCRIPTION
“Our two prototypes were
built to test additional
ways ofimplementing and
representing features of a
search user interface.”
9. 4.1.2 Implementation of tested interface features
Only certain systems were selected for testing each feature. Here, we
list how each feature is supported by the systems we tested. Information
provided on systems’ websites and publications.
Autocomplete: PubMed employs Automatic Term Mapping that compares
and maps user’s search terms to lists of pre-indexed terms. GoPubMed
matches typed terms to MeSH and GO terms. Semedico places the
suggestions in a taxonomy tree allowing users to select a broader term as
their query. Synonyms are listed in brackets. NextBio lists matching genes,
compounds, SNPs, diseases, tissues, biogroups and authors. Google predicts
suggestions based on other users’ search activities – for certain queries
it analyzes just the last two words. Bing also computes suggestions using
user’s queries and boosts trending queries.
Query expansion: PubMed displays the “search details” that combine (sub)
headings, fields and Boolean. Users can edit them and re-submit. Semedico
displays terms identified in the query and users may remove one from
the search. Pingar suggests misspellings, grammatically similar terms and
synonyms as checkboxes to add to the query using OR.
08
Search Interface Feature Evaluation in Biosciences
EXPERIMENT DESCRIPTION
“Pingar suggests
misspellings,
grammatically similar
terms and synonyms.”
10. Faceted search: PubMed allows filtering by “free full text” or
“reviews” and shows the number of matching results in brackets.
Suggestions are displayed as links. GoPubMed categorizes
filtering suggestions into “Top Terms” (more specific) and
“Knowledge Base” (more generic), ordered by relevance.
Suggestions are displayed as checkboxes allowing multiple
selections. Semedico displays 9 top level MeSH terms as facet
categories each in a differently colored box. Per category, top 3
most frequent terms are shown (numbers in brackets). Expanding
leads to more terms or their child terms. Solr was chosen to
evaluate faceted refinement based on indexed metadata: journal
year and title, and keywords, generated by Pingar. Single and
multiple selections were tested (links vs. checkboxes). Pingar
dynamically generates facet categories by first mapping top 10
search results to terms in multiple biomedical taxonomies and
then walking up the taxonomy tree to find common broader terms.
A different variation of top 3 most relevant facets is displayed for
each query. Each facet lists top 5 most frequent terms and can be
expanded to see more. Some screenshots showed terms computed
by analyzing only text surrounding the query terms (QB), others
the entire content of the document (DB). The intention was to
evaluate whether search query should aid as a context when
computing facet values. We also tested preference over choosing
one or multiple terms per facet category (links vs. checkboxes).
09
Search Interface Feature Evaluation in Biosciences
EXPERIMENT DESCRIPTION
11. Related searches: PubMed suggests variations of the query in the “also
try” area, but such terms are not always available. Google offers two kinds
of related searches in different parts of the interface: Searches for things
of similar kind (e.g. “aquaporin” for “connexin”) and more specific searches
(e.g. “connexin 26” for “connexin”). Bing’s two areas designated to related
searches show the same suggestions formatted as one or two columns. The
suggestions are variations of the original query with added or modified
parts. Pingar also computes related searches, but instead of query logs, top
search results are analyzed for suggestions.
Results preview: We limited the evaluation of this interface feature to
prototype systems only and tested the following features: (1) Keywords are
usually defined by authors or extracted automatically to represent the key
topics in an article. We used Pingar API to compare two cases: extracting
keywords from the text surrounding the query terms and from the entire
document. The intention was to evaluate whether search query should
aid as a context when computing keywords. (2) Document preview is
commonly implemented using sentence snippets containing the query
terms. One prototype used Solr to extract top 3 such snippets per
document. Another one used Pingar’s query-based summary extraction tool
to display the top scoring sentence in the document, and the top scoring
paragraph on mouse-over.
step
10
Search Interface Feature Evaluation in Biosciences
EXPERIMENT DESCRIPTION
“Keywords are usually
defined by authors or
extracted automatically to
represent the key topics in
an article.”
12. 11
Search Interface Feature Evaluation in Biosciences
4.2 The Study
We run an exploratory short study with 6 bioscientists (2 faculty,
2 postdocs, 2 PhD students), where we explained them the 3 types
of search and asked them for examples of such searches they use
for their work: queries and resources/systems. The study was
conducted in person with each participant (10-15 min sessions).
We recruited 10 bioscientists to participate in the main study. All
of them are researchers in academia in various biological areas:
Developmental, Molecular, Cell, Evolutionary, Transcriptional
and Systems Biology, as well as Biochemistry, Immunology,
Genetics, Population Genetics, and Neuroscience. 2 of them are
faculty; 7 are postdoctoral researchers (having received PhDs
from 2005 to 2010) and 1 final year PhD student.
We selected the interface features we wanted to study for the 3
different types of search and identified systems that handle these
features in a different manner. We asked the participants, via email
prior the sessions, to supply us with 4 different queries they use.
We also asked for use frequency, resources, to elaborate on what
information they are looking for, and to fill out an informed
consent form. We selected two baseline queries based on our
exploratory study. We also selected one personal query from each
participant. The selection was done aiming at having a range for
the 3 search types. For each query we took screenshots in different
systems available for that query, and isolated the part of the system
that shows the interface feature in question (logos were removed).
EXPERIMENT DESCRIPTION
13. 12
Search Interface Feature Evaluation in Biosciences
EXPERIMENT DESCRIPTION
During the in-person sessions (each lasted 1-2 hours), we showed
participants in PowerPoint presentation screenshots for one baseline
query (of their choice) and for one of their own queries (Table 1). For each
feature, we asked them to rate overall usefulness and aesthetics using a 5
point Likert scale. Then for each system demonstrating possible handling
of each feature, we asked them to rate usefulness and aesthetics using
again the Likert scale. Finally for each feature, we asked them to rank the
systems in order of preference. Throughout the sessions, we applied the
talk aloud protocol and encouraged comments and suggestions.
Table 1. Queries and search type classification
14. Below we discuss participants’ reactions to content and overall usefulness
of the interface features. Given the space limitations, we plan to present
our aesthetics preference findings in a different venue. Overall, participants
told us that aesthetics are important (and need to be “good enough” to
use a system) but what really matters is the content. We would also like to
emphasize that all scores presented in this paper are for specific features
supported by each system and are not reflective of the systems as whole.
5.1 Interface Features vs. Search Tasks
Figure 1 summarizes how participants judged the usefulness of interface
features for their queries. The length of each bar equals the number of
ratings. All 4 browsing participants liked autocomplete, whereas 5 out of
6 participants with info gathering queries rated it as neutral and 1 out of 6
as not useful. This shows a clear difference in the usefulness of this feature.
For query expansions, participants expressed positive or neutral opinions
for
browsing and mixed opinions for the other search types. Faceted
refinement was rated mostly useful for all search types with equal neutral
scores for information gathering. Related searches got predominantly
neutral or negative reactions, except for browsing, for which they are
equally spread. Not surprisingly, it’s useful to see document previews
for all search types. Comments analysis shows that snippets are better
for browsing, whereas summaries with full sentences for finding specific
or specialized information. The results are intuitive and participant’s
comments confirm these. We encourage the interface designers to give
priority to “green” elements of the search display and do not bother much
about related searches for biologists. Most of them told us that their
searches are usually specific and even correctly suggested related searches
are not of interest. Access to query expansions is important to experts, but
should not be a default feature.
13
Search Interface Feature Evaluation in Biosciences
5RESULTS
“Faceted refinement
was rated mostly useful
for all search types.”
15. 5.2 System Rankings
For each interface feature participants ranked the systems in order of
preference. Table 2 shows the systems ranked at the top and at the bottom
(top/bottom two were considered if 5 or more systems were compared,
top/bottom one if 4 or less were compared).
5.3 Comparisons with Previous Findings
Our results agree with those reported by Scheider et al. that facets are well
received by bioscientists, but autocomplete is less important [9]. Scheider
et al. also argue that in biosciences, a large number of facets are needed
per query, which they grouped into collapsible tabs in Semedico. While
our participants were positive on collapsible tabs (judging by comments
but not explicit testing), 9 out of 10 did not want to see a large number of
faceted groups. The large number of choice and the inevitable redundancy
overwhelmed them. They commented that they would not spend time
inspecting the facets despite speculating that some might be useful. Divoli
et al. found that bioscientists like to refine searches by organism names [4]
and two our participants also commented they really liked Semedico’s facet
“Organisms” (again not explicitly tested). Our results also confirm findings
in [4] that users prefer selecting multiple suggestions using checkboxes.
14
Search Interface Feature Evaluation in Biosciences
RESULTS
Figure 1. Usefulness ratings for interface features & search tasks:
browsing (br), fact finding (ff) & information gathering (ig)
16. 5.4 More Findings on Interface Features
During the course of the study, participants provided us with interesting
suggestions that are not currently implemented by the systems. Below we
categorize them by feature.
Autocomplete was preferred when the major part of the query is typed
and users feel pigeon holed if suggestions come up with the first characters.
Specific suggestions work best here. Query expansions do not need to
include misspellings and close grammatical forms. These should be included
in search automatically. Overall biologists mostly refine and focus searches,
but not expand them. Faceted refinement is always desired, with
checkboxes that enable multiple selections. Too much information and too
many categories scare them away, as does redundancy of terms (across
categories and within each category). Simpler designs are better (e.g., not
too many colours) - this is why Pingar’s top 3 ranked facets with a few
values each scored highly. Users expect facet categories to reflect query
types, e.g. if the query mentions a disease, conditions should be shown, but
not other diseases. Many liked the ability to refine search by a specific
keyword related to their query, offered by GoPubMed’s “top terms” and
Pingar’s keywords. Some commented that year, publication and even the
entire faceted refinement column should not be displayed by default.
PubMed’s option to refine by reviews was highly favored. Some explained
that besides offering comprehensive information, reviews help to discover
important papers by navigating through the references. Pingar DB’s
document preview was preferred for general searches (baseline queries)
and Pingar QB for specific searches (their own queries). Related searches
are not desired, except for browsing. Scientists dislike clicking on links
leading them to new or broader searches.
step
15
Search Interface Feature Evaluation in Biosciences
RESULTS
Table 2. Top and bottom ranked systems
“Faceted refinement is always
desired, with checkboxes that
enable multiple selections.”
17. 16
Search Interface Feature Evaluation in Biosciences
This paper demonstrates user preferences for different search features
depending on search types in the biomedical domain. Although the search
tasks bioscientists perform are not clearly distinct from each other, in the
future we would like to study which tasks to prioritize and how to integrate
features on the interface to allow optimization for switching search types.
6CONCLUSIONS & FUTURE WORK
18. 17
Search Interface Feature Evaluation in Biosciences
7. Acknowledgments
We are extremely grateful to all participants for their contribution.
8. References
[1] Anick, P. and Kantamneni, R. 2008. A longitudinal study of
real-time search assistance adoption. In SIGIR’08: 701-702.
[2] Anick, P. 2003. Using terminological feedback for web
search refinement. In SIGIR’03: 88-95.
[3] Cutrell, E. and Guan, Z. 2007. An eye-tracking study of
information usage in web search. In CHI’07: 407-416.
[4] Divoli, A., Hearst, M.A. and Wooldridge, M.A. 2008.
Evidence for Showing Gene/Protein Name Suggestions in
Bioscience Literature Search, PSB 2008.
[5] Doms, A. and Schroeder, M. 2005. GoPubMed: Exploring
PubMed with the Gene Ontology.Nucl Acids Res,33:783-786
[6] Hearst, M.A. Search User Interfaces. Cambridge UP, 2009.
[7] Kellar, M., Watters, C. and Shepherd, M. 2007. A field study
characterizing web-based information seeking tasks. JASIST,
58(7), 999-1018.
[8] Lu, Z., Wilbur, J.W., et al. 2009. Finding query suggestions
for PubMed, AMIA Annu Symp Proc. 2009: 396–400.
[9] Schneider, A., Landefeld, R., Wermter, J. and Hahn, U.
(2009) Do users appreciate novel interface features for
literature search? Systems, Man and Cybernetics, SMC 2009.
[10] Tombros, A. and Sanderson, M. 1998. Advantages of query
biased summaries in IR. In SIGIR’98: 2-10.
[11] Tunkelang, D. Faceted Search. Morgan and Claypool, 2009.
[12] Tunkelang, D. 2006. Dynamic category sets: An approach
for facetted search. In SIGIR’06 Faceted Search Workshop.
ACKNOWLEDGMENTS & REFERENCES
19. DiscoveryOneTM
DiscoveryOne Content Enrichment is the
easiest way to improve search, enable
defensible deletion and identify document
security risks. By reading, categorizing
and tagging documents, DiscoveryOne
automatically creates metadata. This
metadata can be used in systems such as
enterprise search, document management,
email and CRM.
CONTENT ENRICHMENT
DiscoveryOne Content Inventory reads file
systems to present an overview of what
they contain. It identifies documents that
are redundant, outdated, trivial, or useful
and worth retaining. Usually, a file system
contains only 25% of relevant, valuable and
useful content, the rest are candidates for
disposal.
CONTENT INVENTORY
www.pingar.com | North America +1 408 663 2328 | Asia Pacific +64 9 950 3299 | Europe and Asia +91 80 4212 7047 | info@pingar.com
The study presented here was originally published as a workshop paper entitled: “Search interface feature evaluation in biosciences”
by Anna Divoli and Alyona Medelyan.
It was presented at the HCIR 2011 workshop that took place at Google’s Mountain View campus.