A comprehensive presentation on Federated Search (FS) Technologies including the types of FS, FS Challenges & Benefits, a case study, FS Evaluation Criteria, Examples of FS Solutions, Best Practices and Future Vision of where FS Technologies may go.
Findability Primer by Information Architected - the IA Primer SeriesDan Keldsen
Findability - The Art and Science of Making Content Findable
Why Findability is Critical Today
Content without access is worthless. With the advent and maturity of the Internet, what was once exclusively the domain of libraries and the private collections of enterprises is now a broadly understood issue.
Case in point: Moments ago, I entered the word “Findability” into a search tool that indexes the Internet.
More than 543,000 individual bodies of content were retrieved. Eureka – Findability solved, right? With a simple search, I am able to retrieve “all” of that content. No. The rules of the game have changed significantly.
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
When your colleagues say they want Google, they don’t mean the Google Search Appliance. They mean the Google Search user experience: pervasive, expedient and delivering the information that they need. Successful enterprise search does not start with the application features, is not part of the information architecture, does not come from a controlled vocabulary and does not emerge on its own from the developers. It requires enterprise-specific data mining, enterprise-specific user-centered design and fine tuning to turn “search sucks” into search success within the firewall. This presentation looks at action items, tools and deliverables for Discovery, Planning, Design and Post Launch phases of an enterprise search deployment.
Using the LucidWorks REST API to Support User-Configuration Big Data Search E...lucenerevolution
Presented by Mark Davis, CTO Kitenga - See conference video - http://www.lucidimagination.com/devzone/events/conferences/lucene-revolution-2012
Kitenga's Analyst system uses the LucidWorks Enterprise REST API in a variety of ways, including for configuring collections and managing Solr schema. As part of the Kitenga platform, the ZettaSearch Designer empowers the end-user to dynamically drag-and-drop search widgets to create a specialized search interface. For a user to effectively design search UIs that meet their needs, they need to be able to understand the available schema fields that populate a given collection. ZettaSearch Designer interrogates the Solr infrastructure using the Lucid REST API to provide an overview of the available metadata. It is then easy for the user to build rich, facetted search experiences around the metadata library indexed into the collection. In this implementation overview, I will describe the design of ZettaSearch Designer, how it interacts with big data technologies like Hadoop as part of the indexing pipeline, and how it uses the LucidWorks API to enable user discovery of the metadata needed to create novel search user interfaces on the fly.
Findability Primer by Information Architected - the IA Primer SeriesDan Keldsen
Findability - The Art and Science of Making Content Findable
Why Findability is Critical Today
Content without access is worthless. With the advent and maturity of the Internet, what was once exclusively the domain of libraries and the private collections of enterprises is now a broadly understood issue.
Case in point: Moments ago, I entered the word “Findability” into a search tool that indexes the Internet.
More than 543,000 individual bodies of content were retrieved. Eureka – Findability solved, right? With a simple search, I am able to retrieve “all” of that content. No. The rules of the game have changed significantly.
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
When your colleagues say they want Google, they don’t mean the Google Search Appliance. They mean the Google Search user experience: pervasive, expedient and delivering the information that they need. Successful enterprise search does not start with the application features, is not part of the information architecture, does not come from a controlled vocabulary and does not emerge on its own from the developers. It requires enterprise-specific data mining, enterprise-specific user-centered design and fine tuning to turn “search sucks” into search success within the firewall. This presentation looks at action items, tools and deliverables for Discovery, Planning, Design and Post Launch phases of an enterprise search deployment.
Using the LucidWorks REST API to Support User-Configuration Big Data Search E...lucenerevolution
Presented by Mark Davis, CTO Kitenga - See conference video - http://www.lucidimagination.com/devzone/events/conferences/lucene-revolution-2012
Kitenga's Analyst system uses the LucidWorks Enterprise REST API in a variety of ways, including for configuring collections and managing Solr schema. As part of the Kitenga platform, the ZettaSearch Designer empowers the end-user to dynamically drag-and-drop search widgets to create a specialized search interface. For a user to effectively design search UIs that meet their needs, they need to be able to understand the available schema fields that populate a given collection. ZettaSearch Designer interrogates the Solr infrastructure using the Lucid REST API to provide an overview of the available metadata. It is then easy for the user to build rich, facetted search experiences around the metadata library indexed into the collection. In this implementation overview, I will describe the design of ZettaSearch Designer, how it interacts with big data technologies like Hadoop as part of the indexing pipeline, and how it uses the LucidWorks API to enable user discovery of the metadata needed to create novel search user interfaces on the fly.
Exploring Process Barriers to Release Public Sector Information in Local Gove...Peter Conradie
Conradie, P. & Choenni, S., 2012. Exploring Process Barriers to Release Public Sector Information in Local Government. In 6th International Conference on Theory and Practice of Electronic Governance, Albany. NY. Albany, New York, pp. 5–13.
DataCite and Campus Data Services
Paul Bracke, Associate Dean for Digital Programs and Information Services, Purdue University
Research libraries are increasingly interested in developing data services for their campuses. There are many perspectives, however, on how to develop services that are responsive to the many needs of scientists; sensitive to the concerns of scientists who are not always accustomed to sharing their data; and that are attractive to campus administrators. This presentation will discuss the development of campus-based data services programs, the centrality of data citation to these efforts, and the ways in which engagement with DataCite can enhance local programs.
"At the toolbar (menu, whatever) associated with a document there is a button marked "Oh, yeah?". You press it when you lose that feeling of trust. It says to the Web, 'so how do I know I can trust this information?'. The software then goes directly or indirectly back to metainformation about the document, which suggests a number of reasons."
Tim Berners-Lee, W3C Chair, Web Design Issues, September 1997
Provenance is focused on the description and understanding of where and how data is produced, the actors involved in the production of such data, and the processes by which the data was manipulated and transformed until it arrived to the collection from which it is being accessed. Provenance aims at providing the ability to trace the sources of data, enabling the exploration not just of the relationships between datasets, but also of their authors and affiliations, with the goal of preserving data ownership and establishing a notion of trust based on authenticity and reliability.
The Future Internet poses important challenges for provenance, derived from complex and rich scenarios characterized by the presence of large amounts of data stemming from heterogeneous sources like user communities, services, and things. Such challenges span across technical but also socioeconomic dimensions. The former includes aspects like vocabularies for representing provenance, interoperability and scalability issues, and means to produce, acquire, and reason with provenance in order to provide measures of trust and information quality. However, it is probably in the socieconomic dimension where more significant efforts need to be made as to addressing issues like the role of provenance in the overall picture of the Future Internet, entry barriers preventing the generation of provenance-aware internet content, means required to incentivate the production of such content, and ways to prevent provenance forgery.
In this talk, we provide and overview on provenance and the above mentioned challenges and introduce ongoing work in order to address trust issues from the provenance perspective in the Future Internet. We also link provenance to other relevant aspects for trust discussed in the session, like security, legal frameworks, and economics.
Scientific discovery and innovation in an era of data-intensive science
William (Bill) Michener, Professor and Director of e-Science Initiatives for University Libraries, University of New Mexico; DataONE Principal Investigator
The scope and nature of biological, environmental and earth sciences research are evolving rapidly in response to environmental challenges such as global climate change, invasive species and emergent diseases. Scientific studies are increasingly focusing on long-term, broad-scale, and complex questions that require massive amounts of diverse data collected by remote sensing platforms and embedded environmental sensor networks; collaborative, interdisciplinary science teams; and new tools that promote scientific data preservation, discovery, and innovation. This talk describes the challenges facing scientists as they transition into this new era of data intensive science, presents current solutions, and lays out a roadmap to the future where new information technologies significantly increase the pace of scientific discovery and innovation.
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Findwise
With data volumes growing by 200 percent a year, knowledge workers are spending around 30 percent of their time trying to extract useful information. Furthermore a recent U.S. study asserted that knowledge workers spend more than twice as much time re-creating already created content as they spend creating new content. In addition to this time spent on maintaining structures for storing incoming unstructured information (e.g. mail, documents etc) is increasing rapidly.
Enabling search solutions makes information easy to find, however the key is to transform this information into knowledge. This is normally not done by simple intranet search functionality, however the intranet portal can act as a portal to a knowledge management system based on advanced search functionality withadded collaborative functions. This transforms your organization into a “knowledge finding organization”, creating an even more competitive organization.
Knowledge Management systems based on an Enterprise Search Platform (ESP) can, if implemented properly, significantly improve the efficiency of an organization. IDC Research suggests in their latest report (April 2006) “Hidden cost of information Work” that the cost for wasted time on the part of professional searching, but not finding relevant information, amounts to $5.3 million annually for an enterprise with 1000 knowledge workers.
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitJoel Oleson
The Four Pillars of Search really help you focus your search planning. In this session we dig into the context, content, metadata and UX or user experience that really matter. We also dig into a variety of publicly accessible SharePoint 2013 real world search pages to demonstrate the value.
Finding, or not finding, information is consistently the most called out issue in the enterprise. Technology companies spend millions developing features that remain idle because, while everyone is concerned about optimizing enterprise search, no one is doing anything about it. The PM cuts the budget because "the devs will do it." The IA/UX architects do not have the specific expertise. The developers want to do it but do not have appropriate guidance.
This is a call-to-action for developers and ITpros to make sure that they get what they need to make search in the enterprise work. Because, after the interactive marketing agency has left the building, they are the ones that will be hearing "search sucks" directed at them.
Organizations are beginning to recognize that search is not a stand-alone technology or application, but must be integrated with business processes and corporate objectives as a key infrastructure component.
Why? Providing enriched metadata to the search engine index significantly improves search applications, eDiscovery, FOIA requests, and collaboration.
In this webinar COMPU-DATA International and Concept Searching will demonstrate their combined offering that uses unique, language independent technology and integrated enterprise metadata repository management, to deliver intelligent metadata enabled search.
What you will learn about during this session:
• How our innovative technology delivers both high precision and high recall, using industry unique compound term processing
• How to accomplish federated search as content is created or ingested
• How to enable true concept based searching
• How to eliminate end user tagging
• How to integrate the combined solution with any search engine including SharePoint, the former FAST products, Google Search Appliance, IBM Vivisimo, and Solr
• How the combined solution can be extended to address records identification, protection of privacy information, migration, and text analytics with the same technology
• Benefit from industry-specific use cases:
• Developing a powerful search solution for the US Army, creating easy access to millions of records, with an integrated solution to consolidate many data sources, accessing high volumes of data
• Solving search, migration, records management, and data privacy challenges to manage the intranet for a global company which designs, manufactures, and distributes appliances to more than 70 countries
Aiim Webinar Helen Mitchell Unified Search Final 7 21 2010Helen Mitchell
About the available tools & techniques for leveraging information from cloud-borne, social & internal applications to aggregate ideas, impact thinking, & drive business decisions.
Exploring Process Barriers to Release Public Sector Information in Local Gove...Peter Conradie
Conradie, P. & Choenni, S., 2012. Exploring Process Barriers to Release Public Sector Information in Local Government. In 6th International Conference on Theory and Practice of Electronic Governance, Albany. NY. Albany, New York, pp. 5–13.
DataCite and Campus Data Services
Paul Bracke, Associate Dean for Digital Programs and Information Services, Purdue University
Research libraries are increasingly interested in developing data services for their campuses. There are many perspectives, however, on how to develop services that are responsive to the many needs of scientists; sensitive to the concerns of scientists who are not always accustomed to sharing their data; and that are attractive to campus administrators. This presentation will discuss the development of campus-based data services programs, the centrality of data citation to these efforts, and the ways in which engagement with DataCite can enhance local programs.
"At the toolbar (menu, whatever) associated with a document there is a button marked "Oh, yeah?". You press it when you lose that feeling of trust. It says to the Web, 'so how do I know I can trust this information?'. The software then goes directly or indirectly back to metainformation about the document, which suggests a number of reasons."
Tim Berners-Lee, W3C Chair, Web Design Issues, September 1997
Provenance is focused on the description and understanding of where and how data is produced, the actors involved in the production of such data, and the processes by which the data was manipulated and transformed until it arrived to the collection from which it is being accessed. Provenance aims at providing the ability to trace the sources of data, enabling the exploration not just of the relationships between datasets, but also of their authors and affiliations, with the goal of preserving data ownership and establishing a notion of trust based on authenticity and reliability.
The Future Internet poses important challenges for provenance, derived from complex and rich scenarios characterized by the presence of large amounts of data stemming from heterogeneous sources like user communities, services, and things. Such challenges span across technical but also socioeconomic dimensions. The former includes aspects like vocabularies for representing provenance, interoperability and scalability issues, and means to produce, acquire, and reason with provenance in order to provide measures of trust and information quality. However, it is probably in the socieconomic dimension where more significant efforts need to be made as to addressing issues like the role of provenance in the overall picture of the Future Internet, entry barriers preventing the generation of provenance-aware internet content, means required to incentivate the production of such content, and ways to prevent provenance forgery.
In this talk, we provide and overview on provenance and the above mentioned challenges and introduce ongoing work in order to address trust issues from the provenance perspective in the Future Internet. We also link provenance to other relevant aspects for trust discussed in the session, like security, legal frameworks, and economics.
Scientific discovery and innovation in an era of data-intensive science
William (Bill) Michener, Professor and Director of e-Science Initiatives for University Libraries, University of New Mexico; DataONE Principal Investigator
The scope and nature of biological, environmental and earth sciences research are evolving rapidly in response to environmental challenges such as global climate change, invasive species and emergent diseases. Scientific studies are increasingly focusing on long-term, broad-scale, and complex questions that require massive amounts of diverse data collected by remote sensing platforms and embedded environmental sensor networks; collaborative, interdisciplinary science teams; and new tools that promote scientific data preservation, discovery, and innovation. This talk describes the challenges facing scientists as they transition into this new era of data intensive science, presents current solutions, and lays out a roadmap to the future where new information technologies significantly increase the pace of scientific discovery and innovation.
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Findwise
With data volumes growing by 200 percent a year, knowledge workers are spending around 30 percent of their time trying to extract useful information. Furthermore a recent U.S. study asserted that knowledge workers spend more than twice as much time re-creating already created content as they spend creating new content. In addition to this time spent on maintaining structures for storing incoming unstructured information (e.g. mail, documents etc) is increasing rapidly.
Enabling search solutions makes information easy to find, however the key is to transform this information into knowledge. This is normally not done by simple intranet search functionality, however the intranet portal can act as a portal to a knowledge management system based on advanced search functionality withadded collaborative functions. This transforms your organization into a “knowledge finding organization”, creating an even more competitive organization.
Knowledge Management systems based on an Enterprise Search Platform (ESP) can, if implemented properly, significantly improve the efficiency of an organization. IDC Research suggests in their latest report (April 2006) “Hidden cost of information Work” that the cost for wasted time on the part of professional searching, but not finding relevant information, amounts to $5.3 million annually for an enterprise with 1000 knowledge workers.
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitJoel Oleson
The Four Pillars of Search really help you focus your search planning. In this session we dig into the context, content, metadata and UX or user experience that really matter. We also dig into a variety of publicly accessible SharePoint 2013 real world search pages to demonstrate the value.
Finding, or not finding, information is consistently the most called out issue in the enterprise. Technology companies spend millions developing features that remain idle because, while everyone is concerned about optimizing enterprise search, no one is doing anything about it. The PM cuts the budget because "the devs will do it." The IA/UX architects do not have the specific expertise. The developers want to do it but do not have appropriate guidance.
This is a call-to-action for developers and ITpros to make sure that they get what they need to make search in the enterprise work. Because, after the interactive marketing agency has left the building, they are the ones that will be hearing "search sucks" directed at them.
Organizations are beginning to recognize that search is not a stand-alone technology or application, but must be integrated with business processes and corporate objectives as a key infrastructure component.
Why? Providing enriched metadata to the search engine index significantly improves search applications, eDiscovery, FOIA requests, and collaboration.
In this webinar COMPU-DATA International and Concept Searching will demonstrate their combined offering that uses unique, language independent technology and integrated enterprise metadata repository management, to deliver intelligent metadata enabled search.
What you will learn about during this session:
• How our innovative technology delivers both high precision and high recall, using industry unique compound term processing
• How to accomplish federated search as content is created or ingested
• How to enable true concept based searching
• How to eliminate end user tagging
• How to integrate the combined solution with any search engine including SharePoint, the former FAST products, Google Search Appliance, IBM Vivisimo, and Solr
• How the combined solution can be extended to address records identification, protection of privacy information, migration, and text analytics with the same technology
• Benefit from industry-specific use cases:
• Developing a powerful search solution for the US Army, creating easy access to millions of records, with an integrated solution to consolidate many data sources, accessing high volumes of data
• Solving search, migration, records management, and data privacy challenges to manage the intranet for a global company which designs, manufactures, and distributes appliances to more than 70 countries
Aiim Webinar Helen Mitchell Unified Search Final 7 21 2010Helen Mitchell
About the available tools & techniques for leveraging information from cloud-borne, social & internal applications to aggregate ideas, impact thinking, & drive business decisions.
AMCTO presentation on moving from records managment to information managementChristopher Wynder
This presentation was given to AMCTO zones 1 and 4/5. It presents how to use the records classification as the core for a faceted classification schema that can be used to enable workflow and processes across the organization.
An overview of Digital Science - a new company started out of Macmillan Publishers dedicated to making research more efficient through better use of technology.
Introduction to Enterprise Search. A two hour class to introduce Enterprise Search. It covers:
The problems enterprise search can solve
History of (web) search
How we search and find?
Current state of Enterprise Search + stats
Technical concept
Information quality
Feedback cycle
Five dimensions of Findability
Similar to Federated Search Webinar for SLA (Special Libraries Assoc.) (20)
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Federated Search Webinar for SLA (Special Libraries Assoc.)
1. Federated Search in a Disparate
Environment
PREPARED FOR:
SLA Webinar Series
Evidence-Based Practice in Libraries
2040 Corbett Rd
Monkton, Md 21111
(410.472.4631
Helen L. Mitchell Curtis
* hmitchell5@gmail.com
Principal, Enterprising Solutions
September 9, 2009
2. Enterprising Solutions
Biography
Helen L. Mitchell Curtis – Principal, Enterprising
Solutions
32+ years at FDA leading one of the largest
enterprise search implementations among Civilian
Federal Agencies
Develop enterprise-wide search strategies &
solutions
Integrate search technologies across IT
applications and disparate document repositories
Build governance, management and end user
buy-in
Promote collaboration, standards, findability and
improved organization of data and document
assets
Passion – to help clients to reduce costs, improve
quality and efficiency, reduce 'pain points' and
achieve a positive search experience
2
3. Enterprising Solutions
Polling Question
• What is Your Role? (select all that apply, if group participants)
• CIO, Executive Director
• Library Director (Corporate, Gov’t, Academia, Solo)
• Librarian/Information Management Professional
• IT Professional or Consultant
• Project/Product Manager
• Sales/Marketing/Communications
• End User (i.e., Scientist, Researcher, Engineering Professional)
• Federated Search Vendor
• Other
3
4. Enterprising Solutions
Agenda
1. Terms Clarified
2. Types of Federated Search (FS)
3. FS Challenges & Benefits
4. FDA Case Study
5. FS Evaluation Criteria
6. Examples of FS Solutions
7. Live Federated Search Demo
8. Best Practices
9. Future Vision
10. Questions & Answers
4
5. 1. Definition by AIIM Market IQ
2. Definition by CMS Watch
Enterprising Solutions
Clarify Terms
3. A Federated Search Primer – Part II
4. Deep Web Technologies
5. Federated Search Rpt & Toolkit-Jill Hurst-Wahl
• Reliable and complete retrieval of content based on user need,
i.e. everything relevant is recalled (recall) while simultaneously
Findability returning only that content relevant to the user’s focus
(precision), thus eliminating the review of irrelevant content by the
1
user.
• Systems…within an organization…seeking information held
Enterprise internally…in a variety of formats and locations, including
Search databases, document management systems, and other
2
repositories. Content is pre-indexed, simultaneously searched,
(ES) and displayed to authorized users.
• The process of performing a simultaneous real-time search of
Federated multiple diverse and distributed sources from a single search
3
page, with the federated search engine acting as intermediary.
Search (FS)
• The set of web-sites and their documents that cannot be accessed
via crawler-type search engines such as Google. Deep web content
Deep Web typically lives inside of databases, and is accessed through search
4
forms. It is also referred to as the Hidden or Invisible Web.
• SW written to access a content source that must know the URL of
Connector the source, how to send search commands, its search syntax, &
5
how to process the search results returned from a source.
5
6. Enterprising Solutions
Polling Question
Information Accessibility (select all that apply)
1. I can easily find information to do my job
2. Less than 50% of our organization’s info is searchable online
3. More than 50% of our organization's info is searchable online
4. I reference less than 5 systems (info sources) in any given
week
5. I reference 5 or more systems (info sources) in any given
week
6
7. Enterprising Solutions
Findability Issues
AIIM Market IQ Research on Findability (of 528 end users):
50% believe Findability in their organization is ―Worse to Much Worse‖
than their consumer-facing web sites
49% have no formal goal for Enterprise Findability within their
organizations
49% ―Agreed or Strongly Agreed‖ that finding the information to do their
job is difficult and time consuming
69% believe less than 50% of their organization's information is
searchable online
36% reference five or more systems in any given week
7
Source: AIIM Market Intelligence, 2008
8. Enterprising Solutions
Why Use Federated Search
To increase findability to better accomplish business objectives.
To issue a single query across multiple content sources through a common
search interface.
When not feasible to re-index all of the content available from large public
sites like PubMed.
To increase user awareness of all content sources such as deep web for
scientific, technical and business content.
To eliminate using multiple database search protocols & passwords.
When don‘t have the rights to index the content (e.g. subscription sites).
Real-time search: for content constantly being updated & impractical to
8 keep the data as timely as it needs to be.
9. Federated Search Sources
Enterprising Solutions
(examples)
Reason Corporate Academic Gov’t Public
Library
Subscription Databases X X X X
Internal or External Repositories X X
Library Catalog(s) X X X X
News X X
Digitized Material X X X
Blogs & Wikis X X X
Intranet/Internet Sites X X
Industry Specific Sources X
DB‘s available to customers X X
Historical Collections X
9
12. Enterprising Solutions
Federated „Master Index‟ Search
Index multiple data sources content into a single master index
Queries & results come from that one master index
Many Enterprise Search products integrate FS via ‗connectors‘ to
accomplish this (ex., FAST, Autonomy, Endeca)
12 Source: New Idea Engineering, Inc.
13. Enterprising Solutions
Federated „Data Silos‟ Search
‗Search Federator‘ processes queries for each data source silo
Transforms search terms to match each content source requirements
Submits query to each of the sources simultaneously
Merges each source‘s results together - single look & feel
Maintains no indices of its own, relies on linked systems capabilities
13
Source: New Idea Engineering, Inc.
14. Enterprising Solutions
Surface vs. Deep Web Search
Popular search engines (Google, Yahoo…) ―crawl‖ surface web
FS can drill down to the deep web where specialized content (i.e.,
scientific and technical databases) reside
Deep Web FS Examples:
www.completeplanet.com -
70,000+ searchable DBs & specialty
search engines
www.science.gov- federates U.S.
federal agency science info
http://imlsdcc.grainger.uiuc.edu/ -
Institute of Museum & Library
Services (IMLS) - Digital Collections
& Content w/descriptions of digital
resources developed by IMLS
grantees
14
Source: Juanico-Environmental Consultants, Ltd.
15. Enterprising Solutions
Vertical Search Engine
Closely related to Deep Web – searches for a particular niche i.e.,
a specific industry, topic, type of content (e.g., scientific research,
travel, movies, images, blogs)
Example: www.vetseek.info - is a search engine focusing on veterinary science and
related topics
15
16. Enterprising Solutions
Polling Question
Federated Search Solutions (select one)
1. We are currently conducting an evaluation to procure a
Federated Search Product
2. We currently have a Federated Search Solution installed that
satisfies our requirements
3. We have a Federated Search Solution by are considering
replacing it or enhancing its capabilities & features
16
17. Enterprising Solutions
Challenges
Authentication
Showing each record‘s branding and copyright information
Licensed or subscription databases
True De-duplication
Virtually impossible because DBs return 10-20 results at a
time
Vendors usually just de-dupe the first results set returned
Security
Mapping user credentials and access rights to each
repository security model
Speed
Limited by slowest search engine‘s performance
17
18. Enterprising Solutions
Challenges (continued)
Lack of data standardization
Each source has a unique access method & needs
translation
Metadata mapping between FSS and underlying systems
Access methods to sources may change
Requires an interface rewrite or modification
Rules for error handling
Ex. Query term not available—exclude the query, the
repository, or proceed without the term?
Ex. Timeouts or connection problem
Complex searches usually not available
Fielded searches
Known Items, i.e. Article Name
Best to directly search database
18
19. Enterprising Solutions
Challenges (continued)
Relevancy scores
Can‘t identify a single relevancy ranking model
Relevancy rankings for repository‘s results refers to its own
May be not be useful when comparing the results with
those from another system
Access to content stored in a variety of
places
Results page may not let user obtain identified documents
This may involve a built-in viewer or invoking the owning
product‘s interface.
Combining navigators from each result set
i.e., faceted search, taxonomies and auto-generate
clusters
Selecting the right FS engine
Depends on business goals, type of content sources –
structured vs. unstructured, licensed/subscriptions
19
20. Enterprising Solutions
Benefits
• Single master index
• Quicker response times
• No need to access original data sources
• Relevancy algorithms applied uniformly
• Dynamic navigators are available for all documents
• Time savings
• Searches many sources at one time
• Combines results into a single results page
• Quality of results
• Client selects the sources to search
• Minimum impact on the data silos
• Only accessed when a user performs a query
• Eliminates increased load crawling/indexing the data source
20
21. Enterprising Solutions
Benefits (continued)
• Improve productivity
• Reduces number of searches executed to find relevant results
• Save, reuse, schedule, and share effective search queries
• Leverage security controls at queried source
• Access repositories secured against crawls but can be accessed
by search queries
• Reduce costs
• No additional capacity requirements for content index since its
not crawled by search server
• Most current content
• Real time searches - as soon as the source is updated, the info is
available to the searcher on the very next query
• Increase awareness
• Identify most relevant sources to search based on # of results
each source produced
21
22. Enterprising Solutions
FDA Case Study Success
(Federated „Master Index‟ Search System)
ACTIONS RESULT
Started small with high ‘pain Increased productivity & popularity.
points’.
Modified business processes. Standardized nomenclature improved
efficiencies.
Users across organization Produced more timely & QUALITY
could find content in silos. work products.
Indexed structured & Grew from 1 repository of 500 docs
unstructured content with to 50 with 30 million docs. Accessed
document level security. on ‘need to know’ basis.
Introduced standardized Reduced development time & costs.
search web services into Increased mgmt & user acceptance.
applications. Integrated in more applications.
Increased user awareness Used more & content added. Search
with training, newsletters & requirements now captured at
meetings. BEGINNING of project development.
22
23. Enterprising Solutions
Evaluation Criteria Overview
Identify Goals
Create an Effective Search
Strategy
Collect Business Requirements
Conduct needs assessment
Work Closely with User
Community
23
24. Evaluation Criteria Overview
Enterprising Solutions
(continued)
Define Features and Functions
Eliminate emotional decisions re: product,
company or others using the product
High Precision
Return content relevant to user‘s focus
High Recall
Recall everything relevant to user‘s need
Thoroughly Research
Products, Users & Product
Reviewers
24
30. Digital Library FSS Example
Enterprising Solutions
http://www.calisphere.universityofcalifornia.edu/
Features of Interest
30
31. Digital Library FSS Example
Enterprising Solutions
http://www.calisphere.universityofcalifornia.edu
1 2
3
31
32. Enterprising Solutions
FSS Example
(LibraryFind® developed by Oregon State Univ Libraries)
Features of Interest
32
33. Enterprising Solutions
Semantic Federated Search
(prototype by Collexis & Deep Web Technologies)
SOURCES:
•PubMed
•NCI=Nat‘l Cancer Inst
DeepWeb Technologies (a federated search provider) and •DTIC=Defense Tech. Info Ctr
•PMC=PubMed Central
Collexis (a developer of semantic search & knowledge •ScrDOEIB=DOE Info Bridge
discovery solutions) teamed up to deliver the world’s first •Eurekalert=Science News
semantic federated search. THESAURI Used:
•MeSH
•DTIC=Defense Tech. Info Ctr
•How does semantic federated search work?
•All results from your initial query are processed
through one or more thesauri. (i.e., MeSH & DTIC.)
•The system then returns terms that are found both in
the top results and in the thesauri.
33
34. Enterprising Solutions
Collexis & Deep Web Technologies
(Search Results – screenshot 1)
Unlike clustering, which
simply lumps together
words that are
frequently found near
each other, these terms
are being suggested
from an expert-
developed thesaurus
(taxonomy) in which 2429 hits
terms are meaningfully
& consistently
organized.
The longer the
Semantic terms. blue bar, the
more semantic
evidence found
for that term.
34
35. Enterprising Solutions
Collexis & Deep Web Technologies
(Search Results – screenshot 2)
•Clicking on term
“Mental Recall” from
prior screen added
term to search, reduced
relevant hits to 3; &
terms suggested are
organized.
•Thesaurus-based search will
consistently suggest terms in
the same organized way.
•Clustering changes the way it
organizes suggestions with
every query.
• Clustering tends to be useful
for very broad, general or
unpredictable content.
*Thesaurus-based semantic search tends to be better
when you are working consistently in knowledge
domains, such as medicine, physics or electronics.
35
36. Enterprising Solutions
Best Practices
Strategically plan how to deliver your
mission and just DO IT!
Do proof of concept – demos can be
deceiving
Establish common set of standards &
governance model
Measure results by establishing key
performance indicators
Leverage lessons learned to reduce
project cycles, increase trust and
empower communities
36
37. Enterprising Solutions
Future Vision
Personalized Search
• A simple, persistent box on a users‘ browser, cell, or entertainment screen
that initiates a search based on what the user was doing, their previous
keystrokes, & perhaps using historical data.
Better Quality of Search Results
• Number of results retrieved, Relevance Ranking, De-Duplication
Enterprise Mashups
• Combine real-time searching with social networking tools, maps, etc.
Users build the index by their searching
• Know Web pages people display, what‘s on them & what apps are
showing up on users' computers
37
38. Enterprising Solutions
Future Vision (continued)
Query analysis & predictive modeling on the fly
• Business users expect to access info behind company firewalls &
from the larger web world using the same tools and consistency
Improved Navigators, Facets, Clustering
• Filter result sets dynamically for more relevant results
Web of Interconnected Data
• Automate analysis of database structures and cross-reference
results. Ex.- Health site cross-references data from pharmaceutical
companies with the latest findings from medical researchers
Visualization Technologies
38
• Enable extreme-scale knowledge discovery
39. Enterprising Solutions
Resources
1. Great resource for many Federated Search topics:
www.federatedsearchblog.com – Author: Sol Lederman
2. Open Source & commercial search components & tools list:
http://tinyurl.com/l3w8of
3. Federated Search Vendors: http://tinyurl.com/92s8qv
4. Deep Web Databases: http://tinyurl.com/yam3sw
5. Deep Web resources: http://www.internettutorials.net/deepweb.asp
6. Digital Image Resources on the Deep Web: http://tinyurl.com/46vcqp
7. Info on Vertical Search Engines: http://tinyurl.com/lpcufw
8. 50 Niche Search Engines: http://tinyurl.com/lukxwx
9. Library of Congress FS Portal Products/Vendors list:
http://tinyurl.com/l6mdy8
10. Resources to Research & Mine the Deep Web: http://tinyurl.com/6g5768
39