Allex Lyons, a programmer at Access Innovations, Inc., talks about the decision made by this company to apply a faster, more reliable and efficient Lucene index to XIS for searching docsets, instead of a random access file.
A discussion of the role of taxonomies and other controlled vocabularies in the managing of large amounts of data for researchers, focusing in particular on searchability and data visualization. Presented by Marjorie M.K. Hlava, president of Access Innovations, Inc., for the SLA Military Libraries Division 2013 Workshop, December 12, 2013.
Case Study: Data Harmony Custom Features as Implemented for Triumph LearningAccess Innovations, Inc.
Kirk Sanders, an editorial project manager at Access Innovations, Inc., explains how Data Harmony software was used to build taxonomies. These taxonomies were implemented to assist educational publisher Triumph Learning in precisely aligning standards-based instruction materials into a new content management system, or CMS.
Case Study: Integrating Data Harmony Terms and the eJournalPress Peer Review...Access Innovations, Inc.
Anna Jester from eJournalPress presents the integration project between eJournalPress and Access Innovations, Inc. Anna's team used Data Harmony software to index terms in order to correctly identify manuscripts for review.
A discussion of the role of taxonomies and other controlled vocabularies in the managing of large amounts of data for researchers, focusing in particular on searchability and data visualization. Presented by Marjorie M.K. Hlava, president of Access Innovations, Inc., for the SLA Military Libraries Division 2013 Workshop, December 12, 2013.
Case Study: Data Harmony Custom Features as Implemented for Triumph LearningAccess Innovations, Inc.
Kirk Sanders, an editorial project manager at Access Innovations, Inc., explains how Data Harmony software was used to build taxonomies. These taxonomies were implemented to assist educational publisher Triumph Learning in precisely aligning standards-based instruction materials into a new content management system, or CMS.
Case Study: Integrating Data Harmony Terms and the eJournalPress Peer Review...Access Innovations, Inc.
Anna Jester from eJournalPress presents the integration project between eJournalPress and Access Innovations, Inc. Anna's team used Data Harmony software to index terms in order to correctly identify manuscripts for review.
Apache LuceneTM is a free open-source , high-performance, full-featured text search engine library that has been written completely in Java. As a technology is best suited for any application that requires full-text search, especially cross-platform.
This presentation mainly intends to explain how to gather the author name, title, submission year, keywords, subjects, department, university, language, and the number of pages of every thesis document in Theseus and then reuse the gathered data for building a Web portal.
Explore SharePoint 2010 Enterprise & Document Management features K.Mohamed Faizal
Topic 2 : Explore SharePoint 2010 Enterprise & Document Management features
This session you learn about SharePoint 2010 Enterprise Content Management offerings.
Enterprise Content Management
Document Management features
Metadata Management
Term Store, Term Sets & Terms
Content Hub
Content Type Syndication
Document IDs
Document Sets
Document Centre
Content organizer
Sent to Location
Structured Dynamics provides 'ontology-driven applications'. Our product stack is geared to enable the semantic enterprise. The products are premised on preserving and leveraging existing information assets in an incremental, low-risk way. SD's products span from converters to authoring environments to Web services middleware and to eventual ontologies and user interfaces and applications.
This presentation is an introduction to DSpace for archiving digital content.
Presented as part of a webinar series by Enovation Solutions, a Duraspace Service Provider http://www.enovation.ie
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementIvan Sanders
This session walks you through some of the enterprise content management features in SharePoint 2010 such as metadata management, document sets, records management, search, and more. The demos will include declarative and programmatic creation of document sets and document ids, records management routing, and search
Entenda como as tecnologias open source funcionam no Microsoft Azure. Abordaremos cenários de IAAS, PAAS e Websites, além de cenários mais sofisticados tais como Haddop
Strategies for Data Migration in the Age of CCO and VRA Core 4.0
Presented at the Annual Conference of the Visual Resources Association, March 30th, 2007
Susan Jane Williams, Data Specialist, Scholars Resource; Independent Consultant and Developer
Have you ever wondered how search works while visiting an e-commerce site, internal website, or searching through other types of online resources? Look no further than this informative session on the ways that taxonomies help end-users navigate the internet! Hear from taxonomists and other information professionals who have first-hand experience creating and working with taxonomies that aid in navigation, search, and discovery across a range of disciplines.
Apache LuceneTM is a free open-source , high-performance, full-featured text search engine library that has been written completely in Java. As a technology is best suited for any application that requires full-text search, especially cross-platform.
This presentation mainly intends to explain how to gather the author name, title, submission year, keywords, subjects, department, university, language, and the number of pages of every thesis document in Theseus and then reuse the gathered data for building a Web portal.
Explore SharePoint 2010 Enterprise & Document Management features K.Mohamed Faizal
Topic 2 : Explore SharePoint 2010 Enterprise & Document Management features
This session you learn about SharePoint 2010 Enterprise Content Management offerings.
Enterprise Content Management
Document Management features
Metadata Management
Term Store, Term Sets & Terms
Content Hub
Content Type Syndication
Document IDs
Document Sets
Document Centre
Content organizer
Sent to Location
Structured Dynamics provides 'ontology-driven applications'. Our product stack is geared to enable the semantic enterprise. The products are premised on preserving and leveraging existing information assets in an incremental, low-risk way. SD's products span from converters to authoring environments to Web services middleware and to eventual ontologies and user interfaces and applications.
This presentation is an introduction to DSpace for archiving digital content.
Presented as part of a webinar series by Enovation Solutions, a Duraspace Service Provider http://www.enovation.ie
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementIvan Sanders
This session walks you through some of the enterprise content management features in SharePoint 2010 such as metadata management, document sets, records management, search, and more. The demos will include declarative and programmatic creation of document sets and document ids, records management routing, and search
Entenda como as tecnologias open source funcionam no Microsoft Azure. Abordaremos cenários de IAAS, PAAS e Websites, além de cenários mais sofisticados tais como Haddop
Strategies for Data Migration in the Age of CCO and VRA Core 4.0
Presented at the Annual Conference of the Visual Resources Association, March 30th, 2007
Susan Jane Williams, Data Specialist, Scholars Resource; Independent Consultant and Developer
Similar to Using Lucene for Search within XIS (20)
Have you ever wondered how search works while visiting an e-commerce site, internal website, or searching through other types of online resources? Look no further than this informative session on the ways that taxonomies help end-users navigate the internet! Hear from taxonomists and other information professionals who have first-hand experience creating and working with taxonomies that aid in navigation, search, and discovery across a range of disciplines.
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsAccess Innovations, Inc.
In today's highly charged atmosphere of anxiety and anticipation about AI, and especially LLMs,
one of the biggest concerns is how to ensure that it returns accurate results (meaning both true
and pertinent to its audience). This is particularly important to scholarly, scientific, and other
technical organizations, whose constituents are often in very specific domains, such as
medicine, engineering, history, biology, chemistry, etc. One extremely useful tool to incorporate in an AI-based process in such cases is a comprehensive and well-structured knowledge domain which is based on a controlled vocabulary.
Smart Submit and Client Support
Michael Millar, Junior Software Developer, and Frank Coates, Client Support Manager
Get a peek at the new and improved Smart Submit and learn about new, easier ways to contact the support team at Access Innovations.
How a Good Taxonomy Can Provide Valuable Business Insights
Kristen Monahan, Public Library of Science (PLOS)
Kristen is a business analyst and she won’t be talking about the PLOS taxonomy but rather how she uses that taxonomy to drill down into the massive amount of content, metadata, and usage and process data that is PLOS for deep, detailed analysis and to drive business decisions. Much of this work involves trend analysis. For example, trend analysis of submissions can look at the time it takes from submission to decision by subject (narrow subjects like Covid, broad subjects like biotechnology), or by institution, or by country, etc. to see not just the overall big picture but where in their submission and peer review workflows the bottlenecks might be. A trend analysis of topics over time can prompt them to issue a call for papers for a topic they think needs to be better covered–and then look at both short-term and long-term trends resulting from that call to papers. Their taxonomy doesn’t just make their content smarter–it makes how they publish that content smarter too.
Editor and Peer Reviewer Assignments Using Data Harmony
Andrew Smeall, Hindawi Publishing
Andrew will show how Hindawi, an open access publisher, applies their taxonomy to make editor and reviewer assignments for incoming submissions to their journals.
Cloud Deployment of Data Harmony
Jeffrey Gordon, Lead Developer, Access Innovations, Inc.
Jeffrey will describe the cloud deployment of the Data Harmony software.
Marjorie M. K. Hlava, President, Chair of the Board, and Chief Scientist, Access Innovations, Inc.
During this annual highlight of the DHUG meetings, Margie will discuss the exciting new changes and additions to the Data Harmony software. She will be joined by some members of our software development team to talk about specific initiatives we have worked on over the past year.
Access Innovations and Atypon: Beyond Content Tagging
Hong Zhou and Gerasimos Razis, Atypon
Gerasimos and Hong will discuss the changes to the Atypon platform since DHUG 2020.
Getting to the Point: Using AI and Taxonomies to Craft Meta -Titles
Travis Hicks, American Society of Clinical Oncology (ASCO)
Looking to better leverage SEO and include key terms in the url construct for research abstracts, ASCO is working with Access Innovations to evaluate how to programmatically create short titles for abstracts. The idea is to index titles against existing taxonomies as a way of producing a short title that succinctly identified what an abstract is about for purposes of constructing a new url configuration. Travis will discuss the need, challenges, and early results of the project.
Expanding the Use of MAIstro at ASCE
Xi Van Fleet, American Society for Civil Engineers
Using MAIstro, ASCE created the subject/topic taxonomies for their publications to enhance content discovery and business insight. After achieving their primary goal, they have been expanding its use for other applications.
Lessons Learned From Building a Taxonomy and Indexing 140 Years of Content
Michael Darr, Project Manager, D33 – American Chemical Society Pubs IT
Michael will talk about the things they would do differently if they were to build a new taxonomy and index a legacy file, and the things they did right the first time.
Bill’s talk is entitled “WHAT’S IN A NAME? How Kew helps drug regulators disambiguate the messy welter of medicinal plant names to shore up regulation and save lives”. It’s really eye-opening to realize how complicated and imprecise names can get, with multiple scientific, pharmaceutical and popular names for the same thing or with one name used for completely different things.
This has real-world consequences. For example, the EU mistakenly banned a useful plant we use every day when intending to ban a poisonous one because of a naming problem. How Kew is using semantic and taxonomic tools and technologies to bring order to this complexity (I almost said chaos) is really fascinating. They’re also helping to disambiguate nomenclature and provide links to authoritative information for botanical terms for use in journal articles, among other things.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
2. What is XIS?
XIS is a XML schema-based database system used to
store user data
All records are stored in individual XML files
Option to zip XML files available with XIS Project DTD
3. How XIS Data Is Stored
Docsets
Stores records with multiple fields (similar to SQL Table)
Can also have subfields and lists of field values nested within a
record
Can look up values from other fields in other Docsets or other
tables
Tables
Stores a single list of values
Can be referenced by other Docsets
Can be directly accessible for editing or kept hidden from user
view
4. How to Create a XIS Project
Create DTD file for XIS project
Specify MAI Thesaurus to link to project
Create Docset and Tables
Specify ID lengths for each Docset
Create fields for Docsets
Save DTD to dhserver/projects/projects/xml folder
Create XIS Project folder under dhserver/data
Create subfolders for each Docset under XIS Project
folder as well as Tables directory
XIS Projects can only be created by administrators
5. Starting a XIS Project
Start Data Harmony server where project is located
Log in to Admin module
Start MAI Thesaurus
Start XIS Project
Index XIS Project, especially if just created
Run startXis program
Enter server, port, thesaurus, username, and password
to log in
11. XIS Record Format
Saved in XML file
Starts with tag to represent Docset name along with ID
as attribute
Fields are listed within Docset tag along with values.
Subfields are nested within their parent fields
14. Current XIS Indexing and Search
Uses text-based indexes
Creates large number of index files (one for each field)
Generates temporary files for results
Uses less reliable RandomAccessFile search
Has limited amount of search operands
Does not take into account numerical values
15. Lucene vs. Current XIS Index
Fewer index files needed
Allows for broader searches
Fuzzy matching
Start and end wildcard searches
Recognizes numerical and date fields as such
Can be utilized to remove stopwords
16. New Lucene Search Process
Establish index reader to perform search
Submit query string containing fields and parameters
Return results
17. Other Lucene Functions
Will be used for adding, updating, and deleting XIS
records
Indexes will be housed on Data Harmony server