SlideShare a Scribd company logo
MAIstro and the State of Iowa
 Legislative Services Agency

  The journey from paper and “back-of-
   the-book” indexing to an electronic
 indexing system and use of a controlled
               vocabulary
Talking points:
• Life before MAIstro and use of a controlled
  vocabulary
• Integration of MAIstro and new XML database
• Learning a new way to index
• Life after MAIstro
• Print publication
• Internet search and retrieval
• Moving forward and looking ahead…
Life before MAIstro
              • Wordy index entries with lots
                of detail and description

              • Lengthy index: A 300-page
                index was commonplace for
                the Iowa Acts book. 2010
                Iowa Acts consisted of 1200
                pages, with over 300 pages
                being the index.

              • Indexing work for our Acts
                publication consisted of
                seven staffers and seven
                months of work.

              • Because of the amount of
                work involved, indexing
                production would delay
                publication of the Acts book.
Life before MAIstro


                  • Paper printout, editing,
                    and rewrites. Rewrites
                    take time…

                  • Example to the right is a
                    fairly clean edit. Often edit
                    marks would fill page,
                    especially for new indexers

                  • 3 years was typical
                    learning curve for new
                    indexers
Life before MAIstro



            • Example of recent Iowa Code
              Index. Yellow highlight indicates
              “see” and “see also” references.

            • Common user complaints of
              confusing “directions” and going
              in circles.

            • Such tactics were used to
              condense index, already around
              900 pages.
Life before MAIstro

… And we had these too. Up until around 2010, the index for the over 20,000-page Administrative Code was
still maintained using index cards. Updates were marked in pencil.
Integration of MAIstro and new XML database


• Access Innovations collaborated with the Iowa Legislative
  Services Agency to build a customized thesaurus.

• The six-month project utilized Access Innovations’ Data
  Harmony software suite. The project team created the
  thesaurus using MAISTRO, a software tool which includes
  both Thesaurus Master (thesaurus and taxonomy
  management) and Machine Aided Indexer (M.A.I.).

• Thesaurus and controlled vocabulary were integrated as
  part of indexing interface of new XML-based system.
A new way of indexing




Completely electronic system. Paperless. Indexing terms are “tagged” to XML database
content. Tags can be assigned to all types of content. Not restricted by type of
document or publication.
A new way of indexing




Machine-aided indexing interface. Users can choose top terms or select
other terms from thesaurus. MAIstro runs behind the background.
Life after MAIstro
•   Previously Iowa Acts indexing was completed in about 7 months with 7 staffers.
    For the 2011 Iowa Acts, indexing was completed in about 4 weeks with 2 staffers.

•   Biggest surprise to me: How much time the use of a controlled vocabulary saved. I
    did not realize how much time we had been spending writing, rewriting, and
    editing index entries.

•   Less learning curve for new indexers. In the new system, the concept applies or it
    doesn’t.

•   The entire Administrative Code (now in XML database) has indexing terms
    assigned to its content electronically… No indexing cards to be found.

•   The possibility now exists to index legislative documents that have not been
    indexed before.

•   Indexing for historical documents, current documents, and future documents can
    evolve and change.
Print publications

                • Example of 2011 Acts,
                  chapter 3 content. This
                  content is generated
                  without “see refs”
                  (nonpreferred terms)

                • Interface allows us
                  ability to generate
                  with or without
                  nonpreferred terms
Print publications


           • This is an example of chapter 3
             content with “see refs”.

           • For a simple output and design,
             we have restricted our print
             output to utilize only preferred
             and nonpreferred terms.

           • We have debated about
             accounting for broader, narrower,
             related terms in print output, but
             for now we prefer the simpler,
             streamlined approach.
Print publications

           • For 2011 Acts, index size was
             around 60 pages, compared to the
             300 some pages of the 2010 Acts
             index.

           • Same concepts indexed, but with
             less level of detail and description.

           • This same print output and design
             was also used for our 2011 Code
             Supplement. Terms used are the
             same, but output can be stylized
             to fit publication.
Internet search




The next phase of development: Use of indexing tags to help users find and retrieve
documents. Combination of document types, indexing tags, keyword, and metadata for
search criteria. This is currently exposed only in test environment.
Internet search




This is an example of keyword “doves” coupled with “Iowa Acts” document type. Note
chapter 3 indexing tags “Birds, Hunting, Game animals, Doves” cited as Related
Topic(s).
Moving forward…
• One of the fears that my staff had at the start of
  this journey was that the “machines” might
  replace the “people”. In fact, quite the opposite
  has occurred. Because of the ability to index
  documents so quickly and efficiently with
  machine-aided tools, we are being asked to do
  more as an indexing staff.

• There is still a lot to learn, and I believe we have
  only really scratched the surface regarding the
  true potential of this technology.
Contact information:
        Roger Karns
Legislative Services Agency
 rkarns@legis.state.ia.us
       515-242-6459

More Related Content

Similar to MAIstro and the State of Iowa Legislative Services Agency

10 mistakes when moving to topic-based authoring
10 mistakes when moving to topic-based authoring10 mistakes when moving to topic-based authoring
10 mistakes when moving to topic-based authoring
Sharon Burton
 
Guidelines for indexing and tools
Guidelines for indexing and toolsGuidelines for indexing and tools
Guidelines for indexing and tools
NagaVarthini
 
European SharePoint Conference Automated Tagging and Metadata Management w...
European SharePoint Conference   Automated Tagging and Metadata  Management w...European SharePoint Conference   Automated Tagging and Metadata  Management w...
European SharePoint Conference Automated Tagging and Metadata Management w...
B-S-S Business Software Solutions GmbH
 
Arabic Text mining Classification
Arabic Text mining Classification Arabic Text mining Classification
Arabic Text mining Classification
Zakaria Zubi
 
Inverted files for text search engines
Inverted files for text search enginesInverted files for text search engines
Inverted files for text search engines
unyil96
 
SharePoint 2010 for Document Compliance
SharePoint 2010 for Document ComplianceSharePoint 2010 for Document Compliance
SharePoint 2010 for Document Compliance
ntenany
 

Similar to MAIstro and the State of Iowa Legislative Services Agency (20)

Cataloging roundtable discussion questions
Cataloging roundtable discussion questionsCataloging roundtable discussion questions
Cataloging roundtable discussion questions
 
Web of science,Scopus,bibtex,latex
Web of science,Scopus,bibtex,latexWeb of science,Scopus,bibtex,latex
Web of science,Scopus,bibtex,latex
 
10 mistakes when moving to topic-based authoring
10 mistakes when moving to topic-based authoring10 mistakes when moving to topic-based authoring
10 mistakes when moving to topic-based authoring
 
Guidelines for indexing and tools
Guidelines for indexing and toolsGuidelines for indexing and tools
Guidelines for indexing and tools
 
IRS-Cataloging and Indexing-2.1.pptx
IRS-Cataloging and Indexing-2.1.pptxIRS-Cataloging and Indexing-2.1.pptx
IRS-Cataloging and Indexing-2.1.pptx
 
European SharePoint Conference Automated Tagging and Metadata Management w...
European SharePoint Conference   Automated Tagging and Metadata  Management w...European SharePoint Conference   Automated Tagging and Metadata  Management w...
European SharePoint Conference Automated Tagging and Metadata Management w...
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
 
Enterprise Search Share Point2009 Best Practices Final
Enterprise Search Share Point2009 Best Practices FinalEnterprise Search Share Point2009 Best Practices Final
Enterprise Search Share Point2009 Best Practices Final
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
Arabic Text mining Classification
Arabic Text mining Classification Arabic Text mining Classification
Arabic Text mining Classification
 
ISSN, DOI ,IMPACT FACTOR ,CITATIONS.pptx
ISSN, DOI ,IMPACT FACTOR ,CITATIONS.pptxISSN, DOI ,IMPACT FACTOR ,CITATIONS.pptx
ISSN, DOI ,IMPACT FACTOR ,CITATIONS.pptx
 
Citation tools in research
Citation tools in researchCitation tools in research
Citation tools in research
 
Tabloid
TabloidTabloid
Tabloid
 
Inverted files for text search engines
Inverted files for text search enginesInverted files for text search engines
Inverted files for text search engines
 
Word processing and ms excel
Word processing and ms excelWord processing and ms excel
Word processing and ms excel
 
Reports and DITA Metrics IXIASOFT User Conference 2016
Reports and DITA Metrics IXIASOFT User Conference 2016Reports and DITA Metrics IXIASOFT User Conference 2016
Reports and DITA Metrics IXIASOFT User Conference 2016
 
Searching of Web and Electronic Resources
Searching of Web and Electronic Resources Searching of Web and Electronic Resources
Searching of Web and Electronic Resources
 
SharePoint 2010 for Document Compliance
SharePoint 2010 for Document ComplianceSharePoint 2010 for Document Compliance
SharePoint 2010 for Document Compliance
 
How to Apply Your Taxonomy to Your Content Automatically
How to Apply Your Taxonomy to Your Content AutomaticallyHow to Apply Your Taxonomy to Your Content Automatically
How to Apply Your Taxonomy to Your Content Automatically
 
Presentacion tics (1)
Presentacion tics (1)Presentacion tics (1)
Presentacion tics (1)
 

More from Access Innovations, Inc.

More from Access Innovations, Inc. (20)

Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
 
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 
Smart submit
Smart submitSmart submit
Smart submit
 
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
 
Acs discoverability-dhug2021
Acs discoverability-dhug2021Acs discoverability-dhug2021
Acs discoverability-dhug2021
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
 

Recently uploaded

Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 

Recently uploaded (20)

To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
The architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfThe architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdf
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 

MAIstro and the State of Iowa Legislative Services Agency

  • 1. MAIstro and the State of Iowa Legislative Services Agency The journey from paper and “back-of- the-book” indexing to an electronic indexing system and use of a controlled vocabulary
  • 2. Talking points: • Life before MAIstro and use of a controlled vocabulary • Integration of MAIstro and new XML database • Learning a new way to index • Life after MAIstro • Print publication • Internet search and retrieval • Moving forward and looking ahead…
  • 3. Life before MAIstro • Wordy index entries with lots of detail and description • Lengthy index: A 300-page index was commonplace for the Iowa Acts book. 2010 Iowa Acts consisted of 1200 pages, with over 300 pages being the index. • Indexing work for our Acts publication consisted of seven staffers and seven months of work. • Because of the amount of work involved, indexing production would delay publication of the Acts book.
  • 4. Life before MAIstro • Paper printout, editing, and rewrites. Rewrites take time… • Example to the right is a fairly clean edit. Often edit marks would fill page, especially for new indexers • 3 years was typical learning curve for new indexers
  • 5. Life before MAIstro • Example of recent Iowa Code Index. Yellow highlight indicates “see” and “see also” references. • Common user complaints of confusing “directions” and going in circles. • Such tactics were used to condense index, already around 900 pages.
  • 6. Life before MAIstro … And we had these too. Up until around 2010, the index for the over 20,000-page Administrative Code was still maintained using index cards. Updates were marked in pencil.
  • 7. Integration of MAIstro and new XML database • Access Innovations collaborated with the Iowa Legislative Services Agency to build a customized thesaurus. • The six-month project utilized Access Innovations’ Data Harmony software suite. The project team created the thesaurus using MAISTRO, a software tool which includes both Thesaurus Master (thesaurus and taxonomy management) and Machine Aided Indexer (M.A.I.). • Thesaurus and controlled vocabulary were integrated as part of indexing interface of new XML-based system.
  • 8. A new way of indexing Completely electronic system. Paperless. Indexing terms are “tagged” to XML database content. Tags can be assigned to all types of content. Not restricted by type of document or publication.
  • 9. A new way of indexing Machine-aided indexing interface. Users can choose top terms or select other terms from thesaurus. MAIstro runs behind the background.
  • 10. Life after MAIstro • Previously Iowa Acts indexing was completed in about 7 months with 7 staffers. For the 2011 Iowa Acts, indexing was completed in about 4 weeks with 2 staffers. • Biggest surprise to me: How much time the use of a controlled vocabulary saved. I did not realize how much time we had been spending writing, rewriting, and editing index entries. • Less learning curve for new indexers. In the new system, the concept applies or it doesn’t. • The entire Administrative Code (now in XML database) has indexing terms assigned to its content electronically… No indexing cards to be found. • The possibility now exists to index legislative documents that have not been indexed before. • Indexing for historical documents, current documents, and future documents can evolve and change.
  • 11. Print publications • Example of 2011 Acts, chapter 3 content. This content is generated without “see refs” (nonpreferred terms) • Interface allows us ability to generate with or without nonpreferred terms
  • 12. Print publications • This is an example of chapter 3 content with “see refs”. • For a simple output and design, we have restricted our print output to utilize only preferred and nonpreferred terms. • We have debated about accounting for broader, narrower, related terms in print output, but for now we prefer the simpler, streamlined approach.
  • 13. Print publications • For 2011 Acts, index size was around 60 pages, compared to the 300 some pages of the 2010 Acts index. • Same concepts indexed, but with less level of detail and description. • This same print output and design was also used for our 2011 Code Supplement. Terms used are the same, but output can be stylized to fit publication.
  • 14. Internet search The next phase of development: Use of indexing tags to help users find and retrieve documents. Combination of document types, indexing tags, keyword, and metadata for search criteria. This is currently exposed only in test environment.
  • 15. Internet search This is an example of keyword “doves” coupled with “Iowa Acts” document type. Note chapter 3 indexing tags “Birds, Hunting, Game animals, Doves” cited as Related Topic(s).
  • 16. Moving forward… • One of the fears that my staff had at the start of this journey was that the “machines” might replace the “people”. In fact, quite the opposite has occurred. Because of the ability to index documents so quickly and efficiently with machine-aided tools, we are being asked to do more as an indexing staff. • There is still a lot to learn, and I believe we have only really scratched the surface regarding the true potential of this technology.
  • 17. Contact information: Roger Karns Legislative Services Agency rkarns@legis.state.ia.us 515-242-6459