Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Society of indexers keynote


Published on

'Publshing Revolutions" keynote presentation to the Society of Indexers 2011 conference

Published in: Technology, Business
  • Be the first to comment

Society of indexers keynote

  1. 1. Publishing Revolutions Nic Gibson 2nd September 2011 (
  2. 2. What’s Happening? • Publishing is changing • Outsourcing • Digital Publishing • XML • A single retailer has gained huge power2
  3. 3. Digital markets are maturing • More books will become digital only • The price of printing may only be a small part but it’s a part • Traditional publishing will be competing against a different cost base3
  4. 4. Change happens • The business is changing • Publishing is paper centric • It needs to be content centric • The paper mindset needs to be replaced by a content mindset.4
  5. 5. The world has changed • When did you • last use an encyclopaedia? • pick up a dictionary? • These are digital activities now5
  6. 6. Publishers are using XML • Text + tags • It’s a mark-up language • Made up of elements, text and attributes • Each element contains other elements • Ideally we represent the meaning of the content not the format of it6
  7. 7. A little bit more… • XML files are basically text files with rules • The rules are • Content is marked up with elements • Elements contain other elements or text • An element is made up of an opening tag and a closing tag • Elements can contain attributes <para id="para001">This is a paragraph.</para>7
  8. 8. XML… <?xml version="1.0" encoding="UTF-8"?> <article xmlns="" xmlns:xlink="" version="5.0"> <info> <title>Publishing Revolutions</title> <abstract><para>Lets talk about all the changes in publishing going on right now.</para></abstract> </info> <section> <title>Whats Happening?</title> <itemizedlist> <listitem><para>Publishing is changing</para></listitem> <listitem> <itemizedlist> <listitem><para>Outsourcing</para></listitem> <listitem><para>Digital Publishing</para></listitem> <listitem><para>XML</para></listitem> </listitem> </itemizedlist> </listitem> </itemizedlist> </section> </article>8
  9. 9. Indexing the XML <para xml:id="the0000116">As a business strategy, the Internet giants’ formula is simple: The more personally relevant their information offerings are, the more ads they can sell, and the more likely you are to buy the products they’re offering. And the formula works. <indexterm><primary>Amazon</primary></indexterm> Amazon sells billions of dollars in merchandise by predicting what each customer is interested in and putting it in the front of the virtual store. Up to 60 percent of <indexterm><primary>Netflix</primary></indexterm>Netflix’s rentals come from the personalized guesses it can make about each customer’s movie preferences—and at this point, Netflix can predict how much you’ll like a given movie within about half a star. Personalization is a core strategy for the top five sites on the Internet— Yahoo, Google, Facebook, YouTube, and Microsoft Live—as well as countless others.</para>9
  10. 10. Standards • That last slide is a language called DocBook • A standard for (originally) technical publishing • We have • <indexterm> • <primary>, <secondary>, <tertiary> • <see>, <seealso> • The indexing mark-up was not created by indexers • why not? • Please get involved – the XML community needs you10
  11. 11. Why XML? • Agile content • XML is an enabler for multi-format publishing • EPUB • Kindle • Web • multiple print formats • custom book publishing11
  12. 12. Traditional workflows • Traditional publishing workflows do not include digital content well • Well-known • Developed over a long period • Efficient way to produce books12
  13. 13. Traditional workflows Manuscript Prepare for copyediting Copyediting Author queries Other formats? and corrections Prepare for typesetting Print ready Typesetting proofs Printing Proofs Indexing Proofreading13
  14. 14. Content driven publishing • XML is a valuable tool • It’s not an answer in itself • An XML first workflow doesn’t necessarily get you anywhere • It’s the neutral format for your content14
  15. 15. XML first workflows Manuscript Copyediting and Tagging HTML Conversion to Place into XML Repository XML Author queries and corrections EPUB Kindle Indexing Transformation Print Ready PDF Proofs eBook PDF Proofreading/ Copyediting …15
  16. 16. How does this affect indexers? • Publishers are outsourcing more and more • Indexers have almost always been freelance anyway • Publishers are now outsourcing the entire process • Mostly to India and China • Non-native speakers • That might affect indexers • Increasing automation • I don’t think automated indexing is a good idea but that doesn’t mean everyone does16
  17. 17. EBooks • Indexes in eBooks are a bit of a problem • Random House – “We don’t need indexes any more because we can search the text” • Many EBooks have an unlinked index with page numbers at the back • Some have links with page numbers • Some publishers take out the index completely17
  18. 18. EBooks • Right now, Amazon dominate the market with the Kindle • Everyone else supports the EPUB format • Apple have about 10% of the European market and Amazon have 85%. • The other retailers are basically irrelevant • The organisations controlling this market are not the publishers18
  19. 19. eBook Indexes19
  20. 20. Less bad?20
  21. 21. Better?21
  22. 22. Indexes and eBooks • There is no point in talking to publishers about the way indexes work in eBooks • Talk to Apple, Amazon, Sony, Barnes & Noble • Think of the amazing things you could do with an index on an eBook • intelligent search • new ways to present the index • view the index and display snippets of content22
  23. 23. What Now?23
  24. 24. This may affect you • As publishers move towards that XML first workflow, the old ways of indexing may not be appropriate • As publishers move to entirely digital workflows the index becomes a digital artefact • The concept of the page is going to become much less important • A true XML index has no presentation • Is it italic, bold, etc is no longer important to the publisher who needs to present it in five different ways24
  25. 25. The index is more important • The digital publishing world includes the Web • On the web, the primary way we can find things is through searches • A good index is a fantastic search corpus • search the index not the book • Semantic searches have been promised next year for twenty years • you are good at this and computers aren’t • Convergence is happening in search – taxonomies and indexes are both used25
  26. 26. What can the Society do? • Talk to the industry • Not necessarily publishers • Join the International Digital Publishers Forum • Talk to your software suppliers about indexing XML directly • Join the Organisation for the Advancement of Standards in Information Systems • Have a corporate policy on XML – create some XML indexing standards yourselves! • Create relationships with bibliographic communities because they are already thinking hard about these things26
  27. 27. What can individuals do? • Learn some XML • It’s not hard • Go on a course? • Improve your digital skills • The industry is changing and being aware of the new environment can only help you • Push back on clients – make sure they understand what an index is27