How Does the USA Today Network Provide Its Readers With Meaningful Content? - Devansh Dhutia, USA Today Network

•Download as PPTX, PDF•

0 likes•99 views

This document summarizes Devansh Dhutia's presentation about content delivery at USA Today Network. It discusses how the network reaches over 1 billion monthly viewers through 3000 journalists and leverages both active and passive search. It also outlines initiatives like My Topics for personalized alerts, News Near Me for local content, and different types of backfill content used to supplement news coverage. The presentation concludes by discussing lessons learned and opportunities to improve the distributed platform and content pipeline.

Technology

Delivering Meaningful Content at USA
Today Network
Devansh Dhutia
Manager, Development – USA Today Network
Montreal
October 15-18

Agenda
• About USA Today Network
• Where are the readers?
• My Topics
• News Near Me
• Content Backfill
– Types of backfill
• Distributed Platform
– Time series data
– Rights Management
• Lessons Learned
• Enter Content Pipeline
• Where to next?
• Q&A
2

About USA Today Network
~ 1.1 billion monthly views
~ 3000 journalists creating content
< 1 % of end users use active search
> 75% of pages leverage passive search
100% of authors leverage active/passive search to package &
curate content
~ 25 million monthly syndication requests
5

• Automated Push Alerts
• Customized headline feed
• Increased reader engagement
• Native Apps only
“This is just one of the first ways
we’re making personalized consumer
experiences a priority”
Jason Jedlinski, VP Product Management

• Surface local content to national
audience
• Smart closest market detection
LOCAL IS NATIONAL.

Types of backfill
13
• Chronological
• Popular
• Trending
• Personalized

Chronological
14
• Historical solution
• Type Filters
• Publish Behaviors

Popular
15
• Straightforward sort
• Publish Behaviors

Personalization
18
• Many options
• News is temporal
• Too many one & dones
• Collaborative filtering is “hard”

Distributed Platform
20
• Not all readers are on platform
• Push vs pull
Off platform

Lessons Learned
22
• Write as you want to read it is fast
• Challenge: keeping various views consistent.
• Challenge: related data changes require large reprocessing
• Challenge: lack of strict schema makes changes unpredictable
• Challenge: business logic spread across multiple tiers
• Denormalization simplifies queries
• Challenge: Simple data changes require large chunks of re-indexing
• Challenge: Denormalization makes your index look different from data

Lessons Learned (cont.)
23
• Expose raw search engine’s power to users
• Challenge: Most users don’t care to craft solr specific queries
• Challenge: New use cases can go to production without query review
• Challenge: Query sprawl

Enter Content Pipeline
24
• Write the data once
• Benefit: Single view to maintain
• Benefit: All consumers work off single model
• Benefit: Business logic pushed to production tier
• Normalize models in storage and in index
• Benefit: Related model updates do not require reprocessing
• Benefit: Retrieve only the data you care about

Enter Content Pipeline (cont.)
25
• GraphQL: Customers choose what they want
• Benefit: Customers have an ala carte selection of data to query
• Benefit: All data access becomes uniform
• Benefit: Api engineers can understand what data is actually used and what isn’t
• Abstract the search index nuances away from user
• Benefit: search becomes another graphql query
• Benefit: new searches are reviewed
• Benefit: Relevance engineering can happen independently from application development

Where to next?
26
• Feedback loops from the distributed platform
• Solve the “hard” personalization problem
• Switch more of our customers to the graphql based content pipeline
• Faster “on-the-fly” access management

We’re hiring!
usatodaynetworkcareers.com
Montreal
October 15-18

THANK YOU.
CONTACT
Devansh Dhutia
ddhutia@gannett.com
linkedin.com/in/devanshdhutia

Similar to How Does the USA Today Network Provide Its Readers With Meaningful Content? - Devansh Dhutia, USA Today Network

User Required? On the Value of User Research in the Digital HumanitiesMaxKemman

Expanding Retail Frontiers with MongoDBNorberto Leite

166 sspcc1 b_newmanSociety for Scholarly Publishing

CrowdSearcher. Reactive and multiplatform Crowdsourcing. keynote speech at DB...Search Computing

CRC-STC May 2013 Summit Presentationcrcstc

Emma.antunesNASAPMC

Usability requirements Andres Baravalle

Requirements analysis.pptxazida3

Requirments Elicitation.pptxazida3

Ch 3Saumil Shah

NLM Update by Dianne Babski, 18th June 2019EAHILPHIG

Developing & Implementing Findability StandardsRavi Mynampaty

Netflix Recommender System : Big Data Case StudyKetan Patil

Webinar: Achieving Customer Centricity and High Margins in Financial Services...MongoDB

Data Driven - The Ancestry Journey - 12-10-14Adam Davis

Why we need an independent index of the WebDirk Lewandowski

Improving search at Wellcome CollectionElasticsearch

Software Project Management Presentation FinalMinhas Kamal

Content Science Review: A Case Study in Engineering Personalization with Dari...Information Development World

Practical usability - Making your apps betterRiaan Cornelius

Similar to How Does the USA Today Network Provide Its Readers With Meaningful Content? - Devansh Dhutia, USA Today Network (20)

User Required? On the Value of User Research in the Digital Humanities

Expanding Retail Frontiers with MongoDB

166 sspcc1 b_newman

CrowdSearcher. Reactive and multiplatform Crowdsourcing. keynote speech at DB...

CRC-STC May 2013 Summit Presentation

Emma.antunes

Usability requirements

Requirements analysis.pptx

Requirments Elicitation.pptx

Ch 3

NLM Update by Dianne Babski, 18th June 2019

Developing & Implementing Findability Standards

Netflix Recommender System : Big Data Case Study

Webinar: Achieving Customer Centricity and High Margins in Financial Services...

Data Driven - The Ancestry Journey - 12-10-14

Why we need an independent index of the Web

Improving search at Wellcome Collection

Software Project Management Presentation Final

Content Science Review: A Case Study in Engineering Personalization with Dari...

Practical usability - Making your apps better

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Key Features Of Token Development (1).pptxLBM Solutions

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

Install Stable Diffusion in windows machinePadma Pradeep

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf

How to Troubleshoot Apps for the Modern Connected Worker

A Domino Admins Adventures (Engage 2024)

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

08448380779 Call Girls In Civil Lines Women Seeking Men

Unblocking The Main Thread Solving ANRs and Frozen Frames

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

My Hashitalk Indonesia April 2024 Presentation

Key Features Of Token Development (1).pptx

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

Install Stable Diffusion in windows machine

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Breaking the Kubernetes Kill Chain: Host Path Mount

Maximizing Board Effectiveness 2024 Webinar.pptx

SQL Database Design For Developers at php[tek] 2024

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Injustice - Developers Among Us (SciFiDevCon 2024)

How Does the USA Today Network Provide Its Readers With Meaningful Content? - Devansh Dhutia, USA Today Network

1. Delivering Meaningful Content at USA Today Network Devansh Dhutia Manager, Development – USA Today Network Montreal October 15-18

2. Agenda • About USA Today Network • Where are the readers? • My Topics • News Near Me • Content Backfill – Types of backfill • Distributed Platform – Time series data – Rights Management • Lessons Learned • Enter Content Pipeline • Where to next? • Q&A 2

3. 3

4. Where are we?

5. About USA Today Network ~ 1.1 billion monthly views ~ 3000 journalists creating content < 1 % of end users use active search > 75% of pages leverage passive search 100% of authors leverage active/passive search to package & curate content ~ 25 million monthly syndication requests 5

6. Where are the readers? 6 Off platform

7. My Topics

8. • Automated Push Alerts • Customized headline feed • Increased reader engagement • Native Apps only “This is just one of the first ways we’re making personalized consumer experiences a priority” Jason Jedlinski, VP Product Management

9. News Near Me

10. • Surface local content to national audience • Smart closest market detection LOCAL IS NATIONAL.

11. Content Backfill

12. Backfill Curation

13. Types of backfill 13 • Chronological • Popular • Trending • Personalized

14. Chronological 14 • Historical solution • Type Filters • Publish Behaviors

15. Popular 15 • Straightforward sort • Publish Behaviors

16. Trending 16 VIEWS AGE

17. Trending (cont.) 17

18. Personalization 18 • Many options • News is temporal • Too many one & dones • Collaborative filtering is “hard”

19. Distributed Platform

20. Distributed Platform 20 • Not all readers are on platform • Push vs pull Off platform

21. But … I have rights! 21

22. Lessons Learned 22 • Write as you want to read it is fast • Challenge: keeping various views consistent. • Challenge: related data changes require large reprocessing • Challenge: lack of strict schema makes changes unpredictable • Challenge: business logic spread across multiple tiers • Denormalization simplifies queries • Challenge: Simple data changes require large chunks of re-indexing • Challenge: Denormalization makes your index look different from data

23. Lessons Learned (cont.) 23 • Expose raw search engine’s power to users • Challenge: Most users don’t care to craft solr specific queries • Challenge: New use cases can go to production without query review • Challenge: Query sprawl

24. Enter Content Pipeline 24 • Write the data once • Benefit: Single view to maintain • Benefit: All consumers work off single model • Benefit: Business logic pushed to production tier • Normalize models in storage and in index • Benefit: Related model updates do not require reprocessing • Benefit: Retrieve only the data you care about

25. Enter Content Pipeline (cont.) 25 • GraphQL: Customers choose what they want • Benefit: Customers have an ala carte selection of data to query • Benefit: All data access becomes uniform • Benefit: Api engineers can understand what data is actually used and what isn’t • Abstract the search index nuances away from user • Benefit: search becomes another graphql query • Benefit: new searches are reviewed • Benefit: Relevance engineering can happen independently from application development

26. Where to next? 26 • Feedback loops from the distributed platform • Solve the “hard” personalization problem • Switch more of our customers to the graphql based content pipeline • Faster “on-the-fly” access management

27. We’re hiring! usatodaynetworkcareers.com Montreal October 15-18

28. THANK YOU. CONTACT Devansh Dhutia ddhutia@gannett.com linkedin.com/in/devanshdhutia

Editor's Notes

Micro service polling solr Near realtime High adoption rate – 8% lift in PV depth & high lift on return frequency Increased local market app downloads through cross-pollination of content
2 step detection for closest market then content from market Widening capability for news near you
Unlike retail whose lifecycle continues News is short lived
AN – 14.5M users / mo Significant amount content used by various partners
DRM Pre-tagging content – reindex everything on changes Time series sharding

How Does the USA Today Network Provide Its Readers With Meaningful Content? - Devansh Dhutia, USA Today Network

Recommended

Recommended

More Related Content

Similar to How Does the USA Today Network Provide Its Readers With Meaningful Content? - Devansh Dhutia, USA Today Network

Similar to How Does the USA Today Network Provide Its Readers With Meaningful Content? - Devansh Dhutia, USA Today Network (20)

More from Lucidworks

More from Lucidworks (20)

Recently uploaded

Recently uploaded (20)

How Does the USA Today Network Provide Its Readers With Meaningful Content? - Devansh Dhutia, USA Today Network

Editor's Notes