The document discusses improving content categorization accuracy using taxonomies. It describes the goal of the IAB taxonomy working group to create an enhanced taxonomy that enables more accurate and consistent content description to facilitate relevant advertising. The document compares the Sprig and IAB taxonomy structures and notes they are designed for advertising rather than classifying news content. It proposes using IPTC-IAB taxonomy mapping and deep learning as a solution to better classify news while acknowledging limitations of existing IAB categories.
3. Taxonomy Structures
Sprig; A complete list of content categories returned for the ‘classifyText’ method are shown below.
The Natural Language API filters the categories returned by the ‘classifyText’ method to include only the most relevant
categories for a request.
IAB- “The working group was conscious that the taxonomy should be leveraged by publishers and advertisers to describe
the topical “aboutness” of content with the primary purpose of facilitating relevant, brand safe, and effective advertising. “
The goal for the group was to create an enhanced and more powerful taxonomy, enabling content creators to
more accurately and consistently describe content, facilitating more relevant advertising and providing a higher quality
and more granular foundation for data analysis.
4. Taxonomy Structures
Sprig:
“A complete list of content categories returned for the ‘classifyText’ method are shown below.
The Natural Language API filters the categories returned by the ‘classifyText’ method to include only the most relevant
categories for a request.”
IAB:
“The working group was conscious that the taxonomy should be leveraged by publishers and advertisers to describe
the topical “aboutness” of content with the primary purpose of facilitating relevant, brand safe, and effective advertising. “
The goal for the group was to create an enhanced and more powerful taxonomy, enabling content creators to
more accurately and consistently describe content, facilitating more relevant advertising and providing a higher quality
and more granular foundation for data analysis.
13. The crux of the matter is
❏ Taxonomy Structures are designed to classify content, not
news/information
❏ Publisher in advertising means content publisher, not news
publisher
❏ IAB & Sprig both are designed to cater to advertising, not news
webpage
❏ It also explains why contextual keyword types have always
struggled in News domains
18. Everyone knows the limitations of IAB categories,
So let’s keep that aside and try to achieve
the maximum accuracy level with the current structure.
The more important question is