Artificial intelligence breaks into our lives. In the future, everything will probably be clear, but so far, some questions have arisen, and increasingly these issues affect aspects of morality and ethics. Which principles do we need to keep in mind while surfacing machine learning algorithms? How the editorial team affects the day to day development of applications at BBC?
Place: Kharkiv National University of Radio Electronics, Ukraine
When: 17th November 2019.
Networld - Our Data Journey (2016-09-29)Patrick Ng
An introduction to the data science related projects currently undertaken at Networld. The talk was given at the Programmatic Connect 2016 event hosted by AquaMedia.
The Music Information Retrieval Evaluation eXchange (MIREX) is a valuable community service, having established standard datasets, metrics, baselines, methodologies, and infrastructure for comparing MIR methods. While MIREX has managed to successfully maintain operations for over a decade, its long-term sustainability is at risk without considerable ongoing financial support. The imposed constraint that input data cannot be made freely available to participants necessitates that all algorithms run on centralized computational resources, which are administered by a limited number of people. This incurs an approximately linear cost with the number of submissions, exacting significant tolls on both human and financial resources, such that the current paradigm becomes less tenable as participation increases. To alleviate the recurring costs of future evaluation campaigns, we propose a distributed, community-centric paradigm for system evaluation, built upon the principles of openness, transparency, reproducibility, and incremental evaluation. We argue that this proposal has the potential to reduce operating costs to sustainable levels. Moreover, the proposed paradigm would improve scalability, and eventually result in the release of large, open datasets for improving both MIR techniques and evaluation methods.
Ever felt that expecting a personalized user experience is a bit like tossing a wish to a genie, uncertain if it'll truly come to life? You're not alone.
Many platforms are in a wrestling match with the intricacies of A/B testing, turning the journey to user engagement into a bit of a head-scratcher.
Join us in this session as we zoom into the specifics of building and implementing an A/B testing ecosystem for personalized meditation recommendations. Our guest, Rohan will dive deep into key metrics, testing strategies, and top-notch practices to roll out personalized recommendations that truly enhance user experience and engagement.
My slides for my talk regarding machine learning and data science. Includes working examples with accompanying repo with reproducible code and data sets available.
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...CambridgeshireInsight
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected
Hendrik Grothuis
Research Manager - Local Intelligence & Data Management
Cambridgeshire County Council
Making Transparency Work, Birmingham,
09th June 2014.
A presentation on the Cambridgeshire Insight Open Data project with a general overview of project progress and development.
Networld - Our Data Journey (2016-09-29)Patrick Ng
An introduction to the data science related projects currently undertaken at Networld. The talk was given at the Programmatic Connect 2016 event hosted by AquaMedia.
The Music Information Retrieval Evaluation eXchange (MIREX) is a valuable community service, having established standard datasets, metrics, baselines, methodologies, and infrastructure for comparing MIR methods. While MIREX has managed to successfully maintain operations for over a decade, its long-term sustainability is at risk without considerable ongoing financial support. The imposed constraint that input data cannot be made freely available to participants necessitates that all algorithms run on centralized computational resources, which are administered by a limited number of people. This incurs an approximately linear cost with the number of submissions, exacting significant tolls on both human and financial resources, such that the current paradigm becomes less tenable as participation increases. To alleviate the recurring costs of future evaluation campaigns, we propose a distributed, community-centric paradigm for system evaluation, built upon the principles of openness, transparency, reproducibility, and incremental evaluation. We argue that this proposal has the potential to reduce operating costs to sustainable levels. Moreover, the proposed paradigm would improve scalability, and eventually result in the release of large, open datasets for improving both MIR techniques and evaluation methods.
Ever felt that expecting a personalized user experience is a bit like tossing a wish to a genie, uncertain if it'll truly come to life? You're not alone.
Many platforms are in a wrestling match with the intricacies of A/B testing, turning the journey to user engagement into a bit of a head-scratcher.
Join us in this session as we zoom into the specifics of building and implementing an A/B testing ecosystem for personalized meditation recommendations. Our guest, Rohan will dive deep into key metrics, testing strategies, and top-notch practices to roll out personalized recommendations that truly enhance user experience and engagement.
My slides for my talk regarding machine learning and data science. Includes working examples with accompanying repo with reproducible code and data sets available.
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected - He...CambridgeshireInsight
Cambridgeshire Insight Open Data: What we’ve learnt from the unexpected
Hendrik Grothuis
Research Manager - Local Intelligence & Data Management
Cambridgeshire County Council
Making Transparency Work, Birmingham,
09th June 2014.
A presentation on the Cambridgeshire Insight Open Data project with a general overview of project progress and development.
Building trust and accountability - the role User Experience design can play ...Pistoia Alliance
In this webinar our panel of UX specialists give a brief introduction to User Experience before presenting the design opportunities UX can bring to AI. We all know that AI has great potential but has some significant hurdles to overcome not least so the human aspect of trust and ethical considerations when designing in the life sciences.
Slides for keynote "Social Media and AI: Don’t forget the users" at WWW 2017 workshop "International Workshop on Modeling Social Media: Machine Learning and AI for Modeling and Analyzing Social Media". I am arguing that we need consider two things: the source of what we use to make good algorithms and whether users are impacted the way we want to impact them. The talk is based on two uses cases around providing diversity (something many of us believe is good) to users:
1. Engaging through diversity: serendipity (same algorithm, different sources)
2. Engaging through diversity: awareness (effective algorithm, perception)
My goal is to say, we may have the best AI, but we may get it wrong if we forget the users. I don't have answers, but it is important that we ask the right questions in today's world.
Natural Intelligence the human factor in AIBill Liu
Presented at AI NEXTCon Seattle 1/17-20, 2018
http://aisea18.xnextcon.com
join our free online AI group with 50,000+ tech engineers to learn and practice AI technology, including: latest AI news, tech articles/blogs, tech talks, tutorial videos, and hands-on workshop/codelabs, on machine learning, deep learning, data science, etc..
The Freedom to Grow: How Standards in Communication Facilitate Our Industry, ...dclsocialmedia
Standards – either in the XML sense or simply communication best practices – help grow, accelerate and “professionalize” an industry. Where would construction be without material standards for width and strengths, or certification for specific skills? How could we have transportation without standards for traffic and processes? Standards are what help ad-hoc processes become enterprise-class, and allow them to scale beyond our expectations.
Technical communication is in an era of rapid, disruptive and revolutionary change. The true nature of the challenge is understood by a few, and pros and cons of potential solutions by even fewer. The future therefore will require that we work together to exchange knowledge as best we can to help each other hit the many moving targets. We must do this because our old techniques and processes just can’t keep up, and no organization has the time or funds to reinvent every solution on their own.
In “The Freedom to Grow,” Noz Urbina will explain how standards can help an organization with little funds tackle larger challenges, and larger organizations implement profound change with reduced risk. The alternative is potentially getting left behind as the industry and community rush forward.
Noz Urbina is an established content strategy thought leader, consultant and trainer specializing in cutting edge, multi-channel, business-driven content projects for marketing, business, technical and omnichannel communications. He is co-author of “Content Strategy: Connecting the dots between business brand and benefits”. Since 2000, he has provided customer experience focused services to Fortune 500 organizations and small-to-medium enterprises. Noz is the founder of Urbina Consulting, and since 2006 has been Events Chair and Content Director for Congility.com.
Recommendation systems today are widely used across many applications such as in multimedia content platforms, social networks, and ecommerce, to provide suggestions to users that are most likely to fulfill their needs, thereby improving the user experience. Academic research, to date, largely focuses on the performance of recommendation models in terms of ranking quality or accuracy measures, which often don’t directly translate into improvements in the real-world. In this talk, we present some of the most interesting challenges that we face in the personalization efforts at Netflix. The goal of this talk is to sunshine challenging research problems in industrial recommendation systems and start a conversation about exciting areas of future research.
Catch a comprehensive overview of the transformative intersection between AI and User Experience (UX). Dive into practical applications, understand the nuances, and engage with the ethical challenges. Ideal for professionals, enthusiasts, and anyone curious about the future of digital experiences.
Future of land use project overview - august 2019Future Agenda
Future of Land Use
With all the challenges on the horizon, we are pleased to be exploring the future of land use via another Open Foresight major project kicking off in October and running through until next summer.
Addressing pivotal issues from food production, soil quality, water scarcity and biosphere protection to urbanisation, leisure use and land ownership, this global collaborative project is focused on the critical issues and potential solutions for the future.
Undertaken in collaboration with a wide range of major organisations, including the WWF as our global knowledge partner, the locations and schedule for the programme are now being detailed.
This is the project overview.
If you would like to be involved in this major and important topic and host one or more of the expert workshops around the world, do let us know.
Website Content Planning For Law Firms | LawLytics WebinarsDan Jaffe
Learn how to plan your law firm's content marketing strategy, and create a content plan for your law firm website that engages potential clients and referral sources, builds your reputation as a attorney, and works and plays well with the search engines and social media.
Presentation at the Netflix Expo session at RecSys 2020 virtual conference on 2020-09-24. It provides an overview of recommendation and personalization at Netflix and then highlights some of the things we’ve been working on as well as some important open research questions in the field of recommendations.
Software libre en la banca - Experiencias del grupo Santander con OSSLibreCon
Banco Santander es la empresa de mayor capitalización bursátil de España y uno de los bancos más importantes del mundo. Exposición de las razones que les han llevado no solo a utilizar software abierto en el core del software del Banco Santander, sino a liderar y desarrollar una de las iniciativas de código abierto: Open Nebula. Open Nebula es una plataforma de cloud computing para manejar infraestructuras heterogéneas de data center.
¿Por qué un banco líder apuesta por el software libre? Autor: Jesus Ruiz Martínez (Director of Open Innovation en Banco Santander). Librecon.io
A talk about how to conduct usability research without a massive budget or it being a huge undertaking. Case Studies about past experiences in guerrilla UX as well as the "patent-pending" $1000 UX Lab.
Talk originally prepared for ProductTank Madison 2018-03-14.
See https://tinyurl.com/guerrilla-ux for slides, transitions, etc.
This slide is a recruitment materials for NABLAS Inc. It provides a brief introduction of NABLAS's mission, business activities, and working environment.
NABLAS aims to create a world where people can live as human beings through human resource development, research and development, and consulting activities in the field of AI.
Mahara offers basic statistics by default, but advanced reporting functionnality is not available. By
using a separate tool that integrates well with Mahara, such as the open source Piwik, Mahara
administrators can benefit from a multitude of insightful analytics reports.
This presentation will showcase some examples of such analytics and how they can help:
- Understand user behavior and their interactions with Mahara.
- Increase user engagement by analysing the data and implementing changes.
- monitor over time the availability, speed, and evolution of the Mahara system.
Talk given at the London AICamp meet up on the 13 July 2023. It's an introduction on building open-source ChatGPT-like chat bots and some of the considerations to have while training/tuning them using Airflow.
Building trust and accountability - the role User Experience design can play ...Pistoia Alliance
In this webinar our panel of UX specialists give a brief introduction to User Experience before presenting the design opportunities UX can bring to AI. We all know that AI has great potential but has some significant hurdles to overcome not least so the human aspect of trust and ethical considerations when designing in the life sciences.
Slides for keynote "Social Media and AI: Don’t forget the users" at WWW 2017 workshop "International Workshop on Modeling Social Media: Machine Learning and AI for Modeling and Analyzing Social Media". I am arguing that we need consider two things: the source of what we use to make good algorithms and whether users are impacted the way we want to impact them. The talk is based on two uses cases around providing diversity (something many of us believe is good) to users:
1. Engaging through diversity: serendipity (same algorithm, different sources)
2. Engaging through diversity: awareness (effective algorithm, perception)
My goal is to say, we may have the best AI, but we may get it wrong if we forget the users. I don't have answers, but it is important that we ask the right questions in today's world.
Natural Intelligence the human factor in AIBill Liu
Presented at AI NEXTCon Seattle 1/17-20, 2018
http://aisea18.xnextcon.com
join our free online AI group with 50,000+ tech engineers to learn and practice AI technology, including: latest AI news, tech articles/blogs, tech talks, tutorial videos, and hands-on workshop/codelabs, on machine learning, deep learning, data science, etc..
The Freedom to Grow: How Standards in Communication Facilitate Our Industry, ...dclsocialmedia
Standards – either in the XML sense or simply communication best practices – help grow, accelerate and “professionalize” an industry. Where would construction be without material standards for width and strengths, or certification for specific skills? How could we have transportation without standards for traffic and processes? Standards are what help ad-hoc processes become enterprise-class, and allow them to scale beyond our expectations.
Technical communication is in an era of rapid, disruptive and revolutionary change. The true nature of the challenge is understood by a few, and pros and cons of potential solutions by even fewer. The future therefore will require that we work together to exchange knowledge as best we can to help each other hit the many moving targets. We must do this because our old techniques and processes just can’t keep up, and no organization has the time or funds to reinvent every solution on their own.
In “The Freedom to Grow,” Noz Urbina will explain how standards can help an organization with little funds tackle larger challenges, and larger organizations implement profound change with reduced risk. The alternative is potentially getting left behind as the industry and community rush forward.
Noz Urbina is an established content strategy thought leader, consultant and trainer specializing in cutting edge, multi-channel, business-driven content projects for marketing, business, technical and omnichannel communications. He is co-author of “Content Strategy: Connecting the dots between business brand and benefits”. Since 2000, he has provided customer experience focused services to Fortune 500 organizations and small-to-medium enterprises. Noz is the founder of Urbina Consulting, and since 2006 has been Events Chair and Content Director for Congility.com.
Recommendation systems today are widely used across many applications such as in multimedia content platforms, social networks, and ecommerce, to provide suggestions to users that are most likely to fulfill their needs, thereby improving the user experience. Academic research, to date, largely focuses on the performance of recommendation models in terms of ranking quality or accuracy measures, which often don’t directly translate into improvements in the real-world. In this talk, we present some of the most interesting challenges that we face in the personalization efforts at Netflix. The goal of this talk is to sunshine challenging research problems in industrial recommendation systems and start a conversation about exciting areas of future research.
Catch a comprehensive overview of the transformative intersection between AI and User Experience (UX). Dive into practical applications, understand the nuances, and engage with the ethical challenges. Ideal for professionals, enthusiasts, and anyone curious about the future of digital experiences.
Future of land use project overview - august 2019Future Agenda
Future of Land Use
With all the challenges on the horizon, we are pleased to be exploring the future of land use via another Open Foresight major project kicking off in October and running through until next summer.
Addressing pivotal issues from food production, soil quality, water scarcity and biosphere protection to urbanisation, leisure use and land ownership, this global collaborative project is focused on the critical issues and potential solutions for the future.
Undertaken in collaboration with a wide range of major organisations, including the WWF as our global knowledge partner, the locations and schedule for the programme are now being detailed.
This is the project overview.
If you would like to be involved in this major and important topic and host one or more of the expert workshops around the world, do let us know.
Website Content Planning For Law Firms | LawLytics WebinarsDan Jaffe
Learn how to plan your law firm's content marketing strategy, and create a content plan for your law firm website that engages potential clients and referral sources, builds your reputation as a attorney, and works and plays well with the search engines and social media.
Presentation at the Netflix Expo session at RecSys 2020 virtual conference on 2020-09-24. It provides an overview of recommendation and personalization at Netflix and then highlights some of the things we’ve been working on as well as some important open research questions in the field of recommendations.
Software libre en la banca - Experiencias del grupo Santander con OSSLibreCon
Banco Santander es la empresa de mayor capitalización bursátil de España y uno de los bancos más importantes del mundo. Exposición de las razones que les han llevado no solo a utilizar software abierto en el core del software del Banco Santander, sino a liderar y desarrollar una de las iniciativas de código abierto: Open Nebula. Open Nebula es una plataforma de cloud computing para manejar infraestructuras heterogéneas de data center.
¿Por qué un banco líder apuesta por el software libre? Autor: Jesus Ruiz Martínez (Director of Open Innovation en Banco Santander). Librecon.io
A talk about how to conduct usability research without a massive budget or it being a huge undertaking. Case Studies about past experiences in guerrilla UX as well as the "patent-pending" $1000 UX Lab.
Talk originally prepared for ProductTank Madison 2018-03-14.
See https://tinyurl.com/guerrilla-ux for slides, transitions, etc.
This slide is a recruitment materials for NABLAS Inc. It provides a brief introduction of NABLAS's mission, business activities, and working environment.
NABLAS aims to create a world where people can live as human beings through human resource development, research and development, and consulting activities in the field of AI.
Mahara offers basic statistics by default, but advanced reporting functionnality is not available. By
using a separate tool that integrates well with Mahara, such as the open source Piwik, Mahara
administrators can benefit from a multitude of insightful analytics reports.
This presentation will showcase some examples of such analytics and how they can help:
- Understand user behavior and their interactions with Mahara.
- Increase user engagement by analysing the data and implementing changes.
- monitor over time the availability, speed, and evolution of the Mahara system.
Similar to Responsible Machine Learning at the BBC (20)
Talk given at the London AICamp meet up on the 13 July 2023. It's an introduction on building open-source ChatGPT-like chat bots and some of the considerations to have while training/tuning them using Airflow.
From an idea to production: building a recommender for BBC SoundsTatiana Al-Chueyr
This presentation was given on the 28th of September 2021 at the first MLOps London Meetup
Event website: https://www.meetup.com/mlopslondon/events/280295841/
Presentation given on the 21st of September 2021 at the London Beam Meet-up
Event website: https://www.meetup.com/London-Apache-Beam-Meetup/events/280442419/
Presentation given on the 15th July 2021 at the Airflow Summit 2021
Conference website: https://airflowsummit.org/sessions/2021/clearing-airflow-obstructions/
Recording: https://www.crowdcast.io/e/airflowsummit2021/40
Presented at PyCon UK 2018 (18 September 2018, Cardiff).
The slides are incomplete.
Recording available at:
https://www.youtube.com/watch?v=-weU0Zy4Yd8
Presentation about some common mistakes English learners make - and how it is possible to try to identify part of them automatically (spelling, capitalization and article). This presentation was made during PyCon SK on the 12th of March 2016. Many of the results are due to the partnership of the University of Cambridge and Education First.
Slides presenting some numbers of PythonBrasil[8] conference (PyCon Brasil), that happened in Rio de Janeiro, during November 2012. Authors: @tati_alchueyr and @turicas
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Responsible Machine Learning at the BBC
1. Kharkiv National University of Radio Electronics
17 November 2019
@tati_alchueyr
Ethical Machine Learning
building recommendation engines with
editorial support
3. About me
● Brazilian living in London since 2014
● Senior Data Engineer at the BBC Datalab team
● Graduated in Computer Engineering at Unicamp
● Passionate software developer for 16 years
● Experience in the private and public sectors
● Developed software for Medicine, Media and Education
● Loves Open Source
● Loves Brazilian Jiu Jitsu
● Proud mother of Amanda
4. BBC
● British Broadcasting Corporation
● Values
○ Independent, impartial and honest
○ Audiences are at the heart of everything we do
○ We take pride in delivering quality and value for
money
○ Creativity is the lifeblood of our organisation
○ We respect each other and celebrate our diversity
so that everyone can give their best
5. BBC
● Founded in 1922
● Purpose
○ Inform
○ Educate
○ Entertain
● “Our organisation exists in order to serve individuals and
society as a whole rather than a small set of stakeholders.”
Reference: Gabriel Straub (BBC)
6. bbc.stats()
➢ BBC TV reaches 91% UK adult population
➢ BBC News reaches 426 million global audience weekly
Reference 1: BBC
Reference 2: BBC
Image Credit: BBC
7. BBC. .
“Bring the BBC’s data together
accessible through a common platform,
along with flexible and scalable tools to
support machine learning to enable
content enrichment and deeper
personalisation”
8. Some of the Datalab team members (15 August 2019)
BBC. .
9. BBC. .
● Multi-disciplinary team
○ Editorial
○ Data scientists
○ Engineers
○ Product Manager
○ Project Manager
15. BBC+ app experiment
● Fully personalised experience on short videos, on Android & iPhone
● Allow users to find gems that they didn’t know at a time that suits them
18. Content-based recommendations content
We create a content representation (*):
{
"genres": {
"science": 0.8,
"nature": 0.2,
}
}
(*) simplified for didactic purposes
19. Content-based recommendations user
We learn about the user indirectly
● news you read
● videos you watch
● things you search
● quizzes you answer
● things you like
● things you comment
20. Content-based recommendations user
We create a user representation (*):
{
"genres": {
"science": 0.4,
"folk-music": 0.5,
"judo": 0.1,
}
}
(*) simplified for didactic purposes
21. Content-based recommendations prediction
We use the user representation to search for content similar to it,
using Elasticsearch. As an output, we have a ranked list of content.
22. BBC+ app experiment
● How to get from algorithm to product
○ Start with content-based recommendations
○ Apply business rules
24. Legal Policies
Programme: BBC
Contempt of court
● The recommendations should not affect the
outcome of a legal case
● The BBC can be held accountable for
influencing the jury’s opinion
Action
● Create a “contempt of court risk” label by
detecting keywords such as arrest, assault,
allegation etc
● Avoid items with this label
25. Legal Policies
Electoral law
● During elections we should not surface
political content that could influence the vote
Action
● Create a “political risk” label by detecting
political content sources
● Avoid items when appropriate
26. Editorial Policies
Quality criteria
● Avoid content that shows little care has been
taken in the metadata
Action
● Avoid content with poor titles and descriptions
27. Editorial Policies
Under 16 audience
● Provide children-safe content
● BBC’s 9PM watershed
Action
● Avoid items with warnings of sex, violence,
strong language
29. GDPR
Explainability
● Choose simple models over complex ones
● UI features to provide explanations
Agency
● UI features for users to interact with the algorithm
● Eg. delete history items, like, dislike, report
30. Curation values
● Affection
● Authenticity
● Compelling
● Fresh
● Warm
● Quirky
● Relatable
● Aspirational
● Entertaining
● Reassuring
Reference: Anna McGovern
“Website editor, manager, analyst and
digital nurturer” at the BBC
Much more than click rates
31. Business values & objectives
Quantitative offline evaluation
● NDCG, hit rate, diversity, recency, surprisal
● Prioritise diversity and recency over accuracy
Qualitative offline evaluation
● Prioritise content for young audiences
● Prioritise content of editorial importance
33. BBC+ app experiment
Takeaways
● The editorial partnership is key to how we work
● The company’s principles are at the heart of all of our decisions
● There is a significant path between implementation and
production ready
36. ● 9 to 12 items on native apps and web
● Current provider: content-based algorithm
○ Poor metadata, poor recommendations
○ Popularity biases towards heritage audience
○ Cold start using editorially curated lists
○ Opportunity for improvement of performance
We decided to try a different approach: Factorisation Machines
Recommended for you
37. Recommendation strategy content-based
How it works
● Given a user, find similar content to their preferences
● Characterising item using genres, masterbrand, etc.
● Based on user’s historical data and content metadata
Challenges
● Potential lack of diversity and relies on good content description
Where can we find this?
● “You may also be interested in …”
38. How it works
● Given a user, find similar users and the content they watched
● Based on all users’ historical data
● Uses implicit feedback (user-item interactions)
Challenges
● Sparse matrix
○ SVM very efficient except in sparse settings where
not enough data to estimate interactions
● Cold start
Where can we find this?
● “Customers who viewed this item, also viewed...”
Recommendation strategy collaborative filtering
39. How it works
● Hybrid content-based and collaborative filtering
● SVM and factorisation techniques
● Based on all users’ historical data and content metadata
● Based on reliable information (latent features)
● Linear time complexity
Recommendation strategy factorisation machine
Reference: Academic Paper
40. Example
● Estimate interaction between Alice and
Star Trek
a. No case where A and ST > wA,ST= 0
b. Use factorized interaction parameters
{vA, vST}
c. Dot product of the factor vectors of A and
ST will be similar to the one of A and SW
Recommendation strategy factorisation machine
User Item Rating
Alie (A) Titanic (T) 5
Alice (A) Notting Hill (NH) 3
Alice (A) Star Wars (SW) 1
Bob (B) Star Wars (SW) 4
Bob (B) Star Trek (ST) 5
Charlie
(C)
Titanic (T) 1
Charlie
(C)
Star Wars (SW) 5
41. Qualitative Experiment
Who
● ~30 test users recruited
○ From non-editorial and editorial
teams from BBC audio networks
○ Under 35
How
● Two sets of recommendations
displayed
● Users have to pick either the best list,
or “both”, or “neither”
● And explain why
42. Qualitative Experiment Feedback
● “Need to categorize speech vs music,
background listening vs ‘serious’
content”
● “Need to consider the age of the item”
● “Looking for diverse content durations
…”
Reducing item/user biases helped to
generate more personalised
recommendations than the current state
Neither Content-
Based
Hybrid
approach
Both
2 8 17 1
7% 28.5% 61% 3.5%
45. The BBC Machile Learning Values
1. Audiences at the heart of everything we do. We celebrate diversity
○ Good value for money and focusing on using the audience-based
data to improve their experience
3. Our algorithms serve our audiences equally and fairly, so that the
full breadth of the BBC is available to everyone
6. Algorithms form only part of the content discovery process for our
audiences, and sit alongside (human) editorial curation
Reference: Gabriel Straub (BBC)
47. Flourishing in the age of AI
● Research
● 11,000 people
● 7 markets
● What people want from their lives
● How technology might enable that
Reference: Flourishing in AI report
48. Flourishing in the age of AI
“(...) people in the UK don’t think technology is being
developed with their best interests at heart”
Reference: Flourishing in AI report
49. Flourishing in the age of AI
Reference: Flourishing in AI report
● How satisfied are you with
your life?
● To what extent the thing
you do in life is
worthwhile?
● How anxious did you feel
yesterday?
Base: 5432, May 2019
63. Ethical Machine Learning
● How do you make decisions about what is fair?
● What metrics can you use?
● How to achieve an ethical machine learning in your work?
Reference: Avoiding the Fate of Icarus
Medium
мне приятно быть здесь с тобой
it's a pleasure to be here with you
большое Вам спасибо
thank you very much
UK population: 66.44 million
Ukraine: ~ 42.22 million
World wide population: 7.7 billion people as of April 2019
Image from Seven worlds, one planet
~12 million penguins live in Antarctica
https://oceanites.org/wp-content/uploads/2019/06/SOAP-2019-Online.pdf
Program: Made by Machine: when AI met the archive
https://www.bbc.co.uk/rd/blog/2018-09-artificial-intelligence-archive-made-machine
https://www.bbc.co.uk/programmes/b0bhwk3p
The General Data Protection Regulation 2016/679 is a regulation in EU law on data protection and privacy for all individual citizens of the European Union and the European Economic Area. It also addresses the transfer of personal data outside the EU and EEA
(normalised)
Discounted cumulative gain (DCG) is a measure of ranking quality. In information retrieval, it is often used to measure effectiveness of web search engine algorithms or related applications. Using a graded relevance scale of documents in a search-engine result set, DCG measures the usefulness, or gain, of a document based on its position in the result list. The gain is accumulated from the top of the result list to the bottom, with the gain of each result discounted at lower ranks.
приємно бути тут з тобою
pryyemno buty tut z toboyu
it's a pleasure to be here with you
дуже тобі дякую
duzhe tobi dyakuyu
thank you very much