SlideShare a Scribd company logo
google_logo Open Source
WikiLoop tools for
Wikimedia projects
Exploring how Big Tech can
contribute to the Open
Knowledge world.
Zainan Victor Zhou
Software engineer,
Google Search
Elan
Hourticolon-Retzler
Software engineer,
Google Search
María Cruz
Program manager,
Google Open Source
google_logo Open Source
Agenda
● The WikiLoop program (2 min)
● How WikiLoop uses machine learning (5 min)
● Community participation (8 min)
● WikiLoop DoubleCheck demo (5 min)
● Civility Check demo (5 min)
● Q&A; feedback (20 min)
“ google_logo Open Source
This program was originally conceived as a
virtuous circle: providing data and tools to
enhance human editor's productivity, and
making the Wikipedia editorial input more
machine-readable for open knowledge
institutions, academia and researchers
interested in advancing machine learning
technology.
google_logo Open Source
Source
Improvements
Maximize Neutrality
Empower Human
Bootstrap Machine
*Find image licensing info in the speaker notes
What projects fits in WikiLoop
google_logo Open Source
Source Improvements
In the area that algorithms has
saturated the extraction
maximum, the source noise
becomes the dominant
bottleneck
WikiLoop provide a platform to
empower the community to
reduce noise from source itself
What kinds of projects fits in WikiLoop landscape
*Find image licensing info in the speaker notes
google_logo Open Source
Maximize Neutrality
In the areas when opinions are largely
divided and controversial, any
fact/opinion choice Google made
could dissatisfy one side and deemed
"biased".
WikiLoop provides a means to obtain
consensus and maximize neutrality by
relying on a peer-reviewed
self-governed community.
*Find image licensing info in the speaker notes
What projects fits in WikiLoop
google_logo Open Source
"Bootstrap effect"
In the areas due to lack of training data, certain
AI-based services are not good enough to serve
general Google users and use-cases, but
community members are doing hand-crafting
creation anyway.
WikiLoop provides a platform to provide machine
assistance for such knowledge creation, while
imperfect, improving their productivity, and then
send back training data to improve AI models,
hopefully to a point that services quality will
improve and bootstrap in this process
*Find image licensing info in the speaker notes
Current AI
precision
level
Desired AI
precision
level
AI available
to users
(none)
What projects fits in WikiLoop
google_logo Open Source
WikiLoop program - Communication & Impact
Tool in
production:
DoubleCheck
A web-based app to review
edits on Wikipedia.
● 100K reviews
milestone.
● 316 users
Outreach to
Wikipedians
We wrote a story for The
Signpost, +830 pageviews,
lots of engagement.
http://goo.gle/WLDCSignpost
Consultation
with the
community
Change the name of the tool
from “Battlefield” to
“DoubleCheck”
http://goo.gle/WLDCVote
RfC for user
trust levels
Consulted with the
community to implement
trust levels for WLDC users.
http://goo.gle/WLDCrfc
google_logo Open Source
WikiLoop DoubleCheck
(in production)
google_logo Open Source
Civility Check
(proof of concept)
Combatting Incivility on Wikipedia
“Wikipedia Has a Civility Issue That
Creates a Toxic Editing Environment”
“Wikimedia Takes Steps To Fight
Harassment, Build Inclusivity”
“Wikimedia is writing new policies to
fight Wikipedia harassment”
“Wikipedia is instituting new rules to
battle "toxic behavior" by volunteers”
Proprietary + Confidential
How do we determine if a comment is potentially toxic?
Perspective API
● Created by Jigsaw and Google's Counter Abuse
Technology Team
● Perspective API uses machine learning models to
classify text under certain labels
○ A comment can be classified as Toxic, Serve
Toxicity, Insult, Sexually Explicit, Profanity,
Threat and returns with the probability of that
classification
● Since this is a machine learning model there are bound
to be some edge cases & false positives
○ Therefore, we allow users to flag false
positives/true negatives
● API rate limited so we currently only process a subset
of the total namespace
Proprietary + Confidential
Perspective API
Calculate the incivility
percentage of comment
WikiMedia API
Get the revision comments
from Wikipedia
Discover View
Load potential incivil comments
with high toxicity %
Detailed Edit View
Allow users to see incivility
report and taken actions
User View
Allow users to see their own
history of potential incivil
comments
Discover Servlet
Load potential toxic comments
with high toxicity %
Retrieve Edit Servlet
Load toxic comments and their
incivility reports
User Servlet
Load potential incivil comments
with high toxicity %
Google
Datastore
- Store comments
with incivility score &
label
- Store actions taken
History View
Allow users to see action
history on all edits by all users
Action Servlet
Load actions taken by users and
display on Action view
Proprietary + Confidential
Discover View
● Displays toxic comments
● Query by:
○ User
○ Revision ID(s)
● Access to Detailed Edit View
● Hides already seen edits
Proprietary + Confidential
Detailed Edit View
● Display incivility scores in detail
● User can take actions
○ Looks good
○ Not sure
○ Should report
● Looks good action acts as a flag
for false positives
Proprietary + Confidential
User Profile View
● Displays the user’s comments
and their incivility scores
● Display user’s average incivility
score
● Same functionalities as Discover
page
google_logo Open Source
Q&A
google_logo Open Source
Thank you!

More Related Content

Similar to WikiLoop: Big tech's Open Knowledge contributions

Postman Public Workspaces: The First Massively Multiplayer API Experience | W...
Postman Public Workspaces: The First Massively Multiplayer API Experience | W...Postman Public Workspaces: The First Massively Multiplayer API Experience | W...
Postman Public Workspaces: The First Massively Multiplayer API Experience | W...
Postman
 
infoShare 2013: Alan Bird - The Web is Changing the World. Are you READY?
infoShare 2013: Alan Bird - The Web is Changing the World. Are you READY?infoShare 2013: Alan Bird - The Web is Changing the World. Are you READY?
infoShare 2013: Alan Bird - The Web is Changing the World. Are you READY?
Infoshare
 
Liferay DevCon 2014: Lliferay Platform - A new and exciting vision
Liferay DevCon 2014: Lliferay Platform - A new and exciting visionLiferay DevCon 2014: Lliferay Platform - A new and exciting vision
Liferay DevCon 2014: Lliferay Platform - A new and exciting vision
Jorge Ferrer
 
Extension 2.0
Extension 2.0Extension 2.0
Extension 2.0
cccowman
 
CLEAR: A Fully User-side Image Search System
CLEAR: A Fully User-side Image Search SystemCLEAR: A Fully User-side Image Search System
CLEAR: A Fully User-side Image Search System
joisino
 
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit ShahANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
AgileNetwork
 
Self-Organized, Autonomous UX | SoCal UX Camp | May 31, 2014
Self-Organized, Autonomous UX  |  SoCal UX Camp  |  May 31, 2014Self-Organized, Autonomous UX  |  SoCal UX Camp  |  May 31, 2014
Self-Organized, Autonomous UX | SoCal UX Camp | May 31, 2014
Jaimi Kercher
 
Repertoire of contributions hacktoberfest
Repertoire of contributions hacktoberfestRepertoire of contributions hacktoberfest
Repertoire of contributions hacktoberfest
DSCVSSUT
 
progressive web app
 progressive web app progressive web app
progressive web app
RAGINI .
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigm
Jonathan Challener
 
Offline First in React Native
Offline First in React NativeOffline First in React Native
Offline First in React Native
Josh Warwick
 
Bubble(No code Tool)
Bubble(No code Tool)Bubble(No code Tool)
Bubble(No code Tool)
Livares Technologies Pvt Ltd
 
Introduction To Open Web Protocols
Introduction To Open Web ProtocolsIntroduction To Open Web Protocols
Introduction To Open Web Protocols
Mohan Krishnan
 
Maruti gollapudi cv
Maruti gollapudi cvMaruti gollapudi cv
Maruti gollapudi cv
Maruti Gollapudi
 
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
Dario Taraborelli
 
Free and Open Source for Education
Free and Open Source for EducationFree and Open Source for Education
Free and Open Source for Education
Savitri Wilder
 
Opensocial Haifa Seminar - 2008.04.08
Opensocial Haifa Seminar - 2008.04.08Opensocial Haifa Seminar - 2008.04.08
Opensocial Haifa Seminar - 2008.04.08Ari Leichtberg
 
Open Social Presentation - GSP West 2008
Open Social Presentation - GSP West 2008Open Social Presentation - GSP West 2008
Open Social Presentation - GSP West 2008
Patrick Chanezon
 
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
WSO2
 

Similar to WikiLoop: Big tech's Open Knowledge contributions (20)

Naresh Chirra
Naresh ChirraNaresh Chirra
Naresh Chirra
 
Postman Public Workspaces: The First Massively Multiplayer API Experience | W...
Postman Public Workspaces: The First Massively Multiplayer API Experience | W...Postman Public Workspaces: The First Massively Multiplayer API Experience | W...
Postman Public Workspaces: The First Massively Multiplayer API Experience | W...
 
infoShare 2013: Alan Bird - The Web is Changing the World. Are you READY?
infoShare 2013: Alan Bird - The Web is Changing the World. Are you READY?infoShare 2013: Alan Bird - The Web is Changing the World. Are you READY?
infoShare 2013: Alan Bird - The Web is Changing the World. Are you READY?
 
Liferay DevCon 2014: Lliferay Platform - A new and exciting vision
Liferay DevCon 2014: Lliferay Platform - A new and exciting visionLiferay DevCon 2014: Lliferay Platform - A new and exciting vision
Liferay DevCon 2014: Lliferay Platform - A new and exciting vision
 
Extension 2.0
Extension 2.0Extension 2.0
Extension 2.0
 
CLEAR: A Fully User-side Image Search System
CLEAR: A Fully User-side Image Search SystemCLEAR: A Fully User-side Image Search System
CLEAR: A Fully User-side Image Search System
 
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit ShahANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
 
Self-Organized, Autonomous UX | SoCal UX Camp | May 31, 2014
Self-Organized, Autonomous UX  |  SoCal UX Camp  |  May 31, 2014Self-Organized, Autonomous UX  |  SoCal UX Camp  |  May 31, 2014
Self-Organized, Autonomous UX | SoCal UX Camp | May 31, 2014
 
Repertoire of contributions hacktoberfest
Repertoire of contributions hacktoberfestRepertoire of contributions hacktoberfest
Repertoire of contributions hacktoberfest
 
progressive web app
 progressive web app progressive web app
progressive web app
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigm
 
Offline First in React Native
Offline First in React NativeOffline First in React Native
Offline First in React Native
 
Bubble(No code Tool)
Bubble(No code Tool)Bubble(No code Tool)
Bubble(No code Tool)
 
Introduction To Open Web Protocols
Introduction To Open Web ProtocolsIntroduction To Open Web Protocols
Introduction To Open Web Protocols
 
Maruti gollapudi cv
Maruti gollapudi cvMaruti gollapudi cv
Maruti gollapudi cv
 
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
 
Free and Open Source for Education
Free and Open Source for EducationFree and Open Source for Education
Free and Open Source for Education
 
Opensocial Haifa Seminar - 2008.04.08
Opensocial Haifa Seminar - 2008.04.08Opensocial Haifa Seminar - 2008.04.08
Opensocial Haifa Seminar - 2008.04.08
 
Open Social Presentation - GSP West 2008
Open Social Presentation - GSP West 2008Open Social Presentation - GSP West 2008
Open Social Presentation - GSP West 2008
 
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
 

More from All Things Open

Building Reliability - The Realities of Observability
Building Reliability - The Realities of ObservabilityBuilding Reliability - The Realities of Observability
Building Reliability - The Realities of Observability
All Things Open
 
Modern Database Best Practices
Modern Database Best PracticesModern Database Best Practices
Modern Database Best Practices
All Things Open
 
Open Source and Public Policy
Open Source and Public PolicyOpen Source and Public Policy
Open Source and Public Policy
All Things Open
 
Weaving Microservices into a Unified GraphQL Schema with graph-quilt - Ashpak...
Weaving Microservices into a Unified GraphQL Schema with graph-quilt - Ashpak...Weaving Microservices into a Unified GraphQL Schema with graph-quilt - Ashpak...
Weaving Microservices into a Unified GraphQL Schema with graph-quilt - Ashpak...
All Things Open
 
The State of Passwordless Auth on the Web - Phil Nash
The State of Passwordless Auth on the Web - Phil NashThe State of Passwordless Auth on the Web - Phil Nash
The State of Passwordless Auth on the Web - Phil Nash
All Things Open
 
Total ReDoS: The dangers of regex in JavaScript
Total ReDoS: The dangers of regex in JavaScriptTotal ReDoS: The dangers of regex in JavaScript
Total ReDoS: The dangers of regex in JavaScript
All Things Open
 
What Does Real World Mass Adoption of Decentralized Tech Look Like?
What Does Real World Mass Adoption of Decentralized Tech Look Like?What Does Real World Mass Adoption of Decentralized Tech Look Like?
What Does Real World Mass Adoption of Decentralized Tech Look Like?
All Things Open
 
How to Write & Deploy a Smart Contract
How to Write & Deploy a Smart ContractHow to Write & Deploy a Smart Contract
How to Write & Deploy a Smart Contract
All Things Open
 
Spinning Your Drones with Cadence Workflows, Apache Kafka and TensorFlow
 Spinning Your Drones with Cadence Workflows, Apache Kafka and TensorFlow Spinning Your Drones with Cadence Workflows, Apache Kafka and TensorFlow
Spinning Your Drones with Cadence Workflows, Apache Kafka and TensorFlow
All Things Open
 
DEI Challenges and Success
DEI Challenges and SuccessDEI Challenges and Success
DEI Challenges and Success
All Things Open
 
Scaling Web Applications with Background
Scaling Web Applications with BackgroundScaling Web Applications with Background
Scaling Web Applications with Background
All Things Open
 
Supercharging tutorials with WebAssembly
Supercharging tutorials with WebAssemblySupercharging tutorials with WebAssembly
Supercharging tutorials with WebAssembly
All Things Open
 
Using SQL to Find Needles in Haystacks
Using SQL to Find Needles in HaystacksUsing SQL to Find Needles in Haystacks
Using SQL to Find Needles in Haystacks
All Things Open
 
Configuration Security as a Game of Pursuit Intercept
Configuration Security as a Game of Pursuit InterceptConfiguration Security as a Game of Pursuit Intercept
Configuration Security as a Game of Pursuit Intercept
All Things Open
 
Scaling an Open Source Sponsorship Program
Scaling an Open Source Sponsorship ProgramScaling an Open Source Sponsorship Program
Scaling an Open Source Sponsorship Program
All Things Open
 
Build Developer Experience Teams for Open Source
Build Developer Experience Teams for Open SourceBuild Developer Experience Teams for Open Source
Build Developer Experience Teams for Open Source
All Things Open
 
Deploying Models at Scale with Apache Beam
Deploying Models at Scale with Apache BeamDeploying Models at Scale with Apache Beam
Deploying Models at Scale with Apache Beam
All Things Open
 
Sudo – Giving access while staying in control
Sudo – Giving access while staying in controlSudo – Giving access while staying in control
Sudo – Giving access while staying in control
All Things Open
 
Fortifying the Future: Tackling Security Challenges in AI/ML Applications
Fortifying the Future: Tackling Security Challenges in AI/ML ApplicationsFortifying the Future: Tackling Security Challenges in AI/ML Applications
Fortifying the Future: Tackling Security Challenges in AI/ML Applications
All Things Open
 
Securing Cloud Resources Deployed with Control Planes on Kubernetes using Gov...
Securing Cloud Resources Deployed with Control Planes on Kubernetes using Gov...Securing Cloud Resources Deployed with Control Planes on Kubernetes using Gov...
Securing Cloud Resources Deployed with Control Planes on Kubernetes using Gov...
All Things Open
 

More from All Things Open (20)

Building Reliability - The Realities of Observability
Building Reliability - The Realities of ObservabilityBuilding Reliability - The Realities of Observability
Building Reliability - The Realities of Observability
 
Modern Database Best Practices
Modern Database Best PracticesModern Database Best Practices
Modern Database Best Practices
 
Open Source and Public Policy
Open Source and Public PolicyOpen Source and Public Policy
Open Source and Public Policy
 
Weaving Microservices into a Unified GraphQL Schema with graph-quilt - Ashpak...
Weaving Microservices into a Unified GraphQL Schema with graph-quilt - Ashpak...Weaving Microservices into a Unified GraphQL Schema with graph-quilt - Ashpak...
Weaving Microservices into a Unified GraphQL Schema with graph-quilt - Ashpak...
 
The State of Passwordless Auth on the Web - Phil Nash
The State of Passwordless Auth on the Web - Phil NashThe State of Passwordless Auth on the Web - Phil Nash
The State of Passwordless Auth on the Web - Phil Nash
 
Total ReDoS: The dangers of regex in JavaScript
Total ReDoS: The dangers of regex in JavaScriptTotal ReDoS: The dangers of regex in JavaScript
Total ReDoS: The dangers of regex in JavaScript
 
What Does Real World Mass Adoption of Decentralized Tech Look Like?
What Does Real World Mass Adoption of Decentralized Tech Look Like?What Does Real World Mass Adoption of Decentralized Tech Look Like?
What Does Real World Mass Adoption of Decentralized Tech Look Like?
 
How to Write & Deploy a Smart Contract
How to Write & Deploy a Smart ContractHow to Write & Deploy a Smart Contract
How to Write & Deploy a Smart Contract
 
Spinning Your Drones with Cadence Workflows, Apache Kafka and TensorFlow
 Spinning Your Drones with Cadence Workflows, Apache Kafka and TensorFlow Spinning Your Drones with Cadence Workflows, Apache Kafka and TensorFlow
Spinning Your Drones with Cadence Workflows, Apache Kafka and TensorFlow
 
DEI Challenges and Success
DEI Challenges and SuccessDEI Challenges and Success
DEI Challenges and Success
 
Scaling Web Applications with Background
Scaling Web Applications with BackgroundScaling Web Applications with Background
Scaling Web Applications with Background
 
Supercharging tutorials with WebAssembly
Supercharging tutorials with WebAssemblySupercharging tutorials with WebAssembly
Supercharging tutorials with WebAssembly
 
Using SQL to Find Needles in Haystacks
Using SQL to Find Needles in HaystacksUsing SQL to Find Needles in Haystacks
Using SQL to Find Needles in Haystacks
 
Configuration Security as a Game of Pursuit Intercept
Configuration Security as a Game of Pursuit InterceptConfiguration Security as a Game of Pursuit Intercept
Configuration Security as a Game of Pursuit Intercept
 
Scaling an Open Source Sponsorship Program
Scaling an Open Source Sponsorship ProgramScaling an Open Source Sponsorship Program
Scaling an Open Source Sponsorship Program
 
Build Developer Experience Teams for Open Source
Build Developer Experience Teams for Open SourceBuild Developer Experience Teams for Open Source
Build Developer Experience Teams for Open Source
 
Deploying Models at Scale with Apache Beam
Deploying Models at Scale with Apache BeamDeploying Models at Scale with Apache Beam
Deploying Models at Scale with Apache Beam
 
Sudo – Giving access while staying in control
Sudo – Giving access while staying in controlSudo – Giving access while staying in control
Sudo – Giving access while staying in control
 
Fortifying the Future: Tackling Security Challenges in AI/ML Applications
Fortifying the Future: Tackling Security Challenges in AI/ML ApplicationsFortifying the Future: Tackling Security Challenges in AI/ML Applications
Fortifying the Future: Tackling Security Challenges in AI/ML Applications
 
Securing Cloud Resources Deployed with Control Planes on Kubernetes using Gov...
Securing Cloud Resources Deployed with Control Planes on Kubernetes using Gov...Securing Cloud Resources Deployed with Control Planes on Kubernetes using Gov...
Securing Cloud Resources Deployed with Control Planes on Kubernetes using Gov...
 

Recently uploaded

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 

Recently uploaded (20)

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 

WikiLoop: Big tech's Open Knowledge contributions

  • 1. google_logo Open Source WikiLoop tools for Wikimedia projects Exploring how Big Tech can contribute to the Open Knowledge world. Zainan Victor Zhou Software engineer, Google Search Elan Hourticolon-Retzler Software engineer, Google Search María Cruz Program manager, Google Open Source
  • 2. google_logo Open Source Agenda ● The WikiLoop program (2 min) ● How WikiLoop uses machine learning (5 min) ● Community participation (8 min) ● WikiLoop DoubleCheck demo (5 min) ● Civility Check demo (5 min) ● Q&A; feedback (20 min)
  • 3. “ google_logo Open Source This program was originally conceived as a virtuous circle: providing data and tools to enhance human editor's productivity, and making the Wikipedia editorial input more machine-readable for open knowledge institutions, academia and researchers interested in advancing machine learning technology.
  • 4. google_logo Open Source Source Improvements Maximize Neutrality Empower Human Bootstrap Machine *Find image licensing info in the speaker notes What projects fits in WikiLoop
  • 5. google_logo Open Source Source Improvements In the area that algorithms has saturated the extraction maximum, the source noise becomes the dominant bottleneck WikiLoop provide a platform to empower the community to reduce noise from source itself What kinds of projects fits in WikiLoop landscape *Find image licensing info in the speaker notes
  • 6. google_logo Open Source Maximize Neutrality In the areas when opinions are largely divided and controversial, any fact/opinion choice Google made could dissatisfy one side and deemed "biased". WikiLoop provides a means to obtain consensus and maximize neutrality by relying on a peer-reviewed self-governed community. *Find image licensing info in the speaker notes What projects fits in WikiLoop
  • 7. google_logo Open Source "Bootstrap effect" In the areas due to lack of training data, certain AI-based services are not good enough to serve general Google users and use-cases, but community members are doing hand-crafting creation anyway. WikiLoop provides a platform to provide machine assistance for such knowledge creation, while imperfect, improving their productivity, and then send back training data to improve AI models, hopefully to a point that services quality will improve and bootstrap in this process *Find image licensing info in the speaker notes Current AI precision level Desired AI precision level AI available to users (none) What projects fits in WikiLoop
  • 8. google_logo Open Source WikiLoop program - Communication & Impact Tool in production: DoubleCheck A web-based app to review edits on Wikipedia. ● 100K reviews milestone. ● 316 users Outreach to Wikipedians We wrote a story for The Signpost, +830 pageviews, lots of engagement. http://goo.gle/WLDCSignpost Consultation with the community Change the name of the tool from “Battlefield” to “DoubleCheck” http://goo.gle/WLDCVote RfC for user trust levels Consulted with the community to implement trust levels for WLDC users. http://goo.gle/WLDCrfc
  • 9. google_logo Open Source WikiLoop DoubleCheck (in production)
  • 10. google_logo Open Source Civility Check (proof of concept)
  • 11. Combatting Incivility on Wikipedia “Wikipedia Has a Civility Issue That Creates a Toxic Editing Environment” “Wikimedia Takes Steps To Fight Harassment, Build Inclusivity” “Wikimedia is writing new policies to fight Wikipedia harassment” “Wikipedia is instituting new rules to battle "toxic behavior" by volunteers”
  • 12. Proprietary + Confidential How do we determine if a comment is potentially toxic? Perspective API ● Created by Jigsaw and Google's Counter Abuse Technology Team ● Perspective API uses machine learning models to classify text under certain labels ○ A comment can be classified as Toxic, Serve Toxicity, Insult, Sexually Explicit, Profanity, Threat and returns with the probability of that classification ● Since this is a machine learning model there are bound to be some edge cases & false positives ○ Therefore, we allow users to flag false positives/true negatives ● API rate limited so we currently only process a subset of the total namespace
  • 13. Proprietary + Confidential Perspective API Calculate the incivility percentage of comment WikiMedia API Get the revision comments from Wikipedia Discover View Load potential incivil comments with high toxicity % Detailed Edit View Allow users to see incivility report and taken actions User View Allow users to see their own history of potential incivil comments Discover Servlet Load potential toxic comments with high toxicity % Retrieve Edit Servlet Load toxic comments and their incivility reports User Servlet Load potential incivil comments with high toxicity % Google Datastore - Store comments with incivility score & label - Store actions taken History View Allow users to see action history on all edits by all users Action Servlet Load actions taken by users and display on Action view
  • 14. Proprietary + Confidential Discover View ● Displays toxic comments ● Query by: ○ User ○ Revision ID(s) ● Access to Detailed Edit View ● Hides already seen edits
  • 15. Proprietary + Confidential Detailed Edit View ● Display incivility scores in detail ● User can take actions ○ Looks good ○ Not sure ○ Should report ● Looks good action acts as a flag for false positives
  • 16. Proprietary + Confidential User Profile View ● Displays the user’s comments and their incivility scores ● Display user’s average incivility score ● Same functionalities as Discover page