SlideShare a Scribd company logo
1 of 49
Soylent A Word Processor with a Crowd Inside Michael Bernstein msbernst@csail.mit.edu Greg Little, Rob Miller, David Karger, David Crowell, Katrina Panovichmit csail Björn Hartmann	Mark Ackermanucberkeley			university of michigan mit human-computer interaction
Shortening A Paper to Ten Pages 1) Do it yourself 2) Use an AI 3) Ask colleagues
Shortening A Paper to Ten Pages 4) Recruit a crowd
Shortening A Paper to Ten Pages 4) Recruit a crowd
Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks.
Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks. Soylent is people. Amazon Mechanical Turk  Soylent’s core algorithms are human-powered. Find Unnecessary Text Requester:Matt C.Reward:$0.01Tasks Available: 7 Shorten Rambling Text Requester:Gordon L.Reward:$0.04Tasks Available: 12
Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks. Find-Fix-Verify: Crowd control design pattern Find a problem Fix each problem Verify quality of each fix Soylent, a prototype... Soylent, a prototype... Soylent, a prototype... Soylent, a prototype...
Embed paid crowd workers in user interfaces to support cognition and manipulation tasks on demand
demo
Word processors don’t: - parse semantics well ,[object Object],Paid crowds don’t:- guarantee quality - control costs State ofthe Art [Kittur ’08]
Paid crowds can: - parse semantics well ,[object Object],Soylent can:- guarantee quality - control costs Word witha Crowd Inside
Challenges in Programming Crowds This project has interacted with~9000 Turkers on ~2000 different tasks Key Problem: crowd workers often produce poor output on open-ended tasks 30% Rule:  ~30% of the resultsfrom open-ended tasks will be unsatisfactory Introduction Demo Find-Fix-Verify Evaluation Discussion Soylent
Two Personas: An Example Proofread and correct the following paragraph: The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme is pursuit of dreams.
The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.  Two Personas: An Example Proofread and correct the following paragraph:
The Lazy Turker Does as little work as necessary to be paid The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
The Lazy Turker Does as little work as necessary to be paid The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradeship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
The Eager Beaver Go beyond task requirements to be helpful, but introduce errors in the process The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
Go beyond task requirements to be helpful, but introduce errors in the process The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections of this story.  This theme occurs during many circumstances but is not present from start to finish.  In my mind, for a theme to be pervasive it must be present during every element of the story.  There are many themes that are present most of the way through such as sacrifice, friendship and comradeship. But in my opinion there is only one theme that is present from beginning to end: this theme is pursuit of dreams.  The Eager Beaver
The Eager Beaver Go beyond task requirements to be helpful, but introduce errors in the process
Find-Fix-Verify Programming crowds today is haphazard,similar to UI technology before design patternslike Model-View-Controller Find-Fix-Verify is a design pattern for programming crowds to complete open-ended tasks
Find “Identify at least one area that can be shortened without changing the meaning of the paragraph.” Independent agreement to identify patches Fix “Edit the highlighted section to shorten its length without changing the meaning of the paragraph.” Soylent, a prototype... Randomize order of suggestions Verify “Choose at least one rewrite that has style errors, and at least one rewrite that changes the meaning of the sentence.”
Keep suggestions that do not get voted out Verify “Choose at least one rewrite that has style errors, and at least one rewrite that changes the meaning of the sentence.”
Fix-Fix-Verify Discussion Why split Find and Fix?	Force Lazy Turkers to work on a problem of our choice	Allows us to merge work completed in parallel Why add Verify?	Quality rises when we place Turkers in productive tension	Allows us to trade off lag time with quality
Evaluation Goals Is Soylent’s crowdsourced user interfaceapproach feasible?	How high is the quality? 		How long is the delay? 		How much does it cost? 1 2 3 Introduction Demo Find-Fix-Verify Evaluation Discussion Soylent
Blog Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper introduced the metaDESK along with two companion platforms, the transBOARD and ambientROOM. Shortn Draft uistPaper In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool that uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key (the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83% Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87% The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Cut 15% of original paragraph length on average. Shortn Draft uistPaper – 90% In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing – 82% Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78% A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87%	7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Shortn Draft uistPaper – 90%			5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing – 82%	3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78%			6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87%	7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Focus on unnecessarily wordy phrases But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Shortn Draft uistPaper – 90%			5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing – 82%	3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78%			6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87%	7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Shortn Draft uistPaper – 90%			5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Merged sentences when patches crossed sentence boundaries The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, whichintroduced the metaDESKalong withandtwo companion platforms, the transBOARD and ambientROOM. Technical Writing – 82%	3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78%			6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87%	7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Shortn Draft uistPaper – 90%			5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Workers introduced style errors when not part of the community of practice Technical Writing – 82%	3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). In this paper we argue thatit is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. Rambling E-mail – 78%			6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Blog – 83%					3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Parallelism can result in inconsistent changes Classic uistPaper– 87%	7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Shortn Draft uistPaper – 90%			5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications.  We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing – 82%	3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78%			6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups.  Check out our new page: […]
Results: Lag Soylent Posts Task Most time is spent waiting: Median 18.5 minutesSummed medians across Find, Fix and Verify q1=8.3 minutes, q3=41.6 minutes WorkerAccepts Task Actual work time is very short:Median 2.0 minutesSummed medians across Find, Fix and Verifyq1=60 seconds, q3=3.6 minutes WorkerSubmits Task
ESL: English as a Second Language However, while GUI made using computers be more intuitive and easier to learn, it didn’t let people be able to control computers efficiently. Masses only can use the software developed by software companies. Passes Word’s Grammar Checker Marketing are bad for brand big and small.  You Know What I am Saying. It is no wondering that advertisings are bad for company in America, Chicago and Germany.  Wikipedia DanduMonara (Flying Peacock, Wooden Peacock), The Flying machine able to fly. The King Ravana (Sri Lanka) built it. Accorinding to hindu believes in Ramayanaya King Ravana used "DanduMonara" for abduct queen Seetha from Rama. According to believes "DanduMonara" landed at Werangatota. Crowdproof Notes Blah blah blah—argument about whether there should be a standard “nosqlsto- rage” API to protect developers storing their stuff in proprietary services in the cloud. Probably unrealistic. UIST Draft Many of these problems vanish if we turn to a much older recording technology---text. When we enter text, each (pen or key) stroke is being used to record the actual information we care about---; none is wasted on application navigation or configuration.
ESL: English as a Second Language However, while GUI made using computers be more intuitive and easier to learn, it didn’t allow people to let people be able to control computers efficiently. Masses only canThe masses can only use the software developed by software companies, unless they know how to write programs. Crowdproof alone found 67% of errors. Crowdproof Crowdproof fixed 88%of the errors it found. Crowdproof and Word togetherfound 82% of errors.
Find BibTeX: “Hi, please find the bibtex references for the 3 papers in brackets. You can located these by Google Scholar searches and clicking on bibtex.’’ Find Creative Commons Figures: “Pick out keywords from the paragrah like Yosemite, rock, half dome, park. Go to a site which hsa CC licensed images […]’’ Human Macro Blog Feedback:  “Please tell me how to make this paragraph communicate better. Say what's wrong, and what I can improve. Thanks!’’ Tense Change: “Please change text in document from past tense to present tense” Find and Format Addresses:  “Please complete the addresses below to include all informtion needed as in example below. [...]”
Find BibTeX: “Hi, please find the bibtex references for the 3 papers in brackets. You can located these by Google Scholar searches and clicking on bibtex.’’ Duncan and Watts [Duncan and watts HCOMP 09 anchoring] found that Turkers will do more work, but quality is no higher. @conference {       title={{Financial incentives […]}},       author={Mason, W. and Watts, D.J.},  booktitle={HCOMP ‘09},       […] } Human Macro The Human Macro executedrequests perfectly 71%of the time.
Introduction Demo Find-Fix-Verify Evaluation Discussion Soylent
Wizard of Turk: The New Wizard of Oz Wizard of Oz prototyping is a tried-and-true technique in HCI and AIPut a human behind the curtain	…until we understand how to engineer it It’s now possible to wire a wizard permanently into an interactive systemFully deployable from day one	AI vs. Turk is a cost / performance optimization	Crowd contributions can provide training data  Wizard of Oz Interface Wizard of Turk
Soylent A Word Processor with a Crowd Inside A new class of crowd-powered interfaces The Find-Fix-Verify design pattern Crowd Personas: The Lazy Turker and the Eager Beaver
Soylent soylent@csail.mit.edu
Effect of Price on Wait Time Paying more had no effect on early arrivals,but sped up the latecomers
Privacy, Legality, Ethics Unknown third parties can see your documentOne solution: develop long-term relationshipsOr: enterprises hire wage workers under NDA Who owns the edits to your document?Work-for-hire contract means the author retains rights	Is this the correct model?  Taylorism and the Turker as API callEmbed human-human contract ethics into your system	Adjusting to minimum wage
Related Work Powering novel interactions with the wisdom of crowds[ChaCha; Sala et al., Pervasive 2007; Bigham et al., UIST 2010] Improving quality on Mechanical Turk[Kittur et al., CHI 2008; Heer et al., CHI 2010] Artificial intelligence techniques for word processing	Automatic proofreading [Kukich, CSUR 1992]	Sentence compression [Clarke and Lapata, ACL 2006]	AI for EUP [Cypher 1993]
Results: Cost $0.08 per Find, $0.05 per Fix, and $0.04 per Verify Average paragraph cost $1.41 to Shortn:		$0.55 to Find an average of two patches	$0.48 to Fix each patch	$0.38 to Verify each patch Lower bound with $0.01 per task:	$0.30 per paragraph
Soylent: A Word Processor with a Crowd Inside
Soylent: A Word Processor with a Crowd Inside
Soylent: A Word Processor with a Crowd Inside
Soylent: A Word Processor with a Crowd Inside

More Related Content

Similar to Soylent: A Word Processor with a Crowd Inside

10 Tips For Best Writing In Exams. Online assignment writing service.
10 Tips For Best Writing In Exams. Online assignment writing service.10 Tips For Best Writing In Exams. Online assignment writing service.
10 Tips For Best Writing In Exams. Online assignment writing service.Tiffany Rose
 
Designing Narrative Content Workshop
Designing Narrative Content WorkshopDesigning Narrative Content Workshop
Designing Narrative Content WorkshopMartha Rotter
 
Cloud AI GenAI Overview.pptx
Cloud AI GenAI Overview.pptxCloud AI GenAI Overview.pptx
Cloud AI GenAI Overview.pptxSahithiGurlinka
 
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...Grammarly
 
Invasion of the dynamic language weenies
Invasion of the dynamic language weeniesInvasion of the dynamic language weenies
Invasion of the dynamic language weeniesSrijit Kumar Bhadra
 
Microblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyMicroblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyLeon Derczynski
 
Deep Learning from Scratch - Building with Python from First Principles.pdf
Deep Learning from Scratch - Building with Python from First Principles.pdfDeep Learning from Scratch - Building with Python from First Principles.pdf
Deep Learning from Scratch - Building with Python from First Principles.pdfYungSang1
 
Large Components in the Rearview Mirror
Large Components in the Rearview MirrorLarge Components in the Rearview Mirror
Large Components in the Rearview MirrorMichelle Brush
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1Sara Hooker
 
Un actor (model) per amico - Alessandro Melchiori - Codemotion Milan 2016
Un actor (model) per amico - Alessandro Melchiori - Codemotion Milan 2016Un actor (model) per amico - Alessandro Melchiori - Codemotion Milan 2016
Un actor (model) per amico - Alessandro Melchiori - Codemotion Milan 2016Codemotion
 
Seo automation using gpt 3 and transformer-based language models
Seo automation using gpt 3 and transformer-based language modelsSeo automation using gpt 3 and transformer-based language models
Seo automation using gpt 3 and transformer-based language modelsAndrea Volpini
 
How to tell a better story (in code)(final)
How to tell a better story (in code)(final)How to tell a better story (in code)(final)
How to tell a better story (in code)(final)Bonnie Pan
 
To Make Your Chatbot Smart, You Need to Feed It Right: How to Write for Chatb...
To Make Your Chatbot Smart, You Need to Feed It Right: How to Write for Chatb...To Make Your Chatbot Smart, You Need to Feed It Right: How to Write for Chatb...
To Make Your Chatbot Smart, You Need to Feed It Right: How to Write for Chatb...LavaConConference
 
CHAPTER 6FunctionsChapter Topics6.1 Introductio.docx
CHAPTER 6FunctionsChapter Topics6.1  Introductio.docxCHAPTER 6FunctionsChapter Topics6.1  Introductio.docx
CHAPTER 6FunctionsChapter Topics6.1 Introductio.docxrobertad6
 
The Four Principles Of Object Oriented Programming
The Four Principles Of Object Oriented ProgrammingThe Four Principles Of Object Oriented Programming
The Four Principles Of Object Oriented ProgrammingDiane Allen
 

Similar to Soylent: A Word Processor with a Crowd Inside (20)

10 Tips For Best Writing In Exams. Online assignment writing service.
10 Tips For Best Writing In Exams. Online assignment writing service.10 Tips For Best Writing In Exams. Online assignment writing service.
10 Tips For Best Writing In Exams. Online assignment writing service.
 
Designing Narrative Content Workshop
Designing Narrative Content WorkshopDesigning Narrative Content Workshop
Designing Narrative Content Workshop
 
Cloud AI GenAI Overview.pptx
Cloud AI GenAI Overview.pptxCloud AI GenAI Overview.pptx
Cloud AI GenAI Overview.pptx
 
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
 
Invasion of the dynamic language weenies
Invasion of the dynamic language weeniesInvasion of the dynamic language weenies
Invasion of the dynamic language weenies
 
Microblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyMicroblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracy
 
Deep Learning from Scratch - Building with Python from First Principles.pdf
Deep Learning from Scratch - Building with Python from First Principles.pdfDeep Learning from Scratch - Building with Python from First Principles.pdf
Deep Learning from Scratch - Building with Python from First Principles.pdf
 
Large Components in the Rearview Mirror
Large Components in the Rearview MirrorLarge Components in the Rearview Mirror
Large Components in the Rearview Mirror
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1
 
A class action
A class actionA class action
A class action
 
Os Goodger
Os GoodgerOs Goodger
Os Goodger
 
Un actor (model) per amico - Alessandro Melchiori - Codemotion Milan 2016
Un actor (model) per amico - Alessandro Melchiori - Codemotion Milan 2016Un actor (model) per amico - Alessandro Melchiori - Codemotion Milan 2016
Un actor (model) per amico - Alessandro Melchiori - Codemotion Milan 2016
 
Seo automation using gpt 3 and transformer-based language models
Seo automation using gpt 3 and transformer-based language modelsSeo automation using gpt 3 and transformer-based language models
Seo automation using gpt 3 and transformer-based language models
 
How to tell a better story (in code)(final)
How to tell a better story (in code)(final)How to tell a better story (in code)(final)
How to tell a better story (in code)(final)
 
To Make Your Chatbot Smart, You Need to Feed It Right: How to Write for Chatb...
To Make Your Chatbot Smart, You Need to Feed It Right: How to Write for Chatb...To Make Your Chatbot Smart, You Need to Feed It Right: How to Write for Chatb...
To Make Your Chatbot Smart, You Need to Feed It Right: How to Write for Chatb...
 
CHAPTER 6FunctionsChapter Topics6.1 Introductio.docx
CHAPTER 6FunctionsChapter Topics6.1  Introductio.docxCHAPTER 6FunctionsChapter Topics6.1  Introductio.docx
CHAPTER 6FunctionsChapter Topics6.1 Introductio.docx
 
Naming Things
Naming ThingsNaming Things
Naming Things
 
Flow based-1994
Flow based-1994Flow based-1994
Flow based-1994
 
Essay About Mothers.pdf
Essay About Mothers.pdfEssay About Mothers.pdf
Essay About Mothers.pdf
 
The Four Principles Of Object Oriented Programming
The Four Principles Of Object Oriented ProgrammingThe Four Principles Of Object Oriented Programming
The Four Principles Of Object Oriented Programming
 

More from Michael Bernstein

Quantifying the Invisible Audience in Social Networks
Quantifying the Invisible Audience in Social NetworksQuantifying the Invisible Audience in Social Networks
Quantifying the Invisible Audience in Social NetworksMichael Bernstein
 
Direct Answers for Search Queries in the Long Tail
Direct Answers for Search Queries in the Long TailDirect Answers for Search Queries in the Long Tail
Direct Answers for Search Queries in the Long TailMichael Bernstein
 
Analytic Methods for Optimizing Realtime Crowdsourcing
Analytic Methods for Optimizing Realtime CrowdsourcingAnalytic Methods for Optimizing Realtime Crowdsourcing
Analytic Methods for Optimizing Realtime CrowdsourcingMichael Bernstein
 
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...Michael Bernstein
 
RepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student PerspectivesRepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student PerspectivesMichael Bernstein
 
RepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student PerspectivesRepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student PerspectivesMichael Bernstein
 
The Trouble with Social Computing Systems Research
The Trouble with Social Computing Systems ResearchThe Trouble with Social Computing Systems Research
The Trouble with Social Computing Systems ResearchMichael Bernstein
 
Eddi: Interactive Topic-Based Browsing of Social Status Streams
Eddi: Interactive Topic-Based Browsing of Social Status StreamsEddi: Interactive Topic-Based Browsing of Social Status Streams
Eddi: Interactive Topic-Based Browsing of Social Status StreamsMichael Bernstein
 
FeedMe: Enhancing Directed Content Sharing on the Web
FeedMe: Enhancing Directed Content Sharing on the WebFeedMe: Enhancing Directed Content Sharing on the Web
FeedMe: Enhancing Directed Content Sharing on the WebMichael Bernstein
 

More from Michael Bernstein (10)

Quantifying the Invisible Audience in Social Networks
Quantifying the Invisible Audience in Social NetworksQuantifying the Invisible Audience in Social Networks
Quantifying the Invisible Audience in Social Networks
 
The Future of Crowd Work
The Future of Crowd WorkThe Future of Crowd Work
The Future of Crowd Work
 
Direct Answers for Search Queries in the Long Tail
Direct Answers for Search Queries in the Long TailDirect Answers for Search Queries in the Long Tail
Direct Answers for Search Queries in the Long Tail
 
Analytic Methods for Optimizing Realtime Crowdsourcing
Analytic Methods for Optimizing Realtime CrowdsourcingAnalytic Methods for Optimizing Realtime Crowdsourcing
Analytic Methods for Optimizing Realtime Crowdsourcing
 
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...
4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Co...
 
RepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student PerspectivesRepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student Perspectives
 
RepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student PerspectivesRepliCHI: Graduate Student Perspectives
RepliCHI: Graduate Student Perspectives
 
The Trouble with Social Computing Systems Research
The Trouble with Social Computing Systems ResearchThe Trouble with Social Computing Systems Research
The Trouble with Social Computing Systems Research
 
Eddi: Interactive Topic-Based Browsing of Social Status Streams
Eddi: Interactive Topic-Based Browsing of Social Status StreamsEddi: Interactive Topic-Based Browsing of Social Status Streams
Eddi: Interactive Topic-Based Browsing of Social Status Streams
 
FeedMe: Enhancing Directed Content Sharing on the Web
FeedMe: Enhancing Directed Content Sharing on the WebFeedMe: Enhancing Directed Content Sharing on the Web
FeedMe: Enhancing Directed Content Sharing on the Web
 

Recently uploaded

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Soylent: A Word Processor with a Crowd Inside

  • 1. Soylent A Word Processor with a Crowd Inside Michael Bernstein msbernst@csail.mit.edu Greg Little, Rob Miller, David Karger, David Crowell, Katrina Panovichmit csail Björn Hartmann Mark Ackermanucberkeley university of michigan mit human-computer interaction
  • 2. Shortening A Paper to Ten Pages 1) Do it yourself 2) Use an AI 3) Ask colleagues
  • 3. Shortening A Paper to Ten Pages 4) Recruit a crowd
  • 4. Shortening A Paper to Ten Pages 4) Recruit a crowd
  • 5. Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks.
  • 6. Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks. Soylent is people. Amazon Mechanical Turk Soylent’s core algorithms are human-powered. Find Unnecessary Text Requester:Matt C.Reward:$0.01Tasks Available: 7 Shorten Rambling Text Requester:Gordon L.Reward:$0.04Tasks Available: 12
  • 7. Soylent is a word processing interface that uses crowd contributions to aid complex writing tasks. Find-Fix-Verify: Crowd control design pattern Find a problem Fix each problem Verify quality of each fix Soylent, a prototype... Soylent, a prototype... Soylent, a prototype... Soylent, a prototype...
  • 8. Embed paid crowd workers in user interfaces to support cognition and manipulation tasks on demand
  • 10.
  • 11.
  • 12. Challenges in Programming Crowds This project has interacted with~9000 Turkers on ~2000 different tasks Key Problem: crowd workers often produce poor output on open-ended tasks 30% Rule: ~30% of the resultsfrom open-ended tasks will be unsatisfactory Introduction Demo Find-Fix-Verify Evaluation Discussion Soylent
  • 13. Two Personas: An Example Proofread and correct the following paragraph: The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme is pursuit of dreams.
  • 14. The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams. Two Personas: An Example Proofread and correct the following paragraph:
  • 15. The Lazy Turker Does as little work as necessary to be paid The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
  • 16. The Lazy Turker Does as little work as necessary to be paid The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradeship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
  • 17. The Eager Beaver Go beyond task requirements to be helpful, but introduce errors in the process The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections during this story. This theme occurs during many circumstances but is not present from start to finish. In my mind for a theme to be pervasive is must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradship. But in my opinion there is only one theme that is present from beginning to end, this theme ispursuit of dreams.
  • 18. Go beyond task requirements to be helpful, but introduce errors in the process The theme of loneliness features throughout many scenes in Of Mice and Men and is often the dominant theme of sections of this story. This theme occurs during many circumstances but is not present from start to finish. In my mind, for a theme to be pervasive it must be present during every element of the story. There are many themes that are present most of the way through such as sacrifice, friendship and comradeship. But in my opinion there is only one theme that is present from beginning to end: this theme is pursuit of dreams. The Eager Beaver
  • 19. The Eager Beaver Go beyond task requirements to be helpful, but introduce errors in the process
  • 20. Find-Fix-Verify Programming crowds today is haphazard,similar to UI technology before design patternslike Model-View-Controller Find-Fix-Verify is a design pattern for programming crowds to complete open-ended tasks
  • 21. Find “Identify at least one area that can be shortened without changing the meaning of the paragraph.” Independent agreement to identify patches Fix “Edit the highlighted section to shorten its length without changing the meaning of the paragraph.” Soylent, a prototype... Randomize order of suggestions Verify “Choose at least one rewrite that has style errors, and at least one rewrite that changes the meaning of the sentence.”
  • 22. Keep suggestions that do not get voted out Verify “Choose at least one rewrite that has style errors, and at least one rewrite that changes the meaning of the sentence.”
  • 23. Fix-Fix-Verify Discussion Why split Find and Fix? Force Lazy Turkers to work on a problem of our choice Allows us to merge work completed in parallel Why add Verify? Quality rises when we place Turkers in productive tension Allows us to trade off lag time with quality
  • 24. Evaluation Goals Is Soylent’s crowdsourced user interfaceapproach feasible? How high is the quality? How long is the delay? How much does it cost? 1 2 3 Introduction Demo Find-Fix-Verify Evaluation Discussion Soylent
  • 25. Blog Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper introduced the metaDESK along with two companion platforms, the transBOARD and ambientROOM. Shortn Draft uistPaper In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool that uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key (the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 26. Blog – 83% Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87% The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Cut 15% of original paragraph length on average. Shortn Draft uistPaper – 90% In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing – 82% Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78% A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 27. Blog – 83% 3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87% 7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Shortn Draft uistPaper – 90% 5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing – 82% 3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78% 6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 28. Blog – 83% 3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87% 7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Focus on unnecessarily wordy phrases But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Shortn Draft uistPaper – 90% 5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing – 82% 3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78% 6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 29. Blog – 83% 3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87% 7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Shortn Draft uistPaper – 90% 5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Merged sentences when patches crossed sentence boundaries The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, whichintroduced the metaDESKalong withandtwo companion platforms, the transBOARD and ambientROOM. Technical Writing – 82% 3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78% 6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 30. Blog – 83% 3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Classic uistPaper– 87% 7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. Shortn Draft uistPaper – 90% 5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Workers introduced style errors when not part of the community of practice Technical Writing – 82% 3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). In this paper we argue thatit is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. Rambling E-mail – 78% 6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 31. Blog – 83% 3 para., 158 people, $4.57 Print publishers are in a tizzy over Apple’s new iPad because they hope to finally be able to charge for their digital editions. But in order to get people to pay for their magazine and newspaper apps, they are going to have to offer something different that readers cannot get at the newsstand or on the open Web. Parallelism can result in inconsistent changes Classic uistPaper– 87% 7 para., 264 people, $7.45 The metaDESK effort is part of the larger Tangible Bits project. The Tangible Bits vision paper, which introduced the metaDESKalong withand two companion platforms, the transBOARD and ambientROOM. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Shortn Draft uistPaper – 90% 5 para., 284 people, $7.47 In this paper we argue that it is possible and desirable to combine the easy input affordances of text with the powerful retrieval and visualization capabilities of graphical applications. We present WenSo, a tool thatwhich uses lightweight text input to capture richly structured information for later retrieval and navigation. Technical Writing – 82% 3 para., 188 people, $4.84 Figure 3 shows the pseudocode that implements this design for Lookup. FAWN-DS extracts two fields from the 160-bit key: the i low order bits of the key(the index bits) and the next 15 low order bits (the key fragment). Rambling E-mail – 78% 6 para., 362 people, $9.72 A previous board member, Steve Burleigh, created our web site last year and gave me alot of ideas. For this year, I found a web site called eTeamZ that hosts web sites for sports groups. Check out our new page: […]
  • 32. Results: Lag Soylent Posts Task Most time is spent waiting: Median 18.5 minutesSummed medians across Find, Fix and Verify q1=8.3 minutes, q3=41.6 minutes WorkerAccepts Task Actual work time is very short:Median 2.0 minutesSummed medians across Find, Fix and Verifyq1=60 seconds, q3=3.6 minutes WorkerSubmits Task
  • 33. ESL: English as a Second Language However, while GUI made using computers be more intuitive and easier to learn, it didn’t let people be able to control computers efficiently. Masses only can use the software developed by software companies. Passes Word’s Grammar Checker Marketing are bad for brand big and small. You Know What I am Saying. It is no wondering that advertisings are bad for company in America, Chicago and Germany. Wikipedia DanduMonara (Flying Peacock, Wooden Peacock), The Flying machine able to fly. The King Ravana (Sri Lanka) built it. Accorinding to hindu believes in Ramayanaya King Ravana used "DanduMonara" for abduct queen Seetha from Rama. According to believes "DanduMonara" landed at Werangatota. Crowdproof Notes Blah blah blah—argument about whether there should be a standard “nosqlsto- rage” API to protect developers storing their stuff in proprietary services in the cloud. Probably unrealistic. UIST Draft Many of these problems vanish if we turn to a much older recording technology---text. When we enter text, each (pen or key) stroke is being used to record the actual information we care about---; none is wasted on application navigation or configuration.
  • 34. ESL: English as a Second Language However, while GUI made using computers be more intuitive and easier to learn, it didn’t allow people to let people be able to control computers efficiently. Masses only canThe masses can only use the software developed by software companies, unless they know how to write programs. Crowdproof alone found 67% of errors. Crowdproof Crowdproof fixed 88%of the errors it found. Crowdproof and Word togetherfound 82% of errors.
  • 35. Find BibTeX: “Hi, please find the bibtex references for the 3 papers in brackets. You can located these by Google Scholar searches and clicking on bibtex.’’ Find Creative Commons Figures: “Pick out keywords from the paragrah like Yosemite, rock, half dome, park. Go to a site which hsa CC licensed images […]’’ Human Macro Blog Feedback: “Please tell me how to make this paragraph communicate better. Say what's wrong, and what I can improve. Thanks!’’ Tense Change: “Please change text in document from past tense to present tense” Find and Format Addresses: “Please complete the addresses below to include all informtion needed as in example below. [...]”
  • 36. Find BibTeX: “Hi, please find the bibtex references for the 3 papers in brackets. You can located these by Google Scholar searches and clicking on bibtex.’’ Duncan and Watts [Duncan and watts HCOMP 09 anchoring] found that Turkers will do more work, but quality is no higher. @conference { title={{Financial incentives […]}}, author={Mason, W. and Watts, D.J.}, booktitle={HCOMP ‘09}, […] } Human Macro The Human Macro executedrequests perfectly 71%of the time.
  • 37. Introduction Demo Find-Fix-Verify Evaluation Discussion Soylent
  • 38. Wizard of Turk: The New Wizard of Oz Wizard of Oz prototyping is a tried-and-true technique in HCI and AIPut a human behind the curtain …until we understand how to engineer it It’s now possible to wire a wizard permanently into an interactive systemFully deployable from day one AI vs. Turk is a cost / performance optimization Crowd contributions can provide training data Wizard of Oz Interface Wizard of Turk
  • 39. Soylent A Word Processor with a Crowd Inside A new class of crowd-powered interfaces The Find-Fix-Verify design pattern Crowd Personas: The Lazy Turker and the Eager Beaver
  • 41.
  • 42. Effect of Price on Wait Time Paying more had no effect on early arrivals,but sped up the latecomers
  • 43. Privacy, Legality, Ethics Unknown third parties can see your documentOne solution: develop long-term relationshipsOr: enterprises hire wage workers under NDA Who owns the edits to your document?Work-for-hire contract means the author retains rights Is this the correct model? Taylorism and the Turker as API callEmbed human-human contract ethics into your system Adjusting to minimum wage
  • 44. Related Work Powering novel interactions with the wisdom of crowds[ChaCha; Sala et al., Pervasive 2007; Bigham et al., UIST 2010] Improving quality on Mechanical Turk[Kittur et al., CHI 2008; Heer et al., CHI 2010] Artificial intelligence techniques for word processing Automatic proofreading [Kukich, CSUR 1992] Sentence compression [Clarke and Lapata, ACL 2006] AI for EUP [Cypher 1993]
  • 45. Results: Cost $0.08 per Find, $0.05 per Fix, and $0.04 per Verify Average paragraph cost $1.41 to Shortn: $0.55 to Find an average of two patches $0.48 to Fix each patch $0.38 to Verify each patch Lower bound with $0.01 per task: $0.30 per paragraph

Editor's Notes

  1. let's start with a scenario many of us are familiar with. we have a pretty strict page limit on a paper, and somewhere in this 7500 words we are just a few lines overlength. We've already spent time editing this paper. It's a pretty painful scenario.move quickly through this or you’ll go overtimeask colleagues, "but they're doing CHI papers too!"
  2. We're introducing a fourth option with this work: a crowd. We want to recruit an entire crowd of helpers to support your writing process, to get 100 people to help you cut down your text. And what we're going to do is build it into the interface. (picture)
  3. We're introducing a fourth option with this work: a crowd. We want to recruit an entire crowd of helpers to support your writing process, to get 100 people to help you cut down your text. And what we're going to do is build it into the interface. (picture)
  4. In this talk I'm introducing Soylent, a word processor with a crowd inside. Soylent is a new kind of word processing interface that is powered by crowd contributions. Three major features: ...
  5. Uses TurKit
  6. This is hard because the crowd pulls in different directions and doesn’t always do the work you want it to do. We control it with a new design pattern we call Find-Fix-Verify.
  7. You hit a button, and some dude goes off and does something
  8. "we're just a few references overlength" --> "we're just a few lines overlength”Shortn: authors are bad at hacking their own text. Nobody hacks it in tiny bits. Say out loud that the user is specifying how "tall" the text should be.Rob's reason why the authors missed the error -- we're tired by page 7. Fresh eyeballs looking at a small bit of text.
  9. Go FAST
  10. You can reject their work
  11. They’re working on the same sentence, trying to fix the same underlying problem
  12. go faster through this. “We evaluated soylent, and we wanted to ask three questions: quality, delay, and cost.”
  13. Cut more than a page from a CHI paper without touching figures or refs, and just removing fat from the writing.
  14. Cost works out to $1.40 per paragraph
  15. Cost works out to $1.40 per paragraph
  16. Cost works out to $1.40 per paragraph
  17. Cost works out to $1.40 per paragraph
  18. Cost works out to $1.40 per paragraph
  19. "our next step is to find ways to reduce wait time"
  20. note that they are misspelled and still worked
  21. We think that Soylent is an example of a new class of user interfaces. (not, “and here’s the discussion!”)"There are AIs that do sentence compression like Shortn, but their biggest problem right now is lack of data. We can feed it that data, and eventually save the user money."
  22. Task: crowdproof, input was ESL first paragraph, separated Find from Fix+Verify so that Fix always had the same patchesAll tasks had at least 3 workers in the first 10 minutes. After about 15 minutes, arrival time starts looking exponential.So, you can get away with paying very little for interaction
  23. snooping on CHI papers right before the deadlineadd 9000 Turkers to the author list