Your SlideShare is downloading. ×
0
Introduction to Mechanical Turk
Artificial Artificial Intelligence




AWS User Group Berlin
Thomas Metschke
25.03.2010
Pe...
Amazon Mechanical Turk
is a marketplace for work.




                             2
Mechanical Turk Marketplace



 400,000+ Workers
 In 100+ Countries
 Available 24/7
 Programmatically
  Accessible
   ...
So there are basically

         Workers                                   Requesters




      http://www.flickr.com/phot...
Mechanical Turk as a Worker

        Workers



                                                  Make money by working o...
Your Dashboard




                 6
Your Dashboard




       The number of available tasks.




                                        7
Your Dashboard




       Total Earnings and Bonuses.




                                     8
Your Dashboard



          HIT Status and Totals.




                                   9
How do I get the money?



                Amazon         Bank
 U.S. Bank
                  Gift       Checks in
  account...
Mechanical Turk as a Requester

                                 Requesters


 Have access to a global,
  on-demand, 24 x...
Requesting HITs




      Requesters              Workers          Requesters


• define and create   • work on your     •...
Design HITs




               Enter Properties
               Design Layout



                                   13
Design HITs - faster




                       Take developer and use
                       CSV files
                  ...
What would it look like


 http://mechanicalturk.amazonaws.com/
          ?Service=AWSMechanicalTurkRequester
          &A...
Publish HITs




 credit card   debit card
                             HITs have to be paid in
                         ...
Use Mechanical Turk for


                   Work that requires Human
                    Judgment
                   Wo...
Improving Data Quality

                                           Background
    Are these two
                          ...
Moderating User
Generated Content

Is this image explicit?
                                                      Backgroun...
Categorization
                                                           Background
 What kind of dress is               ...
Optimizing your HITs for


                  Price




      Accuracy             Speed
                                  ...
Check it out!




          http://mturk.com
          http://turkers.proboards.com




                                  ...
Thank you for your attention
Peritor GmbH
Blücherstr. 22, Hof III Aufgang 6
10961 Berlin
Tel.: +49 (0)30 69 20 09 84 0
Fax...
Upcoming SlideShare
Loading in...5
×

AWS User Group Berlin - Introduction To Amazon Mechanical Turk

1,644

Published on

SIldes for a short introduction to Amazon mechanical Turk on the AWS User Group on 2010/03/25 in Berlin.

Published in: Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,644
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
50
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Transcript of "AWS User Group Berlin - Introduction To Amazon Mechanical Turk"

  1. 1. Introduction to Mechanical Turk Artificial Artificial Intelligence AWS User Group Berlin Thomas Metschke 25.03.2010 Peritor GmbH
  2. 2. Amazon Mechanical Turk is a marketplace for work. 2
  3. 3. Mechanical Turk Marketplace  400,000+ Workers  In 100+ Countries  Available 24/7  Programmatically Accessible http://www.flickr.com/photos/diamond_rain/2543837414/ 3
  4. 4. So there are basically Workers Requesters http://www.flickr.com/photos/saad/1968774 http://www.flickr.com/photos/chicagobart/4181533461 4
  5. 5. Mechanical Turk as a Worker Workers  Make money by working on Human Intelligence Tasks  Workers can work from home and choose their own work hours http://www.flickr.com/photos/saad/1968774 5
  6. 6. Your Dashboard 6
  7. 7. Your Dashboard The number of available tasks. 7
  8. 8. Your Dashboard Total Earnings and Bonuses. 8
  9. 9. Your Dashboard HIT Status and Totals. 9
  10. 10. How do I get the money? Amazon Bank U.S. Bank Gift Checks in account Certificate Rupees 10
  11. 11. Mechanical Turk as a Requester Requesters  Have access to a global, on-demand, 24 x 7 workforce  Can get thousands of HITs completed in minutes  Pay only when they are satisfied with the results http://www.flickr.com/photos/chicagobart/4181533461 11
  12. 12. Requesting HITs Requesters Workers Requesters • define and create • work on your • approve and pay your HITs HITs for completed • load HITs to • submit results HITs Mechanical Turk • use the results 12
  13. 13. Design HITs  Enter Properties  Design Layout 13
  14. 14. Design HITs - faster Take developer and use CSV files SOAP / REST or Amazon Mechanical Turk developer tools 14
  15. 15. What would it look like http://mechanicalturk.amazonaws.com/ ?Service=AWSMechanicalTurkRequester &AWSAccessKeyId=[the Requester's Access Key ID] &Version=2008-08-02 &Operation=CreateHIT &Signature=[signature for this request] &Timestamp=[your system's local time] &Title=Location%20and%20Photograph%20Identification &Description=Select%20the%20image%20that%20best%20represents &Reward.1.Amount=5 &Reward.1.CurrencyCode=USD &Question=[URL-encoded question data] &AssignmentDurationInSeconds=30 &LifetimeInSeconds=604800 &Keywords=location,%20photograph,%20image,%20identification,%20opinion 15
  16. 16. Publish HITs credit card debit card  HITs have to be paid in advance Amazon  Amazon takes 10% on top U.S. bank Payments account account 16
  17. 17. Use Mechanical Turk for  Work that requires Human Judgment  Work that algorithms cannot completely solve  Work that has unpredictable or spiky volume 17
  18. 18. Improving Data Quality Background Are these two  Data is the company’s business businesses the same?  Accuracy and breadth are key to differentiation Process Peritor GmbH Peritor Consulting  1 MM data points to ingest each day Blücherstraße 22 Blücherstraße 22  200 data sources 10961 Berlin Hof III Aufgang 6 http://peritor.com 10961 Berlin Problem  Data needs to be normalized, enhanced and de-dupped  Algorithms could get data about 70% YES NO clean 18
  19. 19. Moderating User Generated Content Is this image explicit? Background  User generated content is a key part of a web 2.0 experience Process  Millions of photos uploaded every day Problem  Need to ensure user generated http://www.flickr.com/photos/cmak/1521356521/ content meets site guidelines YES NO 19
  20. 20. Categorization Background What kind of dress is  Consumers need to be able to this? quickly find a product when shopping online The Business Process  Millions of new products are introduced everyday  Products are sourced from hundreds of merchants and manufacturers, http://www.flickr.com/photos/34801476@N00/296743627/ each with their own taxonomy Cocktail Problem  Need to properly categorize new Bridal dress products quickly in order to monetize 20
  21. 21. Optimizing your HITs for Price Accuracy Speed 21
  22. 22. Check it out! http://mturk.com http://turkers.proboards.com 22
  23. 23. Thank you for your attention Peritor GmbH Blücherstr. 22, Hof III Aufgang 6 10961 Berlin Tel.: +49 (0)30 69 20 09 84 0 Fax: +49 (0)30 69 20 09 84 9 Internet: www.peritor.com E-Mail: info@peritor.com © Peritor GmbH - Alle Rechte vorbehalten
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×