• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
AWS User Group Berlin - Introduction To Amazon Mechanical Turk

AWS User Group Berlin - Introduction To Amazon Mechanical Turk



SIldes for a short introduction to Amazon mechanical Turk on the AWS User Group on 2010/03/25 in Berlin.

SIldes for a short introduction to Amazon mechanical Turk on the AWS User Group on 2010/03/25 in Berlin.



Total Views
Views on SlideShare
Embed Views



8 Embeds 165

http://redbug0314.blogspot.com 100
http://redbug0314.blogspot.tw 53
http://www.slideshare.net 3
http://redbug0314.blogspot.ca 3
http://webcache.googleusercontent.com 2
http://redbug0314.blogspot.sg 2
http://redbug0314.blogspot.hk 1
http://redbug0314.blogspot.de 1



Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
Post Comment
Edit your comment

    AWS User Group Berlin - Introduction To Amazon Mechanical Turk AWS User Group Berlin - Introduction To Amazon Mechanical Turk Presentation Transcript

    • Introduction to Mechanical Turk Artificial Artificial Intelligence AWS User Group Berlin Thomas Metschke 25.03.2010 Peritor GmbH
    • Amazon Mechanical Turk is a marketplace for work. 2
    • Mechanical Turk Marketplace  400,000+ Workers  In 100+ Countries  Available 24/7  Programmatically Accessible http://www.flickr.com/photos/diamond_rain/2543837414/ 3
    • So there are basically Workers Requesters http://www.flickr.com/photos/saad/1968774 http://www.flickr.com/photos/chicagobart/4181533461 4
    • Mechanical Turk as a Worker Workers  Make money by working on Human Intelligence Tasks  Workers can work from home and choose their own work hours http://www.flickr.com/photos/saad/1968774 5
    • Your Dashboard 6
    • Your Dashboard The number of available tasks. 7
    • Your Dashboard Total Earnings and Bonuses. 8
    • Your Dashboard HIT Status and Totals. 9
    • How do I get the money? Amazon Bank U.S. Bank Gift Checks in account Certificate Rupees 10
    • Mechanical Turk as a Requester Requesters  Have access to a global, on-demand, 24 x 7 workforce  Can get thousands of HITs completed in minutes  Pay only when they are satisfied with the results http://www.flickr.com/photos/chicagobart/4181533461 11
    • Requesting HITs Requesters Workers Requesters • define and create • work on your • approve and pay your HITs HITs for completed • load HITs to • submit results HITs Mechanical Turk • use the results 12
    • Design HITs  Enter Properties  Design Layout 13
    • Design HITs - faster Take developer and use CSV files SOAP / REST or Amazon Mechanical Turk developer tools 14
    • What would it look like http://mechanicalturk.amazonaws.com/ ?Service=AWSMechanicalTurkRequester &AWSAccessKeyId=[the Requester's Access Key ID] &Version=2008-08-02 &Operation=CreateHIT &Signature=[signature for this request] &Timestamp=[your system's local time] &Title=Location%20and%20Photograph%20Identification &Description=Select%20the%20image%20that%20best%20represents &Reward.1.Amount=5 &Reward.1.CurrencyCode=USD &Question=[URL-encoded question data] &AssignmentDurationInSeconds=30 &LifetimeInSeconds=604800 &Keywords=location,%20photograph,%20image,%20identification,%20opinion 15
    • Publish HITs credit card debit card  HITs have to be paid in advance Amazon  Amazon takes 10% on top U.S. bank Payments account account 16
    • Use Mechanical Turk for  Work that requires Human Judgment  Work that algorithms cannot completely solve  Work that has unpredictable or spiky volume 17
    • Improving Data Quality Background Are these two  Data is the company’s business businesses the same?  Accuracy and breadth are key to differentiation Process Peritor GmbH Peritor Consulting  1 MM data points to ingest each day Blücherstraße 22 Blücherstraße 22  200 data sources 10961 Berlin Hof III Aufgang 6 http://peritor.com 10961 Berlin Problem  Data needs to be normalized, enhanced and de-dupped  Algorithms could get data about 70% YES NO clean 18
    • Moderating User Generated Content Is this image explicit? Background  User generated content is a key part of a web 2.0 experience Process  Millions of photos uploaded every day Problem  Need to ensure user generated http://www.flickr.com/photos/cmak/1521356521/ content meets site guidelines YES NO 19
    • Categorization Background What kind of dress is  Consumers need to be able to this? quickly find a product when shopping online The Business Process  Millions of new products are introduced everyday  Products are sourced from hundreds of merchants and manufacturers, http://www.flickr.com/photos/34801476@N00/296743627/ each with their own taxonomy Cocktail Problem  Need to properly categorize new Bridal dress products quickly in order to monetize 20
    • Optimizing your HITs for Price Accuracy Speed 21
    • Check it out! http://mturk.com http://turkers.proboards.com 22
    • Thank you for your attention Peritor GmbH Blücherstr. 22, Hof III Aufgang 6 10961 Berlin Tel.: +49 (0)30 69 20 09 84 0 Fax: +49 (0)30 69 20 09 84 9 Internet: www.peritor.com E-Mail: info@peritor.com © Peritor GmbH - Alle Rechte vorbehalten