More Related Content
Similar to 데이터 라벨링 노가다는 이제 그만 - Amazon Sagemaker Ground Truth :: 소성운 - AWS Community Day 2019 (20)
More from AWSKRUG - AWS한국사용자모임 (20)
데이터 라벨링 노가다는 이제 그만 - Amazon Sagemaker Ground Truth :: 소성운 - AWS Community Day 2019
- 1. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon SageMaker Ground Truth
Data Scientist
- 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Sagemaker
Amazon Sagemaker Ground Truth
Amazon Sagemaker Ground Truth
- 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
(Yan So)
AWSKRUG #datascience
E: 13imso@gmail.com
L: https://www.linkedin.com/in/yanso
- 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Sagemaker
Build
Train
Tune
Deploy
● Jupyter
● Tensorflow, mxnet, Pytorch, Glueon
●
●
●
●
●
●
●
●
●
●
● API
●
●
- 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Sagemaker Ground Truth
- 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
?
,
- 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
?
- 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
?
/
- 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
?
/
ML Model
RGB
- 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
?
/
: AWSKRUG Hands-on Lab 2018 -
https://github.com/yansonz/2018-handson-data-02
- 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
?
/
/
: AWSKRUG Hands-on Lab 2018 -
https://github.com/yansonz/2018-handson-data-02
- 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
?
ML
Imagenet: 14M , 22K
Microsoft COCO: 330K , 80
MNIST: 70K , 10
Open Images Datasets: 9M URL , 5K
CIFAR-10: 60K , 10
Fashion-MNIST: 70K , 10
ML
- 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Label Build Train Tune Deploy
- 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Bounding Boxes Image Classification Semantic Segmentation
Text Classification Custom Tasks
- 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Active learning Auto Data Labeling
Input datasets
Human Labeling
Large datasets
- 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Vendors
A curated list of third-party
vendors that specialize in
providing data labeling services,
available via the AWS Marketplace
( )
Private
A team of workers that you have
sourced yourself, including
your own employees or contractors
for handling data that needs to stay
within your organization
Public
An on-demand 24 x7 workforce
of over 500,000 independent
Contractors worldwide, powered
by Amazon Mechanical Turn
- 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
- 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
( )
- 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
( )
- 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
( )
- 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
( )
- 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
( )
- 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
( )
- 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
( )
- 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Active Learning Auto Data Labeling
Input datasets
Human Labeling
Large datasets
- 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Labeling Consolidation: Major Voting
bulldog sharpei bulldogbulldog
bulldog (3/4)
- 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Labeling Consolidation: Probabilities
bulldog sharpei bulldogbulldog bulldog 0.1
sharpei 0.9
Probabilities of
correct labels
- 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Active Learning Auto Data Labeling
Input datasets
Human Labeling
Large datasets
- 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Active Learning Auto Data Labeling
Input datasets
Labeling +
Consolidation
Labeled datasets
Active Learning
Auto Labeling
- 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
- 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
- 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Label Build Train Tune Deploy
- 33. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
https://aws.amazon.com/sagemaker/groundtruth/
https://aws.amazon.com/blogs/aws/amazon-sagemaker-ground-truth-
build-highly-accurate-datasets-and-reduce-labeling-costs-by-up-to-70/
https://docs.aws.amazon.com/sagemaker/latest/dg/sms.html
https://aws.amazon.com/sagemaker/groundtruth/pricing/
https://github.com/awslabs/amazon-sagemaker-examples/tree/master/
ground_truth_labeling_jobs
- 34. Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Scientist