More Related Content
Similar to Hadoopを業務で使ってみた
Similar to Hadoopを業務で使ってみた (20)
More from Tatsuya Sasaki (8)
Hadoopを業務で使ってみた
- 19. Hive
• (each do ... end)
• Hive DB,
• (HiveQL)
• MySQL
EXISTS
…
- 29. Map
2 aaa
0 bbb
key 1 ccc value
1 ddd
0 eee
- 30. Map
2 aaa
0 bbb
1 ccc Reducer
1 ddd
0 eee
- 33. Map
2 aaa
0 bbb
1 ccc
1 ddd
0 eee
Reducer
1
※
- 35. 2 aaa
0 bbb
1 ccc
1 ddd
0 eee
- 36. 2 aaa
0 bbb
1 ccc
1 ddd
0 eee
- 37. 2 aaa
0 bbb
1 ccc
1 ddd
0 eee
key Reducer
- 44. EC2 S3
Amazon
• EC2 •••
※
• S3 •••
- 47. 1. (CSV or
Marshal) S3
2. EC2 Hadoop 1.
S3
3. S3 2.
MySQL
- 48. DB
1. (CSV or
Marshal) S3
2. EC2 Hadoop 1.
S3
3. S3 2.
MySQL
- 49. Hadoop
1. (CSV or
Marshal) S3
2. EC2 Hadoop 1.
S3
3. S3 2.
MySQL
- 50. DB
1. (CSV or
Marshal) S3
2. EC2 Hadoop 1.
S3
3. S3 2.
MySQL
- 56. •
• Mapper, Reducer
•
- 57. Hadoop S3
`hadoop dfs -cat s3://xxx/
input/user_info`
- 62. • Hadoop
• MapReduce
MapReduce
• Hadoop