19. Sample Problem
Count the number of each alphabets:
Water is a transparent and nearly colorless chemical
substance that is the main constituent of Earth's
streams, lakes, and oceans, and the fluids of most living
organisms. Its chemical formula is H2O, meaning that its
molecule contains one oxygen and two hydrogen atoms,
that are connected by covalent bonds …
20. Sample Map
map("Water is a transparent and nearly colorless
chemical substance …")
⇒ [(w, 1), (a, 1), (t, 1), (e, 1), (r, 1), (i, 1),
(s, 1) …]
23. Dividing Data
Water is a transparent and nearly colorless chemical
substance that is the main constituent of Earth's
streams, lakes, and oceans, and the fluids of most living
organisms. Its chemical formula is H2O, meaning that its
molecule contains one oxygen and two hydrogen atoms,
that are connected by covalent bonds …
24. Dividing Data
1. Water is a transparent and nearly colorless chemical
substance that is the main constituent of Earth's
streams, lakes, and oceans, and the fluids of most
living organisms.
2. Its chemical formula is H2O, meaning that its molecule
contains one oxygen and two hydrogen atoms, that
are connected by covalent bonds
3. …
28. In Summary
4 ! Big Data Analysis ≈ Cloud Computing ≈
Distributed Computing
4 " Big Data is the technology for saving money
4 # Dividing data and to give a guarantee falut
tolerance is the key factor of the Big Data
4 $ It is hard to analyze the highly coupled data (e.g.
graph), therefore ML in multi-node is impossible now