2. • We introduce a new sampling methods for efficient cloud
data storage.
• Compared with MapReduce, our solution is more efficient
in terms of processing time.
• Refer to the link for full paper:
https://ieeexplore.ieee.org/abstract/document/7172493
4. • The NDA/PDA methods adopt data
distribution characteristics, i.e. Normal
distribution approximation (NDA) and
Possion distribution approximation
(PDA).
• The algorithm is simple. Create a
buffer zone Delta, which controls the
size of insert data from original
datasets as shown on the right.
7. Reference Paper:
“Splitting large medical data sets based on normal
distribution in cloud environment”, H Zhang, Y Zhao, C
Pang, J He, IEEE Transactions on Cloud Computing 8 (2),
518-531.
https://ieeexplore.ieee.org/abstract/document/7172493