SlideShare a Scribd company logo
Performance Improvement and Analysis for Data
       Privacy of Cloud Computing Documents
                                  指導教授:葉生正
                                     作者:林俊淵
                                   報告者:劉孝皇
Outline
•   緒論
•   文獻探討
•   研究方法
•   實驗環境與數據分析
•   結論及未來展望
緒論 - Cloud Computing
•   Cloud Computing Technologies, use virtualization and
    automation technologies to create and popularize
    computer in a variety of computing resources
•   Cloud Computing Services, obtaining services from a
    remote network connection

                      Software-as-a-Service (SaaS)
                                          SaaS


                      Platform-as-a-Service (PaaS)
                                          PaaS


                    Infrastructure-as-a-Service (IaaS)
                                           IaaS
Cloud computing - Definition
Cloud Collaboration
Data Outsourcing
•   Security issues
    •   Confidentiality, (data encryption and access control)
    •   Integrity, (hashing and CRC)
研究動機
研究目的
•   The document which is large scale but needs to edit few
    parts

•   When modified, it does not have to encrypt the whole
    document but only the modified parts
文獻探討 (1/2)
•   Cong Wang (2010), ”Privacy-Preserving Public Auditing for
    Data Storage Security in Cloud Computing”, INFOCOM, pp. 1-9.
文獻探討 (2/2)
•   J. Hur and D. Noh, "Attribute-Based Access Control with Efficient Revocation in Data
    Outsourcing Systems," IEEE Transactions on Parallel and Distributed Systems, 2010.
•   Q. Liu, et al., "Efficient Sharing of Secure Cloud Storage Services," , IEEE
    International Conference on Computer and Information Technology, 2010, pp. 922-
    929.
•   S. Ning, et al., "A privacy-preserving approach to policy-based content dissemination,"
    in Data Engineering (ICDE), 2010, pp. 944-955.
Red-Black tree
•   Red-Black tree was invented in 1972 by Rudolf Bayer. It
    is a type of self-balancing binary search tree
                                    Operation   Time complexity in worse case

                                     Search               O(log n)

                                      Insert              O(log n)

                                     Delete               O(log n)
•   Properties
    •   Every node is either red or black
    •   The root is black
    •   Every leaf is black
    •   If a node is red, both its children are black
    •   Each path from the root to a leaf contains the same number of
        black nodes
研究方法
Distribution




Access
Rebuilding
                        (p,{align:center};si:1;ei:9),(b;si:1;ei:4),

              Content                                       Stylesheet
 Encryption
Update TreeMap ~Insertion~




                             Tree Node {
                               int : key
                               StringBuilder : content
                             }
Update TreeMap ~Insertion~
cont.
   Step. 1




   Step. 2




   Step. 3



   Step. 4
Update TreeMap ~ Remove~




                           Tree Node {
                             int : key
                             StringBuilder : content
                           }
Update TreeMap ~ Remove~
cont.
   Step. 1




   Step. 2




   Step. 3



   Step. 4
Environment
•   Windows 7 64bit
    •   Intel Core 2 Quad CPU Q8200 @2.33Ghz
    •   4.00 RAM

•   Windows XP 32bit (Run on VM)
    •   Core 2 @2.33Ghz
    •   2.00GB RAM

•   Programming language : Java (JDK 1.6)
3DES & AES Encryption/Decryption time
Document size   1MB     4MB 8MB 12MB
                       3DES Encrypt    187.4   618.6 1131.4 1721.8
Random Insert/Remove   AES Encrypt     112.8   327.8 628     915.8
Processing time
Processing time
Future work
•   In this study, although it pays the extra cost of establish and
    maintain the red-black tree, our method is more efficient if a
    large document was modified a small part
•   Improve the efficiency
    •   3DES is 31.04%
    •   AES is 23.94%

•   In the future, I want to improve this study and develop a
    collaboration service

More Related Content

What's hot

Cavity Data
Cavity DataCavity Data
Cavity Data
Alan Dean
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridEvert Lammerts
 
Session 09 learning relationships.pptx
Session 09 learning relationships.pptxSession 09 learning relationships.pptx
Session 09 learning relationships.pptx
bodaceacat
 
MongoDB IoT City Tour STUTTGART: Managing the Database Complexity, by Arthur ...
MongoDB IoT City Tour STUTTGART: Managing the Database Complexity, by Arthur ...MongoDB IoT City Tour STUTTGART: Managing the Database Complexity, by Arthur ...
MongoDB IoT City Tour STUTTGART: Managing the Database Complexity, by Arthur ...
MongoDB
 
Intro to hadoop ecosystem
Intro to hadoop ecosystemIntro to hadoop ecosystem
Intro to hadoop ecosystem
Grzegorz Kolpuc
 
ORDS, research data network
ORDS, research data networkORDS, research data network
ORDS, research data network
Jisc RDM
 
Optimization Techniques
Optimization TechniquesOptimization Techniques
Optimization Techniques
Joud Khattab
 
Introduction To Using TensorFlow & Deep Learning
Introduction To Using TensorFlow & Deep LearningIntroduction To Using TensorFlow & Deep Learning
Introduction To Using TensorFlow & Deep Learning
ali alemi
 
Mongo db nosql (1)
Mongo db nosql (1)Mongo db nosql (1)
Mongo db nosql (1)
Bhavesh Sarvaiya
 
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
Deltares
 
Squeeeze models
Squeeeze modelsSqueeeze models
Squeeeze models
Dong Heon Cho
 
WoSC19: Serverless Workflows for Indexing Large Scientific Data
WoSC19: Serverless Workflows for Indexing Large Scientific DataWoSC19: Serverless Workflows for Indexing Large Scientific Data
WoSC19: Serverless Workflows for Indexing Large Scientific Data
University of Chicago
 
3 python packages
3 python packages3 python packages
3 python packages
FEG
 
Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)
Robert Grossman
 
Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...The HDF-EOS Tools and Information Center
 
Finding Association Rules in Linked Data
Finding Association Rules in Linked DataFinding Association Rules in Linked Data
Finding Association Rules in Linked Data
Reza Ramezani
 
Provenance state-of-art
Provenance state-of-artProvenance state-of-art
Provenance state-of-art
vty
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture SearchAn Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
Bill Liu
 

What's hot (20)

Cavity Data
Cavity DataCavity Data
Cavity Data
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG Grid
 
Session 09 learning relationships.pptx
Session 09 learning relationships.pptxSession 09 learning relationships.pptx
Session 09 learning relationships.pptx
 
MongoDB IoT City Tour STUTTGART: Managing the Database Complexity, by Arthur ...
MongoDB IoT City Tour STUTTGART: Managing the Database Complexity, by Arthur ...MongoDB IoT City Tour STUTTGART: Managing the Database Complexity, by Arthur ...
MongoDB IoT City Tour STUTTGART: Managing the Database Complexity, by Arthur ...
 
Intro to hadoop ecosystem
Intro to hadoop ecosystemIntro to hadoop ecosystem
Intro to hadoop ecosystem
 
ORDS, research data network
ORDS, research data networkORDS, research data network
ORDS, research data network
 
iRODS: Interoperability in Data Management
iRODS: Interoperability in Data ManagementiRODS: Interoperability in Data Management
iRODS: Interoperability in Data Management
 
Optimization Techniques
Optimization TechniquesOptimization Techniques
Optimization Techniques
 
Hota hadoop
Hota hadoopHota hadoop
Hota hadoop
 
Introduction To Using TensorFlow & Deep Learning
Introduction To Using TensorFlow & Deep LearningIntroduction To Using TensorFlow & Deep Learning
Introduction To Using TensorFlow & Deep Learning
 
Mongo db nosql (1)
Mongo db nosql (1)Mongo db nosql (1)
Mongo db nosql (1)
 
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
 
Squeeeze models
Squeeeze modelsSqueeeze models
Squeeeze models
 
WoSC19: Serverless Workflows for Indexing Large Scientific Data
WoSC19: Serverless Workflows for Indexing Large Scientific DataWoSC19: Serverless Workflows for Indexing Large Scientific Data
WoSC19: Serverless Workflows for Indexing Large Scientific Data
 
3 python packages
3 python packages3 python packages
3 python packages
 
Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)
 
Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...
 
Finding Association Rules in Linked Data
Finding Association Rules in Linked DataFinding Association Rules in Linked Data
Finding Association Rules in Linked Data
 
Provenance state-of-art
Provenance state-of-artProvenance state-of-art
Provenance state-of-art
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture SearchAn Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
 

Similar to 報告

Apache Cassandra training. Overview and Basics
Apache Cassandra training. Overview and BasicsApache Cassandra training. Overview and Basics
Apache Cassandra training. Overview and Basics
Oleg Magazov
 
Distributed Framework for Data Mining As a Service on Private Cloud
Distributed Framework for Data Mining As a Service on Private CloudDistributed Framework for Data Mining As a Service on Private Cloud
Distributed Framework for Data Mining As a Service on Private Cloud
IJERA Editor
 
Technologies For Appraising and Managing Electronic Records
Technologies For Appraising and Managing Electronic RecordsTechnologies For Appraising and Managing Electronic Records
Technologies For Appraising and Managing Electronic Records
pbajcsy
 
BDSE 2015 Evaluation of Big Data Platforms with HiBench
BDSE 2015 Evaluation of Big Data Platforms with HiBenchBDSE 2015 Evaluation of Big Data Platforms with HiBench
BDSE 2015 Evaluation of Big Data Platforms with HiBench
t_ivanov
 
Cloud computing infrastructure
Cloud computing infrastructure Cloud computing infrastructure
Cloud computing infrastructure
Dr. Anita Goel
 
Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017
Dr. Anita Goel
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
Bigstep
 
Grid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the CloudGrid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the Cloud
Adianto Wibisono
 
Big Process for Big Data @ NASA
Big Process for Big Data @ NASABig Process for Big Data @ NASA
Big Process for Big Data @ NASA
Ian Foster
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
musrath mohammad
 
Brad stack - Digital Health and Well-Being Festival
Brad stack - Digital Health and Well-Being Festival Brad stack - Digital Health and Well-Being Festival
Brad stack - Digital Health and Well-Being Festival
Digital Health Enterprise Zone
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
SoftServe
 
Accelerating TensorFlow with RDMA for high-performance deep learning
Accelerating TensorFlow with RDMA for high-performance deep learningAccelerating TensorFlow with RDMA for high-performance deep learning
Accelerating TensorFlow with RDMA for high-performance deep learning
DataWorks Summit
 
Preservation Planning: Choosing a suitable digital preservation strategy
Preservation Planning: Choosing a suitable digital preservation strategyPreservation Planning: Choosing a suitable digital preservation strategy
Preservation Planning: Choosing a suitable digital preservation strategy
GarethKnight
 
140320702029 maurya ppt
140320702029 maurya ppt140320702029 maurya ppt
140320702029 maurya ppt
Maurya Shah
 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdb
jixuan1989
 
Atom: A cloud native deep learning platform at Supremind
Atom: A cloud native deep learning platform at SupremindAtom: A cloud native deep learning platform at Supremind
Atom: A cloud native deep learning platform at Supremind
Alluxio, Inc.
 
PhD Thesis Proposal
PhD Thesis Proposal PhD Thesis Proposal
PhD Thesis Proposal
Ziqiang Feng
 
E Afgan - Zero to a bioinformatics analysis platform in four minutes
E Afgan - Zero to a bioinformatics analysis platform in four minutesE Afgan - Zero to a bioinformatics analysis platform in four minutes
E Afgan - Zero to a bioinformatics analysis platform in four minutes
Jan Aerts
 

Similar to 報告 (20)

Apache Cassandra training. Overview and Basics
Apache Cassandra training. Overview and BasicsApache Cassandra training. Overview and Basics
Apache Cassandra training. Overview and Basics
 
Distributed Framework for Data Mining As a Service on Private Cloud
Distributed Framework for Data Mining As a Service on Private CloudDistributed Framework for Data Mining As a Service on Private Cloud
Distributed Framework for Data Mining As a Service on Private Cloud
 
Technologies For Appraising and Managing Electronic Records
Technologies For Appraising and Managing Electronic RecordsTechnologies For Appraising and Managing Electronic Records
Technologies For Appraising and Managing Electronic Records
 
BDSE 2015 Evaluation of Big Data Platforms with HiBench
BDSE 2015 Evaluation of Big Data Platforms with HiBenchBDSE 2015 Evaluation of Big Data Platforms with HiBench
BDSE 2015 Evaluation of Big Data Platforms with HiBench
 
Cloud computing infrastructure
Cloud computing infrastructure Cloud computing infrastructure
Cloud computing infrastructure
 
Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
 
Grid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the CloudGrid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the Cloud
 
Big Process for Big Data @ NASA
Big Process for Big Data @ NASABig Process for Big Data @ NASA
Big Process for Big Data @ NASA
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
 
Bertenthal
BertenthalBertenthal
Bertenthal
 
Brad stack - Digital Health and Well-Being Festival
Brad stack - Digital Health and Well-Being Festival Brad stack - Digital Health and Well-Being Festival
Brad stack - Digital Health and Well-Being Festival
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
 
Accelerating TensorFlow with RDMA for high-performance deep learning
Accelerating TensorFlow with RDMA for high-performance deep learningAccelerating TensorFlow with RDMA for high-performance deep learning
Accelerating TensorFlow with RDMA for high-performance deep learning
 
Preservation Planning: Choosing a suitable digital preservation strategy
Preservation Planning: Choosing a suitable digital preservation strategyPreservation Planning: Choosing a suitable digital preservation strategy
Preservation Planning: Choosing a suitable digital preservation strategy
 
140320702029 maurya ppt
140320702029 maurya ppt140320702029 maurya ppt
140320702029 maurya ppt
 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdb
 
Atom: A cloud native deep learning platform at Supremind
Atom: A cloud native deep learning platform at SupremindAtom: A cloud native deep learning platform at Supremind
Atom: A cloud native deep learning platform at Supremind
 
PhD Thesis Proposal
PhD Thesis Proposal PhD Thesis Proposal
PhD Thesis Proposal
 
E Afgan - Zero to a bioinformatics analysis platform in four minutes
E Afgan - Zero to a bioinformatics analysis platform in four minutesE Afgan - Zero to a bioinformatics analysis platform in four minutes
E Afgan - Zero to a bioinformatics analysis platform in four minutes
 

報告

  • 1. Performance Improvement and Analysis for Data Privacy of Cloud Computing Documents 指導教授:葉生正 作者:林俊淵 報告者:劉孝皇
  • 2. Outline • 緒論 • 文獻探討 • 研究方法 • 實驗環境與數據分析 • 結論及未來展望
  • 3. 緒論 - Cloud Computing • Cloud Computing Technologies, use virtualization and automation technologies to create and popularize computer in a variety of computing resources • Cloud Computing Services, obtaining services from a remote network connection Software-as-a-Service (SaaS) SaaS Platform-as-a-Service (PaaS) PaaS Infrastructure-as-a-Service (IaaS) IaaS
  • 4. Cloud computing - Definition
  • 6. Data Outsourcing • Security issues • Confidentiality, (data encryption and access control) • Integrity, (hashing and CRC)
  • 8. 研究目的 • The document which is large scale but needs to edit few parts • When modified, it does not have to encrypt the whole document but only the modified parts
  • 9. 文獻探討 (1/2) • Cong Wang (2010), ”Privacy-Preserving Public Auditing for Data Storage Security in Cloud Computing”, INFOCOM, pp. 1-9.
  • 10. 文獻探討 (2/2) • J. Hur and D. Noh, "Attribute-Based Access Control with Efficient Revocation in Data Outsourcing Systems," IEEE Transactions on Parallel and Distributed Systems, 2010. • Q. Liu, et al., "Efficient Sharing of Secure Cloud Storage Services," , IEEE International Conference on Computer and Information Technology, 2010, pp. 922- 929. • S. Ning, et al., "A privacy-preserving approach to policy-based content dissemination," in Data Engineering (ICDE), 2010, pp. 944-955.
  • 11. Red-Black tree • Red-Black tree was invented in 1972 by Rudolf Bayer. It is a type of self-balancing binary search tree Operation Time complexity in worse case Search O(log n) Insert O(log n) Delete O(log n) • Properties • Every node is either red or black • The root is black • Every leaf is black • If a node is red, both its children are black • Each path from the root to a leaf contains the same number of black nodes
  • 14. Rebuilding (p,{align:center};si:1;ei:9),(b;si:1;ei:4), Content Stylesheet Encryption
  • 15. Update TreeMap ~Insertion~ Tree Node { int : key StringBuilder : content }
  • 16. Update TreeMap ~Insertion~ cont. Step. 1 Step. 2 Step. 3 Step. 4
  • 17. Update TreeMap ~ Remove~ Tree Node { int : key StringBuilder : content }
  • 18. Update TreeMap ~ Remove~ cont. Step. 1 Step. 2 Step. 3 Step. 4
  • 19. Environment • Windows 7 64bit • Intel Core 2 Quad CPU Q8200 @2.33Ghz • 4.00 RAM • Windows XP 32bit (Run on VM) • Core 2 @2.33Ghz • 2.00GB RAM • Programming language : Java (JDK 1.6)
  • 20. 3DES & AES Encryption/Decryption time
  • 21. Document size 1MB 4MB 8MB 12MB 3DES Encrypt 187.4 618.6 1131.4 1721.8 Random Insert/Remove AES Encrypt 112.8 327.8 628 915.8
  • 24. Future work • In this study, although it pays the extra cost of establish and maintain the red-black tree, our method is more efficient if a large document was modified a small part • Improve the efficiency • 3DES is 31.04% • AES is 23.94% • In the future, I want to improve this study and develop a collaboration service