SlideShare a Scribd company logo
1 of 12
Download to read offline
sones Whitepaper


GraphDB
The sones GraphDB


The sones GraphDB is a new, object-oriented data management system based on graphs.
It enables efficient saving, management and evaluation of extremely complex data bases
that are networked to a high degree. The integrated solution combines the advantages of
file storage with the possibilities of a Database Management System. Unstructured data
(e.g. video files), semi-structured data (Meta data, e.g. log files) and structured data can be
linked to each other and thus be managed from one source and be evaluated as needed.


The idea for a new development of a database based on the graph theory arose in the
beginning of 2005 due to dissatisfaction with the existing solutions available in the
market. Often, those offered no or only insufficient ad-hoc analysis functions for queries
into strongly linked data. Besides, these systems were mostly not able to cope with the
frequently changing data and information structures applications have to process
especially in the web environment. Therefore, the development focused on providing
efficient data handling and real time analysis for these tasks which could previously due to
their complexity hardly be solved or only with the highest efforts. A fundamentally new
database technology was developed based on totally independent and new concepts.


The solution includes an object-oriented storage solution (sones GraphFS) for high-
performance data storage as well as the graph-oriented database management system
(sones GraphDB), which links the data to each other and makes their high-performance
evaluation possible. The summation of these two parts into one software offers the
advantage of highly efficient processing since any abstraction layers can be avoided. The
otherwise customary efforts for the operation of a database solution will be extremely
decreased by this solution.


The object-oriented storage solution (sones GraphFS) also stores unstructured and semi-
structured data, like e.g. document formats, photos, more efficiently than file systems,
which are the basis for the operation system. This principle successfully overcomes one of
the significant performance problems in the previous object-oriented
databases, which is the mapping of object models in relational tables. A patent has been
filed for the file system.


The graph-oriented data management (sones GraphDB) offers high flexibility of the data
structuring and enables the direct linking of the data without additional auxiliary
constructs like, for example, allocation tables in relational data bases. Since the data
model is oriented on the established programming languages, there is no discrepancy
between application logic and data storage. The complexity is in part drastically smaller
compared to the existing relational solutions. This leads amongst others to the following
advantage: With increasing linkage of the data, the complexity does not increase
exponentially, but in a linear manner. A further decisive advantage is the ability of the
data base to change or amend data structures during runtime. New requirements on the
part of the application can therefore be realized without extensive re-programming of a
relational database schematic.


The Graph Query Language (sones GQL) was developed to facilitate the data
management for the user. While its extent and syntax is oriented on SQL, it was adjusted
to the graph-oriented approach and makes the advantages which exist in this model
accessible to a direct and easily comprehensible query.
The market


The Database Management System is the most important instrument for electronic (mass)
data management. It is its job to store data efficiently, unambiguous and durably and to provide
it to the user on an as needed basis.
To this end, several concepts have been developed during the software development, which are
fundamentally different from each other.


Hierarchical databases
The model of the hierarchical
databases is the oldest. Data
was stored in a tree structure
starting from a “root“ data set.
Dependencies between data
sets are described as “parent-
child“    relations    in    the
hierarchy.
Hierarchical database systems have now been almost completely replaced. These are only still
used in banks and insurances for older applications. A representative is IMS/DB by IBM.
The concept of the hierarchical databases is partially re-used in document-oriented data bases,
e.g. when information is stored in a hierarchical structure in XML files.


                                                             Relational databases
                                                             The relational database model is
                                                             currently the most prevalent.
                                                             Data is stored in tables line by line.
                                                             For each data set (line) any number
                                                             of attributes can be stored (in
                                                             columns). Dependencies between
                                                             data sets can be described by way
                                                             of common use of clear attributes
(keys). The structure of the data model will be determined at the time of compilation (or
modification). In the past 20 years relational databases have achieved acceptance since
the requirements of structuring (in tables) and linking (by referencing the data sets to each
other) can be well fulfilled with the model. Additionally, there is a standardized query language
(SQL), which can be easily used for evaluations. The relational database model reaches its
limitation when data are not directly dependent, but through several relations (For example, a
multi-level hierarchy cannot be specifically evaluated with a SELECT Statement). Moreover,
often additional artificial key attributes have to be applied to reduce the complexity of links and
thereby ensure the required performance. This in turn increases the complexity of the model
and requires additional storage space.
There are many standardized interfaces and a large number of useful tools for relational data
base systems.
The most well-known commercial databases are those by Oracle (as of Jan. 2010: version 11g),
Microsoft SQL Server and IBM DB/2. The most well-known Open Source databases are
MySQL and Postgres.

Object-oriented databases
Object-oriented       databases        are      a
“comparatively“ new concept (Developed
since the end of the 80s – together with the
object-oriented programming languages).
The structure of the data model is determined
at its compilation (or modification). All
attributes   that   describe    an   object    are
summarized into one object class by the data
modeling. Relations between objects are
described through references (as in the object-
oriented     programming       languages).    The
advantage compared to the relational data bases
is that no auxiliary constructs (artificial key
attributes, allocation tables) have to be planned
for dependencies. This decreases the effort of
the compilation as well as the complexity of
the data base. Since they do not reach the performance of relational data base management
systems, object-oriented data bases do not yet have a large distribution. Commercial solutions
are for example Versant, InterSystems (Cachè) and Objectivity (Objectivity/DB).
Document-oriented databases
It is the goal of the document-oriented databases not to determine the data structures at the time
of the conception of the database models. Data are stored in an unstructured manner and
completely evaluated when needed. Providers of document-oriented data bases usually offer
other solutions to ensure the performance of the evaluations for large amounts of data.
The advantage of this solution is the flexibility at the time of the data import, since no
specifications are made to the data structure. This, however, increases the effort when
evaluating the (so-called “semi-structured“) data.
Apache CouchDB, MongoDB or Amazons SimpleDB are examples for document-oriented
data bases.


Graph-oriented databases
Graph-oriented data bases store
nodes and relations between
nodes. An indirect (complex)
relation (e.g. between object 1
and 3) exist if an object
references a second and this in
turn a third.


Multilevel relations are called graphs.
Graph-oriented databases allow a query of indirect relations over multiple edges. The edges can
also be weighed to describe the degree of the relation. Thus, complex and indirect dependencies
in data bases can be easily described and their high-performance evaluation can be enabled.
The evaluation of the data is not limited to the lists, but the data can also be processed structured
as graphs. The advantage of this approach is that the work step of data structuring will be saved
in the data processing application. The GraphDB of sones is a graph-oriented database.
The architecture


Contrary to most object-oriented approaches the sones GraphDB is a complete internal
development. The approach of a complete re-development was in particular chosen to preclude
potential legacy problems. The whole development is based on the approach to challenge
established approaches and to re-think them if necessary.


The graphic illustrates the structure of the sones GraphDB. It consists of four application layers
– the Storage Layer, the File System, the Database Management System as well as the Data
Access Layer.
These as well as the available interfaces are explained in detail below.
Storage Engines

The Storage Engines separate the physical storage media from the application logic. The
interface furthermore offers functions to measure performance and for fault management.


File System – GraphFS

The decision to re-develop the File System from the ground up is based on avoiding the
paradigm shift between graph-oriented data base logic and structural storage of the data in files.


Instead the objects are stored as data streams based on blocks in the Storage Engines.
The storage in unstructured data streams is high-performing. The extend based storage ensures
defined entry points.
The interface between File System and Storage Engines is reduced to the provision of a data
stream. Thus, the access is not limited to locally tied storage systems, but can also be tied via
TCP/IP or iSCSI to external systems.
To guarantee high-performance access to the objects, different Meta data (e.g. type of data
objects) are indexed together with a clear Object Location. This contains all information to
specifically access any object. Moreover, one’s own Meta data can be allocated to the data
objects and can be evaluated.


The File System offers the following functions:
       Storage of the data in netlike structures
       High-performance storage of both structured data (objects) as well as binary data
       Automatic data structuring and optimization
       Information LifeCycle Management beyond hardware limitations
       High performance through direct connection with the Data Base Management System
       Versioning of data objects
Database Management System - GraphDB


Software internal data organization is the job of the Database Management System. This
includes for example the structured and high-performance object storage and provisioning. It
also provides the interfaces, which enable the interaction with the system.
All data access takes place formalized in the query language Graph Query Language (GQL).
The syntax is oriented on SQL, but has been adapted for the interaction with graph-oriented
data structures.


The main feature of sones GraphDB is the data management of objects, which can be linked to
each other in graph structures. When making a query it is possibly to also evaluate data, which
is indirectly linked (through several objects). The objects are also provided in the graph
structure for evaluation. An additional mapping (like e.g. in the case of relational data bases) is
therefore not necessary.


The attributes of the data objects can be defined free of type errors (C# data types). Complex
structures can be shown through lists and object dependencies.
The database offers the following functions:
        Attributes of objects can be defined free of type errors (C# data types)
        Complex types can for example by shown through object referencing
        Inheritance of objects (for more specifics standardization of types or for expansion)
        Editing of objects
        Versioning of objects
        Data safety (redundancy, mirroring, striping, backup) can be defined on object level
        Data integrity is ensured via checksums (CRC32, SHA1, MD5)
        Authorization concept for the data base can be defined


DataStorage – GraphDS


GraphDS provides the interface to the product. A GQL command line as well as a user
interface is provided for administration. The system can also be accessed via WebService,
REST or WebShell.
The Graph Query Language (GQL)


The challenge of the graph-oriented data bases is to make the advantages of this approach also
easy to use for the users and applications.
Currently 2 approaches are being pursued:
There is the query language SPARQL. It is oriented on RDF, which makes an adaption for
Database Management Systems more difficult.
The second approach is to specifically track resp. evaluate the edges. This solution is flexible
and high-performing, however, this solution is difficult to implement without knowledge of
programming and requires proximity to the data model (e.g. API).
sones therefore decided to develop its own query language modeled on SQL (Graph Query
Language - GQL). Thus, the learning requirements are very small for persons with SQL
knowledge. GQL can be accessed through several interfaces. Support of SPARQL is planned.


The Graph Query Language offers the following functions:
       Functionalities analog SQL (CREATE, INSERT, UPDATE, ALTER, SELECT,
        etc.)
       Graph theoretical functions, e.g. pathfinding or weighing of edges
At a glance (USP’s)


!          Universal access
With Graph DS sones offers a data base solution for problems which can only be shown in the
relational models with big efforts without giving up their advantages (standardized query
language, structuring of the data). The model recommends itself for the illustration of netlike
(e.g. social networks) or hierarchical structures.


!          More speed
A higher performance for queries than what was possible with the previously existing solutions
is reached by using graph theoretical algorithms.


The system is optimized for the use of multi / many core processors. Complex queries are
processed simultaneously in several threads.


!          Scalability
The performance of the Database Management System is independent from the data amount.


GraphDS can be integrated in existing structures, since it is a transparent solution, which
supports standardized interfaces and offers an easy to learn query language.


!          Flexibility
The flexible storage system makes object and attribute changes possible at any time during
runtime.


!          Lower costs
Significant decrease in expenses for implementation as well as operating costs.


Sones GraphDB uses the .NET framework consistently. Therefore, the system is only offered
on .NET supported operating systems. The compatibility with the mono project will be ensured
in the product test.
sones GmbH                      © 2010 sones GmbH. All rights reserved. sones as well as the according logos are registered trademarks of the sones.
February 2010                   All other names of products and services are trademarks of their respective owners. This document is provided for
                                information purposes only and the contents hereof are subject to change without notice.
sones GmbH
Eugen-Richter-Str. 44           This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed
99085 Erfurt                    orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular
Germany                         purpose. We specifically disclaim any liability with respect to this document and no contractual obligations are
                                formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any
Tel.: +49(0) 361 - 30 26 25 0   form or by any means, electronical or mechanical, for any purpose, without our prior written permission.
Fax: +49(0) 361 - 244 500 8

More Related Content

What's hot

Database models unit 1 part 2
Database models unit 1  part 2Database models unit 1  part 2
Database models unit 1 part 2Ram Paliwal
 
Database system concepts
Database system conceptsDatabase system concepts
Database system conceptsKumar
 
Lecture 01 introduction to database
Lecture 01 introduction to databaseLecture 01 introduction to database
Lecture 01 introduction to databaseemailharmeet
 
No sql – rise of the clusters
No sql – rise of the clustersNo sql – rise of the clusters
No sql – rise of the clustersresponseteam
 
Dbms classification according to data models
Dbms classification according to data modelsDbms classification according to data models
Dbms classification according to data modelsABDUL KHALIQ
 
Data base management systems ppt
Data base management systems pptData base management systems ppt
Data base management systems pptsuthi
 
data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
DBMS & Data Models - In Introduction
DBMS & Data Models - In IntroductionDBMS & Data Models - In Introduction
DBMS & Data Models - In IntroductionRajeev Srivastava
 
Birthof Relation Database
Birthof Relation DatabaseBirthof Relation Database
Birthof Relation DatabaseRaj Bhat
 
Introduction to database
Introduction to databaseIntroduction to database
Introduction to databaseArpee Callejo
 

What's hot (20)

Database models unit 1 part 2
Database models unit 1  part 2Database models unit 1  part 2
Database models unit 1 part 2
 
Lecture 04 data resource management
Lecture 04 data resource managementLecture 04 data resource management
Lecture 04 data resource management
 
Database system concepts
Database system conceptsDatabase system concepts
Database system concepts
 
Unit 1
Unit 1Unit 1
Unit 1
 
Lecture 01 introduction to database
Lecture 01 introduction to databaseLecture 01 introduction to database
Lecture 01 introduction to database
 
No sql – rise of the clusters
No sql – rise of the clustersNo sql – rise of the clusters
No sql – rise of the clusters
 
Database
DatabaseDatabase
Database
 
Managing data resources
Managing  data resourcesManaging  data resources
Managing data resources
 
Dbms classification according to data models
Dbms classification according to data modelsDbms classification according to data models
Dbms classification according to data models
 
Data base management systems ppt
Data base management systems pptData base management systems ppt
Data base management systems ppt
 
Different data models
Different data modelsDifferent data models
Different data models
 
Session#5; data resource managment
Session#5;  data resource managmentSession#5;  data resource managment
Session#5; data resource managment
 
Dbms database models
Dbms database modelsDbms database models
Dbms database models
 
data resource management
 data resource management data resource management
data resource management
 
DBMS & Data Models - In Introduction
DBMS & Data Models - In IntroductionDBMS & Data Models - In Introduction
DBMS & Data Models - In Introduction
 
DBMS and its Models
DBMS and its ModelsDBMS and its Models
DBMS and its Models
 
Dbms module i
Dbms module iDbms module i
Dbms module i
 
Dbms Networks
Dbms NetworksDbms Networks
Dbms Networks
 
Birthof Relation Database
Birthof Relation DatabaseBirthof Relation Database
Birthof Relation Database
 
Introduction to database
Introduction to databaseIntroduction to database
Introduction to database
 

Viewers also liked

GQL CheatSheet 1.1
GQL CheatSheet 1.1GQL CheatSheet 1.1
GQL CheatSheet 1.1sones GmbH
 
Scalling through Couchbase at Sky Deutschland (Couchbase Live France 2015)
Scalling through Couchbase at Sky Deutschland (Couchbase Live France 2015)Scalling through Couchbase at Sky Deutschland (Couchbase Live France 2015)
Scalling through Couchbase at Sky Deutschland (Couchbase Live France 2015)Norbert Gergely
 
GQL cheat sheet latest
GQL cheat sheet latestGQL cheat sheet latest
GQL cheat sheet latestsones GmbH
 
指陸科技行銷GRO公司簡介
指陸科技行銷GRO公司簡介指陸科技行銷GRO公司簡介
指陸科技行銷GRO公司簡介netvcd
 
sones on windows azure whitepaper (english)
sones on windows azure whitepaper (english)sones on windows azure whitepaper (english)
sones on windows azure whitepaper (english)sones GmbH
 
sones company presentation
sones company presentationsones company presentation
sones company presentationsones GmbH
 
sones auf windows azure whitepaper (german)
sones auf windows azure whitepaper (german)sones auf windows azure whitepaper (german)
sones auf windows azure whitepaper (german)sones GmbH
 
Whitepaper sones GraphDB (ger)
Whitepaper sones GraphDB (ger)Whitepaper sones GraphDB (ger)
Whitepaper sones GraphDB (ger)sones GmbH
 

Viewers also liked (8)

GQL CheatSheet 1.1
GQL CheatSheet 1.1GQL CheatSheet 1.1
GQL CheatSheet 1.1
 
Scalling through Couchbase at Sky Deutschland (Couchbase Live France 2015)
Scalling through Couchbase at Sky Deutschland (Couchbase Live France 2015)Scalling through Couchbase at Sky Deutschland (Couchbase Live France 2015)
Scalling through Couchbase at Sky Deutschland (Couchbase Live France 2015)
 
GQL cheat sheet latest
GQL cheat sheet latestGQL cheat sheet latest
GQL cheat sheet latest
 
指陸科技行銷GRO公司簡介
指陸科技行銷GRO公司簡介指陸科技行銷GRO公司簡介
指陸科技行銷GRO公司簡介
 
sones on windows azure whitepaper (english)
sones on windows azure whitepaper (english)sones on windows azure whitepaper (english)
sones on windows azure whitepaper (english)
 
sones company presentation
sones company presentationsones company presentation
sones company presentation
 
sones auf windows azure whitepaper (german)
sones auf windows azure whitepaper (german)sones auf windows azure whitepaper (german)
sones auf windows azure whitepaper (german)
 
Whitepaper sones GraphDB (ger)
Whitepaper sones GraphDB (ger)Whitepaper sones GraphDB (ger)
Whitepaper sones GraphDB (ger)
 

Similar to Whitepaper sones GraphDB (eng)

data base system to new data science lerne
data base system to new data science lernedata base system to new data science lerne
data base system to new data science lernetarunprajapati0t
 
Student POST  Database processing models showcase the logical s.docx
Student POST  Database processing models showcase the logical s.docxStudent POST  Database processing models showcase the logical s.docx
Student POST  Database processing models showcase the logical s.docxorlandov3
 
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENTHYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENTIJCSEA Journal
 
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENTHYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENTIJCSEA Journal
 
DATABASE MANAGEMENT SYSTEM-MRS. LAXMI B PANDYA FOR 25TH AUGUST,2022.pptx
DATABASE MANAGEMENT SYSTEM-MRS. LAXMI B PANDYA FOR 25TH AUGUST,2022.pptxDATABASE MANAGEMENT SYSTEM-MRS. LAXMI B PANDYA FOR 25TH AUGUST,2022.pptx
DATABASE MANAGEMENT SYSTEM-MRS. LAXMI B PANDYA FOR 25TH AUGUST,2022.pptxLaxmi Pandya
 
Spatial Database and Database Management System
Spatial Database and Database Management SystemSpatial Database and Database Management System
Spatial Database and Database Management SystemLal Mohammad
 
Comparison of Relational Database and Object Oriented Database
Comparison of Relational Database and Object Oriented DatabaseComparison of Relational Database and Object Oriented Database
Comparison of Relational Database and Object Oriented DatabaseEditor IJMTER
 
Database and Database Management (DBM): Health Informatics
Database and Database Management (DBM): Health InformaticsDatabase and Database Management (DBM): Health Informatics
Database and Database Management (DBM): Health InformaticsZulfiquer Ahmed Amin
 
No Sql On Social And Sematic Web
No Sql On Social And Sematic WebNo Sql On Social And Sematic Web
No Sql On Social And Sematic WebStefan Ceriu
 
NoSQL On Social And Sematic Web
NoSQL On Social And Sematic WebNoSQL On Social And Sematic Web
NoSQL On Social And Sematic WebStefan Prutianu
 
Database Systems - introduction
Database Systems - introductionDatabase Systems - introduction
Database Systems - introductionJananath Banuka
 
2. Chapter Two.pdf
2. Chapter Two.pdf2. Chapter Two.pdf
2. Chapter Two.pdffikadumola
 
Query Optimization Techniques in Graph Databases
Query Optimization Techniques in Graph DatabasesQuery Optimization Techniques in Graph Databases
Query Optimization Techniques in Graph Databasesijdms
 

Similar to Whitepaper sones GraphDB (eng) (20)

data base system to new data science lerne
data base system to new data science lernedata base system to new data science lerne
data base system to new data science lerne
 
Presentation1
Presentation1Presentation1
Presentation1
 
Student POST  Database processing models showcase the logical s.docx
Student POST  Database processing models showcase the logical s.docxStudent POST  Database processing models showcase the logical s.docx
Student POST  Database processing models showcase the logical s.docx
 
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENTHYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
 
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENTHYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
HYBRID DATABASE SYSTEM FOR BIG DATA STORAGE AND MANAGEMENT
 
Unit-10.pptx
Unit-10.pptxUnit-10.pptx
Unit-10.pptx
 
Computer Science Dissertation Literature Review Example
Computer Science Dissertation Literature Review ExampleComputer Science Dissertation Literature Review Example
Computer Science Dissertation Literature Review Example
 
Computer Science Dissertation Literature Review Example
Computer Science Dissertation Literature Review ExampleComputer Science Dissertation Literature Review Example
Computer Science Dissertation Literature Review Example
 
DATABASE MANAGEMENT SYSTEM-MRS. LAXMI B PANDYA FOR 25TH AUGUST,2022.pptx
DATABASE MANAGEMENT SYSTEM-MRS. LAXMI B PANDYA FOR 25TH AUGUST,2022.pptxDATABASE MANAGEMENT SYSTEM-MRS. LAXMI B PANDYA FOR 25TH AUGUST,2022.pptx
DATABASE MANAGEMENT SYSTEM-MRS. LAXMI B PANDYA FOR 25TH AUGUST,2022.pptx
 
Spatial Database and Database Management System
Spatial Database and Database Management SystemSpatial Database and Database Management System
Spatial Database and Database Management System
 
Lecture#5
Lecture#5Lecture#5
Lecture#5
 
Comparison of Relational Database and Object Oriented Database
Comparison of Relational Database and Object Oriented DatabaseComparison of Relational Database and Object Oriented Database
Comparison of Relational Database and Object Oriented Database
 
Database and Database Management (DBM): Health Informatics
Database and Database Management (DBM): Health InformaticsDatabase and Database Management (DBM): Health Informatics
Database and Database Management (DBM): Health Informatics
 
No Sql On Social And Sematic Web
No Sql On Social And Sematic WebNo Sql On Social And Sematic Web
No Sql On Social And Sematic Web
 
NoSQL On Social And Sematic Web
NoSQL On Social And Sematic WebNoSQL On Social And Sematic Web
NoSQL On Social And Sematic Web
 
View of data DBMS
View of data DBMSView of data DBMS
View of data DBMS
 
Database Systems - introduction
Database Systems - introductionDatabase Systems - introduction
Database Systems - introduction
 
2. Chapter Two.pdf
2. Chapter Two.pdf2. Chapter Two.pdf
2. Chapter Two.pdf
 
Mis chapter 7 database systems
Mis chapter 7 database systemsMis chapter 7 database systems
Mis chapter 7 database systems
 
Query Optimization Techniques in Graph Databases
Query Optimization Techniques in Graph DatabasesQuery Optimization Techniques in Graph Databases
Query Optimization Techniques in Graph Databases
 

Recently uploaded

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 

Recently uploaded (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 

Whitepaper sones GraphDB (eng)

  • 2. The sones GraphDB The sones GraphDB is a new, object-oriented data management system based on graphs. It enables efficient saving, management and evaluation of extremely complex data bases that are networked to a high degree. The integrated solution combines the advantages of file storage with the possibilities of a Database Management System. Unstructured data (e.g. video files), semi-structured data (Meta data, e.g. log files) and structured data can be linked to each other and thus be managed from one source and be evaluated as needed. The idea for a new development of a database based on the graph theory arose in the beginning of 2005 due to dissatisfaction with the existing solutions available in the market. Often, those offered no or only insufficient ad-hoc analysis functions for queries into strongly linked data. Besides, these systems were mostly not able to cope with the frequently changing data and information structures applications have to process especially in the web environment. Therefore, the development focused on providing efficient data handling and real time analysis for these tasks which could previously due to their complexity hardly be solved or only with the highest efforts. A fundamentally new database technology was developed based on totally independent and new concepts. The solution includes an object-oriented storage solution (sones GraphFS) for high- performance data storage as well as the graph-oriented database management system (sones GraphDB), which links the data to each other and makes their high-performance evaluation possible. The summation of these two parts into one software offers the advantage of highly efficient processing since any abstraction layers can be avoided. The otherwise customary efforts for the operation of a database solution will be extremely decreased by this solution. The object-oriented storage solution (sones GraphFS) also stores unstructured and semi- structured data, like e.g. document formats, photos, more efficiently than file systems, which are the basis for the operation system. This principle successfully overcomes one of the significant performance problems in the previous object-oriented
  • 3. databases, which is the mapping of object models in relational tables. A patent has been filed for the file system. The graph-oriented data management (sones GraphDB) offers high flexibility of the data structuring and enables the direct linking of the data without additional auxiliary constructs like, for example, allocation tables in relational data bases. Since the data model is oriented on the established programming languages, there is no discrepancy between application logic and data storage. The complexity is in part drastically smaller compared to the existing relational solutions. This leads amongst others to the following advantage: With increasing linkage of the data, the complexity does not increase exponentially, but in a linear manner. A further decisive advantage is the ability of the data base to change or amend data structures during runtime. New requirements on the part of the application can therefore be realized without extensive re-programming of a relational database schematic. The Graph Query Language (sones GQL) was developed to facilitate the data management for the user. While its extent and syntax is oriented on SQL, it was adjusted to the graph-oriented approach and makes the advantages which exist in this model accessible to a direct and easily comprehensible query.
  • 4. The market The Database Management System is the most important instrument for electronic (mass) data management. It is its job to store data efficiently, unambiguous and durably and to provide it to the user on an as needed basis. To this end, several concepts have been developed during the software development, which are fundamentally different from each other. Hierarchical databases The model of the hierarchical databases is the oldest. Data was stored in a tree structure starting from a “root“ data set. Dependencies between data sets are described as “parent- child“ relations in the hierarchy. Hierarchical database systems have now been almost completely replaced. These are only still used in banks and insurances for older applications. A representative is IMS/DB by IBM. The concept of the hierarchical databases is partially re-used in document-oriented data bases, e.g. when information is stored in a hierarchical structure in XML files. Relational databases The relational database model is currently the most prevalent. Data is stored in tables line by line. For each data set (line) any number of attributes can be stored (in columns). Dependencies between data sets can be described by way of common use of clear attributes (keys). The structure of the data model will be determined at the time of compilation (or modification). In the past 20 years relational databases have achieved acceptance since
  • 5. the requirements of structuring (in tables) and linking (by referencing the data sets to each other) can be well fulfilled with the model. Additionally, there is a standardized query language (SQL), which can be easily used for evaluations. The relational database model reaches its limitation when data are not directly dependent, but through several relations (For example, a multi-level hierarchy cannot be specifically evaluated with a SELECT Statement). Moreover, often additional artificial key attributes have to be applied to reduce the complexity of links and thereby ensure the required performance. This in turn increases the complexity of the model and requires additional storage space. There are many standardized interfaces and a large number of useful tools for relational data base systems. The most well-known commercial databases are those by Oracle (as of Jan. 2010: version 11g), Microsoft SQL Server and IBM DB/2. The most well-known Open Source databases are MySQL and Postgres. Object-oriented databases Object-oriented databases are a “comparatively“ new concept (Developed since the end of the 80s – together with the object-oriented programming languages). The structure of the data model is determined at its compilation (or modification). All attributes that describe an object are summarized into one object class by the data modeling. Relations between objects are described through references (as in the object- oriented programming languages). The advantage compared to the relational data bases is that no auxiliary constructs (artificial key attributes, allocation tables) have to be planned for dependencies. This decreases the effort of the compilation as well as the complexity of the data base. Since they do not reach the performance of relational data base management systems, object-oriented data bases do not yet have a large distribution. Commercial solutions are for example Versant, InterSystems (Cachè) and Objectivity (Objectivity/DB).
  • 6. Document-oriented databases It is the goal of the document-oriented databases not to determine the data structures at the time of the conception of the database models. Data are stored in an unstructured manner and completely evaluated when needed. Providers of document-oriented data bases usually offer other solutions to ensure the performance of the evaluations for large amounts of data. The advantage of this solution is the flexibility at the time of the data import, since no specifications are made to the data structure. This, however, increases the effort when evaluating the (so-called “semi-structured“) data. Apache CouchDB, MongoDB or Amazons SimpleDB are examples for document-oriented data bases. Graph-oriented databases Graph-oriented data bases store nodes and relations between nodes. An indirect (complex) relation (e.g. between object 1 and 3) exist if an object references a second and this in turn a third. Multilevel relations are called graphs. Graph-oriented databases allow a query of indirect relations over multiple edges. The edges can also be weighed to describe the degree of the relation. Thus, complex and indirect dependencies in data bases can be easily described and their high-performance evaluation can be enabled. The evaluation of the data is not limited to the lists, but the data can also be processed structured as graphs. The advantage of this approach is that the work step of data structuring will be saved in the data processing application. The GraphDB of sones is a graph-oriented database.
  • 7. The architecture Contrary to most object-oriented approaches the sones GraphDB is a complete internal development. The approach of a complete re-development was in particular chosen to preclude potential legacy problems. The whole development is based on the approach to challenge established approaches and to re-think them if necessary. The graphic illustrates the structure of the sones GraphDB. It consists of four application layers – the Storage Layer, the File System, the Database Management System as well as the Data Access Layer. These as well as the available interfaces are explained in detail below.
  • 8. Storage Engines The Storage Engines separate the physical storage media from the application logic. The interface furthermore offers functions to measure performance and for fault management. File System – GraphFS The decision to re-develop the File System from the ground up is based on avoiding the paradigm shift between graph-oriented data base logic and structural storage of the data in files. Instead the objects are stored as data streams based on blocks in the Storage Engines. The storage in unstructured data streams is high-performing. The extend based storage ensures defined entry points. The interface between File System and Storage Engines is reduced to the provision of a data stream. Thus, the access is not limited to locally tied storage systems, but can also be tied via TCP/IP or iSCSI to external systems. To guarantee high-performance access to the objects, different Meta data (e.g. type of data objects) are indexed together with a clear Object Location. This contains all information to specifically access any object. Moreover, one’s own Meta data can be allocated to the data objects and can be evaluated. The File System offers the following functions:  Storage of the data in netlike structures  High-performance storage of both structured data (objects) as well as binary data  Automatic data structuring and optimization  Information LifeCycle Management beyond hardware limitations  High performance through direct connection with the Data Base Management System  Versioning of data objects
  • 9. Database Management System - GraphDB Software internal data organization is the job of the Database Management System. This includes for example the structured and high-performance object storage and provisioning. It also provides the interfaces, which enable the interaction with the system. All data access takes place formalized in the query language Graph Query Language (GQL). The syntax is oriented on SQL, but has been adapted for the interaction with graph-oriented data structures. The main feature of sones GraphDB is the data management of objects, which can be linked to each other in graph structures. When making a query it is possibly to also evaluate data, which is indirectly linked (through several objects). The objects are also provided in the graph structure for evaluation. An additional mapping (like e.g. in the case of relational data bases) is therefore not necessary. The attributes of the data objects can be defined free of type errors (C# data types). Complex structures can be shown through lists and object dependencies. The database offers the following functions:  Attributes of objects can be defined free of type errors (C# data types)  Complex types can for example by shown through object referencing  Inheritance of objects (for more specifics standardization of types or for expansion)  Editing of objects  Versioning of objects  Data safety (redundancy, mirroring, striping, backup) can be defined on object level  Data integrity is ensured via checksums (CRC32, SHA1, MD5)  Authorization concept for the data base can be defined DataStorage – GraphDS GraphDS provides the interface to the product. A GQL command line as well as a user interface is provided for administration. The system can also be accessed via WebService, REST or WebShell.
  • 10. The Graph Query Language (GQL) The challenge of the graph-oriented data bases is to make the advantages of this approach also easy to use for the users and applications. Currently 2 approaches are being pursued: There is the query language SPARQL. It is oriented on RDF, which makes an adaption for Database Management Systems more difficult. The second approach is to specifically track resp. evaluate the edges. This solution is flexible and high-performing, however, this solution is difficult to implement without knowledge of programming and requires proximity to the data model (e.g. API). sones therefore decided to develop its own query language modeled on SQL (Graph Query Language - GQL). Thus, the learning requirements are very small for persons with SQL knowledge. GQL can be accessed through several interfaces. Support of SPARQL is planned. The Graph Query Language offers the following functions:  Functionalities analog SQL (CREATE, INSERT, UPDATE, ALTER, SELECT, etc.)  Graph theoretical functions, e.g. pathfinding or weighing of edges
  • 11. At a glance (USP’s) ! Universal access With Graph DS sones offers a data base solution for problems which can only be shown in the relational models with big efforts without giving up their advantages (standardized query language, structuring of the data). The model recommends itself for the illustration of netlike (e.g. social networks) or hierarchical structures. ! More speed A higher performance for queries than what was possible with the previously existing solutions is reached by using graph theoretical algorithms. The system is optimized for the use of multi / many core processors. Complex queries are processed simultaneously in several threads. ! Scalability The performance of the Database Management System is independent from the data amount. GraphDS can be integrated in existing structures, since it is a transparent solution, which supports standardized interfaces and offers an easy to learn query language. ! Flexibility The flexible storage system makes object and attribute changes possible at any time during runtime. ! Lower costs Significant decrease in expenses for implementation as well as operating costs. Sones GraphDB uses the .NET framework consistently. Therefore, the system is only offered on .NET supported operating systems. The compatibility with the mono project will be ensured in the product test.
  • 12. sones GmbH © 2010 sones GmbH. All rights reserved. sones as well as the according logos are registered trademarks of the sones. February 2010 All other names of products and services are trademarks of their respective owners. This document is provided for information purposes only and the contents hereof are subject to change without notice. sones GmbH Eugen-Richter-Str. 44 This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed 99085 Erfurt orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular Germany purpose. We specifically disclaim any liability with respect to this document and no contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any Tel.: +49(0) 361 - 30 26 25 0 form or by any means, electronical or mechanical, for any purpose, without our prior written permission. Fax: +49(0) 361 - 244 500 8