The document discusses limitations of relational databases for large, unstructured data and introduces NoSQL technology as an alternative. It describes four types of NoSQL databases - key-value, column-oriented, graph and document - and how they address issues like flexibility, distribution and scaling. Examples are given showing how NoSQL databases like Cassandra can process large amounts of data much faster than MySQL. The conclusion is that NoSQL databases are better suited than relational databases for big data and dynamic applications needing fast, distributed processing of unstructured information at low cost.
Building next generation data warehousesAlex Meadows
All Things Open 2016 Talk - discussing technologies used to augment traditional data warehousing. Those technologies are:
* data vault
* anchor modeling
* linked data
* NoSQL
* data virtualization
* textual disambiguation
Triple stores are finally seeing mainstream use, but what exactly is all this talk about linked data? In this deck, we discuss what the semantic web is and how to map your relational data sets into a triple store database using open source software.
How Linked Data Can Speed Information DiscoveryAlex Meadows
Linked data platforms are now making it easier than ever to perform data exploration and discovery without having to wait to get the data integrated into the data warehouse. In this presentation, we discuss what linked data is and show a case study on integrating separate source systems so that scientists don't have to learn the source systems structures to get to their data.
Semantic Graph Databases: The Evolution of Relational DatabasesCambridge Semantics
In this webinar, Barry Zane, our Vice President of Engineering, discusses the evolution of databases from Relational to Semantic Graph and the Anzo Graph Query Engine, the key element of scale in the Anzo Smart Data Lake. Based on elastic clustered, in-memory computing, the Anzo Graph Query Engine offers interactive ad hoc query and analytics on datasets with billions of triples. With this powerful layer over their data, end users can effect powerful analytic workflows in a self-service manner.
Selecting the right database type for your knowledge management needs.Synaptica, LLC
This presentation looks at relational vs. graph databases and their advantages and disadvantages in storing semantic data for taxonomies and ontologies.
Building next generation data warehousesAlex Meadows
All Things Open 2016 Talk - discussing technologies used to augment traditional data warehousing. Those technologies are:
* data vault
* anchor modeling
* linked data
* NoSQL
* data virtualization
* textual disambiguation
Triple stores are finally seeing mainstream use, but what exactly is all this talk about linked data? In this deck, we discuss what the semantic web is and how to map your relational data sets into a triple store database using open source software.
How Linked Data Can Speed Information DiscoveryAlex Meadows
Linked data platforms are now making it easier than ever to perform data exploration and discovery without having to wait to get the data integrated into the data warehouse. In this presentation, we discuss what linked data is and show a case study on integrating separate source systems so that scientists don't have to learn the source systems structures to get to their data.
Semantic Graph Databases: The Evolution of Relational DatabasesCambridge Semantics
In this webinar, Barry Zane, our Vice President of Engineering, discusses the evolution of databases from Relational to Semantic Graph and the Anzo Graph Query Engine, the key element of scale in the Anzo Smart Data Lake. Based on elastic clustered, in-memory computing, the Anzo Graph Query Engine offers interactive ad hoc query and analytics on datasets with billions of triples. With this powerful layer over their data, end users can effect powerful analytic workflows in a self-service manner.
Selecting the right database type for your knowledge management needs.Synaptica, LLC
This presentation looks at relational vs. graph databases and their advantages and disadvantages in storing semantic data for taxonomies and ontologies.
DBPedia past, present and future - Dimitris Kontokostas. Reveals recent developments in the Linked Data and knowledge graphs field and how DBPedia progress with wikipedia data.
The Power of Semantic Technologies to Explore Linked Open DataOntotext
Atanas Kiryakov's, Ontotext’s CEO, presentation at the first edition of Graphorum (http://graphorum2017.dataversity.net/) – a new forum that taps into the growing interest in Graph Databases and Technologies. Graphorum is co-located with the Smart Data Conference, organized by the digital publishing platform Dataversity.
The presentation demonstrates the capabilities of Ontotext’s own approach to contributing to the discipline of more intelligent information gathering and analysis by:
- graphically explorinh the connectivity patterns in big datasets;
- building new links between identical entities residing in different data silos;
- getting insights of what type of queries can be run against various linked data sets;
- reliably filtering information based on relationships, e.g., between people and organizations, in the news;
- demonstrating the conversion of tabular data into RDF.
Learn more at http://ontotext.com/.
A modern resource view for tabular data.
This talks shows a modern drop-in replacement for the current default Recline.js based table view in CKAN, which is well beyond a normal table viewer.
Within the Islandora context there have been a number of ways paged content has been handled. One of the simplest approaches to paged content is to store it as a PDF document. This continues to be a viable option in Islandora and is supported by a number of solution packs including the PDF Solution Pack. A more sophisticated and preservation friendly approach to paged content is to treat pages as individual digital objects that are related to a parent object. The Book Solution Pack uses this model and it can be repurposed to satisfy many paged content models including journal content by modifying the metadata required. For instance, the Newspaper Solution Pack shares much of the same code as the Book Solution pack, but metadata requirements and the viewer are different.
A talk given at the Ingenta Publisher Forum, in November 2008.
See: http://www.ldodds.com/blog/archives/000264.html
For a detailed description of the talk.
Now I See You, Now I Understand You - New Web SemanticsRicardo Castelhano
My talk about Web Semantics, the new HTML5 structure tags, the usage of microdata and rdfa lite, choosing vocabularies/taxonomies and the schema.org project.
A comparison between Relational and Non relational Databases.
The full article is available at this link: https://towardsdatascience.com/relational-vs-non-relational-databases-f2ac792482e3
Presentation given at ODW2013 (http://www.w3.org/2013/04/odw/). Goes over the need for institutions doing digital archiving to publish their meta-data as LOD and ensure formats round-tripping for the data
The Web is until now mostly considered to be a Web of documents, more specifically a Web of HTML pages. However, the inventor of the Web Tim Berners Lee considers the Web not to have reached its fullest potential. The Data Web and Linked Data will enable more precise search services transforming the Web into a smarter and richer Web. Google for example uses Linked Data concepts to realize its own knowledge graph to process voice commands and voice queries for users. Linked Data concepts are not limited to the public Web. They can also be used to capture private knowledge in private company Webs making them potentially applicable as the backbone for future PLM solutions.
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
MuseoTorino, is the first italian project using Web 3.0 tecnologies. NOSQL-GraphDB (Neo4J), RDFa, Linked Open Data.
MuseoTorino is a 21style (www.21-style.com) project for the municipality of Torino, Italy.
These slides come from CodeMotion, the best Italian conference for developers and IT entusiast !
The Bounties of Semantic Data Integration for the Enterprise Ontotext
If you are looking for solutions that allow you not only to manage all of your data (structured, semi-structured and unstructured) but to also make the most out of them, using a common language is critical.
Adding Semantic Technology to data integration is the glue that holds together all your enterprise data and their relationships in a meaningful way.
Learn how you can quickly design data processing jobs and integrate massive amounts of data and see what semantic integration can do for your data and your business.
www.ontotext.com
XML (eXtensible Markup Language) é uma recomendação da W3C para gerar linguagens de marcação para necessidades especiais.
É um dos subtipos da SGML (acrônimo de Standard Generalized Markup Language ou Linguagem Padronizada de Marcação Genérica) capaz de descrever diversos tipos de dados. Seu propósito principal é a facilidade de compartilhamento de informações através da internet.
DBPedia past, present and future - Dimitris Kontokostas. Reveals recent developments in the Linked Data and knowledge graphs field and how DBPedia progress with wikipedia data.
The Power of Semantic Technologies to Explore Linked Open DataOntotext
Atanas Kiryakov's, Ontotext’s CEO, presentation at the first edition of Graphorum (http://graphorum2017.dataversity.net/) – a new forum that taps into the growing interest in Graph Databases and Technologies. Graphorum is co-located with the Smart Data Conference, organized by the digital publishing platform Dataversity.
The presentation demonstrates the capabilities of Ontotext’s own approach to contributing to the discipline of more intelligent information gathering and analysis by:
- graphically explorinh the connectivity patterns in big datasets;
- building new links between identical entities residing in different data silos;
- getting insights of what type of queries can be run against various linked data sets;
- reliably filtering information based on relationships, e.g., between people and organizations, in the news;
- demonstrating the conversion of tabular data into RDF.
Learn more at http://ontotext.com/.
A modern resource view for tabular data.
This talks shows a modern drop-in replacement for the current default Recline.js based table view in CKAN, which is well beyond a normal table viewer.
Within the Islandora context there have been a number of ways paged content has been handled. One of the simplest approaches to paged content is to store it as a PDF document. This continues to be a viable option in Islandora and is supported by a number of solution packs including the PDF Solution Pack. A more sophisticated and preservation friendly approach to paged content is to treat pages as individual digital objects that are related to a parent object. The Book Solution Pack uses this model and it can be repurposed to satisfy many paged content models including journal content by modifying the metadata required. For instance, the Newspaper Solution Pack shares much of the same code as the Book Solution pack, but metadata requirements and the viewer are different.
A talk given at the Ingenta Publisher Forum, in November 2008.
See: http://www.ldodds.com/blog/archives/000264.html
For a detailed description of the talk.
Now I See You, Now I Understand You - New Web SemanticsRicardo Castelhano
My talk about Web Semantics, the new HTML5 structure tags, the usage of microdata and rdfa lite, choosing vocabularies/taxonomies and the schema.org project.
A comparison between Relational and Non relational Databases.
The full article is available at this link: https://towardsdatascience.com/relational-vs-non-relational-databases-f2ac792482e3
Presentation given at ODW2013 (http://www.w3.org/2013/04/odw/). Goes over the need for institutions doing digital archiving to publish their meta-data as LOD and ensure formats round-tripping for the data
The Web is until now mostly considered to be a Web of documents, more specifically a Web of HTML pages. However, the inventor of the Web Tim Berners Lee considers the Web not to have reached its fullest potential. The Data Web and Linked Data will enable more precise search services transforming the Web into a smarter and richer Web. Google for example uses Linked Data concepts to realize its own knowledge graph to process voice commands and voice queries for users. Linked Data concepts are not limited to the public Web. They can also be used to capture private knowledge in private company Webs making them potentially applicable as the backbone for future PLM solutions.
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
MuseoTorino, is the first italian project using Web 3.0 tecnologies. NOSQL-GraphDB (Neo4J), RDFa, Linked Open Data.
MuseoTorino is a 21style (www.21-style.com) project for the municipality of Torino, Italy.
These slides come from CodeMotion, the best Italian conference for developers and IT entusiast !
The Bounties of Semantic Data Integration for the Enterprise Ontotext
If you are looking for solutions that allow you not only to manage all of your data (structured, semi-structured and unstructured) but to also make the most out of them, using a common language is critical.
Adding Semantic Technology to data integration is the glue that holds together all your enterprise data and their relationships in a meaningful way.
Learn how you can quickly design data processing jobs and integrate massive amounts of data and see what semantic integration can do for your data and your business.
www.ontotext.com
XML (eXtensible Markup Language) é uma recomendação da W3C para gerar linguagens de marcação para necessidades especiais.
É um dos subtipos da SGML (acrônimo de Standard Generalized Markup Language ou Linguagem Padronizada de Marcação Genérica) capaz de descrever diversos tipos de dados. Seu propósito principal é a facilidade de compartilhamento de informações através da internet.
In this lecture we analyze key-values databases. At first we introduce key-value characteristics, advantages and disadvantages.
Then we analyze the major Key-Value data stores and finally we discuss about Dynamo DB.
In particular we consider how Dynamo DB: How is implemented
1. Motivation Background
2. Partitioning: Consistent Hashing
3. High Availability for writes: Vector Clocks
4. Handling temporary failures: Sloppy Quorum
5. Recovering from failures: Merkle Trees
6. Membership and failure detection: Gossip Protocol
Apresentação de Alex Martins e Laercio de Souza. Estudantes de Sistemas para Internet. Agradecimento as pessoas que apoiaram no desenvolvimento da apresentação. Faltou algumas referências. Mas para ficar a disposição para vocês.
In this lecture we analyze document oriented databases. In particular we consider why there are the first approach to nosql and what are the main features. Then, we analyze as example MongoDB. We consider the data model, CRUD operations, write concerns, scaling (replication and sharding).
Finally we presents other document oriented database and when to use or not document oriented databases.
NativeX (formerly W3i) recently transitioned a large portion of their backend infrastructure from MS SQL Server to Apache Cassandra. Today, its Cassandra cluster backs its mobile advertising network supporting over 10 million daily active users producing over 10,000 transactions per second with an average database request latency of under 2 milliseconds. Going from relational to noSQL required NativeX's engineers to re-train, re-tool and re-think the way it architects applications and infrastructure. Learn why Cassandra was selected as a replacement, what challenges were encountered along the way, and what architecture and infrastructure were involved in the implementation.
This ppt explain about choosing your NoSQL database. This also contains factors which needs to be consider while choosing NoSQL database. Thanks Arun Chandrasekaran(https://www.linkedin.com/profile/view?id=AAMAAAQKxWsB9tkk7s2ll2T2BvLvR9QDv_OdJXs&trk=hp-identity-name) for helping me.
What is NoSQL? NoSQL describes a family of approaches to managing data at an enterprise level that have key similarities, but - at the same time - are very different from classic SQL based relational databases.
NoSQL has emerged as a 'movement' over the last 5 years and many specific noSQL datastores - Mongo, Redis, HBase, Cassandra, Neo4J - are being used for mission critical systems by many organizations including Facebook, LinkedIn, Dropbox, American Express, NSA, & the CIA. Does NoSQL spell the end of SQL based relational datastores like Oracle, MySQL, SQLServer, & Sybase? Definitely not, but the world is moving in the direction of "Polyglot Persistence" and away from the "Relational Persistence" hegemony. In my presentation I will explain why this shift is occurring and will speculate about what the future will hold.
This deck talks about the basic overview of NoSQL technologies, implementation vendors/products, case studies, and some of the core implementation algorithms. The presentation also describes a quick overview of "Polyglot Persistency", "NewSQL" like emerging trends.
The deck is targeted to beginners who wants to get an overview of NoSQL databases.
Similar to Improvement of no sql technology for relational databases v2 (20)
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Honest Reviews of Tim Han LMA Course Program.pptxtimhan337
Personal development courses are widely available today, with each one promising life-changing outcomes. Tim Han’s Life Mastery Achievers (LMA) Course has drawn a lot of interest. In addition to offering my frank assessment of Success Insider’s LMA Course, this piece examines the course’s effects via a variety of Tim Han LMA course reviews and Success Insider comments.
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Francesca Gottschalk - How can education support child empowerment.pptxEduSkills OECD
Francesca Gottschalk from the OECD’s Centre for Educational Research and Innovation presents at the Ask an Expert Webinar: How can education support child empowerment?
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
Embracing GenAI - A Strategic ImperativePeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Biological screening of herbal drugs: Introduction and Need for
Phyto-Pharmacological Screening, New Strategies for evaluating
Natural Products, In vitro evaluation techniques for Antioxidants, Antimicrobial and Anticancer drugs. In vivo evaluation techniques
for Anti-inflammatory, Antiulcer, Anticancer, Wound healing, Antidiabetic, Hepatoprotective, Cardio protective, Diuretics and
Antifertility, Toxicity studies as per OECD guidelines
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
Improvement of no sql technology for relational databases v2
1. Improvement of NoSQL Technology for Relational Databases TsendsurenMunkhdalai twitter: @tsendeemts
2. Contents Limitation of relational database NoSQL technology Types of NoSQL database Conclusion
3. Nowadays, statement of data Large data Some data generators Facebook photos +25TB/week Twitter +7TB/day Flickr +21GB/hour Data size is repeatedly increased every year Not structured data New kinds of applications are growing up Such as Web 2.0, Enterprise applications and Cloud computing They needed not structured data There are many no structured data generators
4. Limitation of relational database Static, normalized data schema Have to store structured data There is complex join operation Not flexible datastore Data is centralized in one place Not distributed Data overflowing Nothing
5. NoSQL technology NoSQL: Not Only SQL Handle huge amount of data at full speed Distributed Natively support clustering Have Map/Reduce mechanism Support replication and sharding Schema free More flexible Have hashing and B-tree indexing There are four types of NoSQL databases
6. Distributed: NoSQL database Support replication and sharding Map/Reduce mechanism Similarity, parallel processing Sharding/Partitioned res res Big Result res job Big Job Data 1 2 3 4 5 6 job job replication 3 4 5 6 1 2
7. Types of NoSQL database 1/4 Key-Value database Stores value based on its key Designed to handle massive load Data model: Collection of Key-Value pairs Given key, get value Data hashing indexed Some systems do that automatically Good for Cashe aside Simple, id based interactions
8. Types of NoSQL database 2/4 Column oriented database Column oriented Relational database Tables similarly to RDBMS, but handles semi-structured Each row can have a different number of columns Table is sparse Columns are dynamic
9. Types of NoSQL database 3/4 Graph database These store data structure as graph Focus on modeling the structure of data Represent complex relation between objects as graph Data model: Nodes, relationships between theirs Each node can have key/value properties C P A
10. Types of NoSQL database 4/4 Document database Stores data as document More complex Key-Value database Data model: Collection of Key-Value, collections as JSON or XML types document { “name” : “Lady Gaga”, “ssn” : “213445”, “hobbies” : [“Dressing up”, “Singing”], “albums” : [{“name” : “The fame” “release_year” : “2008”}, {“name” : “Born this away” “release_year” : “2011”}] } { {….} } { {….} }
11. Some statistic Facebook search MySQL > 50 GB Data Writes Average : ~300 ms Reads Average : ~350 ms Rewritten with Cassandra (NoSQL) > 50 GB Data Writes Average : 0.12 ms Reads Average : 15 ms
13. Conclusion NoSQL databases Data process quite faster than relational database Distributed Dynamically determine new attributes Cheap Mostly, open source Have natively clustering, don’t need supercomputer (Expensive) Map/Reduce mechanism is provided Have B-tree and hashing indexing
Hello everyone, I’m tsendee from Database/Bioinformatics lab, Chungbuk national university. Welcome to today’s my presentation. I will try to talk improvement of NoSQL technology for relational databases. It is topic of my paper. Ok let’s begin.
-> I am going to introduce contents of my presentation-> First I will describe limitation of relational database, in this section we present what are there limitations of relational database for today’s data.-> NoSQL technology, this is our primary section. I will give you what is actually NoSQL what does NoSQL do that is better than relational databases.-> and there are a several types of nosql, that is presented in types of nosql section.-> Finally I will conclude to my presentation
-> We consider two things those would recently be features for data science-> Large volume of data was generated, there are a number of huge data generators Here showed some data generators, for example facebook photos are increased by twenty five terrabyteseveryweek , size of twitter database is increased by seven terrabytes per one day, so big data that is one special point for data science and data management system.-> Next thing is not structureddata there is not structured data everywhere
-> Now, I’m going to talk about limitation aspects of relational database.-> schema consists of tables and theirs relations, already we designed a schema , next time we difficulty modify the schema. So schema is hard. You have to store structured data, Your data must fit a table. After did that datastore can be more complex or not flexible-> data centralized in one place, not distributed depend on ACID property, join operation. There is one node failure. If the node fail, entire system That is big problem for developer.
-> So some problems fail in RDBMS,NoSQL technology aimed to improve relational database. That is one kind of database.-> NoSQL is standing Not Only SQL, -> Itcan handle huge amount of data at full speed. -> because such databases work well distributed over multiple nodes in a cluster.-> these are explained in next slide-> Schema free, At any time we can define new attribute for object in NoSQL database so it is more flexible-> There are four types of NoSQL databases according their data model. after some slide I will give you in more detailed
-> Sharding: big data is partitioned over individual nodes in cluster, those are connected in network-> replication: it means multiple write, same data, that is written on more than one nodes. There is not one node failure if a master computer failed then system automatically chooses another the data replicated computer.-> map/reduce mechanism consist of two phase first one is map next phase is reduce. In some case We need to process big job. We can easily do the big job by using map/reduce mechanism. This our big job Firstly, the big job is separated into a several small sections and distributed over nodes then these are processed on each nodes now we have small results finally bring to big result by combining to them, final process is called reduce the other one is map phase.