Big data and security involves managing huge amounts of data from various sources. Some key points:
- The amount of data generated annually is expected to grow exponentially to over 6.6 zettabytes by 2016. Individual companies like Facebook generate over 400 terabytes of data per day.
- Big data comes from a variety of structured and unstructured sources, and is distributed across multiple locations and systems. Both batch-based and real-time streaming approaches are used.
- Effectively organizing, analyzing, and deriving value from large, diverse datasets requires new approaches that can handle different data types and structures from many online and offline sources.
AORTA BI Solutions is specialized in implementing the Oracle BI Suite. The BI Server is the integrated platform to achieve al the necessary business requirements.
Rick van der Lans referred to it. He calls the BI Server an good example of which he calls the Data Delivery Platform.
AvePoint - Death of the FileShare, as you know it.garthluke
With files stored in myriad locations – file shares, cloud storage, and SQL Server to name a few – business users simply need one place to search and access the documents they need to complete their tasks.
Learn how to place all file share content at users’ fingertips via SharePoint 2010 – no migration, no additional storage costs, and no headaches
"A Study of I/O and Virtualization Performance with a Search Engine based on ...Lucidworks (Archived)
Documentum xPlore provides an integrated Search facility for the Documentum Content Server. The standalone search engine is based on EMC's xDB (Native XML database) and Lucene. In this talk we will introduce xPlore and some of its key components and capabilities. These include aspects of a tight integration of Lucene with the XML database: xQuery translation and optimization into Lucene query/API's as well as transactional update Lucene). In addition, xPlore is being deployed aggressively into virtualized environments (both disk I/O and VM). We cover some performance results and tuning tips in these areas.
AORTA BI Solutions is specialized in implementing the Oracle BI Suite. The BI Server is the integrated platform to achieve al the necessary business requirements.
Rick van der Lans referred to it. He calls the BI Server an good example of which he calls the Data Delivery Platform.
AvePoint - Death of the FileShare, as you know it.garthluke
With files stored in myriad locations – file shares, cloud storage, and SQL Server to name a few – business users simply need one place to search and access the documents they need to complete their tasks.
Learn how to place all file share content at users’ fingertips via SharePoint 2010 – no migration, no additional storage costs, and no headaches
"A Study of I/O and Virtualization Performance with a Search Engine based on ...Lucidworks (Archived)
Documentum xPlore provides an integrated Search facility for the Documentum Content Server. The standalone search engine is based on EMC's xDB (Native XML database) and Lucene. In this talk we will introduce xPlore and some of its key components and capabilities. These include aspects of a tight integration of Lucene with the XML database: xQuery translation and optimization into Lucene query/API's as well as transactional update Lucene). In addition, xPlore is being deployed aggressively into virtualized environments (both disk I/O and VM). We cover some performance results and tuning tips in these areas.
HP Microsoft SQL Server Data Management SolutionsEduardo Castro
In this presentation was used in the MSDN WebCast and we cover some details about the hardware offerings to run SQL Server DataWarehouse, some detail about HP Hardware is shown.
Best Regards,
Ing. Eduardo Castro Martinez
http://ecastrom.blogspot.com
This month C/D/H, with partners BA Insight and Microsoft, hosted a half-day seminar on SharePoint 2010 & FAST Search for SharePoint – and using it as a single, enterprise-wide search tool.
View C/D/H’s FAST SharePoint slide deck to see real-world examples of search-driven information portals. We’ll also show you how FAST can dramatically improve end-user productivity.
And for more on Search and other topics, visit our blog at www.cdhtalkstech.com.
Database Architechs is a database-focused consulting company for 17 years bringing you the most skilled and experienced data and database experts with a wide variety of service offering covering all database and data related aspects.
AORTA BI Solutions is specialized in implementing the Oracle BI Suite. The BI Server is the integrated information platform the suite is build on. Many people don't know it, but it's one of the best technologies Oracle has ever acquired.
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
In this one-hour webinar, Caserta Concepts and Talend described an approach to achieve an architectural framework and roadmap to extend a traditional enterprise data warehouse environment, into a Big Data ecosystem.
They illustrated the architectural components involved for collecting, analyzing and delivering Big Data, with a focus on the importance of Hadoop, Data Integration, Machine Learning, NoSQL, Business Intelligence and Analytics.
Attendees learned:
Which Big Data technologies can’t be ignored
Considerations when extending the data ecosystem
What happens to your existing investment
What are the points of integration
Does Big Data = better data?
To find access the recorded webinar or to learn more, visit http://www.casertaconcepts.com/.
Transaction-based Capacity Planning for greater IT Reliability™ webinar Metron
Do you need to predict the true impact of business growth for a specific department or product line?
Are you unsure which infrastructure items (servers and their logical software components) are serving which business applications and on which tiers response time for your transactions are taking place?
Now you can get a valuable insight into the performance across all tiers of your enterprise data center environments.
We’ll show you how you can combine business forecast information with infrastructure performance metrics and predict whether you have sufficient capacity to meet the needs of your business at both the component and service levels.
Join us and find out how the combination of Correlsense SharePath and Metron athene® will provide you with a complete Capacity Management solution
HP Microsoft SQL Server Data Management SolutionsEduardo Castro
In this presentation was used in the MSDN WebCast and we cover some details about the hardware offerings to run SQL Server DataWarehouse, some detail about HP Hardware is shown.
Best Regards,
Ing. Eduardo Castro Martinez
http://ecastrom.blogspot.com
This month C/D/H, with partners BA Insight and Microsoft, hosted a half-day seminar on SharePoint 2010 & FAST Search for SharePoint – and using it as a single, enterprise-wide search tool.
View C/D/H’s FAST SharePoint slide deck to see real-world examples of search-driven information portals. We’ll also show you how FAST can dramatically improve end-user productivity.
And for more on Search and other topics, visit our blog at www.cdhtalkstech.com.
Database Architechs is a database-focused consulting company for 17 years bringing you the most skilled and experienced data and database experts with a wide variety of service offering covering all database and data related aspects.
AORTA BI Solutions is specialized in implementing the Oracle BI Suite. The BI Server is the integrated information platform the suite is build on. Many people don't know it, but it's one of the best technologies Oracle has ever acquired.
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
In this one-hour webinar, Caserta Concepts and Talend described an approach to achieve an architectural framework and roadmap to extend a traditional enterprise data warehouse environment, into a Big Data ecosystem.
They illustrated the architectural components involved for collecting, analyzing and delivering Big Data, with a focus on the importance of Hadoop, Data Integration, Machine Learning, NoSQL, Business Intelligence and Analytics.
Attendees learned:
Which Big Data technologies can’t be ignored
Considerations when extending the data ecosystem
What happens to your existing investment
What are the points of integration
Does Big Data = better data?
To find access the recorded webinar or to learn more, visit http://www.casertaconcepts.com/.
Transaction-based Capacity Planning for greater IT Reliability™ webinar Metron
Do you need to predict the true impact of business growth for a specific department or product line?
Are you unsure which infrastructure items (servers and their logical software components) are serving which business applications and on which tiers response time for your transactions are taking place?
Now you can get a valuable insight into the performance across all tiers of your enterprise data center environments.
We’ll show you how you can combine business forecast information with infrastructure performance metrics and predict whether you have sufficient capacity to meet the needs of your business at both the component and service levels.
Join us and find out how the combination of Correlsense SharePath and Metron athene® will provide you with a complete Capacity Management solution
As new technologies emerge, it can be difficult to identify the benefits of the many different options available. In an effort to understand the NOSQL options better, specifically graph databases, Objectivity, Inc. has formed an internal Performance Center to evaluate the features, performance and functionality of different graph database solutions that are available today. This webinar will focus on understanding the complementary nature, use cases and value of graph databases for “Big Data” solutions. Please join us with guest speaker Noel Yuhanna, Principal Analyst serving Enterprise Architecture Professionals, Forrester Research Inc, for an overview of the NOSQL market and Brian Clark, Vice President Objectivity, presenting an overview of initial Performance Center Findings.
Guest Speaker:
Noel Yuhanna
Principal Analyst serving Enterprise Architecture Professionals, Forrester Research, Inc.
Noel serves Enterprise Architecture Professionals. He primarily covers database management systems (DBMSes), infrastructure-as-a-service (IaaS), data replication and integration, data security, data management tools, and related online transaction processing issues. His current primary research focus is on customer usage experiences and broad industry trends of DBMS, IaaS, data security, enterprise data grids, outsourcing, information life-cycle management, open source databases, and other emerging database technologies.
Presenter:
Brian Clark
Corporate Vice President, Objectivity
Brian Clark has nearly 30 years of software and technology experience, and was one of the early architects of Objectivity/DB. Before joining Objectivity, Brian worked at Automation Technology Products, providing leading tools in the MCAD market. Prior to that, he was with Project Management Services at International Computers Limited, one of Europe’s leading computer companies at the time. Brian holds a B.S
View the webinar at: https://attendee.gotowebinar.com/recording/5730303120063488770
Webinar 3/12/14: Using Social Media to Drive ValueInfiniteGraph
Social networks are everywhere. Realize value from publicly available social relationships and connections to understand customer preferences, behaviors and buying patterns. This webinar presentation explores key consumer analytics use-cases and the connection platform enabling real-time, relevant customer analytics data.
NoSQL Simplified: Schema vs. Schema-lessInfiniteGraph
A look at the many facets of schema-less approaches vs a rich schema approach, ranging from performance and query support to heterogeneity and code/data migration issues. Presented by Leon Guzenda, Founder, Objectivity
The Value of Explicit Schema for Graph Use CasesInfiniteGraph
A look at the many facets of schema-less approaches vs a rich schema approach, ranging from performance and query support to heterogeneity and code/data migration issues. Presented by Nick Quinn, Principal Engineer, InfiniteGraph
Solution Use Case Demo: The Power of Relationships in Your Big DataInfiniteGraph
In this security solution demo, we have integrated Oracle NoSQL DB with InfiniteGraph to demonstrate the power of using the right tools for the solution. By integrating the key value technology of Oracle with the InfiniteGraph distributed graph database, we are able to create new views of existing Call Detail Record (CDR) details to enable discovery of connections, paths and behaviors that may otherwise be missed.
Discover how to add value to your existing Big Data to increase revenues and performance!
In this security solution demo, we have integrated Oracle NoSQL DB with InfiniteGraph to demonstrate the power of using the right tools for the solution. By integrating the key value technology of Oracle with the InfiniteGraph distributed graph database, we are able to create new views of existing Call Detail Record (CDR) details to enable discovery of connections, paths and behaviors that may otherwise be missed.
Discover how to add value to your existing Big Data to increase revenues and performance!
Objectivity/DB: A Multipurpose NoSQL DatabaseInfiniteGraph
The speakers will describe the flexible configuration possibilities that Objectivity/DB provides, with an emphasis on how best to distribute data across multiple storage nodes. The session will start by describing the distributed processing architecture of Objectivity/DB before covering the new Placement Manager features. The speakers will also describe how Objectivity/DB compares and contrasts with other NoSQL solutions.
In 2013:
- 1.4 Trillion digital interactions happen per month.
- 2.9 million emails are sent every second.
- 72.9 products are ordered on Amazon per second.
That is a lot of connected data, graphs are truly everywhere. Companies are finding that graph database technology is helping them make sense of their big data.
Objectivity’s Nick Quinn, Chief Architect of InfiniteGraph, shows us just how popular graph databases have become and where they are being used, as well as showing us the ins and outs.
Do you want to build technology that does great things with big data? You might want to find out what your colleagues are Tweeting about, make recommendations for apps, music or other retail that result in higher purchase rates, discover hidden connections between new and recorded medical research data, or maybe even leverage intel across government agencies to catch the bad guys.
All this is possible with a graph database.
This tutorial will provide you with a basic understanding of graph database technology and the ability to quickly begin development of a graph database application. You will have the capability to recognize graph-based problems and present the benefits of using graph technology for problem resolution.
The tutorial will give you an understanding of:
• Graph theory - origins and concepts
• Benefits of graph databases
• Different types of graph databases
• Typical graph database API
• Programming basics
• Use cases
Bring your laptops for a hands-on opportunity to practice some sample codes. A basic understanding of Java programming is a recommended prerequisite to understand this course. This session is led by the InfiniteGraph technical team and the demonstration code will be drawn from InfiniteGraph examples, however the broader educational presentation is product-neutral and not a commercial presentation of their products.
To participate in the hands-on portion of the graph tutorial users must have:
• Java programming experience
• Java Developer Kit (JDK)
• Current InfiniteGraph installed on laptop. (To download visit www.objectivity.com/infinitegraph)
• HelloGraph test – Upon installing IG, run HelloGraph to test the install. (HelloGraph can be found online at http://wiki.infinitegraph.com/2.1/w/index.php?title=Download_Sample_Code)
Leon Guzenda was one of the founding members of Objectivity in 1988 and one of the original architects of Objectivity/DB. He currently works with Objectivity's major customers to help them effectively develop and deploy complex applications and systems that use the industry's highest-performing, most reliable DBMS technology, Objectivity/DB. He also liaises with technology partners and industry groups to help ensure that Objectivity/DB remains at the forefront of database and distributed computing technology. Leon has more than 35 years experience in the software industry. At Automation Technology Products, he managed the development of the ODBMS for the Cimplex solid modeling and numerical control system. Before that, he was Principal Project Director for International Computers Ltd. in the United Kingdom, delivering major projects for NATO and leading multinationals. He was also design and development manager for ICL's 2900 IDMS product. He spent the first 7 years of his career working in defense and government systems. Leon has a B.S. degree in Electronic Engineering from the University of Wales.
Using A Distributed Graph Database To Make Sense Of Disparate Data StoresInfiniteGraph
Presented at DataWeek SF Oct 13
Most analytics depend on data-mining and statistical correlation of information held in single data stores. It is generally inefficient to replicate diverse data, which may be stored in enterprise databases or NoSQL "Big Data" repositories and consolidate them using a single database technology. Although federated queries can help with statistical correlation of data values across data stores the technique is not very good at handling the data stored in relationships because the data stores generally have no knowledge of one another. The speaker describes a different approach that uses graph (relationship) analytics to extract structural data from existing repositories, store representations of the nodes and connections in a graph database, then analyze them to extract additional value.
Turning Big Data into Smart Data with Graph TechnologiesInfiniteGraph
join Objectivity, Inc.’s, Nick Quinn in a discussion of the latest trends in Big Data Analytics, defining what “Big Data” is and understanding how to maximize your existing architectures by utilizing NOSQL technologies to improve functionality and provide real-time results.
How Graph Database technology, like InfiniteGraph, can support complex relationship analytics problems.
How to turn your Big Data into Smart Data.
How to develop applications with significant time-to-market advantages and technical cost savings.
NoSQL Technology and Real-time, Accurate Predictive AnalyticsInfiniteGraph
Big Data: NoSQL Technology and Real-time, Accurate Predictive Analytics
Enjoy this insightful webinar moderated by Matt Aslett, Research Director at 451 Group beginning with a brief overview of Objectivity, Inc. and its products Objectivity/DB, a world class object database and InfiniteGraph, the enterprise proven, scalable and distributed graph database with deployments across multiple major verticals including government, telecom, finance, security, and social networking. Learn how Georgetown University is taking advantage of Objectivity’s products to develop one of the most interconnected databases today. Examining information from all types of sources worldwide in real-time.
J.C. Smart, Director Global Insight Laboratory, Georgetown University- Coming Soon
Leon Guzenda, Founder, Objectivity – a founding member of Objectivity, Inc. in 1988, one of the original architects of Objectivity/DB and Chief Technology Officer. He now consults with the company and works with Objectivity’s Big Data and Analytics customers/partners to deploy Objectivity/DB and InfiniteGraph, a high performance, scalable graph database.
Matt Aslett, Research Director, 451 Group – As Research Director for data management and analytics within 451 Research’s Information Management practice, Matt has overall responsibility for the coverage of operational and analytic databases, data integration, data quality, and business intelligence. Matt’s own primary area of focus is on relational and non-relational databases, data warehousing, data caching, and Hadoop. Matthew is also an expert in open source software and regularly contributes to 451 Research’s open source-related research.
How we Learned to Stop Worrying and Solve the Distributed Graph ProblemInfiniteGraph
Graphs, what are they and why?
Graph Data Management. Why do we need it?
Problems in Distributed Graph
How we solved the problems?
Finding the value in Big Data.
Everything Goes Better With Bacon: Revisiting the Six Degrees Problem with a ...InfiniteGraph
The six degrees problem is a classic party game and typical use-case for the power and efficiency of graph databases. But even with a powerful graph database, a complex ecosystem of data like IMDB (Internet Movie Database) can return a dizzying amount of data within six degrees of separation from the source. With this amount of data, how do you draw business value from large sets of highly connected data? In this session, we will discuss some powerful strategies for using a distributed graph database, to perform analysis to derive business value from highly connected, complex data sets using navigational queries and visualization.
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph TechnologyInfiniteGraph
Join Oracle NoSQL DB and InfiniteGraph development teams in a discussion of the latest trends in Big Data and Graph Technology. Learn what Oracle’s view of Big Data is and how Oracle NoSQL Database technologies enable you to manage vast amounts of real-time key-value data.
Join Objectivity, Inc.’s VP of Product Management, Brian Clark, in a discussion of the latest trends in Big Data Analytics, defining what is Big Data and understanding how to maximize your existing architectures by utilizing NOSQL technologies to improve functionality and provide real-time results. There will be a focus on relationship analytics as well as an introduction to NOSQL data stores, object and graph databases, such as the architecture behind Objectivity/DB and InfiniteGraph.
NOSQL Now! Presentation, August 24, 2011: Graph Databases: Connecting the Dot...InfiniteGraph
Darren Wood is the Architect and Lead Developer of InfiniteGraph, the distributed graph database, produced by Objectivity, Inc. Darren has spent the majority of his career architecting and building distributed systems with an emphasis on elastic scalability and data management. Prior to joining Objectivity, Inc. in 2007, Darren held positions as a Senior Consultant with IONA Technologies and a Development Team Lead for Citect Australia. Darren holds a First Class Honors Degree in Computer Systems Engineering from the University of Technology in Sydney, Australia.
3. About 11850 Amps to generate
around 8.4 Tesla fields (about
150000 times the earth
magnetic field) but they
operate at low Voltage
A lot of what LHC is about is electricity flow management
4. How BIG?
BIG data is like the LHC combined with gold
extraction
- Huge amount of data -> 6.6 Zettabytes/year by 2016 (Cisco
Cloud Index)
- Big flow of data -> 400TB/day (Facebook)
- LHC generates 10-15 Petabytes/year of data for each
experiment
5. The essence of new service
providers BI Based Revenue Models
(eg Advertisement)
User
Core Semantic
Improves Consumes
experience
Data Set
Mindmap
Revenue from
Value enriched Data
existing services
generates
revenue Data Service will shrink
Service
Produces Service
Additional
revenue from
new services
The more context
the more efficient and
One data set Many free services
the more value and common semantic
Example:
Search/Information Management :
Rated auction/Selling:
6. Classic Approach
• Structured Data
• Data in the range of Gigabytes to Terabytes
• Centralized (Data is imported in analytics)
• Batch based
• Data silos
ETL ETL ETL
Transaction Relational Data Analyse
Database Warehouse
Where is the data that answer my questions ?
7. Big Data Approach
• Multi Structured Data
• Data in the range of Terabytes to Petabytes
• Distributed/Federated (Analytics grab the data)
• Streaming based
• Holistic Data Clusters
1
Stream 2
Organize Analyse
3
n
Here are the questions and the data for the answers
8. A new pattern
• Many different data structures
• Many different ways to extract the data
Knowledge • Structured
• Many different locations (even for the
References
API
Services Content
Sources
Applications
Social Networks
Buffering
same type of data) • Proprietary
•
RAN
Graph
• Batch and Realtime based
Data card
•
Data as a
Service
Neural Network
• Buffered or stream
Sim Card Premise Network Core
•
Connected Things
Connected
(Consumer, Enterprise)
Gateway
Relational
• Correlation parameters • Unstructured
Devices IT Infrastructure
Consumption
Buffering
Report
Statistics
• Streaming
• Taping at Source
Real-time
Cheap Storage High Efficient Storage
Low level Semantic
• Buffering, Routing, Filtering • Taping on Stream
• Structured/Unstructured • Consumption to
Stream
Graph
Network/
store Source Analysis
• Event Collector
• Batch Process/Multi
Non Real-time
Rich Semantic
Structure Stream
• Multi Stage Store/Process Neura l
Network/
Analysis
9. With added security
Knowledge
References
API
Services Content
Sources
Social Networks
Applications
• Strong access
RAN
control based
Data as a
Service
Data card
Sim Card
Connected Things
(Consumer, Enterprise)
Premise
Gateway
Network Core
on industry
Connected
Devices IT Infrastructure
standard
Consumption
(user, dev, app
lication)
Report
Statistics
• Securing the infrastructure (public, private) • Strong
• Policy (internal/external) authorization
• On-going assessment (DDOS, Penetration …) control based
• Data leakage
•
on open
Stream
Migration Graph
standard
Network/
• Securing the identity
Analysis
• Validating ID • Analytics
• Anonymization applied to
• Securing the access Analytics
• Distributed permission/preference
• 3rd party permission Neura l
Network/
Analysis
10. Final thoughts
1. We need to eliminate the silos
– Sources or Usage
2. Still very much a collection of technologies
– The assembly is still very complex
3. Is everything about events?
4. We need to handle the CAP theorem more appropriately
5. What is the user experience (not just the end user but also
the admin)