Infinispan is a distributed, scalable, and transactional data grid that can be used as a NoSQL key-value store. It supports indexing and querying of data through integration with Apache Lucene. Queries can be executed on the data grid to search for objects by fields or perform more complex searches. Infinispan also supports MapReduce-style processing on the data grid. Hibernate Search leverages Infinispan to provide full-text search capabilities for Hibernate entities in a clustered environment.
This document discusses JBoss Web Services (JBoss WS), which provides a web services framework that integrates multiple Java web services stacks like Apache CXF and Metro on top of JBoss Application Server versions. It summarizes the benefits of standards and specifications that JBoss WS supports and how its integration of other web services stacks benefits the JBoss community and integrated projects by allowing choices, joint support, and focus on added value rather than reimplementing existing functionality. It also provides overviews of JBoss WS's management, configuration, features, application server integration, and common testing suite.
This document provides an overview of using Python and MongoDB together. It discusses MongoDB concepts and architecture, how to get started with MongoDB using the interactive console, and basic CRUD operations. It then covers installing and using PyMongo, the main Python driver for MongoDB, and some popular high-level Python frameworks built on top of PyMongo like MongoEngine and MongoAlchemy.
CQL performance with Apache Cassandra 3.0 (Aaron Morton, The Last Pickle) | C...DataStax
The 3.0 storage engine re-write is the biggest and most exciting change to ever happen in Apache Cassandra. The new storage engine can efficiently store and read data from disk using the same concepts present in the CQL 3 language. This has delivered large space savings, and creates new performance characteristics.
In this talk Aaron Morton, Co Founder at The Last Pickle and Apache Cassandra Committer, will discuss the 3.0 storage engine, it's layout and performance characteristics.
About the Speaker
Aaron Morton CEO, The Last Pickle
Aaron Morton is the Co Founder & CEO at The Last Pickle (thelastpickle.com). A professional services company that works with clients to deliver and improve Apache Cassandra based solutions. He's based in New Zealand, is an Apache Cassandra Committer and a DataStax MVP for Apache Cassandra.
The document discusses frustrations experienced using Scala in a large project that ported a Java web application to Scala, including long compilation times, issues with ScalaIDE and sbt, and limitations of frameworks like Anorm; it then provides solutions the author developed to address these challenges, such as libraries to improve validation, ORM usage, and integration with Play frameworks.
A presentation to introduce the Lobos project made at the Bonjure group meeing on 2011/01/21. For more information on Lobos, visit the website: http://budu.github.com/lobos/
SenchaCon 2016: Modernizing the Ext JS Class System - Don GriffinSencha
JavaScript is advancing and ES2015 (formerly ES6) is the foundation of its future. Sencha is committed to delivering cutting-edge technology for your applications, and supporting the evolution of JavaScript is a central part of that commitment. The expansive feature set of ES2015 formally enables coding paradigms: modules to better organize your code, classes to cleanly declare reusable units of functionality, and so much more. See how Ext JS is embracing these new language and toolset features, and how they will expand your development horizons.
The document discusses cloning Twitter using HBase. It describes some key features of Twitter like allowing users to post status updates, follow other users, mention users, and re-tweet posts. It then provides an overview of HBase including its features like consistency, automatic sharding and failover. It discusses how to install HBase in single node, pseudo-distributed and fully distributed modes using Docker. It also demonstrates some common HBase shell commands like creating and listing tables, putting and getting data. Finally, it discusses how to model the user, tweet, follower and following relationships in HBase.
Cassandra 3.0 - JSON at scale - StampedeCon 2015StampedeCon
This session will explore the new features in Cassandra 3.0, starting with JSON support. Cassandra now allows storing JSON directly to Cassandra rows and vice versa, making it trivial to deploy Cassandra as a component in modern service-oriented architectures.
Cassandra 3.0 also delivers other enhancements to developer productivity: user defined functions let developers deploy custom application logic server side with any language conforming to the Java scripting API, including Javascript. Global indexes allow scaling indexed queries linearly with the size of the cluster, a first for open-source NoSQL databases.
Finally, we will cover the performance improvements in Cassandra 3.0 as well.
This document discusses JBoss Web Services (JBoss WS), which provides a web services framework that integrates multiple Java web services stacks like Apache CXF and Metro on top of JBoss Application Server versions. It summarizes the benefits of standards and specifications that JBoss WS supports and how its integration of other web services stacks benefits the JBoss community and integrated projects by allowing choices, joint support, and focus on added value rather than reimplementing existing functionality. It also provides overviews of JBoss WS's management, configuration, features, application server integration, and common testing suite.
This document provides an overview of using Python and MongoDB together. It discusses MongoDB concepts and architecture, how to get started with MongoDB using the interactive console, and basic CRUD operations. It then covers installing and using PyMongo, the main Python driver for MongoDB, and some popular high-level Python frameworks built on top of PyMongo like MongoEngine and MongoAlchemy.
CQL performance with Apache Cassandra 3.0 (Aaron Morton, The Last Pickle) | C...DataStax
The 3.0 storage engine re-write is the biggest and most exciting change to ever happen in Apache Cassandra. The new storage engine can efficiently store and read data from disk using the same concepts present in the CQL 3 language. This has delivered large space savings, and creates new performance characteristics.
In this talk Aaron Morton, Co Founder at The Last Pickle and Apache Cassandra Committer, will discuss the 3.0 storage engine, it's layout and performance characteristics.
About the Speaker
Aaron Morton CEO, The Last Pickle
Aaron Morton is the Co Founder & CEO at The Last Pickle (thelastpickle.com). A professional services company that works with clients to deliver and improve Apache Cassandra based solutions. He's based in New Zealand, is an Apache Cassandra Committer and a DataStax MVP for Apache Cassandra.
The document discusses frustrations experienced using Scala in a large project that ported a Java web application to Scala, including long compilation times, issues with ScalaIDE and sbt, and limitations of frameworks like Anorm; it then provides solutions the author developed to address these challenges, such as libraries to improve validation, ORM usage, and integration with Play frameworks.
A presentation to introduce the Lobos project made at the Bonjure group meeing on 2011/01/21. For more information on Lobos, visit the website: http://budu.github.com/lobos/
SenchaCon 2016: Modernizing the Ext JS Class System - Don GriffinSencha
JavaScript is advancing and ES2015 (formerly ES6) is the foundation of its future. Sencha is committed to delivering cutting-edge technology for your applications, and supporting the evolution of JavaScript is a central part of that commitment. The expansive feature set of ES2015 formally enables coding paradigms: modules to better organize your code, classes to cleanly declare reusable units of functionality, and so much more. See how Ext JS is embracing these new language and toolset features, and how they will expand your development horizons.
The document discusses cloning Twitter using HBase. It describes some key features of Twitter like allowing users to post status updates, follow other users, mention users, and re-tweet posts. It then provides an overview of HBase including its features like consistency, automatic sharding and failover. It discusses how to install HBase in single node, pseudo-distributed and fully distributed modes using Docker. It also demonstrates some common HBase shell commands like creating and listing tables, putting and getting data. Finally, it discusses how to model the user, tweet, follower and following relationships in HBase.
Cassandra 3.0 - JSON at scale - StampedeCon 2015StampedeCon
This session will explore the new features in Cassandra 3.0, starting with JSON support. Cassandra now allows storing JSON directly to Cassandra rows and vice versa, making it trivial to deploy Cassandra as a component in modern service-oriented architectures.
Cassandra 3.0 also delivers other enhancements to developer productivity: user defined functions let developers deploy custom application logic server side with any language conforming to the Java scripting API, including Javascript. Global indexes allow scaling indexed queries linearly with the size of the cluster, a first for open-source NoSQL databases.
Finally, we will cover the performance improvements in Cassandra 3.0 as well.
This document provides an overview of Database Jones, a Node.js API for highly scalable database access to MySQL Cluster. It introduces J.D. Duncan and Craig Russell, the creators of Database Jones, and describes how Database Jones provides an asynchronous JavaScript API that can be used with MySQL Cluster and other databases. It also summarizes the key features and capabilities of Database Jones, including its data modeling approaches, operations, and usage with Node.js applications.
Slides for presentation on Cloudera Impala I gave at the DC/NOVA Java Users Group on 7/9/2013. It is a slightly updated set of slides from the ones I uploaded a few months ago on 4/19/2013. It covers version 1.0.1 and also includes some new slides on HortonWorks' Stinger Initiative.
This document provides an overview of MySQL Cluster and NoSQL. It discusses how to set up nodes in a multi-node MySQL Cluster, including connecting to the network and firewall configuration. It also outlines the tutorial agenda, which will first cover deploying a MySQL Cluster and then developing applications using ClusterJ, Memcache, and Node.js connectors. Presenter biographies and a high-level introduction to database concepts, MySQL Cluster architecture, and the basics of MySQL Cluster are also included.
The document discusses JavaFX Mobile and provides an overview of its features and current status. It summarizes a speaker's presentation on designing a JavaFX Mobile application and examples shown, including calling JME code, posting content via HTTP requests, and loading resources via binding. While JavaFX Mobile was promising in 2009-2010, its future is uncertain given the dominance of Android and iOS. The speaker plans to research porting their application to Android and JME to catch platforms not adopting newer technologies.
Learn how to develop with Couchbase Lite for .NET. The session will include a look at the development environment and required C# APIs using a walkthrough of a demo app.
This document provides an overview of Elasticsearch and how to use it with .NET. It discusses what Elasticsearch is, how to install it, how Elasticsearch provides scalability through its architecture of clusters, nodes, shards and replicas. It also covers topics like indexing and querying data through the REST API or NEST client for .NET, performing searches, aggregations, highlighting hits, handling human language through analyzers, and using suggesters.
We describe the features of Oak Lucene indexes and how they can be used to get your queries perform better. In the second part we will talk about how asynchronous indexing works in general and how it can be monitored.
This was presented as part of AEM Gem Series -http://dev.day.com/content/ddc/en/gems/oak-lucene-indexes.html
Omnisearch in AEM 6.2 - Search All the ThingsJustin Edelson
This document provides an overview of Omnisearch in AEM 6.2, including how to use it, add new locations, and implement a custom Omnisearch handler. Omnisearch provides a unified search experience across AEM authoring tools and content. Key points covered include how to build queries for search results, suggestions, and spell check, as well as configure the UI display of search locations and results. The presentation includes code examples for implementing a custom Omnisearch handler that searches content fragments. It concludes with a demonstration and overview of potential future enhancements.
Scalable XQuery Processing with Zorba on top of MongoDBWilliam Candillon
Since a couple of years, the NoSQL movement has developed a variety of open-source document stores. Most of them focus on high availability, horizontal scalability, and are designed to run on commodity hardware. These products have gained great traction in the industry to store large amounts of flexible data (mostly JSON). In the meantime, XQuery has evolved to a standardized, full-fledged programming language for XML with native support for complex queries, indexes, updates, full-text search, and scripting. Moreover, JSON has recently been added as a first-level datatype into the language. As of today, it is without doubt the most robust and productive technology to process flexible data.
The aim of this talk is to showcase the benefits that can be achieved by integrating the Zorba XQuery Processor with MongoDB. We will introduce the 28msec platform that seamlessly stores, indexes, and manages flexible data entirely in XQuery. The data itself is stored in MongoDB. The platform leverages MongoDB’s indexes, sharding, and consistency guarantees to scale-out horizontally. The talk will conclude by showing a benchmark of the platform and discuss perspectives of the outlined approach.
Getting started with Elasticsearch and .NETTomas Jansson
This document provides an introduction to using Elasticsearch with .NET and the NEST client library. It demonstrates how to install and configure Elasticsearch on Windows, define mappings and index documents using NEST, perform various queries including fuzzy, highlighted, faceted queries, and add filters. NEST provides a strongly typed, fluent abstraction over Elasticsearch that allows querying and filtering Elasticsearch in a similar manner to Elasticsearch JSON queries. The document encourages attendees to view the demo code on GitHub for examples of indexing, querying, highlighting, faceting, and filtering documents using NEST.
Slides accompanying a presentation on Dropwizard I gave at the DevIgnition conference ( www.devignition.com ) on April 29, 2016. The sample code is on GitHub at https://github.com/sleberknight/dropwizard-devignition-2016
The document provides information about new features in Cassandra 2.2 and 3.0, including materialized views. Materialized views allow data to be pre-computed and denormalized to relieve the pain of manual denormalization. Materialized views are implemented by taking a write lock on the base table partition, reading the current values, constructing a batch log of delete and insert operations for the view, executing this asynchronously on the view replica, and then applying the base table update locally. This allows views to be kept in sync with the base table in an efficient manner.
This document discusses best practices for completing pragmatic Plone projects based on lessons learned from a large Plone project for Weishaupt, a major heating systems company. It addresses common project constraints like limited budgets, resources and time. It provides tips for getting work done on schedule and budget through customizations, patching, overriding and extending existing Plone and third-party packages. Sample content and pre-defined site structures are also recommended to help designers. The key lessons are to contribute fixes and enhancements back to packages when possible and always keep custom code separate from third-party packages.
MongoDB Munich 2012: MongoDB for official documents in BavariaMongoDB
Christian Brensing, Senior Developer, State of Bavaria
The Bavarian government runs a document template application (RTF or ODF with Groovy, Python, Ruby or Tcl as scripting language) serving different government offices. Having complex and hierarchical data structures to organize the templates, MongoDB was selected to replace the Oracle-based persistence layer. In this talk you will hear about the improvements we have achieved with the migration to MongoDB, problems we had to solve underway and unit testing of the persistence layer in order to keep our quality level.
Replacing Oracle with MongoDB for a templating application at the Bavarian go...Comsysto Reply GmbH
Bavarian government runs a document template application (RTF or ODF with Groovy, Python, Ruby or Tcl as scripting language) serving different government offices. Having complex and hierarchical data structures to organize the templates, MongoDB was selected to replace the Oracle-based persistence layer. This presentation is about the improvements they have achieved with the migration to MongoDB, problems they had to solve underway and unit testing of the persistence layer in order to keep their quality level. Presentation slides by Christian Brensing, Senior Developer at Rechenzentrum Süd, shown at Munich MongoDB User Group Meetup on 18th September 2012
This document outlines how to create and use inline tasks in MSBuild. It discusses the structure of an inline task including the UsingTask attributes, elements, and Code element. It provides examples of simple "Hello World" and BMI calculation inline tasks to demonstrate how input and output parameters are defined and used. The examples show how to perform basic operations and return output within an inline task coded directly in the MSBuild project file.
Chef is an infrastructure configuration management platform that allows users to define infrastructure as code. It uses a client-server architecture with components like Ohai for data collection, Chef Server as a code repository, Chef Client software, Knife as a CLI, and roles, cookbooks, recipes and resources as basic units of configuration. Users define configurations through cookbooks containing recipes written in Ruby code, templates, attributes and other components. Chef enforces configurations through its client-server model and provides tools for testing, deployment and management of infrastructure through code.
This document provides a high-level overview of polyglot persistence and different database technologies. It begins by discussing the benefits of polyglot persistence in allowing the use of multiple data storage technologies based on application needs. It then summarizes several common database types including relational, document-oriented, key-value, and column-oriented databases. It also discusses database properties like ACID compliance, scaling, and data consistency models. The document concludes with examples of Amazon's SimpleDB and DynamoDB key-value stores.
PostgreSQL is a well-known relational database. But in the last few years, it has gained capabilities that previously belonged only to "NoSQL" databases. In this talk, I describe several of PostgreSQL that give it such capabilities.
In this session, Galder Zamarreño, a senior software engineer at Red Hat, will:
- Provide a brief introduction to RESTful principles
Discuss how cloud-scale APIs are done best with REST
- Introduce Infinispan REST server, focusing on its cloud capabilities and simple REST API
- Detail how REST can apply to many APIs, focusing on some of the deeper principles and practices behind it and how easy it is to implement and use
Infinispan – the open source data grid platform by Mircea MarkusCodemotion
This presentation describes what the project is and focuses on the main scenarios in which the audience can make use of it.
Mircea Markus, project’s lead and co-founder, will give you an overview of the Infinispan ecosystem from which you’ll take home:
- what Infinispan is
- the main use cases in which you can benefit from it
- its key features and differentiators in the data grid wold
This document provides an overview of Database Jones, a Node.js API for highly scalable database access to MySQL Cluster. It introduces J.D. Duncan and Craig Russell, the creators of Database Jones, and describes how Database Jones provides an asynchronous JavaScript API that can be used with MySQL Cluster and other databases. It also summarizes the key features and capabilities of Database Jones, including its data modeling approaches, operations, and usage with Node.js applications.
Slides for presentation on Cloudera Impala I gave at the DC/NOVA Java Users Group on 7/9/2013. It is a slightly updated set of slides from the ones I uploaded a few months ago on 4/19/2013. It covers version 1.0.1 and also includes some new slides on HortonWorks' Stinger Initiative.
This document provides an overview of MySQL Cluster and NoSQL. It discusses how to set up nodes in a multi-node MySQL Cluster, including connecting to the network and firewall configuration. It also outlines the tutorial agenda, which will first cover deploying a MySQL Cluster and then developing applications using ClusterJ, Memcache, and Node.js connectors. Presenter biographies and a high-level introduction to database concepts, MySQL Cluster architecture, and the basics of MySQL Cluster are also included.
The document discusses JavaFX Mobile and provides an overview of its features and current status. It summarizes a speaker's presentation on designing a JavaFX Mobile application and examples shown, including calling JME code, posting content via HTTP requests, and loading resources via binding. While JavaFX Mobile was promising in 2009-2010, its future is uncertain given the dominance of Android and iOS. The speaker plans to research porting their application to Android and JME to catch platforms not adopting newer technologies.
Learn how to develop with Couchbase Lite for .NET. The session will include a look at the development environment and required C# APIs using a walkthrough of a demo app.
This document provides an overview of Elasticsearch and how to use it with .NET. It discusses what Elasticsearch is, how to install it, how Elasticsearch provides scalability through its architecture of clusters, nodes, shards and replicas. It also covers topics like indexing and querying data through the REST API or NEST client for .NET, performing searches, aggregations, highlighting hits, handling human language through analyzers, and using suggesters.
We describe the features of Oak Lucene indexes and how they can be used to get your queries perform better. In the second part we will talk about how asynchronous indexing works in general and how it can be monitored.
This was presented as part of AEM Gem Series -http://dev.day.com/content/ddc/en/gems/oak-lucene-indexes.html
Omnisearch in AEM 6.2 - Search All the ThingsJustin Edelson
This document provides an overview of Omnisearch in AEM 6.2, including how to use it, add new locations, and implement a custom Omnisearch handler. Omnisearch provides a unified search experience across AEM authoring tools and content. Key points covered include how to build queries for search results, suggestions, and spell check, as well as configure the UI display of search locations and results. The presentation includes code examples for implementing a custom Omnisearch handler that searches content fragments. It concludes with a demonstration and overview of potential future enhancements.
Scalable XQuery Processing with Zorba on top of MongoDBWilliam Candillon
Since a couple of years, the NoSQL movement has developed a variety of open-source document stores. Most of them focus on high availability, horizontal scalability, and are designed to run on commodity hardware. These products have gained great traction in the industry to store large amounts of flexible data (mostly JSON). In the meantime, XQuery has evolved to a standardized, full-fledged programming language for XML with native support for complex queries, indexes, updates, full-text search, and scripting. Moreover, JSON has recently been added as a first-level datatype into the language. As of today, it is without doubt the most robust and productive technology to process flexible data.
The aim of this talk is to showcase the benefits that can be achieved by integrating the Zorba XQuery Processor with MongoDB. We will introduce the 28msec platform that seamlessly stores, indexes, and manages flexible data entirely in XQuery. The data itself is stored in MongoDB. The platform leverages MongoDB’s indexes, sharding, and consistency guarantees to scale-out horizontally. The talk will conclude by showing a benchmark of the platform and discuss perspectives of the outlined approach.
Getting started with Elasticsearch and .NETTomas Jansson
This document provides an introduction to using Elasticsearch with .NET and the NEST client library. It demonstrates how to install and configure Elasticsearch on Windows, define mappings and index documents using NEST, perform various queries including fuzzy, highlighted, faceted queries, and add filters. NEST provides a strongly typed, fluent abstraction over Elasticsearch that allows querying and filtering Elasticsearch in a similar manner to Elasticsearch JSON queries. The document encourages attendees to view the demo code on GitHub for examples of indexing, querying, highlighting, faceting, and filtering documents using NEST.
Slides accompanying a presentation on Dropwizard I gave at the DevIgnition conference ( www.devignition.com ) on April 29, 2016. The sample code is on GitHub at https://github.com/sleberknight/dropwizard-devignition-2016
The document provides information about new features in Cassandra 2.2 and 3.0, including materialized views. Materialized views allow data to be pre-computed and denormalized to relieve the pain of manual denormalization. Materialized views are implemented by taking a write lock on the base table partition, reading the current values, constructing a batch log of delete and insert operations for the view, executing this asynchronously on the view replica, and then applying the base table update locally. This allows views to be kept in sync with the base table in an efficient manner.
This document discusses best practices for completing pragmatic Plone projects based on lessons learned from a large Plone project for Weishaupt, a major heating systems company. It addresses common project constraints like limited budgets, resources and time. It provides tips for getting work done on schedule and budget through customizations, patching, overriding and extending existing Plone and third-party packages. Sample content and pre-defined site structures are also recommended to help designers. The key lessons are to contribute fixes and enhancements back to packages when possible and always keep custom code separate from third-party packages.
MongoDB Munich 2012: MongoDB for official documents in BavariaMongoDB
Christian Brensing, Senior Developer, State of Bavaria
The Bavarian government runs a document template application (RTF or ODF with Groovy, Python, Ruby or Tcl as scripting language) serving different government offices. Having complex and hierarchical data structures to organize the templates, MongoDB was selected to replace the Oracle-based persistence layer. In this talk you will hear about the improvements we have achieved with the migration to MongoDB, problems we had to solve underway and unit testing of the persistence layer in order to keep our quality level.
Replacing Oracle with MongoDB for a templating application at the Bavarian go...Comsysto Reply GmbH
Bavarian government runs a document template application (RTF or ODF with Groovy, Python, Ruby or Tcl as scripting language) serving different government offices. Having complex and hierarchical data structures to organize the templates, MongoDB was selected to replace the Oracle-based persistence layer. This presentation is about the improvements they have achieved with the migration to MongoDB, problems they had to solve underway and unit testing of the persistence layer in order to keep their quality level. Presentation slides by Christian Brensing, Senior Developer at Rechenzentrum Süd, shown at Munich MongoDB User Group Meetup on 18th September 2012
This document outlines how to create and use inline tasks in MSBuild. It discusses the structure of an inline task including the UsingTask attributes, elements, and Code element. It provides examples of simple "Hello World" and BMI calculation inline tasks to demonstrate how input and output parameters are defined and used. The examples show how to perform basic operations and return output within an inline task coded directly in the MSBuild project file.
Chef is an infrastructure configuration management platform that allows users to define infrastructure as code. It uses a client-server architecture with components like Ohai for data collection, Chef Server as a code repository, Chef Client software, Knife as a CLI, and roles, cookbooks, recipes and resources as basic units of configuration. Users define configurations through cookbooks containing recipes written in Ruby code, templates, attributes and other components. Chef enforces configurations through its client-server model and provides tools for testing, deployment and management of infrastructure through code.
This document provides a high-level overview of polyglot persistence and different database technologies. It begins by discussing the benefits of polyglot persistence in allowing the use of multiple data storage technologies based on application needs. It then summarizes several common database types including relational, document-oriented, key-value, and column-oriented databases. It also discusses database properties like ACID compliance, scaling, and data consistency models. The document concludes with examples of Amazon's SimpleDB and DynamoDB key-value stores.
PostgreSQL is a well-known relational database. But in the last few years, it has gained capabilities that previously belonged only to "NoSQL" databases. In this talk, I describe several of PostgreSQL that give it such capabilities.
In this session, Galder Zamarreño, a senior software engineer at Red Hat, will:
- Provide a brief introduction to RESTful principles
Discuss how cloud-scale APIs are done best with REST
- Introduce Infinispan REST server, focusing on its cloud capabilities and simple REST API
- Detail how REST can apply to many APIs, focusing on some of the deeper principles and practices behind it and how easy it is to implement and use
Infinispan – the open source data grid platform by Mircea MarkusCodemotion
This presentation describes what the project is and focuses on the main scenarios in which the audience can make use of it.
Mircea Markus, project’s lead and co-founder, will give you an overview of the Infinispan ecosystem from which you’ll take home:
- what Infinispan is
- the main use cases in which you can benefit from it
- its key features and differentiators in the data grid wold
Presentation delivered by Mircea Markus (JBoss) at the London JBoss User Group Event on Wednesday, the 4th of December 2013.
In this talk, Mircea Markus (Infinispan Lead) covers the major features added in the 6.0 release to the Infinispan ecosystem:
- querying in client/server mode
- better integration with the persistent storage
- multi-master cross-site replication
- support for heterogeneous clusters
Participants will take home a better understanding of Infinispan capabilities and the use cases in which it can be put at work. Ideally the attendees should have a basic knowledge of the in-memory data grids.
The document discusses the rise of cloud computing and challenges with databases in the cloud. It introduces Infinispan, an open source data grid that provides an alternative to traditional databases in cloud environments. Infinispan offers scalability, low latency access to data, and elasticity to address the demands of cloud computing. It is suitable as a cloud data store and can help overcome bottlenecks and single points of failure for both cloud and non-cloud applications. The document encourages participants to try out Infinispan, contribute feedback and suggestions, and get involved in its open development process.
The document introduces the Infinispan data grid platform. It discusses how Infinispan can be used as a distributed in-memory cache both as a library and server. Key features of Infinispan are clustering, persistence, transactions, querying, and map-reduce capabilities. Examples of using Infinispan for session clustering and as a state store for Storm processing are provided.
London JBUG April 2015 - Performance Tuning Apps with WildFly Application ServerJBUG London
This document discusses performance tuning for Wildfly8 applications. It outlines reasons for tuning like contractual obligations and user experience. It describes benchmarking methodology like defining test objectives and harnessing test tools. Common bottlenecks like the web tier, EJB tier, and JMS/JDBC connections are discussed. Wildfly tuning controls like thread pools, bean instance counts, and pool sizes are covered. Ideal request flow and queuing with timeouts are addressed. Specific thread pool types like unbounded, bounded, and blocking-bounded are explained. The presentation ends with questions.
This document introduces Infinispan, an open source distributed in-memory data grid. It provides high availability and elasticity through data replication across nodes. The API allows developers to treat Infinispan as a distributed concurrent hash map. Key features include transactions, persistence, querying, and map/reduce capabilities. Infinispan can be used as a local cache, cluster of caches, or autonomous data store accessed via protocols like REST and Memcached. Future plans include improvements to transactions, non-blocking state transfers, and support for cross-datacenter replication and eventual consistency.
This document summarizes a presentation about data grids versus databases. It discusses how data grids provide extremely fast in-memory access to distributed data and can be used for caching to improve database performance. While data grids offer benefits like scalability, their use requires a different programming model than databases. They may replace databases for some use cases like analytics but databases will remain important for their maturity and existing implementations. Data grids are best viewed as complementing rather than replacing databases.
Infinispan, a distributed in-memory key/value data grid and cacheSebastian Andrasoni
This document provides an introduction to distributed in-memory data grids and caches, including Infinispan. It discusses hash tables, distributed hash tables, consistent hashing, and the Chord lookup protocol. It then describes data grids and Infinispan's architecture, which uses consistent hashing to distribute data across clusters and allows for high availability even when nodes fail or partitions occur. The document also briefly discusses Infinispan's features like transactions, querying, map-reduce, and more.
Galder Zamarreño from Red Hat presented on Infinispan, an open source in-memory data grid platform. Infinispan can be used as a local cache, clustered cache, or as a data grid. As a data grid, it provides a highly available, distributed, and elastic data store. Infinispan also enables users to build their own data-as-a-service solutions in private clouds by virtualizing data and making it accessible in an elastic and scalable manner. Major companies use Infinispan both as a cache (e.g. for Hibernate) and as a data grid for applications requiring real-time access to distributed data.
This document summarizes a presentation about Infinispan, an open source in-memory data grid platform. The presentation covers how Infinispan can be used as a local cache, clustered cache, and distributed data grid. It also discusses challenges of clustering large numbers of nodes and migrating data between grids. Finally, it provides examples of companies that use Infinispan for caching and as an authoritative data store.
Examiness hints and tips from the trenchesIsmail Mayat
This document provides an overview of tools and techniques for working with the Examine search engine in Umbraco, including:
- Tools like Luke and the Examine Dashboard for debugging indexes.
- Using the GatheringNodeData event to merge fields, add fields like node type aliases, and handle errors during indexing.
- Indexing different media types like PDFs using Tika.
- Techniques for search highlighting, boosting documents, and deploying index changes across environments.
- Faceted search capabilities and using the index as an object database.
The presenter encourages exploring the full capabilities of Examine and provides examples of how to optimize indexing and searching.
A comparison of different solutions for full-text search in web applications using PostgreSQL and other technology. Presented at the PostgreSQL Conference West, in Seattle, October 2009.
JavaEdge09 : Java Indexing and SearchingShay Sofer
From AlphaCSP's Java conference - JavaEdge09. The presentation of myself and Evgeny Borisov about 'Java Indexing and Searching'
In this session we discussed the need of Full Test Search (as opposed to regular textual/SQL search) , Lucene and it's OO mismatches, the solution that Hibernate Search provides to those mismatches and then a bit about Lucene's scoring algorithm.
Hypothesis 1 proposes a single network and common schema. Hypothesis 2 proposes an object-oriented design approach. Hypothesis 3 states that the resource is the message. The document then discusses Catmandu, a Perl toolkit for working with complex data, and LibreCat, an example program for repository functions like search, citations, and more. It outlines the project plan and thanks collaborators.
This document discusses Elasticsearch, including understanding how it works and optimizing performance. It covers Elasticsearch concepts like clusters, indexes, shards and nodes. It also discusses installing and configuring Elasticsearch, modeling data, indexing and querying optimizations. Lastly it discusses integrating Elasticsearch with Hadoop and using SQL on Elasticsearch.
This document discusses understanding and performance optimization of Elasticsearch. It covers:
1. Understanding Elasticsearch including its architecture, nodes, indexing and querying.
2. Optimizing Elasticsearch performance by understanding factors that impact performance and configuring settings, indexing, and querying for better performance.
3. Utilizing Elasticsearch for big data by integrating with Hadoop and using SQL on Elasticsearch.
Introduction to Elasticsearch with basics of LuceneRahul Jain
Rahul Jain gives an introduction to Elasticsearch and its basic concepts like term frequency, inverse document frequency, and boosting. He describes Lucene as a fast, scalable search library that uses inverted indexes. Elasticsearch is introduced as an open source search platform built on Lucene that provides distributed indexing, replication, and load balancing. Logstash and Kibana are also briefly described as tools for collecting, parsing, and visualizing logs in Elasticsearch.
The importance of search for modern applications is evident and nowadays it is higher than ever. A lot of projects use search forms as a primary interface for communication with a user. Though implementation of an intelligent search functionality is still a challenge and we need a good set of tools.
In this presentation, I will talk through the high-level architecture and benefits of Elasticsearch with some examples. Aside from that, we will also take a look at its existing competitors, their similarities, and differences.
The document provides an overview of some key classes and utilities available in the Darwino API framework. The Darwino APIs are designed to be portable across devices and provide lightweight wrappers to similar device APIs. The APIs aim to make common tasks easy while allowing developers to use more advanced features when needed. Some notable utilities covered include the Platform object for accessing services, plugins for extension mechanisms, JSON and XML processing libraries, HTTP client, task scheduling, logging, internationalization support, and application manifests.
The document describes a presentation about rapidly prototyping with Solr. It will demonstrate ingesting documents into Solr, adjusting Solr's schema, and showcasing data in a flexible search UI. The presentation will cover faceting, highlighting, spellchecking, and debugging. Time will also be spent outlining next steps to develop and take the search application to production.
Introducing Hibernate OGM: porting JPA applications to NoSQL, Sanne Grinovero...OpenBlend society
Hibernate OGM allows developers to use JPA to access NoSQL databases like Infinispan. It stores entities as tuples in the key-value store and uses Hibernate Search to index entities and support JP-QL queries. While still early, it aims to reuse familiar Hibernate concepts while taking advantage of NoSQL databases' flexibility, high availability, and scalability. It currently supports Infinispan but contributions are welcome to integrate other NoSQL databases.
ElasticSearch is a flexible and powerful open source, distributed real-time search and analytics engine for the cloud. It is JSON-oriented, uses a RESTful API, and has a schema-free design. Logstash is a tool for collecting, parsing, and storing logs and events in ElasticSearch for later use and analysis. It has many input, filter, and output plugins to collect data from various sources, parse it, and send it to destinations like ElasticSearch. Kibana works with ElasticSearch to visualize and explore stored logs and data.
Using ElasticSearch as a fast, flexible, and scalable solution to search occu...kristgen
Elasticsearch is an open source search engine that provides fast, flexible, and scalable search of occurrence records and checklists. It allows adding and querying data through a REST API or Java API. Data can be imported from databases or other sources using rivers. Mappings customize indexing and querying. Elasticsearch has been used at Canadensys to index vascular plant names with filters for autocompletion, genus filtering, and epithet hierarchy. It is also used at GBIF France to search biodiversity data from MongoDB with filters and calculate statistics with facets.
Using JPA applications in the era of NoSQL: Introducing Hibernate OGMPT.JUG
Sanne Grinovero presented on Hibernate Object/Grid Mapper (OGM), which provides an object-oriented interface for NoSQL databases using JPA. OGM stores entities as serialized tuples and uses Lucene/Hibernate Search for querying. It reuses Hibernate Core and is targeted at Infinispan but also works with other NoSQL databases. The goals are to encourage new data usage patterns with a familiar programming model and ease of use while pushing NoSQL exploration in enterprises.
Effiziente Datenpersistierung mit JPA 2.1 und HibernateThorben Janssen
This document discusses techniques for improving data persistence performance using JPA 2.1 and Hibernate. It covers analyzing performance issues, optimizing queries, caching, and entity fetching strategies. Specific techniques include using Hibernate statistics, native queries, fetch joins, named entity graphs, and the first- and second-level caches. The goal is to recognize and address performance problems early using Hibernate's features.
This document provides an overview of Lucene and Solr. It introduces Erik Hatcher, who is a committer to Lucene and Solr projects and co-founder of Lucid Imagination, a company that provides commercial support for Lucene and Solr. It then provides brief descriptions of Lucene, its inverted index structure, segments and merging, and scoring. Finally, it discusses Solr architecture and some extension points for customizing Lucene and Solr functionality.
Here is one way to build a custom search component that automatically selects facets based on the results:
1. Create a class that extends SearchComponent and implements the prepare and process methods.
2. In prepare, analyze the query and use Lucene's term vectors or other analysis to determine which fields are likely to provide useful facets. Add these fields to the response builder.
3. In process, after the normal query processing, generate facet counts for the fields added in prepare. Add the facet counts to the response.
4. Register the component in solrconfig.xml and configure it to run after the query and facet components.
Now facets will be automatically selected without needing to specify them in the request.
This document discusses improving performance when using Hibernate for data persistence. It recommends recognizing performance problems early, analyzing slow queries and database indexes, and optimizing fetch strategies. Specific tips include using fetch joins, entity graphs, caching, and Hibernate statistics to identify issues and optimize data loading. Resources for further information on Java persistence and Hibernate are also listed.
This document summarizes a presentation about rapid prototyping with Solr. It discusses getting documents indexed into Solr quickly, adjusting Solr's schema to better match needs, and showcasing data in a flexible search UI. It outlines how to leverage faceting, highlighting, spellchecking and debugging in rapid prototyping. Finally, it discusses next steps in developing a search application and taking it to production.
This document discusses using the Spoon tool to analyze and transform source code when an IDE is no longer sufficient for large codebases with many modules. It begins with an introduction to Spoon and outlines some common code transformation tasks. It then walks through an example of using Spoon to introduce a common base class for unit tests across many modules by reading the code, identifying test classes, determining a common package, generating the base class, extending the test classes, and writing the transformed code. The document emphasizes that Spoon provides a programmatic API and AST representation that allows complex automated refactoring and analysis of large codebases.
JBoss Wise: breaking barriers to WS testingJBug Italy
JBoss Wise is a Java library that makes it easy to test web services with little to no code. It allows dynamic invocation of web service operations by browsing WSDL models and populating request parameters. Users can define their own data models to map to service parameters using mappers like Smooks. Wise also includes a web-based GUI that allows testing services visually without XML or Java knowledge. The goal is to lower the barrier to web service testing and enable business analysts to perform acceptance tests.
The document discusses implementing enterprise integration patterns through Apache Camel. It provides an overview of enterprise integration patterns, describes what Apache Camel is and how it is based on these patterns, and gives examples of implementing the Message Filter pattern in XML, Java, Scala and Spring configurations. It also discusses using beans with Camel for message translation and binding beans to endpoints.
1. JBoss AS7 is a lightweight, modular Java application server with features like hot parallel deployment, elegant administration, and domain management.
2. The CLI provides a command line interface for managing AS7 resources through a detyped management model and supports features like tab completion, scripting, and deployment management.
3. The CLI allows viewing and modifying resources, connecting to controllers, sending operations, and deploying/undeploying packages through both interactive and non-interactive usage.
The document outlines the agenda for a JBoss User Group meeting in Milano on September 26, 2012. The agenda includes presentations on using TEIID in the European Open-DAI project and on JBoss Application Server 7 CLI administration. It also provides updates on JBoss news including webinars, products, pricing, acquisitions, and upcoming events.
Faster & Greater Messaging System HornetQ zzzJBug Italy
This document provides an overview of HornetQ, an open source messaging system. It describes key features of HornetQ including its core architecture, modes of operation in both standalone and JBoss EAP environments, transport options, persistence, flow control, clustering, high availability, and support for large messages. Performance benchmarks are cited showing HornetQ can process over 8 million messages per second, significantly outperforming other messaging systems.
Stefano Maestri is a committer for various JBoss projects including JBoss WS and IronJacamar. He is a member of the JBoss AS7 team and leads the Wise project. AS7 provides a modular, lightweight Java application server with fast startup times, easy management across multiple instances using domains, and simplified configuration.
JBoss BRMS - The enterprise platform for business logicJBug Italy
The document provides an agenda for a presentation on JBoss BRMS. It includes sections on the JBoss BRMS overview and benefits, how it integrates with Guvnor, the different types of assets that can be defined in JBoss BRMS like packages, facts, rules, decision tables, and test scenarios. It also discusses how JBoss BRMS supports business processes with JBPM5, authoring, deployment, and integration with Eclipse. Key assets include packages to organize logic, a fact model, rules, decision tables to define rules visually, and test scenarios to validate the system.
JBoss Application Server 7 (AS7) introduces major changes from previous versions including a new modular architecture, support for domain mode management across multiple servers, and a unified configuration model. AS7 aims to improve usability, manageability, and performance of the application server through these changes. The new architecture in AS7 includes concepts such as server groups that allow consistent configuration and deployment of applications across multiple server instances.
The document announces a JBoss User Group meeting in Milano on January 24th 2012. The agenda includes welcome coffee, news on JBoss, an introduction to Drools, Infinispan clustering in AS7.1 and Enterprise Data Grid, and a buffet lunch. Updates are provided on JBoss AS7 webinars, the release of AS7.1 CR1, additions to OpenShift, the release of Teiid 7.6 and RichFaces 4.1.0, and the Ceylon IDE M1 release. Information is also given on RHQ driftmonitoring and samples projects, the release of JBoss Operations Network 3.0, and HornetQ support for JBoss EAP 5
This document discusses JBoss Web Services and how it integrates Apache CXF into JBoss Application Server. It provides an overview of how JBoss WS works at runtime and during deployment. Key points include that JBoss WS allows CXF to be used on JBoss AS, addresses classloading issues, and provides features like web service reference injection. It also demonstrates configuring security using the WS-Security UT Profile and JAAS login modules.
Stefano Maestri is a long-time committer to JBoss projects who has worked at Red Hat since 2010. He is involved with JBoss AS7, leading the Wise project and serving on the AS7 team. AS7 features a highly modular, lightweight architecture with fast startup times, easy administration across a domain, and improved usability over previous versions.
The document outlines the agenda for a JBoss User Group meeting in Milan on September 20th, 2011. The agenda includes presentations on JBoss AS7, JBoss AS7 web services, using JBoss on OpenShift cloud, and time for networking and questions. Additional sections provide news and information on JBoss projects, events, books, and the differences between community and enterprise versions of JBoss middleware.
The document provides an overview of the key features and benefits of a Business Rules Management System (BRMS). It discusses what a BRMS is, its main components like Guvnor, assets, rules, processes, and how it can be used with Eclipse. A BRMS provides a centralized repository for business logic, enables separation of logic and data, and allows non-technical users to define rules through a graphical interface.
Infinispan is an in-memory data grid that provides a distributed key-value store. It allows for data replication across nodes for high availability and partitions data using consistent hashing to enable horizontal scalability. Infinispan supports transactions, caching, querying and more. It can be configured programmatically or via XML and integrates with various Java technologies like JPA, CDI and Spring.
Drools was originally created as a rule engine but has expanded to be a full business modeling platform through the integration of business rule management (Drools Expert), business process management (Drools Flow), and complex event processing (Drools Fusion). Drools 5 provides a single platform for developing business logic applications using these complementary techniques. It allows modeling problems as rules, processes, events, and more through tools like Drools Guvnor for managing knowledge bases.
This document discusses barriers to integration testing and introduces Arquillian and ShrinkWrap as tools to help address those barriers. It describes how Arquillian handles container lifecycles and test deployment, allowing tests to focus on logic. ShrinkWrap provides a fluent API for programmatically creating deployment archives. The presentation provides an overview of their capabilities and benefits, such as running tests directly in containers without full application builds. It also outlines future plans like additional container and framework support. Attendees are encouraged to get involved in the open source projects.
This document provides an overview of REST (Representational State Transfer), including the key aspects of RESTful architectures such as:
- Resources are addressed through URIs
- Standard HTTP methods like GET, PUT, POST, DELETE are used to manipulate resources
- Data is represented in various formats like JSON, XML, HTML
- Communication is stateless between client and server
It then discusses how these REST principles are implemented in RESTEasy, the JBoss RESTful Web Services framework, through annotations and APIs. Features like content negotiation, interceptors, asynchronous calls and caching are also covered.
Digital Marketing Trends in 2024 | Guide for Staying AheadWask
https://www.wask.co/ebooks/digital-marketing-trends-in-2024
Feeling lost in the digital marketing whirlwind of 2024? Technology is changing, consumer habits are evolving, and staying ahead of the curve feels like a never-ending pursuit. This e-book is your compass. Dive into actionable insights to handle the complexities of modern marketing. From hyper-personalization to the power of user-generated content, learn how to build long-term relationships with your audience and unlock the secrets to success in the ever-shifting digital landscape.
5th LF Energy Power Grid Model Meet-up SlidesDanBrown980551
5th Power Grid Model Meet-up
It is with great pleasure that we extend to you an invitation to the 5th Power Grid Model Meet-up, scheduled for 6th June 2024. This event will adopt a hybrid format, allowing participants to join us either through an online Mircosoft Teams session or in person at TU/e located at Den Dolech 2, Eindhoven, Netherlands. The meet-up will be hosted by Eindhoven University of Technology (TU/e), a research university specializing in engineering science & technology.
Power Grid Model
The global energy transition is placing new and unprecedented demands on Distribution System Operators (DSOs). Alongside upgrades to grid capacity, processes such as digitization, capacity optimization, and congestion management are becoming vital for delivering reliable services.
Power Grid Model is an open source project from Linux Foundation Energy and provides a calculation engine that is increasingly essential for DSOs. It offers a standards-based foundation enabling real-time power systems analysis, simulations of electrical power grids, and sophisticated what-if analysis. In addition, it enables in-depth studies and analysis of the electrical power grid’s behavior and performance. This comprehensive model incorporates essential factors such as power generation capacity, electrical losses, voltage levels, power flows, and system stability.
Power Grid Model is currently being applied in a wide variety of use cases, including grid planning, expansion, reliability, and congestion studies. It can also help in analyzing the impact of renewable energy integration, assessing the effects of disturbances or faults, and developing strategies for grid control and optimization.
What to expect
For the upcoming meetup we are organizing, we have an exciting lineup of activities planned:
-Insightful presentations covering two practical applications of the Power Grid Model.
-An update on the latest advancements in Power Grid -Model technology during the first and second quarters of 2024.
-An interactive brainstorming session to discuss and propose new feature requests.
-An opportunity to connect with fellow Power Grid Model enthusiasts and users.
Webinar: Designing a schema for a Data WarehouseFederico Razzoli
Are you new to data warehouses (DWH)? Do you need to check whether your data warehouse follows the best practices for a good design? In both cases, this webinar is for you.
A data warehouse is a central relational database that contains all measurements about a business or an organisation. This data comes from a variety of heterogeneous data sources, which includes databases of any type that back the applications used by the company, data files exported by some applications, or APIs provided by internal or external services.
But designing a data warehouse correctly is a hard task, which requires gathering information about the business processes that need to be analysed in the first place. These processes must be translated into so-called star schemas, which means, denormalised databases where each table represents a dimension or facts.
We will discuss these topics:
- How to gather information about a business;
- Understanding dictionaries and how to identify business entities;
- Dimensions and facts;
- Setting a table granularity;
- Types of facts;
- Types of dimensions;
- Snowflakes and how to avoid them;
- Expanding existing dimensions and facts.
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Speck&Tech
ABSTRACT: A prima vista, un mattoncino Lego e la backdoor XZ potrebbero avere in comune il fatto di essere entrambi blocchi di costruzione, o dipendenze di progetti creativi e software. La realtà è che un mattoncino Lego e il caso della backdoor XZ hanno molto di più di tutto ciò in comune.
Partecipate alla presentazione per immergervi in una storia di interoperabilità, standard e formati aperti, per poi discutere del ruolo importante che i contributori hanno in una comunità open source sostenibile.
BIO: Sostenitrice del software libero e dei formati standard e aperti. È stata un membro attivo dei progetti Fedora e openSUSE e ha co-fondato l'Associazione LibreItalia dove è stata coinvolta in diversi eventi, migrazioni e formazione relativi a LibreOffice. In precedenza ha lavorato a migrazioni e corsi di formazione su LibreOffice per diverse amministrazioni pubbliche e privati. Da gennaio 2020 lavora in SUSE come Software Release Engineer per Uyuni e SUSE Manager e quando non segue la sua passione per i computer e per Geeko coltiva la sua curiosità per l'astronomia (da cui deriva il suo nickname deneb_alpha).
Building Production Ready Search Pipelines with Spark and MilvusZilliz
Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfChart Kalyan
A Mix Chart displays historical data of numbers in a graphical or tabular form. The Kalyan Rajdhani Mix Chart specifically shows the results of a sequence of numbers over different periods.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
Fueling AI with Great Data with Airbyte WebinarZilliz
This talk will focus on how to collect data from a variety of sources, leveraging this data for RAG and other GenAI use cases, and finally charting your course to productionalization.
Generating privacy-protected synthetic data using Secludy and MilvusZilliz
During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc
How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024?
In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores.
See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe.
This webinar will review:
- The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey
- The top challenges for privacy leaders, practitioners, and organizations in 2024
- Key themes to consider in developing and maintaining your privacy program
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceIndexBug
Imagine a world where machines not only perform tasks but also learn, adapt, and make decisions. This is the promise of Artificial Intelligence (AI), a technology that's not just enhancing our lives but revolutionizing entire industries.
Project Management Semester Long Project - Acuityjpupo2018
Acuity is an innovative learning app designed to transform the way you engage with knowledge. Powered by AI technology, Acuity takes complex topics and distills them into concise, interactive summaries that are easy to read & understand. Whether you're exploring the depths of quantum mechanics or seeking insight into historical events, Acuity provides the key information you need without the burden of lengthy texts.
Monitoring and Managing Anomaly Detection on OpenShift.pdfTosin Akinosho
Monitoring and Managing Anomaly Detection on OpenShift
Overview
Dive into the world of anomaly detection on edge devices with our comprehensive hands-on tutorial. This SlideShare presentation will guide you through the entire process, from data collection and model training to edge deployment and real-time monitoring. Perfect for those looking to implement robust anomaly detection systems on resource-constrained IoT/edge devices.
Key Topics Covered
1. Introduction to Anomaly Detection
- Understand the fundamentals of anomaly detection and its importance in identifying unusual behavior or failures in systems.
2. Understanding Edge (IoT)
- Learn about edge computing and IoT, and how they enable real-time data processing and decision-making at the source.
3. What is ArgoCD?
- Discover ArgoCD, a declarative, GitOps continuous delivery tool for Kubernetes, and its role in deploying applications on edge devices.
4. Deployment Using ArgoCD for Edge Devices
- Step-by-step guide on deploying anomaly detection models on edge devices using ArgoCD.
5. Introduction to Apache Kafka and S3
- Explore Apache Kafka for real-time data streaming and Amazon S3 for scalable storage solutions.
6. Viewing Kafka Messages in the Data Lake
- Learn how to view and analyze Kafka messages stored in a data lake for better insights.
7. What is Prometheus?
- Get to know Prometheus, an open-source monitoring and alerting toolkit, and its application in monitoring edge devices.
8. Monitoring Application Metrics with Prometheus
- Detailed instructions on setting up Prometheus to monitor the performance and health of your anomaly detection system.
9. What is Camel K?
- Introduction to Camel K, a lightweight integration framework built on Apache Camel, designed for Kubernetes.
10. Configuring Camel K Integrations for Data Pipelines
- Learn how to configure Camel K for seamless data pipeline integrations in your anomaly detection workflow.
11. What is a Jupyter Notebook?
- Overview of Jupyter Notebooks, an open-source web application for creating and sharing documents with live code, equations, visualizations, and narrative text.
12. Jupyter Notebooks with Code Examples
- Hands-on examples and code snippets in Jupyter Notebooks to help you implement and test anomaly detection models.
Main news related to the CCS TSI 2023 (2023/1695)Jakub Marek
An English 🇬🇧 translation of a presentation to the speech I gave about the main changes brought by CCS TSI 2023 at the biggest Czech conference on Communications and signalling systems on Railways, which was held in Clarion Hotel Olomouc from 7th to 9th November 2023 (konferenceszt.cz). Attended by around 500 participants and 200 on-line followers.
The original Czech 🇨🇿 version of the presentation can be found here: https://www.slideshare.net/slideshow/hlavni-novinky-souvisejici-s-ccs-tsi-2023-2023-1695/269688092 .
The videorecording (in Czech) from the presentation is available here: https://youtu.be/WzjJWm4IyPk?si=SImb06tuXGb30BEH .
3. Infinispan
• Cache distribuita
• Datagrid scalabile e transazionale:
performance estreme e cloud
• NoSQL “DataBase”: key-value store
– Come si interroga un data grid ?
SELECT * FROM GRID
8. Test sulla mia
libreria
• Dov'é Hibernate
Search in Action?
• Mi passi
ISBN 978-1-
933988-17-7 ?
• Prendi i libri su
Gaudí ?
9.
10.
11.
12. Come implementare
queste funzioni su un
Key/Value store?
• Dov'é Hibernate Search in Action?
• Mi passi ISBN 978-1-933988-17-7 ?
• Trovi i libri su Gaudí ?
13. document based NoSQL:
Map/Reduce
Infinispan non é propriamente document based ma
offre Map/Reduce.
Eppure non é escluso l'uso di JSON, XML, YAML, Java:
public class Book implements Serializable {
final String title;
final String author;
final String editor;
public Book(String title, String author, String editor) {
this.title = title;
this.author = author;
this.editor = editor;
}
}
14. Iterate & collect
class TitleBookSearcher implements
Mapper<String, Book, String, Book> {
final String title;
public TitleBookSearcher(String t) { title = t; }
public void map(String key, Book value, Collector collector){
if ( title.equals( value.title ) )
collector.emit( key, value );
}
class BookReducer implements
Reducer<String, Book> {
public Book reduce(String reducedKey, Iterator<Book> iter) {
return iter.next();
}
}
15. Implementare queste
semplici funzioni:
✔ Trova “Hibernate Search in Action”?
✔ Trova per codice “ISBN 978-1-933988-17-7” ?
✗ Quanti libri a proposito di
“Shakespeare” ?
• Per uno score corretto in ricerche fulltext
servono le frequenze dei frammenti di
testo relative al corpus.
• Il Pre-tagging é poco pratico e limitante
16. Apache Lucene
• Progetto open source Apache™
• Integrato in innumerevoli progetti
• .. tra cui Hibernate via Hibernate Search
• Clusterizzabile via Infinispan
– Performance
– Real time
– High availability
23. Dov'é la fregatura?
• Necessita di un indice: risorse fisiche e di
amministrazione.
– in memory
– on filesystem
– in Infinispan
• Sostanzialmente immutable segments
– Ottimizzato per data mining / query, non per
updates.
• Un mondo di stringhe e vettori di frequenze
24. Infinispan Query quickstart
• Abilita indexing=true nella
configurazione
• Aggiungi il modulo infinispan-
query.jar al classpath
• Annota i POJO inseriti nella cache
per le modalitá di indicizzazione
<dependency>
<groupId>org.infinispan</groupId>
<artifactId>infinispan-query</artifactId>
<version>5.1.3.FINAL</version>
</dependency>
25. Configurazione tramite
codice
Configuration c = new Configuration()
.fluent()
.indexing()
.addProperty(
"hibernate.search.default.directory_provider",
"ram")
.build();
CacheManager manager = new DefaultCacheManager(c);
27. Annotazioni sul modello
@ProvidedId @Indexed
public class Book implements Serializable {
@Field String title;
@Field String author;
@Field String editor;
public Book(String title, String author, String editor) {
this.title = title;
this.author = author;
this.editor = editor;
}
}
28. Esecuzione di Query
SearchManager sm = Search.getSearchManager(cache);
Query query = sm.buildQueryBuilderForClass(Book.class)
.get()
.phrase()
.onField("title")
.sentence("in action")
.createQuery();
List<Object> list = sm.getQuery(query).list();
29. Architettura
• Integra Hibernate Search (engine)
– Listener a eventi Hibernate &
transazioni
• Eventi Infinispan & transazioni
– Mappa tipi Java e grafi del modello a
Documents di Lucene
– Thin-layer design
38. Quickstart Hibernate
Search
• Aggiungi la dipendenza ad hibernate-
search:
<dependency>
<groupId>org.hibernate</groupId>
<artifactId>hibernatesearchorm</artifactId>
<version>4.1.0.Final</version>
</dependency>
39. Quickstart Hibernate
Search
• Tutto il resto é opzionale:
– Come gestire gli indici
– Moduli di estensione, Analyzer custom
– Performance tuning
– Mapping custom dei tipi
– Clustering
• JGroups
• Infinispan
• JMS
40. Quickstart Hibernate
@Entity
Search
public class Essay {
@Id
public Long getId() { return id; }
public String getSummary() { return
summary; }
@Lob
public String getText() { return text; }
@ManyToOne
public Author getAuthor() { return
author; }
...
41. Quickstart Hibernate
@Entity @Indexed
Search
public class Essay {
@Id
public Long getId() { return id; }
public String getSummary() { return
summary; }
@Lob
public String getText() { return text; }
@ManyToOne
public Author getAuthor() { return
author; }
...
42. Quickstart Hibernate
@Entity @Indexed
Search
public class Essay {
@Id
public Long getId() { return id; }
@Field
public String getSummary() { return
summary; }
@Lob
public String getText() { return text; }
@ManyToOne
public Author getAuthor() { return
author; }
...
43. Quickstart Hibernate
@Entity @Indexed
Search
public class Essay {
@Id
public Long getId() { return id; }
@Field
public String getSummary() { return
summary; }
@Lob @Field @Boost(0.8)
public String getText() { return text; }
@ManyToOne
public Author getAuthor() { return
author; }
...
44. Quickstart Hibernate
@Entity @Indexed
Search
public class Essay {
@Id
public Long getId() { return id; }
@Field
public String getSummary() { return
summary; }
@Lob @Field @Boost(0.8)
public String getText() { return text; }
@ManyToOne @IndexedEmbedded
public Author getAuthor() { return
author; }
...
45. Un secondo esempio
@Entity @Entity
public class Author { public class Book {
@Id @GeneratedValue private Integer id;
private Integer id; private String title;
private String name; }
@OneToMany
private Set<Book>
books;
}
46. Struttura dell'indice
@Entity @Indexed @Entity
public class Author { public class Book {
@Id @GeneratedValue private Integer id;
private Integer id; @Field(store=Store.YES)
private String title;
@Field(store=Store.YES) }
private String name;
@OneToMany
@IndexedEmbedded
private Set<Book>
books;
}
56. Suggerimenti per
performance ottimali
• Calibra il chunk_size per l'uso effettivo
del vostro indice (evita i read lock
evitando la frammentazione)
• Verifica la dimensione dei pacchetti
network: blob size, JGroups packets,
network interface and hardware.
• Scegli e configura un CacheLoader
adatto
57. Requisiti di memoria
• RAMDirectory: tutto l'indice (e piú) in RAM.
• FSDirectory: un buon OS sa fare un ottimo
lavoro di caching di IO – spesso meglio di
RAMDirectory.
• Infinispan: configurabile, fino alla memoria
condivisa tra nodi
– Flexible
– Fast
– Network vs. disk
58. Moduli per cloud
deployment scalabili
One Infinispan to rule them all
– Store Lucene indexes
– Hibernate second level cache
– Application managed cache
– Datagrid
– EJB, session replication in AS7
– As a JPA “store” via Hibernate OGM
59. Ingredienti per la cloud
• JGroups DISCOVERY protocol
– MPING
– TCP_PING
– JDBC_PING
– S3_PING
• Scegli un CacheLoader
– Database based, Jclouds,
Cassandra, ...
60. Futuro prossimo
• Semplificare la scalabilitá in scrittura
• Auto-tuning dei parametri di
clustering – ergonomics!
• Parallel searching: multi/core +
multi/node
• A component of
– http://www.cloudtm.eu
63. NoSQL:
la flessibilitá costa
• Programming model
• one per product :-(
• no schema => app driven schema
• query (Map Reduce, specific DSL, ...)
• data structure transpires
• Transaction
• durability / consistency
64. Esempio: Infinispan
Distributed Key/Value store
(or Replicated, local only efficient cache,
•
invalidating cache)
Each node is equal
Just start more nodes, or kill some
•
No bottlenecks
by design
•
Cloud-network friendly
JGroups
•
And “cloud storage” friendly too!
•
66. É una ConcurrentMap !
map.put( “user-34”, userInstance );
map.get( “user-34” );
map.remove( “user-34” );
map.putIfAbsent( “user-38”,
another );
67. Qualche altro dettaglio su
Infinispan
● Support for Transactions (XA)
● CacheLoaders
● Cassandra, JDBC, Amazon S3 (jclouds),...
● Tree API for JBossCache compatibility
● Lucene integration
● Two-fold
● Some Hibernate integrations
● Second level cache
● Hibernate Search indexing backend
68. Obiettivi di Hibernate
OGM
Encourage new data usage patterns
•
Familiar environment
•
Ease of use
•
easy to jump in
•
easy to jump out
•
Push NoSQL exploration in enterprises
•
“PaaS for existing API” initiative
•
69. Cos'é
• JPA front end to key/value stores
• Object CRUD (incl polymorphism and
associations)
• OO queries (JP-QL)
• Reuses
• Hibernate Core
• Hibernate Search (and Lucene)
• Infinispan
• Is not a silver bullet
• not for all NoSQL use cases
70. Entitá come blob
serializzati?
• Serialize objects into the (key) value
• store the whole graph?
• maintain consistency with duplicated
objects
• guaranteed identity a == b
• concurrency / latency
• structure change and (de)serialization,
class definition changes
71. OGM’s approach to
schema
• Keep what’s best from relational model
• as much as possible
• tables / columns / pks
• Decorrelate object structure from data
structure
• Data stored as (self-described) tuples
• Core types limited
• portability
72.
73. Query
• Hibernate Search indexes entities
• Store Lucene indexes in Infinispan
• JP-QL to Lucene query transformation
• Works for simple queries
• Lucene is not a relational SQL engine