Presenter: Puneet Oberai, Senior Software Engineer at Netflix
In this session, we'll cover a quick introduction to the Astyanax Java client driver, powerful features, comparison to Java Driver and what to do with CQL3.
Algolia's Fury Road to a Worldwide API - Take Off Conference 2016Olivier Lance
From beta service to a worldwide distributed API.
From 1 to over 2000 customers all around the world.
This presentation takes you through 12 of the steps Algolia has been through to build and scale its hosted search API, always keeping in mind high availability and speed.
Herding a Cat with Antlers - Catalyst 5.80Tomas Doran
Catalyst 5.80 - the new major version allows you lots of new ways to build applications. This talk looks at some of the technologies you may want to use, and points out some examples and other modules you might want to look at on CPAN.
Algolia is a distributed Search-as-a-Service API that processes more than 4 billions user-generated queries per month. Algolia’s DNA is performance: Algolia's service is optimized to reply in milliseconds from any location worldwide while maintaining high availability. Sylvain will provide you details on Algolia architecture, showing how they have designed their fault tolerant service and how they cracked the website & in-app search.
http://www.meetup.com/Enterprise-Search-and-Analytics-Meetup/events/223926122/
Sylvain Utard presents on Build Realtime Search. Realtime Search is a search API provider that processes 2 billion operations per month across 30+ servers. The backend is built in C++, analytics in Java, and the website in Rails. It supports 12 API clients in various languages. Realtime Search provides an average query time of less than 10ms across multiple datacenters globally. The REST API is CORS compliant for use in JavaScript applications and aims for an overall end-user latency of less than 100ms.
How LogicMonitor Automates Deployments with Bamboo and AnsibleRandall Thomson
This document discusses automating deployments with Bamboo and Ansible. It describes how Bamboo is used for building and deploying software and Ansible is used for automation and configuration management. Previously, deployments were done manually by copying files but this led to inconsistencies. The solution implemented automates deployments through Bamboo deployment plans using Ansible playbooks for consistency and notifications for success/failure. This provides empowered developers with an automated, consistent and resilient deployment process.
Rails 5 will include several performance improvements and new features while maintaining backwards compatibility. Notable changes include faster development mode, optional belongs_to validations, support for JSON columns in MySQL, and ActionCable for real-time features. While Turbolinks 5 is included, Rails 5 will not upgrade to Turbolinks 2. The release aims to improve APIs, add Active Record query methods like or, and refine existing functionality with few breaking changes.
The document discusses AREL (Arel Relational Algebra Library), which is an abstract syntax tree manager and the engine behind ActiveRecord queries in Ruby on Rails. It compares using raw SQL, ActiveRecord queries, and AREL for database queries, noting the pros and cons of each. Examples are provided of typical ActiveRecord queries, rewriting them using AREL, and using raw SQL. Resources for further reading on AREL are also listed.
Algolia's Fury Road to a Worldwide API - Take Off Conference 2016Olivier Lance
From beta service to a worldwide distributed API.
From 1 to over 2000 customers all around the world.
This presentation takes you through 12 of the steps Algolia has been through to build and scale its hosted search API, always keeping in mind high availability and speed.
Herding a Cat with Antlers - Catalyst 5.80Tomas Doran
Catalyst 5.80 - the new major version allows you lots of new ways to build applications. This talk looks at some of the technologies you may want to use, and points out some examples and other modules you might want to look at on CPAN.
Algolia is a distributed Search-as-a-Service API that processes more than 4 billions user-generated queries per month. Algolia’s DNA is performance: Algolia's service is optimized to reply in milliseconds from any location worldwide while maintaining high availability. Sylvain will provide you details on Algolia architecture, showing how they have designed their fault tolerant service and how they cracked the website & in-app search.
http://www.meetup.com/Enterprise-Search-and-Analytics-Meetup/events/223926122/
Sylvain Utard presents on Build Realtime Search. Realtime Search is a search API provider that processes 2 billion operations per month across 30+ servers. The backend is built in C++, analytics in Java, and the website in Rails. It supports 12 API clients in various languages. Realtime Search provides an average query time of less than 10ms across multiple datacenters globally. The REST API is CORS compliant for use in JavaScript applications and aims for an overall end-user latency of less than 100ms.
How LogicMonitor Automates Deployments with Bamboo and AnsibleRandall Thomson
This document discusses automating deployments with Bamboo and Ansible. It describes how Bamboo is used for building and deploying software and Ansible is used for automation and configuration management. Previously, deployments were done manually by copying files but this led to inconsistencies. The solution implemented automates deployments through Bamboo deployment plans using Ansible playbooks for consistency and notifications for success/failure. This provides empowered developers with an automated, consistent and resilient deployment process.
Rails 5 will include several performance improvements and new features while maintaining backwards compatibility. Notable changes include faster development mode, optional belongs_to validations, support for JSON columns in MySQL, and ActionCable for real-time features. While Turbolinks 5 is included, Rails 5 will not upgrade to Turbolinks 2. The release aims to improve APIs, add Active Record query methods like or, and refine existing functionality with few breaking changes.
The document discusses AREL (Arel Relational Algebra Library), which is an abstract syntax tree manager and the engine behind ActiveRecord queries in Ruby on Rails. It compares using raw SQL, ActiveRecord queries, and AREL for database queries, noting the pros and cons of each. Examples are provided of typical ActiveRecord queries, rewriting them using AREL, and using raw SQL. Resources for further reading on AREL are also listed.
1. Algolia started as a beta with two machines in 2013 and has since expanded to 15 regions across 50 data centers, serving over 2600 customers globally.
2. Early challenges included risky deployments without easy rollbacks and intermittent slowness caused by DNS issues.
3. Algolia now has a distributed search network for worldwide synchronization and spreads clusters across independent providers and datacenters for high reliability.
How Netflix tests in production to augment more traditional testing methods. This talk covers the Simian Army (Chaos Monkey & friends, code coverage in production, and canary testing.
Rails 5 introduces several new features including Action Cable for real-time functionality using WebSockets, Active Model Serializers for JSON APIs, Turbolinks 3 as a partial replacement for the traditional Rails form submission process, and performance improvements for Active Record and Ruby. It also requires Ruby 2.2.1.
With C# 8 the yield and foreach statements have been extended to offer asynchronously iterating through streams. IAsyncEnumerable, IAsyncEnumerator, and IAsyncDisposable have been added as asynchronous counterparts of the synchronous interfaces IEnumerable, IEnumerator, and IDisposable. This session demonstrates the foundation of asynchronous streams, as well how async streams can be used with SignalR, gRPC, EF Core (.NET 6), ASP.NET Core 6, and Azure Storage..
C# 8 and .NET Core 3.0 will be released in 2019. In this session you learn what’s new with these new major versions. Influenced on language enhancements such as async streams and nullable reference types, enhancements based on this are coming to .NET Core and EF Core. With ASP.NET Core, a new routing foundation is available. You’ll see advantages of the new endpoint routing in addition to the Blazor Components, and some new project templates. Last but not least, you learn about different aspects and features creating WPF applications with .NET Core 3.0.
What's new with .NET Core 3 - covering features from C#, .NET Core, ASP.NET Core, WPF - including nullability, indices and ranges, switch expressions, enhanced pattern matching, changes with ASP.NET Core, Blazor server-side components, and WPF with .NET Core.
The document discusses ReasonML, a dialect of OCaml that compiles to JavaScript. It summarizes:
1) The author uses ReasonML for various projects at Viska including 8 cloud functions, a web admin, and a React Native app with over 8,000 lines of ReasonML code and several open source binding libraries.
2) ReasonML allows developing both web and mobile apps with one codebase and team as it compiles to JavaScript, unlike React Native which requires separate teams.
3) ReasonML is advantageous over React Native as it is typed, has a fast compiler, and the functional features of OCaml, unlike the untyped and slow nature of JavaScript in React Native.
Spark UDFs are EviL, Catalyst to the rEsCue!Adi Polak
Processing data at scale usually involves struggling with performance, strict SLA, limited hardware capabilities and more. After struggling with Spark SQL query run-time, I found the felon! In this lecture, I would like to share with you the change in perspective and process we had to go through in order to find the felon (and the solution!). Today in the world of Big Data and Spark we are processing high volume transactions. Catalyst is the Spark SQL query optimizer, in this talk, we will reveal how you can fully utilize Catalyst’s optimization power in order to make queries run as fast as possible, by pushing down actions and avoiding UDFs as much as possible, while still maximizing performance.
Fore features of .NET Core: dependency injection, logging, and configuration, and using the .NET Core 3.0 Host class.
Only few slides but live coding with many samples available at: https://github.com/christiannagel/bastafrankfurt2020
This presentation explains how to use the Spark framework together with the Kotlin programming language and the JVM to build robust APIs. The presentation begins with some basics about the Kotlin language from JetBrains. It then gives a brief introduction to Spark, a Sinatra-inspired framework for setting up Web routes. It concludes with some references to three blog posts with more explanation and a GitHub repository where the sample code can be found.
The document discusses integration hell, which can occur when developing software if changes and deployments happen too frequently without proper processes. It provides details on a real-world project with 6 developers, over 900 files, and a deployment every 43 minutes on average. Recommendations are made around using tools like Git, Jenkins, virtualenv, and others to help manage the integration process and spot problems early.
In this presentation I will show you how to setup Laravel and Elasticsearch to quickly build a search engine. This was given at a local meetup in Groningen (Netherlands).
From the perspective of software developers, you must still build, integrate, and deploy the software that makes up your Serverless Stack, be it Lambda functions, APIs in API gateway, databases in DynamoDB, streams in Kinesis, and so on. What does provisioning, continuous integration, continuous deployment, and monitoring look like in the Serverless world? We will look at effective end-to-end approaches for to achieve all of the above.
Speaker: Krishnan Mani,
Solutions Architect, Amazon India
- The document provides best practices for using Ansible including organizing content with roles and hierarchies, making playbooks readable with tasks, comments and whitespace, tagging tasks for organization, avoiding duplication, ensuring idempotency, templating with Jinja2, and testing playbooks. It emphasizes documenting processes, non-idempotent tasks can cause issues, and validating changes with testing.
The document discusses a company that transitioned from a Java codebase to Scala for developing their content API. They started by writing Scala test code and enjoyed features like sensible class constructors, type inference, and the REPL. After a month, they decided to convert the whole app to Scala due to these benefits. Over time, they embraced more Scala features while continuing to use the same tools as with Java. The transition was smooth by starting with "java without semicolons" and gradually adopting Scala.
How and why test Azure Front Door with AWS Lambda & PowerShell? | Osman Sahin...UK DevOps Collective
Osman Sahin, a regular attendee of our events, explains how and why he is using AWS Lambda & PowerShell to test the new Azure Front Door service.
Presented Wednesday 28th July 2019 at the London PowerShell User Group Meetup hosted by dotdigital Group.
Connect with Osman Sahin:
- LinkedIn: https://www.linkedin.com/in/osmanysahin/
Thanks to dotdigital Group (https://dotdigital.com / https://twitter.com/dotdigital) for providing the venue, food and drinks. We very much appreciate your continued support of our community of PowerShell & DevOps tech enthusiasts.
Join our next event at https://www.meetup.com/PowerShell-London-UK/. We are running at least one Meetup every month.
#PowerShell #Azure #AWS #DevOps
Cassandra Day SV 2014: Netflix’s Astyanax Java Client Driver for Apache Cassa...DataStax Academy
Astyanax is the thrift protocol based C* driver widely used and open sourced by Netflix. It was recently integrated with the Java Driver released by DataStax. This talk focusses on the different options available with Astyanax and how it complements the Java Driver.
About Puneet Oberai, Senior Software Engineer at Netflix
Senior Software Engineer at Netflix and proud team member of Netflix CDE (Cloud Data Engineering).
Van Wilson
Senior Consultant with Cardinal Solutions
Find more by Van Wilson: https://speakerdeck.com/vjwilson
All Things Open
October 26-27, 2016
Raleigh, North Carolina
1. Algolia started as a beta with two machines in 2013 and has since expanded to 15 regions across 50 data centers, serving over 2600 customers globally.
2. Early challenges included risky deployments without easy rollbacks and intermittent slowness caused by DNS issues.
3. Algolia now has a distributed search network for worldwide synchronization and spreads clusters across independent providers and datacenters for high reliability.
How Netflix tests in production to augment more traditional testing methods. This talk covers the Simian Army (Chaos Monkey & friends, code coverage in production, and canary testing.
Rails 5 introduces several new features including Action Cable for real-time functionality using WebSockets, Active Model Serializers for JSON APIs, Turbolinks 3 as a partial replacement for the traditional Rails form submission process, and performance improvements for Active Record and Ruby. It also requires Ruby 2.2.1.
With C# 8 the yield and foreach statements have been extended to offer asynchronously iterating through streams. IAsyncEnumerable, IAsyncEnumerator, and IAsyncDisposable have been added as asynchronous counterparts of the synchronous interfaces IEnumerable, IEnumerator, and IDisposable. This session demonstrates the foundation of asynchronous streams, as well how async streams can be used with SignalR, gRPC, EF Core (.NET 6), ASP.NET Core 6, and Azure Storage..
C# 8 and .NET Core 3.0 will be released in 2019. In this session you learn what’s new with these new major versions. Influenced on language enhancements such as async streams and nullable reference types, enhancements based on this are coming to .NET Core and EF Core. With ASP.NET Core, a new routing foundation is available. You’ll see advantages of the new endpoint routing in addition to the Blazor Components, and some new project templates. Last but not least, you learn about different aspects and features creating WPF applications with .NET Core 3.0.
What's new with .NET Core 3 - covering features from C#, .NET Core, ASP.NET Core, WPF - including nullability, indices and ranges, switch expressions, enhanced pattern matching, changes with ASP.NET Core, Blazor server-side components, and WPF with .NET Core.
The document discusses ReasonML, a dialect of OCaml that compiles to JavaScript. It summarizes:
1) The author uses ReasonML for various projects at Viska including 8 cloud functions, a web admin, and a React Native app with over 8,000 lines of ReasonML code and several open source binding libraries.
2) ReasonML allows developing both web and mobile apps with one codebase and team as it compiles to JavaScript, unlike React Native which requires separate teams.
3) ReasonML is advantageous over React Native as it is typed, has a fast compiler, and the functional features of OCaml, unlike the untyped and slow nature of JavaScript in React Native.
Spark UDFs are EviL, Catalyst to the rEsCue!Adi Polak
Processing data at scale usually involves struggling with performance, strict SLA, limited hardware capabilities and more. After struggling with Spark SQL query run-time, I found the felon! In this lecture, I would like to share with you the change in perspective and process we had to go through in order to find the felon (and the solution!). Today in the world of Big Data and Spark we are processing high volume transactions. Catalyst is the Spark SQL query optimizer, in this talk, we will reveal how you can fully utilize Catalyst’s optimization power in order to make queries run as fast as possible, by pushing down actions and avoiding UDFs as much as possible, while still maximizing performance.
Fore features of .NET Core: dependency injection, logging, and configuration, and using the .NET Core 3.0 Host class.
Only few slides but live coding with many samples available at: https://github.com/christiannagel/bastafrankfurt2020
This presentation explains how to use the Spark framework together with the Kotlin programming language and the JVM to build robust APIs. The presentation begins with some basics about the Kotlin language from JetBrains. It then gives a brief introduction to Spark, a Sinatra-inspired framework for setting up Web routes. It concludes with some references to three blog posts with more explanation and a GitHub repository where the sample code can be found.
The document discusses integration hell, which can occur when developing software if changes and deployments happen too frequently without proper processes. It provides details on a real-world project with 6 developers, over 900 files, and a deployment every 43 minutes on average. Recommendations are made around using tools like Git, Jenkins, virtualenv, and others to help manage the integration process and spot problems early.
In this presentation I will show you how to setup Laravel and Elasticsearch to quickly build a search engine. This was given at a local meetup in Groningen (Netherlands).
From the perspective of software developers, you must still build, integrate, and deploy the software that makes up your Serverless Stack, be it Lambda functions, APIs in API gateway, databases in DynamoDB, streams in Kinesis, and so on. What does provisioning, continuous integration, continuous deployment, and monitoring look like in the Serverless world? We will look at effective end-to-end approaches for to achieve all of the above.
Speaker: Krishnan Mani,
Solutions Architect, Amazon India
- The document provides best practices for using Ansible including organizing content with roles and hierarchies, making playbooks readable with tasks, comments and whitespace, tagging tasks for organization, avoiding duplication, ensuring idempotency, templating with Jinja2, and testing playbooks. It emphasizes documenting processes, non-idempotent tasks can cause issues, and validating changes with testing.
The document discusses a company that transitioned from a Java codebase to Scala for developing their content API. They started by writing Scala test code and enjoyed features like sensible class constructors, type inference, and the REPL. After a month, they decided to convert the whole app to Scala due to these benefits. Over time, they embraced more Scala features while continuing to use the same tools as with Java. The transition was smooth by starting with "java without semicolons" and gradually adopting Scala.
How and why test Azure Front Door with AWS Lambda & PowerShell? | Osman Sahin...UK DevOps Collective
Osman Sahin, a regular attendee of our events, explains how and why he is using AWS Lambda & PowerShell to test the new Azure Front Door service.
Presented Wednesday 28th July 2019 at the London PowerShell User Group Meetup hosted by dotdigital Group.
Connect with Osman Sahin:
- LinkedIn: https://www.linkedin.com/in/osmanysahin/
Thanks to dotdigital Group (https://dotdigital.com / https://twitter.com/dotdigital) for providing the venue, food and drinks. We very much appreciate your continued support of our community of PowerShell & DevOps tech enthusiasts.
Join our next event at https://www.meetup.com/PowerShell-London-UK/. We are running at least one Meetup every month.
#PowerShell #Azure #AWS #DevOps
Cassandra Day SV 2014: Netflix’s Astyanax Java Client Driver for Apache Cassa...DataStax Academy
Astyanax is the thrift protocol based C* driver widely used and open sourced by Netflix. It was recently integrated with the Java Driver released by DataStax. This talk focusses on the different options available with Astyanax and how it complements the Java Driver.
About Puneet Oberai, Senior Software Engineer at Netflix
Senior Software Engineer at Netflix and proud team member of Netflix CDE (Cloud Data Engineering).
Van Wilson
Senior Consultant with Cardinal Solutions
Find more by Van Wilson: https://speakerdeck.com/vjwilson
All Things Open
October 26-27, 2016
Raleigh, North Carolina
The document discusses using Apache Camel and Apache Karaf to build distributed, asynchronous systems in a similar way to AKKA. It provides examples of building a dynamic routing system using Camel routing and JMS, as well as a modular ETL system for processing CSV files using a configurable, hot-deployable mutation framework. The examples demonstrate how to achieve scalability, modularity, and asynchronous behavior without deep knowledge of the underlying technologies through an event-driven architecture based on messaging.
Lucene, Solr and java 9 - opportunities and challengesCharlie Hull
Apache Lucene and Solr needed to be updated to work with Java 9's new module system. This introduced challenges around strong encapsulation and reflective access. The talk discussed changes like compact strings and performance improvements from intrinsics and the G1 garbage collector. It also recommended using multi-release JARs to include Java 9 specific implementations of utils classes for compatibility. Migrating to Java 9 could improve security and performance in some cases for Elasticsearch users.
The document provides an overview of Netflix's approach to continuous delivery using their open source tools. It discusses how Netflix builds immutable infrastructure by baking software packages into pre-configured server images. It also describes how their build system tools like Gradle and Nebula plugins help standardize builds at scale. Finally, it outlines how tools like Eureka, Ribbon, and Asgard help enable ongoing deployment and management of cloud resources through concepts like service discovery and application clusters.
1. The document discusses how OpsWorks has made the presenter's life easier as a developer who also handles operations. OpsWorks provides hosted infrastructure on AWS for deploying applications using Chef recipes.
2. It describes the main structures in OpsWorks - stacks, layers, apps, and instances. Stacks represent entire applications, layers define different parts like web servers, apps contain specific settings, and instances define the servers.
3. The presenter discusses using OpsWorks with Ruby on Rails applications, including customizing Chef recipes, deploying code, and integrating other AWS services for monitoring, security, and scaling. While documentation can be confusing, OpsWorks provides an easy way for developers to manage operations.
Visual Studio 2010 and the .NET Framework 4 enhance support for parallel programming by providing a new runtime, new class library types, and new diagnostic tools. This presentation is all about parallel programming and its features.
Automation for Anyone at Nutanix NEXT 2017 USChris Wahl
Are you wondering how to solve repetitive tasks with software automation, but you struggle every time you see the word “developer” or “code?” Do you know what APIs are and how they make these tasks easy to solve? In this session, we’ll explore the framework that can apply to mundane tasks (i.e. PowerShell, Pester), and we’ll discuss what open source tools are available to help solve these problems. Walk away with the advice you need to get started!
The document outlines Viresh Doshi's top 10 Ansible tips based on his experience as a DevOps engineer. The tips include: adding a README to repositories to document purpose; unit testing roles with Molecule; using name and debug modules for readability and troubleshooting; connecting roles to Jenkins CI; using Python virtual environments; templating with Jinja2; ensuring idempotency; using variables consistently; splitting tasks across files; and sharing roles openly. Additional tips are to write modules, use filters, leverage Ansible Galaxy, manage inventories, and enjoy Python and coffee.
Scala Native is a tool that compiles Scala code to native machine code instead of the JVM bytecode. It provides faster startup times compared to JVM-based Scala code since it avoids the JIT compilation overhead. The speaker discusses how to set up a basic Scala Native project, the benefits of native compilation like small binary sizes and low memory usage. He also covers interoperability with C libraries via FFI and the current ecosystem support. As an example, the speaker demonstrates a 2D physics simulation game built with Scala Native, SDL and the Chipmunk physics engine.
15015 SRV318 Serverless Breakout Session Research at PNNL: Powered by AWS Pacific Northwest National Laboratory's rich data sciences capability has produced novel solutions in numerous research areas including image analysis, statistical modeling, and social media (and many more!). See how PNNL software engineers utilize AWS to enable better collaboration between researchers and engineers, and to power the data processing systems required to facilitate this work, with a focus on Lambda, EC2, S3, Apache Nifi and other technologies. Several approaches will be covered including lessons learned. AWS re:Invent 2017, Amazon, Giardinelli, Serverless, SRV318, EC2 11/28/2017 1:00:00 PM Tue Breakout Session
Research at PNNL: Powered by AWS - SRV318 - re:Invent 2017Amazon Web Services
Pacific Northwest National Laboratory's rich data sciences capability has produced novel solutions in numerous research areas including image analysis, statistical modeling, and social media (and many more!). See how PNNL software engineers utilize AWS to enable better collaboration between researchers and engineers, and to power the data processing systems required to facilitate this work, with a focus on Lambda, EC2, S3, Apache Nifi and other technologies. Several approaches will be covered including lessons learned.
This document provides an overview of an Amazon EKS hands-on workshop. It introduces the workshop agenda which includes deploying example microservices, logging with Elasticsearch Fluentd and Kibana, monitoring with Prometheus and Grafana, and continuous integration/continuous delivery using GitOps with Weave Flux. Key concepts covered are Kubernetes pods, services, deployments, container networking with CNI plugins, observability tools, and CI/CD approaches.
Fargate is a compute engine for containers that allows you to run and scale containerized applications without having to manage servers or clusters. With Fargate, you no longer have to provision, configure, and scale clusters of virtual machines to run containers. You simply specify your containerized application, and Fargate provisions the infrastructure and ensures the containers have compute capacity. Fargate automatically handles tasks like cluster management, node provisioning and replacement, load balancing, scaling and application health monitoring. This allows developers to focus on building applications rather than managing the underlying infrastructure.
IaC? VSTS to the rescue! Abbreviations explainedJeroen Niesen
This document discusses DevOps and infrastructure as code (IaC) using Azure Resource Manager. It begins with an overview of how Agile development processes led to the need for immutable infrastructure and DevOps. Infrastructure is now defined as code using ARM templates to ensure consistency and deployability. The document then outlines how IaC, DevOps tools like VSTS, and a continuous delivery pipeline can be used together for automated deployments in a production environment every sprint. It concludes by advertising an upcoming session on continuous delivery for IT professionals.
Distributed Model Validation with EpsilonSina Madani
Scalable performance is a major challenge with current model management tools. As the size and complexity of models and model management programs increases and the cost of computing falls, one solution for improving performance of model management programs is to perform computations on multiple computers. The developed prototype demonstrates a low-overhead data-parallel approach for distributed model validation in the context of an OCL-like language. The approach minimises communication costs by exploiting the deterministic structure of programs and can take advantage of multiple cores on each (heterogenous) machine with highly configurable computational granularity. Performance evaluation shows linear improvements with more machines and processor cores, being up to 340x faster than the baseline sequential program with 88 computers.
This document discusses Chef Cookbook workflow testing. It begins with an introduction to Chef and its core components like resources, recipes, and cookbooks. It then emphasizes the importance of testing infrastructure code like Chef cookbooks. Various testing techniques for Chef cookbooks are presented, including linting with Rubocop, style checking with FoodCritic, unit testing with ChefSpec, and integration testing using Test Kitchen. The document stresses treating infrastructure code like any other codebase by implementing practices like version control, continuous integration, and separation of concerns. It provides examples of implementing some of these testing techniques and outlines an example pipeline for testing and releasing Chef cookbooks.
This document discusses application delivery patterns used by REA. It begins with an agenda and mission statement. It then provides examples of "Hello World" programs in various languages. It discusses development and delivery lifecycles, including the use of pipelines. It describes characteristics of good pipelines and pipeline design considerations. It outlines REA's journey with application delivery on AWS and lessons learned, including the use of multiple accounts and decoupling deployment tools from applications. Key recommendations include deploying fully resolved artifacts, keeping metrics, and giving deployment teams response powers.
Similar to Cassandra Summit 2014: Astyanax — To Be or Not To Be (20)
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftDataStax Academy
Companies today are innovating with real-time data to deliver truly amazing customer experiences in the moment. Real-time data management for real-time customer experience is core to staying ahead of competition and driving revenue growth. Join Trays to learn how Comcast is differentiating itself from it's own historical reputation with Customer Experience strategies.
Introduction to DataStax Enterprise Graph DatabaseDataStax Academy
DataStax Enterprise (DSE) Graph is a built to manage, analyze, and search highly connected data. DSE Graph, built on NoSQL Apache Cassandra delivers continuous uptime along with predictable performance and scales for modern systems dealing with complex and constantly changing data.
Download DataStax Enterprise: Academy.DataStax.com/Download
Start free training for DataStax Enterprise Graph: Academy.DataStax.com/courses/ds332-datastax-enterprise-graph
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraDataStax Academy
DataStax Enterprise Advanced Replication supports one-way distributed data replication from remote database clusters that might experience periods of network or internet downtime. Benefiting use cases that require a 'hub and spoke' architecture.
Learn more at http://www.datastax.com/2016/07/stay-100-connected-with-dse-advanced-replication
Advanced Replication docs – https://docs.datastax.com/en/latest-dse/datastax_enterprise/advRep/advRepTOC.html
This document discusses using Docker containers to run Cassandra clusters at Walmart. It proposes transforming existing Cassandra hardware into containers to better utilize unused compute. It also suggests building new Cassandra clusters in containers and migrating old clusters to double capacity on existing hardware and save costs. Benchmark results show Docker containers outperforming virtual machines on OpenStack and Azure in terms of reads, writes, throughput and latency for an in-house application.
The document discusses the evolution of Cassandra's data modeling capabilities over different versions of CQL. It covers features introduced in each version such as user defined types, functions, aggregates, materialized views, and storage attached secondary indexes (SASI). It provides examples of how to create user defined types, functions, materialized views, and SASI indexes in CQL. It also discusses when each feature should and should not be used.
Cisco has a large global IT infrastructure supporting many applications, databases, and employees. The document discusses Cisco's existing customer service and commerce systems (CSCC/SMS3) and some of the performance, scalability, and user experience issues. It then presents a proposed new architecture using modern technologies like Elasticsearch, Cassandra, and microservices to address these issues and improve agility, performance, scalability, uptime, and the user interface.
Data Modeling is the one of the first things to sink your teeth into when trying out a new database. That's why we are going to cover this foundational topic in enough detail for you to get dangerous. Data Modeling for relational databases is more than a touch different than the way it's approached with Cassandra. We will address the quintessential query-driven methodology through a couple of different use cases, including working with time series data for IoT. We will also demo a new tool to get you bootstrapped quickly with MovieLens sample data. This talk should give you the basics you need to get serious with Apache Cassandra.
Hear about how Coursera uses Cassandra as the core of its scalable online education platform. I'll discuss the strengths of Cassandra that we leverage, as well as some limitations that you might run into as well in practice.
In the second part of this talk, we'll dive into how best to effectively use the Datastax Java drivers. We'll dig into how the driver is architected, and use this understanding to develop best practices to follow. I'll also share a couple of interesting bug we've run into at Coursera.
This document promotes Datastax Academy and Certification resources for learning Cassandra including a three step process of learning Cassandra, getting certified, and profiting. It lists community evangelists like Luke Tillman, Patrick McFadin, Jon Haddad, and Duy Hai Doan who can provide help and resources.
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonDataStax Academy
This document summarizes three presentations from a Cassandra Meetup:
1. Jason Cacciatore discussed monitoring Cassandra health at scale across hundreds of clusters and thousands of nodes using the reactive stream processing system Mantis.
2. Minh Do explained how Cassandra uses the gossip protocol for tasks like discovering cluster topology and sharing load information. Gossip also has limitations and race conditions that can cause problems.
3. Chris Kalantzis presented Cassandra Tickler, an open source tool he created to help repair operations that get stuck by running lightweight consistency checks on an old Cassandra version or a node with space issues.
Cassandra @ Sony: The good, the bad, and the ugly part 1DataStax Academy
This talk covers scaling Cassandra to a fast growing user base. Alex and Isaias will cover new best practices and how to work with the strengths and weaknesses of Cassandra at large scale. They will discuss how to adapt to bottlenecks while providing a rich feature set to the playstation community.
Cassandra @ Sony: The good, the bad, and the ugly part 2DataStax Academy
The document discusses Cassandra's use by Sony Network Entertainment to handle the large amount of user and transaction data from the growing PlayStation Network. It describes how the relational database they previously used did not scale sufficiently, so they transitioned to using Cassandra in a denormalized and customized way. Some of the techniques discussed include caching user data locally on application servers, secondary indexing, and using a real-time indexer to enable personalized search by friends.
This document provides guidance on setting up server monitoring, application metrics, log aggregation, time synchronization, replication strategies, and garbage collection for a Cassandra cluster. Key recommendations include:
1. Use monitoring tools like Monit, Munin, Nagios, or OpsCenter to monitor processes, disk usage, and system performance. Aggregate all logs centrally with tools like Splunk, Logstash, or Greylog.
2. Install NTP to synchronize server times which are critical for consistency.
3. Use the NetworkTopologyStrategy replication strategy and avoid SimpleStrategy for production.
4. Avoid shared storage and focus on low latency and high throughput using multiple local disks.
5. Understand
This document discusses real time analytics using Spark and Spark Streaming. It provides an introduction to Spark and highlights limitations of Hadoop for real-time analytics. It then describes Spark's advantages like in-memory processing and rich APIs. The document discusses Spark Streaming and the Spark Cassandra Connector. It also introduces DataStax Enterprise which integrates Spark, Cassandra and Solr to allow real-time analytics without separate clusters. Examples of streaming use cases and demos are provided.
Introduction to Data Modeling with Apache CassandraDataStax Academy
This document provides an introduction to data modeling with Apache Cassandra. It discusses how Cassandra data models are designed based on the queries an application will perform, unlike relational databases which are designed based on normalization rules. Key aspects covered include avoiding joins by denormalizing data, using a partition key to group related data on nodes, and controlling the clustering order of columns. The document provides examples of modeling time series and tag data in Cassandra.
The document discusses different data storage options for small, medium, and large datasets. It argues that relational databases do not scale well for large datasets due to limitations with replication, normalization, sharding, and high availability. The document then introduces Apache Cassandra as a fast, distributed, highly available, and linearly scalable database that addresses these limitations through its use of a hash ring architecture and tunable consistency levels. It describes Cassandra's key features including replication, compaction, and multi-datacenter support.
Enabling Search in your Cassandra Application with DataStax EnterpriseDataStax Academy
This document provides an overview of using Datastax Enterprise (DSE) Search to enable full-text search capabilities in Cassandra applications. It discusses how DSE Search integrates Solr/Lucene indexing with the Cassandra database to allow searching of application data without requiring a separate search cluster, external ETL processes, or custom application code for data management. The document also includes examples of different types of searches that can be performed, such as filtering, faceting, geospatial searches, and joins. It concludes with basic steps for getting started with DSE Search such as creating a Solr core and executing search queries using CQL.
The document discusses common bad habits that can occur when working with Apache Cassandra and provides recommendations to avoid them. Specifically, it addresses issues like sliding back into a relational mindset when the data model is different, improperly benchmarking Cassandra systems, having slow client performance, and neglecting important operations tasks. The presentation provides guidance on how to approach data modeling, querying, benchmarking, driver usage, and operations management in a Cassandra-oriented way.
This document provides an overview and examples of modeling data in Apache Cassandra. It begins with an introduction to thinking about data models and queries before modeling, and emphasizes that Cassandra requires modeling around queries due to its limitations on joins and indexes. The document then provides examples of modeling user, video, and other entity data for a video sharing application to support common queries. It also discusses techniques for handling queries that could become hotspots, such as bucketing or adding random values. The examples illustrate best practices for data duplication, materialized views, and time series data storage in Cassandra.
The document discusses best practices for using Apache Cassandra, including:
- Topology considerations like replication strategies and snitches
- Booting new datacenters and replacing nodes
- Security techniques like authentication, authorization, and SSL encryption
- Using prepared statements for efficiency
- Asynchronous execution for request pipelining
- Batch statements and their appropriate uses
- Improving performance through techniques like the new row cache
Monitoring and Managing Anomaly Detection on OpenShift.pdfTosin Akinosho
Monitoring and Managing Anomaly Detection on OpenShift
Overview
Dive into the world of anomaly detection on edge devices with our comprehensive hands-on tutorial. This SlideShare presentation will guide you through the entire process, from data collection and model training to edge deployment and real-time monitoring. Perfect for those looking to implement robust anomaly detection systems on resource-constrained IoT/edge devices.
Key Topics Covered
1. Introduction to Anomaly Detection
- Understand the fundamentals of anomaly detection and its importance in identifying unusual behavior or failures in systems.
2. Understanding Edge (IoT)
- Learn about edge computing and IoT, and how they enable real-time data processing and decision-making at the source.
3. What is ArgoCD?
- Discover ArgoCD, a declarative, GitOps continuous delivery tool for Kubernetes, and its role in deploying applications on edge devices.
4. Deployment Using ArgoCD for Edge Devices
- Step-by-step guide on deploying anomaly detection models on edge devices using ArgoCD.
5. Introduction to Apache Kafka and S3
- Explore Apache Kafka for real-time data streaming and Amazon S3 for scalable storage solutions.
6. Viewing Kafka Messages in the Data Lake
- Learn how to view and analyze Kafka messages stored in a data lake for better insights.
7. What is Prometheus?
- Get to know Prometheus, an open-source monitoring and alerting toolkit, and its application in monitoring edge devices.
8. Monitoring Application Metrics with Prometheus
- Detailed instructions on setting up Prometheus to monitor the performance and health of your anomaly detection system.
9. What is Camel K?
- Introduction to Camel K, a lightweight integration framework built on Apache Camel, designed for Kubernetes.
10. Configuring Camel K Integrations for Data Pipelines
- Learn how to configure Camel K for seamless data pipeline integrations in your anomaly detection workflow.
11. What is a Jupyter Notebook?
- Overview of Jupyter Notebooks, an open-source web application for creating and sharing documents with live code, equations, visualizations, and narrative text.
12. Jupyter Notebooks with Code Examples
- Hands-on examples and code snippets in Jupyter Notebooks to help you implement and test anomaly detection models.
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceIndexBug
Imagine a world where machines not only perform tasks but also learn, adapt, and make decisions. This is the promise of Artificial Intelligence (AI), a technology that's not just enhancing our lives but revolutionizing entire industries.
Webinar: Designing a schema for a Data WarehouseFederico Razzoli
Are you new to data warehouses (DWH)? Do you need to check whether your data warehouse follows the best practices for a good design? In both cases, this webinar is for you.
A data warehouse is a central relational database that contains all measurements about a business or an organisation. This data comes from a variety of heterogeneous data sources, which includes databases of any type that back the applications used by the company, data files exported by some applications, or APIs provided by internal or external services.
But designing a data warehouse correctly is a hard task, which requires gathering information about the business processes that need to be analysed in the first place. These processes must be translated into so-called star schemas, which means, denormalised databases where each table represents a dimension or facts.
We will discuss these topics:
- How to gather information about a business;
- Understanding dictionaries and how to identify business entities;
- Dimensions and facts;
- Setting a table granularity;
- Types of facts;
- Types of dimensions;
- Snowflakes and how to avoid them;
- Expanding existing dimensions and facts.
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
OpenID AuthZEN Interop Read Out - AuthorizationDavid Brossard
During Identiverse 2024 and EIC 2024, members of the OpenID AuthZEN WG got together and demoed their authorization endpoints conforming to the AuthZEN API
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Speck&Tech
ABSTRACT: A prima vista, un mattoncino Lego e la backdoor XZ potrebbero avere in comune il fatto di essere entrambi blocchi di costruzione, o dipendenze di progetti creativi e software. La realtà è che un mattoncino Lego e il caso della backdoor XZ hanno molto di più di tutto ciò in comune.
Partecipate alla presentazione per immergervi in una storia di interoperabilità, standard e formati aperti, per poi discutere del ruolo importante che i contributori hanno in una comunità open source sostenibile.
BIO: Sostenitrice del software libero e dei formati standard e aperti. È stata un membro attivo dei progetti Fedora e openSUSE e ha co-fondato l'Associazione LibreItalia dove è stata coinvolta in diversi eventi, migrazioni e formazione relativi a LibreOffice. In precedenza ha lavorato a migrazioni e corsi di formazione su LibreOffice per diverse amministrazioni pubbliche e privati. Da gennaio 2020 lavora in SUSE come Software Release Engineer per Uyuni e SUSE Manager e quando non segue la sua passione per i computer e per Geeko coltiva la sua curiosità per l'astronomia (da cui deriva il suo nickname deneb_alpha).
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfChart Kalyan
A Mix Chart displays historical data of numbers in a graphical or tabular form. The Kalyan Rajdhani Mix Chart specifically shows the results of a sequence of numbers over different periods.
Digital Marketing Trends in 2024 | Guide for Staying AheadWask
https://www.wask.co/ebooks/digital-marketing-trends-in-2024
Feeling lost in the digital marketing whirlwind of 2024? Technology is changing, consumer habits are evolving, and staying ahead of the curve feels like a never-ending pursuit. This e-book is your compass. Dive into actionable insights to handle the complexities of modern marketing. From hyper-personalization to the power of user-generated content, learn how to build long-term relationships with your audience and unlock the secrets to success in the ever-shifting digital landscape.
Main news related to the CCS TSI 2023 (2023/1695)Jakub Marek
An English 🇬🇧 translation of a presentation to the speech I gave about the main changes brought by CCS TSI 2023 at the biggest Czech conference on Communications and signalling systems on Railways, which was held in Clarion Hotel Olomouc from 7th to 9th November 2023 (konferenceszt.cz). Attended by around 500 participants and 200 on-line followers.
The original Czech 🇨🇿 version of the presentation can be found here: https://www.slideshare.net/slideshow/hlavni-novinky-souvisejici-s-ccs-tsi-2023-2023-1695/269688092 .
The videorecording (in Czech) from the presentation is available here: https://youtu.be/WzjJWm4IyPk?si=SImb06tuXGb30BEH .
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
Taking AI to the Next Level in Manufacturing.pdfssuserfac0301
Read Taking AI to the Next Level in Manufacturing to gain insights on AI adoption in the manufacturing industry, such as:
1. How quickly AI is being implemented in manufacturing.
2. Which barriers stand in the way of AI adoption.
3. How data quality and governance form the backbone of AI.
4. Organizational processes and structures that may inhibit effective AI adoption.
6. Ideas and approaches to help build your organization's AI strategy.
Ivanti’s Patch Tuesday breakdown goes beyond patching your applications and brings you the intelligence and guidance needed to prioritize where to focus your attention first. Catch early analysis on our Ivanti blog, then join industry expert Chris Goettl for the Patch Tuesday Webinar Event. There we’ll do a deep dive into each of the bulletins and give guidance on the risks associated with the newly-identified vulnerabilities.
Project Management Semester Long Project - Acuityjpupo2018
Acuity is an innovative learning app designed to transform the way you engage with knowledge. Powered by AI technology, Acuity takes complex topics and distills them into concise, interactive summaries that are easy to read & understand. Whether you're exploring the depths of quantum mechanics or seeking insight into historical events, Acuity provides the key information you need without the burden of lengthy texts.
4. Top Level Features
• Load balancing
• Connection pooling - resilient with failover and retries
• Pluggable host discovery mechanism
• Metrics (there are a lot!)
• Highly configurable and pluggable
15. No wait! Astyanax has a new adaptor
• Built on top of Java Driver
• All your queries are now async
• Astyanax APIs (structured queries) are supported
• All Astyanax recipes work
18. What About Performance?
If you use prepared statements, you’re good!
More findings on our blog
http://techblog.netflix.com/2013/12/astyanax-update.html
19. Prepared Statements
• There is no magic here
• You “prepare”
• Then you re-use
For generic DAOs this means - prepared statement
management.
20. Astyanax Value Add
Structured Queries Naturally Have Some
Query Signature
keyspace.prepareQuery( myCF )!
.withRow( myRowKey )!
.withColumnSlice( start, end)!
.execute();!
Translates to
select * from ks.myCF where key=? and column1 =? and
column1 =?;
21. But wait! Something is not right with the new
model
• Are columns
really columns?
• Are rows still
rows?
• Are columns
really rows?
22. Simple Schema
key validator – int
col comparator – int
default validator – utf8
24. Astyanax preserves original semantics
• Astyanax maintains backwards compatibility
with the api
• Hence rows are still rows and columns are
still columns
25. Takeaway for Astyanax
Apps really care about high level abstractions
• Time series
• Sliding window
• Objects with attribute based indexing
26. So What Should I Use?
• Astyanax presents a higher level abstraction
• Astyanax has recipes
• Astyanax is good for structured queries
• If all you want is CQL3, then use Java Driver