What is it about? In-Stream Event Processing is a new approach for building near real time big data systems with rapidly growing user base and applications like clickstream analytics, preventive maintenance or fraud detection. Maturity of some open source projects enables building an enterprise grade In-Stream Processing service in-house. However the open source world comprises of many competing projects of different maturity, having different perspectives so the task to select effective and efficient projects is not straightforward. In the talk I’ll present a blueprint of an In-Stream Processing Service, enterprise grade reliable and scalable, cloud ready, build from 100% open source components.
How the Big Data of APM can Supercharge DevOpsCA Technologies
In the age where applications reign supreme, your organizations must be agile in application performance management and app development in order to meet the market demands and stay competitive. Even with mature APM solutions, developer, test and operations teams are strained by operational complexity, accelerated release schedules, and big data challenges to quickly find the root cause of issues affecting end user experience.
The power of advanced analytics and data science can help us make the most of the vast cache of APM data we collect and help our DevOps teams supercharge user experience. It’s time to take some of the load off of our humans and let technology make it easier to focus on meaningful changes in user, application and system behavior. Analytics are becoming a valuable component of APM solutions to redefine triage, improve application quality, and delight the end-user.
In a webcast on August 7th, 2014, Ken Godskind, Chief blogger and Analyst, APMExaminer.com shared how the big data of APM can supercharge your DevOps transformation. Chris Kline, Senior Director, CA Technologies followed Ken and discussed how the Advanced Behavior Analytics capability of CA APM can assist in this journey.
Ken and Chris used this slide set during the webcast which can be viewed at http://goo.gl/TZYEuq
Cloud-Native Data: What data questions to ask when building cloud-native appsVMware Tanzu
While a number of patterns and architectural guidelines exist for cloud-native applications, a discussion about data often leads to more questions than answers. For example, what are some of the typical data problems encountered, why are they different, and how can they be overcome?
Join Prasad Radhakrishnan from Pivotal and Dave Nielsen from Redis Labs as they discuss:
- Expectations and requirements of cloud-native data
- Common faux pas and strategies on how you can avoid them
Presenters:
Prasad Radhakrishnan, Platform Architecture for Data at Pivotal
Dave Nielsen, Head of Ecosystem Programs at Redis Labs
Cloud-Native Fundamentals: Accelerating Development with Continuous IntegrationVMware Tanzu
DevOps. Microservices. Containers. These terms have a lot of buzz for their role in cloud-native application development and operations. But, if you haven't automated your tests and builds with continuous integration (CI), none of them matter.
Continuous integration is the automation of building and testing new code. Development teams that use CI can catch bugs early and often; resulting in code that is always production ready. Compared to manual testing, CI eliminates a lot of toil and improves code quality. At the end of the day, it's those code defects that slip into production that slow down teams and cause apps to fall over.
The journey to continuous integration maturity has some requirements. Join Pivotal's James Ma, product manager for Concourse, and Dormain Drewitz, product marketing to learn about:
- How Test-Driven Development feeds the CI process
- What is different about CI in a cloud-native context
- How to measure progress and success in adopting CI
Dormain is a Senior Director of Product and Customer Marketing with Pivotal. She has published extensively on cloud computing topics for ten years, demystifying the changing requirements of the infrastructure software stack. She’s presented at the Gartner Application Architecture, Development, and Integration Summit; Open Source Summit; Cloud Foundry Summit, and numerous software user events.
James Ma is a product manager at Pivotal and is based out of their office in Toronto, Canada. As a consultant for the Pivotal Labs team, James worked with Fortune 500 companies to hone their agile software development practices and adopt a user-centered approach to product development. He has worked with companies across multiple industries including: mobile e-commerce, finance, heath and hospitality. James is currently a part of the Pivotal Cloud Foundry R&D group and is the product manager for Concourse CI, the continuous "thing do-er".
Presenters : Dormain Drewitz & James Ma, Pivotal
Creating Complete Test Environments in the Cloud: Skytap & Parasoft WebinarSkytap Cloud
By utilizing virtualization technology in the SDLC, specifically service virtualization and virtual dev/test labs, companies can increase test coverage in less time and ultimately produce better software faster.
Download this complimentary webinar from Skytap and Parasoft now and learn how to to combine service virtualization with cloud-based dev/test environments.
How the Big Data of APM can Supercharge DevOpsCA Technologies
In the age where applications reign supreme, your organizations must be agile in application performance management and app development in order to meet the market demands and stay competitive. Even with mature APM solutions, developer, test and operations teams are strained by operational complexity, accelerated release schedules, and big data challenges to quickly find the root cause of issues affecting end user experience.
The power of advanced analytics and data science can help us make the most of the vast cache of APM data we collect and help our DevOps teams supercharge user experience. It’s time to take some of the load off of our humans and let technology make it easier to focus on meaningful changes in user, application and system behavior. Analytics are becoming a valuable component of APM solutions to redefine triage, improve application quality, and delight the end-user.
In a webcast on August 7th, 2014, Ken Godskind, Chief blogger and Analyst, APMExaminer.com shared how the big data of APM can supercharge your DevOps transformation. Chris Kline, Senior Director, CA Technologies followed Ken and discussed how the Advanced Behavior Analytics capability of CA APM can assist in this journey.
Ken and Chris used this slide set during the webcast which can be viewed at http://goo.gl/TZYEuq
Cloud-Native Data: What data questions to ask when building cloud-native appsVMware Tanzu
While a number of patterns and architectural guidelines exist for cloud-native applications, a discussion about data often leads to more questions than answers. For example, what are some of the typical data problems encountered, why are they different, and how can they be overcome?
Join Prasad Radhakrishnan from Pivotal and Dave Nielsen from Redis Labs as they discuss:
- Expectations and requirements of cloud-native data
- Common faux pas and strategies on how you can avoid them
Presenters:
Prasad Radhakrishnan, Platform Architecture for Data at Pivotal
Dave Nielsen, Head of Ecosystem Programs at Redis Labs
Cloud-Native Fundamentals: Accelerating Development with Continuous IntegrationVMware Tanzu
DevOps. Microservices. Containers. These terms have a lot of buzz for their role in cloud-native application development and operations. But, if you haven't automated your tests and builds with continuous integration (CI), none of them matter.
Continuous integration is the automation of building and testing new code. Development teams that use CI can catch bugs early and often; resulting in code that is always production ready. Compared to manual testing, CI eliminates a lot of toil and improves code quality. At the end of the day, it's those code defects that slip into production that slow down teams and cause apps to fall over.
The journey to continuous integration maturity has some requirements. Join Pivotal's James Ma, product manager for Concourse, and Dormain Drewitz, product marketing to learn about:
- How Test-Driven Development feeds the CI process
- What is different about CI in a cloud-native context
- How to measure progress and success in adopting CI
Dormain is a Senior Director of Product and Customer Marketing with Pivotal. She has published extensively on cloud computing topics for ten years, demystifying the changing requirements of the infrastructure software stack. She’s presented at the Gartner Application Architecture, Development, and Integration Summit; Open Source Summit; Cloud Foundry Summit, and numerous software user events.
James Ma is a product manager at Pivotal and is based out of their office in Toronto, Canada. As a consultant for the Pivotal Labs team, James worked with Fortune 500 companies to hone their agile software development practices and adopt a user-centered approach to product development. He has worked with companies across multiple industries including: mobile e-commerce, finance, heath and hospitality. James is currently a part of the Pivotal Cloud Foundry R&D group and is the product manager for Concourse CI, the continuous "thing do-er".
Presenters : Dormain Drewitz & James Ma, Pivotal
Creating Complete Test Environments in the Cloud: Skytap & Parasoft WebinarSkytap Cloud
By utilizing virtualization technology in the SDLC, specifically service virtualization and virtual dev/test labs, companies can increase test coverage in less time and ultimately produce better software faster.
Download this complimentary webinar from Skytap and Parasoft now and learn how to to combine service virtualization with cloud-based dev/test environments.
Capgemini: Observability within the Dutch governmentElasticsearch
The digital landscape within Dutch government is a complex and heterogeneous mix of technologies. Within this scenario, Capgemini is tasked with continuous integration and maintenance of key infrastructure. The results connect major organizational parts of the country with a large volume of daily traffic. To keep the lights on in operation and allow for quick turn-around times, Elastic is the dominant choice for generating reliable insight. It facilitates a thorough insight into the inner workings of modern amalgamated java deployments, databases and legacy systems spanning a multitude of decades.
SoftwareCircus 2020 "The Past, Present, and Future of Cloud Native API Gateways"Daniel Bryant
An API gateway is at the core of how APIs are managed, secured, and presented within any web-based system. Although the technology has been in use for many years, it has not always kept pace with recent developments within the cloud native space, and many engineers are confused about how a cloud native API gateway relates to Kubernetes Ingress or a Service load balancer.
Join this session to learn about:
– The evolution of API gateways over the past ten years, and how the original problems they were solving have shifted in relation to cloud native technologies and workflow
– Current challenges of using an API gateway within Kubernetes: scaling the developer workflow; and supporting multiple architecture styles and protocols
– Strategies for exposing Kubernetes services and APIs at the edge of your system
– A brief guide to the (potential) future of cloud native API gateways
DevOpsCon 2020: The Past, Present, and Future of Cloud Native API GatewaysDaniel Bryant
An API gateway is at the core of how APIs are managed, secured, and presented within any web-based system. Although the technology has been in use for many years, it has not always kept pace with recent developments within the cloud native space, and many engineers are confused about how a cloud native API gateway relates to Kubernetes Ingress or a Service load balancer.
Join this session to learn about:
– The evolution of API gateways over the past ten years, and how the original problems they were solving have shifted in relation to cloud native technologies and workflow
– Current challenges of using an API gateway within Kubernetes: scaling the developer workflow; and supporting multiple architecture styles and protocols
– Strategies for exposing Kubernetes services and APIs at the edge of your system
– A brief guide to the (potential) future of cloud native API gateways
Building Reactive Applications With Node.Js And Red Hat JBoss Data Grid (Gald...Red Hat Developers
Node.js is a very popular framework for developing asynchronous, event-driven, reactive applications. Red Hat JBoss Data Grid, an in-memory distributed database designed for fast access to large volumes of data and scalability, has recently gained compatibility with Node.js letting reactive applications use it as a persistence layer. Thanks to near caching, JBoss Data Grid offers excellent response times for data queried regularly, and its continuous remote event support means data can get pushed from the data grid to the Node.js application instead of having to wait for the data grid to serve it. In this session, we'll show how to build Node.js applications that use JBoss Data Grid as a persistence layer.
Achieving scale and performance using cloud native environmentRakuten Group, Inc.
ID Platform Product can be used by every Rakuten Group Companies and can easily serve millions of users. Multi-Region product challenges are many, example:
- Ensure 4 9’s availability
- Management across each region
- Alerting and Monitoring across each region
- Auto scaling (Scale up and Scale down) across each region
- Performance (vertical scale up)
- Cost
- DB Consistency Across Multiple Regions
- Resiliency
At Ecosystem Platform Layer for Rakuten, we handle each of these and this presentation is about how we handle these challenging scenarios.
Using Apache Spark for Predicting Degrading and Failing Parts in AviationDatabricks
Throughout naval aviation, data lakes provide the raw material for generating insights into predictive maintenance and increasing readiness across many platforms. Successfully leveraging these data lakes can be technically challenging.
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)DevOps.com
As containers and Kubernetes are adopted in production, security is a critical concern and DevOps teams need to go beyond image scanning. Use cases such as runtime security, network visibility and segmentation, incident response and compliance become priorities as your Kubernetes security framework matures.
In this talk, we’ll share an overview of runtime security, discuss approaches used by open source and commercial tools, and hear how users are getting started quickly without impacting developer productivity.
This presentation was given at the London Nutanix user group (NUG) on Oct 26 by Grant Friend. If you would like to join a NUG, you can find more information here http://bit.ly/NTNXUG - Hope to see you at a community meeting!
Download link - https://www.kovair.com/adapters/gitlab-integration-adapter/
The Kovair Omnibus Adapter for GitLab helps enrich the workflows in the ALM ecosystem and boost productivity for the entire DevOps chain. It supports the integration of GitLab with multiple tools in the ALM and DevOps ecosystem.
The adapter helps bridge the gap between multiple teams by bi-directionally flowing issues from GitLab to separate purpose-built tools. As the issue is present in both the systems, it becomes easier to keep the information up-to-date in a seamless manner.
Digital Transformation: Highly Resilient Streaming Architecture and StrategiesHostedbyConfluent
Failure is inevitable in any distributed system but anticipating failures and building systems to recover from failures instantaneous makes the system highly resilient. At Capital One we process billions of events everyday and we leverage cloud, microservices, streaming and machine learning technologies to solve customer problems and provide the best customer experience.
As part of this session I will be talking about highly resilient streaming architecture that is supporting processing of billions of events every day then some of the strategies & best practices to build highly available and fault-tolerant systems utilizing Kafka and Cloud environments.
dA Platform is a production-ready platform for stream processing with Apache Flink®. The Platform includes open source Apache Flink, a stateful stream processing and event-driven application framework, and dA Application Manager, a central deployment and management component. dA Platform schedules clusters on Kubernetes, deploys stateful Flink applications, and controls these applications and their state.
Moving from application automation to true DevOps by including the databaseRed Gate Software
As software teams face increasing pressure to speed up the delivery cycle, the DevOps approach is gaining momentum. But the database is commonly a bottleneck. This presentation will explore how you can include the database in continuous integration and automated release management. Sharing best practices from high-performing IT organisations, we will discuss how you can extend DevOps practices to the database; increasing productivity, agility and optimizing performance.
We are back with another set of interesting topics. Want to learn all about processing high data volumes with an ERP? We got you covered! Or maybe your heart is more at the Front-End and you want to know how to use SASS with OutSystems? We also got you covered.
Эталонная архитектура сервиса из компонентов со 100% открытым исходным кодом, готового к развертыванию в облаках, с масштабируемостью и надежностью уровня предприятия.
Антон Овчинников, Grid Dynamics
Capgemini: Observability within the Dutch governmentElasticsearch
The digital landscape within Dutch government is a complex and heterogeneous mix of technologies. Within this scenario, Capgemini is tasked with continuous integration and maintenance of key infrastructure. The results connect major organizational parts of the country with a large volume of daily traffic. To keep the lights on in operation and allow for quick turn-around times, Elastic is the dominant choice for generating reliable insight. It facilitates a thorough insight into the inner workings of modern amalgamated java deployments, databases and legacy systems spanning a multitude of decades.
SoftwareCircus 2020 "The Past, Present, and Future of Cloud Native API Gateways"Daniel Bryant
An API gateway is at the core of how APIs are managed, secured, and presented within any web-based system. Although the technology has been in use for many years, it has not always kept pace with recent developments within the cloud native space, and many engineers are confused about how a cloud native API gateway relates to Kubernetes Ingress or a Service load balancer.
Join this session to learn about:
– The evolution of API gateways over the past ten years, and how the original problems they were solving have shifted in relation to cloud native technologies and workflow
– Current challenges of using an API gateway within Kubernetes: scaling the developer workflow; and supporting multiple architecture styles and protocols
– Strategies for exposing Kubernetes services and APIs at the edge of your system
– A brief guide to the (potential) future of cloud native API gateways
DevOpsCon 2020: The Past, Present, and Future of Cloud Native API GatewaysDaniel Bryant
An API gateway is at the core of how APIs are managed, secured, and presented within any web-based system. Although the technology has been in use for many years, it has not always kept pace with recent developments within the cloud native space, and many engineers are confused about how a cloud native API gateway relates to Kubernetes Ingress or a Service load balancer.
Join this session to learn about:
– The evolution of API gateways over the past ten years, and how the original problems they were solving have shifted in relation to cloud native technologies and workflow
– Current challenges of using an API gateway within Kubernetes: scaling the developer workflow; and supporting multiple architecture styles and protocols
– Strategies for exposing Kubernetes services and APIs at the edge of your system
– A brief guide to the (potential) future of cloud native API gateways
Building Reactive Applications With Node.Js And Red Hat JBoss Data Grid (Gald...Red Hat Developers
Node.js is a very popular framework for developing asynchronous, event-driven, reactive applications. Red Hat JBoss Data Grid, an in-memory distributed database designed for fast access to large volumes of data and scalability, has recently gained compatibility with Node.js letting reactive applications use it as a persistence layer. Thanks to near caching, JBoss Data Grid offers excellent response times for data queried regularly, and its continuous remote event support means data can get pushed from the data grid to the Node.js application instead of having to wait for the data grid to serve it. In this session, we'll show how to build Node.js applications that use JBoss Data Grid as a persistence layer.
Achieving scale and performance using cloud native environmentRakuten Group, Inc.
ID Platform Product can be used by every Rakuten Group Companies and can easily serve millions of users. Multi-Region product challenges are many, example:
- Ensure 4 9’s availability
- Management across each region
- Alerting and Monitoring across each region
- Auto scaling (Scale up and Scale down) across each region
- Performance (vertical scale up)
- Cost
- DB Consistency Across Multiple Regions
- Resiliency
At Ecosystem Platform Layer for Rakuten, we handle each of these and this presentation is about how we handle these challenging scenarios.
Using Apache Spark for Predicting Degrading and Failing Parts in AviationDatabricks
Throughout naval aviation, data lakes provide the raw material for generating insights into predictive maintenance and increasing readiness across many platforms. Successfully leveraging these data lakes can be technically challenging.
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)DevOps.com
As containers and Kubernetes are adopted in production, security is a critical concern and DevOps teams need to go beyond image scanning. Use cases such as runtime security, network visibility and segmentation, incident response and compliance become priorities as your Kubernetes security framework matures.
In this talk, we’ll share an overview of runtime security, discuss approaches used by open source and commercial tools, and hear how users are getting started quickly without impacting developer productivity.
This presentation was given at the London Nutanix user group (NUG) on Oct 26 by Grant Friend. If you would like to join a NUG, you can find more information here http://bit.ly/NTNXUG - Hope to see you at a community meeting!
Download link - https://www.kovair.com/adapters/gitlab-integration-adapter/
The Kovair Omnibus Adapter for GitLab helps enrich the workflows in the ALM ecosystem and boost productivity for the entire DevOps chain. It supports the integration of GitLab with multiple tools in the ALM and DevOps ecosystem.
The adapter helps bridge the gap between multiple teams by bi-directionally flowing issues from GitLab to separate purpose-built tools. As the issue is present in both the systems, it becomes easier to keep the information up-to-date in a seamless manner.
Digital Transformation: Highly Resilient Streaming Architecture and StrategiesHostedbyConfluent
Failure is inevitable in any distributed system but anticipating failures and building systems to recover from failures instantaneous makes the system highly resilient. At Capital One we process billions of events everyday and we leverage cloud, microservices, streaming and machine learning technologies to solve customer problems and provide the best customer experience.
As part of this session I will be talking about highly resilient streaming architecture that is supporting processing of billions of events every day then some of the strategies & best practices to build highly available and fault-tolerant systems utilizing Kafka and Cloud environments.
dA Platform is a production-ready platform for stream processing with Apache Flink®. The Platform includes open source Apache Flink, a stateful stream processing and event-driven application framework, and dA Application Manager, a central deployment and management component. dA Platform schedules clusters on Kubernetes, deploys stateful Flink applications, and controls these applications and their state.
Moving from application automation to true DevOps by including the databaseRed Gate Software
As software teams face increasing pressure to speed up the delivery cycle, the DevOps approach is gaining momentum. But the database is commonly a bottleneck. This presentation will explore how you can include the database in continuous integration and automated release management. Sharing best practices from high-performing IT organisations, we will discuss how you can extend DevOps practices to the database; increasing productivity, agility and optimizing performance.
We are back with another set of interesting topics. Want to learn all about processing high data volumes with an ERP? We got you covered! Or maybe your heart is more at the Front-End and you want to know how to use SASS with OutSystems? We also got you covered.
Эталонная архитектура сервиса из компонентов со 100% открытым исходным кодом, готового к развертыванию в облаках, с масштабируемостью и надежностью уровня предприятия.
Антон Овчинников, Grid Dynamics
Open Blueprint for Real-Time Analytics in Retail: Strata Hadoop World 2017 S...Grid Dynamics
This presentation outlines key business drivers for real-time analytics applications in retail and describes the emerging architectures based on In-Stream Processing (ISP) technologies. The slides present a complete open blueprint for an ISP platform - including a demo application for real-time Twitter Sentiment Analytics - designed with 100% open source components and deployable to any cloud.
To learn more, read an adjoining blog series on this topic here : https://blog.griddynamics.com/in-stream-processing-service-blueprint
How should IT architecture look like in the ages of Cloud and Devops? How do the changes and the technological evolution of the last years affect our systems and processes? Why is innovation and digitalization important? What means 'cloud-native', how should I build my solutions? To anwer these questions we will look at the "Big Five": Microservices and Containers, cloud and DevOps and finally BigData, IoT and last-but-not-least artificial intelligence.
Best Practices for Streaming IoT Data with MQTT and Apache KafkaKai Wähner
Organizations today are looking to stream IoT data to Apache Kafka. However, connecting tens of thousands or even millions of devices over unreliable networks can create some architecture challenges. In this session, we will identify and demo some best practices for implementing a large scale IoT system that can stream MQTT messages to Apache Kafka.
We use HiveMQ as open source MQTT broker to ingest data from IoT devices, ingest the data in real time into an Apache Kafka cluster for preprocessing (using Kafka Streams / KSQL), and model training + inference (using TensorFlow 2.0 and its TensorFlow I/O Kafka plugin).
We leverage additional enterprise components from HiveMQ and Confluent to allow easy operations, scalability and monitoring.
Assessing New Databases– Translytical Use CasesDATAVERSITY
Organizations run their day-in-and-day-out businesses with transactional applications and databases. On the other hand, organizations glean insights and make critical decisions using analytical databases and business intelligence tools.
The transactional workloads are relegated to database engines designed and tuned for transactional high throughput. Meanwhile, the big data generated by all the transactions require analytics platforms to load, store, and analyze volumes of data at high speed, providing timely insights to businesses.
Thus, in conventional information architectures, this requires two different database architectures and platforms: online transactional processing (OLTP) platforms to handle transactional workloads and online analytical processing (OLAP) engines to perform analytics and reporting.
Today, a particular focus and interest of operational analytics includes streaming data ingest and analysis in real time. Some refer to operational analytics as hybrid transaction/analytical processing (HTAP), translytical, or hybrid operational analytic processing (HOAP). We’ll address if this model is a way to create efficiencies in our environments.
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Dataconomy Media
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder of DataTorrent presented "Streaming Analytics with Apache Apex" as part of the Big Data, Berlin v 8.0 meetup organised on the 14th of July 2016 at the WeWork headquarters.
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Precisely
Tackling the challenge of designing a machine learning model and putting it into production is the key to getting value back – and the roadblock that stops many promising machine learning projects. After the data scientists have done their part, engineering robust production data pipelines has its own set of challenges. Syncsort software helps the data engineer every step of the way.
Building on the process of finding and matching duplicates to resolve entities, the next step is to set up a continuous streaming flow of data from data sources so that as the sources change, new data automatically gets pushed through the same transformation and cleansing data flow – into the arms of machine learning models.
Some of your sources may already be streaming, but the rest are sitting in transactional databases that change hundreds or thousands of times a day. The challenge is that you can’t affect performance of data sources that run key applications, so putting something like database triggers in place is not the best idea. Using Apache Kafka or similar technologies as the backbone to moving data around doesn’t solve the problem of needing to grab changes from the source pushing them into Kafka and consuming the data from Kafka to be processed. If something unexpected happens – like connectivity is lost on either the source or the target side, you don’t want to have to fix it or start over because the data is out of sync.
View this 15-minute webcast on-demand to learn how to tackle these challenges in large scale production implementations.
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
Whether to take data ingestion cycles off the ETL tool and the data warehouse or to facilitate competitive Data Science and building algorithms in the organization, the data lake – a place for unmodeled and vast data – will be provisioned widely in 2020.
Though it doesn’t have to be complicated, the data lake has a few key design points that are critical, and it does need to follow some principles for success. Avoid building the data swamp, but not the data lake! The tool ecosystem is building up around the data lake and soon many will have a robust lake and data warehouse. We will discuss policy to keep them straight, send data to its best platform, and keep users’ confidence up in their data platforms.
Data lakes will be built in cloud object storage. We’ll discuss the options there as well.
Get this data point for your data lake journey.
Oracle in the 2014 edition of its Open World rolled out new database public cloud service with its DBaaS offerings, but this is just a piece in each company's technological architecture. Businesses still have the need to create a Private cloud and discover the driver to create it; Wether it is a measured service,consolidation or rapid provisioning, finding this driver will be the initial building block for it. This presentation will give you an insight on how a Private Cloud is architected, how the service catalog is the most important brick and how get the benefit of this upcoming era of Databases.
Building a Real-Time Security Application Using Log Data and Machine Learning...Sri Ambati
Building a Real-Time Security Application Using Log Data and Machine Learning- Karthik Aaravabhoomi
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
From Data to Services at the Speed of BusinessAli Hodroj
From Data to Services at the Speed of Business: Applying cloud-native paradigm to combine fast data analytics with microservices architecture for hybrid workloads.
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®confluent
Watch this talk here: https://www.confluent.io/online-talks/best-practices-for-streaming-iot-data-with-MQTT-and-apache-kafka-on-demand
Organizations today are looking to stream IoT data to Apache Kafka. However, connecting tens of thousands or even millions of devices over unreliable networks can create some architecture challenges.
In this session, we will identify and demo some best practices for implementing a large scale IoT system that can stream MQTT messages to Apache Kafka.
How to scale your PaaS with OVH infrastructure?OVHcloud
ForePaaS has developed an “as-a-service” platform which lets you automate an infrastructure designed for analytical applications. The company has formed a cloud partnership with OVH in order to deliver flexible solutions for containerised and high-performance tools, such as Kunernetes and Docker.
VMworld 2013: Separating Cloud Hype from Reality in Healthcare – a Real-Life ...VMworld
VMworld 2013
Tim Graf, VMware
Matthew Ritchart, Health Management Associates
Learn more about VMworld and register at http://www.vmworld.com/index.jspa?src=socmed-vmworld-slideshare
Similar to In-Stream Processing Service Blueprint, Reference architecture for real-time Big Data applications (20)
Industry-leading brands have adapted to the ever-changing business environment and consumer expectations significantly faster than the competition. The key driving factor behind this speed-to-market is the adoption of intelligent cloud-native digital commerce architectures. In our talk, we will describe the best practices and case studies of achieving business agility that is imperative to excel in the post-pandemic era.
"Implementing data quality automation with open source stack" - Max Martynov,...Grid Dynamics
The quality of business decisions, machine learning insights, and executive reports depend on the quality and integrity of the underlying data. There are many ways that data can get corrupted in an analytical data platform from de-synchronization with the system-of-record to defects in data pipelines. We will show how to detect and prevent data corruption with automation, open-source tools, and machine learning.
"How to build cool & useful voice commerce applications (such as devices like...Grid Dynamics
In this talk, we describe how to build conversational e-commerce applications for the growing market of voice-powered AI devices using Dialog Flow. This talk demonstrates the capabilities of "Flower Genie," a teaching-oriented chatbot that can recommend a bouquet for any occasion, then take an order and deliver the flower arrangement via Alexa. We present the overall architecture of a voice application developed with Dialog Flow, including dialog management and NLU, and then discuss the finer points of testing and publishing a voice application for Alexa.
"Challenges for AI in Healthcare" - Peter Graven Ph.DGrid Dynamics
Dynamic Talks Portland: The use of AI in many industries has revolutionized operations and efficiency. In healthcare, the progress is just beginning. Despite the promise of AI, why has the development lagged other industries? What issues are unique to healthcare that create challenges for common approaches? How can data scientists overcome these challenges and deliver on the promise of using data to reach multiple goals of improved quality, decreased cost, and greater patient satisfaction?
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...Grid Dynamics
The global HIV pandemic continues, particularly in Sub-Saharan Africa. By 2025, 40 million people will be living with HIV. The global cost of the pandemic is in the hundreds of billions of dollars. Better treatment means more people are living longer and costs will increase. The use of existing and emerging technologies is rare. Research institutions don’t share data. Data that drive US HIV policy in 2019 are from 2017 because of the time it takes for the CDC and NIH to combine data. The are many opportunities for big data, ML and AI to have a broader and continued impact on the HIV crisis. The use of these technologies can identify new avenues of research and help prioritize and focus efforts. We are starting to see these technologies used more and more. Several case studies will be presented. For example, advances in HIV vaccine research by Dr. David Heckerman; research at UCLA and Georgetown University looking at how social media can be used for tracking and predicting the spread of the epidemic; and work by researchers to improve care utilization in South Carolina. The opportunities for commercial and non-profit ventures to apply existing and emerging technologies like big data, ML and AI are countless. Tech4HIV is an organization working to drive these efforts into the tech sector and provides opportunities and resources for engagement.
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...Grid Dynamics
Retail industry has gone through a significant transformation in the last decade; caused by the change in consumer behaviour and expectations which have been driven by the convenience and low cost offered by Amazon. There are clear signs of a similar disruption in the financial services industry now, driven by new entrants who are changing consumer expectations. In this talk, Rahul looks at the evolution of retailers and draws conclusions about the path players in the financial services have to take to compete and provide customer experiences that will be necessary to compete.
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...Grid Dynamics
Organizations need to tap into the huge potential of their vast volumes of data, but a use case tactical approach is not going to work. Instead, they need to work in the definition of a data strategy linked to the most relevant goals for the enterprise.
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...Grid Dynamics
Apache Spark is a general-purpose big data execution engine. You can work with different data sources with the same set of API in both batch and streaming mode. Such flexibility is great if you are experienced Spark developer solving a complicated data engineering problem, which might include ML or streaming. In Airbnb, 95% of all data pipelines are daily batch jobs, which read from Hive tables and write to Hive tables. For such jobs, you would like to trade some flexibility for more extensive functionality around writing to Hive or multiple days processing orchestration. Another advantage of reducing flexibility is creating "best practices", which can be followed by less experienced data engineers. In Airbnb, we've created a framework called "Sputnik," which tries to address these issues. In this talk, I'll show the typical boilerplate code, which Sputnik tries to reduce and concepts it introduces to simplify pipeline development.
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019Grid Dynamics
For businesses large and small, cloud-based software platforms are breaking down data siloes between teams, adding new capabilities, and boosting efficiency. While not known as early adopters, there's a compelling case for public safety to do the same. In this session, we'll explore how a comprehensive records and evidence platform can simplify operations and streamline data management and access, securely and responsibly. We'll also highlight the role of artificial intelligence, and how it can help agencies respond more quickly and improve outcomes.
Dynamic Talks: "Implementing data quality automation with open source stack" ...Grid Dynamics
The quality of business decisions, machine learning insights, and executive reports depend on the quality and integrity of the underlying data. There are many ways that data can get corrupted in an analytical data platform from de-synchronization with the system-of-record to defects in data pipelines. We will show how to detect and prevent data corruption with automation, open source tools, and machine learning.
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...Grid Dynamics
Dynamic talks Seattle: Artificial Intelligence (AI) and data are foundational to ideation of new business models that bring growth and efficiencies. This is in action in pharmaceutical supply chain for reducing costs, increasing volumes, and optimizing the contracts with suppliers. The implementation involves process and tools that make data usable, and overcome the challenges of culture, ethics, data scalability, compute and engineering, and. Learn about data collection, data management, and metadata management tools implementation and modern data architecture to support them. Discuss machine learning algorithms for growth and efficiency scenarios.
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...Grid Dynamics
Dynamic Talks Seattle: Patient No-shows and Late Cancelations is estimated to cost around $150B across the nation. Providence St. Joseph Health, Healthcare Intelligence Group has been working on this pain point for the past couple of years and has developed a software solution leveraging a predictive engine to identify high risk patients at risk of No-show and Late Cancellation on a daily basis. More than 25 clinics have been using this solution for about 2 years now to optimize their reminder call strategy and to have a meaningful reduction of No-shows and Late Cancellations. In addition, a call center pilot was designed recently to standardize this process end-end. Pilot results will be evaluated to prove the impact at scale.
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...Grid Dynamics
In this talk, we will discuss automatic decision-making and AI techniques for customer relationship management. First, we will present a methodology that helps to develop highly automated promotion and loyalty management systems. Next, we will walk through practical examples of how advanced customer and content signals can be generated using predictive models, and how optimization and reinforcement learning techniques can be used for targeting, budgeting, and pricing decisions. This talk is for Data Scientists, Product Owners, and Software Engineers involved in marketing operations or development of marketing automation software and interested in ML-based decision automation techniques.
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud...Grid Dynamics
Dynamic talks SF: So you have heard all the hype around how Machine Learning is going to change the world. But within your business context, where do you start? How do you get leadership buy-in for investment? And how and when you start scaling your ML Services?
In this session, you will walk away with an actionable framework to bootstrap and scale a machine learning services team. You will see the framework in action through an actual 0 to 1 product journey involving deep learning where we delivered value in record speed in-spite of not having a dataset when we started. You will get practical tips on how to make decisions about when and how to scale your capability to scale ML Services and platform. Some of the tips are pretty counterintuitive and revealed themselves with our experience of productizing ML services over the last 5+ years. (Using a diverse range of technologies - Vision, Language, Graph, Anomaly Detection, Search Relevance, Personalization)
Realtime Contextual Product Recommendations…that scale and generate revenue -...Grid Dynamics
Dynamic Talks SF: Recommendation systems are all around us. E-commerce companies like Amazon recommend products we are likely to buy based on our browsing behavior. Netflix suggests what shows we should watch based on our binging habits. Spotify builds a personalized playlist we would enjoy listening to, based on their understanding of what musical genre we are into.
In this talk, we will explore recent advances in the area of product recommendations in both research and practice. We will see how machine learning, design thinking and solid data engineering principles are combined to create an engaging customer experience that positively impacts the bottom line.
We will look at how we use various deep learning architectures to obtain image and text embeddings that supplement user and product based features to generate product recommendations that align closely with a consumer’s aesthetic preferences.
The talk would be of interest to data scientists, data engineers, product managers, UX designers and anyone interested in machine learning.
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...Grid Dynamics
In this talk, we will discuss automatic decision-making and AI techniques for customer relationship management. First, we will present a methodology that helps to develop highly automated promotion and loyalty management systems. Next, we will walk through practical examples of how predictive models can be used to characterize customer intent, and how optimization and reinforcement learning techniques can be used to build next best action models that incorporate targeting, budgeting, and pricing decisions.
Best practices for enterprise-grade microservices implementations with Google...Grid Dynamics
When migrating to a cloud and microservices architecture, companies need to invest in foundational capabilities, such as a microservices platform, continuous delivery, and an immutable infrastructure. In this talk, we will discuss our experience implementing these capabilities on the enterprise-scale with Google Cloud, Kubernetes, Istio, Envoy, Spinnaker, and Hashicorp stack. We will also discuss best practices of onboarding the cloud to facilitate DevOps, SRE without sacrificing quality or control.
Attribution Modelling 101: Credit Where Credit is Due!: Dynamic talks Seattle...Grid Dynamics
Abstract: In this talk, Scott will go over the basics of Attribution Modelling, its advantages and disadvantages, and strategic ways to implement in the cloud leveraging big data.
Building an algorithmic price management system using ML: Dynamic talks Seatt...Grid Dynamics
In this talk, we will discuss how predictive modeling and reinforcement learning can be used to build advanced price management systems that unlock the potential of dynamic and personalized pricing. We will present price optimization methods for a number of use cases including introductory pricing, promotion calendars, replenishable and seasonal products, targeted offers, and flash sales. We will also review case studies that demonstrate how these methods were applied in practice and how algorithmic price management components were fitted into pricing strategies.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
In-Stream Processing Service Blueprint, Reference architecture for real-time Big Data applications
1. 1 Private and Confidential1 Public
In-Stream Processing Service Blueprint
Reference architecture for real-time Big Data applications
Anton Ovchinnikov, Data Scientist, Grid Dynamics
2. 2
What we’ll talk about today
• What’s In-Stream Processing?
• How it’s used to process huge data streams in real time
• How to build in-stream processing with open source
• All about scale and reliability considerations
• Example of large-scale customer implementation
• Where can you learn more (hint: blog.griddynamics.com)
9. 9
Target performance & reliability SLAs
Throughput Up to 100,000 events per second
Latency 1-60 seconds
Retention Raw data and results archived for 30 days
Reliability Built-in data loss mitigation mechanism in case of faults
Availability 99.999 on commodity cloud infrastructure
12. 12
Every component is scalable in its own way
• No single point of failure
• Automatic failover
• Data replication
13. 13
Multi-node Kafka cluster
• Undisputed modern choice
for real time MOM
• Retention and replay
• Scalable via partitioning
• Persistent
• Super-fast
14. 14
Single Zookeeper cluster for all components
• Distributed coordination
service, facilitates HA
of other clustered services
• Guaranteed
consistent storage
• Client monitoring
• Leader election
15. 15
Spark streaming:
Central component of the platform
Why Spark streaming?
• Leading In-Stream Processing middleware
• Active community
• Rate of adoption
• Vendor support
• Excellent integration with Hadoop eco-
system
Key architectural considerations
• Runs on top of Hadoop
• Out-of-the-box integration with Kafka
• Support for machine learning pipelines
16. 16
Cassandra as operational store
• Massively scalable, highly available
NoSQL database
• Ideal choice as large operational
store (100s of GB) for streaming
applications
• Needed when event processing is
stateful, and the state is quite large
Example: stores user profiles as
they are being updated in real time
from clickstreams
17. 17
REDIS as a lookup database
• Simple, cheap and super-
performant lookup store
• Needed when event processing
requires frequent access to GBs of
reference data
• Can be updated from outside
• Master/slave architecture
Example: IP geo-mapping,
dictionaries, training sets
18. 18
Putting all the pieces together:
end-to-end platform configuration
• No single points of failure
• No bottlenecks
• Scaling or recovering any
component cluster mitigates
availability issues
• Caveat: pathologies do
happen, even in this design –
for example, dynamic
repartitioning is not supported
19. 19
Out of scope
• Dashboards for visualization, monitoring, and reporting
of In-Stream Processing results
• Tools and modeling environments for data scientists
• Logging and monitoring
• Data encryption, in motion or at rest
• Dynamic re-partitioning is not supported and will require down-time
• Disaster recovery
• Infrastructure provisioning and deployment automation
20. 20
Case study:
Large media agency
Business opportunity
• Real-time popularity trends for
the content across all client’s
properties drives audience
coverage with the most
interesting, trending articles
Work done
• Implementation used Amazon
Kinesis & Amazon
EMR/Spark Streaming stack
• Data egress: S3 and Amazon
Redshift
22. 22
Summary
• In-Stream Processing is a hot new technology
• It can process mind-boggling volumes of events in real-time and discover insights
• You can build a whole platform with 100% open source components
• We give you a complete blueprint on how to put it together
• It will run on any public cloud
23. 23
What we didn’t get to talk about today
• Docker, Docker, Docker: how to make auto-deployment and auto-scaling work
• Data scientist’s kitchen: what they are doing when no one is watching
• Cloud sandbox for our In-Stream Processing Blueprint: how to take it for a spin on AWS
• Demo app: see how social analytics is done using the blueprint
• All this, and more: coming up soon in our blog (blog.griddynamics.com)
• Please, subscribe!
24. 24
Thank you!
Anton Ovchinnikov: aovchinnikov@griddynamics.com
Grid Dynamics blog: blog.griddynamics.com
Follow up on twitter: @griddynamics
We are hiring!
https://www.griddynamics.com/careers