How can you improve complex video software when your performance indicators are highly variable? The answer is proper methodology, proper data infrastructure and analysis.
Worldsensing: A Real World Use Case for Flux by Albert Zaragoza, CTO & Head o...InfluxData
Albert Zaragoza, Fuad Mimoun and Daniel Lázaro from Worldsensing will be sharing how their team built an end-to-end IoT solution for cities — from traffic flow management and smart parking to emergency & security response and critical infrastructure monitoring. They will focus this talk on how they used Flux to pull together lots of data sources into their real-time platform to provide alerts to the many constituents of the data.
Presented at: All Things Open RTP Meetup
Presented by: John Hammink, Aiven.io
Abstract:
With mobile devices and emerging IoT connected infrastructure and devices, we’re seeing the amount of generated data explode, while continuing to transform in form and function. With 16.1 zettabytes of data generated in 2016 expected to grow tenfold by 2025, we’ll look at what we believe data pipelines and data-pipeline components will need to be able to achieve in terms of functionality, design, compliance, usability, performance, and scalability to handle this growth.
Postgres Vision 2018: Will Postgres Live Forever?EDB
Bruce Momjian, co-founder of the PostgreSQL Global Development Group, believes that some organizations choose Postgres because it is cost-effective, open source, and flexible. But the database has also reached a point of maturity such that organizations are choosing it because it is the only database that does what they need it to do. Delivered at Postgres Vision 2018, this presentation by Momjian, a Senior Database Architect at EnterpriseDB, outlines the history of Postgres, the major advances, and the reasons why it has taken hold in organizations.
http://PredixTransform.com
How do you securely connect industrial devices to the Cloud? What if you could save a plant millions with a $250 thermal camera? This is what our team wanted to find out. We sent a member of the team to a power plant to capture thermal images of the site and then created a ( Predix based) Matlab / Python based algorithm to identify potential issues.
PCF1: Cloud Foundry Diego ( Predix Transform 2016)Predix
http://PredixTransform.com
Get ahead of the curve by knowing what's in Cloud Foundry Diego. We'll cover architecture, DEA and Diego, use cases, and a demo of Diego runtime with Docker.
Worldsensing: A Real World Use Case for Flux by Albert Zaragoza, CTO & Head o...InfluxData
Albert Zaragoza, Fuad Mimoun and Daniel Lázaro from Worldsensing will be sharing how their team built an end-to-end IoT solution for cities — from traffic flow management and smart parking to emergency & security response and critical infrastructure monitoring. They will focus this talk on how they used Flux to pull together lots of data sources into their real-time platform to provide alerts to the many constituents of the data.
Presented at: All Things Open RTP Meetup
Presented by: John Hammink, Aiven.io
Abstract:
With mobile devices and emerging IoT connected infrastructure and devices, we’re seeing the amount of generated data explode, while continuing to transform in form and function. With 16.1 zettabytes of data generated in 2016 expected to grow tenfold by 2025, we’ll look at what we believe data pipelines and data-pipeline components will need to be able to achieve in terms of functionality, design, compliance, usability, performance, and scalability to handle this growth.
Postgres Vision 2018: Will Postgres Live Forever?EDB
Bruce Momjian, co-founder of the PostgreSQL Global Development Group, believes that some organizations choose Postgres because it is cost-effective, open source, and flexible. But the database has also reached a point of maturity such that organizations are choosing it because it is the only database that does what they need it to do. Delivered at Postgres Vision 2018, this presentation by Momjian, a Senior Database Architect at EnterpriseDB, outlines the history of Postgres, the major advances, and the reasons why it has taken hold in organizations.
http://PredixTransform.com
How do you securely connect industrial devices to the Cloud? What if you could save a plant millions with a $250 thermal camera? This is what our team wanted to find out. We sent a member of the team to a power plant to capture thermal images of the site and then created a ( Predix based) Matlab / Python based algorithm to identify potential issues.
PCF1: Cloud Foundry Diego ( Predix Transform 2016)Predix
http://PredixTransform.com
Get ahead of the curve by knowing what's in Cloud Foundry Diego. We'll cover architecture, DEA and Diego, use cases, and a demo of Diego runtime with Docker.
Presentation of the paper by M Ponce de Leon, W. Yao, M. Angel Diaz, Scenarios Designed for the Verification of Mobile IPv6 Enabling Technologies, Federation of Telecommunications Engineers of the European Community (FITCE) Congress 2007, Aug. 2007.
Accelerate Your C/C++ Applications with Amazon EC2 F1 Instances (CMP405) - AW...Amazon Web Services
Amazon EC2 F1 OpenCL development workflow helps software developers with little to no FPGA experience to supercharge their applications with Amazon EC2 F1. Join us for an overview and demonstration of accelerating your C/C++ applications in the cloud using OpenCL with Amazon EC2 F1 instances. In this workshop, we walk through the development flow for creating a custom hardware acceleration for a software algorithm. Attendees get hands-on and creative by optimizing an algorithm for maximum acceleration on Amazon EC2 F1 instances. All attendees must bring their own laptop (Windows, macOS, and Linux all supported). Tablets are not appropriate. We also recommend having the current version of Chrome or Firefox installed.
Frank Brockners' presentation from the 2017 Open Networking Summit.
While troubleshooting or planning, did you ever wish to get full insight into which paths *all* your packets take in your network, understand whether your SLA is really in place, or were you ever asked to prove that your traffic really follows the path you specified by service chaining or traffic engineering? We approach this problem by adding meta-data to *all* packets. In-band OAM adds forwarding path information and other information/stats to every data packet - as opposed to relying on probe packets, which is the traditional method that tools like ping or traceroute use. This session will introduce In-band OAM, explain the technology and outline the reference implementation in FD.io/VPP and OpenDaylight using a example demos.
Bandwidth Requirements for Delivering UltraHD TVSckipio
The bandwidth requirements for service providers to deliver UltraHD TV are very high. This presentation describes the requirements and the solution - G.fast.
Replaying KStreams Apps Using State Snapshots (Nishchay Sinha & Yan Wang, Blo...confluent
Are you a Kafka Streams application developer who needs a faster, more efficient way to reproduce a bug or issue from past events? Do you need to test a new algorithm patch near a discrete point-in-time? Do you have a standard methodology for investigating these issues, or does each team member devise an ad hoc solution? How much time does your simulation take?
It is technically impossible to solve this challenge using only changelog topics, since a compact, topic-based changelog doesn't capture the full change history. The Derivatives Data team at Bloomberg augments this through periodic snapshots. To make the snapshotted state accessible to our replay system, we built a query service that leverages Interactive Queries using the Kafka Streams API, along with gRPC-based coordination, to fetch the distributed snapshot states from different snapshotting instances.
In this talk, we will cover:
● Overview of this system architecture
● Deep dive into the mechanics of snapshots with Kafka Streams, state-store changelogs, and a query service to serve replay requests
● How two modes, normal and replay, are used on the Kafka Streams application runtime
● Some use cases that benefit from this replay system
Bringing Streaming Data To The Masses: Lowering The “Cost Of Admission” For Y...confluent
(Bob Lehmann, Bayer) Kafka Summit SF 2018
You’ve built your streaming data platform. The early adopters are “all in” and have developed producers, consumers and stream processing apps for a number of use cases. A large percentage of the enterprise, however, has expressed interest but hasn’t made the leap. Why?
In 2014, Bayer Crop Science (formerly Monsanto) adopted a cloud first strategy and started a multi-year transition to the cloud. A Kafka-based cross-datacenter DataHub was created to facilitate this migration and to drive the shift to real-time stream processing. The DataHub has seen strong enterprise adoption and supports a myriad of use cases. Data is ingested from a wide variety of sources and the data can move effortlessly between an on premise datacenter, AWS and Google Cloud. The DataHub has evolved continuously over time to meet the current and anticipated needs of our internal customers. The “cost of admission” for the platform has been lowered dramatically over time via our DataHub Portal and technologies such as Kafka Connect, Kubernetes and Presto. Most operations are now self-service, onboarding of new data sources is relatively painless and stream processing via KSQL and other technologies is being incorporated into the core DataHub platform.
In this talk, Bob Lehmann will describe the origins and evolution of the Enterprise DataHub with an emphasis on steps that were taken to drive user adoption. Bob will also talk about integrations between the DataHub and other key data platforms at Bayer, lessons learned and the future direction for streaming data and stream processing at Bayer.
FPGAs as Components in Heterogeneous HPC Systems (paraFPGA 2015 keynote) Wim Vanderbauwhede
Keynote I gave at the ParCo conference (http://www.parco2015.org) workshop paraFPGA in Edinburgh, Sept 2015, on the need to raise the abstraction level for programming of heterogeneous systems.
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...Datacratic
Join us for a Virtual RTBkit Meetup to kick off 2015 with knowledge, best practices and tips from the RTBkit developer community.
Agenda:
- RTBkit Developer Speaker Spotlight with Nicolas Emiliani - RTB Technical Lead, Motrixi
- RTBkit Behind the Scenes Update
- Intro to the RTBkit Stack
- Open Q&A
Join us for this virtual meetup from the comfort of your home or office. Virtual seats are limited so register early to claim your spot.
Gain Deep Visibility into APIs and Integrations with Anypoint MonitoringInfluxData
On average, a business supporting digital transactions now crosses 35 backend systems—and legacy tools haven’t been able to keep up. This session will cover how MuleSoft uses InfluxCloud to help power their monitoring and diagnostic solutions as well as provide end-to-end actionable visibility to APIs and integrations to help customers identify and resolve issues quickly.
Curing the Kafka Blindness – Streams Messaging ManagerDataWorks Summit
Companies who use Kafka today struggle with monitoring and managing Kafka clusters. Kafka is a key backbone of IoT streaming analytics applications. The challenge is understanding what is going on overall in the Kafka cluster including performance, issues and message flows. No open source tool caters to the needs of different users that work with Kafka: DevOps/developers, platform team, and security/governance teams. See how the new Hortonworks Streams Messaging Manager enables users to visualize their entire Kafka environment end-to-end and simplifies Kafka operations.
In this session learn how SMM visualizes the intricate details of how Apache Kafka functions in real time while simultaneously surfacing every nuance of tuning, optimizing, and measuring input and output. SMM will assist users to quickly understand and operate Kafka while providing the much-needed transparency that sophisticated and experienced users need to avoid all the pitfalls of running a Kafka cluster.
Speaker: Andrew Psaltis, Principal Solution Engineer, Hortonworks
Stay up-to-date on the latest news, events and resources for the OpenACC community. This month’s highlights covers the upcoming NVIDIA GTC 2019, complete schedule of GPU hackathons and more!
TeleHealth Platform: DevOps-Based Progressive Delivery
The talk covers a real-life experience related to building a DevOps Delivery-powered AI platform for doctors’ community and telehealth support for patients during COVID-19 lockdown. The doctors’ community interacts related to cases and triage for different patient cases. They can extend telehealth support using medical practice management solutions. Patients can order medicines online through integrated pharmacies on the platform. AI Platform has digital, voice, and knowledge assistants to provide information to the doctor. DevOps is enabled using Jenkins on AWS which helps in continuous integration and progressive delivery of features to Mobile and web apps (Apple & Google app stores). Historical data is used for predictive analytics by the machine learning platform. The platform helps healthcare enterprises: 1. Deploy voice tech to facilitate clinical documentation 2. Reduce physicians’ administrative burden 3. Increase patient volume and billable revenue 4. Eliminate transcription costs 5. Use voice to increase touchpoints and increase patient engagement.
(Joseph deBuzna + Zulfikar Quereshi, HVR) Kafka Summit SF 2018
This presentation is a customer story about France-based regional airline HOP! and their need to make better use of data that was contained in various applications. They also needed this information to be available in real time. As one can imagine, airlines manage a wide variety of information such as weather, customer information, flight plans, sensor data from planes and much more.
In this presentation, Joe will discuss how HOP! was delivering their data before and the limitations associated with delivering this data. Joe will then talk about HOP!’s selection of Kafka and HVR as a solution to enabling data availability and real-time information for analysis and action.
In this session, attendees will learn:
-How Kafka was selected and chosen as a solution for HOP!’s complex challenges
-Architecture and capabilities implemented that enabled data feeding from multiple sources to Kafka
-Considerations and challenges with this approach
-Business results and future plans
Advanced technologies and techniques for debugging HPC applicationsRogue Wave Software
Presented at Supercomputing 18. Debugging and analyzing today's HPC applications requires a tool with capabilities and features to support the demands of today’s complex HPC applications. Debugging tools must be able to handle the extensive use of C++ templates and the STL, use of many shared libraries, optimized code, code leveraging GPU accelerators and applications constructed with multiple languages.
This presentation walks through the different advanced technologies provided by the debugger, TotalView for HPC, and shows how they can be used to easily understand complex code and quickly solve difficult problems. Showcasing TotalView’s new user interface, you will learn how to leverage the amazing technology of reverse debugging to replay how your program ran. You will also see how TotalView provides a unified view across applications that utilize Python and C++, debug CUDA applications, find memory leaks in your HPC codes and other powerful techniques for improving the quality of your code.
Bitmovin LIVE Tech Talks: Analytics for Workflow Automation (ft. Touchstream ...Bitmovin Inc
As part of Bitmovin's NAB 2020 Virtual event series, we were joined by live video monitoring solutions provider Touchstream Media and had the chance to discuss how live-streaming organizations (such as Sports broadcasters) should automate analytics and data to best improve your video workflows.
View our on-demand discussion featuring case studies from a few major sports broadcasters: https://go.bitmovin.com/techtalk-live-analytics-automation-touchstream?utm_source=slideshare
Industrial Internet of Things: Protocols an StandardsJavier Povedano
Presentation for the Distributed Systems Master at the University of Cordoba (Spain). In this presentation we review the state of the art in communication middlewares for Industrial Internet of Things
Apache Geode (incubating) is the core of Pivotal Gemfire now available as an open source project governed by Apache Software Foundation Incubator. The legacy of Pivotal Gemfire and the ASF community uniquely position Geode as a secret ingredient for modern-day data management architectures.
These types of architectures require a robust in-memory data grid solution to handle a variety of use cases, ranging from enterprise-wide caching to real-time transactional applications at scale. In addition, as memory size and network bandwidth growth continues to outpace those of disk, the importance of managing large pools of RAM at scale increases. It is essential to innovate at the same pace.
Apache Geode (incubating) has all the right ingredients to do for RAM what HDFS has done for direct attach disks. The excitement (and funding!) in this area of big data ecosystem is palpable, and the ASF is the place where the innovation is happening. Come to this session to understand: a brief history of Geode, architecture and use cases, design philosophy and principles, but most importantly: how you too can participate in the in-memory data center revolution.
Presentation of the paper by M Ponce de Leon, W. Yao, M. Angel Diaz, Scenarios Designed for the Verification of Mobile IPv6 Enabling Technologies, Federation of Telecommunications Engineers of the European Community (FITCE) Congress 2007, Aug. 2007.
Accelerate Your C/C++ Applications with Amazon EC2 F1 Instances (CMP405) - AW...Amazon Web Services
Amazon EC2 F1 OpenCL development workflow helps software developers with little to no FPGA experience to supercharge their applications with Amazon EC2 F1. Join us for an overview and demonstration of accelerating your C/C++ applications in the cloud using OpenCL with Amazon EC2 F1 instances. In this workshop, we walk through the development flow for creating a custom hardware acceleration for a software algorithm. Attendees get hands-on and creative by optimizing an algorithm for maximum acceleration on Amazon EC2 F1 instances. All attendees must bring their own laptop (Windows, macOS, and Linux all supported). Tablets are not appropriate. We also recommend having the current version of Chrome or Firefox installed.
Frank Brockners' presentation from the 2017 Open Networking Summit.
While troubleshooting or planning, did you ever wish to get full insight into which paths *all* your packets take in your network, understand whether your SLA is really in place, or were you ever asked to prove that your traffic really follows the path you specified by service chaining or traffic engineering? We approach this problem by adding meta-data to *all* packets. In-band OAM adds forwarding path information and other information/stats to every data packet - as opposed to relying on probe packets, which is the traditional method that tools like ping or traceroute use. This session will introduce In-band OAM, explain the technology and outline the reference implementation in FD.io/VPP and OpenDaylight using a example demos.
Bandwidth Requirements for Delivering UltraHD TVSckipio
The bandwidth requirements for service providers to deliver UltraHD TV are very high. This presentation describes the requirements and the solution - G.fast.
Replaying KStreams Apps Using State Snapshots (Nishchay Sinha & Yan Wang, Blo...confluent
Are you a Kafka Streams application developer who needs a faster, more efficient way to reproduce a bug or issue from past events? Do you need to test a new algorithm patch near a discrete point-in-time? Do you have a standard methodology for investigating these issues, or does each team member devise an ad hoc solution? How much time does your simulation take?
It is technically impossible to solve this challenge using only changelog topics, since a compact, topic-based changelog doesn't capture the full change history. The Derivatives Data team at Bloomberg augments this through periodic snapshots. To make the snapshotted state accessible to our replay system, we built a query service that leverages Interactive Queries using the Kafka Streams API, along with gRPC-based coordination, to fetch the distributed snapshot states from different snapshotting instances.
In this talk, we will cover:
● Overview of this system architecture
● Deep dive into the mechanics of snapshots with Kafka Streams, state-store changelogs, and a query service to serve replay requests
● How two modes, normal and replay, are used on the Kafka Streams application runtime
● Some use cases that benefit from this replay system
Bringing Streaming Data To The Masses: Lowering The “Cost Of Admission” For Y...confluent
(Bob Lehmann, Bayer) Kafka Summit SF 2018
You’ve built your streaming data platform. The early adopters are “all in” and have developed producers, consumers and stream processing apps for a number of use cases. A large percentage of the enterprise, however, has expressed interest but hasn’t made the leap. Why?
In 2014, Bayer Crop Science (formerly Monsanto) adopted a cloud first strategy and started a multi-year transition to the cloud. A Kafka-based cross-datacenter DataHub was created to facilitate this migration and to drive the shift to real-time stream processing. The DataHub has seen strong enterprise adoption and supports a myriad of use cases. Data is ingested from a wide variety of sources and the data can move effortlessly between an on premise datacenter, AWS and Google Cloud. The DataHub has evolved continuously over time to meet the current and anticipated needs of our internal customers. The “cost of admission” for the platform has been lowered dramatically over time via our DataHub Portal and technologies such as Kafka Connect, Kubernetes and Presto. Most operations are now self-service, onboarding of new data sources is relatively painless and stream processing via KSQL and other technologies is being incorporated into the core DataHub platform.
In this talk, Bob Lehmann will describe the origins and evolution of the Enterprise DataHub with an emphasis on steps that were taken to drive user adoption. Bob will also talk about integrations between the DataHub and other key data platforms at Bayer, lessons learned and the future direction for streaming data and stream processing at Bayer.
FPGAs as Components in Heterogeneous HPC Systems (paraFPGA 2015 keynote) Wim Vanderbauwhede
Keynote I gave at the ParCo conference (http://www.parco2015.org) workshop paraFPGA in Edinburgh, Sept 2015, on the need to raise the abstraction level for programming of heterogeneous systems.
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...Datacratic
Join us for a Virtual RTBkit Meetup to kick off 2015 with knowledge, best practices and tips from the RTBkit developer community.
Agenda:
- RTBkit Developer Speaker Spotlight with Nicolas Emiliani - RTB Technical Lead, Motrixi
- RTBkit Behind the Scenes Update
- Intro to the RTBkit Stack
- Open Q&A
Join us for this virtual meetup from the comfort of your home or office. Virtual seats are limited so register early to claim your spot.
Gain Deep Visibility into APIs and Integrations with Anypoint MonitoringInfluxData
On average, a business supporting digital transactions now crosses 35 backend systems—and legacy tools haven’t been able to keep up. This session will cover how MuleSoft uses InfluxCloud to help power their monitoring and diagnostic solutions as well as provide end-to-end actionable visibility to APIs and integrations to help customers identify and resolve issues quickly.
Curing the Kafka Blindness – Streams Messaging ManagerDataWorks Summit
Companies who use Kafka today struggle with monitoring and managing Kafka clusters. Kafka is a key backbone of IoT streaming analytics applications. The challenge is understanding what is going on overall in the Kafka cluster including performance, issues and message flows. No open source tool caters to the needs of different users that work with Kafka: DevOps/developers, platform team, and security/governance teams. See how the new Hortonworks Streams Messaging Manager enables users to visualize their entire Kafka environment end-to-end and simplifies Kafka operations.
In this session learn how SMM visualizes the intricate details of how Apache Kafka functions in real time while simultaneously surfacing every nuance of tuning, optimizing, and measuring input and output. SMM will assist users to quickly understand and operate Kafka while providing the much-needed transparency that sophisticated and experienced users need to avoid all the pitfalls of running a Kafka cluster.
Speaker: Andrew Psaltis, Principal Solution Engineer, Hortonworks
Stay up-to-date on the latest news, events and resources for the OpenACC community. This month’s highlights covers the upcoming NVIDIA GTC 2019, complete schedule of GPU hackathons and more!
TeleHealth Platform: DevOps-Based Progressive Delivery
The talk covers a real-life experience related to building a DevOps Delivery-powered AI platform for doctors’ community and telehealth support for patients during COVID-19 lockdown. The doctors’ community interacts related to cases and triage for different patient cases. They can extend telehealth support using medical practice management solutions. Patients can order medicines online through integrated pharmacies on the platform. AI Platform has digital, voice, and knowledge assistants to provide information to the doctor. DevOps is enabled using Jenkins on AWS which helps in continuous integration and progressive delivery of features to Mobile and web apps (Apple & Google app stores). Historical data is used for predictive analytics by the machine learning platform. The platform helps healthcare enterprises: 1. Deploy voice tech to facilitate clinical documentation 2. Reduce physicians’ administrative burden 3. Increase patient volume and billable revenue 4. Eliminate transcription costs 5. Use voice to increase touchpoints and increase patient engagement.
(Joseph deBuzna + Zulfikar Quereshi, HVR) Kafka Summit SF 2018
This presentation is a customer story about France-based regional airline HOP! and their need to make better use of data that was contained in various applications. They also needed this information to be available in real time. As one can imagine, airlines manage a wide variety of information such as weather, customer information, flight plans, sensor data from planes and much more.
In this presentation, Joe will discuss how HOP! was delivering their data before and the limitations associated with delivering this data. Joe will then talk about HOP!’s selection of Kafka and HVR as a solution to enabling data availability and real-time information for analysis and action.
In this session, attendees will learn:
-How Kafka was selected and chosen as a solution for HOP!’s complex challenges
-Architecture and capabilities implemented that enabled data feeding from multiple sources to Kafka
-Considerations and challenges with this approach
-Business results and future plans
Advanced technologies and techniques for debugging HPC applicationsRogue Wave Software
Presented at Supercomputing 18. Debugging and analyzing today's HPC applications requires a tool with capabilities and features to support the demands of today’s complex HPC applications. Debugging tools must be able to handle the extensive use of C++ templates and the STL, use of many shared libraries, optimized code, code leveraging GPU accelerators and applications constructed with multiple languages.
This presentation walks through the different advanced technologies provided by the debugger, TotalView for HPC, and shows how they can be used to easily understand complex code and quickly solve difficult problems. Showcasing TotalView’s new user interface, you will learn how to leverage the amazing technology of reverse debugging to replay how your program ran. You will also see how TotalView provides a unified view across applications that utilize Python and C++, debug CUDA applications, find memory leaks in your HPC codes and other powerful techniques for improving the quality of your code.
Bitmovin LIVE Tech Talks: Analytics for Workflow Automation (ft. Touchstream ...Bitmovin Inc
As part of Bitmovin's NAB 2020 Virtual event series, we were joined by live video monitoring solutions provider Touchstream Media and had the chance to discuss how live-streaming organizations (such as Sports broadcasters) should automate analytics and data to best improve your video workflows.
View our on-demand discussion featuring case studies from a few major sports broadcasters: https://go.bitmovin.com/techtalk-live-analytics-automation-touchstream?utm_source=slideshare
Industrial Internet of Things: Protocols an StandardsJavier Povedano
Presentation for the Distributed Systems Master at the University of Cordoba (Spain). In this presentation we review the state of the art in communication middlewares for Industrial Internet of Things
Apache Geode (incubating) is the core of Pivotal Gemfire now available as an open source project governed by Apache Software Foundation Incubator. The legacy of Pivotal Gemfire and the ASF community uniquely position Geode as a secret ingredient for modern-day data management architectures.
These types of architectures require a robust in-memory data grid solution to handle a variety of use cases, ranging from enterprise-wide caching to real-time transactional applications at scale. In addition, as memory size and network bandwidth growth continues to outpace those of disk, the importance of managing large pools of RAM at scale increases. It is essential to innovate at the same pace.
Apache Geode (incubating) has all the right ingredients to do for RAM what HDFS has done for direct attach disks. The excitement (and funding!) in this area of big data ecosystem is palpable, and the ASF is the place where the innovation is happening. Come to this session to understand: a brief history of Geode, architecture and use cases, design philosophy and principles, but most importantly: how you too can participate in the in-memory data center revolution.
Scale Your Load Balancer from 0 to 1 million TPS on AzureAvi Networks
For years, enterprises have relied on appliance-based (hardware or virtual) load balancers. Unfortunately, these legacy ADCs are inflexible at scale, costly due to overprovisioning for peak traffic, and slow to respond to changes or security incidents.
These problems are amplified as applications migrate to the cloud. In contrast, the Avi Vantage Platform not only elastically scales up and down based on real-time traffic patterns, but also offers ludicrous scale at a fraction of the cost.
Watch this webinar to see how Avi can scale up and down quickly on the Microsoft Azure Cloud.
- Configure load balancing on Azure to scale up from 0 to 1 million transactions per second (TPS) and down in under 10 minutes
- Learn why hardware or virtual appliances are not an option for modern load balancing in public clouds
- Understand how Avi’s elastic scale dramatically lowers TCO and enhances security, including DDoS attacks
Watch the full webinar: https://info.avinetworks.com/webinars-ludicrous-scale-on-azure
From Disaster to Recovery: Preparing Your IT for the UnexpectedDataCore Software
Did you know that 22% of data center outages are caused by human error? Or that 10% are caused by weather incidents?
The impact of an unexpected outage for just a few hours or even days could be catastrophic to your business.
How would you like to minimize or even eliminate these business interruptions, and more?
Join us to discover:
• Useful and simple measures to use that can help you keep the lights on
• How to quickly recover when the worst-case scenario occurs
• How to achieve zero downtime and high availability
Intro to GitOps with Weave GitOps, Flagger and LinkerdWeaveworks
You may not think of "GitOps" and "service mesh" together – but maybe you should! These two wildly different technologies are each enormously capable independently, and combined they deliver far more than the sum of their parts: a single Git commit can control workflows customized for your exact situation by taking advantage of the service mesh's ability to measure and manipulate traffic anywhere in your application's call graph, and you can rest easy knowing that Git is preserving the complete configuration for your entire application every step of the way.
See how these technologies can work together to tackle complex problems in cloud-native applications.
What you’ll get out of this:
* Understand what GitOps and service meshes can - and can't - do for you.
* Understand basic operations with GitOps and Linkerd.
* Understand the basics of continuous deployment with Weave GitOps and Linkerd.
Observability for Modern Applications (CON306-R1) - AWS re:Invent 2018Amazon Web Services
In modern, microservices-based applications, it’s critical to have end-to-end observability of each microservice and the communications between them in order to quickly identify and debug issues. In this session, we cover the techniques and tools to achieve consistent, full-application observability, including monitoring, tracing, logging, and service mesh.
Unlocking the Power of IoT: A comprehensive approach to real-time insightsconfluent
In today's data-driven world, the Internet of Things (IoT) is revolutionizing industries and unlocking new possibilities. Join Data Reply, Confluent, and Imply as we unveil a comprehensive solution for IoT that harnesses the power of real-time insights.
An Introduction to Apache Geode (incubating)Anthony Baker
Geode is a data management platform that provides real-time, consistent access to data-intensive applications throughout widely distributed cloud architectures.
Geode pools memory (along with CPU, network and optionally local disk) across multiple processes to manage application objects and behavior. It uses dynamic replication and data partitioning techniques for high availability, improved performance, scalability, and fault tolerance. Geode is both a distributed data container and an in-memory data management system providing reliable asynchronous event notifications and guaranteed message delivery.
Pivotal GemFire has had a long and winding journey, starting in 2002, winding through VMware, Pivotal, and finding it's way to Apache in 2015. Companies using GemFire have deployed it in some of the most mission critical latency sensitive applications in their enterprises, making sure tickets are purchased in a timely fashion, hotel rooms are booked, trades are made, and credit card transactions are cleared. This presentation discusses:
- A brief history of GemFire
- Architecture and use cases
- Why we are taking GemFire Open Source
- Design philosophy and principles
But most importantly: how you can join this exciting community to work on the bleeding edge in-memory platform.
An Introduction to Apache Geode (incubating) - Geode is a data management platform that provides real-time, consistent access to data-intensive applications throughout widely distributed cloud architectures.
Pivotal - Advanced Analytics for Telecommunications Hortonworks
Innovative mobile operators need to mine the vast troves of unstructured data now available to them to help develop compelling customer experiences and uncover new revenue opportunities. In this webinar, you’ll learn how HDB’s in-database analytics enable advanced use cases in network operations, customer care, and marketing for better customer experience. Join us, and get started on your advanced analytics journey today!
Cheryl Wiebe - Advanced Analytics in the Industrial WorldRehgan Avon
2018 Women in Analytics Conference
https://www.womeninanalytics.org/
Cheryl will talk about her consulting practice in Industrial Solutions, Analytic solutions for industrial IoT-enabled businesses, including connected factory, connected supply chain, smart mobility, connected assets. Her path to this practice has bounced between hands on systems development, IT strategy, business process reengineering, supply chain analytics, manufacturing quality analytics, and now Industrial IoT analytics. She spent time working in industry as a developer, as a management consultant, started and sold a company, before settling in to pursue this topic as a career analytics consultant. Cheryl will shed light on what's happening in industrial companies struggling to make the transition to digital, what that means, and what barriers they're challenged with. She'll touch on how/where artificial intelligence, deep learning, and machine learning technologies are being used most effectively in industrial companies, and what are the unique challenges they are facing. Reflecting on what's changed over the years, and her journey to witness this, Cheryl will pose what she considers important ideas to consider for women (and men) in pursuing an analytics career successfully and meaningfully.
Doing Cloud Right! Five Keys to Becoming an Elite DevOps PerformerDevOps.com
The recently published results from the 2018 DORA State of DevOps Report shows that “doing cloud right” is the largest predictor of DevOps success. It’s not enough to just move to the cloud. Instead, organizations must adopt the 5 Key Cloud Practices of on-demand self-service, broad access, resource pooling, elasticity, and measured service to be truly successful. The result? Organizations that do are 23 times more likely to be elite performers!
"Industrial Internet IoT bootcamp" meetup, 11-5-2015 hosted by GE Digital at HackerDojo. Discussing topics ranging from IoT architecture to connectivity and protocols, cyber security, data science and industrial UX design.
Similar to Charles sonigo - Demuxed 2018 - How to be data-driven when you aren't Netflix (or even if you are) (20)
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
27. 201: Chi-Squared Test
Test dependency of two variables: Label vs. Metric X
● If X and Label are truly independent (Null
Hypothesis), the Chi Squared distribution is known.
● Comparison of the observed Chi Squared distribution
versus the theoretical one gives us the probability
that the difference we see isn’t just noise.
32. 5 What’s Next for us?
● More statistical significance visuals
● User clusters based on how they interact with our tech
● ML for client configuration optimization