The document describes the Cassandra platform, which is currently in alpha version. The platform allows users to create library-based scenarios using pre-existing components, run simulations, and train models using measurement data. It also has modules for estimating consumer response to incentives and identifying consumer social networks. The platform is open source and its development is ongoing. The next steps include a beta release in October 2013 and integrating external modules. The presenter welcomes feedback to improve the platform.
This presentation was presented by Ricardo Wendell and I in QCon Rio 2015. Here we talk about data lakes, agile analytics and how to implement it with Hadoop and Spark.
Apresentação feita no PPGEE - Programa de Pós Graduação de Engenharia Elétrica da Universidade Federal do Pará
Uma explanação geral sobre o funcionamento das principais ferramentas disponíveis no mercado para análise e pré-processamento de dados
Making Big Data a First Class citizen in the enterpriseTony Baer
Big Data emerged with Internet companies as special projects managed by elite practitioners to solve unique problems. This approach will not be sustainable for enterprises. This presentation describes how big Data projects must become part of the fabric of your enterprise if they are to succeed.
This presentation was presented by Ricardo Wendell and I in QCon Rio 2015. Here we talk about data lakes, agile analytics and how to implement it with Hadoop and Spark.
Apresentação feita no PPGEE - Programa de Pós Graduação de Engenharia Elétrica da Universidade Federal do Pará
Uma explanação geral sobre o funcionamento das principais ferramentas disponíveis no mercado para análise e pré-processamento de dados
Making Big Data a First Class citizen in the enterpriseTony Baer
Big Data emerged with Internet companies as special projects managed by elite practitioners to solve unique problems. This approach will not be sustainable for enterprises. This presentation describes how big Data projects must become part of the fabric of your enterprise if they are to succeed.
Fast Data:The Rebirth of Streaming AnalyticsTony Baer
The explosion of iIoT & mobile data have created compelling new use cases for analyzing data in motion. Commodity scale-out infrastructure, bandwidth&open source are pushing streaming analytics to the front burner.
Hadoop, SQL & NoSQL: No Longer an Either-or QuestionTony Baer
It used to be black and white. If you needed MapReduce processing, you chose Hadoop; if you needed standard query and reporting, you chose a SQL data warehouse. The decision is no longer clear cut. With YARN clearing the way for Hadoop to accept multiple workloads, Hadoop is no longer your father’s MapReduce machine – as frameworks are rapidly emerging for interactive SQL, search, streaming and other workloads. We are on the path toward a federated world of analytic and operational decision stores, but as the boundaries between platform types grow fuzzier, deciding what platforms to use and where to run which workloads grow trickier.
Elytics - Construindo uma plataforma de big data Elo7
Temos a necessidade de processar e analisar milhões de dados. Nessa palestra vamos compartilhar a experiência adquirida ao construir uma arquitetura que utiliza Kafka como fonte de dados para o processamento de milhões de mensagens com Spark Streaming, persistência no Cassandra e utilizando o Zeppelin para a apresentação de dashboards estratégicos para o negócio.
Utilizando esta arquitetura, foi possível criar um ambiente para análise de dados de navegação do usuário pelo site, assim como métricas de venda e interação dos usuários via chat. Através dessa plataforma viabilizamos insights, validação de novas features e a possibilidade de aplicação de algoritmos preditivos que otimizem os objetivos de negócio.
Un pequeña introduccion acerca que el gestor de base de datos Apache Cassandra.
Martin ulises Ildefonso torres, luego mencionare los links que se utilizo para esta presentacion
Manual apache cassandra y comandos en la shellKevin López
Tutoria de instación de apache cassandra
comandos de apache cassandra
lugar de referencia para mas información
http://www.tutorialspoint.com/cassandra/index.htm
Desenvolvimento Mobile: Android e iOS caminhando juntosElo7
Apesar de tecnologias diferentes, temos como desafio pensar nas versões Android e iOS como sendo um mesmo produto. Vamos demonstrar como lidamos com a arquitetura mobile e as estratégias para manter a qualidade, manutenibilidade e a robustez em ambas as plataformas.
Are you tired of struggling with your existing data analytic applications?
When MapReduce first emerged it was a great boon to the big data world, but modern big data processing demands have outgrown this framework.
That’s where Apache Spark steps in, boasting speeds 10-100x faster than Hadoop and setting the world record in large scale sorting. Spark’s general abstraction means it can expand beyond simple batch processing, making it capable of such things as blazing-fast, iterative algorithms and exactly once streaming semantics. This combined with it’s interactive shell make it a powerful tool useful for everybody, from data tinkerers to data scientists to data developers.
On June 11th, 2013 we hosted the First CASSANDRA Webinar for NOI Members, titled:
"Modeling consumer behaviour from the ground up – A presentation of the CASSANDRA platform"
During the Webinar, the platform was explained and demonstrated, and participants had the opportunity to ask questions and discuss the early development of the platform.
The presenters and their topics were as follows:
1. Introduction
- By Jessica Stromback, Chairman VaasaETT
2. Concept and theoretical aspects of CASSANDRA
- By Giorgos Andreou, Lecturer at Power Systems Laboratory Department of Electrical and Computer Engineering Aristotle University of Thessaloniki
3. Explanation and demonstration of CASSANDRA Modeling Platform
- By Christos Diou, Postdoctoral Research Associate at Centre for Research and Technology Hellas ITI - CERTH
We hope you enjoy the presenters' slides and we look forward to your continued feedback!
Sincerely,
The CASSANDRA Team
Fast Data:The Rebirth of Streaming AnalyticsTony Baer
The explosion of iIoT & mobile data have created compelling new use cases for analyzing data in motion. Commodity scale-out infrastructure, bandwidth&open source are pushing streaming analytics to the front burner.
Hadoop, SQL & NoSQL: No Longer an Either-or QuestionTony Baer
It used to be black and white. If you needed MapReduce processing, you chose Hadoop; if you needed standard query and reporting, you chose a SQL data warehouse. The decision is no longer clear cut. With YARN clearing the way for Hadoop to accept multiple workloads, Hadoop is no longer your father’s MapReduce machine – as frameworks are rapidly emerging for interactive SQL, search, streaming and other workloads. We are on the path toward a federated world of analytic and operational decision stores, but as the boundaries between platform types grow fuzzier, deciding what platforms to use and where to run which workloads grow trickier.
Elytics - Construindo uma plataforma de big data Elo7
Temos a necessidade de processar e analisar milhões de dados. Nessa palestra vamos compartilhar a experiência adquirida ao construir uma arquitetura que utiliza Kafka como fonte de dados para o processamento de milhões de mensagens com Spark Streaming, persistência no Cassandra e utilizando o Zeppelin para a apresentação de dashboards estratégicos para o negócio.
Utilizando esta arquitetura, foi possível criar um ambiente para análise de dados de navegação do usuário pelo site, assim como métricas de venda e interação dos usuários via chat. Através dessa plataforma viabilizamos insights, validação de novas features e a possibilidade de aplicação de algoritmos preditivos que otimizem os objetivos de negócio.
Un pequeña introduccion acerca que el gestor de base de datos Apache Cassandra.
Martin ulises Ildefonso torres, luego mencionare los links que se utilizo para esta presentacion
Manual apache cassandra y comandos en la shellKevin López
Tutoria de instación de apache cassandra
comandos de apache cassandra
lugar de referencia para mas información
http://www.tutorialspoint.com/cassandra/index.htm
Desenvolvimento Mobile: Android e iOS caminhando juntosElo7
Apesar de tecnologias diferentes, temos como desafio pensar nas versões Android e iOS como sendo um mesmo produto. Vamos demonstrar como lidamos com a arquitetura mobile e as estratégias para manter a qualidade, manutenibilidade e a robustez em ambas as plataformas.
Are you tired of struggling with your existing data analytic applications?
When MapReduce first emerged it was a great boon to the big data world, but modern big data processing demands have outgrown this framework.
That’s where Apache Spark steps in, boasting speeds 10-100x faster than Hadoop and setting the world record in large scale sorting. Spark’s general abstraction means it can expand beyond simple batch processing, making it capable of such things as blazing-fast, iterative algorithms and exactly once streaming semantics. This combined with it’s interactive shell make it a powerful tool useful for everybody, from data tinkerers to data scientists to data developers.
On June 11th, 2013 we hosted the First CASSANDRA Webinar for NOI Members, titled:
"Modeling consumer behaviour from the ground up – A presentation of the CASSANDRA platform"
During the Webinar, the platform was explained and demonstrated, and participants had the opportunity to ask questions and discuss the early development of the platform.
The presenters and their topics were as follows:
1. Introduction
- By Jessica Stromback, Chairman VaasaETT
2. Concept and theoretical aspects of CASSANDRA
- By Giorgos Andreou, Lecturer at Power Systems Laboratory Department of Electrical and Computer Engineering Aristotle University of Thessaloniki
3. Explanation and demonstration of CASSANDRA Modeling Platform
- By Christos Diou, Postdoctoral Research Associate at Centre for Research and Technology Hellas ITI - CERTH
We hope you enjoy the presenters' slides and we look forward to your continued feedback!
Sincerely,
The CASSANDRA Team
The presentation I did, when presenting my work at UCC 2014 in London on the 8th of December, 2014.
http://kkpradeeban.blogspot.com/2014/09/ucc-2014-adaptive-distributed-simulator.html
My master thesis Cloud2Sim, at INESC-ID Lisboa, Instituto Superior Tecnico, Universidade de Lisboa, Portugal, titled, "An Elastic Middleware Platform for Concurrent and Distributed Cloud and MapReduce Simulations."
I was able to secure 18/20 for the thesis.
[Sirius Day Eindhoven 2018] ASML's MDE Going SiriusObeo
Talk done by Wilbert Alberts (ASML) at Sirius Day Eindhoven:
ASML is the world's leading provider of lithography systems for the semiconductor industry. Such systems are controlled by more than 20 million lines of code. To improve the efficiency and quality of its software development process, ASML is using, amongst others, model-driven-engineering and associated tools and techniques.
Recently, subsystems are being developed according to an architecture pattern that separates Data, Control and Algorithms (DCA). To support this pattern, the ASML software architecture group is working towards a SW development environment (ASOME). This environment consists of a set of modeling languages, associated editors that allow specification of (sub)systems according to this DCA pattern. Furthermore, it contains model-to-model transformations to (COTS) analysis tools (e.g. model checkers) and model-to-text transformation to generate (parts of) the implementation.
In this presentation, I will briefly introduce ASML and the kind of (software) systems that we develop. Some aspects of the DCA architectural pattern, the languages that we are developing and the associated Sirius based editors, will be presented. For the Data part, a DSL and editor have been developed allowing the definition of various kinds of datatypes from which various kinds of repositories can be generated supporting clone based data or reference based data, modifiable and read-only entities etc. In order support the Control aspect; a language and editor have been defined that allow specification of interfaces and their realization based on state machines. A system editor allows decomposition of a system into subsystems while allowing delegation of incoming requests to internal parts. The editors are mostly Sirius based graphical editors, where the created models are persisted textually using XText.
The presentation will focus on sharing some of our experiences with both the development and deployment of products based on Sirius technology. Building the ASOME environment imposes many challenges and I would like to conclude with some that specifically target the development of the front ends of this environment.
Cloud computing technology, life cycle model of cloud computing, Phases of CDLC, Role of cloud modelling and architecture, Reference model for cloud computing, Cloud architecture,Cloud computing logical architecture, Cloud computing model and design,Cloud system architecture, Cloud system architecture, Developing holistic cloud computing reference model, Cloud unified process
Developing recommendation systems to support open source software developers ...Davide Ruscio
Open-source software (OSS) forges contain rich data sources useful for supporting development activities. Several techniques and tools have been promoted to provide open source developers with innovative features, aiming to obtain improvements in development effort, cost savings, and developer productivity. In the context of the EU H2020 CROSSMINER project, different recommendation systems have been conceived to assist software programmers in different phases of the development process by providing them with various artifacts, such as third-party libraries, or documentation about how to use the APIs being adopted, or relevant API function calls. To develop such recommendations, various technical choices have been made to overcome issues related to several aspects, including the lack of baselines, limited data availability, decisions about the performance measures, and evaluation approaches. This lecture provides an introduction to Recommendation Systems in Software Engineering (RSSE) and describes the challenges that have been encountered in the context of the CROSSMINER project. Specific attention is devoted to present the intricacies related to the development and evaluation techniques that have been employed to conceive and evaluate the CROSSMINER recommendation systems. The lessons that have been learned while working on the project are also discussed.
https://sites.google.com/gssi.it/csgssi/ph-d-program/se-ai-course-2021
In this presentation is briefly introduced the use of Docker for Data Science.
Are presented arguments like the management of containers and the creation of new Docker images
DDS Advanced Tutorial - OMG June 2013 Berlin MeetingJaime Martin Losa
An extended, in-depth tutorial explaining how to fully exploit the standard's unique communication capabilities.Presented at the OMG June 2013 Berlin Meeting.
Users upgrading to DDS from a homegrown solution or a legacy-messaging infrastructure often limit themselves to using its most basic publish-subscribe features. This allows applications to take advantage of reliable multicast and other performance and scalability features of the DDS wire protocol, as well as the enhanced robustness of the DDS peer-to-peer architecture. However, applications that do not use DDS's data-centricity do not take advantage of many of its QoS-related, scalability and availability features, such as the KeepLast History Cache, Instance Ownership and Deadline Monitoring. As a consequence some developers duplicate these features in custom application code, resulting in increased costs, lower performance, and compromised portability and interoperability.
This tutorial will formally define the data-centric publish-subscribe model as specified in the OMG DDS specification and define a set of best-practice guidelines and patterns for the design and implementation of systems based on DDS.
DSD-INT 2019 Modelling in DANUBIUS-RI-BellafioreDeltares
Presentation by Debora Bellafiore (ISMAR, Italy), at the DANUBIUS Modelling Workshop, during Delft Software Days - Edition 2019. Friday, 8 November 2019, Delft.
DSD-INT 2016 Calibration and scenario generation of hydrodynamics and water -...Deltares
Presentation by Fernando Aguilar Gómez (Instituto de Fisica de Cantabria) at the Delft3D User Days, during Delft Software Days 2016. Tuesday 1 November 2016, Delft.
Similar to The Cassandra Platform - Christos Diou (20)
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
2. Cassandra platform – Christos Diou
Outline
• Library-based scenarios in the alpha platform version
– Use pre-existing library components
• Measurement-based scenarios
– Model training to build models automatically
• Response models
– Consumer response to different incentives
• Consumer Social Network analysis
– Grouping of small-scale consumers into Consumer Social Networks
• Development status and next steps
• NoI members and the platform
Cassandra 1st Webinar 2
3. Cassandra platform – Christos Diou
About the platform
• Currently in alpha version, development is highly active
– Some functionality has not been integrated yet
• Open source platform, publicly available through GitHub
– Apache license
• Your feedback is highly appreciated!
Cassandra 1st Webinar 3
20. Cassandra platform – Christos Diou
What if…
• The residents of Apartment 1 use the water heater less
– Because they have installed solar water heating
• The residents of Apartment 2 use their air conditioning unit less
– Because they have upgraded their apartment’s thermal insulation
properties
• The residents of Apartment 3 replace their refrigeration unit
Cassandra 1st Webinar 20
30. Cassandra platform – Christos Diou
One step further
• Library-based scenarios can be used to simulate working
hypotheses
• The real power of Cassandra is in its disaggregation and model
training functionality
Cassandra 1st Webinar 30
51. Cassandra platform – Christos Diou
CSNs
• Groups of similar consumers
– Multiple similarity criteria
• CSNs have potential:
– Increased market power of aggregated small-scale consumers
– Coordination of consumption activities at group level
– Targeted incentives at group level
52. Cassandra platform – Christos Diou
CSN module
• CSN module: A tool for identifying links and grouping of
consumers in a meaningful way
– Existing social network connections
– Explicit attributes (e.g. working, non-working person, locality in the grid
topology)
– Implicit attributes (e.g. consumption similarity, peak similarity, behavioural
similarity)
• Early version implemented for experimentation
• Next version:
– More similarity criteria
– Estimation of group response to incentives
– GUI integration
Cassandra 1st Webinar 52
67. Various statistics are
available, as well as the
consumption and information
of selected nodes
Sliders for minimum edge
weight and clustering values
68. Cassandra platform – Christos Diou
In summary, with Cassandra you can
• Simulate working scenarios/pilots
• Benchmark different energy efficiency solutions/products in
simulation before testing them in real-life
• Create detailed models that describe consumer behaviour
• Identify and evaluate optimal consumption schedules
• Estimate consumer response to a range of incentives
– Pricing schemes
– Consumer awareness
– Environmental impact
• Identify meaningful consumer groups and benchmark the
application of targeted incentives
69. Cassandra platform – Christos Diou
So, what’s next?
• Beta release is planned for October, 2013
• Integration of external modules with the platform
• Evaluation of Cassandra in our three project pilot cases
• Evaluation of Cassandra in a limited number of NoI pilots
(external evaluation)
• We can use Cassandra to simulate your context and benchmark
a number of working scenarios.
– Use measurements and the training module to model your context
– Simulate a range of scenarios appropriate for your business case
– Prepare an analysis/report of our findings
– Ask for your feedback
– Improve the platform based on your comments