Big data debunking some of the myths

A look at the practicalities of big data and why algorithms (and those who understand them) are important

Software

copyright 2015
Big Data:
debunking some of the myths
Chris Swan
@cpswan

copyright 2015
Agenda
• My background
• What do I mean by big data?
• Know your algorithm
• Know your data
• Performance

copyright 2015
My background
CTO
CTO Client Experience
Co-head CTO Security
Corporate Finance
fintech, early stage
IT R&D – Networks and security
Grid, app server engineering
Combat System Engineer

copyright 2015
Recent adventure with Big Data

copyright 2015
Misquoting Roger Needham
Whoever thinks their analytics
problem is solved by big data,
doesn’t understand their analytics
problem and doesn’t understand
big data
5

copyright 2015
What do I mean by ‘big data’?

copyright 2015
Overview
7
Based on a blog post from April 2012 – http://is.gd/swbdla
Problem Types
Algorithm Complexity
DataVolume
Simple
Big Data
Quant

copyright 2015
Simple problems
8
Low data volume, low algorithm complexity
Problem Types
Algorithm Complexity
DataVolume
Simple
Big Data
Quant

copyright 2015
Quant Problems
9
Any data volume, high algorithm complexity
Problem Types
Algorithm Complexity
DataVolume
Simple
Big Data
Quant

copyright 2015
Big Data Problems
10
High data volume, low algorithm complexity
Problem Types
Algorithm Complexity
DataVolume
Simple
Big Data
Quant
Types of Big Data Problem:
1. Inherent
2. More data gives better
result than more complex
algorithm

copyright 2015 11
Good
- Lots of new tools, mostly open source
Bad
- Term being abused by marketing departments
Ugly
- Can easily lead to over reliance on systems that lack transparency and ignore specific data points
'Computer says no', but nobody can explain why
The good, the bad and the ugly of Big Data

copyright 2015
It’s important to know your algorithms

copyright 2015
Turning an assumption into a line

copyright 2015
There are lots of algorithms to understand

copyright 2015
It’s also important to know your data

copyright 2015
Whatever we call our ‘experts’

copyright 2015
Who’s heard of Anscombe’s quartet?

copyright 2015
Same statistical properties, but…
http://en.wikipedia.org/wiki/Anscombe's_quartet

copyright 2015
Don’t agonise over distros
The performance of Hadoop distros
are all the same to within 1 server
within a cluster
Stefan Groschupf
One of the creators of Hadoop

copyright 2015
In terms of distance
http://loci.cs.utk.edu/dsi/netstore99/docs/presentations/keynote/sld023.htm

Big data projects often fail to deliver on promises of working out of the box for any data with regular DBAs. Extracting data from legacy systems into big data systems can result in loss of important controls and governance from the original systems around security, access controls, and metadata. It is important to have hoses securely attached before opening the fire hydrant by mapping security, controls, and developing a vocabulary before combining disparate data sources or implementing complex big data technologies.

DataEngConf SF16 - Three lessons learned from building a production machine l...

Hakka Labs

This document discusses three lessons learned from building machine learning systems at Stripe. 1. Don't treat models as black boxes. Early on, Stripe focused only on training with more data and features without understanding algorithms, results, or deeper reasons behind results. This led to overfitting. Introspecting models using "score reasons" helped debug issues. 2. Have a plan for counterfactual evaluation before production. Stripe's validation results did not predict poor production performance because the environment changed. Counterfactual evaluation using A/B testing with probabilistic reversals of block decisions allows estimating true precision and recall. 3. Invest in production monitoring of models. Monitoring inputs, outputs, action rates, score

From Data Analytics to Fast Data Intelligence

UX Analytics for Data-driven Product Development

- UX analytics can help companies turn their user data into real products by discovering user interests in real-time. - Mobile analytics is important because mobile devices are becoming the dominant way users access the web, and big data and analytics are major trends. - Core KPIs for mobile analytics include users, sessions, events, and other metrics to understand user behavior and how to engage app users.

IPTC Semantic Exchange update

Heather Edwards

A Statistician Walks into a Tech Company: R at a Rapidly Scaling Healthcare S...

Work-Bench

This document summarizes a statistician's experience working at a healthcare technology startup that uses electronic health record data. It describes how the company initially had just one quantitative scientist but grew its team to include 70 software engineers and 10 quantitative scientists. It discusses how the company cultivated an R culture through internal packages, training, and hiring. It provides examples of when the company uses R for prototyping but implements in other languages for production, when R is used as a long-term solution, and when R and other languages are used in parallel for analysis.

Reactive Reatime Big Data with Open Source Lambda Architecture - TechCampVN 2014

This document discusses using a reactive lambda architecture with open source tools to solve real-time big data problems. It begins by defining big data and explaining that simply having data is not enough - you need to solve the right problems with the right team and tools. It then presents three example problems that could benefit from real-time big data solutions: disaster prediction and response, understanding customers through social media data, and optimizing marketing campaigns in real-time. The document proposes using a reactive lambda architecture along with open source frameworks like Hadoop, Spark, Storm and databases like Redis, HDFS and HBase to build streaming data pipelines and query data in real-time. It demonstrates this through a social media user tracking and personalized recommendations use

This document discusses the development of Arthur, an internal social media platform launched by ACCA in 2013. It was initially aimed at ACCA's 400 line managers to provide a centralized place for news and information. The platform later expanded and was renamed to help people managers by providing curated content on topics like coaching, communication, and motivation. Over time, the platform refined its content personalization, expanded its readership, and strengthened its measurement of user activity and roles. The goal is for the platform to continue cutting through noise and serving as a core information channel for managers.

Beyond Data Discovery: The Value Unlocked by Modern Data Modeling

Real-time Big Data at FPT (for TechCamp University)

Enabling Data centric Teams

Data Con LA

Data Con LA 2020 Description Coming from a grand belief of data democratization, I believe that in order for any team to be successful collaborators, it has to be data centric and data should be accessible to all. *To ensure that your non software or software engineering centric team has maximum efficiency, data should be visible, data lake should be accessible. *Form a database for analytics summaries, talk about the different technologies(SQL, NoSQL) cost of deployment, need, team driven structure. Build an API for this database for external/inter team crosstalk. *Build analytics and visual layer on top of it. Flask/Django/Node, etc.., to enable the team to have high visibility in their analysis, and to ensure a higher turnaround of data. *Talk about an easy way of enabling the team to run code, could be local/cloud, JupyterHub is a great way of doing so, talk about the tremendous value added in that and the potential it enables *Talk about the common tools user for version control/CICD/Coding technologies, etc.. *Finally summarize the value of the mixture of all these tools and technologies in order to ensure the maximum efficiency. Speaker Nawar Khabbaz, Rivian, Data Engineer

Fast, reliable, secure @ Velocity 2015

Ariel Tseitlin

This document discusses DevOps and how it can accelerate innovation while reducing risk. DevOps combines software development ("Dev") and IT operations ("Ops") practices to shorten the systems development life cycle and provide continuous delivery. It allows organizations to develop and release software faster and more reliably. DevOps reduces risk by decreasing the likelihood and impact of adverse events through practices like continuous integration, automated testing, and infrastructure as code which allow for faster recovery from failures and security issues.

Angular js, Yeomon & Grunt

Richard Powell

Agile Data

odsc

To rephrase an old saying: ‘It takes a village to raise an Analyst.’ Data Analysts and Scientists are working in teams delivering insight and analysis on an ongoing basis. So how do you get the team to support experimentation and insight delivery without ending up in an IT Engineer vs Analyst vs Data Governance war? We present 5 shocking steps to get these teams of people working together with practical, doable steps that can help you achieve data agility. The speaker has decades of hands on and executive management experience in data, analytics, and software development.

Data Democracy: Hadoop + Redshift

See the recording at http://looker.com/learn#ufh-i-225858450-driving-data-democracy-hadoop-amazon-redshift The Hadoop ecosystem has improved markedly over the past few years. Moreover, MPP databases seem to slot in nicely as complementary tools to map-reduce batch jobs, in that they allow analytics teams to easily query massive structured data sets. Rex Gibson, Manager of Data Engineering at Knewton and Scott Hoover, Data Scientist at Looker walk through how these pipelines work. They discuss: - their technology and data stacks - possible drawbacks to Hadoop + Redshift - the merits and drawbacks associated with making data processing and querying more “democratic.”

WP2 Overview (Technical architecture)

vbrant

This document discusses the technical architecture work package for the ViBRANT project. It covers hosting architecture, failover and mirroring, providing technical support, multisite integration, a dynamic site registry, measuring and publishing data usage, developing a citation metric, integrating Scratchpads, prioritizing development, code testing, managing training resources, developing a financial model for sustainability, and providing a service level agreement for users.

How the economist with cloud BI and Looker have improved data-driven decision...

Where is my big data: security, privacy and jurisdictions in the cloud

This document summarizes Chris Swan's presentation on big data security, privacy, and jurisdiction in the cloud. The presentation covers Swan's background in technology, defines big data, discusses cloud security concerns and challenges of regulation across jurisdictions. It concludes by suggesting some steps individuals can take to protect their data, such as only using services from providers with strong privacy policies and avoiding services from countries with surveillance laws that compromise privacy.

Self-Service Analytics with Guard Rails

Denodo

Watch this webinar in full here: https://buff.ly/2MVTKqL Self-Service BI promises to remove the bottleneck that exists between IT and business users. The truth is, if data is handed over to a wide range of data consumers without proper guardrails in place, it can result in data anarchy. Attend this session to learn why data virtualization: • Is a must for implementing the right self-service BI • Makes self-service BI useful for every business user • Accelerates any self-service BI initiative

The Maturity Model: Taking the Growing Pains Out of Hadoop

The Briefing Room with Rick van der Lans and Think Big, a Teradata Company Live Webcast on June 16, 2015 Watch the archive: https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=197f8106531874cc5c14081ca214eaff Hadoop is arguably one of the most disruptive technologies of the last decade. Once lauded solely for its ability to transform the speed of batch processing, it has marched steadily forward and promulgated an array of performance-enhancing accessories, notably Spark and YARN. Hadoop has evolved into much more than a file system and batch processor, and it now promises to stand as the data management and analytics backbone for enterprises. Register for this episode of The Briefing Room to learn from veteran Analyst Rick van der Lans, as he discusses the emerging roles of Hadoop within the analytics ecosystem. He’ll be briefed by Ron Bodkin of Think Big, a Teradata Company, who will explore Hadoop’s maturity spectrum, from typical entry use cases all the way up the value chain. He’ll show how enterprises that already use Hadoop in production are finding new ways to exploit its power and build creative, dynamic analytics environments. Visit InsideAnalysis.com for more information.

Architecting a Data Platform For Enterprise Use (Strata NY 2018)

Building a data lake involves more than installing Hadoop or putting data into AWS. The goal in most organizations is to build multi-use data infrastructure that is not subject to past constraints. This tutorial covers design assumptions, design principles, and how to approach the architecture and planning for multi-use data infrastructure in IT. Long: The goal in most organizations is to build multi-use data infrastructure that is not subject to past constraints. This session will discuss hidden design assumptions, review design principles to apply when building multi-use data infrastructure, and provide a reference architecture to use as you work to unify your analytics infrastructure. The focus in our market has been on acquiring technology, and that ignores the more important part: the larger IT landscape within which this technology lives and the data architecture that lies at its core. If one expects longevity from a platform then it should be a designed rather than accidental architecture. Architecture is more than just software. It starts from use and includes the data, technology, methods of building and maintaining, and organization of people. What are the design principles that lead to good design and a functional data architecture? What are the assumptions that limit older approaches? How can one integrate with, migrate from or modernize an existing data environment? How will this affect an organization's data management practices? This tutorial will help you answer these questions. Topics covered: * A brief history of data infrastructure and past design assumptions * Categories of data and data use in organizations * Data architecture * Functional architecture * Technology planning assumptions and guidance

Driving Real Insights Through Data Science

Major changes in industries have been brought about by the emergence of data-driven discoveries and applications. Many organizations are bringing together their data, and looking to drive change. But the ability to generate new insights in real time from a massive sets of data is still far from commonplace. At this event, data technology experts and data scientists from Pivotal provided the latest business perspective on how data science and engineering can be used to accelerate the generation of new insights. For information about upcoming Pivotal events, please visit: http://pivotal.io/news-events/#events

2015 10 dev ops n-fi - why it's a good idea to deploy 10 times per day v1.0 -...

Architecting a Platform for Enterprise Use - Strata London 2018

The goal in most organizations is to build multi-use data infrastructure that is not subject to past constraints. This session will discuss hidden design assumptions, review design principles to apply when building multi-use data infrastructure, and provide a reference architecture to use as you work to unify your analytics infrastructure. The focus in our market has been on acquiring technology, and that ignores the more important part: the larger IT landscape within which this technology lives and the data architecture that lies at its core. If one expects longevity from a platform then it should be a designed rather than accidental architecture. Architecture is more than just software. It starts from use and includes the data, technology, methods of building and maintaining, and organization of people. What are the design principles that lead to good design and a functional data architecture? What are the assumptions that limit older approaches? How can one integrate with, migrate from or modernize an existing data environment? How will this affect an organization's data management practices? This tutorial will help you answer these questions. Topics covered: * A brief history of data infrastructure and past design assumptions * Categories of data and data use in organizations * Analytic workload characteristics and constraints * Data architecture * Functional architecture * Tradeoffs between different classes of technology * Technology planning assumptions and guidance #strataconf

2014-10 DevOps NFi - Why it's a good idea to deploy 10 times per day v1.0

Pivotal Big Data Roadshow

Data technology experts from Pivotal give the latest perspective on how big data analytics and applications are transforming organizations across industries. This event provides an opportunity to learn about new developments in the rapidly-changing world of big data and understand best practices in creating Internet of Things (IoT) applications. Learn more about the Pivotal Big Data Roadshow: http://pivotal.io/big-data/data-roadshow

Fit For Purpose: Preventing a Big Data Letdown

The Briefing Room with Dr. Robin Bloor and RedPoint Global Live Webcast October 6, 2015 Watch the archive: https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=9982ad3a2603345984895f279e849d35 Gartner recently placed Big Data in its “trough of disillusionment,” reflective of many leaders’ struggle to prove the value of Hadoop within their organization. While the promise of enhanced data integration and enrichment is obvious, measurable results have remained elusive. This episode of The Briefing Room will outline how to successfully tie Big Data to existing business applications, preventing your next Hadoop project from being another “Big Data letdown.” Register today to learn from veteran Analyst Dr. Robin Bloor as he discusses the importance of converging enterprise data integration with intelligence and scalability. He’ll be briefed by George Corugedo of RedPoint Global, who will provide concrete examples of how the convergence of scalable cloud platforms, ever-expanding data sources and intelligent execution can turn the Big Data hype into demonstrable business value. Visit InsideAnalysis.com for more information.

Making big data work

Ed Thewlis

Top Business Intelligence Trends for 2016 by Panorama Software

Panorama Software

What's hot

Cutting through the noise – a digital space to help line managers - Sarah Mof...

Intranet Now

Beyond Data Discovery: The Value Unlocked by Modern Data Modeling

Real-time Big Data at FPT (for TechCamp University)

Enabling Data centric Teams

Data Con LA

Fast, reliable, secure @ Velocity 2015

Ariel Tseitlin

Angular js, Yeomon & Grunt

Richard Powell

Agile Data

odsc

Data Democracy: Hadoop + Redshift

WP2 Overview (Technical architecture)

vbrant

How the economist with cloud BI and Looker have improved data-driven decision...

What's hot (10)

Cutting through the noise – a digital space to help line managers - Sarah Mof...

Beyond Data Discovery: The Value Unlocked by Modern Data Modeling

Real-time Big Data at FPT (for TechCamp University)

Enabling Data centric Teams

Fast, reliable, secure @ Velocity 2015

Angular js, Yeomon & Grunt

Agile Data

Data Democracy: Hadoop + Redshift

WP2 Overview (Technical architecture)

How the economist with cloud BI and Looker have improved data-driven decision...

Similar to Big data debunking some of the myths

Where is my big data: security, privacy and jurisdictions in the cloud

Self-Service Analytics with Guard Rails

Denodo

The Maturity Model: Taking the Growing Pains Out of Hadoop

Architecting a Data Platform For Enterprise Use (Strata NY 2018)

Driving Real Insights Through Data Science

2015 10 dev ops n-fi - why it's a good idea to deploy 10 times per day v1.0 -...

Architecting a Platform for Enterprise Use - Strata London 2018

2014-10 DevOps NFi - Why it's a good idea to deploy 10 times per day v1.0

Pivotal Big Data Roadshow

Fit For Purpose: Preventing a Big Data Letdown

Making big data work

Ed Thewlis

Top Business Intelligence Trends for 2016 by Panorama Software

Panorama Software

There are 250 Database products, are you running the right one?

Aerospike, Inc.

This webinar discusses choosing the right database for organizations. It will cover industry trends driving data and database evolution, real-world use cases where speed and scale are important, and an architecture overview. Speakers from Forrester and Aerospike will discuss how new applications are challenging traditional databases and how Aerospike's in-memory database provides extremely high performance for large-scale, data-intensive workloads. The agenda includes an industry overview, tips for choosing a database, how data has evolved, examples where low latency is critical, and a question and answer session.

A Connected Data Landscape: Virtualization and the Internet of Things

The Briefing Room with Dr. Robin Bloor and Cisco Live Webcast March 3, 2015 Watch the archive: https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=a75f0f379405de155800a37b2bf104db Data at rest, data in motion - regardless of its trajectory, data remains the lifeblood of today's information economy. But finding a way to bridge old systems with new opportunities requires an innovative data strategy, one that takes advantage of multiple processing technologies. With the optimal architecture in place, companies can harness years of work in traditional information systems, while opening the door to the flood of new data sources available. Register for this episode of The Briefing Room to learn from veteran Analyst Dr. Robin Bloor, as he explains how data virtualization and other data technologies fundamentally change what's possible with data access, movement and analysis. He'll be briefed by David Besemer of Cisco, who will discuss how this new kind of data strategy can enable the integration of legacy systems, Cloud computing and the Internet of Things. He'll also answer questions about how Big Data and the IoT are helping to redefine the practice of data management. Visis InsideAnalysis.com for more information.

Rational User Group - May 2014 Stockholm - DevOps from an EA perspective

S ba0881 big-data-use-cases-pearson-edge2015-v7

Tony Pearson

Fri benghiat gil-odsc-data-kitchen-data science to dataops

DataKitchen

This document outlines seven steps for transitioning from data science to data operations (DataOps): 1. Orchestrate the data science and production workflows. 2. Add testing at each step to monitor quality. 3. Use a version control system to manage code changes. 4. Implement branching and merging to allow parallel development. 5. Maintain separate environments for experiments, development and production. 6. Containerize components and practice environment version control. 7. Parameterize processes to increase flexibility and reuse.

ODSC data science to DataOps

Christopher Bergh

WSO2Con USA 2015: Keynote - Helping You Connect the World

WSO2

The document discusses Sanjiva Weerawarana, founder and CEO of WSO2, and his vision for the company. It summarizes that Weerawarana thinks long-term and aims to build a comprehensive middleware platform, not focus on hype. It also outlines WSO2's product strategy updates to support microservices, containers, cloud, analytics, mobile/IoT, and consumerization of IT through a series of new and updated products.

The Analytic Platform: Empowering the Business Now

The Briefing Room with Dr. Robin Bloor and Actuate Live Webcast on October 7, 2014 Watch the archive: https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=475312d15f46d095797f5842de84925f As businesses grapple with more and more data, analysts and data consumers have a growing expectation to get at those assets fast. All too often, business users are stymied by governance and performance roadblocks, making time-to-insight a relatively slow process. One solution is to leverage the power of an analytic platform, one that keeps data management in IT’s hands, and lets business analysts jump right in without the need for modeling and provisioning. Register for this episode of The Briefing Room to hear veteran Analyst Dr. Robin Bloor as he explains the principles behind a meaningful analytic platform. He’ll be briefed by Peter Hoopes and Allen Bonde of Actuate, who will tout their company’s BIRT Analytics, a solution that combines columnar database technology with pre-built algorithms and puts analytics in the hands of the business user in minutes, not days. They will show how their platform makes it easy to perform complex analytics on enterprise data and visualize results, without slowing down other systems or interfering with governance needs. Visit InsideAnlaysis.com for more information.

Similar to Big data debunking some of the myths (20)

Where is my big data: security, privacy and jurisdictions in the cloud

Self-Service Analytics with Guard Rails

The Maturity Model: Taking the Growing Pains Out of Hadoop

Architecting a Data Platform For Enterprise Use (Strata NY 2018)

Driving Real Insights Through Data Science

2015 10 dev ops n-fi - why it's a good idea to deploy 10 times per day v1.0 -...

Architecting a Platform for Enterprise Use - Strata London 2018

2014-10 DevOps NFi - Why it's a good idea to deploy 10 times per day v1.0

Pivotal Big Data Roadshow

Fit For Purpose: Preventing a Big Data Letdown

Making big data work

Top Business Intelligence Trends for 2016 by Panorama Software

There are 250 Database products, are you running the right one?

A Connected Data Landscape: Virtualization and the Internet of Things

Rational User Group - May 2014 Stockholm - DevOps from an EA perspective

S ba0881 big-data-use-cases-pearson-edge2015-v7

Fri benghiat gil-odsc-data-kitchen-data science to dataops

ODSC data science to DataOps

WSO2Con USA 2015: Keynote - Helping You Connect the World

The Analytic Platform: Empowering the Business Now

More from Chris Swan

LNETM - Atsign - Privacy with Personal Data Services

SOOCon24 - Showing that you care about security - OpenSSF Scorecards

Open Source Security Foundation (OpenSSF) Scorecards provide a way for open source users to determine whether maintainers are being diligent about securing their link in the software security supply chain. Practices such as pinning dependencies, branch protection, required reviews, continuous integration tests etc. are measured to provide a score and accompanying badge. This presentation will provide a walkthrough of the steps involved in securing a first repository, and then what it takes to repeat that process across and organization with multiple repos. It will also look at the ongoing maintenance involved once scorecards have been implemented, and how aspects of that maintenance can be better automated to minimize toil.

All Day DevOps 2023 - Implementing OSSF Scorecards Across an Organisation.pdf

Fluttercon Berlin 23 - Dart & Flutter on RISC-V

Arm has dominated the mobile space since the dawn of smartphones, but systems based on the open source RISC-V instruction set architecture will bring new choices for manufacturers and us, their customers. RISC-V SDKs showed up in the Dart dev channel in Apr 22, but it's still pretty hard to build stuff due to lots of missing dependencies. As always happens with new stuff, the hardware people are waiting for broader software support, and the software people are waiting for a larger hardware installed base. This talk examines the forces that are driving RISC-V forward, and what developers can expect from a world that will have RISC-V devices, mobile phones, tablets and cloud services.

QConNY 2023 - Implementing OSSF Scorecards Across an Organisation

Flutter SV Meetup Oct 2022 - End to end encrypted IoT with Dart and Flutter

QConSF 2022 - Backends in Dart

London IoT Meetup Sep 2022 - End to end encrypted IoT

Flutter Vikings 2022 - End to end IoT with Dart and Flutter

Things need apps to manage them, which Flutter is great for, providing an easy way to build cross platform support. But things also need to get their data (securely and privately) to their apps, and Dart can be used for that. This presentation will walk through a use case demonstrated at Mobile World Congress (and now open sourced) that uses Dart to read sensor data through to Flutter for user presentation.

EMFcamp2022 - What if apps logged into you, instead of you logging into apps?

As a hacker and engineer I've been interested in identity and privacy since the dawn of the Internet and the online services it's enabled. For the past year I've been helping to build and open source The @ Platform, which inverts the usual model by giving everybody (and every thing) their own place to store data and control who (and what) has access to it. This talk will give an overview of the platform and its underlying protocol, and illustrate how it can be used to build privacy preserving apps and Internet connected things. It will also cover how the platform can be self hosted on devices like the Raspberry Pi, and how people can get involved in the open source community growing around it.

Devoxx UK 2022 - Application security: What should the attack landscape look ...

What do we need to do in the next few years to ensure that the attack landscape for 2030 isn't the same as 2020? Better languages and frameworks have already brought substantial improvements in memory safety, eliminating whole classes of vulnerabilities caused by buffer overflows.Yet despite a major reshuffle in 2021, the OWASP top 10 remains full of things that boil down to a lack of input validation. An issue that has bedevilled tech since its inception. We're all told that we shouldn't trust the input to our programs, and that validation is our best defence. But developers get precious little help on that front from today's languages and frameworks; something that can and should change. This talk will examine a hypothetical evolution of TypeScript - ValidScript, to consider a future where input validation is baked in.

Flutter Festival London 2022 - End to end IoT with Dart and Flutter

Full Stack Squared 2022 - Power of Open Source

The document discusses the power of open source software and how people can get involved. It begins with an introduction of the author and covers the three types of "free" that define open source - free like beer meaning no cost, free like speech meaning freedom over the code, and free like puppy meaning ongoing maintenance is required. Famous people in open source like Richard Stallman, Eric Raymond, and Linus Torvalds are profiled. The document outlines how readers can get involved through contributing code, being considerate of maintainers, and participating in challenges. It concludes with contact information and a call for questions.

Flutter Vikings 2022 - Full Stack Dart

Flutter provides an excellent way to build Android, iOS, web and desktop apps, but what about the back end services? Full stack Dart is all about using that investment in Dart programming to build the services used by applications, whether it's in the cloud or on the Internet of Things. This presentation will look at the tradeoffs between just in time (JIT) and ahead of time (AOT) compilation, Dart on Docker, the Functions Framework for Dart, Profiling and Performance Management. Choices of back end architecture (x86_64 vs Arm) will also be examined, along with some of the challenges this can present for Continuous Delivery.

Droidcon London 2021 - Full Stack Dart

Keeping a project going

This document summarizes a Raspberry Pi Sous Vide project that has been running for over 8 years. It details the project's longevity with stats on uptime, logs, and failed hardware components like temperature sensors and SD cards over time. The software has also evolved, including upgrades to the Raspberry Pi OS, changes to key dependencies, and a rewrite from Python 2 to Python 3. More details on the long-running project can be found online at the provided URL.

Dart on Arm - Flutter Bangalore June 2021

TMS9995 on RC2014

CloudCamp London Nov 2019 Intro

The document contains summaries of several short talks or presentations on various topics such as ethics in technology, data bias, climate change, and social impact. The summaries are represented visually through maps or models linking different stages of product or service development to relevant approaches, tools, or considerations for each topic. Overall the document demonstrates using maps or models as a concise way to summarize key points that would be discussed in short talks.

DevSecOps Days London - Teaching 'Shift Left on Security'