There are a number of issues production applications need to solve to be scalable and fault tolerant. In this talk, we explore some tips for efficiently running Python apps, particularly with Flask, on App Engine. We also share some collective experience and best practices on GAE.
How we solved Real-time User Segmentation using HBaseDataWorks Summit
At RichRelevance, we service 10 of the top 20 Internet retailer chains and deliver more than $5.5 billions in attributable sales. Every 21 milliseconds a shopper clicks on a recommendation that we have delivered, and we serve over 850 million product recommendations daily. Our Hadoop infrastructure has a capacity to handle upwards of 1.5+ PB. Behavioral Targeting, specifically user segmentation and building personas, is critical for us in generating triggers when a user is added to a segment or switches from a segment. In this presentation, we intend to demonstrate not only how the events are captured, but also how they are stored in HBase in real-time. It is critical to design the system so it can handle thousands of writes per second and, at the same time, be able to query any combination of behavioral attributes in HBase through real-time APIs. This session will walk attendees through the entire design & architecture starting from data Ingestion, schema design, and access patterns, as well as some major problems like sharing & hot spotting. Furthermore, performance metrics will be presented, including the number of read/write per second and details around cluster configuration.
We present our solution for building an AI Architecture that provides engineering teams the ability to leverage data to drive insight and help our customers solve their problems. We started with siloed data, entities that were described differently by each product, different formats, complicated security and access schemes, data spread over numerous locations and systems.
Domain Driven Design - Strategic Patterns and MicroservicesRadosław Maziarka
Presentation describes Domain Driven Design - approach to create applications driven by business domain. I show how to split your monolith base on DDD strategic patterns.
Event-driven architecture is a versatile approach to designing and integrating complex software systems. These systems tend to be easier to model and build. Event-driven architecture is not a new concept, but as more organizations contemplate microservices, this approach to system design has become appropriate in more situations and is worth a fresh look.
Mind Map of Big Data Technologies and ConceptsAmir Hadad
My first try to capture #bigdata related concepts and technologies in a #mindmap!
#NoSQL vs #SQL https://db-engines.com/
#Streaming vs #Batchprocessing https://lnkd.in/gNEdsrC
#BigDataSecurity https://lnkd.in/gA8e7RC
#OLAP https://lnkd.in/gh3z6r9
How we solved Real-time User Segmentation using HBaseDataWorks Summit
At RichRelevance, we service 10 of the top 20 Internet retailer chains and deliver more than $5.5 billions in attributable sales. Every 21 milliseconds a shopper clicks on a recommendation that we have delivered, and we serve over 850 million product recommendations daily. Our Hadoop infrastructure has a capacity to handle upwards of 1.5+ PB. Behavioral Targeting, specifically user segmentation and building personas, is critical for us in generating triggers when a user is added to a segment or switches from a segment. In this presentation, we intend to demonstrate not only how the events are captured, but also how they are stored in HBase in real-time. It is critical to design the system so it can handle thousands of writes per second and, at the same time, be able to query any combination of behavioral attributes in HBase through real-time APIs. This session will walk attendees through the entire design & architecture starting from data Ingestion, schema design, and access patterns, as well as some major problems like sharing & hot spotting. Furthermore, performance metrics will be presented, including the number of read/write per second and details around cluster configuration.
We present our solution for building an AI Architecture that provides engineering teams the ability to leverage data to drive insight and help our customers solve their problems. We started with siloed data, entities that were described differently by each product, different formats, complicated security and access schemes, data spread over numerous locations and systems.
Domain Driven Design - Strategic Patterns and MicroservicesRadosław Maziarka
Presentation describes Domain Driven Design - approach to create applications driven by business domain. I show how to split your monolith base on DDD strategic patterns.
Event-driven architecture is a versatile approach to designing and integrating complex software systems. These systems tend to be easier to model and build. Event-driven architecture is not a new concept, but as more organizations contemplate microservices, this approach to system design has become appropriate in more situations and is worth a fresh look.
Mind Map of Big Data Technologies and ConceptsAmir Hadad
My first try to capture #bigdata related concepts and technologies in a #mindmap!
#NoSQL vs #SQL https://db-engines.com/
#Streaming vs #Batchprocessing https://lnkd.in/gNEdsrC
#BigDataSecurity https://lnkd.in/gA8e7RC
#OLAP https://lnkd.in/gh3z6r9
SCS 4120 - Software Engineering IV
BACHELOR OF SCIENCE HONOURS IN COMPUTER SCIENCE
BACHELOR OF SCIENCE HONOURS IN SOFTWARE ENGINEERING
All in One Place Lecture Notes
Distribution Among Friends Only
All copyrights belong to their respective owners
Viraj Brian Wijesuriya
vbw@ucsc.cmb.ac.lk
SQL vs NoSQL, an experiment with MongoDBMarco Segato
A simple experiment with MongoDB compared to Oracle classic RDBMS database: what are NoSQL databases, when to use them, why to choose MongoDB and how we can play with it.
ELT vs. ETL - How they’re different and why it mattersMatillion
ELT is a fundamentally better way to load and transform your data. It’s faster. It’s more efficient. And Matillion’s browser-based interface makes it easier than ever to work with your data. You’re using data to improve your world: shouldn’t the tools you use return the favor?
In this webinar:
- Explore the differences between ELT and ETL
- Learn why ELT is a better, more modern process
- Discover the latest trends in ELT and how they apply to your business
- Find out how Matillion ETL makes loading large amounts of data easier
Applying Anti-Reversing Techniques to Machine CodeTeodoro Cipresso
CS266 Software Reverse Engineering (SRE)Applying Anti-Reversing Techniques to Machine Code
Teodoro (Ted) Cipresso, teodoro.cipresso@sjsu.edu
Department of Computer Science
San José State University
Spring 2015
Slidedeck presented at http://devternity.com/ around MongoDB internals. We review the usage patterns of MongoDB, the different storage engines and persistency models as well has the definition of documents and general data structures.
Building Cloud-Native App Series - Part 1 of 11
Microservices Architecture Series
Design Thinking, Lean Startup, Agile (Kanban, Scrum),
User Stories, Domain-Driven Design
How to Choose the Right Database for Your WorkloadsInfluxData
Learn how to make the right choice for your workloads with this walkthrough of a set of distinct database types (graph, in-memory, search, columnar, document, relational, key-value, and time series databases). In this webinar, we will review the strengths and qualities of each database type from their particular use-case perspectives.
A coleta e a análise de grandes quantidades de dados têm se tornado tarefas essenciais em muitas empresas. Os data lakes se tornaram uma escolha popular para o armazenamento de todo tipo de dados, estruturados e não-estruturados, servindo como uma “fonte única da verdade”. Veja este webinar para descobrir como você pode facilmente criar e gerenciar, de forma segura, um data lake utilizando serviços da AWS.
In this presentation, Raghavendra BM of Valuebound has discussed the basics of MongoDB - an open-source document database and leading NoSQL database.
----------------------------------------------------------
Get Socialistic
Our website: http://valuebound.com/
LinkedIn: http://bit.ly/2eKgdux
Facebook: https://www.facebook.com/valuebound/
Twitter: http://bit.ly/2gFPTi8
SCS 4120 - Software Engineering IV
BACHELOR OF SCIENCE HONOURS IN COMPUTER SCIENCE
BACHELOR OF SCIENCE HONOURS IN SOFTWARE ENGINEERING
All in One Place Lecture Notes
Distribution Among Friends Only
All copyrights belong to their respective owners
Viraj Brian Wijesuriya
vbw@ucsc.cmb.ac.lk
SQL vs NoSQL, an experiment with MongoDBMarco Segato
A simple experiment with MongoDB compared to Oracle classic RDBMS database: what are NoSQL databases, when to use them, why to choose MongoDB and how we can play with it.
ELT vs. ETL - How they’re different and why it mattersMatillion
ELT is a fundamentally better way to load and transform your data. It’s faster. It’s more efficient. And Matillion’s browser-based interface makes it easier than ever to work with your data. You’re using data to improve your world: shouldn’t the tools you use return the favor?
In this webinar:
- Explore the differences between ELT and ETL
- Learn why ELT is a better, more modern process
- Discover the latest trends in ELT and how they apply to your business
- Find out how Matillion ETL makes loading large amounts of data easier
Applying Anti-Reversing Techniques to Machine CodeTeodoro Cipresso
CS266 Software Reverse Engineering (SRE)Applying Anti-Reversing Techniques to Machine Code
Teodoro (Ted) Cipresso, teodoro.cipresso@sjsu.edu
Department of Computer Science
San José State University
Spring 2015
Slidedeck presented at http://devternity.com/ around MongoDB internals. We review the usage patterns of MongoDB, the different storage engines and persistency models as well has the definition of documents and general data structures.
Building Cloud-Native App Series - Part 1 of 11
Microservices Architecture Series
Design Thinking, Lean Startup, Agile (Kanban, Scrum),
User Stories, Domain-Driven Design
How to Choose the Right Database for Your WorkloadsInfluxData
Learn how to make the right choice for your workloads with this walkthrough of a set of distinct database types (graph, in-memory, search, columnar, document, relational, key-value, and time series databases). In this webinar, we will review the strengths and qualities of each database type from their particular use-case perspectives.
A coleta e a análise de grandes quantidades de dados têm se tornado tarefas essenciais em muitas empresas. Os data lakes se tornaram uma escolha popular para o armazenamento de todo tipo de dados, estruturados e não-estruturados, servindo como uma “fonte única da verdade”. Veja este webinar para descobrir como você pode facilmente criar e gerenciar, de forma segura, um data lake utilizando serviços da AWS.
In this presentation, Raghavendra BM of Valuebound has discussed the basics of MongoDB - an open-source document database and leading NoSQL database.
----------------------------------------------------------
Get Socialistic
Our website: http://valuebound.com/
LinkedIn: http://bit.ly/2eKgdux
Facebook: https://www.facebook.com/valuebound/
Twitter: http://bit.ly/2gFPTi8
90-minute October 2015 Los Angeles CTO Forum presentation on AngularJS, other JavaScript frameworks including ReactJS, and the state of web development in 2015.
Topics covered:
- State of web development in 2015
- AngularJS code examples
- Analysis of JavaScript MVC frameworks suitable for 2015-2019 development
- AngularJS pros/cons
- ReactJS
- Hybrid mobile apps
IT 8003 Cloud ComputingFor this activi.docxvrickens
IT 8003 Cloud Computing
For this activity you need to divide your class in groups
1
Group Activity 1 “SuperTAX Software”
2
SuperTax Overview
Did you know President Abraham Lincoln, one of America's most beloved leaders, also instituted one of its least liked obligations - the income tax? In this brief history of taxes, see the historical events which shaped income taxes in the United States today.
SuperTax is an American tax preparation software package developed in the mid-1980s.
SuperTax Corporation is headquartered in Mountain View, California.
2
Group Activity 1 “SuperTAX Software”
3
SuperTax Information
Desktop Software.
Support MS Windows and Mac OS.
Software method: CD/DVD media format.
Different versions:
SuperTAX Basic, Deluxe, Premier, and Home & Business.
Used by millions of users and organizations.
Group Activity 1 “SuperTAX Software”
4
SuperTAX Project
SuperTAX has hired your group as a consultant to move their Desktop Software to a Traditional IT Hosted Software, available Online.
Group Activity 1 “SuperTAX Software”
5
For Discussion:
Find the challenges that your team will encounter attempting to move SuperTAX Software to the new platform.
Prepared a presentation for the class.
On your Group you will need to define positions.
For example:
Project Manager, Senior Project Network, Senior Project Engineer, etc.
Group Activity 1 “SuperTAX Software”
6
Infrastructure
Software Development
Software Testing
Marketing & Business Model
Project Management
CHALLENGES
Group Activity 1 “SuperTAX Software”
7
Infrastructure
No more test in a single machine. (CD/DVD format model)
Test in a production cluster. (20, 30 users?)
A larger cluster can bring problems. (1000’s of users)
Testing must be done for different clients (mobile, desktops, OS)
Small performance bottleneck. Slow performance.
CHALLENGES
Group Activity 1 “SuperTAX Software”
8
Marketing & Business Model
One time fixed cost vs. subscription model
Before a CD was sold, now a subscription model.
Maintenance and replacement of cooling, power, and server is required
CHALLENGES
Group Activity 1 “SuperTAX Software”
9
Project Management
Project can take many months to years for Software Development cycle.
What model is appropriate for Hosted application. (Agile vs. waterfall)
Ability to try new features faster.
CHALLENGES
RUNNING HEAD: INTERSESSION 5 FINAL PROJECT PROJECTION 1
INTERSESSION 5 FINAL PROJECT PROJECTION 5
INTERSESSION 5 FINAL PROJECT PROJECTION
Shalini Kantamneni
Ottawa University
Intersession 5 Final Project Projection
The Design Process
This process involves the formulation of a model to be used in deriving a comprehensive cloud application. In this case, the model-view-controller design pattern will be used. This type of design pattern partitions the logic of the application into three distinct domains that are to be interconnected to provide a working cloud application (Jailia et al., 2016). ...
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB
To successfully implement our clients' unique use cases and data patterns, it is mandatory that we unlearn many relational concepts while designing and rapidly developing efficient applications in NoSQL.
In this session, we will talk about some of our client use cases and the strategies we adopted using features of MongoDB.
Accelerate Enterprise Software Engineering with PlatformlessWSO2
Key takeaways:
Challenges of building platforms and the benefits of platformless.
Key principles of platformless, including API-first, cloud-native middleware, platform engineering, and developer experience.
How Choreo enables the platformless experience.
How key concepts like application architecture, domain-driven design, zero trust, and cell-based architecture are inherently a part of Choreo.
Demo of an end-to-end app built and deployed on Choreo.
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisGlobus
JASMIN is the UK’s high-performance data analysis platform for environmental science, operated by STFC on behalf of the UK Natural Environment Research Council (NERC). In addition to its role in hosting the CEDA Archive (NERC’s long-term repository for climate, atmospheric science & Earth observation data in the UK), JASMIN provides a collaborative platform to a community of around 2,000 scientists in the UK and beyond, providing nearly 400 environmental science projects with working space, compute resources and tools to facilitate their work. High-performance data transfer into and out of JASMIN has always been a key feature, with many scientists bringing model outputs from supercomputers elsewhere in the UK, to analyse against observational or other model data in the CEDA Archive. A growing number of JASMIN users are now realising the benefits of using the Globus service to provide reliable and efficient data movement and other tasks in this and other contexts. Further use cases involve long-distance (intercontinental) transfers to and from JASMIN, and collecting results from a mobile atmospheric radar system, pushing data to JASMIN via a lightweight Globus deployment. We provide details of how Globus fits into our current infrastructure, our experience of the recent migration to GCSv5.4, and of our interest in developing use of the wider ecosystem of Globus services for the benefit of our user community.
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Globus
The U.S. Geological Survey (USGS) has made substantial investments in meeting evolving scientific, technical, and policy driven demands on storing, managing, and delivering data. As these demands continue to grow in complexity and scale, the USGS must continue to explore innovative solutions to improve its management, curation, sharing, delivering, and preservation approaches for large-scale research data. Supporting these needs, the USGS has partnered with the University of Chicago-Globus to research and develop advanced repository components and workflows leveraging its current investment in Globus. The primary outcome of this partnership includes the development of a prototype enterprise repository, driven by USGS Data Release requirements, through exploration and implementation of the entire suite of the Globus platform offerings, including Globus Flow, Globus Auth, Globus Transfer, and Globus Search. This presentation will provide insights into this research partnership, introduce the unique requirements and challenges being addressed and provide relevant project progress.
Unleash Unlimited Potential with One-Time Purchase
BoxLang is more than just a language; it's a community. By choosing a Visionary License, you're not just investing in your success, you're actively contributing to the ongoing development and support of BoxLang.
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamtakuyayamamoto1800
In this slide, we show the simulation example and the way to compile this solver.
In this solver, the Helmholtz equation can be solved by helmholtzFoam. Also, the Helmholtz equation with uniformly dispersed bubbles can be simulated by helmholtzBubbleFoam.
TROUBLESHOOTING 9 TYPES OF OUTOFMEMORYERRORTier1 app
Even though at surface level ‘java.lang.OutOfMemoryError’ appears as one single error; underlyingly there are 9 types of OutOfMemoryError. Each type of OutOfMemoryError has different causes, diagnosis approaches and solutions. This session equips you with the knowledge, tools, and techniques needed to troubleshoot and conquer OutOfMemoryError in all its forms, ensuring smoother, more efficient Java applications.
Code reviews are vital for ensuring good code quality. They serve as one of our last lines of defense against bugs and subpar code reaching production.
Yet, they often turn into annoying tasks riddled with frustration, hostility, unclear feedback and lack of standards. How can we improve this crucial process?
In this session we will cover:
- The Art of Effective Code Reviews
- Streamlining the Review Process
- Elevating Reviews with Automated Tools
By the end of this presentation, you'll have the knowledge on how to organize and improve your code review proces
Listen to the keynote address and hear about the latest developments from Rachana Ananthakrishnan and Ian Foster who review the updates to the Globus Platform and Service, and the relevance of Globus to the scientific community as an automation platform to accelerate scientific discovery.
First Steps with Globus Compute Multi-User EndpointsGlobus
In this presentation we will share our experiences around getting started with the Globus Compute multi-user endpoint. Working with the Pharmacology group at the University of Auckland, we have previously written an application using Globus Compute that can offload computationally expensive steps in the researcher's workflows, which they wish to manage from their familiar Windows environments, onto the NeSI (New Zealand eScience Infrastructure) cluster. Some of the challenges we have encountered were that each researcher had to set up and manage their own single-user globus compute endpoint and that the workloads had varying resource requirements (CPUs, memory and wall time) between different runs. We hope that the multi-user endpoint will help to address these challenges and share an update on our progress here.
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?XfilesPro
Worried about document security while sharing them in Salesforce? Fret no more! Here are the top-notch security standards XfilesPro upholds to ensure strong security for your Salesforce documents while sharing with internal or external people.
To learn more, read the blog: https://www.xfilespro.com/how-does-xfilespro-make-document-sharing-secure-and-seamless-in-salesforce/
We describe the deployment and use of Globus Compute for remote computation. This content is aimed at researchers who wish to compute on remote resources using a unified programming interface, as well as system administrators who will deploy and operate Globus Compute services on their research computing infrastructure.
Check out the webinar slides to learn more about how XfilesPro transforms Salesforce document management by leveraging its world-class applications. For more details, please connect with sales@xfilespro.com
If you want to watch the on-demand webinar, please click here: https://www.xfilespro.com/webinars/salesforce-document-management-2-0-smarter-faster-better/
top nidhi software solution freedownloadvrstrong314
This presentation emphasizes the importance of data security and legal compliance for Nidhi companies in India. It highlights how online Nidhi software solutions, like Vector Nidhi Software, offer advanced features tailored to these needs. Key aspects include encryption, access controls, and audit trails to ensure data security. The software complies with regulatory guidelines from the MCA and RBI and adheres to Nidhi Rules, 2014. With customizable, user-friendly interfaces and real-time features, these Nidhi software solutions enhance efficiency, support growth, and provide exceptional member services. The presentation concludes with contact information for further inquiries.
Multiple Your Crypto Portfolio with the Innovative Features of Advanced Crypt...Hivelance Technology
Cryptocurrency trading bots are computer programs designed to automate buying, selling, and managing cryptocurrency transactions. These bots utilize advanced algorithms and machine learning techniques to analyze market data, identify trading opportunities, and execute trades on behalf of their users. By automating the decision-making process, crypto trading bots can react to market changes faster than human traders
Hivelance, a leading provider of cryptocurrency trading bot development services, stands out as the premier choice for crypto traders and developers. Hivelance boasts a team of seasoned cryptocurrency experts and software engineers who deeply understand the crypto market and the latest trends in automated trading, Hivelance leverages the latest technologies and tools in the industry, including advanced AI and machine learning algorithms, to create highly efficient and adaptable crypto trading bots
Experience our free, in-depth three-part Tendenci Platform Corporate Membership Management workshop series! In Session 1 on May 14th, 2024, we began with an Introduction and Setup, mastering the configuration of your Corporate Membership Module settings to establish membership types, applications, and more. Then, on May 16th, 2024, in Session 2, we focused on binding individual members to a Corporate Membership and Corporate Reps, teaching you how to add individual members and assign Corporate Representatives to manage dues, renewals, and associated members. Finally, on May 28th, 2024, in Session 3, we covered questions and concerns, addressing any queries or issues you may have.
For more Tendenci AMS events, check out www.tendenci.com/events
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Globus
Large Language Models (LLMs) are currently the center of attention in the tech world, particularly for their potential to advance research. In this presentation, we'll explore a straightforward and effective method for quickly initiating inference runs on supercomputers using the vLLM tool with Globus Compute, specifically on the Polaris system at ALCF. We'll begin by briefly discussing the popularity and applications of LLMs in various fields. Following this, we will introduce the vLLM tool, and explain how it integrates with Globus Compute to efficiently manage LLM operations on Polaris. Attendees will learn the practical aspects of setting up and remotely triggering LLMs from local machines, focusing on ease of use and efficiency. This talk is ideal for researchers and practitioners looking to leverage the power of LLMs in their work, offering a clear guide to harnessing supercomputing resources for quick and effective LLM inference.
How Recreation Management Software Can Streamline Your Operations.pptxwottaspaceseo
Recreation management software streamlines operations by automating key tasks such as scheduling, registration, and payment processing, reducing manual workload and errors. It provides centralized management of facilities, classes, and events, ensuring efficient resource allocation and facility usage. The software offers user-friendly online portals for easy access to bookings and program information, enhancing customer experience. Real-time reporting and data analytics deliver insights into attendance and preferences, aiding in strategic decision-making. Additionally, effective communication tools keep participants and staff informed with timely updates. Overall, recreation management software enhances efficiency, improves service delivery, and boosts customer satisfaction.
Software Engineering, Software Consulting, Tech Lead.
Spring Boot, Spring Cloud, Spring Core, Spring JDBC, Spring Security,
Spring Transaction, Spring MVC,
Log4j, REST/SOAP WEB-SERVICES.
2. Emmanuel
Olowosulu
CTO, Formplus
@seyisulu
o Also, software developer with
experience building Flask
applications and community builder.
Yemisi
Obielodan
Product Manager, Formplus
@ye_misi
o Over 8 years experience consulting
and building production apps with a
variety of stacks and technologies.
3. o Formplus is a cloud-based
data collection software as a
service. We help businesses
gather data from their users
who are online and offline,
analyze the data and then
help them make data-driven
decisions from it.
o Also a fun place to work.
4. o Python 2.7
o Flask
o Google App Engine
o Cloud Datastore
o Cloud Tasks
o AngularJS 1.5
(don’t judge !)
Our Stack
5. What We Will Be Covering
OVERVIEW
Large applications,
Flask and Google App
Engine
FLASK
Flask application
structure, blueprints
and request routing
APP
ENGINE
App Engine, Cloud
Datastore and the
Google Cloud Platform
SCALING
Performance, effective
GCP utilization,
problems
QUESTIONS
?
7. What is a Large
Application?
o Large number of users
o High requests/second
o Distributed servers
o Large data volume
8. Flask
Framework
o Flask is a micro web
framework written in
Python
o Supports extensions for
additional functionality
o Compatible with Google
App Engine
9. Google App
Engine
o GAE is a web
framework and cloud
computing platform for
developing and hosting
web applications in
Google-managed data
centers
o Applications are
sandboxed and run
across multiple servers
11. Flask Application Structure
o Large Flask apps should be broken down into
microservices. There should be at least two: the
frontend handling user facing requests and backend
handling longer running data and event processing.
o This ensures that the application is responsive.
o Also, there is nothing micro about microservices; the
frontend and backend may be monoliths themselves and
could benefit from refactoring.
12. Flask Structure
w/Blueprints
o In the structure shown, each
directory represents a
microservice
o In each microservice, group
related functionality using
Blueprints: the blog
microservice has frontend and
backend Blueprints
o App Engine handles requests
using rules contained in
dispatch.yaml
14. Monorepo vs Multirepo
o Microservice purist may insist on having different
repositories for each microservice but we have had
success with a Monorepo.
o There are pros and cons associated with both
approaches and a Multirepo is better suited for large
teams each working on distinct microservices.
o Also, Conway’s Law: software ends up "shaped like" the
organizational structure it's designed in.
15. organizations which design
systems ... are constrained
to produce designs which
are copies of the
communication structures
of these organizations.
Conway’s Law
16. In Flask there are two
ways to create routes:
class based and function
based. A lot of people are
familiar with the latter but
using the class based
method can lead to better
code organization as you
can use inheritance to
share functionality
between routes.
Request Routing
18. App Engine & Google Cloud Platform
Image credits: https://cloud.google.com/appengine/
19. Cloud Storage
o Google Cloud Storage is file storage solution intended
for developers and system admins and has APIs allowing
apps to store and retrieve objects. It features global
edge-caching, versioning, OAuth and granular access
controls, configurable security, etc.
20. Cloud CDN
o Google Cloud CDN (Content Delivery Network) uses
Google's servers worldwide to load balanced content
close to your users. This provides faster delivery of
content to your users and reduces serving costs.
o App Engine also automatically pushes application assets
to Cloud CDN on deployment.
21. Cloud Tasks
o Task queues let applications do work, asynchronously
outside of a user request. If an app needs to execute
work in the background, it adds tasks to task queues.
The tasks are executed later, by worker services. Task
queues come in two flavors, push and pull.
o Push queues run tasks by delivering HTTP requests to
App Engine worker services.
o Pull queues rely on other worker services to fetch tasks
from the queue on their own.
22. Cloud Pub/Sub
o Cloud Pub/Sub is a fully-managed real-time messaging
service that allows you to send and receive messages
between independent applications.
o Pub/Sub broadcasts messages from publishers to
subscribers via channels. Many subscribers can
subscribe to the same channel.
o Allows the decoupling of background, long running data
and event processing from user-facing requests.
23. Cloud Dataflow
o Cloud Dataflow is a fully-managed service for
transforming and enriching data in stream (real time)
and batch (historical) modes. Cloud Dataflow seamlessly
integrates with GCP services for streaming events
ingestion (Cloud Pub/Sub), data warehousing
(BigQuery), and more.
24. Google BigQuery
o BigQuery is Google's serverless, highly scalable,
enterprise cloud data warehouse. It is fast, cost-
effective, fully managed and can be used for analytics. It
also has built-in machine learning (beta).
o BigQuery is used for Online Analytical Processing
(OLAP): generating reports, time series and trend
analysis views and can be paired with Business
Intelligence (BI) tools like Metabase or Google Data
Studio (beta).
25. Cloud Datastore
o Cloud Datastore is a highly-scalable NoSQL database for
your web and mobile applications.
o Datastore is to App Engine what MongoDB is to the
MEAN stack and to App Engine what MySQL is to the
LAMP stack.
o It is a highly-scalable NoSQL database for your
applications. Cloud Datastore automatically handles
sharding and replication, providing you with a highly
available and durable database that scales
automatically.
26. Datastore vs. MongoDB vs. MySQL
Datastore
oCategory of object:
Kind
oOne object:
Entity
oIndividual object data:
Property
oUnique ID for object:
Key
MongoDB
oCategory of object
Collection
oOne object
Document
oIndividual object data
Field
oUnique ID for object
Key
MySQL
statementoCategory of object:
Table
oOne object:
Row
oIndividual object data:
Column
oUnique ID for object:
Primary Key
NoSQL SQL
27. Merits of Using
App Engine
App Engine is similar to Heroku
o Platform as a Service (PaaS)
o Provide abstracted machine
instances to run your code
o Prevent you from worrying
about ops: infrastructure
upgrades, software updates
and security patches
o Automatic load balancing,
scaling and fault tolerance
28. Google App Engine
o Task queues let applications do work, asynchronously
outside of a user request. If an app needs to execute
work in the background, it adds tasks to task queues.
The tasks are executed later, by worker services. Task
queues come in two flavors, push and pull.
o Push queues run tasks by delivering HTTP requests to
App Engine worker services.
o Pull queues rely on other worker services to fetch tasks
from the queue on their own.
30. - Pete Edochie
He who thinks the
soup is too much for the
eba cannot handle
greatness scale.
31. Problem 1: Out of Memory Errors
Background
Some tasks are
unable to be
completed because
they ran out of
memory.
One of what we’ve
found causes this
error is
communicating
with Sheets.
Cause
A library used consumed
~30MB per instance
when created and
instances cannot be
reused across requests.
This resulted in OOM
Errors when serving a lot
of requests to update
Google Sheets.
Solution
Cloud Functions
We recently refactored
the part of our code that
updates users
spreadsheets to use a
Cloud Function.
This means the
additional memory used
does not affect
subsequent requests.
32. - Phil Karlton
There are only two hard
things in Computer
Science: cache
invalidation and naming
things.
33. Problem 2: Cache Invalidation
Background
Users not having
their monthly
submissions quota
reset
Cause
When updating a large
number of entities in
Cloud Datastore, the
recommended approach
is to use Datastore
Migrations Pipelines.
Batching Datastore
writes for efficiency
bypassed Memcache and
that stale data could
potentially overwrite the
Datastore update.
Solution
Skip the in-process
cache but update
Memcache and
Datastore.
Use transactions when
modifying entities likely
to be updated in the
datastore but outdated
in the Memcache.
35. Problem 3: Datastore Contention
Background
Updating form
submission count
for high traffic
forms
Cause
A job was created on the
submissions task queue
for each submission and
they all tried to update
the same entity at the
same time.
Solution
When the submission
comes in, a task is
created named with the
form ID and the time of
submission and then we
schedule it to run in
future.
We combine this with
an entity in datastore
that holds the
submission status for
that task to batch
updates.